ansaurus

Question

SQL 2005 Query Optimisation

Answer 1

+2 A:

Short answer is you have a CURSOR here. The scalar UDF is run per row of output.

The udf could be 2 LEFT JOINs onto derived tables. A rough outline:

...
COALESCE (F.xxx, L.xxx) --etc
...
FROM
 dbo.Logs l (readuncommitted)
 LEFT JOIN
 (select DISTINCT --added after comment
FileId, FileUrl from dbo.[Rollup]) R ON L.FileUrl = R.FileUrl
 LEFT JOIN
 (SELECT DISTINCT --added after comment
                f.FileId,
FileName ,
left(@PrependURLProtocol, 4) + '%' AS Left4
                FROM
                dbo.[Files] f

                INNER JOIN
                dbo.[Servers] s on s.[ServerId] = f.[ServerId]

                INNER JOIN
                dbo.[URLs] u on 
                           u.[ServerId] = f.[ServerId]
) F ON L.CleanFileName = R.FileName AND L.FileURL LIKE F.Left4
...

I'm also not sure if you need the NOT EXISTS because of how the udf works. If you do, make sure the columns are indexed.

gbn 2009-09-22 13:22:32

Hi gbn, haven't had much luck implementing this ... it turns up duplicates in the SELECT query which it must not do, i.e. it must select distinct URLs in dbo.Logs and retrieve corresponding FileIds for each. Perhaps I've misinterpreted your instructions?

Richard 2009-09-22 14:06:22

@Richard: You probably need DISTINCT in the derived tables

gbn 2009-09-22 14:10:27

@KM: np, I got a cleanup badge out of it ;-)

gbn 2009-09-22 14:11:27

@gbn - right, so I'll give that a go to see what that turns up; just wondering what DISTINCT will do for performance ;) Thanks

Richard 2009-09-23 16:59:10

@Richard: DISTINCT replaces the udf though, so it should still be quicker

gbn 2009-09-24 04:36:52

@gbn: thanks for the help; your advice has put me on the right track. Still got plenty of tweaking to do elsewhere, though! All the best...

Richard 2009-09-24 14:01:06

Answer 2

+1 A:

Hello,

I think your hotspot is located here:

Left(u.[PrependURLProtocol],4) = left(@URL, 4)

This will cause the server to do a scan on the url table. You should not use a function on a field in a join clause. try to rewrite that to something like

... where PrependURLProtocol like left(@URL, 4) +"%"

And make sure you have an index on the field.

Heiko Hatzfeld 2009-09-22 13:23:44

Answer 3

A:

INSERT INTO dbo.Rollup ([FileURL], [FileId])
SELECT  
 logs.RequestedFile As [URL], 
 FileId = dbo.fn_GetFileIdFromURL(l.RequestedFile, l.CleanFileName)
FROM dbo.Logs l (readuncommitted) LEFT OUTER JOIN dbo.Rollup
 on FileUrl = RequestedFile
WHERE FileUrl IS NULL

The logic here is that if dbo.Rollup does not exist for the given FileUrl, then the left outer join will turn up null. The NOT EXISTS now becomes an IS NULL, which is faster.

Ryan Michela 2009-09-22 13:24:19

Hi Ryan, this looks good - I'll try it out now.

Richard 2009-09-22 14:09:15

ansaurus

tags:

views:

answers:

SQL 2005 Query Optimisation

related questions