ansaurus

Question

Answer 1

+2 A:

your problem is not the WHERE but the JOIN, you are getting an implicit conversion and a scan on the JOIN, on the WHERE condition you are getting a SEEK

ON p4f.FileRelease = (af.path+'#'+cast(af.revision as varchar))

Parallelism could also be a problem, try adding MAXDOP=1

Are your statistics up to date? Is there excessive fragmentation?

SQLMenace 2010-07-06 18:13:32

Yep, that's what I'm saying: i'm getting a seek on the where and a scan on the join. And I'd like to understand why. I actually am using MAXDOP=1, I just omitted it here. So all the data I've included is for non-parallel processing.

Dmitry Beransky 2010-07-06 18:51:24

the optimizer doesn't know whet the value of af.path+'#'+cast(af.revision as varchar) is so it has to scan the whole table

SQLMenace 2010-07-06 19:04:47

Answer 2

A:

Try moving "af.tracked_change_id = 1" into the join clause.

INNER JOIN AnalyzedFileView af 
ON p4f.FileRelease = (af.path+'#'+cast(af.revision as varchar))
AND af.tracked_change_id = 1

WHERE is applied after the INNER JOIN

ThatSteveGuy 2010-07-06 18:19:21

Doing so had no affect. Shouldn't the optimizer be smart enough to make rearrangements like that automatically anyway?

Dmitry Beransky 2010-07-06 18:55:22

Answer 3

+2 A:

You are joining varchar column p4f.FileRelease with an nvarchar column (af.path). Since the data types don't match, SQL has to convert one's type to the other's (and of course it can't go from nvarchar to varchar). In converting af.path to nvarchar, it loses the ability to use the index to lookup/filter those values, resulting in the need to scan and convert all possible rows.

The best solution is to store the data as matching data types (change column p4f.FileRelase to nvarchar, or af.path to varchar). Since no one ever gets to modify existing database structures, a work-around might be to explicitly cast af.path to varchar in the query. Test it and see... though of course you can't do this if the data truly requires double-byte formatting.

Philip Kelley 2010-07-06 18:39:12

Even if it was stored as matching types the fact that he is concating two fields with a # inbetween will force a scan won't it?

Mike M. 2010-07-06 18:42:27

Awesome! Looks like varchar vs nvarchar was the problem. thanks!

Dmitry Beransky 2010-07-06 19:14:18

@Mike M, there is little difference *to SQL* between "...WHERE ColName = @Variable" and "...WHERE ColName = @Variable + @Variable". It takes whatever value you've generated, and looks it up in the table.

Philip Kelley 2010-07-06 20:34:56

Answer 4

A:

Philip Kelley spotted the problem. It was a datatype mismatch between varchar in P4FileReleases and nvarchar in AnalyzedFileView.

Dmitry Beransky 2010-07-06 19:16:34

ansaurus

tags:

views:

answers:

Getting rid of full index scan

related questions