ansaurus

Question

Answer 1

A:

My guess is that when you take the non paramaterized route, your guid has to be converted from a varchar to a UniqueIdentifier which may cause an index not to be used, while it will be used taking the paramatarized route.

I've seen this happen with using queries that have a smalldatetime in the where clause against a column that uses a datetime.

Matt Wrock 2009-11-03 12:58:19

Answer 2

+2 A:

If you provide an explicit value, SQL Server can use statistics of this field to make a "better" query plan decision. Unfortunately (as I've experienced myself recently), if the information contained in the statistics is misleading, sometimes SQL Server just makes the wrong choices.

If you want to dig deeper into this issue, I recommend you to check what happens if you use other GUIDs: If it uses a different query plan for different concrete GUIDs, that's an indication that statistics data is used. In that case, you might want to look at sp_updatestats and related commands.

EDIT: Have a look at DBCC SHOW_STATISTICS: The "slow" and the "fast" GUID are probably in different buckets in the histogram. I've had a similar problem, which I solved by adding an INDEX table hint to the SQL, which "guides" SQL Server towards finding the "right" query plan. Basically, I've looked at what indices are used during a "fast" query and hard-coded those into the SQL. This is far from an optimal or elegant solution, but I haven't found a better one yet...

Heinzi 2009-11-03 13:03:24

Just tried running the slow, non-parameterized with another GUID and it produced a nice query plan and executed as expected. Could you perhaps elaborate on the what I need to be looking for in regards to the statistics? Is it a specific index that needs to be rebuild or similar?

soren.enemaerke 2009-11-03 13:20:36

I've edited my post to add some more details.

Heinzi 2009-11-03 13:39:32

Admittedly my first look into the DBBC SHOW_STATISTICS but I seem to decipher that the GUIDs are in seperate buckets (the "slow" with RANGE_ROWS equal to 316 and the "fast" with RANGE_ROWS equal to 0 (?)). Unfortunately I'm using Linq2SQL so I have no real path for setting query hints. Could I recalculate statistics somehow?

soren.enemaerke 2009-11-03 13:57:24

Yes, both UPDATE STATISTICS and sp_updatestats should do that.

Heinzi 2009-11-03 14:13:26

Answer 3

A:

Its difficult to tell without looking at the execution plans, however if I was going to guess at a reason I'd say that its a combinaton of parameter sniffing and poor statistics - In the case where you hard-code the GUID into the query, the query optimiser attempts to optimise the query for that value of the parameter. I believe that the same thing happens with the parameterised / prepared query (this is called parameter sniffing - the execution plan is optimised for the parameters used the first time that the prepared statement is executed), however this definitely doesn't happen when you declare the parameter and use it in the query.

Like I said, SQL server attempt to optimise the execution plan for that value, and so usually you should see better results. It seems here that that information it is basing its decisions on is incorrect / misleading, and you are better off (for some reason) when it optimises the query for a generic parameter value.

This is mostly guesswork however - its impossible to tell really without the execution - if you can upload the executuion plan somewhere then I'm sure someone will be able to help you with the real reason.

Kragen 2009-11-03 13:25:24

Uploaded the execution plans, see my edited post

soren.enemaerke 2009-11-03 14:01:17

Answer 4

+2 A:

I'm not looking for advise on which indexes to create or the like, I'm just trying to understand why the query plan and execution are so dissimilar on three seemingly similar queries.

You seem to have two indexes:

IX_NonCluster_Config (ProductID, ServerTime)
IX_NonCluster_ProductID_CookieID_With_ServerTime (ProductID, CookieID) INCLUDE (ServerTime)

The first index does not cover CookieID but is ordered on ServerTime and hence is more efficient for the less selective ProductID's (i. e. those that you have many)

The second index does cover all columns but is not ordered, and hence is more efficient for more selective ProductID's (those that you have few).

In average, you ProductID cardinality is so that SQL Server expects the second method to be efficient, which is what it uses when you use parametrized queries or explicitly provide selective GUID's.

However, your original GUID is considered less selective, that's why the first method is used.

Unfortunately, the first method requires additional filtering on CookieID which is why it's less efficient in fact.

Quassnoi 2009-11-03 14:11:02

Ahh...perhaps this also explains why I get the "fast" execution if I remove the redundant parts of the where clause (the OR parts that checks for IS NULL is redundant) since no additional filtering is then required on the CookieID. The query is generated through Linq2SQL to I can't really modify but I will check to see if I can make the column Not-Null which seems to remove the additonal clauses and generate the fast query.

soren.enemaerke 2009-11-03 14:37:41

ansaurus

tags:

views:

answers:

SQL Server query plan differences

related questions