ansaurus

Question

SUM/GROUP performance and the Primary Key

Answer 1

+3 A:

A GUID is a poor choice of clustered index, simply because it's so big. Using an INTEGER field allows the database to pack much much more information into a 'page', so fewer pages need to be fetched from disk for any given query.

Also note that the cluster key(s) are stored in every non-clustered index (since that's what's used to locate the data), which compounds the problem.

Gary McGill 2009-11-09 19:54:41

@GUID PK: Isn't it a sorting issue? Primary Keys are clustered indexes by default. Guids are not growing. So you should have a fragmented clustered index.

Arthur 2009-11-09 20:00:21

@Arthur: you're right that clustered indexes can allow more efficient sorting (and range filtering), but in this case since there's no sorting going on I don't think it'd make a difference. SUM() doesn't need sorted data.

Gary McGill 2009-11-09 20:04:11

@Sum():Yes, but the where clause would benefit from a clustered index.

Arthur 2009-11-09 20:05:46

@Arthur: I'm not sure why you think that this particular WHERE clause would benefit from the PK being sorted? Unless the GUIDs in ##test happen to be in a contiguous range (so that the sorting means all the hits are from the same page), I wouldn't expect there to be any benefit.

Gary McGill 2009-11-09 20:35:35

OK - it's late here in Klosterneuburg. You are totally right. What I wanted to point out, that when you create a primary key (clustered index = sorted by default) and your PK column is a guid then you should get fragmented pages. the col used in the where clause should be index - but that's another discussion. Sorry - I didn't get to the point. It has nothing to do with sorting but with page fragmentation.

Arthur 2009-11-09 20:54:04

@Gary: Your idea makes sense. However, I'm still hesitant to accept that the key size can make *that much* of a difference...

Heinzi 2009-11-09 22:34:28

@Heinzi: Yes; while I stand by my advice not to use a GUID as a cluster key, I think Remus' answer is more relevant to your problem.

Gary McGill 2009-11-10 12:31:22

Answer 2

+3 A:

Looking at the plan in the 'slow' case it shows that the query does a seek on the index [myDB].[dbo].[myTable].[myGuid] followed by a clustered index seek on [myDB].[dbo].[myTable].[PK__myTable__2334397B]. This only makes sense if you have created both a non-clustered index on myTable(myGuid) and also declared myTable(myGuid) as clustered index key (it appears so, judging from the typical 'PRIMARY KEY' declaration auto-generated name naming convention of the clustered index object 'PK_...').

Other than that, the plans are very similar and they're both quite bad in that they include a SORT. The difference in width of the autoInc column vs. the GUID involved in the potential larger width of the non-clustered index in the first case may explain the difference, but I doubt is the full story.

Please redo the test making sure that you have a clustered key on myGuid and that you do NOT have also a non-clustered index on the same key. The plan should include only one single seek on myTable, using the clustered index, to compare exactly the cases you wanted to compare.

Also, obviously, make sure you compare the same ##test content and the buffer pool cache is warmed up in both case identically. Run DBCC FREESYSTEMCACHE('All') before each test then run the query at least 5 times, negleting the first run (it will be the run that warms up the buffer pool).

Also, as Arthur already noted, having an order guarantee on ##test (ie. a clustered key) could speed up things as the nested loops can be replaced with a merge join, if ##test content is large enough. If the ##temp has only few rows, then the nested loop is better and order makes no difference.

Remus Rusanu 2009-11-09 20:45:37

Thanks for noticing the extra index; I have removed it and updated the query plan. Unfortunately, no change in performance. Thanks also for the `FREESYSTEMCACHE` hint (still no change, though).

Heinzi 2009-11-09 22:23:32

@Heinzi: can you post the new plans? Also, pleas make sure you post the **actual** execution plan, not the estimate. Afaik `SET SHOWPLAN_TEXT` shows the *estimate* plan. Please use `SET STATISTICS IO ON` instead, is more relevant. Also, if possible post the actual execution plan XML (captured from SSMS 'show execution plan' or from Profiler), just make sure is actual not estimate.

Remus Rusanu 2009-11-09 23:09:11

@Remus: Thanks a lot for looking into this. I've added the `STATISTICS IO` output and links to the `STATISTICS XML` output -- you should be able to open the latter with SSMS.

Heinzi 2009-11-11 11:15:10

The 'slow' one has 38k logical reads, a simpler plan and the .sqlplan file shows a compile time of 59ms. The 'fast' plan by comparison has 67k logical reads, a more complex plan and a compile time of 284ms. There is one critical difference though: the 'slow' plan has degree of parallelism 1, the 'fast' one has DOP **4**. So although the GUID index makes a better plan, it does not get paralelized. I'm not sure why yet.

Remus Rusanu 2009-11-11 19:19:01

I see. Thanks for the analysis; I'm quite surprised that the parallelism makes that much of a difference (in particular, since I usually expect the HDDs or the RAM to be the bottleneck rather than the CPU). I guess I'll have to read up on SQL Server's Parallel Query Processing...

Heinzi 2009-11-13 08:49:04

ansaurus

tags:

views:

answers:

SUM/GROUP performance and the Primary Key

related questions