ansaurus

Question

SQL SERVER 2008 JOIN hints

Answer 1

+3 A:

Can someone please tell me why applying LOOP hints to all my queries is a bad idea. I read somewhere that a LOOP JOIN is default JOIN method for query optimiser but couldn't verify the validity of the statement?

Because this robs the optimizer of the opportunity to consider other methods which can be more efficient.

When are JOIN hints used? When the sh*t hits the fan and ghost busters ain't in town?

When the data distribution (on which the optimizer makes its decisions) is severely skewed and the statistics are no able to represent it correctly.

What's the difference between LOOP, HASH and MERGE hints? BOL states that MERGE seems to be the slowest but what is the application of each hint?

These are different algorithms.

LOOP is nested loops: for each record from the outer table, the inner table is searched for matches (using the index of available). Fastest when only a tiny portion of records from both tables satisfy the JOIN and the WHERE conditions.
MERGE sorts both tables are traverses them in the sort order, skipping the unmatched records. Fastest for the FULL JOINs and when both recordsets are already sorted (from previous sort operations or when the index access path is used)
HASH build a hash table in the temporary storage (memory or tempdb) from one of the tables and searches it for each record from the other one. Fastest if the large portion of records from either table matches the WHERE and JOIN condition.

Quassnoi 2010-03-15 12:18:25

Great explanation! I don't suppose you could give your 2 cents on what Martin's reply?

Nai 2010-03-15 12:52:21

Answer 2

+2 A:

The Estimated execution plan show 57% on the Table Update and 40% on a Hash Match (Aggregate). I did some snooping around and came across the topic of JOIN hints. So I added a LOOP hint to my inner join and WA-ZHAM! The new execution plan shows 38% on the Table Update and 58% on an Index Seek.

Surely that means that your proposed plan is worse? Assuming the table update takes a constant time it is now being out costed by the index activity.

Martin Smith 2010-03-15 12:22:26

Is that a fair assumption? I have always been under the impression that having the bulk of the work on Index Seek is the way to go.

Nai 2010-03-15 12:49:15

I think its a fair assumption though I'm happy to stand corrected if I'm wrong. Is there a clustered index on Analytics.UserGUID? If so updating the GUID to a different value will likely cause a fair bit of IO which may explain any performance issue you are getting.

Martin Smith 2010-03-15 13:07:25

I have a non-clustered index on Analytics.UserGUID. I'm updating the UserID columns not the UserGUID. I have no indexes on the Analytics.UserID column.

Nai 2010-03-15 13:11:41

Sorry didn't read that bit properly!

Martin Smith 2010-03-15 13:23:49

ansaurus

tags:

views:

answers:

SQL SERVER 2008 JOIN hints

related questions