ansaurus

Question

Same query has nested loops when used with INSERT, but Hash Match without.

Answer 1

+2 A:

A few thoughts:

Make sure your statistics are up to date. Bad statistics account for many of the bizarre "intermittent" query plan problems.
Make sure your indexes are covering, otherwise there's a much higher probability of the optimizer ignoring them.
If none of that helps, you can always force a specific join by writing INNER HASH JOIN as opposed to just INNER JOIN.

Aaronaught 2010-04-22 23:27:06

+1 for the third thought (first two don't make a whole lot of sense to me tho)

Andomar 2010-04-23 00:15:45

@Andomar: Sometimes even a *slight* difference in the parse tree (and an `INSERT` tacked on at the end counts) can result in a totally different plan if the optimizer thinks it's crossing a threshold. Anyway, I'm not 100% sure that either of those will work, but can't hurt to try!

Aaronaught 2010-04-23 01:50:51

@Aaronaught: Exactly. Of the 3 options only an explicit `hash join` is a permanent fix :)

Andomar 2010-04-23 02:10:20

@Andomar: Yeah, but it's a hacky fix. I use join hints only as a last resort!

Aaronaught 2010-04-23 02:21:53

@Aaronaught I was able to use `HASH JOIN` to force the hash match so that I at least could test the difference in performance and see if it was worth my time investigating further.

AaronLS 2010-04-23 17:06:55

Answer 2

+1 A:

Does the destination table have a clustered index? The choice of join may be necessary to facilitate the ordering of the data in the insert. I've seen execution plans differ depending on whether the destination table has a clustered index and what column(s) it is on.

Cade Roux 2010-04-23 02:14:19

Yes it does. This makes sense, because if I force the hash match with `hash join` then there are two extra steps added to the query to do a clustered index insert.

AaronLS 2010-04-23 17:04:08

@AaronLS - Yeah, that's exactly the kind of case I've seen. So does the insert with the explicit hash join perform quicker than the insert without, given the extra steps needed to accommodate the clustered index when you force the hash join?

Cade Roux 2010-04-24 01:07:31

@Cade The `hash join` is slightly faster at about 18 seconds and without it is 20 seconds. So it is not significant at this point. Later on I might test out using a non-clustered primary key and run the entire staging processes and see if overall the speed increase outweighs the decrease, since there would be other queries effected by the change.

AaronLS 2010-04-26 13:27:32

ansaurus

tags:

views:

answers:

Same query has nested loops when used with INSERT, but Hash Match without.

related questions