ansaurus

Question

Can this row-by-row T-SQL script be converted to a set-based script?

Answer 1

+1 A:

Ok the I'd sugggest the following:

Add an extra column to the temp table to @TargetColVarchar value, do this one hit
Join the temp table and TargetColTable to the do the update

You can then optimise based on the execution plans

Update:

Looking at your amended results, I'd say the following is in order:

use a #temp table, these tend to be more performant on large datasets.

a. add more columns to the temp table to record things like : (select ColIntA from TargetColTable where ColTargetPK = @minColTargetPK and (select ColIntB from TargetColTable where ColTargetPK = @minColTargetPK in the big hit up front

b. The string replace is slow I reckon. This will still be slow. I know XML is not the fastest thing in the world. Can you replace the string comp with SQL XML specfic code

c. In the second update at the bottom the where ColTargetPK not in (select minColTargetPK from @minColTargetPKTable) is likely to be slower than a precise join, and you should do both the updated in one hit

However use the Actual Query Plan to work this out.

Preet Sangha 2010-09-06 01:45:26

Thanks for your reply, Preet - if I follow this method how would you suggest I go back and flag the duplicate entries as inactive (the ColTargetStatus = 0 portion of the above script)?

WT_W 2010-09-06 01:56:49

Nevermind, realised the answer was right in front of my face.

WT_W 2010-09-06 02:00:37

Please paste your answer after you've done it - it will make good reference material

Preet Sangha 2010-09-06 02:42:36

@WT_W: agreed with Preet: please edit your question and paste your refactored code to benefit others in the future! Interested in seeing it.

p.campbell 2010-09-06 02:59:50

I have editted the question with new script and comments from testing

WT_W 2010-09-07 01:38:45

Updating the answer:

Preet Sangha 2010-09-07 02:50:47

Thanks for your help Preet, I will implement these changes when I can and repost

WT_W 2010-09-07 05:17:56

Answer 2

A:

Try to do this to your first query

    from tct v1
    join tct v2 on v2.pk = @pk
    where v1.a = v2.a and v1.b = v2.b and v1.dt = v2.dt

and this to your second query

    from tct v1
    join tct v2 on v2.pk = @pk and v1.pk <> @pk
    where v1.a = v2.a and v1.b = v2.b and v1.dt = v2.dt

Denis Valeev 2010-09-06 06:17:07

Thanks for your comment, Denis - however I think you may misunderstand the purpose of that (admittedly inefficient) subquery. That subquery does not get a minimum ColIntA value, it gets the ColIntA value for the row that is specified by the primary key minColTargetPK.

WT_W 2010-09-06 07:19:03

Okay, I revised my answer.

Denis Valeev 2010-09-06 07:41:01

ansaurus

tags:

views:

answers:

Can this row-by-row T-SQL script be converted to a set-based script?

related questions