ansaurus

Question

My SQL insert/update statement is too inefficient

Answer 1

A:

Next I copy over the rows from the staging table that I don't already have in the contacts table

It seems that implies that ContactGroupMapping does not have records matching Contacts.id, in which case you can just omit the EXISTS:

UPDATE  [t1]
SET     [t1].GroupToBeAssignedTo = [t2].GroupToBeAssignedTo
FROM    Contacts [t1]
INNER JOIN
        ContactImportStaging [t2]
ON      [t1].UserID = [t2].UserID
        AND [t1].EmailAddress = [t2].EmailAddress
        AND [t2].GUID = @GUID

Or I am missing something?

Quassnoi 2009-09-22 11:05:09

Answer 2

+1 A:

This answer is slightly "I wouldn't start from here", but it's the way I'd do it ;)

If you've got the Standard or Enterprise editions of MS SQL Server 2005, and you have access to SQL Server Integration Services, this kind of thing is a doddle to do with a Data Flow.

Create a data source linked to the CSV file (it's faster if it's sorted by some field)
...and another to your existing contacts table (using ORDER BY to sort it by the same field)
Do a Merge Join on their common field -- you'll need to use a Sort transformation if either the two sources aren't already sorted
Do a Conditional split to focus only on rows that aren't already in your table (i.e. a table-unique field is "null", i.e. the merge join didn't actually merge for that row)
Use an OLEDB destination to input to the table.

Probably more individual steps than a single insert-with-select statement, but it'll save your staging, and it's pretty intuitive to follow. Plus, you're probably already licenced to use it, and it's pretty easy :)

Jeremy Smyth 2009-09-22 11:08:09

ansaurus

tags:

views:

answers:

My SQL insert/update statement is too inefficient

related questions