ansaurus

Question

"Merging" two tables in T-SQL - replacing or preserving duplicate IDs

Answer 1

A:

Getting the new date in is fairly easy, you can do it in a set with a left join

The update should be also fairly straight forward as you can also do it in a set

The tricky part is getting the transactions right, its complicated enough getting it right for 1 row.

Sam Saffron 2009-04-28 03:17:41

Answer 2

+2 A:

How about this?

option 2:

begin tran;
delete from tablea where exists (select 1 from tableb where tablea.id=tableb.id);
insert into tablea select * from tableb;
commit tran;

option 3:

begin tran;
delete from tableb where exists (select 1 from tablea where tablea.id=tableb.id);
insert into tablea select * from tableb;
commit tran;

As for performance, so long as the id field(s) in tablea (the big table) are indexed, you should be fine.

Chris 2009-04-28 03:36:32

Chris, This solution works perfectly. However, I'm way too much a SQL novice to be able to evaluate whether it's efficient. For example, in this statement - "delete from tablea where exists (select 1 from tableb where tablea.id=tableb.id)" - does that sub-select statement get executed once for every row in tableA? If tableA has 5m rows, does that mean this statement executes 5m sub-selects?In any case, many thanks for a solution that solves the problem!

mattstuehler 2009-04-28 18:02:45

In my experience, the 'where exists' sql will be quicker than 'where x in (select id from tableb)' which is your other option. And the latter 'in' sql looks like it does one query, which you'd think is quick, right? Anyway, i'd just give it a try and see how long it takes.

Chris 2009-04-29 06:27:17

This solution is fairly expensive as it may cause a bunch of indexes to rebuild even if no data changes. under concurrent usage you will probably get deadlocks and primary key violations.

Sam Saffron 2009-04-29 11:14:39

ansaurus

tags:

views:

answers:

"Merging" two tables in T-SQL - replacing or preserving duplicate IDs

related questions