ansaurus

Question

inserting rows from one table to another, which sql is more efficient (outer join vs sequential scan)

Answer 1

+4 A:

To add another option:

INSERT INTO A (x, y, z)
SELECT B.x, B.y, B.z
FROM B
WHERE NOT EXISTS(SELECT * FROM A WHERE A.id = B.id)

I usually go with the LEFT JOIN approach. But, if you want to know truly what is most efficient, run some tests on your environment. See what the execution plans for each approach are (you may find that multiple approaches actually result in the same execution plan).

AdaTheDev 2009-12-03 22:22:38

+1, the best answer IMHO. I'd use exists

gbn 2009-12-05 10:10:23

Answer 2

A:

It shouldn't matter - a good optimizer will treat these identically. In practice, I have seen to quirky execution plans in exactly this case, but I have been known to use both styles interchangeably, depending on mood, readability and complexity of the query.

In SQL Server, option A is not available when you need to JOIN on a tuple of more thana a single column without using some kind of concatenation workaround (which I do not recommend), which brings us to cat-skinning option C (which I also use, expecially with the joins are really squirrely), which extends to tuples directly:

INSERT INTO A (x, y, z) 
SELECT x, y, z 
FROM B b 
WHERE NOT EXISTS (SELECT * FROM A WHERE id = b.id); 

INSERT INTO A (x, y, z) 
SELECT x, y, z 
FROM B b 
WHERE NOT EXISTS (SELECT * FROM A WHERE id1 = b.id1 AND id2 = b.id2);

Cade Roux 2009-12-03 22:24:30

Answer 3

A:

I think option B is better, especially if Table A is bigger than Table B by a factor > 1.

If you have indexes on a.id and b.id then joining will be faster, IMHO, than using where for each row...

Leon 2009-12-03 22:26:28

But it depends on the optimiser - if the optimiser does a good job, they'll probably come out the same

AdaTheDev 2009-12-03 22:31:55

I agree about the optimizer, but it wouldn't hurt to help him a little bit :)

Leon 2009-12-03 22:33:28

Answer 4

A:

Depending on the number of rows and the activity on the database, it would help a lot to drop all indexes on the table before the insert and recreate them afterwards.

edoloughlin 2009-12-03 22:52:57

ansaurus

tags:

views:

answers:

inserting rows from one table to another, which sql is more efficient (outer join vs sequential scan)

related questions