ansaurus

Question

UPDATE FROM with a JOIN (Large Table Performance) Postgresql?

Answer 1

A:

I found an alternate way to write the query that allowed the pgsql optimizer to build the query much more efficiently:

UPDATE order_item_imprint SET item_new_id = oi.item_new_id 
FROM order_item oi where oi.item_number = order_item_imprint.item_id

apparently that join was redundant and the following execution plan could have been used:

Hash Join  (cost=1.38..5.73 rows=48 width=1407)
  Hash Cond: ((order_item_imprint.item_id)::text = (oi.item_number)::text)
  ->  Seq Scan on order_item_imprint  (cost=0.00..3.63 rows=63 width=1399)
  ->  Hash  (cost=1.17..1.17 rows=17 width=23)
        ->  Seq Scan on order_item oi  (cost=0.00..1.17 rows=17 width=23)

Martin Dale Lyness 2010-07-09 14:59:51

Answer 2

+1 A:

I found an alternate way to write the query that allowed the pgsql optimizer to build the query much more efficiently

Actually, what you did was remove the unconstrained self-join on order_item_inprint.

If you look at the first line you'll see the following row estimate:

rows=1432184234121

That's 1.4 billion updates it is trying to do. When you aliased order_item_inprint in the join it got treated as a separate table from the update target.

Richard Huxton 2010-07-10 09:46:02

ansaurus

tags:

views:

answers:

UPDATE FROM with a JOIN (Large Table Performance) Postgresql?

related questions