ansaurus

Question

Efficient way to update all rows in a table

Answer 1

+8 A:

The usual way is to use UPDATE:

UPDATE mytable
   SET new_column = <expr containing old_column>

You should be able to do this is a single transaction.

Marcelo Cantos 2010-04-14 07:47:57

It sounds like the OP knows how to do this in a single transaction, but there is a performance problem, so he tried to batch it into separate transactions.

Tim Drisdelle 2010-04-14 08:01:05

That is possible, but is extraordinary that 1 M rows should take so long to update a single column. It's also possible that the OP is updating a record at a time, either through lack of understanding of set operations, or because they are trying to compute the new value in client code (either out of necessity or again, because of lack of understanding). Whatever the case, I'll be able to update my answer if the OP indicates which of the foregoing cases applies to them.

Marcelo Cantos 2010-04-14 08:05:46

Fair enough. I agree that more information is needed.

Tim Drisdelle 2010-04-14 08:21:49

OP: If it's a performance problem, do it at a quiet time. If your DBMS can't handle an update of a million rows, it's time to start looking at a new DBMS :-)

paxdiablo 2010-04-14 08:26:13

Thank you all for the fast response. I skipped the part that I'm using generated SQL statements. Now I looked deep into it and it looks like the generated SQL updates row by row! So any attempt to separate in chunks of 100 records was meaningless... I'll change the code to generate a proper SQL UPDATE statement, as the one pointed here.

m_pGladiator 2010-04-14 09:36:24

Nice! That's an epic fail for the generated SQL. Good work on the solution Marcelo.

Tim Drisdelle 2010-04-14 09:48:09

Answer 2

+2 A:

You could drop any indexes on the table, then do your insert, and then recreate the indexes.

Tim Drisdelle 2010-04-14 07:49:18

+1. It was matter of time to suggest this, but yeah, for 10M or more rows, you can do as long as you get them done fast and quick.

Guru 2010-04-14 07:52:59

For the love of whatever gods you worship, _do this at a quiet time_. Otherwise your users will track you down, torture you, kill you, quarter you, tar and feather the remains then burn them and spit on your charred body parts. At a minimum. They'll probably do far worse.

paxdiablo 2010-04-14 08:24:04

Sounds like the groans of a DBA who has been sacrificed on the altar of performance...

Tim Drisdelle 2010-04-14 08:42:51

If the new column is not indexed, removing indexes on the table will be useless (not that it matters since rebuilding a 1M rows index will not take much time)

Vincent Malgrat 2010-04-14 09:13:23

Agreed. I already asked OP to provide more details. So much solution guessing right now.

Tim Drisdelle 2010-04-14 09:47:22

Answer 3

A:

Might not work you for, but a technique I've used a couple times in the past for similar circumstances.

created updated_{table_name}, then select insert into this table in batches. Once finished, and this hinges on Oracle ( which I don't know or use ) supporting the ability to rename tables in an atomic fashion. updated_{table_name} becomes {table_name} while {table_name} becomes original_{table_name}.

Last time I had to do this was for a heavily indexed table with several million rows that absolutely positively could not be locked for the duration needed to make some serious changes to it.

David 2010-04-14 09:21:36

Answer 4

+2 A:

As Marcelos suggests:

UPDATE mytable
SET new_column = <expr containing old_column>;

If this takes too long and fails due to "snapshot too old" errors (e.g. if the expression queries another highly-active table), and if the new value for the column is always NOT NULL, you could update the table in batches:

UPDATE mytable
SET new_column = <expr containing old_column>
WHERE new_column IS NULL
AND ROWNUM <= 100000;

Just run this statement, COMMIT, then run it again; rinse, repeat until it reports "0 rows updated". It'll take longer but each update is less likely to fail.

Jeffrey Kemp 2010-04-15 06:01:36

I think this is a great idea for very large and highly used tables! I didn't have such failures yet, but you have +1 :)

m_pGladiator 2010-04-15 07:33:16

Answer 5

A:

What is the database version? Check out virtual columns in 11g:

Adding Columns with a Default Value http://www.oracle.com/technology/pub/articles/oracle-database-11g-top-features/11g-schemamanagement.html

Stellios 2010-04-19 05:58:47

ansaurus

tags:

views:

answers:

Efficient way to update all rows in a table

related questions