ansaurus

Question

How to debug differences in postgresql TRUNCATE times?

Answer 1

+1 A:

TRUNCATE needs a lock, setting this lock, might take some time because of other transactions. Try pg_locks to see what's going on.

Frank Heikens 2010-07-19 18:47:08

Is there a way to log lock events instead of polling for them with pg_locks? The case where it takes < 100ms it's difficult to get an accurate reading of how many locks it has. The case where it takes longer, there are over 70 locks at peak.

keturn 2010-07-19 20:47:24

Set a manual LOCK with NO WAIT if you don't want to wait: http://www.postgresql.org/docs/8.4/static/sql-lock.html

Frank Heikens 2010-07-19 20:54:17

Adjust log_lock_waits if you want to, as the name says, log lock waits: http://www.postgresql.org/docs/current/static/runtime-config-logging.html

Greg Smith 2010-07-25 01:45:31

Answer 2

A:

truncate needs to lock the whole table. If there are any transactions running, it needs to wait for them to finish. Another side effect which is not evidenced here is that its a single bottleneck preventing any new transactions which need this table.

When it's a live system with dozens or hundreds of transactions using this table, this bottleneck itself may be an important issue. delete only locks single rows, so it's faster in many concurrent/live environments.

I'm not sure what you need this for, but you may want to build new "version" of data for this table to a temporary table, then (to keep lock/update time as short as possible) push it to the live table with delete + insert as select:

begin;

create temp table my_data on commit drop as
---... lengthy calculation here;

delete from data;

insert into data select * from my_data;

commit;

Konrad Garus 2010-07-20 06:34:20

I edited my question to provide a little more context; the timings I gave are not from an application with lots of concurrency, they're from an idle system.

keturn 2010-07-20 16:45:14

Now that's weird. I would research the immediate locks Frank suggested or try asking at PG mailing lists.

Konrad Garus 2010-07-21 06:36:29

Answer 3

+1 A:

The way TRUNCATE works in PostgreSQL, it's very sensitive to how fast your filesystem can delete blocks, as well as whether it correctly honors the fsync system call when you write to flush the write cache out. My guess is that you have different filesystem setups on the two systems. For example, if the Lucid install is using ext4 and the Karmic one ext3, this is unsurprising behavior. Newer kernels will correctly turn fsync calls into disk cache flushing via write barriers; older ones let the drives lie to them about things being written. This is a good thing in terms of keeping the database writes safe during a crash, but performance drops a lot when the kernel does the right thing from a reliability perspective.

Greg Smith 2010-07-25 01:46:19

This was it. Turning `fsync = off` on the slow system makes it as fast as the other one. Maybe virtualbox doesn't really do fsync or something.

keturn 2010-07-26 17:07:44

ansaurus

tags:

views:

answers:

How to debug differences in postgresql TRUNCATE times?

related questions