ansaurus

Question

How can I speed up update/replace operations in PostgreSQL?

Answer 1

A:

In your insert_or_replace. try this:

WHERE EXISTS(SELECT 1 FROM item WHERE key=NEW.key LIMIT 1)

instead of

WHERE EXISTS(SELECT 1 FROM item WHERE key=NEW.key)

As noted in comments, that will probably do nothing. All I have to add, then, is that you can always speed up INSERT/UPDATE performance by removing indexes. This will likely not be something you want to do unless you find your table is overindexed, but that should at least be checked out.

chaos 2009-06-07 17:39:09

Most probably that's unnecessary - excerpt from the docs (http://www.postgresql.org/docs/current/static/functions-subquery.html#AEN15270): "The subquery will generally only be executed far enough to determine whether at least one row is returned, not all the way to completion."

Milen A. Radev 2009-06-07 17:42:51

Ah, thanks. Didn't know how smart EXISTS was. Now I do. :)

chaos 2009-06-07 17:43:55

The key is unique, so it will only ever return one row. Nevertheless, I tried, and there was no noticeable change in performance either way. Thanks though!

Henrik Gustafsson 2009-06-07 17:54:32

Answer 2

A:

In Oracle, locking the table would definitely help. You might want to try that with PostgreSQL, too.

ammoQ 2009-06-07 18:05:02

I tried running all tests with the table locked in all transactions; no change.

Henrik Gustafsson 2009-06-07 18:45:56

Answer 3

+1 A:

I had a similar situation a few months ago and ended up getting the largest speed boost from a tuned chunk/transaction size. You may also want to check the log for a checkpoint warning during the test and tune appropriately.

BML 2009-06-07 18:55:14

I will certainly look for the checkpoint warnings, it looks very relevant; thanks!

Henrik Gustafsson 2009-06-07 19:19:34

Answer 4

+1 A:

Sounds like you'd see benefits from using WAL (Write Ahead Logging) with a UPS to cache your updates between disk writes.

wal_buffers This setting decides the number of buffers WAL(Write ahead Log) can have. If your database has many write transactions, setting this value bit higher than default could result better usage of disk space. Experiment and decide. A good start would be around 32-64 corresponding to 256-512K memory.

http://www.varlena.com/GeneralBits/Tidbits/perf.html

SpliFF 2009-06-07 19:59:32

Answer 5

+2 A:

The usual way I do these things in pg is: load raw data matching target table into temp table (no constraints) using copy, merge(the fun part), profit.

I wrote a merge_by_key function specifically for these situations:

http://mbk.projects.postgresql.org/

The docs aren't terribly friendly, but I'd suggest giving it a good look.

jwp 2009-06-07 20:01:59

The points of the general process being:load into temp to avoid any network round-trip costs and to avoid creating multiple cursors(portals) for each loaded row(yeah, it's fast with executemany, but COPY wtf-pwns-it wrt to efficiency).Use a merge function/process to avoid creating rules/trigger that alter the semantics of an insert statement. I've done it both ways, and I've always preferred the merge process because it's explicit.If the merge process is not efficient enough, you'll need to look at index suspension(recreation)/partitioning orhttp://pgfoundry.org/projects/pgbulkload/

jwp 2009-06-08 20:52:08

Answer 6

A:

I am working with a database of a million rows approx.. using python to parse documents and populate the table with terms.. The insert statements work fine but the update statements get extremely time consuming as the table size grows..

Is it normal for the update statement to take significantly longer than the insert statement? In my case updates are slowing the program many times

2009-07-03 14:35:13

I think this should be a new question

Henrik Gustafsson 2009-07-24 14:34:22

Answer 7

A:

For updates, you can lower your fillfactor for the tables and the indexes and that might help

http://www.postgresql.org/docs/current/static/sql-createtable.html

http://www.postgresql.org/docs/current/static/sql-createindex.html

mikelikespie 2010-08-19 10:34:02

ansaurus

tags:

views:

answers:

How can I speed up update/replace operations in PostgreSQL?

related questions