ansaurus

Question

Is PostgreSQL IN() statement still fast with up to 1000 arguments?

Answer 1

+2 A:

Use a temporary table to JOIN, gives better performance and has no limits. An IN() having a 1000 arguments, will give you problems in any database.

Frank Heikens 2010-08-03 18:40:32

Answer 2

+3 A:

I assume it's a large table, otherwise it wouldn't matter much.

Depending on table size and number of keys, this may turn into a sequence scan. If there are many IN keys, Postgres often chooses not to use an index for it. The more keys, the bigger the chance of a sequence scan.

If you use another indexed column in WHERE, like:

select * from table where id in (%) and my_date > '2010-01-01';

It's likely to fetch all rows matching the indexed (my_date) columns, and then perform an in-memory scan on them.

Using a JOIN to a persistent or temporary table may, but does not have to help. It still will need to locate all the rows, either with a nested loop (unlikely for large data), or for a hash/merge join.

I would say the solution is:

Use as few IN keys as possible.
Use other criteria for indexing and querying whenever possible. If IN requires an in-memory scan of all rows, at least there will be fewer of them thanks to additional criteria.

Konrad Garus 2010-08-04 07:10:40

ansaurus

tags:

views:

answers:

Is PostgreSQL IN() statement still fast with up to 1000 arguments?

related questions