ansaurus

Question

How can I "think better" when reading a PostgreSQL query plan? (Example attached)

Answer 1

A:

Did you ANALYZE the tables? And what does pg_stats has to say about these tables? The queryplan is based on the stats, these have to be ok. And what version do you use? 8.4?

The costs can be calculated by using the stats, the amount of relpages, amount of rows and the settings in postgresql.conf for the Planner Cost Constants.

work_mem is also involved, it might be too low and force the planner to do a seqscan, to kill performance...

Frank Heikens 2010-02-25 21:23:31

I did run `VACUUM FULL ANALYZE` on all of the tables involved, and this is Postgresql 8.4.

Evan Carroll 2010-02-25 21:56:19

Answer 2

+2 A:

Seeing through issues like this requires some experience on where things can go wrong. But to find issues in the query plans, try to validate the produced plan from inside out, check if the number of rows estimates are sane and cost estimates match spent time. Btw. the two cost estimates aren't lower and upper bounds, first is the estimated cost to produce the first row of output, the second number is the estimated total cost, see explain documentation for details, there is also some planner documentation available. It also helps to know how the different access methods work. As a starting point Wikipedia has information on nested loop, hash and merge joins.

In your example, you'd start with:

           ->  Seq Scan on options io  (cost=0.00..20223.32 rows=23004 width=36)
                 Filter: (name IS NULL)

Run EXPLAIN ANALYZE SELECT * FROM options WHERE name IS NULL; and see if the returned rows matches the estimate. A factor of 2 off isn't usually a problem, you're trying to spot order of magnitude differences.

Then see EXPLAIN ANALYZE SELECT * FROM vehicles WHERE date_sold IS NULL; returns expected amount of rows.

Then go up one level to the hash join:

     ->  Hash Join  (cost=5301.58..29722.32 rows=229 width=40)
           Hash Cond: ((io.lot_id = iv.lot_id) AND ((io.vin)::text = (iv.vin)::text))

See if EXPLAIN ANALYZE SELECT * FROM vehicles AS iv INNER JOIN options io ON (io.lot_id = iv.lot_id) AND ((io.vin)::text = (iv.vin)::text) WHERE iv.date_sold IS NULL AND io.name IS NULL; results in 229 rows.

Up one more level adds INNER JOIN options co ON (co.fkey_style = iv.chrome_styleid) AND (co.code = io.code) and is expected to return only one row. This is probably where the issue is because if the actual numebr of rows goes from 1 to 100, the total cost estimate of traversing the inner loop of the containing nested loop is off by a factor of 100.

The underlying mistake that the planner is making is probably that it expects that the two predicates for joining in co are independent of each other and multiplies their selectivities. While in reality they may be heavily correlated and the selectivity is closer to MIN(s1, s2) not s1*s2.

Ants Aasma 2010-02-26 12:19:07

This is a great answer, but which one are you speaking of when you say `joining in co`? I believe `RhodiumToads` explanation of the problem because it sounds accurate? Are you explaining the same thing or something different?

Evan Carroll 2010-02-26 16:38:58

Same thing. Joining in `co` is the `Index Scan using options_pkey on options co` inner node of the nested loop join. It has two conditions that the planner probably unrealistically thinks will result in one row of output. If you try to run that query and see how many rows it really returns you can verify whether this is the case. Bad estimates for correlated predicates are a known issue. Theres some discussion about this on the performance list: http://archives.postgresql.org/pgsql-performance/2009-06/msg00055.php

Ants Aasma 2010-02-26 19:46:30

ansaurus

tags:

views:

answers:

How can I "think better" when reading a PostgreSQL query plan? (Example attached)

related questions