ansaurus

Question

Help in optimizing a query for a Postgresql database

Answer 1

A:

One thing that I thought ~might~ make it faster would be to crate a temporary table with a column for the date range and a column for the user id's. Then rewrite that query using a JOIN to that table rather than putting those numbers in the query itself. Does anyone know if that would work?

That would be the approach I would take. It will also make the query clearer. You can add indexes to the temp table too, though you should do this after filling it with data. Don't assume you need an index though - test.

Oh - you might want to store timestamps rather than dates (it'll save casting) and perhaps an index on the "timestamp" column in your answers table.

PS - generally considered better not to name columns the same as built-in types. Even if the database doesn't get confused a human reader can.

Richard Huxton 2009-10-30 10:32:34

Answer 2

A:

First, I'd suggest you do add a coarse filter that would use the indexes on usercontextid and timestamp:

SELECT  COUNT(uapl.id) AS numAnswered,
        SUM(CASE WHEN (a.correct OR q.survey OR uapl.answersId IS NULL) THEN 1 ELSE 0 END) AS numCorrect
FROM    questions q
JOIN    usersAnswersProgramsLink uapl
ON      uapl.questionsId = q.id
LEFT JOIN
        answers a
ON      a.id = uapl.answersId
WHERE   programsId=123
        AND timestamp >= '2009-09-01'
        AND timestamp < '2009-09-22'
        AND usercontextid IN (/* all possible values here */)
        AND 
(
  (
    CAST(timestamp AS date) >= '2009-09-01'
    AND CAST(timestamp AS date) <= '2009-09-21'
    AND usercontextid in('123','234','345','465','567')
  )
  OR
  (
    CAST(timestamp AS date) >= '2009-09-10'
    AND CAST(timestamp AS date) <= '2009-09-21'
    AND usercontextid in('321','432','543')
  )
  OR
  (
    CAST(timestamp AS date) >= '2009-09-16'
    AND CAST(timestamp AS date) <= '2009-09-21'
    AND usercontextid in('987','876')
  )
)

You also need to clarify which tables do all these field belong to.

Quassnoi 2009-10-30 10:40:51

Answer 3

+2 A:

as already mentioned: please provide the result of EXPLAIN ANALYZE <query> as well as table structures and created indexes, without that it will be difficult to help

a index on timestamp::date could help ( a index on timestamp would not be used because of the cast)

you could also post the explain analyze output into http://explain.depesz.com/ which will highlight the problematic places in the execution plan

pfote 2009-10-31 11:02:25

Yes, http://explain.depesz.com/ is great.

bortzmeyer 2009-11-02 21:50:44

ansaurus

tags:

views:

answers:

Help in optimizing a query for a Postgresql database

related questions