ansaurus

Question

Answer 1

+10 A:

Did you forget something from your actual query? It's missing ANSI-89 joins between the three tables, giving you a cartesian product but only pulling out the POINTS records.

OMG Ponies 2010-06-08 00:44:28

Not only that but `AND c.user_id=6` is doing nothing, since no results from `contacts` are returned.

VeeArr 2010-06-08 00:46:23

2010-06-08 00:48:45

@user315975: I dunno your data, but don't include tables if they serve absolutely no purpose. Still need to know how `POINTS` and `CONTACTS` relate...

OMG Ponies 2010-06-08 00:50:42

@user315975: It'll probably be more worthwhile to analyze the performance of a query that makes sense.

Thanatos 2010-06-08 01:19:19

Yep! Whats happening is he is getting all the possuble permutations of the points, areas and contacts rows, which are then being sorted to remove the duplicates as directed by the "DISTINCT" clause.

James Anderson 2010-06-08 02:09:14

Not enough sleep, and some bad thinking on my part. Made the assumption that naming the tables would not generate the joins unless they were included in the clauses that relate them. Problem solved. Will post completed code if any are interested.

2010-06-08 11:45:33

Answer 2

+5 A:

You're joining three tables, p, a, and c, but you aren't specifying how to attach them together. What you're getting is a full Cartesian join between all of the rows in all of the tables that match the criteria, then everything in areas.

You probably want to attach something in points to something in areas. And something in contacts with ... well, I don't know what your schema looks like.

Try sticking an "EXPLAIN" at the beginning for information on what's happening.

Charles 2010-06-08 01:02:19

Indeed. You might only have 2000 records in points, but if you have 2000 in areas and 2000 in contacts as well, you're generating 2000 * 2000 * 2000 = 8 billion rows, then sorting them back into distinct.

Cowan 2010-06-08 05:14:32

Answer 3

+2 A:

Probably you are missing the joins. Joining the table would be something like this.

SELECT DISTINCT p.* 
  FROM points p
  JOIN areas a p ON  a.FkPoint = p.id
  JOIN contacts c ON c.FkArea = a.id
 WHERE (    p.latitude > 43.6511659465 
        AND p.latitude < 43.6711659465 
        AND p.longitude > -79.4677941889 
        AND p.longitude < -79.4477941889) 
   AND p.resource_type = 'Contact' 
   AND c.user_id = 6

For better indexes on coordinates use Quadtree or R-Tree index implementation.

If you intentionally did not miss the joins, try a subquery like this.

select DISTINCT thePoints.*
(   
    SELECT DISTINCT p.* 
    FROM points p
    WHERE (     p.latitude > 43.6511659465 
            AND p.latitude < 43.6711659465 
            AND p.longitude > -79.4677941889 
            AND p.longitude < -79.4477941889) 
    AND p.resource_type = 'Contact' 
) as thePoints
, areas, contacts
WHERE  c.user_id = 6

2010-06-08 01:32:36

Answer 4

A:

You need a rtree index and use the @ operator, normal index won't work.

R-Tree http://www.postgresql.org/docs/8.1/static/indexes-types.html

@ operator http://www.postgresql.org/docs/8.1/static/functions-geometry.html

J-16 SDiZ 2010-06-08 02:04:23

R-tree indices don't exist 8.3+.

rfusca 2010-06-08 02:42:19

Well, GiST indices (which implement R-trees for the geom types, I think)

araqnid 2010-06-08 09:48:39

ansaurus

tags:

views:

answers:

Why is this postgresql query so slow?

related questions