ansaurus

Question

Quickly select all rows with "1 or more" matching rows in another table

Answer 1

A:

Does MYSQL support the TOP N syntax? If so:

SELECT TOP 1 identity.id FROM identity
INNER JOIN task ON
  task.identityid=identity.id
  AND task.groupid IN (78, 122, 345, 12, 234, 778, 233, 123, 33)

1800 INFORMATION 2009-05-26 02:13:58

the mysql syntax (instead of TOP 1 right after SELECT) would be to add ORDER BY identity.id DESC LIMIT 1 at the end of the query - but either TOP or LIMIT produce a single-row answer which seems quite different from what the question requests.

Alex Martelli 2009-05-26 02:19:28

Answer 2

+1 A:

Just using "SELECT DISTINCT" with what you have should be efficient in mysql. You may need to put your values in a table and join to it, rather than using "IN ( ... )".

le dorfier 2009-05-26 02:14:39

When I use 'DISTINCT' it shows 'Using temporary table'. It still seems pretty fast for my simplified tests, but doesn't that add a fair amount of overhead which could catch up with me? Are temporary tables for DISTINCT ever fast/in-memory?

thomasrutter 2009-05-26 02:17:14

Note the change. Mysql does like to do temp tables, but usually fairly efficiently. The WHERE EXISTS strategy is usally the most frequent cross-server recommendation, and should also work. (WHERE ... IN ( ... ) just makes me shudder - it usually means an automatic UNION.)

le dorfier 2009-05-26 02:22:36

Answer 3

A:

Exists should perform just fine for you, as long as the column you are comparing in the subquery is indexed.

I would expect that the exists would perform just a little better than a join-and-group-by, but I would have to try it out to be sure. I've run across enough performance stuff in MySQL where my prediction was wrong to know it's worth giving it a try.

MBCook 2009-05-26 02:18:15

I gave it a try, and EXPLAIN showed that it joined the identity table before the task table, so it executed the subquery for every row in the identity table. This isn't the order I want, but it's hard to say whether this is just because the test data I have is so small - maybe it would join it the other way with lots more identities. I'll have to test with a large amount of data to find out!

thomasrutter 2009-05-26 13:03:01

I guess I may also have misunderstood how EXPLAIN shows join order for subqueries...

thomasrutter 2009-05-26 13:06:35

ansaurus

tags:

views:

answers:

Quickly select all rows with "1 or more" matching rows in another table

related questions