ansaurus

Question

Answer 1

+2 A:

MySQL is not very good in optimizing IN conditions.

The condition in your first query cannot be easily rewritten as EXISTS, that's why MySQL checks the results for each row.

If you want to select company_id's that are mentioned more than once in tag 18, it's better to rewrite this query as such:

SELECT  tags.*, COUNT(company_id) AS count
FROM    company2tag ct
JOIN    tags
ON      tags.id = ct.tag_id
WHERE   ct.tag_id <> 18
        AND NOT EXISTS
        (
        SELECT  NULL
        FROM    company2tag cti
        WHERE   cti.tag_id = 18
                AND cti.company_id = ct.company_id
        LIMIT 1, 1
        )
GROUP BY
        ct.tag_id
ORDER BY
        count DESC

The main idea here is that you don't need to COUNT(*): it's enough just to check that at least two values exist.

See this article in my blog for the similar problem:

Counting bans

Having the following index:

CREATE INDEX ix_company2tag_tag_company_id ON company2tag (tag_id, company_id)

will greatly improve this query.

Quassnoi 2009-07-29 13:55:07

Answer 2

+3 A:

First of all, you may be experiencing problems from the first query because you have two tables aliased to ct... one in the outer query, one in the sub-query.

Secondly, you can rewrite the IN as a JOIN:

SELECT tags.*, COUNT(ct.company_id) AS count
FROM company2tag ct
INNER JOIN tags ON tags.id = ct.tag_id
INNER JOIN (
    SELECT company_id FROM company2tag
    WHERE tag_id = 18
    GROUP BY company_id
    HAVING COUNT(company_id) = 1
) ctf ON ct.company_id = ctf.company_id
WHERE tags.id != 18
GROUP BY ct.tag_id
ORDER BY count DESC
LIMIT 5;

Note that I haven't actually tested this.

R. Bemrose 2009-07-29 14:14:13

+1 for mentioning rewriting the IN as a JOIN. IN is supposed to be for matching against a short list. If you have a larger list based on other tables/queries it should be a JOIN.

Jason S 2009-07-29 14:21:19

Thanks for this! Your solution is about 10 times faster and does not block the server.

smoove666 2009-08-03 08:30:41

ansaurus

tags:

views:

answers:

Mysql Sub-Select Problem

related questions