ansaurus

Question

How can I optimize a query that does an ORDER BY on a derived column in MySQL?

Answer 1

+2 A:

I haven't done this sort of thing in MySQL for a while (long since switched to PostgtreSQL) but typically I would handle this with concentric selects to trick the query planner into giving a good plan.

SELECT * FROM 
(SELECT `ScrapeSearchResult`.`creative_id`, 
        MAX(`ScrapeSearchResult`.`access_date`) AS `latest_access_date` 
FROM `scrape_search_results` AS `ScrapeSearchResult` 
WHERE 1 = 1 
GROUP BY `ScrapeSearchResult`.`creative_id` 

) as inner
ORDER BY `latest_access_date` DESC 
LIMIT 20;

The success of this will purely depend on a reasonable number of total rows in the inner though.

I just looked up the docs for MySQL 5.6 and it looks like this should work ... even in MySQL ;)

Trey 2010-08-25 00:31:18

@Trey: Excellent, your suggestion brought the time down from 0.44 sec to 0.17 sec. The new query plan built a temporary table and then queried it. The `creative_id_access_date_index` column was used (again, searching 10687 rows but not using temporary or filesort, and instead using-index-for-group-by), which I believe accounts for the speedup. Any other suggestions to bring the number of rows considered down at all, for an even greater boost in speed?

Rob Crowell 2010-08-25 00:40:35

Didn't want to mess up the comment above; here's the new EXPLAIN output:\n1 PRIMARY <derived2> ALL NULL NULL NULL NULL 10812 Using filesort\n2 DERIVED ScrapeSearchResult range NULL creative_id_access_date_index 4 NULL 10687 Using index for group-by

Rob Crowell 2010-08-25 00:46:28

As long as you have no WHERE condition it's going to do a full table scan. If you could put any sort of condition on the query at all it will help considerably. My query will only help speed up the sort ... not so much reduce the number of rows scanned.

Trey 2010-08-25 01:23:42

I don't know how often you are adding / pruning data but you might want to make a summary table and select from there (where you only have the results of the inner query). You can maintain with code or with insert / update triggers. That will cause all sorting to be done with an appropriate index (if you add one).

Trey 2010-08-25 01:27:12

In mysql, group by also does an "order by" on the same columns. Perhaps you would want to eliminate that, then modify it this way: `GROUP BY ScrapeSearchResult.creative_id order by null ) as inner `

ceteras 2010-08-25 08:58:54

@ceteras: Interesting, didn't know this about MySQL at all. It made about 0.9 s difference when I explicitly ordered by NULL. Thanks!

Rob Crowell 2010-08-25 14:58:54

ansaurus

tags:

views:

answers:

How can I optimize a query that does an ORDER BY on a derived column in MySQL?

related questions