ansaurus

Question

Calculating average (AVG) and grouping by week on large data set takes too long

Answer 1

+1 A:

Have you tried indexing the fields?

I am no DDL expert in MySQL, but in this case I'd say that timelog should have a clustered index, then non-clustered indexes should be declared for product_id. Also it would be a good idea to add a new field to the table, for storing the "week" value and index it as well. It would take slightly more space, but that way you would avoid making the same calculations each time.

Alexander 2010-04-09 21:28:48

In MySQL clustered can only be the primary key, mostly in InnoBD tables.

newtover 2010-04-09 22:52:21

hummm. I'm on myisam

caioiglesias 2010-04-15 20:33:35

Answer 2

A:

I would suggest creating a new composite BTREE index on (product_id, timelog) and change the order of conditions in WHERE clause:

SELECT
    AVG(price) as price,
    yearWEEK(FROM_UNIXTIME(timelog)) as week
from pricehistory
where product_id = $id AND timelog > $range 
GROUP BY week

If you already have a BTREE index on (product_id) only, just extend it to (product_id, timelog)

newtover 2010-04-09 23:03:53

I tried this approach with no significant improvement. A query involving all 7 million rows takes 40-45s to complete.

caioiglesias 2010-04-10 00:09:13

Queries involving less data are faster though. Let me test it some more.

caioiglesias 2010-04-10 00:12:41

ansaurus

tags:

views:

answers:

Calculating average (AVG) and grouping by week on large data set takes too long

related questions