ansaurus

Question

MySQL - SQLite How to improve this very simple query ?

Answer 1

A:

Do you have any indexed fields ?

indexing timestamp and/or id_tick could change a lot of things.

Also why don't you use an interval for timestamp ?

WHERE timestamp >= '2010-04-15 15:22:05' AND timestamp <= '2010-04-16 15:22:05'

that would ease the burden of the MAX function.

siukurnin 2010-09-02 11:07:04

yes, timestamp and id_tick are already indexed.Adding an interval does not make a difference here, even if it is a very small interval.

Sam 2010-09-02 11:17:35

Answer 2

+1 A:

1) Make sure you have an index on timestamp

2) Assuming that id_tick is both the PRIMARY KEY and Clustered Index, and assuming that id_tick increments as a function of time (since you are doing a MAX)

You can try this:

SELECT id_tick, price, timestamp 
FROM EURUSD 
WHERE id_tick = (SELECT id_tick
                   FROM EURUSD WHERE timestamp <='2010-04-16 15:22:05'
                   ORDER BY id_tick DESC
                   LIMIT 1)

This should be similar to janmoesen's performance though, since there should be high page correlation between id_tick and timestamp in any event

nonnb 2010-09-02 11:09:21

Also, using partitioning will help protect performance as more data is added.

OmerGertel 2010-09-02 11:11:11

1) I have an index on timestamp and id_tick2) all assumptions are correctYou query is executed in 6.2s

Sam 2010-09-02 11:20:17

Try dropping and recreating the indexes - something has gone horribly wrong with statistics or cached plans IMHO.

nonnb 2010-09-03 07:53:56

Answer 3

A:

You are doing analysis using ALL the ticks for large intervals? I'd tried to filter data into minute/hour/day etc. graphs.

alxx 2010-09-02 11:24:44

No I am only using a very limited number of ticks, what about breaking down the data in may tables? Instead of EURUSD, I would have EURUSD_20100501, EURUSD_20100502 ... one table a day ?

Sam 2010-09-02 12:04:53

Split would help, I guess, but you will have to deal with borders.

alxx 2010-09-02 13:06:08

@alxx: no, no, no, no ! you need to treat this as a snowflake schema with a number of dimension tables such as time, currency etc and a forex fact table.

f00 2010-09-02 13:17:17

@alxx to deal with borders, I was thinking of always using the union of two tables (I will not need more)@foo I am not familiar with snowflake schema (and google is not helping), would you mind elaborate a bit or posting a link to explicit your point ? thks!

Sam 2010-09-02 15:54:10

Answer 4

A:

OK, I guess my index was corrupted somehow, a re-indexation greatly improved the performance.

The following is now executed in 0.0012s (non cached)

SELECT id_tick, price, timestamp
FROM EURUSD
WHERE timestamp <= '2010-05-11 05:30:10'
ORDER by id_tick desc
LIMIT 1

Thanks!

Sam 2010-09-02 12:53:06

ansaurus

tags:

views:

answers:

MySQL - SQLite How to improve this very simple query ?

related questions