ansaurus

Question

MySQL efficient select / order on a range query with multiple / single column indices

Answer 1

A:

As far as I remember the sql-query-analyzer parses the query from right to left - so the first index he meets is the city-one because it's the rightest. maybe you can flip the indices by changing the position of the in and the between-clause. Do you need all the information from your table? if not you could gain some speed by selecting only the columns you need.

Gambrinus 2009-12-23 12:43:25

Ah. That seems not to work. In the application we define specific fields. I thought it would be easier to read if I did not include those fields.

Ruuts 2009-12-23 12:54:49

Answer 2

A:

I'm now thinking of something totally different. Since the city_ids are a result of a base_city + range it would be possible to use only the date plus a algorithm in the where clause for defining the distance of the base_city -> activity. This takes about 0.009 sec to complete. Downside is the usage of that we sometimes still use the city_ids. Hmm.

SQL_NO_CACHE *
FROM `activities_index` AS idx
WHERE 
ROUND(
((acos(sin((52.220818*pi()/180)) * sin(( idx.lat *pi()/180)) + cos((52.220818*pi()/180)) * cos(( idx.lat *pi()/180)) * cos(( (6.891140 -  idx.lng )*pi()/180 )))) 
*180/pi()) *60*1.1515*1.609344
) < 15 AND idx.date BETWEEN '2009-12-23' AND '2010-1-23'
ORDER BY idx.date
LIMIT 25

Ruuts 2009-12-23 12:57:23

Answer 3

A:

Some interesting information on index mergining. Unfortunately, your query is a perfect example of one of the listed deficiencies (a single range scan).

Whether or not the query in your reply is any better depends a lot on how many rows you have in a given date range, because you definitely won't get any optimization out of that algorithm. However, if the date range can narrow the rows sufficiently, that could be the most effective.

Note: the order of the possible_keys in the EXPLAIN output is not significant. Your wording also makes it sound as if you interpret the EXPLAIN output to say that it is doing a range select using date. It is not. It is doing a range select on city_id (it will scan every row with a city_id value between the min and max values in your IN() clause. The efficiency of doing so will depend greatly on the distribution of your values.

Have you tried running ANALYZE TABLE activities_index to see if the speed of the query and/or the output of EXPLAIN changes. MySQL often tries to predict value distributions based on column type, but actually analyzing the table gives a true distribution to use, which can allow it to better select the best key(s).

Rob Van Dam 2009-12-24 07:15:44

ansaurus

tags:

views:

answers:

MySQL efficient select / order on a range query with multiple / single column indices

related questions