ansaurus

Question

How do I avoid a full table scan with this basic inner join?

Answer 1

+3 A:

I guess you are trying this on empty table (because MySQL thinks it needs to go through one row to do full table scan), what might influence results of scheduler. When you will do it on real table, the EXPLAIN results might vary (and actually did vary in my test).

Michal Čihař 2009-12-23 12:48:21

Your right, I threw some data in there, ran analyze table on both tables, and now the first join is of type range on the meta table.

2009-12-30 17:51:02

Answer 2

A:

If I read what you posted right it goes from index to ref and eq_ref to all.

CREATE INDEX idx_metadata USING BTREE ON `metadata` (fkBlob,start);

Should take it right back.

Don 2009-12-23 12:52:12

Answer 3

+2 A:

The optimizer thinks you query will benefit from swapping the table order (which most probably means that statistics are not up to date).

You can try adding an index on metadata (start, fkBlob):

CREATE INDEX ix_metadata_start_blob ON metadata (start, fkBlob)

and run ANALYZE TABLE on both tables.

This way, the index on start will be used for filtering on metadata which will be made leading.

You can also explicitly force the order of the join:

SELECT  *
FROM    metadata m
STRAIGHT_JOIN
        blobstore b
ON      b.id = m.fkBlob
WHERE   m.start <= '2009-01-01'

, though it's usually not recommended.

Quassnoi 2009-12-23 17:28:30

Answer 4

A:

if the index doesnot take it right use HINTS

select /* INDEX <index_name> */
blah blah blah
from ........

Venkataramesh Kommoju 2009-12-29 11:50:01

Answer 5

A:

In the first example, MySQL used the metadata fk_blob index because it was a covering index--every column you used in the query was present in the index. (This is what "using index" means.) That query still did a full scan, but it scanned every row via the secondary index instead of the primary. As soon as you used start, you lost the covering index and MySQL calculated it was faster to use blobstore as the driving index. (InnoDB's primary index is integrated with the row storage.)

If you want MySQL to continue to use a metadata index as the driving index, make sure there's a single index on it that will be useful for the query. An index on (start, fkBlob) would be best for the second query, but that may not be useful for other queries. The next best index is to replace (fkBlob) with (fkBlob, start). You'll have to balance having too many indexes (which are expensive to maintain) vs having efficient query plans. Test, test, test--and never blindly believe explain on your dev database.

Ken Fox 2009-12-29 16:39:39

ansaurus

tags:

views:

answers:

How do I avoid a full table scan with this basic inner join?

related questions