ansaurus

Question

Answer 1

A:

If a subquery references fields from its containing query, the subquery has to be rerun per every row in the containing query, because the referenced fields may be different in each row. If it's completely self-contained, it can be run a single time before the outer query begins processing.

Dewayne Christensen 2009-12-11 20:19:29

Answer 2

+1 A:

Try to avoid correlated subqueries by using JOIN if you can.

Watch this great video on MySQL performance on youtube. Go to 31:00 minute in. The speaker Jay Pipes talks about avoiding correlated subqueries.

Yada 2009-12-11 22:02:54

interesting link - thank you!

davek 2009-12-14 10:12:39

Answer 3

+1 A:

If the correlated subquery isn't optimized well, then try this query:

select
  t.id
, t.int1
, t.int2
, count(*)
from myTable t
left outer join big_table_with_millions_of_rows b
  on (b.id between t.int1 and t.int2)
where
....
group by t.id

That should optimize much better.

Re your updated question: Right, MySQL is not the most sophisticated RDBMS on the market in terms of optimization. Don't be surprised when MySQL can't optimize corner cases like this.

I'm a fan of MySQL for its ease of use and open source and all those good things, but the truth is that its competitors are far ahead of MySQL in terms of technology. Every RDBMS has some "blind spots" but MySQL's seem to be larger.

Also be sure you're using the latest version of MySQL. They improve the optimizer in every release, so you might get better results with a newer version.

Bill Karwin 2009-12-12 06:00:28

+1 thank you: that brought the execution time down from minutes to several seconds. I'll have to bear that tip in mind in future!

davek 2009-12-14 10:12:00

ansaurus

tags:

views:

answers:

MySql and inline SELECTs

related questions