ansaurus

Question

General SQL model optimization question (MySql)

Answer 1

A:

I would compute your <today - time range> portion client side, before even connecting to the database.

Beyond that, it's going to depend on what indexes you have, what load your server has (what it keeps cached in memory), and the amount of data in each table (how many comments per typical post, how many posts per category, etc). In other words, you need to profile. Assuming all that is moot (it's not!), a decent query optimizer should be able to pretty much take care of any thing else.

There are a couple things I'd do differently out of habit, but they shouldn't matter in this case without knowing your system more. The main one though is that I'd think about volume.

In general, I like to express my queries such that if the joins are done in order, the result set is kept as small as possible for as long as possible. In this case, that would likely mean listing the postCategory join above the comments join, and moving the " = <MyCategoryID>" condition up to be part of the join expression.

Joel Coehoorn 2009-06-11 02:49:34

Mmm ok.. Is it a good idea to try the model with dummy data inserted by an automatic script or something? Or I could just get different results from real life data anyway?

mrmuggles 2009-06-11 02:53:43

dummy data is better than no data, especially if you can be reasonably confident that is somewhat representative of actual data.

Joel Coehoorn 2009-06-11 02:56:34

I'm going to try that way and ask some help if I see a problem. Thanks!

mrmuggles 2009-06-11 02:58:16

Well- take anything I say with a grain of salt. My experience is more in SQL Server dbs. Most of this _should_ apply across systems, but sometimes things don't work as you expect.

Joel Coehoorn 2009-06-11 03:01:56

ansaurus

tags:

views:

answers:

General SQL model optimization question (MySql)

related questions