ansaurus

Question

Answer 1

+1 A:

Consider splitting result records away from the data table? I didn't catch what percentage your result is, but perhaps it's worth benchmarking in a Dev copy of your Prod database.

Can you FK those run values? If they're reusable(?), perhaps create a Run table? My guesstimate is that 144 string matches, even indexed, is slower that if they were int or smallint. Again, benchmarking this suggestion, or any suggestion, will obviously prove the theory.

What does the query plan difference look like when not including your like clause on the name attribute?

p.campbell 2010-08-14 21:54:36

Had the data3 key wrong. (I'd a handedit). The first field is the data_type.

Will Glass 2010-08-14 22:08:36

About 80% of the data is "result" data_type. Also, I do have a Run table-- the "run" field is foreign keyed to the run table, but via the char(32) id. Doing a join seems to be about the same performance. I use a char(32) instead of integer for portability of data (if I need to move it between servers).

Will Glass 2010-08-14 22:09:43

replacing the name "like" with an in clause that spells out the 36 possibilities has about the same performance.

Will Glass 2010-08-14 22:55:30

Answer 2

A:

The create statement doesn't work for me - can you post the explain output? And adding indices on data_type and name should help.

Alternatively, create a view with the 'd.run in...' clause and run your queries against that, if the values for run are fixed.

Thilo 2010-08-14 21:55:50

posted explain output

Will Glass 2010-08-14 22:45:36

Answer 3

A:

Depending on how selective the condition on run is, it might be better to provide the index

data_type, run, name(10)

The trouble with providing the column used for prefix matching early in the index is that it scatters matching rows across the index, requiring a bigger part of the index to be read from disk.

Also, using a smaller datatype for the id of run will reduce index size and speed up comparisions. This is a constant factor improvement, but might be worthwhile regardless.

meriton 2010-08-14 22:20:11

this helps. running on my laptop it's 78 seconds for this index vs 135 for the other index.

Will Glass 2010-08-14 23:01:46

ansaurus

tags:

views:

answers:

Optimize query with two "in" clauses

related questions