ansaurus

Question

Does having a longer string in a SQL Like expression allow hinder or help query executing speed?

Answer 1

+2 A:

All things being equal, longer match strings should run faster since it allows to skip through the test strings with bigger steps and do less matches.

For an example of the algorithms behind sting matching see for example Boyer Moore Algorithm on Wikipedia.

Of course not all things are equal, so I would definitely benchmark it.

A quick check found in the mysql reference docs the following paragraph :

If you use ... LIKE '%string%' and string is longer than three characters, MySQL uses the Turbo Boyer-Moore algorithm to initialize the pattern for the string and then uses this pattern to perform the search more quickly.

Peter Tillemans 2010-05-29 19:43:49

Thanks for the information, though benchmarking wouldn't tell me that they were using that specific algorithm. Though they are likely using something similar.

Allain Lalonde 2010-05-29 19:46:07

I just cross referenced and found mysql does use Boyer-Moore and under which conditions. I updated the answer.

Peter Tillemans 2010-05-29 19:51:13

Fantastic. Thanks.

Allain Lalonde 2010-05-29 19:53:01

Answer 2

+1 A:

No difference whatsoever. Because you've got a % sign at the beginning of your LIKE expression, that completely rules out the use of indexes, which can only be used to match the a prefix of the string.

So it will be a full table scan either way.

In a significant sized database (i.e. one which doesn't fit in ram on your 32G server), IO is the biggest cost by a very large margin, so I'm afraid the string pattern-matching algorithm will not be relevant.

MarkR 2010-05-29 22:34:58

true, but it still burns less CPU cycles which is nice to know in the time of Green IT ;-).

Peter Tillemans 2010-05-30 11:18:42

In which case, it depends which occurs more often in your field, 'p' or '='. It has to compare every character in the string with the first literal character. If it doesn't find it, it can stop. If you have lots of = but few 'p's, then the '%p' expression is better and vice versa.

MarkR 2010-05-30 18:54:00

ansaurus

tags:

views:

answers:

Does having a longer string in a SQL Like expression allow hinder or help query executing speed?

related questions