ansaurus

Question

SQL Server Search Proper Names Full Text Index vs LIKE + SOUNDEX

Answer 1

A:

If you create an index on the first name and last name columns, then exact match searches and prefix searches using LIKE will become blazingly fast.

(In MySQL, "The index also can be used for LIKE comparisons if the argument to LIKE is a constant string that does not start with a wildcard character." I think MS SQL has a similar rule, but check the MS SQL documentation to be sure.)

To speed up SoundEx searches, store the SoundEx version of the first name and last name new columns, and create indices on those columns.

Ken Bloom 2010-06-01 01:07:26

Answer 2

+3 A:

Depends what your LIKE queries look like.

If you are searching for LIKE '%abc%' then no index can be utilised, whereas when searching for LIKE 'abc%' an index can be used. Also, if the index(es) on First and Last name are not 'covering' the emitted query then key lookups (Bookmark Lookups) will be performed and significantly impact performance.

Are your indexes rebuilt regularly?

Do you have an example query plan?

Update: A covering index for a query is one which can be used to perform the WHERE criteria and also has all of the columns required to satisfy the rest of the query such as the SELECT column list.

Using Covering Indexes to Improve Query Performance

Update: Even if you create a composite index on (Lastname, Firstname) (since lastname should be more selective), a lookup for all the other columns (the '*' column list) will still be required into the tables clustered index.

Mitch Wheat 2010-06-01 01:08:16

Indexes will be rebuilt regularly, probably weekly. I'm adding records at the rate of roughly 5,000 per day.Ha, looks like the current system isn't using "LIKE" at all, evidently too slow. So, I'd say 'abc%' ought to be an improvement.

Matthew Talbert 2010-06-01 01:23:00

What do you mean by 'covering'?

Matthew Talbert 2010-06-01 01:23:39

This is really helpful, Mitch. I'm working on getting an example query plan for you. So, should I create a single index that contains all of the columns I'm interested in?

Matthew Talbert 2010-06-01 01:43:29

I've added the query execution plan. Hopefully it's what you need.

Matthew Talbert 2010-06-01 01:50:45

@Matthew Talbert: well, it's a trade-off, and depends on several factors. Wide indexes are generally not a good idea. You can use SQL Server 2005 onwards INCLUDE part of a CREATE INDEX definition to create a covering index.

Mitch Wheat 2010-06-01 01:51:21

Answer 3

+1 A:

I don't like soundex much. I think newer iterations of the algorithm are better, but you are hashing every word in the English language down to a fairly small hash. This tends to generate a ton of false matches over time. I've read that metaphone and it's successor double metaphone are better, but I don't have direct experience with them.

Mitch's coverage of like is pretty thorough, so I'm not going to repeat it.

Donnie 2010-06-01 01:32:26

Thanks for the info on soundex.

Matthew Talbert 2010-06-01 01:59:31

ansaurus

tags:

views:

answers:

SQL Server Search Proper Names Full Text Index vs LIKE + SOUNDEX

related questions