ansaurus

Question

Performant techniques for finding similar values in SQL?

Answer 1

+2 A:

You don't mention what DB your using, but if it's T-SQL, you could use the SOUNDEX value and difference.

JP Alioto 2009-07-10 04:39:44

Presently I am using T-SQL, but I didn't mention it in hopes of getting a more generalized answers that might work in multiple database platforms. The Soundex/Difference approach looks promising though. I'll give it a spin.

JohnFx 2009-07-10 04:45:56

Answer 2

+1 A:

If you are using SQL Server, you might look into using the SOUNDEX() function as in:

...
where
   SOUNDEX("searchterm") = SOUNDEX(searchvaluefield)

it is supposed to do Phonetic matching on the strings ...

Ron

Some odd examples ... so it seems you could catch plurals by always appending the plural text to both sides, since multiple 's's sound the same ... :-)

select soundex('Canine'), soundex('Canines')
go

----- ----- 
C550  C552  

1 Row(s) affected


select soundex('Canine'), soundex('Caynyn')
go

----- ----- 
C550  C550  

1 Row(s) affected


select soundex('Canines'), soundex('Caniness')
go

----- ----- 
C552  C552  

1 Row(s) affected

Ron Savage 2009-07-10 04:43:21

Any experience on how well Soundex resolves plural versions of words? Sounds like it would do well for "firemen"/"Fireman" but perhaps not so much for "Canine"/"Canines".

JohnFx 2009-07-10 04:47:57

Answer 3

A:

John, if you are using MS SQL Server, you can take advantage of the Full-Text Indexing service. Full-text search functionality has some powerful functions using which you can achieve this.

Kirtan 2009-07-10 04:58:07

ansaurus

tags:

views:

answers:

Performant techniques for finding similar values in SQL?

related questions