ansaurus

Question

Prevent stemming of proper nouns in PostgreSQL?

Answer 1

+2 A:

The point of stemming algorithms is not to reduce every word to its proper stem; the goal is to reduce words that are alike to a common stemmed form. The goal is generally not to get a word that can be presented to the user: even if 'balling' and 'ball' would both produce 'kjebnkkekaa' the algorithm is correct because it still sees 'balling' and 'ball' as generally concerning the same thing.

Also beware that no stemming algorithm is absolutely perfect, for more info look up the Porter Stemming algorithm

Jasper Bekkers 2008-12-09 21:12:25

Answer 2

+1 A:

That's due to the Snowball stemmer as explained here. Basically you'll want to disable the Snowball stemmer and use just iSpell or one of the other dictionaries, but that would also reduce the stemming efficiency for words not in the dictionaries.

codelogic 2008-12-09 21:21:48

ansaurus

tags:

views:

answers:

Prevent stemming of proper nouns in PostgreSQL?

related questions