ansaurus

Question

Answer 1

+1 A:

PorterStemmerAnalyzer is composed of series of tokenizers and filters. PorterStemmer is one of the filters to the tokenstream generated. If you want to verify that, try changing the case of the query. QueryParser output will be in the lowercase due to LowerCaseFilter on tokenstream.

Some sample code for custom analyzer can be checked here. This will give you a peek inside an Analyzer.

Shashikant Kore 2010-09-30 06:01:00

Answer 2

+2 A:

The query parser tokenizes it first into two tokens. Porter considers it all as one "word" and so only stems the last portion.

Xodarap 2010-09-30 13:59:27

ansaurus

tags:

views:

answers:

Lucene PorterStemmer question

related questions