ansaurus

Question

How to do partial word searches in Lucene.NET?

Answer 1

A:

it's more a matter of populating your index with partial words in the first place. your analyzer needs to put in the partial keywords into the index as it analyzes (and hopefully weight them lower then full keywords as it does).

lucene index lookup trees work from left to right. if you want to search in the middle of a keyword, you have break it up as you analyze. the problem is that partial keywords will explode your index sizes usually.

people usually use really creative analyzers that break up words in root words (that take off prefixes and suffixes).

get down in to deep into understand lucene. it's good stuff. :-)

Zac Bowling 2009-12-04 05:54:46

Answer 2

A:

Yes, this can be done. But, leading wildcard can result in slow queries. Check the documentation. Also, if you are indexing the entire string (eg. "Dayton, Ohio") as single token, most of the queries will degenerate to leading prefix queries. Using a tokenizer like StandardAnalyzer (which I suppose, you are already doing) will lessen the requirement for leading wildcard.

If you don't want leading prefixes for performance reasons, you can try out indexing ngrams. That way, there will not be any leading wildcard queries. The ngram (assuming only of length 4) tokenizer will create tokens for "Dayton Ohio" as "dayt", "ayto", "yton" and so on.

Shashikant Kore 2009-12-04 06:23:09

Thanks for the response. I'm not too worried about the slow queries yet as I'd like to see it work first before I decide if it's too slow or not. My location list should stay steady at around 4,000 documents so I'm not too worried about it getting any bigger.When you say, "Yes, this can be done." could you elaborate a little more? I thought that the code I displayed above should be doing what I'm expecting, but it's not. Any ideas on what I'm doing wrong?

thinkzig 2009-12-04 13:56:50

Answer 3

+2 A:

Try this:

parser.Parse(query.Keywords.ToLower() + "*")

:)

j3fft 2009-12-04 14:23:53

That did the trick! You had just what I needed./GBT: werd!!!

thinkzig 2009-12-04 14:29:03

ansaurus

tags:

views:

answers:

How to do partial word searches in Lucene.NET?

related questions