Searching hyphenated words with Lucene | ansaurus

tags:

lucene

views:

47

answers:

1

Q:

Searching hyphenated words with Lucene

Hi

I want lucene to search for hyphenated words, for eg: energy-efficient or "energy-efficient" as one single word

So if the input is energy-efficient the tokenizer generates terms like energy or efficient or energy efficient or energy-efficient

Therefore lucene returns with pages containing both "energy efficient" and "energy-efficient", but I want it to return exclusively with pages for energy-efficient

So the question is how can I modify the standardtokenizer to search for energy-efficient as one whole word and not break it into separate words.

A:

Use WhitespaceAnalyzer instead of standardAnalyzer. That will generate tokens dividing only on white space. But check for the other things that'll be changed.

kaka 2010-09-01 03:13:25

related questions

Lucene.Net Search result to highlight search keywords

Does a pom.xml.template tell me everything I need to know to use the project as a dependency

Can someone compare a Fuzzy Query to a LuceneDictionary solution?

Has anyone used lucene.net with Linq-to-Entities?

Can someone give me a high overview of how lucene.net works?

Using Lucene to count results in categories

Which search technology to use with ASP.NET?

How to do query auto-completion/suggestions in Lucene?

Should an index be optimised after incremental indexes in Lucene?

What is the best search approach using Lucene?

How to best search against a DB with Lucene?

Is there a fast, accurate Highlighter for Lucene?

How to sort by Lucene.Net field and ignore common stop words such as 'a' and 'the'?

How do I estimate the size of a Lucene index?

Analyzer for Russian language in Lucene and Lucene.Net

In Lucene how do terms get used in calculating scores, can I override it with a CustomScoreQuery?

Troubleshoot Java Lucene ignoring Field

Best full text search alternative to ms sql, c++ solution

Strategies for keeping a Lucene Index up to date with domain model changes

How to get facet ranges in solr results?

Using Lucene to search for email addresses

WildcardQuery error in Solr

With Lucene: Why do I get a Too Many Clauses error if I do a prefix search?

Lucene exact ordering

Lucene Score results