lucene.net

Setting wildcard queries as default for QueryParser

When my users enter a term like "word" I would like it be treated as a wildcard query "word*" so all terms beginning "word" are found. Is there a way to tell the QueryParser to automatically create wildcard queries or do I have to parse the query myself? This shouldn't be a problem for simple queries but it may become tricky for more com...

Lucene (.NET) Document stucture and performance suggestions.

Hello, I am indexing about 100M documents that consist of a few string identifiers and a hundred or so numaric terms.. I won't be doing range queries, so I haven't dugg too deep into Numaric Field but I'm not thinking its the right choose here. My problem is that the query performance degrades quickly when I start adding OR criteria...

Different analyzers for each field

Hi, How can I enable different analyzers for each field in a document I'm indexing with Lucene? Example: RAMDirectory dir = new RAMDirectory(); IndexWriter iw = new IndexWriter(dir, new StandardAnalyzer(Lucene.Net.Util.Version.LUCENE_CURRENT), true, IndexWriter.MaxFieldLength.UNLIMITED); Document doc = new Docu...

Lucene.NET performance

I have a website that runs of a third party search provider that is expensive. I am going to roll my own. Is Lucene.NET capable of ~25,000 products (or documents), each with maybe ten attributes used for filtering? I am looking to do a "narrow/drill down" or "faceted search". Does that sound like to much to ask from Lucene.NET? ...

Resources for getting started with Lucene.Net?

I'm building a simple site that allows users to post text content and I want to add it to a search index as it gets posted, so my site search is up to date. From what I can tell Lucene.NET is a good full text search framework. I've found very few examples of how to use it though. Can anyone post some good references for learning about...

Lucene.net create+lock errors in ASP.NET

-Edit- Important: I updated the code to not use obsoluete functions. Now only the NoSuchDirectoryException issue remains Edit: NOTE i can bypass the NoSuchDirectoryException by creating the folder in a winform app and copy it. However i still have a LockObtainFailedException issue if i dont shut down properly. I have an issue with (Luc...

How to index a string like "aaa.bbb.ddd-fff" in Lucene?

Hi, I have to index a lot documents that contain reference numbers like "aaa.bbb.ddd-fff". The structure can change but it's always some arbitrary numbers or characters combined with "/","-","_" or some other delimiter. The users want to be able to search for any of the substrings like "aaa" or "ddd" and also for combinations like "aaa...

Lucene.net 2.9.2 NumericField sort

Can anyone verify if the latest Lucene.net 2.9.2 can sort (and reverse sort) NumericField? I am sort of stumped, expecially with the reverse sort :( ...

Lucene.NET faceted search.

I found a great tutorial on performing a faceted search. http://www.devatwork.nl/articles/lucenenet/faceted-search-and-drill-down-lucenenet/ This article does not explain how to retrieve the narrowed available attributes to filter from (for further drill down). Lets say I am looking for planners that are red. When I perform the facete...

Lucene.NET 2.9 and BitArray/DocIdSet

I found a great example on grabbing facet counts on a base query. It stores the bitarray of the base query to improve the performance each time the a facet gets counted. var genreQuery = new TermQuery(new Term("genre", genre)); var genreQueryFilter = new QueryFilter(genreQuery); BitArray genreBitArray = genreQuer...

SQL Server Search Proper Names Full Text Index vs LIKE + SOUNDEX

I have a database of names of people that has (currently) 35 million rows. I need to know what is the best method for quickly searching these names. The current system (not designed by me), simply has the first and last name columns indexed and uses "LIKE" queries with the additional option of using SOUNDEX (though I'm not sure this is a...

Is there a way I can provide Lucene.NET with a list of predefined relevant terms?

I know I can, during search, specify a "boost factor" to a term as described in http://lucene.apache.org/java/2_4_0/queryparsersyntax.html. My question is: Can I provide Lucene with a predefined table of relevance? For instance, I could say that "chair" and "table" are relevant words with a boost factor of 4 and all subsequent searches...

Lucene search taking TOOO long.

I;m using Lucene.net (2.9.2.2) on a (currently) 70Gig index.. I can do a fairly complicated search and get all the document IDs back in 1 ~ 2 seconds.. But to actually load up all the hits (about 700 thousand in my test queries) takes 5+ minutes. We aren't using lucene for UI, this is a datastore between processes where we have hundreds...

Multiple word Auttosuggest using Lucene.Net

I am currently working on an search application which uses Lucene.Net to index the data from the database to Index file. I have a product catalog which has Name, short and long description, sku and other fields. The data is stored in Index using StandardAnalyzer. I am trying to add auto suggestion for a text field and using TermEnum to ...

Where can I find open source applications that use lucene.net

I am looking for any open source application that uses lucene.net. I am working on a complicated web application and would like to see how others have implemented lucene.net. ...

Lucene: Question of score caculation with PrefixQuery

Hi, I meet some problem with the score caculation with a PrefixQuery. To change score of each document, when add document into index, I have used setBoost to change the boost of the document. Then I create PrefixQuery to search, but the result have not been changed according to the boost. It seems setBoost totally doesn't work for a Pre...

Lucene.Net memory consumption and slow search when too many clauses used

I have a DB having text file attributes and text file primary key IDs and indexed around 1 million text files along with their IDs (primary keys in DB). Now, I am searching at two levels. First is straight forward DB search, where i get primary keys as result (roughly 2 or 3 million IDs) Then i make a Boolean query for instance as foll...

Lucene Boolean Query on Not ANalyzed Fields

Using RavenDB to do a query on Lucene Index. This query parses okay: X:[[a]] AND Y:[[b]] AND Z:[[c]] However this query gives me a parse exception: X:[[a]] AND Y:[[b]] AND Z:[[c]] AND P:[[d]] "Lucene.Net.QueryParsers.ParseException: Cannot parse '( AND )': Encountered \" \"AND" I tried this on complexed index and simple reproduce ...

Handling different non-accented versions of Umlaut characters

The German accented Umlaut characters “ö”, “ä” and “ü” are often replaced with non-accented versions when users type, often for convenience when they do not have the correct keyboard. With most accented characters there is a particular non-accented version that most people use. The accented “è”, for instance, is always replaced with a s...

Lucene.Net/SpellChecker - multi-word/phrase based auto-suggest

Hi I've implemented Lucenet.NET on my site, using it to index my products which are theatre shows, tours and attractions around London. I want to implement a "Did you mean?" feature for when users misspell product names that takes the whole product titles into account and not just single words. For example, If the user typed: Lodnon...