lucene.net

Help needed bubbling up relevant records with most recent date

Hi, I've got 5 records in Lucene index. a.Record 1 contains--tax analysis.Date field value is March 2009 b.Record 2 contains--Senior tax analyst.Date field value is Aug 2009 c.Record 3 contains--Senior tax analyst.Date field value is July 2009 d.Record 4 contains--tax analyst.Date field value is Feb 2009 e.Record 5 contains--Senio...

Lucene / Lucene.NET - Document.SetBoost() values???

I know it takes in a float, but what are some typical values for various levels of boosting within a result? For example: If I wanted to boost a document's weighting by 10% then I should set it 1.1? For 20% then 1.2? What happens if I start setting boosts to values like 75.0? or 500.0? Edit: Fixed Formatting ...

Help needed figuring out reason for maxClauseCount is set to 1024 error

Hi, I've two sets of search indexes. TestIndex (used in our test environment) and ProdIndex(used in PRODUCTION environment). Lucene search query: +date:[20090410184806 TO 20091007184806] works fine for test index but gives this error message for Prod index. "maxClauseCount is set to 1024" If I execute following line just before execut...

Lucene - Searching several terms in different fields

I have a Lucene index which populates from a database. I store/index some fields and then add a FullText field in which I index the contents of all the other fields, so I can do a general search. Now let's say I have a document with the following two fields: fld1 - "Samsung releases a new 22'' LCD screen" fld2 - "Sony Ericsson phone's b...

Lucene query - "Match exactly one of x, y, z"

I have a Lucene index that contains documents that have a "type" field, this field can be one of three values "article", "forum" or "blog". I want the user to be able to search within these types (there is a checkbox for each document type) How do I create a Lucene query dependent on which types the user has selected? A couple of prere...

SQl Server 2008 FILESTREAM, Indexing Services, and FTS

I need to implement a service to search PDFs. Initially I started using SQL Server 2008 FTS, but soon realized that my PDFs would have to be stored in the DB itself. I was then pointed to Indexing Services as well as to the SQL 2008 FILESTREAM data type so that I can store PDFs in the file system. So how do these three (Indexing Servi...

FullTextSession configuration using spring.net

Am using lucene.net for full text search. Am configuring my Nhibernate session programatically using SearchFactory provided by Lucene in the NHibernate.Search library. Am evaluating if spring.net can be introduced in my system. Am not sure if spring.net will configure a FullTextSession for me? Please let me know if you have any suggestio...

Lucene.Net maintaining indexes when using MultiSearcher

In order to keep my index up to date, I need to add / modify my search index every 5 minutes. The way I have it set up is with 2 indexes, one in the Full directory and one in the Incremental directory, and to search them I am using a MultiSearcher. I am now writing a process to maintain the index. What I am doing is passing the last ind...

Lucene.Net alternative sorting

Most of the time (>90%) I will want results to be sorted by the UpdatedOn field. When I do a search with this sort the results take almost 500% longer than a search based on a score sort. Is there some alternate way of indexing that will optimize this kind of sort? I use multiple indexes (MultiSearcher is that matters). ...

How do I delete old documents from Lucene/Lucene.NET

What is the idiomatic way to delete old documents from a Lucene Index? I have a date field (YYYYMMddhhmmss) on all of the documents, and I'd like to remove anything more than a day old (for example). Should I perform a filtered search or enumerate through the IndexReader's documents? I'm sure the question is the same regardless of whi...

Need to know pros and cons of using RAMDirectory

Hi, I need to improve performance of my Lucene search query. Can I use RAMDirectory?Does it optimize performance?Is there any index size limit for this? I would appreciate if someone could list pros and cons of using a RAMDirectory. Thanks. ...

Lucene.Net TermQuery wildcard search

Hi, I have a lucene index I am trying to do a wildcard search. In index i have a character like '234Test2343' I am trying to do the search like %Test%.. My lucene syntax looks like string catalogNumber="test"; Term searchTerm = new Term("FIELD", "*"+catalogNumber+"*"); Query query = new TermQuery(searchTerm); I don't get the results ...

How to index numeric fields and search them by range in Lucene.Net ?

I'm looking for an efficient way to index and search numeric fields in Lucene.Net. Right now my need is for integer values only. I want to search by ranges of values (between x and y, more than x...). Right now I'm indexing the number as is and manually creating clauses for each value in between the two values, but it quickly generates ...

Handling + as a special character in Lucene search

Hi, How do i make sure lucene gives me back relevant search results when my input string contains terms like c++? Lucene seems to ignore ++ characters. Code details: When I execute this line,I get a blank search query. queryField = multiFieldQueryParser.Parse(inpKeywords); keywordsQuery.Add(queryField, BooleanClause.Occur.SHOULD); ...

Search filters with Lucene.NET

I'm using Lucene.Net to create a website to search books, articles, etc, stored as PDFs. I need to be able to filter my search results based on author name, for example. Can this be done with just Lucene? Or do I need a DB to store the filter fields for each document? Also, what's the best way to index my documents? I'll have about 5...

Resolving Lucene Index error

Hi, Why do I get error like this in Lucene and how to resolve it? Could not find file 'C:\Indexes_z3_1.del'. Thanks. ...

Lucene Index and Query Design Question - Searching People

I have recently just started working with Lucene (specifically, Lucene.Net) and have successfully created several indicies and have no problem with any of them. Previously having worked with Endeca, I find that Lucene is lightweight, powerful, and has a much lower learning curve (due mostly to a concise API). However, I have one specif...

Inflectional forms of verbs using DBsight lucene?

I know dbsight allows synonyms and stop words for searching but does this take care of inflectional forms of a verb too e.g. for 'swim' it should find swim, swims, swimming, swam, and swum Link on DBSight Wiki : http://wiki.dbsight.com/index.php?title=User%5Fdictionary ...

Using Lucene.Net, what indexing stategy should I use here?

I'm trying to build a search for our internal support database - each support ticket consists of many emails and I'm trying to work out how best to index it: Should I create a document for each of the emails individually, or Should I concatenate all the emails for a ticket and create a document for each ticket. When searching I want ...

Boosting Multi-Value Fields

I have a set of documents containing scored items that I'd like to index. Our data structure looks like: Document ID Text List<RelatedScore> RelatedScore ID Score My first thought was to add each RelatedScore as a multi-value field using the Boost property of the Field to modify the value of the particular score when searc...