lucene

Optimizing Lucene performance

What are the various ways of optimizing Lucene performance? Shall I use caching API to store my lucene search query so that I save on the overhead of building the query again? ...

Index replication and Load balancing

Hi, Am using Lucene API in my web portal which is going to have 1000s of concurrent users. Our web server will call Lucene API which will be sitting on an app server.We plan to use 2 app servers for load balancing. Given this, what should be our strategy for replicating lucene indexes on the 2nd app server?any tips please? ...

Using RAMDirectory

Hi, When do i use Lucene RAMDirectory?what's the advantage of using it? Can i have some code sample please? Thanks. ...

Recommended title boost?

I have a relatively simple Lucene index, being served by Solr. The index consists of two major fields, title and body, and a few less-important fields. Most search engines give more relevance to results with matches in the title, over the body. I'm going to start providing an index-time boost to the title field. My question is, what...

Reading document in Lucene

My indexed document in Lucene has got multiple cities assigned to it...ie. doc.Add(new Field("city", city1.Trim(), Field.Store.YES, Field.Index.TOKENIZED)); doc.Add(new Field("city", city2.Trim(), Field.Store.YES, Field.Index.TOKENIZED)); etc how do i iterate thru them and read the values after executing the Lucene search query? Thanks...

Lucene analyzer and dots

Am newbie to Lucene. Is there any way I can make Lucene analyzer not ignore dots in the string?? for example,if my search criteria is: "A.B.C.D",Lucene should give me only those documents in the search results which have "A.B.C.D" and not "ABCD".... ...

Lucene and forward slash

In my index there is a value companyName:opel/vauxhall Searching for "opel/vauxhall" returns the correct results. Searching for "opel/*" returns nothing. Currently I'm replacing the '/' with spaces. But that does give my code a bit of a smell. Any better ideas? ...

Search multiple indices at once using Lucene Search

I am using Zend_Search_Lucene to implement site search. I created separate indices for different data types (e.g. one for users, one for posts etc). The results are similarly divided by data type however there is an 'all' option which should show a combination of the different result types. Is it possible to search across the different i...

Problem using Lucene RangeQuery

I'm using Rangequery to get all the documents which have amount between say 0 to 2. When i execute the query, Lucene gives me documents which have amount greater than 2 also...What am i missing here? Here is my code: Term lowerTerm = new Term("amount", minAmount); Term upperTerm = new Term("amount", maxAmount); ...

Why does a Lucene exact match only score 0.4?

I have a document that contains a title and a description. One of the item description contains "Shipping is free with Amazon Prime" among other words. I do a search with lucene for "Shipping is free with Amazon Prime" on the fields title and description using perl Lucene my $analyzer = new Lucene::Analysis::SimpleAnalyzer(); my @field...

DynamicFields in Solr

In my current project i need to index all e-mails and their attachments from multiple mailbox. I will use Solr and I don't know what is the best approach to build my index's structure. My first approach was: <fields> <field name="id" require="true"/> <field name="uid" require="true"/> //A lot of other fields <dynamicField name="attachm...

City name query

Am a newbie to Lucene an working on a city search API using Lucene. If user types in san francisco as search input, then it should give cities with exact match only and not San Jose /San Diego,etc. How should i index city names in Lucene?and which Lucene analyzer and query class do i need to use? ...

Lucene Javadoc package

Does Lucene has separeted jar with javadoc inside ? Everything in contrib section has separated javadoc jars but it look to me that core has none. Am I right ? ...

Pros & cons of full text search engine Lucene, Sphinx, Postgresql full text search, MySQL full text search?

I'm building a Django site and is looking for a search engine. A few candidates: Lucene/Lucene with Compass/Solr Sphinx Postgresql built-in full text search MySQl built-in full text search Selection criteria: result relevance and ranking searching and indexing speed ease of use and ease of integration with Django resource requirem...

Grouping Lucene search results and calculating frequency by category

I am working on a store search API using Lucene. I need to show store search results for each City,State combination with its frequency in brackets....for example: Los Angles,CA (450) Atlanta,GA (212) Boston, MA (78) . . . As of now, my search results return around 7000 Lucene documents, on average, if the user says "Show me all the ...

What are some good resources for learning the Perl port of Lucene?

For learning about the Perl port of Lucene, what are some good documents, resources, or books? As people note on CPAN, the documentation is a little lacking. On Amazon all I find is Lucene and Java, but I understand the port is fairly good. Is buying a Java or Lucene book the best approach? ...

Good Documentation for Learning Clucene?

Is there a good book or tutorial for setting up and using Cluence for beginners? ...

Tags and attributes in Lucene shared across documents

My app needs to keep an index of files in which the files are known by tags and attributes, suggesting a Lucene (Java) document schema like: tags: i s (indexed, stored) attributes: i s content: i fileId: i s (The actual file is looked up by id in sqlite.) However, while a file has only one set of tags/attributes, it may have multiple ...

Lucene SpanQuery - what is it good for?

Can someone explain or provide a link to an explanation of what a SpanQuery is, and what are typical use cases for it? The javadoc is very laconic, and keeps mentioning the concept of "Span", which I'm not quite sure I get. Also, I'm interested in the SpanScorer in the highlighter, and what it does exactly. ...

Lucene IndexSearcher locks index causing IOException when rebuilding

I've learned from reading the available documentation that an IndexSearcher instance should be shared across searches, for optimal performance, and that a new instance must be created in order to load any changes made to the index. This implies that the index is writable (using IndexWriter) after having created an instance of IndexSearch...