lucene

Reusing Lucene Query objects

Are Lucene's Query objects reusable/stateless ? If not, will clone()ing a TermQuery be faster than rebuilding it ? ...

How can I exclude certain URLs in Solr / Lucene

Hi, I have setup new instance of Solr indexing on a website. I want Solr NOT to index certain URL patterns. Is there any way of mentioning such exclude-pattern? Regards, Paras ...

File content via Solr to Lucene???

I have a question regarding searching a complete document. 1 - I have indexed a lot of documents on lucene. 2 - Each document has a single word per line. Suppose 200 words which becomes 200 lines. 3 - I know how to search lucene via Solr but; If suppose that i indexed the document mydoc.txt on lucene containing 200 words along with o...

dismax solr request handler MM , PS and Q.ALT

Hi all , I'm testing the dismax requesthandler , im trying to customize the mm ( Minimum Match ) Parameter ( i already tooked a look at the documentation ) <str name="mm"> 2&lt;-2 3&lt;-70% 5&lt;-50% </str> <int name="ps">100</int> <str name="q.alt">*:*</str> I have 3 Questions : mm , i understood what it does , i want...

What's better? Solr or Lucene?

Possible Duplicate: Search Engine - Lucene or Solr What's better? Solr or Lucene? ...

dismax feat. stopwords , synonyms ect..

Hello all ;) does dismax support all the features like the standard requestHandler ? stopwords ? synonymes ? stemming ? did you hear about "edismax" ? Im using Solr1.4 for my first tests of the stopwords , it doesn't work .. well i think so.. i configured my DisMax to match all terms if count terms=[1,2] Example ( in french ) ...

Zend Search Lucene floating point numbers range search

Hello, I have problem with Zend Search Lucene when searching float numbers. The problem is that when I execute query like avg:[0.15 TO 0.30] I get error message Range query boundary terms must be non-multiple word terms. For parsing the query I use the default parser. I a little desperate now because searching float numbers is main aim ...

Loader constraint violation in hibernate createFullTextQuery

I have a webapplication using Hibernate and Jpa and want to add search via hibernate-search. I use the following hibernate versions: hibernate-core-3.3.2.GA hibernate-annotations-3.4.0.GA hibernate-entitymanager-3.4.0.GA hibernate-search-3.2.1.Final And for lucene: lucene-core-2.9.3 The following code compiles perfectly fine: public Li...

Lucene/Solr Searching problem?

Hi I have a problem that i want to search in the specific locations in the indexed text, let we have a lucene document that contains text as <Cover> This document contains following items 1. Business overview. 2. Risk Factors. 3. Management </Cover> <BusinessOverview> our business is xyz </BusinessOverview> <RiskFactors> we have xyz...

What does Field.Index.NOT_ANALYZED_NO_NORMS mean

I know what does not_analyzed mean. In short the field will not be tokenized by specified Analyzer. However, what does a NO_NORMS means? I see the documentation, but please explain me in plain English. what is index-time field and document boosting and field length normalization ? ...

Split Lucene index files without reindexing

Is there a way to generate separate index files from a single one based on some sort of rule without reindexing the documents again? The original index contains none-stored fields which means I can't read documents and add them to the destination index. One option mentioned in SO is to clone the index into many and then delete the docu...

How to get total match count in Solr/lucene

I have a problem that i want to get total count of matched text in solr. but when i want to perform search using solr i have to set max rows parameter. can anybody explain how i could get the total matched count using solr efficiently? ...

How to get matching string by performing search on Lucene ?

I would like to know how can I get the matching string results by performing search on a Lucene index ? I have tried to install the Perl module Lucene::Search::Highlight and failed over and over again. Is there another I can get the relevant fragments of text that Lucene "thinks" they relevant for my search phrase? ...

Standard analyzer, do not treat one character as a punctuation.

Hi, i am indexing some data in which '@' character should be treated like the way it is. I also need all the features provided by standard Analyzer (eliminating punctuation,trailing s etc), but this removes @ also. Is there any place where i can specify to treat a character as not a punctuation?Or any filter that can be used with white...

Can I run iPad with 200 thousand record with sqlite ?

Current database have 200 thousand record. I want to make iPad application , can I run 200 thousand record with sqlite ? I don't want to use sqlite because searching is too slow over 32000. Any search engine like lucene for iPhone SDK ? if we can run lucene in iPad that will be awesome because current project is base on lucene. Can you g...

How to use Lucene for getting tags for tagcloud?

Hello I'm a Lucene newbie and I have it working where I can create an index from a column of a database table storing tags, and search this index for a particular tag and display the results and their scores. I'm looking at this as an alternative to a MySQL fulltext index which I've heard mixed comments about. But, what I want is to g...

exist xml database - full text index using Lucene

Hi i using Lucene to create index in my xml eXist database. In this time need to get all index (want create tag). I should use xpath query? or need write something in java? ...

Displaying sample text from the Lucene Search Results

Currently, I am using Lucene version 3.0.2 to create a search application that is similar to a dictionary. One of the objects that I want to display is a sort of "example", where Lucene would look for a word in a book and then the sentences where the words were used are displayed. I've been reading the Lucene in Action book and it menti...

SolrNet/Solr - Update vs. Overwrite Document

I am using SolrNet to intreract with a Solr index. I have a daemon application writing to the Solr index with adds/updates/deletes. However with SolrNet an Add with the same unique-key over-writes (replaces) the existing document, instead of appending (combining) them. In Lucene I could do something like this where term is the Lucene te...

N-gram generation form sentence

how to generate ngram of a string like String Input="This is my car." i want to generate Ngram of this input Input Ngram size = 3 Output should come: This is my car This is is my my car This is my is my car give some idea in java, how to implement that or any library is available for it. I am trying to use this NGramTokenizer ...