lucene

Katta in production environment

According to the website Katta is a scalable, failure tolerant, distributed, indexed, data storage. I would like to know if it is ready to be deployed into production environment. Anyone already using it and has advices? Any pitfalls? Recommendations? Testimonials? Please share. Any answer would be greatly appreciated. ...

php mysql fulltext search: lucene, sphinx, or ?

This is admittedly similar to (but not a duplicate of) http://stackoverflow.com/questions/737275/pros-cons-of-full-text-search-engine-lucene-sphinx-postgresql-full-text-search, however what I am looking for are specific, supported, recommendations from the benefit of experience with more than one of the available systems (there seems to ...

Syncing Lucene.net indexes across multiple app servers

Hi, we are designing the search architecture for a corporate web application. We'll be using Lucene.net for this. The indexes will not be big (about 100,000 documents), but the search service must be always up and always be up to date. There will be new documents added to the index all the time and concurrent searches. Since we must hav...

Lucene BooleanQuery

How to use booleanQuery with StandardAnalyzer in Lucene Search? ...

Could you use Lucene as an OODB?

Given that Lucene is a robust document based search engine could it be used as an Object Database for simple applications (E.G., CMS style applications) and if so what do you see the benefits and limitations? I understand the role of the RDBMS (and use them on a daily basis) but watned to explore other technologies/ideas. For example ...

Why Zend Lucene doesn't find results but Luke does for the same fuzzy query.

Hello I am coding search engine using Zend Framework Lucene. I'm trying to make fuzzy query: "name:sxample~" When I put it into Luke - it founds 14 results (all with word "sample"). When I use my php code - $query = 'name:sxample~'; $query = Zend_Search_Lucene_Search_QueryParser::parse($query,'utf-8'); try { $hits = $index->fin...

Pros and cons of using Lucene's MultiSearcher class

Hi, Am using Lucene search API for a .net web application. Can I know the pros and cons of using MultiSearcher ?In what scenarios shall I use it? Thanks for reading! ...

how to achieve pagination in lucene?

Wondering how to achieve pagination in Lucene, as it does not inherently support pagination. I basically need to search for 'top 10 entries' (based on some parameter) then 'next 10 entries' and so on. And at the same time I don't want Lucene to hog memory. Any piece of advice would be appreciated. Thanks in advance. ...

How to do search of part of a word using lucene?

Hi, Its Ershad here.I am working on lucene. Now i am able to search the word.But if i type part of word, i am unable to get the results. Can you pls suggest what to be done. For indexing, i am using the below code writer = new IndexWriter(directory, new StandardAnalyzer(), true); writer.SetUseCompoundFile(true); doc.Add(Field.UnStored...

Solr Query Syntax

Hi all, I just got started looking at using Solr as my search web service. I don't know whether Solr supports these query types: Startswith Exact Match Contain Doesn't Contain In the range Could anyone guide me how to implement those features in Solr? Cheers, Samnang ...

Migrating from Hit/Hits to TopDocs/TopDocCollector

I have existing code that's like: final Term t = /* ... */; final Iterator i = searcher.search( new TermQuery( t ) ).iterator(); while ( i.hasNext() ) { Hit hit = (Hit)i.next(); // "FILE" is the field that recorded the original file indexed File f = new File( hit.get( "FILE" ) ); // ... } It's not clear to me how to re...

Lucene: How to do wildcard search inside a Term?

I've the following lucene index: Document A item = level:(1) item = level:(2) item = level:(3) Document B item = level:(1) item = level:(4) Suppose I want to search for all documents which contain level:(1) AND level:(2) ? The Lucene query could be like: "item:level\:\(1\) AND level\:\(2\)" but is it also possible to do something...

Finding exact match using Lucene search API

Hi, I'm working on a company search API using Lucene. My Lucene company index has got 2 companies: 1.Abigail Adams National Bancorp, Inc. 2.National Bancorp If the user types in National Bancorp, then only company # 2(ie. National Bancorp) should be returned and not #1.....ie. only exact matches should be returned. How do I achieve thi...

How to change default conjunction with Lucene MultiFieldQueryParser

I have some code using Lucene that leaves the default conjunction operator as OR, and I want to change it to AND. Some of the code just uses a plain QueryParser, and that's fine - I can just call setDefaultOperator on those instances. Unfortunately, in one place the code uses a MultiFieldQueryParser, and calls the static "parse" method...

Query in Lucene

The structure of the table "testtable" is id int primary key productid int attributeid int value varchar(250) where productid is the unique id of a product, attributeid is the unique id of attribute of a product e.g. size, quality,height, color and 'value' is the value for the attribute i have to filter a result. I achieve the re...

Using solr for indexing different types of data

Hi. I'm considering the use of Apache solr for indexing data in a new project. The data is made of different, independent types, which means there are for example botanicals animals cars computers to index. Should I be using different indexes for each of the types or does it make more sense to use only one index? How does using ...

Best cross-language analyzer to use with lucene index

Hello, I'm looking for feedback on which analyzer to use with an index that has documents from multiple languages. Currently I am using the simpleanalyzer, as it seems to handle the broadest amount of languages. Most of the documents to be indexed will be english, but there will be the occasional double-byte language indexed as well. A...

Lucene.Net Best Practices

What are the best practices in using Lucene.Net? or where can I find a good lucene.net usage sample? ...

access list of keywords from lucene index

We plan to use lucene as FTI-service. Amongst other things, we want to build a tag-index, based on a tag-attribute of our documents that simply contains space-delimited tags. Now for suggesting tag-completions, it would be great if there was a way to access all unique keywords of a given index. Lucene must be able to do that internally...

nutch field problem

I was using something like: Field notdirectory = new Field("notdirectory","1", Field.Store.NO, Field.Index.UN_TOKENIZED); and queries like "notdirectory:1" can be processed quite well all the time. But recently I've changed the "Field.Store.NO, Field.Index.UN_TOKENIZED" to index a non-numeric string: Field stateField = new Field("st...