lucene

how to get all docs with acts_as_solr

Hi, I'm doing something like this: Item.find_by_solr('name:ab*') and it says it returns 297 results: => #<ActsAsSolr::SearchResults:0xb6516858 @total_pages=1, @solr_data={:docs=>[doc1, doc2, doc3...]}, :max_score=>1.6935261, :total_pages=>1, :total=>297}, @current_page=1> Item.count_by_solr('name:ab*') also returns 297. Yet when ...

Lucene: unstored fields.

Hello, I just wondering whenever exist a way to read the unstored, but indexed field in Lucene index? I need because I have an index and I'm going to iterate over all documents in the index in order to apply some analysis and I need to update those documents later, in order to update I need first delete and when to re-insert the docume...

Ruby alternative for Lucene

Hi, I have heard about Lucene a lot, that it's one of the best search engine library in JAVA, is there any similar(as powerful) library for Ruby. ...

Lucene IndexReader.reopen doesn't seem to work correctly

I have a problem with Lucene 2.4, the situation being as follows: I have to deal with the possibility that there are 2 seperate processes operating on the same Index directory and they need to have the same data. This means that when one Instance adds a Document to the Index, the other application instances shall find the added Document...

Match whole field in Lucene

I'm currently indexing a database with lucene. I was thinking of storing the table id in the index, but I can't find a way to retrieve the documents by that field. I guess some pseudo-code will further clarify the question: document.add("_id", 7, Field.Index.UN_TOKENIZED, Field.Store.YES); // How can I query the document with _id=7 // w...

PHP and MySQL search across lots of columns

I need a way to search through 50+ columns, both varchar and int, in a specific table. I've seen people recommend Lucene and Sphinx, but I have no clue how to go about using those and MySQL's Full-Text won't allow me to combine more than 16 columns or both varchar and int columns. ...

What is the use of Field.Set OmitNorms(true); in lucene

Hi All, I have been suggested to use Field.Set OmitNorms(true); when creating the documents for lucenesearch to sort the result according to the number of hits, but I am not clear of what it does and is it safe. sort the result according to the number of hits means that the document in which search text is found maximum number ...

lucene set boost on fields at search time

Is it possible to adjust the boost of a field with the Query object before running the search? I know the proper way to do it is to change the fields boost during indexing, but it takes about 4 days to make an index and was just wondering if there's a quick hack i can do for now. also i have tried hardcoding in the boost to the search q...

Lucene search results sort by custom order list (unique to each user)

I have authenticated users in my application who have access to a shared database of up to 500,000 items. Each of the users has their own public facing web site and needs the ability to prioritize the items on display (think upvote) on their own site. out of the 500,000 items they may only have up to 200 prioritized items, the order of ...

Lucene for Blackberry OS?

Does anyone know if and where I can get a port of Lucene or a similar library that allows full text searching on Blackberry? Thanks, ...

Updating Lucene index from two different threads in a web application

Hi, I've a .net web application which uses Lucene.net for company search functionality. When registered users add a new company,it is saved to database and also gets indexed in Lucene based company search index in real time. When adding company in Lucene index, how do I handle use case of two or more logged-in users posting a new compa...

Full-text search for static HTML files on CD-Rom via javascript

I will be delivering a set of static HTML pages on CD-Rom; these pages need to be fully viewable with no Internet access whatsoever. I'd like to provide a full-text search (Lucene-like) for the content of those pages, which should "just work" from the CD-Rom with no software installation on the client machine. A search engine implement...

Latest Version of Lucene .Net

Can anybody please inform me, which is the latest version of Lucene .Net.. please be sure Thanking you ...

Why does 'delete document' in lucene 2.4 not work?

Hi I want to delete a document in lucene 2.4 with java. My code is Directory directory = FSDirectory.getDirectory("c:/index"); IndexReader indexReader = IndexReader.open(directory); System.out.println("num="+indexReader.maxDoc()); indexReader.deleteDocuments(new Term("name","1")); System.out.println("num="+indexReader.maxDoc()...

Why does Lucene cause OOM when indexing large files?

Hi, I’m working with Lucene 2.4.0 and the JVM (JDK 1.6.0_07). I’m consistently receiving “OutOfMemoryError: Java heap space”, when trying to index large text files. Example 1: Indexing a 5 MB text file runs out of memory with a 64 MB max. heap size. So I increased the max. heap size to 512 MB. This worked for the 5 MB text file, but...

Searcing for a list of keywords to find out which ones exist or not

I have a large document with various sections. Each section has a list of keywords /phrases of interest. I have a master list of keywords/phrases stored as a String array. How can I use Solr or Lucene to search each section document for all keywords and basically give me which keywords were found ? I cant think of any straightforward wa...

Lucene.net-2.3.2" memory leakages problem

Is "Lucene.net-2.3.2" memory leakages problem solved or not solved? please help me how to solve it. i am facing it.. Thanks in advance.... ...

Is it possible to get a list of node types (or even property types) in Alfresco?

Is there a way that I can call alfresco via webscript and get back a list of node types or list of node properties that are available? So if there are three different node types (cm:content, cm:folder, and cm:foo), I'd get that three returned to me (and not actually nodes that are one of these three types)? Reason why I'm asking is th...

A few Lucene questions

I've been using Zend and need a search. The Zend docs aren't great so I had a couple questions that are easy to answer but not directly obvious. I'm using Lucene to search an SQL database How do I associate the index of my item with the text of that item. So if they search and find the item, how do I get its index returned? As far ...

Lucene Index optimization

Is there a progmmatic way in Lucene to know if the index is optimized or not? Thanks. ...