lucene

Zend_Search_Lucene massive - similar to ZF-5545 issue

EDIT: Solved with a hack for now. Added at line 473: if (isset($this->_termsFreqs[$termId][$docId])) { } This happens only when I'm searching for multiple words, e.g.: +word1 +word2 + word3 I get this massive error: Notice: Undefined offset: 2 in C:\wamp\www\project\library\Zend\Search\Lucene\Search\Query\MultiTerm.php on line...

Does Zend Lucene need Java Lucene?

When implementing Zend Lucene, do we need to install Java on our server or not? ...

What is best and most active open source .Net search technology?

I'm trying to decide on an open source search/indexing technology for a .Net project. It seems like the standard out there for Java projects is Lucene, but as far as .Net is concerned, the Lucene.Net project seems to be pretty inactive. Is this still the best option out there? Or are there other viable alternatives? ...

Zend_Search_Lucene - Can't create directory '/data/users_index'.

I have a problem creating an index with Zend_Search_Lucene. Now, everything works fine on my local machine so I guess there is just an issue with file permissions on the webserver. Here is how I'm trying to create index in controller: $index = Zend_Search_Lucene::create('/data/users_index'); Of course the data directory has permissi...

lucene ignore queries on fields other than default

i have 2 indexes, one for meta data and one for text, i want to be able to remove all field searches in the query and only use the default fields that the user searched, ie "help AND title:carpool" i want only the help part, ideas? ...

how do i filter my lucene search results?

Say my requirement is "search for all users by name, who are over 18" If i were using SQL, i might write something like: Select * from [Users] Where ([firstname] like '%' + @searchTerm + '%' OR [lastname] like '%' + @searchTerm + '%') AND [age] >= 18 However, im having difficulty translating this into lucene.net. This ...

Help with Search Engine Architecture .NET C#

I'm trying to create a search engine for all literature (books, articles, etc), music, and videos relating to a particular spiritual group. When a keyword is entered, I want to display a link to all the PDF articles where the keyword appears, and also all the music files and video files which are tagged with the keyword in question. The...

java lucene class not found problem

I keep getting this error: java.lang.NoClassDefFoundError: org/apache/lucene/index/memory/MemoryIndex Is there any way i can go about making sure java/tomcat can find this class? ...

Choosing a stand-alone full-text search server: Sphinx or SOLR?

I'm looking for a stand-alone full-text search server with the following properties: Must operate as a stand-alone server that can serve search requests from multiple clients Must be able to do "bulk indexing" by indexing the result of an SQL query: say "SELECT id, text_to_index FROM documents;" Must be free software and must run on Li...

How to merge matches from two distinct (not sharded) Lucene Indexes.

I have two separate indexes holding different fields that together contain all the searchable fields for an index. For example the first index holds the indexed text for all documents, and the second holds tags for each and every document. Note the example below is a bit wonky as I've changed the names of the entities. Index1: text ...

Searching and sorting by language

Hi all, I am testing Lucene.NET for our searching requirements, and I've got couple of questions. We have documents in XML format. Every document contains multi-language text. The number of languages and the languages itself vary from document to document. See example below: <document>This is a sample document, which is describing a <...

Faster search in Lucene - Is there a way to keep the whole index in RAM?

Is there a way of keeping the index in RAM instead of keeping it on the hard disk? We want to make searching faster. ...

Getting the Doc ID in Lucene

In lucene, I can do the following doc.GetField("mycustomfield").StringValue(); This retrieves the value of a column in an index's document. My question, for the same 'doc', is there a way to get the Doc. Id ? Luke displays it hence there must be a way to figure this out. I need it to delete documents on updates. I scoured the docs b...

Can you read from a lucene index while updating the index

I can't find a straightforward yes or no answer to this! I know I can send multiple reads in parallel but can I query an index while a seperate process/thread is updating it? ...

Zend_Search_Luncene handle Querys

Hi, iam trying to implement an Searchmachine into my site. Iam using Zend_Search_Lucene for this. The index is created like this : public function create($config, $create = true) { $this->_config = $config; // create a new index if ($create) { Zend_Search_Lucene_Analysis_Analyzer::setDefault( new Zend_...

Purpose / importance of the file modification date methods in Lucene's Directory?

I've built a Lucene Directory implementation for jdbm, an embedded Java database. Part of the Directory API are two methods related to "file" modification dates: touchFile and fileModified (javadoc). My question is, what is the purpose of these methods? I've searched the entire Lucene core source tree, and found no usage of these meth...

lucene.net combine multiple filters and no search terms

How can I do a Filter across multiple fields in Lucene.Net? On one field I simply do: TermQuery tool = new TermQuery(new Term("Tool", "Nail")); Filter f = new QueryFilter(tool); If I now wanted to add a nail length to the filter, how can I do that? Also, I want the user to be a able to do a search with no search term (i.e. by just c...

How to use n-grams approximate matching with Solr?

We have a database of movies and series, and as the data comes from many sources of varying reliability, we'd like to be able to do fuzzy string matching on the titles of episodes. We are using Solr for search in our application, but the default matching mechanisms operate on word levels, which is not good enough for short strings, like ...

Finding the position of search hits from Lucene

With Lucene, what would be the recommended approach for locating matches in search results? More specifically, suppose index documents have a field "fullText" which stores the plain-text content of some document. Furthermore, assume that for one of these documents the content is "The quick brown fox jumps over the lazy dog". Next a sear...

Keyword (OR, AND) search in Lucene

Hi, I am using Lucene in my portal (J2EE based) for indexing and search services. The problem is about the keywords of Lucene. When you use one of them in the search query, you'll get an error. For example: searchTerms = "ik OR jij" This works fine, because it will search for "ik" or "jij" searchTerms = "ik AND jij" This works fi...