zend-lucene

How to call a Zend lucene search function?

I inherited a Zend project devoid of comments and I didn't get to talk to the previous developer. Since I have no Zend experience I'm having some issues :) I'd like to print out some variables inside an function that indexes items from the site using Zend_Search_Lucene because I think something is going wrong here. From what I've read...

Zend: index generation and the pros and cons of Zend_Search_Lucene

I've never came across an app/class like Zend Search Lucene before, as I've always queried my database. Zend_Search_Lucene operates with documents as atomic objects for indexing. A document is divided into named fields, and fields have content that can be searched. A document is represented by the Zend_Search_Lucene_Do...

zend lucene problem with the word "mortgage"

I'm using Porter Stemmer to stem the words, and here's a problem I'm running into: Word "mortgage" is correctly stemmed to "mortgag" Word "mortgagee" is (arguably incorrectly) stemmed to "mortgage" There are approximately 100 documents with the word "mortgage" There is 1 document with word "mortgagee" When I build an index without put...

Zend Lucene - tokenizing swedish chars

I use Zend Lucene to index swedish texts. The problem is that lucene tokenizes words at swedish chars åäö. For example the word "världens" becomes two words "v" and "ldens" in the index. Is there a way to add characters that zend lucene should accept and not tokenize at? ...

locking a lucene folder

I am writing a wrapper around Zend's lucene implementation and wanted to add a function rebuildIndex() which reads all relevant fields from the database and re-creates the index file in a temporary folder. When the operation is finished, I want to replace the original folder with the new one. How can I lock the original lucene folder whi...

Zend Lucene displays blank screen when no results found

When I submit a query to Zend_Lucene with a string that exists in the index, the results are displayed as expected, however when string is not found, I get a blank page with no error messages. Code used as below: require_once 'Zend/Feed.php'; require_once 'Zend/Search/Lucene.php'; $index = Zend_Search_Lucene::open('data/my-index'); $que...

How to optimize indexing of large number of DB records using Zend_Lucene and Zend_Paginator

So I have this cron script that is deployed and ran using Cron on a host and indexes all the records in a database table - the index is later used both for the front end of the site and the backed operations as well. After the operation, the index is about 3-4 MB. The problem is it takes a lot of resources (CPU: 30+ and a good chunk of...

Lucene cakephp Hex number is too big: 0x100000000

Hi, I'm trying to use Zend Lucene with CakePHP. I'm seeing this error now. Notice (8): Hex number is too big: 0x100000000 [APP/vendors/Zend/Search/Lucene/Storage/File.php, line 271] Does anyone know how to fix that? Thanks, Tee ...

performance comparision between Zend Lucene and Java Lucene

Zend Lucene and Java Lucene are built in PHP and java repectively, and PHP language has a higher level than java. Just wondering How big the performance difference among these two, regarding to index building and data searching? Is it much more effective to let java create and rebuild index, and let php use the index? ...

Using Zend Lucene in Cakephp

Hello all I am creating a webapp in Cakephp, and am thinking of implementing a search function in it. I read about Zend Lucene providing the search capabilities for native PHP webapps. I have my web pages all created without using any kind of database functionality. How will I able to add webpages to the indexes? I don't mean the code,...

Best way to perform online re-indexing of tables for Zend_Search_Lucene-powered search

Hi, I'm using Zend_Search_Lucene for full-text search of records in several different tables in my application. I have just implemented this functionality, and currently the index is built upon first use of the search functionality after application deployment. This is obviously not what I would like in production. I'm looking for an e...

Regular expression for Text, Numbers and Decimals

I am trying to get a regular expression for text, decimals and decimal numbers. The expression must match all at once. The main problem is that I had to write an Analyzer for Zend_Search_Lucene in order to be able to search for decimal digits. Right now I can only search for texts and numbers. So I guess it to match decimal numbers to m...