lucene

Synchronizing Lucene indexes across 2 application servers

I've an asp.net web application hosted on a web server(IIS 7).It uses Lucene for search functionality. Lucene search requests are served by .Net WCF services sitting on 2 application servers (IIS 7).The 2 application servers are Load balanced using "netscaler". Both these servers host a .net windows service which updates search indexes ...

Solr: Populate Separate Fields from a Tokenizer

I've created a custom Tokenizer in Solr that looks for named entities. I would like to be able to use this information to populate separate fields within the lucene/solr document. As an example, I want to populate a multivalued field called, "locations" with all the location names that were extracted from the text. To extract location...

Why size of lucene index increased if i index the same data?

I implemented Hibernate search in my application i.e. based on Lucene. Whenever, i indexes the database, the size of the lucene indexes increase. But, the result of the query return same no of results every time. Why the size of lucene increases each time if i index the same data everytime? FullTextSession fullTextSession = Search.get...

Improving performance of Location based search using Lucene

Hi, I'm using Lucene for a job search portal using .net. Am facing some performance related issues in the following use case. Use case is: When doing job search, user can select job location(for exameple:Atlanta,GA) and select radial distance (say 50 miles).The time required to return job search results from Lucene is pretty high. FYI,...

Is excessive use of lucene good?

In my project, entire searching and listing of content is depend on Lucene. I am not facing any performance issues. Still, the project is in development phase and long way to go in production. I have to find out the performance issues before the project completed in large structure. Whether the excessive use of lucene is feasible or no...

Indexing properties files

I need to index a large number of Java properties and manifest files. The data in the files is just key-value pairs. I am thinking to use Lucene for this. However, I do not need any real full-text search capabilities, as the data is quite structured. I only need to search for exact matches of property values, and the property key is a...

How to know when Lucene Index generation process is completed

Hi, I've a .net windows service which generates Lucene search indexes every night. I first get all the records from the database and add it to Lucene index using IndexWriter's AddDocument method and then call Optimize method before returning from the method. Since the records fetched are faily large, indexing takes around 2-3 minutes to...

SpatialQuery for location based search using Lucene

Hi, My lucene index has got latitude and longitudes fields indexed as follows: doc.Add(new Field("latitude", latitude.ToString() , Field.Store.YES, Field.Index.UN_TOKENIZED)); doc.Add(new Field("longitude", longitude.ToString(), Field.Store.YES, Field.Index.UN_TOKENIZED)); I want to retrieve a set of documents from this index whose ...

zend search lucene

I have a database that I would like to leverage with Zend_Search_Lucene. However, I am having difficulty creating a "fully searchable" document for Lucene. Each Zend_Search_Lucene document pulls information from two relational database tables (Table_One and Table_Two). Table_One has basic information (id, owner_id, title, description, l...

ConstantScoreRangeQuery in Lucene

How is Lucene's ConstantScoreRangeQuery better than the old RangeQuery? In what case should you use still use RangeQuery? ...

Correctly indexing latitude and longitude values in Lucene

Hi, Am working on a "US based nearest city search within a given radius" functionality using Lucene API. Am indexing city's lat and long values in Lucene as follows: doc.Add(new Field("latitude", paddedLatitude, Field.Store.YES, Field.Index.UN_TOKENIZED)); doc.Add(new Field("longitude", paddedLongitude, Field.Store.YES, Field.Index.U...

Lucene Hebrew analyzer

Does anybody know whether one exists? I've been googling this for monthes... Thanks ...

Why doesnt' Lucene remove docs?

Hi, I'm using Lucene.NET 2.3.1 with a MultiSearcher. For testing purposes, I'm indexing a database with about 10000 rows. I have two indexes and I select randomly which to insert each row in. This works correctly, but as Lucene doesn't have update capabilities, I have to test if the row exists (I have an Id field) and then delete it. ...

Multi-Term Wildcard queries in Lucene?

I'm using Zend_Search_Lucene, the PHP port of Java Lucene. I currently have some code that will build a search query based on an array of strings, finding results for which at least one index field matches each of the strings submitted. Simplified, it looks like this: (Note: $words is an array constructed from user input.) $query = new...

Need help regarding Lucene index/query

I want to have a "citystate" field in Lucene index which will store various city state values like: Chicago, IL Boston, MA San Diego, CA How do i store these values(shud it be tokenized or non-tokenized?) in Lucene and how do I generate a query (should it be phrasequery or termquery or something else?) which gets me all records who...

How to find a match within a single term using Lucene

I am using the Lucene search engine but it only seems to find matches that occur at the beginning of terms. For example: Searching for "one" would match "onematch" or "one day a time" but not "loneranger". The Lucene doc says it doesnt support wildcards at the front of a search string so I am not sure whether Lucene even searches inter...

Lucene - Wildcards in phrases

I am currently attempting to use Lucene to search data populated in an index. I can match on exact phrases by enclosing it in brackets (i.e. "Processing Documents"), but cannot get Lucene to find that phrase by doing any sort of "Processing Document*". The obvious difference being the wildcard at the end. I am currently attemptin...

How to exclude searching specified fields using Zend Search (Lucene)

I've built a search index using the PHP Zend Framework Search (based on Lucene). The search is for a buy/sell website. My search index includes the following fields: item-id (UnIndexed) item-title (Text) item-description (UnStored) item-tags (Text) item-price (keyword) seller-id (UnIndexed) seller-name (Text) I want the user to sear...

Manage Zend_Search_Lucene Index on a load balanced environment

Each server in the cluster has a a search index that is synced from one of the servers every 15 minutes. This was done because appending to an index can't happen on a nfs because of flock; see documentation (or else the index would be on a shared folder that all servers access). The issue that I'm running into is that if an action is t...

Lucene and access control (visibility of comments)

Imagine this simple scenario for full text search: Articles with Comments. I want to search articles also by text in comments. That alone is fairly simple to implement. Not all comments are visible to all users though. User that writes comment can also restrict it's visibility to concrete Role (so comment has 2 fields: text and role). ...