lucene

Document score effecting the sort order of results, Lucene

Even after passing a sortfield, score of the document is effecting the sort order of the search results. Is there a way to make lucene to ignore the document score when a specific sort field is passed? For ex: DocId Score SortFieldA SortFieldB 1 23.0041 200906030800 Test 2 32.2774 200906020...

How to read a Lucene index?

Hi! I'm working on a project for which I want to build a tag cloud by reading a Lucene index and pruning it down. I didn't set up the Lucene engine, it was someone else in the team, now I just want to read its index. Do you how to do that in Java? ...

How to acces a file under WEB-INF from a Java web application

Hi! Do you have any idea how to access files in WEB-INF/index folder from my application? I'm using OpenCMS for my application and I want to open a Lucene search index (with the help of Lucene IndexReader class) located at WEB-INF/index folder. Lucene jar is stored in WEB-INF/lib folder. ...

Why is my Lucene index getting locked?

I had an issue with my search not return the results I expect. I tried to run Luke on my index, but it said it was locked and I needed to Force Unlock it (I'm not a Jedi/Sith though) I tried to delete the index folder and run my recreate-indicies application but the folder was locked. Using unlocker I've found that there are about 100 ...

Using LinqToLucene With NHibernate and NHibernate.Search

Does anyone already use LinqToLucene with NHibernate.Search? How to getting started ? Thanks. ...

What does NHibernate.Search ContainedIn attribute do?

And how it differs from IndexedEmbedded? P.s. and what's the best source of information about NHibernate.Search? ...

What happened to the Spring Modules project?

I read on Spring web site that the Spring Modules project is now deprecated in favor of Spring Extensions. However, unless I am missing something, Spring Extensions does not support Lucene like Spring Modules did. Am I missing something? ...

Getting lucene to return only unique threads (indexing both threads and posts)

I have a StackOverflow-like system where content is organised into threads, each thread having content of its own (the question body / text), and posts / replies. I'm producing the ability to search this content via Lucene, and if possible I have decided I would like to index individual posts, (it makes the index easier to update, and m...

Need help with location aware search

Hi, I am trying location aware search with spatial example found in http://www.ibm.com/developerworks/java/library/j-spatial/#indexing.approaches. The schema.xml has geohash field ,but this field is not present in any of the .osm files(present in data folder) used to index. Am not able to understand how value is assigned to it , so th...

Syntax for hsin query in solr

I have created a database and have used dataimport for retrieving the data. I tried out the hsin query: http://localhost:8983/solr/select/?q=name:Denver AND _val_:"recip(hsin(0.611941,-1.567437,lat_rad,lng_rad,3963.205),1,1,0)"^100 Though there are records that are retived, when the value of the radius is drastically reduced(7.205), th...

Bizarre error with ASP.NET + Lucene website

ASP.NET newbie here I have coded up an ASP.NET website and running on win'08 (remotely hosted). The application queries 11 very large Lucene indexes (each ~100GB). I open IndexSearchers on Page_load() and keep them open and pass them by ref into the query methods based on user entered keywords. I can RDC in and run the site fine in VS...

Remove Expired Records in solr query syntax

Hi, I'm having a solr query syntax issue (I think) with solr 1.4. I'm trying exclude expired records from a solr query. However if the record doesn't have an expiry record, i would like to bring that record back also. E.g. To only get a list of record that haven't expired i am using this query: expirydate:[NOW/DAY TO *] Then I thoug...

Why are Solr Filters disable bypassed when doing a fuzzy searches?

My Solr setup contains the Filter LowerCaseFilterFactory for indexing and querying, so - the index only contains lowercase words and - every query is supposed to be turned lowercase before used for searching. When doing a non-fuzzy search like "mix:barba", they are working fine: - barba finds Barba and BARBA - Barba finds Barba and BARB...

Lucene: Wildcards are missing from index

Hi - i am building a search index that contains special names - containing ! and ? and & and + and ... I have to tread the following searches different: me & you me + you But whatever i do (did try with queryparser escaping before indexing, escaped it manually, tried different indexers...) - if i check the search index with Luke they ...

NHibernate Search Clustered Lucene Index

We are using NHibernate Search in an application which is going to be clustered. I have been reading up on the approaches for maintaining separate collections, in particular the master/slave configuration and I was wondering how to go about implementing it using MSMQ if indeed there is an implementation for this at this time. The JMS imp...

How to make sure Solr/Lucene won't die with java.lang.OutOfMemoryError?

I'm really puzzled why it keeps dying with java.lang.OutOfMemoryError during indexing even though it has a few GBs of memory. Is there a fundamental reason why it needs manual tweaking of config files / jvm parameters instead of it just figuring out how much memory is available and limiting itself to that? No other programs except Solr ...

Hibernate session not getting created.

Hi all, I am trying to a run an application with Hibernate annotation. I am getting the following exception Initial SessionFactory creation failed.java.lang.NoClassDefFoundError: net.sf.cglib.core.DebuggingClassWriter$1 Exception in thread "main" java.lang.ExceptionInInitializerError at tapes.test.validation.SessionUtil.<clinit>(Se...

where do i have to look in order to understand location based search concepts?

Hi, Inorder to understand - cartessian tiers,how are they contributing in location based search - What is happening internally when we give query to solr like http://localhost:8983/solr/select/?q=name:Minneapolis AND _val_:"recip(hsin(0.78, -1.6, lat_rad, lon_rad, 3963.205), 1, 1, 0)"^100 and other functions like ghhsin(),sqedis...

Lucene.NET: Retrieving all the Terms used in a particular Document

Is there a way to itterate through all of the terms held against a particular document in a Lucene.NET index? Basically I want to be able to retrieve a Document from the Index based on it's ID and then find the frequency with which each Term is used in that Document. Does anyone know a way to do this? I can find the number of Documents...

using hit highlighter in lucene

I have two questions regarding hit highlighter provided with apache lucene: see this function could you explain the use of token stream parameter. I have several large lucene document containing many fields and each field has some strings in it. Now I have found the most relevant document for a particular query. Now this document was f...