I have a number of POJO's, each with a String field that I am indexing and searching using Lucene. However, I seem to be missing an easy way to associate a "found" keyphrase with the (unique) object that it comes from. Does anyone know how to do this?
...
A month ago I adjusted my database schema. I added a column called ordinal.
I rebuilt my model and uploaded my changes.
Everything works fine apart from my instance of sfLucene will not rebuild.
I run
symfony lucene-rebuild frontend
But I get the error once it gets to the Model in question (others are fine):
propel exception: unk...
Hi all, I am looking for a keyword indexing library for java. I found Lucene in google search. I think it is a very popular one but just wondering if it is the best (in terms of speed performance) indexing library (of course, it can be subjective but your opinion should be good enough for a beginner like me)? Is the example in this site ...
Hi all, I am using Luke to view a Lucene index. There is a column named 'Rank'. What is the actual meaning of it? My guess is that the Rank means number of occurrence and the larger Rank number meaning the term is more significant. But I don't understand is that it is a full text search. If I search for 'apple', all the 'apple' index wil...
I have an incremental index on a shared hosting server. I would have used a console application to perform the indexing by the hosting provider does not allow console apps on the server.
I am thinking of using a web service to do the indexing (with some queuing mechanism). Is this a good idea? What are the pros/cons or alternatives?
...
I've got an array with data. For example
$data = array(
'test',
'test2',
'tost',
'testing',
'another',
'search'
);
I wanna do a search on this array. lets say "test~" with lucene. I would for that way want to have the first 5 results in the array, and as feedback i need an array with those 5 items. Is this poss...
Has anybody tried to create a lucene index of Amazon browse nodes? If so, how should one go about doing this? Does it cost anything to crawl the browse nodes yourselfe?
...
Hi,
I have googled a lot and also searched in stackoverflow.com about how to sort search results based on a Field Value in Lucene 3.0.2, but not found any useful data. I'm getting the search results from the index, based on the user query but not able to sort the results based on field like id or date.
I have pasted my code here for se...
Has anyone taken the sfLucenePlugin and made it work in Symfony 1.3 or 1.4?
The plugin requires pake functions and the new config class to be adjusted but looks like it could be done if persistent.
I have seen the Jobeet tutorial for implementing Zend Lucene but am not keen to lose all of the .yml functionality that sfLucene provides.
...
I am interviewing candidates for a position developing an application which relies heavily on Lucene. In addition to the usual questions I ask, I'd like to be able to ask one or two Lucene-specific questions that will give me a rough idea of how familiar they are with the library. The problem is that I have no experience with Lucene my...
http://lucene.apache.org/java/2_3_0/api/org/apache/lucene/misc/SweetSpotSimilarity.html
Implemented as: 1/sqrt( steepness * (abs(x-min) + abs(x-max) - (max-min)) + 1 ) .
This degrades to 1/sqrt(x) when min and max are both 1 and steepness is 0.5
Can anyone explain this formula for me? How steepness is decided and what is exactly ref...
With the NoSQL movement growing based on document-based databases, I've looked at MongoDB lately. I have noticed a striking similarity with how to treat items as "Documents", just like Lucene does (and users of Solr).
So, the question: Why would you want to use NoSQL (MongoDB, Cassandra, CouchDB, etc) over Lucene (or Solr) as your "dat...
I added PDF document to Solr
curl "http://localhost:8983/solr/update/extract?literal.id=doc2&captureAttr=true&defaultField=text&fmap.div=foo_t&capture=div" -F "[email protected]"
and I would like to query it for word "errors"
http://localhost:8983/solr/select/?q=errors&version=2.2&start=0&rows=10&indent...
I am working on a C++ project which has a huge code base and multiple components. I want to create a rich code browser for it which will give a visual studio like experience. I am thinking of an Adobe AIR app with Lucene or Swish-e as the backend text indexer. I have never used either of the two. If you have used one or both of them, can...
Hi all, I am using Lucene to index my documents. In my case, each document is rather in small size but having a large quantity (~2GB). And in each document, there are many repeating words or terms. I am wondering if it is the right way for me to do index using Lucene or what preprocessing I should do on the document before indexing.
Th...
Hi,
I am having a problem doing wildcard searches in lucene syntax using the edismax handler. I have Solr 4.0 nightly build from the trunk.
A general search like 'computer' returns results but 'com*er' doesn't return any results. Similary, a search like 'co?mput?r' returns no results. The only type of wildcard searches working currren...
I am using Solr 1.4.1 (lucene 2.9.3) on windows and am trying to understand ShingleFilter. I wrote the following code and find that if I provide more words than the actual phrase indexed in the field, then the search on that field fails i.e. no score contributed from that field with debugQuery=true.
Here is an example I created to repro...
Hi,
I have a list of 17 million sentences in a text file. Each sentence contains at max 200 characters. Each sentence is also accompanied by one or more annotation(s) with it. I have a list of unique annotations and a list of unique words obtained from the 17 million sentences. I have to create a sparse matrix with the rows as the uniqu...
Hi,
I have just started learning Lucene and would like to use it for indexing a table in an existing database. The way I have been thinking about this so far has been to 1. Create a 'Field' for every column in the table 2. Store all the Fields 3. 'ANALYZE' all the Fields except for the Field with the primary key 3. Each row in the table...
I am attempting query some results using a Boolean Query. However the query does not return any results.
Here is the FilterQuery I am running. This returns no results, even though the field foo contains bar, and the field foo3 contains bar3. And I have triple checked my fields to make sure that the fields do exist in the index.
+(foo...