lucene

Lucene - comparing data in multiple indexes

Is it possible to compare data from multiple Lucene indexes? I would like to get documents that have the same value in similar fields (like first name, last name) across two indexes. Does Lucence support queries that can do this? ...

Creating a search over a mySQL database

I am trying to create a search for my website over a mySQL database. I started down the line of using Sphinx but was hesitant when learning that the index doesn't update in real time. I did see they have an almost real time update but I am concerned this doesn't fit my system well because new content is added to the database on a minute ...

Adding new terms in the Gosen Dictionary

Good day, How do you add new terms in the Gosen's Dictionary? Thanks, Franz ...

Any tutorial or code for Tf Idf in java

Hi All, I am looking for a simple java class that can compute tf-idf calculation. I want to do similarity test on 2 documents. I found so many BIG API who used tf-idf class. I do not want to use a big jar file, just to do my simple test. Please help ! Or atlest if some one can tell me how to find TF? and IDF? I will calculate the results...

Hibernate Search querying ?

Greetings My domain model is as follows class Species { private String name; .. .. List<Family> families; } class Family{ private String name; private String locusId; .. List<Member> members; } class Members{ private String name; private String repTranscript; } I want to use 'Hibernate Search' to execute queries like o...

Faceting with Solr using "string" fields, "text" fields and "copy" fields

Hi, I have a problem with Solr and Faceting and wondering if anyone knows of the fix. I have a work around for it at the minute, however i really want to work out why my query isn't working. Here is my Schema, simplified to make it easier to follow: <fields> <field name="uniqueid" type="string" indexed="true" required="true"/> ...

Full Text Search like Google

I would like to implement full-text-search in my off-line (android) application to search the user generated list of notes. I would like it to behave just like Google (since most people are already used to querying to Google) My initial requirements are: Fast: like Google or as fast as possible, having 100000 documents with 200 hundr...

How to index rows dependent on column values with Hibernate Search / Lucene?

Is it possible to use hibernate search/lucene to index some entity based on values of some fields? For example, let's take the following example: A product has several properties with values. e.g. property names could be color, amount, order-date, price, whatever... PRODUCT ( name description ... ) PROPERTY ( id nam...

Zend Lucene - tokenizing swedish chars

I use Zend Lucene to index swedish texts. The problem is that lucene tokenizes words at swedish chars åäö. For example the word "världens" becomes two words "v" and "ldens" in the index. Is there a way to add characters that zend lucene should accept and not tokenize at? ...

Inverted search: Phrases per document

I have a database full of phrases (80-100 characters), and some longish documents (50-100Kb), and I would like a ranked list of phrases for a given document; rather than the usual output of a search engine, list of documents for a given phrase. I've used MYSQL fulltext indexing before, and looked into lucene, but never used it. They bot...

Digg-like search result ranking with Lucene / Solr ?

I'm using Solr for search. I have documents that have an integer field "popularity". I want to rank results by a combination of normal fulltext search relevance and popularity. It's kinda like search in digg - result ranking is based on the search relevance as well as how many digs a posting has. I don't have any specific ranking a...

Sitecore Lucene indexing - save child field values in parent Lucene doc

I have a Sitecore content structure where any single item can have a number of child items that are used to store enumerable content for lists (obviously a fairly standard approach). I am hoping to index these items but store their index data against the parent doc in Lucene. This should hopefully speed up the search bit, by saving time ...

Lucene with PHP

Can I use Lucene with PHP ? I don't want to use Zend. Can I use in native PHP (not framework) ? ...

Sorting Lucene docs in Luke

Hi, I want to fire up a query in luke for luncene indexed document. I want to sort the results before they are fetched. What is the lucene syntax to sort the data. I don't need java/or-language-specific code to sort; instead raw lucene syntax to sort the data? Can anyone give me a sample example for sorting by 2 fields in descending o...

Lucene SpanNearQuery partial matching

Good day, Given a document {'foo', 'bar', 'baz'}, I want to match using SpanNearQuery with the tokens {'baz', 'extra'} But this fails. How do I go around this? Thanks, Franz Sample test (using lucene 2.9.1) with the following results: givenSingleMatch - PASS givenTwoMatches - PASS givenThreeMatches - PASS givenSingleMatch_andExtra...

Update specific field on SOLR index

I want to using solr for search on articles I have 3 table: Group (id , group name) ArticleBase (id, groupId, some other field) Article(id, articleBaseId, title, date, ...) in solr schema.xml file i just define all article field that mixed with ArticleBase table (for use one index on solr) like this: (id, articleBaseId, groupId, ...

Is it possible to obtain real time search results sorted by frequently updating field with Lucene 3.0 in Java

Consider following assumptions: I have Java 5.0 Web Application for which I'm considering to use Lucene 3.0 for full-text searching There will be more than 1000K Lucene documents, each with 100 words (average) New documents must be searchable just after they are created (real time search) Lucene documents have frequently updating inte...

looking for a db abstraction/substitute that actually works

i am looking for a form of data storage that will answer a few requirements. i realize these requirements are non-standard, and for now i'm using activerecord and ORM solutions like everyone else, but this is my "holy grail" - if you know of anything like this, i would be eternally grateful: pure PHP multiple repositories, preferably f...

Java Lucene English Stemmer?

Hello, I need help indexing and searching english text using Java Lucene over Google App Engine. The only solution I have found so far was the SnowballAnalyzer (in the contrib packages), but it only supports Lucene 3.0, and GAELucene only supports lucene 2.3.1. Just changing jars doesn't really work.. Can anyone help me index my text w...

How do I query on a number in Zend Lucene?

How can I create a Query to search for a number? $query->addTerm(new Zend_Search_Lucene_Index_Term('1', 'id'), null); $query->addTerm(new Zend_Search_Lucene_Index_Term('frank', 'name'), null); $queryP = Zend_Search_Lucene_Search_QueryParser::parse($query); $hits = $ix->find($queryP); echo $queryP; Returns; ...