ansaurus

Question

Lucene: Fastest way to return the document occurance of a phrase?

Answer 1

+2 A:

Typically, writing custom hit collector is the fastest way to count the number of hits using a bitset as illustrated in javadoc of Collector.

Other method is to get TopDocs with number of results specified as one.

TopDocs topDocs = searcher.search(query, filter, 1);

topDocs.totalHits will give you the total number of results. I'm not sure if this is as fast as it involves calculating scores, which is skipped in aforementioned method.

These solutions are applicable for Java. You have to check equivalent technique in Python.

Shashikant Kore 2010-05-09 07:49:13

ansaurus

tags:

views:

answers:

Lucene: Fastest way to return the document occurance of a phrase?

related questions