views:

366

answers:

3

Are there any silver bullets out there for searching medium sized amounts of text data (hundreds of gigabytes)? Don't really care if it's commercial or open source. I should add that I need it to be C++ or C based.

+3  A: 

Apache Lucene is a commonly used indexing engine, not sure if it qualifies as a silver bullet or not. I've seen this combined with Carrot2 for clustering and search of the index.

Ninefingers
Is there a C++ implementation of Carrot2?
Leeks and Leaks
I don't think so, but Lucene is open-source as is Carrot2 - http://sourceforge.net/projects/carrot2/ so you could see how it works/implement a C++ version? I've always used the Java ones - never needed C++.
Ninefingers
A: 

There's also xapian.

A: 

use SPINX http://sphinxsearch.com/

kampre