Are there any silver bullets out there for searching medium sized amounts of text data (hundreds of gigabytes)? Don't really care if it's commercial or open source. I should add that I need it to be C++ or C based.
views:
366answers:
3
+3
A:
Apache Lucene is a commonly used indexing engine, not sure if it qualifies as a silver bullet or not. I've seen this combined with Carrot2 for clustering and search of the index.
Ninefingers
2010-02-21 17:33:07
Is there a C++ implementation of Carrot2?
Leeks and Leaks
2010-02-21 23:47:35
I don't think so, but Lucene is open-source as is Carrot2 - http://sourceforge.net/projects/carrot2/ so you could see how it works/implement a C++ version? I've always used the Java ones - never needed C++.
Ninefingers
2010-02-22 00:58:13