I want to learn more about the full-text search. Recommend please a few good books in which would be well described used algorithms and data structures, which must describe how to write simple search engine. I use C++. Thanks!
+1
A:
Managing Gigabytes: Compressing and Indexing Documents and Images is the best (and probably only) comprehensive book on this subject.
Vijay Mathew
2010-10-12 05:22:29
looks interesting, but gigabytes seems quite small. I'd have expected Terrabytes at least! ;) ...I mean I manage several Terrabytes at home...
Mitch Wheat
2010-10-12 05:27:23
You don't index several terrabytes.
Kirk Broadhurst
2010-10-12 05:38:08
@Mitch The book lays down a good foundation.
Vijay Mathew
2010-10-12 06:00:09
@Kirk Broadhurst: Really? Are you sure?
Mitch Wheat
2010-10-12 06:33:52
Sorry Mitch, not trying to be a smartarse - it's a bit hard to believe. Normally the index is only the tip of the data iceberg, so an index of several terrabytes would imply a crapload of data.
Kirk Broadhurst
2010-10-12 11:28:13
+1
A:
Introduction to Information Retrieval. This book discussed many data structures, algorithms and related things to search engine.
taskinoor
2010-10-12 05:32:04