views:

232

answers:

5

What is the best document search engine (perhaps Plucene?) to integrate into Perl code that allows customizing and tuning the search algorithm?

+1  A: 

First up, seriously consider Lucene instead - it's much faster and the files it generates are more compatible with other implementations of lucene. There's a dependency on the clucene library but it's worth it if you can.

http://search.cpan.org/dist/Lucene

Actually, KinoSearch is the recommended indexing engine for use with Perl.
jrockway
A: 

Consider using KinoSearch instead. Then, read its tutorial.

cjm
Does KinoSearch support customizing search results?
NoahD
Yes. Standard +field:(value) and -field:(value) constructs, ie +site:(stackoverflow.com). Kino rocks
RET
+2  A: 

You should also look at swish-e. Really fast, really easy to setup and really flexible. I've used it in lots of projects successfully and it has lots of CPAN support too:

mpeters
I love SWISH-E and it comes with Perl libraries. I have used it on both Windows and Unix.
meme
+1  A: 

Just to suggest something else: I've enjoyed sphinx. One of the next releases is supposed to get proper "real time updates"; but until then it's rebuilding its indexes very fast.

Ask Bjørn Hansen
A: 

What platform? What resources are available? Is this monolithic or something that is deployed. Does this have to handle issues lke revision control as well?

hpavc
It would be Linux and monolithic.
NoahD