views:

287

answers:

2

I'm looking for a Ruby library or gem (or set of gems) which will not only do spidering, but also collect the data into, say, a database, and allow basic searches on the data (i.e. a typical web search).

I've found several spidering libraries, so that part seems well covered (I was going to try Anemone first), but I can't find anything that will take the spidered data and allow querying on it. For lack of an existing one, I was going to write something myself with Anemone.

Any suggestions?

+1  A: 

That blog post might give you some pointers. Also, look into ferret for the search part.

agregoire
Great stuff; thanks agregoire. I will read up on these.
Pistos
A: 

There are lots of great stuff on github.com

khelll
Yep, already did that search, which is how I found Anemone.
Pistos