I love the Beautiful Soup scraping library in Python. It just works. Is there a close equivalent in Ruby?
Blindly posting google hits is not the best way to answer a question. If you go to your posted link you get told to use Hpricot instead.
jjnguy
2009-03-12 19:20:11
+3
A:
There's scRUBYt!, Rubyful-soup (no longer maintained), WWW::Mechanize, scrAPI and a few more.
Or you could just use Hpricot or Nokogiri for parsing.
SimonV
2009-03-12 19:24:00
+2
A:
Nokogiri is another HTML/XML parser. It's faster than hpricot according to these benchmarks. Nokogiri uses libxml2 and is a drop in replacement for hpricot. It also has css3 selector support which is pretty nice.
Edit: There's a new benchmark comparing nokogiri, libxml-ruby, hpricot and rexml here.
Ruby Toolbox has a category on HTML parsers here.
Jack Chu
2009-03-12 19:25:16
+1
A:
This image from Ruby Toolbox indicates the relative popularity of various parsers:
ski
2009-11-11 21:46:08