I found the HTML Agility Pack useful and easy to use for screen scraping web sites. What's the equivalent library for HTML screen scraping in Java, Ruby, Python?
views:
953answers:
2
+3
A:
BeautifulSoup is the standard Python screen scraping tool.
Recently, however, I used the (incomplete at the moment) pyQuery, which is more or less a rewrite of jQuery into python, and found it to be very useful.
cobbal
2009-06-29 20:59:59
lxml is good too.
Lennart Regebro
2009-06-30 04:40:45
+2
A:
Found what I was looking for: http://stackoverflow.com/questions/2861/options-for-html-scraping
Sajee
2009-06-29 21:13:47