ansaurus

Question

Answer 1

A:

If you're just getting started, you might want to check out Nokogiri. It's more lightweight and better-documented than Hpricot (which appears to have been abandoned).

Edit: Jakub Hampl is correct - Mechanize is what you're looking for to interact with web sites. It works in cooperation with Nokogiri (which parses HTML and XML).

Drew Johnson 2010-05-12 21:47:36

Answer 2

+1 A:

Even better check out Mechanize. A good starting point on screen scrapping is the railscasts.com episode on mechanize.

Jakub Hampl 2010-05-12 21:50:24

Unfortunately, Mechanize does not handle Javascript links. I'm not familiar with ASP.NET at all, but is there a way to manually mimic what the Javascript is doing?

JillianK 2010-05-12 23:54:20

Sorry, I know almost nothing about asp.net. Maybe checking out what exactly `__doPostBack` does and getting the url would help? Also what platform are you on? You could always harness a full Webkit instance - though it seems way overkill for something this simple.

Jakub Hampl 2010-05-13 01:17:16

JillianK 2010-05-13 07:07:37

ansaurus

tags:

views:

answers:

Scraping ASP.NET site with Ruby

related questions