Hello, I wan't to write a script to get the source of a website which requires a post login. I need a shell script to do this. I want to parse some information. Any idea which language is the best choice for handling the http request and maybe cookies? Thank you.
+4
A:
Starting with shell programming, curl can do a POST to get you the source of a web page. -- See the -d option.
You can then use shell scripting with grep, sed or other options to parse the results.
More automated/sophisticated/complicated is to write a web scraper program using Perl, Ruby, pick-your-language
Larry K
2010-06-29 20:11:42
A:
Thank you very much.
This is nice too for ruby: http://scrubyt.org/
SurfingCat
2010-06-29 20:33:00