views:

51

answers:

3

I need to put a little project together for myself, and I need some functionality to download a page for offline viewing. Is there a library that will download a given page and its embedded images, and edit the img tags to reflect the local locations of the images.

I know there are a lot of website downloaders out there, but I cant find something that i can use directly in my code.

I have some basic scripts done in python, so Python is very welcome. but pretty much any language will do.

A: 

You're looking for BeautifulSoup.

SLaks
A: 

How about python web crawler? http://code.google.com/p/pywebcrawler/

OR, Anemone (ruby)? http://anemone.rubyforge.org/

geff_chang
+1  A: 

Yes, BeautifulSoup + python urllib module

Dingo