ansaurus

Question

Answer 1

+1 A:

I am not sure how you can utilize a 'selector' like query on the file but a Perl regex might do the job just as well:

for url in `cat urls.txt`; do wget -O- $url; done | \
  perl -nle 'print $1 if /<img.+?class="artwork".+?src="([^"]+)"/'

Maxwell Troy Milton King 2010-02-10 14:00:10

whats the best way to feed that wget a .txt file of urls?

Peter Clark 2010-02-10 14:06:24

Above should work if you are using bash. Not sure about other shells.

Maxwell Troy Milton King 2010-02-10 15:50:16

Whats the best way to crawl a batch of urls for a specific html element and retrieve the image?