ansaurus

Question

regexp in bash (downloading output form regexp)

Answer 1

+3 A:

Why don't you use wget ? It already have that feature :

wget -i --force-html yourfile.html

BatchyX 2010-09-19 17:30:22

+1: Can't get simpler than this.

codaddict 2010-09-19 17:35:34

Answer 2

A:

cut -f 2 -d '"' file-with-addresses.txt

cut is included in all posix shells. This command will split the line using the " as the delimiter and return the second "field". To download using wget Adam Rosenfield's method is fine.

cut -f 2 -d '"' file-with-addresses.txt | xargs wget

adamse 2010-09-19 17:30:25

Answer 3

+2 A:

Here's one way to do that using a combination of sed, xargs, and wget:

sed -n 's/.*<a href="\([^"]*\)">.*/\1/p' input-file | xargs wget

Adam Rosenfield 2010-09-19 17:31:16

Couple tweaks: you might want to change [^"]* to [^"]\+ to ensure the pattern appears at least once, and you might want to use xargs -n 1 so xargs will be called once for each address.

Adam Liss 2010-09-19 17:35:09

ansaurus

tags:

views:

answers:

regexp in bash (downloading output form regexp)

related questions