ansaurus

Question

mirror single page with httrack

Answer 1

A:

The purpose of HTTTrack is to follow links. Try setting --ext-depth=0.

Gregory Pakosz 2009-12-28 08:01:29

Answer 2

A:

Looking at the example:

httrack "http://www.all.net/" -O "/tmp/www.all.net" "+*.all.net/*" -v

The last part is a regex. Just make a completely matching regex.

httrack "http://www.google.com.au/" -O "/tmp/www.google.com.au" "+*.google.com.au/*" -v ---depth=2 --ext-depth=2

I had to localise, otherwise I get a redirect page. You should localise to whichever google you get directed to.

Nazarius Kappertaal 2009-12-28 08:03:22

That helped, but was not quite right. Could you please see my edit?

Max 2009-12-28 09:50:17

This seems to copy images, and the js.

Nazarius Kappertaal 2009-12-28 23:33:17

Answer 3

A:

Could you use wget instead of httrack? wget -p will download a single page and all of its “prerequisites” (images, stylesheets).

Kevin Reid 2009-12-28 12:57:44

wget would be my fallback solution if httrack cant do the job.

Max 2009-12-28 14:57:29

mirror single page with httrack