ansaurus

Question

Download HTML and Images with WGet without first few lines

Answer 1

A:

In PHP, you could use this function to strip out X lines:

function strip_toplines($string,$lines){
    $string = explode(PHP_EOL,$string);
    foreach($string as $line_num => $line){
        if($line_num>($lines - 1)){
            $output .= $line . PHP_EOL;
        }
    }
    return trim($output);
}

and then this:

strip_toplines(file_get_contents($url),6);

Jamza 2010-03-31 16:04:59

True, but I need to download all the images from the HTML as well.

St. John Johnson 2010-03-31 16:13:19

Answer 2

+1 A:

Devon_C_Miller 2010-03-31 16:18:43

Great find! I didn't even think to look at the robots file. Well, your alternate method gave me some issues (due to anchor links in the file), so instead I'm just bypassing the Robots file with `-e robots=off` Thank you!

St. John Johnson 2010-03-31 16:30:11

ansaurus

tags:

views:

answers:

Download HTML and Images with WGet without first few lines

related questions