tags:

views:

557

answers:

5

Is it possible to write a script in Perl that opens different urls and saves a screenshot of each of them?

+14  A: 

You could use WWW::Mechanize::Firefox to control a firefox instance and dump the rendered page with $mech->content_as_png.

Be aware that setting it up can pose quite a challenge, though.

If all works as expected, you can simply use a script like this to dump images of the desired websites, but you should start Firefox and resize it to the desired width manually (height doesn't matter, WWW::Mechanize::Firefox always dumps the whole page).

use WWW::Mechanize::Firefox;
use Path::Class qw/file/;

my $mech = WWW::Mechanize::Firefox->new(
  bufsize => 10_000_000, # PNGs might become huge
);
$mech->get('http://www.stackoverflow.com/');

my $fh = file( 'test.png' )->open( '> :raw' );
print $fh $mech->content_as_png();
willert
+2  A: 

You could also use Win32::IE::Mechanize to render the web page using IE, and then Win32::Screenshot to capture the page. You'll probably have to do a bit of work to figure out where to take the screenshot, but that shouldn't be too incredibly hard.

This will be a Windows platform only solution, of course, but may suffice.

Robert P
+4  A: 

Use the WWW::Selenium module, for which you'll need to have a Selenium Remote Control session up and running.

The capture_entire_page_screenshot() method should get you up and running.

From WWW::Selenium on CPAN:

$sel->capture_entire_page_screenshot($filename, $kwargs)

Saves the entire contents of the current window canvas to a PNG file...


A typical script:

use strict;
use warnings;
use WWW::Selenium;

my $sel = WWW::Selenium->new( host => "localhost", 
                              port => 4444, 
                              browser => "*iexplore", 
                              browser_url => "http://www.google.com",
                            );

$sel->start;
$sel->open("http://www.google.com");
$sel->capture_entire_page_screenshot("screenshot.png");
$sel->close;
Zaid
+4  A: 

Another approach, which doesn't require the use of a browser, is to use ImageMagick and HTML2PS to convert the image. Be warned however, this isn't trivial, and it's near impossible (last I tried) to get this working on Windows properly.

Once ImageMagick is installed, the simplest approach is to just run a system call to the convert program that ImageMagick installs. If you want a less hackish approach, you can use the PerlMagick ImageMagick API.

There is an excellent discussion on this approach you can find on PerlMonks.

Weegee
A: 

Use a third-party web service API like http://webshotspro.com/ (screenshots) or http://www.thumbalizr.com/ (thumbnails).

Anon Guy