Saving full page content using Selenium | ansaurus

tags:

selenium

views:

79

answers:

1

Q:

Saving full page content using Selenium

Hi all, I was wondering what's the best way to save all the files that are retrieved when Selenium visits a site. In other words, when Selenium visits http://www.google.com I want to save the HTML, JavaScript (including scripts referenced in src tags), images, and potentially content contained in iframes. How can this be done?

I know getHTMLSource() will return the HTML content in the body of the main frame, but how can this be extended to download the complete set of files necessary to render that page again. Thanks in advance!

A:

Selenium isn't the designed for this, you could either:

Use getHtmlSource and parse the resulting HTML for references to external files, which you can then download and store outside of Selenium.
Use something other than Selenium to download and store an offline version of a website - I'm sure there are plenty of tools that could do this if you do a search. For example WGet can perform a recursive download (http://en.wikipedia.org/wiki/Wget#Recursive_download)

Is there any reason you want to use Selenium? Is this part of your testing strategy or are you just wanting to find a tool that will create an offline copy of a page?

Dave Hunt 2010-06-16 08:38:14

The reason why we want to use Selenium is because it parses JavaScript which is essential to reconstruct an entire page (including ad traffic).

Rick 2010-06-17 21:58:56

related questions

Equivalent of Firebug's "Copy Xpath" in Internet Explorer?

Selenium RC: Run tests in multiple browsers automatically

How can I run NUnit(Selenium Grid) tests in parallel?

How can I do automated tests on non JavaScript applications?

Selenium: wait_for_* and friends in Selenium RC ruby driver

Selenium Drag&Drop in testing javascript

How to get Selenium working with PHP/Firefox3 on Linux

Testing onbeforeunload events from Selenium

Using Selenium IDE with random values

Defining custom actions in Selenium

How to be successful in web user interface testing ?

What do you use to test your browser extension / BHO?

Why isn't Selenium capturing my keystrokes?

Selenium Remote Control HTML Source Extraction in Internet Explorer

Selenium RC against a Cassini webserver

Selenium internals

Any suggestions for testing extjs code in a browser, preferably with selenium?

Start seleniumRC from Fitnesse

Selenium Critique

Handling browser pop-up windows with Selenium

Has anyone found a way to run C# Selenium RC tests in parallel?

How do you get selenium to recognize that a page loaded?

How do I solve this error: "Class PHPUnit_Extensions_SeleniumTestCase could not be found"

How to simulate pressing enter in html text input with Selenium?

What do you use to Unit-Test your Web UI?