views:

1098

answers:

4

Is there a free tool that can read given webpage and take a screenshot of it?

+1  A: 

use selenium-rc

flybywire
+2  A: 

You can use the createScreenCapture method in awt.Robot. This method allows you to specify which portion of the screen to capture. So, you would still need to determine the coordinates of the window that contains the web page that you want to capture.

JZeeb
A: 

To render the HTML in pure java, you might take a look at Flying Saucer.

https://xhtmlrenderer.dev.java.net/

It renders XML/XHTML/CSS 2.1

I believe it only works on valid XML or XHTML, so if you need to render non-valid HTML, use a tool like neko to clean it up before passing it to flying saucer.

Sam Barnum
+4  A: 

To build on two of the answers above:

Rendering the HTML in Java then saving to an image - A few Java based HTML renders exist, all with different sets of drawbacks. The most common is the one built in. This is quite simple and can only render fairly basic HTML. The most intresting I know of is The Flying Saucer Project. This can render fairly complex XHTML but you will have to convert HTML before you can use it (JTindy may be able to help here). Taking a Swing component and creating a image is fairly simple, you just pass an BufferedImages graphics object and pass it to the Swing components paint method. Then splat that out with ImageIO.
A big advantage to this would be that the renderer would be headless. The disadvantage is that it would not be a perfect rendering and it would lack any plugins.

The second option requires you to start a web browser, work out where it is and then take a screen shot. Optionally you may also wish to strip out all the Firefox/IE/Opera/etc menus leaving you with just image. To get the dimensions of the web browser the simplest option would be to start it full screen. The other option would be to use something like JDICs browser component to include it as part of the Java application. It would then be able to specify where the HTML is being rendered on screen and then simply use Robot to create a screen shot of that area.
The big advantage to this is that it will give a perfect rendering (for a given browser). The two disadvantages is that it would require native code (or at least using a native component) and it could not be headless¹.

1) You could use a virtual frame buffer. But that is outside Java.

mlk
There is a browser in Eclipse. Have you experience with it?
Thorbjørn Ravn Andersen
Other than as a user I have not (the SWT APIs scare me). But a good point, that is likely better then the default built in one and better HTML support than Flying Saucer.
mlk
JTidy has only recently seen recent contributions. You might also try HtmlUnit (.sourceforge.net). +1 for the JDIC browser reference, it's a really ambitious project and needs more support.
Karl the Pagan