views:

414

answers:

1

I'm writing a program (in Java) that needs to extract links from webpages. I'm using htmlParser (http://htmlparser.sourceforge.net/) but I'm only able to extract html links (defined with <a href="...">) and I don't know how to handle javascript code to extract links from... can you help me??

+2  A: 

You can use Rhino with DOM environment, written in JavaScript.
By the way it is written by John Resig.

Eldar Djafarov
I've never played with DOM, but I'll take a look at your link, thanks.
Raffo