ansaurus

Question

JS or any other language hook on loading resources in a HTML page

Answer 1

+1 A:

The simplest way to do this would be to write a Fiddler addon.

SLaks 2010-02-23 13:17:15

Answer 2

A:

You can always setup a proxy like fiddler and look at the traffic - anything apart from the initial call for the page will be the additional resources that are being requested.

Oded 2010-02-23 13:17:36

Doesn't solve the question of how to automate the fetching of the page, and the interpretation of the JavaScript.

Pekka 2010-02-23 13:20:26

Well, the issue is that it's supposed to be an application. It's not that I will run in manually. I need to get the contents of the page and then having this content - use it. If we are going with the run in the browser path I need something like JS to gather the links.

Alexandru Luchian 2010-02-23 13:28:31

Answer 3

+1 A:

I am assuming you are looking for a fully automated solution.

There are several approaches to parsing the file (In all major scripting languages, wget-based, and others) but none I know of that can actually interpret JavaScript (because that's what this would be coming down to).

I think the only option you have is to set up a Firefox (or other modern browser) instance on your Unix/Linux box, feed it a URL and watch/block all outgoing connections it attempts to make. On a client PC, this is the contents of the "Net" tab in Firebug. Whether and to what extent this can be automated without actually rewriting parts of the browser, I don't know. Maybe Selenium RC or one of the other tools from the Selenium suite is a starting point.

Pekka 2010-02-23 13:18:25

I am not a very big expert in this but, maybe it's better to use Gecko or WebKit libraries/engines for this purpose?

Alexandru Luchian 2010-02-23 14:09:40

@Heavy Bytes, probably yes (I am not an expert myself in browser internals that deep). However, I'm pretty sure the JavaScript engine is a separate part from the rendering engine, which may cause problems building an application.

Pekka 2010-02-23 15:32:25

ansaurus

tags:

views:

answers:

JS or any other language hook on loading resources in a HTML page

related questions