Suggestions on Spidering/Cloning an annoying website, logins/ javascript to transverse | ansaurus

tags:

views:

5

answers:

0

Q:

Suggestions on Spidering/Cloning an annoying website, logins/ javascript to transverse

A friend of mine wants to collect data from an website. I recommended that spidering would be a fast way of automating the process. But when I saw the website, I found that it wasn't so simple at all.

First a login with a captcha thwarts most spidering software, there is no way that I can manually log in and use the cookie to get all other webpages.
Secondly, all pages are linked using <div onclick="window.open('/blahblah.asp?id=123')>,where ids are not consecutively incremented. This thwarts wget.
Finally, all the data are in pages that use hiding/showing div's for navigation.

Does anyone have an idea if there is a quick(dirty) solution to this?

related questions

Combined SVN FTP system?

Looking for ways to automate web site testing

Find coordinates of every link in a page

What is your preferred site for code snippets

Javascript make the web broswer srcoll to the top?

Best Ruby on Rails social networking framework

Twitching Consumption of Web Services from Web Site to Web Application

How to use chrome to login to same site twice with different credentials ?

Looking for a Good Resource for Print Designers that are doing Web Design

How does dedicated webhosting compare to Amazon's Cloud?

What makes a good website for an open source project?

Communicating between websites (using Javascript or ?)

Which CMS or other framework should I use to develop a stackoverflow-like site?

How to store Application Messages for a .NET Website

What to charge for a simple website

What is the best solution for maintaining backup and revision control on live websites?

What is a good Content Management System to deploy for a personal website/blog?

What do most users consider acceptable load times on a website?

Should my website abandon username/password authentication in favor of OpenID?

Shopping Cart

Personal Website Construction

HTTPS in IIS 5.1

Do you think using websites like Elance, to obtain side jobs while still working full time, could help my programming career?

Whatever happened to VB Accelerator?

My website got hacked... What should I do?