screen-scraping

Copying data across tabs

Hello, I got two different forms in two different tabs. One has data from our system and the other one is an interface of another, external, system in wich we need to copy data into (XML or API integration not an option here) The this is that, having open both forms - in two different tabs - i need a greasemonkey script or something si...

Programmatically login to a website and redirect the user to the logged in page?

Hi, Right now, I have all the employees of my company login to an external website using the company id, username and a password. We are trying to integrate it into an intranet portal which should provide seamless access to this website without requiring the user to enter these credentials. Is there any way of doing this programmatical...

Saving HttpResponse/Request to file system

Here is my scenario. User fills out this large page which is dynamically created based off DB values. Those values can change. When the user fills out the page and hits submit we want to save a copy of the page as html on the server, this way if the text or wording changes, when they go back to view their posted information, it is histor...

What's the best way to access a website from a Windows Service?

I'm struggling to write a Windows Service that accesses a website, logs in using stored credentials, and downloads the HTML to parse it. What do you think is the best way to go about this? ...

Get coordinates of Google Maps markers

I am creating a databse containing the names and coordinates of all bus stops in my local area. I have all the names stored in my database, and now I need to add the coordinates. I am trying to get these of a website that contains them all as placemarks on a Google Map. It seems to me like they are being generated from a local server, an...

Extracting Window Contents

I need to extract window content if this is based on text, or at least the file path associated to that window. To-date, I have considered: 1. win32api 2. 3rd party libraries 3. wrapper classes However, I am not satisfied with the solutions. So any ideas how this can be done in a clean way? ...

Screen-scraping a site with a asp.net form login in C#?

Hi Friends, I've created a web application in asp.net so far. where i've tried to get some data(site scraping) from secure page of a web site.I've used the HttpWebRequest class for this functionality but i haven't accessed the secure page yet. Every time the login pages was scraped not secure page.I have the site user id and password an...

Screen-scraping of a secure page of any site on https:// with asp.net in C#

I've done site scraping of secure page of any site on http by below code: string cookiedata = "fsfsfsdfsfsfsfsfsdf"; NetworkCredential credential = new NetworkCredential("xxx", "xxx"); HttpWebRequest request = HttpWebRequest.Create("https://ysats.com") as HttpWebRequest; //set the user agent so it looks like IE to not...

What must I learn to parse dynamic HTML sites with PHP?

What I must to learn to write php web-site grabber (parser)? It must collect information from other websites, such as as weather forecast, wiki "on this day", some news and other useful and interesting "every day" information! what i must to read for writing m3u player on php? sorry for my bad english ...

How would you protect a database of links from being scraped?

I have a large database of links, which are all sorted in specific ways and are attached to other information, which is valuable (to some people). Currently my setup (which seems to work) simply calls a php file like link.php?id=123, it logs the request with a timestamp into the DB. Before it spits out the link, it checks how many requ...

HTML Agility Pack Screen Scraping XPATH isn't returning data

I'm attempting to write a screen scraper for Digikey that will allow our company to keep accurate track of pricing, part availability and product replacements when a part is discontinued. There seems to be a discrepancy between the XPATH that I'm seeing in Chrome Devtools as well as Firebug on Firefox and what my C# program is seeing. ...

Scraping landing pages of a list of domains

I have a reasonably long list of websites that I want to download the landing (index.html or equivalent) pages for. I am currently using Scrapy (much love to the guys behind it -- this is a fabulous framework). Scrapy is slower on this particular task than I'd like and I am wondering if wget or an other alternative would be faster given ...

How do I send an arrow key in Perl using the Net::Telnet module?

Using the Perl module Net::Telnet, how do you send an arrow key to a telnet session so that it would be the same thing as a user pressing the down key on the keyboard? use Net::Telnet; my $t = new Net::Telnet(); my $down_key=?; #How do you send a down key in a telnet session? t->print($down_key); This list of VT102 codes says that cur...

change label text from a VB6 binary (not source code)

Hi, we have a VB6 binary executable that comes with no source code. And we need to change the label text for that VB6 application from "AAA" to "BBB". Is there any way or tools that can do that? The closest tool I can find right now is microsoft UISpy, it can read all the other elements but not the label. I hope there is a tool that c...

Automated download of website content using ASP.net

Using ASP.net, what methods can I use to do the following: Open up a connection to a given URL to read HTML content Parse the given URL for hyperlinks, and place them in an array Loop through each hyperlink (only 1 level down), opening each one, saving the HTML contents in a table, and move to the next hyperlink until done. If ASP.ne...

Web scraping: how to get scraper implementation from text link?

I'm building a java web media-scraping application for extracting content from a variety of popular websites: youtube, facebook, rapidshare, and so on. The application will include a search capability to find content urls, but should also allow the user to paste a url into the application if they already where the media is. Youtube Down...

Screen capturing Flash

Hey everyone, Is there a way to play a SWF and screen capture it in a headless environment? Thanks! ...

How do I screen scrape a website and get data within div?

How can I screen scrape a website using cURL and show the data within a specific div? ...

Writing Crawler for Screen Scrapping

I want to write crawler for screen scrapping What I want is, I want to get price of particular hotel from a website, like here is website e.g. In the above URL, there is list of hotels and its price. I want to get the price of the beaufort Please Advise how to accomplish this. Thanks ...

How selectorgadget works?

How selectorgadget.com works? Is there any link/page that explain the algorithm behind selectorgadget? thanks ...