crawl

crawling a html page using php?

this website http://courses.westminster.ac.uk/CourseList.aspx which lists over 250 courses in one list, i wanted to get the name of each course and insert that into my mysql database using php, the courses are listed like this: <td> computer science</td> <td> media studeies</td> etc... is thier a way to do that in php, instead of me ...

Googlebot is crawling my site and entering ratings on my rating system

My rating system allows anonymous users to add ratings, but Google's crawler is rating things. How can I ensure that Googlebot won't follow the link? ...

Python web crawling and storing to mysql

Hi, Looking for few days for some simple solution for this, but I think that in this moment I am still on the beginning :) I need good web crawler written in Python to store complete page into mysql database. Small system that I am experimenting uses now PHP Sphider to crawl and store into database. I need something that works almost ex...

How to make "index.php#foo" load properly.

I am new to deep linking, so not sure how to do this. In order for the web site to be crawl able by all crawlers, I am using Hijax method which basically means all my links are tagged as follows: <a href=”index.php?foo=32” onClick=”navigate('index.php#foo=32'); return false”>Link</a> Due to this schema all of the pages on the web ...