ansaurus

Question

Answer 1

A:

If your server lets you, this is probably the best solution: just remove the time limit for this script.

set_time_limit(0);

Matchu 2010-05-29 23:48:53

Answer 2

A:

Well theres several ways you can do this.

The best way to do this is to set up a cron to execute your scraper every X minutes.

This being sed you will need to keep track of what id your currently at.

so if you set up a function to write to a file you can do the following way

--

Open file (get current id) Start Parser at the id for 60 times Insert the data Open the file and update it with the new id close files and exit.

This will run over space of few hours or however long it takes.

Is if your doing this manually and your sitting there and refreshing everytime the script finishes then you can use sessions instead of writing the id to the file

`session_start();
$id = (isset($_SESSION['position']) ? $_SESSION['position'] : 0);
for($i=$id;$<=9999;$i++)
{
   //FetchItem($id); //Or whatever function it is you use!
   //Update the id for next run.
   $_SESSION['position'] = $id;
}`

If your your willing to overide your servers resources you can extend the 60 seconds using set_time_limit(120) for 120 seconds or whatever you prefer.

RobertPitt 2010-05-29 23:55:56

...meh. Cron is really only the way to go if he has a hugely busy website that would be taken down by running that script, or if he plans to keep collecting this data continuously rather than all in one go. Really, this script should just be run on his computer rather than a remote host, anyway.

Matchu 2010-05-30 00:01:09

Totally agree, Scraping takes up too much server resources, IF i was scraping i would set up a cron to run on the hours my sites are least busy!

RobertPitt 2010-05-30 00:13:29

I'm running this script in localhost, then I will pu the DB online by importing it, so its not a problem. the problem I have is that I dont know CRON. But I think i will do it by setting set_time_limit(0)..

Jonathan 2010-05-30 00:37:44

Answer 3

A:

If your server won't let you change the script time limit, just have your script check the database for the last inserted article in your sequence and start from there.

Another approach: Use Javascript "window.location = " instead of a header to redirect.

Robert 2010-05-29 23:56:50

How do i check the last row and then run the code without redirecting to the same page? Doing a loop will give the time limit error.

Jonathan 2010-05-30 00:03:56

Sorry I wasn't very clear. I meant to do the loop and let it run until it times out, then just run it again and let it pick up from where it left off. If that's not practical (because it would take too many loops, or because this process needs to be completely automated) then the javascript I mentioned is a better way to go.

Robert 2010-05-30 03:48:03

ansaurus

tags:

views:

answers:

PHP: Infinity loop and Time Limit!

related questions