ansaurus

Question

Making a web interface to a script that takes 30 minutes to execute

Answer 1

+5 A:

I've had this situation before and I used cronjobs. The HTTP script would just write in a queue a job to be performed (a DB or a file in a directory) and the cronjob would read it and execute that job.

sharjeel 2010-01-20 11:52:15

+1; user could be notified through email when his job finishes, with a link to download it

Rubens Farias 2010-01-20 11:56:55

You can also make the cronjob write its progress periodically in a file and use ajax to read the value from the webserver and display it to the user in his browser.

sharjeel 2010-01-20 11:59:38

If you need the job process to run as a different user, using a cronjob is certainly a good way. Else, it's usually just as easy to just start the processing thread directly from the CGI app.

Wim 2010-01-20 12:01:24

Wim, could you please share a code snippet for that?

sharjeel 2010-01-20 13:08:30

Answer 2

+1 A:

imho the best way would be to run an independent script which posts updates somewhere (flat file, database, etc...). I don't know how to fork an independent process from python so i can't give any code examples.

To show progress on a WebSite implement an ajax request to a page that reads those status updates and for example shows a nice progress bar.

Add something like setTimeout("refreshProgressBar[...]) or meta-refresh for auto-refresh.

dbemerlin 2010-01-20 11:54:01

Answer 3

+3 A:

You'll probably need to do a stdout.flush(), as the script isn't really writing anything yet to the webserver until you've written a page buffer's worth of data - which doesn't happen before the timeout.

But the proper way to solve this is, as others suggested, to do the processing in a separate thread/process, and show the user an auto-refreshed page which shows the status, with a progress bar or some other fancy visual to keep them from being bored.

Wim 2010-01-20 11:59:40

Adding sys.stdout.flush() right after a print statement in the loop seems to resolve the issue.

Pranab 2010-01-20 12:23:02

Answer 4

+8 A:

I would separate the work like this:

A web app URL that accept the POSTed CSV file. The web app puts the CSV content into an off line queue, for instance a database table. The web app's response should be an unique ID for the queued item (use an auto-incremented ID column, for instance). The client must store this ID for part 3.
A stand-alone service app that polls the queue for work, and does the processing. Upon completion of the processing, store the results in another database table, using the unique ID as a key.
A web app URL that can get processed results, http://server/getresults/uniqueid/. If the processing is finished (i.e. the unique ID is found in the results database table), then return the results. If not finished, the response should be a code that indicates this. For instance a custom HTTP header, a HTTP status response, response body 'PENDING' or similar.

codeape 2010-01-20 12:00:21

Answer 5

+2 A:

See Randal Schwartz's Watching long processes through CGI. The article uses Perl, but the technique does not depend on the language.

Sinan Ünür 2010-01-20 12:03:14

Answer 6

+2 A:

Very similar question here. I suggest spawning off the lengthy process and returning an ajax based progress bar to the user. This way they user has the luxury of the web-interface and you have the luxury of no time-outs.

whatnick 2010-01-20 12:06:52

ansaurus

tags:

views:

answers:

Making a web interface to a script that takes 30 minutes to execute

related questions