ansaurus

Question

<class 'google.appengine.runtime.DeadlineExceededError'>: how to get around?

Answer 1

+3 A:

Yes, the App Engine imposes a 30 seconds deadline. One way around it might be a try/except DeadlineExceededError and putting the rest in a taskqueue.

But you can't make your requests run for a longer period.

You can also try Bulkupdate

Example:

class Todo(db.Model):
    page = db.StringProperty()

class BulkPageParser(bulkupdate.BulkUpdater):
    def get_query(self):
        return Todo.all()

    def handle_entity(self, entity):
        JSON_data = overview(entity.page)
        db.put(parse_json(JSON_data, [])
        entity.delete()

# Put this in your view code:
for i in range(500):
    Todo(page='someurlpage.com/%s' % i).put()

job = BulkPageParser()
job.start()

WoLpH 2010-07-24 16:55:50

wtf, so lets assume I have a large list of urls I want to request in a for loop, collect data and throw in db.class instances into a list before putting. What would be the best method to do so?

2010-07-24 17:05:52

Put the list of urls in a model and execute the queue with bulkupdate. Atleast... I think that would be the easiest solution ;)

WoLpH 2010-07-24 17:29:05

yes but i need to repeatedly request urls and update information in a for loop not just upload the urls, I am looking for example code of someone requesting a lot of urls in a for loop that avoids the timeout error now.

2010-07-24 18:43:11

Can you edit your question and add some example code to it? Than I'll try to create an example for you.

WoLpH 2010-07-24 19:01:46

thx I have converted my existing code to psuedo as best as I can I think it will give a better idea of what I am trying to do

2010-07-24 19:59:42

@user291071: I've added an example for you. Let's hope it works like that ;)

WoLpH 2010-07-24 22:15:37

awesome, quick question though is what if I don't know all the urls ahead of time? For instance lets say that as I start I am collecting and adding urls to be visited, and there is no way to get them ahead of time.

2010-07-24 22:47:13

Just add them to the `Todo` model. Calling `Todo(page=...).put()` should be enough. After that you can just run the `BulkPageParser()` again.

WoLpH 2010-07-24 22:57:00

Answer 2

A:

ok so if I am dynamically adding links as I am parsing the pages, I would add to the todo queue like so I believe.

def handle_entity(self, entity):
    JSON_data = overview(entity.page)
    data_gathered,new_links = parse_json(JSON_data, [])##like earlier returns the a list of sr objects, and now a list of new links/pages to go to
    db.put(data_gathered)
    for link in new_links:
        Todo(page=link).put()
    entity.delete()

2010-07-24 23:23:36

@user291071: Correct :)

WoLpH 2010-07-24 23:24:56

hey WoLph, another simple follow up, I have the code so far implmented but the batch is executing very quickly, how do I change the above codes so that only 1 request/batch is executed at a time. I put a 1 second delay in the handle entity, and I want only one url request every 1 second, so I need to limit my batch to 1 request. My current code seems to be doing nothing, with my PUT_BATCH_SIZE options.

2010-07-29 16:42:09

ansaurus

tags:

views:

answers:

<class 'google.appengine.runtime.DeadlineExceededError'>: how to get around?

related questions