ansaurus

Question

How to call Twiter's Streaming/Filter Feed with urllib2/httplib?

Answer 1

+1 A:

urllib on App Engine is a thin wrapper around the urlfetch API. You're right about what's happening: Twitter's streaming API never terminates its response, so it times out, and urlfetch throws an exception.

If you use urlfetch directly, you can set the timeout (up to 10 seconds), and set allow_truncated to True so you can get the partial result. The Twitter streaming API really isn't a good match for App Engine, though, because App Engine requests are limited to 30 seconds of execution time, and urlfetch requests can't send back results progressively, or take more than 10 seconds. Using Twitter's 'standard' API would be a better option.

Nick Johnson 2010-03-30 09:37:10

Thanks - that's a great explanation for what's happening.Re: 'standard' API, I assume you mean something like the http://apiwiki.twitter.com/Twitter-REST-API-Method%3A-statuses-public_timeline - but there is no great analog to the filtering capability of the stream, is there?

Simon 2010-03-30 14:39:09

It depends on how you want to filter. The standard 'search' API does most of that. Another alternative would be to deploy a service elsewhere that uses the streaming API, and packages up batches to be sent to your app via HTTP. I've actually written a small tool to do just that, if you're interested.

Nick Johnson 2010-03-30 17:42:33

I would definitely be interested in such a tool if you're up for sharing!

Simon 2010-03-30 17:57:27

Here it is, such as it is: http://github.com/Arachnid/whofoundit/blob/master/feeder.py . It's not very polished, but the basic workflow is clear: It uses a producer/consumer pattern to fetch entries from Twitter and add them to a queue. Dropped connections are automatically retried. The consumer fetches batches from the queue and uploads them to any HTTP endpoint, with a batch every n results or m seconds (whichever comes first).

Nick Johnson 2010-03-30 18:57:58

ansaurus

tags:

views:

answers:

How to call Twiter's Streaming/Filter Feed with urllib2/httplib?

related questions