ansaurus

Question

Answer 1

+1 A:

urllib2 operates at the HTTP level, which works with complete documents. I don't think there's a way around that without hacking into the urllib2 source code.

What you can do is use plain sockets (you'll have to talk HTTP yourself in this case), and call sock.recv(maxbytes) which does read only available data.

Update: you may want to try to call conn.fp._sock.recv(maxbytes), instead of conn.read(bytes) on an urllib2 connection.

Wim 2010-10-15 17:20:35

The point of using the urllib2 connection is that urllib2 supports environmental proxies and chunked encoding already, something which I'm not too excited about implementing myself. I feel like if I could just kick something in the pants at the lowest level everything would work...

jdizzle 2010-10-15 18:26:22

Right, I wouldn't want to start implementing all of those myself either. Did the `conn.fp._sock.recv(maxbytes)` trick do you any good?

Wim 2010-10-18 15:37:10

I actually did end up using conn.fp._sock.fp._sock or something crazy like that. I had to implement a chunked decoder, but that's not actually that difficult. It was not having to deal with the proxy issue that really scared me.

jdizzle 2010-10-19 18:40:29

ansaurus

tags:

views:

answers:

Constipated Python urllib2 sockets

related questions