ansaurus

Question

dose python urllib2 will automaticly uncompress gzip data from fetch webpage

Answer 1

+3 A:

This checks if the content is gzipped and decompresses it:

from StringIO import StringIO
import gzip

response = urllib2.urlopen(request)
if response.info().get('Content-Encoding') == 'gzip':
    buf = StringIO( response.read())
    f = gzip.GzipFile(fileobj=buf)
    data = f.read()

ars 2010-10-16 01:21:48

Answer 2

A:

If you are talking about a simple .gz file, no, urllib2 will not decode it, you will get the unchanged .gz file as output.

If you are talking about automatic HTTP-level compression using Content-Encoding: gzip or deflate, then that has to be deliberately requested by the client using an Accept-Encoding header.

urllib2 doesn't set this header, so the response it gets back will not be compressed. You can safely fetch the resource without having to worry about compression (though since compression isn't supported the request may take longer).

bobince 2010-10-16 01:28:21

ansaurus

tags:

views:

answers:

dose python urllib2 will automaticly uncompress gzip data from fetch webpage

related questions