ansaurus

Question

Is there an easy way to request a URL in python and NOT follow redirects?

Answer 1

+13 A:

Dive Into Python has a good chapter on handling redirects with urllib2. Another solution is httplib.

>>> import httplib
>>> conn = httplib.HTTPConnection("www.bogosoft.com")
>>> conn.request("GET", "")
>>> r1 = conn.getresponse()
>>> print r1.status, r1.reason
301 Moved Permanently

olt 2008-09-21 08:33:12

Answer 2

+3 A:

I second olt's pointer to Dive into Python. Here's an implementation using urllib2 redirect handlers, more work than it should be? Maybe, shrug.

import sys
import urllib2

class RedirectHandler(urllib2.HTTPRedirectHandler):
 def http_error_301(self, req, fp, code, msg, headers):  
  result = urllib2.HTTPRedirectHandler.http_error_301( 
   self, req, fp, code, msg, headers)              
  result.status = code                                 
  raise Exception("Permanent Redirect: %s" % 301)

 def http_error_302(self, req, fp, code, msg, headers):
  result = urllib2.HTTPRedirectHandler.http_error_302(
   self, req, fp, code, msg, headers)              
  result.status = code                                
  raise Exception("Temporary Redirect: %s" % 302)

def main(script_name, url):
   opener = urllib2.build_opener(RedirectHandler)
   urllib2.install_opener(opener)
   print urllib2.urlopen(url).read()

if __name__ == "__main__":
 main(*sys.argv)

Aaron Maenpaa 2008-09-21 11:31:20

Answer 3

+5 A:

i suppose this would help

from httplib2 import Http
def get_html(uri,num_redirections=0): # put it as 0 for not to follow redirects
conn = Http()
return conn.request(uri,redirections=num_redirections)

Ashish 2008-09-21 13:51:30

ansaurus

tags:

views:

answers:

Is there an easy way to request a URL in python and NOT follow redirects?

related questions