ansaurus

Question

How to download a file over http with authorization in python 3.0, working around bugs?

Answer 1

+1 A:

One of the biggest changes in python 3.0 has been string handling. Because of reported by your exception, I would first check by using a byte string:

import urllib.request;
url = b"http://username:password@server/file";
urllib.request.urlretrieve(url, "temp.dat");

However, in this case, string conversion is not the cause of the issue; please see reply from bishanty for a good solution.
As a matter of fact, I think encoding username and password in the url was not a documented method even in previous versions (see for instance basic authentication introduction from fuzzyman).

Roberto Liffredo 2008-12-27 22:03:41

Did you test this? This fails with "unknown url type b'http'", the rest of urllib.request doesn't appear to be ready to handle byte-strings.

Lasse V. Karlsen 2008-12-27 23:09:48

Actually, it was intended to be more a generic suggestion over the generic p3k unicode/byte string "issue". I have now changed it, hope it will be clearer.

Roberto Liffredo 2008-12-28 08:20:32

Answer 2

+11 A:

Direct from the Py3k docs: http://docs.python.org/dev/py3k/library/urllib.request.html#examples

import urllib.request
# Create an OpenerDirector with support for Basic HTTP Authentication...
auth_handler = urllib.request.HTTPBasicAuthHandler()
auth_handler.add_password(realm='PDQ Application',
                          uri='https://mahler:8092/site-updates.py',
                          user='klem',
                          passwd='kadidd!ehopper')
opener = urllib.request.build_opener(auth_handler)
# ...and install it globally so it can be used with urlopen.
urllib.request.install_opener(opener)
urllib.request.urlopen('http://www.example.com/login.html')

jb 2008-12-27 22:04:53

did you mean to post that password? If not, then I suggest deleting the answer and posting a new one with dummy data there. Thanks for the answer though, this looks promising.

Lasse V. Karlsen 2008-12-27 23:11:23

Direct from the Python docs :P

jb 2008-12-27 23:37:16

Klem is probably pretty pissed if that's his real password though :)

jb 2008-12-27 23:38:06

+1: Direct from the docs.

S.Lott 2008-12-28 02:19:14

Answer 3

A:

My advice would be to maintain your 2.* branch as your production branch until you can get the 3.0 stuff sorted.

I am going to wait a while before moving over to Python 3.0. There seems a lot of people in a rush, but I just want everything sorted out, and a decent selection of third-party libraries. This may take a year, it may take 18 months, but the pressure to "upgrade" is really low for me.

Ali A 2008-12-28 01:21:20

Answer 4

A:

Have any of you made this method work?

I had no problem to retrieve zip files from free access website, but had problems with retrieving zip files from password protected sites. Retrieved files are corrupted.

2009-01-13 10:12:18

ansaurus

tags:

views:

answers:

How to download a file over http with authorization in python 3.0, working around bugs?

related questions