ansaurus

Question

Python urllib3 and how to handle cookie support?

Answer 1

+1 A:

You need to set 'Cookie' not 'Set-Cookie', 'Set-Cookie' set by web server.

And Cookies are one of headers, so its nothing wrong with doing that way.

S.Mark 2010-03-11 06:07:54

Yeah that was a typo, but my question still stands, I'd like a better way (i.e. using built-ins).

bigredbob 2010-03-11 06:09:34

Answer 2

+1 A:

You're correct, there's no immediately better way to do this right now. I would be more than happy to accept a patch if you have a congruent improvement.

One thing to keep in mind, urllib3's HTTPConnectionPool is intended to be a "pool of connections" to a specific host, as opposed to a stateful client. In that context, it makes sense to keep the tracking of cookies outside of the actual pool.

shazow (the author of urllib3)

shazow 2010-03-11 07:07:21

If I knew how to patch this I would love to, but I'm not that good. It's good to know I'm not doing it wrong at least.

bigredbob 2010-03-11 08:03:18

Answer 3

A:

Is there not a problem with multiple cookies?

Some servers return multiple Set-Cookie headers, but urllib3 stores the headers in a dict and a dict does not allow multiple entries with the same key.

httplib2 has a similar problem.

Or maybe not: it turns out that the readheaders method of the HTTPMessage class in the httplib package -- which both urllib3 and httplib2 use -- has the following comment:

If multiple header fields with the same name occur, they are combined according to the rules in RFC 2616 sec 4.2:

    Appending each subsequent field-value to the first, each separated
    by a comma. The order in which header fields with the same field-name
    are received is significant to the interpretation of the combined
    field value.

So no headers are lost.

There is, however, a problem if there are commas within a header value. I have not yet figured out what is going on here, but from skimming RFC 2616 ("Hypertext Transfer Protocol -- HTTP/1.1") and RFC 2965 ("HTTP State Management Mechanism") I get the impression that any commas within a header value are supposed to be quoted.

Rod Montgomery 2010-05-18 02:04:48

ansaurus

tags:

views:

answers:

Python urllib3 and how to handle cookie support?

related questions