ansaurus

Question

Pythonic URL Parsing

Answer 1

+6 A:

Since your requirements on what parts you want are different from what urlparse gives you, that's as good as it's going to get. You could, however, replace this:

partCount = len(pathParts) - 1

path = "/"
for i in range(2, partCount):
    path += pathParts[i] + "/"

With this:

path = '/'.join(pathParts[2:-1])

Paolo Bergantino 2009-05-19 04:44:12

Answer 2

+2 A:

I'd be inclined to start out with urlparse. Also, you can use rsplit, and the maxsplit parameter of split and rsplit to simplify things a bit:

_, netloc, path, _, q, _ = urlparse(url)
_, base, path = path.split('/', 2) # 1st component will always be empty
path, file = path.rsplit('/', 1)
if q: file += '?' + q

Laurence Gonsalves 2009-05-19 06:55:31

ansaurus

tags:

views:

answers:

Pythonic URL Parsing

related questions