ansaurus

Question

Answer 1

+2 A:

You don't provide an error message so I can't be sure this is the only error. But, xml.minidom.parse does not take a string. From the docstring for parse:

Parse a file into a DOM by filename or file object.

You should try:

response = urllib2.urlopen(askfor)
doc = parse(response)

since response will behave like a file object. Or you could use the parseString method in minidom instead (and then pass the_page as the argument).

EDIT: to extract the URL, you'll need to do:

url_nodes = doc.getElementsByTagName('url')
url = url_nodes[0]
print url.childNodes[0].data

The result of getElementsByTagName is a list of all nodes matching (just one in this case). url is an Element as you noticed, which contains a child Text node, which contains the data you need.

ars 2010-07-16 02:02:05

That does parse the_page but i cant seem to get an individual tags. using doc.getElementsByTagName("url") returns: [<DOM Element: url at 0x13cbf80>] instead of the data in between.

Ali 2010-07-16 02:28:59

Updated my answer, see above.

ars 2010-07-16 02:49:08

Answer 2

+1 A:

from xml.dom.minidom import parseString
doc = parseString(the_page)

See the documentation for xml.dom.minidom.

Jed Smith 2010-07-16 02:03:14

That does parse the_page but i cant seem to get an individual tags.using doc..getElementsByTagName("url") returns: [<DOM Element: url at 0x13cbf80>] instead of the data.

Ali 2010-07-16 02:28:34

Continue reading the documentation. That object you are getting back has attributes from which you get both (a) get its children and (b) get the data.

Jed Smith 2010-07-16 15:22:55

ansaurus

tags:

views:

answers:

Parsing XML response of bit.ly

related questions