ansaurus

Question

How to get all the info in XML into dictionary with Python

Answer 1

+3 A:

The following links may provide what you're after:

Recipe 573463: Converting XML to Dictionary and back (Python)

Recipe 410469: XML as Dictionary (Python)

Converting XML Nodes into a Dictionary in Python using SAX

Editing XML as a dictionary in python?

Leniel Macaferi 2010-07-10 01:50:52

Answer 2

+3 A:

I usually parse XML using the ElemntTree module on the standard library. Itr does not give you a dictionary, you get a much more useful DOM structure which allows you to iterate over each element for children.

from xml.etree import ElementTree as ET

xml = ET.parse("<path-to-xml-file")
root_element = xml.get_root()

for child in root_element:
   ...

If there is specific need to parse it to a dicionary, insetead of getting tehinformation you need from a DOM tree, a recursive function to build one from the root node would be soemething like:

def xml_dict(node, path="", dic =None):
    if dic == None:
        dic = {}
    name_prefix = path + ("." if path else "") + node.tag
    numbers = set()
    for similar_name in dic.keys():
        if similar_name.startswith(name_prefix):
            numbers.add(int (similar_name[len(name_prefix):].split(".")[0] ) )
    if not numbers:
        numbers.add(0)
    index = max(numbers) + 1
    name = name_prefix + str(index)
    dic[name] = node.text + "<...>".join(childnode.tail
                                         if childnode.tail is not None else
                                         "" for childnode in node)
    for childnode in node:
        xml_dict(childnode, name, dic)
    return dic

For the XML you list above this yileds this dictionary:

{'A1': '\n \n <...>\n',
 'A1.B1': '\n  \n  <...>\n ',
 'A1.B1.C1': '"blah"',
 'A1.B1.C2': '"blah"',
 'A1.B2': '\n  \n  <...>\n ',
 'A1.B2.C1': '"blah"',
 'A1.B2.C2': '"blah"'}

(I find the DOM form more usefull)

jsbueno 2010-07-10 02:12:33

Answer 3

A:

I usually use the lxml.objectify library for quick XML parsing.

With your XML string, you can do:

from lxml import objectify
root = objectify.fromstring(xml_string)

And then get individual elements using a dictionary interface:

value = root["A"][0]["B"][0]["C"][0]

Or, if you prefer:

value = root.A[0].B[0].C[0]

Lior 2010-07-10 07:36:49

Answer 4

A:

Check out the answers to http://stackoverflow.com/questions/3106480/really-simple-way-to-deal-with-xml-in-python, you will probably find one of them to directly suit your needs.

Nas Banov 2010-07-10 07:55:13

ansaurus

tags:

views:

answers:

How to get all the info in XML into dictionary with Python

related questions