ansaurus

Question

Can ElementTree be told to preserve the order of attributes?

Answer 1

+2 A:

Nope. ElementTree uses a dictionary to store attribute values, so it's inherently unordered.

Even DOM doesn't guarantee you attribute ordering, and DOM exposes a lot more detail of the XML infoset than ElementTree does. (There are some DOMs that do offer it as a feature, but it's not standard.)

Can it be fixed? Maybe. Here's a stab at it that replaces the dictionary when parsing with an ordered one. (I used odict as linked from PEP 372).

from xml.etree import ElementTree
import odict

class OrderedXMLTreeBuilder(ElementTree.XMLTreeBuilder):
    def _start_list(self, tag, attrib_in):
        fixname = self._fixname
        tag = fixname(tag)
        attrib= odict.odict()
        if attrib_in:
            for i in range(0, len(attrib_in), 2):
                attrib[fixname(attrib_in[i])] = self._fixtext(attrib_in[i+1])
        return self._target.start(tag, attrib)

>>> xmlf= StringIO.StringIO('<a b="c" d="e" f="g" j="k" h="i"/>')

>>> tree= ElementTree.ElementTree()
>>> root= tree.parse(xmlf, OrderedXMLTreeBuilder())
>>> root.attrib
odict.odict([('b', 'c'), ('d', 'e'), ('f', 'g'), ('j', 'k'), ('h', 'i')])

Looks potentially promising.

>>> s= StringIO.StringIO()
>>> tree.write(s)
>>> s.getvalue()
'<a b="c" d="e" f="g" h="i" j="k" />'

Bah, the serialiser outputs them in canonical order.

This looks like the line to blame, in ElementTree._write:

            items.sort() # lexical order

Subclassing or monkey-patching that is going to be annoying as it's right in the middle of a big method.

Unless you did something nasty like subclass odict and hack items to return a special subclass of list that ignores calls to sort(). Nah, probably that's even worse and I should go to bed before I come up with anything more horrible than that.

bobince 2010-04-30 01:16:37

Answer 2

+1 A:

Wrong question. Should be: "Where do I find a diff gadget that works sensibly with XML files?

Answer: Google is your friend. First result for search on "xml diff" => this. There are a few more possibles.

John Machin 2010-04-30 02:01:24

Always happy to see an alternate solution. Thanks.

dmckee 2010-04-30 02:14:39

Answer 3

+1 A:

From section 3.1 of the XML recommendation:

Note that the order of attribute specifications in a start-tag or empty-element tag is not significant.

Any system that relies on the order of attributes in an XML element is going to break.

Robert Rossney 2010-05-01 08:09:20

ansaurus

tags:

views:

answers:

Can ElementTree be told to preserve the order of attributes?

Context for this

related questions