ansaurus

Question

How to prevent XMLSerializer.serializeToString() from re-ordering attributes?

Answer 1

A:

If this matters, the bug isn't in re-ordering the attributes, but in it mattering. Let it order them however it wants, and fix the bug.

Edit:

Wait a minute. Why is this being put into a repository? If it's output rather than source, then its value in a repository is as a non-editted resource rather than as source, and its stored as a convenience. Otherwise, why are you letting a computer process change it?

This is analogous to putting a binary into a repository, with the same reasons why that's often bad, and the same reasons for making exceptions.

Jon Hanna 2010-08-23 22:11:34

SCMs are typically line-oriented, and unfortunately, AFAIK there aren't any SCM systems that "understand" XML to the point of being able to ignore things that don't matter in XML. For that matter, defining whether two XML files are "the same" can depend on the semantics of the data.

Jim Garrison 2010-08-24 02:30:46

@Jim, well then XML is not the output to use.

Jon Hanna 2010-08-24 07:31:38

This isn't a bug, merely an issue of convenience: the re-ordering of attributes obscures other, meaningful changes that are being applied to the data. Whether or not XML is the correct output, or whether it belongs in a repository or not are outside the scope of the question.

David Eyk 2010-08-24 13:52:07

Answer 2

A:

I've taken the direction @Tomalak suggested and am "fixing" the order server-side. Thankfully, the original order was alphabetical, and the order produced by XMLSerializer is reverse alphabetical. My server-side XML tool, lxml, maintains document attribute order, so reversing the order is simple:

xmls = json.loads(self.data['xmls'])
out = []
for xml in xmls:
    # DOM adds an XHTML namespace... silly DOM.
    xml = xml.replace('xmlns="http://www.w3.org/1999/xhtml"', '')
    tree = ET.fromstring(xml)
    for el in tree.xpath('//*'):
        attrs = dict(el.attrib)
        keys = el.attrib.keys()  # el.attrib preserves attribute order
        keys.reverse()  # But the browser DOM has reversed that order.
        # Put them back in the order we want.
        el.attrib.clear()
        for k in keys:
            el.attrib[k] = attrs[k]
    out.append(ET.tostring(tree, encoding=unicode))

My line-based diffs are useful again!

David Eyk 2010-08-26 13:47:01

ansaurus

tags:

views:

answers:

How to prevent XMLSerializer.serializeToString() from re-ordering attributes?

related questions