ansaurus

Question

Order of XML attributes after DOM processing

Answer 1

+1 A:

You really shouldn't need to keep any sort of order. As far as I know, no schema takes attribute order into account when validating an XML document either. It sounds like whatever is processing XML on the other end isn't using a proper DOM to parse the results.

I suppose one option would be to manually build up the document using string building, but I strongly recommend against that.

Soviut 2009-04-07 15:45:25

Answer 2

+7 A:

Look at section 3.1 of the XML recommendation. It says, "Note that the order of attribute specifications in a start-tag or empty-element tag is not significant."

If a piece of software requires attributes on an XML element to appear in a specific order, that software is not processing XML, it's processing text that looks superficially like XML. It needs to be fixed.

If it can't be fixed, and you have to produce files that conform to its requirements, you can't reliably use standard XML tools to produce those files. For instance, you might try (as you suggest) to use XSLT to produce attributes in a defined order, e.g.:

<test>
   <xsl:attribute name="foo"/>
   <xsl:attribute name="bar"/>
   <xsl:attribute name="baz"/>
</test>

only to find that the XSLT processor emits this:

<test bar="" baz="" foo=""/>

because the DOM that the processor is using orders attributes alphabetically by tag name. (That's common but not universal behavior among XML DOMs.)

But I want to emphasize something. If a piece of software violates the XML recommendation in one respect, it probably violates it in other respects. If it breaks when you feed it attributes in the wrong order, it probably also breaks if you delimit attributes with single quotes, or if the attribute values contain character entities, or any of a dozen other things that the XML recommendation says that an XML document can do that the author of this software probably didn't think about.

Robert Rossney 2009-04-07 18:07:16

Answer 3

+3 A:

It's not possible to over-emphasize what Robert Rossney just said, but I'll try. ;-)

The benefit of International Standards is that, when everybody follows them, life is good. All our software gets along peacefully.

XML has to be one of the most important standards we have. It's the basis of "old web" stuff like SOAP, and still 'web 2.0' stuff like RSS and Atom. It's because of clear standards that XML is able to interoperate between different platforms.

If we give up on XML, little by little, we'll get into a situation where a producer of XML will not be able to assume that a consumer of XML will be able to consumer their content. This would have a disasterous affect on the industry.

We should push back very forcefully, on anyone who writes code that does not process XML according to the standard. I understand that, in these economic times, there is a reluctance to offend customers and business partners by saying "no". But in this case, I think it's worth it. We would be in much worse financial shape if we had to hand-craft XML for each business partner.

So, don't "enable" companies who do not understand XML. Send them the standard, with the appropriate lines highlighted. They need to stop thinking that XML is just text with angle brackets in it. It simply does not behave like text with angle brackets in it.

It's not like there's an excuse for this. Even the smallest embedded devices can have full-featured XML parser implementations in them. I have not yet heard a good reason for not being able to parse standard XML, even if one can't afford a fully-featured DOM implementation.

John Saunders 2009-04-07 18:27:40

Answer 4

A:

Robert Rossney said it well: if you're relying on the ordering of attributes, you're not really processing XML, but rather, something that looks like XML.

I can think of at least two reasons why you might care about attribute ordering. There may be others, but at least for these two I can suggest alternatives:

1) You're using multiple instances of attributes with the same name:

<foo myAttribute="a" myAttribute="b" myAttribute="c"/>

This is just plain invalid XML; a DOM processor will probably drop all but one of these values -- if it processes the document at all. Instead of this, you want to use child elements:

2) You're assuming that some sort of distinction applies to the attribute(s) that come first. Make this explicit, either through other attributes or through child elements. For example:

<foo attr1="a" attr2="b" attr3="c" theMostImportantAttribute="attr1" />

Dan Breslau 2009-04-07 18:32:21

Answer 5

A:

Sorry to say, but the answer is more subtle than "No you can't" or "Why do you need to do this in the first place ?".

The short answer is "DOM will not allow you to do that, but SAX will".

This is because DOM does not care about the attribute order, since it's meaningless as far as the standard is concerned, and by the time the XSL gets hold of the input stream, the info is already lost. Most XSL engine will actually gracefully preserve the input stream attribute order (e.g. Xalan-C (except in one case) or Xalan-J (always)). Especially if you use .

Cases where the attribute order is not kept, best of my knowledge, are. - If the input stream is a DOM - Xalan-C: if you insert your result-tree tags literally (e.g.

Here is one example with SAX, for the record (inhibiting DTD nagging as well).

    SAXParserFactory spf = SAXParserFactoryImpl.newInstance();
    spf.setNamespaceAware(true);
    spf.setValidating(false);
    spf.setFeature("http://xml.org/sax/features/validation", false);
    spf.setFeature("http://apache.org/xml/features/nonvalidating/load-dtd-grammar", false);
    spf.setFeature("http://apache.org/xml/features/nonvalidating/load-external-dtd", false);
    SAXParser sp = spf.newSAXParser() ;
    Source src = new SAXSource ( sp.getXMLReader(), new InputSource( input.getAbsolutePath() ) ) ;
    String resultFileName = input.getAbsolutePath().replaceAll(".xml$", ".cooked.xml" ) ;
    Result result = new StreamResult( new File (resultFileName) ) ;
    TransformerFactory tf = TransformerFactory.newInstance();
    Source xsltSource = new StreamSource( new File ( COOKER_XSL ) );
    xsl = tf.newTransformer( xsltSource ) ;
    xsl.setParameter( "srcDocumentName", input.getName() ) ;
    xsl.setParameter( "srcDocumentPath", input.getAbsolutePath() ) ;

    xsl.transform(src, result );

I'd also like to point out, at the intention of many naysayers that there are cases where attribute order does matter.
Regression testing is an obvious case. Whoever has been called to optimise not-so-well written XSL knows that you usually want to make sure that "new" result trees are similar or identical to the "old" ones. And when the result tree are around one million lines, XML diff tools prove too unwieldy... In these cases, preserving attribute order is of great help.

Hope this helps ;-)

Alain Pannetier 2010-09-16 15:31:51

Answer 6

+1 A:

XML Canonicalisation results in a consistent attribute ordering, primarily to allow one to check a signature over some or all of the XML, though there are other potential uses. This may suit your purposes.

Jon Hanna 2010-09-16 15:36:26

Any way to flag this as a necro?

Qwerky 2010-09-16 15:41:50

@Qwerky Any reason why that would matter? I see nothing in the FAQ to suggest that hitting old questions is frowned upon, do see something in the badges to suggest it is acceptable, and nothing on this page that offers C18N as a potential answer.

Jon Hanna 2010-09-16 21:35:29

@Jon It makes a bit of a mockery of the sort by recent activity feature, which I use to track active threads.

Qwerky 2010-09-17 09:07:43

Though the problem no longer applies to my current situation I do thank your answer. In a near future it could be useful

Fernando Miguélez 2010-09-19 04:13:10

ansaurus

tags:

views:

answers:

Order of XML attributes after DOM processing

related questions