ansaurus

Question

Which XML style is better when handling it with Python's ElementTree?

Answer 1

+2 A:

You're both right, but I would pick #1 where possible, except for the text content:

#1 is much more succinct and human-readable, thus less error-prone.
Complete extensibility: YAGNI. YAGNI is not always true but if you're confident that you won't need extensibility, don't sacrifice other benefits for the sake of extensibility.
#1 is still pretty extensible. You can always add more attributes or child elements. The only way it isn't extensible is if you later discover you need multiple values for name, or info (or the text content value)... since you can't have multiple attributes with the same name on an element (nor multiple text content nodes without something in between). However you can still extend those by various techniques, e.g. space-separated values in an attribute, or adding child elements as an alternative to an attribute.
I would make the "value" into an attribute or a child element rather than using the text content. If you ever have to add a child element, and you have that text content there, you will end up with mixed content (text as a sibling of an element), which gets messy to process.

Update: further reading

A few good articles on the XML elements-vs-attributes debate, including when to use each:

Principles of XML design: When to use elements versus attributes - well-integrated article by Uche Ogbuji
SGML/XML Elements versus Attributes - with many links to other commentary on this old debate
Elements or Attributes?

See also this SO question (but I think the above give more profitable reading).

LarsH 2010-10-01 06:14:33

Good answer, and if you could tell me if there is a difference in handling those in lxml or not I'd gladly accept your answer.

Makis 2010-10-01 10:20:05

@Makis, sorry, I don't know anything about lxml. I'm just speaking from an XML perspective.

LarsH 2010-10-01 15:18:45

Answer 2

A:

Remember the critical limitations on XML attributes:

Attribute names must be XML names.
An element can have only one attribute with a given name.
The ordering of attributes is not significant.

In other words, attributes represent key/value pairs. If you can represent it in Python as a dictionary whose keys are XML names and whose values are strings, you can represent it in XML as a set of attributes, no matter what "it" is.

If you can't - if, for instance, ordering is significant, or you need a value to include child elements - then you shouldn't use attributes.

Robert Rossney 2010-10-01 18:16:11

ansaurus

tags:

views:

answers:

Which XML style is better when handling it with Python's ElementTree?

Update: further reading

related questions