ansaurus

Question

Answer 1

+4 A:

Do you have any control over the creation of the xml file? The contents of xml tags which contain xml tags (or similar), or markup chars ('<', etc) should be encoded to avoid this problem. You can do this with either:

a CDATA section
Base64 or some other encoding (which doesn't include xml reserved characters)
Entity encoding ('<' == '<')

If you can't make these changes, and ElementTree can't ignore tags not included in the xml schema, then you will have to pre-process the file. Of course, you're out of luck if the schema overlaps html.

Dana the Sane 2009-07-06 18:22:37

Using a CDATA section solved the problem. Thanks!

Rafael Almeida 2009-07-06 18:30:10

Answer 2

+1 A:

Characters like "<" and "&" are illegal in XML elements.

"<" will generate an error because the parser interprets it as the start of a new element.

"&" will generate an error because the parser interprets it as the start of an character entity.

Some text, like JavaScript code, contains a lot of "<" or "&" characters. To avoid errors script code can be defined as CDATA.

Everything inside a CDATA section is ignored by the parser.

A CDATA section starts with "":

More information on: http://www.w3schools.com/xmL/xml_cdata.asp

Hope this helps!

ylebre 2009-07-06 18:25:28

ansaurus

tags:

views:

answers:

HTML inside node using ElementTree

related questions