ansaurus

Question

Is there any use for putting a string that contains enters (char 10 or 13) in a xml CDATA section?

Answer 1

A:

I could be way off base on this, but I seem to remember it being a good recommendation to put Javascript code inside CDATA tags. In fact see the selected answer for this stack overflow question as it does a decent job on answering why: http://stackoverflow.com/questions/66837/javascript-cdata-tags#66865

Jordan S. Jones 2009-05-29 09:15:28

clearly not using javascript here so I don't see how that's relevant

Jonathan Fingland 2009-05-29 09:17:19

Yup, now that you mention it, whenever I write XHTML I always put javascript and css in CDATA tags - it just makes your life easier when you need to use ampersands freely.

Elijah 2009-05-29 09:18:05

It's not javascript, but basic text.

2009-05-29 09:23:57

The original poster didn't define whether it was javascript or not. My answer simply dictates when it would be advisable to put a "string" with CR\LFs in a CDATA tag.

Jordan S. Jones 2009-05-29 09:29:34

Answer 2

A:

I know it sounds obvious, but if you are embedding a plain ascii text file and you want to preserve the manual formatting of the file verbatim. That would be a useful case.

Other cases that I have encountered are outputting metadata from images and I have no control over their formatting.

Elijah 2009-05-29 09:16:44

Answer 3

+1 A:

In XML, CDATA preserves whitespace, ordinary text does not.

anon 2009-05-29 09:18:40

Answer 4

A:

Putting text inside a CDATA section should ensure that any parser ignores it, so the code above might be used to ensure correct formatting regardless what a parser is told to do with whitespace.

I supposed that it effectively says that the line breaks are meaningful in that section, and not just incidental. Not sure why you would only put a CDATA section in if there were linebreaks present though, so I would guess it's just a workaround rather than a by-design thing in the code given.

Brabster 2009-05-29 09:19:10

I suspect a work-around too, but unfortunatly, the code commit doesn't document why this is written the way it is (whats new :-) ). So I hoped that somebody would recognize a similar case.

2009-05-29 09:34:41

Answer 5

A:

I would say it depends entirely on whether your XML parse strips whitespace and control characters. I'm fairly certain the System.Xml ones in .NET don't, nor MSXML or Xerces but there are options to do it.

Chris S 2009-05-29 09:20:22

Ok. In my testcase I used `root.getNodeValue()` to retrieve the string. So it depends on the xml implementation I use if I would get "mytext\n\n\n" back or "mytext"!? I hope this stripping of whitespace is not the default. I'll go check.

2009-05-29 09:29:03

In .net, there are properites in the Xml readers that control whitespace preservation (e.g. XmlDocument.PreserveWhitespace)

Jason Williams 2009-10-11 17:50:18

Answer 6

A:

Since CDATA sections allow you to put arbitrary data inside an XML document without having to understand anything about how the XML objects are going to handle it, they're frequently used by people who don't understand how the XML objects work. Generally speaking, when I see someone creating CDATA in their XML I start from the assumption that they don't really know what they're doing unless they've included a good explanation. (And more often than not, that good explanation reveals that they didn't know what they were doing.)

The original developer is probably confusing the DOM's handling of text nodes that contain whitespace with its handling of text nodes that contain only whitespace. DOMs frequently normalize whitespace-only text nodes, which can be a problem in XML like:

<xsl:value-of select="foo"/>
<xsl:text>    </xsl:text>
<xsl:value-of select="bar"/>

If the DOM normalizes the four spaces in that second element down to one space, that changes the functionality of that transform, which is an unambiguously bad thing.

But there's a reason you don't see XSLT that looks like this:

<xsl:value-of select="foo"/>
<xsl:text><![CDATA[    ]]>/xsl:text>
<xsl:value-of select="bar"/>

And that's that XSLT processors are written by people who understand how the XML objects work, and who know that in their specific case, it's important to tell the DOM to preserve whitespace in whitespace-only text nodes.

Robert Rossney 2009-10-11 17:43:26

ansaurus

tags:

views:

answers:

Is there any use for putting a string that contains enters (char 10 or 13) in a xml CDATA section?

related questions