ansaurus

Question

How to generate an *exact* copy of an XML document with resolved entities

Answer 1

+1 A:

Is it possible for you to read in the xml template as a string? And with the string do something like

string s = "<title>&title;</title>";
s = s.replace("&title;", "Stack Overflow Madness");
SaveXml(s);

2009-10-29 06:40:22

Unfortunately, I can't. The entity resolution is much more complex than simply replacing with something else. So I will have to use org.xml.sax.EntityResolver to do it.

2009-10-29 06:44:07

Answer 2

+1 A:

You almost certainly cannot do this using any XML parser I've heard of, and certainly the Sun XML parsers cannot do it. They will happily discard details that have no significance as far as the meaning of the XML is concerned. For example,

<title>Stack Overflow Madness</title>

and

<title >Stack Overflow Madness</title >

are indistinguishable from the perspective of the XML syntax, and the Sun parsers (rightly) treat them as identical.

I think your choices are to do the replacement treating the XML as text (as @Wololo suggests) or relax your requirements.

By the way, you can probably use an XmlEntityResolver independently of the XML parser. Or create a class that does the same thing. This may mean that String.replace... is not the answer, but you should be able to implement an ad-hoc expander that iterates over the characters in a character buffer, expanding them into a second one.

Stephen C 2009-10-29 06:51:36

Big +1 on this. Perhaps if you (OP) were to explain **why** you need to preserve exact XML, someone would be able to suggest a better approach.

ChssPly76 2009-10-29 06:57:40

There is no reason really. It would just be nicer if it were possible, and I didn't know if it were/weren't possible with a Java XML parser, so I asked.

2009-10-29 07:14:15

ansaurus

tags:

views:

answers:

How to generate an exact copy of an XML document with resolved entities

related questions