ansaurus

Question

Deciding on when to use XmlDocument vs XmlReader

Answer 1

A:

There is a size threshold at which XmlDocument becomes slower, and eventually unusable. But the actual value of the threshold will depend on your application and XML content, so there are no hard and fast rules.

If your XML file can contain large lists (say tens of thousands of elements), you should definitely be using XmlReader.

Joe 2009-10-01 16:43:27

Answer 2

+6 A:

Braveyard 2009-10-01 16:45:52

I'd be interested to see if the `XmlDocument` performance changes any if you use `/*/child` instead of `//child` as your XPath pattern.

Robert Rossney 2009-10-01 18:50:32

You should not use `new XmlTextReader()` as of .NET 2.0. Use `XmlReader.Create` instead.

John Saunders 2009-10-01 20:39:20

Answer 3

+1 A:

XmlDocument is an in-memory representation of the entire XML document. Therefore if your document is large, then it will consume much more memory than if you had read it using XmlReader.

This is assuming that when you use XmlReader you read and process the elements one-by-one then discard it. If you use XmlReader and construct another intermediary structure in memory then you have the same problem, and you're defeating the purpose of it.

Google for "SAX versus DOM" to read more about the difference between the two models of processing XML.

DSO 2009-10-01 20:08:32

Answer 4

+8 A:

I've generally looked at it not from a fastest perspective, but rather from a memory utilization perspective. All of the implementations have been fast enough for the usage scenarios I've used them in (typical enterprise integration). However, where I've fallen down, and sometimes spectacularly, is not taking into account the general size of the xml I'm working with. :) If you think about it up front you can save yourself some grief.

Xml tends to bloat when loaded into memory, at least with a DOM reader like XmlDocument or XPathDocument. Something like 10:1? Exact amount is hard to quantify, but if its 1MB on disk it will be 10MB in memory, or more, for example.

A process using any reader that loads the whole document into memory in its entirety (XmlDocument/XPathDocument) can suffer from large object heap fragmentation, which can ultimately lead to OutOfMemoryExceptions (even with available memory) resulting in an unavailable service/process.

Since objects that are greater than 85K in size end up on the large object heap, and you've got a 10:1 size explosion with a DOM reader, you can see it doesn't take much before your xml documents are being allocated from the large object heap.

XmlDocument is very easy to use. Its only real drawback is that it loads the whole xml document into memory to process. Its seductively simple to use.

XmlReader is a stream based reader so will keep your process memory utilization generally flatter but is more difficult to use.

XPathDocument tends to be a faster, read-only version of XmlDocument, but still suffers from memory 'bloat'.

Zach Bonham 2009-10-01 20:35:10

ansaurus

tags:

views:

answers:

Deciding on when to use XmlDocument vs XmlReader

related questions