ansaurus

Question

My custom XML reader is a two-legged turtle. Suggestions?

Answer 1

+1 A:

Your implementation assumes the Stream is seekable. If it is known to be seekable, why do anything? Just create an XmlReader at your position; consume the data; ditch the reader; and seek the Stream back to where you started?

How large is the xml? You may find that throwing the data into a DOM (XmlDocument / XDocument / ec) is a viable way of getting a reader that does what you need without requiring lots of rework. In the case of XmlDocument, XmlNodeReader would suffice, for example (it would also provide xpath support if you want to use non-trivial queries).

Marc Gravell 2010-02-15 05:10:08

In the case of a file stream, would I be losing performance gains by doing the repeated seeking backwards by possibly large amounts?

oakskc 2010-02-15 06:37:54

@oakskc - there is only one way to find that out; try it... I wouldn't *expect* this to be *horribly* expensive, though.

Marc Gravell 2010-02-15 07:01:37

Answer 2

A:

Why not use an existing one, like this one?

RC 2010-02-15 05:11:35

I'm trying to avoid loading a while DOM. Partly for resource reasons and partly to force myself into a different way of doing it.

oakskc 2010-02-15 06:39:45

Answer 3

+3 A:

Right off the bat, you should using a profiler for performance optimizations if you haven't already (I'd recommend SlimTune if you're on a budget). Without one you're just taking slightly-educated stabs in the dark.

Once you've profiled the parser you should have a good idea of where the ReadString() method is spending all its time, which will make your optimizing much easier.

One suggestion I'd make at the algorithm level is to scan the stream first, and then build the contents out: Instead of consuming each character as you see it, mark where you find <, >, and </ characters. Once you have those positions you can pull the data out of the stream in blocks rather than throwing characters into a StringBuilder one at a time. This will optimize away a significant amount of StringBuilder.Append calls, which may increase your performance (this is where profiling would help).

You may find this analysis useful for optimizing string operations, if they prove to be the source of the slowness.

But really, profile.

ShZ 2010-02-15 08:10:52

+1: for profiling advice. @oakskc: Most science isn't about thinking, it's about measurement and observation (then you think). The ancient philosophers "proved" that a horse must have one foot on the ground at all times, even when galloping. In the early 19th Century they captured a runnin horse on camera and slowed the images down, and - through observation - they proved otherwise. You can't solve performance problems by ONLY thinking about them :)

Binary Worrier 2010-02-15 08:31:26

+1 for recommending SlimTune. Profilers are WAY out of my budget, but SlimTune showed me that Stream.Length was a costly operation in my code. Once I stopped calling that repeatedly (my stream length doesn't change during this), it helped a lot.

oakskc 2010-02-16 14:34:50

ansaurus

tags:

views:

answers:

My custom XML reader is a two-legged turtle. Suggestions?

related questions