ansaurus

Question

slow construction of tree structure from XML

Answer 1

A:

Have you tried profiling this ? I think that may be more instructive than looking at the code. It's quite often that a bottleneck shows up that you'd normally never expect. A simple profile (that you can do trivially in code) is to time the DOM parsing vs. your tree building.

For more in-depth profiling, JProfiler is available as an evaluation copy. Others may be able to recommend something more appropriate.

Brian Agnew 2009-07-16 11:40:18

I've only benchmarked the larger program that is using it, and it shows that this process is a bottleneck

Robert 2009-07-16 11:44:23

So I'd certainly look at the DOM parsing vs. your tree building

Brian Agnew 2009-07-16 11:45:57

Creating docBuilderFactory... Done [3ms]Creating docBuilder... Done [21ms]parsing file... Done [5646ms]getDocumentElement... Done [1ms]creating DomTree... Done [17076ms]

Robert 2009-07-16 11:56:37

If you're loading a 100Mb doc in, then your memory may be an issue. Try increasing the VM max memory size using -Xmx512m (to allocate up to 512m, or use whatever figure you can)

Brian Agnew 2009-07-16 12:11:04

actually its already set to -Xms2g -Xmx2g

Robert 2009-07-16 12:15:11

Ah. Just removed that from my answer. Thx

Brian Agnew 2009-07-16 12:29:27

Answer 2

+2 A:

If you're parsing a large XML, you don't use DOM, you use SAX, a pull parser such as XPP3 or anything else.

The problem is that you won't have an "XML tree" in memory which might be convenient, you only get events and deal with them accordingly. However it will be memory wise, and you can map to elements to your data structures.

John Doe 2009-07-16 11:51:33

do you have an example?

Robert 2009-07-16 12:07:09

ansaurus

tags:

views:

answers:

slow construction of tree structure from XML

related questions