ansaurus

Question

Answer 1

A:

Piccolo is a small, extremely fast XML parser for Java. It implements the SAX 1, SAX 2.0.1, and JAXP 1.1 (SAX parsing only) interfaces as a non-validating parser. It's available on Apache's License

venJava 2010-07-07 16:13:07

The last release of piccolo is from 2004 and there are open bug reports that are several years old, so I would not recommend to use it.

Jörn Horstmann 2010-07-07 22:05:33

Answer 2

+3 A:

When handling XML you generally have two approaches: streaming (SAX) or loading the entire document into memory (various DOM implementations).

If you can pre-establish a set of queries to be processed in bulk, you could write a program to use SAX to stream the file, looking for matches. If the queries come in at random intervals (i.e. a typical database application) then you will need to either load the entire document into memory, or preprocess the XML document into a database of some kind.

A better description of what you're trying to accomplish might help get better answers.

Jim Garrison 2010-07-07 16:52:14

+1 for the better description for better answers ...

Xavier Combelle 2010-07-07 17:33:14

Answer 3

A:

depending of the application using a xml orientated database such http://exist.sourceforge.net/ could be interesting.

Xavier Combelle 2010-07-07 17:32:29

Answer 4

+1 A:

vtd-xml is the best-fit for your usecase. http://vtd-xml.sourceforge.net/

Pangea 2010-07-07 17:36:49

This looks promising. I look into this, and if it suits my needs I can mark the question as answered.

Jan 2010-07-07 22:56:20

ansaurus

tags:

views:

answers:

Random queries on a large xml file

related questions