ansaurus

Question

Java: Using XPath to loop over nodes and extract specific subnode values

Answer 1

A:

Yes, you can always do like this -

XPathFactory factory = XPathFactory.newInstance();
XPath xpath = factory.newXPath();
XPathExpression expr = xpath.compile("/products/product");
Object result = expr.evaluate(doc, XPathConstants.NODESET);
expr = xpath.compile("title"); // The new xpath expression to find 'title' within 'product'.

NodeList products = (NodeList) result;
for (int i = 0; i < products.getLength(); i++) {
    Node n = products.item(i);
    if (n != null && n.getNodeType() == Node.ELEMENT_NODE) {
        Element product = (Element) n;
        NodeList nodes = (NodeList)  expr.evaluate(product,XPathConstants.NODESET); //Find the 'title' in the 'product'
        System.out.println("TITLE: " + nodes.item(0).getTextContent()); // And here is the title 
    }
}

Here I have given example of extracting the 'title' value. In same way you can do for 'image'

Gopi 2010-10-22 11:58:04

Answer 2

A:

I'm not a big fan of this approach because you have to build a document (which might be expensive) before you can apply XPaths to it.

I've found VTD-XML a lot more efficient when it comes to applying XPaths to documents, because you don't need to load the whole document into memory. Here is some sample code:

final VTDGen vg = new VTDGen();
vg.parseFile("file.xml", false);
final VTDNav vn = vg.getNav();
final AutoPilot ap = new AutoPilot(vn);

ap.selectXPath("/products/product");
while (ap.evalXPath() != -1) {
    System.out.println("PRODUCT:");

    // you could either apply another xpath or simply get the first child
    if (vn.toElement(VTDNav.FIRST_CHILD, "title")) {
        int val = vn.getText();
        if (val != -1) {
            System.out.println("Title: " + vn.toNormalizedString(val));
        }
        vn.toElement(VTDNav.PARENT);
    }
    if (vn.toElement(VTDNav.FIRST_CHILD, "image")) {
        int val = vn.getText();
        if (val != -1) {
            System.out.println("Image: " + vn.toNormalizedString(val));
        }
        vn.toElement(VTDNav.PARENT);
    }
}

Also see this post on Faster XPaths with VTD-XML.

dogbane 2010-10-22 12:49:55

ansaurus

tags:

views:

answers:

Java: Using XPath to loop over nodes and extract specific subnode values

related questions