ansaurus

Question

Is there a way to compute an XPath from a known value in a web page ?

Answer 1

+1 A:

Since XPath can select the n'th child element (ie /parentelement/child_element[2]), if you can figure out where in the tree the element is, then you should be able to generate an XPath back to it.

Thanatos 2009-05-23 03:59:35

Answer 2

A:

You didn't specify a language which makes this question difficult to answer. The python lxml module can do it

>>> a  = etree.Element("a")
>>> b  = etree.SubElement(a, "b")
>>> c  = etree.SubElement(a, "c")
>>> d1 = etree.SubElement(c, "d")
>>> d2 = etree.SubElement(c, "d")

>>> tree = etree.ElementTree(c)
>>> print(tree.getpath(d2))
/c/d[2]
>>> tree.xpath(tree.getpath(d2)) == [d2]
True

Even if you aren't using python you may find what you need in the module source code

SpliFF 2009-05-23 04:01:14

Answer 3

+1 A:

Here's a simple JS function to do it. It uses only previousSibling, nodeType, and parentNode, so it should be portable to other languages. However, the result is not readable (for humans), and it will not be particularly robust in the face of page changes.

In my experience, XPath is more useful if written by hand. However, you can certainly make an algorithm that will generate prettier (if probably slower) results.

function getXPath(node)
{
    if(node == document)
    return "/";
    var xpath = "";
    while (node != null && node.nodeType != Node.DOCUMENT_NODE)
    {
    print(node.nodeType);
    var pos = 1, prev;
    while ((prev = node.previousSibling) != null)
    {
        node = prev
        pos++;
    }
    xpath = "/node()[" + pos + "]" + xpath;
    node = node.parentNode;
    }
    return xpath;
}

Matthew Flaschen 2009-05-23 07:40:24

ansaurus

tags:

views:

answers:

Is there a way to compute an XPath from a known value in a web page ?

related questions