ansaurus

Question

Need help making Jython (dom4j) script more graceful :)

Answer 1

+2 A:

How about this (I don't claim to know much about Python, by the way, but this looks like an obvious first step):

for path in ('//xhtml:h1', '//xhtml:title'):
    elemHolder = dom.createXPath(path)
    elemHolder.namespaceURIs = map
    elem = elemHolder.selectSingleNode(dom)
    if elem is not None:
        return (elem.localName, elem.text)

return (None, "Page does not contain h1 or title tag")

Chris Jester-Young 2008-10-23 20:21:07

I got the concept and tweaked it to work. Cheers mate

Eef 2008-10-23 21:37:15

Answer 2

A:

That looks like it would work perfectly, only other thing is. I will be passing the value to a database and depending what was found its put in the appropriate column.

If its a H1 tag it will put it in the H1 column and if its a title tag it will get put in the title column.

Is there a way to detemine what tag was found also? Does this make sense?

Eef 2008-10-23 20:35:27

Yes, I've now made the function return a tuple, the first element of which is the tag name, and the second element of which is the result.

Chris Jester-Young 2008-10-23 20:40:30

ansaurus

tags:

views:

answers:

Need help making Jython (dom4j) script more graceful :)

related questions