ansaurus

Question

Finding the parent tag of a text string with ElementTree/lxml

Answer 1

+1 A:

This is a simple way to do it with ElementTree. It does require that your HTML input is valid XML (so I have added the appropriate end tags to your HTML):

import elementtree.ElementTree as ET

html = """<html>
<head>
</head>
<body>
<div>
<p>TEXT STRING HERE ......</p> 
</div>
</body>
</html>"""

for e in ET.fromstring(html).getiterator():
    if e.text.find('TEXT STRING HERE') != -1:
        print "Found string %r, element = %r" % (e.text, e)

mhawke 2009-06-22 01:19:18

ansaurus

tags:

views:

answers:

Finding the parent tag of a text string with ElementTree/lxml

related questions