ansaurus

Question

Select specific child elements with BeautifulSoup

Answer 1

+1 A:

You are more flexible with find, and to get what you want you just need to run:

node = p.find('div', text="Content I Want")

But since it might not be how you want to get there, following options might suit you better:

xml = """<div id="top"><div>Content</div><div><div>Content I Want</div></div></div>"""
from BeautifulSoup import BeautifulSoup
p = BeautifulSoup(xml)

# returns a list of texts
print p.div.div.findNextSibling().div.contents
# returns a list of texts
print p.div.div.findNextSibling().div(text=True)
# join (and strip) the values
print ''.join(s.strip() for s in p.div.div.findNextSibling().div(text=True))

van 2009-10-15 11:34:56

ansaurus

tags:

views:

answers:

Select specific child elements with BeautifulSoup

related questions