ansaurus

Question

How to get a html elements with python lxml

Answer 1

+1 A:

Why dont you just fetch what you want in each step?

links = [el.text for el in html.xpath('//td[@class="test"][position() = 1]/b/a')]
smalls = [el.text for el in html.xpath('//td[@class="test"][position() = 4]/small')]
print zip(links, smalls) 
# => [('aaa', 'ddd'), ('eee', 'hhh')]

THC4k 2010-05-11 01:20:10

Answer 2

+2 A:

If you do el.text_content() you'll strip all the tag stuff from each element, i.e.:

result = [el.text_content() for el in result]

Ian Bicking 2010-05-11 02:13:07

ansaurus

tags:

views:

answers:

How to get a html elements with python lxml

related questions