ansaurus

Question

Difference between attributes and style tags in lxml

Answer 1

A:

Using the CSS API really isn't the right approach. If you want to find all b elements, do

strHTM=open(r'c:\myfile.htm','r').read() # no need to split it into lines first
newHTM=html.fromString(strHTM)
bELements = newHTM.findall('b')
for b in bElements:
    print b.text_content()

Martin v. Löwis 2009-09-29 05:59:08

This is where I started and it does not work. As near as I can figure it is because the newHTM is a class and but now I am lost. I am not sure why I decided to operate on each in newHTM but that was the key.

Burch Kealey 2009-09-29 16:23:19

What do you mean, "it does not work"? It works fine for me.

Martin v. Löwis 2009-09-29 17:17:18

Well I am wrong because both newHTM and the each in newHTM are the same type of objects so that is not it

PyNEwbie 2009-09-29 17:25:46

Well I would edit but I can't fromString sb fromstring and your list is named differently. But when I run this code on my htm fragment bElements has a length of 0.

PyNEwbie 2009-09-29 17:31:30

ansaurus

tags:

views:

answers:

Difference between attributes and style tags in lxml

related questions