ansaurus

Question

Answer 1

+1 A:

You can cut out the strings:

from lxml.html import HtmlComment # or similar
no_comments=[element for element in element_list if not isinstance(element, HtmlComment)]

Matthew Flaschen 2010-09-04 22:23:58

Didn't work my list still included comments Humm, but it might work earlier the elements in element_list, if they are comments are the comments - does that make sense? An element that is a comment is , an element that is not a comment is <Element br at 12b9928>

PyNEwbie 2010-09-04 22:37:22

But it does work here elements=[e for e in theTree.cssselect('text')[0].iter()) if not isinstance(e,HtmlComment)]

PyNEwbie 2010-09-04 22:41:56

ansaurus

tags:

views:

answers:

How to access comments using lxml

related questions