ansaurus

Question

Using Python lxml.html how can I find images within link tags?

Answer 1

+1 A:

Just modify your css selector:

for img in doc.cssselect('a img'):

You can also use an XPATH expression:

for img in doc.xpath('a//img'):

mikerobi 2010-10-31 00:52:02

Does that also pickup if there is no img?

Wizzard 2010-10-31 01:21:08

No, base on your question, it seemed all you wanted was the alt text, no image, no alt text.

mikerobi 2010-10-31 01:23:39

Answer 2

+1 A:

for link in doc.xpath('a'):
    img = link.find('img')
    if img is not None:
        print '%s: %s' % (img.get('alt'), link.get('href'))
    else:
        print '%s: %s' % (link.text_content(), link.get('href'))

dusan 2010-10-31 01:15:16

ansaurus

tags:

views:

answers:

Using Python lxml.html how can I find images within link tags?

related questions