ansaurus

Question

grabbing a substring while scraping with Python2.6

Answer 1

+2 A:

I imagine upc_code is the list you're showing us, and the local_links one has nothing to do with your question right? Given that you don't mention it further in your code...?

So I'm not certain what upc_text would be in your loop's body given that upc is a ul Tag -- upc.contents is going to be a list of li tags (presumably), and I don't see how upc.contents.contents can work -- what are you seeing as a result of that code? I would have expected an exception!

Anyway, the way I'd write the loop would be something like:

for upc in upc_code:
    listitems = upc.findAll('li')
    for anitem in listitems:
        print anitem.contents[1]

since you appear to want the second child of each list item (the first one is the strong tag, the second one the navigable string you want.

If it's not the second child of each list item that you want, please clarify; for example, you could identify the strong and get its next sibling, if that suits you better -- just make the body of the nested loop into

print anitem.find('strong').nextSibling

Alex Martelli 2010-05-17 00:32:43

you are right, i hadn't changed that when I posted.. the upc.contents.contents didn't work Cheers!

Diego 2010-05-28 02:46:31

ansaurus

tags:

views:

answers:

grabbing a substring while scraping with Python2.6

related questions