ansaurus

Question

Cleaning up and removing tags with BeautifulSoup

Answer 1

A:

This will do it for this EXACT html. Obviously this isn't tolerant of any deviation, so you'll want to add quite a lot of bounds checking and null checking, but here's the nuts and bolts to get your data into plain text.

items = soup.findAll(id="info")
print items[0].span.b.contents[0]
print items[0].contents[3].strip()
print items[0].contents[5].strip().split(":", 1)[1]

Peter Lyons 2010-07-01 00:42:23

Thanks, Peter, this is exactly what I needed!

Parker 2010-07-01 11:37:03

ansaurus

tags:

views:

answers:

Cleaning up and removing tags with BeautifulSoup

related questions