ansaurus

Question

python beautifulsoup adding extra end tags

Answer 1

+1 A:

How about searching directly for each tag instead of trying to traverse into the table?

   for td in soup.find("td"):
        ...

its not unusual to find the tbody tag nested within a table automatically when its not in the code. Either you can code for it or just jump straight to the tr or td tag.

ebt 2010-08-17 17:25:08

That's a good thought and I tried that. When I run the code above it returns the whole table not each individual td. I think BS is breaking on this pages horrible html ... bot sure what to do about it though

bababa 2010-08-17 17:39:51

2 things, check the version your using. If you're using 3.1 switch back to 3.0 (http://www.crummy.com/software/BeautifulSoup/3.1-problems.html) else try lxml, IMHO its a better general parser than Soup.

ebt 2010-08-17 21:40:04

ansaurus

tags:

views:

answers:

python beautifulsoup adding extra end tags

related questions