I'm scraping a html page, then using xml.dom.minidom.parseString() to create a dom object.
however, the html page has a '&'. I can use cgi.escape to convert this into &
but it also converts all my html <> tags into <>
which makes parseString() unhappy.
how do i go about this? i would rather not just hack it and straight replace the "&"s
thanks