On this data:
<row Id="37501" PostId="135577" Text="...uses though."/>
I'm getting an error with the Python sax parser:
xml.sax._exceptions.SAXParseException:
comments.xml:29776:332: reference to invalid character number
I trimmed the example; 332 points to "".
Is the parser correct in rejecting this character?