xml-parse

Parsing XML With Single Quotes?

I am currently running into a problem where an element is coming back from my xml file with a single quote in it. This is causing xml_parse to break it up into multiple chunks, example: Get Wired, You're Hired! Is then enterpreted as 'Get Wired, You' being one object, the single quote being a second, and 're Hired!' as a third. What I w...

Are line breaks in XML attribute values valid?

I realise that it's not elegant or desired, but is it valid for an attribute value in an XML element to span multiple lines? e.g. <some-xml-element value="this value goes over.... multiple lines!" /> Yeah I realise there's better ways of writing that. I would personally write it like: <some-xml-element> <value>this value goes over.....

Umlaut in Java SAX Parser

Hello everybody! I am currently having trouble with German umlaut values in a XML document I received. It displays / saves the value as a "ü" instead of a "ü". The XML Encoding is set to UTF-8 which should be capable of displaying umlauts. Also I couldn't find any option to set a locale on the SAX parser. Is there any other way I c...

BioPython: extracting sequence IDs from a Blast output file

Hi, I have a BLAST output file in XML format. It is 22 query sequences with 50 hits reported from each sequence. And I want to extract all the 50x22 hits. This is the code I currently have, but it only extracts the 50 hits from the first query. from Bio.Blast import NCBIXM blast_records = NCBIXML.parse(result_handle) blast_record = bl...