ansaurus

Question

Unescape _xHHHH_ XML escape sequences using Python

Answer 1

+1 A:

You might as well check for '_x' rather than just _, that won't matter much but surely the two-character sequence's even rarer than the single underscore. Apart from such details, you do seem to be making the best of a bad situation!

Alex Martelli 2009-06-23 04:03:04

Checking for '_x' is slightly slower (Python 2.6) and doesn't work on Pythons earlier than 2.3.

John Machin 2009-06-23 04:21:51

As for Python 2.2 and earlier, you're right @john -- I was kind of assuming `x >= 3` (is ANYbody still stuck with Python 2.2....?! If so, I'm DEEPLY sorry...!!!). As for relative speed, it depends on how many isolated `'_'` you get and how much slowed checking with a regex can be (test to be effing fast in my experience, but the original poster says otherwise) -- URL to any specific benchmark pls?

Alex Martelli 2009-06-23 05:07:21

Answer 2

A:

xml.sax.saxutils.unescape()

(maybe with htmlentitydefs module)

abafei 2010-07-27 18:40:18

John Machin 2010-07-28 00:16:00

ansaurus

tags:

views:

answers:

Unescape _xHHHH_ XML escape sequences using Python

related questions