ansaurus

Question

From escaped html -> to regular html? - Python

Answer 1

+5 A:

I think you want xml.sax.saxutils.unescape from the Python standard library.

E.g.:

>>> from xml.sax import saxutils as su
>>> s = '&lt;foo&gt;bar&lt;/foo&gt;'
>>> su.unescape(s)
'<foo>bar</foo>'

Alex Martelli 2010-03-19 04:31:55

@Alex thank you!! this was nice and simple! :D

RadiantHex 2010-03-19 18:10:20

Answer 2

+1 A:

You could try the urllib module?

It has a method unquote() that might suit your needs.

Edit: on second thought, (and more reading of your question) you might just want to just use string.replace()

Like so:

string.replace('&lt;','<')
string.replace('&gt;','>')

George Edison 2010-03-19 04:33:33

Why would you bother with coding the different replace steps (for lt, gt, amp) when the saxutils.unescape wraps them all up for you?-) Plus, remember: the replace call doesn't alter the string, it builds a new string. Your code snippet, as given, is a slow no-op!-)

Alex Martelli 2010-03-19 20:29:46

ansaurus

tags:

views:

answers:

From escaped html -> to regular html? - Python

related questions