ansaurus

Question

Twitter double encode entity references?

Answer 1

A:

It looks like it's taking the HTML code, and sticking that inside of an XML field, so when you use your XML parser on the XML, you get valid HTML.

FryGuy 2009-06-24 00:29:39

Andrew Medico 2009-06-24 00:31:07

Justin Niessner 2009-06-24 01:01:43

Answer 2

+1 A:

It's double coded because the text property is quasi HTML Encoded text (looks like they're only encoding < and > so that you don't start/end a new html element in your tweet). Therefore, before the XML parses it for communication across the wire, you'd have:

xml entity ref test &lt; & '

That string then gets encoded again (so that when it is decoded, it is still the proper HTML Encoded text) which turns it in to the:

xml entity ref test &amp;lt; &amp; '

That you are getting back.

Justin Niessner 2009-06-24 00:31:32

ansaurus

tags:

views:

answers:

Twitter double encode entity references?

related questions