Hi,
Long version:
Those familiar to the standardization nightmare of the RSS-family, may know that RSS does not provide you with information if for example the "description" element contains just plain text or html or xhtml.
I currently use the ROME-API to convert from various RSS versions to Atom 1.0. The Rome-API will happily parse the RSS and later output an Atom feed. Atom fortunately has a means to declare a summary to contain text, html or xhtml.
Example. RSS:
<item>
<link>http://www.schwarzwaelder-bote.de/wm?catId=79039&amp;artId=14737088&amp;rss=true</link>
<title>Analyse: Winter reißt Löcher in Straßen und Kassen</title>
<description><img src="http://www.schwarzwaelder-bote.de/cms_images/swol/dpa-InfoLine_rs-images/20100306/1192a_24128948.thumbnail.jpg" alt="Schlagloch" title="" border="0">&nbsp;& ;nbsp;&nbsp;Berlin (dpa) - Von Schnee und Eis befreit sind Deutschlands Straßen, und jetzt geht es ans große Aufräumen....</description>
</item>
becomes: ATOM:
<entry>
<title>Analyse: Winter reißt Löcher in Straßen und Kassen</title>
<link rel="alternate" href="http://www.schwarzwaelder-bote.de/wm?catId=79039&amp;artId=14737088&amp;rss=true" />
<author>
<name />
</author>
<id>http://www.schwarzwaelder-bote.de/wm?catId=79039&amp;artId=14737088&amp;rss=true</id>
<summary type="text"><img src="http://www.schwarzwaelder-bote.de/cms_images/swol/dpa-InfoLine_rs-images/20100306/1192a_24128948.thumbnail.jpg" alt="Schlagloch" title="" border="0">&nbs p;&nbsp;&nbsp;Berlin (dpa) - Von Schnee und Eis befreit sind Deutschlands Straßen, und jetzt geht es ans große Aufräumen....</summary>
</entry>
The problem is type="text"
which tells feed-readers like firefox to render the content of the summary as text --> you get to see all the html-source.
Short version: How do I detect that the content of the description element is (X)HTML so I can set the correct type attribute?