ansaurus

Question

Managed (.net) library with html-tidy like functionality?

Answer 1

+1 A:

Saw your post when starting to post the same question...

Have you found anything?

I noticed that there's a version for JSP (http://jtidy.sourceforge.net/).

And I'm sure you've seen the one that calls out to HtmlTidy (http://schneegans.de/asp.net/tidy/) but none that are managed/"modern".

Rob 2010-05-24 03:26:58

I haven't found anything. What I'm doing in the meantime is having a few regex based hacks to "hopefully" make the html valid xhtml - nothing fancy enough to parse the above example of mine. If it can be parsed as xhtml (which is usually the case, since most html is actually syntactically pretty clean), I use linq to xml to extract those elements+attributes that are in a whitelisted known safe set and trash the rest.That works good enough for now, in particular since browsers generate pretty parseable stuff so tinymce and ckeditor end up sending fairly clean things over the wire.

Eamon Nerbonne 2010-05-24 12:35:19

@Eamon: thanks for the info!

Rob 2010-05-24 21:56:00

ansaurus

tags:

views:

answers:

Managed (.net) library with html-tidy like functionality?

related questions