ansaurus

Question

Can this be improved? Scrubbing of dangerous html tags.

Answer 1

+1 A:

Yes, I already see you're missing onmousedown, onmouseup, onchange, onsubmit, etc. This is part of why should use whitelisting for both tags and attributes. Even if you had a perfect blacklist now (very unlikely), tags and attributes are added fairly often.

See Why use a whitelist for HTML sanitizing?.

Matthew Flaschen 2010-06-10 22:56:22

See Edit.......

chobo2 2010-06-10 23:13:11

Answer 2

+1 A:

That code is dangerous -- you should be whitelisting elements, not blacklisting them.

In other words, make a small list of tags and attributes you want to allow, and don't let any others through.

EDIT: I'm not familiar with HTML agility pack, but I see no reason why it wouldn't work for this. Since I don't know the framework, I'll give you pseudo-code for what you need to do.

doc.LoadHtml(html);

var validTags = new List<string>(new string[] {"b", "i", "u", "strong", "em"});

var nodes = doc.DocumentNode.SelectAllNodes();
foreach(HtmlNode node in nodes)
    if(!validTags.Contains(node.Tag.ToLower()))
        node.Parent.ReplaceNode(node, node.InnerHtml);

Basically, for each tag, if it's not contained in the whitelist, replace the tag with just its inner HTML. Again, I don't know your framework, so I can't really give you specifics, sorry. Hopefully this gets you started in the right direction.

zildjohn01 2010-06-10 22:56:45

See Edit.......

chobo2 2010-06-10 23:14:05

Hi. So how about attributes to tags or classes? what happens if you allow links so it would be <a href="">hi</a> how would that look in the list of valid tags? Also you get all nodes but once your doing checking them out do you put the nodes back together? Like right now they are all separate from the link of SelectAllNodes()

chobo2 2010-06-11 17:57:26

What framework do you uses? Html agility pack does not seem to have slectAllNodes

chobo2 2010-06-19 01:04:16

ansaurus

tags:

views:

answers:

Can this be improved? Scrubbing of dangerous html tags.

related questions