I am currently in a project with php frontend. We're pretty concerned about security, because we'll have quite a lot of users and are an attractive target for hackers. Our users are able to submit html formatted content that is visible to other users later. This is a big problem because we're vulnerable for the whole set of XSS attacks. We're filtering as good as we can , but the variety of attack vectors is pretty big.
So, I'm searching for php based html sanitizing/filtering solutions. Commercial solutions are fine (even preferred). Currently we're using a modified html purifier, but we're not satisfied with the results.
Does anyone know good libraries/tools that are capable of filtering malicious parts of html?
Nice to have is for example html5 awareness, which will become a security nightmare once it's available "in the wild".
Update: We're doing a in depth configuration of htmlpurifier. Looks like the older framework we used before was just not configuring it at all. Now the results look much better.