ansaurus

Question

How do I filter chat messages by normalizing letter forms?

Answer 1

+1 A:

I think your best bet is to use an OCR (optical character recognition) engine. After all, that's precisely what you're after: A best effort to parse the letters into readable A-Z characters. (Remember to print the chat-messages onto an image using the same font as used in your chat-client.)

Two Java-OCR libraries:

aioobe 2010-10-11 09:16:11

Answer 2

A:

The correct solution is not to install idiotic "profanity filters" (which I assume are behind this request). If the community cannot police itself at all in that regard, moderate it manually and ban offenders, or shut it down. Having to wrestle with the Scunthorpe problem will offend your users much more than some swearing kids.

Michael Borgwardt 2010-10-11 09:19:20

Possibly, but it is possible to offend users by filtering, and parents of users by not filtering. In any case the filtering is being done already and this is not really an answer to the question posed. Understanding the shape of letter forms will lead to an understanding of the intent behind the message and ultimately less messages being blocked.

izb 2010-10-11 10:21:26

ansaurus

tags:

views:

answers:

How do I filter chat messages by normalizing letter forms?

related questions