Hi,
Any idea (example) of spam filter implementation in C#? I would like to use a database to store unwanted words, to train my spam filter.
Thanks.
Hi,
Any idea (example) of spam filter implementation in C#? I would like to use a database to store unwanted words, to train my spam filter.
Thanks.
Patterns are more important than words (barring "#FF0000", you can be pretty much 100% confident that anything sent as an HTML mail will "#FF0000" in it is not worth reading). Take a look at http://en.wikipedia.org/wiki/Bayesian_spam_filtering and the references it has for one approach (IIRC, one of the first experiments with technique found that after training the filter, "#FF0000" was the expression most likely to indicate Spam, see I told you so).
You may want to check out Paul Graham spam filtering article You can also have a look at C# implementation of spam filter using Naive Bayes Classifier