spam-filtering

Trained Spam Machine Learning Classifier/Model

I have a list of sentences about 17 million. I need to identify sentence as spam/ham/unsure. Are there trained models present on the internet to which I could just feed in my data as a "test" set and the system would classify my sentence as spam/ham ? Note: The sentences aren't e-mails. ...

How Gmail spam filter works?

Hi, I'm always surprised by the high quality of Gmail spam filter. For the last year, it filtered 99.95% of the spam, and blocked by mistake only one mail. By comparison, any other mail service I used makes at least one mistake for every 50 mails. How, internally, Gmail does to reach this level of quality? Is it based on customers feed...

How to create custom spam database

Hi, I'm working on a project where I need to create a spam database and accept submissions from users. Accepting the submissions is easy, but I was trying to figure out how to weight these submissions. Let's say the database consists of words, and i get the following submissions: * 137x "banana" * 22x "apple" * 1x "exploding mouse" No...

How does Outlook 2007 decide if it should redirect an e-mail to the Junk E-mail folder?

My Question: How does Outlook 2007 decide if it should redirect an e-mail to the Junk E-mail folder? Background Story: When I send a simple HTML e-mail from my domain mailbox, it's being redirected to the Junk E-mail folder. Here are two sub questions: Does Outlook 2007 care if the sender's outbound mailbox's reverse PTR is ...

reCAPTCHA or anti-spam filters?

For protecting my website from spam and malicious registrations/posts/etc, is it better to use something like reCAPTCHA or an anti-spam script? I was planning on implementing reCAPTCHA for user registrations on my site, using the CodeIgniter reCAPTCHA library (here). However, the CI forum on this library (here) has a lot of negative vi...

Automatic spam filtering or flagging for Django or Python?

I'm working on a Django-based site that consists mostly of user- generated content: reviews, comments, tweet-like posts, etc. I'm concerned about spam. Are there any spam filters available for Django/Python? If not, what types of algorithms can be used for automatic spam filtering or flagging? On a more general note, does anyone k...

Mail goes to Junk folder in hotmail

I am able to send an email successfully to my gmail account and the mail goes to the INBOX, but when i send it to hotmail it reaches the Junk/Spam folder? I am using System.Net.Mail to send email. ...

What is the best way to filter spam with JavaScript?

I have recently been inspired to write spam filters in JavaScript, Greasemonkey-style, for several websites I use that are prone to spam (especially in comments). When considering my options about how to go about this, I realize I have several options, each with pros/cons. My goal for this question is to expand on this list I have create...