I recently retrained my spam filters (sa-learn) by the simple expedient of pulling down about three weeks worth of mail and hand sorting it. It came out something like this:
Ham: 960 messages ~6 megs
Spam: 4100 messages ~120 megs
Out of the ham there was about 50 I actually kept to read. The rest was mailing list threads I wasn't particularly interested in.
Realize this is *after* my mail is filtered through pobox.com's RBL-based filtering. That knocks out between 5000 to 10,000 messages a month (6700 in the last 30 days).
I always assume I get an uncommonly large amount of spam. My address is over six years old and is posted all over the Internet via mailing lists and Perl documentation. This is why I usually scoff when someone suggests "just use foo.com's built-in filtering" whenever my email toolchain has a hiccup.
Do other folks see this much crap?
PS If you ever need to hand filter a large amount of email, sorting by subject helps a lot.
When will you answer your email? (Score:1)
Clever :-)
Meanwhile, when do you plan on reading (and answering) those 50 messages? (I sent you an email, but I don't know if it got through...)
Re:When will you answer your email? (Score:2)
-Dom
Re:When will you answer your email? (Score:2)
Re:When will you answer your email? (Score:1)
Re: Some spam numbers (Score:2)
Yep. Since January 1st I've got about 12,000 emails in my caughtspam folder. I really need to implement some kind of SMTP-time blocking mechanism.
Re: Some spam numbers. (Score:2)
Re: (Score:1)
I don't think 960 ham messages in three weeks is a common figure. I probably get a fifth of that.
Going by your numbers, though, you have a spam:ham ratio of 10:1, and I'd say that yes, that's pretty common. It's certainly close to what I'm getting.
Re: (Score:2)
Re: (Score:1)
Here's my stats (Score:1)
/home/cjcollier/Maildir/.backup.2004.01/cur: 957
/home/cjcollier/Maildir/.backup.2004.02/cur: 616
/home/cjcollier/Maildir/.backup.2004.03/cur: 489
/home/cjcollier/Maildir/.backup.2004.04/cur: 324
/home/cjcollier/Maildir/.backup.2004.05/cur: 417
/home/cjcollier/Maildir/.backup.2004.06/cur: 563
/home/cjcollier/Maildir/.backup.2004.09/cur: 1872
/home/cjcollier/Maildir/.backup.2004.10/cur: 595
/home/cjcollier/Maildir
200 spam messages a day (Score:2)
I fear that, when I go on holiday for a few weeks, my normal mailbox will just overflow. The sheer bulk of it is just too much for even the multi-megabytes mailbox I have at my disposal.