Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

use Perl Log In

Log In

[ Create a new account ]

davorg (18)

davorg
  dave@dave.org.uk
http://dave.org.uk/
Yahoo! ID: daveorguk (Add User, Send Message)

Hacker, author, trainer

Technorati Profile [technorati.com]

Journal of davorg (18)

Sunday April 25, 2004
05:24 AM

Bayesian

[ #18474 ]

I was wondering why the amount of spam in my inbox had increased drastically over the last month or so. Further investigation shows that for some reason the bayesian filters had stopped working.

So yesterday I spent some time sorting all that out. Overnight I retrained it from large piles of spam and ham and now everything seems to work fine again. It's good to see entries in my spam folder saying things like

pts rule name              description
---- ---------------------- --------------------------------------------------
5.4 BAYES_99               BODY: Bayesian spam probability is 99 to 100%
                            [score: 1.0000]

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • Interesting. I've also noticed a large number of false negatives, and have been wondering whether keyword flooding has been spoiling the data. After you retrained, have your non-spams been scoring differently?