Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • by Matts (1087) on 2003.01.19 13:10 (#16150) Journal
    One thing worth pointing out is that the testing for CRM114 isn't very well done - the figures are merely real time figures for his personal email. As all well and good as that is, the other project's test data was based on split training/validation corpuses in lab conditions. And that produces different results.

    So everyone seems pretty happy with the CRM114 results, most of the other Bayesian projects are getting equivalent results. That plus CRM114's slowness (16 times slower than every other bayesian project) will in my opinion mean that it won't find all that much success.

    PS: Please don't take this as bias because I'm affiliated with spamassassin - I'm all for any solution that works better, but in my personal testing CRM114 wasn't any better and was significantly slower than everything else.