Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

use Perl Log In

Log In

[ Create a new account ]

demerphq (2831)

demerphq
  (email not shown publicly)
http://www.perlm ... l?node_id=108447

Perlmonk. Perl5 Regex Hacker. Telecoms Billing Specialist. Canadian living in Germany.

Journal of demerphq (2831)

Friday March 25, 2005
08:49 AM

Hacking the core....

[ #23840 ]

Hacking the core has got be one of the most interesting things I've done. Not being much of C programmer (im one of those Pascal weenies) its been a non stop rollercoaster ride of learning, head banging and even more learning.

Anyway, I finally was able to achieve a dream of mine and add Trie matching to Perls regex engine. Raphael applied it to 5.9.2 as patch 24044 on March 18, 2005. A moment I doubt I will ever forget. :-)

Since then ive been plugging away at the second phase of my regex plans which is to add Aho-Corasick matching support. I released a patch for it just the other day but sofar it hasnt worked out as well as the plain Trie patch. It seems to have problems building on some folks machines, and seems to add an unnacceptable overhead to some regexes that involve normal Tries and not Aho-Corasick enhanced ones.

I have to say the slowdown is at the point totally inexplicable, as I would expect the code from the second patch to be in fact slightly more efficient. Theres weirdness afoot that i really dont understand.

Anyway, it feels good to have contributed this, especially as it should eventually result in performance improvements in things like SpamAssassin which would probably be a boon to many folks out there. And doing a little bit to fight spam makes me feel good. :-)

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.