Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • 570 stop words!? That's insane. I thought it was odd when some other search software I was using had "furthermore" as a stop word. Aren't stop words supposed to be common enough to be considered noise?

    Other words that MySQL thinks we'll never need to search for include "unfortunately", "willing", "afterwards" (but not "afterward"?), "corresponding", "associated", "known", "second", "unlikely", "better", and "immediate". It seems like a pretty random selection.
    • I think it's quite overdone too, but OTOH, these words don't really tell very much at all about the content of a given bit of text, in the general sense. Were I looking for TorgoX's journal entry in question, the chances of me looking for it by searching for "obviously" are pretty slight. The only journal entry I might think of looking for by that word is this one here. :-D