Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • It seems like the data structure will become dependent on which sentence you start with first, and that you won't be able to find variants of words which come under the control of
    earlier-processed counterparts.

    By the way, where is the NLP community in perl? There seems to be some action around Ted Pedersen's Wordnet similarity modules and Dragomir Radev has text relationship analyzer work in Clair.

    But there doesn't seem to be other ongoing NLP development in perl.

  • It may have some dependence on which text I start with; I may need to look at this in more detail. My assumption is that for a given word row (discounting NULs for the moment) the words will look like this (picking an arbitrary set of words)

    Precedense precedence order ordering preceedence

    So the entire structure is hanging on the first word, "Precedense". I should link both "precedence and "preceedence" to that as a fuzzy match. When I find "order", I will link it to "Precedense" as a variant.

    ...and, ye