Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • It may have some dependence on which text I start with; I may need to look at this in more detail. My assumption is that for a given word row (discounting NULs for the moment) the words will look like this (picking an arbitrary set of words)

    Precedense precedence order ordering preceedence

    So the entire structure is hanging on the first word, "Precedense". I should link both "precedence and "preceedence" to that as a fuzzy match. When I find "order", I will link it to "Precedense" as a variant.

    ...and, yes. The next step is one I left out of my overview. When I find "ordering", before adding it as a variant in its own right, I compare it to each of the existing variants. So I have a structure that looks like:

    Precedense:
       FUZZY_MATCH precedence preceedence
       VARIANT order

    order:
       FUZZY_MATCH ordering

    I think that retains all the information I need, yes?

    Re NLP: I did a double-take before I realized that you meant this [wikipedia.org] and not this [wikipedia.org]. :) I am not aware of a community around it, no.