Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • Recipe? (Score:2, Insightful)

    This would make a nice Cookbook recipe...

    --
    (darren)
    • D'oh, good point. I can't believe I didn't think of that. Thanks, applied. :-)

      --Nat

      • There's another module HTML::TableExtract to parse HTML tables. I have used this, and it is pretty nice. I haven't looked at HTML::TableContentParser, so can't really compare, yet.

        Also, look at WWW::Mechanize, which is really awesome for scraping web content. There is WWW::Mechanize::Shell, for writing quick scripts to this kinda stuff.

        Just some more info for you to chew on while you write that cookbook entry.

        /prakash

        • by gnat (29) on 2003.05.13 16:40 (#20072) Journal
          I spent a long time looking for data with column headings for HTML::TableExtract to work on. I finally found some census data [census.gov], but after half an hour of trying, I couldn't make H::TE grok the nested table headings. I finally gave up and just documented HTML::TableContentParser. Sorry!

          --Nat