Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • Perhaps HTML Tidy [w3.org] can help? The last version I tried (I haven't upgraded in a while) did do basic conversion from presentational markup, FONT tags and friends, into CSS — even though somewhat buggy.

    I really ought to look into the upgrades.

    • Tidy [sourceforge.net] is pretty good at fixing bad HTML, as is libxml2 [xmlsoft.org], however they are no good at at de DreamMangling a page, for that you have to do it by hand. However, that job is now done!

      Both tidylib and libxml2 can be used from within Perl, so they are also great for on the fly HTML cleaning too. If only it was possible to on the fly upgrade IE users to Firefox or Opera.....

      --
      -- "It's not magic, it's work..."