Slash Boxes
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
More | Login | Reply
Loading... please wait.
  • So does this have the same functionality as the on-line W3C validator service (without having to go to the web)? I believe their validator is just a perl script that calls tidy somehow. []
    • Hrmm, i don't think so.

      Check the source page []. No mention of Tidy. Also, it says "OpenSP is the SGML and XML parser used by the service". So i assume it parses the output from that.

    • I believe the W3C Validator does not use tidylib. It uses
      use File::Spec          qw();
      use HTML::Parser   3.25 qw(); # Need 3.25 for $p->ignore_elements.
      .. and some other good stuff. The source [] is available. I would be very interested to know the similarities and differences between tidylib and this validation. [I could go look it up!]
      • The W3C validator requires an installation of OpenSP [], which is a fairly heavyweight requirement.

        I'm not sure quite what tidylib does, but I'm going to give it a play and see what it does. If it's faster than onsgmls, then I'm all for it!

        Your other option for validation is to get libxml2 (in its perl form XML::LibXML) set up. The disadvantage (which it shares with OpenSP) is that it requires you to have all the catalogs for html/xhtml set up correctly. I'm assuming that tidylib has all that sort of st

        • There are CGI tidy interfaces out there, so you can see what tidy reports on. tidy also does cleanup on the HTML, and prettifies it for you, although HTML::Lint doesn't support that.