Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • tidy (Score:2, Informative)

    Tidy [sourceforge.net] has an option to clean up Word HTML [sourceforge.net] which might be handy, especially now there are Perl bindings [rcn.com].
  • The more popular XML is getting, the more it's becoming like HTML. RSS is the most end-user XML application, and validity of generated RSS is so bad reasonable numbers of people seem to have started writing non-XML parsers to read it and accept anything...
    • Well it doesn't help when you have something like XHTML, which is supposed to be a gateway drug to XML somehow, except that people write their XHTML in non-validating editors, and so the vast majority of XHTML out there isn't XHTML at all, and if it's not XML then it really is pointless to bother. Which, is why I support the "XHTML considered harmful" gang.

      If more people would use XSLT then that would improve the situation a lot, since it can only output valid XML (in most situations).

      These people who are
      • These people who are outputting bad RSS ... what tools are they using to create it?

        Probably Perl, or PHP or ASP. And not using tools, just using strings.