NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.
All the Perl that's Practical to Extract and Report
Stories, comments, journals, and other submissions on use Perl; are Copyright 1998-2006, their respective owners.
I have been impressed too (Score:3, Interesting)
So impressed in fact that I stole a couple of its methods and added them to XML::Twig
mirod
Reply to This
Re:I have been impressed too (Score:1)
TreeBuilder is great, as it's close relative XML::TreeBuilder [cpan.org]. However they use both use HTML::Entities as found in HTML::Parser [cpan.org], and can't deal with some valid utf-8 HTML entites because of underlying problems with Perl (or so I'm told). When 5.8 goes final, I for one will upgrade just to get the maximum out of these great modules.
Both make good efforts at dealing with dirty HTML and XML, where faster tools die...
Neither tool is perfect, but both are pretty good, and Sean answers his email
-- "It's not magic, it's work..."