NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.
All the Perl that's Practical to Extract and Report
Stories, comments, journals, and other submissions on use Perl; are Copyright 1998-2006, their respective owners.
Excellent idea (Score:2)
If that's possible, I would be totally happy to include CSS selectors in HTML::TreeBuilder::XPath (and actually even in XML::XPathEngine). I would love the module to auto-detect which query language is used, but I don't think that's possible, as the syntax overlap.
mirod
Re: (Score:1)
Re: (Score:1)
Re: (Score:1)
Looks like CSS::SAC on CPAN is not updated for a long time (the last update is September 2004) and it's not a bad thing to have a separate, pure perl (and independent of any CPAN module) would not be a bad thing, though.
Re: (Score:1)
Re: (Score:1)
Re: (Score:2)
I hadn't looked at this at all, but I see that your HTML::Selector::XPath is indeed most of what's needed. Nice job.
I have to thing about it, but at the very least I will add something in the docs about using HTML::Selector::XPath in order to use CSS selectors on XML/HTML modules.
mirod
amazing (Score:1)
I've been scraping HTML for a while (since sitescooper), and XPath is definitely the right way to do it, I think.