Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • You might want to try XML::Liberal and XML::Feed for your feed parsing needs.
    • XML::Feed is just an API-unification wrapper around XML::RSS and XML::Atom::Feed, innit? I'm using XML::Atom::Syndication::Feed instead, for reasons I can't quite remember just now, but, uh, I'm sure they were good ones at the time.

      XML::Liberal might help, though I assume it has the same issue XML::Twig (which I saw someone recommend as an alternative) has in that it's not RSS-specific, and this was supposed to be a quick and dirty hack to pull title, date, content, and link out of feeds... and I didn't want to have to hunt down all the potential variations of where the content (especially) can be hidden. (XML::Atom::Syndication::Feed already doesn't count summaries as content, which isn't entirely unreasonable, but which does mean one more bit of exception coding in the shoulda-beena-quick-hack, which I haven't gotten around to actually coding, and undoubtedly shouldn't today.) I kind of don't want to throw out entire feeds, but on the other hand I don't want to actually do any *work* to fix them...