Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

use Perl Log In

Log In

[ Create a new account ]

Journal of LTjake (4001)

Thursday July 10, 2003
09:13 AM

XML::RSS ... broken.

[ #13370 ]

After reading Mark's rant on stripping HTML from RSS feeds, i decided to test out my XML::RSS-based aggregator.

Simply adding an <em></em> tag around some text in the <description></description> tag proved interesting. It _ATE_ the text!

This:

<description>I am <em>really not</em> a jerk.</description>

Becomes:

I am a jerk.

when i output it.

Since XML::RSS uses XML::Parser... is it a problem with that? I noticed a callback for Start (called when an XML start tag is found) ... perhaps it's being lost there?

*sigh*

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • XML::RSS is broken in many ways, though some of it's old bugs have been ironed out. You can help to fix it though, see Perl-RSS [sourceforge.net] where new memebers and ideas are always welcome.

    In an ideal world it will be rebuilt from the ground-up, but I don't know when that will happen, I just look after the web site...

    --
    -- "It's not magic, it's work..."
  • It's actually doing the right thing which is not always the same as DWIM.

    Strictly speaking, RSS 0.9x/2.x doesn't allow for anything but a core set of elements. RSS 1.0 should, in theory, via namespaces but it's not something that XML::RSS supports except for a few specific RSS modules. Everything else is ignored.

    The artist formerly known as Prince (or whatever it's being called this week) will, in theory, just work but people are still picking at the details. [1]

    On the other hand, you can wrap your custo
    • My primary concern was having potentially malicious html milling around the description tag -- and, I guess, I don't have to worry about that -- so it's sort of working right. :)