Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • Mark Pilgrim wrote an essay [xml.com] about getting the character set correct for XML over HTTP. Unfortunately even though XML makes dealing with character sets a bit more explicit, it's still got enough areas of pain to be a bother. Particularly when you find out things like all characters in an XML document are represented by a Unicode code point regardless of the source input encoding, except that some code points are specifically barred. Including U+0080 to U+009F, which is what you're looking at. Gah.

    -Dom