NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.
All the Perl that's Practical to Extract and Report
Stories, comments, journals, and other submissions on use Perl; are Copyright 1998-2006, their respective owners.
Preserving Entities (Score:2)
Will bite you in the end. Trust me. I've just come from a project where we've used hack upon hack upon hack upon hack to ensure that entities get preserved in one state or another. But the trouble is that you've effectively got several layers of character encoding. In our case, we ended up with stuff in the database which contained & et al. Well, in some tables we did. In others we had UTF-8. And the search engine saw character references and turned them into latin-1. Sometimes. So you neve
Re:Preserving Entities (Score:2)
We own the data that we are serving up through this web app. So it's fully normalized by the time it's parsed in this pipeline. The problem is more about keeping the entities that are in there from being converted into UTF-8.
Re:Preserving Entities (Score:2)
I'm still curious about the need for character references rather than UTF-8 bytes though. Which browsers were giving you trouble?
-Dom
Re:Preserving Entities (Score:1)
Re:Preserving Entities (Score:2)
Browsers (Score:2)
Where are all these bad browsers?
Re:Browsers (Score:2)
It's been a while since I looked at AxKit, but I think the (XML) output that is serialized to the browser is properly re-escaped. This is the correct behavior according to the XML character model.
For some reason, this system goes to extreme measures to do as little work as possible in each transaction. That includes passing around raw XML text instead of higher level data structures, and making as f
Re:Browsers (Score:2)
Re:Browsers (Score:2)
The tool chain is old enough that there are likely lots of XM
Re:Browsers (Score:2)
-Dom
Re:Browsers (Score:2)
The problem came about because the standard version of expat was munged, and linking in a new module with a clean expat broke one or the other of the dependencies. And expat was munged in the first place explicitly to avoid re-escaping previously escaped entities on output once they had been parsed into unicode characters. (A performance optimization to do as little work as possible, and work at the lowest layer possible, to enable high throu
I would like to have this option (Score:2)
Have you thought about sending a patch back to expat? In some cases I would love to have that option, especially from Perl where XML::Parser sometimes behaves in a really funny way [perlmonks.org]
mirod
Re: Reasons to hate C, #2752 (Score:1)