The initial problem is that the use.perl.org pages declare iso-8859-1
as its charset. So form data has also to be sent as iso-8859-1. Maybe
a browser shouldn't accept any non-latin1 characters when entering or
pasting data into form fields, but at least gecko-based browsers
doesn't do this. To do something with non-latin1 characters,
gecko-based browsers on Unix system seem to do use this heuristic:
* codepoints below 256 are fine
* if there are codepoints in the 0x80-0x9f range of win1252, then they
are send like this (try LATIN CAPITAL LETTER S WITH CARON for a test)
* every other codepoint is sent as a numerical HTML entity
About pod2text: no, *pod2text* does not use man, but *perldoc* uses by
default pod2man. The plan was to fix pod2text encoding issues (there
are still some, but they are fixable, in contrast to pod2man) and then
to use something like Pod::Text::Overstrike or Pod::Text::Termcap
instead of Pod::Man.
I just right now created and uploaded
Pod-Perldoc-ToTextTermcap-0.00_50.tar.gz to CPAN. Just install it and
set
export PERLDOC=-MPod::Perldoc::ToTextTermcap
or
export PERLDOC=-MPod::Perldoc::ToTextOverstrike
and perldoc will use the new renderer. It looks somewhat different
than man output, but at least bold and underline is done (unlike with
stock Pod::Perldoc::ToText).
Read More