Last modified: 2010-05-15 14:36:07 UTC
iso-8859-1 reserves the code space 0x80-0x9F for control codes that are
forbidden in html
however it seems that the vast majority of browsers treat iso-8859-1 as
windows-1252 which assigns printable charactors to theese code points. and
people do use theese charactors on iso-8859-1 wikipedias.
it would be wise to take this into account when making conversions to utf-8 (ie
convert them to the proper unicode code points for those printable charactors
rather than unicode code points for control codes that are disallowed in html)
Already taken into account in UTF-8 converter for 1.5 upgrade.