Last modified: 2010-05-15 14:36:07 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T3881, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 1881 - windows-1252
Product: MediaWiki
Classification: Unclassified
General/Unknown (Other open bugs)
PC All
: Normal normal with 1 vote (vote)
: ---
Assigned To: Nobody - You can work on this!
Depends on:
Blocks: 1679
  Show dependency treegraph
Reported: 2005-04-13 00:33 UTC by peter green
Modified: 2010-05-15 14:36 UTC (History)
0 users

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Description peter green 2005-04-13 00:33:22 UTC
iso-8859-1 reserves the code space 0x80-0x9F for control codes that are
forbidden in html

however it seems that the vast majority of browsers treat iso-8859-1 as
windows-1252 which assigns printable charactors to theese code points. and
people do use theese charactors on iso-8859-1 wikipedias.

it would be wise to take this into account when making conversions to utf-8 (ie
convert them to the proper unicode code points for those printable charactors
rather than unicode code points for control codes that are disallowed in html)
Comment 1 Brion Vibber 2005-06-21 21:37:08 UTC
Already taken into account in UTF-8 converter for 1.5 upgrade.

Note You need to log in before you can comment on or make changes to this bug.