Last modified: 2008-05-08 06:31:05 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T15615, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 13615 - update support to Unicode 5.1.0
update support to Unicode 5.1.0
Status: RESOLVED FIXED
Product: MediaWiki
Classification: Unclassified
Internationalization (Other open bugs)
unspecified
All All
: Normal enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2008-04-05 07:22 UTC by Denis Jacquerye
Modified: 2008-05-08 06:31 UTC (History)
0 users

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Denis Jacquerye 2008-04-05 07:22:42 UTC
Unicode 5.1.0 is out http://www.unicode.org/versions/Unicode5.1.0/
Please update Unicode support to that version.

For example the characters Ɑ <U+2C6D> has been added as the upper case of ɑ <U+0251>, which would be useful for automatic titling.
Comment 1 Brion Vibber 2008-05-08 06:31:05 UTC
r34417: Normalization data files updated to Unicode 5.1.0; passes the automated tests.

Seem to have long since lost the script I originally used to generate the Utf8Case.php mapping file, which appears not to have been updated since 2002 or so.  :) 
Made a new one and moved it into the UtfNormal sub-library.

Note a couple limitations:
* Case mapping (still) uses only the 1:1 simple mappings. Any full or locale-specific mappings are ignored.
* These case mappings are not used anyway when the PHP mbstring extension is available; mbstring's case conversion functions are used instead, with whatever version of Unicode support and whatever complex mapping support they may or may not have.
* The generated Utf8Case.php file is not used directly -- you must also regenerate the serialized version in the 'serialized' directory after updating it to a new Unicode version.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links