Last modified: 2014-02-12 23:39:54 UTC

Wikimedia Bugzilla is closed!

Wikimedia has migrated from Bugzilla to Phabricator. Bug reports should be created and updated in Wikimedia Phabricator instead. Please create an account in Phabricator and add your Bugzilla email address to it.
Wikimedia Bugzilla is read-only. If you try to edit or create any bug report in Bugzilla you will be shown an intentional error message.
In order to access the Phabricator task corresponding to a Bugzilla report, just remove "static-" from its URL.
You could still run searches in Bugzilla or access your list of votes but bug reports will obviously not be up-to-date in Bugzilla.
Bug 45476 - Enable Unicode normalization for Malayalam on Wikimedia Wikis
Enable Unicode normalization for Malayalam on Wikimedia Wikis
Status: NEW
Product: Wikimedia
Classification: Unclassified
Site requests (Other open bugs)
All All
: Normal enhancement with 4 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
: i18n, performance, shell
Depends on:
Blocks: 56295
  Show dependency treegraph
Reported: 2013-02-27 03:51 UTC by praveenp
Modified: 2014-02-12 23:39 UTC (History)
11 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Description praveenp 2013-02-27 03:51:27 UTC
Please see Bug 22371.

Normalization is enabled on All Malayalam Language wikies. But now Malayalam grew outside those wikies (mainly to commons, wiki data etc) and old defacto characters are not supported by many applications including various webkit browsers (Chrome, Chromium), and many mobile applications. Please Enable normalization in All Wikimedia Wikies.

(Probably Arabic is also need this.)
Comment 1 Sam Reed (reedy) 2013-02-28 00:15:07 UTC

And where is the consensus that this should be done?
Comment 2 Andre Klapper 2013-02-28 00:26:16 UTC
Longer version of last comment:
For any configuration change, we require a local consensus. As this request is cross-wiki, this requires discussing the matter on Meta, probably (somebody correct me if this is the wrong place), in order to confirm that this change is wanted by the community.

For more information about how to request these kinds of changes, please see . Thanks!
Comment 3 praveenp 2013-02-28 04:43:09 UTC
Here the consensus for original bug:

But at that time fix was limited to Malayalam Language wikis because of some performance issues (If I remember correctly). And as you can see in Mediawiki, If your wiki language is Malayalam, the normalization is enabled by default. 

And also joiner based combinations are always problematic (Bug 45111).
Comment 4 Santhosh Thottingal 2013-02-28 05:00:02 UTC
Normalization works only when content language is ml, or ar. It does not get triggered based on user interface language. includes/WebRequest.php normalizeUnicode method calls normalize on $wgContLang.
Comment 5 praveenp 2013-02-28 05:10:37 UTC
However, normalization in non-ml/non-ar wikis is also possible. Check
Comment 6 Santhosh Thottingal 2013-02-28 11:18:37 UTC
$wgAllUnicodeFixes = true; is the setting required to get this normalization irrespective of the content language.$wgAllUnicodeFixes

Enabling this means, every normalize call will do Unicode normalization fix for Malayalam and Arabic. Documentation hints performance impact, but I don't know how much.
Comment 9 Siebrand Mazeland 2013-10-29 07:48:49 UTC
Andre, Sam, Ori, what's going on with this issue?

Note You need to log in before you can comment on or make changes to this bug.