Last modified: 2010-01-04 08:30:30 UTC

Wikimedia Bugzilla is closed!

Wikimedia has migrated from Bugzilla to Phabricator. Bug reports should be created and updated in Wikimedia Phabricator instead. Please create an account in Phabricator and add your Bugzilla email address to it.
Wikimedia Bugzilla is read-only. If you try to edit or create any bug report in Bugzilla you will be shown an intentional error message.
In order to access the Phabricator task corresponding to a Bugzilla report, just remove "static-" from its URL.
You could still run searches in Bugzilla or access your list of votes but bug reports will obviously not be up-to-date in Bugzilla.
Bug 11162 - Malayalam language characters don't work well with mediawiki
Malayalam language characters don't work well with mediawiki
Product: MediaWiki
Classification: Unclassified
General/Unknown (Other open bugs)
All All
: Normal normal with 2 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
Depends on:
  Show dependency treegraph
Reported: 2007-09-03 11:02 UTC by とある白い猫
Modified: 2010-01-04 08:30 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---

chillu (Malayalam) (18.03 KB, image/png)
2007-09-05 14:16 UTC, Sadik Khalid
From (208.73 KB, image/jpeg)
2008-03-21 01:51 UTC, とある白い猫

Description とある白い猫 2007-09-03 11:02:23 UTC
I asked the following task (3 username renames - a swap) from the local bcrat:

1. ഉപയോക്താവ്:കമ്പ്യൂട്ടര് -> ഉപയോക്താവ്:കമ്പ്യൂട്ടര്‍ temp
2. ഉപയോക്താവ്:WOPR -> ഉപയോക്താവ്:കമ്പ്യൂട്ടര്‍
3. ഉപയോക്താവ്:കമ്പ്യൂട്ടര്‍ temp -> ഉപയോക്താവ്:WOPR

Step 1 failes with the error: The username "കമ്പ്യൂട്ടര്‍ temp" is invalid

Related local wiki discussion:
Comment 1 Sadik Khalid 2007-09-05 14:16:20 UTC
Created attachment 4079 [details]
chillu (Malayalam)

Malayalam language characters don't work well with mediawiki (chillu problem)
Comment 2 Sadik Khalid 2007-09-05 14:19:08 UTC
Look at the above attachment the last character make problems
Comment 4 Brion Vibber 2008-03-05 00:07:17 UTC
The username in comment #3 has a U+200D Zero-Width Joiner control character at the end. This is in a blacklisted control character range, and is not currently allowed in usernames. It also looks totally incorrect, seeing as how it comes at the end of a name, not really a valid place for one even if it was allowed.

Remove that last character and it should be accepted (confirmed on a local installation.)
Comment 6 Jacob 2008-03-22 18:45:14 UTC
Could you elaborate why is "U+200D Zero-Width Joiner control character at the end" blacklisted? Many Malayalam words need this at the end to represent their meaning. 

I can't see how we can say mediawiki software is fully unicode compliant without this support. So I would very much like to hear about the technical concerns.
Comment 7 Brion Vibber 2008-03-24 18:53:17 UTC
Well, the fact that it's invisible and hard to cut-n-paste makes it a bit tricky to manage. :P

We've generally forbidden most magic invisible chars from usernames for security (spoofing etc) purposes.
Comment 8 とある白い猫 2008-03-26 10:29:54 UTC
Right and thats a good practice on all usernames that are not in Malaylam. In the case of Malaylam that is a different story.

Perhaps a solution is to let bureaucrats rename users to these 'invisible' characters while banning users from creating accounts with such characters. That way vandals won't be able to abuse this and good users would benefit from it.

I believe the validity check (weather a username is valid or not) used by new username creation and username rename is conducted by the same block of code.
Comment 9 Jacob 2008-04-02 17:58:44 UTC
Please hold from making any fixes. This seems is part of a much bigger issue with Malayalam Unicode. I am withdrawing my vote for now.
Comment 10 Tim Starling 2010-01-04 08:30:30 UTC
Fixed in r60599.

Note You need to log in before you can comment on or make changes to this bug.