Last modified: 2010-04-06 19:34:55 UTC

Wikimedia Bugzilla is closed!

Wikimedia has migrated from Bugzilla to Phabricator. Bug reports should be created and updated in Wikimedia Phabricator instead. Please create an account in Phabricator and add your Bugzilla email address to it.
Wikimedia Bugzilla is read-only. If you try to edit or create any bug report in Bugzilla you will be shown an intentional error message.
In order to access the Phabricator task corresponding to a Bugzilla report, just remove "static-" from its URL.
You could still run searches in Bugzilla or access your list of votes but bug reports will obviously not be up-to-date in Bugzilla.
Bug 3696 - Unicode Control Characters should be restricted in title text (RLM, LRM, RLO, LRO, . . .)
Unicode Control Characters should be restricted in title text (RLM, LRM, RLO,...
Status: RESOLVED FIXED
Product: MediaWiki
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: High normal with 3 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
:
: 3819 3888 5735 5736 7414 7939 8312 (view as bug list)
Depends on:
Blocks: rtl unicode
  Show dependency treegraph
 
Reported: 2005-10-13 10:30 UTC by Tietew
Modified: 2010-04-06 19:34 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments
screen dump (137.86 KB, image/jpeg)
2005-11-08 18:52 UTC, lɛʁi לערי ריינהארט
Details

Description Tietew 2005-10-13 10:30:58 UTC
Unicode Control Characters such as "RIGHT-TO-LEFT OVERRIDE" (U+202E)
or *all unprintable characters* should be restricted in title text.

A title including control characters breaks RC, history, contributions, etc.
A username including control characters breaks page text after the signature!
And they are hard to be linked.

... in Japanese Wikipedia, a username with U+202E is used as vandalism.
Comment 1 lɛʁi לערי ריינהארט 2005-10-13 11:27:36 UTC
compare with

bug 1524: usernames should use unicode whitelist
Comment 2 Tietew 2005-11-05 13:18:57 UTC
*** Bug 3888 has been marked as a duplicate of this bug. ***
Comment 3 lɛʁi לערי ריינהארט 2005-11-08 18:52:21 UTC
Created attachment 1052 [details]
screen dump

added new screen dump:

This gets a real mess. This happened because 
http://yi.wikipedia.org/wiki/%E2%80%AEtest was edited:
http://yi.wikipedia.org/w/index.php?title=%E2%80%AEtest&action=history
Comment 4 lɛʁi לערי ריינהארט 2005-12-11 17:57:13 UTC
Hallo!

Last week I found
http://uncyclopedia.org/wiki/User:%C2%AD%C2%AD%C2%AD%C2%AD
containing more "Unicode Character SOFT HYPHEN - U+00AD"'s.

http://yi.wiktionary.org/wiki/category:bugzilla/02042 contains more tests. Some
of them relating to Unicode whitespace characters (see bug 02042).

regards reinhardt [[user:gangleri]]
Comment 5 Brion Vibber 2006-04-27 20:09:10 UTC
*** Bug 5736 has been marked as a duplicate of this bug. ***
Comment 6 Brion Vibber 2006-04-27 20:09:25 UTC
*** Bug 5735 has been marked as a duplicate of this bug. ***
Comment 7 Pablo Saratxaga 2006-05-28 21:30:17 UTC
final patch on bug #6100 has fixes for that problem in recentchanges
Comment 8 Brion Vibber 2006-11-16 15:35:14 UTC
*** Bug 7939 has been marked as a duplicate of this bug. ***
Comment 9 Brion Vibber 2006-12-04 19:14:44 UTC
*** Bug 7414 has been marked as a duplicate of this bug. ***
Comment 10 Brion Vibber 2006-12-18 23:22:39 UTC
*** Bug 8312 has been marked as a duplicate of this bug. ***
Comment 11 Brion Vibber 2006-12-23 00:44:58 UTC
As of r18513, the LRM and RLM marks are stripped from titles on normalization.
This will avoid creation of broken links and broken titles from cut-n-paste
from the list pages where sometimes those marks creep in.

Running cleanup on live wikis for titles where this has crept in.
Such pages can be found with prefix search on 'Broken/'.
Comment 12 lɛʁi לערי ריינהארט 2008-03-13 06:13:44 UTC
https://bugzilla.wikimedia.org/show_bug.cgi?id=7414#c6

should have been posted here

----

the actual report is part of a more general one:
bug 4185 feature request: provide a notification for irregular Unicode characters
Comment 13 lɛʁi לערי ריינהארט 2008-03-13 06:15:38 UTC
*** Bug 3819 has been marked as a duplicate of this bug. ***
Comment 14 Ilmari Karonen 2010-04-06 19:34:55 UTC
I think r44000 fixed this back in 2008.  Closing.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links