Last modified: 2010-04-06 19:34:55 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T5696, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 3696 - Unicode Control Characters should be restricted in title text (RLM, LRM, RLO, LRO, . . .)
Unicode Control Characters should be restricted in title text (RLM, LRM, RLO,...
Status: RESOLVED FIXED
Product: MediaWiki
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: High normal with 3 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
:
: 3819 3888 5735 5736 7414 7939 8312 (view as bug list)
Depends on:
Blocks: rtl unicode
  Show dependency treegraph
 
Reported: 2005-10-13 10:30 UTC by Tietew
Modified: 2010-04-06 19:34 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments
screen dump (137.86 KB, image/jpeg)
2005-11-08 18:52 UTC, lɛʁi לערי ריינהארט
Details

Description Tietew 2005-10-13 10:30:58 UTC
Unicode Control Characters such as "RIGHT-TO-LEFT OVERRIDE" (U+202E)
or *all unprintable characters* should be restricted in title text.

A title including control characters breaks RC, history, contributions, etc.
A username including control characters breaks page text after the signature!
And they are hard to be linked.

... in Japanese Wikipedia, a username with U+202E is used as vandalism.
Comment 1 lɛʁi לערי ריינהארט 2005-10-13 11:27:36 UTC
compare with

bug 1524: usernames should use unicode whitelist
Comment 2 Tietew 2005-11-05 13:18:57 UTC
*** Bug 3888 has been marked as a duplicate of this bug. ***
Comment 3 lɛʁi לערי ריינהארט 2005-11-08 18:52:21 UTC
Created attachment 1052 [details]
screen dump

added new screen dump:

This gets a real mess. This happened because 
http://yi.wikipedia.org/wiki/%E2%80%AEtest was edited:
http://yi.wikipedia.org/w/index.php?title=%E2%80%AEtest&action=history
Comment 4 lɛʁi לערי ריינהארט 2005-12-11 17:57:13 UTC
Hallo!

Last week I found
http://uncyclopedia.org/wiki/User:%C2%AD%C2%AD%C2%AD%C2%AD
containing more "Unicode Character SOFT HYPHEN - U+00AD"'s.

http://yi.wiktionary.org/wiki/category:bugzilla/02042 contains more tests. Some
of them relating to Unicode whitespace characters (see bug 02042).

regards reinhardt [[user:gangleri]]
Comment 5 Brion Vibber 2006-04-27 20:09:10 UTC
*** Bug 5736 has been marked as a duplicate of this bug. ***
Comment 6 Brion Vibber 2006-04-27 20:09:25 UTC
*** Bug 5735 has been marked as a duplicate of this bug. ***
Comment 7 Pablo Saratxaga 2006-05-28 21:30:17 UTC
final patch on bug #6100 has fixes for that problem in recentchanges
Comment 8 Brion Vibber 2006-11-16 15:35:14 UTC
*** Bug 7939 has been marked as a duplicate of this bug. ***
Comment 9 Brion Vibber 2006-12-04 19:14:44 UTC
*** Bug 7414 has been marked as a duplicate of this bug. ***
Comment 10 Brion Vibber 2006-12-18 23:22:39 UTC
*** Bug 8312 has been marked as a duplicate of this bug. ***
Comment 11 Brion Vibber 2006-12-23 00:44:58 UTC
As of r18513, the LRM and RLM marks are stripped from titles on normalization.
This will avoid creation of broken links and broken titles from cut-n-paste
from the list pages where sometimes those marks creep in.

Running cleanup on live wikis for titles where this has crept in.
Such pages can be found with prefix search on 'Broken/'.
Comment 12 lɛʁi לערי ריינהארט 2008-03-13 06:13:44 UTC
https://bugzilla.wikimedia.org/show_bug.cgi?id=7414#c6

should have been posted here

----

the actual report is part of a more general one:
bug 4185 feature request: provide a notification for irregular Unicode characters
Comment 13 lɛʁi לערי ריינהארט 2008-03-13 06:15:38 UTC
*** Bug 3819 has been marked as a duplicate of this bug. ***
Comment 14 Ilmari Karonen 2010-04-06 19:34:55 UTC
I think r44000 fixed this back in 2008.  Closing.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links