Last modified: 2013-04-08 22:46:40 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T29913, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 27913 - URI schemes should be normalized to lowercase
URI schemes should be normalized to lowercase
Status: RESOLVED WORKSFORME
Product: MediaWiki
Classification: Unclassified
Parser (Other open bugs)
unspecified
All All
: Normal normal (vote)
: ---
Assigned To: Nobody - You can work on this!
aklapper-moreinfo
: need-unittest
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2011-03-07 10:43 UTC by Dirk Beetstra
Modified: 2013-04-08 22:46 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Dirk Beetstra 2011-03-07 10:43:52 UTC
At the moment, '[http://www.example.com]' is recognised as an external link by the wikimedia software, while '[HTTP://www.example.com]' is not, while my browser recognises both (even '[HtTp://www.example.com]').  I think that the protocol-recognition should be caseINsensitive.

Note: I noticed this, since the linkwatcher (LiWa3, see e.g freenode #wikipedia-en-spam) that I am operating does recognise uppercase HTTP as a protocol, and found an edit (http://en.wikipedia.org/?diff=417579535) where a blacklisted url (see http://meta.wikimedia.org/wiki/Spam-blacklist) was added to a page (but it is not a 'clickable link').  In fact, changing the capitalisation to lowercase disallows the edit.  Therefore, this should be propagated into the blacklist code as well, if necessary.
Comment 1 Antoine "hashar" Musso (WMF) 2011-03-07 15:06:41 UTC
RFC 3986 [1] states that schemes are case-insensitive but canonical form is lowercase.  The parser should normalize the scheme part to use lower case. Depending on Spam Blacklist hook point, it might or might not need an adaptation to take care of upper case schemes.

[1] http://tools.ietf.org/html/rfc3986#section-3.1
Comment 2 Andre Klapper 2013-04-08 12:12:24 UTC
(In reply to comment #0)
> At the moment, '[http://www.example.com]' is recognised as an external link
> by the wikimedia software, while '[HTTP://www.example.com]' is not

Cannot reproduce on test2 with 1.22wmf1 - both seem to render as external links (likely also because [ is used insteadof [[ ?).
How can I see this problem?
Comment 3 Dirk Beetstra 2013-04-08 12:22:16 UTC
Hmm, it seems that it has changed somewhere automagically then.  This was a problem 2 years ago ...

By the way, the single brackets are necessary for an external link, double brackets are for wikilinks.

I am changing the status of the bug .. seems 'resolved'
Comment 4 Antoine "hashar" Musso (WMF) 2013-04-08 17:32:02 UTC
bug 34939 was a duplicate of this bug and got fixed with https://gerrit.wikimedia.org/r/#/c/15224/

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links