Last modified: 2014-09-23 19:47:27 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T30519, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 28519 - idea: use the Interlanguage extension as a universal redirect center
idea: use the Interlanguage extension as a universal redirect center
Status: NEW
Product: MediaWiki extensions
Classification: Unclassified
Interlanguage (Other open bugs)
unspecified
All All
: Normal enhancement with 1 vote (vote)
: ---
Assigned To: Nobody - You can work on this!
: i18n
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2011-04-12 21:28 UTC by Amir E. Aharoni
Modified: 2014-09-23 19:47 UTC (History)
10 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Amir E. Aharoni 2011-04-12 21:28:42 UTC
MediaWiki has a very good feature: it gives pages readable URLs that match the page's titles. Unfortunately it doesn't work well with titles that include characters that need to be URL-encoded. In Latin-script wikis this is a relatively small problem, but in wikis which are not Latin, such as Russian, Arabic and many others, this turns nearly every URL into a long string of percent signs and character numbers.

Some Wikipedias try to cope with it. In the Malayalam Wikipedia is is common to create redirects in ASCII, but every such redirect must be created manually, which is a bit tedious. There's also the Shortify tool ( http://defn.me/s/ ), which makes the URL short, but unreadable.

Maybe the Interlanguage extension can be modified so that typing something like
http://mul.wikipedia.org/wiki/Barack_Obama:ar will redirect to http://ar.wikipedia.org/wiki/باراك_أوباما. It wont make the name very short, but it will certainly be better than http://ar.wikipedia.org/wiki/%D8%A8%D8%A7%D8%B1%D8%A7%D9%83_%D8%A3%D9%88%D8%A8%D8%A7%D9%85%D8%A7 for sending by email or instant messaging.

("mul" is the ISO 639 code for "multiple languages". There was no decision to give the future Interlanguage wiki this name; i just made it up. The domain can have any other name. Another proposal is to call it data.wikimedia.org. It's not very important)
Comment 1 Bawolff (Brian Wolff) 2011-04-13 03:53:13 UTC
To ask the stupid question, why not send http://ar.wikipedia.org/wiki/باراك_أوباما in an email. 99% of modern programs will convert that to the correct percent encoding.

This personally seems kind of icky to me. If we did have the ability to resolve interlanguage names like this, and wanted to do something like this, why wouldn't we just make http://ar.wikipedia.org/wiki/<magic name in some other lang> automatically redirect to the correct page?
Comment 2 Amir E. Aharoni 2011-04-13 06:02:54 UTC
(In reply to comment #1)
> To ask the stupid question, why not send
> http://ar.wikipedia.org/wiki/باراك_أوباما in an email. 99% of modern programs
> will convert that to the correct percent encoding.

It will correct it to the right percent encoding, but it will be very long. Just look at my first comment. And that's a relatively short name; it could be much longer, for example, like http://ar.wikipedia.org/wiki/%D8%A7%D9%84%D8%A3%D9%85%D9%8A%D8%B1%D8%A9_%D9%85%D8%A7%D8%B1%D8%AC%D8%B1%D9%8A%D8%AA_%D9%83%D9%88%D9%86%D8%AA%D9%8A%D8%B3%D8%A9_%D8%B3%D9%86%D9%88%D8%AF%D9%88%D9%86 . It usually goes through, but it's terribly ugly. And in an IM program, like, GMail chat, sending such a link fills the whole screen with percent signs and digits.

> This personally seems kind of icky to me. If we did have the ability to resolve
> interlanguage names like this, and wanted to do something like this, why
> wouldn't we just make http://ar.wikipedia.org/wiki/<magic name in some other
> lang> automatically redirect to the correct page?

Also possible.
Comment 3 Nikola Smolenski 2011-04-13 07:08:43 UTC
If I understand what you want correctly, you would like all wikis to have links in Latin alphabet, not necessarily short, but still shorter than the current ones. It is possible (srwiki already partly works that way and it could be improved), but I don't think that the Interlanguage Extension could help. For starters, there are no guarantees that Barack Obama on mul would always continue to point to باراك_أوباما on ar. Then, this may work for names, but articles about animals, or scientific terms, or whatever is not similar in most languages will be very dissimilar in English and the target language, thus negating the point of the Latin link.
Comment 4 Amir E. Aharoni 2011-04-13 07:19:45 UTC
The goal is not so much to make the URL short as to make it readable, assuming that anyone that is able to read the string "wikipedia.org" is able to read the string "Barack_Obama". A lot of people are able to read both "Barack_Obama" and "باراك_أوباما"; some people are able to read only one of these strings; but i'm quite sure that "%D8%A8%D8%A7%D8%B1%D8%A7%D9%83_%D8%A3%D9%88%D8%A8%D8%A7%D9%85%D8%A7" is equally gibberish to everyone.

It doesn't matter what the article is about and what the final name in the target language is. An Interlanguage wiki page is supposed to be a hub that points to the different names of the same thing in different languages, whether it's people, building materials, animals or towns. The name in the target language may change, but then the Interlanguage wiki will probably be updated.

What i suggest is that the Interlanguage extension will simply redirect the reader to the article that is currently listed at the same page as the target article in the requested language. If there's no article about this in that language yet, then the extension can offer the reader to create an article in his language.
Comment 5 Krinkle 2011-04-13 14:55:16 UTC
(In reply to comment #2)
> (In reply to comment #1)
> > To ask the stupid question, why not send
> > http://ar.wikipedia.org/wiki/باراك_أوباما in an email. 99% of modern programs
> > will convert that to the correct percent encoding.
> 
> It will correct it to the right percent encoding, but it will be very long.
> Just look at my first comment. And that's a relatively short name; it could be
> much longer, for example, like
> http://ar.wikipedia.org/wiki/%D8%A7%D9%84%D8%A3%D9%85%D9%8A%D8%B1%D8%A9_%D9%85%D8%A7%D8%B1%D8%AC%D8%B1%D9%8A%D8%AA_%D9%83%D9%88%D9%86%D8%AA%D9%8A%D8%B3%D8%A9_%D8%B3%D9%86%D9%88%D8%AF%D9%88%D9%86
> . It usually goes through, but it's terribly ugly. And in an IM program, like,
> GMail chat, sending such a link fills the whole screen with percent signs and
> digits.

I'm not sure this is correct.

All of these can redirect to the "ugly encoded" version (depending on your browser) [1]:

* http://ar.wikipedia.org/wiki/باراك_أوباما 
* http://mul.wikipedia.org/wiki/Barack_Obama:ar
* http://ar.wikipedia.org/wiki/Barack_Obama

So the ugly-ness factor of native-title urls seems irrelevant. The point is the url that you're sharing communication to others, which neither the second or third one is what will be in your browser addressbar.


--
Krinkle


[1]: For the record, accessing http://ar.wikipedia.org/wiki/باراك_أوباما  (not a redirect) in Safari 5 or Firefox 4 does *not* redirect to a url encoded version, neither does MediaWiki link to urlencoded versions from inline links. So I could copy/paste http://ar.wikipedia.org/wiki/باراك_أوباما  right from the browser to here.
Comment 6 Amir E. Aharoni 2011-04-13 21:10:30 UTC
(In reply to comment #5)
> (In reply to comment #2)
> * http://ar.wikipedia.org/wiki/باراك_أوباما 
> * http://mul.wikipedia.org/wiki/Barack_Obama:ar
> * http://ar.wikipedia.org/wiki/Barack_Obama
> 
> So the ugly-ness factor of native-title urls seems irrelevant. The point is the
> url that you're sharing communication to others, which neither the second or
> third one is what will be in your browser addressbar.

The link to http://mul.wikipedia.org/wiki/Barack_Obama:ar can appear somewhere on the page.

http://ar.wikipedia.org/wiki/%D8%A8%D8%A7%D8%B1%D8%A7%D9%83_%D8%A3%D9%88%D8%A8%D8%A7%D9%85%D8%A7

> [1]: For the record, accessing http://ar.wikipedia.org/wiki/باراك_أوباما  (not
> a redirect) in Safari 5 or Firefox 4 does *not* redirect to a url encoded
> version, neither does MediaWiki link to urlencoded versions from inline links.
> So I could copy/paste http://ar.wikipedia.org/wiki/باراك_أوباما  right from the
> browser to here.

Firefox 4 on XP shows it nicely in the address bar, but URD-encodes it when it's copied and pasted...
Comment 7 Nemo 2012-08-26 21:43:41 UTC
Doesn't shorturl extension already fix this issue?

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links