Last modified: 2011-04-30 01:16:45 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T19742, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 17742 - zh-tw page names use many synonyms for URLs, e.g., <a href=BIG>LARGE</a>
zh-tw page names use many synonyms for URLs, e.g., <a href=BIG>LARGE</a>
Status: RESOLVED FIXED
Product: MediaWiki
Classification: Unclassified
Internationalization (Other open bugs)
1.14.x
All All
: Normal normal (vote)
: ---
Assigned To: Philip Tzou
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2009-03-01 19:05 UTC by Dan Jacobson
Modified: 2011-04-30 01:16 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Dan Jacobson 2009-03-01 19:05:01 UTC
For my zh-tw wiki, I notice many places where the URL is different
than the page name, e.g. the URL is 最近更改 but the pagename is 近期變動.

It is so frustrating to have one called e.g., "big", and the other e.g.,
"large", when they should be called the same name.

$ GET http://radioscanningtw.jidanni.org/index.php?title=首頁 |
perl -MURI::Escape -nwle 'next unless /變動/; s/\s+/ /g;print uri_unescape($_);'

<li id="n-recentchanges"><a href="/index.php?title=特殊:最近更改"
title="列出該網站中的最近修改 [r]" accesskey="r">近期變動</a></li>

What a mess! Why can't they be unified?

$ egrep -n "'(最近更改|近期變動)'" MessagesZh_tw.php MessagesZh_hant.php
MessagesZh_tw.php:729:'prefs-rc'                 => '近期變動',
MessagesZh_tw.php:795:'recentchanges'                     => '近期變動',
MessagesZh_hant.php:85:	'Recentchanges'             => array( '最近更改' ),
MessagesZh_hant.php:1102:'prefs-rc'                  => '最近更改',
MessagesZh_hant.php:1298:'recentchanges'                     => '最近更改',

This is just the tip of the iceberg. There are many cases like this.

Another example:
$ GET http://radioscanningtw.jidanni.org/index.php?title=特殊:特殊頁面 |
 perl -MURI::Escape -nwle 'next unless /最舊/; s/\s+/ /g;print uri_unescape($_);'
</ul></td><td width='10%'></td><td width='30%' valign='top'><ul><li><a
 href="/index.php?title=特殊:最早頁面" title="特殊:最早頁面">最舊頁面</a></li> 

Here the two strings, 最早頁面 vs. 最舊頁面, we see are coming from:

$ egrep -n "最(早|舊)頁" MessagesZh_tw.php MessagesZh_hant.php
MessagesZh_tw.php:1057:'ancientpages'            => '最舊頁面',
MessagesZh_hant.php:115:	'Ancientpages'              => array( '最早頁面' ),
MessagesZh_hant.php:1636:'ancientpages'            => '最舊頁面',

Anyway, for all the above cases, if you must pick one choice to unify
upon, it should be the URL one, as many people have hardwired URLs
into their wikitext, e.g., and as often is the case looking at
http://taizhongbus.jidanni.org/index.php?title=特殊:最近更改
we have to keep changing links every time a new version of mediawiki
comes out.

P.S., the above is all seen even after installing
http://www.mediawiki.org/wiki/Special:Code/MediaWiki/47908
which thank you very much for.
Comment 1 Niklas Laxström 2009-03-02 08:06:33 UTC
Few comments:
1) your bug reports are difficult to understand
2) your attitude feels very angry. Neutral reports are best, because they don't call emotions to mess up the real thing.
3) this is yet again the same issue caused by r47908, to which Tim Starling already suggested a solution: translate them to zh_tw.
4) the links will not stop working, don't worry too much. On the other hand, it would be completely wrong to show wrong language to user just because it is wrong in the url
5) you are giving too much weight to what happens in the browsers url bar
Comment 2 Dan Jacobson 2009-03-05 09:57:00 UTC
OK, here's the whole list.

As you can see, many of the pairs are maddeningly close!
Some even differ by e.g, 連 vs. 鏈 !

I could step forward and volunteer to submit a patch, changing many of
the items in column 3 ("RENDERED") to become the same as column 2
("URL").

(One might instead say why not do the opposite, make column 2 a copy of
column 3. However doing that will break many people's links,
bookmarks, etc. -- a disaster.)

However I am not sure if you really want me to send such a diff!
You might say "well isn't that rolling back the refinements we've made
in those names over the years, just to match their earliest forms,
still to be found in the URLs, (which we don't want to tamper with, as
it breaks links, bookmarks, etc.)"

Well, OK, we can live with the schizophrenia, but at least you must
admit some of the pairs are maddeningly close. Making users think
MediaWiki is running a sloppy ship.

Note, the same problem probably exists for MessagesZh_hans.php, here I
only examine MessagesZh_hant.php.


               VARIABLE=>          URL|RENDERED
              Userlogin=>    用戶登入|登入/建立新賬號
             Userlogout=>    用戶登出|登出
          CreateAccount=>    創建帳戶|建立新賬號
    Recentchangeslinked=>    連出更改|鏈出更改
              Newimages=>    新建檔案|新建圖像畫廊
        Listgrouprights=>    群組權限|用戶群組權限
             Statistics=>    統計信息|統計
     Uncategorizedpages=>  未歸類頁面|待分類頁面
Uncategorizedcategories=>  未歸類分類|待分類類別
    Uncategorizedimages=>  未歸類檔案|待分類檔案
 Uncategorizedtemplates=>  未歸類模板|待分類模版
       Unusedcategories=>  未使用分類|未使用的分類
           Unusedimages=>  未使用檔案|未使用圖像
       Wantedcategories=>    待撰分類|需要的分類
    Mostlinkedtemplates=>最多連結模板|最多連結模版
        Fewestrevisions=>最少修訂頁面|最少修訂的頁面
           Ancientpages=>    最早頁面|最舊頁面
           Deadendpages=>    斷連頁面|斷鏈頁面
        Protectedtitles=>  已保護標題|已保護的標題
            Prefixindex=>    前綴索引|所有頁面之前綴
            Ipblocklist=>    封禁列表|被封IP地址列表
              Emailuser=>    電郵用戶|E-mail該用戶
           Confirmemail=>確認電子郵件|確認郵箱位址
          Whatlinkshere=>    連入頁面|鏈入頁面
               Movepage=>    移動頁面|
                 Export=>    導出頁面|匯出頁面
                Version=>    版本資訊|版本
            Allmessages=>    所有訊息|系統界面
                 Import=>    導入頁面|匯入頁面
             Userrights=>    用戶權限|用戶權限管理
             MIMEsearch=>     MIME搜尋|MIME 搜索
        Unusedtemplates=>  未使用模板|未使用的模板
                 Mypage=>  我的用戶頁|我的頁面
                 Mytalk=>  我的討論頁|我的對話頁
        Mycontributions=>    我的貢獻|
             Listadmins=>  管理員列表|
               Listbots=>  機器人列表|
                 Search=>        搜尋|搜索
              Resetpass=>    修改密碼|更改密碼
           MergeHistory=>    合併歷史|合併頁面歷史
        Invalidateemail=>不可識別的電郵地址|取消電郵確認

I made the above list with:

function printt($a,$b,$c){
  printf("%23s=>%s%s|%s\n",$a,str_pad(NULL, 12-(strlen($b)*2/3), " ",STR_PAD_LEFT),$b,$c);}
$d='/var/lib/mediawiki/languages/messages';
include("$d/MessagesZh_hant.php");
printt("VARIABLE","URL","RENDERED");
foreach($specialPageAliases as $key=>$value){
  $v2=$messages[strtolower($key)];
  if($v2!=$value[0]){
    printt($key,$value[0],$v2);}}
Comment 3 Siebrand Mazeland 2009-03-08 13:13:22 UTC
I suggest you start contributing to the zh-tw localisation on http://translatewiki.net instead of ordering others to fix alleged issues. There is a fall back chain which substitutes non-localised messages. Whenever the message is localised, it will be used instead of the message from the fallback.

There are plenty of possibilities to talk to the currently active zh translators.

Closed as WONTFIX.
Comment 4 Shinjiman 2009-08-21 14:36:25 UTC
The issue mentioned in comment 0 has been fixed on r55412.
For comment 2, that is the text message which is specified to the Taiwanese language, it might be different to the Traditional Chinese language. Thus if the text is going to be changed, one of the way is to retranslate in Translatewiki.net or provide a unified diff as a patch file.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links