Last modified: 2013-10-11 16:47:40 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T57565, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 55565 - $wgCategoryCollation setting makes categories contain characters from wrong alphabets
$wgCategoryCollation setting makes categories contain characters from wrong a...
Status: RESOLVED FIXED
Product: Wikimedia
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Highest major with 6 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
: code-update-regression, shell
Depends on:
Blocks: 29782
  Show dependency treegraph
 
Reported: 2013-10-10 15:48 UTC by Calak
Modified: 2013-10-11 16:47 UTC (History)
17 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Calak 2013-10-10 15:48:10 UTC
Hello. All Wikimedia projects which use wgCategoryCollation setting for collations has been failed. These sites show unrelated header characters on categories.

Examples:

On fa.wiki:
http://fa.wikipedia.org/wiki/رده:استان‌های_ایران

On ckb.wiki
http://ckb.wikipedia.org/wiki/پۆل:ئەلفوبێی_کوردی

On pt.wiki:
http://pt.wikipedia.org/wiki/Categoria:História
Comment 2 Nemo 2013-10-10 18:36:25 UTC
17.55 < Reedy> The only thing that have changed are some DB hosts, and a PHP package upgrade
17.55 < Reedy> (according to SAL)
17.55 < apergos> in the last 3 hours? 
17.55 < Reedy> He said over 3 hours
Comment 3 Bartosz Dziewoński 2013-10-10 18:39:45 UTC
Per bug 46036 comment 1, it looks like someone just needs to run some maintenance scripts on the cluster after package upgrades.
Comment 4 Bawolff (Brian Wolff) 2013-10-10 19:25:08 UTC
(In reply to comment #3)
> Per bug 46036 comment 1, it looks like someone just needs to run some
> maintenance scripts on the cluster after package upgrades.


Specificly updateCollation.php --force after php upgrade unless you make sure to compile against same version of icu library.
Comment 5 Greg Grossmeier 2013-10-10 19:29:15 UTC
Reedy tried this on plwikivoyage, but it doesn't appear to have helped.
Comment 6 Andrey Kiselev 2013-10-10 19:36:49 UTC
Same strange things on 
https://www.mediawiki.org/wiki/Category:Extensions_by_category
Comment 7 Bartosz Dziewoński 2013-10-10 19:57:38 UTC
(This is being actively worked on by ops. Turned out to be not that easy, my understanding is that something got messed up during the package upgrading.)
Comment 8 Bawolff (Brian Wolff) 2013-10-10 22:13:03 UTC
(In reply to comment #5)
> Reedy tried this on plwikivoyage, but it doesn't appear to have helped.

Did he use the --force option?

Also there is a memcached key that should be deleted dbname:first-letters:collation-name or something. (Alternatively bump IcuCollation::FIRST_LETTER_VERSION.)

(We should really be including icu version in that cache key)
Comment 9 Daniel Zahn 2013-10-10 22:30:48 UTC
pasting this here from http://paste.debian.net/plain/55306 because that may time out after 24 hours:

this was from /var/log/apt/history.log on mw1020

Start-Date: 2013-10-10  12:18:23
Commandline: /usr/bin/apt-get -q -y -o DPkg::Options::=--force-confold install php5-mysql
Install: libicu42:amd64 (4.2.1-3ubuntu0.10.04.1, automatic)

Upgrade: libapache2-mod-php5:amd64 (5.3.10-1ubuntu3.6+wmf1, 5.3.10-1ubuntu3.8+wmf1), php5-curl:amd64 (5.3.10-1ubuntu3.6+wmf1, 5.3.10-1ubuntu3.8+wmf1), php5-xmlrpc:amd64 (5.3.10-1ubuntu3.6+wmf1, 5.3.10-1ubuntu3.8+wmf1), php5-intl:amd64 (5.3.10-1ubuntu3.6+wmf1, 5.3.10-1ubuntu3.8+wmf1), php5-mysql:amd64 (5.3.10-1ubuntu3.6+wmf1, 5.3.10-1ubuntu3.8+wmf1), php5-cli:amd64 (5.3.10-1ubuntu3.6+wmf1, 5.3.10-1ubuntu3.8+wmf1), php5-common:amd64 (5.3.10-1ubuntu3.6+wmf1, 5.3.10-1ubuntu3.8+wmf1)
End-Date: 2013-10-10  12:18:35
Comment 10 Faidon Liambotis 2013-10-11 01:28:37 UTC
This was the result of a new version of PHP being rolled out across the fleet that was badly built (built under an unclean environment, having the wrong version of libicu installed). We now have 5.3.10-1ubuntu3.8+wmf2, this should be fixed.
Comment 11 [no longer active user] 2013-10-11 16:06:07 UTC
Still not perfect, https://fa.wikipedia.org/wiki/%D8%B1%D8%AF%D9%87:%D8%B5%D9%88%D8%B1%D8%AA_%D9%81%D9%84%DA%A9%DB%8C_%D8%A8%D8%B1%D9%87 seems all digits type on first of page title is being converted to Arabic digits. We shouldn't see '1' '2' '3' (Arabic Digits) and we should see '۱' '۲' '۳' (Persian Digits) instead. Reproducible on all categories and also on ckbwiki https://ckb.wikipedia.org/w/index.php?title=%D9%BE%DB%86%D9%84:%DA%95%DB%86%DA%98%DB%95%DA%A9%D8%A7%D9%86%DB%8C_%D8%B3%D8%A7%DA%B5&action=edit&redlink=1 that are using Arabic-Indic digits.
Comment 12 Bartosz Dziewoński 2013-10-11 16:13:09 UTC
Are you sure it wasn't like this before?

Can you file a separate bug? It's almost certainly not related (unless the package upgrade changed ICU's behavior for this case).
Comment 13 Calak 2013-10-11 16:25:34 UTC
Yes, we are sure. Please reopen this bug.
Comment 14 Bartosz Dziewoński 2013-10-11 16:47:40 UTC
Filed that as bug 55630.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links