Last modified: 2014-02-12 23:40:00 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T6537, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 4537 - Categories created using Tamil words not recognised in stats
Categories created using Tamil words not recognised in stats
Status: NEW
Product: Analytics
Classification: Unclassified
Wikistats (Other open bugs)
All All
: Low normal
: ---
Assigned To: Nobody - You can work on this!
: analytics
Depends on:
Blocks: 32578
  Show dependency treegraph
Reported: 2006-01-09 05:21 UTC by Sundar
Modified: 2014-02-12 23:40 UTC (History)
9 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---

Modified LanguageCodes.csv to fix category counts (10.62 KB, text/plain)
2006-02-03 09:01 UTC, Sundar

Description Sundar 2006-01-09 05:21:26 UTC - this
page seems to recognise the categories created using [[Category:some category]]
tag and NOT the ones created using [[பகுப்பு:some category]] tag. "பகுப்பு" is the
Tamil word for "category" and it is a valid namespace in Tamil wikipedia.

Comment 1 lɛʁi לערי ריינהארט 2006-01-09 10:21:17 UTC
If the configuration of the underlying tool is wrong this can be a consequence of
== Bug [Bugzilla] 321607:   "'copy and paste cuts off text (Tamil/Hindi scripts)''"

a) copy "பகுப்பு" with apostrophes and remove them after pasting them;
b) use &#nnnn; character encoding in the configuration; you can convert the
characters at and will get


regards reinhardt [[user:gangleri]]
Comment 2 Sundar 2006-01-09 10:26:45 UTC
This doesn't seem to be related to the other problem. Because we always have
"பகுப்பு" enclossed by [[ and : characters. Also, we're able to see that the
categories are created successfully, but only the statistics don't reflect that.
Thanks Reinhardt.
Comment 3 lɛʁi לערי ריינהארט 2006-01-09 11:04:25 UTC
Hi Sundar!

comment 1 referes to the configuration of the underlying tool for

I assume that in that configuration the last character is missing.

regards reinhardt [[user:gangleri]]
Comment 4 Sundar 2006-01-09 11:12:53 UTC
Oh, I got your point now. Thanks.

Comment 5 Sundar 2006-02-03 09:01:14 UTC
Created attachment 1351 [details]
Modified LanguageCodes.csv to fix category counts

Thanks to Reinhart and Natkeeran, I fixed the entry for Tamil in
LanguageCodes.csv as indicated at
Please update the file with the diff
Also, will the default categories marked with the English tag "Category" come
under the stats?

Comment 6 Brion Vibber 2009-03-30 21:43:55 UTC
Another old one that may be obsolete.
Comment 7 John Mark Vandenberg 2010-09-03 01:21:04 UTC
Comment on attachment 1351 [details]
Modified LanguageCodes.csv to fix category counts

The attached file is not a patch.
Comment 8 Erik Zachte 2010-09-09 20:01:57 UTC
Wikistats harvests language specific tags by parsing language specific config files (php). So just supplying an updated tag file won't do it. It will be overwritten. I will hunt bug in php parser code.
Comment 9 Sundar 2010-09-19 04:05:53 UTC
Thanks Erik.
Comment 10 Mark A. Hershberger 2011-05-03 17:38:26 UTC
Lowering priority since this bug has sat around for 2.5 years.
Comment 11 Andre Klapper 2012-11-10 16:08:03 UTC
Categories like
பகுப்பு:ஐக்கிய அமெரிக்கக் குடியரசுத் தலைவர்கள்
exist but are not listed on while other categories like
பகுப்பு:சிங்கள தலைவர்கள்
are. Obviously they both use பகுப்பு: instead of Category:.
Comment 12 Andre Klapper 2012-12-03 13:59:58 UTC
[mass-moving wikistats reports from Wikimedia→Statistics to Analytics→Wikistats to have stats issues under one Bugzilla product (see bug 42088) - sorry for the bugspam!]

Note You need to log in before you can comment on or make changes to this bug.