Last modified: 2014-11-18 18:07:32 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T8928, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 6928 - Double character support in category pages
Double character support in category pages
Status: RESOLVED DUPLICATE of bug 164
Product: MediaWiki
Classification: Unclassified
Categories (Other open bugs)
unspecified
All All
: Normal enhancement with 2 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2006-08-05 12:08 UTC by Tisza Gergő
Modified: 2014-11-18 18:07 UTC (History)
1 user (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Tisza Gergő 2006-08-05 12:08:01 UTC
In some languages, certain double characters (digraphs) are treated as a sinlge
letter. For example in Hungarian, the word "cselló" starts with the double
letter "cs". (For more examples, see
[[Latin_alphabet#Collating_sequence_with_extensions]].) This means that on
huwiki category pages like [[hu:Kategória:Vonós hangszerek]], "cselló" should
not be grouped together with words starting with the letter "c", but have an own
"cs" section. 

This doesn't apply to foreign words (eg. "CSS" should be put in the "c"
section), and therefore cannot be decided automatically. An easy way to handle
it would be to use a special character in the category sort keys: eg.
[[Category:XXX|cselló]] would have the same effect as now, but
[[Category:XXX|cs,elló]] would create a section for words starting with "cs" in
the category page, and put it there.

Another use for this would be a more flexible categorization of numbers; see
[[Category:Stargate_SG-1_episodes]], where "A" is used for the 10th season.
Using the above markup, a "10" section could be created by using
[[Category:Stargate_SG-1_episodes|10,{{PAGENAME}}]].
Comment 1 Aryeh Gregor (not reading bugmail, please e-mail directly) 2006-09-04 15:20:08 UTC

*** This bug has been marked as a duplicate of 164 ***
Comment 2 Tisza Gergő 2006-09-07 00:11:00 UTC
Reopening, this has nothing to do with collation, and - as explained above -
requires additional information beyond the category name to be handled
correctly. It cannot be handled without introducing new markup.
Comment 3 Aryeh Gregor (not reading bugmail, please e-mail directly) 2006-09-07 01:32:37 UTC
It has to do with nothing but collation.  It requires no additional information
beyond a user-provided sort key, which would then be evaluated in a
locale-specific manner.  No new markup need be added.  The kind of collation
support added in bug 164 would allow things like "cs," being interpreted as its
own letter, or some better convention.  Many languages have similar conventions,
many of which you kindly linked to at
[[Latin_alphabet#Collating_sequence_with_extensions]], and that's what bug 164
is about.

(For the time being, I may as well note that if you replace all "cs" with "c{s"
in sort keys, similar to what you suggest as the new markup required, it will
sort in the "c" section but after all pages starting with a normal "c", which is
at least half correct.)
Comment 4 Tisza Gergő 2006-09-07 09:34:35 UTC
Sorry. I must have misunderstood bug 164 then.

*** This bug has been marked as a duplicate of 164 ***

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links