Last modified: 2014-04-08 06:09:21 UTC
Hello. Is it possible to change alphabetical order in the categories for all Kurdish projects (ku.wikipedia, ku.wiktionary, ku.wikiquote and ku.wikibooks)? =============================================================================== Kurdish alphabet (Kurmanci) uses Latin letters (31 letters).Alphabetical order: Aa · Bb · Cc · Çç · Dd · Ee · Êê · Ff · Gg · Hh · Ii · Îî · Jj · Kk · Ll · Mm · Nn · Oo · Pp · Qq · Rr · Ss · Şş · Tt · Uu · Ûû · Vv · Ww · Xx · Yy · Zz 26 letters like the English language + 5 diacritical letters (Çç, Êê, Îî, Şş, Ûû) http://en.wikipedia.org/wiki/Kurdish_alphabets#Hawar_alphabet =============================================================================== The problem is that the diacritic letters (Çç, Êê, Îî, Şş, Ûû) do not follow the Kurdish alphabetical order (see above) in the categories: they are placed at the end. For example here (Ç, Î, Û placed at the end): http://ku.wikipedia.org/wiki/Kategor%C3%AE:Dewlet%C3%AAn_Asyay%C3%AA More generally, all the classifications can it be done by Kurdish alphabetical order? Can we do something? Sorry if I'm not at the right place but I do not know who else to ask. Thank you in advance.
This is the right place. ---- At first glance I didnt see kurdish on the list of supported collations for icu that I found on google, which would be a problem. But maybe i just missed it. Will have to investigate further.
Yup, it seems to be unsupported, trying to use uca-ku collation gives the same results as if it was uca-default: http://users.v-lo.krakow.pl/~matmarex/testwiki-ku/index.php?title=Kategor%C3%AE:Test (So no separete headings for the letters with diacritic marks, and e.g. C and Ç are considered the same letter when sorting - this isn't visible on that page right now, since if there's a conflict, the diacritical version is placed after the default one.) ghybu, does that seems like an improvement over the current state?
No, I don't see any improvement.
I mean, is the category sorting on the testwiki I linked better than the one currently visible on ku.wikipedia? (Also, feel free to create or modify pages there to test the behavior.)
Yes, it is much better, this is what I wanted. But, it's necessary to also have sections (Ç, Ê, Î, Ş, Û); for example to use this type of template: http://ku.wikipedia.org/wiki/%C5%9Eablon:TOC_Kategor%C3%AE
(In reply to comment #5) > Yes, it is much better, this is what I wanted. But, it's necessary to also > have > sections (Ç, Ê, Î, Ş, Û); for example to use this type of template: > http://ku.wikipedia.org/wiki/%C5%9Eablon:TOC_Kategor%C3%AE In the test wiki linked above, the accents are secondary differences (used as tie breakers) where in kurdish they should be considered different letters. (Look at the section for c in that link where I just added more examples). Thus it can't have separate section headers as they arent sorted separately Comment 3 is asking if even though the behaviour is wrong, is it more or less wrong than the current behaviour on ku wikis.
I have also done test, the behavior is not good. The current version of ku.wiki is better. I also think separate sections are required. Thank you for trying!
A solution was found here: bug 30287 an bug 50311 Can we do the same?
(In reply to comment #8) > A solution was found here: bug 30287 an bug 50311 > > Can we do the same? Not really. fa is in the list of supported collations by cldr at http://www.unicode.org/repos/cldr/trunk/common/collation/ . ku is not. I think the next step in this bug would be get cldr to add ku as a collation ( http://cldr.unicode.org/index/cldr-spec/collation-guidelines ) [Or i suppose making php's intl bindings to the icu library suck a little less so we could make our own collation]
(In reply to comment #0) Although little used and not included in the alphabet, the letters "Ḧḧ" and "Ẍẍ" must be integrated in sorting, now we have (26 letters like the English language + 7 diacritical letters): Aa · Bb · Cc · Çç · Dd · Ee · Êê · Ff · Gg · Hh · Ḧḧ · Ii · Îî · Jj · Kk · Ll · Mm · Nn · Oo · Pp · Qq · Rr · Ss · Şş · Tt · Uu · Ûû · Vv · Ww · Xx · Ẍẍ · Yy · Zz (In reply to comment #9) I made a request for Kurdish alphabet.
I made a request for the CLDR, it seems that the problem is fixed: http://unicode.org/cldr/trac/ticket/6527
(In reply to ghybu.w from comment #11) > I made a request for the CLDR, it seems that the problem is fixed: > http://unicode.org/cldr/trac/ticket/6527 Cool, thanks. (For reference, upstream revisions are http://unicode.org/cldr/trac/changeset/9765 http://unicode.org/cldr/trac/changeset/9761). So open question: Does php intl's extension use ICU data in the "seed" directory If so, this is now pending us upgrading to CLDR 25, which will happen at ?? (Probably not for a little while, not sure when)