Last modified: 2014-04-08 06:09:21 UTC
Hello. Is it possible to change alphabetical order in the categories for all Kurdish projects (ku.wikipedia, ku.wiktionary, ku.wikiquote and ku.wikibooks)?
Kurdish alphabet (Kurmanci) uses Latin letters (31 letters).Alphabetical order:
Aa · Bb · Cc · Çç · Dd · Ee · Êê · Ff · Gg · Hh · Ii · Îî · Jj · Kk · Ll · Mm · Nn · Oo · Pp · Qq · Rr · Ss · Şş · Tt · Uu · Ûû · Vv · Ww · Xx · Yy · Zz
26 letters like the English language + 5 diacritical letters (Çç, Êê, Îî, Şş, Ûû)
The problem is that the diacritic letters (Çç, Êê, Îî, Şş, Ûû) do not follow the Kurdish alphabetical order (see above) in the categories: they are placed at the end.
For example here (Ç, Î, Û placed at the end): http://ku.wikipedia.org/wiki/Kategor%C3%AE:Dewlet%C3%AAn_Asyay%C3%AA
More generally, all the classifications can it be done by Kurdish alphabetical order? Can we do something?
Sorry if I'm not at the right place but I do not know who else to ask.
Thank you in advance.
This is the right place.
At first glance I didnt see kurdish on the list of supported collations for icu that I found on google, which would be a problem. But maybe i just missed it. Will have to investigate further.
Yup, it seems to be unsupported, trying to use uca-ku collation gives the same results as if it was uca-default: http://users.v-lo.krakow.pl/~matmarex/testwiki-ku/index.php?title=Kategor%C3%AE:Test
(So no separete headings for the letters with diacritic marks, and e.g. C and Ç are considered the same letter when sorting - this isn't visible on that page right now, since if there's a conflict, the diacritical version is placed after the default one.)
ghybu, does that seems like an improvement over the current state?
No, I don't see any improvement.
I mean, is the category sorting on the testwiki I linked better than the one currently visible on ku.wikipedia?
(Also, feel free to create or modify pages there to test the behavior.)
Yes, it is much better, this is what I wanted. But, it's necessary to also have sections (Ç, Ê, Î, Ş, Û); for example to use this type of template: http://ku.wikipedia.org/wiki/%C5%9Eablon:TOC_Kategor%C3%AE
(In reply to comment #5)
> Yes, it is much better, this is what I wanted. But, it's necessary to also
> sections (Ç, Ê, Î, Ş, Û); for example to use this type of template:
In the test wiki linked above, the accents are secondary differences (used as tie breakers) where in kurdish they should be considered different letters. (Look at the section for c in that link where I just added more examples). Thus it can't have separate section headers as they arent sorted separately
Comment 3 is asking if even though the behaviour is wrong, is it more or less wrong than the current behaviour on ku wikis.
I have also done test, the behavior is not good. The current version of ku.wiki is better. I also think separate sections are required. Thank you for trying!
A solution was found here: bug 30287 an bug 50311
Can we do the same?
(In reply to comment #8)
> A solution was found here: bug 30287 an bug 50311
> Can we do the same?
Not really. fa is in the list of supported collations by cldr at http://www.unicode.org/repos/cldr/trunk/common/collation/ . ku is not. I think the next step in this bug would be get cldr to add ku as a collation ( http://cldr.unicode.org/index/cldr-spec/collation-guidelines )
[Or i suppose making php's intl bindings to the icu library suck a little less so we could make our own collation]
(In reply to comment #0)
Although little used and not included in the alphabet, the letters "Ḧḧ" and "Ẍẍ" must be integrated in sorting, now we have (26 letters like the English language + 7 diacritical letters):
Aa · Bb · Cc · Çç · Dd · Ee · Êê · Ff · Gg · Hh · Ḧḧ · Ii · Îî · Jj · Kk · Ll · Mm · Nn · Oo · Pp · Qq · Rr · Ss · Şş · Tt · Uu · Ûû · Vv · Ww · Xx · Ẍẍ · Yy · Zz
(In reply to comment #9)
I made a request for Kurdish alphabet.
I made a request for the CLDR, it seems that the problem is fixed: http://unicode.org/cldr/trac/ticket/6527
(In reply to ghybu.w from comment #11)
> I made a request for the CLDR, it seems that the problem is fixed:
Cool, thanks. (For reference, upstream revisions are http://unicode.org/cldr/trac/changeset/9765 http://unicode.org/cldr/trac/changeset/9761).
So open question: Does php intl's extension use ICU data in the "seed" directory
If so, this is now pending us upgrading to CLDR 25, which will happen at ?? (Probably not for a little while, not sure when)