Last modified: 2013-10-18 15:49:20 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T45547, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 43547 - MediaWiki needs a fictitious variant for English for easier variant development work
MediaWiki needs a fictitious variant for English for easier variant developme...
Status: REOPENED
Product: MediaWiki
Classification: Unclassified
Language converter (Other open bugs)
1.21.x
All All
: Low enhancement with 1 vote (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks: 41716
  Show dependency treegraph
 
Reported: 2013-01-01 00:24 UTC by Bartosz Dziewoński
Modified: 2013-10-18 15:49 UTC (History)
13 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Bartosz Dziewoński 2013-01-01 00:24:18 UTC
Let's start 2013 with a fun bug.

It would be very useful when working on the skins or other bits of the interface.

Currently, to test variants handling, you have to switch the entire wiki to a language that uses variant conversion, and unfortunately it seems like all of them use non-Latin alphabets, which makes working with them considerably harder if know them than if they were in a latin script at least.

With this, you would just set $wgUsePigLatinVariant (or something similar) to true and be able to test.

Of course we could use piratey or lolcat language instead of pig latin, but it seems the nicest to me (and easier to implement).
Comment 1 MZMcBride 2013-01-01 00:28:35 UTC
Please don't mark bugs as resolved (particularly resolved/wontfix) without a comment. Re-opening for now.
Comment 2 Dereckson 2013-01-01 00:38:48 UTC
If we want to explore this option, I would recommend to take a look to two small applications: Jive and valspeak, as there are very easy to implement.

I have to note these applications could be in violation with modern expectation of decency and politeness.

For example, the jive application [1] is built with the following code:

#include <stdio.h>

char *yylex();

int main()
{
        char *line;
        while(line = yylex()){
                printf("%s", line);
        }
        return 0;
}

And this lex list:
http://stuff.mit.edu/afs/sipb/user/rfrench/src/jive/jive.l

The valspeak lex list:
https://groups.google.com/group/net.sources/msg/be14e2cfcdf7eb06?dmode=source

[1] https://groups.google.com/group/net.sources/tree/browse_frm/month/1986-10/e5cbd0d14f430065?rnum=81&_done=/group/net.sources/browse_frm/month/1986-10?&pli=1

Strangely, the jive application reaches FreeBSD port, but not the valspeak.
Comment 3 Nemo 2013-01-01 00:40:51 UTC
(In reply to comment #1)
> Please don't mark bugs as resolved (particularly resolved/wontfix) without a
> comment. 

Comment: looks like what you want is rather bug 38486.
Comment 4 Bartosz Dziewoński 2013-01-01 00:47:22 UTC
(In reply to comment #3)
> Comment: looks like what you want is rather bug 38486.

While slightly similar, this doesn't seem like the same thing to me (although solutions to one of them could solve the other as well, in some cases). 

This bug is from the perspective of a skin developer, that one is from an extension developer. I went through some pain to implement interface for variants in CologneBlue when I was rewirting it, and I've been sitting on this idea since.

Well, I'm not sure. I think we should just try implementing something ;) (I could work on this pig Latin variant if there was a reasonable chance of it getting merged).

[Reopening for now; maybe it's a dupe, but certainly not a wontfix, is it?]
Comment 5 Liangent 2013-03-23 13:18:17 UTC
The language converter works on chars or substrings instead of (space-delimited) words, so I don't think it's easy to adapt a word-based conversion here (correct me if I'm wrong). Since this is not a serious variant development, maybe we can choose l33t or something similar instead?
Comment 6 Liangent 2013-03-29 17:24:43 UTC
Hey, just a converter between UPPERCASE and lowercase is enough.
Comment 7 Helder 2013-03-29 17:42:22 UTC
See also bug 26121 comment 12.
Comment 8 Bawolff (Brian Wolff) 2013-04-19 07:27:57 UTC
Btw, In my experiance serbian is best for testing things when your eyes are used to a latin script
Comment 9 Daniel Friesen 2013-04-19 07:53:26 UTC
The converter works abstractly on text not individual characters. So I'd think it should be completely fine to do word based conversions.

Heck, I'm already doing word based conversions. I hacked the variant system with a custom language to make it so that character names in a certain animanga wiki can vary by what version of the series you've been accustomed to (The original romaji, manga translation, subtitled translations, dub translations, and translations of alternate dubs can have different translations for the same character). Though a recent upgrade may have broke some of the code I used to hackily load the language variants in to mw in the first place.

Note that we have another bug contemplating the idea of implementing US, Canada, UK, etc... English as variants.
Comment 10 Liangent 2013-04-19 07:58:24 UTC
(In reply to comment #9)
> The converter works abstractly on text not individual characters. So I'd
> think
> it should be completely fine to do word based conversions.
> 
> Heck, I'm already doing word based conversions. I hacked the variant system
> with a custom language to make it so that character names in a certain
> animanga
> wiki can vary by what version of the series you've been accustomed to (The
> original romaji, manga translation, subtitled translations, dub translations,
> and translations of alternate dubs can have different translations for the
> same
> character). Though a recent upgrade may have broke some of the code I used to
> hackily load the language variants in to mw in the first place.
> 
> Note that we have another bug contemplating the idea of implementing US,
> Canada, UK, etc... English as variants.

Yeah you have to override LanguageConverter::translate() then, and can't make use of the default conversion table holder (ReplacementArray).
Comment 11 Gerrit Notification Bot 2013-07-04 21:58:56 UTC
Change 72053 had a related patch set uploaded by Liangent:
(bug 43547) New language variant en-x-piglatin DO NOT MERGE

https://gerrit.wikimedia.org/r/72053
Comment 12 John Mark Vandenberg 2013-07-28 01:59:16 UTC
(In reply to comment #9)
> ..
> Note that we have another bug contemplating the idea of implementing US,
> Canada, UK, etc... English as variants.

bug 31015?  It would be useful to start that set of variants, with just the basic transforms; the implementation of a viable set of conversions could then develop over time.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links