Last modified: 2013-07-04 10:35:00 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T45332, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 43332 - Add non-English wiki support to Parsoid
Add non-English wiki support to Parsoid
Status: RESOLVED FIXED
Product: Parsoid
Classification: Unclassified
General (Other open bugs)
unspecified
All All
: Normal normal
: ---
Assigned To: Mark Holmquist
: i18n
Depends on: 41716 41151 41206 42160
Blocks: 45749
  Show dependency treegraph
 
Reported: 2012-12-21 22:14 UTC by Gabriel Wicke
Modified: 2013-07-04 10:35 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Gabriel Wicke 2012-12-21 22:14:31 UTC
Tasks:
* try to pull config options (namespace names mainly) from the MW API when accessing the wiki first
* our link t[r]ail regexp needs to be developed further / tested in other languages; it is a negative char match rather than the language-specific positive regexps, so can potentially be made to work across languages.
* language variants are further down on the list (see bug 41716).
Comment 1 Mark Holmquist 2012-12-21 22:19:32 UTC
I'll see what I can do about the config options, definitely. Are there any other options that we need besides namespace names?
Comment 3 Gabriel Wicke 2012-12-22 00:21:23 UTC
The actual GET request URL (displayed in the sandbox at the bottom after executing the sample query):

https://en.wikipedia.org//w/api.php?action=query&meta=siteinfo&format=json&siprop=namespaces|namespacealiases|specialpagealiases
Comment 4 Gabriel Wicke 2012-12-22 00:43:08 UTC
Oh, fun- my vague memory that image / file options and magic words are localized too turned out to be true:

Look for 'img_' in

https://fr.wikipedia.org/w/api.php?action=query&meta=siteinfo&format=json&siprop=general%7Cnamespaces%7Cnamespacealiases%7Cspecialpagealiases%7Cmagicwords%7Cinterwikimap%7Cdbrepllag%7Cstatistics%7Cusergroups%7Cextensions%7Cfileextensions%7Crightsinfo%7Clanguages%7Cskins%7Cextensiontags%7Cfunctionhooks%7Cshowhooks%7Cvariables

That output has more interesting things about registered extensions, parser functions etc- probably a good idea to simply wrap the entire JSON object with a WikiConfig object which provides convenient accessor methods for the info we need. That way we would only need to update those accessors if the underlying JSON structure changes.
Comment 5 Mark Holmquist 2013-01-03 23:21:12 UTC
First attempt at some progress: https://gerrit.wikimedia.org/r/42197
Comment 6 Mark Holmquist 2013-01-08 01:23:55 UTC
The patch is now updated (rebased), and has addressed most of the issues raised in review.
Comment 7 Mark Holmquist 2013-01-10 00:21:33 UTC
Some work done in https://gerrit.wikimedia.org/r/42798
Comment 8 Mark Holmquist 2013-01-15 22:06:32 UTC
More steps taken in https://gerrit.wikimedia.org/r/43972

The next step is magic words, which could take rather a bit of work.
Comment 9 Mark Holmquist 2013-01-17 00:40:57 UTC
https://gerrit.wikimedia.org/r/44353

That patch is a bit of work on using magic words from the remote wiki config. It's still WIP, but the feature is coming!
Comment 10 Gabriel Wicke 2013-06-12 00:43:12 UTC
This is now pretty far along and used in production. Please report further issues separately.
Comment 11 Andre Klapper 2013-07-04 10:35:00 UTC
[Parsoid component reorg by merging JS/General and General. See bug 50685 for more information. Filter bugmail on this comment. parsoidreorg20130704]

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links