Last modified: 2014-01-30 09:15:17 UTC
It would be really usefull to be able to import XML dictionary, glossary data
that are open content into wiktionary.
It would be equally usefull to be able to export XML data about wiktionary content.
This would enhance the open character of wiktionary and it would open up the
"proprietary" nature of our content.
XML is a standard for how to format data. Just like CSV or other formats, it
does not provide information about the meaning of data. "import xml" is a very
There are at least two existing XML definitions for things like a glossary. The
people from GEMET produce there thesaurus in XML. This would cover a lot of the
ground that is required. There propably also is a XML standard for dictionary
content. I will learn that from the GEMET people I trus.
For thesaurus data exchange I would propose the Zthes standard which is based on some ISO
It seems to be most wiedly used including ORACLE (?) and our projects www.geometa.info and
www.integis.ch/tsaurus (see e.g. http://www.geometa.info/report/geometa-info-6-3.html).
There are several standards there is one for thesaurus use, one for glossary use
and several for dictionary use. Important in the selection is to find that it
fits what Wikionary will require. A thesaurus typically does not include
ethymology or translations.
Why not just export the data in the simplest XML format we can think of that
includes all the data. If people want it in some other standard format, that's
what XSLT was designed for. Confirming to some other standard straight from
wiktionary will just make it harder to implement competing standards.
Import failed: XML import parse failure at line 184308, col 84 (byte 12778764; ""): Invalid document end
-- Served by srv136 in 5.508 secs. --
Import failed: XML import parse failure at line 190401, col 2 (byte 13262462; ""): Invalid document end
-- Served by srv125 in 6.480 secs. --
Whoops, sorry. That last comment was intended for bugzilla:9911.
Stale, no real activity for four and a half years. We already have data dumps in XML format to export content. Importing data in XML format is best done by a bot script.
Don't use REMIND.
*** Bug 21460 has been marked as a duplicate of this bug. ***
I'm going to close this bug again as it is too vague. Besides, in the current day, things like this should be done by bots (for import) and scripts operating on database dumps (for export).