Last modified: 2012-04-16 09:24:34 UTC
I tried loading "esqiki-latest-pages-articles.xml.bz2" into a MySQL database using mwdumper. The command I use is: java -client -classpath mwdumper.jar;mysql-connector-java-3.1.12-bin.jar org.mediawiki.dumper.Dumper "--output=mysql://127.0.0.1/wikidb?user=<user>&password=<password>" "--format=sql:1.5" ".\eswiki-latest-pages-articles.xml.bz2" (obviously, typing my username and password in the fields <user> and <password>) And everything seems to go well (with the status report after every 1000 pages), but when I reach the 436.000 pages I get: ... 435.000 pages (532,589/sec), 435.000 revs (532,589/sec) 436.000 pages (531,89/sec), 436.000 revs (531,89/sec) Exception in thread "main" java.io.IOException: java.sql.SQLException: Duplicate entry '0-' for key 2 at org.mediawiki.importer.XmlDumpReader.readDump(Unknown Source) at org.mediawiki.dumper.Dumper.main(Unknown Source) Caused by: org.xml.sax.SAXException: java.sql.SQLException: Duplicate entry '0-' for key 2 at org.mediawiki.importer.XmlDumpReader.endElement(Unknown Source) ... As advised in you page, the database is expecting utf8-encoding text. Also I tried with an older version of the spanish wikipedia I had (eswiki-20080507-pages-articles.xml.bz2) and I got the same error, this time after the line: 438.000 pages (527,591/sec), 438.000 revs (527,591/sec) The program crashes, so I have no way to load the sapanish wikipedi files. Thank you very much for your help.
*** This bug has been marked as a duplicate of bug 14379 ***