Last modified: 2012-04-16 09:24:34 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T16958, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 14958 - SQLException while loading spanish dump
SQLException while loading spanish dump
Status: RESOLVED DUPLICATE of bug 14379
Product: Utilities
Classification: Unclassified
mwdumper (Other open bugs)
unspecified
PC Windows XP
: Normal blocker (vote)
: ---
Assigned To: Brion Vibber
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2008-07-28 11:31 UTC by Jesus
Modified: 2012-04-16 09:24 UTC (History)
1 user (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Jesus 2008-07-28 11:31:35 UTC
I tried loading "esqiki-latest-pages-articles.xml.bz2" into a MySQL database using mwdumper. The command I use is:

java -client -classpath mwdumper.jar;mysql-connector-java-3.1.12-bin.jar org.mediawiki.dumper.Dumper "--output=mysql://127.0.0.1/wikidb?user=<user>&password=<password>" "--format=sql:1.5" ".\eswiki-latest-pages-articles.xml.bz2"

(obviously, typing my username and password in the fields <user> and <password>)

And everything seems to go well (with the status report after every 1000 pages), but when I reach the 436.000 pages I get:

...
435.000 pages (532,589/sec), 435.000 revs (532,589/sec)
436.000 pages (531,89/sec), 436.000 revs (531,89/sec)
Exception in thread "main" java.io.IOException: java.sql.SQLException: Duplicate entry '0-' for key 2
    at org.mediawiki.importer.XmlDumpReader.readDump(Unknown Source)
    at org.mediawiki.dumper.Dumper.main(Unknown Source)
Caused by: org.xml.sax.SAXException: java.sql.SQLException: Duplicate entry '0-' for key 2
    at org.mediawiki.importer.XmlDumpReader.endElement(Unknown Source)
...

As advised in you page, the database is expecting utf8-encoding text. Also I tried with an older version of the spanish wikipedia I had (eswiki-20080507-pages-articles.xml.bz2) and I got the same error, this time after the line:

438.000 pages (527,591/sec), 438.000 revs (527,591/sec) 

The program crashes, so I have no way to load the sapanish wikipedi files. 

Thank you very much for your help.
Comment 1 Max Semenik 2008-08-22 09:16:53 UTC

*** This bug has been marked as a duplicate of bug 14379 ***

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links