Last modified: 2014-08-04 15:51:03 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T26559, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 24559 - mwdumper crashes with mysql syntax error
mwdumper crashes with mysql syntax error
Status: NEW
Product: Utilities
Classification: Unclassified
mwdumper (Other open bugs)
unspecified
PC Linux
: Normal critical (vote)
: ---
Assigned To: Brion Vibber
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2010-07-27 16:54 UTC by Vasile Ceteras
Modified: 2014-08-04 15:51 UTC (History)
1 user (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Vasile Ceteras 2010-07-27 16:54:07 UTC
I run mwdumper on enwiki-20100622-pages-articles.xml, it crashes after 310.000 pages with:

ERROR 1064 (42000) at line 6034: You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near ''{{Infobox CanadianMP | name= Dean Allison\n| image = \n| term_start=October 4, ' at line 1



java -jar mwdumper.jar --format=sql:1.5 enwiki-20100622-pages-articles.xml --filter=latest | mysql -u user -ppassword -D wikidb --default-character-set=utf8 -f

Please advise, I couldn't find a workaround for this.

Thank you!

Vasile Ceteras
Comment 1 Brion Vibber 2010-07-27 18:24:27 UTC
Sounds like a bad character escaping. Adding to my list to test when I get an updated data set downloaded on my big test comp...
Comment 2 Umherirrender 2010-07-31 16:44:29 UTC
The jar from http://download.wikimedia.org/tools/ is out of sync with trunk. Try to load the source from trunk and run with that.

See r12972 for the escaping fix.
Comment 3 Andre Klapper 2012-10-09 15:43:21 UTC
Vasile Ceteras: Is this still an issue if you load the source from trunk?
Comment 4 Vasile Ceteras 2012-10-09 22:09:38 UTC
I'm really sorry, but I'm not a Java developer, so I couldn't compile the source. 
I've got the svn checkout, but ant can't compile it.


I'm running Centos 6, 64bit.

svn checkout http://svn.wikimedia.org/svnroot/mediawiki/trunk/mwdumper/
...
Checked out revision 115751.

cd mwdumper 
ant

...
compile:
    [javac] Compiling 39 source files to /home/vceteras/Downloads/mwdumper/mwdumper/bin
    [javac] /home/vceteras/Downloads/mwdumper/mwdumper/src/org/mediawiki/dumper/gui/DumperGui.java:253: annotations are not supported in -source 1.4
    [javac] (use -source 5 or higher to enable annotations)
... 

I edit build.xml at line 19: source="1.5"' and line 20 target="1.5" .

Now I get this at the end: "[javac] 100 errors" .
Lots of packages do not exist, and two "cannot find symbol" errors.

I apologise for the late answer, I've been terribly busy and couldn't get any of my co-workers to help either.

It would be really great if someone could compile mwdumper for everyone who wants to use it without learning java.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links