Last modified: 2011-11-29 03:20:57 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T12693, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 10693 - Database dumps, smaller - pages-articles.xml should be avalaible in .7z as well
Database dumps, smaller - pages-articles.xml should be avalaible in .7z as well
Status: RESOLVED WONTFIX
Product: Datasets
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Lowest enhancement (vote)
: ---
Assigned To: Brion Vibber
http://download.wikimedia.org/enwiki/...
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2007-07-25 08:53 UTC by adam
Modified: 2011-11-29 03:20 UTC (History)
1 user (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description adam 2007-07-25 08:53:12 UTC
pages-articles.xml.bz2  "Articles, templates, image descriptions, and primary meta-pages" ("the archive most mirror sites will probably want") is currently only avaliable in .bz2
It should be made available in .7z as well

pages-meta-history.xml.7z is 3.2 GB the .bz2 is 5 gigs
It would appear that .7z is 64% of the size of .bz2
So pages-articles.xml.bz2 could be aprox 1.7 gigs, instead of 2.7 gigs. This may even reduce  some server load/bandwith.
Comment 1 Brion Vibber 2007-07-25 13:57:30 UTC
The extreme size savings for the history dumps are due to better compression of the multiple-revision runs. Current-version dumps compress to about the same size with .bz2 and .7z, but p7zip takes much much longer to run.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links