Last modified: 2013-05-31 01:21:43 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T17017, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 15017 - Wikimedia static HTML dumps broken
Wikimedia static HTML dumps broken
Status: NEW
Product: Datasets
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Normal normal with 4 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
http://dumps.wikimedia.org/other/stat...
:
: 16186 (view as bug list)
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2008-08-02 23:02 UTC by 555
Modified: 2013-05-31 01:21 UTC (History)
16 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description 555 2008-08-02 23:02:41 UTC
Since the developers team have some exciting news (a donation for future usage [1] and new boxes have been ordered [2]), I'm finally comfortable in add this request: please make static dump HTML files for non-Wikipedia projects.

The biggest wikis from those projects have useful content, sometimes more useful than some small Wikipedias that have static dumps. Moreover, a HTML dump is one of the resources to spread the world about some Wikimedia projects :)

Best regards,
[[:m:User:555]]

[1] http://lists.wikimedia.org/pipermail/foundation-l/2008-July/044905.html

[2] http://lists.wikimedia.org/pipermail/wikitech-l/2008-August/038869.html
Comment 1 Siebrand Mazeland 2008-08-11 00:04:50 UTC
Assigning to brion. He should be able to process, or re-assign.
Comment 2 Brion Vibber 2008-08-11 00:59:51 UTC
Assigning to Tim, as he's in charge of static HTML dumps at present.
Comment 3 p858snake 2009-07-29 07:02:08 UTC
Changing Component: WikiBugs → Downloads
Comment 4 Sam Reed (reedy) 2011-10-25 18:00:50 UTC
*** Bug 16186 has been marked as a duplicate of this bug. ***
Comment 5 Hydriz Scholz 2012-06-18 13:05:06 UTC
Do note that static HTML dumps are no longer running at the moment (the last time was in 2008), but moving bug to the Datasets product nevertheless.
Comment 6 Hydriz Scholz 2012-06-18 13:28:06 UTC
Assigning back to Nobody. Ariel isn't involved in the static HTML dumps.
Comment 7 MZMcBride 2012-09-08 22:37:48 UTC
Changing the bug summary from "Static HTML dumps for non-Wikipedia projects" to "Wikimedia static HTML dumps broken". This is more accurate of the current status.

Related mailing list thread: <http://lists.wikimedia.org/pipermail/wikitech-l/2011-December/056752.html>.

What's the status of this bug? Can the dumps please be updated? What's needed to make that happen? Is an RT ticket needed?
Comment 8 jeremyb 2012-09-09 22:25:38 UTC
-easy

Not something a typical user can work on. (at least unless there's a list of known issues with the code that need fixing before this is restarted)
Comment 9 Nemo 2012-09-17 22:09:50 UTC
It would be useful if someone identified the bugs which actually block this request, among https://bugzilla.wikimedia.org/buglist.cgi?query_format=advanced&component=DumpHTML&resolution=---&product=MediaWiki%20extensions

"Currently, the extension is not really usable without fixing/tweaking the Mediawiki code."
http://www.kiwix.org/index.php/Mediawiki_DumpHTML_extension_improvement#2_-_Revamping_and_fixing_bugs
Comment 10 MZMcBride 2013-02-02 23:59:02 UTC
Restoring the "shell" keyword. Until a shell user tries to re-generate these dumps, it'll be impossible to know what the issues are (if any).
Comment 11 Nemo 2013-02-03 07:27:28 UTC
(In reply to comment #10)
> Restoring the "shell" keyword. Until a shell user tries to re-generate these
> dumps, it'll be impossible to know what the issues are (if any).

Seriously? Kelson is the de facto maintainer of dumpHTML as he runs it for Kiwix, and he says (like everyone else) that it's broken, see comment 9.
Also, this bug is 5+ years old. Nowadays, it should probably be repurposed to ask ZIM dumps, which make much more sense; we already have many but it would be nice if they were produced regularly on WMF servers without the laborious steps now necessary for Kelson.
Comment 12 Kelson [Emmanuel Engelhart] 2013-02-03 09:44:58 UTC
We will rewrite/fix the DumpHTML extension in the next months. We have a granted project (by Wikimedia France) :
http://www.kiwix.org/index.php/Mediawiki_DumpHTML_extension_improvement

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links