Last modified: 2011-01-25 01:05:28 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T23809, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 21809 - commons fails to purge large djvu files
commons fails to purge large djvu files
Status: RESOLVED FIXED
Product: MediaWiki
Classification: Unclassified
Uploading (Other open bugs)
1.16.x
All All
: Normal normal (vote)
: ---
Assigned To: Tomasz Finc
http://commons.wikimedia.org/wiki/Fil...
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2009-12-10 07:51 UTC by Philippe Elie
Modified: 2011-01-25 01:05 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Philippe Elie 2009-12-10 07:51:56 UTC
Since a few days trying to purge large djvu file fail, so the text layer of djvu file is not accessible. After a successful purge, creating a page should show the text layer here: http://fr.wikisource.org/wiki/Livre:Croiset_-_Histoire_de_la_litt%C3%A9rature_grecque,_t4.djvu
Comment 1 ThomasV 2009-12-10 08:23:41 UTC
The bug can be seen for the following files:

http://commons.wikimedia.org/wiki/File:Burnouf_-_Lotus_de_la_bonne_loi.djvu
http://commons.wikimedia.org/wiki/File:Croiset_-_Histoire_de_la_litt%C3%A9rature_grecque,_t4.djvu
http://commons.wikimedia.org/wiki/File:Michaud_-_Biographie_universelle_ancienne_et_moderne_-_1843_-_Tome_10.djvu

the first two of them were uploaded recently, and the djvu text layer has not been successfully extracted, 
because of this bug; or maybe it is the text layer extraction that causes the bug.

the last file was uploaded a long time ago, and at that time the file could be purged, so 
the djvu text was successfully extracted; it is thus still available in the metadata.

I tested the first file on my machine, with a recent mediawiki install and it worked fine:
the file can be purged and the text layer is correctly extracted.
Comment 2 Lars Aronsson 2010-01-06 18:16:28 UTC
Also happened to http://commons.wikimedia.org/wiki/File:Uppslagsbok_f%C3%B6r_alla_1910.djvu 
Comment 3 Platonides 2010-01-06 19:01:42 UTC
Probably the same issue: http://en.wikisource.org/wiki/User:Billinghurst
Comment 4 Lars Aronsson 2010-01-07 20:15:54 UTC
When I try to "purge" the large djvu file from Commons, I get an HTTP 500 internal server error response after exactly 30 seconds. Why is purge taking so long? It should just remove old stuff (supposedly a quick operation), and then schedule a queued job for reindexing (a slower operation, depending on the job queue length).
Comment 5 ThomasV 2010-01-19 17:34:56 UTC
Purge takes time because of the djvu text layer extraction.

The bug should be fixed in r61258. 
Comment 6 ThomasV 2010-03-03 14:36:56 UTC
Reopening this bug because the fix is not live.
Comment 7 Bryan Tong Minh 2010-04-11 22:02:29 UTC
Fix has been deployed, but purging still doesn't work.
Comment 8 Philippe Elie 2010-04-12 06:15:02 UTC
Purging work fine now, Bryan, what File: fails to purge for you ?
Comment 9 Bryan Tong Minh 2010-04-12 09:22:15 UTC
I got a 403 "Wikimedia has an error" error page trying to purge <http://commons.wikimedia.org/wiki/File:Uppslagsbok_f%C3%B6r_alla_1910.djvu>. Presumably it times out, because it takes a long while to load the page.
Comment 10 ThomasV 2010-04-12 09:34:11 UTC
it works for me.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links