Last modified: 2014-05-17 00:52:08 UTC
Currently on Commons and on the various projects, there are images that have description pages but their images have disappeared. Because the meta data (MIME type, file size, directory, etc.) is stored in the database, it's not easily possible to find which images have this issue. (Without a definitive list of broken images, it also makes it very difficult to know whether this is a growing problem somehow related to WMF's servers or not.) It would be nice if someone were to write a script that checked each image in a database to ensure that it exists and that it is not 0 bytes. Example (though this will likely be deleted by an admin at some point): <http://commons.wikimedia.org/wiki/Image:Hatogayacity_Fire_Department.jpg>
Is this still a issue that needs to be investigated? Since you are a TS wizz, Can this be done with the recent DB metadata improvements?
Bumping this. Still an issue?
Hello?
Are we talking about: 1) Running cleanupImages.php? Would probably need shell. 2) Running a database query? It would be something like "SELECT page_title FROM page WHERE NOT EXISTS (SELECT img_name FROM image WHERE img_name = page_title) AND NOT EXISTS (SELECT img_name FROM commonswiki_p.image WHERE img_name = page_title) AND page_namespace = 6 AND page_is_redirect = 0 LIMIT 1000;" 3) Fixing bug 32551?
(In reply to TeleComNasSprVen from comment #4) > Are we talking about: > > 1) Running cleanupImages.php? Would probably need shell. > 2) Running a database query? It would be something like "SELECT page_title > FROM page WHERE NOT EXISTS (SELECT img_name FROM image WHERE img_name = > page_title) AND NOT EXISTS (SELECT img_name FROM commonswiki_p.image WHERE > img_name = page_title) AND page_namespace = 6 AND page_is_redirect = 0 LIMIT > 1000;" > 3) Fixing bug 32551? The bug summary is "Write and run script to find non-existent images on Wikimedia wikis". We're probably talking about that.