Last modified: 2014-05-17 00:52:08 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T17889, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 15889 - Write and run script to find non-existent images on Wikimedia wikis
Write and run script to find non-existent images on Wikimedia wikis
Status: NEW
Product: Wikimedia
Classification: Unclassified
Site requests (Other open bugs)
unspecified
All All
: Low enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks: 16660
  Show dependency treegraph
 
Reported: 2008-10-07 21:17 UTC by MZMcBride
Modified: 2014-05-17 00:52 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description MZMcBride 2008-10-07 21:17:14 UTC
Currently on Commons and on the various projects, there are images that have description pages but their images have disappeared. Because the meta data (MIME type, file size, directory, etc.) is stored in the database, it's not easily possible to find which images have this issue. (Without a definitive list of broken images, it also makes it very difficult to know whether this is a growing problem somehow related to WMF's servers or not.)

It would be nice if someone were to write a script that checked each image in a database to ensure that it exists and that it is not 0 bytes. Example (though this will likely be deleted by an admin at some point): <http://commons.wikimedia.org/wiki/Image:Hatogayacity_Fire_Department.jpg>
Comment 1 p858snake 2012-01-16 10:46:49 UTC
Is this still a issue that needs to be investigated?

Since you are a TS wizz, Can this be done with the recent DB metadata improvements?
Comment 2 Alex Monk 2012-06-10 17:54:09 UTC
Bumping this. Still an issue?
Comment 3 Alex Monk 2012-07-26 01:10:37 UTC
Hello?
Comment 4 TeleComNasSprVen 2014-05-14 07:48:29 UTC
Are we talking about:

1) Running cleanupImages.php? Would probably need shell.
2) Running a database query? It would be something like "SELECT page_title FROM page WHERE NOT EXISTS (SELECT img_name FROM image WHERE img_name = page_title) AND NOT EXISTS (SELECT img_name FROM commonswiki_p.image WHERE img_name = page_title) AND page_namespace = 6 AND page_is_redirect = 0 LIMIT 1000;"
3) Fixing bug 32551?
Comment 5 MZMcBride 2014-05-17 00:52:08 UTC
(In reply to TeleComNasSprVen from comment #4)
> Are we talking about:
> 
> 1) Running cleanupImages.php? Would probably need shell.
> 2) Running a database query? It would be something like "SELECT page_title
> FROM page WHERE NOT EXISTS (SELECT img_name FROM image WHERE img_name =
> page_title) AND NOT EXISTS (SELECT img_name FROM commonswiki_p.image WHERE
> img_name = page_title) AND page_namespace = 6 AND page_is_redirect = 0 LIMIT
> 1000;"
> 3) Fixing bug 32551?

The bug summary is "Write and run script to find non-existent images on Wikimedia wikis". We're probably talking about that.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links