Last modified: 2014-10-09 19:03:55 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T32961, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 30961 - Run refreshImageMetadata.php
Run refreshImageMetadata.php
Status: NEW
Product: Wikimedia
Classification: Unclassified
Site requests (Other open bugs)
unspecified
All All
: Normal enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
: shell
Depends on: 31740
Blocks: 29782 29876 31391
  Show dependency treegraph
 
Reported: 2011-09-18 02:32 UTC by Bawolff (Brian Wolff)
Modified: 2014-10-09 19:03 UTC (History)
10 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Bawolff (Brian Wolff) 2011-09-18 02:32:23 UTC
1.18 added a lot of new image metadata properties (Exif and friends) that are now extracted that weren't previously. Please run refreshImageMetadata.php after the 1.18 deployment, which will go through every image and re-extract all the metadata.


Also, the script outputs the name of any image that had a larger img_metadata field before the refresh relative to after it. This could indicate a bug in new img_metadata stuff since almost all of the old properties are still extracted, so the img_metadata field should either stay the same or increase in size. (There is one exception to this, but it should be very rare). It'd be great if stdout of that script could be redirected to a file and posted somewhere, so I could investigate any cases of the metadata decreasing in size.

Cheers.
Comment 1 Mark A. Hershberger 2011-09-20 17:04:06 UTC
Adding Reedy as CC since he has a clue what needs to be done here and says it needs some planning.
Comment 2 Tim Starling 2011-10-15 18:28:56 UTC
With --force ?
Comment 3 Bawolff (Brian Wolff) 2011-10-15 19:58:54 UTC
(In reply to comment #2)
> With --force ?

That's not necessary. Various version fields were incremented in 1.18 so the script will recognize the "old" metadata values as needing updating, which are the only one's that need to be updated.
Comment 4 Tim Starling 2011-10-15 20:40:47 UTC
It's in progress.
Comment 5 Derk-Jan Hartman 2011-10-23 09:03:24 UTC
Still running ? Finished ? Halted ?
Comment 6 Bawolff (Brian Wolff) 2011-10-23 18:21:49 UTC
(In reply to comment #5)
> Still running ? Finished ? Halted ?

I believe its halted pending me fixing bug 31740 (which i plan to soon)
Comment 7 Tim Starling 2011-10-25 06:03:03 UTC
Yes, it was stopped for that bug, and I've now backported and deployed the bugfix and restarted the script.
Comment 8 Tim Starling 2011-10-25 22:19:38 UTC
It looks like it just crashed due to OOM, while doing commonswiki.
Comment 9 Tim Starling 2011-10-26 04:55:30 UTC
Core dumping indicates that the bulk of the leaked memory is in image metadata structures, such as XMP. It's possible that it is leaking LocalFile objects.
Comment 10 Thehelpfulone 2012-05-10 00:04:15 UTC
Has this been run successfully?
Comment 11 Sam Reed (reedy) 2012-05-10 00:26:26 UTC
(In reply to comment #10)
> Has this been run successfully?

I imagine not, as no one has addressed the memory leaks
Comment 12 Krinkle 2012-05-11 03:31:49 UTC
Was scheduled for 1.18 deployment. Adding to 1.20wmf3 deployment now.
Comment 13 Sam Reed (reedy) 2012-05-11 03:32:34 UTC
(In reply to comment #12)
> Was scheduled for 1.18 deployment. Adding to 1.20wmf3 deployment now.

There's no point. The script needs fixing first
Comment 14 Krinkle 2012-05-11 03:33:36 UTC
(In reply to comment #13)
> (In reply to comment #12)
> > Was scheduled for 1.18 deployment. Adding to 1.20wmf3 deployment now.
> 
> There's no point. The script needs fixing first

Aha, I didn't read the previous comment right. Makes sense now.
Comment 15 Krinkle 2012-07-20 07:36:27 UTC
(In reply to comment #6)
> (In reply to comment #5)
> > Still running ? Finished ? Halted ?
> 
> I believe its halted pending me fixing bug 31740 (which i plan to soon)

That bug has been resolved as fixed in the mean time.

(In reply to comment #9)
> Core dumping indicates that the bulk of the leaked memory is in image metadata
> structures, such as XMP. It's possible that it is leaking LocalFile objects.

(In reply to comment #13)
> (In reply to comment #12)
> > Was scheduled for 1.18 deployment. Adding to 1.20wmf3 deployment now.
> 
> There's no point. The script needs fixing first

What has to be fixed still? Can a new bug be opened please as blocker to this one. Other wise, how about another run?
Comment 16 Bawolff (Brian Wolff) 2012-07-20 11:58:12 UTC
> What has to be fixed still? Can a new bug be opened please as blocker to this
> one. Other wise, how about another run?

The script appearently has memory leaks. Personally I'm not sure how or why that happens in a garbage collected language like php (furthermore one needs a test wiki with more than 10 files on it to really reproduce that issue), so fixing is a bit beyond me.
Comment 17 Andre Klapper 2013-04-25 17:25:51 UTC
(In reply to comment #15)
> > The script needs fixing first
> 
> What has to be fixed still? Can a new bug be opened please as blocker to this
> one. Other wise, how about another run?

Could somebody please specify in a new bug report what's need to fix (any pointers etc) in the script, and make it block this report?


High priority for 20 months & no changes for 9 months => resetting to normal.
Comment 18 Bawolff (Brian Wolff) 2013-04-25 20:22:27 UTC
High priority was probably an exageration, and its especially not now. As people notice a file without metadata, they can purge it if it bothers them.

As for fixing the bug. Im not really sure how or why there is a mem leak in the script or how to debug it.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links