Last modified: 2014-05-23 16:18:51 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T20871, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 18871 - Include at least some EXIF metadata in resized pictures
Include at least some EXIF metadata in resized pictures
Status: REOPENED
Product: MediaWiki
Classification: Unclassified
File management (Other open bugs)
unspecified
All All
: Low enhancement with 2 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
http://commons.wikimedia.org/wiki/COM...
:
Depends on:
Blocks: exif
  Show dependency treegraph
 
Reported: 2009-05-22 05:51 UTC by folengo
Modified: 2014-05-23 16:18 UTC (History)
8 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description folengo 2009-05-22 05:51:38 UTC
A discussion took place on Wikimedia Commons' Village Pump on 19 May 2009 on how to best respond to the pressure from photographers wanting their names to be credited on article pages (1).

It was suggested that instead of (or in addition to) crediting photographers on article pages, we should make our best efforts to keep the copyright EXIF metadata when available. 

It was understood that the underlying reason for excluding EXIF data from resized pictures and thumbnails until now, was the concern that sometimes cameras add an overwhelmingly heavy amount of EXIF metadata.

Our conclusion is that some sort of compromise has to be reached between these two concerns, by including at least some EXIF metatada, if not all of them.

A) Perhaps, really small thumbnails like those used in categories, (that means 120px or smaller) might be allowed to remain void of metadata, while larger thumbnails or resized pictures would compulsorily include at least the most useful metadata.

B) The most useful metadata which should be included in most resized pictures and thumbnails should be :
**ImageDescription, 
**Copyright, 
**DateTimeOriginal, 
**DateTime, 
**GPSLatitudeRef, 
**GPSLatitude, 
**GPSLongitudeRef, 
**GPSLongitude, 
**GPSAltitudeRef, 
**GPSAltitude, 
**IPTC:Credit, 
**IPTC:CopyrightNotice, and 
**IPTC:Byline.

C) The message at the bottom of description pages (is it a [[Mediawiki:]] message ? I have not been able to find which) should be reworded with a less ambiguous wording. When a user reads "This file contains additional information (...) ", he remains clueless on whether that means the original file, or the resized 800px preview present on the description page, or both.

(1) [[COM:VP#Crediting photographers in article pages]]
Comment 1 folengo 2009-05-22 05:54:06 UTC
Erratum : the link to the discussion is [[:Commons:COM:VP#Crediting photographers in article pages]] .
Comment 3 folengo 2009-05-22 05:57:59 UTC
It was also suggested to set UserComment and IPTC:SpecialInstructions to "For more information, see <url of file description page>".
Comment 4 Brion Vibber 2009-05-26 21:11:10 UTC
See also bug 3361 ("Image author, description, and copyright data saved in EXIF fields") and bug 657 ("Pull copyright metadata from files on upload") for related issues.
Comment 5 Brion Vibber 2009-06-22 22:16:09 UTC
Agreed, whitelisting metadata to pass through to resized output would be handy here...

PHP's exif support doesn't appear to include writing metadata, though, so we might need to add new code to copy the info or find a way to do it via ImageMagick. The same would be needed to implement bug 3361, which would add new metadata to files based on info from the wiki.
Comment 6 folengo 2009-08-17 09:46:36 UTC
A similar discussion (same request from flickr users to the flickr developers) took place at http://www.flickr.com/photos/x180/3196541234/

Comment 7 Derk-Jan Hartman 2009-08-17 11:23:59 UTC
See also bug 19791

(In reply to comment #2)
> Again (full URL) : 
> http://commons.wikimedia.org/wiki/COM:VP#Crediting_photographers_in_article_pages
> 

A patch for this is in bug 19791
Comment 8 Guillaume Paumier 2009-12-31 22:57:42 UTC

*** This bug has been marked as a duplicate of bug 19791 ***
Comment 9 folengo 2010-04-09 08:26:35 UTC
This bug has been marked as "RESOLVED FIXED", as a duplicate of bug 19791 .

However, I checked with file

http://upload.wikimedia.org/wikipedia/commons/thumb/f/fd/Glorious_First_of_June%2C_1_June_1794_%28Monument_to_the_Republic%29_2010-03-23_05.jpg/640px-Glorious_First_of_June%2C_1_June_1794_%28Monument_to_the_Republic%29_2010-03-23_05.jpg

which is a jpg file. And I don't see anything in the Exif of this 640px
thumbnail.

Perhaps the patch provided  at bug 19791 is wonderful for png files, but I am
afraid it does not solve the problem for jpg files.

May I know which field of the Exif metadata is supposed to be filled ?
Comment 10 Raimond Spekking 2010-04-09 08:33:36 UTC
(In reply to comment #9)

> Perhaps the patch provided  at bug 19791 is wonderful for png files, but I am
> afraid it does not solve the problem for jpg files.
> 
> May I know which field of the Exif metadata is supposed to be filled ?

The patch from bug 19791 added the URl of the file source (http://commons.wikimedia.org/wiki/File:Foo.jpg) into the comment field, not into EXIF. It's a kind of workaround until we can have EXIF in the thumbs.

Hint: Works only for thumbs generated from 2009-04-09, not for already cached thumbs.

Therefore the current bug should be furthermore keep open.
Comment 11 folengo 2010-04-09 16:12:29 UTC
I uploaded the file, and the corresponding thumbnail at

http://upload.wikimedia.org/wikipedia/commons/thumb/f/fd/Glorious_First_of_June%2C_1_June_1794_%28Monument_to_the_Republic%29_2010-03-23_05.jpg/640px-Glorious_First_of_June%2C_1_June_1794_%28Monument_to_the_Republic%29_2010-03-23_05.jpg

was generated in 2010. However I don't see anything in the "comment" field either.

My picture viewing software is [[IrfanView]], and I checked the "comment" button close to the "IPTC info" button, at the bottom in the "image information" menu. 

I aslo have the Fxlf extension installed on Firefox, and that detects nothing.
Comment 12 Raimond Spekking 2010-04-09 16:21:02 UTC
(In reply to comment #11)
> I uploaded the file, and the corresponding thumbnail at
> 
> http://upload.wikimedia.org/wikipedia/commons/thumb/f/fd/Glorious_First_of_June%2C_1_June_1794_%28Monument_to_the_Republic%29_2010-03-23_05.jpg/640px-Glorious_First_of_June%2C_1_June_1794_%28Monument_to_the_Republic%29_2010-03-23_05.jpg
> 
> was generated in 2010. However I don't see anything in the "comment" field
> either.
> 
> My picture viewing software is [[IrfanView]], and I checked the "comment"
> button close to the "IPTC info" button, at the bottom in the "image
> information" menu. 
> 
> I aslo have the Fxlf extension installed on Firefox, and that detects nothing.

As I wrote: Works only for thumbs generated since 2009-04-09, not for already cached thumbs. Yours was generated 2010-03-25.
Comment 13 folengo 2010-04-10 00:34:47 UTC
Sorry, perhaps my English reading capacity is not perfect, but according to Merriam Webster dictionary, http://www.merriam-webster.com/dictionary/since , "since" means "from a definite past time until now" and I would have thought that when you say "since 2009-04-09" you mean between April 2009 and now. Now we are in April 2010 aren't we ? My file was uploaded in March 2010. Isn't March 2010 somewhere between April 2009 and April 2010 ?

Is my file too old or too new ? 

Conversely, could you provide another example of existing file whose thumbnails are filled with the new comment ?
Comment 14 Bawolff (Brian Wolff) 2010-04-10 19:12:28 UTC
I just tried it on http://upload.wikimedia.org/wikipedia/commons/thumb/4/46/Torbj%C3%B8rn_R%C3%B8e_Isaksen_-_2010-04-10_at_11-08-52_(1).jpg/120px-Torbj%C3%B8rn_R%C3%B8e_Isaksen_-_2010-04-10_at_11-08-52_(1).jpg

Here's the comment field according to exiftool:

Comment: File source: http://commons.wikimedia.org/wiki/File:TorbjJPEG3%B8rn_RJPEG3%B8e_Isaksen_-_2010-04-10_at_11-08-52_(1).jpg

Thus it does put the url in it, on the downside it seems to totally screw over non-ascii characters, replacing %C with JPEG.

Is %C replaced with the file type somewhere along the line?
Comment 16 Raimond Spekking 2010-04-10 19:20:03 UTC
(In reply to comment #13)
> Sorry, perhaps my English reading capacity is not perfect, but according to
> Merriam Webster dictionary, http://www.merriam-webster.com/dictionary/since ,
> "since" means "from a definite past time until now" and I would have thought
> that when you say "since 2009-04-09" you mean between April 2009 and now. Now
> we are in April 2010 aren't we ? My file was uploaded in March 2010. Isn't
> March 2010 somewhere between April 2009 and April 2010 ?


My apologies. I wrote the wrong year :-( Please read it as "2010-04-09". On this date (yesterday) the software on the WMF servers were updated. And all thumbs generated from this date on have the comment.
Comment 17 Derk-Jan Hartman 2010-04-11 00:09:52 UTC
for the imagemagick issue, i have opened a new bug 23148
Comment 18 folengo 2010-04-11 14:19:44 UTC
In reply to comment #16 : you're welcome.

I think this new comment metadata is a great step forward. Too bad, though, that this metadata is less popular than Exif or IPTC (the Exif viewer add-on in Firefox can't display it https://addons.mozilla.org/fr/firefox/addon/3905 ).
Comment 19 Bawolff (Brian Wolff) 2012-11-22 23:55:13 UTC
>
>C) The message at the bottom of description pages (is it a [[Mediawiki:]]
>message ? I have not been able to find which) should be reworded with a less
>ambiguous wording. When a user reads "This file contains additional information
>(...) ", he remains clueless on whether that means the original file, or the
>resized 800px preview present on the description page, or both.

This part of the original bug has been ignored. Do you have an alternative sentence that would be more clear - its a very easy fix to change the message if one has an alternative to change it to.

-----
The comment feature seems recently to have stopped working. I filed bug 42368 for that.
Comment 20 Jean-Fred 2013-02-07 12:49:03 UTC
Bumping!

I agree with comment 18. The source in the comment is good (though I had never noticed it… neither Firefox nor Nautilus mentionned it − I had to open the file in GIMP), but it does not replace keeping some EXIF (the ones listed in comment 1 seem reasonable).
Comment 21 Lokal_Profil 2013-02-07 14:11:18 UTC
I just had a similar request from a museum who are planning a large image donation where they had spent a significant effort making sure that the metadata was also included in the exif tags.

For generating their own thumbnails they use a script which first generates the jpg thumbnail with imagemagick and then calls ExifTool to copy the exif data across with 
exiftool –all= -tagsfromfile SOURCE.TIF –all:all –overwrite_original TARGET.JPG

Would something similar be doable here? We might not want to copy all of the fields (or for all sizes of thumbnails) but the ones mentioned above would be good.
Comment 22 Bawolff (Brian Wolff) 2013-02-07 15:45:17 UTC
(In reply to comment #21)
> I just had a similar request from a museum who are planning a large image
> donation where they had spent a significant effort making sure that the
> metadata was also included in the exif tags.
> 
> For generating their own thumbnails they use a script which first generates
> the
> jpg thumbnail with imagemagick and then calls ExifTool to copy the exif data
> across with 
> exiftool –all= -tagsfromfile SOURCE.TIF –all:all –overwrite_original
> TARGET.JPG
> 
> Would something similar be doable here? We might not want to copy all of the
> fields (or for all sizes of thumbnails) but the ones mentioned above would be
> good.

post processing with either exiftool or exiv2 is a potential solution to this bug (And probably the best way forward. I don't think we want to write our own metadata writer). Would require making sure such a program is available on the server (which might already be the case).

ImageMagick also has -caption and -title options we can use without resorting to using another binary. It should probably be investigated what sort of metadata these options produce in the image (The docs are very vague, but testing that should be easy). -label Should also be investigated.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links