Last modified: 2014-11-17 10:36:45 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T27624, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 25624 - Making license and author information api accessible
Making license and author information api accessible
Product: MediaWiki
Classification: Unclassified
File management (Other open bugs)
All All
: Normal enhancement with 2 votes (vote)
: ---
Assigned To: Nobody - You can work on this!
: 8298 24101 42407 (view as bug list)
Depends on: 26741 31257
  Show dependency treegraph
Reported: 2010-10-23 15:11 UTC by Derk-Jan Hartman
Modified: 2014-11-17 10:36 UTC (History)
11 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Description Derk-Jan Hartman 2010-10-23 15:11:37 UTC
There are several projects out there, from stockphoto to mwEmbed, that would really benefit from license and author information of images being accessible in some way in the database. (But perhaps also, mediatype [document, image, video, audio] date, artists, location etc)

1: licensekey -> licenseMessage
2: scraping the information from the existing messages.

My idea was to create a seperate table, then introduce some new type of keyword or function, add this to the templates so that the information gets wrapped by this info and then whenever a description page is parsed, update the table with the new values. I think we'd want to continue to use our current license templates, so for that we need to create some key value system, that is probably a bit smarter than the current license template names, or perhaps we can create a table of all license templates that describes the basic nature of a license (pd, attribution required, share alike, cc-0). That would make searching easier, with the multitude of licenses that we have. Preferably, there should be some consistency in keys for licenses among the different wiki projects, that would simplify searching multiple wiki file repo's...
Comment 1 Bryan Tong Minh 2010-10-27 20:13:57 UTC
I have been thinking about this, and there are multiple options to indicate the author and license:

1) Introduce a new parser tag that will indicate the author, license etc. E.g. {{#author:Bryan}}
2) Create magic to allow extracting certain template parameters and storing them into a database. I'm thinking of course of the parameters of {{Information}}. I believe we discussed this last year in Paris.
3) Add a special page/dedicated interface for storing and changing the author and license information outside of the page text and get them with a specific parser function into the text.
4) Install SMW

An argument for 1) and 2) is that they easily fit into our current system. We just need to change the {{Information}} template in 1) to get the data propagated to the database. As for 2) there would be no action required at all (except rerendering all pages).

A point that was previously raised was that license and author data are actually not associated with page text, but with the image itself. Therefore 3) would make sense, as we are linking the meta data to the image and not to the page text. The downside is that we need to migrate from the old to the new system.

I don't know how and if SMW can solve this problem, but I have heart that it can. I don't think though that we will get SMW in the near to middle term future, so 4) can be removed.

An important requirement to think about when implementing this feature is that images can have multiple authors and licenses and that they need to be saved all.
Another point to think about is whether we also want to store source and date. And other properties. We just need to watch out that we are not rewriting SMW for this purpose though.
Comment 2 Krinkle 2010-10-29 12:37:27 UTC
Number 3 sounds good to me aswell. Just like the pagename/filename isn't inside the wikitext. Things like author and license information (and categories too actually, but that's a different story) make sense to be stored seperately.

Advantage of keeping it outside of wikitext is also that history of the page-text will stay cleaner and changing or renaming a license is a lot easier since they'd be stored as id's instead of template links.
I'd guess things like licenses would then be stored in the License:-namespace or in [MediaWiki:License-<id>-descr] and [MediaWiki:License-<id>-content], which may or may not contain a template.

This is also related to the UsabilityInitiative I think. One of their ideas (not sure if it's choosen as "the plan") was to seperate this making it (the license) into a dropdown when editing (just like when uploading it) - and that licences not in the dropdown, can't be used. (ie. need to be added to it - which makes sense)
Comment 3 Krinkle 2010-12-07 15:06:42 UTC
I've been brainstorming a little last weekend for the details of a few things.

Wrote it up here:

Comment 4 Bryan Tong Minh 2011-01-14 20:45:51 UTC
We've been discussing this here in Amsterdam with Krinkle, Roan and me, have made some changes to the page mentioned in comment #3 and are implementing this in the license-work branch.

I'll assign this to myself, but feel free to work on it.
Comment 5 Brion Vibber 2011-11-29 21:22:25 UTC
*** Bug 24101 has been marked as a duplicate of this bug. ***
Comment 6 Krinkle 2011-12-30 19:32:24 UTC
*** Bug 8298 has been marked as a duplicate of this bug. ***
Comment 7 Brad Jorsch 2012-11-25 23:23:47 UTC
*** Bug 42407 has been marked as a duplicate of this bug. ***
Comment 8 Jean-Fred 2014-01-21 13:36:36 UTC
With the deployment of the CommonsMetadata extension <>, license and authors are available through the API (though they are not “accessible in some way in the database”). This could be closed as fixed then?
Comment 9 Andre Klapper 2014-02-26 16:46:39 UTC
(In reply to Jean-Fred from comment #8)
> With the deployment of the CommonsMetadata extension
> <>, license and
> authors are available through the API (though they are not “accessible in
> some way in the database”). This could be closed as fixed then?

Good catch.
Assuming this covers the request and setting WORKSFORME. Please comment/reopen and elaborate what is missing if CommonsMetadata does not cover this request.

Note You need to log in before you can comment on or make changes to this bug.