Last modified: 2013-09-29 16:19:28 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T3378, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 1378 - add DOI URI support
add DOI URI support
Status: RESOLVED WONTFIX
Product: MediaWiki
Classification: Unclassified
Parser (Other open bugs)
unspecified
All All
: Low enhancement with 2 votes (vote)
: Future release
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks: 29473
  Show dependency treegraph
 
Reported: 2005-01-21 17:22 UTC by mark_sweep
Modified: 2013-09-29 16:19 UTC (History)
7 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description mark_sweep 2005-01-21 17:22:03 UTC
Consider adding support for Digital Object Identifiers in wikitext.  This would
be done by supporting the "doi:" URI scheme in addition to "http:", which is
already treated specially.

For example, a string "doi:10.1000/186" in wikitext would be transformed into
"<a href="http://dx.doi.org/10.1000/186">doi:10.1000/186</a>" in HTML.

A few WP articles use DOIs in references -- e.g.
[http://en.wikipedia.org/wiki/Kurtosis] -- by using standard external links.

Here is an overview of the DOI URI representation:
[http://www.doi.org/factsheets/DOIIdentifierSpecs.html]

The specification of DOI names is here:
[http://www.doi.org/handbook_2000/appendix_1.html#A1-4]. As far as specs go,
this one is a bit vague.  I'm not sure I understand how to recognize the end of
a DOI.  It says a DOI consists of "legal *graphic* characters of Unicode 2.0 or
greater" (emphasis added).  Not sure if the graphic characters include
whitespace, or even if "graphic character" is a meaningful term defined in
Unicode.  I've never seen DOIs with embedded whitespace.  Also, a DOI might have
to be URL encoded when it's converted to a http: URI.  I don't have any test
cases for that, sorry.
Comment 1 T. Gries 2005-01-22 17:18:36 UTC
(In reply to comment #0)
> Consider adding support for Digital Object Identifiers in wikitext.  This would
> be done by supporting the "doi:" URI scheme in addition to "http:", which is
> already treated specially.
> 
> For example, a string "doi:10.1000/186" in wikitext would be transformed into
> "<a href="http://dx.doi.org/10.1000/186">doi:10.1000/186</a>" in HTML.
> 
> A few WP articles use DOIs in references -- e.g.
> [http://en.wikipedia.org/wiki/Kurtosis] -- by using standard external links.
> 
> Here is an overview of the DOI URI representation:
> [http://www.doi.org/factsheets/DOIIdentifierSpecs.html]
> 
> The specification of DOI names is here:
> [http://www.doi.org/handbook_2000/appendix_1.html#A1-4]. As far as specs go,
> this one is a bit vague.  I'm not sure I understand how to recognize the end of
> a DOI.  It says a DOI consists of "legal *graphic* characters of Unicode 2.0 or
> greater" (emphasis added).  Not sure if the graphic characters include
> whitespace, or even if "graphic character" is a meaningful term defined in
> Unicode.  I've never seen DOIs with embedded whitespace.  Also, a DOI might have
> to be URL encoded when it's converted to a http: URI.  I don't have any test
> cases for that, sorry.

I like your idea, It is something from the ideas of Interwiki, TinyURl and
Magic-ISBN number and could be implemented. I took over to implement this,
because I am interested in this.
Comment 2 Antoine "hashar" Musso (WMF) 2005-01-22 17:22:04 UTC
Just add it to the ./maintenance/interwiki.sql . This way
someone can just [[doi:10.1000/186]].
Comment 3 mark_sweep 2005-01-22 17:31:55 UTC
No, not "[[doi:10.1000/186]]" -- it's not an internal link.  Simply
"doi:10.1000/186" and also perhaps "[doi:10.1000/186 title]", just like "http:"
is treated specially.  In fact, you can match on "doi:10." to detect the
beginning of a DOI.
Comment 4 Zigger 2005-01-22 17:36:02 UTC
en.wikipedia already has  http://en.wikipedia.org/wiki/Template:Doi , but there
are few uses of it.

See 
http://en.wikipedia.org/wiki/Wikipedia:Template_messages/Links#External_links 
for other linkages using templates.
Comment 5 T. Gries 2005-01-22 17:41:04 UTC
(In reply to comment #4)
> en.wikipedia already has  http://en.wikipedia.org/wiki/Template:Doi , but there
> are few uses of it.
> 
> See 
> http://en.wikipedia.org/wiki/Wikipedia:Template_messages/Links#External_links 
> for other linkages using templates.

I want to 
(In reply to comment #4)
> en.wikipedia already has  http://en.wikipedia.org/wiki/Template:Doi , but there
> are few uses of it.
> 
> See 
> http://en.wikipedia.org/wiki/Wikipedia:Template_messages/Links#External_links 
> for other linkages using templates.

I was unclear.

I wanted to propose to have "magic "DOI <number> such we already have for ISBN
<number> , so that a magic link is created if someone write DOI 10.1000/186 in
the text. Not everyone knows yet of these identifiers, mainly scientists,
engineers and people working in research fields.
Comment 6 Rowan Collins [IMSoP] 2005-08-28 18:38:07 UTC
If there is an existing URI scheme for these, no horrible magic like ISBN and
RFC (which break all sorts) is needed, nor even any hacking of the code - the
URI prefix can just be added to $wgUrlProtocols, as bug 431 has been fixed.
However, see bug 3133, comment 5 for a set of criteria I think need considering
before adding a new link-prefix to either the default or the Wikimedia
configuration.

[Re-assigning to Product="Wikimedia web sites" as this is now a configuration,
rather than coding, issue]
Comment 7 Dario Taraborelli 2012-06-02 11:42:18 UTC
A typical DOI has this form:

doi:10.1371/journal.pone.0006872

Using a colon as a separator is the canonical way of representing DOIs. A space (or a colon followed by a space) is more rare and not supported AFAIK by any citation style, see for instance Chicago: http://bit.ly/K4INu6 or APA: http://www.loyola.edu/library/ref/apastyle.htm#journal

Also , DOI prefixes always start with a "10." string and to minimize performance issues they could be parsed only within <ref> tags.
Comment 8 Nemo 2012-07-26 13:13:22 UTC
(In reply to comment #2)
> Just add it to the ./maintenance/interwiki.sql . This way
> someone can just [[doi:10.1000/186]].

This has been done in the meanwhile, FYI (on Wikimedia projects).
Comment 9 Brion Vibber 2013-09-29 16:19:28 UTC
Going through old bugs...

I'm going to go ahead and close this out -- interwikis and templates do a better job at this than URL links, as inserting direct <a href>s pointing at 'doi:something' would be useless for 99% of users.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links