Last modified: 2011-12-22 01:05:08 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T18880, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 16880 - Interwiki links should not be handled like local/static pages
Interwiki links should not be handled like local/static pages
Status: NEW
Product: MediaWiki extensions
Classification: Unclassified
DumpHTML (Other open bugs)
unspecified
All All
: Normal enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2009-01-04 12:49 UTC by Kelson [Emmanuel Engelhart]
Modified: 2011-12-22 01:05 UTC (History)
1 user (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Kelson [Emmanuel Engelhart] 2009-01-04 12:49:08 UTC
Interwiki links are handled like local links and not like external links. Consequently, the HTML static pages belong dead local URLs.

What I do:
I run dumpHTML.php on the "Wikipedia" article (http://en.wikipedia.org/wiki/Wikipedia).

What I get:
The interwikis issue from the "Template:Wikipedia" (http://en.wikipedia.org/wiki/Template:Wikipedias), included in the "Wikipedia" article, are represented rewritten local URLs :

-------
(...)
<tr style="">
<td class="navbox-group" style="">750,000+</td>
<td style="border-left: 2px solid rgb(253, 253, 253); padding: 0px; text-align: left; width: 100%;" class="navbox-list navbox-even">
<div style="padding: 0em 0.25em;"><span style="white-space: nowrap;"><a href="../../../../articles/g/e/r/German_Wikipedia_58de.html" title="German Wikipedia">German</a> <a href="../../../../../de/index.html" class="extiw" title="de:">de:</a></span></div>
</td>
</tr>
(...)
-------

What I want:

-------
(...)
</tr>
<tr style="">
<td class="navbox-group" style="">750,000+</td>
<td style="border-left: 2px solid rgb(253, 253, 253); padding: 0px; text-align: left; width: 100%;" class="navbox-list navbox-even">
<div style="padding: 0em 0.25em;"><span style="white-space: nowrap;"><a href="../../../../articles/g/e/r/German_Wikipedia_58de.html" title="German Wikipedia">German</a> <a href="http://de.wikipedia.org" class="extiw" title="de:">de:</a></span></div>
</td>
</tr>
(...)
-------

The difference:
* In the first case : <a href="../../../../../de/index.html" class="extiw" title="de:">de:</a>
* In the second case : <a href="http://de.wikipedia.org" class="extiw" title="de:">de:</a>

Remark:
The issue comes IMO from the "GetFullURL" hook in dumpHTML.inc, removing it (and the useless onGetFullURL()) seems to resolve the issue.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links