Last modified: 2010-05-15 14:36:30 UTC

Wikimedia Bugzilla is closed!

Wikimedia has migrated from Bugzilla to Phabricator. Bug reports should be created and updated in Wikimedia Phabricator instead. Please create an account in Phabricator and add your Bugzilla email address to it.
Wikimedia Bugzilla is read-only. If you try to edit or create any bug report in Bugzilla you will be shown an intentional error message.
In order to access the Phabricator task corresponding to a Bugzilla report, just remove "static-" from its URL.
You could still run searches in Bugzilla or access your list of votes but bug reports will obviously not be up-to-date in Bugzilla.
Bug 4461 - <span id="characters that should be encoded"> and [[#chara...]] breaks
<span id="characters that should be encoded"> and [[#chara...]] breaks
Product: MediaWiki
Classification: Unclassified
Parser (Other open bugs)
All Linux
: Normal blocker (vote)
: ---
Assigned To: Nobody - You can work on this!
Depends on:
Blocks: 4515
  Show dependency treegraph
Reported: 2006-01-03 06:59 UTC by Ævar Arnfjörð Bjarmason
Modified: 2010-05-15 14:36 UTC (History)
1 user (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Description Ævar Arnfjörð Bjarmason 2006-01-03 06:59:58 UTC
The HTML spec mandates that id="" be encoded, we follow this in TOC headers but
not in id's specified e.g. with <span>, as a result manually specified links
within a page break if they contain characters that should be encoded.

== multibæt ==
* <span id="bæt">byte</span>
* [[#bæt|backlink]]
The inline TOC links will work but not the manually specified backlink

TOC links are encoded as a special case in the parser (see $canonized_headline),
this needs to be put into some general encoding routine in Sanitizer or something.
Comment 1 Brion Vibber 2006-01-03 21:00:03 UTC
Well, if you specify something manually, it's... manual...
Comment 2 Ævar Arnfjörð Bjarmason 2006-01-07 08:51:11 UTC

This bug has a parsertest called 'Sanitizer: Escaping of spaces, multibyte
characters, colons & other stuff in id=""'

Note You need to log in before you can comment on or make changes to this bug.