Last modified: 2010-05-15 14:36:30 UTC
The HTML spec mandates that id="" be encoded, we follow this in TOC headers but
not in id's specified e.g. with <span>, as a result manually specified links
within a page break if they contain characters that should be encoded.
== multibæt ==
* <span id="bæt">byte</span>
The inline TOC links will work but not the manually specified backlink
TOC links are encoded as a special case in the parser (see $canonized_headline),
this needs to be put into some general encoding routine in Sanitizer or something.
Well, if you specify something manually, it's... manual...
FIXED in CVS HEAD
This bug has a parsertest called 'Sanitizer: Escaping of spaces, multibyte
characters, colons & other stuff in id=""'