Last modified: 2010-05-15 14:36:30 UTC
The HTML spec mandates that id="" be encoded, we follow this in TOC headers but not in id's specified e.g. with <span>, as a result manually specified links within a page break if they contain characters that should be encoded. Testcase: """ __FORCETOC__ == multibæt == * <span id="bæt">byte</span> * [[#bæt|backlink]] """ The inline TOC links will work but not the manually specified backlink TOC links are encoded as a special case in the parser (see $canonized_headline), this needs to be put into some general encoding routine in Sanitizer or something.
Well, if you specify something manually, it's... manual...
FIXED in CVS HEAD This bug has a parsertest called 'Sanitizer: Escaping of spaces, multibyte characters, colons & other stuff in id=""'