Last modified: 2004-09-03 19:41:12 UTC
BUG MIGRATED FROM SOURCEFORGE http://sourceforge.net/tracker/index.php?func=detail&aid=957818&group_id=34373&atid=411192 Originally submitted by Roger Persson (rogper) 2004-05-21 07:00 Of a coincident I noticed that greater-than (>) char in URLs is rendered wrongly IF it occures as last character in URL. Example: Check this extra semicolon http://sample.link/<hello> in the end Check this http://sample.link/<hello> strange thing Result: http://sample.link/<hello>; http://sample.link/<hello> ------------------------- Additional comments ------------------------ Date: 2004-05-28 09:35 Sender: SF user vibber The HTML output is: http:// sample.link/<hello>; It looks like the HTML stripping is being done before external links, so the have become "<" and ">". Semicolons are actually legal in links; the _final_ punctuation (not followed by linkable chars) is stripped, but the bits in the middle are considered fair game for belonging to a link so it extends up to the ">" but not including the final ";" (or the other ";" that follows, which is extraneous). Correct behavior would be to have the link cover "http://sample.link/", then cut off at the <. This will require parsing for external links before stripping HTML; perhaps another placeholder step would be useful here (might also help the longstanding URL-within-URL bug). Bug is present in both 1.2 and current 1.3. ------------------------------------------------- Date: 2004-09-03 05:07 Sender: SF user timwi This bug has been migrated to MediaZilla: http://bugzilla.wikipedia.org/show_bug.cgi?id=289 Please leave additional comments or attachments there.
*** This bug has been marked as a duplicate of 289 ***