Last modified: 2012-12-19 08:32:24 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T23767, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 21767 - Non-printable characters (e.g. Unicode control characters) in wikitext page source
Non-printable characters (e.g. Unicode control characters) in wikitext page s...
Status: NEW
Product: MediaWiki
Classification: Unclassified
Page editing (Other open bugs)
unspecified
All All
: Low normal with 1 vote (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2009-12-06 00:59 UTC by Wolfram Schmied
Modified: 2012-12-19 08:32 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Wolfram Schmied 2009-12-06 00:59:09 UTC
See

http://en.wikisource.org/w/index.php?title=Index_talk:1965_FBI_monograph_on_Nation_of_Islam.djvu&oldid=1678252#Non-printables_in_OCR_output

I don't know how much of a security concern that is, but I think DELs and somesuch do not belong in source code, and should be stripped automatically.
Comment 1 Dan Wolff 2012-12-19 08:32:24 UTC
Yeah, it seems like invisible characters are allowed in wikitext for no good reason. E.g. here[1] I'm cleaning up a left-over LTR character, since my script gave weird results because of it.

[1] https://sv.wiktionary.org/w/index.php?title=papperstidning&diff=1496883&oldid=1035418

See also Bug 3696.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links