Last modified: 2006-05-03 17:23:57 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T7256, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 5256 - Magic words (predefined templates) {{UTF-8DECODE:}} {{UTF-8ENCODE:}} (optional {{UTF-8ENCODE8:}} {{UTF-8ENCODE16:}})


Summary:	Magic words (predefined templates) {{UTF-8DECODE:}} {{UTF-8ENCODE:}} (optiona...

Status:	RESOLVED INVALID

Product:	MediaWiki
Classification:	Unclassified
Component:	Parser (Other open bugs)
Version:	unspecified
Hardware:	All All

Importance:	Normal enhancement (vote)
Target Milestone:	---
Assigned To:	Nobody - You can work on this!

URL:	http://epov.org/wd-gemet/index.php?ti...
Whiteboard:
Keywords:

Depends on:
Blocks:
	Show dependency tree / graph

Reported:	2006-03-15 11:33 UTC by lɛʁi לערי ריינהארט
Modified:	2006-05-03 17:23 UTC (History)
CC List:	1 user (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description lɛʁi לערי ריינהארט 2006-03-15 11:33:41 UTC

Hallo!

There is a lot of code in &html_entity; &#nnnn; &#xnnn; %xx%yy%zz notation
around there.
There are arguments for using either UTF-8 characters or &html_entity; &#nnnn;
&#xnnn; %xx%yy%zz notation depending on available fonts, objectives (traying to
read / debug / trace the code) etc.

*request*
New prediefined templates {{UTF-8DECODE:}} {{UTF-8ENCODE:}} should act like
{{SUBST:}} and change the page source. {{SUBSTUTF-8DECODE:}}
{{SUBSTUTF-8ENCODE:}} would be "horrible" magic words.

*reason*
Such endoding decoding is very painfull. With preview, copy and paste characters
can be lost. UTF-8DECODE requires external tools and copying %xx%yy%zz notation
from the browsers url should belong to the past.

In the example from the url
{{UTF-8DECODE:&#1502;&#1513;&#1514;&#1502;&#1513; &#1494;&#1492;
&#1491;&#1493;&#1489;&#1512; '''[[:Category:User
he|&#1506;&#1489;&#1512;&#1497;&#1514;]]''' &#1499;&#1502;&#1506;&#1496;
&#1499;'''[[:Category:User he-4|&#1513;&#1508;&#1514; &#1488;&#1501;]]'''.}}
would give
משתמש זה דובר '''[[:Category:User he|עברית]]''' כמעט כ'''[[:Category:User
he-4|שפת אם]]'''.
which displays here in MediaZilla according to the bidirectional algorithm.

{{UTF-8ENCODE:}} should do the oposite. It might be that a encoding using HTLM
entities should be supported for better readability (by default or as an
option). Then UTF-8ENCODE should encode non-entities either in decimal notation
or in hexadecimal notation.

Please comment if variants {{UTF-8ENCODE8:}} {{UTF-8ENCODE16:}}
{{UTF-8ENCODEENTITY:}} are required or not.

best regards reinhardt [[user:gangleri]]

Comment 1 Brion Vibber 2006-03-15 20:26:10 UTC

What in the hell?

Comment 2 lɛʁi לערי ריינהארט 2006-03-24 16:32:41 UTC

I assume that for {{UTF-8DECODE:}}

& should never be decoded as &amp;
*but*
&amp; should be escaped as &amp;amp;

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links