Last modified: 2013-10-29 05:09:23 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T27623, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 25623 - automatic unicode conversion for Malayalam makes it difficult to link to external sites using old unicode sequences; want a tag to supress conversion
automatic unicode conversion for Malayalam makes it difficult to link to exte...
Status: RESOLVED WONTFIX
Product: MediaWiki
Classification: Unclassified
Internationalization (Other open bugs)
unspecified
All All
: Normal enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
: i18n
Depends on:
Blocks: 56295
  Show dependency treegraph
 
Reported: 2010-10-23 14:39 UTC by praveenp
Modified: 2013-10-29 05:09 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description praveenp 2010-10-23 14:39:43 UTC
Unicode equivalence fix for Language Malayalam (https://bugzilla.wikimedia.org/show_bug.cgi?id=22371) helps a lot in internal linking and searching. But it cause some problem in external linking. For example Kerala Governments encyclopedia project "Sarvavijnjana kosham" uses mixed versions of unicode and so sometimes linking there get not work properly. If possible please include a feature similar to <nowiki> to suppress automatic unicode version conversion.
Comment 1 Bawolff (Brian Wolff) 2010-10-23 18:22:17 UTC
(In reply to comment #0)
> Unicode equivalence fix for Language Malayalam
> (https://bugzilla.wikimedia.org/show_bug.cgi?id=22371) helps a lot in internal
> linking and searching. But it cause some problem in external linking. For
> example Kerala Governments encyclopedia project "Sarvavijnjana kosham" uses
> mixed versions of unicode and so sometimes linking there get not work properly.
> If possible please include a feature similar to <nowiki> to suppress automatic
> unicode version conversion.

This is not a fix, but as a work around you can try % encoding your urls. For example a url that contained the old way of writing (hopefully i get these characters right) MALAYALAM LETTER CHILLU LL  (aka U+D33 U+D4D U+200D): http://example.com/ള്‍ Can be written in the wiki as http://example.com/%E0%B4%B3%E0%B5%8D%E2%80%8D and that should not be automatically converted to the new form (aka 'MALAYALAM LETTER CHILLU LL' (U+0D7E) which has the % encoding form of http://example.com/E0%B5%BE )


Note, I'm changing the component from Database to internationalization as I believe that internationalization is a better component for unicode normalization issues.
Comment 2 Niklas Laxström 2013-05-29 17:49:53 UTC
I will recommend using the url encoding workaround which works universally.

The fix for this bug would be very complicated and even have security implications.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links