Last modified: 2012-10-29 18:09:48 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T35127, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 33127 - Double-clicking a word selects several non-Latin words
Double-clicking a word selects several non-Latin words
Status: RESOLVED FIXED
Product: VisualEditor
Classification: Unclassified
General (Other open bugs)
unspecified
All All
: Normal normal
: VE-deploy-2012-10-15
Assigned To: Nobody - You can work on this!
: i18n
Depends on:
Blocks: ve-multi-lingual 33128 33146
  Show dependency treegraph
 
Reported: 2011-12-14 19:53 UTC by Amir E. Aharoni
Modified: 2012-10-29 18:09 UTC (History)
8 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Amir E. Aharoni 2011-12-14 19:53:58 UTC
In most web browsers and text editors double-clicking a word usually selects one word. In the Visual Editor this works for Latin words, but not for words in other scripts - i tried:

* Russian: Мой дядя самых честных правил
* Devanagari: कोणीही घडवू शकेल असा हा मुक्त ज्ञानकोश आहे
* Georgian: რომლის განმარტებითაც იგი ბავშვების

In all cases double-clicking a word selected the whole sentence.

I didn't check the code, but my guess is that the word boundary algorithm only understand the Latin script.
Comment 1 Amir E. Aharoni 2011-12-14 19:58:31 UTC
Bug 33128 may be related.
Comment 2 Trevor Parscal 2011-12-14 21:15:12 UTC
The word-boundary detection is done using a regular expression, currently it's:

/([ \-\t\r\n\f])/g

We will need this to be more sophisticated for different scripts, and also need some help from people more familiar with how word breaks can be programmatically detected in these scripts.
Comment 3 Amir E. Aharoni 2011-12-14 21:18:08 UTC
I am curious - is it really impossible to use the host browser's word-boundary algorithm? Or better yet, to use the host browser's behavior for double-clicking a word?

Adding Santhosh, who may have advanced knowledge about word boundaries.
Comment 4 Liangent 2011-12-15 04:48:47 UTC
(In reply to comment #2)
> We will need this to be more sophisticated for different scripts, and also need
> some help from people more familiar with how word breaks can be
> programmatically detected in these scripts.

I wish that could be implemented on Chinese as well (just wish). Few softwares (but there're still some even open-source ones) can do this since there's no clear boundary.
Comment 5 James Forrester 2012-06-22 18:57:25 UTC
Fixed in new version of VE through using native implementation (which is much easier than re-implementing, as discussed!).
Comment 6 James Forrester 2012-10-15 22:57:12 UTC
Mass-moving old VisualEditor tickets to the VE product. Search for this message to mass-delete bugmail.
Comment 7 James Forrester 2012-10-29 18:09:48 UTC
Noting bugs closed in the 2012-10-15 release.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links