Last modified: 2012-10-29 18:09:48 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T35127, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 33127 - Double-clicking a word selects several non-Latin words


Summary:	Double-clicking a word selects several non-Latin words

Status:	RESOLVED FIXED

Product:	VisualEditor
Classification:	Unclassified
Component:	General (Other open bugs)
Version:	unspecified
Hardware:	All All

Importance:	Normal normal
Target Milestone:	VE-deploy-2012-10-15
Assigned To:	Nobody - You can work on this!

URL:
Whiteboard:
Keywords:	i18n

Depends on:
Blocks:	ve-multi-lingual 33128 33146
	Show dependency tree / graph

Reported:	2011-12-14 19:53 UTC by Amir E. Aharoni
Modified:	2012-10-29 18:09 UTC (History)
CC List:	8 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Amir E. Aharoni 2011-12-14 19:53:58 UTC

In most web browsers and text editors double-clicking a word usually selects one word. In the Visual Editor this works for Latin words, but not for words in other scripts - i tried:

* Russian: Мой дядя самых честных правил
* Devanagari: कोणीही घडवू शकेल असा हा मुक्त ज्ञानकोश आहे
* Georgian: რომლის განმარტებითაც იგი ბავშვების

In all cases double-clicking a word selected the whole sentence.

I didn't check the code, but my guess is that the word boundary algorithm only understand the Latin script.

Comment 1 Amir E. Aharoni 2011-12-14 19:58:31 UTC

Bug 33128 may be related.

Comment 2 Trevor Parscal 2011-12-14 21:15:12 UTC

The word-boundary detection is done using a regular expression, currently it's:

/([ \-\t\r\n\f])/g

We will need this to be more sophisticated for different scripts, and also need some help from people more familiar with how word breaks can be programmatically detected in these scripts.

Comment 3 Amir E. Aharoni 2011-12-14 21:18:08 UTC

I am curious - is it really impossible to use the host browser's word-boundary algorithm? Or better yet, to use the host browser's behavior for double-clicking a word?

Adding Santhosh, who may have advanced knowledge about word boundaries.

Comment 4 Liangent 2011-12-15 04:48:47 UTC

(In reply to comment #2)
> We will need this to be more sophisticated for different scripts, and also need
> some help from people more familiar with how word breaks can be
> programmatically detected in these scripts.

I wish that could be implemented on Chinese as well (just wish). Few softwares (but there're still some even open-source ones) can do this since there's no clear boundary.

Comment 5 James Forrester 2012-06-22 18:57:25 UTC

Fixed in new version of VE through using native implementation (which is much easier than re-implementing, as discussed!).

Comment 6 James Forrester 2012-10-15 22:57:12 UTC

Mass-moving old VisualEditor tickets to the VE product. Search for this message to mass-delete bugmail.

Comment 7 James Forrester 2012-10-29 18:09:48 UTC

Noting bugs closed in the 2012-10-15 release.

Note You need to log in before you can comment on or make changes to this bug.

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links