Last modified: 2012-12-30 21:11:22 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T42821, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 40821 - PostgreSQL searches do not treat Unicode full width characters as their normal counterparts


Summary:	PostgreSQL searches do not treat Unicode full width characters as their norma...

Status:	NEW

Product:	MediaWiki
Classification:	Unclassified
Component:	Search (Other open bugs)
Version:	1.21.x
Hardware:	All All

Importance:	Normal enhancement (vote)
Target Milestone:	---
Assigned To:	Nobody - You can work on this!

URL:
Whiteboard:
Keywords:

Depends on:
Blocks:	postgres
	Show dependency tree / graph

Reported:	2012-10-06 17:06 UTC by Tim Landscheidt
Modified:	2012-12-30 21:11 UTC (History)
CC List:	2 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Tim Landscheidt 2012-10-06 17:06:25 UTC

The search engines for MySQL and SQLite treat "ＡＺ" (that's #xff21 and #xff3a) as "AZ" (cf. [[Halfwidth and fullwidth forms]]), PostgreSQL does not and thus fails testFullWidth().

One idea would be to TRANSLATE() them in ts2_page_text() and ts2_page_title() and use a similar technique in SearchPostgres::parseQuery().  If so, we need to describe in the release notes how to regenerate the tsvectors after an update or detect if ts2_page_text() or ts2_page_title() has changed and then regenerate them ourselves (I prefer the former).

Of course, another imaginable approach would be try to push this normalization into a text search configuration for to_tsvector(), but I don't know whether this is even possible.

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links