Last modified: 2009-08-06 19:58:15 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T21882, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 19882 - Case insensitivity in the search box does not function for Titles with Mixed Case
Case insensitivity in the search box does not function for Titles with Mixed ...
Status: RESOLVED FIXED
Product: MediaWiki
Classification: Unclassified
Search (Other open bugs)
unspecified
All All
: High normal (vote)
: ---
Assigned To: Brion Vibber
http://en.wikipedia.org/wiki/WP:MIXED...
:
Depends on:
Blocks: 20097
  Show dependency treegraph
 
Reported: 2009-07-22 19:56 UTC by xenocidic
Modified: 2009-08-06 19:58 UTC (History)
4 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description xenocidic 2009-07-22 19:56:57 UTC
Case insensitivity doesn't work for Titles with Mixed Case (see [[WP:MIXEDCAPS]]).

We had a bot creating thousands upon thousands of redirects from the small-case versions of mixed-case articles. http://en.wikipedia.org/wiki/User:BOTijo

This is obviously sub-optimal to simply fixing the case-insensitivity in the search box to find the mixed case article automagically.

We've revoked the bot's authorization in hopes this bug can be fixed.
Comment 1 Brion Vibber 2009-07-24 16:28:09 UTC
"Go" searches to titles with mixed capitalization should work just fine on Wikipedia for the last year or two thanks to the TitleKey extension; there's no need to create redirects for that.

Can you provide some sample searches that fail? We might have had a regression in functionality or a break in indexing.
Comment 2 xenocidic 2009-07-27 18:03:11 UTC
See any Page with Mixed Caps, e.g.

"Congolese Union of Republicans" (this is a new page I just found, might not be around when you get to this)

Type in search box: "congolese union of republicans" = Does not get you to the article.
Comment 3 Brad Jorsch 2009-08-06 12:17:21 UTC
(In reply to comment #1)
> "Go" searches to titles with mixed capitalization should work just fine on
> Wikipedia for the last year or two thanks to the TitleKey extension; there's no
> need to create redirects for that.

[[en:Special:Version]] doesn't list TitleKey at this time.
Comment 4 Robert Stojnic 2009-08-06 12:33:06 UTC
We are using lucene as a prefix backend on en.wp at the moment. 
Comment 5 Le Chat 2009-08-06 15:01:55 UTC
Has TitleKey recently been removed? I'm sure this used to work fine. Will it be restored?
Comment 6 Brad Jorsch 2009-08-06 16:19:18 UTC
(In reply to comment #4)
> We are using lucene as a prefix backend on en.wp at the moment. 

Lucene doesn't seem to use the SearchGetNearMatch hook, which AFAICT is what is needed to affect the "Go" button.
Comment 7 Robert Stojnic 2009-08-06 16:25:56 UTC
It would be good if they both could co-exist. lucene.php should be loaded after titlekey in CommonsSettings.php and $wmgUseTitleKey = false removed from lucene.php. Also, titlekey index might need rebuilding for the past couple of months. 
Comment 8 Brion Vibber 2009-08-06 16:26:48 UTC
I think I found our problem:

} elseif ( in_array( $wgDBname, array( 'enwiki' ) ) ) {
        # Big RAM pool 1, via LVS
        $wgLuceneHost = '10.2.1.11';
        $wgLuceneSearchVersion = 2.1;
        $wgEnableLucenePrefixSearch = true;
        $wgLucenePrefixHost = '10.0.3.8'; #search8
        $wmgUseTitleKey = false;
}

For some mysterious reason the Lucene configuration disables TitleKey on enwiki. Ouch! Removing this...
Comment 9 Brion Vibber 2009-08-06 16:30:10 UTC
Ok, TitleKey is reenabled and I'm rebuilding the index.
Comment 10 Brion Vibber 2009-08-06 17:40:56 UTC
Ok, me & Robert worked out the compat issue between TitleKey and MWSearch; should now be fixed with the adjustment from r54533.

TitleKey is still on to handle the "go" search, but no longer interferes with MWSearch's Lucene prefix search when it's enabled as long as we load them in the right order.
Comment 11 Brion Vibber 2009-08-06 18:56:26 UTC
Ok, TitleKey index rebuild is now done and we have the best of both worlds. :) Case-insensitive match on 'go' searches works and we have the more advanced drop-down ajax search with the Lucene backend.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links