Last modified: 2013-03-26 11:25:29 UTC
Do a search for 'commodity' at en.wikipedia.org.
The first two results are:
The article for 'Commode' doesn't even have the word 'commodity' in it, but it's
rated at 100% relevancy (the page hasn't been edited for nearly a month).
'Commodity' should come up first and be rated at 100% relevancy.
Should this be filed against Lucene?
It looks like a stemming issue.
Fixed in Lucene Search 2. Original words are indexes alongside with stemmed words.
The queries are rewritten to have original words with higher boost, and stemmed words with lower boost. This ensures that the unstemmed words are preferred.
[Merging "MediaWiki extensions/Lucene Search" into "Wikimedia/lucene-search2", see bug 46542. You can filter bugmail for: search-component-merge-20130326 ]