Last modified: 2014-02-07 22:35:52 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T27404, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 25404 - lucene search for simple text misses some results
lucene search for simple text misses some results
Status: RESOLVED WONTFIX
Product: Wikimedia
Classification: Unclassified
lucene-search-2 (Other open bugs)
unspecified
All All
: High major (vote)
: ---
Assigned To: Rob Lanphier
http://el.wiktionary.org
:
Depends on: 28605
Blocks:
  Show dependency treegraph
 
Reported: 2010-10-03 18:39 UTC by Ariel T. Glenn
Modified: 2014-02-07 22:35 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Ariel T. Glenn 2010-10-03 18:39:09 UTC
I searched for the word πράγμα on el.wiktionary.org (no other preferences, main namespace, no punctuation or anything).  It returned a list of very few results, not including for example 
http://el.wiktionary.org/wiki/%CE%B8%CE%B7%CF%83%CE%B1%CF%85%CF%81%CF%8C%CF%82
(θησαυρός) which contains the word in the 6th definition: 
ένα πρόσωπο ή πράγμα που διαθέτει σε μεγάλο βαθμό κάτι πολύτιμο 

Note that this is not part of a template parameter or anything, it's just plain text.  It's not even in bold or italics or anything.

Also the history of the page indicates that the word has been in the article for over three years, so it can't be an "indices haven't been rebuilt since then" sort of issue. 

Any thoughts?  

I ran into a similar issue on officewiki a while back and dismissed it then as a fluke; I now don't think it was.
Comment 1 Robert Stojnic 2010-10-03 18:50:18 UTC
The page is up-to-date in the index, and the word is not in a template, so in theory it should work. 

Must be some kind of article parsing issue.
Comment 2 Mark A. Hershberger 2011-04-14 19:02:24 UTC
Just reproduced.
Comment 3 Mark A. Hershberger 2011-04-14 19:18:01 UTC
See also Bug 25586
Comment 4 Mark A. Hershberger 2011-04-22 04:14:59 UTC
seems to stil be occurring after update was run (bug #28605).  Could you confirm, Ariel?
Comment 5 Mark A. Hershberger 2011-06-16 16:31:09 UTC
It is still happening ....
Comment 6 Andre Klapper 2013-03-26 11:20:07 UTC
[Merging "MediaWiki extensions/Lucene Search" into "Wikimedia/lucene-search2", see bug 46542. You can filter bugmail for: search-component-merge-20130326 ]
Comment 7 Andre Klapper 2013-05-17 12:55:58 UTC
Bug is still valid.

Going to http://el.wiktionary.org/w/index.php?search= , entering πράγμα into the search field, making the results display ALL results, θησαυρός is missing, and http://el.wiktionary.org/wiki/%CE%B8%CE%B7%CF%83%CE%B1%CF%85%CF%81%CF%8C%CF%82 still includes πράγμα in the sixth definition.
Comment 8 Dan Garry 2014-02-07 22:35:52 UTC
This bug really shouldn't exist in CirrusSearch, as it has much better support for languages other than English. As Lucene is reaching the end of its life and we'll soon be migrating fully over to CirrusSearch, I'm changing this to RESOLVED WONTFIX.

If this bug does still exist in CirrusSearch, feel free to re-file it with the new test case under MediaWiki Extensions -> CirrusSearch.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links