Last modified: 2013-03-26 11:24:48 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T15792, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 13792 - Deleted items contaminating search results
Deleted items contaminating search results
Status: RESOLVED FIXED
Product: Wikimedia
Classification: Unclassified
lucene-search-2 (Other open bugs)
unspecified
All All
: Normal normal (vote)
: ---
Assigned To: Nobody - You can work on this!
http://en.wikipedia.org/w/api.php?act...
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2008-04-19 13:04 UTC by MER-C
Modified: 2013-03-26 11:24 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description MER-C 2008-04-19 13:04:23 UTC
On opening the above URL (you might have to reload the page a few times to get the text to appear - this might be another bug) you'll find that the last five results, being:

[[Image:3 accelerators 17-59. PILCURE-MBT,MBTS,ZMBT,F,CBS,NS,MOR,DCBS,TMT,ZDMC,ZDC,ZDBC,SDBC,ZDBzC-.pdf]]

[[Image:Minnesota Educational Computing Consortium Quick Reference Guide for BASIC Language Version 3.1 MECC TIMESHARE SYSTEM Rev. 2 slash 78.pdf]]

[[Image:Partial unilateral ureteropelvic obstruction in neonatal pigs - Effect of acute inhibition of angiotensin II AT1-receptors on GFR and sodium handling.pdf]]

[[Image:Strategy for improving genetic aspects of fertility and hatchability in breeding lines of White Leghorns, and choosing hens for second cycle of production.pdf]]

[[Image:Buddha's teachings in a NUTSHELL(Explains why Buddha did not answer Questions pertaining to eternal God, NON-SOUL theory (anatta) and his basic teachings.pdf]]

... as well as various others, are deleted and have been so for quite a long time. Deleted items should not appear in search results. 

The user interface search - url: http://en.wikipedia.org/w/index.php?title=Special:Search&limit=500&offset=7000&ns6=1&redirs=1&search=.pdf (once again, you might have to reload multiple times) - is somewhat better behaved as in it doesn't show these deleted items  but when you view the page source you find comments such as:

<!-- missing page Image:3 accelerators 17-59. PILCURE-MBT,MBTS,ZMBT,F,CBS,NS,MOR,DCBS,TMT,ZDMC,ZDC,ZDBC,SDBC,ZDBzC-.pdf-->
<!-- missing page Image:Minnesota Educational Computing Consortium Quick Reference Guide for BASIC Language Version 3.1 MECC TIMESHARE SYSTEM Rev. 2 slash 78.pdf-->
<!-- missing page Image:Partial unilateral ureteropelvic obstruction in neonatal pigs - Effect of acute inhibition of angiotensin II AT1-receptors on GFR and sodium handling.pdf-->
<!-- missing page Image:Strategy for improving genetic aspects of fertility and hatchability in breeding lines of White Leghorns, and choosing hens for second cycle of production.pdf-->
<!-- missing page Image:Buddha's teachings in a NUTSHELL(Explains why Buddha did not answer Questions pertaining to eternal God, NON-SOUL theory (anatta) and his basic teachings.pdf-->
Comment 1 MER-C 2008-04-19 13:09:03 UTC
Stupid long PDF names. The files are:

http://en.wikipedia.org/wiki/Image:3 accelerators 17-59.PILCURE-MBT,MBTS,ZMBT,F,CBS,NS,MOR,DCBS,TMT,ZDMC,ZDC,ZDBC,SDBC,ZDBzC-.pdf
http://en.wikipedia.org/wiki/Image:Minnesota Educational Computing Consortium Quick Reference Guide for BASIC Language Version 3.1 MECC TIMESHARE SYSTEM Rev. 2 slash 78.pdf
http://en.wikipedia.org/wiki/Image:Partial unilateral ureteropelvic obstruction in neonatal pigs - Effect of acute inhibition of angiotensin II AT1-receptors on GFR and sodium handling.pdf
http://en.wikipedia.org/wiki/Image:Strategy for improving genetic aspects of fertility and hatchability in breeding lines of White Leghorns, and choosing hens for second cycle of production.pdf
http://en.wikipedia.org/wiki/Image:Buddha's teachings in a NUTSHELL(Explains why Buddha did not answer Questions pertaining to eternal God, NON-SOUL theory (anatta) and his basic teachings.pdf
Comment 2 MER-C 2008-04-19 13:15:24 UTC
Disregard the above comment, Bugzilla is being annoying. Someone better give the Bugzilla devs a kick: https://bugzilla.mozilla.org/show_bug.cgi?id=40896 .
Comment 3 Robert Stojnic 2008-04-19 13:33:36 UTC
This has been fixed with r32742, so newly deleted files won't show up in search results. However, since this is an old bug, the search index is full of old entries and needs a rebuild. We will be shortly update the whole search backend and have this fully fixed.  
Comment 4 Bryan Tong Minh 2008-04-19 14:29:26 UTC
Sorry for the previous mail, forgot to click the assign option.

Assigning to self.
Comment 5 Bryan Tong Minh 2008-04-19 17:26:08 UTC
Fixed in r33608. Broken titles are now silently skipped in API search results.
Comment 6 Andre Klapper 2013-03-26 11:24:48 UTC
[Merging "MediaWiki extensions/Lucene Search" into "Wikimedia/lucene-search2", see bug 46542. You can filter bugmail for: search-component-merge-20130326 ]

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links