Last modified: 2013-10-29 21:12:39 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T22173, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 20173 - Lucene Search update script fails while downloading DTD
Lucene Search update script fails while downloading DTD
Status: RESOLVED WONTFIX
Product: Wikimedia
Classification: Unclassified
lucene-search-2 (Other open bugs)
unspecified
All Linux
: Normal major (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2009-08-11 08:42 UTC by Jeen
Modified: 2013-10-29 21:12 UTC (History)
4 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Jeen 2009-08-11 08:42:53 UTC
I am experiencing a problem with the Lucene Search (2.1) update script. This is a major issue as it means my search index does not get updated at all.

The environment is a Linux 2.6.x system running Java 1.6.0_14-b08, MySQL 5.0.45, mediawiki 1.13.2.

I get the following error message:

java.io.IOException: Server returned HTTP response code: 503 for URL: http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd
        at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1305)
        at org.apache.xerces.impl.XMLEntityManager.setupCurrentEntity(Unknown Source)
        at org.apache.xerces.impl.XMLEntityManager.startEntity(Unknown Source)
        at org.apache.xerces.impl.XMLEntityManager.startDTDEntity(Unknown Source)
        at org.apache.xerces.impl.XMLDTDScannerImpl.setInputSource(Unknown Source)
        at org.apache.xerces.impl.XMLDocumentScannerImpl$DTDDispatcher.dispatch(Unknown Source)
        at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source)
        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
        at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
        at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
        at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source)
        at org.apache.xerces.jaxp.SAXParserImpl.parse(Unknown Source)
        at javax.xml.parsers.SAXParser.parse(SAXParser.java:198)
        at org.wikimedia.lsearch.oai.OAIParser.parse(OAIParser.java:64)
        at org.wikimedia.lsearch.oai.OAIHarvester.read(OAIHarvester.java:64)
        at org.wikimedia.lsearch.oai.OAIHarvester.getRecords(OAIHarvester.java:44)
        at org.wikimedia.lsearch.oai.IncrementalUpdater.main(IncrementalUpdater.java:191)
555  [main] WARN  org.wikimedia.lsearch.oai.IncrementalUpdater  - Retry later: error while processing update for wikidb : Server returned HTTP response code: 503 for URL: http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd

When retrieving this same DTD using wget on the same machine, however, it succeeds downloading without a problem.
Comment 1 Bugmeister Bot 2011-08-19 19:12:40 UTC
Unassigning default assignments. http://article.gmane.org/gmane.science.linguistics.wikipedia.technical/54734
Comment 2 Andre Klapper 2013-03-26 11:20:24 UTC
[Merging "MediaWiki extensions/Lucene Search" into "Wikimedia/lucene-search2", see bug 46542. You can filter bugmail for: search-component-merge-20130326 ]
Comment 3 Chad H. 2013-10-29 18:59:20 UTC
lsearchd has reached its end of life and will not be improved further, marking this WONTFIX as a result.
Comment 4 Jeen 2013-10-29 20:06:18 UTC
So, what's the alternative then?
Comment 5 Chad H. 2013-10-29 21:11:30 UTC
If you're looking for a Lucene-based search for MediaWiki, I suggest taking a look at the new CirrusSearch extension we're working on. It's backed by Elasticsearch, rather than our home-grown lsearchd.
Comment 6 Jeen 2013-10-29 21:12:39 UTC
Thanks, will do.(In reply to comment #5)
> If you're looking for a Lucene-based search for MediaWiki, I suggest taking a
> look at the new CirrusSearch extension we're working on. It's backed by
> Elasticsearch, rather than our home-grown lsearchd.

Thanks, will do.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links