Last modified: 2008-09-13 00:48:23 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T17253, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 15253 - nostalgia.wikipedia.org possibly should be robots.txt'd out of search engines


Summary:	nostalgia.wikipedia.org possibly should be robots.txt'd out of search engines

Status:	RESOLVED FIXED

Product:	Wikimedia
Classification:	Unclassified
Component:	General/Unknown (Other open bugs)
Version:	unspecified
Hardware:	All All

Importance:	Normal normal (vote)
Target Milestone:	---
Assigned To:	Nobody - You can work on this!

URL:
Whiteboard:
Keywords:	shell

Depends on:
Blocks:
	Show dependency tree / graph

Reported:	2008-08-21 00:05 UTC by Brion Vibber
Modified:	2008-09-13 00:48 UTC (History)
CC List:	2 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Brion Vibber 2008-08-21 00:05:25 UTC

Have received vague reports of nostalgia.wikipedia.org showing up unexpectedly in regular Google search results. (This holds a copy of Wikipedia's database from early 2002, displayed in the old-style 'Nostalgia' skin, and was put up for one of Wikipedia's anniversary celebrations a few years ago.)

The nostalgia site appears to be served out of the primary document root, so gets the regular robots.txt; we should possibly give it a custom docroot with a blanked Disallow robots.txt, which would phase it out of general web search indexes.

Comment 1 Tim Starling 2008-08-25 17:01:01 UTC

We should just redirect robots.txt to extract2.php and have them edited via the web.

Comment 2 Brion Vibber 2008-08-25 23:32:38 UTC

Hmmm, sounds kind of scary but would probably work fine. :)

Comment 3 JeLuF 2008-09-12 20:41:06 UTC

Will robots follow redirects for robots.txt? I guess we should proxy the request to extract2.php.

Comment 4 JeLuF 2008-09-13 00:48:23 UTC

Done.

User-agent: *
Disallow: /

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links