Last modified: 2012-03-16 08:23:21 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T30469, the corresponding Phabricator task for complete and up-to-date bug report information.

Bug 28469 - Make SVN Documentation be indexed by Google


Summary:	Make SVN Documentation be indexed by Google

Status:	RESOLVED FIXED

Product:	Wikimedia
Classification:	Unclassified
Component:	Subversion (Other open bugs)
Version:	unspecified
Hardware:	All All

Importance:	Normal enhancement with 1 vote (vote)
Target Milestone:	---
Assigned To:	Antoine "hashar" Musso (WMF)

URL:
Whiteboard:
Keywords:	platformeng

Depends on:
Blocks:
	Show dependency tree / graph

Reported:	2011-04-09 14:05 UTC by Sam Reed (reedy)
Modified:	2012-03-16 08:23 UTC (History)
CC List:	5 users (show)

See Also:
Web browser:	---
Mobile Platform:	---
Assignee Huggle Beta Tester:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Description Sam Reed (reedy) 2011-04-09 14:05:56 UTC

http://www.google.pl/search?&q=site:svn.wikimedia.org/doc/

It's seemingly not, be nice to correct this

Comment 1 Bawolff (Brian Wolff) 2011-04-09 23:23:14 UTC

Appears thats true for all of svn.wikimedia.org ( http://svn.wikimedia.org/robots.txt ) Is there any particular reason to disallow google looking at viewvc? Its not like our source code is secret.

Comment 2 Sam Reed (reedy) 2011-04-09 23:24:07 UTC

Wonder if it's just to prevent them crawling ViewVC etc...

Comment 3 Krinkle 2011-04-11 13:44:20 UTC

Aside from all revisions of all files in viewvc being a problem (not sure if viewvc has implemented nofollow/noindex, we could fix via robots.txt on that path).

currently this /doc/ system uses frames, which end up ugly via search engines (ie. navigation missing)

Comment 4 Bugmeister Bot 2011-08-19 19:12:42 UTC

Unassigning default assignments. http://article.gmane.org/gmane.science.linguistics.wikipedia.technical/54734

Comment 5 Antoine "hashar" Musso (WMF) 2012-02-28 11:32:54 UTC

Looks like we have to edit /srv/org/wikimedia/svn/robots.txt on formey and add:

Allow: /doc/*

ccing Sam and Chad since they have access there.

Comment 6 Bawolff (Brian Wolff) 2012-02-28 15:40:30 UTC

Might make sense to allow indexing of http://svn.wikimedia.org/users.php while we're at it.

Comment 7 Sam Reed (reedy) 2012-02-28 20:28:08 UTC

(In reply to comment #5)
> Looks like we have to edit /srv/org/wikimedia/svn/robots.txt on formey and add:
> 
> Allow: /doc/*
> 
> ccing Sam and Chad since they have access there.

I unfortunately don't have svnadm, so Chad or Ops would need to deal with it

Somewhat suprised this isn't in puppet, oh well.

Comment 8 Antoine "hashar" Musso (WMF) 2012-02-29 14:22:42 UTC

robots.txt content should be:

User-Agent: *
Allow: /doc/*
Disallow: /

Comment 9 Chad H. 2012-02-29 14:27:52 UTC

(In reply to comment #6)
> Might make sense to allow indexing of http://svn.wikimedia.org/users.php while
> we're at it.

I suppose some people will want USERINFO moved over to git as well?

Comment 10 Antoine "hashar" Musso (WMF) 2012-02-29 14:53:11 UTC

Instead just disallow viewvc

https://gerrit.wikimedia.org/r/2888

Comment 11 Antoine "hashar" Musso (WMF) 2012-03-01 13:37:09 UTC

(In reply to comment #9)
> (In reply to comment #6)
> > Might make sense to allow indexing of http://svn.wikimedia.org/users.php while
> > we're at it.
> 
> I suppose some people will want USERINFO moved over to git as well?

Made that bug 34851.

Comment 12 Antoine "hashar" Musso (WMF) 2012-03-08 10:23:12 UTC

Changed deployed by ops http://svn.wikimedia.org/robots.txt show the new content:

---------------------------------------------------
# THIS FILE IS MANAGED BY PUPPET
#
# puppet:///files/svn/docroot/robots.txt
# https://svn.wikimedia.org/robots.txt
#
User-Agent: *
Allow: /doc/*
Disallow: /

---------------------------------------------------

Will have to wait for google to come around now.

Comment 13 Antoine "hashar" Musso (WMF) 2012-03-16 08:23:21 UTC

Google bot came on svn and index the doc content :-)

http://www.google.pl/search?&q=site:svn.wikimedia.org/doc/

Note You need to log in before you can comment on or make changes to this bug.

Wikimedia Bugzilla is closed!

Search

Personal tools

Navigation

Links