Last modified: 2012-03-16 08:23:21 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T30469, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 28469 - Make SVN Documentation be indexed by Google
Make SVN Documentation be indexed by Google
Status: RESOLVED FIXED
Product: Wikimedia
Classification: Unclassified
Subversion (Other open bugs)
unspecified
All All
: Normal enhancement with 1 vote (vote)
: ---
Assigned To: Antoine "hashar" Musso (WMF)
: platformeng
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2011-04-09 14:05 UTC by Sam Reed (reedy)
Modified: 2012-03-16 08:23 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Sam Reed (reedy) 2011-04-09 14:05:56 UTC
http://www.google.pl/search?&q=site:svn.wikimedia.org/doc/

It's seemingly not, be nice to correct this
Comment 1 Bawolff (Brian Wolff) 2011-04-09 23:23:14 UTC
Appears thats true for all of svn.wikimedia.org ( http://svn.wikimedia.org/robots.txt ) Is there any particular reason to disallow google looking at viewvc? Its not like our source code is secret.
Comment 2 Sam Reed (reedy) 2011-04-09 23:24:07 UTC
Wonder if it's just to prevent them crawling ViewVC etc...
Comment 3 Krinkle 2011-04-11 13:44:20 UTC
Aside from all revisions of all files in viewvc being a problem (not sure if viewvc has implemented nofollow/noindex, we could fix via robots.txt on that path).

currently this /doc/ system uses frames, which end up ugly via search engines (ie. navigation missing)
Comment 4 Bugmeister Bot 2011-08-19 19:12:42 UTC
Unassigning default assignments. http://article.gmane.org/gmane.science.linguistics.wikipedia.technical/54734
Comment 5 Antoine "hashar" Musso (WMF) 2012-02-28 11:32:54 UTC
Looks like we have to edit /srv/org/wikimedia/svn/robots.txt on formey and add:

Allow: /doc/*

ccing Sam and Chad since they have access there.
Comment 6 Bawolff (Brian Wolff) 2012-02-28 15:40:30 UTC
Might make sense to allow indexing of http://svn.wikimedia.org/users.php while we're at it.
Comment 7 Sam Reed (reedy) 2012-02-28 20:28:08 UTC
(In reply to comment #5)
> Looks like we have to edit /srv/org/wikimedia/svn/robots.txt on formey and add:
> 
> Allow: /doc/*
> 
> ccing Sam and Chad since they have access there.

I unfortunately don't have svnadm, so Chad or Ops would need to deal with it

Somewhat suprised this isn't in puppet, oh well.
Comment 8 Antoine "hashar" Musso (WMF) 2012-02-29 14:22:42 UTC
robots.txt content should be:

User-Agent: *
Allow: /doc/*
Disallow: /
Comment 9 Chad H. 2012-02-29 14:27:52 UTC
(In reply to comment #6)
> Might make sense to allow indexing of http://svn.wikimedia.org/users.php while
> we're at it.

I suppose some people will want USERINFO moved over to git as well?
Comment 10 Antoine "hashar" Musso (WMF) 2012-02-29 14:53:11 UTC
Instead just disallow viewvc

https://gerrit.wikimedia.org/r/2888
Comment 11 Antoine "hashar" Musso (WMF) 2012-03-01 13:37:09 UTC
(In reply to comment #9)
> (In reply to comment #6)
> > Might make sense to allow indexing of http://svn.wikimedia.org/users.php while
> > we're at it.
> 
> I suppose some people will want USERINFO moved over to git as well?

Made that bug 34851.
Comment 12 Antoine "hashar" Musso (WMF) 2012-03-08 10:23:12 UTC
Changed deployed by ops http://svn.wikimedia.org/robots.txt show the new content:

---------------------------------------------------
# THIS FILE IS MANAGED BY PUPPET
#
# puppet:///files/svn/docroot/robots.txt
# https://svn.wikimedia.org/robots.txt
#
User-Agent: *
Allow: /doc/*
Disallow: /

---------------------------------------------------

Will have to wait for google to come around now.
Comment 13 Antoine "hashar" Musso (WMF) 2012-03-16 08:23:21 UTC
Google bot came on svn and index the doc content :-)

http://www.google.pl/search?&q=site:svn.wikimedia.org/doc/

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links