Last modified: 2013-03-25 15:02:58 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T18742, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 16742 - Different Priorities in Sitemap
Different Priorities in Sitemap
Status: NEW
Product: MediaWiki
Classification: Unclassified
Maintenance scripts (Other open bugs)
1.13.x
All All
: Low enhancement with 1 vote (vote)
: ---
Assigned To: Nobody - You can work on this!
:
: 17019 (view as bug list)
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2008-12-21 12:32 UTC by DaSch
Modified: 2013-03-25 15:02 UTC (History)
5 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description DaSch 2008-12-21 12:32:17 UTC
When using generate Sitemap all pages in one namespace get the same priority. I think it would be usefull to change this because google is complaining about this and it's not really good for other bots using the sitemap.

So I think the priority should depend on the last time it was touched. Pages that was changed short time ago should have a higher priority than pages that wasn't edited for a long time.
Comment 1 Chad H. 2009-02-11 20:14:03 UTC
*** Bug 17019 has been marked as a duplicate of this bug. ***
Comment 2 Subfader 2009-02-12 19:22:36 UTC
agreed. maybe include the factor of hoih page views. although generating the sitemap shouldn't use too much db-load. i run the sitemap script every 2 hours to feed google :) well, i have ~20 new articles per day though.

For me it only happens in the main ns and I fear google ignores the complete xml if it a warning is printed.
I submitted the following namespaces to google:
MAIN 21051 (with warnings)
CATEGORIES 6833
IMAGES 2040
HELP 24
PROJECT 17

Maybe give it 1.0 if there was much editing action lately independently from teh page views.

But to get rid of the warnings first of all it might be good to set high priorities for recently changed articles.

http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=71936
Comment 3 DaSch 2009-02-13 01:32:25 UTC
Well I'm making an update to the sitemap only once a day at night. Google does not scan that often I think.

From my point of view every solution with diffenrent priorities would be good. But I think when there is one factor that is used to generate different priorities it's not that different so implement others.
Comment 4 DaSch 2009-03-05 15:36:00 UTC
I made some changes to generate Sitemap and added a random priority generator, the best solution I could create myself

http://www.mediawiki.org/wiki/User:DaSch/generateSitemap.php
Comment 5 Subfader 2009-03-06 20:01:08 UTC
This works fine for now. Warnings are gone after first download of the new sitemap.
Comment 6 DaSch 2009-03-06 21:07:17 UTC
Thx :)

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links