Last modified: 2013-06-18 14:45:10 UTC

Wikimedia Bugzilla is closed!

Wikimedia has migrated from Bugzilla to Phabricator. Bug reports should be created and updated in Wikimedia Phabricator instead. Please create an account in Phabricator and add your Bugzilla email address to it.
Wikimedia Bugzilla is read-only. If you try to edit or create any bug report in Bugzilla you will be shown an intentional error message.
In order to access the Phabricator task corresponding to a Bugzilla report, just remove "static-" from its URL.
You could still run searches in Bugzilla or access your list of votes but bug reports will obviously not be up-to-date in Bugzilla.
Bug 12862 - Don't regenerate sitemap files if nothing changed
Don't regenerate sitemap files if nothing changed
Product: MediaWiki
Classification: Unclassified
Maintenance scripts (Other open bugs)
All All
: Low minor (vote)
: ---
Assigned To: Nobody - You can work on this!
Depends on:
  Show dependency treegraph
Reported: 2008-02-01 15:38 UTC by Sergey Chernyshev
Modified: 2013-06-18 14:45 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Description Sergey Chernyshev 2008-02-01 15:38:03 UTC
It's not necessary to regenerate sitemap files if it didn't actually change.

This might not make sense for Wikipedia where something changes for sure within a day, but might be quite useful for smaller sites.

One simple solution (considering current code structure) would be to generate files into temporary folder simultaneously calculating last modified timestamps and only moving new files in place of old ones if they actually have new entries (calculated timespamps are newer then old file's timestamps). Might also be a good idea to change file's timestamp to match calculated last modified timestamp.

Index sitemap should also be created using these last modified timestamps.
Comment 1 Dan Jacobson 2008-08-21 19:20:45 UTC
All I know is I use a cronjob to generate the sitemaps every month.

Well, indeed for small wikis, if no page at all in a namespace
changes, then that sitemap.gz for that namespace does not need to be

Anyway, Google etc. will still see the date of the individual files are the
same as last time.

So though the idea is worthy, the savings aren't very big.
Comment 2 Antoine "hashar" Musso (WMF) 2013-02-18 10:24:29 UTC
On a small wiki the sitemap is fast to produce, I don't think there is any point in saving a few CPU cycles by adding some more complexity.

Note You need to log in before you can comment on or make changes to this bug.