Last modified: 2012-12-03 17:44:28 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T27564, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 25564 - Page view anomaly starting September 27-28
Page view anomaly starting September 27-28
Status: RESOLVED FIXED
Product: Datasets
Classification: Unclassified
Webstatscollector (Other open bugs)
unspecified
All All
: Normal enhancement (vote)
: ---
Assigned To: Ryan Kaldari
http://lists.wikimedia.org/pipermail/...
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2010-10-18 19:38 UTC by Rob Lanphier
Modified: 2012-12-03 17:44 UTC (History)
11 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Rob Lanphier 2010-10-18 19:38:56 UTC
A number of people have noted a problem with the page view stats here, starting September 27 or so:
http://stats.wikimedia.org/EN/TablesPageViewsMonthly.htm

Several people at WMF are investigating.  This is our public tracking area for this.
Comment 1 Rob Lanphier 2010-10-18 19:42:55 UTC
This issue is currently being tracked here:
http://rt.wikimedia.org/Ticket/Display.html?id=328

(though there's no particular reason why this is confidential, so we may just move here.)

Here's some comments from our previous discussion:

Nimish: "We've changed the way banners are loaded from the centralnotice interface several times this month, and while we were testing some of these different methods we'd run blank banners on enwiki. This would mean that there were upto 3 extra resource requests per pageview during tests and during the actual fundraisers.  Other than that, none of the logging infrastructure has changed."

He later added:  " Hm...so the requests looked like php requests: bannerImpression.php?return=js&[...tracking parameters here...]

Would the script recognize these? I'm not sure of anything else we've done that would skew things otherwise."
Comment 2 Erik Zachte 2010-10-19 01:25:23 UTC
As posted on foundation-l (I'll update here from now on)

The following two charts show daily page views as counted by our squid log
post-processing server.
Same data, with different time scales,  one of 13, one for 34 months.

http://tinyurl.com/2w9pcub
http://tinyurl.com/3ytachh

As you can see there's been an instantaneous *reported* dramatic page view
increase on 9/27-9/28.

Mind you we have no idea yet whether these are actual extra messages
received or some internal artifact.

Incidentally the earlier server congestion on locke from mid November to mid
July also shows clearly.

It may be useful to generate this chart with regular intervals.
Comment 3 Nemo 2010-10-19 06:58:26 UTC
This doesn't seem to affect statistics on individual pages, e.g. http://stats.grok.se/en/201009/Main_Page or http://stats.grok.se/en/201009/Wiki .
Comment 4 Roan Kattouw 2010-10-19 16:13:32 UTC
As Domas mentioned on IRC, the cause is glaringly obvious: CentralNotice uses a special page to load JS from: http://en.wikipedia.org/wiki/Special:BannerController?283-4 . This page is therefore getting lots and lots of hits on every wiki: see http://stats.grok.se/en/201010/Special%3ABannerController and http://stats.grok.se/en/201009/Special%3ABannerController .
Comment 5 Rob Lanphier 2010-10-19 17:11:15 UTC
Assigning to Kaldari, who has agreed to fix this if someone will definitively confirm that changing this URL: 
http://en.wikipedia.org/wiki/Special:BannerController 

...to this:
http://en.wikipedia.org/w/index.php?title=Special:BannerController 
 
...will fix the problem.
Comment 6 Rob Lanphier 2010-10-19 18:36:27 UTC
Update: Roan confirmed that changing the call should fix the problem.  The issue is that /a/webstats/bin/filter needs to be able to recognize a page as something *not* to log.  The current (undocumented?) convention for doing that is to use w/index.php?title=Special:Foo instead of wiki/Special:Foo .  The source code for /a/webstats/bin/filter is (I believe):
http://svn.wikimedia.org/viewvc/mediawiki/trunk/webstatscollector/filter.c?view=markup

...which based on my skimming, seems right (parse_url returns false if there's no "/wiki" part).

So, the ball is currently in Ryan K's court to fix this.  Since this is a fix that affects quite a few production scripts, it could be a bit before we get it rolled out, but we've not locked down on a plan yet.
Comment 7 Ryan Kaldari 2010-10-23 00:06:57 UTC
I have a fix for this checked in now (r75181 and r75222). I'll try to get it tested and pushed out on Monday if possible. This should fix the stats issue on all wikis except for meta. I'm going to do the fix for meta separately since it is less critical and more difficult to implement.

To explain, the stats on meta are currently extra inflated while a centralNotice campaign is actually running. To fix it, however, I'll need Tomasz or someone else to change a config setting at the same time that we deploy a code fix. If the first fix goes smoothly, I'll try to fix meta immediately afterward.
Comment 8 Tomasz Finc 2010-10-23 13:54:50 UTC
Just pending code review and then we can get this out.
Comment 9 Ryan Kaldari 2010-10-25 21:57:41 UTC
Still working on an issue with Roan. We can probably get this pushed out after I hear back from him.
Comment 10 Ryan Kaldari 2010-10-27 16:01:25 UTC
The code is ready, but we missed our window for deploying it. We'll have to wait until after the All Staff Meeting :(
Comment 11 Tomasz Finc 2010-10-27 16:20:38 UTC
Tentatively scheduled for 11/1 afternoon PDT.
Comment 12 Tomasz Finc 2010-11-02 00:10:39 UTC
Deployed
Comment 13 Nimish Gautam 2010-11-02 00:35:55 UTC
Checked, the new URLS as they appear on test.wikipedia are filtered correctly by the fliter on locke.
Comment 14 Ryan Kaldari 2010-11-02 01:02:48 UTC
I've checked in the 2nd part of the fix, for fixing the stats on meta. I have to coordinate with Tomasz for deployment since it requires changing a centralNotice config var first on the live servers.
Comment 15 Andre Klapper 2012-12-03 13:59:34 UTC
[mass-moving wikistats reports from Wikimedia→Statistics to Analytics→Wikistats to have stats issues under one Bugzilla product (see bug 42088) - sorry for the bugspam!]

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links