Last modified: 2012-12-03 17:44:32 UTC
Raw access to apache/squid http access log files would be nice. This would allow
individual enthusiasts (including me) to run various statistics on those logs files
for the purpose of boosting communal participation.
For example, I am interested in knowing which articles in English are most accessed
from Bulgaria and which of them are missing so that I can spend some effort improving
them. To the best of my understanding, this information is not available on any of
the reports automatically generated by Wikipedia.
I also believe that many other legitimate uses of the raw log files would be found,
including academical ones, which could regard Wikipedia as a mini-Intenet of sorts,
for which both the full contents (the SQL article dump), the change history, and the
access logs are known. Non of this is available for the real Internet, which may make
Wikipedia a valuable playground for the evaluation of PageRank-like relevancy metrics
Finally, I believe that downloading compressed logs should not place undue burden on
Thank you in advance for considering this suggestion and keep up the good work.
I am sorry, bugzilla was giving me errors so this bug was created three times.
*** This bug has been marked as a duplicate of 3028 ***
[mass-moving wikistats reports from Wikimedia→Statistics to Analytics→Wikistats to have stats issues under one Bugzilla product (see bug 42088) - sorry for the bugspam!]