Last modified: 2012-12-03 17:44:32 UTC

Wikimedia Bugzilla is closed!

Wikimedia has migrated from Bugzilla to Phabricator. Bug reports should be created and updated in Wikimedia Phabricator instead. Please create an account in Phabricator and add your Bugzilla email address to it.
Wikimedia Bugzilla is read-only. If you try to edit or create any bug report in Bugzilla you will be shown an intentional error message.
In order to access the Phabricator task corresponding to a Bugzilla report, just remove "static-" from its URL.
You could still run searches in Bugzilla or access your list of votes but bug reports will obviously not be up-to-date in Bugzilla.
Bug 3030 - Raw access to apache/squid logs would be nice
Raw access to apache/squid logs would be nice
Status: RESOLVED DUPLICATE of bug 3028
Product: Datasets
Classification: Unclassified
Webstatscollector (Other open bugs)
PC Linux
: Normal enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
Depends on:
  Show dependency treegraph
Reported: 2005-08-03 15:59 UTC by Philip Stoev
Modified: 2012-12-03 17:44 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Description Philip Stoev 2005-08-03 15:59:07 UTC
Raw access to apache/squid http access log files would be nice. This would allow 
individual enthusiasts (including me) to run various statistics on those logs files 
for the purpose of boosting communal participation.

For example, I am interested in knowing which articles in English are most accessed 
from Bulgaria and which of them are missing so that I can spend some effort improving 
them. To the best of my understanding, this information is not available on any of 
the reports automatically generated by Wikipedia.

I also believe that many other legitimate uses of the raw log files would be found, 
including academical ones, which could regard Wikipedia as a mini-Intenet of sorts, 
for which both the full contents (the SQL article dump), the change history, and the 
access logs are known. Non of this is available for the real Internet, which may make 
Wikipedia a valuable playground for the evaluation of PageRank-like relevancy metrics 
and such.

Finally, I believe that downloading compressed logs should not place undue burden on 
Wikipedia's servers.

Thank you in advance for considering this suggestion and keep up the good work.
Comment 1 Philip Stoev 2005-08-03 17:16:04 UTC
I am sorry, bugzilla was giving me errors so this bug was created three times.

*** This bug has been marked as a duplicate of 3028 ***
Comment 2 Andre Klapper 2012-12-03 13:59:44 UTC
[mass-moving wikistats reports from Wikimedia→Statistics to Analytics→Wikistats to have stats issues under one Bugzilla product (see bug 42088) - sorry for the bugspam!]

Note You need to log in before you can comment on or make changes to this bug.