Last modified: 2012-12-03 17:44:32 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T5030, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 3030 - Raw access to apache/squid logs would be nice
Raw access to apache/squid logs would be nice
Status: RESOLVED DUPLICATE of bug 3028
Product: Datasets
Classification: Unclassified
Webstatscollector (Other open bugs)
PC Linux
: Normal enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
Depends on:
  Show dependency treegraph
Reported: 2005-08-03 15:59 UTC by Philip Stoev
Modified: 2012-12-03 17:44 UTC (History)
2 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Description Philip Stoev 2005-08-03 15:59:07 UTC
Raw access to apache/squid http access log files would be nice. This would allow 
individual enthusiasts (including me) to run various statistics on those logs files 
for the purpose of boosting communal participation.

For example, I am interested in knowing which articles in English are most accessed 
from Bulgaria and which of them are missing so that I can spend some effort improving 
them. To the best of my understanding, this information is not available on any of 
the reports automatically generated by Wikipedia.

I also believe that many other legitimate uses of the raw log files would be found, 
including academical ones, which could regard Wikipedia as a mini-Intenet of sorts, 
for which both the full contents (the SQL article dump), the change history, and the 
access logs are known. Non of this is available for the real Internet, which may make 
Wikipedia a valuable playground for the evaluation of PageRank-like relevancy metrics 
and such.

Finally, I believe that downloading compressed logs should not place undue burden on 
Wikipedia's servers.

Thank you in advance for considering this suggestion and keep up the good work.
Comment 1 Philip Stoev 2005-08-03 17:16:04 UTC
I am sorry, bugzilla was giving me errors so this bug was created three times.

*** This bug has been marked as a duplicate of 3028 ***
Comment 2 Andre Klapper 2012-12-03 13:59:44 UTC
[mass-moving wikistats reports from Wikimedia→Statistics to Analytics→Wikistats to have stats issues under one Bugzilla product (see bug 42088) - sorry for the bugspam!]

Note You need to log in before you can comment on or make changes to this bug.