Last modified: 2012-08-02 08:05:27 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T39046, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 37046 - fix apache monitoring in nagios
fix apache monitoring in nagios
Status: RESOLVED FIXED
Product: Wikimedia Labs
Classification: Unclassified
deployment-prep (beta) (Other open bugs)
unspecified
All All
: Normal normal
: ---
Assigned To: Nobody - You can work on this!
: ops
Depends on:
Blocks: 37081
  Show dependency treegraph
 
Reported: 2012-05-23 09:24 UTC by Antoine "hashar" Musso (WMF)
Modified: 2012-08-02 08:05 UTC (History)
6 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Antoine "hashar" Musso (WMF) 2012-05-23 09:24:47 UTC
The apaches::monitoring class uses check_http_wikipedia which in turn use the hardcoded en.wikipedia.org URL. The end result is a 403 forbidden error since that virtual host is not available in beta.

Example of output from bot:

<labs-nagios-wm>
PROBLEM HTTP is now: WARNING on deployment-apache23 i-00000270 output: HTTP WARNING: HTTP/1.1 403 Forbidden - 366 bytes in 0.006 second response time


We need to alter apaches::monitoring to use a different monitor service for deployment-prep project, aka check_http_url.
Comment 1 Peter Bena 2012-05-23 11:21:05 UTC
which service is supposed to be checked? Why standard check doesn't work. Can you write a nagios module to use?
Comment 2 Mark A. Hershberger 2012-05-23 11:32:38 UTC
(In reply to comment #1)
> which service is supposed to be checked? Why standard check doesn't work. Can
> you write a nagios module to use?

It is better to use the puppet classes since those are more maintained.
Comment 3 Antoine "hashar" Musso (WMF) 2012-05-23 13:15:21 UTC
(In reply to comment #1)
> which service is supposed to be checked?

That the apache serves a web page.

> Why standard check doesn't work.

Reread my first comment. The check is hardcoded to use en.wikipedia.org which is not available on beta.

> Can you write a nagios module to use?

There is already the check_http_url plugin.
Comment 4 Antoine "hashar" Musso (WMF) 2012-05-23 13:15:50 UTC
I forgot to post the change number:

https://gerrit.wikimedia.org/r/8575/
Comment 5 Antoine "hashar" Musso (WMF) 2012-05-23 19:24:02 UTC
Correct URL https://gerrit.wikimedia.org/r/8575

Asked Faidon or Daniel to review the change.
Comment 6 Faidon Liambotis 2012-05-29 22:05:33 UTC
Placed a review there since May 25th.
Comment 7 Antoine "hashar" Musso (WMF) 2012-05-31 12:56:43 UTC
Gerrit change #8575 was applied. It implements a new apache::monitoring::labs

had to fix the nagios name with https://gerrit.wikimedia.org/r/#/c/9391/ 

The apache check is still in warning (403) on nagios:

http://nagios.wmflabs.org/cgi-bin/nagios3/status.cgi?hostgroup=deployment-prep&style=detail

Maybe nagios need to be reloaded?
Comment 8 Antoine "hashar" Musso (WMF) 2012-05-31 13:02:46 UTC
So nagios on labs does not use the puppet classes! According to Ryan, it does a semantic search, get back a json object for all instances, then based on which puppet classes are used, monitors are created.
Comment 9 Antoine "hashar" Musso (WMF) 2012-08-02 08:05:27 UTC
Closing this one, that was apparently fixed some weeks ago.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links