Last modified: 2011-03-13 18:06:24 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T12998, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 10998 - Configure Nagios to report to toolserver IRC for toolserver outages
Configure Nagios to report to toolserver IRC for toolserver outages
Status: RESOLVED WONTFIX
Product: Wikimedia
Classification: Unclassified
General/Unknown (Other open bugs)
unspecified
All All
: Lowest enhancement with 1 vote (vote)
: ---
Assigned To: Nobody - You can work on this!
http://nagios.wikimedia.org
: shell
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2007-08-19 23:26 UTC by Matt
Modified: 2011-03-13 18:06 UTC (History)
3 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments

Description Matt 2007-08-19 23:26:00 UTC
It would be incredibly useful to add a second contact to nagios to report to #wikimedia-toolserver on freenode when any of the services in the toolserver groups status changes.

The groups to monitor:
- toolserver_database
- ts_ext_store


It would really help us to see that there was a server problem etc and know that it's not an issue in our scripts etc.

I wasn't sure where to ask, but since it is regarding nagios which is part of the main Wikimedia setup rather than the toolserver setup, this seemed like the most appropriate place to request.

Thanks,
Matt
Comment 1 Matt 2007-08-25 04:39:15 UTC
Any comments, any idea when/if this will be implemented?
Comment 2 Casey Brown 2007-08-25 04:44:06 UTC
Ask River (flyingparchment).  Contact the ts-admins {at] wikimedia.org.
Comment 3 Matt 2007-08-25 04:47:57 UTC
I was under the impression that nagios was a part of the main cluster, and the toolserver staff wouldn't be able to do it without asking a main dev anyway.
Comment 4 Casey Brown 2007-08-25 04:49:50 UTC
(In reply to comment #3)
> I was under the impression that nagios was a part of the main cluster, and the
> toolserver staff wouldn't be able to do it without asking a main dev anyway.
> 

River is both a toolserver and Wikimedia server root admin.  Therefore he would be the best to contact about this, being able to configure it properly on both ends and explain if or if not it is needed.
Comment 5 Matt 2007-08-25 10:46:17 UTC
I have talked to River, and they said to talk to Jeluf. Also, it's only the nagios end that needs configuring.
Comment 6 JeLuF 2007-08-29 15:51:45 UTC
nagios is not talking to irc. It's some strange script that does this. I don't see any well-maintainable way to tell nagios-wm which events it should report where. 

I'd recommend to install nagios on one of the toolservers.
Comment 7 Danny B. 2007-08-30 03:31:00 UTC
I've connected nagios-wm-echo bot to #wikimedia-tech and #wikimedia-toolserver which echoes all toolserver-related nagios-wm messages from #wikimedia-tech to #wikimedia-toolserver.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links