Last modified: 2014-10-23 14:25:48 UTC
We need to set up a process nanny alert for EL processes running on hafnium and vanadium. Just recently processes on halfnium died, they were not restarted and we lost some of EL graphite metrics.
Update: there are process alarms set up on vanadium. The only ones missing were so on hafnium.
Pertaining patchset: https://gerrit.wikimedia.org/r/#/c/143258/