[Gluster-infra] [Bug 1564372] Setup Nagios server

bugzilla at redhat.com bugzilla at redhat.com
Wed Sep 26 11:24:14 UTC 2018


https://bugzilla.redhat.com/show_bug.cgi?id=1564372



--- Comment #5 from M. Scherer <mscherer at redhat.com> ---
So:

All servers managed by ansible are now monitored for ping/ssh (which did permit
to see that our freebsd hosts blocked ping, because i got paged for that as
soon as I deployed). Aka, all but gerrit prod.


I have added smtp port on supercolony, and vhost checking for a couple of web
site, see ansible repo for details. 

For now, and while I do clean the roles and stuff, I am the only one receiving
alerts, but we will need a plan for the future, I did discuss with nigel on
irc.

Notes for myself (and people that care), here the list of things to do:
- investigate more nrpe (like, security impact on having it opened on the nated
IP of the cage)
- add munin/nagios connexion
- add check of process:
   - cron
   - custom process

- add custom check (gerrit, jenkins server being offline, etc)

- refine httpd check (like more than "http 200")

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug https://bugzilla.redhat.com/token.cgi?t=OE9iSEFxEC&a=cc_unsubscribe


More information about the Gluster-infra mailing list