While debugging my check disk problem after the 15.10 upgrade, I saw that I forgot to enable external commands. That is handy, when you want to re-schedule a check to see, if your changes took effect. Again, something that is easily activated. So if you see something like this, then you might want to make some changes:
Error: Could not stat() command file ‘/var/lib/nagios3/rw/nagios.cmd’!
The external command file may be missing, Nagios may not be running, and/or Nagios may not be checking external commands. An error occurred while attempting to commit your command for processing.
First stop the Nagios service with systemctl, service or with the init script. Whatever your distribution prefers. Then edit as root the configuration file /etc/nagios3/nagios.cfg and check if the variable check_external_commands is set to 1:
Afterwards update the rights to the external command with the following:
dpkg-statoverride --update --add nagios www-data 2710 /var/lib/nagios3/rw
dpkg-statoverride --update --add nagios nagios 751 /var/lib/nagios3
And then start Nagios again. Et voila, you can call external commands.
Another day another foo, this time done to the check_disk plugin for Nagios on Ubuntu. After updating to 15.10 my disk space check all of a sudden failed with this one here:
DISK CRITICAL - /sys/kernel/debug/tracing is not accessible: Permission denied
It seemed a little odd, especially when I could access that file normally before. So something has changed and the workaround is actually fairly easy. As root edit the file /etc/nagios-plugins/config/disk.cfg and change the command for check_all_disks. You need to add -A -i ‘/sys’ to the command line. So your command for check_all_disks will look like this:
# 'check_all_disks' command definition
command_line /usr/lib/nagios/plugins/check_disk -w '$ARG1$' -c '$ARG2$' -e -A -i '/sys'
Restart Nagios and all is good. After I fixed it this way I found, that it is actually filed as a bug 1516451 in Ubuntu’s Launchpad here
After revamping the site a bit to support multiple projects and some other stuff I finally found the time to upload some projects. The first is the check_process_runtime plugin for Nagios.
check_process_runtime has a simple purpose. Find processes that run for too long. You can give the script the name of the process or part of the process. It will then determine if the process will cause a warning or if it is critical. This happens by determining if a process runs too too long since it started. This is a Nagios plugin that will only run on Linux. It is using special features that are only available on Linux and on no other Unix or Windows!
But, again, I talk too much. The download is readily available here.
Y’all have a great night and please play with the plugin and tell me about the problems, questions, etc you have.