check dell openmanage

INFORMATION ABOUT THIS PLUGIN

This plugin checks the status of objects monitored by Dell OpenManage via SNMP and returns OK, WARNING, CRITICAL or UNKNOWN. If not OK it returns text describing which subsystem failed. This plugin focuses on low bandwidth and speed. Questions, comments and criticisms are appreciated.

As featured in Linux Journal issue 182

http://www.linuxjournal.com/issue/182

This program is written and maintained by:

Jason Ellison - infotek(at)gmail.com

http://www.jasonellison.net/

It is based on check_snmp_temperature.pl plugin by:

William Leibzon - william(at)leibzon.org

SETUP NOTES

Copy check_dell_openmanage*.pl to your Nagios installation folder in libexec/. Rename to check_dell_openmanage.pl.

You must have Dell OpenManage installed on the server you wish to monitor. You must have enabled SNMP on the server and allow SNMP queries. On the nagios server that will be running the plugin you must have the perl Net::SNMP module installed.

perl -MCPAN -e shell
cpan> install Net::SNMP

Check Dell OpenManage locally on the monitored host for alert threshholds like min/max fan speeds...

To do a test run using SNMPv1 try:

./check_dell_openmanage.pl -H 192.168.1.250 -C public -T test

This will report which OID's your system supports.

OIDS Checked by System Type

System Types

dellom monitors the following OID's:

  • systemStateChassisStatus .1.3.6.1.4.1.674.10892.1.200.10.1.4.1
  • systemStatePowerSupplyStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.9.1
  • systemStateVoltageStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.12.1
  • systemStateCoolingDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.21.1
  • systemStateTemperatureStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.24.1
  • systemStateMemoryDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.27.1
  • systemStateChassisIntrusionStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.30.1
  • systemStateEventLogStatus .1.3.6.1.4.1.674.10892.1.200.10.1.41.1

dellom_storage test all of the OID's dellom tests plus the storage OID

  • systemStateChassisStatus .1.3.6.1.4.1.674.10892.1.200.10.1.4.1
  • systemStatePowerSupplyStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.9.1
  • systemStateVoltageStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.12.1
  • systemStateCoolingDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.21.1
  • systemStateTemperatureStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.24.1
  • systemStateMemoryDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.27.1
  • systemStateChassisIntrusionStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.30.1
  • systemStateEventLogStatus .1.3.6.1.4.1.674.10892.1.200.10.1.41.1
  • StorageManagement-MIB::agentGlobalSystemStatus .1.3.6.1.4.1.674.10893.1.20.110.13.0

blade

  • systemStateChassisStatus .1.3.6.1.4.1.674.10892.1.200.10.1.4.1
  • systemStateVoltageStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.12.1
  • systemStateTemperatureStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.24.1
  • systemStateMemoryDeviceStatusCombined .1.3.6.1.4.1.674.10892.1.200.10.1.27.1
  • systemStateEventLogStatus .1.3.6.1.4.1.674.10892.1.200.10.1.41.1
  • StorageManagement-MIB::agentGlobalSystemStatus .1.3.6.1.4.1.674.10893.1.20.110.13.0

global monitors

  • GlobalSystemStatus .1.3.6.1.4.1.674.10892.1.200.10.1.2.1

chassis monitors

  • ChassisStatus .1.3.6.1.4.1.674.10892.1.200.10.1.4.1

custom is meant to be customised by advanced users

TODO

  • Fix bugs and make more user friendly.

REVISION

ver 1.3

  • If non-numeric codes are returned just add the text to the statusinfo
  • This was done to allow adding machine information Dell Model Number and Service Tag to output.

ver 1.2

  • major rewrite. simplified the way new systems are defined.
  • added system type "test" which can be used to easly generate new system definitions.

ver 1.1

  • formating of text output
  • add blade system type... blades apparently do not support systemStatePowerSupplyStatusCombined, systemStateCoolingDeviceStatusCombined or systemStateChassisIntrusionStatusCombined

ver 1.0

  • while in verbose mode report which OID failed in a more readable manner.
  • add "global", "chassis", and "custom" system type.
  • removed system type pe2950.

ver 0.9

  • added type dellom_storage as this is more accurate. this plugin works with all PowerEdge servers it has been tested with. left pe2950 type in for compatibility.
  • remove min max int options from help text as they are no longer relevant

ver 0.8

  • check that perl environment has "Net::SNMP" if not found complain.
  • missing "Net::SNMP" is the most common issue users report.

ver 0.7

  • removed ucdavis definition. Added note about SNMPv3 dependencies

Links

Dell OpenManage Server Administrator Version 5.4 SNMP Reference Guide

http://support.dell.com/support/edocs/software/svradmin/5.4/en/snmp/pdf/om\_54\_snmp\_ref\_gd.pdf

Dell OpenManage Server Administrator Version 1.0 to 6.2 SNMP Reference Guide

http://support.dell.com/support/edocs/software/svradmin/6.3/en/SNMP/1.0-6.2/PDF/SNMP.pdf

Preparing the Dell Openmange server for monitoring

Linux

http://www.ubergeek.co.uk/blog/2008/05/dell-openmanage-on-linux-debian/

Windows

http://docs.google.com/View?docID=dg2fzp3d\_4d2pgb3cp