yzl
93958d0fb0
|
1 year ago | |
---|---|---|
.. | ||
README.md | 1 year ago | |
template_server_hp_ilo_snmp.yaml | 1 year ago |
README.md
HP iLO by SNMP
Overview
for HP iLO adapters that support SNMP get. Or via operating system, using SNMP HP subagent
Requirements
Zabbix version: 7.0 and higher.
Tested versions
This template has been tested on:
- iLo4, HP Proliant G9
Configuration
Zabbix should be configured according to the instructions in the Templates out of the box section.
Setup
Refer to the vendor documentation.
Macros used
Name | Description | Default |
---|---|---|
{$HEALTH_CRIT_STATUS} | 4 |
|
{$HEALTH_WARN_STATUS} | 3 |
|
{$PSU_CRIT_STATUS} | 4 |
|
{$PSU_WARN_STATUS} | 3 |
|
{$FAN_CRIT_STATUS} | 4 |
|
{$FAN_WARN_STATUS} | 3 |
|
{$DISK_ARRAY_CRIT_STATUS} | 4 |
|
{$DISK_ARRAY_WARN_STATUS} | 3 |
|
{$DISK_ARRAY_CACHE_CRIT_STATUS:"cacheModCriticalFailure"} | 8 |
|
{$DISK_ARRAY_CACHE_WARN_STATUS:"invalid"} | 2 |
|
{$DISK_ARRAY_CACHE_WARN_STATUS:"cacheModDegradedFailsafeSpeed"} | 7 |
|
{$DISK_ARRAY_CACHE_WARN_STATUS:"cacheReadCacheNotMapped"} | 9 |
|
{$DISK_ARRAY_CACHE_WARN_STATUS:"cacheModFlashMemNotAttached"} | 6 |
|
{$DISK_ARRAY_CACHE_OK_STATUS:"enabled"} | 3 |
|
{$DISK_ARRAY_CACHE_BATTERY_CRIT_STATUS:"failed"} | 4 |
|
{$DISK_ARRAY_CACHE_BATTERY_CRIT_STATUS:"capacitorFailed"} | 7 |
|
{$DISK_ARRAY_CACHE_BATTERY_WARN_STATUS:"degraded"} | 5 |
|
{$DISK_ARRAY_CACHE_BATTERY_WARN_STATUS:"notPresent"} | 6 |
|
{$VDISK_CRIT_STATUS} | 3 |
|
{$VDISK_OK_STATUS} | 2 |
|
{$DISK_WARN_STATUS} | 4 |
|
{$DISK_FAIL_STATUS} | 3 |
|
{$DISK_SMART_FAIL_STATUS:"replaceDrive"} | 3 |
|
{$DISK_SMART_FAIL_STATUS:"replaceDriveSSDWearOut"} | 4 |
|
{$SNMP.TIMEOUT} | 5m |
|
{$ICMP_LOSS_WARN} | 20 |
|
{$ICMP_RESPONSE_TIME_WARN} | 0.15 |
Items
Name | Description | Type | Key and additional info |
---|---|---|---|
HP iLO: System temperature status | MIB: CPQHLTH-MIB This value specifies the overall condition of the system's thermal environment. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. |
SNMP agent | sensor.temp.status[cpqHeThermalCondition.0] Preprocessing
|
HP iLO: Overall system health status | MIB: CPQHLTH-MIB The overall condition. This object represents the overall status of the server information represented by this MIB. |
SNMP agent | system.status[cpqHeMibCondition.0] Preprocessing
|
HP iLO: Hardware model name | MIB: CPQSINFO-MIB The machine product name.The name of the machine used in this system. |
SNMP agent | system.hw.model Preprocessing
|
HP iLO: Hardware serial number | MIB: CPQSINFO-MIB The serial number of the physical system unit. The string will be empty if the system does not report the serial number function. |
SNMP agent | system.hw.serialnumber Preprocessing
|
HP iLO: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | system.net.uptime[sysUpTime.0] Preprocessing
|
HP iLO: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | system.hw.uptime[hrSystemUptime.0] Preprocessing
|
HP iLO: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items |
SNMP trap | snmptrap.fallback |
HP iLO: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | system.location[sysLocation.0] Preprocessing
|
HP iLO: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | system.contact[sysContact.0] Preprocessing
|
HP iLO: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | system.objectid[sysObjectID.0] Preprocessing
|
HP iLO: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | system.name Preprocessing
|
HP iLO: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | system.descr[sysDescr.0] Preprocessing
|
HP iLO: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible value: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] |
HP iLO: ICMP ping | Simple check | icmpping | |
HP iLO: ICMP loss | Simple check | icmppingloss | |
HP iLO: ICMP response time | Simple check | icmppingsec |
Triggers
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HP iLO: System status is in critical state | Please check the device for errors |
count(/HP iLO by SNMP/system.status[cpqHeMibCondition.0],#1,"eq","{$HEALTH_CRIT_STATUS}")=1 |
High | |
HP iLO: System status is in warning state | Please check the device for warnings |
count(/HP iLO by SNMP/system.status[cpqHeMibCondition.0],#1,"eq","{$HEALTH_WARN_STATUS}")=1 |
Warning | Depends on:
|
HP iLO: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/HP iLO by SNMP/system.hw.serialnumber,#1)<>last(/HP iLO by SNMP/system.hw.serialnumber,#2) and length(last(/HP iLO by SNMP/system.hw.serialnumber))>0 |
Info | Manual close: Yes |
HP iLO: Host has been restarted | Uptime is less than 10 minutes. |
(last(/HP iLO by SNMP/system.hw.uptime[hrSystemUptime.0])>0 and last(/HP iLO by SNMP/system.hw.uptime[hrSystemUptime.0])<10m) or (last(/HP iLO by SNMP/system.hw.uptime[hrSystemUptime.0])=0 and last(/HP iLO by SNMP/system.net.uptime[sysUpTime.0])<10m) |
Warning | Manual close: Yes Depends on:
|
HP iLO: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/HP iLO by SNMP/system.name,#1)<>last(/HP iLO by SNMP/system.name,#2) and length(last(/HP iLO by SNMP/system.name))>0 |
Info | Manual close: Yes |
HP iLO: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/HP iLO by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |
Warning | Depends on:
|
HP iLO: Unavailable by ICMP ping | Last three attempts returned timeout. Please check device connectivity. |
max(/HP iLO by SNMP/icmpping,#3)=0 |
High | |
HP iLO: High ICMP ping loss | min(/HP iLO by SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/HP iLO by SNMP/icmppingloss,5m)<100 |
Warning | Depends on:
|
|
HP iLO: High ICMP ping response time | avg(/HP iLO by SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} |
Warning | Depends on:
|
LLD rule Temperature Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable |
SNMP agent | tempDescr.discovery Preprocessing
|
Item prototypes for Temperature Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: {#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor location | MIB: CPQHLTH-MIB This specifies the location of the temperature sensor present in the system. |
SNMP agent | sensor.temp.locale[cpqHeTemperatureLocale.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Temperature Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 1 |
Info | Manual close: Yes |
{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 3 |
Warning | |
{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 4 |
High |
LLD rule Temperature Discovery Ambient
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery Ambient | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ambient(11) and 0.1 index filter |
SNMP agent | tempDescr.discovery.ambient Preprocessing
|
Item prototypes for Temperature Discovery Ambient
Name | Description | Type | Key and additional info |
---|---|---|---|
Ambient: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Ambient |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.Ambient.{#SNMPINDEX}] Preprocessing
|
Ambient: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Temperature Discovery Ambient
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Ambient: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 1 |
Info | Manual close: Yes |
Ambient: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 3 |
Warning | |
Ambient: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 4 |
High |
LLD rule Temperature Discovery CPU
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery CPU | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with cpu(6) filter |
SNMP agent | tempDescr.discovery.cpu Preprocessing
|
Item prototypes for Temperature Discovery CPU
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: CPU-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.CPU.{#SNMPINDEX}] Preprocessing
|
CPU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Temperature Discovery CPU
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
CPU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 1 |
Info | Manual close: Yes |
CPU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 3 |
Warning | |
CPU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 4 |
High |
LLD rule Temperature Discovery Memory
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery Memory | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with memory(7) filter |
SNMP agent | tempDescr.discovery.memory Preprocessing
|
Item prototypes for Temperature Discovery Memory
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Memory-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.Memory.{#SNMPINDEX}] Preprocessing
|
Memory-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Temperature Discovery Memory
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Memory-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 1 |
Info | Manual close: Yes |
Memory-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 3 |
Warning | |
Memory-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 4 |
High |
LLD rule Temperature Discovery PSU
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery PSU | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with powerSupply(10) filter |
SNMP agent | tempDescr.discovery.psu Preprocessing
|
Item prototypes for Temperature Discovery PSU
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: PSU-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.PSU.{#SNMPINDEX}] Preprocessing
|
PSU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Temperature Discovery PSU
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
PSU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 1 |
Info | Manual close: Yes |
PSU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 3 |
Warning | |
PSU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 4 |
High |
LLD rule Temperature Discovery I/O
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery I/O | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ioBoard(5) filter |
SNMP agent | tempDescr.discovery.io Preprocessing
|
Item prototypes for Temperature Discovery I/O
Name | Description | Type | Key and additional info |
---|---|---|---|
I/O-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: I/O-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius."I/O.{#SNMPINDEX}"] Preprocessing
|
I/O-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"] Preprocessing
|
Trigger prototypes for Temperature Discovery I/O
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
I/O-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 1 |
Info | Manual close: Yes |
I/O-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 3 |
Warning | |
I/O-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 4 |
High |
LLD rule Temperature Discovery System
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery System | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with system(3) filter |
SNMP agent | tempDescr.discovery.system Preprocessing
|
Item prototypes for Temperature Discovery System
Name | Description | Type | Key and additional info |
---|---|---|---|
System-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: System-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.System.{#SNMPINDEX}] Preprocessing
|
System-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Temperature Discovery System
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
System-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 1 |
Info | Manual close: Yes |
System-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 3 |
Warning | |
System-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 4 |
High |
LLD rule PSU Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU Discovery | CPQHLTH-MIB::cpqHeFltTolPowerSupplyStatus |
SNMP agent | psu.discovery Preprocessing
|
Item prototypes for PSU Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Chassis {#CHASSIS_NUM}, bay {#BAY_NUM}: Power supply status | MIB: CPQHLTH-MIB The condition of the power supply. This value will be one of the following: other(1) The status could not be determined or not present. ok(2) The power supply is operating normally. degraded(3) A temperature sensor, fan or other power supply component is outside of normal operating range. failed(4) A power supply component detects a condition that could permanently damage the system. |
SNMP agent | sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for PSU Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Chassis {#CHASSIS_NUM}, bay {#BAY_NUM}: Power supply is in critical state | Please check the power supply unit for errors |
count(/HP iLO by SNMP/sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}],#1,"eq","{$PSU_CRIT_STATUS}")=1 |
Average | |
Chassis {#CHASSIS_NUM}, bay {#BAY_NUM}: Power supply is in warning state | Please check the power supply unit for errors |
count(/HP iLO by SNMP/sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}],#1,"eq","{$PSU_WARN_STATUS}")=1 |
Warning | Depends on:
|
LLD rule FAN Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN Discovery | CPQHLTH-MIB::cpqHeFltTolFanCondition |
SNMP agent | fan.discovery Preprocessing
|
Item prototypes for FAN Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan {#SNMPINDEX}: Fan status | MIB: CPQHLTH-MIB The condition of the fan. This value will be one of the following: other(1) Fan status detection is not supported by this system or driver. ok(2) The fan is operating properly. degraded(2) A redundant fan is not operating properly. failed(4) A non-redundant fan is not operating properly. |
SNMP agent | sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for FAN Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Fan {#SNMPINDEX}: Fan is in critical state | Please check the fan unit |
count(/HP iLO by SNMP/sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}],#1,"eq","{$FAN_CRIT_STATUS}")=1 |
Average | |
Fan {#SNMPINDEX}: Fan is in warning state | Please check the fan unit |
count(/HP iLO by SNMP/sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}],#1,"eq","{$FAN_WARN_STATUS}")=1 |
Warning | Depends on:
|
LLD rule Physical Disk Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical Disk Discovery | Scanning table of physical drive entries CPQIDA-MIB::cpqDaPhyDrvTable. |
SNMP agent | physicalDisk.discovery Preprocessing
|
Item prototypes for Physical Disk Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISK_LOCATION}: Physical disk status | MIB: CPQIDA-MIB Physical Drive Status. This shows the status of the physical drive. The following values are valid for the physical drive status: other (1) Indicates that the instrument agent does not recognize the drive. You may need to upgrade your instrument agent and/or driver software. ok (2) Indicates the drive is functioning properly. failed (3) Indicates that the drive is no longer operating and should be replaced. predictiveFailure(4) Indicates that the drive has a predictive failure error and should be replaced. |
SNMP agent | system.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. status | MIB: CPQIDA-MIB Physical Drive S.M.A.R.T Status.The following values are defined: other(1) The agent is unable to determine if the status of S.M.A.R.T predictive failure monitoring for this drive. ok(2) Indicates the drive is functioning properly. replaceDrive(3) Indicates that the drive has a S.M.A.R.T predictive failure error and should be replaced. |
SNMP agent | system.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk serial number | MIB: CPQIDA-MIB Physical Drive Serial Number. This is the serial number assigned to the physical drive. This value is based upon the serial number as returned by the SCSI inquiry command but may have been modified due to space limitations. This can be used for identification purposes. |
SNMP agent | system.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk model name | MIB: CPQIDA-MIB Physical Drive Model.This is a text description of the physical drive. The text that appears depends upon who manufactured the drive and the drive type. If a drive fails, note the model to identify the type of drive necessary for replacement. If a model number is not present, you may not have properly initialized the drive array to which the physical drive is attached for monitoring. |
SNMP agent | system.hw.physicaldisk.model[cpqDaPhyDrvModel.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk media type | MIB: CPQIDA-MIB Drive Array Physical Drive Media Type.The following values are defined: other(1) The instrument agent is unable to determine the physical drive's media type. rotatingPlatters(2) The physical drive media is composed of rotating platters. solidState(3) The physical drive media is composed of solid state electronics. |
SNMP agent | system.hw.physicaldisk.media_type[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Disk size | MIB: CPQIDA-MIB Physical Drive Size in MB. This is the size of the physical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. This field is only applicable for controllers which support SCSI drives, and therefore is not supported by the IDA or IDA-2 controllers. The field will contain 0xFFFFFFFF if the drive capacity cannot be calculated or if the controller does not support SCSI drives. |
SNMP agent | system.hw.physicaldisk.size[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Physical Disk Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISK_LOCATION}: Physical disk failed | Please check physical disk for warnings or errors |
count(/HP iLO by SNMP/system.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}],#1,"eq","{$DISK_FAIL_STATUS}")=1 |
High | |
{#DISK_LOCATION}: Physical disk is in warning state | Please check physical disk for warnings or errors |
count(/HP iLO by SNMP/system.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}],#1,"eq","{$DISK_WARN_STATUS}")=1 |
Warning | Depends on:
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. failed | Disk probably requires replacement. |
count(/HP iLO by SNMP/system.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}],#1,"eq","{$DISK_SMART_FAIL_STATUS:\"replaceDrive\"}")=1 or count(/HP iLO by SNMP/system.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}],#1,"eq","{$DISK_SMART_FAIL_STATUS:\"replaceDriveSSDWearOut\"}")=1 |
High | Depends on:
|
{#DISK_LOCATION}: Disk has been replaced | Disk serial number has changed. Acknowledge to close the problem manually. |
last(/HP iLO by SNMP/system.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#1)<>last(/HP iLO by SNMP/system.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#2) and length(last(/HP iLO by SNMP/system.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}]))>0 |
Info | Manual close: Yes |
LLD rule Virtual Disk Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual Disk Discovery | CPQIDA-MIB::cpqDaLogDrvTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Item prototypes for Virtual Disk Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Status | Logical Drive Status. |
SNMP agent | system.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Layout type | Logical Drive Fault Tolerance. This shows the fault tolerance mode of the logical drive. |
SNMP agent | system.hw.virtualdisk.layout[cpqDaLogDrvFaultTol.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Disk size | Logical Drive Size. This is the size of the logical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. |
SNMP agent | system.hw.virtualdisk.size[cpqDaLogDrvSize.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Virtual Disk Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk failed | Please check virtual disk for warnings or errors |
count(/HP iLO by SNMP/system.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}],#1,"eq","{$VDISK_CRIT_STATUS}")=1 |
High | |
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk is not in OK state | Please check virtual disk for warnings or errors |
count(/HP iLO by SNMP/system.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}],#1,"ne","{$VDISK_OK_STATUS}")=1 |
Warning | Depends on:
|
LLD rule Array Controller Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Array Controller Discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaCntlrTable |
SNMP agent | array.discovery Preprocessing
|
Item prototypes for Array Controller Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller status | MIB: CPQIDA-MIB This value represents the overall condition of this controller, and any associated logical drives,physical drives, and array accelerators. |
SNMP agent | system.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}] Preprocessing
|
{#CNTLR_LOCATION}: Disk array controller model | MIB: CPQIDA-MIB Array Controller Model. The type of controller card. |
SNMP agent | system.hw.diskarray.model[cpqDaCntlrModel.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Array Controller Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller is in critical state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CRIT_STATUS}")=1 |
High | |
{#CNTLR_LOCATION}: Disk array controller is in warning state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_WARN_STATUS}")=1 |
Average | Depends on:
|
LLD rule Array Controller Cache Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Array Controller Cache Discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaAccelTable |
SNMP agent | array.cache.discovery Preprocessing
|
Item prototypes for Array Controller Cache Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
#{#CACHE_CNTRL_INDEX}: Disk array cache controller status | MIB: CPQIDA-MIB Cache Module/Operations Status. This describes the status of the cache module and/or cache operations. Note that for some controller models, a cache module board that physically attaches to the controller or chipset may not be an available option. The status can be: Other (1) Indicates that the instrument agent does not recognize the status of the cache module. You may need to upgrade the instrument agent. Invalid (2) Indicates that a cache module board has not been installed in this system or is present but not configured. Enabled (3) Indicates that cache operations are currently configured and enabled for at least one logical drive. Temporarily Disabled (4) Indicates that cache operations have been temporarily disabled. View the cache module board error code object to determine why the write cache operations have been temporarily disabled. Permanently Disabled (5) Indicates that cache operations have been permanently disabled. View the cache module board error code object to determine why the write cache operations have been disabled. Cache Module Flash Memory Not Attached (6) Indicates that the flash memory component of the flash backed cache module is not attached. This status will be set when the flash memory is not attached and the Supercap is attached. This value is only used on flash backed cache modules that support removable flash memory. Cache Module Degraded Failsafe Speed (7) Indicates that the cache module board is currently degraded and operating at a failsafe speed. View variables cpqDaCacheMemoryDataWidth and cpqDaCacheMemoryTransferRate to obtain the cache module board`s current memory data width and memory transfer rate. Cache Module Critical Failure (8) Indicates that the cache module board has encountered a critical failure. The controller is currently operating in Zero Memory Raid mode. Read Cache Could Not Be Mapped (9) Indicates that the read cache memory in a split cache configuration could not be mapped by the operating system and as a result is not available. This status may be caused by virtual space limitations in certain operating systems and is only applicable to B-Series controllers. |
SNMP agent | system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}] Preprocessing
|
#{#CACHE_CNTRL_INDEX}: Disk array cache controller battery status | MIB: CPQIDA-MIB Cache Module Board Backup Power Status. This monitors the status of each backup power source on the board. The backup power source can only recharge when the system has power applied. The type of backup power source used is indicated by cpqDaAccelBackupPowerSource. The following values are valid: Other (1) Indicates that the instrument agent does not recognize backup power status. You may need to update your software. Ok (2) The backup power source is fully charged. Recharging (3) The array controller has one or more cache module backup power sources that are recharging. Cache module operations such as Battery/Flash Backed Write Cache, Expansion, Extension and Migration are temporarily suspended until the backup power source is fully charged. Cache module operations will automatically resume when charging is complete. Failed (4) The battery pack is below the sufficient voltage level and has not recharged in 36 hours. Your Cache Module board needs to be serviced. Degraded (5) The battery is still operating, however, one of the batteries in the pack has failed to recharge properly. Your Cache Module board should be serviced as soon as possible. NotPresent (6) A backup power source is not present on the cache module board. Some controllers do not have backup power sources. Capacitor Failed (7) The flash backed cache module capacitor is below the sufficient voltage level and has not recharged in 10 minutes. Your Cache Module board needs to be serviced. |
SNMP agent | system.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Array Controller Cache Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
#{#CACHE_CNTRL_INDEX}: Disk array cache controller is in critical state! | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_CRIT_STATUS:\"cacheModCriticalFailure\"}")=1 |
Average | |
#{#CACHE_CNTRL_INDEX}: Disk array cache controller is in warning state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_WARN_STATUS:\"cacheModDegradedFailsafeSpeed\"}")=1 or count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_WARN_STATUS:\"cacheReadCacheNotMapped\"}")=1 or count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_WARN_STATUS:\"cacheModFlashMemNotAttached\"}")=1 |
Warning | Depends on:
|
#{#CACHE_CNTRL_INDEX}: Disk array cache controller is not in optimal state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"ne","{$DISK_ARRAY_CACHE_OK_STATUS:\"enabled\"}")=1 and last(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK_ARRAY_CACHE_WARN_STATUS:"invalid"} |
Warning | Depends on:
|
#{#CACHE_CNTRL_INDEX}: Disk array cache controller battery is in critical state! | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_BATTERY_CRIT_STATUS:\"failed\"}")=1 or count(/HP iLO by SNMP/system.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_BATTERY_CRIT_STATUS:\"capacitorFailed\"}")=1 |
Average | |
#{#CACHE_CNTRL_INDEX}: Disk array cache controller battery is in warning state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_BATTERY_WARN_STATUS:\"degraded\"}")=1 |
Warning | Depends on:
|
Feedback
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums