24 KiB
Intel_Qlogic Infiniband by SNMP
Overview
The Intel® 12200 is a 36-port, 40Gbps switch based on InfiniBand* architecture that cost-effectively supports a cluster of up to 36 servers, or provides an edge switch option for a larger fabric. This fixed-configuration switch is a member of the 12000 series, which delivers an exceptional set of high-speed networking features and functions.
Requirements
Zabbix version: 7.0 and higher.
Tested versions
This template has been tested on:
- Intel_Qlogic Infiniband
Configuration
Zabbix should be configured according to the instructions in the Templates out of the box section.
Setup
Refer to the vendor documentation.
Macros used
Name | Description | Default |
---|---|---|
{$TEMP_CRIT} | 60 |
|
{$TEMP_CRIT_LOW} | 5 |
|
{$TEMP_WARN} | 50 |
|
{$TEMP_CRIT_STATUS} | 3 |
|
{$TEMP_WARN_STATUS} | 2 |
|
{$PSU_CRIT_STATUS} | 3 |
|
{$PSU_WARN_STATUS} | 4 |
|
{$FAN_CRIT_STATUS} | 3 |
|
{$SNMP.TIMEOUT} | 5m |
|
{$ICMP_LOSS_WARN} | 20 |
|
{$ICMP_RESPONSE_TIME_WARN} | 0.15 |
|
{$IF.ERRORS.WARN} | 2 |
|
{$IF.UTIL.MAX} | 90 |
|
{$IFCONTROL} | 1 |
|
{$NET.IF.IFNAME.MATCHES} | ^.*$ |
|
{$NET.IF.IFNAME.NOT_MATCHES} | Filter out loopbacks, nulls, docker veth links and docker0 bridge by default |
Macro too long. Please see the template. |
{$NET.IF.IFOPERSTATUS.MATCHES} | ^.*$ |
|
{$NET.IF.IFOPERSTATUS.NOT_MATCHES} | Ignore notPresent(6) |
^6$ |
{$NET.IF.IFADMINSTATUS.MATCHES} | Ignore notPresent(6) |
^.* |
{$NET.IF.IFADMINSTATUS.NOT_MATCHES} | Ignore down(2) administrative status |
^2$ |
{$NET.IF.IFDESCR.MATCHES} | .* |
|
{$NET.IF.IFDESCR.NOT_MATCHES} | CHANGE_IF_NEEDED |
|
{$NET.IF.IFALIAS.MATCHES} | .* |
|
{$NET.IF.IFALIAS.NOT_MATCHES} | CHANGE_IF_NEEDED |
|
{$NET.IF.IFTYPE.MATCHES} | .* |
|
{$NET.IF.IFTYPE.NOT_MATCHES} | CHANGE_IF_NEEDED |
Items
Name | Description | Type | Key and additional info |
---|---|---|---|
Intel_Qlogic Infiniband: Hardware model name | MIB: ICS-CHASSIS-MIB |
SNMP agent | system.hw.model Preprocessing
|
Intel_Qlogic Infiniband: Firmware version | MIB: ICS-CHASSIS-MIB |
SNMP agent | system.hw.firmware Preprocessing
|
Intel_Qlogic Infiniband: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | system.net.uptime[sysUpTime.0] Preprocessing
|
Intel_Qlogic Infiniband: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | system.hw.uptime[hrSystemUptime.0] Preprocessing
|
Intel_Qlogic Infiniband: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items |
SNMP trap | snmptrap.fallback |
Intel_Qlogic Infiniband: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | system.location[sysLocation.0] Preprocessing
|
Intel_Qlogic Infiniband: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | system.contact[sysContact.0] Preprocessing
|
Intel_Qlogic Infiniband: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | system.objectid[sysObjectID.0] Preprocessing
|
Intel_Qlogic Infiniband: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | system.name Preprocessing
|
Intel_Qlogic Infiniband: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | system.descr[sysDescr.0] Preprocessing
|
Intel_Qlogic Infiniband: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible value: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] |
Intel_Qlogic Infiniband: ICMP ping | Simple check | icmpping | |
Intel_Qlogic Infiniband: ICMP loss | Simple check | icmppingloss | |
Intel_Qlogic Infiniband: ICMP response time | Simple check | icmppingsec |
Triggers
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Intel_Qlogic Infiniband: Firmware has changed | Firmware version has changed. Acknowledge to close the problem manually. |
last(/Intel_Qlogic Infiniband by SNMP/system.hw.firmware,#1)<>last(/Intel_Qlogic Infiniband by SNMP/system.hw.firmware,#2) and length(last(/Intel_Qlogic Infiniband by SNMP/system.hw.firmware))>0 |
Info | Manual close: Yes |
Intel_Qlogic Infiniband: Host has been restarted | Uptime is less than 10 minutes. |
(last(/Intel_Qlogic Infiniband by SNMP/system.hw.uptime[hrSystemUptime.0])>0 and last(/Intel_Qlogic Infiniband by SNMP/system.hw.uptime[hrSystemUptime.0])<10m) or (last(/Intel_Qlogic Infiniband by SNMP/system.hw.uptime[hrSystemUptime.0])=0 and last(/Intel_Qlogic Infiniband by SNMP/system.net.uptime[sysUpTime.0])<10m) |
Warning | Manual close: Yes Depends on:
|
Intel_Qlogic Infiniband: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/Intel_Qlogic Infiniband by SNMP/system.name,#1)<>last(/Intel_Qlogic Infiniband by SNMP/system.name,#2) and length(last(/Intel_Qlogic Infiniband by SNMP/system.name))>0 |
Info | Manual close: Yes |
Intel_Qlogic Infiniband: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/Intel_Qlogic Infiniband by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |
Warning | Depends on:
|
Intel_Qlogic Infiniband: Unavailable by ICMP ping | Last three attempts returned timeout. Please check device connectivity. |
max(/Intel_Qlogic Infiniband by SNMP/icmpping,#3)=0 |
High | |
Intel_Qlogic Infiniband: High ICMP ping loss | min(/Intel_Qlogic Infiniband by SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/Intel_Qlogic Infiniband by SNMP/icmppingloss,5m)<100 |
Warning | Depends on:
|
|
Intel_Qlogic Infiniband: High ICMP ping response time | avg(/Intel_Qlogic Infiniband by SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} |
Warning | Depends on:
|
LLD rule Temperature Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery | Discovering sensor's table with temperature filter |
SNMP agent | temp.discovery |
Item prototypes for Temperature Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SENSOR_INFO}: Temperature | MIB: ICS-CHASSIS-MIB The current value read from the sensor. |
SNMP agent | sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}] |
{#SENSOR_INFO}: Temperature status | MIB: ICS-CHASSIS-MIB The operational status of the sensor. |
SNMP agent | sensor.temp.status[icsChassisSensorSlotOperStatus.{#SNMPINDEX}] |
Trigger prototypes for Temperature Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SENSOR_INFO}: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Intel_Qlogic Infiniband by SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)>{$TEMP_WARN:"{#SENSOR_INFO}"} or last(/Intel_Qlogic Infiniband by SNMP/sensor.temp.status[icsChassisSensorSlotOperStatus.{#SNMPINDEX}])={$TEMP_WARN_STATUS} |
Warning | Depends on:
|
{#SENSOR_INFO}: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Intel_Qlogic Infiniband by SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"{#SENSOR_INFO}"} or last(/Intel_Qlogic Infiniband by SNMP/sensor.temp.status[icsChassisSensorSlotOperStatus.{#SNMPINDEX}])={$TEMP_CRIT_STATUS} |
High | |
{#SENSOR_INFO}: Temperature is too low | avg(/Intel_Qlogic Infiniband by SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"{#SENSOR_INFO}"} |
Average |
LLD rule Unit Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Unit Discovery | SNMP agent | unit.discovery |
Item prototypes for Unit Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
{#ENT_NAME}: Hardware serial number | MIB: ICS-CHASSIS-MIB The serial number of the FRU. If not available, this value is a zero-length string. |
SNMP agent | system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Unit Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#ENT_NAME}: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/Intel_Qlogic Infiniband by SNMP/system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}],#1)<>last(/Intel_Qlogic Infiniband by SNMP/system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}],#2) and length(last(/Intel_Qlogic Infiniband by SNMP/system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}]))>0 |
Info | Manual close: Yes |
LLD rule PSU Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU Discovery | A textual description of the power supply, that can be assigned by the administrator. |
SNMP agent | psu.discovery |
Item prototypes for PSU Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPVALUE}: Power supply status | MIB: ICS-CHASSIS-MIB Actual status of the power supply: (1) unknown: status not known. (2) disabled: power supply is disabled. (3) failed - power supply is unable to supply power due to failure. (4) warning - power supply is supplying power, but an output or sensor is bad or warning. (5) standby - power supply believed usable,but not supplying power. (6) engaged - power supply is supplying power. (7) redundant - power supply is supplying power, but not needed. (8) notPresent - power supply is supplying power is not present. |
SNMP agent | sensor.psu.status[icsChassisPowerSupplyEntry.{#SNMPINDEX}] |
Trigger prototypes for PSU Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPVALUE}: Power supply is in critical state | Please check the power supply unit for errors |
count(/Intel_Qlogic Infiniband by SNMP/sensor.psu.status[icsChassisPowerSupplyEntry.{#SNMPINDEX}],#1,"eq","{$PSU_CRIT_STATUS}")=1 |
Average | |
{#SNMPVALUE}: Power supply is in warning state | Please check the power supply unit for errors |
count(/Intel_Qlogic Infiniband by SNMP/sensor.psu.status[icsChassisPowerSupplyEntry.{#SNMPINDEX}],#1,"eq","{$PSU_WARN_STATUS}")=1 |
Warning | Depends on:
|
LLD rule FAN Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN Discovery | icsChassisFanDescription of icsChassisFanTable |
SNMP agent | fan.discovery |
Item prototypes for FAN Discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPVALUE}: Fan status | MIB: ICS-CHASSIS-MIB The operational status of the fan unit. |
SNMP agent | sensor.fan.status[icsChassisFanOperStatus.{#SNMPINDEX}] |
Trigger prototypes for FAN Discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPVALUE}: Fan is in critical state | Please check the fan unit |
count(/Intel_Qlogic Infiniband by SNMP/sensor.fan.status[icsChassisFanOperStatus.{#SNMPINDEX}],#1,"eq","{$FAN_CRIT_STATUS}")=1 |
Average |
LLD rule Network interfaces discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interfaces discovery | Discovering interfaces from IF-MIB. |
SNMP agent | net.if.discovery |
Item prototypes for Network interfaces discovery
Name | Description | Type | Key and additional info |
---|---|---|---|
Interface {#IFNAME}({#IFALIAS}): Operational status | MIB: IF-MIB The current operational state of the interface. - The testing(3) state indicates that no operational packet scan be passed - If ifAdminStatus is down(2) then ifOperStatus should be down(2) - If ifAdminStatus is changed to up(1) then ifOperStatus should change to up(1) if the interface is ready to transmit and receive network traffic - It should change todormant(5) if the interface is waiting for external actions (such as a serial line waiting for an incoming connection) - It should remain in the down(2) state if and only if there is a fault that prevents it from going to the up(1) state - It should remain in the notPresent(6) state if the interface has missing(typically, hardware) components. |
SNMP agent | net.if.status[ifOperStatus.{#SNMPINDEX}] |
Interface {#IFNAME}({#IFALIAS}): Bits received | MIB: IF-MIB The total number of octets received on the interface, including framing characters. This object is a 64-bit version of ifInOctets. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | net.if.in[ifHCInOctets.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Bits sent | MIB: IF-MIB The total number of octets transmitted out of the interface, including framing characters. This object is a 64-bit version of ifOutOctets.Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | net.if.out[ifHCOutOctets.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Inbound packets with errors | MIB: IF-MIB For packet-oriented interfaces, the number of inbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of inbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | net.if.in.errors[ifInErrors.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Outbound packets with errors | MIB: IF-MIB For packet-oriented interfaces, the number of outbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of outbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | net.if.out.errors[ifOutErrors.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Outbound packets discarded | MIB: IF-MIB The number of outbound packets which were chosen to be discarded even though no errors had been detected to prevent their being deliverable to a higher-layer protocol. One possible reason for discarding such a packet could be to free up buffer space. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | net.if.out.discards[ifOutDiscards.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Inbound packets discarded | MIB: IF-MIB The number of inbound packets which were chosen to be discarded even though no errors had been detected to prevent their being deliverable to a higher-layer protocol. One possible reason for discarding such a packet could be to free up buffer space. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | net.if.in.discards[ifInDiscards.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Interface type | MIB: IF-MIB The type of interface. Additional values for ifType are assigned by the Internet Assigned Numbers Authority (IANA), through updating the syntax of the IANAifType textual convention. |
SNMP agent | net.if.type[ifType.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Speed | MIB: IF-MIB An estimate of the interface's current bandwidth in units of 1,000,000 bits per second. If this object reports a value of |
SNMP agent | net.if.speed[ifHighSpeed.{#SNMPINDEX}] Preprocessing
|
Trigger prototypes for Network interfaces discovery
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Interface {#IFNAME}({#IFALIAS}): Link down | This trigger expression works as follows: |
{$IFCONTROL:"{#IFNAME}"}=1 and last(/Intel_Qlogic Infiniband by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])=2 and (last(/Intel_Qlogic Infiniband by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#1)<>last(/Intel_Qlogic Infiniband by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#2)) |
Average | Manual close: Yes |
Interface {#IFNAME}({#IFALIAS}): High bandwidth usage | The utilization of the network interface is close to its estimated maximum bandwidth. |
(avg(/Intel_Qlogic Infiniband by SNMP/net.if.in[ifHCInOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}]) or avg(/Intel_Qlogic Infiniband by SNMP/net.if.out[ifHCOutOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])) and last(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 |
Warning | Manual close: Yes Depends on:
|
Interface {#IFNAME}({#IFALIAS}): High error rate | It recovers when it is below 80% of the |
min(/Intel_Qlogic Infiniband by SNMP/net.if.in.errors[ifInErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} or min(/Intel_Qlogic Infiniband by SNMP/net.if.out.errors[ifOutErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} |
Warning | Manual close: Yes Depends on:
|
Interface {#IFNAME}({#IFALIAS}): Ethernet has changed to lower speed than it was before | This Ethernet connection has transitioned down from its known maximum speed. This might be a sign of autonegotiation issues. Acknowledge to close the problem manually. |
change(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])<0 and last(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 and ( last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=6 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=7 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=11 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=62 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=69 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=117 ) and (last(/Intel_Qlogic Infiniband by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])<>2) |
Info | Manual close: Yes Depends on:
|
Feedback
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums