You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

24 KiB

Intel_Qlogic Infiniband by SNMP

Overview

The Intel® 12200 is a 36-port, 40Gbps switch based on InfiniBand* architecture that cost-effectively supports a cluster of up to 36 servers, or provides an edge switch option for a larger fabric. This fixed-configuration switch is a member of the 12000 series, which delivers an exceptional set of high-speed networking features and functions.

Requirements

Zabbix version: 7.0 and higher.

Tested versions

This template has been tested on:

  • Intel_Qlogic Infiniband

Configuration

Zabbix should be configured according to the instructions in the Templates out of the box section.

Setup

Refer to the vendor documentation.

Macros used

Name Description Default
{$TEMP_CRIT} 60
{$TEMP_CRIT_LOW} 5
{$TEMP_WARN} 50
{$TEMP_CRIT_STATUS} 3
{$TEMP_WARN_STATUS} 2
{$PSU_CRIT_STATUS} 3
{$PSU_WARN_STATUS} 4
{$FAN_CRIT_STATUS} 3
{$SNMP.TIMEOUT} 5m
{$ICMP_LOSS_WARN} 20
{$ICMP_RESPONSE_TIME_WARN} 0.15
{$IF.ERRORS.WARN} 2
{$IF.UTIL.MAX} 90
{$IFCONTROL} 1
{$NET.IF.IFNAME.MATCHES} ^.*$
{$NET.IF.IFNAME.NOT_MATCHES}

Filter out loopbacks, nulls, docker veth links and docker0 bridge by default

Macro too long. Please see the template.
{$NET.IF.IFOPERSTATUS.MATCHES} ^.*$
{$NET.IF.IFOPERSTATUS.NOT_MATCHES}

Ignore notPresent(6)

^6$
{$NET.IF.IFADMINSTATUS.MATCHES}

Ignore notPresent(6)

^.*
{$NET.IF.IFADMINSTATUS.NOT_MATCHES}

Ignore down(2) administrative status

^2$
{$NET.IF.IFDESCR.MATCHES} .*
{$NET.IF.IFDESCR.NOT_MATCHES} CHANGE_IF_NEEDED
{$NET.IF.IFALIAS.MATCHES} .*
{$NET.IF.IFALIAS.NOT_MATCHES} CHANGE_IF_NEEDED
{$NET.IF.IFTYPE.MATCHES} .*
{$NET.IF.IFTYPE.NOT_MATCHES} CHANGE_IF_NEEDED

Items

Name Description Type Key and additional info
Intel_Qlogic Infiniband: Hardware model name

MIB: ICS-CHASSIS-MIB

SNMP agent system.hw.model

Preprocessing

  • Regular expression: (.+) - Firmware \1

  • Discard unchanged with heartbeat: 1d

Intel_Qlogic Infiniband: Firmware version

MIB: ICS-CHASSIS-MIB

SNMP agent system.hw.firmware

Preprocessing

  • Regular expression: Firmware Version: ([0-9.]+), \1

  • Discard unchanged with heartbeat: 1d

Intel_Qlogic Infiniband: Uptime (network)

MIB: SNMPv2-MIB

The time (in hundredths of a second) since the network management portion of the system was last re-initialized.

SNMP agent system.net.uptime[sysUpTime.0]

Preprocessing

  • Custom multiplier: 0.01

Intel_Qlogic Infiniband: Uptime (hardware)

MIB: HOST-RESOURCES-MIB

The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system.

SNMP agent system.hw.uptime[hrSystemUptime.0]

Preprocessing

  • Check for not supported value

    Custom on fail: Set value to: 0

  • Custom multiplier: 0.01

Intel_Qlogic Infiniband: SNMP traps (fallback)

The item is used to collect all SNMP traps unmatched by other snmptrap items

SNMP trap snmptrap.fallback
Intel_Qlogic Infiniband: System location

MIB: SNMPv2-MIB

The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string.

SNMP agent system.location[sysLocation.0]

Preprocessing

  • Discard unchanged with heartbeat: 12h

Intel_Qlogic Infiniband: System contact details

MIB: SNMPv2-MIB

The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string.

SNMP agent system.contact[sysContact.0]

Preprocessing

  • Discard unchanged with heartbeat: 12h

Intel_Qlogic Infiniband: System object ID

MIB: SNMPv2-MIB

The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determiningwhat kind of box' is being managed. For example, if vendorFlintstones, Inc.' was assigned the subtree1.3.6.1.4.1.4242, it could assign the identifier 1.3.6.1.4.1.4242.1.1 to its `Fred Router'.

SNMP agent system.objectid[sysObjectID.0]

Preprocessing

  • Discard unchanged with heartbeat: 12h

Intel_Qlogic Infiniband: System name

MIB: SNMPv2-MIB

An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string.

SNMP agent system.name

Preprocessing

  • Discard unchanged with heartbeat: 12h

Intel_Qlogic Infiniband: System description

MIB: SNMPv2-MIB

A textual description of the entity. This value should

include the full name and version identification of the system's hardware type, software operating-system, and

networking software.

SNMP agent system.descr[sysDescr.0]

Preprocessing

  • Discard unchanged with heartbeat: 12h

Intel_Qlogic Infiniband: SNMP agent availability

Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list.

Possible value:

0 - not available

1 - available

2 - unknown

Zabbix internal zabbix[host,snmp,available]
Intel_Qlogic Infiniband: ICMP ping Simple check icmpping
Intel_Qlogic Infiniband: ICMP loss Simple check icmppingloss
Intel_Qlogic Infiniband: ICMP response time Simple check icmppingsec

Triggers

Name Description Expression Severity Dependencies and additional info
Intel_Qlogic Infiniband: Firmware has changed

Firmware version has changed. Acknowledge to close the problem manually.

last(/Intel_Qlogic Infiniband by SNMP/system.hw.firmware,#1)<>last(/Intel_Qlogic Infiniband by SNMP/system.hw.firmware,#2) and length(last(/Intel_Qlogic Infiniband by SNMP/system.hw.firmware))>0 Info Manual close: Yes
Intel_Qlogic Infiniband: Host has been restarted

Uptime is less than 10 minutes.

(last(/Intel_Qlogic Infiniband by SNMP/system.hw.uptime[hrSystemUptime.0])>0 and last(/Intel_Qlogic Infiniband by SNMP/system.hw.uptime[hrSystemUptime.0])<10m) or (last(/Intel_Qlogic Infiniband by SNMP/system.hw.uptime[hrSystemUptime.0])=0 and last(/Intel_Qlogic Infiniband by SNMP/system.net.uptime[sysUpTime.0])<10m) Warning Manual close: Yes
Depends on:
  • Intel_Qlogic Infiniband: No SNMP data collection
Intel_Qlogic Infiniband: System name has changed

The name of the system has changed. Acknowledge to close the problem manually.

last(/Intel_Qlogic Infiniband by SNMP/system.name,#1)<>last(/Intel_Qlogic Infiniband by SNMP/system.name,#2) and length(last(/Intel_Qlogic Infiniband by SNMP/system.name))>0 Info Manual close: Yes
Intel_Qlogic Infiniband: No SNMP data collection

SNMP is not available for polling. Please check device connectivity and SNMP settings.

max(/Intel_Qlogic Infiniband by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 Warning Depends on:
  • Intel_Qlogic Infiniband: Unavailable by ICMP ping
Intel_Qlogic Infiniband: Unavailable by ICMP ping

Last three attempts returned timeout. Please check device connectivity.

max(/Intel_Qlogic Infiniband by SNMP/icmpping,#3)=0 High
Intel_Qlogic Infiniband: High ICMP ping loss min(/Intel_Qlogic Infiniband by SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/Intel_Qlogic Infiniband by SNMP/icmppingloss,5m)<100 Warning Depends on:
  • Intel_Qlogic Infiniband: Unavailable by ICMP ping
Intel_Qlogic Infiniband: High ICMP ping response time avg(/Intel_Qlogic Infiniband by SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} Warning Depends on:
  • Intel_Qlogic Infiniband: High ICMP ping loss
  • Intel_Qlogic Infiniband: Unavailable by ICMP ping

LLD rule Temperature Discovery

Name Description Type Key and additional info
Temperature Discovery

Discovering sensor's table with temperature filter

SNMP agent temp.discovery

Item prototypes for Temperature Discovery

Name Description Type Key and additional info
{#SENSOR_INFO}: Temperature

MIB: ICS-CHASSIS-MIB

The current value read from the sensor.

SNMP agent sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}]
{#SENSOR_INFO}: Temperature status

MIB: ICS-CHASSIS-MIB

The operational status of the sensor.

SNMP agent sensor.temp.status[icsChassisSensorSlotOperStatus.{#SNMPINDEX}]

Trigger prototypes for Temperature Discovery

Name Description Expression Severity Dependencies and additional info
{#SENSOR_INFO}: Temperature is above warning threshold

This trigger uses temperature sensor values as well as temperature sensor status if available

avg(/Intel_Qlogic Infiniband by SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)>{$TEMP_WARN:"{#SENSOR_INFO}"} or last(/Intel_Qlogic Infiniband by SNMP/sensor.temp.status[icsChassisSensorSlotOperStatus.{#SNMPINDEX}])={$TEMP_WARN_STATUS} Warning Depends on:
  • {#SENSOR_INFO}: Temperature is above critical threshold
{#SENSOR_INFO}: Temperature is above critical threshold

This trigger uses temperature sensor values as well as temperature sensor status if available

avg(/Intel_Qlogic Infiniband by SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"{#SENSOR_INFO}"} or last(/Intel_Qlogic Infiniband by SNMP/sensor.temp.status[icsChassisSensorSlotOperStatus.{#SNMPINDEX}])={$TEMP_CRIT_STATUS} High
{#SENSOR_INFO}: Temperature is too low avg(/Intel_Qlogic Infiniband by SNMP/sensor.temp.value[icsChassisSensorSlotValue.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"{#SENSOR_INFO}"} Average

LLD rule Unit Discovery

Name Description Type Key and additional info
Unit Discovery SNMP agent unit.discovery

Item prototypes for Unit Discovery

Name Description Type Key and additional info
{#ENT_NAME}: Hardware serial number

MIB: ICS-CHASSIS-MIB

The serial number of the FRU. If not available, this value is a zero-length string.

SNMP agent system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Trigger prototypes for Unit Discovery

Name Description Expression Severity Dependencies and additional info
{#ENT_NAME}: Device has been replaced

Device serial number has changed. Acknowledge to close the problem manually.

last(/Intel_Qlogic Infiniband by SNMP/system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}],#1)<>last(/Intel_Qlogic Infiniband by SNMP/system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}],#2) and length(last(/Intel_Qlogic Infiniband by SNMP/system.hw.serialnumber[icsChassisSystemUnitFruSerialNumber.{#SNMPINDEX}]))>0 Info Manual close: Yes

LLD rule PSU Discovery

Name Description Type Key and additional info
PSU Discovery

A textual description of the power supply, that can be assigned by the administrator.

SNMP agent psu.discovery

Item prototypes for PSU Discovery

Name Description Type Key and additional info
{#SNMPVALUE}: Power supply status

MIB: ICS-CHASSIS-MIB

Actual status of the power supply:

(1) unknown: status not known.

(2) disabled: power supply is disabled.

(3) failed - power supply is unable to supply power due to failure.

(4) warning - power supply is supplying power, but an output or sensor is bad or warning.

(5) standby - power supply believed usable,but not supplying power.

(6) engaged - power supply is supplying power.

(7) redundant - power supply is supplying power, but not needed.

(8) notPresent - power supply is supplying power is not present.

SNMP agent sensor.psu.status[icsChassisPowerSupplyEntry.{#SNMPINDEX}]

Trigger prototypes for PSU Discovery

Name Description Expression Severity Dependencies and additional info
{#SNMPVALUE}: Power supply is in critical state

Please check the power supply unit for errors

count(/Intel_Qlogic Infiniband by SNMP/sensor.psu.status[icsChassisPowerSupplyEntry.{#SNMPINDEX}],#1,"eq","{$PSU_CRIT_STATUS}")=1 Average
{#SNMPVALUE}: Power supply is in warning state

Please check the power supply unit for errors

count(/Intel_Qlogic Infiniband by SNMP/sensor.psu.status[icsChassisPowerSupplyEntry.{#SNMPINDEX}],#1,"eq","{$PSU_WARN_STATUS}")=1 Warning Depends on:
  • {#SNMPVALUE}: Power supply is in critical state

LLD rule FAN Discovery

Name Description Type Key and additional info
FAN Discovery

icsChassisFanDescription of icsChassisFanTable

SNMP agent fan.discovery

Item prototypes for FAN Discovery

Name Description Type Key and additional info
{#SNMPVALUE}: Fan status

MIB: ICS-CHASSIS-MIB

The operational status of the fan unit.

SNMP agent sensor.fan.status[icsChassisFanOperStatus.{#SNMPINDEX}]

Trigger prototypes for FAN Discovery

Name Description Expression Severity Dependencies and additional info
{#SNMPVALUE}: Fan is in critical state

Please check the fan unit

count(/Intel_Qlogic Infiniband by SNMP/sensor.fan.status[icsChassisFanOperStatus.{#SNMPINDEX}],#1,"eq","{$FAN_CRIT_STATUS}")=1 Average

LLD rule Network interfaces discovery

Name Description Type Key and additional info
Network interfaces discovery

Discovering interfaces from IF-MIB.

SNMP agent net.if.discovery

Item prototypes for Network interfaces discovery

Name Description Type Key and additional info
Interface {#IFNAME}({#IFALIAS}): Operational status

MIB: IF-MIB

The current operational state of the interface.

- The testing(3) state indicates that no operational packet scan be passed

- If ifAdminStatus is down(2) then ifOperStatus should be down(2)

- If ifAdminStatus is changed to up(1) then ifOperStatus should change to up(1) if the interface is ready to transmit and receive network traffic

- It should change todormant(5) if the interface is waiting for external actions (such as a serial line waiting for an incoming connection)

- It should remain in the down(2) state if and only if there is a fault that prevents it from going to the up(1) state

- It should remain in the notPresent(6) state if the interface has missing(typically, hardware) components.

SNMP agent net.if.status[ifOperStatus.{#SNMPINDEX}]
Interface {#IFNAME}({#IFALIAS}): Bits received

MIB: IF-MIB

The total number of octets received on the interface, including framing characters. This object is a 64-bit version of ifInOctets. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.in[ifHCInOctets.{#SNMPINDEX}]

Preprocessing

  • Change per second
  • Custom multiplier: 8

Interface {#IFNAME}({#IFALIAS}): Bits sent

MIB: IF-MIB

The total number of octets transmitted out of the interface, including framing characters. This object is a 64-bit version of ifOutOctets.Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.out[ifHCOutOctets.{#SNMPINDEX}]

Preprocessing

  • Change per second
  • Custom multiplier: 8

Interface {#IFNAME}({#IFALIAS}): Inbound packets with errors

MIB: IF-MIB

For packet-oriented interfaces, the number of inbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of inbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.in.errors[ifInErrors.{#SNMPINDEX}]

Preprocessing

  • Change per second
Interface {#IFNAME}({#IFALIAS}): Outbound packets with errors

MIB: IF-MIB

For packet-oriented interfaces, the number of outbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of outbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.out.errors[ifOutErrors.{#SNMPINDEX}]

Preprocessing

  • Change per second
Interface {#IFNAME}({#IFALIAS}): Outbound packets discarded

MIB: IF-MIB

The number of outbound packets which were chosen to be discarded

even though no errors had been detected to prevent their being deliverable to a higher-layer protocol.

One possible reason for discarding such a packet could be to free up buffer space.

Discontinuities in the value of this counter can occur at re-initialization of the management system,

and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.out.discards[ifOutDiscards.{#SNMPINDEX}]

Preprocessing

  • Change per second
Interface {#IFNAME}({#IFALIAS}): Inbound packets discarded

MIB: IF-MIB

The number of inbound packets which were chosen to be discarded

even though no errors had been detected to prevent their being deliverable to a higher-layer protocol.

One possible reason for discarding such a packet could be to free up buffer space.

Discontinuities in the value of this counter can occur at re-initialization of the management system,

and at other times as indicated by the value of ifCounterDiscontinuityTime.

SNMP agent net.if.in.discards[ifInDiscards.{#SNMPINDEX}]

Preprocessing

  • Change per second
Interface {#IFNAME}({#IFALIAS}): Interface type

MIB: IF-MIB

The type of interface.

Additional values for ifType are assigned by the Internet Assigned Numbers Authority (IANA),

through updating the syntax of the IANAifType textual convention.

SNMP agent net.if.type[ifType.{#SNMPINDEX}]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Interface {#IFNAME}({#IFALIAS}): Speed

MIB: IF-MIB

An estimate of the interface's current bandwidth in units of 1,000,000 bits per second. If this object reports a value of n' then the speed of the interface is somewhere in the range of n-500,000' to`n+499,999'. For interfaces which do not vary in bandwidth or for those where no accurate estimation can be made, this object should contain the nominal bandwidth. For a sub-layer which has no concept of bandwidth, this object should be zero.

SNMP agent net.if.speed[ifHighSpeed.{#SNMPINDEX}]

Preprocessing

  • Custom multiplier: 1000000

  • Discard unchanged with heartbeat: 1h

Trigger prototypes for Network interfaces discovery

Name Description Expression Severity Dependencies and additional info
Interface {#IFNAME}({#IFALIAS}): Link down

This trigger expression works as follows:
1. It can be triggered if the operations status is down.
2. {$IFCONTROL:"{#IFNAME}"}=1 - a user can redefine context macro to value - 0. That marks this interface as not important. No new trigger will be fired if this interface is down.
3. {TEMPLATE_NAME:METRIC.diff()}=1 - the trigger fires only if the operational status was up to (1) sometime before (so, do not fire for the 'eternal off' interfaces.)

WARNING: if closed manually - it will not fire again on the next poll, because of .diff.

{$IFCONTROL:"{#IFNAME}"}=1 and last(/Intel_Qlogic Infiniband by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])=2 and (last(/Intel_Qlogic Infiniband by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#1)<>last(/Intel_Qlogic Infiniband by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}],#2)) Average Manual close: Yes
Interface {#IFNAME}({#IFALIAS}): High bandwidth usage

The utilization of the network interface is close to its estimated maximum bandwidth.

(avg(/Intel_Qlogic Infiniband by SNMP/net.if.in[ifHCInOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}]) or avg(/Intel_Qlogic Infiniband by SNMP/net.if.out[ifHCOutOctets.{#SNMPINDEX}],15m)>({$IF.UTIL.MAX:"{#IFNAME}"}/100)*last(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])) and last(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 Warning Manual close: Yes
Depends on:
  • Interface {#IFNAME}({#IFALIAS}): Link down
Interface {#IFNAME}({#IFALIAS}): High error rate

It recovers when it is below 80% of the {$IF.ERRORS.WARN:"{#IFNAME}"} threshold.

min(/Intel_Qlogic Infiniband by SNMP/net.if.in.errors[ifInErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} or min(/Intel_Qlogic Infiniband by SNMP/net.if.out.errors[ifOutErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} Warning Manual close: Yes
Depends on:
  • Interface {#IFNAME}({#IFALIAS}): Link down
Interface {#IFNAME}({#IFALIAS}): Ethernet has changed to lower speed than it was before

This Ethernet connection has transitioned down from its known maximum speed. This might be a sign of autonegotiation issues. Acknowledge to close the problem manually.

change(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])<0 and last(/Intel_Qlogic Infiniband by SNMP/net.if.speed[ifHighSpeed.{#SNMPINDEX}])>0 and ( last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=6 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=7 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=11 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=62 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=69 or last(/Intel_Qlogic Infiniband by SNMP/net.if.type[ifType.{#SNMPINDEX}])=117 ) and (last(/Intel_Qlogic Infiniband by SNMP/net.if.status[ifOperStatus.{#SNMPINDEX}])<>2) Info Manual close: Yes
Depends on:
  • Interface {#IFNAME}({#IFALIAS}): Link down

Feedback

Please report any issues with the template at https://support.zabbix.com

You can also provide feedback, discuss the template, or ask for help at ZABBIX forums