You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

75 KiB

MSSQL by ODBC

Overview

This template is designed for the effortless deployment of MSSQL monitoring by Zabbix via ODBC and doesn't require any external scripts.

Requirements

Zabbix version: 7.0 and higher.

Tested versions

This template has been tested on:

  • Microsoft SQL, version 2019, 2022

Configuration

Zabbix should be configured according to the instructions in the Templates out of the box section.

Setup

  1. Create an MSSQL user for monitoring. For example, zbx_monitor. View Server State and View Any Definition permissions should be granted to the user. Grant this user read permissions to the sysjobschedules, sysjobhistory, sysjobs tables. For example, using T-SQL commands: GRANT SELECT ON OBJECT::msdb.dbo.sysjobs TO zbx_monitor GRANT SELECT ON OBJECT::msdb.dbo.sysjobservers TO zbx_monitor GRANT SELECT ON OBJECT::msdb.dbo.sysjobactivity TO zbx_monitor GRANT EXECUTE ON OBJECT::msdb.dbo.agent_datetime TO zbx_monitor For more information, see MSSQL documentation: Create a database user GRANT Server Permissions Configure a User to Create and Manage SQL Server Agent Jobs
  2. Set the username and password in host macros ({$MSSQL.USER} and {$MSSQL.PASSWORD}). Do not forget to install Microsoft ODBC driver on Zabbix server or Zabbix proxy. See Microsoft documentation for instructions: https://docs.microsoft.com/en-us/sql/connect/odbc/linux-mac/installing-the-microsoft-odbc-driver-for-sql-server?view=sql-server-ver16. Note! Credentials in the odbc.ini do not work for MSSQL.

For named instance set the value of {$MSSQL.INSTANCE} macro as MSSQL$instance name. In case if MSSQL was installed using default configuration do not change {$MSSQL.INSTANCE} macro value.

The "Service's TCP port state" item uses {HOST.CONN} and {$MSSQL.PORT} macros to check the availability of MSSQL instance. If your instance uses a non-default TCP port, set the port in your section of odbc.ini in the line Server = IP or FQDN name, port.

Macros used

Name Description Default
{$MSSQL.DSN}

System data source name.

<Put your DSN here>
{$MSSQL.USER}

MSSQL username.

<Put your username here>
{$MSSQL.PASSWORD}

MSSQL user password.

<Put your password here>
{$MSSQL.INSTANCE}

The instance name for the default instance is SQLServer. For named instance set the macro value as MSSQL$instance name.

SQLServer
{$MSSQL.PORT}

MSSQL TCP port.

1433
{$MSSQL.DBNAME.MATCHES}

This macro is used in database discovery. It can be overridden on a host or linked template level.

.*
{$MSSQL.DBNAME.NOT_MATCHES}

This macro is used in database discovery. It can be overridden on a host or linked template level.

master|tempdb|model|msdb
{$MSSQL.WORK_FILES.MAX}

The maximum number of work files created per second - for the trigger expression.

20
{$MSSQL.WORK_TABLES.MAX}

The maximum number of work tables created per second - for the trigger expression.

20
{$MSSQL.WORKTABLES_FROM_CACHE_RATIO.MIN.CRIT}

The minimum percentage of the worktables from cache ratio - for the High trigger expression.

90
{$MSSQL.BUFFER_CACHE_RATIO.MIN.CRIT}

The minimum % buffer cache hit ratio - for the High trigger expression.

30
{$MSSQL.BUFFER_CACHE_RATIO.MIN.WARN}

The minimum % buffer cache hit ratio - for the Warning trigger expression.

50
{$MSSQL.FREE_LIST_STALLS.MAX}

The maximum free list stalls per second - for the trigger expression.

2
{$MSSQL.LAZY_WRITES.MAX}

The maximum lazy writes per second - for the trigger expression.

20
{$MSSQL.PAGE_LIFE_EXPECTANCY.MIN}

The minimum page life expectancy - for the trigger expression.

300
{$MSSQL.PAGE_READS.MAX}

The maximum page reads per second - for the trigger expression.

90
{$MSSQL.PAGE_WRITES.MAX}

The maximum page writes per second - for the trigger expression.

90
{$MSSQL.AVERAGE_WAIT_TIME.MAX}

The maximum average wait time in ms - for the trigger expression.

500
{$MSSQL.LOCK_REQUESTS.MAX}

The maximum lock requests per second - for the trigger expression.

1000
{$MSSQL.LOCK_TIMEOUTS.MAX}

The maximum lock timeouts per second - for the trigger expression.

1
{$MSSQL.DEADLOCKS.MAX}

The maximum deadlocks per second - for the trigger expression.

1
{$MSSQL.LOG_FLUSH_WAITS.MAX}

The maximum log flush waits per second - for the trigger expression.

1
{$MSSQL.LOG_FLUSH_WAIT_TIME.MAX}

The maximum log flush wait time in ms - for the trigger expression.

1
{$MSSQL.PERCENT_LOG_USED.MAX}

The maximum percentage of log used - for the trigger expression.

80
{$MSSQL.PERCENT_COMPILATIONS.MAX}

The maximum percentage of Transact-SQL compilations - for the trigger expression.

10
{$MSSQL.PERCENT_RECOMPILATIONS.MAX}

The maximum percentage of Transact-SQL recompilations - for the trigger expression.

10
{$MSSQL.PERCENT_READAHEAD.MAX}

The maximum percentage of pages read/sec in anticipation of use - for the trigger expression.

20
{$MSSQL.BACKUP_DIFF.CRIT}

The maximum days without a differential backup - for the High trigger expression.

6d
{$MSSQL.BACKUP_DIFF.WARN}

The maximum days without a differential backup - for the Warning trigger expression.

3d
{$MSSQL.BACKUP_FULL.CRIT}

The maximum days without a full backup - for the High trigger expression.

10d
{$MSSQL.BACKUP_FULL.WARN}

The maximum days without a full backup - for the Warning trigger expression.

9d
{$MSSQL.BACKUP_LOG.CRIT}

The maximum days without a log backup - for the High trigger expression.

8h
{$MSSQL.BACKUP_LOG.WARN}

The maximum days without a log backup - for the Warning trigger expression.

4h
{$MSSQL.JOB.MATCHES}

This macro is used in job discovery. It can be overridden on a host or linked template level.

.*
{$MSSQL.JOB.NOT_MATCHES}

This macro is used in job discovery. It can be overridden on a host or linked template level.

CHANGE_IF_NEEDED
{$MSSQL.BACKUP_DURATION.WARN}

The maximum job duration - for the Warning trigger expression.

1h

Items

Name Description Type Key and additional info
MSSQL: Service's TCP port state

Test the availability of MS SQL Server on a TCP port.

Simple check net.tcp.service[tcp,{HOST.CONN},{$MSSQL.PORT}]

Preprocessing

  • Discard unchanged with heartbeat: 10m

MSSQL: Get last backup

The item gets information about backup processes.

Database monitor db.odbc.get[get_last_backup,"{$MSSQL.DSN}"]
MSSQL: Get job status

The item gets sql agent job status.

Database monitor db.odbc.get[get_job_status,"{$MSSQL.DSN}"]
MSSQL: Get performance counters

The item gets server global status information.

Database monitor db.odbc.get[get_status_variables,"{$MSSQL.DSN}"]
MSSQL: Version

MS SQL Server version.

Dependent item mssql.version

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Discard unchanged with heartbeat: 1d

MSSQL: Uptime

MS SQL Server uptime in 'N days, hh:mm:ss' format.

Dependent item mssql.uptime

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Get Access Methods counters

The item gets server information about access methods.

Dependent item mssql.access_methods.raw

Preprocessing

  • JSON Path: $[?(@.object_name=='{$MSSQL.INSTANCE}:Access Methods')]

    Custom on fail: Discard value

MSSQL: Forwarded records per second

Number of records per second fetched through forwarded record pointers.

Dependent item mssql.forwarded_records_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Full scans per second

Number of unrestricted full scans per second. These can be either base-table or full-index scans. Values greater than 1 or 2 indicate that there are table / Index page scans. If that is combined with high CPU, this counter requires further investigation, otherwise, if the full scans are on small tables, it can be ignored.

Dependent item mssql.full_scans_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Full Scans/sec')].cntr_value.first()

  • Change per second
MSSQL: Index searches per second

Number of index searches per second. These are used to start a range scan, reposition a range scan, revalidate a scan point, fetch a single index record, and search down the index to locate where to insert a new row.

Dependent item mssql.index_searches_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Page splits per second

Number of page splits per second that occur as the result of overflowing index pages.

Dependent item mssql.page_splits_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Page Splits/sec')].cntr_value.first()

  • Change per second
MSSQL: Work files created per second

Number of work files created per second. For example, work files can be used to store temporary results for hash joins and hash aggregates.

Dependent item mssql.workfiles_created_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Work tables created per second

Number of work tables created per second. For example, work tables can be used to store temporary results for query spool, lob variables, XML variables, and cursors.

Dependent item mssql.worktables_created_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Table lock escalations per second

Number of times locks on a table were escalated to the TABLE or HoBT granularity.

Dependent item mssql.table_lock_escalations.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Worktables from cache ratio

Percentage of work tables created where the initial two pages of the work table were not allocated but were immediately available from the work table cache.

Dependent item mssql.worktables_from_cache_ratio

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Get Buffer Manager counters

The item gets server information about buffer pool.

Dependent item mssql.buffer_manager.raw

Preprocessing

  • JSON Path: $[?(@.object_name=='{$MSSQL.INSTANCE}:Buffer Manager')]

    Custom on fail: Discard value

MSSQL: Buffer cache hit ratio

Indicates the percentage of pages found in the buffer cache without having to read from disk. The ratio is the total number of cache hits divided by the total number of cache lookups over the last few thousand page accesses. After a long period of time, the ratio changes very little. Since reading from the cache is much less expensive than reading from the disk, a higher value is preferred for this item. To increase the buffer cache hit ratio, consider increasing the amount of memory available to SQL Server or using the buffer pool extension feature.

Dependent item mssql.buffer_cache_hit_ratio

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Checkpoint pages per second

Indicates the number of pages flushed to disk per second by a checkpoint or other operation which required all dirty pages to be flushed.

Dependent item mssql.checkpoint_pages_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Database pages

Indicates the number of pages in the buffer pool with database content.

Dependent item mssql.database_pages

Preprocessing

  • JSON Path: $[?(@.counter_name=='Database pages')].cntr_value.first()

MSSQL: Free list stalls per second

Indicates the number of requests per second that had to wait for a free page.

Dependent item mssql.free_list_stalls_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Lazy writes per second

Indicates the number of buffers written per second by the buffer manager's lazy writer. The lazy writer is a system process that flushes out batches of dirty, aged buffers (buffers that contain changes that must be written back to disk before the buffer can be reused for a different page) and makes them available to user processes. The lazy writer eliminates the need to perform frequent checkpoints in order to create available buffers.

Dependent item mssql.lazy_writes_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Lazy writes/sec')].cntr_value.first()

  • Change per second
MSSQL: Page life expectancy

Indicates the number of seconds a page will stay in the buffer pool without references.

Dependent item mssql.page_life_expectancy

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Page lookups per second

Indicates the number of requests per second to find a page in the buffer pool.

Dependent item mssql.page_lookups_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Page lookups/sec')].cntr_value.first()

  • Change per second
MSSQL: Page reads per second

Indicates the number of physical database page reads that are issued per second. This statistic displays the total number of physical page reads across all databases. Because physical I/O is expensive, you may be able to minimize the cost, either by using a larger data cache, intelligent indexes, and more efficient queries, or by changing the database design.

Dependent item mssql.page_reads_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Page reads/sec')].cntr_value.first()

  • Change per second
MSSQL: Page writes per second

Indicates the number of physical database page writes that are issued per second.

Dependent item mssql.page_writes_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Page writes/sec')].cntr_value.first()

  • Change per second
MSSQL: Read-ahead pages per second

Indicates the number of pages read per second in anticipation of use.

Dependent item mssql.readahead_pages_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Target pages

The optimal number of pages in the buffer pool.

Dependent item mssql.target_pages

Preprocessing

  • JSON Path: $[?(@.counter_name=='Target pages')].cntr_value.first()

MSSQL: Get DB counters

The item gets summary information about databases.

Dependent item mssql.db_info.raw

Preprocessing

  • JSON Path: The text is too long. Please see the template.

    Custom on fail: Discard value

MSSQL: Total data file size

Total size of all data files.

Dependent item mssql.data_files_size

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL: Total log file size

Total size of all the transaction log files.

Dependent item mssql.log_files_size

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL: Total log file used size

The cumulative used size of all the log files in the database.

Dependent item mssql.log_files_used_size

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL: Total transactions per second

Total number of transactions started for all databases per second.

Dependent item mssql.transactions_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Transactions/sec')].cntr_value.first()

  • Change per second
MSSQL: Get General Statistics counters

The item gets general statistic information.

Dependent item mssql.general_statistics.raw

Preprocessing

  • JSON Path: $[?(@.object_name=='{$MSSQL.INSTANCE}:General Statistics')]

    Custom on fail: Discard value

MSSQL: Logins per second

Total number of logins started per second. This does not include pooled connections. Any value over 2 may indicate insufficient connection pooling.

Dependent item mssql.logins_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Logins/sec')].cntr_value.first()

  • Change per second
MSSQL: Logouts per second

Total number of logout operations started per second. Any value over 2 may indicate insufficient connection pooling.

Dependent item mssql.logouts_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Logouts/sec')].cntr_value.first()

  • Change per second
MSSQL: Number of blocked processes

Number of currently blocked processes.

Dependent item mssql.processes_blocked

Preprocessing

  • JSON Path: $[?(@.counter_name=='Processes blocked')].cntr_value.first()

MSSQL: Number users connected

Number of users connected to MS SQL Server.

Dependent item mssql.user_connections

Preprocessing

  • JSON Path: $[?(@.counter_name=='User Connections')].cntr_value.first()

MSSQL: Average latch wait time

Average latch wait time (in milliseconds) for latch requests that had to wait.

Calculated mssql.average_latch_wait_time
MSSQL: Get Latches counters

The item gets server information about latches.

Dependent item mssql.latches_info.raw

Preprocessing

  • JSON Path: $[?(@.object_name=='{$MSSQL.INSTANCE}:Latches')]

    Custom on fail: Discard value

MSSQL: Average latch wait time raw

Average latch wait time (in milliseconds) for latch requests that had to wait.

Dependent item mssql.average_latch_wait_time_raw

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Average latch wait time base

For internal use only.

Dependent item mssql.average_latch_wait_time_base

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Latch waits per second

The number of latch requests that could not be granted immediately. Latches are lightweight means of holding a very transient server resource, such as an address in memory.

Dependent item mssql.latch_waits_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Latch Waits/sec')].cntr_value.first()

  • Change per second
MSSQL: Total latch wait Time

Total latch wait time (in milliseconds) for latch requests in the last second. This value should stay stable compared to the number of latch waits per second.

Dependent item mssql.total_latch_wait_time

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Total average wait time

The average wait time, in milliseconds, for each lock request that had to wait.

Calculated mssql.average_wait_time
MSSQL: Get Locks counters

The item gets server information about locks.

Dependent item mssql.locks_info.raw

Preprocessing

  • JSON Path: The text is too long. Please see the template.

    Custom on fail: Discard value

MSSQL: Total average wait time raw

Average amount of wait time (in milliseconds) for each lock request that resulted in a wait. Information for all locks.

Dependent item mssql.average_wait_time_raw

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Total average wait time base

For internal use only.

Dependent item mssql.average_wait_time_base

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Total lock requests per second

Number of new locks and lock conversions per second requested from the lock manager.

Dependent item mssql.lock_requests_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Lock Requests/sec')].cntr_value.first()

  • Change per second
MSSQL: Total lock requests per second that timed out

Number of timed out lock requests per second, including requests for NOWAIT locks.

Dependent item mssql.lock_timeouts_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Lock Timeouts/sec')].cntr_value.first()

  • Change per second
MSSQL: Total lock requests per second that required waiting

Number of lock requests per second that required the caller to wait.

Dependent item mssql.lock_waits_sec.rate

Preprocessing

  • JSON Path: $[?(@.counter_name=='Lock Waits/sec')].cntr_value.first()

  • Change per second
MSSQL: Lock wait time

Average of total wait time (in milliseconds) for locks in the last second.

Dependent item mssql.lock_wait_time

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Total lock requests per second that have deadlocks

Number of lock requests per second that resulted in a deadlock.

Dependent item mssql.number_deadlocks_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Get Memory counters

The item gets memory information.

Dependent item mssql.mem_manager.raw

Preprocessing

  • JSON Path: $[?(@.object_name=='{$MSSQL.INSTANCE}:Memory Manager')]

    Custom on fail: Discard value

MSSQL: Granted Workspace Memory

Specifies the total amount of memory currently granted to executing processes, such as hash, sort, bulk copy, and index creation operations.

Dependent item mssql.granted_workspace_memory

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL: Maximum workspace memory

Indicates the maximum amount of memory available for executing processes, such as hash, sort, bulk copy, and index creation operations.

Dependent item mssql.maximum_workspace_memory

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL: Memory grants outstanding

Specifies the total number of processes that have successfully acquired a workspace memory grant.

Dependent item mssql.memory_grants_outstanding

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Memory grants pending

Specifies the total number of processes waiting for a workspace memory grant.

Dependent item mssql.memory_grants_pending

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Target server memory

Indicates the ideal amount of memory the server can consume.

Dependent item mssql.target_server_memory

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL: Total server memory

Specifies the amount of memory the server has committed using the memory manager.

Dependent item mssql.total_server_memory

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL: Get Cache counters

The item gets server information about cache.

Dependent item mssql.cache_info.raw

Preprocessing

  • JSON Path: The text is too long. Please see the template.

    Custom on fail: Discard value

MSSQL: Cache hit ratio

Ratio between cache hits and lookups.

Dependent item mssql.cache_hit_ratio

Preprocessing

  • JSON Path: $[?(@.counter_name=='CacheHitRatio')].cntr_value.first()

MSSQL: Cache object counts

Number of cache objects in the cache.

Dependent item mssql.cache_object_counts

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Cache objects in use

Number of cache objects in use.

Dependent item mssql.cache_objects_in_use

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL: Cache pages

Number of 8-kilobyte (KB) pages used by cache objects.

Dependent item mssql.cache_pages

Preprocessing

  • JSON Path: $[?(@.counter_name=='Cache Pages')].cntr_value.first()

MSSQL: Get SQL Errors counters

The item gets SQL errors information.

Dependent item mssql.sql_errors.raw

Preprocessing

  • JSON Path: $[?(@.object_name=='{$MSSQL.INSTANCE}:SQL Errors')]

    Custom on fail: Discard value

MSSQL: Errors per second (DB offline errors)

Number of errors per second.

Dependent item mssql.offline_errors_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Errors per second (Info errors)

Number of errors per second.

Dependent item mssql.info_errors_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Errors per second (Kill connection errors)

Number of errors per second.

Dependent item mssql.kill_connection_errors_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Errors per second (User errors)

Number of errors per second.

Dependent item mssql.user_errors_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Total errors per second

Number of errors per second.

Dependent item mssql.errors_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Get SQL Statistics counters

The item gets SQL statistic information.

Dependent item mssql.sql_statistics.raw

Preprocessing

  • JSON Path: $[?(@.object_name=='{$MSSQL.INSTANCE}:SQL Statistics')]

    Custom on fail: Discard value

MSSQL: Auto-param attempts per second

Number of auto-parameterization attempts per second. The total should be the sum of the failed, safe, and unsafe auto-parameterizations. Auto-parameterization occurs when an instance of SQL Server tries to parameterize a Transact-SQL request by replacing some literals with parameters to me reuse of the resulting cached execution plan across multiple similar-looking requests possible. Note that auto-parameterizations are also known as simple parameterizations in the newer versions of SQL Server. This counter does not include forced parameterizations.

Dependent item mssql.autoparam_attempts_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Batch requests per second

Number of Transact-SQL command batches received per second. This statistic is affected by all constraints (such as I/O, number of users, cache size, complexity of requests, and so on). High batch requests mean good throughput.

Dependent item mssql.batch_requests_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Percent of Adhoc queries running

The ratio of SQL compilations per second to Batch requests per second in percentage.

Calculated mssql.percent_of_adhoc_queries
MSSQL: Percent of Recompiled Transact-SQL Objects

The ratio of SQL re-compilations per second to SQL compilations per second in percentage.

Calculated mssql.percent_recompilations_to_compilations
MSSQL: Full scans to Index searches ratio

The ratio of Full scans per second to Index searches per second. The threshold recommendation is strictly for OLTP workloads.

Calculated mssql.scan_to_search
MSSQL: Failed auto-params per second

Number of failed auto-parameterization attempts per second. This number should be small. Note that auto-parameterizations are also known as simple parameterizations in the newer versions of SQL Server.

Dependent item mssql.failed_autoparams_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Safe auto-params per second

Number of safe auto-parameterization attempts per second. Safe refers to a determination that a cached execution plan can be shared between different similar-looking Transact-SQL statements. SQL Server makes many auto-parameterization attempts some of which turn out to be safe and others fail. Note that auto-parameterizations are also known as simple parameterizations in the newer versions of SQL Server. This does not include forced parameterizations.

Dependent item mssql.safe_autoparams_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: SQL compilations per second

Number of SQL compilations per second. Indicates the number of times the compile code path is entered. Includes runs caused by statement-level recompilations in SQL Server. After SQL Server user activity is stable, this value reaches a steady state.

Dependent item mssql.sql_compilations_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: SQL re-compilations per second

Number of statement recompiles per second. Counts the number of times statement recompiles are triggered. Generally, you want the recompiles to be low.

Dependent item mssql.sql_recompilations_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Unsafe auto-params per second

Number of unsafe auto-parameterization attempts per second. For example, the query has some characteristics that prevent the cached plan from being shared. These are designated as unsafe. This does not count the number of forced parameterizations.

Dependent item mssql.unsafe_autoparams_sec.rate

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL: Total transactions number

The number of currently active transactions of all types.

Dependent item mssql.transactions

Preprocessing

  • JSON Path: The text is too long. Please see the template.

Triggers

Name Description Expression Severity Dependencies and additional info
MSSQL: Service is unavailable

The TCP port of the MS SQL Server service is currently unavailable.

last(/MSSQL by ODBC/net.tcp.service[tcp,{HOST.CONN},{$MSSQL.PORT}])=0 Disaster
MSSQL: Version has changed

MSSQL version has changed. Acknowledge to close the problem manually.

last(/MSSQL by ODBC/mssql.version,#1)<>last(/MSSQL by ODBC/mssql.version,#2) and length(last(/MSSQL by ODBC/mssql.version))>0 Info Manual close: Yes
MSSQL: Service has been restarted

Uptime is less than 10 minutes.

last(/MSSQL by ODBC/mssql.uptime)<10m Info Manual close: Yes
MSSQL: Failed to fetch info data

Zabbix has not received any data for items for the last 30 minutes.

nodata(/MSSQL by ODBC/mssql.uptime,30m)=1 Info Depends on:
  • MSSQL: Service is unavailable
MSSQL: Too frequently using pointers

Rows with varchar columns can experience expansion when varchar values are updated with a longer string. In the case where the row cannot fit in the existing page, the row migrates and access to the row will traverse a pointer. This only happens on heaps (tables without clustered indexes). Evaluate clustered index for heap tables. In cases where clustered indexes cannot be used, drop non-clustered indexes, build a clustered index to reorg pages and rows, drop the clustered index, then recreate non-clustered indexes.

last(/MSSQL by ODBC/mssql.forwarded_records_sec.rate) * 100 > 10 * last(/MSSQL by ODBC/mssql.batch_requests_sec.rate) Warning
MSSQL: Number of work files created per second is high

Too many work files created per second to store temporary results for hash joins and hash aggregates.

min(/MSSQL by ODBC/mssql.workfiles_created_sec.rate,5m)>{$MSSQL.WORK_FILES.MAX} Average
MSSQL: Number of work tables created per second is high

Too many work tables created per second to store temporary results for query spool, lob variables, XML variables, and cursors.

min(/MSSQL by ODBC/mssql.worktables_created_sec.rate,5m)>{$MSSQL.WORK_TABLES.MAX} Average
MSSQL: Percentage of work tables available from the work table cache is low

A value less than 90% may indicate insufficient memory, since execution plans are being dropped, or on 32-bit systems, may indicate the need for an upgrade to a 64-bit system

max(/MSSQL by ODBC/mssql.worktables_from_cache_ratio,5m)<{$MSSQL.WORKTABLES_FROM_CACHE_RATIO.MIN.CRIT} High
MSSQL: Percentage of the buffer cache efficiency is low

Too low buffer cache hit ratio.

max(/MSSQL by ODBC/mssql.buffer_cache_hit_ratio,5m)<{$MSSQL.BUFFER_CACHE_RATIO.MIN.CRIT} High
MSSQL: Percentage of the buffer cache efficiency is low

Low buffer cache hit ratio.

max(/MSSQL by ODBC/mssql.buffer_cache_hit_ratio,5m)<{$MSSQL.BUFFER_CACHE_RATIO.MIN.WARN} Warning Depends on:
  • MSSQL: Percentage of the buffer cache efficiency is low
MSSQL: Number of rps waiting for a free page is high

Some requests have to wait for a free page.

min(/MSSQL by ODBC/mssql.free_list_stalls_sec.rate,5m)>{$MSSQL.FREE_LIST_STALLS.MAX} Warning
MSSQL: Number of buffers written per second by the lazy writer is high

The number of buffers written per second by the buffer manager's lazy writer exceeds the threshold.

min(/MSSQL by ODBC/mssql.lazy_writes_sec.rate,5m)>{$MSSQL.LAZY_WRITES.MAX} Warning
MSSQL: Page life expectancy is low

The page stays in the buffer pool without references of less time than the threshold value.

max(/MSSQL by ODBC/mssql.page_life_expectancy,15m)<{$MSSQL.PAGE_LIFE_EXPECTANCY.MIN} High
MSSQL: Number of physical database page reads per second is high

The physical database page reads are issued too frequently.

min(/MSSQL by ODBC/mssql.page_reads_sec.rate,5m)>{$MSSQL.PAGE_READS.MAX} Warning
MSSQL: Number of physical database page writes per second is high

The physical database page writes are issued too frequently.

min(/MSSQL by ODBC/mssql.page_writes_sec.rate,5m)>{$MSSQL.PAGE_WRITES.MAX} Warning
MSSQL: Too many physical reads occurring

If this value makes up even a sizeable minority of the total Page Reads/sec (say, greater than 20% of the total page reads), you may have too many physical reads occurring.

last(/MSSQL by ODBC/mssql.readahead_pages_sec.rate) > {$MSSQL.PERCENT_READAHEAD.MAX} / 100 * last(/MSSQL by ODBC/mssql.page_reads_sec.rate) Warning
MSSQL: Total average wait time for locks is high

An average wait time longer than 500ms may indicate excessive blocking. This value should generally correlate to 'Lock Waits/sec' and move up or down with it accordingly.

min(/MSSQL by ODBC/mssql.average_wait_time,5m)>{$MSSQL.AVERAGE_WAIT_TIME.MAX} Warning
MSSQL: Total number of locks per second is high

Number of new locks and lock conversions per second requested from the lock manager is high.

min(/MSSQL by ODBC/mssql.lock_requests_sec.rate,5m)>{$MSSQL.LOCK_REQUESTS.MAX} Warning
MSSQL: Total lock requests per second that timed out is high

The total number of timed out lock requests per second, including requests for NOWAIT locks, is high.

min(/MSSQL by ODBC/mssql.lock_timeouts_sec.rate,5m)>{$MSSQL.LOCK_TIMEOUTS.MAX} Warning
MSSQL: Some blocking is occurring for 5m

Values greater than zero indicate at least some blocking is occurring, while a value of zero can quickly eliminate blocking as a potential root-cause problem.

min(/MSSQL by ODBC/mssql.lock_waits_sec.rate,5m)>0 Average
MSSQL: Number of deadlock is high

Too many deadlocks are occurring currently.

min(/MSSQL by ODBC/mssql.number_deadlocks_sec.rate,5m)>{$MSSQL.DEADLOCKS.MAX} Average
MSSQL: Percent of adhoc queries running is high

The lower this value is the better. High values often indicate excessive adhoc querying and should be as low as possible. If excessive adhoc querying is happening, try rewriting the queries as procedures or invoke the queries using sp_executeSQL. When rewriting isn't possible, consider using a plan guide or setting the database to parameterization forced mode.

min(/MSSQL by ODBC/mssql.percent_of_adhoc_queries,15m) > {$MSSQL.PERCENT_COMPILATIONS.MAX} Warning
MSSQL: Percent of times statement recompiles is high

This number should be at or near zero, since recompiles can cause deadlocks and exclusive compile locks. This counter's value should follow in proportion to "Batch Requests/sec" and "SQL Compilations/sec".

min(/MSSQL by ODBC/mssql.percent_recompilations_to_compilations,15m) > {$MSSQL.PERCENT_RECOMPILATIONS.MAX} Warning
MSSQL: Number of index and table scans exceeds index searches in the last 15m

Index searches are preferable to index and table scans. For OLTP applications, optimize for more index searches and less scans (preferably, 1 full scan for every 1000 index searches). Index and table scans are expensive I/O operations.

min(/MSSQL by ODBC/mssql.scan_to_search,15m) > 0.001 Warning

LLD rule Database discovery

Name Description Type Key and additional info
Database discovery

Scanning databases in DBMS.

Database monitor db.odbc.discovery[dbname,"{$MSSQL.DSN}"]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Item prototypes for Database discovery

Name Description Type Key and additional info
MSSQL DB '{#DBNAME}': Get performance counters

The item gets server status information for {#DBNAME}.

Dependent item mssql.db.perf_raw["{#DBNAME}"]

Preprocessing

  • JSON Path: The text is too long. Please see the template.

    Custom on fail: Discard value

MSSQL DB '{#DBNAME}': Get last backup

The item gets information about backup processes for {#DBNAME}.

Dependent item mssql.backup.raw["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.dbname=='{#DBNAME}')]

    Custom on fail: Discard value

MSSQL DB '{#DBNAME}': State

0 = ONLINE

1 = RESTORING

2 = RECOVERING | SQL Server 2008 and later

3 = RECOVERY_PENDING | SQL Server 2008 and later

4 = SUSPECT

5 = EMERGENCY | SQL Server 2008 and later

6 = OFFLINE | SQL Server 2008 and later

7 = COPYING | Azure SQL Database Active Geo-Replication

10 = OFFLINE_SECONDARY | Azure SQL Database Active Geo-Replication

Dependent item mssql.db.state["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.counter_name=='State')].cntr_value.first()

  • Discard unchanged with heartbeat: 15m

MSSQL DB '{#DBNAME}': Active transactions

Number of active transactions for the database.

Dependent item mssql.db.active_transactions["{#DBNAME}"]

Preprocessing

  • JSON Path: The text is too long. Please see the template.

MSSQL DB '{#DBNAME}': Data file size

Cumulative size of all the data files in the database including any automatic growth. Monitoring this counter is useful, for example, for determining the correct size of tempdb.

Dependent item mssql.db.data_files_size["{#DBNAME}"]

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL DB '{#DBNAME}': Log bytes flushed per second

Total number of log bytes flushed per second. Useful for determining trends and utilization of the transaction log.

Dependent item mssql.db.log_bytes_flushed_sec.rate["{#DBNAME}"]

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL DB '{#DBNAME}': Log file size

Cumulative size of all the transaction log files in the database.

Dependent item mssql.db.log_files_size["{#DBNAME}"]

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL DB '{#DBNAME}': Log file used size

Cumulative used size of all the log files in the database.

Dependent item mssql.db.log_files_used_size["{#DBNAME}"]

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Custom multiplier: 1024

MSSQL DB '{#DBNAME}': Log flushes per second

Number of log flushes per second.

Dependent item mssql.db.log_flushes_sec.rate["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.counter_name=='Log Flushes/sec')].cntr_value.first()

  • Change per second
MSSQL DB '{#DBNAME}': Log flush waits per second

Number of commits per second waiting for the log flush.

Dependent item mssql.db.log_flush_waits_sec.rate["{#DBNAME}"]

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL DB '{#DBNAME}': Log flush wait time

Total wait time (in milliseconds) to flush the log. On an AlwaysOn secondary database, this value indicates the wait time for log records to be hardened to disk.

Dependent item mssql.db.log_flush_wait_time["{#DBNAME}"]

Preprocessing

  • JSON Path: The text is too long. Please see the template.

  • Change per second
MSSQL DB '{#DBNAME}': Log growths

Total number of times the transaction log for the database has been expanded.

Dependent item mssql.db.log_growths["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.counter_name=='Log Growths')].cntr_value.first()

MSSQL DB '{#DBNAME}': Log shrinks

Total number of times the transaction log for the database has been shrunk.

Dependent item mssql.db.log_shrinks["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.counter_name=='Log Shrinks')].cntr_value.first()

MSSQL DB '{#DBNAME}': Log truncations

Number of times the transaction log has been shrunk.

Dependent item mssql.db.log_truncations["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.counter_name=='Log Truncations')].cntr_value.first()

MSSQL DB '{#DBNAME}': Percent log used

Percentage of space in the log that is in use.

Dependent item mssql.db.percent_log_used["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.counter_name=='Percent Log Used')].cntr_value.first()

MSSQL DB '{#DBNAME}': Transactions per second

Number of transactions started for the database per second.

Dependent item mssql.db.transactions_sec.rate["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.counter_name=='Transactions/sec')].cntr_value.first()

  • Change per second
MSSQL DB '{#DBNAME}': Last diff backup duration

Duration of the last differential backup.

Dependent item mssql.backup.diff.duration["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.type=='I')].duration.first()

    Custom on fail: Set value to: 0

MSSQL DB '{#DBNAME}': Last diff backup (time ago)

The amount of time since the last differential backup.

Dependent item mssql.backup.diff["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.type=='I')].timesincelastbackup.first()

    Custom on fail: Set value to: 0

MSSQL DB '{#DBNAME}': Last full backup duration

Duration of the last full backup.

Dependent item mssql.backup.full.duration["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.type=='D')].duration.first()

    Custom on fail: Set value to: 0

MSSQL DB '{#DBNAME}': Last full backup (time ago)

The amount of time since the last full backup.

Dependent item mssql.backup.full["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.type=='D')].timesincelastbackup.first()

    Custom on fail: Set value to: 0

MSSQL DB '{#DBNAME}': Last log backup duration

Duration of the last log backup.

Dependent item mssql.backup.log.duration["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.type=='L')].duration.first()

    Custom on fail: Set value to: 0

MSSQL DB '{#DBNAME}': Last log backup

The amount of time since the last log backup.

Dependent item mssql.backup.log["{#DBNAME}"]

Preprocessing

  • JSON Path: $[?(@.type=='L')].timesincelastbackup.first()

    Custom on fail: Set value to: 0

Trigger prototypes for Database discovery

Name Description Expression Severity Dependencies and additional info
MSSQL DB '{#DBNAME}': State is {ITEM.VALUE}

The DB has a non-working state.

last(/MSSQL by ODBC/mssql.db.state["{#DBNAME}"])>1 High
MSSQL DB '{#DBNAME}': Number of commits waiting for the log flush is high

Too many commits are waiting for the log flush.

min(/MSSQL by ODBC/mssql.db.log_flush_waits_sec.rate["{#DBNAME}"],5m)>{$MSSQL.LOG_FLUSH_WAITS.MAX:"{#DBNAME}"} Warning
MSSQL DB '{#DBNAME}': Total wait time to flush the log is high

The wait time to flush the log is too long.

min(/MSSQL by ODBC/mssql.db.log_flush_wait_time["{#DBNAME}"],5m)>{$MSSQL.LOG_FLUSH_WAIT_TIME.MAX:"{#DBNAME}"} Warning
MSSQL DB '{#DBNAME}': Percent of log using is high

There's not enough space left in the log.

min(/MSSQL by ODBC/mssql.db.percent_log_used["{#DBNAME}"],5m)>{$MSSQL.PERCENT_LOG_USED.MAX:"{#DBNAME}"} Warning
MSSQL DB '{#DBNAME}': Diff backup is old

The differential backup has not been executed for a long time.

last(/MSSQL by ODBC/mssql.backup.diff["{#DBNAME}"])>{$MSSQL.BACKUP_DIFF.CRIT:"{#DBNAME}"} High Manual close: Yes
MSSQL DB '{#DBNAME}': Diff backup is old

The differential backup has not been executed for a long time.

last(/MSSQL by ODBC/mssql.backup.diff["{#DBNAME}"])>{$MSSQL.BACKUP_DIFF.WARN:"{#DBNAME}"} Warning Manual close: Yes
Depends on:
  • MSSQL DB '{#DBNAME}': Diff backup is old
MSSQL DB '{#DBNAME}': Full backup is old

The full backup has not been executed for a long time.

last(/MSSQL by ODBC/mssql.backup.full["{#DBNAME}"])>{$MSSQL.BACKUP_FULL.CRIT:"{#DBNAME}"} High Manual close: Yes
MSSQL DB '{#DBNAME}': Full backup is old

The full backup has not been executed for a long time.

last(/MSSQL by ODBC/mssql.backup.full["{#DBNAME}"])>{$MSSQL.BACKUP_FULL.WARN:"{#DBNAME}"} Warning Manual close: Yes
Depends on:
  • MSSQL DB '{#DBNAME}': Full backup is old
MSSQL DB '{#DBNAME}': Log backup is old

The log backup has not been executed for a long time.

last(/MSSQL by ODBC/mssql.backup.log["{#DBNAME}"])>{$MSSQL.BACKUP_LOG.CRIT:"{#DBNAME}"} High Manual close: Yes
MSSQL DB '{#DBNAME}': Log backup is old

The log backup has not been executed for a long time.

last(/MSSQL by ODBC/mssql.backup.log["{#DBNAME}"])>{$MSSQL.BACKUP_LOG.WARN:"{#DBNAME}"} Warning Manual close: Yes
Depends on:
  • MSSQL DB '{#DBNAME}': Log backup is old

LLD rule Availability groups discovery

Name Description Type Key and additional info
Availability groups discovery

Discovery of the existing availability groups.

Database monitor db.odbc.discovery[availability_groups,"{$MSSQL.DSN}"]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Item prototypes for Availability groups discovery

Name Description Type Key and additional info
MSSQL AG '{#GROUP_NAME}': Get replica states

Getting replica states - name, primary and secondary health, synchronization health.

Database monitor db.odbc.get[{#GROUP_NAME}_replica_states,"{MSSQL.DSN}"]<p>**Preprocessing**</p><ul><li><p>JSON Path: `.first()`

Custom on fail: Discard value

MSSQL AG '{#GROUP_NAME}': Primary replica recovery health

Indicates the recovery health of the primary replica:

0 = In progress

1 = Online

2 = Unavailable

Dependent item mssql.primary_recovery_health["{#GROUP_NAME}"]

Preprocessing

  • JSON Path: $.primary_recovery_health

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}': Primary replica name

Name of the server instance that is hosting the current primary replica.

Dependent item mssql.primary_replica["{#GROUP_NAME}"]

Preprocessing

  • JSON Path: $.primary_replica

  • Discard unchanged with heartbeat: 3h

MSSQL AG '{#GROUP_NAME}': Secondary replica recovery health

Indicates the recovery health of a secondary replica:

0 = In progress

1 = Online

2 = Unavailable

Dependent item mssql.secondary_recovery_health["{#GROUP_NAME}"]

Preprocessing

  • JSON Path: $.secondary_recovery_health

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}': Synchronization health

Reflects a rollup of the synchronization_health of all availability replicas in the availability group:

0: Not healthy. None of the availability replicas have a healthy synchronization.

1: Partially healthy. The synchronization of some, but not all, availability replicas is healthy.

2: Healthy. The synchronization of every availability replica is healthy.

Dependent item mssql.synchronization_health["{#GROUP_NAME}"]

Preprocessing

  • JSON Path: $.synchronization_health

  • Discard unchanged with heartbeat: 1h

Trigger prototypes for Availability groups discovery

Name Description Expression Severity Dependencies and additional info
MSSQL AG '{#GROUP_NAME}': Primary replica recovery health in progress

The primary replica is in the synchronization process.

last(/MSSQL by ODBC/mssql.primary_recovery_health["{#GROUP_NAME}"])=0 Warning
MSSQL AG '{#GROUP_NAME}': Secondary replica recovery health in progress

The secondary replica is in the synchronization process.

last(/MSSQL by ODBC/mssql.secondary_recovery_health["{#GROUP_NAME}"])=0 Warning
MSSQL AG '{#GROUP_NAME}': All replicas unhealthy

None of the availability replicas have a healthy synchronization.

last(/MSSQL by ODBC/mssql.synchronization_health["{#GROUP_NAME}"])=0 Disaster
MSSQL AG '{#GROUP_NAME}': Some replicas unhealthy

The synchronization health of some, but not all, availability replicas is healthy.

last(/MSSQL by ODBC/mssql.synchronization_health["{#GROUP_NAME}"])=1 High

LLD rule Local database discovery

Name Description Type Key and additional info
Local database discovery

Discovery of the local availability databases.

Database monitor db.odbc.discovery[local_db,"{$MSSQL.DSN}"]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Item prototypes for Local database discovery

Name Description Type Key and additional info
MSSQL AG '{#GROUP_NAME}' Local DB '{#DBNAME}': Get local DB states

Getting the states of the local availability database.

Database monitor db.odbc.get["{#GROUP_NAME}_{#DBNAME}_local_db.states","{MSSQL.DSN}"]<p>**Preprocessing**</p><ul><li><p>JSON Path: `.first()`

Custom on fail: Discard value

MSSQL AG '{#GROUP_NAME}' Local DB '{#DBNAME}': State

0 = Online

1 = Restoring

2 = Recovering

3 = Recovery pending

4 = Suspect

5 = Emergency

6 = Offline

Dependent item mssql.local_db.state["{#DBNAME}"]

Preprocessing

  • JSON Path: $.database_state

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}' Local DB '{#DBNAME}': Suspended

Database state:

0 = Resumed

1 = Suspended

Dependent item mssql.local_db.is_suspended["{#DBNAME}"]

Preprocessing

  • JSON Path: $.is_suspended

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}' Local DB '{#DBNAME}': Synchronization health

Reflects the intersection of the synchronization state of a database that is joined to the availability group on the availability replica and the availability mode of the availability replica (synchronous-commit or asynchronous-commit mode):

0 = Not healthy. The synchronization_state of the database is 0 (NOT SYNCHRONIZING).

1 = Partially healthy. A database on a synchronous-commit availability replica is considered partially healthy if synchronization_state is 1 (SYNCHRONIZING).

2 = Healthy. A database on an synchronous-commit availability replica is considered healthy if synchronization_state is 2 (SYNCHRONIZED), and a database on an asynchronous-commit availability replica is considered healthy if synchronization_state is 1 (SYNCHRONIZING).

Dependent item mssql.local_db.synchronization_health["{#DBNAME}"]

Preprocessing

  • JSON Path: $.synchronization_health

  • Discard unchanged with heartbeat: 1h

Trigger prototypes for Local database discovery

Name Description Expression Severity Dependencies and additional info
MSSQL AG '{#GROUP_NAME}' Local DB '{#DBNAME}': "{#DBNAME}" is {ITEM.VALUE}

The local availability database has a non-working state.

last(/MSSQL by ODBC/mssql.local_db.state["{#DBNAME}"])>0 Warning
MSSQL AG '{#GROUP_NAME}' Local DB '{#DBNAME}': "{#DBNAME}" is Not healthy

The synchronization state of the local availability database is NOT SYNCHRONIZING.

last(/MSSQL by ODBC/mssql.local_db.synchronization_health["{#DBNAME}"])=0 High
MSSQL AG '{#GROUP_NAME}' Local DB '{#DBNAME}': "{#DBNAME}" is Partially healthy

A database on a synchronous-commit availability replica is considered partially healthy if synchronization state is SYNCHRONIZING.

last(/MSSQL by ODBC/mssql.local_db.synchronization_health["{#DBNAME}"])=1 Average

LLD rule Non-local database discovery

Name Description Type Key and additional info
Non-local database discovery

Discovery of the non-local (not local to the SQL Server instance) availability databases.

Database monitor db.odbc.discovery[non-local_db,"{$MSSQL.DSN}"]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Item prototypes for Non-local database discovery

Name Description Type Key and additional info
MSSQL AG '{#GROUP_NAME}' Non-Local DB '{#REPLICA_NAME}{#DBNAME}': Get non-local DB states

Getting the states of the non-local availability database.

Database monitor db.odbc.get["{#GROUP_NAME}{#REPLICA_NAME}{#DBNAME}_non-local_db.states","{MSSQL.DSN}"]<p>**Preprocessing**</p><ul><li><p>JSON Path: `.first()`

Custom on fail: Discard value

MSSQL AG '{#GROUP_NAME}' Non-Local DB '{#REPLICA_NAME}{#DBNAME}': Log queue size

Amount of the log records of the primary database that has not been sent to the secondary databases.

Dependent item mssql.non-local_db.log_send_queue_size["{#GROUP_NAME}{#REPLICA_NAME}{#DBNAME}"]

Preprocessing

  • JSON Path: $.log_send_queue_size

  • Custom multiplier: 1024

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}' Non-Local DB '{#REPLICA_NAME}{#DBNAME}': Redo log queue size

Amount of log records in the log files of the secondary replica that has not yet been redone.

Dependent item mssql.non-local_db.redo_queue_size["{#GROUP_NAME}{#REPLICA_NAME}{#DBNAME}"]

Preprocessing

  • JSON Path: $.redo_queue_size

  • Custom multiplier: 1024

  • Discard unchanged with heartbeat: 1h

Trigger prototypes for Non-local database discovery

Name Description Expression Severity Dependencies and additional info
MSSQL AG '{#GROUP_NAME}' Non-Local DB '{#REPLICA_NAME}{#DBNAME}': Log queue size is growing

The log records of the primary database are not sent to the secondary databases.

last(/MSSQL by ODBC/mssql.non-local_db.log_send_queue_size["{#GROUP_NAME}*{#REPLICA_NAME}*{#DBNAME}"],#1)>last(/MSSQL by ODBC/mssql.non-local_db.log_send_queue_size["{#GROUP_NAME}*{#REPLICA_NAME}*{#DBNAME}"],#2) and last(/MSSQL by ODBC/mssql.non-local_db.log_send_queue_size["{#GROUP_NAME}*{#REPLICA_NAME}*{#DBNAME}"],#2)>last(/MSSQL by ODBC/mssql.non-local_db.log_send_queue_size["{#GROUP_NAME}*{#REPLICA_NAME}*{#DBNAME}"],#3) High
MSSQL AG '{#GROUP_NAME}' Non-Local DB '{#REPLICA_NAME}{#DBNAME}': Redo log queue size is growing

The log records in the log files of the secondary replica have not yet been redone.

last(/MSSQL by ODBC/mssql.non-local_db.redo_queue_size["{#GROUP_NAME}*{#REPLICA_NAME}*{#DBNAME}"],#1)>last(/MSSQL by ODBC/mssql.non-local_db.redo_queue_size["{#GROUP_NAME}*{#REPLICA_NAME}*{#DBNAME}"],#2) and last(/MSSQL by ODBC/mssql.non-local_db.redo_queue_size["{#GROUP_NAME}*{#REPLICA_NAME}*{#DBNAME}"],#2)>last(/MSSQL by ODBC/mssql.non-local_db.redo_queue_size["{#GROUP_NAME}*{#REPLICA_NAME}*{#DBNAME}"],#3) High

LLD rule Replication discovery

Name Description Type Key and additional info
Replication discovery

Discovery of the database replicas.

Database monitor db.odbc.discovery[replicas,"{$MSSQL.DSN}"]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Item prototypes for Replication discovery

Name Description Type Key and additional info
MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': Get the replica state

Getting the database replica states.

Database monitor db.odbc.get["{#GROUP_NAME}_{#REPLICA_NAME}_replica.state","{MSSQL.DSN}"]<p>**Preprocessing**</p><ul><li><p>JSON Path: `.first()`

Custom on fail: Discard value

MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': Connected state

Whether a secondary replica is currently connected to the primary replica:

0 : Disconnected. The response of an availability replica to the DISCONNECTED state depends on its role:

On the primary replica, if a secondary replica is disconnected, its secondary databases are marked as NOT SYNCHRONIZED on the primary replica, which waits for the secondary to reconnect;

On a secondary replica, upon detecting that it is disconnected, the secondary replica attempts to reconnect to the primary replica.

1 : Connected. Each primary replica tracks the connection state for every secondary replica in the same availability group. Secondary replicas track the connection state of only the primary replica.

Dependent item mssql.replica.connected_state["{#GROUP_NAME}_{#REPLICA_NAME}"]

Preprocessing

  • JSON Path: $.connected_state

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': Is local

Whether the replica is local:

0 = Indicates a remote secondary replica in an availability group whose primary replica is hosted by the local server instance. This value occurs only on the primary replica location.

1 = Indicates a local replica. On secondary replicas, this is the only available value for the availability group to which the replica belongs.

Dependent item mssql.replica.is_local["{#GROUP_NAME}_{#REPLICA_NAME}"]

Preprocessing

  • JSON Path: $.is_local

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': Join state

0 = Not joined

1 = Joined, standalone instance

2 = Joined, failover cluster instance

Dependent item mssql.replica.join_state["{#GROUP_NAME}_{#REPLICA_NAME}"]

Preprocessing

  • JSON Path: $.join_state

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': Operational state

Current operational state of the replica:

0 = Pending failover

1 = Pending

2 = Online

3 = Offline

4 = Failed

5 = Failed, no quorum

6 = Not local

Dependent item mssql.replica.operational_state["{#GROUP_NAME}_{#REPLICA_NAME}"]

Preprocessing

  • JSON Path: $.operational_state

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': Recovery health

Rollup of the database_state column of the sys.dm_hadr_database_replica_states dynamic management view:

0 : In progress. At least one joined database has a database state other than ONLINE

(database_state is not 0).

1 : Online. All the joined databases have a database state of ONLINE (database_state is 0).

Dependent item mssql.replica.recovery_health["{#GROUP_NAME}_{#REPLICA_NAME}"]

Preprocessing

  • JSON Path: $.recovery_health

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': Role

Current Always On availability groups role of a local replica or a connected remote replica:

0 = Resolving

1 = Primary

2 = Secondary

Dependent item mssql.replica.role["{#GROUP_NAME}_{#REPLICA_NAME}"]

Preprocessing

  • JSON Path: $.role

  • Discard unchanged with heartbeat: 1h

MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': Sync health

Reflects a rollup of the database synchronization state (synchronization_state)of all joined availability databases (also known as replicas) and the availability mode of the replica (synchronous-commit or asynchronous-commit mode). The rollup will reflect the least healthy accumulated state of the databases on the replica:

0 : Not healthy. At least one joined database is in the NOT SYNCHRONIZING state.

1 : Partially healthy. Some replicas are not in the target synchronization state: synchronous-commit replicas should be synchronized, and asynchronous-commit replicas should be synchronizing.

2 : Healthy. All replicas are in the target synchronization state: synchronous-commit replicas are synchronized, and asynchronous-commit replicas are synchronizing.

Dependent item mssql.replica.synchronization_health["{#GROUP_NAME}_{#REPLICA_NAME}"]

Preprocessing

  • JSON Path: $.synchronization_health

  • Discard unchanged with heartbeat: 1h

Trigger prototypes for Replication discovery

Name Description Expression Severity Dependencies and additional info
MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': {#REPLICA_NAME} is disconnected

The response of an availability replica to the DISCONNECTED state depends on its role:
On the primary replica, if a secondary replica is disconnected, its secondary databases are marked as NOT SYNCHRONIZED on the primary replica, which waits for the secondary to reconnect; On a secondary replica, upon detecting that it is disconnected, the secondary replica attempts to reconnect to the primary replica.

last(/MSSQL by ODBC/mssql.replica.connected_state["{#GROUP_NAME}_{#REPLICA_NAME}"])=0 and last(/MSSQL by ODBC/mssql.replica.role["{#GROUP_NAME}_{#REPLICA_NAME}"])=2 Warning
MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': {#REPLICA_NAME} is {ITEM.VALUE}

The operational state of the replica in a given availability group is "Pending" or "Offline".

last(/MSSQL by ODBC/mssql.replica.operational_state["{#GROUP_NAME}_{#REPLICA_NAME}"])=0 or last(/MSSQL by ODBC/mssql.replica.operational_state["{#GROUP_NAME}_{#REPLICA_NAME}"])=1 or last(/MSSQL by ODBC/mssql.replica.operational_state["{#GROUP_NAME}_{#REPLICA_NAME}"])=3 Warning
MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': {#REPLICA_NAME} is {ITEM.VALUE}

The operational state of the replica in a given availability group is "Failed".

last(/MSSQL by ODBC/mssql.replica.operational_state["{#GROUP_NAME}_{#REPLICA_NAME}"])=4 Average
MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': {#REPLICA_NAME} is {ITEM.VALUE}

The operational state of the replica in a given availability group is "Failed, no quorum".

last(/MSSQL by ODBC/mssql.replica.operational_state["{#GROUP_NAME}_{#REPLICA_NAME}"])=5 High
MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': {#REPLICA_NAME} Recovery in progress

At least one joined database has a database state other than ONLINE.

last(/MSSQL by ODBC/mssql.replica.recovery_health["{#GROUP_NAME}_{#REPLICA_NAME}"])=0 Info
MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': {#REPLICA_NAME} is Not healthy

At least one joined database is in the NOT SYNCHRONIZING state.

last(/MSSQL by ODBC/mssql.replica.synchronization_health["{#GROUP_NAME}_{#REPLICA_NAME}"])=0 Average
MSSQL AG '{#GROUP_NAME}' Replica '{#REPLICA_NAME}': {#REPLICA_NAME} is Partially healthy

Some replicas are not in the target synchronization state: synchronous-commit replicas should be synchronized, and asynchronous-commit replicas should be synchronizing.

last(/MSSQL by ODBC/mssql.replica.synchronization_health["{#GROUP_NAME}_{#REPLICA_NAME}"])=1 Warning

LLD rule Mirroring discovery

Name Description Type Key and additional info
Mirroring discovery

To see the row for a database other than master or tempdb, you must

either be the database owner or have at least ALTER ANY DATABASE or VIEW ANY

DATABASE server-level permission or CREATE DATABASE permission in the master

database. To see non-NULL values on a mirror database, you must be a member

of the sysadmin fixed server role.

Database monitor db.odbc.discovery[mirrors,"{$MSSQL.DSN}"]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Item prototypes for Mirroring discovery

Name Description Type Key and additional info
MSSQL Mirroring '{#DBNAME}': Get the mirror state

Getting mirrors state

Database monitor db.odbc.get["{#DBNAME}_mirroring_state","{MSSQL.DSN}"]<p>**Preprocessing**</p><ul><li><p>JSON Path: `.first()`

Custom on fail: Discard value

MSSQL Mirroring '{#DBNAME}': Role

Current role of the local database plays in the database mirroring session.

1 = Principal

2 = Mirror

Dependent item mssql.mirroring.role["{#DBNAME}"]

Preprocessing

  • JSON Path: $.mirroring_role

  • Discard unchanged with heartbeat: 1h

MSSQL Mirroring '{#DBNAME}': Role sequence

The number of times that mirroring partners have switched the principal and mirror roles due to a failover or forced service.

Dependent item mssql.mirroring.role_sequence["{#DBNAME}"]

Preprocessing

  • JSON Path: $.mirroring_role_sequence

  • Simple change
MSSQL Mirroring '{#DBNAME}': State

State of the mirror database and of the database mirroring session.

0 = Suspended

1 = Disconnected from the other partner

2 = Synchronizing

3 = Pending Failover

4 = Synchronized

5 = The partners are not synchronized. Failover is not possible now.

6 = The partners are synchronized. Failover is potentially possible. For information about the requirements for the failover, see Database Mirroring Operating Modes.

Dependent item mssql.mirroring.state["{#DBNAME}"]

Preprocessing

  • JSON Path: $.mirroring_state

  • Discard unchanged with heartbeat: 1h

MSSQL Mirroring '{#DBNAME}': Witness state

State of the witness in the database mirroring session of the database:

0 = Unknown

1 = Connected

2 = Disconnected

Dependent item mssql.mirroring.witness_state["{#DBNAME}"]

Preprocessing

  • JSON Path: $.mirroring_witness_state

  • Discard unchanged with heartbeat: 1h

MSSQL Mirroring '{#DBNAME}': Safety level

Safety setting for updates on the mirror database:

0 = Unknown state

1 = Off [asynchronous]

2 = Full [synchronous]

Dependent item mssql.mirroring.safety_level["{#DBNAME}"]

Preprocessing

  • JSON Path: $.mirroring_safety_level

  • Discard unchanged with heartbeat: 1h

Trigger prototypes for Mirroring discovery

Name Description Expression Severity Dependencies and additional info
MSSQL Mirroring '{#DBNAME}': "{#DBNAME}" is {ITEM.VALUE}

The state of the mirror database and of the database mirroring session is "Suspended", "Disconnected from the other partner", or "Synchronizing".

last(/MSSQL by ODBC/mssql.mirroring.state["{#DBNAME}"])>=0 and last(/MSSQL by ODBC/mssql.mirroring.state["{#DBNAME}"])<=2 Info
MSSQL Mirroring '{#DBNAME}': "{#DBNAME}" is {ITEM.VALUE}

The state of the mirror database and of the database mirroring session is "Pending Failover".

last(/MSSQL by ODBC/mssql.mirroring.state["{#DBNAME}"])=3 Warning
MSSQL Mirroring '{#DBNAME}': "{#DBNAME}" is {ITEM.VALUE}

The state of the mirror database and of the database mirroring session is "Not synchronized". The partners are not synchronized. A failover is not possible now.

last(/MSSQL by ODBC/mssql.mirroring.state["{#DBNAME}"])=5 High
MSSQL Mirroring '{#DBNAME}': "{#DBNAME}" Witness is disconnected

The state of the witness in the database mirroring session of the database is "Disconnected".

last(/MSSQL by ODBC/mssql.mirroring.witness_state["{#DBNAME}"])=2 Warning

LLD rule Job discovery

Name Description Type Key and additional info
Job discovery

Scanning jobs in DBMS.

Database monitor db.odbc.discovery[jobname,"{$MSSQL.DSN}"]

Preprocessing

  • Discard unchanged with heartbeat: 1d

Item prototypes for Job discovery

Name Description Type Key and additional info
MSSQL Job '{#JOBNAME}': Get job status

The item gets sql agent job {#JOBNAME} status.

Dependent item mssql.job.status_raw["{#JOBNAME}"]

Preprocessing

  • JSON Path: $[?(@.JobName=='{#JOBNAME}')].first()

    Custom on fail: Discard value

MSSQL Job '{#JOBNAME}': Enabled

The job state possible values:

0 = Disabled

1 = Enabled

Dependent item mssql.job.enabled["{#JOBNAME}"]

Preprocessing

  • JSON Path: $.Enabled

  • Discard unchanged with heartbeat: 15m

MSSQL Job '{#JOBNAME}': Last run date-time

The last date-time of the job run.

Dependent item mssql.job.lastrundatetime["{#JOBNAME}"]

Preprocessing

  • JSON Path: $.LastRunDateTime

  • Discard unchanged with heartbeat: 1d

MSSQL Job '{#JOBNAME}': Next run date-time

The next date-time of the job run.

Dependent item mssql.job.nextrundatetime["{#JOBNAME}"]

Preprocessing

  • JSON Path: $.NextRunDateTime

  • Discard unchanged with heartbeat: 15m

MSSQL Job '{#JOBNAME}': Last run status message

The informational message about the last run of the job.

Dependent item mssql.job.lastrunstatusmessage["{#JOBNAME}"]

Preprocessing

  • JSON Path: $.LastRunStatusMessage

  • Discard unchanged with heartbeat: 15m

MSSQL Job '{#JOBNAME}': Run status

The job status possible values:

0 ⇒ Failed

1 ⇒ Succeeded

2 ⇒ Retry

3 ⇒ Canceled

4 ⇒ Running

Dependent item mssql.job.runstatus["{#JOBNAME}"]

Preprocessing

  • JSON Path: $.RunStatus

  • Discard unchanged with heartbeat: 15m

MSSQL Job '{#JOBNAME}': Run duration

Duration of the last run job.

Dependent item mssql.job.run_duration["{#JOBNAME}"]

Preprocessing

  • JSON Path: $.RunDuration

  • Discard unchanged with heartbeat: 15m

Trigger prototypes for Job discovery

Name Description Expression Severity Dependencies and additional info
MSSQL Job '{#JOBNAME}': Failed to run

The last run of the job has failed.

last(/MSSQL by ODBC/mssql.job.runstatus["{#JOBNAME}"])=0 Warning Manual close: Yes
MSSQL Job '{#JOBNAME}': Job duration is high

The job is taking too long.

last(/MSSQL by ODBC/mssql.job.run_duration["{#JOBNAME}"])>{$MSSQL.BACKUP_DURATION.WARN:"{#JOBNAME}"} Warning Manual close: Yes

Feedback

Please report any issues with the template at https://support.zabbix.com

You can also provide feedback, discuss the template, or ask for help at ZABBIX forums