Agent G2 - Active Directory Free System Page Table Entries DotNet v4

Description

Monitors Active Directory Free System Page Table Entries Counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Active Directory Free System Page Table Entries DotNet v4AD_Free_System_Page_Table_EntriesFreeSystemPageTableEntriesNULLFree System Page Table Entries is the number of page table entries not currently in used by the system. This counter displays the last observed value only; it is not an average

Agent G2 - Active Directory Performance Counters DotNet v4

Description

Monitors AD Performance data

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Active Directory Performance Counters DotNet v4DRAInboundObjectsPersecDRAInboundObjectsPersecNULLThe number of objects received (per second) through inbound replication from replication partners.
DSServerBindsPersecDSServerBindsPersecNULLShows the number of DC-to-DC binds per second that are serviced by this DC.
DRAInboundBytesTotalPersecDRAInboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication.
DRAInboundObjectsAppliedPersecDRAInboundObjectsAppliedPersecNULLThis counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers.
ABClientSessionsABClientSessionsNULLAB Client Sessions is the number of connected Address Book client sessions.
LDAPClientSessionsLDAPClientSessionsNULLThe number of sessions of connected LDAP clients. Lack of activity points to network problems.
DSDirectoryReadsPersecDSDirectoryReadsPersecNULLShows the number of directory reads per second.
DRAPendingReplicationSynchronizationsDRAPendingReplicationSynchronizationsNULLThe number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication.
NTLMAuthenticationsNTLMAuthenticationsNULLThe number of NTLM authentications (per second) serviced by this domain controller
DSDirectoryWritesPersecDSDirectoryWritesPersecNULLShows the number of directory writes per second.
LDAPActiveThreadsLDAPActiveThreadsNULLLDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local direcotry service.
KerberosAuthenticationsKerberosAuthenticationsNULLThe number of times per second that clients use a client ticket to this domain controller to authenticate to this domain controller. A lack of activity can indicate network problems that are preventing authentication requests from succeeding.
DRAOutboundBytesTotalPersecDRAOutboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication.
LDAPWritesPersecLDAPWritesPersecNULLShows the rate at which LDAP clients perform write operations.
DSNotifyQueueSizeDSNotifyQueueSizeNULLThe number of pending update notifications that have been queued, but not yet transmitted to clients.
DSClientBindsPersecDSClientBindsPersecNULLShows the number of Ntdsapi.dll binds per second serviced by this DC.
LDAPUDPoperationsPersecLDAPUDPoperationsPersecNULLShows the number of UDP operations that the LDAP server is processing per second.
LDAPBindTimeLDAPBindTimeMillisecondsThis counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems.
LDAPSearchesPersecLDAPSearchesPersecNULLThe number of search operations per second performed by LDAP clients. A lack of activity points to network problems.
DRAOutboundObjectsPersecDRAOutboundObjectsPersecNULLThe number of objects sent (per second) through outbound replication to replication partners.
DRAInboundObjectUpdatesRemaininginPacketDRAInboundObjectUpdatesRemaininginPacketNULLThis counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade).

Agent G2 - AD Database Monitoring - v2

Description

Monitor AD database metrics like DBFileSizeGrowth, DBFile_DiskUsage, DiskHealthStatus, FreeDiskSpace.

Note: Previous version “Agent G2 - AD Database Monitoring” template has a bug at the monitor level. We recommend using the latest template (Agent G2 - AD Database Monitoring - v2)

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - AD Database Custom Monitor - v2AD_Database_DBFileSizeGrowthAD Database DBFile Size GrowthMBIt monitors the growth of the Active Directory database file size. It calculates the delta of the database file size from previous poll to current poll.
AD_Database_FreeDiskSpaceAD Database FreeDiskSpaceMBIt monitors the free disk space in MB for the drives which are having Active Directory Database file / Log file.
AD_Database_DiskHealthStatusAD Database Disk Health StatusnullIt monitors the disk health status of the drive which is having an Active Directory DB File. Below are the possible states: 0 - Healthy 1 - Warning 2 - Unhealthy 3 - Unknown
AD_Database_DBFile_DiskUsageAD Database DBFile DiskUsage%It monitors the disk usage% of the drive which is having Active Directory DB file.

Agent G2 - AD Performance Counters DotNet v4

Description

Agent G2 - AD Performance Counters DotNet v4

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - AD Performance Counters DotNet v4directory.services.ldap.successful.binds.per.secDirectory Services LDAPSuccessfulbindspersecNULLNumber of LDAP Binds per second
directory.services.dra.outbound.values.dns.only.per.secDirectory Services DRAOutboundvaluesdnsonlypersecNULLNumber of object property values containing Distinguished Names sent to outbound replication partners. DN-values, such as group or distribution list memberships, are generally more expensive to read than other kinds of values
directory.services.dra.inbound.values.dns.only.per.secDirectory Services DRAInboundvaluesdnsonlypersecNULLNumber of object property values received from inbound replication partners that are Distinguished Names; i.e., that reference other objects. DN-values, such as group or distribution list memberships, are generally more expensive to apply than other kind
directory.services.threads.in.useDirectory Services ThreadsinuseNULLDS Threads in Use is the current number of threads in use by the directory service (different than the number of threads in the directory service process). Threads in Use is the number of threads currently servicing client API calls and can be used to in
directory.services.dra.inbound.full.sync.objects.remainingDirectory Services DRAInboundfullsyncobjectsremainingNULLNumber of objects remaining until the full sync completes (when set)

Agent G2 - Backup Symantec Exec-11-Performance Counters DotNet v4

Description

Template for Symantec backup exec. Monitors total bytes, total directories and total files. Also performs event log monitoring.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Backup Symantec Exec-11-Performance Counters DotNet v4TotalDirectoriesTotalDirectoriesNULLThe total number of directories that have been backed up since the Backup Exec Engine Service last started.
TotalFilesTotalFilesNULLThe total number of files that have been backed up since the Backup Exec Engine Service last started.
TotalBytesTotalBytesNULLThe total number of bytes that have been backed up since the Backup Exec Engine Service last started.

Agent G2 - Backup-Symantec Exec-12.5-Performance Counters DotNet v4

Description

12.5_Symantec_Backup_Exec. Monitors total bytes, total directories and total files.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Backup-Symantec Exec-12.5-Performance Counters DotNet v4TotalBytesTotalBytesNULLThe total number of bytes that have been backed up since the Backup Exec Engine Service last started.
FailedJobsFailedJobsNULLThe number of jobs that have failed since the Backup Exec Engine Service last started.
TotalDirectoriesTotalDirectoriesNULLThe total number of directories that have been backed up since the Backup Exec Engine Service last started.
TotalFilesTotalFilesNULLThe total number of files that have been backed up since the Backup Exec Engine Service last started.

Agent G2 - Backup-Veritas-Job Performance Counters DotNet v4

Description

Monitors the AbortedJobs, ActiveJobCount, BackupDeviceWaitTime, InUseSkippedObjects MountTime, TotalExchangeMailboxes, TotalSQLServerDatabases.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Backup-Veritas-Job Performance Counters DotNet v4BackupDeviceWaitTimeBackupDeviceWaitTimeSecondsThe total time (in seconds) all backup jobs have spent waiting for a storage device since the Backup Exec Engine Service last started.
AbortedJobsAbortedJobsNULLThe number of jobs that have been aborted since the Backup Exec Engine Service last started.
MountTimeMountTimeSecondsThe total time (in seconds) all jobs have spent waiting for media to be mounted in a storage device since the Backup Exec Engine Service last started.
InUseSkippedObjectsInUseSkippedObjectsNULLThe number of objects that have been skipped because they were in use during backup since the Backup Exec Engine Service last started.
FailedJobsFailedJobsNULLThe number of jobs that have failed since the Backup Exec Engine Service last started.
ActiveJobCountActiveJobCountNULLThe number of jobs currently active (running or pending) in the Backup Exec Engine Service.

Agent G2 - Blackberry 501 Performance Counters DotNet v4

Description

Monitor Blackberry 501 Performance Counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Blackberry 501 Performance Counters DotNet v4MessagesQueuedForDeliveryMessagesQueuedForDeliveryNULLBlackberry Agent displays queued messages for delivery
MessagesExpiredMessagesExpiredNULLBlackberry Agent displays expired messages
MessagesSentMessagesSentNULLBlackberry Agent displays sent messages
MessagesReceivedMessagesReceivedNULLBlackberry Agent displays received messages
MessagesFilteredMessagesFilteredNULLBlackberry Agent displays filtered messages

Agent G2 - Blackberry Enterprise Server

Description

Template for BlackBerry enterprise server. Monitors messages expired, messages filtered, messages queued for delivery, messages received and messages sent. Also performs event log monitoring.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Blackberry Enterprise ServerMessagesQueuedForDeliveryMessagesQueuedForDeliveryNULLBlackberry Agent displays queued messages for delivery
MessagesExpiredMessagesExpiredNULLBlackberry Agent displays expired messages
MessagesSentMessagesSentNULLBlackberry Agent displays sent messages
MessagesReceivedMessagesReceivedNULLBlackberry Agent displays received messages
MessagesFilteredMessagesFilteredNULLBlackberry Agent displays filtered messages

Agent G2 - Blackberry Performance Counters DotNet v4

Description

Monitors Blackberry Performance Counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Blackberry Performance Counters DotNet v4MessagesQueuedForDeliveryMessagesQueuedForDeliveryNULLBlackberry Agent displays queued messages for delivery
MessagesExpiredMessagesExpiredNULLBlackberry Agent displays expired messages
MessagesSentMessagesSentNULLBlackberry Agent displays sent messages
MessagesReceivedMessagesReceivedNULLBlackberry Agent displays received messages
MessagesFilteredMessagesFilteredNULLBlackberry Agent displays filtered messages

Agent G2 - Cisco Unity Performance Counters DotNet v4

Description

Monitors Cisco Unity Performance data

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Cisco Unity Performance Counters DotNet v4IncomingCallsExternalCurrentIncomingCallsExternalCurrentNULLThe current number of incoming calls from external callers.
PortsIdleCurrentPortsIdleCurrentNULLThe current number of integration ports that are not in use by the Cisco Unity Connection server.
TTSSessionsPersecTTSSessionsPersecNULLThe number of active TTS voice sessions per second.
MessageStoresOfflineCurrentMessageStoresOfflineCurrentNULLA current total of Cisco Unity Message Stores that are offline.
PortsUsedCurrentPortsUsedCurrentNULLThe current number of integration ports that are in use by the Cisco Unity Connection server.
AverageAuthenticationTimeAverageAuthenticationTimeNULLAverage time taken to authenticate.
TTSSessionDurationAverageTTSSessionDurationAverageNULLThe average duration of all TTS sessions in seconds.
MessageStoresOnlineCurrentMessageStoresOnlineCurrentNULLA current total of Cisco Unity Message Stores that are online.
PortsLockedCountPortsLockedCountNULLThe current count of the ports that no longer respond or are otherwise unusable by Cisco Unity Connection
UnityMTAMessageCountCurrentUnityMTAMessageCountCurrentNULLThe number of messages currently queued in the MTA(Message Transfer Agent).
IncomingCallsDurationAverageIncomingCallsDurationAverageNULLThe average duration in seconds of all incoming calls to the Cisco Unity Connection server.
DirectoryResynchronizationDurationAverageDirectoryResynchronizationDurationAverageNULLThe average duration of information directory synchronization, in seconds
PortsIdleDurationAveragePortsIdleDurationAverageNULLThe average time that any port remains idle between incoming calls to the Cisco Unity Connection server in seconds.
OutgoingCallsDurationAverageOutgoingCallsDurationAverageNULLThe average duration of all outgoing calls from the Cisco Unity Connection server in seconds.

Agent G2 - Citrix Broker Agent DotNet v4

Description

Monitors Citrix Broker Agent role performance Counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix Broker Agent DotNet v4citrix.broker.agent.total.sessionsCitrixBrokerAgent TotalSessionsNULLTotal Number of Sessions
citrix.broker.agent.num.of.registrationsCitrixBrokerAgent NumberofRegistrationsNULLTotal Number of Registrations
citrix.broker.agent.total.app.sessionsCitrixBrokerAgent TotalAppSessionsNULLTotal Number of Seamless App Sessions
citrix.broker.agent.total.notificationsCitrixBrokerAgent TotalNotificationsNULLTotal Number of Notifications
citrix.broker.agent.num.of.deregistrationsCitrixBrokerAgent NumberofDeregistrationsNULLTotal Number of DeRegistrations
citrix.broker.agent.total.desktops.sessionCitrixBrokerAgent TotalDesktopsSessionNULLTotal Number of Desktop Sessions

Agent G2 - Citrix Broker Service DotNet v4

Description

Monitors Citrix Broker Service role performance counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix Broker Service DotNet v4citrix.broker.service.hard.registrations.per.secCitrixBrokerService HardRegistrationsPerSecNULLHard Registrations/sec is the rate at which virtual desktop agents hard-register with Citrix Broker Service

Agent G2 - Citrix Licensing

Description

Template for Citrix Licensing

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix LicensingTotal_LicenseTotal_LicenseNULLThe sum of the total license count of the citrix concurrent licenses with the common name available in the citrix licensing server.
License_UsageLicense_UsageNULLThe sum of the license usage number of the citrix concurrent licenses being used currently with the common name.
License_Used_PercentageLicense_Used_PercentageNULLLicense percent used is the percentage of the license usage of the citrix concurrent licenses with the common name in the citrix licensing server.
License_SA_Expiry_in_daysLicense_SA_Expiry_in_daysNULLProvides the number of days remaining to expire subscription advantage (SA) of all the possible licenses of the citrix license server.

Agent G2 - Citrix Licensing Performance Counters

Description

Monitors Citrix Licensing Performance data

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix Licensing Performance CountersTotal_LicenseTotal_LicenseNULLThe sum of the total license count of the citrix concurrent licenses with the common name available in the citrix licensing server.
License_UsageLicense_UsageNULLThe sum of the license usage number of the citrix concurrent licenses being used currently with the common name.
License_Used_PercentageLicense_Used_PercentageNULLLicense percent used is the percentage of the license usage of the citrix concurrent licenses with the common name in the citrix licensing server.
License_SA_Expiry_in_daysLicense_SA_Expiry_in_daysNULLProvides the number of days remaining to expire subscription advantage (SA) of all the possible licenses of the citrix license server.

Agent G2 - Citrix Performance Counters DotNet v4

Description

These performance counters should be used to monitor the key performance metrics of the Citrix infrastructure, application servers, and virtual desktops.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix Performance Counters DotNet v4citrix.percent.processor.timeCitrix Percent Processor Time%Percent Processor Time is the percentage of elapsed time that the processor spends to execute a non-Idle thread. It is calculated by measuring the duration of the idle thread is active in the sample interval, and subtracting that time from interval duration. (Each processor has an idle thread that consumes cycles when no other threads are ready to run). This counter is the primary indicator of processor activity, and displays the average percentage of busy time observed during the sample interval. It is calculated by monitoring the time that the service is inactive and subtracting that value from 100 Percent
citrix.logicaldisk.avg.disksecpertransferCitrix LogicalDisk Avg DiskSecPerTransferMSThe Average Disk Second counters show the average time in seconds of a transfer from or to a disk.
citrix.logicaldisk.currentdiskqueuelengthCitrix LogicalDisk CurrentDiskQueueLengthNULLCurrent disk queue length provides a primary measure of disk congestion. It is an indication of the number of transactions that are waiting to be processed.
citrix.logicaldisk.avg.disksecperwriteCitrix LogicalDisk Avg DiskSecPerWriteMSThe Average Disk Second counters show the average time in seconds of a write from or to a disk.
citrix.logicaldisk.percent.disktimeCitrix LogicalDisk Percent DiskTime%Percent Disk Time marks how busy the disk is.
citrix.network.interface.bytestotalpersecCitrix Network Interface BytesTotalPerSecNULLBytes Total/sec shows the rate at which the network adaptor is processing data bytes. This counter includes all application and file data, in addition to protocol information, such as packet headers.
citrix.system.processor.queuelengthCitirx System Processor Queue LengthNULLProcessor queue length is the number of threads in the processor queue. Unlike the disk counters, this counter shows ready threads only, not threads that are running. There is a single queue for processor time even on computers with multiple processors. Therefore, if a computer has multiple processors, you need to divide this value by the number of processors servicing the workload. A sustained processor queue of less than ten threads per processor is normally acceptable, dependent of the workload.
citrix.paging.file.percentusageCitrix Paging File Percent UsageNULLThis is the percentage amount of the Page File instance in use.
citrix.logicaldisk.percent.freespaceCitrix LogicalDisk Percent FreeSpace%Percent Free Space is the percentage of total usable space on the selected logical disk drive that is free.
citrix.memory.available.bytesCitrix Memory Available Bytes%Available memory indicates the amount of memory that is left after nonpaged pool allocations, paged pool allocations, process working sets, and the file system cache have all taken their piece.
citrix.logicaldisk.avg.disksecperreadCitrix LogicalDisk Avg DiskSecPerReadMSThe Average Disk Second counters show the average time in seconds of a read from or to a disk.

Agent G2 - Citrix XenApp 7.5 DotNet v4

Description

Monitor Citrix XenApp Server 7.5 version performance counters.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix XenApp 7.5 DotNet v4Citrix_Conf_Logging_Database_StateCitrix_Conf_Logging_Database_StateNULLDatabase Connected indicates whether this service is in contact with its database (1 is connected; 0 is not connected).

Agent G2 - Citrix XenApp 7.6 DotNet v4

Description

Monitor Citrix XenApp Server 7.6 version performance counters.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix XenApp 7.6 DotNet v4citrix.conf.loggingdatabase.avgtransactiontimeCitirx Conf Logging Database Avg Transaction TimeNULLThe time on average, in seconds, taken to execute a database transaction. A baseline needs to be established in the environment in order to accurately establish threshold values.
citrix.monitor.database.connectedCitrix Monitor Database ConnectedNULLDatabase Connected indicates whether this service is in contact with its database (1 is connected; 0 is not connected)
citrix.conf.loggingdatabase.transactionerrorspersecCitrix Conf Logging Database Transaction Errors Per secNULLThe rate at which database transactions are failing.
citrix.env.test.database.connectedCitrix Env Test Database ConnectedNULLDatabase Connected indicates whether this service is in contact with its database (1 is connected; 0 is not connected)

Agent G2 - Citrix XenApp Advanced Performance Check

Description

XenApp Advanced Monitoring

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix XenApp Advanced Performance CheckXADisconnectedSessionsXADisconnectedSessionsCountLists sessions in disconnected state. This helps identify if there are any sporadic disconnects from the server.
XAOfflineServersXAOfflineServersCountA server can be taken offline either for maintenance or because the Server is unstable and is removed from available servers. These servers will not service any user requests.
XAServerLoadXAServerLoadNULLThis monitor displays the load on each Citrix XenApp Servers. These load values can identify issues with your servers in addition to determining which server is the least/most loaded in your farm. A trending value of this monitor helps identify peak usage of the servers. A value greater than 9000 indicates a heavily loaded server
XAServerApplicationXAServerApplicationCountNumber of applications hosted on each server. This help you decide if the load is distributed evenly across all your servers.
XASessionsXASessionsCountNumber of active sessions per server.

Agent G2 - Citrix XenApp Performance Check DotNet v4

Description

XenApp Standard Monitoring

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix XenApp Performance Check DotNet v4LicenseServerConnectionFailureLicenseServerConnectionFailureNULLThe number of minutes that the XenApp server has been disconnected from the License Server. The time returned should be less than 30 minutes. Ideally the returned time should be zero.
WorkItemQueueExecutingCountWorkItemQueueExecutingCountNULLThe number of work items that are ready to be executed.
DataStorewritesPersecDataStorewritesPersecNULLThe number of times data was written to the data store per second.
ZoneElectionsTriggeredZoneElectionsTriggeredNULLThe number of times a server triggers a zone election.
STATicketTimeoutCountSTATicketTimeoutCountNULLThe total number of ticket time-outs that occur during the lifetime of the STA.
AverageLicenseCheckInResponseTimemsAverageLicenseCheckInResponseTimemsNULLThis component monitor returns the average response time for a license check-in operation in milliseconds.
LatencySessionDeviationLatencySessionDeviationNULLThis component monitor returns the difference between the minimum and the maximum session latency values. This value should be as low as possible.
BytesReceivedPersecBytes Received Per secNULLThis component monitor returns the data rate of incoming Independent Management Architecture(IMA) network traffic.
NetworkConnectionsNetworkConnectionsNULLThis component monitor returns the number of active network IMA connections to IMA servers.
LocalHostCachewritesPersecLocalHostCachewritesPersecNULLThe number of times data was written to the IMA local host cache per second.
ResolutionWorkItemQueueExecutingCountResolutionWorkItemQueueExecutingCountNULLThe number of work items that are currently being executed.
WorkItemQueuePendingCountWorkItemQueuePendingCountNULLThe number of work items that are not yet ready to be executed.
NumberofbusyXMLthreadsNumberofbusyXMLthreadsNULLThe number of busy threads.
DataStorereadsPersecDataStorereadsPersecNULLThe number of times data was read from the data store per second.
CPUEntitlementCPUEntitlementNULLThe percentage of CPU resource that Citrix CPU Utilization Management makes available to a user at a given time.
CPUUsageCPUUsageNULLThe percentage of CPU resource consumed by a user at a given time averaged over a few seconds.
STAPeakTicketRequestRateSTAPeakTicketRequestRateNULLThe maximum rate of ticket generation requests per second during the lifetime of the STA.
LongtermCPUUsageLongtermCPUUsageNULLThe percentage of CPU resource consumed by a user averaged over a longer period than the CPU Usage counter.
ApplicationResolutionsPersecApplicationResolutionsPersecNULLThe number of resolutions completed per second.
DynamicStorewritesPersecDynamicStorewritesPersecNULLThe number of times data was written to the dynamic store per second.
ApplicationEnumerationsPersecApplicationEnumerationsPersecNULLEnumeration is the process in which a client transmits data to locate servers on the network and retrieves information about the server farms published applications. During enumeration the XenApp Plug-in for Hosted Apps communicates with the Citrix XML Service or the ICA browser depending on the browsing protocol selected in the plug-in. This monitor provides the number of application enumerations per second.
DataStoreConnectionFailureDataStoreConnectionFailureNULLThe number of minutes that the XenApp server has been disconnected from the data store. This value should be zero at all times.
LastRecordedLicenseCheckOutResponseTimemsLastRecordedLicenseCheckOutResponseTimemsNULLThe last recorded license check-out response time in milliseconds.
WorkItemQueueReadyCountWorkItemQueueReadyCountNULLThe number of work items that are not yet ready to be executed.
STAPeakAllRequestRateSTAPeakAllRequestRateNULLSecure Ticket Authority (STA) is responsible for issuing session tickets in response to connection requests for published resources on XenApp. These session tickets form the basis of authentication and authorization for access to published resources.
LatencySessionAverageLatencySessionAverageNULLThe average client latency over the lifetime of a session.
LatencyLastRecordedLatencyLastRecordedNULLThis component monitor returns the last recorded latency value of the session.
ICARoundtripLatencyMedianICARoundtripLatencyMedianNULLThe median time of ICA roundtrip latency for all sessions on the server.
BytesSentPersecBytesSentPersecNULLThis component monitor returns the data rate of outgoing IMA network traffic.
STAPeakTicketRefreshRateSTAPeakTicketRefreshRateNULLThe maximum rate of refresh requests per second during the lifetime of the STA.
ResolutionWorkItemQueueReadyCountResolutionWorkItemQueueReadyCountNULLThe number of work items that are ready to be executed.
CPUReservationCPUReservationNULLThe percentage of total computer CPU resource reserved for a user, should that user require it.
CPUSharesCPUSharesNULLThe proportion of CPU resource assigned to a user.
LocalHostCachereadsPersecLocalHostCachereadsPersecNULLThe number of times data was read from the IMA local host cache per second.
MaximumnumberofXMLthreadsMaximumnumberofXMLthreadsNULLThe maximum number of threads allocated to service Web-based sessions since the server restarted.
AverageLicenseCheckOutResponseTimemsAverageLicenseCheckOutResponseTimemsNULLThis component monitor returns the average response time for a license check-out operation in milliseconds.
ApplicationResolutionsFailedPersecApplicationResolutionsFailedPersecNULLThe number of application resolutions failed per second.
DynamicStorereadsPersecDynamicStorereadsPersecNULLThe number of times data was read from the dynamic store per second.
STAPeakDataRequestRateSTAPeakDataRequestRateNULLThe maximum rate of data requests per second during the lifetime of the STA.
ZoneElectionsWonZoneElectionsWonNULLThe number of times a server wins a zone election.
ApplicationResolutionTimemsApplicationResolutionTimemsmsThe time in milliseconds that a resolution took to complete. A baseline would be needed in order to establish increases during peak logon times before an accurate threshold can be defined.

Agent G2 - Citrix XenDesktop Advanced Performance Check

Description

Citrix XenDesktop advanced monitoring based on performance counters.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix XenDesktop Advanced Performance CheckDesktopsWithRegistrationStateAsAgentErrorDesktopsWithRegistrationStateAsAgentErrorNULLLists all Virtual Desktops which have not registered with the controller due to Agent Error. The state of being in communication is referred to as the VDA being registered with a controller. If communication fails for any reason the VDA is said to have failed to register with a controller and it will not be possible for DDC to broker a connection to the VDM in question; the VDM becomes a wasted resource.
DesktopsUnRegisteredDesktopsUnRegisteredNULLLists all Virtual Desktops in the Unregistered state. The state of being in communication is referred to as the VDA being registered with a controller. If communication fails for any reason the VDA is said to have failed to register with a controller and it will not be possible for DDC to broker a connection to the VDM in question; the VDM becomes a wasted resource.
DesktopsWithUnknownPowerStateDesktopsWithUnknownPowerStateNULLThe Virtual desktop power state and issuing of power commands to the VM depends on the DDC services being able to communicate with the hypervisor that hosts the VM. The unknown power state is a result of the DDC never having received a notification of the power state of the VM from the hypervisor (hence state unknown).
DesktopsNeverRegisteredDesktopsNeverRegisteredNULLVirtual desktop machine which never registered with the Desktop Controllers. An unregistered desktop cannot be used by an end-user.
TotalDesktopsTotalDesktopsNULLThe total number of desktops in a given Desktop Group.
BrokerHypervisorAlertSeverityRedBrokerHypervisorAlertSeverityRedNULLLists the current alerts objects reported by the hypervisors that the controller is monitoring.
DesktopsFacingICALatencyDesktopsFacingICALatencyNULLDesktop Receiver on the client PCs communicates with the Virtual Desktop Agent(VDA) on the Virtual Desktops using the ICA (Independent Computing Architecture). High latency on this communication channel would result in slowness/freezing of sessions.
DesktopsDisconnectedDesktopsDisconnectedNULLNumber of Virtual Desktops whose summary state is in disconnected state.
DesktopsFacingHighProfileLoadTimeDesktopsFacingHighProfileLoadTimeNULLDesktop Receiver on the client PCs communicates with the Virtual Desktop Agent(VDA) on the Virtual Desktops using the ICA (Independent Computing Architecture). Profile Load time is directly impacted with the profile size. This results in users experiencing logon delays.
DesktopsAvailableDesktopsAvailableNULLThe number of desktops available in a desktop group.
DesktopGroupsUsageDesktopGroupsUsageNULL% of virtual desktop machines used within a group.
DesktopsInUseDesktopsInUseNULLThe number of desktops currently in use in a given Desktop Group.
ActiveSessionsActiveSessionsNULLThe number of Desktop sessions currently streamed.
DesktopsinMaintenanceModeDesktopsinMaintenanceModeNULLPutting a desktop in maintenance mode temporarily stops connections to the desktop so that maintenance tasks can be carried out. A user trying to connect to a desktop in maintenance mode will receive a message telling them the desktop is currently unavailable and to try reconnecting. XenDesktop has no control over desktops in maintenance mode. No user can log on to a desktop in this state. If a user is already logged on, maintenance mode takes effect as soon as they log off.
DesktopsWithImageOutOfDateDesktopsWithImageOutOfDateNULLShows Desktop Groups that have desktops with Out Of Date Images

Agent G2 - Citrix XenDesktop Performance Counters

Description

Monitors the XenDesktop System, Power status, Available system count, Delivery Group Desktops available count, unregistered count along with the Controller details like State, Services 7 Licensing details etc., Applicable on the XenApp/Xendesktop 7.x versions.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix XenDesktop Performance Countersbroker.desktop.group.desktops.availableBrokerDesktopGroup DesktopsAvailable%Number of available desktops
broker.desktop.group.desktops.disconnectedBrokerDesktopGroup DesktopsDisconnected%Number of disconnected desktops
broker.catalog.available.countBrokerCatalog AvailableCountNULLNumber of available systems
broker.desktop.group.desktops.unregisteredBrokerDesktopGroup DesktopsUnregistered%Number of unregistered desktops
broker.desktop.group.desktops.never.registeredBrokerDesktopGroup Desktops NeverRegistered%Number of desktops never registered
broker.desktop.group.desktops.preparingBrokerDesktopGroup DesktopsPreparing%Number of desktops in preparing state

Agent G2 - Citrix XenDesktop Status and Performance Check DotNet v4

Description

Applicable on XenDesktop - Desktop Delivery Controller

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Citrix XenDesktop Status and Performance Check DotNet v4RegistrationRequestsPersecRegistrationRequestsPersecNULLRegistration Requests/sec is the rate at which Citrix Broker Service receives registration requests from virtual desktops.
RegistrationAvgRequestTimeRegistrationAvgRequestTimeNULLRegistration Avg. Request Time is the time on average in seconds taken to process a virtual desktop registration request in Citrix Broker Service. This delay is a one time daily expense
BrokeredSessionsBrokeredSessionsNULLBrokered Sessions is the number of virtual desktop sessions brokered by the Citrix Broker Service.

Agent G2 - CPU - Run Queue Monitor DotNet v4

Description

Number of threads in queue waiting for processor time. Threshold for this metric is based on the number of processors on the system. Ideal value range from one to three threads per processor.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - CPU - Run Queue Monitor DotNet v4cpu.run.queue.monitorCPU Run Queue MonitorNULLNumber of threads in queue waiting for processor time. Threshold for this metric is based on the number of processors on the system. Ideal value range from one to three threads per processor.

Agent G2 - DB-Oracle DotNet v4

Description

Monitors Oracle Performance Counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - DB-Oracle DotNet v4Oracle_UserRollbacksOracle User RollbacksCountValidates the Number of User Roll backs.
Oracle_DataFileDiskWritesOracle Data File Disk WritesCountValidates the Number of Data file disk writes to database.
Oracle_CacheInvalidationsOracle Cache InvalidationsCountValidates the how many Cache invalidations on particular database.
Oracle_TableSpaceFreeOracle Table Space FreeMBValidates free table space available
Oracle_UsersCommitOracle Users CommitCountValidates Number of User commits
Oracle_TableSpaceAllocatedOracle Table Space AllocatedMBValidates the Size allocated for the table by database.
Oracle_SessionsOracle SessionsCountValidates how many sessions currently on particular database.
Oracle_DataFileDiskReadsOracle Data File Disk ReadsCountValidates the Number of Data file disk reads by database.
Oracle_LibraryCacheReloadsOracle Library Cache ReloadsCountValidates the Number of Library Cache Reloads by database.
Oracle_DataFilelesizeAllocatedOracle Data File size AllocatedMBValidates the Data File Size Allocated for the database.
Oracle_LibraryCacheGetsOracle Library Cache GetsCountValidates the Number of Library Cache gets by database.
Oracle_LongRunningQueriesOracle Long Running QueriesCountValidates the how many long running queries on particular database.
Oracle_TablescanBlocksOracle Table scan BlocksCountValidates the Number of Table scan blocks by database.
Oracle_BlockingLockQueriesOracle Blocking Lock QueriesCountValidates the how many block lock queries on particular database.
oracle_processesOracle ProcessesCountValidates the how many processes on particular database.

Agent G2 - Dell Hardware Health - WMI DotNet v4

Description

Monitors the Dell hardware health parameters like CPU Status, Memory status, Fan status, Fan reading, Temperature status, Temperature reading, Power consumption watts sensor status, Power consumption watts sensor reading, Power consumption amps sensor status, Power consumption amps sensor reading, voltage status and voltage reading.

Prerequisites

Validated on Dell PowerEdge R710, Microsoft Windows Server 2008 R2 Standard Edition Service Pack 1, 64-bit.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Dell Hardware Health - WMI DotNet v4dell.voltage.sensor.readingVoltage ReadingNULLDell Voltage sensor current reading in millivolts.
dell.temperature.readingTemperature ReadingNULLDell Temperature sensor current reading in centigrade.
dell.power.consumption.ampssensor.readingPower Consumption Amps Sensor ReadingNULLDell Power consumption amps sensor current reading.

Agent G2 - DFS NameSpace Replication Performance Counters DotNet v4

Description

DFS Monitoring - Namespace and Replication Check

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - DFS NameSpace Replication Performance Counters DotNet v4dfsnamespaceserviceapirequests.requestsprocessedDFSNamespaceServiceAPIRequests RequestsProcessedNULLRequests Processed shows the number of requests to one API that were processed by the DFS Namespace service.
dfsreplicatedfolders.deletedspaceinuseDFSReplicatedFolders DeletedSpaceInUseNULLDeleted Space in Use shows the total size (in bytes) of the deleted files and folders currently in the Conflict and Deleted folder used by the DFS Replication service. The DFS Replication service detects remote deletes from its sending partner and moves the file or folder to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicatedfolders.deletedfilesgeneratedDFSReplicatedFolders DeletedFilesGeneratedNULLDeleted Files Generated shows the number of replicated deleted files and folders that were moved to the Conflict and Deleted folder after they were deleted from a replicated folder on a sending member. The DFS Replication service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicationconnections.totalfilesreceivedDFSReplicationConnections TotalFilesReceivedNULLTotal Files Received shows the number of files that were received on the connection.
dfsreplicatedfolders.conflictfoldercleanupscompletedDFSReplicatedFolders ConflictFolderCleanupsCompletedNULLConflict Folder Cleanups Completed shows the number of times conflict loser files and folders in the Conflict and Deleted folder were deleted by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicationconnections.rdcbytesreceivedDFSReplicationConnections RDCBytesReceivedNULLRDC Bytes Received shows the bytes that were received on this connection while replicating files using remote differential compression (RDC). This is the actual bytes received over the network without the networking protocol overhead.
dfsreplicatedfolders.stagingspaceinuseDFSReplicatedFolders StagingSpaceInUseNULLStaging Space In Use shows the total size (in bytes) of the files and folders currently in the staging folder used by the DFS Replication service. This counter will fluctuate as staging space is reclaimed. The DFS Replication service stages files and folders in the staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota.
dfsreplicationconnections.sizeoffilesreceivedDFSReplicationConnections SizeofFilesReceivedNULLSize of Files Received shows the uncompressed size (in bytes) of the files received on this connection. This is the number of bytes that would have been received had DFS Replication compression not been used.
dfsnamespaceservicereferrals.requestsprocessedpersecDFSNamespaceServiceReferrals RequestsProcessedPersecNULLRequests Per Sec. shows the number of referral requests per second that were processed by the DFS Namespace service.
dfsreplicationservicevolumes.usnjournalrecordsreadDFSReplicationServiceVolumes USNJournalRecordsReadNULLUSN Journal Records Read shows the number of update sequence number (USN) journal records that were read by the DFS Replication service.
dfsreplicatedfolders.deletedbytesgeneratedDFSReplicatedFolders DeletedBytesGeneratedNULLDeleted Bytes Generated shows the total size (in bytes) of replicated deleted files and folders that were moved to the Conflict and Deleted folder after they were deleted from a replicated folder on a sending member. The DFS Replication service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicatedfolders.stagingfilescleanedupDFSReplicatedFolders StagingFilesCleanedupNULLStaging Files Cleaned up shows the number of files and folders that were cleaned up from the staging folder by the DFS Replication service. The DFS Replication service stages files and folders in the staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota.
dfsreplicationconnections.rdcsizeoffilesreceivedDFSReplicationConnections RDCSizeofFilesReceivedNULLRDC Size of Files Received shows the uncompressed size (in bytes) of files received with remote differential compression (RDC) for this connection. This is the number of bytes that would have been received had neither compression nor RDC been used. This is not the actual number of bytes received over the network.
dfsreplicatedfolders.conflictspaceinuseDFSReplicatedFolders ConflictSpaceInUseNULLConflict Space in Use shows the total size (in bytes) of the conflict loser files and folders currently in the Conflict and Deleted folder used by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicatedfolders.rdcbytesreceivedDFSReplicatedFolders RDCBytesReceivedNULLRDC Bytes Received shows the number of bytes that were received in replicating files using remote differential compression (RDC) for this replicated folder. This is the actual bytes received over the network without the networking protocol overhead.
dfsreplicationconnections.compressedsizeoffilesreceivedDFSReplicationConnections CompressedSizeofFilesReceivedNULLCompressed Size of Files Received shows the compressed size of files (in bytes) received on the connection.
dfsreplicatedfolders.conflictbytesgeneratedDFSReplicatedFolders ConflictBytesGeneratedNULLConflict Bytes Generated shows the total size (in bytes) of the files and folders in this replicated folder that were moved to the Conflict and Deleted folder by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicatedfolders.rdcnumberoffilesreceivedDFSReplicatedFolders RDCNumberofFilesReceivedNULLRDC Number of Files Received shows the number files that were received for this replicated folder.
dfsreplicatedfolders.conflictfilescleanedupDFSReplicatedFolders ConflictFilesCleanedupNULLConflict Files Cleaned up shows the number the conflict loser files and folders that were deleted from the Conflict and Deleted folder by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicatedfolders.fileinstallssucceededDFSReplicatedFolders FileInstallsSucceededNULLFile Installs Succeeded shows the number of files that were successfully received from sending members and installed locally on this server. The DFS Replication service replicates staged files into the staging folder, uncompresses them in the Installing folder, and renames them to the target location. The second and third steps of this process are known as installing the file.
dfsreplicatedfolders.conflictbytescleanedupDFSReplicatedFolders ConflictBytesCleanedupNULLConflict Bytes Cleaned up shows the total size (in bytes) of the conflict loser files and folders that were deleted from the Conflict and Deleted folder by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicatedfolders.stagingbytescleanedupDFSReplicatedFolders StagingBytesCleanedupNULLStaging Bytes Cleaned up shows the total size (in bytes) of the files and folders that were cleaned up from the staging folder by the DFS Replication service. The DFS Replication service stages files and folders in the staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota.
dfsreplicationservicevolumes.databasecommitsDFSReplicationServiceVolumes DatabaseCommitsNULLDatabase Commits shows the number of database commit operations performed by the DFS Replication service. This counter indicates how intensive the DFS Replication service is from a database perspective.
dfsreplicatedfolders.fileinstallsretriedDFSReplicatedFolders FileInstallsRetriedNULLFile Installs Retried shows the number of file installs that are being retried due to sharing violations or other errors encountered when installing the files. The DFS Replication service replicates staged files into the staging folder, uncompresses them in the Installing folder, and renames them to the target location. The second and third steps of this process are known as installing the file.
dfsreplicatedfolders.stagingfilesgeneratedDFSReplicatedFolders StagingFilesGeneratedNULLStaging Files Generated shows the number of times replicated files and folders were staged by the DFS Replication service. The DFS Replication service stages files and folders in a staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota.
dfsreplicatedfolders.totalfilesreceivedDFSReplicatedFolders TotalFilesReceivedNULLTotal Files Received shows the number of files that were received by this replicated folder.
dfsnamespaceservicereferrals.requestsprocessedDFSNamespaceServiceReferrals RequestsProcessedNULLRequests Processed shows the number of referral requests that were processed by the DFS Namespace service.
dfsreplicatedfolders.deletedbytescleanedupDFSReplicatedFolders DeletedBytesCleanedupNULLDeleted Bytes Cleaned up shows the total size (in bytes) of replicating deleted files and folders (in bytes) that were cleaned up from the Conflict and Deleted folder by the DFS Replication service. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicatedfolders.rdccompressedsizeoffilesreceivedDFSReplicatedFolders RDCCompressedSizeofFilesReceivedNULLRDC Compressed Size of Files Received shows the compressed size (in bytes) of the files received with remote differential compression (RDC) for this replicated folder. This is the number of bytes that would have been received had RDC not been used. This is not the actual bytes received over the network.
dfsreplicationconnections.bytesreceivedpersecondDFSReplicationConnections BytesReceivedPerSecondNULLBytes Received Per Second shows an estimate of the average number of bytes that were received each second over the past 30 seconds.
dfsreplicationservicevolumes.usnjournalunreadpercentageDFSReplicationServiceVolumes USNJournalUnreadPercentageNULLUSN Journal Unread Percentage shows the percent of the update sequence number (USN) journal that has not yet been read and processed by the DFS Replication service. A journal wrap will occur if this counter reaches 100.
dfsreplicatedfolders.compressedsizeoffilesreceivedDFSReplicatedFolders CompressedSizeofFilesReceivedNULLCompressed Size of Files Received shows the compressed size of files (in bytes) received for this replicated folder.
dfsreplicatedfolders.deletedfilescleanedupDFSReplicatedFolders DeletedFilesCleanedupNULLDeleted Files Cleaned up shows the number of replicated deleted files and folders that were cleaned up from the Conflict and Deleted folder by the DFS Replication service. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsnamespaceservicereferrals.requestsfailedDFSNamespaceServiceReferrals RequestsFailedNULLRequests Failed shows the number of referral requests that were failed by the DFS Namespace service.
dfsreplicatedfolders.updatesdroppedDFSReplicatedFolders UpdatesDroppedNULLUpdates Dropped shows the number of redundant file replication update records that were ignored by the DFS Replication service because they did not change the replicated file or folder. For example, dropped updates can occur when access control lists (ACLs) are overwritten with identical ACLs on a file or folder.
dfsreplicationconnections.rdccompressedsizeoffilesreceivedDFSReplicationConnections RDCCompressedSizeofFilesReceivedNULLCompressed Size of Files Received shows the compressed size (in bytes) of files received for this replicated folder.
dfsnamespaceserviceapirequests.requestsfailedDFSNamespaceServiceAPIRequests RequestsFailedNULLRequests Failed shows the number of requests to one API that were failed by the DFS Namespace service.
dfsreplicatedfolders.bandwidthsavingsusingdfsreplicationDFSReplicatedFolders BandwidthSavingsUsingDFSReplicationNULLBandwidth Savings Using DFS Replication shows the percentage of bandwidth that was saved by the DFS Replication service for this replicated folder using a combination of remote differential compression (RDC) and other compression technologies that minimize network bandwidth. For example, a value of 20 indicates that the DFS Replication service used 20% less bandwidth than it would have used if it had transmitted the entire files uncompressed over the network.
dfsnamespaceservicereferrals.avgresponsetimeDFSNamespaceServiceReferrals AvgResponseTimeNULLAvg Response Time shows the average response time to the referral requests that were processed by the DFS Namespace service.
dfsreplicationconnections.rdcnumberoffilesreceivedDFSReplicationConnections RDCNumberofFilesReceivedNULLRDC Number of Files Received shows the number files that were received on this connection.
dfsreplicationconnections.bandwidthsavingsusingdfsreplicationDFSReplicationServiceVolumes BandwidthSavingsUsingDFSReplicationNULLBandwidth Savings Using DFS Replication shows the percentage of bandwidth that was saved by the DFS Replication service for this connection using a combination of remote differential compression (RDC) and other compression technologies that minimize network bandwidth use. For example, a value of 20 indicates that the DFS Replication service used 20% less bandwidth than it would have used if it had transmitted the entire files uncompressed over the network.
dfsnamespaceserviceapirequests.requestsprocessedpersecDFSNamespaceServiceAPIRequests RequestsProcessedPersecNULLRequests Per Sec. Rate shows the number of API requests per second that were processed by the DFS Namespace service.
dfsreplicationconnections.totalbytesreceivedDFSReplicationConnections TotalBytesReceivedNULLTotal Bytes Received shows the total number of bytes received on the connection. The bytes received value includes file data and replication metadata.
dfsreplicatedfolders.conflictfilesgeneratedDFSReplicatedFolders ConflictFilesGeneratedNULLConflict Files Generated shows the number of files and folders in this replicated folder that were moved to the Conflict and Deleted folder by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota.
dfsreplicationservicevolumes.usnjournalrecordsacceptedDFSReplicationServiceVolumes USNJournalRecordsAcceptedNULLUSN Journal Records Accepted shows the number of update sequence number (USN) journal records that were processed by the DFS Replication service. The DFS Replication service processes all USN journal records for replicated content on a volume and ignores records for non-replicated files and folders on the volume.
dfsnamespaceserviceapirequests.avgresponsetimeDFSNamespaceServiceAPIRequests AvgResponseTimeNULLAvg Response Time shows the average response time to the requests to one API that were processed by the DFS Namespace service.
dfsnamespace.foldercountDFSNamespace FolderCountNULLFolder count shows the number of DFS folders or links in a namespace.
dfsreplicatedfolders.sizeoffilesreceivedDFSReplicatedFolders SizeofFilesReceivedNULLSize of Files Received shows the uncompressed size (in bytes) of the files received for this replicated folder. This is the number of bytes that would have been received had DFS Replication compression not been used.
dfsreplicatedfolders.rdcsizeoffilesreceivedDFSReplicatedFolders RDCSizeofFilesReceivedNULLRDC Size of Files Received shows the uncompressed size (in bytes) of the files received with remote differential compression (RDC) for this replicated folder. This is the number of bytes that would have been received had neither compression nor RDC been used. This is not the actual bytes received over the network.
dfsreplicationservicevolumes.databaselookupsDFSReplicationServiceVolumes DatabaseLookupsNULLDatabase Lookups shows the number of database search operations performed by the DFS Replication service This counter indicates how intensive the DFS Replication service is from a database perspective.
dfsreplicatedfolders.stagingbytesgeneratedDFSReplicatedFolders StagingBytesGeneratedNULLStaging Bytes Generated shows the total size (in bytes) of replicated files and folders in the staging folder created by the DFS Replication service since last restart and is monotonically increasing counter. The DFS Replication service stages files and folders in the staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota.

Agent G2 - Disk Performance Monitoring DotNet v4

Description

Disk Performance Monitoring

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Disk Performance Monitoring DotNet v4AverageDiskWriteQueueLengthAverageDiskWriteQueueLengthNULLAvg. Disk Write Queue Length is the average number of write requests that were queued for the selected disk during the sample interval.
AvgDisksecPerReadAvgDisksecPerReadNULLAvg. Disk sec/Read is the average time, in seconds, of a read of data from the disk.
PercentDiskTimePercentDiskTimeNULL% Disk Time is the percentage of elapsed time that the selected disk drive was busy servicing read or write requests.
AverageDiskReadQueueLengthAverageDiskReadQueueLengthNULLAvg. Disk Read Queue Length is the average number of read requests that were queued for the selected disk during the sample interval.
AvgDisksecPerWriteAvgDisksecPerWriteNULLAvg. Disk sec/Write is the average time, in seconds, of a write of data to the disk.
AvgDiskBytesPerTransferAvgDiskBytesPerTransferNULLAvg. Disk Bytes/Transfer is the average number of bytes transferred to or from the disk during write or read operations. The disk is efficient if it transfers large amounts of data relatively quickly.
AverageDiskQueueLengthAverageDiskQueueLengthNULLAvg. Disk Queue Length is the average number of both read and write requests that were queued for the selected disk during the sample interval.

Agent G2 - DNS Performance Counters DotNet v4

Description

Monitors DNS performance counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - DNS Performance Counters DotNet v4dns.dynamic.update.requests.receivedDNS DynamicUpdateRequestsReceivedNULLDynamic Update Received is the total number of dynamic update requests received by the DNS server.
dns.total.queries.received.per.secDNS TotalQueriesReceivedPerSecNULLTotal Query Received/sec is the average number of queries received by DNS server in each second.
dns.dynamic.update.requests.empty.persecDNS DynamicUpdateRequestsEmptyPerSecNULLDynamic Update NoOperation/sec is the average number of No-operation/Empty dynamic update requests received by the DNS server in each second.
dns.nb.stat.memory.usedDNS NbStatMemoryUsedNULLNbstat Memory is the total Nbstat memory used by DNS server.
dns.udp.queries.received.per.secDNS UDPQueriesReceivedPerSecNULLUDP Query Received/sec is the average number of UDP queries received by DNS server in each second.
dns.zone.transfers.successDNS ZoneTransfersSuccessNULLZone Transfer Success is the total number of successful zone transfers of the master DNS server.
dns.tcp.queries.received.per.secDNS TCPQueriesReceivedPerSecNULLTCP Query Received/sec is the average number of TCP queries received by DNS server in each second.
dns.caching.memory.usedDNS CachingMemoryUsedNULLCaching Memory is the total caching memory used by DNS server.
dns.secure.updates.failedDNS SecureUpdatesFailedNULLSecure Update Failure is the total number of secure updates failed of the DNS server.
dns.tcp.message.memory.usedDNS TCPMessageMemoryUsedNULLTCP Message Memory is the total TCP message memory used by DNS server.
dns.database.node.memory.usedDNS DatabaseNodeMemoryUsedNULLDatabase Node Memory is the total database node memory used by DNS server.
dns.dynamic.update.requests.rejectedDNS DynamicUpdateRequestsRejectedNULLDynamic Update Rejected is the total number of dynamic updates rejected by the DNS server.
dns.dynamic.updates.written.to.databaseDNS DynamicUpdateswrittentodatabaseNULLDynamic Update Written to Database is the total number of dynamic updates written to the database by the DNS server.
dns.secure.update.requests.receivedDNS SecureUpdateRequestsReceivedNULLSecure Update Received is the total number of secure update requests received by the DNS server.
dns.udp.responses.sent.per.secDNS UDPResponsesSentPerSecNULLUDP Response Sent/sec is the average number of UDP reponses sent by DNS server in each second.
dns.udp.message.memory.usedDNS UDPMessageMemoryUsedNULLUDP Message Memory is the total UDP message memory used by DNS server.
dns.record.flow.memory.usedDNS RecordFlowMemoryUsedNULLRecord Flow Memory is the total record flow memory used by DNS server.
dns.tcp.responses.sent.per.secDNS TCPResponsesSentPerSecNULLTCP Response Sent/sec is the average number of TCP reponses sent by DNS server in each second.
dns.zone.transfers.failedDNS ZoneTransfersFailedNULLZone Transfer Failure is the total number of failed zone transfers of the master DNS server.
dns.total.responses.sent.per.secDNS TotalResponsesSentPerSecNULLTotal Response Sent/sec is the average number of reponses sent by DNS server in each second.
dns.dynamic.update.timeoutsDNS DynamicUpdateTimeoutsNULLDynamic Update TimeOuts is the total number of dynamic update timeouts of the DNS server.

Agent G2 - DNS Recursive Performance Counters DotNet v4

Description

Monitors DNS Recursive Class WMI Performance data

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - DNS Recursive Performance Counters DotNet v4DNS_RecursiveTimeOutPerSecDNS_RecursiveTimeOutPerSecNULLProvides the average number of recursive query sending timeouts in each second.
DNS_RecursiveQueryFailurePerSecDNS_RecursiveQueryFailurePerSecNULLProvides the average number of recursive query failures in each second.
DNS_RecursiveQueriesPerSecDNS_RecursiveQueriesPerSecNULLProvides the average number of recursive queries received by DNS server in each second.

Agent G2 - Fujitsu PRIMERGY Health - Windows

Description

Monitor the following metrics for Fujitsu PRIMERGY Health on Windows servers: fujitsu_primergy_host_raidController_healthState, fujitsu_primergy_host_raidController_primaryStatus, fujitsu_primergy_diskDrive_healthState, fujitsu_primergy_diskDrive_primaryStatus, fujitsu_primergy_temperature_healthState, fujitsu_primergy_temperature_currentState, fujitsu_primergy_temperature_currentReading, fujitsu_primergy_voltage_healthState, fujitsu_primergy_voltage_currentState, fujitsu_primergy_voltage_currentReading, fujitsu_primergy_powerSupply_healthState, fujitsu_primergy_powerConsumptionSensor_healthState, fujitsu_primergy_powerConsumptionSensor_currentState, fujitsu_primergy_fan_healthState, fujitsu_primergy_fan_sensor_healthState, fujitsu_primergy_fan_sensor_currentState, fujitsu_primergy_fan_sensor_currentReading, fujitsu_primergy_managementController_healthState, fujitsu_primergy_processor_healthState, fujitsu_primergy_memory_healthState, fujitsu_primergy_cache_memory_healthState, fujitsu_primergy_physical_memory_healthState

Note: This template will be applicable only for the Agent version 14.0.0 or later.

Prerequisites

This template will be applicable only for the Agent version 14.0.0 or later.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Fujitsu PRIMERGY Health - Windowsfujitsu_primergy_host_raidController_healthStatefujitsu_primergy_host_raidController_healthStatenullMonitors Fujitsu PGY Raid Controller Current Health State. The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_host_raidController_primaryStatusfujitsu_primergy_host_raidController_primaryStatusnullMonitors Fujitsu PGY Host Raid Controller PrimaryStatus, high-level status value intended to align with Red-Yellow-Green type representation of status. Supported values: 0: Unknown, 2: Warning - Warning; 3: Error - Critical; 1: OK - Ok
fujitsu_primergy_diskDrive_primaryStatusfujitsu_primergy_diskDrive_primaryStatusnullMonitors Fujitsu PGY DiskDrive Primary Status, high-level status value intended to align with Red-Yellow-Green type representation of status. Supported values: 0: Unknown, 2: Warning - Warning; 3: Error - Critical; 1: OK - Ok
fujitsu_primergy_temperature_healthStatefujitsu_primergy_temperature_healthStatenullMonitors Fujitsu PGY Temperature HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_temperature_currentStatefujitsu_primergy_temperature_currentStatenullCurrent state indicated by the Sensor. Lower Critical, Upper Critical, Critical - Critical; Unknown, Upper Non-Critical, Non-Critical - Warning; Normal - Ok
fujitsu_primergy_temperature_currentReadingfujitsu_primergy_temperature_currentReadingCCurrent value indicated by the Sensor.
fujitsu_primergy_voltage_healthStatefujitsu_primergy_voltage_healthStatenullMonitors Fujitsu PGY Voltage HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_voltage_currentStatefujitsu_primergy_voltage_currentStatenullCurrent state indicated by the Sensor. Lower Critical, Upper Critical, Critical - Critical; Unknown, Upper Non-Critical, Non-Critical - Warning; Normal - Ok
fujitsu_primergy_voltage_currentReadingfujitsu_primergy_voltage_currentReadingvCurrent value indicated by the Sensor
fujitsu_primergy_powerSupply_healthStatefujitsu_primergy_powerSupply_healthStatenullMonitors Fujitsu PGY PowerSupply HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_powerConsumptionSensor_healthStatefujitsu_primergy_powerConsumptionSensor_healthStatenullMonitors Fujitsu PGY Consumption Sensor HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_powerConsumptionSensor_currentStatefujitsu_primergy_powerConsumptionSensor_currentStatenullCurrent state indicated by the Sensor. Lower Critical, Upper Critical, Critical - Critical; Unknown, Upper Non-Critical, Non-Critical - Warning; Normal - Ok
fujitsu_primergy_fan_healthStatefujitsu_primergy_fan_healthStatenullMonitors Fujitsu PGY Fan HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_fan_sensor_healthStatefujitsu_primergy_fan_sensor_healthStatenullMonitors Fujitsu PGY Fan Sensor HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_fan_sensor_currentStatefujitsu_primergy_fan_sensor_currentStatenullCurrent state indicated by the Sensor. Lower Critical, Upper Critical, Critical - Critical; Unknown, Upper Non-Critical, Non-Critical - Warning; Normal - Ok
fujitsu_primergy_fan_sensor_currentReadingfujitsu_primergy_fan_sensor_currentReadingrpmCurrent value indicated by the Sensor
fujitsu_primergy_managementController_healthStatefujitsu_primergy_managementController_healthStatenullMonitors Fujitsu PGY Management Controller HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_processor_healthStatefujitsu_primergy_processor_healthStatenullMonitors Fujitsu PGY Processor HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_memory_healthStatefujitsu_primergy_memory_healthStatenullMonitors Fujitsu PGY Memory HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok
fujitsu_primergy_physical_memory_healthStatefujitsu_primergy_physical_memory_healthStatenullMonitors Fujitsu PGY Physical Memory HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok

Agent G2 - Fujitsu PRIMERGY Health - Linux

Description

Template for Linux environment to monitor Fujitsu PRIMERGY Health Monitoring metrics like: Fujitsu PGY Host Raid Controller HealthState, Fujitsu PGY Host Raid Controller PrimaryStatus, Fujitsu PGY DiskDrive HealthState, Fujitsu PGY DiskDrive PrimaryStatus, Fujitsu PGY Temperature HealthState, Fujitsu PGY Temperature CurrentState, Fujitsu PGY Temperature CurrentReading, Fujitsu PGY Voltage HealthState, Fujitsu PGY Voltage CurrentState, Fujitsu PGY Voltage CurrentReading, Fujitsu PGY PowerSupply HealthState, Fujitsu PGY Power Consumption Sensor HealthState, Fujitsu PGY Power Consumption Sensor CurrentState, Fujitsu PGY Fan HealthState, Fujitsu PGY Fan Sensor HealthState, Fujitsu PGY Fan Sensor CurrentState, Fujitsu PGY Fan Sensor CurrentReading, Fujitsu PGY Management Controller HealthState, Fujitsu PGY Processor HealthState, Fujitsu PGY Memory HealthState, Fujitsu PGY Cache Memory HealthState, Fujitsu PGY Physical Memory HealthState.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Fujitsu PRIMERGY Health - Linuxfujitsu_pgy_management_controller_health_statePGY Management Controller Health StateNULLPGY Management Controller Health State
fujitsu_pgy_temperature_health_statePGY Temperature Health StateNULLPGY Temperature Health State
fujitsu_pgy_powerConsumption_sensor_health_statePGY Power Consumption Health StateNULLPGY Power Consumption Health State
fujitsu_pgy_voltage_health_statePGY Voltage Health StateNULLPGY Voltage Health State
fujitsu_pgy_diskdrive_health_statePGY Disk Drive Health StateNULLPGY Disk Drive Health State
fujitsu_pgy_temperature_current_statePGY Temperature Current StateNULLPGY Temperature Current State
fujitsu_pgy_fan_sensor_health_statePGY Fan Sensor Health StateNULLPGY Fan Sensor Health State
fujitsu_pgy_powersupply_health_statePGY Power Supply Health StateNULLPGY Power Supply Health State
fujitsu_pgy_cache_memory_health_statePGY Cache Memory Health StateNULLPGY Cache Memory Health State
fujitsu_pgy_diskdrive_primary_statusPGY Disk Drive Primary StatusNULLPGY Disk Drive Primary Status
fujitsu_pgy_voltage_current_valuePGY Voltage Current ValuevPGY Voltage Current Value
fujitsu_pgy_memory_health_statePGY Memory Health StateNULLPGY Memory Health State
fujitsu_pgy_fan_sensor_current_statePGY Fan Sensor Current StateNULLPGY Fan Sensor Current State
fujitsu_pgy_voltage_current_statePGY Voltage Current StateNULLPGY Voltage Current State
fujitsu_pgy_temperature_valuePGY Temperature ValueNULLPGY Temperature Value
fujitsu_pgy_fan_sensor_current_valuePGY Fan Sensor Current ValuerpmPGY Fan Sensor Current Value
fujitsu_pgy_powerConsumption_sensor_current_statePGY Power Consumption Current StateNULLPGY Power Consumption Current State
fujitsu_pgy_physical_memory_health_statePGY Physical Memory Health StateNULLPGY Physical Memory Health State
fujitsu_pgy_host_raidcontroller_health_statePGY Host Raid Controller Health StateNULLPGY Host Raid Controller Health State
fujitsu_pgy_host_raidcontroller_primary_statusPGY Host Raid Controller Primary StatusNULLPGY Host Raid Controller Primary Status
fujitsu_pgy_fan_health_statePGY Fan Health StateNULLPGY Fan Health State
fujitsu_pgy_processor_health_statePGY Processor Health StateNULLPGY Processor Health State

Agent G2 - Fujitsu PRIMERGY Health - Windows - DotNet v4

Description

Template for Windows environment (for .NET v4 or later) to monitor Fujitsu PRIMERGY Health Monitoring metrics like: Fujitsu PGY Host Raid Controller HealthState, Fujitsu PGY Host Raid Controller PrimaryStatus, Fujitsu PGY DiskDrive HealthState, Fujitsu PGY DiskDrive PrimaryStatus, Fujitsu PGY Temperature HealthState, Fujitsu PGY Temperature CurrentState, Fujitsu PGY Temperature CurrentReading, Fujitsu PGY Voltage HealthState, Fujitsu PGY Voltage CurrentState, Fujitsu PGY Voltage CurrentReading, Fujitsu PGY PowerSupply HealthState, Fujitsu PGY Power Consumption Sensor HealthState, Fujitsu PGY Power Consumption Sensor CurrentState, Fujitsu PGY Fan HealthState, Fujitsu PGY Fan Sensor HealthState, Fujitsu PGY Fan Sensor CurrentState, Fujitsu PGY Fan Sensor CurrentReading, Fujitsu PGY Management Controller HealthState, Fujitsu PGY Processor HealthState, Fujitsu PGY Memory HealthState, Fujitsu PGY Cache Memory HealthState, Fujitsu PGY Physical Memory HealthState.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Fujitsu PRIMERGY Health - Windows - DotNet v4fujitsu.pgy.temperature.currentreadingFujitsu PGY Temperature CurrentReadingNULLCurrent value indicated by the Sensor
fujitsu.pgy.voltage.currentreadingFujitsu PGY Voltage CurrentReadingNULLCurrent value indicated by the Sensor
fujitsu.pgy.fan.sensor.currentreadingFujitsu PGY Fan Sensor CurrentReadingNULLCurrent value indicated by the Sensor

Agent G2 - HyperV processorratio

Description

HyperV processorratio monitor Template

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - HyperV processorratioHyperv_VirtualtoLogicalProcessorRatioHyperv_VirtualtoLogicalProcessorRatioNULLShows the count ratio of HyperV Virtual and Logical Processors.

Agent G2 - K8s ApiServer Requests Advanced Metrics

Description

Monitors Apiserver Request Total and Apiserver Request Duration Seconds Bucket (verb = get, put, post) Metrics.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - K8s ApiServer Requests Advanced Metricsapiserver.post.requests.duration.secondsKube apiserver Post Requests Duration SecondsSecondsTime taken for POST API responses.
apiserver.get.requests.duration.secondsKube apiserver Get Requests Duration SecondsSecondsTime taken for GET API responses.
apiserver.requests.total.success.rateKube apiserver Requests Success RatePercentagePercentage of success API requests.
apiserver.put.requests.duration.secondsKube apiserver Put Requests Duration SecondsSecondsTime taken for PUT API responses.

Agent G2 - Kubernetes Pods Monitor

Description

Monitors pods of different kinds of kubernetes like Deployment, Daemonset, Replicasets and Statefulsets

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - k8spodsmonitorkubernetes_statefulset_pods_running_percentageStatefulSet Pods Running PercentagePercentagePercentage of pods running by desired pods of Statefulset.
kubernetes_daemonset_pods_running_percentageDaemonset Pods Running PercentagePercentagePercentage of pods running by desired pods of Daemonset.
kubernetes_deployment_pods_running_percentageDeployment Pods Running PercentagePercentagePercentage of pods running by desired pods of Deployment.
kubernetes_daemonset_pods_runningDaemonset Pods RunningCountThe number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready.
kubernetes_statefulset_pods_running_percentageStatefulSet Pods Running PercentagePercentagePercentage of pods running by desired pods of Statefulset.
kubernetes_daemonset_pods_desiredDaemonset Pods DesiredCountThe number of nodes that should be running the daemon pod.
kubernetes_statefulset_pods_desiredStatefulSet Pods DesiredCountNumber of desired pods for a StatefulSet.
kubernetes_deployment_pods_runningDeployment Pods RunningCountThe number of ready replicas per deployment.
kubernetes_replicaset_pods_desiredReplicaset Pods DesiredCountNumber of desired pods for a ReplicaSet.
kubernetes_statefulset_pods_desiredStatefulSet Pods DesiredCountNumber of desired pods for a StatefulSet.
kubernetes_deployment_pods_running_percentageDeployment Pods Running PercentagePercentagePercentage of pods running by desired pods of Deployment.
kubernetes_deployment_pods_runningDeployment Pods RunningCountThe number of ready replicas per deployment.
kubernetes_statefulset_pods_runningStatefulSet Pods RunningCountThe number of ready replicas per StatefulSet.
kubernetes_daemonset_pods_desiredDaemonset Pods DesiredCountThe number of nodes that should be running the daemon pod.
kubernetes_replicaset_pods_runningReplicaset Pods RunningCountThe number of ready replicas per ReplicaSet.
kubernetes_deployment_pods_desiredDeployment Pods DesiredCountNumber of desired pods for a deployment.
kubernetes_replicaset_pods_desiredReplicaset Pods DesiredCountNumber of desired pods for a ReplicaSet.
kubernetes_daemonset_pods_runningDaemonset Pods RunningCountThe number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready.
kubernetes_deployment_pods_desiredDeployment Pods DesiredCountNumber of desired pods for a deployment.
kubernetes_replicaset_pods_runningReplicaset Pods RunningCountThe number of ready replicas per ReplicaSet.
kubernetes_replicaset_pods_running_percentageReplicaset Pods Running PercentagePercentagePercentage of pods running by desired pods of Replicaset
kubernetes_replicaset_pods_running_percentageReplicaset Pods Running PercentagePercentagePercentage of pods running by desired pods of Replicaset.
kubernetes_daemonset_pods_running_percentageDaemonset Pods Running PercentagePercentagePercentage of pods running by desired pods of Daemonset.
kubernetes_statefulset_pods_runningStatefulSet Pods RunningCountThe number of ready replicas per StatefulSet.

Agent G2 - Linux - ActiveMQ Monitors

Description

Monitors ActiveMQ application metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - ActiveMQ Monitorsactivemq.broker.memory.prctActiveMQ-BrokerMemoryPercentUsageNULLThe percent of memory limit used.
activemq.broker.temp.prctActiveMQ-BrokerTempPercentUsageNULLThe space used by the store for temporary messages.
activemq.broker.store.prctActiveMQ-BrokerStorePercentUsageNULLThe space used by the Message Store.
activemq.jvm.gc.collection_countActiveMQ-JVM.GC.collection_countNULLNumber of garbage objects collected
activemq.queue.consumer_countActiveMQ-ConsumerCountNULLNumber of consumers subscribed to this destination
activemq.queue.inflight_countActiveMQ-InFlightCountNULLNumber of messages sent to a destination and have not received an acknowledgement.
activemq.queue.producer_countActiveMQ-ProducerCountNULLNumber of producers
activemq.queue.enqueue_countActiveMQ-EnqueueCountNULLNumber of messages that have been sent to the destination since the last restart.
activemq.queue.dequeue_countActiveMQ-DequeueCountNULLNumber of messages that have been acknowledged (and removed) from the destination since last restart.
activemq.queue.sizeActiveMQ-QueueSizeNULLThe number of messages that currently reside in the queue. Potentially dispatched but unacknowledged.
activemq.jvm.threadsActiveMQ-JVM.ThreadsNULLNumber of threads.
activemq.jvm.uptimeActiveMQ-UptimeNULLUptime of the server
activemq.jvm.mem.non_heap_committedActiveMQ-JVM.Mem.non_heap_committedNULLNon-heap memory committed (in MB) for the server
activemq.jvm.open.fdsActiveMQ-JVM.OpenFDsNULLNumber of Open file descriptors of the server
activemq.jvm.mem.heap_usedActiveMQ-JVM.Mem.heap_usedNULLHeap memory usage (in MB) of the server
activemq.queue.avg_enqueuetimeActiveMQ-AverageEnqueueTimeNULLOn average, the amount of time (ms) that messages remained enqueued. Or average time it is taking the consumers to successfully process messages.
activemq.queue.memory.prctActiveMQ-QueueMemoryPercentUsageNULLThe percentage of the memory limit used by queues.
activemq.jvm.mem.non_heap_usedActiveMQ-JVM.Mem.non_heap_usedNULLNon-heap memory usage (in MB) of the server
activemq.queue.dispatch_countActiveMQ-DispatchCountNULLNumber of messages that have been dispatched (Dequeue + Inflight).
activemq.queue.expired_countActiveMQ-ExpiredCountNULLNumber of messages that were expired.
activemq.jvm.gc.collection_timeActiveMQ-JVM.GC.collection_timeNULLTime taken for collection of the garbage objects.
activemq.queue.max_enqueuetimeActiveMQ-MaxEnqueueTimeNULLThe maximum amount of time that messages remained enqueued.
activemq.jvm.mem.heap_committedActiveMQ-JVM.Mem.heap_committedNULLHeap memory committed (in MB) for the server

Agent G2 - Linux - ActiveMQ Performance Check

Description

Monitors ActiveMQ application metrics uses MBeans exposed via the JMX console. Please refer to OpsRamp documentation on how to enable JMX on your application.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - ActiveMQ Performance Checkactivemq.broker.memory.prctActiveMQ-BrokerMemoryPercentUsageNULLThe percent of memory limit used.
activemq.broker.temp.prctActiveMQ-BrokerTempPercentUsageNULLThe space used by the store for temporary messages.
activemq.broker.store.prctActiveMQ-BrokerStorePercentUsageNULLThe space used by the Message Store.
activemq.jvm.gc.collection_countActiveMQ-JVM.GC.collection_countNULLNumber of garbage objects collected
activemq.queue.consumer_countActiveMQ-ConsumerCountNULLNumber of consumers subscribed to this destination
activemq.queue.inflight_countActiveMQ-InFlightCountNULLNumber of messages sent to a destination and have not received an acknowledgement.
activemq.queue.producer_countActiveMQ-ProducerCountNULLNumber of producers
activemq.queue.enqueue_countActiveMQ-EnqueueCountNULLNumber of messages that have been sent to the destination since the last restart.
activemq.queue.dequeue_countActiveMQ-DequeueCountNULLNumber of messages that have been acknowledged (and removed) from the destination since last restart.
activemq.queue.sizeActiveMQ-QueueSizeNULLThe number of messages that currently reside in the queue. Potentially dispatched but unacknowledged.
activemq.jvm.threadsActiveMQ-JVM.ThreadsNULLNumber of threads.
activemq.jvm.uptimeActiveMQ-UptimeNULLUptime of the server
activemq.jvm.mem.non_heap_committedActiveMQ-JVM.Mem.non_heap_committedNULLNon-heap memory committed (in MB) for the server
activemq.jvm.open.fdsActiveMQ-JVM.OpenFDsNULLNumber of Open file descriptors of the server
activemq.jvm.mem.heap_usedActiveMQ-JVM.Mem.heap_usedNULLHeap memory usage (in MB) of the server
activemq.queue.avg_enqueuetimeActiveMQ-AverageEnqueueTimeNULLOn average, the amount of time (ms) that messages remained enqueued. Or average time it is taking the consumers to successfully process messages.
activemq.queue.memory.prctActiveMQ-QueueMemoryPercentUsageNULLThe percentage of the memory limit used by queues.
activemq.jvm.mem.non_heap_usedActiveMQ-JVM.Mem.non_heap_usedNULLNon-heap memory usage (in MB) of the server
activemq.queue.dispatch_countActiveMQ-DispatchCountNULLNumber of messages that have been dispatched (Dequeue + Inflight).
activemq.queue.expired_countActiveMQ-ExpiredCountNULLNumber of messages that were expired.
activemq.jvm.gc.collection_timeActiveMQ-JVM.GC.collection_timeNULLTime taken for collection of the garbage objects.
activemq.queue.max_enqueuetimeActiveMQ-MaxEnqueueTimeNULLThe maximum amount of time that messages remained enqueued.
activemq.jvm.mem.heap_committedActiveMQ-JVM.Mem.heap_committedNULLHeap memory committed (in MB) for the server

Agent G2 - Linux - Apache CouchDB Monitors

Description

Monitors CouchDB application metrics using the CouchDB’s API which is normally accessible via http://127.0.0.1:5984/

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Apache CouchDB Monitorscouchdb.httpd_status_codes.412CouchDB-Httpd412StatusNULLNumber of HTTP 412 Precondition Failed responses
couchdb.httpd_status_codes.404CouchDB-Httpd404StatusNULLNumber of HTTP 404 Not Found responses
couchdb.httpd.temporary_view_readsCouchDB-HttpTemporaryViewReadsNULLNumber of temporary view reads
couchdb.request_timeCouchDB-RequestTimemsLength of a request inside CouchDB without MochiWeb
couchdb.httpd_status_codes.202CouchDB-Httpd202StatusNullNumber of HTTP 202 Accepted responses
couchdb.disk_sizeCouchDB-DiskSizeNULLTotal size Disk for all dbs
couchdb.database_readsCouchDB-DatabaseReadsNULLNumber of times a document was read from a database
couchdb.databases_openCouchDB-OpenDatabasesNULLNumber of open databases
couchdb.httpd_request_methods_deleteCouchDB-DELETERequestsNULLNumber of HTTP DELETE requests
couchdb.httpd_status_codes.403CouchDB-Httpd403StatusNULLNumber of HTTP 403 Forbidden responses
couchdb.httpd_status_codes.201CouchDB-Httpd201StatusNULLNumber of HTTP 201 Created responses
couchdb.httpd.clients_requesting_changesCouchDB-HttpClientsRequestingChangesNULLNumber of clients for continuous _changes
couchdb.database_writesCouchDB-DatabaseWritesNULLNumber of times a database was changed
couchdb.open_fdsCouchDB-OpenFDsNULLNumber of file descriptors CouchDB has open
couchdb.httpd_status_codes.405CouchDB-Httpd405StatusNULLNumber of HTTP 405 Method Not Allowed responses
couchdb.httpd_request_methods_headCouchDB-HEADRequestsNULLNumber of HTTP HEAD requests
couchdb.httpd.requestsCouchDB-HttpRequestsNULLNumber of HTTP requests
couchdb.httpd_status_codes.301CouchDB-Httpd301StatusNULLNumber of HTTP 301 Moved Permanently responses
couchdb.httpd_request_method_postCouchDB-POSTRequestsNULLNumber of HTTP POST requests
couchdb.cache_hitsCouchDB-CacheHitsNULLNumber of authentication cache hits
couchdb.httpd_request_methods_getCouchDB-GETRequestsNULLNumber of HTTP GET requests
couchdb.httpd_status_codes.400CouchDB-Httpd400StatusNULLNumber of HTTP 400 Bad Request responses
couchdb.httpd.bulk_requestsCouchDB-HttpBulkRequestsNULLNumber of bulk requests
couchdb.doc_countCouchDB-DocCountNULLNumber doc's in all dbs
couchdb.cache_missesCouchDB-CacheMissesNULLNumber of authentication cache misses
couchdb.httpd_status_codes.304CouchDB-Httpd304StatusNULLNumber of HTTP 304 Not Modified responses
couchdb.httpd.view_readsCouchDB-HttpViewReadsNULLNumber of view reads
couchdb.httpd_status_codes.401CouchDB-Httpd401StatusNULLNumber of HTTP 401 Unauthorized responses
couchdb.httpd_status_codes.200CouchDB-Httpd200StatusNULLNumber of HTTP 200 OK responses
couchdb.httpd_status_codes.409CouchDB-Httpd409StatusNULLNumber of HTTP 409 Conflict responses
couchdb.httpd_request_methods_putCouchDB-PUTRequestsNULLNumber of HTTP PUT requests
couchdb.httpd_request_methods_copyCouchDB-COPYRequestsNULLNumber of HTTP COPY requests
couchdb.httpd_status_codes.500CouchDB-Httpd500StatusNULLNumber of HTTP 500 Internal Server Error responses

Agent G2 - Linux - Apache Httpd Monitors

Description

Monitoring Template for Apache application. Monitors Apache busy workers, bytes per request, bytes per second, etc.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Apache Httpd Monitorsapache.performance.idle_workersApache-IdleWorkersNULLProvides the number of idle workers
apache.net.request_per_secApache-RequestsPerSecNULLProvides the number of requests made per second
apache.net.bytes_per_secApache-BytesPerSecNULLPrivides the number of bytes transferred per second
apache.net.request_per_requestApache-BytesPerRequestNULLPrivides the number of bytes transferred per request
apache.performance.cpu_loadApache-CPULoadNULLProvides the CPU Load of the apache service
apache.performance.scoreboardApache-ScoreBoardNULLProvides the scoreboard metrics
apache.performance.uptimeApache-UptimeNULLChecks the uptime apache service
apache.net.total_kbytesApache-TotalkBytesNULLProvides the number of total kbytes
apache.performance.total_accessesApache-TotalAccessesNULLProvides the total number of accesses made.
apache.performance.open_slotsApache-OpenSlotsNULLProvides the number of open slots
apache.performance.busy_workersApache-BusyWorkersNULLProvides the number of busy workers

Agent G2 - Linux - Apache Pulsar Bookkeeper Monitoring

Description

Monitors Apache Pulsar Bookkeeper metrics for components like Server, Journal and Storage metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Pulsar Bookkeeper Monitorpulsar_bookie_server_statusPulsar Bookie Server StatusNULLThe server status for bookie server. 1: the bookie is running in writable mode.0: the bookie is running in readonly mode.
pulsar_bookie_ledgers_countPulsar Bookie Ledgers CountNULLThe total number of ledgers stored in the bookie.
pulsar_bookie_write_cache_sizePulsar Bookie Write Cache SizebytesThe bookie write cache size.
pulsar_bookie_journal_journal_sync_countPulsar Bookie Journal Journal Sync CountNULLThe total number of journal fsync operations happening at the bookie. The success label is used to distinguish successes and failures.
pulsar_bookie_journal_journal_cb_queue_sizePulsar Bookie Journal Journal Cb Queue SizeNULLThe total number of callbacks pending in the callback queue.
pulsar_bookie_entries_countPulsar Bookie Entries CountNULLThe total number of entries stored in the bookie.
pulsar_bookie_journal_journal_force_write_queue_sizePulsar Bookie Journal Journal Force Write Queue SizeNULLThe total number of force write (fsync) requests pending in the force-write queue.
pulsar_bookie_flushPulsar Bookie FlushNULLThe table flush latency of bookie memory.
pulsar_bookie_read_bytes_countPulsar Bookie Read Bytes CountbytesThe total number of bytes read from the bookie.
pulsar_bookie_write_bytes_countPulsar Bookie Write Bytes CountbytesThe total number of bytes written to the bookie.
pulsar_bookie_throttled_write_requests_countPulsar Bookie Throttled Write Requests CountNULLThe number of write requests to be throttled.
pulsar_bookie_ledger_writable_dirsPulsar Bookie Ledger Writable DirsNULLThe number of writable directories in the bookie.
pulsar_bookie_journal_journal_queue_sizePulsar Bookie Journal Journal Queue SizeNULLThe total number of requests pending in the journal queue.
pulsar_bookie_read_cache_sizePulsar Bookie Read Cache SizebytesThe bookie read cache size.
pulsar_bookkeeper_server_add_entry_countPulsar Bookkeeper Server Add Entry CountNULLThe total number of ADD_ENTRY requests received at the bookie. The success label is used to distinguish successes and failures.
pulsar_bookkeeper_server_read_entry_countPulsar Bookkeeper Server Read Entry CountNULLThe total number of READ_ENTRY requests received at the bookie. The success label is used to distinguish successes and failures.
pulsar_bookie_deleted_ledger_countPulsar Bookie Deleted Ledger CountNULLThe total number of ledgers deleted since the bookie has started.

Agent G2 - Linux - Apache Pulsar Broker Monitoring

Description

Monitors Apache Pulsar Broker metrics for components like Namespace, Topic, Lookup, Connection and Jetty.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Pulsar Broker Monitorpulsar_ml_cache_pool_usedPulsar Ml Cache Pool UsedNULLThe total used memory of chunk lists in direct arena.
pulsar_broker_lookup_sumPulsar Broker Lookup SumNULLTotal latency of all lookup operations.
pulsar_connection_created_total_countPulsar Connection Created Total CountNULLThe total number of connections.
pulsar_broker_lookup_failures_countPulsar Broker Lookup Failures CountNULLThe number of lookup failures.
pulsar_ml_cache_pool_allocatedPulsar Ml Cache Pool AllocatedNULLThe total allocated memory of chunk lists in direct arena.
pulsar_ml_cache_pool_active_allocations_normalPulsar Ml Cache Pool Active Allocations NormalNULLThe number of currently active normal allocations in direct arena.
pulsar_broker_lookup_answers_countPulsar Broker Lookup Answers CountNULLThe number of lookup responses (i.e. not redirected requests).
pulsar_storage_write_ratePulsar Storage Write RateNULLThe total message batches (entries) written to the storage for this namespace (message batches / second).
pulsar_jetty_async_requests_waiting_maxPulsar Jetty Async Requests Waiting MaxNULLMaximum number of waiting async requests.
pulsar_jetty_dispatched_activePulsar Jetty Dispatched ActiveNULLNumber of dispatches currently active.
pulsar_jetty_requests_activePulsar Jetty Requests ActiveNULLNumber of requests currently active.
pulsar_connection_create_fail_countPulsar Connection Create Fail CountNULLThe number of failed connections.
pulsar_jetty_dispatched_active_maxPulsar Jetty Dispatched Active MaxNULLMaximum number of active dispatches being handled.
pulsar_ml_cache_pool_active_allocationsPulsar Ml Cache Pool Active AllocationsNULLThe number of currently active allocations in direct arena.
pulsar_throughput_outPulsar Throughput OutNULLThe total throughput of the namespace going out from this broker (bytes/second).
pulsar_broker_throttled_connectionsPulsar Broker Throttled ConnectionsNULLThe number of throttled connections.
pulsar_ml_countPulsar Ml CountNULLThe number of currently opened managed ledgers.
pulsar_ml_cache_misses_throughputPulsar Ml Cache Misses ThroughputNULLThe amount of data is not retrieved from the cache on the broker side (in byte/s).
pulsar_ml_cache_pool_active_allocations_hugePulsar Ml Cache Pool Active Allocations HugeNULLThe number of currently active huge allocation in direct arena.
pulsar_rate_inPulsar Rate InNULLThe total message rate of the namespace coming into this broker (messages/second).
pulsar_throughput_inPulsar Throughput InNULLThe total throughput of the namespace coming into this broker (bytes/second).
pulsar_active_connectionsPulsar Active ConnectionsNULLThe number of active connections.
pulsar_storage_logical_sizePulsar Storage Logical SizebytesThe storage size of topics in the namespace owned by the broker without replicas.
pulsar_ml_cache_pool_active_allocations_smallPulsar Ml Cache Pool Active Allocations SmallNULLThe number of currently active small allocations in direct arena.
pulsar_broker_lookup_pending_requestsPulsar Broker Lookup Pending RequestsNULLThe number of pending lookups in broker. When it is up to the threshold, new requests are rejected.
pulsar_jetty_requests_active_maxPulsar Jetty Requests Active MaxNULLMaximum number of requests that have been active at once.
pulsar_subscriptions_countPulsar Subscriptions CountNULLThe number of Pulsar subscriptions of the namespace served by this broker.
pulsar_producers_countPulsar Producers CountNULLThe number of active producers of the namespace connected to this broker.
pulsar_jetty_stats_secondsPulsar Jetty Stats SecondssecondsTime in seconds stats have been collected for.
pulsar_ml_cache_hits_throughputPulsar Ml Cache Hits ThroughputNULLThe amount of data is retrieved from the cache on the broker side (in byte/s).
pulsar_ml_cache_hits_ratePulsar Ml Cache Hits RateNULLThe number of cache hits per second on the broker side.
pulsar_ml_cache_used_sizePulsar Ml Cache Used SizeNULLThe size in byte used to store the entries payloads.
pulsar_broker_throttled_connections_global_limitPulsar Broker Throttled Connections Global LimitNULLThe number of throttled connections because of per-connection limit.
pulsar_consumers_countPulsar Consumers CountNULLThe number of active consumers of the namespace connected to this broker.
pulsar_ml_cache_misses_ratePulsar Ml Cache Misses RateNULLThe number of cache misses per second on the broker side.
pulsar_broker_lookup_countPulsar Broker Lookup CountNULLNumber of samples of the latency of all lookup operations.
pulsar_broker_lookup_redirects_countPulsar Broker Lookup Redirects CountNULLThe number of lookup redirected requests.
pulsar_connection_create_success_countPulsar Connection Create Success CountNULLThe number of successfully created connections.
pulsar_jetty_async_requests_waitingPulsar Jetty Async Requests WaitingNULLCurrently waiting async requests.
pulsar_storage_read_ratePulsar Storage Read RateNULLThe total message batches (entries) read from the storage for this namespace (message batches / second).
pulsar_jetty_request_time_max_secondsPulsar Jetty Request Time Max SecondssecondsMaximum time spent handling requests.
pulsar_topics_countPulsar Topics CountNULLThe number of Pulsar topics of the namespace owned by this broker.
pulsar_connection_closed_total_countPulsar Connection Closed Total CountNULLThe total number of closed connections.
pulsar_broker_topic_load_pending_requestsPulsar Broker Topic Load Pending RequestsNULLThe load of pending topic operations.
pulsar_rate_outPulsar Rate OutNULLThe total message rate of the namespace going out from this broker (messages/second).
pulsar_jetty_dispatched_time_maxPulsar Jetty Dispatched Time MaxsecondsMaximum time spent in dispatch handling.
pulsar_storage_sizePulsar Storage SizebytesThe total storage size of the topics in this namespace owned by this broker.
pulsar_ml_cache_evictionsPulsar Ml Cache EvictionsNULLThe number of cache evictions during the last minute.

Agent G2 - Linux - Apache Spark Monitors

Description

Monitors spark metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Apache Spark Monitorsspark.coresSpark CoresNULLThe number of CPUs available for all workers
spark.memorySpark MemoryNULLCalculates the total memory available on spark master
spark.workersSpark WorkersNULLThe number of workers connected to the master
spark.applications.activeSpark Applications ActiveNULLThe number of applications waiting or running
spark.memory.usedSpark Memory UsedNULLCalculates the memory used by the applications on spark master
spark.drivers.activeSpark Drivers ActiveNULLThe number of drivers available
spark.applications.completedSpark Applications CompletedNULLThe number of application completed
spark.cores.usedSpark Cores UsedNULLThe number of CPUs used for all applications

Agent G2 - Linux - Apache Spark Performance Check

Description

Monitors Spark metrics using Spark REST API

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Apache Spark Performance Checkspark.coresSpark CoresNULLThe number of CPUs available for all workers
spark.memory.usedSpark Memory UsedNULLCalculates the memory used by the applications on spark master
spark.cores.usedSpark Cores UsedNULLThe number of CPUs used for all applications
spark.memorySpark MemoryNULLCalculates the total memory available on spark master
spark.drivers.activeSpark Drivers ActiveNULLThe number of drivers available
spark.applications.completedSpark Applications CompletedNULLThe number of application completed
spark.applications.activeSpark Applications ActiveNULLThe number of applications waiting or running
spark.workersSpark WorkersNULLThe number of workers connected to the master

Agent G2 - Linux - Apache Status Check - Agent

Description

Checks the server-status page and monitors metrics from that page.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Apache Status Check - Agentapache.net.request_per_requestApache-BytesPerRequestNULLProvides the number of bytes transferred per request
apache.performance.idle_workersApache-IdleWorkersNULLProvides the number of idle workers
apache.performance.cpu_loadApache-CPULoadNULLProvides the CPU Load of the apache service
apache.performance.uptimeApache-UptimeNULLChecks the uptime apache service
apache.net.bytes_per_secApache-BytesPerSecNULLProvides the number of bytes transferred per second
apache.performance.open_slotsApache-OpenSlotsNULLProvides the number of open slots
apache.net.total_kbytesApache-TotalkBytesNULLProvides the number of total kbytes
apache.performance.total_accessesApache-TotalAccessesNULLProvides the total number of accesses made
apache.net.request_per_secApache-RequestsPerSecNULLProvides the number of requests made per second
apache.performance.busy_workersApache-BusyWorkersNULLProvides the number of busy workers
apache.performance.scoreboardApache-ScoreBoardNULLProvides the scoreboard metrics

Agent G2 - Linux - Apache Tomcat Monitors

Description

Monitors tomcat application metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Apache Tomcat Monitorstomcat.req.processor.error_countTomcat-ReqProcessorErrorCountNULLErrors per second on all the request processors running on the Apache Tomcat
tomcat.active_sessions.countTomcat-ActiveSessionsNULLNumber of active sessions to the server
tomcat.jvm.mem.non_heap_committedTomcat-JVM.Mem.non_heap_committedNULLNon-heap memory committed (in MB) for the server
tomcat.req.processor.processing_timeTomcat-ReqProcessorProcessingTimeNULLThe amount of processing time taken per second
tomcat.servlet.error_countTomcat-ServletErrorCountNULLNumber of erroneous requests received by the servlet per second
tomcat.jvm.uptimeTomcat-UptimeNULLUptime of the server
tomcat.jvm.mem.heap_usedTomcat-JVM.Mem.heap_usedNULLHeap memory usage (in MB) of the server
tomcat.cache.hits_countTomcat-CacheHitsCountNULLNumber of times the cache was hit per second
tomcat.cache.access_countTomcat-CacheAccessCountNULLNumber of times the cache was accessed per second
tomcat.servlet.request_countTomcat-ServletRequestCountNULLNumber of requests served by the servlet per second
tomcat.jvm.gc.collection_timeTomcat-JVM.GC.collection_timeNULLTime taken for collection of the garbage objects
tomcat.servlet.processing_timeTomcat-ServletProcessingTimeNULLThe amount of processing time taken per second
tomcat.jvm.mem.heap_committedTomcat-JVM.Mem.heap_committedNULLHeap memory committed (in MB) for the server
tomcat.threads.busyTomcat-ThreadsBusyNULLNumber of busy threads
tomcat.jvm.mem.non_heap_usedTomcat-JVM.Mem.non_heap_usedNULLNon-heap memory usage (in MB) of the server
tomcat.req.processor.request_countTomcat-ReqProcessorRequestCountNULLRequests per second on all the request processors running on the Apache Tomcat
tomcat.threads.countTomcat-ThreadCountNULLNumber of threads created
tomcat.jvm.threads.countTomcat-JVM.ThreadsNULLNumber of threads
tomcat.jvm.gc.collection_countTomcat-JVM.GC.collection_countNULLNumber of garbage objects collected
tomcat.jvm.open_fds_countTomcat-JVM.OpenFDsNULLNumber of Open file descriptors of the server
tomcat.req.processor.mbytes_sentTomcat-ReqProcessorDataSentNULLMBytes sent per second by all the request processors running on the Apache Tomcat
tomcat.req.processor.mbytes_receivedTomcat-ReqProcessorDataReceivedNULLMBytes received per second by all the request processors running on the Apache Tomcat
tomcat.jsp.reload_countTomcat-JspReloadCountNULLNumber of times JSPs were reloaded on all the applications per second
tomcat.jsp.countTomcat-JspCountNULLNumber of times JSPs were accessed on all the applications per second

Agent G2 - Linux - Apache Tomcat Performance Check

Description

Monitors tomcat application metrics. This monitor uses MBeans exposed via the JMX console. Please refer to Visatar documentation on how to enable JMX on your application.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Apache Tomcat Performance Checktomcat.req.processor.error_countTomcat-ReqProcessorErrorCountNULLErrors per second on all the request processors running on the Apache Tomcat
tomcat.cache.hits_countTomcat-CacheHitsCountNULLNumber of times the cache was hit per second
tomcat.jvm.gc.collection_timeTomcat-JVM.GC.collection_timeNULLTime taken for collection of the garbage objects
tomcat.jvm.uptimeTomcat-UptimeNULLUptime of the server
tomcat.jvm.mem.heap_committedTomcat-JVM.Mem.heap_committedNULLHeap memory committed (in MB) for the server
tomcat.req.processor.request_countTomcat-ReqProcessorRequestCountNULLRequests per second on all the request processors running on the Apache Tomcat
tomcat.jsp.reload_countTomcat-JspReloadCountNULLNumber of times JSPs were reloaded on all the applications per second
tomcat.req.processor.mbytes_sentTomcat-ReqProcessorDataSentNULLMBytes sent per second by all the request processors running on the Apache Tomcat
tomcat.jvm.open_fds_countTomcat-JVM.OpenFDsNULLNumber of Open file descriptors of the server
tomcat.jvm.mem.non_heap_usedTomcat-JVM.Mem.non_heap_usedNULLNon-heap memory usage (in MB) of the server
tomcat.jvm.mem.non_heap_committedTomcat-JVM.Mem.non_heap_committedNULLNon-heap memory committed (in MB) for the server
tomcat.cache.access_countTomcat-CacheAccessCountNULLNumber of times the cache was accessed per second
tomcat.jsp.countTomcat-JspCountNULLNumber of times JSPs were accessed on all the applications per second
tomcat.servlet.request_countTomcat-ServletRequestCountNULLNumber of requests served by the servlet per second
tomcat.jvm.mem.heap_usedTomcat-JVM.Mem.heap_usedNULLHeap memory usage (in MB) of the server
tomcat.servlet.processing_timeTomcat-ServletProcessingTimeNULLThe amount of processing time taken per second
tomcat.threads.countTomcat-ThreadCountNULLNumber of threads created
tomcat.req.processor.processing_timeTomcat-ReqProcessorProcessingTimeNULLThe amount of processing time taken per second
tomcat.req.processor.mbytes_receivedTomcat-ReqProcessorDataReceivedNULLMBytes received per second by all the request processors running on the Apache Tomcat
tomcat.jvm.threads.countTomcat-JVM.ThreadsNULLNumber of threads
tomcat.jvm.gc.collection_countTomcat-JVM.GC.collection_countNULLNumber of garbage objects collected
tomcat.threads.busyTomcat-ThreadsBusyNULLNumber of busy threads
tomcat.active_sessions.countTomcat-ActiveSessionsNULLNumber of active sessions to the server
tomcat.servlet.error_countTomcat-ServletErrorCountNULLNumber of erroneous requests received by the servlet per second

Agent G2 - Linux - Apache ZooKeeper Monitors

Description

Monitors zoo keeper metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Apache ZooKeeper Monitorszookeeper.packets_receivedZooKeeper Total Packets ReceivedNULLThe number of packets received
zookeeper.connectionsZooKeeper Total ConnectionsNULLThe total count of client connections
zookeeper.nodesZooKeeper Total Node CountNULLThe number of znodes in the ZooKeeper namespace
zookeeper.packets_sentZooKeeper Total Packets sentNULLThe number of packets sent
zookeeper.node.countZooKeeper Total Node CountNULLCounts the number of zookeeper nodes
zookeeper.outstanding_requestsZooKeeper OutStanding RequestsNULLThe number of queued requests when the server is under load and is receiving more sustained requests than it can process
zookeeper.latency.maxZooKeeper Max Request LatencyNULLThe amount of time it takes for the server to respond to a client request
zookeeper.latency.minZooKeeper Min Request LatencyNULLThe amount of time it takes for the server to respond to a client request
zookeeper.zxid.countZooKeeper Zxid CountNULLZxid Count
zookeeper.max.client.cnxns.perhostZooKeeper Max client connections perhostNULLCounts the number of max client connections perhost on the zookeeper server
zookeeper.zxid.epochZooKeeper Zxid EpochNULLZxid Epoch
zookeeper.latency.avgZooKeeper Average Request LatencyNULLThe amount of time it takes for the server to respond to a client request
zookeeper.packets.sentZooKeeper Total Packets sentNULLCalculates the number of packets sent from the server
zookeeper.bytes_receivedZooKeeper Total Bytes ReceivedNULLThe number of bytes received
zookeeper.outstanding.requestsZooKeeper Out Standing RequestsNULLCounts the number of outstanding client requests on the zookeeper server
zookeeper.packets.receivedZooKeeper Total Packets ReceivedNULLCalculates the number of packets received from the server
zookeeper.bytes_sentZooKeeper Total Bytes SentNULLThe number of bytes sent
zookeeper.request.latencyZooKeeper Request LatencymsAverage latency time for client requests on the zookeeper server in milliseconds

Agent G2 - Linux - Cassandra2 Monitors

Description

Monitors cassandra version 2.x metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Cassandra2 Monitorcassandra.tasks_completedCassandra Request Completed TasksNULLApproximate number of tasks thread pool has completed execution per second on path - request
cassandra.bloom_filter_false_ratioCassandra Bloom Filter False RatioNULLThe ratio of Bloom filter false positives to total checks
cassandra.jvm.open_fdsCassandra JVM OpenFDsNULLNumber of Open file descriptors of the server
cassandra.compression_ratioCassandra Compression RatioNULLThe compression ratio for all SSTables in a column family
cassandra.write.request_timeoutsCassandra Write Request TimeoutsNULLCount of write requests not acknowledged within configurable timeout window
cassandra.commitlog.tasks_completedCassandra Commitlog Completed TasksNULLApproximate number of completed task per second
cassandra.dropped.messagesCassandra Dropped MessagesNULLNumber of dropped message for each verb per second.
cassandra.compaction.tasks_completedCassandra Compaction Completed TasksNULLEstimated number of completed compaction tasks per second.
cassandra.bloom_filter_false_positivesCassandra Cassandra Bloom Filter False PositivesNULLThe number of Bloom filter false positives
cassandra.compaction.tasks_pendingCassandra Compaction Pending TasksNULLEstimated number of pending compaction tasks.
cassandra.streaming.bytes_outgoingCassandra Data SentNULLOutgoing data per second in megabytes
cassandra.load_countCassandra Disk Space UsedNULLDisk space used on a node
cassandra.internal.pending_tasksCassandra Internal Pending TasksNULLApproximate number of pending tasks thread pool has on path - internal
cassandra.cache.requestsCassandra Requests CountNULLThe number of requests to a cache
cassandra.jvm.mem_heap_usedCassandra JVM Mem heap_usedNULLHeap memory usage (in MB) of the server
cassandra.memtable_live_data_sizeCassandra Memtable Live Data SizeNULLSize of data stored in memtable
cassandra.tasks_activeCassandra Request Active TasksNULLApproximate number of tasks thread pool is actively executing on path - request
cassandra.compaction_completedCassandra Compactions CompletedNULLNumber of compactions completed per second
cassandra.capacityCassandra CapacityNULLThe capacity of the caches, such as the key cache and row cache
cassandra.jvm.threadsCassandra JVM ThreadsNULLNumber of threads
cassandra.cache.sizeCassandra SizeNULLSize of cache
cassandra.live_disk_space_used.countCassandra Live Disk Space Used CountNULLDisk space used by "live" SSTables (only counts non-obsolete files).
cassandra.max_row_sizeCassandra Max Row SizeNULLSize of the largest compacted row
cassandra.jvm.gc_collection_countCassandra JVM GC collection_countNULLNumber of garbage objects collected
cassandra.read.requestsCassandra Read RequestsNULLNumber of read requests
cassandra.tasks_pendingCassandra Request Pending TasksNULLApproximate number of pending tasks thread pool has on path - request
cassandra.internal.currently_blocked_tasksCassandra Internal Current Blocked TasksNULLNumber of currently blocked tasks on path - internal
cassandra.read.request_latencyCassandra Read Request LatencymsRead latency for all client requests
cassandra.jvm.mem_non_heap_committedCassandra JVM Mem non_heap_committedNULLNon-heap memory committed (in MB) for the server
cassandra.live_ss_table_countCassandra Live SS Table CountNULLNumber of "live" (non-obsolete) SSTables
cassandra.total_disk_space_used.countCassandra Total Disk Space Used CountNULLDisk space used by a column family
cassandra.cache.hitrateCassandra Cache Hit RateNULLCache hit rate.
cassandra.exceptions_countCassandra Data ExceptionsNULLThe number of exceptions thrown
cassandra.internal.tasks_completedCassandra Internal Completed TasksNULLApproximate number of tasks thread pool has completed execution per second on path - internal
cassandra.tasks_currently_blockedCassandra Request Current Blocked TasksNULLNumber of currently blocked tasks on path - request
cassandra.memtable_switch_count.countCassandra Memtable Switch CountNULLNumber of times a full memtable has been switched out for an empty one due to flushing
cassandra.jvm.uptimeCassandra UptimeNULLUptime of the server
cassandra.memtable_columns_countCassandra Memtable Columns CountNULLNumber of columns in memtable
cassandra.write.request_unavailablesCassandra Write Request UnavailablesNULLCount write of requests for which the required number of nodes was unavailable
cassandra.compaction_bytes_written.countCassandra Compacted BytesNULLCompacted bytes size
cassandra.write.request_latencyCassandra Write Request LatencymsWrite latency for all client requests
cassandra.internal.tasks_activeCassandra Internal Active TasksNULLApproximate number of tasks thread pool is actively executing on path - internal
cassandra.tasks_blockedCassandra Request Blocked TasksNULLNumber of blocked tasks per second on path - request
cassandra.read.request_unavailablesCassandra Read Request UnavailablesNULLCount read of requests for which the required number of nodes was unavailable
cassandra.commitlog.tasks_pendingCassandra Commitlog Pending TasksNULLApproximate number of pending task
cassandra.mean_row_sizeCassandra Mean Row SizeNULLAverage size of compacted rows
cassandra.write.requestsCassandra Write RequestsNULLNumber of writes requests
cassandra.internal.tasks_blockedCassandra Internal Blocked TasksNULLNumber of blocked tasks per second on path - internal
cassandra.read.request_timeoutsCassandra Read Request TimeoutsNULLCount of read requests not acknowledged within configurable timeout window
cassandra.streaming.bytes_incomingCassandra Data ReceivedNULLIncoming data per second in megabytes
cassandra.min_row_sizeCassandra Min Row SizeNULLSize of the smallest compacted row
cassandra.connection.timeoutsCassandra Cassandra Connection TimeoutsNULLNumber of timeouts occurred for this node per second
cassandra.bloom_filter_disk_space_usedCassandra Bloom Filter Disk Space UsedNULLDisk space used by the Bloom filters
cassandra.commitlog.total_sizeCassandra Commitlog SizeNULLCurrent data size of all commit log segments in megabytes
cassandra.jvm.mem_heap_committedCassandra JVM Mem heap_committedNULLHeap memory committed (in MB) for the server
cassandra.streaming.active_outboundsCassandra Active Outbound StreamsNULLCurrently active outbound streams.
cassandra.cache.hits_countCassandra Hits CountNULLThe number of hits to a cache
cassandra.jvm.gc_collection_timeCassandra JVM GC collection_timeNULLTime taken for collection of the garbage objects.
cassandra.jvm.mem_non_heap_usedCassandra JVM Mem non_heap_usedNULLNon-heap memory usage (in MB) of the server

Agent G2 - Linux - ClickHouse

Description

Monitors both ClickHouse application server and cluster devices

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - ClickHouseclickhouse_jemalloc_retainedJemalloc RetainedbyteThe amount of memory in virtual memory mappings that were retained rather than being returned to the operating system.
clickhouse_ReplicasMaxInsertsInQueueReplicas Max Inserts In QueueNULLThis metric will be renamed in a future minor release.
clickhouse_S3WriteRequestsCount_per_secS3 Write Requests Countrequests/secNumber of POST, DELETE, PUT and PATCH requests to S3 storage.
clickhouse_MergeTreeDataWriterUncompressedBytes_per_secMerge Tree Data Writer Uncompressed Bytesbytes/secUncompressed bytes (for columns as they stored in memory) INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterCompressedBytes_per_secMerge Tree Data Writer Compressed Bytesbytes/secCompressed bytes (for columns as they stored on disk) INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterRows_per_secMerge Tree Data Writer Rowsrows/secNumber of rows INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterBlocks_per_secMerge Tree Data Writer Blocksblocks/secNumber of blocks INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterBlocksAlreadySorted_per_secMerge Tree Data Writer Blocks Already Sortedblocks/secNumber of blocks that were already sorted by primary key or index of MergeTree tables.
clickhouse_MergeTreeDataWriterBlocksAverageSize_bytesMerge Tree Data Writer Blocks Average SizebytesAverage size of blocks INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterBlocksCompressed_bytesMerge Tree Data Writer Blocks CompressedbytesCompressed size of blocks INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterBlocksUncompressed_bytesMerge Tree Data Writer Blocks UncompressedbytesUncompressed size of blocks INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterRows_compressed_bytesMerge Tree Data Writer Rows CompressedbytesSize of rows INSERTed to MergeTree tables, compressed.
clickhouse_MergeTreeDataWriterMarksMerge Tree Data Writer MarksmarksNumber of marks INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterMaxMarkSize_bytesMerge Tree Data Writer Max Mark SizebytesMaximum size of a mark INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterRowsPerMarkMerge Tree Data Writer Rows Per MarkrowsNumber of rows INSERTed per mark to MergeTree tables.
clickhouse_MergeTreeDataWriterCompressedBytesPerMark_bytesMerge Tree Data Writer Compressed Bytes Per MarkbytesCompressed size of marks INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterUncompressedBytesPerMark_bytesMerge Tree Data Writer Uncompressed Bytes Per MarkbytesUncompressed size of marks INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterRowsPerByteMerge Tree Data Writer Rows Per Byterows/byteNumber of rows INSERTed per byte to MergeTree tables.
clickhouse_MergeTreeDataWriterCompressionRatioMerge Tree Data Writer Compression RatioratioCompression ratio for rows INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterAverageCompressedBytesPerByteMerge Tree Data Writer Average Compressed Bytes Per Bytebytes/byteAverage compressed size of a byte INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterCompressedRows_bytesMerge Tree Data Writer Compressed RowsrowsNumber of rows INSERTed to MergeTree tables, compressed.
clickhouse_MergeTreeDataWriterUncompressedRows_bytesMerge Tree Data Writer Uncompressed RowsrowsNumber of rows INSERTed to MergeTree tables, uncompressed.
clickhouse_MergeTreeDataWriterFinalCompressedBytes_bytesMerge Tree Data Writer Final Compressed BytesbytesCompressed size of final data INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterFinalUncompressedBytes_bytesMerge Tree Data Writer Final Uncompressed BytesbytesUncompressed size of final data INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterDiskWritePerformance_bytes_per_secMerge Tree Data Writer Disk Write Performancebytes/secDisk write speed for rows INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterRowsPerDiskWriteMerge Tree Data Writer Rows Per Disk WriterowsNumber of rows INSERTed per disk write to MergeTree tables.
clickhouse_MergeTreeDataWriterWritePerformance_rows_per_secMerge Tree Data Writer Write Performancerows/secWrite performance of rows INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterElapsedMerge Tree Data Writer ElapsedsecondsTime taken to write data to MergeTree tables.
clickhouse_MergeTreeDataWriterRows_before_markMerge Tree Data Writer Rows Before MarkrowsNumber of rows INSERTed to MergeTree tables before a mark.
clickhouse_MergeTreeDataWriterRows_before_sampleMerge Tree Data Writer Rows Before SamplerowsNumber of rows INSERTed to MergeTree tables before a sample.
clickhouse_MergeTreeDataWriterMergeElapsedMerge Tree Data Writer Merge ElapsedsecondsTime taken to perform merge operations on MergeTree tables.
clickhouse_MergeTreeDataWriterFinalMergeElapsedMerge Tree Data Writer Final Merge ElapsedsecondsTime taken to perform final merge operations on MergeTree tables.
clickhouse_MergeTreeDataWriterPartsMergedMerge Tree Data Writer Parts MergedpartsNumber of parts merged during MergeTree operations.
clickhouse_MergeTreeDataWriterFinalPartsMerge Tree Data Writer Final PartspartsNumber of final parts after merge operations on MergeTree tables.
clickhouse_MergeTreeDataWriterPartMutationMerge Tree Data Writer Part MutationNULLInformation about part mutations during MergeTree operations.
clickhouse_MergeTreeDataWriterPartMutationAddMerge Tree Data Writer Part Mutation AddNULLInformation about added part mutations during MergeTree operations.
clickhouse_MergeTreeDataWriterPartMutationRemoveMerge Tree Data Writer Part Mutation RemoveNULLInformation about removed part mutations during MergeTree operations.
clickhouse_MergeTreeDataWriterPartMutationClearMerge Tree Data Writer Part Mutation ClearNULLInformation about cleared part mutations during MergeTree operations.
clickhouse_MergeTreeDataWriterPartMutationAlterMerge Tree Data Writer Part Mutation AlterNULLInformation about altered part mutations during MergeTree operations.
clickhouse_MergeTreeDataWriterTableRowsInsertedMerge Tree Data Writer Table Rows InsertedrowsNumber of rows INSERTed to MergeTree tables.
clickhouse_MergeTreeDataWriterTableRowsDeletedMerge Tree Data Writer Table Rows DeletedrowsNumber of rows DELETed from MergeTree tables.
clickhouse_MergeTreeDataWriterTableRowsUpdatedMerge Tree Data Writer Table Rows UpdatedrowsNumber of rows UPDATed in MergeTree tables.
clickhouse_MergeTreeDataWriterTableTTLRowsMerge Tree Data Writer Table TTL RowsrowsNumber of rows with expired TTL settings in MergeTree tables.
clickhouse_MergeTreeDataWriterPartMutationsMerge Tree Data Writer Part MutationsmutationsNumber of part mutations during MergeTree operations.
clickhouse_MergeTreeDataWriterZooKeeperCheckMerge Tree Data Writer ZooKeeper CheckNULLInformation about ZooKeeper checks during MergeTree operations.
clickhouse_MergeTreeDataWriterTableTTLCheckMerge Tree Data Writer Table TTL CheckNULLInformation about TTL checks during MergeTree operations.
clickhouse_MergeTreeDataWriterSettingsMerge Tree Data Writer SettingsNULLInformation about settings used during MergeTree operations.

Agent G2 - Linux - ContainerD_v2

Description

Agent G2 - Linux - ContainerD_v2

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - ContainerD_v2containerd_cpu_throttling_throttledTimeCPU Throttled TimePercentagecpu throttled time
containerd_memory_usage_limitMemory Usage LimitMegabyteslimit of memory usage
containerd_memory_swap_limitSwap Usage LimitMegabyteslimit of swap usage
containerd_memory_usage_failcntMemory Usage fail RateNULLrate of number of times that the cgroup limit was exceeded
containerd_blkio_sectors_recursiveBlkIO SectorsBytesnumber of sectors transferred to/from disk by the group
containerd_memory_kernel_tcp_usageContinaerD Kernel TCP UsageMegabytescurrent tcp buf memory allocation
containerd_hugetlb_failcntHugeTLB fail RateNULLRate of allocation failure due to HugeTLB limit
containerd_cpu_usage_totalCPU Total UsagePercentagetotal Cpu usage of container with respect to host system
containerd_memory_dirtyMemory DirtyMegabytesbytes that are waiting to get written back to the disk
containerd_cpu_usage_total_over_limitCPU Total Usage Over LimitPercentagecontainer total cpu usage with respect to limit. (if limit is not set the metric is not sent)
containerd_memory_kernel_usageContinaerD Kernel UsageMegabytescurrent kernel memory allocation
containerd_memory_rssMemory RSSMegabytesbytes of anonymous and swap cache memory (includes transparent hugepages)
containerd_containers_stoppedStopped ContainersCountTotal number of Stopped Containers
containerd_blkio_wait_time_recursiveBlkIO Wait TimeBytesTotal amount of time the IOs for this cgroup spent waiting in the scheduler queues for service
containerd_memory_cacheMemory CacheMegabytesbytes of page cache memory
containerd_containers_runningRunning ContainersCountTotal number of running containers
containerd_memory_rss_hugeMemory RSS HugeMegabytesbytes of anonymous transparent hugepages
containerd_cpu_usage_userCPU User UsagePercentageuser Cpu usage of container with respect to host system
containerd_memory_usage_over_limitMemory Usage Over LimitPercentageMemory Usage percentage with respect to limit(if limit is not set then node total memory is used)
containerd_memory_kernel_maxKernel MaxMegabytesmax kernel memory usage recorded
containerd_container_uptimeContainer UptimeSecondsUptime of the Current Container
containerd_cpu_usage_systemCPU System UsagePercentagesystem Cpu usage of container with respect to host system
containerd_proc_open_fdsnumber of open fdCountNumber of open file descriptors
containerd_memory_usage_maxMemory Usage MaxMegabytesshow max memory usage recorded
containerd_blkio_queued_recursiveBlkIO QueuedBytesTotal number of requests queued up at any given instant for the cgroup
containerd_memory_swap_maxSwap Usage MaxMegabytesshow max swap usage recorded
containerd_memory_kernel_tcp_failcntKernel TCP fail rateNULLrate of number of tcp buf memory usage hits limits
containerd_hugetlb_usageContianerD HugeTLB usageBytescurrent usage for "hugepagesize" hugetlb
containerd_blkio_merged_recursiveBlkIO MergedBytesTotal number of bios/requests merged into requests belonging to this cgroup
containerd_blkio_service_time_recursiveBlkIO Service TimeBytesTotal amount of time between request dispatch and request completion for the IOs
containerd_blkio_serviced_recursiveBlkIO ServicedBytesNumber of IOs (bio) issued to the disk by the group
containerd_blkio_service_bytes_recursiveBlkIO Service BytesBytesNumber of bytes transferred to/from the disk
containerd_memory_swap_failcntSwap Usage fail RateNULLrate of number of times that the cgroup swap limit was exceeded
containerd_memory_kernel_failcntKernel fail countNULLrate of the number of kernel memory usage hits limits
containerd_image_sizeImage SizeBytesImage sizes of different container images
containerd_memory_swap_usageContinaerD Swap UsageMegabytesswap Usage in Bytes
containerd_hugetlb_maxHugeTLB max usageBytesmax "hugepagesize" hugetlb usage recorded
containerd_memory_usageMemory UsageMegabytesmemory Usage in Bytes
containerd_blkio_time_recursiveBlkIO TimeBytesdisk time allocated to cgroup per device in milliseconds
containerd_memory_kernel_tcp_limitKernel TCP LimitMegabytesshow hard limit for tcp buf memory
containerd_memory_kernel_limitKernel LimitMegabyteshard limit for kernel memory
containerd_memory_kernel_tcp_maxKernel TCP MaxMegabytesmax tcp buf memory usage recorded

Agent G2 - Linux - Couchbase Monitors

Description

Applicable on Couchbase servers

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - ContainerD_v2containerd_memory_usage_failcntMemory Usage fail RateNULLrate of number of times that the cgroup limit was exceeded
containerd_blkio_sectors_recursiveBlkIO SectorsBytesnumber of sectors transferred to/from disk by the group
containerd_memory_kernel_tcp_usageContinaerD Kernel TCP UsageMegabytescurrent tcp buf memory allocation
containerd_hugetlb_failcntHugeTLB fail RateNULLRate of allocation failure due to HugeTLB limit
containerd_cpu_usage_totalCPU Total UsagePercentagetotal Cpu usage of container with repect to host system
containerd_memory_dirtyMemory DirtyMegabytesbytes that are waiting to get written back to the disk
containerd_cpu_usage_total_over_limitCPU Total Usage Over LimitPercentagecontainer total cpu usage with respect to limit. (if limit is not set the metric is not sent)
containerd_memory_kernel_usageContinaerD Kernel UsageMegabytescurrent kernel memory allocation
containerd_memory_rssMemory RSSMegabytesbytes of anonymous and swap cache memory (includes transparent hugepages)
containerd_containers_stoppedStopped ContainersCountTotal number of Stopped Containers
containerd_blkio_wait_time_recursiveBlkIO Wait TimeBytesTotal amount of time the IOs for this cgroup spent waiting in the scheduler queues for service
containerd_memory_cacheMemory CacheMegabytesbytes of page cache memory
containerd_containers_runningRunning ContainersCountTotal number of running containers
containerd_memory_rss_hugeMemory RSS HugeMegabytesbytes of anonymous transparent hugepages
containerd_cpu_usage_userCPU User UsagePercentageuser Cpu usage of container with repect to host system
containerd_memory_usage_over_limitMemory Usage Over LimitPercentageMemory Usage percentage with respect to limit(if limit is not set then node total memory is used)
containerd_memory_kernel_maxKernel MaxMegabytesmax kernel memory usage recorded
containerd_container_uptimeContainer UptimeSecondsUptime of the Current Container
containerd_cpu_usage_systemCPU System UsagePercentagesystem Cpu usage of container with repect to host system
containerd_proc_open_fdsnumber of open fdCountNumber of open file descriptors
containerd_memory_usage_maxMemory Usage MaxMegabytesshow max memory usage recorded
containerd_blkio_queued_recursiveBlkIO QueuedBytesTotal number of requests queued up at any given instant for the cgroup
containerd_memory_swap_maxSwap Usage MaxMegabytesshow max swap usage recorded
containerd_memory_kernel_tcp_failcntKernel TCP fail rateNULLrate of number of tcp buf memory usage hits limits
containerd_hugetlb_usageContianerD HugeTLB usageBytescurrent usage for "hugepagesize" hugetlb
containerd_blkio_merged_recursiveBlkIO MergedBytesTotal number of bios/requests merged into requests belonging to this cgroup
containerd_blkio_service_time_recursiveBlkIO Service TimeBytesTotal amount of time between request dispatch and request completion for the IOs
containerd_blkio_serviced_recursiveBlkIO ServicedBytesNumber of IOs (bio) issued to the disk by the group
containerd_blkio_service_bytes_recursiveBlkIO Service BytesBytesNumber of bytes transferred to/from the disk
containerd_memory_swap_failcntSwap Usage fail RateNULLrate of number of times that the cgroup swap limit was exceeded
containerd_memory_kernel_failcntKernel fail countNULLrate of the number of kernel memory usage hits limits
containerd_image_sizeImage SizeBytesImage sizes of different container images
containerd_memory_swap_usageContinaerD Swap UsageMegabytesswap Usage in Bytes
containerd_hugetlb_maxHugeTLB max usageBytesmax "hugepagesize" hugetlb usage recorded
containerd_memory_usageMemory UsageMegabytesmemory Usage in Bytes
containerd_blkio_time_recursiveBlkIO TimeBytesdisk time allocated to cgroup per device in milliseconds
containerd_memory_kernel_tcp_limitKernel TCP LimitMegabytesshow hard limit for tcp buf memory
containerd_memory_kernel_limitKernel LimitMegabyteshard limit for kernel memory
containerd_memory_kernel_tcp_maxKernel TCP MaxMegabytesmax tcp buf memory usage recorded

Agent G2 - Linux - CRI-O Monitoring

Description

Monitoring of OCI-based implementation of Kubernetes Container Runtime Interface

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - CRI-O Monitoringcrio_operations_latency_microsecondsOperations Latency MicrosecondsNULLLatency in microseconds of CRI-O operations. Broken down by operation type
crio_memory_working_setMemory Working SetBytesThe amount of working set memory in bytes.
crio_image_pulls_failuresImage Pulls FailuresNULLFailed image pulls by image name and their error category.
crio_operations_errorsOperations ErrorsNULLCumulative number of CRI-O operation errors by operation type.
crio_mem_residentMem ResidentBytesResident memory size in bytes.
crio_inodes_usedInodes UsedNULLrepresents the inodes used by the images. (This may not equal InodesCapacity - InodesAvailable because the underlying filesystem may also be used for purposes other than storing images.)
crio_cpu_usage_coreCPU UsageNanosecondsCumulative CPU usage (sum across all cores) since object creation.
crio_cpu_timeCpu TimeNULLTotal user and system CPU time spent in seconds.
crio_operations_latency_microseconds_countOperations Latency Microseconds CountMicrosecondsLatency in microseconds of CRI-O operations. Broken down by operation type. count value
crio_operations_latency_microseconds_sumOperations Latency Microseconds SumMicrosecondsLatency in microseconds of CRI-O operations. Broken down by operation type. sum value
crio_mem_virtualMem VirtualBytesVirtual memory size in bytes.
crio_filesystem_usedFilesystem UsedBytesrepresents the bytes used for images on the filesystem. (This may differ from the total bytes used on the filesystem and may not equal CapacityBytes - AvailableBytes.)
crio_operationsOperations CountNULLCumulative number of CRI-O operations by operation type.
crio_process_open_fdsProcess Open FdsNULLNumber of open file descriptors.

Agent G2 - Linux - Docker Host Monitoring Template

Description

Docker Host Monitoring Template

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Docker Host Monitoring Templatedocker.cpuDocker CpuNULLCPU usage of the Docker Container
docker.container.statesDocker Container statesNULLThe state of Container
docker.mem.cache.95percentileCache Size 95percentileNULL95th percentile value of docker.mem.cache
docker.mem.inactive_fileInactive Cache MemoryNULLThe amount of "inactive" cache memory. Inactive memory may be reclaimed first when the system needs memory
docker.cpu.sharesShares of CPUNULLShares of CPU usage allocated to the container
docker.image.virtual_sizeImage Virtual SizeNULLSize of all layers of the image on disk
docker.mem.rss.95percentileRSS Memory 95percentileNULL95th percentile value of docker.mem.rss
docker.mem.sw_limitSwap Memory LimitNULLThe swap + memory limit for the container, if set
kubernetes.network.tx_bytesNetwork Tx BytesNULLThe amount of bytes per second transmitted
docker.cpu.systemCPU SystemNULLThe percent of time the CPU is executing system calls on behalf of processes of this container, unnormalized
docker.cpu.throttledCPU ThrottledNULLNumber of times the cgroup has been throttled
docker.image.sizeImage SizeNULLSize of all layers of the image on disk
docker.mem.in_useMemory In UseNULLThe fraction of used memory to available memory, IF THE LIMIT IS SET
docker.io.write_bytesIO Write BytesNULLBytes written per second to disk by the processes of the container
docker.mem.active_fileActive Cache MemoryNULLThe amount of "active" cache memory. Active memory is reclaimed by the system only after "inactive" has been reclaimed
docker.containers.runningContainers Running by ImageNULLThe number of containers running on this host tagged by image
docker.mem.mapped_fileMemory Mapped by ProcessNULLThe amount of memory mapped by the processes in the control group
kubernetes.memory.usageMemory UsageNULLThe amount of memory used
docker.mem.cacheCache SizeNULLThe amount of memory that is being used to cache data from disk (e.g. memory contents that can be associated precisely with a block on a block device)
docker.cpu.usageCPU UsageNULLThe percent of CPU time obtained by this container
docker.mem.sw_in_useSwap Memory In UseNULLThe fraction of used swap + memory to available swap + memory, if the limit is set
docker.mem.in_use.95percentileMemory In Use 95percentileNULL95th percentile of docker.mem.in_use
docker.images.intermediateImages intermediateNULLThe number of intermediate images, which are intermediate layers that make up other images
docker.containers.running_totalDocker Container Running TotalNULLThe total number of containers running on this host
docker.mem.active_anonActive RSS MemoryNULLThe amount of "active" RSS memory. Active memory is not swapped to disk
docker.io.read_bytes.95percentileIO Read Bytes 95percentileNULL95th percentile of docker.io.read_bytes
docker.mem.sw_limit.95percentileSwap Memory Limit 95percentileNULL95th percentile of docker.mem.sw_limit. Ordinarily this value will not change
kubernetes.network.rx_bytesNetwork Rx BytesNULLThe amount of bytes per second received
docker.container.size_rootfsRoot Filesystem SizeNULLTotal size of all the files in the container
kubernetes.memory.limitsMemory LimitsNULLThe limit of memory set
docker.mem.inactive_anonInactive RSS MemoryNULLThe amount of "inactive" RSS memory. Inactive memory is swapped to disk when necessary
docker.mem.pgpgoutPages Uncharged RateNULLThe rate at which pages are "uncharged" (removed from the accounting) of a cgroup
docker.mem.rssRSS MemoryNULLThe amount of non-cache memory that belongs to the container's processes. Used for stacks, heaps, etc.
docker.io.read_bytesIO Read BytesNULLBytes read per second from disk by the processes of the container
docker.mem.pgpginPages Charged RateNULLThe rate at which pages are "charged" (added to the accounting) of a cgroup
docker.container.size_rw.95percentileTotal Files Size 95PercentileNULL95th percentile of docker.container.size_rw
docker.mem.pgfaultMemory Page FaultsNULLThe rate that processes in the container trigger page faults by accessing a nonexistent or protected part of its virtual address space. Usually a page fault of this type results in a segmentation fault
docker.container.size_rwTotal Files SizeNULLTotal size of all the files in the container which have been created or changed by processes running in the container
docker.images.availableImages AvailableNULLThe number of top-level images
docker.containers.stoppedContainers Stopped by ImageNULLThe number of containers stopped on this host tagged by image
docker.memoryDocker MemoryNULLMemory usage of the Docker Container
docker.cpu.system.95percentileCPU System 95PercentileNULL95th percentile of docker.cpu.system
docker.mem.limitMemory LimitNULLThe memory limit for the container, if set
docker.io.write_bytes.95percentileIO Write Bytes 95percentileNULL95th percentile of docker.io.write_bytes
docker.mem.pgmajfaultMemory Page Faults VirtualNULLThe rate that processes in the container trigger page faults by accessing a part virtual address space that was swapped out or corresponded to a mapped file. Usually a page fault of type results in fetching data from disk instead of from memory
docker.cpu.user.95percentileCPU User 95PercentileNULL95th percentile of docker.cpu.user
docker.cpu.userCPU UserNULLThe percent of time the CPU is under direct control of processes of this container, unnormalized
docker.mem.limit.95percentileMemory Limit 95percentileNULL95th percentile of docker.mem.limit. Ordinarily this value will not change
docker.disk.ioDocker Disk IopsNULLDisk IOPS of the Docker Container
docker.container.size_rootfs.95percentileRoot Filesystem Size 95PercentileNULL95th percentile of docker.container.size_rootfs
docker.mem.sw_in_use.95percentileSwap Memory In Use 95percentileNULL95th percentile of docker.mem.sw_in_use

Agent G2 - Linux - Docker Monitoring Template

Description

Template to monitor docker host metrics and containers.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Docker Monitoring Templatedocker.containers.running.totalDocker Total ContainersNULL
docker.cpuDocker CpuNULLCPU usage of the Docker Container
docker.memoryDocker MemoryNULLMemory usage of the Docker Container
docker.networkDocker NetworkNULLNetwork usage of the Docker Container
docker.disk.ioDocker Disk IopsNULLDisk IOPS of the Docker Container

Agent G2 - Linux - Docker Monitors

Description

Docker Monitoring Template

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Docker Monitorsdocker.containers.running.totalDocker Total ContainersNULL
docker.disk.ioDocker Disk IopsNULLDisk IOPS of the Docker Container
docker.cpuDocker CpuNULLCPU usage of the Docker Container
docker.networkDocker NetworkNULLNetwork usage of the Docker Container
docker.memoryDocker MemoryNULLMemory usage of the Docker Container

Agent G2 - Linux - Elasticsearch Monitors

Description

Monitors various performance metrics on the Elasticsearch servers

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Elasticsearch Monitorses.flush.timeElasticSearch Flush timeNULL
es.jvm.gc.collection_timeESJVM GC collection_timeNULL
es.merges.opsElasticSearch Merges opsNULL
es.search.fetch.opsElasticSearch Search fetch opsNULL
es.cache.field.sizeElasticSearch Cache field sizeNULL
es.jvm.gc.par_new.collection_timeESJVM GC par_new collection_timeNULL
es.cluster.initializing_shardsElasticSearch Cluster Initializing shardsNULL
es.search.query.timeElasticSearch Search query timeNULL
es.process.openfdsElasticSearch Process OpenFDsNULL
es.jvm.gc.copy.countESJVM GC copy countNULL
es.cluster.unassigned_shardsElasticSearch Cluster Unassigned shardsNULL
es.docs.deletedElasticSearch Docs deletedNULL
es.refresh.timeElasticSearch Refresh timeNULL
es.search.fetch.timeElasticSearch Search fetch timeNULL
es.failure.domainsElasticSearch Failure DomainsNULL
es.jvm.gc.par_new.countESJVM GC par_new countNULL
es.merged.docs.sizeElasticSearch Merged docs sizeNULL
es.cache.filter.sizeElasticSearch Cache filter sizeNULL
es.search.query.opsElasticSearch Search query opsNULL
es.cluster.shards.active.primaryElasticSearch Cluster Active primary shardsNULL
es.jvm.mem.heap_committedESJVM Mem heap_committedNULL
es.cache.filter.evictionsElasticSearch Cache filter evictionsNULL
es.cluster.nodesElasticSearch Cluster NodesNULL
es.docs.totalElasticSearch Docs totalNULL
es.cache.field.evictionsElasticSearch Cache field evictionsNULL
es.cluster.data_nodesElasticSearch Cluster Data nodesNULL
es.jvm.mem.non_heap_usedESJVM Mem non_heap_usedNULL
es.Store.sizeElasticSearch Store sizeNULL
es.cluster.relocating_shardsElasticSearch Cluster Relocating shardsNULL
es.jvm.gc.concurrent_mark_sweep.collection_timeESJVM GC concurrent_mark_sweep collection_timeNULL
es.cache.filter.countElasticSearch Cache filter countNULL
es.cluster.shards.activeElasticSearch Cluster Active shardsNULL
es.jvm.mem.non_heap_committedESJVM Mem non_heap_committedNULL
es.jvm.gc.copy.collection_timeESJVM GC copy collection_timeNULL
es.merges.timeElasticSearch Merges timeNULL
es.refresh.opsElasticSearch Refresh opsNULL
es.indexing.index.timeElasticSearch Indexing index timeNULL
es.indexing.index.countElasticSearch Indexing index countNULL
es..master.eligible.nodesElasticSearch Master Eligible NodesNULL
es.jvm.gc.collection_countESJVM GC collection_countNULL
es.flush.opsElasticSearch Flush opsNULL
es.merged.docs.countElasticSearch Merged docs countNULL
es.jvm.gc.concurrent_mark_sweep.countESJVM GC concurrent_mark_sweep countNULL
es.jvm.mem.heap_usedESJVM Mem heap_usedNULL
es.jvm.threadsESJVM ThreadsNULL

Agent G2 - Linux - etcd - v2

Description

Agent G2 - Linux - etcd - v2

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - etcd - v2etcd_debugging_snap_save_marshalling_duration_seconds_bucketSnap Save Marshalling DurationNULLThe marshalling cost distributions of save called by snapshot.
etcd_go_threadsGo ThreadsNULLNumber of OS threads created.
etcd_network_peer_sent_bytes_total_per_secNetwork Peer Sent Bytes TotalNULLThe total number of bytes sent to peers per sec
etcd_go_memstats_stack_inuse_bytesGo Memstats Stack Inuse BytesNULLNumber of bytes in use by the stack allocator.
etcd_debugging_store_watchersStore WatchersNULLCount of currently active watchers.
etcd_debugging_mvcc_keys_totalMvcc KeysNULLTotal number of keys.
etcd_go_memstats_heap_idle_bytesGo Memstats Heap Idle BytesNULLNumber of heap bytes waiting to be used.
etcd_grpc_server_handled_totalGrpc Server Handled TotalNULLTotal number of RPCs completed on the server, regardless of success or failure.
etcd_grpc_proxy_events_coalescing_totalGrpc Proxy Events CoalescingNULLTotal number of events coalescing
etcd_disk_wal_fsync_duration_seconds_bucketDisk Wal Fsync DurationNULLThe latency distributions of fsync called by wal.
etcd_network_peer_round_trip_time_secondsNetwork Peer Round Trip Time SecondsNULLRound-Trip-Time histogram between peers
etcd_server_proposals_failed_totalServer Proposals FailedNULLThe total number of failed proposals seen.
etcd_debugging_store_expires_total_per_secStore ExpiresNULLTotal number of expired keys per sec
etcd_grpc_proxy_cache_hits_totalGrpc Proxy Cache HitsNULLTotal number of cache hits
etcd_debugging_mvcc_db_total_size_in_bytesMvcc Db Size In BytesNULLTotal size of the underlying database in bytes.
etcd_go_memstats_mspan_sys_bytesGo Memstats Mspan Sys BytesNULLNumber of bytes used for mspan structures obtained from system.
etcd_debugging_snap_save_total_duration_seconds_bucketSnap Save DurationNULLThe total latency distributions of save called by snapshot.
etcd_debugging_mvcc_delete_totalMvcc DeleteNULLTotal number of deletes seen by this member.
etcd_go_memstats_gc_cpu_fractionGo Memstats Gc Cpu FractionNULLThe fraction of this program's available CPU time used by the GC since the program started.
etcd_debugging_mvcc_db_compaction_total_duration_milliseconds_bucketMvcc Db Compaction DurationNULLBucketed histogram of db compaction total duration.
etcd_network_client_grpc_sent_bytes_total_per_secNetwork Client Grpc Sent BytesNULLThe total number of bytes sent to grpc clients per sec
etcd_go_memstats_heap_alloc_bytesGo Memstats Heap Alloc BytesNULLNumber of heap bytes allocated and still in use.
etcd_network_client_grpc_sent_bytes_totalNetwork Client Grpc Sent BytesNULLThe total number of bytes sent to grpc clients.
etcd_debugging_store_expires_totalStore ExpiresNULLTotal number of expired keys.
etcd_debugging_mvcc_db_compaction_keys_totalMvcc Db Compaction KeysNULLTotal number of db keys compacted.
etcd_debugging_server_lease_expired_totalServer Lease ExpiredNULLThe total number of expired leases.
etcd_network_client_grpc_received_bytes_total_per_secNetwork Client Grpc Received BytesNULLThe total number of bytes received from grpc clients per sec
etcd_debugging_store_writes_totalStore WritesNULLTotal number of writes (e.g. set/compareAndDelete) seen by this member.
etcd_go_memstats_buck_hash_sys_bytesGo Memstats Buck Hash Sys BytesNULLNumber of bytes used by the profiling bucket hash table.
etcd_server_leader_changes_seen_total_per_secServer Leader Changes SeenNULLThe number of leader changes seen per sec
etcd_network_peer_received_bytes_total_per_sedNetwork Peer Received Bytes TotalBytes Per SecThe total number of bytes received from peers per sec
etcd_server_proposals_applied_totalServer Proposals AppliedNULLThe total number of consensus proposals applied.
etcd_grpc_proxy_watchers_coalescing_totalGrpc Proxy Watchers CoalescingNULLTotal number of current watchers coalescing
etcd_server_is_leaderServer Is LeaderNULLWhether or not this member is a leader. 1 if is, 0 otherwise.
etcd_process_start_time_secondsProcess Start Time SecondsNULLStart time of the process since unix epoch in seconds.
etcd_debugging_mvcc_events_totalMvcc EventsNULLTotal number of events sent by this member.
etcd_debugging_mvcc_txn_totalMvcc TxnNULLTotal number of txns seen by this member.
etcd_debugging_store_watch_requests_totalStore Watch RequestsNULLTotal number of incoming watch requests (new or reestablished).
etcd_go_memstats_lookups_total_per_secGo Memstats Lookups TotalNULLTotal number of pointer lookups per sec
etcd_server_versionServer VersionNULLWhich version is running. 1 for 'server_version' label with current version
etcd_grpc_server_started_totalGrpc Server Started TotalNULLTotal number of RPCs started on the server.
etcd_go_memstats_alloc_bytesGo Memstats Alloc BytesNULLNumber of bytes allocated and still in use.
etcd_disk_backend_commit_duration_seconds_bucketDisk Backend Commit DurationNULLThe latency distributions of commit called by backend.
etcd_debugging_store_reads_total_per_secStore ReadsNULLTotal number of reads action by (get/getRecursive), local to this member per sec
etcd_process_open_fdsProcess Open FdsNULLNumber of open file descriptors.
etcd_disk_wal_fsync_duration_seconds_countDisk Wal Fsync Duration Seconds CountNULLThe latency distributions of fsync called by wal.
etcd_go_memstats_heap_inuse_bytesGo Memstats Heap Inuse BytesNULLNumber of heap bytes that are in use.
etcd_go_memstats_next_gc_bytesGo Memstats Next Gc BytesNULLNumber of heap bytes when next garbage collection will take place.
etcd_debugging_mvcc_range_total_per_secMvcc RangeNULLTotal number of ranges seen by this member per sec
etcd_process_cpu_seconds_total_per_secProcess Cpu Seconds TotalNULLTotal user and system CPU time spent per sec
etcd_debugging_mvcc_db_compaction_keys_total_per_secMvcc Db Compaction KeysNULLTotal number of db keys compacted per sec
etcd_go_memstats_heap_released_bytesGo Memstats Heap Released BytesNULLNumber of heap bytes released to OS.
etcd_server_leader_changes_seen_totalServer Leader Changes SeenNULLThe number of leader changes seen.
etcd_debugging_mvcc_put_totalMvcc PutNULLTotal number of puts seen by this member.
etcd_process_resident_memory_bytesProcess Resident Memory BytesNULLResident memory size in bytes.
etcd_debugging_mvcc_slow_watcher_totalMvcc Slow WatcherNULLTotal number of unsynced slow watchers.
etcd_debugging_mvcc_index_compaction_pause_duration_milliseconds_bucketMvcc Index Compaction Pause DurationNULLBucketed histogram of index compaction pause duration.
etcd_debugging_mvcc_db_compaction_pause_duration_milliseconds_bucketMvcc Db Compaction Pause DurationNULLBucketed histogram of db compaction pause duration.
etcd_grpc_proxy_cache_keys_totalGrpc Proxy Cache KeysNULLTotal number of keys/ranges cached
etcd_go_memstats_alloc_bytes_total_per_secGo Memstats Alloc Bytes TotalNULLTotal number of bytes allocated, even if freed per sec
etcd_go_memstats_mcache_inuse_bytesGo Memstats Mcache Inuse BytesNULLNumber of bytes in use by mcache structures.
etcd_debugging_store_reads_totalStore ReadsNULLTotal number of reads action by (get/getRecursive), local to this member.
etcd_go_memstats_heap_objectsGo Memstats Heap ObjectsNULLNumber of allocated objects.
etcd_server_has_leaderServer Has LeaderNULLWhether or not a leader exists. 1 is existence, 0 is not.
etcd_debugging_mvcc_watch_stream_totalMvcc Watch StreamNULLTotal number of watch streams.
etcd_go_memstats_sys_bytesGo Memstats Sys BytesNULLNumber of bytes obtained from system.
etcd_network_peer_received_bytes_total_per_secNetwork Peer Received Bytes TotalNULLThe total number of bytes received from peers per sec
etcd_debugging_mvcc_put_total_per_secMvcc PutNULLTotal number of puts seen by this member per sec
etcd_go_memstats_mspan_inuse_bytesGo Memstats Mspan Inuse BytesNULLNumber of bytes in use by mspan structures.
etcd_debugging_mvcc_range_totalMvcc RangeNULLTotal number of ranges seen by this member.
etcd_go_memstats_other_sys_bytesGo Memstats Other Sys BytesNULLNumber of bytes used for other system allocations.
etcd_debugging_store_writes_total_per_secStore WritesNULLTotal number of writes (e.g. set/compareAndDelete) seen by this member per sec
etcd_network_client_grpc_received_bytes_totalNetwork Client Grpc Received BytesNULLThe total number of bytes received from grpc clients.
etcd_go_infoGo InfoNULLInformation about the Go runtime.
etcd_debugging_mvcc_slow_watcher_total_per_secMvcc Slow WatcherNULLTotal number of unsynced slow watchers per sec
etcd_debugging_store_watch_requests_total_per_secStore Watch RequestsNULLTotal number of incoming watch requests (new or reestablished) per sec
etcd_debugging_snap_save_marshalling_duration_seconds_countSnap Save Marshalling DurationNULLThe marshalling cost distributions of save called by snapshot.
etcd_debugging_snap_save_total_duration_seconds_countSnap Save DurationNULLThe total latency distributions of save called by snapshot.
etcd_network_client_grpc_sent_bytes_total_per_secNetwork Client Grpc Sent BytesNULLThe total number of bytes sent to grpc clients per sec
etcd_disk_backend_commit_duration_seconds_countDisk Backend Commit DurationNULLThe latency distributions of commit called by backend.
etcd_debugging_store_reads_bytes_totalStore Reads BytesNULLTotal number of bytes read out for reads action by (get/getRecursive), local to this member
etcd_debugging_mvcc_slow_watcher_total_per_secMvcc Slow WatcherNULLTotal number of unsynced slow watchers per sec
etcd_debugging_snap_save_total_duration_seconds_bucketSnap Save DurationNULLThe total latency distributions of save called by snapshot.
etcd_disk_wal_fsync_duration_seconds_countDisk Wal Fsync Duration Seconds CountNULLThe latency distributions of fsync called by wal.
etcd_debugging_store_reads_bytes_total_per_secStore Reads BytesNULLTotal number of bytes read out for reads action by (get/getRecursive), local to this member per sec
etcd_debugging_store_writes_bytes_totalStore Writes BytesNULLTotal number of bytes written out for writes action by (set/compareAndDelete), local to this member
etcd_disk_backend_commit_duration_seconds_bucketDisk Backend Commit DurationNULLThe latency distributions of commit called by backend.
etcd_debugging_store_writes_bytes_total_per_secStore Writes BytesNULLTotal number of bytes written out for writes action by (set/compareAndDelete), local to this member per sec

Agent G2 - Linux - Hadoop JobTracker Service Monitors

Description

Monitors Hadoop Job tracker metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Hadoop JobTracker Service Monitorshadoop.jobtracker.reduce.slots.usedHadoop JobTracker Reduce Slots UsedNULLThe Number of currently occupied/reserved used reduce slots
hadoop.jobtracker.rpc.latencyHadoop JobTracker RPC LatencyNULLCalculates the average time spent by an RPC request in the queue
hadoop.jobtracker.nodes.aliveHadoop JobTracker Alive NodesNULLThe Total Number of Alive Nodes in the cluster
hadoop.jobtracker.nodes.totalHadoop JobTracker Total NodesNULLThe Total Number of Nodes in the cluster
hadoop.jobtracker.map.slots.usedHadoop JobTracker Map Slots UsedNULLThe Number of Currently occupied/reserved used map slots
hadoop.jobtracker.nodes.black.listedHadoop JobTracker Black ListedNULLThe Number of BlackListed trackers in the cluster
hadoop.jobtracker.nodes.gray.listedHadoop JobTracker Gray ListedNULLThe Number of Graylisted trackers in the cluster
hadoop.jobtracker.map.slots.totalHadoop JobTracker Map SlotsNULLThe Total Number of Map Slots in the cluster
hadoop.jobtracker.jobsHadoop JobTracker Total JobsNULLThe Total Number of jobs in the cluster
hadoop.jobtracker.reduce.slots.totalHadoop JobTracker Reduce SlotsNULLThe Number of Currently occupied/reserved reduce slots
hadoop.jobtracker.failuresHadoop JobTracker Failure NodesNULLThe Number of Decommissioned trackers in the cluster
hadoop.jobtracker.dir.failuresHadoop JobTracker Dir FailuresNULLThe Number of Failure directories in the cluster
hadoop.jobtracker.nodes.deadHadoop JobTracker Dead NodesNULLThe Total Number of Dead Nodes in the Cluster

Agent G2 - Linux - HAProxy Monitors

Description

Monitors HAProxy application metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - HAProxy Monitorshaproxy.http.errors_3xxHAProxy 3xx HTTP ErrorNULLNumber of http error responses with 3xx code
haproxy.session_currentHAProxy Sessions ActiveNULLCurrent number of concurrent connections.
haproxy.errors_resp_rateHAProxy Response ErrorsNULLThe rate of response errors.
haproxy.requests_queueHAProxy Queued RequestsNULLNumber of requests in the server queue.
haproxy.http.errors_1xxHAProxy 1xx HTTP ErrorNULLNumber of http error responses with 1xx code
haproxy.http.errors_5xxHAProxy 5xx HTTP ErrorNULLNumber of http error responses with 5xx code
haproxy.denied_resp_rateHAProxy Denied ResponsesNULLThe rate of denied responses.
haproxy.warning.retr_rateHAProxy Warn RetriesNULLThe rate of retries (warning).
haproxy.warning.redis_rateHAProxy Warn Redis patchesNULLThe rate of dispatches (warning).
haproxy.servers_backupHAProxy Backup ServersNULLNumber of current backup servers (backend). Validates against total backup servers.
haproxy.mbytes_in_rateHAProxy Data ReceivedNULLThe rate at which the data is received per server in MB.
haproxy.http.errors_2xxHAProxy 2xx HTTP ErrorNULLNumber of http error responses with 2xx code
haproxy.http.errors_4xxHAProxy 4xx HTTP ErrorNULLNumber of http error responses with 4xx code
haproxy.errors_con_rateHAProxy Connection ErrorsNULLThe rate of connection errors.
haproxy.requests_rateHAProxy RequestsNULLThe rate of received HTTP requests.
haproxy.denied_req_rateHAProxy Denied RequestsNULLThe rate of denied requests.
haproxy.errors_req_rateHAProxy Request ErrorsNULLThe rate of request errors.
haproxy.mbytes_out_rateHAProxy Data SentNULLThe rate at which the data is sent per server in MB.
haproxy.session_rateHAProxy SessionsNULLNumber of sessions per second.
haproxy.lastchk_timeHAProxy Last Health Check TimeNULLTime in ms took to finish last health check.
haproxy.servers_activeHAProxy Active ServersNULLNumber of current active servers (backend). Validates against total active servers.

Agent G2 - Linux - HAProxy Performance Statistics

Description

Monitors HAProxy stats module

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - HAProxy Performance Statisticshaproxy.denied_resp_rateHAProxy Denied ResponsesNULLThe rate of denied responses.
haproxy.servers_backupHAProxy Backup ServersNULLNumber of current backup servers (backend). Validates against total backup servers.
haproxy.warning.redis_rateHAProxy Warn Redis patchesNULLThe rate of dispatches (warning).
haproxy.warning.retr_rateHAProxy Warn RetriesNULLThe rate of retries (warning).
haproxy.lastchk_timeHAProxy Last Health Check TimeNULLTime in ms took to finish last health check.
haproxy.denied_req_rateHAProxy Denied RequestsNULLThe rate of denied requests.
haproxy.http.errors_4xxHAProxy 4xx HTTP ErrorNULLNumber of http error responses with 4xx code
haproxy.requests_rateHAProxy RequestsNULLThe rate of received HTTP requests.
haproxy.errors_resp_rateHAProxy Response ErrorsNULLThe rate of response errors.
haproxy.errors_req_rateHAProxy Request ErrorsNULLThe rate of request errors.
haproxy.http.errors_1xxHAProxy 1xx HTTP ErrorNULLNumber of http error responses with 1xx code
haproxy.servers_activeHAProxy Active ServersNULLNumber of current active servers (backend). Validates against total active servers.
haproxy.errors_con_rateHAProxy Connection ErrorsNULLThe rate of connection errors.
haproxy.mbytes_out_rateHAProxy Data SentNULLThe rate at which the data is sent per server in MB.
haproxy.http.errors_2xxHAProxy 2xx HTTP ErrorNULLNumber of http error responses with 2xx code
haproxy.mbytes_in_rateHAProxy Data ReceivedNULLThe rate at which the data is received per server in MB.
haproxy.http.errors_3xxHAProxy 3xx HTTP ErrorNULLNumber of http error responses with 3xx code
haproxy.http.errors_5xxHAProxy 5xx HTTP ErrorNULLNumber of http error responses with 5xx code
haproxy.session_rateHAProxy SessionsNULLNumber of sessions per second.
haproxy.requests_queueHAProxy Queued RequestsNULLNumber of requests in the server queue.
haproxy.session_currentHAProxy Sessions ActiveNULLCurrent number of concurrent connections.

Agent G2 - Linux - HBase Monitors

Description

Monitors HBase application metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - HBase Monitorshbase.dead.region.serversHBase Dead Region ServersNULLThe number of dead region servers.
hbase.live.region.serversHBase Live Region ServersNULLThe number of online region servers.
hbase.average.loadHBase Average LoadNULLAverage number of regions served by each region server.
hbase.cluster.requestsHBase Cluster RequestsNULLThe total number of requests from all region servers to a cluster.

Agent G2 - Linux - HBase Performance Check

Description

Monitors HBase application metrics via MBeans exposed by the JMX Console

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - HBase Performance Checkhbase.live.region.serversHBase Live Region ServersNULLThe number of online region servers.
hbase.dead.region.serversHBase Dead Region ServersNULLThe number of dead region servers.
hbase.cluster.requestsHBase Cluster RequestsNULLThe total number of requests from all region servers to a cluster.
hbase.average.loadHBase Average LoadNULLAverage number of regions served by each region server.

Agent G2 - Linux - HDFS Datanode Template

Description

Monitor HDFS Datanodes

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - HDFS Datanode Templatehdfs.datanode.num_blocks_failed_to_uncacheHDFS Datanode NumBlocksFailedToUncacheNULLThe number of failed blocks to remove from cache.
hdfs.datanode.last_volume_failure_dateHDFS Datanode LastVolumeFailureDateNULLThe date/time of the last volume failure in milliseconds since epoch.
hdfs.datanode.cache_usedHDFS Datanode Cache UsedNULLCache used in bytes.
hdfs.datanode.num_blocks_cachedHDFS Datanode NumBlocksCachedNULLThe number of blocks cached.
hdfs.datanode.cache_capacityHDFS Datanode Cache CapacityNULLCache capacity in bytes.
hdfs.datanode.dfs_remaining_percentHDFS Datanode Dfs Remaining PercentNULLThe remaining disk space left in Percent.
hdfs.datanode.estimated_capacity_lost_totalHDFS Datanode EstimatedCapacityLostTotalNULLThe estimated capacity lost in bytes.
hdfs.datanode.num_blocks_failed_to_cacheHDFS Datanode NumBlocksFailedToCacheNULLThe number of blocks that failed to cache.
hdfs.datanode.process_cpu_loadHDFS Datanode Process CpuLoadNULLThe CPU Load of the Process.
hdfs.datanode.dfs_capacityHDFS Datanode Dfs CapacityNULLDisk capacity in bytes.
hdfs.datanode.num_failed_volumesHDFS Datanode NumFailedVolumesNULLNumber of failed volumes.
hdfs.datanode.dfs_used_percentHDFS Datanode Dfs Used PercentNULLDisk usage in Percent.

Agent G2 - Linux - HDFS Namenode Template

Description

Monitor HDFS Namenodes

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - HDFS Namenode Templatehdfs.namenode.capacity_totalHDFS Namenode CapacityTotalNULLTotal disk capacity in bytes.
hdfs.namenode.num_live_data_nodesHDFS Namenode NumLiveDataNodesNULLTotal number of live data nodes.
hdfs.namenode.pending_deletion_blocksHDFS Namenode PendingDeletionBlocksNULLNumber of pending deletion blocks.
hdfs.namenode.files_totalHDFS Namenode FilesTotalNULLTotal number of files.
hdfs.namenode.volume_failures_totalHDFS Namenode VolumeFailuresTotalNULLTotal volume failures.
hdfs.namenode.under_replicated_blocksHDFS Namenode UnderReplicatedBlocksNULLNumber of under replicated blocks.
hdfs.namenode.scheduled_replication_blocksHDFS Namenode ScheduledReplicationBlocksNULLNumber of blocks scheduled for replication.
hdfs.namenode.num_stale_data_nodesHDFS Namenode NumStaleDataNodesNULLNumber of stale data nodes.
hdfs.namenode.nondfs_used_percentHDFS Namenode NonDfsUsedPercentNULLTotal space used by NonDfs in Percentage.
hdfs.namenode.blocks_totalHDFS Namenode BlocksTotalNULLTotal number of blocks.
hdfs.namenode.num_decom_dead_data_nodesHDFS Namenode NumDecomDeadDataNodesNULLNumber of decommissioning dead data nodes.
hdfs.namenode.num_failed_data_nodesHDFS Namenode NumFailedDataNodesNULLTotal number of failed data nodes.
hdfs.namenode.estimated_capacity_lost_totalHDFS Namenode EstimatedCapacityLostTotalNULLEstimated capacity lost in bytes.
hdfs.namenode.num_stale_storagesHDFS Namenode NumStaleStoragesNULLNumber of stale storages.
hdfs.namenode.total_loadHDFS Namenode TotalLoadNULLTotal load on the file system.
hdfs.namenode.missing_blocksHDFS Namenode MissingBlocksNULLNumber of missing blocks.
hdfs.namenode.corrupt_blocksHDFS Namenode CorruptBlocksNULLNumber of corrupt blocks.
hdfs.namenode.num_dead_data_nodesHDFS Namenode NumDeadDataNodesNULLTotal number of dead data nodes.
hdfs.namenode.num_decom_live_data_nodesHDFS Namenode NumDecomLiveDataNodesNULLNumber of decommissioning live data nodes.
hdfs.namenode.max_objectsHDFS Namenode MaxObjectsNULLMaximum number of files HDFS supports.
hdfs.namenode.pending_replication_blocksHDFS Namenode PendingReplicationBlocksNULLNumber of blocks pending replication.
hdfs.namenode.num_decommissioning_data_nodesHDFS Namenode NumDecommissioningDataNodesNULLNumber of decommissioning data nodes.
hdfs.namenode.capacity_used_percentHDFS Namenode CapacityUsedPercentNULLDisk usage in Percent.
hdfs.namenode.capacity_remaining_percentHDFS Namenode CapacityRemainingPercentNULLRemaining disk space left in Percent.

Agent G2 - Linux - IPTables Monitors

Description

Monitoring Template for IP Tables application. Monitors chain bandwidth, close connections, established connections, filter failures, etc.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - IPTables Monitorsiptables.chain_bandwidthIPTables-ChainBandwidthNULLCaptures traffic following through the IPTables which matches a given Chain.
iptables.mangle_rulesIPTables-MangleRulesNULLChecks a given table for a specific number of rules. If the number of rules in that table is less than what is specified in the argument it throws an alert.
iptables.nat_rulesIPTables-NatRulesNULLChecks a given table for a specific number of rules. If the number of rules in that table is less than what is specified in the argument it throws an alert.
iptables.established_connectionsIPTables-ESTABLISHEDConnectionsNULLProvides the number of active ESTABLISHED connections.
iptables.icmp_connectionsIPTables-ICMPConnectionsNULLProvides the number of active ICMP connections.
iptables.udp_connectionsIPTables-UDPConnectionsNULLProvides the number of active UDP connections.
iptables.tcp_connectionsIPTables-TCPConnectionsNULLProvides the number of active TCP connections.
iptables.syn_connectionsIPTables-SYNConnectionsNULLProvides the number of active SYN connections.
iptables.close_connectionsIPTables-CLOSEConnectionsNULLProvides the number of active CLOSE connections.
iptables.filter_rulesIPTables-FilterRulesNULLChecks a given table for a specific number of rules. If the number of rules in that table is less than what is specified in the argument it throws an alert.
iptables.time_wait_connectionsIPTables-TIME_WAITConnectionsNULLProvides the number of active TIME_WAIT connections.

Agent G2 - Linux - K3S ApiServer

Description

Template for monitoring K3S through Kubernetes API server

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - K3S ApiServerapiserver.dropped.requests.total.countKube apiserver Dropped Requests Total CountNULLMonotonic count of requests dropped with Try-again-later response.
apiserver.http.requests.totalKube apiserver HTTP Requests TotalNULLTotal number of HTTP requests made.
apiserver.audit.event.totalKube apiserver Audit Event TotalNULLCounter of audit events generated and sent to the audit backend.
apiserver.authenticated.user.requestsKube apiserver Authenticated User RequestsNULLCounter of authenticated requests broken out by username.
apiserver.request.duration.seconds.bucketKube apiserver Request Duration Seconds BuckethistogramResponse latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope, and component.
apiserver.go.threads.totalKube apiserver Go Threads TotalNULLNumber of OS threads created.
apiserver.inflight.requestsKube apiserver Inflight RequestsNULLMaximal number of currently used inflight request limit of this apiserver per request kind in the last second.
apiserver.request.countKube apiserver Request CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.http.requests.total.countKube apiserver HTTP Requests Total CountNULLTotal number of HTTP requests made.
apiserver.go.goroutinesKube apiserver GoroutinesNULLNumber of goroutines that currently exist.
apiserver.request.count.countKube apiserver Request Count CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.rest.client.requests.total.countKube apiserver Rest Client Requests Total CountNULLNumber of HTTP requests, partitioned by status code, method, and host.
apiserver.APIServiceRegistrationController.depthKube apiserver APIService Registration Controller DepthNULLCurrent depth of workqueue: APIServiceRegistrationController.
apiserver.etcd.object.countsKube apiserver ETCD Object CountsNULLNumber of stored objects at the time of the last check split by kind.
apiserver.authenticated.user.requests.countKube apiserver Authenticated User Requests CountNULLCounter of authenticated requests broken out by username.

Agent G2 - Linux - K3S CoreDNS

Description

Kubernetes CoreDNS

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - K3S CoreDNScoredns.response_size.bytes.sumResponse Size Bytes SumNULLSize of the returned response in bytes.
coredns.query.countQuery countNULLTotal query count.
coredns.request_duration.seconds.sumRequest Duration Seconds SumNULLDuration to process each query.
coredns.panicsTotal PanicsNULLTotal number of panics.
coredns.request_duration.seconds.countRequest Duration Seconds CountNULLDuration per upstream interaction.

Agent G2 - Linux - K3S Kube State

Description

Template for monitoring K3S using Kube State

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - K3S Kube Statekubernetes_state.container.cpu_limitContainer Cpu LimitNULLThe limit on cpu cores to be used by a container
kubernetes_state.resourcequota.pods.usedResourcequota Pods UsedNULLObserved number of pods used for a resource quota
kubernetes_state.replicaset.replicas_desiredReplicaset Replicas DesiredNULLNumber of desired pods for a ReplicaSet
kubernetes_state.node.cpu_capacityNode Cpu CapacityNULLThe total CPU resources of the node.
kubernetes_state.deployment.replicas_desiredDeployment Replicas DesiredNULLThe number of desired replicas per deployment wrong help in kube-state-metrics.cross check
kubernetes_state.resourcequota.services.loadbalancers.limitResourcequota Services Loadbalancers LimitNULLHard limit of the number of loadbalancers for a resource quota
kubernetes_state.node.memory_capacityNode Memory CapacityNULLThe total memory resources of the node.
kubernetes_state.daemonset.readyDaemonset ReadyNULLThe number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready.
kubernetes_state.replicaset.replicasReplicaset ReplicasNULLThe number of replicas per ReplicaSet.
kubernetes_state.resourcequota.services.nodeports.limitResourcequota Services Nodeports LimitNULLHard limit of the number of node ports for a resource quota
kubernetes_state.container.cpu_requestedContainer Cpu RequestedNULLThe number of requested cpu cores by a container
kubernetes_state.resourcequota.requests.cpu.limitResourcequota Requests Cpu LimitNULLHard limit on the total of CPU core requested for a resource quota
kubernetes_state.resourcequota.requests.storage.limitResourcequota Requests Storage LimitNULLHard limit on the total of storage bytes requested for a resource quota
kubernetes_state.resourcequota.limits.memory.usedResourcequota Limits Memory UsedNULLObserved sum of limits for memory bytes for a resource quota
kubernetes_state.node.pods_capacityNode Pods CapacityNULLThe total pod resources of the node.
kubernetes_state.deployment.replicasDeployment ReplicasNULLThe number of replicas per deployment.
kubernetes_state.deployment.replicas_availableDeployment Replicas AvailableNULLThe number of available replicas per deployment.
kubernetes_state.resourcequota.requests.memory.usedResourcequota Requests Memory UsedNULLObserved sum of memory bytes requested for a resource quota
kubernetes_state.resourcequota.persistentvolumeclaims.limitResourcequota Persistentvolumeclaims LimitNULLHard limit of the number of PVC for a resource quota
kubernetes_state.container.memory_requestedContainer Memory RequestedNULLThe number of requested memory bytes by a container
kubernetes_state.resourcequota.services.usedResourcequota Services UsedNULLObserved number of services used for a resource quota
kubernetes_state.resourcequota.limits.memory.limitResourcequota Limits Memory LimitNULLHard limit on the sum of memory bytes limits for a resource quota
kubernetes_state.deployment.replicas_unavailableDeployment Replicas UnavailableNULLThe number of unavailable replicas per deployment.
kubernetes_state.node.memory_allocatableNode Memory AllocatableNULLThe memory resources of a node that are available for scheduling.
kubernetes_state.resourcequota.pods.limitResourcequota Pods LimitNULLHard limit of the number of pods for a resource quota
kubernetes_state.container.memory_limitContainer Memory LimitNULLThe limit on memory to be used by a container
kubernetes_state.deployment.rollingupdate.max_unavailableDeployment Rollingupdate Max UnavailableNULLMaximum number of unavailable replicas during a rolling update of a deployment.
kubernetes_state.resourcequota.persistentvolumeclaims.usedResourcequota Persistentvolumeclaims UsedNULLObserved number of persistent volume claims used for a resource quota
kubernetes_state.daemonset.misscheduledDaemonset MisscheduledNULLThe number of nodes running a daemon pod but are not supposed to.
kubernetes_state.resourcequota.services.loadbalancers.usedResourcequota Services Loadbalancers UsedNULLObserved number of loadbalancers used for a resource quota
kubernetes_state.resourcequota.requests.cpu.usedResourcequota Requests Cpu UsedNULLObserved sum of CPU cores requested for a resource quota
kubernetes_state.resourcequota.services.nodeports.usedResourcequota Services Nodeports UsedNULLObserved number of node ports used for a resource quota
kubernetes_state.resourcequota.requests.storage.usedResourcequota Requests Storage UsedNULLObserved sum of storage bytes requested for a resource quota
kubernetes_state.node.cpu_allocatableNode Cpu AllocatableNULLThe CPU resources of a node that are available for scheduling.
kubernetes_state.resourcequota.requests.memory.limitResourcequota Requests Memory LimitNULLHard limit on the total of memory bytes requested for a resource quota
kubernetes_state.container.restartsContainer RestartsNULLThe number of restarts per container
kubernetes_state.replicaset.fully_labeled_replicasReplicaset Fully Labeled ReplicasNULLThe number of fully labeled replicas per ReplicaSet.
kubernetes_state.deployment.replicas_updatedDeployment Replicas UpdatedNULLThe number of updated replicas per deployment.
kubernetes_state.resourcequota.limits.cpu.limitResourcequota Limits Cpu LimitNULLHard limit on the sum of CPU core limits for a resource quota
kubernetes_state.resourcequota.services.limitResourcequota Services LimitNULLHard limit of the number of services for a resource quota
kubernetes_state.resourcequota.limits.cpu.usedResourcequota Limits Cpu UsedNULLObserved sum of limits for CPU cores for a resource quota
kubernetes_state.replicaset.replicas_readyReplicaset Replicas ReadyNULLThe number of ready replicas per ReplicaSet
kubernetes_state.node.pods_allocatableNode Pods AllocatableNULLThe pod resources of a node that are available for scheduling.
kubernetes_state.daemonset.desiredDaemonset DesiredNULLThe number of nodes that should be running the daemon pod.
kubernetes_state.daemonset.scheduledDaemonset ScheduledNULLThe number of nodes running at least one daemon pod and are supposed to.

Agent G2 - Linux - K3S Master Agent

Description

Agent G2 - Linux - K3S Master Agent

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - K3S Master Agent - K8scorednscoredns.request_duration.seconds.sumRequest Duration Seconds SumNULLDuration to process each query.
coredns.panicsTotal PanicsNULLTotal number of panics.
coredns.query.countQuery countNULLTotal query count.
coredns.response_size.bytes.sumResponse Size Bytes SumNULLSize of the returns response in bytes.
coredns.request_duration.seconds.countRequest Duration Seconds CountNULLDuration per upstream interaction
Agent G2 - Linux - K3S Master Agent - K8sApiServerapiserver.http.requests.totalKube apiserver HTTP Requests TotalNULLTotal number of HTTP requests made.
apiserver.request.duration.seconds.bucketKube apiserver Request Duration Seconds BuckethistogramResponse latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope and component.
apiserver.authenticated.user.requestsKube apiserver Authenticated User RequestsNULLCounter of authenticated requests broken out by username.
apiserver.rest.client.requests.totalKube apiserver Rest Client Requests TotalNULLNumber of HTTP requests, partitioned by status code, method, and host.
apiserver.dropped.requests.totalKube apiserver Dropped Requests TotalNULLAccumulated number of requests dropped with Try-again-later response
apiserver.request.count.countKube apiserver Request Count CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.go.goroutinesKube apiserver GoroutinesNULLNumber of goroutines that currently exist.
apiserver.request.countKube apiserver Request CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.etcd.object.countsKube apiserver ETCD Object CountsNULLNumber of stored objects at the time of last check split by kind.
apiserver.APIServiceRegistrationController.depthKube apiserver APIService Registration Controller DepthNULLCurrent depth of workqueue: APIServiceRegistrationController
apiserver.audit.event.totalKube apiserver Audit Event TotalNULLCounter of audit events generated and sent to the audit backend.
apiserver.go.threads.totalKube apiserver Go Threads TotalNULLNumber of OS threads created.
apiserver.inflight.requestsKube apiserver Inflight RequestsNULLMaximal number of currently used inflight request limit of this apiserver per request kind in last second.
apiserver.rest.client.requests.total.countKube apiserver Rest Client Requests Total CountNULLNumber of HTTP requests, partitioned by status code, method, and host.
apiserver.dropped.requests.total.countKube apiserver Dropped Requests Total CountNULLMonotonic count of requests dropped with Try-again-later response
apiserver.authenticated.user.requests.countKube apiserver Authenticated User Requests CountNULLCounter of authenticated requests broken out by username.
apiserver.http.requests.total.countKube apiserver HTTP Requests Total CountNULLTotal number of HTTP requests made.

Agent G2 - Linux - Kafka Consumer Monitors

Description

Agent G2 - Linux - Kafka Consumer Monitors

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kafka Consumer Monitorskafka.broker.offsetsKafka Broker OffsetsNULLBroker offsets
kafka.consumer.offsetsKafka Consumer OffsetsNULLConsumer offsets
kafka.consumer.lagKafka Consumer Lag%Lag in the consumer data

Agent G2 - Linux - Kafka Monitors

Description

Monitors performance related metrics via MBeans exposed by the JMX console

Prerequisites

NULL

Supported Metric

>
Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kafka Monitorskafka.jvm.threadsKafka JVM ThreadsNULLNumber of threads.
kafka.metrics.fetch.requestsKafka Fetch RequestsNULLRequest rate
kafka.jvm.uptimeKafka UptimeNULLUptime of the server
kafka.metrics.produce_remote_timeKafka Producer Remote TimemsTime the request waits for the follower
kafka.metrics.offset_commit_resp_queue_timeKafka Offset Commit Response Queue TimemsTime the request waiting in the response queue
kafka.metrics.controlled_shutdown.requestsKafka Controlled Shutdown RequestsNULLRequest rate
kafka.producer.requests_delayedKafka Producer Delayed RequestsNULLRequests delayed in the producer purgatory
kafka.fetch.requests_waitingKafka Fetch Purgatory SizeNULLRequests waiting in the fetch purgatory
kafka.metrics.fetch_follower.resp_queue_timeKafka Fetch Follower Response Queue TimemsTime the request waiting in the response queue
kafka.metrics.controlled_shutdown.total_timeKafka Controlled Shutdown Total TimemsRequest total time
kafka.channel.queue_size_requestKafka Request Queue SizeNULL
kafka.metrics.controlled_shutdown.resp_send_timeKafka Controlled Shutdown Response Send TimemsTime to send the response
kafka.controller.active_controller_countKafka Active Controller CountNULLIs controller active on broker
kafka.metrics.leader_isr.local_timeKafka Leader And Isr Local TimemsTime the request being processed at the leader
kafka.metrics.offset_commit.remote_timeKafka Offset Commit Remote TimemsTime the request waits for the follower
kafka.metrics.fetch_follower.local_timeKafka Fetch Follower Local TimemsTime the request being processed at the leader
kafka.channel.queue_size_responseKafka Response Queue SizeNULL
kafka.metrics.metadata.req_queue_timeKafka Metadata Request Queue TimemsTime the request waiting in the request queue
kafka.metrics.controlled_shutdown.req_queue_timeKafka Controlled Shutdown Request Queue TimemsTime the request waiting in the request queue
kafka.metrics.leader_isr.remote_timeKafka Leader And Isr Remote TimemsTime the request waits for the follower
kafka.metrics.update_metadata.requestsKafka Update Metadata RequestsNULLRequest rate
kafka.metrics.fetch_consumer.resp_send_timeKafka Fetch Consumer Response Send TimemsTime to send the response
kafka.metrics.update_metadata.remote_timeKafka Update Metadata Remote TimemsTime the request waits for the follower
kafka.producer.requests_waitingKafka Producer Purgatory SizeNULLRequests waiting in the producer purgatory
kafka.metrics.fetch_consumer.total_timeKafka Fetch Consumer Total TimemsRequest total time
kafka.metrics.offsets.req_queue_timeKafka Offsets Request Queue TimemsTime the request waiting in the request queue
kafka.metrics.fetch_consumer.req_queue_timeKafka Fetch Consumer Request Queue TimemsTime the request waiting in the request queue
kafka.metrics.offsets.resp_queue_timeKafka Offsets Response Queue TimemsTime the request waiting in the response queue
kafka.fetch.requests_delayedKafka Fetch Delayed RequestsNULLRequests delayed in the fetch purgatory
kafka.metrics.stop_replica_total_timeKafka Stop Replica Total TimemsRequest total time
kafka.log.flush_rateKafka LogFlush Rate And TimeNULLLog flush rate and time

Agent G2 - Linux - Kong Monitoring - V2

Description

Monitors kong application related metrics like Connections Accepted,Connections Active,Connections Handled,Connections Reading,Connections Waiting,Connections Writing,Connections Total,Database Reachable,Db Entities Total,Db Entity Count Errors,Enterprise License Errors,Memory Lua Shared Dict Bytes,Memory Lua Shared Dict Total Bytes,Memory Workers Lua Vms Bytes,Nginx Http Current Connections,Nginx Metric Errors Total,Nginx Timers

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kong Monitoring - V2kong_nginx_timersKong Nginx TimersNULLNumber of nginx timers
kong_db_entity_count_errorsKong Db Entity Count ErrorsNULLErrors during entity count collection
kong_memory_lua_shared_dict_bytesKong Memory Lua Shared Dict BytesbytesAllocated slabs in bytes in a shared_dict
kong_enterprise_license_errorsKong Enterprise License ErrorsNULLErrors when collecting license info
kong_memory_workers_lua_vms_bytesKong Memory Workers Lua Vms BytesbytesAllocated bytes in worker Lua VM
kong_connections_totalKong Connections TotalRequestsTotal number of client requests.
kong_connections_handledConnections HandledNULLTotal number of handled connections. (Same as accepts unless resource limits were reached).
kong_memory_lua_shared_dict_total_bytesKong Memory Lua Shared Dict Total BytesbytesTotal capacity in bytes of a shared_dict
kong_nginx_http_current_connectionsKong Nginx Http Current ConnectionsConnectionsNumber of HTTP connections
kong_db_entities_totalKong Db Entities TotalNULLTotal number of Kong db entities
kong_nginx_metric_errors_totalKong Nginx Metric Errors TotalNULLNumber of nginx-lua-prometheus errors
kong_connections_readingConnections ReadingNULLCurrent number of connections where Kong is reading the request header.
kong_connections_activeConnections ActiveNULLCurrent number of active client connections including Waiting connections.
kong_connections_acceptedConnections AcceptedNULLTotal number of accepted client connections.
kong_connections_waitingConnections WaitingNULLCurrent number of idle client connections waiting for a request.
kong_connections_writingConnections WritingNULLCurrent number of connections where nginx is writing the response back to the client.

Agent G2 - Linux - Kubernetes ApiServer

Description

Template for monitoring Kubernetes through Kubernetes API server

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes ApiServerapiserver.authenticated.user.requests.countKube apiserver Authenticated User Requests CountNULLCounter of authenticated requests broken out by username.
apiserver.go.goroutinesKube apiserver GoroutinesNULLNumber of goroutines that currently exist.
apiserver.audit.event.totalKube apiserver Audit Event TotalNULLCounter of audit events generated and sent to the audit backend.
apiserver.inflight.requestsKube apiserver Inflight RequestsNULLMaximal number of currently used inflight request limit of this apiserver per request kind in last second.
apiserver.http.requests.totalKube apiserver HTTP Requests TotalNULLTotal number of HTTP requests made.
apiserver.rest.client.requests.total.countKube apiserver Rest Client Requests Total CountNULLNumber of HTTP requests, partitioned by status code, method, and host.
apiserver.dropped.requests.total.countKube apiserver Dropped Requests Total CountNULLMonotonic count of requests dropped with Try-again-later response
apiserver.rest.client.requests.totalKube apiserver Rest Client Requests TotalNULLNumber of HTTP requests, partitioned by status code, method, and host.
apiserver.etcd.object.countsKube apiserver ETCD Object CountsNULLNumber of stored objects at the time of last check split by kind.
apiserver.request.duration.seconds.bucketKube apiserver Request Duration Seconds BuckethistogramResponse latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope and component.
apiserver.request.count.countKube apiserver Request Count CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.APIServiceRegistrationController.depthKube apiserver APIService Registration Controller DepthNULLCurrent depth of workqueue: APIServiceRegistrationController
apiserver.authenticated.user.requestsKube apiserver Authenticated User RequestsNULLCounter of authenticated requests broken out by username.
apiserver.request.countKube apiserver Request CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.dropped.requests.totalKube apiserver Dropped Requests TotalNULLAccumulated number of requests dropped with Try-again-later response
apiserver.go.threads.totalKube apiserver Go Threads TotalNULLNumber of OS threads created.
apiserver.http.requests.total.countKube apiserver HTTP Requests Total CountNULLTotal number of HTTP requests made.

Agent G2 - Linux - Kubernetes Controller

Description

Template for monitoring default Kubenetes Controller

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes Controllercontroller.go.goroutinesKube Controller Go GoroutinesNULLNumber of goroutines that currently exist.
controller.workqueue.nodes.evictionsKube Controller Node Collector Evictions NumberNULLNumber of Node evictions that happened since current instance of NodeController started.
controller.workqueue.work_longest_durationKube Controller Workqueue Longest Running Processor SecondsNULLHow many seconds has the longest running processor for workqueue been running.
controller.workqueue.work_duration.sumKube Controller Workqueue Work Duration Seconds SumNULLHow long in seconds processing an item from workqueue takes.
controller.rate_limiter.useKube Controller Node Lifecycle Controller Rate Limiter UseNULLA metric measuring the saturation of the rate limiter for node_lifecycle_controller.
controller.workqueue.depthKube Controller Workqueue DepthNULLCurrent depth of workqueue.
controller.process.open_fdsKube Controller Process Open FdsNULLNumber of open file descriptors.
controller.workqueue.nodes.countKube Controller Registered NodesNULLNumber of registered Nodes per zones.
controller.workqueue.queue_duration.countKube Controller Workqueue Queue Duration Seconds CountNULLTotal how long in seconds an item stays in workqueue before being requested.
controller.workqueue.nodes.unhealthyKube Controller Node Collector Unhealthy Nodes in ZoneNULLNumber of not Ready Nodes per zones.
controller.process.max_fdsKube Controller Process Max FdsNULLMaximum number of open file descriptors.
controller.workqueue.retriesKube Controller Workqueue Retries TotalNULLTotal number of retries handled by workqueue.
controller.workqueue.work_duration.countKube Controller Workqueue Work Duration Seconds CountNULLTotal time in seconds processing an item from workqueue takes.
controller.workqueue.addsKube Controller Workqueue Adds TotalNULLTotal number of adds handled by workqueue.
controller.workqueue.queue_duration.sumKube Controller Workqueue Queue Duration Seconds SumNULLHow long in seconds an item stays in workqueue before being requested.
controller.workqueue.work_unfinished_durationKube Controller Workqueue Unfinished Work SecondsNULLHow many seconds of work has done that is in progress and hasn't been observed by work_duration. Large values indicate stuck threads.
controller.threadsKube Controller Os ThreadsNULLNumber of OS threads created.

Agent G2 - Linux - Kubernetes CoreDNS Monitoring

Description

Kubernetes CoreDNS

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes CoreDNS Monitoringcoredns.request_duration.seconds.countRequest Duration Seconds CountNULLDuration per upstream interaction
coredns.request_duration.seconds.sumRequest Duration Seconds SumNULLDuration to process each query.
coredns.response_size.bytes.sumResponse Size Bytes SumNULLSize of the returns response in bytes.
coredns.panicsTotal PanicsNULLTotal number of panics.
coredns.query.countQuery countNULLTotal query count.

Agent G2 - Linux - Kubernetes DNS Monitoring

Description

Agent G2 - Linux - Kubernetes DNS Monitoring

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes DNS Monitoringkubedns.request_duration.seconds.sumRequest Duration Seconds SumNULLTime (in seconds) each request took to resolve.
kubedns.response_size.bytes.countResponse Size Bytes CountNULLNumber of responses on which the kubedns.response_size.bytes.sum metric is evaluated.
kubedns.request_duration.seconds.countRequest Duration Seconds CountNULLNumber of requests on which the kubedns.request_duration.seconds.sum metric is evaluated.
kubedns.error_countError CountNULLNumber of DNS requests resulting in an error.
kubedns.request_countRequest CountNULLTotal number of DNS requests made.
kubedns.response_size.bytes.sumResponse Size Bytes SumNULLSize of the returned response in bytes.
kubedns.cachemiss_countCachemiss CountNULLNumber of DNS cache misses (from the start of the process).

Agent G2 - Linux - Kubernetes Kubelet

Description

Template for monitoring Kubernetes metrics from Kubelet for each Node

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes Kubeletkube_memory_requestsMemory RequestsNULLThe requested memory
kube_cpu_cfs_periodsCpu Cfs PeriodsNULLNumber of elapsed enforcement period intervals
kube_kubelet_volume_stats_inodes_usedKubelet Volume Stats Inodes UsedNULLThe number of used inodes in the volume
kube_node_memory_usageNode Memory UsageNULLMemory usage of node (Plotted in Megabytes)
kube_memory_sw_limitMemory Sw LimitNULLMemory swap limit for the container.
kube_io_write_bytesIo Write BytesNULLThe amount of bytes written to the disk
kube_node_cpu_allocatableNode Cpu AllocatableNULLCpu allocatable of node
kube_cpu_cfs_throttled_periodsCpu Cfs Throttled PeriodsNULLNumber of throttled period intervals
kube_memory_swapMemory SwapNULLContainer swap usage in bytes.
kube_kubelet_container_log_filesystem_used_bytesKubelet Container Log Filesystem Used BytesNULLBytes used by the container's logs on the filesystem (requires kubernetes 1.14+)
kube_cpu_usage_totalCpu Usage TotalNULLCpu time consumed in seconds.
kube_network_rx_bytesNetwork Rx BytesNULLThe amount of bytes per second received
kube_node_memory_capacityNode Memory CapacityNULLMemory capacity of node (Plotted in Megabytes)
kube_containers_restartsContainers RestartsNULLThe number of times the container has been restarted
kube_node_memory_allocatableNode Memory AllocatableNULLMemory allocatable of node
kube_memory_limitsMemory LimitsNULLMemory limit for the container.
kube_node_cpu_usageNode Cpu UsageNULLCpu usage of node (Plotted in Millicores)
kube_filesystem_usage_pctFilesystem Usage PctNULLNumber of megabytes that can be consumed by the container on this filesystem.
kube_kubelet_volume_stats_capacity_bytesKubelet Volume Stats Capacity BytesNULLThe capacity in bytes of the volume
kube_network_tx_bytesNetwork Tx BytesNULLThe amount of bytes per second transmitted
kube_runtime_memory_rssRuntime Memory RssNULLSize of runtime RSS in megabytes
kube_kubelet_memory_rssKubelet Memory RssNULLSize of kubelet RSS in megabytes
kube_cpu_cfs_throttled_secondsCpu Cfs Throttled SecondsNULLTotal time duration the container has been throttled
kube_runtime_cpu_usageRuntime Cpu UsageNULLThe number of cores used by the runtime
kube_kubelet_cpu_usageKubelet Cpu UsageNULLThe number of cores used by kubelet
kube_network_rx_droppedNetwork Rx DroppedNULLThe amount of rx packets dropped per second
kube_io_read_bytesIo Read BytesNULLThe amount of bytes read from the disk
kube_pods_runningPods RunningNULLThe number of running pods
kube_filesystem_usageFilesystem UsageNULLNumber of megabytes that are consumed by the container on this filesystem.
kube_node_ephemeral_storage_capacityNode Ephemeral Storage CapacityMBEphemeral storage capacity of node
kube_network_rx_errorsNetwork Rx ErrorsNULLThe amount of rx errors per second
kube_kubelet_evictionsKubelet EvictionsNULLThe number of pods that have been evicted from the kubelet (ALPHA in kubernetes v1.16)
kube_rest_client_latencyRest Client LatencyNULLAvg Request latency in seconds. Broken down by verb and URL since last pool.
kube_network_tx_droppedNetwork Tx DroppedNULLThe amount of tx packets dropped per second
kube_apiserver_certificate_expirationApiserver Certificate ExpirationNULLAvg Distribution of the remaining lifetime on the certificate used to authenticate a request since last pool.
kube_kubelet_volume_stats_available_bytesKubelet Volume Stats Available BytesNULLThe number of available bytes in the volume
kube_cpu_requestsCpu RequestsNULLThe requested cpu cores
kube_cpu_user_totalCpu User TotalNULLUser cpu time consumed in seconds.
kube_kubelet_volume_stats_inodes_freeKubelet Volume Stats Inodes FreeNULLThe number of free inodes in the volume
kube_kubelet_runtime_errorsKubelet Runtime ErrorsNULLThe number of runtime operations errors
kube_memory_working_setMemory Working SetNULLCurrent working set in megabytes - this is what the OOM killer is watching for
kube_node_cpu_capacityNode Cpu CapacityNULLCpu capacity of Node (Plotted in Millicores)
kube_memory_usageMemory UsageNULLCurrent memory usage in bytes including all memory regardless of when it was accessed
kube_kubelet_volume_stats_inodesKubelet Volume Stats InodesNULLThe maximum number of inodes in the volume
kube_memory_rssMemory RssNULLSize of RSS in bytes
kube_node_ephemeral_storage_allocatableNode Ephemeral Storage AllocatableMBEphemeral storage allocatable of node
kube_network_tx_errorsNetwork Tx ErrorsNULLThe amount of tx errors per second
kube_node_memory_usage_percentageNode Memory Usage PercentageNULLMemory usage percentage of node
kube_kubelet_runtime_operationsKubelet Runtime OperationsNULLThe number of runtime operations
kube_cpu_limitsCpu LimitsNULLThe limit of cpu cores set
kube_rest_client_requestsRest Client RequestsNULLThe number of HTTP requests
kube_memory_cacheMemory CacheNULLNumber of bytes of page cache memory.
kube_kubelet_volume_stats_used_bytesKubelet Volume Stats Used BytesNULLThe number of used bytes in the volume
kube_containers_runningContainers RunningNULLThe number of running containers
kube_cpu_system_totalCpu System TotalNULLSystem cpu time consumed in seconds.
kube_ephemeral_storage_usageEphemeral Storage UsageNULLEphemeral storage usage of the POD
kube_kubelet_network_plugin_latencyKubelet Network Plugin LatencyNULLAvg Latency in seconds of network plugin operations. Broken down by operation type since last pool.
kube_cpu_load_10s_avgCpu Load 10S AvgNULLContainer cpu load average over the last 10 seconds
kube_node_cpu_usage_percentageNode Cpu Usage PercentageNULLCpu usage percentage of node

Agent G2 - Linux - Kubernetes Master Agent

Description

Agent G2 - Linux - Kubernetes Master Agent

Prerequisites

NULL

Supported Metrics

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes Master Agent - K8sSchedulerscheduler.binding.duration.secondsKube Scheduler Binding Duration Seconds SumNULLBinding duration in seconds sum
scheduler.threadsKube Scheduler OS ThreadsNULLNumber of OS threads created
scheduler.gc_duration_seconds.sumKube Scheduler Go GC Duration Seconds SumNULLA summary of the GC invocation durations
scheduler.gc_duration_seconds.countKube Scheduler Go GC Duration Seconds CountNULLA summary of the GC invocation durations
scheduler.binding.latency.sumKube Scheduler Binding Latency Microseconds SumNULLBinding latency in microseconds sum
scheduler.scheduling.algorithm.predicate_duration.countKube Scheduler Scheduling Algorithm Predicate Evaluation CountNULLScheduling algorithm predicate evaluation duration
scheduler.client.http.requests_duration.countKube Scheduler Rest Client Request Latency Seconds CountNULLTotal request latency in seconds. Broken down by verb and URL
scheduler.scheduling.scheduling_latency.countKube Scheduler Scheduling Latency Seconds CountNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.volume_scheduling_duration.countKube Scheduler Volume Scheduling Duration Seconds CountNULLVolume scheduling stage latency count
scheduler.scheduling.scheduling_duration.quantileKube Scheduler Scheduling Duration SecondsNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.scheduling.algorithm_latency.sumKube Scheduler Scheduling Algorithm Latency Microseconds SumNULLScheduling algorithm latency in microseconds sum
scheduler.scheduling.algorithm.preemption_duration.countKube Scheduler Scheduling Algorithm Preemption Evaluation CountNULLScheduling algorithm preemption evaluation duration
scheduler.pod_preemption.attemptsKube Scheduler Total Preemption AttemptsNULLTotal preemption attempts in the cluster till now
scheduler.schedule_attempts.totalKube Scheduler Schedule Attempts TotalNULLNumber of attempts to schedule pods, by the result. 'unschedulable' means a pod could not be scheduled, while 'error' means an internal scheduler problem
scheduler.scheduling.algorithm.priority_duration.countKube Scheduler Scheduling Algorithm Priority Evaluation CountNULLScheduling algorithm priority evaluation duration
scheduler.scheduling.algorithm.priority_duration.sumKube Scheduler Scheduling Algorithm Priority Evaluation SumNULLScheduling algorithm priority evaluation duration
scheduler.scheduling.scheduling_latency.quantileKube Scheduler Scheduling Latency SecondsNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.binding.latency.countKube Scheduler Binding Latency Microseconds CountNULLTotal Binding latency in microseconds count
scheduler.e2e.scheduling_duration.sumKube Scheduler E2E Scheduling Duration Seconds SumNULLE2e scheduling latency in seconds (scheduling algorithm + binding)
scheduler.cache.lookupsKube Scheduler Equiv Cache Lookups TotalNULLTotal number of equivalence cache lookups, by whether or not a cache entry was found
scheduler.go.goroutinesKube Scheduler Go GoroutinesNULLNumber of goroutines that currently exist
scheduler.scheduling.scheduling_duration.countKube Scheduler Scheduling Duration Seconds CountNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.scheduling.algorithm_duration.sumKube Scheduler Scheduling Algorithm Duration Seconds SumNULLScheduling algorithm latency in seconds sum
scheduler.gc_duration_seconds.quantileKube Scheduler Go GC Duration SecondsNULLA summary of the GC invocation durations
scheduler.e2e.scheduling_duration.countKube Scheduler E2E Scheduling Duration Seconds CountNULLTotal E2e scheduling latency in seconds (scheduling algorithm + binding)
scheduler.client.http.requestsKube Scheduler Rest Client Requests TotalNULLNumber of HTTP requests, partitioned by status code, method, and host
scheduler.process.open_fdsKube Scheduler Process Open FdsNULLNumber of open file descriptors
scheduler.client.http.requests_duration.sumKube Scheduler Rest Client Request Latency Seconds SumNULLRequest latency in seconds sum. Broken down by verb and URL
scheduler.pod_preemption.victimsKube Scheduler Pod Preemption VictimsNULLNumber of selected preemption victims
scheduler.process.max_fdsKube Scheduler Process Max FdsNULLMaximum number of open file descriptors
scheduler.scheduling.scheduling_duration.sumKube Scheduler Scheduling Duration Seconds SumNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.volume_scheduling_duration.sumKube Scheduler Volume Scheduling Duration Seconds SumNULLVolume scheduling stage latency sum
scheduler.e2e.scheduling_latency.countKube Scheduler E2E Scheduling Latency Microseconds CountNULLTotal E2e scheduling latency in microseconds (scheduling algorithm + binding)
scheduler.scheduling.algorithm_duration.countKube Scheduler Scheduling Algorithm Duration Seconds CountNULLTotal Scheduling algorithm latency in seconds count
scheduler.scheduling.algorithm_latency.countKube Scheduler Scheduling Algorithm Latency Microseconds CountNULLTotal Scheduling algorithm latency in microseconds count
scheduler.e2e.scheduling_latency.sumKube Scheduler E2E Scheduling Latency Microseconds SumNULLE2e scheduling latency in microseconds (scheduling algorithm + binding)
scheduler.scheduling.algorithm.predicate_duration.sumKube Scheduler Scheduling Algorithm Predicate Evaluation SumNULLScheduling algorithm predicate evaluation duration
scheduler.scheduling.algorithm.preemption_duration.sumKube Scheduler Scheduling Algorithm Preemption Evaluation SumNULLScheduling algorithm preemption evaluation duration
scheduler.scheduling.scheduling_latency.sumKube Scheduler Scheduling Latency Seconds SumNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.binding.duration.countKube Scheduler Binding Duration Seconds CountNULLTotal Binding duration in seconds count
Agent G2 - Linux - Kubernetes Master Agent - K8sApiServerapiserver.go.goroutinesKube apiserver GoroutinesNULLNumber of goroutines that currently exist.
apiserver.authenticated.user.requests.countKube apiserver Authenticated User Requests CountNULLCounter of authenticated requests broken out by username.
apiserver.rest.client.requests.totalKube apiserver Rest Client Requests TotalNULLNumber of HTTP requests, partitioned by status code, method, and host.
apiserver.rest.client.requests.total.countKube apiserver Rest Client Requests Total CountNULLNumber of HTTP requests, partitioned by status code, method, and host.
apiserver.request.count.countKube apiserver Request Count CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.APIServiceRegistrationController.depthKube apiserver APIService Registration Controller DepthNULLCurrent depth of workqueue: APIServiceRegistrationController
apiserver.audit.event.totalKube apiserver Audit Event TotalNULLCounter of audit events generated and sent to the audit backend.
apiserver.go.threads.totalKube apiserver Go Threads TotalNULLNumber of OS threads created.
apiserver.http.requests.total.countKube apiserver HTTP Requests Total CountNULLTotal number of HTTP requests made.
apiserver.http.requests.totalKube apiserver HTTP Requests TotalNULLTotal number of HTTP requests made.
apiserver.inflight.requestsKube apiserver Inflight RequestsNULLMaximal number of currently used inflight request limit of this apiserver per request kind in the last second.
apiserver.request.countKube apiserver Request CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.etcd.object.countsKube apiserver ETCD Object CountsNULLNumber of stored objects at the time of the last check split by kind.
apiserver.dropped.requests.totalKube apiserver Dropped Requests TotalNULLAccumulated number of requests dropped with Try-again-later response
apiserver.dropped.requests.total.countKube apiserver Dropped Requests Total CountNULLMonotonic count of requests dropped with Try-again-later response
apiserver.authenticated.user.requestsKube apiserver Authenticated User RequestsNULLCounter of authenticated requests broken out by username.
apiserver.request.duration.seconds.bucketKube apiserver Request Duration Seconds BuckethistogramResponse latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope, and component.
Agent G2 - Linux - Kubernetes Master Agent - K8scorednscoredns.request_duration.seconds.countRequest Duration Seconds CountNULLDuration per upstream interaction
coredns.request_duration.seconds.sumRequest Duration Seconds SumNULLDuration to process each query.
coredns.panicsTotal PanicsNULLTotal number of panics.
coredns.query.countQuery countNULLTotal query count.
coredns.response_size.bytes.sumResponse Size Bytes SumNULLSize of the returns response in bytes.
Agent G2 - Linux - Kubernetes Master Agent - K8sControllercontroller.workqueue.work_duration.sumKube Controller Workqueue Work Duration Seconds SumNULLHow long in seconds processing an item from workqueue takes
controller.process.open_fdsKube Controller Process Open FdsNULLNumber of open file descriptors
controller.go.goroutinesKube Controller Go GoroutinesNULLNumber of goroutines that currently exist
controller.workqueue.work_duration.countKube Controller Workqueue Work Duration Seconds CountNULLTotal time in seconds processing an item from workqueue takes
controller.workqueue.queue_duration.sumKube Controller Workqueue Queue Duration Seconds SumNULLHow long in seconds an item stays in workqueue before being requested.
controller.workqueue.nodes.countKube Controller Registered NodesNULLNumber of registered Nodes per zones.
controller.threadsKube Controller Os ThreadsNULLNumber of OS threads created.
controller.workqueue.retriesKube Controller Workqueue Retries TotalNULLTotal number of retries handled by workqueue.
controller.workqueue.work_unfinished_durationKube Controller Workqueue Unfinished Work SecondsNULLHow many seconds of work has done that is in progress and hasn't been observed by work_duration. Large values indicate stuck threads.
controller.workqueue.work_longest_durationKube Controller Workqueue Longest Running Processor SecondsNULLHow many seconds has the longest running processor for workqueue been running.
controller.workqueue.depthKube Controller Workqueue DepthNULLCurrent depth of workqueue.
controller.workqueue.nodes.unhealthyKube Controller Node Collector Unhealthy Nodes in ZoneNULLNumber of not Ready Nodes per zones.
controller.rate_limiter.useKube Controller Node Lifecycle Controller Rate Limiter UseNULLA metric measuring the saturation of the rate limiter for node_lifecycle_controller.
controller.workqueue.addsKube Controller Workqueue Adds TotalNULLTotal number of adds handled by workqueue.
controller.process.max_fdsKube Controller Process Max FdsNULLMaximum number of open file descriptors.
controller.workqueue.queue_duration.countKube Controller Workqueue Queue Duration Seconds CountNULLTotal how long in seconds an item stays in workqueue before being requested.
controller.workqueue.nodes.evictionsKube Controller Node Collector Evictions NumberNULLNumber of Node evictions that happened since current instance of NodeController started.

Agent G2 - Linux - Kubernetes Monitoring Template

Description

Agent G2 - Linux - Kubernetes Monitoring Template

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes Monitoring Template - K8scorednscoredns.response_size.bytes.sumResponse Size Bytes SumNULLSize of the returns response in bytes.
coredns.request_duration.seconds.sumRequest Duration Seconds SumNULLDuration to process each query.
coredns.panicsTotal PanicsNULLTotal number of panics.
coredns.query.countQuery countNULLTotal query count.
coredns.request_duration.seconds.countRequest Duration Seconds CountNULLDuration per upstream interaction
Agent G2 - Linux - Kubernetes Monitoring Template - K8sstatekubernetes_state.resourcequota.requests.cpu.limitResourcequota Requests Cpu LimitNULLHard limit on the total of CPU core requested for a resource quota
kubernetes_state.resourcequota.requests.memory.limitResourcequota Requests Memory LimitNULLHard limit on the total of memory bytes requested for a resource quota
kubernetes_state.daemonset.desiredDaemonset DesiredNULLThe number of nodes that should be running the daemon pod.
kubernetes_state.node.cpu_capacityNode Cpu CapacityNULLThe total CPU resources of the node.
kubernetes_state.container.restartsContainer RestartsNULLThe number of restarts per container
kubernetes_state.deployment.replicas_unavailableDeployment Replicas UnavailableNULLThe number of unavailable replicas per deployment.
kubernetes_state.daemonset.scheduledDaemonset ScheduledNULLThe number of nodes running at least one daemon pod and are supposed to
kubernetes_state.replicaset.fully_labeled_replicasReplicaset Fully Labeled ReplicasNULLThe number of fully labeled replicas per ReplicaSet.
kubernetes_state.node.pods_capacityNode Pods CapacityNULLThe total pod resources of the node.
kubernetes_state.daemonset.readyDaemonset ReadyNULLThe number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready.
kubernetes_state.resourcequota.persistentvolumeclaims.usedResourcequota Persistentvolumeclaims UsedNULLObserved number of persistent volume claims used for a resource quota
kubernetes_state.replicaset.replicasReplicaset ReplicasNULLThe number of replicas per ReplicaSet.
kubernetes_state.deployment.replicas_desiredDeployment Replicas DesiredNULLThe number of desired replicas per deployment wrong help in kube-state-metrics.cross check
kubernetes_state.container.cpu_limitContainer Cpu LimitNULLThe limit on cpu cores to be used by a container
kubernetes_state.resourcequota.services.loadbalancers.limitResourcequota Services Loadbalancers LimitNULLHard limit of the number of loadbalancers for a resource quota
kubernetes_state.resourcequota.limits.memory.usedResourcequota Limits Memory UsedNULLObserved sum of limits for memory bytes for a resource quota
kubernetes_state.container.memory_limitContainer Memory LimitNULLThe limit on memory to be used by a container
kubernetes_state.deployment.replicasDeployment ReplicasNULLThe number of replicas per deployment.
kubernetes_state.resourcequota.services.usedResourcequota Services UsedNULLObserved number of services used for a resource quota
kubernetes_state.deployment.replicas_availableDeployment Replicas AvailableNULLThe number of available replicas per deployment.
kubernetes_state.resourcequota.services.loadbalancers.usedResourcequota Services Loadbalancers UsedNULLObserved number of loadbalancers used for a resource quota
kubernetes_state.resourcequota.limits.cpu.limitResourcequota Limits Cpu LimitNULLHard limit on the sum of CPU core limits for a resource quota
kubernetes_state.resourcequota.limits.cpu.usedResourcequota Limits Cpu UsedNULLObserved sum of limits for CPU cores for a resource quota
kubernetes_state.replicaset.replicas_desiredReplicaset Replicas DesiredNULLNumber of desired pods for a ReplicaSet
kubernetes_state.resourcequota.services.limitResourcequota Services LimitNULLHard limit of the number of services for a resource quota
kubernetes_state.daemonset.misscheduledDaemonset MisscheduledNULLThe number of nodes running a daemon pod but are not supposed to.
kubernetes_state.resourcequota.requests.cpu.usedResourcequota Requests Cpu UsedNULLObserved sum of CPU cores requested for a resource quota
kubernetes_state.resourcequota.requests.storage.limitResourcequota Requests Storage LimitNULLHard limit on the total of storage bytes requested for a resource quota
kubernetes_state.resourcequota.pods.usedResourcequota Pods UsedNULLObserved number of pods used for a resource quota
kubernetes_state.node.memory_allocatableNode Memory AllocatableNULLThe memory resources of a node that are available for scheduling.
kubernetes_state.resourcequota.services.nodeports.usedResourcequota Services Nodeports UsedNULLObserved number of node ports used for a resource quota
kubernetes_state.resourcequota.limits.memory.limitResourcequota Limits Memory LimitNULLHard limit on the sum of memory bytes limits for a resource quota
kubernetes_state.node.pods_allocatableNode Pods AllocatableNULLThe pod resources of a node that are available for scheduling.
kubernetes_state.deployment.rollingupdate.max_unavailableDeployment Rollingupdate Max UnavailableNULLMaximum number of unavailable replicas during a rolling update of a deployment.
kubernetes_state.container.memory_requestedContainer Memory RequestedNULLThe number of requested memory bytes by a container
kubernetes_state.resourcequota.requests.memory.usedResourcequota Requests Memory UsedNULLObserved sum of memory bytes requested for a resource quota
kubernetes_state.deployment.replicas_updatedDeployment Replicas UpdatedNULLThe number of updated replicas per deployment.
kubernetes_state.resourcequota.requests.storage.usedResourcequota Requests Storage UsedNULLObserved sum of storage bytes requested for a resource quota
kubernetes_state.replicaset.replicas_readyReplicaset Replicas ReadyNULLThe number of ready replicas per ReplicaSet
kubernetes_state.resourcequota.persistentvolumeclaims.limitResourcequota Persistentvolumeclaims LimitNULLHard limit of the number of PVC for a resource quota
kubernetes_state.container.cpu_requestedContainer Cpu RequestedNULLThe number of requested cpu cores by a container
kubernetes_state.resourcequota.pods.limitResourcequota Pods LimitNULLHard limit of the number of pods for a resource quota
kubernetes_state.resourcequota.services.nodeports.limitResourcequota Services Nodeports LimitNULLHard limit of the number of node ports for a resource quota
kubernetes_state.node.memory_capacityNode Memory CapacityNULLThe total memory resources of the node.
kubernetes_state.node.cpu_allocatableNode Cpu AllocatableNULLThe CPU resources of a node that are available for scheduling.
Agent G2 - Linux - Kubernetes Monitoring Template - K8sdns
kubedns.response_size.bytes.countResponse Size Bytes CountNULLNumber of responses on which the kubedns.response_size.bytes.sum metric is evaluated.
kubedns.request_countRequest CountNULLTotal number of DNS requests made.
kubedns.request_duration.seconds.sumRequest Duration Seconds SumNULLTime (in seconds) each request took to resolve.
kubedns.request_duration.seconds.countRequest Duration Seconds CountNULLNumber of requests on which the kubedns.request_duration.seconds.sum metric is evaluated.
kubedns.cachemiss_countCachemiss CountNULLNumber of DNS cache misses (from the start of the process).
kubedns.error_countError CountNULLNumber of DNS requests resulting in an error.
kubedns.response_size.bytes.sumResponse Size Bytes SumNULLSize of the returned response in bytes.

Agent G2 - Linux - Kubernetes Metric Server - v2

Description

Template for monitoring Kubernetes using Kubernetes Metric Server. Template “Agent G2 - Linux - Kubernetes Metric Server” does not currently support customizing the namespace. In this v2 template, there is an option to monitor the “kube-system” namespace by default, and customers could specify a different namespace if needed.

Prerequisites

No Prerequisites.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Kubernetes Metric Server Monitor - v2metrics_server.authenticated_user_requestsAuthenticated User RequestsnullCounter of authenticated requests broken out by username.
metrics_server.go_gc_duration_seconds_countGo GC Duration Seconds CountnullA summary of the GC invocation durations.
metrics_server.go_gc_duration_seconds_quantileGo GC Duration Seconds QuantileSecondsA summary of the GC invocation durations.
metrics_server.go_gc_duration_seconds_sumGo GC Duration Seconds SumnullA summary of the GC invocation durations.
metrics_server.go_goroutinesGo GoroutinesnullNumber of goroutines that currently exist.
metrics_server.kubelet_summary_request_duration_countKubelet Summary Request Duration CountnullThe Kubelet summary request latencies in seconds.
metrics_server.kubelet_summary_request_duration_sumKubelet Summary Request Duration SumnullThe Kubelet summary request latencies in seconds.
metrics_server.kubelet_summary_scrapes_totalKubelet Summary Scrapes TotalnullTotal number of attempted Summary API scrapes done by Metrics Server.
metrics_server.manager_tick_duration_countManager Tick Duration CountnullThe total time spent collecting and storing metrics in seconds.
metrics_server.manager_tick_duration_sumManager Tick Duration SumnullThe total time spent collecting and storing metrics in seconds.
metrics_server.process_cpu_seconds_totalProcess Cpu Seconds TotalnullTotal user and system CPU time spent in seconds.
metrics_server.process_max_fdsProcess Max FdsnullMaximum number of open file descriptors.
metrics_server.process_open_fdsProcess Open FdsnullNumber of open file descriptors.
metrics_server.scraper_duration_countScraper Duration CountnullTime spent scraping sources in seconds.
metrics_server.scraper_duration_sumScraper Duration SumnullTime spent scraping sources in seconds.
metrics_server.scraper_last_timeScraper Last TimenullLast time metrics-server performed a scrape since unix epoch in seconds.

Agent G2 - Linux - Kubernetes Scheduler

Description

Template for monitoring default Kubernetes Scheduler

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes Schedulerscheduler.schedule_attempts.totalKube Scheduler Schedule Attempts TotalNULLNumber of attempts to schedule pods, by the result. 'unschedulable' means a pod could not be scheduled, while 'error' means an internal scheduler problem
scheduler.scheduling.algorithm.priority_duration.countKube Scheduler Scheduling Algorithm Priority Evaluation CountNULLScheduling algorithm priority evaluation duration
scheduler.client.http.requests_duration.countKube Scheduler Rest Client Request Latency Seconds CountNULLTotal request latency in seconds. Broken down by verb and URL
scheduler.gc_duration_seconds.countKube Scheduler Go GC Duration Seconds CountNULLA summary of the GC invocation durations
scheduler.scheduling.scheduling_latency.sumKube Scheduler Scheduling Latency Seconds SumNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.pod_preemption.attemptsKube Scheduler Total Preemption AttemptsNULLTotal preemption attempts in the cluster till now
scheduler.client.http.requests_duration.sumKube Scheduler Rest Client Request Latency Seconds SumNULLRequest latency in seconds sum. Broken down by verb and URL
scheduler.e2e.scheduling_latency.sumKube Scheduler E2E Scheduling Latency Microseconds SumNULLE2e scheduling latency in microseconds (scheduling algorithm + binding)
scheduler.scheduling.algorithm.preemption_duration.countKube Scheduler Scheduling Algorithm Preemption Evaluation CountNULLScheduling algorithm preemption evaluation duration
scheduler.scheduling.algorithm.predicate_duration.countKube Scheduler Scheduling Algorithm Predicate Evaluation CountNULLScheduling algorithm predicate evaluation duration
scheduler.scheduling.algorithm.predicate_duration.sumKube Scheduler Scheduling Algorithm Predicate Evaluation SumNULLScheduling algorithm predicate evaluation duration
scheduler.pod_preemption.victimsKube Scheduler Pod Preemption VictimsNULLNumber of selected preemption victims
scheduler.binding.duration.countKube Scheduler Binding Duration Seconds CountNULLTotal Binding duration in seconds count
scheduler.scheduling.algorithm.priority_duration.sumKube Scheduler Scheduling Algorithm Priority Evaluation SumNULLScheduling algorithm priority evaluation duration
scheduler.binding.duration.secondsKube Scheduler Binding Duration Seconds SumNULLBinding duration in seconds sum
scheduler.volume_scheduling_duration.countKube Scheduler Volume Scheduling Duration Seconds CountNULLVolume scheduling stage latency count
scheduler.e2e.scheduling_latency.countKube Scheduler E2E Scheduling Latency Microseconds CountNULLTotal E2e scheduling latency in microseconds (scheduling algorithm + binding)
scheduler.binding.latency.countKube Scheduler Binding Latency Microseconds CountNULLTotal Binding latency in microseconds count
scheduler.volume_scheduling_duration.sumKube Scheduler Volume Scheduling Duration Seconds SumNULLVolume scheduling stage latency sum
scheduler.scheduling.scheduling_duration.quantileKube Scheduler Scheduling Duration SecondsNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.threadsKube Scheduler OS ThreadsNULLNumber of OS threads created
scheduler.process.open_fdsKube Scheduler Process Open FdsNULLNumber of open file descriptors
scheduler.go.goroutinesKube Scheduler Go GoroutinesNULLNumber of goroutines that currently exist
scheduler.process.max_fdsKube Scheduler Process Max FdsNULLMaximum number of open file descriptors
scheduler.binding.latency.sumKube

Scheduler Binding Latency Microseconds Sum

NULLBinding latency in microseconds sum
scheduler.scheduling.algorithm_latency.sumKube Scheduler Scheduling Algorithm Latency Microseconds SumNULLScheduling algorithm latency in microseconds sum
scheduler.e2e.scheduling_duration.sumKube Scheduler E2E Scheduling Duration Seconds SumNULLE2e scheduling latency in seconds (scheduling algorithm + binding)
scheduler.scheduling.algorithm_latency.countKube Scheduler Scheduling Algorithm Latency Microseconds CountNULLTotal Scheduling algorithm latency in microseconds count
scheduler.scheduling.scheduling_duration.sumKube Scheduler Scheduling Duration Seconds SumNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.client.http.requestsKube Scheduler Rest Client Requests TotalNULLNumber of HTTP requests, partitioned by status code, method, and host
scheduler.e2e.scheduling_duration.countKube Scheduler E2E Scheduling Duration Seconds CountNULLTotal E2e scheduling latency in seconds (scheduling algorithm + binding)
scheduler.scheduling.algorithm.preemption_duration.sumKube Scheduler Scheduling Algorithm Preemption Evaluation SumNULLScheduling algorithm preemption evaluation duration
scheduler.cache.lookupsKube Scheduler Equiv Cache Lookups TotalNULLTotal number of equivalence cache lookups, by whether or not a cache entry was found
scheduler.gc_duration_seconds.sumKube Scheduler Go GC Duration Seconds SumNULLA summary of the GC invocation durations
scheduler.scheduling.algorithm_duration.countKube Scheduler Scheduling Algorithm Duration Seconds CountNULLTotal Scheduling algorithm latency in seconds count
scheduler.scheduling.scheduling_latency.countKube Scheduler Scheduling Latency Seconds CountNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.scheduling.algorithm_duration.sumKube Scheduler Scheduling Algorithm Duration Seconds SumNULLScheduling algorithm latency in seconds sum
scheduler.scheduling.scheduling_duration.countKube Scheduler Scheduling Duration Seconds CountNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.scheduling.scheduling_latency.quantileKube Scheduler Scheduling Latency SecondsNULLScheduling latency in seconds split by sub-parts of the scheduling operation
scheduler.gc_duration_seconds.quantileKube Scheduler Go GC Duration SecondsNULLA summary of the GC invocation durations

Agent G2 - Linux - Kubernetes State

Description

Template for monitoring Kubernetes using Kube State

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes Statekubernetes_state.node.memory_capacityNode Memory CapacityNULLThe total memory resources of the node.
kubernetes_state.resourcequota.limits.memory.limitResourcequota Limits Memory LimitNULLHard limit on the sum of memory bytes limits for a resource quota
kubernetes_state.node.cpu_capacityNode Cpu CapacityNULLThe total CPU resources of the node.
kubernetes_state.container.memory_limitContainer Memory LimitNULLThe limit on memory to be used by a container
kubernetes_state.daemonset.scheduledDaemonset ScheduledNULLThe number of nodes running at least one daemon pod and are supposed to
kubernetes_state.deployment.replicas_availableDeployment Replicas AvailableNULLThe number of available replicas per deployment.
kubernetes_state.node.memory_allocatableNode Memory AllocatableNULLThe memory resources of a node that are available for scheduling.
kubernetes_state.resourcequota.persistentvolumeclaims.usedResourcequota Persistentvolumeclaims UsedNULLObserved number of persistent volume claims used for a resource quota
kubernetes_state.deployment.rollingupdate.max_unavailableDeployment Rollingupdate Max UnavailableNULLMaximum number of unavailable replicas during a rolling update of a deployment.
kubernetes_state.deployment.replicas_desiredDeployment Replicas DesiredNULLThe number of desired replicas per deployment wrong help in kube-state-metrics.cross check
kubernetes_state.resourcequota.services.loadbalancers.limitResourcequota Services Loadbalancers LimitNULLHard limit of the number of loadbalancers for a resource quota
kubernetes_state.resourcequota.pods.limitResourcequota Pods LimitNULLHard limit of the number of pods for a resource quota
kubernetes_state.resourcequota.services.limitResourcequota Services LimitNULLHard limit of the number of services for a resource quota
kubernetes_state.resourcequota.services.usedResourcequota Services UsedNULLObserved number of services used for a resource quota
kubernetes_state.resourcequota.requests.cpu.usedResourcequota Requests Cpu UsedNULLObserved sum of CPU cores requested for a resource quota
kubernetes_state.resourcequota.limits.memory.usedResourcequota Limits Memory UsedNULLObserved sum of limits for memory bytes for a resource quota
kubernetes_state.resourcequota.persistentvolumeclaims.limitResourcequota Persistentvolumeclaims LimitNULLHard limit of the number of PVC for a resource quota
kubernetes_state.resourcequota.pods.usedResourcequota Pods UsedNULLObserved number of pods used for a resource quota
kubernetes_state.resourcequota.requests.memory.limitResourcequota Requests Memory LimitNULLHard limit on the total of memory bytes requested for a resource quota
kubernetes_state.resourcequota.requests.storage.limitResourcequota Requests Storage LimitNULLHard limit on the total of storage bytes requested for a resource quota
kubernetes_state.daemonset.misscheduledDaemonset MisscheduledNULLThe number of nodes running a daemon pod but are not supposed to.
kubernetes_state.deployment.replicas_unavailableDeployment Replicas UnavailableNULLThe number of unavailable replicas per deployment.
kubernetes_state.resourcequota.limits.cpu.limitResourcequota Limits Cpu LimitNULLHard limit on the sum of CPU core limits for a resource quota
kubernetes_state.daemonset.readyDaemonset ReadyNULLThe number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready.
kubernetes_state.node.pods_capacityNode Pods CapacityNULLThe total pod resources of the node.
kubernetes_state.replicaset.fully_labeled_replicasReplicaset Fully Labeled ReplicasNULLThe number of fully labeled replicas per ReplicaSet.
kubernetes_state.deployment.replicasDeployment ReplicasNULLThe number of replicas per deployment.
kubernetes_state.resourcequota.requests.cpu.limitResourcequota Requests Cpu LimitNULLHard limit on the total of CPU core requested for a resource quota
kubernetes_state.replicaset.replicas_readyReplicaset Replicas ReadyNULLThe number of ready replicas per ReplicaSet
kubernetes_state.container.memory_requestedContainer Memory RequestedNULLThe number of requested memory bytes by a container
kubernetes_state.node.cpu_allocatableNode Cpu AllocatableNULLThe CPU resources of a node that are available for scheduling.
kubernetes_state.replicaset.replicas_desiredReplicaset Replicas DesiredNULLNumber of desired pods for a ReplicaSet
kubernetes_state.container.cpu_limitContainer Cpu LimitNULLThe limit on cpu cores to be used by a container
kubernetes_state.replicaset.replicasReplicaset ReplicasNULLThe number of replicas per ReplicaSet.
kubernetes_state.resourcequota.services.nodeports.usedResourcequota Services Nodeports UsedNULLObserved number of node ports used for a resource quota
kubernetes_state.container.cpu_requestedContainer Cpu RequestedNULLThe number of requested cpu cores by a container
kubernetes_state.resourcequota.services.loadbalancers.usedResourcequota Services Loadbalancers UsedNULLObserved number of loadbalancers used for a resource quota
kubernetes_state.resourcequota.limits.cpu.usedResourcequota Limits Cpu UsedNULLObserved sum of limits for CPU cores for a resource quota
kubernetes_state.resourcequota.requests.memory.usedResourcequota Requests Memory UsedNULLObserved sum of memory bytes requested for a resource quota
kubernetes_state.daemonset.desiredDaemonset DesiredNULLThe number of nodes that should be running the daemon pod.
kubernetes_state.container.restartsContainer RestartsNULLThe number of restarts per container
kubernetes_state.resourcequota.services.nodeports.limitResourcequota Services Nodeports LimitNULLHard limit of the number of node ports for a resource quota
kubernetes_state.deployment.replicas_updatedDeployment Replicas UpdatedNULLThe number of updated replicas per deployment.
kubernetes_state.resourcequota.requests.storage.usedResourcequota Requests Storage UsedNULLObserved sum of storage bytes requested for a resource quota

Agent G2 - Linux - Kubernetes State Monitoring

Description

Monitors to collect kube-state-metrics from Kubernetes cluster.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Kubernetes State Monitoringkubernetes_state.resourcequota.limits.memory.usedResourcequota Limits Memory UsedNULLObserved sum of limits for memory bytes for a resource quota
kubernetes_state.deployment.replicas_availableDeployment Replicas AvailableNULLThe number of available replicas per deployment.
kubernetes_state.deployment.replicas_unavailableDeployment Replicas UnavailableNULLThe number of unavailable replicas per deployment.
kubernetes_state.resourcequota.persistentvolumeclaims.limitResourcequota Persistentvolumeclaims LimitNULLHard limit of the number of PVC for a resource quota
kubernetes_state.resourcequota.limits.cpu.limitResourcequota Limits Cpu LimitNULLHard limit on the sum of CPU core limits for a resource quota
kubernetes_state.resourcequota.services.limitResourcequota Services LimitNULLHard limit of the number of services for a resource quota
kubernetes_state.resourcequota.requests.storage.limitResourcequota Requests Storage LimitNULLHard limit on the total of storage bytes requested for a resource quota
kubernetes_state.daemonset.misscheduledDaemonset MisscheduledNULLThe number of nodes running a daemon pod but are not supposed to.
kubernetes_state.node.cpu_capacityNode Cpu CapacityNULLThe total CPU resources of the node.
kubernetes_state.resourcequota.services.loadbalancers.limitResourcequota Services Loadbalancers LimitNULLHard limit of the number of loadbalancers for a resource quota
kubernetes_state.container.memory_requestedContainer Memory RequestedNULLThe number of requested memory bytes by a container
kubernetes_state.deployment.replicasDeployment ReplicasNULLThe number of replicas per deployment.
kubernetes_state.resourcequota.requests.memory.limitResourcequota Requests Memory LimitNULLHard limit on the total of memory bytes requested for a resource quota
kubernetes_state.resourcequota.requests.memory.usedResourcequota Requests Memory UsedNULLObserved sum of memory bytes requested for a resource quota
kubernetes_state.replicaset.replicasReplicaset ReplicasNULLThe number of replicas per ReplicaSet.
kubernetes_state.deployment.rollingupdate.max_unavailableDeployment Rollingupdate Max UnavailableNULLMaximum number of unavailable replicas during a rolling update of a deployment.
kubernetes_state.deployment.replicas_updatedDeployment Replicas UpdatedNULLThe number of updated replicas per deployment.
kubernetes_state.node.pods_allocatableNode Pods AllocatableNULLThe pod resources of a node that are available for scheduling.
kubernetes_state.resourcequota.limits.cpu.usedResourcequota Limits Cpu UsedNULLObserved sum of limits for CPU cores for a resource quota
kubernetes_state.resourcequota.services.usedResourcequota Services UsedNULLObserved number of services used for a resource quota
kubernetes_state.daemonset.desiredDaemonset DesiredNULLThe number of nodes that should be running the daemon pod.
kubernetes_state.daemonset.scheduledDaemonset ScheduledNULLThe number of nodes running at least one daemon pod and are supposed to
kubernetes_state.resourcequota.services.nodeports.usedResourcequota Services Nodeports UsedNULLObserved number of node ports used for a resource quota
kubernetes_state.replicaset.fully_labeled_replicasReplicaset Fully Labeled ReplicasNULLThe number of fully labeled replicas per ReplicaSet.
kubernetes_state.resourcequota

.requests.cpu.limit

Resourcequota Requests Cpu LimitNULLHard limit on the total of CPU core requested for a resource quota
kubernetes_state.resourcequota.services.nodeports.limitResourcequota Services Nodeports LimitNULLHard limit of the number of node ports for a resource quota
kubernetes_state.node.memory_capacityNode Memory CapacityNULLThe total memory resources of the node.
kubernetes_state.daemonset.readyDaemonset ReadyNULLThe number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready.
kubernetes_state.deployment.replicas_desiredDeployment Replicas DesiredNULLThe number of desired replicas per deployment wrong help in kube-state-metrics.cross check
kubernetes_state.replicaset.replicas_desiredReplicaset Replicas DesiredNULLNumber of desired pods for a ReplicaSet
kubernetes_state.resourcequota.requests.cpu.usedResourcequota Requests Cpu UsedNULLObserved sum of CPU cores requested for a resource quota
kubernetes_state.resourcequota.services.loadbalancers.usedResourcequota Services Loadbalancers UsedNULLObserved number of loadbalancers used for a resource quota
kubernetes_state.container.restartsContainer RestartsNULLThe number of restarts per container
kubernetes_state.resourcequota.requests.storage.usedResourcequota Requests Storage UsedNULLObserved sum of storage bytes requested for a resource quota
kubernetes_state.node.pods_capacityNode Pods CapacityNULLThe total pod resources of the node.
kubernetes_state.resourcequota.pods.limitResourcequota Pods LimitNULLHard limit of the number of pods for a resource quota
kubernetes_state.node.memory_allocatableNode Memory AllocatableNULLThe memory resources of a node that are available for scheduling.
kubernetes_state.node.cpu_allocatableNode Cpu AllocatableNULLThe CPU resources of a node that are available for scheduling.
kubernetes_state.replicaset.replicas_readyReplicaset Replicas ReadyNULLThe number of ready replicas per ReplicaSet
kubernetes_state.container.memory_limitContainer Memory LimitNULLThe limit on memory to be used by a container
kubernetes_state.resourcequota.pods.usedResourcequota Pods UsedNULLObserved number of pods used for a resource quota
kubernetes_state.container.cpu_limitContainer Cpu LimitNULLThe limit on cpu cores to be used by a container
kubernetes_state.resourcequota.limits.memory.limitResourcequota Limits Memory LimitNULLHard limit on the sum of memory bytes limits for a resource quota
kubernetes_state.container.cpu_requestedContainer Cpu RequestedNULLThe number of requested cpu cores by a container
kubernetes_state.resourcequota.persistentvolumeclaims.usedResourcequota Persistentvolumeclaims UsedNULLObserved number of persistent volume claims used for a resource quota

Agent G2 - Linux - KVM Monitoring Template

Description

Agent G2 - Linux - KVM Monitoring Template

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - KVM Monitoring Templatekvm.diskKvm DiskNULL
kvm.disk.iopsKvm Disk IopsNULL
kvm.networkKvm NetworkNULL
kvm.domain.statesKvm Domain StatesNULL
kvm.memoryKvm MemoryNULL
kvm.cpuKvm CpuNULL
kvm.total.domainsKvm Total DomainsNULL

Agent G2 - Linux - KVM Monitors

Description

Linux KVM Monitoring Template

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - KVM Monitorskvm.diskKvm DiskNULL
kvm.disk.iopsKvm Disk IopsNULL
kvm.networkKvm NetworkNULL
kvm.domain.statesKvm Domain StatesNULL
kvm.memoryKvm MemoryNULL
kvm.cpuKvm CpuNULL
kvm.total.domainsKvm Total DomainsNULL

Agent G2 - Linux - Lighttpd Monitors

Description

Monitors Lighttpd application metrics from the server-status module.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Lighttpd Monitorslighttpd.busy_serversLighttpd Busy ServersNULLProvides the number of busy servers
lighttpd.uptimeLighttpd UptimeNULLChecks the uptime lighttpd service
lighttpd.bytes_per_requestLighttpd-BytesPerRequestNULLProvides the number of bytes transferred per request
lighttpd.bytes_per_requestLighttpd Bytes per requestNULLProvides the number of bytes transferred per request
lighttpd.idle_serversLighttpd Idle ServersNULLProvides the number of idle servers
lighttpd.open.slotsLighttpd Open SlotsNULLProvides the number of open slots
lighttpd.total.accessesLighttpd Total AccessesNULLProvides the total number of accesses made
lighttpd.bytes_rateLighttpd Bytes RateNULLProvides the number of bytes transferred per second
lighttpd.requests_rateLighttpd Requests RateNULLProvides the number of requests made per second
lighttpd.total.kbytesLighttpd Total kBytesNULLProvides the number of total kbytes

Agent G2 - Linux - Memcache Monitors

Description

Monitors Memcache application metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Memcache Monitorsmemcache.curr_connectionsMemcache-CurrentConnectionsNULLNumber of open connections
memcache.memory_usedMemcache-MemoryUsedNULLTotal memory used by the server engine
memcache.uptimeMemcache-UptimeNULLNumber of minutes this server has been running
memcache.tempoom_rateMemcache-TempOOMPersecNULLNumber of temporary out-of-memory errors sent to clients per second.
memcache.gets_rateMemcache-GetsPersecNULLCumulative number of get requests for node per second
memcache.sets_rateMemcache-SetsPersecNULLCumulative number of set requests for node per second
memcache.avg_item_sizeMemcache-AvgItemSizeNULLAverage size of an item
memcache.misses_rateMemcache-MissesPersecNULLNumber of items that have been requested but not found per second.
memcache.bytes_written_rateMemcache-WrittenBytesPersecNULLAverage data written by this server in a second from the network in MB
memcache.fill_percentMemcache-FillPercentNULLPercentage of bytes used by this server
memcache.ratio.resident.itemMemcache-ResidentItemRatioNULLPercentage of items that are resident (in RAM)
memcache.cas_hits_rateMemcache-CASHitsPersecNULLNumber of successful CAS operations per second.
memcache.evictions_rateMemcache-EvictionsPersecNULLNumber of valid items removed per second, from cache to free memory for new items.
memcache.bytes_read_rateMemcache-ReadBytesPersecNULLAverage data read by this server in a second from the network in MB.
memcache.get_hit_percentMemcache-GetHitPercentNULLPercentage number of keys that have been requested and found. This value must be more for an optimal performance.
memcache.connections_rateMemcache-ConnectionsPersecNULLAverage number of connections per second
memcache.ops_rateMemcache-OpsPersecNULLNumber of total operations for node per second.
memcache.hits_rateMemcache-HitsPersecNULLNumber of keys that have been requested and found per second.
memcache.disk_reads_rateMemcache-DiskReadsPersecNULLNumber of items fetched from disk.
memcache.cas_badval_rateMemcache-CASBadvalPersecNULLNumber of CAS operations per second that failed to modify a value due to a bad CAS id
memcache.curr_itemsMemcache-CurrentItemsNULLCurrent number of items stored by the server.
memcache.ratio.cache.missMemcache-CacheMissRatioNULLPercentage number of items fetched from disk against total requests.
memcache.cas_misses_rateMemcache-CASMissesPersecNULLNumber of CAS operations per second against missing keys.
memcache.delete_hits_rateMemcache-DeletesPersecNULLNumber of successful deletions per second

Agent G2 - Linux - Memcached Performance Check

Description

Monitors Memcache application metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Memcached Performance Checkmemcache.curr_connectionsMemcache-CurrentConnectionsNULLNumber of open connections
memcache.memory_usedMemcache-MemoryUsedNULLTotal memory used by the server engine
memcache.uptimeMemcache-UptimeNULLNumber of minutes this server has been running
memcache.tempoom_rateMemcache-TempOOMPersecNULLNumber of temporary out-of-memory errors sent to clients per second.
memcache.gets_rateMemcache-GetsPersecNULLCumulative number of get requests for node per second
memcache.sets_rateMemcache-SetsPersecNULLCumulative number of set requests for node per second
memcache.avg_item_sizeMemcache-AvgItemSizeNULLAverage size of an item
memcache.misses_rateMemcache-MissesPersecNULLNumber of items that have been requested but not found per second.
memcache.bytes_written_rateMemcache-WrittenBytesPersecNULLAverage data written by this server in a second from the network in MB
memcache.fill_percentMemcache-FillPercentNULLPercentage of bytes used by this server
memcache.ratio.resident.itemMemcache-ResidentItemRatioNULLPercentage of items that are resident (in RAM)
memcache.cas_hits_rateMemcache-CASHitsPersecNULLNumber of successful CAS operations per second.
memcache.evictions_rateMemcache-EvictionsPersecNULLNumber of valid items removed per second, from cache to free memory for new items.
memcache.bytes_read_rateMemcache-ReadBytesPersecNULLAverage data read by this server in a second from the network in MB.
memcache.get_hit_percentMemcache-GetHitPercentNULLPercentage number of keys that have been requested and found. This value must be more for an optimal performance.
memcache.connections_rateMemcache-ConnectionsPersecNULLAverage number of connections per second
memcache.ops_rateMemcache-OpsPersecNULLNumber of total operations for node per second.
memcache.hits_rateMemcache-HitsPersecNULLNumber of keys that have been requested and found per second.
memcache.disk_reads_rateMemcache-DiskReadsPersecNULLNumber of items fetched from disk.
memcache.cas_badval_rateMemcache-CASBadvalPersecNULLNumber of CAS operations per second that failed to modify a value due to a bad CAS id
memcache.curr_itemsMemcache-CurrentItemsNULLCurrent number of items stored by the server.
memcache.ratio.cache.missMemcache-CacheMissRatioNULLPercentage number of items fetched from disk against total requests.
memcache.cas_misses_rateMemcache-CASMissesPersecNULLNumber of CAS operations per second against missing keys.
memcache.delete_hits_rateMemcache-DeletesPersecNULLNumber of successful deletions per second

Agent G2 - Linux - Memory Statistics

Description

Agent G2 - Linux - Memory Statistics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Memory StatisticsPhysicalMemoryPhysical MemoryMBPhysical memory information
memory.stats.swapping.rateVirtual Memory Swapping RateKB/sRate of swapping the entire process from physical memory to disk and vice-versa.
memory.stats.swap.util.rateSwap Memory Usage RateNULLRate of swap memory utilization.
memory.stats.physical.util.ratePhysical Memory Usage RateNULLRate of physical memory utilization.
SwapMemorySwap MemoryMBSwap memory information.
memory.stats.paging.rateVirtual Memory Paging RateKB/sRate of swapping least used process memory blocks from physical memory to disk and vice-versa.

Agent G2 - Linux - Mesos Agent Monitoring Template

Description

Agent G2 - Linux - Mesos Agent Monitoring Template

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Mesos Agent Monitoring Templatemesos.slave.tasks_runningTasks runningNULLNumber of running tasks
mesos.slave.invalid_framework_messagesInvalid framework messagesNULLNumber of invalid framework messages
mesos.slave.frameworks_activeFrameworks activeNULLNumber of active frameworks
mesos.slave.disk_percentAllocated disk space percentNULLPercentage of allocated disk space
mesos.slave.tasks_lostTasks lostNULLNumber of lost tasks
mesos.slave.executors_terminatedExecutors terminatedNULLNumber of executors terminated
mesos.slave.gpus_percentAllocated GPUs percentNULLPercentage of allocated GPUs
mesos.slave.cpus_usedNumber of CPUs usedNULLNumber of allocated CPUs
mesos.slave.tasks_startingTasks startingNULLNumber of starting tasks
mesos.slave.valid_framework_messagesValid framework messagesNULLNumber of valid framework messages
mesos.state.task.cpuTask cpuNULLTask cpu
mesos.slave.invalid_status_updatesInvalid status updatesNULLNumber of invalid status updates
mesos.slave.valid_status_updatesMesos slave valid_status_updatesNULLNumber of valid status updates
mesos.stats.system.load_15minMesos stats system load_15minNULLLoad average for the past 15 minutes
mesos.slave.disk_totalDisk space totalNULLDisk space
mesos.slave.mem_totalMemory totalNULLTotal memory
mesos.slave.recovery_errorsRecovery errorsNULLNumber of errors encountered during slave recovery
mesos.stats.uptime_secsUptime secondsNULLSlave uptime
mesos.slave.executors_runningExecutors runningNULLNumber of executors running
mesos.slave.mem_percentAllocated memory percentNULLPercentage of allocated memory
mesos.stats.system.mem_total_bytesMesos stats system mem_total_bytesNULLTotal memory
mesos.slave.disk_usedAllocated disk spaceNULLAllocated disk space
mesos.slave.executors_terminatingExecutors terminatingNULLNumber of executors terminating
mesos.slave.tasks_finishedTasks finishedNULLNumber of finished tasks
mesos.slave.gpus_usedNumber of GPUs usedNULLNumber of allocated GPUs
mesos.slave.cpus_totalCPUs totalNULLNumber of CPUs
mesos.slave.mem_usedAllocated memoryNULLUsed memory
mesos.stats.system.load_1minMesos stats system load_1minNULLLoad average for the past 1 minute
mesos.slave.tasks_stagingTasks stagingNULLNumber of staging tasks
mesos.state.task.memTask memNULLTask memory
mesos.slave.executors_registeringExecutors registeringNULLNumber of executors registering
mesos.slave.tasks_failedTasks failedNULLNumber of failed tasks
mesos.slave.cpus_percentAlloated CPUs percentNULLPercentage of allocated CPUs
mesos.stats.system.mem_free_bytesMesos stats system mem_free_bytesNULLFree memory
mesos.stats.system.load_5minMesos stats system load_5minNULLLoad average for the past 5 minutes
mesos.state.task.diskTask diskNULLTask disk
mesos.stats.system.cpus_totalNumber of CPUsNULLNumber of CPUs
mesos.slave.gpus_totalGPUs totalNULLNumber of GPUs
mesos.slave.tasks_killedTasks killedNULLNumber of killed tasks

Agent G2 - Linux - Mesos Master Monitoring Template

Description

Mesos Master Monitoring Template

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Mesos Master Monitoring Templatemarathon.queue.offers.reject.lastMarathon Queue Offeres Reject LastNULLSummary of unused offers for all last offers
mesos.role.diskMesos Role DiskNULLMesos Role Disk
marathon.queue.offers.unusedMarathon Queue Offers UnusedNULLThe number of unused offers for this launch attempt
dcos.health.admin.router.agentAdmin router agent service healthNULLAdmin Router Agent
mesos.cluster.slaves_unreachableAgents unreachableNULLNumber of unreachable agents. Unreachable agents are periodically garbage collected from the registry, which will cause this value to decrease.
dcos.health.telegraf.socketTelegraf socket healthNULLTelegraf Socket
marathon.instancesMarathon InstancesNULLNumber of instances of a given application
marathon.queue.sizeMarathon Queue SizeNULLNumber of app offer queues
dcos.health.component.package.managerComponent Package Manager (Pkgpanda) service healthNULLDC/OS Component Package Manager (Pkgpanda)
mesos.stats.electedElected as masterNULLWhether this is the elected master
mesos.cluster.mem_percentAllocated memory percentNULLPercentage of allocated memory
mesos.registrar.state_store_ms.p9999Registrar state_store_ms.p9999NULL99.99th percentile registry write latency in ms
marathon.backoffSecondsMarathon Backoff SecondsNULLTask backoff period
dcos.health.telegrafTelegraf service healthNULLTelegraf
mesos.cluster.mem_usedAllocated memoryNULLAllocated memory in MB
mesos.cluster.frameworks_activeFrameworks activeNULLNumber of active frameworks
dcos.health.signal.timerSignal Timer service healthNULLDC/OS Signal Timer
marathon.queue.countMarathon Queue CountNULLNumber of instances left to launch
dcos.health.signalSignal service healthNULLDC/OS Signal
dcos.health.marathonMarathon service healthNULLMarathon
mesos.cluster.invalid_status_updatesInvalid status updatesNULLNumber of invalid status updates
mesos.registrar.state_store_ms.p999Registrar state_store_ms.p999NULL99.9th percentile registry write latency in ms
dcos.health.log.agentAgent Log service healthNULLDC/OS Log Agent
dcos.health.diagnostics.agent.socketDiagnostics Agent socket healthNULLDC/OS Diagnostics Agent Socket
mesos.cluster.disk_usedAllocated disk spaceNULLAllocated disk space in MB
mesos.cluster.gpus_usedNumber of GPUs usedNULLNumber of allocated GPUs
mesos.cluster.tasks_errorInvalid tasksNULLNumber of tasks that were invalid
mesos.registrar.state_store_ms.maxMax registry write latencyNULLMaximum registry write latency in ms
dcos.health.authenticationAuthentication service healthNULLDC/OS Authentication (OAuth)
mesos.cluster.event_queue_http_requestsEvent queue HTTP requestsNULLNumber of HTTP requests in the event queue
marathon.queue.offers.reject.launchMarathon Queue Offeres Reject LaunchNULLSummary of unused offers for the launch attempt
mesos.cluster.event_queue_messagesEvent queue messagesNULLNumber of messages in the event queue
mesos.registrar.state_store_ms.p50Registrar state_store_ms.p50NULLMedian registry write latency in ms
mesos.cluster.frameworks_disconnectedFrameworks disconnectedNULLNumber of disconnected frameworks
mesos.cluster.slaves_inactiveAgents inactiveNULLNumber of inactive agents
mesos.stats.system.load_5minMesos stats system load_5minNULLLoad average for the past 5 minutes
mesos.cluster.tasks_failedFailed tasksNULLNumber of failed tasks
mesos.cluster.cpus_totalCPUs totalNULLNumber of CPUs
dcos.health.mesos.masterMesos Master service healthNULLMesos Master
dcos.health.mesos.agent.publicMesos Public Agent service healthNULLMesos Agent Public
mesos.stats.system.load_15minMesos stats system load_15minNULLLoad average for the past 15 minutes
mesos.cluster.valid_status_update_acknowledgementsValid status update acknowledgement messagesNULLNumber of valid status update acknowledgement messages
mesos.registrar.state_store_ms.countRegistry write countNULLRegistry write count
mesos.cluster.slaves_connectedAgents connectedNULLNumber of connected agents
mesos.cluster.tasks_finishedTasks finishedNULLNumber of finished tasks
dcos.health.mesos.agentMesos Agent service healthNULLMesos Agent
dcos.health.mesos.dnsMesos DNS service healthNULLMesos DNS
mesos.cluster.frameworks_inactiveFrameworks inactiveNULLNumber of inactive frameworks
mesos.cluster.gpus_totalGPUs totalNULLNumber of GPUs
mesos.registrar.state_store_msRegistry write latencyNULLRegistry write latency in ms
mesos.framework.memMesos Framework MemoryNULLMesos Framework Memory
marathon.tasksHealthyMarathon Tasks HealthyNULLNumber of healthy tasks for a given application
dcos.health.logrotate.agent.timerLogrotate TimerNULLLogrotate Timer
dcos.health.net.watchdogNet Watchdog service healthNULLDC/OS Net Watchdog
dcos.health.admin.router.masterAdmin router master service healthNULLAdmin Router Master
dcos.health.jobsJobs (Metronome) service healthNULLDC/OS Jobs (Metronome)
dcos.health.resolv.timerGenerate resolv.conf Timer service healthNULLGenerate resolv.conf Timer
mesos.cluster.tasks_startingTasks startingNULLNumber of starting tasks
marathon.deploymentsMarathon DeploymentsNULLNumber of running or pending deployments
dcos.health.package.managerPackage Manager (Cosmos) service healthNULLDC/OS Package Manager (Cosmos)
dcos.health.log.agent.socketAgent Log socket healthNULLDC/OS Log Socket
dcos.health.checks.timerChecks Timer service healthNULLDC/OS Checks Timer
mesos.cluster.cpus_usedNumber of CPUs usedNULLNumber of allocated CPUs
dcos.health.diagnostics.agentDiagnostics Agent service healthNULLDC/OS Diagnostics Agent
marathon.taskRateLimitMarathon Task Rate LimitNULLThe task rate limit for a given application
mesos.cluster.recovery_slave_removalsAgents removalsNULLNumber of agent removed for various reasons, including maintenance.
mesos.stats.system.cpus_totalNumber of CPUsNULLNumber of CPUs
marathon.tasksStagedMarathon Tasks StagedNULLNumber of tasks staged for a given application
mesos.role.cpuMesos Role CPUNULLMesos Framework Disk
marathon.queue.offers.processedMarathon Queue Offers ProcessedNULLThe number of processed offers for this launch attempt
mesos.cluster.tasks_runningTasks runningNULLNumber of running tasks
mesos.cluster.event_queue_dispatchesDispatches in the event queueNULLNumber of dispatches in the event queue
dcos.health.log.masterMaster Log service healthNULLDC/OS Log Master
dcos.health.poststart.checksPoststart Checks service healthNULLDC/OS Poststart Checks
mesos.stats.system.mem_free_bytesMesos stats system mem_free_bytesNULLFree memory
mesos.stats.uptime_secsUptime secondsNULLSlave uptime
marathon.diskMarathon DISKNULLConfigured DISKs for each instance of a given application
mesos.registrar.log.recoveredRegistrar log recoveredNULLWhether the replicated log for the registrar has caught up with the other masters in the cluster. A cluster is operational as long as a quorum of "recovered" masters is available in the cluster.
mesos.registrar.queued_operationsMesos Registrar queued_operationsNULLMesos Registrar queued_operations
dcos.health.logrotate.masterMaster Logrotate service healthNULLLogrotate Mesos Master
mesos.cluster.mem_totalMemory totalNULLMemory in MB
marathon.cpusMarathon CPUsNULLConfigured CPUs for each instance of a given application
mesos.cluster.cpus_percentAlloated CPUs percentNULLPercentage of allocated CPUs
mesos.cluster.outstanding_offersOutstanding resource offersNULLNumber of outstanding resource offers
mesos.cluster.slaves_disconnectedAgents disconnectedNULLNumber of disconnected agents
mesos.stats.system.mem_total_bytesMesos stats system mem_total_bytesNULLTotal memory
mesos.cluster.slave_shutdowns_scheduledSlave shutdowns scheduledNULLNumber of slaves which have failed their health check and are scheduled to be removed
mesos.registrar.state_store_ms.p99Registrar state_store_ms.p99NULL99th percentile registry write latency in ms
mesos.cluster.gpus_percentAllocated GPUs percentNULLPercentage of allocated GPUs
marathon.backoffFactorMarathon Backoff FactorNULLBackoff time multiplication factor for each consecutive failed task launch.
dcos.health.historyHistory service healthNULLDC/OS History
dcos.health.checks.apiChecks API service healthNULLDC/OS Checks API
mesos.cluster.tasks_lostTasks lostNULLNumber of lost tasks
dcos.health.resolvGenerate resolv.conf service healthNULLGenerate resolv.conf
mesos.cluster.valid_framework_to_executor_messagesValid framework to executor messagesNULLNumber of valid framework to executor messages
dcos.health.netNet service healthNULLDC/OS Net
mesos.framework.diskMesos Framework DiskNULLMesos Framework Disk
marathon.tasksUnhealthyMarathon Tasks UnhealthyNULLNumber of unhealthy tasks for a given application
mesos.cluster.valid_status_updatesValid status update messagesNULLNumber of valid status update messages
mesos.framework.cpuMesos Framework CPUNULLMesos Framework CPU
mesos.cluster.frameworks_connectedFrameworks connectedNULLNumber of connected frameworks
dcos.health.log.master.socketMaster Log socket healthNULLDC/OS Log Socket
mesos.cluster.invalid_status_update_acknowledgementsNumber of invalid operation status update acknowledgementsNULLNumber of invalid operation status update acknowledgements
mesos.cluster.invalid_framework_to_executor_messagesInvalid framework to executor messagesNULLNumber of invalid framework to executor messages
dcos.health.gc.timerDocker GC TimerNULLDocker GC Timer
mesos.cluster.tasks_killedTasks killedNULLNumber of killed tasks
mesos.stats.system.load_1minMesos stats system load_1minNULLLoad average for the past 1 minute
dcos.health.exhibitorExhibitor service healthNULLExhibitor
mesos.cluster.slave_shutdowns_canceledSlave shutdowns canceledNULLNumber of cancelled slave shutdowns
mesos.role.memMesos Role MemoryNULLMesos Role Memory
mesos.cluster.slave_registrationsSlave registrationsNULLNumber of agent registrations
mesos.cluster.disk_percentAllocated disk space percentNULLPercentage of allocated disk space
mesos.registrar.state_store_ms.p95Registrar state_store_ms.p95NULL95th percentile registry write latency in ms
dcos.health.logrotate.master.timerLogrotate TimerNULLLogrotate Timer
dcos.health.rexrayREX-Ray service healthNULLREX-Ray
mesos.cluster.dropped_messagesDropped messagesNULLNumber of dropped messages
marathon.queue.delayMarathon Queue DelayNULLWait before the next launch attempt
marathon.memMarathon MemoryNULLConfigured memory for each instance of a given application
dcos.health.checks.api.socketChecks API socket healthNULLDC/OS Checks API Socket
mesos.cluster.slave_removalsSlave removalsNULLNumber of agent removed for various reasons, including maintenance
marathon.tasksRunningMarathon Tasks RunningNULLNumber of tasks running for a given application
mesos.registrar.state_fetch_msRegistry read latencyNULLRegistry read latency in ms
mesos.registrar.state_store_ms.p90Registrar state_store_ms.p90NULL90th percentile registry write latency in ms
mesos.cluster.tasks_stagingTasks stagingNULLNumber of staging tasks
mesos.registrar.registry_size_bytesMesos Registrar registry_size_bytesNULLMesos Registrar registry_size_bytes
mesos.cluster.disk_totalDisk space totalNULLDisk space in MB
mesos.cluster.slaves_activeAgents activeNULLNumber of active agents
dcos.health.logrotate.agentAgent Logrotate service healthNULLLogrotate Mesos Agent
mesos.cluster.slave_reregistrationsSlave reregistrationsNULLNumber of agent re-registrations
mesos.registrar.state_store_ms.minMin registry write latencyNULLMinimum registry write latency in ms
dcos.health.gcDocker GCNULLDocker GC
marathon.appsMarathon Applications CountNULLNumber of applications

Agent G2 - Linux - MongoDB API Based Status and Performance Check

Description

Monitors the MongoDB using the MongoDB API.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MongoDB API Based Status and Performance Checkmongo.network.requestsMongoDB Network RequestsNULLNetwork Requests per second
mongo.btreeMongoDB BTreeNULLProvides Index Counters details as Accesses, Hits, Misses, Resets, Miss_ratio
mongo.heap.usageMongoDB Heap UsageNULLHeap Usage
mongo.assertsMongoDB AssertsNULLProvides the Mongo DB asserts
mongo.sizeMongoDB SizeNULLProvides the size of individual databases in Gigabytes
mongo.journaled.statusMongoDB Journaled StatusNULLProvides the Mongo DB journaledstatus
mongo.memoryMongoDB MemoryNULLMongo DB resident, virtual, mapped memory
mongo.index.miss.ratioMongoDB Index Miss RatioNULLProvides the Mongo DB index miss ratio
mongo.pagefaultsMongoDB Page FaultsNULLPage Faults
mongo.current.queueMongoDB Current QueueNULLCurrent readers and writers in Queue
mongo.lock.percentMongoDB Lock PercentNULLLock percentage
mongo.index.sizeMongoDB Index SizeNULLProvides the Index size of databases in KBs
mongo.journal.commitsMongoDB Journal Commits In WLNULLProvides the number of journal commits that occurred in the write lock
mongo.replication.lagMongoDB Replication LagNULLChecks the replication lag
mongo.op.countersMongoDB OP CountersNULLProvides opcounts for operations like insert, update, query, delete, getmore, command
mongo.cursorsMongoDB CursorsNULLProvides the Mongo DB cursors
mongo.trafficMongoDB TrafficNULLMongoDB current available traffic
mongo.active.clientsMongoDB Active ClientsNULLCurrent active, read and write clients
mongo.connectionsMongoDB ConnectionsNULLMongo DB current and available connections
mongo.avg.flushMongoDB Background Avg FlushNULLProvides the background average flush of dbs
mongo.uptimeMongoDB UptimeNULLUptime in minutes

Agent G2 - Linux - MongoDB Shard (mongos) Performance Check

Description

Monitors the mongos metric. This requires the python-pymongo package to be install. Check OpsRamp documents for more information

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MongoDB Shard Mongos Performance Checkmongos.chunksMongos ChunksNULLTotal number of mongos chunks
mongos.chunks.balanceMongos Chunks BalanceNULLProvides Mongos chunks balance b/w database and collections
mongos.shardsMongos ShardsNULLTotal number of mongos shards
mongos.collectionsMongos CollectionsNULLTotal number of mongos sharded collections

Agent G2 - Linux - MongoDB Shard(Mongos) Monitors

Description

Monitors the mongos metric. This requires the python-pymongo package to be install. Check OpsRamp documents for more information.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MongoDB Shard Mongos Monitorsmongos.chunks.balanceMongos Chunks BalanceNULLProvides Mongos chunks balance b/w database and collections
mongos.shardsMongos ShardsNULLTotal number of mongos shards
mongos.chunksMongos ChunksNULLTotal number of mongos chunks
mongos.collectionsMongos CollectionsNULLTotal number of mongos sharded collections

Agent G2 - Linux - MongoDB Status and Performance Check

Description

Monitoring Template for Mongo database application. This monitor requires the python-pymongo package to be installed on the server. Check OpsRamp documentation for more information.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MongoDB Status and Performance Checkmongodb.networkrequestsMongoDB-NetworkRequestsNULLNetwork Requests per second
mongodb.sizeMongoDB-SizeNULLProvides the size of individual databases in Gigabytes
mongodb.activeclientsMongoDB-ActiveClientsNULLCurrent active, read and write clients
mongodb.pagefaultsMongoDB-PageFaultsNULLPage Faults
mongodb.replicaprimaryMongoDB-ReplicaPrimaryNULLCheck if the primary server of a replica set has changed
mongodb.btreeMongoDB-BTreeNULLProvides Index Counters details as Accesses, Hits, Resets, Miss_ratio
mongodb.replicationlagMongoDB-ReplicationLagNULLChecks the replication lag
mongodb.opcountersMongoDB-OPCountersNULLProvides opcounts for operations like insert, update, query, delete, getmore, command
mongodb.lockMongoDB-LockNULLLock percentage
mongodb.assertsMongoDB-AssertsNULLThe number of regular, warning, msg, user and rollovers asserts raised since this process started
mongodb.cursorsMongoDB-CursorsNULLThe number of cursors that the server is maintaining for clients and that have timed out since this server was started
mongodb.uptimeMongoDB-UptimeNULLUptime in minutes
mongodb.heapusageMongoDB-HeapUsageNULLHeap Usage
mongodb.connectionsMongoDB-ConnectionsNULLMongoDB current and available connections
mongodb.replicationstateMongoDB-ReplicationStateNULLReplication states will be the following cases: StartingPhase1(0), Primary(1), Secondary(2), Recoverying(3), Fatal Error(4), StartingPhase2(5), Arbiter(7), Down(8), Not_Running_with_Replication(-1)
mongodb.indexsizeMongoDB-IndexSizeNULLProvides the Index size of databases in KBs
mongodb.background_avg_flushMongoDB-BackgroundAvgFlushNULLmongod writes to and flushes (fsyncs) the journal immediately. Data files are flushes only occasionally and in the background. By default these flushes occur every 60 seconds.
mongodb.memoryMongoDB-MemoryNULLMongoDB resident, virtual, mapped memory
mongodb.currentqueueMongoDB-CurrentQueueNULLCurrent readers and writers in Queue
mongodb.journaled_statusMongoDB-JournaledStatusNULLThe average amount of data in megabytes written to the recovery log in the last four seconds is the JournaledMB and the data written to the databases datafiles in the last four seconds is writeToDataFilesMB.

Agent G2 - Linux - MongoDB(API) Monitors

Description

Monitors the mongodb by getting the metrics using mongo API

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MongoDB API Monitorsmongo.index.sizeMongoDB Index SizeKBProvides the Index size of databases in KBs
mongo.memoryMongoDB MemoryNULLMongo DB resident, virtual, mapped memory
mongo.sizeMongoDB SizeGBProvides the size of individual databases in Gigabytes
mongo.current.queueMongoDB Current QueueNULLCurrent readers and writers in Queue
mongo.heap.usageMongoDB Heap UsageNULLHeap Usage
mongo.index.miss.ratioMongoDB Index Miss RatioNULLProvides the Mongo DB index miss ratio
mongo.assertsMongoDB AssertsNULLProvides the Mongo DB asserts
mongo.network.requestsMongoDB Network RequestsNULLNetwork Requests per second
mongo.connectionsMongoDB ConnectionsNULLMongo DB current and available connections
mongo.trafficMongoDB TrafficNULLMongoDB current available traffic
mongo.uptimeMongoDB UptimeNULLUptime in minutes
mongo.active.clientsMongoDB Active ClientsNULLCurrent active, read and write clients
mongo.journaled.statusMongoDB Journaled StatusNULLProvides the Mongo DB journaled status
mongo.lock.percentMongoDB Lock PercentNULLLock percentage
mongo.avg.flushMongoDB Background Avg FlushNULLProvides the background average flush of databases
mongo.pagefaultsMongoDB Page FaultsNULLPage Faults
mongo.replication.lagMongoDB Replication LagNULLChecks the replication lag
mongo.journal.commitsMongoDB Journal Commits In WLNULLProvides the number of journal commits that occurred in the write lock
mongo.btreeMongoDB BTreeNULLProvides Index Counters details as Accesses, Hits, Misses, Resets, Miss_ratio
mongo.cursorsMongoDB CursorsNULLProvides the Mongo DB cursors
mongo.op.countersMongoDB OP CountersNULLProvides opcounts for operations like insert, update, query, delete, getmore, command

Agent G2 - Linux - Mongos Monitors

Description

Monitors the mongos metric

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Mongos Monitorsmongos.chunksMongos ChunksNULLTotal number of mongos chunks
mongos.chunks.balanceMongos Chunks BalanceNULLProvides Mongos chunks balance between database and collections
mongos.shardsMongos ShardsNULLTotal number of mongos shards
mongos.collectionsMongos CollectionsNULLTotal number of mongos sharded collections

Agent G2 - Linux - MySQL Advanced Monitors

Description

It monitors the mysql.Innodb_log_waits, mysql.long_running_procs, and mysqlindex_usage metrics.

Prerequisites

Provide database names as input arguments separated by commas when applying the template at the device level.

Syntax: DBName1,DBName2

Parameters

NameDescriptionDefault value
MySQL DBnameDbnames to collect the metric dataNA
MySQL IPAddressIPAddress of the server where mysql is running127.0.0.1
MySQL PasswordPassword of the given usernameNA
MySQL PortPort on which mysql is running3306
MySQL UsernameUsername to connect to mysqlNA

Note: All field attributes are mandatory. Please use default values wherever applicable.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MySQL Advanced Monitorsmysql.Innodb_log_waitsMysql Innodb log waitsNULLThe number of times that the log buffer was too small, and a wait was required for it to be flushed before continuing.
mysql.long_running_procsMysql long running processNULLTotal running process longer than 1 minute in a specific db.
mysqlindex_usageMysql Index UsageNULLThe sum of index lengths for the tables within the specified database.

Agent G2 - Linux - MySQL Global Performance Statistics

Description

Monitoring template for MySQL application. Monitors command statistics, select statistics, slave status, threads, etc.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MySQL Global Performance Statisticsmysql.trafficMySQL-TrafficNULL
mysql.threads_connectedMySQL-ThreadsConnectedNULLThe number of currently open connections.
mysql.command_statisticsMySQL-CommandStatisticsNULL
mysql.select_statisticsMySQL-SelectStatisticsNULL
mysql.slave_statusMySQL-SlaveStatusNULLIf a database has replication configured, then this monitor checks the status of the slave instance which is returned by the command SHOW SLAVE STATUS. It also checks the number of seconds the slave is behind the master and if IO and SQL is running on the slave
mysql.threadsMySQL-ThreadsNULLCached threads - The number of threads in the thread cache. Connected threads - The number of currently open connections. Running threads- The number of threads that are not sleeping.

Agent G2 - Linux - MySQL InnoDB Statistics

Description

Monitoring template for MySQL InnoDB application statistics. Monitors InnoDB buffer pool hit rate, buffer pool pages data, buffer pool pages dirty, etc.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MySQL InnoDB Statisticsmysql.inno.db_buffer_pool_pages_freeMySQL-InnoDBBufferPoolPagesFreeNULLThe number of free pages.
mysql.inno.db_data_readMySQL-InnoDBDataReadNULLThe amount of data read since the server was started.
mysql.inno.db_buffer_pool_pages_flushedMySQL-InnoDBBufferPoolPagesFlushedNULLThe number of buffer pool page-flush requests.
mysql.inno.db_log_flushed_uptoMySQL-InnoDBLogFlushedUptoNULLLog flushed up to.
mysql.mutex_spin_waitsMySQL-MutexSpinWaitsNULLSemaphores-Mutex Spin Waits.
mysql.inno.db_data_writesMySQL-InnoDBDataWritesNULLThe total number of data writes.
mysql.inno.db_buffer_pool_hit_rateMySQL-InnoDBBufferPoolHitRateNULLBuffer Pool hit rate.
mysql.inno.db_data_writtenMySQL-InnoDBDataWrittenNULLThe amount of data written so far, in bytes.
mysql.inno.db_row_lock_timeMySQL-InnoDBRowLockTimeNULLThe total time spent in acquiring row locks, in milliseconds.
mysql.log_sequence_numberMySQL-LogSequenceNumberNULLLog Sequence number.
mysql.queries_inside_innodb_coreMySQL-QueriesInsideInnoDBCoreNULLRow Operations - Queries inside InnoDB core.
mysql.inno.dbio_capacityMySQL-InnoDBIOCapacityNULL
mysql.inno.db_transaction_history_lengthMySQL-InnoDBTransactionHistoryLengthNULL
mysql.inno.db_flush_log_at_trx_commitMySQL-InnoDBFlushLogAtTRXCommitNULL
mysql.inno.db_buffer_pool_pages_miscMySQL-InnoDBBufferPoolPagesMiscNULLThe number of pages that are busy because they have been allocated for administrative overhead such as row locks or the adaptive hash index.
mysql.inno.db_queries_queueMySQL-InnoDBQueriesQueueNULLRow Operations - Queries in queue.
mysql.inno.db_row_lock_waitsMySQL-InnoDBRowLockWaitsNULLThe number of times a row lock had to be waited for.
mysql.inno.db_data_readsMySQL-InnoDBDataReadsNULLThe total number of data reads.
mysql.open_read_views_inside_innodb_coreMySQL-OpenReadViewsInsideInnoDBCoreNULLRow Operations - Read views open inside InnoDB.
mysql.mutex_spin_roundsMySQL-MutexSpinRoundsNULLSemaphores-Mutex Spin Rounds.
mysql.inno.db_buffer_pool_pages_dataMySQL-InnoDBBufferPoolPagesDataNULLThe number of pages containing data (dirty or clean).
mysql.inno.db_buffer_pool_pages_dirtyMySQL-InnoDBBufferPoolPagesDirtyNULLThe number of pages currently dirty.
mysql.inno.db_log_last_check_pointMySQL-InnoDBLogLastCheckPointNULLLog Last check point.
mysql.inno.db_buffer_pool_read_requestsMySQL-InnoDBBufferPoolReadRequestsNULLThe number of logical read requests InnoDB has done.
mysql.inno.db_non_hash_searchesMySQL-InnoDBNonHashSearchesNULLNon-Hash Searches.
mysql.inno.db_hash_searchesMySQL-InnoDBHashSearchesNULLHash Searches.
mysql.inno.db_force_recoveryMySQL-InnoDBForceRecoveryNULLCrash recovery mode, the possible values are 0-6: Normal Startup(0), Server Starts even if it detects a corrupt page(1), Prevent Master Thread from running(2), No Transaction Rollbacks after crash recovery(3), Prevent insert buffer merge Operations(4), Ignores undo logs-InnoDB treats incomplete Transactions as Committed(5), Ignores Redo log roll-forward in connection with recovery.(6).
mysql.inno.db_buffer_pool_sizeMySQL-InnoDBBufferPoolSizeNULLThe size in bytes of the memory buffer InnoDB uses to cache data and indexes of its tables. The default value is 8MB. The larger you set this value, the less disk I/O is needed to access data in tables.
mysql.inno.db_data_fsyncsMySQL-InnoDBDataFSyncsNULLThe number of fsync operations so far.
mysql.inno.db_buffer_pool_readsMySQL-InnoDBBufferPoolReadsNULLThe number of logical reads that InnoDB could not satisfy from the buffer pool, and had to read directly from the disk.
mysql.inno.db_log_file_sizeMySQL-InnoDBLogFileSizeNULLThe size in bytes of each log file in a log group. The combined size of log files must be less than 4GB. The default value is 5MB. Sensible values range from 1MB to 1/N-th of the size of the buffer pool, where N is the number of log files in the group.
mysql.mutex_os_waitsMySQL-MutexOSWaitsNULLSemaphores-Mutex OS Waits.
mysql.inno.db_buffer_pool_pages_totalMySQL-InnoDBBufferPoolPagesTotalNULLThe total size of the buffer pool, in pages.
mysql.inno.db_log_buffer_sizeMySQL-InnoDBLogBufferSizeNULLThe size in bytes of the buffer that InnoDB uses to write to the log files on disk. The default value is 1MB for the built-in InnoDB, 8MB for InnoDB Plugin. Sensible values range.

Agent G2 - Linux - MySQL Monitors

Description

Monitoring template for MySQL application. Monitors command statistics, select statistics, slave status, threads, etc.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MySQL Monitorsmysql.trafficMySQL-TrafficNULL
mysql.threads_connectedMySQL-ThreadsConnectedNULLThe number of currently open connections.
mysql.command_statisticsMySQL-CommandStatisticsNULL
mysql.select_statisticsMySQL-SelectStatisticsNULL
mysql.slave_statusMySQL-SlaveStatusNULLIf a database has replication configured, then this monitor checks the status of the slave instance which is returned by the command SHOW SLAVE STATUS. It also checks the number of seconds the slave is behind the master and if IO and SQL is running on the slave
mysql.threadsMySQL-ThreadsNULLCached threads - The number of threads in the thread cache. Connected threads - The number of currently open connections. Running threads- The number of threads that are not sleeping.

Agent G2 - Linux - MySQL Variable Statistics

Description

Monitoring template for MySQL application statistics. Monitors aborted clients, aborted connects, command statistics, foreign key checks, log warnings, etc.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - MySQL Variable Statisticsmysql.threads_createdMySQL-ThreadsCreatedNULLThe number of threads created to handle connections.
mysql.query_cache_free_memoryMySQL-QueryCacheFreeMemoryNULLThe amount of free memory for the query cache.
mysql.max_heap_table_sizeMySQL-MaxHeapTableSizeNULLThis variable sets the maximum size to which user-created MEMORY tables are permitted to grow. The value of the variable is used to calculate MEMORY table MAX_ROWS values.
mysql.threads_cachedMySQL-ThreadsCachedNULLThe number of threads in the thread cache.
mysql.trafficMySQL-TrafficNULL
mysql.long_query_timeMySQL-LongQueryTimeNULLIf a query takes longer than this many seconds, the server increments the Slow_queries status variable.
mysql.tmp_table_sizeMySQL-TMPTableSizeNULLThe maximum size of internal in-memory temporary tables. If an in-memory temporary table exceeds the limit, MySQL automatically converts it to an on-disk MyISAM table.
mysql.queriesMySQL-QueriesNULL
mysql.aborted_connectsMySQL-AbortedConnectsNULLThe number of failed attempts to connect to the MySQL server.
mysql.query_cache_sizeMySQL-QueryCacheSizeNULLMax amount of data that can be stored in the cache.
mysql.select_statisticsMySQL-SelectStatisticsNULL
mysql.threads_runningMySQL-ThreadsRunningNULLThe number of threads that are not sleeping.
mysql.query_cache_total_blocksMySQL-QueryCacheTotalBlocksNULLThe total number of blocks in the query cache.
mysql.query_cache_free_blocksMySQL-QueryCacheFreeBlocksNULLThe number of free memory blocks in the query cache.
mysql.max_used_connectionsMySQL-MaxUsedConnectionsNULLMax number of connections that have been in use simultaneously since the server started.
mysql.query_cache_low_memory_purnesMySQL-QueryCacheLowMemoryPurnesNULLThe number of queries that were deleted from the query cache because of low memory.
mysql.slave_statusMySQL-SlaveStatusNULLIf a database has replication configured, then this monitor checks the status of the slave instance which is returned by the command SHOW SLAVE STATUS. It also checks the number of seconds the slave is behind the master and if IO and SQL is running on the slave.
mysql.thread_cache_sizeMySQL-ThreadCacheSizeNULLHow many threads the server should cache for reuse. When a client disconnects, the client's threads are put in the cache if there are fewer than this monitor value.
mysql.query_cache_not_cachedMySQL-QueryCacheNotCachedNULLThe number of noncached queries.
mysql.query_cache_insertsMySQL-QueryCacheInsertsNULLThe number of queries added to the query cache.
mysql.table_definition_cacheMySQL-TableDefinitionCacheNULLThe number of table definitions (from .frm files) that can be stored in the definition cache.
mysql.log_warningsMySQL-LogWarningsNULLWhether to produce additional warning messages. It is enabled (1) by default and can be disabled by setting it to 0.
mysql.aborted_clientsMySQL-AbortedClientsNULLThe number of connections that were aborted because the client died without closing the connection properly.
mysql.query_cache_queries_in_cacheMySQL-QueryCacheQueriesInCacheNULLThe number of queries registered in the query cache.
mysql.threads_connectedMySQL-ThreadsConnectedNULLThe number of currently open connections.
mysql.foreign_key_checksMySQL-ForeignKeyChecksNULLBy default foreign key constraints for InnoDB tables are checked and the value will be 1. If set to 0, they are ignored. Disabling foreign key checking can be useful for reloading InnoDB tables in an order different from that required by their parent/child relationships.
mysql.query_cache_hitsMySQL-QueryCacheHitsNULLThe number of query cache hits.
mysql.open_table_cacheMySQL-OpenTableCacheNULLThe number of open tables for all threads.
mysql.max_user_connectionsMySQL-MaxUserConnectionsNULLMax number of simultaneous connections permitted to any given MySQL user. Default value is 0 (no limit).
mysql.command_statisticsMySQL-CommandStatisticsNULL
mysql.thread_stack_sizeMySQL-ThreadStackSizeNULLThe stack size for each thread. Many of the limits detected by the crash-me test are dependent on this value.

Agent G2 - Linux - NFS Stats Monitors

Description

Agent G2 - Linux - NFS Stats Monitors

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - NFS Stats Monitorsnfs.lookup.opsNFS Lookup OperationsNULLThe number of lookup operations per second issued to the filesystem.
nfs.lookup.exeNFS Lookup Op Request Execution TimeNULLThe duration from the time that NFS client does the RPC lookup request to its kernel until the RPC request is completed, this includes the RTT time above.
nfs.server.write.mbytesNFS Server Written DataNULLThe number megabytes written to the NFS server by the NFS client via an NFS WRITE request.
nfs.read.retransmissionsNFS Read RetransmissionsNULLThe number of retransmissions.
nfs.write.rttNFS Write Op Request RTT TimeNULLThe duration from the time that client kernel sends the RPC write request until the time it receives the reply.
nfs.read.opsNFS Read OperationsNULLThe number of read operations per second issued to the filesystem.
nfs.lookup.retransmissionsNFS Lookup RetransmissionsNULLThe number of retransmissions.
nfs.lookup.kbytesNFS Lookup Op DataNULLThe amount of lookup data, in kB, per second.
nfs.write.opsNFS Write OperationsNULLThe number of write operations per second issued to the filesystem.
nfs.directory.lookupsNFS Directory LookupsNULLThe number of name lookups in directories there have been.
nfs.write.kbytesNFS Write Op DataNULLThe amount of data, in kB, written per second.
nfs.app.write.mbytesNFS Application Written DataNULLThe number of megabytes written by applications using the NFS mounted filesystem with the write(2) system call.
nfs.app.write.direct.mbytesNFS Application Written (O_DIRECT) DataNULLThe number of megabytes written to files opened with the O_DIRECT flag.
nfs.backlogNFS BacklogNULLThe length of the backlog queue.
nfs.read.kbytesNFS Read Op DataNULLThe amount of data, in kB, read per second.
nfs.server.read.mbytesNFS Server Read DataNULLThe number megabytes read from the NFS server by the NFS client via an NFS READ request.
nfs.lookup.rttNFS Lookup Op Request RTT TimeNULLThe duration from the time that client kernel sends the RPC lookup request until the time it receives the reply.
nfs.open.operationsNFS Open OperationsNULLThe number of times files or directories have been opened.
nfs.write.sizeNFS Write Op Data SizeNULLThe average number of kB written per each operation.
nfs.write.exeNFS Write Op Request Execution TimeNULLThe duration from the time that NFS client does the RPC write request to its kernel until the RPC request is completed, this includes the RTT time above.
nfs.read.sizeNFS Read Op Data SizeNULLThe average number of kB read per each operation.
nfs.app.read.mbytesNFS Application Read DataNULLThe number of megabytes read by applications using the NFS mounted filesystem with the read(2) system call.
nfs.opsNFS OperationsNULLThe number of operations per second issued to the filesystem.
nfs.ageNFS Mount AgeNULLThe duration of time that the share was mounted in minutes.
nfs.lookup.sizeNFS Lookup Op Data SizeNULLThe average amount of data size in kB for each lookup operation.
nfs.rpc.badxidsNFS RPC Bad XIDsNULLThe number of unmatchable XIDs that have been received.
nfs.read.rttNFS Read Op Request RTT TimeNULLThe duration from the time that client kernel sends the RPC read request until the time it receives the reply.
nfs.app.read.direct.mbytesNFS Application Read(O_DIRECT) DataNULLThe number of megabytes read from files opened with the O_DIRECT flag.
nfs.read.exeNFS Read Op Request Execution TimeNULLThe duration from the time that NFS client does the RPC read request to its kernel until the RPC request is completed, this includes the RTT time above.
nfs.write.retransmissionsNFS Write RetransmissionsNULLThe number of retransmissions.

Agent G2 - Linux Network Interface

Description

Monitors the performance and status of network interfaces on Linux systems. It captures and analyzes ten essential metrics, including traffic, packet rates, errors, discards, operational states, and output queue lengths.

Prerequisites

NA

Supported Metrics

Metric NameMetric Display NameMetric DescriptionUnit
Agent G2 - Linux Network Interfacesystem_linux_network_interface_trafficInSystem Linux Network Interface TrafficInMonitors the incoming network traffic on a Linux network interface. It indicates the amount of data received by the interface.Kbps
system_linux_network_interface_trafficOutSystem Linux Network Interface TrafficOutMonitors the outgoing network traffic on a Linux network interface. It indicates the amount of data transmitted by the interface.Kbps
system_linux_network_interface_packetsInSystem Linux Network Interface PacketsInMonitors the incoming network packets (number of packets) on a Linux network interface. It counts the number of packets received by the interface.packets/sec
system_linux_network_interface_packetsOutSystem Linux Network Interface PacketsOutMonitors the outgoing network packets (number of packets) on a Linux network interface. It counts the number of packets transmitted by the interface.packets/sec
system_linux_network_interface_errorsInSystem Linux Network Interface ErrorsInMonitors the incoming network errors (number of errors) on a Linux network interface. It counts the number of errors encountered while receiving data.Errors per Sec
system_linux_network_interface_errorsOutSystem Linux Network Interface ErrorsOutMonitors the outgoing network errors (number of errors) on a Linux network interface. It counts the number of errors encountered while transmitting data.Errors per Sec
system_linux_network_interface_discardsInSystem Linux Network Interface DiscardsInMonitors the incoming discarded network packets (number of discarded packets) on a Linux network interface. It counts the number of packets that were discarded upon receipt.psec
system_linux_network_interface_discardsOutSystem Linux Network Interface DiscardsOutMonitors the outgoing discarded network packets (number of discarded packets) on a Linux network interface. It counts the number of packets that were discarded upon transmission.psec
system_linux_network_interface_operationalStateSystem Linux Network Interface OperationalStateMonitors operational state of a Linux network interface. It typically provides information such as whether the interface is up or down.NA
system_linux_network_interface_outputQueueLengthSystem Linux Network Interface OutputQueueLengthMonitors the length of the output queue on a Linux network interface. It indicates the number of packets queued up for transmission.count

Agent G2 - Linux NFS Mount Point Monitoring - v5

Description

Monitors the Linux NFS mount points availability, accessibility, and utilization. In Version 5 of the template, we introduced enhanced functionality to exclude specific users given in input while checking mount point writability. Additionally, users now have the flexibility to choose between monitoring all mounts or exclusively focusing on permanent mount points(i.e. only nfs mounts present in /etc/fstab).Pre-Requisites: NFS needs to be installed and 1 or more NFS Mount points should be available on target device. Agent must be installed as root.

Template Usage Guidelines

While applying this template on the device, users need to provide specific input parameters in below two formats only(Case-Insensitive) -

Format 1: MountType:All;ExcludedUsers:nobody,nfsnobody,anon
(This format monitors both temporary and permanent mount points, this is the default value)

Format 2: MountType:Permanent;ExcludedUsers:nobody,nfsnobody,anon
(This format monitors only Permanent Mounts which are available in \etc\fstab file)

Prerequisites

NFS needs to be installed and NFS Mount points should be available on target device. Agent must be installed as root.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux NFS Mount Point Custom Monitor - v5system_linux_nfs_mountpoint_utilizationSystem Linux NFS Mountpoint UtilizationPercentageMonitors the NFS Mount points utilization.
system_linux_nfs_mountpoint_accessibilitySystem Linux NFS Mountpoint Accessibility-Monitors the accessibility of NFS Mount points. An accessible NFS mount point typically means both read and write access(marked as "0" in output), and inaccessibility (marked as "1" in output) often means it's not writable.
system_linux_nfs_mountpoint_availabilitySystem Linux NFS Mountpoint Availability-Monitors the availability of NFS Mount points. The availability status of mount points is often determined by checking whether the NFS mount point is present in the current list of NFS shares compared to the list stored in the previous state. If a mount point was present in the previous state but is not in the current state, it is marked as "1" (not available). If it is present in both states, it is marked as "0" (available).

Agent G2 - Linux NFS Mount Point Monitoring - v4

Description

Monitors the Linux NFS mount points availability, accessibility, and utilization.In Version 4 of the template, we introduced enhanced functionality to monitor user-specific mounts accessibility. Additionally, users now have the flexibility to choose between monitoring all mounts or exclusively focusing on permanent mount points (i.e. only nfs mounts present in /etc/fstab).

Template Usage Guidelines:

While applying this template on the device, users need to provide specific input parameters in below two formats only (Case-Insensitive).

  • Format 1: All (This format monitors both temporary and permanent mount points)

  • Format 2: Permanent (This format monitors only Permanent Mounts which are available in \etc\fstab file)

Prerequisites

NFS needs to be installed and NFS Mount points should be available on target device.

Supported Metric

Monitor NameMonitor DescriptionMetric NameMetric Display NameUnitMetric Description
Agent G2 - Linux NFS Mount Point Custom Monitor - v4Monitors the Linux NFS mount points availability, accessibility, and utilizationsystem_linux_nfs_mountpoint_utilizationSystem Linux NFS Mountpoint UtilizationPercentageMonitors the NFS Mount points utilization.
system_linux_nfs_mountpoint_accessibilitySystem Linux NFS Mountpoint AccessibilityMonitors the accessibility of NFS Mount points.

An accessible NFS mount point typically means both read and write access(marked as “0” in output), and inaccessibility (marked as “1” in output) often means it’s not writable.

system_linux_nfs_mountpoint_availabilitySystem Linux NFS Mountpoint AvailabilityMonitors the availability of NFS Mount points. The availability status of mount points is often determined by checking whether the NFS mount point is present in the current list of NFS shares compared to the list stored in the previous state.

If a mount point was present in the previous state but is not in the current state, it is marked as “1” (not available).

If it is present in both states, it is marked as “0” (available).

Agent G2 - Linux - Nginx Monitors

Description

Monitors Nginx application metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Nginx Monitorsnginx.connections.writingNginx Writing ConnectionsNULLNginx reads the request body, processes the request or writes response to a client.
nginx.perconnections_requestsNginx Requests per connectionRequests / connAverage number of requests per connection.
nginx.connections.readingNginx Reading ConnectionsNULLNginx reads the request header.
nginx.connections.activeNginx ConnectionsActive ConnectionsNumber of all open connections including connections to backends.
nginx.requests.handledNginx Handled RequestsrequestsThe number of requests served by connections handled by Nginx.
nginx.requests_rateNginx Requests Raterequests / secAverage number of requests per second.
nginx.connections_rateNginx Connections RateConnections / secAverage number of connections per second.
nginx.response.timeNginx Response TimeSecondsTime taken for Nginx request-response.
nginx.connections.waitingNginx Waiting ConnectionsNULLKeep-alive connections, actually it will be active - (reading + writing).

Agent G2 - Linux - NGINX Perfomance Check

Description

Monitors Nginx application metrics. This requires Nginx to be compiled with the Stub Status module.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - NGINX Perfomance Checknginx.requests.handledNginx Handled RequestsNULLThe number of requests served by connections handled by Nginx.
nginx.connections.readingNginx Reading ConnectionsNULLNginx reads the request header.
nginx.response.timeNginx Response TimeNULLTime taken for Nginx request-response.
nginx.requests_rateNginx Requests RateNULLAverage number of requests per second.
nginx.connections.writingNginx Writing ConnectionsNULLNginx reads the request body, processes the request or writes response to a client.
nginx.perconnections_requestsNginx Requests per connectionNULLAverage number of requests per connection.
nginx.connections.waitingNginx Waiting ConnectionsNULLKeep-alive connections, actually it will be active - (reading + writing).
nginx.connections_rateNginx Connections RateNULLAverage number of connections per second.
nginx.connections.activeNginx ConnectionsNULLNumber of all open connections including connections to backends.

Agent G2 - Linux - NTP Monitoring Offset

Description

Monitoring delay in seconds

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - NTP Monitoring Offsetntp.offsetNtp OffsetNULLThe time difference between the local clock and the NTP reference clock

Agent G2 - Linux NTP Statistics

Description

Monitors the NTP offset and jitter values of only system peer (the remote NTP server marked with a * at the beginning) for Linux systems.

Prerequisites

NTP needs to be installed and need Root privileges to execute the script.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux NTP Statisticssystem_linux_NTP_offsetSystem Linux NTP OffsetmillisecondsMonitors the time offset between the system clock and the Network Time Protocol (NTP) reference, indicating how much the system clock is ahead or behind the NTP reference.
system_linux_NTP_jitterSystem Linux NTP JittermillisecondsMonitors the jitter in the Network Time Protocol (NTP) synchronization, which reflects the variability in timekeeping accuracy. It measures the short-term fluctuations in the time offset between the system clock and the NTP reference.

Agent G2 - Linux - Nvidia GPU Monitoring

Description

Monitors Nvidia GPU Metrics like Gpu utilization, Power usage, Memory usage and Temperature per Gpu instance

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Nvidia Gpu Monitornvidia_dcgm_fb_mem_usedNvidia Dcgm Framebuffer Memory UsedPercentageFramebuffer Memory Used
nvidia_dcgm_memory_tempNvidia Dcgm Memory TempCelsiusMemory temperature
nvidia_dcgm_gpu_utilNvidia Dcgm Gpu UtilPercentageGPU utilization
nvidia_dcgm_mem_copy_utilNvidia Dcgm Mem UtilPercentageMemory utilization
nvidia_dcgm_gpu_tempNvidia Dcgm Gpu TempCelsiusGPU temperature
nvidia_dcgm_power_usageNvidia Dcgm Power UsageWattsPower draw
nvidia_dcgm_mem_clockNvidia Dcgm Mem Clock FreqMegahertzMemory clock frequency

Agent G2 - Linux - OKD ApiServer

Description

Template for monitoring OKD through Kubernetes API server

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - OKD ApiServerapiserver.rest.client.requests.total.countKube apiserver Rest Client Requests Total CountNULLNumber of HTTP requests, partitioned by status code, method, and host.
apiserver.dropped.requests.total.countKube apiserver Dropped Requests Total CountNULLMonotonic count of requests dropped with Try-again-later response.
apiserver.request.duration.seconds.bucketKube apiserver Request Duration Seconds BuckethistogramResponse latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope, and component.
apiserver.request.countKube apiserver Request CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.APIServiceRegistrationController.depthKube apiserver APIService Registration Controller DepthNULLCurrent depth of workqueue: APIServiceRegistrationController.
apiserver.rest.client.requests.totalKube apiserver Rest Client Requests TotalNULLNumber of HTTP requests, partitioned by status code, method, and host.
apiserver.audit.event.totalKube apiserver Audit Event TotalNULLCounter of audit events generated and sent to the audit backend.
apiserver.go.goroutinesKube apiserver GoroutinesNULLNumber of goroutines that currently exist.
apiserver.inflight.requestsKube apiserver Inflight RequestsNULLMaximal number of currently used inflight request limit of this apiserver per request kind in the last second.
apiserver.go.threads.totalKube apiserver Go Threads TotalNULLNumber of OS threads created.
apiserver.authenticated.user.requestsKube apiserver Authenticated User RequestsNULLCounter of authenticated requests broken out by username.
apiserver.etcd.object.countsKube apiserver ETCD Object CountsNULLNumber of stored objects at the time of the last check split by kind.
apiserver.http.requests.total.countKube apiserver HTTP Requests Total CountNULLTotal number of HTTP requests made.
apiserver.request.count.countKube apiserver Request Count CountNULLCounter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code.
apiserver.authenticated.user.requests.countKube apiserver Authenticated User Requests CountNULLCounter of authenticated requests broken out by username.
apiserver.http.requests.totalKube apiserver HTTP Requests TotalNULLTotal number of HTTP requests made.

Agent G2 - Linux - OKD CoreDNS

Description

Kubernetes CoreDNS

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - OKD CoreDNScoredns.request_duration.seconds.sumRequest Duration Seconds SumNULLDuration to process each query.
coredns.query.countQuery countNULLTotal query count.
coredns.response_size.bytes.sumResponse Size Bytes SumNULLSize of the returned response in bytes.
coredns.request_duration.seconds.countRequest Duration Seconds CountNULLDuration per upstream interaction.
coredns.panicsTotal PanicsNULLTotal number of panics.

Agent G2 - Linux - OKD Kube Controller

Description

Template for monitoring OKD using Kube Controller

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - OKD Kube Controllercontroller.workqueue.work_unfinished_durationKube Controller Workqueue Unfinished Work SecondsNULLHow many seconds of work has done that is in progress and hasn't been observed by work_duration. Large values indicate stuck threads.
controller.go.goroutinesKube Controller Go GoroutinesNULLNumber of goroutines that currently exist.
controller.workqueue.addsKube Controller Workqueue Adds TotalNULLTotal number of adds handled by workqueue.
controller.process.open_fdsKube Controller Process Open FdsNULLNumber of open file descriptors.
controller.workqueue.retriesKube Controller Workqueue Retries TotalNULLTotal number of retries handled by workqueue.
controller.workqueue.nodes.countKube Controller Registered NodesNULLNumber of registered Nodes per zones.
controller.workqueue.work_duration.sumKube Controller Workqueue Work Duration Seconds SumNULLHow long in seconds processing an item from workqueue takes.
controller.workqueue.depthKube Controller Workqueue DepthNULLCurrent depth of workqueue.
controller.workqueue.work_duration.countKube Controller Workqueue Work Duration Seconds CountNULLTotal time in seconds processing an item from workqueue takes.
controller.workqueue.nodes.evictionsKube Controller Node Collector Evictions NumberNULLNumber of Node evictions that happened since the current instance of NodeController started.
controller.threadsKube Controller OS ThreadsNULLNumber of OS threads created.
controller.workqueue.work_longest_durationKube Controller Workqueue Longest Running Processor SecondsNULLHow many seconds has the longest running processor for workqueue been running.
controller.rate_limiter.useKube Controller Node Lifecycle Controller Rate Limiter UseNULLA metric measuring the saturation of the rate limiter for node_lifecycle_controller.
controller.process.max_fdsKube Controller Process Max FdsNULLMaximum number of open file descriptors.
controller.workqueue.queue_duration.sumKube Controller Workqueue Queue Duration Seconds SumNULLHow long in seconds an item stays in workqueue before being requested.
controller.workqueue.queue_duration.countKube Controller Workqueue Queue Duration Seconds CountNULLTotal how long in seconds an item stays in workqueue before being requested.
controller.workqueue.nodes.unhealthyKube Controller Node Collector Unhealthy Nodes in ZoneNULLNumber of not Ready Nodes per zones.

Agent G2 - Linux - Postfix Monitors

Description

Monitors postfix queues

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Postfix Monitorspostfix.queues.deferredPostfix Queues DeferredNULLPostfix deferred mails queues count
postfix.queues.bouncePostfix Queues BounceNULLPostfix bounce mails queues count
postfix.queues.maildropPostfix Queues MaildropNULLPostfix maildrop mails queues count
postfix.queues.incomingPostfix Queues IncomingNULLPostfix incoming mail queues count
postfix.queues.activePostfix Queues ActiveNULLPostfix active mails queues count

Agent G2 - Linux - OKD Kube Scheduler

Description

Template for monitoring OKD using Kube Scheduler

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - OKD Kube Schedulerscheduler.client.http.requests_duration.countKube Scheduler Rest Client Request Latency Seconds CountNULLTotal request latency in seconds. Broken down by verb and URL.
scheduler.binding.latency.countKube Scheduler Binding Latency Microseconds CountNULLTotal Binding latency in microseconds count.
scheduler.scheduling.scheduling_latency.countKube Scheduler Scheduling Latency Seconds CountNULLScheduling latency in seconds split by sub-parts of the scheduling operation.
scheduler.scheduling.scheduling_duration.quantileKube Scheduler Scheduling Duration SecondsNULLScheduling latency in seconds split by sub-parts of the scheduling operation.
scheduler.volume_scheduling_duration.sumKube Scheduler Volume Scheduling Duration Seconds SumNULLVolume scheduling stage latency sum.
scheduler.volume_scheduling_duration.countKube Scheduler Volume Scheduling Duration Seconds CountNULLVolume scheduling stage latency count.
scheduler.go.goroutinesKube Scheduler Go GoroutinesNULLNumber of goroutines that currently exist.
scheduler.pod_preemption.victimsKube Scheduler Pod Preemption VictimsNULLNumber of selected preemption victims.
scheduler.scheduling.algorithm.preemption_duration.sumKube Scheduler Scheduling Algorithm Preemption Evaluation SumNULLScheduling algorithm preemption evaluation duration.
scheduler.gc_duration_seconds.sumKube Scheduler Go GC Duration Seconds SumNULLA summary of the GC invocation durations.
scheduler.client.http.requestsKube Scheduler Rest Client Requests TotalNULLNumber of HTTP requests, partitioned by status code, method, and host.
scheduler.e2e.scheduling_latency.sumKube Scheduler E2E Scheduling Latency Microseconds SumNULLE2e scheduling latency in microseconds (scheduling algorithm + binding).
scheduler.scheduling.scheduling_duration.sumKube Scheduler Scheduling Duration Seconds SumNULLScheduling latency in seconds split by sub-parts of the scheduling operation.
scheduler.e2e.scheduling_duration.sumKube Scheduler E2E Scheduling Duration Seconds SumNULLE2e scheduling latency in seconds (scheduling algorithm + binding).
scheduler.binding.latency.sumKube Scheduler Binding Latency Microseconds SumNULLBinding latency in microseconds sum.
scheduler.scheduling.scheduling_duration.countKube Scheduler Scheduling Duration Seconds CountNULLScheduling latency in seconds split by sub-parts of the scheduling operation.
scheduler.scheduling.scheduling_latency.quantileKube Scheduler Scheduling Latency SecondsNULLScheduling latency in seconds split by sub-parts of the scheduling operation.
scheduler.e2e.scheduling_duration.countKube Scheduler E2E Scheduling Duration Seconds CountNULLTotal E2e scheduling latency in seconds (scheduling algorithm + binding).
scheduler.binding.duration.countKube Scheduler Binding Duration Seconds CountNULLTotal Binding duration in seconds count.
scheduler.threadsKube Scheduler OS ThreadsNULLNumber of OS threads created.
scheduler.binding.duration.secondsKube Scheduler Binding Duration Seconds SumNULLBinding duration in seconds sum.
scheduler.pod_preemption.attemptsKube Scheduler Total Preemption AttemptsNULLTotal preemption attempts in the cluster till now.
scheduler.process.open_fdsKube Scheduler Process Open FdsNULLNumber of open file descriptors.
scheduler.scheduling.scheduling_latency.sumKube Scheduler Scheduling Latency Seconds SumNULLScheduling latency in seconds split by sub-parts of the scheduling operation.
scheduler.client.http.requests_duration.sumKube Scheduler Rest Client Request Latency Seconds SumNULLRequest latency in seconds sum. Broken down by verb and URL.
scheduler.schedule_attempts.totalKube Scheduler Schedule Attempts TotalNULLNumber of attempts to schedule pods, by the result. 'unschedulable' means a pod could not be scheduled, while 'error' means an internal scheduler problem.
scheduler.gc_duration_seconds.quantileKube Scheduler Go GC Duration SecondsNULLA summary of the GC invocation durations.
scheduler.scheduling.algorithm.preemption_duration.countKube Scheduler Scheduling Algorithm Preemption Evaluation CountNULLScheduling algorithm preemption evaluation duration.
scheduler.scheduling.algorithm.predicate_duration.sumKube Scheduler Scheduling Algorithm Predicate Evaluation SumNULLScheduling algorithm predicate evaluation duration.
scheduler.scheduling.algorithm_duration.sumKube Scheduler Scheduling Algorithm Duration Seconds SumNULLScheduling algorithm latency in seconds sum.
scheduler.scheduling.algorithm.preemption_duration.countKube Scheduler Scheduling Algorithm Preemption Evaluation CountNULLScheduling algorithm preemption evaluation duration.
scheduler.e2e.scheduling_latency.countKube Scheduler E2E Scheduling Latency Microseconds CountNULLTotal E2e scheduling latency in microseconds (scheduling algorithm + binding).
scheduler.scheduling.algorithm.predicate_duration.countKube Scheduler Scheduling Algorithm Predicate Evaluation CountNULLScheduling algorithm predicate evaluation duration.
scheduler.scheduling.algorithm_duration.countKube Scheduler Scheduling Algorithm Duration Seconds CountNULLTotal Scheduling algorithm latency in seconds count.
scheduler.scheduling.algorithm.priority_duration.sumKube Scheduler Scheduling Algorithm Priority Evaluation SumNULLScheduling algorithm priority evaluation duration.
scheduler.scheduling.algorithm_latency.sumKube Scheduler Scheduling Algorithm Latency Microseconds SumNULLScheduling algorithm latency in microseconds sum.
scheduler.gc_duration_seconds.countKube Scheduler Go GC Duration Seconds CountNULLA summary of the GC invocation durations.
scheduler.scheduling.algorithm_latency.countKube Scheduler Scheduling Algorithm Latency Microseconds CountNULLTotal Scheduling algorithm latency in microseconds count.
scheduler.cache.lookupsKube Scheduler Equiv Cache Lookups TotalNULLTotal number of equivalence cache lookups, by whether or not a cache entry was found.
scheduler.process.max_fdsKube Scheduler Process Max FdsNULLMaximum number of open file descriptors.

Agent G2 - Linux - OKD Kube State

Description

Template for monitoring OKD using Kube State

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - OKD Kube Statekubernetes_state.node.cpu_allocatableNode Cpu AllocatableNULLThe CPU resources of a node that are available for scheduling.
kubernetes_state.resourcequota.requests.storage.usedResourcequota Requests Storage UsedNULLObserved sum of storage bytes requested for a resource quota.
kubernetes_state.resourcequota.persistentvolumeclaims.limitResourcequota Persistentvolumeclaims LimitNULLHard limit of the number of PVC for a resource quota.
kubernetes_state.node.cpu_capacityNode Cpu CapacityNULLThe total CPU resources of the node.
kubernetes_state.replicaset.replicas_desiredReplicaset Replicas DesiredNULLNumber of desired pods for a ReplicaSet.
kubernetes_state.resourcequota.pods.limitResourcequota Pods LimitNULLHard limit of the number of pods for a resource quota.
kubernetes_state.replicaset.replicas_readyReplicaset Replicas ReadyNULLThe number of ready replicas per ReplicaSet.
kubernetes_state.deployment.replicas_unavailableDeployment Replicas UnavailableNULLThe number of unavailable replicas per deployment.
kubernetes_state.resourcequota.requests.memory.limitResourcequota Requests Memory LimitNULLHard limit on the total of memory bytes requested for a resource quota.
kubernetes_state.resourcequota.limits.cpu.limitResourcequota Limits Cpu LimitNULLHard limit on the sum of CPU core limits for a resource quota.
kubernetes_state.resourcequota.requests.memory.usedResourcequota Requests Memory UsedNULLObserved sum of memory bytes requested for a resource quota.
kubernetes_state.replicaset.replicasReplicaset ReplicasNULLThe number of replicas per ReplicaSet.
kubernetes_state.resourcequota.limits.memory.limitResourcequota Limits Memory LimitNULLHard limit on the sum of memory bytes limits for a resource quota.
kubernetes_state.resourcequota.requests.cpu.usedResourcequota Requests Cpu UsedNULLObserved sum of CPU cores requested for a resource quota.
kubernetes_state.resourcequota.services.nodeports.limitResourcequota Services Nodeports LimitNULLHard limit of the number of node ports for a resource quota.
kubernetes_state.daemonset.scheduledDaemonset ScheduledNULLThe number of nodes running at least one daemon pod and are supposed to.
kubernetes_state.deployment.replicas_updatedDeployment Replicas UpdatedNULLThe number of updated replicas per deployment.
kubernetes_state.container.memory_requestedContainer Memory RequestedNULLThe number of requested memory bytes by a container.
kubernetes_state.container.cpu_limitContainer Cpu LimitNULLThe limit on cpu cores to be used by a container.
kubernetes_state.container.restartsContainer RestartsNULLThe number of restarts per container.
kubernetes_state.resourcequota.pods.usedResourcequota Pods UsedNULLObserved number of pods used for a resource quota.
kubernetes_state.resourcequota.persistentvolumeclaims.usedResourcequota Persistentvolumeclaims UsedNULLObserved number of persistent volume claims used for a resource quota.
kubernetes_state.resourcequota.limits.memory.usedResourcequota Limits Memory UsedNULLObserved sum of limits for memory bytes for a resource quota.
kubernetes_state.deployment.replicas_availableDeployment Replicas AvailableNULLThe number of available replicas per deployment.
kubernetes_state.daemonset.misscheduledDaemonset MisscheduledNULLThe number of nodes running a daemon pod but are not supposed to.
kubernetes_state.resourcequota.requests.cpu.limitResourcequota Requests Cpu LimitNULLHard limit on the total of CPU core requested for a resource quota.
kubernetes_state.resourcequota.limits.cpu.usedResourcequota Limits Cpu UsedNULLObserved sum of limits for CPU cores for a resource quota.
kubernetes_state.resourcequota.services.loadbalancers.limitResourcequota Services Loadbalancers LimitNULLHard limit of the number of loadbalancers for a resource quota.
kubernetes_state.container.memory_limitContainer Memory LimitNULLThe limit on memory to be used by a container.
kubernetes_state.resourcequota.services.nodeports.usedResourcequota Services Nodeports UsedNULLObserved number of node ports used for a resource quota.
kubernetes_state.resourcequota.services.limitResourcequota Services LimitNULLHard limit of the number of services for a resource quota.
kubernetes_state.node.pods_capacityNode Pods CapacityNULLThe total pod resources of the node.
kubernetes_state.node.memory_allocatableNode Memory AllocatableNULLThe memory resources of a node that are available for scheduling.
kubernetes_state.container.cpu_requestedContainer Cpu RequestedNULLThe number of requested cpu cores by a container.
kubernetes_state.deployment.rollingupdate.max_unavailableDeployment Rollingupdate Max UnavailableNULLMaximum number of unavailable replicas during a rolling update of a deployment.
kubernetes_state.daemonset.readyDaemonset ReadyNULLThe number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready.
kubernetes_state.deployment.replicas_desiredDeployment Replicas DesiredNULLThe number of desired replicas per deployment wrong help in kube-state-metrics.cross check
kubernetes_state.resourcequota.services.usedResourcequota Services UsedNULLObserved number of services used for a resource quota.
kubernetes_state.daemonset.desiredDaemonset DesiredNULLThe number of nodes that should be running the daemon pod.
kubernetes_state.resourcequota.services.loadbalancers.usedResourcequota Services Loadbalancers UsedNULLObserved number of loadbalancers used for a resource quota.
kubernetes_state.node.pods_allocatableNode Pods AllocatableNULLThe pod resources of a node that are available for scheduling.
kubernetes_state.resourcequota.requests.storage.limitResourcequota Requests Storage LimitNULLHard limit on the total of storage bytes requested for a resource quota.
kubernetes_state.replicaset.fully_labeled_replicasReplicaset Fully Labeled ReplicasNULLThe number of fully labeled replicas per ReplicaSet.
kubernetes_state.node.memory_capacityNode Memory CapacityNULLThe total memory resources of the node.
kubernetes_state.deployment.replicasDeployment ReplicasNULLThe number of replicas per deployment.

Agent G2 - Linux - OS DISKIOPS Template

Description

Agent G2 - Linux - OS DISKIOPS Template

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - OS DISKIOPS TemplateDISKIOPSDISKIOPSNULLMonitors the Disk I/O Read in MBps, Disk Write MBps and Disk busy percentage.

Agent G2 - Linux Postfix Statistics

Description

Monitors the counts and sizes of specific mail queue types - namely, incoming, active, maildrop, deferred, and bounce queues.

Prerequisites:

Postfix needs to be installed and need Root privileges to execute the script.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux Postfix Statisticssystem_linux_postfix_active_mailqueue_sizeSystem Linux Postfix Active Mailqueue SizeBytesMonitors the size (in bytes) of active emails in the Postfix mail queue.
system_linux_postfix_deferred_mailqueue_sizeSystem Linux Postfix Deferred Mailqueue SizeBytes**Monitors the size (in bytes) of deferred emails in the Postfix mail queue.
system_linux_postfix_maildrop_mailqueue_sizeSystem Linux Postfix Maildrop Mailqueue SizeBytesMonitors the size (in bytes) of emails in the Postfix mail queue marked for maildrop.
system_linux_postfix_bounce_mailqueue_sizeSystem Linux Postfix Bounce Mailqueue SizeBytesMonitors the size (in bytes) of bounced emails in the Postfix mail queue.
system_linux_postfix_incoming_mailqueue_sizeSystem Linux Postfix Incoming Mailqueue SizeBytesMonitors the size (in bytes) of incoming emails in the Postfix mail queue.
system_linux_postfix_deferred_mailqueue_countSystem Linux Postfix Deferred Mailqueue CountCountMonitors the count of deferred emails in the Postfix mail queue.
system_linux_postfix_bounce_mailqueue_countSystem Linux Postfix Bounce Mailqueue CountCountMonitors the count of bounced emails in the Postfix mail queue.
system_linux_postfix_maildrop_mailqueue_countSystem Linux Postfix Maildrop Mailqueue CountCountMonitors the count of emails in the Postfix mail queue marked for maildrop.
system_linux_postfix_incoming_mailqueue_countSystem Linux Postfix Incoming Mailqueue CountCountMonitors the count of incoming emails in the Postfix mail queue.
system_linux_postfix_active_mailqueue_countsystem_linux_postfix_active_mailqueue_countCountMonitors the count of active emails in the Postfix mail queue.

Agent G2 - Linux - PostgresDB Aliveness Check Status

Description

To monitor Postgresql db_aliveness, Supported versions of PostgreSQL 11 or later (Validated this template on version PostgreSQL-11 and 12 versions).

Prerequisites

  • Agent installed on the target machine.
  • Create a Postgres environment file and provide the file path as input parameter while applying the template.
  • We need to provide env path for single instance - <env path> & for multiple instances - <env path1>, <env path2>,<env path3> Along with setting up postgres environment this env file should contain other parameters like PGDATABASE,PGARCHIVEDIR,PGDATADIR,PGWALDIR,PGPORT,SERVICENAME.
  • Agent must have permission to access the provided .env file

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - PostgresDB Aliveness Check Status Monitorpostgres_check_dbalivePostgres Check DBAliveNULLTo monitor the aliveness of the postgres database

Agent G2 - Linux - PostgresDB Aliveness Check Status - v2

Description

To monitor the aliveness of the postgres database

Template Usage Guidelines:

  • Create a Postgres environment file and provide the file path as input parameter while applying the template.
  • We need to provide env path for single instance - <env path> & for multiple instances - <env path1>,<env path2>,<env path3>
  • Along with setting up postgres environment, this env file should contain other parameters like PGDATABASE,PGARCHIVEDIR,PGDATADIR,PGWALDIR,PGPORT,SERVICENAME.
  • Agent must have permission to access the provided .env file.
  • Sample pg_new_5432.env file data: export PATH=$PATH:/usr/pgsql-12/bin/ PGDATABASE=postgres PGPORT=5432 PGARCHIVEDIR=/var/lib/pgsql/backups_main/archive/ PGDATADIR=/var/lib/pgsql/12/data PGWALDIR=/var/lib/pgsql/12/data/pg_wal SERVICENAME=postgresql-12

Prerequisite

Agent must be installed on the target machine.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - PostgresDB Aliveness Check Status Monitor - v2postgres_check_dbalivePostgres Check DBAliveTo monitor the aliveness of the postgres database

Agent G2 - Linux - PostgreSQL Monitors

Description

Agent G2 - Linux - PostgreSQL Monitors

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - PostgreSQL Monitorspostgres.dbstat.commitPostgreSQL CommitsNULLThe total number of commits for this database since it was created or reset.
postgres.dbstat.readsPostgreSQL ReadsNULLReports information from the pg_stat_database view, The total number of disk blocks read.
postgres.conn.idlePostgreSQL Connections IdleNULLChecks number of connections idle in given state
postgres.db.sizePostgreSQL Database SizeMBTotal Size of all dbs
postgres.locks.not.grantedPostgreSQL Locks Not GrantedNULLChecks the total number of locks are not granted on one or more databases
postgres.conn.waitingPostgreSQL Connections WaitingNULLChecks number of connections waiting in given state
postgres.dbstat.tup.insertedPostgreSQL Tup InsertedNULLReports information from the pg_stat_database view, The total number of tuples inserted.
postgres.autovac.freezePostgreSQL Autovac Freeze%Provides the percentage of current transactions to the max freeze number.
postgres.backendsPostgreSQL Backends%Check the number of connections to the database. Compares it with the Max_connection provided in the postgres conf file
postgres.dbstat.tup.returnedPostgreSQL Tup ReturnedNULLReports information from the pg_stat_database view, The total number of tuples returned.
postgres.dbstat.hitsPostgreSQL HitsNULLReports information from the pg_stat_database view , The total number of buffer hits.
postgres.tnxage.runningPostgreSQL Tnxage RunningNULLChecks the number and duration of "running transaction" queries on one or more databases
postgres.locks.grantedPostgreSQL Locks GrantedNULLChecks the total number of locks are granted on one or more databases
postgres.bloatPostgreSQL BloatNULLChecks the amount of bloat in tables and indexes
postgres.dbstat.rollbackPostgreSQL RollbacksNULLThe total number of rollbacks for this database since it was created or reset.
postgres.dbstat.tup.deletedPostgreSQL Tup DeletedNULLReports information from the pg_stat_database view, The total number of tuples deleted.
postgres.conn.totalPostgreSQL Connections TotalNULLChecks total number of connections in given state
postgres.conn.runningPostgreSQL Connections RunningNULLChecks number of connections running in given state
postgres.pingPostgreSQL PingNULLProvides the Ping response time of PostgreSQL database.
postgres.dbstat.tup.fetchedPostgreSQL Tup FetchedNULLReports information from the pg_stat_database view, The total number of tuples fetched.
postgres.conn.idle.tnxPostgreSQL Connections Idle TnxNULLChecks the number connections "idle in transaction" state
postgres.locks.allPostgreSQL LocksNULLChecks the total number of locks on one or more databases
postgres.tnxage.idle.tnxPostgreSQL Tnxage IdleTnxNULLChecks the number and duration of "idle in transaction" queries on one or more databases
postgres.dbstat.tup.updatedPostgreSQL Tup UpdatedNULLReports information from the pg_stat_database view, The total number of tuples updated.
postgres.walPostgreSQL WalNULLChecks how many WAL files exist in the pg_xlog directory

Agent G2 - Linux PostgresDB Aliveness Check Status - v3

Description:

Monitors the PostgreSQL database aliveness status. The metric value is 1 if the PostgreSQL database is alive; otherwise, the value is 0.

Prerequisites:

  1. The Agent must be installed as Root on the target machine, with Agent version 14.0 or later.
  2. Create a PostgreSQL environment file and provide the file path as an input parameter when applying the template.
  3. For a single instance, provide the environment path as <env path>. For multiple instances, provide the environment paths as <env path1>, <env path2>, <env path3>.
  4. This env file should contain other parameters such as PGDATADIR, PGPORT, SERVICENAME, SERVICELEVEL, POSTGRESUSER, and POSTGRESPATH. The last three parameters (SERVICELEVEL, POSTGRESUSER, POSTGRESPATH) are necessary only for checking the status of PostgreSQL using pg_ctl; otherwise, they are optional.
  5. The agent must have execute permissions on the provided .env file.

The enhancements introduced in Version 3 (V3) of the script provide users with the flexibility to check the PostgreSQL aliveness status using either systemctl, pg_isready commands, or a combination of pg_isready and pg_ctl commands by giving respective parameters in env file as input.

Template Usage Guidelines: While assigning template on device, users need to pass specific input parameters -

For a single instance, provide the environment path as <env path>. For multiple instances, provide the environment paths as <env path1>, <env path2>, <env path3>.

Example:
Single Input Parameter:
/root/pg_new_5432.env

Multiple Input Parameters: /var/lib/pgsql/pg_new_5432.env,/var/lib/pgsql/pg_new_5433.env

  • Create a Postgres environment file and provide the file path as input parameter while applying the template.
  • We need to provide env path for single instance - <env path> & for multiple instances - <env path1>,<env path2>,<env path3>
  • Along with setting up postgres environment, this env file should contain other parameters like PGDATABASE,PGARCHIVEDIR,PGDATADIR,PGWALDIR,PGPORT,SERVICENAME.
  • Agent must have permission to access the provided .env file Example content of the .env file -

SampleContent while checking postgres status with pg_ctl command:

export PATH=$PATH:/usr/pgsql-12/bin/        
PGPORT=5432         
PGDATADIR=/var/lib/pgsql/12/data            
SERVICENAME=postgresql-12           
SERVICELEVEL=pgctl          
POSTGRESUSER=postgres           
POSTGRESPATH=/usr/pgsql-12/bin/

SampleContent while checking postgres status without pg_ctl:

export PATH=$PATH:/usr/pgsql-12/bin/        
PGPORT=5432         
PGDATADIR=/var/lib/pgsql/12/data            
SERVICENAME=postgresql-12           

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux PostgresDB Aliveness Check Status Monitor - v3postgres_check_dbalivePostgres Check DBAlive-To monitor the aliveness of the postgres database

Agent G2 - Linux OS Performance Monitoring - Advanced

Description

Template to monitor Linux OS advanced performance metrics related to OS Resource parameters, Real memory (page outs & scan rate), Total and individual swap utilization. It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.

Prerequisites

Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux OS Resource Parameterssystem_linux_openFileDescriptors_UsedCountSystem Linux OpenFileDescriptors Used CountcountCurrent number of Open File Descriptors
system_linux_messageQueueIDs_UtilizationSystem Linux MessageQueueIDs Utilization%Used percentage of current message queue ID's
system_linux_Semaphores_UtilizationSystem Linux Semaphores Utilization%Semaphore arrays or sets used percentage
system_linux_loggedInUsers_PctSystem Linux Logged In Users Pct%Current number of logged in users percentage
system_linux_sharedMemoryIDs_UtilizationSystem Linux SharedMemoryIDs Utilization%Used percentage of shared memory ID's
system_linux_messageQueueIDs_UsedCountSystem Linux MessageQueueIDs Used CountcountCurrent number of message queue ID’s in use
system_linux_openFileDescriptors_UtilizationSystem Linux OpenFileDescriptors Utilization%Linux Open File Descriptors Used Percentage
system_linux_runningProcesses_PctSystem Linux Running Processes Pct%Current running processes percentage
system_linux_Semaphores_UsedCountSystem Linux Semaphores Used CountcountCurrent number of semaphore arrays in use
system_linux_sharedMemoryIDs_UsedCountSystem Linux SharedMemoryIDs Used CountcountCurrent number of shared memory ID’s in use
system_linux_loggedInUsers_CountSystem Linux LoggedInUsers CountcountCurrent number of logged in users
system_linux_runningProcesses_CountSystem Linux RunningProcesses CountcountCurrent number of running processes
Agent G2 - Linux Real Memory Statssystem_linux_RealMemory_PageOuts_KiloBytesPerSecSystem Linux Real Memory PageOuts KiloBytesPerSecKBpsMemory pages page out rate in Kilo Bytes per second.
system_linux_RealMemory_PageOuts_PagesPerSecSystem Linux Real Memory PageOuts PagesPerSecpsecMemory page out rate in pages per second.
system_linux_RealMemory_Scan_RateSystem Linux Real Memory Scan RatepsecNumber of pages scanned (directly) per second. It will collect data for last 10 min (i.e time configured in /etc/cron.d/sysstat file).\n\nPrerequisite: sysstat package should be installed and sar -B command should respond
Agent G2 - Linux Swap Memory Utilizationsystem_linux_swapMemory_UtilizationSystem Linux Swap Memory Utilization%Swap memory utilization in percent.
system_linux_individual_SwapArea_UtilizationSystem Linux Individual Swap Area Utilization%Individual swap area utilization in percent.

Agent G2 - Linux OS Performance Monitoring - Advanced - V2

Description

Template to monitor Linux OS advanced performance metrics related to OS Resource parameters, Real memory (page ins, page outs, swap in, swap out & scan rate), Total and individual swap utilization. It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.

Prerequisites

Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux OS Resource Parameterssystem_linux_openFileDescriptors_UsedCountSystem Linux OpenFileDescriptors Used CountcountCurrent number of Open File Descriptors
system_linux_messageQueueIDs_UtilizationSystem Linux MessageQueueIDs Utilization%Used percentage of current message queue ID's
system_linux_Semaphores_UtilizationSystem Linux Semaphores Utilization%Semaphore arrays or sets used percentage
system_linux_loggedInUsers_PctSystem Linux Logged In Users Pct%Current number of logged in users percentage
system_linux_sharedMemoryIDs_UtilizationSystem Linux SharedMemoryIDs Utilization%Used percentage of shared memory ID's
system_linux_messageQueueIDs_UsedCountSystem Linux MessageQueueIDs Used CountcountCurrent number of message queue ID’s in use
system_linux_openFileDescriptors_UtilizationSystem Linux OpenFileDescriptors Utilization%Linux Open File Descriptors Used Percentage
system_linux_runningProcesses_PctSystem Linux Running Processes Pct%Current running processes percentage
system_linux_Semaphores_UsedCountSystem Linux Semaphores Used CountcountCurrent number of semaphore arrays in use
system_linux_sharedMemoryIDs_UsedCountSystem Linux SharedMemoryIDs Used CountcountCurrent number of shared memory ID’s in use
system_linux_loggedInUsers_CountSystem Linux LoggedInUsers CountcountCurrent number of logged in users
system_linux_runningProcesses_CountSystem Linux RunningProcesses CountcountCurrent number of running processes
Agent G2 - Linux Real Memory Stats - V2system_linux_RealMemory_PageOuts_KiloBytesPerSecSystem Linux Real Memory PageOuts KiloBytesPerSecKBpsMemory pages page out rate in Kilo Bytes per second.
system_linux_RealMemory_PageOuts_PagesPerSecSystem Linux Real Memory PageOuts PagesPerSecpsecMemory page out rate in pages per second.
system_linux_RealMemory_Scan_RateSystem Linux Real Memory Scan RatepsecNumber of pages scanned (directly) per second. It will collect data for last 10 min (i.e time configured in /etc/cron.d/sysstat file).\n\nPrerequisite: sysstat package should be installed and sar -B command should respond
system_linux_RealMemory_SwapOuts_KiloBytesPerSecSystem Linux Real Memory SwapOuts KiloBytesPerSecKBpsSwap out rate in Kilo Bytes per second.
system_linux_RealMemory_PageIns_PagesPerSecSystem Linux Real Memory PageIns PagesPerSecpsecMemory pages page in rate in Pages per second.
system_linux_RealMemory_PageIns_KiloBytesPerSecSystem Linux Real Memory PageIns KiloBytesPerSecKBpsMemory pages page in rate in Kilo Bytes per second.
system_linux_RealMemory_SwapIns_KiloBytesPerSecSystem Linux Real Memory SwapIns KiloBytesPerSecKBpsSwap in rate in Kilo Bytes per second.
Agent G2 - Linux Swap Memory Utilizationsystem_linux_swapMemory_UtilizationSystem Linux Swap Memory Utilization%Swap memory utilization in percent.
system_linux_individual_SwapArea_UtilizationSystem Linux Individual Swap Area Utilization%Individual swap area utilization in percent.

Agent G2 - Linux OS Performance Monitoring - Advanced - v3

Description:

Monitors the data related to OS resource parameters like Open FDs,Logge In users, Context Switches, Processes created, Running processes, Page Faults, Semaphores, Shared Memory IDs,Message queue IDs, TCP Connection states, Swap memory, Real memory Statistics and disk statistics. Ensure that the “ss” command is available in system to get TCP Connections data and util-linux package version 2.19 or higher, Kernel version above 2.6 be available on the system to get Disk Stats data.
Note: When the server is rebooted or when the disk counter’s valid range is exceeded, negative values for disk metrics will occur, as we are calculating rates.

Prerequisites

Ensure that the “ss” command is available in system to get TCP Connections data and util-linux package version 2.19 or higher, Kernel version above 2.6 be available on the system to get Disk Stats data.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux Swap Memory Utilizationsystem_linux_swapMemory_UtilizationSystem Linux Swap Memory Utilization%Swap memory utilization in percent.
system_linux_individual_SwapArea_UtilizationSystem Linux Individual Swap Area Utilization%Individual swap area utilization in percent.
Agent G2 - Linux Real Memory Stats - V2system_linux_RealMemory_Scan_RateSystem Linux Real Memory Scan RatepsecNumber of pages scanned (directly) per second. It will collect data for last 10 min (i.e time configured in /etc/cron.d/sysstat file). Prerequisite: sysstat package should be installed and sar -B command should respond
system_linux_RealMemory_PageOuts_PagesPerSecSystem Linux Real Memory PageOuts PagesPerSecpsecMemory page out rate in pages per second.
system_linux_RealMemory_PageOuts_KiloBytesPerSecSystem Linux Real Memory PageOuts KiloBytesPerSecKBpsMemory pages page out rate in Kilo Bytes per second.
system_linux_RealMemory_PageIns_KiloBytesPerSecSystem Linux Real Memory PageIns KiloBytesPerSecKBpsMemory pages page in rate in Kilo Bytes per second.
system_linux_RealMemory_PageIns_PagesPerSecSystem Linux Real Memory PageIns PagesPerSecpsecMemory pages page in rate in Pages per second.
system_linux_RealMemory_SwapIns_KiloBytesPerSecSystem Linux Real Memory SwapIns KiloBytesPerSecKBpsSwap in rate in Kilo Bytes per second.
system_linux_RealMemory_SwapOuts_KiloBytesPerSecSystem Linux Real Memory SwapOuts KiloBytesPerSecKBpsSwap out rate in Kilo Bytes per second.
Agent G2 - Linux TCP Connection States Monitorsystem_linux_tcp_connection_statesSystem Linux TCP Connection StatescountMonitors the count of TCP connections in various states on the Linux system.
Agent G2 - Linux Disk Statisticssystem_linux_disk_IOQueueLengthSystem Linux Disk IOQueueLengthnullMonitors the length of the I/O queue for disk operations.
system_linux_disk_averageLatencySystem Linux Disk AverageLatencymicrosecMonitors the average response time of the disk subsystem to process I/O requests.
system_linux_disk_averageReadRequestSizeSystem Linux Disk AverageReadRequestSizeKBMonitors the average size of read requests issued to the disk.
system_linux_disk_averageRequestSizeSystem Linux Disk AverageRequestSizeKBMonitors the average size of all read and write requests issued to the disk.
system_linux_disk_averageWriteRequestSizeSystem Linux Disk AverageWriteRequestSizeKBMonitors the average size of write requests issued to the disk.
system_linux_disk_timeUtilizationSystem Linux Disk TimeUtilization%Monitors the utilization of the disk, measured as the percentage of time the disk is actively processing I/O operations.
system_linux_disk_readOperationsRateSystem Linux Disk ReadOperationsRateMonitors the rate of read operations issued to the disk per second.psec
system_linux_disk_readThroughputSystem Linux Disk ReadThroughputKBpsMonitors the rate of data read from the disk per second.
system_linux_disk_averageReadWaitTimeSystem Linux Disk AverageReadWaitTimemicrosecMonitors the average wait time for read requests.
system_linux_disk_averageRequestWaitTimeSystem Linux Disk AverageRequestWaitTimemicrosecMonitors the time a disk operation (read and write) waits in the I/O queue before being processed.
system_linux_disk_writeOperationsRateSystem Linux Disk WriteOperationsRatepsecMonitors the rate of write operations issued to the disk per second.
system_linux_disk_writeThroughputSystem Linux Disk WriteThroughputKBpsMonitors the rate of data written to the disk per second.
system_linux_disk_averageWriteWaitTimeSystem Linux Disk AverageWriteWaitTimemicrosecMonitors the average wait time for write requests.
Agent G2 - Linux OS Resource Parameters - v2system_linux_openFileDescriptors_UtilizationSystem Linux OpenFileDescriptors Utilization%Linux Open File Descriptors Used Percentage
system_linux_openFileDescriptors_UsedCountSystem Linux OpenFileDescriptors Used CountcountCurrent number of Open File Descriptors
system_linux_loggedInUsers_PctSystem Linux Logged In Users Pct%Current number of logged in users percentage
system_linux_loggedInUsers_CountSystem Linux LoggedInUsers CountcountCurrent number of logged in users
system_linux_runningProcesses_PctSystem Linux Running Processes Pct%Current running processes percentage
system_linux_runningProcesses_CountSystem Linux RunningProcesses CountcountCurrent number of running processes
system_linux_Semaphores_UtilizationSystem Linux Semaphores Utilization%Semaphore arrays or sets used percentage
system_linux_Semaphores_UsedCountSystem Linux Semaphores Used CountcountCurrent number of semaphore arrays in use
system_linux_messageQueueIDs_UtilizationSystem Linux MessageQueueIDs Utilization%Used percentage of current message queue ID's
system_linux_messageQueueIDs_UsedCountSystem Linux MessageQueueIDs Used CountcountCurrent number of message queue ID?s in use
system_linux_sharedMemoryIDs_UtilizationSystem Linux SharedMemoryIDs Utilization%Used percentage of shared memory ID's
system_linux_sharedMemoryIDs_UsedCountSystem Linux SharedMemoryIDs Used CountcountCurrent number of shared memory ID?s in use
system_linux_createdProcessesPerSecSystem Linux CreatedProcesses PerSecCount per secProvides the count of processes created or managed per second on the Linux system since the system was rebooted.
system_linux_pageFaultsPerSecSystem Linux PageFaults PerSecCount per secMonitors the count of page faults per second on the Linux system.
system_linux_kernelContextSwitchesPerSecSystem Linux KernelContextSwitches PerSecCount per secMonitors the count of context switches per second on the Linux system.

Agent G2 - Linux Process Statistics Monitoring

Description

Monitors the process statistics of given process list. Should give process list in configuration parameter - custom arguments param

Prerequisites

NULL

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux Process Statistics Monitorsystem.process.stats.cpuSystem Process Stats Cpu Usage%Monitors Cpu Usage of each process of Linux Device.
system.process.stats.countSystem Process Stats Instance CountcountMonitors Instance count of each process of Linux Device.
system.process.stats.threadsSystem Process Stats Thread CountcountMonitors thread count of each process of Linux Device.
system.process.stats.open.fdsSystem Process Stats Open Fd countcountMonitors open fd count of each process of Linux Device.
system.process.stats.memorySystem Process Stats Memory Usage%Monitors Memory Usage of each process of Linux Device.

Agent G2 - Linux Process Statistics Monitoring - v2

Description

Agent G2 - Linux Process Statistics Monitoring - v2

Prerequisites

NULL

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux Process Statistics Monitor -v2system.process.stats.cpuSystem Process Stats Cpu Usage%Monitors Cpu Usage of each process of Linux Device.
system.process.stats.countSystem Process Stats Instance CountcountMonitors Instance count of each process of Linux Device.
system.process.stats.threadsSystem Process Stats Thread CountcountMonitors thread count of each process of Linux Device.
system.process.stats.open.fdsSystem Process Stats Open Fd countcountMonitors open fd count of each process of Linux Device.
system.process.stats.memorySystem Process Stats Memory Usage%Monitors Memory Usage of each process of Linux Device.

Agent G2 - Linux Veritas Cluster Monitoring

Description

Template to monitor veritas Linux cluster parameters like cluster node state, service group state, resource state, service group failover status and this template need to assigned on all cluster nodes. This template tested with two node veritas Linux cluster (version - “Veritas_InfoScale_7.3.1”) running on CentOS7.

Prerequisites

NULL

Monitor NameMetric NameMetric Display NameUnitDescription
G2 - Linux Veritas Cluster Monitorsystem_linux_veritas_cluster_resource_stateSystem Linux Veritas Cluster Resource StateSystem Linux Veritas Cluster Resource State - Below are the possible states of the resource:
OFFLINE:0
ONLINE:1
FAULTED:2
PARTIAL:3
STARTING:4
STOPPING:5
MIGRATING:6
OFFLINE|FAULTED:7
OFFLINE|STARTING:8
PARTIAL|FAULTED:9
PARTIAL|STARTING:10
PARTIAL|STOPPING:11
ONLINE|STOPPING:12
system_linux_veritas_cluster_group_online_statusSystem Linux Veritas Cluster Group Online StatusSystem Linux Veritas Cluster Group Online Status - Below are the possible values :
0 - Service group online on cluster node.
1 - Service group not online on any cluster node.
system_linux_veritas_cluster_resource_online_statusSystem Linux Veritas Cluster Resource Online StatusSystem Linux Veritas Cluster Resource Online Status . Below are the possible values: 0 - Resource state in online on any cluster node, 1 - Resource state is not online on any cluster node
system_linux_veritas_cluster_node_stateSystem Linux Veritas Cluster Node StateSystem Linux Veritas Cluster Node State - Below are the possible states:
RUNNING : 0
ADMIN_WAIT : 1
CURRENT_DISCOVER_WAIT : 2
CURRENT_PEER_WAIT : 3
EXITING : 4
EXITED : 5
EXITING_FORCIBLY : 6
FAULTED : 7
INITING : 8
LEAVING : 9
LOCAL_BUILD : 10
REMOTE_BUILD : 11
STALE_ADMIN_WAIT : 12
STALE_DISCOVER_WAIT : 13
STALE_PEER_WAIT : 14
UNKNOWN : 15
system_linux_veritas_cluster_group_stateSystem Linux Veritas Cluster Group StateSystem Linux Veritas Cluster Group State - Below are the possible values :
OFFLINE\t: 0
ONLINE\t: 1
FAULTED : 2
PARTIAL : 3
STARTING : 4
STOPPING : 5
MIGRATING : 6
OFFLINE|FAULTED : 7
OFFLINE|STARTING : 8
PARTIAL|FAULTED : 9
PARTIAL|STARTING : 10
PARTIAL|STOPPING : 11
ONLINE|STOPPING : 12
G2 - Linux Veritas Cluster Group Failover Monitorsystem_linux_veritas_cluster_group_failover_statusSystem Linux Veritas Cluster Group Failover StatusSystem Linux Veritas Cluster Group Failover Status - Below are the possible values:
0 - No change.
1 - Cluster group change from one node to another due to failover.
2 - The specific cluster group is not online on any cluster node.

Agent G2 - Logfile Monitoring

Description

The Logfile monitor is used to validate whether the given input string is found or not in the specified input logfile. It sends an alert based on the check type (yes or no), and regex patterns are also allowed in the input string. We have added support for alert tokens in G2 logfile monitoring. In the global template, we have included all tokens, and customers can choose the required tokens based on their requirements.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 Logfile Monitorsystem_logfile_search_monitorsystem_logfile_search_monitorUsing logfile monitor to validate given input string is found or not by from given input logfile and sends the alerts

Agent G2 - Microsoft Active Directory 2003 - Performance Counters

Description

Template for AD 2003 Servers.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Active Directory 2003 - Performance CountersDSServerBindsPersecDSServerBindsPersecShows the number of DC-to-DC binds per second that are serviced by this DC.
LDAPClientSessionsLDAPClientSessionsThe number of sessions of connected LDAP clients. Lack of activity points to network problems.
DRAOutboundObjectsPersecDRAOutboundObjectsPersecThe number of objects sent (per second) through outbound replication to replication partners.
ABClientSessionsABClientSessionsAB Client Sessions is the number of connected Address Book client sessions.
DRAInboundObjectsPersecDRAInboundObjectsPersecThe number of objects received (per second) through inbound replication from replication partners.
DRAInboundObjectsAppliedPersecDRAInboundObjectsAppliedPersecThis counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers.
KerberosAuthenticationsKerberosAuthenticationsThe number of times per second that clients use a client ticket to this domain controller to authenticate to this domain controller. A lack of activity can indicate network problems that are preventing authentication requests from succeeding.
DRAInboundObjectUpdatesRemaininginPacketDRAInboundObjectUpdatesRemaininginPacketThis counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade).
NTLMAuthenticationsNTLMAuthenticationsThe number of NTLM authentications (per second) serviced by this domain controller
LDAPSearchesPersecLDAPSearchesPersecThe number of search operations per second performed by LDAP clients. A lack of activity points to network problems.
DSDirectoryReadsPersecDSDirectoryReadsPersecShows the number of directory reads per second.
DSDirectoryWritesPersecDSDirectoryWritesPersecShows the number of directory writes per second.
LDAPActiveThreadsLDAPActiveThreadsLDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local direcotry service.
LDAPWritesPersecLDAPWritesPersecShows the rate at which LDAP clients perform write operations.
DRAOutboundBytesTotalPersecDRAOutboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication.
DSNotifyQueueSizeDSNotifyQueueSizeThe number of pending update notifications that have been queued, but not yet transmitted to clients.
DRAInboundBytesTotalPersecDRAInboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication.
LDAPUDPoperationsPersecLDAPUDPoperationsPersecShows the number of UDP operations that the LDAP server is processing per second.
LDAPBindTimeLDAPBindTimeMillisecondsThis counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems.
DSClientBindsPersecDSClientBindsPersecShows the number of Ntdsapi.dll binds per second serviced by this DC.
DRAPendingReplicationSynchronizationsDRAPendingReplicationSynchronizationsThe number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication.

Agent G2 - Microsoft Active Directory 2008 - Performance Counters

Description

Template for AD 2008 Servers.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Active Directory 2008 - Performance CountersLDAPWritesPersecLDAPWritesPersecShows the rate at which LDAP clients perform write operations.
NTLMAuthenticationsNTLMAuthenticationsThe number of NTLM authentications (per second) serviced by this domain controller
LDAPClientSessionsLDAPClientSessionsThe number of sessions of connected LDAP clients. Lack of activity points to network problems.
ABClientSessionsABClientSessionsAB Client Sessions is the number of connected Address Book client sessions.
DSClientBindsPersecDSClientBindsPersecShows the number of Ntdsapi.dll binds per second serviced by this DC.
LDAPActiveThreadsLDAPActiveThreadsLDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local directory service.
LDAPUDPoperationsPersecLDAPUDPoperationsPersecShows the number of UDP operations that the LDAP server is processing per second.
LDAPBindTimeLDAPBindTimeMillisecondsThis counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems.
DRAOutboundObjectsPersecDRAOutboundObjectsPersecThe number of objects sent (per second) through outbound replication to replication partners.
DRAInboundObjectsPersecDRAInboundObjectsPersecThe number of objects received (per second) through inbound replication from replication partners.
DRAPendingReplicationSynchronizationsDRAPendingReplicationSynchronizationsThe number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication.
DSNotifyQueueSizeDSNotifyQueueSizeThe number of pending update notifications that have been queued, but not yet transmitted to clients.
DSServerBindsPersecDSServerBindsPersecShows the number of DC-to-DC binds per second that are serviced by this DC.
DSDirectoryReadsPersecDSDirectoryReadsPersecShows the number of directory reads per second.
DRAInboundObjectUpdatesRemaininginPacketDRAInboundObjectUpdatesRemaininginPacketThis counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade).
KerberosAuthenticationsKerberosAuthenticationsThe number of times per second that clients use a client ticket to this domain controller to authenticate to this domain controller. A lack of activity can indicate network problems that are preventing authentication requests from succeeding.
DRAInboundObjectsAppliedPersecDRAInboundObjectsAppliedPersecThis counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers.
LDAPSearchesPersecLDAPSearchesPersecThe number of search operations per second performed by LDAP clients. A lack of activity points to network problems.
DSDirectoryWritesPersecDSDirectoryWritesPersecShows the number of directory writes per second.
DRAOutboundBytesTotalPersecDRAOutboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication.
DRAInboundBytesTotalPersecDRAInboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication.

Agent G2 - Microsoft Active Directory 2012 DotNet v4 - Performance Counters

Description

Monitors Microsoft Active Directory Performance Counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Active Directory 2012 DotNet v4 - Performance CountersLDAPWritesPersecLDAPWritesPersecShows the rate at which LDAP clients perform write operations.
DRAOutboundBytesTotalPersecDRAOutboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication.
DSClientBindsPersecDSClientBindsPersecShows the number of Ntdsapi.dll binds per second serviced by this DC.
LDAPClientSessionsLDAPClientSessionsThe number of sessions of connected LDAP clients. Lack of activity points to network problems.
DRAInboundObjectsPersecDRAInboundObjectsPersecThe number of objects received (per second) through inbound replication from replication partners.
DSDirectoryReadsPersecDSDirectoryReadsPersecShows the number of directory reads per second.
LDAPUDPoperationsPersecLDAPUDPoperationsPersecShows the number of UDP operations that the LDAP server is processing per second.
LDAPActiveThreadsLDAPActiveThreadsLDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local directory service.
LDAPSearchesPersecLDAPSearchesPersecThe number of search operations per second performed by LDAP clients. A lack of activity points to network problems.
LDAPBindTimeLDAPBindTimeMillisecondsThis counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems.
DSDirectoryWritesPersecDSDirectoryWritesPersecShows the number of directory writes per second.
DRAPendingReplicationSynchronizationsDRAPendingReplicationSynchronizationsThe number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication.
ABClientSessionsABClientSessionsAB Client Sessions is the number of connected Address Book client sessions.
DSServerBindsPersecDSServerBindsPersecShows the number of DC-to-DC binds per second that are serviced by this DC.
DRAInboundBytesTotalPersecDRAInboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication.
DSNotifyQueueSizeDSNotifyQueueSizeThe number of pending update notifications that have been queued, but not yet transmitted to clients.
DRAInboundObjectsAppliedPersecDRAInboundObjectsAppliedPersecThis counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers.
DRAInboundObjectUpdatesRemaininginPacketDRAInboundObjectUpdatesRemaininginPacketThis counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade).
DRAOutboundObjectsPersecDRAOutboundObjectsPersecThe number of objects sent (per second) through outbound replication to replication partners.

Agent G2 - Microsoft Active Directory 2019 DotNet v4- Performance Counters

Description

Monitors Microsoft Active Directory Performance Counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Active Directory 2019 DotNet v4- Performance CountersDSNotifyQueueSizeDSNotifyQueueSizeThe number of pending update notifications that have been queued, but not yet transmitted to clients.
DSServerBindsPersecDSServerBindsPersecShows the number of DC-to-DC binds per second that are serviced by this DC.
DRAOutboundBytesTotalPersecDRAOutboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication.
DRAPendingReplicationSynchronizationsDRAPendingReplicationSynchronizationsThe number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication.
DRAInboundObjectsAppliedPersecDRAInboundObjectsAppliedPersecThis counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers.
DSDirectoryWritesPersecDSDirectoryWritesPersecShows the number of directory writes per second.
DRAInboundBytesTotalPersecDRAInboundBytesTotalPersecBytes per secondIt is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication.
DSDirectoryReadsPersecDSDirectoryReadsPersecShows the number of directory reads per second.
LDAPBindTimeLDAPBindTimeMillisecondsThis counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems.
LDAPClientSessionsLDAPClientSessionsThe number of sessions of connected LDAP clients. Lack of activity points to network problems.
DRAOutboundObjectsPersecDRAOutboundObjectsPersecThe number of objects sent (per second) through outbound replication to replication partners.
LDAPWritesPersecLDAPWritesPersecShows the rate at which LDAP clients perform write operations.
DRAInboundObjectUpdatesRemaininginPacketDRAInboundObjectUpdatesRemaininginPacketThis counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade).
LDAPSearchesPersecLDAPSearchesPersecThe number of search operations per second performed by LDAP clients. A lack of activity points to network problems.
ABClientSessionsABClientSessionsAB Client Sessions is the number of connected Address Book client sessions.
DSClientBindsPersecDSClientBindsPersecShows the number of Ntdsapi.dll binds per second serviced by this DC.
LDAPActiveThreadsLDAPActiveThreadsLDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local directory service.
DRAInboundObjectsPersecDRAInboundObjectsPersecThe number of objects received (per second) through inbound replication from replication partners.
LDAPUDPoperationsPersecLDAPUDPoperationsPersecShows the number of UDP operations that the LDAP server is processing per second.

Agent G2 - Microsoft DotNet Performance Counters DotNet v4

Description

Monitors Microsoft DotNet Performance Counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft DotNet Performance Counters DotNet v4netclr.numberof.current.physicalthreadsNETCLR Number of Current Physical ThreadsShows the number of native OS threads created and owned by the CLR to act as underlying threads for .NET thread objects. This counter's value does not include the threads used by the CLR in its internal operations; it is a subset of the threads in the OS process.
aspnet.requests.currentASPNET Requests CurrentShows the current number of requests, including those that are queued, currently executing, or waiting to be written to the client. Under the ASP.NET process model, when this counter exceeds the requestQueueLimit defined in the processModel configuration section, ASP.NET will begin rejecting requests.
aspnet.request.wait.timeASPNET Request Wait TimeMSShows the number of milliseconds the most recent request was waiting in the queue.
netclr.current.queue.lengthNETCLR Current Queue LengthShows the total number of threads currently waiting to acquire some managed lock in the application. This counter is not an average over time; it displays the last observed value.
netclr.numberofexceps.thrown.persecNETCLR Number of Exceps Thrown Per secShows the number of exceptions thrown per second. These include both .NET exceptions and unmanaged exceptions that get converted into .NET exceptions e.g. null pointer reference exception in unmanaged code would get re-thrown in managed code as a .NET System.NullReferenceException; this counter includes both handled and unhandled exceptions.
webservice.get.requests.persecWeb Service Get Requests Per secShows the rate HTTP requests using the GET method are made. Get requests are the most common HTTP request.
netclr.numberof.current.logicalthreadsNETCLR Number of Current Logical ThreadsShows the number of current .NET thread objects in the application. A .NET thread object is created either by new System.Threading.Thread or when an unmanaged thread enters the managed environment. This counter maintains the count of both running and stopped threads. This counter is not an average over time; it just displays the last observed value.
webservice.current.connectionsWeb Service Current ConnectionsShows the current number of connections established with the Web service.
netclr.contention.rate.persecNETCLR Contention Rate Per secRate at which threads in the runtime attempt to acquire a managed lock unsuccessfully. Managed locks can be acquired in many ways; by the "lock" statement in C# or by calling System.Monitor. Enter or by using MethodImplOptions.Synchronized custom attribute.
webservice.post.requests.persecWeb Service Post Requests Per secShows the rate HTTP requests using the POST method are made.

Agent G2 - Microsoft Exchange 2007 - Server Role - Client Access Servers (CAS) DotNet v4

Description

Applicable on Exchange Servers with the CAS role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2007 - Server Role - Client Access Servers DotNet v4SyncCommandsPersecSync Commands Per secSync Commands/sec is the number of Sync commands that are processed per second. Clients use this command to synchronize items within a folder.
DownloadTasksCompletedDownload Tasks CompletedDownload Tasks Completed is the number of OAB download tasks completed.
OWARequestsPersecOWA Requests Per secRequests/sec is the number of requests handled by Outlook Web App per second.
DownloadTaskQueuedDownload Task QueuedDownload Task Queued is one(1) if task is queued for execution, otherwise zero(0).
AverageSearchTimeAverage Search TimeAverage Search Time is the average time that elapsed while waiting for a search to complete.
CurrentConnectionsCurrent ConnectionsThe number of active connections to the WWW service.
AverageTimetoProcessaFreeBusyRequestAverage Time to Process a Free Busy RequestAverage Time to Process a Free Busy Request is the average time to process a free busy request in seconds. One request may contain multiple mailboxes. Free busy responses do not have meeting suggestions.
AverageResponseTimeAverage Response TimeAverage Response Time is the average time (in milliseconds) that elapsed between the beginning and end of an OEH or ASPX request.
ActiveSyncRequestsPersecActiveSync Requests Per secShows the number of HTTP requests that are received from the client via ASP.NET per second.
AvailabilityRequestssecAvailability Requests per secAvailability Requests per second is the number of requests serviced per second. The request can be only for free busy or include suggestions. One request may contain multiple mailboxes.

Agent G2 - Microsoft Exchange 2007 - Server Role - Client Access Servers ASPNET (CAS ASPNET) DotNet v4

Description

Applicable on Exchange Servers with CAS ASPNET role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2007 - Server Role - Client Access Servers ASPNET DotNet v4RequestWaitTimeRequest Wait TimeRPC Requests outstanding is the current number of outstanding RPC requests.
WorkerProcessRestartsWorker Process RestartsNumber of times a worker process has restarted on the machine.
RequestsCurrentRequests CurrentThe current number of requests both executing and queued.
ApplicationRestartsApplication RestartsNumber of times the application has been restarted during the web server lifetime.
RequestsInApplicationQueueRequests In Application QueueThe current number of requests, including those that are queued, currently executing, or waiting to be written to the client. Under the ASP.NET process model, when this counter exceeds the requestQueueLimit defined in the processModel configuration section, ASP.NET will begin rejecting requests.

Agent G2 - Microsoft Exchange 2007 - Server Role - Edge Transport Server (ETS) DotNet v4

Description

Applicable on Exchange Servers with the ETS role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2007 - Server Role - Edge Transport Server DotNet v4LogThreadsWaitingLog Threads WaitingExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks.
IODatabaseReadsPersecIO Database Reads Per secI/O Database Reads/sec is the rate of database read operations completed.
RetryRemoteDeliveryQueueLengthRetry Remote Delivery Queue LengthRetry Remote Delivery Queue Length is the number of messages in retry in the remote delivery queues.
SubmissionQueueLengthSubmission Queue LengthThe number of items in the submission queue when sample was taken.
LargestDeliveryQueueLengthLargest Delivery Queue LengthThe number of items in the largest delivery queue.
AggregateDeliveryQueueLengthAllQueuesAggregate Delivery Queue Length All QueuesThe number of messages queued for aggregate delivery.
VersionbucketsallocatedVersion Buckets AllocatedIt provides the total number of version buckets allocated.
ActiveRemoteDeliveryQueueLengthActive Remote Delivery Queue LengthThe number of messages queued for remote delivery.
LogRecordStallsPersecLog Record Stalls Per secExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second.
PoisonQueueLengthPoison Queue LengthThe number of messages in the poison message queue.
IODatabaseWritesPersecIO Database Writes Per secI/O Database Writes/sec is the rate of database write operations completed.
DatabaseCachePercentHitDatabase Cache Percent Hit%Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the percentage of database file page requests that were fulfilled by the database cache without causing a file operation. If this percentage is too low, the database cache size may be too small. This value should be over 90% for companies with majority online mode clients. It should be over 99% for companies with majority cached mode clients. If the hit ratio is less than these numbers, the database cache may be insufficient.
UnreachableQueueLengthUnreachable Queue LengthThe number of messages in Unreachable Queue when sample was taken.

Agent G2 - Microsoft Exchange 2007 - Server Role - HUB Transport Servers (HTS) DotNet v4

Description

Applicable on Exchange Servers with the HTS role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2007 - Server Role - HUB Transport Servers DotNet v4ActiveNonSmtpDeliveryQueueLengthActive Non-SMTP Delivery Queue LengthThe number of messages queued for delivery to a Non-SMTP transport. The value format is a 4-byte integer.
RetryNonSmtpDeliveryQueueLengthRetry Non-SMTP Delivery Queue LengthAssures the number of messages currently in the retry non-SMTP delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent reattempt to send the message may be successful. A queue length < 200 can be considered as Ok.
ActiveMailboxDeliveryQueueLengthActive Mailbox Delivery Queue LengthThe number of messages queued for delivery to an active mailbox. The value format is a 4-byte integer.
MessagesCompletedDeliveryPerSecondMessages Completed Delivery Per SecondThis monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that are delivered per second.
RetryMailboxDeliveryQueueLengthRetry Mailbox Delivery Queue LengthMeasures the number of messages currently in the retry mailbox delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent attempt to send the message may be successful. A queue length < 200 can be considered as OK.
MessagesQueuedforDeliveryPerSecondMessages Queued for Delivery Per SecondThis monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued for delivery per second.
MessagesSubmittedPerSecondMessages Submitted Per SecondThis monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued in the Submission queue per second.
SubmissionQueueLengthSubmission Queue LengthThe number of items in the submission queue when the sample was taken.
IODatabaseWritesPersecIO Database Writes Per secI/O Database Writes/sec is the rate of database write operations completed.
MessagesQueuedforDeliveryPerSecondMessages Queued for Delivery Per SecondThis monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued for delivery per second.
LogRecordStallsPersecLog Record Stalls Per secExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second.
AggregateDeliveryQueueLengthAllQueuesAggregate Delivery Queue Length All QueuesThe number of messages queued for aggregate delivery.
IODatabaseReadsPersecIO Database Reads Per secI/O Database Reads/sec is the rate of database read operations completed.
PoisonQueueLengthPoison Queue LengthThe number of messages in the poison message queue.
LargestDeliveryQueueLengthLargest Delivery Queue LengthThe number of items in the largest delivery queue.
LogThreadsWaitingLog Threads WaitingExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks.
ActiveRemoteDeliveryQueueLengthActive Remote Delivery Queue LengthThe number of messages queued for remote delivery.
UnreachableQueueLengthUnreachable Queue LengthThe number of messages in Unreachable Queue when the sample was taken.

Agent G2 - Microsoft Exchange 2007 - Server Role - Mailbox Servers (MBS) DotNet v4

Description

Applicable on Exchange Servers with the MBS role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2007 - Server Role - Mailbox Servers DotNet v4IODatabaseWritesAverageLatencyIO Database Writes Average LatencyExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the average length of time, in milliseconds, per database write operation.
DatabaseCachePercentHitDatabase Cache Percent Hit%Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the percentage of database file page requests that were fulfilled by the database cache without causing a file operation. If this percentage is too low, the database cache size may be too small. This value should be over 90% for companies with a majority online mode clients. It should be over 99% for companies with majority cached mode clients. If the hit ratio is less than these numbers, the database cache may be insufficient.
RPCLatencyaveragemsecRPC Latency Average msecExchange communicates with Hub Transport servers via RPC. This counter is useful in isolating and determining issues involving the interface between the Microsoft Exchange Information Store service on the Mailbox server and Hub Transport servers. This monitor shows the average latency, in milliseconds, of RPC requests. The average is calculated over all RPCs since exrpc32 was loaded. The value should be less than 100 ms at all times.
MessagesDeliveredPersecMessages Delivered Per secExchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the rate that messages are delivered to all recipients. It indicates the current message delivery rate to the store.
MessagesQueuedForSubmissionISMailboxMessages Queued For Submission IS MailboxMailbox servers depend on Hub Transport servers for message delivery. This monitor shows the current number of submitted messages that are not yet processed by the transport layer. This value should be below 50 at all times. A higher value for more than 15 minutes may indicate that there are connectivity issues to the transport servers or that backpressure is occurring.
DirectoryAccessLDAPReadsPersecDirectory Access LDAP Reads Per secExchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the current rate that the Lightweight Directory Access Protocol (LDAP) reads occur while processing requests for the client.
IODatabaseReadsAverageLatencyIO Database Reads Average LatencyExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the average length of time, in milliseconds, per database read operation.
RPCClientBackoffPersecRPC Client Backoff Per secExchange throttles RPC clients to prevent individual clients from overusing server resources. This monitor shows the rate that the server notifies the client to back off. It indicates the rate at which client backoffs are occurring. Higher values may indicate that the server may be incurring a higher load resulting in an increase in overall averaged RPC latencies, causing client throttling to occur. This can also occur when certain client user actions are being performed. Depending on what the client is doing and the rate at which RPC operations are occurring, it may be normal to see backoffs occurring.
RPCNumofSlowPacketsRPC Number of Slow PacketsWhen you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the number of RPC packets in the past 1,024 packets that have latencies longer than 2 seconds. This value should be less than 1 on average, and should be less than 3 at all times.
ClientRPCsFailedServerTooBusyPersecClient RPCs Failed Server Too Busy Per secExchange throttles RPC clients to prevent individual clients from overusing server resources. This monitor shows the client-reported rate of failed RPCs (since the store was started) due to the Server Too Busy ROC error. This value should be 0 at all times. Higher values may indicate RPC threads are exhausted or client throttling is occurring for clients running versions of Outlook earlier than Microsoft Office Outlook 2007.
DirectoryAccessLDAPSearchesPersecDirectory Access LDAP Searches Per secExchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the current rate that the LDAP searches occur while processing requests for the client.
LogRecordStallsPersecLog Record Stalls Per secExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second.
ReplicationReceiveQueueSizeReplication Receive Queue SizeThis monitor shows the number of replication messages waiting to be processed. This value should be less than 100 at all times. This value should return to a minimum value between replication intervals.
MessagesQueuedForSubmissionISPublicMessages Queued For Submission IS PublicMailbox servers depend on Hub Transport servers for message delivery. This monitor shows the current number of submitted messages that are not yet processed by the transport layer. This value should be below 20 at all times.
UserCountUser CountThis monitor shows the number of users connected to the information store. It can be used to determine the current user load.
DatabaseCacheSizeMBDatabase Cache Size MBExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the amount of system memory, in megabytes, used by the database cache manager to hold commonly used information from the database files to prevent file operations. If the database cache size seems too small for optimal performance and there is little available memory on the system (check the value of Memory/Available Bytes), adding more memory to the system may increase performance. If there is ample memory on the system and the database cache size is not growing beyond a certain point, the database cache size may be capped at an artificially low limit. Increasing this limit may increase performance.
RPCRequestsoutstandingRPC Requests outstandingRPC Requests outstanding is the current number of outstanding RPC requests.
RPCAveragedLatencyRPC Averaged LatencyMillisecondsWhen you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the RPC latency, in milliseconds, averaged for all operations in the last 1,024 packets. This value should not be higher than 25 ms on average. To determine if certain protocols are causing overall RPC latencies, monitor MSExchangeIS Client (*)RPC Average Latency to separate latencies based on client protocol. Cross-reference MSExchangeISRPC Client Backoff/sec to ensure higher latencies are not causing client throttling.
RPCRequestsRPC RequestsRPC Operations/sec is the rate at which RPC operations occur.
RPCOperationsPersecRPC Operations Per secWhen you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the current number of RPC operations that are occurring per second. Should closely correspond to historical baselines. Values much higher than expected indicate that the workload has changed, while values much lower than expected indicate a bottleneck preventing client requests from reaching the server.
LogThreadsWaitingLog Threads WaitingExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks.
DatabasePageFaultStallsPersecDatabase Page Fault Stalls Per secExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the rate that database file page requests require of the database cache manager to allocate a new page from the database cache. This should be 0 at all times. If this value is non-zero, this indicates that the database is not able to flush dirty pages to the database file fast enough to make pages free for new page allocations.

Agent G2 - Microsoft Exchange 2007 - Server Role - Mailbox Servers replication (MBSR) DotNet v4

Description

Applicable on Exchange Servers with the MBSR role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2007 - Server Role - Mailbox Servers replication DotNet v4ReplayQueueLengthReplay Queue LengthThis monitor indicates issues involving the replication engine and replication partners. These issues can be local or remote. This monitor shows the number of transaction log files waiting to be replayed into the passive copy. It indicates the current replay queue length. Higher values cause longer store mount times when a handoff, failover, or activation is performed.
CopyQueueLengthCopy Queue LengthThis monitor indicates issues involving the replication engine and replication partners. These issues can be local or remote. This monitor shows the number of transaction log files waiting to be copied to the passive copy log file folder. A copy is not considered complete until it has been checked for corruption. This value should be less than 10 at all times for Continuous Cluster Replication(CCR). It should be less than 1 at all times for local continuous replication (LCR).

Agent G2 - Microsoft Exchange 2007 - Server Role - Unified Messaging servers (UMS) DotNet v4

Description

Applicable on Exchange Servers with the UMS role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2007 - Server Role - Unified Messaging servers DotNet v4OperationsoverSixSecondsOperationsoverSixSecondsNULLShows the number of all Unified Messaging operations that took more than six seconds to complete. This is the time during which a caller was waiting for Unified Messaging to respond.
MailboxServerAccessFailuresMailboxServerAccessFailuresNULLShows the number of times the system did not access a Mailbox server. This value should be 0 at all times. A non-zero value indicates that Unified Messaging is having problems with MAPI connectivity to mbx servers.
CallsDisconnectedbyCallersDuringUMAudioHourglassCallsDisconnectedbyCallersDuringUMAudioHourglassNULLShows the number of calls during which the caller disconnected while Unified Messaging was playing the audio hourglass tones. This value should be 0 at all times. A non-zero value suggests excessive latency between a Unified Messaging server and targeted domain controller.

Agent G2 - Microsoft Exchange 2007 General DotNet v4

Description

Monitors Microsoft Exchange 2007 General Performance Counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2007 General DotNet v4ConnectionAttemptsPersecConnection Attempts Per SecondThe rate, in seconds, at which connections to the WWW service have been attempted since the service started.
NumberBytesinallHeapsNumber Bytes in all HeapsShows the sum of four other counters: Gen 0 Heap Size, Gen 1 Heap Size, Gen 2 Heap Size, and the Large Object Heap Size. This counter indicates the current memory allocated in bytes on the GC Heaps.
NumberofExcepsThrownPersecNumber of Exceptions Thrown Per SecondDisplays the number of exceptions thrown per second. These include both .NET exceptions and unmanaged exceptions that get converted into .NET exceptions.
PercentTimeinGCPercent Time in GC%% Time in GC is the percentage of elapsed time that was spent in performing a garbage collection (GC) since the last GC cycle. This counter is usually an indicator of the work done by the Garbage Collector on behalf of the application to collect and compact memory. This counter is updated only at the end of every GC and the counter value reflects the last observed value; its not an average.

Agent G2 - Microsoft Exchange 2010 AD Access DC DotNet v4

Description

Monitor Microsoft Exchange 2010 AD Access DC performance counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2010 AD Access DC DotNet v4LDAPReadTimeLDAP Read TimeShows the time (in ms) to send an LDAP read request and receive a response.
LDAPSearchTimeLDAP Search TimeMillisecondsShows the time (in ms) to send an LDAP search request and receive a response. Should be below 50 ms on average. Spikes (maximum values) shouldn't be higher than 100 ms.

Agent G2 - Microsoft Exchange 2010 DotNet v4

Description

Monitor Microsoft Exchange 2010 performance counters

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2010 DotNet v4ReceiveQueueSizeMailBoxReceive Queue Size MailboxMaximum allowed number of messages in the mailbox receive queue.
LDAPSearchesTimedOutLDAP Searches Timed OutLDAP Searches timed out per minute is the number of LDAP searches returned LDAP_TIMEOUT during the last minute.
RPCAveragedLatencyRPC Averaged LatencyMillisecondsWhen you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the RPC latency, in milliseconds, averaged for all operations in the last 1,024 packets. This value should not be higher than 25 ms on average. To determine if certain protocols are causing overall RPC latencies, monitor MSExchangeIS Client (*)RPC Average Latency to separate latencies based on client protocol. Cross-reference MSExchangeISRPC Client Backoff/sec to ensure higher latencies are not causing client throttling.
ActiveNonSmtpDeliveryQueueLengthActive Non-SMTP Delivery Queue LengthThe number of messages queued for delivery to a Non-SMTP transport. The value format is a 4-byte integer.
MessagesQueuedForSubmissionMailBoxMessages Queued For Submission MailboxMessages Queued For Submission is the current number of submitted messages which are not yet processed by transport.
PollingDelayPolling DelaySecondsPolling delay is the latency between when the most recent Mapi Event was polled and when the event was created in seconds.
LDAPSearchTimeLDAP Search TimeMillisecondsShows the time (in ms) to send an LDAP search request and receive a response. Should be below 50 ms on average. Spikes (maximum values) shouldn't be higher than 100 ms.
ClientLogonsMailBoxClient Logons MailboxThe number of client logons (including system processes). The average number of logons per client depends on the type and version of the client.
MessagesSentPersecMailBoxMessages Sent Persec MailboxMessages Sent/sec is the rate that messages are sent to the transport.
VersionbucketsallocatedVersion Buckets AllocatedIt provides the total number of version buckets allocated.
SubmissionQueueLengthSubmission Queue LengthThe number of items in the submission queue when sample was taken.
AverageDeliveryTimePublicFolderAverage Delivery Time Public FolderMillisecondsAverage Delivery Time is the average time in milliseconds between the submission of a message to the public store and the delivery to all local recipients (recipients on the same server) for the last 10 messages.
UnreachableQueueLengthUnreachable Queue LengthThe number of messages in Unreachable Queue when sample was taken.
MessagesDeliveredPersecMailBoxMessages Delivered Persec MailboxMessages Delivered/sec is the rate that messages are delivered to all recipients.
AverageDeliveryTimeMailBoxAverage Delivery Time MailboxMillisecondsAverage Delivery Time is the average time in milliseconds between the submission of a message to the mailbox store and the delivery to all local recipients (recipients on the same server) for the last 10 messages.
RPCRequestsRPC RequestsRPC Operations/sec is the rate at which RPC operations occur.
MessagesDeliveredPersecPublicFolderMessages Delivered Persec Public FolderMessages Delivered Persec Public Folder
ClientLogonsPublicFolderClient Logons Public FolderClient Logons is the number of clients (including system processes) currently logged on.
RetryRemoteDeliveryQueueLengthRetry Remote Delivery Queue LengthRetry Remote Delivery Queue Length is the number of messages in retry in the remote delivery queues.
LogGenerationCheckpointDepthLog Generation Checkpoint DepthLog Generation Checkpoint Depth represents the amount of work, in count of log files, that will need to be redone or undone to the database file(s) if the process crashes.
LargestDeliveryQueueLengthLargest Delivery Queue LengthThe number of items in the largest delivery queue.
IODatabaseReadsPersecI/O Database Reads PersecI/O Database Reads/sec is the rate of database read operations completed.
ActiveMailboxDeliveryQueueLengthActive Mailbox Delivery Queue LengthThe number of messages queued for delivery to an active mailbox. The value format is a 4-byte integer.
SendQueueSizeMailboxSend Queue Size MailboxIndicates the number of messages in the mailbox store's send queue.
PercentageofFailedEventDispatchersPercentage of Failed Event Dispatchers%It is the percentage of Event Dispatchers that are in failure mode.
PoisonQueueLengthPoison Queue LengthThe number of messages in the poison message queue.
LogBytesWritePersecLog Bytes Write PersecBytes per secondLog Bytes Write per second is the rate bytes are written to the log.
RetryMailboxDeliveryQueueLengthRetry Mailbox Delivery Queue LengthMeasures the number of messages currently in the retry mailbox delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent attempt to send the message may be successful. A queue length < 200 can be considered as OK.
SendQueueSizePublicFolderSend Queue Size Public FolderNumber of messages in a public folders send queue. The value should be less than 500.
RetryNonSmtpDeliveryQueueLengthRetry Non-SMTP Delivery Queue LengthAssures the number of messages currently in the retry non-SMTP delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent reattempt to send the message may be successful. A queue length < 200 can be considered as Ok.
ReceiveQueueSizePublicFolderReceive Queue Size Public FolderIndicates the number of public folder replication messages waiting to be processed.
MessagesSentPersecPublicFolderMessages Sent Persec Public FolderMessages Sent/sec is the rate that messages are sent to the transport.
MessagesQueuedForSubmissionPublicFolderMessages Queued For Submission Public FolderMessages Queued For Submission is the current number of submitted messages which are not yet processed by transport.
MessageRecipientsDeliveredPersecMailBoxMessage Recipients Delivered Persec MailboxShows the rate at which messages are delivered to all recipients. Indicates current message delivery rate to the store.
AggregateDeliveryQueueLengthAllQueuesAggregate Delivery Queue Length All QueuesThe number of messages queued for aggregate delivery.
ActiveRemoteDeliveryQueueLengthActive Remote Delivery Queue LengthThe number of messages queued for remote delivery.
ClientRPCsSuccededClient RPCs SucceededThe client-reported total number of successful RPCs (since the store was started).
ClientLatency10secRPCClient Latency 10 sec RPCMillisecondsThe client-reported number of successful RPCs with latencies > 10 seconds.
ActiveClientLogonsPublicFolderActive Client Logons Public FolderActive Client Logons is the number of clients that performed any action within the last ten minute time interval.
IODatabaseWritesPersecI/O Database Writes PersecI/O Database Writes/sec is the rate of database write operations completed.
ActiveClientLogonsMailBoxActive Client Logons MailboxThe number of logons that have been active within the last ten minute time interval.
MessageRecipientsDeliveredPersecPublicFolderMessage Recipients Delivered Persec Public FolderMessage Recipients Delivered/sec is the rate that recipients receive messages.
LongRunningLDAPOperationsLong Running LDAP OperationsShows the number of LDAP operations on this domain controller that took longer than the specified threshold per minute. (Default threshold is 15 seconds.). Should be less than 50 at all times.

Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Client Access Servers (CAS)

Description

Applicable on Exchange Servers with the Client Access servers (CAS) role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Client Access ServersDownloadTasksCompletedDownload Tasks CompletedNULLDownload Tasks Completed is the number of OAB download tasks completed
ActiveSyncRequestsPersecActiveSync Requests Per SecondNULLShows the number of HTTP requests that are received from the client via ASP.NET per second
AverageSearchTimeAverage Search TimeNULLAverage Search Time is the average time that elapsed while waiting for a search to complete
CurrentConnectionsCurrent ConnectionsNULLThe number of active connections to the WWW service.
DownloadTaskQueuedDownload Task QueuedNULLDownload Task Queued is one(1) if task is queued for execution, otherwise zero(0)
SyncCommandsPersecSync Commands Per SecondNULLSync Commands/sec is the number of Sync commands that are processed per second. Clients use this command to synchronize items within a folder
AvailabilityRequestssecAvailability Requests Per SecondNULLAvailability Requests per second is the number of requests serviced per second. The request can be only for free busy or include suggestions. One request may contain multiple mailboxes
OWARequestsPersecOWA Requests Per SecondNULLRequests/sec is the number of requests handled by Outlook Web App per second
AverageTimetoProcessaFreeBusyRequestAverage Time to Process a Free Busy RequestNULLAverage Time to Process a Free Busy Request is the average time to process a free busy request in seconds. One request may contain multiple mailboxes. Free busy responses do not have meeting suggestions
AverageResponseTimeAverage Response TimeNULLAverage Response Time is the average time (in milliseconds) that elapsed between the beginning and end of an OEH or ASPX request

Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Client Access Servers ASPNET (CAS ASPNET)

Description

Applicable on Exchange Servers with the Client Access servers ASPNET (CAS) role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Client Access Servers ASPNETApplicationRestartsApplication RestartsNULLNumber of times the application has been restarted during the web server lifetime
RequestWaitTimeRequest Wait TimeNULLRPC Requests outstanding is the current number of outstanding RPC requests.
RequestsInApplicationQueueRequests In Application QueueNULLThe current number of requests, including those that are queued, currently executing, or waiting to be written to the client. Under the ASP.NET process model, when this counter exceeds the requestQueueLimit defined in the processModel configuration section, ASP.NET will begin rejecting requests.
WorkerProcessRestartsWorker Process RestartsNULLNumber of times a worker process has restarted on the machine.
RequestsCurrentRequests CurrentNULLThe current number of requests both executing and queued

Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Edge Transport Server (ETS)

Description

Applicable on Exchange Servers with the Edge Transport servers (ETS) role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Edge Transport ServerLargestDeliveryQueueLengthLargest Delivery Queue LengthNULLThe number of items in the largest delivery queue.
IODatabaseReadsPersecI/O Database Reads Per SecondNULLI/O Database Reads/sec is the rate of database read operations completed.
UnreachableQueueLengthUnreachable Queue LengthNULLThe number of messages in Unreachable Queue when sample was taken.
SubmissionQueueLengthSubmission Queue LengthNULLThe number of items in the submission queue when sample was taken.
IODatabaseWritesPersecI/O Database Writes Per SecondNULLI/O Database Writes/sec is the rate of database write operations completed.
LogThreadsWaitingLog Threads WaitingNULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks.
DatabaseCachePercentHitDatabase Cache Percent Hit%Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the percentage of database file page requests that were fulfilled by the database cache without causing a file operation. If this percentage is too low, the database cache size may be too small. This value should be over 90% for companies with majority online mode clients. It should be over 99% for companies with majority cached mode clients. If the hit ratio is less than these numbers, the database cache may be insufficient.
LogRecordStallsPersecLog Record Stalls Per SecondNULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second.
ActiveRemoteDeliveryQueueLengthActive Remote Delivery Queue LengthNULLThe number of messages queued for remote delivery.
VersionbucketsallocatedVersion Buckets AllocatedNULLIt provides the total number of version buckets allocated.
AggregateDeliveryQueueLengthAllQueuesAggregate Delivery Queue Length All QueuesNULLThe number of messages queued for aggregate delivery.
RetryRemoteDeliveryQueueLengthRetry Remote Delivery Queue LengthNULLRetry Remote Delivery Queue Length is the number of messages in retry in the remote delivery queues.
PoisonQueueLengthPoison Queue LengthNULLThe number of messages in the poison message queue.

Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - HUB Transport Servers (HTS)

Description

Applicable on Exchange Servers with the HUB Transport servers (HTS) role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - HUB Transport ServersIODatabaseReadsPersecI/O Database Reads Per SecondNULLI/O Database Reads/sec is the rate of database read operations completed.
MessagesSubmittedPerSecondMessages Submitted Per SecondNULLThis monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued in the Submission queue per second.
SubmissionQueueLengthSubmission Queue LengthNULLThe number of items in the submission queue when sample was taken.
UnreachableQueueLengthUnreachable Queue LengthNULLThe number of messages in Unreachable Queue when sample was taken.
ActiveMailboxDeliveryQueueLengthActive Mailbox Delivery Queue LengthNULLThe number of messages queued for delivery to an active mailbox. The value format is a 4-byte integer.
LogThreadsWaitingLog Threads WaitingNULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks.
MessagesCompletedDeliveryPerSecondMessages Completed Delivery Per SecondNULLThis monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that are delivered per second.
AggregateDeliveryQueueLengthAllQueuesAggregate Delivery Queue Length All QueuesNULLThe number of messages queued for aggregate delivery.
RetryMailboxDeliveryQueueLengthRetry Mailbox Delivery Queue LengthNULLMeasures the number of messages currently in the retry mailbox delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent attempt to send the message may be successful. A queue length < 200 can be considered as OK.
RetryNonSmtpDeliveryQueueLengthRetry Non-SMTP Delivery Queue LengthNULLAssures the number of messages currently in the retry non-SMTP delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent reattempt to send the message may be successful. A queue length < 200 can be considered as Ok.
VersionbucketsallocatedVersion Buckets AllocatedNULLIt provides the total number of version buckets allocated.
ActiveNonSmtpDeliveryQueueLengthActive Non-SMTP Delivery Queue LengthNULLThe number of messages queued for delivery to a Non-SMTP transport. The value format is a 4-byte integer.
IODatabaseWritesPersecI/O Database Writes Per SecondNULLI/O Database Writes/sec is the rate of database write operations completed.
PoisonQueueLengthPoison Queue LengthNULLThe number of messages in the poison message queue.
MessagesQueuedforDeliveryPerSecondMessages Queued for Delivery Per SecondNULLThis monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued for delivery per second.
LargestDeliveryQueueLengthLargest Delivery Queue LengthNULLThe number of items in the largest delivery queue.
ActiveRemoteDeliveryQueueLengthActive Remote Delivery Queue LengthNULLThe number of messages queued for remote delivery.
LogRecordStallsPersecLog Record Stalls Per SecondNULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second.

Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Mailbox Servers (MBS)

Description

Applicable on Exchange Servers with the Mailbox Servers (MBS) role

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Mailbox ServersRPCLatencyaveragemsecRPC Latency Average (ms)NULLExchange communicates with Hub Transport servers via RPC. This counter is useful in isolating and determining issues involving the interface between the Microsoft Exchange Information Store service on the Mailbox server and Hub Transport servers. This monitor shows the average latency, in milliseconds, of RPC requests. The average is calculated over all RPCs since exrpc32 was loaded. The value should be less than 100 ms at all times.
UserCountUser CountNULLThis monitor shows the number of users connected to the information store. It can be used to determine current user load.
ReplicationReceiveQueueSizeReplication Receive Queue SizeNULLThis monitor shows the number of replication messages waiting to be processed. This value should be less than 100 at all times. This value should return to a minimum value between replication intervals.
LogThreadsWaitingLog Threads WaitingNULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks.
RPCAveragedLatencyRPC Averaged LatencyMillisecondsWhen you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the RPC latency, in milliseconds, averaged for all operations in the last 1,024 packets. This value should not be higher than 25 ms on average. To determine if certain protocols are causing overall RPC latencies, monitor MSExchangeIS Client (*)RPC Average Latency to separate latencies based on client protocol. Cross-reference MSExchangeISRPC Client Backoff/sec to ensure higher latencies are not causing client throttling.
RPCRequestsoutstandingRPC Requests OutstandingNULLRPC Requests outstanding is the current number of outstanding RPC requests
MessagesQueuedForSubmissionISMailboxMessages Queued For Submission (IS Mailbox)NULLMailbox servers depend on Hub Transport servers for message delivery. This monitor shows the current number of submitted messages that are not yet processed by the transport layer. This value should be below 50 at all times. A higher value for more than 15 minutes may indicate that there are connectivity issues to the transport servers or that backpressure is occurring.
RPCNumofSlowPacketsRPC Number of Slow PacketsNULLWhen you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the number of RPC packets in the past 1,024 packets that have latencies longer than 2 seconds. This value should be less than 1 on average, and should be less than 3 at all times.
RPCRequestsRPC RequestsNULLRPC Operations/sec is the rate at which RPC operations occur.
DatabaseCacheSizeMBDatabase Cache Size (MB)NULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the amount of system memory, in megabytes, used by the database cache manager to hold commonly used information from the database files to prevent file operations. If the database cache size seems too small for optimal performance and there is little available memory on the system (check the value of Memory/Available Bytes), adding more memory to the system may increase performance. If there is ample memory on the system and the database cache size is not growing beyond a certain point, the database cache size may be capped at an artificially low limit. Increasing this limit may increase performance.
DirectoryAccessLDAPSearchesPersecDirectory Access LDAP Searches PersecNULLExchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the current rate that the LDAP searches occur while processing requests for the client.
RPCOperationsPersecRPC Operations PersecNULLWhen you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the current number of RPC operations that are occurring per second. Should closely correspond to historical baselines. Values much higher than expected indicate that the workload has changed, while values much lower than expected indicate a bottleneck preventing client requests from reaching the server.
RPCClientBackoffPersecRPC Client Backoff PersecNULLExchange throttles RPC clients to prevent individual clients from overusing server resources. This monitor shows the rate that the server notifies the client to back off. It indicates the rate at which client backoffs are occurring. Higher values may indicate that the server may be incurring a higher load resulting in an increase in overall averaged RPC latencies, causing client throttling to occur. This can also occur when certain client user actions are being performed. Depending on what the client is doing and the rate at which RPC operations are occurring, it may be normal to see backoffs occurring.
LogRecordStallsPersecLog Record Stalls PersecNULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second.
DatabasePageFaultStallsPersecDatabase Page Fault Stalls PersecNULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the rate that database file page requests require of the database cache manager to allocate a new page from the database cache. This should be 0 at all times. If this value is non-zero, this indicates that the database is not able to flush dirty pages to the database file fast enough to make pages free for new page allocations.
DatabaseCachePercentHitDatabase Cache Percent Hit%Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the percentage of database file page requests that were fulfilled by the database cache without causing a file operation. If this percentage is too low, the database cache size may be too small. This value should be over 90% for companies with majority online mode clients. It should be over 99% for companies with majority cached mode clients. If the hit ratio is less than these numbers, the database cache may be insufficient.
MessagesDeliveredPersecMessages Delivered PersecNULLExchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the rate that messages are delivered to all recipients. It indicates current message delivery rate to the store.
IODatabaseReadsAverageLatencyIO Database Reads Average LatencyNULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the average length of time, in milliseconds, per database read operation.
MessagesQueuedForSubmissionISPublicMessages Queued For Submission (IS Public)NULLMailbox servers depend on Hub Transport servers for message delivery. This monitor shows the current number of submitted messages that are not yet processed by the transport layer. This value should be below 20 at all times.
DirectoryAccessLDAPReadsPersecDirectory Access LDAP Reads PersecNULLExchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the current rate that the Lightweight Directory Access Protocol (LDAP) reads occur while processing requests for the client.
ClientRPCsFailedServerTooBusyPersecClient RPCs Failed Server Too Busy PersecNULLExchange throttles RPC clients to prevent individual clients from overusing server resources. This monitor shows the client-reported rate of failed RPCs (since the store was started) due to the Server Too Busy ROC error. This value should be 0 at all times. Higher values may indicate RPC threads are exhausted or client throttling is occurring for clients running versions of Outlook earlier than Microsoft Office Outlook 2007.
IODatabaseWritesAverageLatencyIO Database Writes Average LatencyNULLExchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the average length of time, in milliseconds, per database write operation.

Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Mailbox Servers Replication (MBSR)

Description

Applicable on Exchange Servers with the Mailbox Servers Replication (MBSR) role

Prerequisites

NULL

Supported Metric

Agent G2 - Linux - RabbitMQ Monitors

Description

Monitors RabbitMQ monitor metrics using the rabbitmq-management plugin which provides a HTTP-based API.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Mailbox Servers ReplicationReplayQueueLengthReplay Queue LengthNULLThis monitor indicate issues involving the replication engine and replication partners. These issues can be local or remote. This monitor shows the number of transaction log files waiting to be replayed into the passive copy. It indicates the current replay queue length. Higher values cause longer store mount times when a handoff, failover, or activation is performed.
CopyQueueLengthCopy Queue LengthNULLThis monitor indicate issues involving the replication engine and replication partners. These issues can be local or remote. This monitor shows the number of transaction log files waiting to be copied to the passive copy log file folder. A copy is not considered complete until it has been checked for corruption. This value should be less than 10 at all times for Continuous Cluster Replication(CCR). It should be less than 1 at all times for local continuous replication (LCR).
CallsDisconnectedbyCallersDuringUMAudioHourglassCalls Disconnected by Callers During UM Audio HourglassNULLShows the number of calls during which the caller disconnected while Unified Messaging was playing the audio hourglass tones. This value should be 0 at all times. A non-zero value suggests excessive latency between a Unified Messaging server and targeted domain controller.
OperationsoverSixSecondsOperations over Six SecondsNULLShows the number of all Unified Messaging operations that took more than six seconds to complete. This is the time during which a caller was waiting for Unified Messaging to respond.
MailboxServerAccessFailuresMailbox Server Access FailuresNULLShows the number of times the system did not access a Mailbox server. This value should be 0 at all times. A non-zero value indicates that Unified Messaging is having problems with MAPI connectivity to mbx servers.
Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - RabbitMQ Monitorsrabbitmq.node.disk.freeRabbitMQ Free DiskMBThe free disk of the rabbitmq node in MB.
rabbitmq.queue.consumers.activeRabbitMQ Active ConsumersNULLNumber of active consumers. An active consumer is one which could immediately receive any messages sent to the queue.
rabbitmq.node.mem.usedRabbitMQ Node Memory UtilizationMBThe memory used by the rabbitmq node in MB.
rabbitmq.node.fd.usedRabbitMQ OpenFDsNULLNumber of open file descriptors used.
rabbitmq.queue.consumersRabbitMQ ConsumersNULLNumber of consumers.
rabbitmq.queue.memoryRabbitMQ MemoryMBMemory consumed by the Erlang process associated with the queue, including stack, heap, and internal structures.
rabbitmq.node.uptimeRabbitMQ UptimeNULLUptime of the RabbitMQ server.
rabbitmq.queue.messagesRabbitMQ MessagesNULLSum of ready and unacknowledged messages (queue depth).
rabbitmq.node.sockets.usedRabbitMQ Sockets UsedNULLNumber of sockets used.
rabbitmq.queue.messages.unacknowledgedRabbitMQ Messages UnacknowledgedNULLNumber of messages delivered to clients but not yet acknowledged.
rabbitmq.objects.overviewRabbitMQ Overview ObjectsNULLOverview of all objects.
rabbitmq.queue.messages.readyRabbitMQ Messages ReadyNULLNumber of messages ready to be delivered to clients.
rabbitmq.node.proc.usedRabbitMQ Erlang Processes UsedNULLNumber of Erlang processes used.

Agent G2 - Linux - RedisDB Monitors

Description

Monitors RedisDB application metrics

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - RedisDB Monitorsredis.clients.biggest.inputbufRedis Clients Biggest InputBufNULLBiggest input buffer among current client connections.
redis.mem.fragmentation_ratioRedis-Mem.fragmentation_ratioNULLRatio between rss memory used and total memory used.
redis.mem.peakRedis Peak MemoryMBPeak memory consumed by Redis (in Mbytes).
redis.aof.last.rewritetimeRedis AOF Last Rewrite Op TimeNULLDuration of the last AOF rewrite operation in seconds.
redis.replica.sync_left_bytesRedis-Replica.sync_left_bytesNULLNumber of bytes left before SYNCing is complete
redis.rdb.last_bgsave_timeRedis-Rdb.last_bgsave_timeNULLDuration of the last RDB save operation in seconds.
redis.key.hits_rateRedis Key Hit RateNULLThe rate of successful lookup of keys in the main dictionary.
redis.clients.longest.outputlistRedis Clients Longest OutputListNULLLongest output list among current client connections.
redis.rdb.last.bgsavetimeRedis RDB Last Save Op TimeNULLDuration of the last RDB save operation in seconds.
redis.net.rejectedRedis Connections RejectedNULLNumber of connections rejected because of maxclients limit.
redis.clients.biggest_input_bufRedis-Clients.biggest_input_bufNULLBiggest input buffer among current client connections.
redis.uptimeRedis UptimeNULLChecks the uptime of the Redis service.
redis.aof.sizeRedis AOF SizeMBAOF current file size in MB.
redis.pubsub.patternsRedis Pubsub PatternsNULLGlobal number of pub/sub pattern with client subscriptions.
redis.stats.keyspace_hitsRedis Keyspace HitsNULLNumber of successful lookup of keys in the main dictionary per second.
redis.clients.longest_output_listRedis-Clients.longest_output_listNULLLongest output list among current client connections.
redis.key.misses_rateRedis Key Miss RateNULLThe rate of failed lookup of keys in the main dictionary.
redis.net.commandsRedis CommandsNULLThe number of commands processed by the server
redis.replica.syncleftbytesRedis Replication Bytes LagNULLNumber of bytes left before SYNCing is complete
redis.keyspace.missesRedis Keyspace MissesNULLNumber of failed lookup of keys in the main dictionary per second.
redis.keys.evictedRedis Evicted KeysNULLNumber of evicted keys due to maxmemory limit.
redis.perf.latest_fork_usecRedis Latest Fork Op TimeNULLDuration of the latest fork operation in microseconds.
redis.net.clientsRedis Client ConnectionsNULLNumber of client connections (excluding connections from slaves).
redis.changesRedis ChangesNULLNumber of changes since the last save.
redis.keys.totalRedis Total KeysNULLProvides the total number of keys from all the DBs.
redis.mem.rssRedis RSS MemoryMBNumber of bytes that Redis allocated as seen by the operating system (a.k.a resident set size). This is the number reported by tools such as top and ps.
redis.rdb.changesRedis RDB ChangesNULLNumber of changes since the last dump.
redis.pubsub.channelsRedis Pubsub ChannelsNULLGlobal number of pub/sub channels with client subscriptions.
redis.replica.last_io_seconds_agoRedis-Replica.last_io_seconds_agoNULLNumber of seconds since the last interaction with master.
redis.clients.blockedRedis Clients BlockedNULLNumber of clients pending on a blocking call.
redis.mem.luaRedis LUA MemoryMBNumber of Mbytes used by the Lua engine.
redis.mem.usedRedis Used MemoryMBTotal number of Mbytes allocated by Redis using its allocator (either standard libc, jemalloc, or an alternative allocator such as tcmalloc).
redis.replica.last.io.SecondsagoRedis Replication LagNULLNumber of seconds since the last interaction with master.
redis.connectionsRedis ConnectionsNULLThe number of connections accepted by the server per second.
redis.mem.fragmentation.ratioRedis Memory Fragmentation RatioNULLRatio between rss memory used and total memory used.
redis.keys.expiredRedis Expired KeysNULLTotal number of key expiration events.
redis.aof.last_rewrite_timeRedis-Aof.last_rewrite_timeNULLDuration of the last AOF rewrite operation in seconds.
redis.net.slavesRedis Connected SlavesNULLNumber of connected slaves.

Agent G2 - Linux - Riak DB Monitors

Description

Agent G2 - Linux - Riak DB Monitors

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Riak DB Monitorsriak.node_get_fsm_objsize_meanRiak Get Fsm Mean Object SizeNULLMean object size encountered by this node within the last minute
riak.redir_requestsRiak Redirected RequestsNULLAverage number of requests this node has redirected to other nodes for coordination per second
riak.pipe_vnodes.running.countRiak Running Pipe VnodesNULLNumber of pipe virtual node queues running in the last minute
riak.vnode_index_writesRiak Vnode Index WritesNULLAverage number of vnode index write operations performed per second
riak.node_get_fsm_objsize_95Riak Get Fsm 95 Object SizeNULL95th percentile object size encountered by this node within the last minute
riak.node_get_fsm_siblings_100Riak Get Fsm 100 SiblingsNULL100th percentile of siblings encountered during all GET operations by this node within the last minute
riak.node_getsRiak Node GETsNULLAverage number of GETs coordinated by this node per second, including GETs to non-local vnodes
riak.node_get_fsm_objsize_100Riak Get Fsm 100 Object SizeNULL100th percentile object size encountered by this node within the last minute
riak.vnode_index_readsRiak Vnode Index ReadsNULLAverage number of vnode index read operations performed per second
riak.ring_num_partitionsRiak Ring PartitionsNULLNumber of partitions in the ring
riak.pipeline.activeRiak Active PipelinesNULLThe number of pipelines active in the last 60 seconds
riak.vnode.putsRiak Vnode PUTsNULLAverage number of PUTS coordinated by local vnodes per second
riak.node_get_fsm_objsize_medianRiak Get Fsm Median Object SizeNULLMedian object size encountered by this node within the last minute
riak.node_put_fsm_time_95Riak Put Fsm 95 TimeNULL95th percentile time between reception of client PUT request and subsequent response to client
riak.sys.process_countRiak Process CountNULLNumber of Erlang processes
riak.node_get_fsm_time_100Riak Get Fsm 100 TimeNULL100th percentile time between reception of client GET request and subsequent response to client
riak.node_get_fsm_time_meanRiak Get Fsm Mean TimeNULLMean time between reception of client GET request and subsequent response to client
riak.node_get_fsm_siblings_meanRiak Get Fsm Mean SiblingsNULLMean number of siblings encountered during all GET operations by this node within the last minute
riak.vnode.getsRiak Vnode GETsNULLAverage number of GETs coordinated by local vnodes per second
riak.node_get_fsm_siblings_medianRiak Get Fsm Median SiblingsNULLMedian number of siblings encountered during all GET operations by this node within the last minute
riak.node_get_fsm_siblings_95Riak Get Fsm 95 SiblingsNULL95th percentile of siblings encountered during all GET operations by this node within the last minute
riak.precommit.failsRiak Pre-commit FailuresNULLNumber of pre commit hook failures
riak.node_get_fsm_time_medianRiak Get Fsm Median TimeNULLMedian time between reception of client GET request and subsequent response to client
riak.pipeline.errorsRiak Pipeline ErrorsNULLThe average number of pipeline creation errors per second
riak.kv_vnodes.running.countRiak Running KV VnodesNULLNumber of key/value virtual node queues running in the last minute
riak.postcommit.failsRiak Post-commit FailuresNULLNumber of post commit hook failures
riak.pbc_activeRiak Active PB ConnectionsNULLNumber of currently active protocol buffer connections
riak.node_put_fsm_time_medianRiak Put Fsm Median TimeNULLMedian time between reception of client PUT request and subsequent response to client
riak.vnode_index_deletesRiak Vnode Index DeletesNULLAverage number of vnode index delete operations performed per second
riak.node_put_fsm_time_meanRiak Put Fsm Mean TimeNULLMean time between reception of client PUT request and subsequent response to client
riak.node_put_fsm_time_100Riak Put Fsm 100 TimeNULL100th percentile time between reception of client PUT request and subsequent response to client
riak.node.putsRiak Node PUTsNULLAverage number of PUTs coordinated by this node per second, including PUTs to non-local vnodes
riak.pipeline.createsRiak Pipeline CreationsNULLThe average number of pipelines created per second
riak.node_get_fsm_time_95Riak Get Fsm 95 TimeNULL95th percentile time between reception of client GET request and subsequent response to client
riak.pbc_connectsRiak New PB ConnectionsNULLNumber of new protocol buffer connections established during the last minute
riak.memory.processes_usedRiak Memory UsedNULLTotal amount of memory used by Erlang processes
riak.read_repairsRiak Read RepairsNULLAverage number of read repair operations this node has coordinated per second

Agent G2 - Linux - RiakDB Monitors

Description

It monitors DB parameters like Post-commit Failures, Pre-commit Failures, Node GETs, Vnode GETs, Memory Used, Get Fsm 100 Object Size, Get Fsm 100 Siblings, Get Fsm 100 Time, Get Fsm 95 Object Size, Get Fsm 95 Siblings, Get Fsm 95 Time, Get Fsm Mean Object Size, Get Fsm Mean Siblings, Get Fsm Mean Time, Get Fsm Median Object Size, Get Fsm Median Siblings, Get Fsm Median Time, Put Fsm 100 Time, Put Fsm 95 Time, Put Fsm Mean Time, Put Fsm Median Time, Active PB Connections, New PB Connections, Active Pipelines, Pipeline Creations, Pipeline Errors, Node PUTs, Vnode PUTs, Read Repairs, Redirected Requests, Ring Partitions, Running KV Vnodes, Running Pipe Vnodes, Process Count, Vnode Index Deletes, Vnode Index Reads, Vnode Index Writes

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - RiakDB Monitorsriak.postcommit.failsRiak Post-commit FailuresNULLNumber of post commit hook failures
riak.pbc_activeRiak Active PB ConnectionsNULLNumber of currently active protocol buffer connections
riak.vnode_index_deletesRiak Vnode Index DeletesNULLAverage number of vnode index delete operations performed per second
riak.node_get_fsm_time_meanRiak Get Fsm Mean TimeNULLMean time between reception of client GET request and subsequent response to client
riak.node_get_fsm_objsize_medianRiak Get Fsm Median Object SizeNULLMedian object size encountered by this node within the last minute
riak.vnode_index_readsRiak Vnode Index ReadsNULLAverage number of vnode index read operations performed per second
riak.node_get_fsm_time_95Riak Get Fsm 95 TimeNULL95th percentile time between reception of client GET request and subsequent response to client
riak.node_put_fsm_time_95Riak Put Fsm 95 TimeNULL95th percentile time between reception of client PUT request and subsequent response to client
riak.pipeline.activeRiak Active PipelinesNULLThe number of pipelines active in the last 60 seconds
riak.pipe_vnodes.running.countRiak Running Pipe VnodesNULLNumber of pipe virtual node queues running in the last minute
riak.ring_num_partitionsRiak Ring PartitionsNULLNumber of partitions in the ring
riak.memory.processes_usedRiak Memory UsedNULLTotal amount of memory used by Erlang processes
riak.node_get_fsm_siblings_100Riak Get Fsm 100 SiblingsNULL100th percentile of siblings encountered during all GET operations by this node within the last minute
riak.node_get_fsm_objsize_meanRiak Get Fsm Mean Object SizeNULLMean object size encountered by this node within the last minute
riak.node_get_fsm_time_medianRiak Get Fsm Median TimeNULLMedian time between reception of client GET request and subsequent response to client
riak.pbc_connectsRiak New PB ConnectionsNULLNumber of new protocol buffer connections established during the last minute
riak.vnode.putsRiak Vnode PUTsNULLAverage number of PUTS coordinated by local vnodes per second
riak.node_put_fsm_time_meanRiak Put Fsm Mean TimeNULLMean time between reception of client PUT request and subsequent response to client
riak.redir_requestsRiak Redirected RequestsNULLAverage number of requests this node has redirected to other nodes for coordination per second
riak.node_put_fsm_time_medianRiak Put Fsm Median TimeNULLMedian time between reception of client PUT request and subsequent response to client
riak.node.getsRiak Node GETsNULLAverage number of GETs coordinated by this node per second, including GETs to non-local vnodes
riak.node_get_fsm_objsize_95Riak Get Fsm 95 Object SizeNULL95th percentile object size encountered by this node within the last minute
riak.kv_vnodes.running.countRiak Running KV VnodesNULLNumber of key/value virtual node queues running in the last minute
riak.node_get_fsm_siblings_medianRiak Get Fsm Median SiblingsNULLMedian number of siblings encountered during all GET operations by this node within the last minute
riak.read_repairsRiak Read RepairsNULLAverage number of read repair operations this node has coordinated per second
riak.node_put_fsm_time_100Riak Put Fsm 100 TimeNULL100th percentile time between reception of client PUT request and subsequent response to client
riak.sys.process_countRiak Process CountNULLNumber of Erlang processes
riak.vnode.getsRiak Vnode GETsNULLAverage number of GETs coordinated by local vnodes per second
riak.node_get_fsm_siblings_meanRiak Get Fsm Mean SiblingsNULLMean number of siblings encountered during all GET operations by this node within the last minute
riak.pipeline.errorsRiak Pipeline ErrorsNULLThe average number of pipeline creation errors per second
riak.precommit.failsRiak Pre-commit FailuresNULLNumber of pre commit hook failures
riak.pipeline.createsRiak Pipeline CreationsNULLThe average number of pipelines created per second
riak.node.putsRiak Node PUTsNULLAverage number of PUTs coordinated by this node per second, including PUTs to non-local vnodes
riak.vnode_index_writesRiak Vnode Index WritesNULLAverage number of vnode index write operations performed per second
riak.node_get_fsm_siblings_95Riak Get Fsm 95 SiblingsNULL95th percentile of siblings encountered during all GET operations by this node within the last minute
riak.node_get_fsm_objsize_100Riak Get Fsm 100 Object SizeNULL100th percentile object size encountered by this node within the last minute
riak.node_get_fsm_time_100Riak Get Fsm 100 TimeNULL100th percentile time between reception of client GET request and subsequent response to client

Agent G2 - Linux Services Monitoring

Description

Monitors the status of the overall state the service is in. It can be active, reloading, inactive, failed, activating, or deactivating.

Template Usage Guidelines:

While applying this template on the device, users need to provide specific input parameters in below two formats only.

  • Format 1: all (case-insensitive) If “all” is specified as custom script arguments, the script monitors all available services along with alert tokens, that provide additional information, including the total count of services and the count of services in each available state.

  • Format 2: tuned,opsramp.*, systemd (provide service names without .service extension as shown here) Users can specify comma-separated service names or service regex patterns for a more focused monitoring approach. The script filters and retrieves status information about the mentioned services only.

Note: We strongly recommend specifying only particular service names in input parameters,(that is Format 2) as monitoring all services may significantly increase the system load, especially in environments with a large number of services.

Prerequisites

This template only works on systems having Systemd as the default init system.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux Services Monitorsystem_linux_services_statusSystem Linux Services Status-Monitors the status of the overall state the service is in. It can be active, reloading, inactive, failed, activating, or deactivating.

Agent G2 - Linux - Squid Monitors

Description

Monitoring template for squid proxy application

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Squid Monitorssquid.server.errorsSquid Server ErrorsNULLRate of server requests that resulted in error
squid.fqdn.cache.entriesSquid FQDN Cache EntriesNULLNumber of FQDN cache entries
squid.icp.packets.receivedSquid ICP Packets ReceivedNULLRate of ICP packets received
squid.icp.traffic.inSquid ICP Traffic InNULLAmount of ICP data received
squid.server.traffic.inSquid Server Traffic InNULLRate of traffic read by server for all protocols
squid.icp.packets.sentSquid ICP Packets SentNULLRate of ICP packets sent
squid.request.failure.ratioSquid Request Failure Ratio%The average ratio between the number of failed and successful requests
squid.service.timesSquid Service TimesNULLThe median of various service time (or response time) distributions
squid.server.traffic.outSquid Server Traffic OutNULLRate of traffic written to origin servers for all protocols
squid.page.faultsSquid Page FaultsNULLPage faults with physical i/o
squid.icp.replies.queuedSquid ICP Replies QueuedNULLThis counter shows how many times an ICP message was queued for re-transmission
squid.server.requestsSquid Server RequestsNULLRate of server-side requests to HTTP servers
squid.unlink.requestsSquid Unlink RequestsNULLRate of unlink requests
squid.client.http.requestsSquid Client Http RequestsNULLThe rate of the number of HTTP requests.
squid.client.http.traffic.outSquid Client Http Traffic OutNULLRate of response data traffic sent to clients
squid.aborted.requestsSquid Aborted RequestsNULLRate of aborted requests
squid.requests.disk.hit.ratioSquid Requests Disk Hit Ratio%These values represent the percentage of plain cache hits served from disk
squid.cache.hit.ratioSquid Cache Hit Ratio%The percentage of HTTP requests that result in a cache hit.
squid.clientsSquid ClientsNULLNumber of clients accessing cache, client actually means IP address. Squid assumes that each client has a unique IP address
squid.client.http.errorsSquid Client Http ErrorsNULLThe rate of the number of HTTP errors.
squid.client.http.traffic.inSquid Client Http Traffic InNULLRate of request data traffic received from clients
squid.fqdn.cache.statisticsSquid FQDN Cache StatisticsNULLThe FQDN cache is similar to the IP cache, except that it stores address-to-hostname lookups
squid.ip.cache.entriesSquid IP Cache EntriesNULLNumber of IP cache entries
squid.icp.messages.sentSquid ICP Messages SentNULLThe total number of ICP messages sent since Squid was started.
squid.icp.messages.receivedSquid ICP Messages ReceivedNULLThe total number of ICP messages received since Squid was started.
squid.cache.byte.hit.ratioSquid Cache Byte Hit Ratio%Squid calculates byte hit ratio by comparing the number of bytes received from origin servers (or neighbors) to the number of bytes sent to clients
squid.select.loopsSquid Select LoopsNULLRate of select loops called
squid.ip.cache.statisticsSquid IP Cache StatisticsNULLThe IP cache contains cached results of hostname-to-address lookups.
squid.request.memory.hit.ratioSquid Request Memory Hit Ratio%These values represent the percentage of all cache hits that were served from memory
squid.icp.traffic.outSquid ICP Traffic OutNULLAmount of ICP data sent

Agent G2 - Linux - Varnish Monitors

Description

Varnish application monitoring

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - Varnish Monitorsvarnish.uptimeVarnish UptimeNULLUptime in minutes
varnish.client.conns.acceptedVarnish Client Connections AcceptedNULLAverage number of client connections accepted per second
varnish.rate.esi.errorsVarnish ESI ErrorsNULLRate of ESI parsing errors
varnish.rate.hdrbytesVarnish Header SizeNULLRate of total header size in Kbytes
varnish.cache.hitsVarnish Cache HitsNULLCache hit rate
varnish.struct.objectsVarnish ObjectsNULLNumber of objects
varnish.backend.conns.success.ratioVarnish Backend Connections Success RatioNULLSuccessful backend server connections ratio
varnish.client.connections.ratioVarnish Client Connections RatioNULLClient accepted connections to received requests ratio
varnish.client.conns.droppedVarnish Client Connections DroppedNULLAverage number of client connections dropped per second
varnish.lru.objects.movedVarnish LRU Moved ObjectsNULLNumber of LRU moved objects
varnish.work.thread.ratioVarnish Work Thread RatioNULLWorking threads to created threads ratio
varnish.rate.sessionsVarnish SessionsNULLRate of total sessions
varnish.backend.connectionsVarnish Backend ConnectionsNULLRate of backend connections success
varnish.rate.bodybytesVarnish Body SizeNULLRate of total body size in Kbytes
varnish.abandoned.work.requestsVarnish Dropped Work RequestsNULLAverage number of abandoned work requests per second
varnish.client.requestsVarnish Client RequestsNULLAverage number of client requests received per second
varnish.cache.hit.ratioVarnish Cache Hit RatioNULLCache hit ratio
varnish.expired.objectsVarnish Expired ObjectsNULLNumber of expired objects
varnish.worker.threadsVarnish Worker ThreadsNULLNumber of worker threads
varnish.rate.requestsVarnish RequestsNULLRate of total received requests
varnish.rate.fetchesVarnish FetchesNULLRate of total fetches
varnish.backend.recycleVarnish Backend Connection RecyclesNULLRate of backend connection recycles
varnish.cache.miss.ratioVarnish Cache Miss RatioNULLCache miss ratio
varnish.backend.reqVarnish Backend RequestsNULLRate of backend requests made
varnish.lru.objects.nukedVarnish LRU Nuked ObjectsNULLNumber of LRU nuked objects
varnish.backend.failVarnish Backend Connections FailedNULLRate of backend connection failures
varnish.worker.threads.failedVarnish Failed Worker ThreadsNULLAverage number of failures per second, when creating worker threads
varnish.backend.busyVarnish Backend Connections BusyNULLRate of backend connections busy
varnish.backend.toolateVarnish Backend Connections ClosedNULLRate of backend connections closed
varnish.cache.missesVarnish Cache MissesNULLCache miss rate

Agent G2 - Linux - WebLogic Monitors

Description

Agent G2 - Linux - WebLogic Monitors

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - WebLogic Monitorsweblogic_ejp_transactions_committed_countTransactions Committed Count RatePer SecondEJB Transactions Committed Count Rate
weblogic_ejp_access_countAccess Count RatePer SecondEJBPool Access Count Rate
weblogic_jdbc_ds_curr_capacityCurr CapacityNULLJDBC DataSource Curr Capacity
weblogic_jta_transaction_rolled_back_countJTARuntime Transaction Rolled Back Count RatePer SecondJTARuntime Transaction Rolled Back Count Rate
weblogic_thread_countThread CountNULLThread Count Of The Server
weblogic_gc_collection_timeCollection TimeNULLTime Taken For Collection Of The Garbage Objects
weblogic_ejp_beans_in_use_current_countBeans In Use Current CountNULLEJBPool Beans In Use Current Count
weblogic_jdbc_ds_active_connections_current_countActive Connections Current CountNULLJDBC DataSource Active Connections Current Count
weblogic_ejp_transactions_rolled_back_countTransactions Rolled Back Count RatePer SecondEJB Transactions Rolled Back Count Rate
weblogic_jdbc_ds_leaked_connection_countLeaked Connection CountPer SecondJDBC DataSource Leaked Connection Count
weblogic_ejp_pooled_beans_current_countPooled Beans Current CountNULLEJBPool Pooled Beans Current Count
weblogic_ejp_destroyed_countDestroyed Count RatePer SecondEJBPool Destroyed Count Rate
weblogic_open_file_descriptor_countOpen File Descriptor CountNULLNumber Of Open File Descriptors Of The Server
weblogic_jdbc_ds_connections_countConnections Count RatePer SecondJDBC DataSource Connections Count Rate
weblogic_non_heap_memory_usage_usedNon Heap Memory Usage UsedMBNon Heap Memory Usage Of The Server
weblogic_thread_pool_standby_thread_countStandby Thread CountNULLStandby Thread Count
weblogic_ejp_transactions_timed_out_countTransactions Timed Out Count RatePer SecondEJB Transactions Timed Out Count Rate
weblogic_gc_collection_countCollection CountNULLNumber Of Garbage Objects Collected
weblogic_heap_memory_usage_committedHeap Memory Usage CommittedMBHeap Memory Committed For The Server
weblogic_uptimeWebLogic UptimeMinutesWebLogic Uptime
weblogic_ejp_waiter_current_countWaiter Current CountNULLEJBPool Waiter Current Count
weblogic_total_started_thread_countTotal Started Thread CountNULLTotal Started Thread Count
weblogic_thread_pool_hogging_thread_countHogging Thread CountNULLHogging Thread Count
weblogic_jdbc_ds_waiting_for_connection_successWaiting For Connection Success RatePer SecondJDBC DataSource Waiting For Connection Success Rate
weblogic_jdbc_ds_waiting_for_connectionWaiting For Connection RatePer SecondJDBC DataSource Waiting For Connection Rate
weblogic_jdbc_ds_num_availableNum AvailableNULLJDBC DataSource Num Available
weblogic_jdbc_ds_prep_stmt_cache_current_sizePrep Stmt Cache Current SizeNULLJDBC DataSource Prep Stmt Cache Current Size
weblogic_thread_pool_execute_thread_idle_countExecute Thread Idle CountNULLExecute Thread Idle Count
weblogic_ejp_miss_countMiss Count RatePer SecondEJBPool Miss Count Rate
weblogic_loaded_class_countLoaded Class CountNULLLoaded Class Count
weblogic_jta_transaction_countJTARuntime Transaction Count RatePer SecondJTARuntime Transaction Count Rate
weblogic_jdbc_ds_waiting_for_connection_failureWaiting For Connection Failure RatePer SecondJDBC DataSource Waiting For Connection Failure Rate
weblogic_jta_transaction_abandoned_countJTARuntime Transaction Abandoned Count RatePer SecondJTARuntime Transaction Abandoned Count Rate
weblogic_unloaded_class_countUnloaded Class CountNULLUnloaded Class Count
weblogic_jdbc_ds_waiting_for_connection_current_countWaiting For Connection Current CountNULLJDBC DataSource Waiting For Connection Current Count
weblogic_jta_transaction_committed_countJTARuntime Transaction Committed Count RatePer SecondJTARuntime Transaction Committed Count Rate
weblogic_heap_memory_usage_usedHeap Memory Usage UsedMBHeap Memory Usage Of The Server
weblogic_non_heap_memory_usage_committedNon Heap Memory Usage CommittedMBNon Heap Memory Committed For The Server

Agent G2 - Linux - WebSphere Monitors

Description

Agent G2 - Linux - WebSphere Monitors

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux - WebSphere Monitorsibm_was_jdbc_FaultCountJDBC Fault CountNULLThe total number of faults, such as timeouts, in the connection pool.
ibm_was_servlet_session_LiveCountServlet Session Live CountNULLThe number of local sessions that are currently cached in memory from the time at which this metric is enabled.
ibm_was_jdbc_CreateCountJDBC Create CountNULLThe total number of managed connections that were created since pool creation.
ibm_was_servlet_session_NoRoomForNewSessionCountServlet Session No Room For New Session CountNULLApplies only to session in memory with AllowOverflow=false. The number of times that a request for a new session cannot be handled because it exceeds the maximum session count.
ibm_was_thread_pools_ActiveTimeThread Pools Active TimemsThe average time in milliseconds the threads are in active state
ibm_was_jvm_ProcessCpuUsageJVM Process Cpu Usage GaugeNULLThe CPU Usage (in percent) of the Java virtual machine.
ibm_was_thread_pools_ClearedThreadHangCountThread Pools Cleared Thread Hang CountNULLThe number of thread hangs cleared
ibm_was_servlet_session_LifeTimeServlet Session Life TimemsThe average session life time in milliseconds (time invalidated - time created)
ibm_was_servlet_session_ActiveCountServlet Session Active CountNULLThe number of concurrently active sessions. A session is active if the WebSphere Application Server is currently processing a request that uses that session.
ibm_was_thread_pools_DeclaredThreadHungCountThread Pools Declaredthread Hung CountNULLThe number of threads declared hung
ibm_was_thread_pools_ActiveCountThread Pools Active CountNULLThe number of concurrently active threads
ibm_was_servlet_session_CreateCountServlet Session Create CountNULLThe number of sessions that were created
ibm_was_servlet_session_CacheDiscardCountServlet Session Cache Discard CountNULLThe number of session objects that have been forced out of the cache. A least recently used (LRU) algorithm removes old entries to make room for new sessions and cache misses. Applicable only for persistent sessions.
ibm_was_jdbc_ReturnCountJDBC Return CountNULLThe total number of managed connections that were returned since pool creation.
ibm_was_jvm_UsedMemoryJVM Used Memory GaugeKBThe used memory in the JVM run time
ibm_was_jdbc_PrepStmtCacheDiscardCountJDBC Prep Stmt Cache Discard CountNULLThe total number of statements that are discarded by the least recently used (LRU) algorithm of the statement cache.
ibm_was_jdbc_FreePoolSizeJDBC Free Pool SizeNULLThe number of managed connections that are in the free pool.
ibm_was_jvm_UpTimeJVM Up Time GaugeSecondsThe amount of time that the JVM is running
ibm_was_jdbc_CloseCountJDBC Close CountNULLThe total number of managed connections that were destroyed since pool creation.
ibm_was_thread_pools_PoolSizeThread Pools Pool SizeNULLThe average number of threads in pool
ibm_was_servlet_session_InvalidateCountServlet Session Invalidate CountNULLThe number of sessions that were invalidated
ibm_was_servlet_session_ExternalWriteTimeServlet Session External Write TimemsThe time (milliseconds) taken to write the session data from the persistent store. Applicable only for (serialized) persistent sessions. Similar to external Read Time.
ibm_was_jdbc_WaitTimeJDBC Wait TimeNULLThe average waiting time in milliseconds until a connection is granted if a connection is not currently available.
ibm_was_thread_pools_CreateCountThread Pools Create CountNULLThe total number of threads created
ibm_was_jdbc_AllocateCountJDBC Allocate CountNULLThe total number of managed connections that were allocated since pool creation.
ibm_was_jdbc_JDBCTimeJDBC TimeNULLThe average time in milliseconds spent running in the JDBC driver that includes time that is spent in the JDBC driver, network, and database.
ibm_was_servlet_session_TimeSinceLastActivatedServlet Session Time Since Last ActivatedmsThe time difference in milliseconds between previous and current access time stamps. Does not include session time out.
ibm_was_jdbc_PercentUsedJDBC Percent UsedNULLThe percent of the pool that is in use.
ibm_was_thread_pools_DestroyCountThread Pools Destroy CountNULLThe total number of threads destroyed
ibm_was_jdbc_WaitingThreadCountJDBC Waiting Thread CountNULLThe number of threads that are currently waiting for a connection.
ibm_was_thread_pools_PercentMaxedThread Pools Percent MaxedNULLThe average percent of the time that all threads are in use
ibm_was_servlet_session_ExternalReadSizeServlet Session External Read SizeBytesSize of the session data read from persistent store. Applicable only for (serialized) persistent sessions
ibm_was_thread_pools_ConcurrentHungThreadCountThread Pools Concurrent Hung Thread CountNULLThe number of concurrently hung threads
ibm_was_jvm_FreeMemoryJVM Free Memory GaugeKBThe free memory in the JVM run time
ibm_was_jdbc_UseTimeJDBC Use TimeNULLThe average time in milliseconds that a connection is in use.
ibm_was_servlet_session_ExternalWriteSizeServlet Session External Write SizeBytesThe size of the session data written to persistent store. Applicable only for (serialized) persistent sessions. Similar to external Read Time.
ibm_was_servlet_session_SessionObjectSizeServlet Session Session Object SizeBytesThe size in bytes of (the serializable attributes of ) in-memory sessions. Only session objects that contain at least one serializable attribute object is counted. A session can contain some attributes that are serializable and some that are not. The size in bytes is at a session level.
ibm_was_jdbc_PoolSizeJDBC Pool SizeNULLThe size of the connection pool.
ibm_was_jdbc_PercentMaxedJDBC Percent MaxedNULLThe percent of the time that all connections are in use.
ibm_was_servlet_session_TimeoutInvalidationCountServlet Session Timeout Invalidation CountNULLThe number of sessions that are invalidated by timeout.
ibm_was_servlet_session_ExternalReadTimeServlet Session External Read TimemsThe time (ms) taken in reading the session data from the persistent store. For multirow sessions, the metrics are for the attribute; for single row sessions, the metrics are for the entire session. Applicable only for persistent sessions. When using a JMS persistent store, only available if replicated data is serialized.
ibm_was_jdbc_ConnectionHandleCountJDBC Connection Handle CountNULLThe number of connections that are in use. Can include multiple connections that are shared from a single managed connection.
ibm_was_jdbc_ManagedConnectionCountJDBC Managed Connection CountNULLThe total number of managed connections in the free, shared, and unshared pools.
ibm_was_servlet_session_AffinityBreakCountServlet Session Affinity Break CountNULLThe number of requests that are received for sessions that were last accessed from another web application. This value can indicate failover processing or a corrupt plug-in configuration.
ibm_was_jvm_HeapSizeJVM Heap Size GaugeKBThe total memory in the JVM run time
ibm_was_servlet_session_ActivateNonExistSessionCountServlet Session Activate Non Exist Session CountNULLThe number of requests for a session that no longer exists, presumably because the session timed out. Use this counter to help determine if the timeout is too short.

Agent G2 - Linux Directory Monitor - v2

Description

Agent G2 - Linux Directory Monitor

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Linux Directory Monitor - v2system_directory_SizeSystem Directory SizeGBSystem Directory Size: To monitor the total size of the given directory/file path
system_directory_file_ReadOnlyStatusSystem Directory File ReadOnlyStatusNULLSystem Directory File ReadOnlyStatus - To monitor the Filtered File(s) Read Accessibility. 0 - Read Only, 1 - Other than Read Only
system_directory_FileCountSystem Directory File CountcountSystem Directory File Count: To monitor the file count in a given directory based on given regex pattern
system_directory_FileSizeSystem Directory File SizeKBSystem Directory File Size: To monitor the each file size in a given directory path and filtering files based given regex pattern.
system_directory_FileSizeTotalSystem Directory FileSizeTotalMBSystem Directory FileSizeTotal: To monitor the total size of filtered files in a given directory based on given regex pattern.
system_directory_AvailabilitySystem Directory AvailabilityNULLSystem Directory Availability: To monitor the availability of given Directory/File path. Below are the possible states: 1 - Available, 0 - Not Available

Agent G2 - Linux Network Monitoring

Description

Agent G2 - Linux Network Monitoring

Prerequisites

No prerequisite

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux Network Monitorsystem.network.interface.discards.inSystem Network In discardspsecMonitors Network in discards of each interface for Linux Devices
system.network.interface.discards.outSystem Network Out DiscardspsecMonitors network Out Discards of each interface for Linux Devices
system.network.interface.errors.inSystem Network In ErrorsErrors per SecMonitors network in errors of each interface for Linux Devices
system.network.interface.errors.outSystem Network Out ErrorsErrors per SecMonitors Network Out traffic of each interface for Linux Devices
system.network.interface.packets.inSystem Network In packetspackets/secMonitors in Packets of each interface for Linux Devices
system.network.interface.packets.outSystem Network out packetspackets/secMonitors Out packets of each interface for Linux Devices
system.network.interface.traffic.inSystem Network In trafficKbpsMonitors In traffic of each interface for Linux Devices
system.network.interface.traffic.outSystem Network Out TrafficKbpsMonitors Out traffic of each interface for Linux Devices

Agent G2 - Linux NFS Mount Point Monitoring

Description

Template to monitor linux nfs mount points availability, accessibility and utilization. This template need to assigned on all linux NFS client machines. This template tested in “CentOS7” + NFS mount points available.

Prerequisites

NFS Mount points(which are related to NFS file system) available on target device.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux NFS Mount Point Custom Monitorsystem_linux_nfs_mountpoint_accessibilitySystem Linux NFS Mountpoint AccessibilityNULLSystem Linux NFS Mountpoint Accessibility - Below are the possible values: 1 - Read/Write access not available 0 - Read/Write access available
system_linux_nfs_mountpoint_utilizationSystem Linux NFS Mountpoint Utilization%System Linux NFS Mountpoint Utilization
system_linux_nfs_mountpoint_availabilitySystem Linux NFS Mountpoint AvailabilityNULLSystem Linux NFS Mountpoint Availability - Below are the possible values: 1 - NFS Mount Point Not Available 0 - NFS Mount Point Available

Agent G2 - Linux NFS Mount Point Monitoring - V2

Description

Template to monitor linux NFS mount points availability, accessibility and utilization and . In this version-2, mountpoint name is set as the component in both graph and alert. where as, previously in version-1 ,the component is the file system name. This template need to assigned on all linux NFS client machines.

Prerequisites

NFS Mount points(which are related to NFS file system) available on target device.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux NFS Mount Point Custom Monitor-V2system_linux_nfs_mountpoint_accessibilitySystem Linux NFS Mountpoint AccessibilitySystem Linux NFS Mountpoint Accessibility - Below are the possible values: 1 - Read/Write access not available 0 - Read/Write access available
system_linux_nfs_mountpoint_utilizationSystem Linux NFS Mountpoint Utilization%System Linux NFS Mountpoint Utilization
system_linux_nfs_mountpoint_availabilitySystem Linux NFS Mountpoint AvailabilitySystem Linux NFS Mountpoint Availability - Below are the possible values: 1 - NFS Mount Point Not Available 0 - NFS Mount Point Available

Agent G2 - Linux NFS Mount Point Monitoring - v3

Description

Template to monitor Linux NFS mount points availability, accessibility and utilization. This template need to assigned on all linux NFS client machines.

Prerequisites

NFS Mount points(which are related to NFS file system) available on target device.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux NFS Mount Point Custom Monitor - v3system_linux_nfs_mountpoint_accessibilitySystem Linux NFS Mountpoint AccessibilitySystem Linux NFS Mountpoint Accessibility - Below are the possible values: 1 - Read/Write access not available 0 - Read/Write access available
system_linux_nfs_mountpoint_utilizationSystem Linux NFS Mountpoint Utilization%System Linux NFS Mountpoint Utilization
system_linux_nfs_mountpoint_availabilitySystem Linux NFS Mountpoint AvailabilitySystem Linux NFS Mountpoint Availability - Below are the possible values: 1 - NFS Mount Point Not Available 0 - NFS Mount Point Available

Agent G2 - Linux OS File Systems Monitoring

Description

Template to monitor Linux OS advanced performance metrics related to File systems (space, Inodes utilization and change in mount status of a file system). It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.

Prerequisites

  1. Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux File Systemssystem_linux_fileSystem_Inodes_Usage_NumberSystem Linux FileSystem Inodes Usage NumbercountFile system Inodes usage number.
system_linux_fileSystem_Inodes_DeltaSystem Linux FileSystem Inodes DeltacountFile system Inodes usage delta.
system_linux_fileSystem_Space_UsedInMBSystem Linux FileSystem Space Used In MBMBFile system space usage in MB
system_linux_fileSystem_Space_DeltaInKBSystem Linux FileSystem Space Delta In KBKBFile system space usage(KB) delta
system_linux_fileSystem_Inodes_UtilizationSystem Linux FileSystem Inodes Utilization%File system Inodes utilization percent.
Agent G2 - Linux File Systems Mount Change Detectionsystem_linux_fileSystem_Mount_ChangeDetectionSystem Linux Filesystem Mount Change DetectionFile system mount point change detection. It detects if any file system mount point removed and if any new mount point added into the system. Below are the possible values: 0 - Available, 1 - Newly Added, 2 - Removed

Agent G2 - Linux OS File Systems Monitoring - V2

Description

Template to monitor Linux OS advanced performance metrics related to File systems (Network and Local) (space, Inodes utilization and change in mount status of a file system). It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.

Prerequisites

  1. Applicable on devices which is running OpsRamp Agent ( v7.0.0 or above)

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux File Systems - V2system_linux_fileSystem_Inodes_Usage_NumberSystem Linux FileSystem Inodes Usage NumbercountFile system Inodes usage number.
system_linux_fileSystem_Inodes_DeltaSystem Linux FileSystem Inodes DeltacountFile system Inodes usage delta.
system_linux_fileSystem_Space_UsedInMBSystem Linux FileSystem Space Used In MBMBFile system space usage in MB
system_linux_fileSystem_Space_DeltaInKBSystem Linux FileSystem Space Delta In KBKBFile system space usage(KB) delta
system_linux_fileSystem_Inodes_UtilizationSystem Linux FileSystem Inodes Utilization%File system Inodes utilization percent.
Agent G2 - Linux File Systems Mount Change Detection - V2system_linux_fileSystem_Mount_ChangeDetectionSystem Linux Filesystem Mount Change DetectionFile system mount point change detection. It detects if any file system mount point removed and if any new mount point added into the system. Below are the possible values: 0 - Available, 1 - Newly Added, 2 - Removed

Agent G2 - Linux OS File Systems Monitoring - v3

Description

Template to monitor Linux OS advanced performance metrics related to File systems (Network and Local) (space, Inodes utilization and change in mount status of a file system). It has been validated on following Linux flavors: AlmaLinux release 8.6 (Sky Tiger), RHEL 9.0(Plow), RHEL 7.9(Maipo), Centos 8, Ubuntu 20.04, Open SUSE, SUSE SLES 15 sp2.

Prerequisites

  1. Applicable on devices which is running OpsRamp Agent ( v7.0.0 or above)

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux File Systems - V2system_linux_fileSystem_Inodes_Usage_NumberSystem Linux FileSystem Inodes Usage NumbercountFile system Inodes usage number.
system_linux_fileSystem_Inodes_DeltaSystem Linux FileSystem Inodes DeltacountFile system Inodes usage delta.
system_linux_fileSystem_Space_UsedInMBSystem Linux FileSystem Space Used In MBMBFile system space usage in MB
system_linux_fileSystem_Space_DeltaInKBSystem Linux FileSystem Space Delta In KBKBFile system space usage(KB) delta
system_linux_fileSystem_Inodes_UtilizationSystem Linux FileSystem Inodes Utilization%File system Inodes utilization percent.
Agent G2 - Linux File Systems Mount Change Detection - V3system_linux_fileSystem_Mount_ChangeDetectionSystem Linux Filesystem Mount Change DetectionFile system mount point change detection. It detects if any file system mount point removed and if any new mount point added into the system. Below are the possible values: 0 - Available, 1 - Newly Added, 2 - Removed

Agent G2 - Linux OS File Systems Monitoring - v4

Description

Agent G2 - Linux OS File Systems Monitoring - v4

Prerequisites

  1. Applicable on devices which is running OpsRamp Agent ( v7.0.0 or above)

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux File Systems - v4system_linux_fileSystem_Inodes_Usage_NumberSystem Linux FileSystem Inodes Usage NumbercountFile system Inodes usage number.
system_linux_fileSystem_Inodes_DeltaSystem Linux FileSystem Inodes DeltacountFile system Inodes usage delta.
system_linux_fileSystem_Space_UsedInMBSystem Linux FileSystem Space Used In MBMBFile system space usage in MB
system_linux_fileSystem_Space_DeltaInKBSystem Linux FileSystem Space Delta In KBKBFile system space usage(KB) delta
system_linux_fileSystem_Inodes_UtilizationSystem Linux FileSystem Inodes Utilization%File system Inodes utilization percent.
Agent G2 - Linux File Systems Mount Change Detection - v4system_linux_fileSystem_Mount_ChangeDetectionSystem Linux Filesystem Mount Change DetectionFile system mount point change detection. It detects if any file system mount point removed and if any new mount point added into the system. Below are the possible values: 0 - Available, 1 - Newly Added, 2 - Removed

Agent G2 - Linux OS Mount Point Monitoring

Description

Template to monitor Linux OS advanced performance metrics related to Mount Points (space,Inodes utilization and Availability status of mount point). It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.

Prerequisites

  1. Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux Mount Pointssystem_linux_mountpoint_Space_DeltaInKBSystem Linux Mount Point Space Delta In KBKBMount point space usage(KB) delta
system_linux_mountpoint_Inodes_Usage_NumberSystem Linux Mount Point Inodes Usage NumbercountMount point Inodes usage number.
system_linux_mountpoint_Availability_StatusSystem Linux Mount Point Availability StatusAvailability status of mount point. These are possible values: 0 - Not Available, 1 - Available
system_linux_mountpoint_Space_UsedInMBSystem Linux Mount Point Space Used In MBMBMount point usage in MB
system_linux_mountpoint_Inodes_UtilizationSystem Linux Mount Point Inodes Utilization%Mount point Inodes utilization percent.
system_linux_mountpoint_Inodes_DeltaSystem Linux Mount Point Inodes DeltacountMount point Inodes usage delta.

Agent G2 - Linux OS Mount Point Monitoring - V2

Description

Template to monitor Linux OS advanced performance metrics related to Mount Points (space,Inodes utilization and Availability status of mount point). It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.

Prerequisites

  1. Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Linux Mount Points - V2system_linux_mountpoint_Space_DeltaInKBSystem Linux Mount Point Space Delta In KBKBMount point space usage(KB) delta
system_linux_mountpoint_Inodes_Usage_NumberSystem Linux Mount Point Inodes Usage NumbercountMount point Inodes usage number.
system_linux_mountpoint_Availability_StatusSystem Linux Mount Point Availability StatusAvailability status of mount point. These are possible values: 0 - Not Available, 1 - Available
system_linux_mountpoint_Space_UsedInMBSystem Linux Mount Point Space Used In MBMBMount point usage in MB
system_linux_mountpoint_Inodes_UtilizationSystem Linux Mount Point Inodes Utilization%Mount point Inodes utilization percent.
system_linux_mountpoint_Inodes_DeltaSystem Linux Mount Point Inodes DeltacountMount point Inodes usage delta.

Agent G2 - Linux OS Performance Monitoring

Description

Agent G2 - Linux OS Performance Monitoring

Prerequisites

No prerequisite

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - CPU Monitorsystem.cpu.usage.utilizationSystem CPU Utilization%The percentage of elapsed time that the processor spends to execute a non-Idle thread(This doesn't includes CPU steal time)
Agent G2 - Disk Monitorsystem.disk.usage.freespaceSystem FreeDisk UsageGBMonitors the Free Space usage in GB
system.disk.usage.usedspaceSystem Disk UsedSpaceGBMonitors disk used space in GB
system.disk.usage.utilizationSystem Disk Utilization%Monitors disk utilization in percentage
Agent G2 - Memory Monitorsystem.memory.usage.usedspaceSystem Memory Used SpaceGBPhysical and virtual memory usage in GB
system.memory.usage.utilizationSystem Memory Utilization%Physical and virtual memory usage in percentage.
Agent G2 - Uptime Monitorsystem.os.uptimeSystem UptimemTime lapsed since last reboot in minutes
Agent G2 - Linux CPU Load Monitorsystem.cpu.loadSystem CPU LoadMonitors the system's last 1min, 5min and 15min load. It sends per cpu core load average.
Agent G2 - Linux CPU Statistics Monitorsystem.cpu.usage.statsSystem CPU Usage Statistics%Monitors cpu time in percentage spent in various program spaces.\n\nUser - The processor time spent running user space processes \nSystem - The amount of time that the CPU spent running the kernel.\nIOWait - The time the CPU spends idle while waiting for an I/O operation to complete\nIdle - The time the processor spends idle\nSteal - The time virtual CPU has spent waiting for the hypervisor to service another virtual CPU running on a different virtual machine.\nKernal Time\nTotal Time
Agent G2 - Linux Disk Inode Utilization Monitorsystem.disk.inode.utilizationSystem Disk Inode Utilization%This monitor is to collect DISK Inode metrics for all physical disks in a server.

Agent G2 - Microsoft Windows IIS App Pool State

Description

Monitors IIS App Pool States for the given app pool names by fetching the App Pool Names from Internet Information Services (IIS) Manager > Application Pools > Column ‘Name’. By default, it will monitor all the available App Pools because the default value is ‘all’ Otherwise, it will monitor the given exact app pool names separated by commas (avoid spaces for each app pool name). Status of the application pool (1 - Uninitialized, 2 - Initialized, 3 - Running, 4 - Disabling, 5 - Disabled, 6 - Shutdown Pending, 7 - Delete Pending).

Prerequisites

No prerequisite

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Windows IIS App Pool Statemicrosoft_windows_iis_appPool_stateMicrosoft Windows IIS App Pool StateStateMonitors IIS App Pool States for the given app pool names by fetching the App Pool Names from Internet Information Services (IIS) Manager > Application Pools > Column 'Name'. By default, it will monitor all the available App Pools because the default value is 'all.' Otherwise, it will monitor the given exact app pool names separated by commas (avoid spaces for each app pool name). Status of the application pool (1 - Uninitialized, 2 - Initialized, 3 - Running, 4 - Disabling, 5 - Disabled, 6 - Shutdown Pending, 7 - Delete Pending).

Agent G2 - Microsoft Active Directory Domain Controller Performance and Availability

Description

Monitors Active Directory metrics like Global Catalog Bind Time, Global Catalog Search Time, DNS Servers Bind Time, Lost Object Count, SYSVol Share availability.

Prerequisites

No prerequisite

Supported Metric

Monitor NameMonitor DescriptionMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft AD DNS Server Bind TimeMonitors network connectivity test to the specified DNS server IP's.
Example: 127.0.0.1 127.0.0.3
microsoft_AD_DNS_server_bindTimeMicrosoft AD DNS Server BindTimemsMonitors Microsoft Active Directory network connectivity tests to the specified DNS server IPs.
Agent G2 - Microsoft AD Global Catalog Bind TimeMonitors Active Directory Global Catalog Bind Time in Milliseconds. User need to provide AD Domain Controller IP along with its port with : separated.
Example: 127.0.0.1:3268
microsoft_AD_global_catalog_bindTimeMicrosoft AD Global Catalog BindTimemsMonitors Microsoft Active Directory Global Catalog Bind Time in milliseconds.
Agent G2 - Microsoft AD Global Catalog Search TimeMonitors Active Directory Global catalog search time in milliseconds using given Domain Controller IPS's and its associated filter. It measures the time it takes to perform the search operation. User need to provide Domain Controller IP:port and LDAP Filter Name.
Example: DomainControllerIP_and_Port: 127.0.0.1:3268 AD_LDAP_Filter: organization
microsoft_AD_global_catalog_searchTimeMicrosoft AD Global Catalog SearchTimemsMonitors Microsoft Active Directory Global Catalog Search Time in milliseconds
Agent G2 - Microsoft AD LostObjectCount_SYSVOLShareAvailabilityMonitor the count of deleted Active Directory objects and availability of the Active Directory SYSVOL share.microsoft_AD_lost_object_countMicrosoft AD Lost Object CountcountMonitors the count of deleted Active Directory objects.
Agent G2 - Microsoft AD LostObjectCount_SYSVOLShareAvailabilityMonitor the count of deleted Active Directory objects and availability of the Active Directory SYSVOL share.microsoft_AD_SYSVOL_share_availabilityMicrosoft AD SYSVOL Share AvailabilitynullMonitors Microsoft Active Directory SYSVOL share is available or not.

Agent G2 - Microsoft Active Directory Domain Controller Performance and Availability - v2

Description

Monitors Active Directory metrics like Global Catalog Bind Time, Global Catalog Search Time, DNS Servers Bind Time, Lost Object Count, SYSVol Share availability.

Prerequisites

No Prerequisites

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft AD DNS Server Bind Timemicrosoft_AD_DNS_server_bindTimeMicrosoft AD DNS Server BindTimemsMonitors Microsoft Active Directory network connectivity tests to the specified DNS server IPs.
Agent G2 - Microsoft AD Global Catalog Bind Timemicrosoft_AD_global_catalog_bindTimeMicrosoft AD Global Catalog BindTimemsMonitors Microsoft Active Directory Global Catalog Bind Time in milliseconds
Agent G2 - Microsoft AD Global Catalog Search Timemicrosoft_AD_global_catalog_searchTimeMicrosoft AD Global Catalog SearchTimemsMonitors Microsoft Active Directory Global Catalog Search Time in milliseconds
Agent G2 - Microsoft AD LostObjectCount_SYSVOLShareAvailabilitymicrosoft_AD_lost_object_countMicrosoft AD Lost Object CountcountMonitors the count of deleted Active Directory objects.
Agent G2 - Microsoft AD LostObjectCount_SYSVOLShareAvailabilitymicrosoft_AD_SYSVOL_share_availabilityMicrosoft AD SYSVOL Share AvailabilitynullMonitors Microsoft Active Directory SYSVOL share is available or not.
Agent G2 - Microsoft AD BindTimemicrosoft_AD_bindTimeMicrosoft AD BindTimemsMonitors Active Directory BindTime for Domain Controller, Infrastructure Master, Domain Naming Master, Primary Domain Controller Emulator (PDC), Relative ID master (RID), Schema Master (SCH) in milliseconds.
Agent G2 - Microsoft AD LastSuccessful Synchronization Timemicrosoft_AD_lastSuccessfulSyncTimeMicrosoft AD LastSuccessfulSync TimemMonitors Microsoft Active Directory last successful synchronization time in minutes.

Agent G2 - Microsoft Windows OS Counters - RSE - v3

Description

To Monitor OS related counters like disk, memory. In this version, added these additonal metrics support: system_windows_Memory_TotalInstalledRAM,system_windows_Memory_UsedRAM,system_windows_Memory_CommitCharge,system_windows_Memory_PageFileInstalled

Prerequisites

No Prerequisites

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
System Windows LogicalDisk Monitorsystem_windows_logicaldisk_UsedSpaceInMegaBytesSystem Windows LogicalDisk UsedSpace In MegaBytesMBTotal usable space on the selected logical disk drive that was free in MB
system_windows_logicaldisk_DiskReadPerSecondSystem Windows LogicalDisk DiskRead PerSecondropsDisk Reads/sec is the rate of read operations on the disk.
system_windows_logicaldisk_AvgDiskSecPerReadSystem Windows LogicalDisk AvgDiskSec PerReadnullAvg. Disk sec/Read is the average time, in seconds, of a read of data from the disk.
system_windows_logicaldisk_AvgDiskQueueLengthSystem Windows LogicalDisk AvgDiskQueueLengthnullAvg. Disk Queue Length is the average number of both read and write requests that were queued for the selected disk during the sample interval.
system_windows_logicaldisk_FreeSpaceInPercentSystem Windows LogicalDisk FreeSpace In Percent%% Free Space is the percentage of total usable space on the selected logical disk drive that was free.
system_windows_logicaldisk_FreeSpaceInMBSystem Windows LogicalDisk FreeSpace InMBMBFree Megabytes displays the unallocated space, in megabytes, on the disk drive in megabytes. One megabyte is equal to 1,048,576 bytes.
system_windows_logicaldisk_DiskWriteBytesPerSecondSystem Windows Logicaldisk DiskWriteBytes PerSecondBpsDisk Write Bytes/sec is rate at which bytes are transferred to the disk during write operations.
system_windows_logicaldisk_DiskWritePerSecondSystem windows LogicalDisk DiskWrite PerSecondwopsDisk Writes/sec is the rate of write operations on the disk.
system_windows_logicaldisk_DiskReadBytesPerSecondSystem Windows LogicalDisk DiskReadBytes PerSecondBpsDisk Read Bytes/sec is the rate at which bytes are transferred from the disk during read operations.
system_windows_logicaldisk_AvgDiskSecPerWriteSystem Windows LogicalDisk AvgDiskSec PerWritenullAvg. Disk sec/Write is the average time, in seconds, of a write of data to the disk.
System Windows Processor Monitormicrosoft_windows_Processor_PercentPrivilegedTimeMicrosoft Windows Processor PercentPrivilegedTime%% Privileged Time is the percentage of elapsed time that the process threads spent executing code in privileged mode. When a Windows system service in called, the service will often run in privileged mode to gain access to system-private data. Such data is protected from access by threads executing in user mode. Calls to the system can be explicit or implicit, such as page faults or interrupts. Unlike some early operating systems, Windows uses process boundaries for subsystem protection in addition to the traditional protection of user and privileged modes. Some work done by Windows on behalf of the application might appear in other subsystem processes in addition to the privileged time in the process.
system_windows_processorDPCrateSystem Windows Processor DPCratenullDPC Rate is the rate at which deferred procedure calls (DPCs) were added to the processors DPC queues between the timer ticks of the processor clock. DPCs are interrupts that run at alower priority than standard interrupts. Each processor has its own DPC queue. This counter measures the rate that DPCs were added to the queue, not the number of DPCs in the queue. This counter displays the last observed value only; it is not an average.
system_windows_processor_percentprocessortimeSystem Windows Processor PercentProcessorTime%% Processor Time is the percentage of elapsed time that the processor spends to execute a non-Idle thread. It is calculated by measuring the percentage of time that the processor spends executing the idle thread and then subtracting that value from 100%. (Each processor has an idle thread that consumes cycles when no other threads are ready to run). This counter is the primary indicator of processor activity, and displays the average percentage of busy time observed during the sample interval. It should be noted that the accounting calculation of whether the processor is idle is performed at an internal sampling interval of the system clock (10ms). On todays fast processors, % Processor Time can therefore underestimate the processor utilization as the processor may be spending a lot of time servicing threads between the system clock sampling interval. Workload based timer applications are one example of applications which are more likely to be measured inaccurately as timers are signaled just after the sample is taken.
system_windows_processorInterruptsPerSecSystem Windows ProcessorInterrupts PerSecpsecInterrupts/sec is the average rate, in incidents per second, at which the processor received and serviced hardware interrupts. It does not include deferred procedure calls (DPCs), which are counted separately. This value is an indirect indicator of the activity of devices that generate interrupts, such as the system clock, the mouse, disk drivers, data communication lines, network interface cards, and other peripheral devices. These devices normally interrupt the processor when they have completed a task or require attention. Normal thread execution is suspended. The system clock typically interrupts the processor every 10 milliseconds, creating a background of interrupt activity. This counter displays the difference between the values observed in the last two samples, divided by the duration of the sample interval.
microsoft_windows_ProcessorInformation_PercentC1TimeMicrosoft Windows ProcessorInformation PercentC1Time%% C1 Time is the percentage of time the processor spends in the C1 low-power idle state. % C1 Time is a subset of the total processor idle time. C1 low-power idle state enables the processor to maintain its entire context and quickly return to the running state. Not all systems support the % C1 state
microsoft_windows_ASP_InMemoryTemplatesCachedMicrosoft Windows ASP InMemoryTemplatesCachedcountThe number of compiled templates cached in memory.
System Windows PhysicalDisk Monitorsystem_windows_PhysicalDisk_writesPerSecSystem Windows PhysicalDisk Writes PerSecpsecDisk Writes/sec is the rate of write operations on the disk.
system_windows_PhysicalDisk_avgDiskSecPerReadSystem Windows PhysicalDisk AvgDiskSec PerReadnullAvg. Disk sec/Read is the average time, in seconds, of a read of data from the disk.
system_windows_PhysicalDisk_writeBytesPerSecSystem Windows PhysicalDisk WriteBytes PerSecBpsDisk Write Bytes/sec is rate at which bytes are transferred to the disk during write operations.
System_Windows_PhysicalDisk_readBytesPerSecSystem Windows PhysicalDisk ReadBytes PerSecBpsDisk Read Bytes/sec is the rate at which bytes are transferred from the disk during read operations.
System_Windows_PhysicalDisk_avgDiskSecPerWriteSystem Windows PhysicalDisk AvgDiskSec PerWritenullAvg. Disk sec/Write is the average time, in seconds, of a write of data to the disk.
system_windows_PhysicalDisk_readsPerSecSystem Windows PhysicalDisk ReadsPerSecpsecDisk Reads/sec is the rate of read operations on the disk.
System_Windows_PhysicalDisk_avgDiskQueueLengthSystem Windows PhysicalDisk avgDiskQueueLengthnullAvg. Disk Queue Length is the average number of both read and write requests that were queued for the selected disk during the sample interval.
microsoft_windows_PhysicalDisk_AvgDiskBytesPerReadMicrosoft Windows PhysicalDisk AvgDiskBytesPerReadnullAvg. Disk Bytes/Read is the average number of bytes transferred from the disk during read operations.
microsoft_windows_PagingFile_PercentUsageMicrosoft Windows PagingFile PercentUsage%The amount of the Page File instance in use in percent
System Windows Server Monitorsystem_windows_server_logonsPerSecSystem Windows Server LogonsPerSecpsecLogon/sec is the rate of all server logons.
system_windows_contextSwitchesPerSecSystem Windows ContextSwitchesPerSecpsec"Context Switches/sec is the combined rate at which all processors on the computer are switched from one thread to another. Context switches occur when a running thread voluntarily relinquishes the processor, is preempted by a higher priority ready thread, or switches between user-mode and privileged (kernel) mode to use an Executive or subsystem service. It is the sum of Thread Context Switches/sec for all threads running on all processors in the computer and is measured in numbers of switches. There are context switch counters on the System and Thread objects. This counter displays the difference between the values observed in the last two samples, divided by the duration of the sample interval."
system_windows_bytesTransmittedPerSecSystem Windows BytesTransmitted PerSecpsecThe number of bytes the server has sent on the network. Indicates how busy the server is.
microsoft_windows_System_ProcessorQueueLengthMicrosoft Windows System ProcessorQueueLengthnullProcessor Queue Length is the number of threads in the processor queue. Unlike the disk counters, this counter counters, this counter shows ready threads only, not threads that are running. There is a single queue for processor time even on computers with multiple processors. Therefore, if a computer has multiple processors, you need to divide this value by the number of processors servicing the workload. A sustained processor queue of less than 10 threads per processor is normally acceptable, dependent of the workload.
system_windows_bytesReceivedPerSecSystem Windows BytesReceived PerSecpsecThe number of bytes the server has received from the network. Indicates how busy the server is.
Agent G2 - Windows Memory Custom Monitor - v3System_Windows_Pages_Output_PerSecSystem_Windows_Pages_Output_PerSecpsec
System_Windows_Memory_AvailableMbytesSystem_Windows_Memory_AvailableMbytesMBCalculates the Memory Available in Mega Bytes of the System
System_Windows_Memory_CommitLimitSystem_Windows_Memory_CommitLimitBytesCalculates the physical memory space reserved on the disk paging files
System_Windows_Memory_CommittedBytesSystem_Windows_Memory_CommittedBytesBytesCalculates the Total Number of Committed Bytes of the System
System_Windows_Memory_CommittedBytes_InuseSystem_Windows_Memory_CommittedBytes_InuseBytesCalculates the Percentage of the Committed Bytes In use of the System
System_Windows_Memory_PageFaults_PerSecSystem_Windows_Memory_PageFaults_PerSecpsecCalculates the Total Number of Memory Page Faults Per Second of the System
System_Windows_Memory_Pages_PerSecSystem_Windows_Memory_Pages_PerSecpsecCalculates the Total Number of Memory pages per second of the System
system_windows_rawdata_pagesIn_PerSecondsystem_windows_rawdata_pagesIn_PerSecondnull
system_windows_Memory_TotalInstalledRAMSystem Windows Memory TotalInstalledRAMGBTotal PhysicalMemory(RAM) in GB
system_windows_Memory_UsedRAMSystem Windows Memory UsedRAMGBUsed PhysicalMemory(RAM) in GB
system_windows_Memory_CommitChargeSystem Windows Memory CommitChargeGBCommit Charge is the amount of committed virtual memory. Committed memory is the physical memory which has space reserved on the disk paging file(s). There can be one or more paging files on each physical drive. This counter displays the last observed value only; it is not an average.
system_windows_Memory_PageFileInstalledSystem Windows Memory PageFileInstalledGBPage File Installed(Commit Limit) is the amount of virtual memory that can be committed without having to extend the paging file(s). Committed memory is the physical memory which has space reserved on the disk paging files. There can be one paging file on each logical drive). If the paging file(s) are be expanded, this limit increases accordingly. This counter displays the last observed value only; it is not an average.

Agent G2 - Microsoft Windows OS Counters - RSE - v2

Description

To Monitor OS related counters like disk,memory

Prerequisites

No Prerequisites

Agent G2 - MSSQL Database Availability Status - v2

Description

Monitors MSSQL Instance Connection Time in milliseconds and Database Instance status along with Database Status. If the Instance is in a running state and the Databases associated with the instance is offline, it considers the Instance as Down. Conversely, if the Instance is in a running state and the associated Database is up, it considers the Instance as Up. Default value is ‘all’ i.e., it will monitor all databases. If user need to monitor any particular databases, then provide the database names with comma separated. Example: master,msdb,temp.

Prerequisites

Agent G2 - MSSQL Database Availability Status - v2

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - MSSQL Database Availability Status - v2mssql_database_availability_statusMSSQL Database Availability StatusStatusMSSQL Database Availability Status
mssql_database_connection_timeMSSQL Database Connection TimemsMonitors MSSQL Database Instance Connection Time in milliseconds

Agent G2 - MSSQL Database Data and Log Files Freespace and Utilization

Description

Monitors MSSQL Database Log & Data file Freespace and utilization.

Prerequisites

Requires Agent version 14.0.0 or later. MSSQL database credentials must be added to the device. When assigning the template, user need to provide the MSSQL Server name (if they have multiple instances, provide it in this format: ServerName InstanceName) and MSSQL instance name (instance names will be available in services.msc) and mention that the authentication type is Windows or SQL.

Template Usage Guidelines:

  • Requires Agent version 14.0.0 or later.
  • MSSQL database credentials must be added to the device.
  • When assigning the template, user need to provide the MSSQL Server name (if they have multiple instances, provide it in this format: ServerName InstanceName) and MSSQL instance name (instance names will be available in services.msc) and mention that the authentication type is Windows or SQL.
IBM-AIX

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - MSSQL Database Data and Log Files Freespace and Utilizationmssql_database_logFilesFreeSpaceMSSQL Database LogFileFreeSpace%Monitors MSSQL database LogFiles free space
mssql_database_dataFilesFreeSpaceMSSQL Database DataFilesFreeSpace%Monitors MSSQL datafiles free space regardless of auto-growth
mssql_database_ldf_file_utilization_percentageMSSQL Database ldf File Utilization in Percentage%Monitors MSSQL .ldf files utilization in percentage
mssql_database_ndf_file_utilization_percentageMSSQL Database ndf File Utilization Percentage%Monitors MSSQL .ndf files utilization in percentage
mssql_database_mdf_file_utilization_percentageMSSQL Database mdf File Utilization Percentage%Monitors MSSQL .mdf files utilization in percentage

Agent G2 - MSSQL Database Agent Job Status

Description

Monitors MSSQL Agent Job Status latest information (in last 15mins only): Below are the possible status values - 0 : Failed 1 : Successful 2 : Retry 3 : Cancelled 4 : InProgress.

Prerequisites

Requires Agent version 14.0.0 or later. MSSQL database credentials must be added to the device. When assigning the template, user need to provide the MSSQL Server name (if they have multiple instances, provide it in this format: ServerName InstanceName) and MSSQL instance name (instance names will be available in services.msc) and mention that the authentication type is Windows or SQL.

Template Usage Guidelines:

  • Requires Agent version 14.0.0 or later.
  • MSSQL database credentials must be added to the device.
  • When assigning the template, user need to provide the MSSQL Server name (if they have multiple instances, provide it in this format: ServerName InstanceName) and MSSQL instance name (instance names will be available in services.msc) and mention that the authentication type is Windows or SQL.
IBM-AIX

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - MSSQL Database Agent Job Statusmssql_database_agentJobStatusMSSQL Database AgentJobStatusMSSQL DB Agent Jobs Status : This metric collects mssql agent job status latest information (in last 15mins only): Below are the possible status values - 0 : Failed 1 : Successful 2 : Retry 3 : Cancelled 4 : InProgress

Agent G2 - Microsoft Windows MSSQL Performance and Statistics

Description:

Monitors the various performance statistics of MSSQL from versions 2000 to 2022. Ensure that required WMI classes be available in system.

Template Usage Guidelines:

While applying this template on the device, users need to provide specific input parameters specifying version of MSSQL. The default value is set to “MSSQL2022.”

Example 1: MSSQL2022

This monitors the MSSQL 2022 statistics for given metrics.

Example 2: MSSQL2019

This monitors the MSSQL 2019 statistics for given metrics.

Below are the Namespaces associated to MSSQL version.

“MSSQL2000” namespace = “root\Microsoft\SqlServer\ComputerManagement” “MSSQL2005” namespace = “root\Microsoft\SqlServer\ComputerManagement” “MSSQL2008” namespace = “root\Microsoft\SqlServer\ComputerManagement10” “MSSQL2012” namespace = “root\Microsoft\SqlServer\ComputerManagement11” “MSSQL2014” namespace = “root\Microsoft\SqlServer\ComputerManagement12” “MSSQL2016” namespace = “root\Microsoft\SqlServer\ComputerManagement13” “MSSQL2017” namespace = “root\Microsoft\SqlServer\ComputerManagement14” “MSSQL2019” namespace = “root\Microsoft\SqlServer\ComputerManagement15” “MSSQL2022” namespace = “root\Microsoft\SqlServer\ComputerManagement16”

To validate which namespaces exist on the device and provide the correct MSSQL version:

  1. Click Start and search for computer management.
  2. Under Services and Application, select WMI Control.
  3. Right click services and select security tab Namespace: root\Microsoft\SqlServer?
    Please find the attached screenshot for your reference.

Prerequisites

Ensure that required WMI classes be available in system.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Microsoft Windows MSSQL Performance and Statisticsmicrosoft_windows_MSSQL_latchWaitsPerSecMicrosoft Windows MSSQL LatchWaits PerSecCount per secMonitors the number of latch requests that could not be granted immediately and had to wait before being granted.
microsoft_windows_MSSQL_averageLatchWaitTimemsMicrosoft Windows MSSQL AverageLatchWaitTimemsmsMonitors the average latch wait time (milliseconds) for latch requests that had to wait.
microsoft_windows_MSSQL_dataFilesSizeKBMicrosoft Windows MSSQL DataFilesSizeKBKBMonitors the cumulative size of all the data files in the database.
microsoft_windows_MSSQL_logFilesUsedSizeKBMicrosoft Windows MSSQL LogFilesUsedSizeKBKBMonitors the cumulative used size of all the log files in the database.
microsoft_windows_MSSQL_logFlushesPersecMicrosoft Windows MSSQL LogFlushesPersecCount per secMonitors the total number of log flushes.
microsoft_windows_MSSQL_transactionsPersecMicrosoft Windows MSSQL TransactionsPersecCount per secMonitors the total number of transactions started for the database
microsoft_windows_MSSQL_lockTimeoutsPersecMicrosoft Windows MSSQL LockTimeoutsPersecCount per secMonitors the number of lock requests that timed out. This includes requests for NOWAIT locks.
microsoft_windows_MSSQL_numberofDeadlocksPersecMicrosoft Windows MSSQL NumberofDeadlocksPersecCount per secMonitors the number of lock requests that resulted in a deadlock.
microsoft_windows_MSSQL_averageWaitTimemsMicrosoft Windows MSSQL AverageWaitTimemsmsMonitors the total average amount of wait time (milliseconds) for each lock request that resulted in a wait.
microsoft_windows_MSSQL_lockWaitsPersecMicrosoft Windows MSSQL LockWaitsPersecCount per secMonitors the total number of lock requests that could not be satisfied immediately and required the caller to wait before being granted the lock.
microsoft_windows_MSSQL_pagelifeexpectancyMicrosoft Windows MSSQL PagelifeexpectancysMonitors the number of seconds a page will stay in the buffer pool without references
microsoft_windows_MSSQL_lazywritesPersecMicrosoft Windows MSSQL LazywritesPersecCount per secMonitors the number of buffers written by buffer manager's lazy writer.
microsoft_windows_MSSQL_pagereadsPersecMicrosoft Windows MSSQL PagereadsPersecCount per secMonitors the number of physical database page reads issued.
microsoft_windows_MSSQL_pagewritesPersecMicrosoft Windows MSSQL PagewritesPersecCount per secMonitors the number of physical database page writes issued.
microsoft_windows_MSSQL_checkpointpagesPersecMicrosoft Windows MSSQL CheckpointpagesPersecCount per secMonitors the number of pages flushed by checkpoint or other operations that require all dirty pages to be flushed.
microsoft_windows_MSSQL_connectionMemoryKBMicrosoft Windows MSSQL ConnectionMemoryKBKBMonitors the total amount of dynamic memory the server is using for maintaining connections.
microsoft_windows_MSSQL_databaseCacheMemoryKBMicrosoft Windows MSSQL DatabaseCacheMemoryKBKBMonitors the amount of memory the server is currently using for the database cache.
microsoft_windows_MSSQL_freeMemoryKBMicrosoft Windows MSSQL FreeMemoryKBKBMonitors the amount of memory the server is currently not using.
microsoft_windows_MSSQL_lockMemoryKBMicrosoft Windows MSSQL LockMemoryKBKBMonitors the total amount of dynamic memory the server is using for locks.
microsoft_windows_MSSQL_memoryGrantsPendingMicrosoft Windows MSSQL MemoryGrantsPendingcountMonitors the current number of processes waiting for a workspace memory grant.
microsoft_windows_MSSQL_optimizerMemoryKBMicrosoft Windows MSSQL OptimizerMemoryKBKBMonitors the total amount of dynamic memory the server is using for query optimization.
microsoft_windows_MSSQL_totalServerMemoryKBMicrosoft Windows MSSQL TotalServerMemoryKBKBMonitors the total amount of dynamic memory the server is currently consuming.
microsoft_windows_MSSQL_SQLCacheMemoryKBMicrosoft Windows MSSQL SQLCacheMemoryKBKBMonitors the total amount of dynamic memory the server is using for the dynamic SQL cache.
microsoft_windows_MSSQL_forwardedRecordsPersecMicrosoft Windows MSSQL ForwardedRecordsPersecCount per secMonitors the number of records fetched through forwarded record pointers.
microsoft_windows_MSSQL_pageSplitsPersecMicrosoft Windows MSSQL PageSplitsPersecCount per secMonitors the number of page splits per second that occur as a result of overflowing index pages.
microsoft_windows_MSSQL_fullScansPersecMicrosoft Windows MSSQL FullScansPersecCount per secMonitors the number of unrestricted full scans. These can either be base table or full index scans.
microsoft_windows_MSSQL_processesBlockedMicrosoft Windows MSSQL ProcessesBlockedcountMonitors the Number of currently blocked processes.
microsoft_windows_MSSQL_userConnectionsMicrosoft Windows MSSQL UserConnectionscountMonitors the number of users connected to the system.
microsoft_windows_MSSQL_loginsPersecMicrosoft Windows MSSQL LoginsPersecCount per secMonitors the total number of logins started per second.
microsoft_windows_MSSQL_logoutsPersecMicrosoft Windows MSSQL LogoutsPersecCount per secMonitors the total number of logouts started per second.
microsoft_windows_MSSQL_longestTransactionRunningTimeMicrosoft Windows MSSQL LongestTransactionRunningTimesMonitors the longest running time of any transaction in seconds.
microsoft_windows_MSSQL_receiveIPerOsPersecMicrosoft Windows MSSQL ReceiveIPerOsPersecCount per secMonitors the number of transport receives I/O per second. Note that a transport receive I/O may contain more than one message fragment
microsoft_windows_MSSQL_sendIPerOsPersecMicrosoft Windows MSSQL SendIPerOsPersecCount per secMonitors the number of transport send I/Os per second. Note that a transport send I/O may contain more than one message fragment.
microsoft_windows_MSSQL_openConnectionCountMicrosoft Windows MSSQL OpenConnectionCountCountMonitors the total number of transport connections currently open.
microsoft_windows_MSSQL_SQLCompilationsPersecMicrosoft Windows MSSQL SQLCompilationsPersecCount per secMonitors the number of SQL compilations.
microsoft_windows_MSSQL_SQLReCompilationsPersecMicrosoft Windows MSSQL SQLReCompilationsPersecCount per secMonitors the number of SQL re-compiles.
microsoft_windows_MSSQL_batchRequestsPersecMicrosoft Windows MSSQL BatchRequestsPersecCount per secMonitors the number of SQL batch requests received by server.
microsoft_windows_MSSQL_taskLimitReachedMicrosoft Windows MSSQL TaskLimitReachedcountMonitors the total number of times the activated task limit on a queue has been reached.
microsoft_windows_MSSQL_tasksAbortedPersecMicrosoft Windows MSSQL TasksAbortedPersecCount per secMonitors the number of activated tasks that are being aborted per second.
microsoft_windows_MSSQL_SQLRECEIVEsPersecMicrosoft Windows MSSQL SQLRECEIVEsPersecCount per secMonitors the number of SQL RECEIVE commands processed by the Broker per second.
microsoft_windows_MSSQL_BrokerTransactionRollbacksMicrosoft Windows MSSQL BrokerTransactionRollbackscountMonitors the number of Service Broker related transactions that have rolled back.
microsoft_windows_MSSQL_SQLSENDsPersecMicrosoft Windows MSSQL SQLSENDsPersecCount per secMonitors the number of SQL SEND commands processed by the Broker per second.
microsoft_windows_MSSQL_logBytesFlushedPersecMicrosoft Windows MSSQL LogBytesFlushedPersecBpsMonitors the total number of log bytes flushed.
microsoft_windows_MSSQL_percentLogUsedMicrosoft Windows MSSQL PercentLogUsed%Monitors the percent of space in the log that is in use.
microsoft_windows_MSSQL_logFilesSizeKBMicrosoft Windows MSSQL LogFilesSizeKBKBMonitors the cumulative size of all the log files in the database.
microsoft_windows_MSSQL_logFlushWaitsPersecMicrosoft Windows MSSQL LogFlushWaitsPersecCount per secMonitors the number of commits waiting on log flush.
microsoft_windows_MSSQL_logGrowthsMicrosoft Windows MSSQL LogGrowthscountMonitors the total number of log growths for this database.
microsoft_windows_MSSQL_logShrinksMicrosoft Windows MSSQL LogShrinkscountMonitors the total number of log shrinks for this database.
microsoft_windows_MSSQL_cacheHitRatioPercentageMicrosoft Windows MSSQL CacheHitRatio Percentage%Monitors the percentage of cache hits relative to cache lookups.
microsoft_windows_MSSQL_bufferCacheHitRatioPercentageMicrosoft Windows MSSQL Buffercachehitratio Percentage%Monitors the percentage of pages that were found in the buffer pool without having to incur a read from disk.

Agent G2 - Windows CertStore Certificates Expiry - IssuedTo_SN as component

Description

To monitor certificate(s) expiry (in days) which are available in Certificate Manager and not yet expired. This template contains “Issued To and Serial Number” in alert subject.

Template Usage Guidelines:

The script receives its parameters from custom monitor during Runtime.

Params field in the custom monitor holds the certificate store path, the excluded Thumbprints OR the keyword “all”

Formulate your Path params in the format specified below and encode the string. The string has two parts to it separated by a semicolon ( ; ).

The Certificate Store paths are separated by a comma (,) and are present to the left to the semicolon separator.

The list of Thumbprints that needs to be excluded from the monitor are comma (,) separated and are present to the right of the semicolon separator.

The error or warning alerts contains Certificate information specifying respective certificate’s Issuer, Subject, Serial Number.

Case 1: Giving input as specific paths and excluding few certificates from those paths(using their thumbprints)

Example String Before Encoding:

cert:\LocalMachine\Root, cert:\LocalMachine\AuthRoot;a8985d3a65e5e5c4b2d7d66d40c6dd2fb19c5436,df3c24f9bfd666761b268073fe06d1cc8d4f82a4

Example String After Encoding: (use a tool like notepad++ or online resource like https://www.base64encode.org/ to encode your params string)

Y2VydDpcTG9jYWxNYWNoaW5lXFJvb3QsY2VydDpcTG9jYWxNYWNoaW5lXEF1dGhSb290O+KAjmE4OTg1ZDNhNjVlNWU1YzRiMmQ3ZDY2ZDQwYzZkZDJmYjE5YzU0MzYs4oCOZGYzYzI0ZjliZmQ2NjY3NjFiMjY4MDczZmUwNmQxY2M4ZDRmODJhNA==

Case 2: Monitoring all the certificates from all folders or Current User account and Local Machine account

To monitor all the certificates user needs to give input as “all”

Prerequisite

Powershell Version 3 and above.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows CertStore Certificates Expiry - IssuedTo_SN as componentwindows_certStore_certificatesExpiryInDaysWindows CertStore CertificatesExpiry InDaysDaysTo monitor certificate(s) expiry (in days) which are available in Certificate Manager.

Agent G2 - Windows Memory Monitoring

Description

Agent G2 - Windows Service Monitoring v2.0

Prerequisites

No prerequisite

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows Memory Custom Monitorsystem.windows.memory.availableSystem Windows Memory AvailableMBAvailable MBytes is the amount of physical memory, in Megabytes, immediately available for allocation to a process or for system use. It is equal to the sum of memory assigned to the standby (cached), free and zero page lists.

Agent G2 - Windows Network Interface Monitoring

Description

Agent G2 - Windows Network Interface Monitoring

Prerequisites

No prerequisite

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows Network Interface Custom Monitorsystem.windows.network.interface.packets.persecSystem Windows Network Interface Packets Per Secpackets/secPackets/sec is the rate at which packets are sent and received on the network interface.
system.windows.network.interface.traffic.inSystem Windows Network Interface Traffic InbpsBytes Received/sec is the rate at which bytes are received over each network adapter, including framing characters. Network Interface\\Bytes Received/sec is a subset of Network Interface\\Bytes Total/sec.
system.windows.network.interface.traffic.outSystem Windows Network Interface Traffic OutbpsBytes Sent/sec is the rate at which bytes are sent over each network adapter, including framing characters. Network Interface\\Bytes Sent/sec is a subset of Network Interface\\Bytes Total/sec.
system.windows.network.interface.traffic.totalSystem Windows Network Interface Traffic TotalbpsBytes Total/sec is the rate at which bytes are sent and received over each network adapter, including framing characters. Network Interface\\Bytes Total/sec is a sum of Network Interface\\Bytes Received/sec and Network Interface\\Bytes Sent/sec.

Agent G2 - Windows Network Interface Monitoring - v3

Description

Monitors Windows Network Interface metrics. In this version of template, we have added support for new metrics, and we have also modified the metric names from the existing dotted notation to underscore format.

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows Network Interface Custom Monitor - v3system_windows_network_interface_statusSystem Windows Network Interface StatusnullNetConnectionStatus is a string indicating the state of the network adapter's connection to the network. The value of the property is to be interpreted as follows: 0 - Disconnected 1 - Connecting 2 - Connected 3 - Disconnecting 4 - Hardware not present 5 - Hardware disabled 6 - Hardware malfunction 7 - Media disconnected 8 - Authenticating 9 - Authentication succeeded 10 - Authentication failed 11 - Invalid Address 12 - Credentials Required Other (13?65535)
system_windows_network_interface_currentBandwidthSystem Windows Network Interface CurrentBandwidthbpsCurrent Bandwidth is an estimate of the current bandwidth of the network interface in bits per second (BPS). For interfaces that do not vary in bandwidth or for those where no accurate estimation can be made, this value is the nominal bandwidth.
system_windows_network_interface_packetsReceivedPersecSystem Windows Network Interface PacketsReceivedPersecpsecPackets Received/sec is the rate at which packets are received on the network interface.
system_windows_network_interface_packetsSentPersecSystem Windows Network Interface PacketsSentPersecpsecPackets Sent/sec is the rate at which packets are sent on the network interface.
system_windows_network_interface_packetsReceivedNonUnicastPersecSystem Windows Network Interface PacketsReceivedNonUnicastPersecpsecPackets Received Non-Unicast/sec is the rate at which non-unicast (subnet broadcast or subnet multicast) packets are delivered to a higher-layer protocol.
system_windows_network_interface_packetsReceivedUnicastPersecSystem Windows Network Interface PacketsReceivedUnicastPersecpsecPackets Received Unicast/sec is the rate at which (subnet) unicast packets are delivered to a higher-layer protocol.
system_windows_network_interface_packetsSentNonUnicastPersecSystem Windows Network Interface PacketsSentNonUnicastPersecpsecPackets Sent Non-Unicast/sec is the rate at which packets are requested to be transmitted to non-unicast (subnet broadcast or subnet multicast) addresses by higher-level protocols. The rate includes the packets that were discarded or not sent.
system_windows_network_interface_packetsSentUnicastPersecSystem Windows Network Interface PacketsSentUnicastPersecpsecPackets Sent Unicast/sec is the rate at which packets are requested to be transmitted to subnet-unicast addresses by higher-level protocols. The rate includes the packets that were discarded or not sent.
system_windows_network_interface_offloadedConnectionsSystem Windows Network Interface OffloadedConnectionscountOffloaded Connections is the number of TCP connections (over both IPv4 and IPv6) that are currently handled by the TCP chimney offload capable network adapter.
system_windows_network_interface_packetsOutboundErrorsSystem Windows Network Interface PacketsOutboundErrorscountPackets Outbound Errors is the number of outbound packets that could not be transmitted because of errors.
system_windows_network_interface_packetsReceivedErrorsSystem Windows Network Interface PacketsReceivedErrorscountPackets Received Errors is the number of inbound packets that contained errors preventing them from being deliverable to a higher-layer protocol.
system_windows_network_interface_outputQueueLengthSystem Windows Network Interface OutputQueueLengthcountOutput Queue Length is the length of the output packet queue (in packets). If this is longer than two, there are delays and the bottleneck should be found and eliminated, if possible. Since the requests are queued by the Network Driver Interface Specification (NDIS) in this implementation, this will always be 0.
system_windows_network_interface_packetsReceivedDiscardedSystem Windows Network Interface PacketsReceivedDiscardedcountPackets Received Discarded is the number of inbound packets that were chosen to be discarded even though no errors had been detected to prevent their delivery to a higher-layer protocol. One possible reason for discarding packets could be to free up buffer space.
system_windows_network_interface_packetsOutboundDiscardedSystem Windows Network Interface PacketsOutboundDiscardedcountPackets Outbound Discarded is the number of outbound packets that were chosen to be discarded even though no errors had been detected to prevent transmission. One possible reason for discarding packets could be to free up buffer space.
system_windows_network_interface_utilizationSystem Windows Network Interface Utilization%Network Interface utilization in percentage.
system_windows_network_interface_trafficInSystem Windows Network Interface TrafficInbpsBytes Received/sec is the rate at which bytes are received over each network adapter, including framing characters. Network Interface\Bytes Received/sec is a subset of Network Interface\Bytes Total/sec.
system_windows_network_interface_trafficOutSystem Windows Network Interface TrafficOutbpsBytes Sent/sec is the rate at which bytes are sent over each network adapter, including framing characters. Network Interface\Bytes Sent/sec is a subset of Network Interface\Bytes Total/sec.
system_windows_network_interface_trafficTotalSystem Windows Network Interface TrafficTotalbpsBytes Total/sec is the rate at which bytes are sent and received over each network adapter, including framing characters. Network Interface\Bytes Total/sec is a sum of Network Interface\Bytes Received/sec and Network Interface\Bytes Sent/sec.
system_windows_network_interface_packetsPersecSystem Windows Network Interface PacketsPersecpackets/secPackets/sec is the rate at which packets are sent and received on the network interface.

Agent G2 - Windows Network Interface Monitoring - v2

Description

Agent G2 - Windows Network Interface Monitoring - v2

Prerequisites

NULL

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows Network Interface Monitoring - v2system.windows.network.interface.packets.received.persecSystem Windows Network Interface Packets Received Per Secpackets/secPacketsReceived/sec is the rate at which packets are received on the network interface.
system.windows.network.interface.packets.sent.persecSystem Windows Network Interface Packets Sent Per Secpackets/secPacketsSentPerSec is the rate at which packets are sent on the network interface.
system.windows.network.interface.traffic.outSystem Windows Network Interface Traffic OutbpsBytes Sent/sec is the rate at which bytes are sent over each network adapter, including framing characters. Network Interface\\Bytes Sent/sec is a subset of Network Interface\\Bytes Total/sec.
system.windows.network.interface.packets.persecSystem Windows Network Interface Packets Per Secpackets/secPackets/sec is the rate at which packets are sent and received on the network interface.
system.windows.network.interface.traffic.inSystem Windows Network Interface Traffic InbpsBytes Received/sec is the rate at which bytes are received over each network adapter, including framing characters. Network Interface\\Bytes Received/sec is a subset of Network Interface\\Bytes Total/sec.
system.windows.network.interface.traffic.totalSystem Windows Network Interface Traffic TotalbpsBytes Total/sec is the rate at which bytes are sent and received over each network adapter, including framing characters. Network Interface\\Bytes Total/sec is a sum of Network Interface\\Bytes Received/sec and Network Interface\\Bytes Sent/sec.

Agent G2 - Windows Registry Monitoring

Description

Agent G2 - Windows Registry Monitoring

Prerequisites

No prerequisite

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows Registry Custom Monitorsystem.windows.registry.quota.usageSystem Windows Registry Quota Usage%% Registry Quota In Use is the percentage of the Total Registry Quota Allowed that is currently being used by the system. This counter displays the current percentage value only; it is not an average.

Agent G2 - Windows Services Monitoring

Description

Agent G2 - Windows Services Monitoring

Prerequisites

Provide ServiceNames with comma separated as input parameters while applying the template at device level.
Example: opsramp-agent,opsramp-shield,power.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows Services Custom Monitorsystem.windows.service.statusSystem Windows Service StatusNULLIt gives the current status of the given service name(s), In graph 1 - Running & 0 - Stopped

Agent G2 - Windows Service Monitoring v2.0

Description

It monitors the windows service monitoring with Regex support.

Prerequisites

Provide service names as input arguments separated by commas when applying the template at the device level, along with support for regular expressions.

Syntax: serviceName1,serviceName2,regexPattern1,regexPattern2.

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows Services Custom Monitor v2.0System_Windows_Service_Status_ExtSystem Windows Service Status ExtNULLIt gives the current status of the services by matching with the given service name(s) or regex patterns(s). Below are the possible values: Stopped - 0, Running - 1, Start Pending - 2, Stop Pending - 3, Continue Pending - 4, Pause Pending - 5, Paused - 6, Unknown - 7

Agent G2 - Windows Service Monitoring - v3

Description

To monitor the windows services status

Template Usage Guidelines:

  • When assigning this template on the device, users need to pass specific input parameters. These parameters should be provided as one or more service names (not the service display names) or service name regex patterns.
  • To provide multiple service names or service name regex patterns, seperate them with commas.
  • SAMPLE CUSTOM SCRIPT ARGUMENTS:
    Example 1(With regex): ^opsramp,agent$,Power
    Example 2(Without Regex): Netlogon,Dnscache,RpcEptMapper
Dell PowerFlex

Prerequisites

No prerequisite

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows Services Custom Monitor - v3System_Windows_Service_Status_ExtSystem Windows Service Status ExtIt gives the current status of the services by matching with the given service name(s) or regex patterns(s). Below are the possible values: Stopped - 0, Running - 1, Start Pending - 2, Stop Pending - 3, Continue Pending - 4, Pause Pending - 5, Paused - 6, Unknown - 7

Agent G2 - Windows Mountpoint Monitoring

Description

Agent G2 - Windows Mountpoint Monitoring

Prerequisites

No prerequisite

Supported Metric

Monitor NameMetric NameMetric Display NameUnitDescription
Agent G2 - Windows Mountpoint Custom Monitorsystem.windows.mountpoint.disk.freespaceSystem Windows MountPoint Disk FreeSpaceMBShows the free space of mounted disks in MB.
system.windows.mountpoint.disk.usageSystem Windows MountPoint Disk Usage%Shows the utilization space of mounted disks in percentage.