Supported Versions
Target device version 9.8
REST API version using 9.8

Introduction

ONTAP has evolved from NetApp’s original operating system. It is most popular and widely-deployed operating system.

The different platforms that it can run on now are FAS, AFF, ONTAP Select, Cloud Volumes ONTAP, and NetApp Private Storage.

The different platforms run on hardware or virtual machine, and on premises or in the cloud.

NetApp ONTAP Architecture:

Cluster, Cluster Nodes and Disks

An ONTAP cluster can be as small as a single node cluster. Running a single node cluster means actually losing the access to data if the node fails. Simply it is not redundant.

To be redundant we can add another node to the existing cluster to form a HA-Pair. The two nodes which are connected in HA-Pair share a single disk. If we run a HA-Pair then we will have HA-Interconnect. This HA-Interconnect mirrors the NVRAM between these two nodes. So if one of the node fails then the other node can take over the aggregates of the failed node. Thus, there will be no loss of data even if one of the node fails. We have a network called Cluster Interconnect. The Cluster Interconnect is used for heartbeat configuration management and volume data between the cluster nodes.

A cluster node can be scaled out by connecting one or more HA-Pairs to the same cluster. Each HA-pair should be connected to Cluster Interconnect and each HA-pair contains individual HA_Interconnect.

Based on the protocols we use the maximum number of HA-Pairs and can be configured as follows:

  • NAS : 12 HA-Pairs (24 Nodes)
  • SAN : 6 HA-Pairs (12 Nodes)

Aggregates, Volumes and SVMs

Client data and configuration data in ONTAP is stored in volumes, and volumes are always a part of an aggregate. Aggregate is a collection of disks. Each node contains only one root aggregate, but we can have multiple data aggregates per node. In the case of node failure the surviving node will take over the aggregates of the failed node by default. In case of single HA-Pair the aggregates will be given back automatically after the failed node is up and running again.

There is a very important volume called root volume typically named as vol0, which is located in root aggregate. This volume is a node volume which contains cluster configurations and log files. So every node contains a volume named vol0.

A cluster always runs virtual machines. SVMs are logical representation of one of four types. They are:

  1. System Type (Cluster SVM) : We have only one such type of SVM in the cluster. This SVM is used to manage the cluster Inter-Connect interfaces.

  2. Node Type (Node SVM) : Each node in the cluster is represented as a node SVM.

  3. Admin Type (Cluster Management SVM) : Usually connect to this type of SVM when we want to manage the cluster. Basically it represents the cluster.

  4. Data Type : Depending on the number of nodes and protocols we use we can have upto a maximum of 1000 SVMs of this type. These type of SVMs are used to represent data towards our NAS and SAN clients. The data of the SVMs is stored in volumes. All the volumes belonging to a particular data SVM are a part of namespace of that particular SVM. Therefore, the namespace of a data SVM contains all of its data volumes. A data SVM contains exactly one root volume. This is a very small volume of 1GB commonly used to manage data volumes. This root volume should never contain data and the only information available from the root volume is junction paths of the connected data volumes. In a NAS environment the volumes have to be managed, otherwise they are not accessible by the clients.

LUNs and QTrees

A LUN is the logical representation of storage in a storage area network (SAN). In ONTAP, a LUN is located within a volume. Optionally, it can be located within a Qtree in a volume.

A Qtree is a logically defined file system that can exist as a special subdirectory of the root directory within a FlexVol volume or a FlexGroup volume.

Resource Hierarchy:

NetApp Cluster
— Cluster Nodes
    — Aggregates
    — Storage Ports
    — Ethernet Ports
      — Logical Interfaces(LIF’s)
    — FCP Adapters
      — FC Interfaces
— Storage Virtual Machine/SVM
    — Storage Volumes
        — Storage Luns
        — QTrees
— Disks
    — Shelves
Metro Cluster
    — Metro Cluster Nodes

Prerequisites

For sdk 2.0 applications OpsRamp Classic Gateway 12.0.1 and above (or) OpsRamp next-gen gateway

To check the TLS status:

SSH username@hostname
system services web show

If TLS is not enabled: tls.enable on

Example output:

External Web Services: true
               Status: online
   HTTP Protocol Port: 80
  HTTPS Protocol Port: 443
        TLSv1 Enabled: true
        SSLv3 Enabled: true
        SSLv2 Enabled: true

Application Migration

  1. Check for the gateway version as a prerequisite step - classic gateway-12.0.1 and above.
    Notes:

    • You only have to follow these steps when you want to migrate from sdk 1.0 to sdk 2.0.
    • For the first time installation below steps are not required.
      • Application name should be the same.
      • Disable all installed integrations configurations for that application. This is to make sure no discovery and monitoring requests are processed.
      • Install and Add the configuration to that sdk 2.0 application.
        Note: refer to Configure and install the integration & View the NetApp ONTAP details sections of this document.
  2. Once all discoveries are completed with the sdk 2.0 application, follow any one of the approaches.

  • Direct uninstallation of the sdk 1.0 adaptor application through the uninstall API with skipDeleteResources=true in the post request.

End-Point: https://{{host}}/api/v2/tenants/{tenantId}/integrations/installed/{installedIntgId}

    {
        "uninstallReason": "Test",
        "skipDeleteResources": true
     }

(OR)

  • Delete the configuration one by one through the Delete adaptor config API with the request parameter as skipDeleteResources=true

End-Point: https://{{host}}/api/v2/tenants/{tenantId}/integrations/installed/config/{configId}?skipDeleteResources=true.

  • Finally, uninstall the adaptor application through API with skipDeleteResources=true in the post request.

End-Point: https://{{host}}/api/v2/tenants/{tenantId}/integrations/installed/{installedIntgId}

    {
        "uninstallReason": "Test",
        "skipDeleteResources": true
    }

Default monitoring configurations

NETAPP-ONTAP application has default Global Device Management Policies, Global Templates, Global Monitors and Global metrics in OpsRamp. You can customize these default monitoring configurations as per your business use cases by cloning respective global templates and global Device Management Policies. OpsRamp recommends performing the below activity before installing the application to avoid noise alerts and data.

  1. Default Global Device Management Policies available

    OpsRamp will have a Global Device Management Policy for each Native Type of NETAPP-ONTAP. We can find those Device Management Policies at Setup -> Resources -> Device Management Policies, Search with suggested names in global scope. Each Device Management Policy follows below naming convention:

    {appName nativeType - version}

    Ex : netapp-ontap NetApp Cluster - 1 (i.e, appName = netapp-ontap, nativeType = NetApp Cluster, version = 1)

  2. Default Global Templates available

    OpsRamp will have a Global template for each Native Type of NETAPP-ONTAP. We can find those templates at Setup -> Monitoring -> Templates, Search with suggested names in global scope. Each template follows below naming convention:

    {appName nativeType 'Template' - version}

    Ex : netapp-ontap NetApp Cluster Template - 1 (i.e, appName = netapp-ontap, nativeType = NetApp Cluster, version = 1)

  3. Default Global Monitors available OpsRamp will have a Global Monitors for each Native Type which has monitoring support. We can find those monitors at Setup -> Monitoring -> Monitors, Search with suggested names in global scope. Each Monitors follows below naming convention:

    {monitorKey appName nativeType - version}

    Ex : NetApp Cluster Monitor netapp-ontap NetApp Cluster 1 (i.e, monitorKey = NetApp Cluster Monitor, appName = netapp-ontap, nativeType = NetApp Cluster, version = 1)

Configure and install the integration

  1. From All Clients, select a client.
  2. Go to Setup > Integrations and Apps > Integrations.
  3. Click Manage Apps.
    Notes:
    • If there are already installed applications, it will redirect to the INSTALLED APPS page where all the installed applications are displayed.
    • If there are no installed applications, it will navigate to the ADD APP page.
Netapp ONTAP
  1. Click + ADD on the INSTALLED APP page. The ADD APP page displays all the available applications along with the newly created application with the version.
    Note: You can even search for the application using the search option available. Also you can use the All Categories option to search.
Netapp ONTAP
  1. Click ADD in the netapp-ontap-sdkappv2 application.
  2. Select an existing registered profile, and click Next.
  3. In the Configurations page, click + ADD. The Add Configuration page appears.
  4. Enter the below mentioned BASIC INFORMATION
Object NameDescription
NameEnter the name for the integration
IP Address/Host NameIP address/host name of the target.
PortPort.

Example: You can enter 8080 as a port value.
CredentialsSelect the credentials from the drop-down list.
Note: Click + Add to create a credential.

Notes:

  • By default the Is Secure checkbox is selected.
  • App Failure Notifications: if turned on, you will be notified in case of an application failure that is, Connectivity Exception, Authentication Exception.
  1. In the RESOURCE TYPE section, select:

    • ALL: All the existing and future resources will be discovered.
    • SELECT: You can select one or multiple resources to be discovered.
  2. In the DISCOVERY SCHEDULE section, select Recurrence Pattern to add one of the following patterns:

    • Minutes
    • Hourly
    • Daily
    • Weekly
    • Monthly
  3. Click ADD.
    The configuration is saved and displayed on the page.

Netapp ONTAP

After saving the configuration, the resources are discovered and monitoring is done as specified in the configuration profile.

The configuration is saved and displayed on the page.

View the NetApp ONTAP details

The discovered resource(s) are displayed in Infrastructure > Resources > Storage > NetApp Cluster, with Native Resource Type as NetApp Cluster.

Netapp ONTAP

View resource metrics

To confirm NetApp-ONTAP monitoring, review the following:

  • Metric graphs: A graph is plotted for each metric that is enabled in the configuration.
  • Alerts: Alerts are generated for metrics that are configured as defined for integration.
Netapp ONTAP

Supported Metrics

Metric NameMetric Display NameUnitsUnits
netapp_cluster_ReadOpsNetApp Cluster Read OpssNumber of read I/O operations performed on the cluster
netapp_cluster_WriteOpsNetApp Cluster Write OpssNumber of write I/O operations performed on a cluster
netapp_cluster_OtherOpsNetApp Cluster Other OpssNumber of other operations performed on cluster. Other I/O operations can be metadata operations, such as directory lookups and so on
netapp_cluster_TotalOpsNetApp Cluster Total OpssPerformance metric aggregated over all types of I/O operations
netapp_cluster_ReadLatencyNetApp Cluster Read LatencymicrosecLatency for all read operations in the system in microseconds
netapp_cluster_WriteLatencyNetApp Cluster Write LatencymicrosecLatency for all write operations in the system in microseconds
netapp_cluster_OtherLatencyNetApp Cluster Other LatencymicrosecLatency for all other operations in the system in microseconds
netapp_cluster_TotalLatencyNetApp Cluster Total LatencymicrosecLatency for all total operations in the system in microseconds
netapp_cluster_ReadThroughputNetApp Cluster Read ThroughPutBpsRead Throughput for all read I/O operations
netapp_cluster_WriteThroughputNetApp Cluster Write ThroughPutBpsWriteThroughput for all write I/O operations
netapp_cluster_OtherThroughputNetApp Cluster Other ThroughPutBpsOther Throughput for all other I/O operations. Other I/O operations can be metadata operations, such as directory lookups
netapp_cluster_ClusterPeerStatusNetApp Cluster Peer Cluster StatusReturns the state of the peer cluster. Possible states are available, partial, unavailable, pending, unidentified
netapp_cluster_system_SizeAvailableNetapp Cluster System Size AvailableGBSystem Available size.Sum of the available size of all the aggregates in a cluster
netapp_cluster_system_UtilizationNetapp Cluster System Utilization%System Utilization
netapp_cluster_metrocluster_ConfigurationStateNetApp Cluster MetroCluster Configuration StateIndicates the state of the local cluster configuration.Possible states are configuration_error, configured, not_configured, not_reachable, partially_configured, unknown
netapp_cluster_metrocluster_IsPartnerClusterReachableNetApp Cluster MetroCluster Partner Cluster ReachabilitySpecifies whether the partner cluster is reachable from the local cluster.
netapp_cluster_snapmirror_StateNetApp Cluster SnapMirror StateSnapMirror state.Possible states are broken_off, paused, snapmirrored, uninitialized, in_sync, out_of_sync, synchronizing
netapp_cluster_snapmirror_LagTimeNetApp Cluster SnapMirror LagTimemsTime since the exported Snapshot copy was created
netapp_cluster_hardware_HealthStatusNetApp Cluster Hardware Health StatusReturns the status of the Hardware Components of the Cluster. Possible states are unknown(0), unreachable(1), degraded(2), ok-with-suppressed(3) and ok(4)
netapp_node_UpTimeNetApp Cluster Node UpTimesTotal time in seconds that the node has been up
netapp_node_ControllerOverTemperatureNetApp Cluster Node Controller Over TemperatureNode Controller Over Temperature. Possible values are over, normal
netapp_node_StateNetApp Cluster Node StateState of the Node. Possible states are up, booting, down, taken_over, waiting_for_giveback, degraded, unknown
netapp_node_FrusStateNetApp Cluster Node Frus StateState of List of FRUs on the node.Possible values are ok(1), error(0)
netapp_node_ServiceProcessorStateNetApp Cluster Node ServiceProcessor StateState of the service processor on the node.Possible states are online, offline, degraded, rebooting, unknown, updating, node_offline, sp_daemon_offline
netapp_node_ServiceProcessorLinkStatusNetApp Cluster Node ServiceProcessor Link StatusService Processor Link status on the Node.Possible states are up, down, disabled, unknown
netapp_node_CpuUtilizationNetApp Cluster Node Cpu Utilization%Node CPU Utilization
netapp_node_HaPortStateNetApp Cluster Node HA Port StateNode HA Port State.Possible states are down, initialized, armed, active, reserved
netapp_node_TakeOverStateNetApp Cluster Node TakeOver StateRepresents the state of the node that is taking over storage from its HA partner
netapp_node_GiveBackStateNetApp Cluster Node GiveBack StateRepresents the state of the node that is giving storage back to its HA partner
netapp_aggregate_StateNetApp Cluster Aggregate Stateoperational State of the Aggregate. Possible states are online, onlining, offline, offlining, relocating, unmounted, restricted, inconsistent, failed, unknown
netapp_aggregate_SizeUsedNetApp Cluster Aggregate Size UsedGBSpace used or reserved. Includes volume guarantees and aggregate metadata
netapp_aggregate_SizeTotalNetApp Cluster Aggregate Size TotalGBTotal usable space not including WAFL reserve and aggregate Snapshot copy reserve
netapp_aggregate_SizeAvailableNetApp Cluster Aggregate Size AvailableGBSpace available on the Aggregate
netapp_aggregate_PercentUsedCapacityNetApp Cluster Aggregate Percentage Used Capacity%Percentage used Capacity of an Aggregate
netapp_aggregate_SpaceFullThresholdPercentNetApp Cluster Aggregate SpaceFull Threshold percent%Spacefull Threshold Percentage of an Aggregate
netapp_aggregate_InactiveUserDataNetApp Cluster Aggregate Inactive User DataGBThe size that is physically used in the block storage and has a cold temperature.only supported if the aggregate is either attached to a cloud store or can be attached to a cloud store.
netapp_aggregate_MirrorStateNetApp Cluster Aggregate Mirror stateAggregate Mirror State. Possible states are unmirrored, normal, degraded, resynchronizing, failed
netapp_aggregate_DiskCountNetApp Cluster Aggregate Disk CountcountNumber of disks used in the aggregate. This includes parity disks, but excludes disks in the hybrid cache
netapp_aggregate_RaidSizeNetApp Cluster Aggregate Raid SizecountSpecifies the maximum number of disks that can be included in a RAID group.
netapp_aggregate_ReadThroughputNetApp Cluster Aggregate Read ThroughPutBpsRead Throughput for all read I/O operations
netapp_aggregate_WriteThroughputNetApp Cluster Aggregate Write ThroughPutBpsWriteThroughput for all write I/O operations
netapp_aggregate_TotalThroughputNetApp Cluster Aggregate Other ThroughPutBpsOther Throughput for all other I/O operations. Other I/O operations can be metadata operations, such as directory lookups
netapp_aggregate_OtherThroughputNetApp Cluster Aggregate Total ThroughPutBpsTotal Throughput for all types of I/O operations
netapp_aggregate_ReadOpsNetApp Cluster Aggregate Read OpssNumber of read I/O operations performed on an aggregate
netapp_aggregate_WriteOpsNetApp Cluster Aggregate Write OpssNumber of write I/O operations performed on an aggregate
netapp_aggregate_TotalOpsNetApp Cluster Aggregate Other OpssNumber of other operations performed on an aggregate. Other I/O operations can be metadata operations, such as directory lookups and so on
netapp_aggregate_OtherOpsNetApp Cluster Aggregate Total OpssPerformance metric aggregated over all types of I/O operations of an aggregate
netapp_aggregate_ReadLatencyNetApp Cluster Aggregate Read LatencymicrosecLatency for all read operations of an aggregate in microseconds
netapp_aggregate_WriteLatencyNetApp Cluster Aggregate Write LatencymicrosecLatency for all write operations of an aggregate in microseconds
netapp_aggregate_TotalLatencyNetApp Cluster Aggregate Other LatencymicrosecLatency for all other operations of an aggregate in microseconds
netapp_aggregate_OtherLatencyNetApp Cluster Aggregate Total LatencymicrosecLatency for all total operations of an aggregate in microseconds
netapp_aggregate_AvgReadThroughputNetApp Cluster Aggregate Average Read ThroughPutBpsDisplays the incremental average read throughput over the last 1 hour with 15 seconds sampling
netapp_aggregate_AvgWriteThroughputNetApp Cluster Aggregate Average Write ThroughPutBpsDisplays the incremental average write throughput over the last 1 hour with 15 seconds sampling
netapp_aggregate_AverageThroughputNetApp Cluster Aggregate Average ThroughPutBpsDisplays the incremental average throughput over the last 1 hour with 15 seconds sampling
netapp_aggregate_AvgOtherThroughputNetApp Cluster Aggregate Average Other ThroughPutBpsDisplays the incremental average other throughput over the last 1 hour with 15 seconds sampling
netapp_aggregate_AvgReadLatencyNetApp Cluster Aggregate Average Read LatencymicrosecDisplays the incremental average for read latency over the last 1 hour with 15 seconds sampling
netapp_aggregate_AvgWriteLatencyNetApp Cluster Aggregate Average Write LatencymicrosecDisplays the incremental average for write latency over the last 1 hour with 15 seconds sampling
netapp_aggregate_AverageLatencyNetApp Cluster Aggregate Average LatencymicrosecDisplays the incremental average for latency over the last 1 hour with 15 seconds sampling
netapp_aggregate_AvgOtherLatencyNetApp Cluster Aggregate Average Other LatencymicrosecDisplays the incremental average for other latencyover the last 1 hour with 15 seconds sampling
netapp_aggregate_AvgReadOpsNetApp Cluster Aggregate Average Read OpssDisplays the incremental changes in read IOPs over the last 1 hour with 15 seconds sampling
netapp_aggregate_AvgWriteOpsNetApp Cluster Aggregate Average Write OpssDisplays the incremental changes in write IOPs over the last 1 hour with 15 seconds sampling
netapp_aggregate_AvgOpsNetApp Cluster Aggregate Average OpssDisplays the incremental changes in IOPs over the last 1 hour with 15 seconds sampling
netapp_aggregate_AvgOtherOpsNetApp Cluster Aggregate Average Other OpssDisplays the incremental changes in other IOPs over the last 1 hour with 15 seconds sampling
netapp_aggregate_SnapshotReservePercentNetApp Cluster Aggregate Snapshot Reserve percentage%NetApp Cluster Aggregate Snapshot Reserve percentage
netapp_disk_StateNetApp Cluster Disk StateNetApp Cluster Disk State
netapp_disk_SizeAvailableNetApp Cluster Disk Size AvailableGBNetApp Cluster Disk Size Available
netapp_disk_RevolutionsPerMinuteNetApp Cluster Disk Revolutions Per MinuterpmNetApp Cluster Disk Revolutions Per Minute
netapp_fcp_StateNetApp Cluster FC Port StateFC Port State.Possible states are startup, link_not_connected, online, link_disconnected, offlined_by_user, offlined_by_system, node_offline, unknown
netapp_fcp_ReadThroughPutNetApp Cluster FC Port Read ThroughPutBpsRead Throughput for all read I/O operations of an FC Port
netapp_fcp_WriteThroughPutNetApp Cluster FC Port Write ThroughPutBpsWriteThroughput for all write I/O operations of an FC Port
netapp_fcp_TotalThroughPutNetApp Cluster FC Port Total ThroughPutBpsTotal Throughput for all types of I/O operations of an FC Port
netapp_fcp_ReadLatencyNetApp Cluster FC Port Read LatencymicrosecLatency for all read operations in microseconds
netapp_fcp_WriteLatencyNetApp Cluster FC Port Write LatencymicrosecLatency for all write operations system in microseconds
netapp_fcp_TotalLatencyNetApp Cluster FC Port Total LatencymicrosecLatency for all total operations in microseconds
netapp_fcp_OtherLatencyNetApp Cluster FC Port Other LatencymicrosecLatency for all other operations in microseconds
netapp_fcp_ReadOpsNetApp Cluster FC Port Read OpssNumber of read I/O operations performed on the FC Port
netapp_fcp_WriteOpsNetApp Cluster FC Port Write OpssNumber of write I/O operations performed on a FC Port
netapp_fcp_TotalOpsNetApp Cluster FC Port Total OpssPerformance metric aggregated over all types of I/O operations of an FC Port
netapp_fcp_OtherOpsNetApp Cluster FC Port Other OpssNumber of other operations performed on cluster. Other I/O operations can be metadata operations, such as directory lookups and so on
netapp_fcp_IsAdministrativeUpNetApp Cluster FC Port Administratice StatusNumber of other operations performed on cluster. Other I/O operations can be metadata operations, such as directory lookups and so on
netapp_ethernet_port_StateNetApp Cluster Net Port StateNet-Port Status. Possible states are up, down
netapp_ethernet_port_ReadThroughPutNetApp Cluster Net Port Read ThroughPutBpsRead Throughput for all read I/O operations
netapp_ethernet_port_WriteThroughPutNetApp Cluster Net Port Write ThroughPutBpsWriteThroughput for all write I/O operations
netapp_ethernet_port_TotalThroughPutNetApp Cluster Net Port Total ThroughPutBpsTotal Throughput for all types of I/O operations
netapp_ethernet_port_LinkDownCountNetApp Cluster Net Port Link Down CountcountThe number of link state changes from up to down seen on the device
netapp_ethernet_port_ReceivedPacketsNetApp Cluster Net Port Received PacketscountNumber of packets received
netapp_ethernet_port_TransmittedPacketsNetApp Cluster Net Port Transmitted PacketscountNumber of packets transmitted
netapp_ethernet_port_IsAdministrativeUpNetApp Cluster Net Port Administrative StatusAdministrative status of the Port
netapp_fc_interface_StateNetApp Cluster FC Interface StateThe current operational state of the FC interface. The state is set to down if the interface is not enabled.If the node hosting the port is down or unavailable, no state value is returned.
netapp_fc_interface_ReadThroughPutNetApp Cluster FC Interface Read ThroughPutBpsRead Throughput for all read I/O operations of an FC Interface
netapp_fc_interface_WriteThroughPutNetApp Cluster FC Interface Write ThroughPutBpsWriteThroughput for all write I/O operations of an FC Interface
netapp_fc_interface_TotalThroughPutNetApp Cluster FC Interface Total ThroughPutBpsTotal Throughput for all types of I/O operations of an FC Interface
netapp_fc_interface_ReadLatencyNetApp Cluster FC Interface Read LatencymicrosecLatency for all read operations in microseconds
netapp_fc_interface_WriteLatencyNetApp Cluster FC Interface Write LatencymicrosecLatency for all write operations system in microseconds
netapp_fc_interface_TotalLatencyNetApp Cluster FC Interface Total LatencymicrosecLatency for all total operations in microseconds
netapp_fc_interface_OtherLatencyNetApp Cluster FC Interface Other LatencymicrosecLatency for all other operations in microseconds
netapp_fc_interface_ReadOpsNetApp Cluster FC Interface Read OpssNumber of read I/O operations performed on the FC Port
netapp_fc_interface_WriteOpsNetApp Cluster FC Interface Write OpssNumber of write I/O operations performed on a FC Port
netapp_fc_interface_TotalOpsNetApp Cluster FC Interface Total OpssPerformance metric aggregated over all types of I/O operations of an FC Port
netapp_fc_interface_OtherOpsNetApp Cluster FC Interface Other OpssNumber of other operations performed on cluster. Other I/O operations can be metadata operations, such as directory lookups and so on
netapp_fc_interface_IsAdministrativeUpNetApp Cluster FC Interface Administratice StatusThe administrative state of the FC interface. The FC interface can be disabled to block all FC communication with the SVM through this interface
netapp_lif_interface_ReadThroughPutNetApp Cluster Interface Read ThroughPutBpsRead Throughput for all read I/O operations of an Interface
netapp_lif_interface_WriteThroughPutNetApp Cluster Interface Write ThroughPutBpsWriteThroughput for all write I/O operations of an Interface
netapp_lif_interface_TotalThroughPutNetApp Cluster Interface Total ThroughPutBpsTotal Throughput for all types of I/O operations of an Interface
netapp_lif_interface_StateNetApp Cluster Interface StateReturns Interface state. Possible states are up, down
netapp_vserver_StateNetApp Cluster vServer StatevServer State.Possible states are starting, running, stopping, stopped, deleting
netapp_vserver_nfsv3_ReadOpsNetApp vServer NFSV3 Read OpssNumber of read I/O operations performed
netapp_vserver_nfsv3_WriteOpsNetApp vServer NFSV3 Write OpssNumber of write I/O operations performed
netapp_vserver_nfsv3_TotalOpsNetApp vServer NFSV3 Total OpssPerformance metric aggregated over all types of I/O operations
netapp_vserver_nfsv3_OtherOpsNetApp vServer NFSV3 Other OpssNumber of other operations performed. Other I/O operations can be metadata operations, such as directory lookups and so on
netapp_vserver_cifs_ReadThroughPutNetApp vServer CIFS Read ThroughPutBpsRead Throughput for all read I/O operation
netapp_vserver_cifs_WriteThroughPutNetApp vServer CIFS Write ThroughPutBpsWriteThroughput for all write I/O operations
netapp_vserver_cifs_TotalThroughPutNetApp vServer CIFS Total ThroughPutBpsTotal Throughput for all types of I/O operations
netapp_vserver_cifs_ReadLatencyNetApp vServer CIFS Read LatencymicrosecLatency for all read operations in microseconds
netapp_vserver_cifs_WriteLatencyNetApp vServer CIFS Write LatencymicrosecLatency for all write operations system in microseconds
netapp_vserver_cifs_TotalLatencyNetApp vServer CIFS Total LatencymicrosecLatency for all total operations in microseconds
netapp_vserver_cifs_OtherLatencyNetApp vServer CIFS Other LatencymicrosecLatency for all other operations in microseconds
netapp_vserver_cifs_ReadOpsNetApp vServer CIFS Read OpssNumber of read I/O operations performed
netapp_vserver_cifs_WriteOpsNetApp vServer CIFS Write OpssNumber of write I/O operations performed
netapp_vserver_cifs_TotalOpsNetApp vServer CIFS Total OpssPerformance metric aggregated over all types of I/O operations
netapp_vserver_cifs_OtherOpsNetApp vServer CIFS Other OpssNumber of other operations performed. Other I/O operations can be metadata operations, such as directory lookups and so on
netapp_vserver_iscsi_ReadThroughputNetApp vServer ISCSI Read ThroughPutBpsRead Throughput for all read I/O operation
netapp_vserver_iscsi_WriteThroughputNetApp vServer ISCSI Write ThroughPutBpsWriteThroughput for all write I/O operations
netapp_vserver_iscsi_TotalThroughputNetApp vServer ISCSI Total ThroughPutBpsTotal Throughput for all types of I/O operations
netapp_vserver_iscsi_ReadOpsNetApp vServer ISCSI Read LatencysLatency for all read operations in microseconds
netapp_vserver_iscsi_WriteOpsNetApp vServer ISCSI Write LatencysLatency for all write operations system in microseconds
netapp_vserver_iscsi_TotalOpsNetApp vServer ISCSI Total LatencysLatency for all total operations in microseconds
netapp_vserver_iscsi_OtherOpsNetApp vServer ISCSI Other LatencysLatency for all other operations in microseconds
netapp_vserver_iscsi_OtherOpsNetApp vServer ISCSI Other LatencysLatency for all other operations in microseconds
netapp_vserver_iscsi_ReadLatencyNetApp vServer ISCSI Read OpsmicrosecNumber of read I/O operations performed
netapp_vserver_iscsi_WriteLatencyNetApp vServer ISCSI Write OpsmicrosecNumber of write I/O operations performed
netapp_vserver_iscsi_TotalLatencyNetApp vServer ISCSI Total OpsmicrosecPerformance metric aggregated over all types of I/O operations
netapp_vserver_iscsi_OtherLatencyNetApp vServer ISCSI Other OpsmicrosecNumber of other operations performed. Other I/O operations can be metadata operations, such as directory lookups and so on
netapp_volume_FilesTotalNetApp Cluster Volume Files TotalcountThe maximum number of files (inodes) for user-visible data allowed on the volume
netapp_volume_FilesUsedNetApp Cluster Volume Files UsedcountNumber of user-visible files (inodes) used
netapp_volume_SizeTotalNetApp Cluster Volume Size TotalGBTotal provisioned size. The default size is equal to the minimum size of 20MB, in bytes
netapp_volume_SizeAvailableNetApp Cluster Volume Size AvailableGBThe available space in volume, in bytes.
netapp_volume_SizeUsedNetApp Cluster Volume Size UsedGBThe virtual space used (includes volume reserves) before storage efficiency, in bytes.
netapp_volume_SnapshotReserveSizeUsedNetApp Cluster Volume Snapshot Reserve Size UsedGBThe total space used by Snapshot copies in the volume, in bytes.
netapp_volume_PercentageSnapshotsReservedNetApp Cluster Volume Percentage Snapshot Reserved%The space that has been set aside as a reserve for Snapshot copy usage, in percent
netapp_volume_FootprintNetApp Cluster Volume FootPrintGBData used for this volume in the aggregate, in bytes
netapp_volume_MetadataNetApp Cluster Volume MetaDataGBSpace used by the volume metadata in the aggregate, in bytes
netapp_volume_LogicalSpaceUsedByAfsNetApp Cluster Volume Logical Space USed By AFSGBThe virtual space used by AFS alone (includes volume reserves) and along with storage efficiency, in bytes
netapp_volume_CompressionStateNetApp Cluster Volume Compression StateThe system can be enabled/disabled compression. Possible states are inline, background, both, none and mixed
netapp_volume_DeduplicationStateNetApp Cluster Volume Deduplication StateThe system can be enabled/disabled dedupe. Possible states are inline, background, both, none and mixed
netapp_volume_CrossVolumeDeduplicationStateNetApp Cluster Volume Cross Voume Deduplication StateThe system can be enabled/disabled cross volume dedupe. it can be enabled only when dedupe is enabled. Possible states are inline, background, both, none, mixed
netapp_volume_CompactionStateNetApp Cluster Volume Compaction StateThe system can be enabled/disabled compaction. Possible states are inline, none, mixed
netapp_volume_QuotaStateNetApp Cluster Volume Quota StateQuota state of the volume. Possible states are corrupt, initializing, mixed, off, on, resizing
netapp_volume_GrowThresholdPercentageNetApp Cluster Volume Grow Threshold%Used space threshold size, in percentage, for the automatic growth of the volume. When the amount of used space in the volume becomes greater than this threhold, the volume automatically grows unless it has reached the maximum size
netapp_volume_ShrinkThresholdPercentageNetApp Cluster Volume Shrink Threshold%Used space threshold size, in percentage, for the automatic shrinkage of the volume. When the amount of used space in the volume drops below this threshold, the volume automatically shrinks unless it has reached the minimum size
netapp_volume_StateNetApp Cluster Volume StateState of the volume. Possible states are error, mixed, offline, online
netapp_volume_OtherLatencyNetApp Cluster Volume Other LatencymicrosecLatency for all other operations on the volume. Other operations include metadata and directory lookups
netapp_volume_TotalLatencyNetApp Cluster Volume Total LatencymicrosecLatency for all the operations on the volume
netapp_volume_ReadLatencyNetApp Cluster Volume Read LatencymicrosecTime taken for reads operations to the volume
netapp_volume_WriteLatencyNetApp Cluster Volume Write LatencymicrosecTime for writes operations to the volume
netapp_volume_OtherOpsNetApp Cluster Volume Other OpssNumber of other operations performed by the volume
netapp_volume_TotalOpsNetApp Cluster Volume Total OpssNumber of operations performed by the volume
netapp_volume_ReadOpsNetApp Cluster Volume Read OpssNumber of read operations performed by the volume
netapp_volume_WriteOpsNetApp Cluster Volume Write OpssNumber of write operations performed by the volume
netapp_volume_OtherThroughputNetApp Cluster Volume Other ThroughPutBpsOther Throughput for all other I/O operations. Other I/O operations can be metadata operations, such as directory lookups
netapp_volume_TotalThroughputNetApp Cluster Volume Total ThroughPutBpsTotal Throughput for all I/O operations on the volume
netapp_volume_ReadThroughputNetApp Cluster Volume Read ThroughPutBpsRead Throughput for all other I/O operations.
netapp_volume_WriteThroughputNetApp Cluster Volume Write ThroughPutBpsWrite Throughput for all types of I/O operations
netapp_volume_InodeUtilizationNetApp Cluster Volume Inode Utilization%Percentage of volume iNode utilization
netapp_volume_PercentageSizeUsedNetApp Cluster Volume Utilization%Percentage of the volume size that is used. This field is valid only when the volume is online.
netapp_volume_SnapshotCountNetapp Cluster Volume Snapshot CountcountVolume SnapShot Count
netapp_volume_TooOldSnapshotNetapp Cluster Volume Too Old SnapshotDaysSpecifies the age of a Snapshot copy, which, if exceeded, is considered too old for the volume
netapp_volume_QuotaToalFilesUsedNetApp Cluster Volume Quota Total Files UsedcountTotal files used
netapp_volume_QuotaSpaceHardLimitNetApp Cluster Volume Quota Space Hard LimitBytesSpace hard limit in bytes
netapp_volume_QuotaSpaceHardLimitPercentageUsedNetApp Cluster Volume Quota Space Hard Limit Percntage Used%Total space used as a percentage of space soft limit
netapp_volume_QuotaToalSpaceUsedNetApp Cluster Volume Quota Total Space UsedGBTotal space used
netapp_volume_AvgLatencyNetApp Cluster Volume Average LatencymicrosecNetApp Cluster Volume Average Latency
netapp_volume_AvgReadLatencyNetApp Cluster Volume Average Read LatencymicrosecNetApp Cluster Volume Average Read Latency
netapp_volume_AvgWriteLatencyNetApp Cluster Volume Average Write LatencymicrosecNetApp Cluster Volume Average Write Latency
netapp_volume_AvgOtherLatencyNetApp Cluster Volume Average Other LatencymicrosecNetApp Cluster Volume Average Other Latency
netapp_volume_AvgOtherOpsNetApp Cluster Volume Average Other OpssNetApp Cluster Volume Average Other Ops
netapp_volume_AvgOpsNetApp Cluster Volume Average Total OpssNetApp Cluster Volume Average Total Ops
netapp_volume_AvgReadOpsNetApp Cluster Volume Average Read OpssNetApp Cluster Volume Average Read Ops
netapp_volume_AvgWriteOpsNetApp Cluster Volume Average Write OpssNetApp Cluster Volume Average Write Ops
netapp_volume_AvgOtherThroughputNetApp Cluster Volume Average Other ThroughPutBpsNetApp Cluster Volume Average Other ThroughPut
netapp_volume_AvgThroughputNetApp Cluster Volume Average ThroughPutBpsNetApp Cluster Volume Average ThroughPut
netapp_volume_AvgReadThroughputNetApp Cluster Volume Average Read ThroughPutBpsNetApp Cluster Volume Average Read ThroughPut
netapp_volume_AvgWriteThroughputNetApp Cluster Volume Average Write ThroughPutBpsNetApp Cluster Volume Average Write ThroughPut
netapp_volume_SpaceFullThresholdPercentNetApp Cluster Volume SpaceFull Threshold Percentage%NetApp Cluster Volume SpaceFull Threshold Percentage
netapp_qtree_ReadOpsNetApp Volume Qtree Read OpssNumber of read I/O operations performed
netapp_qtree_WriteOpsNetApp Volume Qtree Write OpssNumber of write I/O operations performed
netapp_qtree_OtherOpsNetApp Volume Qtree Other OpssNumber of other operations performed. Other I/O operations can be metadata operations, such as directory lookups and so on
netapp_qtree_TotalOpsNetApp Volume Qtree Total OpssPerformance metric aggregated over all types of I/O operations
netapp_lun_SizeTotalNetApp Cluster Lun Size TotalGBThe total provisioned size of the LUN
netapp_lun_SizeUsedNetApp Cluster Lun Size UsedGBNetApp Cluster Lun Size Used
netapp_lun_StateNetApp Cluster LUN StateThe state of the LUN. Normal states for a LUN are online and offline. Other states indicate errors.Possible states are foreign_lun_error, nvfail, offline, online, space_error
netapp_lun_OtherLatencyNetApp Cluster LUN Other LatencymicrosecLatency for all other operations on the LUN. Other operations include metadata and directory lookups
netapp_lun_TotalLatencyNetApp Cluster LUN Total LatencymicrosecLatency for all Total operations on the LUN
netapp_lun_ReadLatencyNetApp Cluster LUN Read LatencymicrosecLatency for all Read operations on the LUN
netapp_lun_WriteLatencyNetApp Cluster LUN Write LatencymicrosecLatency for all Write operations on the LUN
netapp_lun_OtherOpsNetApp Cluster LUN Other OpssOther operations per second
netapp_lun_TotalOpsNetApp Cluster LUN Total OpssTotal operations per sec
netapp_lun_ReadOpsNetApp Cluster LUN Read OpssRead operations per second
netapp_lun_WriteOpsNetApp Cluster LUN Write OpssWrite operations per sec
netapp_lun_OtherThroughputNetApp Cluster LUN Other ThroughPutBpsOther throughput for all other I/O operations. Other I/O operations can be metadata operations, such as directory lookups
netapp_lun_TotalThroughputNetApp Cluster LUN Total ThroughPutBpsTotal throughPut in bytes per sec
netapp_lun_ReadThroughputNetApp Cluster LUN Read ThroughPutBpsRead bytes per second
netapp_lun_WriteThroughputNetApp Cluster LUN Write ThroughPutBpsWrite bytes per second
netapp_lun_AvgLatencyNetApp Cluster LUN Average LatencymicrosecAvg latency for all the operations on the LUN.
netapp_lun_AvgReadLatencyNetApp Cluster LUN Average Read LatencymicrosecAvg latency for all Read operations on the LUN
netapp_lun_AvgWriteLatencyNetApp Cluster LUN Average Write LatencymicrosecAvg Latency for all Write operations on the LUN
netapp_lun_AvgotherLatencyNetApp Cluster LUN Average Other LatencymicrosecAvg latency for all Other operations performed on the LUN.Other operations include metadata and directory lookups
netapp_lun_AvgOtherOpsNetApp Cluster LUN Average Other OpssAvg other operations per second
netapp_lun_AvgOpsNetApp Cluster LUN Average Total OpssAvg total operations per sec
netapp_lun_AvgReadOpsNetApp Cluster LUN Average Read OpssAvg read operations per second
netapp_lun_AvgWriteOpsNetApp Cluster LUN Average Write OpssAvg write operations per sec
netapp_lun_AvgOtherThroughputNetApp Cluster LUN Average Other ThroughPutBpsAvg reads in bytes per second
netapp_lun_AvgThroughputNetApp Cluster LUN Average Total ThroughPutBpsAvg writes in bytes per second
netapp_lun_AvgReadThroughputNetApp Cluster LUN Average Read ThroughPutBpsAvg latency for all the operations on the LUN.
netapp_lun_AvgWriteThroughputNetApp Cluster LUN Average Write ThroughPutBpsAvg latency for all Read operations on the LUN
netapp_lun_SizePercentageUsedNetApp Cluster LUN Utilization%LUN Utilization
netapp_port_SpeedNetApp Cluster Storage Port SpeedGbpsNetApp Cluster Storage Port Speed
netapp_port_StatusNetApp Cluster Storage Port StatusStorage Port Status, Possible status are error(0), online(1) and offline(2)
netapp_shelf_StateNetApp Hardware Shelf StateReturns the status of the Shelf. Possible states are unknown(0), error(1) and ok(2)
netapp_shelf_PortStatusNetApp Hardware Shelf Port StatusReturns the status of the Ports associated with the shelf. Possible states are disconnected(0), error(1) and connected(2)

Risks, Limitations & Assumptions

  • Application can handle Critical/Recovery failure notifications for below two cases when user enables App Failure Notifications in configuration
    • Connectivity Exception
    • Authentication Exception
  • Application will not send any duplicate/repeat failure alert notification until the already existing critical alert is recovered.
  • Using metrics for monitoring the resources and generating alerts when the threshold values are breached.
  • Application cannot control monitoring pause/resume actions based on above alerts.
  • Since a single NetApp Cluster may contain thousands of resources. At the time of Monitoring we are facing performance issues.
  • The api rate limit is 20 (At a point of time we can make 20 api calls only with the cluster).
  • No support of macro replacement limitation (i.e, customization of threshold breach alert subject, description)
  • This application supports only Classic Gateway. Not supported with Cluster Gateway.
  • Component level thresholds can be configured on each resource level.
  • No support of showing activity log and applied time.
  • No support for the option to get Latest snapshot metric.

References

ONTAP API Response: https://library.netapp.com/ecmdocs/ECMLP2874708/html/index.html