NetApp OnTap (Remote) extension

  • Latest Dynatrace
  • Extension
  • Published Oct 27, 2025

Collect NetApp OnTap metrics via OnTap 9.6+ API to monitor your clusters.

Get started

Overview

Collect NetApp OnTap metrics via OnTap 9.6+ API and Davis AI to monitor your clusters.

The NetApp OnTap (Remote) extension allows you to collect, view, and analyze metrics from your NetApp OnTap clusters both on the cluster level and for each of your nodes and storage virtual machines (SVMs).

Use cases

  • Collect and analyze metrics from your NetApp OnTap clusters in context of your hosts, applications, and services.
  • Gain additional insight by using charting and dashboarding capabilities
  • Use Davis AI to generate baselines and alert you on anomaly detection in designated metrics.

Requirements

  • NetApp OnTap version 9.6+ with a reachable REST API
  • Enabled the following metric event configurations:
    • OnTap Cluster monitoring unavailable
    • OnTap FRU in error state
    • High Temperature on OnTap Node
  • Active VMware extension (version 3.5.1+) in your environment
  • Active NetApp OnTap (Remote) extension in your environment
  • An OnTap user with the http application access that is assigned a rest-role with at least read-only access to the following API paths:
    • /api/cluster
    • /api/svm/svms
    • /api/storage/cluster
    • /api/storage/aggregates
    • /api/storage/disks
    • /api/storage/volumes
    • /api/snapmirror/relationships

Compatibility information

  • This extension requires a connection to the NetApp OnTap API and, thus, supports OnTap only from version 9.6+.
  • A NetApp OnTap Overview dashboard is a part of the extension. This includes links used to access the various entities detected by the OnTap.

Activation and setup

  1. Select the desired ActiveGate group that will run the monitoring configuration. Each monitoring configuration can have one or more OnTap clusters configured.
  2. Configure a NetApp OnTap Extension Endpoint for each of the chosen clusters:
    • OnTap REST API URL: enter the URL (including protocol) of your OnTap API address, for example <https://ontap-prod>/.
    • Cluster name: enter the name of your cluster entity (by default, cluster name uses the detected hostname).
    • Username: enter the username used for the API access.
    • Password: enter the username used for the API access (check #requirements for the list of required permissions).
    • Proxy
      • Address: enter the address of your proxy, for example <http://proxy.example.com:8080>.
      • Proxy username: enter the username used for your proxy.
      • Proxy password: enter the password used for your proxy.
    • Verify SSL certificate
    • Frequency: frequency of metric collection, by default set to once per minute. You can configure it to increase the timeframe between metric collection in large clusters where collecting all requested data would take longer than one minute.
    • Log level: set at the monitoring configuration level and will apply to all endpoints. By default, set to INFO. We recommend using DEBUG logging only when investigating issues with support.
  3. Enable the desired feature sets (refer to the Details tab for what metrics are associated with which feature sets).

Details

  • Metrics are associated with different feature sets that can be enabled or disabled as needed. The extension collects metrics once per minute.
  • Starting with version 2.3.2 of the NetApp OnTap (Remote) extension, rules are included to allow linking of this extension's netapp_ontap:volume entity to the vmware:datastore from the VMware remote monitoring extension. This linking is done via the same as relationship and is based on the volume's name matching the datastore's NAS remote path property.

Licensing and cost

The extension ingests metrics and events. The details of license consumption will depend on which licensing model you are using. For more information about licensing costs, see Dynatrace classic licensing or the Dynatrace Platform Subscription (DPS) depending on your license model.

License consumption is based on the number of metric data points ingested. You can calculate the approximate annual data points ingested by using the following formula:

(16 + (4 x nodes) + (1 x frus) + (1 x svms) + (2 x disks) + (5 x aggregates) + (20 x volumes) +(4 x volume of svm with qos policy) + (3 x snapmirror relationships)) x 60 min x 24 h x 365 days data points/year

The above formula assumes that all feature sets are enabled. You'll need to adjust the formula if you reconfigured the frequency of metric collection.

In the classic licensing model, metric ingestion will consume Davis Data Units (DDUs) at the rate of .001 DDUs per metric data point.

Multiply the above formula for annual data points by .001 to estimate annual DDU usage.

This extension will additionally report log events in 2 situations:

  • When a cluster node restart is detected.
  • When the extension cannot connect to the configured cluster API endpoint.
    • Each minute will have another event reported until the issue is resolved and a successful connection occurs.

License consumption is based on the size (in bytes) of data ingested & processed, retained, and queried. To learn more about the dimensions affecting license consumption, see Logs powered by Grail (DPS).

For the information about the log record ingestion in the classic licensing model, see Davis Data Units (DDUs).

Feature sets

When activating your extension using monitoring configuration, you can limit monitoring to one of the feature sets. To work properly the extension has to collect at least one metric after the activation.

In highly segmented networks, feature sets can reflect the segments of your environment. Then, when you create a monitoring configuration, you can select a feature set and a corresponding ActiveGate group that can connect to this particular segment.

All metrics that aren't categorized into any feature set are considered to be the default and are always reported.

A metric inherits the feature set of a subgroup, which in turn inherits the feature set of a group. Also, the feature set defined on the metric level overrides the feature set defined on the subgroup level, which in turn overrides the feature set defined on the group level.

nodes
Metric nameMetric keyDescription
Node uptimenetapp.ontap.node.uptimeHow long the node reports it has been running
Over temperaturenetapp.ontap.node.over_temperatureSpecifies whether the hardware is currently operating outside of its recommended temperature range (0 = "normal", 1 = "over").
Node membershipnetapp.ontap.node.membershipMembership status of the cluster node
Node processor utilizationnetapp.ontap.node.processor_utilizationAverage CPU Utilization for the node
frus
Metric nameMetric keyDescription
FRU statenetapp.ontap.node.fru.stateState of the field replaceable unit (100% for OK 0% for ERROR))
default
Metric nameMetric keyDescription
Cluster availabilitynetapp.ontap.cluster.availabilityConnectivity to the configured OnTap cluster URL as detected by the extension
clusters
Metric nameMetric keyDescription
Cluster IOPS (other)netapp.ontap.cluster.iops_other.countThe cluster's number of I/O operations observed at the storage object (other)
Cluster IOPS (read)netapp.ontap.cluster.iops_read.countThe cluster's number of I/O operations observed at the storage object (read)
Cluster IOPS (total)netapp.ontap.cluster.iops_total.countThe cluster's number of I/O operations observed at the storage object (total)
Cluster IOPS (write)netapp.ontap.cluster.iops_write.countThe cluster's number of I/O operations observed at the storage object (write)
Cluster throughput (other)netapp.ontap.cluster.throughput_other.countThe cluster's rate of throughput bytes observed at the storage object (other)
Cluster throughput (read)netapp.ontap.cluster.throughput_read.countThe cluster's rate of throughput bytes observed at the storage object (read)
Cluster throughput (total)netapp.ontap.cluster.throughput_total.countThe cluster's rate of throughput bytes observed at the storage object (total)
Cluster throughput (write)netapp.ontap.cluster.throughput_write.countThe cluster's rate of throughput bytes observed at the storage object (write)
Cluster latency (other)netapp.ontap.cluster.latency_other.countThe cluster's raw latency in microseconds observed at the storage object (other)
Cluster latency (read)netapp.ontap.cluster.latency_read.countThe cluster's raw latency in microseconds observed at the storage object (read)
Cluster latency (total)netapp.ontap.cluster.latency_total.countThe cluster's raw latency in microseconds observed at the storage object (total)
Cluster latency (write)netapp.ontap.cluster.latency_write.countThe cluster's raw latency in microseconds observed at the storage object (write)
Cluster block storage sizenetapp.ontap.cluster.block_storage_sizeThe size of the cluster's block storage
Cluster block storage usednetapp.ontap.cluster.block_storage_usedAmount of block storage on the cluster in use
Cluster block storage used percentagenetapp.ontap.cluster.block_storage_used_percentageThe percentage of the cluster's block storage that is currently in use
disks
Metric nameMetric keyDescription
Rated life usednetapp.ontap.disk.rated_life_used_percentagePercentage of rated life used
Disk statenetapp.ontap.disk.stateCurrent disk state: broken, copy, maintenance, partner, pending, present, reconstructing, removed, spare, unfail, or zeroing
aggregates
Metric nameMetric keyDescription
Aggregate statenetapp.ontap.aggregate.stateCurrent aggregate state: online, onlining, offline, offlining, relocating, unmounted, restricted, inconsistent, failed, or unknown
Aggregate block storage usednetapp.ontap.aggregate.block_storage_usedSpace used or reserved in bytes. Includes volume guarantees and aggregate metadata.
Aggregate block storage availablenetapp.ontap.aggregate.block_storage_availableSpace available in bytes
Aggregate block storage sizenetapp.ontap.aggregate.block_storage_sizeTotal usable space in bytes, not including WAFL reserve and aggregate Snapshot copy reserve.
Aggregate block storage used percentagenetapp.ontap.aggregate.block_storage_used_percentPercentage of block storage used
qos
Metric nameMetric keyDescription
Volume QOS minimum throughput (IOPS)netapp.ontap.volume.qos.min_throughput_iopsThe minimum throughput in IOPS (volumes)
Volume QOS maximum throughput (IOPS)netapp.ontap.volume.qos.max_throughput_iopsThe maximum throughput in IOPS (volumes)
Volume QOS maximum throughput (Mbps)netapp.ontap.volume.qos.max_throughput_mbpsThe maximum throughput in Mbps (volumes)
Volume QOS minimum throughput (Mbps)netapp.ontap.volume.qos.min_throughput_mbpsThe minimum throughput in Mbps (volumes)
SVM QOS minimum throughput (IOPS)netapp.ontap.svm.qos.min_throughput_iopsThe minimum throughput in IOPS (svms)
SVM QOS maximum throughput (IOPS)netapp.ontap.svm.qos.max_throughput_iopsThe maximum throughput in IOPS (svms)
netapp.ontap.svm.qos.max_throughput_mbps
SVM QOS minimum throughput (Mbps)netapp.ontap.svm.qos.min_throughput_mbpsThe minimum throughput in Mbps (svms)
snapmirror-relationships
Metric nameMetric keyDescription
Lag timenetapp.ontap.snapmirror.relationship.lag_timeThe time since the exported snapshot was created
Relationship statenetapp.ontap.snapmirror.relationship.stateThe state of the relationship
Relationship healthnetapp.ontap.snapmirror.relationship.healthIs the relationship healthy?
svms
Metric nameMetric keyDescription
SVM statenetapp.ontap.svm.stateCurrent SVM state: starting, running, stopping, stopped,or deleting
volumes
Metric nameMetric keyDescription
Volume statenetapp.ontap.volume.stateVolume state: error, mixed, offline, or online
Volume throughput (other)netapp.ontap.volume.throughput.other.countThe volume's rate of throughput bytes observed at the storage object (other)
Volume throughput (read)netapp.ontap.volume.throughput.read.countThe volume's rate of throughput bytes observed at the storage object (read)
Volume throughput (write)netapp.ontap.volume.throughput.write.countThe volume's rate of throughput bytes observed at the storage object (write)
Volume throughput (total)netapp.ontap.volume.throughput.total.countThe volume's rate of throughput bytes observed at the storage object (total)
Volume IOPS (other)netapp.ontap.volume.iops.other.countThe volume's number of I/O operations observed at the storage object (other)
Volume IOPS (read)netapp.ontap.volume.iops.read.countThe volume's number of I/O operations observed at the storage object (read)
Volume IOPS (write)netapp.ontap.volume.iops.write.countThe volume's number of I/O operations observed at the storage object (write)
Volume IOPS (total)netapp.ontap.volume.iops.total.countThe volume's number of I/O operations observed at the storage object (total)
Volume latency (total)netapp.ontap.volume.latency.totalThe volume's raw latency in microseconds observed at the storage object (total)
Volume latency (read)netapp.ontap.volume.latency.readThe volume's raw latency in microseconds observed at the storage object (read)
Volume latency (write)netapp.ontap.volume.latency.writeThe volume's raw latency in microseconds observed at the storage object (write)
Volume latency (other)netapp.ontap.volume.latency.otherThe volume's raw latency in microseconds observed at the storage object (other)
Volume sizenetapp.ontap.volume.sizeTotal provisioned size
Volume space availablenetapp.ontap.volume.availableThe available space
Volume space usednetapp.ontap.volume.usedVolume space used (including data and metadata)
Volume space used percentagenetapp.ontap.volume.used_percentPercentage of volume space used (including data and metadata)
netapp.ontap.volume.files.maxiumum
Files (inodes)netapp.ontap.volume.files.usedNumber of files (inodes) used for user-visible data permitted on the volume.
Files (inodes) used percentagenetapp.ontap.volume.files.used_percentagePercentage of the maximum number of files used on the volume.
storage-pools
Metric nameMetric keyDescription
Storage pool total capacitynetapp.ontap.pool.total_capacityTotal size of the flash pool, in bytes.
Storage pool usable capacitynetapp.ontap.pool.usable_capacityRemaining usable capacity in the flash pool, in bytes.
Storage pool used capacitynetapp.ontap.pool.used_capacityUsed capacity in the flash pool, in bytes.
Storage pool total capacitynetapp.ontap.pool.used_percentagePercentage of capacity used in the flash pool.
luns
Metric nameMetric keyDescription
LUN statenetapp.ontap.lun.stateThe state of the LUN. Normal states for a LUN are online and offline. Other states indicate errors
LUN container statenetapp.ontap.lun.container_stateThe state of the volume and aggregate that contain the LUN. LUNs are only available when their containers are available
LUN enabled statenetapp.ontap.lun.enabledThe enabled state of the LUN. LUNs can be disabled to prevent access to the LUN. 1 = enabled, 0 = disabled
LUN space usednetapp.ontap.lun.usedThe amount of space consumed by the main data stream of the LUN
LUN sizenetapp.ontap.lun.sizeThe total provisioned size of the LUN
LUN space used percentagenetapp.ontap.lun.used_percentageSpace used in the LUN as a percentage
Related tags
StoragePythonStorageNetAppInfrastructure Observability