Enhanced insights for Hadoop HDFS and Yarn services
1 of 6Overview Dashboard
Get started
Overview
Hadoop monitoring in Dynatrace provides a high-level overview of the main Hadoop components within your cluster. These enhanced insights provides additional metrics directly from the HDFS and YARN services.
Activate this extension in your Dynatrace environment from the in-product Hub and simply select which OneAgents to enable this on.
Use cases
The extension enables insights into the overall health of Hadoop HDFS and YARN services.
Compatibility information
Hadoop version 2.4.1+
Linux OS
For full Hadoop visibility, OneAgent must be installed on all machines running the following Hadoop processes: NameNode, ResourceManager, NodeManager, DataNode, and MRAppMaster
Details
This extension provides extra Hadoop metrics through the use of JMX queries.
HDFS - Hadoop Distributed File System
Improved visibility into the health of your HDFS NameNodes and DataNodes
YARN - Yet Another Resource Negotiator
Improved visibility into the health of your YARN NodeManagers, ResourceManagers and MRAppMaster services
Feature sets
When activating your extension using monitoring configuration, you can limit monitoring to one of the feature sets. To work properly the extension has to collect at least one metric after the activation.
In highly segmented networks, feature sets can reflect the segments of your environment. Then, when you create a monitoring configuration, you can select a feature set and a corresponding ActiveGate group that can connect to this particular segment.
All metrics that aren't categorized into any feature set are considered to be the default and are always reported.
A metric inherits the feature set of a subgroup, which in turn inherits the feature set of a group. Also, the feature set defined on the metric level overrides the feature set defined on the subgroup level, which in turn overrides the feature set defined on the group level.
Metric name
Metric key
Description
FilesTotal
hadoop.hdfs.namenode.FilesTotal
Current number of files and directories
PendingReplicationBlocks
hadoop.hdfs.namenode.PendingReplicationBlocks
Current number of blocks pending to be replicated
UnderReplicatedBlocks
hadoop.hdfs.namenode.UnderReplicatedBlocks
Current number of blocks under replicated
ScheduledReplicationBlocks
hadoop.hdfs.namenode.ScheduledReplicationBlocks
Current number of blocks scheduled for replications
NumLiveDataNodes
hadoop.hdfs.namenode.NumLiveDataNodes
Number of datanodes which are currently live
NumDeadDataNodes
hadoop.hdfs.namenode.NumDeadDataNodes
Number of datanodes which are currently dead
NumDecomLiveDataNodes
hadoop.hdfs.namenode.NumDecomLiveDataNodes
Number of datanodes which have been decommissioned and are now live
NumDecomDeadDataNodes
hadoop.hdfs.namenode.NumDecomDeadDataNodes
Number of datanodes which have been decommissioned and are now dead
VolumeFailuresTotal
hadoop.hdfs.namenode.VolumeFailuresTotal
Total number of volume failures across all Datanodes
EstimatedCapacityLostTotal
hadoop.hdfs.namenode.EstimatedCapacityLostTotal
An estimate of the total capacity lost due to volume failures
NumDecommissioningDataNodes
hadoop.hdfs.namenode.NumDecommissioningDataNodes
Number of datanodes in decommissioning state
NumStaleDataNodes
hadoop.hdfs.namenode.NumStaleDataNodes
Number of datanodes marked stale due to delayed hearbeat.
Metric name
Metric key
Description
CapacityTotal
hadoop.hdfs.namenode.CapacityTotal
Current raw capacity of DataNodes in bytes
CapacityUsed
hadoop.hdfs.namenode.CapacityUsed
Current used capacity across all DataNodes in bytes
CapacityRemaining
hadoop.hdfs.namenode.CapacityRemaining
Current remaining capacity in bytes
TotalLoad
hadoop.hdfs.namenode.TotalLoad
Current number of connections
BlocksTotal
hadoop.hdfs.namenode.BlocksTotal
Current number of allocated blocks in the system
PendingDeletionBlocks
hadoop.hdfs.namenode.PendingDeletionBlocks
Current number of blocks pending deletion
CorruptBlocks
hadoop.hdfs.namenode.CorruptBlocks
Current number of blocks with corrupt replicas.
CapacityUsedNonDFS
hadoop.hdfs.namenode.CapacityUsedNonDFS
Current space used by DataNodes for non DFS purposes in bytes