Amazon EMR (Elastic MapReduce) monitoring

  • How-to guide
  • 12-min read
  • Published Oct 15, 2020

Dynatrace ingests metrics for multiple preselected namespaces, including Amazon EMR. You can view metrics for each service instance, split metrics into multiple dimensions, and create custom charts that you can pin to your dashboards.

Prerequisites

To enable monitoring for this service, you need

  • ActiveGate version 1.181+, as follows:

    • For Dynatrace SaaS deployments, you need an Environment ActiveGate or a Multi-environment ActiveGate.

    • For Dynatrace Managed deployments, you can use any kind of ActiveGate.

      For role-based access (whether in a SaaS or Managed deployment), you need an Environment ActiveGate installed on an Amazon EC2 host.

  • Dynatrace version 1.182+

  • An updated AWS monitoring policy to include the additional AWS services.
    To update the AWS IAM policy, use the JSON below, containing the monitoring policy (permissions) for all supporting services.

{
"Version": "2012-10-17",
"Statement": [
{
"Sid": "VisualEditor0",
"Effect": "Allow",
"Action": [
"acm-pca:ListCertificateAuthorities",
"apigateway:GET",
"apprunner:ListServices",
"appstream:DescribeFleets",
"appsync:ListGraphqlApis",
"athena:ListWorkGroups",
"autoscaling:DescribeAutoScalingGroups",
"cloudformation:ListStackResources",
"cloudfront:ListDistributions",
"cloudhsm:DescribeClusters",
"cloudsearch:DescribeDomains",
"cloudwatch:GetMetricData",
"cloudwatch:GetMetricStatistics",
"cloudwatch:ListMetrics",
"codebuild:ListProjects",
"datasync:ListTasks",
"dax:DescribeClusters",
"directconnect:DescribeConnections",
"dms:DescribeReplicationInstances",
"dynamodb:ListTables",
"dynamodb:ListTagsOfResource",
"ec2:DescribeAvailabilityZones",
"ec2:DescribeInstances",
"ec2:DescribeNatGateways",
"ec2:DescribeSpotFleetRequests",
"ec2:DescribeTransitGateways",
"ec2:DescribeVolumes",
"ec2:DescribeVpnConnections",
"ecs:ListClusters",
"eks:ListClusters",
"elasticache:DescribeCacheClusters",
"elasticbeanstalk:DescribeEnvironmentResources",
"elasticbeanstalk:DescribeEnvironments",
"elasticfilesystem:DescribeFileSystems",
"elasticloadbalancing:DescribeInstanceHealth",
"elasticloadbalancing:DescribeListeners",
"elasticloadbalancing:DescribeLoadBalancers",
"elasticloadbalancing:DescribeRules",
"elasticloadbalancing:DescribeTags",
"elasticloadbalancing:DescribeTargetHealth",
"elasticmapreduce:ListClusters",
"elastictranscoder:ListPipelines",
"es:ListDomainNames",
"events:ListEventBuses",
"firehose:ListDeliveryStreams",
"fsx:DescribeFileSystems",
"gamelift:ListFleets",
"glue:GetJobs",
"inspector:ListAssessmentTemplates",
"kafka:ListClusters",
"kinesis:ListStreams",
"kinesisanalytics:ListApplications",
"kinesisvideo:ListStreams",
"lambda:ListFunctions",
"lambda:ListTags",
"lex:GetBots",
"logs:DescribeLogGroups",
"mediaconnect:ListFlows",
"mediaconvert:DescribeEndpoints",
"mediapackage-vod:ListPackagingConfigurations",
"mediapackage:ListChannels",
"mediatailor:ListPlaybackConfigurations",
"opsworks:DescribeStacks",
"qldb:ListLedgers",
"rds:DescribeDBClusters",
"rds:DescribeDBInstances",
"rds:DescribeEvents",
"rds:ListTagsForResource",
"redshift:DescribeClusters",
"robomaker:ListSimulationJobs",
"route53:ListHostedZones",
"route53resolver:ListResolverEndpoints",
"s3:ListAllMyBuckets",
"sagemaker:ListEndpoints",
"sns:ListTopics",
"sqs:ListQueues",
"storagegateway:ListGateways",
"sts:GetCallerIdentity",
"swf:ListDomains",
"tag:GetResources",
"tag:GetTagKeys",
"transfer:ListServers",
"workmail:ListOrganizations",
"workspaces:DescribeWorkspaces"
],
"Resource": "*"
}
]
}

If you don't want to add permissions to all services, and just select permissions for certain services, consult the table below. The table contains a set of permissions that are required for All AWS cloud services and, for each supporting service, a list of optional permissions specific to that service.

Permissions required for AWS monitoring integration:
  • "cloudwatch:GetMetricData"
  • "cloudwatch:GetMetricStatistics"
  • "cloudwatch:ListMetrics"
  • "sts:GetCallerIdentity"
  • "tag:GetResources"
  • "tag:GetTagKeys"
  • "ec2:DescribeAvailabilityZones"
NamePermissions
All monitored Amazon services requiredcloudwatch:GetMetricData,
cloudwatch:GetMetricStatistics,
cloudwatch:ListMetrics,
sts:GetCallerIdentity,
tag:GetResources,
tag:GetTagKeys,
ec2:DescribeAvailabilityZones
AWS Certificate Manager Private Certificate Authorityacm-pca:ListCertificateAuthorities
Amazon MQ
Amazon API Gatewayapigateway:GET
AWS App Runnerapprunner:ListServices
Amazon AppStreamappstream:DescribeFleets
AWS AppSyncappsync:ListGraphqlApis
Amazon Athenaathena:ListWorkGroups
Amazon Aurorards:DescribeDBClusters
Amazon EC2 Auto Scalingautoscaling:DescribeAutoScalingGroups
Amazon EC2 Auto Scaling (built-in)autoscaling:DescribeAutoScalingGroups
AWS Billing
Amazon Keyspaces
AWS Chatbot
Amazon CloudFrontcloudfront:ListDistributions
AWS CloudHSMcloudhsm:DescribeClusters
Amazon CloudSearchcloudsearch:DescribeDomains
AWS CodeBuildcodebuild:ListProjects
Amazon Cognito
Amazon Connect
Amazon Elastic Kubernetes Service (EKS)eks:ListClusters
AWS DataSyncdatasync:ListTasks
Amazon DynamoDB Accelerator (DAX)dax:DescribeClusters
AWS Database Migration Service (AWS DMS)dms:DescribeReplicationInstances
Amazon DocumentDBrds:DescribeDBClusters
AWS Direct Connectdirectconnect:DescribeConnections
Amazon DynamoDBdynamodb:ListTables
Amazon DynamoDB (built-in)dynamodb:ListTables,
dynamodb:ListTagsOfResource
Amazon EBSec2:DescribeVolumes
Amazon EBS (built-in)ec2:DescribeVolumes
Amazon EC2 API
Amazon EC2 (built-in)ec2:DescribeInstances
Amazon EC2 Spot Fleetec2:DescribeSpotFleetRequests
Amazon Elastic Container Service (ECS)ecs:ListClusters
Amazon ECS Container Insightsecs:ListClusters
Amazon ElastiCache (EC)elasticache:DescribeCacheClusters
AWS Elastic Beanstalkelasticbeanstalk:DescribeEnvironments
Amazon Elastic File System (EFS)elasticfilesystem:DescribeFileSystems
Amazon Elastic Inference
Amazon Elastic Map Reduce (EMR)elasticmapreduce:ListClusters
Amazon Elasticsearch Service (ES)es:ListDomainNames
Amazon Elastic Transcoderelastictranscoder:ListPipelines
Amazon Elastic Load Balancer (ELB) (built-in)elasticloadbalancing:DescribeInstanceHealth,
elasticloadbalancing:DescribeListeners,
elasticloadbalancing:DescribeLoadBalancers,
elasticloadbalancing:DescribeRules,
elasticloadbalancing:DescribeTags,
elasticloadbalancing:DescribeTargetHealth
Amazon EventBridgeevents:ListEventBuses
Amazon FSxfsx:DescribeFileSystems
Amazon GameLiftgamelift:ListFleets
AWS Glueglue:GetJobs
Amazon Inspectorinspector:ListAssessmentTemplates
AWS Internet of Things (IoT)
AWS IoT Analytics
Amazon Managed Streaming for Kafkakafka:ListClusters
Amazon Kinesis Data Analyticskinesisanalytics:ListApplications
Amazon Data Firehosefirehose:ListDeliveryStreams
Amazon Kinesis Data Streamskinesis:ListStreams
Amazon Kinesis Video Streamskinesisvideo:ListStreams
AWS Lambdalambda:ListFunctions
AWS Lambda (built-in)lambda:ListFunctions,
lambda:ListTags
Amazon Lexlex:GetBots
Amazon Application and Network Load Balancer (built-in)elasticloadbalancing:DescribeInstanceHealth,
elasticloadbalancing:DescribeListeners,
elasticloadbalancing:DescribeLoadBalancers,
elasticloadbalancing:DescribeRules,
elasticloadbalancing:DescribeTags,
elasticloadbalancing:DescribeTargetHealth
Amazon CloudWatch Logslogs:DescribeLogGroups
AWS Elemental MediaConnectmediaconnect:ListFlows
AWS Elemental MediaConvertmediaconvert:DescribeEndpoints
AWS Elemental MediaPackage Livemediapackage:ListChannels
AWS Elemental MediaPackage Video on Demandmediapackage-vod:ListPackagingConfigurations
AWS Elemental MediaTailormediatailor:ListPlaybackConfigurations
Amazon VPC NAT Gatewaysec2:DescribeNatGateways
Amazon Neptunerds:DescribeDBClusters
AWS OpsWorksopsworks:DescribeStacks
Amazon Polly
Amazon QLDBqldb:ListLedgers
Amazon RDSrds:DescribeDBInstances
Amazon RDS (built-in)rds:DescribeDBInstances,
rds:DescribeEvents,
rds:ListTagsForResource
Amazon Redshiftredshift:DescribeClusters
Amazon Rekognition
AWS RoboMakerrobomaker:ListSimulationJobs
Amazon Route 53route53:ListHostedZones
Amazon Route 53 Resolverroute53resolver:ListResolverEndpoints
Amazon S3s3:ListAllMyBuckets
Amazon S3 (built-in)s3:ListAllMyBuckets
Amazon SageMaker Batch Transform Jobs
Amazon SageMaker Endpoint Instancessagemaker:ListEndpoints
Amazon SageMaker Endpointssagemaker:ListEndpoints
Amazon SageMaker Ground Truth
Amazon SageMaker Processing Jobs
Amazon SageMaker Training Jobs
AWS Service Catalog
Amazon Simple Email Service (SES)
Amazon Simple Notification Service (SNS)sns:ListTopics
Amazon Simple Queue Service (SQS)sqs:ListQueues
AWS Systems Manager - Run Command
AWS Step Functions
AWS Storage Gatewaystoragegateway:ListGateways
Amazon SWFswf:ListDomains
Amazon Textract
AWS IoT Things Graph
AWS Transfer Familytransfer:ListServers
AWS Transit Gatewayec2:DescribeTransitGateways
Amazon Translate
AWS Trusted Advisor
AWS API Usage
AWS Site-to-Site VPNec2:DescribeVpnConnections
AWS WAF Classic
AWS WAF
Amazon WorkMailworkmail:ListOrganizations
Amazon WorkSpacesworkspaces:DescribeWorkspaces

Example of JSON policy for one single service.

{
"Version": "2012-10-17",
"Statement": [
{
"Sid": "VisualEditor0",
"Effect": "Allow",
"Action": [
"apigateway:GET",
"cloudwatch:GetMetricData",
"cloudwatch:GetMetricStatistics",
"cloudwatch:ListMetrics",
"sts:GetCallerIdentity",
"tag:GetResources",
"tag:GetTagKeys",
"ec2:DescribeAvailabilityZones"
],
"Resource": "*"
}
]
}

In this example, from the complete list of permissions you need to select

  • "apigateway:GET" for Amazon API Gateway
  • "cloudwatch:GetMetricData", "cloudwatch:GetMetricStatistics", "cloudwatch:ListMetrics", "sts:GetCallerIdentity", "tag:GetResources", "tag:GetTagKeys", and "ec2:DescribeAvailabilityZones" for All AWS cloud services.

Enable monitoring

To learn how to enable service monitoring, see Enable service monitoring.

View service metrics

You can view the service metrics in your Dynatrace environment either on the custom device overview page or on your Dashboards page.

View metrics on the custom device overview page

To access the custom device overview page

  1. Go to Technologies & Processes Classic.
  2. Filter by service name and select the relevant custom device group.
  3. Once you select the custom device group, you're on the custom device group overview page.
  4. The custom device group overview page lists all instances (custom devices) belonging to the group. Select an instance to view the custom device overview page.

View metrics on your dashboard

You can also view metrics in the Dynatrace web UI on dashboards. There is no preset dashboard available for this service, but you can create your own dashboard.

To check the availability of preset dashboards for each AWS service, see the list below.

AWS servicePreset dashboard
Amazon EC2 Auto Scaling (built-in)Not applicable
AWS Lambda (built-in)Not applicable
Amazon Application and Network Load Balancer (built-in)Not applicable
Amazon DynamoDB (built-in)Not applicable
Amazon EBS (built-in)Not applicable
Amazon EC2 (built-in)Not applicable
Amazon Elastic Load Balancer (ELB) (built-in)Not applicable
Amazon RDS (built-in)Not applicable
Amazon S3 (built-in)Not applicable
AWS Certificate Manager Private Certificate AuthorityNot applicable
All monitored Amazon servicesNot applicable
Amazon API GatewayNot applicable
AWS App RunnerNot applicable
Amazon AppStreamApplicable
AWS AppSyncApplicable
Amazon AthenaApplicable
Amazon AuroraNot applicable
Amazon EC2 Auto ScalingApplicable
AWS BillingApplicable
Amazon KeyspacesApplicable
AWS ChatbotApplicable
Amazon CloudFrontNot applicable
AWS CloudHSMApplicable
Amazon CloudSearchApplicable
AWS CodeBuildApplicable
Amazon CognitoNot applicable
Amazon ConnectApplicable
AWS DataSyncApplicable
Amazon DynamoDB Accelerator (DAX)Applicable
AWS Database Migration Service (AWS DMS)Applicable
Amazon DocumentDBApplicable
AWS Direct ConnectApplicable
Amazon DynamoDBNot applicable
Amazon EBSNot applicable
Amazon EC2 Spot FleetNot applicable
Amazon EC2 APIApplicable
Amazon Elastic Container Service (ECS)Not applicable
Amazon ECS Container InsightsApplicable
Amazon Elastic File System (EFS)Not applicable
Amazon Elastic Kubernetes Service (EKS)Applicable
Amazon ElastiCache (EC)Not applicable
AWS Elastic BeanstalkApplicable
Amazon Elastic InferenceApplicable
Amazon Elastic TranscoderApplicable
Amazon Elastic Map Reduce (EMR)Not applicable
Amazon Elasticsearch Service (ES)Not applicable
Amazon EventBridgeApplicable
Amazon FSxApplicable
Amazon GameLiftApplicable
AWS GlueNot applicable
Amazon InspectorApplicable
AWS Internet of Things (IoT)Not applicable
AWS IoT Things GraphApplicable
AWS IoT AnalyticsApplicable
Amazon Managed Streaming for KafkaApplicable
Amazon Kinesis Data AnalyticsNot applicable
Amazon Data FirehoseNot applicable
Amazon Kinesis Data StreamsNot applicable
Amazon Kinesis Video StreamsNot applicable
AWS LambdaNot applicable
Amazon LexApplicable
Amazon CloudWatch LogsApplicable
AWS Elemental MediaTailorApplicable
AWS Elemental MediaConnectApplicable
AWS Elemental MediaConvertApplicable
AWS Elemental MediaPackage LiveApplicable
AWS Elemental MediaPackage Video on DemandApplicable
Amazon MQApplicable
Amazon VPC NAT GatewaysNot applicable
Amazon NeptuneApplicable
AWS OpsWorksApplicable
Amazon PollyApplicable
Amazon QLDBApplicable
Amazon RDSNot applicable
Amazon RedshiftNot applicable
Amazon RekognitionApplicable
AWS RoboMakerApplicable
Amazon Route 53Applicable
Amazon Route 53 ResolverApplicable
Amazon S3Not applicable
Amazon SageMaker Batch Transform JobsNot applicable
Amazon SageMaker EndpointsNot applicable
Amazon SageMaker Endpoint InstancesNot applicable
Amazon SageMaker Ground TruthNot applicable
Amazon SageMaker Processing JobsNot applicable
Amazon SageMaker Training JobsNot applicable
AWS Service CatalogApplicable
Amazon Simple Email Service (SES)Not applicable
Amazon Simple Notification Service (SNS)Not applicable
Amazon Simple Queue Service (SQS)Not applicable
AWS Systems Manager - Run CommandApplicable
AWS Step FunctionsApplicable
AWS Storage GatewayApplicable
Amazon SWFApplicable
Amazon TextractApplicable
AWS Transfer FamilyApplicable
AWS Transit GatewayApplicable
Amazon TranslateApplicable
AWS Trusted AdvisorApplicable
AWS API UsageApplicable
AWS Site-to-Site VPNApplicable
AWS WAF ClassicApplicable
AWS WAFApplicable
Amazon WorkMailApplicable
Amazon WorkSpacesApplicable

Available metrics

JobFlowId is the main dimension.

NameDescriptionUnitStatisticsDimensionsRecommended
AppsCompletedThe number of applications submitted to YARN that have completedCountSumJobFlowId, JobId
AppsCompletedCountSumJobFlowId
AppsFailedThe number of applications submitted to YARN that have failed to completeCountSumJobFlowId, JobId
AppsFailedCountSumJobFlowId
AppsKilledThe number of applications submitted to YARN that have been killedCountSumJobFlowId, JobId
AppsKilledCountSumJobFlowId
AppsPendingThe number of applications submitted to YARN that are in a Pending stateCountSumJobFlowId, JobId
AppsPendingCountSumJobFlowId
AppsRunningThe number of applications submitted to YARN that are runningCountSumJobFlowId, JobIdApplicable
AppsRunningCountSumJobFlowIdApplicable
AppsSubmittedThe number of applications submitted to YARNCountSumJobFlowId, JobId
AppsSubmittedCountSumJobFlowId
BackupFailedShows if the last backup failed. Set to 0 by default and updated to 1 if the previous backup attempt failed. This metric is only reported for HBase clusters.CountSumJobFlowId, JobId
BackupFailedCountSumJobFlowId
CapacityRemainingGBThe amount of remaining HDFS disk capacityCountSumJobFlowId, JobId
CapacityRemainingGBCountSumJobFlowId
ContainerAllocatedThe number of resource containers allocated by the resource managerCountSumJobFlowId, JobId
ContainerAllocatedCountSumJobFlowId
ContainerPendingRatioThe ratio (in numbers) of pending containers to containers allocated (ContainerPendingRatio = ContainerPending / ContainerAllocated). If ContainerAllocated = 0, then ContainerPendingRatio = ContainerPending.CountSumJobFlowId, JobId
ContainerPendingRatioCountSumJobFlowId
ContainerPendingThe number of containers in the queue that have not yet been allocatedCountSumJobFlowId, JobId
ContainerPendingCountSumJobFlowId
ContainerReservedThe number of containers reservedCountSumJobFlowId, JobId
ContainerReservedCountSumJobFlowId
CoreNodesPendingThe number of core nodes waiting to be assigned (pending requests)CountSumJobFlowId, JobId
CoreNodesPendingCountSumJobFlowId
CoreNodesRequestedCountSumJobFlowId, JobId
CoreNodesRequestedCountSumJobFlowId
CoreNodesRunningThe number of working core nodesCountSumJobFlowId, JobId
CoreNodesRunningCountSumJobFlowId
CoreUnitsRequestedCountSumJobFlowId, JobId
CoreUnitsRequestedCountSumJobFlowId
CoreUnitsRunningCountSumJobFlowId, JobId
CoreUnitsRunningCountSumJobFlowId
CoreVCPURequestedCountSumJobFlowId, JobId
CoreVCPURequestedCountSumJobFlowId
CoreVCPURunningCountSumJobFlowId, JobId
CoreVCPURunningCountSumJobFlowId
CorruptBlocksThe number of blocks that HDFS reports as corruptedCountSumJobFlowId, JobId
CorruptBlocksCountSumJobFlowId
DfsPendingReplicationBlocksThe status of block replication: blocks being replicated, age of replication requests, and unsuccessful replication requestsCountSumJobFlowId, JobId
DfsPendingReplicationBlocksCountSumJobFlowId
HDFSBytesReadThe number of bytes read from HDFSCountSumJobFlowId, JobId
HDFSBytesReadCountSumJobFlowId
HDFSBytesWrittenThe number of bytes written to HDFSCountSumJobFlowId, JobId
HDFSBytesWrittenCountSumJobFlowId
HDFSUtilizationThe percentage of HDFS storage currently usedPercentAverageJobFlowId, JobIdApplicable
HDFSUtilizationPercentAverageJobFlowIdApplicable
HbaseBackupFailedShows if the last backup failed. Set to 0 by default and updated to 1 if the previous backup attempt failed. This metric is only reported for HBase clusters.CountMinimumJobFlowId, JobId
HbaseBackupFailedCountMinimumJobFlowId
IsIdleIndicates that a cluster is no longer performing work, but is still alive and accruing charges. Set to 1 if no tasks are running and no jobs are running, and to 0 otherwise. This value is checked at five-minute intervals and a value of 1 indicates only that the cluster was idle when checked, not that it was idle for the entire five minutes.CountMinimumJobFlowId, JobIdApplicable
IsIdleCountMinimumJobFlowIdApplicable
JobsFailedThe number of jobs in the cluster that have failedCountSumJobFlowId, JobId
JobsFailedCountSumJobFlowId
JobsRunningThe number of jobs in the cluster that are currently runningCountSumJobFlowId, JobId
JobsRunningCountSumJobFlowId
LiveDataNodesThe percentage of data nodes that are receiving work from HadoopCountSumJobFlowId, JobId
LiveDataNodesCountSumJobFlowId
LiveTaskTrackersThe percentage of task trackers that are functionalPercentAverageJobFlowId, JobId
LiveTaskTrackersPercentAverageJobFlowId
MRActiveNodesThe number of nodes presently running MapReduce tasks or jobs. Equivalent to YARN metric mapred.resourcemanager.NoOfActiveNodesCountSumJobFlowId, JobId
MRActiveNodesCountSumJobFlowId
MRDecommissionedNodesThe number of nodes allocated to MapReduce applications that have been marked in a Decommissioned stateCountSumJobFlowId, JobId
MRDecommissionedNodesCountSumJobFlowId
MRLostNodesThe number of nodes allocated to MapReduce that have been marked in a Lost stateCountSumJobFlowId, JobId
MRLostNodesCountSumJobFlowId
MRRebootedNodesThe number of nodes available to MapReduce that have been rebooted and marked in a Rebooted stateCountSumJobFlowId, JobId
MRRebootedNodesCountSumJobFlowId
MRTotalNodesThe number of nodes presently available to MapReduce jobsCountSumJobFlowId, JobId
MRTotalNodesCountSumJobFlowId
MRUnhealthyNodesThe number of nodes available to MapReduce jobs marked in an Unhealthy stateCountSumJobFlowId, JobId
MRUnhealthyNodesCountSumJobFlowId
MapSlotsOpenThe unused map task capacity. This is calculated as the maximum number of map tasks for a given cluster, less the total number of map tasks currently running in that cluster.CountSumJobFlowId, JobId
MapSlotsOpenCountSumJobFlowId
MapTasksRemainingThe number of remaining map tasks for each jobCountSumJobFlowId, JobId
MapTasksRemainingCountSumJobFlowId
MapTasksRunningThe number of running map tasks for each jobCountSumJobFlowId, JobIdApplicable
MapTasksRunningCountSumJobFlowIdApplicable
MemoryAllocatedMBThe amount of memory allocated to the clusterCountSumJobFlowId, JobId
MemoryAllocatedMBCountSumJobFlowId
MemoryAvailableMBThe amount of memory available for allocationCountSumJobFlowId, JobId
MemoryAvailableMBCountSumJobFlowId
MemoryReservedMBThe amount of memory reserved for allocationCountSumJobFlowId, JobId
MemoryReservedMBCountSumJobFlowId
MemoryTotalMBThe total amount of memory in the clusterCountSumJobFlowId, JobId
MemoryTotalMBCountSumJobFlowId
MissingBlocksThe number of blocks in which HDFS has no replicas. These might be corrupt blocks.CountSumJobFlowId, JobId
MissingBlocksCountSumJobFlowId
MostRecentBackupDurationThe amount of time it took the previous backup to complete. This metric is set regardless of whether the last completed backup succeeded or failed. While the backup is ongoing, this metric returns the number of minutes after the backup started. This metric is only reported for HBase clusters.CountSumJobFlowId, JobId
MostRecentBackupDurationCountSumJobFlowId
PendingDeletionBlocksThe number of blocks marked for deletionCountSumJobFlowId, JobId
PendingDeletionBlocksCountSumJobFlowId
ReduceSlotsOpenUnused reduce task capacity. This is calculated as the maximum reduce task capacity for a given cluster, less the number of reduce tasks currently running in that cluster.CountSumJobFlowId, JobId
ReduceSlotsOpenCountSumJobFlowId
ReduceTasksRemainingThe number of remaining reduce tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated.CountSumJobFlowId
ReduceTasksRunningThe number of running reduce tasks for each job. If you have a scheduler installed and multiple jobs running, multiple graphs are generated.CountSumJobFlowId, JobId
ReduceTasksRunningCountSumJobFlowId
RemainingMapTasksPerSlotThe ratio of the total map tasks remaining to the total map slots available in the clusterPercentAverageJobFlowId, JobId
RemainingMapTasksPerSlotPercentAverageJobFlowId
S3BytesReadThe number of bytes read from Amazon S3. This metric aggregates MapReduce jobs only, and does not apply for other workloads on EMR.CountSumJobFlowId, JobId
S3BytesReadCountSumJobFlowId
S3BytesWrittenThe number of bytes written to Amazon S3. This metric aggregates MapReduce jobs only, and does not apply for other workloads on EMR.CountSumJobFlowId, JobId
S3BytesWrittenCountSumJobFlowId
TaskNodesPendingThe number of task nodes waiting to be assigned (pending requests)CountSumJobFlowId, JobId
TaskNodesPendingCountSumJobFlowId
TaskNodesRequestedCountSumJobFlowId, JobId
TaskNodesRequestedCountSumJobFlowId
TaskNodesRunningThe number of working task nodesCountSumJobFlowId, JobId
TaskNodesRunningCountSumJobFlowId
TaskUnitsRequestedCountSumJobFlowId, JobId
TaskUnitsRequestedCountSumJobFlowId
TaskUnitsRunningCountSumJobFlowId, JobId
TaskUnitsRunningCountSumJobFlowId
TaskVCPURequestedCountSumJobFlowId, JobId
TaskVCPURequestedCountSumJobFlowId
TaskVCPURunningCountSumJobFlowId, JobId
TaskVCPURunningCountSumJobFlowId
TimeSinceLastSuccessfulBackupThe number of elapsed minutes after the last successful HBase backup started on your cluster. This metric is only reported for HBase clusters.CountSumJobFlowId, JobId
TimeSinceLastSuccessfulBackupCountSumJobFlowId
TotalLoadThe total number of concurrent data transfersCountSumJobFlowId, JobId
TotalLoadCountSumJobFlowId
TotalNodesRequestedCountSumJobFlowId, JobId
TotalNodesRequestedCountSumJobFlowId
TotalNodesRunningCountSumJobFlowId, JobId
TotalNodesRunningCountSumJobFlowId
TotalUnitsRequestedCountSumJobFlowId, JobId
TotalUnitsRequestedCountSumJobFlowId
TotalUnitsRunningCountSumJobFlowId, JobId
TotalUnitsRunningCountSumJobFlowId
TotalVCPURequestedCountSumJobFlowId
TotalVCPURunningCountSumJobFlowId, JobId
TotalVCPURunningCountSumJobFlowId
UnderReplicatedBlocksThe number of blocks that need to be replicated one or more timesCountSumJobFlowId, JobId
UnderReplicatedBlocksCountSumJobFlowId
YARNMemoryAvailablePercentagehe percentage of remaining memory available to YARN (YARNMemoryAvailablePercentage = MemoryAvailableMB / MemoryTotalMB)PercentAverageJobFlowId, JobId
YARNMemoryAvailablePercentagePercentAverageJobFlowId
Related tags
Infrastructure Observability