Amazon MSK (Kafka) monitoring

  • How-to guide
  • 14-min read

Dynatrace ingests metrics for multiple preselected namespaces, including Amazon MSK (Kafka). You can view metrics for each service instance, split metrics into multiple dimensions, and create custom charts that you can pin to your dashboards.

Prerequisites

To enable monitoring for this service, you need

  • ActiveGate version 1.197+

    • For Dynatrace SaaS deployments, you need an Environment ActiveGate or a Multi-environment ActiveGate.

    • For Dynatrace Managed deployments, you can use any kind of ActiveGate.

      For role-based access (whether in a SaaS or Managed deployment), you need an Environment ActiveGate installed on an Amazon EC2 host.

  • Dynatrace version 1.203+

  • An updated AWS monitoring policy to include the additional AWS services.
    To update the AWS IAM policy, use the JSON below, containing the monitoring policy (permissions) for all supporting services.

{
"Version": "2012-10-17",
"Statement": [
{
"Sid": "VisualEditor0",
"Effect": "Allow",
"Action": [
"acm-pca:ListCertificateAuthorities",
"apigateway:GET",
"apprunner:ListServices",
"appstream:DescribeFleets",
"appsync:ListGraphqlApis",
"athena:ListWorkGroups",
"autoscaling:DescribeAutoScalingGroups",
"cloudformation:ListStackResources",
"cloudfront:ListDistributions",
"cloudhsm:DescribeClusters",
"cloudsearch:DescribeDomains",
"cloudwatch:GetMetricData",
"cloudwatch:GetMetricStatistics",
"cloudwatch:ListMetrics",
"codebuild:ListProjects",
"datasync:ListTasks",
"dax:DescribeClusters",
"directconnect:DescribeConnections",
"dms:DescribeReplicationInstances",
"dynamodb:ListTables",
"dynamodb:ListTagsOfResource",
"ec2:DescribeAvailabilityZones",
"ec2:DescribeInstances",
"ec2:DescribeNatGateways",
"ec2:DescribeSpotFleetRequests",
"ec2:DescribeTransitGateways",
"ec2:DescribeVolumes",
"ec2:DescribeVpnConnections",
"ecs:ListClusters",
"eks:ListClusters",
"elasticache:DescribeCacheClusters",
"elasticbeanstalk:DescribeEnvironmentResources",
"elasticbeanstalk:DescribeEnvironments",
"elasticfilesystem:DescribeFileSystems",
"elasticloadbalancing:DescribeInstanceHealth",
"elasticloadbalancing:DescribeListeners",
"elasticloadbalancing:DescribeLoadBalancers",
"elasticloadbalancing:DescribeRules",
"elasticloadbalancing:DescribeTags",
"elasticloadbalancing:DescribeTargetHealth",
"elasticmapreduce:ListClusters",
"elastictranscoder:ListPipelines",
"es:ListDomainNames",
"events:ListEventBuses",
"firehose:ListDeliveryStreams",
"fsx:DescribeFileSystems",
"gamelift:ListFleets",
"glue:GetJobs",
"inspector:ListAssessmentTemplates",
"kafka:ListClusters",
"kinesis:ListStreams",
"kinesisanalytics:ListApplications",
"kinesisvideo:ListStreams",
"lambda:ListFunctions",
"lambda:ListTags",
"lex:GetBots",
"logs:DescribeLogGroups",
"mediaconnect:ListFlows",
"mediaconvert:DescribeEndpoints",
"mediapackage-vod:ListPackagingConfigurations",
"mediapackage:ListChannels",
"mediatailor:ListPlaybackConfigurations",
"opsworks:DescribeStacks",
"qldb:ListLedgers",
"rds:DescribeDBClusters",
"rds:DescribeDBInstances",
"rds:DescribeEvents",
"rds:ListTagsForResource",
"redshift:DescribeClusters",
"robomaker:ListSimulationJobs",
"route53:ListHostedZones",
"route53resolver:ListResolverEndpoints",
"s3:ListAllMyBuckets",
"sagemaker:ListEndpoints",
"sns:ListTopics",
"sqs:ListQueues",
"storagegateway:ListGateways",
"sts:GetCallerIdentity",
"swf:ListDomains",
"tag:GetResources",
"tag:GetTagKeys",
"transfer:ListServers",
"workmail:ListOrganizations",
"workspaces:DescribeWorkspaces"
],
"Resource": "*"
}
]
}

If you don't want to add permissions to all services, and just select permissions for certain services, consult the table below. The table contains a set of permissions that are required for All AWS cloud services and, for each supporting service, a list of optional permissions specific to that service.

Permissions required for AWS monitoring integration:
  • "cloudwatch:GetMetricData"
  • "cloudwatch:GetMetricStatistics"
  • "cloudwatch:ListMetrics"
  • "sts:GetCallerIdentity"
  • "tag:GetResources"
  • "tag:GetTagKeys"
  • "ec2:DescribeAvailabilityZones"
NamePermissions
All monitored Amazon services requiredcloudwatch:GetMetricData,
cloudwatch:GetMetricStatistics,
cloudwatch:ListMetrics,
sts:GetCallerIdentity,
tag:GetResources,
tag:GetTagKeys,
ec2:DescribeAvailabilityZones
AWS Certificate Manager Private Certificate Authorityacm-pca:ListCertificateAuthorities
Amazon MQ
Amazon API Gatewayapigateway:GET
AWS App Runnerapprunner:ListServices
Amazon AppStreamappstream:DescribeFleets
AWS AppSyncappsync:ListGraphqlApis
Amazon Athenaathena:ListWorkGroups
Amazon Aurorards:DescribeDBClusters
Amazon EC2 Auto Scalingautoscaling:DescribeAutoScalingGroups
Amazon EC2 Auto Scaling (built-in)autoscaling:DescribeAutoScalingGroups
AWS Billing
Amazon Keyspaces
AWS Chatbot
Amazon CloudFrontcloudfront:ListDistributions
AWS CloudHSMcloudhsm:DescribeClusters
Amazon CloudSearchcloudsearch:DescribeDomains
AWS CodeBuildcodebuild:ListProjects
Amazon Cognito
Amazon Connect
Amazon Elastic Kubernetes Service (EKS)eks:ListClusters
AWS DataSyncdatasync:ListTasks
Amazon DynamoDB Accelerator (DAX)dax:DescribeClusters
AWS Database Migration Service (AWS DMS)dms:DescribeReplicationInstances
Amazon DocumentDBrds:DescribeDBClusters
AWS Direct Connectdirectconnect:DescribeConnections
Amazon DynamoDBdynamodb:ListTables
Amazon DynamoDB (built-in)dynamodb:ListTables,
dynamodb:ListTagsOfResource
Amazon EBSec2:DescribeVolumes
Amazon EBS (built-in)ec2:DescribeVolumes
Amazon EC2 API
Amazon EC2 (built-in)ec2:DescribeInstances
Amazon EC2 Spot Fleetec2:DescribeSpotFleetRequests
Amazon Elastic Container Service (ECS)ecs:ListClusters
Amazon ECS Container Insightsecs:ListClusters
Amazon ElastiCache (EC)elasticache:DescribeCacheClusters
AWS Elastic Beanstalkelasticbeanstalk:DescribeEnvironments
Amazon Elastic File System (EFS)elasticfilesystem:DescribeFileSystems
Amazon Elastic Inference
Amazon Elastic Map Reduce (EMR)elasticmapreduce:ListClusters
Amazon Elasticsearch Service (ES)es:ListDomainNames
Amazon Elastic Transcoderelastictranscoder:ListPipelines
Amazon Elastic Load Balancer (ELB) (built-in)elasticloadbalancing:DescribeInstanceHealth,
elasticloadbalancing:DescribeListeners,
elasticloadbalancing:DescribeLoadBalancers,
elasticloadbalancing:DescribeRules,
elasticloadbalancing:DescribeTags,
elasticloadbalancing:DescribeTargetHealth
Amazon EventBridgeevents:ListEventBuses
Amazon FSxfsx:DescribeFileSystems
Amazon GameLiftgamelift:ListFleets
AWS Glueglue:GetJobs
Amazon Inspectorinspector:ListAssessmentTemplates
AWS Internet of Things (IoT)
AWS IoT Analytics
Amazon Managed Streaming for Kafkakafka:ListClusters
Amazon Kinesis Data Analyticskinesisanalytics:ListApplications
Amazon Data Firehosefirehose:ListDeliveryStreams
Amazon Kinesis Data Streamskinesis:ListStreams
Amazon Kinesis Video Streamskinesisvideo:ListStreams
AWS Lambdalambda:ListFunctions
AWS Lambda (built-in)lambda:ListFunctions,
lambda:ListTags
Amazon Lexlex:GetBots
Amazon Application and Network Load Balancer (built-in)elasticloadbalancing:DescribeInstanceHealth,
elasticloadbalancing:DescribeListeners,
elasticloadbalancing:DescribeLoadBalancers,
elasticloadbalancing:DescribeRules,
elasticloadbalancing:DescribeTags,
elasticloadbalancing:DescribeTargetHealth
Amazon CloudWatch Logslogs:DescribeLogGroups
AWS Elemental MediaConnectmediaconnect:ListFlows
AWS Elemental MediaConvertmediaconvert:DescribeEndpoints
AWS Elemental MediaPackage Livemediapackage:ListChannels
AWS Elemental MediaPackage Video on Demandmediapackage-vod:ListPackagingConfigurations
AWS Elemental MediaTailormediatailor:ListPlaybackConfigurations
Amazon VPC NAT Gatewaysec2:DescribeNatGateways
Amazon Neptunerds:DescribeDBClusters
AWS OpsWorksopsworks:DescribeStacks
Amazon Polly
Amazon QLDBqldb:ListLedgers
Amazon RDSrds:DescribeDBInstances
Amazon RDS (built-in)rds:DescribeDBInstances,
rds:DescribeEvents,
rds:ListTagsForResource
Amazon Redshiftredshift:DescribeClusters
Amazon Rekognition
AWS RoboMakerrobomaker:ListSimulationJobs
Amazon Route 53route53:ListHostedZones
Amazon Route 53 Resolverroute53resolver:ListResolverEndpoints
Amazon S3s3:ListAllMyBuckets
Amazon S3 (built-in)s3:ListAllMyBuckets
Amazon SageMaker Batch Transform Jobs
Amazon SageMaker Endpoint Instancessagemaker:ListEndpoints
Amazon SageMaker Endpointssagemaker:ListEndpoints
Amazon SageMaker Ground Truth
Amazon SageMaker Processing Jobs
Amazon SageMaker Training Jobs
AWS Service Catalog
Amazon Simple Email Service (SES)
Amazon Simple Notification Service (SNS)sns:ListTopics
Amazon Simple Queue Service (SQS)sqs:ListQueues
AWS Systems Manager - Run Command
AWS Step Functions
AWS Storage Gatewaystoragegateway:ListGateways
Amazon SWFswf:ListDomains
Amazon Textract
AWS IoT Things Graph
AWS Transfer Familytransfer:ListServers
AWS Transit Gatewayec2:DescribeTransitGateways
Amazon Translate
AWS Trusted Advisor
AWS API Usage
AWS Site-to-Site VPNec2:DescribeVpnConnections
AWS WAF Classic
AWS WAF
Amazon WorkMailworkmail:ListOrganizations
Amazon WorkSpacesworkspaces:DescribeWorkspaces

See the example of JSON policy for one single service below.

{
"Version": "2012-10-17",
"Statement": [
{
"Sid": "VisualEditor0",
"Effect": "Allow",
"Action": [
"apigateway:GET",
"cloudwatch:GetMetricData",
"cloudwatch:GetMetricStatistics",
"cloudwatch:ListMetrics",
"sts:GetCallerIdentity",
"tag:GetResources",
"tag:GetTagKeys",
"ec2:DescribeAvailabilityZones"
],
"Resource": "*"
}
]
}

In this example, from the complete list of permissions you need to select

  • "apigateway:GET" for Amazon API Gateway
  • "cloudwatch:GetMetricData", "cloudwatch:GetMetricStatistics", "cloudwatch:ListMetrics", "sts:GetCallerIdentity", "tag:GetResources", "tag:GetTagKeys", and "ec2:DescribeAvailabilityZones" for All AWS cloud services.
EndpointService
autoscaling.<REGION>.amazonaws.comAmazon EC2 Auto Scaling (built-in), Amazon EC2 Auto Scaling
lambda.<REGION>.amazonaws.comAWS Lambda (built-in), AWS Lambda
elasticloadbalancing.<REGION>.amazonaws.comAmazon Application and Network Load Balancer (built-in), Amazon Elastic Load Balancer (ELB) (built-in)
dynamodb.<REGION>.amazonaws.comAmazon DynamoDB (built-in), Amazon DynamoDB
ec2.<REGION>.amazonaws.comAmazon EBS (built-in), Amazon EC2 (built-in), Amazon EBS, Amazon EC2 Spot Fleet, Amazon VPC NAT Gateways, AWS Transit Gateway, AWS Site-to-Site VPN
rds.<REGION>.amazonaws.comAmazon RDS (built-in), Amazon Aurora, Amazon DocumentDB, Amazon Neptune, Amazon RDS
s3.<REGION>.amazonaws.comAmazon S3 (built-in)
acm-pca.<REGION>.amazonaws.comAWS Certificate Manager Private Certificate Authority
apigateway.<REGION>.amazonaws.comAmazon API Gateway
apprunner.<REGION>.amazonaws.comAWS App Runner
appstream2.<REGION>.amazonaws.comAmazon AppStream
appsync.<REGION>.amazonaws.comAWS AppSync
athena.<REGION>.amazonaws.comAmazon Athena
cloudfront.amazonaws.comAmazon CloudFront
cloudhsmv2.<REGION>.amazonaws.comAWS CloudHSM
cloudsearch.<REGION>.amazonaws.comAmazon CloudSearch
codebuild.<REGION>.amazonaws.comAWS CodeBuild
datasync.<REGION>.amazonaws.comAWS DataSync
dax.<REGION>.amazonaws.comAmazon DynamoDB Accelerator (DAX)
dms.<REGION>.amazonaws.comAWS Database Migration Service (AWS DMS)
directconnect.<REGION>.amazonaws.comAWS Direct Connect
ecs.<REGION>.amazonaws.comAmazon Elastic Container Service (ECS), Amazon ECS Container Insights
elasticfilesystem.<REGION>.amazonaws.comAmazon Elastic File System (EFS)
eks.<REGION>.amazonaws.comAmazon Elastic Kubernetes Service (EKS)
elasticache.<REGION>.amazonaws.comAmazon ElastiCache (EC)
elasticbeanstalk.<REGION>.amazonaws.comAWS Elastic Beanstalk
elastictranscoder.<REGION>.amazonaws.comAmazon Elastic Transcoder
es.<REGION>.amazonaws.comAmazon Elasticsearch Service (ES)
events.<REGION>.amazonaws.comAmazon EventBridge
fsx.<REGION>.amazonaws.comAmazon FSx
gamelift.<REGION>.amazonaws.comAmazon GameLift
glue.<REGION>.amazonaws.comAWS Glue
inspector.<REGION>.amazonaws.comAmazon Inspector
kafka.<REGION>.amazonaws.comAmazon Managed Streaming for Kafka
models.lex.<REGION>.amazonaws.comAmazon Lex
logs.<REGION>.amazonaws.comAmazon CloudWatch Logs
api.mediatailor.<REGION>.amazonaws.comAWS Elemental MediaTailor
mediaconnect.<REGION>.amazonaws.comAWS Elemental MediaConnect
mediapackage.<REGION>.amazonaws.comAWS Elemental MediaPackage Live
mediapackage-vod.<REGION>.amazonaws.comAWS Elemental MediaPackage Video on Demand
opsworks.<REGION>.amazonaws.comAWS OpsWorks
qldb.<REGION>.amazonaws.comAmazon QLDB
redshift.<REGION>.amazonaws.comAmazon Redshift
robomaker.<REGION>.amazonaws.comAWS RoboMaker
route53.amazonaws.comAmazon Route 53
route53resolver.<REGION>.amazonaws.comAmazon Route 53 Resolver
api.sagemaker.<REGION>.amazonaws.comAmazon SageMaker Endpoints, Amazon SageMaker Endpoint Instances
sns.<REGION>.amazonaws.comAmazon Simple Notification Service (SNS)
sqs.<REGION>.amazonaws.comAmazon Simple Queue Service (SQS)
storagegateway.<REGION>.amazonaws.comAWS Storage Gateway
swf.<REGION>.amazonaws.comAmazon SWF
transfer.<REGION>.amazonaws.comAWS Transfer Family
workmail.<REGION>.amazonaws.comAmazon WorkMail
workspaces.<REGION>.amazonaws.comAmazon WorkSpaces

Enable monitoring

To learn how to enable service monitoring, see Enable service monitoring.

View service metrics

You can view the service metrics in your Dynatrace environment either on the custom device overview page or on your Dashboards page.

View metrics on the custom device overview page

To access the custom device overview page

  1. Go to Technologies & Processes Classic.
  2. Filter by service name and select the relevant custom device group.
  3. Once you select the custom device group, you're on the custom device group overview page.
  4. The custom device group overview page lists all instances (custom devices) belonging to the group. Select an instance to view the custom device overview page.

View metrics on your dashboard

After you add the service to monitoring, a preset dashboard containing all recommended metrics is automatically listed on your Dashboards page. To look for specific dashboards, filter by Preset and then by Name.

AWS presets

For existing monitored services, you might need to resave your credentials for the preset dashboard to appear on the Dashboards page. To resave your credentials, go to Settings > Cloud and virtualization > AWS, select the desired AWS instance, and then select Save.

You can't make changes on a preset dashboard directly, but you can clone and edit it. To clone a dashboard, open the browse menu () and select Clone.

To remove a dashboard from the dashboards page, you can hide it. To hide a dashboard, open the browse menu () and select Hide.

Hiding a dashboard doesn't affect other users.

Clone hide AWS

To check the availability of preset dashboards for each AWS service, see the list below.

AWS servicePreset dashboard
Amazon EC2 Auto Scaling (built-in)Not applicable
AWS Lambda (built-in)Not applicable
Amazon Application and Network Load Balancer (built-in)Not applicable
Amazon DynamoDB (built-in)Not applicable
Amazon EBS (built-in)Not applicable
Amazon EC2 (built-in)Not applicable
Amazon Elastic Load Balancer (ELB) (built-in)Not applicable
Amazon RDS (built-in)Not applicable
Amazon S3 (built-in)Not applicable
AWS Certificate Manager Private Certificate AuthorityNot applicable
All monitored Amazon servicesNot applicable
Amazon API GatewayNot applicable
AWS App RunnerNot applicable
Amazon AppStreamApplicable
AWS AppSyncApplicable
Amazon AthenaApplicable
Amazon AuroraNot applicable
Amazon EC2 Auto ScalingApplicable
AWS BillingApplicable
Amazon KeyspacesApplicable
AWS ChatbotApplicable
Amazon CloudFrontNot applicable
AWS CloudHSMApplicable
Amazon CloudSearchApplicable
AWS CodeBuildApplicable
Amazon CognitoNot applicable
Amazon ConnectApplicable
AWS DataSyncApplicable
Amazon DynamoDB Accelerator (DAX)Applicable
AWS Database Migration Service (AWS DMS)Applicable
Amazon DocumentDBApplicable
AWS Direct ConnectApplicable
Amazon DynamoDBNot applicable
Amazon EBSNot applicable
Amazon EC2 Spot FleetNot applicable
Amazon EC2 APIApplicable
Amazon Elastic Container Service (ECS)Not applicable
Amazon ECS Container InsightsApplicable
Amazon Elastic File System (EFS)Not applicable
Amazon Elastic Kubernetes Service (EKS)Applicable
Amazon ElastiCache (EC)Not applicable
AWS Elastic BeanstalkApplicable
Amazon Elastic InferenceApplicable
Amazon Elastic TranscoderApplicable
Amazon Elastic Map Reduce (EMR)Not applicable
Amazon Elasticsearch Service (ES)Not applicable
Amazon EventBridgeApplicable
Amazon FSxApplicable
Amazon GameLiftApplicable
AWS GlueNot applicable
Amazon InspectorApplicable
AWS Internet of Things (IoT)Not applicable
AWS IoT Things GraphApplicable
AWS IoT AnalyticsApplicable
Amazon Managed Streaming for KafkaApplicable
Amazon Kinesis Data AnalyticsNot applicable
Amazon Data FirehoseNot applicable
Amazon Kinesis Data StreamsNot applicable
Amazon Kinesis Video StreamsNot applicable
AWS LambdaNot applicable
Amazon LexApplicable
Amazon CloudWatch LogsApplicable
AWS Elemental MediaTailorApplicable
AWS Elemental MediaConnectApplicable
AWS Elemental MediaConvertApplicable
AWS Elemental MediaPackage LiveApplicable
AWS Elemental MediaPackage Video on DemandApplicable
Amazon MQApplicable
Amazon VPC NAT GatewaysNot applicable
Amazon NeptuneApplicable
AWS OpsWorksApplicable
Amazon PollyApplicable
Amazon QLDBApplicable
Amazon RDSNot applicable
Amazon RedshiftNot applicable
Amazon RekognitionApplicable
AWS RoboMakerApplicable
Amazon Route 53Applicable
Amazon Route 53 ResolverApplicable
Amazon S3Not applicable
Amazon SageMaker Batch Transform JobsNot applicable
Amazon SageMaker EndpointsNot applicable
Amazon SageMaker Endpoint InstancesNot applicable
Amazon SageMaker Ground TruthNot applicable
Amazon SageMaker Processing JobsNot applicable
Amazon SageMaker Training JobsNot applicable
AWS Service CatalogApplicable
Amazon Simple Email Service (SES)Not applicable
Amazon Simple Notification Service (SNS)Not applicable
Amazon Simple Queue Service (SQS)Not applicable
AWS Systems Manager - Run CommandApplicable
AWS Step FunctionsApplicable
AWS Storage GatewayApplicable
Amazon SWFApplicable
Amazon TextractApplicable
AWS Transfer FamilyApplicable
AWS Transit GatewayApplicable
Amazon TranslateApplicable
AWS Trusted AdvisorApplicable
AWS API UsageApplicable
AWS Site-to-Site VPNApplicable
AWS WAF ClassicApplicable
AWS WAFApplicable
Amazon WorkMailApplicable
Amazon WorkSpacesApplicable

Msk

Available metrics

Cluster Name is the main dimension.

NameDescriptionUnitStatisticsDimensionsRecommended
ActiveControllerCountOnly one controller per cluster should be active at any given time.CountMultiCluster NameApplicable
ActiveControllerCountCountSumCluster NameApplicable
BytesInPerSecThe number of bytes per second received from clientsBytes/SecondMultiCluster Name, Broker ID
BytesInPerSecBytes/SecondMultiCluster Name, Broker ID, Topic
BytesInPerSecBytes/SecondSumCluster Name, Broker ID
BytesInPerSecBytes/SecondSumCluster Name, Broker ID, Topic
BytesOutPerSecThe number of bytes per second sent to clientsBytes/SecondMultiCluster Name, Broker ID
BytesOutPerSecBytes/SecondMultiCluster Name, Broker ID, Topic
BytesOutPerSecBytes/SecondSumCluster Name, Broker ID
BytesOutPerSecBytes/SecondSumCluster Name, Broker ID, Topic
CPUCreditBalanceThe number of earned creditsCountMultiCluster Name, Broker ID
CPUCreditBalanceCountSumCluster Name, Broker ID
CPUCreditUsageThe number of used creditsCountMultiCluster Name, Broker ID
CPUCreditUsageCountSumCluster Name, Broker ID
CpuIdleThe percentage of CPU idle timePercentMultiCluster Name, Broker IDApplicable
CpuIdlePercentSumCluster Name, Broker IDApplicable
CpuSystemThe percentage of CPU in kernel spacePercentMultiCluster Name, Broker IDApplicable
CpuSystemPercentSumCluster Name, Broker IDApplicable
CpuUserThe percentage of CPU in user spacePercentMultiCluster Name, Broker IDApplicable
CpuUserPercentSumCluster Name, Broker IDApplicable
FetchConsumerLocalTimeMsMeanThe mean time in milliseconds that the consumer request is processed at the leaderMillisecondsMultiCluster Name, Broker ID
FetchConsumerLocalTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchConsumerRequestQueueTimeMsMeanThe mean time in milliseconds that the consumer request waits in the request queueMillisecondsMultiCluster Name, Broker ID
FetchConsumerRequestQueueTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchConsumerResponseQueueTimeMsMeanThe mean time in milliseconds that the consumer request waits in the response queueMillisecondsMultiCluster Name, Broker ID
FetchConsumerResponseQueueTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchConsumerResponseSendTimeMsMeanMillisecondsMultiCluster Name, Broker ID
FetchConsumerResponseSendTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchConsumerTotalTimeMsMeanThe mean total time in milliseconds that consumers spend on fetching data from the brokerMillisecondsMultiCluster Name, Broker ID
FetchConsumerTotalTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchFollowerLocalTimeMsMeanThe mean time in milliseconds that the follower request is processed at the leaderMillisecondsMultiCluster Name, Broker ID
FetchFollowerLocalTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchFollowerRequestQueueTimeMsMeanThe mean time in milliseconds that the follower request waits in the request queueMillisecondsMultiCluster Name, Broker ID
FetchFollowerRequestQueueTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchFollowerResponseQueueTimeMsMeanThe mean time in milliseconds that the follower request waits in the response queueMillisecondsMultiCluster Name, Broker ID
FetchFollowerResponseQueueTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchFollowerResponseSendTimeMsMeanThe mean time in milliseconds for the follower to send a responseMillisecondsMultiCluster Name, Broker ID
FetchFollowerResponseSendTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchFollowerTotalTimeMsMeanThe mean total time in milliseconds that followers spend on fetching data from the brokerMillisecondsMultiCluster Name, Broker ID
FetchFollowerTotalTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchMessageConversionsPerSecThe number of fetch message conversions per second for the brokerCount/SecondMultiCluster Name, Broker ID
FetchMessageConversionsPerSecCount/SecondMultiCluster Name, Broker ID, Topic
FetchMessageConversionsPerSecCount/SecondSumCluster Name, Broker ID
FetchMessageConversionsPerSecCount/SecondSumCluster Name, Broker ID, Topic
FetchMessageConversionsTimeMsMeanThe mean total time in milliseconds that messages being fetched spend convertingMillisecondsMultiCluster Name, Broker ID
FetchMessageConversionsTimeMsMeanMillisecondsSumCluster Name, Broker ID
FetchThrottleByteRateThe number of throttled bytes per secondBytes/SecondMultiCluster Name, Broker ID
FetchThrottleByteRateBytes/SecondSumCluster Name, Broker ID
FetchThrottleQueueSizeThe number of messages in the throttle queueCountMultiCluster Name, Broker ID
FetchThrottleQueueSizeCountSumCluster Name, Broker ID
FetchThrottleTimeThe average fetch throttle time in millisecondsMillisecondsMultiCluster Name, Broker ID
FetchThrottleTimeMillisecondsSumCluster Name, Broker ID
GlobalPartitionCountTotal number of partitions across all brokers in the clusterCountMultiCluster NameApplicable
GlobalPartitionCountCountSumCluster NameApplicable
GlobalTopicCountTotal number of topics across all brokers in the clusterCountMultiCluster NameApplicable
GlobalTopicCountCountSumCluster NameApplicable
KafkaAppLogsDiskUsedThe percentage of disk space used for application logsPercentMultiCluster Name, Broker IDApplicable
KafkaAppLogsDiskUsedPercentSumCluster Name, Broker IDApplicable
KafkaDataLogsDiskUsedThe percentage of disk space used for data logsPercentMultiCluster Name, Broker IDApplicable
KafkaDataLogsDiskUsedPercentSumCluster Name, Broker IDApplicable
LeaderCountThe number of leader replicasCountMultiCluster Name, Broker ID
LeaderCountCountSumCluster Name, Broker ID
MemoryBufferedThe size in bytes of buffered memory for the brokerBytesMultiCluster Name, Broker IDApplicable
MemoryBufferedBytesSumCluster Name, Broker IDApplicable
MemoryCachedThe size in bytes of cached memory for the brokerBytesMultiCluster Name, Broker IDApplicable
MemoryCachedBytesSumCluster Name, Broker IDApplicable
MemoryFreeThe size in bytes of memory that is free and available for the brokerBytesMultiCluster Name, Broker IDApplicable
MemoryFreeBytesSumCluster Name, Broker IDApplicable
MemoryUsedThe size in bytes of memory that is in use for the brokerBytesMultiCluster Name, Broker IDApplicable
MemoryUsedBytesSumCluster Name, Broker IDApplicable
MessagesInPerSecThe number of incoming messages per second for the brokerCount/SecondMultiCluster Name, Broker ID
MaxOffsetLagThe maximum offset lag across all partitions in a topicCountMultiCluster Name, Consumer Group, Topic
MaxOffsetLagThe maximum offset lag across all partitions in a topicCountSumCluster Name, Consumer Group, Topic
MessagesInPerSecCount/SecondMultiCluster Name, Broker ID, Topic
MessagesInPerSecCount/SecondSumCluster Name, Broker ID
MessagesInPerSecCount/SecondSumCluster Name, Broker ID, Topic
NetworkProcessorAvgIdlePercentThe average percentage of the time the network processors are idlePercentMultiCluster Name, Broker ID
NetworkProcessorAvgIdlePercentPercentSumCluster Name, Broker ID
NetworkRxDroppedThe number of dropped receive packagesCountMultiCluster Name, Broker IDApplicable
NetworkRxDroppedCountSumCluster Name, Broker IDApplicable
NetworkRxErrorsThe number of network receive errors for the brokerCountMultiCluster Name, Broker IDApplicable
NetworkRxErrorsCountSumCluster Name, Broker IDApplicable
NetworkRxPacketsThe number of packets received by the brokerCountMultiCluster Name, Broker IDApplicable
NetworkRxPacketsCountSumCluster Name, Broker IDApplicable
NetworkTxDroppedThe number of dropped transmit packagesCountMultiCluster Name, Broker IDApplicable
NetworkTxDroppedCountSumCluster Name, Broker IDApplicable
NetworkTxErrorsThe number of network transmit errors for the brokerCountMultiCluster Name, Broker IDApplicable
NetworkTxErrorsCountSumCluster Name, Broker IDApplicable
NetworkTxPacketsThe number of packets transmitted by the brokerCountMultiCluster Name, Broker IDApplicable
NetworkTxPacketsCountSumCluster Name, Broker IDApplicable
OfflinePartitionsCountTotal number of partitions that are offline in the clusterCountMultiCluster NameApplicable
OfflinePartitionsCountCountSumCluster NameApplicable
PartitionCountThe number of partitions for the brokerCountMultiCluster Name, Broker ID
PartitionCountCountSumCluster Name, Broker ID
ProduceLocalTimeMsMeanThe mean time in milliseconds for the follower to send a responseMillisecondsMultiCluster Name, Broker ID
ProduceLocalTimeMsMeanMillisecondsSumCluster Name, Broker ID
ProduceMessageConversionsPerSecThe number of produce message conversions per second for the brokerCount/SecondMultiCluster Name, Broker ID
ProduceMessageConversionsPerSecCount/SecondMultiCluster Name, Broker ID, Topic
ProduceMessageConversionsPerSecCount/SecondSumCluster Name, Broker ID
ProduceMessageConversionsPerSecCount/SecondSumCluster Name, Broker ID, Topic
ProduceMessageConversionsTimeMsMeanThe mean time in milliseconds spent on message format conversionsMillisecondsMultiCluster Name, Broker ID
ProduceMessageConversionsTimeMsMeanMillisecondsSumCluster Name, Broker ID
ProduceRequestQueueTimeMsMeanThe mean time in milliseconds that request messages spend in the queueMillisecondsMultiCluster Name, Broker ID
ProduceRequestQueueTimeMsMeanMillisecondsSumCluster Name, Broker ID
ProduceResponseQueueTimeMsMeanThe mean time in milliseconds that response messages spend in the queueMillisecondsMultiCluster Name, Broker ID
ProduceResponseQueueTimeMsMeanMillisecondsSumCluster Name, Broker ID
ProduceResponseSendTimeMsMeanThe mean time in milliseconds spent on sending response messagesMillisecondsMultiCluster Name, Broker ID
ProduceResponseSendTimeMsMeanMillisecondsSumCluster Name, Broker ID
ProduceThrottleByteRateThe number of throttled bytes per secondBytes/SecondMultiCluster Name, Broker ID
ProduceThrottleByteRateBytes/SecondSumCluster Name, Broker ID
ProduceThrottleQueueSizeThe number of messages in the throttle queueCountMultiCluster Name, Broker ID
ProduceThrottleQueueSizeCountSumCluster Name, Broker ID
ProduceThrottleTimeThe average produce throttle time in millisecondsMillisecondsMultiCluster Name, Broker ID
ProduceThrottleTimeMillisecondsSumCluster Name, Broker ID
ProduceTotalTimeMsMeanThe mean produce time in millisecondsMillisecondsMultiCluster Name, Broker ID
ProduceTotalTimeMsMeanMillisecondsSumCluster Name, Broker ID
RequestBytesMeanThe mean number of request bytes for the brokerBytesMultiCluster Name, Broker ID
RequestBytesMeanBytesSumCluster Name, Broker ID
RequestExemptFromThrottleTimeThe average time in milliseconds spent in broker network and I/O threads to process requests that are exempt from throttlingMillisecondsMultiCluster Name, Broker ID
RequestExemptFromThrottleTimeMillisecondsSumCluster Name, Broker ID
RequestHandlerAvgIdlePercentThe average percentage of the time the request handler threads are idlePercentMultiCluster Name, Broker ID
RequestHandlerAvgIdlePercentPercentSumCluster Name, Broker ID
RequestThrottleQueueSizeThe number of messages in the throttle queueCountMultiCluster Name, Broker ID
RequestThrottleQueueSizeCountSumCluster Name, Broker ID
RequestThrottleTimeThe average request throttle time in millisecondsMillisecondsMultiCluster Name, Broker ID
RequestThrottleTimeMillisecondsSumCluster Name, Broker ID
RequestTimeThe average time in milliseconds spent in broker network and I/O threads to process requestsMillisecondsMultiCluster Name, Broker ID
RequestTimeMillisecondsSumCluster Name, Broker ID
RootDiskUsedThe percentage of the root disk used by the brokerPercentMultiCluster Name, Broker IDApplicable
RootDiskUsedPercentSumCluster Name, Broker IDApplicable
SumOffsetLagThe aggregated offset lag for all the partitions in a topicCountMultiCluster Name, Consumer Group, Topic
SwapFreeThe size in bytes of swap memory that is available for the brokerBytesMultiCluster Name, Broker IDApplicable
SwapFreeBytesSumCluster Name, Broker IDApplicable
SwapUsedThe size in bytes of swap memory that is in use for the brokerBytesMultiCluster Name, Broker IDApplicable
SwapUsedBytesSumCluster Name, Broker IDApplicable
UnderMinIsrPartitionCountThe number of under minIsr partitions for the brokerCountMultiCluster Name, Broker ID
UnderMinIsrPartitionCountCountSumCluster Name, Broker ID
UnderReplicatedPartitionsThe number of under-replicated partitions for the brokerCountMultiCluster Name, Broker ID
UnderReplicatedPartitionsCountSumCluster Name, Broker ID
ZooKeeperRequestLatencyMsMeanMean latency in milliseconds for ZooKeeper requests from brokerMillisecondsMultiCluster Name, Broker IDApplicable
ZooKeeperRequestLatencyMsMeanMillisecondsMultiCluster NameApplicable
ZooKeeperRequestLatencyMsMeanMillisecondsSumCluster Name, Broker IDApplicable
ZooKeeperRequestLatencyMsMeanMillisecondsSumCluster NameApplicable
ZooKeeperSessionStateConnection status of broker's ZooKeeper session which may be one of the following: NOT_CONNECTED: 0.0, ASSOCIATING: 0.1, CONNECTING: 0.5, CONNECTEDREADONLY: 0.8, CONNECTED: 1.0, CLOSED: 5.0, AUTH_FAILED: 10.0.CountMultiCluster Name, Broker IDApplicable
ZooKeeperSessionStateCountMultiCluster NameApplicable
ZooKeeperSessionStateCountSumCluster Name, Broker ID
ZooKeeperSessionStateCountSumCluster Name
Related tags
Infrastructure Observability