Set up metric events for alerting

To configure metric events for alerting

  1. Go to Settings > Cloud and virtualization > AWS.
  2. Under Metric events for alerting, select Manage alerting rules.
  3. On the Metric events for alerting page, you can create, enable/disable, and configure recommended alerting rules.

For an overview of all recommended alerting rules for all supporting services, see the list below.

Name
Alerting rules
Amazon MQ
Amazon MQ store percent usage (Static threshold: above 95 %),
Amazon MQ temp percent usage (Static threshold: above 95 %),
Amazon MQ memory usage (by topic) (Static threshold: above 95 %),
Amazon MQ memory usage (by queue) (Static threshold: above 95 %),
Amazon MQ CPU utilization (Static threshold: above 95 %),
Amazon MQ heap usage (Static threshold: above 95 %),
Amazon MQ job scheduler store percent usage (Static threshold: above 95 %),
Amazon RabbitMQ CPU utilization (Static threshold: above 95 %)
AWS App Runner
AWS App Runner CPU utilization (by instance/service ID) (Static threshold: above 95 %)
Amazon AppStream
Amazon AppStream capacity utilization (Static threshold: above 95 %)
Amazon Aurora
Amazon Aurora CPU utilization (average, by region) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (average, by region/engine) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (average, by region/database class) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (average) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (average, by role) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum, by region) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum, by region/engine) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum, by region/database class) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum, by role) (Static threshold: above 95 %)
Amazon Keyspaces
Amazon Keyspaces account provisioned read capacity utilization (by region) (Static threshold: above 95 %),
Amazon Keyspaces account provisioned write capacity utilization (by region) (Static threshold: above 95 %),
Amazon Keyspaces max provisioned table read capacity utilization (by region) (Static threshold: above 95 %),
Amazon Keyspaces max provisioned table write capacity utilization (by region) (Static threshold: above 95 %)
Amazon CloudFront
Amazon CloudFront total error rate (by region) (Static threshold: above 5 %),
Amazon CloudFront 4xx error rate (by region) (Static threshold: above 5 %),
Amazon CloudFront 5xx error rate (by region) (Static threshold: above 5 %)
Amazon CloudSearch
Amazon CloudSearch index utilization (Static threshold: above 95 %)
AWS CodeBuild
AWS CodeBuild CPU utilized percent (Static threshold: above 95 %),
AWS CodeBuild CPU utilized percent (by build id/build number) (Static threshold: above 95 %),
AWS CodeBuild memory utilized percent (Static threshold: above 95 %),
AWS CodeBuild memory utilized percent (by build id/build number) (Static threshold: above 95 %)
Amazon Connect
Amazon Connect the percentage of the concurrent calls service quota (by metric group) (Static threshold: above 95 %)
Amazon Elastic Kubernetes Service (EKS)
Amazon EKS Node CPU utilization (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Node memory utilization (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization over pod limit (by namespace) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization over pod limit (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization over pod limit (Static threshold: above 95 %),
Amazon EKS Node filesystem utilization (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Pod memory utilization (by namespace) (Static threshold: above 95 %),
Amazon EKS Pod memory utilization (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Service Pod memory utilization (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization (by namespace) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization (Static threshold: above 95 %),
Amazon EKS Pod CPU reserved capacity (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod CPU reserved capacity (Static threshold: above 95 %),
Amazon EKS Pod memory utilization over pod limit (by namespace) (Static threshold: above 95 %),
Amazon EKS Pod memory utilization over pod limit (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod memory utilization over pod limit (Static threshold: above 95 %),
Amazon EKS Pod memory reserved capacity (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod memory reserved capacity (Static threshold: above 95 %),
Amazon EKS Node CPU reserved capacity (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Node CPU reserved capacity (Static threshold: above 95 %),
Amazon EKS Node memory reserved capacity (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Node memory reserved capacity (Static threshold: above 95 %),
Amazon EKS Node CPU utilization (Static threshold: above 95 %),
Amazon EKS Node memory utilization (Static threshold: above 95 %),
Amazon EKS Node filesystem utilization (Static threshold: above 95 %)
Amazon DynamoDB Accelerator (DAX)
Amazon DynamoDB Accelerator CPU utilization (Static threshold: above 95 %),
Amazon DynamoDB Accelerator CPU utilization (by node id) (Static threshold: above 95 %),
Amazon DynamoDB Accelerator CPU utilization (by region) (Static threshold: above 95 %)
AWS Database Migration Service
AWS Database Migration Service CPU utilization (by replication task identifier) (Static threshold: above 95 %),
AWS Database Migration Service CPU utilization (Static threshold: above 95 %),
AWS Database Migration Service CPU utilization (by replication instance/external resource id) (Static threshold: above 95 %),
AWS Database Migration Service CPU utilization (by region) (Static threshold: above 95 %),
AWS Database Migration Service CPU utilization (by region/instance class) (Static threshold: above 95 %)
Amazon DocumentDB
Amazon DocumentDB CPU utilization (by region/DB instance identifier) (Static threshold: above 95 %),
Amazon DocumentDB CPU utilization (by role) (Static threshold: above 95 %),
Amazon DocumentDB CPU utilization (Static threshold: above 95 %)
Amazon Elastic Container Service (ECS)
Amazon ECS CPU reservation (Static threshold: above 95 %),
Amazon ECS CPU utilization (Static threshold: above 95 %),
Amazon ECS CPU utilization (by service name) (Static threshold: above 95 %),
Amazon ECS Memory reservation (Static threshold: above 95 %),
Amazon ECS Memory utilization (Static threshold: above 95 %),
Amazon ECS Memory utilization (by service name) (Static threshold: above 95 %)
Amazon ECS ContainerInsights
Amazon ECS ContainerInsights instance memory utilization (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance memory utilization (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance memory reserved capacity (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance memory reserved capacity (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance CPU utilization (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance CPU utilization (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance filesystem utilization (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance filesystem utilization (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance CPU reserved capacity (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance CPU reserved capacity (Static threshold: above 95 %)

The number of recommended alerting rules depends on the number of your monitored supporting services.
To add recommended alerting rules for a new supporting service, you first need to add the new service to monitoring.

  1. Go to Settings > Cloud and virtualization > AWS.
  2. On the AWS overview page, select the edit button (pencil icon) for the AWS instance you want to edit.
  3. Select Manage services and Add service, choose the service name from the list, and select Add service.
  4. Select Save changes.

Add AWS service

Note that not all supporting services have their own predefined alerting rules.

  1. Create and enable alerting rules.

    To enable recommended alerting rules, you first need to create them. You can create alerting rules and automatically enable them, or (if you clear Automatically enable created rules) create them and manually enable them after possible configuration changes.

    Create alerting rules AWS

    For example, you can create and automatically enable a first batch of alerts. When you start monitoring new services, you can create alerts for these new services without automatically enabling them (because you want to configure them first).

  2. Configure alerting rules. How you edit rules depends on whether you chose to automatically enable alerts.

    • If you chose to automatically enable alerts when creating them, go to Adjust recommended alerting rules, expand Enabled recommended alerting rules, and select any rule. This takes you to Edit custom event for alerting, where you can change the configuration rules for that specific service.

      Conf alerts AWS

    • If you didn't choose to automatically enable alerts when creating them, go to Enable recommended alerting rules, expand Disabled recommended alerting rules, and select any of the disabled rules. This takes you to the same Edit custom event for alerting page.

      Enable rules AWS

  3. Disable alerting rules.

  4. You can disable all alerting rules, or disable or delete them selectively.

    Disable rules AWS

    • To disable all alerting rules, go to Adjust recommended alerting rules and select Disable all enabled recommended alerting rules.
    • To disable or delete alerting rules selectively, go to Adjust recommended alerting rules and select Metric events. On the Metric events page, you can disable an alert by turning it off in the On/Off column, or you can delete it by selecting x in the Delete column.

    Custom events AWS

    If you disable any or all of the alerting rules, you can always re-enable them.

    Reenable AWS