Set up metric events for alerting

To configure metric events for alerting

Go to Settings > Cloud and virtualization > AWS.
Under Metric events for alerting, select Manage alerting rules.
On the Metric events for alerting page, you can create, enable/disable, and configure recommended alerting rules.

For an overview of all recommended alerting rules for all supporting services, see the list below.

Name

Alerting rules

Amazon MQ

Amazon MQ store percent usage (Static threshold: above 95 %),
Amazon MQ temp percent usage (Static threshold: above 95 %),
Amazon MQ memory usage (by topic) (Static threshold: above 95 %),
Amazon MQ memory usage (by queue) (Static threshold: above 95 %),
Amazon MQ CPU utilization (Static threshold: above 95 %),
Amazon MQ heap usage (Static threshold: above 95 %),
Amazon MQ job scheduler store percent usage (Static threshold: above 95 %),
Amazon RabbitMQ CPU utilization (Static threshold: above 95 %)

AWS App Runner

AWS App Runner CPU utilization (by instance/service ID) (Static threshold: above 95 %)

Amazon AppStream

Amazon AppStream capacity utilization (Static threshold: above 95 %)

Amazon Aurora

Amazon Aurora CPU utilization (average, by region) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (average, by region/engine) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (average, by region/database class) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (average) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (average, by role) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum, by region) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum, by region/engine) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum, by region/database class) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum) (Static threshold: above 95 %),
Amazon Aurora CPU utilization (maximum, by role) (Static threshold: above 95 %)

Amazon Keyspaces

Amazon Keyspaces account provisioned read capacity utilization (by region) (Static threshold: above 95 %),
Amazon Keyspaces account provisioned write capacity utilization (by region) (Static threshold: above 95 %),
Amazon Keyspaces max provisioned table read capacity utilization (by region) (Static threshold: above 95 %),
Amazon Keyspaces max provisioned table write capacity utilization (by region) (Static threshold: above 95 %)

Amazon CloudFront

Amazon CloudFront total error rate (by region) (Static threshold: above 5 %),
Amazon CloudFront 4xx error rate (by region) (Static threshold: above 5 %),
Amazon CloudFront 5xx error rate (by region) (Static threshold: above 5 %)

Amazon CloudSearch

Amazon CloudSearch index utilization (Static threshold: above 95 %)

AWS CodeBuild

AWS CodeBuild CPU utilized percent (Static threshold: above 95 %),
AWS CodeBuild CPU utilized percent (by build id/build number) (Static threshold: above 95 %),
AWS CodeBuild memory utilized percent (Static threshold: above 95 %),
AWS CodeBuild memory utilized percent (by build id/build number) (Static threshold: above 95 %)

Amazon Connect

Amazon Connect the percentage of the concurrent calls service quota (by metric group) (Static threshold: above 95 %)

Amazon Elastic Kubernetes Service (EKS)

Amazon EKS Node CPU utilization (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Node memory utilization (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization over pod limit (by namespace) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization over pod limit (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization over pod limit (Static threshold: above 95 %),
Amazon EKS Node filesystem utilization (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Pod memory utilization (by namespace) (Static threshold: above 95 %),
Amazon EKS Pod memory utilization (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Service Pod memory utilization (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization (by namespace) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod CPU utilization (Static threshold: above 95 %),
Amazon EKS Pod CPU reserved capacity (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod CPU reserved capacity (Static threshold: above 95 %),
Amazon EKS Pod memory utilization over pod limit (by namespace) (Static threshold: above 95 %),
Amazon EKS Pod memory utilization over pod limit (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod memory utilization over pod limit (Static threshold: above 95 %),
Amazon EKS Pod memory reserved capacity (by namespace/pod name) (Static threshold: above 95 %),
Amazon EKS Pod memory reserved capacity (Static threshold: above 95 %),
Amazon EKS Node CPU reserved capacity (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Node CPU reserved capacity (Static threshold: above 95 %),
Amazon EKS Node memory reserved capacity (by instance id/node name) (Static threshold: above 95 %),
Amazon EKS Node memory reserved capacity (Static threshold: above 95 %),
Amazon EKS Node CPU utilization (Static threshold: above 95 %),
Amazon EKS Node memory utilization (Static threshold: above 95 %),
Amazon EKS Node filesystem utilization (Static threshold: above 95 %)

Amazon DynamoDB Accelerator (DAX)

Amazon DynamoDB Accelerator CPU utilization (Static threshold: above 95 %),
Amazon DynamoDB Accelerator CPU utilization (by node id) (Static threshold: above 95 %),
Amazon DynamoDB Accelerator CPU utilization (by region) (Static threshold: above 95 %)

AWS Database Migration Service

AWS Database Migration Service CPU utilization (by replication task identifier) (Static threshold: above 95 %),
AWS Database Migration Service CPU utilization (Static threshold: above 95 %),
AWS Database Migration Service CPU utilization (by replication instance/external resource id) (Static threshold: above 95 %),
AWS Database Migration Service CPU utilization (by region) (Static threshold: above 95 %),
AWS Database Migration Service CPU utilization (by region/instance class) (Static threshold: above 95 %)

Amazon DocumentDB

Amazon DocumentDB CPU utilization (by region/DB instance identifier) (Static threshold: above 95 %),
Amazon DocumentDB CPU utilization (by role) (Static threshold: above 95 %),
Amazon DocumentDB CPU utilization (Static threshold: above 95 %)

Amazon Elastic Container Service (ECS)

Amazon ECS CPU reservation (Static threshold: above 95 %),
Amazon ECS CPU utilization (Static threshold: above 95 %),
Amazon ECS CPU utilization (by service name) (Static threshold: above 95 %),
Amazon ECS Memory reservation (Static threshold: above 95 %),
Amazon ECS Memory utilization (Static threshold: above 95 %),
Amazon ECS Memory utilization (by service name) (Static threshold: above 95 %)

Amazon ECS ContainerInsights

Amazon ECS ContainerInsights instance memory utilization (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance memory utilization (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance memory reserved capacity (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance memory reserved capacity (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance CPU utilization (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance CPU utilization (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance filesystem utilization (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance filesystem utilization (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance CPU reserved capacity (by container instance id/instance id) (Static threshold: above 95 %),
Amazon ECS ContainerInsights instance CPU reserved capacity (Static threshold: above 95 %)

The number of recommended alerting rules depends on the number of your monitored supporting services.
To add recommended alerting rules for a new supporting service, you first need to add the new service to monitoring.

Go to Settings > Cloud and virtualization > AWS.
On the AWS overview page, select the edit button (pencil icon) for the AWS instance you want to edit.
Select Manage services and Add service, choose the service name from the list, and select Add service.
Select Save changes.

Note that not all supporting services have their own predefined alerting rules.

Create and enable alerting rules.

To enable recommended alerting rules, you first need to create them. You can create alerting rules and automatically enable them, or (if you clear Automatically enable created rules) create them and manually enable them after possible configuration changes.

For example, you can create and automatically enable a first batch of alerts. When you start monitoring new services, you can create alerts for these new services without automatically enabling them (because you want to configure them first).
Configure alerting rules. How you edit rules depends on whether you chose to automatically enable alerts.
- If you chose to automatically enable alerts when creating them, go to Adjust recommended alerting rules, expand Enabled recommended alerting rules, and select any rule. This takes you to Edit custom event for alerting, where you can change the configuration rules for that specific service.
- If you didn't choose to automatically enable alerts when creating them, go to Enable recommended alerting rules, expand Disabled recommended alerting rules, and select any of the disabled rules. This takes you to the same Edit custom event for alerting page.
Disable alerting rules.
You can disable all alerting rules, or disable or delete them selectively.
- To disable all alerting rules, go to Adjust recommended alerting rules and select Disable all enabled recommended alerting rules.
- To disable or delete alerting rules selectively, go to Adjust recommended alerting rules and select Metric events. On the Metric events page, you can disable an alert by turning it off in the On/Off column, or you can delete it by selecting x in the Delete column.
If you disable any or all of the alerting rules, you can always re-enable them.

Set up metric events for alerting

Related topics