Cluster event notifications

You can manage the following notifications in a Dynatrace Managed cluster:

  • Insufficient disk space on a cluster node
    Triggered when a disk partition on a cluster node has less disk space than required for a given storage type. In that case, you need to extend your disk or review cluster settings. Otherwise, data retention might be reduced.
  • Insufficient hardware on a cluster node
    Triggered when a cluster node doesn't have enough CPU cores and RAM to meet recommended requirements. For details, see Managed hardware requirements.
  • Metrics storage exceeds supported size (4TiB)
    Triggered when metrics storage for a cluster node is too high. In that case, you need to review your monitoring settings or add additional nodes to the cluster.
  • Transaction storage retention period truncation
    Triggered when data retention has been automatically reduced to store new data. In that case, we recommend that you adjust the target retention time, review monitoring settings, or extend a disk.
  • Adaptive Load Reduction activity
    Triggered when a cluster node is considered overloaded and unable to keep up with processing incoming requests. In that case, you need to review monitoring settings or increase CPU cores and RAM on the cluster node. For details, see Adaptive Traffic Management for Dynatrace Managed. To configure cluster event notifications settings

In order to interact with the Cluster API v2 and manage cluster event notifications, follow the procedure below:

Authentication

To generate a cluster token with the Write settings and Read settings scopes

  1. Go to Settings > API tokens.
  2. In the Cluster tokens section, select Generate token.
  3. Enter a name for your token and define the Write settings and Read settings access scopes of your cluster API token.
  4. Save and then Copy the token to a secure location.

Configure email notifications

To send your configuration as a JSON payload, use the POST an object endpoint with a cluster token that has the appropriate access scopes:

[
{
"schemaId": "builtin:cluster-events-notification-settings",
"scope": "cluster",
"value": {
"insufficientDiskSpace": {
"sendEmail": true
},
"insufficientHardware": {
"sendEmail": true
},
"insufficientMetricStorage": {
"sendEmail": false
},
"transactionStorageTruncation": {
"sendEmail": false
},
"adaptiveLoadReductionActivated": {
"sendEmail": false
}
}
}
]

Read configuration schema

To learn the JSON format required to post your configuration, use the GET a schema endpoint with a cluster token that has the appropriate access scopes. The configuration schema identifier (schemaId) is builtin:cluster-events-notification-settings.

{
"dynatrace": "1",
"schemaId": "builtin:cluster-events-notification-settings",
"displayName": "Cluster events notification settings",
"description": "Configuration of emails notifications for cluster events.",
"documentation": "Email notifications are sent to recipients defined in Settings - Emails - Email notifications.",
"version": "0",
"multiObject": false,
"maxObjects": 1,
"allowedScopes": [
"cluster"
],
"enums": {},
"types": {
"SingleEventSettings": {
"version": "0",
"versionInfo": "",
"displayName": "Single event settings",
"summaryPattern": "",
"description": "Settings for a single cluster event",
"documentation": "",
"properties": {
"sendEmail": {
"displayName": "Send email",
"description": "",
"documentation": "",
"type": "boolean",
"nullable": false,
"maxObjects": 1,
"modificationPolicy": "DEFAULT",
"default": true
}
},
"type": "object"
}
},
"properties": {
"insufficientDiskSpace": {
"displayName": "Insufficient disk space on a cluster node",
"description": "Triggered when a disk partition on a cluster node has less disk space than required for a given storage type. In that case, you need to extend your disk or review cluster settings. Otherwise, data retention might be reduced.",
"documentation": "",
"type": {
"$ref": "#/types/SingleEventSettings"
},
"nullable": false,
"maxObjects": 1,
"modificationPolicy": "DEFAULT"
},
"insufficientHardware": {
"displayName": "Insufficient hardware on a cluster node",
"description": "Triggered when a cluster node doesn't have enough CPU cores and RAM to meet recommended requirements.",
"documentation": "",
"type": {
"$ref": "#/types/SingleEventSettings"
},
"nullable": false,
"maxObjects": 1,
"modificationPolicy": "DEFAULT"
},
"insufficientMetricStorage": {
"displayName": "Metrics storage exceeds supported size (4TiB)",
"description": "Triggered when Metrics storage a cluster node is too high. In that case, you need to review your monitoring settings or add additional nodes to the cluster.",
"documentation": "",
"type": {
"$ref": "#/types/SingleEventSettings"
},
"nullable": false,
"maxObjects": 1,
"modificationPolicy": "DEFAULT"
},
"transactionStorageTruncation": {
"displayName": "Transaction storage retention period truncation",
"description": "Triggered when data retention has been automatically reduced to store new data. In that case, it is recommended to adjust target retention time, review monitoring settings or extend a disk.",
"documentation": "",
"type": {
"$ref": "#/types/SingleEventSettings"
},
"nullable": false,
"maxObjects": 1,
"modificationPolicy": "DEFAULT"
},
"adaptiveLoadReductionActivated": {
"displayName": "Adaptive Load Reduction activity",
"description": "Triggered when a cluster node has been considered as overloaded and not able to keep up with processing incoming requests. In that case, you need to review monitoring settings or increase CPU cores and RAM on the cluster node.",
"documentation": "",
"type": {
"$ref": "#/types/SingleEventSettings"
},
"nullable": false,
"maxObjects": 1,
"modificationPolicy": "DEFAULT"
}
}
}

Read current configuration

To check the current configuration, use the GET objects endpoint (/api/cluster/v2/settings/objects?schemaIds=builtin:cluster-events-notification-settings&scopes=cluster) with a cluster token that has the appropriate access scopes.

  • If these settings have been previously changed, the items list contains a single object. Use the objectId from the list in making subsequent updates.
  • If the items list is empty, the default value is used (not visible in Dynatrace API):
    {
    "value": {
    "insufficientDiskSpace": {
    "sendEmail": true
    },
    "insufficientHardware": {
    "sendEmail": true
    },
    "insufficientMetricStorage": {
    "sendEmail": true
    },
    "transactionStorageTruncation": {
    "sendEmail": false
    },
    "adaptiveLoadReductionActivated": {
    "sendEmail": false
    }
    }
    }

Create notification settings object

To create a cluster event notification settings object, use the POST objects endpoint with a cluster token that has the appropriate access scopes. Use the ID of the newly created object (event notification settings) for later notification settings updates.

In this example, using a POST API call to the /api/cluster/v2/settings/objects endpoint and builtin:cluster-events-notification-settings schema, you create a notification settings object in your cluster settings:

[
{
"schemaId": "builtin:cluster-events-notification-settings",
"scope": "cluster",
"value": {
"insufficientDiskSpace": {
"sendEmail": true
},
"insufficientHardware": {
"sendEmail": true
},
"insufficientMetricStorage": {
"sendEmail": false
},
"transactionStorageTruncation": {
"sendEmail": false
},
"adaptiveLoadReductionActivated": {
"sendEmail": false
}
}
}
]

Update notification settings

There are two methods you can use to update cluster events notification settings once the notification settings object is created. In either case, make sure you have a cluster token that has the appropriate access scopes.

  • You can use the same POST method that you used to create the settings object (Create notification settings object). The schema doesn't allow duplicate settings objects, so if you attempt to create another settings object, you will overwrite the existing one.

  • You can modify an existing settings object by making a PUT API call to the /api/cluster/v2/settings/objects/<objectId> endpoint and providing the objectId obtained when creating the initial event notification settings object.

    Make the PUT API call to the /api/cluster/v2/settings/objects/<objectId>

    {
    "value": {
    "insufficientDiskSpace": {
    "sendEmail": true
    },
    "insufficientHardware": {
    "sendEmail": true
    },
    "insufficientMetricStorage": {
    "sendEmail": false
    },
    "transactionStorageTruncation": {
    "sendEmail": false
    },
    "adaptiveLoadReductionActivated": {
    "sendEmail": false
    }
    }
    }

Delete notification settings

You can delete an existing notification settings object by making a DELETE API call to the /api/cluster/v2/settings/objects/<objectId> endpoint (with a cluster token that has the appropriate access scopes) and providing the objectId obtained when creating the initial event notification settings object. After the object is deleted, notification behavior falls back to the default: all notifications trigger an email to configured recipients.

Cluster events and email notifications

Below, you can find a table describing the cluster event types, their associated severity levels, and whether notifications are triggered—either via email or directly to Mission Control.

TypeSeveritySummaryEmail notificationMC notification
ACTIVE_GATE_TOKENSWARNING"ActiveGate Token(s) will expire soon."YesNo
CASSANDRAWARNING"Cassandra node connection lost (%d times in the last hour)."NoYes
WARNING / INFO"Cassandra node connection ADDED/LOST/REACTIVATED, host ."NoYes
INFO"Cassandra node connection added (%d times in the last hour)."NoYes
INFO"Cassandra node connection reestablished (%d times in the last hour)."NoYes
CLUSTER_LIFECYCLESEVERE"License '%s' will expire within one day."YesYes
SEVERE"License '%s' has expired."YesYes
SEVERE"Your Dynatrace Managed cluster node {0} is undersized!"Configurable 1Yes
SEVERE"Storage volume for Dynatrace Managed log files is running out of space on {0}."YesYes
SEVERE"Your Dynatrace Managed cluster should be scaled-out!"YesYes
SEVERE"Could not detect %s activity on %s%s."NoYes
SEVERE"Insufficient system privileges on %s."NoYes
SEVERE"Update your Cluster ActiveGate{0} immediately. You're using {1} ActiveGate{0} that {2} no longer supported!"YesYes
SEVERE"Post-Update data migration failed, please contact Dynatrace Support."NoYes
SEVERE"Dynatrace update %s download failed."NoYes
SEVERE"Cassandra backup problem."NoYes
SEVERE"Dynatrace Managed has stopped collecting monitored data…"NoYes
SEVERE"SSL certificate expired."NoYes
SEVERE"License '%s' is inactive. Until it is done, you cannot set up any monitoring. Activate your license on Licensing page."NoYes
SEVERE"Failed to import the third-party vulnerability feeds version %s into the cluster's Dynatrace Application Security."NoYes
SEVERE"Failed to import metadata file %s."NoYes
SEVERE"Default cluster features were changed for non-gov license."NoYes
SEVERE“Cluster traffic control: OneAgent monitoring was disabled on recently connected hosts to avoid cluster overload."NoYes
WARNING / SEVERE"Host is down."YesYes
WARNING / SEVERE"ElasticSearch backup problem."NoYes
WARNING / SEVERE"Cluster '%s' ('%s') update failed."NoYes
WARNING"WebUI nodes settings change failed."NoYes
WARNING"LDAP connection error."YesYes
WARNING"Upcoming update to version %s is suspended for the cluster '%s' ('%s')."YesYes
WARNING"Node is down - %s."Yes, if happened outside of the upgrade procedureYes
WARNING“There is lack of connection to Dynatrace Mission Control."YesYes
WARNING"Backup has been disabled because the configuration is no longer supported."YesYes
WARNING"Fetching OAuth client credentials failed with status code {0}."NoYes
WARNING"Cluster '%s' ('%s') failed to determine update status."NoYes
WARNING"Self monitoring download failed on cluster '%s'."NoYes
WARNING"License check error."NoYes
WARNING"SSL certificate (%s) refresh failed."NoYes
WARNING"Login failed."NoYes
WARNING"Not all tenants from MultiTenant-ActiveGate %s are configured with AuthTokens."NoYes
WARNING"Let's Encrypt SSL certificate fetching failure."NoYes
WARNING"Your SSL certificate will expire soon."NoYes
WARNING"Login failed."NoYes
WARNING"User welcome e-mail was not sent."NoYes
WARNING"The Snyk vulnerability feed import failed."NoYes
WARNING"The NVD CVE vulnerability feed import failed."NoYes
WARNING“LDAP connection problems.”NoYes
WARNING / INFO"Oidc signature check."NoYes
WARNING / INFO"Billing archive has been successfully downloaded/downloaded with warning/failed."NoYes
INFO"WebUI nodes settings changed successfully."NoYes
INFO"Scheduled update has been resumed for the cluster '%s' ('%s')."YesYes
INFO"Hardware recommendation are fulfilled on {0}."Configurable 1Yes
INFO"Storage space for Dynatrace Managed log files is sufficient on {0}."YesYes
INFO"Cluster meets the minimum requirement for the number of nodes."YesYes
INFO"Node is up - %s."YesYes
INFO"Host is up."YesYes
INFO"Connection to Dynatrace Mission Control is back again."YesYes
INFO"<processName> is up on <hostName>."NoYes
INFO"Node was restored on ."NoYes
INFO"Dynatrace update %s was successfully downloaded."NoYes
INFO"Cluster '%s' ('%s') started updating."NoYes
INFO"Cluster update download finished."NoYes
INFO"Cluster '%s' update to version %s finished."NoYes
INFO"Request for remote access."NoYes
INFO"Successful login to DebugUI."NoYes
INFO"SSL certificate (%s) refresh succeeded."NoYes
INFO"Let's Encrypt SSL certificate fetching succeeded."NoYes
INFO"Cluster '%s' configuration database initialized successfully."NoYes
INFO"Server %d joined cluster '%s' ('%s')."NoYes
INFO"Server %d left cluster '%s' ('%s')."NoYes
INFO"Server %d joined cluster '%s' ('%s') %d times last hour."NoYes
INFO"Server %d left cluster '%s' ('%s') %d times last hour."NoYes
INFO"Successful login."NoYes
INFO"The third-party vulnerability feeds version %s were successfully imported into the cluster's Dynatrace Application Security."NoYes
INFO"Failed to import the third-party vulnerability feeds version %s into the cluster's Dynatrace Application Security."NoYes
CLUSTER_RUNTIME_SETTINGSEVERE"Responsibility cluster nodes override set on node: %d."NoYes
CONFIGURATION_AUDITINFO"Min Agent Version updated to %d by Session Replay."NoYes
INFO"Session State Version updated to v%d by Session Replay."NoYes
ELASTICSEARCHINFO"Elasticsearch storage service on your Dynatrace Managed cluster might be overloaded!"NoYes
ERRORWARNING"ElasticSearch update transient settings failed."NoYes
INFO"ElasticSearch update transient settings succeeded."NoYes
LOG_EVENT_DROPWARNING"Ingested log data is trimmed."NoYes
WARNING"Log ingest queue is full."NoYes
WARNING"Elasticsearch log queue is full."NoYes
WARNING"Elasticsearch log storing failed."NoYes
MANAGED_INTERNALINFO"Internal: Cassandra has old files."NoYes
MANAGED_NODE_ADDWARNING"Adding new node is taking more than expected."NoYes
INFO"Adding new node operation has been started."NoYes
INFO"Adding new node precondition check status."NoYes
INFO"Adding new node finished successfully."NoYes
WARNING"Adding new node failure."NoYes
MANAGED_NODE_REMOVALWARNING"Node removal finished with error %s (id=%d)."NoYes
INFO"Node removal operation is not allowed %s (id=%d)."NoYes
INFO"Node removal process started successfully %s (id=%d)."NoYes
INFO"Node removal process started successfully %s."NoYes
INFO"Node removal finished successfully."NoYes
INFO"Node removal failure."NoYes
SECURITY_GATEWAY_LIFECYCLEINFO"ActiveGate (host=%s) registered on cluster."NoYes
INFO"ActiveGate (host=%s) unregistered on cluster."NoYes
INFO"ActiveGate (host=%s) lost connection to cluster."NoYes
INFO"ActiveGate (environment=%s, host=%s) registered on cluster."NoYes
INFO"ActiveGate (environment=%s, host=%s) unregistered on cluster."NoYes
INFO"ActiveGate (environment=%s, host=%s) lost connection to cluster."NoYes
SERVER_LIFECYCLESEVERE"Transaction storage retention period truncated."Configurable 1Yes
SEVERE"Insufficient disk space on %s on %s."Configurable 1Yes
SEVERE"Long-term Metrics Store size exceeds the maximum acceptable 4 TB on %s."Configurable 1Yes
SEVERE"<component name> is down."NoYes
SEVERE"Heap memory: Server %d started memory emergency mode."NoYes
SEVERE"Heap memory: Server %d triggered a hard memory cleanup action."NoYes
SEVERE"A cluster node can't receive OneAgent traffic."NoYes
WARNING"Server %d activated Adaptive Load Reduction."Configurable 1Yes
WARNING"Long-term Metrics Store size exceeds recommended 2 TB on %s."NoYes
WARNING"Node cannot read and write to directory: '%s'."NoYes
WARNING"Heap memory: Server %d triggered a soft memory cleanup action."NoYes
WARNING / INFO"Disabling OneAgent traffic at "NoYes
WARNING / INFO"Enabling OneAgent traffic at "NoYes
INFO"Failed to import the third-party vulnerability feeds version %s into the cluster's Dynatrace Application Security."Configurable 1Yes
INFO"Server %d deactivated Adaptive Load Reduction."Configurable 1Yes
INFO"Server %d shutdown initiated."NoYes
INFO"<component name> is up."NoYes
INFO"Server %d startup completed. (Version: %s)."NoYes
INFO"Heap memory: Server %d ended memory emergency mode."NoYes
?"Server time of server %d is out of sync. Time difference %d milliseconds. Please enable NTP on all cluster nodes."NoYes
TENANT_LIFECYCLESEVERE"Trial environment expired."NoYes
SEVERE"The node with id %s is not properly configured."NoYes
SEVERE"Environment '%s' with id %s failed to start on server %d. See server logs for details."NoYes
INFO"Environment '%s' with id %s created."NoYes
INFO"Environment '%s' with id %s updated."NoYes
INFO"Environment '%s' with id %s removed."NoYes
1

Configurable means that you can configure the notifications via Settings API. Setting id: builtin:cluster-events-notification-settings. In order to use this REST API, you need to authorize with settings-specific token with settings.read and settings.write permissions.