Azure OpenAI

Dynatrace version 1.272+Environment ActiveGate version 1.195+

Dynatrace ingests metrics from Azure Metrics API for OpenAI. You can view metrics for each service instance, split metrics into multiple dimensions, and create custom charts that you can pin to your dashboards.

Enable monitoring

To learn how to enable service monitoring, see Enable service monitoring.

View service metrics

You can view the service metrics in your Dynatrace environment either on the custom device overview page or on your Dashboards page.

View metrics on the custom device overview page

To access the custom device overview page

  1. Go to Technologies & Processes or Technologies & Processes Classic (latest Dynatrace).
  2. Filter by service name and select the relevant custom device group.
  3. Once you select the custom device group, you're on the custom device group overview page.
  4. The custom device group overview page lists all instances (custom devices) belonging to the group. Select an instance to view the custom device overview page.

View metrics on your dashboard

Once you add a service to monitoring, a preset dashboard for the respective service containing all recommended metrics is automatically created on your Dashboards page. You can look for specific dashboards by filtering by Preset and then by Name.

For existing monitored services, you might need to resave your credentials for the preset dashboard to appear on the Dashboards page. To resave your credentials, go to Settings > Cloud and virtualization > Azure, select the desired Azure instance, then select Save.

You can't make changes on a preset dashboard directly, but you can clone and edit it. To clone a dashboard, open the browse menu () and select Clone.
To remove a dashboard from the dashboards list, you can hide it. To hide a dashboard, open the browse menu () and select Hide.

Hiding a dashboard doesn't affect other users.

Clone hide azure

Available metrics

Name
Description
Dimensions
Unit
Recommended
SuccessRate
Availability rate
API name, Operation name, Ratelimit key, Region
Percent
Applicable
BlockedCalls
Blocked calls
API name, Operation name, Ratelimit key, Region
Count
Applicable
ClientErrors
Client errors
API name, Operation name, Ratelimit key, Region
Count
Applicable
DataIn
Data in
API name, Operation name, Region
Byte
Applicable
DataOut
Data out
API name, Operation name, Region
Byte
Applicable
GeneratedTokens
Number of generated completion tokens
API name, Model deployment name, Model name, Region, Usage channel
Count
Applicable
Latency
Latency
API name, Operation name, Ratelimit key, Region
MilliSecond
Applicable
FineTunedTrainingHours
Processed fine tuned training hours
API name, Model deployment name, Model name, Region, Usage channel
Count
Applicable
TokenTransaction
Processed inference tokens
API name, Model deployment name, Model name, Region, Usage channel
Count
Applicable
ProcessedPromptTokens
Processed prompt tokens
API name, Model deployment name, Model name, Region, Usage channel
Count
Applicable
Ratelimit
Ratelimit
Ratelimit key, Region
Count
Applicable
ServerErrors
Number of server errors
API name, Operation name, Ratelimit key, Region
Count
Applicable
SuccessfulCalls
Number of successful calls
API name, Operation name, Ratelimit key, Region
Count
Applicable
TotalCalls
Number of calls
API name, Operation name, Ratelimit key, Region
Count
Applicable
TotalErrors
Number of errors
API name, Operation name, Ratelimit key, Region
Count
Applicable