Azure Batch monitoring

  • How-to guide
  • 3-min read
  • Published Jun 25, 2020

The Azure Batch overview page gives you a comprehensive view of how many jobs and tasks were completed over a period of time. You can also track nodes in different states, such as running, idle, or offline.

Prerequisites

  • Dynatrace version 1.196+
  • Environment ActiveGate version 1.195+

Enable monitoring

To learn how to enable service monitoring, see Enable service monitoring.

View service metrics

You can view the service metrics in your Dynatrace environment either on the custom device overview page or on your Dashboards page.

View metrics on the custom device overview page

To access the custom device overview page

  1. Go to Technologies & Processes Classic.
  2. Filter by service name and select the relevant custom device group.
  3. Once you select the custom device group, you're on the custom device group overview page.
  4. The custom device group overview page lists all instances (custom devices) belonging to the group. Select an instance to view the custom device overview page.

View metrics on your dashboard

Once you add a service to monitoring, a preset dashboard for the respective service containing all recommended metrics is automatically created on your Dashboards page. You can look for specific dashboards by filtering by Preset and then by Name.

For existing monitored services, you might need to resave your credentials for the preset dashboard to appear on the Dashboards page. To resave your credentials, go to Settings > Cloud and virtualization > Azure, select the desired Azure instance, then select Save.

You can't make changes on a preset dashboard directly, but you can clone and edit it. To clone a dashboard, open the browse menu () and select Clone.
To remove a dashboard from the dashboards list, you can hide it. To hide a dashboard, open the browse menu () and select Hide.

Hiding a dashboard doesn't affect other users.

Clone hide azure

Azure batch dash

Available metrics

NameDescriptionDimensionsUnitRecommended
CoreCountTotal number of dedicated cores in the batch accountNoneCountApplicable
CreatingNodeCountNumber of nodes being createdNoneCount
IdleNodeCountNumber of idle nodesNoneCountApplicable
JobDeleteCompleteEventTotal number of jobs that have been successfully deletedjobIdCount
JobDeleteStartEventTotal number of jobs that have been requested to be deletedjobIdCount
JobDisableCompleteEventTotal number of jobs that have been successfully disabledjobIdCount
JobDisableStartEventTotal number of jobs that have been requested to be disabledjobIdCount
JobStartEventTotal number of jobs that have been successfully startedjobIdCountApplicable
JobTerminateCompleteEventTotal number of jobs that have been successfully terminatedjobIdCount
JobTerminateStartEventTotal number of jobs that have been requested to be terminatedjobIdCount
LeavingPoolNodeCountNumber of nodes leaving the poolNoneCount
LowPriorityCoreCountTotal number of low-priority cores in the batch accountNoneCountApplicable
LowPriorityNodeCountTotal number of low-priority nodes in the batch accountNoneCountApplicable
OfflineNodeCountNumber of offline nodesNoneCount
PoolCreateEventTotal number of pools that have been createdpoolIdCount
PoolDeleteCompleteEventTotal number of pool deletes that have completedpoolIdCount
PoolDeleteStartEventTotal number of pool deletes that have startedpoolIdCount
PoolResizeCompleteEventTotal number of pool resizes that have completedpoolIdCount
PoolResizeStartEventTotal number of pool resizes that have startedpoolIdCount
PreemptedNodeCountNumber of preempted nodesNoneCount
RebootingNodeCountNumber of rebooting nodesNoneCountApplicable
ReimagingNodeCountNumber of reimaging nodesNoneCount
RunningNodeCountNumber of running nodesNoneCountApplicable
StartTaskFailedNodeCountNumber of nodes where the Start Task has failedNoneCount
StartingNodeCountNumber of nodes startingNoneCountApplicable
TaskCompleteEventTotal number of tasks that have completedpoolId,jobIdCountApplicable
TaskFailEventTotal number of tasks that have completed in a failed statepoolId,jobIdCountApplicable
TaskStartEventTotal number of tasks that have startedpoolId,jobIdCountApplicable
TotalNodeCountTotal number of dedicated nodes in the batch accountNoneCountApplicable
UnusableNodeCountNumber of unusable nodesNoneCount
WaitingForStartTaskNodeCountNumber of nodes waiting for the Start Task to completeNoneCount
Related tags
Infrastructure Observability