Infrastructure & Operations app

Latest Dynatrace

Infrastructure & Operations Infrastructure & Operations simplifies infrastructure health monitoring and facilitates root cause analysis for problems.

  • Health indicators powered by Davis® AI help you detect early signs of performance degradation to quickly see which areas of your environment need attention.
  • Check infrastructure host health metrics, logs, and events for all infrastructure entities.
  • Drill down from Infrastructure & Operations to any infrastructure entity metrics, logs, or events.

For detailed observability, we recommend installing OneAgent.

Infrastructure & Operations presents the data for your data centers, hosts, and network devices in lists.

Use the table options to navigate to the details that you need.

  • To sort the table by a particular column, select the column header.
  • To show or hide columns, select Column settings and then select the columns you want to display.
  • To download the table data as a CSV file, select Download table data. Only the visible columns will be included in the downloaded file.

You can filter the tables in the app by using the filter field with listed suggestions. Currently, you can use basic syntax (grouping filter statements and using logical operators is not supported). As you type, the relevant options are displayed.

You can add several statements to narrow down the filter results. For example, problems >4 name != *1b*. In this case, you narrow the search to hosts with more than 4 problems and names that don't include 1b.

For more details on using filters, see Filter field.

Data centers

Infrastructure & Operations provides visibility into interconnected data centers.

You can sort and filter data centers, for example, by name, ID, type, hosts, problems, and location.

By default, data centers are sorted by the number of open problems.

Select the total hosts number displayed in the table for a chosen data center to view the full information for its hosts.

Hosts

You can sort and filter hosts, for example, by name, ID, operating system, hypervisor, cloud, monitoring type, host group, IP address, problems, traffic volume, CPU, memory, disk, tags, and reachability.

By default, hosts are sorted by the number of open problems.

Side panel

The side panel gives you a convenient overview of the data for each selected host on the same page. To open the side panel for a host, select the host name.

The data in the side panel is organized in the following sections:

Section

Description

Overview

Provides insights into performance analytics, which include CPU usage, memory usage, total disk space used, and network traffic. It also shows the host processes, properties, tags, and ownership for fast host identification.

You can enhance the filtering options by adding tags to each host. For details, see Best practices and recommendations for tagging.

Problems

Allows you to view all problems detected by Davis® AI. Selecting a particular problem shows the affected entity.

Vulnerabilities

Contains all the detected vulnerabilities. Use the filter option to narrow down your selection.

SLOs

Allows you to monitor service-level objectives by viewing their name, status, error budget, and timeframe. Selecting Clone SLO from the Actions column, will redirect you to the SLO wizard to create a new SLO. For more details, see Configure and monitor service-level objectives with Dynatrace.

Logs

Contains pre-made queries for logs. Select Run query to see a detailed view of your logs in a log browser. From the page that opens, you can perform further modifications to get more information about issues.

Full-page view

To display the full-page view

  • Select View details in the Actions column of the hosts' table, or
  • Select View details in the host's side panel.

The data in the full-page view is organized in the following sections:

Section

Description

Overview

Presents the available data for the host in the following panels: Host overview, Technologies, Connections, Network Analysis, Disk Analysis. You can adapt the presentation of data in each section by filtering it and adding or removing columns in the lists.

You can add OS services monitoring to the available data list by configuring the services you want to monitor. For details, see OS services monitoring.

Info

Shows the host tags, properties, and ownership.

Processes

Displays the CPU and memory usage of all processes in a graph view. This allows you to easily identify what each process is doing. To see details on a single process, select it in the table to display process-specific tabs with information such as CPU and memory usage, working set size, incoming and outgoing traffic, and properties and tags.

Selecting Host Properties in the upper-right corner of the page opens a side panel with the host tags, properties, and ownership.

Analyze host overview graphs

You can define which graphs or charts are displayed in the host's full-page view by selecting Expand menu in the upper left corner of each graph or chart.

For a detailed analysis of the presented data in Notebooks, select > Open in Notebook Open with and choose whether to open the graph in a new or existing notebook.

View host processes

Processes for each host can be viewed directly from the list of hosts or by opening the full-page view.

On the Hosts page

  • Hover over the CPU and Memory columns to view the top five processes for a selected host.
  • To display the complete list of processes, hover over the CPU or Memory column and select View all processes.

On the full-page view of the chosen host

  • Select View Processes in the Technologies section of the Overview tab.

Display code-module metrics

There is an option to inject code modules and display the injected data in the app. For more details, see Universal injection of code modules.

After selecting a process from the list, the data is presented in a dedicated tab.

Monitor incoming and outgoing connections

In the full-page view under Connections, you can see a quick overview of each type of connection, the total number of connected entities, and the number of problems. Expand the panel to see a list of connected entities and possible problems for each of them. You can also see which processes from the current host communicate with other processes or services that aren't part of the host.

The Connections table allows you to identify potential sources of problems that don't happen directly on the host. Select the connected entity marked with to navigate to its details page and investigate the issue.

The incoming and outgoing process connections in the table are listed by processes with the most issues based on network and CPU usage.

View data injected via extensions

The full-page view for hosts can include data injected by extensions. The data is presented in a dedicated tab called Extensions.

The data from all extensions linked to a selected host appears on the Extensions tab. You can view what type of data is injected via the extension on the extension's settings page.

You can also add data from a custom extension by following the steps in the documentation. For more details, see Extensions 2.0 concepts.

There are dozens of extensions you can use to load data in the Extensions tab. To check whether a particular extension version supports data injection, see the details of the selected extension in the Hub or, if it's a custom extension, contact its author.

Network devices

Infrastructure & Operations gives you an overview of all the devices in your network.

You can sort and filter the network devices, for example, by name, ID, type, problems, IP address, uptime, interface admin/operator, CPU, memory, saturated interfaces, traffic volume, and reachability.

For the available filtering options, see the Navigation and filtering section.

By default, the list of network devices is sorted by the number of problems on the device.

When you select a device name, a side panel opens, displaying tabs with information such as properties, tags, interfaces, and problems.

Use custom extensions to display network devices data

You can use extensions to display the data for network devices in the app. For more details, see Extensions.

Minimum required versions of extensions

Ensure your system has these minimum versions of the installed extensions to avoid any issues with displaying the data in the Infrastructure & Operations app.

This list concerns extensions for supporting network devices and is not exhaustive. Each network device can have more extensions.

Extension
Minimum version
SNMP Autodiscovery
2.2.8
Generic Cisco Device
2.1.4
F5 BIG-IP
2.10.0
Palo Alto firewalls
2.7.0
Juniper Networks (SNMP)
1.5.0
Generic network device
2.0.0
Fortigate
1.2.21

Measure the reachability of a host or device

The Reachability column on the Hosts and Network devices pages shows how easily you can access a device or a host over the network from a remote location.

The value is calculated as the average of all the availability measurements the Network Availability Monitors (NAM) monitors take during the period you select in Infrastructure & Operations Infrastructure & Operations.

To use this feature, you must configure NAM for the desired device or host. For details, refer to Create a NAM monitor.