Data flow

Latest Dynatrace
Explanation
6-min read

With OpenPipeline, you can ingest data in the Dynatrace platform from a wide variety of formats and providers, through ingest sources. Data is then routed to pipelines for processing, and stored in Grail buckets.

Key terms

Pipeline: Collection of processing instructions to structure, separate, and store data.

Configuration scope

Configuration scopes, such as logs and events, provide observability insights into the health, performance, and behavior of your system enabling teams to detect, diagnose, and resolve problems. Each configuration scope offers a different perspective because of its unique characteristics. OpenPipeline provides a unified solution to configure ingestion and processing while ensuring flexibility in configuration options depending on the configuration scope.

The following table lists configuration scopes, summarizing availability in OpenPipeline.

Configuration scope	Availability
Business events
Events - Generic
Events - Davis events
Events - Davis problems
Events - SDLC events
Events - Security events (legacy)
Logs
Metrics
Security events (new)
Spans
System events¹
Topology
User events
User sessions

System events supported by OpenPipeline are limited to: App Lifecycle Notifications (event.kind == "AUDIT_EVENT" AND event.provider == "APP_REGISTRY"), Workflow Execution events (event.kind == "WORKFLOW_EVENT" AND event.provider == "AUTOMATION_ENGINE"), and ECC self-monitoring events (event.kind == "EXTENSIONS_EVENT").

Ingest sources

Data reaches the Dynatrace platform via different ingestion sources, such as API endpoints, OneAgent, and extensions, which collect data from data providers. In OpenPipeline, they are defined by a name and a path. You can leverage:

Built-in ingest sources
Custom ingest sources

Custom ingest sources are available for events, excluding Davis problems and events. They support pre-processing and static routing.

Once the records reach your Dynatrace SaaS environment via ingest sources, you can route it to a pipeline.

The following table lists the ingest sources for each configuration scope supported by OpenPipeline.

Configuration scope	Ingest source	Path (`dt.openpipeline.source`)
Logs	OneAgent	`-` or `oneagent`
	Extensions	`-` or `extension`
	OpenTelemetry	`/api/v2/otlp/v1/logs`
	Log ingest API	`/api/v2/logs/ingest`
	Amazon Data Firehose	`/api/v2/logs/ingest/aws_firehose`
Metrics	OneAgent	`-` or `oneagent`
	OpenTelemetry metrics ingest API	`/api/v2/otlp/v1/metrics`
	Classic environment API	`/api/v2/metrics/ingest`
Spans	OneAgent	`-` or `oneagent`
Spans	OpenTelemetry	`/api/v2/otlp/v1/traces`
Events—Generic events	Default API	`/platform/ingest/v1/events`
Events—Generic events	Custom API	`/platform/ingest/custom/events/<custom-endpoint-name>`
Events—Davis events	OneAgent	`-` or `oneagent`
	Classic environment API	`-` or `events/ingest`
	Data Extraction	`-` or `data_extraction`
Events—Davis problems	Classic root cause analysis	`-`
Events—SDLC events	Endpoint for Software Development Lifecycle events	`/platform/ingest/v1/events.SDLC`
Events—SDLC events	Custom endpoint for Software Development Lifecycle events	`/platform/ingest/custom/events.SDLC/<custom-endpoint-name>`
Events—Security events (legacy)	Security events endpoint (legacy)	`/platform/ingest/v1/events.security`
Events—Security events (legacy)	Custom security events API (legacy)	`/platform/ingest/custom/events.security/<custom-endpoint-name>`
Security events (new)	Security events endpoint (new)	`/platform/ingest/v1/security.events`
Security events (new)	Custom security events API (new)	`/platform/ingest/custom/security.events/<custom-endpoint-name>`
Business events	OneAgent	`-` or `oneagent`
	RUM Agent	`-` or `rumagent`
	Business Events API	`/api/v2/bizevents/ingest`
	Data Extraction	`data_extraction`
System events	Internally generated	`-` or `system_events`
System events	Extensions	`-` or `extension`
User events and User sessions	RUM Agent	`-` or `rumagent`

Use cases

Configure multiple pipelines for the same configuration scope, adopting processing instructions specific to the ingest source.

Best practices

To get started with OpenPipeline ingestion via API, reference Ingestion APIs.
To learn the path and type of the system events processed in your environment
1. Go to Notebooks.
2. Create a new notebook containing the following query
```
fetch dt.system.events
| filter isNotNull(dt.openpipeline.pipelines)
```
3. Select Run.

Pre-processing

Optional data processing that occurs after ingestion and before routing. By setting pre-processing, you can transform raw data into structured formats as soon as it reaches your Dynatrace SaaS environment. Pre-processed data is then routed to a pipeline and is available for further processing before storage. Note that pre-processing is available only for custom ingest sources.

Use cases

Apply a unified structure to different providers' data formats.

Best practices

Set up pre-processing to avoid creating complex matching conditions based on provider-specific data formats. This will help you streamline maintenance for routing and processing, for example, when you start ingesting data from a new provider.

Routing

After data is ingested (and optionally pre-processed), it's routed to pipelines. Routing depends on

The configuration scope

Pipelines are specific to a configuration scope. Different configuration scopes are routed to different pipelines.
The ingest source

You can configure routing for each ingest source. Multiple ingest sources of the same configuration scope can be routed to the same pipeline.
The routing option
- Dynamic routing: data is routed based on a matching condition. The matching condition is a DQL query that defines the data set you want to route.
- Static routing: data is routed to a specific pipeline, which remains fixed unless manually updated. Static routing is available only for custom ingest sources.
If a record matches the condition but you've already configured static routing for its custom ingest source, the match is skipped and data is routed directly to the pipeline you specified.

Use cases

Route data of an ingest source to a dedicated pipeline.

Best practices

When multiple routing options are available, choose according to the data set dimension. For example, large data sets benefit more from dynamic routing.

Processing

OpenPipeline processing occurs in pipelines containing instructions on how to structure, separate, and store your data. To learn more, see Processing.

Storage

Dynatrace Grail database provides a single unified storage solution for all your configuration scopes. OpenPipeline target storage are Grail buckets. You can leverage built-in buckets and, if available for the configuration scope, create new buckets with custom retention periods. Each bucket is assigned to a DQL database table. Assign permissions to user groups or single users to provide them with access to specific buckets and tables.

By default, OpenPipeline routes data into a built-in pipeline with target storage built-in Grail bucket of the configuration scope. You can configure storage assignment

For a custom ingest source, by directly defining its targeted storage.
For a pipeline, based on processing matching conditions.

Exceptions for system events

Storage and retention for system events is not configurable.