The Grail data lakehouse at the heart of the Dynatrace platform enables contextual analytics across unified observability, security, and business data. As a data lakehouse, Grail combines the cost efficiency advantages of data lakes with the analytics capabilities of data warehouses, and adds extreme performance through massively parallel processing.
Enterprises collect exponentially growing volumes of data with the aim of extracting value and having all relevant data available when needed. Data lakes are a cost efficient solution in this regard, as they consolidate raw data in a flat data format on cheap storage. Keeping data in a central place enables data analytics teams to access it when required, including use cases such as machine learning or historical analysis.
Grail incorporates the following benefits of data lakes:
Grail avoids the following limitations of data lakes:
Data warehouses, unlike data lakes, are structured databases that store data according to a predefined schema and data model, making data access and analytics easier.
A data lakehouse merges the benefits of data lakes and data warehouses. It allows storing large volumes of structured and unstructured data while providing fast and scalable analytical and processing capabilities, supporting the full flexibility of schema-on-read. This approach makes it possible to manage data more flexibly and efficiently, facilitating advanced analytics and machine learning on a scalable, cost-effective platform.
Data lakehouses have the following attributes:
Grail is a data lakehouse that specializes in observability use cases. It leverages the scalability and flexibility of data lakes and adds the transactional layer of data lakehouses to give meaning to raw data signals, creating a semantic model that supports Smartscape and provides the contextual layer that enables Davis AI.
Grail data lakehouse
Data warehouse
Data lake
Data type support
Structured and unstructured data in its raw format
Structured data
Structured and unstructured data in its raw format
Scalability
Scales to any size at low cost
Scale-up is exponentially complex and expensive
Scales to any size, requiring extra tooling
Flexibility
Full flexibility with additional schema on read
Limited to schema on ingest
Full flexibility without any schema, requires rehydration and separate processing
Performance
High: Structured data allows to quickly access data and gain insights.
High: For indexed data access patterns
Poor
Ease of use
Simple: Structured data allows to quickly access data and gain insights
Simple: Structured data allows to quickly access data and gain insights.
Difficult: Missing structure and large volume make it difficult to find data.
Security
Grail adds a sophisticated permissioning model and additional concepts such as hard data deletion as well as data encryption at rest and in transit and others.
Good security concepts including encryption, auditing, authorization controls.
Storing data in its raw format, risk to become a data swamp, less efforts for governance and access control topics.
Cost
$
$$$$
$