Data Architecture, Data Governance, Metadata and Snapshot

Data Architecture

Data Governance

Metadata

Snapshot

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

Apache Iceberg overview Iceberg is an open-source table format that brings the power of SQL tables to big data files. It enables ACID transactions on tables, allowing for concurrent data ingestion, updates, and queries, all while using familiar SQL. The Iceberg table is synced with the AWS Glue Data Catalog.

Data Lake

Data Lake Sales Data Warehouse Snapshot

Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation

AWS Big Data

JANUARY 24, 2023

AWS Lake Formation helps with enterprise data governance and is important for a data mesh architecture. It works with the AWS Glue Data Catalog to enforce data access and governance. This solution only replicates metadata in the Data Catalog, not the actual underlying data.

Data Architecture

Data Architecture Metadata Data Lake Snapshot

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Trending Sources

5 Reasons to Use Apache Iceberg on Cloudera Data Platform (CDP)

Cloudera

MARCH 23, 2022

In fact, we recently announced the integration with our cloud ecosystem bringing the benefits of Iceberg to enterprises as they make their journey to the public cloud, and as they adopt more converged architectures like the Lakehouse. 1: Multi-function analytics . 3: Open Performance.

Metadata

Metadata Data Architecture Machine Learning Cost-Benefit

Webinars

How to Optimize the Developer Experience for Monumental Impact

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

How To Get Promoted In Product Management

MORE WEBINARS

Cloud Data Warehouse Migration 101: Expert Tips

Alation

JULY 28, 2022

What Are the Biggest Drivers of Cloud Data Warehousing? It’s costly and time-consuming to manage on-premises data warehouses — and modern cloud data architectures can deliver business agility and innovation. There are tools to replicate and snapshot data, plus tools to scale and improve performance.”

Data Warehouse

Data Warehouse Cost-Benefit Data Governance Data-driven

Choosing an open table format for your transactional data lake on AWS

AWS Big Data

JUNE 9, 2023

A modern data architecture enables companies to ingest virtually any type of data through automated pipelines into a data lake, which provides highly durable and cost-effective object storage at petabyte or exabyte scale. Non-blocking automatic table services (for example, compaction) that don’t impact writers or readers.

Data Lake

Data Lake Metadata Optimization Statistics

“You Complete Me,” said Data Lineage to DataOps Observability.

DataKitchen

JANUARY 23, 2023

Data lineage can also be used for compliance, auditing, and data governance purposes. DataOps Observability Five on data lineage: Data lineage traces data’s origin, history, and movement through various processing, storage, and analysis stages. What is missing in data lineage?

Testing

Testing Data Governance Data Quality Data-driven

Data Leaders Brief

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation

Webinars

Trending Sources

5 Reasons to Use Apache Iceberg on Cloudera Data Platform (CDP)

Webinars

Cloud Data Warehouse Migration 101: Expert Tips

Choosing an open table format for your transactional data lake on AWS

“You Complete Me,” said Data Lineage to DataOps Observability.

Stay Connected