Big Data, Data Lake and Enterprise

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

AWS Big Data

APRIL 3, 2024

Businesses are constantly evolving, and data leaders are challenged every day to meet new requirements. For many enterprises and large organizations, it is not feasible to have one processing engine or tool to deal with the various business requirements. This post is co-written with Andries Engelbrecht and Scott Teal from Snowflake.

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Webinars

Trending Sources

Monitor data pipelines in a serverless data lake

Webinars

Differentiating Between Data Lakes and Data Warehouses

12 Considerations When Evaluating Data Lake Engine Vendors for Analytics and BI

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Use Apache Iceberg in a data lake to support incremental data processing

Improve operational efficiencies of Apache Iceberg tables built on Amazon S3 data lakes

Top Considerations for Building an Open Cloud Data Lake

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

Here’s Why Automation For Data Lakes Could Be Important

Build a real-time GDPR-aligned Apache Iceberg data lake

Complexity Drives Costs: A Look Inside BYOD and Azure Data Lakes

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Data Lakes on Cloud & it’s Usage in Healthcare

Data Lakes: What Are They and Who Needs Them?

A Comprehensive Guide on Delta Lake

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

Data Modeling 301 for the cloud: data lake and NoSQL data modeling and design

Introducing the technology behind watsonx.ai, IBM’s AI and data platform for enterprise

Migrate data from Azure Blob Storage to Amazon S3 using AWS Glue

Build a data lake with Apache Flink on Amazon EMR

Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake Formation

2021 Gift Giving Guide for Data Nerds

Accelerate data science feature engineering on transactional data lakes using Amazon Athena with Apache Iceberg

Building a Beautiful Data Lakehouse

Use AWS Glue ETL to perform merge, partition evolution, and schema evolution on Apache Iceberg

Power enterprise-grade Data Vaults with Amazon Redshift – Part 2

Data Management Requirements for the Enterprise Data Lake

Automatically detect Personally Identifiable Information in Amazon Redshift using AWS Glue

Data governance in the age of generative AI

How BMO improved data security with Amazon Redshift and AWS Lake Formation

The rise of the data lakehouse: A new era of data value

Cloudera announces support for Azure’s next-generation Data Lake Store

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

7 key Microsoft Azure analytics services (plus one extra)

Use IAM runtime roles with Amazon EMR Studio Workspaces and AWS Lake Formation for cross-account fine-grained access control

Big Data Fabric Weaves Together Automation, Scalability, and Intelligence

Munich Re Launches Enterprise-Wide Data-Driven Platform for Analytics

Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 2: AWS Glue Studio Visual Editor

The Future of the Data Lakehouse – Open

3 business realities fueling the need for enterprise data preparation

Stay Connected