Big Data, Data Lake and Strategy

Big Data

Data Lake

Strategy

Migrate an existing data lake to a transactional data lake using Apache Iceberg

AWS Big Data

OCTOBER 3, 2023

A data lake is a centralized repository that you can use to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the data and then run different types of analytics for better business insights. They are the same.

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Webinars

Trending Sources

Choosing an open table format for your transactional data lake on AWS

Webinars

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Use Apache Iceberg in a data lake to support incremental data processing

Architecture for the Data Lake

Improve operational efficiencies of Apache Iceberg tables built on Amazon S3 data lakes

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

Complexity Drives Costs: A Look Inside BYOD and Azure Data Lakes

Build a real-time GDPR-aligned Apache Iceberg data lake

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Create an end-to-end data strategy for Customer 360 on AWS

Data architecture strategy for data quality

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake Formation

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

Decentralize LF-tag management with AWS Lake Formation

Why optimize your warehouse with a data lakehouse strategy

How BMO improved data security with Amazon Redshift and AWS Lake Formation

Use AWS Glue ETL to perform merge, partition evolution, and schema evolution on Apache Iceberg

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

Detect, mask, and redact PII data using AWS Glue before loading into Amazon OpenSearch Service

Use Amazon Athena with Spark SQL for your open-source transactional table formats

What is a data architect? Skills, salaries, and how to become a data framework master

How the BMW Group analyses semiconductor demand with AWS Glue

Backtesting index rebalancing arbitrage with Amazon EMR and Apache Iceberg

AWS Lake Formation 2022 year in review

Data Mesh Strategy: How to Plan for Data Mesh Implementation Success

Deploy and Optimize Your Snowflake Environment Faster With Accelerators

Announcing the AWS Well-Architected Data Analytics Lens

Data governance in the age of generative AI

What you don’t know about data management could kill your business

Baldor’s first-ever CIO sets the transformation agenda

Governing data in relational databases using Amazon DataZone

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

Your guide to AWS Analytics at AWS re:Invent 2023

ChatGPT: le nuove sfide della strategia sui dati nell’era dell’IA generativa

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

How Data Analytics Tools Eliminate Business Owner Headaches

Build an Amazon Redshift data warehouse using an Amazon DynamoDB single-table design

What’s cooking with Amazon Redshift at AWS re:Invent 2023

Stay Connected