Analytics, Data Lake and Optimization

Analytics

Data Lake

Optimization

Multicloud data lake analytics with Amazon Athena

AWS Big Data

MARCH 18, 2024

Many organizations operate data lakes spanning multiple cloud data stores. In these cases, you may want an integrated query layer to seamlessly run analytical queries across these diverse cloud stores and streamline your data analytics processes. This serves as the S3 data lake data for this post.

Multicloud data lake analytics with Amazon Athena

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

Webinars

Trending Sources

Choosing an open table format for your transactional data lake on AWS

Webinars

How Salesforce optimized their detection and response platform using AWS managed services

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Understanding Apache Iceberg on AWS with the new technical guide

Differentiating Between Data Lakes and Data Warehouses

Use Apache Iceberg in a data lake to support incremental data processing

Apache Iceberg optimization: Solving the small files problem in Amazon EMR

Rapidminer Platform Supports Entire Data Science Lifecycle

How to modernize data lakes with a data lakehouse architecture

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Improve operational efficiencies of Apache Iceberg tables built on Amazon S3 data lakes

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Empower your Jira data in a data lake with Amazon AppFlow and AWS Glue

Perform upserts in a data lake using Amazon Athena and Apache Iceberg

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

Deploy and Optimize Your Snowflake Environment Faster With Accelerators

Speed up queries with the cost-based optimizer in Amazon Athena

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

How Amazon Devices scaled and optimized real-time demand and supply forecasts using serverless analytics

Navigating Data Entities, BYOD, and Data Lakes in Microsoft Dynamics

Your guide to AWS Analytics at AWS re:Invent 2023

How Ruparupa gained updated insights with an Amazon S3 data lake, AWS Glue, Apache Hudi, and Amazon QuickSight

Optimize data layout by bucketing with Amazon Athena and AWS Glue to accelerate downstream queries

The Differences Between Data Warehouses and Data Lakes

Deriving Value from Data Lakes with AI

Data Lakes: What Are They and Who Needs Them?

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Efficiently crawl your data lake and improve data access with an AWS Glue crawler using partition indexes

Accelerate data science feature engineering on transactional data lakes using Amazon Athena with Apache Iceberg

7 key Microsoft Azure analytics services (plus one extra)

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

Announcing the AWS Well-Architected Data Analytics Lens

AWS Lake Formation 2023 year in review

Optimize your Go To Market with AI and ML-driven Analytics platforms

Driving Business Value and ROI from a Hybrid Cloud Data Lake

Why optimize your warehouse with a data lakehouse strategy

Introducing Apache Hudi support with AWS Glue crawlers

Lay the groundwork now for advanced analytics and AI

How smava makes loans transparent and affordable using Amazon Redshift Serverless

What I Learned At Gartner Data & Analytics 2022

Stay Connected