Data Lake - Data Leaders Brief

Setting up Data Lake on GCP using Cloud Storage and BigQuery

Analytics Vidhya

FEBRUARY 25, 2023

Introduction A data lake is a centralized and scalable repository storing structured and unstructured data. The need for a data lake arises from the growing volume, variety, and velocity of data companies need to manage and analyze.

Setting up Data Lake on GCP using Cloud Storage and BigQuery

Connecting and Reading Data From Azure Data Lake

Webinars

Trending Sources

An Overview of Using Azure Data Lake Storage Gen2

Webinars

Multicloud data lake analytics with Amazon Athena

Top Considerations for Building an Open Cloud Data Lake

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Monitor data pipelines in a serverless data lake

Delta Lake: A Comprehensive Introduction

Architecture for the Data Lake

The Next-Generation Cloud Data Lake: An Open, No-Copy Data Architecture

Using AWS AppSync and AWS Lake Formation to access a secure data lake through a GraphQL API

Query your Iceberg tables in data lake using Amazon Redshift (Preview)

Data Lakes on Cloud & it’s Usage in Healthcare

Orca Security’s journey to a petabyte-scale data lake with Apache Iceberg and AWS Analytics

12 Considerations When Evaluating Data Lake Engine Vendors for Analytics and BI

How a Delta Lake is Process with Azure Synapse Analytics

Choosing an open table format for your transactional data lake on AWS

Use Apache Iceberg in a data lake to support incremental data processing

Empower your Jira data in a data lake with Amazon AppFlow and AWS Glue

Checklist Report: Preparing for the Next-Generation Cloud Data Architecture

The Key Components of a Successful Data Lake Strategy

The Key Components of a Successful Data Lake Strategy

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Ultimate Guide to the Cloud Data Lake Engine

Secure cloud fabric: Enhancing data management and AI development for the federal government

Race Ahead of Threats with a Security Data Lake

Data Modeling 301 for the cloud: data lake and NoSQL data modeling and design

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

Data Analytics in the Cloud for Developers and Founders

Perform upserts in a data lake using Amazon Athena and Apache Iceberg

Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue

Build a real-time GDPR-aligned Apache Iceberg data lake

Data replication holds the key to hybrid cloud effectiveness

The Differences Between Data Warehouses and Data Lakes

Implement tag-based access control for your data lake and Amazon Redshift data sharing with AWS Lake Formation

DIY cloud cost management: The strategic case for building your own tools

Complexity Drives Costs: A Look Inside BYOD and Azure Data Lakes

Here’s Why Automation For Data Lakes Could Be Important

Rapidminer Platform Supports Entire Data Science Lifecycle

Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

5 things on our data and AI radar for 2021

Deriving Value from Data Lakes with AI

Denodo Advancing Data Virtualization in the Cloud

Stay Connected