Data Architecture, Data Lake, Data Warehouse and Events

Data Architecture

Data Lake

Data Warehouse

Events

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

AWS Big Data

APRIL 3, 2024

They understand that a one-size-fits-all approach no longer works, and recognize the value in adopting scalable, flexible tools and open data formats to support interoperability in a modern data architecture to accelerate the delivery of new solutions.

Data Lake

Data Lake Snapshot Metadata Data Architecture

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

AWS Big Data

SEPTEMBER 13, 2023

The Analytics specialty practice of AWS Professional Services (AWS ProServe) helps customers across the globe with modern data architecture implementations on the AWS Cloud. Of those tables, some are larger (such as in terms of record volume) than others, and some are updated more frequently than others.

Data Lake

Data Lake Data Processing Metadata Snapshot

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

Choosing an open table format for your transactional data lake on AWS

AWS Big Data

JUNE 9, 2023

A modern data architecture enables companies to ingest virtually any type of data through automated pipelines into a data lake, which provides highly durable and cost-effective object storage at petabyte or exabyte scale.

Data Lake

Data Lake Metadata Optimization Statistics

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

AWS Big Data

MARCH 28, 2023

In a data warehouse, a dimension is a structure that categorizes facts and measures in order to enable users to answer business questions. As organizations across the globe are modernizing their data platforms with data lakes on Amazon Simple Storage Service (Amazon S3), handling SCDs in data lakes can be challenging.

Data Lake

Data Lake Testing Snapshot Sales

Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation

AWS Big Data

JANUARY 24, 2023

This solution only replicates metadata in the Data Catalog, not the actual underlying data. To have a redundant data lake using Lake Formation and AWS Glue in an additional Region, we recommend replicating the Amazon S3-based storage using S3 replication , S3 sync, aws-s3-copy-sync-using-batch or S3 Batch replication process.

Data Architecture

Data Architecture Metadata Data Lake Snapshot

Deploy and Optimize Your Snowflake Environment Faster With Accelerators

CDW Research Hub

JULY 18, 2022

While many organizations understand the business need for a data and analytics cloud platform , few can quickly modernize their legacy data warehouse due to a lack of skills, resources, and data literacy. Security Data Lake. Learn more about our Security Data Lake Solution.

Optimization

Optimization Data Lake Data Warehouse Manufacturing

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

AWS Big Data

MARCH 7, 2024

At the same time, they need to optimize operational costs to unlock the value of this data for timely insights and do so with a consistent performance. With this massive data growth, data proliferation across your data stores, data warehouse, and data lakes can become equally challenging.

Data Lake

Data Lake Analytics Dashboards Metrics

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

To speed up the self-service analytics and foster innovation based on data, a solution was needed to provide ways to allow any team to create data products on their own in a decentralized manner. To create and manage the data products, smava uses Amazon Redshift , a cloud data warehouse.

Data Lake

Data Lake Data Warehouse Data-driven B2B

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

AWS Big Data

NOVEMBER 13, 2023

Amazon Redshift is a fully managed data warehousing service that offers both provisioned and serverless options, making it more efficient to run and scale analytics without having to manage your data warehouse. These upstream data sources constitute the data producer components.

Data Warehouse

Data Warehouse Data Lake Analytics Data Science

AWS Lake Formation 2022 year in review

AWS Big Data

JANUARY 31, 2023

We have collected some of the key talks and solutions on data governance, data mesh, and modern data architecture published and presented in AWS re:Invent 2022, and a few data lake solutions built by customers and AWS Partners for easy reference.

Data Lake

Data Lake Data Governance Data Architecture Machine Learning

Exploring real-time streaming for generative AI Applications

AWS Big Data

MARCH 25, 2024

For example, in a chatbot, data events could pertain to an inventory of flights and hotels or price changes that are constantly ingested to a streaming storage engine. Furthermore, data events are filtered, enriched, and transformed to a consumable format using a stream processor.

Data Lake

Data Lake Unstructured Data Management Modeling

How Knowledge Graphs Power Data Mesh and Data Fabric

Ontotext

APRIL 10, 2024

“Any enterprise CEO really ought to be able to ask a question that involves connecting data across the organization, be able to run a company effectively, and especially to be able to respond to unexpected events. Most organizations are missing this ability to connect all the data together.”

Metadata

Metadata Data Lake Data Warehouse Data Quality

Your guide to AWS Analytics at AWS re:Invent 2023

AWS Big Data

NOVEMBER 13, 2023

Visit us at the AWS Analytics Kiosk in the AWS Village at the Expo to discover the AWS Analytics Superhero in you, participate in a playful quiz and AWS book signing events. 11:30 AM – 12:30 PM (PDT) Ceasars Forum ANT318 | Accelerate innovation with end-to-end serverless data architecture.

Analytics

Analytics Data Lake Data Warehouse Data-driven

Simplify access management with Amazon Redshift and AWS Lake Formation for users in an External Identity Provider

AWS Big Data

FEBRUARY 15, 2024

You might be modernizing your data architecture using Amazon Redshift to enable access to your data lake and data in your data warehouse, and are looking for a centralized and scalable way to define and manage the data access based on IdP identities. Select the data share and choose Authorize.

Management

Management Data Lake Sales Data Warehouse

Building a vision for real-time artificial intelligence

CIO Business Intelligence

APRIL 12, 2023

After walking his executive team through the data hops, flows, integrations, and processing across different ingestion software, databases, and analytical platforms, they were shocked by the complexity of their current data architecture and technology stack. It isn’t easy.

Machine Learning

Machine Learning Cost-Benefit Data-driven Strategy

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

You can subscribe to data products that help enrich customer profiles, for example demographics data, advertising data, and financial markets data. Amazon Kinesis ingests streaming events in real time from point-of-sales systems, clickstream data from mobile apps and websites, and social media data.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Demystifying Modern Data Platforms

Cloudera

SEPTEMBER 15, 2022

July brings summer vacations, holiday gatherings, and for the first time in two years, the return of the Massachusetts Institute of Technology (MIT) Chief Data Officer symposium as an in-person event. A key area of focus for the symposium this year was the design and deployment of modern data platforms. What is a data fabric?

Data Lake

Data Lake Data Architecture Data-driven Data Warehouse

Architectural patterns for real-time analytics using Amazon Kinesis Data Streams, part 1

AWS Big Data

JANUARY 8, 2024

It aims to provide a framework to create low-latency streaming applications on the AWS Cloud using Amazon Kinesis Data Streams and AWS purpose-built data analytics services. In this post, we will review the common architectural patterns of two use cases: Time Series Data Analysis and Event Driven Microservices.

Analytics

Analytics IoT Data-driven Snapshot

Generic orchestration framework for data warehousing workloads using Amazon Redshift RSQL

AWS Big Data

APRIL 3, 2023

Tens of thousands of customers run business-critical workloads on Amazon Redshift , AWS’s fast, petabyte-scale cloud data warehouse delivering the best price-performance. With Amazon Redshift, you can query data across your data warehouse, operational data stores, and data lake using standard SQL.

Data Warehouse

Data Warehouse Testing Data Lake Data-driven

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

The aim was to bolster their analytical capabilities and improve data accessibility while ensuring a quick time to market and high data quality, all with low total cost of ownership (TCO) and no need for additional tools or licenses. This process has been scheduled to run daily, ensuring a consistent batch of fresh data for analysis.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

Data science vs data analytics: Unpacking the differences

IBM Big Data Hub

SEPTEMBER 19, 2023

How effectively and efficiently an organization can conduct data analytics is determined by its data strategy and data architecture , which allows an organization, its users and its applications to access different types of data regardless of where that data resides.

Data Science

Data Science Data Analytics Prescriptive Analytics Analytics

Exploring new ETL and ELT capabilities for Amazon Redshift from the AWS Glue Studio visual editor

AWS Big Data

APRIL 20, 2023

In a modern data architecture, unified analytics enable you to access the data you need, whether it’s stored in a data lake or a data warehouse. For the purposes of this post, we write our own custom query that joins data from the preloaded event table and venue table.

Visualization

Visualization Data Warehouse Big Data Data Lake

Visualize data quality scores and metrics generated by AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

The following architecture diagram shows an overview of the complete pipeline. These are six main steps in the data pipeline: Amazon EventBridge triggers an AWS Lambda function when the event pattern for AWS Glue Data Quality matches the defined rule. Avik Bhattacharjee is a Senior Partner Solutions Architect at AWS.

Data Quality

Data Quality Metrics Visualization Dashboards

This Structure has Novel Features which are of Considerable Business Interest

Peter James Thomas

APRIL 3, 2020

The way that this consistency of figures is achieved is by all elements of the Structured Reporting Framework drawing their data from the same data repositories. Without paying attention to this, your shiny warehouse or data lake will be a technological curiosity, not an indispensable business tool.

Dashboards

Dashboards Reporting Sales Data Lake

AWS re:Invent Recap: The Future of Cloud

Alation

DECEMBER 14, 2021

How do you provide access and connect the right people to the right data? AWS has created a way to manage policies and access, but this is only for data lake formation. What about other data sources? Redshift , AWS’ data warehouse that powers data exchange, provides 3x performance (3TB, 30 Tb, 100Tb dataset).

Data Lake

Data Lake Data Warehouse Machine Learning Cost-Benefit

An Introduction to Disaster Recovery with the Cloudera Data Platform

Cloudera

AUGUST 9, 2022

Data platforms are no longer skunkworks projects or science experiments. As customers import their mainframe and legacy data warehouse workloads, there is an expectation on the platform that it can meet, if not exceed, the resilience of the prior system and its associated dependencies. Conclusion.

Data Lake

Data Lake Data Warehouse Data-driven IoT

The Cloud Connection: How Governance Supports Security

Alation

APRIL 14, 2022

Supports the ability to interact with the actual data and perform analysis on it. This provides the facility a time or event for a job to run and offers useful post-run information. Similar to a data warehouse schema, this prep tool automates the development of the recipe to match. Scheduling. Target Matching.

Metadata

Metadata Data Governance Modeling Data-driven

What Is Data Modernization? 5 Benefits Worth Knowing

Alation

APRIL 19, 2022

Data modernization is the process of transferring data to modern cloud-based databases from outdated or siloed legacy databases, including structured and unstructured data. In that sense, data modernization is synonymous with cloud migration. Only then can you extract insights across fragmented data architecture.

Cost-Benefit

Cost-Benefit Data Governance Manufacturing Data Architecture

5 Key Takeaways from Flink Forward 2023

Cloudera

NOVEMBER 27, 2023

billion events/s. Consider a few factors: First, many have been using Kafka as long-term storage and have seen their clusters grow without the same elasticity and accessibility one would expect from a modern data lake. For now, Flink plus Iceberg is the compute plus storage solution for streaming data.

Data Lake

Data Lake Advertising ROI Data Warehouse

Data Leaders Brief

Use Apache Iceberg in your data lake with Amazon S3, AWS Glue, and Snowflake

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Webinars

Trending Sources

Choosing an open table format for your transactional data lake on AWS

Webinars

Implement slowly changing dimensions in a data lake using AWS Glue and Delta

Build a multi-Region and highly resilient modern data architecture using AWS Glue and AWS Lake Formation

Deploy and Optimize Your Snowflake Environment Faster With Accelerators

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

How smava makes loans transparent and affordable using Amazon Redshift Serverless

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

AWS Lake Formation 2022 year in review

Exploring real-time streaming for generative AI Applications

How Knowledge Graphs Power Data Mesh and Data Fabric

Your guide to AWS Analytics at AWS re:Invent 2023

Simplify access management with Amazon Redshift and AWS Lake Formation for users in an External Identity Provider

Building a vision for real-time artificial intelligence

Create an end-to-end data strategy for Customer 360 on AWS

Demystifying Modern Data Platforms

Architectural patterns for real-time analytics using Amazon Kinesis Data Streams, part 1

Generic orchestration framework for data warehousing workloads using Amazon Redshift RSQL

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

Data science vs data analytics: Unpacking the differences

Exploring new ETL and ELT capabilities for Amazon Redshift from the AWS Glue Studio visual editor

Visualize data quality scores and metrics generated by AWS Glue Data Quality

This Structure has Novel Features which are of Considerable Business Interest

AWS re:Invent Recap: The Future of Cloud

An Introduction to Disaster Recovery with the Cloudera Data Platform

The Cloud Connection: How Governance Supports Security

What Is Data Modernization? 5 Benefits Worth Knowing

5 Key Takeaways from Flink Forward 2023

Stay Connected