Analytics, Dashboards, Data Architecture and Data Lake

Analytics

Dashboards

Data Architecture

Data Lake

Migrate an existing data lake to a transactional data lake using Apache Iceberg

AWS Big Data

OCTOBER 3, 2023

A data lake is a centralized repository that you can use to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the data and then run different types of analytics for better business insights.

Data Lake

Data Lake Metadata Snapshot Recreation/Entertainment

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

AWS Big Data

MARCH 7, 2024

At the same time, they need to optimize operational costs to unlock the value of this data for timely insights and do so with a consistent performance. With this massive data growth, data proliferation across your data stores, data warehouse, and data lakes can become equally challenging.

Data Lake

Data Lake Analytics Dashboards Metrics

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

DataOps For Business Analytics Teams

DataKitchen

JANUARY 3, 2022

Their business unit colleagues ask an endless stream of urgent questions that require analytic insights. Business analysts must rapidly deliver value and simultaneously manage fragile and error-prone analytics production pipelines. In business analytics, fire-fighting and stress are common. Analytics Hub and Spoke.

Business Analytics

Business Analytics Analytics Testing Dashboards

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

AWS Big Data

APRIL 24, 2023

Building a data lake on Amazon Simple Storage Service (Amazon S3) provides numerous benefits for an organization. However, many use cases, like performing change data capture (CDC) from an upstream relational database to an Amazon S3-based data lake, require handling data at a record level.

Data Lake

Data Lake Data Governance Cost-Benefit Machine Learning

Centralize Your Data Processes With a DataOps Process Hub

DataKitchen

NOVEMBER 4, 2021

Data organizations often have a mix of centralized and decentralized activity. DataOps concerns itself with the complex flow of data across teams, data centers and organizational boundaries. It expands beyond tools and data architecture and views the data organization from the perspective of its processes and workflows.

Data Processing

Data Processing Data Lake Cost-Benefit Testing

Data science vs data analytics: Unpacking the differences

IBM Big Data Hub

SEPTEMBER 19, 2023

Though you may encounter the terms “data science” and “data analytics” being used interchangeably in conversations or online, they refer to two distinctly different concepts. Meanwhile, data analytics is the act of examining datasets to extract value and find answers to specific questions.

Data Science

Data Science Data Analytics Prescriptive Analytics Analytics

Architectural patterns for real-time analytics using Amazon Kinesis Data Streams, part 1

AWS Big Data

JANUARY 8, 2024

It aims to provide a framework to create low-latency streaming applications on the AWS Cloud using Amazon Kinesis Data Streams and AWS purpose-built data analytics services. In this post, we will review the common architectural patterns of two use cases: Time Series Data Analysis and Event Driven Microservices.

Analytics

Analytics IoT Data-driven Snapshot

Why the Data Journey Manifesto?

DataKitchen

JUNE 12, 2023

We had been talking about “Agile Analytic Operations,” “DevOps for Data Teams,” and “Lean Manufacturing For Data,” but the concept was hard to get across and communicate. I spent much time de-categorizing DataOps: we are not discussing ETL, Data Lake, or Data Science.

Testing

Testing Data Lake Dashboards Data Science

Porsche Carrera Cup Brasil gets real-time data boost

CIO Business Intelligence

MAY 21, 2024

Defining a strategic relationship In July 2023, Dener Motorsport began working with Microsoft Fabric to get at that data in real-time, specifically Fabric components Synapse Real-Time Analytics for data streaming analysis, and Data Activator to monitor and trigger actions in real-time.

Broadcasting

Broadcasting Recreation/Entertainment Manufacturing Data Lake

The New Normal for FP&A: Data Analytics

Jedox

OCTOBER 22, 2020

The term “data analytics” refers to the process of examining datasets to draw conclusions about the information they contain. Data analysis techniques enhance the ability to take raw data and uncover patterns to extract valuable insights from it. Data analytics is not new. Self-service reporting.

Data Analytics

Data Analytics Analytics Unstructured Data Data mining

Deep dive into the AWS ProServe Hadoop Migration Delivery Kit TCO tool

AWS Big Data

FEBRUARY 6, 2023

Amazon QuickSight dashboards showcase the results from the analyzer. With QuickSight, you can visualize YARN log data and conduct analysis against the datasets generated by pre-built dashboard templates and a widget. The following diagram illustrates the HMDK TCO architecture.

Dashboards

Dashboards Optimization Data Lake Cost-Benefit

Introducing the AWS ProServe Hadoop Migration Delivery Kit TCO tool

AWS Big Data

FEBRUARY 6, 2023

Use case overview Migrating Hadoop workloads to Amazon EMR accelerates big data analytics modernization, increases productivity, and reduces operational cost. Refactoring coupled compute and storage to a decoupling architecture is a modern data solution. Let’s see some real customer use cases in the following sections.

Cost-Benefit

Cost-Benefit Data Lake Dashboards Big Data

Building a vision for real-time artificial intelligence

CIO Business Intelligence

APRIL 12, 2023

After walking his executive team through the data hops, flows, integrations, and processing across different ingestion software, databases, and analytical platforms, they were shocked by the complexity of their current data architecture and technology stack. A unified data ecosystem enables this in real time.

Machine Learning

Machine Learning Cost-Benefit Data-driven Strategy

Exploring real-time streaming for generative AI Applications

AWS Big Data

MARCH 25, 2024

To understand the best ways to make API calls via Apache Flink, refer to Common streaming data enrichment patterns in Amazon Kinesis Data Analytics for Apache Flink. OpenSearch Service provides support for native ingestion from Kinesis data streams or MSK topics.

Data Lake

Data Lake Unstructured Data Management Modeling

Visualize data quality scores and metrics generated by AWS Glue Data Quality

AWS Big Data

JUNE 6, 2023

In this post, we highlight the seamless integration of Amazon Athena and Amazon QuickSight , which enables the visualization of operational metrics for AWS Glue Data Quality rule evaluation in an efficient and effective manner. The following architecture diagram shows an overview of the complete pipeline. Choose Visualize.

Data Quality

Data Quality Metrics Visualization Dashboards

Convergent Evolution

Peter James Thomas

AUGUST 18, 2018

That was the Science, here comes the Technology… A Brief Hydrology of Data Lakes. Even back then, these were used for activities such as Analytics , Dashboards , Statistical Modelling , Data Mining and Advanced Visualisation. Of course some architectures featured both paradigms as well.

Data Lake

Data Lake Data Warehouse Data mining Statistics

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

AWS Big Data

NOVEMBER 9, 2023

It allows users to write data transformation code, run it, and test the output, all within the framework it provides. Use case The Enterprise Data Analytics group of a large jewelry retailer embarked on their cloud journey with AWS in 2021. Third-party APIs – These provide analytics and survey data related to ecommerce websites.

Data Warehouse

Data Warehouse Testing Data Quality Reporting

5 misconceptions about cloud data warehouses

IBM Big Data Hub

FEBRUARY 2, 2023

In today’s world, data warehouses are a critical component of any organization’s technology ecosystem. They provide the backbone for a range of use cases such as business intelligence (BI) reporting, dashboarding, and machine-learning (ML)-based predictive analytics, that enable faster decision making and insights.

Data Warehouse

Data Warehouse Cost-Benefit Unstructured Data Data Architecture

This Structure has Novel Features which are of Considerable Business Interest

Peter James Thomas

APRIL 3, 2020

Some espouse the opinion that the term is synonymous with Dashboards. Jane opened up her personal dashboard, which already showed the headline figures the CFO had been citing. Some charts or tables may be replicated across a number of dashboards, but others with be specific to a particular area of the business.

Dashboards

Dashboards Reporting Sales Data Lake

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

AWS Big Data

NOVEMBER 13, 2023

Amazon Redshift is a fully managed data warehousing service that offers both provisioned and serverless options, making it more efficient to run and scale analytics without having to manage your data warehouse. These upstream data sources constitute the data producer components.

Data Warehouse

Data Warehouse Data Lake Analytics Data Science

Eight Top DataOps Trends for 2022

DataKitchen

NOVEMBER 29, 2021

As a result, enterprises will examine their end-to-end data operations and analytics creation workflows. Instead of allowing technology to be a barrier to teamwork, leading data organizations in 2022 will further expand the automation of workflows to improve and facilitate communication and coordination between the groups.

Testing

Testing Data Lake Data Architecture Manufacturing

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

AWS Big Data

NOVEMBER 29, 2023

Since then, customer demands for better scale, higher throughput, and agility in handling a wide variety of changing, but increasingly business critical analytics and machine learning use cases has exploded, and we have been keeping pace. Let’s dive into the highlights.

Data Warehouse

Data Warehouse Data Lake Analytics Machine Learning

Get maximum value out of your cloud data warehouse with Amazon Redshift

AWS Big Data

APRIL 19, 2023

With the right analytics approach, this is possible. In this post, we look at three key challenges that customers face with growing data and how a modern data warehouse and analytics system like Amazon Redshift can meet these challenges across industries and segments.

Data Warehouse

Data Warehouse Data Lake Unstructured Data Optimization

Estimating Scope 1 Carbon Footprint with Amazon Athena

AWS Big Data

AUGUST 2, 2023

In this blog, we will walk through how we can apply existing enterprise data to better understand and estimate Scope 1 carbon footprint using Amazon Simple Storage Service (S3) and Amazon Athena , a serverless interactive analytics service that makes it easy to analyze data using standard SQL.

Data Lake

Data Lake Measurement Visualization Data Architecture

Simplify access management with Amazon Redshift and AWS Lake Formation for users in an External Identity Provider

AWS Big Data

FEBRUARY 15, 2024

You might be modernizing your data architecture using Amazon Redshift to enable access to your data lake and data in your data warehouse, and are looking for a centralized and scalable way to define and manage the data access based on IdP identities. Choose Register location.

Management

Management Data Lake Sales Data Warehouse

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

Cloudera

JUNE 30, 2022

These tools empower analysts and data scientists to easily collaborate on the same data, with their choice of tools and analytic engines. No more lock-in, unnecessary data transformations, or data movement across tools and clouds just to extract insights out of the data.

Data Lake

Data Lake Data Architecture Metadata Data Warehouse

A Day in the Life of a DataOps Engineer

DataKitchen

OCTOBER 11, 2021

First, you must understand the existing challenges of the data team, including the data architecture and end-to-end toolchain. The biggest challenge is broken data pipelines due to highly manual processes. Figure 1 shows a manually executed data analytics pipeline. DataOps Project Design and Implementation.

Testing

Testing Metadata Dashboards Statistics

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

In this post, we discuss how you can use purpose-built AWS services to create an end-to-end data strategy for C360 to unify and govern customer data that address these challenges. We recommend building your data strategy around five pillars of C360, as shown in the following figure.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless

AWS Big Data

MARCH 3, 2023

Building data lakes from continuously changing transactional data of databases and keeping data lakes up to date is a complex task and can be an operational challenge. You can then apply transformations and store data in Delta format for managing inserts, updates, and deletes.

Data Lake

Data Lake Dashboards Metrics Metadata

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

AWS Big Data

NOVEMBER 16, 2023

Amazon Redshift is a popular cloud data warehouse, offering a fully managed cloud-based service that seamlessly integrates with an organization’s Amazon Simple Storage Service (Amazon S3) data lake, real-time streams, machine learning (ML) workflows, transactional workflows, and much more—all while providing up to 7.9x

Enterprise

Enterprise Data Warehouse Data Lake Optimization

A Simple Data Capability Framework

Peter James Thomas

MAY 3, 2019

Collation of Data to provide Information. This area includes what is often described as “traditional” reporting [3] , Dashboards and analysis facilities. Leverage of Data to generate Insight. In this second area we have disciplines such as Analytics and Data Science. Data Architecture / Infrastructure.

Strategy

Strategy Data Architecture Data Quality Data Strategy

Connecting the Data Lifecycle

Cloudera

NOVEMBER 29, 2021

Carrefour Spain , a branch of the larger company (with 1,250 stores), processes over 3 million transactions every day, giving rise to challenges like creating and managing a data lake and honing down key demographic information. . Working with Cloudera, Carrefour Spain was able to create a unified data lake for ease of data handling.

Data Lake

Data Lake Data Warehouse Data Architecture Reporting

Exploring new ETL and ELT capabilities for Amazon Redshift from the AWS Glue Studio visual editor

AWS Big Data

APRIL 20, 2023

In a modern data architecture, unified analytics enable you to access the data you need, whether it’s stored in a data lake or a data warehouse. In this case, the ETL developer would choose TRUNCATE to ensure the data is fully refreshed but the table schema is guaranteed not to change.

Visualization

Visualization Data Warehouse Big Data Data Lake

Your guide to AWS Analytics at AWS re:Invent 2023

AWS Big Data

NOVEMBER 13, 2023

Join the AWS Analytics team at AWS re:Invent this year, where new ideas and exciting innovations come together. For those in the data world, this post provides a curated guide for all analytics sessions that you can use to quickly schedule and build your itinerary. A shapeshifting guardian and protector of data like Data Lynx?

Analytics

Analytics Data Lake Data Warehouse Data-driven

AWS re:Invent 2023 Amazon Redshift Sessions Recap

AWS Big Data

DECEMBER 18, 2023

Amazon Redshift powers data-driven decisions for tens of thousands of customers every day with a fully managed, AI-powered cloud data warehouse, delivering the best price-performance for your analytics workloads.

Data Warehouse

Data Warehouse Machine Learning Data-driven Data Lake

Data democratization: How data architecture can drive business decisions and AI initiatives

IBM Big Data Hub

AUGUST 4, 2023

When effectively implemented, a data democracy simplifies the data stack, eliminates data gatekeepers, and makes the company’s comprehensive data platform easily accessible by different teams via a user-friendly dashboard. Then, it applies these insights to automate and orchestrate the data lifecycle.

Data Architecture

Data Architecture Data Lake Machine Learning Data Governance

Flexible and secure Data-as-a-Service delivered today

Birst BI

OCTOBER 24, 2019

Data-as-a-Service (DaaS) streamlines the chaos of ungoverned data pipelines and reporting silos created by users who are eager to use data, whether they are simple data inquiries from business analysts to more complex data questions from data science teams. How does DaaS fit into this architecture?

Data Warehouse

Data Warehouse Recreation/Entertainment Data Lake Data Architecture

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

AWS Big Data

APRIL 5, 2023

The company decided to use AWS to unify its business intelligence (BI) and reporting strategy for both internal organization-wide use cases and in-product embedded analytics targeted at its customers. In this post, we share how Showpad used QuickSight to streamline data and insights access across teams and customers.

Dashboards

Dashboards Reporting Cost-Benefit Visualization

How Amazon Finance Automation built a data mesh to support distributed data ownership and centralize governance

AWS Big Data

JULY 14, 2023

In this post, we discuss how the Amazon Finance Automation team used AWS Lake Formation and the AWS Glue Data Catalog to build a data mesh architecture that simplified data governance at scale and provided seamless data access for analytics, AI, and machine learning (ML) use cases.

Finance

Finance Metadata Big Data Recreation/Entertainment

Migrate an existing data lake to a transactional data lake using Apache Iceberg

Petabyte-scale log analytics with Amazon S3, Amazon OpenSearch Service, and Amazon OpenSearch Ingestion

Webinars

Trending Sources

DataOps For Business Analytics Teams

Webinars

Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena

Centralize Your Data Processes With a DataOps Process Hub

Data science vs data analytics: Unpacking the differences

Architectural patterns for real-time analytics using Amazon Kinesis Data Streams, part 1

Why the Data Journey Manifesto?

Porsche Carrera Cup Brasil gets real-time data boost

The New Normal for FP&A: Data Analytics

Deep dive into the AWS ProServe Hadoop Migration Delivery Kit TCO tool

Introducing the AWS ProServe Hadoop Migration Delivery Kit TCO tool

Building a vision for real-time artificial intelligence

Exploring real-time streaming for generative AI Applications

Visualize data quality scores and metrics generated by AWS Glue Data Quality

Convergent Evolution

Create a modern data platform using the Data Build Tool (dbt) in the AWS Cloud

5 misconceptions about cloud data warehouses

This Structure has Novel Features which are of Considerable Business Interest

How GamesKraft uses Amazon Redshift data sharing to support growing analytics workloads

Eight Top DataOps Trends for 2022

Amazon Redshift announcements at AWS re:Invent 2023 to enable analytics on all your data

Get maximum value out of your cloud data warehouse with Amazon Redshift

Estimating Scope 1 Carbon Footprint with Amazon Athena

Simplify access management with Amazon Redshift and AWS Lake Formation for users in an External Identity Provider

Supercharge Your Data Lakehouse with Apache Iceberg in Cloudera Data Platform

A Day in the Life of a DataOps Engineer

Create an end-to-end data strategy for Customer 360 on AWS

Build incremental data pipelines to load transactional data changes using AWS DMS, Delta 2.0, and Amazon EMR Serverless

Power enterprise-grade Data Vaults with Amazon Redshift – Part 1

A Simple Data Capability Framework

Connecting the Data Lifecycle

Exploring new ETL and ELT capabilities for Amazon Redshift from the AWS Glue Studio visual editor

Your guide to AWS Analytics at AWS re:Invent 2023

AWS re:Invent 2023 Amazon Redshift Sessions Recap

Data democratization: How data architecture can drive business decisions and AI initiatives

Flexible and secure Data-as-a-Service delivered today

Showpad accelerates data maturity to unlock innovation using Amazon QuickSight

How Amazon Finance Automation built a data mesh to support distributed data ownership and centralize governance

Stay Connected