Data Lake, Data Warehouse, Interactive and Publishing

Data Lake

Data Warehouse

Interactive

Publishing

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

Since the deluge of big data over a decade ago, many organizations have learned to build applications to process and analyze petabytes of data. Data lakes have served as a central repository to store structured and unstructured data at any scale and in various formats.

Data Lake

Data Lake Sales Data Warehouse Snapshot

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

AWS Big Data

SEPTEMBER 13, 2023

A modern data architecture is an evolutionary architecture pattern designed to integrate a data lake, data warehouse, and purpose-built stores with a unified governance model. Of those tables, some are larger (such as in terms of record volume) than others, and some are updated more frequently than others.

Data Lake

Data Lake Data Processing Metadata Snapshot

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

AWS Big Data

JUNE 23, 2023

Events and many other security data types are stored in Imperva’s Threat Research Multi-Region data lake. Imperva harnesses data to improve their business outcomes. As part of their solution, they are using Amazon QuickSight to unlock insights from their data.

Data Lake

Data Lake Cost-Benefit Dashboards Data Warehouse

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Cloudera Data Warehouse Demonstrates Best-in-Class Cloud-Native Price-Performance

Cloudera

JANUARY 15, 2021

Cloud data warehouses allow users to run analytic workloads with greater agility, better isolation and scale, and lower administrative overhead than ever before. The results demonstrate superior price performance of Cloudera Data Warehouse on the full set of 99 queries from the TPC-DS benchmark. Introduction.

Data Warehouse

Data Warehouse Cost-Benefit Consulting Interactive

Use the Amazon Redshift Data API to interact with Amazon Redshift Serverless

AWS Big Data

APRIL 28, 2023

Amazon Redshift is a fast, scalable, secure, and fully managed cloud data warehouse that makes it simple and cost-effective to analyze all your data using standard SQL and your existing ETL (extract, transform, and load), business intelligence (BI), and reporting tools.

Interactive

Interactive Metadata Data Warehouse Data-driven

Data Modeling 201 for the cloud: designing databases for data warehouses

erwin

JUNE 7, 2022

Designing databases for data warehouses or data marts is intrinsically much different than designing for traditional OLTP systems. Accordingly, data modelers must embrace some new tricks when designing data warehouses and data marts. Figure 1: Pricing for a 4 TB data warehouse in AWS.

Data Warehouse

Data Warehouse Modeling Sales Data Lake

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

AWS Big Data

NOVEMBER 29, 2023

dbt is an open source, SQL-first templating engine that allows you to write repeatable and extensible data transforms in Python and SQL. dbt is predominantly used by data warehouses (such as Amazon Redshift ) customers who are looking to keep their data transform logic separate from storage and engine.

Data Lake

Data Lake Management Metrics Data Warehouse

What is a Data Pipeline?

Jet Global

MAY 9, 2024

The key components of a data pipeline are typically: Data Sources : The origin of the data, such as a relational database , data warehouse, data lake , file, API, or other data store. This can include tasks such as data ingestion, cleansing, filtering, aggregation, or standardization.

Data Lake

Data Lake Data Warehouse Business Intelligence Machine Learning

Automate deployment of an Amazon QuickSight analysis connecting to an Amazon Redshift data warehouse with an AWS CloudFormation template

AWS Big Data

FEBRUARY 16, 2023

Amazon Redshift is the most widely used data warehouse in the cloud, best suited for analyzing exabytes of data and running complex analytical queries. Amazon QuickSight is a fast business analytics service to build visualizations, perform ad hoc analysis, and quickly get business insights from your data.

Data Warehouse

Data Warehouse Sales Visualization Data Processing

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics, Part 3: Visualization and trend analysis using Amazon QuickSight

AWS Big Data

MARCH 29, 2024

QuickSight makes it straightforward for business users to visualize data in interactive dashboards and reports. venvScriptsactivate.bat After this step, the subsequent steps run within the bounds of the virtual environment on the client machine and interact with the AWS account as needed. Choose Publish dashboard.

Metrics

Metrics Visualization Dashboards Interactive

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

AWS Big Data

MARCH 7, 2023

A data hub contains data at multiple levels of granularity and is often not integrated. It differs from a data lake by offering data that is pre-validated and standardized, allowing for simpler consumption by users. Data hubs and data lakes can coexist in an organization, complementing each other.

Analytics

Analytics Data Warehouse Data Lake Metadata

AWS Lake Formation 2022 year in review

AWS Big Data

JANUARY 31, 2023

We have collected some of the key talks and solutions on data governance, data mesh, and modern data architecture published and presented in AWS re:Invent 2022, and a few data lake solutions built by customers and AWS Partners for easy reference.

Data Lake

Data Lake Data Governance Data Architecture Data Warehouse

Automate large-scale data validation using Amazon EMR and Apache Griffin

AWS Big Data

APRIL 4, 2024

In the depicted architecture and our typical data lake use case, our data either resides n Amazon S3 or is migrated from on premises to Amazon S3 using replication tools such as AWS DataSync or AWS Database Migration Service (AWS DMS). It generates and publishes reports in Amazon S3, which are then accessible via Athena.

Data Quality

Data Quality Data Lake Data Warehouse Data-driven

Accelerate Amazon Redshift secure data use with Satori – Part 1

AWS Big Data

SEPTEMBER 21, 2023

Satori integrates natively with both Amazon Redshift provisioned clusters and Amazon Redshift Serverless for easy setup of your Amazon Redshift data warehouse in the secure Satori portal. In part 2, we will explore how to set up self-service data access with Satori to data stored in Amazon Redshift.

Data Warehouse

Data Warehouse Interactive Data Architecture Data Lake

Data Mesh 101: What it is and Why You Should Care

Ontotext

FEBRUARY 12, 2024

It proposes a technological, architectural, and organizational approach to solving data management problems by breaking up the monolithic data platform and de-centralizing data management across different domain teams and services. Once these domains interact and share data with each other, the mesh emerges.

IT Metadata Data Quality Data Lake

The hidden history of Db2

IBM Big Data Hub

JULY 5, 2022

Back in the 1960s and 70s, vast amounts of data were stored in the world’s new mainframe computers—many of them IBM System/360 machines—and had become a problem. Finally, 13 years after Codd published his paper, IBM Db2 on z/OS was born, and 10 years after that the first IBM Db2 database for LUW was released. . They were expensive.

Data Lake

Data Lake Data Warehouse Publishing Structured Data

What is Data Mesh?

Ontotext

NOVEMBER 16, 2023

In a data mesh, domains are represented by a node, which can be an operational data store (ODS), a data warehouse, or a data lake tailored to the domain’s requirements. Mesh emerges when teams use other domains’ data products and the domains communicate with others in a governed manner.

Metadata

Metadata Data-driven Data Quality Data Architecture

Data Visualization and Visual Analytics: Seeing the World of Data

Sisense

JUNE 30, 2020

Data visualization can either be static or interactive. Interactive visualizations enable users to drill down into data and extract and examine various views of the same dataset, selecting specific data points that they want to see in a visualized format. The role of visualizations in analytics.

Visualization

Visualization Analytics Dashboards Data-driven

What is Data Mapping?

Jet Global

FEBRUARY 23, 2024

This includes cleaning, aggregating, enriching, and restructuring data to fit the desired format. Load : Once data transformation is complete, the transformed data is loaded into the target system, such as a data warehouse, database, or another application.

Data Warehouse

Data Warehouse Reporting Data Transformation Sales

Dimensional modeling in Amazon Redshift

AWS Big Data

JULY 19, 2023

Amazon Redshift is a fully managed and petabyte-scale cloud data warehouse that is used by tens of thousands of customers to process exabytes of data every day to power their analytics workload. You can structure your data, measure business processes, and get valuable insights quickly can be done by using a dimensional model.

Modeling

Modeling Sales Data Warehouse Snapshot

Migrate from Amazon Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics Studio

AWS Big Data

JUNE 29, 2023

Solution overview For our use case, we use several AWS services to stream, ingest, transform, and analyze sample automotive sensor data in real time using Kinesis Data Analytics Studio. Kinesis Data Analytics Studio allows us to create a notebook, which is a web-based development environment. Provide the following SQL statement.

Data Analytics

Data Analytics Analytics IoT Data Lake

Open Data Lakehouse powered by Iceberg for all your Data Warehouse needs

Cloudera

APRIL 3, 2023

In this blog, we will share with you in detail how Cloudera integrates core compute engines including Apache Hive and Apache Impala in Cloudera Data Warehouse with Iceberg. We will publish follow up blogs for other data services. It allows us to independently upgrade the Virtual Warehouses and Database Catalogs.

Data Warehouse

Data Warehouse Snapshot Metadata Cost-Benefit

Data platform trinity: Competitive or complementary?

IBM Big Data Hub

JANUARY 18, 2023

In another decade, the internet and mobile started the generate data of unforeseen volume, variety and velocity. It required a different data platform solution. Hence, Data Lake emerged, which handles unstructured and structured data with huge volume. Data lakehouse was created to solve these problems.

Data Lake

Data Lake Data Warehouse Data-driven Metadata

Q&A with Greg Rahn – The changing Data Warehouse market

Cloudera

DECEMBER 12, 2018

After having rebuilt their data warehouse, I decided to take a little bit more of a pointed role, and I joined Oracle as a database performance engineer. I spent eight years in the real-world performance group where I specialized in high visibility and high impact data warehousing competes and benchmarks.

Data Warehouse

Data Warehouse Marketing Big Data Data Lake

The Gartner 2021 Leadership Vision for Data & Analytics Leaders Webinar Q&A

Andrew White

JANUARY 11, 2021

This was for the Chief Data Officer, or head of data and analytics. Gartner also published the same piece of research for other roles, such as Application and Software Engineering. Data and Analytics Governance: Whats Broken, and What We Need To Do To Fix It. Link Data to Business Outcomes. Very interesting.

Data Analytics

Data Analytics Analytics Data-driven Finance

How The CIO Can Become The CMO’s Best Ally In The Use Of Data

CIO Business Intelligence

SEPTEMBER 21, 2022

“The good news for many CIOs is that they’ve already laid the groundwork through investments in data governance and migration to the cloud,” LiveRamp noted in a recent report. Inconsistent data , which can result in inaccuracies in interacting with customers, and affect the internal operational use of data.

Risk

Risk Data Lake Marketing Interactive

Introducing watsonx: The future of AI for business

IBM Big Data Hub

MAY 9, 2023

As AI becomes more pervasive, businesses need to feel confident that their models can be relied upon not to “hallucinate” facts or use inappropriate language when interacting with customers. With watsonx.data , businesses can quickly connect to data, get trusted insights and reduce data warehouse costs.

Data Warehouse

Data Warehouse Cost-Benefit Machine Learning Modeling

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

Most of the data management moved to back-end servers, e.g., databases. So we had three tiers providing a separation of concerns: presentation, logic, data. Note that data warehouse (DW) and business intelligence (BI) practices both emerged circa 1990. in lieu of simply landing in a data lake.

Data Governance

Data Governance Machine Learning Metadata Big Data

Exploring the AI and data capabilities of watsonx

IBM Big Data Hub

JULY 17, 2023

While they require task-specific labeled data for fine tuning, they also offer clients the best cost performance trade-off for non-generative use cases. offers a Prompt Lab, where users can interact with different prompts using prompt engineering on generative AI models for both zero-shot prompting and few-shot prompting.

Machine Learning

Machine Learning Data Warehouse Modeling Cost-Benefit

Data Leaders Brief

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

Simplify operational data processing in data lakes using AWS Glue and Apache Hudi

Webinars

Trending Sources

Enable business users to analyze large datasets in your data lake with Amazon QuickSight

Webinars

Cloudera Data Warehouse Demonstrates Best-in-Class Cloud-Native Price-Performance

Use the Amazon Redshift Data API to interact with Amazon Redshift Serverless

Data Modeling 201 for the cloud: designing databases for data warehouses

Build and manage your modern data stack using dbt and AWS Glue through dbt-glue, the new “trusted” dbt adapter

What is a Data Pipeline?

Automate deployment of an Amazon QuickSight analysis connecting to an Amazon Redshift data warehouse with an AWS CloudFormation template

Enhance monitoring and debugging for AWS Glue jobs using new job observability metrics, Part 3: Visualization and trend analysis using Amazon QuickSight

How gaming companies can use Amazon Redshift Serverless to build scalable analytical applications faster and easier

AWS Lake Formation 2022 year in review

Automate large-scale data validation using Amazon EMR and Apache Griffin

Accelerate Amazon Redshift secure data use with Satori – Part 1

Data Mesh 101: What it is and Why You Should Care

The hidden history of Db2

What is Data Mesh?

Data Visualization and Visual Analytics: Seeing the World of Data

What is Data Mapping?

Dimensional modeling in Amazon Redshift

Migrate from Amazon Kinesis Data Analytics for SQL Applications to Amazon Kinesis Data Analytics Studio

Open Data Lakehouse powered by Iceberg for all your Data Warehouse needs

Data platform trinity: Competitive or complementary?

Q&A with Greg Rahn – The changing Data Warehouse market

The Gartner 2021 Leadership Vision for Data & Analytics Leaders Webinar Q&A

How The CIO Can Become The CMO’s Best Ally In The Use Of Data

Introducing watsonx: The future of AI for business

Themes and Conferences per Pacoid, Episode 8

Exploring the AI and data capabilities of watsonx

Stay Connected