Data Governance, Data Lake, Data Processing and Management

Data Governance

Data Lake

Data Processing

Management

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

AWS Big Data

MARCH 10, 2023

Since the deluge of big data over a decade ago, many organizations have learned to build applications to process and analyze petabytes of data. Data lakes have served as a central repository to store structured and unstructured data at any scale and in various formats.

Data Lake

Data Lake Sales Data Warehouse Snapshot

How Data Governance Protects Sensitive Data

erwin

APRIL 2, 2021

Organizations are managing more data than ever. With more companies increasingly migrating their data to the cloud to ensure availability and scalability, the risks associated with data management and protection also are growing. Data Security Starts with Data Governance.

Data Governance

Data Governance Cost-Benefit Risk Metadata

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

Governing data in relational databases using Amazon DataZone

AWS Big Data

MAY 7, 2024

Data governance is a key enabler for teams adopting a data-driven culture and operational model to drive innovation with data. Amazon DataZone allows you to simply and securely govern end-to-end data assets stored in your Amazon Redshift data warehouses or data lakes cataloged with the AWS Glue data catalog.

Metadata

Metadata Data Lake Data Processing Data-driven

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

AWS Big Data

FEBRUARY 27, 2024

In this post, we explore how Bluestone uses AWS services, notably the cloud data warehousing service Amazon Redshift , to implement a cutting-edge data mesh architecture, revolutionizing the way they manage, access, and utilize their data assets. This enables data-driven decision-making across the organization.

Data-driven

Data-driven Data Lake Data Quality Data Governance

How Novo Nordisk built distributed data governance and control at scale

AWS Big Data

APRIL 28, 2023

The first post of this series describes the overall architecture and how Novo Nordisk built a decentralized data mesh architecture, including Amazon Athena as the data query engine. The third post will show how end-users can consume data from their tool of choice, without compromising data governance.

Data Governance

Data Governance Management Data-driven Data Lake

AWS Glue crawlers support cross-account crawling to support data mesh architecture

AWS Big Data

MARCH 27, 2023

Data lakes have come a long way, and there’s been tremendous innovation in this space. Today’s modern data lakes are cloud native, work with multiple data types, and make this data easily available to diverse stakeholders across the business.

Data Lake

Data Lake Data-driven Management Data Architecture

Introducing AWS Glue crawler and create table support for Apache Iceberg format

AWS Big Data

AUGUST 16, 2023

Iceberg has become very popular for its support for ACID transactions in data lakes and features like schema and partition evolution, time travel, and rollback. AWS Glue crawlers will extract schema information and update the location of Iceberg metadata and schema updates in the Data Catalog. Choose Next. Choose Create.

Data Lake

Data Lake Metadata Snapshot Management

How smava makes loans transparent and affordable using Amazon Redshift Serverless

AWS Big Data

DECEMBER 21, 2023

To speed up the self-service analytics and foster innovation based on data, a solution was needed to provide ways to allow any team to create data products on their own in a decentralized manner. To create and manage the data products, smava uses Amazon Redshift , a cloud data warehouse.

Data Lake

Data Lake Data Warehouse Data-driven B2B

Create an end-to-end data strategy for Customer 360 on AWS

AWS Big Data

MARCH 26, 2024

In this post, we discuss how you can use purpose-built AWS services to create an end-to-end data strategy for C360 to unify and govern customer data that address these challenges. AWS Glue Data Quality checks for and alerts on poor data, making it straightforward to spot and fix issues before they harm your business.

Data Strategy

Data Strategy Strategy Data Warehouse Prescriptive Analytics

Modern Data Architecture for Telecommunications

Cloudera

SEPTEMBER 6, 2022

There remain challenges in workforce management, particularly in call centers, and order backlogs for fiber broadband and other physical infrastructure are being worked through. Previously, there were three types of data structures in telco: . Entity data sets — i.e. marketing data lakes .

Data Architecture

Data Architecture Cost-Benefit Digital Transformation Business Driver

How Amazon Finance Automation built a data mesh to support distributed data ownership and centralize governance

AWS Big Data

JULY 14, 2023

In this post, we discuss how the Amazon Finance Automation team used AWS Lake Formation and the AWS Glue Data Catalog to build a data mesh architecture that simplified data governance at scale and provided seamless data access for analytics, AI, and machine learning (ML) use cases.

Finance

Finance Metadata Big Data Recreation/Entertainment

Announcing the 2020 Data Impact Award Winners

Cloudera

NOVEMBER 18, 2020

Data Impact Achievement Award. United Overseas Bank (UOB), a Singaporean multinational banking organization, is recognized as one of the most excellent and professionally managed financial institutions in Asia. UOB understands that the future is data-driven. Winner: United Overseas Bank. Globe Telecom, Inc. Winner: Telkomsel.

Internet Publishing and Broadcasting

Internet Publishing and Broadcasting Data-driven Broadcasting Digital Transformation

Announcing the 2021 Data Impact Awards

Cloudera

MAY 12, 2021

2020 saw us hosting our first ever fully digital Data Impact Awards ceremony, and it certainly was one of the highlights of our year. We saw a record number of entries and incredible examples of how customers were using Cloudera’s platform and services to unlock the power of data. SECURITY AND GOVERNANCE LEADERSHIP.

Digital Transformation

Digital Transformation Machine Learning Optimization Data Lake

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Cloudera

OCTOBER 7, 2021

The concept of the data mesh architecture is not entirely new; Its conceptual origins are rooted in the microservices architecture, its design principles (i.e., need to integrate multiple “point solutions” used in a data ecosystem) and organization reasons (e.g., difficulty to achieve cross-organizational governance model).

Metadata

Metadata Cost-Benefit Enterprise Interactive

Themes and Conferences per Pacoid, Episode 8

Domino Data Lab

APRIL 3, 2019

Paco Nathan ‘s latest column dives into data governance. This month’s article features updates from one of the early data conferences of the year, Strata Data Conference – which was held just last week in San Francisco. In particular, here’s my Strata SF talk “Overview of Data Governance” presented in article form.

Data Governance

Data Governance Machine Learning Metadata Big Data

Design a data mesh on AWS that reflects the envisioned organization

AWS Big Data

JANUARY 22, 2024

The existing monolith and centralized architecture was struggling to meet the growing demands of data consumers. Data engineers were finding it increasingly challenging to maintain and scale the data infrastructure, resulting in data access, data silos, and inefficiencies in data management.

Data-driven

Data-driven Advertising Metadata Data Architecture

What Is Alation Connected Sheets? Q&A with the Creators

Alation

NOVEMBER 28, 2022

Krishna Bhat, co-founder & CEO, Kloudio and senior director product management, Alation: Alation Connected Sheets enables business users to pull trusted, governed, and accurate data directly from Alation into Google Sheets for analysis. It is also hard to know whether one can trust the data within a spreadsheet.

Metadata

Metadata Enterprise Cost-Benefit Finance

What is Data Mapping?

Jet Global

FEBRUARY 23, 2024

This field guide to data mapping will explore how data mapping connects volumes of data for enhanced decision-making. Why Data Mapping is Important Data mapping is a critical element of any data management initiative, such as data integration, data migration, data transformation, data warehousing, or automation.

Data Warehouse

Data Warehouse Reporting Data Transformation Sales

Extreme data center pressure? Burst to the cloud with CDP!

Cloudera

NOVEMBER 12, 2020

In addition, costs generated by independent IT projects frequently skyrockets with few controls in place to manage them. At Cloudera, we listened to our customers’ problems and built the Burst to Cloud feature in Workload Manager (WXM), Cloudera’s intelligent workload management tool. A solution. More than likely it is.

Data Warehouse

Data Warehouse Reporting Risk Cost-Benefit

The Gartner 2021 Leadership Vision for Data & Analytics Leaders Webinar Q&A

Andrew White

JANUARY 11, 2021

On January 4th I had the pleasure of hosting a webinar. It was titled, The Gartner 2021 Leadership Vision for Data & Analytics Leaders. This was for the Chief Data Officer, or head of data and analytics. How do you think Technology Business Management plays into this strategy? Governance. Architecture.

Data Analytics

Data Analytics Analytics Data-driven Finance

UAB IT helps fuel genomic breakthroughs

CIO Business Intelligence

MARCH 10, 2022

And you can manage the IT and budget in such a way that people are empowered to focus on pancreatic cancer. … Next up: AI and data lake decisions. To that end, UAB’s next step is to tackle big decisions around expanding its AI and data analytics platforms, says Carver, who is not handling the long-term planning alone.

IT Data Lake Digital Transformation Data Governance

Data Governance for Dummies: Your Questions, Answered

Alation

FEBRUARY 17, 2023

This past week, I had the pleasure of hosting Data Governance for Dummies author Jonathan Reichental for a fireside chat , along with Denise Swanson , Data Governance lead at Alation. Can you have proper data management without establishing a formal data governance program?

Data Governance

Data Governance Data Quality Metadata Cost-Benefit

The disruptive potential of open data lakehouse architectures and IBM watsonx.data

IBM Big Data Hub

JUNE 15, 2023

It is comprised of commodity cloud object storage, open data and open table formats, and high-performance open-source query engines. To help organizations scale AI workloads, we recently announced IBM watsonx.data , a data store built on an open data lakehouse architecture and part of the watsonx AI and data platform.

Data Warehouse

Data Warehouse Data Lake Optimization Data-driven

The essential check list for effective data democratization

CIO Business Intelligence

JANUARY 20, 2023

Of course, cost is a big consideration, says Orlandini, as well as deciding where to host the data, and having it available in a fiscally responsible way. An organization might also question if the data should be maintained on-premises due to security concerns in the public cloud. They have data swamps,” he says.

Data Lake

Data Lake Data-driven Finance Data Architecture

Data Leaders Brief

Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena

How Data Governance Protects Sensitive Data

Webinars

Trending Sources

Governing data in relational databases using Amazon DataZone

Webinars

Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for success

How Novo Nordisk built distributed data governance and control at scale

AWS Glue crawlers support cross-account crawling to support data mesh architecture

Introducing AWS Glue crawler and create table support for Apache Iceberg format

How smava makes loans transparent and affordable using Amazon Redshift Serverless

Create an end-to-end data strategy for Customer 360 on AWS

Modern Data Architecture for Telecommunications

How Amazon Finance Automation built a data mesh to support distributed data ownership and centralize governance

Announcing the 2020 Data Impact Award Winners

Announcing the 2021 Data Impact Awards

How Cloudera Data Flow Enables Successful Data Mesh Architectures

Themes and Conferences per Pacoid, Episode 8

Design a data mesh on AWS that reflects the envisioned organization

What Is Alation Connected Sheets? Q&A with the Creators

What is Data Mapping?

Extreme data center pressure? Burst to the cloud with CDP!

The Gartner 2021 Leadership Vision for Data & Analytics Leaders Webinar Q&A

UAB IT helps fuel genomic breakthroughs

Data Governance for Dummies: Your Questions, Answered

The disruptive potential of open data lakehouse architectures and IBM watsonx.data

The essential check list for effective data democratization

Stay Connected