2002 and Metadata - Data Leaders Brief

2002

Metadata

Use AWS Glue ETL to perform merge, partition evolution, and schema evolution on Apache Iceberg

AWS Big Data

MARCH 4, 2024

Apache Iceberg manages these schema changes in a backward-compatible way through its innovative metadata table evolution architecture. With Lake Formation, you can manage fine-grained access control for your data lake data on Amazon S3 and its metadata in the Data Catalog. Iceberg maintains the table state in metadata files.

Snapshot

Snapshot Data Lake Metadata Recreation/Entertainment

Speed up queries with the cost-based optimizer in Amazon Athena

AWS Big Data

NOVEMBER 17, 2023

Starting today, the Athena SQL engine uses a cost-based optimizer (CBO), a new feature that uses table and column statistics stored in the AWS Glue Data Catalog as part of the table’s metadata. By using these statistics, CBO improves query run plans and boosts the performance of queries run in Athena.

Optimization

Optimization Statistics Metadata Data Lake

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

Trending Sources

Themes and Conferences per Pacoid, Episode 10

Domino Data Lab

JUNE 2, 2019

It also represents part of the current focus for Project Jupyter : adding support for collaboration, enhanced security, projects as top-level entities, data registry, metadata management, and telemetry about usage. my answer was almost immediate: Daniel Kahneman.

Data-driven

Data-driven Data Science Machine Learning Modeling

Webinars

How to Optimize the Developer Experience for Monumental Impact

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Leading the Development of Profitable and Sustainable Products

MORE WEBINARS

A CDO’s Guide to the Data Catalog

Alation

APRIL 12, 2022

In 2002, Capital One became the first company to appoint a Chief Data Officer (CDO). Through serving as a centralized conduit for discovering and requesting access to data, a data catalog provides CDOs and their data governance teams with information and metadata to determine which people should see and access what data.

Data Quality

Data Quality Data Governance Metrics Data Strategy

Use AWS Glue ETL to perform merge, partition evolution, and schema evolution on Apache Iceberg

Speed up queries with the cost-based optimizer in Amazon Athena

Webinars

Trending Sources

Themes and Conferences per Pacoid, Episode 10

Webinars

A CDO’s Guide to the Data Catalog

Stay Connected