article thumbnail

Use AWS Glue ETL to perform merge, partition evolution, and schema evolution on Apache Iceberg

AWS Big Data

Apache Iceberg manages these schema changes in a backward-compatible way through its innovative metadata table evolution architecture. Due to the security requirements of different organizations, they need to manage fine-grained access control for the analysts through Lake Formation. Iceberg creates snapshots for the table contents.

Snapshot 111
article thumbnail

How to Use Apache Iceberg in CDP’s Open Lakehouse

Cloudera

Time Travel: Reproduce a query as of a given time or snapshot ID, which can be used for historical audits and rollback of erroneous operations, as an example. SDX Integration (Ranger): Manage access to Iceberg tables through Apache Ranger. 8 2001 5967780. 1 2008 7009728. 2 2007 7453215. 3 2006 7141922. 4 2005 7140596.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Clean Harbors’ CIO: Hybrid approach to the cloud is a win-win

CIO Business Intelligence

“Our strategy in taking a hybrid approach has provided the agility we need to do advanced services in the cloud as we go through our digital transformation,” says Gabriel, who joined the company in 2001 and was promoted to executive vice president and CIO of Clean Harbors in 2018.

article thumbnail

Modernize a legacy real-time analytics application with Amazon Managed Service for Apache Flink

AWS Big Data

Organizations with legacy, on-premises, near-real-time analytics solutions typically rely on self-managed relational databases as their data store for analytics workloads. We introduce you to Amazon Managed Service for Apache Flink Studio and get started querying streaming data interactively using Amazon Kinesis Data Streams.