Use Apache Iceberg in a data lake to support incremental data processing
AWS Big Data
MARCH 2, 2023
Iceberg takes advantage of the rich metadata it captures at write time and facilitates techniques such as scan planning, partitioning, pruning, and column-level stats such as min/max values to skip data files that don’t have match records. It first uses the manifest list, which acts as an index of the manifest files.
Let's personalize your content