Filter more pay less with the latest Cloudera Data Warehouse runtime!
Cloudera
MARCH 24, 2021
To enable data pruning, modern columnar formats such as ORC and Parquet maintain indexes, bloom filters, and statistics to determine if a group of data needs to be read at all before returning to the execution engine. Map 1 <- Map 4 ( BROADCAST_EDGE ), Map 5 ( BROADCAST_EDGE ) Reducer 2 <- Map 1 (SIMPLE_EDGE).
Let's personalize your content