Improving Data Processing with Spark 3.0 & Delta Lake
Smart Data Collective
AUGUST 5, 2021
Developed at Databricks, “Delta Lake is an open-source data storage layer that runs on the existing Data Lake and is fully cooperative with Apache Spark APIs. Along with the ability to implement ACID transactions and scalable metadata handling, Delta Lakes can also unify the streaming and batch data processing”. .
Let's personalize your content