Improving Data Processing with Spark 3.0 & Delta Lake
Smart Data Collective
AUGUST 5, 2021
What is Delta Lake? Developed at Databricks, “Delta Lake is an open-source data storage layer that runs on the existing Data Lake and is fully cooperative with Apache Spark APIs. Delta Lake uses versioned Parquet files to store data in the cloud. Advantages of using Delta Lakes.
Let's personalize your content