Remove Big Data Remove Broadcasting Remove Data Lake Remove Data Science
article thumbnail

Announcing the 2020 Data Impact Award Winners

Cloudera

During the first-ever virtual broadcast of our annual Data Impact Awards (DIA) ceremony, we had the great pleasure of announcing this year’s finalists and winners. It hosts over 150 big data analytics sandboxes across the region with over 200 users utilizing the sandbox for data discovery.

article thumbnail

Improving Data Processing with Spark 3.0 & Delta Lake

Smart Data Collective

In this blog, we will cover an overview of Delta Lakes , its advantages, and how the above challenges can be overcome by moving to Delta Lake and migrating to Spark 3.0 What is Delta Lake? Using the configuration “spark.sql.shuffle.partitions” for increased parallelism on more evenly distributed data. from Spark 2.4. .