Data Leaders Brief

spark-on-kubernetes-gang-scheduling-with-yunikorn

Spark on Kubernetes – Gang Scheduling with YuniKorn

Cloudera

MAY 5, 2021

Apache YuniKorn (Incubating) has just released 0.10.0 ( release announcement ). As part of this release, a new feature called Gang Scheduling has become available. By leveraging the Gang Scheduling feature, Spark jobs scheduling on Kubernetes becomes more efficient. What is Gang Scheduling?

Metadata

Metadata Machine Learning Big Data IT

Optimizing Cloudera Data Engineering Autoscaling Performance

Cloudera

SEPTEMBER 2, 2021

That’s why turning to traditional resource scheduling is not sufficient. That’s why turning to traditional resource scheduling is not sufficient. When building CDE, we integrated with Apache YuniKorn which offers rich scheduling capabilities on Kubernetes. . fixed sized clusters).

Optimization

Optimization Testing Cost-Benefit Measurement

Join 52,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Manufacturing Sustainability Surge: Your Guide to Data-Driven Energy Optimization & Decarbonization

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

MORE WEBINARS

Trending Sources

Cloudera Data Engineering 2021 Year End Review

Cloudera

DECEMBER 21, 2021

Since the release of Cloudera Data Engineering (CDE) more than a year ago , our number one goal was operationalizing Spark pipelines at scale with first class tooling designed to streamline automation and observability. We also introduced Apache Airflow on Kubernetes as the next generation orchestration service.

Snapshot

Snapshot Data-driven Optimization Management

Webinars

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Manufacturing Sustainability Surge: Your Guide to Data-Driven Energy Optimization & Decarbonization

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

MORE WEBINARS

Introducing Amazon EMR on EKS job submission with Spark Operator and spark-submit

AWS Big Data

JUNE 6, 2023

Amazon EMR on EKS provides a deployment option for Amazon EMR that allows organizations to run open-source big data frameworks on Amazon Elastic Kubernetes Service (Amazon EKS). With EMR on EKS, Spark applications run on the Amazon EMR runtime for Apache Spark. The EMR runtime provides up to 5.37 times better performance and 76.8%

Optimization

Optimization Data Lake Cost-Benefit Management

Apache Spark on Kubernetes: How Apache YuniKorn (Incubating) helps

Cloudera

OCTOBER 14, 2020

Why choose K8s for Apache Spark. Apache Spark unifies batch processing, real-time processing, stream analytics, machine learning, and interactive query in one-platform. Support for multiple Spark versions, Python versions, and version-controlled containers on the shared K8s clusters for both faster iteration and stable production.

Machine Learning

Machine Learning Management Big Data Optimization

Spark on Kubernetes – Gang Scheduling with YuniKorn

Optimizing Cloudera Data Engineering Autoscaling Performance

Webinars

Trending Sources

Cloudera Data Engineering 2021 Year End Review

Webinars

Introducing Amazon EMR on EKS job submission with Spark Operator and spark-submit

Apache Spark on Kubernetes: How Apache YuniKorn (Incubating) helps

Stay Connected