Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support of Apache Uniffle for remote shuffle service #796

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

zuston
Copy link
Contributor

@zuston zuston commented Jan 26, 2025

Which issue does this PR close?

Closes #.

Rationale for this change

Uniffle is a high performance, general purpose remote shuffle service for distributed computing engines. It provides the ability to push shuffle data into centralized storage service, changing the shuffle style from "local file pull-like style" to "remote block push-like style". It brings in several advantages like supporting disaggregated storage deployment, super large shuffle jobs, and high elasticity. Currently it supports Apache Spark, Apache Hadoop MapReduce and Apache Tez.

Based on the above advantages, uniffle has been used by several commercial companies. After intergrating with blaze, users' spark jobs will benefit greatly from storage-computation separation and vectorized execution.

What changes are included in this PR?

Following the blaze's rss shuffle manager design to implement the writer + reader

Are there any user-facing changes?

Yes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant