Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spark: Rewrite V2 deletes to V3 DVs #12250

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

nastra
Copy link
Contributor

@nastra nastra commented Feb 13, 2025

No description provided.

@nastra nastra force-pushed the rewrite-position-deletes-to-dvs branch 4 times, most recently from 48f9fb6 to d2910f8 Compare February 13, 2025 11:06
@@ -213,45 +217,61 @@ static class PositionDeletesWriterFactory implements DataWriterFactory {
this.writeProperties = writeProperties;
}

private Table underlyingTable(Table table) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm a little concerned here because it appears this has two negative impacts:

  1. This results in a distributed load of the table metadata
  2. This relies on loading from the metadata file location directly, which we're trying to remove from other actions.

Copy link
Contributor

@amogh-jahagirdar amogh-jahagirdar Feb 14, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah I'm a bit confused why we couldn't just leverage the existing table broadcast? If it's a different type of Table, does broadcasting the underlying table work instead?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've updated this so that we fetch the format version not on the executors but rather earlier

@nastra nastra force-pushed the rewrite-position-deletes-to-dvs branch 2 times, most recently from 37eb0d2 to 95ff7d4 Compare February 14, 2025 14:28
@nastra nastra force-pushed the rewrite-position-deletes-to-dvs branch from 95ff7d4 to 4b0819a Compare February 14, 2025 14:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants