Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spark: Test metadata tables with format-version=3 #12135

Merged
merged 1 commit into from
Feb 4, 2025

Conversation

nastra
Copy link
Contributor

@nastra nastra commented Jan 30, 2025

No description provided.

@github-actions github-actions bot added the spark label Jan 30, 2025
@@ -84,6 +84,9 @@ public InternalRow next() {
rowValues.add(deleteFile.contentOffset());
} else if (fieldId == MetadataColumns.CONTENT_SIZE_IN_BYTES_COLUMN_ID) {
rowValues.add(ScanTaskUtil.contentSizeInBytes(deleteFile));
} else if (fieldId == MetadataColumns.DELETE_FILE_ROW_FIELD_ID) {
// DVs don't track the row that was deleted
rowValues.add(null);
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this fixes an issue when reading from the .position_deletes table when the underlying table is V3. By default, the schema includes

MetadataColumns.DELETE_FILE_ROW_FIELD_ID,
which we don't track for DVs, thus we're returning null here

@nastra nastra force-pushed the spark-metadata-tables branch from 95e7d3a to 654a1ed Compare February 4, 2025 14:28
@nastra nastra merged commit e406e3d into apache:main Feb 4, 2025
31 checks passed
@nastra nastra deleted the spark-metadata-tables branch February 4, 2025 15:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants