Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix: Ensure deterministic file ordering in iter_package_files #269

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

lilaliu-stripe
Copy link

@lilaliu-stripe lilaliu-stripe commented Jan 15, 2025

cc @lorencarvalho

Summary

Fixed non-deterministic file ordering in the iter_package_files function by adding explicit sorting of package files. This change ensures consistent behavior across builds and prevents issues caused by unordered file iteration.

Changes Made

  • Applied sorted() to file listings in both Python versions (<3.11 and >=3.11).
  • Guaranteed that files are processed in a consistent, lexicographic order.

Why This Fix?

  • Prevents non-deterministic builds caused by unordered file iteration.
  • Improves build reproducibility and enables better caching in CI/CD pipelines.
  • Simplifies code by enforcing consistent ordering within the function, reducing the need for manual sorting in callers.

Testing

  • Rebuilt the project multiple times and verified that output files are now consistent across builds.
  • Compared generated artifacts (e.g., .json files) to confirm deterministic ordering.

Impact

  • No functional changes to how files are processed, only improves determinism.
  • Safer and more reliable builds, especially in CI environments.

Closes #265

### Summary
Fixed non-deterministic file ordering in the `iter_package_files` function by adding explicit sorting of package files. This change ensures consistent behavior across builds and prevents issues caused by unordered file iteration.

### Changes Made
- Applied `sorted()` to file listings in both Python versions (<3.11 and >=3.11).
- Guaranteed that files are processed in a consistent, lexicographic order.

### Why This Fix?
- Prevents non-deterministic builds caused by unordered file iteration.
- Improves build reproducibility and enables better caching in CI/CD pipelines.
- Simplifies code by enforcing consistent ordering within the function, reducing the need for manual sorting in callers.

### Testing
- Rebuilt the project multiple times and verified that output files are now consistent across builds.
- Compared generated artifacts (e.g., `.json` files) to confirm deterministic ordering.

### Impact
- No functional changes to how files are processed, only improves determinism.
- Safer and more reliable builds, especially in CI environments.

Closes linkedin#265
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Produced zipapp is not reproducible due to _bootstrap file ordering
1 participant