Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix clang-format conflicts #1787

Closed
wants to merge 16 commits into from

Conversation

rhornung67
Copy link
Member

Summary

MrBurmark and others added 16 commits December 30, 2024 13:02
Use a macro to generate the various aliases
This iteration mapping assumes that the number of
iterations is the same as the size of the range
and does no checking.
This is useful when mapping gpu blocks as we often
launch the exact number we need and don't need to check
if we are in range. This can give ~5% speedup vs direct
in this case.
There were a number missing for cuda/hip
This adds testing for unchecked policies with cuda and hip
in kernel and launch.
@rhornung67 rhornung67 requested review from MrBurmark and artv3 January 31, 2025 20:32
@rhornung67 rhornung67 closed this Jan 31, 2025
@rhornung67 rhornung67 deleted the task/rhornung67/fix-format-conflicts branch January 31, 2025 23:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants