Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix clang-format merge conflicts #1788

Closed
wants to merge 16 commits into from

Conversation

rhornung67
Copy link
Member

Summary

MrBurmark and others added 16 commits December 30, 2024 13:02
Use a macro to generate the various aliases
This iteration mapping assumes that the number of
iterations is the same as the size of the range
and does no checking.
This is useful when mapping gpu blocks as we often
launch the exact number we need and don't need to check
if we are in range. This can give ~5% speedup vs direct
in this case.
There were a number missing for cuda/hip
This adds testing for unchecked policies with cuda and hip
in kernel and launch.
@rhornung67 rhornung67 closed this Jan 31, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants