We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What does this code do?
tiled_mma.accumulate_ = GMMA::ScaleOut::One;
For example, in "cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp"
I have no idea when it is set two 1 or 0, it seems random.
The text was updated successfully, but these errors were encountered:
scales accumulator input by 1 or 0 before
so basically either C += A@B or C = A@B
Sorry, something went wrong.
Why is second tiled_mma.accumulate_ = GMMA::ScaleOut::One; in line 713 necessary?
https://github.com/NVIDIA/cutlass/blob/main/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp
I think this would be enough,
tiled_mma.accumulate_ = GMMA::ScaleOut::Zero; tiled_mma.accumulate_ = GMMA::ScaleOut::One;
but they are using it this way
tiled_mma.accumulate_ = GMMA::ScaleOut::Zero; tiled_mma.accumulate_ = GMMA::ScaleOut::One; tiled_mma.accumulate_ = GMMA::ScaleOut::One;
Why is last one needed?
No branches or pull requests
What does this code do?
tiled_mma.accumulate_ = GMMA::ScaleOut::One;
For example, in "cutlass/include/cutlass/gemm/collective/sm90_mma_tma_gmma_rs_warpspecialized.hpp"
I have no idea when it is set two 1 or 0, it seems random.
The text was updated successfully, but these errors were encountered: