-
Hi, I'm looking at For example, I have a batched matrix of shape Is this possible? cc @hwu36 |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
yes. https://github.com/NVIDIA/cutlass/blob/main/include/cutlass/gemm/kernel/gemm_universal.h#L506
|
Beta Was this translation helpful? Give feedback.
yes.
https://github.com/NVIDIA/cutlass/blob/main/include/cutlass/gemm/kernel/gemm_universal.h#L506
threadblock_tile_offset.k()
is essentially batch id.