Skip to main content

/

Mojo module

matmul_sm90

Functions

cluster_size:
consumer_main_loop:
cpasync_wgmma_kernel:
find_K_alignment_upto_16B: Find alignment among 1B, 2B, 4B, 16B based on the row's bytes.
hopper_matmul_tma_wgmma_kernel:
promote_to_cuda_cores:
tma_wgmma_warp_specialized_gemm_kernel:
tma_wgmma_warp_specialized_gemm_kernel_persistent:
warp_specialize_gemm_with_multicasting:
warp_specialized_gemm_output:

Functions

View source

View source

Was this page helpful?

Thank you! We'll create more content like this.

Thank you for helping us improve!