Mojo module
mma
This module includes utilities for working with the warp-matrix-matrix-multiplication (wmma) instructions.
Structs
Functions
-
ld_matrix
: Performs warp sync copy from shared memory to registers. Loads in a fashion that can be used directly by tensor core MMA instructions. -
mma
: Performs warp sync Tensor Core based Matrix-multiply and accumulate(MMA) operation. -
wgmma_async
: Performs warp group async Matrix-multiply and accumulate(WGMMA) operation. -
wgmma_commit_group_sync
: -
wgmma_fence_aligned
: -
wgmma_wait_group_sync
:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!