Mojo module
dual_gemm
Aliases
-
binary_fn_type = fn[DType, Int](SIMD[$0, $1], SIMD[$0, $1]) -> SIMD[$0, $1]
:
Functions
-
config_in_smem
: -
dual_gemm
: -
dual_gemv
: -
dual_gemv_kernel
: -
multistage_dual_gemm
: -
multistage_dual_gemm_kernel
: -
multistage_dual_mma
: -
swilu
: -
swishGLU
: Reference: GLU Variants Improve Transformer by Noam Shazeer https://arxiv.org/pdf/2002.05202v1 The implementation follows cutlass, using one kernel invocation and writing to the destination once.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!