Mojo function
tcgen05_alloc
tcgen05_alloc[cta_group: Int32](ptr_tmem_addr: UnsafePointer[UInt32, address_space=AddressSpace(3), alignment=16], num_cols: UInt32)
Allocates tensor memory for use with tcgen05 instructions.
Note: This function is only available on NVIDIA Blackwell GPUs (SM 100+).
Parameters:
- cta_group (
Int32
): The cooperative thread array (CTA) group ID.
Args:
- ptr_tmem_addr (
UnsafePointer
): Shared memory pointer to hold tensor memory address. - num_cols (
UInt32
): The number of columns to allocate.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!