Mojo function
cp_async_bulk_tensor_global_shared_cta
cp_async_bulk_tensor_global_shared_cta[src_type: AnyType, rank: Int, /, eviction_policy: CacheEviction = 0](src_mem: UnsafePointer[src_type, address_space=3], tma_descriptor: UnsafePointer[NoneType], coords: IndexList[rank])
Initiates an asynchronous copy operation on the tensor data from shared cta memory to global memory.
Args:
- src_mem (
UnsafePointer[src_type, address_space=3]
): Pointer to source shared memory. - tma_descriptor (
UnsafePointer[NoneType]
): Pointer to tensor map descriptor. - coords (
IndexList[rank]
): Tile coordinates.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!