Mojo function
cp_async_bulk_tensor_shared_cluster_global
cp_async_bulk_tensor_shared_cluster_global[dst_type: AnyType, mbr_type: AnyType, rank: Int](dst_mem: UnsafePointer[dst_type, address_space=3], tma_descriptor: UnsafePointer[NoneType], mem_bar: UnsafePointer[mbr_type, address_space=3], coords: IndexList[rank])
Initiates an asynchronous copy operation on the tensor data from global memory to shared memory.
Args:
- dst_mem (
UnsafePointer[dst_type, address_space=3]
): Pointer to destination shared memory. - tma_descriptor (
UnsafePointer[NoneType]
): Pointer to tensor map descriptor. - mem_bar (
UnsafePointer[mbr_type, address_space=3]
): A pointer to shared memory barrier. - coords (
IndexList[rank]
): Tile coordinates.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!