Mojo function
async_copy_wait_group
async_copy_wait_group(n: Int32)
Waits for the completion of n
most recently committed cp.async-groups.
This function blocks execution until the specified number of previously committed cp.async-groups have completed their memory transfers.
Notes:
- Only supported on NVIDIA GPUs.
- Maps to the cp.async.wait.group PTX instruction.
- Provides fine-grained control over asynchronous transfer synchronization.
- Can be used to implement a pipeline of asynchronous transfers.
Args:
- n (
Int32
): The number of pending cp.async-groups to wait for. Must be > 0.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!