Mojo function
block_rank_in_cluster
block_rank_in_cluster() -> SIMD[uint32, 1]
Returns the unique identifier (rank) for the current thread block within its cluster.
Note:
- Only supported on NVIDIA SM90+ GPUs.
- Maps directly to the %cluster_ctarank
special register in CUDA PTX.
Returns:
A unique identifier in the range [0, cluster_size-1] where cluster_size
is the total number of thread blocks in the cluster.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!