Mojo module
info
Contains information about GPU architectures and their capabilities.
This module provides detailed specifications for various GPU models including NVIDIA and AMD GPUs. It includes information about compute capabilities, memory specifications, thread organization, and performance characteristics.
Aliases
-
A10 = Info(StringSlice("A10"), Vendor(__init__[__mlir_type.!pop.int_literal](2)), StringSlice("cuda"), StringSlice("ampere"), StringSlice("nvptx-short-ptr=true"), __init__[__mlir_type.!pop.float_literal](8.5999999999999996), StringSlice("sm_86"), 72, 32, 1536, 32, 64, 2048, 32, 102400, 65536, 256, StringSlice("warp"), 255, 65536, 16, 128, 4, 1024)
: -
A100 = Info(StringSlice("A100"), Vendor(__init__[__mlir_type.!pop.int_literal](2)), StringSlice("cuda"), StringSlice("ampere"), StringSlice("nvptx-short-ptr=true"), __init__[__mlir_type.!pop.float_literal](8), StringSlice("sm_80"), 108, 32, 2048, 32, 64, 2048, 32, 167936, 65536, 256, StringSlice("warp"), 255, 65536, 32, 128, 4, 1024)
: -
B100 = Info(StringSlice("B100"), Vendor(__init__[__mlir_type.!pop.int_literal](2)), StringSlice("cuda"), StringSlice("blackwell"), StringSlice("nvptx-short-ptr=true"), __init__[__mlir_type.!pop.float_literal](10), StringSlice("sm_100a"), 132, 32, -1, 32, 64, 1536, 32, 59392, 65536, 256, StringSlice("warp"), 255, 65536, 32, 128, 4, 1024)
: -
B200 = Info(StringSlice("B100"), Vendor(__init__[__mlir_type.!pop.int_literal](2)), StringSlice("cuda"), StringSlice("blackwell"), StringSlice("nvptx-short-ptr=true"), __init__[__mlir_type.!pop.float_literal](10), StringSlice("sm_100a"), 132, 32, -1, 32, 64, 1536, 32, 59392, 65536, 256, StringSlice("warp"), 255, 65536, 32, 128, 4, 1024)
: -
DEFAULT_GPU = from_name[stdlib::collections::string::string_slice::StringSlice[::Bool()
: -
DEFAULT_GPU_ARCH = _accelerator_arch()
: -
DEFAULT_GPU_TARGET = from_name[stdlib::collections::string::string_slice::StringSlice[::Bool().target()
: -
H100 = Info(StringSlice("H100"), Vendor(__init__[__mlir_type.!pop.int_literal](2)), StringSlice("cuda"), StringSlice("hopper"), StringSlice("nvptx-short-ptr=true"), __init__[__mlir_type.!pop.float_literal](9), StringSlice("sm_90a"), 132, 32, 2048, 32, 64, 2048, 32, 233472, 65536, 256, StringSlice("warp"), 255, 65536, 32, 128, 4, 1024)
: -
L4 = Info(StringSlice("L4"), Vendor(__init__[__mlir_type.!pop.int_literal](2)), StringSlice("cuda"), StringSlice("ada"), StringSlice("nvptx-short-ptr=true"), __init__[__mlir_type.!pop.float_literal](8.9000000000000004), StringSlice("sm_89"), 58, 32, 1536, 32, 64, 2048, 32, 102400, 65536, 256, StringSlice("warp"), 255, 65536, 24, 128, 4, 1024)
: -
MI300X = Info(StringSlice("MI300X"), Vendor(__init__[__mlir_type.!pop.int_literal](1)), StringSlice("hip"), StringSlice("gfx942"), StringSlice("amdhsa-code-object-version=5"), __init__[__mlir_type.!pop.float_literal](9.4000000000000003), StringSlice("CDNA3"), 304, 64, 2048, 64, 32, 2048, 2, 65536, 65536, 256, StringSlice("warp"), 255, 65536, 2, 128, 4, 1024)
: -
NoGPU = Info(StringSlice("NoGPU"), Vendor(__init__[__mlir_type.!pop.int_literal](0)), StringSlice("none"), StringSlice("no_gpu"), StringSlice(""), __init__[__mlir_type.!pop.int_literal](0), StringSlice(""), 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, StringSlice("none"), 0, 0, 0, 0, 0, 0)
: -
OrinNano = Info(StringSlice("Orin Nano"), Vendor(__init__[__mlir_type.!pop.int_literal](2)), StringSlice("cuda"), StringSlice("ampere"), StringSlice("nvptx-short-ptr=true"), __init__[__mlir_type.!pop.float_literal](8.6999999999999993), StringSlice("sm_87"), 8, 32, 1536, 32, 64, 2048, 32, 167936, 65536, 256, StringSlice("warp"), 255, 65536, 16, 128, 4, 1024)
: -
RTX5090 = Info(StringSlice("RTX5090"), Vendor(__init__[__mlir_type.!pop.int_literal](2)), StringSlice("cuda"), StringSlice("blackwell"), StringSlice("nvptx-short-ptr=true"), __init__[__mlir_type.!pop.float_literal](12), StringSlice("sm_120a"), 170, 32, -1, 32, 64, 1536, 32, 59392, 65536, 256, StringSlice("warp"), 255, 65536, 32, 128, 4, 1024)
:
Structs
Functions
-
is_cpu
: Checks if the target is a CPU (compile-time version). -
is_gpu
: Checks if the target is a GPU (compile-time version). -
is_valid_target
: Checks if the target is valid (compile-time version).
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!