Skip to main content
Log in

Mojo module

encodings

Implementations of quantization encodings.

Aliases

  • K_SCALE_SIZE = IntLiteral(12): Size of superblock scales and mins, in bytes.
  • QK_K = IntLiteral(256): Size of superblock quantized elements, in bytes.

Structs