Skip to main content
Log in

Mojo module

swizzle

Structs

Functions

  • eval_composed:
  • make_ldmatrix_swizzle: Make a swizzle to avoid bank conflict for ldmatrix.
  • make_swizzle: 2D swizzle to avoid bank conflict. Access access_size elements in num_rows x row_size in shared memory tile. num_rows should be for minimun access pattern. E.g. store 16x8 mma result to a 64 x 64 tile. The minimum access pattern is 8x8 sub-matrix, num_rows = 8, row_size = 64. We should swizzle the layout to avoid bank conflict for loading in the data in future. The load is most likely 16B, i.e. access_size = 4 for fp32 and 8 for bf16.
  • shiftl:
  • shiftr: