Skip to main content

Mojo function

generic_flare_mla_prefill_ragged_paged_plan

generic_flare_mla_prefill_ragged_paged_plan[target: StringSlice[StaticConstantOrigin]](input_row_offsets: NDBuffer[DType.uint32, 1, origin, shape, strides], kv_collection: PagedKVCacheCollection[dtype_, kv_params_, page_size], layer_idx: UInt32, buffer_token_size: UInt32, buffer_row_offsets: NDBuffer[DType.uint32, 2, origin, shape, strides], cache_offsets: NDBuffer[DType.uint32, 2, origin, shape, strides], buffer_lengths: NDBuffer[DType.int32, 1, origin, shape, strides], context: DeviceContextPtr)

Was this page helpful?