Mojo module
conv
Structs
-
ConvDirectNHWC
: Implement the outer loops for direct convolution. Collapse N, HO, WO into one dimension n_ho_wo. Tile n_ho_wo, C, and F. The tile factor for C and F are chosen by a heuristic prioritizing C. n_ho_wo is tiled by micro kernel's height. -
Naive2dConvolution
: Struct wrapper for naive 2d convolution implementation.
Functions
-
accumulate_wo_tile_1d
: Update one row in the output for a given (c, f) tile. -
accumulate_wo_tile_2d
: -
accumulate_wo_tile_3d
: -
check_cudnn_error
: -
conv1d_update_wo_tile
: -
conv2d_gpu_naive_nhwc_rscf
: -
conv2d_update_wo_tile
: -
conv3d_update_wo_tile
: -
conv_cudnn
: -
conv_gpu
: -
conv_nhwc_direct
: -
conv_shape
: Compute the output shape of aconv
operation, and assert the inputs are compatible. -
pack_conv_filter_shape
: Compute the output shape of convolution filter packing. -
pack_filter
: This packs the filter form RSCF to FRSCf. Use the default micro kernel size for dynamic shapes. -
pack_filter_shape
: Compute the shape of packed filter. The packed layout is FRSCf. shape_ref should be allocated with size 5 outside this kernel. -
pack_filter_shape_impl
: Compute the shape of packed filter. The packed layout is FRSCf. shape_ref should be allocated with size 5 outside this kernel.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!