Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
wmma_gemm_multiply_add.h File Reference

Implements warp-level matrix multiply-accumulate operation using CUDA WMMA API. More...

#include "cutlass/wmma_matrix.h"

Go to the source code of this file.