Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Implements a software-pipelined efficient GEMM. More...
Go to the source code of this file.
Classes | |
struct | cutlass::gemm::Launch< Gemm, WithLaunchBounds > |
Partial specialization for launching the GEMM kernel with or without launch bounds. More... | |
struct | cutlass::gemm::Launch< Gemm, false > |
Partial specialization for launching the GEMM kernel with or without launch bounds. More... | |
struct | cutlass::gemm::Gemm< GemmTraits_ > |
Namespaces | |
cutlass | |
cutlass::gemm | |
Functions | |
template<typename Gemm_ > | |
__global__ | cutlass::gemm::__launch_bounds__ (Gemm_::kThreads) void gemm_kernel(typename Gemm_ |
GEMM kernel with launch bounds specified. More... | |
template<typename Gemm_ > | |
__global__ void | cutlass::gemm::gemm_kernel_nolb (typename Gemm_::Params params) |
GEMM kernel without launch bounds specified. More... | |