Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
Classes | Namespaces | Functions
gemm.h File Reference

Implements a software-pipelined efficient GEMM. More...

#include <cuda.h>
#include "cutlass/coord.h"
#include "cutlass/util/platform.h"

Go to the source code of this file.

Classes

struct  cutlass::gemm::Launch< Gemm, WithLaunchBounds >
 Partial specialization for launching the GEMM kernel with or without launch bounds. More...
 
struct  cutlass::gemm::Launch< Gemm, false >
 Partial specialization for launching the GEMM kernel with or without launch bounds. More...
 
struct  cutlass::gemm::Gemm< GemmTraits_ >
 

Namespaces

 cutlass
 
 cutlass::gemm
 

Functions

template<typename Gemm_ >
__global__ cutlass::gemm::__launch_bounds__ (Gemm_::kThreads) void gemm_kernel(typename Gemm_
 GEMM kernel with launch bounds specified. More...
 
template<typename Gemm_ >
__global__ void cutlass::gemm::gemm_kernel_nolb (typename Gemm_::Params params)
 GEMM kernel without launch bounds specified. More...