Cutlass
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Computes the thread offset in (H, W) based on thread ID.
#include <tile_traits_standard.h>
Public Member Functions | |
CUTLASS_HOST_DEVICE Coord< 4 > | operator() () const |
Basic thread offset function computed from a thread shape. More... | |
|
inline |