|  | CUTLASS
    CUDA Templates for Linear Algebra Subroutines and Solvers | 
Templates implementing loading of tiles from pitch-linear rank=2 tensors. More...
#include "cutlass/cutlass.h"#include "cutlass/tensor_ref.h"#include "cutlass/layout/matrix.h"#include "cutlass/layout/pitch_linear.h"#include "regular_tile_iterator.h"

Go to the source code of this file.
| Classes | |
| class | cutlass::transform::threadblock::RegularTileIterator< Shape_, Element_, layout::PitchLinear, AdvanceRank, ThreadMap_, Alignment > | 
| Regular tile iterator specialized for pitch-linear.  More... | |
| class | cutlass::transform::threadblock::RegularTileIterator< Shape_, Element_, layout::RowMajor, AdvanceRank, ThreadMap_, Alignment > | 
| Regular tile iterator specialized for pitch-linear.  More... | |
| class | cutlass::transform::threadblock::RegularTileIterator< Shape_, Element_, layout::ColumnMajor, AdvanceRank, ThreadMap_, Alignment > | 
| Regular tile iterator specialized for pitch-linear.  More... | |
| Namespaces | |
| cutlass | |
| cutlass::transform | |
| cutlass::transform::threadblock | |
This iterator uses masks to guard out-of-bounds accesses and visits the last "residue" tile first, with the objective of minimizing predicate mask updates during steady-state operation.
A precomputed "Params" object minimizes the amount of state that must be stored in registers, and integer addition is used to advance the pointer through memory.
 1.8.11
 1.8.11