aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Transforms/Utils/LoopVersioning.cpp
diff options
context:
space:
mode:
authorGuray Ozen <guray.ozen@gmail.com>2023-06-16 10:03:30 +0200
committerGuray Ozen <guray.ozen@gmail.com>2023-06-16 13:35:14 +0200
commit58950d4addd6d1dd920801b32cc75ddc8b9f6c3a (patch)
treefc320f936c190d643b9d200aafad7f9a0188fad7 /llvm/lib/Transforms/Utils/LoopVersioning.cpp
parentda7892f7295f31b46486418e2abf15334db96cbb (diff)
downloadllvm-58950d4addd6d1dd920801b32cc75ddc8b9f6c3a.zip
llvm-58950d4addd6d1dd920801b32cc75ddc8b9f6c3a.tar.gz
llvm-58950d4addd6d1dd920801b32cc75ddc8b9f6c3a.tar.bz2
[mlir][nvvm] Implement `mbarrier.init`
NV GPUs provides split arrive/wait barriers that one can syncronize a subgroup of threads in CTA. It is particularly important for Hopper GPUs and allows tracking engines like TMA. See for more details: https://docs.nvidia.com/cuda/parallel-thread-execution/#parallel-synchronization-and-communication-instructions-mbarrier This initial implementation sets the foundation for future enhancements and additions. Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D151334
Diffstat (limited to 'llvm/lib/Transforms/Utils/LoopVersioning.cpp')
0 files changed, 0 insertions, 0 deletions