diff options
author | Durgadoss R <durgadossr@nvidia.com> | 2024-01-20 00:12:33 +0530 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-01-19 10:42:33 -0800 |
commit | 43531e719636e5960d8592a184e10af885be6869 (patch) | |
tree | f381cded6023c1b77e275af93bafb6b2e7c67b24 /llvm/lib/Object/ELFObjectFile.cpp | |
parent | 42b160356fe5d3b41bf07c428d0142d3721b1d44 (diff) | |
download | llvm-43531e719636e5960d8592a184e10af885be6869.zip llvm-43531e719636e5960d8592a184e10af885be6869.tar.gz llvm-43531e719636e5960d8592a184e10af885be6869.tar.bz2 |
[LLVM][NVPTX] Add cp.async.bulk.commit/wait intrinsics (#78698)
This patch adds NVVM intrinsics and NVPTX codegen for the bulk variants
of the async-copy commit/wait instructions.
lit tests are added to verify the generated PTX.
PTX Doc link:
https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-cp-async-bulk-commit-group
Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Diffstat (limited to 'llvm/lib/Object/ELFObjectFile.cpp')
0 files changed, 0 insertions, 0 deletions