aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Object/ELFObjectFile.cpp
diff options
context:
space:
mode:
authorDurgadoss R <durgadossr@nvidia.com>2024-01-20 00:12:33 +0530
committerGitHub <noreply@github.com>2024-01-19 10:42:33 -0800
commit43531e719636e5960d8592a184e10af885be6869 (patch)
treef381cded6023c1b77e275af93bafb6b2e7c67b24 /llvm/lib/Object/ELFObjectFile.cpp
parent42b160356fe5d3b41bf07c428d0142d3721b1d44 (diff)
downloadllvm-43531e719636e5960d8592a184e10af885be6869.zip
llvm-43531e719636e5960d8592a184e10af885be6869.tar.gz
llvm-43531e719636e5960d8592a184e10af885be6869.tar.bz2
[LLVM][NVPTX] Add cp.async.bulk.commit/wait intrinsics (#78698)
This patch adds NVVM intrinsics and NVPTX codegen for the bulk variants of the async-copy commit/wait instructions. lit tests are added to verify the generated PTX. PTX Doc link: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-cp-async-bulk-commit-group Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Diffstat (limited to 'llvm/lib/Object/ELFObjectFile.cpp')
0 files changed, 0 insertions, 0 deletions