diff options
author | Srinivasa Ravi <srinivasar@nvidia.com> | 2025-03-11 09:27:21 +0530 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-03-11 09:27:21 +0530 |
commit | 9a5a8c9a8072d9af9cea087e506ea213bd89c0f5 (patch) | |
tree | 8a360b2bbff9431dd7d414a9dfca60d020d52624 /clang/lib/CodeGen/ModuleBuilder.cpp | |
parent | 494fe0b4145810d4e4e7b6003cabd194f76cb5d4 (diff) | |
download | llvm-9a5a8c9a8072d9af9cea087e506ea213bd89c0f5.zip llvm-9a5a8c9a8072d9af9cea087e506ea213bd89c0f5.tar.gz llvm-9a5a8c9a8072d9af9cea087e506ea213bd89c0f5.tar.bz2 |
[NVPTX] Add intrinsics for st.bulk instruction (#128856)
Adds NVVM intrinsics and NVPTX codegen for the `st.bulk` instruction
introduced in ptx8.6 for sm_100. Tests added in
`CodeGen/NVPTX/st_bulk.ll` and verified through ptxas 12.8.0.
PTX Spec Reference:
https://docs.nvidia.com/cuda/parallel-thread-execution/#data-movement-and-conversion-instructions-st-bulk
Diffstat (limited to 'clang/lib/CodeGen/ModuleBuilder.cpp')
0 files changed, 0 insertions, 0 deletions