diff options
author | Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com> | 2025-08-07 16:19:21 +0530 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-08-07 16:19:21 +0530 |
commit | fee6e539d0a052ca1f20adf55521856bfc5d5b26 (patch) | |
tree | 98eef015cb7b5e2258402714ac8508bdd300800f /clang/lib/Frontend/CompilerInvocation.cpp | |
parent | 4784585747423a8ed6e3acbe3c8fbe97ba362cc5 (diff) | |
download | llvm-fee6e539d0a052ca1f20adf55521856bfc5d5b26.zip llvm-fee6e539d0a052ca1f20adf55521856bfc5d5b26.tar.gz llvm-fee6e539d0a052ca1f20adf55521856bfc5d5b26.tar.bz2 |
[NVPTX] Add prefetch tensormap variant (#146203)
[NVPTX] Add Prefetch tensormap intrinsics
This PR adds prefetch intrinsics with the relevant tensormap_space.
* Lit tests are added as part of prefetch.ll
* The generated PTX is verified with a 12.3 ptxas executable.
* Added docs for these intrinsics in NVPTXUsage.rst.
For more information, refer to the PTX ISA for prefetch intrinsic :
[Prefetch
Tensormap](https://docs.nvidia.com/cuda/parallel-thread-execution/#data-movement-and-conversion-instructions-prefetch-prefetchu)
@durga4github @schwarzschild-radius
Diffstat (limited to 'clang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions