rocket-tools/riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com>	2025-08-07 16:19:21 +0530
committer	GitHub <noreply@github.com>	2025-08-07 16:19:21 +0530
commit	fee6e539d0a052ca1f20adf55521856bfc5d5b26 (patch)
tree	98eef015cb7b5e2258402714ac8508bdd300800f /clang/lib/Frontend/CompilerInvocation.cpp
parent	4784585747423a8ed6e3acbe3c8fbe97ba362cc5 (diff)
download	llvm-fee6e539d0a052ca1f20adf55521856bfc5d5b26.zip llvm-fee6e539d0a052ca1f20adf55521856bfc5d5b26.tar.gz llvm-fee6e539d0a052ca1f20adf55521856bfc5d5b26.tar.bz2

[NVPTX] Add prefetch tensormap variant (#146203)

[NVPTX] Add Prefetch tensormap intrinsics This PR adds prefetch intrinsics with the relevant tensormap_space. * Lit tests are added as part of prefetch.ll * The generated PTX is verified with a 12.3 ptxas executable. * Added docs for these intrinsics in NVPTXUsage.rst. For more information, refer to the PTX ISA for prefetch intrinsic : [Prefetch Tensormap](https://docs.nvidia.com/cuda/parallel-thread-execution/#data-movement-and-conversion-instructions-prefetch-prefetchu) @durga4github @schwarzschild-radius

Diffstat (limited to 'clang/lib/Frontend/CompilerInvocation.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: