aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Bitcode/Reader/BitcodeReader.cpp
diff options
context:
space:
mode:
authorDurgadoss R <durgadossr@nvidia.com>2025-06-12 15:20:39 +0530
committerGitHub <noreply@github.com>2025-06-12 15:20:39 +0530
commit3e5d50f9c61bb266ab17919ab5209c7b08520aff (patch)
tree86d9120e389c2d9c96698d65112944cca49c9d1d /llvm/lib/Bitcode/Reader/BitcodeReader.cpp
parent8e4fdff6f02161d878a63900abb35aaa32ff85e9 (diff)
downloadllvm-3e5d50f9c61bb266ab17919ab5209c7b08520aff.zip
llvm-3e5d50f9c61bb266ab17919ab5209c7b08520aff.tar.gz
llvm-3e5d50f9c61bb266ab17919ab5209c7b08520aff.tar.bz2
[NVPTX] Add cta_group support to TMA G2S intrinsics (#143178)
This patch extends the TMA G2S intrinsics with the support for cta_group::1/2 available from Blackwell onwards. The existing intrinsics are auto-upgraded with a default value of '0' for the `cta_group` flag operand. * lit tests are added for all combinations of the newer variants. * Negative tests are added to validate the error-handling when the value of the cta_group flag falls out-of-range. * The generated PTX is verified with a 12.8 ptxas executable. Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Diffstat (limited to 'llvm/lib/Bitcode/Reader/BitcodeReader.cpp')
0 files changed, 0 insertions, 0 deletions