diff options
author | Durgadoss R <durgadossr@nvidia.com> | 2025-06-12 15:20:39 +0530 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-06-12 15:20:39 +0530 |
commit | 3e5d50f9c61bb266ab17919ab5209c7b08520aff (patch) | |
tree | 86d9120e389c2d9c96698d65112944cca49c9d1d /llvm/lib/Bitcode/Reader/BitcodeReader.cpp | |
parent | 8e4fdff6f02161d878a63900abb35aaa32ff85e9 (diff) | |
download | llvm-3e5d50f9c61bb266ab17919ab5209c7b08520aff.zip llvm-3e5d50f9c61bb266ab17919ab5209c7b08520aff.tar.gz llvm-3e5d50f9c61bb266ab17919ab5209c7b08520aff.tar.bz2 |
[NVPTX] Add cta_group support to TMA G2S intrinsics (#143178)
This patch extends the TMA G2S intrinsics with the
support for cta_group::1/2 available from Blackwell onwards.
The existing intrinsics are auto-upgraded with a default
value of '0' for the `cta_group` flag operand.
* lit tests are added for all combinations of the newer variants.
* Negative tests are added to validate the error-handling
when the value of the cta_group flag falls out-of-range.
* The generated PTX is verified with a 12.8 ptxas executable.
Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Diffstat (limited to 'llvm/lib/Bitcode/Reader/BitcodeReader.cpp')
0 files changed, 0 insertions, 0 deletions