aboutsummaryrefslogtreecommitdiff
path: root/clang/lib/AST/ByteCode/Compiler.cpp
diff options
context:
space:
mode:
authorDurgadoss R <durgadossr@nvidia.com>2025-05-19 14:24:34 +0530
committerGitHub <noreply@github.com>2025-05-19 14:24:34 +0530
commit2483831617a17155afdadc593d227231f83edc05 (patch)
treea1fd9a372037c0506d451fab3155e251875d904a /clang/lib/AST/ByteCode/Compiler.cpp
parentb060022103f551d8ca1dad84122ef73927c86512 (diff)
downloadllvm-2483831617a17155afdadc593d227231f83edc05.zip
llvm-2483831617a17155afdadc593d227231f83edc05.tar.gz
llvm-2483831617a17155afdadc593d227231f83edc05.tar.bz2
[MLIR][NVVM] Extend TMA Bulk Copy Op (#140232)
This patch extends the non-tensor TMA Bulk Copy Op (from shared_cta to global) with an optional byte mask operand. This mask helps selectively copy a particular byte to the destination. * lit tests are added to verify the lowering to the intrinsics. Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Diffstat (limited to 'clang/lib/AST/ByteCode/Compiler.cpp')
0 files changed, 0 insertions, 0 deletions