diff options
author | Durgadoss R <durgadossr@nvidia.com> | 2025-05-19 14:24:34 +0530 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-05-19 14:24:34 +0530 |
commit | 2483831617a17155afdadc593d227231f83edc05 (patch) | |
tree | a1fd9a372037c0506d451fab3155e251875d904a /clang/lib/AST/ByteCode/Compiler.cpp | |
parent | b060022103f551d8ca1dad84122ef73927c86512 (diff) | |
download | llvm-2483831617a17155afdadc593d227231f83edc05.zip llvm-2483831617a17155afdadc593d227231f83edc05.tar.gz llvm-2483831617a17155afdadc593d227231f83edc05.tar.bz2 |
[MLIR][NVVM] Extend TMA Bulk Copy Op (#140232)
This patch extends the non-tensor TMA Bulk Copy Op
(from shared_cta to global) with an optional
byte mask operand. This mask helps selectively
copy a particular byte to the destination.
* lit tests are added to verify the lowering to the intrinsics.
Signed-off-by: Durgadoss R <durgadossr@nvidia.com>
Diffstat (limited to 'clang/lib/AST/ByteCode/Compiler.cpp')
0 files changed, 0 insertions, 0 deletions