diff options
author | Guray Ozen <guray.ozen@gmail.com> | 2023-11-16 14:34:56 +0100 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-11-16 14:34:56 +0100 |
commit | 108380da357e2db513f016d33adede0d58636bea (patch) | |
tree | 7f6c24eda756f96a40f634b01d0dcd69b15b693a /llvm/lib/Bitcode/Reader/BitcodeAnalyzer.cpp | |
parent | 25d0f9fc3bddd50a38eeb44877cfa291c380d408 (diff) | |
download | llvm-108380da357e2db513f016d33adede0d58636bea.zip llvm-108380da357e2db513f016d33adede0d58636bea.tar.gz llvm-108380da357e2db513f016d33adede0d58636bea.tar.bz2 |
[mlir][nvvm] Add `cp.async.bulk.tensor.shared.cluster.global.multicast` (#72429)
This PR introduce `cp.async.bulk.tensor.shared.cluster.global.multicast`
Op in NVVM dialect. It loads data using TMA data from global memory to
shared memory of multiple CTAs in the cluster.
It resolves #72368
Diffstat (limited to 'llvm/lib/Bitcode/Reader/BitcodeAnalyzer.cpp')
0 files changed, 0 insertions, 0 deletions