diff options
| author | Nicolas Vasilache <nicolas.vasilache@gmail.com> | 2023-04-18 04:28:45 -0700 |
|---|---|---|
| committer | Nicolas Vasilache <nicolas.vasilache@gmail.com> | 2023-04-18 05:00:53 -0700 |
| commit | 95cb9862a8dcd3b8e9cdf0a27b5eafb910c9e983 (patch) | |
| tree | d64d974a3cf60abeed14380ccb3aceb88429f187 /llvm/lib/CodeGen/BasicBlockSectionsProfileReader.cpp | |
| parent | 5fdf4d53819e613f5c5be0ca0ec12444c17812d7 (diff) | |
| download | llvm-95cb9862a8dcd3b8e9cdf0a27b5eafb910c9e983.zip llvm-95cb9862a8dcd3b8e9cdf0a27b5eafb910c9e983.tar.gz llvm-95cb9862a8dcd3b8e9cdf0a27b5eafb910c9e983.tar.bz2 | |
[mlir][NVGPU] Support cache all (.ca) in nvgpu.device_async_copy
This patch adds support for cache all (.ca) in conversion from nvgpu-to-nvvm for inline asm `cp.async`.
For sizes other than 16 bytes cp.async cache global is not allowed and cache all is required to generate a valid ptx.
Differential revision: https://reviews.llvm.org/D148604
Authored-by: Manish Gupta <manigupta@google.com>
Diffstat (limited to 'llvm/lib/CodeGen/BasicBlockSectionsProfileReader.cpp')
0 files changed, 0 insertions, 0 deletions
