diff options
author | carlobertolli <carlo.bertolli@amd.com> | 2024-01-08 14:17:28 -0600 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-01-08 14:17:28 -0600 |
commit | 22a73e7c4616e0405db85598c049a7ca70cca7cc (patch) | |
tree | 733cc744bea8aa27f6197e5f5fb296d742498ffc /llvm/unittests/Support/CommandLineTest.cpp | |
parent | 87f67c2599410786ea3600d388fd1d2df13e60af (diff) | |
download | llvm-22a73e7c4616e0405db85598c049a7ca70cca7cc.zip llvm-22a73e7c4616e0405db85598c049a7ca70cca7cc.tar.gz llvm-22a73e7c4616e0405db85598c049a7ca70cca7cc.tar.bz2 |
[OpenMP][libomptarget] Enable automatic unified shared memory executi… (#75999)
…on (zero-copy) on MI300A.
This patch enables applications that did not request OpenMP
unified_shared_memory to run with the same zero-copy behavior, where
mapped memory does not result in extra memory allocations and memory
copies, but CPU-allocated memory is accessed from the device. The name
for this behavior is "automatic zero-copy" and it relies on detecting:
that the runtime is running on a MI300A, that the user did not select
unified_shared_memory in their program, and that XNACK (unified memory
support) is enabled in the current GPU configuration. If all these
conditions are met, then automatic zero-copy is triggered.
This patch is still missing support for global variables, which will be
provided in a subsequent patch.
Co-authored-by: Thorsten Blass <thorsten.blass@amd.com>
Diffstat (limited to 'llvm/unittests/Support/CommandLineTest.cpp')
0 files changed, 0 insertions, 0 deletions