aboutsummaryrefslogtreecommitdiff
path: root/llvm/unittests/Support/CommandLineTest.cpp
diff options
context:
space:
mode:
authorcarlobertolli <carlo.bertolli@amd.com>2024-01-08 14:17:28 -0600
committerGitHub <noreply@github.com>2024-01-08 14:17:28 -0600
commit22a73e7c4616e0405db85598c049a7ca70cca7cc (patch)
tree733cc744bea8aa27f6197e5f5fb296d742498ffc /llvm/unittests/Support/CommandLineTest.cpp
parent87f67c2599410786ea3600d388fd1d2df13e60af (diff)
downloadllvm-22a73e7c4616e0405db85598c049a7ca70cca7cc.zip
llvm-22a73e7c4616e0405db85598c049a7ca70cca7cc.tar.gz
llvm-22a73e7c4616e0405db85598c049a7ca70cca7cc.tar.bz2
[OpenMP][libomptarget] Enable automatic unified shared memory executi… (#75999)
…on (zero-copy) on MI300A. This patch enables applications that did not request OpenMP unified_shared_memory to run with the same zero-copy behavior, where mapped memory does not result in extra memory allocations and memory copies, but CPU-allocated memory is accessed from the device. The name for this behavior is "automatic zero-copy" and it relies on detecting: that the runtime is running on a MI300A, that the user did not select unified_shared_memory in their program, and that XNACK (unified memory support) is enabled in the current GPU configuration. If all these conditions are met, then automatic zero-copy is triggered. This patch is still missing support for global variables, which will be provided in a subsequent patch. Co-authored-by: Thorsten Blass <thorsten.blass@amd.com>
Diffstat (limited to 'llvm/unittests/Support/CommandLineTest.cpp')
0 files changed, 0 insertions, 0 deletions