aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorThomas Schwinge <thomas@codesourcery.com>2025-04-19 20:57:56 +0000
committerSandra Loosemore <sloosemore@baylibre.com>2025-05-15 20:25:47 +0000
commit98586c2f620d2b6789257c29b4bc77567b1152c6 (patch)
tree9795dab6f08e1855c327e2ed9dd5df09879bfe82
parentb4b83d34766a410848d33e8bee24b3d1fb7f1ec1 (diff)
downloadgcc-98586c2f620d2b6789257c29b4bc77567b1152c6.zip
gcc-98586c2f620d2b6789257c29b4bc77567b1152c6.tar.gz
gcc-98586c2f620d2b6789257c29b4bc77567b1152c6.tar.bz2
libgomp: Document OpenMP 'pinned' memory
libgomp/ * libgomp.texi (AMD Radeon, nvptx): Document OpenMP 'pinned' memory.
-rw-r--r--libgomp/libgomp.texi8
1 files changed, 8 insertions, 0 deletions
diff --git a/libgomp/libgomp.texi b/libgomp/libgomp.texi
index 57460c4..2dc2ef7 100644
--- a/libgomp/libgomp.texi
+++ b/libgomp/libgomp.texi
@@ -6889,6 +6889,9 @@ The implementation remark:
does not support XNACK, consider using @code{ROCR_VISIBLE_DEVICES} to
enable only the APU. If not supported, all AMD GPU devices are removed
from the list of available devices (``host fallback'').
+@item OpenMP @emph{pinned} memory (@code{omp_atk_pinned},
+ @code{ompx_pinned_mem_alloc}, for example)
+ is allocated via @code{mmap}, @code{mlock}.
@item The available stack size can be changed using the @code{GCN_STACK_SIZE}
environment variable; the default is 32 kiB per thread.
@item Low-latency memory (@code{omp_low_lat_mem_space}) is supported when the
@@ -7053,6 +7056,11 @@ The implementation remark:
@uref{https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#um-requirements}}
otherwise, all nvptx device are removed from the list of available
devices (``host fallback'').
+@item OpenMP @emph{pinned} memory (@code{omp_atk_pinned},
+ @code{ompx_pinned_mem_alloc}, for example)
+ is allocated via @code{cuMemHostAlloc} (CUDA Driver API).
+ This potentially helps optimization of host <-> device data
+ transfers.
@item The default per-warp stack size is 128 kiB; see also @code{-msoft-stack}
in the GCC manual.
@item Low-latency memory (@code{omp_low_lat_mem_space}) is supported when the