aboutsummaryrefslogtreecommitdiff
path: root/libc/test/src/sys/prctl/linux
diff options
context:
space:
mode:
authorSameer Sahasrabuddhe <sameer.sahasrabuddhe@amd.com>2026-02-11 10:56:58 +0530
committerGitHub <noreply@github.com>2026-02-11 05:26:58 +0000
commitb02b395a1e75030708efc099f449f96436a90141 (patch)
tree7a17cebb6b9f404c4556f7505f0e2b8ed88ab26a /libc/test/src/sys/prctl/linux
parentfbad717b9a09af5abafbeba859646d1ab05213d5 (diff)
downloadllvm-main.zip
llvm-main.tar.gz
llvm-main.tar.bz2
[AMDGPU] Asynchronous loads from global/buffer to LDS on pre-GFX12 (#180466)HEADmain
The existing "LDS DMA" builtins/intrinsics copy data from global/buffer pointer to LDS. These are now augmented with their ".async" version, where the compiler does not automatically track completion. The completion is now tracked using explicit mark/wait intrinsics, which must be inserted by the user. This makes it possible to write programs with efficient waits in software pipeline loops. The program can now wait for only the oldest outstanding operations to finish, while launching more operations for later use. This change only contains the new names of the builtins/intrinsics, which continue to behave exactly like their non-async counterparts. A later change will implement the actual mark/wait semantics in SIInsertWaitcnts. This is part of a stack split out from #173259: - #180467 - #180466 Fixes: SWDEV-521121
Diffstat (limited to 'libc/test/src/sys/prctl/linux')
0 files changed, 0 insertions, 0 deletions