aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/CodeGen/MachineBasicBlock.cpp
diff options
context:
space:
mode:
authorKevin Sala <kevin.sala@bsc.es>2022-12-16 15:26:46 +0100
committerKevin Sala <kevin.sala@bsc.es>2022-12-17 00:01:24 +0100
commit6bbf9c0cca6f7eab35a524bbba9d6508f764b82d (patch)
treec60e856625c902456582d128104320ddb8595e13 /llvm/lib/CodeGen/MachineBasicBlock.cpp
parent7b979417216f271fe36df50ed6a3480d5957a127 (diff)
downloadllvm-6bbf9c0cca6f7eab35a524bbba9d6508f764b82d.zip
llvm-6bbf9c0cca6f7eab35a524bbba9d6508f764b82d.tar.gz
llvm-6bbf9c0cca6f7eab35a524bbba9d6508f764b82d.tar.bz2
[OpenMP][libomptarget] Add AMDGPU NextGen plugin with asynchronous behavior
This commit adds the AMDGPU NextGen plugin inheriting from PluginInterface's classes. It also implements the asynchronous behavior in the plugin operations: kernel launches and memory transfers. To this end, it implements the concept of streams of asynchronous operations. The streams are implemented using the HSA signals to define input and output dependencies between asynchronous operations. Missing features: - Retrieve the maximum number of threads per group that a kernel can run. This requires reading the image. - Implement __tgt_rtl_sync_event, not used on the libomptarget side. Differential Revision: https://reviews.llvm.org/D138389
Diffstat (limited to 'llvm/lib/CodeGen/MachineBasicBlock.cpp')
0 files changed, 0 insertions, 0 deletions