aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Transforms/Utils/CloneModule.cpp
diff options
context:
space:
mode:
authorArpith Chacko Jacob <acjacob@us.ibm.com>2017-01-18 19:35:00 +0000
committerArpith Chacko Jacob <acjacob@us.ibm.com>2017-01-18 19:35:00 +0000
commit44a87c9f1b330d243eae87c60151711c5b0b486a (patch)
tree66af7f701bfa080c252bcea9549369c3533c4088 /llvm/lib/Transforms/Utils/CloneModule.cpp
parent11255ec7659cec933e0b0415c14d50ca80cb16cd (diff)
downloadllvm-44a87c9f1b330d243eae87c60151711c5b0b486a.zip
llvm-44a87c9f1b330d243eae87c60151711c5b0b486a.tar.gz
llvm-44a87c9f1b330d243eae87c60151711c5b0b486a.tar.bz2
[OpenMP] Codegen for the 'target parallel' directive on the NVPTX device.
This patch adds codegen for the 'target parallel' directive on the NVPTX device. We term offload OpenMP directives such as 'target parallel' and 'target teams distribute parallel for' as SPMD constructs. SPMD constructs, in contrast to Generic ones like the plain 'target', can never contain a serial region. SPMD constructs can be handled more efficiently on the GPU and do not require the Warp Loop of the Generic codegen scheme. This patch adds SPMD codegen support for 'target parallel' on the NVPTX device and can be reused for other SPMD constructs. Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D28755 llvm-svn: 292428
Diffstat (limited to 'llvm/lib/Transforms/Utils/CloneModule.cpp')
0 files changed, 0 insertions, 0 deletions