diff options
author | David Pagan <dave.pagan@amd.com> | 2024-04-10 13:09:17 -0700 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-04-10 13:09:17 -0700 |
commit | a12836647e08c4ad203b9834ac55892fa0b9f2d3 (patch) | |
tree | 69d8c2c362db8bad0db4f13a1e5c8746f6e1fb1f /flang/lib/Frontend/CompilerInvocation.cpp | |
parent | b3792ae42a4adda5cb51d53f3d6a4b9b025b11fd (diff) | |
download | llvm-a12836647e08c4ad203b9834ac55892fa0b9f2d3.zip llvm-a12836647e08c4ad203b9834ac55892fa0b9f2d3.tar.gz llvm-a12836647e08c4ad203b9834ac55892fa0b9f2d3.tar.bz2 |
[OpenMP][CodeGen] Improved codegen for combined loop directives (#87278)
IR for 'target teams loop' is now dependent on suitability of associated
loop-nest.
If a loop-nest:
- does not contain a function call, or
- the -fopenmp-assume-no-nested-parallelism has been specified,
- or the call is to an OpenMP API AND
- does not contain nested loop bind(parallel) directives
then it can be emitted as 'target teams distribute parallel for', which
is the current default. Otherwise, it is emitted as 'target teams
distribute'.
Added debug output indicating how 'target teams loop' was emitted. Flag
is -mllvm -debug-only=target-teams-loop-codegen
Added LIT tests explicitly verifying 'target teams loop' emitted as a
parallel loop and a distribute loop.
Updated other 'loop' related tests as needed to reflect change in IR.
- These updates account for most of the changed files and
additions/deletions.
Diffstat (limited to 'flang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions