aboutsummaryrefslogtreecommitdiff
path: root/lldb/packages/Python/lldbsuite/test/lldbtest.py
diff options
context:
space:
mode:
authorSlava Zakharin <szakharin@nvidia.com>2022-10-20 13:46:49 -0700
committerSlava Zakharin <szakharin@nvidia.com>2022-10-26 11:06:26 -0700
commit36c2a9d54ddfaec123859714cad6073edf468b49 (patch)
tree40bc9bf604b06eedfc26cdcc77ab7ec48397bc2b /lldb/packages/Python/lldbsuite/test/lldbtest.py
parent26fdad031ca8baa2196a6c8c40c0a23c066e611e (diff)
downloadllvm-36c2a9d54ddfaec123859714cad6073edf468b49.zip
llvm-36c2a9d54ddfaec123859714cad6073edf468b49.tar.gz
llvm-36c2a9d54ddfaec123859714cad6073edf468b49.tar.bz2
[flang][runtime] New APIs for copyin/copyout of non-contiguous objects.
The intention is to use these APIs for copyin/copyout of subprogram arguments at the call sites. Currently, Flang generates loop nests to do this, and in some corner cases this results in very long compilation times due to LLVM loop optimizations. For example, Flang produces 25245 loops for 521.wrf/module_dm.f90. If we extract the copyin/copyout loops into runtime, Flang will only produce 207 loops, and the compilation time may reduce by 47x. Given that the copyin/copyout loop nests can not be fused with other loop nests, extracting them into runtime functions should not reduce performance if the runtime optimizes the leading contiguous dimension copies. The implementation will come in separate patches. Differential Revision: https://reviews.llvm.org/D136378
Diffstat (limited to 'lldb/packages/Python/lldbsuite/test/lldbtest.py')
0 files changed, 0 insertions, 0 deletions