diff options
author | Slava Zakharin <szakharin@nvidia.com> | 2022-07-08 15:10:33 -0700 |
---|---|---|
committer | Slava Zakharin <szakharin@nvidia.com> | 2022-07-12 08:33:39 -0700 |
commit | a280043b523182ab6bb3ce5caf75e931a26eaf3f (patch) | |
tree | e0cc623d4a131b59f3fc5e748b79f3ecf62a3e5c /clang/lib/CodeGen/CodeGenFunction.cpp | |
parent | d6ef3d20b4e3768dc30fb229dfa938d8059fffef (diff) | |
download | llvm-a280043b523182ab6bb3ce5caf75e931a26eaf3f.zip llvm-a280043b523182ab6bb3ce5caf75e931a26eaf3f.tar.gz llvm-a280043b523182ab6bb3ce5caf75e931a26eaf3f.tar.bz2 |
[flang] Lower TRANSPOSE without using runtime.
Calling runtime TRANSPOSE requires a temporary array for the result,
and, sometimes, a temporary array for the argument. Lowering it inline
should provide faster code.
I added -opt-transpose control just for debugging purposes temporary.
I am going to make driver changes that will disable inline lowering
for -O0. For the time being I would like to enable it by default
to expose the code to more tests.
Differential Revision: https://reviews.llvm.org/D129497
Diffstat (limited to 'clang/lib/CodeGen/CodeGenFunction.cpp')
0 files changed, 0 insertions, 0 deletions