diff options
author | David Sherwood <57997763+david-arm@users.noreply.github.com> | 2024-01-26 14:43:48 +0000 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-01-26 14:43:48 +0000 |
commit | 962fbafecf4730ba84a3b9fd7a662a5c30bb2c7c (patch) | |
tree | d7b1ac70933d78d521d0a21998e3decc6a699870 /llvm/lib/IR/Function.cpp | |
parent | 70fbcdb41d50ee23a36501aee733be33eecfb7da (diff) | |
download | llvm-962fbafecf4730ba84a3b9fd7a662a5c30bb2c7c.zip llvm-962fbafecf4730ba84a3b9fd7a662a5c30bb2c7c.tar.gz llvm-962fbafecf4730ba84a3b9fd7a662a5c30bb2c7c.tar.bz2 |
[LoopVectorize] Refine runtime memory check costs when there is an outer loop (#76034)
When we generate runtime memory checks for an inner loop it's
possible that these checks are invariant in the outer loop and
so will get hoisted out. In such cases, the effective cost of
the checks should reduce to reflect the outer loop trip count.
This fixes a 25% performance regression introduced by commit
49b0e6dcc296792b577ae8f0f674e61a0929b99d
when building the SPEC2017 x264 benchmark with PGO, where we
decided the inner loop trip count wasn't high enough to warrant
the (incorrect) high cost of the runtime checks. Also, when
runtime memory checks consist entirely of diff checks these are
likely to be outer loop invariant.
Diffstat (limited to 'llvm/lib/IR/Function.cpp')
0 files changed, 0 insertions, 0 deletions