aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
diff options
context:
space:
mode:
authorDavid Sherwood <david.sherwood@arm.com>2021-08-18 09:40:21 +0100
committerDavid Sherwood <david.sherwood@arm.com>2021-08-18 17:01:56 +0100
commit219d4518fce9aafcb5eba9b92fb778837f0a4827 (patch)
tree9b5b13f75e9f160783327c45a351b2facfae459a /llvm/lib/Bitcode/Writer/BitcodeWriter.cpp
parent13d8f000d7271226e5dfc6c0dc25b91cf6233349 (diff)
downloadllvm-219d4518fce9aafcb5eba9b92fb778837f0a4827.zip
llvm-219d4518fce9aafcb5eba9b92fb778837f0a4827.tar.gz
llvm-219d4518fce9aafcb5eba9b92fb778837f0a4827.tar.bz2
[Analysis][AArch64] Make fixed-width ordered reductions slightly more expensive
For tight loops like this: float r = 0; for (int i = 0; i < n; i++) { r += a[i]; } it's better not to vectorise at -O3 using fixed-width ordered reductions on AArch64 targets. Although the resulting number of instructions in the generated code ends up being comparable to not vectorising at all, there may be additional costs on some CPUs, for example perhaps the scheduling is worse. It makes sense to deter vectorisation in tight loops. Differential Revision: https://reviews.llvm.org/D108292
Diffstat (limited to 'llvm/lib/Bitcode/Writer/BitcodeWriter.cpp')
0 files changed, 0 insertions, 0 deletions