diff options
author | Hans Wennborg <hans@hanshq.net> | 2019-09-18 13:41:51 +0000 |
---|---|---|
committer | Hans Wennborg <hans@hanshq.net> | 2019-09-18 13:41:51 +0000 |
commit | 858a1ae37d260ae454751bbf5d74742138f10676 (patch) | |
tree | f1458798e4300dc2c4220e6dd663ef2cd7b5bc4e /llvm/lib/CodeGen/MachineBasicBlock.cpp | |
parent | 89ad7f7a1ba245fccf1f64f95a146ada6f6aaac6 (diff) | |
download | llvm-858a1ae37d260ae454751bbf5d74742138f10676.zip llvm-858a1ae37d260ae454751bbf5d74742138f10676.tar.gz llvm-858a1ae37d260ae454751bbf5d74742138f10676.tar.bz2 |
Revert r372082 "[Clang] Pragma vectorize_width() implies vectorize(enable)"
This broke the Chromium build. Consider the following code:
float ScaleSumSamples_C(const float* src, float* dst, float scale, int width) {
float fsum = 0.f;
int i;
#if defined(__clang__)
#pragma clang loop vectorize_width(4)
#endif
for (i = 0; i < width; ++i) {
float v = *src++;
fsum += v * v;
*dst++ = v * scale;
}
return fsum;
}
Compiling at -Oz, Clang now warns:
$ clang++ -target x86_64 -Oz -c /tmp/a.cc
/tmp/a.cc:1:7: warning: loop not vectorized: the optimizer was unable to
perform the requested transformation; the transformation might be disabled or
specified as part of an unsupported transformation ordering
[-Wpass-failed=transform-warning]
this suggests it's not actually enabling vectorization hard enough.
At -Os it asserts instead:
$ build.release/bin/clang++ -target x86_64 -Os -c /tmp/a.cc
clang-10: /work/llvm.monorepo/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp:2734: void
llvm::InnerLoopVectorizer::emitMemRuntimeChecks(llvm::Loop*, llvm::BasicBlock*): Assertion `
!BB->getParent()->hasOptSize() && "Cannot emit memory checks when optimizing for size"' failed.
Of course neither of these are what the developer expected from the pragma.
> Specifying the vectorization width was supposed to implicitly enable
> vectorization, except that it wasn't really doing this. It was only
> setting the vectorize.width metadata, but not vectorize.enable.
>
> This should fix PR27643.
>
> Differential Revision: https://reviews.llvm.org/D66290
llvm-svn: 372225
Diffstat (limited to 'llvm/lib/CodeGen/MachineBasicBlock.cpp')
0 files changed, 0 insertions, 0 deletions