diff options
author | Richard Biener <rguenther@suse.de> | 2024-07-15 13:01:24 +0200 |
---|---|---|
committer | Richard Biener <rguenth@gcc.gnu.org> | 2024-07-16 09:47:19 +0200 |
commit | 1e3aa9c9278db69d4bdb661a750a7268789188d6 (patch) | |
tree | edd1c339390bdaf1a2bd496ddeadd80c61c11857 /gcc/tree-vect-loop.cc | |
parent | df9451936c6c9e4faea371e3f188e1fc6b6d39e3 (diff) | |
download | gcc-1e3aa9c9278db69d4bdb661a750a7268789188d6.zip gcc-1e3aa9c9278db69d4bdb661a750a7268789188d6.tar.gz gcc-1e3aa9c9278db69d4bdb661a750a7268789188d6.tar.bz2 |
Fixup unaligned load/store cost for znver4
Currently unaligned YMM and ZMM load and store costs are cheaper than
aligned which causes the vectorizer to purposely mis-align accesses
by adding an alignment prologue. It looks like the unaligned costs
were simply left untouched from znver3 where they equate the aligned
costs when tweaking aligned costs for znver4. The following makes
the unaligned costs equal to the aligned costs.
This avoids the miscompile seen in PR115843 but it's of course not
a real fix for the issue uncovered there. But it makes it qualify
as a regression fix.
PR tree-optimization/115843
* config/i386/x86-tune-costs.h (znver4_cost): Update unaligned
load and store cost from the aligned costs.
Diffstat (limited to 'gcc/tree-vect-loop.cc')
0 files changed, 0 insertions, 0 deletions