diff options
author | Jan Hubicka <hubicka@ucw.cz> | 2025-04-15 19:04:15 +0200 |
---|---|---|
committer | Jan Hubicka <hubicka@ucw.cz> | 2025-04-15 19:04:15 +0200 |
commit | e2011ab13de3e70774f869b356f5f9c750780b34 (patch) | |
tree | 7a7bfdbccdc149cfa40fda6c5aa5e0428d8e4dda /libjava | |
parent | 25775e73ea4d40a55a26b71c42cc6509caf4845f (diff) | |
download | gcc-e2011ab13de3e70774f869b356f5f9c750780b34.zip gcc-e2011ab13de3e70774f869b356f5f9c750780b34.tar.gz gcc-e2011ab13de3e70774f869b356f5f9c750780b34.tar.bz2 |
Set ADDSS cost to 3 for znver5
Znver5 has latency of addss 2 in typical case while all earlier versions has latency 3.
Unforunately addss cost is used to cost many other SSE instructions than just addss and
setting the cost to 2 makes us to vectorize 4 64bit stores into one 256bit store which
in turn regesses imagemagick.
This patch sets the cost back to 3. Next stage1 we can untie addss from the other operatoins
and set it correctly.
bootstrapped/regtested x86_64-linux and also benchmarked on SPEC2k17
gcc/ChangeLog:
PR target/119298
* config/i386/x86-tune-costs.h (znver5_cost): Set ADDSS cost to 3.
Diffstat (limited to 'libjava')
0 files changed, 0 insertions, 0 deletions