diff options
author | liuhongt <hongtao.liu@intel.com> | 2022-03-03 13:31:39 +0800 |
---|---|---|
committer | liuhongt <hongtao.liu@intel.com> | 2022-03-08 09:02:05 +0800 |
commit | b1a741a03041782b34effcb820fbe19ba41a3b8f (patch) | |
tree | 6a29c55f51928c7394f18f1edb88a9c67c2e3b54 /gcc/fortran | |
parent | e6533e2ebec964e77d3a2462abbabd214d677552 (diff) | |
download | gcc-b1a741a03041782b34effcb820fbe19ba41a3b8f.zip gcc-b1a741a03041782b34effcb820fbe19ba41a3b8f.tar.gz gcc-b1a741a03041782b34effcb820fbe19ba41a3b8f.tar.bz2 |
Optimize v4si broadcast for noavx512vl.
This will enable below
- vbroadcastss .LC1(%rip), %xmm0
+ movl $-45, %edx
+ vmovd %edx, %xmm0
+ vpshufd $0, %xmm0, %xmm0
According to microbenchmark, it's faster than broadcast from memory
for TARGET_INTER_UNIT_MOVES_TO_VEC.
gcc/ChangeLog:
* config/i386/sse.md (*vec_dupv4si): Disable memory operand
for !TARGET_INTER_UNIT_MOVES_TO_VEC when prefer_for_speed.
gcc/testsuite/ChangeLog:
* gcc.target/i386/pr100865-8a.c: Adjust testcase.
* gcc.target/i386/pr100865-8c.c: Ditto.
* gcc.target/i386/pr100865-9c.c: Ditto.
Diffstat (limited to 'gcc/fortran')
0 files changed, 0 insertions, 0 deletions