diff options
author | H.J. Lu <hjl.tools@gmail.com> | 2019-01-23 06:33:58 -0800 |
---|---|---|
committer | H.J. Lu <hjl.tools@gmail.com> | 2020-07-09 06:35:57 -0700 |
commit | fab263ab0fc10ea08409b80afa7e8569438b8d28 (patch) | |
tree | a8c93ef2193d9ce7dd7128a0956de024c56da927 /gcc/tree-vect-loop.c | |
parent | a1e25d0008791118dd58eaddff5f4c3691f8750e (diff) | |
download | gcc-fab263ab0fc10ea08409b80afa7e8569438b8d28.zip gcc-fab263ab0fc10ea08409b80afa7e8569438b8d28.tar.gz gcc-fab263ab0fc10ea08409b80afa7e8569438b8d28.tar.bz2 |
x86: Enable FMA in rsqrt<mode>2 expander
Enable FMA in rsqrt<mode>2 expander and fold rsqrtv16sf2 expander into
rsqrt<mode>2 expander which expands to UNSPEC_RSQRT28 for TARGET_AVX512ER.
Although it doesn't show performance change in our workloads, FMA can
improve other workloads.
gcc/
PR target/88713
* config/i386/i386-expand.c (ix86_emit_swsqrtsf): Enable FMA.
* config/i386/sse.md (VF_AVX512VL_VF1_128_256): New.
(rsqrt<mode>2): Replace VF1_128_256 with VF_AVX512VL_VF1_128_256.
(rsqrtv16sf2): Removed.
gcc/testsuite/
PR target/88713
* gcc.target/i386/pr88713-1.c: New test.
* gcc.target/i386/pr88713-2.c: Likewise.
Diffstat (limited to 'gcc/tree-vect-loop.c')
0 files changed, 0 insertions, 0 deletions