Add x86 addsub SLP pattern

This addds SLP pattern recognition for the SSE3/AVX [v]addsubp{ds} v0, v1 instructions which compute { v0[0] - v1[0], v0[1], + v1[1], ... } thus subtract, add alternating on lanes, starting with subtract. It adds a corresponding optab and direct internal function, vec_addsub$a3 and renames the existing i386 backend patterns to the new canonical name. The SLP pattern matches the exact alternating lane sequence rather than trying to be clever and anticipating incoming permutes - we could permute the two input vectors to the needed lane alternation, do the addsub and then permute the result vector back but that's only profitable in case the two input or the output permute will vanish - something Tamars refactoring of SLP pattern recog should make possible. 2021-06-17 Richard Biener <rguenther@suse.de> * config/i386/sse.md (avx_addsubv4df3): Rename to vec_addsubv4df3. (avx_addsubv8sf3): Rename to vec_addsubv8sf3. (sse3_addsubv2df3): Rename to vec_addsubv2df3. (sse3_addsubv4sf3): Rename to vec_addsubv4sf3. * config/i386/i386-builtin.def: Adjust. * internal-fn.def (VEC_ADDSUB): New internal optab fn. * optabs.def (vec_addsub_optab): New optab. * tree-vect-slp-patterns.c (class addsub_pattern): New. (slp_patterns): Add addsub_pattern. * tree-vect-slp.c (vect_optimize_slp): Disable propagation across CFN_VEC_ADDSUB. * tree-vectorizer.h (vect_pattern::vect_pattern): Make m_ops optional. * doc/md.texi (vec_addsub<mode>3): Document. * gcc.target/i386/vect-addsubv2df.c: New testcase. * gcc.target/i386/vect-addsubv4sf.c: Likewise. * gcc.target/i386/vect-addsubv4df.c: Likewise. * gcc.target/i386/vect-addsubv8sf.c: Likewise. * gcc.target/i386/vect-addsub-2.c: Likewise. * gcc.target/i386/vect-addsub-3.c: Likewise.
author: Richard Biener <rguenther@suse.de> 2021-05-31 13:19:01 +0200
committer: Richard Biener <rguenther@suse.de> 2021-06-24 13:08:25 +0200
commit: 7a6c31f0f84a7295433ebac09b94fae2d5cc2892 (patch)
tree: ae70aac1f6e8305d23a8ee08ce92d3b57d4a3100 /gcc/tree-vect-slp.c
parent: 9872bd8c35be0f4d475fac739115cf5b82cdabc0 (diff)
download: gcc-7a6c31f0f84a7295433ebac09b94fae2d5cc2892.zip
gcc-7a6c31f0f84a7295433ebac09b94fae2d5cc2892.tar.gz
gcc-7a6c31f0f84a7295433ebac09b94fae2d5cc2892.tar.bz2
1 files changed, 1 insertions, 0 deletions
diff --git a/gcc/tree-vect-slp.c b/gcc/tree-vect-slp.c
index 69ee8fa..227d6aa 100644
--- a/gcc/tree-vect-slp.c
+++ b/gcc/tree-vect-slp.c
@@ -3705,6 +3705,7 @@ vect_optimize_slp (vec_info *vinfo)
 	      case CFN_COMPLEX_ADD_ROT270:
 	      case CFN_COMPLEX_MUL:
 	      case CFN_COMPLEX_MUL_CONJ:
+	      case CFN_VEC_ADDSUB:
 		continue;
 	      default:;
 	      }
author	Richard Biener <rguenther@suse.de>	2021-05-31 13:19:01 +0200
committer	Richard Biener <rguenther@suse.de>	2021-06-24 13:08:25 +0200
commit	7a6c31f0f84a7295433ebac09b94fae2d5cc2892 (patch)
tree	ae70aac1f6e8305d23a8ee08ce92d3b57d4a3100 /gcc/tree-vect-slp.c
parent	9872bd8c35be0f4d475fac739115cf5b82cdabc0 (diff)
download	gcc-7a6c31f0f84a7295433ebac09b94fae2d5cc2892.zip gcc-7a6c31f0f84a7295433ebac09b94fae2d5cc2892.tar.gz gcc-7a6c31f0f84a7295433ebac09b94fae2d5cc2892.tar.bz2