diff options
author | Richard Biener <rguenther@suse.de> | 2024-06-26 14:07:51 +0200 |
---|---|---|
committer | Richard Biener <rguenth@gcc.gnu.org> | 2024-06-28 13:43:28 +0200 |
commit | 0192341a07b8ea30f631cf4afdc6fcf3fa7ce838 (patch) | |
tree | a9634f92dc4bbfb0f2ddc1385d3968dbf6a104c5 /gcc | |
parent | ef0b30212f7756db15d7507bfd871bf377d7d648 (diff) | |
download | gcc-0192341a07b8ea30f631cf4afdc6fcf3fa7ce838.zip gcc-0192341a07b8ea30f631cf4afdc6fcf3fa7ce838.tar.gz gcc-0192341a07b8ea30f631cf4afdc6fcf3fa7ce838.tar.bz2 |
tree-optimization/115640 - outer loop vect with inner SLP permute
The following fixes wrong-code when using outer loop vectorization
and an inner loop SLP access with permutation. A wrong adjustment
to the IV increment is then applied on GCN.
PR tree-optimization/115640
* tree-vect-stmts.cc (vectorizable_load): With an inner
loop SLP access to not apply a gap adjustment.
Diffstat (limited to 'gcc')
-rw-r--r-- | gcc/tree-vect-stmts.cc | 11 |
1 files changed, 8 insertions, 3 deletions
diff --git a/gcc/tree-vect-stmts.cc b/gcc/tree-vect-stmts.cc index 0b0761b..7b889f3 100644 --- a/gcc/tree-vect-stmts.cc +++ b/gcc/tree-vect-stmts.cc @@ -10512,9 +10512,14 @@ vectorizable_load (vec_info *vinfo, whole group, not only the number of vector stmts the permutation result fits in. */ unsigned scalar_lanes = SLP_TREE_LANES (slp_node); - if (slp_perm - && (group_size != scalar_lanes - || !multiple_p (nunits, group_size))) + if (nested_in_vect_loop) + /* We do not support grouped accesses in a nested loop, + instead the access is contiguous but it might be + permuted. No gap adjustment is needed though. */ + vec_num = SLP_TREE_NUMBER_OF_VEC_STMTS (slp_node); + else if (slp_perm + && (group_size != scalar_lanes + || !multiple_p (nunits, group_size))) { /* We don't yet generate such SLP_TREE_LOAD_PERMUTATIONs for variable VF; see vect_transform_slp_perm_load. */ |