diff options
author | Feng Xue <fxue@os.amperecomputing.com> | 2024-10-11 14:55:05 +0800 |
---|---|---|
committer | Feng Xue <fxue@os.amperecomputing.com> | 2024-10-12 23:05:37 +0800 |
commit | a9173a50e7e346a218323916e4d3add8552529ae (patch) | |
tree | 9dc5dc68e888fe4dfc368c6f83644570fa617db7 /gcc/fortran/dump-parse-tree.cc | |
parent | f54d42e00007e7a558b273d87f95b3e5b1938f5a (diff) | |
download | gcc-a9173a50e7e346a218323916e4d3add8552529ae.zip gcc-a9173a50e7e346a218323916e4d3add8552529ae.tar.gz gcc-a9173a50e7e346a218323916e4d3add8552529ae.tar.bz2 |
vect: Fix inconsistency in fully-masked lane-reducing op generation [PR116985]
To align vectorized def/use when lane-reducing op is present in loop reduction,
we may need to insert extra trivial pass-through copies, which would cause
mismatch between lane-reducing vector copy and loop mask index. This could be
fixed by computing the right index around a new counter on effective lane-
reducing vector copies.
2024-10-11 Feng Xue <fxue@os.amperecomputing.com>
gcc/
PR tree-optimization/116985
* tree-vect-loop.cc (vect_transform_reduction): Compute loop mask
index based on effective vector copies for reduction op.
gcc/testsuite/
PR tree-optimization/116985
* gcc.dg/vect/pr116985.c: New testcase.
Diffstat (limited to 'gcc/fortran/dump-parse-tree.cc')
0 files changed, 0 insertions, 0 deletions