diff options
author | Robin Dapp <rdapp@ventanamicro.com> | 2024-06-07 14:36:41 +0200 |
---|---|---|
committer | Robin Dapp <rdapp@ventanamicro.com> | 2024-06-11 20:05:29 +0200 |
commit | 2b438a0d2aa80f051a09b245a58f643540d4004b (patch) | |
tree | 879fdb5f416f5c09c0b99e3e857227c594757dd7 /gcc/tree-vectorizer.h | |
parent | 1588983be6112561c805a50eb7a3c585865beffa (diff) | |
download | gcc-2b438a0d2aa80f051a09b245a58f643540d4004b.zip gcc-2b438a0d2aa80f051a09b245a58f643540d4004b.tar.gz gcc-2b438a0d2aa80f051a09b245a58f643540d4004b.tar.bz2 |
vect: Merge loop mask and cond_op mask in fold-left reduction [PR115382].
Currently we discard the cond-op mask when the loop is fully masked
which causes wrong code in
gcc.dg/vect/vect-cond-reduc-in-order-2-signed-zero.c
when compiled with
-O3 -march=cascadelake --param vect-partial-vector-usage=2.
This patch ANDs both masks.
gcc/ChangeLog:
PR tree-optimization/115382
* tree-vect-loop.cc (vectorize_fold_left_reduction): Use
prepare_vec_mask.
* tree-vect-stmts.cc (check_load_store_for_partial_vectors):
Remove static of prepare_vec_mask.
* tree-vectorizer.h (prepare_vec_mask): Export.
Diffstat (limited to 'gcc/tree-vectorizer.h')
-rw-r--r-- | gcc/tree-vectorizer.h | 3 |
1 files changed, 3 insertions, 0 deletions
diff --git a/gcc/tree-vectorizer.h b/gcc/tree-vectorizer.h index 97ec9c3..6bb0f5c 100644 --- a/gcc/tree-vectorizer.h +++ b/gcc/tree-vectorizer.h @@ -2508,6 +2508,9 @@ extern void vect_free_slp_tree (slp_tree); extern bool compatible_calls_p (gcall *, gcall *); extern int vect_slp_child_index_for_operand (const gimple *, int op, bool); +extern tree prepare_vec_mask (loop_vec_info, tree, tree, tree, + gimple_stmt_iterator *); + /* In tree-vect-patterns.cc. */ extern void vect_mark_pattern_stmts (vec_info *, stmt_vec_info, gimple *, tree); |