riscv-gnu-toolchain/gcc.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Tamar Christina <tamar.christina@arm.com>	2024-08-06 22:41:10 +0100
committer	Tamar Christina <tamar.christina@arm.com>	2024-08-06 22:41:10 +0100
commit	a50916a6c0a6c73c1537d033509d4f7034341f75 (patch)
tree	49b0ea6cd0545f510b63ac075bf40b0e081a7adc /gcc/tree-iterator.cc
parent	77d232522d3eb7a6541fc91c3092c115cc535275 (diff)
download	gcc-a50916a6c0a6c73c1537d033509d4f7034341f75.zip gcc-a50916a6c0a6c73c1537d033509d4f7034341f75.tar.gz gcc-a50916a6c0a6c73c1537d033509d4f7034341f75.tar.bz2

AArch64: take gather/scatter decode overhead into account

Gather and scatters are not usually beneficial when the loop count is small. This is because there's not only a cost to their execution within the loop but there is also some cost to enter loops with them. As such this patch models this overhead. For generic tuning we however still prefer gathers/scatters when the loop costs work out. gcc/ChangeLog: * config/aarch64/aarch64-protos.h (struct sve_vec_cost): Add gather_load_x32_init_cost and gather_load_x64_init_cost. * config/aarch64/aarch64.cc (aarch64_vector_costs): Add m_sve_gather_scatter_init_cost. (aarch64_vector_costs::add_stmt_cost): Use them. (aarch64_vector_costs::finish_cost): Likewise. * config/aarch64/tuning_models/a64fx.h: Update. * config/aarch64/tuning_models/cortexx925.h: Update. * config/aarch64/tuning_models/generic.h: Update. * config/aarch64/tuning_models/generic_armv8_a.h: Update. * config/aarch64/tuning_models/generic_armv9_a.h: Update. * config/aarch64/tuning_models/neoverse512tvb.h: Update. * config/aarch64/tuning_models/neoversen2.h: Update. * config/aarch64/tuning_models/neoversen3.h: Update. * config/aarch64/tuning_models/neoversev1.h: Update. * config/aarch64/tuning_models/neoversev2.h: Update. * config/aarch64/tuning_models/neoversev3.h: Update. * config/aarch64/tuning_models/neoversev3ae.h: Update.

Diffstat (limited to 'gcc/tree-iterator.cc')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: