aboutsummaryrefslogtreecommitdiff
path: root/gcc/tree-inline.c
diff options
context:
space:
mode:
authorKyrylo Tkachov <kyrylo.tkachov@arm.com>2019-07-02 08:24:54 +0000
committerKyrylo Tkachov <ktkachov@gcc.gnu.org>2019-07-02 08:24:54 +0000
commit8bb9a5e66a150b73c97aeffee52b57147022a817 (patch)
treead25eff77ffcceb371d0439a550fcb2b7298963e /gcc/tree-inline.c
parent606a9a8c728f2dae6b9d58397fcefb3cb0fa0db8 (diff)
downloadgcc-8bb9a5e66a150b73c97aeffee52b57147022a817.zip
gcc-8bb9a5e66a150b73c97aeffee52b57147022a817.tar.gz
gcc-8bb9a5e66a150b73c97aeffee52b57147022a817.tar.bz2
[arm/AArch64] Assume unhandled NEON types are neon_arith_basic types when scheduling for Cortex-A57
Some scheduling descriptions, like the Cortex-A57 one, are reused for multiple -mcpu options. Sometimes those other -mcpu cores support more architecture features than the Armv8-A Cortex-A57. For example, the Cortex-A75 and Cortex-A76 support Armv8.2-A as well as the Dot Product instructions. These Dot Product instructions have the neon_dot and neon_dot_q scheduling type, but that type is not handled in cortex-a57.md, since the Cortex-A57 itself doesn't need to care about these instructions. But if we just ignore the neon_dot(_q) type at scheduling we get really terrible codegen when compiling for -mcpu=cortex-a76, for example, because the scheduler just pools all the UDOT instructions at the end of the basic block, since it doesn't assume anything about their behaviour. This patch ameliorates the situation somewhat by telling the Cortex-A57 scheduling model to treat any insn that doesn't get assigned a cortex_a57_neon_type but is actually a is_neon_type instruction as a simple neon_arith_basic instruction. This allows us to treat post-Armv8-A SIMD instructions more sanely without having to model each of them explicitly in cortex-a57.md. * config/arm/cortex-a57.md (cortex_a57_neon_type): Use neon_arith_basic for is_neon_type instructions that have not already been categorized. From-SVN: r272924
Diffstat (limited to 'gcc/tree-inline.c')
0 files changed, 0 insertions, 0 deletions