rocket-tools/riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	David Sherwood <david.sherwood@arm.com>	2024-12-19 10:07:41 +0000
committer	GitHub <noreply@github.com>	2024-12-19 10:07:41 +0000
commit	eaf482f01252a0276a6b422dabe810a1abc7e168 (patch)
tree	f14ade5abc8c7bd50750950c8c798621659a2b11 /clang/lib/CodeGen/CodeGenModule.cpp
parent	c18fda02e1c5dd68ce65b8505d3976f0d5714d52 (diff)
download	llvm-eaf482f01252a0276a6b422dabe810a1abc7e168.zip llvm-eaf482f01252a0276a6b422dabe810a1abc7e168.tar.gz llvm-eaf482f01252a0276a6b422dabe810a1abc7e168.tar.bz2

[AArch64] Tweak truncate costs for some scalable vector types (#119542)

== We were previously returning an invalid cost when truncating anything to <vscale x 2 x i1>, which is incorrect since we can generate perfectly good code for this. == The costs for truncating legal or unpacked types to predicates seemed overly optimistic. For example, when truncating <vscale x 8 x i16> to <vscale x 8 x i1> we typically do something like and z0.h, z0.h, #0x1 cmpne p0.h, p0/z, z0.h, #0 I guess it might depend upon whether the input value is generated in the same block or not and if we can avoid the inreg zero-extend. However, it feels safe to take the more conservative cost here. == The costs for some truncates such as trunc <vscale x 2 x i32> %a to <vscale x 2 x i16> were 1, whereas in actual fact they are free and no instructions are required. == Also, for this trunc <vscale x 8 x i32> %a to <vscale x 8 x i16> it's just a single uzp1 instruction so I reduced the cost to 1. In general, I've added costs for all cases where the destination type is legal or unpacked. One unfortunate side effect of this is the costs for some fixed-width truncates when using SVE now look too optimistic.

Diffstat (limited to 'clang/lib/CodeGen/CodeGenModule.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: