riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Florian Hahn <flo@fhahn.com>	2024-01-25 18:28:44 +0000
committer	GitHub <noreply@github.com>	2024-01-25 18:28:44 +0000
commit	eb678d89933684627ee2c5da47e53751f3fdbaa2 (patch)
tree	adf67bde7b1e9331f494f9d152e1868f688b9683 /llvm/lib/Bitcode/Reader/BitcodeReader.cpp
parent	c92ad411f2f94d8521cd18abcb37285f9a390ecb (diff)
download	llvm-eb678d89933684627ee2c5da47e53751f3fdbaa2.zip llvm-eb678d89933684627ee2c5da47e53751f3fdbaa2.tar.gz llvm-eb678d89933684627ee2c5da47e53751f3fdbaa2.tar.bz2

[AArch64] Combine store (trunc X to <3 x i8>) to sequence of ST1.b. (#78637)

Improve codegen for (trunc X to <3 x i8>) by converting it to a sequence of 3 ST1.b, but first converting the truncate operand to either v8i8 or v16i8, extracting the lanes for the truncate results and storing them. At the moment, there are almost no cases in which such vector operations will be generated automatically. The motivating case is non-power-of-2 SLP vectorization: https://github.com/llvm/llvm-project/pull/77790 PR: https://github.com/llvm/llvm-project/pull/78637

Diffstat (limited to 'llvm/lib/Bitcode/Reader/BitcodeReader.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: