aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Bitcode/Reader/BitcodeReader.cpp
diff options
context:
space:
mode:
authorFlorian Hahn <flo@fhahn.com>2024-01-25 18:28:44 +0000
committerGitHub <noreply@github.com>2024-01-25 18:28:44 +0000
commiteb678d89933684627ee2c5da47e53751f3fdbaa2 (patch)
treeadf67bde7b1e9331f494f9d152e1868f688b9683 /llvm/lib/Bitcode/Reader/BitcodeReader.cpp
parentc92ad411f2f94d8521cd18abcb37285f9a390ecb (diff)
downloadllvm-eb678d89933684627ee2c5da47e53751f3fdbaa2.zip
llvm-eb678d89933684627ee2c5da47e53751f3fdbaa2.tar.gz
llvm-eb678d89933684627ee2c5da47e53751f3fdbaa2.tar.bz2
[AArch64] Combine store (trunc X to <3 x i8>) to sequence of ST1.b. (#78637)
Improve codegen for (trunc X to <3 x i8>) by converting it to a sequence of 3 ST1.b, but first converting the truncate operand to either v8i8 or v16i8, extracting the lanes for the truncate results and storing them. At the moment, there are almost no cases in which such vector operations will be generated automatically. The motivating case is non-power-of-2 SLP vectorization: https://github.com/llvm/llvm-project/pull/77790 PR: https://github.com/llvm/llvm-project/pull/78637
Diffstat (limited to 'llvm/lib/Bitcode/Reader/BitcodeReader.cpp')
0 files changed, 0 insertions, 0 deletions