riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Simon Pilgrim <llvm-dev@redking.me.uk>	2021-05-24 18:26:16 +0100
committer	Simon Pilgrim <llvm-dev@redking.me.uk>	2021-05-25 11:35:57 +0100
commit	68ef68f8ac5cad1fdcd9c7b0e2a8f134d9f595ae (patch)
tree	c938805bdde9d054b60b399fbebe6f35c18cd6d5 /clang/lib/CodeGen/CodeGenFunction.cpp
parent	e3b8e6d48251a3b85f925fe695ef961013ddb940 (diff)
download	llvm-68ef68f8ac5cad1fdcd9c7b0e2a8f134d9f595ae.zip llvm-68ef68f8ac5cad1fdcd9c7b0e2a8f134d9f595ae.tar.gz llvm-68ef68f8ac5cad1fdcd9c7b0e2a8f134d9f595ae.tar.bz2

[CostModel][X86] Improve accuracy of vXi8/vXi16 vector non-uniform shift costs on AVX2/AVX512 targets

Determined from llvm-mca analysis, AVX2+ capable targets have a higher throughput for VPBLENDVB and VPMOVZX ops, making it cheaper to perform shift+select patterns for vXi8 shifts or extend/shift/truncate for vXi16 shifts. Similarly AVX512BW can perform vXi8 as extend/shift/truncate patterns.

Diffstat (limited to 'clang/lib/CodeGen/CodeGenFunction.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: