riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Wei Mi <wmi@google.com>	2016-06-14 18:53:20 +0000
committer	Wei Mi <wmi@google.com>	2016-06-14 18:53:20 +0000
commit	b799a625f9221b017e2ab2503e291c503da6c763 (patch)
tree	11a43258809590cacd2698956315b61d21259186 /clang/lib/CodeGen/CodeGenModule.cpp
parent	07c229c9e7f5520953207ab8cf159674c9bd33f2 (diff)
download	llvm-b799a625f9221b017e2ab2503e291c503da6c763.zip llvm-b799a625f9221b017e2ab2503e291c503da6c763.tar.gz llvm-b799a625f9221b017e2ab2503e291c503da6c763.tar.bz2

[X86] Reduce the width of multiplification when its operands are extended from i8 or i16

For <N x i32> type mul, pmuludq will be used for targets without SSE41, which often introduces many extra pack and unpack instructions in vectorized loop body because pmuludq generates <N/2 x i64> type value. However when the operands of <N x i32> mul are extended from smaller size values like i8 and i16, the type of mul may be shrunk to use pmullw + pmulhw/pmulhuw instead of pmuludq, which generates better code. For targets with SSE41, pmulld is supported so no shrinking is needed. Differential Revision: http://reviews.llvm.org/D20931 llvm-svn: 272694

Diffstat (limited to 'clang/lib/CodeGen/CodeGenModule.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: