riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Gadi Haber <gadi.haber@intel.com>	2017-03-27 12:13:37 +0000
committer	Gadi Haber <gadi.haber@intel.com>	2017-03-27 12:13:37 +0000
commit	89d5f9391a365920e753d809e28434cec180d5ec (patch)
tree	42920d2c56796332681e43b318fe32ddd8f97c86 /llvm/lib/Support/CommandLine.cpp
parent	7a130c68f0ace620bdfaa7ecf78d997e25ddac72 (diff)
download	llvm-89d5f9391a365920e753d809e28434cec180d5ec.zip llvm-89d5f9391a365920e753d809e28434cec180d5ec.tar.gz llvm-89d5f9391a365920e753d809e28434cec180d5ec.tar.bz2

[X86][AVX2] bugzilla bug 21281 Performance regression in vector interleave in AVX2

This is a patch for an on-going bugzilla bug 21281 on the generated X86 code for a matrix transpose8x8 subroutine which requires vector interleaving. The generated code in AVX2 is currently non-optimal and requires 60 instructions as opposed to only 40 instructions generated for AVX1. The patch includes a fix for the AVX2 case where vector unpack instructions use less operations than the vector blend operations available in AVX2. In this case using vector unpack instructions is more efficient. Reviewers: zvi delena igorb craig.topper guyblank eladcohen m_zuckerman aymanmus RKSimon llvm-svn: 298840

Diffstat (limited to 'llvm/lib/Support/CommandLine.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: