aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Support/CommandLine.cpp
diff options
context:
space:
mode:
authorGadi Haber <gadi.haber@intel.com>2017-03-27 12:13:37 +0000
committerGadi Haber <gadi.haber@intel.com>2017-03-27 12:13:37 +0000
commit89d5f9391a365920e753d809e28434cec180d5ec (patch)
tree42920d2c56796332681e43b318fe32ddd8f97c86 /llvm/lib/Support/CommandLine.cpp
parent7a130c68f0ace620bdfaa7ecf78d997e25ddac72 (diff)
downloadllvm-89d5f9391a365920e753d809e28434cec180d5ec.zip
llvm-89d5f9391a365920e753d809e28434cec180d5ec.tar.gz
llvm-89d5f9391a365920e753d809e28434cec180d5ec.tar.bz2
[X86][AVX2] bugzilla bug 21281 Performance regression in vector interleave in AVX2
This is a patch for an on-going bugzilla bug 21281 on the generated X86 code for a matrix transpose8x8 subroutine which requires vector interleaving. The generated code in AVX2 is currently non-optimal and requires 60 instructions as opposed to only 40 instructions generated for AVX1. The patch includes a fix for the AVX2 case where vector unpack instructions use less operations than the vector blend operations available in AVX2. In this case using vector unpack instructions is more efficient. Reviewers: zvi delena igorb craig.topper guyblank eladcohen m_zuckerman aymanmus RKSimon llvm-svn: 298840
Diffstat (limited to 'llvm/lib/Support/CommandLine.cpp')
0 files changed, 0 insertions, 0 deletions