diff options
author | Andrea Di Biagio <Andrea_DiBiagio@sn.scee.net> | 2014-06-25 17:41:58 +0000 |
---|---|---|
committer | Andrea Di Biagio <Andrea_DiBiagio@sn.scee.net> | 2014-06-25 17:41:58 +0000 |
commit | 07cdffc324b8fd336b04e8a7f83d7c3ce1ba7815 (patch) | |
tree | d717e6b4dc4e7d246c638db18be77822343093d6 /clang/lib/Basic/VirtualFileSystem.cpp | |
parent | a826147eef05d1d4fa0f3efdd1eeb4c71ae13b9d (diff) | |
download | llvm-07cdffc324b8fd336b04e8a7f83d7c3ce1ba7815.zip llvm-07cdffc324b8fd336b04e8a7f83d7c3ce1ba7815.tar.gz llvm-07cdffc324b8fd336b04e8a7f83d7c3ce1ba7815.tar.bz2 |
[X86] Always prefer to lower a VECTOR_SHUFFLE into a BLENDI instead of SHUFP (or VPERM2X128).
This patch teaches method 'LowerVECTOR_SHUFFLE' to give higher precedence to
the check for 'isBlendMask'; the idea is that, when possible, we should firstly
check if a shuffle performs a blend, and in case, try to lower it into a BLENDI
instead of selecting a SHUFP or (worse) a VPERM2X128.
In general:
- AVX VBLENDPS/D always have better latency and throughput than VPERM2F128;
- BLENDPS/D instructions tend to always have better 'reciprocal throughput'
than the equivalent SHUFPS/D;
- Both BLENDPS/D and SHUFPS/D are often decoded into the same number of
m-ops; however, a m-op obtained from a BLENDPS/D can be scheduled to more
than one execution port.
This patch:
- Moves the check for 'isBlendMask' immediately before the check for
'isSHUFPMask' within method 'LowerVECTOR_SHUFFLE';
- Updates existing tests for sse/avx shuffle/blend instructions to verify
that we select (v)blendps/d when possible (instead of (v)shufps/d or
vperm2f128).
llvm-svn: 211720
Diffstat (limited to 'clang/lib/Basic/VirtualFileSystem.cpp')
0 files changed, 0 insertions, 0 deletions