diff options
author | Roman Lebedev <lebedev.ri@gmail.com> | 2021-11-14 19:53:01 +0300 |
---|---|---|
committer | Roman Lebedev <lebedev.ri@gmail.com> | 2021-11-14 20:01:38 +0300 |
commit | 4dd2f0446cf5de76c16a2663049a874f5c90ac92 (patch) | |
tree | 7725b41980d3e72a0b7be2daa5bb33c3de3830af /llvm/docs/tutorial | |
parent | e876698a5dc48697a077ae51455fb3520ed7410d (diff) | |
download | llvm-4dd2f0446cf5de76c16a2663049a874f5c90ac92.zip llvm-4dd2f0446cf5de76c16a2663049a874f5c90ac92.tar.gz llvm-4dd2f0446cf5de76c16a2663049a874f5c90ac92.tar.bz2 |
[X86][Costmodel] `getReplicationShuffleCost()`: promote 16 bit-wide elements to 32 bit when no AVX512BW
The basic idea is simple, if we don't have native shuffle for this element type,
then we must have native shuffle for wider element type,
so promote, replicate, demote.
I believe, asking `getCastInstrCost(Instruction::Trunc` is correct semantically,
case in point `trunc <32 x i32> to <32 x i8>` aka 2 * ZMM will naively result in
2 * XMM, that then will be packed into 1 * YMM,
and it should count the cost of said packing,
not just the truncations.
Reviewed By: RKSimon
Differential Revision: https://reviews.llvm.org/D113609
Diffstat (limited to 'llvm/docs/tutorial')
0 files changed, 0 insertions, 0 deletions