aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Target/RISCV/Disassembler/RISCVDisassembler.cpp
diff options
context:
space:
mode:
authorpeterbell10 <peterbell10@openai.com>2025-03-29 01:31:33 +0000
committerGitHub <noreply@github.com>2025-03-29 01:31:33 +0000
commit55430f8673b9f8b27779faad3fe9c59954d539df (patch)
tree25fdcf996c86e5b4483a95d80d2742c55a9eebc2 /llvm/lib/Target/RISCV/Disassembler/RISCVDisassembler.cpp
parent22a11be8ab6f31cd3621f37d675e44988218f314 (diff)
downloadllvm-55430f8673b9f8b27779faad3fe9c59954d539df.zip
llvm-55430f8673b9f8b27779faad3fe9c59954d539df.tar.gz
llvm-55430f8673b9f8b27779faad3fe9c59954d539df.tar.bz2
[NVPTX] Customize getScalarizationOverhead (#128077)
We've observed that the SLPVectorizer is too conservative on NVPTX because it over-estimates the cost to build a vector. PTX has a single `mov` instruction that can build e.g. `<2 x half>` vectors from scalars, however the SLPVectorizer over-estimates it as the cost of 2 insert elements. To fix this I customize `getScalarizationOverhead` to lower the cost for building 2x16 types.
Diffstat (limited to 'llvm/lib/Target/RISCV/Disassembler/RISCVDisassembler.cpp')
0 files changed, 0 insertions, 0 deletions