diff options
author | Simon Tatham <simon.tatham@arm.com> | 2020-03-02 09:06:00 +0000 |
---|---|---|
committer | Simon Tatham <simon.tatham@arm.com> | 2020-03-02 10:33:30 +0000 |
commit | b08d2ddd69b4a2209930b31fe456b4d7c1ce148f (patch) | |
tree | fd0a618ff1c5e78da3f513b7a380e3a6eb62395c /llvm/lib/Bitcode/Writer/BitcodeWriter.cpp | |
parent | 69441e53c9f45c58674bbe2fd788baf9b9fe1392 (diff) | |
download | llvm-b08d2ddd69b4a2209930b31fe456b4d7c1ce148f.zip llvm-b08d2ddd69b4a2209930b31fe456b4d7c1ce148f.tar.gz llvm-b08d2ddd69b4a2209930b31fe456b4d7c1ce148f.tar.bz2 |
[ARM,MVE] Add ACLE intrinsics for VCVT.F32.F16 family.
Summary:
These instructions make a vector of `<4 x float>` by widening every
other lane of a vector of `<8 x half>`.
I wondered about representing these using standard IR, along the lines
of a shufflevector to extract elements of the input into a `<4 x half>`
followed by an `fpext` to turn that into `<4 x float>`. But it looks as
if that would take a lot of work in isel lowering to make it match any
pattern I could sensibly write in Tablegen, and also I haven't been
able to think of any other case where that pattern might be generated
in IR, so there wouldn't be any extra code generation win from doing
it that way.
Therefore, I've just used another target-specific intrinsic. We can
always change it to the other way later if anyone thinks of a good
reason.
(In order to put the intrinsic definition near similar things in
`IntrinsicsARM.td`, I've also lifted the definition of the
`MVEMXPredicated` multiclass higher up the file, without changing it.)
Reviewers: MarkMurrayARM, dmgreen, miyuki, ostannard
Reviewed By: miyuki
Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits
Tags: #clang, #llvm
Differential Revision: https://reviews.llvm.org/D75254
Diffstat (limited to 'llvm/lib/Bitcode/Writer/BitcodeWriter.cpp')
0 files changed, 0 insertions, 0 deletions