aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp
diff options
context:
space:
mode:
authorNicolas Miller <nicolas.miller@codeplay.com>2022-03-01 10:34:02 -0800
committerArtem Belevich <tra@google.com>2022-03-01 11:07:11 -0800
commit510fd283fda2d7c5118ae1b451a1f2365cfc3f27 (patch)
tree662a1055737802460ea9b546c2174482399e0a27 /llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp
parenta8951823024b38c455e839d40656ad533b4aa8ff (diff)
downloadllvm-510fd283fda2d7c5118ae1b451a1f2365cfc3f27.zip
llvm-510fd283fda2d7c5118ae1b451a1f2365cfc3f27.tar.gz
llvm-510fd283fda2d7c5118ae1b451a1f2365cfc3f27.tar.bz2
[NVPTX] Add ex2.approx.f16/f16x2 support
NOTE: this is a follow-up commit with the missing clang-side changes. This patch adds builtins and intrinsics for the f16 and f16x2 variants of the ex2 instruction. These two variants were added in PTX7.0, and are supported by sm_75 and above. Note that this isn't wired with the exp2 llvm intrinsic because the ex2 instruction is only available in its approx variant. Running ptxas on the assembly generated by the test f16-ex2.ll works as expected. Differential Revision: https://reviews.llvm.org/D119157
Diffstat (limited to 'llvm/lib/Transforms/Utils/LoopUnrollRuntime.cpp')
0 files changed, 0 insertions, 0 deletions