aboutsummaryrefslogtreecommitdiff
path: root/clang/lib/Basic/Module.cpp
diff options
context:
space:
mode:
authorKrzysztof Drewniak <Krzysztof.Drewniak@amd.com>2023-05-12 15:40:29 +0000
committerKrzysztof Drewniak <Krzysztof.Drewniak@amd.com>2023-09-28 14:44:16 +0000
commit2ebd633f145615a42d7e8b1d07cbdad294c244aa (patch)
treec5f6d579b748b4bfcbd598ae3d04b1ad317fd75b /clang/lib/Basic/Module.cpp
parent0eed8ae7d2b75819ffb1581e99a154097fc69a83 (diff)
downloadllvm-2ebd633f145615a42d7e8b1d07cbdad294c244aa.zip
llvm-2ebd633f145615a42d7e8b1d07cbdad294c244aa.tar.gz
llvm-2ebd633f145615a42d7e8b1d07cbdad294c244aa.tar.bz2
[mlir][AMDGPU] Add packed 8-bit float conversion ops and lowering
Define operations that wrap the gfx940's new operations for converting between f32 and registers containing packed sets of four 8-bit floats. Define rocdl operations for the intrinsics and an AMDGPU dialect wrapper around them (to account for the fact that MLIR distinguishes the two float formats at the type level but that the LLVM IR does not). Define an ArithToAMDGPU pass, meant to run before conversion to LLVM, that replaces relevant calls to arith.extf and arith.truncf with the packed operations in the AMDGPU dialect. Note that the conversion currently only handles scalars and vectors of rank <= 1, as we do not have a usecase for multi-dimensional vector support right now. Reviewed By: jsjodin Differential Revision: https://reviews.llvm.org/D152457
Diffstat (limited to 'clang/lib/Basic/Module.cpp')
0 files changed, 0 insertions, 0 deletions