riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Krzysztof Drewniak <Krzysztof.Drewniak@amd.com>	2023-05-12 15:40:29 +0000
committer	Krzysztof Drewniak <Krzysztof.Drewniak@amd.com>	2023-09-28 14:44:16 +0000
commit	2ebd633f145615a42d7e8b1d07cbdad294c244aa (patch)
tree	c5f6d579b748b4bfcbd598ae3d04b1ad317fd75b /clang/lib/Basic/Module.cpp
parent	0eed8ae7d2b75819ffb1581e99a154097fc69a83 (diff)
download	llvm-2ebd633f145615a42d7e8b1d07cbdad294c244aa.zip llvm-2ebd633f145615a42d7e8b1d07cbdad294c244aa.tar.gz llvm-2ebd633f145615a42d7e8b1d07cbdad294c244aa.tar.bz2

[mlir][AMDGPU] Add packed 8-bit float conversion ops and lowering

Define operations that wrap the gfx940's new operations for converting between f32 and registers containing packed sets of four 8-bit floats. Define rocdl operations for the intrinsics and an AMDGPU dialect wrapper around them (to account for the fact that MLIR distinguishes the two float formats at the type level but that the LLVM IR does not). Define an ArithToAMDGPU pass, meant to run before conversion to LLVM, that replaces relevant calls to arith.extf and arith.truncf with the packed operations in the AMDGPU dialect. Note that the conversion currently only handles scalars and vectors of rank <= 1, as we do not have a usecase for multi-dimensional vector support right now. Reviewed By: jsjodin Differential Revision: https://reviews.llvm.org/D152457

Diffstat (limited to 'clang/lib/Basic/Module.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: