rocket-tools/riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Pravin Jagtap <Pravin.Jagtap@amd.com>	2023-07-24 00:05:42 -0400
committer	Pravin Jagtap <Pravin.Jagtap@amd.com>	2023-07-24 00:06:00 -0400
commit	c48ed93cf8c9db04ce72acf669dce396cc68672a (patch)
tree	dae58b91c2390c2f77b19c0221e57bd6db93ba71 /mlir/lib/Bindings/Python/IRModule.h
parent	49f3435286418c99dd4080fa427809ad72892127 (diff)
download	llvm-c48ed93cf8c9db04ce72acf669dce396cc68672a.zip llvm-c48ed93cf8c9db04ce72acf669dce396cc68672a.tar.gz llvm-c48ed93cf8c9db04ce72acf669dce396cc68672a.tar.bz2

[AMDGPU] Add llvm.amdgcn.wave.reduce.umin/umax Intrinsic.

When input to intrinsic is uniform value, reduced value is same as input whereas if input value is divergent we need to iterate over all active lanes of WaveFront to perform the reduction. The control flow for a `loop` has been set up, which iterates over `only` active lanes to perform reduction. Introduced WAVE_REDUCE_UMIN_PSEUDO_U32 and WAVE_REDUCE_UMAX_PSEUDO_U32 Pseudos which are lowered Post-ISel (in `EmitInstrWithCustomInserter `). Reviewed By: arsenm, #amdgpu Differential Revision: https://reviews.llvm.org/D154858

Diffstat (limited to 'mlir/lib/Bindings/Python/IRModule.h')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: