aboutsummaryrefslogtreecommitdiff
path: root/flang/lib/Frontend/CompilerInvocation.cpp
diff options
context:
space:
mode:
authorstanley-nod <stanley@nod-labs.com>2022-11-09 11:26:20 -0800
committerstanley-nod <stanley@nod-labs.com>2022-11-09 11:52:17 -0800
commitd2061530dc093daca93fbb268611e1a146e722de (patch)
treed914423090828a5c75ebd271a70b415f2ec8c513 /flang/lib/Frontend/CompilerInvocation.cpp
parentdc9846ce988b9ddfcbc42cd462d5d94b634b3161 (diff)
downloadllvm-d2061530dc093daca93fbb268611e1a146e722de.zip
llvm-d2061530dc093daca93fbb268611e1a146e722de.tar.gz
llvm-d2061530dc093daca93fbb268611e1a146e722de.tar.bz2
[mlir][vector] Modify constraint and interface for warp reduce on f16 and i8
Quantization method is crucial and ubiqutous in accelerating machine learning workloads. Most of these methods uses f16 and i8 types. This patch relaxes the type contraints on warp reduce distribution to allow these types. Furthermore, this patch also changed the interface and moved the initial reduction of data to a single thread into the distributedReductionFn, this gives flexibility for developers to control how they are obtaining the initial lane value, which might differ based on the input types. (i.e to shuffle 32-width type, we need to reduce f16 to 2xf16 types rather than a single element). Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D137691
Diffstat (limited to 'flang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions