aboutsummaryrefslogtreecommitdiff
path: root/flang/lib/Frontend/CompilerInvocation.cpp
diff options
context:
space:
mode:
authorziereis <44057120+ziereis@users.noreply.github.com>2025-01-15 09:12:39 +0100
committerGitHub <noreply@github.com>2025-01-15 08:12:39 +0000
commit929eb500d4c9b3fff0693c49fd55c8093dc1ad62 (patch)
treece08ace14ab1be19ff053fe6c86f6be8de9963fa /flang/lib/Frontend/CompilerInvocation.cpp
parentd1d25641f4cb87ab2c07a4136ba1cec4fb6cf578 (diff)
downloadllvm-929eb500d4c9b3fff0693c49fd55c8093dc1ad62.zip
llvm-929eb500d4c9b3fff0693c49fd55c8093dc1ad62.tar.gz
llvm-929eb500d4c9b3fff0693c49fd55c8093dc1ad62.tar.bz2
[mlir] Rewrites for I2 to I8 signed and unsigned extension (#121298)
Adds rewrites for i2 to i8 signed and unsigned extension, similar to the ones that already exist for i4 to i8 conversion. I use this for i6 quantized models, and this gives me roughly a 2x speedup for an i6 4096x4096 dequantization-matmul on an AMD 5950x. I didn't add the rewrite for i8 to i2 truncation because I currently don't use it, but if this is needed, I can add it as well. --------- Co-authored-by: Andrzej WarzyƄski <andrzej.warzynski@gmail.com>
Diffstat (limited to 'flang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions