riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	ziereis <44057120+ziereis@users.noreply.github.com>	2025-01-15 09:12:39 +0100
committer	GitHub <noreply@github.com>	2025-01-15 08:12:39 +0000
commit	929eb500d4c9b3fff0693c49fd55c8093dc1ad62 (patch)
tree	ce08ace14ab1be19ff053fe6c86f6be8de9963fa /flang/lib/Frontend/CompilerInvocation.cpp
parent	d1d25641f4cb87ab2c07a4136ba1cec4fb6cf578 (diff)
download	llvm-929eb500d4c9b3fff0693c49fd55c8093dc1ad62.zip llvm-929eb500d4c9b3fff0693c49fd55c8093dc1ad62.tar.gz llvm-929eb500d4c9b3fff0693c49fd55c8093dc1ad62.tar.bz2

[mlir] Rewrites for I2 to I8 signed and unsigned extension (#121298)

Adds rewrites for i2 to i8 signed and unsigned extension, similar to the ones that already exist for i4 to i8 conversion. I use this for i6 quantized models, and this gives me roughly a 2x speedup for an i6 4096x4096 dequantization-matmul on an AMD 5950x. I didn't add the rewrite for i8 to i2 truncation because I currently don't use it, but if this is needed, I can add it as well. --------- Co-authored-by: Andrzej Warzyński <andrzej.warzynski@gmail.com>

Diffstat (limited to 'flang/lib/Frontend/CompilerInvocation.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: