diff options
author | ziereis <44057120+ziereis@users.noreply.github.com> | 2025-01-15 09:12:39 +0100 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-01-15 08:12:39 +0000 |
commit | 929eb500d4c9b3fff0693c49fd55c8093dc1ad62 (patch) | |
tree | ce08ace14ab1be19ff053fe6c86f6be8de9963fa /flang/lib/Frontend/CompilerInvocation.cpp | |
parent | d1d25641f4cb87ab2c07a4136ba1cec4fb6cf578 (diff) | |
download | llvm-929eb500d4c9b3fff0693c49fd55c8093dc1ad62.zip llvm-929eb500d4c9b3fff0693c49fd55c8093dc1ad62.tar.gz llvm-929eb500d4c9b3fff0693c49fd55c8093dc1ad62.tar.bz2 |
[mlir] Rewrites for I2 to I8 signed and unsigned extension (#121298)
Adds rewrites for i2 to i8 signed and unsigned extension, similar to the
ones that already exist for i4 to i8 conversion.
I use this for i6 quantized models, and this gives me roughly a 2x
speedup for an i6 4096x4096 dequantization-matmul on an AMD 5950x.
I didn't add the rewrite for i8 to i2 truncation because I currently
don't use it, but if this is needed, I can add it as well.
---------
Co-authored-by: Andrzej WarzyĆski <andrzej.warzynski@gmail.com>
Diffstat (limited to 'flang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions