riscv-gnu-toolchain/llvm.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Jeffrey Byrnes <jeffrey.byrnes@amd.com>	2024-11-06 20:48:20 -0800
committer	GitHub <noreply@github.com>	2024-11-06 20:48:20 -0800
commit	ae6dbed5943d76c61fe95107c15a46f915180772 (patch)
tree	8d7786d045741c21d8802ee1cb71a1ac6ea5c95d /llvm/lib/Target/ARM/Disassembler/ARMDisassembler.cpp
parent	f0e2301b7c3f2576a4fbc53441e9378b966e21ef (diff)
download	llvm-ae6dbed5943d76c61fe95107c15a46f915180772.zip llvm-ae6dbed5943d76c61fe95107c15a46f915180772.tar.gz llvm-ae6dbed5943d76c61fe95107c15a46f915180772.tar.bz2

[AMDGPU] Use correct DWord for v_dot4 S0 operand (#115224)

Fixes a copy-paste typo. The typo resulted in producing bad v_perm based operands for the v_dot4 combine. When adding a corresponding byte pair to the v_dot byte pair chains, we must take note of the byte position in the corresponding source nodes. These byte positions are used to ensure we extract the correct DWord from the ultimate source, and formulate a correct perm_mask from the extracted DWord. With the typo, we the S0 byte would used the DWord offset for the corresponding S1 byte. If this offset was not the same as the true DWord offset for the S0 byte, we would extract and use the wrong byte for S0 in the v_dot. Fixes https://github.com/llvm/llvm-project/issues/112941

Diffstat (limited to 'llvm/lib/Target/ARM/Disassembler/ARMDisassembler.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: