aboutsummaryrefslogtreecommitdiff
path: root/llvm/lib/Target/ARM/Disassembler/ARMDisassembler.cpp
diff options
context:
space:
mode:
authorJeffrey Byrnes <jeffrey.byrnes@amd.com>2024-11-06 20:48:20 -0800
committerGitHub <noreply@github.com>2024-11-06 20:48:20 -0800
commitae6dbed5943d76c61fe95107c15a46f915180772 (patch)
tree8d7786d045741c21d8802ee1cb71a1ac6ea5c95d /llvm/lib/Target/ARM/Disassembler/ARMDisassembler.cpp
parentf0e2301b7c3f2576a4fbc53441e9378b966e21ef (diff)
downloadllvm-ae6dbed5943d76c61fe95107c15a46f915180772.zip
llvm-ae6dbed5943d76c61fe95107c15a46f915180772.tar.gz
llvm-ae6dbed5943d76c61fe95107c15a46f915180772.tar.bz2
[AMDGPU] Use correct DWord for v_dot4 S0 operand (#115224)
Fixes a copy-paste typo. The typo resulted in producing bad v_perm based operands for the v_dot4 combine. When adding a corresponding byte pair to the v_dot byte pair chains, we must take note of the byte position in the corresponding source nodes. These byte positions are used to ensure we extract the correct DWord from the ultimate source, and formulate a correct perm_mask from the extracted DWord. With the typo, we the S0 byte would used the DWord offset for the corresponding S1 byte. If this offset was not the same as the true DWord offset for the S0 byte, we would extract and use the wrong byte for S0 in the v_dot. Fixes https://github.com/llvm/llvm-project/issues/112941
Diffstat (limited to 'llvm/lib/Target/ARM/Disassembler/ARMDisassembler.cpp')
0 files changed, 0 insertions, 0 deletions