diff options
author | Jeffrey Byrnes <jeffrey.byrnes@amd.com> | 2024-11-06 20:48:20 -0800 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-11-06 20:48:20 -0800 |
commit | ae6dbed5943d76c61fe95107c15a46f915180772 (patch) | |
tree | 8d7786d045741c21d8802ee1cb71a1ac6ea5c95d /llvm/lib/Target/ARM/Disassembler/ARMDisassembler.cpp | |
parent | f0e2301b7c3f2576a4fbc53441e9378b966e21ef (diff) | |
download | llvm-ae6dbed5943d76c61fe95107c15a46f915180772.zip llvm-ae6dbed5943d76c61fe95107c15a46f915180772.tar.gz llvm-ae6dbed5943d76c61fe95107c15a46f915180772.tar.bz2 |
[AMDGPU] Use correct DWord for v_dot4 S0 operand (#115224)
Fixes a copy-paste typo.
The typo resulted in producing bad v_perm based operands for the v_dot4
combine. When adding a corresponding byte pair to the v_dot byte pair
chains, we must take note of the byte position in the corresponding
source nodes. These byte positions are used to ensure we extract the
correct DWord from the ultimate source, and formulate a correct
perm_mask from the extracted DWord.
With the typo, we the S0 byte would used the DWord offset for the
corresponding S1 byte. If this offset was not the same as the true DWord
offset for the S0 byte, we would extract and use the wrong byte for S0
in the v_dot.
Fixes https://github.com/llvm/llvm-project/issues/112941
Diffstat (limited to 'llvm/lib/Target/ARM/Disassembler/ARMDisassembler.cpp')
0 files changed, 0 insertions, 0 deletions