diff options
| author | Brox Chen <guochen2@amd.com> | 2025-09-24 11:27:26 -0400 |
|---|---|---|
| committer | GitHub <noreply@github.com> | 2025-09-24 11:27:26 -0400 |
| commit | 934f802731887f5ed183d13f1b538414c518b004 (patch) | |
| tree | ff6b766e13484ea1c8b1e17095e1de20b60fcbe0 /lldb/packages/Python/lldbsuite/test/gdbclientutils.py | |
| parent | 3be8294bf9bef8f046db89b0b590642387312d8c (diff) | |
| download | llvm-934f802731887f5ed183d13f1b538414c518b004.zip llvm-934f802731887f5ed183d13f1b538414c518b004.tar.gz llvm-934f802731887f5ed183d13f1b538414c518b004.tar.bz2 | |
[AMDGPU][True16][CodeGen] true16 isel pattern for fma_mix_f16/bf16 (#159648)
This patch includes:
1. fma_mix inst takes fp16 type as input, but place the operand in
vgpr32. Update selector to insert vgpr32 for true16 mode if necessary.
2. fma_mix inst returns fp16 type as output, but place the vdst in
vgpr32. Create a fma_mix_t16 pesudo inst for isel pattern, and lower it
to mix_lo/hi in the mc lowering pass.
These stop isel from emitting illegal `vgpr32 = COPY vgpr16` and improve
code quality
Diffstat (limited to 'lldb/packages/Python/lldbsuite/test/gdbclientutils.py')
0 files changed, 0 insertions, 0 deletions
