diff options
author | Yuta Mukai <mukai.yuta@fujitsu.com> | 2022-08-08 20:45:54 +0900 |
---|---|---|
committer | KAWASHIMA Takahiro <t-kawashima@fujitsu.com> | 2022-08-09 10:53:40 +0900 |
commit | 3f561996bf7193091bc6670a2e7804b0cb0bb936 (patch) | |
tree | c32670ae377e3d1378eadb0af47fb15f35e7b2d2 /llvm/lib/CodeGen/ModuloSchedule.cpp | |
parent | d9004dfbabc62887f09775297436792077ce4496 (diff) | |
download | llvm-3f561996bf7193091bc6670a2e7804b0cb0bb936.zip llvm-3f561996bf7193091bc6670a2e7804b0cb0bb936.tar.gz llvm-3f561996bf7193091bc6670a2e7804b0cb0bb936.tar.bz2 |
[AArch64] Fix and add A64FX scheduling resource/latency info
1. Missing instruction information (FTSSEL, FMSB, PFIRST and RDFFR)
is added and CompleteModel is set to one.
2. Information for pseudo SVE instructions is added. Those
instructions are present at the time of scheduling.
3. Resource and latency information for SVE instructions is modified
to be more accurate.
For example, the description for CMPEQ, which consumes one cycle
each of unit FLA and PPR, is as follows.
```
Previous:
def A64FXGI01 : ProcResGroup<[A64FXIPFLA, A64FXIPPR]>;
def A64FXWrite_4Cyc_GI01 : SchedWriteRes<[A64FXGI01]> {...
Modified:
def A64FXGI0 : ProcResGroup<[A64FXIPFLA]>;
def A64FXGI1 : ProcResGroup<[A64FXIPPR]>;
def A64FXWrite_CMP : SchedWriteRes<[A64FXGI0, A64FXGI1]> {...
```
Reference: A64FX Microarchitecture Manual (Table 16-3)
https://github.com/fujitsu/A64FX/blob/master/doc/A64FX_Microarchitecture_Manual_en_1.7.pdf
Reviewed By: dmgreen, kawashima-fj
Differential Revision: https://reviews.llvm.org/D131165
Diffstat (limited to 'llvm/lib/CodeGen/ModuloSchedule.cpp')
0 files changed, 0 insertions, 0 deletions