aboutsummaryrefslogtreecommitdiff
path: root/flang/lib/Frontend/CompilerInvocation.cpp
diff options
context:
space:
mode:
authorDhruv Chawla <dhruvc@nvidia.com>2024-05-16 08:08:06 +0530
committerGitHub <noreply@github.com>2024-05-16 08:08:06 +0530
commit1dd0d3cf40f21b842dbee107b3d203db9fbaa4ae (patch)
treeb560af5931929c9744c74afa8a2c03717643ffac /flang/lib/Frontend/CompilerInvocation.cpp
parent3a4c1b9b4428b08d4475decf74c11e0d328c5842 (diff)
downloadllvm-1dd0d3cf40f21b842dbee107b3d203db9fbaa4ae.zip
llvm-1dd0d3cf40f21b842dbee107b3d203db9fbaa4ae.tar.gz
llvm-1dd0d3cf40f21b842dbee107b3d203db9fbaa4ae.tar.bz2
[AArch64][GISel] Fold COPY(y:gpr, DUP(x:fpr, i)) -> UMOV(y:gpr, x:fpr, i) (#89017)
This patch adds a peephole to AArch64PostSelectOptimize for codegen that is caused by RegBankSelect limiting G_EXTRACT_VECTOR_ELT only to FPR registers in both the input and output registers. This can cause a generation of COPY from FPR to GPR when, for example, the output register of the G_EXTRACT_VECTOR_ELT is used in a branch condition. This was noticed when looking at codegen differences between SDAG and GI for the s1279 kernel in the TSVC benchmark.
Diffstat (limited to 'flang/lib/Frontend/CompilerInvocation.cpp')
0 files changed, 0 insertions, 0 deletions