Compute estimated trip counts for multiple exit loops

This change allows us to estimate trip count from profile metadata for all multiple exit loops. We still do the estimate only from the latch, but that's fine as it causes us to over estimate the trip count at worst. Reviewing the uses of the API, all but one are cases where we restrict a loop transformation (unroll, and vectorize respectively) when we know the trip count is short enough. So, as a result, the change makes these passes strictly less aggressive. The test change illustrates a case where we'd previously have runtime unrolled a loop which ran fewer iterations than the unroll factor. This is definitely unprofitable. The one case where an upper bound on estimate trip count could drive a more aggressive transform is peeling, and I duplicated the logic being removed from the generic estimation there to keep it the same. The resulting heuristic makes no sense and should probably be immediately removed, but we can do that in a separate change. This was noticed when analyzing regressions on D113939. I plan to come back and incorporate estimated trip counts from other exits, but that's a minor improvement which can follow separately. Differential Revision: https://reviews.llvm.org/D115362
author: Philip Reames <listmail@philipreames.com> 2021-12-09 09:40:03 -0800
committer: Philip Reames <listmail@philipreames.com> 2021-12-09 09:53:49 -0800
commit: 2d31b02517c0eacfa7edf62822cb7265e804e89c (patch)
tree: 9b66af455fd956caaeb731b90799d28ef4e3043b /llvm/lib/Transforms/Utils/LoopPeel.cpp
parent: 06ca0a273308112c660dddf14c92fe14957bd468 (diff)
download: llvm-2d31b02517c0eacfa7edf62822cb7265e804e89c.zip
llvm-2d31b02517c0eacfa7edf62822cb7265e804e89c.tar.gz
llvm-2d31b02517c0eacfa7edf62822cb7265e804e89c.tar.bz2
1 files changed, 27 insertions, 0 deletions
diff --git a/llvm/lib/Transforms/Utils/LoopPeel.cpp b/llvm/lib/Transforms/Utils/LoopPeel.cpp
index f3cf42b..5ce392d 100644
--- a/llvm/lib/Transforms/Utils/LoopPeel.cpp
+++ b/llvm/lib/Transforms/Utils/LoopPeel.cpp
@@ -333,6 +333,31 @@ static unsigned countToEliminateCompares(Loop &L, unsigned MaxPeelCount,
   return DesiredPeelCount;
 }
 
+/// This "heuristic" exactly matches implicit behavior which used to exist
+/// inside getLoopEstimatedTripCount.  It was added here to keep an
+/// improvement inside that API from causing peeling to become more agressive.
+/// This should probably be removed.
+static bool violatesLegacyMultiExitLoopCheck(Loop *L) {
+  BasicBlock *Latch = L->getLoopLatch();
+  if (!Latch)
+    return true;
+
+  BranchInst *LatchBR = dyn_cast<BranchInst>(Latch->getTerminator());
+  if (!LatchBR || LatchBR->getNumSuccessors() != 2 || !L->isLoopExiting(Latch))
+    return true;
+
+  assert((LatchBR->getSuccessor(0) == L->getHeader() ||
+          LatchBR->getSuccessor(1) == L->getHeader()) &&
+         "At least one edge out of the latch must go to the header");
+
+  SmallVector<BasicBlock *, 4> ExitBlocks;
+  L->getUniqueNonLatchExitBlocks(ExitBlocks);
+  return any_of(ExitBlocks, [](const BasicBlock *EB) {
+      return !EB->getTerminatingDeoptimizeCall();
+    });
+}
+
+
 // Return the number of iterations we want to peel off.
 void llvm::computePeelCount(Loop *L, unsigned LoopSize,
                             TargetTransformInfo::PeelingPreferences &PP,
@@ -436,6 +461,8 @@ void llvm::computePeelCount(Loop *L, unsigned LoopSize,
   // We only do this in the presence of profile information, since otherwise
   // our estimates of the trip count are not reliable enough.
   if (L->getHeader()->getParent()->hasProfileData()) {
+    if (violatesLegacyMultiExitLoopCheck(L))
+      return;
     Optional<unsigned> PeelCount = getLoopEstimatedTripCount(L);
     if (!PeelCount)
       return;
author	Philip Reames <listmail@philipreames.com>	2021-12-09 09:40:03 -0800
committer	Philip Reames <listmail@philipreames.com>	2021-12-09 09:53:49 -0800
commit	2d31b02517c0eacfa7edf62822cb7265e804e89c (patch)
tree	9b66af455fd956caaeb731b90799d28ef4e3043b /llvm/lib/Transforms/Utils/LoopPeel.cpp
parent	06ca0a273308112c660dddf14c92fe14957bd468 (diff)
download	llvm-2d31b02517c0eacfa7edf62822cb7265e804e89c.zip llvm-2d31b02517c0eacfa7edf62822cb7265e804e89c.tar.gz llvm-2d31b02517c0eacfa7edf62822cb7265e804e89c.tar.bz2