[LAA] Prune dependencies with distance large than access implied by trip count

When we have a dependency with a dependence distance which can only be hit on an iteration beyond the actual trip count of the loop, we can ignore that dependency when analyzing said loop. We already had this code, but had restricted it solely to unknown dependence distances. This change applies it to all dependence distances. Without this code, we relied on the vectorizer reducing VF such that our infeasible dependence was respected. This usually worked out to about the same result, but not always. For fixed length vectorization, this could mean a smaller VF than optimal being chosen or additional runtime checks. For scalable vectorization - where the bounds on access implied by VF are broader - we could often not find a feasible VF at all. Differential Revision: https://reviews.llvm.org/D131924
author: Philip Reames <preames@rivosinc.com> 2022-08-25 14:16:34 -0700
committer: Philip Reames <listmail@philipreames.com> 2022-08-25 14:24:13 -0700
commit: 86b67a310dedf4d0c6a5bc012d8bee7dbac1d2ad (patch)
tree: 4c7082a5be5730781a3e7c94f9881ad74117c932 /llvm/lib/Analysis/LoopAccessAnalysis.cpp
parent: abcb3d58d999d6325c447daf1f9b439a9f4f465d (diff)
download: llvm-86b67a310dedf4d0c6a5bc012d8bee7dbac1d2ad.zip
llvm-86b67a310dedf4d0c6a5bc012d8bee7dbac1d2ad.tar.gz
llvm-86b67a310dedf4d0c6a5bc012d8bee7dbac1d2ad.tar.bz2
1 files changed, 8 insertions, 7 deletions
diff --git a/llvm/lib/Analysis/LoopAccessAnalysis.cpp b/llvm/lib/Analysis/LoopAccessAnalysis.cpp
index a95b432..180f3fe 100644
--- a/llvm/lib/Analysis/LoopAccessAnalysis.cpp
+++ b/llvm/lib/Analysis/LoopAccessAnalysis.cpp
@@ -1710,7 +1710,7 @@ void MemoryDepChecker::mergeInStatus(VectorizationSafetyStatus S) {
     Status = S;
 }
 
-/// Given a non-constant (unknown) dependence-distance \p Dist between two
+/// Given a dependence-distance \p Dist between two
 /// memory accesses, that have the same stride whose absolute value is given
 /// in \p Stride, and that have the same type size \p TypeByteSize,
 /// in a loop whose takenCount is \p BackedgeTakenCount, check if it is
@@ -1738,7 +1738,7 @@ static bool isSafeDependenceDistance(const DataLayout &DL, ScalarEvolution &SE,
   // This is equivalent to the Strong SIV Test (Practical Dependence Testing,
   // Section 4.2.1); Note, that for vectorization it is sufficient to prove
   // that the dependence distance is >= VF; This is checked elsewhere.
-  // But in some cases we can prune unknown dependence distances early, and
+  // But in some cases we can prune dependence distances early, and
   // even before selecting the VF, and without a runtime test, by comparing
   // the distance against the loop iteration count. Since the vectorized code
   // will be executed only if LoopCount >= VF, proving distance >= LoopCount
@@ -1875,13 +1875,14 @@ MemoryDepChecker::isDependent(const MemAccessInfo &A, unsigned AIdx,
   bool HasSameSize =
       DL.getTypeStoreSizeInBits(ATy) == DL.getTypeStoreSizeInBits(BTy);
   uint64_t Stride = std::abs(StrideAPtr);
+
+  if (!isa<SCEVCouldNotCompute>(Dist) && HasSameSize &&
+      isSafeDependenceDistance(DL, SE, *(PSE.getBackedgeTakenCount()), *Dist,
+                               Stride, TypeByteSize))
+    return Dependence::NoDep;
+
   const SCEVConstant *C = dyn_cast<SCEVConstant>(Dist);
   if (!C) {
-    if (!isa<SCEVCouldNotCompute>(Dist) && HasSameSize &&
-        isSafeDependenceDistance(DL, SE, *(PSE.getBackedgeTakenCount()), *Dist,
-                                 Stride, TypeByteSize))
-      return Dependence::NoDep;
-
     LLVM_DEBUG(dbgs() << "LAA: Dependence because of non-constant distance\n");
     FoundNonConstantDistanceDependence = true;
     return Dependence::Unknown;
author	Philip Reames <preames@rivosinc.com>	2022-08-25 14:16:34 -0700
committer	Philip Reames <listmail@philipreames.com>	2022-08-25 14:24:13 -0700
commit	86b67a310dedf4d0c6a5bc012d8bee7dbac1d2ad (patch)
tree	4c7082a5be5730781a3e7c94f9881ad74117c932 /llvm/lib/Analysis/LoopAccessAnalysis.cpp
parent	abcb3d58d999d6325c447daf1f9b439a9f4f465d (diff)
download	llvm-86b67a310dedf4d0c6a5bc012d8bee7dbac1d2ad.zip llvm-86b67a310dedf4d0c6a5bc012d8bee7dbac1d2ad.tar.gz llvm-86b67a310dedf4d0c6a5bc012d8bee7dbac1d2ad.tar.bz2