This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/Analysis/
-
lib/
-
Analysis/
-
ScalarEvolution.cpp

Differential D72929

[SCEV] Swap guards estimation sequence. NFC
ClosedPublic

Authored by dfukalov on Jan 17 2020, 8:28 AM.

Download Raw Diff

Details

Reviewers

skatkov
sanjoy
mkazantsev

Commits

rGde34b54edce4: [SCEV] Swap guards estimation sequence. NFC

Summary

Loop unroll spends a lot of time in SCEVs processing in case when a function
contains hundreds of simple 'for' loops with a quite complex arrays indexes like

for (int i = 0; i < 8; ++i) {
  for (int j = 0; j < 32; ++j) {
    C[j*8+i] = B[j*32+i+128] + A[i*64+128];
  }
}
for (int i = 0; i < 8; ++i) {
  for (int j = 0; j < 8; ++j) {
    for (int k = 0; k < 32; ++k) {
      D[k*64+i*8+j] = D[k*64+i*8+j] + E[i+16] * C[k*8+j+256];
    }
  }
}

The patch improves loop unroll speed since isLoopBackedgeGuardedByCond takes
much less time than isLoopEntryGuardedByCond in the edge case.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dfukalov created this revision.Jan 17 2020, 8:28 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 17 2020, 8:28 AM

Herald added subscribers: javed.absar, hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B44281: Diff 238788.Jan 17 2020, 8:34 AM

fhahn added a subscriber: fhahn.Jan 17 2020, 9:28 AM

lgtm, although it would be nice to independently check if isLoopEntryGuardedByCond can be sped up.

Please also add a comment stating that the order and short circuit behavior is intentional.

This revision is now accepted and ready to land.Jan 19 2020, 1:54 PM

Closed by commit rGde34b54edce4: [SCEV] Swap guards estimation sequence. NFC (authored by dfukalov). · Explain WhyJan 20 2020, 5:49 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ScalarEvolution.cpp

8 lines

Diff 239098

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 9,216 Lines • ▼ Show 20 Lines	#endif
assert (SplitRHS.second != getCouldNotCompute() && "Unexpected CNC");		assert (SplitRHS.second != getCouldNotCompute() && "Unexpected CNC");
// It is possible that init SCEV contains an invariant load but it does		// It is possible that init SCEV contains an invariant load but it does
// not dominate MDL and is not available at MDL loop entry, so we should		// not dominate MDL and is not available at MDL loop entry, so we should
// check it here.		// check it here.
if (!isAvailableAtLoopEntry(SplitLHS.first, MDL) \|\|		if (!isAvailableAtLoopEntry(SplitLHS.first, MDL) \|\|
!isAvailableAtLoopEntry(SplitRHS.first, MDL))		!isAvailableAtLoopEntry(SplitRHS.first, MDL))
return false;		return false;

return isLoopEntryGuardedByCond(MDL, Pred, SplitLHS.first, SplitRHS.first) &&		// It seems backedge guard check is faster than entry one so in some cases
isLoopBackedgeGuardedByCond(MDL, Pred, SplitLHS.second,		// it can speed up whole estimation by short circuit
SplitRHS.second);		return isLoopBackedgeGuardedByCond(MDL, Pred, SplitLHS.second,
		SplitRHS.second) &&
		isLoopEntryGuardedByCond(MDL, Pred, SplitLHS.first, SplitRHS.first);
}		}

bool ScalarEvolution::isKnownPredicate(ICmpInst::Predicate Pred,		bool ScalarEvolution::isKnownPredicate(ICmpInst::Predicate Pred,
const SCEV LHS, const SCEV RHS) {		const SCEV LHS, const SCEV RHS) {
// Canonicalize the inputs first.		// Canonicalize the inputs first.
(void)SimplifyICmpOperands(Pred, LHS, RHS);		(void)SimplifyICmpOperands(Pred, LHS, RHS);

if (isKnownViaInduction(Pred, LHS, RHS))		if (isKnownViaInduction(Pred, LHS, RHS))
▲ Show 20 Lines • Show All 3,378 Lines • Show Last 20 Lines