Download Raw Diff

Details

Reviewers

reames
nikic
mkazantsev
efriedma

Commits

rG5dad4c67882a: [SCEV] Iteratively compute ranges for deeply nested expressions.

Summary

At the moment, getRangeRef may overflow the stack for very deeply nested
expressions.

This patch introduces a new getRangeRefIter function, which first builds
a worklist of N-ary expressions and phi nodes, followed by their
operands iteratively.

getRangeRef has been extended to also take a Depth argument and it
switches to use getRangeRefIter once the depth reaches a certain
threshold.

This ensures compile-time is not impacted in general. Note that
the iterative algorithm may lead to a slightly different evaluation
order, which could result in slightly worse ranges for cyclic phis.

https://llvm-compile-time-tracker.com/compare.php?from=23c3eb7cdf3478c9db86f6cb5115821a8f0f5f40&to=e0e09fa338e77e53242bfc846e1484350ad79773&stat=instructions

Fixes #49579.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

fhahn created this revision.Jul 28 2022, 1:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 28 2022, 1:01 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

fhahn requested review of this revision.Jul 28 2022, 1:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 28 2022, 1:01 PM

hiraditya added inline comments.Jul 28 2022, 2:21 PM

llvm/lib/Analysis/ScalarEvolution.cpp
6477	Can the traversal order be preserved if we populated the Worklist with DFS traversal for operands starting with `S`?

Harbormaster completed remote builds in B178136: Diff 448423.Jul 28 2022, 2:31 PM

Rebased and addede logic to avoid infinite cycles when evaluating phis using PendingPhiRangesIter.

llvm/lib/Analysis/ScalarEvolution.cpp

6477

I had a look at some of the differences and the main issue is that we now may evaluate ranges earlier than previously.

One example is that for AddRecs, getRangeRef may not evaluate the range of the start value, e.g. because it may not be necessary if the addrec overflows. With the iterative approach, we will evaluate the range of the start value before evaluating the addrec. I don't think there anything we could do about it, except adding the expression-dependent logic of when to not evaluate an operand.

I think this is undesirable and should also not be necessary in practice. The differences caused by this are very minor (in some cases it even leads to small improvements) and won't materialize except in very deep expressions. I am also open to increasing the threshold when the iterative logic triggers. SCEV should just not overflow the stack for valid IR inputs.

Here's where the ranges would be a bit tighter if we unconditionally use the iterative approach:

diff --git a/llvm/test/Analysis/ScalarEvolution/pr49856.ll b/llvm/test/Analysis/ScalarEvolution/pr49856.ll
index 751677f1f9f8..661fd5482ad5 100644
--- a/llvm/test/Analysis/ScalarEvolution/pr49856.ll
+++ b/llvm/test/Analysis/ScalarEvolution/pr49856.ll
@@ -5,7 +5,7 @@ define void @test() {
 ; CHECK-LABEL: 'test'
 ; CHECK-NEXT:  Classifying expressions for: @test
 ; CHECK-NEXT:    %tmp = phi i32 [ 2, %bb ], [ %tmp2, %bb3 ]
-; CHECK-NEXT:    --> %tmp U: [1,-2147483648) S: [0,-2147483648)
+; CHECK-NEXT:    --> %tmp U: [1,-2147483648) S: [1,-2147483648)
 ; CHECK-NEXT:    %tmp2 = add nuw nsw i32 %tmp, 1
 ; CHECK-NEXT:    --> (1 + %tmp)<nuw> U: [1,-2147483647) S: [1,-2147483647)
 ; CHECK-NEXT:  Determining loop execution counts for: @test
diff --git a/llvm/test/Analysis/ScalarEvolution/shift-recurrences.ll b/llvm/test/Analysis/ScalarEvolution/shift-recurrences.ll
index bc0f62e827ea..e5199e027ab3 100644
--- a/llvm/test/Analysis/ScalarEvolution/shift-recurrences.ll
+++ b/llvm/test/Analysis/ScalarEvolution/shift-recurrences.ll
@@ -446,7 +446,7 @@ define void @nonloop_recurrence() {
 ; CHECK-LABEL: 'nonloop_recurrence'
 ; CHECK-NEXT:  Classifying expressions for: @nonloop_recurrence
 ; CHECK-NEXT:    %tmp = phi i32 [ 2, %bb ], [ %tmp2, %bb3 ]
-; CHECK-NEXT:    --> %tmp U: [1,-2147483648) S: [0,-2147483648)
+; CHECK-NEXT:    --> %tmp U: [1,-2147483648) S: [1,-2147483648)
 ; CHECK-NEXT:    %tmp2 = add nuw nsw i32 %tmp, 1
 ; CHECK-NEXT:    --> (1 + %tmp)<nuw> U: [1,-2147483647) S: [1,-2147483647)
 ; CHECK-NEXT:  Determining loop execution counts for: @nonloop_recurrence
@@ -470,7 +470,7 @@ define void @nonloop_recurrence_2() {
 ; CHECK-LABEL: 'nonloop_recurrence_2'
 ; CHECK-NEXT:  Classifying expressions for: @nonloop_recurrence_2
 ; CHECK-NEXT:    %tmp = phi i32 [ 2, %loop ], [ %tmp2, %bb3 ]
-; CHECK-NEXT:    --> %tmp U: [1,-2147483648) S: [0,-2147483648) Exits: <<Unknown>> LoopDispositions: { %loop: Variant }
+; CHECK-NEXT:    --> %tmp U: [1,-2147483648) S: [1,-2147483648) Exits: <<Unknown>> LoopDispositions: { %loop: Variant }
 ; CHECK-NEXT:    %tmp2 = add nuw nsw i32 %tmp, 1
 ; CHECK-NEXT:    --> (1 + %tmp)<nuw> U: [1,-2147483647) S: [1,-2147483647) Exits: <<Unknown>> LoopDispositions: { %loop: Variant }
 ; CHECK-NEXT:  Determining loop execution counts for: @nonloop_recurrence_2

The only regression AFAICT would be in llvm/test/Analysis/ScalarEvolution/addrec-computed-during-addrec-calculation.ll where we fail to hoist the sext out of (sext i32 {%iv,+,1}<nsw><%loop2> to i64)

Harbormaster completed remote builds in B178553: Diff 448994.Aug 1 2022, 5:48 AM

High-level concern: We have lots of code that is working with SCEVs recursively. If we fix getRangeRef(), are we just shifting the problem over to GetMinTrailingZeroes(), or getSCEVAtScope(), or ...?

I wonder whether it would make sense to prevent the creation of very deeply nested SCEVs instead. We already have various limits, and the one that is most relevant is probably the huge expression limit, but that one limits the total number of (recursive) children, not the nesting.

In D130728#3691284, @nikic wrote:

High-level concern: We have lots of code that is working with SCEVs recursively. If we fix getRangeRef(), are we just shifting the problem over to GetMinTrailingZeroes(), or getSCEVAtScope(), or ...?

That's true, I think we have another known crash due to recursion in getSCEVAtScope, GetMinTrailingZeroes also looks like a likely candidate for issues. As far as I know there are no other reported crashes caused by stack overflows other than the one in getRangeRef and getSCEVAtScope.

The getRangeRef situation is particularly unfortunate, because not only are we traversing operands of a single SCEV expression tree, but we also look through SCEVUnknown phis.

I wonder whether it would make sense to prevent the creation of very deeply nested SCEVs instead. We already have various limits, and the one that is most relevant is probably the huge expression limit, but that one limits the total number of (recursive) children, not the nesting.

A limit on the nesting may help in some (most?) cases, but one potential drawback would be that SCEVs may depend on the order they have been constructed (e.g. if we start constructing form the bottom of the tree we may miss folds for expressions higher up in the tree that may reduce overall height; if we construct from the top first we could end up with a different expression). It probably won't matter in too many cases in practice, but at least conceptually avoiding hard limits seems desirable if possible IMO. Updating most or all places to work iteratively if needed is probably going to take longer and more work, but so far the changes have been fairly targeted/isolated I think.

As mentioned earlier, getRangeRef looks through SCEVUnknown phis so it may traverse multiple SCEV expressions, so a limit on the size of a single expression may not help in all cases there.

Switching everything in SCEV to use iterative algorithms to avoid stack recursion depth issues seems like the better long term approach to me. I really dislike the notion of one more limit here.

The other option is we could use the musttail attributes to explicitly write recursion based code without the stack depth problems, but I think that's currently a clang only extension. Not sure if GCC supports it, or where it stands in certification efforts.

mkazantsev added inline comments.Aug 5 2022, 1:35 AM

llvm/lib/Analysis/ScalarEvolution.cpp
6516	This should be an option.

Add option to customize threshold, ping :)

Harbormaster completed remote builds in B181516: Diff 452991.Aug 16 2022, 7:51 AM

ping

ping :)

ping

Sorry, was on vacation. Taking a look...

I'm not sure if the threshold is ever reached in the tests. Could you pls add a test that

runs default computation
runs iterative computation (with cut limit)
shows that the results don't differ?

llvm/lib/Analysis/ScalarEvolution.cpp
6449	What about div?

LG, still think there could be div support as well. :)

This revision is now accepted and ready to land.Nov 16 2022, 2:52 AM

Thanks! I added a test case & SCEVUDivExpr support. I am planning to land this shortly.

fhahn mentioned this in rG535c2da58dd2: [SCEV] Add range test with phi and division..Nov 21 2022, 11:59 AM

This revision was landed with ongoing or failed builds.Nov 21 2022, 1:58 PM

Closed by commit rG5dad4c67882a: [SCEV] Iteratively compute ranges for deeply nested expressions. (authored by fhahn). · Explain Why

This revision was automatically updated to reflect the committed changes.

fhahn added a commit: rG5dad4c67882a: [SCEV] Iteratively compute ranges for deeply nested expressions..

Harbormaster completed remote builds in B198816: Diff 476937.Nov 21 2022, 5:36 PM

Diff 476996

llvm/include/llvm/Analysis/ScalarEvolution.h

Show First 20 Lines • Show All 1,277 Lines • ▼ Show 20 Lines	private:
ValueExprMapType ValueExprMap;		ValueExprMapType ValueExprMap;

/// Mark predicate values currently being processed by isImpliedCond.		/// Mark predicate values currently being processed by isImpliedCond.
SmallPtrSet<const Value *, 6> PendingLoopPredicates;		SmallPtrSet<const Value *, 6> PendingLoopPredicates;

/// Mark SCEVUnknown Phis currently being processed by getRangeRef.		/// Mark SCEVUnknown Phis currently being processed by getRangeRef.
SmallPtrSet<const PHINode *, 6> PendingPhiRanges;		SmallPtrSet<const PHINode *, 6> PendingPhiRanges;

		/// Mark SCEVUnknown Phis currently being processed by getRangeRefIter.
		SmallPtrSet<const PHINode *, 6> PendingPhiRangesIter;

// Mark SCEVUnknown Phis currently being processed by isImpliedViaMerge.		// Mark SCEVUnknown Phis currently being processed by isImpliedViaMerge.
SmallPtrSet<const PHINode *, 6> PendingMerges;		SmallPtrSet<const PHINode *, 6> PendingMerges;

/// Set to true by isLoopBackedgeGuardedByCond when we're walking the set of		/// Set to true by isLoopBackedgeGuardedByCond when we're walking the set of
/// conditions dominating the backedge of a loop.		/// conditions dominating the backedge of a loop.
bool WalkingBEDominatingConds = false;		bool WalkingBEDominatingConds = false;

/// Set to true by isKnownPredicateViaSplitting when we're trying to prove a		/// Set to true by isKnownPredicateViaSplitting when we're trying to prove a
▲ Show 20 Lines • Show All 261 Lines • ▼ Show 20 Lines	const ConstantRange &setRange(const SCEV *S, RangeSignHint Hint,
if (!Pair.second)		if (!Pair.second)
Pair.first->second = std::move(CR);		Pair.first->second = std::move(CR);
return Pair.first->second;		return Pair.first->second;
}		}

/// Determine the range for a particular SCEV.		/// Determine the range for a particular SCEV.
/// NOTE: This returns a reference to an entry in a cache. It must be		/// NOTE: This returns a reference to an entry in a cache. It must be
/// copied if its needed for longer.		/// copied if its needed for longer.
const ConstantRange &getRangeRef(const SCEV *S, RangeSignHint Hint);		const ConstantRange &getRangeRef(const SCEV *S, RangeSignHint Hint,
		unsigned Depth = 0);

		/// Determine the range for a particular SCEV, but evaluates ranges for
		/// operands iteratively first.
		const ConstantRange &getRangeRefIter(const SCEV *S, RangeSignHint Hint);

/// Determines the range for the affine SCEVAddRecExpr {\p Start,+,\p Step}.		/// Determines the range for the affine SCEVAddRecExpr {\p Start,+,\p Step}.
/// Helper for \c getRange.		/// Helper for \c getRange.
ConstantRange getRangeForAffineAR(const SCEV Start, const SCEV Step,		ConstantRange getRangeForAffineAR(const SCEV Start, const SCEV Step,
const SCEV *MaxBECount, unsigned BitWidth);		const SCEV *MaxBECount, unsigned BitWidth);

/// Determines the range for the affine non-self-wrapping SCEVAddRecExpr {\p		/// Determines the range for the affine non-self-wrapping SCEVAddRecExpr {\p
/// Start,+,\p Step}<nw>.		/// Start,+,\p Step}<nw>.
▲ Show 20 Lines • Show All 720 Lines • Show Last 20 Lines

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 214 Lines • ▼ Show 20 Lines	MaxAddRecSize("scalar-evolution-max-add-rec-size", cl::Hidden,
cl::desc("Max coefficients in AddRec during evolving"),		cl::desc("Max coefficients in AddRec during evolving"),
cl::init(8));		cl::init(8));

static cl::opt<unsigned>		static cl::opt<unsigned>
HugeExprThreshold("scalar-evolution-huge-expr-threshold", cl::Hidden,		HugeExprThreshold("scalar-evolution-huge-expr-threshold", cl::Hidden,
cl::desc("Size of the expression which is considered huge"),		cl::desc("Size of the expression which is considered huge"),
cl::init(4096));		cl::init(4096));

		static cl::opt<unsigned> RangeIterThreshold(
		"scev-range-iter-threshold", cl::Hidden,
		cl::desc("Threshold for switching to iteratively computing SCEV ranges"),
		cl::init(32));

static cl::opt<bool>		static cl::opt<bool>
ClassifyExpressions("scalar-evolution-classify-expressions",		ClassifyExpressions("scalar-evolution-classify-expressions",
cl::Hidden, cl::init(true),		cl::Hidden, cl::init(true),
cl::desc("When printing analysis, include information on every instruction"));		cl::desc("When printing analysis, include information on every instruction"));

static cl::opt<bool> UseExpensiveRangeSharpening(		static cl::opt<bool> UseExpensiveRangeSharpening(
"scalar-evolution-use-expensive-range-sharpening", cl::Hidden,		"scalar-evolution-use-expensive-range-sharpening", cl::Hidden,
cl::init(false),		cl::init(false),
▲ Show 20 Lines • Show All 6,189 Lines • ▼ Show 20 Lines	if (TotalShift.ult(KnownStart.countMinLeadingZeros()))
return ConstantRange(KnownStart.getMinValue(),		return ConstantRange(KnownStart.getMinValue(),
KnownEnd.getMaxValue() + 1);		KnownEnd.getMaxValue() + 1);
break;		break;
}		}
};		};
return FullSet;		return FullSet;
}		}

		const ConstantRange &
		ScalarEvolution::getRangeRefIter(const SCEV *S,
		ScalarEvolution::RangeSignHint SignHint) {
		DenseMap<const SCEV *, ConstantRange> &Cache =
		SignHint == ScalarEvolution::HINT_RANGE_UNSIGNED ? UnsignedRanges
		: SignedRanges;
		SmallVector<const SCEV *> WorkList;
		SmallPtrSet<const SCEV *, 8> Seen;

		// Add Expr to the worklist, if Expr is either an N-ary expression or a
		// SCEVUnknown PHI node.
		auto AddToWorklist = [&WorkList, &Seen, &Cache](const SCEV *Expr) {
		if (!Seen.insert(Expr).second)
		return;
		if (Cache.find(Expr) != Cache.end())
		return;
		if (isa<SCEVNAryExpr>(Expr) \|\| isa<SCEVUDivExpr>(Expr))
		mkazantsevUnsubmitted Not Done Reply Inline Actions What about div? mkazantsev: What about div?
		WorkList.push_back(Expr);
		else if (auto *UnknownS = dyn_cast<SCEVUnknown>(Expr))
		if (isa<PHINode>(UnknownS->getValue()))
		WorkList.push_back(Expr);
		};
		AddToWorklist(S);

		// Build worklist by queuing operands of N-ary expressions and phi nodes.
		for (unsigned I = 0; I != WorkList.size(); ++I) {
		const SCEV *P = WorkList[I];
		if (auto *NaryS = dyn_cast<SCEVNAryExpr>(P)) {
		for (const SCEV *Op : NaryS->operands())
		AddToWorklist(Op);
		} else if (auto *UDiv = dyn_cast<SCEVUDivExpr>(P)) {
		AddToWorklist(UDiv->getLHS());
		AddToWorklist(UDiv->getRHS());
		} else {
		auto *UnknownS = cast<SCEVUnknown>(P);
		if (const PHINode *P = dyn_cast<PHINode>(UnknownS->getValue())) {
		if (!PendingPhiRangesIter.insert(P).second)
		continue;
		for (auto &Op : reverse(P->operands()))
		AddToWorklist(getSCEV(Op));
		}
		}
		}

		if (!WorkList.empty()) {
		hiradityaUnsubmitted Done Reply Inline Actions Can the traversal order be preserved if we populated the Worklist with DFS traversal for operands starting with `S`? hiraditya: Can the traversal order be preserved if we populated the Worklist with DFS traversal for…
		fhahnAuthorUnsubmitted Done Reply Inline Actions I had a look at some of the differences and the main issue is that we now may evaluate ranges earlier than previously. One example is that for AddRecs, `getRangeRef` may not evaluate the range of the start value, e.g. because it may not be necessary if the addrec overflows. With the iterative approach, we will evaluate the range of the start value before evaluating the addrec. I don't think there anything we could do about it, except adding the expression-dependent logic of when to not evaluate an operand. I think this is undesirable and should also not be necessary in practice. The differences caused by this are very minor (in some cases it even leads to small improvements) and won't materialize except in very deep expressions. I am also open to increasing the threshold when the iterative logic triggers. SCEV should just not overflow the stack for valid IR inputs. Here's where the ranges would be a bit tighter if we unconditionally use the iterative approach: diff --git a/llvm/test/Analysis/ScalarEvolution/pr49856.ll b/llvm/test/Analysis/ScalarEvolution/pr49856.ll index 751677f1f9f8..661fd5482ad5 100644 --- a/llvm/test/Analysis/ScalarEvolution/pr49856.ll +++ b/llvm/test/Analysis/ScalarEvolution/pr49856.ll @@ -5,7 +5,7 @@ define void @test() { ; CHECK-LABEL: 'test' ; CHECK-NEXT: Classifying expressions for: @test ; CHECK-NEXT: %tmp = phi i32 [ 2, %bb ], [ %tmp2, %bb3 ] -; CHECK-NEXT: --> %tmp U: [1,-2147483648) S: [0,-2147483648) +; CHECK-NEXT: --> %tmp U: [1,-2147483648) S: [1,-2147483648) ; CHECK-NEXT: %tmp2 = add nuw nsw i32 %tmp, 1 ; CHECK-NEXT: --> (1 + %tmp)<nuw> U: [1,-2147483647) S: [1,-2147483647) ; CHECK-NEXT: Determining loop execution counts for: @test diff --git a/llvm/test/Analysis/ScalarEvolution/shift-recurrences.ll b/llvm/test/Analysis/ScalarEvolution/shift-recurrences.ll index bc0f62e827ea..e5199e027ab3 100644 --- a/llvm/test/Analysis/ScalarEvolution/shift-recurrences.ll +++ b/llvm/test/Analysis/ScalarEvolution/shift-recurrences.ll @@ -446,7 +446,7 @@ define void @nonloop_recurrence() { ; CHECK-LABEL: 'nonloop_recurrence' ; CHECK-NEXT: Classifying expressions for: @nonloop_recurrence ; CHECK-NEXT: %tmp = phi i32 [ 2, %bb ], [ %tmp2, %bb3 ] -; CHECK-NEXT: --> %tmp U: [1,-2147483648) S: [0,-2147483648) +; CHECK-NEXT: --> %tmp U: [1,-2147483648) S: [1,-2147483648) ; CHECK-NEXT: %tmp2 = add nuw nsw i32 %tmp, 1 ; CHECK-NEXT: --> (1 + %tmp)<nuw> U: [1,-2147483647) S: [1,-2147483647) ; CHECK-NEXT: Determining loop execution counts for: @nonloop_recurrence @@ -470,7 +470,7 @@ define void @nonloop_recurrence_2() { ; CHECK-LABEL: 'nonloop_recurrence_2' ; CHECK-NEXT: Classifying expressions for: @nonloop_recurrence_2 ; CHECK-NEXT: %tmp = phi i32 [ 2, %loop ], [ %tmp2, %bb3 ] -; CHECK-NEXT: --> %tmp U: [1,-2147483648) S: [0,-2147483648) Exits: <<Unknown>> LoopDispositions: { %loop: Variant } +; CHECK-NEXT: --> %tmp U: [1,-2147483648) S: [1,-2147483648) Exits: <<Unknown>> LoopDispositions: { %loop: Variant } ; CHECK-NEXT: %tmp2 = add nuw nsw i32 %tmp, 1 ; CHECK-NEXT: --> (1 + %tmp)<nuw> U: [1,-2147483647) S: [1,-2147483647) Exits: <<Unknown>> LoopDispositions: { %loop: Variant } ; CHECK-NEXT: Determining loop execution counts for: @nonloop_recurrence_2 The only regression AFAICT would be in `llvm/test/Analysis/ScalarEvolution/addrec-computed-during-addrec-calculation.ll` where we fail to hoist the `sext` out of `(sext i32 {%iv,+,1}<nsw><%loop2> to i64)` fhahn: I had a look at some of the differences and the main issue is that we now may evaluate ranges…
		// Use getRangeRef to compute ranges for items in the worklist in reverse
		// order. This will force ranges for earlier operands to be computed before
		// their users in most cases.
		for (const SCEV *P :
		reverse(make_range(WorkList.begin() + 1, WorkList.end()))) {
		getRangeRef(P, SignHint);

		if (auto *UnknownS = dyn_cast<SCEVUnknown>(P))
		if (const PHINode *P = dyn_cast<PHINode>(UnknownS->getValue()))
		PendingPhiRangesIter.erase(P);
		}
		}

		return getRangeRef(S, SignHint, 0);
		}

/// Determine the range for a particular SCEV. If SignHint is		/// Determine the range for a particular SCEV. If SignHint is
/// HINT_RANGE_UNSIGNED (resp. HINT_RANGE_SIGNED) then getRange prefers ranges		/// HINT_RANGE_UNSIGNED (resp. HINT_RANGE_SIGNED) then getRange prefers ranges
/// with a "cleaner" unsigned (resp. signed) representation.		/// with a "cleaner" unsigned (resp. signed) representation.
const ConstantRange &		const ConstantRange &ScalarEvolution::getRangeRef(
ScalarEvolution::getRangeRef(const SCEV *S,		const SCEV *S, ScalarEvolution::RangeSignHint SignHint, unsigned Depth) {
ScalarEvolution::RangeSignHint SignHint) {
DenseMap<const SCEV *, ConstantRange> &Cache =		DenseMap<const SCEV *, ConstantRange> &Cache =
SignHint == ScalarEvolution::HINT_RANGE_UNSIGNED ? UnsignedRanges		SignHint == ScalarEvolution::HINT_RANGE_UNSIGNED ? UnsignedRanges
: SignedRanges;		: SignedRanges;
ConstantRange::PreferredRangeType RangeType =		ConstantRange::PreferredRangeType RangeType =
SignHint == ScalarEvolution::HINT_RANGE_UNSIGNED		SignHint == ScalarEvolution::HINT_RANGE_UNSIGNED ? ConstantRange::Unsigned
? ConstantRange::Unsigned : ConstantRange::Signed;		: ConstantRange::Signed;

// See if we've computed this range already.		// See if we've computed this range already.
DenseMap<const SCEV *, ConstantRange>::iterator I = Cache.find(S);		DenseMap<const SCEV *, ConstantRange>::iterator I = Cache.find(S);
if (I != Cache.end())		if (I != Cache.end())
return I->second;		return I->second;

if (const SCEVConstant *C = dyn_cast<SCEVConstant>(S))		if (const SCEVConstant *C = dyn_cast<SCEVConstant>(S))
return setRange(C, SignHint, ConstantRange(C->getAPInt()));		return setRange(C, SignHint, ConstantRange(C->getAPInt()));

		// Switch to iteratively computing the range for S, if it is part of a deeply
		// nested expression.
		if (Depth > RangeIterThreshold)
		mkazantsevUnsubmitted Not Done Reply Inline Actions This should be an option. mkazantsev: This should be an option.
		return getRangeRefIter(S, SignHint);

unsigned BitWidth = getTypeSizeInBits(S->getType());		unsigned BitWidth = getTypeSizeInBits(S->getType());
ConstantRange ConservativeResult(BitWidth, /isFullSet=/true);		ConstantRange ConservativeResult(BitWidth, /isFullSet=/true);
using OBO = OverflowingBinaryOperator;		using OBO = OverflowingBinaryOperator;

// If the value has known zeros, the maximum value will have those known zeros		// If the value has known zeros, the maximum value will have those known zeros
// as well.		// as well.
uint32_t TZ = GetMinTrailingZeros(S);		uint32_t TZ = GetMinTrailingZeros(S);
if (TZ != 0) {		if (TZ != 0) {
if (SignHint == ScalarEvolution::HINT_RANGE_UNSIGNED)		if (SignHint == ScalarEvolution::HINT_RANGE_UNSIGNED)
ConservativeResult =		ConservativeResult =
ConstantRange(APInt::getMinValue(BitWidth),		ConstantRange(APInt::getMinValue(BitWidth),
APInt::getMaxValue(BitWidth).lshr(TZ).shl(TZ) + 1);		APInt::getMaxValue(BitWidth).lshr(TZ).shl(TZ) + 1);
else		else
ConservativeResult = ConstantRange(		ConservativeResult = ConstantRange(
APInt::getSignedMinValue(BitWidth),		APInt::getSignedMinValue(BitWidth),
APInt::getSignedMaxValue(BitWidth).ashr(TZ).shl(TZ) + 1);		APInt::getSignedMaxValue(BitWidth).ashr(TZ).shl(TZ) + 1);
}		}

if (const SCEVAddExpr *Add = dyn_cast<SCEVAddExpr>(S)) {		if (const SCEVAddExpr *Add = dyn_cast<SCEVAddExpr>(S)) {
ConstantRange X = getRangeRef(Add->getOperand(0), SignHint);		ConstantRange X = getRangeRef(Add->getOperand(0), SignHint, Depth + 1);
unsigned WrapType = OBO::AnyWrap;		unsigned WrapType = OBO::AnyWrap;
if (Add->hasNoSignedWrap())		if (Add->hasNoSignedWrap())
WrapType \|= OBO::NoSignedWrap;		WrapType \|= OBO::NoSignedWrap;
if (Add->hasNoUnsignedWrap())		if (Add->hasNoUnsignedWrap())
WrapType \|= OBO::NoUnsignedWrap;		WrapType \|= OBO::NoUnsignedWrap;
for (unsigned i = 1, e = Add->getNumOperands(); i != e; ++i)		for (unsigned i = 1, e = Add->getNumOperands(); i != e; ++i)
X = X.addWithNoWrap(getRangeRef(Add->getOperand(i), SignHint),		X = X.addWithNoWrap(getRangeRef(Add->getOperand(i), SignHint, Depth + 1),
WrapType, RangeType);		WrapType, RangeType);
return setRange(Add, SignHint,		return setRange(Add, SignHint,
ConservativeResult.intersectWith(X, RangeType));		ConservativeResult.intersectWith(X, RangeType));
}		}

if (const SCEVMulExpr *Mul = dyn_cast<SCEVMulExpr>(S)) {		if (const SCEVMulExpr *Mul = dyn_cast<SCEVMulExpr>(S)) {
ConstantRange X = getRangeRef(Mul->getOperand(0), SignHint);		ConstantRange X = getRangeRef(Mul->getOperand(0), SignHint, Depth + 1);
for (unsigned i = 1, e = Mul->getNumOperands(); i != e; ++i)		for (unsigned i = 1, e = Mul->getNumOperands(); i != e; ++i)
X = X.multiply(getRangeRef(Mul->getOperand(i), SignHint));		X = X.multiply(getRangeRef(Mul->getOperand(i), SignHint, Depth + 1));
return setRange(Mul, SignHint,		return setRange(Mul, SignHint,
ConservativeResult.intersectWith(X, RangeType));		ConservativeResult.intersectWith(X, RangeType));
}		}

if (isa<SCEVMinMaxExpr>(S) \|\| isa<SCEVSequentialMinMaxExpr>(S)) {		if (isa<SCEVMinMaxExpr>(S) \|\| isa<SCEVSequentialMinMaxExpr>(S)) {
Intrinsic::ID ID;		Intrinsic::ID ID;
switch (S->getSCEVType()) {		switch (S->getSCEVType()) {
case scUMaxExpr:		case scUMaxExpr:
Show All 9 Lines	if (isa<SCEVMinMaxExpr>(S) \|\| isa<SCEVSequentialMinMaxExpr>(S)) {
case scSMinExpr:		case scSMinExpr:
ID = Intrinsic::smin;		ID = Intrinsic::smin;
break;		break;
default:		default:
llvm_unreachable("Unknown SCEVMinMaxExpr/SCEVSequentialMinMaxExpr.");		llvm_unreachable("Unknown SCEVMinMaxExpr/SCEVSequentialMinMaxExpr.");
}		}

const auto *NAry = cast<SCEVNAryExpr>(S);		const auto *NAry = cast<SCEVNAryExpr>(S);
ConstantRange X = getRangeRef(NAry->getOperand(0), SignHint);		ConstantRange X = getRangeRef(NAry->getOperand(0), SignHint, Depth + 1);
for (unsigned i = 1, e = NAry->getNumOperands(); i != e; ++i)		for (unsigned i = 1, e = NAry->getNumOperands(); i != e; ++i)
X = X.intrinsic(ID, {X, getRangeRef(NAry->getOperand(i), SignHint)});		X = X.intrinsic(
		ID, {X, getRangeRef(NAry->getOperand(i), SignHint, Depth + 1)});
return setRange(S, SignHint,		return setRange(S, SignHint,
ConservativeResult.intersectWith(X, RangeType));		ConservativeResult.intersectWith(X, RangeType));
}		}

if (const SCEVUDivExpr *UDiv = dyn_cast<SCEVUDivExpr>(S)) {		if (const SCEVUDivExpr *UDiv = dyn_cast<SCEVUDivExpr>(S)) {
ConstantRange X = getRangeRef(UDiv->getLHS(), SignHint);		ConstantRange X = getRangeRef(UDiv->getLHS(), SignHint, Depth + 1);
ConstantRange Y = getRangeRef(UDiv->getRHS(), SignHint);		ConstantRange Y = getRangeRef(UDiv->getRHS(), SignHint, Depth + 1);
return setRange(UDiv, SignHint,		return setRange(UDiv, SignHint,
ConservativeResult.intersectWith(X.udiv(Y), RangeType));		ConservativeResult.intersectWith(X.udiv(Y), RangeType));
}		}

if (const SCEVZeroExtendExpr *ZExt = dyn_cast<SCEVZeroExtendExpr>(S)) {		if (const SCEVZeroExtendExpr *ZExt = dyn_cast<SCEVZeroExtendExpr>(S)) {
ConstantRange X = getRangeRef(ZExt->getOperand(), SignHint);		ConstantRange X = getRangeRef(ZExt->getOperand(), SignHint, Depth + 1);
return setRange(ZExt, SignHint,		return setRange(ZExt, SignHint,
ConservativeResult.intersectWith(X.zeroExtend(BitWidth),		ConservativeResult.intersectWith(X.zeroExtend(BitWidth),
RangeType));		RangeType));
}		}

if (const SCEVSignExtendExpr *SExt = dyn_cast<SCEVSignExtendExpr>(S)) {		if (const SCEVSignExtendExpr *SExt = dyn_cast<SCEVSignExtendExpr>(S)) {
ConstantRange X = getRangeRef(SExt->getOperand(), SignHint);		ConstantRange X = getRangeRef(SExt->getOperand(), SignHint, Depth + 1);
return setRange(SExt, SignHint,		return setRange(SExt, SignHint,
ConservativeResult.intersectWith(X.signExtend(BitWidth),		ConservativeResult.intersectWith(X.signExtend(BitWidth),
RangeType));		RangeType));
}		}

if (const SCEVPtrToIntExpr *PtrToInt = dyn_cast<SCEVPtrToIntExpr>(S)) {		if (const SCEVPtrToIntExpr *PtrToInt = dyn_cast<SCEVPtrToIntExpr>(S)) {
ConstantRange X = getRangeRef(PtrToInt->getOperand(), SignHint);		ConstantRange X = getRangeRef(PtrToInt->getOperand(), SignHint, Depth + 1);
return setRange(PtrToInt, SignHint, X);		return setRange(PtrToInt, SignHint, X);
}		}

if (const SCEVTruncateExpr *Trunc = dyn_cast<SCEVTruncateExpr>(S)) {		if (const SCEVTruncateExpr *Trunc = dyn_cast<SCEVTruncateExpr>(S)) {
ConstantRange X = getRangeRef(Trunc->getOperand(), SignHint);		ConstantRange X = getRangeRef(Trunc->getOperand(), SignHint, Depth + 1);
return setRange(Trunc, SignHint,		return setRange(Trunc, SignHint,
ConservativeResult.intersectWith(X.truncate(BitWidth),		ConservativeResult.intersectWith(X.truncate(BitWidth),
RangeType));		RangeType));
}		}

if (const SCEVAddRecExpr *AddRec = dyn_cast<SCEVAddRecExpr>(S)) {		if (const SCEVAddRecExpr *AddRec = dyn_cast<SCEVAddRecExpr>(S)) {
// If there's no unsigned wrap, the value will never be less than its		// If there's no unsigned wrap, the value will never be less than its
// initial value.		// initial value.
▲ Show 20 Lines • Show All 113 Lines • ▼ Show 20 Lines	if (Known.getMinValue() != Known.getMaxValue() + 1)
RangeType);		RangeType);
if (NS > 1)		if (NS > 1)
ConservativeResult = ConservativeResult.intersectWith(		ConservativeResult = ConservativeResult.intersectWith(
ConstantRange(APInt::getSignedMinValue(BitWidth).ashr(NS - 1),		ConstantRange(APInt::getSignedMinValue(BitWidth).ashr(NS - 1),
APInt::getSignedMaxValue(BitWidth).ashr(NS - 1) + 1),		APInt::getSignedMaxValue(BitWidth).ashr(NS - 1) + 1),
RangeType);		RangeType);

// A range of Phi is a subset of union of all ranges of its input.		// A range of Phi is a subset of union of all ranges of its input.
if (const PHINode *Phi = dyn_cast<PHINode>(U->getValue())) {		if (PHINode *Phi = dyn_cast<PHINode>(U->getValue())) {
// Make sure that we do not run over cycled Phis.		// Make sure that we do not run over cycled Phis.
if (PendingPhiRanges.insert(Phi).second) {		if (PendingPhiRanges.insert(Phi).second) {
ConstantRange RangeFromOps(BitWidth, /isFullSet=/false);		ConstantRange RangeFromOps(BitWidth, /isFullSet=/false);

for (const auto &Op : Phi->operands()) {		for (const auto &Op : Phi->operands()) {
auto OpRange = getRangeRef(getSCEV(Op), SignHint);		auto OpRange = getRangeRef(getSCEV(Op), SignHint, Depth + 1);
RangeFromOps = RangeFromOps.unionWith(OpRange);		RangeFromOps = RangeFromOps.unionWith(OpRange);
// No point to continue if we already have a full set.		// No point to continue if we already have a full set.
if (RangeFromOps.isFullSet())		if (RangeFromOps.isFullSet())
break;		break;
}		}
ConservativeResult =		ConservativeResult =
ConservativeResult.intersectWith(RangeFromOps, RangeType);		ConservativeResult.intersectWith(RangeFromOps, RangeType);
bool Erased = PendingPhiRanges.erase(Phi);		bool Erased = PendingPhiRanges.erase(Phi);
▲ Show 20 Lines • Show All 8,230 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/ranges.ll

	; NOTE: Assertions have been autogenerated by utils/update_analyze_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_analyze_test_checks.py
	; RUN: opt < %s -disable-output "-passes=print<scalar-evolution>" 2>&1 \| FileCheck %s			; RUN: opt < %s -disable-output "-passes=print<scalar-evolution>" 2>&1 \| FileCheck %s
				; RUN: opt < %s -disable-output "-passes=print<scalar-evolution>" -scev-range-iter-threshold=1 2>&1 \| FileCheck %s

	target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64"			target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64"

	; Collection of cases exercising range logic, mostly (but not exclusively)			; Collection of cases exercising range logic, mostly (but not exclusively)
	; involving SCEVUnknowns.			; involving SCEVUnknowns.

	declare void @llvm.assume(i1)			declare void @llvm.assume(i1)

	▲ Show 20 Lines • Show All 116 Lines • Show Last 20 Lines

llvm/test/Transforms/IndVarSimplify/range-iter-threshold.ll

This file was added.

				; RUN: opt -passes=indvars -S %s \| FileCheck --check-prefix=COMMON --check-prefix=DEFAULT %s
				; RUN: opt -passes=indvars -scev-range-iter-threshold=1 -S %s \| FileCheck --check-prefix=COMMON --check-prefix=LIMIT %s

				target datalayout = "e-m:o-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"

				define i32 @test(i1 %c.0, i32 %m) {
				; COMMON-LABEL: @test(
				; COMMON-NEXT: entry:
				; COMMON-NEXT: br label [[OUTER_HEADER:%.*]]
				; COMMON: outer.header:
				; DEFAULT-NEXT: [[INDVARS_IV:%.]] = phi i32 [ [[INDVARS_IV_NEXT:%.]], [[OUTER_LATCH:%.]] ], [ 2, [[ENTRY:%.]] ]
				; COMMON-NEXT: [[IV_1:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_1_NEXT:%.]], [[OUTER_LATCH:%.]] ]
				; COMMON-NEXT: [[MAX_0:%.]] = phi i32 [ 0, [[ENTRY]] ], [ [[MAX_1:%.]], [[OUTER_LATCH]] ]
				; COMMON-NEXT: [[TMP0:%.*]] = sext i32 [[IV_1]] to i64
				; COMMON-NEXT: br label [[INNER_1:%.*]]
				; COMMON: inner.1:
				; COMMON-NEXT: [[C_1:%.*]] = icmp slt i64 0, [[TMP0]]
				; COMMON-NEXT: br i1 [[C_1]], label [[INNER_1]], label [[INNER_2_HEADER_PREHEADER:%.*]]
				; COMMON: inner.2.header.preheader:
				; COMMON-NEXT: br label [[INNER_2_HEADER:%.*]]
				; COMMON: inner.2.header:
				; COMMON-NEXT: [[IV_3:%.]] = phi i32 [ [[IV_3_NEXT:%.]], [[INNER_2_LATCH:%.*]] ], [ 0, [[INNER_2_HEADER_PREHEADER]] ]
				; COMMON-NEXT: br i1 [[C_0:%.*]], label [[OUTER_LATCH]], label [[INNER_2_LATCH]]
				; COMMON: inner.2.latch:
				; COMMON-NEXT: [[IV_3_NEXT]] = add i32 [[IV_3]], 1
				; DEFAULT-NEXT: [[EXITCOND:%.*]] = icmp eq i32 [[IV_3_NEXT]], [[INDVARS_IV]]
				; LIMIT-NEXT: [[EXITCOND:%.*]] = icmp ugt i32 [[IV_3]], [[IV_1]]
				; COMMON-NEXT: br i1 [[EXITCOND]], label [[OUTER_LATCH]], label [[INNER_2_HEADER]]
				; COMMON: outer.latch:
				; COMMON-NEXT: [[MAX_1]] = phi i32 [ [[M:%.*]], [[INNER_2_LATCH]] ], [ 0, [[INNER_2_HEADER]] ]
				; COMMON-NEXT: [[IV_1_NEXT]] = add nuw i32 [[IV_1]], 1
				; COMMON-NEXT: [[C_3:%.*]] = icmp ugt i32 [[IV_1]], [[MAX_0]]
				; DEFAULT-NEXT: [[INDVARS_IV_NEXT]] = add i32 [[INDVARS_IV]], 1
				; COMMON-NEXT: br i1 [[C_3]], label [[EXIT:%.*]], label [[OUTER_HEADER]], !llvm.loop [[LOOP0:![0-9]+]]
				; COMMON: exit:
				; COMMON-NEXT: ret i32 0
				;
				entry:
				br label %outer.header

				outer.header:
				%iv.1 = phi i32 [ 0, %entry ], [ %iv.1.next, %outer.latch ]
				%iv.2 = phi i32 [ 0, %entry ], [ %iv.2.next , %outer.latch ]
				%max.0 = phi i32 [ 0, %entry ], [ %max.1, %outer.latch ]
				%0 = sext i32 %iv.1 to i64
				br label %inner.1

				inner.1:
				%c.1 = icmp slt i64 0, %0
				br i1 %c.1, label %inner.1, label %inner.2.header

				inner.2.header:
				%iv.3 = phi i32 [ 0, %inner.1 ], [ %iv.3.next, %inner.2.latch ]
				br i1 %c.0, label %outer.latch, label %inner.2.latch

				inner.2.latch:
				%iv.3.next = add i32 %iv.3, 1
				%c.2 = icmp ugt i32 %iv.3, %iv.2
				br i1 %c.2, label %outer.latch, label %inner.2.header

				outer.latch:
				%max.1 = phi i32 [ %m, %inner.2.latch ], [ %iv.3, %inner.2.header ]
				%iv.1.next = add i32 %iv.1, 1
				%iv.2.next = add i32 %iv.2, 1
				%c.3 = icmp ugt i32 %iv.2, %max.0
				br i1 %c.3, label %exit, label %outer.header, !llvm.loop !0

				exit:
				ret i32 0
				}

				!0 = distinct !{!0, !1}
				!1 = !{!"llvm.loop.mustprogress"}

This is an archive of the discontinued LLVM Phabricator instance.

[SCEV] Iteratively compute ranges for deeply nested expressions.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 476996

llvm/include/llvm/Analysis/ScalarEvolution.h

llvm/lib/Analysis/ScalarEvolution.cpp

llvm/test/Analysis/ScalarEvolution/ranges.ll

llvm/test/Transforms/IndVarSimplify/range-iter-threshold.ll

This is an archive of the discontinued LLVM Phabricator instance.

[SCEV] Iteratively compute ranges for deeply nested expressions.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 476996

llvm/include/llvm/Analysis/ScalarEvolution.h

llvm/lib/Analysis/ScalarEvolution.cpp

llvm/test/Analysis/ScalarEvolution/ranges.ll

llvm/test/Transforms/IndVarSimplify/range-iter-threshold.ll

[SCEV] Iteratively compute ranges for deeply nested expressions.
ClosedPublic