This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
-
ScalarEvolution.h
-
lib/Analysis/
-
Analysis/
1
ScalarEvolution.cpp
-
test/
-
Analysis/ScalarEvolution/
-
ScalarEvolution/
-
trip-count-unknown-stride.ll
-
Transforms/LoopReroll/
-
LoopReroll/
-
nonconst_lb.ll

Differential D106331

[ScalarEvolution] Try harder to prove overflow in howManyLessThans.
Changes PlannedPublic

Authored by efriedma on Jul 19 2021, 6:24 PM.

Download Raw Diff

Details

Reviewers

reames
mkazantsev

Summary

If we have an instruction "add nsw (IV - Stride), Stride" feeding into the icmp, we know Start - Stride doesn't overflow.

It's unfortunate we can't deduce this more directly, but we don't really have any SCEV infrastructure to support this sort of check.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	3,260 ms	x64 debian > libarcher.critical::critical.c
	3,660 ms	x64 debian > libarcher.races::critical-unrelated.c
	3,440 ms	x64 debian > libarcher.races::lock-nested-unrelated.c
	4,019 ms	x64 debian > libarcher.races::lock-unrelated.c
	3,680 ms	x64 debian > libarcher.races::parallel-simple.c
		View Full Test Results (15 Failed)

Event Timeline

efriedma created this revision.Jul 19 2021, 6:24 PM

Herald added subscribers: javed.absar, hiraditya. · View Herald TranscriptJul 19 2021, 6:24 PM

efriedma requested review of this revision.Jul 19 2021, 6:24 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 19 2021, 6:24 PM

Missed a test update

Harbormaster completed remote builds in B114995: Diff 359979.Jul 19 2021, 7:06 PM

Not really a fan of the structure of this change. Bear with me as I talk through a couple of approaches (only the last of which may work.)

Let me throw out an approach for this in terms of SCEVs. I'm going to use the variable names from foo1 in trip-count-unknown-stride.ll to make this understandable.
%i.05 = {0, +, %s}
%add = {0 + %s, +, %s}

The condition we need to prove is that (0 + %s - %s < 0 + %s. (Remember that the start here is in terms of %add, not %i.05.)

One tactic would be to prove (0 - %s) < 0. (i.e. cancel %s on either side - we don't care whether the addition of %s would wrap in that step.) This reduces to proving %s > 0. This is interesting as we already have a framework for proving strides positive. Maybe we can extend that?

Interestingly, we can do that with SCEV. Consider the following code:

if (!PositiveStride) {
  if (IsSigned && NoWrap &&
      isLoopInvariant(RHS, L) && IV->getStart() == Stride &&
      isLoopEntryGuardedByCond(L, Cond, getZero(RHS->getType()), RHS) && 
      loopIsFiniteByAssumption(L))
    PositiveStride = true;
}

I'd written this for another purpose which didn't work out, but I think the structure is valid. This is maybe too specific for a zero start on i.05 - though maybe we can generalize?

Anyways, that isn't a fully solution, but I'd strongly prefer you try to use a SCEV proof here over mixing IR and SCEV proofs.

This revision now requires changes to proceed.Jul 20 2021, 9:58 AM

efriedma planned changes to this revision.Jul 20 2021, 12:08 PM

Added another interesting case to look at. Still considering possible solutions that don't involve poking at IR directly.

Harbormaster completed remote builds in B115162: Diff 360226.Jul 20 2021, 1:19 PM

The easiest thing here would be if SCEV recorded the flags somewhere... but unfortunately, we don't; when we discover the flags aren't universally applicable, we throw them away. We could try to construct some sort of side-table, I guess, to provide an API getNoWrapFlagsWithContext(SCEV*, Instruction*). So basically the same logic as this, but factored away.

One alternative I considered is looking at the nowrap flags of the AddRec "IV - Stride". We do often manage to deduce nsw/nuw, and if the backedge is taken, it proves Start - Stride doesn't overflow. But if we're doing that, I'm not sure how we prove the case where the backedge isn't taken.

Any comments on whether we want to merge this patch? Or how we can encode the necessary information into ScalarEvolution? (Really, I'm more concerned about the pointer_iv_nowrap_guard case than the unknown-stride case.)

efriedma mentioned this in D106852: [SCEV] Fix getAddExpr for adding loop invariants into start of some AddRec.Aug 5 2021, 10:54 AM

LGTM. I'm not completely opposed to this and don't want to block progress further. I do think we need something better and more general. If you've got some time in the next couple days, I'd really like to get on a call and chat through what that might be.

One, possibly crazy, idea would be to use the SCEV <-> Value maps we have, and then do a forward propagation on the values to see if a given addrec being poison is guaranteed to trigger UB before the latch is taken. In this case, we should be able to map from the addrec to the IR expressions, and then find the poison reaching latch without needing to explicitly pass in the conditions.

llvm/lib/Analysis/ScalarEvolution.cpp
11791	I don't understand this line of the diff. Can you remove for the moment, and do this in a separate change? The mixed usage of OrigStart and Start here is confusing.
llvm/test/Analysis/ScalarEvolution/no-wrap-symbolic-becount.ll
125 ↗	(On Diff #360226)	Please land this separately, and then rebase over so that the landed diff shows the change.

This revision is now accepted and ready to land.Aug 31 2021, 5:19 PM

efriedma mentioned this in rG0375734439a0: [NFC] Add extra test for D106331.Sep 8 2021, 2:19 PM

Committed new test separately. Split off the Start/OrigStart thing to D109465.

@efriedma Can I ask you to prioritize landing this? I suspect this is going to reduce the test diffs implied by any fix to D106852, and I'd like to get this in so we can build on it.

(For context, we're likely to loose ability to infer flags on the add forming the start expression for the post-increment IV. As such, being able to directly derive the fact from the IR - this patch - becomes much more relevant. )

Rebased

Harbormaster completed remote builds in B123759: Diff 372361.Sep 13 2021, 4:39 PM

efriedma mentioned this in D125792: [SCEV] Use nowrap flags to infer that start > start - stride in howManyLessThans.May 17 2022, 12:34 PM

(This currently doesn't apply cleanly.)

Herald added a project: Restricted Project. · View Herald TranscriptMay 17 2022, 12:42 PM

bsmith added a subscriber: bsmith.May 18 2022, 3:15 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

ScalarEvolution.h

2 lines

lib/

Analysis/

ScalarEvolution.cpp

36 lines

test/

Analysis/

ScalarEvolution/

trip-count-unknown-stride.ll

2 lines

Transforms/

LoopReroll/

nonconst_lb.ll

24 lines

Diff 359979

llvm/include/llvm/Analysis/ScalarEvolution.h

Show First 20 Lines • Show All 1,805 Lines • ▼ Show 20 Lines	private:
/// \p ControlsExit is true when the LHS < RHS condition directly controls		/// \p ControlsExit is true when the LHS < RHS condition directly controls
/// the branch (loops exits only if condition is true). In this case, we can		/// the branch (loops exits only if condition is true). In this case, we can
/// use NoWrapFlags to skip overflow checks.		/// use NoWrapFlags to skip overflow checks.
///		///
/// If \p AllowPredicates is set, this call will try to use a minimal set of		/// If \p AllowPredicates is set, this call will try to use a minimal set of
/// SCEV predicates in order to return an exact answer.		/// SCEV predicates in order to return an exact answer.
ExitLimit howManyLessThans(const SCEV LHS, const SCEV RHS, const Loop *L,		ExitLimit howManyLessThans(const SCEV LHS, const SCEV RHS, const Loop *L,
bool isSigned, bool ControlsExit,		bool isSigned, bool ControlsExit,
bool AllowPredicates = false);		bool AllowPredicates, ICmpInst *OrigCond);

ExitLimit howManyGreaterThans(const SCEV LHS, const SCEV RHS, const Loop *L,		ExitLimit howManyGreaterThans(const SCEV LHS, const SCEV RHS, const Loop *L,
bool isSigned, bool IsSubExpr,		bool isSigned, bool IsSubExpr,
bool AllowPredicates = false);		bool AllowPredicates = false);

/// Return a predecessor of BB (which may not be an immediate predecessor)		/// Return a predecessor of BB (which may not be an immediate predecessor)
/// which has exactly one successor from which BB is reachable, or null if		/// which has exactly one successor from which BB is reachable, or null if
/// no such block is found.		/// no such block is found.
▲ Show 20 Lines • Show All 440 Lines • Show Last 20 Lines

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,102 Lines • ▼ Show 20 Lines	case ICmpInst::ICMP_EQ: { // while (X == Y)
ExitLimit EL = howFarToNonZero(getMinusSCEV(LHS, RHS), L);		ExitLimit EL = howFarToNonZero(getMinusSCEV(LHS, RHS), L);
if (EL.hasAnyInfo()) return EL;		if (EL.hasAnyInfo()) return EL;
break;		break;
}		}
case ICmpInst::ICMP_SLT:		case ICmpInst::ICMP_SLT:
case ICmpInst::ICMP_ULT: { // while (X < Y)		case ICmpInst::ICMP_ULT: { // while (X < Y)
bool IsSigned = Pred == ICmpInst::ICMP_SLT;		bool IsSigned = Pred == ICmpInst::ICMP_SLT;
ExitLimit EL = howManyLessThans(LHS, RHS, L, IsSigned, ControlsExit,		ExitLimit EL = howManyLessThans(LHS, RHS, L, IsSigned, ControlsExit,
AllowPredicates);		AllowPredicates, ExitCond);
if (EL.hasAnyInfo()) return EL;		if (EL.hasAnyInfo()) return EL;
break;		break;
}		}
case ICmpInst::ICMP_SGT:		case ICmpInst::ICMP_SGT:
case ICmpInst::ICMP_UGT: { // while (X > Y)		case ICmpInst::ICMP_UGT: { // while (X > Y)
bool IsSigned = Pred == ICmpInst::ICMP_SGT;		bool IsSigned = Pred == ICmpInst::ICMP_SGT;
ExitLimit EL =		ExitLimit EL =
howManyGreaterThans(LHS, RHS, L, IsSigned, ControlsExit,		howManyGreaterThans(LHS, RHS, L, IsSigned, ControlsExit,
▲ Show 20 Lines • Show All 3,445 Lines • ▼ Show 20 Lines	MaxEnd = IsSigned ? APIntOps::smax(MaxEnd, MinStart)
: APIntOps::umax(MaxEnd, MinStart);		: APIntOps::umax(MaxEnd, MinStart);

MaxBECount = getUDivCeilSCEV(getConstant(MaxEnd - MinStart) /* Delta */,		MaxBECount = getUDivCeilSCEV(getConstant(MaxEnd - MinStart) /* Delta */,
getConstant(StrideForMaxBECount) /* Step */);		getConstant(StrideForMaxBECount) /* Step */);

return MaxBECount;		return MaxBECount;
}		}

ScalarEvolution::ExitLimit		ScalarEvolution::ExitLimit
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code -ScalarEvolution::ExitLimit -ScalarEvolution::howManyLessThans(const SCEV LHS, const SCEV RHS, - const Loop L, bool IsSigned, - bool ControlsExit, bool AllowPredicates, - ICmpInst OrigCond) { +ScalarEvolution::ExitLimit ScalarEvolution::howManyLessThans( + const SCEV LHS, const SCEV RHS, const Loop L, bool IsSigned, + bool ControlsExit, bool AllowPredicates, ICmpInst OrigCond) { Lint: Pre-merge checks: clang-format: please reformat the code ``` -ScalarEvolution::ExitLimit -ScalarEvolution…
ScalarEvolution::howManyLessThans(const SCEV LHS, const SCEV RHS,		ScalarEvolution::howManyLessThans(const SCEV LHS, const SCEV RHS,
const Loop *L, bool IsSigned,		const Loop *L, bool IsSigned,
bool ControlsExit, bool AllowPredicates) {		bool ControlsExit, bool AllowPredicates,
		ICmpInst *OrigCond) {
SmallPtrSet<const SCEVPredicate *, 4> Predicates;		SmallPtrSet<const SCEVPredicate *, 4> Predicates;

const SCEVAddRecExpr *IV = dyn_cast<SCEVAddRecExpr>(LHS);		const SCEVAddRecExpr *IV = dyn_cast<SCEVAddRecExpr>(LHS);
bool PredicatedIV = false;		bool PredicatedIV = false;

if (!IV && AllowPredicates) {		if (!IV && AllowPredicates) {
// Try to make this an AddRec using runtime tests, in the first X		// Try to make this an AddRec using runtime tests, in the first X
// iterations of this loop, where X is the SCEV expression found by the		// iterations of this loop, where X is the SCEV expression found by the
▲ Show 20 Lines • Show All 175 Lines • ▼ Show 20 Lines	ScalarEvolution::howManyLessThans(const SCEV LHS, const SCEV RHS,

// We use the expression (max(End,Start)-Start)/Stride to describe the		// We use the expression (max(End,Start)-Start)/Stride to describe the
// backedge count, as if the backedge is taken at least once max(End,Start)		// backedge count, as if the backedge is taken at least once max(End,Start)
// is End and so the result is as above, and if not max(End,Start) is Start		// is End and so the result is as above, and if not max(End,Start) is Start
// so we get a backedge count of zero.		// so we get a backedge count of zero.
const SCEV *BECount = nullptr;		const SCEV *BECount = nullptr;
auto *StartMinusStride = getMinusSCEV(OrigStart, Stride);		auto *StartMinusStride = getMinusSCEV(OrigStart, Stride);
// Can we prove (max(RHS,Start) > Start - Stride?		// Can we prove (max(RHS,Start) > Start - Stride?
if (isLoopEntryGuardedByCond(L, Cond, StartMinusStride, Start) &&		if (isLoopEntryGuardedByCond(L, Cond, StartMinusStride, RHS)) {
isLoopEntryGuardedByCond(L, Cond, StartMinusStride, RHS)) {		auto MaySubOverflow = [&]() {
		// Start - Stride < Start implies no overflow.
		if (isLoopEntryGuardedByCond(L, Cond, StartMinusStride, Start))
		return false;
		// Check if we have an IR instruction feeding into the branch
		// "add nsw IVMinusStride, Stride"
		if (getSCEV(OrigCond->getOperand(0)) == IV) {
		if (auto *Add = dyn_cast<AddOperator>(OrigCond->getOperand(0))) {
		if (IsSigned ? Add->hasNoSignedWrap() : Add->hasNoUnsignedWrap()) {
		if (getSCEV(Add->getOperand(1)) == Stride)
		return false;
		}
		}
		}
		return true;
		};
// In this case, we can use a refined formula for computing backedge taken		// In this case, we can use a refined formula for computing backedge taken
// count. The general formula remains:		// count. The general formula remains:
// "End-Start /uceiling Stride" where "End = max(RHS,Start)"		// "End-Start /uceiling Stride" where "End = max(RHS,Start)"
// We want to use the alternate formula:		// We want to use the alternate formula:
// "((End - 1) - (Start - Stride)) /u Stride"		// "((End - 1) - (Start - Stride)) /u Stride"
// Let's do a quick case analysis to show these are equivalent under		// Let's do a quick case analysis to show these are equivalent under
// our precondition that max(RHS,Start) > Start - Stride.		// our precondition that max(RHS,Start) > Start - Stride.
// * For RHS <= Start, the backedge-taken count must be zero.		// * For RHS <= Start, the backedge-taken count must be zero.
// "((End - 1) - (Start - Stride)) /u Stride" reduces to		// "((End - 1) - (Start - Stride)) /u Stride" reduces to
// "((Start - 1) - (Start - Stride)) /u Stride" which simplies to		// "((Start - 1) - (Start - Stride)) /u Stride" which simplies to
// "Stride - 1 /u Stride" which is indeed zero for all non-zero values		// "Stride - 1 /u Stride" which is indeed zero for all non-zero values
// of Stride. For 0 stride, we've use umin(1,Stride) above, reducing		// of Stride. For 0 stride, we've use umin(1,Stride) above, reducing
// this to the stride of 1 case.		// this to the stride of 1 case.
// * For RHS >= Start, the backedge count must be "RHS-Start /uceil Stride".		// * For RHS >= Start, the backedge count must be "RHS-Start /uceil Stride".
// "((End - 1) - (Start - Stride)) /u Stride" reduces to		// "((End - 1) - (Start - Stride)) /u Stride" reduces to
// "((RHS - 1) - (Start - Stride)) /u Stride" reassociates to		// "((RHS - 1) - (Start - Stride)) /u Stride" reassociates to
// "((RHS - (Start - Stride) - 1) /u Stride".		// "((RHS - (Start - Stride) - 1) /u Stride".
// Our preconditions trivially imply no overflow in that form.		// Our preconditions trivially imply no overflow in that form.
		if (!MaySubOverflow()) {
const SCEV *MinusOne = getMinusOne(Stride->getType());		const SCEV *MinusOne = getMinusOne(Stride->getType());
const SCEV *Numerator =		const SCEV *Numerator =
getMinusSCEV(getAddExpr(RHS, MinusOne), StartMinusStride);		getMinusSCEV(getAddExpr(RHS, MinusOne), StartMinusStride);
if (!isa<SCEVCouldNotCompute>(Numerator)) {		if (!isa<SCEVCouldNotCompute>(Numerator)) {
reamesUnsubmitted Not Done Reply Inline Actions I don't understand this line of the diff. Can you remove for the moment, and do this in a separate change? The mixed usage of OrigStart and Start here is confusing. reames: I don't understand this line of the diff. Can you remove for the moment, and do this in a…
BECount = getUDivExpr(Numerator, Stride);		BECount = getUDivExpr(Numerator, Stride);
}		}
}		}
		}

const SCEV *BECountIfBackedgeTaken = nullptr;		const SCEV *BECountIfBackedgeTaken = nullptr;
if (!BECount) {		if (!BECount) {
auto canProveRHSGreaterThanEqualStart = [&]() {		auto canProveRHSGreaterThanEqualStart = [&]() {
auto CondGE = IsSigned ? ICmpInst::ICMP_SGE : ICmpInst::ICMP_UGE;		auto CondGE = IsSigned ? ICmpInst::ICMP_SGE : ICmpInst::ICMP_UGE;
if (isLoopEntryGuardedByCond(L, CondGE, OrigRHS, OrigStart))		if (isLoopEntryGuardedByCond(L, CondGE, OrigRHS, OrigStart))
return true;		return true;

▲ Show 20 Lines • Show All 2,283 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/trip-count-unknown-stride.ll

	; RUN: opt < %s -analyze -enable-new-pm=0 -scalar-evolution \| FileCheck %s			; RUN: opt < %s -analyze -enable-new-pm=0 -scalar-evolution \| FileCheck %s
	; RUN: opt < %s -disable-output "-passes=print<scalar-evolution>" 2>&1 \| FileCheck %s			; RUN: opt < %s -disable-output "-passes=print<scalar-evolution>" 2>&1 \| FileCheck %s

	; ScalarEvolution should be able to compute trip count of the loop by proving			; ScalarEvolution should be able to compute trip count of the loop by proving
	; that this is not an infinite loop with side effects.			; that this is not an infinite loop with side effects.

	; CHECK-LABEL: Determining loop execution counts for: @foo1			; CHECK-LABEL: Determining loop execution counts for: @foo1
	; CHECK: backedge-taken count is ((-1 + (%n smax %s)) /u %s)			; CHECK: backedge-taken count is ((-1 + %n) /u %s)

	; We should have a conservative estimate for the max backedge taken count for			; We should have a conservative estimate for the max backedge taken count for
	; loops with unknown stride.			; loops with unknown stride.
	; CHECK: max backedge-taken count is -1			; CHECK: max backedge-taken count is -1

	target datalayout = "e-m:e-p:32:32-f64:32:64-f80:32-n8:16:32-S128"			target datalayout = "e-m:e-p:32:32-f64:32:64-f80:32-n8:16:32-S128"

	define void @foo1(i32* nocapture %A, i32 %n, i32 %s) mustprogress {			define void @foo1(i32* nocapture %A, i32 %n, i32 %s) mustprogress {
	▲ Show 20 Lines • Show All 170 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopReroll/nonconst_lb.ll

	Show All 11 Lines
	; }			; }
	;}			;}
	define void @foo(i32* nocapture %A, i32* nocapture readonly %B, i32 %m, i32 %n) {			define void @foo(i32* nocapture %A, i32* nocapture readonly %B, i32 %m, i32 %n) {
	; CHECK-LABEL: @foo(			; CHECK-LABEL: @foo(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[CMP34:%.]] = icmp slt i32 [[M:%.]], [[N:%.*]]			; CHECK-NEXT: [[CMP34:%.]] = icmp slt i32 [[M:%.]], [[N:%.*]]
	; CHECK-NEXT: br i1 [[CMP34]], label [[FOR_BODY_PREHEADER:%.]], label [[FOR_END:%.]]			; CHECK-NEXT: br i1 [[CMP34]], label [[FOR_BODY_PREHEADER:%.]], label [[FOR_END:%.]]
	; CHECK: for.body.preheader:			; CHECK: for.body.preheader:
	; CHECK-NEXT: [[TMP0:%.*]] = add i32 [[M]], 4			; CHECK-NEXT: [[TMP0:%.*]] = add i32 [[N]], -1
	; CHECK-NEXT: [[SMAX:%.*]] = call i32 @llvm.smax.i32(i32 [[N]], i32 [[TMP0]])			; CHECK-NEXT: [[TMP1:%.*]] = sub i32 [[TMP0]], [[M]]
	; CHECK-NEXT: [[TMP1:%.*]] = add i32 [[SMAX]], -1			; CHECK-NEXT: [[TMP2:%.*]] = lshr i32 [[TMP1]], 2
	; CHECK-NEXT: [[TMP2:%.*]] = sub i32 [[TMP1]], [[M]]			; CHECK-NEXT: [[TMP3:%.*]] = shl nuw i32 [[TMP2]], 2
	; CHECK-NEXT: [[TMP3:%.*]] = lshr i32 [[TMP2]], 2			; CHECK-NEXT: [[TMP4:%.*]] = add nuw nsw i32 [[TMP3]], 3
	; CHECK-NEXT: [[TMP4:%.*]] = shl nuw i32 [[TMP3]], 2
	; CHECK-NEXT: [[TMP5:%.*]] = add nuw nsw i32 [[TMP4]], 3
	; CHECK-NEXT: br label [[FOR_BODY:%.*]]			; CHECK-NEXT: br label [[FOR_BODY:%.*]]
	; CHECK: for.body:			; CHECK: for.body:
	; CHECK-NEXT: [[INDVAR:%.]] = phi i32 [ 0, [[FOR_BODY_PREHEADER]] ], [ [[INDVAR_NEXT:%.]], [[FOR_BODY]] ]			; CHECK-NEXT: [[INDVAR:%.]] = phi i32 [ 0, [[FOR_BODY_PREHEADER]] ], [ [[INDVAR_NEXT:%.]], [[FOR_BODY]] ]
	; CHECK-NEXT: [[TMP6:%.*]] = add i32 [[M]], [[INDVAR]]			; CHECK-NEXT: [[TMP5:%.*]] = add i32 [[M]], [[INDVAR]]
	; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds i32, i32 [[B:%.*]], i32 [[TMP6]]			; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds i32, i32 [[B:%.*]], i32 [[TMP5]]
	; CHECK-NEXT: [[TMP7:%.]] = load i32, i32 [[ARRAYIDX]], align 4			; CHECK-NEXT: [[TMP6:%.]] = load i32, i32 [[ARRAYIDX]], align 4
	; CHECK-NEXT: [[MUL:%.*]] = shl nsw i32 [[TMP7]], 2			; CHECK-NEXT: [[MUL:%.*]] = shl nsw i32 [[TMP6]], 2
	; CHECK-NEXT: [[ARRAYIDX2:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i32 [[TMP6]]			; CHECK-NEXT: [[ARRAYIDX2:%.]] = getelementptr inbounds i32, i32 [[A:%.*]], i32 [[TMP5]]
	; CHECK-NEXT: store i32 [[MUL]], i32* [[ARRAYIDX2]], align 4			; CHECK-NEXT: store i32 [[MUL]], i32* [[ARRAYIDX2]], align 4
	; CHECK-NEXT: [[INDVAR_NEXT]] = add i32 [[INDVAR]], 1			; CHECK-NEXT: [[INDVAR_NEXT]] = add i32 [[INDVAR]], 1
	; CHECK-NEXT: [[EXITCOND:%.*]] = icmp eq i32 [[INDVAR]], [[TMP5]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp eq i32 [[INDVAR]], [[TMP4]]
	; CHECK-NEXT: br i1 [[EXITCOND]], label [[FOR_END_LOOPEXIT:%.*]], label [[FOR_BODY]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[FOR_END_LOOPEXIT:%.*]], label [[FOR_BODY]]
	; CHECK: for.end.loopexit:			; CHECK: for.end.loopexit:
	; CHECK-NEXT: br label [[FOR_END]]			; CHECK-NEXT: br label [[FOR_END]]
	; CHECK: for.end:			; CHECK: for.end:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%cmp34 = icmp slt i32 %m, %n			%cmp34 = icmp slt i32 %m, %n
	▲ Show 20 Lines • Show All 123 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[ScalarEvolution] Try harder to prove overflow in howManyLessThans.Changes PlannedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 359979

llvm/include/llvm/Analysis/ScalarEvolution.h

llvm/lib/Analysis/ScalarEvolution.cpp

llvm/test/Analysis/ScalarEvolution/trip-count-unknown-stride.ll

llvm/test/Transforms/LoopReroll/nonconst_lb.ll

[ScalarEvolution] Try harder to prove overflow in howManyLessThans.
Changes PlannedPublic