This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Analysis/
-
Analysis/
7/13
ScalarEvolution.cpp
-
test/
-
Analysis/ScalarEvolution/
-
ScalarEvolution/
-
flags-from-poison.ll
-
incorrect-exit-count.ll
-
no-wrap-add-exprs.ll
-
nsw-offset-assume.ll
-
nsw-offset.ll
-
CodeGen/PowerPC/
-
PowerPC/
-
lsr-profitable-chain.ll
-
Transforms/LoopIdiom/
-
LoopIdiom/
-
basic.ll

Differential D109845

[SCEV] Correctly propagate nowrap flags across scopes when folding invariant add through addrec
ClosedPublic

Authored by reames on Sep 15 2021, 12:26 PM.

Download Raw Diff

Details

Reviewers

nikic
fhahn
mkazantsev
efriedma

Commits

rGf39978b84f1d: [SCEV] Correctly propagate nowrap flags across scopes when folding invariant…

Summary

This fixes a violation of the wrap flag rules introduced in c4048d8f. This is an alternate fix to D106852.

The basic problem being fixed is that we infer a set of flags which is valid at some inner scope S1 (usually by correctly propagating them from IR), and then (incorrectly) extend them to a SCEV in scope S2 where S1 != S2. This is not in general safe per the wrap flags semantics recently defined.

In this patch, I include a simple inference step to handle the case where we can prove that S2 is the preheader of the loop S1, and that entry into S2 implies execution of S1. See the code for a more detailed explanation. I'd welcome input on how to reword the comment to make it easier to follow. I believe the reasoning is sound, but its super hard to follow.

One worry I have with this patch is that I might be over-fitting what shows up in tests - and thus hiding negative impact we'd see in the real world. My best defense is that the rule used here very closely follows the one used to propagate the flags from IR to the inner add to start with, and thus if one is reasonable, so probably is the other. Curious what others think about that piece.

The test diffs are roughly as expected. Mostly analysis only, with two transform changes. Oddly, the result looks better in the loop-idiom test, and I don't understand the PPC output enough to have tell. Nothing terrible looking though. (For context, without the scope inference peephole, the test delta includes a couple of vectorization tests. Again, not super concerning, but slightly more so.)

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

reames created this revision.Sep 15 2021, 12:26 PM

Herald added subscribers: javed.absar, bollu, hiraditya and 2 others. · View Herald TranscriptSep 15 2021, 12:26 PM

reames requested review of this revision.Sep 15 2021, 12:26 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 15 2021, 12:26 PM

reames mentioned this in rG248e430f37c8: precommit test for D109845/D106852.Sep 15 2021, 12:54 PM

Harbormaster completed remote builds in B124061: Diff 372769.Sep 15 2021, 12:55 PM

Rebase over additional tests landed in 248e430f3. These were the reduced ones arrived at during discussion; I'd apparently never actually landed them.

reames mentioned this in D106852: [SCEV] Fix getAddExpr for adding loop invariants into start of some AddRec.Sep 15 2021, 1:00 PM

Harbormaster completed remote builds in B124069: Diff 372779.Sep 15 2021, 1:41 PM

mkazantsev added inline comments.Sep 24 2021, 7:58 PM

llvm/lib/Analysis/ScalarEvolution.cpp
2808	Preheader has single successor by definition.
2825	Do we reject constants due to some block-related reasoning? Aren't they always available?
2831	Imagine the situation when inner loop is entered (and existing checks basically only test it), but always side-exits on 1st iteration in a call that is inside the inner loop. So actually whatever flags the inner AddRec has, it doesn't matter because they will never actually come in play. Will they still be propagated to outside?

reames added inline comments.Sep 27 2021, 2:20 PM

llvm/lib/Analysis/ScalarEvolution.cpp
2808	Yep, good catch. This is overly general from an earlier version of the code which had broader scope.
2825	I don't follow your question? We're analyzing a non-constant expression. If we find a constant operand, we need to prove that this occurrence of the expression must execute on entry to the function. Does that answer your concern?
2831	I believe this is still fine. Here's my reasoning: The flags being present on the inner addrec disallows the case where the call is before all UB triggering uses of the inner addrec. See usage of programUndefinedIfPoison in isSCEVExprNeverPoison. As such, we know that if we reach the inner loop, we must execute UB if the inner add-rec's flags produce poison. We also know that the inner add (not addrec!) must also trigger UB on overflow. (From argument) Together, those let us know that the loop invariant add can have flags within the inner loop. (This doesn't correspond to a SCEV, but is a useful mental stepping stone.) All that's left is proving that all instances of the add in the defining scope must reach the one in the inner loop. Thus, we're good. (Assuming that all flags were set correctly to start at least.)

reames mentioned this in D109789: [SCEV] Stop blindly propagating flags from inbound geps to SCEV nodes.Sep 30 2021, 2:24 PM

reames mentioned this in rG91dfc0840ddf: [test] add coverage for a SCEVUnknown scoped value in isSCEVExprNeverPoison.Oct 1 2021, 4:47 PM

reames mentioned this in D111001: [SCEV] Infer flags from binop when scope bounded by unknown.Oct 2 2021, 12:20 PM

reames added a child revision: D111001: [SCEV] Infer flags from binop when scope bounded by unknown.

reames mentioned this in rG26223af256bb: [SCEV] Split isSCEVExprNeverPoison reasoning explicitly into scope and….Oct 2 2021, 1:10 PM

Rebase over conceptual split introduced in 26223af. Resulting code is much easier to follow and reason about.

reames mentioned this in D111003: [SCEV] Use full logic when infering flags on add and gep.Oct 2 2021, 2:41 PM

nikic added inline comments.Oct 2 2021, 3:03 PM

llvm/lib/Analysis/ScalarEvolution.cpp
6598	Might make sense to just make `&*F.getEntryBlock().begin()` the fallback return value? That one should hold for all SCEVs, not just SCEVConstant in particular.

reames added inline comments.Oct 2 2021, 3:40 PM

llvm/lib/Analysis/ScalarEvolution.cpp
6598	Hm, with the current code structure, you're right. Once I build the recursion interface, I'll want to go back to the nullptr form (as a signal to recurse), but this is a good idea.

reames added inline comments.Oct 2 2021, 4:11 PM

llvm/lib/Analysis/ScalarEvolution.cpp
6598	Ran a test with this, and don't really see much meaningful test diff. Still worth doing, but do you mind if I leave it to a follow on patch?

LGTM

llvm/lib/Analysis/ScalarEvolution.cpp
6598	I'm fine with that, but also don't really see the point of landing it separately. The suggestion was mostly intended as code cleanup, as I found the current treatment of SCEVConstant unintuitive.
6607	These instruction walks have pathological cases for large BBs. We should limit this walk to a small constant number of instructions. E.g. the `programUndefinedIfPoison()` check looks through 32 guaranteed-to-execute instructions. (This is fine as a followup.) Alternatively we could add an ICF-style cache for this, but I am somewhat hesitant to suggest adding yet another cache that requires invalidation to SCEV.

This revision is now accepted and ready to land.Oct 3 2021, 1:38 AM

Closed by commit rGf39978b84f1d: [SCEV] Correctly propagate nowrap flags across scopes when folding invariant… (authored by reames). · Explain WhyOct 3 2021, 3:23 PM

This revision was automatically updated to reflect the committed changes.

reames added a commit: rGf39978b84f1d: [SCEV] Correctly propagate nowrap flags across scopes when folding invariant….

reames mentioned this in rGd02db32644b7: [SCEV] Use full logic when infering flags on add and gep.Oct 3 2021, 3:34 PM

reames mentioned this in rG35ab211c3753: [SCEV] Use trivial bound on defining scope of all SCEVs when computing flags.Oct 3 2021, 4:15 PM

reames mentioned this in rG5f7a5353301b: [SCEV] Cap the number of instructions scanned when infering flags.

reames added inline comments.Oct 3 2021, 4:17 PM

llvm/lib/Analysis/ScalarEvolution.cpp
6598	Addressed in 35ab21. As you'll note, I needed to rename a bit and restructure the code to make the bounding explicit to make it clear to me. :)
6607	Addressed in 5f7a53.

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ScalarEvolution.cpp

54 lines

test/

Analysis/

ScalarEvolution/

flags-from-poison.ll

4 lines

incorrect-exit-count.ll

4 lines

no-wrap-add-exprs.ll

10 lines

nsw-offset-assume.ll

4 lines

nsw-offset.ll

4 lines

CodeGen/

PowerPC/

lsr-profitable-chain.ll

94 lines

Transforms/

LoopIdiom/

basic.ll

28 lines

Diff 376792

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,775 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = Ops.size(); i != e; ++i)
LIOps.push_back(Ops[i]);		LIOps.push_back(Ops[i]);
Ops.erase(Ops.begin()+i);		Ops.erase(Ops.begin()+i);
--i; --e;		--i; --e;
}		}

// If we found some loop invariants, fold them into the recurrence.		// If we found some loop invariants, fold them into the recurrence.
if (!LIOps.empty()) {		if (!LIOps.empty()) {
// Compute nowrap flags for the addition of the loop-invariant ops and		// Compute nowrap flags for the addition of the loop-invariant ops and
// the addrec. Temporarily push it as an operand for that purpose.		// the addrec. Temporarily push it as an operand for that purpose. These
		// flags are valid in the scope of the addrec only.
LIOps.push_back(AddRec);		LIOps.push_back(AddRec);
SCEV::NoWrapFlags Flags = ComputeFlags(LIOps);		SCEV::NoWrapFlags Flags = ComputeFlags(LIOps);
LIOps.pop_back();		LIOps.pop_back();

// NLI + LI + {Start,+,Step} --> NLI + {LI+Start,+,Step}		// NLI + LI + {Start,+,Step} --> NLI + {LI+Start,+,Step}
LIOps.push_back(AddRec->getStart());		LIOps.push_back(AddRec->getStart());

SmallVector<const SCEV *, 4> AddRecOps(AddRec->operands());		SmallVector<const SCEV *, 4> AddRecOps(AddRec->operands());
// This follows from the fact that the no-wrap flags on the outer add
// expression are applicable on the 0th iteration, when the add recurrence		// It is not in general safe to propagate flags valid on an add within
// will be equal to its start value.		// the addrec scope to one outside it. We must prove that the inner
AddRecOps[0] = getAddExpr(LIOps, Flags, Depth + 1);		// scope is guaranteed to execute if the outer one does to be able to
		// safely propagate. We know the program is undefined if poison is
		// produced on the inner scoped addrec. We also know that for this use
		// the outer scoped add can't overflow (because of the flags we just
		// computed for the inner scoped add) without the program being undefined.
		// Proving that entry to the outer scope neccesitates entry to the inner
		// scope, thus proves the program undefined if the flags would be violated
		// in the outer scope.
		const bool CanPropagateFlags = llvm::any_of(LIOps, [&](const SCEV *S) {
		auto ReachI = &AddRecLoop->getHeader()->begin();
		if (auto *DefI = getDefinedScopeRoot(S))
		if (isGuaranteedToTransferExecutionTo(DefI, ReachI))
		mkazantsevUnsubmitted Not Done Reply Inline Actions Preheader has single successor by definition. mkazantsev: Preheader has single successor by definition.
		reamesAuthorUnsubmitted Done Reply Inline Actions Yep, good catch. This is overly general from an earlier version of the code which had broader scope. reames: Yep, good catch. This is overly general from an earlier version of the code which had broader…
		return true;
		return false;
		});
		auto AddFlags = CanPropagateFlags ? Flags : SCEV::FlagAnyWrap;
		AddRecOps[0] = getAddExpr(LIOps, AddFlags, Depth + 1);

// Build the new addrec. Propagate the NUW and NSW flags if both the		// Build the new addrec. Propagate the NUW and NSW flags if both the
// outer add and the inner addrec are guaranteed to have no overflow.		// outer add and the inner addrec are guaranteed to have no overflow.
// Always propagate NW.		// Always propagate NW.
Flags = AddRec->getNoWrapFlags(setFlags(Flags, SCEV::FlagNW));		Flags = AddRec->getNoWrapFlags(setFlags(Flags, SCEV::FlagNW));
const SCEV *NewRec = getAddRecExpr(AddRecOps, AddRecLoop, Flags);		const SCEV *NewRec = getAddRecExpr(AddRecOps, AddRecLoop, Flags);

// If all of the other operands were loop invariant, we are done.		// If all of the other operands were loop invariant, we are done.
if (Ops.size() == 1) return NewRec;		if (Ops.size() == 1) return NewRec;

// Otherwise, add the folded AddRec by the non-invariant parts.		// Otherwise, add the folded AddRec by the non-invariant parts.
for (unsigned i = 0;; ++i)		for (unsigned i = 0;; ++i)
		mkazantsevUnsubmitted Not Done Reply Inline Actions Do we reject constants due to some block-related reasoning? Aren't they always available? mkazantsev: Do we reject constants due to some block-related reasoning? Aren't they always available?
		reamesAuthorUnsubmitted Done Reply Inline Actions I don't follow your question? We're analyzing a non-constant expression. If we find a constant operand, we need to prove that this occurrence of the expression must execute on entry to the function. Does that answer your concern? reames: I don't follow your question? We're analyzing a non-constant expression. If we find a…
if (Ops[i] == AddRec) {		if (Ops[i] == AddRec) {
Ops[i] = NewRec;		Ops[i] = NewRec;
break;		break;
}		}
return getAddExpr(Ops, SCEV::FlagAnyWrap, Depth + 1);		return getAddExpr(Ops, SCEV::FlagAnyWrap, Depth + 1);
}		}
		mkazantsevUnsubmitted Not Done Reply Inline Actions Imagine the situation when inner loop is entered (and existing checks basically only test it), but always side-exits on 1st iteration in a call that is inside the inner loop. So actually whatever flags the inner AddRec has, it doesn't matter because they will never actually come in play. Will they still be propagated to outside? mkazantsev: Imagine the situation when inner loop is entered (and existing checks basically only test it)…
		reamesAuthorUnsubmitted Done Reply Inline Actions I believe this is still fine. Here's my reasoning: The flags being present on the inner addrec disallows the case where the call is before all UB triggering uses of the inner addrec. See usage of programUndefinedIfPoison in isSCEVExprNeverPoison. As such, we know that if we reach the inner loop, we must execute UB if the inner add-rec's flags produce poison. We also know that the inner add (not addrec!) must also trigger UB on overflow. (From argument) Together, those let us know that the loop invariant add can have flags within the inner loop. (This doesn't correspond to a SCEV, but is a useful mental stepping stone.) All that's left is proving that all instances of the add in the defining scope must reach the one in the inner loop. Thus, we're good. (Assuming that all flags were set correctly to start at least.) reames: I believe this is still fine. Here's my reasoning: The flags being present on the inner…

// Okay, if there weren't any loop invariants to be folded, check to see if		// Okay, if there weren't any loop invariants to be folded, check to see if
// there are multiple AddRec's with the same loop induction variable being		// there are multiple AddRec's with the same loop induction variable being
// added together. If so, we can fold them.		// added together. If so, we can fold them.
for (unsigned OtherIdx = Idx+1;		for (unsigned OtherIdx = Idx+1;
OtherIdx < Ops.size() && isa<SCEVAddRecExpr>(Ops[OtherIdx]);		OtherIdx < Ops.size() && isa<SCEVAddRecExpr>(Ops[OtherIdx]);
++OtherIdx) {		++OtherIdx) {
// We expect the AddRecExpr's to be sorted in reverse dominance order,		// We expect the AddRecExpr's to be sorted in reverse dominance order,
▲ Show 20 Lines • Show All 3,744 Lines • ▼ Show 20 Lines	if (Flags == SCEV::FlagAnyWrap)
return SCEV::FlagAnyWrap;		return SCEV::FlagAnyWrap;

return isSCEVExprNeverPoison(BinOp) ? Flags : SCEV::FlagAnyWrap;		return isSCEVExprNeverPoison(BinOp) ? Flags : SCEV::FlagAnyWrap;
}		}

const Instruction ScalarEvolution::getDefinedScopeRoot(const SCEV S) {		const Instruction ScalarEvolution::getDefinedScopeRoot(const SCEV S) {
if (auto *AddRec = dyn_cast<SCEVAddRecExpr>(S))		if (auto *AddRec = dyn_cast<SCEVAddRecExpr>(S))
return &*AddRec->getLoop()->getHeader()->begin();		return &*AddRec->getLoop()->getHeader()->begin();
// TODO: add SCEVConstant and SCEVUnknown caxes here		if (isa<SCEVConstant>(S))
		return &*F.getEntryBlock().begin();
		if (auto *U = dyn_cast<SCEVUnknown>(S)) {
		if (auto *I = dyn_cast<Instruction>(U->getValue()))
		return I;
		return &*F.getEntryBlock().begin();
		}
		nikicUnsubmitted Not Done Reply Inline Actions Might make sense to just make `&F.getEntryBlock().begin()` the fallback return value? That one should hold for all SCEVs, not just SCEVConstant in particular. nikic:* Might make sense to just make `&*F.getEntryBlock().begin()` the fallback return value? That one…
		reamesAuthorUnsubmitted Done Reply Inline Actions Hm, with the current code structure, you're right. Once I build the recursion interface, I'll want to go back to the nullptr form (as a signal to recurse), but this is a good idea. reames: Hm, with the current code structure, you're right. Once I build the recursion interface, I'll…
		reamesAuthorUnsubmitted Done Reply Inline Actions Ran a test with this, and don't really see much meaningful test diff. Still worth doing, but do you mind if I leave it to a follow on patch? reames: Ran a test with this, and don't really see much meaningful test diff. Still worth doing, but…
		nikicUnsubmitted Not Done Reply Inline Actions I'm fine with that, but also don't really see the point of landing it separately. The suggestion was mostly intended as code cleanup, as I found the current treatment of SCEVConstant unintuitive. nikic: I'm fine with that, but also don't really see the point of landing it separately. The…
		reamesAuthorUnsubmitted Done Reply Inline Actions Addressed in 35ab21. As you'll note, I needed to rename a bit and restructure the code to make the bounding explicit to make it clear to me. :) reames: Addressed in 35ab21. As you'll note, I needed to rename a bit and restructure the code to make…
return nullptr;		return nullptr;
}		}

static bool		static bool
isGuaranteedToTransferExecutionToSuccessor(BasicBlock::const_iterator Begin,		isGuaranteedToTransferExecutionToSuccessor(BasicBlock::const_iterator Begin,
BasicBlock::const_iterator End) {		BasicBlock::const_iterator End) {
return llvm::all_of( make_range(Begin, End), [](const Instruction &I) {		return llvm::all_of( make_range(Begin, End), [](const Instruction &I) {
return isGuaranteedToTransferExecutionToSuccessor(&I);		return isGuaranteedToTransferExecutionToSuccessor(&I);
});		});
		nikicUnsubmitted Not Done Reply Inline Actions These instruction walks have pathological cases for large BBs. We should limit this walk to a small constant number of instructions. E.g. the `programUndefinedIfPoison()` check looks through 32 guaranteed-to-execute instructions. (This is fine as a followup.) Alternatively we could add an ICF-style cache for this, but I am somewhat hesitant to suggest adding yet another cache that requires invalidation to SCEV. nikic: These instruction walks have pathological cases for large BBs. We should limit this walk to a…
		reamesAuthorUnsubmitted Done Reply Inline Actions Addressed in 5f7a53. reames: Addressed in 5f7a53.
}		}

bool ScalarEvolution::isGuaranteedToTransferExecutionTo(const Instruction *A,		bool ScalarEvolution::isGuaranteedToTransferExecutionTo(const Instruction *A,
const Instruction *B) {		const Instruction *B) {
if (A->getParent() == B->getParent() &&		if (A->getParent() == B->getParent() &&
::isGuaranteedToTransferExecutionToSuccessor(A->getIterator(),		::isGuaranteedToTransferExecutionToSuccessor(A->getIterator(),
B->getIterator()))		B->getIterator()))
return true;		return true;

		auto *BLoop = LI.getLoopFor(B->getParent());
		if (BLoop && BLoop->getHeader() == B->getParent() &&
		BLoop->getLoopPreheader() == A->getParent() &&
		::isGuaranteedToTransferExecutionToSuccessor(A->getIterator(),
		A->getParent()->end()) &&
		::isGuaranteedToTransferExecutionToSuccessor(B->getParent()->begin(),
		B->getIterator()))
		return true;
return false;		return false;
}		}


bool ScalarEvolution::isSCEVExprNeverPoison(const Instruction *I) {		bool ScalarEvolution::isSCEVExprNeverPoison(const Instruction *I) {
// Here we check that I is in the header of the innermost loop containing I,		// Here we check that I is in the header of the innermost loop containing I,
// since we only deal with instructions in the loop header. The actual loop we		// since we only deal with instructions in the loop header. The actual loop we
// need to check later will come from an add recurrence, but getting that		// need to check later will come from an add recurrence, but getting that
Show All 19 Lines	bool ScalarEvolution::isSCEVExprNeverPoison(const Instruction *I) {
// executed every time we enter that scope. When the bounding scope is a		// executed every time we enter that scope. When the bounding scope is a
// loop (the common case), this is equivalent to proving I executes on every		// loop (the common case), this is equivalent to proving I executes on every
// iteration of that loop.		// iteration of that loop.
for (const Use &Op : I->operands()) {		for (const Use &Op : I->operands()) {
// I could be an extractvalue from a call to an overflow intrinsic.		// I could be an extractvalue from a call to an overflow intrinsic.
// TODO: We can do better here in some cases.		// TODO: We can do better here in some cases.
if (!isSCEVable(Op->getType()))		if (!isSCEVable(Op->getType()))
return false;		return false;
if (auto *DefI = getDefinedScopeRoot(getSCEV(Op)))		// TODO: the following two lines should be:
if (isGuaranteedToTransferExecutionTo(DefI, I))		// if (auto *DefI = getDefinedScopeRoot(getSCEV(Op)))
		// if (isGuaranteedToTransferExecutionTo(DefI, I))
		// We use the following instead for the purposes of seperating a bugfix
		// change from an optimization change. Once pr51817 is fully addressed,
		// we should unlock this power.
		if (auto *AddRecS = dyn_cast<SCEVAddRecExpr>(getSCEV(Op)))
		if (isGuaranteedToExecuteForEveryIteration(I, AddRecS->getLoop()))
return true;		return true;
}		}
return false;		return false;
}		}

bool ScalarEvolution::isAddRecNeverPoison(const Instruction I, const Loop L) {		bool ScalarEvolution::isAddRecNeverPoison(const Instruction I, const Loop L) {
// If we know that \c I can never be poison period, then that's enough.		// If we know that \c I can never be poison period, then that's enough.
if (isSCEVExprNeverPoison(I))		if (isSCEVExprNeverPoison(I))
▲ Show 20 Lines • Show All 7,065 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/flags-from-poison.ll

	Show First 20 Lines • Show All 1,322 Lines • ▼ Show 20 Lines
	define void @test-sub-no-nsw(float* %input, i32 %start, i32 %sub, i32 %numIterations) {			define void @test-sub-no-nsw(float* %input, i32 %start, i32 %sub, i32 %numIterations) {
	; CHECK-LABEL: 'test-sub-no-nsw'			; CHECK-LABEL: 'test-sub-no-nsw'
	; CHECK-NEXT: Classifying expressions for: @test-sub-no-nsw			; CHECK-NEXT: Classifying expressions for: @test-sub-no-nsw
	; CHECK-NEXT: %i = phi i32 [ %nexti, %loop ], [ %start, %entry ]			; CHECK-NEXT: %i = phi i32 [ %nexti, %loop ], [ %start, %entry ]
	; CHECK-NEXT: --> {%start,+,1}<nsw><%loop> U: full-set S: full-set Exits: (-1 + %numIterations) LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {%start,+,1}<nsw><%loop> U: full-set S: full-set Exits: (-1 + %numIterations) LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %index32 = sub nsw i32 %i, %sub			; CHECK-NEXT: %index32 = sub nsw i32 %i, %sub
	; CHECK-NEXT: --> {((-1 * %sub) + %start),+,1}<nw><%loop> U: full-set S: full-set Exits: (-1 + (-1 * %sub) + %numIterations) LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {((-1 * %sub) + %start),+,1}<nw><%loop> U: full-set S: full-set Exits: (-1 + (-1 * %sub) + %numIterations) LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %index64 = sext i32 %index32 to i64			; CHECK-NEXT: %index64 = sext i32 %index32 to i64
	; CHECK-NEXT: --> {((sext i32 %start to i64) + (-1 * (sext i32 %sub to i64))<nsw>)<nsw>,+,1}<nsw><%loop> U: [-4294967295,8589934591) S: [-4294967295,8589934591) Exits: ((zext i32 (-1 + (-1 * %start) + %numIterations) to i64) + (sext i32 %start to i64) + (-1 * (sext i32 %sub to i64))<nsw>) LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {((sext i32 %start to i64) + (-1 * (sext i32 %sub to i64))<nsw>),+,1}<nsw><%loop> U: [-4294967295,8589934591) S: [-4294967295,8589934591) Exits: ((zext i32 (-1 + (-1 * %start) + %numIterations) to i64) + (sext i32 %start to i64) + (-1 * (sext i32 %sub to i64))<nsw>) LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %ptr = getelementptr inbounds float, float* %input, i64 %index64			; CHECK-NEXT: %ptr = getelementptr inbounds float, float* %input, i64 %index64
	; CHECK-NEXT: --> {((4 * (sext i32 %start to i64))<nsw> + (-4 * (sext i32 %sub to i64))<nsw> + %input),+,4}<nw><%loop> U: full-set S: full-set Exits: ((4 * (zext i32 (-1 + (-1 * %start) + %numIterations) to i64))<nuw><nsw> + (4 * (sext i32 %start to i64))<nsw> + (-4 * (sext i32 %sub to i64))<nsw> + %input) LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {((4 * (sext i32 %start to i64))<nsw> + (-4 * (sext i32 %sub to i64))<nsw> + %input),+,4}<nw><%loop> U: full-set S: full-set Exits: ((4 * (zext i32 (-1 + (-1 * %start) + %numIterations) to i64))<nuw><nsw> + (4 * (sext i32 %start to i64))<nsw> + (-4 * (sext i32 %sub to i64))<nsw> + %input) LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %nexti = add nsw i32 %i, 1			; CHECK-NEXT: %nexti = add nsw i32 %i, 1
	; CHECK-NEXT: --> {(1 + %start),+,1}<nsw><%loop> U: full-set S: full-set Exits: %numIterations LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {(1 + %start),+,1}<nsw><%loop> U: full-set S: full-set Exits: %numIterations LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: Determining loop execution counts for: @test-sub-no-nsw			; CHECK-NEXT: Determining loop execution counts for: @test-sub-no-nsw
	; CHECK-NEXT: Loop %loop: backedge-taken count is (-1 + (-1 * %start) + %numIterations)			; CHECK-NEXT: Loop %loop: backedge-taken count is (-1 + (-1 * %start) + %numIterations)
	; CHECK-NEXT: Loop %loop: max backedge-taken count is -1			; CHECK-NEXT: Loop %loop: max backedge-taken count is -1
	; CHECK-NEXT: Loop %loop: Predicated backedge-taken count is (-1 + (-1 * %start) + %numIterations)			; CHECK-NEXT: Loop %loop: Predicated backedge-taken count is (-1 + (-1 * %start) + %numIterations)
	Show All 24 Lines
	; CHECK-NEXT: Classifying expressions for: @test-sub-nsw			; CHECK-NEXT: Classifying expressions for: @test-sub-nsw
	; CHECK-NEXT: %halfsub = ashr i32 %sub, 1			; CHECK-NEXT: %halfsub = ashr i32 %sub, 1
	; CHECK-NEXT: --> %halfsub U: [-1073741824,1073741824) S: [-1073741824,1073741824)			; CHECK-NEXT: --> %halfsub U: [-1073741824,1073741824) S: [-1073741824,1073741824)
	; CHECK-NEXT: %i = phi i32 [ %nexti, %loop ], [ %start, %entry ]			; CHECK-NEXT: %i = phi i32 [ %nexti, %loop ], [ %start, %entry ]
	; CHECK-NEXT: --> {%start,+,1}<nsw><%loop> U: full-set S: full-set Exits: (-1 + %numIterations) LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {%start,+,1}<nsw><%loop> U: full-set S: full-set Exits: (-1 + %numIterations) LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %index32 = sub nsw i32 %i, %halfsub			; CHECK-NEXT: %index32 = sub nsw i32 %i, %halfsub
	; CHECK-NEXT: --> {((-1 * %halfsub)<nsw> + %start)<nsw>,+,1}<nsw><%loop> U: full-set S: full-set Exits: (-1 + (-1 * %halfsub)<nsw> + %numIterations) LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {((-1 * %halfsub)<nsw> + %start)<nsw>,+,1}<nsw><%loop> U: full-set S: full-set Exits: (-1 + (-1 * %halfsub)<nsw> + %numIterations) LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %index64 = sext i32 %index32 to i64			; CHECK-NEXT: %index64 = sext i32 %index32 to i64
	; CHECK-NEXT: --> {((sext i32 %start to i64) + (-1 * (sext i32 %halfsub to i64))<nsw>)<nsw>,+,1}<nsw><%loop> U: [-3221225471,7516192767) S: [-3221225471,7516192767) Exits: ((zext i32 (-1 + (-1 * %start) + %numIterations) to i64) + (sext i32 %start to i64) + (-1 * (sext i32 %halfsub to i64))<nsw>) LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {((sext i32 %start to i64) + (-1 * (sext i32 %halfsub to i64))<nsw>),+,1}<nsw><%loop> U: [-3221225471,7516192767) S: [-3221225471,7516192767) Exits: ((zext i32 (-1 + (-1 * %start) + %numIterations) to i64) + (sext i32 %start to i64) + (-1 * (sext i32 %halfsub to i64))<nsw>) LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %ptr = getelementptr inbounds float, float* %input, i64 %index64			; CHECK-NEXT: %ptr = getelementptr inbounds float, float* %input, i64 %index64
	; CHECK-NEXT: --> {((4 * (sext i32 %start to i64))<nsw> + (-4 * (sext i32 %halfsub to i64))<nsw> + %input),+,4}<nw><%loop> U: full-set S: full-set Exits: ((4 * (zext i32 (-1 + (-1 * %start) + %numIterations) to i64))<nuw><nsw> + (4 * (sext i32 %start to i64))<nsw> + (-4 * (sext i32 %halfsub to i64))<nsw> + %input) LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {((4 * (sext i32 %start to i64))<nsw> + (-4 * (sext i32 %halfsub to i64))<nsw> + %input),+,4}<nw><%loop> U: full-set S: full-set Exits: ((4 * (zext i32 (-1 + (-1 * %start) + %numIterations) to i64))<nuw><nsw> + (4 * (sext i32 %start to i64))<nsw> + (-4 * (sext i32 %halfsub to i64))<nsw> + %input) LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %nexti = add nsw i32 %i, 1			; CHECK-NEXT: %nexti = add nsw i32 %i, 1
	; CHECK-NEXT: --> {(1 + %start),+,1}<nsw><%loop> U: full-set S: full-set Exits: %numIterations LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {(1 + %start),+,1}<nsw><%loop> U: full-set S: full-set Exits: %numIterations LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: Determining loop execution counts for: @test-sub-nsw			; CHECK-NEXT: Determining loop execution counts for: @test-sub-nsw
	; CHECK-NEXT: Loop %loop: backedge-taken count is (-1 + (-1 * %start) + %numIterations)			; CHECK-NEXT: Loop %loop: backedge-taken count is (-1 + (-1 * %start) + %numIterations)
	; CHECK-NEXT: Loop %loop: max backedge-taken count is -1			; CHECK-NEXT: Loop %loop: max backedge-taken count is -1
	; CHECK-NEXT: Loop %loop: Predicated backedge-taken count is (-1 + (-1 * %start) + %numIterations)			; CHECK-NEXT: Loop %loop: Predicated backedge-taken count is (-1 + (-1 * %start) + %numIterations)
	▲ Show 20 Lines • Show All 242 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/incorrect-exit-count.ll

	Show All 15 Lines
	; CHECK-NEXT: Classifying expressions for: @f			; CHECK-NEXT: Classifying expressions for: @f
	; CHECK-NEXT: %storemerge23 = phi i32 [ 3, %entry ], [ %dec16, %for.inc13.3 ]			; CHECK-NEXT: %storemerge23 = phi i32 [ 3, %entry ], [ %dec16, %for.inc13.3 ]
	; CHECK-NEXT: --> {3,+,-1}<nsw><%outer.loop> U: [1,4) S: [1,4) Exits: <<Unknown>> LoopDispositions: { %outer.loop: Computable, %for.cond6: Invariant, %inner.loop: Invariant }			; CHECK-NEXT: --> {3,+,-1}<nsw><%outer.loop> U: [1,4) S: [1,4) Exits: <<Unknown>> LoopDispositions: { %outer.loop: Computable, %for.cond6: Invariant, %inner.loop: Invariant }
	; CHECK-NEXT: %storemerge1921 = phi i32 [ 3, %outer.loop ], [ %dec, %for.end ]			; CHECK-NEXT: %storemerge1921 = phi i32 [ 3, %outer.loop ], [ %dec, %for.end ]
	; CHECK-NEXT: --> {3,+,-1}<nuw><nsw><%for.cond6> U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %for.cond6: Computable, %outer.loop: Variant }			; CHECK-NEXT: --> {3,+,-1}<nuw><nsw><%for.cond6> U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %for.cond6: Computable, %outer.loop: Variant }
	; CHECK-NEXT: %idxprom20 = zext i32 %storemerge1921 to i64			; CHECK-NEXT: %idxprom20 = zext i32 %storemerge1921 to i64
	; CHECK-NEXT: --> {3,+,4294967295}<nuw><nsw><%for.cond6> U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %for.cond6: Computable, %outer.loop: Variant }			; CHECK-NEXT: --> {3,+,4294967295}<nuw><nsw><%for.cond6> U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %for.cond6: Computable, %outer.loop: Variant }
	; CHECK-NEXT: %arrayidx7 = getelementptr inbounds [1 x [4 x i16]], [1 x [4 x i16]]* @__const.f.g, i64 0, i64 0, i64 %idxprom20			; CHECK-NEXT: %arrayidx7 = getelementptr inbounds [1 x [4 x i16]], [1 x [4 x i16]]* @__const.f.g, i64 0, i64 0, i64 %idxprom20
	; CHECK-NEXT: --> {(6 + @__const.f.g)<nuw>,+,8589934590}<nuw><%for.cond6> U: [6,-1) S: [-9223372036854775808,9223372036854775807) Exits: <<Unknown>> LoopDispositions: { %for.cond6: Computable, %outer.loop: Variant }			; CHECK-NEXT: --> {(6 + @__const.f.g),+,8589934590}<nuw><%for.cond6> U: [0,-1) S: [-9223372036854775808,9223372036854775807) Exits: <<Unknown>> LoopDispositions: { %for.cond6: Computable, %outer.loop: Variant }
	; CHECK-NEXT: %i = load i16, i16* %arrayidx7, align 2			; CHECK-NEXT: %i = load i16, i16* %arrayidx7, align 2
	; CHECK-NEXT: --> %i U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %for.cond6: Variant, %outer.loop: Variant }			; CHECK-NEXT: --> %i U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %for.cond6: Variant, %outer.loop: Variant }
	; CHECK-NEXT: %storemerge1822.lcssa.ph = phi i32 [ 0, %for.cond6 ]			; CHECK-NEXT: %storemerge1822.lcssa.ph = phi i32 [ 0, %for.cond6 ]
	; CHECK-NEXT: --> 0 U: [0,1) S: [0,1)			; CHECK-NEXT: --> 0 U: [0,1) S: [0,1)
	; CHECK-NEXT: %storemerge1822.lcssa.ph32 = phi i32 [ 3, %inner.loop ]			; CHECK-NEXT: %storemerge1822.lcssa.ph32 = phi i32 [ 3, %inner.loop ]
	; CHECK-NEXT: --> 3 U: [3,4) S: [3,4)			; CHECK-NEXT: --> 3 U: [3,4) S: [3,4)
	; CHECK-NEXT: %storemerge1822.lcssa = phi i32 [ %storemerge1822.lcssa.ph, %if.end.loopexit ], [ %storemerge1822.lcssa.ph32, %if.end.loopexit31 ]			; CHECK-NEXT: %storemerge1822.lcssa = phi i32 [ %storemerge1822.lcssa.ph, %if.end.loopexit ], [ %storemerge1822.lcssa.ph32, %if.end.loopexit31 ]
	; CHECK-NEXT: --> %storemerge1822.lcssa U: [0,4) S: [0,4)			; CHECK-NEXT: --> %storemerge1822.lcssa U: [0,4) S: [0,4)
	; CHECK-NEXT: %i1 = load i32, i32* @e, align 4			; CHECK-NEXT: %i1 = load i32, i32* @e, align 4
	; CHECK-NEXT: --> %i1 U: full-set S: full-set			; CHECK-NEXT: --> %i1 U: full-set S: full-set
	; CHECK-NEXT: %i2 = load volatile i32, i32* @b, align 4			; CHECK-NEXT: %i2 = load volatile i32, i32* @b, align 4
	; CHECK-NEXT: --> %i2 U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %for.cond6: Variant, %outer.loop: Variant }			; CHECK-NEXT: --> %i2 U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %for.cond6: Variant, %outer.loop: Variant }
	; CHECK-NEXT: %dec = add nsw i32 %storemerge1921, -1			; CHECK-NEXT: %dec = add nsw i32 %storemerge1921, -1
	; CHECK-NEXT: --> {2,+,-1}<nsw><%for.cond6> U: [2,3) S: [2,3) Exits: <<Unknown>> LoopDispositions: { %for.cond6: Computable, %outer.loop: Variant }			; CHECK-NEXT: --> {2,+,-1}<nsw><%for.cond6> U: [2,3) S: [2,3) Exits: <<Unknown>> LoopDispositions: { %for.cond6: Computable, %outer.loop: Variant }
	; CHECK-NEXT: %inc.lcssa.lcssa = phi i32 [ 4, %for.inc13.3 ]			; CHECK-NEXT: %inc.lcssa.lcssa = phi i32 [ 4, %for.inc13.3 ]
	; CHECK-NEXT: --> 4 U: [4,5) S: [4,5)			; CHECK-NEXT: --> 4 U: [4,5) S: [4,5)
	; CHECK-NEXT: %retval.0 = phi i32 [ %i1, %if.end ], [ 0, %cleanup.loopexit ]			; CHECK-NEXT: %retval.0 = phi i32 [ %i1, %if.end ], [ 0, %cleanup.loopexit ]
	; CHECK-NEXT: --> %retval.0 U: full-set S: full-set			; CHECK-NEXT: --> %retval.0 U: full-set S: full-set
	; CHECK-NEXT: %storemerge1921.3 = phi i32 [ 3, %for.end ], [ %dec.3, %for.end.3 ]			; CHECK-NEXT: %storemerge1921.3 = phi i32 [ 3, %for.end ], [ %dec.3, %for.end.3 ]
	; CHECK-NEXT: --> {3,+,-1}<nuw><nsw><%inner.loop> U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %inner.loop: Computable, %outer.loop: Variant }			; CHECK-NEXT: --> {3,+,-1}<nuw><nsw><%inner.loop> U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %inner.loop: Computable, %outer.loop: Variant }
	; CHECK-NEXT: %idxprom20.3 = zext i32 %storemerge1921.3 to i64			; CHECK-NEXT: %idxprom20.3 = zext i32 %storemerge1921.3 to i64
	; CHECK-NEXT: --> {3,+,4294967295}<nuw><nsw><%inner.loop> U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %inner.loop: Computable, %outer.loop: Variant }			; CHECK-NEXT: --> {3,+,4294967295}<nuw><nsw><%inner.loop> U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %inner.loop: Computable, %outer.loop: Variant }
	; CHECK-NEXT: %arrayidx7.3 = getelementptr inbounds [1 x [4 x i16]], [1 x [4 x i16]]* @__const.f.g, i64 0, i64 0, i64 %idxprom20.3			; CHECK-NEXT: %arrayidx7.3 = getelementptr inbounds [1 x [4 x i16]], [1 x [4 x i16]]* @__const.f.g, i64 0, i64 0, i64 %idxprom20.3
	; CHECK-NEXT: --> {(6 + @__const.f.g)<nuw>,+,8589934590}<nuw><%inner.loop> U: [6,-1) S: [-9223372036854775808,9223372036854775807) Exits: <<Unknown>> LoopDispositions: { %inner.loop: Computable, %outer.loop: Variant }			; CHECK-NEXT: --> {(6 + @__const.f.g),+,8589934590}<nuw><%inner.loop> U: [0,-1) S: [-9223372036854775808,9223372036854775807) Exits: <<Unknown>> LoopDispositions: { %inner.loop: Computable, %outer.loop: Variant }
	; CHECK-NEXT: %i7 = load i16, i16* %arrayidx7.3, align 2			; CHECK-NEXT: %i7 = load i16, i16* %arrayidx7.3, align 2
	; CHECK-NEXT: --> %i7 U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %inner.loop: Variant, %outer.loop: Variant }			; CHECK-NEXT: --> %i7 U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %inner.loop: Variant, %outer.loop: Variant }
	; CHECK-NEXT: %i8 = load volatile i32, i32* @b, align 4			; CHECK-NEXT: %i8 = load volatile i32, i32* @b, align 4
	; CHECK-NEXT: --> %i8 U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %inner.loop: Variant, %outer.loop: Variant }			; CHECK-NEXT: --> %i8 U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %inner.loop: Variant, %outer.loop: Variant }
	; CHECK-NEXT: %dec.3 = add nsw i32 %storemerge1921.3, -1			; CHECK-NEXT: %dec.3 = add nsw i32 %storemerge1921.3, -1
	; CHECK-NEXT: --> {2,+,-1}<nsw><%inner.loop> U: [2,3) S: [2,3) Exits: <<Unknown>> LoopDispositions: { %inner.loop: Computable, %outer.loop: Variant }			; CHECK-NEXT: --> {2,+,-1}<nsw><%inner.loop> U: [2,3) S: [2,3) Exits: <<Unknown>> LoopDispositions: { %inner.loop: Computable, %outer.loop: Variant }
	; CHECK-NEXT: %storemerge1921.lcssa25.3 = phi i32 [ %storemerge1921.3, %for.end.3 ]			; CHECK-NEXT: %storemerge1921.lcssa25.3 = phi i32 [ %storemerge1921.3, %for.end.3 ]
	; CHECK-NEXT: --> %storemerge1921.lcssa25.3 U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %outer.loop: Variant, %for.cond6: Invariant, %inner.loop: Invariant }			; CHECK-NEXT: --> %storemerge1921.lcssa25.3 U: [3,4) S: [3,4) Exits: <<Unknown>> LoopDispositions: { %outer.loop: Variant, %for.cond6: Invariant, %inner.loop: Invariant }
	▲ Show 20 Lines • Show All 88 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/no-wrap-add-exprs.ll

	Show First 20 Lines • Show All 291 Lines • ▼ Show 20 Lines
	; and (as shown in the _b variant) the printer frequently makes the actual			; and (as shown in the _b variant) the printer frequently makes the actual
	; bug very hard to see.			; bug very hard to see.
	define i1 @test2_a(i32 %a, i32 %b, i1 %will_overflow) {			define i1 @test2_a(i32 %a, i32 %b, i1 %will_overflow) {
	; CHECK-LABEL: 'test2_a'			; CHECK-LABEL: 'test2_a'
	; CHECK-NEXT: Classifying expressions for: @test2_a			; CHECK-NEXT: Classifying expressions for: @test2_a
	; CHECK-NEXT: %iv = phi i32 [ %a, %entry ], [ %iv.next, %loop ]			; CHECK-NEXT: %iv = phi i32 [ %a, %entry ], [ %iv.next, %loop ]
	; CHECK-NEXT: --> {%a,+,%b}<nuw><nsw><%loop> U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {%a,+,%b}<nuw><nsw><%loop> U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %iv.next = add nuw nsw i32 %iv, %b			; CHECK-NEXT: %iv.next = add nuw nsw i32 %iv, %b
	; CHECK-NEXT: --> {(%a + %b)<nuw><nsw>,+,%b}<nuw><nsw><%loop> U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {(%a + %b),+,%b}<nuw><nsw><%loop> U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %trap = udiv i32 %a, %iv.next			; CHECK-NEXT: %trap = udiv i32 %a, %iv.next
	; CHECK-NEXT: --> (%a /u {(%a + %b)<nuw><nsw>,+,%b}<nuw><nsw><%loop>) U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> (%a /u {(%a + %b),+,%b}<nuw><nsw><%loop>) U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %c = add i32 %a, %b			; CHECK-NEXT: %c = add i32 %a, %b
	; CHECK-NEXT: --> (%a + %b)<nuw><nsw> U: full-set S: full-set			; CHECK-NEXT: --> (%a + %b) U: full-set S: full-set
	; CHECK-NEXT: Determining loop execution counts for: @test2_a			; CHECK-NEXT: Determining loop execution counts for: @test2_a
	; CHECK-NEXT: Loop %loop: Unpredictable backedge-taken count.			; CHECK-NEXT: Loop %loop: Unpredictable backedge-taken count.
	; CHECK-NEXT: Loop %loop: Unpredictable max backedge-taken count.			; CHECK-NEXT: Loop %loop: Unpredictable max backedge-taken count.
	; CHECK-NEXT: Loop %loop: Unpredictable predicated backedge-taken count.			; CHECK-NEXT: Loop %loop: Unpredictable predicated backedge-taken count.
	;			;
	entry:			entry:
	br i1 %will_overflow, label %exit1, label %loop			br i1 %will_overflow, label %exit1, label %loop

	Show All 17 Lines
	define i1 @test2_b(i32 %a, i32 %b, i1 %will_overflow) {			define i1 @test2_b(i32 %a, i32 %b, i1 %will_overflow) {
	; CHECK-LABEL: 'test2_b'			; CHECK-LABEL: 'test2_b'
	; CHECK-NEXT: Classifying expressions for: @test2_b			; CHECK-NEXT: Classifying expressions for: @test2_b
	; CHECK-NEXT: %c = add i32 %a, %b			; CHECK-NEXT: %c = add i32 %a, %b
	; CHECK-NEXT: --> (%a + %b) U: full-set S: full-set			; CHECK-NEXT: --> (%a + %b) U: full-set S: full-set
	; CHECK-NEXT: %iv = phi i32 [ %a, %entry ], [ %iv.next, %loop ]			; CHECK-NEXT: %iv = phi i32 [ %a, %entry ], [ %iv.next, %loop ]
	; CHECK-NEXT: --> {%a,+,%b}<nuw><nsw><%loop> U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {%a,+,%b}<nuw><nsw><%loop> U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %iv.next = add nuw nsw i32 %iv, %b			; CHECK-NEXT: %iv.next = add nuw nsw i32 %iv, %b
	; CHECK-NEXT: --> {(%a + %b)<nuw><nsw>,+,%b}<nuw><nsw><%loop> U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> {(%a + %b),+,%b}<nuw><nsw><%loop> U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: %trap = udiv i32 %a, %iv.next			; CHECK-NEXT: %trap = udiv i32 %a, %iv.next
	; CHECK-NEXT: --> (%a /u {(%a + %b)<nuw><nsw>,+,%b}<nuw><nsw><%loop>) U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }			; CHECK-NEXT: --> (%a /u {(%a + %b),+,%b}<nuw><nsw><%loop>) U: full-set S: full-set Exits: <<Unknown>> LoopDispositions: { %loop: Computable }
	; CHECK-NEXT: Determining loop execution counts for: @test2_b			; CHECK-NEXT: Determining loop execution counts for: @test2_b
	; CHECK-NEXT: Loop %loop: Unpredictable backedge-taken count.			; CHECK-NEXT: Loop %loop: Unpredictable backedge-taken count.
	; CHECK-NEXT: Loop %loop: Unpredictable max backedge-taken count.			; CHECK-NEXT: Loop %loop: Unpredictable max backedge-taken count.
	; CHECK-NEXT: Loop %loop: Unpredictable predicated backedge-taken count.			; CHECK-NEXT: Loop %loop: Unpredictable predicated backedge-taken count.
	;			;
	entry:			entry:
	br i1 %will_overflow, label %exit1, label %loop			br i1 %will_overflow, label %exit1, label %loop

	Show All 16 Lines

llvm/test/Analysis/ScalarEvolution/nsw-offset-assume.ll

	Show All 24 Lines
	; CHECK-NEXT: --> {0,+,2}<nuw><nsw><%bb> U: [0,2147483645) S: [0,2147483645) Exits: (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {0,+,2}<nuw><nsw><%bb> U: [0,2147483645) S: [0,2147483645) Exits: (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %5 = getelementptr inbounds double, double* %q, i64 %4			; CHECK-NEXT: %5 = getelementptr inbounds double, double* %q, i64 %4
	; CHECK-NEXT: --> {%q,+,16}<nuw><%bb> U: full-set S: full-set Exits: ((16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {%q,+,16}<nuw><%bb> U: full-set S: full-set Exits: ((16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %7 = or i32 %i.01, 1			; CHECK-NEXT: %7 = or i32 %i.01, 1
	; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>)<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>)<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %8 = sext i32 %7 to i64			; CHECK-NEXT: %8 = sext i32 %7 to i64
	; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw>)<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw>)<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %9 = getelementptr inbounds double, double* %q, i64 %8			; CHECK-NEXT: %9 = getelementptr inbounds double, double* %q, i64 %8
	; CHECK-NEXT: --> {(8 + %q)<nuw>,+,16}<nuw><%bb> U: [8,0) S: [8,0) Exits: (8 + (16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {(8 + %q),+,16}<nuw><%bb> U: full-set S: full-set Exits: (8 + (16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %t7 = add nsw i32 %i.01, 1			; CHECK-NEXT: %t7 = add nsw i32 %i.01, 1
	; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>)<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>)<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %t8 = sext i32 %t7 to i64			; CHECK-NEXT: %t8 = sext i32 %t7 to i64
	; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw>)<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw>)<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %t9 = getelementptr inbounds double, double* %q, i64 %t8			; CHECK-NEXT: %t9 = getelementptr inbounds double, double* %q, i64 %t8
	; CHECK-NEXT: --> {(8 + %q)<nuw>,+,16}<nuw><%bb> U: [8,0) S: [8,0) Exits: (8 + (16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {(8 + %q),+,16}<nuw><%bb> U: full-set S: full-set Exits: (8 + (16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %14 = sext i32 %i.01 to i64			; CHECK-NEXT: %14 = sext i32 %i.01 to i64
	; CHECK-NEXT: --> {0,+,2}<nuw><nsw><%bb> U: [0,2147483645) S: [0,2147483645) Exits: (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {0,+,2}<nuw><nsw><%bb> U: [0,2147483645) S: [0,2147483645) Exits: (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %15 = getelementptr inbounds double, double* %d, i64 %14			; CHECK-NEXT: %15 = getelementptr inbounds double, double* %d, i64 %14
	; CHECK-NEXT: --> {%d,+,16}<nuw><%bb> U: full-set S: full-set Exits: ((16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %d) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {%d,+,16}<nuw><%bb> U: full-set S: full-set Exits: ((16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %d) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %16 = add nsw i32 %i.01, 2			; CHECK-NEXT: %16 = add nsw i32 %i.01, 2
	; CHECK-NEXT: --> {2,+,2}<nuw><nsw><%bb> U: [2,2147483647) S: [2,2147483647) Exits: (2 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {2,+,2}<nuw><nsw><%bb> U: [2,2147483647) S: [2,2147483647) Exits: (2 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: Determining loop execution counts for: @foo			; CHECK-NEXT: Determining loop execution counts for: @foo
	; CHECK-NEXT: Loop %bb: backedge-taken count is ((-1 + (2 * (%no /u 2))<nuw>) /u 2)			; CHECK-NEXT: Loop %bb: backedge-taken count is ((-1 + (2 * (%no /u 2))<nuw>) /u 2)
	▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/nsw-offset.ll

	Show All 21 Lines
	; CHECK-NEXT: --> {0,+,2}<nuw><nsw><%bb> U: [0,2147483645) S: [0,2147483645) Exits: (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {0,+,2}<nuw><nsw><%bb> U: [0,2147483645) S: [0,2147483645) Exits: (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %5 = getelementptr inbounds double, double* %q, i64 %4			; CHECK-NEXT: %5 = getelementptr inbounds double, double* %q, i64 %4
	; CHECK-NEXT: --> {%q,+,16}<nuw><%bb> U: full-set S: full-set Exits: ((16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {%q,+,16}<nuw><%bb> U: full-set S: full-set Exits: ((16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %7 = or i32 %i.01, 1			; CHECK-NEXT: %7 = or i32 %i.01, 1
	; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>)<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>)<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %8 = sext i32 %7 to i64			; CHECK-NEXT: %8 = sext i32 %7 to i64
	; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw>)<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw>)<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %9 = getelementptr inbounds double, double* %q, i64 %8			; CHECK-NEXT: %9 = getelementptr inbounds double, double* %q, i64 %8
	; CHECK-NEXT: --> {(8 + %q)<nuw>,+,16}<nuw><%bb> U: [8,0) S: [8,0) Exits: (8 + (16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {(8 + %q),+,16}<nuw><%bb> U: full-set S: full-set Exits: (8 + (16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %t7 = add nsw i32 %i.01, 1			; CHECK-NEXT: %t7 = add nsw i32 %i.01, 1
	; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>)<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>)<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %t8 = sext i32 %t7 to i64			; CHECK-NEXT: %t8 = sext i32 %t7 to i64
	; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw>)<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {1,+,2}<nuw><nsw><%bb> U: [1,2147483646) S: [1,2147483646) Exits: (1 + (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw>)<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %t9 = getelementptr inbounds double, double* %q, i64 %t8			; CHECK-NEXT: %t9 = getelementptr inbounds double, double* %q, i64 %t8
	; CHECK-NEXT: --> {(8 + %q)<nuw>,+,16}<nuw><%bb> U: [8,0) S: [8,0) Exits: (8 + (16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {(8 + %q),+,16}<nuw><%bb> U: full-set S: full-set Exits: (8 + (16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %q) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %14 = sext i32 %i.01 to i64			; CHECK-NEXT: %14 = sext i32 %i.01 to i64
	; CHECK-NEXT: --> {0,+,2}<nuw><nsw><%bb> U: [0,2147483645) S: [0,2147483645) Exits: (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {0,+,2}<nuw><nsw><%bb> U: [0,2147483645) S: [0,2147483645) Exits: (2 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %15 = getelementptr inbounds double, double* %d, i64 %14			; CHECK-NEXT: %15 = getelementptr inbounds double, double* %d, i64 %14
	; CHECK-NEXT: --> {%d,+,16}<nuw><%bb> U: full-set S: full-set Exits: ((16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %d) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {%d,+,16}<nuw><%bb> U: full-set S: full-set Exits: ((16 * ((1 + (zext i32 (-2 + (2 * (%no /u 2))<nuw>) to i64))<nuw><nsw> /u 2))<nuw><nsw> + %d) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: %16 = add nsw i32 %i.01, 2			; CHECK-NEXT: %16 = add nsw i32 %i.01, 2
	; CHECK-NEXT: --> {2,+,2}<nuw><nsw><%bb> U: [2,2147483647) S: [2,2147483647) Exits: (2 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>) LoopDispositions: { %bb: Computable }			; CHECK-NEXT: --> {2,+,2}<nuw><nsw><%bb> U: [2,2147483647) S: [2,2147483647) Exits: (2 + (2 * ((-1 + (2 * (%no /u 2))<nuw>) /u 2))<nuw>) LoopDispositions: { %bb: Computable }
	; CHECK-NEXT: Determining loop execution counts for: @foo			; CHECK-NEXT: Determining loop execution counts for: @foo
	; CHECK-NEXT: Loop %bb: backedge-taken count is ((-1 + (2 * (%no /u 2))<nuw>) /u 2)			; CHECK-NEXT: Loop %bb: backedge-taken count is ((-1 + (2 * (%no /u 2))<nuw>) /u 2)
	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/lsr-profitable-chain.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -verify-machineinstrs -mtriple=powerpc64le-unknown-linux-gnu \			; RUN: llc -verify-machineinstrs -mtriple=powerpc64le-unknown-linux-gnu \
	; RUN: -mcpu=pwr9 < %s \| FileCheck %s			; RUN: -mcpu=pwr9 < %s \| FileCheck %s

	define void @foo(double* readonly %0, double* %1, i64 %2, i64 %3, i64 %4, i64 %5, i64 %6, i64 %7) {			define void @foo(double* readonly %0, double* %1, i64 %2, i64 %3, i64 %4, i64 %5, i64 %6, i64 %7) {
	; CHECK-LABEL: foo:			; CHECK-LABEL: foo:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: cmpd 5, 7			; CHECK-NEXT: cmpd 5, 7
	; CHECK-NEXT: std 19, -104(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 20, -96(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 21, -88(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 22, -80(1) # 8-byte Folded Spill			; CHECK-NEXT: std 22, -80(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 23, -72(1) # 8-byte Folded Spill			; CHECK-NEXT: std 23, -72(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 24, -64(1) # 8-byte Folded Spill			; CHECK-NEXT: std 24, -64(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 25, -56(1) # 8-byte Folded Spill			; CHECK-NEXT: std 25, -56(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 26, -48(1) # 8-byte Folded Spill			; CHECK-NEXT: std 26, -48(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 27, -40(1) # 8-byte Folded Spill			; CHECK-NEXT: std 27, -40(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 28, -32(1) # 8-byte Folded Spill			; CHECK-NEXT: std 28, -32(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 29, -24(1) # 8-byte Folded Spill			; CHECK-NEXT: std 29, -24(1) # 8-byte Folded Spill
	; CHECK-NEXT: std 30, -16(1) # 8-byte Folded Spill			; CHECK-NEXT: std 30, -16(1) # 8-byte Folded Spill
	; CHECK-NEXT: bge 0, .LBB0_6			; CHECK-NEXT: bge 0, .LBB0_6
	; CHECK-NEXT: # %bb.1: # %.preheader			; CHECK-NEXT: # %bb.1: # %.preheader
	; CHECK-NEXT: addi 30, 5, 1			; CHECK-NEXT: addi 30, 5, 1
	; CHECK-NEXT: addi 28, 5, 3			; CHECK-NEXT: addi 28, 5, 3
	; CHECK-NEXT: addi 27, 5, 2			; CHECK-NEXT: addi 27, 5, 2
	; CHECK-NEXT: mulld 12, 8, 5			; CHECK-NEXT: mulld 12, 8, 5
	; CHECK-NEXT: addi 29, 3, 16			; CHECK-NEXT: addi 29, 3, 16
	; CHECK-NEXT: mulld 0, 9, 8			; CHECK-NEXT: mulld 0, 9, 8
	; CHECK-NEXT: mr 25, 12			; CHECK-NEXT: sldi 11, 10, 3
	; CHECK-NEXT: mulld 30, 8, 30			; CHECK-NEXT: mulld 30, 8, 30
	; CHECK-NEXT: mulld 28, 8, 28			; CHECK-NEXT: mulld 28, 8, 28
	; CHECK-NEXT: mulld 8, 8, 27			; CHECK-NEXT: mulld 8, 8, 27
	; CHECK-NEXT: sldi 11, 10, 3
	; CHECK-NEXT: li 27, 0
	; CHECK-NEXT: mr 26, 30
	; CHECK-NEXT: b .LBB0_3			; CHECK-NEXT: b .LBB0_3
	; CHECK-NEXT: .p2align 4			; CHECK-NEXT: .p2align 4
	; CHECK-NEXT: .LBB0_2:			; CHECK-NEXT: .LBB0_2:
	; CHECK-NEXT: add 5, 5, 9			; CHECK-NEXT: add 5, 5, 9
	; CHECK-NEXT: add 25, 25, 0			; CHECK-NEXT: add 12, 12, 0
	; CHECK-NEXT: add 26, 26, 0			; CHECK-NEXT: add 30, 30, 0
	; CHECK-NEXT: add 28, 28, 0			; CHECK-NEXT: add 28, 28, 0
	; CHECK-NEXT: add 8, 8, 0			; CHECK-NEXT: add 8, 8, 0
	; CHECK-NEXT: addi 27, 27, 1
	; CHECK-NEXT: cmpd 5, 7			; CHECK-NEXT: cmpd 5, 7
	; CHECK-NEXT: bge 0, .LBB0_6			; CHECK-NEXT: bge 0, .LBB0_6
	; CHECK-NEXT: .LBB0_3: # =>This Loop Header: Depth=1			; CHECK-NEXT: .LBB0_3: # =>This Loop Header: Depth=1
	; CHECK-NEXT: # Child Loop BB0_5 Depth 2			; CHECK-NEXT: # Child Loop BB0_5 Depth 2
	; CHECK-NEXT: sub 24, 5, 10			; CHECK-NEXT: sub 27, 5, 10
	; CHECK-NEXT: cmpd 6, 24			; CHECK-NEXT: cmpd 6, 27
	; CHECK-NEXT: bge 0, .LBB0_2			; CHECK-NEXT: bge 0, .LBB0_2
	; CHECK-NEXT: # %bb.4:			; CHECK-NEXT: # %bb.4:
	; CHECK-NEXT: maddld 21, 0, 27, 30			; CHECK-NEXT: add 23, 6, 12
	; CHECK-NEXT: maddld 20, 0, 27, 12			; CHECK-NEXT: add 22, 6, 30
	; CHECK-NEXT: add 23, 6, 28			; CHECK-NEXT: add 26, 6, 28
	; CHECK-NEXT: add 22, 6, 8			; CHECK-NEXT: add 25, 6, 8
	; CHECK-NEXT: add 20, 6, 20			; CHECK-NEXT: sldi 24, 6, 3
	; CHECK-NEXT: add 19, 6, 21			; CHECK-NEXT: sldi 26, 26, 3
				; CHECK-NEXT: sldi 25, 25, 3
	; CHECK-NEXT: sldi 23, 23, 3			; CHECK-NEXT: sldi 23, 23, 3
	; CHECK-NEXT: sldi 22, 22, 3			; CHECK-NEXT: sldi 22, 22, 3
	; CHECK-NEXT: sldi 21, 6, 3			; CHECK-NEXT: add 24, 4, 24
	; CHECK-NEXT: add 23, 29, 23			; CHECK-NEXT: add 26, 29, 26
	; CHECK-NEXT: add 22, 29, 22			; CHECK-NEXT: add 25, 29, 25
	; CHECK-NEXT: sldi 20, 20, 3			; CHECK-NEXT: add 23, 3, 23
	; CHECK-NEXT: sldi 19, 19, 3			; CHECK-NEXT: add 22, 3, 22
	; CHECK-NEXT: add 21, 4, 21
	; CHECK-NEXT: add 20, 3, 20
	; CHECK-NEXT: add 19, 3, 19
	; CHECK-NEXT: .p2align 5			; CHECK-NEXT: .p2align 5
	; CHECK-NEXT: .LBB0_5: # Parent Loop BB0_3 Depth=1			; CHECK-NEXT: .LBB0_5: # Parent Loop BB0_3 Depth=1
	; CHECK-NEXT: # => This Inner Loop Header: Depth=2			; CHECK-NEXT: # => This Inner Loop Header: Depth=2
	; CHECK-NEXT: lfd 0, 0(21)			; CHECK-NEXT: lfd 0, 0(24)
	; CHECK-NEXT: lfd 1, 0(20)			; CHECK-NEXT: lfd 1, 0(23)
	; CHECK-NEXT: add 6, 6, 10			; CHECK-NEXT: add 6, 6, 10
	; CHECK-NEXT: cmpd 6, 24			; CHECK-NEXT: cmpd 6, 27
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 8(20)			; CHECK-NEXT: lfd 1, 8(23)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 16(20)			; CHECK-NEXT: lfd 1, 16(23)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 24(20)			; CHECK-NEXT: lfd 1, 24(23)
	; CHECK-NEXT: add 20, 20, 11			; CHECK-NEXT: add 23, 23, 11
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 0(19)			; CHECK-NEXT: lfd 1, 0(22)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 8(19)			; CHECK-NEXT: lfd 1, 8(22)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 16(19)			; CHECK-NEXT: lfd 1, 16(22)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 24(19)			; CHECK-NEXT: lfd 1, 24(22)
	; CHECK-NEXT: add 19, 19, 11			; CHECK-NEXT: add 22, 22, 11
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, -16(22)			; CHECK-NEXT: lfd 1, -16(25)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, -8(22)			; CHECK-NEXT: lfd 1, -8(25)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 0(22)			; CHECK-NEXT: lfd 1, 0(25)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 8(22)			; CHECK-NEXT: lfd 1, 8(25)
	; CHECK-NEXT: add 22, 22, 11			; CHECK-NEXT: add 25, 25, 11
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, -16(23)			; CHECK-NEXT: lfd 1, -16(26)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, -8(23)			; CHECK-NEXT: lfd 1, -8(26)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 0(23)			; CHECK-NEXT: lfd 1, 0(26)
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: lfd 1, 8(23)			; CHECK-NEXT: lfd 1, 8(26)
	; CHECK-NEXT: add 23, 23, 11			; CHECK-NEXT: add 26, 26, 11
	; CHECK-NEXT: xsadddp 0, 0, 1			; CHECK-NEXT: xsadddp 0, 0, 1
	; CHECK-NEXT: stfd 0, 0(21)			; CHECK-NEXT: stfd 0, 0(24)
	; CHECK-NEXT: add 21, 21, 11			; CHECK-NEXT: add 24, 24, 11
	; CHECK-NEXT: blt 0, .LBB0_5			; CHECK-NEXT: blt 0, .LBB0_5
	; CHECK-NEXT: b .LBB0_2			; CHECK-NEXT: b .LBB0_2
	; CHECK-NEXT: .LBB0_6:			; CHECK-NEXT: .LBB0_6:
	; CHECK-NEXT: ld 30, -16(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 30, -16(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 29, -24(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 29, -24(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 28, -32(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 28, -32(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 27, -40(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 27, -40(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 26, -48(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 26, -48(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 25, -56(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 25, -56(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 24, -64(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 24, -64(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 23, -72(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 23, -72(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 22, -80(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 22, -80(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 21, -88(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 20, -96(1) # 8-byte Folded Reload
	; CHECK-NEXT: ld 19, -104(1) # 8-byte Folded Reload
	; CHECK-NEXT: blr			; CHECK-NEXT: blr
	%9 = icmp slt i64 %2, %4			%9 = icmp slt i64 %2, %4
	br i1 %9, label %10, label %97			br i1 %9, label %10, label %97

	10: ; preds = %8, %93			10: ; preds = %8, %93
	%11 = phi i64 [ %95, %93 ], [ %2, %8 ]			%11 = phi i64 [ %95, %93 ], [ %2, %8 ]
	%12 = phi i64 [ %94, %93 ], [ %3, %8 ]			%12 = phi i64 [ %94, %93 ], [ %3, %8 ]
	%13 = sub nsw i64 %11, %7			%13 = sub nsw i64 %11, %7
	▲ Show 20 Lines • Show All 95 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopIdiom/basic.ll

	Show First 20 Lines • Show All 803 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[CONV:%.]] = sext i32 [[C:%.]] to i64			; CHECK-NEXT: [[CONV:%.]] = sext i32 [[C:%.]] to i64
	; CHECK-NEXT: [[MUL:%.*]] = shl nsw i64 [[CONV]], 2			; CHECK-NEXT: [[MUL:%.*]] = shl nsw i64 [[CONV]], 2
	; CHECK-NEXT: [[CALL:%.]] = tail call noalias i8 @malloc(i64 [[MUL]])			; CHECK-NEXT: [[CALL:%.]] = tail call noalias i8 @malloc(i64 [[MUL]])
	; CHECK-NEXT: [[TMP0:%.]] = bitcast i8 [[CALL]] to i32*			; CHECK-NEXT: [[TMP0:%.]] = bitcast i8 [[CALL]] to i32*
	; CHECK-NEXT: [[TOBOOL_9:%.*]] = icmp eq i32 [[C]], 0			; CHECK-NEXT: [[TOBOOL_9:%.*]] = icmp eq i32 [[C]], 0
	; CHECK-NEXT: br i1 [[TOBOOL_9]], label [[WHILE_END:%.]], label [[WHILE_BODY_PREHEADER:%.]]			; CHECK-NEXT: br i1 [[TOBOOL_9]], label [[WHILE_END:%.]], label [[WHILE_BODY_PREHEADER:%.]]
	; CHECK: while.body.preheader:			; CHECK: while.body.preheader:
	; CHECK-NEXT: [[TMP1:%.*]] = sext i32 [[C]] to i64			; CHECK-NEXT: [[TMP1:%.*]] = add i32 [[C]], -1
	; CHECK-NEXT: [[TMP2:%.*]] = shl nsw i64 [[TMP1]], 2			; CHECK-NEXT: [[TMP2:%.*]] = sext i32 [[TMP1]] to i64
	; CHECK-NEXT: [[TMP3:%.*]] = add i64 [[TMP2]], -4			; CHECK-NEXT: [[TMP3:%.*]] = shl nsw i64 [[TMP2]], 2
	; CHECK-NEXT: [[TMP4:%.*]] = add nsw i32 [[C]], -1			; CHECK-NEXT: [[TMP4:%.*]] = zext i32 [[TMP1]] to i64
	; CHECK-NEXT: [[TMP5:%.*]] = zext i32 [[TMP4]] to i64			; CHECK-NEXT: [[TMP5:%.*]] = shl nuw nsw i64 [[TMP4]], 2
	; CHECK-NEXT: [[TMP6:%.*]] = shl nuw nsw i64 [[TMP5]], 2			; CHECK-NEXT: [[TMP6:%.*]] = sub i64 [[TMP3]], [[TMP5]]
	; CHECK-NEXT: [[TMP7:%.*]] = sub i64 [[TMP3]], [[TMP6]]			; CHECK-NEXT: [[SCEVGEP:%.]] = getelementptr i8, i8 [[CALL]], i64 [[TMP6]]
	; CHECK-NEXT: [[SCEVGEP:%.]] = getelementptr i8, i8 [[CALL]], i64 [[TMP7]]			; CHECK-NEXT: [[TMP7:%.*]] = sub i64 [[TMP2]], [[TMP4]]
	; CHECK-NEXT: [[TMP8:%.*]] = add i64 [[TMP1]], -1			; CHECK-NEXT: [[SCEVGEP1:%.]] = getelementptr i32, i32 [[A:%.*]], i64 [[TMP7]]
	; CHECK-NEXT: [[TMP9:%.*]] = sub i64 [[TMP8]], [[TMP5]]
	; CHECK-NEXT: [[SCEVGEP1:%.]] = getelementptr i32, i32 [[A:%.*]], i64 [[TMP9]]
	; CHECK-NEXT: [[SCEVGEP12:%.]] = bitcast i32 [[SCEVGEP1]] to i8*			; CHECK-NEXT: [[SCEVGEP12:%.]] = bitcast i32 [[SCEVGEP1]] to i8*
	; CHECK-NEXT: [[TMP10:%.*]] = zext i32 [[C]] to i64			; CHECK-NEXT: [[TMP8:%.*]] = zext i32 [[C]] to i64
	; CHECK-NEXT: [[TMP11:%.*]] = shl nuw nsw i64 [[TMP10]], 2			; CHECK-NEXT: [[TMP9:%.*]] = shl nuw nsw i64 [[TMP8]], 2
	; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 [[SCEVGEP]], i8* align 4 [[SCEVGEP12]], i64 [[TMP11]], i1 false)			; CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 [[SCEVGEP]], i8* align 4 [[SCEVGEP12]], i64 [[TMP9]], i1 false)
	; CHECK-NEXT: br label [[WHILE_BODY:%.*]]			; CHECK-NEXT: br label [[WHILE_BODY:%.*]]
	; CHECK: while.body:			; CHECK: while.body:
	; CHECK-NEXT: [[DEC10_IN:%.]] = phi i32 [ [[DEC10:%.]], [[WHILE_BODY]] ], [ [[C]], [[WHILE_BODY_PREHEADER]] ]			; CHECK-NEXT: [[DEC10_IN:%.]] = phi i32 [ [[DEC10:%.]], [[WHILE_BODY]] ], [ [[C]], [[WHILE_BODY_PREHEADER]] ]
	; CHECK-NEXT: [[DEC10]] = add nsw i32 [[DEC10_IN]], -1			; CHECK-NEXT: [[DEC10]] = add nsw i32 [[DEC10_IN]], -1
	; CHECK-NEXT: [[IDXPROM:%.*]] = sext i32 [[DEC10]] to i64			; CHECK-NEXT: [[IDXPROM:%.*]] = sext i32 [[DEC10]] to i64
	; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds i32, i32 [[A]], i64 [[IDXPROM]]			; CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds i32, i32 [[A]], i64 [[IDXPROM]]
	; CHECK-NEXT: [[TMP12:%.]] = load i32, i32 [[ARRAYIDX]], align 4			; CHECK-NEXT: [[TMP10:%.]] = load i32, i32 [[ARRAYIDX]], align 4
	; CHECK-NEXT: [[ARRAYIDX2:%.]] = getelementptr inbounds i32, i32 [[TMP0]], i64 [[IDXPROM]]			; CHECK-NEXT: [[ARRAYIDX2:%.]] = getelementptr inbounds i32, i32 [[TMP0]], i64 [[IDXPROM]]
	; CHECK-NEXT: [[TOBOOL:%.*]] = icmp eq i32 [[DEC10]], 0			; CHECK-NEXT: [[TOBOOL:%.*]] = icmp eq i32 [[DEC10]], 0
	; CHECK-NEXT: br i1 [[TOBOOL]], label [[WHILE_END_LOOPEXIT:%.*]], label [[WHILE_BODY]]			; CHECK-NEXT: br i1 [[TOBOOL]], label [[WHILE_END_LOOPEXIT:%.*]], label [[WHILE_BODY]]
	; CHECK: while.end.loopexit:			; CHECK: while.end.loopexit:
	; CHECK-NEXT: br label [[WHILE_END]]			; CHECK-NEXT: br label [[WHILE_END]]
	; CHECK: while.end:			; CHECK: while.end:
	; CHECK-NEXT: ret i32* [[TMP0]]			; CHECK-NEXT: ret i32* [[TMP0]]
	;			;
	▲ Show 20 Lines • Show All 547 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[SCEV] Correctly propagate nowrap flags across scopes when folding invariant add through addrecClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 376792

llvm/lib/Analysis/ScalarEvolution.cpp

llvm/test/Analysis/ScalarEvolution/flags-from-poison.ll

llvm/test/Analysis/ScalarEvolution/incorrect-exit-count.ll

llvm/test/Analysis/ScalarEvolution/no-wrap-add-exprs.ll

llvm/test/Analysis/ScalarEvolution/nsw-offset-assume.ll

llvm/test/Analysis/ScalarEvolution/nsw-offset.ll

llvm/test/CodeGen/PowerPC/lsr-profitable-chain.ll

llvm/test/Transforms/LoopIdiom/basic.ll

[SCEV] Correctly propagate nowrap flags across scopes when folding invariant add through addrec
ClosedPublic