This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/Transforms/Scalar/
-
Transforms/
-
Scalar/
-
IndVarSimplify.cpp
-
test/Transforms/IndVarSimplify/
-
Transforms/
-
IndVarSimplify/
-
eliminate-comparison.ll
-
lftr-multi-exit.ll

Differential D62625

LFTR for multiple exit loops
ClosedPublic

Authored by reames on May 29 2019, 1:27 PM.

Download Raw Diff

Details

Reviewers

nikic
apilipenko
sanjoy

Commits

rGeda1ba65ca98: LFTR for multiple exit loops
rL363883: LFTR for multiple exit loops

Summary

Teach IndVarSimply's LinearFunctionTestReplace transform to handle multiple exit loops. LFTR does two key things 1) it rewrites (all) exit tests in terms of a common IV potentially eliminating one in the process and 2) it moves any offset/indexing/f(i) style logic out of the loop.

This turns out to actually be pretty easy to implement. SCEV already has all the information we need to know what the backedge taken count is for each individual exit. (We use that when computing the BE taken count for the loop as a whole.) We basically just need to iterate through the exiting blocks and apply the existing logic with the exit specific BE taken count. (The previously landed NFC makes this super obvious.)

I chose to go ahead and apply this to all loop exits instead of only latch exits as originally proposed. After reviewing other passes, the only case I could find where LFTR form was harmful was LoopPredication. I've fixed the latch case, and guards aren't LFTRed anyways. We'll have some more work to do on the way towards widenable_conditions, but that's easily deferred.

Diff Detail

Repository: rL LLVM

Event Timeline

reames created this revision.May 29 2019, 1:27 PM

Herald added subscribers: sanjoy, javed.absar, bollu, mcrosier. · View Herald TranscriptMay 29 2019, 1:27 PM

Clean up towards a practical patch. Need to land a bunch of tests, and then rebase. Also included the LoopPred fix needed to make the increase in scope non-pessimizing in practice. (There may be more pieces like that needed.)

reames mentioned this in rL362417: [Tests] Add LFTR tests for multiple exit loops.Jun 3 2019, 9:44 AM

reames mentioned this in rG2fcd2bd0df5a: [Tests] Add LFTR tests for multiple exit loops.

Rebase on tests, and minor generalization. Still have a couple of other test failures to track down, so not yet ready for review.

reames mentioned this in rL362426: [Tests] Add LFTR tests for multiple exit loops (try 2).Jun 3 2019, 10:40 AM

reames mentioned this in rG83645d214d4d: [Tests] Add LFTR tests for multiple exit loops (try 2).

reames mentioned this in D62880: Prepare for multi-exit LFTR [NFC].Jun 4 2019, 2:26 PM

The NFC portion of this turned out to be a bit more involved than expected. It's been separated into D62880. Once that lands, I'll rebase for the small remaining functional bit. I'm debating just going to all loop exits instead of just the latch exit. On further reflection, I'm not sure it's really going to matter impact wise, and fleshing out problems faster might be worthwhile.

Also, this is still blocked on one last correctness issue in the existing LFTR implementation. Specifically, we're not clearing the inbounds parameter on a pointer IV when moving to a dynamically dead IV. That needs fixed before we generalize the code this much and increase the number of opportunities for miscompiles sharply.

reames mentioned this in rL362971: Prepare for multi-exit LFTR [NFC].Jun 10 2019, 10:48 AM

reames mentioned this in rG5d84ccb2303b: Prepare for multi-exit LFTR [NFC].

Rebase on landed prep patches. Ready for actual review, though won't land until after https://reviews.llvm.org/D62939.

Herald added a project: Restricted Project. · View Herald TranscriptJun 11 2019, 3:21 PM

Herald added a subscriber: kristof.beyls. · View Herald Transcript

This looks good, but I'd like to see some tests that involve an IV change. In particular I would expect that this transformation can go from exit conditions on two different IVs, to exit conditions on a single IV and it would be nice to see that.

reames mentioned this in rL363217: [Tests] Highlight impact of multiple exit LFTR (D62625) as requested by reviewer.Jun 12 2019, 4:36 PM

reames mentioned this in rG0bded8442fef: [Tests] Highlight impact of multiple exit LFTR (D62625) as requested by reviewer.Jun 12 2019, 4:40 PM

rebase on requested tests

Just to note, I'm deliberately not adding any tests for possibly poison IVs. I don't think there are any unique issues for multiple exit IVs, and giving that I want to leave the coverage concentrated on things which are unique to multiple exits.

LGTM, thanks for the extra tests.

lib/Transforms/Scalar/IndVarSimplify.cpp
2634 ↗	(On Diff #204387)	Rename BETakenCount to ExitCount here and also in linearFunctionTestReplace, possibly as NFC precommit? With multiple exits, this is not the backedge count anymore...

This revision is now accepted and ready to land.Jun 13 2019, 10:10 AM

reames mentioned this in rL363293: [LFTR] Rename variable to minimize confusion [NFC].Jun 13 2019, 11:36 AM

reames marked an inline comment as done.Jun 13 2019, 11:38 AM

reames added inline comments.

lib/Transforms/Scalar/IndVarSimplify.cpp
2634 ↗	(On Diff #204387)	Done in 363293.

reames mentioned this in rGc37be2963421: [LFTR] Rename variable to minimize confusion [NFC].Jun 13 2019, 11:38 AM

dendibakh added a subscriber: dendibakh.Jun 18 2019, 10:12 AM

reames mentioned this in rL363875: [LFTR] Rename variable to minimize confusion [NFC].Jun 19 2019, 1:38 PM

reames mentioned this in rGf8104f01e6bf: [LFTR] Rename variable to minimize confusion [NFC].Jun 19 2019, 1:42 PM

Closed by commit rL363883: LFTR for multiple exit loops (authored by reames). · Explain WhyJun 19 2019, 2:57 PM

This revision was automatically updated to reflect the committed changes.

I just submitted rL363964 which fixes a bug introduced in this change. I can't find an example which actually triggers it so it may be entirely latent, but if anyone is chasing a miscompile tied to this change, try applying that patch.

Hello,
I am going to be a bit unhelpful here. We see some regressions caused by this change, but I didn't have a chance to look into it today and so I don't have a reproducer. And here's my unhelpful question: are you aware or have you seen regressions, or are you still working on this? I am not really familiar with this pass, but from reading the description, it should do good things. Taking a wild guess, I see changes in the tests and they change comparisons from ult to ne, could it for example be that this confuses other passes/analysis?
Anyway, I will try to look into this tomorrow.
Cheers.

In D62625#1552433, @SjoerdMeijer wrote:

Hello,
I am going to be a bit unhelpful here. We see some regressions caused by this change, but I didn't have a chance to look into it today and so I don't have a reproducer. And here's my unhelpful question: are you aware or have you seen regressions, or are you still working on this? I am not really familiar with this pass, but from reading the description, it should do good things. Taking a wild guess, I see changes in the tests and they change comparisons from ult to ne, could it for example be that this confuses other passes/analysis?
Anyway, I will try to look into this tomorrow.
Cheers.

Just to be clear, you mean performance regressions correct? Not correctness ones?

I won't be surprised if this exposes weaknesses in other passes. If you can find me examples, I'm happy to go about trying to fix them. If there's too many, or the regressions are too large, we can also disable this behind a flag while we work through issues.

One case I know of is that you have a hot loop which runs very few iterations, this could end up being a regression. I have ideas on how to address that, but since they're involved, I'd really like to confirm a test case first.

Look forward to more details when you have them.

I just ran into a performance regression due to this commit. With https://martin.st/temp/glew-preproc-i686.c, compiling with clang -target i686-w64-mingw32 -c -O3 glew-preproc-i686.c went from 73 seconds before this commit, to 133 seconds afterwards.

There should be also positive impact on LSR pass from this change, since LSR likes equality comparisons.

In D62625#1552681, @mstorsjo wrote:

I just ran into a performance regression due to this commit. With https://martin.st/temp/glew-preproc-i686.c, compiling with clang -target i686-w64-mingw32 -c -O3 glew-preproc-i686.c went from 73 seconds before this commit, to 133 seconds afterwards.

Clarification here. Do you mean this was a compile time regression? Or a runtime regression? (i.e. is your clang here build with a different compiler? Or is this a self host?)

In D62625#1552791, @reames wrote:

In D62625#1552681, @mstorsjo wrote:

I just ran into a performance regression due to this commit. With https://martin.st/temp/glew-preproc-i686.c, compiling with clang -target i686-w64-mingw32 -c -O3 glew-preproc-i686.c went from 73 seconds before this commit, to 133 seconds afterwards.

Clarification here. Do you mean this was a compile time regression? Or a runtime regression? (i.e. is your clang here build with a different compiler? Or is this a self host?)

Compile time regression. (I build the project it's part of with a 120 second timeout per translation unit, enforced with ulimit -t 120, that's why I noticed.)

Just to confirm that in my case it's a performance regression.
I am going to take a look now.

I think I am mainly looking at micro-architecture sensitivities and some unfortunate knock on effects of this patch.
I don't think I can blame anything of that on this patch. If I change my mind about that, I will let you know. :-) But then I'd of course have to come up with a reproducer (or a fix).
Thanks for your help, sorry for the noise!

In D62625#1553095, @mstorsjo wrote:

Compile time regression. (I build the project it's part of with a 120 second timeout per translation unit, enforced with ulimit -t 120, that's why I noticed.)

If you could provide an IR reproducer, and ideally a hint where to look, I can see what I find. Please file a bug with reproduction instructions and an IR attachment.

In D62625#1553525, @SjoerdMeijer wrote:

I think I am mainly looking at micro-architecture sensitivities and some unfortunate knock on effects of this patch.
I don't think I can blame anything of that on this patch. If I change my mind about that, I will let you know. :-) But then I'd of course have to come up with a reproducer (or a fix).
Thanks for your help, sorry for the noise!

Np. I'm happy to help debate fixes for issues you might find. Feel free to add me as a reviewer (if appropriate), or start an email conversation with preliminary findings if it's helpful.

In D62625#1553942, @reames wrote:

In D62625#1553095, @mstorsjo wrote:

Compile time regression. (I build the project it's part of with a 120 second timeout per translation unit, enforced with ulimit -t 120, that's why I noticed.)

If you could provide an IR reproducer, and ideally a hint where to look, I can see what I find. Please file a bug with reproduction instructions and an IR attachment.

A C level reproducer is already linked (https://martin.st/temp/glew-preproc-i686.c, compiled with clang -target i686-w64-mingw32 -c -O3 glew-preproc-i686.c) I filed https://bugs.llvm.org/show_bug.cgi?id=42357 now as well, with that, and matching IR for reproducing on that level. The issue is not only that compilation takes longer, the output also almost doubled in size.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Transforms/

Scalar/

IndVarSimplify.cpp

33 lines

test/

Transforms/

IndVarSimplify/

eliminate-comparison.ll

52 lines

lftr-multi-exit.ll

69 lines

Diff 205680

llvm/trunk/lib/Transforms/Scalar/IndVarSimplify.cpp

Show First 20 Lines • Show All 2,031 Lines • ▼ Show 20 Lines
/// block is also the latch.		/// block is also the latch.
static ICmpInst getLoopTest(Loop L, BasicBlock *ExitingBB) {		static ICmpInst getLoopTest(Loop L, BasicBlock *ExitingBB) {

BasicBlock *LatchBlock = L->getLoopLatch();		BasicBlock *LatchBlock = L->getLoopLatch();
// Don't bother with LFTR if the loop is not properly simplified.		// Don't bother with LFTR if the loop is not properly simplified.
if (!LatchBlock)		if (!LatchBlock)
return nullptr;		return nullptr;

BranchInst *BI = dyn_cast<BranchInst>(ExitingBB->getTerminator());		BranchInst *BI = cast<BranchInst>(ExitingBB->getTerminator());
assert(BI && "expected exit branch");

return dyn_cast<ICmpInst>(BI->getCondition());		return dyn_cast<ICmpInst>(BI->getCondition());
}		}

/// linearFunctionTestReplace policy. Return true unless we can show that the		/// linearFunctionTestReplace policy. Return true unless we can show that the
/// current exit test is already sufficiently canonical.		/// current exit test is already sufficiently canonical.
static bool needsLFTR(Loop L, BasicBlock ExitingBB) {		static bool needsLFTR(Loop L, BasicBlock ExitingBB) {
		// Avoid converting a constant or loop invariant test back to a runtime
		// test. This is critical for when SCEV's cached ExitCount is less precise
		// than the current IR (such as after we've proven a particular exit is
		// actually dead and thus the BE count never reaches our ExitCount.)
		BranchInst *BI = cast<BranchInst>(ExitingBB->getTerminator());
		if (L->isLoopInvariant(BI->getCondition()))
		return false;

// Do LFTR to simplify the exit condition to an ICMP.		// Do LFTR to simplify the exit condition to an ICMP.
ICmpInst *Cond = getLoopTest(L, ExitingBB);		ICmpInst *Cond = getLoopTest(L, ExitingBB);
if (!Cond)		if (!Cond)
return true;		return true;

// Do LFTR to simplify the exit ICMP to EQ/NE		// Do LFTR to simplify the exit ICMP to EQ/NE
ICmpInst::Predicate Pred = Cond->getPredicate();		ICmpInst::Predicate Pred = Cond->getPredicate();
if (Pred != ICmpInst::ICMP_NE && Pred != ICmpInst::ICMP_EQ)		if (Pred != ICmpInst::ICMP_NE && Pred != ICmpInst::ICMP_EQ)
▲ Show 20 Lines • Show All 185 Lines • ▼ Show 20 Lines	for (BasicBlock::iterator I = L->getHeader()->begin(); isa<PHINode>(I); ++I) {
uint64_t PhiWidth = SE->getTypeSizeInBits(AR->getType());		uint64_t PhiWidth = SE->getTypeSizeInBits(AR->getType());
if (PhiWidth < BCWidth \|\| !DL.isLegalInteger(PhiWidth))		if (PhiWidth < BCWidth \|\| !DL.isLegalInteger(PhiWidth))
continue;		continue;

// Avoid reusing a potentially undef value to compute other values that may		// Avoid reusing a potentially undef value to compute other values that may
// have originally had a concrete definition.		// have originally had a concrete definition.
if (!hasConcreteDef(Phi)) {		if (!hasConcreteDef(Phi)) {
// We explicitly allow unknown phis as long as they are already used by		// We explicitly allow unknown phis as long as they are already used by
// the loop test. In this case we assume that performing LFTR could not		// the loop exit test. This is legal since performing LFTR could not
// increase the number of undef users.		// increase the number of undef users.
// TODO: Generalize this to allow any loop exit which is known to
// execute on each iteration
if (L->getExitingBlock())
if (ICmpInst *Cond = getLoopTest(L, ExitingBB))		if (ICmpInst *Cond = getLoopTest(L, ExitingBB))
if (Phi != getLoopPhiForCounter(Cond->getOperand(0), L) &&		if (Phi != getLoopPhiForCounter(Cond->getOperand(0), L) &&
Phi != getLoopPhiForCounter(Cond->getOperand(1), L))		Phi != getLoopPhiForCounter(Cond->getOperand(1), L))
continue;		continue;
}		}

// Avoid introducing undefined behavior due to poison which didn't exist in		// Avoid introducing undefined behavior due to poison which didn't exist in
// the original program. (Annoyingly, the rules for poison and undef		// the original program. (Annoyingly, the rules for poison and undef
// propagation are distinct, so this does NOT cover the undef case above.)		// propagation are distinct, so this does NOT cover the undef case above.)
// We have to ensure that we don't introduce UB by introducing a use on an		// We have to ensure that we don't introduce UB by introducing a use on an
// iteration where said IV produces poison. Our strategy here differs for		// iteration where said IV produces poison. Our strategy here differs for
// pointers and integer IVs. For integers, we strip and reinfer as needed,		// pointers and integer IVs. For integers, we strip and reinfer as needed,
▲ Show 20 Lines • Show All 429 Lines • ▼ Show 20 Lines	if (ReplaceExitValue != NeverRepl &&
Changed \|= rewriteLoopExitValues(L, Rewriter);		Changed \|= rewriteLoopExitValues(L, Rewriter);

// Eliminate redundant IV cycles.		// Eliminate redundant IV cycles.
NumElimIV += Rewriter.replaceCongruentIVs(L, DT, DeadInsts);		NumElimIV += Rewriter.replaceCongruentIVs(L, DT, DeadInsts);

// If we have a trip count expression, rewrite the loop's exit condition		// If we have a trip count expression, rewrite the loop's exit condition
// using it.		// using it.
if (!DisableLFTR) {		if (!DisableLFTR) {
// For the moment, we only do LFTR for single exit loops. The code is
// structured as it is in the expectation of generalization to multi-exit
// loops in the near future. See D62625 for context.
SmallVector<BasicBlock*, 16> ExitingBlocks;		SmallVector<BasicBlock*, 16> ExitingBlocks;
if (auto *ExitingBB = L->getExitingBlock())		L->getExitingBlocks(ExitingBlocks);
ExitingBlocks.push_back(ExitingBB);
for (BasicBlock *ExitingBB : ExitingBlocks) {		for (BasicBlock *ExitingBB : ExitingBlocks) {
// Can't rewrite non-branch yet.		// Can't rewrite non-branch yet.
if (!isa<BranchInst>(ExitingBB->getTerminator()))		if (!isa<BranchInst>(ExitingBB->getTerminator()))
continue;		continue;

if (!needsLFTR(L, ExitingBB))		if (!needsLFTR(L, ExitingBB))
continue;		continue;

▲ Show 20 Lines • Show All 143 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/IndVarSimplify/eliminate-comparison.ll

	Show First 20 Lines • Show All 171 Lines • ▼ Show 20 Lines


	define i32 @func_11() nounwind uwtable {			define i32 @func_11() nounwind uwtable {
	; CHECK-LABEL: @func_11(			; CHECK-LABEL: @func_11(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[FORCOND:%.*]]			; CHECK-NEXT: br label [[FORCOND:%.*]]
	; CHECK: forcond:			; CHECK: forcond:
	; CHECK-NEXT: [[__KEY6_0:%.]] = phi i32 [ 2, [[ENTRY:%.]] ], [ [[TMP37:%.]], [[NOASSERT:%.]] ]			; CHECK-NEXT: [[__KEY6_0:%.]] = phi i32 [ 2, [[ENTRY:%.]] ], [ [[TMP37:%.]], [[NOASSERT:%.]] ]
	; CHECK-NEXT: [[TMP5:%.*]] = icmp ult i32 [[__KEY6_0]], 10			; CHECK-NEXT: [[EXITCOND1:%.*]] = icmp ne i32 [[__KEY6_0]], 10
	; CHECK-NEXT: br i1 [[TMP5]], label [[NOASSERT]], label [[FORCOND38_PREHEADER:%.*]]			; CHECK-NEXT: br i1 [[EXITCOND1]], label [[NOASSERT]], label [[FORCOND38_PREHEADER:%.*]]
	; CHECK: forcond38.preheader:			; CHECK: forcond38.preheader:
	; CHECK-NEXT: br label [[FORCOND38:%.*]]			; CHECK-NEXT: br label [[FORCOND38:%.*]]
	; CHECK: noassert:			; CHECK: noassert:
	; CHECK-NEXT: [[TMP13:%.*]] = sdiv i32 -32768, [[__KEY6_0]]			; CHECK-NEXT: [[TMP13:%.*]] = sdiv i32 -32768, [[__KEY6_0]]
	; CHECK-NEXT: [[TMP2936:%.*]] = shl i32 [[TMP13]], 24			; CHECK-NEXT: [[TMP2936:%.*]] = shl i32 [[TMP13]], 24
	; CHECK-NEXT: [[SEXT23:%.*]] = shl i32 [[TMP13]], 24			; CHECK-NEXT: [[SEXT23:%.*]] = shl i32 [[TMP13]], 24
	; CHECK-NEXT: [[TMP32:%.*]] = icmp eq i32 [[TMP2936]], [[SEXT23]]			; CHECK-NEXT: [[TMP32:%.*]] = icmp eq i32 [[TMP2936]], [[SEXT23]]
	; CHECK-NEXT: [[TMP37]] = add nuw nsw i32 [[__KEY6_0]], 1			; CHECK-NEXT: [[TMP37]] = add nuw nsw i32 [[__KEY6_0]], 1
	; CHECK-NEXT: br i1 [[TMP32]], label [[FORCOND]], label [[ASSERT33:%.*]]			; CHECK-NEXT: br i1 [[TMP32]], label [[FORCOND]], label [[ASSERT33:%.*]]
	; CHECK: assert33:			; CHECK: assert33:
	; CHECK-NEXT: tail call void @llvm.trap()			; CHECK-NEXT: tail call void @llvm.trap()
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	; CHECK: forcond38:			; CHECK: forcond38:
	; CHECK-NEXT: [[__KEY8_0:%.]] = phi i32 [ [[TMP81:%.]], [[NOASSERT68:%.*]] ], [ 2, [[FORCOND38_PREHEADER]] ]			; CHECK-NEXT: [[__KEY8_0:%.]] = phi i32 [ [[TMP81:%.]], [[NOASSERT68:%.*]] ], [ 2, [[FORCOND38_PREHEADER]] ]
	; CHECK-NEXT: [[TMP46:%.*]] = icmp ult i32 [[__KEY8_0]], 10			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[__KEY8_0]], 10
	; CHECK-NEXT: br i1 [[TMP46]], label [[NOASSERT68]], label [[UNROLLEDEND:%.*]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[NOASSERT68]], label [[UNROLLEDEND:%.*]]
	; CHECK: noassert68:			; CHECK: noassert68:
	; CHECK-NEXT: [[TMP57:%.*]] = sdiv i32 -32768, [[__KEY8_0]]			; CHECK-NEXT: [[TMP57:%.*]] = sdiv i32 -32768, [[__KEY8_0]]
	; CHECK-NEXT: [[SEXT34:%.*]] = shl i32 [[TMP57]], 16			; CHECK-NEXT: [[SEXT34:%.*]] = shl i32 [[TMP57]], 16
	; CHECK-NEXT: [[SEXT21:%.*]] = shl i32 [[TMP57]], 16			; CHECK-NEXT: [[SEXT21:%.*]] = shl i32 [[TMP57]], 16
	; CHECK-NEXT: [[TMP76:%.*]] = icmp eq i32 [[SEXT34]], [[SEXT21]]			; CHECK-NEXT: [[TMP76:%.*]] = icmp eq i32 [[SEXT34]], [[SEXT21]]
	; CHECK-NEXT: [[TMP81]] = add nuw nsw i32 [[__KEY8_0]], 1			; CHECK-NEXT: [[TMP81]] = add nuw nsw i32 [[__KEY8_0]], 1
	; CHECK-NEXT: br i1 [[TMP76]], label [[FORCOND38]], label [[ASSERT77:%.*]]			; CHECK-NEXT: br i1 [[TMP76]], label [[FORCOND38]], label [[ASSERT77:%.*]]
	; CHECK: assert77:			; CHECK: assert77:
	▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines

	; In this case the second loop only has a single iteration, fold the header away			; In this case the second loop only has a single iteration, fold the header away
	define i32 @func_12() nounwind uwtable {			define i32 @func_12() nounwind uwtable {
	; CHECK-LABEL: @func_12(			; CHECK-LABEL: @func_12(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[FORCOND:%.*]]			; CHECK-NEXT: br label [[FORCOND:%.*]]
	; CHECK: forcond:			; CHECK: forcond:
	; CHECK-NEXT: [[__KEY6_0:%.]] = phi i32 [ 2, [[ENTRY:%.]] ], [ [[TMP37:%.]], [[NOASSERT:%.]] ]			; CHECK-NEXT: [[__KEY6_0:%.]] = phi i32 [ 2, [[ENTRY:%.]] ], [ [[TMP37:%.]], [[NOASSERT:%.]] ]
	; CHECK-NEXT: [[TMP5:%.*]] = icmp ult i32 [[__KEY6_0]], 10			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[__KEY6_0]], 10
	; CHECK-NEXT: br i1 [[TMP5]], label [[NOASSERT]], label [[FORCOND38_PREHEADER:%.*]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[NOASSERT]], label [[FORCOND38_PREHEADER:%.*]]
	; CHECK: forcond38.preheader:			; CHECK: forcond38.preheader:
	; CHECK-NEXT: br label [[FORCOND38:%.*]]			; CHECK-NEXT: br label [[FORCOND38:%.*]]
	; CHECK: noassert:			; CHECK: noassert:
	; CHECK-NEXT: [[TMP13:%.*]] = sdiv i32 -32768, [[__KEY6_0]]			; CHECK-NEXT: [[TMP13:%.*]] = sdiv i32 -32768, [[__KEY6_0]]
	; CHECK-NEXT: [[TMP2936:%.*]] = shl i32 [[TMP13]], 24			; CHECK-NEXT: [[TMP2936:%.*]] = shl i32 [[TMP13]], 24
	; CHECK-NEXT: [[SEXT23:%.*]] = shl i32 [[TMP13]], 24			; CHECK-NEXT: [[SEXT23:%.*]] = shl i32 [[TMP13]], 24
	; CHECK-NEXT: [[TMP32:%.*]] = icmp eq i32 [[TMP2936]], [[SEXT23]]			; CHECK-NEXT: [[TMP32:%.*]] = icmp eq i32 [[TMP2936]], [[SEXT23]]
	; CHECK-NEXT: [[TMP37]] = add nuw nsw i32 [[__KEY6_0]], 1			; CHECK-NEXT: [[TMP37]] = add nuw nsw i32 [[__KEY6_0]], 1
	▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	}			}

	declare void @side_effect()			declare void @side_effect()

	define void @func_13(i32* %len.ptr) {			define void @func_13(i32* %len.ptr) {
	; CHECK-LABEL: @func_13(			; CHECK-LABEL: @func_13(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[LEN:%.]] = load i32, i32 [[LEN_PTR:%.*]], !range !0			; CHECK-NEXT: [[LEN:%.]] = load i32, i32 [[LEN_PTR:%.*]], !range !0
	; CHECK-NEXT: [[LEN_SUB_1:%.*]] = add i32 [[LEN]], -1
	; CHECK-NEXT: [[LEN_IS_ZERO:%.*]] = icmp eq i32 [[LEN]], 0			; CHECK-NEXT: [[LEN_IS_ZERO:%.*]] = icmp eq i32 [[LEN]], 0
	; CHECK-NEXT: br i1 [[LEN_IS_ZERO]], label [[LEAVE:%.]], label [[LOOP_PREHEADER:%.]]			; CHECK-NEXT: br i1 [[LEN_IS_ZERO]], label [[LEAVE:%.]], label [[LOOP_PREHEADER:%.]]
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]			; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
	; CHECK: be:			; CHECK: be:
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[BE_COND:%.*]] = icmp ult i32 [[IV]], [[LEN_SUB_1]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_INC]], [[LEN]]
	; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]
	; CHECK: leave.loopexit:			; CHECK: leave.loopexit:
	; CHECK-NEXT: br label [[LEAVE]]			; CHECK-NEXT: br label [[LEAVE]]
	; CHECK: leave:			; CHECK: leave:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%len = load i32, i32* %len.ptr, !range !0			%len = load i32, i32* %len.ptr, !range !0
	%len.sub.1 = add i32 %len, -1			%len.sub.1 = add i32 %len, -1
	Show All 15 Lines
	leave:			leave:
	ret void			ret void
	}			}

	define void @func_14(i32* %len.ptr) {			define void @func_14(i32* %len.ptr) {
	; CHECK-LABEL: @func_14(			; CHECK-LABEL: @func_14(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[LEN:%.]] = load i32, i32 [[LEN_PTR:%.*]], !range !0			; CHECK-NEXT: [[LEN:%.]] = load i32, i32 [[LEN_PTR:%.*]], !range !0
	; CHECK-NEXT: [[LEN_SUB_1:%.*]] = add i32 [[LEN]], -1
	; CHECK-NEXT: [[LEN_IS_ZERO:%.*]] = icmp eq i32 [[LEN]], 0			; CHECK-NEXT: [[LEN_IS_ZERO:%.*]] = icmp eq i32 [[LEN]], 0
	; CHECK-NEXT: [[LEN_IS_INT_MIN:%.*]] = icmp eq i32 [[LEN]], -2147483648			; CHECK-NEXT: [[LEN_IS_INT_MIN:%.*]] = icmp eq i32 [[LEN]], -2147483648
	; CHECK-NEXT: [[NO_ENTRY:%.*]] = or i1 [[LEN_IS_ZERO]], [[LEN_IS_INT_MIN]]			; CHECK-NEXT: [[NO_ENTRY:%.*]] = or i1 [[LEN_IS_ZERO]], [[LEN_IS_INT_MIN]]
	; CHECK-NEXT: br i1 [[NO_ENTRY]], label [[LEAVE:%.]], label [[LOOP_PREHEADER:%.]]			; CHECK-NEXT: br i1 [[NO_ENTRY]], label [[LEAVE:%.]], label [[LOOP_PREHEADER:%.]]
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]			; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
	; CHECK: be:			; CHECK: be:
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[BE_COND:%.*]] = icmp slt i32 [[IV]], [[LEN_SUB_1]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_INC]], [[LEN]]
	; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]
	; CHECK: leave.loopexit:			; CHECK: leave.loopexit:
	; CHECK-NEXT: br label [[LEAVE]]			; CHECK-NEXT: br label [[LEAVE]]
	; CHECK: leave:			; CHECK: leave:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%len = load i32, i32* %len.ptr, !range !0			%len = load i32, i32* %len.ptr, !range !0
	%len.sub.1 = add i32 %len, -1			%len.sub.1 = add i32 %len, -1
	Show All 29 Lines
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]			; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
	; CHECK: be:			; CHECK: be:
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[BE_COND:%.*]] = icmp ult i32 [[IV]], [[LEN]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_INC]], [[LEN_ADD_1]]
	; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]
	; CHECK: leave.loopexit:			; CHECK: leave.loopexit:
	; CHECK-NEXT: br label [[LEAVE]]			; CHECK-NEXT: br label [[LEAVE]]
	; CHECK: leave:			; CHECK: leave:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%len = load i32, i32* %len.ptr, !range !0			%len = load i32, i32* %len.ptr, !range !0
	%len.add.1 = add i32 %len, 1			%len.add.1 = add i32 %len, 1
	Show All 21 Lines
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[LEN:%.]] = load i32, i32 [[LEN_PTR:%.*]], !range !0			; CHECK-NEXT: [[LEN:%.]] = load i32, i32 [[LEN_PTR:%.*]], !range !0
	; CHECK-NEXT: [[LEN_ADD_5:%.*]] = add i32 [[LEN]], 5			; CHECK-NEXT: [[LEN_ADD_5:%.*]] = add i32 [[LEN]], 5
	; CHECK-NEXT: [[ENTRY_COND_0:%.*]] = icmp slt i32 [[LEN]], 2147483643			; CHECK-NEXT: [[ENTRY_COND_0:%.*]] = icmp slt i32 [[LEN]], 2147483643
	; CHECK-NEXT: [[ENTRY_COND_1:%.*]] = icmp slt i32 4, [[LEN_ADD_5]]			; CHECK-NEXT: [[ENTRY_COND_1:%.*]] = icmp slt i32 4, [[LEN_ADD_5]]
	; CHECK-NEXT: [[ENTRY_COND:%.*]] = and i1 [[ENTRY_COND_0]], [[ENTRY_COND_1]]			; CHECK-NEXT: [[ENTRY_COND:%.*]] = and i1 [[ENTRY_COND_0]], [[ENTRY_COND_1]]
	; CHECK-NEXT: br i1 [[ENTRY_COND]], label [[LOOP_PREHEADER:%.]], label [[LEAVE:%.]]			; CHECK-NEXT: br i1 [[ENTRY_COND]], label [[LOOP_PREHEADER:%.]], label [[LEAVE:%.]]
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
				; CHECK-NEXT: [[TMP0:%.*]] = add nuw nsw i32 [[LEN]], 1
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]			; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
	; CHECK: be:			; CHECK: be:
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[BE_COND:%.*]] = icmp ult i32 [[IV]], [[LEN]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_INC]], [[TMP0]]
	; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]
	; CHECK: leave.loopexit:			; CHECK: leave.loopexit:
	; CHECK-NEXT: br label [[LEAVE]]			; CHECK-NEXT: br label [[LEAVE]]
	; CHECK: leave:			; CHECK: leave:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%len = load i32, i32* %len.ptr, !range !0			%len = load i32, i32* %len.ptr, !range !0
	%len.add.5 = add i32 %len, 5			%len.add.5 = add i32 %len, 5
	▲ Show 20 Lines • Show All 144 Lines • ▼ Show 20 Lines
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]			; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
	; CHECK: be:			; CHECK: be:
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[BE_COND:%.*]] = icmp ult i32 [[IV_INC]], [[LENGTH]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_INC]], [[LENGTH]]
	; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]
	; CHECK: leave.loopexit:			; CHECK: leave.loopexit:
	; CHECK-NEXT: br label [[LEAVE]]			; CHECK-NEXT: br label [[LEAVE]]
	; CHECK: leave:			; CHECK: leave:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%length = load i32, i32* %length.ptr, !range !0			%length = load i32, i32* %length.ptr, !range !0
	%length.is.nonzero = icmp ne i32 %length, 0			%length.is.nonzero = icmp ne i32 %length, 0
	Show All 22 Lines
	; CHECK-NEXT: [[LENGTH:%.]] = load i32, i32 [[LENGTH_PTR:%.*]]			; CHECK-NEXT: [[LENGTH:%.]] = load i32, i32 [[LENGTH_PTR:%.*]]
	; CHECK-NEXT: [[LENGTH_IS_NONZERO:%.*]] = icmp ne i32 [[LENGTH]], 0			; CHECK-NEXT: [[LENGTH_IS_NONZERO:%.*]] = icmp ne i32 [[LENGTH]], 0
	; CHECK-NEXT: br i1 [[LENGTH_IS_NONZERO]], label [[LOOP_PREHEADER:%.]], label [[LEAVE:%.]]			; CHECK-NEXT: br i1 [[LENGTH_IS_NONZERO]], label [[LOOP_PREHEADER:%.]], label [[LEAVE:%.]]
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: [[RANGE_CHECK:%.*]] = icmp ult i32 [[IV]], [[LENGTH]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV]], [[LENGTH]]
	; CHECK-NEXT: br i1 [[RANGE_CHECK]], label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
	; CHECK: be:			; CHECK: be:
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[BE_COND:%.*]] = icmp slt i32 [[IV_INC]], [[LENGTH]]			; CHECK-NEXT: [[BE_COND:%.*]] = icmp slt i32 [[IV_INC]], [[LENGTH]]
	; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]			; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]
	; CHECK: leave.loopexit:			; CHECK: leave.loopexit:
	; CHECK-NEXT: br label [[LEAVE]]			; CHECK-NEXT: br label [[LEAVE]]
	; CHECK: leave:			; CHECK: leave:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	Show All 30 Lines
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]			; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
	; CHECK: be:			; CHECK: be:
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[BE_COND:%.*]] = icmp slt i32 [[IV_INC]], [[LIM]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_INC]], [[LIM]]
	; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]
	; CHECK: leave.loopexit:			; CHECK: leave.loopexit:
	; CHECK-NEXT: br label [[LEAVE]]			; CHECK-NEXT: br label [[LEAVE]]
	; CHECK: leave:			; CHECK: leave:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%length = load i32, i32* %length.ptr, !range !0			%length = load i32, i32* %length.ptr, !range !0
	%lim = sub i32 %length, 1			%lim = sub i32 %length, 1
	Show All 16 Lines
	}			}

	; This checks that the backedge condition, (I + 1) < Length - 1 implies			; This checks that the backedge condition, (I + 1) < Length - 1 implies
	; (I + 1) < Length			; (I + 1) < Length
	define void @func_22(i32* %length.ptr) {			define void @func_22(i32* %length.ptr) {
	; CHECK-LABEL: @func_22(			; CHECK-LABEL: @func_22(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[LENGTH:%.]] = load i32, i32 [[LENGTH_PTR:%.*]], !range !0			; CHECK-NEXT: [[LENGTH:%.]] = load i32, i32 [[LENGTH_PTR:%.*]], !range !0
	; CHECK-NEXT: [[LIM:%.*]] = sub i32 [[LENGTH]], 1
	; CHECK-NEXT: [[ENTRY_COND:%.*]] = icmp sgt i32 [[LENGTH]], 1			; CHECK-NEXT: [[ENTRY_COND:%.*]] = icmp sgt i32 [[LENGTH]], 1
	; CHECK-NEXT: br i1 [[ENTRY_COND]], label [[LOOP_PREHEADER:%.]], label [[LEAVE:%.]]			; CHECK-NEXT: br i1 [[ENTRY_COND]], label [[LOOP_PREHEADER:%.]], label [[LEAVE:%.]]
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]			; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
	; CHECK: be:			; CHECK: be:
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[BE_COND:%.*]] = icmp sle i32 [[IV_INC]], [[LIM]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_INC]], [[LENGTH]]
	; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]
	; CHECK: leave.loopexit:			; CHECK: leave.loopexit:
	; CHECK-NEXT: br label [[LEAVE]]			; CHECK-NEXT: br label [[LEAVE]]
	; CHECK: leave:			; CHECK: leave:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%length = load i32, i32* %length.ptr, !range !0			%length = load i32, i32* %length.ptr, !range !0
	%lim = sub i32 %length, 1			%lim = sub i32 %length, 1
	Show All 24 Lines
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 4, [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 4, [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]			; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
	; CHECK: be:			; CHECK: be:
	; CHECK-NEXT: call void @side_effect()			; CHECK-NEXT: call void @side_effect()
	; CHECK-NEXT: [[BE_COND:%.*]] = icmp ult i32 [[IV_INC]], [[LENGTH]]			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_INC]], [[LENGTH]]
	; CHECK-NEXT: br i1 [[BE_COND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[LEAVE_LOOPEXIT]]
	; CHECK: leave.loopexit:			; CHECK: leave.loopexit:
	; CHECK-NEXT: br label [[LEAVE]]			; CHECK-NEXT: br label [[LEAVE]]
	; CHECK: leave:			; CHECK: leave:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%length = load i32, i32* %length.ptr, !range !0			%length = load i32, i32* %length.ptr, !range !0
	%entry.cond = icmp ult i32 4, %length			%entry.cond = icmp ult i32 4, %length
	▲ Show 20 Lines • Show All 60 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/IndVarSimplify/lftr-multi-exit.ll

	Show All 9 Lines
	@A = external global i32			@A = external global i32

	define void @analyzeable_early_exit(i32 %n) {			define void @analyzeable_early_exit(i32 %n) {
	; CHECK-LABEL: @analyzeable_early_exit(			; CHECK-LABEL: @analyzeable_early_exit(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]
	; CHECK-NEXT: [[EARLYCND:%.]] = icmp ult i32 [[IV]], [[N:%.]]			; CHECK-NEXT: [[EXITCOND:%.]] = icmp ne i32 [[IV]], [[N:%.]]
	; CHECK-NEXT: br i1 [[EARLYCND]], label [[LATCH]], label [[EXIT:%.*]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LATCH]], label [[EXIT:%.*]]
	; CHECK: latch:			; CHECK: latch:
	; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: store i32 [[IV]], i32* @A			; CHECK-NEXT: store i32 [[IV]], i32* @A
	; CHECK-NEXT: [[C:%.*]] = icmp ult i32 [[IV_NEXT]], 1000			; CHECK-NEXT: [[EXITCOND1:%.*]] = icmp ne i32 [[IV_NEXT]], 1000
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT]]			; CHECK-NEXT: br i1 [[EXITCOND1]], label [[LOOP]], label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %loop			br label %loop

	loop:			loop:
	%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]			%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]
	Show All 17 Lines
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]
	; CHECK-NEXT: [[VOL:%.]] = load volatile i32, i32 @A			; CHECK-NEXT: [[VOL:%.]] = load volatile i32, i32 @A
	; CHECK-NEXT: [[EARLYCND:%.*]] = icmp ne i32 [[VOL]], 0			; CHECK-NEXT: [[EARLYCND:%.*]] = icmp ne i32 [[VOL]], 0
	; CHECK-NEXT: br i1 [[EARLYCND]], label [[LATCH]], label [[EXIT:%.*]]			; CHECK-NEXT: br i1 [[EARLYCND]], label [[LATCH]], label [[EXIT:%.*]]
	; CHECK: latch:			; CHECK: latch:
	; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: store i32 [[IV]], i32* @A			; CHECK-NEXT: store i32 [[IV]], i32* @A
	; CHECK-NEXT: [[C:%.*]] = icmp ult i32 [[IV_NEXT]], 1000			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_NEXT]], 1000
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %loop			br label %loop

	loop:			loop:
	%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]			%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]
	Show All 13 Lines


	define void @multiple_early_exits(i32 %n, i32 %m) {			define void @multiple_early_exits(i32 %n, i32 %m) {
	; CHECK-LABEL: @multiple_early_exits(			; CHECK-LABEL: @multiple_early_exits(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]
	; CHECK-NEXT: [[EARLYCND:%.]] = icmp ult i32 [[IV]], [[N:%.]]			; CHECK-NEXT: [[EXITCOND:%.]] = icmp ne i32 [[IV]], [[N:%.]]
	; CHECK-NEXT: br i1 [[EARLYCND]], label [[CONTINUE:%.]], label [[EXIT:%.]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[CONTINUE:%.]], label [[EXIT:%.]]
	; CHECK: continue:			; CHECK: continue:
	; CHECK-NEXT: store volatile i32 [[IV]], i32* @A			; CHECK-NEXT: store volatile i32 [[IV]], i32* @A
	; CHECK-NEXT: [[EARLYCND2:%.]] = icmp ult i32 [[IV]], [[M:%.]]			; CHECK-NEXT: [[EXITCOND1:%.]] = icmp ne i32 [[IV]], [[M:%.]]
	; CHECK-NEXT: br i1 [[EARLYCND2]], label [[LATCH]], label [[EXIT]]			; CHECK-NEXT: br i1 [[EXITCOND1]], label [[LATCH]], label [[EXIT]]
	; CHECK: latch:			; CHECK: latch:
	; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: store volatile i32 [[IV]], i32* @A			; CHECK-NEXT: store volatile i32 [[IV]], i32* @A
	; CHECK-NEXT: [[C:%.*]] = icmp ult i32 [[IV_NEXT]], 1000			; CHECK-NEXT: [[EXITCOND2:%.*]] = icmp ne i32 [[IV_NEXT]], 1000
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT]]			; CHECK-NEXT: br i1 [[EXITCOND2]], label [[LOOP]], label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %loop			br label %loop

	loop:			loop:
	%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]			%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]
	Show All 25 Lines
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]
	; CHECK-NEXT: [[EARLYCND:%.]] = icmp ult i32 [[IV]], [[N:%.]]			; CHECK-NEXT: [[EARLYCND:%.]] = icmp ult i32 [[IV]], [[N:%.]]
	; CHECK-NEXT: [[EARLYCND2:%.]] = icmp ult i32 [[IV]], [[M:%.]]			; CHECK-NEXT: [[EARLYCND2:%.]] = icmp ult i32 [[IV]], [[M:%.]]
	; CHECK-NEXT: [[AND:%.*]] = and i1 [[EARLYCND]], [[EARLYCND2]]			; CHECK-NEXT: [[AND:%.*]] = and i1 [[EARLYCND]], [[EARLYCND2]]
	; CHECK-NEXT: br i1 [[AND]], label [[LATCH]], label [[EXIT:%.*]]			; CHECK-NEXT: br i1 [[AND]], label [[LATCH]], label [[EXIT:%.*]]
	; CHECK: latch:			; CHECK: latch:
	; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: store volatile i32 [[IV]], i32* @A			; CHECK-NEXT: store volatile i32 [[IV]], i32* @A
	; CHECK-NEXT: [[C:%.*]] = icmp ult i32 [[IV_NEXT]], 1000			; CHECK-NEXT: [[EXITCOND:%.*]] = icmp ne i32 [[IV_NEXT]], 1000
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LOOP]], label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %loop			br label %loop

	loop:			loop:
	%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]			%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]
	Show All 14 Lines


	define void @unanalyzeable_latch(i32 %n) {			define void @unanalyzeable_latch(i32 %n) {
	; CHECK-LABEL: @unanalyzeable_latch(			; CHECK-LABEL: @unanalyzeable_latch(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]
	; CHECK-NEXT: [[EARLYCND:%.]] = icmp ult i32 [[IV]], [[N:%.]]			; CHECK-NEXT: [[EXITCOND:%.]] = icmp ne i32 [[IV]], [[N:%.]]
	; CHECK-NEXT: br i1 [[EARLYCND]], label [[LATCH]], label [[EXIT:%.*]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LATCH]], label [[EXIT:%.*]]
	; CHECK: latch:			; CHECK: latch:
	; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], 1
	; CHECK-NEXT: store i32 [[IV]], i32* @A			; CHECK-NEXT: store i32 [[IV]], i32* @A
	; CHECK-NEXT: [[VOL:%.]] = load volatile i32, i32 @A			; CHECK-NEXT: [[VOL:%.]] = load volatile i32, i32 @A
	; CHECK-NEXT: [[C:%.*]] = icmp ult i32 [[VOL]], 1000			; CHECK-NEXT: [[C:%.*]] = icmp ult i32 [[VOL]], 1000
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT]]			; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines
	; Multiple exits which could be LFTRed, but the latch itself is not an			; Multiple exits which could be LFTRed, but the latch itself is not an
	; exiting block.			; exiting block.
	define void @no_latch_exit(i32 %n, i32 %m) {			define void @no_latch_exit(i32 %n, i32 %m) {
	; CHECK-LABEL: @no_latch_exit(			; CHECK-LABEL: @no_latch_exit(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]
	; CHECK-NEXT: [[EARLYCND:%.]] = icmp ult i32 [[IV]], [[N:%.]]			; CHECK-NEXT: [[EXITCOND:%.]] = icmp ne i32 [[IV]], [[N:%.]]
	; CHECK-NEXT: br i1 [[EARLYCND]], label [[CONTINUE:%.]], label [[EXIT:%.]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[CONTINUE:%.]], label [[EXIT:%.]]
	; CHECK: continue:			; CHECK: continue:
	; CHECK-NEXT: store volatile i32 [[IV]], i32* @A			; CHECK-NEXT: store volatile i32 [[IV]], i32* @A
	; CHECK-NEXT: [[EARLYCND2:%.]] = icmp ult i32 [[IV]], [[M:%.]]			; CHECK-NEXT: [[EXITCOND1:%.]] = icmp ne i32 [[IV]], [[M:%.]]
	; CHECK-NEXT: br i1 [[EARLYCND2]], label [[LATCH]], label [[EXIT]]			; CHECK-NEXT: br i1 [[EXITCOND1]], label [[LATCH]], label [[EXIT]]
	; CHECK: latch:			; CHECK: latch:
	; CHECK-NEXT: store volatile i32 [[IV]], i32* @A			; CHECK-NEXT: store volatile i32 [[IV]], i32* @A
	; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], 1
	; CHECK-NEXT: br label [[LOOP]]			; CHECK-NEXT: br label [[LOOP]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	Show All 21 Lines
	;; Show the value of multiple exit LFTR (being able to eliminate all but			;; Show the value of multiple exit LFTR (being able to eliminate all but
	;; one IV when exit tests involve multiple IVs).			;; one IV when exit tests involve multiple IVs).
	define void @combine_ivs(i32 %n) {			define void @combine_ivs(i32 %n) {
	; CHECK-LABEL: @combine_ivs(			; CHECK-LABEL: @combine_ivs(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]
	; CHECK-NEXT: [[IV2:%.]] = phi i32 [ 1, [[ENTRY]] ], [ [[IV2_NEXT:%.]], [[LATCH]] ]			; CHECK-NEXT: [[EXITCOND:%.]] = icmp ne i32 [[IV]], [[N:%.]]
	; CHECK-NEXT: [[EARLYCND:%.]] = icmp ult i32 [[IV]], [[N:%.]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LATCH]], label [[EXIT:%.*]]
	; CHECK-NEXT: br i1 [[EARLYCND]], label [[LATCH]], label [[EXIT:%.*]]
	; CHECK: latch:			; CHECK: latch:
	; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: [[IV2_NEXT]] = add nuw nsw i32 [[IV2]], 1
	; CHECK-NEXT: store volatile i32 [[IV]], i32* @A			; CHECK-NEXT: store volatile i32 [[IV]], i32* @A
	; CHECK-NEXT: [[C:%.*]] = icmp ult i32 [[IV2_NEXT]], 1000			; CHECK-NEXT: [[EXITCOND1:%.*]] = icmp ne i32 [[IV_NEXT]], 999
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT]]			; CHECK-NEXT: br i1 [[EXITCOND1]], label [[LOOP]], label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %loop			br label %loop

	loop:			loop:
	%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]			%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]
	Show All 14 Lines

	; We can remove the decrementing IV entirely			; We can remove the decrementing IV entirely
	define void @combine_ivs2(i32 %n) {			define void @combine_ivs2(i32 %n) {
	; CHECK-LABEL: @combine_ivs2(			; CHECK-LABEL: @combine_ivs2(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]
	; CHECK-NEXT: [[IV2:%.]] = phi i32 [ 1000, [[ENTRY]] ], [ [[IV2_NEXT:%.]], [[LATCH]] ]			; CHECK-NEXT: [[EXITCOND:%.]] = icmp ne i32 [[IV]], [[N:%.]]
	; CHECK-NEXT: [[EARLYCND:%.]] = icmp ult i32 [[IV]], [[N:%.]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LATCH]], label [[EXIT:%.*]]
	; CHECK-NEXT: br i1 [[EARLYCND]], label [[LATCH]], label [[EXIT:%.*]]
	; CHECK: latch:			; CHECK: latch:
	; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: [[IV2_NEXT]] = sub nuw nsw i32 [[IV2]], 1
	; CHECK-NEXT: store volatile i32 [[IV]], i32* @A			; CHECK-NEXT: store volatile i32 [[IV]], i32* @A
	; CHECK-NEXT: [[C:%.*]] = icmp ugt i32 [[IV2_NEXT]], 0			; CHECK-NEXT: [[EXITCOND1:%.*]] = icmp ne i32 [[IV_NEXT]], 1000
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT]]			; CHECK-NEXT: br i1 [[EXITCOND1]], label [[LOOP]], label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %loop			br label %loop

	loop:			loop:
	%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]			%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]
	Show All 15 Lines
	; An example where we can eliminate an f(i) computation entirely			; An example where we can eliminate an f(i) computation entirely
	; from a multiple exit loop with LFTR.			; from a multiple exit loop with LFTR.
	define void @simplify_exit_test(i32 %n) {			define void @simplify_exit_test(i32 %n) {
	; CHECK-LABEL: @simplify_exit_test(			; CHECK-LABEL: @simplify_exit_test(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LATCH:%.]] ]
	; CHECK-NEXT: [[EARLYCND:%.]] = icmp ult i32 [[IV]], [[N:%.]]			; CHECK-NEXT: [[EXITCOND:%.]] = icmp ne i32 [[IV]], [[N:%.]]
	; CHECK-NEXT: br i1 [[EARLYCND]], label [[LATCH]], label [[EXIT:%.*]]			; CHECK-NEXT: br i1 [[EXITCOND]], label [[LATCH]], label [[EXIT:%.*]]
	; CHECK: latch:			; CHECK: latch:
	; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: [[FX:%.*]] = shl i32 [[IV]], 4
	; CHECK-NEXT: store volatile i32 [[IV]], i32* @A			; CHECK-NEXT: store volatile i32 [[IV]], i32* @A
	; CHECK-NEXT: [[C:%.*]] = icmp ult i32 [[FX]], 1024			; CHECK-NEXT: [[EXITCOND1:%.*]] = icmp ne i32 [[IV_NEXT]], 65
	; CHECK-NEXT: br i1 [[C]], label [[LOOP]], label [[EXIT]]			; CHECK-NEXT: br i1 [[EXITCOND1]], label [[LOOP]], label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %loop			br label %loop

	loop:			loop:
	%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]			%iv = phi i32 [ 0, %entry], [ %iv.next, %latch]
	▲ Show 20 Lines • Show All 54 Lines • Show Last 20 Lines