This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Scalar/
-
Transforms/
-
Scalar/
-
IndVarSimplify.cpp
-
test/Transforms/IndVarSimplify/
-
Transforms/
-
IndVarSimplify/
-
eliminate-exit.ll
-
loop-predication.ll
-
pr38674.ll

Differential D69009

[IndVars] Eliminate loop exits with equivalent exit counts
ClosedPublic

Authored by reames on Oct 15 2019, 2:56 PM.

Download Raw Diff

Details

Reviewers

nikic
ebrevnov
apilipenko

Commits

rG8cbcd2f484a2: [IndVars] Eliminate loop exits with equivalent exit counts
rL375379: [IndVars] Eliminate loop exits with equivalent exit counts

Summary

We can end up with two loop exits whose exit counts are equivalent, but whose textual representation is different and non-obvious. For the sub-case where we have a series of exits which dominate one another (common), eliminate any exits which would iterate *after* a previous exit on the exiting iteration.

As noted in the TODO being removed, I'd always thought this was a good idea, but I've now seen this in a real workload as well. This needs to be rebased on D68956 as both need the same dominance order check.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

reames created this revision.Oct 15 2019, 2:56 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 15 2019, 2:56 PM

Herald added subscribers: mgrang, bollu, mcrosier. · View Herald Transcript

ebrevnov added inline comments.Oct 16 2019, 1:05 AM

lib/Transforms/Scalar/IndVarSimplify.cpp
2758 ↗	(On Diff #225121)	That would be a fourth loop filtering out "bad" exits before actual work. And the method becomes pretty long. I think we need to refactor in this or consequent patch.
2793 ↗	(On Diff #225121)	Two notes here. 1) Any reason not to use isKnownViaNonRecursiveReasoning or isKnownPredicate? 2) We can actually eliminate an exit if its exit count not less than exit count of one of its predecessors. None of these is blocking the current patch though.

reames mentioned this in rL375138: [IndVars] Split loop predication out of optimizeLoopExits [NFC].Oct 17 2019, 10:33 AM

reames mentioned this in rGe51d57d64a4d: [IndVars] Split loop predication out of optimizeLoopExits [NFC].

Rebase over landed loop pred change, and some refactoring to address a few concerns brought up in the review.

reames marked 2 inline comments as done.Oct 17 2019, 10:45 AM

reames added inline comments.

lib/Transforms/Scalar/IndVarSimplify.cpp
2758 ↗	(On Diff #225121)	What do you think of the committed NFC reorgs? If you have suggestions, I'm happy to further tweak.
2793 ↗	(On Diff #225121)	I believe you're referring not to the EQ case, but to the provably greater then case just above right? I could generalize, but I don't have an example which illustrates the difference. SCEV tries to canonicalize at construction (as demonstrated by some of the test changes). If you don't mind, I'd like to wait until we have a motivating example before using a more expensive API.

nikic added inline comments.Oct 17 2019, 12:06 PM

lib/Transforms/Scalar/IndVarSimplify.cpp
2804 ↗	(On Diff #225465)	This code is already repeated three times now, extract?

reames marked an inline comment as done.Oct 17 2019, 3:00 PM

reames added inline comments.

lib/Transforms/Scalar/IndVarSimplify.cpp
2804 ↗	(On Diff #225465)	Not quite. We're folding the branch either to taken or untaken, but yes, it could be factored out. Will do, and rebase.

reames mentioned this in rL375191: [IndVars] Factor out some common code into a utility function.Oct 17 2019, 4:49 PM

reames mentioned this in rG8eaa5b9abab3: [IndVars] Factor out some common code into a utility function.

Rebase on requested refactor.

nikic added inline comments.Oct 18 2019, 11:44 AM

lib/Transforms/Scalar/IndVarSimplify.cpp
2796 ↗	(On Diff #225548)	Now that we're traversing the exits in dominating order, I think we should consider a slightly different overall approach: Instead of computing `umin({ exit counts })` upfront and then checking for `umin({ exit counts }) < this exit count`, we can instead build this up incrementally, i.e. take the umin between the current max exit count and the exit count of the currently considered exit at each iteration. This will allow us to check for `umin({ already seen exit counts }) <= this exit count` (note the `<=`!) instead. This should implicitly handle the case of exact equalities, but may also be more powerful in general because we're checking for a weaker predicate.

reames marked an inline comment as done.Oct 19 2019, 10:03 AM

reames added inline comments.

lib/Transforms/Scalar/IndVarSimplify.cpp
2796 ↗	(On Diff #225548)	The suggested approach doesn't handle one of the most common cases. If the latch exit is in fact the minimum exit, we want to be able to discharge earlier exits. (i.e. the difference between the dominating set and the total sets is important) I do agree that your phrasing might be more powerful for eliminating exits which SCEV can prove are <= dominating checks (but not either < or ==). However, I don't have a test case which demonstrates that difference, and until I do, I'd like to avoid further complicating the code. (Assuming I'd need both approaches due to the gap described above.)

nikic added inline comments.Oct 19 2019, 1:28 PM

lib/Transforms/Scalar/IndVarSimplify.cpp
2796 ↗	(On Diff #225548)	For an example, consider the existing `@ult` test just using a `ule` predicate instead: https://godbolt.org/z/j-rZeb This should eliminate the second exit, but doesn't, as we have neither the `<` predicate nor the strict scev identity (but do have a `<=` predicate). I didn't quite follow what you mean regarding the latch exit and why it is special. Is there a test case that shows the issue?

lib/Transforms/Scalar/IndVarSimplify.cpp
2796 ↗	(On Diff #225548)	Okay, I see what you mean now. Not specifically the latch, just any future exit. It does seem like we'd have to combine both approaches to get the full coverage, so let's go with your current variant for now.

This revision is now accepted and ready to land.Oct 19 2019, 2:54 PM

Closed by commit rG8cbcd2f484a2: [IndVars] Eliminate loop exits with equivalent exit counts (authored by reames). · Explain WhyOct 20 2019, 4:43 PM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: hiraditya. · View Herald TranscriptOct 20 2019, 4:43 PM

reames mentioned this in rGe884843d7839: [IndVars] Add a todo to reflect a further oppurtunity identified in D69009.Oct 20 2019, 4:43 PM

reames mentioned this in rL375380: [IndVars] Add a todo to reflect a further oppurtunity identified in D69009.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Scalar/

IndVarSimplify.cpp

32 lines

test/

Transforms/

IndVarSimplify/

eliminate-exit.ll

34 lines

loop-predication.ll

9 lines

pr38674.ll

5 lines

Diff 225810

llvm/lib/Transforms/Scalar/IndVarSimplify.cpp

Show First 20 Lines • Show All 2,711 Lines • ▼ Show 20 Lines	bool IndVarSimplify::optimizeLoopExits(Loop *L, SCEVExpander &Rewriter) {
if (ExitingBlocks.empty())		if (ExitingBlocks.empty())
return false;		return false;

// Get a symbolic upper bound on the loop backedge taken count.		// Get a symbolic upper bound on the loop backedge taken count.
const SCEV MaxExitCount = getMaxBackedgeTakenCount(SE, *DT, L);		const SCEV MaxExitCount = getMaxBackedgeTakenCount(SE, *DT, L);
if (isa<SCEVCouldNotCompute>(MaxExitCount))		if (isa<SCEVCouldNotCompute>(MaxExitCount))
return false;		return false;

		// Visit our exit blocks in order of dominance. We know from the fact that
		// all exits (left) are analyzeable that the must be a total dominance order
		// between them as each must dominate the latch. The visit order only
		// matters for the provably equal case.
		llvm::sort(ExitingBlocks,
		[&](BasicBlock A, BasicBlock B) {
		// std::sort sorts in ascending order, so we want the inverse of
		// the normal dominance relation.
		if (DT->properlyDominates(A, B)) return true;
		if (DT->properlyDominates(B, A)) return false;
		llvm_unreachable("expected total dominance order!");
		});
		#ifdef ASSERT
		for (unsigned i = 1; i < ExitingBlocks.size(); i++) {
		assert(DT->dominates(ExitingBlocks[i-1], ExitingBlocks[i]));
		}
		#endif

auto FoldExit = [&](BasicBlock *ExitingBB, bool IsTaken) {		auto FoldExit = [&](BasicBlock *ExitingBB, bool IsTaken) {
BranchInst *BI = cast<BranchInst>(ExitingBB->getTerminator());		BranchInst *BI = cast<BranchInst>(ExitingBB->getTerminator());
bool ExitIfTrue = !L->contains(*succ_begin(ExitingBB));		bool ExitIfTrue = !L->contains(*succ_begin(ExitingBB));
auto *OldCond = BI->getCondition();		auto *OldCond = BI->getCondition();
auto *NewCond = ConstantInt::get(OldCond->getType(),		auto *NewCond = ConstantInt::get(OldCond->getType(),
IsTaken ? ExitIfTrue : !ExitIfTrue);		IsTaken ? ExitIfTrue : !ExitIfTrue);
BI->setCondition(NewCond);		BI->setCondition(NewCond);
if (OldCond->use_empty())		if (OldCond->use_empty())
DeadInsts.push_back(OldCond);		DeadInsts.push_back(OldCond);
};		};

bool Changed = false;		bool Changed = false;
		SmallSet<const SCEV*, 8> DominatingExitCounts;
for (BasicBlock *ExitingBB : ExitingBlocks) {		for (BasicBlock *ExitingBB : ExitingBlocks) {
const SCEV *ExitCount = SE->getExitCount(L, ExitingBB);		const SCEV *ExitCount = SE->getExitCount(L, ExitingBB);
assert(!isa<SCEVCouldNotCompute>(ExitCount) && "checked above");		assert(!isa<SCEVCouldNotCompute>(ExitCount) && "checked above");

// If we know we'd exit on the first iteration, rewrite the exit to		// If we know we'd exit on the first iteration, rewrite the exit to
// reflect this. This does not imply the loop must exit through this		// reflect this. This does not imply the loop must exit through this
// exit; there may be an earlier one taken on the first iteration.		// exit; there may be an earlier one taken on the first iteration.
// TODO: Given we know the backedge can't be taken, we should go ahead		// TODO: Given we know the backedge can't be taken, we should go ahead
Show All 21 Lines	for (BasicBlock *ExitingBB : ExitingBlocks) {
// one?		// one?
if (SE->isLoopEntryGuardedByCond(L, CmpInst::ICMP_ULT,		if (SE->isLoopEntryGuardedByCond(L, CmpInst::ICMP_ULT,
MaxExitCount, ExitCount)) {		MaxExitCount, ExitCount)) {
FoldExit(ExitingBB, false);		FoldExit(ExitingBB, false);
Changed = true;		Changed = true;
continue;		continue;
}		}

// TODO: If we can prove that the exiting iteration is equal to the exit		// As we run, keep track of which exit counts we've encountered. If we
// count for this exit and that no previous exit oppurtunities exist within		// find a duplicate, we've found an exit which would have exited on the
// the loop, then we can discharge all other exits. (May fall out of		// exiting iteration, but (from the visit order) strictly follows another
// previous TODO.)		// which does the same and is thus dead.
		if (!DominatingExitCounts.insert(ExitCount).second) {
		FoldExit(ExitingBB, false);
		Changed = true;
		continue;
		}
}		}
return Changed;		return Changed;
}		}

bool IndVarSimplify::predicateLoopExits(Loop *L, SCEVExpander &Rewriter) {		bool IndVarSimplify::predicateLoopExits(Loop *L, SCEVExpander &Rewriter) {
SmallVector<BasicBlock*, 16> ExitingBlocks;		SmallVector<BasicBlock*, 16> ExitingBlocks;
L->getExitingBlocks(ExitingBlocks);		L->getExitingBlocks(ExitingBlocks);

▲ Show 20 Lines • Show All 388 Lines • Show Last 20 Lines

llvm/test/Transforms/IndVarSimplify/eliminate-exit.ll

	Show First 20 Lines • Show All 179 Lines • ▼ Show 20 Lines
	latch:			latch:
	call void @side_effect()			call void @side_effect()
	%cmp2 = icmp ult i64 %iv, %n			%cmp2 = icmp ult i64 %iv, %n
	br i1 %cmp2, label %loop, label %exit			br i1 %cmp2, label %loop, label %exit
	exit:			exit:
	ret void			ret void
	}			}

				define void @mixed_width(i32 %len) {
				; CHECK-LABEL: @mixed_width(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[LEN_ZEXT:%.]] = zext i32 [[LEN:%.]] to i64
				; CHECK-NEXT: br label [[LOOP:%.*]]
				; CHECK: loop:
				; CHECK-NEXT: [[IV:%.]] = phi i64 [ 0, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[BACKEDGE:%.]] ]
				; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i64 [[IV]], 1
				; CHECK-NEXT: [[CMP1:%.*]] = icmp ult i64 [[IV]], [[LEN_ZEXT]]
				; CHECK-NEXT: br i1 [[CMP1]], label [[BACKEDGE]], label [[EXIT:%.*]]
				; CHECK: backedge:
				; CHECK-NEXT: call void @side_effect()
				; CHECK-NEXT: br i1 true, label [[LOOP]], label [[EXIT]]
				; CHECK: exit:
				; CHECK-NEXT: ret void
				;
				entry:
				%len.zext = zext i32 %len to i64
				br label %loop
				loop:
				%iv = phi i64 [0, %entry], [%iv.next, %backedge]
				%iv2 = phi i32 [0, %entry], [%iv2.next, %backedge]
				%iv.next = add i64 %iv, 1
				%iv2.next = add i32 %iv2, 1
				%cmp1 = icmp ult i64 %iv, %len.zext
				br i1 %cmp1, label %backedge, label %exit

				backedge:
				call void @side_effect()
				%cmp2 = icmp ult i32 %iv2, %len
				br i1 %cmp2, label %loop, label %exit
				exit:
				ret void
				}

	declare void @side_effect()			declare void @side_effect()

llvm/test/Transforms/IndVarSimplify/loop-predication.ll

Show First 20 Lines • Show All 458 Lines • ▼ Show 20 Lines
; CHECK-LABEL: @duplicate_checks(		; CHECK-LABEL: @duplicate_checks(
; CHECK-NEXT: loop.preheader:		; CHECK-NEXT: loop.preheader:
; CHECK-NEXT: [[TMP0:%.]] = icmp ugt i32 [[N:%.]], 1		; CHECK-NEXT: [[TMP0:%.]] = icmp ugt i32 [[N:%.]], 1
; CHECK-NEXT: [[UMAX:%.*]] = select i1 [[TMP0]], i32 [[N]], i32 1		; CHECK-NEXT: [[UMAX:%.*]] = select i1 [[TMP0]], i32 [[N]], i32 1
; CHECK-NEXT: [[TMP1:%.*]] = add i32 [[UMAX]], -1		; CHECK-NEXT: [[TMP1:%.*]] = add i32 [[UMAX]], -1
; CHECK-NEXT: [[TMP2:%.]] = icmp ult i32 [[LENGTH:%.]], [[TMP1]]		; CHECK-NEXT: [[TMP2:%.]] = icmp ult i32 [[LENGTH:%.]], [[TMP1]]
; CHECK-NEXT: [[UMIN:%.*]] = select i1 [[TMP2]], i32 [[LENGTH]], i32 [[TMP1]]		; CHECK-NEXT: [[UMIN:%.*]] = select i1 [[TMP2]], i32 [[LENGTH]], i32 [[TMP1]]
; CHECK-NEXT: [[TMP3:%.*]] = icmp ne i32 [[LENGTH]], [[UMIN]]		; CHECK-NEXT: [[TMP3:%.*]] = icmp ne i32 [[LENGTH]], [[UMIN]]
; CHECK-NEXT: [[TMP4:%.*]] = icmp ne i32 [[LENGTH]], [[UMIN]]
; CHECK-NEXT: br label [[LOOP:%.*]]		; CHECK-NEXT: br label [[LOOP:%.*]]
; CHECK: loop:		; CHECK: loop:
; CHECK-NEXT: [[LOOP_ACC:%.]] = phi i32 [ [[LOOP_ACC_NEXT:%.]], [[GUARDED1:%.]] ], [ 0, [[LOOP_PREHEADER:%.]] ]		; CHECK-NEXT: [[LOOP_ACC:%.]] = phi i32 [ [[LOOP_ACC_NEXT:%.]], [[GUARDED1:%.]] ], [ 0, [[LOOP_PREHEADER:%.]] ]
; CHECK-NEXT: [[I:%.]] = phi i32 [ [[I_NEXT:%.]], [[GUARDED1]] ], [ 0, [[LOOP_PREHEADER]] ]		; CHECK-NEXT: [[I:%.]] = phi i32 [ [[I_NEXT:%.]], [[GUARDED1]] ], [ 0, [[LOOP_PREHEADER]] ]
; CHECK-NEXT: br i1 [[TMP3]], label [[GUARDED:%.]], label [[DEOPT:%.]], !prof !0		; CHECK-NEXT: br i1 [[TMP3]], label [[GUARDED:%.]], label [[DEOPT:%.]], !prof !0
; CHECK: deopt:		; CHECK: deopt:
; CHECK-NEXT: call void @prevent_merging()		; CHECK-NEXT: call void @prevent_merging()
; CHECK-NEXT: ret i32 -1		; CHECK-NEXT: ret i32 -1
; CHECK: guarded:		; CHECK: guarded:
; CHECK-NEXT: [[I_I64:%.*]] = zext i32 [[I]] to i64		; CHECK-NEXT: [[I_I64:%.*]] = zext i32 [[I]] to i64
; CHECK-NEXT: [[ARRAY_1_I_PTR:%.]] = getelementptr inbounds i32, i32 [[ARRAY_1:%.*]], i64 [[I_I64]]		; CHECK-NEXT: [[ARRAY_1_I_PTR:%.]] = getelementptr inbounds i32, i32 [[ARRAY_1:%.*]], i64 [[I_I64]]
; CHECK-NEXT: [[ARRAY_1_I:%.]] = load i32, i32 [[ARRAY_1_I_PTR]], align 4		; CHECK-NEXT: [[ARRAY_1_I:%.]] = load i32, i32 [[ARRAY_1_I_PTR]], align 4
; CHECK-NEXT: [[LOOP_ACC_1:%.*]] = add i32 [[LOOP_ACC]], [[ARRAY_1_I]]		; CHECK-NEXT: [[LOOP_ACC_1:%.*]] = add i32 [[LOOP_ACC]], [[ARRAY_1_I]]
; CHECK-NEXT: br i1 [[TMP4]], label [[GUARDED1]], label [[DEOPT2:%.*]], !prof !0		; CHECK-NEXT: br i1 true, label [[GUARDED1]], label [[DEOPT2:%.*]], !prof !0
; CHECK: deopt2:		; CHECK: deopt2:
; CHECK-NEXT: call void @prevent_merging()		; CHECK-NEXT: call void @prevent_merging()
; CHECK-NEXT: ret i32 -1		; CHECK-NEXT: ret i32 -1
; CHECK: guarded1:		; CHECK: guarded1:
; CHECK-NEXT: [[ARRAY_3_I_PTR:%.]] = getelementptr inbounds i32, i32 [[ARRAY_3:%.*]], i64 [[I_I64]]		; CHECK-NEXT: [[ARRAY_3_I_PTR:%.]] = getelementptr inbounds i32, i32 [[ARRAY_3:%.*]], i64 [[I_I64]]
; CHECK-NEXT: [[ARRAY_3_I:%.]] = load i32, i32 [[ARRAY_3_I_PTR]], align 4		; CHECK-NEXT: [[ARRAY_3_I:%.]] = load i32, i32 [[ARRAY_3_I_PTR]], align 4
; CHECK-NEXT: [[LOOP_ACC_NEXT]] = add i32 [[LOOP_ACC_1]], [[ARRAY_3_I]]		; CHECK-NEXT: [[LOOP_ACC_NEXT]] = add i32 [[LOOP_ACC_1]], [[ARRAY_3_I]]
; CHECK-NEXT: [[I_NEXT]] = add nuw i32 [[I]], 1		; CHECK-NEXT: [[I_NEXT]] = add nuw i32 [[I]], 1
▲ Show 20 Lines • Show All 289 Lines • ▼ Show 20 Lines
exit:		exit:
%result = phi i32 [ %loop.acc.next, %guarded ], [0, %entry]		%result = phi i32 [ %loop.acc.next, %guarded ], [0, %entry]
ret i32 %result		ret i32 %result
}		}

; If we have a dominating exit (exit1) which can't be itself rewritten, we		; If we have a dominating exit (exit1) which can't be itself rewritten, we
; can't rewrite a later exit (exit2). Doing so would cause the loop to exit		; can't rewrite a later exit (exit2). Doing so would cause the loop to exit
; from the exit2 when it should have exited from exit1.		; from the exit2 when it should have exited from exit1.
define i32 @neg_dominating_exit(i32* %array, i32 %length, i32 %n) {		define i32 @neg_dominating_exit(i32* %array, i32 %length, i32 %length2, i32 %n) {
; CHECK-LABEL: @neg_dominating_exit(		; CHECK-LABEL: @neg_dominating_exit(
; CHECK-NEXT: loop.preheader:		; CHECK-NEXT: loop.preheader:
; CHECK-NEXT: br label [[LOOP:%.*]]		; CHECK-NEXT: br label [[LOOP:%.*]]
; CHECK: loop:		; CHECK: loop:
; CHECK-NEXT: [[LOOP_ACC:%.]] = phi i32 [ [[LOOP_ACC_NEXT:%.]], [[GUARDED2:%.]] ], [ 0, [[LOOP_PREHEADER:%.]] ]		; CHECK-NEXT: [[LOOP_ACC:%.]] = phi i32 [ [[LOOP_ACC_NEXT:%.]], [[GUARDED2:%.]] ], [ 0, [[LOOP_PREHEADER:%.]] ]
; CHECK-NEXT: [[I:%.]] = phi i32 [ [[I_NEXT:%.]], [[GUARDED2]] ], [ 0, [[LOOP_PREHEADER]] ]		; CHECK-NEXT: [[I:%.]] = phi i32 [ [[I_NEXT:%.]], [[GUARDED2]] ], [ 0, [[LOOP_PREHEADER]] ]
; CHECK-NEXT: [[WITHIN_BOUNDS:%.]] = icmp ult i32 [[I]], [[LENGTH:%.]]		; CHECK-NEXT: [[WITHIN_BOUNDS:%.]] = icmp ult i32 [[I]], [[LENGTH:%.]]
; CHECK-NEXT: br i1 [[WITHIN_BOUNDS]], label [[GUARDED:%.]], label [[DEOPT:%.]], !prof !0		; CHECK-NEXT: br i1 [[WITHIN_BOUNDS]], label [[GUARDED:%.]], label [[DEOPT:%.]], !prof !0
; CHECK: deopt:		; CHECK: deopt:
; CHECK-NEXT: [[RESULT:%.*]] = phi i32 [ [[LOOP_ACC]], [[LOOP]] ]		; CHECK-NEXT: [[RESULT:%.*]] = phi i32 [ [[LOOP_ACC]], [[LOOP]] ]
; CHECK-NEXT: call void @prevent_merging()		; CHECK-NEXT: call void @prevent_merging()
; CHECK-NEXT: ret i32 [[RESULT]]		; CHECK-NEXT: ret i32 [[RESULT]]
; CHECK: guarded:		; CHECK: guarded:
; CHECK-NEXT: [[WITHIN_BOUNDS2:%.*]] = icmp ult i32 [[I]], [[LENGTH]]		; CHECK-NEXT: [[WITHIN_BOUNDS2:%.]] = icmp ult i32 [[I]], [[LENGTH2:%.]]
; CHECK-NEXT: br i1 [[WITHIN_BOUNDS2]], label [[GUARDED2]], label [[DEOPT2:%.*]], !prof !0		; CHECK-NEXT: br i1 [[WITHIN_BOUNDS2]], label [[GUARDED2]], label [[DEOPT2:%.*]], !prof !0
; CHECK: deopt2:		; CHECK: deopt2:
; CHECK-NEXT: call void @prevent_merging()		; CHECK-NEXT: call void @prevent_merging()
; CHECK-NEXT: ret i32 -1		; CHECK-NEXT: ret i32 -1
; CHECK: guarded2:		; CHECK: guarded2:
; CHECK-NEXT: [[I_I64:%.*]] = zext i32 [[I]] to i64		; CHECK-NEXT: [[I_I64:%.*]] = zext i32 [[I]] to i64
; CHECK-NEXT: [[ARRAY_I_PTR:%.]] = getelementptr inbounds i32, i32 [[ARRAY:%.*]], i64 [[I_I64]]		; CHECK-NEXT: [[ARRAY_I_PTR:%.]] = getelementptr inbounds i32, i32 [[ARRAY:%.*]], i64 [[I_I64]]
; CHECK-NEXT: [[ARRAY_I:%.]] = load i32, i32 [[ARRAY_I_PTR]], align 4		; CHECK-NEXT: [[ARRAY_I:%.]] = load i32, i32 [[ARRAY_I_PTR]], align 4
Show All 15 Lines	loop: ; preds = %guarded, %loop.preheader
br i1 %within.bounds, label %guarded, label %deopt, !prof !0		br i1 %within.bounds, label %guarded, label %deopt, !prof !0

deopt: ; preds = %loop		deopt: ; preds = %loop
%result = phi i32 [ %loop.acc, %loop ]		%result = phi i32 [ %loop.acc, %loop ]
call void @prevent_merging()		call void @prevent_merging()
ret i32 %result		ret i32 %result

guarded: ; preds = %loop		guarded: ; preds = %loop
%within.bounds2 = icmp ult i32 %i, %length		%within.bounds2 = icmp ult i32 %i, %length2
br i1 %within.bounds2, label %guarded2, label %deopt2, !prof !0		br i1 %within.bounds2, label %guarded2, label %deopt2, !prof !0

deopt2: ; preds = %loop		deopt2: ; preds = %loop
call void @prevent_merging()		call void @prevent_merging()
ret i32 -1		ret i32 -1

guarded2: ; preds = %loop		guarded2: ; preds = %loop
%i.i64 = zext i32 %i to i64		%i.i64 = zext i32 %i to i64
Show All 18 Lines

llvm/test/Transforms/IndVarSimplify/pr38674.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -indvars < %s \| FileCheck %s			; RUN: opt -S -indvars < %s \| FileCheck %s

	; Check that we don't reuse %zext instead of %inc11 for LCSSA Phi node. Case			; Check that we don't reuse %zext instead of %inc11 for LCSSA Phi node. Case
	; with constants SCEV.			; with constants SCEV.

	define i32 @test_01() {			define i32 @test_01() {
	; CHECK-LABEL: @test_01(			; CHECK-LABEL: @test_01(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[FOR_COND1_PREHEADER:%.*]]			; CHECK-NEXT: br label [[FOR_COND1_PREHEADER:%.*]]
	; CHECK: for.cond1.preheader:			; CHECK: for.cond1.preheader:
	; CHECK-NEXT: br label [[FOR_COND4_PREHEADER:%.*]]			; CHECK-NEXT: br label [[FOR_COND4_PREHEADER:%.*]]
	; CHECK: for.cond4.preheader:			; CHECK: for.cond4.preheader:
	; CHECK-NEXT: [[ZEXT:%.*]] = zext i16 1 to i32			; CHECK-NEXT: [[ZEXT:%.*]] = zext i16 1 to i32
	; CHECK-NEXT: br label [[FOR_BODY6:%.*]]			; CHECK-NEXT: br label [[FOR_BODY6:%.*]]
	; CHECK: for.cond4:			; CHECK: for.cond4:
	; CHECK-NEXT: [[CMP5:%.]] = icmp ult i32 [[INC:%.]], 2			; CHECK-NEXT: br i1 true, label [[FOR_BODY6]], label [[FOR_END:%.*]]
	; CHECK-NEXT: br i1 [[CMP5]], label [[FOR_BODY6]], label [[FOR_END:%.*]]
	; CHECK: for.body6:			; CHECK: for.body6:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[FOR_COND4_PREHEADER]] ], [ [[INC]], [[FOR_COND4:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ 0, [[FOR_COND4_PREHEADER]] ], [ [[INC:%.]], [[FOR_COND4:%.*]] ]
	; CHECK-NEXT: [[TMP0:%.*]] = icmp eq i32 [[IV]], [[ZEXT]]			; CHECK-NEXT: [[TMP0:%.*]] = icmp eq i32 [[IV]], [[ZEXT]]
	; CHECK-NEXT: [[INC]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[INC]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: br i1 [[TMP0]], label [[RETURN_LOOPEXIT:%.*]], label [[FOR_COND4]]			; CHECK-NEXT: br i1 [[TMP0]], label [[RETURN_LOOPEXIT:%.*]], label [[FOR_COND4]]
	; CHECK: for.end:			; CHECK: for.end:
	; CHECK-NEXT: br i1 false, label [[FOR_COND4_PREHEADER]], label [[FOR_END9:%.*]]			; CHECK-NEXT: br i1 false, label [[FOR_COND4_PREHEADER]], label [[FOR_END9:%.*]]
	; CHECK: for.end9:			; CHECK: for.end9:
	; CHECK-NEXT: br i1 false, label [[FOR_COND1_PREHEADER]], label [[RETURN_LOOPEXIT3:%.*]]			; CHECK-NEXT: br i1 false, label [[FOR_COND1_PREHEADER]], label [[RETURN_LOOPEXIT3:%.*]]
	; CHECK: return.loopexit:			; CHECK: return.loopexit:
	▲ Show 20 Lines • Show All 111 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[IndVars] Eliminate loop exits with equivalent exit countsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 225810

llvm/lib/Transforms/Scalar/IndVarSimplify.cpp

llvm/test/Transforms/IndVarSimplify/eliminate-exit.ll

llvm/test/Transforms/IndVarSimplify/loop-predication.ll

llvm/test/Transforms/IndVarSimplify/pr38674.ll

[IndVars] Eliminate loop exits with equivalent exit counts
ClosedPublic