This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
2
SimplifyIndVar.cpp
-
test/Transforms/
-
Transforms/
-
IndVarSimplify/
-
X86/
-
pr35406.ll
-
checks_against_min_value.ll
1
eliminate-comparison.ll
-
LoopLoadElim/
-
pr-49141.ll

Differential D98697

[IndVars] Provide eliminateIVComparison with context
ClosedPublic

Authored by mkazantsev on Mar 16 2021, 4:38 AM.

Download Raw Diff

Details

Reviewers

lebedev.ri
nikic
reames
asbirlea

Commits

rG16370e02a715: [IndVars] Provide eliminateIVComparison with context

Summary

We can prove more predicates when we have a context when eliminating ICmp.
As first (and very obvious) approximation we can use the ICmp instruction itself,
though in the future we are going to use a common dominator of all its users.
Need some refactoring before that.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

mkazantsev created this revision.Mar 16 2021, 4:38 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptMar 16 2021, 4:38 AM

mkazantsev requested review of this revision.Mar 16 2021, 4:38 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 16 2021, 4:38 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

lebedev.ri added inline comments.Mar 16 2021, 4:44 AM

llvm/lib/Transforms/Utils/SimplifyIndVar.cpp
264
266	It's usually called `CtxI`
llvm/test/Transforms/IndVarSimplify/eliminate-comparison.ll
822	Please precommit, it's a bit noisy.

mkazantsev planned changes to this revision.Mar 16 2021, 4:54 AM

mkazantsev added inline comments.

llvm/test/Transforms/IndVarSimplify/scev-expander-preserve-lcssa.ll
119 ↗	(On Diff #330941)	Looks like it's a bug needs to be fixed before we can go with it.

mkazantsev added inline comments.Mar 16 2021, 5:00 AM

llvm/test/Transforms/IndVarSimplify/scev-expander-preserve-lcssa.ll
119 ↗	(On Diff #330941)	Funny thing that both `false` and `true` are correct here. `false` is obviously correct because `inc` is strictly positive and `i.0` is non-positive, and this is what is proved without context. However `true` is also correct because this is guarded by br i1 %cmp8243, label %for.body84, label %return which is an impossible condition. In fact, this code is unreachable so anything is OK.

And that's what i was expecting to see: https://llvm-compile-time-tracker.com/compare.php?from=9575c48b8959dae3c3e39e0227435ae6ebd71443&to=c0e80b44e3c80fa9f344b14cbb8435c15f75f94e&stat=instructions
Are the checks already ordered from less costly to more costly?

mkazantsev added inline comments.Mar 16 2021, 5:08 AM

llvm/test/Transforms/IndVarSimplify/scev-expander-preserve-lcssa.ll
119 ↗	(On Diff #330941)	btw this is a lead for further compile time opt that we are missing. Blocks guarded by `false` conditions can have any predicate proved automatically, to save some time.

In D98697#2628661, @lebedev.ri wrote:

And that's what i was expecting to see: https://llvm-compile-time-tracker.com/compare.php?from=9575c48b8959dae3c3e39e0227435ae6ebd71443&to=c0e80b44e3c80fa9f344b14cbb8435c15f75f94e&stat=instructions
Are the checks already ordered from less costly to more costly?

That's sad. I think this is happening because, when false is trivially provable without context, we still spend a lot of time proving true via context. I'll try to find a way around it...

Harbormaster completed remote builds in B94020: Diff 330941.Mar 16 2021, 5:17 AM

For the compile time concern, one idea you could consider exploring would be to have a version of isKnownPredicateAt which returns an Optional<bool>. Such a routine could use the cheap proof techniques first (in both directions), and then resort to the more expensive ones. In this particular case, it would also simplify the caller code. Looking at a couple other callers, we seem to be missing some optimizations by not asking both.

If we pushed that down all the way to the loop guard processing, that could be a substantial compile time win. (As we'd walk the CFG once, not twice, in the case where we can't prove anything.)

In D98697#2630151, @reames wrote:

For the compile time concern, one idea you could consider exploring would be to have a version of isKnownPredicateAt which returns an Optional<bool>. Such a routine could use the cheap proof techniques first (in both directions), and then resort to the more expensive ones. In this particular case, it would also simplify the caller code. Looking at a couple other callers, we seem to be missing some optimizations by not asking both.

If we pushed that down all the way to the loop guard processing, that could be a substantial compile time win. (As we'd walk the CFG once, not twice, in the case where we can't prove anything.)

This is a really good way to imrpove this, thank you for bringing this up. I'll consider this.

Rebased on top of new API for predicate evaluation. Now the order of checks should be from cheap to expensive. Hope this is more CT-friendly.

Harbormaster completed remote builds in B94441: Diff 331533.Mar 18 2021, 6:32 AM

Looks good to me.

In D98697#2634349, @mkazantsev wrote:

Rebased on top of new API for predicate evaluation. Now the order of checks should be from cheap to expensive. Hope this is more CT-friendly.

It didn't really help, still within the same ballpark +/- noise i would say:
https://llvm-compile-time-tracker.com/compare.php?from=9575c48b8959dae3c3e39e0227435ae6ebd71443&to=c0e80b44e3c80fa9f344b14cbb8435c15f75f94e&stat=instructions
vs
https://llvm-compile-time-tracker.com/compare.php?from=fb04227c7cc3c5ebb3ece90f8ec5c629920eb3af&to=71476f7052b97e7ba9f91624c7fa3d3cbd418aa6&stat=instructions

As discussed in previous SCEV changes, that is probably not a blocker here, although it is not great.

This revision is now accepted and ready to land.Mar 18 2021, 1:02 PM

Well, we clearly can produce better code here. Let's hope it's worth it. I'll spend some time trying to understand how we can make SCEV cheaper.

This revision was landed with ongoing or failed builds.Mar 18 2021, 10:28 PM

Closed by commit rG16370e02a715: [IndVars] Provide eliminateIVComparison with context (authored by mkazantsev). · Explain Why

This revision was automatically updated to reflect the committed changes.

mkazantsev added a commit: rG16370e02a715: [IndVars] Provide eliminateIVComparison with context.

I wrote
https://bugs.llvm.org/show_bug.cgi?id=52327
about a crash starting to happen with this patch.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Utils/

SimplifyIndVar.cpp

7 lines

test/

Transforms/

IndVarSimplify/

X86/

pr35406.ll

4 lines

checks_against_min_value.ll

6 lines

eliminate-comparison.ll

12 lines

LoopLoadElim/

pr-49141.ll

8 lines

Diff 331764

llvm/lib/Transforms/Utils/SimplifyIndVar.cpp

Show First 20 Lines • Show All 255 Lines • ▼ Show 20 Lines void SimplifyIndvar::eliminateIVComparison(ICmpInst *ICmp, Value *IVOperand) {

} }

// Get the SCEVs for the ICmp operands (in the specific context of the // Get the SCEVs for the ICmp operands (in the specific context of the

// current loop) // current loop)

const Loop *ICmpLoop = LI->getLoopFor(ICmp->getParent()); const Loop *ICmpLoop = LI->getLoopFor(ICmp->getParent());

const SCEV *S = SE->getSCEVAtScope(ICmp->getOperand(IVOperIdx), ICmpLoop); const SCEV *S = SE->getSCEVAtScope(ICmp->getOperand(IVOperIdx), ICmpLoop);

const SCEV *X = SE->getSCEVAtScope(ICmp->getOperand(1 - IVOperIdx), ICmpLoop); const SCEV *X = SE->getSCEVAtScope(ICmp->getOperand(1 - IVOperIdx), ICmpLoop);

// If the condition is always true or always false, replace it with // If the condition is always true or always false in the given context,

lebedev.riUnsubmitted

Not Done

// If the condition is always true or always false, in the given context,

- // replace it witha constant value.

+ // replace it with a constant value.

// TODO: We can sharpen the context to common dominator of all ICmp's users.

lebedev.ri:

// a constant value. // replace it with a constant value.

if (auto Ev = SE->evaluatePredicate(Pred, S, X)) { // TODO: We can sharpen the context to common dominator of all ICmp's users.

lebedev.riUnsubmitted

Not Done

It's usually called CtxI

lebedev.ri: It's usually called `CtxI`

if (auto Ev = SE->evaluatePredicateAt(Pred, S, X, ICmp)) {

ICmp->replaceAllUsesWith(ConstantInt::getBool(ICmp->getContext(), *Ev)); ICmp->replaceAllUsesWith(ConstantInt::getBool(ICmp->getContext(), *Ev));

DeadInsts.emplace_back(ICmp); DeadInsts.emplace_back(ICmp);

LLVM_DEBUG(dbgs() << "INDVARS: Eliminated comparison: " << *ICmp << '\n'); LLVM_DEBUG(dbgs() << "INDVARS: Eliminated comparison: " << *ICmp << '\n');

} else if (makeIVComparisonInvariant(ICmp, IVOperand)) { } else if (makeIVComparisonInvariant(ICmp, IVOperand)) {

// fallthrough to end of function // fallthrough to end of function

} else if (ICmpInst::isSigned(OriginalPred) && } else if (ICmpInst::isSigned(OriginalPred) &&

SE->isKnownNonNegative(S) && SE->isKnownNonNegative(X)) { SE->isKnownNonNegative(S) && SE->isKnownNonNegative(X)) {

// If we were unable to make anything above, all we can is to canonicalize // If we were unable to make anything above, all we can is to canonicalize

▲ Show 20 Lines • Show All 1,811 Lines • Show Last 20 Lines

llvm/test/Transforms/IndVarSimplify/X86/pr35406.ll

	Show All 17 Lines
	; CHECK-NEXT: [[TMP1:%.*]] = udiv i64 60392, [[TMP0]]			; CHECK-NEXT: [[TMP1:%.*]] = udiv i64 60392, [[TMP0]]
	; CHECK-NEXT: br label [[LOOP2:%.*]]			; CHECK-NEXT: br label [[LOOP2:%.*]]
	; CHECK: loop2:			; CHECK: loop2:
	; CHECK-NEXT: [[INDVARS_IV1:%.]] = phi i64 [ [[TMP1]], [[LOOP2_PREHEADER]] ], [ [[INDVARS_IV_NEXT2:%.]], [[LOOP2]] ]			; CHECK-NEXT: [[INDVARS_IV1:%.]] = phi i64 [ [[TMP1]], [[LOOP2_PREHEADER]] ], [ [[INDVARS_IV_NEXT2:%.]], [[LOOP2]] ]
	; CHECK-NEXT: [[INDVARS_IV_NEXT2]] = add nuw nsw i64 [[INDVARS_IV1]], -1			; CHECK-NEXT: [[INDVARS_IV_NEXT2]] = add nuw nsw i64 [[INDVARS_IV1]], -1
	; CHECK-NEXT: [[I4:%.]] = load atomic i64, i64 [[P1:%.*]] unordered, align 8			; CHECK-NEXT: [[I4:%.]] = load atomic i64, i64 [[P1:%.*]] unordered, align 8
	; CHECK-NEXT: [[I6:%.*]] = sub i64 [[I4]], [[INDVARS_IV_NEXT2]]			; CHECK-NEXT: [[I6:%.*]] = sub i64 [[I4]], [[INDVARS_IV_NEXT2]]
	; CHECK-NEXT: store atomic i64 [[I6]], i64* [[P1]] unordered, align 8			; CHECK-NEXT: store atomic i64 [[I6]], i64* [[P1]] unordered, align 8
	; CHECK-NEXT: br i1 false, label [[LOOP2_EXIT_LOOPEXIT:%.*]], label [[LOOP2]]			; CHECK-NEXT: br i1 true, label [[LOOP2_EXIT_LOOPEXIT:%.*]], label [[LOOP2]]
	; CHECK: loop2.exit.loopexit:			; CHECK: loop2.exit.loopexit:
	; CHECK-NEXT: br label [[LOOP2_EXIT]]			; CHECK-NEXT: br label [[LOOP2_EXIT]]
	; CHECK: loop2.exit:			; CHECK: loop2.exit:
	; CHECK-NEXT: [[INDVARS_IV_NEXT]] = add nuw nsw i64 [[INDVARS_IV]], 1			; CHECK-NEXT: [[INDVARS_IV_NEXT]] = add nuw nsw i64 [[INDVARS_IV]], 1
	; CHECK-NEXT: br i1 false, label [[EXIT]], label [[LOOP1]]			; CHECK-NEXT: br i1 false, label [[EXIT]], label [[LOOP1]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret i32 0			; CHECK-NEXT: ret i32 0
	;			;
	▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP5:%.*]] = add nsw i64 [[TMP4]], 60392			; CHECK-NEXT: [[TMP5:%.*]] = add nsw i64 [[TMP4]], 60392
	; CHECK-NEXT: br label [[LOOP2:%.*]]			; CHECK-NEXT: br label [[LOOP2:%.*]]
	; CHECK: loop2:			; CHECK: loop2:
	; CHECK-NEXT: [[INDVARS_IV:%.]] = phi i64 [ [[TMP5]], [[LOOP2_PREHEADER]] ], [ [[INDVARS_IV_NEXT:%.]], [[LOOP2]] ]			; CHECK-NEXT: [[INDVARS_IV:%.]] = phi i64 [ [[TMP5]], [[LOOP2_PREHEADER]] ], [ [[INDVARS_IV_NEXT:%.]], [[LOOP2]] ]
	; CHECK-NEXT: [[INDVARS_IV_NEXT]] = add nsw i64 [[INDVARS_IV]], -1			; CHECK-NEXT: [[INDVARS_IV_NEXT]] = add nsw i64 [[INDVARS_IV]], -1
	; CHECK-NEXT: [[I4:%.]] = load atomic i64, i64 [[P1:%.*]] unordered, align 8			; CHECK-NEXT: [[I4:%.]] = load atomic i64, i64 [[P1:%.*]] unordered, align 8
	; CHECK-NEXT: [[I6:%.*]] = sub i64 [[I4]], [[INDVARS_IV_NEXT]]			; CHECK-NEXT: [[I6:%.*]] = sub i64 [[I4]], [[INDVARS_IV_NEXT]]
	; CHECK-NEXT: store atomic i64 [[I6]], i64* [[P1]] unordered, align 8			; CHECK-NEXT: store atomic i64 [[I6]], i64* [[P1]] unordered, align 8
	; CHECK-NEXT: br i1 false, label [[LOOP2_EXIT_LOOPEXIT:%.*]], label [[LOOP2]]			; CHECK-NEXT: br i1 true, label [[LOOP2_EXIT_LOOPEXIT:%.*]], label [[LOOP2]]
	; CHECK: loop2.exit.loopexit:			; CHECK: loop2.exit.loopexit:
	; CHECK-NEXT: br label [[LOOP2_EXIT]]			; CHECK-NEXT: br label [[LOOP2_EXIT]]
	; CHECK: loop2.exit:			; CHECK: loop2.exit:
	; CHECK-NEXT: [[I9]] = add nuw nsw i32 [[LOCAL_0_]], 1			; CHECK-NEXT: [[I9]] = add nuw nsw i32 [[LOCAL_0_]], 1
	; CHECK-NEXT: br i1 false, label [[EXIT]], label [[LOOP1]]			; CHECK-NEXT: br i1 false, label [[EXIT]], label [[LOOP1]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret i32 0			; CHECK-NEXT: ret i32 0
	;			;
	Show All 37 Lines

llvm/test/Transforms/IndVarSimplify/checks_against_min_value.ll

	Show All 9 Lines
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_NEXT:%.]], [[GUARDED:%.*]] ], [ [[START]], [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_NEXT:%.]], [[GUARDED:%.*]] ], [ [[START]], [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], -1			; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], -1
	; CHECK-NEXT: [[CHECK:%.*]] = icmp slt i32 [[IV_NEXT]], [[IV]]			; CHECK-NEXT: [[CHECK:%.*]] = icmp slt i32 [[IV_NEXT]], [[IV]]
	; CHECK-NEXT: br i1 [[CHECK]], label [[GUARDED]], label [[FAIL:%.*]]			; CHECK-NEXT: br i1 [[CHECK]], label [[GUARDED]], label [[FAIL:%.*]]
	; CHECK: guarded:			; CHECK: guarded:
	; CHECK-NEXT: [[LOOP_COND:%.*]] = icmp ne i32 [[IV]], -2147483648			; CHECK-NEXT: br i1 true, label [[LOOP]], label [[EXIT_LOOPEXIT:%.*]]
	; CHECK-NEXT: br i1 [[LOOP_COND]], label [[LOOP]], label [[EXIT_LOOPEXIT:%.*]]
	; CHECK: exit.loopexit:			; CHECK: exit.loopexit:
	; CHECK-NEXT: br label [[EXIT]]			; CHECK-NEXT: br label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	; CHECK: fail:			; CHECK: fail:
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	;			;
	entry:			entry:
	Show All 25 Lines
	; CHECK: loop.preheader:			; CHECK: loop.preheader:
	; CHECK-NEXT: br label [[LOOP:%.*]]			; CHECK-NEXT: br label [[LOOP:%.*]]
	; CHECK: loop:			; CHECK: loop:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_NEXT:%.]], [[GUARDED:%.*]] ], [ [[START]], [[LOOP_PREHEADER]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_NEXT:%.]], [[GUARDED:%.*]] ], [ [[START]], [[LOOP_PREHEADER]] ]
	; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], -1			; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], -1
	; CHECK-NEXT: [[CHECK:%.*]] = icmp ult i32 [[IV_NEXT]], [[IV]]			; CHECK-NEXT: [[CHECK:%.*]] = icmp ult i32 [[IV_NEXT]], [[IV]]
	; CHECK-NEXT: br i1 [[CHECK]], label [[GUARDED]], label [[FAIL:%.*]]			; CHECK-NEXT: br i1 [[CHECK]], label [[GUARDED]], label [[FAIL:%.*]]
	; CHECK: guarded:			; CHECK: guarded:
	; CHECK-NEXT: [[LOOP_COND:%.*]] = icmp ne i32 [[IV]], 0			; CHECK-NEXT: br i1 true, label [[LOOP]], label [[EXIT_LOOPEXIT:%.*]]
	; CHECK-NEXT: br i1 [[LOOP_COND]], label [[LOOP]], label [[EXIT_LOOPEXIT:%.*]]
	; CHECK: exit.loopexit:			; CHECK: exit.loopexit:
	; CHECK-NEXT: br label [[EXIT]]			; CHECK-NEXT: br label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	; CHECK: fail:			; CHECK: fail:
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	;			;
	entry:			entry:
	Show All 19 Lines

llvm/test/Transforms/IndVarSimplify/eliminate-comparison.ll

Show First 20 Lines • Show All 813 Lines • ▼ Show 20 Lines	leave:
ret void		ret void
}		}

; This checks that the backedge condition, (I + 1) < Length - 1 implies		; This checks that the backedge condition, (I + 1) < Length - 1 implies
; (I + 1) < Length		; (I + 1) < Length
define void @func_22(i32* %length.ptr) {		define void @func_22(i32* %length.ptr) {
; CHECK-LABEL: @func_22(		; CHECK-LABEL: @func_22(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[LENGTH:%.]] = load i32, i32 [[LENGTH_PTR:%.*]], align 4, !range [[RNG0]]		; CHECK-NEXT: [[LENGTH:%.]] = load i32, i32 [[LENGTH_PTR:%.*]], align 4, !range [[RNG0]]
		lebedev.riUnsubmitted Not Done Reply Inline Actions Please precommit, it's a bit noisy. lebedev.ri: Please precommit, it's a bit noisy.
; CHECK-NEXT: [[ENTRY_COND:%.*]] = icmp sgt i32 [[LENGTH]], 1		; CHECK-NEXT: [[ENTRY_COND:%.*]] = icmp sgt i32 [[LENGTH]], 1
; CHECK-NEXT: br i1 [[ENTRY_COND]], label [[LOOP_PREHEADER:%.]], label [[LEAVE:%.]]		; CHECK-NEXT: br i1 [[ENTRY_COND]], label [[LOOP_PREHEADER:%.]], label [[LEAVE:%.]]
; CHECK: loop.preheader:		; CHECK: loop.preheader:
; CHECK-NEXT: br label [[LOOP:%.*]]		; CHECK-NEXT: br label [[LOOP:%.*]]
; CHECK: loop:		; CHECK: loop:
; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]		; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_INC:%.]], [[BE:%.*]] ], [ 0, [[LOOP_PREHEADER]] ]
; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1		; CHECK-NEXT: [[IV_INC]] = add nuw nsw i32 [[IV]], 1
; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]		; CHECK-NEXT: br i1 true, label [[BE]], label [[LEAVE_LOOPEXIT:%.*]]
▲ Show 20 Lines • Show All 115 Lines • ▼ Show 20 Lines
; CHECK-LABEL: @func_25(		; CHECK-LABEL: @func_25(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: br label [[LOOP:%.*]]		; CHECK-NEXT: br label [[LOOP:%.*]]
; CHECK: loop:		; CHECK: loop:
; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[START:%.]], [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[BACKEDGE:%.*]] ]		; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[START:%.]], [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[BACKEDGE:%.*]] ]
; CHECK-NEXT: [[C1:%.*]] = icmp ne i32 [[IV]], 0		; CHECK-NEXT: [[C1:%.*]] = icmp ne i32 [[IV]], 0
; CHECK-NEXT: br i1 [[C1]], label [[CHECKED_1:%.]], label [[FAIL:%.]]		; CHECK-NEXT: br i1 [[C1]], label [[CHECKED_1:%.]], label [[FAIL:%.]]
; CHECK: checked.1:		; CHECK: checked.1:
; CHECK-NEXT: [[C2:%.*]] = icmp ne i32 [[IV]], 0		; CHECK-NEXT: br i1 true, label [[CHECKED_2:%.*]], label [[FAIL]]
; CHECK-NEXT: br i1 [[C2]], label [[CHECKED_2:%.*]], label [[FAIL]]
; CHECK: checked.2:		; CHECK: checked.2:
; CHECK-NEXT: [[C3:%.*]] = icmp ne i32 [[IV]], 0		; CHECK-NEXT: br i1 true, label [[BACKEDGE]], label [[FAIL]]
; CHECK-NEXT: br i1 [[C3]], label [[BACKEDGE]], label [[FAIL]]
; CHECK: backedge:		; CHECK: backedge:
; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], 758394		; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], 758394
; CHECK-NEXT: [[LOOP_COND:%.*]] = call i1 @cond_func()		; CHECK-NEXT: [[LOOP_COND:%.*]] = call i1 @cond_func()
; CHECK-NEXT: br i1 [[LOOP_COND]], label [[LOOP]], label [[EXIT:%.*]]		; CHECK-NEXT: br i1 [[LOOP_COND]], label [[LOOP]], label [[EXIT:%.*]]
; CHECK: fail:		; CHECK: fail:
; CHECK-NEXT: unreachable		; CHECK-NEXT: unreachable
; CHECK: exit:		; CHECK: exit:
; CHECK-NEXT: [[IV_LCSSA1:%.*]] = phi i32 [ [[IV]], [[BACKEDGE]] ]		; CHECK-NEXT: [[IV_LCSSA1:%.*]] = phi i32 [ [[IV]], [[BACKEDGE]] ]
Show All 31 Lines
; CHECK-LABEL: @func_26(		; CHECK-LABEL: @func_26(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: br label [[LOOP:%.*]]		; CHECK-NEXT: br label [[LOOP:%.*]]
; CHECK: loop:		; CHECK: loop:
; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[START:%.]], [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[BACKEDGE:%.*]] ]		; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[START:%.]], [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[BACKEDGE:%.*]] ]
; CHECK-NEXT: [[C1:%.*]] = icmp slt i32 [[IV]], 0		; CHECK-NEXT: [[C1:%.*]] = icmp slt i32 [[IV]], 0
; CHECK-NEXT: br i1 [[C1]], label [[CHECKED_1:%.]], label [[FAIL:%.]]		; CHECK-NEXT: br i1 [[C1]], label [[CHECKED_1:%.]], label [[FAIL:%.]]
; CHECK: checked.1:		; CHECK: checked.1:
; CHECK-NEXT: [[C2:%.*]] = icmp slt i32 [[IV]], 1		; CHECK-NEXT: br i1 true, label [[CHECKED_2:%.*]], label [[FAIL]]
; CHECK-NEXT: br i1 [[C2]], label [[CHECKED_2:%.*]], label [[FAIL]]
; CHECK: checked.2:		; CHECK: checked.2:
; CHECK-NEXT: [[C3:%.*]] = icmp slt i32 [[IV]], 2		; CHECK-NEXT: br i1 true, label [[BACKEDGE]], label [[FAIL]]
; CHECK-NEXT: br i1 [[C3]], label [[BACKEDGE]], label [[FAIL]]
; CHECK: backedge:		; CHECK: backedge:
; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], 758394		; CHECK-NEXT: [[IV_NEXT]] = add i32 [[IV]], 758394
; CHECK-NEXT: [[LOOP_COND:%.*]] = call i1 @cond_func()		; CHECK-NEXT: [[LOOP_COND:%.*]] = call i1 @cond_func()
; CHECK-NEXT: br i1 [[LOOP_COND]], label [[LOOP]], label [[EXIT:%.*]]		; CHECK-NEXT: br i1 [[LOOP_COND]], label [[LOOP]], label [[EXIT:%.*]]
; CHECK: fail:		; CHECK: fail:
; CHECK-NEXT: unreachable		; CHECK-NEXT: unreachable
; CHECK: exit:		; CHECK: exit:
; CHECK-NEXT: [[IV_LCSSA1:%.*]] = phi i32 [ [[IV]], [[BACKEDGE]] ]		; CHECK-NEXT: [[IV_LCSSA1:%.*]] = phi i32 [ [[IV]], [[BACKEDGE]] ]
Show All 32 Lines

llvm/test/Transforms/LoopLoadElim/pr-49141.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -indvars -loop-load-elim < %s 2>&1 \| FileCheck %s			; RUN: opt -S -indvars -loop-load-elim < %s 2>&1 \| FileCheck %s

	target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128-ni:1-p2:32:8:8:32-ni:2"			target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128-ni:1-p2:32:8:8:32-ni:2"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define void @test() {			define void @test() {
	; CHECK-LABEL: @test(			; CHECK-LABEL: @test(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[FOR_BODY:%.*]]			; CHECK-NEXT: br label [[FOR_BODY:%.*]]
				; CHECK: for.cond:
				; CHECK-NEXT: br i1 true, label [[FOR_BODY]], label [[FOR_END:%.*]]
	; CHECK: for.body:			; CHECK: for.body:
	; CHECK-NEXT: [[A_01:%.]] = phi i16 [ undef, [[ENTRY:%.]] ], [ [[INC:%.*]], [[FOR_BODY]] ]			; CHECK-NEXT: br i1 false, label [[FOR_COND:%.*]], label [[FOR_END]]
	; CHECK-NEXT: [[INC]] = add nsw i16 [[A_01]], 1
	; CHECK-NEXT: [[CMP:%.*]] = icmp sle i16 [[INC]], 2
	; CHECK-NEXT: [[OR_COND:%.*]] = and i1 false, [[CMP]]
	; CHECK-NEXT: br i1 [[OR_COND]], label [[FOR_BODY]], label [[FOR_END:%.*]]
	; CHECK: for.end:			; CHECK: for.end:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %for.body			br label %for.body

	for.cond: ; preds = %for.body			for.cond: ; preds = %for.body
	%a.0 = phi i16 [ %inc, %for.body ]			%a.0 = phi i16 [ %inc, %for.body ]
	Show All 11 Lines