This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Analysis/
-
Analysis/
3
ScalarEvolution.cpp
-
test/Analysis/ScalarEvolution/
-
Analysis/
-
ScalarEvolution/
-
nuw-add-nested-loops.ll
-
nuw-add-sibling-loops.ll

Differential D106852

[SCEV] Fix getAddExpr for adding loop invariants into start of some AddRec
AbandonedPublic

Authored by skatkov on Jul 27 2021, 12:29 AM.

Download Raw Diff

Details

Reviewers

reames
nikic
efriedma
fhahn
mkazantsev

Summary

getAddExpr utility uses computed flags from AddRec plus loop invariants
in AddRec.Start + loop invariants basing on the an assumption that
0th iteration of the loop exists. However the loop might be dead
(runtime or compile time), in this case the propagation of the flag becomes
invalid.

To fix the bug we need to ensure that 0th iteration happens or use less
strict set of flags.
The patch uses the check for dominating all latches from used loops to ensure
that 0th iteration exists.

Diff Detail

Event Timeline

skatkov created this revision.Jul 27 2021, 12:29 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptJul 27 2021, 12:29 AM

skatkov requested review of this revision.Jul 27 2021, 12:29 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 27 2021, 12:29 AM

skatkov added a parent revision: D106851: [SCEV] Add two tests showing the bug in SCEV getAddExpr.Jul 27 2021, 12:29 AM

Harbormaster completed remote builds in B116357: Diff 361923.Jul 27 2021, 1:18 AM

When you're checking for is0thIterationGuaranteed, do you also need to check for abnormal exits from the loop?

What happens if you just assume is0thIterationGuaranteed is always false? I'm not sure all this work to try to prove the flags is worth doing.

In D106852#2907747, @efriedma wrote:

When you're checking for is0thIterationGuaranteed, do you also need to check for abnormal exits from the loop?

This is a good question. If I understand your question correctly you are asking about abnormal exit in outer loop,
before entering the current loop. Probably it is a problem.

What happens if you just assume is0thIterationGuaranteed is always false? I'm not sure all this work to try to prove the flags is worth doing.

At least no-one LLVM test is failing. Do you propose to just always say that we cannot guarantee 0th iteration?
I'll update the patch. If it will cause any performance issue we can return to discussion how to detect that 0th iteration is possible.

In D106852#2909261, @skatkov wrote:

In D106852#2907747, @efriedma wrote:

What happens if you just assume is0thIterationGuaranteed is always false? I'm not sure all this work to try to prove the flags is worth doing.

At least no-one LLVM test is failing. Do you propose to just always say that we cannot guarantee 0th iteration?
I'll update the patch. If it will cause any performance issue we can return to discussion how to detect that 0th iteration is possible.

ok I was too optimistic, I've got 15 failures which will look into

Failed Tests (15):
  LLVM :: Analysis/LoopAccessAnalysis/number-of-memchecks.ll
  LLVM :: Analysis/LoopAccessAnalysis/reverse-memcheck-bounds.ll
  LLVM :: Analysis/ScalarEvolution/flags-from-poison.ll
  LLVM :: Analysis/ScalarEvolution/incorrect-exit-count.ll
  LLVM :: Analysis/ScalarEvolution/max-backedge-taken-count-guard-info.ll
  LLVM :: Analysis/ScalarEvolution/no-wrap-symbolic-becount.ll
  LLVM :: Analysis/ScalarEvolution/nsw-offset-assume.ll
  LLVM :: Analysis/ScalarEvolution/nsw-offset.ll
  LLVM :: Analysis/ScalarEvolution/nsw.ll
  LLVM :: Analysis/ScalarEvolution/ptrtoint.ll
  LLVM :: Analysis/ScalarEvolution/range_nw_flag.ll
  LLVM :: CodeGen/ARM/ParallelDSP/pr42729.ll
  LLVM :: Transforms/LoopIdiom/basic.ll
  LLVM :: Transforms/LoopStrengthReduce/X86/expander-crashes.ll
  LLVM :: Transforms/LoopVectorize/runtime-check-pointer-element-type.ll

Ok, it happened due to I disabled also the case when Start + Invariants has no AddRecs inside.

please take a look.

Harbormaster completed remote builds in B116634: Diff 362304.Jul 28 2021, 2:20 AM

mkazantsev added a comment.Jul 30 2021, 3:13 AM

This comment was removed by mkazantsev.

efriedma added inline comments.Jul 30 2021, 12:28 PM

llvm/lib/Analysis/ScalarEvolution.cpp
2788	I'm not sure I see the connection between UsedLoops.size() and the treatment of nowrap flags.

I think that the first version of this patch was better. The only problem that was confusing is naming. In fact, is0thIterationGuaranteed should be is0thIterationDominatedByThis. It is totally fine if the 0th iteration of outer loop does not happen (because of abnormal exit or whatsoever). The important bit that it cannot happen withou execution of the given Add. And therefore the no-wrap flags on the addrec may be inferred from the fact that this add has executed.

I suggest returning the initial solution with this renaming.

@efriedma WDYT?

A SCEV add being marked nsw means, essentially, that at any point in the IR where all the operands are defined, the add doesn't wrap.

The problem here is that we're splitting the add into pieces. a+b+1 being nsw doesn't necessarily imply that a+1 is nsw in all contexts; the nsw only applies in contexts where b is defined. This is a little subtle, but it's the only interpretation that's consistent with some of the ways we try to prove nsw flags.

(The definition point for a SCEV is either the point of definition for a SCEVUnknown, or the beginning of the loop header of an AddRec.)

But we can extend the logic a little. If a+1 is nsw in contexts where b is defined, it's also nsw in any context that must eventually flow to the definition of b. If the definition of a is such a context, a+1 is always nsw.

So, for example, if "a" is an addrec for a loop, and "b" is an addrec for a loop nested inside the first loop, and the control flow is simple, nsw on "a+b+1" implies nsw for "a+1". Abnormal exits break this sort of proof, though: we assumed control must eventually flow to b's loop header.

Also, this is possibly an argument for changing the way nsw flags work. The amount of work we do to try to translate simple IR markings in SCEV is getting a bit crazy.

The problem here is that we're splitting the add into pieces. a+b+1 being nsw doesn't necessarily imply that a+1 is nsw in all contexts; the nsw only applies in contexts where b is defined. This is a little subtle, but it's the only interpretation that's consistent with some of the ways we try to prove nsw flags.

I think this is a self-contradictory interpretation. What if we instead computed a+b+1-b <nsw> and then decided to simplify b away, getting a+1<nsw> and no hint that it's only for b's context?

In D106852#2924428, @mkazantsev wrote:

The problem here is that we're splitting the add into pieces. a+b+1 being nsw doesn't necessarily imply that a+1 is nsw in all contexts; the nsw only applies in contexts where b is defined. This is a little subtle, but it's the only interpretation that's consistent with some of the ways we try to prove nsw flags.

I think this is a self-contradictory interpretation. What if we instead computed a+b+1-b <nsw> and then decided to simplify b away, getting a+1<nsw>

a+1 wouldn't be nsw? And in fact, that's what getAddExpr currently does. I don't see how that's a contradiction.

If we say that a+b+1-b <nsw> implies a+1<nsw>, and similarly say ({a,+,1} + b)<nsw> implies (a+b)<nsw>, that leads to a consistent system, I think. But that would imply the bug here isn't in getAddExpr at all; instead, getNoWrapFlagsFromUB() is fundamentally broken.

a+1 wouldn't be nsw? And in fact, that's what getAddExpr currently does. I don't see how that's a contradiction.

Imagine that the point where b is defined is also guarded by other conditions. In particular, this very place can be guarded by a != SINT_MAX. We could put the <nsw> only because we took all these guards into consideration. In this case, a+1+b-b<nsw> implies a+1<nsw> *only at the point where b is defined* and nowhere else.

So whenever b exists, it is legal to say a+1+b-b is nsw because of facts unrelated to value of b, but relevant to the specific point in code.

But there is no way to specify this, right?

In D106852#2927439, @mkazantsev wrote:

But there is no way to specify this, right?

Exactly; we don't have any place to store nsw markings that only apply to a specific region of the function. This is also the reason that D106331 is so awkward.

Spent some time trying to get my head around this review, and how we generally handle flags on SCEV objects used in multiple contexts. https://github.com/preames/public-notes/blob/master/llvm-loop-opt-ideas.rst#scev-wrap-flags

My conclusion so far is that this patch is not a complete fix. Or at least, I found an analogous bit of code which includes several preconditions this code does not.

I'll also note that I'm not yet at a point of really having a feeling for what path forward we should take. The way flags are handled currently appears to be largely the inverse of how flags in IR CSE are handled. I'm still wrapping my head around the implications of a localized fix (how much opt quality do we loose?) and deeper changes to the representation of flags (how scary is it).

Just a side note (maybe not directly related to this one): the way how flags in SCEV are designed now (effectively set *after* SCEV construction is finished and later mutated) has been a source of subtleties and tricky bugs for a good while. I know it's hard, but maybe at some point we should just stop trying to do what we are doing now, and make SCEVs truly immutable. This will, in particular, prevent us from updating of flags of outer loop and fix this bug along with many other bugs of this variety.

This complexity just doesn't seem worth sustaining.

Spent some more time thinking about this, and updated my running summary (https://github.com/preames/public-notes/blob/master/llvm-loop-opt-ideas.rst#scev-wrap-flags) with my thinking on how to fix the root problem.

However, I do want to explicitly note that I'm open to incrementalism here. This *particular* patch is addressing a *particular* instance of our flags problem. I am 100% open to fixing this issue in isolation. (See inline comments)

llvm/lib/Analysis/ScalarEvolution.cpp
2778	This comment is subtle, and almost correct, but not quite. We'd have to prove both that the loop loop is always taken, and that the values are defined in the function scope. (a.g. a GEP off a global pointer would require a non-function scope)
2788	As with Eli, I don't see why the loops used is relevant. Take a look at the interesting example from my writeup, and I think you'll see this check is irrelevant. I'm guessing that this was an attempt to prevent the need for widespread test changes. Can you confirm that and give a feel for how bad they are? (Autoupdate is your friend...) The place I expect this to matter the most is in trip count computation. Given that, I'd be really curious to see if rebasing this over Eli's D106331 helps reduce that test diff. That change should get some of the context sensitivity lost here back, and might very well cut down on the impact.

I wrote up what I think the current semantics are supposed to be in https://reviews.llvm.org/D109553. Assuming we agree, I'd like to land that, and then fix this piece of code accordingly.

I still think we probably want different semantics, but getting to my desired semantics from our current ones looks quite involved, and I think we need to start with getting to *any* consistent model.

reames mentioned this in D106331: [ScalarEvolution] Try harder to prove overflow in howManyLessThans..Sep 10 2021, 11:18 AM

reames mentioned this in D109845: [SCEV] Correctly propagate nowrap flags across scopes when folding invariant add through addrec.Sep 15 2021, 12:26 PM

reames mentioned this in rG248e430f37c8: precommit test for D109845/D106852.Sep 15 2021, 12:54 PM

The semantics proposed in D109553 have been approved and landed. I have a review (D109845) which addresses the same issue as this one, but is based on reasoning in line with the new semantics.

Abandon in favor of D109845

reames mentioned this in rGf39978b84f1d: [SCEV] Correctly propagate nowrap flags across scopes when folding invariant….Oct 3 2021, 3:23 PM

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ScalarEvolution.cpp

18 lines

test/

Analysis/

ScalarEvolution/

nuw-add-nested-loops.ll

24 lines

nuw-add-sibling-loops.ll

4 lines

Diff 362304

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,769 Lines • ▼ Show 20 Lines	if (!LIOps.empty()) {
LIOps.push_back(AddRec);		LIOps.push_back(AddRec);
SCEV::NoWrapFlags Flags = ComputeFlags(LIOps);		SCEV::NoWrapFlags Flags = ComputeFlags(LIOps);
LIOps.pop_back();		LIOps.pop_back();

// NLI + LI + {Start,+,Step} --> NLI + {LI+Start,+,Step}		// NLI + LI + {Start,+,Step} --> NLI + {LI+Start,+,Step}
LIOps.push_back(AddRec->getStart());		LIOps.push_back(AddRec->getStart());

SmallVector<const SCEV *, 4> AddRecOps(AddRec->operands());		SmallVector<const SCEV *, 4> AddRecOps(AddRec->operands());
// This follows from the fact that the no-wrap flags on the outer add		// TODO: If we could prove that the 0th iteration of a loop is guaranteed
		reamesUnsubmitted Not Done Reply Inline Actions This comment is subtle, and almost correct, but not quite. We'd have to prove both that the loop loop is always taken, and that the values are defined in the function scope. (a.g. a GEP off a global pointer would require a non-function scope) reames: This comment is subtle, and almost correct, but not quite. We'd have to prove both that the…
// expression are applicable on the 0th iteration, when the add recurrence		// we could use inferred flags. This follows from the fact that
// will be equal to its start value.		// the no-wrap flags on the outer add expression are applicable on
AddRecOps[0] = getAddExpr(LIOps, Flags, Depth + 1);		// the 0th iteration, when the add recurrence will be equal to
		// its start value. If there is no guarantee for 0th iteration,
		// try our best for infer flags.
		SmallPtrSet<const Loop *, 4> UsedLoops;
		for (auto Op : LIOps)
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto Op' can be declared as 'const auto Op' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto Op' can be declared as 'const auto *Op' [llvm-qualified-auto]…
		getUsedLoops(Op, UsedLoops);
		SCEV::NoWrapFlags FlagsForNewStart = Flags;
		if (UsedLoops.size())
		efriedmaUnsubmitted Not Done Reply Inline Actions I'm not sure I see the connection between UsedLoops.size() and the treatment of nowrap flags. efriedma: I'm not sure I see the connection between UsedLoops.size() and the treatment of nowrap flags.
		reamesUnsubmitted Not Done Reply Inline Actions As with Eli, I don't see why the loops used is relevant. Take a look at the interesting example from my writeup, and I think you'll see this check is irrelevant. I'm guessing that this was an attempt to prevent the need for widespread test changes. Can you confirm that and give a feel for how bad they are? (Autoupdate is your friend...) The place I expect this to matter the most is in trip count computation. Given that, I'd be really curious to see if rebasing this over Eli's D106331 helps reduce that test diff. That change should get some of the context sensitivity lost here back, and might very well cut down on the impact. reames: As with Eli, I don't see why the loops used is relevant. Take a look at the interesting…
		FlagsForNewStart = StrengthenNoWrapFlags(
		this, scAddExpr, LIOps, SCEV::NoWrapFlags::FlagAnyWrap);
		AddRecOps[0] = getAddExpr(LIOps, FlagsForNewStart, Depth + 1);

// Build the new addrec. Propagate the NUW and NSW flags if both the		// Build the new addrec. Propagate the NUW and NSW flags if both the
// outer add and the inner addrec are guaranteed to have no overflow.		// outer add and the inner addrec are guaranteed to have no overflow.
// Always propagate NW.		// Always propagate NW.
Flags = AddRec->getNoWrapFlags(setFlags(Flags, SCEV::FlagNW));		Flags = AddRec->getNoWrapFlags(setFlags(Flags, SCEV::FlagNW));
const SCEV *NewRec = getAddRecExpr(AddRecOps, AddRecLoop, Flags);		const SCEV *NewRec = getAddRecExpr(AddRecOps, AddRecLoop, Flags);

// If all of the other operands were loop invariant, we are done.		// If all of the other operands were loop invariant, we are done.
▲ Show 20 Lines • Show All 11,304 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/nuw-add-nested-loops.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --force-update			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --force-update
	; RUN: opt < %s -indvars -S \| FileCheck %s			; RUN: opt < %s -indvars -S \| FileCheck %s
	; RUN: opt -S -disable-output "-passes=print<scalar-evolution>" < %s 2>&1 \| FileCheck %s --check-prefix=CHECK-ANALYSIS			; RUN: opt -S -disable-output "-passes=print<scalar-evolution>" < %s 2>&1 \| FileCheck %s --check-prefix=CHECK-ANALYSIS

	; At the time test is written it shows that SCEV incorrectly computes flags			; At the time test is written it shows that SCEV incorrectly computes flags
	; for expression corresponding to outer_header loop and indvars does an			; for expression corresponding to outer_header loop and indvars does an
	; incorrect transformation.			; incorrect transformation.

	; CHECK-ANALYSIS: %r.ivi.next = add nuw nsw i32 %r.ivi, 1			; CHECK-ANALYSIS: %r.ivi.next = add nuw nsw i32 %r.ivi, 1
	; CHECK-ANALYSIS-NEXT: --> {{{{}}-399,+,1}<nuw><nsw><%outer_header>,+,1}<nuw><nsw><%right_header>			; CHECK-ANALYSIS-NEXT: --> {{{{}}-399,+,1}<nsw><%outer_header>,+,1}<nuw><nsw><%right_header>
	; CHECK-ANALYSIS: %l.ivi.next = add nsw i32 %l.ivi, 1			; CHECK-ANALYSIS: %l.ivi.next = add nsw i32 %l.ivi, 1
	; CHECK-ANALYSIS-NEXT: --> {{{{}}-399,+,1}<nuw><nsw><%outer_header>,+,1}<nsw><%left_header>			; CHECK-ANALYSIS-NEXT: --> {{{{}}-399,+,1}<nsw><%outer_header>,+,1}<nsw><%left_header>
	define void @test(i1 %c) {			define void @test(i1 %c) {
	; CHECK-LABEL: @test(			; CHECK-LABEL: @test(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[OUTER_HEADER:%.*]]			; CHECK-NEXT: br label [[OUTER_HEADER:%.*]]
	; CHECK: outer_header:			; CHECK: outer_header:
	; CHECK-NEXT: [[IVO:%.]] = phi i32 [ -400, [[ENTRY:%.]] ], [ [[IVO_NEXT:%.]], [[OUTER_BACKEDGE:%.]] ]			; CHECK-NEXT: [[INDVARS_IV4:%.]] = phi i32 [ [[INDVARS_IV_NEXT5:%.]], [[OUTER_BACKEDGE:%.]] ], [ 402, [[ENTRY:%.]] ]
				; CHECK-NEXT: [[INDVARS_IV2:%.]] = phi i32 [ [[INDVARS_IV_NEXT3:%.]], [[OUTER_BACKEDGE]] ], [ 399, [[ENTRY]] ]
				; CHECK-NEXT: [[INDVARS_IV:%.]] = phi i32 [ [[INDVARS_IV_NEXT:%.]], [[OUTER_BACKEDGE]] ], [ -399, [[ENTRY]] ]
				; CHECK-NEXT: [[IVO:%.]] = phi i32 [ -400, [[ENTRY]] ], [ [[IVO_NEXT:%.]], [[OUTER_BACKEDGE]] ]
				; CHECK-NEXT: [[UMAX:%.*]] = call i32 @llvm.umax.i32(i32 [[INDVARS_IV]], i32 400)
				; CHECK-NEXT: [[TMP0:%.*]] = add i32 [[UMAX]], [[INDVARS_IV2]]
				; CHECK-NEXT: [[UMIN:%.*]] = call i32 @llvm.umin.i32(i32 [[INDVARS_IV4]], i32 [[TMP0]])
	; CHECK-NEXT: br i1 [[C:%.]], label [[LEFT_HEADER_PREHEADER:%.]], label [[RIGHT_HEADER_PREHEADER:%.*]]			; CHECK-NEXT: br i1 [[C:%.]], label [[LEFT_HEADER_PREHEADER:%.]], label [[RIGHT_HEADER_PREHEADER:%.*]]
	; CHECK: right_header.preheader:			; CHECK: right_header.preheader:
	; CHECK-NEXT: br label [[RIGHT_HEADER:%.*]]			; CHECK-NEXT: br label [[RIGHT_HEADER:%.*]]
	; CHECK: left_header.preheader:			; CHECK: left_header.preheader:
				; CHECK-NEXT: [[TMP1:%.*]] = icmp ne i32 [[TMP0]], [[UMIN]]
				; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[INDVARS_IV4]], [[UMIN]]
	; CHECK-NEXT: br label [[LEFT_HEADER:%.*]]			; CHECK-NEXT: br label [[LEFT_HEADER:%.*]]
	; CHECK: right_header:			; CHECK: right_header:
	; CHECK-NEXT: [[R_IVI:%.]] = phi i32 [ [[R_IVI_NEXT:%.]], [[RIGHT_HEADER]] ], [ [[IVO]], [[RIGHT_HEADER_PREHEADER]] ]			; CHECK-NEXT: [[R_IVI:%.]] = phi i32 [ [[R_IVI_NEXT:%.]], [[RIGHT_HEADER]] ], [ [[IVO]], [[RIGHT_HEADER_PREHEADER]] ]
	; CHECK-NEXT: [[R_IVI_NEXT]] = add nuw nsw i32 [[R_IVI]], 1			; CHECK-NEXT: [[R_IVI_NEXT]] = add nuw nsw i32 [[R_IVI]], 1
	; CHECK-NEXT: [[R_IVI_NEXT_SDIV:%.*]] = sdiv i32 -1, [[R_IVI_NEXT]]			; CHECK-NEXT: [[R_IVI_NEXT_SDIV:%.*]] = sdiv i32 -1, [[R_IVI_NEXT]]
	; CHECK-NEXT: [[R_IVI_NEXT_ZEXT:%.*]] = zext i32 [[R_IVI_NEXT_SDIV]] to i64			; CHECK-NEXT: [[R_IVI_NEXT_ZEXT:%.*]] = zext i32 [[R_IVI_NEXT_SDIV]] to i64
	; CHECK-NEXT: call void @bar(i64 [[R_IVI_NEXT_ZEXT]])			; CHECK-NEXT: call void @bar(i64 [[R_IVI_NEXT_ZEXT]])
	; CHECK-NEXT: br i1 false, label [[OUTER_BACKEDGE_LOOPEXIT1:%.*]], label [[RIGHT_HEADER]]			; CHECK-NEXT: [[R_C:%.*]] = icmp sgt i32 [[R_IVI_NEXT]], 1
				; CHECK-NEXT: br i1 [[R_C]], label [[OUTER_BACKEDGE_LOOPEXIT1:%.*]], label [[RIGHT_HEADER]]
	; CHECK: left_header:			; CHECK: left_header:
	; CHECK-NEXT: br i1 false, label [[LEFT_BACKEDGE:%.]], label [[OUTER_BACKEDGE_LOOPEXIT:%.]]			; CHECK-NEXT: br i1 [[TMP1]], label [[LEFT_BACKEDGE:%.]], label [[OUTER_BACKEDGE_LOOPEXIT:%.]]
	; CHECK: left_backedge:			; CHECK: left_backedge:
	; CHECK-NEXT: br i1 false, label [[LEFT_EXIT:%.*]], label [[LEFT_HEADER]]			; CHECK-NEXT: br i1 [[TMP2]], label [[LEFT_EXIT:%.*]], label [[LEFT_HEADER]]
	; CHECK: left_exit:			; CHECK: left_exit:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[OUTER_BACKEDGE]]			; CHECK-NEXT: br label [[OUTER_BACKEDGE]]
	; CHECK: outer_backedge.loopexit:			; CHECK: outer_backedge.loopexit:
	; CHECK-NEXT: br label [[OUTER_BACKEDGE]]			; CHECK-NEXT: br label [[OUTER_BACKEDGE]]
	; CHECK: outer_backedge.loopexit1:			; CHECK: outer_backedge.loopexit1:
	; CHECK-NEXT: br label [[OUTER_BACKEDGE]]			; CHECK-NEXT: br label [[OUTER_BACKEDGE]]
	; CHECK: outer_backedge:			; CHECK: outer_backedge:
	; CHECK-NEXT: [[IVO_NEXT]] = add nuw nsw i32 [[IVO]], 1			; CHECK-NEXT: [[IVO_NEXT]] = add nuw nsw i32 [[IVO]], 1
	; CHECK-NEXT: [[C_2:%.*]] = icmp sgt i32 [[IVO]], -2			; CHECK-NEXT: [[C_2:%.*]] = icmp sgt i32 [[IVO]], -2
				; CHECK-NEXT: [[INDVARS_IV_NEXT]] = add nsw i32 [[INDVARS_IV]], 1
				; CHECK-NEXT: [[INDVARS_IV_NEXT3]] = add nsw i32 [[INDVARS_IV2]], -1
				; CHECK-NEXT: [[INDVARS_IV_NEXT5]] = add nsw i32 [[INDVARS_IV4]], -1
	; CHECK-NEXT: br i1 [[C_2]], label [[EXIT:%.*]], label [[OUTER_HEADER]]			; CHECK-NEXT: br i1 [[C_2]], label [[EXIT:%.*]], label [[OUTER_HEADER]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	br label %outer_header			br label %outer_header

	outer_header:			outer_header:
	Show All 38 Lines

llvm/test/Analysis/ScalarEvolution/nuw-add-sibling-loops.ll

	; RUN: opt -S -disable-output "-passes=print<scalar-evolution>" < %s 2>&1 \| FileCheck %s --check-prefix=CHECK-ANALYSIS			; RUN: opt -S -disable-output "-passes=print<scalar-evolution>" < %s 2>&1 \| FileCheck %s --check-prefix=CHECK-ANALYSIS

	; At the time test is written it shows that SCEV incorrectly computes flags			; At the time test is written it shows that SCEV incorrectly computes flags
	; for expression corresponding to outer_header loop.			; for expression corresponding to outer_header loop.

	; CHECK-ANALYSIS: %r.ivi.next = add nuw nsw i32 %r.ivi, 1			; CHECK-ANALYSIS: %r.ivi.next = add nuw nsw i32 %r.ivi, 1
	; CHECK-ANALYSIS-NEXT: {{{{}}-399,+,1}<nuw><nsw><%outer_header>,+,1}<nuw><nsw><%right_header>			; CHECK-ANALYSIS-NEXT: {{{{}}-399,+,1}<nsw><%outer_header>,+,1}<nuw><nsw><%right_header>
	; CHECK-ANALYSIS: %l.ivi.next = add nsw i32 %l.ivi, 1			; CHECK-ANALYSIS: %l.ivi.next = add nsw i32 %l.ivi, 1
	; CHECK-ANALYSIS-NEXT: {{{{}}-399,+,1}<nuw><nsw><%outer_header>,+,1}<nw><%left_header>			; CHECK-ANALYSIS-NEXT: {{{{}}-399,+,1}<nsw><%outer_header>,+,1}<nw><%left_header>
	define void @test(i1 %c) {			define void @test(i1 %c) {
	entry:			entry:
	br label %outer_header			br label %outer_header

	outer_header:			outer_header:
	%ivo = phi i32 [-400, %entry], [%ivo.next, %outer_header]			%ivo = phi i32 [-400, %entry], [%ivo.next, %outer_header]
	%ivo.next = add nuw nsw i32 %ivo, 1			%ivo.next = add nuw nsw i32 %ivo, 1
	%c.2 = icmp sgt i32 %ivo, -2			%c.2 = icmp sgt i32 %ivo, -2
	Show All 35 Lines