This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Transforms/Scalar/
-
Transforms/
-
Scalar/
22
InductiveRangeCheckElimination.cpp
-
test/Transforms/IRCE/
-
Transforms/
-
IRCE/
2
variable-loop-bounds.ll

Differential D45439

[IRCE] Use NUW flag for indvar
AbandonedPublic

Authored by samparker on Apr 9 2018, 6:47 AM.

Download Raw Diff

Details

Reviewers

mkazantsev
sanjoy

Summary

IsInductionVar currently looks for nsw properties for the induction variable, and this patch now adds the use of nuw. isKnownNonNegative has also been added as a helper function which uses isLoopEntryGuardedByCond to check that the value is greater or equal than zero. If this is valid, I will extract it out into SCEV's isKnownNonNegative.

Diff Detail

Event Timeline

samparker created this revision.Apr 9 2018, 6:47 AM

samparker added a reviewer: sanjoy.Apr 9 2018, 7:38 AM

mkazantsev requested changes to this revision.Apr 9 2018, 10:03 PM

mkazantsev added inline comments.

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
705	I don't think that these two checks make any sense, because `isLoopEntryGuardedByCond` makes trivial checks anyways. You are just duplicating efforts.
712	Are different values of `IsSigned` flag really needed here? :) You could've created zero constant just once.
936	This does not feel right. What if the indvar goes from `SINT_MAX - 10` to `SINT_MIN + 10` and, thus, has signed wrap? In this case it might have `nuw`, but if we deal with signed predicates, we might miscompile. I think the correct approach would be to identify whether the latch predicate is signed or unsigned, and if it is unsigned, then we may check `nuw` instead of `nsw`.
972	If these two changes (i.e. checking of `nuw` flag and advanced non-negative) check can be separated into 2 patches, plase do.

This revision now requires changes to proceed.Apr 9 2018, 10:03 PM

mkazantsev added inline comments.Apr 9 2018, 10:08 PM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
936	BTW, why it is `&&` and not `\|\|`? I think you were extending the scope of optimization, not narrowing it.
936	Ah, forget it, it's `!`... Scratch my 2nd comment on that.

samparker added inline comments.Apr 10 2018, 1:05 AM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
705	Removing isKnownNonNegative has no effects on the tests, but removing isKnownNegative is causing eq_ne and non_known_positive_end tests to fail. I will investigate.
712	:)
936	Good point!

mkazantsev added inline comments.Apr 10 2018, 1:32 AM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
705	It is strange, and maybe it is because of potential problem @ line 936. That is why I'd prefer these changes go separately. The part about `isKnownNonNegative` looks pretty safe, the part about `nuw` may be more fishy.

samparker mentioned this in D45481: [IRCE] isKnownNonNegative helper.Apr 10 2018, 3:19 AM

Split the helper function out to D45481 and rebased.

mkazantsev added inline comments.Apr 10 2018, 3:50 AM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
928	Why not `(Signed && HasNoSignedWrap(AR)) \|\| (!IsSigned && AR->hasNoUnsignedWrap())`?
929	Actually I look into it and the old implementation seems suspicious to me. We only check `nsw`, but when we don't have `nuw` and actually do have an unsigned wrap, why don't we have a bug on unsigned latch predicates? Or we do, but not aware about?.. :)
929	If you look at `HasNoSignedWrap`, it has a bit more complex check than just looking into the immediate flag, it also tries to go through `sext` to find this flag. Can we do the same for `nuw` and `zext` to keep things consistent?
964	It seems to be an independent change, please commit it separately as NFC.
test/Transforms/IRCE/variable-loop-bounds.ll
328	Same, this can go as NFC without approval.
361	Please add `CHECK` or `CHECK-NOT` on existence/non-existence of preloop and postloop, so that we can make sure that IRCE does good job generating/not generating them. Also the same applies to the tests above, you can add these checks as NFC separately (not super-urgent, but useful).

samparker added inline comments.Apr 10 2018, 6:55 AM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
929	Is this case handled on line 915?
929	Ok, I'll make some changes. I also need to handle equality differently, because I believe its classed as unsigned which messes up the NoSignedWrap check.

samparker added inline comments.Apr 10 2018, 9:01 AM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
929	I'm trying to understand test_02 of decrementing_loop. The test looks like it should work, but I can't figure out how the transform is making the decision. If I try to check nuw on unsigned and nsw on signed values, this test stops being transformed because it figures the indvar is NSW but ULT is being used.

Hi Sam,

In test_02 in decrementing loop %idx starts from %len.a and then make a range check on %idx < %len.a which always fails on 1st iteration. In practice it means that we should have a preloop (in this particular case it means that we deoptimize on 1st iteration). So IRCE here basically makes no sense, but if it DOES happen (i.e. the compiler is not smart enough to understand its non-profitability), it MUST make a preloop.

It does not have nuw flag because of %len.a is zero then %idx.next = sub i32 %idx, 1 is calculated with passing from 0 to -1 which is an unsigned overflow, so this instruction does not have a nuw, so SCEV might also not set nuw for the addrec.

I think (I didn't check it, though) that if you insert a check %len.a != 0 above the loop and only go to the loop with this condition, it will have nuw. I hope it helps!

mkazantsev added inline comments.Apr 11 2018, 10:50 PM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
929	Is this case handled on line 915? No, I mean something like for (i = -10; i <u 10; i++) This does have `nsw` but does not have a `nuw` because traversal from `-1` to `0` is an unsigned wrap. However in this case we will fail on 1st iteration, so it sholdn't be a bug in practice, but it's still fishy.
929	See my comment, I hope it will help. I think that either inserting the check `len.a != 0` before the loop or not expecting that IRCE will handle this case will be OK. I would prefer the first option of it works.

Hi Max,

Thanks, the reason why I don't understand why nuw isn't used is because the range data !{i32 0, i32 2147483647} is used for %len.a and the loop is only entered if %len.a != 0. The loop then exits when %idx < 2, so surely we should be able to discern that nuw is applicable? isKnownNonNegativeInLoop is also not helping in this case, but it would seem there's enough information here. Could it be that SCEV not be using the range metadata in conjunction with the entry and exit conditions?

thanks,
sam

Max,

After starting and playing around for the day, I've got to the root question of: why do we do the wrap checks anyway? Aren't the following range checks in the IsIncreasing/Decreasing blocks enough?

In D45439#1065206, @samparker wrote:

Hi Max,

Thanks, the reason why I don't understand why nuw isn't used is because the range data !{i32 0, i32 2147483647} is used for %len.a and the loop is only entered if %len.a != 0. The loop then exits when %idx < 2, so surely we should be able to discern that nuw is applicable? isKnownNonNegativeInLoop is also not helping in this case, but it would seem there's enough information here. Could it be that SCEV not be using the range metadata in conjunction with the entry and exit conditions?

thanks,
sam

Wow, I've overlooked that it already has 1st iteration check. :) Well, we cannot always expect that SCEV is smart enough to do something, maybe in this particular case it simply failed to prove the non-negativity of the indvar. It's not a bug, SCEV is just now as smart as humans are. I will take a look into this case when I have some time, maybe we can improve the situation by teaching SCEV to derive nuw here. Feel free to take a look into it as well.

In D45439#1065630, @samparker wrote:

Max,

After starting and playing around for the day, I've got to the root question of: why do we do the wrap checks anyway? Aren't the following range checks in the IsIncreasing/Decreasing blocks enough?

Just omitting the nsw check is clearly wrong. Look at the logic for ne predicates.

if (Pred == ICmpInst::ICMP_NE && LatchBrExitIdx == 1)
  // while (++i != len) {         while (++i < len) {
  //   ...                 --->     ...
  // }                            }
  // If both parts are known non-negative, it is profitable to use
  // unsigned comparison in increasing loop. This allows us to make the
  // comparison check against "RightSCEV + 1" more optimistic.
  if (SE.isKnownNonNegative(IndVarStart) &&
      SE.isKnownNonNegative(RightSCEV))
    Pred = ICmpInst::ICMP_ULT;
  else
    Pred = ICmpInst::ICMP_SLT;

This will not be an equivalent transform if i is greater than len on 1st iteration but then overflows and can still reach len after that. So with ne we will make a lot of iterations and then exit the loop, and with lt we will fail on the 1st iteration.

However in other cases, when predicate was slt/ult from the beginning, it's an interesting question whether we need this condition or not. I will give it some thought and run fuzzer testing to see what happens, it should either expose some problems which are not obvious or give a clue that you might be right and we don't need NoWrap flag check at all.

Interesting... I've made a dummy patch which lifts the restricton on nsw in cases when we are not dealing with eq/ne, and it passed 5000 complex fuzzed tests (which usually fail if there is some obvious bug). I also don't see a place where we would really need the no wrap other than eq/ne.

@samparker do you have a chance to prepare a patch that lifts this restriction? I can do it as well, but I'm a bit loaded atm. If we can get rid of no-wrap requirements at all, it will be a way better than using nuw. :)

Sure Max, thanks for doing the testing.

samparker mentioned this in D45617: [IRCE] Only check for NSW on equality predicates.Apr 13 2018, 4:33 AM

samparker added a parent revision: D45617: [IRCE] Only check for NSW on equality predicates.Apr 13 2018, 5:57 AM

Now that we're only looking for nsw with equality predicates, we should now be able to use nuw as well as nsw. HasNoSignedWrap has been renamed and queries hasNoSelfWrap, while still attempting to also search for nsw.

mkazantsev requested changes to this revision.Apr 15 2018, 6:09 PM

mkazantsev added inline comments.

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
894	This is incorrect. If counter value goes from `-1` to `(SINT_MAX + 1)` (both included), it has both signed and unsigned wrap and has no self-wrap.

This revision now requires changes to proceed.Apr 15 2018, 6:09 PM

mkazantsev added inline comments.Apr 15 2018, 6:13 PM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
894	Just to clarify the example: we start with `i = -1` and go up with step of `1` and exit the loop when `i = SINT_MAX + 1`. In this case when we pass from `-1` to `0` we are making an unsigned wrap and when we pass from `SINT_MAX` to `SINT_MAX + 1` we are making a signed wrap. There is no self-wrap because this variable will never reach `-1` again.

mkazantsev added inline comments.Apr 15 2018, 6:17 PM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
907	Do you have a plan to do the same for `zext`? I'm OK if it's in a follow-up patch.

samparker added inline comments.Apr 16 2018, 2:52 AM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
894	From the SCEV header comments, I understand that NoSelfWrap can be used to query whether either nsw or nuw has been set.
907	Yes, happy to do that in a follow-up.

Diffusion mentioned this in rL330256: [IRCE] Only check for NSW on equality predicates.Apr 18 2018, 6:53 AM

mkazantsev added inline comments.Jun 6 2018, 8:40 PM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
894	It is not quite right. If either of `nsw` or `nuw` is set then `nw` is also present automatically. But `nw` can also be present of none of those is set, like in my example of iterations from `-1` to `-2^31 + 1`. It crosses both signed and unsigned range borders but does not cross its initial value `-1`.

Herald added a subscriber: javed.absar. · View Herald TranscriptJun 6 2018, 8:40 PM

mkazantsev added inline comments.Jun 6 2018, 8:41 PM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
894	I mean, it iterates from -1 to min_int + 1 with step 1, through overflow.

samparker added inline comments.Jun 8 2018, 2:25 AM

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp
894	Ah, ok, thanks.

samparker abandoned this revision.May 4 2020, 1:05 AM

Revision Contents

Path

Size

lib/

Transforms/

Scalar/

InductiveRangeCheckElimination.cpp

26 lines

test/

Transforms/

IRCE/

variable-loop-bounds.ll

254 lines

Diff 142392

lib/Transforms/Scalar/InductiveRangeCheckElimination.cpp

Context not available.
	}	}
	}	}

	auto HasNoSignedWrap = [&](const SCEVAddRecExpr *AR) {	auto HasNoWrap = [&](const SCEVAddRecExpr *AR) {
	if (AR->getNoWrapFlags(SCEV::FlagNSW))	if (AR->hasNoSelfWrap())
		mkazantsevUnsubmitted Not Done Reply Inline Actions This is incorrect. If counter value goes from `-1` to `(SINT_MAX + 1)` (both included), it has both signed and unsigned wrap and has no self-wrap. mkazantsev: This is incorrect. If counter value goes from `-1` to `(SINT_MAX + 1)` (both included), it has…
		mkazantsevUnsubmitted Not Done Reply Inline Actions Just to clarify the example: we start with `i = -1` and go up with step of `1` and exit the loop when `i = SINT_MAX + 1`. In this case when we pass from `-1` to `0` we are making an unsigned wrap and when we pass from `SINT_MAX` to `SINT_MAX + 1` we are making a signed wrap. There is no self-wrap because this variable will never reach `-1` again. mkazantsev: Just to clarify the example: we start with `i = -1` and go up with step of `1` and exit the…
		samparkerAuthorUnsubmitted Not Done Reply Inline Actions From the SCEV header comments, I understand that NoSelfWrap can be used to query whether either nsw or nuw has been set. samparker: From the SCEV header comments, I understand that NoSelfWrap can be used to query whether either…
		mkazantsevUnsubmitted Not Done Reply Inline Actions It is not quite right. If either of `nsw` or `nuw` is set then `nw` is also present automatically. But `nw` can also be present of none of those is set, like in my example of iterations from `-1` to `-2^31 + 1`. It crosses both signed and unsigned range borders but does not cross its initial value `-1`. mkazantsev: It is not quite right. If either of `nsw` or `nuw` is set then `nw` is also present…
		mkazantsevUnsubmitted Not Done Reply Inline Actions I mean, it iterates from -1 to min_int + 1 with step 1, through overflow. mkazantsev: I mean, it iterates from -1 to min_int + 1 with step 1, through overflow.
		samparkerAuthorUnsubmitted Not Done Reply Inline Actions Ah, ok, thanks. samparker: Ah, ok, thanks.
	return true;	return true;

		// Try to find nsw.
	IntegerType *Ty = cast<IntegerType>(AR->getType());	IntegerType *Ty = cast<IntegerType>(AR->getType());
	IntegerType *WideTy =	IntegerType *WideTy =
	IntegerType::get(Ty->getContext(), Ty->getBitWidth() * 2);	IntegerType::get(Ty->getContext(), Ty->getBitWidth() * 2);

	const SCEVAddRecExpr *ExtendAfterOp =	const SCEVAddRecExpr *ExtendAfterOp =
	dyn_cast<SCEVAddRecExpr>(SE.getSignExtendExpr(AR, WideTy));	dyn_cast<SCEVAddRecExpr>(SE.getSignExtendExpr(AR, WideTy));
	if (ExtendAfterOp) {	if (!ExtendAfterOp)
	const SCEV *ExtendedStart = SE.getSignExtendExpr(AR->getStart(), WideTy);	return false;
	const SCEV *ExtendedStep =
	SE.getSignExtendExpr(AR->getStepRecurrence(SE), WideTy);

	bool NoSignedWrap = ExtendAfterOp->getStart() == ExtendedStart &&
	ExtendAfterOp->getStepRecurrence(SE) == ExtendedStep;

	if (NoSignedWrap)	const SCEV *ExtendedStart = SE.getSignExtendExpr(AR->getStart(), WideTy);
		mkazantsevUnsubmitted Not Done Reply Inline Actions Do you have a plan to do the same for `zext`? I'm OK if it's in a follow-up patch. mkazantsev: Do you have a plan to do the same for `zext`? I'm OK if it's in a follow-up patch.
		samparkerAuthorUnsubmitted Not Done Reply Inline Actions Yes, happy to do that in a follow-up. samparker: Yes, happy to do that in a follow-up.
	return true;	const SCEV *ExtendedStep =
	}	SE.getSignExtendExpr(AR->getStepRecurrence(SE), WideTy);

	// We may have proved this when computing the sign extension above.	return ExtendAfterOp->getStart() == ExtendedStart &&
	return AR->getNoWrapFlags(SCEV::FlagNSW) != SCEV::FlagAnyWrap;	ExtendAfterOp->getStepRecurrence(SE) == ExtendedStep;
	};	};

	// `ICI` is interpreted as taking the backedge if the next value of the	// `ICI` is interpreted as taking the backedge if the next value of the
Context not available.
	FailureReason = "LHS in icmp not induction variable";	FailureReason = "LHS in icmp not induction variable";
	return None;	return None;
	}	}
	if (ICI->isEquality() && !HasNoSignedWrap(IndVarBase)) {	if (ICI->isEquality() && !HasNoWrap(IndVarBase)) {
	FailureReason = "LHS in icmp needs nsw for equality predicates";	FailureReason = "LHS in icmp needs nsw for equality predicates";
	mkazantsevUnsubmitted Not Done Reply Inline Actions Actually I look into it and the old implementation seems suspicious to me. We only check `nsw`, but when we don't have `nuw` and actually do have an unsigned wrap, why don't we have a bug on unsigned latch predicates? Or we do, but not aware about?.. :) mkazantsev: Actually I look into it and the old implementation seems suspicious to me. We only check `nsw`…
	samparkerAuthorUnsubmitted Not Done Reply Inline Actions Is this case handled on line 915? samparker: Is this case handled on line 915?
	mkazantsevUnsubmitted Not Done Reply Inline Actions Is this case handled on line 915? No, I mean something like for (i = -10; i <u 10; i++) This does have `nsw` but does not have a `nuw` because traversal from `-1` to `0` is an unsigned wrap. However in this case we will fail on 1st iteration, so it sholdn't be a bug in practice, but it's still fishy. mkazantsev: > Is this case handled on line 915? No, I mean something like for (i = -10; i <u 10; i++)…
	samparkerAuthorUnsubmitted Not Done Reply Inline Actions I'm trying to understand test_02 of decrementing_loop. The test looks like it should work, but I can't figure out how the transform is making the decision. If I try to check nuw on unsigned and nsw on signed values, this test stops being transformed because it figures the indvar is NSW but ULT is being used. samparker: I'm trying to understand test_02 of decrementing_loop. The test looks like it should work, but…
	mkazantsevUnsubmitted Not Done Reply Inline Actions See my comment, I hope it will help. I think that either inserting the check `len.a != 0` before the loop or not expecting that IRCE will handle this case will be OK. I would prefer the first option of it works. mkazantsev: See my comment, I hope it will help. I think that either inserting the check `len.a != 0`…
	return None;	return None;
	}	}
Context not available.
		mkazantsevUnsubmitted Not Done Reply Inline Actions This does not feel right. What if the indvar goes from `SINT_MAX - 10` to `SINT_MIN + 10` and, thus, has signed wrap? In this case it might have `nuw`, but if we deal with signed predicates, we might miscompile. I think the correct approach would be to identify whether the latch predicate is signed or unsigned, and if it is unsigned, then we may check `nuw` instead of `nsw`. mkazantsev: This does not feel right. What if the indvar goes from `SINT_MAX - 10` to `SINT_MIN + 10` and…
		mkazantsevUnsubmitted Not Done Reply Inline Actions BTW, why it is `&&` and not `\|\|`? I think you were extending the scope of optimization, not narrowing it. mkazantsev: BTW, why it is `&&` and not `\|\|`? I think you were extending the scope of optimization, not…
		mkazantsevUnsubmitted Not Done Reply Inline Actions Ah, forget it, it's `!`... Scratch my 2nd comment on that. mkazantsev: Ah, forget it, it's `!`... Scratch my 2nd comment on that.
		samparkerAuthorUnsubmitted Not Done Reply Inline Actions Good point! samparker: Good point!
		mkazantsevUnsubmitted Not Done Reply Inline Actions If these two changes (i.e. checking of `nuw` flag and advanced non-negative) check can be separated into 2 patches, plase do. mkazantsev: If these two changes (i.e. checking of `nuw` flag and advanced non-negative) check can be…
		mkazantsevUnsubmitted Not Done Reply Inline Actions Why not `(Signed && HasNoSignedWrap(AR)) \|\| (!IsSigned && AR->hasNoUnsignedWrap())`? mkazantsev: Why not `(Signed && HasNoSignedWrap(AR)) \|\| (!IsSigned && AR->hasNoUnsignedWrap())`?
		mkazantsevUnsubmitted Not Done Reply Inline Actions If you look at `HasNoSignedWrap`, it has a bit more complex check than just looking into the immediate flag, it also tries to go through `sext` to find this flag. Can we do the same for `nuw` and `zext` to keep things consistent? mkazantsev: If you look at `HasNoSignedWrap`, it has a bit more complex check than just looking into the…
		samparkerAuthorUnsubmitted Not Done Reply Inline Actions Ok, I'll make some changes. I also need to handle equality differently, because I believe its classed as unsigned which messes up the NoSignedWrap check. samparker: Ok, I'll make some changes. I also need to handle equality differently, because I believe its…
		mkazantsevUnsubmitted Not Done Reply Inline Actions It seems to be an independent change, please commit it separately as NFC. mkazantsev: It seems to be an independent change, please commit it separately as NFC.

test/Transforms/IRCE/variable-loop-bounds.ll

Context not available.
	; CHECK: irce: in function signed_var_imm_dec_sge: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%for.inc<latch><exiting>	; CHECK: irce: in function signed_var_imm_dec_sge: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%for.inc<latch><exiting>
	; CHECK: irce: in function signed_var_imm_dec_ne: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%for.inc<latch><exiting>	; CHECK: irce: in function signed_var_imm_dec_ne: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%for.inc<latch><exiting>
	; CHECK-NOT: irce: in function signed_var_imm_dec_eq: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%for.inc<latch><exiting>	; CHECK-NOT: irce: in function signed_var_imm_dec_eq: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%for.inc<latch><exiting>
		; CHECK: irce: in function test_inc_eq_var_nuw: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%if.then,%for.inc<latch><exiting>
		; CHECK-NOT: irce: in function test_inc_ne_var_nuw: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%if.then,%for.inc<latch><exiting>
		; CHECK: irce: in function test_inc_ult_var_nuw: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%if.then,%for.inc<latch><exiting>
		; CHECK: irce: in function test_dec_ult_ugt_var_imm_nuw: constrained Loop at depth 1 containing: %for.body<header>,%if.else,%if.then,%for.inc<latch><exiting>
		; CHECK-NOT: irce: in function inc_subrange_var_var_ult_nuw: constrained Loop
		; CHECK-NOT: irce: in function inc_subrange_var_var_ult: constrained Loop

	; CHECK-LABEL: test_inc_eq(	; CHECK-LABEL: test_inc_eq(
	; CHECK: main.exit.selector:	; CHECK: main.exit.selector:
		mkazantsevUnsubmitted Not Done Reply Inline Actions Same, this can go as NFC without approval. mkazantsev: Same, this can go as NFC without approval.
Context not available.
	%cmp = icmp eq i32 %dec, %M	%cmp = icmp eq i32 %dec, %M
	br i1 %cmp, label %for.cond.cleanup, label %for.body	br i1 %cmp, label %for.cond.cleanup, label %for.body
	}	}

		mkazantsevUnsubmitted Not Done Reply Inline Actions Please add `CHECK` or `CHECK-NOT` on existence/non-existence of preloop and postloop, so that we can make sure that IRCE does good job generating/not generating them. Also the same applies to the tests above, you can add these checks as NFC separately (not super-urgent, but useful). mkazantsev: Please add `CHECK` or `CHECK-NOT` on existence/non-existence of preloop and postloop, so that…
		; CHECK-LABEL: test_inc_eq_var_nuw(
		; CHECK: for.body:
		; CHECK: br i1 true, label %if.then, label %if.else

		; CHECK: main.exit.selector:
		; CHECK: [[PSEUDO_PHI:%[^ ]+]] = phi i32 [ %inc, %for.inc ]
		; CHECK: [[COND:%[^ ]+]] = icmp ult i32 [[PSEUDO_PHI]], %N
		; CHECK: br i1 [[COND]], label %main.pseudo.exit, label %for.cond.cleanup.loopexit

		; CHECK: for.body.postloop:
		; CHECK: [[POST_IV:%[^ ]+]] = phi i32 [ %inc.postloop, %for.inc.postloop ]
		; CHECK: [[POST_COND:%[^ ]+]] = icmp ult i32 [[POST_IV]], %M
		; CHECK: br i1 [[POST_COND]], label %if.then.postloop, label %if.else.postloop
		define void @test_inc_eq_var_nuw(i32* %a, i32* %b, i32* %c, i32 %N, i32 %M) {
		entry:
		%cmp16 = icmp ugt i32 %N, 0
		br i1 %cmp16, label %for.body, label %for.cond.cleanup

		for.cond.cleanup:
		ret void

		for.body:
		%i.017 = phi i32 [ %inc, %for.inc ], [ 0, %entry ]
		%cmp1 = icmp ult i32 %i.017, %M
		%arrayidx = getelementptr inbounds i32, i32* %b, i32 %i.017
		%0 = load i32, i32* %arrayidx, align 4
		%arrayidx2 = getelementptr inbounds i32, i32* %c, i32 %i.017
		%1 = load i32, i32* %arrayidx2, align 4
		br i1 %cmp1, label %if.then, label %if.else

		if.then:
		%sub = sub i32 %0, %1
		%arrayidx3 = getelementptr inbounds i32, i32* %a, i32 %i.017
		%2 = load i32, i32* %arrayidx3, align 4
		%add = add nsw i32 %sub, %2
		store i32 %add, i32* %arrayidx3, align 4
		br label %for.inc

		if.else:
		%add6 = add nsw i32 %1, %0
		%arrayidx7 = getelementptr inbounds i32, i32* %a, i32 %i.017
		store i32 %add6, i32* %arrayidx7, align 4
		br label %for.inc

		for.inc:
		%inc = add nuw i32 %i.017, 1
		%exitcond = icmp eq i32 %inc, %N
		br i1 %exitcond, label %for.cond.cleanup, label %for.body
		}

		; CHECK-LABEL: test_inc_ne_var_nuw
		; TODO Don't know why eq works but ne doesn't.
		define void @test_inc_ne_var_nuw(i32* %a, i32* %b, i32* %c, i32 %N, i32 %M) {
		entry:
		%cmp16 = icmp ugt i32 %N, 0
		br i1 %cmp16, label %for.body, label %for.cond.cleanup

		for.cond.cleanup:
		ret void

		for.body:
		%i.017 = phi i32 [ %inc, %for.inc ], [ 0, %entry ]
		%cmp1 = icmp ult i32 %i.017, %M
		%arrayidx = getelementptr inbounds i32, i32* %b, i32 %i.017
		%0 = load i32, i32* %arrayidx, align 4
		%arrayidx2 = getelementptr inbounds i32, i32* %c, i32 %i.017
		%1 = load i32, i32* %arrayidx2, align 4
		br i1 %cmp1, label %if.then, label %if.else

		if.then:
		%sub = sub i32 %0, %1
		%arrayidx3 = getelementptr inbounds i32, i32* %a, i32 %i.017
		%2 = load i32, i32* %arrayidx3, align 4
		%add = add nsw i32 %sub, %2
		store i32 %add, i32* %arrayidx3, align 4
		br label %for.inc

		if.else:
		%add6 = add nsw i32 %1, %0
		%arrayidx7 = getelementptr inbounds i32, i32* %a, i32 %i.017
		store i32 %add6, i32* %arrayidx7, align 4
		br label %for.inc

		for.inc:
		%inc = add nuw i32 %i.017, 1
		%exitcond = icmp ne i32 %inc, %N
		br i1 %exitcond, label %for.body, label %for.cond.cleanup
		}

		; CHECK-LABEL: test_inc_ult_var_nuw
		; CHECK: for.body:
		; CHECK: br i1 true, label %if.then, label %if.else

		; CHECK: main.exit.selector:
		; CHECK: [[PSEUDO_PHI:%[^ ]+]] = phi i32 [ %inc, %for.inc ]
		; CHECK: [[COND:%[^ ]+]] = icmp ult i32 [[PSEUDO_PHI]], %N
		; CHECK: br i1 [[COND]], label %main.pseudo.exit, label %for.cond.cleanup.loopexit

		; CHECK: for.body.postloop:
		; CHECK: [[POST_IV:%[^ ]+]] = phi i32 [ %inc.postloop, %for.inc.postloop ]
		; CHECK: [[POST_COND:%[^ ]+]] = icmp ult i32 [[POST_IV]], %M
		; CHECK: br i1 [[POST_COND]], label %if.then.postloop, label %if.else.postloop
		define void @test_inc_ult_var_nuw(i32* %a, i32* %b, i32* %c, i32 %N, i32 %M) {
		entry:
		%cmp16 = icmp ugt i32 %N, 0
		br i1 %cmp16, label %for.body, label %for.cond.cleanup

		for.cond.cleanup:
		ret void

		for.body:
		%i.017 = phi i32 [ %inc, %for.inc ], [ 0, %entry ]
		%cmp1 = icmp ult i32 %i.017, %M
		%arrayidx = getelementptr inbounds i32, i32* %b, i32 %i.017
		%0 = load i32, i32* %arrayidx, align 4
		%arrayidx2 = getelementptr inbounds i32, i32* %c, i32 %i.017
		%1 = load i32, i32* %arrayidx2, align 4
		br i1 %cmp1, label %if.then, label %if.else

		if.then:
		%sub = sub i32 %0, %1
		%arrayidx3 = getelementptr inbounds i32, i32* %a, i32 %i.017
		%2 = load i32, i32* %arrayidx3, align 4
		%add = add nsw i32 %sub, %2
		store i32 %add, i32* %arrayidx3, align 4
		br label %for.inc

		if.else:
		%add6 = add nsw i32 %1, %0
		%arrayidx7 = getelementptr inbounds i32, i32* %a, i32 %i.017
		store i32 %add6, i32* %arrayidx7, align 4
		br label %for.inc

		for.inc:
		%inc = add nuw i32 %i.017, 1
		%exitcond = icmp ult i32 %inc, %N
		br i1 %exitcond, label %for.body, label %for.cond.cleanup
		}

		; CHECK-LABEL: test_dec_ult_ugt_var_imm_nuw(
		; CHECK: for.body:
		; CHECK: [[IV:%[^ ]+]] = phi i32 [ %inc, %for.inc ], [ %i.017.preloop.copy, %mainloop ]
		; CHECK: [[COND:%[^ ]+]] = icmp ult i32 [[IV]], %M
		; CHECK: br i1 true, label %if.then, label %if.else

		; CHECK: for.body.preloop:
		; CHECK: [[PRE_IV:%[^ ]+]] = phi i32 [ %inc.preloop, %for.inc.preloop ], [ 1024, %for.body.preloop.preheader ]
		; CHECK: [[PRE_COND:%[^ ]+]] = icmp ult i32 [[PRE_IV]], %M
		; CHECK: br i1 [[PRE_COND]], label %if.then.preloop, label %if.else.preloop
		define void @test_dec_ult_ugt_var_imm_nuw(i32* %a, i32* %b, i32* %c, i32 %N, i32 %M) {
		entry:
		br label %for.body

		for.cond.cleanup:
		ret void

		for.body:
		%i.017 = phi i32 [ %inc, %for.inc ], [ 1024, %entry ]
		%cmp1 = icmp ult i32 %i.017, %M
		%arrayidx = getelementptr inbounds i32, i32* %b, i32 %i.017
		%0 = load i32, i32* %arrayidx, align 4
		%arrayidx2 = getelementptr inbounds i32, i32* %c, i32 %i.017
		%1 = load i32, i32* %arrayidx2, align 4
		br i1 %cmp1, label %if.then, label %if.else

		if.then:
		%sub = sub i32 %0, %1
		%arrayidx3 = getelementptr inbounds i32, i32* %a, i32 %i.017
		%2 = load i32, i32* %arrayidx3, align 4
		%add = add nsw i32 %sub, %2
		store i32 %add, i32* %arrayidx3, align 4
		br label %for.inc

		if.else:
		%add6 = add nsw i32 %1, %0
		%arrayidx7 = getelementptr inbounds i32, i32* %a, i32 %i.017
		store i32 %add6, i32* %arrayidx7, align 4
		br label %for.inc

		for.inc:
		%inc = add nuw i32 %i.017, -1
		%exitcond = icmp ugt i32 %inc, 0
		br i1 %exitcond, label %for.body, label %for.cond.cleanup
		}

		; TODO: IRCE should be able to understand that the (N-10):(N-1) is safe.
		define void @inc_subrange_var_var_ult_nuw(i32* %a, i32* %b, i32 %N, i32 %M) {
		entry:
		%begin = add nuw i32 %N, -10
		%cmp0 = icmp ugt i32 %begin, 0
		br i1 %cmp0, label %for.body, label %for.cond.cleanup

		for.cond.cleanup:
		ret void

		for.body:
		%iv = phi i32 [ %begin, %entry ], [ %inc, %for.inc ]
		%cmp1 = icmp ult i32 %iv, %M
		br i1 %cmp1, label %if.then, label %if.else

		if.then:
		%arrayidxA = getelementptr inbounds i32, i32* %a, i32 %iv
		store i32 %iv, i32* %arrayidxA, align 4
		br label %for.inc

		if.else:
		%arrayidxB = getelementptr inbounds i32, i32* %b, i32 %iv
		store i32 %iv, i32* %arrayidxB, align 4
		br label %for.inc

		for.inc:
		%inc = add nuw i32 %iv, 1
		%cmp2 = icmp ult i32 %inc, %N
		br i1 %cmp2, label %for.body, label %for.cond.cleanup
		}

		; TODO: IRCE can determine that %iv is nuw, but it also needs to understand
		; that the (N-10):(N-1) range is safe.
		define void @inc_subrange_var_var_ult(i32* %a, i32* %b, i32 %N, i32 %M) {
		entry:
		%begin = add nuw i32 %N, -10
		%cmp0 = icmp ugt i32 %begin, 0
		br i1 %cmp0, label %for.body, label %for.cond.cleanup

		for.cond.cleanup:
		ret void

		for.body:
		%iv = phi i32 [ %begin, %entry ], [ %inc, %for.inc ]
		%cmp1 = icmp ult i32 %iv, %M
		br i1 %cmp1, label %if.then, label %if.else

		if.then:
		%arrayidxA = getelementptr inbounds i32, i32* %a, i32 %iv
		store i32 %iv, i32* %arrayidxA, align 4
		br label %for.inc

		if.else:
		%arrayidxB = getelementptr inbounds i32, i32* %b, i32 %iv
		store i32 %iv, i32* %arrayidxB, align 4
		br label %for.inc

		for.inc:
		%inc = add i32 %iv, 1
		%cmp2 = icmp ult i32 %inc, %N
		br i1 %cmp2, label %for.body, label %for.cond.cleanup
		}
Context not available.