This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Analysis/
-
Analysis/
-
ScalarEvolution.cpp
-
test/Transforms/IndVarSimplify/
-
Transforms/
-
IndVarSimplify/
3
prove_via_monotonicity.ll

Differential D43375

[SCEV] Prove predicates in loops via monotonicity
AbandonedPublic

Authored by mkazantsev on Feb 16 2018, 1:59 AM.

Download Raw Diff

Details

Reviewers

sanjoy
apilipenko
anna
reames

Summary

When we try to prove some predicate being true on every iteration of a loop,
sometimes we cannot prove that backedge is guarded by the condition we need.
But if the predicate is monotonic, it is enough to prove it on the first and last iteration
of the loop, and this will give us its proof on every loop's iteration.

Diff Detail

Event Timeline

mkazantsev created this revision.Feb 16 2018, 1:59 AM

mkazantsev added a parent revision: D43373: [SCEV][NFC] Factor out common logic into a separate method.

This is very similar to what we had as an initial implementation of loop predication. Later we found that this approach is problematic. See https://reviews.llvm.org/D37569 description for the details. This patch likely suffers from a similar problem.

In D43375#1012189, @apilipenko wrote:

This is very similar to what we had as an initial implementation of loop predication. Later we found that this approach is problematic. See https://reviews.llvm.org/D37569 description for the details. This patch likely suffers from a similar problem.

I don't think it does. The problem with Loop Predication was that we made an implicit assumption that the loop backedge is taken, and if it actually wasn't (and the loop exited by guard's condition).

In this patch, we require the number of backedge taken count be computable. If the loop fails the guard after N-th iteration, this number should not exceed N. So the end value we take is not the value on theoretical "last" iteration after which we exit by loop condition, but the value before exiting by either exit condition.

Actually if a loop exit by guard, we should either be able to understand that it happens and limit the backedge taken count to this value, or return SCEVCouldNotCompute at this query. So I don't think it's the case here.

I see the point now. Backedge taken count does not factor in the abnormal exits like guard exits. Looks fishy indeed. Need to check whether we may end up with computable number of iterations when we go via overflow, and maybe make our checks more strict to restrain such cases.

@apilipenko you were right. I was able to construct a test which demonstrates the problem:

define void @test_03(i32* %p, i32* %a) {

entry:
  %len = load i32, i32* %p, align 4, !range !0
  %len.minus.1 = add nsw i32 %len, -1
  %zero_check = icmp eq i32 %len, 0
  br i1 %zero_check, label %loopexit, label %preheader

preheader:
  br label %loop

loopexit:
  ret void

loop:
  %iv = phi i32 [ %iv.next, %loop ], [ -2000000000, %preheader ]
  %iv.wide = zext i32 %iv to i64
  %el = getelementptr inbounds i32, i32* %a, i64 %iv.wide
  %fishy = icmp sgt i32 %iv, -2000000009
  call void(i1, ...) @llvm.experimental.guard(i1 %fishy) [ "deopt"() ]
  store atomic i32 0, i32* %el unordered, align 4
  %iv.next = add nsw i32 %iv, -1
  %loopcond = icmp eq i32 %iv, 2000000000
  br i1 %loopcond, label %loopexit, label %loop
}

-indvars here produces infinite loop instead of deopt after reachng -2000000009. I need to think on alternative solution.

Added checks to avoid circular logic in removing of side exits basing on facts that are derived from side-exits.

mkazantsev added a parent revision: D43610: [SCEV][NFC] Introduce utility functions that measure number of iterations before overflow.Feb 21 2018, 10:49 PM

sanjoy added inline comments.Feb 21 2018, 11:41 PM

test/Transforms/IndVarSimplify/prove_via_monotonicity.ll
19	I think we can optimize these cases without thinking about monotonicity -- in both `@test_01` and `@test_02` to widen the IV you need to prove that if the backedge is taken then `%iv` is not `0`; and SCEV should be able to prove that easily from the BE condition.

mkazantsev added inline comments.Feb 25 2018, 10:48 PM

test/Transforms/IndVarSimplify/prove_via_monotonicity.ll
19	It looks like `isImpliedCondOperandsViaNoOverflow` could have done that, but it doesn't currently support `sgt` predicate (which is weird). I will try to come up with alternative solution there.

Abandoning in favor of https://reviews.llvm.org/D43759

test/Transforms/IndVarSimplify/prove_via_monotonicity.ll
19	https://reviews.llvm.org/D43759

mkazantsev abandoned this revision.Feb 27 2018, 9:38 PM

Revision Contents

Path

Size

lib/

Analysis/

ScalarEvolution.cpp

29 lines

test/

Transforms/

IndVarSimplify/

prove_via_monotonicity.ll

102 lines

Diff 135365

lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,748 Lines • ▼ Show 20 Lines	bool ScalarEvolution::isKnownPredicate(ICmpInst::Predicate Pred,
// Otherwise see what can be done with some simple reasoning.		// Otherwise see what can be done with some simple reasoning.
return isKnownViaNonRecursiveReasoning(Pred, LHS, RHS);		return isKnownViaNonRecursiveReasoning(Pred, LHS, RHS);
}		}

bool ScalarEvolution::isKnownOnEveryIteration(ICmpInst::Predicate Pred,		bool ScalarEvolution::isKnownOnEveryIteration(ICmpInst::Predicate Pred,
const SCEVAddRecExpr *LHS,		const SCEVAddRecExpr *LHS,
const SCEV *RHS) {		const SCEV *RHS) {
const Loop *L = LHS->getLoop();		const Loop *L = LHS->getLoop();
return isLoopEntryGuardedByCond(L, Pred, LHS->getStart(), RHS) &&		// Check if the condition is met on the first iteration.
isLoopBackedgeGuardedByCond(L, Pred, LHS->getPostIncExpr(*this), RHS);		if (!isLoopEntryGuardedByCond(L, Pred, LHS->getStart(), RHS))
		return false;

		// If the backedge is guarded by condition for post-inc LHS, then we've
		// proved that our predicate will be true on every iteration.
		if (isLoopBackedgeGuardedByCond(L, Pred, LHS->getPostIncExpr(*this), RHS))
		return true;

		// If the predicate is monotonic, and we've already proved it on the first
		// iteration, it is enough to prove it on the last iteration. The tricky part
		// is that the monotonicity could be cased on no-wrap flag which was derived
		// from a taken side-exit (for example, a guard), and if we actually did the
		// number of iterations returned by getBackedgeTakenCount(), we would have
		// overflown. That is why we make an extra check that we don't overflow even
		// if we make this many iterations.
		bool Increasing = false;
		if (isMonotonicPredicate(LHS, Pred, Increasing) &&
		isProvedNoOverflowOnNormalExit(LHS, ICmpInst::isSigned(Pred))) {
		const SCEV End = LHS->evaluateAtIteration(getBackedgeTakenCount(L), this);
		if (isAvailableAtLoopEntry(End, L) &&
		isLoopEntryGuardedByCond(L, Pred, End, RHS))
		return true;
		}

		// Failed to prove anything, conservatively return false.
		return false;
}		}

bool ScalarEvolution::isMonotonicPredicate(const SCEVAddRecExpr *LHS,		bool ScalarEvolution::isMonotonicPredicate(const SCEVAddRecExpr *LHS,
ICmpInst::Predicate Pred,		ICmpInst::Predicate Pred,
bool &Increasing) {		bool &Increasing) {
bool Result = isMonotonicPredicateImpl(LHS, Pred, Increasing);		bool Result = isMonotonicPredicateImpl(LHS, Pred, Increasing);

#ifndef NDEBUG		#ifndef NDEBUG
▲ Show 20 Lines • Show All 3,182 Lines • Show Last 20 Lines

test/Transforms/IndVarSimplify/prove_via_monotonicity.ll

This file was added.

				; RUN: opt -indvars -S < %s \| FileCheck %s

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128-ni:1"
				target triple = "x86_64-unknown-linux-gnu"

				declare void @llvm.experimental.guard(i1, ...)

				; Check that we can widen the indvar for the decrementing loop. Phi's range
				; doesn't give us enough information, so we have to prove it using monotonicity.

				define void @test_01(i32* %p, i32* %a) {

				; CHECK-LABEL: @test_01(
				; CHECK-NOT: trunc

				entry:
				%len = load i32, i32* %p, align 4, !range !0
				%len.minus.1 = add nsw i32 %len, -1
				%zero_check = icmp eq i32 %len, 0
				sanjoyUnsubmitted Not Done Reply Inline Actions I think we can optimize these cases without thinking about monotonicity -- in both `@test_01` and `@test_02` to widen the IV you need to prove that if the backedge is taken then `%iv` is not `0`; and SCEV should be able to prove that easily from the BE condition. sanjoy: I think we can optimize these cases without thinking about monotonicity -- in both `@test_01`…
				mkazantsevAuthorUnsubmitted Not Done Reply Inline Actions It looks like `isImpliedCondOperandsViaNoOverflow` could have done that, but it doesn't currently support `sgt` predicate (which is weird). I will try to come up with alternative solution there. mkazantsev: It looks like `isImpliedCondOperandsViaNoOverflow` could have done that, but it doesn't…
				mkazantsevAuthorUnsubmitted Not Done Reply Inline Actions https://reviews.llvm.org/D43759 mkazantsev: https://reviews.llvm.org/D43759
				br i1 %zero_check, label %loopexit, label %preheader

				preheader:
				br label %loop

				loopexit:
				ret void

				loop:
				%iv = phi i32 [ %iv.next, %loop ], [ %len.minus.1, %preheader ]
				; CHECK: %indvars.iv = phi i64
				%iv.wide = zext i32 %iv to i64
				%el = getelementptr inbounds i32, i32* %a, i64 %iv.wide
				store atomic i32 0, i32* %el unordered, align 4
				%iv.next = add nsw i32 %iv, -1
				; CHECK: %loopcond = icmp slt i64 %indvars.iv, 1
				%loopcond = icmp slt i32 %iv, 1
				br i1 %loopcond, label %loopexit, label %loop
				}

				define void @test_02(i32* %p, i32* %a) {

				; CHECK-LABEL: @test_02(
				; CHECK-NOT: trunc

				entry:
				%len = load i32, i32* %p, align 4, !range !0
				%len.minus.1 = add nsw i32 %len, -1
				%zero_check = icmp eq i32 %len, 0
				br i1 %zero_check, label %loopexit, label %preheader

				preheader:
				br label %loop

				loopexit:
				ret void

				loop:
				%iv = phi i32 [ %iv.next, %loop ], [ %len.minus.1, %preheader ]
				; CHECK: %indvars.iv = phi i64
				%iv.wide = zext i32 %iv to i64
				%el = getelementptr inbounds i32, i32* %a, i64 %iv.wide
				store atomic i32 0, i32* %el unordered, align 4
				%iv.next = add nsw i32 %iv, -1
				; CHECK: %loopcond = icmp ult i64 %indvars.iv, 1
				%loopcond = icmp ult i32 %iv, 1
				br i1 %loopcond, label %loopexit, label %loop
				}

				; In this situation the indvar has nsw, but it is derived from the guard. If the
				; side exit wasn't present, we would be unable to prove it. Make sure that we do
				; not optimize away the guard basing on the fact which was derived from this
				; very guard.
				define void @test_03(i32* %p, i32* %a) {

				; CHECK-LABEL: @test_03

				entry:
				%len = load i32, i32* %p, align 4, !range !0
				%len.minus.1 = add nsw i32 %len, -1
				%zero_check = icmp eq i32 %len, 0
				br i1 %zero_check, label %loopexit, label %preheader

				preheader:
				br label %loop

				loopexit:
				ret void

				loop:
				%iv = phi i32 [ %iv.next, %loop ], [ -2000000000, %preheader ]
				%iv.wide = zext i32 %iv to i64
				%el = getelementptr inbounds i32, i32* %a, i64 %iv.wide
				%fishy = icmp sgt i32 %iv, -2000000009
				; CHECK-NOT: call void (i1, ...) @llvm.experimental.guard(i1 true) [ "deopt"() ]
				call void(i1, ...) @llvm.experimental.guard(i1 %fishy) [ "deopt"() ]
				store atomic i32 0, i32* %el unordered, align 4
				%iv.next = add nsw i32 %iv, -1
				%loopcond = icmp eq i32 %iv, 2000000000
				br i1 %loopcond, label %loopexit, label %loop
				}

				!0 = !{i32 0, i32 2147483647}