This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Analysis/
-
Analysis/
-
ScalarEvolution.cpp
-
test/Analysis/ScalarEvolution/
-
Analysis/
-
ScalarEvolution/
-
trip-count.ll

Differential D70623

[SCEV] Compute trip counts w/frozen conditions
AbandonedPublic

Authored by reames on Nov 22 2019, 3:09 PM.

Download Raw Diff

Details

Reviewers

dalegr
sanjoy
nlopes
aqjune

Summary

I'd really appreciate a sceptical eye on this. I'm not 100% sure this is correct.

The motivation is that unswitching a loop nest will produce conditions in the outer loop which are frozen. If that exit would otherwise be computable, we'd really like it to remain computable w/freeze.

My hesitation is that I'm not sure it's sound to propagate an analysis result (potentially based on UB) through freeze. What do folks think? Is this legal? Or not?

If not, suggestions for alternate approaches to recover knowledge in SCEV?

Diff Detail

Event Timeline

reames created this revision.Nov 22 2019, 3:09 PM

Herald added a project: Restricted Project. · View Herald TranscriptNov 22 2019, 3:09 PM

Herald added subscribers: javed.absar, bollu, hiraditya, mcrosier. · View Herald Transcript

Hi,

Currently ScalarEvolution::getBackedgeTakenCount() is commented as follows, which is not clear about the case when instruction returning nondeterministic value is involved (such as freeze).

/// If the specified loop has a predictable backedge-taken count, return it,
/// otherwise return a SCEVCouldNotCompute object. The backedge-taken count is
/// the number of times the loop header will be branched to from within the
/// loop, assuming there are no abnormal exists like exception throws. This is
/// one less than the trip count of the loop, since it doesn't count the first
/// iteration, when the header is branched to from outside the loop.
///  
/// Note that it is not valid to call this method on a loop without a
/// loop-invariant backedge-taken count (see
/// hasLoopInvariantBackedgeTakenCount).

Updating its definition as follows helps the tests frozen_condition, frozen_iv, frozen_inc to have %n backedge-count.
We can simulate a case where the number of loop iteration is %n by properly choosing the freezed values at different iterations.

  /// loop, assuming there are no abnormal exists like exception throws.
+  /// If there are more than one possible backedge-taken counts because its branch
+  /// condition is evaluated from nondeterministic operations like freeze,
+  /// any of possible backedge-taken counts is valid. This assumes that variables
+  /// defined outside the loop are fixed.

In case of the frozen_limit example, the number of iterations is fixed to %freeze (%freeze is defined outside the loop), so it cannot be %n. I think the current CHECK is okay.

This update will support sustaining SCEV's analysis after insertion of freezes, but not very sure whether it is correct.
A possible problematic case is creating a code snippet and inserting it into a loop, based on SCEV's analysis result. Loop unrolling might be the case if it adds increment of the induction variable, because it wouldn't create freezed increment right now.

Interesting question :)

Let's focus on the first example, but with a ule instead:

%iv.inc = add nsw i32 %iv, 1
%becond = icmp ule i32 %iv, %n
%freeze = freeze i1 %becond
br i1 %freeze, label %loop, label %leave

If %n == UINT_MAX, then icmp is always true. If freeze is not there, eventually %iv.inv becomes poison and then we have UB at the branch. So without freeze we can safely bound the number of iterations by %n. With freeze, the loop potentially never terminates (once the IV becomes poison, frozen cond can be forever true).

With ult we don't have this particular problem of the IV becoming poison. But what if the IV initializer is non-constant and poison in the first place? Then the freeze gives a non-terminating loop again (non-deterministically).
Same goes if %n is poison: we can't bound the number of iterations. You would need to push the freeze to %n. Then it's ok.

Essentially we need to push freezes out of loops. A freeze evaluated in the loop body can give a non-deterministic value in each iteration so we can't rely on it for most (any?) analysis.

I agree with the analysis from Nuno, and thus the patch as written is wrong.

Hello all,
I'm seeing this issue after https://reviews.llvm.org/D76483 is merged. LoopStrengthReduce doesn't fire on 49f7513.ll because the induction variable is frozen (LSR successfully optimizes 0019c2f.ll on the other hand).
To resolve this, I'd like to write a transformation suggested by @nlopes; pushing freeze out of a loop, effectively making it as 49f7513.candidate.ll.
Is there an appropriate pass for having this transformation? Or should it be a separate pass?

0019c2f.ll4 KBDownload

49f7513.candidate.ll4 KBDownload

49f7513.ll4 KBDownload

Herald added a subscriber: dantrushin. · View Herald TranscriptApr 1 2020, 6:58 AM

aqjune added a subscriber: sanwou01.Apr 1 2020, 6:58 AM

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ScalarEvolution.cpp

4 lines

test/

Analysis/

ScalarEvolution/

trip-count.ll

95 lines

Diff 230732

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,370 Lines • ▼ Show 20 Lines	if (ConstantInt *CI = dyn_cast<ConstantInt>(ExitCond)) {
if (ExitIfTrue == !CI->getZExtValue())		if (ExitIfTrue == !CI->getZExtValue())
// The backedge is always taken.		// The backedge is always taken.
return getCouldNotCompute();		return getCouldNotCompute();
else		else
// The backedge is never taken.		// The backedge is never taken.
return getZero(CI->getType());		return getZero(CI->getType());
}		}

		if (auto *FI = dyn_cast<FreezeInst>(ExitCond))
		return computeExitLimitFromCondImpl(Cache, L, FI->getOperand(0), ExitIfTrue,
		ControlsExit, AllowPredicates);

// If it's not an integer or pointer comparison then compute it the hard way.		// If it's not an integer or pointer comparison then compute it the hard way.
return computeExitCountExhaustively(L, ExitCond, ExitIfTrue);		return computeExitCountExhaustively(L, ExitCond, ExitIfTrue);
}		}

ScalarEvolution::ExitLimit		ScalarEvolution::ExitLimit
ScalarEvolution::computeExitLimitFromICmp(const Loop *L,		ScalarEvolution::computeExitLimitFromICmp(const Loop *L,
ICmpInst *ExitCond,		ICmpInst *ExitCond,
bool ExitIfTrue,		bool ExitIfTrue,
▲ Show 20 Lines • Show All 5,191 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/trip-count.ll

Show First 20 Lines • Show All 115 Lines • ▼ Show 20 Lines	loop:
%iv.inc = add nsw i32 %iv, 3		%iv.inc = add nsw i32 %iv, 3
call void @may_exit()		call void @may_exit()
%becond = icmp ne i32 %iv.inc, 46		%becond = icmp ne i32 %iv.inc, 46
br i1 %becond, label %loop, label %leave		br i1 %becond, label %loop, label %leave

leave:		leave:
ret void		ret void
}		}

		define void @frozen_condition(i32 %n) {
		; CHECK-LABEL: 'frozen_condition'
		; CHECK-NEXT: Determining loop execution counts for: @frozen_condition
		; CHECK-NEXT: Loop %loop: backedge-taken count is %n
		; CHECK-NEXT: Loop %loop: max backedge-taken count is -1
		; CHECK-NEXT: Loop %loop: Predicated backedge-taken count is %n
		; CHECK-NEXT: Predicates:
		; CHECK: Loop %loop: Trip multiple is 1
		;
		entry:
		br label %loop

		loop:
		%iv = phi i32 [ 0, %entry ], [ %iv.inc, %loop ]
		%iv.inc = add nsw i32 %iv, 1
		call void @may_exit()
		%becond = icmp ult i32 %iv, %n
		%freeze = freeze i1 %becond
		br i1 %freeze, label %loop, label %leave

		leave:
		ret void
		}

		define void @frozen_iv(i32 %n) {
		; CHECK-LABEL: 'frozen_iv'
		; CHECK-NEXT: Determining loop execution counts for: @frozen_iv
		; CHECK-NEXT: Loop %loop: Unpredictable backedge-taken count.
		; CHECK-NEXT: Loop %loop: Unpredictable max backedge-taken count.
		; CHECK-NEXT: Loop %loop: Unpredictable predicated backedge-taken count.
		;
		entry:
		br label %loop

		loop:
		%iv = phi i32 [ 0, %entry ], [ %iv.inc, %loop ]
		%iv.inc = add nsw i32 %iv, 1
		call void @may_exit()
		%freeze = freeze i32 %iv
		%becond = icmp ult i32 %freeze, %n
		br i1 %becond, label %loop, label %leave

		leave:
		ret void
		}

		define void @frozen_inc(i32 %n) {
		; CHECK-LABEL: 'frozen_inc'
		; CHECK-NEXT: Determining loop execution counts for: @frozen_inc
		; CHECK-NEXT: Loop %loop: Unpredictable backedge-taken count.
		; CHECK-NEXT: Loop %loop: Unpredictable max backedge-taken count.
		; CHECK-NEXT: Loop %loop: Unpredictable predicated backedge-taken count.
		;
		entry:
		br label %loop

		loop:
		%iv = phi i32 [ 0, %entry ], [ %freeze, %loop ]
		%iv.inc = add nsw i32 %iv, 1
		%freeze = freeze i32 %iv.inc
		call void @may_exit()
		%becond = icmp ult i32 %iv, %n
		br i1 %becond, label %loop, label %leave

		leave:
		ret void
		}

		define void @frozen_limit(i32 %n) {
		; CHECK-LABEL: 'frozen_limit'
		; CHECK-NEXT: Determining loop execution counts for: @frozen_limit
		; CHECK-NEXT: Loop %loop: backedge-taken count is %freeze
		; CHECK-NEXT: Loop %loop: max backedge-taken count is -1
		; CHECK-NEXT: Loop %loop: Predicated backedge-taken count is %freeze
		; CHECK-NEXT: Predicates:
		; CHECK: Loop %loop: Trip multiple is 1
		;
		entry:
		%freeze = freeze i32 %n
		br label %loop

		loop:
		%iv = phi i32 [ 0, %entry ], [ %iv.inc, %loop ]
		%iv.inc = add nsw i32 %iv, 1
		call void @may_exit()
		%becond = icmp ult i32 %iv, %freeze
		br i1 %becond, label %loop, label %leave

		leave:
		ret void
		}