This is an archive of the discontinued LLVM Phabricator instance.

[SCEV] If max BTC is zero, then so is the exact BTC
ClosedPublic

Authored by reames on Aug 30 2021, 9:16 AM.

Download Raw Diff

Details

Reviewers

nikic
fhahn
efriedma

Commits

rG6600e1759be1: [SCEV] If max BTC is zero, then so is the exact BTC [1 of N]

Summary

The subtle bit is explaining why the two codepaths have a difference while both are correct. The test case with modifications is a good example, so let's discuss in terms of it.

The previous exact bounds for this example of (-126 + (126 smax %n))<nsw> can evaluate to either 0 or 1. Both are "correct" results, but only one of them results in a well defined loop. If %n were 127 (the only possible value producing a trip count of 1), then the loop must execute undefined behavior. As a result, we can ignore the TC computed when %n is 127. All other values produce 0.
The max taken count computation uses the limit (i.e. the maximum value END can be without resulting in UB) to restrict the bound computation. As a result, it returns 0 which is also correct.

WARNING: The logic above only holds for a single exit loop. The current logic for max trip count would be incorrect for multiple exit loops, except that we never call computeMaxBECountForLT except when we can prove either a) no overflow occurs in this IV before exit, or b) this is the sole exit.

An alternate approach here would be to add the limit logic to the symbolic path. I haven't played with this extensively, but I'm hesitant because a) the term is optional and b) I'm not sure it'll reliably simplify away. As such, the resulting code quality from expansion might actually get worse.

This was noticed while trying to figure out why D108848 wasn't NFC, but is otherwise standalone.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

reames created this revision.Aug 30 2021, 9:16 AM

Herald added subscribers: bollu, hiraditya, mcrosier. · View Herald TranscriptAug 30 2021, 9:16 AM

reames requested review of this revision.Aug 30 2021, 9:16 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 30 2021, 9:16 AM

reames added a child revision: D108848: [LoopDeletion] Separate logic in breakBackedgeIfNotTaken using symboic max trip count [nfc].Aug 30 2021, 9:16 AM

reames mentioned this in D108848: [LoopDeletion] Separate logic in breakBackedgeIfNotTaken using symboic max trip count [nfc].

FYI, fixed the description. The current max trip code is correct, but only due to a subtle invariant I keep forgetting. I've now made exactly this same mistake at least 3 times...

In D108921#2972436, @reames wrote:

FYI, fixed the description. The current max trip code is correct, but only due to a subtle invariant I keep forgetting. I've now made exactly this same mistake at least 3 times...

So that there isn't a fourth time, I added a clarifying comment and test in 301fbf9b.

Harbormaster completed remote builds in B121753: Diff 369462.Aug 30 2021, 10:18 AM

This revision is now accepted and ready to land.Aug 30 2021, 2:20 PM

Closed by commit rG6600e1759be1: [SCEV] If max BTC is zero, then so is the exact BTC [1 of N] (authored by reames). · Explain WhyAug 31 2021, 8:50 AM

This revision was automatically updated to reflect the committed changes.

reames added a commit: rG6600e1759be1: [SCEV] If max BTC is zero, then so is the exact BTC [1 of N].

reames mentioned this in D109015: [SCEV] If max BTC is zero, then so is the exact BTC [2 of 2].Aug 31 2021, 12:09 PM

reames mentioned this in rG29fa37ec9fce: [SCEV] If max BTC is zero, then so is the exact BTC [2 of 2].Sep 1 2021, 11:51 AM

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ScalarEvolution.cpp

4 lines

test/

Analysis/

ScalarEvolution/

max-trip-count.ll

2 lines

Diff 369714

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 11,933 Lines • ▼ Show 20 Lines	if (isa<SCEVConstant>(BECount)) {
// If we know exactly how many times the backedge will be taken if it's		// If we know exactly how many times the backedge will be taken if it's
// taken at least once, then the backedge count will either be that or		// taken at least once, then the backedge count will either be that or
// zero.		// zero.
MaxBECount = BECountIfBackedgeTaken;		MaxBECount = BECountIfBackedgeTaken;
MaxOrZero = true;		MaxOrZero = true;
} else {		} else {
MaxBECount = computeMaxBECountForLT(		MaxBECount = computeMaxBECountForLT(
Start, Stride, RHS, getTypeSizeInBits(LHS->getType()), IsSigned);		Start, Stride, RHS, getTypeSizeInBits(LHS->getType()), IsSigned);
		// If we prove the max count is zero, so is the symbolic bound. This can
		// happen due to differences in how we reason about bounds impied by UB.
		if (MaxBECount->isZero())
		BECount = MaxBECount;
}		}

if (isa<SCEVCouldNotCompute>(MaxBECount) &&		if (isa<SCEVCouldNotCompute>(MaxBECount) &&
!isa<SCEVCouldNotCompute>(BECount))		!isa<SCEVCouldNotCompute>(BECount))
MaxBECount = getConstant(getUnsignedRangeMax(BECount));		MaxBECount = getConstant(getUnsignedRangeMax(BECount));

return ExitLimit(BECount, MaxBECount, MaxOrZero, Predicates);		return ExitLimit(BECount, MaxBECount, MaxOrZero, Predicates);
}		}
▲ Show 20 Lines • Show All 2,144 Lines • Show Last 20 Lines

llvm/test/Analysis/ScalarEvolution/max-trip-count.ll

Show First 20 Lines • Show All 452 Lines • ▼ Show 20 Lines	loop:
br i1 %cmp, label %loop, label %loop.exit		br i1 %cmp, label %loop, label %loop.exit

loop.exit:		loop.exit:
ret void		ret void
}		}

define void @max_overflow_se(i8 %n) mustprogress {		define void @max_overflow_se(i8 %n) mustprogress {
; CHECK-LABEL: Determining loop execution counts for: @max_overflow_se		; CHECK-LABEL: Determining loop execution counts for: @max_overflow_se
; CHECK: Loop %loop: backedge-taken count is (-126 + (126 smax %n))<nsw>		; CHECK: Loop %loop: backedge-taken count is 0
; CHECK: Loop %loop: max backedge-taken count is 0		; CHECK: Loop %loop: max backedge-taken count is 0
entry:		entry:
br label %loop		br label %loop

loop:		loop:
%i = phi i8 [ 63, %entry ], [ %i.next, %loop ]		%i = phi i8 [ 63, %entry ], [ %i.next, %loop ]
%i.next = add nsw i8 %i, 63		%i.next = add nsw i8 %i, 63
%t = icmp slt i8 %i.next, %n		%t = icmp slt i8 %i.next, %n
▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines