Download Raw Diff

Details

Reviewers

chandlerc
sanjoy
vsk
junbuml
reames

Commits

rG63c9c8115223: [BPI] Ignore remainder while distributing the remaining probability from…
rL302883: [BPI] Ignore remainder while distributing the remaining probability from…

Summary

This is a follow up patch for https://reviews.llvm.org/rL300440
to address a comment.

To make implementation to be consistent with other cases we just
ignore the tail after distribution of remaining probability between
reachable edges.

If we reduced the probability of some edges coming to unreachable
blocks we should distribute the remaining part across other edges
coming to reachable blocks to satisfy the condition that sum of all
probabilities should be equal to one. If this remaining part is not
divided by number of "reachable" edges then we get this tail.
This tail probability should be pretty small. Other cases just ignore
if the some of probabilities is not equal to one so we do the same.

Diff Detail

Repository: rL LLVM

Event Timeline

Please review.

skatkov added reviewers: junbuml, vsk, sanjoy.Apr 18 2017, 8:58 PM

Thanks for the cleanup.

One note is that it would help to explicitly explain what your patch changes, and why, in your description (the title would be a very condensed version of this). This would make your patch easier to review.

lib/Analysis/BranchProbabilityInfo.cpp
334 ↗	(On Diff #95423)	IIUC, the "tail" is supposed to be some small, non-negative epsilon value. Maybe it's worth asserting this? E.g assert(tail >= 0 && tail < 1).

skatkov updated this revision to Diff 95893.Apr 19 2017, 11:35 PM

skatkov retitled this revision from [BPI] Follow up rL300440. NFC to [BPI] Move tail computation out of the loop. NFC.

skatkov edited the summary of this revision. (Show Details)

skatkov marked an inline comment as done.

Thanks for the updated patch description! Another minor request here.

include/llvm/Support/BranchProbability.h
115–121 ↗	(On Diff #95893)	Add a unittest? I know that much of BranchProbability isn't well covered by unittests, but we should really have good coverage for basic primitives like this and you can start covering the mutation operators.

Sure, will do.

Some test added.

skatkov marked an inline comment as done.Apr 20 2017, 4:29 AM

BTW, probably we should not add the tail to any edge at all... It might be that the information that two edges have exactly the same probability is mot important than the requirement that the sum of all probabilities should be 1. it seems that sum == 1 is not a strict requirement. For example, if we have three edges with the same weight then as a result the sum of probabilities will not be equal to one. What do you think?

In D32124#731921, @skatkov wrote:

Some test added.

The tests and added operators are awesome.

If you want, split that off into a separate change just to the BranchProbability stuff and go ahead and submit that?

In D32124#732170, @skatkov wrote:

BTW, probably we should not add the tail to any edge at all... It might be that the information that two edges have exactly the same probability is mot important than the requirement that the sum of all probabilities should be 1. it seems that sum == 1 is not a strict requirement. For example, if we have three edges with the same weight then as a result the sum of probabilities will not be equal to one. What do you think?

I think we already handle this kind of case in a few places in BPI and we should be consistent there. I can go digging for it but you may already know where it is...

skatkov updated this revision to Diff 96090.Apr 20 2017, 9:00 PM

In D32124#733015, @chandlerc wrote:

In D32124#732170, @skatkov wrote:

BTW, probably we should not add the tail to any edge at all... It might be that the information that two edges have exactly the same probability is mot important than the requirement that the sum of all probabilities should be 1. it seems that sum == 1 is not a strict requirement. For example, if we have three edges with the same weight then as a result the sum of probabilities will not be equal to one. What do you think?

I think we already handle this kind of case in a few places in BPI and we should be consistent there. I can go digging for it but you may already know where it is...

Hi Chandler, I'm not sure what exactly you mean but take a look into the last test @test_unreachable_with_switch_prof4 in test/Analysis/BranchProbabilityInfo/basic.ll:
For the five unreachable edges with the same weight we generate the following probability
; CHECK: edge entry -> case_a probability is 0x1999999a / 0x80000000 = 20.00%
; CHECK: edge entry -> case_b probability is 0x1999999a / 0x80000000 = 20.00%
; CHECK: edge entry -> case_c probability is 0x1999999a / 0x80000000 = 20.00%
; CHECK: edge entry -> case_d probability is 0x1999999a / 0x80000000 = 20.00%
; CHECK: edge entry -> case_e probability is 0x1999999a / 0x80000000 = 20.00%

and 0x1999999a * 5 = 0x80000002 > 0x80000000, so the sum of probabilities is greater than 1.

After reading of the code a bit more I would say that in general no one worries about sum of probabilities is equal to 1. They follow it when operating on weights. But the trouble is that after weights are transformed to probabilities there are rounding errors which breaks the restriction.

Please review.

Ping, could please review this change.

ping

LGTM w/comment addressed.

lib/Analysis/BranchProbabilityInfo.cpp
333 ↗	(On Diff #97406)	I would suggest leaving the decrement in the loop and asserting after the loop that the remainder is less than the number of edges. A comment stating that we explicitly drop the remainder to be consistent with other code seems called for as well.

This revision is now accepted and ready to land.May 10 2017, 9:40 PM

reames added inline comments.May 10 2017, 9:43 PM

lib/Analysis/BranchProbabilityInfo.cpp
333 ↗	(On Diff #97406)	Actually, don't bother with this. The assert would follow trivially from the code just above it, that's fine. Thus, LGTM w/out comments. :)

Closed by commit rL302883: [BPI] Ignore remainder while distributing the remaining probability from… (authored by skatkov). · Explain WhyMay 12 2017, 1:03 AM

This revision was automatically updated to reflect the committed changes.

Diff 98733

llvm/trunk/lib/Analysis/BranchProbabilityInfo.cpp

Show First 20 Lines • Show All 295 Lines • ▼ Show 20 Lines	bool BranchProbabilityInfo::calcMetadataWeights(const BasicBlock *BB) {

if (ScalingFactor > 1) {		if (ScalingFactor > 1) {
WeightSum = 0;		WeightSum = 0;
for (unsigned i = 0, e = TI->getNumSuccessors(); i != e; ++i) {		for (unsigned i = 0, e = TI->getNumSuccessors(); i != e; ++i) {
Weights[i] /= ScalingFactor;		Weights[i] /= ScalingFactor;
WeightSum += Weights[i];		WeightSum += Weights[i];
}		}
}		}
		assert(WeightSum <= UINT32_MAX &&
		"Expected weights to scale down to 32 bits");

if (WeightSum == 0 \|\| ReachableIdxs.size() == 0) {		if (WeightSum == 0 \|\| ReachableIdxs.size() == 0) {
for (unsigned i = 0, e = TI->getNumSuccessors(); i != e; ++i)		for (unsigned i = 0, e = TI->getNumSuccessors(); i != e; ++i)
Weights[i] = 1;		Weights[i] = 1;
WeightSum = TI->getNumSuccessors();		WeightSum = TI->getNumSuccessors();
}		}

// Set the probability.		// Set the probability.
Show All 11 Lines	for (auto i : UnreachableIdxs)
ToDistribute += BP[i] - UnreachableProb;		ToDistribute += BP[i] - UnreachableProb;
BP[i] = UnreachableProb;		BP[i] = UnreachableProb;
}		}

// If we modified the probability of some edges then we must distribute		// If we modified the probability of some edges then we must distribute
// the difference between reachable blocks.		// the difference between reachable blocks.
if (ToDistribute > BranchProbability::getZero()) {		if (ToDistribute > BranchProbability::getZero()) {
BranchProbability PerEdge = ToDistribute / ReachableIdxs.size();		BranchProbability PerEdge = ToDistribute / ReachableIdxs.size();
for (auto i : ReachableIdxs) {		for (auto i : ReachableIdxs)
BP[i] += PerEdge;		BP[i] += PerEdge;
ToDistribute -= PerEdge;
}
// Tail goes to the first reachable edge.
BP[ReachableIdxs[0]] += ToDistribute;
}		}
}		}

for (unsigned i = 0, e = TI->getNumSuccessors(); i != e; ++i)		for (unsigned i = 0, e = TI->getNumSuccessors(); i != e; ++i)
setEdgeProbability(BB, i, BP[i]);		setEdgeProbability(BB, i, BP[i]);

assert(WeightSum <= UINT32_MAX &&
"Expected weights to scale down to 32 bits");

return true;		return true;
}		}

/// \brief Calculate edge weights for edges leading to cold blocks.		/// \brief Calculate edge weights for edges leading to cold blocks.
///		///
/// A cold block is one post-dominated by a block with a call to a		/// A cold block is one post-dominated by a block with a call to a
/// cold function. Those edges are unlikely to be taken, so we give		/// cold function. Those edges are unlikely to be taken, so we give
/// them relatively low weight.		/// them relatively low weight.
▲ Show 20 Lines • Show All 464 Lines • Show Last 20 Lines

llvm/trunk/test/Analysis/BranchProbabilityInfo/basic.ll

	Show First 20 Lines • Show All 446 Lines • ▼ Show 20 Lines
	define i32 @test_unreachable_with_switch_prof1(i32 %i, i32 %a, i32 %b, i32 %c, i32 %d, i32 %e) {			define i32 @test_unreachable_with_switch_prof1(i32 %i, i32 %a, i32 %b, i32 %c, i32 %d, i32 %e) {
	; CHECK: Printing analysis {{.*}} for function 'test_unreachable_with_switch_prof1'			; CHECK: Printing analysis {{.*}} for function 'test_unreachable_with_switch_prof1'
	entry:			entry:
	switch i32 %i, label %case_a [ i32 1, label %case_b			switch i32 %i, label %case_a [ i32 1, label %case_b
	i32 2, label %case_c			i32 2, label %case_c
	i32 3, label %case_d			i32 3, label %case_d
	i32 4, label %case_e ], !prof !8			i32 4, label %case_e ], !prof !8
	; CHECK: edge entry -> case_a probability is 0x00000800 / 0x80000000 = 0.00%			; CHECK: edge entry -> case_a probability is 0x00000800 / 0x80000000 = 0.00%
	; CHECK: edge entry -> case_b probability is 0x07fffe01 / 0x80000000 = 6.25%			; CHECK: edge entry -> case_b probability is 0x07fffdff / 0x80000000 = 6.25%
	; CHECK: edge entry -> case_c probability is 0x67fffdff / 0x80000000 = 81.25% [HOT edge]			; CHECK: edge entry -> case_c probability is 0x67fffdff / 0x80000000 = 81.25% [HOT edge]
	; CHECK: edge entry -> case_d probability is 0x07fffdff / 0x80000000 = 6.25%			; CHECK: edge entry -> case_d probability is 0x07fffdff / 0x80000000 = 6.25%
	; CHECK: edge entry -> case_e probability is 0x07fffdff / 0x80000000 = 6.25%			; CHECK: edge entry -> case_e probability is 0x07fffdff / 0x80000000 = 6.25%

	case_a:			case_a:
	unreachable			unreachable

	case_b:			case_b:
	Show All 26 Lines
	; CHECK: Printing analysis {{.*}} for function 'test_unreachable_with_switch_prof2'			; CHECK: Printing analysis {{.*}} for function 'test_unreachable_with_switch_prof2'
	entry:			entry:
	switch i32 %i, label %case_a [ i32 1, label %case_b			switch i32 %i, label %case_a [ i32 1, label %case_b
	i32 2, label %case_c			i32 2, label %case_c
	i32 3, label %case_d			i32 3, label %case_d
	i32 4, label %case_e ], !prof !9			i32 4, label %case_e ], !prof !9
	; CHECK: edge entry -> case_a probability is 0x00000400 / 0x80000000 = 0.00%			; CHECK: edge entry -> case_a probability is 0x00000400 / 0x80000000 = 0.00%
	; CHECK: edge entry -> case_b probability is 0x00000400 / 0x80000000 = 0.00%			; CHECK: edge entry -> case_b probability is 0x00000400 / 0x80000000 = 0.00%
	; CHECK: edge entry -> case_c probability is 0x6aaaa800 / 0x80000000 = 83.33% [HOT edge]			; CHECK: edge entry -> case_c probability is 0x6aaaa7ff / 0x80000000 = 83.33% [HOT edge]
	; CHECK: edge entry -> case_d probability is 0x0aaaa7ff / 0x80000000 = 8.33%			; CHECK: edge entry -> case_d probability is 0x0aaaa7ff / 0x80000000 = 8.33%
	; CHECK: edge entry -> case_e probability is 0x0aaaa7ff / 0x80000000 = 8.33%			; CHECK: edge entry -> case_e probability is 0x0aaaa7ff / 0x80000000 = 8.33%

	case_a:			case_a:
	unreachable			unreachable

	case_b:			case_b:
	unreachable			unreachable
	Show All 23 Lines
	; CHECK: Printing analysis {{.*}} for function 'test_unreachable_with_switch_prof3'			; CHECK: Printing analysis {{.*}} for function 'test_unreachable_with_switch_prof3'
	entry:			entry:
	switch i32 %i, label %case_a [ i32 1, label %case_b			switch i32 %i, label %case_a [ i32 1, label %case_b
	i32 2, label %case_c			i32 2, label %case_c
	i32 3, label %case_d			i32 3, label %case_d
	i32 4, label %case_e ], !prof !10			i32 4, label %case_e ], !prof !10
	; CHECK: edge entry -> case_a probability is 0x00000000 / 0x80000000 = 0.00%			; CHECK: edge entry -> case_a probability is 0x00000000 / 0x80000000 = 0.00%
	; CHECK: edge entry -> case_b probability is 0x00000400 / 0x80000000 = 0.00%			; CHECK: edge entry -> case_b probability is 0x00000400 / 0x80000000 = 0.00%
	; CHECK: edge entry -> case_c probability is 0x6e08fa2e / 0x80000000 = 85.96% [HOT edge]			; CHECK: edge entry -> case_c probability is 0x6e08fa2d / 0x80000000 = 85.96% [HOT edge]
	; CHECK: edge entry -> case_d probability is 0x08fb80e9 / 0x80000000 = 7.02%			; CHECK: edge entry -> case_d probability is 0x08fb80e9 / 0x80000000 = 7.02%
	; CHECK: edge entry -> case_e probability is 0x08fb80e9 / 0x80000000 = 7.02%			; CHECK: edge entry -> case_e probability is 0x08fb80e9 / 0x80000000 = 7.02%

	case_a:			case_a:
	unreachable			unreachable

	case_b:			case_b:
	unreachable			unreachable
	▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[BPI] Ignore tail while distributing the remaining probability from unreachanble
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 98733

llvm/trunk/lib/Analysis/BranchProbabilityInfo.cpp

llvm/trunk/test/Analysis/BranchProbabilityInfo/basic.ll

This is an archive of the discontinued LLVM Phabricator instance.

[BPI] Ignore tail while distributing the remaining probability from unreachanbleClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 98733

llvm/trunk/lib/Analysis/BranchProbabilityInfo.cpp

llvm/trunk/test/Analysis/BranchProbabilityInfo/basic.ll

[BPI] Ignore tail while distributing the remaining probability from unreachanble
ClosedPublic