Diff 65842

lib/CodeGen/MachineBlockPlacement.cpp

Show First 20 Lines • Show All 625 Lines • ▼ Show 20 Lines	bool MachineBlockPlacement::hasBetterLayoutPredecessor(
// means the cost of topological order is greater.		// means the cost of topological order is greater.
// When profile data is not available, however, we need to be more		// When profile data is not available, however, we need to be more
// conservative. If the branch prediction is wrong, breaking the topo-order		// conservative. If the branch prediction is wrong, breaking the topo-order
// will actually yield a layout with large cost. For this reason, we need		// will actually yield a layout with large cost. For this reason, we need
// strong biased branch at block S with Prob(S->BB) in order to select		// strong biased branch at block S with Prob(S->BB) in order to select
// BB->Succ. This is equivalent to looking the CFG backward with backward		// BB->Succ. This is equivalent to looking the CFG backward with backward
// edge: Prob(Succ->BB) needs to >= HotProb in order to be selected (without		// edge: Prob(Succ->BB) needs to >= HotProb in order to be selected (without
// profile data).		// profile data).
		// --------------------------------------------------------------------------
		// Case 3: forked diamond
		// S
		// / \
		// / \
		// BB Pred
		davidxlUnsubmitted Done Reply Inline Actions Nit: can you make the art work like the following to not split S2 into two 'blocks': // // Head (Or Entry, Top) // / \ // / \ // BB Pred // / \ / \| // \| S1 \| // \ / // S2 // davidxl: Nit: can you make the art work like the following to not split S2 into two 'blocks': // //…
		// / \ / \
		// S2 S1 S2
		//
		// The current block is BB and edge BB->S1 is now being evaluated.
		davidxlUnsubmitted Done Reply Inline Actions Nit: can you make the art work like the following to not split S2 into two 'blocks': // // Head (Or Entry, Top) // / \ // / \ // BB Pred // / \ / \| // \| S1 \| // \ / // S2 // davidxl: Nit: can you make the art work like the following to not split S2 into two 'blocks': // //…
		// As above S->BB was already selected because
		// prob(S->BB) > prob(S->Pred). Assume that prob(BB->S1) >= prob(BB->S2).
		//
		// topo-order:
		//
		// S-------\| ---S
		// \| \| \| \|
		// ---BB \| \| BB
		// \| \| \| \|
		// \| Pred----\| \| S1----
		// \| \| \| \|
		// --(S1 or S2) ---Pred--
		//
		// topo-cost = freq(S->Pred) + freq(BB->S1) + freq(BB->S2)
		// + min(freq(Pred->S1), freq(Pred->S2))
		// Non-topo-order cost:
		// In the worst case, S2 will not get layed out after Pred.
		davidxlUnsubmitted Done Reply Inline Actions layed out --> laid out davidxl: layed out --> laid out
		// non-topo-cost = 2 * freq(S->Pred) + freq(BB->S2).
		davidxlUnsubmitted Done Reply Inline Actions Another way to explain in terms of savings instead of cost: the savings is the total freq of the fall through edges. In topo case, the savings is freq(S->BB) + max(freq(Pred->S1), freq(Pred->S2). (1) For non-top case, the saving is: freq(S->BB) + freq(BB->S1) + freq(Pred->S2) (2) When freq(Pred->S2) > freq(Pred->S1), (2) is strictly larger than (1). In the opposite case, the check below will also lead to (2) > (1) davidxl: Another way to explain in terms of savings instead of cost: the savings is the total freq of…
		iterateeAuthorUnsubmitted Not Done Reply Inline Actions I think I'll stick with cost, as that's how the other 2 cases are explained. iteratee: I think I'll stick with cost, as that's how the other 2 cases are explained.
		// To be conservative, we can assume that min(freq(Pred->S1), freq(Pred->S2))
		// is 0. Then the non topo layout is better when
		// freq(S->Pred) < freq(BB->S1).
		// This is exactly what is checked below.
BranchProbability HotProb = getLayoutSuccessorProbThreshold(BB);		BranchProbability HotProb = getLayoutSuccessorProbThreshold(BB);

// Forward checking. For case 2, SuccProb will be 1.
if (SuccProb < HotProb) {
DEBUG(dbgs() << " Not a candidate: " << getBlockName(Succ) << " "
<< "Respecting topological ordering because "
<< "probability is less than prob treshold: "
<< SuccProb << "\n");
return true;
}

// Make sure that a hot successor doesn't have a globally more		// Make sure that a hot successor doesn't have a globally more
// important predecessor.		// important predecessor.
BlockFrequency CandidateEdgeFreq = MBFI->getBlockFreq(BB) * RealSuccProb;		BlockFrequency CandidateEdgeFreq = MBFI->getBlockFreq(BB) * RealSuccProb;
bool BadCFGConflict = false;		bool BadCFGConflict = false;

for (MachineBasicBlock *Pred : Succ->predecessors()) {		for (MachineBasicBlock *Pred : Succ->predecessors()) {
if (Pred == Succ \|\| BlockToChain[Pred] == &SuccChain \|\|		if (Pred == Succ \|\| BlockToChain[Pred] == &SuccChain \|\|
(BlockFilter && !BlockFilter->count(Pred)) \|\|		(BlockFilter && !BlockFilter->count(Pred)) \|\|
BlockToChain[Pred] == &Chain)		BlockToChain[Pred] == &Chain)
continue;		continue;
// Do backward checking. For case 1, it is actually redundant check. For		// Do backward checking.
// case 2 above, we need a backward checking to filter out edges that are		// For case 2 above, we need a backward checking to filter out edges that
		davidxlUnsubmitted Done Reply Inline Actions For case 1 and 2 davidxl: For case 1 and 2
// not 'strongly' biased. With profile data available, the check is mostly		// are not 'strongly' biased. With profile data available, the check is
// redundant too (when threshold prob is set at 50%) unless S has more than		// mostly redundant (when threshold prob is set at 50%) unless S has more
// two successors.		// than two successors.
		// For case 3 above, this test is essential, even with profiling data.
		davidxlUnsubmitted Done Reply Inline Actions This comment does not fit here. With profile data, such check won't be skipped, but just does not need to be as biased. davidxl: This comment does not fit here. With profile data, such check won't be skipped, but just does…
// BB Pred		// BB Pred
// \ /		// \ /
// Succ		// Succ
// We select edge BB->Succ if		// We select edge BB->Succ if
// freq(BB->Succ) > freq(Succ) * HotProb		// freq(BB->Succ) > freq(Succ) * HotProb
// i.e. freq(BB->Succ) > freq(BB->Succ) * HotProb + freq(Pred->Succ) *		// i.e. freq(BB->Succ) > freq(BB->Succ) * HotProb + freq(Pred->Succ) *
// HotProb		// HotProb
// i.e. freq((BB->Succ) * (1 - HotProb) > freq(Pred->Succ) * HotProb		// i.e. freq((BB->Succ) * (1 - HotProb) > freq(Pred->Succ) * HotProb
		// case 1 is covered too, because the first equation reduces to:
		// prob(BB->Succ) > HotProb. (freq(Succ) = freq(BB) for a triangle)
BlockFrequency PredEdgeFreq =		BlockFrequency PredEdgeFreq =
MBFI->getBlockFreq(Pred) * MBPI->getEdgeProbability(Pred, Succ);		MBFI->getBlockFreq(Pred) * MBPI->getEdgeProbability(Pred, Succ);
if (PredEdgeFreq * HotProb >= CandidateEdgeFreq * HotProb.getCompl()) {		if (PredEdgeFreq * HotProb >= CandidateEdgeFreq * HotProb.getCompl()) {
BadCFGConflict = true;		BadCFGConflict = true;
break;		break;
}		}
}		}

▲ Show 20 Lines • Show All 1,140 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Codegen: MachineBlockPlacement Improve probability layout.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 65842

lib/CodeGen/MachineBlockPlacement.cpp

This is an archive of the discontinued LLVM Phabricator instance.

Codegen: MachineBlockPlacement Improve probability layout.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 65842

lib/CodeGen/MachineBlockPlacement.cpp

Codegen: MachineBlockPlacement Improve probability layout.
ClosedPublic