This is an archive of the discontinued LLVM Phabricator instance.

Differential D19847

Codegen: Don't outline in favor of return blocks
AbandonedPublic

Authored by iteratee on May 2 2016, 6:34 PM.

Download Raw Diff

Details

Reviewers

Summary

When outlining optional branches, an early exit condition will make the
rest of the function optional. If we outline in favor of this exit
block, it can place the return block in the middle of the function. In 3
very different benchmarks, I found that not outlining in favor of the
return block would fix the performance regression. Two of the benchmarks
are in the test-suite:
MultiSource/Applications/lambda-0.1.3/lambda.test
SingleSource/Benchmarks/fib2

lambda exhibits far worse instruction decode cache with outlining
enabled and without this patch.

fib2 exhibits worse branch prediction, which by itself isn't an argument
for this patch, but it is another point in a pattern.

The internal benchmark that slowed down was basically a large function
where incorrect exit placement created additional icache misses

Taken together these three data points, along with the absence of any
obvious regressions in the test-suite with outlining enabled vs this
patch suggest that this is a reasonable heuristic.

Diff Detail

Event Timeline

iteratee updated this revision to Diff 55931.May 2 2016, 6:34 PM

iteratee retitled this revision from to Codegen: Don't outline in favor of return blocks.

iteratee updated this object.

iteratee added a reviewer: djasper.

iteratee set the repository for this revision to rL LLVM.

iteratee added a subscriber: llvm-commits.

Please include a test case.

iteratee abandoned this revision.May 18 2016, 11:20 AM

Revision Contents

Path

Size

lib/

CodeGen/

MachineBlockPlacement.cpp

7 lines

Diff 55931

lib/CodeGen/MachineBlockPlacement.cpp

Show First 20 Lines • Show All 447 Lines • ▼ Show 20 Lines	for (MachineBasicBlock *Succ : Successors) {
if (SuccProbN >= SuccProbD)		if (SuccProbN >= SuccProbD)
SuccProb = BranchProbability::getOne();		SuccProb = BranchProbability::getOne();
else		else
SuccProb = BranchProbability(SuccProbN, SuccProbD);		SuccProb = BranchProbability(SuccProbN, SuccProbD);

// If we outline optional branches, look whether Succ is unavoidable, i.e.		// If we outline optional branches, look whether Succ is unavoidable, i.e.
// dominates all terminators of the MachineFunction. If it does, other		// dominates all terminators of the MachineFunction. If it does, other
// successors must be optional. Don't do this for cold branches.		// successors must be optional. Don't do this for cold branches.
		// Also, return branches seem to behave perversely as well. Don't outline in
		// favor of them either, unless the exit branch is hot.
if (OutlineOptionalBranches && SuccProb > HotProb.getCompl() &&		if (OutlineOptionalBranches && SuccProb > HotProb.getCompl() &&
UnavoidableBlocks.count(Succ) > 0) {		UnavoidableBlocks.count(Succ) > 0 &&
		(!Succ->isReturnBlock() \|\| SuccProb > HotProb)) {
auto HasShortOptionalBranch = [&]() {		auto HasShortOptionalBranch = [&]() {
for (MachineBasicBlock *Pred : Succ->predecessors()) {		for (MachineBasicBlock *Pred : Succ->predecessors()) {
// Check whether there is an unplaced optional branch.		// Check whether there is an unplaced optional branch.
if (Pred == Succ \|\| (BlockFilter && !BlockFilter->count(Pred)) \|\|		if (Pred == Succ \|\| (BlockFilter && !BlockFilter->count(Pred)) \|\|
BlockToChain[Pred] == &Chain)		BlockToChain[Pred] == &Chain)
continue;		continue;
// Check whether the optional branch has exactly one BB.		// Check whether the optional branch has exactly one BB.
if (Pred->pred_size() > 1 \|\| *Pred->pred_begin() != BB)		if (Pred->pred_size() > 1 \|\| *Pred->pred_begin() != BB)
▲ Show 20 Lines • Show All 201 Lines • ▼ Show 20 Lines	for (;;) {
assert(BB);		assert(BB);
assert(BlockToChain[BB] == &Chain);		assert(BlockToChain[BB] == &Chain);
assert(*std::prev(Chain.end()) == BB);		assert(*std::prev(Chain.end()) == BB);

// Look for the best viable successor if there is one to place immediately		// Look for the best viable successor if there is one to place immediately
// after this block.		// after this block.
MachineBasicBlock *BestSucc = selectBestSuccessor(BB, Chain, BlockFilter);		MachineBasicBlock *BestSucc = selectBestSuccessor(BB, Chain, BlockFilter);

// If an immediate successor isn't available, look for the best viable		// If an immediate successor isn't available, look for the best viable
// block among those we've identified as not violating the loop's CFG at		// block among those we've identified as not violating the loop's CFG at
// this point. This won't be a fallthrough, but it will increase locality.		// this point. This won't be a fallthrough, but it will increase locality.
if (!BestSucc)		if (!BestSucc)
BestSucc = selectBestCandidateBlock(Chain, BlockWorkList);		BestSucc = selectBestCandidateBlock(Chain, BlockWorkList);
if (!BestSucc)		if (!BestSucc)
BestSucc = selectBestCandidateBlock(Chain, EHPadWorkList);		BestSucc = selectBestCandidateBlock(Chain, EHPadWorkList);

if (!BestSucc) {		if (!BestSucc) {
▲ Show 20 Lines • Show All 852 Lines • Show Last 20 Lines