This is an archive of the discontinued LLVM Phabricator instance.

Adjust the hotness threshold from 99.9% to 99%.
ClosedPublic

Authored by danielcdh on Aug 4 2017, 8:41 AM.

Download Raw Diff

Details

Reviewers

davidxl
tejohnson
eraman

Commits

rG63799512b223: Adjust the hotness threshold from 99.9% to 99%.
rL310065: Adjust the hotness threshold from 99.9% to 99%.

Summary

We originally set the hotness threshold as 99.9% to be consistent with gcc FDO. But because the inline heuristic is different between 2 compilers: llvm uses bottom-up algorithm while gcc uses priority based. The LLVM algorithm tends to inline too much early that prevents hot callsites from further inlined into its caller. Due to this restriction, we think it is reasonable to lower the hotness threshold to give priority to those that are really hot. Our experiments show that this change would improve performance on large applications. Note that the inline heuristic has great room for further tuning. Once the inline heuristics are refined, we could adjust this threshold to allow inlining for less hot callsites.

Diff Detail

Build Status

Buildable 9031
Build 9031: arc lint + arc unit

Event Timeline

danielcdh created this revision.Aug 4 2017, 8:41 AM

Herald added a subscriber: sanjoy. · View Herald TranscriptAug 4 2017, 8:41 AM

LGTM

This revision is now accepted and ready to land.Aug 4 2017, 8:44 AM

danielcdh closed this revision.Aug 4 2017, 9:21 AM

Bottom up ininling can also create lots of cold inline instances. Other than the effect of blocking hotter callers from being inlined, current Machine Block Layout also has problems forming long hot traces leaving holes in code layout.

I wonder if another way to fix this is better: 1) compute 99.9% working set size 2) if it is too large compared with the working set threshold, drop the hot cutoff.

Revision Contents

Path

Size

lib/

Analysis/

ProfileSummaryInfo.cpp

2 lines

Diff 109752

lib/Analysis/ProfileSummaryInfo.cpp

	Show All 24 Lines
	// considered hot/cold. These two parameters are percentile values (multiplied			// considered hot/cold. These two parameters are percentile values (multiplied
	// by 10000). If the counts are sorted in descending order, the minimum count to			// by 10000). If the counts are sorted in descending order, the minimum count to
	// reach ProfileSummaryCutoffHot gives the threshold to determine a hot count.			// reach ProfileSummaryCutoffHot gives the threshold to determine a hot count.
	// Similarly, the minimum count to reach ProfileSummaryCutoffCold gives the			// Similarly, the minimum count to reach ProfileSummaryCutoffCold gives the
	// threshold for determining cold count (everything <= this threshold is			// threshold for determining cold count (everything <= this threshold is
	// considered cold).			// considered cold).

	static cl::opt<int> ProfileSummaryCutoffHot(			static cl::opt<int> ProfileSummaryCutoffHot(
	"profile-summary-cutoff-hot", cl::Hidden, cl::init(999000), cl::ZeroOrMore,			"profile-summary-cutoff-hot", cl::Hidden, cl::init(990000), cl::ZeroOrMore,
	cl::desc("A count is hot if it exceeds the minimum count to"			cl::desc("A count is hot if it exceeds the minimum count to"
	" reach this percentile of total counts."));			" reach this percentile of total counts."));

	static cl::opt<int> ProfileSummaryCutoffCold(			static cl::opt<int> ProfileSummaryCutoffCold(
	"profile-summary-cutoff-cold", cl::Hidden, cl::init(999999), cl::ZeroOrMore,			"profile-summary-cutoff-cold", cl::Hidden, cl::init(999999), cl::ZeroOrMore,
	cl::desc("A count is cold if it is below the minimum count"			cl::desc("A count is cold if it is below the minimum count"
	" to reach this percentile of total counts."));			" to reach this percentile of total counts."));

	▲ Show 20 Lines • Show All 236 Lines • Show Last 20 Lines