Download Raw Diff

Details

Reviewers

kazu
mtrofin

Commits

rGaa6ee0370952: [NFC][Inliner] Introduce another multiplier for cost benefit analysis and make…

Summary

The motivation is to expose tunable knobs to control the aggressiveness of inlines for different backend (e.g., machines with different icache size, and workload with different icache/itlb PMU counters). Tuning inline aggressiveness shows a small (~+0.3%) but stable improvement on workload/hardware that is more frontend bound.
Both multipliers could be overridden from command line.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

mingmingl created this revision.Jun 16 2023, 9:58 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 16 2023, 9:58 AM

Herald added subscribers: ChuanqiXu, haicheng, hiraditya. · View Herald Transcript

mingmingl requested review of this revision.Jun 16 2023, 9:58 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 16 2023, 9:58 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

mingmingl added a subscriber: kazu.Jun 16 2023, 9:59 AM

Harbormaster completed remote builds in B239464: Diff 532208.Jun 16 2023, 12:30 PM

Update patch to have per-target parameters. Tests WIP

Herald added a subscriber: kristof.beyls. · View Herald TranscriptJun 20 2023, 9:56 AM

mingmingl added a child revision: D153368: [AArch64][Inliner] Adjust savings multiplier for aarch64 inliner.Jun 20 2023, 10:08 AM

Harbormaster completed remote builds in B240050: Diff 532971.Jun 20 2023, 11:17 AM

kazu added inline comments.Jun 20 2023, 5:34 PM

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h
72 ↗	(On Diff #532971)	IIUC, with this patch, we stop honoring the command line option `-inline-savings-multiplier`. Is there any way you could retain the ability to specify the multiplier as a command line option so that we can easily experiment with different values without rebuilding the compiler? You could do something like: if (InlineSavingsMultiplier.getNumOccurrences()) { // Honor the option. ... } else { // Do whatever architecture-specific things we need to do. ... }
llvm/lib/Analysis/InlineCost.cpp
920–924	IIUC, we are returning `false` if the cost-benefit ratio is below some threshold. Could you somehow mention that we are comparing the cost-benefit ratio against some threshold as a comment? // Do not inline if the savings does not justify the cost of inlining. // Specifically, we evaluate the following inequality: // // CycleSavings PSI->getOrCompHotCountThreshold() // -------------- <= -------------------------------------------------------- // Size TTI.getInliningCostBenefitAnalysisProfitableMultiplier() If you come with some concise language instead of nearly repeating the entire comment block, that's even better.

Revive this NFC patch to expose tunable knobs for workloads/hardwares that could be more frontend bound.

Address feedback.

Let me know if you'd prefer to review this on Github PR or use Phabricator. I'm fine with both.

Harbormaster completed remote builds in B257504: Diff 557191.Sep 21 2023, 12:49 PM

kazu added inline comments.Sep 25 2023, 12:52 PM

llvm/lib/Analysis/InlineCost.cpp
103	I am the one that suggested: if (InlineSavingsMultiplier.getNumOccurrences()) { // Honor the option. ... } else { // Do whatever architecture-specific things we need to do. ... } below. That said, with this approach, we do not use the default value of `InlineSavingsMultiplier` at all. That is, changing the default value at the source code level (as opposed to the command-line level) has no effect. This is very confusing. I am not sure if there is a better way to juggle three things -- the compiler-wide default, the architecture-specific default, and the command-line override. May I suggest a comment like this? // We honor this option only when it is explicitly specified. // The default value below isn't used at all. If you wish to change it, update // TargetTransformInfoImplBase::getInliningCostBenefitAnalysisSavingsMultiplier. static cl::opt<int> InlineSavingsMultiplier(
107	Likewise, may I suggest a comment like this? // We honor this option only when it is explicitly specified. // The default value below isn't used at all. If you wish to change it, update // TargetTransformInfoImplBase::getInliningCostBenefitAnalysisProfitableMultiplier.
903–915	IIUC, you want to return `true` if the ratio of the cycle savings to the size is really high and `false` if it's really low. If the ratio falls somewhere in the middle, then you want to fall back to the cost-based analysis. If you are making this change, I would like to make this picture clear, especially given that it's not easy to read the code that tries to avoid divisions. May I suggest replacing the comment above with something like this? // Let R be the ratio of CycleSavings to Size. We accept the inlining // opportunity if R is really high and reject if R is really low. If R is // somewhere in the middle, we fall back to the cost-based analysis. // // Specifically, we accept the inlining opportunity if: // // PSI->getOrCompHotCountThreshold() // R > ------------------------------------------------- // getInliningCostBenefitAnalysisSavingsMultiplier() // // and reject the inlining opportunity if: // // PSI->getOrCompHotCountThreshold() // R <= ---------------------------------------------------- // getInliningCostBenefitAnalysisProfitableMultiplier() // // Otherwise, we fall back to the cost-based analysis. Sure, we would be repeating comments like "Otherwise, we fall back to" below, but IMHO it's much more important to get the high-level idea across.
928	Remove this empty inline for consistency with the block above.
932	// Otherwise, fall back to the cost-based analysis. Use "fall back" (with a space in between) as a verb. We are falling back to the specific analysis, so insert "the". "Cost-benefit analysis" is a proverbial phrase, but "cost-threshold analysis" isn't. May I suggest "cost-based analysis" or "cost-only analysis"?
llvm/test/Transforms/Inline/inline-cost-benefit-multiplier-override.ll
1 ↗	(On Diff #557191)	Remove this empty line.
69 ↗	(On Diff #557191)	Insert a new line at the end.

Thanks for the suggestions around readability! Resolve comments.

llvm/test/Transforms/Inline/inline-cost-benefit-multiplier-override.ll
69 ↗	(On Diff #557191)	Whoops. This `No newline at end of file` should be an inserted by the editor. I removed it.

Harbormaster completed remote builds in B257582: Diff 557326.Sep 25 2023, 2:51 PM

LGTM. Thanks!

This revision is now accepted and ready to land.Sep 28 2023, 1:30 PM

thanks for the reviews and suggestions! It's great to keep this in Phab (before reviews are fully on Github PR) ˙ᵕ˙

I'll do a sanity check of run, add a brief comment to explain the motivation (hardwares with much smaller icache and weaker frontend might need a per-target multiplier), and then land this.

I did a sanity check of this NFC change; the performance result is neutral (expected).

Closed by commit rGaa6ee0370952: [NFC][Inliner] Introduce another multiplier for cost benefit analysis and make… (authored by mingmingl). · Explain WhyOct 2 2023, 9:27 PM

This revision was automatically updated to reflect the committed changes.

mingmingl added a commit: rGaa6ee0370952: [NFC][Inliner] Introduce another multiplier for cost benefit analysis and make….

Diff 532208

llvm/lib/Analysis/InlineCost.cpp

Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	static cl::opt<int>
ColdCallSiteThreshold("inline-cold-callsite-threshold", cl::Hidden,		ColdCallSiteThreshold("inline-cold-callsite-threshold", cl::Hidden,
cl::init(45),		cl::init(45),
cl::desc("Threshold for inlining cold callsites"));		cl::desc("Threshold for inlining cold callsites"));

static cl::opt<bool> InlineEnableCostBenefitAnalysis(		static cl::opt<bool> InlineEnableCostBenefitAnalysis(
"inline-enable-cost-benefit-analysis", cl::Hidden, cl::init(false),		"inline-enable-cost-benefit-analysis", cl::Hidden, cl::init(false),
cl::desc("Enable the cost-benefit analysis for the inliner"));		cl::desc("Enable the cost-benefit analysis for the inliner"));

		static cl::opt<bool> EnableHybridAnalysis(
		"inline-hybrid-analysis", cl::init(false), cl::Hidden,
		cl::desc("If set to true, a hybrid of cost-benefit analysis and "
		"cost-threshold analysis is used. In parctice, a hybrid model "
		"should be used when machine-functions-splitter is not enabled."));

		static cl::opt<int> AggressiveInlineSavingsMultiplier(
		"aggressive-inline-savings-multiplier", cl::Hidden, cl::init(3),
		cl::desc("The multiplier to multiply cycle savings. If a callee's cycle "
		"saving is greater than cost using this multiplier, it's a "
		"stronger signal that inlining this callee is useful"));

static cl::opt<int> InlineSavingsMultiplier(		static cl::opt<int> InlineSavingsMultiplier(
		kazuUnsubmitted Done Reply Inline Actions I am the one that suggested: if (InlineSavingsMultiplier.getNumOccurrences()) { // Honor the option. ... } else { // Do whatever architecture-specific things we need to do. ... } below. That said, with this approach, we do not use the default value of `InlineSavingsMultiplier` at all. That is, changing the default value at the source code level (as opposed to the command-line level) has no effect. This is very confusing. I am not sure if there is a better way to juggle three things -- the compiler-wide default, the architecture-specific default, and the command-line override. May I suggest a comment like this? // We honor this option only when it is explicitly specified. // The default value below isn't used at all. If you wish to change it, update // TargetTransformInfoImplBase::getInliningCostBenefitAnalysisSavingsMultiplier. static cl::opt<int> InlineSavingsMultiplier( kazu: I am the one that suggested: ``` if (InlineSavingsMultiplier.getNumOccurrences()) { // Honor…
"inline-savings-multiplier", cl::Hidden, cl::init(8),		"inline-savings-multiplier", cl::Hidden, cl::init(8),
cl::desc("Multiplier to multiply cycle savings by during inlining"));		cl::desc("Multiplier to multiply cycle savings by during inlining"));

static cl::opt<int>		static cl::opt<int>
		kazuUnsubmitted Done Reply Inline Actions Likewise, may I suggest a comment like this? // We honor this option only when it is explicitly specified. // The default value below isn't used at all. If you wish to change it, update // TargetTransformInfoImplBase::getInliningCostBenefitAnalysisProfitableMultiplier. kazu: Likewise, may I suggest a comment like this? ``` // We honor this option only when it is…
InlineSizeAllowance("inline-size-allowance", cl::Hidden, cl::init(100),		InlineSizeAllowance("inline-size-allowance", cl::Hidden, cl::init(100),
cl::desc("The maximum size of a callee that get's "		cl::desc("The maximum size of a callee that get's "
"inlined without sufficient cycle savings"));		"inlined without sufficient cycle savings"));

// We introduce this threshold to help performance of instrumentation based		// We introduce this threshold to help performance of instrumentation based
// PGO before we actually hook up inliner with analysis passes such as BPI and		// PGO before we actually hook up inliner with analysis passes such as BPI and
// BFI.		// BFI.
static cl::opt<int> ColdThreshold(		static cl::opt<int> ColdThreshold(
▲ Show 20 Lines • Show All 779 Lines • ▼ Show 20 Lines	std::optional<bool> costBenefitAnalysis() {
int Size = Cost - ColdSize;		int Size = Cost - ColdSize;

// Allow tiny callees to be inlined regardless of whether they meet the		// Allow tiny callees to be inlined regardless of whether they meet the
// savings threshold.		// savings threshold.
Size = Size > InlineSizeAllowance ? Size - InlineSizeAllowance : 1;		Size = Size > InlineSizeAllowance ? Size - InlineSizeAllowance : 1;

CostBenefit.emplace(APInt(128, Size), CycleSavings);		CostBenefit.emplace(APInt(128, Size), CycleSavings);

// Return true if the savings justify the cost of inlining. Specifically,		// Return true if the savings justify the cost of inlining. Specifically,
// we evaluate the following inequality:		// we evaluate the following inequality:
//		//
// CycleSavings PSI->getOrCompHotCountThreshold()		// CycleSavings PSI->getOrCompHotCountThreshold()
// -------------- >= -----------------------------------		// -------------- >= -----------------------------------
// Size InlineSavingsMultiplier		// Size InlineSavingsMultiplier
//		//
// Note that the left hand side is specific to a call site. The right hand		// Note that the left hand side is specific to a call site. The right hand
// side is a constant for the entire executable.		// side is a constant for the entire executable.
APInt LHS = CycleSavings;		APInt LHS = CycleSavings;
LHS *= InlineSavingsMultiplier;		LHS *= AggressiveInlineSavingsMultiplier;
APInt RHS(128, PSI->getOrCompHotCountThreshold());		APInt RHS(128, PSI->getOrCompHotCountThreshold());
RHS *= Size;		RHS *= Size;
		kazuUnsubmitted Done Reply Inline Actions IIUC, you want to return `true` if the ratio of the cycle savings to the size is really high and `false` if it's really low. If the ratio falls somewhere in the middle, then you want to fall back to the cost-based analysis. If you are making this change, I would like to make this picture clear, especially given that it's not easy to read the code that tries to avoid divisions. May I suggest replacing the comment above with something like this? // Let R be the ratio of CycleSavings to Size. We accept the inlining // opportunity if R is really high and reject if R is really low. If R is // somewhere in the middle, we fall back to the cost-based analysis. // // Specifically, we accept the inlining opportunity if: // // PSI->getOrCompHotCountThreshold() // R > ------------------------------------------------- // getInliningCostBenefitAnalysisSavingsMultiplier() // // and reject the inlining opportunity if: // // PSI->getOrCompHotCountThreshold() // R <= ---------------------------------------------------- // getInliningCostBenefitAnalysisProfitableMultiplier() // // Otherwise, we fall back to the cost-based analysis. Sure, we would be repeating comments like "Otherwise, we fall back to" below, but IMHO it's much more important to get the high-level idea across. kazu: IIUC, you want to return `true` if the ratio of the cycle savings to the size is really high…
return LHS.uge(RHS);		if (LHS.uge(RHS))
		return true;

		APInt ConservativeLHS = CycleSavings;
		ConservativeLHS *= InlineSavingsMultiplier;

		if (!ConservativeLHS.uge(RHS))
		return false;

		kazuUnsubmitted Done Reply Inline Actions IIUC, we are returning `false` if the cost-benefit ratio is below some threshold. Could you somehow mention that we are comparing the cost-benefit ratio against some threshold as a comment? // Do not inline if the savings does not justify the cost of inlining. // Specifically, we evaluate the following inequality: // // CycleSavings PSI->getOrCompHotCountThreshold() // -------------- <= -------------------------------------------------------- // Size TTI.getInliningCostBenefitAnalysisProfitableMultiplier() If you come with some concise language instead of nearly repeating the entire comment block, that's even better. kazu: IIUC, we are returning `false` if the cost-benefit ratio is below some threshold. Could you…
		return std::nullopt;
}		}

InlineResult finalizeAnalysis() override {		InlineResult finalizeAnalysis() override {
		kazuUnsubmitted Done Reply Inline Actions Remove this empty inline for consistency with the block above. kazu: Remove this empty inline for consistency with the block above.
// Loops generally act a lot like calls in that they act like barriers to		// Loops generally act a lot like calls in that they act like barriers to
// movement, require a certain amount of setup, etc. So when optimising for		// movement, require a certain amount of setup, etc. So when optimising for
// size, we penalise any call sites that perform loops. We do this after all		// size, we penalise any call sites that perform loops. We do this after all
// other costs here, so will likely only be dealing with relatively small		// other costs here, so will likely only be dealing with relatively small
		kazuUnsubmitted Done Reply Inline Actions // Otherwise, fall back to the cost-based analysis. Use "fall back" (with a space in between) as a verb. We are falling back to the specific analysis, so insert "the". "Cost-benefit analysis" is a proverbial phrase, but "cost-threshold analysis" isn't. May I suggest "cost-based analysis" or "cost-only analysis"? kazu: ``` // Otherwise, fall back to the cost-based analysis. ``` - Use "fall back" (with a space in…
// functions (and hence DT and LI will hopefully be cheap).		// functions (and hence DT and LI will hopefully be cheap).
auto *Caller = CandidateCall.getFunction();		auto *Caller = CandidateCall.getFunction();
if (Caller->hasMinSize()) {		if (Caller->hasMinSize()) {
DominatorTree DT(F);		DominatorTree DT(F);
LoopInfo LI(DT);		LoopInfo LI(DT);
int NumLoops = 0;		int NumLoops = 0;
for (Loop *L : LI) {		for (Loop *L : LI) {
// Ignore loops that will not be executed		// Ignore loops that will not be executed
▲ Show 20 Lines • Show All 2,284 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[NFC][Inliner] Introduce another multiplier for cost benefit analysis and make multipliers overriddable in TargetTransformInfo.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 532208

llvm/lib/Analysis/InlineCost.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[NFC][Inliner] Introduce another multiplier for cost benefit analysis and make multipliers overriddable in TargetTransformInfo.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 532208

llvm/lib/Analysis/InlineCost.cpp

[NFC][Inliner] Introduce another multiplier for cost benefit analysis and make multipliers overriddable in TargetTransformInfo.
ClosedPublic