This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
Analysis/
1/4
InlineAdvisor.h
-
ReplayInlineAdvisor.h
-
Transforms/IPO/
-
IPO/
-
Inliner.h
-
lib/
-
Analysis/
1
InlineAdvisor.cpp
4
ReplayInlineAdvisor.cpp
-
Transforms/IPO/
-
IPO/
1/4
Inliner.cpp
-
SampleProfile.cpp
-
test/Transforms/Inline/
-
Transforms/
-
Inline/
-
Inputs/
-
cgscc-inline-replay.txt
1/2
cgscc-inline-replay.ll

Differential D94334

[InlineAdvisor] Allow replay of inline decisions for the CGSCC inliner from optimization remarks
ClosedPublic

Authored by modimo on Jan 8 2021, 12:36 PM.

Download Raw Diff

Details

Reviewers

mtrofin
wenlei
wmi
davidxl

Commits

rGce7f9cdb50a9: [InlineAdvisor] Allow replay of inline decisions for the CGSCC inliner from…

Summary

This change leverages the work done in D83743 to replay in the SampleProfile inliner to also be used in the CGSCC inliner. NOTE: currently restricted to non-ML advisors only.

The added switch -cgscc-inline-replay=<remarks file> will replay the inlining decisions in that file where the remarks file is generated via -Rpass=inline. The aim here is to make it easier to analyze changes that would modify inlining heuristics to be separated from this behavior. Doing so allows easier examination of assembly and runtime behavior compared to the baseline rather than trying to dig through the large churn caused by inlining.

In LTO compilation, since inlining is done twice you can separately specify replay by passing the flag to the FE (-cgscc-inline-replay=) and to the linker (-Wl,cgscc-inline-replay=) with the remarks generated from their respective places.

Testing on mysqld by comparing the inline decisions between base (generates remarks.txt) and diff (replay using identical input/tools with remarks.txt) and examining the inlining sites with diff shows 14,000 mismatches out of 247,341 for a ~94% replay accuracy. I believe this gap can be narrowed further though for the general case we may never achieve full accuracy. For my personal use, this is close enough to be representative: I set the baseline as the one generated by the replay on identical input/toolset and compare that to my modified input/toolset using the same replay.

Testing:
ninja check-llvm
newly added test correctly replays CGSCC inlining decisions

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

modimo created this revision.Jan 8 2021, 12:36 PM

Herald added subscribers: hoy, wenlei, lxfind and 2 others. · View Herald TranscriptJan 8 2021, 12:36 PM

modimo requested review of this revision.Jan 8 2021, 12:36 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 8 2021, 12:36 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

modimo retitled this revision from cgscc replay to [InlineAdvisor] Allow replay of inline decisions for the CGSCC inliner from optimization remarks.Jan 8 2021, 1:39 PM

modimo edited the summary of this revision. (Show Details)

modimo added reviewers: mtrofin, wenlei, wmi.

modimo added a reviewer: davidxl.

Harbormaster completed remote builds in B84520: Diff 315490.Jan 8 2021, 1:52 PM

wenlei added inline comments.Jan 8 2021, 2:17 PM

llvm/lib/Analysis/ReplayInlineAdvisor.cpp
61	This looks redundant/similar to `DefaultInlineAdvice`, is that just for controlling `EmitRemarks`? ORE should be able to handle remark printing (or not) correctly without extra guard.

mtrofin added inline comments.Jan 8 2021, 2:28 PM

llvm/lib/Analysis/ReplayInlineAdvisor.cpp
61	Along the same lines as @wenlei 's comment - if the Advisor can both generate and digest the trace of decisions, why rely on ORE and not, instead, use a more structured format that wouldn't need parsing like ReplayInlineAdvisor.cpp:43? Also (if using ORE is desirable, case in which I share @wenlei's question), I think there's a yaml output format ORE generates, perhaps requiring that as input would also simplify ingestion?

modimo mentioned this in D94333: [Inliner] Change inline remark format and update ReplayInlineAdvisor to use it.Jan 8 2021, 4:58 PM

Move the ReplayInlineAdvisor.cpp/h and SampleProfile.cpp files to D94333 as they need to be atomic with the remarks format change.

modimo edited the summary of this revision. (Show Details)Jan 8 2021, 6:15 PM

modimo added inline comments.Jan 8 2021, 6:29 PM

llvm/lib/Analysis/ReplayInlineAdvisor.cpp
61	I've moved this section to D94333 since the replay mechanism change to consume the new `line:col.discriminator` format needed to be together with format change. I've folded `ReplayInlineAdvice` back into `DefaultInlineAdvice` with the additional features I need. The extra guard is needed because the SampleProfile inliner uses the "legacy PM" mechanism of inline printing rather than bundling it with InlineAdvice calls. Since the use of InlineAdvice in SampleProfile is purely to support replay right now I'm leaving that refactoring (if we want to go after it) for the future. As far as using yaml I like how condensed the format is in remarks form. Something that's a single line in remarks ends up as 24 lines (like in llvm/test/Transforms/Inline/optimization-remarks-passed-yaml.ll) which makes manual reading and modification tedious especially on larger binaries. The current text processing is also fairly simple as is which makes a change here less pressing. That being said I'm not against using the yaml file as the official/supported format. A nice advantage there is that if we wanted to add more replay data (say negative inline decisions) it'll be smoother in yaml than adding new parsing of the text remark.

weiwang added a subscriber: weiwang.Jan 11 2021, 1:07 PM

wenlei added inline comments.Jan 12 2021, 9:07 PM

llvm/lib/Transforms/IPO/Inliner.cpp
644	Plug in replay inline advisor here isn't extensible. In the future we want to be able to use inline replay only for a specific function, or enforce/prevent certain inlining at particular callsite, and fall back to regular advisor for the rest (see comments in D83743). That means we would need to be able to fall back from replay advisor to default advisor (or whatever main advisor being used) when replay advisor doesn't have info. For that cascaded model, we would need inline advise to have something like `hasInlineRecommendation` in addition to `isInliningRecommended`. We should probably still record inlining on each advice, but don't want to emit duplicated remarks from each advice. These changes can come later, but current change better offer that flexibility - we don't to stick to replay advisor for the entire module inliner pass.
llvm/test/Transforms/Inline/cgscc-inline-replay.ll
5	Better verify inline decision without replay first, to make sure the replay has visible impact on inlining. See DEFAULT and REPLAY check for SampleProfile/inline-replay.ll.

modimo edited the summary of this revision. (Show Details)Jan 14 2021, 3:00 PM

modimo added inline comments.Jan 14 2021, 3:15 PM

llvm/lib/Transforms/IPO/Inliner.cpp
644	I think your suggested change is to initialize both Advisors and allow fallback if we defer on one. With how the current scheme is setup up though advisors are single entities and we only ask it once: `auto Advice = Advisor.getAdvice(*CB);` A proven approach to doing something like this is with alias analysis where we query AA in a specific order until we hit a real recommendation. That would cause a rehaul of this function rather than a small tweak so I don't see a good intermediate step to take for this patch. Let me know if you think there's something to do here now for it.
llvm/test/Transforms/Inline/cgscc-inline-replay.ll
5	Makes sense, added.

Add DEFAULT testing to make sure baseline inlining differs from replay. Fix copy-paste error in flag description for -cgscc-inline-replay

modimo marked an inline comment as not done.Jan 15 2021, 11:56 AM

modimo added inline comments.

llvm/lib/Analysis/ReplayInlineAdvisor.cpp
61	Along the same lines as @wenlei 's comment - if the Advisor can both generate and digest the trace of decisions, why rely on ORE and not, instead, use a more structured format that wouldn't need parsing like ReplayInlineAdvisor.cpp:43? @mtrofin I very much agree on this point. Personal front-runners for me is a CSV file which gives you line density but makes parsing easy or a tree format based on what @wenlei shows in D82213: Inlinees for main [P] _ZN15largesolidarrayIP6regobjEixEi @ 369 [P] _Z7random1i @ 363 [C] _Z8myrandomv @ 2 [P] _Z7random1i @ 364 [C] _Z8myrandomv @ 2 [P] _ZN15largesolidarrayIP6regobjEixEi @ 366 [P] _ZN6wayobj9createwayEiiiiRP8point16tRi @ 327 [P] _ZN6wayobj11createwayarEiiRP8point16tRi @ 37.1 [P] _ZN6wayobj5indexEii @ 143 [P] _ZN6wayobj5indexEii @ 130 [P] _ZN6wayobj6indexxEi @ 31 [P] _ZN6wayobj6indexyEi @ 32 [C] _ZN8point16tC2Ess @ 2 [C] _ZN8point16tC2Ess @ 2.1 I do want to see what users think about the current flow that's currently the same between CGSCC and sample inliner because there's definitely more refinements (additional replay accuracy, more logging, global allow-list/block-list etc.) that can be pursued but which I don't have a sense of value/priority for. I'm hoping that'll give us more information on what path to pursue here.

mtrofin mentioned this in D94825: [NewPM][Inliner] Move the 'always inliner' case in the same CGSCC pass as 'regular' inliner.Jan 15 2021, 3:00 PM

wenlei added inline comments.Jan 18 2021, 11:34 PM

llvm/lib/Transforms/IPO/Inliner.cpp
644	Well, we don't have to have everything setup for the cascaded query like how AA works, but something more flexible than having entire inliner sticking to one advisor would be good (and does not seem like a significant change) What I was thinking about is that the main advisor can still go through `getAdvisor` interface, then for inline replay, we can just let `ModuleInlinerWrapperPass` own an `ExternalInlineAdvisor` just like how `SampleProfileLoader` owns one. Then it can be passed to `InlinerPass` and serve as a short-circuit look up or side look up when available in addition to the main advisor from `getAdvisor`. The changes to add `hasInlineRecommendation` etc are not what I'm suggesting for this patch though I don't think these are significant either. It can evolve into cascaded advice support in the framework if needed, but if replay inline advice is the only case needing that support, generalizing it doesn't not seem like a must do.

mtrofin added inline comments.Jan 19 2021, 8:08 AM

llvm/lib/Transforms/IPO/Inliner.cpp
644	+1 to what @wenlei said - you can wrap advisors in advisors, basically. There may be some helper utilities that we may need factored in InlineAdvisor, and I was at a point thinking of doing that, but the motivating scenario at the time ended up not really needing that. If this turns out to be that scenario, I'd be happy to help!

Wrap ReplayInlineAdvisor into InlineAdvisor so that we can fall back to the original Advisor if we don't want to follow the replay. I think the composition of advisors here makes sense but I'm not sure so I'm very open to different approaches.

wenlei added inline comments.Jan 21 2021, 4:24 PM

llvm/include/llvm/Analysis/InlineAdvisor.h
199	By adding a `ReplayAdvisor` field into every `InlineAdvisor`, we're allowing advisors to be chained. It'd be weird if a replay advisor itself has a non-empty replay advisor though the current implementation doesn't prohibit that. However if we change the names to be like a fall back advisor, and let replay advisor be just a use case of the fall back chain, I think that would be more reasonable.

mtrofin added inline comments.Jan 21 2021, 5:43 PM

llvm/include/llvm/Analysis/InlineAdvisor.h
199	Instead of all advisors knowing about replay, why not doing it vice-versa: Replay wraps other advisors. Then, in InlineAdvisorAnalysis::Result::tryCreate (in InlineAdvisor.cpp), you see if replaying is requested, and build the ReplayInlineAdvisor wrapping the advisor requested initially - something like adding, right before return: if (ReplayRequired) Advisor = std::make_unique<ReplayInlineAdvisor>(<params>, std::move(Advisor)) I believe this keeps the concerns (replaying vs regular advising) separated, while also allowing future usecases where the the replay advisor can delegate to some other advisor, generically. Note: I think you'd also need to have a ReplayInlineAdvice wrapping the InlineAdvice coming from the contained InlineAdvisor.

Nest original advisors in ReplayInlineAdvisor rather than the other way around.

Nice! LGTM (assuming comment in tryCreate is addressed)

llvm/lib/Analysis/InlineAdvisor.cpp
187	Probably best to check first if Advisor isn't null before line 180, then not bother making a replay advisor if the underlying one can't be made in the first place (and just return false)

This revision is now accepted and ready to land.Jan 22 2021, 1:41 PM

modimo added inline comments.Jan 22 2021, 1:42 PM

llvm/include/llvm/Analysis/InlineAdvisor.h
199	However if we change the names to be like a fall back advisor, and let replay advisor be just a use case of the fall back chain, I think that would be more reasonable. Fall back would be better wording for it, agreed. Instead of all advisors knowing about replay, why not doing it vice-versa: Replay wraps other advisors. I like it, knowing about `tryCreate` makes it easier than I first thought given it's a centralizing creation point where we can wrap. Note: I think you'd also need to have a ReplayInlineAdvice wrapping the InlineAdvice coming from the contained InlineAdvisor. Can you elaborate on what would go into `ReplayInlineAdvice`? My thinking is that if the `ReplayInlineAdvisor::getAdviceImpl` declines to offer advice then we go to `OriginalAdvisor->getAdvice(CB)` so wrapping doesn't seem needed.

mtrofin added inline comments.Jan 22 2021, 1:49 PM

llvm/include/llvm/Analysis/InlineAdvisor.h
199	TL;DR; right now it's probably fine. Longer story: the ML advisors are stateful - they track module-wide changes. So if we wanted to combine replaying with one of those, then the replayer would always have to get an advice from the underlying advisor, so it'd be able to notify back through it on what actually happened. But the motivation for that scenario is kind of tenuous, I think, and it'd complicate the design unnecessarily. May be better to just disallow replaying with anything else other than the default advisor and we can add there a comment as to why.

Restrict replay to default advisor only

Looks great, thanks!

Closed by commit rGce7f9cdb50a9: [InlineAdvisor] Allow replay of inline decisions for the CGSCC inliner from… (authored by modimo). · Explain WhyJan 25 2021, 3:39 PM

This revision was automatically updated to reflect the committed changes.

modimo added a commit: rGce7f9cdb50a9: [InlineAdvisor] Allow replay of inline decisions for the CGSCC inliner from….

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

InlineAdvisor.h

3 lines

ReplayInlineAdvisor.h

6 lines

Transforms/

IPO/

Inliner.h

3 lines

lib/

Analysis/

InlineAdvisor.cpp

12 lines

ReplayInlineAdvisor.cpp

12 lines

Transforms/

IPO/

Inliner.cpp

24 lines

SampleProfile.cpp

3 lines

test/

Transforms/

Inline/

Inputs/

cgscc-inline-replay.txt

2 lines

cgscc-inline-replay.ll

119 lines

Diff 319147

llvm/include/llvm/Analysis/InlineAdvisor.h

Show First 20 Lines • Show All 190 Lines • ▼ Show 20 Lines	static MandatoryInliningKind getMandatoryKind(CallBase &CB,
OptimizationRemarkEmitter &ORE);		OptimizationRemarkEmitter &ORE);

OptimizationRemarkEmitter &getCallerORE(CallBase &CB);		OptimizationRemarkEmitter &getCallerORE(CallBase &CB);

private:		private:
friend class InlineAdvice;		friend class InlineAdvice;
void markFunctionAsDeleted(Function *F);		void markFunctionAsDeleted(Function *F);
std::unordered_set<const Function *> DeletedFunctions;		std::unordered_set<const Function *> DeletedFunctions;
};		};
		wenleiUnsubmitted Not Done Reply Inline Actions By adding a `ReplayAdvisor` field into every `InlineAdvisor`, we're allowing advisors to be chained. It'd be weird if a replay advisor itself has a non-empty replay advisor though the current implementation doesn't prohibit that. However if we change the names to be like a fall back advisor, and let replay advisor be just a use case of the fall back chain, I think that would be more reasonable. wenlei: By adding a `ReplayAdvisor` field into every `InlineAdvisor`, we're allowing advisors to be…
		mtrofinUnsubmitted Not Done Reply Inline Actions Instead of all advisors knowing about replay, why not doing it vice-versa: Replay wraps other advisors. Then, in InlineAdvisorAnalysis::Result::tryCreate (in InlineAdvisor.cpp), you see if replaying is requested, and build the ReplayInlineAdvisor wrapping the advisor requested initially - something like adding, right before return: if (ReplayRequired) Advisor = std::make_unique<ReplayInlineAdvisor>(<params>, std::move(Advisor)) I believe this keeps the concerns (replaying vs regular advising) separated, while also allowing future usecases where the the replay advisor can delegate to some other advisor, generically. Note: I think you'd also need to have a ReplayInlineAdvice wrapping the InlineAdvice coming from the contained InlineAdvisor. mtrofin: Instead of all advisors knowing about replay, why not doing it vice-versa: Replay wraps other…
		modimoAuthorUnsubmitted Done Reply Inline Actions However if we change the names to be like a fall back advisor, and let replay advisor be just a use case of the fall back chain, I think that would be more reasonable. Fall back would be better wording for it, agreed. Instead of all advisors knowing about replay, why not doing it vice-versa: Replay wraps other advisors. I like it, knowing about `tryCreate` makes it easier than I first thought given it's a centralizing creation point where we can wrap. Note: I think you'd also need to have a ReplayInlineAdvice wrapping the InlineAdvice coming from the contained InlineAdvisor. Can you elaborate on what would go into `ReplayInlineAdvice`? My thinking is that if the `ReplayInlineAdvisor::getAdviceImpl` declines to offer advice then we go to `OriginalAdvisor->getAdvice(CB)` so wrapping doesn't seem needed. modimo: > However if we change the names to be like a fall back advisor, and let replay advisor be just…
		mtrofinUnsubmitted Not Done Reply Inline Actions TL;DR; right now it's probably fine. Longer story: the ML advisors are stateful - they track module-wide changes. So if we wanted to combine replaying with one of those, then the replayer would always have to get an advice from the underlying advisor, so it'd be able to notify back through it on what actually happened. But the motivation for that scenario is kind of tenuous, I think, and it'd complicate the design unnecessarily. May be better to just disallow replaying with anything else other than the default advisor and we can add there a comment as to why. mtrofin: TL;DR; right now it's probably fine. Longer story: the ML advisors are stateful - they track…

/// The default (manual heuristics) implementation of the InlineAdvisor. This		/// The default (manual heuristics) implementation of the InlineAdvisor. This
/// implementation does not need to keep state between inliner pass runs, and is		/// implementation does not need to keep state between inliner pass runs, and is
/// reusable as-is for inliner pass test scenarios, as well as for regular use.		/// reusable as-is for inliner pass test scenarios, as well as for regular use.
class DefaultInlineAdvisor : public InlineAdvisor {		class DefaultInlineAdvisor : public InlineAdvisor {
public:		public:
DefaultInlineAdvisor(Module &M, FunctionAnalysisManager &FAM,		DefaultInlineAdvisor(Module &M, FunctionAnalysisManager &FAM,
InlineParams Params)		InlineParams Params)
Show All 15 Lines	public:
InlineAdvisorAnalysis() = default;		InlineAdvisorAnalysis() = default;
struct Result {		struct Result {
Result(Module &M, ModuleAnalysisManager &MAM) : M(M), MAM(MAM) {}		Result(Module &M, ModuleAnalysisManager &MAM) : M(M), MAM(MAM) {}
bool invalidate(Module &, const PreservedAnalyses &,		bool invalidate(Module &, const PreservedAnalyses &,
ModuleAnalysisManager::Invalidator &) {		ModuleAnalysisManager::Invalidator &) {
// InlineAdvisor must be preserved across analysis invalidations.		// InlineAdvisor must be preserved across analysis invalidations.
return false;		return false;
}		}
bool tryCreate(InlineParams Params, InliningAdvisorMode Mode);		bool tryCreate(InlineParams Params, InliningAdvisorMode Mode,
		StringRef ReplayFile);
InlineAdvisor *getAdvisor() const { return Advisor.get(); }		InlineAdvisor *getAdvisor() const { return Advisor.get(); }
void clear() { Advisor.reset(); }		void clear() { Advisor.reset(); }

private:		private:
Module &M;		Module &M;
ModuleAnalysisManager &MAM;		ModuleAnalysisManager &MAM;
std::unique_ptr<InlineAdvisor> Advisor;		std::unique_ptr<InlineAdvisor> Advisor;
};		};
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

llvm/include/llvm/Analysis/ReplayInlineAdvisor.h

	Show All 19 Lines
	class Module;			class Module;
	class OptimizationRemarkEmitter;			class OptimizationRemarkEmitter;

	/// Replay inline advisor that uses optimization remarks from inlining of			/// Replay inline advisor that uses optimization remarks from inlining of
	/// previous build to guide current inlining. This is useful for inliner tuning.			/// previous build to guide current inlining. This is useful for inliner tuning.
	class ReplayInlineAdvisor : public InlineAdvisor {			class ReplayInlineAdvisor : public InlineAdvisor {
	public:			public:
	ReplayInlineAdvisor(Module &M, FunctionAnalysisManager &FAM,			ReplayInlineAdvisor(Module &M, FunctionAnalysisManager &FAM,
	LLVMContext &Context, StringRef RemarksFile,			LLVMContext &Context,
	bool EmitRemarks);			std::unique_ptr<InlineAdvisor> OriginalAdvisor,
				StringRef RemarksFile, bool EmitRemarks);
	std::unique_ptr<InlineAdvice> getAdviceImpl(CallBase &CB) override;			std::unique_ptr<InlineAdvice> getAdviceImpl(CallBase &CB) override;
	bool areReplayRemarksLoaded() const { return HasReplayRemarks; }			bool areReplayRemarksLoaded() const { return HasReplayRemarks; }

	private:			private:
	StringSet<> InlineSitesFromRemarks;			StringSet<> InlineSitesFromRemarks;
				std::unique_ptr<InlineAdvisor> OriginalAdvisor;
	bool HasReplayRemarks = false;			bool HasReplayRemarks = false;
	bool EmitRemarks = false;			bool EmitRemarks = false;
	};			};
	} // namespace llvm			} // namespace llvm
	#endif // LLVM_ANALYSIS_REPLAYINLINEADVISOR_H			#endif // LLVM_ANALYSIS_REPLAYINLINEADVISOR_H

llvm/include/llvm/Transforms/IPO/Inliner.h

//===- Inliner.h - Inliner pass and infrastructure --------------- C++ --===//		//===- Inliner.h - Inliner pass and infrastructure --------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TRANSFORMS_IPO_INLINER_H		#ifndef LLVM_TRANSFORMS_IPO_INLINER_H
#define LLVM_TRANSFORMS_IPO_INLINER_H		#define LLVM_TRANSFORMS_IPO_INLINER_H

#include "llvm/Analysis/CGSCCPassManager.h"		#include "llvm/Analysis/CGSCCPassManager.h"
#include "llvm/Analysis/CallGraphSCCPass.h"		#include "llvm/Analysis/CallGraphSCCPass.h"
#include "llvm/Analysis/InlineAdvisor.h"		#include "llvm/Analysis/InlineAdvisor.h"
#include "llvm/Analysis/InlineCost.h"		#include "llvm/Analysis/InlineCost.h"
#include "llvm/Analysis/LazyCallGraph.h"		#include "llvm/Analysis/LazyCallGraph.h"
		#include "llvm/Analysis/ReplayInlineAdvisor.h"
#include "llvm/Analysis/Utils/ImportedFunctionsInliningStatistics.h"		#include "llvm/Analysis/Utils/ImportedFunctionsInliningStatistics.h"
#include "llvm/IR/PassManager.h"		#include "llvm/IR/PassManager.h"
#include <utility>		#include <utility>

namespace llvm {		namespace llvm {

class AssumptionCacheTracker;		class AssumptionCacheTracker;
class CallGraph;		class CallGraph;
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	public:
InlinerPass(InlinerPass &&Arg) = default;		InlinerPass(InlinerPass &&Arg) = default;

PreservedAnalyses run(LazyCallGraph::SCC &C, CGSCCAnalysisManager &AM,		PreservedAnalyses run(LazyCallGraph::SCC &C, CGSCCAnalysisManager &AM,
LazyCallGraph &CG, CGSCCUpdateResult &UR);		LazyCallGraph &CG, CGSCCUpdateResult &UR);

private:		private:
InlineAdvisor &getAdvisor(const ModuleAnalysisManagerCGSCCProxy::Result &MAM,		InlineAdvisor &getAdvisor(const ModuleAnalysisManagerCGSCCProxy::Result &MAM,
FunctionAnalysisManager &FAM, Module &M);		FunctionAnalysisManager &FAM, Module &M);
std::unique_ptr<DefaultInlineAdvisor> OwnedDefaultAdvisor;		std::unique_ptr<InlineAdvisor> OwnedAdvisor;
const bool OnlyMandatory;		const bool OnlyMandatory;
};		};

/// Module pass, wrapping the inliner pass. This works in conjunction with the		/// Module pass, wrapping the inliner pass. This works in conjunction with the
/// InlineAdvisorAnalysis to facilitate inlining decisions taking into account		/// InlineAdvisorAnalysis to facilitate inlining decisions taking into account
/// module-wide state, that need to keep track of inter-inliner pass runs, for		/// module-wide state, that need to keep track of inter-inliner pass runs, for
/// a given module. An InlineAdvisor is configured and kept alive for the		/// a given module. An InlineAdvisor is configured and kept alive for the
/// duration of the ModuleInlinerWrapperPass::run.		/// duration of the ModuleInlinerWrapperPass::run.
Show All 31 Lines

llvm/lib/Analysis/InlineAdvisor.cpp

Show All 10 Lines
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/Analysis/InlineAdvisor.h"		#include "llvm/Analysis/InlineAdvisor.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/Analysis/InlineCost.h"		#include "llvm/Analysis/InlineCost.h"
#include "llvm/Analysis/OptimizationRemarkEmitter.h"		#include "llvm/Analysis/OptimizationRemarkEmitter.h"
#include "llvm/Analysis/ProfileSummaryInfo.h"		#include "llvm/Analysis/ProfileSummaryInfo.h"
		#include "llvm/Analysis/ReplayInlineAdvisor.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/IR/DebugInfoMetadata.h"		#include "llvm/IR/DebugInfoMetadata.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

#include <sstream>		#include <sstream>
▲ Show 20 Lines • Show All 121 Lines • ▼ Show 20 Lines	void InlineAdvice::recordInliningWithCalleeDeleted() {
recordInlineStatsIfNeeded();		recordInlineStatsIfNeeded();
Advisor->markFunctionAsDeleted(Callee);		Advisor->markFunctionAsDeleted(Callee);
recordInliningWithCalleeDeletedImpl();		recordInliningWithCalleeDeletedImpl();
}		}

AnalysisKey InlineAdvisorAnalysis::Key;		AnalysisKey InlineAdvisorAnalysis::Key;

bool InlineAdvisorAnalysis::Result::tryCreate(InlineParams Params,		bool InlineAdvisorAnalysis::Result::tryCreate(InlineParams Params,
InliningAdvisorMode Mode) {		InliningAdvisorMode Mode,
		StringRef ReplayFile) {
auto &FAM = MAM.getResult<FunctionAnalysisManagerModuleProxy>(M).getManager();		auto &FAM = MAM.getResult<FunctionAnalysisManagerModuleProxy>(M).getManager();
switch (Mode) {		switch (Mode) {
case InliningAdvisorMode::Default:		case InliningAdvisorMode::Default:
Advisor.reset(new DefaultInlineAdvisor(M, FAM, Params));		Advisor.reset(new DefaultInlineAdvisor(M, FAM, Params));
		// Restrict replay to default advisor, ML advisors are stateful so
		// replay will need augmentations to interleave with them correctly.
		if (!ReplayFile.empty()) {
		Advisor = std::make_unique<ReplayInlineAdvisor>(
		M, FAM, M.getContext(), std::move(Advisor), ReplayFile,
		/* EmitRemarks =*/true);
		}
break;		break;
case InliningAdvisorMode::Development:		case InliningAdvisorMode::Development:
#ifdef LLVM_HAVE_TF_API		#ifdef LLVM_HAVE_TF_API
Advisor =		Advisor =
llvm::getDevelopmentModeAdvisor(M, MAM, [&FAM, Params](CallBase &CB) {		llvm::getDevelopmentModeAdvisor(M, MAM, [&FAM, Params](CallBase &CB) {
auto OIC = getDefaultInlineAdvice(CB, FAM, Params);		auto OIC = getDefaultInlineAdvice(CB, FAM, Params);
return OIC.hasValue();		return OIC.hasValue();
});		});
#endif		#endif
break;		break;
case InliningAdvisorMode::Release:		case InliningAdvisorMode::Release:
#ifdef LLVM_HAVE_TF_AOT		#ifdef LLVM_HAVE_TF_AOT
Advisor = llvm::getReleaseModeAdvisor(M, MAM);		Advisor = llvm::getReleaseModeAdvisor(M, MAM);
#endif		#endif
break;		break;
}		}

return !!Advisor;		return !!Advisor;
		mtrofinUnsubmitted Not Done Reply Inline Actions Probably best to check first if Advisor isn't null before line 180, then not bother making a replay advisor if the underlying one can't be made in the first place (and just return false) mtrofin: Probably best to check first if Advisor isn't null before line 180, then not bother making a…
}		}

/// Return true if inlining of CB can block the caller from being		/// Return true if inlining of CB can block the caller from being
/// inlined which is proved to be more beneficial. \p IC is the		/// inlined which is proved to be more beneficial. \p IC is the
/// estimated inline cost associated with callsite \p CB.		/// estimated inline cost associated with callsite \p CB.
/// \p TotalSecondaryCost will be set to the estimated cost of inlining the		/// \p TotalSecondaryCost will be set to the estimated cost of inlining the
/// caller if \p CB is suppressed for inlining.		/// caller if \p CB is suppressed for inlining.
static bool		static bool
▲ Show 20 Lines • Show All 319 Lines • Show Last 20 Lines

llvm/lib/Analysis/ReplayInlineAdvisor.cpp

Show All 16 Lines
#include "llvm/IR/DebugInfoMetadata.h"		#include "llvm/IR/DebugInfoMetadata.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/Support/LineIterator.h"		#include "llvm/Support/LineIterator.h"

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "inline-replay"		#define DEBUG_TYPE "inline-replay"

ReplayInlineAdvisor::ReplayInlineAdvisor(Module &M,		ReplayInlineAdvisor::ReplayInlineAdvisor(
FunctionAnalysisManager &FAM,		Module &M, FunctionAnalysisManager &FAM, LLVMContext &Context,
LLVMContext &Context,		std::unique_ptr<InlineAdvisor> OriginalAdvisor, StringRef RemarksFile,
StringRef RemarksFile,
bool EmitRemarks)		bool EmitRemarks)
: InlineAdvisor(M, FAM), HasReplayRemarks(false), EmitRemarks(EmitRemarks) {		: InlineAdvisor(M, FAM), OriginalAdvisor(std::move(OriginalAdvisor)),
		HasReplayRemarks(false), EmitRemarks(EmitRemarks) {
auto BufferOrErr = MemoryBuffer::getFileOrSTDIN(RemarksFile);		auto BufferOrErr = MemoryBuffer::getFileOrSTDIN(RemarksFile);
std::error_code EC = BufferOrErr.getError();		std::error_code EC = BufferOrErr.getError();
if (EC) {		if (EC) {
Context.emitError("Could not open remarks file: " + EC.message());		Context.emitError("Could not open remarks file: " + EC.message());
return;		return;
}		}

// Example for inline remarks to parse:		// Example for inline remarks to parse:
Show All 14 Lines	for (; !LineIt.is_at_eof(); ++LineIt) {
std::string Combined = (Callee + CallSite).str();		std::string Combined = (Callee + CallSite).str();
InlineSitesFromRemarks.insert(Combined);		InlineSitesFromRemarks.insert(Combined);
}		}

HasReplayRemarks = true;		HasReplayRemarks = true;
}		}

std::unique_ptr<InlineAdvice> ReplayInlineAdvisor::getAdviceImpl(CallBase &CB) {		std::unique_ptr<InlineAdvice> ReplayInlineAdvisor::getAdviceImpl(CallBase &CB) {
assert(HasReplayRemarks);		assert(HasReplayRemarks);
		wenleiUnsubmitted Not Done Reply Inline Actions This looks redundant/similar to `DefaultInlineAdvice`, is that just for controlling `EmitRemarks`? ORE should be able to handle remark printing (or not) correctly without extra guard. wenlei: This looks redundant/similar to `DefaultInlineAdvice`, is that just for controlling…
		mtrofinUnsubmitted Not Done Reply Inline Actions Along the same lines as @wenlei 's comment - if the Advisor can both generate and digest the trace of decisions, why rely on ORE and not, instead, use a more structured format that wouldn't need parsing like ReplayInlineAdvisor.cpp:43? Also (if using ORE is desirable, case in which I share @wenlei's question), I think there's a yaml output format ORE generates, perhaps requiring that as input would also simplify ingestion? mtrofin: Along the same lines as @wenlei 's comment - if the Advisor can both generate and digest the…
		modimoAuthorUnsubmitted Not Done Reply Inline Actions I've moved this section to D94333 since the replay mechanism change to consume the new `line:col.discriminator` format needed to be together with format change. I've folded `ReplayInlineAdvice` back into `DefaultInlineAdvice` with the additional features I need. The extra guard is needed because the SampleProfile inliner uses the "legacy PM" mechanism of inline printing rather than bundling it with InlineAdvice calls. Since the use of InlineAdvice in SampleProfile is purely to support replay right now I'm leaving that refactoring (if we want to go after it) for the future. As far as using yaml I like how condensed the format is in remarks form. Something that's a single line in remarks ends up as 24 lines (like in llvm/test/Transforms/Inline/optimization-remarks-passed-yaml.ll) which makes manual reading and modification tedious especially on larger binaries. The current text processing is also fairly simple as is which makes a change here less pressing. That being said I'm not against using the yaml file as the official/supported format. A nice advantage there is that if we wanted to add more replay data (say negative inline decisions) it'll be smoother in yaml than adding new parsing of the text remark. modimo: I've moved this section to D94333 since the replay mechanism change to consume the new `line…
		modimoAuthorUnsubmitted Not Done Reply Inline Actions Along the same lines as @wenlei 's comment - if the Advisor can both generate and digest the trace of decisions, why rely on ORE and not, instead, use a more structured format that wouldn't need parsing like ReplayInlineAdvisor.cpp:43? @mtrofin I very much agree on this point. Personal front-runners for me is a CSV file which gives you line density but makes parsing easy or a tree format based on what @wenlei shows in D82213: Inlinees for main [P] _ZN15largesolidarrayIP6regobjEixEi @ 369 [P] _Z7random1i @ 363 [C] _Z8myrandomv @ 2 [P] _Z7random1i @ 364 [C] _Z8myrandomv @ 2 [P] _ZN15largesolidarrayIP6regobjEixEi @ 366 [P] _ZN6wayobj9createwayEiiiiRP8point16tRi @ 327 [P] _ZN6wayobj11createwayarEiiRP8point16tRi @ 37.1 [P] _ZN6wayobj5indexEii @ 143 [P] _ZN6wayobj5indexEii @ 130 [P] _ZN6wayobj6indexxEi @ 31 [P] _ZN6wayobj6indexyEi @ 32 [C] _ZN8point16tC2Ess @ 2 [C] _ZN8point16tC2Ess @ 2.1 I do want to see what users think about the current flow that's currently the same between CGSCC and sample inliner because there's definitely more refinements (additional replay accuracy, more logging, global allow-list/block-list etc.) that can be pursued but which I don't have a sense of value/priority for. I'm hoping that'll give us more information on what path to pursue here. modimo: > Along the same lines as @wenlei 's comment - if the Advisor can both generate and digest the…

Function &Caller = *CB.getCaller();		Function &Caller = *CB.getCaller();
auto &ORE = FAM.getResult<OptimizationRemarkEmitterAnalysis>(Caller);		auto &ORE = FAM.getResult<OptimizationRemarkEmitterAnalysis>(Caller);

if (InlineSitesFromRemarks.empty())		if (InlineSitesFromRemarks.empty())
return std::make_unique<DefaultInlineAdvice>(this, CB, None, ORE,		return std::make_unique<DefaultInlineAdvice>(this, CB, None, ORE,
EmitRemarks);		EmitRemarks);

Show All 13 Lines

llvm/lib/Transforms/IPO/Inliner.cpp

Show First 20 Lines • Show All 86 Lines • ▼ Show 20 Lines
/// with disabling it and relying fully on lifetime marker based stack		/// with disabling it and relying fully on lifetime marker based stack
/// coloring, you can pass this flag to LLVM.		/// coloring, you can pass this flag to LLVM.
static cl::opt<bool>		static cl::opt<bool>
DisableInlinedAllocaMerging("disable-inlined-alloca-merging",		DisableInlinedAllocaMerging("disable-inlined-alloca-merging",
cl::init(false), cl::Hidden);		cl::init(false), cl::Hidden);

extern cl::opt<InlinerFunctionImportStatsOpts> InlinerFunctionImportStats;		extern cl::opt<InlinerFunctionImportStatsOpts> InlinerFunctionImportStats;

		static cl::opt<std::string> CGSCCInlineReplayFile(
		"cgscc-inline-replay", cl::init(""), cl::value_desc("filename"),
		cl::desc(
		"Optimization remarks file containing inline remarks to be replayed "
		"by inlining from cgscc inline remarks."),
		cl::Hidden);

LegacyInlinerBase::LegacyInlinerBase(char &ID) : CallGraphSCCPass(ID) {}		LegacyInlinerBase::LegacyInlinerBase(char &ID) : CallGraphSCCPass(ID) {}

LegacyInlinerBase::LegacyInlinerBase(char &ID, bool InsertLifetime)		LegacyInlinerBase::LegacyInlinerBase(char &ID, bool InsertLifetime)
: CallGraphSCCPass(ID), InsertLifetime(InsertLifetime) {}		: CallGraphSCCPass(ID), InsertLifetime(InsertLifetime) {}

/// For this class, we declare that we require and preserve the call graph.		/// For this class, we declare that we require and preserve the call graph.
/// If the derived class implements this method, it should		/// If the derived class implements this method, it should
/// always explicitly call the implementation here.		/// always explicitly call the implementation here.
▲ Show 20 Lines • Show All 525 Lines • ▼ Show 20 Lines	for (CallGraphNode *CGN : FunctionsToRemove) {
++NumDeleted;		++NumDeleted;
}		}
return true;		return true;
}		}

InlineAdvisor &		InlineAdvisor &
InlinerPass::getAdvisor(const ModuleAnalysisManagerCGSCCProxy::Result &MAM,		InlinerPass::getAdvisor(const ModuleAnalysisManagerCGSCCProxy::Result &MAM,
FunctionAnalysisManager &FAM, Module &M) {		FunctionAnalysisManager &FAM, Module &M) {
if (OwnedDefaultAdvisor)		if (OwnedAdvisor)
return *OwnedDefaultAdvisor;		return *OwnedAdvisor;
		wenleiUnsubmitted Not Done Reply Inline Actions Plug in replay inline advisor here isn't extensible. In the future we want to be able to use inline replay only for a specific function, or enforce/prevent certain inlining at particular callsite, and fall back to regular advisor for the rest (see comments in D83743). That means we would need to be able to fall back from replay advisor to default advisor (or whatever main advisor being used) when replay advisor doesn't have info. For that cascaded model, we would need inline advise to have something like `hasInlineRecommendation` in addition to `isInliningRecommended`. We should probably still record inlining on each advice, but don't want to emit duplicated remarks from each advice. These changes can come later, but current change better offer that flexibility - we don't to stick to replay advisor for the entire module inliner pass. wenlei: Plug in replay inline advisor here isn't extensible. In the future we want to be able to use…
		modimoAuthorUnsubmitted Done Reply Inline Actions I think your suggested change is to initialize both Advisors and allow fallback if we defer on one. With how the current scheme is setup up though advisors are single entities and we only ask it once: `auto Advice = Advisor.getAdvice(CB);` A proven approach to doing something like this is with alias analysis where we query AA in a specific order until we hit a real recommendation. That would cause a rehaul of this function rather than a small tweak so I don't see a good intermediate step to take for this patch. Let me know if you think there's something to do here now for it. modimo:* I think your suggested change is to initialize both Advisors and allow fallback if we defer on…
		wenleiUnsubmitted Not Done Reply Inline Actions Well, we don't have to have everything setup for the cascaded query like how AA works, but something more flexible than having entire inliner sticking to one advisor would be good (and does not seem like a significant change) What I was thinking about is that the main advisor can still go through `getAdvisor` interface, then for inline replay, we can just let `ModuleInlinerWrapperPass` own an `ExternalInlineAdvisor` just like how `SampleProfileLoader` owns one. Then it can be passed to `InlinerPass` and serve as a short-circuit look up or side look up when available in addition to the main advisor from `getAdvisor`. The changes to add `hasInlineRecommendation` etc are not what I'm suggesting for this patch though I don't think these are significant either. It can evolve into cascaded advice support in the framework if needed, but if replay inline advice is the only case needing that support, generalizing it doesn't not seem like a must do. wenlei: Well, we don't have to have everything setup for the cascaded query like how AA works, but…
		mtrofinUnsubmitted Not Done Reply Inline Actions +1 to what @wenlei said - you can wrap advisors in advisors, basically. There may be some helper utilities that we may need factored in InlineAdvisor, and I was at a point thinking of doing that, but the motivating scenario at the time ended up not really needing that. If this turns out to be that scenario, I'd be happy to help! mtrofin: +1 to what @wenlei said - you can wrap advisors in advisors, basically. There may be some…

auto *IAA = MAM.getCachedResult<InlineAdvisorAnalysis>(M);		auto *IAA = MAM.getCachedResult<InlineAdvisorAnalysis>(M);
if (!IAA) {		if (!IAA) {
// It should still be possible to run the inliner as a stand-alone SCC pass,		// It should still be possible to run the inliner as a stand-alone SCC pass,
// for test scenarios. In that case, we default to the		// for test scenarios. In that case, we default to the
// DefaultInlineAdvisor, which doesn't need to keep state between SCC pass		// DefaultInlineAdvisor, which doesn't need to keep state between SCC pass
// runs. It also uses just the default InlineParams.		// runs. It also uses just the default InlineParams.
// In this case, we need to use the provided FAM, which is valid for the		// In this case, we need to use the provided FAM, which is valid for the
// duration of the inliner pass, and thus the lifetime of the owned advisor.		// duration of the inliner pass, and thus the lifetime of the owned advisor.
// The one we would get from the MAM can be invalidated as a result of the		// The one we would get from the MAM can be invalidated as a result of the
// inliner's activity.		// inliner's activity.
OwnedDefaultAdvisor =		OwnedAdvisor =
std::make_unique<DefaultInlineAdvisor>(M, FAM, getInlineParams());		std::make_unique<DefaultInlineAdvisor>(M, FAM, getInlineParams());
return *OwnedDefaultAdvisor;
		if (!CGSCCInlineReplayFile.empty())
		OwnedAdvisor = std::make_unique<ReplayInlineAdvisor>(
		M, FAM, M.getContext(), std::move(OwnedAdvisor),
		CGSCCInlineReplayFile,
		/EmitRemarks=/true);

		return *OwnedAdvisor;
}		}
assert(IAA->getAdvisor() &&		assert(IAA->getAdvisor() &&
"Expected a present InlineAdvisorAnalysis also have an "		"Expected a present InlineAdvisorAnalysis also have an "
"InlineAdvisor initialized");		"InlineAdvisor initialized");
return *IAA->getAdvisor();		return *IAA->getAdvisor();
}		}

PreservedAnalyses InlinerPass::run(LazyCallGraph::SCC &InitialC,		PreservedAnalyses InlinerPass::run(LazyCallGraph::SCC &InitialC,
▲ Show 20 Lines • Show All 333 Lines • ▼ Show 20 Lines	ModuleInlinerWrapperPass::ModuleInlinerWrapperPass(InlineParams Params,
if (MandatoryFirst)		if (MandatoryFirst)
PM.addPass(InlinerPass(/OnlyMandatory/ true));		PM.addPass(InlinerPass(/OnlyMandatory/ true));
PM.addPass(InlinerPass());		PM.addPass(InlinerPass());
}		}

PreservedAnalyses ModuleInlinerWrapperPass::run(Module &M,		PreservedAnalyses ModuleInlinerWrapperPass::run(Module &M,
ModuleAnalysisManager &MAM) {		ModuleAnalysisManager &MAM) {
auto &IAA = MAM.getResult<InlineAdvisorAnalysis>(M);		auto &IAA = MAM.getResult<InlineAdvisorAnalysis>(M);
if (!IAA.tryCreate(Params, Mode)) {		if (!IAA.tryCreate(Params, Mode, CGSCCInlineReplayFile)) {
M.getContext().emitError(		M.getContext().emitError(
"Could not setup Inlining Advisor for the requested "		"Could not setup Inlining Advisor for the requested "
"mode and/or options");		"mode and/or options");
return PreservedAnalyses::all();		return PreservedAnalyses::all();
}		}

// We wrap the CGSCC pipeline in a devirtualization repeater. This will try		// We wrap the CGSCC pipeline in a devirtualization repeater. This will try
// to detect when we devirtualize indirect calls and iterate the SCC passes		// to detect when we devirtualize indirect calls and iterate the SCC passes
Show All 15 Lines

llvm/lib/Transforms/IPO/SampleProfile.cpp

Show First 20 Lines • Show All 1,961 Lines • ▼ Show 20 Lines	bool SampleProfileLoader::doInitialization(Module &M,
if (ProfAccForSymsInList) {		if (ProfAccForSymsInList) {
NamesInProfile.clear();		NamesInProfile.clear();
if (auto NameTable = Reader->getNameTable())		if (auto NameTable = Reader->getNameTable())
NamesInProfile.insert(NameTable->begin(), NameTable->end());		NamesInProfile.insert(NameTable->begin(), NameTable->end());
}		}

if (FAM && !ProfileInlineReplayFile.empty()) {		if (FAM && !ProfileInlineReplayFile.empty()) {
ExternalInlineAdvisor = std::make_unique<ReplayInlineAdvisor>(		ExternalInlineAdvisor = std::make_unique<ReplayInlineAdvisor>(
M, FAM, Ctx, ProfileInlineReplayFile, /EmitRemarks=*/false);		M, FAM, Ctx, /OriginalAdvisor=*/nullptr, ProfileInlineReplayFile,
		/EmitRemarks=/false);
if (!ExternalInlineAdvisor->areReplayRemarksLoaded())		if (!ExternalInlineAdvisor->areReplayRemarksLoaded())
ExternalInlineAdvisor.reset();		ExternalInlineAdvisor.reset();
}		}

// Apply tweaks if context-sensitive profile is available.		// Apply tweaks if context-sensitive profile is available.
if (Reader->profileIsCS()) {		if (Reader->profileIsCS()) {
ProfileIsCS = true;		ProfileIsCS = true;
FunctionSamples::ProfileIsCS = true;		FunctionSamples::ProfileIsCS = true;
▲ Show 20 Lines • Show All 195 Lines • Show Last 20 Lines

llvm/test/Transforms/Inline/Inputs/cgscc-inline-replay.txt

This file was added.

				remark: calls.cc:10:0: _Z3sumii inlined into main with (cost=45, threshold=337) at callsite main:3:0.1;
				remark: calls.cc:4:0: _Z3subii inlined into main with (cost=-5, threshold=337) at callsite _Z3sumii:1:0 @ main:3:0.1;

llvm/test/Transforms/Inline/cgscc-inline-replay.ll

This file was added.

				;; Note that this needs new pass manager for now. Passing `-cgscc-inline-replay` to legacy pass manager is a no-op.

				;; Check replay inline decisions
				; RUN: opt < %s -passes=inline -pass-remarks=inline -S 2>&1 \| FileCheck -check-prefix=DEFAULT %s
				; RUN: opt < %s -passes=inline -cgscc-inline-replay=%S/Inputs/cgscc-inline-replay.txt -pass-remarks=inline -S 2>&1 \| FileCheck -check-prefix=REPLAY %s
				wenleiUnsubmitted Not Done Reply Inline Actions Better verify inline decision without replay first, to make sure the replay has visible impact on inlining. See DEFAULT and REPLAY check for SampleProfile/inline-replay.ll. wenlei: Better verify inline decision without replay first, to make sure the replay has visible impact…
				modimoAuthorUnsubmitted Done Reply Inline Actions Makes sense, added. modimo: Makes sense, added.

				@.str = private unnamed_addr constant [11 x i8] c"sum is %d\0A\00", align 1

				define i32 @_Z3sumii(i32 %x, i32 %y) #0 !dbg !6 {
				entry:
				%x.addr = alloca i32, align 4
				%y.addr = alloca i32, align 4
				store i32 %x, i32* %x.addr, align 4
				store i32 %y, i32* %y.addr, align 4
				%tmp = load i32, i32* %x.addr, align 4, !dbg !8
				%tmp1 = load i32, i32* %y.addr, align 4, !dbg !8
				%add = add nsw i32 %tmp, %tmp1, !dbg !8
				%tmp2 = load i32, i32* %x.addr, align 4, !dbg !8
				%tmp3 = load i32, i32* %y.addr, align 4, !dbg !8
				%call = call i32 @_Z3subii(i32 %tmp2, i32 %tmp3), !dbg !8
				ret i32 %add, !dbg !8
				}

				define i32 @_Z3subii(i32 %x, i32 %y) #0 !dbg !9 {
				entry:
				%x.addr = alloca i32, align 4
				%y.addr = alloca i32, align 4
				store i32 %x, i32* %x.addr, align 4
				store i32 %y, i32* %y.addr, align 4
				%tmp = load i32, i32* %x.addr, align 4, !dbg !10
				%tmp1 = load i32, i32* %y.addr, align 4, !dbg !10
				%add = sub nsw i32 %tmp, %tmp1, !dbg !10
				ret i32 %add, !dbg !11
				}

				define i32 @main() #0 !dbg !12 {
				entry:
				%retval = alloca i32, align 4
				%s = alloca i32, align 4
				%i = alloca i32, align 4
				store i32 0, i32* %retval
				store i32 0, i32* %i, align 4, !dbg !13
				br label %while.cond, !dbg !14

				while.cond: ; preds = %if.end, %entry
				%tmp = load i32, i32* %i, align 4, !dbg !15
				%inc = add nsw i32 %tmp, 1, !dbg !15
				store i32 %inc, i32* %i, align 4, !dbg !15
				%cmp = icmp slt i32 %tmp, 400000000, !dbg !15
				br i1 %cmp, label %while.body, label %while.end, !dbg !15

				while.body: ; preds = %while.cond
				%tmp1 = load i32, i32* %i, align 4, !dbg !17
				%cmp1 = icmp ne i32 %tmp1, 100, !dbg !17
				br i1 %cmp1, label %if.then, label %if.else, !dbg !17

				if.then: ; preds = %while.body
				%tmp2 = load i32, i32* %i, align 4, !dbg !19
				%tmp3 = load i32, i32* %s, align 4, !dbg !19
				%call = call i32 @_Z3sumii(i32 %tmp2, i32 %tmp3), !dbg !19
				store i32 %call, i32* %s, align 4, !dbg !19
				br label %if.end, !dbg !19

				if.else: ; preds = %while.body
				store i32 30, i32* %s, align 4, !dbg !21
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				br label %while.cond, !dbg !23

				while.end: ; preds = %while.cond
				%tmp4 = load i32, i32* %s, align 4, !dbg !25
				%call2 = call i32 (i8, ...) @printf(i8 getelementptr inbounds ([11 x i8], [11 x i8]* @.str, i32 0, i32 0), i32 %tmp4), !dbg !25
				ret i32 0, !dbg !26
				}

				declare i32 @printf(i8*, ...)

				attributes #0 = { "use-sample-profile" }

				!llvm.dbg.cu = !{!0}
				!llvm.module.flags = !{!3, !4}
				!llvm.ident = !{!5}

				!0 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus, file: !1, producer: "clang version 3.5 ", isOptimized: false, runtimeVersion: 0, emissionKind: NoDebug, enums: !2, retainedTypes: !2, globals: !2, imports: !2)
				!1 = !DIFile(filename: "calls.cc", directory: ".")
				!2 = !{}
				!3 = !{i32 2, !"Dwarf Version", i32 4}
				!4 = !{i32 1, !"Debug Info Version", i32 3}
				!5 = !{!"clang version 3.5 "}
				!6 = distinct !DISubprogram(name: "sum", linkageName: "_Z3sumii", scope: !1, file: !1, line: 3, type: !7, scopeLine: 3, virtualIndex: 6, flags: DIFlagPrototyped, spFlags: DISPFlagDefinition, unit: !0, retainedNodes: !2)
				!7 = !DISubroutineType(types: !2)
				!8 = !DILocation(line: 4, scope: !6)
				!9 = distinct !DISubprogram(name: "sub", linkageName: "_Z3subii", scope: !1, file: !1, line: 20, type: !7, scopeLine: 20, virtualIndex: 6, flags: DIFlagPrototyped, spFlags: DISPFlagDefinition, unit: !0, retainedNodes: !2)
				!10 = !DILocation(line: 20, scope: !9)
				!11 = !DILocation(line: 21, scope: !9)
				!12 = distinct !DISubprogram(name: "main", scope: !1, file: !1, line: 7, type: !7, scopeLine: 7, virtualIndex: 6, flags: DIFlagPrototyped, spFlags: DISPFlagDefinition, unit: !0, retainedNodes: !2)
				!13 = !DILocation(line: 8, scope: !12)
				!14 = !DILocation(line: 9, scope: !12)
				!15 = !DILocation(line: 9, scope: !16)
				!16 = !DILexicalBlockFile(scope: !12, file: !1, discriminator: 2)
				!17 = !DILocation(line: 10, scope: !18)
				!18 = distinct !DILexicalBlock(scope: !12, file: !1, line: 10)
				!19 = !DILocation(line: 10, scope: !20)
				!20 = !DILexicalBlockFile(scope: !18, file: !1, discriminator: 2)
				!21 = !DILocation(line: 10, scope: !22)
				!22 = !DILexicalBlockFile(scope: !18, file: !1, discriminator: 4)
				!23 = !DILocation(line: 10, scope: !24)
				!24 = !DILexicalBlockFile(scope: !18, file: !1, discriminator: 6)
				!25 = !DILocation(line: 11, scope: !12)
				!26 = !DILocation(line: 12, scope: !12)

				; DEFAULT: _Z3subii inlined into _Z3sumii
				; DEFAULT: _Z3sumii inlined into main
				; DEFAULT-NOT: _Z3subii inlined into main

				; REPLAY: _Z3sumii inlined into main
				; REPLAY: _Z3subii inlined into main
				; REPLAY-NOT: _Z3subii inlined into _Z3sumii

This is an archive of the discontinued LLVM Phabricator instance.

[InlineAdvisor] Allow replay of inline decisions for the CGSCC inliner from optimization remarksClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 319147

llvm/include/llvm/Analysis/InlineAdvisor.h

llvm/include/llvm/Analysis/ReplayInlineAdvisor.h

llvm/include/llvm/Transforms/IPO/Inliner.h

llvm/lib/Analysis/InlineAdvisor.cpp

llvm/lib/Analysis/ReplayInlineAdvisor.cpp

llvm/lib/Transforms/IPO/Inliner.cpp

llvm/lib/Transforms/IPO/SampleProfile.cpp

llvm/test/Transforms/Inline/Inputs/cgscc-inline-replay.txt

llvm/test/Transforms/Inline/cgscc-inline-replay.ll

[InlineAdvisor] Allow replay of inline decisions for the CGSCC inliner from optimization remarks
ClosedPublic