This is an archive of the discontinued LLVM Phabricator instance.

Refactor inline costs analysis by removing the InlineCostAnalysis class
ClosedPublic

Authored by eraman on Dec 21 2015, 1:26 PM.

Download Raw Diff

Details

Reviewers

Commits

rGb9f7120e7a4c: Refactor inline costs analysis by removing the InlineCostAnalysis class
rL256521: Refactor inline costs analysis by removing the InlineCostAnalysis class

Summary

InlineCostAnalysis is an analysis pass without any need for it to be one. Once it stops being an analysis pass, it doesn't maintain any useful state and the member functions inside can be made free functions. NFC.

(Note: InlineCost.h has the following comment for getInlineCost, a member function of InlineCostAnalysis:

// Note: This is used by out-of-tree passes, please do not remove without
// adding a replacement API.

I am not sure what this means, but this refactoring makes this into a free function with extra parameters)

Diff Detail

Repository: rL LLVM

Event Timeline

eraman updated this revision to Diff 43397.Dec 21 2015, 1:26 PM

eraman retitled this revision from to Refactor inline costs analysis by removing the InlineCostAnalysis class.

eraman updated this object.

eraman added a reviewer: chandlerc.

eraman set the repository for this revision to rL LLVM.

eraman added subscribers: llvm-commits, davidxl.

Thanks Easwaran, this seems like a really nice clean up. Just one minor fix needed below...

lib/Transforms/IPO/InlineSimple.cpp
57–59	These are actually expensive operations. Instead of doing this once per getInlineCost, it would be better to cache them in members of the actual pass (either the SimpleInliner or Inliner depending on what is cleanest).
108–111	Hmm, this makes me believe that at least the base class should manage calling 'getAnalysis' for AssumptionCacheTracker, and probably this class should do so for TargetTransformInfo...

eraman added inline comments.Dec 21 2015, 3:24 PM

lib/Transforms/IPO/InlineSimple.cpp
57–59	Ok.
108–111	Not fully sure what you mean here. The base class calls getAnalysis for AssumptionCacheTracker, and you want to cache the value in the base class so that it could be used in derived class's InlineCost?

chandlerc added inline comments.Dec 21 2015, 3:49 PM

lib/Transforms/IPO/InlineSimple.cpp
108–111	Yea, just make a protected member that the derived class can access when calling getInlineCost so it doesn't have to call getAnalysis again. (The fact that this is somewhat magical is one of many reasons why I don't really like the pattern SimpleInliner AlwaysInliner use of subclassing a fully formed pass, but I think that's a much bigger refactoring yak to shave)

eraman updated this revision to Diff 43425.Dec 21 2015, 5:33 PM

eraman edited edge metadata.

Mostly nits...

lib/Analysis/InlineCost.cpp
1322	Rather than putting all of this in the llvm namespace, you can just use the qualified function name "llvm::getInlineCost" in the definition.
lib/Transforms/IPO/InlineSimple.cpp
55–62	Why add this code?
68	You can actually make TTI the member instead of the wrapper pass.
103–104	You want to replace this on every run rather than checking it on each run. Look for other places where we cache a TTI pointer. You should also leave it uninitialized in the constructor so we get an MSan error if we reach code without the run call happening first.
lib/Transforms/IPO/Inliner.cpp
477–478 ↗	(On Diff #43425)	Same comment as above, just overwrite the pointer each time.

eraman marked an inline comment as done.Dec 22 2015, 10:56 AM

eraman added inline comments.

lib/Transforms/IPO/InlineSimple.cpp
55–62	Originally, there was a if(Callee) guard before calling the getTTI, but the Callee can never be NULL here. I agree that assert(Callee) is not very useful and will remove this.
68	I'm confused. Isn't the returned TTI dependent on the function? If I make TTI a member, how do I get the TTI for the callee in getInlineCost without calling getAnalysis on TTI wrapper pass ?
103–104	I didn't think about the MSan interaction and it now makes sense to remove the initialization and hence can't do the if (!TTIWP)... . I am curious if there is any other reason to avoid this pattern.

chandlerc added inline comments.Dec 22 2015, 11:37 AM

lib/Transforms/IPO/InlineSimple.cpp
68	Quite right. Sorry for the noise!
103–104	It doesn't happen much in-tree, but the pass manager supports running multiple modules through a single pass instance, and it is conceivable that they would have different TTI wrapper passes. Not likely, and not practically reproducible, but the pattern is designed to handle cases where the analysis might not be valid to cache from run to run.

Address Chandler's comments

This LGTM, feel free to submit with the argument naming tweak below.

However, this naming tweak raises an interesting question for me. This should *not* be addressed in this patch, but it might be a good thing to put on your queue.

We are passing the *callee* TTI into the inline cost analysis. That doesn't make a lot of sense to me. We're inlining into the *caller*. If there is something *incompatible* about the callee, we should refuse to inline it. If they are compatible, the *caller's* TTI should become dominant. As an example, if we have one function which is marked as optimized for size, but we inline it into code that is not optimized for size, I would expect the inlined body to *not* be optimized for size. I would generally expect the TTI of the call site to determine the cost analysis for inlining because after inlining, the callee is gone. Does that make sense? That will be a non-trivial functional change, so you'll want to benchmark it etc before making it.

Relatedly, I know you're looking heavily at getting our nested call site analysis to be accurate. You should make sure we're considering the possibility of incompatible function attributes that might prevent inlining entirely. I don't think we currently model that correctly, in that the cost analysis skips it but then the inliner itself considers it.

include/llvm/Analysis/InlineCost.h
112–124	I would clarify that this is currently the callee's TTI, maybe just by naming it "CalleeTTI".

This revision is now accepted and ready to land.Dec 28 2015, 9:50 AM

In D15701#317434, @chandlerc wrote:

This LGTM, feel free to submit with the argument naming tweak below.

However, this naming tweak raises an interesting question for me. This should *not* be addressed in this patch, but it might be a good thing to put on your queue.

We are passing the *callee* TTI into the inline cost analysis. That doesn't make a lot of sense to me. We're inlining into the *caller*. If there is something *incompatible* about the callee, we should refuse to inline it. If they are compatible, the *caller's* TTI should become dominant. As an example, if we have one function which is marked as optimized for size, but we inline it into code that is not optimized for size, I would expect the inlined body to *not* be optimized for size. I would generally expect the TTI of the call site to determine the cost analysis for inlining because after inlining, the callee is gone. Does that make sense?

Yes. We will have to pass both the caller and callee TTIs when I get to implementing estimated speedup that requires computing the cost of the inlined and uninlined versions of the callee.

That will be a non-trivial functional change, so you'll want to benchmark it etc before making it.

Ok.

Relatedly, I know you're looking heavily at getting our nested call site analysis to be accurate. You should make sure we're considering the possibility of incompatible function attributes that might prevent inlining entirely. I don't think we currently model that correctly, in that the cost analysis skips it but then the inliner itself considers it.

Sure.

Closed by commit rL256521: Refactor inline costs analysis by removing the InlineCostAnalysis class (authored by eraman). · Explain WhyDec 28 2015, 12:31 PM

This revision was automatically updated to reflect the committed changes.

jevinskie added a subscriber: jevinskie.Dec 28 2015, 12:41 PM

Chandler, do you have more comments?

Revision Contents

Path

Size

include/

llvm/

Analysis/

InlineCost.h

43 lines

InitializePasses.h

1 line

lib/

Analysis/

InlineCost.cpp

45 lines

Transforms/

IPO/

InlineAlways.cpp

23 lines

InlineSimple.cpp

24 lines

Diff 43397

include/llvm/Analysis/InlineCost.h

Show All 17 Lines
#include <cassert>		#include <cassert>
#include <climits>		#include <climits>

namespace llvm {		namespace llvm {
class AssumptionCacheTracker;		class AssumptionCacheTracker;
class CallSite;		class CallSite;
class DataLayout;		class DataLayout;
class Function;		class Function;
class TargetTransformInfoWrapperPass;		class TargetTransformInfo;

namespace InlineConstants {		namespace InlineConstants {
// Various magic constants used to adjust heuristics.		// Various magic constants used to adjust heuristics.
const int InstrCost = 5;		const int InstrCost = 5;
const int IndirectCallThreshold = 100;		const int IndirectCallThreshold = 100;
const int CallPenalty = 25;		const int CallPenalty = 25;
const int LastCallToStaticBonus = -15000;		const int LastCallToStaticBonus = -15000;
const int ColdccPenalty = 2000;		const int ColdccPenalty = 2000;
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	public:
}		}

/// \brief Get the cost delta from the threshold for inlining.		/// \brief Get the cost delta from the threshold for inlining.
/// Only valid if the cost is of the variable kind. Returns a negative		/// Only valid if the cost is of the variable kind. Returns a negative
/// value if the cost is too high to inline.		/// value if the cost is too high to inline.
int getCostDelta() const { return Threshold - getCost(); }		int getCostDelta() const { return Threshold - getCost(); }
};		};

/// \brief Cost analyzer used by inliner.
class InlineCostAnalysis : public CallGraphSCCPass {
TargetTransformInfoWrapperPass *TTIWP;
AssumptionCacheTracker *ACT;

public:
static char ID;

InlineCostAnalysis();
~InlineCostAnalysis() override;

// Pass interface implementation.
void getAnalysisUsage(AnalysisUsage &AU) const override;
bool runOnSCC(CallGraphSCC &SCC) override;

/// \brief Get an InlineCost object representing the cost of inlining this		/// \brief Get an InlineCost object representing the cost of inlining this
/// callsite.		/// callsite.
///		///
/// Note that threshold is passed into this function. Only costs below the		/// Note that threshold is passed into this function. Only costs below the
/// threshold are computed with any accuracy. The threshold can be used to		/// threshold are computed with any accuracy. The threshold can be used to
/// bound the computation necessary to determine whether the cost is		/// bound the computation necessary to determine whether the cost is
/// sufficiently low to warrant inlining.		/// sufficiently low to warrant inlining.
///		///
/// Also note that calling this function dynamically computes the cost of		/// Also note that calling this function dynamically computes the cost of
/// inlining the callsite. It is an expensive, heavyweight call.		/// inlining the callsite. It is an expensive, heavyweight call.
InlineCost getInlineCost(CallSite CS, int Threshold);		InlineCost getInlineCost(CallSite CS, int Threshold, TargetTransformInfo &TTI,
		AssumptionCacheTracker *ACT);

/// \brief Get an InlineCost with the callee explicitly specified.		/// \brief Get an InlineCost with the callee explicitly specified.
/// This allows you to calculate the cost of inlining a function via a		/// This allows you to calculate the cost of inlining a function via a
/// pointer. This behaves exactly as the version with no explicit callee		/// pointer. This behaves exactly as the version with no explicit callee
/// parameter in all other respects.		/// parameter in all other respects.
//		//
// Note: This is used by out-of-tree passes, please do not remove without		InlineCost getInlineCost(CallSite CS, Function *Callee, int Threshold,
// adding a replacement API.		TargetTransformInfo &TTI, AssumptionCacheTracker *ACT);
InlineCost getInlineCost(CallSite CS, Function *Callee, int Threshold);

/// \brief Minimal filter to detect invalid constructs for inlining.		/// \brief Minimal filter to detect invalid constructs for inlining.
bool isInlineViable(Function &Callee);		bool isInlineViable(Function &Callee);
		chandlercUnsubmitted Not Done Reply Inline Actions I would clarify that this is currently the callee's TTI, maybe just by naming it "CalleeTTI". chandlerc: I would clarify that this is currently the callee's TTI, maybe just by naming it "CalleeTTI".
};

}		}

#endif		#endif

include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 138 Lines • ▼ Show 20 Lines
	void initializeGlobalOptPass(PassRegistry&);			void initializeGlobalOptPass(PassRegistry&);
	void initializeGlobalsAAWrapperPassPass(PassRegistry&);			void initializeGlobalsAAWrapperPassPass(PassRegistry&);
	void initializeIPCPPass(PassRegistry&);			void initializeIPCPPass(PassRegistry&);
	void initializeIPSCCPPass(PassRegistry&);			void initializeIPSCCPPass(PassRegistry&);
	void initializeIVUsersPass(PassRegistry&);			void initializeIVUsersPass(PassRegistry&);
	void initializeIfConverterPass(PassRegistry&);			void initializeIfConverterPass(PassRegistry&);
	void initializeInductiveRangeCheckEliminationPass(PassRegistry&);			void initializeInductiveRangeCheckEliminationPass(PassRegistry&);
	void initializeIndVarSimplifyPass(PassRegistry&);			void initializeIndVarSimplifyPass(PassRegistry&);
	void initializeInlineCostAnalysisPass(PassRegistry&);
	void initializeInstructionCombiningPassPass(PassRegistry&);			void initializeInstructionCombiningPassPass(PassRegistry&);
	void initializeInstCountPass(PassRegistry&);			void initializeInstCountPass(PassRegistry&);
	void initializeInstNamerPass(PassRegistry&);			void initializeInstNamerPass(PassRegistry&);
	void initializeInternalizePassPass(PassRegistry&);			void initializeInternalizePassPass(PassRegistry&);
	void initializeIntervalPartitionPass(PassRegistry&);			void initializeIntervalPartitionPass(PassRegistry&);
	void initializeJumpThreadingPass(PassRegistry&);			void initializeJumpThreadingPass(PassRegistry&);
	void initializeLCSSAPass(PassRegistry&);			void initializeLCSSAPass(PassRegistry&);
	void initializeLICMPass(PassRegistry&);			void initializeLICMPass(PassRegistry&);
	▲ Show 20 Lines • Show All 157 Lines • Show Last 20 Lines

lib/Analysis/InlineCost.cpp

Show First 20 Lines • Show All 1,313 Lines • ▼ Show 20 Lines	#define DEBUG_PRINT_STAT(x) dbgs() << " " #x ": " << x << "\n"
DEBUG_PRINT_STAT(SROACostSavingsLost);		DEBUG_PRINT_STAT(SROACostSavingsLost);
DEBUG_PRINT_STAT(ContainsNoDuplicateCall);		DEBUG_PRINT_STAT(ContainsNoDuplicateCall);
DEBUG_PRINT_STAT(Cost);		DEBUG_PRINT_STAT(Cost);
DEBUG_PRINT_STAT(Threshold);		DEBUG_PRINT_STAT(Threshold);
#undef DEBUG_PRINT_STAT		#undef DEBUG_PRINT_STAT
}		}
#endif		#endif

INITIALIZE_PASS_BEGIN(InlineCostAnalysis, "inline-cost", "Inline Cost Analysis",		namespace llvm {
		chandlercUnsubmitted Done Reply Inline Actions Rather than putting all of this in the llvm namespace, you can just use the qualified function name "llvm::getInlineCost" in the definition. chandlerc: Rather than putting all of this in the llvm namespace, you can just use the qualified function…
true, true)		InlineCost getInlineCost(CallSite CS, int Threshold, TargetTransformInfo &TTI,
INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)		AssumptionCacheTracker *ACT) {
INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)		return getInlineCost(CS, CS.getCalledFunction(), Threshold, TTI, ACT);
INITIALIZE_PASS_END(InlineCostAnalysis, "inline-cost", "Inline Cost Analysis",
true, true)

char InlineCostAnalysis::ID = 0;

InlineCostAnalysis::InlineCostAnalysis() : CallGraphSCCPass(ID) {}

InlineCostAnalysis::~InlineCostAnalysis() {}

void InlineCostAnalysis::getAnalysisUsage(AnalysisUsage &AU) const {
AU.setPreservesAll();
AU.addRequired<AssumptionCacheTracker>();
AU.addRequired<TargetTransformInfoWrapperPass>();
CallGraphSCCPass::getAnalysisUsage(AU);
}

bool InlineCostAnalysis::runOnSCC(CallGraphSCC &SCC) {
TTIWP = &getAnalysis<TargetTransformInfoWrapperPass>();
ACT = &getAnalysis<AssumptionCacheTracker>();
return false;
}

InlineCost InlineCostAnalysis::getInlineCost(CallSite CS, int Threshold) {
return getInlineCost(CS, CS.getCalledFunction(), Threshold);
}		}

/// \brief Test that two functions either have or have not the given attribute		/// \brief Test that two functions either have or have not the given attribute
/// at the same time.		/// at the same time.
template<typename AttrKind>		template<typename AttrKind>
static bool attributeMatches(Function F1, Function F2, AttrKind Attr) {		static bool attributeMatches(Function F1, Function F2, AttrKind Attr) {
return F1->getFnAttribute(Attr) == F2->getFnAttribute(Attr);		return F1->getFnAttribute(Attr) == F2->getFnAttribute(Attr);
}		}

/// \brief Test that there are no attribute conflicts between Caller and Callee		/// \brief Test that there are no attribute conflicts between Caller and Callee
/// that prevent inlining.		/// that prevent inlining.
static bool functionsHaveCompatibleAttributes(Function *Caller,		static bool functionsHaveCompatibleAttributes(Function *Caller,
Function *Callee,		Function *Callee,
TargetTransformInfo &TTI) {		TargetTransformInfo &TTI) {
return TTI.areInlineCompatible(Caller, Callee) &&		return TTI.areInlineCompatible(Caller, Callee) &&
attributeMatches(Caller, Callee, Attribute::SanitizeAddress) &&		attributeMatches(Caller, Callee, Attribute::SanitizeAddress) &&
attributeMatches(Caller, Callee, Attribute::SanitizeMemory) &&		attributeMatches(Caller, Callee, Attribute::SanitizeMemory) &&
attributeMatches(Caller, Callee, Attribute::SanitizeThread);		attributeMatches(Caller, Callee, Attribute::SanitizeThread);
}		}

InlineCost InlineCostAnalysis::getInlineCost(CallSite CS, Function *Callee,		InlineCost getInlineCost(CallSite CS, Function *Callee, int Threshold,
int Threshold) {		TargetTransformInfo &TTI,
		AssumptionCacheTracker *ACT) {
// Cannot inline indirect calls.		// Cannot inline indirect calls.
if (!Callee)		if (!Callee)
return llvm::InlineCost::getNever();		return llvm::InlineCost::getNever();

// Calls to functions with always-inline attributes should be inlined		// Calls to functions with always-inline attributes should be inlined
// whenever possible.		// whenever possible.
if (CS.hasFnAttr(Attribute::AlwaysInline)) {		if (CS.hasFnAttr(Attribute::AlwaysInline)) {
if (isInlineViable(*Callee))		if (isInlineViable(*Callee))
return llvm::InlineCost::getAlways();		return llvm::InlineCost::getAlways();
return llvm::InlineCost::getNever();		return llvm::InlineCost::getNever();
}		}

// Never inline functions with conflicting attributes (unless callee has		// Never inline functions with conflicting attributes (unless callee has
// always-inline attribute).		// always-inline attribute).
if (!functionsHaveCompatibleAttributes(CS.getCaller(), Callee,		if (!functionsHaveCompatibleAttributes(CS.getCaller(), Callee, TTI))
TTIWP->getTTI(*Callee)))
return llvm::InlineCost::getNever();		return llvm::InlineCost::getNever();

// Don't inline this call if the caller has the optnone attribute.		// Don't inline this call if the caller has the optnone attribute.
if (CS.getCaller()->hasFnAttribute(Attribute::OptimizeNone))		if (CS.getCaller()->hasFnAttribute(Attribute::OptimizeNone))
return llvm::InlineCost::getNever();		return llvm::InlineCost::getNever();

// Don't inline functions which can be redefined at link-time to mean		// Don't inline functions which can be redefined at link-time to mean
// something else. Don't inline functions marked noinline or call sites		// something else. Don't inline functions marked noinline or call sites
// marked noinline.		// marked noinline.
if (Callee->mayBeOverridden() \|\|		if (Callee->mayBeOverridden() \|\|
Callee->hasFnAttribute(Attribute::NoInline) \|\| CS.isNoInline())		Callee->hasFnAttribute(Attribute::NoInline) \|\| CS.isNoInline())
return llvm::InlineCost::getNever();		return llvm::InlineCost::getNever();

DEBUG(llvm::dbgs() << " Analyzing call of " << Callee->getName()		DEBUG(llvm::dbgs() << " Analyzing call of " << Callee->getName()
<< "...\n");		<< "...\n");

CallAnalyzer CA(TTIWP->getTTI(Callee), ACT, Callee, Threshold, CS);		CallAnalyzer CA(TTI, ACT, *Callee, Threshold, CS);
bool ShouldInline = CA.analyzeCall(CS);		bool ShouldInline = CA.analyzeCall(CS);

DEBUG(CA.dump());		DEBUG(CA.dump());

// Check if there was a reason to force inlining or no inlining.		// Check if there was a reason to force inlining or no inlining.
if (!ShouldInline && CA.getCost() < CA.getThreshold())		if (!ShouldInline && CA.getCost() < CA.getThreshold())
return InlineCost::getNever();		return InlineCost::getNever();
if (ShouldInline && CA.getCost() >= CA.getThreshold())		if (ShouldInline && CA.getCost() >= CA.getThreshold())
return InlineCost::getAlways();		return InlineCost::getAlways();

return llvm::InlineCost::get(CA.getCost(), CA.getThreshold());		return llvm::InlineCost::get(CA.getCost(), CA.getThreshold());
}		}

bool InlineCostAnalysis::isInlineViable(Function &F) {		bool isInlineViable(Function &F) {
bool ReturnsTwice = F.hasFnAttribute(Attribute::ReturnsTwice);		bool ReturnsTwice = F.hasFnAttribute(Attribute::ReturnsTwice);
for (Function::iterator BI = F.begin(), BE = F.end(); BI != BE; ++BI) {		for (Function::iterator BI = F.begin(), BE = F.end(); BI != BE; ++BI) {
// Disallow inlining of functions which contain indirect branches or		// Disallow inlining of functions which contain indirect branches or
// blockaddresses.		// blockaddresses.
if (isa<IndirectBrInst>(BI->getTerminator()) \|\| BI->hasAddressTaken())		if (isa<IndirectBrInst>(BI->getTerminator()) \|\| BI->hasAddressTaken())
return false;		return false;

for (auto &II : *BI) {		for (auto &II : *BI) {
Show All 17 Lines	for (auto &II : *BI) {
CS.getCalledFunction()->getIntrinsicID() ==		CS.getCalledFunction()->getIntrinsicID() ==
llvm::Intrinsic::localescape)		llvm::Intrinsic::localescape)
return false;		return false;
}		}
}		}

return true;		return true;
}		}
		}

lib/Transforms/IPO/InlineAlways.cpp

	Show All 29 Lines
	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "inline"			#define DEBUG_TYPE "inline"

	namespace {			namespace {

	/// \brief Inliner pass which only handles "always inline" functions.			/// \brief Inliner pass which only handles "always inline" functions.
	class AlwaysInliner : public Inliner {			class AlwaysInliner : public Inliner {
	InlineCostAnalysis *ICA;

	public:			public:
	// Use extremely low threshold.			// Use extremely low threshold.
	AlwaysInliner() : Inliner(ID, -2000000000, /InsertLifetime/ true),			AlwaysInliner() : Inliner(ID, -2000000000, /InsertLifetime/ true) {
	ICA(nullptr) {
	initializeAlwaysInlinerPass(*PassRegistry::getPassRegistry());			initializeAlwaysInlinerPass(*PassRegistry::getPassRegistry());
	}			}

	AlwaysInliner(bool InsertLifetime)			AlwaysInliner(bool InsertLifetime)
	: Inliner(ID, -2000000000, InsertLifetime), ICA(nullptr) {			: Inliner(ID, -2000000000, InsertLifetime) {
	initializeAlwaysInlinerPass(*PassRegistry::getPassRegistry());			initializeAlwaysInlinerPass(*PassRegistry::getPassRegistry());
	}			}

	static char ID; // Pass identification, replacement for typeid			static char ID; // Pass identification, replacement for typeid

	InlineCost getInlineCost(CallSite CS) override;			InlineCost getInlineCost(CallSite CS) override;

	void getAnalysisUsage(AnalysisUsage &AU) const override;
	bool runOnSCC(CallGraphSCC &SCC) override;

	using llvm::Pass::doFinalization;			using llvm::Pass::doFinalization;
	bool doFinalization(CallGraph &CG) override {			bool doFinalization(CallGraph &CG) override {
	return removeDeadFunctions(CG, /AlwaysInlineOnly=/ true);			return removeDeadFunctions(CG, /AlwaysInlineOnly=/ true);
	}			}
	};			};

	}			}

	char AlwaysInliner::ID = 0;			char AlwaysInliner::ID = 0;
	INITIALIZE_PASS_BEGIN(AlwaysInliner, "always-inline",			INITIALIZE_PASS_BEGIN(AlwaysInliner, "always-inline",
	"Inliner for always_inline functions", false, false)			"Inliner for always_inline functions", false, false)
	INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)			INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)
	INITIALIZE_PASS_DEPENDENCY(CallGraphWrapperPass)			INITIALIZE_PASS_DEPENDENCY(CallGraphWrapperPass)
	INITIALIZE_PASS_DEPENDENCY(InlineCostAnalysis)
	INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)			INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)
	INITIALIZE_PASS_END(AlwaysInliner, "always-inline",			INITIALIZE_PASS_END(AlwaysInliner, "always-inline",
	"Inliner for always_inline functions", false, false)			"Inliner for always_inline functions", false, false)

	Pass *llvm::createAlwaysInlinerPass() { return new AlwaysInliner(); }			Pass *llvm::createAlwaysInlinerPass() { return new AlwaysInliner(); }

	Pass *llvm::createAlwaysInlinerPass(bool InsertLifetime) {			Pass *llvm::createAlwaysInlinerPass(bool InsertLifetime) {
	return new AlwaysInliner(InsertLifetime);			return new AlwaysInliner(InsertLifetime);
	Show All 13 Lines
	/// likely not worth it in practice.			/// likely not worth it in practice.
	InlineCost AlwaysInliner::getInlineCost(CallSite CS) {			InlineCost AlwaysInliner::getInlineCost(CallSite CS) {
	Function *Callee = CS.getCalledFunction();			Function *Callee = CS.getCalledFunction();

	// Only inline direct calls to functions with always-inline attributes			// Only inline direct calls to functions with always-inline attributes
	// that are viable for inlining. FIXME: We shouldn't even get here for			// that are viable for inlining. FIXME: We shouldn't even get here for
	// declarations.			// declarations.
	if (Callee && !Callee->isDeclaration() &&			if (Callee && !Callee->isDeclaration() &&
	CS.hasFnAttr(Attribute::AlwaysInline) &&			CS.hasFnAttr(Attribute::AlwaysInline) && isInlineViable(*Callee))
	ICA->isInlineViable(*Callee))
	return InlineCost::getAlways();			return InlineCost::getAlways();

	return InlineCost::getNever();			return InlineCost::getNever();
	}			}

	bool AlwaysInliner::runOnSCC(CallGraphSCC &SCC) {
	ICA = &getAnalysis<InlineCostAnalysis>();
	return Inliner::runOnSCC(SCC);
	}

	void AlwaysInliner::getAnalysisUsage(AnalysisUsage &AU) const {
	AU.addRequired<InlineCostAnalysis>();
	Inliner::getAnalysisUsage(AU);
	}

lib/Transforms/IPO/InlineSimple.cpp

	//===- InlineSimple.cpp - Code to perform simple function inlining --------===//			//===- InlineSimple.cpp - Code to perform simple function inlining --------===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file implements bottom-up inlining of functions into callees.			// This file implements bottom-up inlining of functions into callees.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/Transforms/IPO.h"
	#include "llvm/Analysis/AssumptionCache.h"			#include "llvm/Analysis/AssumptionCache.h"
	#include "llvm/Analysis/CallGraph.h"			#include "llvm/Analysis/CallGraph.h"
	#include "llvm/Analysis/InlineCost.h"			#include "llvm/Analysis/InlineCost.h"
	#include "llvm/Analysis/TargetLibraryInfo.h"			#include "llvm/Analysis/TargetLibraryInfo.h"
				#include "llvm/Analysis/TargetTransformInfo.h"
	#include "llvm/IR/CallSite.h"			#include "llvm/IR/CallSite.h"
	#include "llvm/IR/CallingConv.h"			#include "llvm/IR/CallingConv.h"
	#include "llvm/IR/DataLayout.h"			#include "llvm/IR/DataLayout.h"
	#include "llvm/IR/Instructions.h"			#include "llvm/IR/Instructions.h"
	#include "llvm/IR/IntrinsicInst.h"			#include "llvm/IR/IntrinsicInst.h"
	#include "llvm/IR/Module.h"			#include "llvm/IR/Module.h"
	#include "llvm/IR/Type.h"			#include "llvm/IR/Type.h"
				#include "llvm/Transforms/IPO.h"
	#include "llvm/Transforms/IPO/InlinerPass.h"			#include "llvm/Transforms/IPO/InlinerPass.h"

	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "inline"			#define DEBUG_TYPE "inline"

	namespace {			namespace {

	/// \brief Actual inliner pass implementation.			/// \brief Actual inliner pass implementation.
	///			///
	/// The common implementation of the inlining logic is shared between this			/// The common implementation of the inlining logic is shared between this
	/// inliner pass and the always inliner pass. The two passes use different cost			/// inliner pass and the always inliner pass. The two passes use different cost
	/// analyses to determine when to inline.			/// analyses to determine when to inline.
	class SimpleInliner : public Inliner {			class SimpleInliner : public Inliner {
	InlineCostAnalysis *ICA;

	public:			public:
	SimpleInliner() : Inliner(ID), ICA(nullptr) {			SimpleInliner() : Inliner(ID) {
	initializeSimpleInlinerPass(*PassRegistry::getPassRegistry());			initializeSimpleInlinerPass(*PassRegistry::getPassRegistry());
	}			}

	SimpleInliner(int Threshold)			SimpleInliner(int Threshold)
	: Inliner(ID, Threshold, /InsertLifetime/ true), ICA(nullptr) {			: Inliner(ID, Threshold, /InsertLifetime/ true) {
	initializeSimpleInlinerPass(*PassRegistry::getPassRegistry());			initializeSimpleInlinerPass(*PassRegistry::getPassRegistry());
	}			}

	static char ID; // Pass identification, replacement for typeid			static char ID; // Pass identification, replacement for typeid

	InlineCost getInlineCost(CallSite CS) override {			InlineCost getInlineCost(CallSite CS) override {
	return ICA->getInlineCost(CS, getInlineThreshold(CS));			Function *Callee = CS.getCalledFunction();
				assert(Callee);
				TargetTransformInfo &TTI =
				getAnalysis<TargetTransformInfoWrapperPass>().getTTI(*Callee);
				AssumptionCacheTracker *ACT = &getAnalysis<AssumptionCacheTracker>();
				chandlercUnsubmitted Not Done Reply Inline Actions These are actually expensive operations. Instead of doing this once per getInlineCost, it would be better to cache them in members of the actual pass (either the SimpleInliner or Inliner depending on what is cleanest). chandlerc: These are actually expensive operations. Instead of doing this once per getInlineCost, it would…
				eramanAuthorUnsubmitted Not Done Reply Inline Actions Ok. eraman: Ok.

				return llvm::getInlineCost(CS, getInlineThreshold(CS), TTI, ACT);
	}			}
				chandlercUnsubmitted Not Done Reply Inline Actions Why add this code? chandlerc: Why add this code?
				eramanAuthorUnsubmitted Not Done Reply Inline Actions Originally, there was a if(Callee) guard before calling the getTTI, but the Callee can never be NULL here. I agree that assert(Callee) is not very useful and will remove this. eraman: Originally, there was a if(Callee) guard before calling the getTTI, but the Callee can never be…

	bool runOnSCC(CallGraphSCC &SCC) override;			bool runOnSCC(CallGraphSCC &SCC) override;
	void getAnalysisUsage(AnalysisUsage &AU) const override;			void getAnalysisUsage(AnalysisUsage &AU) const override;
	};			};

	static int computeThresholdFromOptLevels(unsigned OptLevel,			static int computeThresholdFromOptLevels(unsigned OptLevel,
				chandlercUnsubmitted Not Done Reply Inline Actions You can actually make TTI the member instead of the wrapper pass. chandlerc: You can actually make TTI the member instead of the wrapper pass.
				eramanAuthorUnsubmitted Not Done Reply Inline Actions I'm confused. Isn't the returned TTI dependent on the function? If I make TTI a member, how do I get the TTI for the callee in getInlineCost without calling getAnalysis on TTI wrapper pass ? eraman: I'm confused. Isn't the returned TTI dependent on the function? If I make TTI a member, how do…
				chandlercUnsubmitted Not Done Reply Inline Actions Quite right. Sorry for the noise! chandlerc: Quite right. Sorry for the noise!
	unsigned SizeOptLevel) {			unsigned SizeOptLevel) {
	if (OptLevel > 2)			if (OptLevel > 2)
	return 275;			return 275;
	if (SizeOptLevel == 1) // -Os			if (SizeOptLevel == 1) // -Os
	return 75;			return 75;
	if (SizeOptLevel == 2) // -Oz			if (SizeOptLevel == 2) // -Oz
	return 25;			return 25;
	return 225;			return 225;
	}			}

	} // end anonymous namespace			} // end anonymous namespace

	char SimpleInliner::ID = 0;			char SimpleInliner::ID = 0;
	INITIALIZE_PASS_BEGIN(SimpleInliner, "inline",			INITIALIZE_PASS_BEGIN(SimpleInliner, "inline",
	"Function Integration/Inlining", false, false)			"Function Integration/Inlining", false, false)
	INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)			INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)
	INITIALIZE_PASS_DEPENDENCY(CallGraphWrapperPass)			INITIALIZE_PASS_DEPENDENCY(CallGraphWrapperPass)
	INITIALIZE_PASS_DEPENDENCY(InlineCostAnalysis)			INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)
	INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)			INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)
	INITIALIZE_PASS_END(SimpleInliner, "inline",			INITIALIZE_PASS_END(SimpleInliner, "inline",
	"Function Integration/Inlining", false, false)			"Function Integration/Inlining", false, false)

	Pass *llvm::createFunctionInliningPass() { return new SimpleInliner(); }			Pass *llvm::createFunctionInliningPass() { return new SimpleInliner(); }

	Pass *llvm::createFunctionInliningPass(int Threshold) {			Pass *llvm::createFunctionInliningPass(int Threshold) {
	return new SimpleInliner(Threshold);			return new SimpleInliner(Threshold);
	}			}

	Pass *llvm::createFunctionInliningPass(unsigned OptLevel,			Pass *llvm::createFunctionInliningPass(unsigned OptLevel,
	unsigned SizeOptLevel) {			unsigned SizeOptLevel) {
	return new SimpleInliner(			return new SimpleInliner(
	computeThresholdFromOptLevels(OptLevel, SizeOptLevel));			computeThresholdFromOptLevels(OptLevel, SizeOptLevel));
	}			}

	bool SimpleInliner::runOnSCC(CallGraphSCC &SCC) {			bool SimpleInliner::runOnSCC(CallGraphSCC &SCC) {
	ICA = &getAnalysis<InlineCostAnalysis>();
	return Inliner::runOnSCC(SCC);			return Inliner::runOnSCC(SCC);
				chandlercUnsubmitted Not Done Reply Inline Actions You want to replace this on every run rather than checking it on each run. Look for other places where we cache a TTI pointer. You should also leave it uninitialized in the constructor so we get an MSan error if we reach code without the run call happening first. chandlerc: You want to replace this on every run rather than checking it on each run. Look for other…
				eramanAuthorUnsubmitted Not Done Reply Inline Actions I didn't think about the MSan interaction and it now makes sense to remove the initialization and hence can't do the if (!TTIWP)... . I am curious if there is any other reason to avoid this pattern. eraman: I didn't think about the MSan interaction and it now makes sense to remove the initialization…
				chandlercUnsubmitted Not Done Reply Inline Actions It doesn't happen much in-tree, but the pass manager supports running multiple modules through a single pass instance, and it is conceivable that they would have different TTI wrapper passes. Not likely, and not practically reproducible, but the pattern is designed to handle cases where the analysis might not be valid to cache from run to run. chandlerc: It doesn't happen much in-tree, but the pass manager supports running multiple modules through…
	}			}

	void SimpleInliner::getAnalysisUsage(AnalysisUsage &AU) const {			void SimpleInliner::getAnalysisUsage(AnalysisUsage &AU) const {
	AU.addRequired<InlineCostAnalysis>();			// TargetTransformInfoWrapperPass and AssumptionCacheTracker are
				// needed to perform inline cost analysis. The base Inliner class
				// calls addRequired on AssumptionCacheTracker.
				AU.addRequired<TargetTransformInfoWrapperPass>();
				chandlercUnsubmitted Not Done Reply Inline Actions Hmm, this makes me believe that at least the base class should manage calling 'getAnalysis' for AssumptionCacheTracker, and probably this class should do so for TargetTransformInfo... chandlerc: Hmm, this makes me believe that at least the base class should manage calling 'getAnalysis' for…
				eramanAuthorUnsubmitted Not Done Reply Inline Actions Not fully sure what you mean here. The base class calls getAnalysis for AssumptionCacheTracker, and you want to cache the value in the base class so that it could be used in derived class's InlineCost? eraman: Not fully sure what you mean here. The base class calls getAnalysis for AssumptionCacheTracker…
				chandlercUnsubmitted Not Done Reply Inline Actions Yea, just make a protected member that the derived class can access when calling getInlineCost so it doesn't have to call getAnalysis again. (The fact that this is somewhat magical is one of many reasons why I don't really like the pattern SimpleInliner AlwaysInliner use of subclassing a fully formed pass, but I think that's a much bigger refactoring yak to shave) chandlerc: Yea, just make a protected member that the derived class can access when calling getInlineCost…
	Inliner::getAnalysisUsage(AU);			Inliner::getAnalysisUsage(AU);
	}			}