This is an archive of the discontinued LLVM Phabricator instance.

[InlineCost] Prevent infinite recursion on function pointers
AcceptedPublic

Authored by paquette on Jan 3 2018, 2:40 PM.

Download Raw Diff

Details

Reviewers

echristo
davide
MatzeB
chandlerc

Summary

Compile the following at -Os:

typedef void (*Foo)(void*);

void Bar(void* FunctionPtr)
{
	((Foo)FunctionPtr)((void*)Bar);
}

int main(int argc, char *argv[]) {
	Bar((void*)Bar);	
}

The inliner will recurse infinitely because it doesn't handle the case where a function takes a function pointer argument, and is called using a pointer to itself as an argument. This patch makes the inliner quit when the examined callsite involves a function pointer parameter.

Diff Detail

Event Timeline

paquette created this revision.Jan 3 2018, 2:40 PM

Herald added subscribers: haicheng, eraman. · View Herald TranscriptJan 3 2018, 2:40 PM

Needs test

lib/Analysis/InlineCost.cpp
1278–1279	I don't think you can rely on the pointee type this way

The bug that's getting triggered by the testcase is that CallAnalyzer::analyzeCall is recursive, without any recursion limit, so it crashes by overflowing the stack.

Your patch doesn't solve that issue in general; it only solves the problem for your specific testcase. You could trigger a similar crash with a program that isn't recursive. Or there might be some other way to trick CallAnalyzer into following a recursive program. If you want to actually fix the bug, the solution is to either add a recursion depth limit, or make the algorithm iterative and add an iteration limit.

Updated patch to prevent general recursion instead of just the one edgecase. Also added some information to the dump function for CallAnalyzer to make it clear if the inliner bailed out because of the recursion limit.

We must have some pre-existing recursion check already as clang doesn't fail for obviously recursive cases. (I guess it's the code setting IsRecursiveCall).
I assume that code isn't working correctly with the testcases here can't we just fix that?

If we are going for arbitrary recursion limits, then we should probably have a comment for why the limit is 100 (and not 50 or 200) and it would probably be a good idea to add a cl::opt so people can tweak the limit if it doesn't work for them.

I assume that code isn't working correctly with the testcases here can't we just fix that?

Even if we did, we would still need some sort of arbitrary limit to handle large callgraphs (if you have Bar1 calls Bar2 calls Bar3...Bar1000, or something like that, we'll eventually overflow compiler's stack through recursion even if the IR doesn't contain any recursion).

Got rid of the magic 100 for the maximum depth and put in a command line option (-inline-recursion-limit) instead. I tried a few different limits with the knob and found that this test crashes around a recursion depth of 3500 on my machine. A max depth of 2000 seemed like a good middle-ground.

Makes sense to me from all I've seen. So tentative LGTM: Wait a few days and if noone else chimes in, it's fine :)

lib/Analysis/InlineCost.cpp
92–94	typo in the help text.
1289	I would recommend duplicating some code we had earlier for known call sites here: if (F == CS.getInstruction()->getFunction()) { // This flag will fully abort the analysis, so don't bother with anything // else. IsRecursiveCall = true; return false; } as we won't inline recursively anyway after this.

This revision is now accepted and ready to land.Jan 9 2018, 4:04 PM

There are two problems this patch is trying to address:
a. A call chain I1->I2->I3...Ik, where each of the I_i ->I_(i+1) are indirect calls that become direct (because we know the target through inline analysis). No recursion in the callgraph of the module being compiled.
b. The callgraph has recursion through a sequence of indirect calls. This is similar to the example in this patch description but could be a more broader cycle (not just self recursion).

One consequence of this patch is that in the second example, we will end up applying a huge bonus to the original callee that is being analyzed. Note that the intent of recursively calling analyzeCall is to apply a bonus (negative cost) by subtracting the cost of indirect call from the indirect threshold parameter. Let's say that you have a self recursive call where the original cost of the body is 25. IndirectCallThreshold is 150. At the maximum depth R, we don't analyze any further and return a cost of 25. In the previous level (R-1), we give a bonus by subtracting 125 (IndirectCallThreshold - call cost (25)). So the cost becomes -125. So at the topmost level the bonus will be something like -125R. While the intent of the heuristic is to give some small bonus to each indirect call promoted to a direct call, this will end up applying the bonus multiple times to a call instruction.

One way to fix this is by keeping track of the callsites visited in this process and not revisit a callsite. A simpler alternative is to limit the recursion depth to just 1 (or some very small number). I don't think we will lose out much doing so.

Revision Contents

Path

Size

include/

llvm/

Analysis/

InlineCost.h

3 lines

lib/

Analysis/

InlineCost.cpp

47 lines

test/

Transforms/

Inline/

inline-functionptr.ll

29 lines

Diff 128977

include/llvm/Analysis/InlineCost.h

	Show First 20 Lines • Show All 125 Lines • ▼ Show 20 Lines
	/// and pass it to \c getInlineCost. Some specialized versions of inliner			/// and pass it to \c getInlineCost. Some specialized versions of inliner
	/// (such as the pre-inliner) might have custom logic to compute \c InlineParams			/// (such as the pre-inliner) might have custom logic to compute \c InlineParams
	/// object.			/// object.

	struct InlineParams {			struct InlineParams {
	/// The default threshold to start with for a callee.			/// The default threshold to start with for a callee.
	int DefaultThreshold;			int DefaultThreshold;

				/// The maximum depth of recursion for analyzing a call.
				Optional<int> RecursionLimit;

	/// Threshold to use for callees with inline hint.			/// Threshold to use for callees with inline hint.
	Optional<int> HintThreshold;			Optional<int> HintThreshold;

	/// Threshold to use for cold callees.			/// Threshold to use for cold callees.
	Optional<int> ColdThreshold;			Optional<int> ColdThreshold;

	/// Threshold to use when the caller is optimized for size.			/// Threshold to use when the caller is optimized for size.
	Optional<int> OptSizeThreshold;			Optional<int> OptSizeThreshold;
	▲ Show 20 Lines • Show All 73 Lines • Show Last 20 Lines

lib/Analysis/InlineCost.cpp

Show First 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	cl::desc("Minimum block frequency, expressed as a multiple of caller's "
"entry frequency, for a callsite to be hot in the absence of "		"entry frequency, for a callsite to be hot in the absence of "
"profile information."));		"profile information."));

static cl::opt<bool> OptComputeFullInlineCost(		static cl::opt<bool> OptComputeFullInlineCost(
"inline-cost-full", cl::Hidden, cl::init(false),		"inline-cost-full", cl::Hidden, cl::init(false),
cl::desc("Compute the full inline cost of a call site even when the cost "		cl::desc("Compute the full inline cost of a call site even when the cost "
"exceeds the threshold."));		"exceeds the threshold."));

		static cl::opt<int> InlineRecursionLimit(
		"inline-recursion-limit", cl::Hidden, cl::init(2000), cl::ZeroOrMore,
		cl::desc("Recursion limit for analyzing calls (default = 3000)"));
		MatzeBUnsubmitted Not Done Reply Inline Actions typo in the help text. MatzeB: typo in the help text.

namespace {		namespace {

class CallAnalyzer : public InstVisitor<CallAnalyzer, bool> {		class CallAnalyzer : public InstVisitor<CallAnalyzer, bool> {
typedef InstVisitor<CallAnalyzer, bool> Base;		typedef InstVisitor<CallAnalyzer, bool> Base;
friend class InstVisitor<CallAnalyzer, bool>;		friend class InstVisitor<CallAnalyzer, bool>;

/// The TargetTransformInfo available for this compilation.		/// The TargetTransformInfo available for this compilation.
const TargetTransformInfo &TTI;		const TargetTransformInfo &TTI;
Show All 19 Lines	class CallAnalyzer : public InstVisitor<CallAnalyzer, bool> {
/// The candidate callsite being analyzed. Please do not use this to do		/// The candidate callsite being analyzed. Please do not use this to do
/// analysis in the caller function; we want the inline cost query to be		/// analysis in the caller function; we want the inline cost query to be
/// easily cacheable. Instead, use the cover function paramHasAttr.		/// easily cacheable. Instead, use the cover function paramHasAttr.
CallSite CandidateCS;		CallSite CandidateCS;

/// Tunable parameters that control the analysis.		/// Tunable parameters that control the analysis.
const InlineParams &Params;		const InlineParams &Params;

		/// The depth of recursion that this CallAnalyzer sits at.
		int CallsAnalyzedRecursionDepth = 0;

		/// True if this CallAnalyzer hit its maximum recursion depth.
		bool HitMaxNumAnalyzeCalls = false;

int Threshold;		int Threshold;
int Cost;		int Cost;
bool ComputeFullInlineCost;		bool ComputeFullInlineCost;

bool IsCallerRecursive;		bool IsCallerRecursive;
bool IsRecursiveCall;		bool IsRecursiveCall;
bool ExposesReturnsTwice;		bool ExposesReturnsTwice;
bool HasDynamicAlloca;		bool HasDynamicAlloca;
▲ Show 20 Lines • Show All 131 Lines • ▼ Show 20 Lines	class CallAnalyzer : public InstVisitor<CallAnalyzer, bool> {
bool visitCatchReturnInst(CatchReturnInst &RI);		bool visitCatchReturnInst(CatchReturnInst &RI);
bool visitUnreachableInst(UnreachableInst &I);		bool visitUnreachableInst(UnreachableInst &I);

public:		public:
CallAnalyzer(const TargetTransformInfo &TTI,		CallAnalyzer(const TargetTransformInfo &TTI,
std::function<AssumptionCache &(Function &)> &GetAssumptionCache,		std::function<AssumptionCache &(Function &)> &GetAssumptionCache,
Optional<function_ref<BlockFrequencyInfo &(Function &)>> &GetBFI,		Optional<function_ref<BlockFrequencyInfo &(Function &)>> &GetBFI,
ProfileSummaryInfo PSI, OptimizationRemarkEmitter ORE,		ProfileSummaryInfo PSI, OptimizationRemarkEmitter ORE,
Function &Callee, CallSite CSArg, const InlineParams &Params)		Function &Callee, CallSite CSArg, const InlineParams &Params,
		const unsigned &Depth)
: TTI(TTI), GetAssumptionCache(GetAssumptionCache), GetBFI(GetBFI),		: TTI(TTI), GetAssumptionCache(GetAssumptionCache), GetBFI(GetBFI),
PSI(PSI), F(Callee), DL(F.getParent()->getDataLayout()), ORE(ORE),		PSI(PSI), F(Callee), DL(F.getParent()->getDataLayout()), ORE(ORE),
CandidateCS(CSArg), Params(Params), Threshold(Params.DefaultThreshold),		CandidateCS(CSArg), Params(Params), CallsAnalyzedRecursionDepth(Depth),
Cost(0), ComputeFullInlineCost(OptComputeFullInlineCost \|\|		Threshold(Params.DefaultThreshold), Cost(0),
		ComputeFullInlineCost(OptComputeFullInlineCost \|\|
Params.ComputeFullInlineCost \|\| ORE),		Params.ComputeFullInlineCost \|\| ORE),
IsCallerRecursive(false), IsRecursiveCall(false),		IsCallerRecursive(false), IsRecursiveCall(false),
ExposesReturnsTwice(false), HasDynamicAlloca(false),		ExposesReturnsTwice(false), HasDynamicAlloca(false),
ContainsNoDuplicateCall(false), HasReturn(false), HasIndirectBr(false),		ContainsNoDuplicateCall(false), HasReturn(false), HasIndirectBr(false),
HasFrameEscape(false), AllocatedSize(0), NumInstructions(0),		HasFrameEscape(false), AllocatedSize(0), NumInstructions(0),
NumVectorInstructions(0), VectorBonus(0), SingleBBBonus(0),		NumVectorInstructions(0), VectorBonus(0), SingleBBBonus(0),
EnableLoadElimination(true), LoadEliminationCost(0), NumConstantArgs(0),		EnableLoadElimination(true), LoadEliminationCost(0), NumConstantArgs(0),
NumConstantOffsetPtrArgs(0), NumAllocaArgs(0), NumConstantPtrCmps(0),		NumConstantOffsetPtrArgs(0), NumAllocaArgs(0), NumConstantPtrCmps(0),
NumConstantPtrDiffs(0), NumInstructionsSimplified(0),		NumConstantPtrDiffs(0), NumInstructionsSimplified(0),
▲ Show 20 Lines • Show All 898 Lines • ▼ Show 20 Lines	if (Constant *C = ConstantFoldCall(CS, F, ConstantArgs)) {
SimplifiedValues[CS.getInstruction()] = C;		SimplifiedValues[CS.getInstruction()] = C;
return true;		return true;
}		}

return false;		return false;
}		}

bool CallAnalyzer::visitCallSite(CallSite CS) {		bool CallAnalyzer::visitCallSite(CallSite CS) {
		// If we've recursed too deep, then quit.
		if (HitMaxNumAnalyzeCalls)
		return false;

if (CS.hasFnAttr(Attribute::ReturnsTwice) &&		if (CS.hasFnAttr(Attribute::ReturnsTwice) &&
!F.hasFnAttribute(Attribute::ReturnsTwice)) {		!F.hasFnAttribute(Attribute::ReturnsTwice)) {
// This aborts the entire analysis.		// This aborts the entire analysis.
ExposesReturnsTwice = true;		ExposesReturnsTwice = true;
return false;		return false;
}		}
if (CS.isCall() && cast<CallInst>(CS.getInstruction())->cannotDuplicate())		if (CS.isCall() && cast<CallInst>(CS.getInstruction())->cannotDuplicate())
ContainsNoDuplicateCall = true;		ContainsNoDuplicateCall = true;
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	if (Function *F = CS.getCalledFunction()) {
return Base::visitCallSite(CS);		return Base::visitCallSite(CS);
}		}

// Otherwise we're in a very special case -- an indirect function call. See		// Otherwise we're in a very special case -- an indirect function call. See
// if we can be particularly clever about this.		// if we can be particularly clever about this.
Value *Callee = CS.getCalledValue();		Value *Callee = CS.getCalledValue();

// First, pay the price of the argument setup. We account for the average		// First, pay the price of the argument setup. We account for the average
// 1 instruction per call argument setup here.		// 1 instruction per call argument setup here.
Cost += CS.arg_size() * InlineConstants::InstrCost;		Cost += CS.arg_size() * InlineConstants::InstrCost;
		arsenmUnsubmitted Not Done Reply Inline Actions I don't think you can rely on the pointee type this way arsenm: I don't think you can rely on the pointee type this way

// Next, check if this happens to be an indirect function call to a known		// Next, check if this happens to be an indirect function call to a known
// function in this inline context. If not, we've done all we can.		// function in this inline context. If not, we've done all we can.
Function *F = dyn_cast_or_null<Function>(SimplifiedValues.lookup(Callee));		Function *F = dyn_cast_or_null<Function>(SimplifiedValues.lookup(Callee));
if (!F) {		if (!F) {
if (!CS.onlyReadsMemory())		if (!CS.onlyReadsMemory())
disableLoadElimination();		disableLoadElimination();
return Base::visitCallSite(CS);		return Base::visitCallSite(CS);
}		}

		MatzeBUnsubmitted Not Done Reply Inline Actions I would recommend duplicating some code we had earlier for known call sites here: if (F == CS.getInstruction()->getFunction()) { // This flag will fully abort the analysis, so don't bother with anything // else. IsRecursiveCall = true; return false; } as we won't inline recursively anyway after this. MatzeB: I would recommend duplicating some code we had earlier for known call sites here: ``` if (F…
// If we have a constant that we are calling as a function, we can peer		// If we have a constant that we are calling as a function, we can peer
// through it and see the function target. This happens not infrequently		// through it and see the function target. This happens not infrequently
// during devirtualization and so we want to give it a hefty bonus for		// during devirtualization and so we want to give it a hefty bonus for
// inlining, but cap that bonus in the event that inlining wouldn't pan		// inlining, but cap that bonus in the event that inlining wouldn't pan
// out. Pretend to inline the function, with a custom threshold.		// out. Pretend to inline the function, with a custom threshold.
auto IndirectCallParams = Params;		auto IndirectCallParams = Params;
IndirectCallParams.DefaultThreshold = InlineConstants::IndirectCallThreshold;		IndirectCallParams.DefaultThreshold = InlineConstants::IndirectCallThreshold;
CallAnalyzer CA(TTI, GetAssumptionCache, GetBFI, PSI, ORE, *F, CS,		CallAnalyzer CA(TTI, GetAssumptionCache, GetBFI, PSI, ORE, *F, CS,
IndirectCallParams);		IndirectCallParams, CallsAnalyzedRecursionDepth);

		// Analyze CS and update the depth of recursion.
if (CA.analyzeCall(CS)) {		if (CA.analyzeCall(CS)) {
// We were able to inline the indirect call! Subtract the cost from the		// We were able to inline the indirect call! Subtract the cost from the
// threshold to get the bonus we want to apply, but don't go below zero.		// threshold to get the bonus we want to apply, but don't go below zero.
Cost -= std::max(0, CA.getThreshold() - CA.getCost());		Cost -= std::max(0, CA.getThreshold() - CA.getCost());
}		}

		// If we hit the maximum recursion when calling analyzeCall, update this
		// CallAnalyzer's max recursion flag.
		if (CA.HitMaxNumAnalyzeCalls)
		HitMaxNumAnalyzeCalls = true;

if (!F->onlyReadsMemory())		if (!F->onlyReadsMemory())
disableLoadElimination();		disableLoadElimination();
return Base::visitCallSite(CS);		return Base::visitCallSite(CS);
}		}

bool CallAnalyzer::visitReturnInst(ReturnInst &RI) {		bool CallAnalyzer::visitReturnInst(ReturnInst &RI) {
// At least one return instruction will be free after inlining.		// At least one return instruction will be free after inlining.
bool Free = !HasReturn;		bool Free = !HasReturn;
▲ Show 20 Lines • Show All 380 Lines • ▼ Show 20 Lines
/// \brief Analyze a call site for potential inlining.		/// \brief Analyze a call site for potential inlining.
///		///
/// Returns true if inlining this call is viable, and false if it is not		/// Returns true if inlining this call is viable, and false if it is not
/// viable. It computes the cost and adjusts the threshold based on numerous		/// viable. It computes the cost and adjusts the threshold based on numerous
/// factors and heuristics. If this method returns false but the computed cost		/// factors and heuristics. If this method returns false but the computed cost
/// is below the computed threshold, then inlining was forcibly disabled by		/// is below the computed threshold, then inlining was forcibly disabled by
/// some artifact of the routine.		/// some artifact of the routine.
bool CallAnalyzer::analyzeCall(CallSite CS) {		bool CallAnalyzer::analyzeCall(CallSite CS) {
		++CallsAnalyzedRecursionDepth;

		// Update the depth of recursion. If we've gone too deep, bail out.
		if (CallsAnalyzedRecursionDepth >= Params.RecursionLimit) {
		HitMaxNumAnalyzeCalls = true;
		return false;
		}

++NumCallsAnalyzed;		++NumCallsAnalyzed;

// Perform some tweaks to the cost and threshold based on the direct		// Perform some tweaks to the cost and threshold based on the direct
// callsite information.		// callsite information.

// We want to more aggressively inline vector-dense kernels, so up the		// We want to more aggressively inline vector-dense kernels, so up the
// threshold, and we'll lower it if the % of vector instructions gets too		// threshold, and we'll lower it if the % of vector instructions gets too
// low. Note that these bonuses are some what arbitrary and evolved over time		// low. Note that these bonuses are some what arbitrary and evolved over time
▲ Show 20 Lines • Show All 185 Lines • ▼ Show 20 Lines	#define DEBUG_PRINT_STAT(x) dbgs() << " " #x ": " << x << "\n"
DEBUG_PRINT_STAT(NumInstructionsSimplified);		DEBUG_PRINT_STAT(NumInstructionsSimplified);
DEBUG_PRINT_STAT(NumInstructions);		DEBUG_PRINT_STAT(NumInstructions);
DEBUG_PRINT_STAT(SROACostSavings);		DEBUG_PRINT_STAT(SROACostSavings);
DEBUG_PRINT_STAT(SROACostSavingsLost);		DEBUG_PRINT_STAT(SROACostSavingsLost);
DEBUG_PRINT_STAT(LoadEliminationCost);		DEBUG_PRINT_STAT(LoadEliminationCost);
DEBUG_PRINT_STAT(ContainsNoDuplicateCall);		DEBUG_PRINT_STAT(ContainsNoDuplicateCall);
DEBUG_PRINT_STAT(Cost);		DEBUG_PRINT_STAT(Cost);
DEBUG_PRINT_STAT(Threshold);		DEBUG_PRINT_STAT(Threshold);
		DEBUG_PRINT_STAT(HitMaxNumAnalyzeCalls);
#undef DEBUG_PRINT_STAT		#undef DEBUG_PRINT_STAT
}		}
#endif		#endif

/// \brief Test that there are no attribute conflicts between Caller and Callee		/// \brief Test that there are no attribute conflicts between Caller and Callee
/// that prevent inlining.		/// that prevent inlining.
static bool functionsHaveCompatibleAttributes(Function *Caller,		static bool functionsHaveCompatibleAttributes(Function *Caller,
Function *Callee,		Function *Callee,
▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	InlineCost llvm::getInlineCost(
if (Callee->isInterposable() \|\| Callee->hasFnAttribute(Attribute::NoInline) \|\|		if (Callee->isInterposable() \|\| Callee->hasFnAttribute(Attribute::NoInline) \|\|
CS.isNoInline())		CS.isNoInline())
return llvm::InlineCost::getNever();		return llvm::InlineCost::getNever();

DEBUG(llvm::dbgs() << " Analyzing call of " << Callee->getName()		DEBUG(llvm::dbgs() << " Analyzing call of " << Callee->getName()
<< "... (caller:" << Caller->getName() << ")\n");		<< "... (caller:" << Caller->getName() << ")\n");

CallAnalyzer CA(CalleeTTI, GetAssumptionCache, GetBFI, PSI, ORE, *Callee, CS,		CallAnalyzer CA(CalleeTTI, GetAssumptionCache, GetBFI, PSI, ORE, *Callee, CS,
Params);		Params, 0);
bool ShouldInline = CA.analyzeCall(CS);		bool ShouldInline = CA.analyzeCall(CS);

DEBUG(CA.dump());		DEBUG(CA.dump());

// Check if there was a reason to force inlining or no inlining.		// Check if there was a reason to force inlining or no inlining.
if (!ShouldInline && CA.getCost() < CA.getThreshold())		if (!ShouldInline && CA.getCost() < CA.getThreshold())
return InlineCost::getNever();		return InlineCost::getNever();
if (ShouldInline && CA.getCost() >= CA.getThreshold())		if (ShouldInline && CA.getCost() >= CA.getThreshold())
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	else
Params.DefaultThreshold = Threshold;		Params.DefaultThreshold = Threshold;

// Set the HintThreshold knob from the -inlinehint-threshold.		// Set the HintThreshold knob from the -inlinehint-threshold.
Params.HintThreshold = HintThreshold;		Params.HintThreshold = HintThreshold;

// Set the HotCallSiteThreshold knob from the -hot-callsite-threshold.		// Set the HotCallSiteThreshold knob from the -hot-callsite-threshold.
Params.HotCallSiteThreshold = HotCallSiteThreshold;		Params.HotCallSiteThreshold = HotCallSiteThreshold;

		// Set the inlining recursion limit using -inline-recursion-limit.
		Params.RecursionLimit = InlineRecursionLimit;

// If the -locally-hot-callsite-threshold is explicitly specified, use it to		// If the -locally-hot-callsite-threshold is explicitly specified, use it to
// populate LocallyHotCallSiteThreshold. Later, we populate		// populate LocallyHotCallSiteThreshold. Later, we populate
// Params.LocallyHotCallSiteThreshold from -locally-hot-callsite-threshold if		// Params.LocallyHotCallSiteThreshold from -locally-hot-callsite-threshold if
// we know that optimization level is O3 (in the getInlineParams variant that		// we know that optimization level is O3 (in the getInlineParams variant that
// takes the opt and size levels).		// takes the opt and size levels).
// FIXME: Remove this check (and make the assignment unconditional) after		// FIXME: Remove this check (and make the assignment unconditional) after
// addressing size regression issues at O2.		// addressing size regression issues at O2.
if (LocallyHotCallSiteThreshold.getNumOccurrences() > 0)		if (LocallyHotCallSiteThreshold.getNumOccurrences() > 0)
▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

test/Transforms/Inline/inline-functionptr.ll

This file was added.

				; RUN: opt -passes='cgscc(inline)' -S %s \| FileCheck %s

				;; The inliner should quit when trying to inline the call to Bar into Bar.
				define void @Bar(i8* nocapture %FunctionPtr) #0 {
				; CHECK-LABEL: Bar
				; CHECK: %0 = bitcast i8* %FunctionPtr to void (i8)
				; CHECK-NEXT: call void %0(i8* bitcast (void (i8) @Bar to i8*)) #0
				; CHECK-NEXT: call void %0(i8* bitcast (void (i8) @Bar to i8*)) #0
				; CHECK-NEXT: ret void
				entry:
				%0 = bitcast i8* %FunctionPtr to void (i8)
				call void %0(i8* bitcast (void (i8) @Bar to i8*)) #0
				call void %0(i8* bitcast (void (i8) @Bar to i8*)) #0
				ret void
				}

				;; The inliner should be able to inline Bar into main
				define i32 @main(i32 %argc, i8** nocapture readnone %argv)
				local_unnamed_addr #0 {
				; CHECK-LABEL: main
				; CHECK: call void @Bar(i8* bitcast (void (i8) @Bar to i8*)) #0
				; CHECK-NEXT: call void @Bar(i8* bitcast (void (i8) @Bar to i8*)) #0
				; CHECK-NEXT: ret i32 0
				entry:
				tail call void @Bar(i8* bitcast (void (i8) @Bar to i8*)) #0
				ret i32 0
				}

				attributes #0 = { nounwind ssp uwtable }