This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/test/Frontend/
-
test/
-
Frontend/
-
optimization-remark-with-hotness-new-pm.c
-
optimization-remark-with-hotness.c
-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
-
InlineAdvisor.h
-
lib/
-
Analysis/
2/5
InlineAdvisor.cpp
-
Transforms/IPO/
-
IPO/
-
SampleProfile.cpp
-
test/Transforms/
-
Transforms/
-
Inline/
-
optimization-remarks-passed-yaml.ll
-
SampleProfile/
-
Inputs/
-
remarks.prof
-
remarks.ll

Differential D82213

[Remarks] Add callsite locations to inline remarks
ClosedPublic

Authored by wenlei on Jun 19 2020, 10:29 AM.

Download Raw Diff

Details

Reviewers

wmi
davidxl
hoy
chandlerc

Commits

rG7c8a6936bf6b: [Remarks] Add callsite locations to inline remarks

Summary

Add call site location info into inline remarks so we can differentiate inline sites.
This can be useful for inliner tuning. We can also reconstruct full hierarchical inline
tree from parsing such remarks. The messege of inline remark is also tweaked so we can
differentiate SampleProfileLoader inline from CGSCC inline.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

wenlei created this revision.Jun 19 2020, 10:29 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJun 19 2020, 10:29 AM

Herald added subscribers: llvm-commits, cfe-commits, hiraditya. · View Herald Transcript

This sounds useful indeed. @fhahn, @anemet might want to take a look.

Harbormaster completed remote builds in B61065: Diff 272131.Jun 19 2020, 1:04 PM

wenlei added a reviewer: chandlerc.Jun 19 2020, 2:02 PM

Can you add a test case where there is more than one level of inline contexts for the callsite?

llvm/lib/Analysis/InlineAdvisor.cpp
391	is this necessary? User should know if their build has profile or not. What is more useful is when PGO is on, but some callsite does not have profile data, then it is worth reporting.

wenlei marked an inline comment as done.Jun 19 2020, 5:33 PM

wenlei added inline comments.

llvm/lib/Analysis/InlineAdvisor.cpp
391	is this necessary? User should know if their build has profile or not. This was used to differentiate between SampleProfileLoader inline vs CGSCC inline. Maybe the message `by profile guided inliner` isn't great, but can't think of a better and concise way.. With the differentiation in the message, the inlinee tree recovered through some parsing is what I'm looking for (`[P]` for SampleProfileLoader inline, `[C]` for CGSCC inline): Inlinees for main [P] _ZN15largesolidarrayIP6regobjEixEi @ 369 [P] _Z7random1i @ 363 [C] _Z8myrandomv @ 2 [P] _Z7random1i @ 364 [C] _Z8myrandomv @ 2 [P] _ZN15largesolidarrayIP6regobjEixEi @ 366 [P] _ZN6wayobj9createwayEiiiiRP8point16tRi @ 327 [P] _ZN6wayobj11createwayarEiiRP8point16tRi @ 37.1 [P] _ZN6wayobj5indexEii @ 143 [P] _ZN6wayobj5indexEii @ 130 [P] _ZN6wayobj6indexxEi @ 31 [P] _ZN6wayobj6indexyEi @ 32 [C] _ZN8point16tC2Ess @ 2 [C] _ZN8point16tC2Ess @ 2.1 What is more useful is when PGO is on, but some callsite does not have profile data, then it is worth reporting. That can be useful. I was also looking for a way to get call site count printed (if we have a count), but looks like it's not available from `InlineCost`. I'm going to defer that for now if that's ok.

Address David's comments, add test for nested inlinining.

Harbormaster failed remote builds in B61129: Diff 272240!Jun 19 2020, 11:57 PM

davidxl added inline comments.Jun 20 2020, 8:52 AM

llvm/lib/Analysis/InlineAdvisor.cpp
392	Perhaps reword it to " to match profiling context" ..

wenlei marked an inline comment as done.Jun 20 2020, 10:07 AM

wenlei added inline comments.

llvm/lib/Analysis/InlineAdvisor.cpp
392	Sounds good, updated.

Update remark message.

Harbormaster completed remote builds in B61142: Diff 272263.Jun 20 2020, 11:38 AM

lgtm

llvm/lib/Analysis/InlineAdvisor.cpp
383	ProfileGuidedInline --> ForProfileContext

This revision is now accepted and ready to land.Jun 20 2020, 8:03 PM

Closed by commit rG7c8a6936bf6b: [Remarks] Add callsite locations to inline remarks (authored by wenlei). · Explain WhyJun 20 2020, 11:57 PM

This revision was automatically updated to reflect the committed changes.

That's interesting. We are also using something similar for the matrix lowering remarks [1]: we traverse the inlining chain bottom up and emit a remark at each step which contains the expression available at that level. I think those approaches could be useful in general to surface remarks at the right level and it might be worth moving them somewhere so they can be shared. What do you think?

[1] https://github.com/llvm/llvm-project/blob/master/llvm/lib/Transforms/Scalar/LowerMatrixIntrinsics.cpp#L1783

In D82213#2110941, @fhahn wrote:

That's interesting. We are also using something similar for the matrix lowering remarks [1]: we traverse the inlining chain bottom up and emit a remark at each step which contains the expression available at that level. I think those approaches could be useful in general to surface remarks at the right level and it might be worth moving them somewhere so they can be shared. What do you think?

[1] https://github.com/llvm/llvm-project/blob/master/llvm/lib/Transforms/Scalar/LowerMatrixIntrinsics.cpp#L1783

That's indeed similar, though it seems like what you're doing is more than just showing the full inline stack as location. Agreed that if we start to do these in more places for optimization remarks, it'd make sense to build it into remarks infra. But we may not always want full inline stack names as location (considering deep inlining with long template instantiation names that can "pollute" the remark messages), so I'm guessing what we could do is move that into remarks infra, but still use a separate switch to control whether we show inline locations (just like how -fdiagnostics-show-hotness controls whether we show hotness for remarks)?

modimo mentioned this in D94334: [InlineAdvisor] Allow replay of inline decisions for the CGSCC inliner from optimization remarks.Jan 15 2021, 11:56 AM

Revision Contents

Path

Size

clang/

test/

Frontend/

optimization-remark-with-hotness-new-pm.c

2 lines

optimization-remark-with-hotness.c

2 lines

llvm/

include/

llvm/

Analysis/

InlineAdvisor.h

7 lines

lib/

Analysis/

InlineAdvisor.cpp

38 lines

Transforms/

IPO/

SampleProfile.cpp

6 lines

test/

Transforms/

Inline/

optimization-remarks-passed-yaml.ll

6 lines

SampleProfile/

Inputs/

remarks.prof

2 lines

remarks.ll

51 lines

Diff 272289

clang/test/Frontend/optimization-remark-with-hotness-new-pm.c

	Show First 20 Lines • Show All 67 Lines • ▼ Show 20 Lines

	void bar(int x) {			void bar(int x) {
	// HOTNESS_OFF: foo inlined into bar			// HOTNESS_OFF: foo inlined into bar
	// HOTNESS_OFF-NOT: hotness:			// HOTNESS_OFF-NOT: hotness:
	// THRESHOLD-NOT: inlined			// THRESHOLD-NOT: inlined
	// THRESHOLD-NOT: hotness			// THRESHOLD-NOT: hotness
	// NO_PGO: '-fdiagnostics-show-hotness' requires profile-guided optimization information			// NO_PGO: '-fdiagnostics-show-hotness' requires profile-guided optimization information
	// NO_PGO: '-fdiagnostics-hotness-threshold=' requires profile-guided optimization information			// NO_PGO: '-fdiagnostics-hotness-threshold=' requires profile-guided optimization information
	// expected-remark@+1 {{foo inlined into bar with (cost=always): always inline attribute (hotness:}}			// expected-remark@+1 {{foo inlined into bar with (cost=always): always inline attribute at callsite bar:8 (hotness:}}
	sum += foo(x, x - 2);			sum += foo(x, x - 2);
	}			}

	int main(int argc, const char *argv[]) {			int main(int argc, const char *argv[]) {
	for (int i = 0; i < 30; i++)			for (int i = 0; i < 30; i++)
	// expected-remark@+1 {{bar inlined into main with}}			// expected-remark@+1 {{bar inlined into main with}}
	bar(argc);			bar(argc);
	return sum;			return sum;
	}			}

clang/test/Frontend/optimization-remark-with-hotness.c

	Show First 20 Lines • Show All 60 Lines • ▼ Show 20 Lines

	void bar(int x) {			void bar(int x) {
	// HOTNESS_OFF: foo inlined into bar			// HOTNESS_OFF: foo inlined into bar
	// HOTNESS_OFF-NOT: hotness:			// HOTNESS_OFF-NOT: hotness:
	// THRESHOLD-NOT: inlined			// THRESHOLD-NOT: inlined
	// THRESHOLD-NOT: hotness			// THRESHOLD-NOT: hotness
	// NO_PGO: '-fdiagnostics-show-hotness' requires profile-guided optimization information			// NO_PGO: '-fdiagnostics-show-hotness' requires profile-guided optimization information
	// NO_PGO: '-fdiagnostics-hotness-threshold=' requires profile-guided optimization information			// NO_PGO: '-fdiagnostics-hotness-threshold=' requires profile-guided optimization information
	// expected-remark@+1 {{foo inlined into bar with (cost=always): always inliner (hotness:}}			// expected-remark@+1 {{foo inlined into bar with (cost=always): always inliner at callsite bar:8 (hotness:}}
	sum += foo(x, x - 2);			sum += foo(x, x - 2);
	}			}

	int main(int argc, const char *argv[]) {			int main(int argc, const char *argv[]) {
	for (int i = 0; i < 30; i++)			for (int i = 0; i < 30; i++)
	// expected-remark@+1 {{bar not inlined into main because it should never be inlined (cost=never): no alwaysinline attribute (hotness:}}			// expected-remark@+1 {{bar not inlined into main because it should never be inlined (cost=never): no alwaysinline attribute (hotness:}}
	bar(argc);			bar(argc);
	return sum;			return sum;
	}			}

llvm/include/llvm/Analysis/InlineAdvisor.h

	Show First 20 Lines • Show All 211 Lines • ▼ Show 20 Lines
	/// inlining should not be attempted.			/// inlining should not be attempted.
	Optional<InlineCost>			Optional<InlineCost>
	shouldInline(CallBase &CB, function_ref<InlineCost(CallBase &CB)> GetInlineCost,			shouldInline(CallBase &CB, function_ref<InlineCost(CallBase &CB)> GetInlineCost,
	OptimizationRemarkEmitter &ORE, bool EnableDeferral = true);			OptimizationRemarkEmitter &ORE, bool EnableDeferral = true);

	/// Emit ORE message.			/// Emit ORE message.
	void emitInlinedInto(OptimizationRemarkEmitter &ORE, DebugLoc DLoc,			void emitInlinedInto(OptimizationRemarkEmitter &ORE, DebugLoc DLoc,
	const BasicBlock *Block, const Function &Callee,			const BasicBlock *Block, const Function &Callee,
	const Function &Caller, const InlineCost &IC);			const Function &Caller, const InlineCost &IC,
				bool ForProfileContext = false,
				const char *PassName = nullptr);

				/// Add location info to ORE message.
				void addLocationToRemarks(OptimizationRemark &Remark, DebugLoc DLoc);

	/// Set the inline-remark attribute.			/// Set the inline-remark attribute.
	void setInlineRemark(CallBase &CB, StringRef Message);			void setInlineRemark(CallBase &CB, StringRef Message);

	/// Utility for extracting the inline cost message to a string.			/// Utility for extracting the inline cost message to a string.
	std::string inlineCostStr(const InlineCost &IC);			std::string inlineCostStr(const InlineCost &IC);
	} // namespace llvm			} // namespace llvm
	#endif // LLVM_INLINEADVISOR_H_			#endif // LLVM_INLINEADVISOR_H_

llvm/lib/Analysis/InlineAdvisor.cpp

Show All 12 Lines

#include "llvm/Analysis/InlineAdvisor.h"		#include "llvm/Analysis/InlineAdvisor.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/Analysis/InlineCost.h"		#include "llvm/Analysis/InlineCost.h"
#include "llvm/Analysis/OptimizationRemarkEmitter.h"		#include "llvm/Analysis/OptimizationRemarkEmitter.h"
#include "llvm/Analysis/ProfileSummaryInfo.h"		#include "llvm/Analysis/ProfileSummaryInfo.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
		#include "llvm/IR/DebugInfoMetadata.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

#include <sstream>		#include <sstream>

using namespace llvm;		using namespace llvm;
#define DEBUG_TYPE "inline"		#define DEBUG_TYPE "inline"

▲ Show 20 Lines • Show All 320 Lines • ▼ Show 20 Lines	if (EnableDeferral &&
return None;		return None;
}		}

LLVM_DEBUG(dbgs() << " Inlining " << inlineCostStr(IC) << ", Call: " << CB		LLVM_DEBUG(dbgs() << " Inlining " << inlineCostStr(IC) << ", Call: " << CB
<< '\n');		<< '\n');
return IC;		return IC;
}		}

		void llvm::addLocationToRemarks(OptimizationRemark &Remark, DebugLoc DLoc) {
		if (!DLoc.get())
		return;

		bool First = true;
		Remark << " at callsite ";
		for (DILocation *DIL = DLoc.get(); DIL; DIL = DIL->getInlinedAt()) {
		if (!First)
		Remark << " @ ";
		unsigned int Offset = DIL->getLine();
		Offset -= DIL->getScope()->getSubprogram()->getLine();
		unsigned int Discriminator = DIL->getBaseDiscriminator();
		StringRef Name = DIL->getScope()->getSubprogram()->getLinkageName();
		if (Name.empty())
		Name = DIL->getScope()->getSubprogram()->getName();
		Remark << Name << ":" << ore::NV("Line", Offset);
		if (Discriminator)
		Remark << "." << ore::NV("Disc", Discriminator);
		First = false;
		}
		}

void llvm::emitInlinedInto(OptimizationRemarkEmitter &ORE, DebugLoc DLoc,		void llvm::emitInlinedInto(OptimizationRemarkEmitter &ORE, DebugLoc DLoc,
const BasicBlock *Block, const Function &Callee,		const BasicBlock *Block, const Function &Callee,
const Function &Caller, const InlineCost &IC) {		const Function &Caller, const InlineCost &IC,
		bool ForProfileContext, const char *PassName) {
		davidxlUnsubmitted Not Done Reply Inline Actions ProfileGuidedInline --> ForProfileContext davidxl: ProfileGuidedInline --> ForProfileContext
ORE.emit([&]() {		ORE.emit([&]() {
bool AlwaysInline = IC.isAlways();		bool AlwaysInline = IC.isAlways();
StringRef RemarkName = AlwaysInline ? "AlwaysInline" : "Inlined";		StringRef RemarkName = AlwaysInline ? "AlwaysInline" : "Inlined";
return OptimizationRemark(DEBUG_TYPE, RemarkName, DLoc, Block)		OptimizationRemark Remark(PassName ? PassName : DEBUG_TYPE, RemarkName,
<< ore::NV("Callee", &Callee) << " inlined into "		DLoc, Block);
<< ore::NV("Caller", &Caller) << " with " << IC;		Remark << ore::NV("Callee", &Callee) << " inlined into ";
		Remark << ore::NV("Caller", &Caller);
		if (ForProfileContext)
		davidxlUnsubmitted Not Done Reply Inline Actions is this necessary? User should know if their build has profile or not. What is more useful is when PGO is on, but some callsite does not have profile data, then it is worth reporting. davidxl: is this necessary? User should know if their build has profile or not. What is more useful is…
		wenleiAuthorUnsubmitted Done Reply Inline Actions is this necessary? User should know if their build has profile or not. This was used to differentiate between SampleProfileLoader inline vs CGSCC inline. Maybe the message `by profile guided inliner` isn't great, but can't think of a better and concise way.. With the differentiation in the message, the inlinee tree recovered through some parsing is what I'm looking for (`[P]` for SampleProfileLoader inline, `[C]` for CGSCC inline): Inlinees for main [P] _ZN15largesolidarrayIP6regobjEixEi @ 369 [P] _Z7random1i @ 363 [C] _Z8myrandomv @ 2 [P] _Z7random1i @ 364 [C] _Z8myrandomv @ 2 [P] _ZN15largesolidarrayIP6regobjEixEi @ 366 [P] _ZN6wayobj9createwayEiiiiRP8point16tRi @ 327 [P] _ZN6wayobj11createwayarEiiRP8point16tRi @ 37.1 [P] _ZN6wayobj5indexEii @ 143 [P] _ZN6wayobj5indexEii @ 130 [P] _ZN6wayobj6indexxEi @ 31 [P] _ZN6wayobj6indexyEi @ 32 [C] _ZN8point16tC2Ess @ 2 [C] _ZN8point16tC2Ess @ 2.1 What is more useful is when PGO is on, but some callsite does not have profile data, then it is worth reporting. That can be useful. I was also looking for a way to get call site count printed (if we have a count), but looks like it's not available from `InlineCost`. I'm going to defer that for now if that's ok. wenlei: > is this necessary? User should know if their build has profile or not. This was used to…
		Remark << " to match profiling context";
		davidxlUnsubmitted Not Done Reply Inline Actions Perhaps reword it to " to match profiling context" .. davidxl: Perhaps reword it to " to match profiling context" ..
		wenleiAuthorUnsubmitted Done Reply Inline Actions Sounds good, updated. wenlei: Sounds good, updated.
		Remark << " with " << IC;
		addLocationToRemarks(Remark, DLoc);
		return Remark;
});		});
}		}

llvm/lib/Transforms/IPO/SampleProfile.cpp

Show All 31 Lines
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
#include "llvm/Analysis/AssumptionCache.h"		#include "llvm/Analysis/AssumptionCache.h"
#include "llvm/Analysis/CallGraph.h"		#include "llvm/Analysis/CallGraph.h"
#include "llvm/Analysis/CallGraphSCCPass.h"		#include "llvm/Analysis/CallGraphSCCPass.h"
		#include "llvm/Analysis/InlineAdvisor.h"
#include "llvm/Analysis/InlineCost.h"		#include "llvm/Analysis/InlineCost.h"
#include "llvm/Analysis/LoopInfo.h"		#include "llvm/Analysis/LoopInfo.h"
#include "llvm/Analysis/OptimizationRemarkEmitter.h"		#include "llvm/Analysis/OptimizationRemarkEmitter.h"
#include "llvm/Analysis/PostDominators.h"		#include "llvm/Analysis/PostDominators.h"
#include "llvm/Analysis/ProfileSummaryInfo.h"		#include "llvm/Analysis/ProfileSummaryInfo.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
▲ Show 20 Lines • Show All 863 Lines • ▼ Show 20 Lines	bool SampleProfileLoader::inlineCallInstruction(CallBase &CB) {
if (Cost.isNever()) {		if (Cost.isNever()) {
ORE->emit(OptimizationRemarkAnalysis(CSINLINE_DEBUG, "InlineFail", DLoc, BB)		ORE->emit(OptimizationRemarkAnalysis(CSINLINE_DEBUG, "InlineFail", DLoc, BB)
<< "incompatible inlining");		<< "incompatible inlining");
return false;		return false;
}		}
InlineFunctionInfo IFI(nullptr, GetAC);		InlineFunctionInfo IFI(nullptr, GetAC);
if (InlineFunction(CB, IFI).isSuccess()) {		if (InlineFunction(CB, IFI).isSuccess()) {
// The call to InlineFunction erases I, so we can't pass it here.		// The call to InlineFunction erases I, so we can't pass it here.
ORE->emit(OptimizationRemark(CSINLINE_DEBUG, "InlineSuccess", DLoc, BB)		emitInlinedInto(ORE, DLoc, BB, CalledFunction, *BB->getParent(), Cost,
<< "inlined callee '" << ore::NV("Callee", CalledFunction)		true, CSINLINE_DEBUG);
<< "' into '" << ore::NV("Caller", BB->getParent()) << "'");
return true;		return true;
}		}
return false;		return false;
}		}

bool SampleProfileLoader::shouldInlineColdCallee(CallBase &CallInst) {		bool SampleProfileLoader::shouldInlineColdCallee(CallBase &CallInst) {
if (!ProfileSizeInline)		if (!ProfileSizeInline)
return false;		return false;
▲ Show 20 Lines • Show All 1,066 Lines • Show Last 20 Lines

llvm/test/Transforms/Inline/optimization-remarks-passed-yaml.ll

	Show All 16 Lines
	; is the input:			; is the input:

	; 1 int foo() { return 1; }			; 1 int foo() { return 1; }
	; 2			; 2
	; 3 int bar() {			; 3 int bar() {
	; 4 return foo();			; 4 return foo();
	; 5 }			; 5 }

	; CHECK: remark: /tmp/s.c:4:10: foo inlined into bar with (cost={{[0-9\-]+}}, threshold={{[0-9]+}}) (hotness: 30)			; CHECK: remark: /tmp/s.c:4:10: foo inlined into bar with (cost={{[0-9\-]+}}, threshold={{[0-9]+}}) at callsite bar:1 (hotness: 30)

	; YAML: --- !Passed			; YAML: --- !Passed
	; YAML-NEXT: Pass: inline			; YAML-NEXT: Pass: inline
	; YAML-NEXT: Name: Inlined			; YAML-NEXT: Name: Inlined
	; YAML-NEXT: DebugLoc: { File: '/tmp/s.c', Line: 4, Column: 10 }			; YAML-NEXT: DebugLoc: { File: '/tmp/s.c', Line: 4, Column: 10 }
	; YAML-NEXT: Function: bar			; YAML-NEXT: Function: bar
	; YAML-NEXT: Hotness: 30			; YAML-NEXT: Hotness: 30
	; YAML-NEXT: Args:			; YAML-NEXT: Args:
	; YAML-NEXT: - Callee: foo			; YAML-NEXT: - Callee: foo
	; YAML-NEXT: DebugLoc: { File: '/tmp/s.c', Line: 1, Column: 0 }			; YAML-NEXT: DebugLoc: { File: '/tmp/s.c', Line: 1, Column: 0 }
	; YAML-NEXT: - String: ' inlined into '			; YAML-NEXT: - String: ' inlined into '
	; YAML-NEXT: - Caller: bar			; YAML-NEXT: - Caller: bar
	; YAML-NEXT: DebugLoc: { File: '/tmp/s.c', Line: 3, Column: 0 }			; YAML-NEXT: DebugLoc: { File: '/tmp/s.c', Line: 3, Column: 0 }
	; YAML-NEXT: - String: ' with '			; YAML-NEXT: - String: ' with '
	; YAML-NEXT: - String: '(cost='			; YAML-NEXT: - String: '(cost='
	; YAML-NEXT: - Cost: '{{[0-9\-]+}}'			; YAML-NEXT: - Cost: '{{[0-9\-]+}}'
	; YAML-NEXT: - String: ', threshold='			; YAML-NEXT: - String: ', threshold='
	; YAML-NEXT: - Threshold: '{{[0-9]+}}'			; YAML-NEXT: - Threshold: '{{[0-9]+}}'
	; YAML-NEXT: - String: ')'			; YAML-NEXT: - String: ')'
				; YAML-NEXT: - String: ' at callsite '
				; YAML-NEXT: - String: bar
				; YAML-NEXT: - String: ':'
				; YAML-NEXT: - Line: '1'
	; YAML-NEXT: ...			; YAML-NEXT: ...

	; ModuleID = '/tmp/s.c'			; ModuleID = '/tmp/s.c'
	source_filename = "/tmp/s.c"			source_filename = "/tmp/s.c"
	target datalayout = "e-m:o-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:o-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-apple-macosx10.11.0"			target triple = "x86_64-apple-macosx10.11.0"

	; Function Attrs: nounwind ssp uwtable			; Function Attrs: nounwind ssp uwtable
	Show All 32 Lines

llvm/test/Transforms/SampleProfile/Inputs/remarks.prof

	main:623868:0			main:623868:0
	0: 0			0: 0
	0: _Z3foov:623868			0: _Z3foov:623868
	3: 18346			3: 18346
	4: 0			4: 0
	6: 19475			6: 19475
				6: rand:12093
				0: 11203
	2: 18305			2: 18305

llvm/test/Transforms/SampleProfile/remarks.ll

	Show All 15 Lines
	; 9 sum += -i * rand();			; 9 sum += -i * rand();
	; 10 return sum;			; 10 return sum;
	; 11 }			; 11 }
	; 12			; 12
	; 13 int main() { return foo() > 0; }			; 13 int main() { return foo() > 0; }

	; We are expecting foo() to be inlined in main() (almost all the cycles are			; We are expecting foo() to be inlined in main() (almost all the cycles are
	; spent inside foo).			; spent inside foo).
	; CHECK: remark: remarks.cc:13:21: inlined callee '_Z3foov' into 'main'			; CHECK: remark: remarks.cc:13:21: _Z3foov inlined into main to match profiling context with (cost=130, threshold=225) at callsite main:0
				; CHECK: remark: remarks.cc:9:19: rand inlined into main to match profiling context with (cost=always): always inline attribute at callsite _Z3foov:6 @ main:0

	; The back edge for the loop is the hottest edge in the loop subgraph.			; The back edge for the loop is the hottest edge in the loop subgraph.
	; CHECK: remark: remarks.cc:6:9: most popular destination for conditional branches at remarks.cc:5:3			; CHECK: remark: remarks.cc:6:9: most popular destination for conditional branches at remarks.cc:5:3

	; The predicate almost always chooses the 'else' branch.			; The predicate almost always chooses the 'else' branch.
	; CHECK: remark: remarks.cc:9:15: most popular destination for conditional branches at remarks.cc:6:9			; CHECK: remark: remarks.cc:9:15: most popular destination for conditional branches at remarks.cc:6:9

	; Checking to see if YAML file is generated and contains remarks			; Checking to see if YAML file is generated and contains remarks
	;YAML: --- !Passed			;YAML: --- !Passed
	;YAML-NEXT: Pass: sample-profile-inline			;YAML-NEXT: Pass: sample-profile-inline
	;YAML-NEXT: Name: InlineSuccess			;YAML-NEXT: Name: Inlined
	;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 13, Column: 21 }			;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 13, Column: 21 }
	;YAML-NEXT: Function: main			;YAML-NEXT: Function: main
	;YAML-NEXT: Args:			;YAML-NEXT: Args:
	;YAML-NEXT: - String: 'inlined callee '''
	;YAML-NEXT: - Callee: _Z3foov			;YAML-NEXT: - Callee: _Z3foov
	;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 3, Column: 0 }			;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 3, Column: 0 }
	;YAML-NEXT: - String: ''' into '''			;YAML-NEXT: - String: ' inlined into '
	;YAML-NEXT: - Caller: main			;YAML-NEXT: - Caller: main
	;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 13, Column: 0 }			;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 13, Column: 0 }
	;YAML-NEXT: - String: ''''			;YAML-NEXT: - String: ' to match profiling context'
				;YAML-NEXT: - String: ' with '
				;YAML-NEXT: - String: '(cost='
				;YAML-NEXT: - Cost: '130'
				;YAML-NEXT: - String: ', threshold='
				;YAML-NEXT: - Threshold: '225'
				;YAML-NEXT: - String: ')'
				;YAML-NEXT: - String: ' at callsite '
				;YAML-NEXT: - String: main
				;YAML-NEXT: - String: ':'
				;YAML-NEXT: - Line: '0'
	;YAML-NEXT: ...			;YAML-NEXT: ...
				;YAML: --- !Passed
				;YAML-NEXT: Pass: sample-profile-inline
				;YAML-NEXT: Name: AlwaysInline
				;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 9, Column: 19 }
				;YAML-NEXT: Function: main
				;YAML-NEXT: Args:
				;YAML-NEXT: - Callee: rand
				;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 90, Column: 0 }
				;YAML-NEXT: - String: ' inlined into '
				;YAML-NEXT: - Caller: main
				;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 13, Column: 0 }
				;YAML-NEXT: - String: ' to match profiling context'
				;YAML-NEXT: - String: ' with '
				;YAML-NEXT: - String: '(cost=always)'
				;YAML-NEXT: - String: ': '
				;YAML-NEXT: - Reason: always inline attribute
				;YAML-NEXT: - String: ' at callsite '
				;YAML-NEXT: - String: _Z3foov
				;YAML-NEXT: - String: ':'
				;YAML-NEXT: - Line: '6'
				;YAML-NEXT: - String: ' @ '
				;YAML-NEXT: - String: main
				;YAML-NEXT: - String: ':'
				;YAML-NEXT: - Line: '0'
	;YAML: --- !Analysis			;YAML: --- !Analysis
	;YAML-NEXT: Pass: sample-profile			;YAML-NEXT: Pass: sample-profile
	;YAML-NEXT: Name: AppliedSamples			;YAML-NEXT: Name: AppliedSamples
	;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 5, Column: 8 }			;YAML-NEXT: DebugLoc: { File: remarks.cc, Line: 5, Column: 8 }
	;YAML-NEXT: Function: main			;YAML-NEXT: Function: main
	;YAML-NEXT: Args:			;YAML-NEXT: Args:
	;YAML-NEXT: - String: 'Applied '			;YAML-NEXT: - String: 'Applied '
	;YAML-NEXT: - NumSamples: '18305'			;YAML-NEXT: - NumSamples: '18305'
	▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines

	; Function Attrs: nounwind argmemonly			; Function Attrs: nounwind argmemonly
	declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture) #1			declare void @llvm.lifetime.start.p0i8(i64, i8* nocapture) #1

	; Function Attrs: nounwind readnone			; Function Attrs: nounwind readnone
	declare void @llvm.dbg.declare(metadata, metadata, metadata) #2			declare void @llvm.dbg.declare(metadata, metadata, metadata) #2

	; Function Attrs: nounwind			; Function Attrs: nounwind
	declare i32 @rand() #3			define i32 @rand() #3 !dbg !59 {
				ret i32 1
				}

	; Function Attrs: nounwind argmemonly			; Function Attrs: nounwind argmemonly
	declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture) #1			declare void @llvm.lifetime.end.p0i8(i64, i8* nocapture) #1

	; Function Attrs: nounwind uwtable			; Function Attrs: nounwind uwtable
	define i32 @main() #0 !dbg !13 {			define i32 @main() #0 !dbg !13 {
	entry:			entry:
	%retval = alloca i32, align 4			%retval = alloca i32, align 4
	store i32 0, i32* %retval, align 4			store i32 0, i32* %retval, align 4
	%call = call i64 @_Z3foov(), !dbg !56			%call = call i64 @_Z3foov(), !dbg !56
	%cmp = icmp sgt i64 %call, 0, !dbg !57			%cmp = icmp sgt i64 %call, 0, !dbg !57
	%conv = zext i1 %cmp to i32, !dbg !56			%conv = zext i1 %cmp to i32, !dbg !56
	ret i32 %conv, !dbg !58			ret i32 %conv, !dbg !58
	}			}

	attributes #0 = { nounwind uwtable "disable-tail-calls"="false" "less-precise-fpmad"="false" "frame-pointer"="none" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2" "unsafe-fp-math"="false" "use-soft-float"="false" "use-sample-profile" }			attributes #0 = { nounwind uwtable "disable-tail-calls"="false" "less-precise-fpmad"="false" "frame-pointer"="none" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2" "unsafe-fp-math"="false" "use-soft-float"="false" "use-sample-profile" }
	attributes #1 = { nounwind argmemonly }			attributes #1 = { nounwind argmemonly }
	attributes #2 = { nounwind readnone }			attributes #2 = { nounwind readnone }
	attributes #3 = { nounwind "disable-tail-calls"="false" "less-precise-fpmad"="false" "frame-pointer"="none" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2" "unsafe-fp-math"="false" "use-soft-float"="false" }			attributes #3 = { nounwind alwaysinline "disable-tail-calls"="false" "less-precise-fpmad"="false" "frame-pointer"="none" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2" "unsafe-fp-math"="false" "use-soft-float"="false" }
	attributes #4 = { nounwind }			attributes #4 = { nounwind }

	!llvm.dbg.cu = !{!0}			!llvm.dbg.cu = !{!0}
	!llvm.module.flags = !{!16, !17}			!llvm.module.flags = !{!16, !17}
	!llvm.ident = !{!18}			!llvm.ident = !{!18}

	!0 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus, file: !1, producer: "clang version 3.8.0 (trunk 251041) (llvm/trunk 251053)", isOptimized: true, runtimeVersion: 0, emissionKind: NoDebug, enums: !2)			!0 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus, file: !1, producer: "clang version 3.8.0 (trunk 251041) (llvm/trunk 251053)", isOptimized: true, runtimeVersion: 0, emissionKind: NoDebug, enums: !2)
	!1 = !DIFile(filename: "remarks.cc", directory: ".")			!1 = !DIFile(filename: "remarks.cc", directory: ".")
	▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	!51 = !DILocation(line: 5, column: 35, scope: !33)			!51 = !DILocation(line: 5, column: 35, scope: !33)
	!52 = !DILocation(line: 5, column: 3, scope: !33)			!52 = !DILocation(line: 5, column: 3, scope: !33)
	!53 = !DILocation(line: 10, column: 10, scope: !4)			!53 = !DILocation(line: 10, column: 10, scope: !4)
	!54 = !DILocation(line: 11, column: 1, scope: !4)			!54 = !DILocation(line: 11, column: 1, scope: !4)
	!55 = !DILocation(line: 10, column: 3, scope: !4)			!55 = !DILocation(line: 10, column: 3, scope: !4)
	!56 = !DILocation(line: 13, column: 21, scope: !13)			!56 = !DILocation(line: 13, column: 21, scope: !13)
	!57 = !DILocation(line: 13, column: 27, scope: !13)			!57 = !DILocation(line: 13, column: 27, scope: !13)
	!58 = !DILocation(line: 13, column: 14, scope: !13)			!58 = !DILocation(line: 13, column: 14, scope: !13)
				!59 = distinct !DISubprogram(name: "rand", linkageName: "rand", scope: !1, file: !1, line: 90, type: !5, isLocal: false, isDefinition: true, scopeLine: 90, flags: DIFlagPrototyped, isOptimized: true, unit: !0)

This is an archive of the discontinued LLVM Phabricator instance.

[Remarks] Add callsite locations to inline remarksClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 272289

clang/test/Frontend/optimization-remark-with-hotness-new-pm.c

clang/test/Frontend/optimization-remark-with-hotness.c

llvm/include/llvm/Analysis/InlineAdvisor.h

llvm/lib/Analysis/InlineAdvisor.cpp

llvm/lib/Transforms/IPO/SampleProfile.cpp

llvm/test/Transforms/Inline/optimization-remarks-passed-yaml.ll

llvm/test/Transforms/SampleProfile/Inputs/remarks.prof

llvm/test/Transforms/SampleProfile/remarks.ll

[Remarks] Add callsite locations to inline remarks
ClosedPublic