Diff 97862

include/llvm/Analysis/ProfileSummaryInfo.h

Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	private:
void computeThresholds();		void computeThresholds();
// Count thresholds to answer isHotCount and isColdCount queries.		// Count thresholds to answer isHotCount and isColdCount queries.
Optional<uint64_t> HotCountThreshold, ColdCountThreshold;		Optional<uint64_t> HotCountThreshold, ColdCountThreshold;

public:		public:
ProfileSummaryInfo(Module &M) : M(M) {}		ProfileSummaryInfo(Module &M) : M(M) {}
ProfileSummaryInfo(ProfileSummaryInfo &&Arg)		ProfileSummaryInfo(ProfileSummaryInfo &&Arg)
: M(Arg.M), Summary(std::move(Arg.Summary)) {}		: M(Arg.M), Summary(std::move(Arg.Summary)) {}
		/// Returns the profile kind for the module.
		static Optional<ProfileSummary::Kind> getKind(const Module *M);
/// Returns the profile count for \p CallInst.		/// Returns the profile count for \p CallInst.
static Optional<uint64_t> getProfileCount(const Instruction *CallInst,		static Optional<uint64_t> getProfileCount(const Instruction *CallInst,
BlockFrequencyInfo *BFI);		BlockFrequencyInfo *BFI,
		ProfileSummary *Summary = nullptr);
		eramanUnsubmitted Not Done Reply Inline Actions A major rationale for adding ProfileSummaryInfo as a separate analysis is to prevent Profilesummary from being directly manipulated, so I believe we shouldn't add an interface that takes ProfileSummary as a parameter. As Dehao mentions below, you anyway get the module from the instruction and get the kind from there, so this is not necessary. My concern there is this becomes unnecessarily expensive as we get an invariant value (kind) many times. eraman: A major rationale for adding ProfileSummaryInfo as a separate analysis is to prevent…
		tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions A major rationale for adding ProfileSummaryInfo as a separate analysis is to prevent Profilesummary from being directly manipulated, so I believe we shouldn't add an interface that takes ProfileSummary as a parameter. Note that this is only passed in when getProfileCount() is called from a ProfileSummaryInfo method (it passes in the Summary object it owns), so I am not sure what the concern is about directly manipulating it? As Dehao mentions below, you anyway get the module from the instruction and get the kind from there, so this is not necessary. My concern there is this becomes unnecessarily expensive as we get an invariant value (kind) many times. Exactly, that's why I don't think we should keep re-finding the profile metadata every time when it is called from the ProfileSummaryInfo. tejohnson: > A major rationale for adding ProfileSummaryInfo as a separate analysis is to prevent…
		eramanUnsubmitted Not Done Reply Inline Actions The concern is you now have a public method that takes ProfileSummaryInfo * (even though no one outside the class passes it). What you can do is have a private helper method that takes a ProfileSummaryInfo . Within the class, call this directly. Note that the class already owns the Summary. External users will still call the public method, where you can get the summary from the instruction and call the private method (with a comment about the overhead). eraman:* The concern is you now have a public method that takes ProfileSummaryInfo * (even though no one…
		tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions Done tejohnson: Done
/// \brief Returns true if \p F has hot function entry.		/// \brief Returns true if \p F has hot function entry.
bool isFunctionEntryHot(const Function *F);		bool isFunctionEntryHot(const Function *F);
/// Returns true if \p F has hot function entry or hot call edge.		/// Returns true if \p F has hot function entry or hot call edge.
bool isFunctionHotInCallGraph(const Function *F);		bool isFunctionHotInCallGraph(const Function *F);
/// \brief Returns true if \p F has cold function entry.		/// \brief Returns true if \p F has cold function entry.
bool isFunctionEntryCold(const Function *F);		bool isFunctionEntryCold(const Function *F);
/// Returns true if \p F has cold function entry or cold call edge.		/// Returns true if \p F has cold function entry or cold call edge.
bool isFunctionColdInCallGraph(const Function *F);		bool isFunctionColdInCallGraph(const Function *F);
▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

include/llvm/IR/Module.h

Show First 20 Lines • Show All 798 Lines • ▼ Show 20 Lines	/// @}

/// @name Utility functions for querying and setting PGO summary		/// @name Utility functions for querying and setting PGO summary
/// @{		/// @{

/// \brief Attach profile summary metadata to this module.		/// \brief Attach profile summary metadata to this module.
void setProfileSummary(Metadata *M);		void setProfileSummary(Metadata *M);

/// \brief Returns profile summary metadata		/// \brief Returns profile summary metadata
Metadata *getProfileSummary();		Metadata *getProfileSummary() const;
/// @}		/// @}

/// Take ownership of the given memory buffer.		/// Take ownership of the given memory buffer.
void setOwnedMemoryBuffer(std::unique_ptr<MemoryBuffer> MB);		void setOwnedMemoryBuffer(std::unique_ptr<MemoryBuffer> MB);
};		};

/// \brief Given "llvm.used" or "llvm.compiler.used" as a global name, collect		/// \brief Given "llvm.used" or "llvm.compiler.used" as a global name, collect
/// the initializer elements of that global in Set and return the global itself.		/// the initializer elements of that global in Set and return the global itself.
Show All 23 Lines

include/llvm/IR/ProfileSummary.h

Show All 12 Lines

#ifndef LLVM_SUPPORT_PROFILE_SUMMARY_H		#ifndef LLVM_SUPPORT_PROFILE_SUMMARY_H
#define LLVM_SUPPORT_PROFILE_SUMMARY_H		#define LLVM_SUPPORT_PROFILE_SUMMARY_H

#include <cstdint>		#include <cstdint>
#include <utility>		#include <utility>
#include <vector>		#include <vector>

		#include "llvm/ADT/Optional.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"

namespace llvm {		namespace llvm {

class LLVMContext;		class LLVMContext;
class Metadata;		class Metadata;
class MDTuple;		class MDTuple;
class MDNode;		class MDNode;
Show All 36 Lines	ProfileSummary(Kind K, SummaryEntryVector DetailedSummary,
uint32_t NumCounts, uint32_t NumFunctions)		uint32_t NumCounts, uint32_t NumFunctions)
: PSK(K), DetailedSummary(std::move(DetailedSummary)),		: PSK(K), DetailedSummary(std::move(DetailedSummary)),
TotalCount(TotalCount), MaxCount(MaxCount),		TotalCount(TotalCount), MaxCount(MaxCount),
MaxInternalCount(MaxInternalCount), MaxFunctionCount(MaxFunctionCount),		MaxInternalCount(MaxInternalCount), MaxFunctionCount(MaxFunctionCount),
NumCounts(NumCounts), NumFunctions(NumFunctions) {}		NumCounts(NumCounts), NumFunctions(NumFunctions) {}
Kind getKind() const { return PSK; }		Kind getKind() const { return PSK; }
/// \brief Return summary information as metadata.		/// \brief Return summary information as metadata.
Metadata *getMD(LLVMContext &Context);		Metadata *getMD(LLVMContext &Context);
		/// Return the profile kind from metadata.
		static Optional<ProfileSummary::Kind> getKindFromMD(Metadata *MD);
/// \brief Construct profile summary from metdata.		/// \brief Construct profile summary from metdata.
static ProfileSummary getFromMD(Metadata MD);		static ProfileSummary getFromMD(Metadata MD);
SummaryEntryVector &getDetailedSummary() { return DetailedSummary; }		SummaryEntryVector &getDetailedSummary() { return DetailedSummary; }
uint32_t getNumFunctions() { return NumFunctions; }		uint32_t getNumFunctions() { return NumFunctions; }
uint64_t getMaxFunctionCount() { return MaxFunctionCount; }		uint64_t getMaxFunctionCount() { return MaxFunctionCount; }
uint32_t getNumCounts() { return NumCounts; }		uint32_t getNumCounts() { return NumCounts; }
uint64_t getTotalCount() { return TotalCount; }		uint64_t getTotalCount() { return TotalCount; }
uint64_t getMaxCount() { return MaxCount; }		uint64_t getMaxCount() { return MaxCount; }
uint64_t getMaxInternalCount() { return MaxInternalCount; }		uint64_t getMaxInternalCount() { return MaxInternalCount; }
};		};

} // end namespace llvm		} // end namespace llvm
#endif		#endif

lib/Analysis/ProfileSummaryInfo.cpp

Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	if (Summary)
return true;		return true;
auto *SummaryMD = M.getProfileSummary();		auto *SummaryMD = M.getProfileSummary();
if (!SummaryMD)		if (!SummaryMD)
return false;		return false;
Summary.reset(ProfileSummary::getFromMD(SummaryMD));		Summary.reset(ProfileSummary::getFromMD(SummaryMD));
return true;		return true;
}		}

Optional<uint64_t>		Optional<ProfileSummary::Kind> ProfileSummaryInfo::getKind(const Module *M) {
ProfileSummaryInfo::getProfileCount(const Instruction *Inst,		auto *SummaryMD = M->getProfileSummary();
BlockFrequencyInfo *BFI) {		if (!SummaryMD)
		return None;
		return ProfileSummary::getKindFromMD(SummaryMD);
		}

		Optional<uint64_t> ProfileSummaryInfo::getProfileCount(
		const Instruction Inst, BlockFrequencyInfo BFI, ProfileSummary *Summary) {
if (!Inst)		if (!Inst)
return None;		return None;
assert((isa<CallInst>(Inst) \|\| isa<InvokeInst>(Inst)) &&		assert((isa<CallInst>(Inst) \|\| isa<InvokeInst>(Inst)) &&
"We can only get profile count for call/invoke instruction.");		"We can only get profile count for call/invoke instruction.");
// Check if there is a profile metadata on the instruction. If it is present,		bool IsSamplePGO = false;
// determine hotness solely based on that.		if (Summary)
		IsSamplePGO = Summary->getKind() == ProfileSummary::PSK_Sample;
		else if (auto Kind = getKind(Inst->getModule()))
		IsSamplePGO = Kind.getValue() == ProfileSummary::PSK_Sample;
		danielcdhUnsubmitted Not Done Reply Inline Actions Looks like you can always get the Kind from Inst, why would you want to pass in the Summary? danielcdh: Looks like you can always get the Kind from Inst, why would you want to pass in the Summary?
		tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions It involves doing some work that is already done when the summary is available, so this was added just for the case where it is invoked without a summary. tejohnson: It involves doing some work that is already done when the summary is available, so this was…
		if (IsSamplePGO) {
		// In sample PGO mode, check if there is a profile metadata on the
		// instruction. If it is present, determine hotness solely based on that,
		// since the sampled entry count may not be accurate.
uint64_t TotalCount;		uint64_t TotalCount;
if (Inst->extractProfTotalWeight(TotalCount))		if (Inst->extractProfTotalWeight(TotalCount))
return TotalCount;		return TotalCount;
		}
		eramanUnsubmitted Not Done Reply Inline Actions I wonder if we should check if Summary is non-null and then the summary kind is PSK_Sample. There is one test case down below (inliner count update) where you had to attach the summary to the test case. Is there any reason the summary has to be present to get the count based on entry count and block frequency? eraman: I wonder if we should check if Summary is non-null and then the summary kind is PSK_Sample.
		tejohnsonAuthorUnsubmitted Not Done Reply Inline Actions Do you mean only do the metadata-based hotness when computeSummary() returns true and the kind is PSK_Sample? I.e. if !computeSummary(), then assume instrumentation based? I could do that, it would mean a few less test changes. tejohnson: Do you mean only do the metadata-based hotness when computeSummary() returns true and the kind…
		eramanUnsubmitted Not Done Reply Inline Actions Yes, that's what I should've written. As long as we have function entry counts, we should return the profile count. eraman: Yes, that's what I should've written. As long as we have function entry counts, we should…
if (BFI)		if (BFI)
return BFI->getBlockProfileCount(Inst->getParent());		return BFI->getBlockProfileCount(Inst->getParent());
return None;		return None;
}		}

/// Returns true if the function's entry is hot. If it returns false, it		/// Returns true if the function's entry is hot. If it returns false, it
/// either means it is not hot or it is unknown whether it is hot or not (for		/// either means it is not hot or it is unknown whether it is hot or not (for
/// example, no profile data is available).		/// example, no profile data is available).
Show All 16 Lines	bool ProfileSummaryInfo::isFunctionHotInCallGraph(const Function *F) {
if (auto FunctionCount = F->getEntryCount())		if (auto FunctionCount = F->getEntryCount())
if (isHotCount(FunctionCount.getValue()))		if (isHotCount(FunctionCount.getValue()))
return true;		return true;

uint64_t TotalCallCount = 0;		uint64_t TotalCallCount = 0;
for (const auto &BB : *F)		for (const auto &BB : *F)
for (const auto &I : BB)		for (const auto &I : BB)
if (isa<CallInst>(I) \|\| isa<InvokeInst>(I))		if (isa<CallInst>(I) \|\| isa<InvokeInst>(I))
if (auto CallCount = getProfileCount(&I, nullptr))		if (auto CallCount = getProfileCount(&I, nullptr, Summary.get()))
TotalCallCount += CallCount.getValue();		TotalCallCount += CallCount.getValue();
return isHotCount(TotalCallCount);		return isHotCount(TotalCallCount);
}		}

/// Returns true if the function's entry and total call edge count is cold.		/// Returns true if the function's entry and total call edge count is cold.
/// If it returns false, it either means it is not cold or it is unknown		/// If it returns false, it either means it is not cold or it is unknown
/// whether it is cold or not (for example, no profile data is available).		/// whether it is cold or not (for example, no profile data is available).
bool ProfileSummaryInfo::isFunctionColdInCallGraph(const Function *F) {		bool ProfileSummaryInfo::isFunctionColdInCallGraph(const Function *F) {
if (!F \|\| !computeSummary())		if (!F \|\| !computeSummary())
return false;		return false;
if (auto FunctionCount = F->getEntryCount())		if (auto FunctionCount = F->getEntryCount())
if (!isColdCount(FunctionCount.getValue()))		if (!isColdCount(FunctionCount.getValue()))
return false;		return false;

uint64_t TotalCallCount = 0;		uint64_t TotalCallCount = 0;
for (const auto &BB : *F)		for (const auto &BB : *F)
for (const auto &I : BB)		for (const auto &I : BB)
if (isa<CallInst>(I) \|\| isa<InvokeInst>(I))		if (isa<CallInst>(I) \|\| isa<InvokeInst>(I))
if (auto CallCount = getProfileCount(&I, nullptr))		if (auto CallCount = getProfileCount(&I, nullptr, Summary.get()))
TotalCallCount += CallCount.getValue();		TotalCallCount += CallCount.getValue();
return isColdCount(TotalCallCount);		return isColdCount(TotalCallCount);
}		}

/// Returns true if the function's entry is a cold. If it returns false, it		/// Returns true if the function's entry is a cold. If it returns false, it
/// either means it is not cold or it is unknown whether it is cold or not (for		/// either means it is not cold or it is unknown whether it is cold or not (for
/// example, no profile data is available).		/// example, no profile data is available).
bool ProfileSummaryInfo::isFunctionEntryCold(const Function *F) {		bool ProfileSummaryInfo::isFunctionEntryCold(const Function *F) {
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
bool ProfileSummaryInfo::isColdBB(const BasicBlock *B,		bool ProfileSummaryInfo::isColdBB(const BasicBlock *B,
BlockFrequencyInfo *BFI) {		BlockFrequencyInfo *BFI) {
auto Count = BFI->getBlockProfileCount(B);		auto Count = BFI->getBlockProfileCount(B);
return Count && isColdCount(*Count);		return Count && isColdCount(*Count);
}		}

bool ProfileSummaryInfo::isHotCallSite(const CallSite &CS,		bool ProfileSummaryInfo::isHotCallSite(const CallSite &CS,
BlockFrequencyInfo *BFI) {		BlockFrequencyInfo *BFI) {
auto C = getProfileCount(CS.getInstruction(), BFI);		auto C = getProfileCount(CS.getInstruction(), BFI, Summary.get());
return C && isHotCount(*C);		return C && isHotCount(*C);
}		}

bool ProfileSummaryInfo::isColdCallSite(const CallSite &CS,		bool ProfileSummaryInfo::isColdCallSite(const CallSite &CS,
BlockFrequencyInfo *BFI) {		BlockFrequencyInfo *BFI) {
auto C = getProfileCount(CS.getInstruction(), BFI);		auto C = getProfileCount(CS.getInstruction(), BFI, Summary.get());
return C && isColdCount(*C);		return C && isColdCount(*C);
}		}

INITIALIZE_PASS(ProfileSummaryInfoWrapperPass, "profile-summary-info",		INITIALIZE_PASS(ProfileSummaryInfoWrapperPass, "profile-summary-info",
"Profile summary info", false, true)		"Profile summary info", false, true)

ProfileSummaryInfoWrapperPass::ProfileSummaryInfoWrapperPass()		ProfileSummaryInfoWrapperPass::ProfileSummaryInfoWrapperPass()
: ImmutablePass(ID) {		: ImmutablePass(ID) {
Show All 36 Lines

lib/IR/Module.cpp

	Show First 20 Lines • Show All 477 Lines • ▼ Show 20 Lines
	void Module::setPIELevel(PIELevel::Level PL) {			void Module::setPIELevel(PIELevel::Level PL) {
	addModuleFlag(ModFlagBehavior::Error, "PIE Level", PL);			addModuleFlag(ModFlagBehavior::Error, "PIE Level", PL);
	}			}

	void Module::setProfileSummary(Metadata *M) {			void Module::setProfileSummary(Metadata *M) {
	addModuleFlag(ModFlagBehavior::Error, "ProfileSummary", M);			addModuleFlag(ModFlagBehavior::Error, "ProfileSummary", M);
	}			}

	Metadata *Module::getProfileSummary() {			Metadata *Module::getProfileSummary() const {
	return getModuleFlag("ProfileSummary");			return getModuleFlag("ProfileSummary");
	}			}

	void Module::setOwnedMemoryBuffer(std::unique_ptr<MemoryBuffer> MB) {			void Module::setOwnedMemoryBuffer(std::unique_ptr<MemoryBuffer> MB) {
	OwnedMemoryBuffer = std::move(MB);			OwnedMemoryBuffer = std::move(MB);
	}			}

	GlobalVariable *llvm::collectUsedGlobalVariables(			GlobalVariable *llvm::collectUsedGlobalVariables(
	Show All 13 Lines

lib/IR/ProfileSummary.cpp

Show First 20 Lines • Show All 137 Lines • ▼ Show 20 Lines	if (!Op0 \|\| !Op1 \|\| !Op2)
return false;		return false;
Summary.emplace_back(cast<ConstantInt>(Op0->getValue())->getZExtValue(),		Summary.emplace_back(cast<ConstantInt>(Op0->getValue())->getZExtValue(),
cast<ConstantInt>(Op1->getValue())->getZExtValue(),		cast<ConstantInt>(Op1->getValue())->getZExtValue(),
cast<ConstantInt>(Op2->getValue())->getZExtValue());		cast<ConstantInt>(Op2->getValue())->getZExtValue());
}		}
return true;		return true;
}		}

		Optional<ProfileSummary::Kind> ProfileSummary::getKindFromMD(Metadata *MD) {
		if (!MD)
		return None;
		if (!isa<MDTuple>(MD))
		return None;
		MDTuple *Tuple = cast<MDTuple>(MD);
		if (Tuple->getNumOperands() != 8)
		return None;

		auto &FormatMD = Tuple->getOperand(0);
		if (isKeyValuePair(dyn_cast_or_null<MDTuple>(FormatMD), "ProfileFormat",
		"SampleProfile"))
		return PSK_Sample;
		else if (isKeyValuePair(dyn_cast_or_null<MDTuple>(FormatMD), "ProfileFormat",
		"InstrProf"))
		return PSK_Instr;

		return None;
		}

ProfileSummary ProfileSummary::getFromMD(Metadata MD) {		ProfileSummary ProfileSummary::getFromMD(Metadata MD) {
if (!MD)		if (!MD)
return nullptr;		return nullptr;
if (!isa<MDTuple>(MD))		if (!isa<MDTuple>(MD))
return nullptr;		return nullptr;
MDTuple *Tuple = cast<MDTuple>(MD);		MDTuple *Tuple = cast<MDTuple>(MD);
if (Tuple->getNumOperands() != 8)		if (Tuple->getNumOperands() != 8)
return nullptr;		return nullptr;
Show All 38 Lines

test/Bitcode/thinlto-function-summary-callgraph-profile-summary.ll

This file was copied to test/Bitcode/thinlto-function-summary-callgraph-sample-profile-summary.ll.

	Show All 23 Lines
	; "none2"			; "none2"
	; CHECK-NEXT: <FUNCTION op0=37 op1=5			; CHECK-NEXT: <FUNCTION op0=37 op1=5
	; "none3"			; "none3"
	; CHECK-NEXT: <FUNCTION op0=42 op1=5			; CHECK-NEXT: <FUNCTION op0=42 op1=5
	; CHECK-LABEL: <GLOBALVAL_SUMMARY_BLOCK			; CHECK-LABEL: <GLOBALVAL_SUMMARY_BLOCK
	; CHECK-NEXT: <VERSION			; CHECK-NEXT: <VERSION
	; CHECK-NEXT: <VALUE_GUID op0=25 op1=123/>			; CHECK-NEXT: <VALUE_GUID op0=25 op1=123/>
	; op4=hot1 op6=cold op8=hot2 op10=hot4 op12=none1 op14=hot3 op16=none2 op18=none3 op20=123			; op4=hot1 op6=cold op8=hot2 op10=hot4 op12=none1 op14=hot3 op16=none2 op18=none3 op20=123
	; CHECK-NEXT: <PERMODULE_PROFILE {{.*}} op4=1 op5=3 op6=5 op7=1 op8=2 op9=3 op10=4 op11=3 op12=6 op13=2 op14=3 op15=3 op16=7 op17=2 op18=8 op19=2 op20=25 op21=3/>			; CHECK-NEXT: <PERMODULE_PROFILE {{.*}} op4=1 op5=3 op6=5 op7=1 op8=2 op9=3 op10=4 op11=1 op12=6 op13=2 op14=3 op15=3 op16=7 op17=2 op18=8 op19=2 op20=25 op21=3/>
	; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>

	; CHECK: <STRTAB_BLOCK			; CHECK: <STRTAB_BLOCK
	; CHECK-NEXT: blob data = 'hot_functionhot1hot2hot3hot4coldnone1none2none3'			; CHECK-NEXT: blob data = 'hot_functionhot1hot2hot3hot4coldnone1none2none3'

	; COMBINED: <GLOBALVAL_SUMMARY_BLOCK			; COMBINED: <GLOBALVAL_SUMMARY_BLOCK
	; COMBINED-NEXT: <VERSION			; COMBINED-NEXT: <VERSION
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	▲ Show 20 Lines • Show All 81 Lines • Show Last 20 Lines

test/Bitcode/thinlto-function-summary-callgraph-sample-profile-summary.ll

This file was copied from test/Bitcode/thinlto-function-summary-callgraph-profile-summary.ll.

	Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines



	!llvm.module.flags = !{!1}			!llvm.module.flags = !{!1}
	!20 = !{!"function_entry_count", i64 110, i64 123}			!20 = !{!"function_entry_count", i64 110, i64 123}

	!1 = !{i32 1, !"ProfileSummary", !2}			!1 = !{i32 1, !"ProfileSummary", !2}
	!2 = !{!3, !4, !5, !6, !7, !8, !9, !10}			!2 = !{!3, !4, !5, !6, !7, !8, !9, !10}
	!3 = !{!"ProfileFormat", !"InstrProf"}			!3 = !{!"ProfileFormat", !"SampleProfile"}
	!4 = !{!"TotalCount", i64 10000}			!4 = !{!"TotalCount", i64 10000}
	!5 = !{!"MaxCount", i64 10}			!5 = !{!"MaxCount", i64 10}
	!6 = !{!"MaxInternalCount", i64 1}			!6 = !{!"MaxInternalCount", i64 1}
	!7 = !{!"MaxFunctionCount", i64 1000}			!7 = !{!"MaxFunctionCount", i64 1000}
	!8 = !{!"NumCounts", i64 3}			!8 = !{!"NumCounts", i64 3}
	!9 = !{!"NumFunctions", i64 3}			!9 = !{!"NumFunctions", i64 3}
	!10 = !{!"DetailedSummary", !11}			!10 = !{!"DetailedSummary", !11}
	!11 = !{!12, !13, !14}			!11 = !{!12, !13, !14}
	!12 = !{i32 10000, i64 100, i32 1}			!12 = !{i32 10000, i64 100, i32 1}
	!13 = !{i32 999000, i64 100, i32 1}			!13 = !{i32 999000, i64 100, i32 1}
	!14 = !{i32 999999, i64 1, i32 2}			!14 = !{i32 999999, i64 1, i32 2}
	!15 = !{!"branch_weights", i32 100}			!15 = !{!"branch_weights", i32 100}

test/Transforms/CodeGenPrepare/section-samplepgo.ll

This file was copied from test/Transforms/CodeGenPrepare/section.ll.

Show All 33 Lines	define void @cold_func() !prof !16 {
ret void		ret void
}		}

; CHECK: ![[HOT_ID]] = !{!"function_section_prefix", !".hot"}		; CHECK: ![[HOT_ID]] = !{!"function_section_prefix", !".hot"}
; CHECK: ![[COLD_ID]] = !{!"function_section_prefix", !".cold"}		; CHECK: ![[COLD_ID]] = !{!"function_section_prefix", !".cold"}
!llvm.module.flags = !{!1}		!llvm.module.flags = !{!1}
!1 = !{i32 1, !"ProfileSummary", !2}		!1 = !{i32 1, !"ProfileSummary", !2}
!2 = !{!3, !4, !5, !6, !7, !8, !9, !10}		!2 = !{!3, !4, !5, !6, !7, !8, !9, !10}
!3 = !{!"ProfileFormat", !"InstrProf"}		!3 = !{!"ProfileFormat", !"SampleProfile"}
!4 = !{!"TotalCount", i64 10000}		!4 = !{!"TotalCount", i64 10000}
!5 = !{!"MaxCount", i64 1000}		!5 = !{!"MaxCount", i64 1000}
!6 = !{!"MaxInternalCount", i64 1}		!6 = !{!"MaxInternalCount", i64 1}
!7 = !{!"MaxFunctionCount", i64 1000}		!7 = !{!"MaxFunctionCount", i64 1000}
!8 = !{!"NumCounts", i64 3}		!8 = !{!"NumCounts", i64 3}
!9 = !{!"NumFunctions", i64 3}		!9 = !{!"NumFunctions", i64 3}
!10 = !{!"DetailedSummary", !11}		!10 = !{!"DetailedSummary", !11}
!11 = !{!12, !13, !14}		!11 = !{!12, !13, !14}
!12 = !{i32 10000, i64 100, i32 1}		!12 = !{i32 10000, i64 100, i32 1}
!13 = !{i32 999000, i64 100, i32 1}		!13 = !{i32 999000, i64 100, i32 1}
!14 = !{i32 999999, i64 1, i32 2}		!14 = !{i32 999999, i64 1, i32 2}
!15 = !{!"function_entry_count", i64 1000}		!15 = !{!"function_entry_count", i64 1000}
!16 = !{!"function_entry_count", i64 1}		!16 = !{!"function_entry_count", i64 1}
!17 = !{!"branch_weights", i32 80}		!17 = !{!"branch_weights", i32 80}
!18 = !{!"branch_weights", i32 1}		!18 = !{!"branch_weights", i32 1}

test/Transforms/CodeGenPrepare/section.ll

This file was copied to test/Transforms/CodeGenPrepare/section-samplepgo.ll.

	; RUN: opt < %s -codegenprepare -S \| FileCheck %s			; RUN: opt < %s -codegenprepare -S \| FileCheck %s

	target triple = "x86_64-pc-linux-gnu"			target triple = "x86_64-pc-linux-gnu"

	; This tests that hot/cold functions get correct section prefix assigned			; This tests that hot/cold functions get correct section prefix assigned

	; CHECK: hot_func{{.*}}!section_prefix ![[HOT_ID:[0-9]+]]			; CHECK: hot_func{{.*}}!section_prefix ![[HOT_ID:[0-9]+]]
	; The entry is hot			; The entry is hot
	define void @hot_func() !prof !15 {			define void @hot_func() !prof !15 {
	ret void			ret void
	}			}

	; CHECK: hot_call_func{{.*}}!section_prefix ![[HOT_ID]]			; For instrumentation based PGO, we should only look at entry counts,
	; The sum of 2 callsites are hot			; not call site VP metadata (which can exist on value profiled memcpy,
	define void @hot_call_func() !prof !16 {			; or possibly left behind after static analysis based devirtualization).
				; CHECK: cold_func1{{.*}}!section_prefix ![[COLD_ID:[0-9]+]]
				define void @cold_func1() !prof !16 {
	call void @hot_func(), !prof !17			call void @hot_func(), !prof !17
	call void @hot_func(), !prof !17			call void @hot_func(), !prof !17
	ret void			ret void
	}			}

	; CHECK-NOT: normal_func{{.*}}!section_prefix			; CHECK: cold_func2{{.*}}!section_prefix
	; The sum of all callsites are neither hot or cold			define void @cold_func2() !prof !16 {
	define void @normal_func() !prof !16 {
	call void @hot_func(), !prof !17			call void @hot_func(), !prof !17
	call void @hot_func(), !prof !18			call void @hot_func(), !prof !18
	call void @hot_func(), !prof !18			call void @hot_func(), !prof !18
	ret void			ret void
	}			}

	; CHECK: cold_func{{.*}}!section_prefix ![[COLD_ID:[0-9]+]]			; CHECK: cold_func3{{.*}}!section_prefix ![[COLD_ID]]
	; The entry and the callsite are both cold			define void @cold_func3() !prof !16 {
	define void @cold_func() !prof !16 {
	call void @hot_func(), !prof !18			call void @hot_func(), !prof !18
	ret void			ret void
	}			}

	; CHECK: ![[HOT_ID]] = !{!"function_section_prefix", !".hot"}			; CHECK: ![[HOT_ID]] = !{!"function_section_prefix", !".hot"}
	; CHECK: ![[COLD_ID]] = !{!"function_section_prefix", !".cold"}			; CHECK: ![[COLD_ID]] = !{!"function_section_prefix", !".cold"}
	!llvm.module.flags = !{!1}			!llvm.module.flags = !{!1}
	!1 = !{i32 1, !"ProfileSummary", !2}			!1 = !{i32 1, !"ProfileSummary", !2}
	Show All 17 Lines

test/Transforms/Inline/prof-update.ll

	; RUN: opt < %s -inline -S \| FileCheck %s			; RUN: opt < %s -inline -S \| FileCheck %s
	; Checks if inliner updates branch_weights annotation for call instructions.			; Checks if inliner updates branch_weights annotation for call instructions.

	declare void @ext();			declare void @ext();
	declare void @ext1();			declare void @ext1();

	; CHECK: define void @callee(i32 %n) !prof ![[ENTRY_COUNT:[0-9]*]]			; CHECK: define void @callee(i32 %n) !prof ![[ENTRY_COUNT:[0-9]*]]
	define void @callee(i32 %n) !prof !1 {			define void @callee(i32 %n) !prof !15 {
	%cond = icmp sle i32 %n, 10			%cond = icmp sle i32 %n, 10
	br i1 %cond, label %cond_true, label %cond_false			br i1 %cond, label %cond_true, label %cond_false
	cond_true:			cond_true:
	; ext1 is optimized away, thus not updated.			; ext1 is optimized away, thus not updated.
	; CHECK: call void @ext1(), !prof ![[COUNT_CALLEE1:[0-9]*]]			; CHECK: call void @ext1(), !prof ![[COUNT_CALLEE1:[0-9]*]]
	call void @ext1(), !prof !2			call void @ext1(), !prof !16
	ret void			ret void
	cond_false:			cond_false:
	; ext is cloned and updated.			; ext is cloned and updated.
	; CHECK: call void @ext(), !prof ![[COUNT_CALLEE:[0-9]*]]			; CHECK: call void @ext(), !prof ![[COUNT_CALLEE:[0-9]*]]
	call void @ext(), !prof !2			call void @ext(), !prof !16
	ret void			ret void
	}			}

	; CHECK: define void @caller()			; CHECK: define void @caller()
	define void @caller() {			define void @caller() {
	; CHECK: call void @ext(), !prof ![[COUNT_CALLER:[0-9]*]]			; CHECK: call void @ext(), !prof ![[COUNT_CALLER:[0-9]*]]
	call void @callee(i32 15), !prof !3			call void @callee(i32 15), !prof !17
	ret void			ret void
	}			}

	!llvm.module.flags = !{!0}			!llvm.module.flags = !{!1}
	!0 = !{i32 1, !"MaxFunctionCount", i32 2000}			!1 = !{i32 1, !"ProfileSummary", !2}
	!1 = !{!"function_entry_count", i64 1000}			!2 = !{!3, !4, !5, !6, !7, !8, !9, !10}
	!2 = !{!"branch_weights", i64 2000}			!3 = !{!"ProfileFormat", !"SampleProfile"}
	!3 = !{!"branch_weights", i64 400}			!4 = !{!"TotalCount", i64 10000}
				!5 = !{!"MaxCount", i64 10}
				!6 = !{!"MaxInternalCount", i64 1}
				!7 = !{!"MaxFunctionCount", i64 2000}
				!8 = !{!"NumCounts", i64 2}
				!9 = !{!"NumFunctions", i64 2}
				!10 = !{!"DetailedSummary", !11}
				!11 = !{!12, !13, !14}
				!12 = !{i32 10000, i64 100, i32 1}
				!13 = !{i32 999000, i64 100, i32 1}
				!14 = !{i32 999999, i64 1, i32 2}
				!15 = !{!"function_entry_count", i64 1000}
				!16 = !{!"branch_weights", i64 2000}
				!17 = !{!"branch_weights", i64 400}
	attributes #0 = { alwaysinline }			attributes #0 = { alwaysinline }
	; CHECK: ![[ENTRY_COUNT]] = !{!"function_entry_count", i64 600}			; CHECK: ![[ENTRY_COUNT]] = !{!"function_entry_count", i64 600}
	; CHECK: ![[COUNT_CALLEE1]] = !{!"branch_weights", i64 2000}			; CHECK: ![[COUNT_CALLEE1]] = !{!"branch_weights", i64 2000}
	; CHECK: ![[COUNT_CALLEE]] = !{!"branch_weights", i32 1200}			; CHECK: ![[COUNT_CALLEE]] = !{!"branch_weights", i32 1200}
	; CHECK: ![[COUNT_CALLER]] = !{!"branch_weights", i32 800}			; CHECK: ![[COUNT_CALLER]] = !{!"branch_weights", i32 800}

unittests/Analysis/ProfileSummaryInfoTest.cpp

Show First 20 Lines • Show All 156 Lines • ▼ Show 20 Lines	TEST_F(ProfileSummaryInfoTest, InstrProf) {
EXPECT_TRUE(PSI.isHotBB(BB3, &BFI));		EXPECT_TRUE(PSI.isHotBB(BB3, &BFI));

CallSite CS1(BB1->getFirstNonPHI());		CallSite CS1(BB1->getFirstNonPHI());
auto *CI2 = BB2->getFirstNonPHI();		auto *CI2 = BB2->getFirstNonPHI();
CallSite CS2(CI2);		CallSite CS2(CI2);

EXPECT_TRUE(PSI.isHotCallSite(CS1, &BFI));		EXPECT_TRUE(PSI.isHotCallSite(CS1, &BFI));
EXPECT_FALSE(PSI.isHotCallSite(CS2, &BFI));		EXPECT_FALSE(PSI.isHotCallSite(CS2, &BFI));

		// Test that adding an MD_prof metadata with a hot count on CS2 does not
		// change its hotness as it has no effect in instrumented profiling.
		MDBuilder MDB(M->getContext());
		CI2->setMetadata(llvm::LLVMContext::MD_prof, MDB.createBranchWeights({400}));
		EXPECT_FALSE(PSI.isHotCallSite(CS2, &BFI));
}		}

TEST_F(ProfileSummaryInfoTest, SampleProf) {		TEST_F(ProfileSummaryInfoTest, SampleProf) {
auto M = makeLLVMModule("SampleProfile");		auto M = makeLLVMModule("SampleProfile");
Function *F = M->getFunction("f");		Function *F = M->getFunction("f");
ProfileSummaryInfo PSI = buildPSI(M.get());		ProfileSummaryInfo PSI = buildPSI(M.get());

BasicBlock &BB0 = F->getEntryBlock();		BasicBlock &BB0 = F->getEntryBlock();
Show All 26 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Restrict call metadata based hotness detection to Sample PGO mode
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 97862

include/llvm/Analysis/ProfileSummaryInfo.h

include/llvm/IR/Module.h

include/llvm/IR/ProfileSummary.h

lib/Analysis/ProfileSummaryInfo.cpp

lib/IR/Module.cpp

lib/IR/ProfileSummary.cpp

test/Bitcode/thinlto-function-summary-callgraph-profile-summary.ll

test/Bitcode/thinlto-function-summary-callgraph-sample-profile-summary.ll

test/Transforms/CodeGenPrepare/section-samplepgo.ll

test/Transforms/CodeGenPrepare/section.ll

test/Transforms/Inline/prof-update.ll

unittests/Analysis/ProfileSummaryInfoTest.cpp

This is an archive of the discontinued LLVM Phabricator instance.

Restrict call metadata based hotness detection to Sample PGO modeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 97862

include/llvm/Analysis/ProfileSummaryInfo.h

include/llvm/IR/Module.h

include/llvm/IR/ProfileSummary.h

lib/Analysis/ProfileSummaryInfo.cpp

lib/IR/Module.cpp

lib/IR/ProfileSummary.cpp

test/Bitcode/thinlto-function-summary-callgraph-profile-summary.ll

test/Bitcode/thinlto-function-summary-callgraph-sample-profile-summary.ll

test/Transforms/CodeGenPrepare/section-samplepgo.ll

test/Transforms/CodeGenPrepare/section.ll

test/Transforms/Inline/prof-update.ll

unittests/Analysis/ProfileSummaryInfoTest.cpp

Restrict call metadata based hotness detection to Sample PGO mode
ClosedPublic