This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
Analysis/
-
FuncSpecCost.h
-
ModuleSummaryAnalysis.h
-
IR/
-
ModuleSummaryIndex.h
-
ModuleSummaryIndexYAML.h
-
lib/
-
Analysis/
-
FuncSpecCost.cpp
-
ModuleSummaryAnalysis.cpp
-
AsmParser/
-
LLParser.cpp
-
Bitcode/
-
Reader/
-
BitcodeReader.cpp
-
Writer/
-
BitcodeWriter.cpp
-
IR/
-
AsmWriter.cpp
-
LTO/
-
ThinLTOCodeGenerator.cpp
-
Transforms/IPO/
-
IPO/
-
FunctionImport.cpp
-
ThinLTOBitcodeWriter.cpp
-
WholeProgramDevirt.cpp
-
test/Bitcode/
-
Bitcode/
-
summary_version.ll
-
thinlto-alias.ll
-
thinlto-alias2.ll
-
thinlto-function-summary-callgraph-cast.ll
-
thinlto-function-summary-callgraph-pgo.ll
-
thinlto-function-summary-callgraph-profile-summary.ll
-
thinlto-function-summary-callgraph-relbf.ll
-
thinlto-function-summary-callgraph-sample-profile-summary.ll
-
thinlto-function-summary-callgraph.ll
-
thinlto-function-summary-refgraph.ll

Differential D105966

[WIP] [ThinLTO] Importing function by function specialization
AbandonedPublic

Authored by ChuanqiXu on Jul 14 2021, 12:26 AM.

Download Raw Diff

Details

Reviewers

tejohnson
fhahn
SjoerdMeijer

Summary

This diff is the successive for D105524, which extract the analysis part of function specialization as an independent analysis.

The intention of this diff is to enable ThinLTO to import functions by heuristics for function specialization.
Now the ThinLTO imports functions from other CU by the heuristics for inlining only.
Simply, ThinLTO would import functions whose lines of codes are less than 100 lines by default to enable potential inlining.
This makes sense due to the inlining is the main root cause for IPO in fact.

And function specialization is another important IPO technique which got implemented recently.
My goal was to make ThinLTO to import functions to enable function specialization.
The key point here is that we need to record the information needed for the imported to judge whether it is beneficial to import a function whose LoC is more than 100 lines.

Here is the extra information I planned to add:

Specialize Function Cost. An unsigned number, which is an estimation for the cost to clone one function.
Base Bonus for specializing specific argument. A map from ArgNo to the corresponding base bonus. Here the base bonus means the bonus we could calculate by visiting the function body itself only (in other words, doesn't need to see the call site).
ArgUsage. A map from ArgNo to the extra bonus for each call site. The extra bonus means the bonus we could calculate for specific call site. For example, if we passed a function in the call, we could calculate the possibility that the function could get inlined at the specific call site only.

Before in https://lists.llvm.org/pipermail/llvm-dev/2021-May/150443.html, I raised some problems we may met:

We can't see the funciton body before we import it.
It would repeat traversing the call graph in each translation unit, which is very redundant.
It may specialize functions with the same version, which could make the code size get larger and redundant.

Here the first problem would be solved by D105524 and this patch.
Then the second problem seems not so noisy. Except 502.gcc_r, which CT time increased 30%, we didn't observe any significant CT change in SPEC2017 int.
Finally, the third problem. The average enlarged code size in spec2017 int is 6.7%. And the maximum is 19%.

The numbers may not be so satisfying. And the function specialization pass is not so mature. It's the reason that I marked this patch as [WIP].

The reason why I update the patch is that I want to make sure that I am on the right direction.

Finally, there are some TODOs in this patch:

Didn't implement writer and parser for .ll files.
Need to add and fix tests.

Test Plan: SPEC2017 int rate.

Diff Detail

Unit TestsFailed

	Time	Test
	230 ms	x64 debian > LLVM.Bitcode::thinlto-function-summary-callgraph-sample-profile-summary.ll
	90 ms	x64 debian > LLVM.Bitcode::thinlto-function-summary-refgraph.ll
	140 ms	x64 debian > LLVM.ThinLTO/X86::funcimport2.ll
	2,830 ms	x64 debian > libarcher.critical::critical.c
	2,760 ms	x64 debian > libarcher.parallel::parallel-simple2.c
		View Full Test Results (24 Failed)

Event Timeline

ChuanqiXu created this revision.Jul 14 2021, 12:26 AM

Herald added subscribers: ormris, dexonsmith, wenlei and 4 others. · View Herald TranscriptJul 14 2021, 12:27 AM

ChuanqiXu requested review of this revision.Jul 14 2021, 12:27 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 14 2021, 12:27 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

ChuanqiXu added a parent revision: D105524: [FuncSpec] Extract the analysis part as an analysis pass.Jul 14 2021, 12:27 AM

Harbormaster completed remote builds in B113931: Diff 358525.Jul 14 2021, 1:10 AM

tschuett added a subscriber: tschuett.Jul 29 2021, 6:52 AM

ChuanqiXu mentioned this in D107136: [WIP] [FuncSpec] Handle Available Externally Linkage Function.Jul 30 2021, 12:47 AM

ChuanqiXu added a child revision: D107136: [WIP] [FuncSpec] Handle Available Externally Linkage Function.Aug 9 2021, 6:41 PM

SjoerdMeijer added a subscriber: SjoerdMeijer.Aug 13 2021, 5:53 AM

SjoerdMeijer added reviewers: tejohnson, fhahn, SjoerdMeijer.Aug 13 2021, 5:55 AM

ChuanqiXu planned changes to this revision.Nov 15 2021, 2:25 AM

ChuanqiXu abandoned this revision.Nov 15 2021, 7:38 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

FuncSpecCost.h

61 lines

ModuleSummaryAnalysis.h

1 line

IR/

ModuleSummaryIndex.h

50 lines

ModuleSummaryIndexYAML.h

2 lines

lib/

Analysis/

FuncSpecCost.cpp

48 lines

ModuleSummaryAnalysis.cpp

28 lines

AsmParser/

LLParser.cpp

9 lines

Bitcode/

Reader/

BitcodeReader.cpp

78 lines

Writer/

BitcodeWriter.cpp

28 lines

IR/

AsmWriter.cpp

3 lines

LTO/

ThinLTOCodeGenerator.cpp

2 lines

Transforms/

IPO/

FunctionImport.cpp

59 lines

ThinLTOBitcodeWriter.cpp

9 lines

WholeProgramDevirt.cpp

2 lines

test/

Bitcode/

summary_version.ll

2 lines

thinlto-alias.ll

4 lines

thinlto-alias2.ll

2 lines

thinlto-function-summary-callgraph-cast.ll

4 lines

thinlto-function-summary-callgraph-pgo.ll

4 lines

thinlto-function-summary-callgraph-profile-summary.ll

4 lines

thinlto-function-summary-callgraph-relbf.ll

2 lines

thinlto-function-summary-callgraph-sample-profile-summary.ll

4 lines

thinlto-function-summary-callgraph.ll

2 lines

thinlto-function-summary-refgraph.ll

10 lines

Diff 358525

llvm/include/llvm/Analysis/FuncSpecCost.h

Show All 11 Lines
//===------------------------------------------------------------------------===//		//===------------------------------------------------------------------------===//

#ifndef LLVM_ANALYSIS_FUNCSPECCOST_H		#ifndef LLVM_ANALYSIS_FUNCSPECCOST_H
#define LLVM_ANALYSIS_FUNCSPECCOST_H		#define LLVM_ANALYSIS_FUNCSPECCOST_H

#include "llvm/ADT/MapVector.h"		#include "llvm/ADT/MapVector.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/IR/Argument.h"		#include "llvm/IR/Argument.h"
		#include "llvm/IR/InstrTypes.h"
#include "llvm/IR/PassManager.h"		#include "llvm/IR/PassManager.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"
#include "llvm/Support/InstructionCost.h"		#include "llvm/Support/InstructionCost.h"

namespace llvm {		namespace llvm {

class AssumptionCache;		class AssumptionCache;
class TargetTransformInfo;		class TargetTransformInfo;
class TargetLibraryInfo;		class TargetLibraryInfo;
		class ArgUsage;

/// Cost/Bonus information for specialize a function with		/// Cost/Bonus information for specialize a function with
/// each argument.		/// each argument.
class FuncSpecCostInfo {		class FuncSpecCostInfo {
/// The cost to specialize the function.		/// The cost to specialize the function.
InstructionCost Cost;		InstructionCost Cost;
/// Map from the number of the argument, to the base bonuss for specialize it.		/// Map from the number of the argument, to the base bonuss for specialize it.
/// Base bonus stands for the bonus we could get by the function body.		/// Base bonus stands for the bonus we could get by the function body.
MapVector<unsigned, unsigned> SpecBonusBaseMap;		MapVector<unsigned, unsigned> SpecBonusBaseMap;

unsigned getBonusBase(Argument *Arg) const;		unsigned getBonusBase(Argument *Arg) const;
unsigned getBonusBase(unsigned Index) const;		unsigned getBonusBase(unsigned Index) const;

public:		public:
InstructionCost getCost() const { return Cost; }		InstructionCost getCost() const { return Cost; }
unsigned		unsigned
getBonus(Argument Arg, Constant C,		getBonus(Argument Arg, Constant C,
function_ref<TargetTransformInfo &(Function &)> GetTTI,		function_ref<TargetTransformInfo &(Function &)> GetTTI,
function_ref<AssumptionCache &(Function &)> GetAC,		function_ref<AssumptionCache &(Function &)> GetAC,
function_ref<const TargetLibraryInfo &(Function &)> GetTLI) const;		function_ref<const TargetLibraryInfo &(Function &)> GetTLI) const;

		/// Given an ArgUsage, estimating if we should import corresponding function.
		/// We should only call this when importing.
		///
		/// If there is one argument marks function, we would think it would be
		/// inlined if its lines of codes is less than a specific threshold.
		///
		/// TOOD: Add profiling infomation.
		bool shouldImport(const ArgUsage &) const;

		ArrayRef<std::pair<unsigned, unsigned>> getSpecBonusBaseMap() const {
		return makeArrayRef(&SpecBonusBaseMap.front(),
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - return makeArrayRef(&SpecBonusBaseMap.front(), - SpecBonusBaseMap.size()); + return makeArrayRef(&SpecBonusBaseMap.front(), SpecBonusBaseMap.size()); Lint: Pre-merge checks: clang-format: please reformat the code ``` - return makeArrayRef(&SpecBonusBaseMap.front()…
		SpecBonusBaseMap.size());
		}

FuncSpecCostInfo() {}		FuncSpecCostInfo() {}
FuncSpecCostInfo(FuncSpecCostInfo &&Other)		FuncSpecCostInfo(FuncSpecCostInfo &&Other)
: FuncSpecCostInfo(std::move(Other.Cost),		: FuncSpecCostInfo(std::move(Other.Cost),
std::move(Other.SpecBonusBaseMap)) {}		std::move(Other.SpecBonusBaseMap)) {}
FuncSpecCostInfo(InstructionCost Cost,		FuncSpecCostInfo(InstructionCost Cost,
MapVector<unsigned, unsigned> &&BonusBaseMap)		MapVector<unsigned, unsigned> &&BonusBaseMap)
: Cost(std::move(Cost)) {		: Cost(std::move(Cost)) {
SpecBonusBaseMap.swap(BonusBaseMap);		SpecBonusBaseMap.swap(BonusBaseMap);
Show All 26 Lines	public:

bool runOnFunction(Function &F) override;		bool runOnFunction(Function &F) override;

void getAnalysisUsage(AnalysisUsage &AU) const override;		void getAnalysisUsage(AnalysisUsage &AU) const override;

StringRef getPassName() const override;		StringRef getPassName() const override;
};		};

		/// Represent the usage of args at the callsite used in module summary.
		/// We should keep it as small as possible.
		///
		/// Now we mainly cares if the argument is a function. If yes, it implies
		/// a chance to hoist an indirect call to a direct call by function specialize
		/// pass.
		///
		/// It should be easy to add value infomation about the constantness or value
		/// range.
		class ArgUsage {
		private:
		static unsigned ConstantMarker;
		/// Map from ArgNo to the lines of codes if the corresponding argument refer
		/// to a function. If the corresponding argument is a constant other than
		/// function, we would set the value to ConstantMarker(0xffffffff).
		///
		/// For example, the value of LinesOfArgs for following example:
		/// ```
		/// foo(var, 1, bar); // bar is a function; var is a variable
		/// ```
		/// should be `[<1, 0xffffffff>, <2, lines of bar>]`.
		SmallVector<std::pair<unsigned, unsigned>, 4> LinesOfArgs;

		friend class FuncSpecCostInfo;
		friend class CalleeInfo;

		public:
		ArgUsage() {}
		ArgUsage(const CallBase &);
		ArgUsage(SmallVectorImpl<std::pair<unsigned, unsigned>> &&Uses)
		: LinesOfArgs(std::move(Uses)) {}

		ArgUsage(const ArgUsage &AU) : LinesOfArgs(AU.LinesOfArgs) {}
		ArgUsage(ArgUsage &&AU) : LinesOfArgs(std::move(AU.LinesOfArgs)) {}
		ArgUsage &operator=(ArgUsage &&AU) {
		LinesOfArgs = std::move(AU.LinesOfArgs);
		return *this;
		}
		ArgUsage &operator=(const ArgUsage &AU) {
		LinesOfArgs = AU.LinesOfArgs;
		return *this;
		}

		static bool isConstant(unsigned value) { return value == ConstantMarker; }
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for parameter 'value' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for parameter 'value' [readability-identifier-naming]…
		};
} // namespace llvm		} // namespace llvm

#endif		#endif

llvm/include/llvm/Analysis/ModuleSummaryAnalysis.h

	Show All 30 Lines
	///			///
	/// If operating within a pass manager which has defined ways to compute the \c			/// If operating within a pass manager which has defined ways to compute the \c
	/// BlockFrequencyInfo for a given function, that can be provided via			/// BlockFrequencyInfo for a given function, that can be provided via
	/// a std::function callback. Otherwise, this routine will manually construct			/// a std::function callback. Otherwise, this routine will manually construct
	/// that information.			/// that information.
	ModuleSummaryIndex buildModuleSummaryIndex(			ModuleSummaryIndex buildModuleSummaryIndex(
	const Module &M,			const Module &M,
	std::function<BlockFrequencyInfo *(const Function &F)> GetBFICallback,			std::function<BlockFrequencyInfo *(const Function &F)> GetBFICallback,
				std::function<FuncSpecCostInfo *(const Function &F)> GetFSCICallback,
	ProfileSummaryInfo *PSI,			ProfileSummaryInfo *PSI,
	std::function<const StackSafetyInfo *(const Function &F)> GetSSICallback =			std::function<const StackSafetyInfo *(const Function &F)> GetSSICallback =
	[](const Function &F) -> const StackSafetyInfo * { return nullptr; });			[](const Function &F) -> const StackSafetyInfo * { return nullptr; });

	/// Analysis pass to provide the ModuleSummaryIndex object.			/// Analysis pass to provide the ModuleSummaryIndex object.
	class ModuleSummaryIndexAnalysis			class ModuleSummaryIndexAnalysis
	: public AnalysisInfoMixin<ModuleSummaryIndexAnalysis> {			: public AnalysisInfoMixin<ModuleSummaryIndexAnalysis> {
	friend AnalysisInfoMixin<ModuleSummaryIndexAnalysis>;			friend AnalysisInfoMixin<ModuleSummaryIndexAnalysis>;
	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/include/llvm/IR/ModuleSummaryIndex.h

Show All 17 Lines
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallString.h"		#include "llvm/ADT/SmallString.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/TinyPtrVector.h"		#include "llvm/ADT/TinyPtrVector.h"
		#include "llvm/Analysis/FuncSpecCost.h"
#include "llvm/IR/ConstantRange.h"		#include "llvm/IR/ConstantRange.h"
#include "llvm/IR/GlobalValue.h"		#include "llvm/IR/GlobalValue.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/Support/Allocator.h"		#include "llvm/Support/Allocator.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/ScaledNumber.h"		#include "llvm/Support/ScaledNumber.h"
#include "llvm/Support/StringSaver.h"		#include "llvm/Support/StringSaver.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
Show All 31 Lines	struct CalleeInfo {

// The size of the bit-field might need to be adjusted if more values are		// The size of the bit-field might need to be adjusted if more values are
// added to HotnessType enum.		// added to HotnessType enum.
uint32_t Hotness : 3;		uint32_t Hotness : 3;

/// The value stored in RelBlockFreq has to be interpreted as the digits of		/// The value stored in RelBlockFreq has to be interpreted as the digits of
/// a scaled number with a scale of \p -ScaleShift.		/// a scaled number with a scale of \p -ScaleShift.
uint32_t RelBlockFreq : 29;		uint32_t RelBlockFreq : 29;
		/// The infomation about how arguments are used at the callsite.
		/// NOTE: This may enlarge the size of CalleeInfo.
		ArgUsage Usages;
static constexpr int32_t ScaleShift = 8;		static constexpr int32_t ScaleShift = 8;
static constexpr uint64_t MaxRelBlockFreq = (1 << 29) - 1;		static constexpr uint64_t MaxRelBlockFreq = (1 << 29) - 1;

CalleeInfo()		CalleeInfo()
: Hotness(static_cast<uint32_t>(HotnessType::Unknown)), RelBlockFreq(0) {}		: Hotness(static_cast<uint32_t>(HotnessType::Unknown)), RelBlockFreq(0),
explicit CalleeInfo(HotnessType Hotness, uint64_t RelBF)		Usages() {}
: Hotness(static_cast<uint32_t>(Hotness)), RelBlockFreq(RelBF) {}		explicit CalleeInfo(HotnessType Hotness, uint64_t RelBF, ArgUsage &&Usages)
		: Hotness(static_cast<uint32_t>(Hotness)), RelBlockFreq(RelBF),
		Usages(std::move(Usages)) {}

		void updateUsages(ArgUsage &&OtherUsages) { Usages = std::move(OtherUsages); }

		size_t getUsagesSize() const { return Usages.LinesOfArgs.size(); }

		ArrayRef<std::pair<unsigned, unsigned>> getUsages() const {
		return makeArrayRef(Usages.LinesOfArgs.begin(), Usages.LinesOfArgs.end());
		}

void updateHotness(const HotnessType OtherHotness) {		void updateHotness(const HotnessType OtherHotness) {
Hotness = std::max(Hotness, static_cast<uint32_t>(OtherHotness));		Hotness = std::max(Hotness, static_cast<uint32_t>(OtherHotness));
}		}

HotnessType getHotness() const { return HotnessType(Hotness); }		HotnessType getHotness() const { return HotnessType(Hotness); }

/// Update \p RelBlockFreq from \p BlockFreq and \p EntryFreq		/// Update \p RelBlockFreq from \p BlockFreq and \p EntryFreq
▲ Show 20 Lines • Show All 532 Lines • ▼ Show 20 Lines	return FunctionSummary(
/CanAutoHide=/false),		/CanAutoHide=/false),
/NumInsts=/0, FunctionSummary::FFlags{}, /EntryCount=/0,		/NumInsts=/0, FunctionSummary::FFlags{}, /EntryCount=/0,
std::vector<ValueInfo>(), std::move(Edges),		std::vector<ValueInfo>(), std::move(Edges),
std::vector<GlobalValue::GUID>(),		std::vector<GlobalValue::GUID>(),
std::vector<FunctionSummary::VFuncId>(),		std::vector<FunctionSummary::VFuncId>(),
std::vector<FunctionSummary::VFuncId>(),		std::vector<FunctionSummary::VFuncId>(),
std::vector<FunctionSummary::ConstVCall>(),		std::vector<FunctionSummary::ConstVCall>(),
std::vector<FunctionSummary::ConstVCall>(),		std::vector<FunctionSummary::ConstVCall>(),
std::vector<FunctionSummary::ParamAccess>());		std::vector<FunctionSummary::ParamAccess>(),
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - std::vector<FunctionSummary::ParamAccess>(), - FuncSpecCostInfo()); + std::vector<FunctionSummary::ParamAccess>(), FuncSpecCostInfo()); Lint: Pre-merge checks: clang-format: please reformat the code ``` - std::vector<FunctionSummary::ParamAccess>()…
		FuncSpecCostInfo());
}		}

/// A dummy node to reference external functions that aren't in the index		/// A dummy node to reference external functions that aren't in the index
static FunctionSummary ExternalNode;		static FunctionSummary ExternalNode;

private:		private:
/// Number of instructions (ignoring debug instructions, e.g.) computed		/// Number of instructions (ignoring debug instructions, e.g.) computed
/// during the initial compile step when the summary index is first built.		/// during the initial compile step when the summary index is first built.
Show All 11 Lines	private:
std::vector<EdgeTy> CallGraphEdgeList;		std::vector<EdgeTy> CallGraphEdgeList;

std::unique_ptr<TypeIdInfo> TIdInfo;		std::unique_ptr<TypeIdInfo> TIdInfo;

/// Uses for every parameter to this function.		/// Uses for every parameter to this function.
using ParamAccessesTy = std::vector<ParamAccess>;		using ParamAccessesTy = std::vector<ParamAccess>;
std::unique_ptr<ParamAccessesTy> ParamAccesses;		std::unique_ptr<ParamAccessesTy> ParamAccesses;

		FuncSpecCostInfo SpecCostInfo;

public:		public:
FunctionSummary(GVFlags Flags, unsigned NumInsts, FFlags FunFlags,		FunctionSummary(GVFlags Flags, unsigned NumInsts, FFlags FunFlags,
uint64_t EntryCount, std::vector<ValueInfo> Refs,		uint64_t EntryCount, std::vector<ValueInfo> Refs,
std::vector<EdgeTy> CGEdges,		std::vector<EdgeTy> CGEdges,
std::vector<GlobalValue::GUID> TypeTests,		std::vector<GlobalValue::GUID> TypeTests,
std::vector<VFuncId> TypeTestAssumeVCalls,		std::vector<VFuncId> TypeTestAssumeVCalls,
std::vector<VFuncId> TypeCheckedLoadVCalls,		std::vector<VFuncId> TypeCheckedLoadVCalls,
std::vector<ConstVCall> TypeTestAssumeConstVCalls,		std::vector<ConstVCall> TypeTestAssumeConstVCalls,
std::vector<ConstVCall> TypeCheckedLoadConstVCalls,		std::vector<ConstVCall> TypeCheckedLoadConstVCalls,
std::vector<ParamAccess> Params)		std::vector<ParamAccess> Params,
		FuncSpecCostInfo SpecCostInfo)
: GlobalValueSummary(FunctionKind, Flags, std::move(Refs)),		: GlobalValueSummary(FunctionKind, Flags, std::move(Refs)),
InstCount(NumInsts), FunFlags(FunFlags), EntryCount(EntryCount),		InstCount(NumInsts), FunFlags(FunFlags), EntryCount(EntryCount),
CallGraphEdgeList(std::move(CGEdges)) {		CallGraphEdgeList(std::move(CGEdges)),
		SpecCostInfo(std::move(SpecCostInfo)) {
if (!TypeTests.empty() \|\| !TypeTestAssumeVCalls.empty() \|\|		if (!TypeTests.empty() \|\| !TypeTestAssumeVCalls.empty() \|\|
!TypeCheckedLoadVCalls.empty() \|\| !TypeTestAssumeConstVCalls.empty() \|\|		!TypeCheckedLoadVCalls.empty() \|\| !TypeTestAssumeConstVCalls.empty() \|\|
!TypeCheckedLoadConstVCalls.empty())		!TypeCheckedLoadConstVCalls.empty())
TIdInfo = std::make_unique<TypeIdInfo>(		TIdInfo = std::make_unique<TypeIdInfo>(
TypeIdInfo{std::move(TypeTests), std::move(TypeTestAssumeVCalls),		TypeIdInfo{std::move(TypeTests), std::move(TypeTestAssumeVCalls),
std::move(TypeCheckedLoadVCalls),		std::move(TypeCheckedLoadVCalls),
std::move(TypeTestAssumeConstVCalls),		std::move(TypeTestAssumeConstVCalls),
std::move(TypeCheckedLoadConstVCalls)});		std::move(TypeCheckedLoadConstVCalls)});
▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	public:
void addTypeTest(GlobalValue::GUID Guid) {		void addTypeTest(GlobalValue::GUID Guid) {
if (!TIdInfo)		if (!TIdInfo)
TIdInfo = std::make_unique<TypeIdInfo>();		TIdInfo = std::make_unique<TypeIdInfo>();
TIdInfo->TypeTests.push_back(Guid);		TIdInfo->TypeTests.push_back(Guid);
}		}

const TypeIdInfo *getTypeIdInfo() const { return TIdInfo.get(); };		const TypeIdInfo *getTypeIdInfo() const { return TIdInfo.get(); };

		unsigned getSpecializeCost() const {
		InstructionCost Cost = SpecCostInfo.getCost();
		if (Cost.isValid())
		return *Cost.getValue();
		return std::numeric_limits<unsigned>::max();
		}

		ArrayRef<std::pair<unsigned, unsigned>> getSpecBonusBase() const {
		return SpecCostInfo.getSpecBonusBaseMap();
		}

		bool shouldImport(const CalleeInfo &CI) const {
		return SpecCostInfo.shouldImport(CI.Usages);
		}

friend struct GraphTraits<ValueInfo>;		friend struct GraphTraits<ValueInfo>;
};		};

template <> struct DenseMapInfo<FunctionSummary::VFuncId> {		template <> struct DenseMapInfo<FunctionSummary::VFuncId> {
static FunctionSummary::VFuncId getEmptyKey() { return {0, uint64_t(-1)}; }		static FunctionSummary::VFuncId getEmptyKey() { return {0, uint64_t(-1)}; }

static FunctionSummary::VFuncId getTombstoneKey() {		static FunctionSummary::VFuncId getTombstoneKey() {
return {0, uint64_t(-2)};		return {0, uint64_t(-2)};
▲ Show 20 Lines • Show All 332 Lines • ▼ Show 20 Lines	ModuleSummaryIndex(bool HaveGVs, bool EnableSplitLTOUnit = false)
: HaveGVs(HaveGVs), EnableSplitLTOUnit(EnableSplitLTOUnit), Saver(Alloc),		: HaveGVs(HaveGVs), EnableSplitLTOUnit(EnableSplitLTOUnit), Saver(Alloc),
BlockCount(0) {}		BlockCount(0) {}

// Current version for the module summary in bitcode files.		// Current version for the module summary in bitcode files.
// The BitcodeSummaryVersion should be bumped whenever we introduce changes		// The BitcodeSummaryVersion should be bumped whenever we introduce changes
// in the way some record are interpreted, like flags for instance.		// in the way some record are interpreted, like flags for instance.
// Note that incrementing this may require changes in both BitcodeReader.cpp		// Note that incrementing this may require changes in both BitcodeReader.cpp
// and BitcodeWriter.cpp.		// and BitcodeWriter.cpp.
static constexpr uint64_t BitcodeSummaryVersion = 9;		// ChangeLog: 9->10. Add ArgUsage in CalleeInfo and SpecCostInfo in
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - // ChangeLog: 9->10. Add ArgUsage in CalleeInfo and SpecCostInfo in + // ChangeLog: 9->10. Add ArgUsage in CalleeInfo and SpecCostInfo in Lint: Pre-merge checks: clang-format: please reformat the code ``` - // ChangeLog: 9->10. Add ArgUsage in CalleeInfo…
		// Function Summary.
		static constexpr uint64_t BitcodeSummaryVersion = 10;

// Regular LTO module name for ASM writer		// Regular LTO module name for ASM writer
static constexpr const char *getRegularLTOModuleName() {		static constexpr const char *getRegularLTOModuleName() {
return "[Regular LTO]";		return "[Regular LTO]";
}		}

bool haveGVs() const { return HaveGVs; }		bool haveGVs() const { return HaveGVs; }

▲ Show 20 Lines • Show All 473 Lines • Show Last 20 Lines

llvm/include/llvm/IR/ModuleSummaryIndexYAML.h

Show First 20 Lines • Show All 228 Lines • ▼ Show 20 Lines	for (auto &FSum : FSums) {
FSum.NotEligibleToImport, FSum.Live, FSum.IsLocal,		FSum.NotEligibleToImport, FSum.Live, FSum.IsLocal,
FSum.CanAutoHide),		FSum.CanAutoHide),
/NumInsts=/0, FunctionSummary::FFlags{}, /EntryCount=/0, Refs,		/NumInsts=/0, FunctionSummary::FFlags{}, /EntryCount=/0, Refs,
ArrayRef<FunctionSummary::EdgeTy>{}, std::move(FSum.TypeTests),		ArrayRef<FunctionSummary::EdgeTy>{}, std::move(FSum.TypeTests),
std::move(FSum.TypeTestAssumeVCalls),		std::move(FSum.TypeTestAssumeVCalls),
std::move(FSum.TypeCheckedLoadVCalls),		std::move(FSum.TypeCheckedLoadVCalls),
std::move(FSum.TypeTestAssumeConstVCalls),		std::move(FSum.TypeTestAssumeConstVCalls),
std::move(FSum.TypeCheckedLoadConstVCalls),		std::move(FSum.TypeCheckedLoadConstVCalls),
ArrayRef<FunctionSummary::ParamAccess>{}));		ArrayRef<FunctionSummary::ParamAccess>{}, FuncSpecCostInfo()));
}		}
}		}
static void output(IO &io, GlobalValueSummaryMapTy &V) {		static void output(IO &io, GlobalValueSummaryMapTy &V) {
for (auto &P : V) {		for (auto &P : V) {
std::vector<FunctionSummaryYaml> FSums;		std::vector<FunctionSummaryYaml> FSums;
for (auto &Sum : P.second.SummaryList) {		for (auto &Sum : P.second.SummaryList) {
if (auto *FSum = dyn_cast<FunctionSummary>(Sum.get())) {		if (auto *FSum = dyn_cast<FunctionSummary>(Sum.get())) {
std::vector<uint64_t> Refs;		std::vector<uint64_t> Refs;
▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

llvm/lib/Analysis/FuncSpecCost.cpp

Show All 22 Lines
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include <cassert>		#include <cassert>
#include <cmath>		#include <cmath>

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "func-spec-cost"		#define DEBUG_TYPE "func-spec-cost"

		/// Limit on instruction count of imported functions.
		/// The function whose number of instruction below this argument
		/// is considered to be inlined during function importing.
		static cl::opt<unsigned> PotentialInlingLimit(
		"funcspec-potential-inlininglimit", cl::init(100), cl::Hidden,
		cl::value_desc("N"),
		cl::desc("Only functions with less than N instructions are considered to "
		"be inlined during analysis of func spec."));

		static cl::opt<double>
		BonusFactorFromInlining("func-spec-bonus-from-inlining-factor", cl::Hidden,
		cl::desc("Factor for bonus from potetial inling."),
		cl::init(10));

static cl::opt<unsigned>		static cl::opt<unsigned>
AvgLoopIterationCount("func-specialization-avg-iters-cost", cl::Hidden,		AvgLoopIterationCount("func-specialization-avg-iters-cost", cl::Hidden,
cl::desc("Average loop iteration count cost"),		cl::desc("Average loop iteration count cost"),
cl::init(10));		cl::init(10));

static Function getFunction(Value CalledValue) {		static Function getFunction(Value CalledValue) {
// Since the argument is a function pointer, its incoming constant values		// Since the argument is a function pointer, its incoming constant values
// should be functions or constant expressions. The code below attempts to		// should be functions or constant expressions. The code below attempts to
▲ Show 20 Lines • Show All 176 Lines • ▼ Show 20 Lines	bool FunctionSpecializationWrapperPass::runOnFunction(Function &F) {
AssumptionCache &AC =		AssumptionCache &AC =
getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);		getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);
Info.emplace(CreateFuncSpecCostInfo(F, AC, TTI, LI));		Info.emplace(CreateFuncSpecCostInfo(F, AC, TTI, LI));
return false;		return false;
}		}

StringRef FunctionSpecializationWrapperPass::getPassName() const {		StringRef FunctionSpecializationWrapperPass::getPassName() const {
return "Function Specialization Cost Analysis";		return "Function Specialization Cost Analysis";
		}

		unsigned ArgUsage::ConstantMarker = ~(unsigned)0;

		ArgUsage::ArgUsage(const CallBase &CB) {
		for (auto &U : CB.args()) {
		Function *F = getFunction(U.get());
		LinesOfArgs.push_back({CB.getArgOperandNo(&U), ConstantMarker});
		if (!F)
		continue;

		LinesOfArgs.push_back({CB.getArgOperandNo(&U), F->getInstructionCount()});
		}
		}

		bool FuncSpecCostInfo::shouldImport(const ArgUsage &AU) const {
		if (!Cost.isValid())
		return false;
		unsigned SpecCost = *Cost.getValue();
		/// FIXME: It would consider the first argument who fits
		/// the condition. It should be fixed after Function Specialization
		/// pass fix this.
		for (auto &IndexValuePair : AU.LinesOfArgs) {
		unsigned Index = IndexValuePair.first;
		unsigned Value = IndexValuePair.second;
		unsigned BonusBase = getBonusBase(Index);
		// Should we consider attributes like `noinline` and `always_inline` here?
		if (!ArgUsage::isConstant(Value) && Value < PotentialInlingLimit)
		BonusBase += BonusFactorFromInlining * (PotentialInlingLimit - Value);

		if (BonusBase > SpecCost)
		return true;
		}
		return false;
}		}
No newline at end of file		No newline at end of file

llvm/lib/Analysis/ModuleSummaryAnalysis.cpp

Show First 20 Lines • Show All 233 Lines • ▼ Show 20 Lines	static bool isNonVolatileStore(const Instruction *I) {
return false;		return false;
}		}

static void computeFunctionSummary(		static void computeFunctionSummary(
ModuleSummaryIndex &Index, const Module &M, const Function &F,		ModuleSummaryIndex &Index, const Module &M, const Function &F,
BlockFrequencyInfo BFI, ProfileSummaryInfo PSI, DominatorTree &DT,		BlockFrequencyInfo BFI, ProfileSummaryInfo PSI, DominatorTree &DT,
bool HasLocalsInUsedOrAsm, DenseSet<GlobalValue::GUID> &CantBePromoted,		bool HasLocalsInUsedOrAsm, DenseSet<GlobalValue::GUID> &CantBePromoted,
bool IsThinLTO,		bool IsThinLTO,
std::function<const StackSafetyInfo *(const Function &F)> GetSSICallback) {		std::function<const StackSafetyInfo *(const Function &F)> GetSSICallback,
		std::function<FuncSpecCostInfo *(const Function &F)> GetFSCICallback) {
// Summary not currently supported for anonymous functions, they should		// Summary not currently supported for anonymous functions, they should
// have been named.		// have been named.
assert(F.hasName());		assert(F.hasName());

unsigned NumInsts = 0;		unsigned NumInsts = 0;
// Map from callee ValueId to profile count. Used to accumulate profile		// Map from callee ValueId to profile count. Used to accumulate profile
// counts for all static calls to a given callee.		// counts for all static calls to a given callee.
MapVector<ValueInfo, CalleeInfo> CallGraphEdges;		MapVector<ValueInfo, CalleeInfo> CallGraphEdges;
▲ Show 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	for (const Instruction &I : BB) {
ValueInfo.updateHotness(Hotness);		ValueInfo.updateHotness(Hotness);
// Add the relative block frequency to CalleeInfo if there is no profile		// Add the relative block frequency to CalleeInfo if there is no profile
// information.		// information.
if (BFI != nullptr && Hotness == CalleeInfo::HotnessType::Unknown) {		if (BFI != nullptr && Hotness == CalleeInfo::HotnessType::Unknown) {
uint64_t BBFreq = BFI->getBlockFreq(&BB).getFrequency();		uint64_t BBFreq = BFI->getBlockFreq(&BB).getFrequency();
uint64_t EntryFreq = BFI->getEntryFreq();		uint64_t EntryFreq = BFI->getEntryFreq();
ValueInfo.updateRelBlockFreq(BBFreq, EntryFreq);		ValueInfo.updateRelBlockFreq(BBFreq, EntryFreq);
}		}

		ValueInfo.updateUsages(ArgUsage(*CB));
} else {		} else {
// Skip inline assembly calls.		// Skip inline assembly calls.
if (CI && CI->isInlineAsm())		if (CI && CI->isInlineAsm())
continue;		continue;
// Skip direct calls.		// Skip direct calls.
if (!CalledValue \|\| isa<Constant>(CalledValue))		if (!CalledValue \|\| isa<Constant>(CalledValue))
continue;		continue;

▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	FunctionSummary::FFlags FunFlags{
F.hasFnAttribute(Attribute::NoRecurse), F.returnDoesNotAlias(),		F.hasFnAttribute(Attribute::NoRecurse), F.returnDoesNotAlias(),
// FIXME: refactor this to use the same code that inliner is using.		// FIXME: refactor this to use the same code that inliner is using.
// Don't try to import functions with noinline attribute.		// Don't try to import functions with noinline attribute.
F.getAttributes().hasFnAttribute(Attribute::NoInline),		F.getAttributes().hasFnAttribute(Attribute::NoInline),
F.hasFnAttribute(Attribute::AlwaysInline)};		F.hasFnAttribute(Attribute::AlwaysInline)};
std::vector<FunctionSummary::ParamAccess> ParamAccesses;		std::vector<FunctionSummary::ParamAccess> ParamAccesses;
if (auto *SSI = GetSSICallback(F))		if (auto *SSI = GetSSICallback(F))
ParamAccesses = SSI->getParamAccesses(Index);		ParamAccesses = SSI->getParamAccesses(Index);
		FuncSpecCostInfo *FSCI = nullptr;
		if (GetFSCICallback)
		FSCI = GetFSCICallback(F);
auto FuncSummary = std::make_unique<FunctionSummary>(		auto FuncSummary = std::make_unique<FunctionSummary>(
Flags, NumInsts, FunFlags, /EntryCount=/0, std::move(Refs),		Flags, NumInsts, FunFlags, /EntryCount=/0, std::move(Refs),
CallGraphEdges.takeVector(), TypeTests.takeVector(),		CallGraphEdges.takeVector(), TypeTests.takeVector(),
TypeTestAssumeVCalls.takeVector(), TypeCheckedLoadVCalls.takeVector(),		TypeTestAssumeVCalls.takeVector(), TypeCheckedLoadVCalls.takeVector(),
TypeTestAssumeConstVCalls.takeVector(),		TypeTestAssumeConstVCalls.takeVector(),
TypeCheckedLoadConstVCalls.takeVector(), std::move(ParamAccesses));		TypeCheckedLoadConstVCalls.takeVector(), std::move(ParamAccesses),
		FSCI ? std::move(*FSCI) : FuncSpecCostInfo());
if (NonRenamableLocal)		if (NonRenamableLocal)
CantBePromoted.insert(F.getGUID());		CantBePromoted.insert(F.getGUID());
Index.addGlobalValueSummary(F, std::move(FuncSummary));		Index.addGlobalValueSummary(F, std::move(FuncSummary));
}		}

/// Find function pointers referenced within the given vtable initializer		/// Find function pointers referenced within the given vtable initializer
/// (or subset of an initializer) \p I. The starting offset of \p I within		/// (or subset of an initializer) \p I. The starting offset of \p I within
/// the vtable initializer is \p StartingOffset. Any discovered function		/// the vtable initializer is \p StartingOffset. Any discovered function
▲ Show 20 Lines • Show All 152 Lines • ▼ Show 20 Lines	static void setLiveRoot(ModuleSummaryIndex &Index, StringRef Name) {
if (ValueInfo VI = Index.getValueInfo(GlobalValue::getGUID(Name)))		if (ValueInfo VI = Index.getValueInfo(GlobalValue::getGUID(Name)))
for (auto &Summary : VI.getSummaryList())		for (auto &Summary : VI.getSummaryList())
Summary->setLive(true);		Summary->setLive(true);
}		}

ModuleSummaryIndex llvm::buildModuleSummaryIndex(		ModuleSummaryIndex llvm::buildModuleSummaryIndex(
const Module &M,		const Module &M,
std::function<BlockFrequencyInfo *(const Function &F)> GetBFICallback,		std::function<BlockFrequencyInfo *(const Function &F)> GetBFICallback,
		std::function<FuncSpecCostInfo *(const Function &F)> GetFSCICallback,
ProfileSummaryInfo *PSI,		ProfileSummaryInfo *PSI,
std::function<const StackSafetyInfo *(const Function &F)> GetSSICallback) {		std::function<const StackSafetyInfo *(const Function &F)> GetSSICallback) {
assert(PSI);		assert(PSI);
bool EnableSplitLTOUnit = false;		bool EnableSplitLTOUnit = false;
if (auto *MD = mdconst::extract_or_null<ConstantInt>(		if (auto *MD = mdconst::extract_or_null<ConstantInt>(
M.getModuleFlag("EnableSplitLTOUnit")))		M.getModuleFlag("EnableSplitLTOUnit")))
EnableSplitLTOUnit = MD->getZExtValue();		EnableSplitLTOUnit = MD->getZExtValue();
ModuleSummaryIndex Index(/HaveGVs=/true, EnableSplitLTOUnit);		ModuleSummaryIndex Index(/HaveGVs=/true, EnableSplitLTOUnit);
▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	ModuleSymbolTable::CollectAsmSymbols(
F->hasFnAttribute(Attribute::AlwaysInline)},		F->hasFnAttribute(Attribute::AlwaysInline)},
/EntryCount=/0, ArrayRef<ValueInfo>{},		/EntryCount=/0, ArrayRef<ValueInfo>{},
ArrayRef<FunctionSummary::EdgeTy>{},		ArrayRef<FunctionSummary::EdgeTy>{},
ArrayRef<GlobalValue::GUID>{},		ArrayRef<GlobalValue::GUID>{},
ArrayRef<FunctionSummary::VFuncId>{},		ArrayRef<FunctionSummary::VFuncId>{},
ArrayRef<FunctionSummary::VFuncId>{},		ArrayRef<FunctionSummary::VFuncId>{},
ArrayRef<FunctionSummary::ConstVCall>{},		ArrayRef<FunctionSummary::ConstVCall>{},
ArrayRef<FunctionSummary::ConstVCall>{},		ArrayRef<FunctionSummary::ConstVCall>{},
ArrayRef<FunctionSummary::ParamAccess>{});		ArrayRef<FunctionSummary::ParamAccess>{},
		FuncSpecCostInfo());
Index.addGlobalValueSummary(*GV, std::move(Summary));		Index.addGlobalValueSummary(*GV, std::move(Summary));
} else {		} else {
std::unique_ptr<GlobalVarSummary> Summary =		std::unique_ptr<GlobalVarSummary> Summary =
std::make_unique<GlobalVarSummary>(		std::make_unique<GlobalVarSummary>(
GVFlags,		GVFlags,
GlobalVarSummary::GVarFlags(		GlobalVarSummary::GVarFlags(
false, false, cast<GlobalVariable>(GV)->isConstant(),		false, false, cast<GlobalVariable>(GV)->isConstant(),
GlobalObject::VCallVisibilityPublic),		GlobalObject::VCallVisibilityPublic),
Show All 23 Lines	else if (F.hasProfileData()) {
LoopInfo LI{DT};		LoopInfo LI{DT};
BranchProbabilityInfo BPI{F, LI};		BranchProbabilityInfo BPI{F, LI};
BFIPtr = std::make_unique<BlockFrequencyInfo>(F, BPI, LI);		BFIPtr = std::make_unique<BlockFrequencyInfo>(F, BPI, LI);
BFI = BFIPtr.get();		BFI = BFIPtr.get();
}		}

computeFunctionSummary(Index, M, F, BFI, PSI, DT,		computeFunctionSummary(Index, M, F, BFI, PSI, DT,
!LocalsUsed.empty() \|\| HasLocalInlineAsmSymbol,		!LocalsUsed.empty() \|\| HasLocalInlineAsmSymbol,
CantBePromoted, IsThinLTO, GetSSICallback);		CantBePromoted, IsThinLTO, GetSSICallback, GetFSCICallback);
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - CantBePromoted, IsThinLTO, GetSSICallback, GetFSCICallback); + CantBePromoted, IsThinLTO, GetSSICallback, + GetFSCICallback); Lint: Pre-merge checks: clang-format: please reformat the code ``` - CantBePromoted…
}		}

// Compute summaries for all variables defined in module, and save in the		// Compute summaries for all variables defined in module, and save in the
// index.		// index.
SmallVector<MDNode *, 2> Types;		SmallVector<MDNode *, 2> Types;
for (const GlobalVariable &G : M.globals()) {		for (const GlobalVariable &G : M.globals()) {
if (G.isDeclaration())		if (G.isDeclaration())
continue;		continue;
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	ModuleSummaryIndexAnalysis::run(Module &M, ModuleAnalysisManager &AM) {
auto &FAM = AM.getResult<FunctionAnalysisManagerModuleProxy>(M).getManager();		auto &FAM = AM.getResult<FunctionAnalysisManagerModuleProxy>(M).getManager();
bool NeedSSI = needsParamAccessSummary(M);		bool NeedSSI = needsParamAccessSummary(M);
return buildModuleSummaryIndex(		return buildModuleSummaryIndex(
M,		M,
[&FAM](const Function &F) {		[&FAM](const Function &F) {
return &FAM.getResult<BlockFrequencyAnalysis>(		return &FAM.getResult<BlockFrequencyAnalysis>(
const_cast<Function >(&F));		const_cast<Function >(&F));
},		},
		[&FAM](const Function &F) {
		return &FAM.getResult<FunctionSpecializationAnalysis>(
		const_cast<Function >(&F));
		},
&PSI,		&PSI,
[&FAM, NeedSSI](const Function &F) -> const StackSafetyInfo * {		[&FAM, NeedSSI](const Function &F) -> const StackSafetyInfo * {
return NeedSSI ? &FAM.getResult<StackSafetyAnalysis>(		return NeedSSI ? &FAM.getResult<StackSafetyAnalysis>(
const_cast<Function &>(F))		const_cast<Function &>(F))
: nullptr;		: nullptr;
});		});
}		}

char ModuleSummaryIndexWrapperPass::ID = 0;		char ModuleSummaryIndexWrapperPass::ID = 0;

INITIALIZE_PASS_BEGIN(ModuleSummaryIndexWrapperPass, "module-summary-analysis",		INITIALIZE_PASS_BEGIN(ModuleSummaryIndexWrapperPass, "module-summary-analysis",
"Module Summary Analysis", false, true)		"Module Summary Analysis", false, true)
INITIALIZE_PASS_DEPENDENCY(BlockFrequencyInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(BlockFrequencyInfoWrapperPass)
INITIALIZE_PASS_DEPENDENCY(ProfileSummaryInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(ProfileSummaryInfoWrapperPass)
INITIALIZE_PASS_DEPENDENCY(StackSafetyInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(StackSafetyInfoWrapperPass)
		INITIALIZE_PASS_DEPENDENCY(FunctionSpecializationWrapperPass)
INITIALIZE_PASS_END(ModuleSummaryIndexWrapperPass, "module-summary-analysis",		INITIALIZE_PASS_END(ModuleSummaryIndexWrapperPass, "module-summary-analysis",
"Module Summary Analysis", false, true)		"Module Summary Analysis", false, true)

ModulePass *llvm::createModuleSummaryIndexWrapperPass() {		ModulePass *llvm::createModuleSummaryIndexWrapperPass() {
return new ModuleSummaryIndexWrapperPass();		return new ModuleSummaryIndexWrapperPass();
}		}

ModuleSummaryIndexWrapperPass::ModuleSummaryIndexWrapperPass()		ModuleSummaryIndexWrapperPass::ModuleSummaryIndexWrapperPass()
: ModulePass(ID) {		: ModulePass(ID) {
initializeModuleSummaryIndexWrapperPassPass(*PassRegistry::getPassRegistry());		initializeModuleSummaryIndexWrapperPassPass(*PassRegistry::getPassRegistry());
}		}

bool ModuleSummaryIndexWrapperPass::runOnModule(Module &M) {		bool ModuleSummaryIndexWrapperPass::runOnModule(Module &M) {
auto *PSI = &getAnalysis<ProfileSummaryInfoWrapperPass>().getPSI();		auto *PSI = &getAnalysis<ProfileSummaryInfoWrapperPass>().getPSI();
bool NeedSSI = needsParamAccessSummary(M);		bool NeedSSI = needsParamAccessSummary(M);
Index.emplace(buildModuleSummaryIndex(		Index.emplace(buildModuleSummaryIndex(
M,		M,
[this](const Function &F) {		[this](const Function &F) {
return &(this->getAnalysis<BlockFrequencyInfoWrapperPass>(		return &(this->getAnalysis<BlockFrequencyInfoWrapperPass>(
const_cast<Function >(&F))		const_cast<Function >(&F))
.getBFI());		.getBFI());
},		},
		[&](const Function &F) {
		return &(this->getAnalysis<FunctionSpecializationWrapperPass>(
		const_cast<Function >(&F))
		.getFuncSpecCost());
		},
PSI,		PSI,
[&](const Function &F) -> const StackSafetyInfo * {		[&](const Function &F) -> const StackSafetyInfo * {
return NeedSSI ? &getAnalysis<StackSafetyInfoWrapperPass>(		return NeedSSI ? &getAnalysis<StackSafetyInfoWrapperPass>(
const_cast<Function &>(F))		const_cast<Function &>(F))
.getResult()		.getResult()
: nullptr;		: nullptr;
}));		}));
return false;		return false;
}		}

bool ModuleSummaryIndexWrapperPass::doFinalization(Module &M) {		bool ModuleSummaryIndexWrapperPass::doFinalization(Module &M) {
Index.reset();		Index.reset();
return false;		return false;
}		}

void ModuleSummaryIndexWrapperPass::getAnalysisUsage(AnalysisUsage &AU) const {		void ModuleSummaryIndexWrapperPass::getAnalysisUsage(AnalysisUsage &AU) const {
AU.setPreservesAll();		AU.setPreservesAll();
AU.addRequired<BlockFrequencyInfoWrapperPass>();		AU.addRequired<BlockFrequencyInfoWrapperPass>();
AU.addRequired<ProfileSummaryInfoWrapperPass>();		AU.addRequired<ProfileSummaryInfoWrapperPass>();
AU.addRequired<StackSafetyInfoWrapperPass>();		AU.addRequired<StackSafetyInfoWrapperPass>();
		AU.addRequired<FunctionSpecializationWrapperPass>();
}		}

char ImmutableModuleSummaryIndexWrapperPass::ID = 0;		char ImmutableModuleSummaryIndexWrapperPass::ID = 0;

ImmutableModuleSummaryIndexWrapperPass::ImmutableModuleSummaryIndexWrapperPass(		ImmutableModuleSummaryIndexWrapperPass::ImmutableModuleSummaryIndexWrapperPass(
const ModuleSummaryIndex *Index)		const ModuleSummaryIndex *Index)
: ImmutablePass(ID), Index(Index) {		: ImmutablePass(ID), Index(Index) {
initializeImmutableModuleSummaryIndexWrapperPassPass(		initializeImmutableModuleSummaryIndexWrapperPassPass(
Show All 15 Lines

llvm/lib/AsmParser/LLParser.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,697 Lines • ▼ Show 20 Lines	bool LLParser::parseFunctionSummary(std::string Name, GlobalValue::GUID GUID,

auto FS = std::make_unique<FunctionSummary>(		auto FS = std::make_unique<FunctionSummary>(
GVFlags, InstCount, FFlags, /EntryCount=/0, std::move(Refs),		GVFlags, InstCount, FFlags, /EntryCount=/0, std::move(Refs),
std::move(Calls), std::move(TypeIdInfo.TypeTests),		std::move(Calls), std::move(TypeIdInfo.TypeTests),
std::move(TypeIdInfo.TypeTestAssumeVCalls),		std::move(TypeIdInfo.TypeTestAssumeVCalls),
std::move(TypeIdInfo.TypeCheckedLoadVCalls),		std::move(TypeIdInfo.TypeCheckedLoadVCalls),
std::move(TypeIdInfo.TypeTestAssumeConstVCalls),		std::move(TypeIdInfo.TypeTestAssumeConstVCalls),
std::move(TypeIdInfo.TypeCheckedLoadConstVCalls),		std::move(TypeIdInfo.TypeCheckedLoadConstVCalls),
std::move(ParamAccesses));		std::move(ParamAccesses),
		/// FIXME: Implement actually parser for FuncSpecCostInfo.
		FuncSpecCostInfo());

FS->setModulePath(ModulePath);		FS->setModulePath(ModulePath);

addGlobalValueToIndex(Name, GUID, (GlobalValue::LinkageTypes)GVFlags.Linkage,		addGlobalValueToIndex(Name, GUID, (GlobalValue::LinkageTypes)GVFlags.Linkage,
ID, std::move(FS));		ID, std::move(FS));

return false;		return false;
}		}
▲ Show 20 Lines • Show All 220 Lines • ▼ Show 20 Lines	if (EatIfPresent(lltok::comma)) {
return true;		return true;
}		}
}		}
// Keep track of the Call array index needing a forward reference.		// Keep track of the Call array index needing a forward reference.
// We will save the location of the ValueInfo needing an update, but		// We will save the location of the ValueInfo needing an update, but
// can only do so once the std::vector is finalized.		// can only do so once the std::vector is finalized.
if (VI.getRef() == FwdVIRef)		if (VI.getRef() == FwdVIRef)
IdToIndexMap[GVId].push_back(std::make_pair(Calls.size(), Loc));		IdToIndexMap[GVId].push_back(std::make_pair(Calls.size(), Loc));
Calls.push_back(FunctionSummary::EdgeTy{VI, CalleeInfo(Hotness, RelBF)});
		/// FIXME: Implement parser for new added infomation.
		Calls.push_back(
		FunctionSummary::EdgeTy{VI, CalleeInfo(Hotness, RelBF, ArgUsage())});

if (parseToken(lltok::rparen, "expected ')' in call"))		if (parseToken(lltok::rparen, "expected ')' in call"))
return true;		return true;
} while (EatIfPresent(lltok::comma));		} while (EatIfPresent(lltok::comma));

// Now that the Calls vector is finalized, it is safe to save the locations		// Now that the Calls vector is finalized, it is safe to save the locations
// of any forward GV references that need updating later.		// of any forward GV references that need updating later.
for (auto I : IdToIndexMap) {		for (auto I : IdToIndexMap) {
▲ Show 20 Lines • Show All 678 Lines • Show Last 20 Lines

llvm/lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 785 Lines • ▼ Show 20 Lines
private:		private:
void setValueGUID(uint64_t ValueID, StringRef ValueName,		void setValueGUID(uint64_t ValueID, StringRef ValueName,
GlobalValue::LinkageTypes Linkage,		GlobalValue::LinkageTypes Linkage,
StringRef SourceFileName);		StringRef SourceFileName);
Error parseValueSymbolTable(		Error parseValueSymbolTable(
uint64_t Offset,		uint64_t Offset,
DenseMap<unsigned, GlobalValue::LinkageTypes> &ValueIdToLinkageMap);		DenseMap<unsigned, GlobalValue::LinkageTypes> &ValueIdToLinkageMap);
std::vector<ValueInfo> makeRefList(ArrayRef<uint64_t> Record);		std::vector<ValueInfo> makeRefList(ArrayRef<uint64_t> Record);
		FuncSpecCostInfo makeSpecCostInfo(ArrayRef<uint64_t> Record,
		unsigned SpecCost);
std::vector<FunctionSummary::EdgeTy> makeCallList(ArrayRef<uint64_t> Record,		std::vector<FunctionSummary::EdgeTy> makeCallList(ArrayRef<uint64_t> Record,
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - std::vector<FunctionSummary::EdgeTy> makeCallList(ArrayRef<uint64_t> Record, - uint64_t Version, - bool IsOldProfileFormat, - bool HasProfile, - bool HasRelBF); + std::vector<FunctionSummary::EdgeTy> + makeCallList(ArrayRef<uint64_t> Record, uint64_t Version, + bool IsOldProfileFormat, bool HasProfile, bool HasRelBF); Lint: Pre-merge checks: clang-format: please reformat the code ``` - std::vector<FunctionSummary::EdgeTy> makeCallList…
		uint64_t Version,
bool IsOldProfileFormat,		bool IsOldProfileFormat,
bool HasProfile,		bool HasProfile,
bool HasRelBF);		bool HasRelBF);
		ArgUsage makeArgUsages(ArrayRef<uint64_t> Record, unsigned &I);
Error parseEntireSummary(unsigned ID);		Error parseEntireSummary(unsigned ID);
Error parseModuleStringTable();		Error parseModuleStringTable();
void parseTypeIdCompatibleVtableSummaryRecord(ArrayRef<uint64_t> Record);		void parseTypeIdCompatibleVtableSummaryRecord(ArrayRef<uint64_t> Record);
void parseTypeIdCompatibleVtableInfo(ArrayRef<uint64_t> Record, size_t &Slot,		void parseTypeIdCompatibleVtableInfo(ArrayRef<uint64_t> Record, size_t &Slot,
TypeIdCompatibleVtableInfo &TypeId);		TypeIdCompatibleVtableInfo &TypeId);
std::vector<FunctionSummary::ParamAccess>		std::vector<FunctionSummary::ParamAccess>
parseParamAccesses(ArrayRef<uint64_t> Record);		parseParamAccesses(ArrayRef<uint64_t> Record);

▲ Show 20 Lines • Show All 5,106 Lines • ▼ Show 20 Lines
ModuleSummaryIndexBitcodeReader::makeRefList(ArrayRef<uint64_t> Record) {		ModuleSummaryIndexBitcodeReader::makeRefList(ArrayRef<uint64_t> Record) {
std::vector<ValueInfo> Ret;		std::vector<ValueInfo> Ret;
Ret.reserve(Record.size());		Ret.reserve(Record.size());
for (uint64_t RefValueId : Record)		for (uint64_t RefValueId : Record)
Ret.push_back(getValueInfoFromValueId(RefValueId).first);		Ret.push_back(getValueInfoFromValueId(RefValueId).first);
return Ret;		return Ret;
}		}

		FuncSpecCostInfo
		ModuleSummaryIndexBitcodeReader::makeSpecCostInfo(ArrayRef<uint64_t> Record,
		unsigned SpecCost) {
		MapVector<unsigned, unsigned> BonusBaseMap;
		auto size = Record.size();
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'size' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'size' [readability-identifier-naming]…
		assert(size % 2 == 0 && "The size of base bonus should be even!");
		for (unsigned i = 0; i < size; i += 2)
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'i' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'i' [readability-identifier-naming]…
		BonusBaseMap.insert({Record[i], Record[i + 1]});
		return FuncSpecCostInfo(SpecCost, std::move(BonusBaseMap));
		}

		ArgUsage
		ModuleSummaryIndexBitcodeReader::makeArgUsages(ArrayRef<uint64_t> Record,
		unsigned &I) {
		SmallVector<std::pair<unsigned, unsigned>, 4> Uses;
		size_t Size = Record[I++];
		assert(Size % 2 == 0);
		assert(I + Size <= Record.size());
		Size += I;
		for (; I != Size; I += 2)
		Uses.push_back({Record[I], Record[I + 1]});
		return ArgUsage(std::move(Uses));
		}

std::vector<FunctionSummary::EdgeTy>		std::vector<FunctionSummary::EdgeTy>
ModuleSummaryIndexBitcodeReader::makeCallList(ArrayRef<uint64_t> Record,		ModuleSummaryIndexBitcodeReader::makeCallList(ArrayRef<uint64_t> Record,
		uint64_t Version,
bool IsOldProfileFormat,		bool IsOldProfileFormat,
bool HasProfile, bool HasRelBF) {		bool HasProfile, bool HasRelBF) {
std::vector<FunctionSummary::EdgeTy> Ret;		std::vector<FunctionSummary::EdgeTy> Ret;
Ret.reserve(Record.size());		Ret.reserve(Record.size());
for (unsigned I = 0, E = Record.size(); I != E; ++I) {		for (unsigned I = 0, E = Record.size(); I != E; ++I) {
CalleeInfo::HotnessType Hotness = CalleeInfo::HotnessType::Unknown;		CalleeInfo::HotnessType Hotness = CalleeInfo::HotnessType::Unknown;
uint64_t RelBF = 0;		uint64_t RelBF = 0;
ValueInfo Callee = getValueInfoFromValueId(Record[I]).first;		ValueInfo Callee = getValueInfoFromValueId(Record[I]).first;
		ArgUsage Uses;
		if (Version >= 10) {
		Uses = makeArgUsages(Record, ++I);
		I -= 1; // Match original order.
		}
if (IsOldProfileFormat) {		if (IsOldProfileFormat) {
I += 1; // Skip old callsitecount field		I += 1; // Skip old callsitecount field
if (HasProfile)		if (HasProfile)
I += 1; // Skip old profilecount field		I += 1; // Skip old profilecount field
} else if (HasProfile)		} else if (HasProfile)
Hotness = static_cast<CalleeInfo::HotnessType>(Record[++I]);		Hotness = static_cast<CalleeInfo::HotnessType>(Record[++I]);
else if (HasRelBF)		else if (HasRelBF)
RelBF = Record[++I];		RelBF = Record[++I];
Ret.push_back(FunctionSummary::EdgeTy{Callee, CalleeInfo(Hotness, RelBF)});		Ret.push_back(FunctionSummary::EdgeTy{
		Callee, CalleeInfo(Hotness, RelBF, std::move(Uses))});
}		}
return Ret;		return Ret;
}		}

static void		static void
parseWholeProgramDevirtResolutionByArg(ArrayRef<uint64_t> Record, size_t &Slot,		parseWholeProgramDevirtResolutionByArg(ArrayRef<uint64_t> Record, size_t &Slot,
WholeProgramDevirtResolution &Wpd) {		WholeProgramDevirtResolution &Wpd) {
uint64_t ArgNum = Record[Slot++];		uint64_t ArgNum = Record[Slot++];
▲ Show 20 Lines • Show All 212 Lines • ▼ Show 20 Lines	while (true) {
case bitc::FS_PERMODULE_PROFILE: {		case bitc::FS_PERMODULE_PROFILE: {
unsigned ValueID = Record[0];		unsigned ValueID = Record[0];
uint64_t RawFlags = Record[1];		uint64_t RawFlags = Record[1];
unsigned InstCount = Record[2];		unsigned InstCount = Record[2];
uint64_t RawFunFlags = 0;		uint64_t RawFunFlags = 0;
unsigned NumRefs = Record[3];		unsigned NumRefs = Record[3];
unsigned NumRORefs = 0, NumWORefs = 0;		unsigned NumRORefs = 0, NumWORefs = 0;
int RefListStartIndex = 4;		int RefListStartIndex = 4;
		unsigned SpecCost = ~(unsigned)0;
		unsigned NumBaseBonus = 0;
if (Version >= 4) {		if (Version >= 4) {
RawFunFlags = Record[3];		RawFunFlags = Record[3];
NumRefs = Record[4];		NumRefs = Record[4];
RefListStartIndex = 5;		RefListStartIndex = 5;
if (Version >= 5) {		if (Version >= 5) {
NumRORefs = Record[5];		NumRORefs = Record[5];
RefListStartIndex = 6;		RefListStartIndex = 6;
if (Version >= 7) {		if (Version >= 7) {
NumWORefs = Record[6];		NumWORefs = Record[6];
RefListStartIndex = 7;		RefListStartIndex = 7;
		if (Version >= 10) {
		SpecCost = Record[7];
		NumBaseBonus = Record[8];
		RefListStartIndex = 9;
		}
}		}
}		}
}		}

auto Flags = getDecodedGVSummaryFlags(RawFlags, Version);		auto Flags = getDecodedGVSummaryFlags(RawFlags, Version);
// The module path string ref set in the summary must be owned by the		// The module path string ref set in the summary must be owned by the
// index's module string table. Since we don't have a module path		// index's module string table. Since we don't have a module path
// string table section in the per-module index, we create a single		// string table section in the per-module index, we create a single
// module path string table entry with an empty (0) ID to take		// module path string table entry with an empty (0) ID to take
// ownership.		// ownership.
int CallGraphEdgeStartIndex = RefListStartIndex + NumRefs;		unsigned FuncSpecStartIndex = RefListStartIndex + NumRefs;
assert(Record.size() >= RefListStartIndex + NumRefs &&		unsigned CallGraphEdgeStartIndex = FuncSpecStartIndex + NumBaseBonus;
		assert(Record.size() >= CallGraphEdgeStartIndex &&
"Record size inconsistent with number of references");		"Record size inconsistent with number of references");
std::vector<ValueInfo> Refs = makeRefList(		std::vector<ValueInfo> Refs = makeRefList(
ArrayRef<uint64_t>(Record).slice(RefListStartIndex, NumRefs));		ArrayRef<uint64_t>(Record).slice(RefListStartIndex, NumRefs));
		FuncSpecCostInfo FSCI = makeSpecCostInfo(
		ArrayRef<uint64_t>(Record).slice(FuncSpecStartIndex, NumBaseBonus),
		SpecCost);
bool HasProfile = (BitCode == bitc::FS_PERMODULE_PROFILE);		bool HasProfile = (BitCode == bitc::FS_PERMODULE_PROFILE);
bool HasRelBF = (BitCode == bitc::FS_PERMODULE_RELBF);		bool HasRelBF = (BitCode == bitc::FS_PERMODULE_RELBF);
std::vector<FunctionSummary::EdgeTy> Calls = makeCallList(		std::vector<FunctionSummary::EdgeTy> Calls = makeCallList(
ArrayRef<uint64_t>(Record).slice(CallGraphEdgeStartIndex),		ArrayRef<uint64_t>(Record).slice(CallGraphEdgeStartIndex),
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - ArrayRef<uint64_t>(Record).slice(CallGraphEdgeStartIndex), - Version, IsOldProfileFormat, HasProfile, HasRelBF); + ArrayRef<uint64_t>(Record).slice(CallGraphEdgeStartIndex), Version, + IsOldProfileFormat, HasProfile, HasRelBF); Lint: Pre-merge checks: clang-format: please reformat the code ``` - ArrayRef<uint64_t>(Record).slice…
IsOldProfileFormat, HasProfile, HasRelBF);		Version, IsOldProfileFormat, HasProfile, HasRelBF);
setSpecialRefs(Refs, NumRORefs, NumWORefs);		setSpecialRefs(Refs, NumRORefs, NumWORefs);
auto FS = std::make_unique<FunctionSummary>(		auto FS = std::make_unique<FunctionSummary>(
Flags, InstCount, getDecodedFFlags(RawFunFlags), /EntryCount=/0,		Flags, InstCount, getDecodedFFlags(RawFunFlags), /EntryCount=/0,
std::move(Refs), std::move(Calls), std::move(PendingTypeTests),		std::move(Refs), std::move(Calls), std::move(PendingTypeTests),
std::move(PendingTypeTestAssumeVCalls),		std::move(PendingTypeTestAssumeVCalls),
std::move(PendingTypeCheckedLoadVCalls),		std::move(PendingTypeCheckedLoadVCalls),
std::move(PendingTypeTestAssumeConstVCalls),		std::move(PendingTypeTestAssumeConstVCalls),
std::move(PendingTypeCheckedLoadConstVCalls),		std::move(PendingTypeCheckedLoadConstVCalls),
std::move(PendingParamAccesses));		std::move(PendingParamAccesses), std::move(FSCI));
auto VIAndOriginalGUID = getValueInfoFromValueId(ValueID);		auto VIAndOriginalGUID = getValueInfoFromValueId(ValueID);
FS->setModulePath(getThisModule()->first());		FS->setModulePath(getThisModule()->first());
FS->setOriginalName(VIAndOriginalGUID.second);		FS->setOriginalName(VIAndOriginalGUID.second);
TheIndex.addGlobalValueSummary(VIAndOriginalGUID.first, std::move(FS));		TheIndex.addGlobalValueSummary(VIAndOriginalGUID.first, std::move(FS));
break;		break;
}		}
// FS_ALIAS: [valueid, flags, valueid]		// FS_ALIAS: [valueid, flags, valueid]
// Aliases must be emitted (and parsed) after all FS_PERMODULE entries, as		// Aliases must be emitted (and parsed) after all FS_PERMODULE entries, as
▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	case bitc::FS_COMBINED_PROFILE: {
uint64_t ModuleId = Record[1];		uint64_t ModuleId = Record[1];
uint64_t RawFlags = Record[2];		uint64_t RawFlags = Record[2];
unsigned InstCount = Record[3];		unsigned InstCount = Record[3];
uint64_t RawFunFlags = 0;		uint64_t RawFunFlags = 0;
uint64_t EntryCount = 0;		uint64_t EntryCount = 0;
unsigned NumRefs = Record[4];		unsigned NumRefs = Record[4];
unsigned NumRORefs = 0, NumWORefs = 0;		unsigned NumRORefs = 0, NumWORefs = 0;
int RefListStartIndex = 5;		int RefListStartIndex = 5;
		unsigned SpecCost = ~(unsigned)0;
		unsigned NumBaseBonus = 0;

if (Version >= 4) {		if (Version >= 4) {
RawFunFlags = Record[4];		RawFunFlags = Record[4];
RefListStartIndex = 6;		RefListStartIndex = 6;
size_t NumRefsIndex = 5;		size_t NumRefsIndex = 5;
if (Version >= 5) {		if (Version >= 5) {
unsigned NumRORefsOffset = 1;		unsigned NumRORefsOffset = 1;
RefListStartIndex = 7;		RefListStartIndex = 7;
if (Version >= 6) {		if (Version >= 6) {
NumRefsIndex = 6;		NumRefsIndex = 6;
EntryCount = Record[5];		EntryCount = Record[5];
RefListStartIndex = 8;		RefListStartIndex = 8;
if (Version >= 7) {		if (Version >= 7) {
RefListStartIndex = 9;		RefListStartIndex = 9;
NumWORefs = Record[8];		NumWORefs = Record[8];
NumRORefsOffset = 2;		NumRORefsOffset = 2;

		if (Version >= 10) {
		SpecCost = Record[9];
		NumBaseBonus = Record[10];
		RefListStartIndex = 11;
		NumRORefsOffset = 4;
		}
}		}
}		}
NumRORefs = Record[RefListStartIndex - NumRORefsOffset];		NumRORefs = Record[RefListStartIndex - NumRORefsOffset];
}		}
NumRefs = Record[NumRefsIndex];		NumRefs = Record[NumRefsIndex];
}		}

auto Flags = getDecodedGVSummaryFlags(RawFlags, Version);		auto Flags = getDecodedGVSummaryFlags(RawFlags, Version);
int CallGraphEdgeStartIndex = RefListStartIndex + NumRefs;
assert(Record.size() >= RefListStartIndex + NumRefs &&		uint64_t FuncSpecStartIndex = RefListStartIndex + NumRefs;
		uint64_t CallGraphEdgeStartIndex = FuncSpecStartIndex + NumBaseBonus;
		assert(Record.size() >= CallGraphEdgeStartIndex &&
"Record size inconsistent with number of references");		"Record size inconsistent with number of references");
std::vector<ValueInfo> Refs = makeRefList(		std::vector<ValueInfo> Refs = makeRefList(
ArrayRef<uint64_t>(Record).slice(RefListStartIndex, NumRefs));		ArrayRef<uint64_t>(Record).slice(RefListStartIndex, NumRefs));
		FuncSpecCostInfo FSCI = makeSpecCostInfo(
		ArrayRef<uint64_t>(Record).slice(FuncSpecStartIndex, NumBaseBonus),
		SpecCost);
bool HasProfile = (BitCode == bitc::FS_COMBINED_PROFILE);		bool HasProfile = (BitCode == bitc::FS_COMBINED_PROFILE);
std::vector<FunctionSummary::EdgeTy> Edges = makeCallList(		std::vector<FunctionSummary::EdgeTy> Edges = makeCallList(
ArrayRef<uint64_t>(Record).slice(CallGraphEdgeStartIndex),		ArrayRef<uint64_t>(Record).slice(CallGraphEdgeStartIndex),
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - ArrayRef<uint64_t>(Record).slice(CallGraphEdgeStartIndex), - Version, IsOldProfileFormat, HasProfile, false); + ArrayRef<uint64_t>(Record).slice(CallGraphEdgeStartIndex), Version, + IsOldProfileFormat, HasProfile, false); Lint: Pre-merge checks: clang-format: please reformat the code ``` - ArrayRef<uint64_t>(Record).slice…
IsOldProfileFormat, HasProfile, false);		Version, IsOldProfileFormat, HasProfile, false);
ValueInfo VI = getValueInfoFromValueId(ValueID).first;		ValueInfo VI = getValueInfoFromValueId(ValueID).first;
setSpecialRefs(Refs, NumRORefs, NumWORefs);		setSpecialRefs(Refs, NumRORefs, NumWORefs);
auto FS = std::make_unique<FunctionSummary>(		auto FS = std::make_unique<FunctionSummary>(
Flags, InstCount, getDecodedFFlags(RawFunFlags), EntryCount,		Flags, InstCount, getDecodedFFlags(RawFunFlags), EntryCount,
std::move(Refs), std::move(Edges), std::move(PendingTypeTests),		std::move(Refs), std::move(Edges), std::move(PendingTypeTests),
std::move(PendingTypeTestAssumeVCalls),		std::move(PendingTypeTestAssumeVCalls),
std::move(PendingTypeCheckedLoadVCalls),		std::move(PendingTypeCheckedLoadVCalls),
std::move(PendingTypeTestAssumeConstVCalls),		std::move(PendingTypeTestAssumeConstVCalls),
std::move(PendingTypeCheckedLoadConstVCalls),		std::move(PendingTypeCheckedLoadConstVCalls),
std::move(PendingParamAccesses));		std::move(PendingParamAccesses), std::move(FSCI));
LastSeenSummary = FS.get();		LastSeenSummary = FS.get();
LastSeenGUID = VI.getGUID();		LastSeenGUID = VI.getGUID();
FS->setModulePath(ModuleIdMap[ModuleId]);		FS->setModulePath(ModuleIdMap[ModuleId]);
TheIndex.addGlobalValueSummary(VI, std::move(FS));		TheIndex.addGlobalValueSummary(VI, std::move(FS));
break;		break;
}		}
// FS_COMBINED_ALIAS: [valueid, modid, flags, valueid]		// FS_COMBINED_ALIAS: [valueid, modid, flags, valueid]
// Aliases must be emitted (and parsed) after all FS_COMBINED entries, as		// Aliases must be emitted (and parsed) after all FS_COMBINED entries, as
▲ Show 20 Lines • Show All 679 Lines • Show Last 20 Lines

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

Show First 20 Lines • Show All 3,817 Lines • ▼ Show 20 Lines	void ModuleBitcodeWriterBase::writePerModuleFunctionSummaryRecord(

auto SpecialRefCnts = FS->specialRefCounts();		auto SpecialRefCnts = FS->specialRefCounts();
NameVals.push_back(getEncodedGVSummaryFlags(FS->flags()));		NameVals.push_back(getEncodedGVSummaryFlags(FS->flags()));
NameVals.push_back(FS->instCount());		NameVals.push_back(FS->instCount());
NameVals.push_back(getEncodedFFlags(FS->fflags()));		NameVals.push_back(getEncodedFFlags(FS->fflags()));
NameVals.push_back(FS->refs().size());		NameVals.push_back(FS->refs().size());
NameVals.push_back(SpecialRefCnts.first); // rorefcnt		NameVals.push_back(SpecialRefCnts.first); // rorefcnt
NameVals.push_back(SpecialRefCnts.second); // worefcnt		NameVals.push_back(SpecialRefCnts.second); // worefcnt
		NameVals.push_back(FS->getSpecializeCost());
		auto BonusBases = FS->getSpecBonusBase();
		NameVals.push_back(BonusBases.size() * 2);

for (auto &RI : FS->refs())		for (auto &RI : FS->refs())
NameVals.push_back(VE.getValueID(RI.getValue()));		NameVals.push_back(VE.getValueID(RI.getValue()));

		for (auto &I : BonusBases) {
		NameVals.push_back(I.first);
		NameVals.push_back(I.second);
		}

bool HasProfileData =		bool HasProfileData =
F.hasProfileData() \|\| ForceSummaryEdgesCold != FunctionSummary::FSHT_None;		F.hasProfileData() \|\| ForceSummaryEdgesCold != FunctionSummary::FSHT_None;
for (auto &ECI : FS->calls()) {		for (auto &ECI : FS->calls()) {
NameVals.push_back(getValueId(ECI.first));		NameVals.push_back(getValueId(ECI.first));
		NameVals.push_back(ECI.second.getUsagesSize() * 2);
		auto Uses = ECI.second.getUsages();
		for (auto &IndexValuePair : Uses) {
		NameVals.push_back(IndexValuePair.first);
		NameVals.push_back(IndexValuePair.second);
		}
if (HasProfileData)		if (HasProfileData)
NameVals.push_back(static_cast<uint8_t>(ECI.second.Hotness));		NameVals.push_back(static_cast<uint8_t>(ECI.second.Hotness));
else if (WriteRelBFToSummary)		else if (WriteRelBFToSummary)
NameVals.push_back(ECI.second.RelBlockFreq);		NameVals.push_back(ECI.second.RelBlockFreq);
}		}

unsigned FSAbbrev = (HasProfileData ? FSCallsProfileAbbrev : FSCallsAbbrev);		unsigned FSAbbrev = (HasProfileData ? FSCallsProfileAbbrev : FSCallsAbbrev);
unsigned Code =		unsigned Code =
▲ Show 20 Lines • Show All 389 Lines • ▼ Show 20 Lines	forEachSummary([&](GVInfo I, bool IsAliasee) {
NameVals.push_back(FS->instCount());		NameVals.push_back(FS->instCount());
NameVals.push_back(getEncodedFFlags(FS->fflags()));		NameVals.push_back(getEncodedFFlags(FS->fflags()));
NameVals.push_back(FS->entryCount());		NameVals.push_back(FS->entryCount());

// Fill in below		// Fill in below
NameVals.push_back(0); // numrefs		NameVals.push_back(0); // numrefs
NameVals.push_back(0); // rorefcnt		NameVals.push_back(0); // rorefcnt
NameVals.push_back(0); // worefcnt		NameVals.push_back(0); // worefcnt
		NameVals.push_back(FS->getSpecializeCost());
		auto BonusBases = FS->getSpecBonusBase();
		NameVals.push_back(BonusBases.size() * 2);

unsigned Count = 0, RORefCnt = 0, WORefCnt = 0;		unsigned Count = 0, RORefCnt = 0, WORefCnt = 0;
for (auto &RI : FS->refs()) {		for (auto &RI : FS->refs()) {
auto RefValueId = getValueId(RI.getGUID());		auto RefValueId = getValueId(RI.getGUID());
if (!RefValueId)		if (!RefValueId)
continue;		continue;
NameVals.push_back(*RefValueId);		NameVals.push_back(*RefValueId);
if (RI.isReadOnly())		if (RI.isReadOnly())
RORefCnt++;		RORefCnt++;
else if (RI.isWriteOnly())		else if (RI.isWriteOnly())
WORefCnt++;		WORefCnt++;
Count++;		Count++;
}		}
NameVals[6] = Count;		NameVals[6] = Count;
NameVals[7] = RORefCnt;		NameVals[7] = RORefCnt;
NameVals[8] = WORefCnt;		NameVals[8] = WORefCnt;

		for (auto &I : BonusBases) {
		NameVals.push_back(I.first);
		NameVals.push_back(I.second);
		}

bool HasProfileData = false;		bool HasProfileData = false;
for (auto &EI : FS->calls()) {		for (auto &EI : FS->calls()) {
HasProfileData \|=		HasProfileData \|=
EI.second.getHotness() != CalleeInfo::HotnessType::Unknown;		EI.second.getHotness() != CalleeInfo::HotnessType::Unknown;
if (HasProfileData)		if (HasProfileData)
break;		break;
}		}

for (auto &EI : FS->calls()) {		for (auto &EI : FS->calls()) {
// If this GUID doesn't have a value id, it doesn't have a function		// If this GUID doesn't have a value id, it doesn't have a function
// summary and we don't need to record any calls to it.		// summary and we don't need to record any calls to it.
Optional<unsigned> CallValueId = GetValueId(EI.first);		Optional<unsigned> CallValueId = GetValueId(EI.first);
if (!CallValueId)		if (!CallValueId)
continue;		continue;
NameVals.push_back(*CallValueId);		NameVals.push_back(*CallValueId);
		NameVals.push_back(EI.second.getUsagesSize() * 2);
		auto Uses = EI.second.getUsages();
		for (auto &IndexValuePair : Uses) {
		NameVals.push_back(IndexValuePair.first);
		NameVals.push_back(IndexValuePair.second);
		}
if (HasProfileData)		if (HasProfileData)
NameVals.push_back(static_cast<uint8_t>(EI.second.Hotness));		NameVals.push_back(static_cast<uint8_t>(EI.second.Hotness));
}		}

unsigned FSAbbrev = (HasProfileData ? FSCallsProfileAbbrev : FSCallsAbbrev);		unsigned FSAbbrev = (HasProfileData ? FSCallsProfileAbbrev : FSCallsAbbrev);
unsigned Code =		unsigned Code =
(HasProfileData ? bitc::FS_COMBINED_PROFILE : bitc::FS_COMBINED);		(HasProfileData ? bitc::FS_COMBINED_PROFILE : bitc::FS_COMBINED);

▲ Show 20 Lines • Show All 700 Lines • Show Last 20 Lines

llvm/lib/IR/AsmWriter.cpp

Show First 20 Lines • Show All 3,205 Lines • ▼ Show 20 Lines	if (FFlags.ReadNone \| FFlags.ReadOnly \| FFlags.NoRecurse \|
Out << "readNone: " << FFlags.ReadNone;		Out << "readNone: " << FFlags.ReadNone;
Out << ", readOnly: " << FFlags.ReadOnly;		Out << ", readOnly: " << FFlags.ReadOnly;
Out << ", noRecurse: " << FFlags.NoRecurse;		Out << ", noRecurse: " << FFlags.NoRecurse;
Out << ", returnDoesNotAlias: " << FFlags.ReturnDoesNotAlias;		Out << ", returnDoesNotAlias: " << FFlags.ReturnDoesNotAlias;
Out << ", noInline: " << FFlags.NoInline;		Out << ", noInline: " << FFlags.NoInline;
Out << ", alwaysInline: " << FFlags.AlwaysInline;		Out << ", alwaysInline: " << FFlags.AlwaysInline;
Out << ")";		Out << ")";
}		}

		/// FIXME: Add writer for new infomation added for function specialization.

if (!FS->calls().empty()) {		if (!FS->calls().empty()) {
Out << ", calls: (";		Out << ", calls: (";
FieldSeparator IFS;		FieldSeparator IFS;
for (auto &Call : FS->calls()) {		for (auto &Call : FS->calls()) {
Out << IFS;		Out << IFS;
Out << "(callee: ^" << Machine.getGUIDSlot(Call.first.getGUID());		Out << "(callee: ^" << Machine.getGUIDSlot(Call.first.getGUID());
if (Call.second.getHotness() != CalleeInfo::HotnessType::Unknown)		if (Call.second.getHotness() != CalleeInfo::HotnessType::Unknown)
Out << ", hotness: " << getHotnessName(Call.second.getHotness());		Out << ", hotness: " << getHotnessName(Call.second.getHotness());
▲ Show 20 Lines • Show All 1,567 Lines • Show Last 20 Lines

llvm/lib/LTO/ThinLTOCodeGenerator.cpp

Show First 20 Lines • Show All 535 Lines • ▼ Show 20 Lines	ProcessThinLTOModule(Module &TheModule, ModuleSummaryIndex &Index,
saveTempBitcode(TheModule, SaveTempsDir, count, ".4.opt.bc");		saveTempBitcode(TheModule, SaveTempsDir, count, ".4.opt.bc");

if (DisableCodeGen) {		if (DisableCodeGen) {
// Configured to stop before CodeGen, serialize the bitcode and return.		// Configured to stop before CodeGen, serialize the bitcode and return.
SmallVector<char, 128> OutputBuffer;		SmallVector<char, 128> OutputBuffer;
{		{
raw_svector_ostream OS(OutputBuffer);		raw_svector_ostream OS(OutputBuffer);
ProfileSummaryInfo PSI(TheModule);		ProfileSummaryInfo PSI(TheModule);
auto Index = buildModuleSummaryIndex(TheModule, nullptr, &PSI);		auto Index = buildModuleSummaryIndex(TheModule, nullptr, nullptr, &PSI);
WriteBitcodeToFile(TheModule, OS, true, &Index);		WriteBitcodeToFile(TheModule, OS, true, &Index);
}		}
return std::make_unique<SmallVectorMemoryBuffer>(std::move(OutputBuffer));		return std::make_unique<SmallVectorMemoryBuffer>(std::move(OutputBuffer));
}		}

return codegenModule(TheModule, TM);		return codegenModule(TheModule, TM);
}		}

▲ Show 20 Lines • Show All 692 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/FunctionImport.cpp

Show First 20 Lines • Show All 137 Lines • ▼ Show 20 Lines	SummaryFile("summary-file",
cl::desc("The summary file to use for function importing."));		cl::desc("The summary file to use for function importing."));

/// Used when testing importing from distributed indexes via opt		/// Used when testing importing from distributed indexes via opt
// -function-import.		// -function-import.
static cl::opt<bool>		static cl::opt<bool>
ImportAllIndex("import-all-index",		ImportAllIndex("import-all-index",
cl::desc("Import all external functions in index."));		cl::desc("Import all external functions in index."));

		static cl::opt<bool>
		UseFuncSpecCostInfo("import-use-func-spec-info",
		cl::desc("Considering function specialization cost "
		"infomation when importing functions.\n"),
		cl::Hidden, cl::init(true));

// Load lazily a module from \p FileName in \p Context.		// Load lazily a module from \p FileName in \p Context.
static std::unique_ptr<Module> loadFile(const std::string &FileName,		static std::unique_ptr<Module> loadFile(const std::string &FileName,
LLVMContext &Context) {		LLVMContext &Context) {
SMDiagnostic Err;		SMDiagnostic Err;
LLVM_DEBUG(dbgs() << "Loading '" << FileName << "'\n");		LLVM_DEBUG(dbgs() << "Loading '" << FileName << "'\n");
// Metadata isn't loaded until functions are imported, to minimize		// Metadata isn't loaded until functions are imported, to minimize
// the memory overhead.		// the memory overhead.
std::unique_ptr<Module> Result =		std::unique_ptr<Module> Result =
Show All 12 Lines
///		///
/// FIXME: select "best" instead of first that fits. But what is "best"?		/// FIXME: select "best" instead of first that fits. But what is "best"?
/// - The smallest: more likely to be inlined.		/// - The smallest: more likely to be inlined.
/// - The one with the least outgoing edges (already well optimized).		/// - The one with the least outgoing edges (already well optimized).
/// - One from a module already being imported from in order to reduce the		/// - One from a module already being imported from in order to reduce the
/// number of source modules parsed/linked.		/// number of source modules parsed/linked.
/// - One that has PGO data attached.		/// - One that has PGO data attached.
/// - [insert you fancy metric here]		/// - [insert you fancy metric here]
static const GlobalValueSummary *		static const GlobalValueSummary *selectCallee(
selectCallee(const ModuleSummaryIndex &Index,		const ModuleSummaryIndex &Index,
ArrayRef<std::unique_ptr<GlobalValueSummary>> CalleeSummaryList,		ArrayRef<std::unique_ptr<GlobalValueSummary>> CalleeSummaryList,
unsigned Threshold, StringRef CallerModulePath,		const CalleeInfo &CI, unsigned Threshold, StringRef CallerModulePath,
FunctionImporter::ImportFailureReason &Reason,		FunctionImporter::ImportFailureReason &Reason, GlobalValue::GUID GUID) {
GlobalValue::GUID GUID) {
Reason = FunctionImporter::ImportFailureReason::None;		Reason = FunctionImporter::ImportFailureReason::None;
auto It = llvm::find_if(		auto It = llvm::find_if(
CalleeSummaryList,		CalleeSummaryList,
[&](const std::unique_ptr<GlobalValueSummary> &SummaryPtr) {		[&](const std::unique_ptr<GlobalValueSummary> &SummaryPtr) {
auto *GVSummary = SummaryPtr.get();		auto *GVSummary = SummaryPtr.get();
if (!Index.isGlobalValueLive(GVSummary)) {		if (!Index.isGlobalValueLive(GVSummary)) {
Reason = FunctionImporter::ImportFailureReason::NotLive;		Reason = FunctionImporter::ImportFailureReason::NotLive;
return false;		return false;
Show All 37 Lines	auto It = llvm::find_if(
if (GlobalValue::isLocalLinkage(Summary->linkage()) &&		if (GlobalValue::isLocalLinkage(Summary->linkage()) &&
CalleeSummaryList.size() > 1 &&		CalleeSummaryList.size() > 1 &&
Summary->modulePath() != CallerModulePath) {		Summary->modulePath() != CallerModulePath) {
Reason =		Reason =
FunctionImporter::ImportFailureReason::LocalLinkageNotInModule;		FunctionImporter::ImportFailureReason::LocalLinkageNotInModule;
return false;		return false;
}		}

		Reason = [&](FunctionSummary *Summary) {
if ((Summary->instCount() > Threshold) &&		if ((Summary->instCount() > Threshold) &&
!Summary->fflags().AlwaysInline && !ForceImportAll) {		!Summary->fflags().AlwaysInline && !ForceImportAll)
Reason = FunctionImporter::ImportFailureReason::TooLarge;		return FunctionImporter::ImportFailureReason::TooLarge;
return false;
}

// Skip if it isn't legal to import (e.g. may reference unpromotable		// Skip if it isn't legal to import (e.g. may reference unpromotable
// locals).		// locals).
if (Summary->notEligibleToImport()) {		if (Summary->notEligibleToImport())
Reason = FunctionImporter::ImportFailureReason::NotEligible;		return FunctionImporter::ImportFailureReason::NotEligible;
return false;
}

// Don't bother importing if we can't inline it anyway.		// Don't bother importing if we can't inline it anyway.
if (Summary->fflags().NoInline && !ForceImportAll) {		if (Summary->fflags().NoInline && !ForceImportAll)
Reason = FunctionImporter::ImportFailureReason::NoInline;		return FunctionImporter::ImportFailureReason::NoInline;

		return FunctionImporter::ImportFailureReason::None;
		}(Summary);

		if (Reason == FunctionImporter::ImportFailureReason::None)
		return true;

		if (!UseFuncSpecCostInfo)
		return false;

		if (!Summary->shouldImport(CI)) {
		/// FIXME: add new failure type.
		Reason = FunctionImporter::ImportFailureReason::NotEligible;
return false;		return false;
}		}

return true;		return true;
});		});
if (It == CalleeSummaryList.end())		if (It == CalleeSummaryList.end())
return nullptr;		return nullptr;

▲ Show 20 Lines • Show All 207 Lines • ▼ Show 20 Lines	if (CalleeSummary) {
assert(FailureInfo &&		assert(FailureInfo &&
"Expected FailureInfo for previously rejected candidate");		"Expected FailureInfo for previously rejected candidate");
FailureInfo->Attempts++;		FailureInfo->Attempts++;
}		}
continue;		continue;
}		}

FunctionImporter::ImportFailureReason Reason;		FunctionImporter::ImportFailureReason Reason;
CalleeSummary = selectCallee(Index, VI.getSummaryList(), NewThreshold,		CalleeSummary =
		selectCallee(Index, VI.getSummaryList(), Edge.second, NewThreshold,
Summary.modulePath(), Reason, VI.getGUID());		Summary.modulePath(), Reason, VI.getGUID());
if (!CalleeSummary) {		if (!CalleeSummary) {
// Update with new larger threshold if this was a retry (otherwise		// Update with new larger threshold if this was a retry (otherwise
// we would have already inserted with NewThreshold above). Also		// we would have already inserted with NewThreshold above). Also
// update failure info if requested.		// update failure info if requested.
if (PreviouslyVisited) {		if (PreviouslyVisited) {
ProcessedThreshold = NewThreshold;		ProcessedThreshold = NewThreshold;
if (PrintImportFailures) {		if (PrintImportFailures) {
assert(FailureInfo &&		assert(FailureInfo &&
Show All 24 Lines	if (CalleeSummary) {
continue;		continue;
}		}
}		}

// "Resolve" the summary		// "Resolve" the summary
CalleeSummary = CalleeSummary->getBaseObject();		CalleeSummary = CalleeSummary->getBaseObject();
ResolvedCalleeSummary = cast<FunctionSummary>(CalleeSummary);		ResolvedCalleeSummary = cast<FunctionSummary>(CalleeSummary);

		/// FIXME: add cheap condition for assert function specialization.
assert((ResolvedCalleeSummary->fflags().AlwaysInline \|\| ForceImportAll \|\|		assert((ResolvedCalleeSummary->fflags().AlwaysInline \|\| ForceImportAll \|\|
(ResolvedCalleeSummary->instCount() <= NewThreshold)) &&		(ResolvedCalleeSummary->instCount() <= NewThreshold)) &&
"selectCallee() didn't honor the threshold");		"selectCallee() didn't honor the threshold");

auto ExportModulePath = ResolvedCalleeSummary->modulePath();		auto ExportModulePath = ResolvedCalleeSummary->modulePath();
auto ILI = ImportList[ExportModulePath].insert(VI.getGUID());		auto ILI = ImportList[ExportModulePath].insert(VI.getGUID());
// We previously decided to import this GUID definition if it was already		// We previously decided to import this GUID definition if it was already
// inserted in the set of imports from the exporting module.		// inserted in the set of imports from the exporting module.
▲ Show 20 Lines • Show All 908 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp

Show First 20 Lines • Show All 225 Lines • ▼ Show 20 Lines	void splitAndWriteThinLTOBitcode(
raw_ostream &OS, raw_ostream *ThinLinkOS,		raw_ostream &OS, raw_ostream *ThinLinkOS,
function_ref<AAResults &(Function &)> AARGetter, Module &M) {		function_ref<AAResults &(Function &)> AARGetter, Module &M) {
std::string ModuleId = getUniqueModuleId(&M);		std::string ModuleId = getUniqueModuleId(&M);
if (ModuleId.empty()) {		if (ModuleId.empty()) {
// We couldn't generate a module ID for this module, write it out as a		// We couldn't generate a module ID for this module, write it out as a
// regular LTO module with an index for summary-based dead stripping.		// regular LTO module with an index for summary-based dead stripping.
ProfileSummaryInfo PSI(M);		ProfileSummaryInfo PSI(M);
M.addModuleFlag(Module::Error, "ThinLTO", uint32_t(0));		M.addModuleFlag(Module::Error, "ThinLTO", uint32_t(0));
ModuleSummaryIndex Index = buildModuleSummaryIndex(M, nullptr, &PSI);		ModuleSummaryIndex Index =
		buildModuleSummaryIndex(M, nullptr, nullptr, &PSI);
WriteBitcodeToFile(M, OS, /ShouldPreserveUseListOrder=/false, &Index);		WriteBitcodeToFile(M, OS, /ShouldPreserveUseListOrder=/false, &Index);

if (ThinLinkOS)		if (ThinLinkOS)
// We don't have a ThinLTO part, but still write the module to the		// We don't have a ThinLTO part, but still write the module to the
// ThinLinkOS if requested so that the expected output file is produced.		// ThinLinkOS if requested so that the expected output file is produced.
WriteBitcodeToFile(M, ThinLinkOS, /ShouldPreserveUseListOrder=*/false,		WriteBitcodeToFile(M, ThinLinkOS, /ShouldPreserveUseListOrder=*/false,
&Index);		&Index);

▲ Show 20 Lines • Show All 170 Lines • ▼ Show 20 Lines	if (!Symvers.empty()) {
for (auto MD : Symvers)		for (auto MD : Symvers)
NMD->addOperand(MD);		NMD->addOperand(MD);
}		}

simplifyExternals(*MergedM);		simplifyExternals(*MergedM);

// FIXME: Try to re-use BSI and PFI from the original module here.		// FIXME: Try to re-use BSI and PFI from the original module here.
ProfileSummaryInfo PSI(M);		ProfileSummaryInfo PSI(M);
ModuleSummaryIndex Index = buildModuleSummaryIndex(M, nullptr, &PSI);		ModuleSummaryIndex Index = buildModuleSummaryIndex(M, nullptr, nullptr, &PSI);

// Mark the merged module as requiring full LTO. We still want an index for		// Mark the merged module as requiring full LTO. We still want an index for
// it though, so that it can participate in summary-based dead stripping.		// it though, so that it can participate in summary-based dead stripping.
MergedM->addModuleFlag(Module::Error, "ThinLTO", uint32_t(0));		MergedM->addModuleFlag(Module::Error, "ThinLTO", uint32_t(0));
ModuleSummaryIndex MergedMIndex =		ModuleSummaryIndex MergedMIndex =
buildModuleSummaryIndex(*MergedM, nullptr, &PSI);		buildModuleSummaryIndex(*MergedM, nullptr, nullptr, &PSI);

SmallVector<char, 0> Buffer;		SmallVector<char, 0> Buffer;

BitcodeWriter W(Buffer);		BitcodeWriter W(Buffer);
// Save the module hash produced for the full bitcode, which will		// Save the module hash produced for the full bitcode, which will
// be used in the backends, and use that in the minimized bitcode		// be used in the backends, and use that in the minimized bitcode
// produced for the full link.		// produced for the full link.
ModuleHash ModHash = {{0}};		ModuleHash ModHash = {{0}};
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	if (!ModuleId.empty()) {
// FIXME: Probably should not bother building the index at all		// FIXME: Probably should not bother building the index at all
// in the caller of writeThinLTOBitcode (which does so via the		// in the caller of writeThinLTOBitcode (which does so via the
// ModuleSummaryIndexAnalysis pass), since we have to rebuild it		// ModuleSummaryIndexAnalysis pass), since we have to rebuild it
// anyway whenever there is type metadata (here or in		// anyway whenever there is type metadata (here or in
// splitAndWriteThinLTOBitcode). Just always build it once via the		// splitAndWriteThinLTOBitcode). Just always build it once via the
// buildModuleSummaryIndex when Module(s) are ready.		// buildModuleSummaryIndex when Module(s) are ready.
ProfileSummaryInfo PSI(M);		ProfileSummaryInfo PSI(M);
NewIndex = std::make_unique<ModuleSummaryIndex>(		NewIndex = std::make_unique<ModuleSummaryIndex>(
buildModuleSummaryIndex(M, nullptr, &PSI));		buildModuleSummaryIndex(M, nullptr, nullptr, &PSI));
Index = NewIndex.get();		Index = NewIndex.get();
}		}
}		}

// Write it out as an unsplit ThinLTO module.		// Write it out as an unsplit ThinLTO module.

// Save the module hash produced for the full bitcode, which will		// Save the module hash produced for the full bitcode, which will
// be used in the backends, and use that in the minimized bitcode		// be used in the backends, and use that in the minimized bitcode
▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp

Show First 20 Lines • Show All 1,115 Lines • ▼ Show 20 Lines	if (Callee.getSummaryList().empty())
return false;		return false;

// Insert calls into the summary index so that the devirtualized targets		// Insert calls into the summary index so that the devirtualized targets
// are eligible for import.		// are eligible for import.
// FIXME: Annotate type tests with hotness. For now, mark these as hot		// FIXME: Annotate type tests with hotness. For now, mark these as hot
// to better ensure we have the opportunity to inline them.		// to better ensure we have the opportunity to inline them.
bool IsExported = false;		bool IsExported = false;
auto &S = Callee.getSummaryList()[0];		auto &S = Callee.getSummaryList()[0];
CalleeInfo CI(CalleeInfo::HotnessType::Hot, /* RelBF = */ 0);		CalleeInfo CI(CalleeInfo::HotnessType::Hot, /* RelBF = */ 0, ArgUsage());
auto AddCalls = [&](CallSiteInfo &CSInfo) {		auto AddCalls = [&](CallSiteInfo &CSInfo) {
for (auto *FS : CSInfo.SummaryTypeCheckedLoadUsers) {		for (auto *FS : CSInfo.SummaryTypeCheckedLoadUsers) {
FS->addCall({Callee, CI});		FS->addCall({Callee, CI});
IsExported \|= S->modulePath() != FS->modulePath();		IsExported \|= S->modulePath() != FS->modulePath();
}		}
for (auto *FS : CSInfo.SummaryTypeTestAssumeUsers) {		for (auto *FS : CSInfo.SummaryTypeTestAssumeUsers) {
FS->addCall({Callee, CI});		FS->addCall({Callee, CI});
IsExported \|= S->modulePath() != FS->modulePath();		IsExported \|= S->modulePath() != FS->modulePath();
▲ Show 20 Lines • Show All 1,147 Lines • Show Last 20 Lines

llvm/test/Bitcode/summary_version.ll

	; Check summary versioning			; Check summary versioning
	; RUN: opt -module-summary %s -o - \| llvm-bcanalyzer -dump \| FileCheck %s			; RUN: opt -module-summary %s -o - \| llvm-bcanalyzer -dump \| FileCheck %s

	; CHECK: <GLOBALVAL_SUMMARY_BLOCK			; CHECK: <GLOBALVAL_SUMMARY_BLOCK
	; CHECK: <VERSION op0=9/>			; CHECK: <VERSION op0=10/>



	; Need a function for the summary to be populated.			; Need a function for the summary to be populated.
	define void @foo() {			define void @foo() {
	ret void			ret void
	}			}

llvm/test/Bitcode/thinlto-alias.ll

	Show All 15 Lines
	; CHECK-NEXT: <FUNCTION op0=0 op1=4			; CHECK-NEXT: <FUNCTION op0=0 op1=4
	; "analias"			; "analias"
	; CHECK-NEXT: <FUNCTION op0=4 op1=7			; CHECK-NEXT: <FUNCTION op0=4 op1=7
	; CHECK: <GLOBALVAL_SUMMARY_BLOCK			; CHECK: <GLOBALVAL_SUMMARY_BLOCK
	; CHECK-NEXT: <VERSION			; CHECK-NEXT: <VERSION
	; CHECK-NEXT: <FLAGS			; CHECK-NEXT: <FLAGS
	; See if the call to func is registered.			; See if the call to func is registered.
	; The value id 1 matches the second FUNCTION record above.			; The value id 1 matches the second FUNCTION record above.
	; CHECK-NEXT: <PERMODULE {{.*}} op7=1/>			; CHECK-NEXT: <PERMODULE {{.*}} op9=1 op10=0/>
	; CHECK-NEXT: <BLOCK_COUNT op0=1/>			; CHECK-NEXT: <BLOCK_COUNT op0=1/>
	; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>

	; CHECK: <STRTAB_BLOCK			; CHECK: <STRTAB_BLOCK
	; CHECK-NEXT: blob data = 'mainanalias{{.*}}'			; CHECK-NEXT: blob data = 'mainanalias{{.*}}'

	; COMBINED: <GLOBALVAL_SUMMARY_BLOCK			; COMBINED: <GLOBALVAL_SUMMARY_BLOCK
	; COMBINED-NEXT: <VERSION			; COMBINED-NEXT: <VERSION
	; COMBINED-NEXT: <FLAGS			; COMBINED-NEXT: <FLAGS
	; See if the call to analias is registered, using the expected value id.			; See if the call to analias is registered, using the expected value id.
	; COMBINED-NEXT: <VALUE_GUID op0=[[ALIASID:[0-9]+]] op1=-5751648690987223394/>			; COMBINED-NEXT: <VALUE_GUID op0=[[ALIASID:[0-9]+]] op1=-5751648690987223394/>
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <VALUE_GUID op0=[[ALIASEEID:[0-9]+]] op1=-1039159065113703048/>			; COMBINED-NEXT: <VALUE_GUID op0=[[ALIASEEID:[0-9]+]] op1=-1039159065113703048/>
	; COMBINED-NEXT: <COMBINED {{.*}} op9=[[ALIASID]]/>			; COMBINED-NEXT: <COMBINED {{.*}} op11=[[ALIASID]] op12=0/>
	; COMBINED-NEXT: <COMBINED {{.*}}			; COMBINED-NEXT: <COMBINED {{.*}}
	; COMBINED-NEXT: <COMBINED_ALIAS {{.*}} op3=[[ALIASEEID]]			; COMBINED-NEXT: <COMBINED_ALIAS {{.*}} op3=[[ALIASEEID]]
	; COMBINED-NEXT: <BLOCK_COUNT op0=2/>			; COMBINED-NEXT: <BLOCK_COUNT op0=2/>
	; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK			; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK

	; ModuleID = 'thinlto-function-summary-callgraph.ll'			; ModuleID = 'thinlto-function-summary-callgraph.ll'
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"
	Show All 21 Lines

llvm/test/Bitcode/thinlto-alias2.ll

	; Test to check the callgraph for call to alias in module.			; Test to check the callgraph for call to alias in module.
	; RUN: opt -module-summary %s -o %t.o			; RUN: opt -module-summary %s -o %t.o
	; RUN: llvm-bcanalyzer -dump %t.o \| FileCheck %s			; RUN: llvm-bcanalyzer -dump %t.o \| FileCheck %s

	; CHECK: <GLOBALVAL_SUMMARY_BLOCK			; CHECK: <GLOBALVAL_SUMMARY_BLOCK
	; CHECK-NEXT: <VERSION			; CHECK-NEXT: <VERSION
	; CHECK-NEXT: <FLAGS			; CHECK-NEXT: <FLAGS
	; CHECK-NEXT: <PERMODULE {{.*}} op4=0 op5=0 op6=0 op7=[[ALIASID:[0-9]+]]/>			; CHECK-NEXT: <PERMODULE {{.*}} op9=[[ALIASID:[0-9]+]] op10=0/>
	; CHECK-NEXT: <PERMODULE {{.*}} op0=[[ALIASEEID:[0-9]+]]			; CHECK-NEXT: <PERMODULE {{.*}} op0=[[ALIASEEID:[0-9]+]]
	; CHECK-NEXT: <ALIAS {{.}} op0=[[ALIASID]] {{.}} op2=[[ALIASEEID]]/>			; CHECK-NEXT: <ALIAS {{.}} op0=[[ALIASID]] {{.}} op2=[[ALIASEEID]]/>
	; CHECK-NEXT: <BLOCK_COUNT op0=2/>			; CHECK-NEXT: <BLOCK_COUNT op0=2/>
	; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>

	; ModuleID = 'thinlto-alias2.ll'			; ModuleID = 'thinlto-alias2.ll'
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"
	Show All 14 Lines

llvm/test/Bitcode/thinlto-function-summary-callgraph-cast.ll

	; Test to check the callgraph for calls to casts.			; Test to check the callgraph for calls to casts.
	; RUN: opt -module-summary %s -o %t.o			; RUN: opt -module-summary %s -o %t.o
	; RUN: llvm-bcanalyzer -dump %t.o \| FileCheck %s			; RUN: llvm-bcanalyzer -dump %t.o \| FileCheck %s
	; PR34966			; PR34966

	; CHECK: <GLOBALVAL_SUMMARY_BLOCK			; CHECK: <GLOBALVAL_SUMMARY_BLOCK
	; CHECK-NEXT: <VERSION			; CHECK-NEXT: <VERSION
	; CHECK-NEXT: <FLAGS			; CHECK-NEXT: <FLAGS
	; "op7" is a call to "callee" function.			; "op7" is a call to "callee" function.
	; CHECK-NEXT: <PERMODULE {{.*}} op9=3 op10=[[ALIASID:[0-9]+]]/>			; CHECK-NEXT: <PERMODULE {{.*}} op9=3 op10=[[ALIASID:[0-9]+]]
	; "another_caller" has only references but no calls.			; "another_caller" has only references but no calls.
	; CHECK-NEXT: <PERMODULE {{.}} op4=3 {{.}} op9={{[0-9]+}}/>			; CHECK-NEXT: <PERMODULE {{.}} op4=3 {{.}} op9={{[0-9]+}}
	; CHECK-NEXT: <PERMODULE {{.*}} op0=[[ALIASEEID:[0-9]+]]			; CHECK-NEXT: <PERMODULE {{.*}} op0=[[ALIASEEID:[0-9]+]]
	; CHECK-NEXT: <ALIAS {{.}} op0=[[ALIASID]] {{.}} op2=[[ALIASEEID]]/>			; CHECK-NEXT: <ALIAS {{.}} op0=[[ALIASID]] {{.}} op2=[[ALIASEEID]]/>
	; CHECK-NEXT: <BLOCK_COUNT op0=3/>			; CHECK-NEXT: <BLOCK_COUNT op0=3/>
	; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>

	; ModuleID = 'thinlto-function-summary-callgraph-cast.ll'			; ModuleID = 'thinlto-function-summary-callgraph-cast.ll'
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"
	Show All 25 Lines

llvm/test/Bitcode/thinlto-function-summary-callgraph-pgo.ll

	Show All 12 Lines
	; CHECK: <SOURCE_FILENAME			; CHECK: <SOURCE_FILENAME
	; CHECK-NEXT: <FUNCTION			; CHECK-NEXT: <FUNCTION
	; "func"			; "func"
	; CHECK-NEXT: <FUNCTION op0=4 op1=4			; CHECK-NEXT: <FUNCTION op0=4 op1=4
	; CHECK: <GLOBALVAL_SUMMARY_BLOCK			; CHECK: <GLOBALVAL_SUMMARY_BLOCK
	; CHECK-NEXT: <VERSION			; CHECK-NEXT: <VERSION
	; CHECK-NEXT: <FLAGS			; CHECK-NEXT: <FLAGS
	; See if the call to func is registered, using the expected hotness type.			; See if the call to func is registered, using the expected hotness type.
	; CHECK-NEXT: <PERMODULE_PROFILE {{.*}} op7=1 op8=2/>			; CHECK-NEXT: <PERMODULE_PROFILE {{.*}} op9=1 op10=0 op11=2/>
	; CHECK-NEXT: <BLOCK_COUNT op0=1/>			; CHECK-NEXT: <BLOCK_COUNT op0=1/>
	; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>
	; CHECK: <STRTAB_BLOCK			; CHECK: <STRTAB_BLOCK
	; CHECK-NEXT: blob data = 'mainfunc{{.*}}'			; CHECK-NEXT: blob data = 'mainfunc{{.*}}'

	; COMBINED: <GLOBALVAL_SUMMARY_BLOCK			; COMBINED: <GLOBALVAL_SUMMARY_BLOCK
	; COMBINED-NEXT: <VERSION			; COMBINED-NEXT: <VERSION
	; COMBINED-NEXT: <FLAGS			; COMBINED-NEXT: <FLAGS
	; COMBINED-NEXT: <VALUE_GUID op0=[[FUNCID:[0-9]+]] op1=7289175272376759421/>			; COMBINED-NEXT: <VALUE_GUID op0=[[FUNCID:[0-9]+]] op1=7289175272376759421/>
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <COMBINED			; COMBINED-NEXT: <COMBINED
	; See if the call to func is registered, using the expected hotness type.			; See if the call to func is registered, using the expected hotness type.
	; op6=2 which is hotnessType::None.			; op6=2 which is hotnessType::None.
	; COMBINED-NEXT: <COMBINED_PROFILE {{.*}} op9=[[FUNCID]] op10=2/>			; COMBINED-NEXT: <COMBINED_PROFILE {{.*}} op11=[[FUNCID]] op12=0 op13=2/>
	; COMBINED-NEXT: <BLOCK_COUNT op0=2/>			; COMBINED-NEXT: <BLOCK_COUNT op0=2/>
	; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK>

	; ModuleID = 'thinlto-function-summary-callgraph.ll'			; ModuleID = 'thinlto-function-summary-callgraph.ll'
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; Function Attrs: nounwind uwtable			; Function Attrs: nounwind uwtable
	Show All 12 Lines

llvm/test/Bitcode/thinlto-function-summary-callgraph-profile-summary.ll

	Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: <FUNCTION op0=37 op1=5			; CHECK-NEXT: <FUNCTION op0=37 op1=5
	; "none3"			; "none3"
	; CHECK-NEXT: <FUNCTION op0=42 op1=5			; CHECK-NEXT: <FUNCTION op0=42 op1=5
	; CHECK-LABEL: <GLOBALVAL_SUMMARY_BLOCK			; CHECK-LABEL: <GLOBALVAL_SUMMARY_BLOCK
	; CHECK-NEXT: <VERSION			; CHECK-NEXT: <VERSION
	; CHECK-NEXT: <FLAGS			; CHECK-NEXT: <FLAGS
	; CHECK-NEXT: <VALUE_GUID op0=25 op1=123/>			; CHECK-NEXT: <VALUE_GUID op0=25 op1=123/>
	; op4=hot1 op6=cold op8=hot2 op10=hot4 op12=none1 op14=hot3 op16=none2 op18=none3 op20=123			; op4=hot1 op6=cold op8=hot2 op10=hot4 op12=none1 op14=hot3 op16=none2 op18=none3 op20=123
	; CHECK-NEXT: <PERMODULE_PROFILE {{.*}} op7=1 op8=3 op9=5 op10=1 op11=2 op12=3 op13=4 op14=1 op15=6 op16=2 op17=3 op18=3 op19=7 op20=2 op21=8 op22=2 op23=25 op24=4/>			; CHECK-NEXT: <PERMODULE_PROFILE {{.*}} op13=1 op14=0 op15=3 op16=5 op17=0 op18=1 op19=2 op20=0 op21=3 op22=4 op23=0 op24=1 op25=6 op26=0 op27=2 op28=3 op29=0 op30=3 op31=7 op32=0 op33=2 op34=8 op35=0 op36=2 op37=25 op38=0 op39=4/>
	; CHECK-NEXT: <BLOCK_COUNT op0=6/>			; CHECK-NEXT: <BLOCK_COUNT op0=6/>
	; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>

	; CHECK: <STRTAB_BLOCK			; CHECK: <STRTAB_BLOCK
	; CHECK-NEXT: blob data = 'hot_functionhot1hot2hot3hot4coldnone1none2none3{{.*}}'			; CHECK-NEXT: blob data = 'hot_functionhot1hot2hot3hot4coldnone1none2none3{{.*}}'

	; COMBINED: <GLOBALVAL_SUMMARY_BLOCK			; COMBINED: <GLOBALVAL_SUMMARY_BLOCK
	; COMBINED-NEXT: <VERSION			; COMBINED-NEXT: <VERSION
	; COMBINED-NEXT: <FLAGS			; COMBINED-NEXT: <FLAGS
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED_PROFILE {{.}} op9=[[HOT1:.]] op10=3 op11=[[COLD:.]] op12=1 op13=[[HOT2:.]] op14=3 op15=[[NONE1:.]] op16=2 op17=[[HOT3:.]] op18=3 op19=[[NONE2:.]] op20=2 op21=[[NONE3:.]] op22=2/>			; COMBINED-NEXT: <COMBINED_PROFILE {{.}} op16=[[HOT1:.]] op17=3 op18=[[COLD:.]] op20=1 op21=[[HOT2:.]] op23=3 op24=[[NONE1:.]] op26=2 op27=[[HOT3:.]] op29=3 op30=[[NONE2:.]] op32=2 op33=[[NONE3:.]] op35=2/>
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <BLOCK_COUNT op0=13/>			; COMBINED-NEXT: <BLOCK_COUNT op0=13/>
	; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK>


	; ModuleID = 'thinlto-function-summary-callgraph.ll'			; ModuleID = 'thinlto-function-summary-callgraph.ll'
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"
	▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

llvm/test/Bitcode/thinlto-function-summary-callgraph-relbf.ll

	; Test to check the callgraph in summary			; Test to check the callgraph in summary
	; RUN: opt -write-relbf-to-summary -module-summary %s -o %t.o			; RUN: opt -write-relbf-to-summary -module-summary %s -o %t.o
	; RUN: llvm-bcanalyzer -dump %t.o \| FileCheck %s			; RUN: llvm-bcanalyzer -dump %t.o \| FileCheck %s
	; RUN: llvm-dis -o - %t.o \| FileCheck %s --check-prefix=DIS			; RUN: llvm-dis -o - %t.o \| FileCheck %s --check-prefix=DIS
	; Round trip it through llvm-as			; Round trip it through llvm-as
	; RUN: llvm-dis -o - %t.o \| llvm-as -write-relbf-to-summary -o - \| llvm-dis -o - \| FileCheck %s --check-prefix=DIS			; RUN: llvm-dis -o - %t.o \| llvm-as -write-relbf-to-summary -o - \| llvm-dis -o - \| FileCheck %s --check-prefix=DIS

	; CHECK: <SOURCE_FILENAME			; CHECK: <SOURCE_FILENAME
	; CHECK-NEXT: <GLOBALVAR			; CHECK-NEXT: <GLOBALVAR
	; CHECK-NEXT: <FUNCTION			; CHECK-NEXT: <FUNCTION
	; "func"			; "func"
	; CHECK-NEXT: <FUNCTION op0=17 op1=4			; CHECK-NEXT: <FUNCTION op0=17 op1=4
	; CHECK: <GLOBALVAL_SUMMARY_BLOCK			; CHECK: <GLOBALVAL_SUMMARY_BLOCK
	; CHECK-NEXT: <VERSION			; CHECK-NEXT: <VERSION
	; CHECK-NEXT: <FLAGS			; CHECK-NEXT: <FLAGS
	; See if the call to func is registered.			; See if the call to func is registered.
	; CHECK-NEXT: <PERMODULE_RELBF {{.}} op4=1 {{.}} op9=256			; CHECK-NEXT: <PERMODULE_RELBF {{.}} op4=1 {{.}} op12=256
	; CHECK-NEXT: <BLOCK_COUNT op0=1/>			; CHECK-NEXT: <BLOCK_COUNT op0=1/>
	; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>
	; CHECK: <STRTAB_BLOCK			; CHECK: <STRTAB_BLOCK
	; CHECK-NEXT: blob data = 'undefinedglobmainfunc{{.*}}'			; CHECK-NEXT: blob data = 'undefinedglobmainfunc{{.*}}'


	; ModuleID = 'thinlto-function-summary-callgraph.ll'			; ModuleID = 'thinlto-function-summary-callgraph.ll'
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	Show All 19 Lines

llvm/test/Bitcode/thinlto-function-summary-callgraph-sample-profile-summary.ll

	Show All 26 Lines
	; CHECK-NEXT: <FUNCTION op0=44 op1=5			; CHECK-NEXT: <FUNCTION op0=44 op1=5
	; CHECK-NEXT: <FUNCTION op0=49 op1=5			; CHECK-NEXT: <FUNCTION op0=49 op1=5

	; CHECK-LABEL: <GLOBALVAL_SUMMARY_BLOCK			; CHECK-LABEL: <GLOBALVAL_SUMMARY_BLOCK
	; CHECK-NEXT: <VERSION			; CHECK-NEXT: <VERSION
	; CHECK-NEXT: <FLAGS			; CHECK-NEXT: <FLAGS
	; CHECK-NEXT: <VALUE_GUID op0=26 op1=123/>			; CHECK-NEXT: <VALUE_GUID op0=26 op1=123/>
	; op4=none1 op6=hot1 op8=cold1 op10=none2 op12=hot2 op14=cold2 op16=none3 op18=hot3 op20=cold3 op22=123			; op4=none1 op6=hot1 op8=cold1 op10=none2 op12=hot2 op14=cold2 op16=none3 op18=hot3 op20=cold3 op22=123
	; CHECK-NEXT: <PERMODULE_PROFILE {{.*}} op7=7 op8=0 op9=1 op10=3 op11=4 op12=1 op13=8 op14=0 op15=2 op16=3 op17=5 op18=1 op19=9 op20=0 op21=3 op22=3 op23=6 op24=1 op25=26 op26=4/>			; CHECK-NEXT: <PERMODULE_PROFILE {{.*}} op13=7 op14=0 op15=0 op16=1 op17=0 op18=3 op19=4 op20=0 op21=1 op22=8 op23=0 op24=0 op25=2 op26=0 op27=3 op28=5 op29=0 op30=1 op31=9 op32=0 op33=0 op34=3 op35=0 op36=3 op37=6 op38=0 op39=1 op40=26 op41=0 op42=4/>
	; CHECK-NEXT: <BLOCK_COUNT op0=4/>			; CHECK-NEXT: <BLOCK_COUNT op0=4/>
	; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; CHECK-NEXT: </GLOBALVAL_SUMMARY_BLOCK>

	; CHECK: <STRTAB_BLOCK			; CHECK: <STRTAB_BLOCK
	; CHECK-NEXT: blob data = 'hot_functionhot1hot2hot3cold1cold2cold3none1none2none3{{.*}}'			; CHECK-NEXT: blob data = 'hot_functionhot1hot2hot3cold1cold2cold3none1none2none3{{.*}}'

	; COMBINED: <GLOBALVAL_SUMMARY_BLOCK			; COMBINED: <GLOBALVAL_SUMMARY_BLOCK
	; COMBINED-NEXT: <VERSION			; COMBINED-NEXT: <VERSION
	Show All 11 Lines
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <COMBINED_PROFILE {{.}} op9=[[NONE1:.]] op10=0 op11=[[HOT1:.]] op12=3 op13=[[COLD1:.]] op14=1 op15=[[NONE2:.]] op16=0 op17=[[HOT2:.]] op18=3 op19=[[COLD2:.]] op20=1 op21=[[NONE3:.]] op22=0 op23=[[HOT3:.]] op24=3 op25=[[COLD3:.]] op26=1/>			; COMBINED-NEXT: <COMBINED_PROFILE {{.}} op20=3 op21=[[COLD1:.]] op23=1 op24=[[NONE2:.]] op26=0 op27=[[HOT2:.]] op29=3 op30=[[COLD2:.]] op32=1 op33=[[NONE3:.]] op34=0 op35=[[HOT3:.]] op38=3 op39=[[COLD3:.]] op41=1/>
	; COMBINED-NEXT: <COMBINED abbrevid=			; COMBINED-NEXT: <COMBINED abbrevid=
	; COMBINED-NEXT: <BLOCK_COUNT op0=13/>			; COMBINED-NEXT: <BLOCK_COUNT op0=13/>
	; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK>


	; ModuleID = 'thinlto-function-summary-callgraph.ll'			; ModuleID = 'thinlto-function-summary-callgraph.ll'
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"
	▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

llvm/test/Bitcode/thinlto-function-summary-callgraph.ll

	Show All 29 Lines
	; COMBINED-NEXT: <VERSION			; COMBINED-NEXT: <VERSION
	; COMBINED-NEXT: <FLAGS			; COMBINED-NEXT: <FLAGS
	; Only 2 VALUE_GUID since reference to undefinedglob should not be included in			; Only 2 VALUE_GUID since reference to undefinedglob should not be included in
	; combined index.			; combined index.
	; COMBINED-NEXT: <VALUE_GUID op0=[[FUNCID:[0-9]+]] op1=7289175272376759421/>			; COMBINED-NEXT: <VALUE_GUID op0=[[FUNCID:[0-9]+]] op1=7289175272376759421/>
	; COMBINED-NEXT: <VALUE_GUID			; COMBINED-NEXT: <VALUE_GUID
	; COMBINED-NEXT: <COMBINED			; COMBINED-NEXT: <COMBINED
	; See if the call to func is registered.			; See if the call to func is registered.
	; COMBINED-NEXT: <COMBINED {{.*}} op9=[[FUNCID]]/>			; COMBINED-NEXT: <COMBINED {{.*}} op11=[[FUNCID]] op12=0/>
	; COMBINED-NEXT: <BLOCK_COUNT op0=2/>			; COMBINED-NEXT: <BLOCK_COUNT op0=2/>
	; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK>			; COMBINED-NEXT: </GLOBALVAL_SUMMARY_BLOCK>

	; ModuleID = 'thinlto-function-summary-callgraph.ll'			; ModuleID = 'thinlto-function-summary-callgraph.ll'
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; Function Attrs: nounwind uwtable			; Function Attrs: nounwind uwtable
	Show All 12 Lines

llvm/test/Bitcode/thinlto-function-summary-refgraph.ll

	Show All 35 Lines
	; for calls). Use different linkage types for the various test cases to			; for calls). Use different linkage types for the various test cases to
	; distinguish the test cases here (op1 contains the linkage type).			; distinguish the test cases here (op1 contains the linkage type).
	; Note that op3 contains the # non-call references.			; Note that op3 contains the # non-call references.
	; This also ensures that we didn't include a call or reference to intrinsic			; This also ensures that we didn't include a call or reference to intrinsic
	; llvm.ctpop.i8.			; llvm.ctpop.i8.
	; CHECK: <GLOBALVAL_SUMMARY_BLOCK			; CHECK: <GLOBALVAL_SUMMARY_BLOCK
	; Function main contains call to func, as well as address reference to func:			; Function main contains call to func, as well as address reference to func:
	; op0=main op4=func op5=func			; op0=main op4=func op5=func
	; CHECK-DAG: <PERMODULE {{.}} op0=11 op1=0 {{.}} op4=1 op5=0 op6=0 op7=2 op8=2/>			; CHECK-DAG: <PERMODULE {{.}} op0=11 op1=0 {{.}} op4=1 op5=0 op6=0 {{.*}} op9=2 op10=2 op11=0/>
	; Function W contains a call to func3 as well as a reference to globalvar:			; Function W contains a call to func3 as well as a reference to globalvar:
	; op0=W op4=globalvar op5=func3			; op0=W op4=globalvar op5=func3
	; CHECK-DAG: <PERMODULE {{.}} op0=6 op1=5 {{.}} op4=1 op5=0 op6=0 op7=1 op8=5/>			; CHECK-DAG: <PERMODULE {{.}} op0=6 op1=5 {{.}} op4=1 op5=0 {{.*}} op8=0 op9=1 op10=5
	; Function X contains call to foo, as well as address reference to foo			; Function X contains call to foo, as well as address reference to foo
	; which is in the same instruction as the call:			; which is in the same instruction as the call:
	; op0=X op4=foo op5=foo			; op0=X op4=foo op5=foo
	; CHECK-DAG: <PERMODULE {{.}} op0=7 op1=1 {{.}} op4=1 op5=0 op6=0 op7=4 op8=4/>			; CHECK-DAG: <PERMODULE {{.}} op0=7 op1=1 {{.}} op9=4 op10=4
	; Function Y contains call to func2, and ensures we don't incorrectly add			; Function Y contains call to func2, and ensures we don't incorrectly add
	; a reference to it when reached while earlier analyzing the phi using its			; a reference to it when reached while earlier analyzing the phi using its
	; return value:			; return value:
	; op0=Y op4=func2			; op0=Y op4=func2
	; CHECK-DAG: <PERMODULE {{.}} op0=8 op1=72 {{.}} op4=0 op5=0 op6=0 op7=3/>			; CHECK-DAG: <PERMODULE {{.}} op0=8 op1=72 {{.}} op9=0 op10=8 op11=3 op12=0/>
	; Function Z contains call to func2, and ensures we don't incorrectly add			; Function Z contains call to func2, and ensures we don't incorrectly add
	; a reference to it when reached while analyzing subsequent use of its return			; a reference to it when reached while analyzing subsequent use of its return
	; value:			; value:
	; op0=Z op4=func2			; op0=Z op4=func2
	; CHECK-DAG: <PERMODULE {{.}} op0=9 op1=3 {{.}} op4=0 op5=0 op6=0 op7=3/>			; CHECK-DAG: <PERMODULE {{.}} op0=9 op1=3 {{.}} op6=0 op7=10 op8=0 op9=3 op10=0/>
	; Variable bar initialization contains address reference to func:			; Variable bar initialization contains address reference to func:
	; op0=bar op2=func			; op0=bar op2=func
	; CHECK-DAG: <PERMODULE_GLOBALVAR_INIT_REFS {{.*}} op0=0 op1=0 op2=3 op3=2/>			; CHECK-DAG: <PERMODULE_GLOBALVAR_INIT_REFS {{.*}} op0=0 op1=0 op2=3 op3=2/>
	; CHECK: </GLOBALVAL_SUMMARY_BLOCK>			; CHECK: </GLOBALVAL_SUMMARY_BLOCK>

	; CHECK: <STRTAB_BLOCK			; CHECK: <STRTAB_BLOCK
	; CHECK-NEXT: blob data = 'barglobalvarfuncfunc2foofunc3WXYZllvm.ctpop.i8main{{.*}}'			; CHECK-NEXT: blob data = 'barglobalvarfuncfunc2foofunc3WXYZllvm.ctpop.i8main{{.*}}'

	▲ Show 20 Lines • Show All 96 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[WIP] [ThinLTO] Importing function by function specializationAbandonedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 358525

llvm/include/llvm/Analysis/FuncSpecCost.h

llvm/include/llvm/Analysis/ModuleSummaryAnalysis.h

llvm/include/llvm/IR/ModuleSummaryIndex.h

llvm/include/llvm/IR/ModuleSummaryIndexYAML.h

llvm/lib/Analysis/FuncSpecCost.cpp

llvm/lib/Analysis/ModuleSummaryAnalysis.cpp

llvm/lib/AsmParser/LLParser.cpp

llvm/lib/Bitcode/Reader/BitcodeReader.cpp

llvm/lib/Bitcode/Writer/BitcodeWriter.cpp

llvm/lib/IR/AsmWriter.cpp

llvm/lib/LTO/ThinLTOCodeGenerator.cpp

llvm/lib/Transforms/IPO/FunctionImport.cpp

llvm/lib/Transforms/IPO/ThinLTOBitcodeWriter.cpp

llvm/lib/Transforms/IPO/WholeProgramDevirt.cpp

llvm/test/Bitcode/summary_version.ll

llvm/test/Bitcode/thinlto-alias.ll

llvm/test/Bitcode/thinlto-alias2.ll

llvm/test/Bitcode/thinlto-function-summary-callgraph-cast.ll

llvm/test/Bitcode/thinlto-function-summary-callgraph-pgo.ll

llvm/test/Bitcode/thinlto-function-summary-callgraph-profile-summary.ll

llvm/test/Bitcode/thinlto-function-summary-callgraph-relbf.ll

llvm/test/Bitcode/thinlto-function-summary-callgraph-sample-profile-summary.ll

llvm/test/Bitcode/thinlto-function-summary-callgraph.ll

llvm/test/Bitcode/thinlto-function-summary-refgraph.ll

[WIP] [ThinLTO] Importing function by function specialization
AbandonedPublic