This is an archive of the discontinued LLVM Phabricator instance.

llvm/include/llvm/ProfileData/SampleProfReader.h
743	Same as SampleProfileReaderText, can we use std::list instead of std::vector so the storage underlying will not be moved after more elements are inserted?
llvm/lib/ProfileData/SampleProfReader.cpp
238	Looks like it can be a utility function in SampleProfile.h and may be reused somewhere else.
269–276	Can we make it a utility function?
452	This param is unused. Write it as readNameFromTable(bool /* IsContextName */) to make it more explicit.

hoy added inline comments.Aug 20 2021, 4:17 PM

llvm/include/llvm/ProfileData/SampleProfReader.h
743	Std::vector is good for looking up with an offset, which is needed during the load of function profiles. Also, std::vector is more efficient for the extbinary reader, since number of contexts is known beforehand. Once the CS name table section is read, the std::vector should be frozen and no more names should be appended.
llvm/lib/ProfileData/SampleProfReader.cpp
238	It's actually moved out of SampleProfile.h since it's only used by the text reader. Also the text reader should be the only part of the toolchain that needs to parse full context strings. Other parts just need to deal with ArrayRef form of context. What do you think?
452	Sounds good.

hoy added inline comments.Aug 20 2021, 4:21 PM

llvm/lib/ProfileData/SampleProfReader.cpp
452	Actually I can only do that in the header file for the function declaration. The function definition needs a real name for param.

Addressing Wei's comment.

Harbormaster completed remote builds in B120647: Diff 367922.Aug 20 2021, 4:37 PM

wmi added inline comments.Aug 23 2021, 9:50 AM

llvm/include/llvm/ProfileData/SampleProfReader.h
743	Yeah, right, std::vector is good for looking up. Now once cs name table is read, there is no other code appending names to the table. Just want to know whether we can have some additional mechanism to enforce that. Can we save the size of the table after reading cs name table and check that the table size is the same before visiting any element from the table?
llvm/lib/ProfileData/SampleProfReader.cpp
238	Ok, if it is less likely to be used by others, it is fine to keep as it is.

hoy added inline comments.Aug 23 2021, 10:27 AM

llvm/include/llvm/ProfileData/SampleProfReader.h
743	Good point. Setting the underlying object to be constant so that it is immutable once populated.

Addressing Wei's feedback.

Harbormaster completed remote builds in B120817: Diff 368139.Aug 23 2021, 10:49 AM

LGTM.

This revision is now accepted and ready to land.Aug 23 2021, 8:36 PM

Updating D108435: [CSSPGO] split context string II - reader/writer changes

Herald added a subscriber: ormris. · View Herald TranscriptAug 24 2021, 11:55 PM

Harbormaster completed remote builds in B121118: Diff 368562.Aug 24 2021, 11:56 PM

Updating D108435: [CSSPGO] split context string II - reader/writer changes

Harbormaster completed remote builds in B121119: Diff 368563.Aug 24 2021, 11:58 PM

wenlei added inline comments.Aug 26 2021, 5:34 PM

llvm/include/llvm/ProfileData/SampleProfReader.h
135	In this section, we need to add documentation/spec for CSNameTable how context is stored, and how it interacts with NameTable.
507	nit: remove the blank line
578	typo: given
llvm/lib/ProfileData/SampleProfReader.cpp
282–283	Even though we only need to deal with string context in profile reader/writer for text profile, it's probably still cleaner to keep all string context related parsing into SampleContext. `createContext` is more like a ctor. I'd prefer keep string decoding, createContext in its original place in SampleContext. That way, we can construct a context from string, SampleContext::setContext remain a private helper too, and the logic here can be simpler, just like before. SampleContext does still have getContextString and toString, so it's not really isolated from string representation, might as well keep all string stuff together there for consistency.
447	I think `getNameTable` exist as a virtual function only because of the use in sample loader for `profile-sample-accuarate` and `profile-accurate-for-symsinlist`. Here can we still access the NameTable directly without going through virtual call? Same for other places where `NameTable` is directly accessible. For writer, before this change, we indeed only have `NameTable` for `SampleProfileWriterBinary` without virtual getter, and we access it without going through virtual function call there.
452	Can we avoid this parameter altogether? For one profile, it's either going to be CS or non-CS, so the dispatch can be based on state/member of the reader instead of relying on a param for each invocation.
582	Same here, avoid `IsContextName` as param, but dispatch based on `SampleProfileReader::ProfileIsCS`
587–588	nit: this can be confusing, readNameFromTable can mislead people to think FContext is string (or vector of strings) type. the auto also isn't helping. Either spell out the type name, or rename `readNameFromTable` to something like `readSampleContextFromTable`
749	Since we have `SampleProfileReader::ProfileIsCS`, within reader it probable makes more sense to use the reader instance flag as opposed to the global FunctionSamples::ProfileIsCS flag. We were not very consistent in past, might be good to clean up as you're expanding the use of ProfileIsCS.
775	OrderedNames -> OrderedContexts
1042	Let emplace_back take variadic arguments and forward to the constructor directly instead of doing a move copy?
1046	How about we emplace an empty slot before going into the inner loop, then we can operate on the vector directly (`PNameVec.back().emplace_back(...)`) and avoid copying temporary `Context` onto `PNameVec`?
1062–1063	nit: peal/hoist the `get()` so we don't have to call it for every use.

wenlei added inline comments.Aug 26 2021, 10:17 PM

llvm/include/llvm/ProfileData/SampleProfWriter.h
138–139	Now that the parameter is no longer a string name, rename the function as well, e.g. writeContextIdx? Same for addName(const SampleContext &Context) -> addContext
llvm/lib/ProfileData/SampleProfReader.cpp
776–794	Which operation here is the most costly and visible for e2e build time? The insert/sort/find or the prefix check? What's the % of time here for both prelink and postlink?
llvm/lib/ProfileData/SampleProfWriter.cpp
249	nit: this contains leaf frame, so it's `Frames` instead of `Callsites`.
482	assert !Context.hasContext() ?
llvm/tools/llvm-profdata/llvm-profdata.cpp
524–526	I suggest rename these FName (key of the SampleProfileMap) to be FContext accordingly given this is no longer a string map. Same for other places.

hoy marked 2 inline comments as done.Aug 27 2021, 10:57 PM

hoy added inline comments.

llvm/include/llvm/ProfileData/SampleProfReader.h
135	CSNameTable is a concept of the extensible binary format, not the basic binary format, like other concepts such as func name offset table, func metadata table. We could comment about CSNameTable in the `SampleProfileReaderExtBinary/SampleProfileWriterExtBinary`, but I didn't find a good centralized place to do that. Seems that descriptions about other sections are scattered around the implementation code.
507	done.
578	fixed.
llvm/include/llvm/ProfileData/SampleProfWriter.h
138–139	Sounds good.
llvm/lib/ProfileData/SampleProfReader.cpp
282–283	Makes sense. Moved back to SampleContext.
447	Yes, we can just use `NameTable` directly here. Actually all changes here are unnecessary. I just undid them.
452	Good point. Checking `FunctionSamples::ProfileIsCS` should be enough.
587–588	Fixed by using explicit type.
749	That's reasonable. Replaced all `FunctionSamples::ProfileIsCS` usage in the reader with the reader `ProfileIsCS` flag.
775	fixed.
776–794	The insert/sort/find operations are the most expansive. For thinlto postlink, they count to 15% to 20% of the whole backend running time. I haven't measured for prelink but given the similarity of prelink and postlink, they should be expansive there too. I'm actually working on a sample writer change that emits the func offset table in the order of contexts so that we don't need the set operations here. That turns out very effective. Will send out a separate diff for it.
1046	Sounds good, that should be faster.
1062–1063	You mean save `FContext.get()` in a temp and use it in the loop? Thought the compiler would do it.
llvm/lib/ProfileData/SampleProfWriter.cpp
249	`Frames` sounds good.
482	Added the assert.
llvm/tools/llvm-profdata/llvm-profdata.cpp
524–526	Sounds good.

Addressing Wenlei's comment.

Harbormaster completed remote builds in B121589: Diff 369238.Aug 27 2021, 10:58 PM

wenlei added inline comments.Aug 29 2021, 2:42 PM

llvm/include/llvm/ProfileData/SampleProfReader.h
135	Ok, didn't realize we don't use header comment for extbin. That's fine. Yeah, it'd be good to have some comment for CSNameTable somewhere to explain the nesting structure of context and raw names.
llvm/lib/ProfileData/SampleProfReader.cpp
282–283	Thanks. Can you move it back in D107299, so we don't see the change back and forth, just for review? Can we also fold `FName.startswith("[")` in to SampleContext? Additionally, why we do need a `createContextFromString` instead of using overload ctors? This is also inconsistent between how SampleContext is created from text profile (createContextFromString) vs from binary profile (ctor). What I was thinking is have SampleContext decide how to create the object, so there's no changed needed here, just like before.
587–588	Ok, but on 2nd thought, why do we call it readName while it's actually returning a context? Especially given that we've renamed addName to AddContext, writeNameIdx to writeContextIdx.
722	FName to FContext as well?
776–794	I'm actually working on a sample writer change that emits the func offset table in the order of contexts so that we don't need the set operations here. That turns out very effective. Great. I was thinking about exactly that too. The order of binary profile is opaque to users, so we can order them in the file to save sorting on reading. For thinlto postlink, they count to 15% to 20% of the whole backend running time. What was the % before this work when we were all using StringRef?
1062–1063	Yeah, it's less about optimization but for readability to make the code less verbose.

hoy added inline comments.Aug 29 2021, 7:07 PM

llvm/include/llvm/ProfileData/SampleProfReader.h
135	Comment added to the definition of `SampleProfileReaderExtBinaryBase::readCSNameTableSec`.
llvm/lib/ProfileData/SampleProfReader.cpp
282–283	Constructing a CS context will require additional parameter than non-CS profile, especially the underlying context vector. That is causing the inconsistency with the context split work and I'm separating the construction of CS and non-CS contexts. Currently CS context is only constructible from a `SampleContextFrameVector`. Another reason is that I was hoping to construct `SampleContext` in a quick manner from `StringRef` to favor non-CS profile.
722	Oops, this was fixed in my other unified patch, somehow got dropped when rebasing. Will send an update to the other patch as you suggested.
776–794	What was the % before this work when we were all using StringRef? It's not noticeable. Actually I didn't see the previous sorting as a hot routine in the profile. With the presorted func offset stable, we are able to achieve similar performance with the previous approach.

wenlei added inline comments.Aug 29 2021, 9:02 PM

llvm/lib/ProfileData/SampleProfReader.cpp
282–283	Can you move it back in D107299, so we don't see the change back and forth, just for review? Sorry my bad, I meant to say D108433. I think the key blocker for everything to be taken care from within ctor is that you need reader to own the context created from the string. How about having a ctor with StringRef and CSNameTable as parameter - it puts the context into CSNameTable (owned by reader) for CS profile, and the CSNameTable would be ignored for non-CS profile. The current implementation works, but reader has to be aware of the actual string representation of context. I thought it'd be cleaner if such representation is all dealt with from within SampleContext. Currently, it's indeed mostly handled by SampleContext, except it's bleeding into reader here.
587–588	I think it'd be good to establish a convention as to when we call things `Name` vs `Context`. My thought is it goes with the type, what do you think?
1014	Comment added to the definition of SampleProfileReaderExtBinaryBase::readCSNameTableSec. Not seeing comments in the latest update. Did I miss anything?

hoy added inline comments.Aug 29 2021, 10:26 PM

llvm/lib/ProfileData/SampleProfReader.cpp
282–283	Yeah, that's why I originally put everything related to context string parsing in the reader. I thought they were related to profile-specific representation that `SampleContext` shouldn't care. I guess moving the parsing code back is better for extension and code sharing, when we have new module doing the same thing in the future. Adding a new constructor with StringRef and CSNameTable as parameter can work but feel like it's easier and more clear for the reader to know what it constructs. Hiding that from the reader is fine, but asking the reader to provide an underlying CS name table which may not be needed for non-CS sounds a bit confusing. That also kinds of exposes reader-specific implementation to `SampleContext`. I think having the reader be aware of the specific representation might be reasonable, since context representation is a part of the profile format. What do you think?
587–588	Yeah, I've been using `Name` as an identifier of of the context and function name string, and using `Context` for CS profile and `String` for function names. But sometimes `Name` and `String` as mixed. We are using `SampleContext` for both CS and non-CS. And we are also using the word context specifically for CS. Sounds like we need a more general name (instead of `Name`) in the reader for both of them. How about `readIdFromTable`?
1062–1063	Changed to using `*FContext` for readability, like other places. Look good? Alternatively, we can have two variables such as `FContextError` and `FContext`.

Updating D108435: [CSSPGO] split context string II - reader/writer changes

Harbormaster completed remote builds in B121695: Diff 369373.Aug 29 2021, 10:33 PM

wenlei added inline comments.Aug 30 2021, 12:20 PM

llvm/lib/ProfileData/SampleProfReader.cpp
282–283	asking the reader to provide an underlying CS name table which may not be needed for non-CS sounds a bit confusing. That also kinds of exposes reader-specific implementation to SampleContext. I don't see this as exposing reader-specific stuff to sample context. It'd just be the expectation of the sample context API that requires a buffer to hold newly created context. This is in essence no different from how string buffer is passed in getRepInFormat (and only used for MD5), and I don't see it as coupling between reader and context. I think having the reader be aware of the specific representation might be reasonable, since context representation is a part of the profile format. I think the actual string presentation of context is not tied to the format, but rather the format is using the string representation directly. Also if string representation is considered part of format, then all of the string related stuff should be part of reader (like your earlier change). The possible reusing of string functions you mentioned also shows that the string representation is not format-specific.
587–588	I've been using Name as an identifier of of the context and function name string, and using Context for CS profile and String for function names. The problem is you can't do that cleanly because there're cases that covers both CS and non-CS. readNameFromTable is one example. I think that name it according to the type makes things clearer. For non-CS profile we're still using SampleContext as key in the profile map anyways. Adding a new notion of `Id` in addition to name and context seem unnecessary (we have Idx already which can confuse people if we add Id). So I think readSampleContextFromTable/readContextFromTable is better.
1062–1063	either works, thanks.

hoy added inline comments.Aug 30 2021, 12:42 PM

llvm/lib/ProfileData/SampleProfReader.cpp
282–283	This is in essence no different from how string buffer is passed in getRepInFormat (and only used for MD5), and I don't see it as coupling between reader and context. `getRepInFormat` takes a string buffer instead of a string table which is not coupled with the reader. However, `CSNameTable` is an implementation detail of the reader. Having that populated and updated by `SampleContext` sounds like a coupling with the reader. E.g, from `SampleContext` point of view, it is questionable why it has to update a std::list but not a std::vector or std::set. We can pass in a SampleContextFrameVector to the new ctor, and similar with the callsites of `getRepInFormat`, the reader should know where to place it, just like the current code. But then the reader has to redo some checks in the ctor. Anyway, feels that the reader will need to check if it is constructing a CS context, if we don't expose `CSNameTable` to `SampleContext`. Since the reader is currently the only consumer, maybe just keep it in the reader for now? When we have a new user we can then decide what kind of new ctor to make?
587–588	Sounds good. Will use `readSampleContextFromTable`.

Renaming readNameFromTable to readSampleContextFromTable.

Harbormaster completed remote builds in B121804: Diff 369530.Aug 30 2021, 1:04 PM

wenlei added inline comments.Aug 30 2021, 1:33 PM

llvm/lib/ProfileData/SampleProfReader.cpp
282–283	Ok, I think we've probably spent too much time on this. But bear with me a bit more, still hope we can reach a consensus. :) That said, whatever we choose to do, I don't agree with your reasoning above. However, CSNameTable is an implementation detail of the reader. Look at it as buffer, and in this case reader happens to own and provide that buffer. Having that populated and updated by SampleContext sounds like a coupling with the reader. E.g, from SampleContext point of view, it is questionable why it has to update a std::list but not a std::vector or std::set. I don't think this is really a concern, nor is it a coupling tbh. Regardless of what reader does, what makes sense for a buffer type? I think std::list simply makes sense as it avoid reallocating. Yes, reader uses std::list, but this is just a choice that makes sense in general for a buffer. If we go down this level, sure every single API call is a coupling because every param has a type. The way I look at this - sample context abstracts away the string representation (again it's independent of format, and text format just uses that string representation directly), and reader resort to sample context for anything related to that. It's probably not so important to have that logical layering enforced, but if the cost of doing that is small, why not? Clear layering makes it easy to maintain and reason about.

hoy added inline comments.Aug 30 2021, 1:53 PM

llvm/lib/ProfileData/SampleProfReader.cpp
282–283	I don't think this is really a concern, nor is it a coupling tbh. Regardless of what reader does, what makes sense for a buffer type? I think std::list simply makes sense as it avoid reallocating. Yes, reader uses std::list, but this is just a choice that makes sense in general for a buffer. If we go down this level, sure every single API call is a coupling because every param has a type. Yeah, we probably spent too much time on this. I will make a new ctor with `CSNameTable` as a parameter.

Updating D108435: [CSSPGO] split context string II - reader/writer changes

Harbormaster completed remote builds in B121827: Diff 369559.Aug 30 2021, 2:46 PM

wenlei added inline comments.Aug 30 2021, 3:14 PM

llvm/include/llvm/ProfileData/SampleProf.h
533	This can be removed now?

Updating D108435: [CSSPGO] split context string II - reader/writer changes

llvm/include/llvm/ProfileData/SampleProf.h
533	yes, it's removed from the other patch but somehow not updated here.

Harbormaster completed remote builds in B121846: Diff 369582.Aug 30 2021, 4:53 PM

lgtm, thanks for working through the comments!

wenlei mentioned this in D108437: [CSSPGO] split context string III - llvm-profgen changes.Aug 30 2021, 6:24 PM

hoy closed this revision.Aug 31 2021, 4:19 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

ProfileData/

4 lines

3 lines

28 lines

74 lines

lib/

ProfileData/

SampleProf.cpp

45 lines

SampleProfReader.cpp

140 lines

SampleProfWriter.cpp

191 lines

test/

tools/

llvm-profdata/

Inputs/

cs-sample.proftext

26 lines

tools/

llvm-profdata/

llvm-profdata.cpp

97 lines

unittests/

ProfileData/

SampleProfTest.cpp

10 lines

Diff 369582

llvm/include/llvm/ProfileData/ProfileCommon.h

	Show First 20 Lines • Show All 86 Lines • ▼ Show 20 Lines

	class SampleProfileSummaryBuilder final : public ProfileSummaryBuilder {			class SampleProfileSummaryBuilder final : public ProfileSummaryBuilder {
	public:			public:
	SampleProfileSummaryBuilder(std::vector<uint32_t> Cutoffs)			SampleProfileSummaryBuilder(std::vector<uint32_t> Cutoffs)
	: ProfileSummaryBuilder(std::move(Cutoffs)) {}			: ProfileSummaryBuilder(std::move(Cutoffs)) {}

	void addRecord(const sampleprof::FunctionSamples &FS,			void addRecord(const sampleprof::FunctionSamples &FS,
	bool isCallsiteSample = false);			bool isCallsiteSample = false);
	std::unique_ptr<ProfileSummary> computeSummaryForProfiles(			std::unique_ptr<ProfileSummary>
	const StringMap<sampleprof::FunctionSamples> &Profiles);			computeSummaryForProfiles(const sampleprof::SampleProfileMap &Profiles);
	std::unique_ptr<ProfileSummary> getSummary();			std::unique_ptr<ProfileSummary> getSummary();
	};			};

	/// This is called when a count is seen in the profile.			/// This is called when a count is seen in the profile.
	void ProfileSummaryBuilder::addCount(uint64_t Count) {			void ProfileSummaryBuilder::addCount(uint64_t Count) {
	TotalCount += Count;			TotalCount += Count;
	if (Count > MaxCount)			if (Count > MaxCount)
	MaxCount = Count;			MaxCount = Count;
	NumCounts++;			NumCounts++;
	CountFrequencies[Count]++;			CountFrequencies[Count]++;
	}			}

	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_PROFILEDATA_PROFILECOMMON_H			#endif // LLVM_PROFILEDATA_PROFILECOMMON_H

llvm/include/llvm/ProfileData/SampleProf.h

Show First 20 Lines • Show All 119 Lines • ▼ Show 20 Lines
// value of enum. Only append new ones.		// value of enum. Only append new ones.
enum SecType {		enum SecType {
SecInValid = 0,		SecInValid = 0,
SecProfSummary = 1,		SecProfSummary = 1,
SecNameTable = 2,		SecNameTable = 2,
SecProfileSymbolList = 3,		SecProfileSymbolList = 3,
SecFuncOffsetTable = 4,		SecFuncOffsetTable = 4,
SecFuncMetadata = 5,		SecFuncMetadata = 5,
		SecCSNameTable = 6,
// marker for the first type of profile.		// marker for the first type of profile.
SecFuncProfileFirst = 32,		SecFuncProfileFirst = 32,
SecLBRProfile = SecFuncProfileFirst		SecLBRProfile = SecFuncProfileFirst
};		};

static inline std::string getSecName(SecType Type) {		static inline std::string getSecName(SecType Type) {
switch (Type) {		switch (Type) {
case SecInValid:		case SecInValid:
return "InvalidSection";		return "InvalidSection";
case SecProfSummary:		case SecProfSummary:
return "ProfileSummarySection";		return "ProfileSummarySection";
case SecNameTable:		case SecNameTable:
return "NameTableSection";		return "NameTableSection";
case SecProfileSymbolList:		case SecProfileSymbolList:
return "ProfileSymbolListSection";		return "ProfileSymbolListSection";
case SecFuncOffsetTable:		case SecFuncOffsetTable:
return "FuncOffsetTableSection";		return "FuncOffsetTableSection";
case SecFuncMetadata:		case SecFuncMetadata:
return "FunctionMetadata";		return "FunctionMetadata";
		case SecCSNameTable:
		return "CSNameTableSection";
case SecLBRProfile:		case SecLBRProfile:
return "LBRProfileSection";		return "LBRProfileSection";
}		}
llvm_unreachable("A SecType has no name for output");		llvm_unreachable("A SecType has no name for output");
}		}

// Entry type of section header table used by SampleProfileExtBinaryBaseReader		// Entry type of section header table used by SampleProfileExtBinaryBaseReader
// and SampleProfileExtBinaryBaseWriter.		// and SampleProfileExtBinaryBaseWriter.
▲ Show 20 Lines • Show All 367 Lines • ▼ Show 20 Lines	uint64_t getHashCode() const {
return hasContext() ? hash_value(getFullContext()) : hash_value(getName());		return hasContext() ? hash_value(getFullContext()) : hash_value(getName());
}		}

/// Set the name of the function.		/// Set the name of the function.
void setName(StringRef FunctionName) { Name = FunctionName; }		void setName(StringRef FunctionName) { Name = FunctionName; }

void setContext(SampleContextRefType Context,		void setContext(SampleContextRefType Context,
ContextStateMask CState = RawContext) {		ContextStateMask CState = RawContext) {
FullContext = Context;		FullContext = Context;
		wenleiUnsubmitted Not Done Reply Inline Actions This can be removed now? wenlei: This can be removed now?
		hoyAuthorUnsubmitted Done Reply Inline Actions yes, it's removed from the other patch but somehow not updated here. hoy: yes, it's removed from the other patch but somehow not updated here.
Name = Context.back().CallerName;		Name = Context.back().CallerName;
State = CState;		State = CState;
}		}

bool operator==(const SampleContext &That) const {		bool operator==(const SampleContext &That) const {
return State == That.State && Name == That.Name &&		return State == That.State && Name == That.Name &&
FullContext == That.FullContext;		FullContext == That.FullContext;
}		}
▲ Show 20 Lines • Show All 603 Lines • Show Last 20 Lines

llvm/include/llvm/ProfileData/SampleProfReader.h

Show First 20 Lines • Show All 126 Lines • ▼ Show 20 Lines
// We support the following types of metadata:		// We support the following types of metadata:
//		//
// a. CFG Checksum (a.k.a. function hash):		// a. CFG Checksum (a.k.a. function hash):
// !CFGChecksum: 12345		// !CFGChecksum: 12345
// b. CFG Checksum (see ContextAttributeMask):		// b. CFG Checksum (see ContextAttributeMask):
// !Atribute: 1		// !Atribute: 1
//		//
//		//
// Binary format		// Binary format
		wenleiUnsubmitted Not Done Reply Inline Actions In this section, we need to add documentation/spec for CSNameTable how context is stored, and how it interacts with NameTable. wenlei: In this section, we need to add documentation/spec for CSNameTable how context is stored, and…
		hoyAuthorUnsubmitted Done Reply Inline Actions CSNameTable is a concept of the extensible binary format, not the basic binary format, like other concepts such as func name offset table, func metadata table. We could comment about CSNameTable in the `SampleProfileReaderExtBinary/SampleProfileWriterExtBinary`, but I didn't find a good centralized place to do that. Seems that descriptions about other sections are scattered around the implementation code. hoy: CSNameTable is a concept of the extensible binary format, not the basic binary format, like…
		wenleiUnsubmitted Not Done Reply Inline Actions Ok, didn't realize we don't use header comment for extbin. That's fine. Yeah, it'd be good to have some comment for CSNameTable somewhere to explain the nesting structure of context and raw names. wenlei: Ok, didn't realize we don't use header comment for extbin. That's fine. Yeah, it'd be good to…
		hoyAuthorUnsubmitted Done Reply Inline Actions Comment added to the definition of `SampleProfileReaderExtBinaryBase::readCSNameTableSec`. hoy: Comment added to the definition of `SampleProfileReaderExtBinaryBase::readCSNameTableSec`.
// -------------		// -------------
//		//
// This is a more compact encoding. Numbers are encoded as ULEB128 values		// This is a more compact encoding. Numbers are encoded as ULEB128 values
// and all strings are encoded in a name table. The file is organized in		// and all strings are encoded in a name table. The file is organized in
// the following sections:		// the following sections:
//		//
// MAGIC (uint64_t)		// MAGIC (uint64_t)
// File identifier computed by function SPMagic() (0x5350524f463432ff)		// File identifier computed by function SPMagic() (0x5350524f463432ff)
▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines
#include "llvm/ProfileData/SampleProf.h"		#include "llvm/ProfileData/SampleProf.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/Discriminator.h"		#include "llvm/Support/Discriminator.h"
#include "llvm/Support/ErrorOr.h"		#include "llvm/Support/ErrorOr.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/SymbolRemappingReader.h"		#include "llvm/Support/SymbolRemappingReader.h"
#include <algorithm>		#include <algorithm>
#include <cstdint>		#include <cstdint>
		#include <list>
#include <memory>		#include <memory>
#include <string>		#include <string>
#include <system_error>		#include <system_error>
#include <unordered_set>		#include <unordered_set>
#include <vector>		#include <vector>

namespace llvm {		namespace llvm {

▲ Show 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	if (Remapper)
Remapper->applyRemapping(Ctx);		Remapper->applyRemapping(Ctx);
FunctionSamples::UseMD5 = useMD5();		FunctionSamples::UseMD5 = useMD5();
return sampleprof_error::success;		return sampleprof_error::success;
}		}

/// The implementaion to read sample profiles from the associated file.		/// The implementaion to read sample profiles from the associated file.
virtual std::error_code readImpl() = 0;		virtual std::error_code readImpl() = 0;

/// Print the profile for \p FName on stream \p OS.		/// Print the profile for \p FContext on stream \p OS.
void dumpFunctionProfile(StringRef FName, raw_ostream &OS = dbgs());		void dumpFunctionProfile(SampleContext FContext, raw_ostream &OS = dbgs());

/// Collect functions with definitions in Module M. For reader which		/// Collect functions with definitions in Module M. For reader which
/// support loading function profiles on demand, return true when the		/// support loading function profiles on demand, return true when the
/// reader has been given a module. Always return false for reader		/// reader has been given a module. Always return false for reader
/// which doesn't support loading function profiles on demand.		/// which doesn't support loading function profiles on demand.
virtual bool collectFuncsFromModule() { return false; }		virtual bool collectFuncsFromModule() { return false; }

/// Print all the profiles on stream \p OS.		/// Print all the profiles on stream \p OS.
Show All 38 Lines	if (Remapper) {
if (It != Profiles.end())		if (It != Profiles.end())
return &It->second;		return &It->second;
}		}
}		}
return nullptr;		return nullptr;
}		}

/// Return all the profiles.		/// Return all the profiles.
StringMap<FunctionSamples> &getProfiles() { return Profiles; }		SampleProfileMap &getProfiles() { return Profiles; }

/// Report a parse error message.		/// Report a parse error message.
void reportError(int64_t LineNumber, const Twine &Msg) const {		void reportError(int64_t LineNumber, const Twine &Msg) const {
Ctx.diagnose(DiagnosticInfoSampleProfile(Buffer->getBufferIdentifier(),		Ctx.diagnose(DiagnosticInfoSampleProfile(Buffer->getBufferIdentifier(),
LineNumber, Msg));		LineNumber, Msg));
}		}

/// Create a sample profile reader appropriate to the file format.		/// Create a sample profile reader appropriate to the file format.
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	public:
void setModule(const Module *Mod) { M = Mod; }		void setModule(const Module *Mod) { M = Mod; }

protected:		protected:
/// Map every function to its associated profile.		/// Map every function to its associated profile.
///		///
/// The profile of every function executed at runtime is collected		/// The profile of every function executed at runtime is collected
/// in the structure FunctionSamples. This maps function objects		/// in the structure FunctionSamples. This maps function objects
/// to their corresponding profiles.		/// to their corresponding profiles.
StringMap<FunctionSamples> Profiles;		SampleProfileMap Profiles;
		wenleiUnsubmitted Not Done Reply Inline Actions nit: remove the blank line wenlei: nit: remove the blank line
		hoyAuthorUnsubmitted Done Reply Inline Actions done. hoy: done.

/// LLVM context used to emit diagnostics.		/// LLVM context used to emit diagnostics.
LLVMContext &Ctx;		LLVMContext &Ctx;

/// Memory buffer holding the profile file.		/// Memory buffer holding the profile file.
std::unique_ptr<MemoryBuffer> Buffer;		std::unique_ptr<MemoryBuffer> Buffer;

/// Extra name buffer holding names created on demand.		/// Extra name buffer holding names created on demand.
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	public:
/// Read and validate the file header.		/// Read and validate the file header.
std::error_code readHeader() override { return sampleprof_error::success; }		std::error_code readHeader() override { return sampleprof_error::success; }

/// Read sample profiles from the associated file.		/// Read sample profiles from the associated file.
std::error_code readImpl() override;		std::error_code readImpl() override;

/// Return true if \p Buffer is in the format supported by this class.		/// Return true if \p Buffer is in the format supported by this class.
static bool hasFormat(const MemoryBuffer &Buffer);		static bool hasFormat(const MemoryBuffer &Buffer);

		private:
		/// CSNameTable is used to save full context vectors. This serves as an
		/// underlying immutable buffer for all clients.
		std::list<SampleContextFrameVector> CSNameTable;
};		};

class SampleProfileReaderBinary : public SampleProfileReader {		class SampleProfileReaderBinary : public SampleProfileReader {
		wenleiUnsubmitted Not Done Reply Inline Actions typo: given wenlei: typo: given
		hoyAuthorUnsubmitted Done Reply Inline Actions fixed. hoy: fixed.
public:		public:
SampleProfileReaderBinary(std::unique_ptr<MemoryBuffer> B, LLVMContext &C,		SampleProfileReaderBinary(std::unique_ptr<MemoryBuffer> B, LLVMContext &C,
SampleProfileFormat Format = SPF_None)		SampleProfileFormat Format = SPF_None)
: SampleProfileReader(std::move(B), C, Format) {}		: SampleProfileReader(std::move(B), C, Format) {}

/// Read and validate the file header.		/// Read and validate the file header.
virtual std::error_code readHeader() override;		virtual std::error_code readHeader() override;

▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	protected:
/// Points to the end of the buffer.		/// Points to the end of the buffer.
const uint8_t *End = nullptr;		const uint8_t *End = nullptr;

/// Function name table.		/// Function name table.
std::vector<StringRef> NameTable;		std::vector<StringRef> NameTable;

/// Read a string indirectly via the name table.		/// Read a string indirectly via the name table.
virtual ErrorOr<StringRef> readStringFromTable();		virtual ErrorOr<StringRef> readStringFromTable();
		virtual ErrorOr<SampleContext> readSampleContextFromTable();

private:		private:
std::error_code readSummaryEntry(std::vector<ProfileSummaryEntry> &Entries);		std::error_code readSummaryEntry(std::vector<ProfileSummaryEntry> &Entries);
virtual std::error_code verifySPMagic(uint64_t Magic) = 0;		virtual std::error_code verifySPMagic(uint64_t Magic) = 0;
};		};

class SampleProfileReaderRawBinary : public SampleProfileReaderBinary {		class SampleProfileReaderRawBinary : public SampleProfileReaderBinary {
private:		private:
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	protected:
std::error_code readSecHdrTableEntry(uint32_t Idx);		std::error_code readSecHdrTableEntry(uint32_t Idx);
std::error_code readSecHdrTable();		std::error_code readSecHdrTable();

std::error_code readFuncMetadata(bool ProfileHasAttribute);		std::error_code readFuncMetadata(bool ProfileHasAttribute);
std::error_code readFuncOffsetTable();		std::error_code readFuncOffsetTable();
std::error_code readFuncProfiles();		std::error_code readFuncProfiles();
std::error_code readMD5NameTable();		std::error_code readMD5NameTable();
std::error_code readNameTableSec(bool IsMD5);		std::error_code readNameTableSec(bool IsMD5);
		std::error_code readCSNameTableSec();
std::error_code readProfileSymbolList();		std::error_code readProfileSymbolList();

virtual std::error_code readHeader() override;		virtual std::error_code readHeader() override;
virtual std::error_code verifySPMagic(uint64_t Magic) override = 0;		virtual std::error_code verifySPMagic(uint64_t Magic) override = 0;
virtual std::error_code readOneSection(const uint8_t *Start, uint64_t Size,		virtual std::error_code readOneSection(const uint8_t *Start, uint64_t Size,
const SecHdrTableEntry &Entry);		const SecHdrTableEntry &Entry);
// placeholder for subclasses to dispatch their own section readers.		// placeholder for subclasses to dispatch their own section readers.
virtual std::error_code readCustomSection(const SecHdrTableEntry &Entry) = 0;		virtual std::error_code readCustomSection(const SecHdrTableEntry &Entry) = 0;
virtual ErrorOr<StringRef> readStringFromTable() override;		virtual ErrorOr<StringRef> readStringFromTable() override;
		virtual ErrorOr<SampleContext> readSampleContextFromTable() override;
		ErrorOr<SampleContextFrames> readContextFromTable();

std::unique_ptr<ProfileSymbolList> ProfSymList;		std::unique_ptr<ProfileSymbolList> ProfSymList;

/// The table mapping from function name to the offset of its FunctionSample		/// The table mapping from function context to the offset of its
/// towards file start.		/// FunctionSample towards file start.
DenseMap<StringRef, uint64_t> FuncOffsetTable;		DenseMap<SampleContext, uint64_t> FuncOffsetTable;
/// The set containing the functions to use when compiling a module.		/// The set containing the functions to use when compiling a module.
DenseSet<StringRef> FuncsToUse;		DenseSet<StringRef> FuncsToUse;

/// Use fixed length MD5 instead of ULEB128 encoding so NameTable doesn't		/// Use fixed length MD5 instead of ULEB128 encoding so NameTable doesn't
/// need to be read in up front and can be directly accessed using index.		/// need to be read in up front and can be directly accessed using index.
bool FixedLengthMD5 = false;		bool FixedLengthMD5 = false;
/// The starting address of NameTable containing fixed length MD5.		/// The starting address of NameTable containing fixed length MD5.
const uint8_t *MD5NameMemStart = nullptr;		const uint8_t *MD5NameMemStart = nullptr;

/// If MD5 is used in NameTable section, the section saves uint64_t data.		/// If MD5 is used in NameTable section, the section saves uint64_t data.
/// The uint64_t data has to be converted to a string and then the string		/// The uint64_t data has to be converted to a string and then the string
/// will be used to initialize StringRef in NameTable.		/// will be used to initialize StringRef in NameTable.
/// Note NameTable contains StringRef so it needs another buffer to own		/// Note NameTable contains StringRef so it needs another buffer to own
/// the string data. MD5StringBuf serves as the string buffer that is		/// the string data. MD5StringBuf serves as the string buffer that is
/// referenced by NameTable (vector of StringRef). We make sure		/// referenced by NameTable (vector of StringRef). We make sure
/// the lifetime of MD5StringBuf is not shorter than that of NameTable.		/// the lifetime of MD5StringBuf is not shorter than that of NameTable.
std::unique_ptr<std::vector<std::string>> MD5StringBuf;		std::unique_ptr<std::vector<std::string>> MD5StringBuf;

		/// CSNameTable is used to save full context vectors. This serves as an
		/// underlying immutable buffer for all clients.
		std::unique_ptr<const std::vector<SampleContextFrameVector>> CSNameTable;
		wmiUnsubmitted Not Done Reply Inline Actions Same as SampleProfileReaderText, can we use std::list instead of std::vector so the storage underlying will not be moved after more elements are inserted? wmi: Same as SampleProfileReaderText, can we use std::list instead of std::vector so the storage…
		hoyAuthorUnsubmitted Done Reply Inline Actions Std::vector is good for looking up with an offset, which is needed during the load of function profiles. Also, std::vector is more efficient for the extbinary reader, since number of contexts is known beforehand. Once the CS name table section is read, the std::vector should be frozen and no more names should be appended. hoy: Std::vector is good for looking up with an offset, which is needed during the load of function…
		wmiUnsubmitted Not Done Reply Inline Actions Yeah, right, std::vector is good for looking up. Now once cs name table is read, there is no other code appending names to the table. Just want to know whether we can have some additional mechanism to enforce that. Can we save the size of the table after reading cs name table and check that the table size is the same before visiting any element from the table? wmi: Yeah, right, std::vector is good for looking up. Now once cs name table is read, there is no…
		hoyAuthorUnsubmitted Done Reply Inline Actions Good point. Setting the underlying object to be constant so that it is immutable once populated. hoy: Good point. Setting the underlying object to be constant so that it is immutable once populated.

/// If SkipFlatProf is true, skip the sections with		/// If SkipFlatProf is true, skip the sections with
/// SecFlagFlat flag.		/// SecFlagFlat flag.
bool SkipFlatProf = false;		bool SkipFlatProf = false;

public:		public:
SampleProfileReaderExtBinaryBase(std::unique_ptr<MemoryBuffer> B,		SampleProfileReaderExtBinaryBase(std::unique_ptr<MemoryBuffer> B,
LLVMContext &C, SampleProfileFormat Format)		LLVMContext &C, SampleProfileFormat Format)
: SampleProfileReaderBinary(std::move(B), C, Format) {}		: SampleProfileReaderBinary(std::move(B), C, Format) {}
▲ Show 20 Lines • Show All 134 Lines • Show Last 20 Lines

llvm/include/llvm/ProfileData/SampleProfWriter.h

Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	public:
/// Write sample profiles in \p S.		/// Write sample profiles in \p S.
///		///
/// \returns status code of the file update operation.		/// \returns status code of the file update operation.
virtual std::error_code writeSample(const FunctionSamples &S) = 0;		virtual std::error_code writeSample(const FunctionSamples &S) = 0;

/// Write all the sample profiles in the given map of samples.		/// Write all the sample profiles in the given map of samples.
///		///
/// \returns status code of the file update operation.		/// \returns status code of the file update operation.
virtual std::error_code write(const StringMap<FunctionSamples> &ProfileMap);		virtual std::error_code write(const SampleProfileMap &ProfileMap);

raw_ostream &getOutputStream() { return *OutputStream; }		raw_ostream &getOutputStream() { return *OutputStream; }

/// Profile writer factory.		/// Profile writer factory.
///		///
/// Create a new file writer based on the value of \p Format.		/// Create a new file writer based on the value of \p Format.
static ErrorOr<std::unique_ptr<SampleProfileWriter>>		static ErrorOr<std::unique_ptr<SampleProfileWriter>>
create(StringRef Filename, SampleProfileFormat Format);		create(StringRef Filename, SampleProfileFormat Format);
Show All 9 Lines	public:
virtual void setPartialProfile() {}		virtual void setPartialProfile() {}
virtual void resetSecLayout(SectionLayout SL) {}		virtual void resetSecLayout(SectionLayout SL) {}

protected:		protected:
SampleProfileWriter(std::unique_ptr<raw_ostream> &OS)		SampleProfileWriter(std::unique_ptr<raw_ostream> &OS)
: OutputStream(std::move(OS)) {}		: OutputStream(std::move(OS)) {}

/// Write a file header for the profile file.		/// Write a file header for the profile file.
virtual std::error_code		virtual std::error_code writeHeader(const SampleProfileMap &ProfileMap) = 0;
writeHeader(const StringMap<FunctionSamples> &ProfileMap) = 0;

// Write function profiles to the profile file.		// Write function profiles to the profile file.
virtual std::error_code		virtual std::error_code writeFuncProfiles(const SampleProfileMap &ProfileMap);
writeFuncProfiles(const StringMap<FunctionSamples> &ProfileMap);

/// Output stream where to emit the profile to.		/// Output stream where to emit the profile to.
std::unique_ptr<raw_ostream> OutputStream;		std::unique_ptr<raw_ostream> OutputStream;

/// Profile summary.		/// Profile summary.
std::unique_ptr<ProfileSummary> Summary;		std::unique_ptr<ProfileSummary> Summary;

/// Compute summary for this profile.		/// Compute summary for this profile.
void computeSummary(const StringMap<FunctionSamples> &ProfileMap);		void computeSummary(const SampleProfileMap &ProfileMap);

/// Profile format.		/// Profile format.
SampleProfileFormat Format = SPF_None;		SampleProfileFormat Format = SPF_None;
};		};

/// Sample-based profile writer (text format).		/// Sample-based profile writer (text format).
class SampleProfileWriterText : public SampleProfileWriter {		class SampleProfileWriterText : public SampleProfileWriter {
public:		public:
std::error_code writeSample(const FunctionSamples &S) override;		std::error_code writeSample(const FunctionSamples &S) override;

protected:		protected:
SampleProfileWriterText(std::unique_ptr<raw_ostream> &OS)		SampleProfileWriterText(std::unique_ptr<raw_ostream> &OS)
: SampleProfileWriter(OS), Indent(0) {}		: SampleProfileWriter(OS), Indent(0) {}

std::error_code		std::error_code writeHeader(const SampleProfileMap &ProfileMap) override {
writeHeader(const StringMap<FunctionSamples> &ProfileMap) override {
return sampleprof_error::success;		return sampleprof_error::success;
}		}

private:		private:
/// Indent level to use when writing.		/// Indent level to use when writing.
///		///
/// This is used when printing inlined callees.		/// This is used when printing inlined callees.
unsigned Indent;		unsigned Indent;

friend ErrorOr<std::unique_ptr<SampleProfileWriter>>		friend ErrorOr<std::unique_ptr<SampleProfileWriter>>
SampleProfileWriter::create(std::unique_ptr<raw_ostream> &OS,		SampleProfileWriter::create(std::unique_ptr<raw_ostream> &OS,
SampleProfileFormat Format);		SampleProfileFormat Format);
};		};

/// Sample-based profile writer (binary format).		/// Sample-based profile writer (binary format).
class SampleProfileWriterBinary : public SampleProfileWriter {		class SampleProfileWriterBinary : public SampleProfileWriter {
public:		public:
SampleProfileWriterBinary(std::unique_ptr<raw_ostream> &OS)		SampleProfileWriterBinary(std::unique_ptr<raw_ostream> &OS)
: SampleProfileWriter(OS) {}		: SampleProfileWriter(OS) {}

virtual std::error_code writeSample(const FunctionSamples &S) override;		virtual std::error_code writeSample(const FunctionSamples &S) override;

protected:		protected:
		virtual MapVector<StringRef, uint32_t> &getNameTable() { return NameTable; }
virtual std::error_code writeMagicIdent(SampleProfileFormat Format);		virtual std::error_code writeMagicIdent(SampleProfileFormat Format);
virtual std::error_code writeNameTable();		virtual std::error_code writeNameTable();
virtual std::error_code		virtual std::error_code
writeHeader(const StringMap<FunctionSamples> &ProfileMap) override;		writeHeader(const SampleProfileMap &ProfileMap) override;
std::error_code writeSummary();		std::error_code writeSummary();
std::error_code writeNameIdx(StringRef FName, bool IsContextName = false);		virtual std::error_code writeContextIdx(const SampleContext &Context);
		std::error_code writeNameIdx(StringRef FName);
		wenleiUnsubmitted Not Done Reply Inline Actions Now that the parameter is no longer a string name, rename the function as well, e.g. writeContextIdx? Same for addName(const SampleContext &Context) -> addContext wenlei: Now that the parameter is no longer a string name, rename the function as well, e.g.
		hoyAuthorUnsubmitted Done Reply Inline Actions Sounds good. hoy: Sounds good.
std::error_code writeBody(const FunctionSamples &S);		std::error_code writeBody(const FunctionSamples &S);
inline void stablizeNameTable(std::set<StringRef> &V);		inline void stablizeNameTable(MapVector<StringRef, uint32_t> &NameTable,
		std::set<StringRef> &V);

MapVector<StringRef, uint32_t> NameTable;		MapVector<StringRef, uint32_t> NameTable;
std::unordered_set<std::string> BracketedContextStr;

void addName(StringRef FName, bool IsContextName = false);		void addName(StringRef FName);
		virtual void addContext(const SampleContext &Context);
void addNames(const FunctionSamples &S);		void addNames(const FunctionSamples &S);

private:		private:
friend ErrorOr<std::unique_ptr<SampleProfileWriter>>		friend ErrorOr<std::unique_ptr<SampleProfileWriter>>
SampleProfileWriter::create(std::unique_ptr<raw_ostream> &OS,		SampleProfileWriter::create(std::unique_ptr<raw_ostream> &OS,
SampleProfileFormat Format);		SampleProfileFormat Format);
};		};

class SampleProfileWriterRawBinary : public SampleProfileWriterBinary {		class SampleProfileWriterRawBinary : public SampleProfileWriterBinary {
using SampleProfileWriterBinary::SampleProfileWriterBinary;		using SampleProfileWriterBinary::SampleProfileWriterBinary;
};		};

const std::array<SmallVector<SecHdrTableEntry, 8>, NumOfLayout>		const std::array<SmallVector<SecHdrTableEntry, 8>, NumOfLayout>
ExtBinaryHdrLayoutTable = {		ExtBinaryHdrLayoutTable = {
// Note that SecFuncOffsetTable section is written after SecLBRProfile		// Note that SecFuncOffsetTable section is written after SecLBRProfile
// in the profile, but is put before SecLBRProfile in SectionHdrLayout.		// in the profile, but is put before SecLBRProfile in SectionHdrLayout.
// This is because sample reader follows the order in SectionHdrLayout		// This is because sample reader follows the order in SectionHdrLayout
// to read each section. To read function profiles on demand, sample		// to read each section. To read function profiles on demand, sample
// reader need to get the offset of each function profile first.		// reader need to get the offset of each function profile first.
//		//
// DefaultLayout		// DefaultLayout
SmallVector<SecHdrTableEntry, 8>({{SecProfSummary, 0, 0, 0, 0},		SmallVector<SecHdrTableEntry, 8>({{SecProfSummary, 0, 0, 0, 0},
{SecNameTable, 0, 0, 0, 0},		{SecNameTable, 0, 0, 0, 0},
		{SecCSNameTable, 0, 0, 0, 0},
{SecFuncOffsetTable, 0, 0, 0, 0},		{SecFuncOffsetTable, 0, 0, 0, 0},
{SecLBRProfile, 0, 0, 0, 0},		{SecLBRProfile, 0, 0, 0, 0},
{SecProfileSymbolList, 0, 0, 0, 0},		{SecProfileSymbolList, 0, 0, 0, 0},
{SecFuncMetadata, 0, 0, 0, 0}}),		{SecFuncMetadata, 0, 0, 0, 0}}),
// CtxSplitLayout		// CtxSplitLayout
SmallVector<SecHdrTableEntry, 8>({{SecProfSummary, 0, 0, 0, 0},		SmallVector<SecHdrTableEntry, 8>({{SecProfSummary, 0, 0, 0, 0},
{SecNameTable, 0, 0, 0, 0},		{SecNameTable, 0, 0, 0, 0},
// profile with context		// profile with context
// for next two sections		// for next two sections
{SecFuncOffsetTable, 0, 0, 0, 0},		{SecFuncOffsetTable, 0, 0, 0, 0},
{SecLBRProfile, 0, 0, 0, 0},		{SecLBRProfile, 0, 0, 0, 0},
// profile without context		// profile without context
// for next two sections		// for next two sections
{SecFuncOffsetTable, 0, 0, 0, 0},		{SecFuncOffsetTable, 0, 0, 0, 0},
{SecLBRProfile, 0, 0, 0, 0},		{SecLBRProfile, 0, 0, 0, 0},
{SecProfileSymbolList, 0, 0, 0, 0},		{SecProfileSymbolList, 0, 0, 0, 0},
{SecFuncMetadata, 0, 0, 0, 0}}),		{SecFuncMetadata, 0, 0, 0, 0}}),
};		};

class SampleProfileWriterExtBinaryBase : public SampleProfileWriterBinary {		class SampleProfileWriterExtBinaryBase : public SampleProfileWriterBinary {
using SampleProfileWriterBinary::SampleProfileWriterBinary;		using SampleProfileWriterBinary::SampleProfileWriterBinary;
public:		public:
virtual std::error_code		virtual std::error_code write(const SampleProfileMap &ProfileMap) override;
write(const StringMap<FunctionSamples> &ProfileMap) override;

virtual void setToCompressAllSections() override;		virtual void setToCompressAllSections() override;
void setToCompressSection(SecType Type);		void setToCompressSection(SecType Type);
virtual std::error_code writeSample(const FunctionSamples &S) override;		virtual std::error_code writeSample(const FunctionSamples &S) override;

// Set to use MD5 to represent string in NameTable.		// Set to use MD5 to represent string in NameTable.
virtual void setUseMD5() override {		virtual void setUseMD5() override {
UseMD5 = true;		UseMD5 = true;
Show All 38 Lines	for (auto &Entry : SectionHdrLayout) {
addSecFlag(Entry, Flag);		addSecFlag(Entry, Flag);
}		}
}		}
template <class SecFlagType>		template <class SecFlagType>
void addSectionFlag(uint32_t SectionIdx, SecFlagType Flag) {		void addSectionFlag(uint32_t SectionIdx, SecFlagType Flag) {
addSecFlag(SectionHdrLayout[SectionIdx], Flag);		addSecFlag(SectionHdrLayout[SectionIdx], Flag);
}		}

		virtual void addContext(const SampleContext &Context) override;

// placeholder for subclasses to dispatch their own section writers.		// placeholder for subclasses to dispatch their own section writers.
virtual std::error_code writeCustomSection(SecType Type) = 0;		virtual std::error_code writeCustomSection(SecType Type) = 0;
// Verify the SecLayout is supported by the format.		// Verify the SecLayout is supported by the format.
virtual void verifySecLayout(SectionLayout SL) = 0;		virtual void verifySecLayout(SectionLayout SL) = 0;

// specify the order to write sections.		// specify the order to write sections.
virtual std::error_code		virtual std::error_code writeSections(const SampleProfileMap &ProfileMap) = 0;
writeSections(const StringMap<FunctionSamples> &ProfileMap) = 0;

// Dispatch section writer for each section. \p LayoutIdx is the sequence		// Dispatch section writer for each section. \p LayoutIdx is the sequence
// number indicating where the section is located in SectionHdrLayout.		// number indicating where the section is located in SectionHdrLayout.
virtual std::error_code		virtual std::error_code writeOneSection(SecType Type, uint32_t LayoutIdx,
writeOneSection(SecType Type, uint32_t LayoutIdx,		const SampleProfileMap &ProfileMap);
const StringMap<FunctionSamples> &ProfileMap);

// Helper function to write name table.		// Helper function to write name table.
virtual std::error_code writeNameTable() override;		virtual std::error_code writeNameTable() override;
		virtual std::error_code
		writeContextIdx(const SampleContext &Context) override;
		std::error_code writeCSNameIdx(const SampleContext &Context);
		std::error_code writeCSNameTableSection();

std::error_code writeFuncMetadata(const StringMap<FunctionSamples> &Profiles);		std::error_code writeFuncMetadata(const SampleProfileMap &Profiles);

// Functions to write various kinds of sections.		// Functions to write various kinds of sections.
std::error_code		std::error_code writeNameTableSection(const SampleProfileMap &ProfileMap);
writeNameTableSection(const StringMap<FunctionSamples> &ProfileMap);
std::error_code writeFuncOffsetTable();		std::error_code writeFuncOffsetTable();
std::error_code writeProfileSymbolListSection();		std::error_code writeProfileSymbolListSection();

SectionLayout SecLayout = DefaultLayout;		SectionLayout SecLayout = DefaultLayout;
// Specifiy the order of sections in section header table. Note		// Specifiy the order of sections in section header table. Note
// the order of sections in SecHdrTable may be different that the		// the order of sections in SecHdrTable may be different that the
// order in SectionHdrLayout. sample Reader will follow the order		// order in SectionHdrLayout. sample Reader will follow the order
// in SectionHdrLayout to read each section.		// in SectionHdrLayout to read each section.
SmallVector<SecHdrTableEntry, 8> SectionHdrLayout =		SmallVector<SecHdrTableEntry, 8> SectionHdrLayout =
ExtBinaryHdrLayoutTable[DefaultLayout];		ExtBinaryHdrLayoutTable[DefaultLayout];

// Save the start of SecLBRProfile so we can compute the offset to the		// Save the start of SecLBRProfile so we can compute the offset to the
// start of SecLBRProfile for each Function's Profile and will keep it		// start of SecLBRProfile for each Function's Profile and will keep it
// in FuncOffsetTable.		// in FuncOffsetTable.
uint64_t SecLBRProfileStart = 0;		uint64_t SecLBRProfileStart = 0;

private:		private:
void allocSecHdrTable();		void allocSecHdrTable();
std::error_code writeSecHdrTable();		std::error_code writeSecHdrTable();
virtual std::error_code		virtual std::error_code
writeHeader(const StringMap<FunctionSamples> &ProfileMap) override;		writeHeader(const SampleProfileMap &ProfileMap) override;
std::error_code compressAndOutput();		std::error_code compressAndOutput();

// We will swap the raw_ostream held by LocalBufStream and that		// We will swap the raw_ostream held by LocalBufStream and that
// held by OutputStream if we try to add a section which needs		// held by OutputStream if we try to add a section which needs
// compression. After the swap, all the data written to output		// compression. After the swap, all the data written to output
// will be temporarily buffered into the underlying raw_string_ostream		// will be temporarily buffered into the underlying raw_string_ostream
// originally held by LocalBufStream. After the data writing for the		// originally held by LocalBufStream. After the data writing for the
// section is completed, compress the data in the local buffer,		// section is completed, compress the data in the local buffer,
// swap the raw_ostream back and write the compressed data to the		// swap the raw_ostream back and write the compressed data to the
// real output.		// real output.
std::unique_ptr<raw_ostream> LocalBufStream;		std::unique_ptr<raw_ostream> LocalBufStream;
// The location where the output stream starts.		// The location where the output stream starts.
uint64_t FileStart;		uint64_t FileStart;
// The location in the output stream where the SecHdrTable should be		// The location in the output stream where the SecHdrTable should be
// written to.		// written to.
uint64_t SecHdrTableOffset;		uint64_t SecHdrTableOffset;
// The table contains SecHdrTableEntry entries in order of how they are		// The table contains SecHdrTableEntry entries in order of how they are
// populated in the writer. It may be different from the order in		// populated in the writer. It may be different from the order in
// SectionHdrLayout which specifies the sequence in which sections will		// SectionHdrLayout which specifies the sequence in which sections will
// be read.		// be read.
std::vector<SecHdrTableEntry> SecHdrTable;		std::vector<SecHdrTableEntry> SecHdrTable;

// FuncOffsetTable maps function name to its profile offset in SecLBRProfile		// FuncOffsetTable maps function context to its profile offset in
// section. It is used to load function profile on demand.		// SecLBRProfile section. It is used to load function profile on demand.
MapVector<StringRef, uint64_t> FuncOffsetTable;		MapVector<SampleContext, uint64_t> FuncOffsetTable;
// Whether to use MD5 to represent string.		// Whether to use MD5 to represent string.
bool UseMD5 = false;		bool UseMD5 = false;

		/// CSNameTable maps function context to its offset in SecCSNameTable section.
		/// The offset will be used everywhere where the context is referenced.
		MapVector<SampleContext, uint32_t> CSNameTable;

ProfileSymbolList *ProfSymList = nullptr;		ProfileSymbolList *ProfSymList = nullptr;
};		};

class SampleProfileWriterExtBinary : public SampleProfileWriterExtBinaryBase {		class SampleProfileWriterExtBinary : public SampleProfileWriterExtBinaryBase {
public:		public:
SampleProfileWriterExtBinary(std::unique_ptr<raw_ostream> &OS)		SampleProfileWriterExtBinary(std::unique_ptr<raw_ostream> &OS)
: SampleProfileWriterExtBinaryBase(OS) {}		: SampleProfileWriterExtBinaryBase(OS) {}

private:		private:
std::error_code		std::error_code writeDefaultLayout(const SampleProfileMap &ProfileMap);
writeDefaultLayout(const StringMap<FunctionSamples> &ProfileMap);		std::error_code writeCtxSplitLayout(const SampleProfileMap &ProfileMap);
std::error_code
writeCtxSplitLayout(const StringMap<FunctionSamples> &ProfileMap);

virtual std::error_code		virtual std::error_code
writeSections(const StringMap<FunctionSamples> &ProfileMap) override;		writeSections(const SampleProfileMap &ProfileMap) override;

virtual std::error_code writeCustomSection(SecType Type) override {		virtual std::error_code writeCustomSection(SecType Type) override {
return sampleprof_error::success;		return sampleprof_error::success;
};		};

virtual void verifySecLayout(SectionLayout SL) override {		virtual void verifySecLayout(SectionLayout SL) override {
assert((SL == DefaultLayout \|\| SL == CtxSplitLayout) &&		assert((SL == DefaultLayout \|\| SL == CtxSplitLayout) &&
"Unsupported layout");		"Unsupported layout");
Show All 30 Lines
//		//
// We need Part2 because profile reader can use it to find out and read		// We need Part2 because profile reader can use it to find out and read
// function offset table without reading Part3 first.		// function offset table without reading Part3 first.
class SampleProfileWriterCompactBinary : public SampleProfileWriterBinary {		class SampleProfileWriterCompactBinary : public SampleProfileWriterBinary {
using SampleProfileWriterBinary::SampleProfileWriterBinary;		using SampleProfileWriterBinary::SampleProfileWriterBinary;

public:		public:
virtual std::error_code writeSample(const FunctionSamples &S) override;		virtual std::error_code writeSample(const FunctionSamples &S) override;
virtual std::error_code		virtual std::error_code write(const SampleProfileMap &ProfileMap) override;
write(const StringMap<FunctionSamples> &ProfileMap) override;

protected:		protected:
/// The table mapping from function name to the offset of its FunctionSample		/// The table mapping from function name to the offset of its FunctionSample
/// towards profile start.		/// towards profile start.
MapVector<StringRef, uint64_t> FuncOffsetTable;		MapVector<StringRef, uint64_t> FuncOffsetTable;
/// The offset of the slot to be filled with the offset of FuncOffsetTable		/// The offset of the slot to be filled with the offset of FuncOffsetTable
/// towards profile start.		/// towards profile start.
uint64_t TableOffset;		uint64_t TableOffset;
virtual std::error_code writeNameTable() override;		virtual std::error_code writeNameTable() override;
virtual std::error_code		virtual std::error_code
writeHeader(const StringMap<FunctionSamples> &ProfileMap) override;		writeHeader(const SampleProfileMap &ProfileMap) override;
std::error_code writeFuncOffsetTable();		std::error_code writeFuncOffsetTable();
};		};

} // end namespace sampleprof		} // end namespace sampleprof
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_PROFILEDATA_SAMPLEPROFWRITER_H		#endif // LLVM_PROFILEDATA_SAMPLEPROFWRITER_H

llvm/lib/ProfileData/SampleProf.cpp

Show First 20 Lines • Show All 193 Lines • ▼ Show 20 Lines

raw_ostream &llvm::sampleprof::operator<<(raw_ostream &OS,		raw_ostream &llvm::sampleprof::operator<<(raw_ostream &OS,
const FunctionSamples &FS) {		const FunctionSamples &FS) {
FS.print(OS);		FS.print(OS);
return OS;		return OS;
}		}

void sampleprof::sortFuncProfiles(		void sampleprof::sortFuncProfiles(
const StringMap<FunctionSamples> &ProfileMap,		const SampleProfileMap &ProfileMap,
std::vector<NameFunctionSamples> &SortedProfiles) {		std::vector<NameFunctionSamples> &SortedProfiles) {
for (const auto &I : ProfileMap) {		for (const auto &I : ProfileMap) {
assert(I.getKey() == I.second.getNameWithContext() &&		assert(I.first == I.second.getContext() && "Inconsistent profile map");
"Inconsistent profile map");		SortedProfiles.push_back(std::make_pair(I.second.getContext(), &I.second));
SortedProfiles.push_back(
std::make_pair(I.second.getNameWithContext(), &I.second));
}		}
llvm::stable_sort(SortedProfiles, [](const NameFunctionSamples &A,		llvm::stable_sort(SortedProfiles, [](const NameFunctionSamples &A,
const NameFunctionSamples &B) {		const NameFunctionSamples &B) {
if (A.second->getTotalSamples() == B.second->getTotalSamples())		if (A.second->getTotalSamples() == B.second->getTotalSamples())
return A.first > B.first;		return A.first < B.first;
return A.second->getTotalSamples() > B.second->getTotalSamples();		return A.second->getTotalSamples() > B.second->getTotalSamples();
});		});
}		}

unsigned FunctionSamples::getOffset(const DILocation *DIL) {		unsigned FunctionSamples::getOffset(const DILocation *DIL) {
return (DIL->getLine() - DIL->getScope()->getSubprogram()->getLine()) &		return (DIL->getLine() - DIL->getScope()->getSubprogram()->getLine()) &
0xffff;		0xffff;
}		}
▲ Show 20 Lines • Show All 116 Lines • ▼ Show 20 Lines	if (!TrimColdContext && !MergeColdContext)
return;		return;

// Nothing to merge if sample threshold is zero		// Nothing to merge if sample threshold is zero
if (ColdCountThreshold == 0)		if (ColdCountThreshold == 0)
return;		return;

// Filter the cold profiles from ProfileMap and move them into a tmp		// Filter the cold profiles from ProfileMap and move them into a tmp
// container		// container
std::vector<std::pair<StringRef, const FunctionSamples *>> ColdProfiles;		std::vector<std::pair<SampleContext, const FunctionSamples *>> ColdProfiles;
for (const auto &I : ProfileMap) {		for (const auto &I : ProfileMap) {
const FunctionSamples &FunctionProfile = I.second;		const FunctionSamples &FunctionProfile = I.second;
if (FunctionProfile.getTotalSamples() >= ColdCountThreshold)		if (FunctionProfile.getTotalSamples() >= ColdCountThreshold)
continue;		continue;
ColdProfiles.emplace_back(I.getKey(), &I.second);		ColdProfiles.emplace_back(I.first, &I.second);
}		}

// Remove the cold profile from ProfileMap and merge them into		// Remove the cold profile from ProfileMap and merge them into
// MergedProfileMap by the last K frames of context		// MergedProfileMap by the last K frames of context
StringMap<FunctionSamples> MergedProfileMap;		SampleProfileMap MergedProfileMap;
for (const auto &I : ColdProfiles) {		for (const auto &I : ColdProfiles) {
if (MergeColdContext) {		if (MergeColdContext) {
auto Ret = MergedProfileMap.try_emplace(		auto MergedContext = I.second->getContext().getContextFrames();
I.second->getContext().getContextWithLastKFrames(		if (ColdContextFrameLength < MergedContext.size())
ColdContextFrameLength),		MergedContext = MergedContext.take_back(ColdContextFrameLength);
FunctionSamples());		auto Ret = MergedProfileMap.emplace(MergedContext, FunctionSamples());
FunctionSamples &MergedProfile = Ret.first->second;		FunctionSamples &MergedProfile = Ret.first->second;
MergedProfile.merge(*I.second);		MergedProfile.merge(*I.second);
}		}
ProfileMap.erase(I.first);		ProfileMap.erase(I.first);
}		}

// Move the merged profiles into ProfileMap;		// Move the merged profiles into ProfileMap;
for (const auto &I : MergedProfileMap) {		for (const auto &I : MergedProfileMap) {
// Filter the cold merged profile		// Filter the cold merged profile
if (TrimColdContext && I.second.getTotalSamples() < ColdCountThreshold &&		if (TrimColdContext && I.second.getTotalSamples() < ColdCountThreshold &&
ProfileMap.find(I.getKey()) == ProfileMap.end())		ProfileMap.find(I.first) == ProfileMap.end())
continue;		continue;
// Merge the profile if the original profile exists, otherwise just insert		// Merge the profile if the original profile exists, otherwise just insert
// as a new profile		// as a new profile
auto Ret = ProfileMap.try_emplace(I.getKey(), FunctionSamples());		auto Ret = ProfileMap.emplace(I.first, FunctionSamples());
if (Ret.second) {		if (Ret.second) {
SampleContext FContext(Ret.first->first(), RawContext);		SampleContext FContext(Ret.first->first, RawContext);
FunctionSamples &FProfile = Ret.first->second;		FunctionSamples &FProfile = Ret.first->second;
FProfile.setContext(FContext);		FProfile.setContext(FContext);
FProfile.setName(FContext.getNameWithoutContext());
}		}
FunctionSamples &OrigProfile = Ret.first->second;		FunctionSamples &OrigProfile = Ret.first->second;
OrigProfile.merge(I.second);		OrigProfile.merge(I.second);
}		}
}		}

void SampleContextTrimmer::canonicalizeContextProfiles() {		void SampleContextTrimmer::canonicalizeContextProfiles() {
std::vector<StringRef> ProfilesToBeRemoved;		std::vector<SampleContext> ProfilesToBeRemoved;
StringMap<FunctionSamples> ProfilesToBeAdded;		SampleProfileMap ProfilesToBeAdded;
for (auto &I : ProfileMap) {		for (auto &I : ProfileMap) {
FunctionSamples &FProfile = I.second;		FunctionSamples &FProfile = I.second;
StringRef ContextStr = FProfile.getNameWithContext();		SampleContext &Context = FProfile.getContext();
if (I.first() == ContextStr)		if (I.first == Context)
continue;		continue;

// Use the context string from FunctionSamples to update the keys of		// Use the context string from FunctionSamples to update the keys of
// ProfileMap. They can get out of sync after context profile promotion		// ProfileMap. They can get out of sync after context profile promotion
// through pre-inliner.		// through pre-inliner.
// Duplicate the function profile for later insertion to avoid a conflict		// Duplicate the function profile for later insertion to avoid a conflict
// caused by a context both to be add and to be removed. This could happen		// caused by a context both to be add and to be removed. This could happen
// when a context is promoted to another context which is also promoted to		// when a context is promoted to another context which is also promoted to
// the third context. For example, given an original context A @ B @ C that		// the third context. For example, given an original context A @ B @ C that
// is promoted to B @ C and the original context B @ C which is promoted to		// is promoted to B @ C and the original context B @ C which is promoted to
// just C, adding B @ C to the profile map while removing same context (but		// just C, adding B @ C to the profile map while removing same context (but
// with different profiles) from the map can cause a conflict if they are		// with different profiles) from the map can cause a conflict if they are
// not handled in a right order. This can be solved by just caching the		// not handled in a right order. This can be solved by just caching the
// profiles to be added.		// profiles to be added.
auto Ret = ProfilesToBeAdded.try_emplace(ContextStr, FProfile);		auto Ret = ProfilesToBeAdded.emplace(Context, FProfile);
(void)Ret;		(void)Ret;
assert(Ret.second && "Context conflict during canonicalization");		assert(Ret.second && "Context conflict during canonicalization");
ProfilesToBeRemoved.push_back(I.first());		ProfilesToBeRemoved.push_back(I.first);
}		}

for (auto &I : ProfilesToBeRemoved) {		for (auto &I : ProfilesToBeRemoved) {
ProfileMap.erase(I);		ProfileMap.erase(I);
}		}

for (auto &I : ProfilesToBeAdded) {		for (auto &I : ProfilesToBeAdded) {
ProfileMap.try_emplace(I.first(), I.second);		ProfileMap.emplace(I.first, I.second);
}		}
}		}

std::error_code ProfileSymbolList::write(raw_ostream &OS) {		std::error_code ProfileSymbolList::write(raw_ostream &OS) {
// Sort the symbols before output. If doing compression.		// Sort the symbols before output. If doing compression.
// It will make the compression much more effective.		// It will make the compression much more effective.
std::vector<StringRef> SortedList(Syms.begin(), Syms.end());		std::vector<StringRef> SortedList(Syms.begin(), Syms.end());
llvm::sort(SortedList);		llvm::sort(SortedList);
Show All 19 Lines

llvm/lib/ProfileData/SampleProfReader.cpp

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
static cl::opt<bool> ProfileIsFSDisciminator(		static cl::opt<bool> ProfileIsFSDisciminator(
"profile-isfs", cl::Hidden, cl::init(false),		"profile-isfs", cl::Hidden, cl::init(false),
cl::desc("Profile uses flow sensitive discriminators"));		cl::desc("Profile uses flow sensitive discriminators"));

/// Dump the function profile for \p FName.		/// Dump the function profile for \p FName.
///		///
/// \param FName Name of the function to print.		/// \param FName Name of the function to print.
/// \param OS Stream to emit the output to.		/// \param OS Stream to emit the output to.
void SampleProfileReader::dumpFunctionProfile(StringRef FName,		void SampleProfileReader::dumpFunctionProfile(SampleContext FContext,
raw_ostream &OS) {		raw_ostream &OS) {
OS << "Function: " << FName << ": " << Profiles[FName];		OS << "Function: " << FContext.toString() << ": " << Profiles[FContext];
}		}

/// Dump all the function profiles found on stream \p OS.		/// Dump all the function profiles found on stream \p OS.
void SampleProfileReader::dump(raw_ostream &OS) {		void SampleProfileReader::dump(raw_ostream &OS) {
std::vector<NameFunctionSamples> V;		std::vector<NameFunctionSamples> V;
sortFuncProfiles(Profiles, V);		sortFuncProfiles(Profiles, V);
for (const auto &I : V)		for (const auto &I : V)
dumpFunctionProfile(I.first, OS);		dumpFunctionProfile(I.first, OS);
▲ Show 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	if (isDigit(Rest[0])) {
if (Rest.substr(n3 + 1).getAsInteger(10, NumSamples))		if (Rest.substr(n3 + 1).getAsInteger(10, NumSamples))
return false;		return false;
}		}
return true;		return true;
}		}

/// Load samples from a text file.		/// Load samples from a text file.
///		///
/// See the documentation at the top of the file for an explanation of		/// See the documentation at the top of the file for an explanation of
		wmiUnsubmitted Not Done Reply Inline Actions Looks like it can be a utility function in SampleProfile.h and may be reused somewhere else. wmi: Looks like it can be a utility function in SampleProfile.h and may be reused somewhere else.
		hoyAuthorUnsubmitted Done Reply Inline Actions It's actually moved out of SampleProfile.h since it's only used by the text reader. Also the text reader should be the only part of the toolchain that needs to parse full context strings. Other parts just need to deal with ArrayRef form of context. What do you think? hoy: It's actually moved out of SampleProfile.h since it's only used by the text reader. Also the…
		wmiUnsubmitted Not Done Reply Inline Actions Ok, if it is less likely to be used by others, it is fine to keep as it is. wmi: Ok, if it is less likely to be used by others, it is fine to keep as it is.
/// the expected format.		/// the expected format.
///		///
/// \returns true if the file was loaded successfully, false otherwise.		/// \returns true if the file was loaded successfully, false otherwise.
std::error_code SampleProfileReaderText::readImpl() {		std::error_code SampleProfileReaderText::readImpl() {
line_iterator LineIt(Buffer, /SkipBlanks=*/true, '#');		line_iterator LineIt(Buffer, /SkipBlanks=*/true, '#');
sampleprof_error Result = sampleprof_error::success;		sampleprof_error Result = sampleprof_error::success;

InlineCallStack InlineStack;		InlineCallStack InlineStack;
Show All 14 Lines	for (; !LineIt.is_at_eof(); ++LineIt) {
// the compiler decides not to emit the function (e.g., it was inlined		// the compiler decides not to emit the function (e.g., it was inlined
// and removed). In this case, the binary will not have the linkage		// and removed). In this case, the binary will not have the linkage
// name for the function, so the profiler will emit the function's		// name for the function, so the profiler will emit the function's
// unmangled name, which may contain characters like ':' and '>' in its		// unmangled name, which may contain characters like ':' and '>' in its
// name (member functions, templates, etc).		// name (member functions, templates, etc).
//		//
// The only requirement we place on the identifier, then, is that it		// The only requirement we place on the identifier, then, is that it
// should not begin with a number.		// should not begin with a number.
if ((*LineIt)[0] != ' ') {		if ((*LineIt)[0] != ' ') {
uint64_t NumSamples, NumHeadSamples;		uint64_t NumSamples, NumHeadSamples;
StringRef FName;		StringRef FName;
if (!ParseHead(*LineIt, FName, NumSamples, NumHeadSamples)) {		if (!ParseHead(*LineIt, FName, NumSamples, NumHeadSamples)) {
reportError(LineIt.line_number(),		reportError(LineIt.line_number(),
"Expected 'mangled_name:NUM:NUM', found " + *LineIt);		"Expected 'mangled_name:NUM:NUM', found " + *LineIt);
return sampleprof_error::malformed;		return sampleprof_error::malformed;
}		}
		wmiUnsubmitted Not Done Reply Inline Actions Can we make it a utility function? wmi: Can we make it a utility function?
SeenMetadata = false;		SeenMetadata = false;
SampleContext FContext(FName);		SampleContext FContext(FName, CSNameTable);
if (FContext.hasContext())		if (FContext.hasContext())
++CSProfileCount;		++CSProfileCount;
Profiles[FContext] = FunctionSamples();		Profiles[FContext] = FunctionSamples();
FunctionSamples &FProfile = Profiles[FContext];		FunctionSamples &FProfile = Profiles[FContext];
FProfile.setName(FContext.getNameWithoutContext());
FProfile.setContext(FContext);		FProfile.setContext(FContext);
		wenleiUnsubmitted Not Done Reply Inline Actions Even though we only need to deal with string context in profile reader/writer for text profile, it's probably still cleaner to keep all string context related parsing into SampleContext. `createContext` is more like a ctor. I'd prefer keep string decoding, createContext in its original place in SampleContext. That way, we can construct a context from string, SampleContext::setContext remain a private helper too, and the logic here can be simpler, just like before. SampleContext does still have getContextString and toString, so it's not really isolated from string representation, might as well keep all string stuff together there for consistency. wenlei: Even though we only need to deal with string context in profile reader/writer for text profile…
		hoyAuthorUnsubmitted Done Reply Inline Actions Makes sense. Moved back to SampleContext. hoy: Makes sense. Moved back to SampleContext.
		wenleiUnsubmitted Not Done Reply Inline Actions Thanks. Can you move it back in D107299, so we don't see the change back and forth, just for review? Can we also fold `FName.startswith("[")` in to SampleContext? Additionally, why we do need a `createContextFromString` instead of using overload ctors? This is also inconsistent between how SampleContext is created from text profile (createContextFromString) vs from binary profile (ctor). What I was thinking is have SampleContext decide how to create the object, so there's no changed needed here, just like before. wenlei: Thanks. Can you move it back in D107299, so we don't see the change back and forth, just for…
		hoyAuthorUnsubmitted Done Reply Inline Actions Constructing a CS context will require additional parameter than non-CS profile, especially the underlying context vector. That is causing the inconsistency with the context split work and I'm separating the construction of CS and non-CS contexts. Currently CS context is only constructible from a `SampleContextFrameVector`. Another reason is that I was hoping to construct `SampleContext` in a quick manner from `StringRef` to favor non-CS profile. hoy: Constructing a CS context will require additional parameter than non-CS profile, especially the…
		wenleiUnsubmitted Not Done Reply Inline Actions Can you move it back in D107299, so we don't see the change back and forth, just for review? Sorry my bad, I meant to say D108433. I think the key blocker for everything to be taken care from within ctor is that you need reader to own the context created from the string. How about having a ctor with StringRef and CSNameTable as parameter - it puts the context into CSNameTable (owned by reader) for CS profile, and the CSNameTable would be ignored for non-CS profile. The current implementation works, but reader has to be aware of the actual string representation of context. I thought it'd be cleaner if such representation is all dealt with from within SampleContext. Currently, it's indeed mostly handled by SampleContext, except it's bleeding into reader here. wenlei: > Can you move it back in D107299, so we don't see the change back and forth, just for review?
		hoyAuthorUnsubmitted Done Reply Inline Actions Yeah, that's why I originally put everything related to context string parsing in the reader. I thought they were related to profile-specific representation that `SampleContext` shouldn't care. I guess moving the parsing code back is better for extension and code sharing, when we have new module doing the same thing in the future. Adding a new constructor with StringRef and CSNameTable as parameter can work but feel like it's easier and more clear for the reader to know what it constructs. Hiding that from the reader is fine, but asking the reader to provide an underlying CS name table which may not be needed for non-CS sounds a bit confusing. That also kinds of exposes reader-specific implementation to `SampleContext`. I think having the reader be aware of the specific representation might be reasonable, since context representation is a part of the profile format. What do you think? hoy: Yeah, that's why I originally put everything related to context string parsing in the reader. I…
		wenleiUnsubmitted Not Done Reply Inline Actions asking the reader to provide an underlying CS name table which may not be needed for non-CS sounds a bit confusing. That also kinds of exposes reader-specific implementation to SampleContext. I don't see this as exposing reader-specific stuff to sample context. It'd just be the expectation of the sample context API that requires a buffer to hold newly created context. This is in essence no different from how string buffer is passed in getRepInFormat (and only used for MD5), and I don't see it as coupling between reader and context. I think having the reader be aware of the specific representation might be reasonable, since context representation is a part of the profile format. I think the actual string presentation of context is not tied to the format, but rather the format is using the string representation directly. Also if string representation is considered part of format, then all of the string related stuff should be part of reader (like your earlier change). The possible reusing of string functions you mentioned also shows that the string representation is not format-specific. wenlei: > asking the reader to provide an underlying CS name table which may not be needed for non-CS…
		hoyAuthorUnsubmitted Done Reply Inline Actions This is in essence no different from how string buffer is passed in getRepInFormat (and only used for MD5), and I don't see it as coupling between reader and context. `getRepInFormat` takes a string buffer instead of a string table which is not coupled with the reader. However, `CSNameTable` is an implementation detail of the reader. Having that populated and updated by `SampleContext` sounds like a coupling with the reader. E.g, from `SampleContext` point of view, it is questionable why it has to update a std::list but not a std::vector or std::set. We can pass in a SampleContextFrameVector to the new ctor, and similar with the callsites of `getRepInFormat`, the reader should know where to place it, just like the current code. But then the reader has to redo some checks in the ctor. Anyway, feels that the reader will need to check if it is constructing a CS context, if we don't expose `CSNameTable` to `SampleContext`. Since the reader is currently the only consumer, maybe just keep it in the reader for now? When we have a new user we can then decide what kind of new ctor to make? hoy: > This is in essence no different from how string buffer is passed in getRepInFormat (and only…
		wenleiUnsubmitted Not Done Reply Inline Actions Ok, I think we've probably spent too much time on this. But bear with me a bit more, still hope we can reach a consensus. :) That said, whatever we choose to do, I don't agree with your reasoning above. However, CSNameTable is an implementation detail of the reader. Look at it as buffer, and in this case reader happens to own and provide that buffer. Having that populated and updated by SampleContext sounds like a coupling with the reader. E.g, from SampleContext point of view, it is questionable why it has to update a std::list but not a std::vector or std::set. I don't think this is really a concern, nor is it a coupling tbh. Regardless of what reader does, what makes sense for a buffer type? I think std::list simply makes sense as it avoid reallocating. Yes, reader uses std::list, but this is just a choice that makes sense in general for a buffer. If we go down this level, sure every single API call is a coupling because every param has a type. The way I look at this - sample context abstracts away the string representation (again it's independent of format, and text format just uses that string representation directly), and reader resort to sample context for anything related to that. It's probably not so important to have that logical layering enforced, but if the cost of doing that is small, why not? Clear layering makes it easy to maintain and reason about. wenlei: Ok, I think we've probably spent too much time on this. But bear with me a bit more, still hope…
		hoyAuthorUnsubmitted Done Reply Inline Actions I don't think this is really a concern, nor is it a coupling tbh. Regardless of what reader does, what makes sense for a buffer type? I think std::list simply makes sense as it avoid reallocating. Yes, reader uses std::list, but this is just a choice that makes sense in general for a buffer. If we go down this level, sure every single API call is a coupling because every param has a type. Yeah, we probably spent too much time on this. I will make a new ctor with `CSNameTable` as a parameter. hoy: > I don't think this is really a concern, nor is it a coupling tbh. Regardless of what reader…
MergeResult(Result, FProfile.addTotalSamples(NumSamples));		MergeResult(Result, FProfile.addTotalSamples(NumSamples));
MergeResult(Result, FProfile.addHeadSamples(NumHeadSamples));		MergeResult(Result, FProfile.addHeadSamples(NumHeadSamples));
InlineStack.clear();		InlineStack.clear();
InlineStack.push_back(&FProfile);		InlineStack.push_back(&FProfile);
} else {		} else {
uint64_t NumSamples;		uint64_t NumSamples;
StringRef FName;		StringRef FName;
DenseMap<StringRef, uint64_t> TargetCountMap;		DenseMap<StringRef, uint64_t> TargetCountMap;
▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	inline ErrorOr<uint32_t> SampleProfileReaderBinary::readStringIndex(T &Table) {
if (std::error_code EC = Idx.getError())		if (std::error_code EC = Idx.getError())
return EC;		return EC;
if (*Idx >= Table.size())		if (*Idx >= Table.size())
return sampleprof_error::truncated_name_table;		return sampleprof_error::truncated_name_table;
return *Idx;		return *Idx;
}		}

ErrorOr<StringRef> SampleProfileReaderBinary::readStringFromTable() {		ErrorOr<StringRef> SampleProfileReaderBinary::readStringFromTable() {
auto Idx = readStringIndex(NameTable);		auto Idx = readStringIndex(NameTable);
		wenleiUnsubmitted Not Done Reply Inline Actions I think `getNameTable` exist as a virtual function only because of the use in sample loader for `profile-sample-accuarate` and `profile-accurate-for-symsinlist`. Here can we still access the NameTable directly without going through virtual call? Same for other places where `NameTable` is directly accessible. For writer, before this change, we indeed only have `NameTable` for `SampleProfileWriterBinary` without virtual getter, and we access it without going through virtual function call there. wenlei: I think `getNameTable` exist as a virtual function only because of the use in sample loader for…
		hoyAuthorUnsubmitted Done Reply Inline Actions Yes, we can just use `NameTable` directly here. Actually all changes here are unnecessary. I just undid them. hoy: Yes, we can just use `NameTable` directly here. Actually all changes here are unnecessary. I…
if (std::error_code EC = Idx.getError())		if (std::error_code EC = Idx.getError())
return EC;		return EC;

return NameTable[*Idx];		return NameTable[*Idx];
}		}
		wmiUnsubmitted Not Done Reply Inline Actions This param is unused. Write it as readNameFromTable(bool /* IsContextName /) to make it more explicit. wmi:* This param is unused. Write it as readNameFromTable(bool /* IsContextName */) to make it more…
		hoyAuthorUnsubmitted Done Reply Inline Actions Sounds good. hoy: Sounds good.
		hoyAuthorUnsubmitted Done Reply Inline Actions Actually I can only do that in the header file for the function declaration. The function definition needs a real name for param. hoy: Actually I can only do that in the header file for the function declaration. The function…
		wenleiUnsubmitted Not Done Reply Inline Actions Can we avoid this parameter altogether? For one profile, it's either going to be CS or non-CS, so the dispatch can be based on state/member of the reader instead of relying on a param for each invocation. wenlei: Can we avoid this parameter altogether? For one profile, it's either going to be CS or non-CS…
		hoyAuthorUnsubmitted Done Reply Inline Actions Good point. Checking `FunctionSamples::ProfileIsCS` should be enough. hoy: Good point. Checking `FunctionSamples::ProfileIsCS` should be enough.

		ErrorOr<SampleContext> SampleProfileReaderBinary::readSampleContextFromTable() {
		auto FName(readStringFromTable());
		if (std::error_code EC = FName.getError())
		return EC;
		return SampleContext(*FName);
		}

ErrorOr<StringRef> SampleProfileReaderExtBinaryBase::readStringFromTable() {		ErrorOr<StringRef> SampleProfileReaderExtBinaryBase::readStringFromTable() {
if (!FixedLengthMD5)		if (!FixedLengthMD5)
return SampleProfileReaderBinary::readStringFromTable();		return SampleProfileReaderBinary::readStringFromTable();

// read NameTable index.		// read NameTable index.
auto Idx = readStringIndex(NameTable);		auto Idx = readStringIndex(NameTable);
if (std::error_code EC = Idx.getError())		if (std::error_code EC = Idx.getError())
return EC;		return EC;
▲ Show 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	if (std::error_code EC = readProfile(CalleeProfile))
return EC;		return EC;
}		}

return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code		std::error_code
SampleProfileReaderBinary::readFuncProfile(const uint8_t *Start) {		SampleProfileReaderBinary::readFuncProfile(const uint8_t *Start) {
Data = Start;		Data = Start;
		wenleiUnsubmitted Done Reply Inline Actions Same here, avoid `IsContextName` as param, but dispatch based on `SampleProfileReader::ProfileIsCS` wenlei: Same here, avoid `IsContextName` as param, but dispatch based on `SampleProfileReader…
auto NumHeadSamples = readNumber<uint64_t>();		auto NumHeadSamples = readNumber<uint64_t>();
if (std::error_code EC = NumHeadSamples.getError())		if (std::error_code EC = NumHeadSamples.getError())
return EC;		return EC;

auto FName(readStringFromTable());		ErrorOr<SampleContext> FContext(readSampleContextFromTable());
if (std::error_code EC = FName.getError())		if (std::error_code EC = FContext.getError())
		wenleiUnsubmitted Not Done Reply Inline Actions nit: this can be confusing, readNameFromTable can mislead people to think FContext is string (or vector of strings) type. the auto also isn't helping. Either spell out the type name, or rename `readNameFromTable` to something like `readSampleContextFromTable` wenlei: nit: this can be confusing, readNameFromTable can mislead people to think FContext is string…
		hoyAuthorUnsubmitted Done Reply Inline Actions Fixed by using explicit type. hoy: Fixed by using explicit type.
		wenleiUnsubmitted Not Done Reply Inline Actions Ok, but on 2nd thought, why do we call it readName while it's actually returning a context? Especially given that we've renamed addName to AddContext, writeNameIdx to writeContextIdx. wenlei: Ok, but on 2nd thought, why do we call it readName while it's actually returning a context?
		wenleiUnsubmitted Not Done Reply Inline Actions I think it'd be good to establish a convention as to when we call things `Name` vs `Context`. My thought is it goes with the type, what do you think? wenlei: I think it'd be good to establish a convention as to when we call things `Name` vs `Context`.
		hoyAuthorUnsubmitted Done Reply Inline Actions Yeah, I've been using `Name` as an identifier of of the context and function name string, and using `Context` for CS profile and `String` for function names. But sometimes `Name` and `String` as mixed. We are using `SampleContext` for both CS and non-CS. And we are also using the word context specifically for CS. Sounds like we need a more general name (instead of `Name`) in the reader for both of them. How about `readIdFromTable`? hoy: Yeah, I've been using `Name` as an identifier of of the context and function name string, and…
		wenleiUnsubmitted Not Done Reply Inline Actions I've been using Name as an identifier of of the context and function name string, and using Context for CS profile and String for function names. The problem is you can't do that cleanly because there're cases that covers both CS and non-CS. readNameFromTable is one example. I think that name it according to the type makes things clearer. For non-CS profile we're still using SampleContext as key in the profile map anyways. Adding a new notion of `Id` in addition to name and context seem unnecessary (we have Idx already which can confuse people if we add Id). So I think readSampleContextFromTable/readContextFromTable is better. wenlei: > I've been using Name as an identifier of of the context and function name string, and using…
		hoyAuthorUnsubmitted Done Reply Inline Actions Sounds good. Will use `readSampleContextFromTable`. hoy: Sounds good. Will use `readSampleContextFromTable`.
return EC;		return EC;

SampleContext FContext(*FName);		Profiles[*FContext] = FunctionSamples();
Profiles[FContext] = FunctionSamples();		FunctionSamples &FProfile = Profiles[*FContext];
FunctionSamples &FProfile = Profiles[FContext];		FProfile.setContext(*FContext);
FProfile.setName(FContext.getNameWithoutContext());
FProfile.setContext(FContext);
FProfile.addHeadSamples(*NumHeadSamples);		FProfile.addHeadSamples(*NumHeadSamples);

if (FContext.hasContext())		if (FContext->hasContext())
CSProfileCount++;		CSProfileCount++;

if (std::error_code EC = readProfile(FProfile))		if (std::error_code EC = readProfile(FProfile))
return EC;		return EC;
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileReaderBinary::readImpl() {		std::error_code SampleProfileReaderBinary::readImpl() {
ProfileIsFS = ProfileIsFSDisciminator;		ProfileIsFS = ProfileIsFSDisciminator;
while (!at_eof()) {		while (!at_eof()) {
if (std::error_code EC = readFuncProfile(Data))		if (std::error_code EC = readFuncProfile(Data))
return EC;		return EC;
}		}

return sampleprof_error::success;		return sampleprof_error::success;
}		}

		ErrorOr<SampleContextFrames>
		SampleProfileReaderExtBinaryBase::readContextFromTable() {
		auto ContextIdx = readNumber<uint32_t>();
		if (std::error_code EC = ContextIdx.getError())
		return EC;
		if (*ContextIdx >= CSNameTable->size())
		return sampleprof_error::truncated_name_table;
		return (CSNameTable)[ContextIdx];
		}

		ErrorOr<SampleContext>
		SampleProfileReaderExtBinaryBase::readSampleContextFromTable() {
		if (ProfileIsCS) {
		auto FContext(readContextFromTable());
		if (std::error_code EC = FContext.getError())
		return EC;
		return SampleContext(*FContext);
		} else {
		auto FName(readStringFromTable());
		if (std::error_code EC = FName.getError())
		return EC;
		return SampleContext(*FName);
		}
		}

std::error_code SampleProfileReaderExtBinaryBase::readOneSection(		std::error_code SampleProfileReaderExtBinaryBase::readOneSection(
const uint8_t *Start, uint64_t Size, const SecHdrTableEntry &Entry) {		const uint8_t *Start, uint64_t Size, const SecHdrTableEntry &Entry) {
Data = Start;		Data = Start;
End = Start + Size;		End = Start + Size;
switch (Entry.Type) {		switch (Entry.Type) {
case SecProfSummary:		case SecProfSummary:
if (std::error_code EC = readSummary())		if (std::error_code EC = readSummary())
return EC;		return EC;
Show All 11 Lines	case SecNameTable: {
assert((!FixedLengthMD5 \|\| UseMD5) &&		assert((!FixedLengthMD5 \|\| UseMD5) &&
"If FixedLengthMD5 is true, UseMD5 has to be true");		"If FixedLengthMD5 is true, UseMD5 has to be true");
FunctionSamples::HasUniqSuffix =		FunctionSamples::HasUniqSuffix =
hasSecFlag(Entry, SecNameTableFlags::SecFlagUniqSuffix);		hasSecFlag(Entry, SecNameTableFlags::SecFlagUniqSuffix);
if (std::error_code EC = readNameTableSec(UseMD5))		if (std::error_code EC = readNameTableSec(UseMD5))
return EC;		return EC;
break;		break;
}		}
		case SecCSNameTable: {
		if (std::error_code EC = readCSNameTableSec())
		return EC;
		break;
		}
case SecLBRProfile:		case SecLBRProfile:
if (std::error_code EC = readFuncProfiles())		if (std::error_code EC = readFuncProfiles())
return EC;		return EC;
break;		break;
case SecFuncOffsetTable:		case SecFuncOffsetTable:
if (std::error_code EC = readFuncOffsetTable())		if (std::error_code EC = readFuncOffsetTable())
return EC;		return EC;
break;		break;
Show All 35 Lines	std::error_code SampleProfileReaderExtBinaryBase::readFuncOffsetTable() {
FuncOffsetTable.clear();		FuncOffsetTable.clear();

auto Size = readNumber<uint64_t>();		auto Size = readNumber<uint64_t>();
if (std::error_code EC = Size.getError())		if (std::error_code EC = Size.getError())
return EC;		return EC;

FuncOffsetTable.reserve(*Size);		FuncOffsetTable.reserve(*Size);
for (uint32_t I = 0; I < *Size; ++I) {		for (uint32_t I = 0; I < *Size; ++I) {
auto FName(readStringFromTable());		auto FName(readSampleContextFromTable());
		wenleiUnsubmitted Not Done Reply Inline Actions FName to FContext as well? wenlei: FName to FContext as well?
		hoyAuthorUnsubmitted Done Reply Inline Actions Oops, this was fixed in my other unified patch, somehow got dropped when rebasing. Will send an update to the other patch as you suggested. hoy: Oops, this was fixed in my other unified patch, somehow got dropped when rebasing. Will send an…
if (std::error_code EC = FName.getError())		if (std::error_code EC = FName.getError())
return EC;		return EC;

auto Offset = readNumber<uint64_t>();		auto Offset = readNumber<uint64_t>();
if (std::error_code EC = Offset.getError())		if (std::error_code EC = Offset.getError())
return EC;		return EC;

FuncOffsetTable[FName] = Offset;		FuncOffsetTable[FName] = Offset;
Show All 10 Lines	std::error_code SampleProfileReaderExtBinaryBase::readFuncProfiles() {
// NameTable section is read.		// NameTable section is read.
bool LoadFuncsToBeUsed = collectFuncsFromModule();		bool LoadFuncsToBeUsed = collectFuncsFromModule();

// When LoadFuncsToBeUsed is false, load all the function profiles.		// When LoadFuncsToBeUsed is false, load all the function profiles.
const uint8_t *Start = Data;		const uint8_t *Start = Data;
if (!LoadFuncsToBeUsed) {		if (!LoadFuncsToBeUsed) {
while (Data < End) {		while (Data < End) {
if (std::error_code EC = readFuncProfile(Data))		if (std::error_code EC = readFuncProfile(Data))
return EC;		return EC;
		wenleiUnsubmitted Not Done Reply Inline Actions Since we have `SampleProfileReader::ProfileIsCS`, within reader it probable makes more sense to use the reader instance flag as opposed to the global FunctionSamples::ProfileIsCS flag. We were not very consistent in past, might be good to clean up as you're expanding the use of ProfileIsCS. wenlei: Since we have `SampleProfileReader::ProfileIsCS`, within reader it probable makes more sense to…
		hoyAuthorUnsubmitted Done Reply Inline Actions That's reasonable. Replaced all `FunctionSamples::ProfileIsCS` usage in the reader with the reader `ProfileIsCS` flag. hoy: That's reasonable. Replaced all `FunctionSamples::ProfileIsCS` usage in the reader with the…
}		}
assert(Data == End && "More data is read than expected");		assert(Data == End && "More data is read than expected");
} else {		} else {
// Load function profiles on demand.		// Load function profiles on demand.
if (Remapper) {		if (Remapper) {
for (auto Name : FuncsToUse) {		for (auto Name : FuncsToUse) {
Remapper->insert(Name);		Remapper->insert(Name);
}		}
}		}

if (useMD5()) {		if (useMD5()) {
for (auto Name : FuncsToUse) {		for (auto Name : FuncsToUse) {
auto GUID = std::to_string(MD5Hash(Name));		auto GUID = std::to_string(MD5Hash(Name));
auto iter = FuncOffsetTable.find(StringRef(GUID));		auto iter = FuncOffsetTable.find(StringRef(GUID));
if (iter == FuncOffsetTable.end())		if (iter == FuncOffsetTable.end())
continue;		continue;
const uint8_t *FuncProfileAddr = Start + iter->second;		const uint8_t *FuncProfileAddr = Start + iter->second;
assert(FuncProfileAddr < End && "out of LBRProfile section");		assert(FuncProfileAddr < End && "out of LBRProfile section");
if (std::error_code EC = readFuncProfile(FuncProfileAddr))		if (std::error_code EC = readFuncProfile(FuncProfileAddr))
return EC;		return EC;
}		}
} else if (FunctionSamples::ProfileIsCS) {		} else if (ProfileIsCS) {
// Compute the ordered set of names, so we can		// Compute the ordered set of names, so we can
// get all context profiles under a subtree by		// get all context profiles under a subtree by
// iterating through the ordered names.		// iterating through the ordered names.
struct Comparer {		std::set<SampleContext> OrderedContexts;
		wenleiUnsubmitted Not Done Reply Inline Actions OrderedNames -> OrderedContexts wenlei: OrderedNames -> OrderedContexts
		hoyAuthorUnsubmitted Done Reply Inline Actions fixed. hoy: fixed.
// Ignore the closing ']' when ordering context
bool operator()(const StringRef &L, const StringRef &R) const {
return L.substr(0, L.size() - 1) < R.substr(0, R.size() - 1);
}
};
std::set<StringRef, Comparer> OrderedNames;
for (auto Name : FuncOffsetTable) {		for (auto Name : FuncOffsetTable) {
OrderedNames.insert(Name.first);		OrderedContexts.insert(Name.first);
}		}

// For each function in current module, load all		// For each function in current module, load all
// context profiles for the function.		// context profiles for the function.
for (auto NameOffset : FuncOffsetTable) {		for (auto NameOffset : FuncOffsetTable) {
StringRef ContextName = NameOffset.first;		SampleContext FContext = NameOffset.first;
SampleContext FContext(ContextName);		auto FuncName = FContext.getName();
auto FuncName = FContext.getNameWithoutContext();
if (!FuncsToUse.count(FuncName) &&		if (!FuncsToUse.count(FuncName) &&
(!Remapper \|\| !Remapper->exist(FuncName)))		(!Remapper \|\| !Remapper->exist(FuncName)))
continue;		continue;

// For each context profile we need, try to load		// For each context profile we need, try to load
// all context profile in the subtree. This can		// all context profile in the subtree. This can
// help profile guided importing for ThinLTO.		// help profile guided importing for ThinLTO.
auto It = OrderedNames.find(ContextName);		auto It = OrderedContexts.find(FContext);
while (It != OrderedNames.end() &&		while (It != OrderedContexts.end() && FContext.IsPrefixOf(*It)) {
It->startswith(ContextName.substr(0, ContextName.size() - 1))) {
const uint8_t FuncProfileAddr = Start + FuncOffsetTable[It];		const uint8_t FuncProfileAddr = Start + FuncOffsetTable[It];
		wenleiUnsubmitted Not Done Reply Inline Actions Which operation here is the most costly and visible for e2e build time? The insert/sort/find or the prefix check? What's the % of time here for both prelink and postlink? wenlei: Which operation here is the most costly and visible for e2e build time? The insert/sort/find or…
		hoyAuthorUnsubmitted Done Reply Inline Actions The insert/sort/find operations are the most expansive. For thinlto postlink, they count to 15% to 20% of the whole backend running time. I haven't measured for prelink but given the similarity of prelink and postlink, they should be expansive there too. I'm actually working on a sample writer change that emits the func offset table in the order of contexts so that we don't need the set operations here. That turns out very effective. Will send out a separate diff for it. hoy: The insert/sort/find operations are the most expansive. For thinlto postlink, they count to 15%…
		wenleiUnsubmitted Not Done Reply Inline Actions I'm actually working on a sample writer change that emits the func offset table in the order of contexts so that we don't need the set operations here. That turns out very effective. Great. I was thinking about exactly that too. The order of binary profile is opaque to users, so we can order them in the file to save sorting on reading. For thinlto postlink, they count to 15% to 20% of the whole backend running time. What was the % before this work when we were all using StringRef? wenlei: > I'm actually working on a sample writer change that emits the func offset table in the order…
		hoyAuthorUnsubmitted Done Reply Inline Actions What was the % before this work when we were all using StringRef? It's not noticeable. Actually I didn't see the previous sorting as a hot routine in the profile. With the presorted func offset stable, we are able to achieve similar performance with the previous approach. hoy: > What was the % before this work when we were all using StringRef? It's not noticeable.
assert(FuncProfileAddr < End && "out of LBRProfile section");		assert(FuncProfileAddr < End && "out of LBRProfile section");
if (std::error_code EC = readFuncProfile(FuncProfileAddr))		if (std::error_code EC = readFuncProfile(FuncProfileAddr))
return EC;		return EC;
// Remove loaded context profile so we won't		// Remove loaded context profile so we won't
// load it repeatedly.		// load it repeatedly.
It = OrderedNames.erase(It);		It = OrderedContexts.erase(It);
}		}
}		}
} else {		} else {
for (auto NameOffset : FuncOffsetTable) {		for (auto NameOffset : FuncOffsetTable) {
SampleContext FContext(NameOffset.first);		SampleContext FContext(NameOffset.first);
auto FuncName = FContext.getNameWithoutContext();		auto FuncName = FContext.getName();
if (!FuncsToUse.count(FuncName) &&		if (!FuncsToUse.count(FuncName) &&
(!Remapper \|\| !Remapper->exist(FuncName)))		(!Remapper \|\| !Remapper->exist(FuncName)))
continue;		continue;
const uint8_t *FuncProfileAddr = Start + NameOffset.second;		const uint8_t *FuncProfileAddr = Start + NameOffset.second;
assert(FuncProfileAddr < End && "out of LBRProfile section");		assert(FuncProfileAddr < End && "out of LBRProfile section");
if (std::error_code EC = readFuncProfile(FuncProfileAddr))		if (std::error_code EC = readFuncProfile(FuncProfileAddr))
return EC;		return EC;
}		}
▲ Show 20 Lines • Show All 191 Lines • ▼ Show 20 Lines
}		}

std::error_code SampleProfileReaderExtBinaryBase::readNameTableSec(bool IsMD5) {		std::error_code SampleProfileReaderExtBinaryBase::readNameTableSec(bool IsMD5) {
if (IsMD5)		if (IsMD5)
return readMD5NameTable();		return readMD5NameTable();
return SampleProfileReaderBinary::readNameTable();		return SampleProfileReaderBinary::readNameTable();
}		}

std::error_code		// Read in the CS name table section, which basically contains a list of context
		wenleiUnsubmitted Not Done Reply Inline Actions Comment added to the definition of SampleProfileReaderExtBinaryBase::readCSNameTableSec. Not seeing comments in the latest update. Did I miss anything? wenlei: > Comment added to the definition of SampleProfileReaderExtBinaryBase::readCSNameTableSec.
SampleProfileReaderExtBinaryBase::readFuncMetadata(bool ProfileHasAttribute) {		// vectors. Each element of a context vector, aka a frame, refers to the
while (Data < End) {		// underlying raw function names that are stored in the name table, as well as
		// a callsite identifier that only makes sense for non-leaf frames.
		std::error_code SampleProfileReaderExtBinaryBase::readCSNameTableSec() {
		auto Size = readNumber<uint32_t>();
		if (std::error_code EC = Size.getError())
		return EC;

		std::vector<SampleContextFrameVector> *PNameVec =
		new std::vector<SampleContextFrameVector>();
		PNameVec->reserve(*Size);
		for (uint32_t I = 0; I < *Size; ++I) {
		PNameVec->emplace_back(SampleContextFrameVector());
		auto ContextSize = readNumber<uint32_t>();
		if (std::error_code EC = ContextSize.getError())
		return EC;
		for (uint32_t J = 0; J < *ContextSize; ++J) {
auto FName(readStringFromTable());		auto FName(readStringFromTable());
if (std::error_code EC = FName.getError())		if (std::error_code EC = FName.getError())
return EC;		return EC;
		auto LineOffset = readNumber<uint64_t>();
		if (std::error_code EC = LineOffset.getError())
		return EC;

		if (!isOffsetLegal(*LineOffset))
		return std::error_code();

		auto Discriminator = readNumber<uint64_t>();
		wenleiUnsubmitted Done Reply Inline Actions Let emplace_back take variadic arguments and forward to the constructor directly instead of doing a move copy? wenlei: Let emplace_back take variadic arguments and forward to the constructor directly instead of…
		if (std::error_code EC = Discriminator.getError())
		return EC;

		PNameVec->back().emplace_back(
		wenleiUnsubmitted Not Done Reply Inline Actions How about we emplace an empty slot before going into the inner loop, then we can operate on the vector directly (`PNameVec.back().emplace_back(...)`) and avoid copying temporary `Context` onto `PNameVec`? wenlei: How about we emplace an empty slot before going into the inner loop, then we can operate on the…
		hoyAuthorUnsubmitted Done Reply Inline Actions Sounds good, that should be faster. hoy: Sounds good, that should be faster.
		FName.get(), LineLocation(LineOffset.get(), Discriminator.get()));
		}
		}

SampleContext FContext(*FName);		// From this point the underlying object of CSNameTable should be immutable.
bool ProfileInMap = Profiles.count(FContext);		CSNameTable.reset(PNameVec);
		return sampleprof_error::success;
		}

		std::error_code
		SampleProfileReaderExtBinaryBase::readFuncMetadata(bool ProfileHasAttribute) {
		while (Data < End) {
		auto FContext(readSampleContextFromTable());
		if (std::error_code EC = FContext.getError())
		return EC;

		bool ProfileInMap = Profiles.count(*FContext);
		wenleiUnsubmitted Not Done Reply Inline Actions nit: peal/hoist the `get()` so we don't have to call it for every use. wenlei: nit: peal/hoist the `get()` so we don't have to call it for every use.
		hoyAuthorUnsubmitted Done Reply Inline Actions You mean save `FContext.get()` in a temp and use it in the loop? Thought the compiler would do it. hoy: You mean save `FContext.get()` in a temp and use it in the loop? Thought the compiler would do…
		wenleiUnsubmitted Not Done Reply Inline Actions Yeah, it's less about optimization but for readability to make the code less verbose. wenlei: Yeah, it's less about optimization but for readability to make the code less verbose.
		hoyAuthorUnsubmitted Done Reply Inline Actions Changed to using `FContext` for readability, like other places. Look good? Alternatively, we can have two variables such as `FContextError` and `FContext`. hoy:* Changed to using `*FContext` for readability, like other places. Look good? Alternatively, we…
		wenleiUnsubmitted Not Done Reply Inline Actions either works, thanks. wenlei: either works, thanks.
if (ProfileIsProbeBased) {		if (ProfileIsProbeBased) {
auto Checksum = readNumber<uint64_t>();		auto Checksum = readNumber<uint64_t>();
if (std::error_code EC = Checksum.getError())		if (std::error_code EC = Checksum.getError())
return EC;		return EC;
if (ProfileInMap)		if (ProfileInMap)
Profiles[FContext].setFunctionHash(*Checksum);		Profiles[FContext].setFunctionHash(Checksum);
}		}

if (ProfileHasAttribute) {		if (ProfileHasAttribute) {
auto Attributes = readNumber<uint32_t>();		auto Attributes = readNumber<uint32_t>();
if (std::error_code EC = Attributes.getError())		if (std::error_code EC = Attributes.getError())
return EC;		return EC;
if (ProfileInMap)		if (ProfileInMap)
Profiles[FContext].getContext().setAllAttributes(*Attributes);		Profiles[FContext].getContext().setAllAttributes(Attributes);
}		}
}		}

assert(Data == End && "More data is read than expected");		assert(Data == End && "More data is read than expected");
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileReaderCompactBinary::readNameTable() {		std::error_code SampleProfileReaderCompactBinary::readNameTable() {
▲ Show 20 Lines • Show All 725 Lines • Show Last 20 Lines

llvm/lib/ProfileData/SampleProfWriter.cpp

Show All 35 Lines
#include <set>		#include <set>
#include <system_error>		#include <system_error>
#include <utility>		#include <utility>
#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;
using namespace sampleprof;		using namespace sampleprof;

std::error_code SampleProfileWriter::writeFuncProfiles(		std::error_code
const StringMap<FunctionSamples> &ProfileMap) {		SampleProfileWriter::writeFuncProfiles(const SampleProfileMap &ProfileMap) {
std::vector<NameFunctionSamples> V;		std::vector<NameFunctionSamples> V;
sortFuncProfiles(ProfileMap, V);		sortFuncProfiles(ProfileMap, V);
for (const auto &I : V) {		for (const auto &I : V) {
if (std::error_code EC = writeSample(*I.second))		if (std::error_code EC = writeSample(*I.second))
return EC;		return EC;
}		}
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code		std::error_code SampleProfileWriter::write(const SampleProfileMap &ProfileMap) {
SampleProfileWriter::write(const StringMap<FunctionSamples> &ProfileMap) {
if (std::error_code EC = writeHeader(ProfileMap))		if (std::error_code EC = writeHeader(ProfileMap))
return EC;		return EC;

if (std::error_code EC = writeFuncProfiles(ProfileMap))		if (std::error_code EC = writeFuncProfiles(ProfileMap))
return EC;		return EC;

return sampleprof_error::success;		return sampleprof_error::success;
}		}
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	if (hasSecFlag(Entry, SecCommonFlags::SecFlagCompress)) {
if (std::error_code EC = compressAndOutput())		if (std::error_code EC = compressAndOutput())
return EC;		return EC;
}		}
SecHdrTable.push_back({Type, Entry.Flags, SectionStart - FileStart,		SecHdrTable.push_back({Type, Entry.Flags, SectionStart - FileStart,
OutputStream->tell() - SectionStart, LayoutIdx});		OutputStream->tell() - SectionStart, LayoutIdx});
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterExtBinaryBase::write(		std::error_code
const StringMap<FunctionSamples> &ProfileMap) {		SampleProfileWriterExtBinaryBase::write(const SampleProfileMap &ProfileMap) {
if (std::error_code EC = writeHeader(ProfileMap))		if (std::error_code EC = writeHeader(ProfileMap))
return EC;		return EC;

std::string LocalBuf;		std::string LocalBuf;
LocalBufStream = std::make_unique<raw_string_ostream>(LocalBuf);		LocalBufStream = std::make_unique<raw_string_ostream>(LocalBuf);
if (std::error_code EC = writeSections(ProfileMap))		if (std::error_code EC = writeSections(ProfileMap))
return EC;		return EC;

if (std::error_code EC = writeSecHdrTable())		if (std::error_code EC = writeSecHdrTable())
return EC;		return EC;

return sampleprof_error::success;		return sampleprof_error::success;
}		}

		std::error_code SampleProfileWriterExtBinaryBase::writeContextIdx(
		const SampleContext &Context) {
		if (Context.hasContext())
		return writeCSNameIdx(Context);
		else
		return SampleProfileWriterBinary::writeNameIdx(Context.getName());
		}

		std::error_code
		SampleProfileWriterExtBinaryBase::writeCSNameIdx(const SampleContext &Context) {
		const auto &Ret = CSNameTable.find(Context);
		if (Ret == CSNameTable.end())
		return sampleprof_error::truncated_name_table;
		encodeULEB128(Ret->second, *OutputStream);
		return sampleprof_error::success;
		}

std::error_code		std::error_code
SampleProfileWriterExtBinaryBase::writeSample(const FunctionSamples &S) {		SampleProfileWriterExtBinaryBase::writeSample(const FunctionSamples &S) {
uint64_t Offset = OutputStream->tell();		uint64_t Offset = OutputStream->tell();
StringRef Name = S.getNameWithContext();		auto &Context = S.getContext();
FuncOffsetTable[Name] = Offset - SecLBRProfileStart;		FuncOffsetTable[Context] = Offset - SecLBRProfileStart;
encodeULEB128(S.getHeadSamples(), *OutputStream);		encodeULEB128(S.getHeadSamples(), *OutputStream);
return writeBody(S);		return writeBody(S);
}		}

std::error_code SampleProfileWriterExtBinaryBase::writeFuncOffsetTable() {		std::error_code SampleProfileWriterExtBinaryBase::writeFuncOffsetTable() {
auto &OS = *OutputStream;		auto &OS = *OutputStream;

// Write out the table size.		// Write out the table size.
encodeULEB128(FuncOffsetTable.size(), OS);		encodeULEB128(FuncOffsetTable.size(), OS);

// Write out FuncOffsetTable.		// Write out FuncOffsetTable.
for (auto Entry : FuncOffsetTable) {		for (auto Entry : FuncOffsetTable) {
if (std::error_code EC =		if (std::error_code EC = writeContextIdx(Entry.first))
writeNameIdx(Entry.first, FunctionSamples::ProfileIsCS))
return EC;		return EC;
encodeULEB128(Entry.second, OS);		encodeULEB128(Entry.second, OS);
}		}
FuncOffsetTable.clear();		FuncOffsetTable.clear();
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterExtBinaryBase::writeFuncMetadata(		std::error_code SampleProfileWriterExtBinaryBase::writeFuncMetadata(
const StringMap<FunctionSamples> &Profiles) {		const SampleProfileMap &Profiles) {
if (!FunctionSamples::ProfileIsProbeBased && !FunctionSamples::ProfileIsCS)		if (!FunctionSamples::ProfileIsProbeBased && !FunctionSamples::ProfileIsCS)
return sampleprof_error::success;		return sampleprof_error::success;
auto &OS = *OutputStream;		auto &OS = *OutputStream;
for (const auto &Entry : Profiles) {		for (const auto &Entry : Profiles) {
if (std::error_code EC = writeNameIdx(Entry.second.getNameWithContext(),		if (std::error_code EC = writeContextIdx(Entry.second.getContext()))
FunctionSamples::ProfileIsCS))
return EC;		return EC;
if (FunctionSamples::ProfileIsProbeBased)		if (FunctionSamples::ProfileIsProbeBased)
encodeULEB128(Entry.second.getFunctionHash(), OS);		encodeULEB128(Entry.second.getFunctionHash(), OS);
if (FunctionSamples::ProfileIsCS)		if (FunctionSamples::ProfileIsCS)
encodeULEB128(Entry.second.getContext().getAllAttributes(), OS);		encodeULEB128(Entry.second.getContext().getAllAttributes(), OS);
}		}
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterExtBinaryBase::writeNameTable() {		std::error_code SampleProfileWriterExtBinaryBase::writeNameTable() {
if (!UseMD5)		if (!UseMD5)
return SampleProfileWriterBinary::writeNameTable();		return SampleProfileWriterBinary::writeNameTable();

auto &OS = *OutputStream;		auto &OS = *OutputStream;
std::set<StringRef> V;		std::set<StringRef> V;
stablizeNameTable(V);		stablizeNameTable(NameTable, V);

// Write out the MD5 name table. We wrote unencoded MD5 so reader can		// Write out the MD5 name table. We wrote unencoded MD5 so reader can
// retrieve the name using the name index without having to read the		// retrieve the name using the name index without having to read the
// whole name table.		// whole name table.
encodeULEB128(NameTable.size(), OS);		encodeULEB128(NameTable.size(), OS);
support::endian::Writer Writer(OS, support::little);		support::endian::Writer Writer(OS, support::little);
for (auto N : V)		for (auto N : V)
Writer.write(MD5Hash(N));		Writer.write(MD5Hash(N));
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterExtBinaryBase::writeNameTableSection(		std::error_code SampleProfileWriterExtBinaryBase::writeNameTableSection(
const StringMap<FunctionSamples> &ProfileMap) {		const SampleProfileMap &ProfileMap) {
for (const auto &I : ProfileMap) {		for (const auto &I : ProfileMap) {
assert(I.first() == I.second.getNameWithContext() &&		assert(I.first == I.second.getContext() && "Inconsistent profile map");
"Inconsistent profile map");		addContext(I.second.getContext());
addName(I.second.getNameWithContext(), FunctionSamples::ProfileIsCS);
addNames(I.second);		addNames(I.second);
}		}

// If NameTable contains ".__uniq." suffix, set SecFlagUniqSuffix flag		// If NameTable contains ".__uniq." suffix, set SecFlagUniqSuffix flag
// so compiler won't strip the suffix during profile matching after		// so compiler won't strip the suffix during profile matching after
// seeing the flag in the profile.		// seeing the flag in the profile.
for (const auto &I : NameTable) {		for (const auto &I : NameTable) {
if (I.first.find(FunctionSamples::UniqSuffix) != StringRef::npos) {		if (I.first.find(FunctionSamples::UniqSuffix) != StringRef::npos) {
addSectionFlag(SecNameTable, SecNameTableFlags::SecFlagUniqSuffix);		addSectionFlag(SecNameTable, SecNameTableFlags::SecFlagUniqSuffix);
break;		break;
}		}
}		}

if (auto EC = writeNameTable())		if (auto EC = writeNameTable())
return EC;		return EC;
return sampleprof_error::success;		return sampleprof_error::success;
}		}

		std::error_code SampleProfileWriterExtBinaryBase::writeCSNameTableSection() {
		// Sort the names to make CSNameTable deterministic.
		std::set<SampleContext> OrderedContexts;
		for (const auto &I : CSNameTable)
		OrderedContexts.insert(I.first);
		assert(OrderedContexts.size() == CSNameTable.size() &&
		"Unmatched ordered and unordered contexts");
		uint64_t I = 0;
		for (auto &Context : OrderedContexts)
		CSNameTable[Context] = I++;

		auto &OS = *OutputStream;
		encodeULEB128(OrderedContexts.size(), OS);
		support::endian::Writer Writer(OS, support::little);
		for (auto Context : OrderedContexts) {
		auto Frames = Context.getContextFrames();
		wenleiUnsubmitted Not Done Reply Inline Actions nit: this contains leaf frame, so it's `Frames` instead of `Callsites`. wenlei: nit: this contains leaf frame, so it's `Frames` instead of `Callsites`.
		hoyAuthorUnsubmitted Done Reply Inline Actions `Frames` sounds good. hoy: `Frames` sounds good.
		encodeULEB128(Frames.size(), OS);
		for (auto &Callsite : Frames) {
		if (std::error_code EC = writeNameIdx(Callsite.CallerName))
		return EC;
		encodeULEB128(Callsite.Callsite.LineOffset, OS);
		encodeULEB128(Callsite.Callsite.Discriminator, OS);
		}
		}

		return sampleprof_error::success;
		}

std::error_code		std::error_code
SampleProfileWriterExtBinaryBase::writeProfileSymbolListSection() {		SampleProfileWriterExtBinaryBase::writeProfileSymbolListSection() {
if (ProfSymList && ProfSymList->size() > 0)		if (ProfSymList && ProfSymList->size() > 0)
if (std::error_code EC = ProfSymList->write(*OutputStream))		if (std::error_code EC = ProfSymList->write(*OutputStream))
return EC;		return EC;

return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterExtBinaryBase::writeOneSection(		std::error_code SampleProfileWriterExtBinaryBase::writeOneSection(
SecType Type, uint32_t LayoutIdx,		SecType Type, uint32_t LayoutIdx, const SampleProfileMap &ProfileMap) {
const StringMap<FunctionSamples> &ProfileMap) {
// The setting of SecFlagCompress should happen before markSectionStart.		// The setting of SecFlagCompress should happen before markSectionStart.
if (Type == SecProfileSymbolList && ProfSymList && ProfSymList->toCompress())		if (Type == SecProfileSymbolList && ProfSymList && ProfSymList->toCompress())
setToCompressSection(SecProfileSymbolList);		setToCompressSection(SecProfileSymbolList);
if (Type == SecFuncMetadata && FunctionSamples::ProfileIsProbeBased)		if (Type == SecFuncMetadata && FunctionSamples::ProfileIsProbeBased)
addSectionFlag(SecFuncMetadata, SecFuncMetadataFlags::SecFlagIsProbeBased);		addSectionFlag(SecFuncMetadata, SecFuncMetadataFlags::SecFlagIsProbeBased);
if (Type == SecProfSummary && FunctionSamples::ProfileIsCS)		if (Type == SecProfSummary && FunctionSamples::ProfileIsCS)
addSectionFlag(SecProfSummary, SecProfSummaryFlags::SecFlagFullContext);		addSectionFlag(SecProfSummary, SecProfSummaryFlags::SecFlagFullContext);
if (Type == SecFuncMetadata && FunctionSamples::ProfileIsCS)		if (Type == SecFuncMetadata && FunctionSamples::ProfileIsCS)
addSectionFlag(SecFuncMetadata, SecFuncMetadataFlags::SecFlagHasAttribute);		addSectionFlag(SecFuncMetadata, SecFuncMetadataFlags::SecFlagHasAttribute);
if (Type == SecProfSummary && FunctionSamples::ProfileIsFS)		if (Type == SecProfSummary && FunctionSamples::ProfileIsFS)
addSectionFlag(SecProfSummary, SecProfSummaryFlags::SecFlagFSDiscriminator);		addSectionFlag(SecProfSummary, SecProfSummaryFlags::SecFlagFSDiscriminator);

uint64_t SectionStart = markSectionStart(Type, LayoutIdx);		uint64_t SectionStart = markSectionStart(Type, LayoutIdx);
switch (Type) {		switch (Type) {
case SecProfSummary:		case SecProfSummary:
computeSummary(ProfileMap);		computeSummary(ProfileMap);
if (auto EC = writeSummary())		if (auto EC = writeSummary())
return EC;		return EC;
break;		break;
case SecNameTable:		case SecNameTable:
if (auto EC = writeNameTableSection(ProfileMap))		if (auto EC = writeNameTableSection(ProfileMap))
return EC;		return EC;
break;		break;
		case SecCSNameTable:
		if (auto EC = writeCSNameTableSection())
		return EC;
		break;
case SecLBRProfile:		case SecLBRProfile:
SecLBRProfileStart = OutputStream->tell();		SecLBRProfileStart = OutputStream->tell();
if (std::error_code EC = writeFuncProfiles(ProfileMap))		if (std::error_code EC = writeFuncProfiles(ProfileMap))
return EC;		return EC;
break;		break;
case SecFuncOffsetTable:		case SecFuncOffsetTable:
if (auto EC = writeFuncOffsetTable())		if (auto EC = writeFuncOffsetTable())
return EC;		return EC;
Show All 12 Lines	default:
break;		break;
}		}
if (std::error_code EC = addNewSection(Type, LayoutIdx, SectionStart))		if (std::error_code EC = addNewSection(Type, LayoutIdx, SectionStart))
return EC;		return EC;
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterExtBinary::writeDefaultLayout(		std::error_code SampleProfileWriterExtBinary::writeDefaultLayout(
const StringMap<FunctionSamples> &ProfileMap) {		const SampleProfileMap &ProfileMap) {
// The const indices passed to writeOneSection below are specifying the		// The const indices passed to writeOneSection below are specifying the
// positions of the sections in SectionHdrLayout. Look at		// positions of the sections in SectionHdrLayout. Look at
// initSectionHdrLayout to find out where each section is located in		// initSectionHdrLayout to find out where each section is located in
// SectionHdrLayout.		// SectionHdrLayout.
if (auto EC = writeOneSection(SecProfSummary, 0, ProfileMap))		if (auto EC = writeOneSection(SecProfSummary, 0, ProfileMap))
return EC;		return EC;
if (auto EC = writeOneSection(SecNameTable, 1, ProfileMap))		if (auto EC = writeOneSection(SecNameTable, 1, ProfileMap))
return EC;		return EC;
if (auto EC = writeOneSection(SecLBRProfile, 3, ProfileMap))		if (auto EC = writeOneSection(SecCSNameTable, 2, ProfileMap))
		return EC;
		if (auto EC = writeOneSection(SecLBRProfile, 4, ProfileMap))
return EC;		return EC;
if (auto EC = writeOneSection(SecProfileSymbolList, 4, ProfileMap))		if (auto EC = writeOneSection(SecProfileSymbolList, 5, ProfileMap))
return EC;		return EC;
if (auto EC = writeOneSection(SecFuncOffsetTable, 2, ProfileMap))		if (auto EC = writeOneSection(SecFuncOffsetTable, 3, ProfileMap))
return EC;		return EC;
if (auto EC = writeOneSection(SecFuncMetadata, 5, ProfileMap))		if (auto EC = writeOneSection(SecFuncMetadata, 6, ProfileMap))
return EC;		return EC;
return sampleprof_error::success;		return sampleprof_error::success;
}		}

static void		static void splitProfileMapToTwo(const SampleProfileMap &ProfileMap,
splitProfileMapToTwo(const StringMap<FunctionSamples> &ProfileMap,		SampleProfileMap &ContextProfileMap,
StringMap<FunctionSamples> &ContextProfileMap,		SampleProfileMap &NoContextProfileMap) {
StringMap<FunctionSamples> &NoContextProfileMap) {
for (const auto &I : ProfileMap) {		for (const auto &I : ProfileMap) {
if (I.second.getCallsiteSamples().size())		if (I.second.getCallsiteSamples().size())
ContextProfileMap.insert({I.first(), I.second});		ContextProfileMap.insert({I.first, I.second});
else		else
NoContextProfileMap.insert({I.first(), I.second});		NoContextProfileMap.insert({I.first, I.second});
}		}
}		}

std::error_code SampleProfileWriterExtBinary::writeCtxSplitLayout(		std::error_code SampleProfileWriterExtBinary::writeCtxSplitLayout(
const StringMap<FunctionSamples> &ProfileMap) {		const SampleProfileMap &ProfileMap) {
StringMap<FunctionSamples> ContextProfileMap, NoContextProfileMap;		SampleProfileMap ContextProfileMap, NoContextProfileMap;
splitProfileMapToTwo(ProfileMap, ContextProfileMap, NoContextProfileMap);		splitProfileMapToTwo(ProfileMap, ContextProfileMap, NoContextProfileMap);

if (auto EC = writeOneSection(SecProfSummary, 0, ProfileMap))		if (auto EC = writeOneSection(SecProfSummary, 0, ProfileMap))
return EC;		return EC;
if (auto EC = writeOneSection(SecNameTable, 1, ProfileMap))		if (auto EC = writeOneSection(SecNameTable, 1, ProfileMap))
return EC;		return EC;
if (auto EC = writeOneSection(SecLBRProfile, 3, ContextProfileMap))		if (auto EC = writeOneSection(SecLBRProfile, 3, ContextProfileMap))
return EC;		return EC;
Show All 13 Lines	if (auto EC = writeOneSection(SecProfileSymbolList, 6, ProfileMap))
return EC;		return EC;
if (auto EC = writeOneSection(SecFuncMetadata, 7, ProfileMap))		if (auto EC = writeOneSection(SecFuncMetadata, 7, ProfileMap))
return EC;		return EC;

return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterExtBinary::writeSections(		std::error_code SampleProfileWriterExtBinary::writeSections(
const StringMap<FunctionSamples> &ProfileMap) {		const SampleProfileMap &ProfileMap) {
std::error_code EC;		std::error_code EC;
if (SecLayout == DefaultLayout)		if (SecLayout == DefaultLayout)
EC = writeDefaultLayout(ProfileMap);		EC = writeDefaultLayout(ProfileMap);
else if (SecLayout == CtxSplitLayout)		else if (SecLayout == CtxSplitLayout)
EC = writeCtxSplitLayout(ProfileMap);		EC = writeCtxSplitLayout(ProfileMap);
else		else
llvm_unreachable("Unsupported layout");		llvm_unreachable("Unsupported layout");
return EC;		return EC;
}		}

std::error_code SampleProfileWriterCompactBinary::write(		std::error_code
const StringMap<FunctionSamples> &ProfileMap) {		SampleProfileWriterCompactBinary::write(const SampleProfileMap &ProfileMap) {
if (std::error_code EC = SampleProfileWriter::write(ProfileMap))		if (std::error_code EC = SampleProfileWriter::write(ProfileMap))
return EC;		return EC;
if (std::error_code EC = writeFuncOffsetTable())		if (std::error_code EC = writeFuncOffsetTable())
return EC;		return EC;
return sampleprof_error::success;		return sampleprof_error::success;
}		}

/// Write samples to a text file.		/// Write samples to a text file.
///		///
/// Note: it may be tempting to implement this in terms of		/// Note: it may be tempting to implement this in terms of
/// FunctionSamples::print(). Please don't. The dump functionality is intended		/// FunctionSamples::print(). Please don't. The dump functionality is intended
/// for debugging and has no specified form.		/// for debugging and has no specified form.
///		///
/// The format used here is more structured and deliberate because		/// The format used here is more structured and deliberate because
/// it needs to be parsed by the SampleProfileReaderText class.		/// it needs to be parsed by the SampleProfileReaderText class.
std::error_code SampleProfileWriterText::writeSample(const FunctionSamples &S) {		std::error_code SampleProfileWriterText::writeSample(const FunctionSamples &S) {
auto &OS = *OutputStream;		auto &OS = *OutputStream;
if (FunctionSamples::ProfileIsCS)		if (FunctionSamples::ProfileIsCS)
OS << "[" << S.getNameWithContext() << "]:" << S.getTotalSamples();		OS << "[" << S.getContext().toString() << "]:" << S.getTotalSamples();
else		else
OS << S.getName() << ":" << S.getTotalSamples();		OS << S.getName() << ":" << S.getTotalSamples();

if (Indent == 0)		if (Indent == 0)
OS << ":" << S.getHeadSamples();		OS << ":" << S.getHeadSamples();
OS << "\n";		OS << "\n";

SampleSorter<LineLocation, SampleRecord> SortedSamples(S.getBodySamples());		SampleSorter<LineLocation, SampleRecord> SortedSamples(S.getBodySamples());
Show All 39 Lines	if (FunctionSamples::ProfileIsCS) {
OS.indent(Indent + 1);		OS.indent(Indent + 1);
OS << "!Attributes: " << S.getContext().getAllAttributes() << "\n";		OS << "!Attributes: " << S.getContext().getAllAttributes() << "\n";
}		}
}		}

return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterBinary::writeNameIdx(StringRef FName,		std::error_code
bool IsContextName) {		SampleProfileWriterBinary::writeContextIdx(const SampleContext &Context) {
std::string BracketedName;		assert(!Context.hasContext() && "cs profile is not supported");
		wenleiUnsubmitted Not Done Reply Inline Actions assert !Context.hasContext() ? wenlei: assert !Context.hasContext() ?
		hoyAuthorUnsubmitted Done Reply Inline Actions Added the assert. hoy: Added the assert.
if (IsContextName) {		return writeNameIdx(Context.getName());
BracketedName = "[" + FName.str() + "]";
FName = StringRef(BracketedName);
}		}

const auto &Ret = NameTable.find(FName);		std::error_code SampleProfileWriterBinary::writeNameIdx(StringRef FName) {
if (Ret == NameTable.end())		auto &NTable = getNameTable();
		const auto &Ret = NTable.find(FName);
		if (Ret == NTable.end())
return sampleprof_error::truncated_name_table;		return sampleprof_error::truncated_name_table;
encodeULEB128(Ret->second, *OutputStream);		encodeULEB128(Ret->second, *OutputStream);
return sampleprof_error::success;		return sampleprof_error::success;
}		}

void SampleProfileWriterBinary::addName(StringRef FName, bool IsContextName) {		void SampleProfileWriterBinary::addName(StringRef FName) {
if (IsContextName) {		auto &NTable = getNameTable();
auto It = BracketedContextStr.insert("[" + FName.str() + "]");		NTable.insert(std::make_pair(FName, 0));
FName = StringRef(*It.first);
}		}
NameTable.insert(std::make_pair(FName, 0));
		void SampleProfileWriterBinary::addContext(const SampleContext &Context) {
		addName(Context.getName());
}		}

void SampleProfileWriterBinary::addNames(const FunctionSamples &S) {		void SampleProfileWriterBinary::addNames(const FunctionSamples &S) {
// Add all the names in indirect call targets.		// Add all the names in indirect call targets.
for (const auto &I : S.getBodySamples()) {		for (const auto &I : S.getBodySamples()) {
const SampleRecord &Sample = I.second;		const SampleRecord &Sample = I.second;
for (const auto &J : Sample.getCallTargets())		for (const auto &J : Sample.getCallTargets())
addName(J.first());		addName(J.first());
}		}

// Recursively add all the names for inlined callsites.		// Recursively add all the names for inlined callsites.
for (const auto &J : S.getCallsiteSamples())		for (const auto &J : S.getCallsiteSamples())
for (const auto &FS : J.second) {		for (const auto &FS : J.second) {
const FunctionSamples &CalleeSamples = FS.second;		const FunctionSamples &CalleeSamples = FS.second;
addName(CalleeSamples.getName());		addName(CalleeSamples.getName());
addNames(CalleeSamples);		addNames(CalleeSamples);
}		}
}		}

void SampleProfileWriterBinary::stablizeNameTable(std::set<StringRef> &V) {		void SampleProfileWriterExtBinaryBase::addContext(
		const SampleContext &Context) {
		if (Context.hasContext()) {
		for (auto &Callsite : Context.getContextFrames())
		SampleProfileWriterBinary::addName(Callsite.CallerName);
		CSNameTable.insert(std::make_pair(Context, 0));
		} else {
		SampleProfileWriterBinary::addName(Context.getName());
		}
		}

		void SampleProfileWriterBinary::stablizeNameTable(
		MapVector<StringRef, uint32_t> &NameTable, std::set<StringRef> &V) {
// Sort the names to make NameTable deterministic.		// Sort the names to make NameTable deterministic.
for (const auto &I : NameTable)		for (const auto &I : NameTable)
V.insert(I.first);		V.insert(I.first);
int i = 0;		int i = 0;
for (const StringRef &N : V)		for (const StringRef &N : V)
NameTable[N] = i++;		NameTable[N] = i++;
}		}

std::error_code SampleProfileWriterBinary::writeNameTable() {		std::error_code SampleProfileWriterBinary::writeNameTable() {
auto &OS = *OutputStream;		auto &OS = *OutputStream;
std::set<StringRef> V;		std::set<StringRef> V;
stablizeNameTable(V);		stablizeNameTable(NameTable, V);

// Write out the name table.		// Write out the name table.
encodeULEB128(NameTable.size(), OS);		encodeULEB128(NameTable.size(), OS);
for (auto N : V) {		for (auto N : V) {
OS << N;		OS << N;
encodeULEB128(0, OS);		encodeULEB128(0, OS);
}		}
return sampleprof_error::success;		return sampleprof_error::success;
Show All 12 Lines	std::error_code SampleProfileWriterCompactBinary::writeFuncOffsetTable() {
if (OFS.seek(FuncOffsetTableStart) == (uint64_t)-1)		if (OFS.seek(FuncOffsetTableStart) == (uint64_t)-1)
return sampleprof_error::ostream_seek_unsupported;		return sampleprof_error::ostream_seek_unsupported;

// Write out the table size.		// Write out the table size.
encodeULEB128(FuncOffsetTable.size(), OS);		encodeULEB128(FuncOffsetTable.size(), OS);

// Write out FuncOffsetTable.		// Write out FuncOffsetTable.
for (auto Entry : FuncOffsetTable) {		for (auto Entry : FuncOffsetTable) {
if (std::error_code EC =		if (std::error_code EC = writeNameIdx(Entry.first))
writeNameIdx(Entry.first, FunctionSamples::ProfileIsCS))
return EC;		return EC;
encodeULEB128(Entry.second, OS);		encodeULEB128(Entry.second, OS);
}		}
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterCompactBinary::writeNameTable() {		std::error_code SampleProfileWriterCompactBinary::writeNameTable() {
auto &OS = *OutputStream;		auto &OS = *OutputStream;
std::set<StringRef> V;		std::set<StringRef> V;
stablizeNameTable(V);		stablizeNameTable(NameTable, V);

// Write out the name table.		// Write out the name table.
encodeULEB128(NameTable.size(), OS);		encodeULEB128(NameTable.size(), OS);
for (auto N : V) {		for (auto N : V) {
encodeULEB128(MD5Hash(N), OS);		encodeULEB128(MD5Hash(N), OS);
}		}
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code		std::error_code
SampleProfileWriterBinary::writeMagicIdent(SampleProfileFormat Format) {		SampleProfileWriterBinary::writeMagicIdent(SampleProfileFormat Format) {
auto &OS = *OutputStream;		auto &OS = *OutputStream;
// Write file magic identifier.		// Write file magic identifier.
encodeULEB128(SPMagic(Format), OS);		encodeULEB128(SPMagic(Format), OS);
encodeULEB128(SPVersion(), OS);		encodeULEB128(SPVersion(), OS);
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterBinary::writeHeader(		std::error_code
const StringMap<FunctionSamples> &ProfileMap) {		SampleProfileWriterBinary::writeHeader(const SampleProfileMap &ProfileMap) {
writeMagicIdent(Format);		writeMagicIdent(Format);

computeSummary(ProfileMap);		computeSummary(ProfileMap);
if (auto EC = writeSummary())		if (auto EC = writeSummary())
return EC;		return EC;

// Generate the name table for all the functions referenced in the profile.		// Generate the name table for all the functions referenced in the profile.
for (const auto &I : ProfileMap) {		for (const auto &I : ProfileMap) {
assert(I.first() == I.second.getNameWithContext() &&		assert(I.first == I.second.getContext() && "Inconsistent profile map");
"Inconsistent profile map");		addContext(I.first);
addName(I.first(), FunctionSamples::ProfileIsCS);
addNames(I.second);		addNames(I.second);
}		}

writeNameTable();		writeNameTable();
return sampleprof_error::success;		return sampleprof_error::success;
}		}

void SampleProfileWriterExtBinaryBase::setToCompressAllSections() {		void SampleProfileWriterExtBinaryBase::setToCompressAllSections() {
▲ Show 20 Lines • Show All 57 Lines • ▼ Show 20 Lines	std::error_code SampleProfileWriterExtBinaryBase::writeSecHdrTable() {
// Reset OutputStream.		// Reset OutputStream.
if (OFS.seek(Saved) == (uint64_t)-1)		if (OFS.seek(Saved) == (uint64_t)-1)
return sampleprof_error::ostream_seek_unsupported;		return sampleprof_error::ostream_seek_unsupported;

return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterExtBinaryBase::writeHeader(		std::error_code SampleProfileWriterExtBinaryBase::writeHeader(
const StringMap<FunctionSamples> &ProfileMap) {		const SampleProfileMap &ProfileMap) {
auto &OS = *OutputStream;		auto &OS = *OutputStream;
FileStart = OS.tell();		FileStart = OS.tell();
writeMagicIdent(Format);		writeMagicIdent(Format);

allocSecHdrTable();		allocSecHdrTable();
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterCompactBinary::writeHeader(		std::error_code SampleProfileWriterCompactBinary::writeHeader(
const StringMap<FunctionSamples> &ProfileMap) {		const SampleProfileMap &ProfileMap) {
support::endian::Writer Writer(*OutputStream, support::little);		support::endian::Writer Writer(*OutputStream, support::little);
if (auto EC = SampleProfileWriterBinary::writeHeader(ProfileMap))		if (auto EC = SampleProfileWriterBinary::writeHeader(ProfileMap))
return EC;		return EC;

// Reserve a slot for the offset of function offset table. The slot will		// Reserve a slot for the offset of function offset table. The slot will
// be populated with the offset of FuncOffsetTable later.		// be populated with the offset of FuncOffsetTable later.
TableOffset = OutputStream->tell();		TableOffset = OutputStream->tell();
Writer.write(static_cast<uint64_t>(-2));		Writer.write(static_cast<uint64_t>(-2));
Show All 13 Lines	for (auto Entry : Entries) {
encodeULEB128(Entry.Cutoff, OS);		encodeULEB128(Entry.Cutoff, OS);
encodeULEB128(Entry.MinCount, OS);		encodeULEB128(Entry.MinCount, OS);
encodeULEB128(Entry.NumCounts, OS);		encodeULEB128(Entry.NumCounts, OS);
}		}
return sampleprof_error::success;		return sampleprof_error::success;
}		}
std::error_code SampleProfileWriterBinary::writeBody(const FunctionSamples &S) {		std::error_code SampleProfileWriterBinary::writeBody(const FunctionSamples &S) {
auto &OS = *OutputStream;		auto &OS = *OutputStream;
		if (std::error_code EC = writeContextIdx(S.getContext()))
if (std::error_code EC =
writeNameIdx(S.getNameWithContext(), FunctionSamples::ProfileIsCS))
return EC;		return EC;

encodeULEB128(S.getTotalSamples(), OS);		encodeULEB128(S.getTotalSamples(), OS);

// Emit all the body samples.		// Emit all the body samples.
encodeULEB128(S.getBodySamples().size(), OS);		encodeULEB128(S.getBodySamples().size(), OS);
for (const auto &I : S.getBodySamples()) {		for (const auto &I : S.getBodySamples()) {
LineLocation Loc = I.first;		LineLocation Loc = I.first;
▲ Show 20 Lines • Show All 102 Lines • ▼ Show 20 Lines	SampleProfileWriter::create(std::unique_ptr<raw_ostream> &OS,

if (EC)		if (EC)
return EC;		return EC;

Writer->Format = Format;		Writer->Format = Format;
return std::move(Writer);		return std::move(Writer);
}		}

void SampleProfileWriter::computeSummary(		void SampleProfileWriter::computeSummary(const SampleProfileMap &ProfileMap) {
const StringMap<FunctionSamples> &ProfileMap) {
SampleProfileSummaryBuilder Builder(ProfileSummaryBuilder::DefaultCutoffs);		SampleProfileSummaryBuilder Builder(ProfileSummaryBuilder::DefaultCutoffs);
Summary = Builder.computeSummaryForProfiles(ProfileMap);		Summary = Builder.computeSummaryForProfiles(ProfileMap);
}		}

llvm/test/tools/llvm-profdata/Inputs/cs-sample.proftext

	[main:3 @ _Z5funcAi:1 @ _Z8funcLeafi]:1467299:11			[main:3 @ _Z5funcAi:1 @ _Z8funcLeafi]:1467299:11
	0: 6			0: 6
	1: 6			1: 6
	3: 287884			3: 287884
	4: 287864 _Z3fibi:315608			4: 287864 _Z3fibi:315608
	15: 23			15: 23
	!Attributes: 0			!Attributes: 0
	[main:3.1 @ _Z5funcBi:1 @ _Z8funcLeafi]:500853:20			[main:3.1 @ _Z5funcBi:1 @ _Z8funcLeafi]:500853:20
	0: 15			0: 15
	1: 15			1: 15
	3: 74946			3: 74946
	4: 74941 _Z3fibi:82359			4: 74941 _Z3fibi:82359
	10: 23324			10: 23324
	11: 23327 _Z3fibi:25228			11: 23327 _Z3fibi:25228
	15: 11			15: 11
	!Attributes: 1			!Attributes: 1
				[external:12 @ main]:154:12
				2: 12
				3: 10 _Z5funcAi:7
				3.1: 10 _Z5funcBi:11
				!Attributes: 0
	[main]:154:0			[main]:154:0
	2: 12			2: 12
	3: 18 _Z5funcAi:11			3: 18 _Z5funcAi:11
	3.1: 18 _Z5funcBi:19			3.1: 18 _Z5funcBi:19
	!Attributes: 0			!Attributes: 0
	[external:12 @ main]:154:12			[external:10 @ _Z5funcBi]:120:10
	2: 12			0: 10
	3: 10 _Z5funcAi:7			1: 10
	3.1: 10 _Z5funcBi:11			!Attributes: 0
				[externalA:17 @ _Z5funcBi]:120:3
				0: 3
				1: 3
	!Attributes: 0			!Attributes: 0
	[main:3.1 @ _Z5funcBi]:120:19			[main:3.1 @ _Z5funcBi]:120:19
	0: 19			0: 19
	1: 19 _Z8funcLeafi:20			1: 19 _Z8funcLeafi:20
	3: 12			3: 12
	!Attributes: 1			!Attributes: 1
	[externalA:17 @ _Z5funcBi]:120:3
	0: 3
	1: 3
	!Attributes: 0
	[external:10 @ _Z5funcBi]:120:10
	0: 10
	1: 10
	!Attributes: 0
	[main:3 @ _Z5funcAi]:99:11			[main:3 @ _Z5funcAi]:99:11
	0: 10			0: 10
	1: 10 _Z8funcLeafi:11			1: 10 _Z8funcLeafi:11
	3: 24			3: 24
	!Attributes: 0			!Attributes: 0
				No newline at end of file

llvm/tools/llvm-profdata/llvm-profdata.cpp

Show First 20 Lines • Show All 515 Lines • ▼ Show 20 Lines	uint64_t ColdInstrThreshold =
: ProfileSummaryBuilder::getEntryForPercentile(		: ProfileSummaryBuilder::getEntryForPercentile(
InstrPS.getDetailedSummary(),		InstrPS.getDetailedSummary(),
ProfileSummaryBuilder::DefaultCutoffs[ColdPercentileIdx])		ProfileSummaryBuilder::DefaultCutoffs[ColdPercentileIdx])
.MinCount;		.MinCount;

// Find hot/warm functions in sample profile which is cold in instr profile		// Find hot/warm functions in sample profile which is cold in instr profile
// and adjust the profiles of those functions in the instr profile.		// and adjust the profiles of those functions in the instr profile.
for (const auto &PD : Reader->getProfiles()) {		for (const auto &PD : Reader->getProfiles()) {
StringRef FName = PD.getKey();		auto &FContext = PD.first;
const sampleprof::FunctionSamples &FS = PD.getValue();		const sampleprof::FunctionSamples &FS = PD.second;
auto It = InstrProfileMap.find(FName);		auto It = InstrProfileMap.find(FContext.toString());
		wenleiUnsubmitted Not Done Reply Inline Actions I suggest rename these FName (key of the SampleProfileMap) to be FContext accordingly given this is no longer a string map. Same for other places. wenlei: I suggest rename these FName (key of the SampleProfileMap) to be FContext accordingly given…
		hoyAuthorUnsubmitted Done Reply Inline Actions Sounds good. hoy: Sounds good.
if (FS.getHeadSamples() > ColdSampleThreshold &&		if (FS.getHeadSamples() > ColdSampleThreshold &&
It != InstrProfileMap.end() &&		It != InstrProfileMap.end() &&
It->second.MaxCount <= ColdInstrThreshold &&		It->second.MaxCount <= ColdInstrThreshold &&
FS.getBodySamples().size() >= SupplMinSizeThreshold) {		FS.getBodySamples().size() >= SupplMinSizeThreshold) {
updateInstrProfileEntry(It->second, HotInstrThreshold,		updateInstrProfileEntry(It->second, HotInstrThreshold,
ZeroCounterThreshold);		ZeroCounterThreshold);
}		}
}		}
▲ Show 20 Lines • Show All 150 Lines • ▼ Show 20 Lines
static void		static void
mergeSampleProfile(const WeightedFileVector &Inputs, SymbolRemapper *Remapper,		mergeSampleProfile(const WeightedFileVector &Inputs, SymbolRemapper *Remapper,
StringRef OutputFilename, ProfileFormat OutputFormat,		StringRef OutputFilename, ProfileFormat OutputFormat,
StringRef ProfileSymbolListFile, bool CompressAllSections,		StringRef ProfileSymbolListFile, bool CompressAllSections,
bool UseMD5, bool GenPartialProfile,		bool UseMD5, bool GenPartialProfile,
bool SampleMergeColdContext, bool SampleTrimColdContext,		bool SampleMergeColdContext, bool SampleTrimColdContext,
bool SampleColdContextFrameDepth, FailureMode FailMode) {		bool SampleColdContextFrameDepth, FailureMode FailMode) {
using namespace sampleprof;		using namespace sampleprof;
StringMap<FunctionSamples> ProfileMap;		SampleProfileMap ProfileMap;
SmallVector<std::unique_ptr<sampleprof::SampleProfileReader>, 5> Readers;		SmallVector<std::unique_ptr<sampleprof::SampleProfileReader>, 5> Readers;
LLVMContext Context;		LLVMContext Context;
sampleprof::ProfileSymbolList WriterList;		sampleprof::ProfileSymbolList WriterList;
Optional<bool> ProfileIsProbeBased;		Optional<bool> ProfileIsProbeBased;
Optional<bool> ProfileIsCS;		Optional<bool> ProfileIsCS;
for (const auto &Input : Inputs) {		for (const auto &Input : Inputs) {
auto ReaderOrErr = SampleProfileReader::create(Input.Filename, Context,		auto ReaderOrErr = SampleProfileReader::create(Input.Filename, Context,
FSDiscriminatorPassOption);		FSDiscriminatorPassOption);
Show All 9 Lines	for (const auto &Input : Inputs) {
Readers.push_back(std::move(ReaderOrErr.get()));		Readers.push_back(std::move(ReaderOrErr.get()));
const auto Reader = Readers.back().get();		const auto Reader = Readers.back().get();
if (std::error_code EC = Reader->read()) {		if (std::error_code EC = Reader->read()) {
warnOrExitGivenError(FailMode, EC, Input.Filename);		warnOrExitGivenError(FailMode, EC, Input.Filename);
Readers.pop_back();		Readers.pop_back();
continue;		continue;
}		}

StringMap<FunctionSamples> &Profiles = Reader->getProfiles();		SampleProfileMap &Profiles = Reader->getProfiles();
if (ProfileIsProbeBased.hasValue() &&		if (ProfileIsProbeBased.hasValue() &&
ProfileIsProbeBased != FunctionSamples::ProfileIsProbeBased)		ProfileIsProbeBased != FunctionSamples::ProfileIsProbeBased)
exitWithError(		exitWithError(
"cannot merge probe-based profile with non-probe-based profile");		"cannot merge probe-based profile with non-probe-based profile");
ProfileIsProbeBased = FunctionSamples::ProfileIsProbeBased;		ProfileIsProbeBased = FunctionSamples::ProfileIsProbeBased;
if (ProfileIsCS.hasValue() && ProfileIsCS != FunctionSamples::ProfileIsCS)		if (ProfileIsCS.hasValue() && ProfileIsCS != FunctionSamples::ProfileIsCS)
exitWithError("cannot merge CS profile with non-CS profile");		exitWithError("cannot merge CS profile with non-CS profile");
ProfileIsCS = FunctionSamples::ProfileIsCS;		ProfileIsCS = FunctionSamples::ProfileIsCS;
for (StringMap<FunctionSamples>::iterator I = Profiles.begin(),		for (SampleProfileMap::iterator I = Profiles.begin(), E = Profiles.end();
E = Profiles.end();
I != E; ++I) {		I != E; ++I) {
sampleprof_error Result = sampleprof_error::success;		sampleprof_error Result = sampleprof_error::success;
FunctionSamples Remapped =		FunctionSamples Remapped =
Remapper ? remapSamples(I->second, *Remapper, Result)		Remapper ? remapSamples(I->second, *Remapper, Result)
: FunctionSamples();		: FunctionSamples();
FunctionSamples &Samples = Remapper ? Remapped : I->second;		FunctionSamples &Samples = Remapper ? Remapped : I->second;
StringRef FName = Samples.getNameWithContext();		SampleContext FContext = Samples.getContext();
MergeResult(Result, ProfileMap[FName].merge(Samples, Input.Weight));		MergeResult(Result, ProfileMap[FContext].merge(Samples, Input.Weight));
if (Result != sampleprof_error::success) {		if (Result != sampleprof_error::success) {
std::error_code EC = make_error_code(Result);		std::error_code EC = make_error_code(Result);
handleMergeWriterError(errorCodeToError(EC), Input.Filename, FName);		handleMergeWriterError(errorCodeToError(EC), Input.Filename,
		FContext.toString());
}		}
}		}

std::unique_ptr<sampleprof::ProfileSymbolList> ReaderList =		std::unique_ptr<sampleprof::ProfileSymbolList> ReaderList =
Reader->getProfileSymbolList();		Reader->getProfileSymbolList();
if (ReaderList)		if (ReaderList)
WriterList.merge(*ReaderList);		WriterList.merge(*ReaderList);
}		}
▲ Show 20 Lines • Show All 268 Lines • ▼ Show 20 Lines	static void overlapInstrProfile(const std::string &BaseFilename,
loadInput(WeightedInput, nullptr, &Context);		loadInput(WeightedInput, nullptr, &Context);
overlapInput(BaseFilename, TestFilename, &Context, Overlap, FuncFilter, OS,		overlapInput(BaseFilename, TestFilename, &Context, Overlap, FuncFilter, OS,
IsCS);		IsCS);
Overlap.dump(OS);		Overlap.dump(OS);
}		}

namespace {		namespace {
struct SampleOverlapStats {		struct SampleOverlapStats {
StringRef BaseName;		SampleContext BaseName;
StringRef TestName;		SampleContext TestName;
// Number of overlap units		// Number of overlap units
uint64_t OverlapCount;		uint64_t OverlapCount;
// Total samples of overlap units		// Total samples of overlap units
uint64_t OverlapSample;		uint64_t OverlapSample;
// Number of and total samples of units that only present in base or test		// Number of and total samples of units that only present in base or test
// profile		// profile
uint64_t BaseUniqueCount;		uint64_t BaseUniqueCount;
uint64_t BaseUniqueSample;		uint64_t BaseUniqueSample;
▲ Show 20 Lines • Show All 186 Lines • ▼ Show 20 Lines	public:
/// profiles. This function also computes and keeps the sum of samples and		/// profiles. This function also computes and keeps the sum of samples and
/// max sample counts of each function in BaseStats and TestStats for later		/// max sample counts of each function in BaseStats and TestStats for later
/// use to avoid re-computations.		/// use to avoid re-computations.
void initializeSampleProfileOverlap();		void initializeSampleProfileOverlap();

/// Load profiles specified by BaseFilename and TestFilename.		/// Load profiles specified by BaseFilename and TestFilename.
std::error_code loadProfiles();		std::error_code loadProfiles();

		using FuncSampleStatsMap =
		std::unordered_map<SampleContext, FuncSampleStats, SampleContext::Hash>;

private:		private:
SampleOverlapStats ProfOverlap;		SampleOverlapStats ProfOverlap;
SampleOverlapStats HotFuncOverlap;		SampleOverlapStats HotFuncOverlap;
SampleOverlapStats HotBlockOverlap;		SampleOverlapStats HotBlockOverlap;
std::string BaseFilename;		std::string BaseFilename;
std::string TestFilename;		std::string TestFilename;
std::unique_ptr<sampleprof::SampleProfileReader> BaseReader;		std::unique_ptr<sampleprof::SampleProfileReader> BaseReader;
std::unique_ptr<sampleprof::SampleProfileReader> TestReader;		std::unique_ptr<sampleprof::SampleProfileReader> TestReader;
// BaseStats and TestStats hold FuncSampleStats for each function, with		// BaseStats and TestStats hold FuncSampleStats for each function, with
// function name as the key.		// function name as the key.
StringMap<FuncSampleStats> BaseStats;		FuncSampleStatsMap BaseStats;
StringMap<FuncSampleStats> TestStats;		FuncSampleStatsMap TestStats;
// Low similarity threshold in floating point number		// Low similarity threshold in floating point number
double LowSimilarityThreshold;		double LowSimilarityThreshold;
// Block samples above BaseHotThreshold or TestHotThreshold are considered hot		// Block samples above BaseHotThreshold or TestHotThreshold are considered hot
// for tracking hot blocks.		// for tracking hot blocks.
uint64_t BaseHotThreshold;		uint64_t BaseHotThreshold;
uint64_t TestHotThreshold;		uint64_t TestHotThreshold;
// A small threshold used to round the results of floating point accumulations		// A small threshold used to round the results of floating point accumulations
// to resolve imprecision.		// to resolve imprecision.
Show All 22 Lines	private:
/// this function in test profile ST, compute BS(i) = 1.0 - fabs(BB(i)/SB -		/// this function in test profile ST, compute BS(i) = 1.0 - fabs(BB(i)/SB -
/// BT(i)/ST), ranging in [0.0f to 1.0f] with 0.0 meaning no-overlap.		/// BT(i)/ST), ranging in [0.0f to 1.0f] with 0.0 meaning no-overlap.
double computeBlockSimilarity(uint64_t BaseSample, uint64_t TestSample,		double computeBlockSimilarity(uint64_t BaseSample, uint64_t TestSample,
const SampleOverlapStats &FuncOverlap) const;		const SampleOverlapStats &FuncOverlap) const;

void updateHotBlockOverlap(uint64_t BaseSample, uint64_t TestSample,		void updateHotBlockOverlap(uint64_t BaseSample, uint64_t TestSample,
uint64_t HotBlockCount);		uint64_t HotBlockCount);

void getHotFunctions(const StringMap<FuncSampleStats> &ProfStats,		void getHotFunctions(const FuncSampleStatsMap &ProfStats,
StringMap<FuncSampleStats> &HotFunc,		FuncSampleStatsMap &HotFunc,
uint64_t HotThreshold) const;		uint64_t HotThreshold) const;

void computeHotFuncOverlap();		void computeHotFuncOverlap();

/// This function updates statistics in FuncOverlap, HotBlockOverlap, and		/// This function updates statistics in FuncOverlap, HotBlockOverlap, and
/// Difference for two sample units in a matched function according to the		/// Difference for two sample units in a matched function according to the
/// given match status.		/// given match status.
void updateOverlapStatsForFunction(uint64_t BaseSample, uint64_t TestSample,		void updateOverlapStatsForFunction(uint64_t BaseSample, uint64_t TestSample,
▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	if (IsBaseHot)
HotBlockOverlap.BaseCount += HotBlockCount;		HotBlockOverlap.BaseCount += HotBlockCount;
if (IsTestHot)		if (IsTestHot)
HotBlockOverlap.TestCount += HotBlockCount;		HotBlockOverlap.TestCount += HotBlockCount;
if (IsBaseHot && IsTestHot)		if (IsBaseHot && IsTestHot)
HotBlockOverlap.OverlapCount += HotBlockCount;		HotBlockOverlap.OverlapCount += HotBlockCount;
}		}

void SampleOverlapAggregator::getHotFunctions(		void SampleOverlapAggregator::getHotFunctions(
const StringMap<FuncSampleStats> &ProfStats,		const FuncSampleStatsMap &ProfStats, FuncSampleStatsMap &HotFunc,
StringMap<FuncSampleStats> &HotFunc, uint64_t HotThreshold) const {		uint64_t HotThreshold) const {
for (const auto &F : ProfStats) {		for (const auto &F : ProfStats) {
if (isFunctionHot(F.second, HotThreshold))		if (isFunctionHot(F.second, HotThreshold))
HotFunc.try_emplace(F.first(), F.second);		HotFunc.emplace(F.first, F.second);
}		}
}		}

void SampleOverlapAggregator::computeHotFuncOverlap() {		void SampleOverlapAggregator::computeHotFuncOverlap() {
StringMap<FuncSampleStats> BaseHotFunc;		FuncSampleStatsMap BaseHotFunc;
getHotFunctions(BaseStats, BaseHotFunc, BaseHotThreshold);		getHotFunctions(BaseStats, BaseHotFunc, BaseHotThreshold);
HotFuncOverlap.BaseCount = BaseHotFunc.size();		HotFuncOverlap.BaseCount = BaseHotFunc.size();

StringMap<FuncSampleStats> TestHotFunc;		FuncSampleStatsMap TestHotFunc;
getHotFunctions(TestStats, TestHotFunc, TestHotThreshold);		getHotFunctions(TestStats, TestHotFunc, TestHotThreshold);
HotFuncOverlap.TestCount = TestHotFunc.size();		HotFuncOverlap.TestCount = TestHotFunc.size();
HotFuncOverlap.UnionCount = HotFuncOverlap.TestCount;		HotFuncOverlap.UnionCount = HotFuncOverlap.TestCount;

for (const auto &F : BaseHotFunc) {		for (const auto &F : BaseHotFunc) {
if (TestHotFunc.count(F.first()))		if (TestHotFunc.count(F.first))
++HotFuncOverlap.OverlapCount;		++HotFuncOverlap.OverlapCount;
else		else
++HotFuncOverlap.UnionCount;		++HotFuncOverlap.UnionCount;
}		}
}		}

void SampleOverlapAggregator::updateOverlapStatsForFunction(		void SampleOverlapAggregator::updateOverlapStatsForFunction(
uint64_t BaseSample, uint64_t TestSample, uint64_t HotBlockCount,		uint64_t BaseSample, uint64_t TestSample, uint64_t HotBlockCount,
▲ Show 20 Lines • Show All 195 Lines • ▼ Show 20 Lines	double SampleOverlapAggregator::computeSampleFunctionOverlap(
FuncSimilarity = weightForFuncSimilarity(FuncInternalSimilarity,		FuncSimilarity = weightForFuncSimilarity(FuncInternalSimilarity,
BaseFuncSample, TestFuncSample);		BaseFuncSample, TestFuncSample);
return FuncSimilarity;		return FuncSimilarity;
}		}

void SampleOverlapAggregator::computeSampleProfileOverlap(raw_fd_ostream &OS) {		void SampleOverlapAggregator::computeSampleProfileOverlap(raw_fd_ostream &OS) {
using namespace sampleprof;		using namespace sampleprof;

StringMap<const FunctionSamples *> BaseFuncProf;		std::unordered_map<SampleContext, const FunctionSamples *,
		SampleContext::Hash>
		BaseFuncProf;
const auto &BaseProfiles = BaseReader->getProfiles();		const auto &BaseProfiles = BaseReader->getProfiles();
for (const auto &BaseFunc : BaseProfiles) {		for (const auto &BaseFunc : BaseProfiles) {
BaseFuncProf.try_emplace(BaseFunc.second.getNameWithContext(),		BaseFuncProf.emplace(BaseFunc.second.getContext(), &(BaseFunc.second));
&(BaseFunc.second));
}		}
ProfOverlap.UnionCount = BaseFuncProf.size();		ProfOverlap.UnionCount = BaseFuncProf.size();

const auto &TestProfiles = TestReader->getProfiles();		const auto &TestProfiles = TestReader->getProfiles();
for (const auto &TestFunc : TestProfiles) {		for (const auto &TestFunc : TestProfiles) {
SampleOverlapStats FuncOverlap;		SampleOverlapStats FuncOverlap;
FuncOverlap.TestName = TestFunc.second.getNameWithContext();		FuncOverlap.TestName = TestFunc.second.getContext();
assert(TestStats.count(FuncOverlap.TestName) &&		assert(TestStats.count(FuncOverlap.TestName) &&
"TestStats should have records for all functions in test profile "		"TestStats should have records for all functions in test profile "
"except inlinees");		"except inlinees");
FuncOverlap.TestSample = TestStats[FuncOverlap.TestName].SampleSum;		FuncOverlap.TestSample = TestStats[FuncOverlap.TestName].SampleSum;

const auto Match = BaseFuncProf.find(FuncOverlap.TestName);		const auto Match = BaseFuncProf.find(FuncOverlap.TestName);
if (Match == BaseFuncProf.end()) {		if (Match == BaseFuncProf.end()) {
const FuncSampleStats &FuncStats = TestStats[FuncOverlap.TestName];		const FuncSampleStats &FuncStats = TestStats[FuncOverlap.TestName];
Show All 10 Lines	if (Match == BaseFuncProf.end()) {

++ProfOverlap.UnionCount;		++ProfOverlap.UnionCount;
ProfOverlap.UnionSample += FuncStats.SampleSum;		ProfOverlap.UnionSample += FuncStats.SampleSum;
} else {		} else {
++ProfOverlap.OverlapCount;		++ProfOverlap.OverlapCount;

// Two functions match with each other. Compute function-level overlap and		// Two functions match with each other. Compute function-level overlap and
// aggregate them into profile-level overlap.		// aggregate them into profile-level overlap.
FuncOverlap.BaseName = Match->second->getNameWithContext();		FuncOverlap.BaseName = Match->second->getContext();
assert(BaseStats.count(FuncOverlap.BaseName) &&		assert(BaseStats.count(FuncOverlap.BaseName) &&
"BaseStats should have records for all functions in base profile "		"BaseStats should have records for all functions in base profile "
"except inlinees");		"except inlinees");
FuncOverlap.BaseSample = BaseStats[FuncOverlap.BaseName].SampleSum;		FuncOverlap.BaseSample = BaseStats[FuncOverlap.BaseName].SampleSum;

FuncOverlap.Similarity = computeSampleFunctionOverlap(		FuncOverlap.Similarity = computeSampleFunctionOverlap(
Match->second, &TestFunc.second, &FuncOverlap, FuncOverlap.BaseSample,		Match->second, &TestFunc.second, &FuncOverlap, FuncOverlap.BaseSample,
FuncOverlap.TestSample);		FuncOverlap.TestSample);
Show All 16 Lines	for (const auto &TestFunc : TestProfiles) {
// Print function-level similarity information if specified by options.		// Print function-level similarity information if specified by options.
assert(TestStats.count(FuncOverlap.TestName) &&		assert(TestStats.count(FuncOverlap.TestName) &&
"TestStats should have records for all functions in test profile "		"TestStats should have records for all functions in test profile "
"except inlinees");		"except inlinees");
if (TestStats[FuncOverlap.TestName].MaxSample >= FuncFilter.ValueCutoff \|\|		if (TestStats[FuncOverlap.TestName].MaxSample >= FuncFilter.ValueCutoff \|\|
(Match != BaseFuncProf.end() &&		(Match != BaseFuncProf.end() &&
FuncOverlap.Similarity < LowSimilarityThreshold) \|\|		FuncOverlap.Similarity < LowSimilarityThreshold) \|\|
(Match != BaseFuncProf.end() && !FuncFilter.NameFilter.empty() &&		(Match != BaseFuncProf.end() && !FuncFilter.NameFilter.empty() &&
FuncOverlap.BaseName.find(FuncFilter.NameFilter) !=		FuncOverlap.BaseName.toString().find(FuncFilter.NameFilter) !=
FuncOverlap.BaseName.npos)) {		std::string::npos)) {
assert(ProfOverlap.BaseSample > 0 &&		assert(ProfOverlap.BaseSample > 0 &&
"Total samples in base profile should be greater than 0");		"Total samples in base profile should be greater than 0");
FuncOverlap.BaseWeight =		FuncOverlap.BaseWeight =
static_cast<double>(FuncOverlap.BaseSample) / ProfOverlap.BaseSample;		static_cast<double>(FuncOverlap.BaseSample) / ProfOverlap.BaseSample;
assert(ProfOverlap.TestSample > 0 &&		assert(ProfOverlap.TestSample > 0 &&
"Total samples in test profile should be greater than 0");		"Total samples in test profile should be greater than 0");
FuncOverlap.TestWeight =		FuncOverlap.TestWeight =
static_cast<double>(FuncOverlap.TestSample) / ProfOverlap.TestSample;		static_cast<double>(FuncOverlap.TestSample) / ProfOverlap.TestSample;
FuncSimilarityDump.emplace(FuncOverlap.BaseWeight, FuncOverlap);		FuncSimilarityDump.emplace(FuncOverlap.BaseWeight, FuncOverlap);
}		}
}		}

// Traverse through functions in base profile but not in test profile.		// Traverse through functions in base profile but not in test profile.
for (const auto &F : BaseFuncProf) {		for (const auto &F : BaseFuncProf) {
assert(BaseStats.count(F.second->getNameWithContext()) &&		assert(BaseStats.count(F.second->getContext()) &&
"BaseStats should have records for all functions in base profile "		"BaseStats should have records for all functions in base profile "
"except inlinees");		"except inlinees");
const FuncSampleStats &FuncStats =		const FuncSampleStats &FuncStats = BaseStats[F.second->getContext()];
BaseStats[F.second->getNameWithContext()];
++ProfOverlap.BaseUniqueCount;		++ProfOverlap.BaseUniqueCount;
ProfOverlap.BaseUniqueSample += FuncStats.SampleSum;		ProfOverlap.BaseUniqueSample += FuncStats.SampleSum;

updateHotBlockOverlap(FuncStats.SampleSum, 0, FuncStats.HotBlockCount);		updateHotBlockOverlap(FuncStats.SampleSum, 0, FuncStats.HotBlockCount);

double FuncSimilarity = computeSampleFunctionOverlap(		double FuncSimilarity = computeSampleFunctionOverlap(
nullptr, nullptr, nullptr, FuncStats.SampleSum, 0);		nullptr, nullptr, nullptr, FuncStats.SampleSum, 0);
ProfOverlap.Similarity +=		ProfOverlap.Similarity +=
Show All 14 Lines

void SampleOverlapAggregator::initializeSampleProfileOverlap() {		void SampleOverlapAggregator::initializeSampleProfileOverlap() {
const auto &BaseProf = BaseReader->getProfiles();		const auto &BaseProf = BaseReader->getProfiles();
for (const auto &I : BaseProf) {		for (const auto &I : BaseProf) {
++ProfOverlap.BaseCount;		++ProfOverlap.BaseCount;
FuncSampleStats FuncStats;		FuncSampleStats FuncStats;
getFuncSampleStats(I.second, FuncStats, BaseHotThreshold);		getFuncSampleStats(I.second, FuncStats, BaseHotThreshold);
ProfOverlap.BaseSample += FuncStats.SampleSum;		ProfOverlap.BaseSample += FuncStats.SampleSum;
BaseStats.try_emplace(I.second.getNameWithContext(), FuncStats);		BaseStats.emplace(I.second.getContext(), FuncStats);
}		}

const auto &TestProf = TestReader->getProfiles();		const auto &TestProf = TestReader->getProfiles();
for (const auto &I : TestProf) {		for (const auto &I : TestProf) {
++ProfOverlap.TestCount;		++ProfOverlap.TestCount;
FuncSampleStats FuncStats;		FuncSampleStats FuncStats;
getFuncSampleStats(I.second, FuncStats, TestHotThreshold);		getFuncSampleStats(I.second, FuncStats, TestHotThreshold);
ProfOverlap.TestSample += FuncStats.SampleSum;		ProfOverlap.TestSample += FuncStats.SampleSum;
TestStats.try_emplace(I.second.getNameWithContext(), FuncStats);		TestStats.emplace(I.second.getContext(), FuncStats);
}		}

ProfOverlap.BaseName = StringRef(BaseFilename);		ProfOverlap.BaseName = StringRef(BaseFilename);
ProfOverlap.TestName = StringRef(TestFilename);		ProfOverlap.TestName = StringRef(TestFilename);
}		}

void SampleOverlapAggregator::dumpFuncSimilarity(raw_fd_ostream &OS) const {		void SampleOverlapAggregator::dumpFuncSimilarity(raw_fd_ostream &OS) const {
using namespace sampleprof;		using namespace sampleprof;
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	for (const auto &F : FuncSimilarityDump) {
FOS << format("%.2f%%", BaseUniquePercent * 100);		FOS << format("%.2f%%", BaseUniquePercent * 100);
FOS.PadToColumn(TestUniqueCol);		FOS.PadToColumn(TestUniqueCol);
FOS << format("%.2f%%", TestUniquePercent * 100);		FOS << format("%.2f%%", TestUniquePercent * 100);
FOS.PadToColumn(BaseSampleCol);		FOS.PadToColumn(BaseSampleCol);
FOS << F.second.BaseSample;		FOS << F.second.BaseSample;
FOS.PadToColumn(TestSampleCol);		FOS.PadToColumn(TestSampleCol);
FOS << F.second.TestSample;		FOS << F.second.TestSample;
FOS.PadToColumn(FuncNameCol);		FOS.PadToColumn(FuncNameCol);
FOS << F.second.TestName << "\n";		FOS << F.second.TestName.toString() << "\n";
}		}
}		}

void SampleOverlapAggregator::dumpProgramSummary(raw_fd_ostream &OS) const {		void SampleOverlapAggregator::dumpProgramSummary(raw_fd_ostream &OS) const {
OS << "Profile overlap infomation for base_profile: " << ProfOverlap.BaseName		OS << "Profile overlap infomation for base_profile: "
<< " and test_profile: " << ProfOverlap.TestName << "\nProgram level:\n";		<< ProfOverlap.BaseName.toString()
		<< " and test_profile: " << ProfOverlap.TestName.toString()
		<< "\nProgram level:\n";

OS << " Whole program profile similarity: "		OS << " Whole program profile similarity: "
<< format("%.3f%%", ProfOverlap.Similarity * 100) << "\n";		<< format("%.3f%%", ProfOverlap.Similarity * 100) << "\n";

assert(ProfOverlap.UnionSample > 0 &&		assert(ProfOverlap.UnionSample > 0 &&
"Total samples in two profile should be greater than 0");		"Total samples in two profile should be greater than 0");
double OverlapPercent =		double OverlapPercent =
static_cast<double>(ProfOverlap.OverlapSample) / ProfOverlap.UnionSample;		static_cast<double>(ProfOverlap.OverlapSample) / ProfOverlap.UnionSample;
▲ Show 20 Lines • Show All 441 Lines • ▼ Show 20 Lines	WithColor::warning() << "-show-sec-info-only is only supported for "
<< "sample profile in extbinary format and is "		<< "sample profile in extbinary format and is "
<< "ignored for other formats.\n";		<< "ignored for other formats.\n";
return;		return;
}		}
}		}

namespace {		namespace {
struct HotFuncInfo {		struct HotFuncInfo {
StringRef FuncName;		std::string FuncName;
uint64_t TotalCount;		uint64_t TotalCount;
double TotalCountPercent;		double TotalCountPercent;
uint64_t MaxCount;		uint64_t MaxCount;
uint64_t EntryCount;		uint64_t EntryCount;

HotFuncInfo()		HotFuncInfo()
: FuncName(), TotalCount(0), TotalCountPercent(0.0f), MaxCount(0),		: FuncName(), TotalCount(0), TotalCountPercent(0.0f), MaxCount(0),
EntryCount(0) {}		EntryCount(0) {}

HotFuncInfo(StringRef FN, uint64_t TS, double TSP, uint64_t MS, uint64_t ES)		HotFuncInfo(StringRef FN, uint64_t TS, double TSP, uint64_t MS, uint64_t ES)
: FuncName(FN), TotalCount(TS), TotalCountPercent(TSP), MaxCount(MS),		: FuncName(FN.begin(), FN.end()), TotalCount(TS), TotalCountPercent(TSP),
EntryCount(ES) {}		MaxCount(MS), EntryCount(ES) {}
};		};
} // namespace		} // namespace

// Print out detailed information about hot functions in PrintValues vector.		// Print out detailed information about hot functions in PrintValues vector.
// Users specify titles and offset of every columns through ColumnTitle and		// Users specify titles and offset of every columns through ColumnTitle and
// ColumnOffset. The size of ColumnTitle and ColumnOffset need to be the same		// ColumnOffset. The size of ColumnTitle and ColumnOffset need to be the same
// and at least 4. Besides, users can optionally give a HotFuncMetric string to		// and at least 4. Besides, users can optionally give a HotFuncMetric string to
// print out or let it be an empty string.		// print out or let it be an empty string.
Show All 39 Lines	for (const HotFuncInfo &R : PrintValues) {
FOS << R.MaxCount;		FOS << R.MaxCount;
FOS.PadToColumn(ColumnOffset[2]);		FOS.PadToColumn(ColumnOffset[2]);
FOS << R.EntryCount;		FOS << R.EntryCount;
FOS.PadToColumn(ColumnOffset[3]);		FOS.PadToColumn(ColumnOffset[3]);
FOS << R.FuncName << "\n";		FOS << R.FuncName << "\n";
}		}
}		}

static int		static int showHotFunctionList(const sampleprof::SampleProfileMap &Profiles,
showHotFunctionList(const StringMap<sampleprof::FunctionSamples> &Profiles,
ProfileSummary &PS, raw_fd_ostream &OS) {		ProfileSummary &PS, raw_fd_ostream &OS) {
using namespace sampleprof;		using namespace sampleprof;

const uint32_t HotFuncCutoff = 990000;		const uint32_t HotFuncCutoff = 990000;
auto &SummaryVector = PS.getDetailedSummary();		auto &SummaryVector = PS.getDetailedSummary();
uint64_t MinCountThreshold = 0;		uint64_t MinCountThreshold = 0;
for (const ProfileSummaryEntry &SummaryEntry : SummaryVector) {		for (const ProfileSummaryEntry &SummaryEntry : SummaryVector) {
if (SummaryEntry.Cutoff == HotFuncCutoff) {		if (SummaryEntry.Cutoff == HotFuncCutoff) {
MinCountThreshold = SummaryEntry.MinCount;		MinCountThreshold = SummaryEntry.MinCount;
Show All 33 Lines	static int showHotFunctionList(const sampleprof::SampleProfileMap &Profiles,
std::vector<HotFuncInfo> PrintValues;		std::vector<HotFuncInfo> PrintValues;
for (const auto &FuncPair : HotFunc) {		for (const auto &FuncPair : HotFunc) {
const FunctionSamples &Func = *FuncPair.second.first;		const FunctionSamples &Func = *FuncPair.second.first;
double TotalSamplePercent =		double TotalSamplePercent =
(ProfileTotalSample > 0)		(ProfileTotalSample > 0)
? (Func.getTotalSamples() * 100.0) / ProfileTotalSample		? (Func.getTotalSamples() * 100.0) / ProfileTotalSample
: 0;		: 0;
PrintValues.emplace_back(HotFuncInfo(		PrintValues.emplace_back(HotFuncInfo(
Func.getNameWithContext(), Func.getTotalSamples(), TotalSamplePercent,		Func.getContext().toString(), Func.getTotalSamples(),
FuncPair.second.second, Func.getEntrySamples()));		TotalSamplePercent, FuncPair.second.second, Func.getEntrySamples()));
}		}
dumpHotFunctionList(ColumnTitle, ColumnOffset, PrintValues, HotFuncCount,		dumpHotFunctionList(ColumnTitle, ColumnOffset, PrintValues, HotFuncCount,
Profiles.size(), HotFuncSample, ProfileTotalSample,		Profiles.size(), HotFuncSample, ProfileTotalSample,
Metric, OS);		Metric, OS);

return 0;		return 0;
}		}

Show All 17 Lines	static int showSampleProfile(const std::string &Filename, bool ShowCounts,
}		}

if (std::error_code EC = Reader->read())		if (std::error_code EC = Reader->read())
exitWithErrorCode(EC, Filename);		exitWithErrorCode(EC, Filename);

if (ShowAllFunctions \|\| ShowFunction.empty())		if (ShowAllFunctions \|\| ShowFunction.empty())
Reader->dump(OS);		Reader->dump(OS);
else		else
Reader->dumpFunctionProfile(ShowFunction, OS);		// TODO: parse context string to support filtering by contexts.
		Reader->dumpFunctionProfile(StringRef(ShowFunction), OS);

if (ShowProfileSymbolList) {		if (ShowProfileSymbolList) {
std::unique_ptr<sampleprof::ProfileSymbolList> ReaderList =		std::unique_ptr<sampleprof::ProfileSymbolList> ReaderList =
Reader->getProfileSymbolList();		Reader->getProfileSymbolList();
ReaderList->dump(OS);		ReaderList->dump(OS);
}		}

if (ShowDetailedSummary) {		if (ShowDetailedSummary) {
▲ Show 20 Lines • Show All 143 Lines • Show Last 20 Lines

llvm/unittests/ProfileData/SampleProfTest.cpp

Show First 20 Lines • Show All 187 Lines • ▼ Show 20 Lines	void testRoundTrip(SampleProfileFormat Format, bool Remap, bool UseMD5) {

StringRef BooName("_Z3booi");		StringRef BooName("_Z3booi");
FunctionSamples BooSamples;		FunctionSamples BooSamples;
BooSamples.setName(BooName);		BooSamples.setName(BooName);
BooSamples.addTotalSamples(1232);		BooSamples.addTotalSamples(1232);
BooSamples.addHeadSamples(1);		BooSamples.addHeadSamples(1);
BooSamples.addBodySamples(1, 0, 1232);		BooSamples.addBodySamples(1, 0, 1232);

StringMap<FunctionSamples> Profiles;		SampleProfileMap Profiles;
Profiles[FooName] = std::move(FooSamples);		Profiles[FooName] = std::move(FooSamples);
Profiles[BarName] = std::move(BarSamples);		Profiles[BarName] = std::move(BarSamples);
Profiles[BazName] = std::move(BazSamples);		Profiles[BazName] = std::move(BazSamples);
Profiles[BooName] = std::move(BooSamples);		Profiles[BooName] = std::move(BooSamples);

Module M("my_module", Context);		Module M("my_module", Context);
FunctionType *fn_type =		FunctionType *fn_type =
FunctionType::get(Type::getVoidTy(Context), {}, false);		FunctionType::get(Type::getVoidTy(Context), {}, false);
▲ Show 20 Lines • Show All 117 Lines • ▼ Show 20 Lines	void testRoundTrip(SampleProfileFormat Format, bool Remap, bool UseMD5) {

verifyProfileSummary(Summary, M, false, false);		verifyProfileSummary(Summary, M, false, false);

Summary.setPartialProfile(true);		Summary.setPartialProfile(true);
Summary.setPartialProfileRatio(0.5);		Summary.setPartialProfileRatio(0.5);
verifyProfileSummary(Summary, M, true, true);		verifyProfileSummary(Summary, M, true, true);
}		}

void addFunctionSamples(StringMap<FunctionSamples> Smap, const char Fname,		void addFunctionSamples(SampleProfileMap Smap, const char Fname,
uint64_t TotalSamples, uint64_t HeadSamples) {		uint64_t TotalSamples, uint64_t HeadSamples) {
StringRef Name(Fname);		StringRef Name(Fname);
FunctionSamples FcnSamples;		FunctionSamples FcnSamples;
FcnSamples.setName(Name);		FcnSamples.setName(Name);
FcnSamples.addTotalSamples(TotalSamples);		FcnSamples.addTotalSamples(TotalSamples);
FcnSamples.addHeadSamples(HeadSamples);		FcnSamples.addHeadSamples(HeadSamples);
FcnSamples.addBodySamples(1, 0, HeadSamples);		FcnSamples.addBodySamples(1, 0, HeadSamples);
(*Smap)[Name] = FcnSamples;		(*Smap)[Name] = FcnSamples;
}		}

StringMap<FunctionSamples> setupFcnSamplesForElisionTest(StringRef Policy) {		SampleProfileMap setupFcnSamplesForElisionTest(StringRef Policy) {
StringMap<FunctionSamples> Smap;		SampleProfileMap Smap;
addFunctionSamples(&Smap, "foo", uint64_t(20301), uint64_t(1437));		addFunctionSamples(&Smap, "foo", uint64_t(20301), uint64_t(1437));
if (Policy == "" \|\| Policy == "all")		if (Policy == "" \|\| Policy == "all")
return Smap;		return Smap;
addFunctionSamples(&Smap, "foo.bar", uint64_t(20303), uint64_t(1439));		addFunctionSamples(&Smap, "foo.bar", uint64_t(20303), uint64_t(1439));
if (Policy == "selected")		if (Policy == "selected")
return Smap;		return Smap;
addFunctionSamples(&Smap, "foo.llvm.2465", uint64_t(20305), uint64_t(1441));		addFunctionSamples(&Smap, "foo.llvm.2465", uint64_t(20305), uint64_t(1441));
return Smap;		return Smap;
Show All 17 Lines	struct SampleProfTest : ::testing::Test {
}		}

void testSuffixElisionPolicy(SampleProfileFormat Format, StringRef Policy,		void testSuffixElisionPolicy(SampleProfileFormat Format, StringRef Policy,
const StringMap<uint64_t> &Expected) {		const StringMap<uint64_t> &Expected) {
TempFile ProfileFile("profile", "", "", /Unique/ true);		TempFile ProfileFile("profile", "", "", /Unique/ true);

Module M("my_module", Context);		Module M("my_module", Context);
setupModuleForElisionTest(&M, Policy);		setupModuleForElisionTest(&M, Policy);
StringMap<FunctionSamples> ProfMap = setupFcnSamplesForElisionTest(Policy);		SampleProfileMap ProfMap = setupFcnSamplesForElisionTest(Policy);

// write profile		// write profile
createWriter(Format, ProfileFile.path());		createWriter(Format, ProfileFile.path());
std::error_code EC;		std::error_code EC;
EC = Writer->write(ProfMap);		EC = Writer->write(ProfMap);
ASSERT_TRUE(NoError(EC));		ASSERT_TRUE(NoError(EC));
Writer->getOutputStream().flush();		Writer->getOutputStream().flush();

▲ Show 20 Lines • Show All 150 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[CSSPGO] split context string II - reader/writer changesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 369582

llvm/include/llvm/ProfileData/ProfileCommon.h

llvm/include/llvm/ProfileData/SampleProf.h

llvm/include/llvm/ProfileData/SampleProfReader.h

llvm/include/llvm/ProfileData/SampleProfWriter.h

llvm/lib/ProfileData/SampleProf.cpp

llvm/lib/ProfileData/SampleProfReader.cpp

llvm/lib/ProfileData/SampleProfWriter.cpp

llvm/test/tools/llvm-profdata/Inputs/cs-sample.proftext

llvm/tools/llvm-profdata/llvm-profdata.cpp

llvm/unittests/ProfileData/SampleProfTest.cpp

[CSSPGO] split context string II - reader/writer changes
ClosedPublic