This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
bolt/lib/Rewrite/
-
lib/
-
Rewrite/
-
RewriteInstance.cpp
-
llvm/
-
include/llvm/
-
llvm/
-
IR/
1/2
PseudoProbe.h
-
MC/
-
MCObjectFileInfo.h
7/12
MCPseudoProbe.h
-
MCStreamer.h
-
Transforms/IPO/
-
IPO/
-
SampleProfileProbe.h
-
lib/
-
CodeGen/AsmPrinter/
-
AsmPrinter/
-
PseudoProbePrinter.cpp
-
MC/
-
MCAsmStreamer.cpp
-
MCObjectFileInfo.cpp
-
MCParser/
-
AsmParser.cpp
12/14
MCPseudoProbe.cpp
-
MCStreamer.cpp
-
test/
-
Transforms/SampleProfile/
-
SampleProfile/
-
pseudo-probe-emit-inline.ll
-
pseudo-probe-emit.ll
-
tools/llvm-profgen/
-
llvm-profgen/
-
Inputs/
-
func-split.perfbin
-
inline-cs-pseudoprobe.perfbin
-
inline-force-dwarf.test
-
tools/llvm-profgen/
-
llvm-profgen/
-
ProfileGenerator.cpp
-
ProfiledBinary.h
-
ProfiledBinary.cpp

Differential D135912

[PseudoProbe] Replace relocation with offset for entry probe.
ClosedPublic

Authored by hoy on Oct 13 2022, 1:14 PM.

Download Raw Diff

Details

Reviewers

wenlei
wlei
rafauler
Amir
maksfb

Commits

rGd5a963ab8b40: [PseudoProbe] Replace relocation with offset for entry probe.

Summary

Currently pseudo probe encoding for a function is like:

For the first probe, a relocation from it to its physical position in the code body
For subsequent probes, an incremental offset from the current probe to the previous probe

The relocation could potentially cause relocation overflow during link time. I'm now replacing it with an offset from the first probe to the function start address.

A source function could be lowered into multiple split functions due to outlining (e.g, coro-split). Since those split functions have independent link-time layout, to really avoid relocations from .pseudo_probe sections to .text sections, the offset to replace with should really be the offset from the probe's enclosing split function, rather than from the entry of the source function. This requires some changes to previous section-based emission scheme which now switches to be function-based. The assembly form of pseudo probe directive is also changed correspondingly, i.e, reflecting the split function name.

A sentinel probe is emitted for each of the functions with a different name from the source. The sentinel probe indicates the elf symbol name to differentiate subsequent probes from the ones from a different split function. For examples, given source function

Foo() {
  …
  Probe 1
  …
  Probe 2
}

If it is transformed into two split functions:

Foo:
   …

Foo.outlined:
   …

The encoding for the two split functions will be separate:

GUID of Foo
  Probe 1

GUID of Foo 
  Sentinel probe of Foo.outlined 
  Probe 2

Then probe1 will be decoded against binary Foo's address, and Probe 2 will be decoded against Foo.outlined. The sentinel probe of Foo.outlined makes sure there's not accidental relocation from Foo.outlined's probes to Foo's entry address.

Decoding change will be in separate patch.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hoy created this revision.Oct 13 2022, 1:14 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 13 2022, 1:14 PM

Herald added subscribers: ormris, modimo, wenlei, hiraditya. · View Herald Transcript

hoy requested review of this revision.Oct 13 2022, 1:14 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 13 2022, 1:14 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

hoy added reviewers: wenlei, wlei.Oct 13 2022, 1:16 PM

Harbormaster completed remote builds in B192041: Diff 467583.Oct 13 2022, 1:53 PM

Updating D135912: [PseudoProbe] Replace relocation with offset for entry probe.

hoy edited the summary of this revision. (Show Details)Oct 17 2022, 10:17 AM

Harbormaster completed remote builds in B192527: Diff 468246.Oct 17 2022, 11:27 AM

hoy mentioned this in D135914: [PseudoProbe] Decode offset based pseudo probe..Oct 17 2022, 11:33 AM

hoy added a child revision: D135914: [PseudoProbe] Decode offset based pseudo probe..

hoy mentioned this in D136394: [BOLT][PseudoProbe] Support new pseudo probe encoding.Oct 20 2022, 4:51 PM

Not an expert, just some hints in case you like the suggestions.

llvm/include/llvm/MC/MCPseudoProbe.h
299	Is this over 80col?
llvm/lib/MC/MCPseudoProbe.cpp
151	typo Santinel->Sentinel
214–228	std::map is an expensive data structure. For usage patterns that are strictly "insert then query", considering using a vector + std::sort. Reference https://llvm.org/docs/ProgrammersManual.html#dss-sortedvectormap

hoy marked 2 inline comments as done.Oct 21 2022, 12:41 PM

hoy added inline comments.

llvm/include/llvm/MC/MCPseudoProbe.h
299	Yes, it is. Fixed.
llvm/lib/MC/MCPseudoProbe.cpp
214–228	Good point. Changed to using vector+sort.

Addressing feedbacks.

Herald added a subscriber: mgrang. · View Herald TranscriptOct 21 2022, 12:41 PM

Harbormaster completed remote builds in B193604: Diff 469718.Oct 21 2022, 1:26 PM

wenlei added inline comments.Oct 24 2022, 10:43 PM

llvm/include/llvm/MC/MCPseudoProbe.h
34–36	I thought we now have GUID for symbol name for sentinel, and address offset for non-sentinel. What does regular probe that doesn't use offset refer to?
287	do you want to rename this to MCPseudoProbeFunctions?
llvm/lib/MC/MCPseudoProbe.cpp
52	Did you mean `assert((LastProbe \|\| IsSentinel) && "...")`?
80–81	To be accurate, this is the GUID of the (split) binary function name / elf symbol name?
140	nit: this message doesn't provide additional info other than the assertion itself. the message could be something more meaningful - why we don't expect root node here
155	Is this checking the top level binary function name is different from the top level source function name? And this is to avoid emitting sentinel for main function body?
215	nit: this is a comparator instead of a sorter. same for the one above.

wenlei added inline comments.Oct 24 2022, 10:54 PM

llvm/include/llvm/MC/MCPseudoProbe.h
17–20	Have you considered removing this GUID field, but always require a sentinel probe for any (split) binary function including the main body?

hoy marked 5 inline comments as done.Oct 25 2022, 10:10 AM

hoy added inline comments.

llvm/include/llvm/MC/MCPseudoProbe.h
17–20	The GUID field here is the GUID of the source/dwarf name, which may be different from the actually symbol linkage name, even for the main body of the function. The source GUID will be used to decode and generate a profile against the source function name.
34–36	This is mainly for downwards compatibility, i.e, to have the new llvm-profgen handle old binaries. Also Bolt re-encoding uses the old encoding scheme for simplicity.
287	This class kind of tells how probe encoding looks physically. Probes of a function will stay in a standalone section (comdat) and that's why section is used here. I'd like to keep it as is, but don't have a strong preference. WDYT?
llvm/lib/MC/MCPseudoProbe.cpp
52	Good catch. Yeah, that's what I actually want.
80–81	Exactly.
140	Fixed.
155	Exactly. Added a comment about that. The encoding for a function with outlined code will look like below. Note that the main entry doesn't need a sentinel probe. GUID of Foo Probe 1 GUID of Foo Sentinel probe of Foo.outlined Probe 2

Addressing feedbacks.

Harbormaster completed remote builds in B194214: Diff 470541.Oct 25 2022, 11:35 AM

For those don't, a sentinel probe is emitted for each of the binary functions with a different name from the source.

Nit but it indeed caused confusion for me: if the definition of binary function is each MCSymbol, the above isn't accurate because we never add sentinel for the original function.

Also the term binary function is a bit confusing too. Intuitively a binary function has 1:1 mapping to source function, which is how BOLT defines binary function I believe. I also didn't find precedence in LLVM/MC where these funclets are called binary function. Can we make it straightforward and just call it split functions in code and comments?

Otherwise this looks good. Thanks for working on relocation removal.

llvm/include/llvm/MC/MCPseudoProbe.h
17–20	Ok, makes sense. Maybe clarify this in this comment section as well?
34–36	How about adding a TODO comment to remove this later?

In D135912#3883462, @wenlei wrote:

For those don't, a sentinel probe is emitted for each of the binary functions with a different name from the source.

Nit but it indeed caused confusion for me: if the definition of binary function is each MCSymbol, the above isn't accurate because we never add sentinel for the original function.

I see. Will just remove "Most of the source functions end up with only one binary function. For those don't, ".

Also the term binary function is a bit confusing too. Intuitively a binary function has 1:1 mapping to source function, which is how BOLT defines binary function I believe. I also didn't find precedence in LLVM/MC where these funclets are called binary function. Can we make it straightforward and just call it split functions in code and comments?

Otherwise this looks good. Thanks for working on relocation removal.

How about using code ranges instead of binary functions?

llvm/include/llvm/MC/MCPseudoProbe.h
17–20	Done.
34–36	Sounds good.

Updating comments.

hoy edited the summary of this revision. (Show Details)Oct 25 2022, 2:38 PM

Harbormaster completed remote builds in B194259: Diff 470603.Oct 25 2022, 3:27 PM

How about using code ranges instead of binary functions?

code range is very general. I'd just use split function to make it straightforward. otoh, if BOLT uses the term binary function to represent those split funclets, it would be ok to do the same here too.

Updating D135912: [PseudoProbe] Replace relocation with offset for entry probe.

Harbormaster completed remote builds in B194313: Diff 470682.Oct 25 2022, 8:22 PM

lgtm, thanks.

llvm/include/llvm/IR/PseudoProbe.h
33	nit on comment: split function entry address? main body doesn't have it.
llvm/lib/MC/MCPseudoProbe.cpp
80–81	nit on comment: function -> split function?

This revision is now accepted and ready to land.Oct 26 2022, 10:37 AM

hoy added inline comments.Oct 26 2022, 11:02 AM

llvm/include/llvm/IR/PseudoProbe.h
33	Sounds good. Strictly speaking, main body can also have a sentinel probe if its source name doesn't equal binary name, like with the .llvm. suffix, but split function here would make it more obvious.
llvm/lib/MC/MCPseudoProbe.cpp
80–81	Done.

Renaming

Harbormaster completed remote builds in B194453: Diff 470872.Oct 26 2022, 12:30 PM

This revision was landed with ongoing or failed builds.Oct 27 2022, 1:28 PM

Closed by commit rGd5a963ab8b40: [PseudoProbe] Replace relocation with offset for entry probe. (authored by hoy). · Explain Why

This revision was automatically updated to reflect the committed changes.

hoy added a commit: rGd5a963ab8b40: [PseudoProbe] Replace relocation with offset for entry probe..

Herald added a reviewer: rafauler. · View Herald TranscriptOct 27 2022, 1:28 PM

Herald added a reviewer: Amir. · View Herald Transcript

Herald added a reviewer: maksfb. · View Herald Transcript

Herald added subscribers: yota9, ayermolo. · View Herald Transcript

Revision Contents

Path

Size

bolt/

lib/

Rewrite/

RewriteInstance.cpp

25 lines

llvm/

include/

llvm/

IR/

PseudoProbe.h

11 lines

MC/

MCObjectFileInfo.h

2 lines

MCPseudoProbe.h

53 lines

MCStreamer.h

3 lines

Transforms/

IPO/

SampleProfileProbe.h

2 lines

lib/

CodeGen/

AsmPrinter/

PseudoProbePrinter.cpp

3 lines

MC/

MCAsmStreamer.cpp

7 lines

MCObjectFileInfo.cpp

6 lines

MCParser/

AsmParser.cpp

8 lines

MCPseudoProbe.cpp

201 lines

MCStreamer.cpp

5 lines

test/

Transforms/

SampleProfile/

pseudo-probe-emit-inline.ll

14 lines

pseudo-probe-emit.ll

21 lines

tools/

llvm-profgen/

Inputs/

func-split.perfbin

inline-cs-pseudoprobe.perfbin

inline-force-dwarf.test

5 lines

tools/

llvm-profgen/

ProfileGenerator.cpp

1 line

ProfiledBinary.h

23 lines

ProfiledBinary.cpp

63 lines

Diff 471279

bolt/lib/Rewrite/RewriteInstance.cpp

Show First 20 Lines • Show All 593 Lines • ▼ Show 20 Lines	void RewriteInstance::parsePseudoProbe() {

StringRef Contents = PseudoProbeDescSection->getContents();		StringRef Contents = PseudoProbeDescSection->getContents();
if (!BC->ProbeDecoder.buildGUID2FuncDescMap(		if (!BC->ProbeDecoder.buildGUID2FuncDescMap(
reinterpret_cast<const uint8_t *>(Contents.data()),		reinterpret_cast<const uint8_t *>(Contents.data()),
Contents.size())) {		Contents.size())) {
errs() << "BOLT-WARNING: fail in building GUID2FuncDescMap\n";		errs() << "BOLT-WARNING: fail in building GUID2FuncDescMap\n";
return;		return;
}		}

		MCPseudoProbeDecoder::Uint64Set GuidFilter;
		MCPseudoProbeDecoder::Uint64Map FuncStartAddrs;
		for (const BinaryFunction *F : BC->getAllBinaryFunctions()) {
		for (const MCSymbol *Sym : F->getSymbols()) {
		FuncStartAddrs[Function::getGUID(NameResolver::restore(Sym->getName()))] =
		F->getAddress();
		}
		}
Contents = PseudoProbeSection->getContents();		Contents = PseudoProbeSection->getContents();
if (!BC->ProbeDecoder.buildAddress2ProbeMap(		if (!BC->ProbeDecoder.buildAddress2ProbeMap(
reinterpret_cast<const uint8_t *>(Contents.data()),		reinterpret_cast<const uint8_t *>(Contents.data()), Contents.size(),
Contents.size())) {		GuidFilter, FuncStartAddrs)) {
BC->ProbeDecoder.getAddress2ProbesMap().clear();		BC->ProbeDecoder.getAddress2ProbesMap().clear();
errs() << "BOLT-WARNING: fail in building Address2ProbeMap\n";		errs() << "BOLT-WARNING: fail in building Address2ProbeMap\n";
return;		return;
}		}

if (opts::PrintPseudoProbes == opts::PrintPseudoProbesOptions::PPP_All \|\|		if (opts::PrintPseudoProbes == opts::PrintPseudoProbesOptions::PPP_All \|\|
opts::PrintPseudoProbes ==		opts::PrintPseudoProbes ==
opts::PrintPseudoProbesOptions::PPP_Probes_Section_Decode) {		opts::PrintPseudoProbesOptions::PPP_Probes_Section_Decode) {
▲ Show 20 Lines • Show All 2,807 Lines • ▼ Show 20 Lines	auto EmitSLEB128IntValue = [&](int64_t Value) {
Contents.append(OSE.str().begin(), OSE.str().end());		Contents.append(OSE.str().begin(), OSE.str().end());
};		};

// Emit indiviual pseudo probes in a inline tree node		// Emit indiviual pseudo probes in a inline tree node
// Probe index, type, attribute, address type and address are encoded		// Probe index, type, attribute, address type and address are encoded
// Address of the first probe is absolute.		// Address of the first probe is absolute.
// Other probes' address are represented by delta		// Other probes' address are represented by delta
auto EmitDecodedPseudoProbe = [&](MCDecodedPseudoProbe *&CurProbe) {		auto EmitDecodedPseudoProbe = [&](MCDecodedPseudoProbe *&CurProbe) {
		assert(!isSentinelProbe(CurProbe->getAttributes()) &&
		"Sentinel probes should not be emitted");
EmitULEB128IntValue(CurProbe->getIndex());		EmitULEB128IntValue(CurProbe->getIndex());
uint8_t PackedType = CurProbe->getType() \| (CurProbe->getAttributes() << 4);		uint8_t PackedType = CurProbe->getType() \| (CurProbe->getAttributes() << 4);
uint8_t Flag =		uint8_t Flag =
LastProbe ? ((int8_t)MCPseudoProbeFlag::AddressDelta << 7) : 0;		LastProbe ? ((int8_t)MCPseudoProbeFlag::AddressDelta << 7) : 0;
EmitInt(Flag \| PackedType, 1);		EmitInt(Flag \| PackedType, 1);
if (LastProbe) {		if (LastProbe) {
// Emit the delta between the address label and LastProbe.		// Emit the delta between the address label and LastProbe.
int64_t Delta = CurProbe->getAddress() - LastProbe->getAddress();		int64_t Delta = CurProbe->getAddress() - LastProbe->getAddress();
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	if (opts::PrintPseudoProbes == opts::PrintPseudoProbesOptions::PPP_All \|\|
opts::PrintPseudoProbesOptions::PPP_Encoded_Probes) {		opts::PrintPseudoProbesOptions::PPP_Encoded_Probes) {
// create a dummy decoder;		// create a dummy decoder;
MCPseudoProbeDecoder DummyDecoder;		MCPseudoProbeDecoder DummyDecoder;
StringRef DescContents = PseudoProbeDescSection->getContents();		StringRef DescContents = PseudoProbeDescSection->getContents();
DummyDecoder.buildGUID2FuncDescMap(		DummyDecoder.buildGUID2FuncDescMap(
reinterpret_cast<const uint8_t *>(DescContents.data()),		reinterpret_cast<const uint8_t *>(DescContents.data()),
DescContents.size());		DescContents.size());
StringRef ProbeContents = PseudoProbeSection->getOutputContents();		StringRef ProbeContents = PseudoProbeSection->getOutputContents();
		MCPseudoProbeDecoder::Uint64Set GuidFilter;
		MCPseudoProbeDecoder::Uint64Map FuncStartAddrs;
		for (const BinaryFunction *F : BC->getAllBinaryFunctions()) {
		const uint64_t Addr =
		F->isEmitted() ? F->getOutputAddress() : F->getAddress();
		FuncStartAddrs[Function::getGUID(
		NameResolver::restore(F->getOneName()))] = Addr;
		}
DummyDecoder.buildAddress2ProbeMap(		DummyDecoder.buildAddress2ProbeMap(
reinterpret_cast<const uint8_t *>(ProbeContents.data()),		reinterpret_cast<const uint8_t *>(ProbeContents.data()),
ProbeContents.size());		ProbeContents.size(), GuidFilter, FuncStartAddrs);
DummyDecoder.printProbesForAllAddresses(outs());		DummyDecoder.printProbesForAllAddresses(outs());
}		}
}		}

void RewriteInstance::updateSDTMarkers() {		void RewriteInstance::updateSDTMarkers() {
NamedRegionTimer T("updateSDTMarkers", "update SDT markers", TimerGroupName,		NamedRegionTimer T("updateSDTMarkers", "update SDT markers", TimerGroupName,
TimerGroupDesc, opts::TimeRewrite);		TimerGroupDesc, opts::TimeRewrite);

▲ Show 20 Lines • Show All 2,071 Lines • Show Last 20 Lines

llvm/include/llvm/IR/PseudoProbe.h

Show All 18 Lines
#include <limits>		#include <limits>

namespace llvm {		namespace llvm {

class Instruction;		class Instruction;

constexpr const char *PseudoProbeDescMetadataName = "llvm.pseudo_probe_desc";		constexpr const char *PseudoProbeDescMetadataName = "llvm.pseudo_probe_desc";

		enum class PseudoProbeReservedId { Invalid = 0, Last = Invalid };

enum class PseudoProbeType { Block = 0, IndirectCall, DirectCall };		enum class PseudoProbeType { Block = 0, IndirectCall, DirectCall };

		enum class PseudoProbeAttributes {
		Reserved = 0x1,
		Sentinel = 0x2, // A place holder for split function entry address.
		wenleiUnsubmitted Not Done Reply Inline Actions nit on comment: split function entry address? main body doesn't have it. wenlei: nit on comment: split function entry address? main body doesn't have it.
		hoyAuthorUnsubmitted Done Reply Inline Actions Sounds good. Strictly speaking, main body can also have a sentinel probe if its source name doesn't equal binary name, like with the .llvm. suffix, but split function here would make it more obvious. hoy: Sounds good. Strictly speaking, main body can also have a sentinel probe if its source name…
		};

// The saturated distrution factor representing 100% for block probes.		// The saturated distrution factor representing 100% for block probes.
constexpr static uint64_t PseudoProbeFullDistributionFactor =		constexpr static uint64_t PseudoProbeFullDistributionFactor =
std::numeric_limits<uint64_t>::max();		std::numeric_limits<uint64_t>::max();

struct PseudoProbeDwarfDiscriminator {		struct PseudoProbeDwarfDiscriminator {
public:		public:
// The following APIs encodes/decodes per-probe information to/from a		// The following APIs encodes/decodes per-probe information to/from a
// 32-bit integer which is organized as:		// 32-bit integer which is organized as:
Show All 38 Lines	struct PseudoProbe {
uint32_t Type;		uint32_t Type;
uint32_t Attr;		uint32_t Attr;
// Distribution factor that estimates the portion of the real execution count.		// Distribution factor that estimates the portion of the real execution count.
// A saturated distribution factor stands for 1.0 or 100%. A pesudo probe has		// A saturated distribution factor stands for 1.0 or 100%. A pesudo probe has
// a factor with the value ranged from 0.0 to 1.0.		// a factor with the value ranged from 0.0 to 1.0.
float Factor;		float Factor;
};		};

		static inline bool isSentinelProbe(uint32_t Flags) {
		return Flags & (uint32_t)PseudoProbeAttributes::Sentinel;
		}

Optional<PseudoProbe> extractProbe(const Instruction &Inst);		Optional<PseudoProbe> extractProbe(const Instruction &Inst);

void setProbeDistributionFactor(Instruction &Inst, float Factor);		void setProbeDistributionFactor(Instruction &Inst, float Factor);
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_IR_PSEUDOPROBE_H		#endif // LLVM_IR_PSEUDOPROBE_H

llvm/include/llvm/MC/MCObjectFileInfo.h

Show First 20 Lines • Show All 356 Lines • ▼ Show 20 Lines	public:
MCSection *getRemarksSection() const { return RemarksSection; }		MCSection *getRemarksSection() const { return RemarksSection; }

MCSection *getStackSizesSection(const MCSection &TextSec) const;		MCSection *getStackSizesSection(const MCSection &TextSec) const;

MCSection *getBBAddrMapSection(const MCSection &TextSec) const;		MCSection *getBBAddrMapSection(const MCSection &TextSec) const;

MCSection *getKCFITrapSection(const MCSection &TextSec) const;		MCSection *getKCFITrapSection(const MCSection &TextSec) const;

MCSection getPseudoProbeSection(const MCSection TextSec) const;		MCSection *getPseudoProbeSection(const MCSection &TextSec) const;

MCSection *getPseudoProbeDescSection(StringRef FuncName) const;		MCSection *getPseudoProbeDescSection(StringRef FuncName) const;

MCSection getPCSection(StringRef Name, const MCSection TextSec) const;		MCSection getPCSection(StringRef Name, const MCSection TextSec) const;

// ELF specific sections.		// ELF specific sections.
MCSection *getDataRelROSection() const { return DataRelROSection; }		MCSection *getDataRelROSection() const { return DataRelROSection; }
const MCSection *getMergeableConst4Section() const {		const MCSection *getMergeableConst4Section() const {
▲ Show 20 Lines • Show All 124 Lines • Show Last 20 Lines

llvm/include/llvm/MC/MCPseudoProbe.h

//===- MCPseudoProbe.h - Pseudo probe encoding support ---------- C++ --===//		//===- MCPseudoProbe.h - Pseudo probe encoding support ---------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file contains the declaration of the MCPseudoProbe to support the pseudo		// This file contains the declaration of the MCPseudoProbe to support the pseudo
// probe encoding for AutoFDO. Pseudo probes together with their inline context		// probe encoding for AutoFDO. Pseudo probes together with their inline context
// are encoded in a DFS recursive way in the .pseudoprobe sections. For each		// are encoded in a DFS recursive way in the .pseudoprobe sections. For each
// .pseudoprobe section, the encoded binary data consist of a single or mutiple		// .pseudoprobe section, the encoded binary data consist of a single or mutiple
// function records each for one outlined function. A function record has the		// function records each for one outlined function. A function record has the
// following format :		// following format :
//		//
// FUNCTION BODY (one for each outlined function present in the text section)		// FUNCTION BODY (one for each outlined function present in the text section)
// GUID (uint64)		// GUID (uint64)
// GUID of the function		// GUID of the function's source name which may be different from the
		// actual binary linkage name. This GUID will be used to decode and
		// generate a profile against the source function name.
		wenleiUnsubmitted Not Done Reply Inline Actions Have you considered removing this GUID field, but always require a sentinel probe for any (split) binary function including the main body? wenlei: Have you considered removing this GUID field, but always require a sentinel probe for any…
		hoyAuthorUnsubmitted Done Reply Inline Actions The GUID field here is the GUID of the source/dwarf name, which may be different from the actually symbol linkage name, even for the main body of the function. The source GUID will be used to decode and generate a profile against the source function name. hoy: The GUID field here is the GUID of the source/dwarf name, which may be different from the…
		wenleiUnsubmitted Not Done Reply Inline Actions Ok, makes sense. Maybe clarify this in this comment section as well? wenlei: Ok, makes sense. Maybe clarify this in this comment section as well?
		hoyAuthorUnsubmitted Done Reply Inline Actions Done. hoy: Done.
// NPROBES (ULEB128)		// NPROBES (ULEB128)
// Number of probes originating from this function.		// Number of probes originating from this function.
// NUM_INLINED_FUNCTIONS (ULEB128)		// NUM_INLINED_FUNCTIONS (ULEB128)
// Number of callees inlined into this function, aka number of		// Number of callees inlined into this function, aka number of
// first-level inlinees		// first-level inlinees
// PROBE RECORDS		// PROBE RECORDS
// A list of NPROBES entries. Each entry contains:		// A list of NPROBES entries. Each entry contains:
// INDEX (ULEB128)		// INDEX (ULEB128)
// TYPE (uint4)		// TYPE (uint4)
// 0 - block probe, 1 - indirect call, 2 - direct call		// 0 - block probe, 1 - indirect call, 2 - direct call
// ATTRIBUTE (uint3)		// ATTRIBUTE (uint3)
// 1 - reserved		// 1 - reserved
// ADDRESS_TYPE (uint1)		// ADDRESS_TYPE (uint1)
// 0 - code address, 1 - address delta		// 0 - code address for regular probes (for downwards compatibility)
		// - GUID of linkage name for sentinel probes
		// 1 - address delta
		wenleiUnsubmitted Not Done Reply Inline Actions I thought we now have GUID for symbol name for sentinel, and address offset for non-sentinel. What does regular probe that doesn't use offset refer to? wenlei: I thought we now have GUID for symbol name for sentinel, and address offset for non-sentinel.
		hoyAuthorUnsubmitted Done Reply Inline Actions This is mainly for downwards compatibility, i.e, to have the new llvm-profgen handle old binaries. Also Bolt re-encoding uses the old encoding scheme for simplicity. hoy: This is mainly for downwards compatibility, i.e, to have the new llvm-profgen handle old…
		wenleiUnsubmitted Not Done Reply Inline Actions How about adding a TODO comment to remove this later? wenlei: How about adding a TODO comment to remove this later?
		hoyAuthorUnsubmitted Done Reply Inline Actions Sounds good. hoy: Sounds good.
// CODE_ADDRESS (uint64 or ULEB128)		// CODE_ADDRESS (uint64 or ULEB128)
// code address or address delta, depending on ADDRESS_TYPE		// code address or address delta, depending on ADDRESS_TYPE
// INLINED FUNCTION RECORDS		// INLINED FUNCTION RECORDS
// A list of NUM_INLINED_FUNCTIONS entries describing each of the inlined		// A list of NUM_INLINED_FUNCTIONS entries describing each of the inlined
// callees. Each record contains:		// callees. Each record contains:
// INLINE SITE		// INLINE SITE
// ID of the callsite probe (ULEB128)		// ID of the callsite probe (ULEB128)
// FUNCTION BODY		// FUNCTION BODY
// A FUNCTION BODY entry describing the inlined function.		// A FUNCTION BODY entry describing the inlined function.
		//
		// TODO: retire the ADDRESS_TYPE encoding for code addresses once compatibility
		// is no longer an issue.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_MC_MCPSEUDOPROBE_H		#ifndef LLVM_MC_MCPSEUDOPROBE_H
#define LLVM_MC_MCPSEUDOPROBE_H		#define LLVM_MC_MCPSEUDOPROBE_H

		#include "llvm/ADT/DenseSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/IR/PseudoProbe.h"		#include "llvm/IR/PseudoProbe.h"
#include "llvm/Support/ErrorOr.h"		#include "llvm/Support/ErrorOr.h"
#include <list>		#include <list>
#include <map>		#include <map>
#include <memory>		#include <memory>
#include <string>		#include <string>
▲ Show 20 Lines • Show All 216 Lines • ▼ Show 20 Lines	public:
MCDecodedPseudoProbeInlineTree(const InlineSite &Site) : ISite(Site){};		MCDecodedPseudoProbeInlineTree(const InlineSite &Site) : ISite(Site){};

// Return false if it's a dummy inline site		// Return false if it's a dummy inline site
bool hasInlineSite() const { return !isRoot() && !Parent->isRoot(); }		bool hasInlineSite() const { return !isRoot() && !Parent->isRoot(); }
};		};

/// Instances of this class represent the pseudo probes inserted into a compile		/// Instances of this class represent the pseudo probes inserted into a compile
/// unit.		/// unit.
class MCPseudoProbeSection {		class MCPseudoProbeSections {
		wenleiUnsubmitted Not Done Reply Inline Actions do you want to rename this to MCPseudoProbeFunctions? wenlei: do you want to rename this to MCPseudoProbeFunctions?
		hoyAuthorUnsubmitted Done Reply Inline Actions This class kind of tells how probe encoding looks physically. Probes of a function will stay in a standalone section (comdat) and that's why section is used here. I'd like to keep it as is, but don't have a strong preference. WDYT? hoy: This class kind of tells how probe encoding looks physically. Probes of a function will stay in…
public:		public:
void addPseudoProbe(MCSection *Sec, const MCPseudoProbe &Probe,		void addPseudoProbe(MCSymbol *FuncSym, const MCPseudoProbe &Probe,
const MCPseudoProbeInlineStack &InlineStack) {		const MCPseudoProbeInlineStack &InlineStack) {
MCProbeDivisions[Sec].addPseudoProbe(Probe, InlineStack);		MCProbeDivisions[FuncSym].addPseudoProbe(Probe, InlineStack);
}		}

// TODO: Sort by getOrdinal to ensure a determinstic section order		// TODO: Sort by getOrdinal to ensure a determinstic section order
using MCProbeDivisionMap = std::map<MCSection *, MCPseudoProbeInlineTree>;		using MCProbeDivisionMap = std::map<MCSymbol *, MCPseudoProbeInlineTree>;

private:		private:
// A collection of MCPseudoProbe for each text section. The MCPseudoProbes		// A collection of MCPseudoProbe for each function. The MCPseudoProbes are
// are grouped by GUID of the functions where they are from and will be		// grouped by GUIDs due to inlining that can bring probes from different
		rafaulerUnsubmitted Done Reply Inline Actions Is this over 80col? rafauler: Is this over 80col?
		hoyAuthorUnsubmitted Done Reply Inline Actions Yes, it is. Fixed. hoy: Yes, it is. Fixed.
// encoded by groups. In the comdat scenario where a text section really only		// functions into one function.
// contains the code of a function solely, the probes associated with a comdat
// function are still grouped by GUIDs due to inlining that can bring probes
// from different functions into one function.
MCProbeDivisionMap MCProbeDivisions;		MCProbeDivisionMap MCProbeDivisions;

public:		public:
const MCProbeDivisionMap &getMCProbes() const { return MCProbeDivisions; }		const MCProbeDivisionMap &getMCProbes() const { return MCProbeDivisions; }

bool empty() const { return MCProbeDivisions.empty(); }		bool empty() const { return MCProbeDivisions.empty(); }

void emit(MCObjectStreamer *MCOS);		void emit(MCObjectStreamer *MCOS);
};		};

class MCPseudoProbeTable {		class MCPseudoProbeTable {
// A collection of MCPseudoProbe in the current module grouped by text		// A collection of MCPseudoProbe in the current module grouped by
// sections. MCPseudoProbes will be encoded into a corresponding		// functions. MCPseudoProbes will be encoded into a corresponding
// .pseudoprobe section. With functions emitted as separate comdats,		// .pseudoprobe section. With functions emitted as separate comdats,
// a text section really only contains the code of a function solely, and the		// a text section really only contains the code of a function solely, and the
// probes associated with the text section will be emitted into a standalone		// probes associated with the text section will be emitted into a standalone
// .pseudoprobe section that shares the same comdat group with the function.		// .pseudoprobe section that shares the same comdat group with the function.
MCPseudoProbeSection MCProbeSections;		MCPseudoProbeSections MCProbeSections;

public:		public:
static void emit(MCObjectStreamer *MCOS);		static void emit(MCObjectStreamer *MCOS);

MCPseudoProbeSection &getProbeSections() { return MCProbeSections; }		MCPseudoProbeSections &getProbeSections() { return MCProbeSections; }

#ifndef NDEBUG		#ifndef NDEBUG
static int DdgPrintIndent;		static int DdgPrintIndent;
#endif		#endif
};		};

class MCPseudoProbeDecoder {		class MCPseudoProbeDecoder {
// GUID to PseudoProbeFuncDesc map.		// GUID to PseudoProbeFuncDesc map.
Show All 9 Lines	class MCPseudoProbeDecoder {
MCDecodedPseudoProbeInlineTree DummyInlineRoot;		MCDecodedPseudoProbeInlineTree DummyInlineRoot;

/// Points to the current location in the buffer.		/// Points to the current location in the buffer.
const uint8_t *Data = nullptr;		const uint8_t *Data = nullptr;

/// Points to the end of the buffer.		/// Points to the end of the buffer.
const uint8_t *End = nullptr;		const uint8_t *End = nullptr;

		/// Whether encoding is based on a starting probe with absolute code address.
		bool EncodingIsAddrBased = false;

// Decoding helper function		// Decoding helper function
template <typename T> ErrorOr<T> readUnencodedNumber();		template <typename T> ErrorOr<T> readUnencodedNumber();
template <typename T> ErrorOr<T> readUnsignedNumber();		template <typename T> ErrorOr<T> readUnsignedNumber();
template <typename T> ErrorOr<T> readSignedNumber();		template <typename T> ErrorOr<T> readSignedNumber();
ErrorOr<StringRef> readString(uint32_t Size);		ErrorOr<StringRef> readString(uint32_t Size);

public:		public:
		using Uint64Set = DenseSet<uint64_t>;
		using Uint64Map = DenseMap<uint64_t, uint64_t>;

// Decode pseudo_probe_desc section to build GUID to PseudoProbeFuncDesc map.		// Decode pseudo_probe_desc section to build GUID to PseudoProbeFuncDesc map.
bool buildGUID2FuncDescMap(const uint8_t *Start, std::size_t Size);		bool buildGUID2FuncDescMap(const uint8_t *Start, std::size_t Size);

// Decode pseudo_probe section to build address to probes map.
bool buildAddress2ProbeMap(const uint8_t *Start, std::size_t Size);

// Decode pseudo_probe section to build address to probes map for specifed		// Decode pseudo_probe section to build address to probes map for specifed
// functions only.		// functions only.
bool buildAddress2ProbeMap(const uint8_t *Start, std::size_t Size,		bool buildAddress2ProbeMap(const uint8_t *Start, std::size_t Size,
std::unordered_set<uint64_t> &GuildFilter);		const Uint64Set &GuildFilter,
		const Uint64Map &FuncStartAddrs);

bool buildAddress2ProbeMap(MCDecodedPseudoProbeInlineTree *Cur,		bool buildAddress2ProbeMap(MCDecodedPseudoProbeInlineTree *Cur,
uint64_t &LastAddr,		uint64_t &LastAddr, const Uint64Set &GuildFilter,
std::unordered_set<uint64_t> &GuildFilter);		const Uint64Map &FuncStartAddrs);

// Print pseudo_probe_desc section info		// Print pseudo_probe_desc section info
void printGUID2FuncDescMap(raw_ostream &OS);		void printGUID2FuncDescMap(raw_ostream &OS);

// Print pseudo_probe section info, used along with show-disassembly		// Print pseudo_probe section info, used along with show-disassembly
void printProbeForAddress(raw_ostream &OS, uint64_t Address);		void printProbeForAddress(raw_ostream &OS, uint64_t Address);

// do printProbeForAddress for all addresses		// do printProbeForAddress for all addresses
Show All 40 Lines

llvm/include/llvm/MC/MCStreamer.h

Show First 20 Lines • Show All 1,099 Lines • ▼ Show 20 Lines	public:
virtual void emitAddrsigSym(const MCSymbol *Sym) {}		virtual void emitAddrsigSym(const MCSymbol *Sym) {}

/// Emit the given \p Instruction into the current section.		/// Emit the given \p Instruction into the current section.
virtual void emitInstruction(const MCInst &Inst, const MCSubtargetInfo &STI);		virtual void emitInstruction(const MCInst &Inst, const MCSubtargetInfo &STI);

/// Emit the a pseudo probe into the current section.		/// Emit the a pseudo probe into the current section.
virtual void emitPseudoProbe(uint64_t Guid, uint64_t Index, uint64_t Type,		virtual void emitPseudoProbe(uint64_t Guid, uint64_t Index, uint64_t Type,
uint64_t Attr,		uint64_t Attr,
const MCPseudoProbeInlineStack &InlineStack);		const MCPseudoProbeInlineStack &InlineStack,
		MCSymbol *FnSym);

/// Set the bundle alignment mode from now on in the section.		/// Set the bundle alignment mode from now on in the section.
/// The argument is the power of 2 to which the alignment is set. The		/// The argument is the power of 2 to which the alignment is set. The
/// value 0 means turn the bundle alignment off.		/// value 0 means turn the bundle alignment off.
virtual void emitBundleAlignMode(unsigned AlignPow2);		virtual void emitBundleAlignMode(unsigned AlignPow2);

/// The following instructions are a bundle-locked group.		/// The following instructions are a bundle-locked group.
///		///
▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/IPO/SampleProfileProbe.h

	Show All 35 Lines
	using BlockIdMap = std::unordered_map<BasicBlock *, uint32_t>;			using BlockIdMap = std::unordered_map<BasicBlock *, uint32_t>;
	using InstructionIdMap = std::unordered_map<Instruction *, uint32_t>;			using InstructionIdMap = std::unordered_map<Instruction *, uint32_t>;
	// Map from tuples of Probe id and inline stack hash code to distribution			// Map from tuples of Probe id and inline stack hash code to distribution
	// factors.			// factors.
	using ProbeFactorMap = std::unordered_map<std::pair<uint64_t, uint64_t>, float,			using ProbeFactorMap = std::unordered_map<std::pair<uint64_t, uint64_t>, float,
	pair_hash<uint64_t, uint64_t>>;			pair_hash<uint64_t, uint64_t>>;
	using FuncProbeFactorMap = StringMap<ProbeFactorMap>;			using FuncProbeFactorMap = StringMap<ProbeFactorMap>;

	enum class PseudoProbeReservedId { Invalid = 0, Last = Invalid };

	class PseudoProbeDescriptor {			class PseudoProbeDescriptor {
	uint64_t FunctionGUID;			uint64_t FunctionGUID;
	uint64_t FunctionHash;			uint64_t FunctionHash;

	public:			public:
	PseudoProbeDescriptor(uint64_t GUID, uint64_t Hash)			PseudoProbeDescriptor(uint64_t GUID, uint64_t Hash)
	: FunctionGUID(GUID), FunctionHash(Hash) {}			: FunctionGUID(GUID), FunctionHash(Hash) {}
	uint64_t getFunctionGUID() const { return FunctionGUID; }			uint64_t getFunctionGUID() const { return FunctionGUID; }
	▲ Show 20 Lines • Show All 111 Lines • Show Last 20 Lines

llvm/lib/CodeGen/AsmPrinter/PseudoProbePrinter.cpp

Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	if (!CallerGuid)
CallerGuid = Function::getGUID(Name);		CallerGuid = Function::getGUID(Name);
uint64_t CallerProbeId = PseudoProbeDwarfDiscriminator::extractProbeIndex(		uint64_t CallerProbeId = PseudoProbeDwarfDiscriminator::extractProbeIndex(
InlinedAt->getDiscriminator());		InlinedAt->getDiscriminator());
ReversedInlineStack.emplace_back(CallerGuid, CallerProbeId);		ReversedInlineStack.emplace_back(CallerGuid, CallerProbeId);
InlinedAt = InlinedAt->getInlinedAt();		InlinedAt = InlinedAt->getInlinedAt();
}		}

SmallVector<InlineSite, 8> InlineStack(llvm::reverse(ReversedInlineStack));		SmallVector<InlineSite, 8> InlineStack(llvm::reverse(ReversedInlineStack));
Asm->OutStreamer->emitPseudoProbe(Guid, Index, Type, Attr, InlineStack);		Asm->OutStreamer->emitPseudoProbe(Guid, Index, Type, Attr, InlineStack,
		Asm->CurrentFnSym);
}		}

llvm/lib/MC/MCAsmStreamer.cpp

Show First 20 Lines • Show All 372 Lines • ▼ Show 20 Lines	public:

void emitCGProfileEntry(const MCSymbolRefExpr *From,		void emitCGProfileEntry(const MCSymbolRefExpr *From,
const MCSymbolRefExpr *To, uint64_t Count) override;		const MCSymbolRefExpr *To, uint64_t Count) override;

void emitInstruction(const MCInst &Inst, const MCSubtargetInfo &STI) override;		void emitInstruction(const MCInst &Inst, const MCSubtargetInfo &STI) override;

void emitPseudoProbe(uint64_t Guid, uint64_t Index, uint64_t Type,		void emitPseudoProbe(uint64_t Guid, uint64_t Index, uint64_t Type,
uint64_t Attr,		uint64_t Attr,
const MCPseudoProbeInlineStack &InlineStack) override;		const MCPseudoProbeInlineStack &InlineStack, MCSymbol *FnSym) override;

void emitBundleAlignMode(unsigned AlignPow2) override;		void emitBundleAlignMode(unsigned AlignPow2) override;
void emitBundleLock(bool AlignToEnd) override;		void emitBundleLock(bool AlignToEnd) override;
void emitBundleUnlock() override;		void emitBundleUnlock() override;

Optional<std::pair<bool, std::string>>		Optional<std::pair<bool, std::string>>
emitRelocDirective(const MCExpr &Offset, StringRef Name, const MCExpr *Expr,		emitRelocDirective(const MCExpr &Offset, StringRef Name, const MCExpr *Expr,
SMLoc Loc, const MCSubtargetInfo &STI) override;		SMLoc Loc, const MCSubtargetInfo &STI) override;
▲ Show 20 Lines • Show All 1,943 Lines • ▼ Show 20 Lines	void MCAsmStreamer::emitInstruction(const MCInst &Inst,
if (Comments.size() && Comments.back() != '\n')		if (Comments.size() && Comments.back() != '\n')
getCommentOS() << "\n";		getCommentOS() << "\n";

EmitEOL();		EmitEOL();
}		}

void MCAsmStreamer::emitPseudoProbe(		void MCAsmStreamer::emitPseudoProbe(
uint64_t Guid, uint64_t Index, uint64_t Type, uint64_t Attr,		uint64_t Guid, uint64_t Index, uint64_t Type, uint64_t Attr,
const MCPseudoProbeInlineStack &InlineStack) {		const MCPseudoProbeInlineStack &InlineStack, MCSymbol *FnSym) {
OS << "\t.pseudoprobe\t" << Guid << " " << Index << " " << Type << " "		OS << "\t.pseudoprobe\t" << Guid << " " << Index << " " << Type << " "
<< Attr;		<< Attr;
// Emit inline stack like		// Emit inline stack like
// @ GUIDmain:3 @ GUIDCaller:1 @ GUIDDirectCaller:11		// @ GUIDmain:3 @ GUIDCaller:1 @ GUIDDirectCaller:11
for (const auto &Site : InlineStack)		for (const auto &Site : InlineStack)
OS << " @ " << std::get<0>(Site) << ":" << std::get<1>(Site);		OS << " @ " << std::get<0>(Site) << ":" << std::get<1>(Site);

		OS << " " << FnSym->getName();

EmitEOL();		EmitEOL();
}		}

void MCAsmStreamer::emitBundleAlignMode(unsigned AlignPow2) {		void MCAsmStreamer::emitBundleAlignMode(unsigned AlignPow2) {
OS << "\t.bundle_align_mode " << AlignPow2;		OS << "\t.bundle_align_mode " << AlignPow2;
EmitEOL();		EmitEOL();
}		}

▲ Show 20 Lines • Show All 213 Lines • Show Last 20 Lines

llvm/lib/MC/MCObjectFileInfo.cpp

Show First 20 Lines • Show All 1,154 Lines • ▼ Show 20 Lines	MCObjectFileInfo::getKCFITrapSection(const MCSection &TextSec) const {

return Ctx->getELFSection(".kcfi_traps", ELF::SHT_PROGBITS, Flags, 0,		return Ctx->getELFSection(".kcfi_traps", ELF::SHT_PROGBITS, Flags, 0,
GroupName,		GroupName,
/IsComdat=/true, ElfSec.getUniqueID(),		/IsComdat=/true, ElfSec.getUniqueID(),
cast<MCSymbolELF>(TextSec.getBeginSymbol()));		cast<MCSymbolELF>(TextSec.getBeginSymbol()));
}		}

MCSection *		MCSection *
MCObjectFileInfo::getPseudoProbeSection(const MCSection *TextSec) const {		MCObjectFileInfo::getPseudoProbeSection(const MCSection &TextSec) const {
if (Ctx->getObjectFileType() == MCContext::IsELF) {		if (Ctx->getObjectFileType() == MCContext::IsELF) {
const auto ElfSec = static_cast<const MCSectionELF >(TextSec);		const auto &ElfSec = static_cast<const MCSectionELF &>(TextSec);
// Create a separate section for probes that comes with a comdat function.		// Create a separate section for probes that comes with a comdat function.
if (const MCSymbol *Group = ElfSec->getGroup()) {		if (const MCSymbol *Group = ElfSec.getGroup()) {
auto S = static_cast<MCSectionELF >(PseudoProbeSection);		auto S = static_cast<MCSectionELF >(PseudoProbeSection);
auto Flags = S->getFlags() \| ELF::SHF_GROUP;		auto Flags = S->getFlags() \| ELF::SHF_GROUP;
return Ctx->getELFSection(S->getName(), S->getType(), Flags,		return Ctx->getELFSection(S->getName(), S->getType(), Flags,
S->getEntrySize(), Group->getName(),		S->getEntrySize(), Group->getName(),
/IsComdat=/true);		/IsComdat=/true);
}		}
}		}
return PseudoProbeSection;		return PseudoProbeSection;
▲ Show 20 Lines • Show All 47 Lines • Show Last 20 Lines

llvm/lib/MC/MCParser/AsmParser.cpp

Show First 20 Lines • Show All 5,898 Lines • ▼ Show 20 Lines	if (getLexer().is(AsmToken::Integer)) {
"unexpected token in '.pseudoprobe' directive"))		"unexpected token in '.pseudoprobe' directive"))
return true;		return true;
}		}

InlineSite Site(CallerGuid, CallerProbeId);		InlineSite Site(CallerGuid, CallerProbeId);
InlineStack.push_back(Site);		InlineStack.push_back(Site);
}		}

		// Parse function entry name
		StringRef FnName;
		if (parseIdentifier(FnName))
		return Error(getLexer().getLoc(), "unexpected token in '.pseudoprobe' directive");
		MCSymbol *FnSym = getContext().lookupSymbol(FnName);

if (parseEOL())		if (parseEOL())
return true;		return true;

getStreamer().emitPseudoProbe(Guid, Index, Type, Attr, InlineStack);		getStreamer().emitPseudoProbe(Guid, Index, Type, Attr, InlineStack, FnSym);
return false;		return false;
}		}

/// parseDirectiveLTODiscard		/// parseDirectiveLTODiscard
/// ::= ".lto_discard" [ identifier ( , identifier )* ]		/// ::= ".lto_discard" [ identifier ( , identifier )* ]
/// The LTO library emits this directive to discard non-prevailing symbols.		/// The LTO library emits this directive to discard non-prevailing symbols.
/// We ignore symbol assignments and attribute changes for the specified		/// We ignore symbol assignments and attribute changes for the specified
/// symbols.		/// symbols.
▲ Show 20 Lines • Show All 526 Lines • Show Last 20 Lines

llvm/lib/MC/MCPseudoProbe.cpp

//===- lib/MC/MCPseudoProbe.cpp - Pseudo probe encoding support ----------===//		//===- lib/MC/MCPseudoProbe.cpp - Pseudo probe encoding support ----------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/MC/MCPseudoProbe.h"		#include "llvm/MC/MCPseudoProbe.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/MC/MCAsmInfo.h"		#include "llvm/MC/MCAsmInfo.h"
#include "llvm/MC/MCContext.h"		#include "llvm/MC/MCContext.h"
#include "llvm/MC/MCExpr.h"		#include "llvm/MC/MCExpr.h"
#include "llvm/MC/MCFragment.h"		#include "llvm/MC/MCFragment.h"
#include "llvm/MC/MCObjectFileInfo.h"		#include "llvm/MC/MCObjectFileInfo.h"
#include "llvm/MC/MCObjectStreamer.h"		#include "llvm/MC/MCObjectStreamer.h"
		#include "llvm/MC/MCSymbol.h"
#include "llvm/Support/Endian.h"		#include "llvm/Support/Endian.h"
#include "llvm/Support/LEB128.h"		#include "llvm/Support/LEB128.h"
		#include "llvm/Support/MD5.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
		#include <algorithm>
		#include <cassert>
#include <limits>		#include <limits>
#include <memory>		#include <memory>
#include <sstream>		#include <sstream>
		#include <vector>

#define DEBUG_TYPE "mcpseudoprobe"		#define DEBUG_TYPE "mcpseudoprobe"

using namespace llvm;		using namespace llvm;
using namespace support;		using namespace support;

#ifndef NDEBUG		#ifndef NDEBUG
int MCPseudoProbeTable::DdgPrintIndent = 0;		int MCPseudoProbeTable::DdgPrintIndent = 0;
#endif		#endif

static const MCExpr buildSymbolDiff(MCObjectStreamer MCOS, const MCSymbol *A,		static const MCExpr buildSymbolDiff(MCObjectStreamer MCOS, const MCSymbol *A,
const MCSymbol *B) {		const MCSymbol *B) {
MCContext &Context = MCOS->getContext();		MCContext &Context = MCOS->getContext();
MCSymbolRefExpr::VariantKind Variant = MCSymbolRefExpr::VK_None;		MCSymbolRefExpr::VariantKind Variant = MCSymbolRefExpr::VK_None;
const MCExpr *ARef = MCSymbolRefExpr::create(A, Variant, Context);		const MCExpr *ARef = MCSymbolRefExpr::create(A, Variant, Context);
const MCExpr *BRef = MCSymbolRefExpr::create(B, Variant, Context);		const MCExpr *BRef = MCSymbolRefExpr::create(B, Variant, Context);
const MCExpr *AddrDelta =		const MCExpr *AddrDelta =
MCBinaryExpr::create(MCBinaryExpr::Sub, ARef, BRef, Context);		MCBinaryExpr::create(MCBinaryExpr::Sub, ARef, BRef, Context);
return AddrDelta;		return AddrDelta;
}		}

void MCPseudoProbe::emit(MCObjectStreamer *MCOS,		void MCPseudoProbe::emit(MCObjectStreamer *MCOS,
const MCPseudoProbe *LastProbe) const {		const MCPseudoProbe *LastProbe) const {
		bool IsSentinel = isSentinelProbe(getAttributes());
		assert((LastProbe \|\| IsSentinel) &&
		wenleiUnsubmitted Done Reply Inline Actions Did you mean `assert((LastProbe \|\| IsSentinel) && "...")`? wenlei: Did you mean `assert((LastProbe \|\| IsSentinel) && "...")`?
		hoyAuthorUnsubmitted Done Reply Inline Actions Good catch. Yeah, that's what I actually want. hoy: Good catch. Yeah, that's what I actually want.
		"Last probe should not be null for non-sentinel probes");

// Emit Index		// Emit Index
MCOS->emitULEB128IntValue(Index);		MCOS->emitULEB128IntValue(Index);
// Emit Type and the flag:		// Emit Type and the flag:
// Type (bit 0 to 3), with bit 4 to 6 for attributes.		// Type (bit 0 to 3), with bit 4 to 6 for attributes.
// Flag (bit 7, 0 - code address, 1 - address delta). This indicates whether		// Flag (bit 7, 0 - code address, 1 - address delta). This indicates whether
// the following field is a symbolic code address or an address delta.		// the following field is a symbolic code address or an address delta.
assert(Type <= 0xF && "Probe type too big to encode, exceeding 15");		assert(Type <= 0xF && "Probe type too big to encode, exceeding 15");
assert(Attributes <= 0x7 &&		assert(Attributes <= 0x7 &&
"Probe attributes too big to encode, exceeding 7");		"Probe attributes too big to encode, exceeding 7");
uint8_t PackedType = Type \| (Attributes << 4);		uint8_t PackedType = Type \| (Attributes << 4);
uint8_t Flag = LastProbe ? ((int8_t)MCPseudoProbeFlag::AddressDelta << 7) : 0;		uint8_t Flag =
		!IsSentinel ? ((int8_t)MCPseudoProbeFlag::AddressDelta << 7) : 0;
MCOS->emitInt8(Flag \| PackedType);		MCOS->emitInt8(Flag \| PackedType);

if (LastProbe) {		if (!IsSentinel) {
// Emit the delta between the address label and LastProbe.		// Emit the delta between the address label and LastProbe.
const MCExpr *AddrDelta =		const MCExpr *AddrDelta =
buildSymbolDiff(MCOS, Label, LastProbe->getLabel());		buildSymbolDiff(MCOS, Label, LastProbe->getLabel());
int64_t Delta;		int64_t Delta;
if (AddrDelta->evaluateAsAbsolute(Delta, MCOS->getAssemblerPtr())) {		if (AddrDelta->evaluateAsAbsolute(Delta, MCOS->getAssemblerPtr())) {
MCOS->emitSLEB128IntValue(Delta);		MCOS->emitSLEB128IntValue(Delta);
} else {		} else {
MCOS->insert(new MCPseudoProbeAddrFragment(AddrDelta));		MCOS->insert(new MCPseudoProbeAddrFragment(AddrDelta));
}		}
} else {		} else {
// Emit label as a symbolic code address.		// Emit the GUID of the split function that the sentinel probe represents.
MCOS->emitSymbolValue(		MCOS->emitInt64(Guid);
		wenleiUnsubmitted Done Reply Inline Actions To be accurate, this is the GUID of the (split) binary function name / elf symbol name? wenlei: To be accurate, this is the GUID of the (split) binary function name / elf symbol name?
		hoyAuthorUnsubmitted Done Reply Inline Actions Exactly. hoy: Exactly.
		wenleiUnsubmitted Not Done Reply Inline Actions nit on comment: function -> split function? wenlei: nit on comment: function -> split function?
		hoyAuthorUnsubmitted Done Reply Inline Actions Done. hoy: Done.
Label, MCOS->getContext().getAsmInfo()->getCodePointerSize());
}		}

LLVM_DEBUG({		LLVM_DEBUG({
dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);		dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);
dbgs() << "Probe: " << Index << "\n";		dbgs() << "Probe: " << Index << "\n";
});		});
}		}

void MCPseudoProbeInlineTree::addPseudoProbe(		void MCPseudoProbeInlineTree::addPseudoProbe(
const MCPseudoProbe &Probe, const MCPseudoProbeInlineStack &InlineStack) {		const MCPseudoProbe &Probe, const MCPseudoProbeInlineStack &InlineStack) {
// The function should not be called on the root.		// The function should not be called on the root.
assert(isRoot() && "Should not be called on root");		assert(isRoot() && "Should only be called on root");

// When it comes here, the input look like:		// When it comes here, the input look like:
// Probe: GUID of C, ...		// Probe: GUID of C, ...
// InlineStack: [88, A], [66, B]		// InlineStack: [88, A], [66, B]
// which means, Function A inlines function B at call site with a probe id of		// which means, Function A inlines function B at call site with a probe id of
// 88, and B inlines C at probe 66. The tri-tree expects a tree path like {[0,		// 88, and B inlines C at probe 66. The tri-tree expects a tree path like {[0,
// A], [88, B], [66, C]} to locate the tree node where the probe should be		// A], [88, B], [66, C]} to locate the tree node where the probe should be
// added. Note that the edge [0, A] means A is the top-level function we are		// added. Note that the edge [0, A] means A is the top-level function we are
Show All 30 Lines

void MCPseudoProbeInlineTree::emit(MCObjectStreamer *MCOS,		void MCPseudoProbeInlineTree::emit(MCObjectStreamer *MCOS,
const MCPseudoProbe *&LastProbe) {		const MCPseudoProbe *&LastProbe) {
LLVM_DEBUG({		LLVM_DEBUG({
dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);		dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);
dbgs() << "Group [\n";		dbgs() << "Group [\n";
MCPseudoProbeTable::DdgPrintIndent += 2;		MCPseudoProbeTable::DdgPrintIndent += 2;
});		});
		assert(!isRoot() && "Root should be handled seperately");
		wenleiUnsubmitted Done Reply Inline Actions nit: this message doesn't provide additional info other than the assertion itself. the message could be something more meaningful - why we don't expect root node here wenlei: nit: this message doesn't provide additional info other than the assertion itself. the message…
		hoyAuthorUnsubmitted Done Reply Inline Actions Fixed. hoy: Fixed.

// Emit probes grouped by GUID.		// Emit probes grouped by GUID.
if (Guid != 0) {
LLVM_DEBUG({		LLVM_DEBUG({
dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);		dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);
dbgs() << "GUID: " << Guid << "\n";		dbgs() << "GUID: " << Guid << "\n";
});		});
// Emit Guid		// Emit Guid
MCOS->emitInt64(Guid);		MCOS->emitInt64(Guid);
// Emit number of probes in this node		// Emit number of probes in this node, including a sentinel probe for
MCOS->emitULEB128IntValue(Probes.size());		// top-level functions if needed.
		bool NeedSentinel = false;
		rafaulerUnsubmitted Done Reply Inline Actions typo Santinel->Sentinel rafauler: typo Santinel->Sentinel
		if (Parent->isRoot()) {
		assert(isSentinelProbe(LastProbe->getAttributes()) &&
		"Starting probe of a top-level function should be a sentinel probe");
		// The main body of a split function doesn't need a sentinel probe.
		wenleiUnsubmitted Done Reply Inline Actions Is this checking the top level binary function name is different from the top level source function name? And this is to avoid emitting sentinel for main function body? wenlei: Is this checking the top level binary function name is different from the top level source…
		hoyAuthorUnsubmitted Done Reply Inline Actions Exactly. Added a comment about that. The encoding for a function with outlined code will look like below. Note that the main entry doesn't need a sentinel probe. GUID of Foo Probe 1 GUID of Foo Sentinel probe of Foo.outlined Probe 2 hoy: Exactly. Added a comment about that. The encoding for a function with outlined code will look…
		if (LastProbe->getGuid() != Guid)
		NeedSentinel = true;
		}

		MCOS->emitULEB128IntValue(Probes.size() + NeedSentinel);
// Emit number of direct inlinees		// Emit number of direct inlinees
MCOS->emitULEB128IntValue(Children.size());		MCOS->emitULEB128IntValue(Children.size());
		// Emit sentinel probe for top-level functions
		if (NeedSentinel)
		LastProbe->emit(MCOS, nullptr);

// Emit probes in this group		// Emit probes in this group
for (const auto &Probe : Probes) {		for (const auto &Probe : Probes) {
Probe.emit(MCOS, LastProbe);		Probe.emit(MCOS, LastProbe);
LastProbe = &Probe;		LastProbe = &Probe;
}		}
} else {
assert(Probes.empty() && "Root should not have probes");
}

// Emit sorted descendant		// Emit sorted descendant. InlineSite is unique for each pair, so there will
// InlineSite is unique for each pair,		// be no ordering of Inlinee based on MCPseudoProbeInlineTree*
// so there will be no ordering of Inlinee based on MCPseudoProbeInlineTree*		using InlineeType = std::pair<InlineSite, MCPseudoProbeInlineTree *>;
std::map<InlineSite, MCPseudoProbeInlineTree *> Inlinees;		auto Comparer = [](const InlineeType &A, const InlineeType &B) {
for (auto &Child : Children)		return A.first < B.first;
Inlinees[Child.first] = Child.second.get();		};
		std::vector<InlineeType> Inlinees;
		for (const auto &Child : Children)
		Inlinees.emplace_back(Child.first, Child.second.get());
		std::sort(Inlinees.begin(), Inlinees.end(), Comparer);

for (const auto &Inlinee : Inlinees) {		for (const auto &Inlinee : Inlinees) {
if (Guid) {
// Emit probe index		// Emit probe index
MCOS->emitULEB128IntValue(std::get<1>(Inlinee.first));		MCOS->emitULEB128IntValue(std::get<1>(Inlinee.first));
LLVM_DEBUG({		LLVM_DEBUG({
dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);		dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);
dbgs() << "InlineSite: " << std::get<1>(Inlinee.first) << "\n";		dbgs() << "InlineSite: " << std::get<1>(Inlinee.first) << "\n";
});		});
}
// Emit the group		// Emit the group
Inlinee.second->emit(MCOS, LastProbe);		Inlinee.second->emit(MCOS, LastProbe);
}		}

LLVM_DEBUG({		LLVM_DEBUG({
MCPseudoProbeTable::DdgPrintIndent -= 2;		MCPseudoProbeTable::DdgPrintIndent -= 2;
dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);		dbgs().indent(MCPseudoProbeTable::DdgPrintIndent);
dbgs() << "]\n";		dbgs() << "]\n";
});		});
}		}

void MCPseudoProbeSection::emit(MCObjectStreamer *MCOS) {		void MCPseudoProbeSections::emit(MCObjectStreamer *MCOS) {
MCContext &Ctx = MCOS->getContext();		MCContext &Ctx = MCOS->getContext();

for (auto &ProbeSec : MCProbeDivisions) {		for (auto &ProbeSec : MCProbeDivisions) {
const MCPseudoProbe *LastProbe = nullptr;		const auto *FuncSym = ProbeSec.first;
if (auto *S =		const auto &Root = ProbeSec.second;
Ctx.getObjectFileInfo()->getPseudoProbeSection(ProbeSec.first)) {		if (auto *S = Ctx.getObjectFileInfo()->getPseudoProbeSection(
		FuncSym->getSection())) {
// Switch to the .pseudoprobe section or a comdat group.		// Switch to the .pseudoprobe section or a comdat group.
MCOS->switchSection(S);		MCOS->switchSection(S);
// Emit probes grouped by GUID.		// Emit probes grouped by GUID.
ProbeSec.second.emit(MCOS, LastProbe);		// Emit sorted descendant. InlineSite is unique for each pair, so there
		// will be no ordering of Inlinee based on MCPseudoProbeInlineTree*
		using InlineeType = std::pair<InlineSite, MCPseudoProbeInlineTree *>;
		auto Comparer = [](const InlineeType &A, const InlineeType &B) {
		wenleiUnsubmitted Done Reply Inline Actions nit: this is a comparator instead of a sorter. same for the one above. wenlei: nit: this is a comparator instead of a sorter. same for the one above.
		return A.first < B.first;
		};
		std::vector<InlineeType> Inlinees;
		for (const auto &Child : Root.getChildren())
		Inlinees.emplace_back(Child.first, Child.second.get());
		std::sort(Inlinees.begin(), Inlinees.end(), Comparer);

		for (const auto &Inlinee : Inlinees) {
		// Emit the group guarded by a sentinel probe.
		MCPseudoProbe SentinelProbe(const_cast<MCSymbol *>(FuncSym),
		MD5Hash(FuncSym->getName()),
		(uint32_t)PseudoProbeReservedId::Invalid,
		(uint32_t)PseudoProbeType::Block,
		rafaulerUnsubmitted Not Done Reply Inline Actions std::map is an expensive data structure. For usage patterns that are strictly "insert then query", considering using a vector + std::sort. Reference https://llvm.org/docs/ProgrammersManual.html#dss-sortedvectormap rafauler: std::map is an expensive data structure. For usage patterns that are strictly "insert then…
		hoyAuthorUnsubmitted Done Reply Inline Actions Good point. Changed to using vector+sort. hoy: Good point. Changed to using vector+sort.
		(uint32_t)PseudoProbeAttributes::Sentinel);
		const MCPseudoProbe *Probe = &SentinelProbe;
		Inlinee.second->emit(MCOS, Probe);
		}
}		}
}		}
}		}

//		//
// This emits the pseudo probe tables.		// This emits the pseudo probe tables.
//		//
void MCPseudoProbeTable::emit(MCObjectStreamer *MCOS) {		void MCPseudoProbeTable::emit(MCObjectStreamer *MCOS) {
▲ Show 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	while (Data < End) {
GUID2FuncDescMap.emplace(GUID, MCPseudoProbeFuncDesc(GUID, Hash, Name));		GUID2FuncDescMap.emplace(GUID, MCPseudoProbeFuncDesc(GUID, Hash, Name));
}		}
assert(Data == End && "Have unprocessed data in pseudo_probe_desc section");		assert(Data == End && "Have unprocessed data in pseudo_probe_desc section");
return true;		return true;
}		}

bool MCPseudoProbeDecoder::buildAddress2ProbeMap(		bool MCPseudoProbeDecoder::buildAddress2ProbeMap(
MCDecodedPseudoProbeInlineTree *Cur, uint64_t &LastAddr,		MCDecodedPseudoProbeInlineTree *Cur, uint64_t &LastAddr,
std::unordered_set<uint64_t> &GuildFilter) {		const Uint64Set &GuidFilter, const Uint64Map &FuncStartAddrs) {
// The pseudo_probe section encodes an inline forest and each tree has a		// The pseudo_probe section encodes an inline forest and each tree has a
// format like:		// format defined in MCPseudoProbe.h
// FUNCTION BODY (one for each uninlined function present in the text
// section)
// GUID (uint64)
// GUID of the function
// NPROBES (ULEB128)
// Number of probes originating from this function.
// NUM_INLINED_FUNCTIONS (ULEB128)
// Number of callees inlined into this function, aka number of
// first-level inlinees
// PROBE RECORDS
// A list of NPROBES entries. Each entry contains:
// INDEX (ULEB128)
// TYPE (uint4)
// 0 - block probe, 1 - indirect call, 2 - direct call
// ATTRIBUTE (uint3)
// 1 - tail call, 2 - dangling
// ADDRESS_TYPE (uint1)
// 0 - code address, 1 - address delta
// CODE_ADDRESS (uint64 or ULEB128)
// code address or address delta, depending on Flag
// INLINED FUNCTION RECORDS
// A list of NUM_INLINED_FUNCTIONS entries describing each of the
// inlined callees. Each record contains:
// INLINE SITE
// Index of the callsite probe (ULEB128)
// FUNCTION BODY
// A FUNCTION BODY entry describing the inlined function.

uint32_t Index = 0;		uint32_t Index = 0;
if (Cur == &DummyInlineRoot) {		bool IsTopLevelFunc = Cur == &DummyInlineRoot;
		if (IsTopLevelFunc) {
// Use a sequential id for top level inliner.		// Use a sequential id for top level inliner.
Index = Cur->getChildren().size();		Index = Cur->getChildren().size();
} else {		} else {
// Read inline site for inlinees		// Read inline site for inlinees
auto ErrorOrIndex = readUnsignedNumber<uint32_t>();		auto ErrorOrIndex = readUnsignedNumber<uint32_t>();
if (!ErrorOrIndex)		if (!ErrorOrIndex)
return false;		return false;
Index = std::move(*ErrorOrIndex);		Index = std::move(*ErrorOrIndex);
}		}

// Read guid		// Read guid
auto ErrorOrCurGuid = readUnencodedNumber<uint64_t>();		auto ErrorOrCurGuid = readUnencodedNumber<uint64_t>();
if (!ErrorOrCurGuid)		if (!ErrorOrCurGuid)
return false;		return false;
uint64_t Guid = std::move(*ErrorOrCurGuid);		uint64_t Guid = std::move(*ErrorOrCurGuid);

// Decide if top-level node should be disgarded.		// Decide if top-level node should be disgarded.
if (Cur == &DummyInlineRoot && !GuildFilter.empty() &&		if (IsTopLevelFunc && !GuidFilter.empty() && !GuidFilter.count(Guid))
!GuildFilter.count(Guid))
Cur = nullptr;		Cur = nullptr;

// If the incoming node is null, all its children nodes should be disgarded.		// If the incoming node is null, all its children nodes should be disgarded.
if (Cur) {		if (Cur) {
// Switch/add to a new tree node(inlinee)		// Switch/add to a new tree node(inlinee)
Cur = Cur->getOrAddNode(std::make_tuple(Guid, Index));		Cur = Cur->getOrAddNode(std::make_tuple(Guid, Index));
Cur->Guid = Guid;		Cur->Guid = Guid;
		if (IsTopLevelFunc && !EncodingIsAddrBased) {
		if (auto V = FuncStartAddrs.lookup(Guid))
		LastAddr = V;
		}
}		}

// Read number of probes in the current node.		// Read number of probes in the current node.
auto ErrorOrNodeCount = readUnsignedNumber<uint32_t>();		auto ErrorOrNodeCount = readUnsignedNumber<uint32_t>();
if (!ErrorOrNodeCount)		if (!ErrorOrNodeCount)
return false;		return false;
uint32_t NodeCount = std::move(*ErrorOrNodeCount);		uint32_t NodeCount = std::move(*ErrorOrNodeCount);
// Read number of direct inlinees		// Read number of direct inlinees
Show All 22 Lines	if (Value & 0x80) {
return false;		return false;
int64_t Offset = std::move(*ErrorOrOffset);		int64_t Offset = std::move(*ErrorOrOffset);
Addr = LastAddr + Offset;		Addr = LastAddr + Offset;
} else {		} else {
auto ErrorOrAddr = readUnencodedNumber<int64_t>();		auto ErrorOrAddr = readUnencodedNumber<int64_t>();
if (!ErrorOrAddr)		if (!ErrorOrAddr)
return false;		return false;
Addr = std::move(*ErrorOrAddr);		Addr = std::move(*ErrorOrAddr);
		if (isSentinelProbe(Attr)) {
		// For sentinel probe, the addr field actually stores the GUID of the
		// split function. Convert it to the real address.
		if (auto V = FuncStartAddrs.lookup(Addr))
		Addr = V;
		} else {
		// For now we assume all probe encoding should be either based on
		// leading probe address or function start address.
		// The scheme is for downwards compatibility.
		// TODO: retire this scheme once compatibility is no longer an issue.
		EncodingIsAddrBased = true;
		}
}		}

if (Cur) {		if (Cur && !isSentinelProbe(Attr)) {
// Populate Address2ProbesMap		// Populate Address2ProbesMap
auto &Probes = Address2ProbesMap[Addr];		auto &Probes = Address2ProbesMap[Addr];
Probes.emplace_back(Addr, Cur->Guid, Index, PseudoProbeType(Kind), Attr,		Probes.emplace_back(Addr, Cur->Guid, Index, PseudoProbeType(Kind), Attr,
Cur);		Cur);
Cur->addProbes(&Probes.back());		Cur->addProbes(&Probes.back());
}		}
LastAddr = Addr;		LastAddr = Addr;
}		}

uint32_t ChildrenToProcess = std::move(*ErrorOrCurChildrenToProcess);		uint32_t ChildrenToProcess = std::move(*ErrorOrCurChildrenToProcess);
for (uint32_t I = 0; I < ChildrenToProcess; I++) {		for (uint32_t I = 0; I < ChildrenToProcess; I++) {
buildAddress2ProbeMap(Cur, LastAddr, GuildFilter);		buildAddress2ProbeMap(Cur, LastAddr, GuidFilter, FuncStartAddrs);
}		}

return true;		return true;
}		}

bool MCPseudoProbeDecoder::buildAddress2ProbeMap(		bool MCPseudoProbeDecoder::buildAddress2ProbeMap(
const uint8_t *Start, std::size_t Size,		const uint8_t *Start, std::size_t Size, const Uint64Set &GuidFilter,
std::unordered_set<uint64_t> &GuildFilter) {		const Uint64Map &FuncStartAddrs) {
Data = Start;		Data = Start;
End = Data + Size;		End = Data + Size;
uint64_t LastAddr = 0;		uint64_t LastAddr = 0;
while (Data < End)		while (Data < End)
buildAddress2ProbeMap(&DummyInlineRoot, LastAddr, GuildFilter);		buildAddress2ProbeMap(&DummyInlineRoot, LastAddr, GuidFilter,
		FuncStartAddrs);
assert(Data == End && "Have unprocessed data in pseudo_probe section");		assert(Data == End && "Have unprocessed data in pseudo_probe section");
return true;		return true;
}		}

bool MCPseudoProbeDecoder::buildAddress2ProbeMap(const uint8_t *Start,
std::size_t Size) {
std::unordered_set<uint64_t> GuildFilter;
return buildAddress2ProbeMap(Start, Size, GuildFilter);
}

void MCPseudoProbeDecoder::printGUID2FuncDescMap(raw_ostream &OS) {		void MCPseudoProbeDecoder::printGUID2FuncDescMap(raw_ostream &OS) {
OS << "Pseudo Probe Desc:\n";		OS << "Pseudo Probe Desc:\n";
// Make the output deterministic		// Make the output deterministic
std::map<uint64_t, MCPseudoProbeFuncDesc> OrderedMap(GUID2FuncDescMap.begin(),		std::map<uint64_t, MCPseudoProbeFuncDesc> OrderedMap(GUID2FuncDescMap.begin(),
GUID2FuncDescMap.end());		GUID2FuncDescMap.end());
for (auto &I : OrderedMap) {		for (auto &I : OrderedMap) {
I.second.print(OS);		I.second.print(OS);
}		}
▲ Show 20 Lines • Show All 73 Lines • Show Last 20 Lines

llvm/lib/MC/MCStreamer.cpp

Show First 20 Lines • Show All 1,096 Lines • ▼ Show 20 Lines	void MCStreamer::emitInstruction(const MCInst &Inst, const MCSubtargetInfo &) {
// Scan for values.		// Scan for values.
for (unsigned i = Inst.getNumOperands(); i--;)		for (unsigned i = Inst.getNumOperands(); i--;)
if (Inst.getOperand(i).isExpr())		if (Inst.getOperand(i).isExpr())
visitUsedExpr(*Inst.getOperand(i).getExpr());		visitUsedExpr(*Inst.getOperand(i).getExpr());
}		}

void MCStreamer::emitPseudoProbe(uint64_t Guid, uint64_t Index, uint64_t Type,		void MCStreamer::emitPseudoProbe(uint64_t Guid, uint64_t Index, uint64_t Type,
uint64_t Attr,		uint64_t Attr,
const MCPseudoProbeInlineStack &InlineStack) {		const MCPseudoProbeInlineStack &InlineStack,
		MCSymbol *FnSym) {
auto &Context = getContext();		auto &Context = getContext();

// Create a symbol at in the current section for use in the probe.		// Create a symbol at in the current section for use in the probe.
MCSymbol *ProbeSym = Context.createTempSymbol();		MCSymbol *ProbeSym = Context.createTempSymbol();

// Set the value of the symbol to use for the MCPseudoProbe.		// Set the value of the symbol to use for the MCPseudoProbe.
emitLabel(ProbeSym);		emitLabel(ProbeSym);

// Create a (local) probe entry with the symbol.		// Create a (local) probe entry with the symbol.
MCPseudoProbe Probe(ProbeSym, Guid, Index, Type, Attr);		MCPseudoProbe Probe(ProbeSym, Guid, Index, Type, Attr);

// Add the probe entry to this section's entries.		// Add the probe entry to this section's entries.
Context.getMCPseudoProbeTable().getProbeSections().addPseudoProbe(		Context.getMCPseudoProbeTable().getProbeSections().addPseudoProbe(
getCurrentSectionOnly(), Probe, InlineStack);		FnSym, Probe, InlineStack);
}		}

void MCStreamer::emitAbsoluteSymbolDiff(const MCSymbol Hi, const MCSymbol Lo,		void MCStreamer::emitAbsoluteSymbolDiff(const MCSymbol Hi, const MCSymbol Lo,
unsigned Size) {		unsigned Size) {
// Get the Hi-Lo expression.		// Get the Hi-Lo expression.
const MCExpr *Diff =		const MCExpr *Diff =
MCBinaryExpr::createSub(MCSymbolRefExpr::create(Hi, Context),		MCBinaryExpr::createSub(MCSymbolRefExpr::create(Hi, Context),
MCSymbolRefExpr::create(Lo, Context), Context);		MCSymbolRefExpr::create(Lo, Context), Context);
▲ Show 20 Lines • Show All 287 Lines • Show Last 20 Lines

llvm/test/Transforms/SampleProfile/pseudo-probe-emit-inline.ll

	; REQUIRES: x86_64-linux			; REQUIRES: x86_64-linux
	; RUN: opt < %s -passes='pseudo-probe,cgscc(inline)' -function-sections -mtriple=x86_64-unknown-linux-gnu -S -o %t			; RUN: opt < %s -passes='pseudo-probe,cgscc(inline)' -function-sections -mtriple=x86_64-unknown-linux-gnu -S -o %t
	; RUN: FileCheck %s < %t --check-prefix=CHECK-IL			; RUN: FileCheck %s < %t --check-prefix=CHECK-IL
	; RUN: llc -function-sections <%t -filetype=asm -o %t1			; RUN: llc -function-sections <%t -filetype=asm -o %t1
	; RUN: FileCheck %s < %t1 --check-prefix=CHECK-ASM			; RUN: FileCheck %s < %t1 --check-prefix=CHECK-ASM
	; RUN: llc -function-sections <%t -filetype=obj -o %t2			; RUN: llc -function-sections <%t -filetype=obj -o %t2
	; RUN: llvm-objdump --section-headers %t2 \| FileCheck %s --check-prefix=CHECK-OBJ			; RUN: llvm-objdump --section-headers %t2 \| FileCheck %s --check-prefix=CHECK-OBJ
	; RUN: llvm-mc -filetype=asm <%t1 -o %t3			; RUN: llvm-mc -filetype=asm <%t1 -o %t3
	; RUN: FileCheck %s < %t3 --check-prefix=CHECK-ASM			; RUN: FileCheck %s < %t3 --check-prefix=CHECK-ASM
	; RUN: llvm-mc -filetype=obj <%t1 -o %t4			; RUN: llvm-mc -filetype=obj <%t1 -o %t4
	; RUN: llvm-objdump --section-headers %t4 \| FileCheck %s --check-prefix=CHECK-OBJ			; RUN: llvm-objdump --section-headers %t4 \| FileCheck %s --check-prefix=CHECK-OBJ


	define dso_local void @foo2() !dbg !7 {			define dso_local void @foo2() !dbg !7 {
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID1:]], i64 1, i32 0, i64 -1), !dbg ![[#]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID1:]], i64 1, i32 0, i64 -1), !dbg ![[#]]
	; CHECK-ASM: .pseudoprobe [[#GUID1:]] 1 0 0			; CHECK-ASM: .pseudoprobe [[#GUID1:]] 1 0 0 foo2
	ret void, !dbg !10			ret void, !dbg !10
	}			}

	define dso_local void @foo() #0 !dbg !11 {			define dso_local void @foo() #0 !dbg !11 {
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID2:]], i64 1, i32 0, i64 -1), !dbg ![[#]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID2:]], i64 1, i32 0, i64 -1), !dbg ![[#]]
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID1]], i64 1, i32 0, i64 -1), !dbg ![[#DL1:]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID1]], i64 1, i32 0, i64 -1), !dbg ![[#DL1:]]
	; CHECK-ASM: .pseudoprobe [[#GUID2:]] 1 0 0			; CHECK-ASM: .pseudoprobe [[#GUID2:]] 1 0 0 foo
	; CHECK-ASM: .pseudoprobe [[#GUID1]] 1 0 0 @ [[#GUID2]]:2			; CHECK-ASM: .pseudoprobe [[#GUID1]] 1 0 0 @ [[#GUID2]]:2 foo
	call void @foo2(), !dbg !12			call void @foo2(), !dbg !12
	ret void, !dbg !13			ret void, !dbg !13
	}			}

	define dso_local i32 @entry() !dbg !14 {			define dso_local i32 @entry() !dbg !14 {
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID3:]], i64 1, i32 0, i64 -1), !dbg ![[#]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID3:]], i64 1, i32 0, i64 -1), !dbg ![[#]]
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID2]], i64 1, i32 0, i64 -1), !dbg ![[#DL2:]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID2]], i64 1, i32 0, i64 -1), !dbg ![[#DL2:]]
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID1]], i64 1, i32 0, i64 -1), !dbg ![[#DL3:]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID1]], i64 1, i32 0, i64 -1), !dbg ![[#DL3:]]
	; CHECK-ASM: .pseudoprobe [[#GUID3:]] 1 0 0			; CHECK-ASM: .pseudoprobe [[#GUID3:]] 1 0 0 entry
	; CHECK-ASM: .pseudoprobe [[#GUID2]] 1 0 0 @ [[#GUID3]]:2			; CHECK-ASM: .pseudoprobe [[#GUID2]] 1 0 0 @ [[#GUID3]]:2 entry
	; CHECK-ASM: .pseudoprobe [[#GUID1]] 1 0 0 @ [[#GUID3]]:2 @ [[#GUID2]]:2			; CHECK-ASM: .pseudoprobe [[#GUID1]] 1 0 0 @ [[#GUID3]]:2 @ [[#GUID2]]:2 entry
	call void @foo(), !dbg !18			call void @foo(), !dbg !18
	ret i32 0, !dbg !19			ret i32 0, !dbg !19
	}			}


	; CHECK-IL: ![[#SCOPE1:]] = distinct !DISubprogram(name: "foo2"			; CHECK-IL: ![[#SCOPE1:]] = distinct !DISubprogram(name: "foo2"
	; CHECK-IL: ![[#SCOPE2:]] = distinct !DISubprogram(name: "foo"			; CHECK-IL: ![[#SCOPE2:]] = distinct !DISubprogram(name: "foo"
	; CHECK-IL: ![[#DL1]] = !DILocation(line: 3, column: 1, scope: ![[#SCOPE1]], inlinedAt: ![[#INL1:]])			; CHECK-IL: ![[#DL1]] = !DILocation(line: 3, column: 1, scope: ![[#SCOPE1]], inlinedAt: ![[#INL1:]])
	Show All 23 Lines
	; CHECK-ASM-NEXT: .section .pseudo_probe_desc,"G",@progbits,.pseudo_probe_desc_entry,comdat			; CHECK-ASM-NEXT: .section .pseudo_probe_desc,"G",@progbits,.pseudo_probe_desc_entry,comdat
	; CHECK-ASM-NEXT: .quad [[#GUID3]]			; CHECK-ASM-NEXT: .quad [[#GUID3]]
	; CHECK-ASM-NEXT: .quad [[#HASH3:]]			; CHECK-ASM-NEXT: .quad [[#HASH3:]]
	; CHECK-ASM-NEXT: .byte 5			; CHECK-ASM-NEXT: .byte 5
	; CHECK-ASM-NEXT: .ascii "entry"			; CHECK-ASM-NEXT: .ascii "entry"

	; CHECK-OBJ: .pseudo_probe_desc			; CHECK-OBJ: .pseudo_probe_desc
	; CHECK-OBJ: .pseudo_probe			; CHECK-OBJ: .pseudo_probe
				; CHECK-OBJ-NOT: .rela.pseudo_probe

	!llvm.dbg.cu = !{!0}			!llvm.dbg.cu = !{!0}
	!llvm.module.flags = !{!3, !4}			!llvm.module.flags = !{!3, !4}

	!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1, emissionKind: FullDebug)			!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1, emissionKind: FullDebug)
	!1 = !DIFile(filename: "foo.c", directory: "any")			!1 = !DIFile(filename: "foo.c", directory: "any")
	!2 = !{}			!2 = !{}
	!3 = !{i32 2, !"Dwarf Version", i32 4}			!3 = !{i32 2, !"Dwarf Version", i32 4}
	Show All 14 Lines

llvm/test/Transforms/SampleProfile/pseudo-probe-emit.ll

	Show All 12 Lines

	@a = dso_local global i32 0, align 4			@a = dso_local global i32 0, align 4

	define void @foo(i32 %x) !dbg !3 {			define void @foo(i32 %x) !dbg !3 {
	bb0:			bb0:
	%cmp = icmp eq i32 %x, 0			%cmp = icmp eq i32 %x, 0
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID:]], i64 1, i32 0, i64 -1), !dbg ![[#FAKELINE:]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID:]], i64 1, i32 0, i64 -1), !dbg ![[#FAKELINE:]]
	; CHECK-MIR: PSEUDO_PROBE [[#GUID:]], 1, 0, 0			; CHECK-MIR: PSEUDO_PROBE [[#GUID:]], 1, 0, 0
	; CHECK-ASM: .pseudoprobe [[#GUID:]] 1 0 0			; CHECK-ASM: .pseudoprobe [[#GUID:]] 1 0 0 foo
	br i1 %cmp, label %bb1, label %bb2			br i1 %cmp, label %bb1, label %bb2

	bb1:			bb1:
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID:]], i64 2, i32 0, i64 -1), !dbg ![[#FAKELINE]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID:]], i64 2, i32 0, i64 -1), !dbg ![[#FAKELINE]]
	; CHECK-MIR: PSEUDO_PROBE [[#GUID]], 3, 0, 0			; CHECK-MIR: PSEUDO_PROBE [[#GUID]], 3, 0, 0
	; CHECK-MIR: PSEUDO_PROBE [[#GUID]], 4, 0, 0			; CHECK-MIR: PSEUDO_PROBE [[#GUID]], 4, 0, 0
	; CHECK-ASM: .pseudoprobe [[#GUID]] 3 0 0			; CHECK-ASM: .pseudoprobe [[#GUID]] 3 0 0 foo
	; CHECK-ASM: .pseudoprobe [[#GUID]] 4 0 0			; CHECK-ASM: .pseudoprobe [[#GUID]] 4 0 0 foo
	store i32 6, ptr @a, align 4			store i32 6, ptr @a, align 4
	br label %bb3			br label %bb3

	bb2:			bb2:
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID:]], i64 3, i32 0, i64 -1), !dbg ![[#FAKELINE]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID:]], i64 3, i32 0, i64 -1), !dbg ![[#FAKELINE]]
	; CHECK-MIR: PSEUDO_PROBE [[#GUID]], 2, 0, 0			; CHECK-MIR: PSEUDO_PROBE [[#GUID]], 2, 0, 0
	; CHECK-MIR: PSEUDO_PROBE [[#GUID]], 4, 0, 0			; CHECK-MIR: PSEUDO_PROBE [[#GUID]], 4, 0, 0
	; CHECK-ASM: .pseudoprobe [[#GUID]] 2 0 0			; CHECK-ASM: .pseudoprobe [[#GUID]] 2 0 0 foo
	; CHECK-ASM: .pseudoprobe [[#GUID]] 4 0 0			; CHECK-ASM: .pseudoprobe [[#GUID]] 4 0 0 foo
	store i32 8, ptr @a, align 4			store i32 8, ptr @a, align 4
	br label %bb3			br label %bb3

	bb3:			bb3:
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID]], i64 4, i32 0, i64 -1), !dbg ![[#REALLINE:]]			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID]], i64 4, i32 0, i64 -1), !dbg ![[#REALLINE:]]
	ret void, !dbg !12			ret void, !dbg !12
	}			}

	declare void @bar(i32 %x)			declare void @bar(i32 %x)

	define internal void @foo2(ptr %f) !dbg !4 {			define internal void @foo2(ptr %f) !dbg !4 {
	entry:			entry:
	; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID2:]], i64 1, i32 0, i64 -1)			; CHECK-IL: call void @llvm.pseudoprobe(i64 [[#GUID2:]], i64 1, i32 0, i64 -1)
	; CHECK-MIR: PSEUDO_PROBE [[#GUID2:]], 1, 0, 0			; CHECK-MIR: PSEUDO_PROBE [[#GUID2:]], 1, 0, 0
	; CHECK-ASM: .pseudoprobe [[#GUID2:]] 1 0 0			; CHECK-ASM: .pseudoprobe [[#GUID2:]] 1 0 0 foo2
	; Check pseudo_probe metadata attached to the indirect call instruction.			; Check pseudo_probe metadata attached to the indirect call instruction.
	; CHECK-IL: call void %f(i32 1), !dbg ![[#PROBE0:]]			; CHECK-IL: call void %f(i32 1), !dbg ![[#PROBE0:]]
	; CHECK-MIR: PSEUDO_PROBE [[#GUID2]], 2, 1, 0			; CHECK-MIR: PSEUDO_PROBE [[#GUID2]], 2, 1, 0
	; CHECK-ASM: .pseudoprobe [[#GUID2]] 2 1 0			; CHECK-ASM: .pseudoprobe [[#GUID2]] 2 1 0 foo2
	call void %f(i32 1), !dbg !13			call void %f(i32 1), !dbg !13
	; Check pseudo_probe metadata attached to the direct call instruction.			; Check pseudo_probe metadata attached to the direct call instruction.
	; CHECK-IL: call void @bar(i32 1), !dbg ![[#PROBE1:]]			; CHECK-IL: call void @bar(i32 1), !dbg ![[#PROBE1:]]
	; CHECK-MIR: PSEUDO_PROBE [[#GUID2]], 3, 2, 0			; CHECK-MIR: PSEUDO_PROBE [[#GUID2]], 3, 2, 0
	; CHECK-ASM: .pseudoprobe [[#GUID2]] 3 2 0			; CHECK-ASM: .pseudoprobe [[#GUID2]] 3 2 0 foo2
	call void @bar(i32 1)			call void @bar(i32 1)
	ret void			ret void
	}			}

	; CHECK-IL: Function Attrs: inaccessiblememonly nocallback nofree nosync nounwind willreturn			; CHECK-IL: Function Attrs: inaccessiblememonly nocallback nofree nosync nounwind willreturn
	; CHECK-IL-NEXT: declare void @llvm.pseudoprobe(i64, i64, i32, i64)			; CHECK-IL-NEXT: declare void @llvm.pseudoprobe(i64, i64, i32, i64)

	; CHECK-IL: ![[#FOO:]] = distinct !DISubprogram(name: "foo"			; CHECK-IL: ![[#FOO:]] = distinct !DISubprogram(name: "foo"
	Show All 16 Lines
	; CHECK-ASM-NEXT: .ascii "foo"			; CHECK-ASM-NEXT: .ascii "foo"
	; CHECK-ASM-NEXT: .section .pseudo_probe_desc,"G",@progbits,.pseudo_probe_desc_foo2,comdat			; CHECK-ASM-NEXT: .section .pseudo_probe_desc,"G",@progbits,.pseudo_probe_desc_foo2,comdat
	; CHECK-ASM-NEXT: .quad [[#GUID2]]			; CHECK-ASM-NEXT: .quad [[#GUID2]]
	; CHECK-ASM-NEXT: .quad [[#HASH2:]]			; CHECK-ASM-NEXT: .quad [[#HASH2:]]
	; CHECK-ASM-NEXT: .byte 4			; CHECK-ASM-NEXT: .byte 4
	; CHECK-ASM-NEXT: .ascii "foo2"			; CHECK-ASM-NEXT: .ascii "foo2"

	; CHECK-OBJ-COUNT-2: .pseudo_probe_desc			; CHECK-OBJ-COUNT-2: .pseudo_probe_desc
	; CHECK-OBJ-COUNT-2: .pseudo_probe			; CHECK-OBJ: .pseudo_probe
				; CHECK-OBJ-NOT: .rela.pseudo_probe

	!llvm.dbg.cu = !{!0}			!llvm.dbg.cu = !{!0}
	!llvm.module.flags = !{!9, !10}			!llvm.module.flags = !{!9, !10}

	!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1)			!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1)
	!1 = !DIFile(filename: "test.c", directory: "")			!1 = !DIFile(filename: "test.c", directory: "")
	!2 = !{}			!2 = !{}
	!3 = distinct !DISubprogram(name: "foo", scope: !1, file: !1, line: 1, type: !5, unit: !0, retainedNodes: !2)			!3 = distinct !DISubprogram(name: "foo", scope: !1, file: !1, line: 1, type: !5, unit: !0, retainedNodes: !2)
	Show All 9 Lines

llvm/test/tools/llvm-profgen/Inputs/func-split.perfbin

This is a binary file.

llvm/test/tools/llvm-profgen/Inputs/inline-cs-pseudoprobe.perfbin

This is a binary file.

llvm/test/tools/llvm-profgen/inline-force-dwarf.test

	; RUN: llvm-profgen --format=text --ignore-stack-samples --use-dwarf-correlation --perfscript=%S/Inputs/inline-cs-pseudoprobe.perfscript --binary=%S/Inputs/inline-cs-pseudoprobe.perfbin --output=%t --profile-summary-cold-count=0			; RUN: llvm-profgen --format=text --ignore-stack-samples --use-dwarf-correlation --perfscript=%S/Inputs/inline-cs-pseudoprobe.perfscript --binary=%S/Inputs/inline-cs-pseudoprobe.perfbin --output=%t --profile-summary-cold-count=0
	; RUN: FileCheck %s --input-file %t			; RUN: FileCheck %s --input-file %t

	; CHECK: main:1109:0			; CHECK: main:1109:0
	; CHECK-NEXT: 0: 0			; CHECK-NEXT: 0: 0
	; CHECK-NEXT: 2: 0			; CHECK-NEXT: 2: 0
	; CHECK-NEXT: 1: foo:1109			; CHECK-NEXT: 1: foo:1109
	; CHECK-NEXT: 2: 14			; CHECK-NEXT: 2: 14
	; CHECK-NEXT: 3: 15			; CHECK-NEXT: 3: 15
	; CHECK-NEXT: 4: 0			; CHECK-NEXT: 4: 0
	; CHECK-NEXT: 65526: 14			; CHECK-NEXT: 65525: 14
	; CHECK-NEXT: 3: bar:224			; CHECK-NEXT: 3: bar:196
	; CHECK-NEXT: 1: 14			; CHECK-NEXT: 1: 14
	; CHECK-NEXT: 65533: 14


	; clang -O3 -fuse-ld=lld -fpseudo-probe-for-profiling			; clang -O3 -fuse-ld=lld -fpseudo-probe-for-profiling
	; -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -Xclang -mdisable-tail-calls			; -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -Xclang -mdisable-tail-calls
	; -g test.c -o a.out			; -g test.c -o a.out

	#include <stdio.h>			#include <stdio.h>

	Show All 18 Lines

llvm/tools/llvm-profgen/ProfileGenerator.cpp

Show First 20 Lines • Show All 781 Lines • ▼ Show 20 Lines	void CSProfileGenerator::generateProfile() {

if (Binary->getTrackFuncContextSize())		if (Binary->getTrackFuncContextSize())
computeSizeForProfiledFunctions();		computeSizeForProfiledFunctions();

postProcessProfiles();		postProcessProfiles();
}		}

void CSProfileGenerator::computeSizeForProfiledFunctions() {		void CSProfileGenerator::computeSizeForProfiledFunctions() {
std::unordered_set<const BinaryFunction *> ProfiledFunctions;
for (auto *Func : Binary->getProfiledFunctions())		for (auto *Func : Binary->getProfiledFunctions())
Binary->computeInlinedContextSizeForFunc(Func);		Binary->computeInlinedContextSizeForFunc(Func);

// Flush the symbolizer to save memory.		// Flush the symbolizer to save memory.
Binary->flushSymbolizer();		Binary->flushSymbolizer();
}		}

void CSProfileGenerator::updateFunctionSamples() {		void CSProfileGenerator::updateFunctionSamples() {
▲ Show 20 Lines • Show All 441 Lines • Show Last 20 Lines

llvm/tools/llvm-profgen/ProfiledBinary.h

//===-- ProfiledBinary.h - Binary decoder ------------------------ C++ --===//		//===-- ProfiledBinary.h - Binary decoder ------------------------ C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_TOOLS_LLVM_PROFGEN_PROFILEDBINARY_H		#ifndef LLVM_TOOLS_LLVM_PROFGEN_PROFILEDBINARY_H
#define LLVM_TOOLS_LLVM_PROFGEN_PROFILEDBINARY_H		#define LLVM_TOOLS_LLVM_PROFGEN_PROFILEDBINARY_H

#include "CallContext.h"		#include "CallContext.h"
#include "ErrorHandling.h"		#include "ErrorHandling.h"
		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/StringSet.h"		#include "llvm/ADT/StringSet.h"
#include "llvm/DebugInfo/DWARF/DWARFContext.h"		#include "llvm/DebugInfo/DWARF/DWARFContext.h"
#include "llvm/DebugInfo/Symbolize/Symbolize.h"		#include "llvm/DebugInfo/Symbolize/Symbolize.h"
#include "llvm/MC/MCAsmInfo.h"		#include "llvm/MC/MCAsmInfo.h"
#include "llvm/MC/MCContext.h"		#include "llvm/MC/MCContext.h"
#include "llvm/MC/MCDisassembler/MCDisassembler.h"		#include "llvm/MC/MCDisassembler/MCDisassembler.h"
▲ Show 20 Lines • Show All 139 Lines • ▼ Show 20 Lines	public:
uint32_t getFuncSizeForContext(const ContextTrieNode *Context);		uint32_t getFuncSizeForContext(const ContextTrieNode *Context);

// For inlinees that are full optimized away, we can establish zero size using		// For inlinees that are full optimized away, we can establish zero size using
// their remaining probes.		// their remaining probes.
void trackInlineesOptimizedAway(MCPseudoProbeDecoder &ProbeDecoder);		void trackInlineesOptimizedAway(MCPseudoProbeDecoder &ProbeDecoder);

using ProbeFrameStack = SmallVector<std::pair<StringRef, uint32_t>>;		using ProbeFrameStack = SmallVector<std::pair<StringRef, uint32_t>>;
void trackInlineesOptimizedAway(MCPseudoProbeDecoder &ProbeDecoder,		void trackInlineesOptimizedAway(MCPseudoProbeDecoder &ProbeDecoder,
MCDecodedPseudoProbeInlineTree &ProbeNode,		MCDecodedPseudoProbeInlineTree &ProbeNode,
ProbeFrameStack &Context);		ProbeFrameStack &Context);

void dump() { RootContext.dumpTree(); }		void dump() { RootContext.dumpTree(); }

private:		private:
// Root node for context trie tree, node that this is a reverse context trie		// Root node for context trie tree, node that this is a reverse context trie
// with callee as parent and caller as child. This way we can traverse from		// with callee as parent and caller as child. This way we can traverse from
// root to find the best/longest matching context if an exact match does not		// root to find the best/longest matching context if an exact match does not
// exist. It gives us the best possible estimate for function's post-inline,		// exist. It gives us the best possible estimate for function's post-inline,
Show All 34 Lines	class ProfiledBinary {
std::set<std::pair<uint64_t, uint64_t>> TextSections;		std::set<std::pair<uint64_t, uint64_t>> TextSections;

// A map of mapping function name to BinaryFunction info.		// A map of mapping function name to BinaryFunction info.
std::unordered_map<std::string, BinaryFunction> BinaryFunctions;		std::unordered_map<std::string, BinaryFunction> BinaryFunctions;

// A list of binary functions that have samples.		// A list of binary functions that have samples.
std::unordered_set<const BinaryFunction *> ProfiledFunctions;		std::unordered_set<const BinaryFunction *> ProfiledFunctions;

		// GUID to Elf symbol start address map
		DenseMap<uint64_t, uint64_t> SymbolStartAddrs;

		// Start address to Elf symbol GUID map
		std::unordered_multimap<uint64_t, uint64_t> StartAddrToSymMap;

// An ordered map of mapping function's start address to function range		// An ordered map of mapping function's start address to function range
// relevant info. Currently to determine if the address of ELF is the start of		// relevant info. Currently to determine if the offset of ELF is the start of
// a real function, we leverage the function range info from DWARF.		// a real function, we leverage the function range info from DWARF.
std::map<uint64_t, FuncRange> StartAddrToFuncRangeMap;		std::map<uint64_t, FuncRange> StartAddrToFuncRangeMap;

// Address to context location map. Used to expand the context.		// Address to context location map. Used to expand the context.
std::unordered_map<uint64_t, SampleContextFrameVector> AddressToLocStackMap;		std::unordered_map<uint64_t, SampleContextFrameVector> AddressToLocStackMap;

// Address to instruction size map. Also used for quick Address lookup.		// Address to instruction size map. Also used for quick Address lookup.
std::unordered_map<uint64_t, uint64_t> AddressToInstSizeMap;		std::unordered_map<uint64_t, uint64_t> AddressToInstSizeMap;
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	class ProfiledBinary {
// the preferred address		// the preferred address
bool IsLoadedByMMap = false;		bool IsLoadedByMMap = false;
// Use to avoid redundant warning.		// Use to avoid redundant warning.
bool MissingMMapWarned = false;		bool MissingMMapWarned = false;

void setPreferredTextSegmentAddresses(const ELFObjectFileBase *O);		void setPreferredTextSegmentAddresses(const ELFObjectFileBase *O);

template <class ELFT>		template <class ELFT>
void setPreferredTextSegmentAddresses(const ELFFile<ELFT> &Obj, StringRef FileName);		void setPreferredTextSegmentAddresses(const ELFFile<ELFT> &Obj,
		StringRef FileName);

void checkPseudoProbe(const ELFObjectFileBase *Obj);		void checkPseudoProbe(const ELFObjectFileBase *Obj);

void decodePseudoProbe(const ELFObjectFileBase *Obj);		void decodePseudoProbe(const ELFObjectFileBase *Obj);

void		void
checkUseFSDiscriminator(const ELFObjectFileBase *Obj,		checkUseFSDiscriminator(const ELFObjectFileBase *Obj,
std::map<SectionRef, SectionSymbolsTy> &AllSymbols);		std::map<SectionRef, SectionSymbolsTy> &AllSymbols);

// Set up disassembler and related components.		// Set up disassembler and related components.
void setUpDisassembler(const ELFObjectFileBase *Obj);		void setUpDisassembler(const ELFObjectFileBase *Obj);
void setupSymbolizer();		void setupSymbolizer();

// Load debug info of subprograms from DWARF section.		// Load debug info of subprograms from DWARF section.
void loadSymbolsFromDWARF(ObjectFile &Obj);		void loadSymbolsFromDWARF(ObjectFile &Obj);

// Load debug info from DWARF unit.		// Load debug info from DWARF unit.
void loadSymbolsFromDWARFUnit(DWARFUnit &CompilationUnit);		void loadSymbolsFromDWARFUnit(DWARFUnit &CompilationUnit);

		// Create elf symbol to its start address mapping.
		void populateElfSymbolAddressList(const ELFObjectFileBase *O);

// A function may be spilt into multiple non-continuous address ranges. We use		// A function may be spilt into multiple non-continuous address ranges. We use
// this to set whether start address of a function is the real entry of the		// this to set whether start address of a function is the real entry of the
// function and also set false to the non-function label.		// function and also set false to the non-function label.
void setIsFuncEntry(uint64_t Address, StringRef RangeSymName);		void setIsFuncEntry(uint64_t Address, StringRef RangeSymName);

// Warn if no entry range exists in the function.		// Warn if no entry range exists in the function.
void warnNoFuncEntry();		void warnNoFuncEntry();

Show All 34 Lines	public:
uint64_t getBaseAddress() const { return BaseAddress; }		uint64_t getBaseAddress() const { return BaseAddress; }
void setBaseAddress(uint64_t Address) { BaseAddress = Address; }		void setBaseAddress(uint64_t Address) { BaseAddress = Address; }

// Canonicalize to use preferred load address as base address.		// Canonicalize to use preferred load address as base address.
uint64_t canonicalizeVirtualAddress(uint64_t Address) {		uint64_t canonicalizeVirtualAddress(uint64_t Address) {
return Address - BaseAddress + getPreferredBaseAddress();		return Address - BaseAddress + getPreferredBaseAddress();
}		}
// Return the preferred load address for the first executable segment.		// Return the preferred load address for the first executable segment.
uint64_t getPreferredBaseAddress() const { return PreferredTextSegmentAddresses[0]; }		uint64_t getPreferredBaseAddress() const {
		return PreferredTextSegmentAddresses[0];
		}
// Return the preferred load address for the first loadable segment.		// Return the preferred load address for the first loadable segment.
uint64_t getFirstLoadableAddress() const { return FirstLoadableAddress; }		uint64_t getFirstLoadableAddress() const { return FirstLoadableAddress; }
// Return the file offset for the first executable segment.		// Return the file offset for the first executable segment.
uint64_t getTextSegmentOffset() const { return TextSegmentOffsets[0]; }		uint64_t getTextSegmentOffset() const { return TextSegmentOffsets[0]; }
const std::vector<uint64_t> &getPreferredTextSegmentAddresses() const {		const std::vector<uint64_t> &getPreferredTextSegmentAddresses() const {
return PreferredTextSegmentAddresses;		return PreferredTextSegmentAddresses;
}		}
const std::vector<uint64_t> &getTextSegmentOffsets() const {		const std::vector<uint64_t> &getTextSegmentOffsets() const {
▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

llvm/tools/llvm-profgen/ProfiledBinary.cpp

Show First 20 Lines • Show All 198 Lines • ▼ Show 20 Lines	void ProfiledBinary::load() {
// Current only support X86		// Current only support X86
if (!TheTriple.isX86())		if (!TheTriple.isX86())
exitWithError("unsupported target", TheTriple.getTriple());		exitWithError("unsupported target", TheTriple.getTriple());
LLVM_DEBUG(dbgs() << "Loading " << Path << "\n");		LLVM_DEBUG(dbgs() << "Loading " << Path << "\n");

// Find the preferred load address for text sections.		// Find the preferred load address for text sections.
setPreferredTextSegmentAddresses(Obj);		setPreferredTextSegmentAddresses(Obj);

checkPseudoProbe(Obj);

if (ShowDisassemblyOnly)
decodePseudoProbe(Obj);

// Load debug info of subprograms from DWARF section.		// Load debug info of subprograms from DWARF section.
// If path of debug info binary is specified, use the debug info from it,		// If path of debug info binary is specified, use the debug info from it,
// otherwise use the debug info from the executable binary.		// otherwise use the debug info from the executable binary.
if (!DebugBinaryPath.empty()) {		if (!DebugBinaryPath.empty()) {
OwningBinary<Binary> DebugPath =		OwningBinary<Binary> DebugPath =
unwrapOrError(createBinary(DebugBinaryPath), DebugBinaryPath);		unwrapOrError(createBinary(DebugBinaryPath), DebugBinaryPath);
loadSymbolsFromDWARF(*cast<ObjectFile>(DebugPath.getBinary()));		loadSymbolsFromDWARF(*cast<ObjectFile>(DebugPath.getBinary()));
} else {		} else {
loadSymbolsFromDWARF(*cast<ObjectFile>(&ExeBinary));		loadSymbolsFromDWARF(*cast<ObjectFile>(&ExeBinary));
}		}

		DisassembleFunctionSet.insert(DisassembleFunctions.begin(),
		DisassembleFunctions.end());

		checkPseudoProbe(Obj);

		if (UsePseudoProbes)
		populateElfSymbolAddressList(Obj);

		if (ShowDisassemblyOnly)
		decodePseudoProbe(Obj);

// Disassemble the text sections.		// Disassemble the text sections.
disassemble(Obj);		disassemble(Obj);

// Use function start and return address to infer prolog and epilog		// Use function start and return address to infer prolog and epilog
ProEpilogTracker.inferPrologAddresses(StartAddrToFuncRangeMap);		ProEpilogTracker.inferPrologAddresses(StartAddrToFuncRangeMap);
ProEpilogTracker.inferEpilogAddresses(RetAddressSet);		ProEpilogTracker.inferEpilogAddresses(RetAddressSet);

warnNoFuncEntry();		warnNoFuncEntry();
▲ Show 20 Lines • Show All 116 Lines • ▼ Show 20 Lines	void ProfiledBinary::checkPseudoProbe(const ELFObjectFileBase *Obj) {
// set UsePseudoProbes flag, used for PerfReader		// set UsePseudoProbes flag, used for PerfReader
UsePseudoProbes = HasProbeDescSection && HasPseudoProbeSection;		UsePseudoProbes = HasProbeDescSection && HasPseudoProbeSection;
}		}

void ProfiledBinary::decodePseudoProbe(const ELFObjectFileBase *Obj) {		void ProfiledBinary::decodePseudoProbe(const ELFObjectFileBase *Obj) {
if (!UsePseudoProbes)		if (!UsePseudoProbes)
return;		return;

std::unordered_set<uint64_t> ProfiledGuids;		MCPseudoProbeDecoder::Uint64Set GuidFilter;
if (!ShowDisassemblyOnly)		MCPseudoProbeDecoder::Uint64Map FuncStartAddresses;
for (auto *F : ProfiledFunctions)		if (ShowDisassemblyOnly) {
ProfiledGuids.insert(Function::getGUID(F->FuncName));		if (DisassembleFunctionSet.empty()) {
		FuncStartAddresses = SymbolStartAddrs;
		} else {
		for (auto &F : DisassembleFunctionSet) {
		auto GUID = Function::getGUID(F.first());
		if (auto StartAddr = SymbolStartAddrs.lookup(GUID)) {
		FuncStartAddresses[GUID] = StartAddr;
		FuncRange &Range = StartAddrToFuncRangeMap[StartAddr];
		GuidFilter.insert(Function::getGUID(Range.getFuncName()));
		}
		}
		}
		} else {
		for (auto *F : ProfiledFunctions) {
		GuidFilter.insert(Function::getGUID(F->FuncName));
		for (auto &Range : F->Ranges) {
		auto GUIDs = StartAddrToSymMap.equal_range(Range.first);
		for (auto I = GUIDs.first; I != GUIDs.second; ++I)
		FuncStartAddresses[I->second] = I->first;
		}
		}
		}

StringRef FileName = Obj->getFileName();		StringRef FileName = Obj->getFileName();
for (section_iterator SI = Obj->section_begin(), SE = Obj->section_end();		for (section_iterator SI = Obj->section_begin(), SE = Obj->section_end();
SI != SE; ++SI) {		SI != SE; ++SI) {
const SectionRef &Section = *SI;		const SectionRef &Section = *SI;
StringRef SectionName = unwrapOrError(Section.getName(), FileName);		StringRef SectionName = unwrapOrError(Section.getName(), FileName);

if (SectionName == ".pseudo_probe_desc") {		if (SectionName == ".pseudo_probe_desc") {
StringRef Contents = unwrapOrError(Section.getContents(), FileName);		StringRef Contents = unwrapOrError(Section.getContents(), FileName);
if (!ProbeDecoder.buildGUID2FuncDescMap(		if (!ProbeDecoder.buildGUID2FuncDescMap(
reinterpret_cast<const uint8_t *>(Contents.data()),		reinterpret_cast<const uint8_t *>(Contents.data()),
Contents.size()))		Contents.size()))
exitWithError(		exitWithError(
"Pseudo Probe decoder fail in .pseudo_probe_desc section");		"Pseudo Probe decoder fail in .pseudo_probe_desc section");
} else if (SectionName == ".pseudo_probe") {		} else if (SectionName == ".pseudo_probe") {
StringRef Contents = unwrapOrError(Section.getContents(), FileName);		StringRef Contents = unwrapOrError(Section.getContents(), FileName);
if (!ProbeDecoder.buildAddress2ProbeMap(		if (!ProbeDecoder.buildAddress2ProbeMap(
reinterpret_cast<const uint8_t *>(Contents.data()),		reinterpret_cast<const uint8_t *>(Contents.data()),
Contents.size(), ProfiledGuids))		Contents.size(), GuidFilter, FuncStartAddresses))
exitWithError("Pseudo Probe decoder fail in .pseudo_probe section");		exitWithError("Pseudo Probe decoder fail in .pseudo_probe section");
}		}
}		}

// Build TopLevelProbeFrameMap to track size for optimized inlinees when probe		// Build TopLevelProbeFrameMap to track size for optimized inlinees when probe
// is available		// is available
if (TrackFuncContextSize) {		if (TrackFuncContextSize) {
for (const auto &Child : ProbeDecoder.getDummyInlineRoot().getChildren()) {		for (const auto &Child : ProbeDecoder.getDummyInlineRoot().getChildren()) {
▲ Show 20 Lines • Show All 187 Lines • ▼ Show 20 Lines	for (const SymbolRef &Symbol : Obj->symbols()) {
if (SecI != Obj->section_end())		if (SecI != Obj->section_end())
AllSymbols[*SecI].push_back(SymbolInfoTy(Addr, Name, ELF::STT_NOTYPE));		AllSymbols[*SecI].push_back(SymbolInfoTy(Addr, Name, ELF::STT_NOTYPE));
}		}

// Sort all the symbols. Use a stable sort to stabilize the output.		// Sort all the symbols. Use a stable sort to stabilize the output.
for (std::pair<const SectionRef, SectionSymbolsTy> &SecSyms : AllSymbols)		for (std::pair<const SectionRef, SectionSymbolsTy> &SecSyms : AllSymbols)
stable_sort(SecSyms.second);		stable_sort(SecSyms.second);

DisassembleFunctionSet.insert(DisassembleFunctions.begin(),
DisassembleFunctions.end());
assert((DisassembleFunctionSet.empty() \|\| ShowDisassemblyOnly) &&		assert((DisassembleFunctionSet.empty() \|\| ShowDisassemblyOnly) &&
"Functions to disassemble should be only specified together with "		"Functions to disassemble should be only specified together with "
"--show-disassembly-only");		"--show-disassembly-only");

if (ShowDisassemblyOnly)		if (ShowDisassemblyOnly)
outs() << "\nDisassembly of " << FileName << ":\n";		outs() << "\nDisassembly of " << FileName << ":\n";

// Dissassemble a text section.		// Dissassemble a text section.
▲ Show 20 Lines • Show All 57 Lines • ▼ Show 20 Lines	for (std::size_t SI = 0, SE = Symbols.size(); SI != SE; ++SI) {
if (Symbols[SI].Name == FSDiscriminatorVar) {		if (Symbols[SI].Name == FSDiscriminatorVar) {
UseFSDiscriminator = true;		UseFSDiscriminator = true;
return;		return;
}		}
}		}
}		}
}		}

		void ProfiledBinary::populateElfSymbolAddressList(
		const ELFObjectFileBase *Obj) {
		// Create a mapping from virtual address to symbol GUID and the other way
		// around.
		StringRef FileName = Obj->getFileName();
		for (const SymbolRef &Symbol : Obj->symbols()) {
		const uint64_t Addr = unwrapOrError(Symbol.getAddress(), FileName);
		const StringRef Name = unwrapOrError(Symbol.getName(), FileName);
		uint64_t GUID = Function::getGUID(Name);
		SymbolStartAddrs[GUID] = Addr;
		StartAddrToSymMap.emplace(Addr, GUID);
		}
		}

void ProfiledBinary::loadSymbolsFromDWARFUnit(DWARFUnit &CompilationUnit) {		void ProfiledBinary::loadSymbolsFromDWARFUnit(DWARFUnit &CompilationUnit) {
for (const auto &DieInfo : CompilationUnit.dies()) {		for (const auto &DieInfo : CompilationUnit.dies()) {
llvm::DWARFDie Die(&CompilationUnit, &DieInfo);		llvm::DWARFDie Die(&CompilationUnit, &DieInfo);

if (!Die.isSubprogramDIE())		if (!Die.isSubprogramDIE())
continue;		continue;
auto Name = Die.getName(llvm::DINameKind::LinkageName);		auto Name = Die.getName(llvm::DINameKind::LinkageName);
if (!Name)		if (!Name)
▲ Show 20 Lines • Show All 216 Lines • Show Last 20 Lines