This is an archive of the discontinued LLVM Phabricator instance.

[PseudoProbe] Disgard probes for dead functions.
AbandonedPublic

Authored by hoy on Jun 9 2023, 4:42 PM.

Download Raw Diff

Details

Reviewers

wenlei
wlei

Summary

Probes of dead functions may be left over in the final binary for some reason, and they should be disgarded during decoding. The llvm-profgen profile generation path should already disgarded them due to its on-demand style decoding. I'm fixing for the --show-disassembly path which unconditionally decodes all probes.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hoy created this revision.Jun 9 2023, 4:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 9 2023, 4:42 PM

Herald added subscribers: wlei, modimo, wenlei, hiraditya. · View Herald Transcript

hoy requested review of this revision.Jun 9 2023, 4:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 9 2023, 4:42 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

hoy added a parent revision: D152546: [Pseudo Probe] Placing .pseudoprobe section in a comdat group.Jun 9 2023, 4:42 PM

hoy added reviewers: wenlei, wlei.

Harbormaster completed remote builds in B237886: Diff 530115.Jun 9 2023, 4:43 PM

Probes of dead functions may be left over in the final binary for some reason,

How many probes (in terms of % size) belong to dead functions? Can we avoid emitting these probes in the first place?

In D152595#4410514, @wenlei wrote:

Probes of dead functions may be left over in the final binary for some reason,

How many probes (in terms of % size) belong to dead functions? Can we avoid emitting these probes in the first place?

Very little. Currently probes are emitted along with the binary code for functions and the linker is responsible for removing dead code. For probes, the linker can also remove them with the implementation D146853: [Pseudo Probe] Placing .pseudoprobe section in the same comdat group with .text. However, since dead probes are very little, probably due to the thinLTO compiler backend doing a good job removing dead functions already, I'm switching to the implementation D152546: [Pseudo Probe] Placing .pseudoprobe section in a comdat group, which doesn't enable the linker to remove dead probes but has other benefits.

Currently probes are emitted along with the binary code for functions and the linker is responsible for removing dead code. For probes, the linker can also remove them with the implementation

why do we still have probe for dead functions with this current implementation though?

Very little.

How much in terms .pseudo_probe section size?

I'm switching to the implementation D152546: [Pseudo Probe] Placing .pseudoprobe section in a comdat group, which doesn't enable the linker to remove dead probes but has other benefits.

It looks the trade off with the new implementation is that, we would allow ICF for probe section, but would also disable REF when its associated code is being removed?

In D152595#4416048, @wenlei wrote:

Currently probes are emitted along with the binary code for functions and the linker is responsible for removing dead code. For probes, the linker can also remove them with the implementation

why do we still have probe for dead functions with this current implementation though?

The current implementation doesn't place pseudo probes in the same comdat group with the text section, so a probe section won't be removed when the corresponding text section is dead removed by the linker. This is what D146853 is supposed to fix.

Dead probes can be completed removed by the thinLTO backend only. The only ones survive are for the functions removed by the native linker.

Very little.

How much in terms .pseudo_probe section size?

Less than 1% savings when I enabled D146853.

I'm switching to the implementation D152546: [Pseudo Probe] Placing .pseudoprobe section in a comdat group, which doesn't enable the linker to remove dead probes but has other benefits.

It looks the trade off with the new implementation is that, we would allow ICF for probe section, but would also disable REF when its associated code is being removed?

Right, there is no connection between probes and code with the new implementation. It has a benefit of fully deduplicating probes even for static functions injected by the compiler which does not have unique linkage name.

Discussed offline. It makes more sense to move on with D146853: [Pseudo Probe] Placing .pseudoprobe section in the same comdat group with .text.

Revision Contents

Path

Size

llvm/

lib/

MC/

MCPseudoProbe.cpp

12 lines

Diff 530115

llvm/lib/MC/MCPseudoProbe.cpp

Show First 20 Lines • Show All 440 Lines • ▼ Show 20 Lines	if (IsTopLevelFunc && !GuidFilter.empty() && !GuidFilter.count(Guid))
Cur = nullptr;		Cur = nullptr;

// If the incoming node is null, all its children nodes should be disgarded.		// If the incoming node is null, all its children nodes should be disgarded.
if (Cur) {		if (Cur) {
// Switch/add to a new tree node(inlinee)		// Switch/add to a new tree node(inlinee)
Cur = Cur->getOrAddNode(std::make_tuple(Guid, Index));		Cur = Cur->getOrAddNode(std::make_tuple(Guid, Index));
Cur->Guid = Guid;		Cur->Guid = Guid;
if (IsTopLevelFunc && !EncodingIsAddrBased) {		if (IsTopLevelFunc && !EncodingIsAddrBased) {
if (auto V = FuncStartAddrs.lookup(Guid))		if (auto V = FuncStartAddrs.lookup(Guid)) {
LastAddr = V;		LastAddr = V;
		} else {
		// The probe metadata is for a dead function, so discard it.
		Cur = nullptr;
		}
}		}
}		}

// Read number of probes in the current node.		// Read number of probes in the current node.
auto ErrorOrNodeCount = readUnsignedNumber<uint32_t>();		auto ErrorOrNodeCount = readUnsignedNumber<uint32_t>();
if (!ErrorOrNodeCount)		if (!ErrorOrNodeCount)
return false;		return false;
uint32_t NodeCount = std::move(*ErrorOrNodeCount);		uint32_t NodeCount = std::move(*ErrorOrNodeCount);
Show All 26 Lines	for (std::size_t I = 0; I < NodeCount; I++) {
} else {		} else {
auto ErrorOrAddr = readUnencodedNumber<int64_t>();		auto ErrorOrAddr = readUnencodedNumber<int64_t>();
if (!ErrorOrAddr)		if (!ErrorOrAddr)
return false;		return false;
Addr = std::move(*ErrorOrAddr);		Addr = std::move(*ErrorOrAddr);
if (isSentinelProbe(Attr)) {		if (isSentinelProbe(Attr)) {
// For sentinel probe, the addr field actually stores the GUID of the		// For sentinel probe, the addr field actually stores the GUID of the
// split function. Convert it to the real address.		// split function. Convert it to the real address.
if (auto V = FuncStartAddrs.lookup(Addr))		if (auto V = FuncStartAddrs.lookup(Addr)) {
Addr = V;		Addr = V;
} else {		} else {
		// The probe metadata is for a dead function, so discard it.
		Cur = nullptr;
		}
		} else {
// For now we assume all probe encoding should be either based on		// For now we assume all probe encoding should be either based on
// leading probe address or function start address.		// leading probe address or function start address.
// The scheme is for downwards compatibility.		// The scheme is for downwards compatibility.
// TODO: retire this scheme once compatibility is no longer an issue.		// TODO: retire this scheme once compatibility is no longer an issue.
EncodingIsAddrBased = true;		EncodingIsAddrBased = true;
}		}
}		}

▲ Show 20 Lines • Show All 120 Lines • Show Last 20 Lines