This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/
-
CodeGen/
-
AsmPrinter/
-
AsmPrinter.cpp
-
BasicBlockSections.cpp
-
test/
-
CodeGen/X86/
-
X86/
-
basic-block-sections-labels.ll
-
tools/llvm-readobj/ELF/
-
llvm-readobj/
-
ELF/
-
bb-addr-map.test
-
tools/llvm-readobj/
-
llvm-readobj/
1/2
ELFDumper.cpp

Differential D106421

Encode address offsets of basic blocks relative to the end of the previous basic blocks.
ClosedPublic

Authored by rahmanl on Jul 20 2021, 10:23 PM.

Download Raw Diff

Details

Reviewers

tmsriram
jhenderson

Commits

rG029283c1c0d8: Encode address offsets of basic blocks relative to the end of the previous…

Summary

Conceptually, the new encoding emits the offsets and sizes as label differences between each two consecutive basic block begin and end label. When decoding, the offsets must be aggregated along with basic block sizes to calculate the final relative-to-function offsets of basic blocks.

This encoding uses smaller values compared to the existing one (offsets relative to function symbol).
Smaller values tend to occupy fewer bytes in ULEB128 encoding. As a result, we get about 25% reduction
in the size of the bb-address-map section (reduction from about 9MB to 7MB).

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

rahmanl created this revision.Jul 20 2021, 10:23 PM

Herald added a reviewer: jhenderson. · View Herald TranscriptJul 20 2021, 10:23 PM

Herald added subscribers: pengfei, rupprecht, hiraditya, emaste. · View Herald Transcript

rahmanl requested review of this revision.Jul 20 2021, 10:23 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 20 2021, 10:23 PM

Herald added subscribers: llvm-commits, MaskRay. · View Herald Transcript

rahmanl edited the summary of this revision. (Show Details)Jul 20 2021, 10:34 PM

Harbormaster completed remote builds in B115249: Diff 360359.Jul 20 2021, 11:26 PM

Seesm reasonable to me, but I don't know the codegen code well enough to be 100% confident in that aspect - might be best to try to get a second opinion.

This revision is now accepted and ready to land.Jul 26 2021, 1:21 AM

LGTM

llvm/tools/llvm-readobj/ELFDumper.cpp
6971	Just making sure, is it possible that there could be some padding between basic blocks which might not get accounted for in size calculations? I guess this is being done after the assembler has done its work so it should be fine?

Thanks for the review @tmsriram and @jhenderson.

llvm/tools/llvm-readobj/ELFDumper.cpp
6971	Yes. The padding you are mentioning is `BBE.Offset`. This is the offset from the end of a block to the beginning of the next. So it will be accounted for as well (at line 6759).

rebase.

Harbormaster completed remote builds in B117084: Diff 362938.Jul 29 2021, 6:33 PM

Rebase.

This revision was landed with ongoing or failed builds.Feb 22 2022, 3:47 PM

Closed by commit rG029283c1c0d8: Encode address offsets of basic blocks relative to the end of the previous… (authored by rahmanl). · Explain Why

This revision was automatically updated to reflect the committed changes.

rahmanl added a commit: rG029283c1c0d8: Encode address offsets of basic blocks relative to the end of the previous….

Harbormaster completed remote builds in B150948: Diff 410655.Feb 22 2022, 3:58 PM

rahmanl added a reverting change: D120457: Revert "Encode address offsets of basic blocks relative to the end of the previous basic blocks.".Feb 24 2022, 12:10 AM

rahmanl added a reverting change: rGaeec9671fb4c: Revert "Encode address offsets of basic blocks relative to the end of the….Feb 24 2022, 1:31 PM

rahmanl mentioned this in D121346: [Propeller] Encode address offsets of basic blocks relative to the end of the previous basic blocks..Mar 9 2022, 4:27 PM

rahmanl mentioned this in rG0aa6df65756d: [Propeller] Encode address offsets of basic blocks relative to the end of the….Jun 28 2022, 7:43 AM

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

AsmPrinter/

AsmPrinter.cpp

7 lines

BasicBlockSections.cpp

2 lines

test/

CodeGen/

X86/

basic-block-sections-labels.ll

6 lines

tools/

llvm-readobj/

ELF/

bb-addr-map.test

2 lines

tools/

llvm-readobj/

ELFDumper.cpp

5 lines

Diff 410662

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

Show First 20 Lines • Show All 1,146 Lines • ▼ Show 20 Lines	void AsmPrinter::emitBBAddrMapSection(const MachineFunction &MF) {

const MCSymbol *FunctionSymbol = getFunctionBegin();		const MCSymbol *FunctionSymbol = getFunctionBegin();

OutStreamer->PushSection();		OutStreamer->PushSection();
OutStreamer->SwitchSection(BBAddrMapSection);		OutStreamer->SwitchSection(BBAddrMapSection);
OutStreamer->emitSymbolValue(FunctionSymbol, getPointerSize());		OutStreamer->emitSymbolValue(FunctionSymbol, getPointerSize());
// Emit the total number of basic blocks in this function.		// Emit the total number of basic blocks in this function.
OutStreamer->emitULEB128IntValue(MF.size());		OutStreamer->emitULEB128IntValue(MF.size());
		const MCSymbol *PrevMBBEndSymbol = FunctionSymbol;
// Emit BB Information for each basic block in the funciton.		// Emit BB Information for each basic block in the funciton.
for (const MachineBasicBlock &MBB : MF) {		for (const MachineBasicBlock &MBB : MF) {
const MCSymbol *MBBSymbol =		const MCSymbol *MBBSymbol =
MBB.isEntryBlock() ? FunctionSymbol : MBB.getSymbol();		MBB.isEntryBlock() ? FunctionSymbol : MBB.getSymbol();
// Emit the basic block offset.		// Emit the basic block offset relative to the end of the previous block.
emitLabelDifferenceAsULEB128(MBBSymbol, FunctionSymbol);		// This is zero unless the block is padded due to alignment.
		emitLabelDifferenceAsULEB128(MBBSymbol, PrevMBBEndSymbol);
// Emit the basic block size. When BBs have alignments, their size cannot		// Emit the basic block size. When BBs have alignments, their size cannot
// always be computed from their offsets.		// always be computed from their offsets.
emitLabelDifferenceAsULEB128(MBB.getEndSymbol(), MBBSymbol);		emitLabelDifferenceAsULEB128(MBB.getEndSymbol(), MBBSymbol);
OutStreamer->emitULEB128IntValue(getBBAddrMapMetadata(MBB));		OutStreamer->emitULEB128IntValue(getBBAddrMapMetadata(MBB));
		PrevMBBEndSymbol = MBB.getEndSymbol();
}		}
OutStreamer->PopSection();		OutStreamer->PopSection();
}		}

void AsmPrinter::emitPseudoProbe(const MachineInstr &MI) {		void AsmPrinter::emitPseudoProbe(const MachineInstr &MI) {
auto GUID = MI.getOperand(0).getImm();		auto GUID = MI.getOperand(0).getImm();
auto Index = MI.getOperand(1).getImm();		auto Index = MI.getOperand(1).getImm();
auto Type = MI.getOperand(2).getImm();		auto Type = MI.getOperand(2).getImm();
▲ Show 20 Lines • Show All 2,501 Lines • Show Last 20 Lines

llvm/lib/CodeGen/BasicBlockSections.cpp

	Show First 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	// needs special handling with basic block sections. DebugInfo needs to be			// needs special handling with basic block sections. DebugInfo needs to be
	// emitted with more relocations as basic block sections can break a			// emitted with more relocations as basic block sections can break a
	// function into potentially several disjoint pieces, and CFI needs to be			// function into potentially several disjoint pieces, and CFI needs to be
	// emitted per cluster. This also bloats the object file and binary sizes.			// emitted per cluster. This also bloats the object file and binary sizes.
	//			//
	// Basic Block Labels			// Basic Block Labels
	// ==================			// ==================
	//			//
	// With -fbasic-block-sections=labels, we emit the offsets of BB addresses of			// With -fbasic-block-sections=labels, we encode the offsets of BB addresses of
	// every function into the .llvm_bb_addr_map section. Along with the function			// every function into the .llvm_bb_addr_map section. Along with the function
	// symbols, this allows for mapping of virtual addresses in PMU profiles back to			// symbols, this allows for mapping of virtual addresses in PMU profiles back to
	// the corresponding basic blocks. This logic is implemented in AsmPrinter. This			// the corresponding basic blocks. This logic is implemented in AsmPrinter. This
	// pass only assigns the BBSectionType of every function to ``labels``.			// pass only assigns the BBSectionType of every function to ``labels``.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/ADT/Optional.h"			#include "llvm/ADT/Optional.h"
	▲ Show 20 Lines • Show All 462 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/basic-block-sections-labels.ll

	Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
	; UNIQ: .section .llvm_bb_addr_map,"o",@llvm_bb_addr_map,.text._Z3bazb{{$}}			; UNIQ: .section .llvm_bb_addr_map,"o",@llvm_bb_addr_map,.text._Z3bazb{{$}}
	;; Verify that with -unique-section-names=false, the unique id of the text section gets assigned to the llvm_bb_addr_map section.			;; Verify that with -unique-section-names=false, the unique id of the text section gets assigned to the llvm_bb_addr_map section.
	; NOUNIQ: .section .llvm_bb_addr_map,"o",@llvm_bb_addr_map,.text,unique,1			; NOUNIQ: .section .llvm_bb_addr_map,"o",@llvm_bb_addr_map,.text,unique,1
	; CHECK-NEXT: .quad .Lfunc_begin0			; CHECK-NEXT: .quad .Lfunc_begin0
	; CHECK-NEXT: .byte 4			; CHECK-NEXT: .byte 4
	; CHECK-NEXT: .uleb128 .Lfunc_begin0-.Lfunc_begin0			; CHECK-NEXT: .uleb128 .Lfunc_begin0-.Lfunc_begin0
	; CHECK-NEXT: .uleb128 .LBB_END0_0-.Lfunc_begin0			; CHECK-NEXT: .uleb128 .LBB_END0_0-.Lfunc_begin0
	; CHECK-NEXT: .byte 8			; CHECK-NEXT: .byte 8
	; CHECK-NEXT: .uleb128 .LBB0_1-.Lfunc_begin0			; CHECK-NEXT: .uleb128 .LBB0_1-.LBB_END0_0
	; CHECK-NEXT: .uleb128 .LBB_END0_1-.LBB0_1			; CHECK-NEXT: .uleb128 .LBB_END0_1-.LBB0_1
	; CHECK-NEXT: .byte 8			; CHECK-NEXT: .byte 8
	; CHECK-NEXT: .uleb128 .LBB0_2-.Lfunc_begin0			; CHECK-NEXT: .uleb128 .LBB0_2-.LBB_END0_1
	; CHECK-NEXT: .uleb128 .LBB_END0_2-.LBB0_2			; CHECK-NEXT: .uleb128 .LBB_END0_2-.LBB0_2
	; CHECK-NEXT: .byte 1			; CHECK-NEXT: .byte 1
	; CHECK-NEXT: .uleb128 .LBB0_3-.Lfunc_begin0			; CHECK-NEXT: .uleb128 .LBB0_3-.LBB_END0_2
	; CHECK-NEXT: .uleb128 .LBB_END0_3-.LBB0_3			; CHECK-NEXT: .uleb128 .LBB_END0_3-.LBB0_3
	; CHECK-NEXT: .byte 5			; CHECK-NEXT: .byte 5

llvm/test/tools/llvm-readobj/ELF/bb-addr-map.test

	Show All 24 Lines
	# LLVM-NEXT: Offset: 0x0			# LLVM-NEXT: Offset: 0x0
	# LLVM-NEXT: Size: 0x1			# LLVM-NEXT: Size: 0x1
	# LLVM-NEXT: HasReturn: No			# LLVM-NEXT: HasReturn: No
	# LLVM-NEXT: HasTailCall: Yes			# LLVM-NEXT: HasTailCall: Yes
	# LLVM-NEXT: IsEHPad: No			# LLVM-NEXT: IsEHPad: No
	# LLVM-NEXT: CanFallThrough: No			# LLVM-NEXT: CanFallThrough: No
	# LLVM-NEXT: }			# LLVM-NEXT: }
	# LLVM-NEXT: {			# LLVM-NEXT: {
	# LLVM-NEXT: Offset: 0x3			# LLVM-NEXT: Offset: 0x4
	# LLVM-NEXT: Size: 0x4			# LLVM-NEXT: Size: 0x4
	# LLVM-NEXT: HasReturn: Yes			# LLVM-NEXT: HasReturn: Yes
	# LLVM-NEXT: HasTailCall: No			# LLVM-NEXT: HasTailCall: No
	# LLVM-NEXT: IsEHPad: Yes			# LLVM-NEXT: IsEHPad: Yes
	# LLVM-NEXT: CanFallThrough: No			# LLVM-NEXT: CanFallThrough: No
	# LLVM-NEXT: }			# LLVM-NEXT: }
	# LLVM-NEXT: ]			# LLVM-NEXT: ]
	# LLVM-NEXT: }			# LLVM-NEXT: }
	▲ Show 20 Lines • Show All 106 Lines • Show Last 20 Lines

llvm/tools/llvm-readobj/ELFDumper.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,956 Lines • ▼ Show 20 Lines	for (const BBAddrMap &AM : *BBAddrMapOrErr) {
this->reportUniqueWarning(		this->reportUniqueWarning(
"could not identify function symbol for address (0x" +		"could not identify function symbol for address (0x" +
Twine::utohexstr(AM.Addr) + ") in " + this->describe(Sec));		Twine::utohexstr(AM.Addr) + ") in " + this->describe(Sec));
else		else
FuncName = this->getStaticSymbolName(FuncSymIndex.front());		FuncName = this->getStaticSymbolName(FuncSymIndex.front());
W.printString("Name", FuncName);		W.printString("Name", FuncName);

ListScope L(W, "BB entries");		ListScope L(W, "BB entries");
		uint32_t FunctionRelativeAddress = 0;
for (const BBAddrMap::BBEntry &BBE : AM.BBEntries) {		for (const BBAddrMap::BBEntry &BBE : AM.BBEntries) {
DictScope L(W);		DictScope L(W);
W.printHex("Offset", BBE.Offset);		FunctionRelativeAddress += BBE.Offset;
		W.printHex("Offset", FunctionRelativeAddress);
W.printHex("Size", BBE.Size);		W.printHex("Size", BBE.Size);
		FunctionRelativeAddress += BBE.Size;
		tmsriramUnsubmitted Not Done Reply Inline Actions Just making sure, is it possible that there could be some padding between basic blocks which might not get accounted for in size calculations? I guess this is being done after the assembler has done its work so it should be fine? tmsriram: Just making sure, is it possible that there could be some padding between basic blocks which…
		rahmanlAuthorUnsubmitted Done Reply Inline Actions Yes. The padding you are mentioning is `BBE.Offset`. This is the offset from the end of a block to the beginning of the next. So it will be accounted for as well (at line 6759). rahmanl: Yes. The padding you are mentioning is `BBE.Offset`. This is the offset from the end of a block…
W.printBoolean("HasReturn", BBE.HasReturn);		W.printBoolean("HasReturn", BBE.HasReturn);
W.printBoolean("HasTailCall", BBE.HasTailCall);		W.printBoolean("HasTailCall", BBE.HasTailCall);
W.printBoolean("IsEHPad", BBE.IsEHPad);		W.printBoolean("IsEHPad", BBE.IsEHPad);
W.printBoolean("CanFallThrough", BBE.CanFallThrough);		W.printBoolean("CanFallThrough", BBE.CanFallThrough);
}		}
}		}
}		}
}		}
▲ Show 20 Lines • Show All 386 Lines • Show Last 20 Lines