This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/
-
ELF/
1/1
Writer.cpp
-
test/ELF/
-
ELF/
-
text-section-prefix.s

Differential D87840

[lld] Make -z keep-text-section-prefix recognize .text.split. as a prefix.
ClosedPublic

Authored by snehasish on Sep 17 2020, 10:07 AM.

Download Raw Diff

Details

Reviewers

MaskRay
• espindola

Commits

rG070555c6c008: [lld] Make -z keep-text-section-prefix recognize .text.split. as a prefix.

Summary

".text.split." holds symbols which are split out from functions in
other input sections. For example, with -fsplit-machine-functions,
placing the cold parts in .text.split instead of .text.unlikely mitigates
against poor profile inaccuracy. Techniques such as hugepage remapping can
make conservative decisions at the section granularity.
Additionally we find small improvement in icache and tlb metrics (3-5%)
due to improved locality.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

snehasish created this revision.Sep 17 2020, 10:07 AM

Herald added a reviewer: • espindola. · View Herald TranscriptSep 17 2020, 10:07 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, arichardson, emaste. · View Herald Transcript

snehasish requested review of this revision.Sep 17 2020, 10:07 AM

snehasish mentioned this in D87813: [llvm] Add -bbsections-cold-text-prefix to emit cold clusters to a different section..Sep 17 2020, 10:17 AM

Harbormaster completed remote builds in B72040: Diff 292551.Sep 17 2020, 10:37 AM

Carrying over the discussion from D87813 since it's more appropriate here:

@MaskRay

For your future lld patch: why can't the split sections be placed in .test.cold? This just affects how -z keep-text-section-prefix groups input sections into output sections.

The intent here is to create a new "known" output section prefix which holds the split parts of functions. The rationale for this is outlined below.

@tmsriram

Is there a good reason to put this into .text.split? Does having a new output section give you more leverage on how to manage the mapping of such code?

Yes, having them placed in a separate section does provide additional leverage. In particular your suggestion of placing the split parts in .text would benefit from hugepages, however this may result in suboptimal icache performance since the split parts are interspersed across other code.

Interaction with hugepages

Placing the split parts in a separate section allows us to experiment whether keeping them on hugepages is beneficial similar. For example, we may choose to unlock for FDO targets but keep on hugepages for AFDO to mitigate against profile quality issues.

Loss of locality

From our experiments we found that keeping them in a separate section (as opposed to placing in .text.unlikely or .text) improved metrics such as icache and itlb misses. For itlb misses, it's due to the fact that the split parts are placed on hugepages vs regular pages. For icache misses, we posit that the split parts are distributed across the section reducing locality. The loss here is significant and can be up to a 5% difference (Search B, L2i miss). Note that in this experiment we apply an aggressive 99% threshold for splitting out cold blocks.

	.text.unlikely		.text.split
	Search A	Search B	Search A	Search B
l1i_miss	3.83	-1.70	0.65	-5.17
l2_miss	5.48	6.93	0.64	1.68
itlb_miss	-32.27	-10.25	-35.41	-15.90
stlb_miss	-59.39	-42.36	-67.56	-62.20

Ease of monitoring hotness

Another benefit of placing them in a separate section is ease of monitoring by collecting sampling data from production. We can keep an eye on the hotness of the split output section to ensure that our tuning thresholds for splitting are optimal for the fleet. While I believe it is still possible to monitor split parts (disambiguate symbols using ".cold" suffix) for function splitting, it is more tedious.

This looks good to me. Please add a test to test/ELF/text-section-prefix.s

lld/ELF/Writer.cpp
138	`Additionally we find small improvement in icache and tlb metrics due to improved locality.` might be too specific. It can probably be removed.

[lld] Add a new known text prefix - ".text.split."

Make -z keep-text-section-prefix recognize .text.split. as a prefix

Update description, add test.

Update commit message and comment.
Add a test for -z keep-text-section-prefix.

Updated description and tests, PTAL thanks!

LGTM.

This revision is now accepted and ready to land.Sep 24 2020, 2:53 PM

Harbormaster completed remote builds in B72879: Diff 294176.Sep 24 2020, 2:57 PM

Closed by commit rG070555c6c008: [lld] Make -z keep-text-section-prefix recognize .text.split. as a prefix. (authored by snehasish). · Explain WhySep 24 2020, 3:06 PM

This revision was automatically updated to reflect the committed changes.

snehasish added a commit: rG070555c6c008: [lld] Make -z keep-text-section-prefix recognize .text.split. as a prefix..

snehasish mentioned this in rGd2696dec45cd: [llvm] Add -bbsections-cold-text-prefix to emit cold clusters to a different….Sep 24 2020, 3:30 PM

Revision Contents

Path

Size

lld/

ELF/

Writer.cpp

7 lines

test/

ELF/

text-section-prefix.s

11 lines

Diff 294183

lld/ELF/Writer.cpp

Show First 20 Lines • Show All 125 Lines • ▼ Show 20 Lines	StringRef elf::getOutputSectionName(const InputSectionBase *s) {
// ".text.unlikely.", ".text.startup." or ".text.exit." before others.		// ".text.unlikely.", ".text.startup." or ".text.exit." before others.
// We provide an option -z keep-text-section-prefix to group such sections		// We provide an option -z keep-text-section-prefix to group such sections
// into separate output sections. This is more flexible. See also		// into separate output sections. This is more flexible. See also
// sortISDBySectionOrder().		// sortISDBySectionOrder().
// ".text.unknown" means the hotness of the section is unknown. When		// ".text.unknown" means the hotness of the section is unknown. When
// SampleFDO is used, if a function doesn't have sample, it could be very		// SampleFDO is used, if a function doesn't have sample, it could be very
// cold or it could be a new function never being sampled. Those functions		// cold or it could be a new function never being sampled. Those functions
// will be kept in the ".text.unknown" section.		// will be kept in the ".text.unknown" section.
		// ".text.split." holds symbols which are split out from functions in other
		// input sections. For example, with -fsplit-machine-functions, placing the
		// cold parts in .text.split instead of .text.unlikely mitigates against poor
		// profile inaccuracy. Techniques such as hugepage remapping can make
		// conservative decisions at the section granularity.
		MaskRayUnsubmitted Done Reply Inline Actions `Additionally we find small improvement in icache and tlb metrics due to improved locality.` might be too specific. It can probably be removed. MaskRay: ` Additionally we find small improvement in icache and tlb metrics due to improved locality.`…
if (config->zKeepTextSectionPrefix)		if (config->zKeepTextSectionPrefix)
for (StringRef v : {".text.hot.", ".text.unknown.", ".text.unlikely.",		for (StringRef v : {".text.hot.", ".text.unknown.", ".text.unlikely.",
".text.startup.", ".text.exit."})		".text.startup.", ".text.exit.", ".text.split."})
if (isSectionPrefix(v, s->name))		if (isSectionPrefix(v, s->name))
return v.drop_back();		return v.drop_back();

for (StringRef v :		for (StringRef v :
{".text.", ".rodata.", ".data.rel.ro.", ".data.", ".bss.rel.ro.",		{".text.", ".rodata.", ".data.rel.ro.", ".data.", ".bss.rel.ro.",
".bss.", ".init_array.", ".fini_array.", ".ctors.", ".dtors.", ".tbss.",		".bss.", ".init_array.", ".fini_array.", ".ctors.", ".dtors.", ".tbss.",
".gcc_except_table.", ".tdata.", ".ARM.exidx.", ".ARM.extab."})		".gcc_except_table.", ".tdata.", ".ARM.exidx.", ".ARM.extab."})
if (isSectionPrefix(v, s->name))		if (isSectionPrefix(v, s->name))
▲ Show 20 Lines • Show All 2,852 Lines • Show Last 20 Lines

lld/test/ELF/text-section-prefix.s

	Show All 9 Lines
	# RUN: cmp %t1 %t2			# RUN: cmp %t1 %t2

	# RUN: ld.lld -z keep-text-section-prefix %t.o -o %t.keep			# RUN: ld.lld -z keep-text-section-prefix %t.o -o %t.keep
	# RUN: llvm-readelf -S %t.keep \| FileCheck --check-prefix=KEEP %s			# RUN: llvm-readelf -S %t.keep \| FileCheck --check-prefix=KEEP %s

	# KEEP: [ 1] .text			# KEEP: [ 1] .text
	# KEEP-NEXT: [ 2] .text.hot			# KEEP-NEXT: [ 2] .text.hot
	# KEEP-NEXT: [ 3] .text.unknown			# KEEP-NEXT: [ 3] .text.unknown
	# KEEP-NEXT: [ 4] .text.startup			# KEEP-NEXT: [ 4] .text.split
	# KEEP-NEXT: [ 5] .text.exit			# KEEP-NEXT: [ 5] .text.startup
	# KEEP-NEXT: [ 6] .text.unlikely			# KEEP-NEXT: [ 6] .text.exit
				# KEEP-NEXT: [ 7] .text.unlikely

	# NOKEEP: [ 1] .text			# NOKEEP: [ 1] .text
	# NOKEEP-NOT: .text			# NOKEEP-NOT: .text

	## With a SECTIONS command, orphan sections are created verbatim.			## With a SECTIONS command, orphan sections are created verbatim.
	## No grouping is performed for them.			## No grouping is performed for them.
	# RUN: echo 'SECTIONS {}' > %t.lds			# RUN: echo 'SECTIONS {}' > %t.lds
	# RUN: ld.lld -T %t.lds -z keep-text-section-prefix %t.o -o %t.script			# RUN: ld.lld -T %t.lds -z keep-text-section-prefix %t.o -o %t.script
	# RUN: llvm-readelf -S %t.script \| FileCheck --check-prefix=SCRIPT %s			# RUN: llvm-readelf -S %t.script \| FileCheck --check-prefix=SCRIPT %s

	# SCRIPT: .text			# SCRIPT: .text
	# SCRIPT-NEXT: .text.f			# SCRIPT-NEXT: .text.f
	# SCRIPT-NEXT: .text.hot.f_hot			# SCRIPT-NEXT: .text.hot.f_hot
	# SCRIPT-NEXT: .text.unknown.f_unknown			# SCRIPT-NEXT: .text.unknown.f_unknown
				# SCRIPT-NEXT: .text.split.f_split
	# SCRIPT-NEXT: .text.startup.f_startup			# SCRIPT-NEXT: .text.startup.f_startup
	# SCRIPT-NEXT: .text.exit.f_exit			# SCRIPT-NEXT: .text.exit.f_exit
	# SCRIPT-NEXT: .text.unlikely.f_unlikely			# SCRIPT-NEXT: .text.unlikely.f_unlikely

	.globl _start			.globl _start
	_start:			_start:
	ret			ret

	.section .text.f,"ax"			.section .text.f,"ax"
	nop			nop

	.section .text.hot.f_hot,"ax"			.section .text.hot.f_hot,"ax"
	nop			nop

	.section .text.unknown.f_unknown,"ax"			.section .text.unknown.f_unknown,"ax"
	nop			nop

				.section .text.split.f_split,"ax"
				nop

	.section .text.startup.f_startup,"ax"			.section .text.startup.f_startup,"ax"
	nop			nop

	.section .text.exit.f_exit,"ax"			.section .text.exit.f_exit,"ax"
	nop			nop

	.section .text.unlikely.f_unlikely,"ax"			.section .text.unlikely.f_unlikely,"ax"
	nop			nop