Download Raw Diff

Details

Reviewers

ruiu
MaskRay
• espindola

Commits

rG68142324290f: [LLD][ELF] Support --[no-]mmap-output-file with F_no_mmap

Summary

Add a flag F_no_mmap to FileOutputBuffer to support
--[no-]mmap-output-file in ELF LLD. LLD currently explicitly ignores
this flag for compatibility with GNU ld and gold.

We need this flag to speed up link time for large binaries in certain
scenarios. When we link some of our larger binaries we find that LLD
takes 50+ GB of memory, which causes memory pressure. The memory
pressure causes the VM to flush dirty pages of the output file to disk.
This is normally okay, since we should be flushing cold pages. However,
when using BtrFS with compression we need to write 128KB at a time when
we flush a page. If any page in that 128KB block is written again, then
it must be flushed a second time, and so on. Since LLD doesn't write
sequentially this causes write amplification. The same 128KB block will
end up being flushed multiple times, causing the linker to many times
more IO than necessary. We've observed 3-5x faster builds with
-no-mmap-output-file when we hit this scenario.

The bad scenario only applies to compressed filesystems, which group
together multiple pages into a single compressed block. I've tested
BtrFS, but the problem will be present for any compressed filesystem
on Linux, since it is caused by the VM.

Silently ignoring --no-mmap-output-file caused a silent regression when
we switched from gold to lld. We pass --no-mmap-output-file to fix this
edge case, but since lld silently ignored the flag we didn't realize it
wasn't being respected.

Test Plan:

ninja check-llvm
ninja check-lld

Benchmark building a 9 GB binary that exposes this edge case. I linked 3
times with --mmap-output-file and 3 times with --no-mmap-output-file and
took the average. The machine has 24 cores @ 2.4 GHz, 112 GB of RAM,
BtrFS mounted with -compress-force=zstd, and an 80% full disk.

Mode	Time
mmap	894 s
no mmap	126 s

When compression is disabled, BtrFS performs just as well with and
without mmap on this benchmark.

I was unable to reproduce the regression with any binaries in
lld-speed-test.

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 39896
Build 39950: arc lint + arc unit

Event Timeline

terrelln created this revision.Oct 21 2019, 8:42 PM

Herald added a reviewer: • espindola. · View Herald TranscriptOct 21 2019, 8:42 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: arichardson, emaste. · View Herald Transcript

terrelln added a parent revision: D69293: [Support] Add in memory flag for FileOutputBuffer.Oct 21 2019, 8:43 PM

Harbormaster completed remote builds in B39896: Diff 225990.Oct 21 2019, 8:50 PM

I don't know btrfs well but my current understanding is that there is a trade-off between compression rates and performance.

--no-mmap-output-file is supported though currently ignored, so filling in its functionality seems fine. I've got some suggestions and questions about the description.

Add support for -[no-]mmap-output-file to ELF. LLD currently explicitly ignores this flag for compatibility with the GNU linker.

GNU linker*s*, or more explicitly, GNU ld and gold.

However, when BtrFS compression is enabled, BtrFS writes 128KB blocks. Since LLD doesn't write the output file sequentially, this causes massive write and compression amplification.

Do the flushes of random accessed dirty pages make it slow under memory pressure?

FileOutputBuffer::F_in_memory

FileOutputBuffer does no have this mode. Do you have another patch that has not been sent?

-compress-force=zstd

What is compression is turned off? Have you measured the performance differences with and without --no-mmap-output-file? I think I have asked too much, but if you have access to some ext4/XFS/ZFS machines, the numbers will be useful... CentOS has defaulted to XFS and ext4 is the most widely used fs on Linux.

lld/test/ELF/mmap-output-file.s
4	Just check the output is the same. llvm-mc -filetype=obj -triple=x86-unknown-linux /dev/null -o %t.o ld.lld %t.o -o %t ld.lld --no-mmap-output-file %t.o -o %t1 cmp %t %t1 ld.lld --mmap-output-file %t.o -o %t2 cmp %t %t2

I should have reviewed this one first, sorry. I'm a little sad that BtrFS behaves poorly under a certain condition, and I feel like the filesystem is at fault than the linker, but I'm fine with actually supporting -no-mmap-output-file option as a workaround. That's practical and easy to implement.

lld/ELF/Config.h
199	Sort asciibetically.
lld/ELF/Options.td
410	Map -> Mmap
411	map -> mmap

I'll update the description to answer questions, and make the requested changes. I've also answered questions inline below.

I'm a little sad that BtrFS behaves poorly under a certain condition, and I feel like the filesystem is at fault than the linker

I'd say it is a combination of the linker and BtrFS causing the problem. With compression enabled, BtrFS has to write 128 KB at a time. When there is memory pressure, because the linker uses ~50 GB of memory, the kernel is forced to flush the page cache, so BtrFS is forced to do a write. I suspect you'd see the same problem on any compressed filesystem when under memory pressure. Reducing the memory footprint of the linker would also alleviate the problem.

It is possible the kernel page eviction algorithm could be improved if it knew the "block size" of the filesystem, and I've asked our BtrFS folks if they see a way this case can be improved.

Do the flushes of random accessed dirty pages make it slow under memory pressure?

Yeah, BtrFS is forced to flush the dirty pages, but it has to write 128 KB at a time. This drastically slows down non-sequential write patterns since the same 128 KB block is written multiple times.

FileOutputBuffer::F_in_memory

Thats the parent diff D69293

What is compression is turned off? Have you measured the performance differences with and without --no-mmap-output-file? I think I have asked too much, but if you have access to some ext4/XFS/ZFS machines, the numbers will be useful... CentOS has defaulted to XFS and ext4 is the most widely used fs on Linux.

When compression is disabled I don't see the same problem, and I don't expect to see it on other filesystems (without compression enabled). The reason is we don't see as much write amplification. When compression is disabled flushing a 4 KB page only writes 4 KB. And, the kernel should only flush colder pages, so the write amplification should be minimized. But, maybe the kernel isn't as smart about flushing a cold 4 KB page in a warm 128 KB range.

Can you try posix_fallocate(fd, 0, len) and see how it performs under memory pressure with btrfs+compression?

Can you try posix_fallocate(fd, 0, len) and see how it performs under memory pressure with btrfs+compression?

I see the same behavior, which is expected, since we'll still see the write amplification.

Merge D69293 into this diff.
Fix all comments.

Herald added a subscriber: hiraditya. · View Herald TranscriptOct 24 2019, 6:02 PM

terrelln mentioned this in D69293: [Support] Add in memory flag for FileOutputBuffer.Oct 24 2019, 6:02 PM

terrelln marked 4 inline comments as done.

terrelln retitled this revision from [LLD][ELF] Support -[no-]mmap-output-file to [LLD][ELF] Support -[no-]mmap-output-file with F_no_mmap.Oct 24 2019, 6:04 PM

terrelln edited the summary of this revision. (Show Details)

terrelln added a project: lld.

Harbormaster completed remote builds in B40038: Diff 226369.Oct 24 2019, 6:08 PM

terrelln edited the summary of this revision. (Show Details)Oct 24 2019, 6:09 PM

smeenai added a subscriber: smeenai.Oct 24 2019, 6:23 PM

LGTM

lld/ELF/Writer.cpp
2596–2598	nit: can you replace this `=` with `\|=` for consistency?

This revision is now accepted and ready to land.Oct 24 2019, 7:52 PM

MaskRay accepted this revision.Oct 24 2019, 10:32 PM

MaskRay added inline comments.

lld/test/ELF/mmap-output-file.s
6	Prefer `--m` in tests. Due to the way the parser works, `-m emulation` is not considered, but we don't want users to use `-mmap-output-file`

Fix comments

terrelln marked 2 inline comments as done.Oct 25 2019, 11:11 AM

Harbormaster completed remote builds in B40071: Diff 226466.Oct 25 2019, 11:12 AM

Will you need someone to commit this for you?

@smeenai yeah I will

-[no-]mmap-output-file

Please also change the title to use the double-dashed form.

lld/test/ELF/mmap-output-file.s
4	Just use `x86_64`. This is not Linux specific.

terrelln retitled this revision from [LLD][ELF] Support -[no-]mmap-output-file with F_no_mmap to [LLD][ELF] Support --[no-]mmap-output-file with F_no_mmap.Oct 28 2019, 9:47 PM

terrelln edited the summary of this revision. (Show Details)

Use triple x86_64 in tests.

Harbormaster completed remote builds in B40174: Diff 226835.Oct 28 2019, 9:50 PM

terrelln marked an inline comment as done.Oct 28 2019, 9:50 PM

Feel free to submit.

I can commit this for you tomorrow if one of your reviewers hasn't gotten to it by then.

@smeenai I just tried

curl -L 'https://reviews.llvm.org/D69294?download=true' | patch -p1; git commit -m 'Differential Revision: https://reviews.llvm.org/D69294'; arc amend

The nice thing with git is that %an and %ae (author name and email) will be just correct. The old Patch by Name svn way does not give enough attribution to the author :(

Closed by commit rG68142324290f: [LLD][ELF] Support --[no-]mmap-output-file with F_no_mmap (authored by terrelln, committed by MaskRay). · Explain WhyOct 29 2019, 5:45 PM

This revision was automatically updated to reflect the committed changes.

Diff 225990

lld/ELF/Config.h

Show First 20 Lines • Show All 190 Lines • ▼ Show 20 Lines	struct Configuration {
bool undefinedVersion;		bool undefinedVersion;
bool useAndroidRelrTags = false;		bool useAndroidRelrTags = false;
bool warnBackrefs;		bool warnBackrefs;
bool warnCommon;		bool warnCommon;
bool warnIfuncTextrel;		bool warnIfuncTextrel;
bool warnMissingEntry;		bool warnMissingEntry;
bool warnSymbolOrdering;		bool warnSymbolOrdering;
bool writeAddends;		bool writeAddends;
		bool mmapOutputFile;
		ruiuUnsubmitted Done Reply Inline Actions Sort asciibetically. ruiu: Sort asciibetically.
bool zCombreloc;		bool zCombreloc;
bool zCopyreloc;		bool zCopyreloc;
bool zExecstack;		bool zExecstack;
bool zGlobal;		bool zGlobal;
bool zHazardplt;		bool zHazardplt;
bool zIfuncNoplt;		bool zIfuncNoplt;
bool zInitfirst;		bool zInitfirst;
bool zInterpose;		bool zInterpose;
▲ Show 20 Lines • Show All 123 Lines • Show Last 20 Lines

lld/ELF/Driver.cpp

Show First 20 Lines • Show All 941 Lines • ▼ Show 20 Lines	static void readConfigs(opt::InputArgList &args) {
config->unresolvedSymbols = getUnresolvedSymbolPolicy(args);		config->unresolvedSymbols = getUnresolvedSymbolPolicy(args);
config->warnBackrefs =		config->warnBackrefs =
args.hasFlag(OPT_warn_backrefs, OPT_no_warn_backrefs, false);		args.hasFlag(OPT_warn_backrefs, OPT_no_warn_backrefs, false);
config->warnCommon = args.hasFlag(OPT_warn_common, OPT_no_warn_common, false);		config->warnCommon = args.hasFlag(OPT_warn_common, OPT_no_warn_common, false);
config->warnIfuncTextrel =		config->warnIfuncTextrel =
args.hasFlag(OPT_warn_ifunc_textrel, OPT_no_warn_ifunc_textrel, false);		args.hasFlag(OPT_warn_ifunc_textrel, OPT_no_warn_ifunc_textrel, false);
config->warnSymbolOrdering =		config->warnSymbolOrdering =
args.hasFlag(OPT_warn_symbol_ordering, OPT_no_warn_symbol_ordering, true);		args.hasFlag(OPT_warn_symbol_ordering, OPT_no_warn_symbol_ordering, true);
		config->mmapOutputFile =
		args.hasFlag(OPT_mmap_output_file, OPT_no_mmap_output_file, true);
config->zCombreloc = getZFlag(args, "combreloc", "nocombreloc", true);		config->zCombreloc = getZFlag(args, "combreloc", "nocombreloc", true);
config->zCopyreloc = getZFlag(args, "copyreloc", "nocopyreloc", true);		config->zCopyreloc = getZFlag(args, "copyreloc", "nocopyreloc", true);
config->zExecstack = getZFlag(args, "execstack", "noexecstack", false);		config->zExecstack = getZFlag(args, "execstack", "noexecstack", false);
config->zGlobal = hasZOption(args, "global");		config->zGlobal = hasZOption(args, "global");
config->zHazardplt = hasZOption(args, "hazardplt");		config->zHazardplt = hasZOption(args, "hazardplt");
config->zIfuncNoplt = hasZOption(args, "ifunc-noplt");		config->zIfuncNoplt = hasZOption(args, "ifunc-noplt");
config->zInitfirst = hasZOption(args, "initfirst");		config->zInitfirst = hasZOption(args, "initfirst");
config->zInterpose = hasZOption(args, "interpose");		config->zInterpose = hasZOption(args, "interpose");
▲ Show 20 Lines • Show All 1,020 Lines • Show Last 20 Lines

lld/ELF/Options.td

	Show First 20 Lines • Show All 400 Lines • ▼ Show 20 Lines
	defm warn_ifunc_textrel: B<"warn-ifunc-textrel",			defm warn_ifunc_textrel: B<"warn-ifunc-textrel",
	"Warn about using ifunc symbols with text relocations",			"Warn about using ifunc symbols with text relocations",
	"Do not warn about using ifunc symbols with text relocations (default)">;			"Do not warn about using ifunc symbols with text relocations (default)">;

	defm warn_symbol_ordering: B<"warn-symbol-ordering",			defm warn_symbol_ordering: B<"warn-symbol-ordering",
	"Warn about problems with the symbol ordering file (default)",			"Warn about problems with the symbol ordering file (default)",
	"Do not warn about problems with the symbol ordering file">;			"Do not warn about problems with the symbol ordering file">;

				defm mmap_output_file: B<"mmap-output-file",
				"Map the output file for writing (default)",
				ruiuUnsubmitted Done Reply Inline Actions Map -> Mmap ruiu: Map -> Mmap
				"Do not map the output file for writing">;
				ruiuUnsubmitted Done Reply Inline Actions map -> mmap ruiu: map -> mmap

	def warn_unresolved_symbols: F<"warn-unresolved-symbols">,			def warn_unresolved_symbols: F<"warn-unresolved-symbols">,
	HelpText<"Report unresolved symbols as warnings">;			HelpText<"Report unresolved symbols as warnings">;

	defm whole_archive: B<"whole-archive",			defm whole_archive: B<"whole-archive",
	"Force load of all members in a static library",			"Force load of all members in a static library",
	"Do not force load of all members in a static library (default)">;			"Do not force load of all members in a static library (default)">;

	defm wrap: Eq<"wrap", "Use wrapper functions for symbol">,			defm wrap: Eq<"wrap", "Use wrapper functions for symbol">,
	▲ Show 20 Lines • Show All 144 Lines • ▼ Show 20 Lines
	// Options listed below are silently ignored for now for compatibility.			// Options listed below are silently ignored for now for compatibility.
	def: F<"detect-odr-violations">;			def: F<"detect-odr-violations">;
	def: Flag<["-"], "g">;			def: Flag<["-"], "g">;
	def: F<"long-plt">;			def: F<"long-plt">;
	def: F<"no-add-needed">;			def: F<"no-add-needed">;
	def: F<"no-copy-dt-needed-entries">;			def: F<"no-copy-dt-needed-entries">;
	def: F<"no-ctors-in-init-array">;			def: F<"no-ctors-in-init-array">;
	def: F<"no-keep-memory">;			def: F<"no-keep-memory">;
	def: F<"no-mmap-output-file">;
	def: F<"no-pipeline-knowledge">;			def: F<"no-pipeline-knowledge">;
	def: F<"no-warn-mismatch">;			def: F<"no-warn-mismatch">;
	def: Flag<["-"], "p">;			def: Flag<["-"], "p">;
	def: Separate<["--", "-"], "rpath-link">;			def: Separate<["--", "-"], "rpath-link">;
	def: J<"rpath-link=">;			def: J<"rpath-link=">;
	def: F<"secure-plt">;			def: F<"secure-plt">;
	def: F<"sort-common">;			def: F<"sort-common">;
	def: F<"stats">;			def: F<"stats">;
	Show All 12 Lines

lld/ELF/Writer.cpp

Show First 20 Lines • Show All 2,587 Lines • ▼ Show 20 Lines	template <class ELFT> void Writer<ELFT>::openFile() {
if (fileSize != size_t(fileSize) \|\| maxSize < fileSize) {		if (fileSize != size_t(fileSize) \|\| maxSize < fileSize) {
error("output file too large: " + Twine(fileSize) + " bytes");		error("output file too large: " + Twine(fileSize) + " bytes");
return;		return;
}		}

unlinkAsync(config->outputFile);		unlinkAsync(config->outputFile);
unsigned flags = 0;		unsigned flags = 0;
if (!config->relocatable)		if (!config->relocatable)
flags = FileOutputBuffer::F_executable;		flags = FileOutputBuffer::F_executable;
		if (!config->mmapOutputFile)
		flags \|= FileOutputBuffer::F_in_memory;
		ruiuUnsubmitted Done Reply Inline Actions nit: can you replace this `=` with `\|=` for consistency? ruiu: nit: can you replace this `=` with `\|=` for consistency?
Expected<std::unique_ptr<FileOutputBuffer>> bufferOrErr =		Expected<std::unique_ptr<FileOutputBuffer>> bufferOrErr =
FileOutputBuffer::create(config->outputFile, fileSize, flags);		FileOutputBuffer::create(config->outputFile, fileSize, flags);

if (!bufferOrErr) {		if (!bufferOrErr) {
error("failed to open " + config->outputFile + ": " +		error("failed to open " + config->outputFile + ": " +
llvm::toString(bufferOrErr.takeError()));		llvm::toString(bufferOrErr.takeError()));
return;		return;
}		}
▲ Show 20 Lines • Show All 146 Lines • Show Last 20 Lines

lld/test/ELF/mmap-output-file.s

This file was added.

				# REQUIRES: x86

				# RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %s -o %t.o
				# RUN: ld.lld %t.o -mmap-output-file -o %t1
				MaskRayUnsubmitted Done Reply Inline Actions Just check the output is the same. llvm-mc -filetype=obj -triple=x86-unknown-linux /dev/null -o %t.o ld.lld %t.o -o %t ld.lld --no-mmap-output-file %t.o -o %t1 cmp %t %t1 ld.lld --mmap-output-file %t.o -o %t2 cmp %t %t2 MaskRay: Just check the output is the same. ``` llvm-mc -filetype=obj -triple=x86-unknown-linux…
				MaskRayUnsubmitted Done Reply Inline Actions Just use `x86_64`. This is not Linux specific. MaskRay: Just use `x86_64`. This is not Linux specific.
				# RUN: llvm-objdump -d %t1 \| FileCheck %s

				MaskRayUnsubmitted Done Reply Inline Actions Prefer `--m` in tests. Due to the way the parser works, `-m emulation` is not considered, but we don't want users to use `-mmap-output-file` MaskRay: Prefer `--m` in tests. Due to the way the parser works, `-m emulation` is not considered, but…
				# CHECK: nop

				# RUN: ld.lld %t.o -no-mmap-output-file -o %t2
				# RUN: diff %t1 %t2

				.globl _start
				_start:
				nop

lld/test/ELF/silent-ignore.test

	RUN: ld.lld --version \			RUN: ld.lld --version \
	RUN: -detect-odr-violations \			RUN: -detect-odr-violations \
	RUN: -g \			RUN: -g \
	RUN: -long-plt \			RUN: -long-plt \
	RUN: -no-add-needed \			RUN: -no-add-needed \
	RUN: -no-copy-dt-needed-entries \			RUN: -no-copy-dt-needed-entries \
	RUN: -no-ctors-in-init-array \			RUN: -no-ctors-in-init-array \
	RUN: -no-keep-memory \			RUN: -no-keep-memory \
	RUN: -no-mmap-output-file \
	RUN: -no-pipeline-knowledge \			RUN: -no-pipeline-knowledge \
	RUN: -no-warn-mismatch \			RUN: -no-warn-mismatch \
	RUN: -p \			RUN: -p \
	RUN: -rpath-link . \			RUN: -rpath-link . \
	RUN: -secure-plt \			RUN: -secure-plt \
	RUN: -sort-common \			RUN: -sort-common \
	RUN: -stats \			RUN: -stats \
	RUN: -warn-execstack \			RUN: -warn-execstack \
	RUN: -warn-once \			RUN: -warn-once \
	RUN: -warn-shared-textrel \			RUN: -warn-shared-textrel \
	RUN: -EB \			RUN: -EB \
	RUN: -EL \			RUN: -EL \
	RUN: -G 0 \			RUN: -G 0 \
	RUN: -Qy			RUN: -Qy
	RUN: not ld.lld --version --not-an-ignored-argument			RUN: not ld.lld --version --not-an-ignored-argument

This is an archive of the discontinued LLVM Phabricator instance.

[LLD][ELF] Support --[no-]mmap-output-file with F_no_mmap
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 225990

lld/ELF/Config.h

lld/ELF/Driver.cpp

lld/ELF/Options.td

lld/ELF/Writer.cpp

lld/test/ELF/mmap-output-file.s

lld/test/ELF/silent-ignore.test

This is an archive of the discontinued LLVM Phabricator instance.

[LLD][ELF] Support --[no-]mmap-output-file with F_no_mmapClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 225990

lld/ELF/Config.h

lld/ELF/Driver.cpp

lld/ELF/Options.td

lld/ELF/Writer.cpp

lld/test/ELF/mmap-output-file.s

lld/test/ELF/silent-ignore.test

[LLD][ELF] Support --[no-]mmap-output-file with F_no_mmap
ClosedPublic