This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/
-
MachO/
-
CMakeLists.txt
-
ConcatOutputSection.h
-
ConcatOutputSection.cpp
-
Config.h
-
Driver.cpp
-
InputFiles.h
-
InputFiles.cpp
7/7
InputSection.h
-
InputSection.cpp
-
MergedOutputSection.h
-
MergedOutputSection.cpp
-
Options.td
-
OutputSection.h
-
OutputSegment.h
-
OutputSegment.cpp
-
Symbols.cpp
-
SyntheticSections.h
2/2
SyntheticSections.cpp
-
UnwindInfoSection.h
-
UnwindInfoSection.cpp
1/1
Writer.cpp
-
test/MachO/
-
MachO/
1/2
cstring-merging.s
-
invalid/
-
cstring-merging.s
-
reserved-section-name.s
1/1
load-command-sequence.s
-
section-order.s
-
subsections-section-relocs.s
-
weak-binding.s
-
x86-64-relocs.s

Differential D102964

[lld-macho] Implement cstring deduplication
ClosedPublic

Authored by int3 on May 21 2021, 8:35 PM.

Download Raw Diff

Details

Reviewers

gkm

Group Reviewers

Restricted Project

Commits

rG04259cde15a9: [lld-macho] Implement cstring deduplication

Summary

Our implementation draws heavily from LLD-ELF's, which in turn delegates
its string deduplication to llvm-mc's StringTableBuilder. The messiness of
this diff is largely due to the fact that we've previously assumed that
all InputSections get concatenated together to form the output. This is
no longer true with CStringInputSections, which split their contents into
StringPieces. StringPieces are much more lightweight than InputSections,
which is important as we create a lot of them. They may also overlap in
the output, which makes it possible for strings to be tail-merged. In
fact, the initial version of this diff implemented tail merging, but
I've dropped it for reasons I'll explain later.

Alignment Issues

Mergeable cstring literals are found under the __TEXT,__cstring
section. In contrast to ELF, which puts strings that need different
alignments into different sections, clang's Mach-O backend puts them all
in one section. Strings that need to be aligned have the .p2align
directive emitted before them, which simply translates into zero padding
in the object file.

I *think* ld64 extracts the desired per-string alignment from this data
by preserving each string's offset from the last section-aligned
address. I'm not entirely certain since it doesn't seem consistent about
doing this; but perhaps this can be chalked up to cases where ld64 has
to deduplicate strings with different offset/alignment combos -- it
seems to pick one of their alignments to preserve. This doesn't seem
correct in general; we can in fact can induce ld64 to produce a crashing
binary just by linking in an additional object file that only contains
cstrings and no code. See PR50563 for details.

Moreover, this scheme seems rather inefficient: since unaligned and
aligned strings are all put in the same section, which has a single
alignment value, it doesn't seem possible to tell whether a given string
doesn't have any alignment requirements. Preserving offset+alignments
for strings that don't need it is wasteful.

In practice, the crashes seen so far seem to stem from x86_64 SIMD
operations on cstrings. X86_64 requires SIMD accesses to be
16-byte-aligned. So for now, I'm thinking of just aligning all strings
to 16 bytes on x86_64. This is indeed wasteful, but implementation-wise
it's simpler than preserving per-string alignment+offsets. It also
avoids the aforementioned crash after deduplication of
differently-aligned strings. Finally, the overhead is not huge: using
16-byte alignment (vs no alignment) is only a 0.5% size overhead when
linking chromium_framework.

With these alignment requirements, it doesn't make sense to attempt tail
merging -- most strings will not be eligible since their overlaps aren't
likely to start at a 16-byte boundary. Tail-merging (with alignment) for
chromium_framework only improves size by 0.3%.

It's worth noting that LLD-ELF only does tail merging at -O2. By
default (at -O1), it just deduplicates w/o tail merging. @thakis has
also mentioned that they saw it regress compressed size in some cases
and therefore turned it off. ld64 does not seem to do tail merging at
all.

Performance Numbers

CString deduplication reduces chromium_framework from 250MB to 242MB, or
about a 3.2% reduction.

Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W:

    N           Min           Max        Median           Avg        Stddev
x  20          3.91          4.03         3.935          3.95   0.034641016
+  20          3.99          4.14         4.015        4.0365     0.0492336
Difference at 95.0% confidence
        0.0865 +/- 0.027245
        2.18987% +/- 0.689746%
        (Student's t, pooled s = 0.0425673)

As expected, cstring merging incurs some non-trivial overhead.

When passing --no-literal-merge, it seems that performance is the
same, i.e. the refactoring in this diff didn't cost us.

    N           Min           Max        Median           Avg        Stddev
x  20          3.91          4.03         3.935          3.95   0.034641016
+  20          3.89          4.02         3.935        3.9435   0.043197831
No difference proven at 95.0% confidence

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

int3 created this revision.May 21 2021, 8:35 PM

Herald added a reviewer: gkm. · View Herald TranscriptMay 21 2021, 8:35 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: dang, mgrang, mgorny. · View Herald Transcript

int3 requested review of this revision.May 21 2021, 8:35 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 21 2021, 8:35 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

int3 edited the summary of this revision. (Show Details)May 21 2021, 8:35 PM

Harbormaster completed remote builds in B105742: Diff 347169.May 21 2021, 8:35 PM

update commit msg

Harbormaster completed remote builds in B105743: Diff 347170.May 21 2021, 8:38 PM

int3 edited the summary of this revision. (Show Details)May 21 2021, 8:49 PM

remove FIXME

Harbormaster completed remote builds in B105744: Diff 347171.May 21 2021, 8:54 PM

remove forward declaration

Harbormaster completed remote builds in B105745: Diff 347172.May 21 2021, 8:56 PM

int3 added inline comments.May 21 2021, 9:00 PM

lld/MachO/Writer.cpp
936–937	An earlier implementation of this diff always created the CStringLiteralSection, even if literal merging was disabled. I therefore hoisted out this check to avoid having a conflict between the unneeded CStringLiteralSection and the actual ConcatOutputSection when literal merging was not being done. We now only create the CStringLiteralSection as-needed, so this is likely unnecessary. However, I think it still makes sense to avoid unnecessary section name conflicts, so I've left it in.

Harbormaster completed remote builds in B105746: Diff 347173.May 21 2021, 10:28 PM

int3 added inline comments.May 21 2021, 11:51 PM

lld/test/MachO/load-command-sequence.s
33–34	while fixing section ordering issues, I noticed that ld64 orders `__const` after `__got`, hence this fix.

int3 edited the summary of this revision. (Show Details)May 22 2021, 12:18 AM

int3 edited the summary of this revision. (Show Details)May 22 2021, 3:00 PM

gkm added a comment.May 22 2021, 3:21 PM

This comment was removed by gkm.

I think the class hierarchy change makes sense as part of this diff, since it's motivated by needs of CStringInputSection's implementation, but yeah I can split out some of the other parts

int3 mentioned this in D102971: [lld-macho][nfc] Rename MergedOutputSection to ConcatOutputSection.May 22 2021, 5:52 PM

int3 mentioned this in D102972: [lld-macho][nfc] Sort OutputSections based on explicit order of command-line inputs.

int3 added a parent revision: D102972: [lld-macho][nfc] Sort OutputSections based on explicit order of command-line inputs.May 22 2021, 5:53 PM

rebase

Harbormaster completed remote builds in B105777: Diff 347220.May 22 2021, 5:54 PM

(FWIW we implemented this in lld-link but then ended up turning it off in chromium since it increased compressed size: https://crbug.com/838449)

(see also D44504)

Huh, that makes sense I guess. I suppose we should plan on implementing dedup-without-merge in the future then. Shouldn't be hard to fit into the existing structure.

int3 mentioned this in rG33706191d88d: [lld-macho][nfc] Rename MergedOutputSection to ConcatOutputSection.May 25 2021, 11:59 AM

int3 mentioned this in rGfcab06bd85d1: [lld-macho][nfc] Sort OutputSections based on explicit order of command-line….

reword error message

need to double-check some correctness issues

Harbormaster completed remote builds in B106146: Diff 347764.May 25 2021, 1:51 PM

fix alignment issues

Herald added a subscriber: pengfei. · View Herald TranscriptJun 2 2021, 3:18 PM

Harbormaster completed remote builds in B107336: Diff 349397.Jun 2 2021, 4:00 PM

A random review might not be the best place for this, but:

It's great we're looking at size of the output binary!

Maybe it makes sense to look for lower-hanging fruit before implementing somewhat expensive things?

Here's a bloaty (https://github.com/google/bloaty) diff between ld64-linked Chromium Framework (after --, that's where the old version goes) and lld-linked Chromium Framework:

% ~/src/bloaty/bloaty 'Chromium Framework' --  'Chromium.app/Contents/Frameworks/Chromium Framework.framework/Versions/Current/Chromium Framework'
    FILE SIZE        VM SIZE
 --------------  --------------
  [NEW] +6.98Mi  [NEW] +6.98Mi    __DATA_CONST,__const
 +36e2% +6.42Mi +36e2% +6.42Mi    Rebase Info
   +62% +3.19Mi   +62% +3.19Mi    __TEXT,__cstring
  [NEW] +1.52Mi  [NEW] +1.52Mi    __TEXT,__literal16
  +139% +1.11Mi  +139% +1.11Mi    Function Start Addresses
  +1.5%  +940Ki  +1.5%  +940Ki    String Table
 +38e2%  +530Ki +38e2%  +530Ki    __TEXT,__eh_frame
   +27%  +124Ki   +27%  +124Ki    __TEXT,__objc_methtype
  +102%  +118Ki  +102%  +118Ki    __TEXT,__objc_methname
  +1.1%  +114Ki  +1.1%  +114Ki    Symbol Table
  [NEW] +88.0Ki  [NEW] +88.0Ki    __TEXT,__literal8
  +263% +80.9Ki  +263% +80.9Ki    Binding Info
  [NEW] +62.3Ki  [NEW] +62.3Ki    __TEXT,__literal4
  [NEW] +32.4Ki  [NEW] +32.4Ki    __DATA_CONST,__cfstring
  +136% +28.2Ki  +136% +28.2Ki    __DATA,__objc_selrefs
  +0.0% +8.38Ki  -0.0% -6.59Ki    [31 Others]
  [DEL] -26.8Ki  [DEL] -26.8Ki    __DATA,__cfstring
  [DEL] -53.4Ki  [DEL] -53.4Ki    Table of Non-instructions
 -30.7% -57.9Ki -30.7% -57.9Ki    __DATA,__objc_const
  -1.0% -83.3Ki  -1.0% -83.3Ki    __TEXT,__const
  [DEL] -6.98Mi  [DEL] -6.98Mi    __DATA,__const
  +6.0% +14.1Mi  +5.9% +14.1Mi    TOTAL

__cstring is indeed on the list, but there are other things before it. The _DATA_CONST looks like it's just in __DATA in ld64 and just moved around (see 2nd-to-last line), but our rebase info and LC_FUNCTION_STARTS sections are way larger and possibly easier to fix (LC_FUNCTION_STARTS is 2003880 vs 838208 -- that's 1.2 MB that are likely a cheap fix).

3% smaller is great, but 2% slower isn't exactly cheap. It's not super expensive either, but 10% here and 10% there and suddenly you take twice as long. Being much faster is one of the big selling points of lld so we should try hard not to regress on that. Several thoughts on that:

Maybe this should be opt in (only at -O2, and/or lto or what)? People who really want optimized binaries over link time probably do (thin) LTO.
If the main cost is the hash:
1. That should parallelize well
2. Is there some way we could compute the string hash at compile time and stash it somewhere? (Similar in idea to http://blog.llvm.org/2018/01/improving-link-time-on-windows-with.html)

(Some of this also applies to the ICF patch – looks like we're doing a more thorough job than ld64 with ICF but it's also more expensive.)

Ah, I should probably have added a bit more motivation. The internal program I've been analyzing has significant size overhead from duplicated CFStrings. These CFStrings are essentially boxed cstrings, with an additional field that needs to be bound by dyld. As such, they bloat not just the __cfstring section but also the binding info. I didn't quantify exactly how much of the binding info could be attributed to them, but it seemed significant.

Ultimately, I think we'll have ICF dedup these CFStrings, but in order to do so we must first dedup the cstrings they point to. Hence this diff.

I'm fine with turning merging off by default for now, until we get it integrated with ICF for a bigger win. And maybe only turn it on together with ICF. How does that sound?

In terms of prioritization, I'd like to keep the implementation of these optimizations simple for now, until we are sure that they are operating correctly. (E.g. as the commit message indicates, I uncovered alignment issues while implementing this, and I'm still not entirely sure this is the best way to handle them.) I think parallelization can wait till we're more certain that the output works...

try to fix test on linux

Harbormaster completed remote builds in B108058: Diff 350397.Jun 7 2021, 1:45 PM

LGTM

lld/MachO/InputSection.h
101	Why are we truncating 64-bit hashes to 32 bits? Because the low-order 32 bits are sufficient, and it's more important that `StringPiece` be 16 bytes vs. 24 bytes?
lld/MachO/SyntheticSections.cpp
1072–1073	The extremity of the target-dependent difference in alignment requirement is surprising, and worthy of a comment.
lld/test/MachO/cstring-merging.s
55–66	Is there value in testing ... Strings of length other than 3? Zero length? Non-null terminated? Prefix matches? (e.g. "foo" and "fool", or "bar" and "barf")

This revision is now accepted and ready to land.Jun 7 2021, 3:34 PM

alexander-shaposhnikov added a subscriber: alexander-shaposhnikov.Jun 7 2021, 3:48 PM

alexander-shaposhnikov added inline comments.

lld/MachO/InputSection.h
100	would be good to add comments for these fields (inSecOff, outSecOff)

alexander-shaposhnikov added inline comments.Jun 7 2021, 4:03 PM

lld/MachO/InputSection.h
75	explicit
123	khm, wouldn't const StringPiece &getStringPiece(uint64_t offset) const be a cleaner interface ?
141	does it need to be `public` ?

int3 marked an inline comment as done.Jun 7 2021, 7:19 PM

int3 added inline comments.

lld/MachO/InputSection.h
101	This was copied from LLD-ELF's implementation, and yeah the motivation is to reduce the memory cost. I'll copy over the comment too...
141	`CStringSection::finalize()` needs it to be public

address comments + disable literal dedup by default, per @thakis' suggestion

This revision was landed with ongoing or failed builds.Jun 7 2021, 8:48 PM

Closed by commit rG04259cde15a9: [lld-macho] Implement cstring deduplication (authored by int3). · Explain Why

This revision was automatically updated to reflect the committed changes.

int3 added a commit: rG04259cde15a9: [lld-macho] Implement cstring deduplication.

Harbormaster completed remote builds in B108125: Diff 350483.Jun 7 2021, 9:19 PM

int3 added inline comments.Jun 7 2021, 9:29 PM

lld/MachO/SyntheticSections.cpp
1072–1073	Good point. I've copied the relevant bits of the commit message.
lld/test/MachO/cstring-merging.s
55–66	yeah I got lazy here... those are good suggestions. I don't think prefix matches are necessary since we are no longer doing tail merging, but the rest seem useful.

int3 mentioned this in D104159: [not for review][lld-macho] Simple cstring literal implementation.Jun 11 2021, 4:33 PM

Revision Contents

Path

Size

lld/

MachO/

CMakeLists.txt

2 lines

	ConcatOutputSection.h
	MergedOutputSection.h

20 lines

	ConcatOutputSection.cpp
	MergedOutputSection.cpp

28 lines

1 line

3 lines

4 lines

75 lines

87 lines

54 lines

MergedOutputSection.h

MergedOutputSection.cpp

1 line

4 lines

3 lines

5 lines

6 lines

24 lines

SyntheticSections.cpp

50 lines

UnwindInfoSection.h

6 lines

UnwindInfoSection.cpp

8 lines

Writer.cpp

63 lines

test/

MachO/

cstring-merging.s

96 lines

invalid/

cstring-merging.s

31 lines

reserved-section-name.s

7 lines

load-command-sequence.s

4 lines

section-order.s

35 lines

subsections-section-relocs.s

13 lines

weak-binding.s

5 lines

x86-64-relocs.s

50 lines

Diff 347173

lld/MachO/CMakeLists.txt

	set(LLVM_TARGET_DEFINITIONS Options.td)			set(LLVM_TARGET_DEFINITIONS Options.td)
	tablegen(LLVM Options.inc -gen-opt-parser-defs)			tablegen(LLVM Options.inc -gen-opt-parser-defs)
	add_public_tablegen_target(MachOOptionsTableGen)			add_public_tablegen_target(MachOOptionsTableGen)

	include_directories(${LLVM_MAIN_SRC_DIR}/../libunwind/include)			include_directories(${LLVM_MAIN_SRC_DIR}/../libunwind/include)

	add_lld_library(lldMachO2			add_lld_library(lldMachO2
	Arch/ARM.cpp			Arch/ARM.cpp
	Arch/ARM64.cpp			Arch/ARM64.cpp
	Arch/ARM64Common.cpp			Arch/ARM64Common.cpp
	Arch/ARM64_32.cpp			Arch/ARM64_32.cpp
	Arch/X86_64.cpp			Arch/X86_64.cpp
	UnwindInfoSection.cpp			UnwindInfoSection.cpp
				ConcatOutputSection.cpp
	Driver.cpp			Driver.cpp
	DriverUtils.cpp			DriverUtils.cpp
	Dwarf.cpp			Dwarf.cpp
	ExportTrie.cpp			ExportTrie.cpp
	InputFiles.cpp			InputFiles.cpp
	InputSection.cpp			InputSection.cpp
	LTO.cpp			LTO.cpp
	MergedOutputSection.cpp
	ObjC.cpp			ObjC.cpp
	OutputSection.cpp			OutputSection.cpp
	OutputSegment.cpp			OutputSegment.cpp
	Relocations.cpp			Relocations.cpp
	SymbolTable.cpp			SymbolTable.cpp
	Symbols.cpp			Symbols.cpp
	SyntheticSections.cpp			SyntheticSections.cpp
	Target.cpp			Target.cpp
	Show All 26 Lines

lld/MachO/ConcatOutputSection.h

This file was moved from lld/MachO/MergedOutputSection.h.

	//===- OutputSection.h ------------------------------------------- C++ --===//			//===- ConcatOutputSection.h ------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLD_MACHO_MERGED_OUTPUT_SECTION_H			#ifndef LLD_MACHO_MERGED_OUTPUT_SECTION_H
	Show All 9 Lines
	namespace macho {			namespace macho {

	class Defined;			class Defined;

	// Linking multiple files will inevitably mean resolving sections in different			// Linking multiple files will inevitably mean resolving sections in different
	// files that are labeled with the same segment and section name. This class			// files that are labeled with the same segment and section name. This class
	// contains all such sections and writes the data from each section sequentially			// contains all such sections and writes the data from each section sequentially
	// in the final binary.			// in the final binary.
	class MergedOutputSection : public OutputSection {			class ConcatOutputSection : public OutputSection {
	public:			public:
	MergedOutputSection(StringRef name) : OutputSection(MergedKind, name) {}			ConcatOutputSection(StringRef name) : OutputSection(MergedKind, name) {}

	const InputSection *firstSection() const { return inputs.front(); }			const ConcatInputSection *firstSection() const { return inputs.front(); }
	const InputSection *lastSection() const { return inputs.back(); }			const ConcatInputSection *lastSection() const { return inputs.back(); }

	// These accessors will only be valid after finalizing the section			// These accessors will only be valid after finalizing the section
	uint64_t getSize() const override { return size; }			uint64_t getSize() const override { return size; }
	uint64_t getFileSize() const override { return fileSize; }			uint64_t getFileSize() const override { return fileSize; }

	void mergeInput(InputSection *input);			void addInput(ConcatInputSection *input);
	void finalize() override;			void finalize() override;
	bool needsThunks() const;			bool needsThunks() const;
	uint64_t estimateStubsInRangeVA(size_t callIdx) const;			uint64_t estimateStubsInRangeVA(size_t callIdx) const;

	void writeTo(uint8_t *buf) const override;			void writeTo(uint8_t *buf) const override;

	std::vector<InputSection *> inputs;			std::vector<ConcatInputSection *> inputs;
	std::vector<InputSection *> thunks;			std::vector<ConcatInputSection *> thunks;

	static bool classof(const OutputSection *sec) {			static bool classof(const OutputSection *sec) {
	return sec->kind() == MergedKind;			return sec->kind() == MergedKind;
	}			}

	private:			private:
	void mergeFlags(InputSection *input);			void mergeFlags(InputSection *input);

	size_t size = 0;			size_t size = 0;
	uint64_t fileSize = 0;			uint64_t fileSize = 0;
	};			};

	// We maintain one ThunkInfo per real function.			// We maintain one ThunkInfo per real function.
	//			//
	// The "active thunk" is represented by the sym/isec pair that			// The "active thunk" is represented by the sym/isec pair that
	// turns-over during finalize(): as the call-site address advances,			// turns-over during finalize(): as the call-site address advances,
	// the active thunk goes out of branch-range, and we create a new			// the active thunk goes out of branch-range, and we create a new
	// thunk to take its place.			// thunk to take its place.
	//			//
	// The remaining members -- bools and counters -- apply to the			// The remaining members -- bools and counters -- apply to the
	// collection of thunks associated with the real function.			// collection of thunks associated with the real function.

	struct ThunkInfo {			struct ThunkInfo {
	// These denote the active thunk:			// These denote the active thunk:
	Defined *sym = nullptr; // private-extern symbol for active thunk			Defined *sym = nullptr; // private-extern symbol for active thunk
	InputSection *isec = nullptr; // input section for active thunk			ConcatInputSection *isec = nullptr; // input section for active thunk

	// The following values are cumulative across all thunks on this function			// The following values are cumulative across all thunks on this function
	uint32_t callSiteCount = 0; // how many calls to the real function?			uint32_t callSiteCount = 0; // how many calls to the real function?
	uint32_t callSitesUsed = 0; // how many call sites processed so-far?			uint32_t callSitesUsed = 0; // how many call sites processed so-far?
	uint32_t thunkCallCount = 0; // how many call sites went to thunk?			uint32_t thunkCallCount = 0; // how many call sites went to thunk?
	uint8_t sequence = 0; // how many thunks created so-far?			uint8_t sequence = 0; // how many thunks created so-far?
	};			};

	extern llvm::DenseMap<Symbol *, ThunkInfo> thunkMap;			extern llvm::DenseMap<Symbol *, ThunkInfo> thunkMap;

	} // namespace macho			} // namespace macho
	} // namespace lld			} // namespace lld

	#endif			#endif

lld/MachO/ConcatOutputSection.cpp

This file was moved from lld/MachO/MergedOutputSection.cpp.

//===- OutputSection.cpp --------------------------------------------------===//		//===- ConcatOutputSection.cpp --------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "MergedOutputSection.h"		#include "ConcatOutputSection.h"
#include "Config.h"		#include "Config.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "SymbolTable.h"		#include "SymbolTable.h"
#include "Symbols.h"		#include "Symbols.h"
#include "SyntheticSections.h"		#include "SyntheticSections.h"
#include "Target.h"		#include "Target.h"
#include "lld/Common/ErrorHandler.h"		#include "lld/Common/ErrorHandler.h"
#include "lld/Common/Memory.h"		#include "lld/Common/Memory.h"
#include "llvm/BinaryFormat/MachO.h"		#include "llvm/BinaryFormat/MachO.h"
#include "llvm/Support/ScopedPrinter.h"		#include "llvm/Support/ScopedPrinter.h"

#include <algorithm>		#include <algorithm>

using namespace llvm;		using namespace llvm;
using namespace llvm::MachO;		using namespace llvm::MachO;
using namespace lld;		using namespace lld;
using namespace lld::macho;		using namespace lld::macho;

void MergedOutputSection::mergeInput(InputSection *input) {		void ConcatOutputSection::addInput(ConcatInputSection *input) {
if (inputs.empty()) {		if (inputs.empty()) {
align = input->align;		align = input->align;
flags = input->flags;		flags = input->flags;
} else {		} else {
align = std::max(align, input->align);		align = std::max(align, input->align);
mergeFlags(input);		mergeFlags(input);
}		}
inputs.push_back(input);		inputs.push_back(input);
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines

DenseMap<Symbol *, ThunkInfo> lld::macho::thunkMap;		DenseMap<Symbol *, ThunkInfo> lld::macho::thunkMap;

// Determine whether we need thunks, which depends on the target arch -- RISC		// Determine whether we need thunks, which depends on the target arch -- RISC
// (i.e., ARM) generally does because it has limited-range branch/call		// (i.e., ARM) generally does because it has limited-range branch/call
// instructions, whereas CISC (i.e., x86) generally doesn't. RISC only needs		// instructions, whereas CISC (i.e., x86) generally doesn't. RISC only needs
// thunks for programs so large that branch source & destination addresses		// thunks for programs so large that branch source & destination addresses
// might differ more than the range of branch instruction(s).		// might differ more than the range of branch instruction(s).
bool MergedOutputSection::needsThunks() const {		bool ConcatOutputSection::needsThunks() const {
if (!target->usesThunks())		if (!target->usesThunks())
return false;		return false;
uint64_t isecAddr = addr;		uint64_t isecAddr = addr;
for (InputSection *isec : inputs)		for (InputSection *isec : inputs)
isecAddr = alignTo(isecAddr, isec->align) + isec->getSize();		isecAddr = alignTo(isecAddr, isec->align) + isec->getSize();
if (isecAddr - addr + in.stubs->getSize() <= target->branchRange)		if (isecAddr - addr + in.stubs->getSize() <= target->branchRange)
return false;		return false;
// Yes, this program is large enough to need thunks.		// Yes, this program is large enough to need thunks.
for (InputSection *isec : inputs) {		for (InputSection *isec : inputs) {
for (Reloc &r : isec->relocs) {		for (Reloc &r : isec->relocs) {
if (!target->hasAttr(r.type, RelocAttrBits::BRANCH))		if (!target->hasAttr(r.type, RelocAttrBits::BRANCH))
continue;		continue;
auto sym = r.referent.get<Symbol >();		auto sym = r.referent.get<Symbol >();
// Pre-populate the thunkMap and memoize call site counts for every		// Pre-populate the thunkMap and memoize call site counts for every
// InputSection and ThunkInfo. We do this for the benefit of		// InputSection and ThunkInfo. We do this for the benefit of
// MergedOutputSection::estimateStubsInRangeVA()		// ConcatOutputSection::estimateStubsInRangeVA()
ThunkInfo &thunkInfo = thunkMap[sym];		ThunkInfo &thunkInfo = thunkMap[sym];
// Knowing ThunkInfo call site count will help us know whether or not we		// Knowing ThunkInfo call site count will help us know whether or not we
// might need to create more for this referent at the time we are		// might need to create more for this referent at the time we are
// estimating distance to __stubs in .		// estimating distance to __stubs in .
++thunkInfo.callSiteCount;		++thunkInfo.callSiteCount;
// Knowing InputSection call site count will help us avoid work on those		// Knowing InputSection call site count will help us avoid work on those
// that have no BRANCH relocs.		// that have no BRANCH relocs.
++isec->callSiteCount;		++isec->callSiteCount;
}		}
}		}
return true;		return true;
}		}

// Since __stubs is placed after __text, we must estimate the address		// Since __stubs is placed after __text, we must estimate the address
// beyond which stubs are within range of a simple forward branch.		// beyond which stubs are within range of a simple forward branch.
uint64_t MergedOutputSection::estimateStubsInRangeVA(size_t callIdx) const {		uint64_t ConcatOutputSection::estimateStubsInRangeVA(size_t callIdx) const {
uint64_t branchRange = target->branchRange;		uint64_t branchRange = target->branchRange;
size_t endIdx = inputs.size();		size_t endIdx = inputs.size();
InputSection *isec = inputs[callIdx];		ConcatInputSection *isec = inputs[callIdx];
uint64_t isecVA = isec->getVA();		uint64_t isecVA = isec->getVA();
// Tally the non-stub functions which still have call sites		// Tally the non-stub functions which still have call sites
// remaining to process, which yields the maximum number		// remaining to process, which yields the maximum number
// of thunks we might yet place.		// of thunks we might yet place.
size_t maxPotentialThunks = 0;		size_t maxPotentialThunks = 0;
for (auto &tp : thunkMap) {		for (auto &tp : thunkMap) {
ThunkInfo &ti = tp.second;		ThunkInfo &ti = tp.second;
maxPotentialThunks +=		maxPotentialThunks +=
Show All 14 Lines	log("thunks = " + std::to_string(thunkMap.size()) +
", stubs = " + std::to_string(in.stubs->getSize()) + ", isecVA = " +		", stubs = " + std::to_string(in.stubs->getSize()) + ", isecVA = " +
to_hexString(isecVA) + ", threshold = " + to_hexString(stubsInRangeVA) +		to_hexString(isecVA) + ", threshold = " + to_hexString(stubsInRangeVA) +
", isecEnd = " + to_hexString(isecEnd) +		", isecEnd = " + to_hexString(isecEnd) +
", tail = " + to_hexString(isecEnd - isecVA) +		", tail = " + to_hexString(isecEnd - isecVA) +
", slop = " + to_hexString(branchRange - (isecEnd - isecVA)));		", slop = " + to_hexString(branchRange - (isecEnd - isecVA)));
return stubsInRangeVA;		return stubsInRangeVA;
}		}

void MergedOutputSection::finalize() {		void ConcatOutputSection::finalize() {
uint64_t isecAddr = addr;		uint64_t isecAddr = addr;
uint64_t isecFileOff = fileOff;		uint64_t isecFileOff = fileOff;
auto finalizeOne = [&](InputSection *isec) {		auto finalizeOne = [&](ConcatInputSection *isec) {
isecAddr = alignTo(isecAddr, isec->align);		isecAddr = alignTo(isecAddr, isec->align);
isecFileOff = alignTo(isecFileOff, isec->align);		isecFileOff = alignTo(isecFileOff, isec->align);
isec->outSecOff = isecAddr - addr;		isec->outSecOff = isecAddr - addr;
isec->outSecFileOff = isecFileOff - fileOff;		isec->outSecFileOff = isecFileOff - fileOff;
isec->isFinal = true;		isec->isFinal = true;
isecAddr += isec->getSize();		isecAddr += isec->getSize();
isecFileOff += isec->getFileSize();		isecFileOff += isec->getFileSize();
};		};

if (!needsThunks()) {		if (!needsThunks()) {
for (InputSection *isec : inputs)		for (ConcatInputSection *isec : inputs)
finalizeOne(isec);		finalizeOne(isec);
size = isecAddr - addr;		size = isecAddr - addr;
fileSize = isecFileOff - fileOff;		fileSize = isecFileOff - fileOff;
return;		return;
}		}

uint64_t branchRange = target->branchRange;		uint64_t branchRange = target->branchRange;
uint64_t stubsInRangeVA = TargetInfo::outOfRangeVA;		uint64_t stubsInRangeVA = TargetInfo::outOfRangeVA;
size_t thunkSize = target->thunkSize;		size_t thunkSize = target->thunkSize;
size_t relocCount = 0;		size_t relocCount = 0;
size_t callSiteCount = 0;		size_t callSiteCount = 0;
size_t thunkCallCount = 0;		size_t thunkCallCount = 0;
size_t thunkCount = 0;		size_t thunkCount = 0;

// inputs[finalIdx] is for finalization (address-assignment)		// inputs[finalIdx] is for finalization (address-assignment)
size_t finalIdx = 0;		size_t finalIdx = 0;
// Kick-off by ensuring that the first input section has an address		// Kick-off by ensuring that the first input section has an address
for (size_t callIdx = 0, endIdx = inputs.size(); callIdx < endIdx;		for (size_t callIdx = 0, endIdx = inputs.size(); callIdx < endIdx;
++callIdx) {		++callIdx) {
if (finalIdx == callIdx)		if (finalIdx == callIdx)
finalizeOne(inputs[finalIdx++]);		finalizeOne(inputs[finalIdx++]);
InputSection *isec = inputs[callIdx];		ConcatInputSection *isec = inputs[callIdx];
assert(isec->isFinal);		assert(isec->isFinal);
uint64_t isecVA = isec->getVA();		uint64_t isecVA = isec->getVA();
// Assign addresses up-to the forward branch-range limit		// Assign addresses up-to the forward branch-range limit
while (finalIdx < endIdx &&		while (finalIdx < endIdx &&
isecAddr + inputs[finalIdx]->getSize() < isecVA + branchRange)		isecAddr + inputs[finalIdx]->getSize() < isecVA + branchRange)
finalizeOne(inputs[finalIdx++]);		finalizeOne(inputs[finalIdx++]);
if (isec->callSiteCount == 0)		if (isec->callSiteCount == 0)
continue;		continue;
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	for (Reloc &r : reverse(relocs)) {
// isecAddr and the distance between subsequent call sites is		// isecAddr and the distance between subsequent call sites is
// smaller than thunkSize, then a new thunk can go out of		// smaller than thunkSize, then a new thunk can go out of
// range. Fix by unfinalizing inputs[finalIdx] to reduce the		// range. Fix by unfinalizing inputs[finalIdx] to reduce the
// distance between callVA and highVA, then shift some thunks		// distance between callVA and highVA, then shift some thunks
// to occupy address-space formerly occupied by the		// to occupy address-space formerly occupied by the
// unfinalized inputs[finalIdx].		// unfinalized inputs[finalIdx].
fatal(Twine(__FUNCTION__) + ": FIXME: thunk range overrun");		fatal(Twine(__FUNCTION__) + ": FIXME: thunk range overrun");
}		}
thunkInfo.isec = make<InputSection>();		thunkInfo.isec = make<ConcatInputSection>();
thunkInfo.isec->name = isec->name;		thunkInfo.isec->name = isec->name;
thunkInfo.isec->segname = isec->segname;		thunkInfo.isec->segname = isec->segname;
thunkInfo.isec->parent = this;		thunkInfo.isec->parent = this;
StringRef thunkName = saver.save(funcSym->getName() + ".thunk." +		StringRef thunkName = saver.save(funcSym->getName() + ".thunk." +
std::to_string(thunkInfo.sequence++));		std::to_string(thunkInfo.sequence++));
r.referent = thunkInfo.sym = symtab->addDefined(		r.referent = thunkInfo.sym = symtab->addDefined(
thunkName, /file=/nullptr, thunkInfo.isec, /value=/0,		thunkName, /file=/nullptr, thunkInfo.isec, /value=/0,
/size=/thunkSize, /isWeakDef=/false, /isPrivateExtern=/true,		/size=/thunkSize, /isWeakDef=/false, /isPrivateExtern=/true,
Show All 10 Lines	void ConcatOutputSection::finalize() {
log("thunks for " + parent->name + "," + name +		log("thunks for " + parent->name + "," + name +
": funcs = " + std::to_string(thunkMap.size()) +		": funcs = " + std::to_string(thunkMap.size()) +
", relocs = " + std::to_string(relocCount) +		", relocs = " + std::to_string(relocCount) +
", all calls = " + std::to_string(callSiteCount) +		", all calls = " + std::to_string(callSiteCount) +
", thunk calls = " + std::to_string(thunkCallCount) +		", thunk calls = " + std::to_string(thunkCallCount) +
", thunks = " + std::to_string(thunkCount));		", thunks = " + std::to_string(thunkCount));
}		}

void MergedOutputSection::writeTo(uint8_t *buf) const {		void ConcatOutputSection::writeTo(uint8_t *buf) const {
// Merge input sections from thunk & ordinary vectors		// Merge input sections from thunk & ordinary vectors
size_t i = 0, ie = inputs.size();		size_t i = 0, ie = inputs.size();
size_t t = 0, te = thunks.size();		size_t t = 0, te = thunks.size();
while (i < ie \|\| t < te) {		while (i < ie \|\| t < te) {
while (i < ie && (t == te \|\| inputs[i]->getSize() == 0 \|\|		while (i < ie && (t == te \|\| inputs[i]->getSize() == 0 \|\|
inputs[i]->outSecOff < thunks[t]->outSecOff)) {		inputs[i]->outSecOff < thunks[t]->outSecOff)) {
inputs[i]->writeTo(buf + inputs[i]->outSecFileOff);		inputs[i]->writeTo(buf + inputs[i]->outSecFileOff);
++i;		++i;
}		}
while (t < te && (i == ie \|\| thunks[t]->outSecOff < inputs[i]->outSecOff)) {		while (t < te && (i == ie \|\| thunks[t]->outSecOff < inputs[i]->outSecOff)) {
thunks[t]->writeTo(buf + thunks[t]->outSecFileOff);		thunks[t]->writeTo(buf + thunks[t]->outSecFileOff);
++t;		++t;
}		}
}		}
}		}

// TODO: this is most likely wrong; reconsider how section flags		// TODO: this is most likely wrong; reconsider how section flags
// are actually merged. The logic presented here was written without		// are actually merged. The logic presented here was written without
// any form of informed research.		// any form of informed research.
void MergedOutputSection::mergeFlags(InputSection *input) {		void ConcatOutputSection::mergeFlags(InputSection *input) {
uint8_t baseType = flags & SECTION_TYPE;		uint8_t baseType = flags & SECTION_TYPE;
uint8_t inputType = input->flags & SECTION_TYPE;		uint8_t inputType = input->flags & SECTION_TYPE;
if (baseType != inputType)		if (baseType != inputType)
error("Cannot merge section " + input->name + " (type=0x" +		error("Cannot merge section " + input->name + " (type=0x" +
to_hexString(inputType) + ") into " + name + " (type=0x" +		to_hexString(inputType) + ") into " + name + " (type=0x" +
to_hexString(baseType) + "): inconsistent types");		to_hexString(baseType) + "): inconsistent types");

constexpr uint32_t strictFlags = S_ATTR_DEBUG \| S_ATTR_STRIP_STATIC_SYMS \|		constexpr uint32_t strictFlags = S_ATTR_DEBUG \| S_ATTR_STRIP_STATIC_SYMS \|
Show All 13 Lines

lld/MachO/Config.h

Show First 20 Lines • Show All 93 Lines • ▼ Show 20 Lines	struct Configuration {
bool searchDylibsFirst = false;		bool searchDylibsFirst = false;
bool saveTemps = false;		bool saveTemps = false;
bool adhocCodesign = false;		bool adhocCodesign = false;
bool emitFunctionStarts = false;		bool emitFunctionStarts = false;
bool emitBitcodeBundle = false;		bool emitBitcodeBundle = false;
bool emitEncryptionInfo = false;		bool emitEncryptionInfo = false;
bool timeTraceEnabled = false;		bool timeTraceEnabled = false;
bool dataConst = false;		bool dataConst = false;
		bool mergeLiterals = true;
uint32_t headerPad;		uint32_t headerPad;
uint32_t dylibCompatibilityVersion = 0;		uint32_t dylibCompatibilityVersion = 0;
uint32_t dylibCurrentVersion = 0;		uint32_t dylibCurrentVersion = 0;
uint32_t timeTraceGranularity = 500;		uint32_t timeTraceGranularity = 500;
std::string progName;		std::string progName;
llvm::StringRef installName;		llvm::StringRef installName;
llvm::StringRef mapFile;		llvm::StringRef mapFile;
llvm::StringRef outputFile;		llvm::StringRef outputFile;
▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

lld/MachO/Driver.cpp

Show First 20 Lines • Show All 499 Lines • ▼ Show 20 Lines
// any CommonSymbols.		// any CommonSymbols.
static void replaceCommonSymbols() {		static void replaceCommonSymbols() {
TimeTraceScope timeScope("Replace common symbols");		TimeTraceScope timeScope("Replace common symbols");
for (Symbol *sym : symtab->getSymbols()) {		for (Symbol *sym : symtab->getSymbols()) {
auto *common = dyn_cast<CommonSymbol>(sym);		auto *common = dyn_cast<CommonSymbol>(sym);
if (common == nullptr)		if (common == nullptr)
continue;		continue;

auto *isec = make<InputSection>();		auto *isec = make<ConcatInputSection>();
isec->file = common->getFile();		isec->file = common->getFile();
isec->name = section_names::common;		isec->name = section_names::common;
isec->segname = segment_names::data;		isec->segname = segment_names::data;
isec->align = common->align;		isec->align = common->align;
// Casting to size_t will truncate large values on 32-bit architectures,		// Casting to size_t will truncate large values on 32-bit architectures,
// but it's not really worth supporting the linking of 64-bit programs on		// but it's not really worth supporting the linking of 64-bit programs on
// 32-bit archs.		// 32-bit archs.
isec->data = {nullptr, static_cast<size_t>(common->size)};		isec->data = {nullptr, static_cast<size_t>(common->size)};
▲ Show 20 Lines • Show All 463 Lines • ▼ Show 20 Lines	config->ltoNewPassManager =
LLVM_ENABLE_NEW_PASS_MANAGER);		LLVM_ENABLE_NEW_PASS_MANAGER);
config->runtimePaths = args::getStrings(args, OPT_rpath);		config->runtimePaths = args::getStrings(args, OPT_rpath);
config->allLoad = args.hasArg(OPT_all_load);		config->allLoad = args.hasArg(OPT_all_load);
config->forceLoadObjC = args.hasArg(OPT_ObjC);		config->forceLoadObjC = args.hasArg(OPT_ObjC);
config->demangle = args.hasArg(OPT_demangle);		config->demangle = args.hasArg(OPT_demangle);
config->implicitDylibs = !args.hasArg(OPT_no_implicit_dylibs);		config->implicitDylibs = !args.hasArg(OPT_no_implicit_dylibs);
config->emitFunctionStarts = !args.hasArg(OPT_no_function_starts);		config->emitFunctionStarts = !args.hasArg(OPT_no_function_starts);
config->emitBitcodeBundle = args.hasArg(OPT_bitcode_bundle);		config->emitBitcodeBundle = args.hasArg(OPT_bitcode_bundle);
		config->mergeLiterals = !args.hasArg(OPT_no_literal_merge);

std::array<PlatformKind, 3> encryptablePlatforms{		std::array<PlatformKind, 3> encryptablePlatforms{
PlatformKind::iOS, PlatformKind::watchOS, PlatformKind::tvOS};		PlatformKind::iOS, PlatformKind::watchOS, PlatformKind::tvOS};
config->emitEncryptionInfo =		config->emitEncryptionInfo =
args.hasFlag(OPT_encryptable, OPT_no_encryption,		args.hasFlag(OPT_encryptable, OPT_no_encryption,
is_contained(encryptablePlatforms, config->platform()));		is_contained(encryptablePlatforms, config->platform()));

#ifndef HAVE_LIBXAR		#ifndef HAVE_LIBXAR
▲ Show 20 Lines • Show All 268 Lines • Show Last 20 Lines

lld/MachO/InputFiles.h

	Show All 33 Lines
	} // namespace MachO			} // namespace MachO
	class TarWriter;			class TarWriter;
	} // namespace llvm			} // namespace llvm

	namespace lld {			namespace lld {
	namespace macho {			namespace macho {

	struct PlatformInfo;			struct PlatformInfo;
	class InputSection;			class ConcatInputSection;
	class Symbol;			class Symbol;
	struct Reloc;			struct Reloc;
	enum class RefState : uint8_t;			enum class RefState : uint8_t;

	// If --reproduce option is given, all input files are written			// If --reproduce option is given, all input files are written
	// to this tar archive.			// to this tar archive.
	extern std::unique_ptr<llvm::TarWriter> tar;			extern std::unique_ptr<llvm::TarWriter> tar;

	▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	// .o file			// .o file
	class ObjFile : public InputFile {			class ObjFile : public InputFile {
	public:			public:
	ObjFile(MemoryBufferRef mb, uint32_t modTime, StringRef archiveName);			ObjFile(MemoryBufferRef mb, uint32_t modTime, StringRef archiveName);
	static bool classof(const InputFile *f) { return f->kind() == ObjKind; }			static bool classof(const InputFile *f) { return f->kind() == ObjKind; }

	llvm::DWARFUnit *compileUnit = nullptr;			llvm::DWARFUnit *compileUnit = nullptr;
	const uint32_t modTime;			const uint32_t modTime;
	std::vector<InputSection *> debugSections;			std::vector<ConcatInputSection *> debugSections;

	private:			private:
	template <class LP> void parse();			template <class LP> void parse();
	template <class Section> void parseSections(ArrayRef<Section>);			template <class Section> void parseSections(ArrayRef<Section>);
	template <class LP>			template <class LP>
	void parseSymbols(ArrayRef<typename LP::section> sectionHeaders,			void parseSymbols(ArrayRef<typename LP::section> sectionHeaders,
	ArrayRef<typename LP::nlist> nList, const char *strtab,			ArrayRef<typename LP::nlist> nList, const char *strtab,
	bool subsectionsViaSymbols);			bool subsectionsViaSymbols);
	▲ Show 20 Lines • Show All 119 Lines • Show Last 20 Lines

lld/MachO/InputFiles.cpp

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
#include "ExportTrie.h"		#include "ExportTrie.h"
#include "InputSection.h"		#include "InputSection.h"
#include "MachOStructs.h"		#include "MachOStructs.h"
#include "ObjC.h"		#include "ObjC.h"
#include "OutputSection.h"		#include "OutputSection.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "SymbolTable.h"		#include "SymbolTable.h"
#include "Symbols.h"		#include "Symbols.h"
		#include "SyntheticSections.h"
#include "Target.h"		#include "Target.h"

#include "lld/Common/DWARF.h"		#include "lld/Common/DWARF.h"
#include "lld/Common/ErrorHandler.h"		#include "lld/Common/ErrorHandler.h"
#include "lld/Common/Memory.h"		#include "lld/Common/Memory.h"
#include "lld/Common/Reproduce.h"		#include "lld/Common/Reproduce.h"
#include "llvm/ADT/iterator.h"		#include "llvm/ADT/iterator.h"
#include "llvm/BinaryFormat/MachO.h"		#include "llvm/BinaryFormat/MachO.h"
▲ Show 20 Lines • Show All 172 Lines • ▼ Show 20 Lines	Optional<MemoryBufferRef> macho::readFile(StringRef path) {
error("unable to find matching architecture in " + path);		error("unable to find matching architecture in " + path);
return None;		return None;
}		}

InputFile::InputFile(Kind kind, const InterfaceFile &interface)		InputFile::InputFile(Kind kind, const InterfaceFile &interface)
: id(idCount++), fileKind(kind), name(saver.save(interface.getPath())) {}		: id(idCount++), fileKind(kind), name(saver.save(interface.getPath())) {}

template <class Section>		template <class Section>
void ObjFile::parseSections(ArrayRef<Section> sections) {		static void parseSection(ObjFile file, const uint8_t buf, const Section &sec,
subsections.reserve(sections.size());		InputSection *isec) {
auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());		isec->file = file;

for (const Section &sec : sections) {
InputSection *isec = make<InputSection>();
isec->file = this;
isec->name =		isec->name =
StringRef(sec.sectname, strnlen(sec.sectname, sizeof(sec.sectname)));		StringRef(sec.sectname, strnlen(sec.sectname, sizeof(sec.sectname)));
isec->segname =		isec->segname =
StringRef(sec.segname, strnlen(sec.segname, sizeof(sec.segname)));		StringRef(sec.segname, strnlen(sec.segname, sizeof(sec.segname)));
isec->data = {isZeroFill(sec.flags) ? nullptr : buf + sec.offset,		isec->data = {isZeroFill(sec.flags) ? nullptr : buf + sec.offset,
static_cast<size_t>(sec.size)};		static_cast<size_t>(sec.size)};
if (sec.align >= 32)		if (sec.align >= 32)
error("alignment " + std::to_string(sec.align) + " of section " +		error("alignment " + std::to_string(sec.align) + " of section " +
isec->name + " is too large");		isec->name + " is too large");
else		else
isec->align = 1 << sec.align;		isec->align = 1 << sec.align;
isec->flags = sec.flags;		isec->flags = sec.flags;
		}

		template <class Section>
		void ObjFile::parseSections(ArrayRef<Section> sections) {
		subsections.reserve(sections.size());
		auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());

		for (const Section &sec : sections) {
		if (config->mergeLiterals && sectionType(sec.flags) == S_CSTRING_LITERALS) {
		if (sec.nreloc)
		fatal(toString(this) +
		" contains relocations in a cstring_literals section, so LLD "
		"cannot do string merging. Try re-running with "
		"--no-literal-merge.");

		auto *isec = make<CStringInputSection>();
		parseSection(this, buf, sec, isec);
		isec->splitIntoPieces(); // FIXME: parallelize this?
		subsections.push_back({{0, isec}});
		} else {
		auto *isec = make<ConcatInputSection>();
		parseSection(this, buf, sec, isec);
if (!(isDebugSection(isec->flags) &&		if (!(isDebugSection(isec->flags) &&
isec->segname == segment_names::dwarf)) {		isec->segname == segment_names::dwarf)) {
subsections.push_back({{0, isec}});		subsections.push_back({{0, isec}});
} else {		} else {
// Instead of emitting DWARF sections, we emit STABS symbols to the		// Instead of emitting DWARF sections, we emit STABS symbols to the
// object files that contain them. We filter them out early to avoid		// object files that contain them. We filter them out early to avoid
// parsing their relocations unnecessarily. But we must still push an		// parsing their relocations unnecessarily. But we must still push an
// empty map to ensure the indices line up for the remaining sections.		// empty map to ensure the indices line up for the remaining sections.
subsections.push_back({});		subsections.push_back({});
debugSections.push_back(isec);		debugSections.push_back(isec);
}		}
}		}
}		}
		}

// Find the subsection corresponding to the greatest section offset that is <=		// Find the subsection corresponding to the greatest section offset that is <=
// that of the given offset.		// that of the given offset.
//		//
// offset: an offset relative to the start of the original InputSection (before		// offset: an offset relative to the start of the original InputSection (before
// any subsection splitting has occurred). It will be updated to represent the		// any subsection splitting has occurred). It will be updated to represent the
// same location as an offset relative to the start of the containing		// same location as an offset relative to the start of the containing
// subsection.		// subsection.
▲ Show 20 Lines • Show All 311 Lines • ▼ Show 20 Lines	for (size_t j = 0; j < symbolIndices.size(); ++j) {
InputSection *isec = subsecEntry.isec;		InputSection *isec = subsecEntry.isec;

uint64_t subsecAddr = sectionAddr + subsecEntry.offset;		uint64_t subsecAddr = sectionAddr + subsecEntry.offset;
uint64_t symbolOffset = sym.n_value - subsecAddr;		uint64_t symbolOffset = sym.n_value - subsecAddr;
uint64_t symbolSize =		uint64_t symbolSize =
j + 1 < symbolIndices.size()		j + 1 < symbolIndices.size()
? nList[symbolIndices[j + 1]].n_value - sym.n_value		? nList[symbolIndices[j + 1]].n_value - sym.n_value
: isec->data.size() - symbolOffset;		: isec->data.size() - symbolOffset;
// There are 3 cases where we do not need to create a new subsection:		// There are 4 cases where we do not need to create a new subsection:
// 1. If the input file does not use subsections-via-symbols.		// 1. If the input file does not use subsections-via-symbols.
// 2. Multiple symbols at the same address only induce one subsection.		// 2. Multiple symbols at the same address only induce one subsection.
// (The symbolOffset == 0 check covers both this case as well as		// (The symbolOffset == 0 check covers both this case as well as
// the first loop iteration.)		// the first loop iteration.)
// 3. Alternative entry points do not induce new subsections.		// 3. Alternative entry points do not induce new subsections.
		// 4. If we have a literal section (e.g. __cstring and __literal4).
if (!subsectionsViaSymbols \|\| symbolOffset == 0 \|\|		if (!subsectionsViaSymbols \|\| symbolOffset == 0 \|\|
sym.n_desc & N_ALT_ENTRY) {		sym.n_desc & N_ALT_ENTRY \|\| !isa<ConcatInputSection>(isec)) {
symbols[symIndex] =		symbols[symIndex] =
createDefined(sym, name, isec, symbolOffset, symbolSize);		createDefined(sym, name, isec, symbolOffset, symbolSize);
continue;		continue;
}		}
		auto *concatIsec = cast<ConcatInputSection>(isec);

auto nextIsec = make<InputSection>(isec);		auto nextIsec = make<ConcatInputSection>(concatIsec);
nextIsec->data = isec->data.slice(symbolOffset);		nextIsec->data = isec->data.slice(symbolOffset);
nextIsec->numRefs = 0;		nextIsec->numRefs = 0;
nextIsec->canOmitFromOutput = false;		nextIsec->canOmitFromOutput = false;
isec->data = isec->data.slice(0, symbolOffset);		isec->data = isec->data.slice(0, symbolOffset);

// By construction, the symbol will be at offset zero in the new		// By construction, the symbol will be at offset zero in the new
// subsection.		// subsection.
symbols[symIndex] =		symbols[symIndex] =
createDefined(sym, name, nextIsec, /value=/0, symbolSize);		createDefined(sym, name, nextIsec, /value=/0, symbolSize);
// TODO: ld64 appears to preserve the original alignment as well as each		// TODO: ld64 appears to preserve the original alignment as well as each
// subsection's offset from the last aligned address. We should consider		// subsection's offset from the last aligned address. We should consider
// emulating that behavior.		// emulating that behavior.
nextIsec->align = MinAlign(sectionAlign, sym.n_value);		nextIsec->align = MinAlign(sectionAlign, sym.n_value);
subsecMap.push_back({sym.n_value - sectionAddr, nextIsec});		subsecMap.push_back({sym.n_value - sectionAddr, nextIsec});
subsecEntry = subsecMap.back();		subsecEntry = subsecMap.back();
}		}
}		}
}		}

OpaqueFile::OpaqueFile(MemoryBufferRef mb, StringRef segName,		OpaqueFile::OpaqueFile(MemoryBufferRef mb, StringRef segName,
StringRef sectName)		StringRef sectName)
: InputFile(OpaqueKind, mb) {		: InputFile(OpaqueKind, mb) {
InputSection *isec = make<InputSection>();		ConcatInputSection *isec = make<ConcatInputSection>();
isec->file = this;		isec->file = this;
isec->name = sectName.take_front(16);		isec->name = sectName.take_front(16);
isec->segname = segName.take_front(16);		isec->segname = segName.take_front(16);
const auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());		const auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());
isec->data = {buf, mb.getBufferSize()};		isec->data = {buf, mb.getBufferSize()};
subsections.push_back({{0, isec}});		subsections.push_back({{0, isec}});
}		}

▲ Show 20 Lines • Show All 389 Lines • Show Last 20 Lines

lld/MachO/InputSection.h

	//===- InputSection.h -------------------------------------------- C++ --===//			//===- InputSection.h -------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLD_MACHO_INPUT_SECTION_H			#ifndef LLD_MACHO_INPUT_SECTION_H
	#define LLD_MACHO_INPUT_SECTION_H			#define LLD_MACHO_INPUT_SECTION_H

	#include "Relocations.h"			#include "Relocations.h"

	#include "lld/Common/LLVM.h"			#include "lld/Common/LLVM.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
				#include "llvm/ADT/CachedHashString.h"
	#include "llvm/BinaryFormat/MachO.h"			#include "llvm/BinaryFormat/MachO.h"

	namespace lld {			namespace lld {
	namespace macho {			namespace macho {

	class InputFile;			class InputFile;
	class OutputSection;			class OutputSection;

	class InputSection {			class InputSection {
	public:			public:
				enum Kind {
				ConcatKind,
				CStringLiteralKind,
				};

				Kind kind() const { return sectionKind; }
	virtual ~InputSection() = default;			virtual ~InputSection() = default;
	virtual uint64_t getSize() const { return data.size(); }			virtual uint64_t getSize() const { return data.size(); }
	uint64_t getFileSize() const;			uint64_t getFileSize() const;
	uint64_t getFileOffset() const;			// Translates \p off -- an offset relative to this InputSection -- into an
	uint64_t getVA() const;			// offset from the beginning of its parent OutputSection.
				virtual uint64_t getOffset(uint64_t off) const = 0;
				// The offset from the beginning of the file.
				virtual uint64_t getFileOffset(uint64_t off) const = 0;
				uint64_t getVA(uint64_t off) const;

	void writeTo(uint8_t *buf);			void writeTo(uint8_t *buf);

	InputFile *file = nullptr;			InputFile *file = nullptr;
	StringRef name;			StringRef name;
	StringRef segname;			StringRef segname;

	OutputSection *parent = nullptr;			OutputSection *parent = nullptr;
	uint64_t outSecOff = 0;
	uint64_t outSecFileOff = 0;

	uint32_t align = 1;			uint32_t align = 1;
	uint32_t flags = 0;			uint32_t flags = 0;
	uint32_t callSiteCount = 0;			uint32_t callSiteCount = 0;
	bool isFinal = false; // is address assigned?			bool isFinal = false; // is address assigned?

	// How many symbols refer to this InputSection.			// How many symbols refer to this InputSection.
	uint32_t numRefs = 0;			uint32_t numRefs = 0;

	// True if this InputSection could not be written to the output file.			// True if this InputSection could not be written to the output file.
	// With subsections_via_symbols, most symbol have its own InputSection,			// With subsections_via_symbols, most symbol have its own InputSection,
	// and for weak symbols (e.g. from inline functions), only the			// and for weak symbols (e.g. from inline functions), only the
	// InputSection from one translation unit will make it to the output,			// InputSection from one translation unit will make it to the output,
	// while all copies in other translation units are coalesced into the			// while all copies in other translation units are coalesced into the
	// first and not copied to the output.			// first and not copied to the output.
	bool canOmitFromOutput = false;			bool canOmitFromOutput = false;

	bool shouldOmitFromOutput() const {			bool shouldOmitFromOutput() const {
	return canOmitFromOutput && numRefs == 0;			return canOmitFromOutput && numRefs == 0;
	}			}

	ArrayRef<uint8_t> data;			ArrayRef<uint8_t> data;
	std::vector<Reloc> relocs;			std::vector<Reloc> relocs;

				protected:
				InputSection(Kind kind) : sectionKind(kind) {}
				alexander-shaposhnikovUnsubmitted Done Reply Inline Actions explicit alexander-shaposhnikov: explicit

				private:
				Kind sectionKind;
				};

				// ConcatInputSections are combined into (Concat)OutputSections through simple
				// concatentation, in contrast with literal sections which may have their
				// contents merged before output.
				class ConcatInputSection : public InputSection {
				public:
				ConcatInputSection() : InputSection(ConcatKind) {}
				uint64_t getFileOffset(uint64_t off) const override;
				uint64_t getOffset(uint64_t off) const override { return outSecOff + off; }
				uint64_t getVA() const { return InputSection::getVA(0); }

				static bool classof(const InputSection *isec) {
				return isec->kind() == ConcatKind;
				}

				uint64_t outSecOff = 0;
				uint64_t outSecFileOff = 0;
				};

				struct StringPiece {
				uint32_t inSecOff;
				alexander-shaposhnikovUnsubmitted Done Reply Inline Actions would be good to add comments for these fields (inSecOff, outSecOff) alexander-shaposhnikov: would be good to add comments for these fields (inSecOff, outSecOff)
				uint32_t hash;
				gkmUnsubmitted Done Reply Inline Actions Why are we truncating 64-bit hashes to 32 bits? Because the low-order 32 bits are sufficient, and it's more important that `StringPiece` be 16 bytes vs. 24 bytes? gkm: Why are we truncating 64-bit hashes to 32 bits? Because the low-order 32 bits are sufficient…
				int3AuthorUnsubmitted Done Reply Inline Actions This was copied from LLD-ELF's implementation, and yeah the motivation is to reduce the memory cost. I'll copy over the comment too... int3: This was copied from LLD-ELF's implementation, and yeah the motivation is to reduce the memory…
				uint64_t outSecOff;

				StringPiece(uint64_t off, uint32_t hash) : inSecOff(off), hash(hash) {}
				};

				// CStringInputSections are composed of multiple null-terminated string
				// literals, which we represent using StringPieces. These literals can be
				// deduplicated and tail-merged, so translating offsets between the input and
				// outputs sections is more complicated.
				//
				// NOTE: One significant difference between LLD and ld64 is that we merge all
				// cstring literals, even those referenced directly by non-private symbols.
				// ld64 is more conservative and does not do that. This was mostly done for
				// implementation simplicity; if we find programs that need the more
				// conservative behavior we can certainly implement that.
				class CStringInputSection : public InputSection {
				public:
				CStringInputSection() : InputSection(CStringLiteralKind) {}
				uint64_t getFileOffset(uint64_t off) const override;
				uint64_t getOffset(uint64_t off) const override;
				// Find the StringPiece that contains this offset.
				const StringPiece *getStringPiece(uint64_t off) const;
				alexander-shaposhnikovUnsubmitted Done Reply Inline Actions khm, wouldn't const StringPiece &getStringPiece(uint64_t offset) const be a cleaner interface ? alexander-shaposhnikov: khm, wouldn't ``` const StringPiece &getStringPiece(uint64_t offset) const ``` be a cleaner…
				// Split at each null byte.
				void splitIntoPieces();

				// Returns i'th piece as a CachedHashStringRef. This function is very hot when
				// string merging is enabled, so we want to inline.
				LLVM_ATTRIBUTE_ALWAYS_INLINE
				llvm::CachedHashStringRef getCachedHashStringRef(size_t i) const {
				size_t begin = pieces[i].inSecOff;
				size_t end =
				(pieces.size() - 1 == i) ? data.size() : pieces[i + 1].inSecOff;
				return {toStringRef(data.slice(begin, end - begin)), pieces[i].hash};
				}

				static bool classof(const InputSection *isec) {
				return isec->kind() == CStringLiteralKind;
				}

				std::vector<StringPiece> pieces;
				alexander-shaposhnikovUnsubmitted Done Reply Inline Actions does it need to be `public` ? alexander-shaposhnikov: does it need to be `public` ?
				int3AuthorUnsubmitted Done Reply Inline Actions `CStringSection::finalize()` needs it to be public int3: `CStringSection::finalize()` needs it to be public
	};			};

	inline uint8_t sectionType(uint32_t flags) {			inline uint8_t sectionType(uint32_t flags) {
	return flags & llvm::MachO::SECTION_TYPE;			return flags & llvm::MachO::SECTION_TYPE;
	}			}

	inline bool isZeroFill(uint32_t flags) {			inline bool isZeroFill(uint32_t flags) {
	return llvm::MachO::isVirtualSection(sectionType(flags));			return llvm::MachO::isVirtualSection(sectionType(flags));
	Show All 19 Lines
	extern std::vector<InputSection *> inputSections;			extern std::vector<InputSection *> inputSections;

	namespace section_names {			namespace section_names {

	constexpr const char authGot[] = "__auth_got";			constexpr const char authGot[] = "__auth_got";
	constexpr const char authPtr[] = "__auth_ptr";			constexpr const char authPtr[] = "__auth_ptr";
	constexpr const char binding[] = "__binding";			constexpr const char binding[] = "__binding";
	constexpr const char bitcodeBundle[] = "__bundle";			constexpr const char bitcodeBundle[] = "__bundle";
				constexpr const char cString[] = "__cstring";
	constexpr const char cfString[] = "__cfstring";			constexpr const char cfString[] = "__cfstring";
	constexpr const char codeSignature[] = "__code_signature";			constexpr const char codeSignature[] = "__code_signature";
	constexpr const char common[] = "__common";			constexpr const char common[] = "__common";
	constexpr const char compactUnwind[] = "__compact_unwind";			constexpr const char compactUnwind[] = "__compact_unwind";
	constexpr const char data[] = "__data";			constexpr const char data[] = "__data";
	constexpr const char debugAbbrev[] = "__debug_abbrev";			constexpr const char debugAbbrev[] = "__debug_abbrev";
	constexpr const char debugInfo[] = "__debug_info";			constexpr const char debugInfo[] = "__debug_info";
	constexpr const char debugStr[] = "__debug_str";			constexpr const char debugStr[] = "__debug_str";
	▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

lld/MachO/InputSection.cpp

Show All 9 Lines
#include "InputFiles.h"		#include "InputFiles.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "Symbols.h"		#include "Symbols.h"
#include "SyntheticSections.h"		#include "SyntheticSections.h"
#include "Target.h"		#include "Target.h"
#include "Writer.h"		#include "Writer.h"
#include "lld/Common/Memory.h"		#include "lld/Common/Memory.h"
#include "llvm/Support/Endian.h"		#include "llvm/Support/Endian.h"
		#include "llvm/Support/xxhash.h"

using namespace llvm;		using namespace llvm;
using namespace llvm::MachO;		using namespace llvm::MachO;
using namespace llvm::support;		using namespace llvm::support;
using namespace lld;		using namespace lld;
using namespace lld::macho;		using namespace lld::macho;

std::vector<InputSection *> macho::inputSections;		std::vector<InputSection *> macho::inputSections;

uint64_t InputSection::getFileOffset() const {		uint64_t ConcatInputSection::getFileOffset(uint64_t off) const {
return parent->fileOff + outSecFileOff;		return parent->fileOff + outSecFileOff + off;
}		}

uint64_t InputSection::getFileSize() const {		uint64_t InputSection::getFileSize() const {
return isZeroFill(flags) ? 0 : getSize();		return isZeroFill(flags) ? 0 : getSize();
}		}

uint64_t InputSection::getVA() const { return parent->addr + outSecOff; }		uint64_t InputSection::getVA(uint64_t off) const {
		return parent->addr + getOffset(off);
		}

static uint64_t resolveSymbolVA(const Symbol *sym, uint8_t type) {		static uint64_t resolveSymbolVA(const Symbol *sym, uint8_t type) {
const RelocAttrs &relocAttrs = target->getRelocAttrs(type);		const RelocAttrs &relocAttrs = target->getRelocAttrs(type);
if (relocAttrs.hasAttr(RelocAttrBits::BRANCH))		if (relocAttrs.hasAttr(RelocAttrBits::BRANCH))
return sym->resolveBranchVA();		return sym->resolveBranchVA();
else if (relocAttrs.hasAttr(RelocAttrBits::GOT))		else if (relocAttrs.hasAttr(RelocAttrBits::GOT))
return sym->resolveGotVA();		return sym->resolveGotVA();
else if (relocAttrs.hasAttr(RelocAttrBits::TLV))		else if (relocAttrs.hasAttr(RelocAttrBits::TLV))
Show All 13 Lines	for (size_t i = 0; i < relocs.size(); i++) {
const Reloc &r = relocs[i];		const Reloc &r = relocs[i];
uint8_t *loc = buf + r.offset;		uint8_t *loc = buf + r.offset;
uint64_t referentVA = 0;		uint64_t referentVA = 0;
if (target->hasAttr(r.type, RelocAttrBits::SUBTRAHEND)) {		if (target->hasAttr(r.type, RelocAttrBits::SUBTRAHEND)) {
const Symbol fromSym = r.referent.get<Symbol >();		const Symbol fromSym = r.referent.get<Symbol >();
const Reloc &minuend = relocs[++i];		const Reloc &minuend = relocs[++i];
uint64_t minuendVA;		uint64_t minuendVA;
if (const Symbol toSym = minuend.referent.dyn_cast<Symbol >())		if (const Symbol toSym = minuend.referent.dyn_cast<Symbol >())
minuendVA = toSym->getVA();		minuendVA = toSym->getVA() + minuend.addend;
else {		else {
auto referentIsec = minuend.referent.get<InputSection >();		auto referentIsec = minuend.referent.get<InputSection >();
assert(!referentIsec->shouldOmitFromOutput());		assert(!referentIsec->shouldOmitFromOutput());
minuendVA = referentIsec->getVA();		minuendVA = referentIsec->getVA(minuend.addend);
}		}
referentVA = minuendVA - fromSym->getVA() + minuend.addend;		referentVA = minuendVA - fromSym->getVA();
} else if (auto referentSym = r.referent.dyn_cast<Symbol >()) {		} else if (auto referentSym = r.referent.dyn_cast<Symbol >()) {
if (target->hasAttr(r.type, RelocAttrBits::LOAD) &&		if (target->hasAttr(r.type, RelocAttrBits::LOAD) &&
!referentSym->isInGot())		!referentSym->isInGot())
target->relaxGotLoad(loc, r.type);		target->relaxGotLoad(loc, r.type);
referentVA = resolveSymbolVA(referentSym, r.type);		referentVA = resolveSymbolVA(referentSym, r.type) + r.addend;

if (isThreadLocalVariables(flags)) {		if (isThreadLocalVariables(flags)) {
// References from thread-local variable sections are treated as offsets		// References from thread-local variable sections are treated as offsets
// relative to the start of the thread-local data memory area, which		// relative to the start of the thread-local data memory area, which
// is initialized via copying all the TLV data sections (which are all		// is initialized via copying all the TLV data sections (which are all
// contiguous).		// contiguous).
if (isa<Defined>(referentSym))		if (isa<Defined>(referentSym))
referentVA -= firstTLVDataSection->addr;		referentVA -= firstTLVDataSection->addr;
}		}
} else if (auto referentIsec = r.referent.dyn_cast<InputSection >()) {		} else if (auto referentIsec = r.referent.dyn_cast<InputSection >()) {
assert(!referentIsec->shouldOmitFromOutput());		assert(!referentIsec->shouldOmitFromOutput());
referentVA = referentIsec->getVA();		referentVA = referentIsec->getVA(r.addend);
		}
		target->relocateOne(loc, r, referentVA, getVA(r.offset));
		}
		}

		void CStringInputSection::splitIntoPieces() {
		size_t off = 0;
		StringRef s = toStringRef(data);
		while (!s.empty()) {
		size_t end = s.find(0);
		if (end == StringRef::npos)
		fatal(toString(this) + ": string is not null terminated");
		size_t size = end + 1;
		pieces.emplace_back(off, xxHash64(s.substr(0, size)));
		s = s.substr(size);
		off += size;
}		}
target->relocateOne(loc, r, referentVA + r.addend, getVA() + r.offset);
}		}

		const StringPiece *CStringInputSection::getStringPiece(uint64_t off) const {
		if (off >= data.size())
		fatal(toString(this) + ": offset is outside the section");

		auto it =
		partition_point(pieces, [=](StringPiece p) { return p.inSecOff <= off; });
		return &it[-1];
		}

		uint64_t CStringInputSection::getFileOffset(uint64_t off) const {
		return parent->fileOff + getOffset(off);
		}

		uint64_t CStringInputSection::getOffset(uint64_t off) const {
		const StringPiece &piece = *getStringPiece(off);
		uint64_t addend = off - piece.inSecOff;
		return piece.outSecOff + addend;
}		}

bool macho::isCodeSection(const InputSection *isec) {		bool macho::isCodeSection(const InputSection *isec) {
uint32_t type = isec->flags & SECTION_TYPE;		uint32_t type = isec->flags & SECTION_TYPE;
if (type != S_REGULAR && type != S_COALESCED)		if (type != S_REGULAR && type != S_COALESCED)
return false;		return false;

uint32_t attr = isec->flags & SECTION_ATTRIBUTES_USR;		uint32_t attr = isec->flags & SECTION_ATTRIBUTES_USR;
Show All 14 Lines

lld/MachO/MergedOutputSection.h

This file was moved to lld/MachO/ConcatOutputSection.h.

lld/MachO/MergedOutputSection.cpp

This file was moved to lld/MachO/ConcatOutputSection.cpp.

lld/MachO/Options.td

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	def lto_legacy_pass_manager: Flag<["--"], "lto-legacy-pass-manager">,
Group<grp_lld>;		Group<grp_lld>;
def no_lto_legacy_pass_manager : Flag<["--"], "no-lto-legacy-pass-manager">,		def no_lto_legacy_pass_manager : Flag<["--"], "no-lto-legacy-pass-manager">,
HelpText<"Use the new pass manager in LLVM">,		HelpText<"Use the new pass manager in LLVM">,
Group<grp_lld>;		Group<grp_lld>;
def time_trace: Flag<["--"], "time-trace">, HelpText<"Record time trace">;		def time_trace: Flag<["--"], "time-trace">, HelpText<"Record time trace">;
def time_trace_granularity_eq: Joined<["--"], "time-trace-granularity=">,		def time_trace_granularity_eq: Joined<["--"], "time-trace-granularity=">,
HelpText<"Minimum time granularity (in microseconds) traced by time profiler">;		HelpText<"Minimum time granularity (in microseconds) traced by time profiler">;
def time_trace_file_eq: Joined<["--"], "time-trace-file=">, HelpText<"Specify time trace output file">;		def time_trace_file_eq: Joined<["--"], "time-trace-file=">, HelpText<"Specify time trace output file">;
		def no_literal_merge: Flag<["--"], "no-literal-merge">, HelpText<"Disable literal merging">;

// This is a complete Options.td compiled from Apple's ld(1) manpage		// This is a complete Options.td compiled from Apple's ld(1) manpage
// dated 2018-03-07 and cross checked with ld64 source code in repo		// dated 2018-03-07 and cross checked with ld64 source code in repo
// https://github.com/apple-opensource/ld64 at git tag "512.4" dated		// https://github.com/apple-opensource/ld64 at git tag "512.4" dated
// 2018-03-18.		// 2018-03-18.

// Flags<[HelpHidden]> marks options that are not yet ported to lld,		// Flags<[HelpHidden]> marks options that are not yet ported to lld,
// and serve as a scoreboard for annotating our progress toward		// and serve as a scoreboard for annotating our progress toward
▲ Show 20 Lines • Show All 1,254 Lines • Show Last 20 Lines

lld/MachO/OutputSection.h

Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	virtual void finalize() {
// TODO investigate refactoring synthetic section finalization logic into		// TODO investigate refactoring synthetic section finalization logic into
// overrides of this function.		// overrides of this function.
}		}

virtual void writeTo(uint8_t *buf) const = 0;		virtual void writeTo(uint8_t *buf) const = 0;

StringRef name;		StringRef name;
OutputSegment *parent = nullptr;		OutputSegment *parent = nullptr;
		// For output sections that don't have explicit ordering requirements, their
		// output order should be based on the relative ordering of the input sections
		// they contain.
		size_t inputOrder = 0;

uint32_t index = 0;		uint32_t index = 0;
uint64_t addr = 0;		uint64_t addr = 0;
uint64_t fileOff = 0;		uint64_t fileOff = 0;
uint32_t align = 1;		uint32_t align = 1;
uint32_t flags = 0;		uint32_t flags = 0;
uint32_t reserved1 = 0;		uint32_t reserved1 = 0;
uint32_t reserved2 = 0;		uint32_t reserved2 = 0;
Show All 9 Lines

lld/MachO/OutputSegment.h

	Show All 36 Lines
	class OutputSegment {			class OutputSegment {
	public:			public:
	const OutputSection *firstSection() const { return sections.front(); }			const OutputSection *firstSection() const { return sections.front(); }
	const OutputSection *lastSection() const { return sections.back(); }			const OutputSection *lastSection() const { return sections.back(); }

	void addOutputSection(OutputSection *os);			void addOutputSection(OutputSection *os);
	void sortOutputSections(			void sortOutputSections(
	llvm::function_ref<bool(OutputSection , OutputSection )> comparator) {			llvm::function_ref<bool(OutputSection , OutputSection )> comparator) {
	llvm::stable_sort(sections, comparator);			llvm::sort(sections, comparator);
	}			}

	const std::vector<OutputSection *> &getSections() const { return sections; }			const std::vector<OutputSection *> &getSections() const { return sections; }
	size_t numNonHiddenSections() const;			size_t numNonHiddenSections() const;

	uint64_t fileOff = 0;			uint64_t fileOff = 0;
	uint64_t fileSize = 0;			uint64_t fileSize = 0;
	uint64_t vmSize = 0;			uint64_t vmSize = 0;
				size_t inputOrder = 0;
	StringRef name;			StringRef name;
	uint32_t maxProt = 0;			uint32_t maxProt = 0;
	uint32_t initProt = 0;			uint32_t initProt = 0;
	uint8_t index;			uint8_t index;

	private:			private:
	std::vector<OutputSection *> sections;			std::vector<OutputSection *> sections;
	};			};
	Show All 9 Lines

lld/MachO/OutputSegment.cpp

	//===- OutputSegment.cpp --------------------------------------------------===//			//===- OutputSegment.cpp --------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "OutputSegment.h"			#include "OutputSegment.h"
				#include "ConcatOutputSection.h"
	#include "InputSection.h"			#include "InputSection.h"
	#include "MergedOutputSection.h"
	#include "SyntheticSections.h"			#include "SyntheticSections.h"

	#include "lld/Common/ErrorHandler.h"			#include "lld/Common/ErrorHandler.h"
	#include "lld/Common/Memory.h"			#include "lld/Common/Memory.h"
	#include "llvm/BinaryFormat/MachO.h"			#include "llvm/BinaryFormat/MachO.h"

	using namespace llvm;			using namespace llvm;
	using namespace llvm::MachO;			using namespace llvm::MachO;
	Show All 25 Lines
	size_t OutputSegment::numNonHiddenSections() const {			size_t OutputSegment::numNonHiddenSections() const {
	size_t count = 0;			size_t count = 0;
	for (const OutputSection *osec : sections)			for (const OutputSection *osec : sections)
	count += (!osec->isHidden() ? 1 : 0);			count += (!osec->isHidden() ? 1 : 0);
	return count;			return count;
	}			}

	void OutputSegment::addOutputSection(OutputSection *osec) {			void OutputSegment::addOutputSection(OutputSection *osec) {
				if (sections.empty())
				inputOrder = osec->inputOrder;

	osec->parent = this;			osec->parent = this;
	sections.push_back(osec);			sections.push_back(osec);

	for (const SectionAlign &sectAlign : config->sectionAlignments)			for (const SectionAlign &sectAlign : config->sectionAlignments)
	if (sectAlign.segName == name && sectAlign.sectName == osec->name)			if (sectAlign.segName == name && sectAlign.sectName == osec->name)
	osec->align = sectAlign.align;			osec->align = sectAlign.align;
	}			}

	Show All 16 Lines

lld/MachO/Symbols.cpp

Show All 34 Lines	uint64_t Defined::getVA() const {
if (isAbsolute())		if (isAbsolute())
return value;		return value;

if (!isec->isFinal) {		if (!isec->isFinal) {
// A target arch that does not use thunks ought never ask for		// A target arch that does not use thunks ought never ask for
// the address of a function that has not yet been finalized.		// the address of a function that has not yet been finalized.
assert(target->usesThunks());		assert(target->usesThunks());

// MergedOutputSection::finalize() can seek the address of a		// ConcatOutputSection::finalize() can seek the address of a
// function before its address is assigned. The thunking algorithm		// function before its address is assigned. The thunking algorithm
// knows that unfinalized functions will be out of range, so it is		// knows that unfinalized functions will be out of range, so it is
// expedient to return a contrived out-of-range address.		// expedient to return a contrived out-of-range address.
return TargetInfo::outOfRangeVA;		return TargetInfo::outOfRangeVA;
}		}
return isec->getVA() + value;		return isec->getVA(value);
}		}

uint64_t Defined::getFileOffset() const {		uint64_t Defined::getFileOffset() const {
if (isAbsolute()) {		if (isAbsolute()) {
error("absolute symbol " + toString(*this) +		error("absolute symbol " + toString(*this) +
" does not have a file offset");		" does not have a file offset");
return 0;		return 0;
}		}
return isec->getFileOffset() + value;		return isec->getFileOffset(value);
}		}

uint64_t DylibSymbol::getVA() const {		uint64_t DylibSymbol::getVA() const {
return isInStubs() ? getStubVA() : Symbol::getVA();		return isInStubs() ? getStubVA() : Symbol::getVA();
}		}

void LazySymbol::fetchArchiveMember() { getFile()->fetch(sym); }		void LazySymbol::fetchArchiveMember() { getFile()->fetch(sym); }

lld/MachO/SyntheticSections.h

Show All 12 Lines
#include "ExportTrie.h"		#include "ExportTrie.h"
#include "InputSection.h"		#include "InputSection.h"
#include "OutputSection.h"		#include "OutputSection.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "Target.h"		#include "Target.h"

#include "llvm/ADT/PointerUnion.h"		#include "llvm/ADT/PointerUnion.h"
#include "llvm/ADT/SetVector.h"		#include "llvm/ADT/SetVector.h"
		#include "llvm/MC/StringTableBuilder.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

namespace llvm {		namespace llvm {
class DWARFUnit;		class DWARFUnit;
} // namespace llvm		} // namespace llvm

namespace lld {		namespace lld {
▲ Show 20 Lines • Show All 120 Lines • ▼ Show 20 Lines
};		};

struct Location {		struct Location {
const InputSection *isec;		const InputSection *isec;
uint64_t offset;		uint64_t offset;

Location(const InputSection *isec, uint64_t offset)		Location(const InputSection *isec, uint64_t offset)
: isec(isec), offset(offset) {}		: isec(isec), offset(offset) {}
uint64_t getVA() const { return isec->getVA() + offset; }		uint64_t getVA() const { return isec->getVA(offset); }
};		};

// Stores rebase opcodes, which tell dyld where absolute addresses have been		// Stores rebase opcodes, which tell dyld where absolute addresses have been
// encoded in the binary. If the binary is not loaded at its preferred address,		// encoded in the binary. If the binary is not loaded at its preferred address,
// dyld has to rebase these addresses by adding an offset to them.		// dyld has to rebase these addresses by adding an offset to them.
class RebaseSection : public LinkEditSection {		class RebaseSection : public LinkEditSection {
public:		public:
RebaseSection();		RebaseSection();
▲ Show 20 Lines • Show All 125 Lines • ▼ Show 20 Lines	public:
bool isNeeded() const override { return !entries.empty(); }		bool isNeeded() const override { return !entries.empty(); }
void finalize() override;		void finalize() override;
void writeTo(uint8_t *buf) const override;		void writeTo(uint8_t *buf) const override;
const llvm::SetVector<Symbol *> &getEntries() const { return entries; }		const llvm::SetVector<Symbol *> &getEntries() const { return entries; }
// Returns whether the symbol was added. Note that every stubs entry will		// Returns whether the symbol was added. Note that every stubs entry will
// have a corresponding entry in the LazyPointerSection.		// have a corresponding entry in the LazyPointerSection.
bool addEntry(Symbol *);		bool addEntry(Symbol *);
uint64_t getVA(uint32_t stubsIndex) const {		uint64_t getVA(uint32_t stubsIndex) const {
// MergedOutputSection::finalize() can seek the address of a		// ConcatOutputSection::finalize() can seek the address of a
// stub before its address is assigned. Before __stubs is		// stub before its address is assigned. Before __stubs is
// finalized, return a contrived out-of-range address.		// finalized, return a contrived out-of-range address.
return isFinal ? addr + stubsIndex * target->stubSize		return isFinal ? addr + stubsIndex * target->stubSize
: TargetInfo::outOfRangeVA;		: TargetInfo::outOfRangeVA;
}		}

bool isFinal = false; // is address assigned?		bool isFinal = false; // is address assigned?

Show All 13 Lines	public:
DylibSymbol *stubBinder = nullptr;		DylibSymbol *stubBinder = nullptr;
Defined *dyldPrivate = nullptr;		Defined *dyldPrivate = nullptr;
};		};

// This section contains space for just a single word, and will be used by dyld		// This section contains space for just a single word, and will be used by dyld
// to cache an address to the image loader it uses. Note that unlike the other		// to cache an address to the image loader it uses. Note that unlike the other
// synthetic sections, which are OutputSections, the ImageLoaderCacheSection is		// synthetic sections, which are OutputSections, the ImageLoaderCacheSection is
// an InputSection that gets merged into the __data OutputSection.		// an InputSection that gets merged into the __data OutputSection.
class ImageLoaderCacheSection : public InputSection {		class ImageLoaderCacheSection : public ConcatInputSection {
public:		public:
ImageLoaderCacheSection();		ImageLoaderCacheSection();
uint64_t getSize() const override { return target->wordSize; }		uint64_t getSize() const override { return target->wordSize; }
};		};

// Note that this section may also be targeted by non-lazy bindings. In		// Note that this section may also be targeted by non-lazy bindings. In
// particular, this happens when branch relocations target weak symbols.		// particular, this happens when branch relocations target weak symbols.
class LazyPointerSection : public SyntheticSection {		class LazyPointerSection : public SyntheticSection {
▲ Show 20 Lines • Show All 172 Lines • ▼ Show 20 Lines	public:
void finalize() override;		void finalize() override;
void writeTo(uint8_t *buf) const override;		void writeTo(uint8_t *buf) const override;

private:		private:
llvm::SmallString<261> xarPath;		llvm::SmallString<261> xarPath;
uint64_t xarSize;		uint64_t xarSize;
};		};

static_assert((CodeSignatureSection::blobHeadersSize % 8) == 0, "");		class CStringSection : public SyntheticSection {
static_assert((CodeSignatureSection::fixedHeadersSize % 8) == 0, "");		public:
		CStringSection();
		void addInput(CStringInputSection *);
		uint64_t getSize() const override { return builder.getSize(); }
		void finalize() override;
		bool isNeeded() const override { return !inputs.empty(); }
		void writeTo(uint8_t *buf) const override { builder.write(buf); }

		std::vector<CStringInputSection *> inputs;

		private:
		llvm::StringTableBuilder builder;
		};

struct InStruct {		struct InStruct {
MachHeaderSection *header = nullptr;		MachHeaderSection *header = nullptr;
		CStringSection *cStringSection = nullptr;
RebaseSection *rebase = nullptr;		RebaseSection *rebase = nullptr;
BindingSection *binding = nullptr;		BindingSection *binding = nullptr;
WeakBindingSection *weakBinding = nullptr;		WeakBindingSection *weakBinding = nullptr;
LazyBindingSection *lazyBinding = nullptr;		LazyBindingSection *lazyBinding = nullptr;
ExportSection *exports = nullptr;		ExportSection *exports = nullptr;
GotSection *got = nullptr;		GotSection *got = nullptr;
TlvPointerSection *tlvPointers = nullptr;		TlvPointerSection *tlvPointers = nullptr;
LazyPointerSection *lazyPointers = nullptr;		LazyPointerSection *lazyPointers = nullptr;
Show All 15 Lines

lld/MachO/SyntheticSections.cpp

//===- SyntheticSections.cpp ---------------------------------------------===//		//===- SyntheticSections.cpp ---------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "SyntheticSections.h"		#include "SyntheticSections.h"
		#include "ConcatOutputSection.h"
#include "Config.h"		#include "Config.h"
#include "ExportTrie.h"		#include "ExportTrie.h"
#include "InputFiles.h"		#include "InputFiles.h"
#include "MachOStructs.h"		#include "MachOStructs.h"
#include "MergedOutputSection.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "SymbolTable.h"		#include "SymbolTable.h"
#include "Symbols.h"		#include "Symbols.h"
#include "Writer.h"		#include "Writer.h"

#include "lld/Common/ErrorHandler.h"		#include "lld/Common/ErrorHandler.h"
#include "lld/Common/Memory.h"		#include "lld/Common/Memory.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
Show All 20 Lines
using namespace lld;		using namespace lld;
using namespace lld::macho;		using namespace lld::macho;

InStruct macho::in;		InStruct macho::in;
std::vector<SyntheticSection *> macho::syntheticSections;		std::vector<SyntheticSection *> macho::syntheticSections;

SyntheticSection::SyntheticSection(const char segname, const char name)		SyntheticSection::SyntheticSection(const char segname, const char name)
: OutputSection(SyntheticKind, name), segname(segname) {		: OutputSection(SyntheticKind, name), segname(segname) {
isec = make<InputSection>();		isec = make<ConcatInputSection>();
isec->segname = segname;		isec->segname = segname;
isec->name = name;		isec->name = name;
isec->parent = this;		isec->parent = this;
isec->outSecOff = 0;
syntheticSections.push_back(this);		syntheticSections.push_back(this);
}		}

// dyld3's MachOLoaded::getSlide() assumes that the __TEXT segment starts		// dyld3's MachOLoaded::getSlide() assumes that the __TEXT segment starts
// from the beginning of the file (i.e. the header).		// from the beginning of the file (i.e. the header).
MachHeaderSection::MachHeaderSection()		MachHeaderSection::MachHeaderSection()
: SyntheticSection(segment_names::text, section_names::header) {		: SyntheticSection(segment_names::text, section_names::header) {
// XXX: This is a hack. (See D97007)		// XXX: This is a hack. (See D97007)
▲ Show 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	if (locations.empty())
return;		return;

raw_svector_ostream os{contents};		raw_svector_ostream os{contents};
Rebase lastRebase;		Rebase lastRebase;

os << static_cast<uint8_t>(REBASE_OPCODE_SET_TYPE_IMM \| REBASE_TYPE_POINTER);		os << static_cast<uint8_t>(REBASE_OPCODE_SET_TYPE_IMM \| REBASE_TYPE_POINTER);

llvm::sort(locations, [](const Location &a, const Location &b) {		llvm::sort(locations, [](const Location &a, const Location &b) {
return a.isec->getVA() < b.isec->getVA();		return a.isec->getVA(a.offset) < b.isec->getVA(b.offset);
});		});
for (const Location &loc : locations)		for (const Location &loc : locations)
encodeRebase(loc.isec->parent, loc.isec->outSecOff + loc.offset, lastRebase,		encodeRebase(loc.isec->parent, loc.isec->getOffset(loc.offset), lastRebase,
os);		os);
if (lastRebase.consecutiveCount != 0)		if (lastRebase.consecutiveCount != 0)
encodeDoRebase(lastRebase, os);		encodeDoRebase(lastRebase, os);

os << static_cast<uint8_t>(REBASE_OPCODE_DONE);		os << static_cast<uint8_t>(REBASE_OPCODE_DONE);
}		}

void RebaseSection::writeTo(uint8_t *buf) const {		void RebaseSection::writeTo(uint8_t *buf) const {
▲ Show 20 Lines • Show All 144 Lines • ▼ Show 20 Lines	void BindingSection::finalizeContents() {
});		});
for (const BindingEntry &b : bindings) {		for (const BindingEntry &b : bindings) {
int16_t ordinal = ordinalForDylibSymbol(*b.dysym);		int16_t ordinal = ordinalForDylibSymbol(*b.dysym);
if (ordinal != lastBinding.ordinal) {		if (ordinal != lastBinding.ordinal) {
encodeDylibOrdinal(ordinal, os);		encodeDylibOrdinal(ordinal, os);
lastBinding.ordinal = ordinal;		lastBinding.ordinal = ordinal;
}		}
encodeBinding(b.dysym, b.target.isec->parent,		encodeBinding(b.dysym, b.target.isec->parent,
b.target.isec->outSecOff + b.target.offset, b.addend,		b.target.isec->getOffset(b.target.offset), b.addend,
/isWeakBinding=/false, lastBinding, os);		/isWeakBinding=/false, lastBinding, os);
}		}
if (!bindings.empty())		if (!bindings.empty())
os << static_cast<uint8_t>(BIND_OPCODE_DONE);		os << static_cast<uint8_t>(BIND_OPCODE_DONE);
}		}

void BindingSection::writeTo(uint8_t *buf) const {		void BindingSection::writeTo(uint8_t *buf) const {
memcpy(buf, contents.data(), contents.size());		memcpy(buf, contents.data(), contents.size());
Show All 12 Lines	void WeakBindingSection::finalizeContents() {
// Since bindings are delta-encoded, sorting them allows for a more compact		// Since bindings are delta-encoded, sorting them allows for a more compact
// result.		// result.
llvm::sort(bindings,		llvm::sort(bindings,
[](const WeakBindingEntry &a, const WeakBindingEntry &b) {		[](const WeakBindingEntry &a, const WeakBindingEntry &b) {
return a.target.getVA() < b.target.getVA();		return a.target.getVA() < b.target.getVA();
});		});
for (const WeakBindingEntry &b : bindings)		for (const WeakBindingEntry &b : bindings)
encodeBinding(b.symbol, b.target.isec->parent,		encodeBinding(b.symbol, b.target.isec->parent,
b.target.isec->outSecOff + b.target.offset, b.addend,		b.target.isec->getOffset(b.target.offset), b.addend,
/isWeakBinding=/true, lastBinding, os);		/isWeakBinding=/true, lastBinding, os);
if (!bindings.empty() \|\| !definitions.empty())		if (!bindings.empty() \|\| !definitions.empty())
os << static_cast<uint8_t>(BIND_OPCODE_DONE);		os << static_cast<uint8_t>(BIND_OPCODE_DONE);
}		}

void WeakBindingSection::writeTo(uint8_t *buf) const {		void WeakBindingSection::writeTo(uint8_t *buf) const {
memcpy(buf, contents.data(), contents.size());		memcpy(buf, contents.data(), contents.size());
}		}
▲ Show 20 Lines • Show All 510 Lines • ▼ Show 20 Lines
void StringTableSection::writeTo(uint8_t *buf) const {		void StringTableSection::writeTo(uint8_t *buf) const {
uint32_t off = 0;		uint32_t off = 0;
for (StringRef str : strings) {		for (StringRef str : strings) {
memcpy(buf + off, str.data(), str.size());		memcpy(buf + off, str.data(), str.size());
off += str.size() + 1; // account for null terminator		off += str.size() + 1; // account for null terminator
}		}
}		}

		static_assert((CodeSignatureSection::blobHeadersSize % 8) == 0, "");
		static_assert((CodeSignatureSection::fixedHeadersSize % 8) == 0, "");

CodeSignatureSection::CodeSignatureSection()		CodeSignatureSection::CodeSignatureSection()
: LinkEditSection(segment_names::linkEdit, section_names::codeSignature) {		: LinkEditSection(segment_names::linkEdit, section_names::codeSignature) {
align = 16; // required by libstuff		align = 16; // required by libstuff
fileName = config->outputFile;		fileName = config->outputFile;
size_t slashIndex = fileName.rfind("/");		size_t slashIndex = fileName.rfind("/");
if (slashIndex != std::string::npos)		if (slashIndex != std::string::npos)
fileName = fileName.drop_front(slashIndex + 1);		fileName = fileName.drop_front(slashIndex + 1);
allHeadersSize = alignTo<16>(fixedHeadersSize + fileName.size() + 1);		allHeadersSize = alignTo<16>(fixedHeadersSize + fileName.size() + 1);
▲ Show 20 Lines • Show All 126 Lines • ▼ Show 20 Lines	void BitcodeBundleSection::writeTo(uint8_t *buf) const {
if (ec)		if (ec)
fatal("failed to map XAR file");		fatal("failed to map XAR file");
memcpy(buf, xarMap.const_data(), xarSize);		memcpy(buf, xarMap.const_data(), xarSize);

closeFile(handle);		closeFile(handle);
remove(xarPath);		remove(xarPath);
}		}

		CStringSection::CStringSection()
		: SyntheticSection(segment_names::text, section_names::cString),
		builder(StringTableBuilder::RAW) {
		flags = S_CSTRING_LITERALS; // FIXME merge flags properly
		}
		gkmUnsubmitted Done Reply Inline Actions The extremity of the target-dependent difference in alignment requirement is surprising, and worthy of a comment. gkm: The extremity of the target-dependent difference in alignment requirement is surprising, and…
		int3AuthorUnsubmitted Done Reply Inline Actions Good point. I've copied the relevant bits of the commit message. int3: Good point. I've copied the relevant bits of the commit message.

		void CStringSection::addInput(CStringInputSection *isec) {
		align = std::max(align, isec->align);
		isec->parent = this;
		inputs.push_back(isec);
		}

		void CStringSection::finalize() {
		// Add all string pieces to the string table builder to create section
		// contents.
		for (const CStringInputSection *isec : inputs)
		for (size_t i = 0, e = isec->pieces.size(); i != e; ++i)
		builder.add(isec->getCachedHashStringRef(i));

		// Fix the string table content. After this, the contents will never change.
		builder.finalize();

		// finalize() fixed tail-optimized strings, so we can now get
		// offsets of strings. Get an offset for each string and save it
		// to a corresponding SectionPiece for easy access.
		for (CStringInputSection *isec : inputs) {
		for (size_t i = 0, e = isec->pieces.size(); i != e; ++i) {
		isec->pieces[i].outSecOff =
		builder.getOffset(isec->getCachedHashStringRef(i));
		isec->isFinal = true;
		}
		}
		}

void macho::createSyntheticSymbols() {		void macho::createSyntheticSymbols() {
auto addHeaderSymbol = [](const char *name) {		auto addHeaderSymbol = [](const char *name) {
symtab->addSynthetic(name, in.header->isec, /value=/0,		symtab->addSynthetic(name, in.header->isec, /value=/0,
/privateExtern=/true, /includeInSymtab=/false,		/privateExtern=/true, /includeInSymtab=/false,
/referencedDynamically=/false);		/referencedDynamically=/false);
};		};

switch (config->outputType) {		switch (config->outputType) {
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

lld/MachO/UnwindInfoSection.h

	//===- UnwindInfoSection.h ------------------------------------------------===//			//===- UnwindInfoSection.h ------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLD_MACHO_UNWIND_INFO_H			#ifndef LLD_MACHO_UNWIND_INFO_H
	#define LLD_MACHO_UNWIND_INFO_H			#define LLD_MACHO_UNWIND_INFO_H

	#include "MergedOutputSection.h"			#include "ConcatOutputSection.h"
	#include "SyntheticSections.h"			#include "SyntheticSections.h"

	#include "mach-o/compact_unwind_encoding.h"			#include "mach-o/compact_unwind_encoding.h"
	#include "llvm/ADT/DenseMap.h"			#include "llvm/ADT/DenseMap.h"

	#include <vector>			#include <vector>

	namespace lld {			namespace lld {
	namespace macho {			namespace macho {

	class UnwindInfoSection : public SyntheticSection {			class UnwindInfoSection : public SyntheticSection {
	public:			public:
	bool isNeeded() const override { return compactUnwindSection != nullptr; }			bool isNeeded() const override { return compactUnwindSection != nullptr; }
	uint64_t getSize() const override { return unwindInfoSize; }			uint64_t getSize() const override { return unwindInfoSize; }
	virtual void prepareRelocations(InputSection *) = 0;			virtual void prepareRelocations(InputSection *) = 0;

	void setCompactUnwindSection(MergedOutputSection *cuSection) {			void setCompactUnwindSection(ConcatOutputSection *cuSection) {
	compactUnwindSection = cuSection;			compactUnwindSection = cuSection;
	}			}

	protected:			protected:
	UnwindInfoSection()			UnwindInfoSection()
	: SyntheticSection(segment_names::text, section_names::unwindInfo) {			: SyntheticSection(segment_names::text, section_names::unwindInfo) {
	align = 4;			align = 4;
	}			}

	MergedOutputSection *compactUnwindSection = nullptr;			ConcatOutputSection *compactUnwindSection = nullptr;
	uint64_t unwindInfoSize = 0;			uint64_t unwindInfoSize = 0;
	};			};

	UnwindInfoSection *makeUnwindInfoSection();			UnwindInfoSection *makeUnwindInfoSection();
	void prepareCompactUnwind(InputSection *isec);			void prepareCompactUnwind(InputSection *isec);

	} // namespace macho			} // namespace macho
	} // namespace lld			} // namespace lld

	#endif			#endif

lld/MachO/UnwindInfoSection.cpp

	//===- UnwindInfoSection.cpp ----------------------------------------------===//			//===- UnwindInfoSection.cpp ----------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "UnwindInfoSection.h"			#include "UnwindInfoSection.h"
				#include "ConcatOutputSection.h"
	#include "Config.h"			#include "Config.h"
	#include "InputSection.h"			#include "InputSection.h"
	#include "MergedOutputSection.h"
	#include "OutputSection.h"			#include "OutputSection.h"
	#include "OutputSegment.h"			#include "OutputSegment.h"
	#include "SymbolTable.h"			#include "SymbolTable.h"
	#include "Symbols.h"			#include "Symbols.h"
	#include "SyntheticSections.h"			#include "SyntheticSections.h"
	#include "Target.h"			#include "Target.h"

	#include "lld/Common/ErrorHandler.h"			#include "lld/Common/ErrorHandler.h"
	▲ Show 20 Lines • Show All 186 Lines • ▼ Show 20 Lines
	}			}

	// We need to apply the relocations to the pre-link compact unwind section			// We need to apply the relocations to the pre-link compact unwind section
	// before converting it to post-link form. There should only be absolute			// before converting it to post-link form. There should only be absolute
	// relocations here: since we are not emitting the pre-link CU section, there			// relocations here: since we are not emitting the pre-link CU section, there
	// is no source address to make a relative location meaningful.			// is no source address to make a relative location meaningful.
	template <class Ptr>			template <class Ptr>
	static void			static void
	relocateCompactUnwind(MergedOutputSection *compactUnwindSection,			relocateCompactUnwind(ConcatOutputSection *compactUnwindSection,
	std::vector<CompactUnwindEntry<Ptr>> &cuVector) {			std::vector<CompactUnwindEntry<Ptr>> &cuVector) {
	for (const InputSection *isec : compactUnwindSection->inputs) {			for (const ConcatInputSection *isec : compactUnwindSection->inputs) {
	assert(isec->parent == compactUnwindSection);			assert(isec->parent == compactUnwindSection);

	uint8_t *buf =			uint8_t *buf =
	reinterpret_cast<uint8_t *>(cuVector.data()) + isec->outSecFileOff;			reinterpret_cast<uint8_t *>(cuVector.data()) + isec->outSecFileOff;
	memcpy(buf, isec->data.data(), isec->data.size());			memcpy(buf, isec->data.data(), isec->data.size());

	for (const Reloc &r : isec->relocs) {			for (const Reloc &r : isec->relocs) {
	uint64_t referentVA = 0;			uint64_t referentVA = 0;
	if (auto referentSym = r.referent.dyn_cast<Symbol >()) {			if (auto referentSym = r.referent.dyn_cast<Symbol >()) {
	if (!isa<Undefined>(referentSym)) {			if (!isa<Undefined>(referentSym)) {
	assert(referentSym->isInGot());			assert(referentSym->isInGot());
	if (auto *defined = dyn_cast<Defined>(referentSym))			if (auto *defined = dyn_cast<Defined>(referentSym))
	checkTextSegment(defined->isec);			checkTextSegment(defined->isec);
	// At this point in the link, we may not yet know the final address of			// At this point in the link, we may not yet know the final address of
	// the GOT, so we just encode the index. We make it a 1-based index so			// the GOT, so we just encode the index. We make it a 1-based index so
	// that we can distinguish the null pointer case.			// that we can distinguish the null pointer case.
	referentVA = referentSym->gotIndex + 1;			referentVA = referentSym->gotIndex + 1;
	}			}
	} else if (auto referentIsec = r.referent.dyn_cast<InputSection >()) {			} else if (auto referentIsec = r.referent.dyn_cast<InputSection >()) {
	checkTextSegment(referentIsec);			checkTextSegment(referentIsec);
	if (referentIsec->shouldOmitFromOutput())			if (referentIsec->shouldOmitFromOutput())
	referentVA = UINT64_MAX; // Tombstone value			referentVA = UINT64_MAX; // Tombstone value
	else			else
	referentVA = referentIsec->getVA() + r.addend;			referentVA = referentIsec->getVA(r.addend);
	}			}

	writeAddress(buf + r.offset, referentVA, r.length);			writeAddress(buf + r.offset, referentVA, r.length);
	}			}
	}			}
	}			}

	// There should only be a handful of unique personality pointers, so we can			// There should only be a handful of unique personality pointers, so we can
	▲ Show 20 Lines • Show All 307 Lines • Show Last 20 Lines

lld/MachO/Writer.cpp

//===- Writer.cpp ---------------------------------------------------------===//		//===- Writer.cpp ---------------------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "Writer.h"		#include "Writer.h"
		#include "ConcatOutputSection.h"
#include "Config.h"		#include "Config.h"
#include "InputFiles.h"		#include "InputFiles.h"
#include "InputSection.h"		#include "InputSection.h"
#include "MapFile.h"		#include "MapFile.h"
#include "MergedOutputSection.h"
#include "OutputSection.h"		#include "OutputSection.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "SymbolTable.h"		#include "SymbolTable.h"
#include "Symbols.h"		#include "Symbols.h"
#include "SyntheticSections.h"		#include "SyntheticSections.h"
#include "Target.h"		#include "Target.h"
#include "UnwindInfoSection.h"		#include "UnwindInfoSection.h"

▲ Show 20 Lines • Show All 737 Lines • ▼ Show 20 Lines	return StringSwitch<int>(seg->name)
.Case(segment_names::pageZero, -4)		.Case(segment_names::pageZero, -4)
.Case(segment_names::text, -3)		.Case(segment_names::text, -3)
.Case(segment_names::dataConst, -2)		.Case(segment_names::dataConst, -2)
.Case(segment_names::data, -1)		.Case(segment_names::data, -1)
.Case(segment_names::llvm, std::numeric_limits<int>::max() - 1)		.Case(segment_names::llvm, std::numeric_limits<int>::max() - 1)
// Make sure __LINKEDIT is the last segment (i.e. all its hidden		// Make sure __LINKEDIT is the last segment (i.e. all its hidden
// sections must be ordered after other sections).		// sections must be ordered after other sections).
.Case(segment_names::linkEdit, std::numeric_limits<int>::max())		.Case(segment_names::linkEdit, std::numeric_limits<int>::max())
.Default(0);		.Default(seg->inputOrder);
}		}

static int sectionOrder(OutputSection *osec) {		static int sectionOrder(OutputSection *osec) {
StringRef segname = osec->parent->name;		StringRef segname = osec->parent->name;
// Sections are uniquely identified by their segment + section name.		// Sections are uniquely identified by their segment + section name.
if (segname == segment_names::text) {		if (segname == segment_names::text) {
return StringSwitch<int>(osec->name)		return StringSwitch<int>(osec->name)
.Case(section_names::header, -4)		.Case(section_names::header, -4)
.Case(section_names::text, -3)		.Case(section_names::text, -3)
.Case(section_names::stubs, -2)		.Case(section_names::stubs, -2)
.Case(section_names::stubHelper, -1)		.Case(section_names::stubHelper, -1)
.Case(section_names::unwindInfo, std::numeric_limits<int>::max() - 1)		.Case(section_names::unwindInfo, std::numeric_limits<int>::max() - 1)
.Case(section_names::ehFrame, std::numeric_limits<int>::max())		.Case(section_names::ehFrame, std::numeric_limits<int>::max())
.Default(0);		.Default(osec->inputOrder);
} else if (segname == segment_names::data \|\|		} else if (segname == segment_names::data \|\|
segname == segment_names::dataConst) {		segname == segment_names::dataConst) {
// For each thread spawned, dyld will initialize its TLVs by copying the		// For each thread spawned, dyld will initialize its TLVs by copying the
// address range from the start of the first thread-local data section to		// address range from the start of the first thread-local data section to
// the end of the last one. We therefore arrange these sections contiguously		// the end of the last one. We therefore arrange these sections contiguously
// to minimize the amount of memory used. Additionally, since zerofill		// to minimize the amount of memory used. Additionally, since zerofill
// sections must be at the end of their segments, and since TLV data		// sections must be at the end of their segments, and since TLV data
// sections can be zerofills, we end up putting all TLV data sections at the		// sections can be zerofills, we end up putting all TLV data sections at the
// end of the segment.		// end of the segment.
switch (sectionType(osec->flags)) {		switch (sectionType(osec->flags)) {
case S_THREAD_LOCAL_REGULAR:		case S_THREAD_LOCAL_REGULAR:
return std::numeric_limits<int>::max() - 2;		return std::numeric_limits<int>::max() - 2;
case S_THREAD_LOCAL_ZEROFILL:		case S_THREAD_LOCAL_ZEROFILL:
return std::numeric_limits<int>::max() - 1;		return std::numeric_limits<int>::max() - 1;
case S_ZEROFILL:		case S_ZEROFILL:
return std::numeric_limits<int>::max();		return std::numeric_limits<int>::max();
default:		default:
return StringSwitch<int>(osec->name)		return StringSwitch<int>(osec->name)
		.Case(section_names::got, -3)
.Case(section_names::lazySymbolPtr, -2)		.Case(section_names::lazySymbolPtr, -2)
.Case(section_names::data, -1)		.Case(section_names::const_, -1)
.Default(0);		.Default(osec->inputOrder);
}		}
} else if (segname == segment_names::linkEdit) {		} else if (segname == segment_names::linkEdit) {
return StringSwitch<int>(osec->name)		return StringSwitch<int>(osec->name)
.Case(section_names::rebase, -9)		.Case(section_names::rebase, -9)
.Case(section_names::binding, -8)		.Case(section_names::binding, -8)
.Case(section_names::weakBinding, -7)		.Case(section_names::weakBinding, -7)
.Case(section_names::lazyBinding, -6)		.Case(section_names::lazyBinding, -6)
.Case(section_names::export_, -5)		.Case(section_names::export_, -5)
.Case(section_names::functionStarts, -4)		.Case(section_names::functionStarts, -4)
.Case(section_names::symbolTable, -3)		.Case(section_names::symbolTable, -3)
.Case(section_names::indirectSymbolTable, -2)		.Case(section_names::indirectSymbolTable, -2)
.Case(section_names::stringTable, -1)		.Case(section_names::stringTable, -1)
.Case(section_names::codeSignature, std::numeric_limits<int>::max())		.Case(section_names::codeSignature, std::numeric_limits<int>::max())
.Default(0);		.Default(osec->inputOrder);
}		}
// ZeroFill sections must always be the at the end of their segments,		// ZeroFill sections must always be the at the end of their segments,
// otherwise subsequent sections may get overwritten with zeroes at runtime.		// otherwise subsequent sections may get overwritten with zeroes at runtime.
if (sectionType(osec->flags) == S_ZEROFILL)		if (sectionType(osec->flags) == S_ZEROFILL)
return std::numeric_limits<int>::max();		return std::numeric_limits<int>::max();
return 0;		return osec->inputOrder;
}		}

template <typename T, typename F>		template <typename T, typename F>
static std::function<bool(T, T)> compareByOrder(F ord) {		static std::function<bool(T, T)> compareByOrder(F ord) {
return [=](T a, T b) { return ord(a) < ord(b); };		return [=](T a, T b) { return ord(a) < ord(b); };
}		}

// Sorting only can happen once all outputs have been collected. Here we sort		// Sorting only can happen once all outputs have been collected. Here we sort
// segments, output sections within each segment, and input sections within each		// segments, output sections within each segment, and input sections within each
// output segment.		// output segment.
static void sortSegmentsAndSections() {		static void sortSegmentsAndSections() {
TimeTraceScope timeScope("Sort segments and sections");		TimeTraceScope timeScope("Sort segments and sections");

llvm::stable_sort(outputSegments,		llvm::sort(outputSegments, compareByOrder<OutputSegment *>(segmentOrder));
compareByOrder<OutputSegment *>(segmentOrder));

DenseMap<const InputSection *, size_t> isecPriorities =		DenseMap<const InputSection *, size_t> isecPriorities =
buildInputSectionPriorities();		buildInputSectionPriorities();

uint32_t sectionIndex = 0;		uint32_t sectionIndex = 0;
for (OutputSegment *seg : outputSegments) {		for (OutputSegment *seg : outputSegments) {
seg->sortOutputSections(compareByOrder<OutputSection *>(sectionOrder));		seg->sortOutputSections(compareByOrder<OutputSection *>(sectionOrder));
for (OutputSection *osec : seg->getSections()) {		for (OutputSection *osec : seg->getSections()) {
// Now that the output sections are sorted, assign the final		// Now that the output sections are sorted, assign the final
// output section indices.		// output section indices.
if (!osec->isHidden())		if (!osec->isHidden())
osec->index = ++sectionIndex;		osec->index = ++sectionIndex;
if (!firstTLVDataSection && isThreadLocalData(osec->flags))		if (!firstTLVDataSection && isThreadLocalData(osec->flags))
firstTLVDataSection = osec;		firstTLVDataSection = osec;

if (!isecPriorities.empty()) {		if (!isecPriorities.empty()) {
if (auto *merged = dyn_cast<MergedOutputSection>(osec)) {		if (auto *merged = dyn_cast<ConcatOutputSection>(osec)) {
llvm::stable_sort(merged->inputs,		llvm::stable_sort(merged->inputs,
[&](InputSection a, InputSection b) {		[&](InputSection a, InputSection b) {
return isecPriorities[a] > isecPriorities[b];		return isecPriorities[a] > isecPriorities[b];
});		});
}		}
}		}
}		}
}		}
Show All 28 Lines	case MH_EXECUTE:
break;		break;
case MH_DYLIB:		case MH_DYLIB:
case MH_BUNDLE:		case MH_BUNDLE:
break;		break;
default:		default:
llvm_unreachable("unhandled output file type");		llvm_unreachable("unhandled output file type");
}		}

// Then merge input sections into output sections.		// Then add input sections to output sections.
MapVector<NamePair, MergedOutputSection *> mergedOutputSections;		DenseMap<NamePair, ConcatOutputSection *> mergedOutputSections;
for (InputSection *isec : inputSections) {		for (const auto &p : enumerate(inputSections)) {
		InputSection *isec = p.value();
if (isec->shouldOmitFromOutput())		if (isec->shouldOmitFromOutput())
continue;		continue;
		if (auto *concatIsec = dyn_cast<ConcatInputSection>(isec)) {
NamePair names = maybeRenameSection({isec->segname, isec->name});		NamePair names = maybeRenameSection({isec->segname, isec->name});
MergedOutputSection *&osec = mergedOutputSections[names];		ConcatOutputSection *&osec = mergedOutputSections[names];
if (osec == nullptr)		if (osec == nullptr) {
osec = make<MergedOutputSection>(names.second);		osec = make<ConcatOutputSection>(names.second);
osec->mergeInput(isec);		osec->inputOrder = p.index();
		}
		osec->addInput(concatIsec);
		} else if (auto *cStringIsec = dyn_cast<CStringInputSection>(isec)) {
		if (in.cStringSection->inputs.empty())
		in.cStringSection->inputOrder = p.index();
		in.cStringSection->addInput(cStringIsec);
		}
}		}

		// Once all the inputs are added, we can finalize the output section
		// properties and create the corresponding output segments.
for (const auto &it : mergedOutputSections) {		for (const auto &it : mergedOutputSections) {
StringRef segname = it.first.first;		StringRef segname = it.first.first;
MergedOutputSection *osec = it.second;		ConcatOutputSection *osec = it.second;
if (segname == segment_names::ld) {		if (segname == segment_names::ld) {
assert(osec->name == section_names::compactUnwind);		assert(osec->name == section_names::compactUnwind);
in.unwindInfo->setCompactUnwindSection(osec);		in.unwindInfo->setCompactUnwindSection(osec);
} else {		} else {
getOrCreateOutputSegment(segname)->addOutputSection(osec);		getOrCreateOutputSegment(segname)->addOutputSection(osec);
}		}
}		}

for (SyntheticSection *ssec : syntheticSections) {		for (SyntheticSection *ssec : syntheticSections) {
auto it = mergedOutputSections.find({ssec->segname, ssec->name});		auto it = mergedOutputSections.find({ssec->segname, ssec->name});
		if (ssec->isNeeded()) {
if (it == mergedOutputSections.end()) {		if (it == mergedOutputSections.end()) {
		int3AuthorUnsubmitted Done Reply Inline Actions An earlier implementation of this diff always created the CStringLiteralSection, even if literal merging was disabled. I therefore hoisted out this check to avoid having a conflict between the unneeded CStringLiteralSection and the actual ConcatOutputSection when literal merging was not being done. We now only create the CStringLiteralSection as-needed, so this is likely unnecessary. However, I think it still makes sense to avoid unnecessary section name conflicts, so I've left it in. int3: An earlier implementation of this diff always created the CStringLiteralSection, even if…
if (ssec->isNeeded())
getOrCreateOutputSegment(ssec->segname)->addOutputSection(ssec);		getOrCreateOutputSegment(ssec->segname)->addOutputSection(ssec);
} else {		} else {
error("section from " + toString(it->second->firstSection()->file) +		fatal("section from " + toString(it->second->firstSection()->file) +
" conflicts with synthetic section " + ssec->segname + "," +		" conflicts with synthetic section " + ssec->segname + "," +
ssec->name);		ssec->name);
}		}
}		}
		}

// dyld requires __LINKEDIT segment to always exist (even if empty).		// dyld requires __LINKEDIT segment to always exist (even if empty).
linkEditSegment = getOrCreateOutputSegment(segment_names::linkEdit);		linkEditSegment = getOrCreateOutputSegment(segment_names::linkEdit);
}		}

void Writer::finalizeAddresses() {		void Writer::finalizeAddresses() {
TimeTraceScope timeScope("Finalize addresses");		TimeTraceScope timeScope("Finalize addresses");
uint64_t pageSize = target->getPageSize();		uint64_t pageSize = target->getPageSize();
▲ Show 20 Lines • Show All 130 Lines • ▼ Show 20 Lines	template <class LP> void Writer::run() {
writeMapFile();		writeMapFile();
writeOutputFile();		writeOutputFile();
}		}

template <class LP> void macho::writeResult() { Writer().run<LP>(); }		template <class LP> void macho::writeResult() { Writer().run<LP>(); }

void macho::createSyntheticSections() {		void macho::createSyntheticSections() {
in.header = make<MachHeaderSection>();		in.header = make<MachHeaderSection>();
		in.cStringSection = config->mergeLiterals ? make<CStringSection>() : nullptr;
in.rebase = make<RebaseSection>();		in.rebase = make<RebaseSection>();
in.binding = make<BindingSection>();		in.binding = make<BindingSection>();
in.weakBinding = make<WeakBindingSection>();		in.weakBinding = make<WeakBindingSection>();
in.lazyBinding = make<LazyBindingSection>();		in.lazyBinding = make<LazyBindingSection>();
in.exports = make<ExportSection>();		in.exports = make<ExportSection>();
in.got = make<GotSection>();		in.got = make<GotSection>();
in.tlvPointers = make<TlvPointerSection>();		in.tlvPointers = make<TlvPointerSection>();
in.lazyPointers = make<LazyPointerSection>();		in.lazyPointers = make<LazyPointerSection>();
Show All 10 Lines

lld/test/MachO/cstring-merging.s

This file was added.

				# REQUIRES: x86
				# RUN: rm -rf %t; split-file %s %t
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/test.s -o %t/test.o
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/more-foo.s -o %t/more-foo.o
				# RUN: %lld -dylib %t/test.o %t/more-foo.o -o %t/test
				# RUN: llvm-objdump --macho --section="__TEXT,__cstring" --section="__DATA,ptrs" --syms %t/test \| FileCheck %s
				# RUN: llvm-readobj --section-headers %t/test \| FileCheck %s --check-prefix=HEADER

				## Make sure we only have two deduplicated strings in __cstring.
				# CHECK: Contents of (__TEXT,__cstring) section
				# CHECK-NEXT: bar
				# CHECK-NEXT: foo

				## Make sure both symbol and section relocations point to the right thing.
				# CHECK-NEXT: Contents of (__DATA,ptrs) section
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:bar
				# CHECK-NEXT: __TEXT:__cstring:bar
				# CHECK-NEXT: __TEXT:__cstring:ar
				# CHECK-NEXT: __TEXT:__cstring:r

				## Make sure the symbol addresses are correct too.
				# CHECK: SYMBOL TABLE:
				# CHECK-DAG: [[#%.16x,FOO:]] l O __TEXT,__cstring _local_foo1
				# CHECK-DAG: [[#FOO]] l O __TEXT,__cstring _local_foo2
				# CHECK-DAG: [[#FOO]] g O __TEXT,__cstring _globl_foo1
				# CHECK-DAG: [[#FOO]] g O __TEXT,__cstring _globl_foo2
				# CHECK-DAG: [[#%.16x,BAR:]] l O __TEXT,__cstring _bar1
				# CHECK-DAG: [[#BAR]] l O __TEXT,__cstring _bar2
				# CHECK-DAG: [[#BAR+1]] l O __TEXT,__cstring _ar
				# CHECK-DAG: [[#BAR+2]] l O __TEXT,__cstring _r

				## Make sure we set the right alignment and flags.
				# HEADER: Name: __cstring
				# HEADER-NEXT: Segment: __TEXT
				# HEADER-NEXT: Address:
				# HEADER-NEXT: Size:
				# HEADER-NEXT: Offset:
				# HEADER-NEXT: Alignment: 4
				# HEADER-NEXT: RelocationOffset:
				# HEADER-NEXT: RelocationCount: 0
				# HEADER-NEXT: Type: CStringLiterals
				# HEADER-NEXT: Attributes [ (0x0)
				# HEADER-NEXT: ]
				# HEADER-NEXT: Reserved1: 0x0
				# HEADER-NEXT: Reserved2: 0x0
				# HEADER-NEXT: Reserved3: 0x0

				#--- test.s
				.cstring
				.p2align 2
				_local_foo1:
				.asciz "foo"
				_local_foo2:
				.asciz "foo"
				L_.foo1:
				.asciz "foo"
				L_.foo2:
				.asciz "foo"
				_bar1:
				.asciz "bar"
				gkmUnsubmitted Not Done Reply Inline Actions Is there value in testing ... Strings of length other than 3? Zero length? Non-null terminated? Prefix matches? (e.g. "foo" and "fool", or "bar" and "barf") gkm: Is there value in testing ... * Strings of length other than 3? * Zero length? * Non-null…
				int3AuthorUnsubmitted Done Reply Inline Actions yeah I got lazy here... those are good suggestions. I don't think prefix matches are necessary since we are no longer doing tail merging, but the rest seem useful. int3: yeah I got lazy here... those are good suggestions. I don't think prefix matches are necessary…
				_bar2:
				.asciz "bar"
				_ar:
				.asciz "ar"
				_r:
				.asciz "r"

				.text
				movq _r@GOTPCREL(%rip), %rax

				.section __DATA,ptrs,literal_pointers
				.quad L_.foo1
				.quad L_.foo2
				.quad _local_foo1
				.quad _local_foo2
				.quad _globl_foo1
				.quad _globl_foo2
				.quad _bar1
				.quad _bar2
				.quad _ar
				.quad _r

				#--- more-foo.s
				.globl _globl_foo1, _globl_foo2
				.cstring
				.p2align 4
				_globl_foo1:
				.asciz "foo"
				_globl_foo2:
				.asciz "foo"

lld/test/MachO/invalid/cstring-merging.s

This file was added.

				# REQUIRES: x86
				# RUN: rm -rf %t; split-file %s %t
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/not-terminated.s -o %t/not-terminated.o
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/relocs.s -o %t/relocs.o

				# RUN: not %lld -dylib %t/not-terminated.o 2>&1 \| FileCheck %s --check-prefix=TERM
				# RUN: not %lld -dylib %t/relocs.o 2>&1 \| FileCheck %s --check-prefix=RELOCS

				# TERM: not-terminated.o:(__cstring): string is not null terminated
				# RELOCS: relocs.o contains relocations in a cstring_literals section, so LLD cannot do string merging. Try re-running with --no-literal-merge.

				## Make sure things still work reasonably if literal merging is disabled
				# RUN: %lld -dylib --no-literal-merge %t/relocs.o -o %t/relocs
				# RUN: llvm-objdump --macho --syms --full-contents %t/relocs \| FileCheck %s

				# CHECK: Contents of section __TEXT,__cstring:
				# CHECK: 02d8 666f6f00 d8020000 00000000

				# CHECK: SYMBOL TABLE:
				# CHECK: 00000000000002d8 l O __TEXT,__cstring _str

				#--- not-terminated.s
				.cstring
				.asciz "foo"
				.ascii "oh no"

				#--- relocs.s
				.cstring
				_str:
				.asciz "foo"
				.quad _str

lld/test/MachO/invalid/reserved-section-name.s

	# REQUIRES: x86			# REQUIRES: x86
	# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %s -o %t.o
	# RUN: not %lld -o %t %t.o 2>&1 \| FileCheck %s -DFILE=%t.o			# RUN: not %lld -o %t %t.o 2>&1 \| FileCheck %s -DFILE=%t.o
	# CHECK: error: section from [[FILE]] conflicts with synthetic section __DATA_CONST,__got			# CHECK: error: section from [[FILE]] conflicts with synthetic section __DATA_CONST,__got

	.globl _main			.globl _main

	.section __DATA_CONST,__got			.section __DATA_CONST,__got
	.space 1			.space 1

				.data
				_foo:
				.space 1

	.text			.text
	_main:			_main:
	mov $0, %rax			## make sure the GOT will be needed
				pushq _foo@GOTPCREL(%rip)
	ret			ret

lld/test/MachO/load-command-sequence.s

	Show All 22 Lines
	# COMMON: cmd LC_SEGMENT_64			# COMMON: cmd LC_SEGMENT_64
	# COMMON: segname __TEXT			# COMMON: segname __TEXT
	# COMMON: sectname __text			# COMMON: sectname __text
	# COMMON: segname __TEXT			# COMMON: segname __TEXT
	# COMMON: sectname __cstring			# COMMON: sectname __cstring
	# COMMON: segname __TEXT			# COMMON: segname __TEXT
	# COMMON: cmd LC_SEGMENT_64			# COMMON: cmd LC_SEGMENT_64
	# COMMON: segname __DATA_CONST			# COMMON: segname __DATA_CONST
	# COMMON: sectname __const
	# COMMON: segname __DATA_CONST
	# COMMON: sectname __got			# COMMON: sectname __got
	# COMMON: segname __DATA_CONST			# COMMON: segname __DATA_CONST
				# COMMON: sectname __const
				# COMMON: segname __DATA_CONST
				int3AuthorUnsubmitted Done Reply Inline Actions while fixing section ordering issues, I noticed that ld64 orders `__const` after `__got`, hence this fix. int3: while fixing section ordering issues, I noticed that ld64 orders `__const` after `__got`, hence…
	# COMMON: cmd LC_SEGMENT_64			# COMMON: cmd LC_SEGMENT_64
	# COMMON: segname __DATA			# COMMON: segname __DATA
	# COMMON: sectname __data			# COMMON: sectname __data
	# COMMON: segname __DATA			# COMMON: segname __DATA
	# COMMON: cmd LC_SEGMENT_64			# COMMON: cmd LC_SEGMENT_64
	# COMMON: segname __LINKEDIT			# COMMON: segname __LINKEDIT
	# COMMON: cmd LC_DYLD_INFO_ONLY			# COMMON: cmd LC_DYLD_INFO_ONLY
	# COMMON: cmd LC_SYMTAB			# COMMON: cmd LC_SYMTAB
	Show All 29 Lines

lld/test/MachO/section-order.s

This file was added.

				# REQUIRES: x86
				## Check that section ordering follows from input file ordering.
				# RUN: rm -rf %t; split-file %s %t
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/1.s -o %t/1.o
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/2.s -o %t/2.o
				# RUN: %lld -dylib %t/1.o %t/2.o -o %t/12
				# RUN: %lld -dylib %t/2.o %t/1.o -o %t/21
				# RUN: llvm-objdump --macho --section-headers %t/12 \| FileCheck %s --check-prefix=CHECK-12
				# RUN: llvm-objdump --macho --section-headers %t/21 \| FileCheck %s --check-prefix=CHECK-21

				# CHECK-12: __text
				# CHECK-12-NEXT: foo
				# CHECK-12-NEXT: bar
				# CHECK-12-NEXT: __cstring

				# CHECK-21: __text
				# CHECK-21-NEXT: __cstring
				# CHECK-21-NEXT: bar
				# CHECK-21-NEXT: foo

				#--- 1.s
				.section __TEXT,foo
				.space 1
				.section __TEXT,bar
				.space 1
				.cstring
				.asciz ""

				#--- 2.s
				.cstring
				.asciz ""
				.section __TEXT,bar
				.space 1
				.section __TEXT,foo
				.space 1

lld/test/MachO/subsections-section-relocs.s

	# REQUIRES: x86			# REQUIRES: x86
	# RUN: rm -rf %t; split-file %s %t			# RUN: rm -rf %t; split-file %s %t
	# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/test.s -o %t/test.o			# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/test.s -o %t/test.o

	# RUN: %lld -o %t/test %t/test.o -order_file %t/order-file			# RUN: %lld --no-literal-merge -o %t/test %t/test.o -order_file %t/order-file
	# RUN: llvm-objdump --section-headers -d --no-show-raw-insn %t/test \| FileCheck %s			# RUN: llvm-objdump --section-headers -d --no-show-raw-insn %t/test \| FileCheck %s
	# CHECK-LABEL: Sections:			# CHECK-LABEL: Sections:
	# CHECK: __cstring {{[^ ]}} {{0}}[[#%x, CSTRING_ADDR:]]			# CHECK: __cstring {{[^ ]}} {{0}}[[#%x, CSTRING_ADDR:]]
	# CHECK-LABEL: Disassembly of section __TEXT,__text:			# CHECK-LABEL: Disassembly of section __TEXT,__text:
	## L._str should end up at CSTRING_ADDR + 4, and leaq is 7 bytes long so we			## L._str should end up at CSTRING_ADDR + 4, and leaq is 7 bytes long so we
	## have RIP = ADDR + 7			## have RIP = ADDR + 7
	# CHECK: [[#%x, ADDR:]]: leaq			# CHECK: [[#%x, ADDR:]]: leaq
	# CHECK-SAME: [[#%u, CSTRING_ADDR + 4 - ADDR - 7]](%rip), %rsi {{.*}} <_bar_str+0x4>			# CHECK-SAME: [[#%u, CSTRING_ADDR + 4 - ADDR - 7]](%rip), %rsi {{.*}} <_bar_str+0x4>
	Show All 20 Lines
	_foo_str:			_foo_str:
	.asciz "foo"			.asciz "foo"

	_bar_str:			_bar_str:
	.asciz "bar"			.asciz "bar"

	## References to this generate a section relocation			## References to this generate a section relocation
	## N.B.: ld64 doesn't actually reorder symbols in __cstring based on the order			## N.B.: ld64 doesn't actually reorder symbols in __cstring based on the order
	## file. Only our implementation does. However, I'm not sure how else to			## file. Our implementation only does does so if --no-literal-merge is
	## test section relocations that target an address inside a relocated			## specified. I'm not sure how else to test section relocations that
	## symbol: using a non-__cstring section would cause llvm-mc to emit a			## target an address inside a relocated symbol: using a non-__cstring
	## symbol relocation instead using the nearest symbol.			## section would cause llvm-mc to emit a symbol relocation instead using
				## the nearest symbol. It might be more consistent for LLD to disable
				## symbol-based cstring reordering altogether and leave this functionality
				## untested, at least until we find a real-world use case...
	L_.str:			L_.str:
	.asciz "Private symbol"			.asciz "Private symbol"

	.subsections_via_symbols			.subsections_via_symbols

lld/test/MachO/weak-binding.s

	Show All 31 Lines
	# CHECK-LABEL: Rebase table:			# CHECK-LABEL: Rebase table:
	# CHECK: __DATA __la_symbol_ptr 0x[[#%x,WEAK_EXT_FN:]] pointer			# CHECK: __DATA __la_symbol_ptr 0x[[#%x,WEAK_EXT_FN:]] pointer

	# CHECK-LABEL: Bind table:			# CHECK-LABEL: Bind table:
	# CHECK-DAG: __DATA_CONST __got 0x[[#WEAK_DY_GOT_ADDR]] pointer 0 libfoo _weak_dysym_for_gotpcrel			# CHECK-DAG: __DATA_CONST __got 0x[[#WEAK_DY_GOT_ADDR]] pointer 0 libfoo _weak_dysym_for_gotpcrel
	# CHECK-DAG: __DATA __la_symbol_ptr 0x[[#%x,WEAK_DY_FN:]] pointer 0 libfoo _weak_dysym_fn			# CHECK-DAG: __DATA __la_symbol_ptr 0x[[#%x,WEAK_DY_FN:]] pointer 0 libfoo _weak_dysym_fn
	# CHECK-DAG: __DATA __data 0x[[#%x,WEAK_DY:]] pointer 0 libfoo _weak_dysym			# CHECK-DAG: __DATA __data 0x[[#%x,WEAK_DY:]] pointer 0 libfoo _weak_dysym
	# CHECK-DAG: __DATA __thread_vars 0x{{[0-9a-f]*}} pointer 0 libSystem __tlv_bootstrap			# CHECK-DAG: __DATA __thread_vars 0x{{[0-9a-f]*}} pointer 0 libSystem __tlv_bootstrap
				# CHECK-DAG: __DATA __thread_vars 0x{{[0-9a-f]*}} pointer 0 libSystem __tlv_bootstrap
	# CHECK-DAG: __DATA __thread_ptrs 0x[[#WEAK_DY_TLV_ADDR]] pointer 0 libfoo _weak_dysym_tlv			# CHECK-DAG: __DATA __thread_ptrs 0x[[#WEAK_DY_TLV_ADDR]] pointer 0 libfoo _weak_dysym_tlv
	## Check that we don't have any other bindings			## FIXME: This was supposed to check that we don't have any other bindings in
				## the bind table, but in practice it just checks that we don't have a binding
				## immediately after the last -DAG match...
	# CHECK-NOT: pointer			# CHECK-NOT: pointer

	# CHECK-LABEL: Lazy bind table:			# CHECK-LABEL: Lazy bind table:
	## Verify that we have no lazy bindings			## Verify that we have no lazy bindings
	# CHECK-NOT: pointer			# CHECK-NOT: pointer

	# CHECK-LABEL: Weak bind table:			# CHECK-LABEL: Weak bind table:
	# CHECK-DAG: __DATA_CONST __got 0x[[#WEAK_DY_GOT_ADDR]] pointer 0 _weak_dysym_for_gotpcrel			# CHECK-DAG: __DATA_CONST __got 0x[[#WEAK_DY_GOT_ADDR]] pointer 0 _weak_dysym_for_gotpcrel
	▲ Show 20 Lines • Show All 97 Lines • Show Last 20 Lines

lld/test/MachO/x86-64-relocs.s

	# REQUIRES: x86			# REQUIRES: x86
	# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %s -o %t.o
	# RUN: %lld -lSystem -o %t %t.o			# RUN: %lld -lSystem -o %t %t.o
	# RUN: llvm-objdump --section-headers --syms -d %t \| FileCheck %s			# RUN: llvm-objdump --section-headers --syms -d %t \| FileCheck %s

	# CHECK-LABEL: Sections:			# CHECK-LABEL: Sections:
	# CHECK: __cstring {{[0-9a-z]+}} [[#%x, CSTRING_ADDR:]]			# CHECK: __data {{[0-9a-z]+}} [[#%x, DATA_ADDR:]]

	# CHECK-LABEL: SYMBOL TABLE:			# CHECK-LABEL: SYMBOL TABLE:
	# CHECK: [[#%x, F_ADDR:]] {{.*}} _f			# CHECK: [[#%x, F_ADDR:]] {{.*}} _f

	# CHECK-LABEL: <_main>:			# CHECK-LABEL: <_main>:
	## Test X86_64_RELOC_BRANCH			## Test X86_64_RELOC_BRANCH
	# CHECK: callq 0x[[#%x, F_ADDR]] <_f>			# CHECK: callq 0x[[#%x, F_ADDR]] <_f>
	## Test extern (symbol) X86_64_RELOC_SIGNED			## Test extern (symbol) X86_64_RELOC_SIGNED
	# CHECK: leaq [[#%u, STR_OFF:]](%rip), %rsi			# CHECK: leaq [[#%u, LOCAL_OFF:]](%rip), %rsi
	# CHECK-NEXT: [[#%x, CSTRING_ADDR - STR_OFF]]			# CHECK-NEXT: [[#%x, DATA_ADDR - LOCAL_OFF]]
	## Test non-extern (section) X86_64_RELOC_SIGNED			## Test non-extern (section) X86_64_RELOC_SIGNED
	# CHECK: leaq [[#%u, LSTR_OFF:]](%rip), %rsi			# CHECK: leaq [[#%u, PRIVATE_OFF:]](%rip), %rsi
	# CHECK-NEXT: [[#%x, CSTRING_ADDR + 22 - LSTR_OFF]]			# CHECK-NEXT: [[#%x, DATA_ADDR + 8 - PRIVATE_OFF]]

	# RUN: llvm-objdump --section=__const --full-contents %t \| FileCheck %s --check-prefix=NONPCREL			# RUN: llvm-objdump --section=__const --full-contents %t \| FileCheck %s --check-prefix=NONPCREL
	# NONPCREL: Contents of section __DATA_CONST,__const:			# NONPCREL: Contents of section __DATA_CONST,__const:
	# NONPCREL-NEXT: 100001000 18040000 01000000 18040000 01000000			# NONPCREL-NEXT: 100001000 08200000 01000000 08200000 01000000

	.section __TEXT,__text			.section __TEXT,__text
	.globl _main, _f			.globl _main, _f
	_main:			_main:
	callq _f # X86_64_RELOC_BRANCH			callq _f # X86_64_RELOC_BRANCH
	mov $0, %rax			mov $0, %rax
	ret			ret

	_f:			_f:
	movl $0x2000004, %eax # write() syscall			leaq _local(%rip), %rsi # Generates a X86_64_RELOC_SIGNED pcrel symbol relocation
	mov $1, %rdi # stdout			leaq L_.private(%rip), %rsi # Generates a X86_64_RELOC_SIGNED pcrel section relocation
	leaq _str(%rip), %rsi # Generates a X86_64_RELOC_SIGNED pcrel symbol relocation			movq L_.ptr_1(%rip), %rsi
	mov $21, %rdx # length of str
	syscall

	movl $0x2000004, %eax # write() syscall
	mov $1, %rdi # stdout
	leaq L_.str(%rip), %rsi # Generates a X86_64_RELOC_SIGNED pcrel section relocation
	mov $15, %rdx # length of str
	syscall

	movl $0x2000004, %eax # write() syscall
	mov $1, %rdi # stdout
	movq L_.ptr_1_to_str(%rip), %rsi
	mov $15, %rdx # length of str
	syscall
	ret			ret

	.section __TEXT,__cstring			.data
	## References to this generate a symbol relocation			## References to this generate a symbol relocation
	_str:			_local:
	.asciz "Local defined symbol\n"			.quad 123
	## References to this generate a section relocation			## References to this generate a section relocation
	L_.str:			L_.private:
	.asciz "Private symbol\n"			.quad 123

	.section __DATA,__const			.section __DATA,__const
	## These generate X86_64_RELOC_UNSIGNED non-pcrel section relocations			## These generate X86_64_RELOC_UNSIGNED non-pcrel section relocations
	L_.ptr_1_to_str:			L_.ptr_1:
	.quad L_.str			.quad L_.private
	L_.ptr_2_to_str:			L_.ptr_2:
	.quad L_.str			.quad L_.private

This is an archive of the discontinued LLVM Phabricator instance.

[lld-macho] Implement cstring deduplicationClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 347173

lld/MachO/CMakeLists.txt

lld/MachO/ConcatOutputSection.h

lld/MachO/ConcatOutputSection.cpp

lld/MachO/Config.h

lld/MachO/Driver.cpp

lld/MachO/InputFiles.h

lld/MachO/InputFiles.cpp

lld/MachO/InputSection.h

lld/MachO/InputSection.cpp

lld/MachO/MergedOutputSection.h

lld/MachO/MergedOutputSection.cpp

lld/MachO/Options.td

lld/MachO/OutputSection.h

lld/MachO/OutputSegment.h

lld/MachO/OutputSegment.cpp

lld/MachO/Symbols.cpp

lld/MachO/SyntheticSections.h

lld/MachO/SyntheticSections.cpp

lld/MachO/UnwindInfoSection.h

lld/MachO/UnwindInfoSection.cpp

lld/MachO/Writer.cpp

lld/test/MachO/cstring-merging.s

lld/test/MachO/invalid/cstring-merging.s

lld/test/MachO/invalid/reserved-section-name.s

lld/test/MachO/load-command-sequence.s

lld/test/MachO/section-order.s

lld/test/MachO/subsections-section-relocs.s

lld/test/MachO/weak-binding.s

lld/test/MachO/x86-64-relocs.s

[lld-macho] Implement cstring deduplication
ClosedPublic