This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/
-
MachO/
-
ConcatOutputSection.h
-
ConcatOutputSection.cpp
-
Config.h
-
Driver.cpp
-
InputFiles.h
-
InputFiles.cpp
7/7
InputSection.h
-
InputSection.cpp
-
Options.td
-
Symbols.cpp
-
SyntheticSections.h
2/2
SyntheticSections.cpp
-
UnwindInfoSection.cpp
1/1
Writer.cpp
-
test/MachO/
-
MachO/
-
cstring-dedup.s
-
invalid/
-
cstring-dedup.s
-
reserved-section-name.s
-
subsections-section-relocs.s
-
x86-64-relocs.s

Differential D102964

[lld-macho] Implement cstring deduplication
ClosedPublic

Authored by int3 on May 21 2021, 8:35 PM.

Download Raw Diff

Details

Reviewers

gkm

Group Reviewers

Restricted Project

Commits

rG04259cde15a9: [lld-macho] Implement cstring deduplication

Summary

Our implementation draws heavily from LLD-ELF's, which in turn delegates
its string deduplication to llvm-mc's StringTableBuilder. The messiness of
this diff is largely due to the fact that we've previously assumed that
all InputSections get concatenated together to form the output. This is
no longer true with CStringInputSections, which split their contents into
StringPieces. StringPieces are much more lightweight than InputSections,
which is important as we create a lot of them. They may also overlap in
the output, which makes it possible for strings to be tail-merged. In
fact, the initial version of this diff implemented tail merging, but
I've dropped it for reasons I'll explain later.

Alignment Issues

Mergeable cstring literals are found under the __TEXT,__cstring
section. In contrast to ELF, which puts strings that need different
alignments into different sections, clang's Mach-O backend puts them all
in one section. Strings that need to be aligned have the .p2align
directive emitted before them, which simply translates into zero padding
in the object file.

I *think* ld64 extracts the desired per-string alignment from this data
by preserving each string's offset from the last section-aligned
address. I'm not entirely certain since it doesn't seem consistent about
doing this; but perhaps this can be chalked up to cases where ld64 has
to deduplicate strings with different offset/alignment combos -- it
seems to pick one of their alignments to preserve. This doesn't seem
correct in general; we can in fact can induce ld64 to produce a crashing
binary just by linking in an additional object file that only contains
cstrings and no code. See PR50563 for details.

Moreover, this scheme seems rather inefficient: since unaligned and
aligned strings are all put in the same section, which has a single
alignment value, it doesn't seem possible to tell whether a given string
doesn't have any alignment requirements. Preserving offset+alignments
for strings that don't need it is wasteful.

In practice, the crashes seen so far seem to stem from x86_64 SIMD
operations on cstrings. X86_64 requires SIMD accesses to be
16-byte-aligned. So for now, I'm thinking of just aligning all strings
to 16 bytes on x86_64. This is indeed wasteful, but implementation-wise
it's simpler than preserving per-string alignment+offsets. It also
avoids the aforementioned crash after deduplication of
differently-aligned strings. Finally, the overhead is not huge: using
16-byte alignment (vs no alignment) is only a 0.5% size overhead when
linking chromium_framework.

With these alignment requirements, it doesn't make sense to attempt tail
merging -- most strings will not be eligible since their overlaps aren't
likely to start at a 16-byte boundary. Tail-merging (with alignment) for
chromium_framework only improves size by 0.3%.

It's worth noting that LLD-ELF only does tail merging at -O2. By
default (at -O1), it just deduplicates w/o tail merging. @thakis has
also mentioned that they saw it regress compressed size in some cases
and therefore turned it off. ld64 does not seem to do tail merging at
all.

Performance Numbers

CString deduplication reduces chromium_framework from 250MB to 242MB, or
about a 3.2% reduction.

Numbers for linking chromium_framework on my 3.2 GHz 16-Core Intel Xeon W:

    N           Min           Max        Median           Avg        Stddev
x  20          3.91          4.03         3.935          3.95   0.034641016
+  20          3.99          4.14         4.015        4.0365     0.0492336
Difference at 95.0% confidence
        0.0865 +/- 0.027245
        2.18987% +/- 0.689746%
        (Student's t, pooled s = 0.0425673)

As expected, cstring merging incurs some non-trivial overhead.

When passing --no-literal-merge, it seems that performance is the
same, i.e. the refactoring in this diff didn't cost us.

    N           Min           Max        Median           Avg        Stddev
x  20          3.91          4.03         3.935          3.95   0.034641016
+  20          3.89          4.02         3.935        3.9435   0.043197831
No difference proven at 95.0% confidence

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

int3 created this revision.May 21 2021, 8:35 PM

Herald added a reviewer: gkm. · View Herald TranscriptMay 21 2021, 8:35 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: dang, mgrang, mgorny. · View Herald Transcript

int3 requested review of this revision.May 21 2021, 8:35 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 21 2021, 8:35 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

int3 edited the summary of this revision. (Show Details)May 21 2021, 8:35 PM

Harbormaster completed remote builds in B105742: Diff 347169.May 21 2021, 8:35 PM

update commit msg

Harbormaster completed remote builds in B105743: Diff 347170.May 21 2021, 8:38 PM

int3 edited the summary of this revision. (Show Details)May 21 2021, 8:49 PM

remove FIXME

Harbormaster completed remote builds in B105744: Diff 347171.May 21 2021, 8:54 PM

remove forward declaration

Harbormaster completed remote builds in B105745: Diff 347172.May 21 2021, 8:56 PM

int3 added inline comments.May 21 2021, 9:00 PM

lld/MachO/Writer.cpp
896–897	An earlier implementation of this diff always created the CStringLiteralSection, even if literal merging was disabled. I therefore hoisted out this check to avoid having a conflict between the unneeded CStringLiteralSection and the actual ConcatOutputSection when literal merging was not being done. We now only create the CStringLiteralSection as-needed, so this is likely unnecessary. However, I think it still makes sense to avoid unnecessary section name conflicts, so I've left it in.

Harbormaster completed remote builds in B105746: Diff 347173.May 21 2021, 10:28 PM

int3 added inline comments.May 21 2021, 11:51 PM

lld/test/MachO/load-command-sequence.s
33–34 ↗	(On Diff #347173)	while fixing section ordering issues, I noticed that ld64 orders `__const` after `__got`, hence this fix.

int3 edited the summary of this revision. (Show Details)May 22 2021, 12:18 AM

int3 edited the summary of this revision. (Show Details)May 22 2021, 3:00 PM

gkm added a comment.May 22 2021, 3:21 PM

This comment was removed by gkm.

I think the class hierarchy change makes sense as part of this diff, since it's motivated by needs of CStringInputSection's implementation, but yeah I can split out some of the other parts

int3 mentioned this in D102971: [lld-macho][nfc] Rename MergedOutputSection to ConcatOutputSection.May 22 2021, 5:52 PM

int3 mentioned this in D102972: [lld-macho][nfc] Sort OutputSections based on explicit order of command-line inputs.

int3 added a parent revision: D102972: [lld-macho][nfc] Sort OutputSections based on explicit order of command-line inputs.May 22 2021, 5:53 PM

rebase

Harbormaster completed remote builds in B105777: Diff 347220.May 22 2021, 5:54 PM

(FWIW we implemented this in lld-link but then ended up turning it off in chromium since it increased compressed size: https://crbug.com/838449)

(see also D44504)

Huh, that makes sense I guess. I suppose we should plan on implementing dedup-without-merge in the future then. Shouldn't be hard to fit into the existing structure.

int3 mentioned this in rG33706191d88d: [lld-macho][nfc] Rename MergedOutputSection to ConcatOutputSection.May 25 2021, 11:59 AM

int3 mentioned this in rGfcab06bd85d1: [lld-macho][nfc] Sort OutputSections based on explicit order of command-line….

reword error message

need to double-check some correctness issues

Harbormaster completed remote builds in B106146: Diff 347764.May 25 2021, 1:51 PM

fix alignment issues

Herald added a subscriber: pengfei. · View Herald TranscriptJun 2 2021, 3:18 PM

Harbormaster completed remote builds in B107336: Diff 349397.Jun 2 2021, 4:00 PM

A random review might not be the best place for this, but:

It's great we're looking at size of the output binary!

Maybe it makes sense to look for lower-hanging fruit before implementing somewhat expensive things?

Here's a bloaty (https://github.com/google/bloaty) diff between ld64-linked Chromium Framework (after --, that's where the old version goes) and lld-linked Chromium Framework:

% ~/src/bloaty/bloaty 'Chromium Framework' --  'Chromium.app/Contents/Frameworks/Chromium Framework.framework/Versions/Current/Chromium Framework'
    FILE SIZE        VM SIZE
 --------------  --------------
  [NEW] +6.98Mi  [NEW] +6.98Mi    __DATA_CONST,__const
 +36e2% +6.42Mi +36e2% +6.42Mi    Rebase Info
   +62% +3.19Mi   +62% +3.19Mi    __TEXT,__cstring
  [NEW] +1.52Mi  [NEW] +1.52Mi    __TEXT,__literal16
  +139% +1.11Mi  +139% +1.11Mi    Function Start Addresses
  +1.5%  +940Ki  +1.5%  +940Ki    String Table
 +38e2%  +530Ki +38e2%  +530Ki    __TEXT,__eh_frame
   +27%  +124Ki   +27%  +124Ki    __TEXT,__objc_methtype
  +102%  +118Ki  +102%  +118Ki    __TEXT,__objc_methname
  +1.1%  +114Ki  +1.1%  +114Ki    Symbol Table
  [NEW] +88.0Ki  [NEW] +88.0Ki    __TEXT,__literal8
  +263% +80.9Ki  +263% +80.9Ki    Binding Info
  [NEW] +62.3Ki  [NEW] +62.3Ki    __TEXT,__literal4
  [NEW] +32.4Ki  [NEW] +32.4Ki    __DATA_CONST,__cfstring
  +136% +28.2Ki  +136% +28.2Ki    __DATA,__objc_selrefs
  +0.0% +8.38Ki  -0.0% -6.59Ki    [31 Others]
  [DEL] -26.8Ki  [DEL] -26.8Ki    __DATA,__cfstring
  [DEL] -53.4Ki  [DEL] -53.4Ki    Table of Non-instructions
 -30.7% -57.9Ki -30.7% -57.9Ki    __DATA,__objc_const
  -1.0% -83.3Ki  -1.0% -83.3Ki    __TEXT,__const
  [DEL] -6.98Mi  [DEL] -6.98Mi    __DATA,__const
  +6.0% +14.1Mi  +5.9% +14.1Mi    TOTAL

__cstring is indeed on the list, but there are other things before it. The _DATA_CONST looks like it's just in __DATA in ld64 and just moved around (see 2nd-to-last line), but our rebase info and LC_FUNCTION_STARTS sections are way larger and possibly easier to fix (LC_FUNCTION_STARTS is 2003880 vs 838208 -- that's 1.2 MB that are likely a cheap fix).

3% smaller is great, but 2% slower isn't exactly cheap. It's not super expensive either, but 10% here and 10% there and suddenly you take twice as long. Being much faster is one of the big selling points of lld so we should try hard not to regress on that. Several thoughts on that:

Maybe this should be opt in (only at -O2, and/or lto or what)? People who really want optimized binaries over link time probably do (thin) LTO.
If the main cost is the hash:
1. That should parallelize well
2. Is there some way we could compute the string hash at compile time and stash it somewhere? (Similar in idea to http://blog.llvm.org/2018/01/improving-link-time-on-windows-with.html)

(Some of this also applies to the ICF patch – looks like we're doing a more thorough job than ld64 with ICF but it's also more expensive.)

Ah, I should probably have added a bit more motivation. The internal program I've been analyzing has significant size overhead from duplicated CFStrings. These CFStrings are essentially boxed cstrings, with an additional field that needs to be bound by dyld. As such, they bloat not just the __cfstring section but also the binding info. I didn't quantify exactly how much of the binding info could be attributed to them, but it seemed significant.

Ultimately, I think we'll have ICF dedup these CFStrings, but in order to do so we must first dedup the cstrings they point to. Hence this diff.

I'm fine with turning merging off by default for now, until we get it integrated with ICF for a bigger win. And maybe only turn it on together with ICF. How does that sound?

In terms of prioritization, I'd like to keep the implementation of these optimizations simple for now, until we are sure that they are operating correctly. (E.g. as the commit message indicates, I uncovered alignment issues while implementing this, and I'm still not entirely sure this is the best way to handle them.) I think parallelization can wait till we're more certain that the output works...

try to fix test on linux

Harbormaster completed remote builds in B108058: Diff 350397.Jun 7 2021, 1:45 PM

LGTM

lld/MachO/InputSection.h
102	Why are we truncating 64-bit hashes to 32 bits? Because the low-order 32 bits are sufficient, and it's more important that `StringPiece` be 16 bytes vs. 24 bytes?
lld/MachO/SyntheticSections.cpp
1081–1082	The extremity of the target-dependent difference in alignment requirement is surprising, and worthy of a comment.
lld/test/MachO/cstring-merging.s
54–65 ↗	(On Diff #350397)	Is there value in testing ... Strings of length other than 3? Zero length? Non-null terminated? Prefix matches? (e.g. "foo" and "fool", or "bar" and "barf")

This revision is now accepted and ready to land.Jun 7 2021, 3:34 PM

alexander-shaposhnikov added a subscriber: alexander-shaposhnikov.Jun 7 2021, 3:48 PM

alexander-shaposhnikov added inline comments.

lld/MachO/InputSection.h
101	would be good to add comments for these fields (inSecOff, outSecOff)

alexander-shaposhnikov added inline comments.Jun 7 2021, 4:03 PM

lld/MachO/InputSection.h
76	explicit
124	khm, wouldn't const StringPiece &getStringPiece(uint64_t offset) const be a cleaner interface ?
142	does it need to be `public` ?

int3 marked an inline comment as done.Jun 7 2021, 7:19 PM

int3 added inline comments.

lld/MachO/InputSection.h
102	This was copied from LLD-ELF's implementation, and yeah the motivation is to reduce the memory cost. I'll copy over the comment too...
142	`CStringSection::finalize()` needs it to be public

address comments + disable literal dedup by default, per @thakis' suggestion

This revision was landed with ongoing or failed builds.Jun 7 2021, 8:48 PM

Closed by commit rG04259cde15a9: [lld-macho] Implement cstring deduplication (authored by int3). · Explain Why

This revision was automatically updated to reflect the committed changes.

int3 added a commit: rG04259cde15a9: [lld-macho] Implement cstring deduplication.

Harbormaster completed remote builds in B108125: Diff 350483.Jun 7 2021, 9:19 PM

int3 added inline comments.Jun 7 2021, 9:29 PM

lld/MachO/SyntheticSections.cpp
1081–1082	Good point. I've copied the relevant bits of the commit message.
lld/test/MachO/cstring-merging.s
54–65 ↗	(On Diff #350397)	yeah I got lazy here... those are good suggestions. I don't think prefix matches are necessary since we are no longer doing tail merging, but the rest seem useful.

int3 mentioned this in D104159: [not for review][lld-macho] Simple cstring literal implementation.Jun 11 2021, 4:33 PM

Revision Contents

Path

Size

lld/

MachO/

ConcatOutputSection.h

14 lines

ConcatOutputSection.cpp

12 lines

1 line

3 lines

4 lines

75 lines

91 lines

54 lines

1 line

4 lines

21 lines

SyntheticSections.cpp

68 lines

UnwindInfoSection.cpp

4 lines

Writer.cpp

34 lines

test/

MachO/

cstring-dedup.s

107 lines

invalid/

cstring-dedup.s

21 lines

reserved-section-name.s

7 lines

subsections-section-relocs.s

11 lines

x86-64-relocs.s

50 lines

Diff 350484

lld/MachO/ConcatOutputSection.h

	Show All 23 Lines
	// files that are labeled with the same segment and section name. This class			// files that are labeled with the same segment and section name. This class
	// contains all such sections and writes the data from each section sequentially			// contains all such sections and writes the data from each section sequentially
	// in the final binary.			// in the final binary.
	class ConcatOutputSection : public OutputSection {			class ConcatOutputSection : public OutputSection {
	public:			public:
	explicit ConcatOutputSection(StringRef name)			explicit ConcatOutputSection(StringRef name)
	: OutputSection(ConcatKind, name) {}			: OutputSection(ConcatKind, name) {}

	const InputSection *firstSection() const { return inputs.front(); }			const ConcatInputSection *firstSection() const { return inputs.front(); }
	const InputSection *lastSection() const { return inputs.back(); }			const ConcatInputSection *lastSection() const { return inputs.back(); }

	// These accessors will only be valid after finalizing the section			// These accessors will only be valid after finalizing the section
	uint64_t getSize() const override { return size; }			uint64_t getSize() const override { return size; }
	uint64_t getFileSize() const override { return fileSize; }			uint64_t getFileSize() const override { return fileSize; }

	void addInput(InputSection *input);			void addInput(ConcatInputSection *input);
	void finalize() override;			void finalize() override;
	bool needsThunks() const;			bool needsThunks() const;
	uint64_t estimateStubsInRangeVA(size_t callIdx) const;			uint64_t estimateStubsInRangeVA(size_t callIdx) const;

	void writeTo(uint8_t *buf) const override;			void writeTo(uint8_t *buf) const override;

	std::vector<InputSection *> inputs;			std::vector<ConcatInputSection *> inputs;
	std::vector<InputSection *> thunks;			std::vector<ConcatInputSection *> thunks;

	static bool classof(const OutputSection *sec) {			static bool classof(const OutputSection *sec) {
	return sec->kind() == ConcatKind;			return sec->kind() == ConcatKind;
	}			}

	private:			private:
	void mergeFlags(InputSection *input);			void mergeFlags(InputSection *input);

	size_t size = 0;			size_t size = 0;
	uint64_t fileSize = 0;			uint64_t fileSize = 0;
	};			};

	// We maintain one ThunkInfo per real function.			// We maintain one ThunkInfo per real function.
	//			//
	// The "active thunk" is represented by the sym/isec pair that			// The "active thunk" is represented by the sym/isec pair that
	// turns-over during finalize(): as the call-site address advances,			// turns-over during finalize(): as the call-site address advances,
	// the active thunk goes out of branch-range, and we create a new			// the active thunk goes out of branch-range, and we create a new
	// thunk to take its place.			// thunk to take its place.
	//			//
	// The remaining members -- bools and counters -- apply to the			// The remaining members -- bools and counters -- apply to the
	// collection of thunks associated with the real function.			// collection of thunks associated with the real function.

	struct ThunkInfo {			struct ThunkInfo {
	// These denote the active thunk:			// These denote the active thunk:
	Defined *sym = nullptr; // private-extern symbol for active thunk			Defined *sym = nullptr; // private-extern symbol for active thunk
	InputSection *isec = nullptr; // input section for active thunk			ConcatInputSection *isec = nullptr; // input section for active thunk

	// The following values are cumulative across all thunks on this function			// The following values are cumulative across all thunks on this function
	uint32_t callSiteCount = 0; // how many calls to the real function?			uint32_t callSiteCount = 0; // how many calls to the real function?
	uint32_t callSitesUsed = 0; // how many call sites processed so-far?			uint32_t callSitesUsed = 0; // how many call sites processed so-far?
	uint32_t thunkCallCount = 0; // how many call sites went to thunk?			uint32_t thunkCallCount = 0; // how many call sites went to thunk?
	uint8_t sequence = 0; // how many thunks created so-far?			uint8_t sequence = 0; // how many thunks created so-far?
	};			};

	extern llvm::DenseMap<Symbol *, ThunkInfo> thunkMap;			extern llvm::DenseMap<Symbol *, ThunkInfo> thunkMap;

	} // namespace macho			} // namespace macho
	} // namespace lld			} // namespace lld

	#endif			#endif

lld/MachO/ConcatOutputSection.cpp

Show All 19 Lines

#include <algorithm>		#include <algorithm>

using namespace llvm;		using namespace llvm;
using namespace llvm::MachO;		using namespace llvm::MachO;
using namespace lld;		using namespace lld;
using namespace lld::macho;		using namespace lld::macho;

void ConcatOutputSection::addInput(InputSection *input) {		void ConcatOutputSection::addInput(ConcatInputSection *input) {
if (inputs.empty()) {		if (inputs.empty()) {
align = input->align;		align = input->align;
flags = input->flags;		flags = input->flags;
} else {		} else {
align = std::max(align, input->align);		align = std::max(align, input->align);
mergeFlags(input);		mergeFlags(input);
}		}
inputs.push_back(input);		inputs.push_back(input);
▲ Show 20 Lines • Show All 112 Lines • ▼ Show 20 Lines	bool ConcatOutputSection::needsThunks() const {
return true;		return true;
}		}

// Since __stubs is placed after __text, we must estimate the address		// Since __stubs is placed after __text, we must estimate the address
// beyond which stubs are within range of a simple forward branch.		// beyond which stubs are within range of a simple forward branch.
uint64_t ConcatOutputSection::estimateStubsInRangeVA(size_t callIdx) const {		uint64_t ConcatOutputSection::estimateStubsInRangeVA(size_t callIdx) const {
uint64_t branchRange = target->branchRange;		uint64_t branchRange = target->branchRange;
size_t endIdx = inputs.size();		size_t endIdx = inputs.size();
InputSection *isec = inputs[callIdx];		ConcatInputSection *isec = inputs[callIdx];
uint64_t isecVA = isec->getVA();		uint64_t isecVA = isec->getVA();
// Tally the non-stub functions which still have call sites		// Tally the non-stub functions which still have call sites
// remaining to process, which yields the maximum number		// remaining to process, which yields the maximum number
// of thunks we might yet place.		// of thunks we might yet place.
size_t maxPotentialThunks = 0;		size_t maxPotentialThunks = 0;
for (auto &tp : thunkMap) {		for (auto &tp : thunkMap) {
ThunkInfo &ti = tp.second;		ThunkInfo &ti = tp.second;
maxPotentialThunks +=		maxPotentialThunks +=
Show All 17 Lines	log("thunks = " + std::to_string(thunkMap.size()) +
", tail = " + to_hexString(isecEnd - isecVA) +		", tail = " + to_hexString(isecEnd - isecVA) +
", slop = " + to_hexString(branchRange - (isecEnd - isecVA)));		", slop = " + to_hexString(branchRange - (isecEnd - isecVA)));
return stubsInRangeVA;		return stubsInRangeVA;
}		}

void ConcatOutputSection::finalize() {		void ConcatOutputSection::finalize() {
uint64_t isecAddr = addr;		uint64_t isecAddr = addr;
uint64_t isecFileOff = fileOff;		uint64_t isecFileOff = fileOff;
auto finalizeOne = [&](InputSection *isec) {		auto finalizeOne = [&](ConcatInputSection *isec) {
isecAddr = alignTo(isecAddr, isec->align);		isecAddr = alignTo(isecAddr, isec->align);
isecFileOff = alignTo(isecFileOff, isec->align);		isecFileOff = alignTo(isecFileOff, isec->align);
isec->outSecOff = isecAddr - addr;		isec->outSecOff = isecAddr - addr;
isec->outSecFileOff = isecFileOff - fileOff;		isec->outSecFileOff = isecFileOff - fileOff;
isec->isFinal = true;		isec->isFinal = true;
isecAddr += isec->getSize();		isecAddr += isec->getSize();
isecFileOff += isec->getFileSize();		isecFileOff += isec->getFileSize();
};		};

if (!needsThunks()) {		if (!needsThunks()) {
for (InputSection *isec : inputs)		for (ConcatInputSection *isec : inputs)
finalizeOne(isec);		finalizeOne(isec);
size = isecAddr - addr;		size = isecAddr - addr;
fileSize = isecFileOff - fileOff;		fileSize = isecFileOff - fileOff;
return;		return;
}		}

uint64_t branchRange = target->branchRange;		uint64_t branchRange = target->branchRange;
uint64_t stubsInRangeVA = TargetInfo::outOfRangeVA;		uint64_t stubsInRangeVA = TargetInfo::outOfRangeVA;
size_t thunkSize = target->thunkSize;		size_t thunkSize = target->thunkSize;
size_t relocCount = 0;		size_t relocCount = 0;
size_t callSiteCount = 0;		size_t callSiteCount = 0;
size_t thunkCallCount = 0;		size_t thunkCallCount = 0;
size_t thunkCount = 0;		size_t thunkCount = 0;

// inputs[finalIdx] is for finalization (address-assignment)		// inputs[finalIdx] is for finalization (address-assignment)
size_t finalIdx = 0;		size_t finalIdx = 0;
// Kick-off by ensuring that the first input section has an address		// Kick-off by ensuring that the first input section has an address
for (size_t callIdx = 0, endIdx = inputs.size(); callIdx < endIdx;		for (size_t callIdx = 0, endIdx = inputs.size(); callIdx < endIdx;
++callIdx) {		++callIdx) {
if (finalIdx == callIdx)		if (finalIdx == callIdx)
finalizeOne(inputs[finalIdx++]);		finalizeOne(inputs[finalIdx++]);
InputSection *isec = inputs[callIdx];		ConcatInputSection *isec = inputs[callIdx];
assert(isec->isFinal);		assert(isec->isFinal);
uint64_t isecVA = isec->getVA();		uint64_t isecVA = isec->getVA();
// Assign addresses up-to the forward branch-range limit		// Assign addresses up-to the forward branch-range limit
while (finalIdx < endIdx &&		while (finalIdx < endIdx &&
isecAddr + inputs[finalIdx]->getSize() < isecVA + branchRange)		isecAddr + inputs[finalIdx]->getSize() < isecVA + branchRange)
finalizeOne(inputs[finalIdx++]);		finalizeOne(inputs[finalIdx++]);
if (isec->callSiteCount == 0)		if (isec->callSiteCount == 0)
continue;		continue;
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	for (Reloc &r : reverse(relocs)) {
// isecAddr and the distance between subsequent call sites is		// isecAddr and the distance between subsequent call sites is
// smaller than thunkSize, then a new thunk can go out of		// smaller than thunkSize, then a new thunk can go out of
// range. Fix by unfinalizing inputs[finalIdx] to reduce the		// range. Fix by unfinalizing inputs[finalIdx] to reduce the
// distance between callVA and highVA, then shift some thunks		// distance between callVA and highVA, then shift some thunks
// to occupy address-space formerly occupied by the		// to occupy address-space formerly occupied by the
// unfinalized inputs[finalIdx].		// unfinalized inputs[finalIdx].
fatal(Twine(__FUNCTION__) + ": FIXME: thunk range overrun");		fatal(Twine(__FUNCTION__) + ": FIXME: thunk range overrun");
}		}
thunkInfo.isec = make<InputSection>();		thunkInfo.isec = make<ConcatInputSection>();
thunkInfo.isec->name = isec->name;		thunkInfo.isec->name = isec->name;
thunkInfo.isec->segname = isec->segname;		thunkInfo.isec->segname = isec->segname;
thunkInfo.isec->parent = this;		thunkInfo.isec->parent = this;
StringRef thunkName = saver.save(funcSym->getName() + ".thunk." +		StringRef thunkName = saver.save(funcSym->getName() + ".thunk." +
std::to_string(thunkInfo.sequence++));		std::to_string(thunkInfo.sequence++));
r.referent = thunkInfo.sym = symtab->addDefined(		r.referent = thunkInfo.sym = symtab->addDefined(
thunkName, /file=/nullptr, thunkInfo.isec, /value=/0,		thunkName, /file=/nullptr, thunkInfo.isec, /value=/0,
/size=/thunkSize, /isWeakDef=/false, /isPrivateExtern=/true,		/size=/thunkSize, /isWeakDef=/false, /isPrivateExtern=/true,
▲ Show 20 Lines • Show All 61 Lines • Show Last 20 Lines

lld/MachO/Config.h

Show First 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	struct Configuration {
bool searchDylibsFirst = false;		bool searchDylibsFirst = false;
bool saveTemps = false;		bool saveTemps = false;
bool adhocCodesign = false;		bool adhocCodesign = false;
bool emitFunctionStarts = false;		bool emitFunctionStarts = false;
bool emitBitcodeBundle = false;		bool emitBitcodeBundle = false;
bool emitEncryptionInfo = false;		bool emitEncryptionInfo = false;
bool timeTraceEnabled = false;		bool timeTraceEnabled = false;
bool dataConst = false;		bool dataConst = false;
		bool dedupLiterals = true;
uint32_t headerPad;		uint32_t headerPad;
uint32_t dylibCompatibilityVersion = 0;		uint32_t dylibCompatibilityVersion = 0;
uint32_t dylibCurrentVersion = 0;		uint32_t dylibCurrentVersion = 0;
uint32_t timeTraceGranularity = 500;		uint32_t timeTraceGranularity = 500;
std::string progName;		std::string progName;
llvm::StringRef installName;		llvm::StringRef installName;
llvm::StringRef mapFile;		llvm::StringRef mapFile;
llvm::StringRef outputFile;		llvm::StringRef outputFile;
▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines

lld/MachO/Driver.cpp

Show First 20 Lines • Show All 523 Lines • ▼ Show 20 Lines
// any CommonSymbols.		// any CommonSymbols.
static void replaceCommonSymbols() {		static void replaceCommonSymbols() {
TimeTraceScope timeScope("Replace common symbols");		TimeTraceScope timeScope("Replace common symbols");
for (Symbol *sym : symtab->getSymbols()) {		for (Symbol *sym : symtab->getSymbols()) {
auto *common = dyn_cast<CommonSymbol>(sym);		auto *common = dyn_cast<CommonSymbol>(sym);
if (common == nullptr)		if (common == nullptr)
continue;		continue;

auto *isec = make<InputSection>();		auto *isec = make<ConcatInputSection>();
isec->file = common->getFile();		isec->file = common->getFile();
isec->name = section_names::common;		isec->name = section_names::common;
isec->segname = segment_names::data;		isec->segname = segment_names::data;
isec->align = common->align;		isec->align = common->align;
// Casting to size_t will truncate large values on 32-bit architectures,		// Casting to size_t will truncate large values on 32-bit architectures,
// but it's not really worth supporting the linking of 64-bit programs on		// but it's not really worth supporting the linking of 64-bit programs on
// 32-bit archs.		// 32-bit archs.
isec->data = {nullptr, static_cast<size_t>(common->size)};		isec->data = {nullptr, static_cast<size_t>(common->size)};
▲ Show 20 Lines • Show All 491 Lines • ▼ Show 20 Lines	bool macho::link(ArrayRef<const char *> argsArr, bool canExitEarly,
config->runtimePaths = args::getStrings(args, OPT_rpath);		config->runtimePaths = args::getStrings(args, OPT_rpath);
config->allLoad = args.hasArg(OPT_all_load);		config->allLoad = args.hasArg(OPT_all_load);
config->forceLoadObjC = args.hasArg(OPT_ObjC);		config->forceLoadObjC = args.hasArg(OPT_ObjC);
config->deadStripDylibs = args.hasArg(OPT_dead_strip_dylibs);		config->deadStripDylibs = args.hasArg(OPT_dead_strip_dylibs);
config->demangle = args.hasArg(OPT_demangle);		config->demangle = args.hasArg(OPT_demangle);
config->implicitDylibs = !args.hasArg(OPT_no_implicit_dylibs);		config->implicitDylibs = !args.hasArg(OPT_no_implicit_dylibs);
config->emitFunctionStarts = !args.hasArg(OPT_no_function_starts);		config->emitFunctionStarts = !args.hasArg(OPT_no_function_starts);
config->emitBitcodeBundle = args.hasArg(OPT_bitcode_bundle);		config->emitBitcodeBundle = args.hasArg(OPT_bitcode_bundle);
		config->dedupLiterals = args.hasArg(OPT_deduplicate_literals);

// FIXME: Add a commandline flag for this too.		// FIXME: Add a commandline flag for this too.
config->zeroModTime = getenv("ZERO_AR_DATE");		config->zeroModTime = getenv("ZERO_AR_DATE");

std::array<PlatformKind, 3> encryptablePlatforms{		std::array<PlatformKind, 3> encryptablePlatforms{
PlatformKind::iOS, PlatformKind::watchOS, PlatformKind::tvOS};		PlatformKind::iOS, PlatformKind::watchOS, PlatformKind::tvOS};
config->emitEncryptionInfo =		config->emitEncryptionInfo =
args.hasFlag(OPT_encryptable, OPT_no_encryption,		args.hasFlag(OPT_encryptable, OPT_no_encryption,
▲ Show 20 Lines • Show All 274 Lines • Show Last 20 Lines

lld/MachO/InputFiles.h

	Show All 33 Lines
	} // namespace MachO			} // namespace MachO
	class TarWriter;			class TarWriter;
	} // namespace llvm			} // namespace llvm

	namespace lld {			namespace lld {
	namespace macho {			namespace macho {

	struct PlatformInfo;			struct PlatformInfo;
	class InputSection;			class ConcatInputSection;
	class Symbol;			class Symbol;
	struct Reloc;			struct Reloc;
	enum class RefState : uint8_t;			enum class RefState : uint8_t;

	// If --reproduce option is given, all input files are written			// If --reproduce option is given, all input files are written
	// to this tar archive.			// to this tar archive.
	extern std::unique_ptr<llvm::TarWriter> tar;			extern std::unique_ptr<llvm::TarWriter> tar;

	▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	// .o file			// .o file
	class ObjFile : public InputFile {			class ObjFile : public InputFile {
	public:			public:
	ObjFile(MemoryBufferRef mb, uint32_t modTime, StringRef archiveName);			ObjFile(MemoryBufferRef mb, uint32_t modTime, StringRef archiveName);
	static bool classof(const InputFile *f) { return f->kind() == ObjKind; }			static bool classof(const InputFile *f) { return f->kind() == ObjKind; }

	llvm::DWARFUnit *compileUnit = nullptr;			llvm::DWARFUnit *compileUnit = nullptr;
	const uint32_t modTime;			const uint32_t modTime;
	std::vector<InputSection *> debugSections;			std::vector<ConcatInputSection *> debugSections;

	private:			private:
	template <class LP> void parse();			template <class LP> void parse();
	template <class Section> void parseSections(ArrayRef<Section>);			template <class Section> void parseSections(ArrayRef<Section>);
	template <class LP>			template <class LP>
	void parseSymbols(ArrayRef<typename LP::section> sectionHeaders,			void parseSymbols(ArrayRef<typename LP::section> sectionHeaders,
	ArrayRef<typename LP::nlist> nList, const char *strtab,			ArrayRef<typename LP::nlist> nList, const char *strtab,
	bool subsectionsViaSymbols);			bool subsectionsViaSymbols);
	▲ Show 20 Lines • Show All 138 Lines • Show Last 20 Lines

lld/MachO/InputFiles.cpp

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
#include "ExportTrie.h"		#include "ExportTrie.h"
#include "InputSection.h"		#include "InputSection.h"
#include "MachOStructs.h"		#include "MachOStructs.h"
#include "ObjC.h"		#include "ObjC.h"
#include "OutputSection.h"		#include "OutputSection.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "SymbolTable.h"		#include "SymbolTable.h"
#include "Symbols.h"		#include "Symbols.h"
		#include "SyntheticSections.h"
#include "Target.h"		#include "Target.h"

#include "lld/Common/DWARF.h"		#include "lld/Common/DWARF.h"
#include "lld/Common/ErrorHandler.h"		#include "lld/Common/ErrorHandler.h"
#include "lld/Common/Memory.h"		#include "lld/Common/Memory.h"
#include "lld/Common/Reproduce.h"		#include "lld/Common/Reproduce.h"
#include "llvm/ADT/iterator.h"		#include "llvm/ADT/iterator.h"
#include "llvm/BinaryFormat/MachO.h"		#include "llvm/BinaryFormat/MachO.h"
▲ Show 20 Lines • Show All 172 Lines • ▼ Show 20 Lines	Optional<MemoryBufferRef> macho::readFile(StringRef path) {
error("unable to find matching architecture in " + path);		error("unable to find matching architecture in " + path);
return None;		return None;
}		}

InputFile::InputFile(Kind kind, const InterfaceFile &interface)		InputFile::InputFile(Kind kind, const InterfaceFile &interface)
: id(idCount++), fileKind(kind), name(saver.save(interface.getPath())) {}		: id(idCount++), fileKind(kind), name(saver.save(interface.getPath())) {}

template <class Section>		template <class Section>
void ObjFile::parseSections(ArrayRef<Section> sections) {		static void parseSection(ObjFile file, const uint8_t buf, const Section &sec,
subsections.reserve(sections.size());		InputSection *isec) {
auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());		isec->file = file;

for (const Section &sec : sections) {
InputSection *isec = make<InputSection>();
isec->file = this;
isec->name =		isec->name =
StringRef(sec.sectname, strnlen(sec.sectname, sizeof(sec.sectname)));		StringRef(sec.sectname, strnlen(sec.sectname, sizeof(sec.sectname)));
isec->segname =		isec->segname =
StringRef(sec.segname, strnlen(sec.segname, sizeof(sec.segname)));		StringRef(sec.segname, strnlen(sec.segname, sizeof(sec.segname)));
isec->data = {isZeroFill(sec.flags) ? nullptr : buf + sec.offset,		isec->data = {isZeroFill(sec.flags) ? nullptr : buf + sec.offset,
static_cast<size_t>(sec.size)};		static_cast<size_t>(sec.size)};
if (sec.align >= 32)		if (sec.align >= 32)
error("alignment " + std::to_string(sec.align) + " of section " +		error("alignment " + std::to_string(sec.align) + " of section " +
isec->name + " is too large");		isec->name + " is too large");
else		else
isec->align = 1 << sec.align;		isec->align = 1 << sec.align;
isec->flags = sec.flags;		isec->flags = sec.flags;
		}

		template <class Section>
		void ObjFile::parseSections(ArrayRef<Section> sections) {
		subsections.reserve(sections.size());
		auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());

		for (const Section &sec : sections) {
		if (config->dedupLiterals && sectionType(sec.flags) == S_CSTRING_LITERALS) {
		if (sec.nreloc)
		fatal(toString(this) + " contains relocations in " + sec.segname + "," +
		sec.sectname +
		", so LLD cannot deduplicate literals. Try re-running without "
		"--deduplicate-literals.");

		auto *isec = make<CStringInputSection>();
		parseSection(this, buf, sec, isec);
		isec->splitIntoPieces(); // FIXME: parallelize this?
		subsections.push_back({{0, isec}});
		} else {
		auto *isec = make<ConcatInputSection>();
		parseSection(this, buf, sec, isec);
if (!(isDebugSection(isec->flags) &&		if (!(isDebugSection(isec->flags) &&
isec->segname == segment_names::dwarf)) {		isec->segname == segment_names::dwarf)) {
subsections.push_back({{0, isec}});		subsections.push_back({{0, isec}});
} else {		} else {
// Instead of emitting DWARF sections, we emit STABS symbols to the		// Instead of emitting DWARF sections, we emit STABS symbols to the
// object files that contain them. We filter them out early to avoid		// object files that contain them. We filter them out early to avoid
// parsing their relocations unnecessarily. But we must still push an		// parsing their relocations unnecessarily. But we must still push an
// empty map to ensure the indices line up for the remaining sections.		// empty map to ensure the indices line up for the remaining sections.
subsections.push_back({});		subsections.push_back({});
debugSections.push_back(isec);		debugSections.push_back(isec);
}		}
}		}
}		}
		}

// Find the subsection corresponding to the greatest section offset that is <=		// Find the subsection corresponding to the greatest section offset that is <=
// that of the given offset.		// that of the given offset.
//		//
// offset: an offset relative to the start of the original InputSection (before		// offset: an offset relative to the start of the original InputSection (before
// any subsection splitting has occurred). It will be updated to represent the		// any subsection splitting has occurred). It will be updated to represent the
// same location as an offset relative to the start of the containing		// same location as an offset relative to the start of the containing
// subsection.		// subsection.
▲ Show 20 Lines • Show All 311 Lines • ▼ Show 20 Lines	for (size_t j = 0; j < symbolIndices.size(); ++j) {
InputSection *isec = subsecEntry.isec;		InputSection *isec = subsecEntry.isec;

uint64_t subsecAddr = sectionAddr + subsecEntry.offset;		uint64_t subsecAddr = sectionAddr + subsecEntry.offset;
uint64_t symbolOffset = sym.n_value - subsecAddr;		uint64_t symbolOffset = sym.n_value - subsecAddr;
uint64_t symbolSize =		uint64_t symbolSize =
j + 1 < symbolIndices.size()		j + 1 < symbolIndices.size()
? nList[symbolIndices[j + 1]].n_value - sym.n_value		? nList[symbolIndices[j + 1]].n_value - sym.n_value
: isec->data.size() - symbolOffset;		: isec->data.size() - symbolOffset;
// There are 3 cases where we do not need to create a new subsection:		// There are 4 cases where we do not need to create a new subsection:
// 1. If the input file does not use subsections-via-symbols.		// 1. If the input file does not use subsections-via-symbols.
// 2. Multiple symbols at the same address only induce one subsection.		// 2. Multiple symbols at the same address only induce one subsection.
// (The symbolOffset == 0 check covers both this case as well as		// (The symbolOffset == 0 check covers both this case as well as
// the first loop iteration.)		// the first loop iteration.)
// 3. Alternative entry points do not induce new subsections.		// 3. Alternative entry points do not induce new subsections.
		// 4. If we have a literal section (e.g. __cstring and __literal4).
if (!subsectionsViaSymbols \|\| symbolOffset == 0 \|\|		if (!subsectionsViaSymbols \|\| symbolOffset == 0 \|\|
sym.n_desc & N_ALT_ENTRY) {		sym.n_desc & N_ALT_ENTRY \|\| !isa<ConcatInputSection>(isec)) {
symbols[symIndex] =		symbols[symIndex] =
createDefined(sym, name, isec, symbolOffset, symbolSize);		createDefined(sym, name, isec, symbolOffset, symbolSize);
continue;		continue;
}		}
		auto *concatIsec = cast<ConcatInputSection>(isec);

auto nextIsec = make<InputSection>(isec);		auto nextIsec = make<ConcatInputSection>(concatIsec);
nextIsec->data = isec->data.slice(symbolOffset);		nextIsec->data = isec->data.slice(symbolOffset);
nextIsec->numRefs = 0;		nextIsec->numRefs = 0;
nextIsec->wasCoalesced = false;		nextIsec->wasCoalesced = false;
isec->data = isec->data.slice(0, symbolOffset);		isec->data = isec->data.slice(0, symbolOffset);

// By construction, the symbol will be at offset zero in the new		// By construction, the symbol will be at offset zero in the new
// subsection.		// subsection.
symbols[symIndex] =		symbols[symIndex] =
createDefined(sym, name, nextIsec, /value=/0, symbolSize);		createDefined(sym, name, nextIsec, /value=/0, symbolSize);
// TODO: ld64 appears to preserve the original alignment as well as each		// TODO: ld64 appears to preserve the original alignment as well as each
// subsection's offset from the last aligned address. We should consider		// subsection's offset from the last aligned address. We should consider
// emulating that behavior.		// emulating that behavior.
nextIsec->align = MinAlign(sectionAlign, sym.n_value);		nextIsec->align = MinAlign(sectionAlign, sym.n_value);
subsecMap.push_back({sym.n_value - sectionAddr, nextIsec});		subsecMap.push_back({sym.n_value - sectionAddr, nextIsec});
subsecEntry = subsecMap.back();		subsecEntry = subsecMap.back();
}		}
}		}
}		}

OpaqueFile::OpaqueFile(MemoryBufferRef mb, StringRef segName,		OpaqueFile::OpaqueFile(MemoryBufferRef mb, StringRef segName,
StringRef sectName)		StringRef sectName)
: InputFile(OpaqueKind, mb) {		: InputFile(OpaqueKind, mb) {
InputSection *isec = make<InputSection>();		ConcatInputSection *isec = make<ConcatInputSection>();
isec->file = this;		isec->file = this;
isec->name = sectName.take_front(16);		isec->name = sectName.take_front(16);
isec->segname = segName.take_front(16);		isec->segname = segName.take_front(16);
const auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());		const auto buf = reinterpret_cast<const uint8_t >(mb.getBufferStart());
isec->data = {buf, mb.getBufferSize()};		isec->data = {buf, mb.getBufferSize()};
isec->live = true;		isec->live = true;
subsections.push_back({{0, isec}});		subsections.push_back({{0, isec}});
}		}
▲ Show 20 Lines • Show All 521 Lines • Show Last 20 Lines

lld/MachO/InputSection.h

	//===- InputSection.h -------------------------------------------- C++ --===//			//===- InputSection.h -------------------------------------------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLD_MACHO_INPUT_SECTION_H			#ifndef LLD_MACHO_INPUT_SECTION_H
	#define LLD_MACHO_INPUT_SECTION_H			#define LLD_MACHO_INPUT_SECTION_H

	#include "Config.h"			#include "Config.h"
	#include "Relocations.h"			#include "Relocations.h"

	#include "lld/Common/LLVM.h"			#include "lld/Common/LLVM.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
				#include "llvm/ADT/CachedHashString.h"
	#include "llvm/BinaryFormat/MachO.h"			#include "llvm/BinaryFormat/MachO.h"

	namespace lld {			namespace lld {
	namespace macho {			namespace macho {

	class InputFile;			class InputFile;
	class OutputSection;			class OutputSection;

	class InputSection {			class InputSection {
	public:			public:
				enum Kind {
				ConcatKind,
				CStringLiteralKind,
				};

				Kind kind() const { return sectionKind; }
	virtual ~InputSection() = default;			virtual ~InputSection() = default;
	virtual uint64_t getSize() const { return data.size(); }			virtual uint64_t getSize() const { return data.size(); }
	uint64_t getFileSize() const;			uint64_t getFileSize() const;
	uint64_t getFileOffset() const;			// Translates \p off -- an offset relative to this InputSection -- into an
	uint64_t getVA() const;			// offset from the beginning of its parent OutputSection.
				virtual uint64_t getOffset(uint64_t off) const = 0;
				// The offset from the beginning of the file.
				virtual uint64_t getFileOffset(uint64_t off) const = 0;
				uint64_t getVA(uint64_t off) const;

	void writeTo(uint8_t *buf);			void writeTo(uint8_t *buf);

	InputFile *file = nullptr;			InputFile *file = nullptr;
	StringRef name;			StringRef name;
	StringRef segname;			StringRef segname;

	OutputSection *parent = nullptr;			OutputSection *parent = nullptr;
	uint64_t outSecOff = 0;
	uint64_t outSecFileOff = 0;

	uint32_t align = 1;			uint32_t align = 1;
	uint32_t flags = 0;			uint32_t flags = 0;
	uint32_t callSiteCount = 0;			uint32_t callSiteCount = 0;
	bool isFinal = false; // is address assigned?			bool isFinal = false; // is address assigned?

	// How many symbols refer to this InputSection.			// How many symbols refer to this InputSection.
	uint32_t numRefs = 0;			uint32_t numRefs = 0;

	// With subsections_via_symbols, most symbols have their own InputSection,			// With subsections_via_symbols, most symbols have their own InputSection,
	// and for weak symbols (e.g. from inline functions), only the			// and for weak symbols (e.g. from inline functions), only the
	// InputSection from one translation unit will make it to the output,			// InputSection from one translation unit will make it to the output,
	// while all copies in other translation units are coalesced into the			// while all copies in other translation units are coalesced into the
	// first and not copied to the output.			// first and not copied to the output.
	bool wasCoalesced = false;			bool wasCoalesced = false;

	bool isCoalescedWeak() const { return wasCoalesced && numRefs == 0; }			bool isCoalescedWeak() const { return wasCoalesced && numRefs == 0; }
	bool shouldOmitFromOutput() const { return !live \|\| isCoalescedWeak(); }			bool shouldOmitFromOutput() const { return !live \|\| isCoalescedWeak(); }

	bool live = !config->deadStrip;			bool live = !config->deadStrip;

	ArrayRef<uint8_t> data;			ArrayRef<uint8_t> data;
	std::vector<Reloc> relocs;			std::vector<Reloc> relocs;

				protected:
				explicit InputSection(Kind kind) : sectionKind(kind) {}
				alexander-shaposhnikovUnsubmitted Done Reply Inline Actions explicit alexander-shaposhnikov: explicit

				private:
				Kind sectionKind;
				};

				// ConcatInputSections are combined into (Concat)OutputSections through simple
				// concatentation, in contrast with literal sections which may have their
				// contents merged before output.
				class ConcatInputSection : public InputSection {
				public:
				ConcatInputSection() : InputSection(ConcatKind) {}
				uint64_t getFileOffset(uint64_t off) const override;
				uint64_t getOffset(uint64_t off) const override { return outSecOff + off; }
				uint64_t getVA() const { return InputSection::getVA(0); }

				static bool classof(const InputSection *isec) {
				return isec->kind() == ConcatKind;
				}

				uint64_t outSecOff = 0;
				uint64_t outSecFileOff = 0;
				};

				// We allocate a lot of these and binary search on them, so they should be as
				// compact as possible. Hence the use of 32 rather than 64 bits for the hash.
				alexander-shaposhnikovUnsubmitted Done Reply Inline Actions would be good to add comments for these fields (inSecOff, outSecOff) alexander-shaposhnikov: would be good to add comments for these fields (inSecOff, outSecOff)
				struct StringPiece {
				gkmUnsubmitted Done Reply Inline Actions Why are we truncating 64-bit hashes to 32 bits? Because the low-order 32 bits are sufficient, and it's more important that `StringPiece` be 16 bytes vs. 24 bytes? gkm: Why are we truncating 64-bit hashes to 32 bits? Because the low-order 32 bits are sufficient…
				int3AuthorUnsubmitted Done Reply Inline Actions This was copied from LLD-ELF's implementation, and yeah the motivation is to reduce the memory cost. I'll copy over the comment too... int3: This was copied from LLD-ELF's implementation, and yeah the motivation is to reduce the memory…
				// Offset from the start of the containing input section.
				uint32_t inSecOff;
				uint32_t hash;
				// Offset from the start of the containing output section.
				uint64_t outSecOff;

				StringPiece(uint64_t off, uint32_t hash) : inSecOff(off), hash(hash) {}
				};

				// CStringInputSections are composed of multiple null-terminated string
				// literals, which we represent using StringPieces. These literals can be
				// deduplicated and tail-merged, so translating offsets between the input and
				// outputs sections is more complicated.
				//
				// NOTE: One significant difference between LLD and ld64 is that we merge all
				// cstring literals, even those referenced directly by non-private symbols.
				// ld64 is more conservative and does not do that. This was mostly done for
				// implementation simplicity; if we find programs that need the more
				// conservative behavior we can certainly implement that.
				class CStringInputSection : public InputSection {
				public:
				CStringInputSection() : InputSection(CStringLiteralKind) {}
				alexander-shaposhnikovUnsubmitted Done Reply Inline Actions khm, wouldn't const StringPiece &getStringPiece(uint64_t offset) const be a cleaner interface ? alexander-shaposhnikov: khm, wouldn't ``` const StringPiece &getStringPiece(uint64_t offset) const ``` be a cleaner…
				uint64_t getFileOffset(uint64_t off) const override;
				uint64_t getOffset(uint64_t off) const override;
				// Find the StringPiece that contains this offset.
				const StringPiece &getStringPiece(uint64_t off) const;
				// Split at each null byte.
				void splitIntoPieces();

				// Returns i'th piece as a CachedHashStringRef. This function is very hot when
				// string merging is enabled, so we want to inline.
				LLVM_ATTRIBUTE_ALWAYS_INLINE
				llvm::CachedHashStringRef getCachedHashStringRef(size_t i) const {
				size_t begin = pieces[i].inSecOff;
				size_t end =
				(pieces.size() - 1 == i) ? data.size() : pieces[i + 1].inSecOff;
				return {toStringRef(data.slice(begin, end - begin)), pieces[i].hash};
				}

				static bool classof(const InputSection *isec) {
				alexander-shaposhnikovUnsubmitted Done Reply Inline Actions does it need to be `public` ? alexander-shaposhnikov: does it need to be `public` ?
				int3AuthorUnsubmitted Done Reply Inline Actions `CStringSection::finalize()` needs it to be public int3: `CStringSection::finalize()` needs it to be public
				return isec->kind() == CStringLiteralKind;
				}

				std::vector<StringPiece> pieces;
	};			};

	inline uint8_t sectionType(uint32_t flags) {			inline uint8_t sectionType(uint32_t flags) {
	return flags & llvm::MachO::SECTION_TYPE;			return flags & llvm::MachO::SECTION_TYPE;
	}			}

	inline bool isZeroFill(uint32_t flags) {			inline bool isZeroFill(uint32_t flags) {
	return llvm::MachO::isVirtualSection(sectionType(flags));			return llvm::MachO::isVirtualSection(sectionType(flags));
	Show All 19 Lines
	extern std::vector<InputSection *> inputSections;			extern std::vector<InputSection *> inputSections;

	namespace section_names {			namespace section_names {

	constexpr const char authGot[] = "__auth_got";			constexpr const char authGot[] = "__auth_got";
	constexpr const char authPtr[] = "__auth_ptr";			constexpr const char authPtr[] = "__auth_ptr";
	constexpr const char binding[] = "__binding";			constexpr const char binding[] = "__binding";
	constexpr const char bitcodeBundle[] = "__bundle";			constexpr const char bitcodeBundle[] = "__bundle";
				constexpr const char cString[] = "__cstring";
	constexpr const char cfString[] = "__cfstring";			constexpr const char cfString[] = "__cfstring";
	constexpr const char codeSignature[] = "__code_signature";			constexpr const char codeSignature[] = "__code_signature";
	constexpr const char common[] = "__common";			constexpr const char common[] = "__common";
	constexpr const char compactUnwind[] = "__compact_unwind";			constexpr const char compactUnwind[] = "__compact_unwind";
	constexpr const char data[] = "__data";			constexpr const char data[] = "__data";
	constexpr const char debugAbbrev[] = "__debug_abbrev";			constexpr const char debugAbbrev[] = "__debug_abbrev";
	constexpr const char debugInfo[] = "__debug_info";			constexpr const char debugInfo[] = "__debug_info";
	constexpr const char debugStr[] = "__debug_str";			constexpr const char debugStr[] = "__debug_str";
	▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

lld/MachO/InputSection.cpp

Show All 9 Lines
#include "InputFiles.h"		#include "InputFiles.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "Symbols.h"		#include "Symbols.h"
#include "SyntheticSections.h"		#include "SyntheticSections.h"
#include "Target.h"		#include "Target.h"
#include "Writer.h"		#include "Writer.h"
#include "lld/Common/Memory.h"		#include "lld/Common/Memory.h"
#include "llvm/Support/Endian.h"		#include "llvm/Support/Endian.h"
		#include "llvm/Support/xxhash.h"

using namespace llvm;		using namespace llvm;
using namespace llvm::MachO;		using namespace llvm::MachO;
using namespace llvm::support;		using namespace llvm::support;
using namespace lld;		using namespace lld;
using namespace lld::macho;		using namespace lld::macho;

std::vector<InputSection *> macho::inputSections;		std::vector<InputSection *> macho::inputSections;

uint64_t InputSection::getFileOffset() const {		uint64_t ConcatInputSection::getFileOffset(uint64_t off) const {
return parent->fileOff + outSecFileOff;		return parent->fileOff + outSecFileOff + off;
}		}

uint64_t InputSection::getFileSize() const {		uint64_t InputSection::getFileSize() const {
return isZeroFill(flags) ? 0 : getSize();		return isZeroFill(flags) ? 0 : getSize();
}		}

uint64_t InputSection::getVA() const { return parent->addr + outSecOff; }		uint64_t InputSection::getVA(uint64_t off) const {
		return parent->addr + getOffset(off);
		}

static uint64_t resolveSymbolVA(const Symbol *sym, uint8_t type) {		static uint64_t resolveSymbolVA(const Symbol *sym, uint8_t type) {
const RelocAttrs &relocAttrs = target->getRelocAttrs(type);		const RelocAttrs &relocAttrs = target->getRelocAttrs(type);
if (relocAttrs.hasAttr(RelocAttrBits::BRANCH))		if (relocAttrs.hasAttr(RelocAttrBits::BRANCH))
return sym->resolveBranchVA();		return sym->resolveBranchVA();
else if (relocAttrs.hasAttr(RelocAttrBits::GOT))		else if (relocAttrs.hasAttr(RelocAttrBits::GOT))
return sym->resolveGotVA();		return sym->resolveGotVA();
else if (relocAttrs.hasAttr(RelocAttrBits::TLV))		else if (relocAttrs.hasAttr(RelocAttrBits::TLV))
Show All 13 Lines	for (size_t i = 0; i < relocs.size(); i++) {
const Reloc &r = relocs[i];		const Reloc &r = relocs[i];
uint8_t *loc = buf + r.offset;		uint8_t *loc = buf + r.offset;
uint64_t referentVA = 0;		uint64_t referentVA = 0;
if (target->hasAttr(r.type, RelocAttrBits::SUBTRAHEND)) {		if (target->hasAttr(r.type, RelocAttrBits::SUBTRAHEND)) {
const Symbol fromSym = r.referent.get<Symbol >();		const Symbol fromSym = r.referent.get<Symbol >();
const Reloc &minuend = relocs[++i];		const Reloc &minuend = relocs[++i];
uint64_t minuendVA;		uint64_t minuendVA;
if (const Symbol toSym = minuend.referent.dyn_cast<Symbol >())		if (const Symbol toSym = minuend.referent.dyn_cast<Symbol >())
minuendVA = toSym->getVA();		minuendVA = toSym->getVA() + minuend.addend;
else {		else {
auto referentIsec = minuend.referent.get<InputSection >();		auto referentIsec = minuend.referent.get<InputSection >();
assert(!referentIsec->shouldOmitFromOutput());		assert(!referentIsec->shouldOmitFromOutput());
minuendVA = referentIsec->getVA();		minuendVA = referentIsec->getVA(minuend.addend);
}		}
referentVA = minuendVA - fromSym->getVA() + minuend.addend;		referentVA = minuendVA - fromSym->getVA();
} else if (auto referentSym = r.referent.dyn_cast<Symbol >()) {		} else if (auto referentSym = r.referent.dyn_cast<Symbol >()) {
if (target->hasAttr(r.type, RelocAttrBits::LOAD) &&		if (target->hasAttr(r.type, RelocAttrBits::LOAD) &&
!referentSym->isInGot())		!referentSym->isInGot())
target->relaxGotLoad(loc, r.type);		target->relaxGotLoad(loc, r.type);
referentVA = resolveSymbolVA(referentSym, r.type);		referentVA = resolveSymbolVA(referentSym, r.type) + r.addend;

if (isThreadLocalVariables(flags)) {		if (isThreadLocalVariables(flags)) {
// References from thread-local variable sections are treated as offsets		// References from thread-local variable sections are treated as offsets
// relative to the start of the thread-local data memory area, which		// relative to the start of the thread-local data memory area, which
// is initialized via copying all the TLV data sections (which are all		// is initialized via copying all the TLV data sections (which are all
// contiguous).		// contiguous).
if (isa<Defined>(referentSym))		if (isa<Defined>(referentSym))
referentVA -= firstTLVDataSection->addr;		referentVA -= firstTLVDataSection->addr;
}		}
} else if (auto referentIsec = r.referent.dyn_cast<InputSection >()) {		} else if (auto referentIsec = r.referent.dyn_cast<InputSection >()) {
assert(!referentIsec->shouldOmitFromOutput());		assert(!referentIsec->shouldOmitFromOutput());
referentVA = referentIsec->getVA();		referentVA = referentIsec->getVA(r.addend);
		}
		target->relocateOne(loc, r, referentVA, getVA(r.offset));
		}
		}

		void CStringInputSection::splitIntoPieces() {
		size_t off = 0;
		StringRef s = toStringRef(data);
		while (!s.empty()) {
		size_t end = s.find(0);
		if (end == StringRef::npos)
		fatal(toString(this) + ": string is not null terminated");
		size_t size = end + 1;
		pieces.emplace_back(off, xxHash64(s.substr(0, size)));
		s = s.substr(size);
		off += size;
}		}
target->relocateOne(loc, r, referentVA + r.addend, getVA() + r.offset);
}		}

		const StringPiece &CStringInputSection::getStringPiece(uint64_t off) const {
		if (off >= data.size())
		fatal(toString(this) + ": offset is outside the section");

		auto it =
		partition_point(pieces, [=](StringPiece p) { return p.inSecOff <= off; });
		return it[-1];
		}

		uint64_t CStringInputSection::getFileOffset(uint64_t off) const {
		return parent->fileOff + getOffset(off);
		}

		uint64_t CStringInputSection::getOffset(uint64_t off) const {
		const StringPiece &piece = getStringPiece(off);
		uint64_t addend = off - piece.inSecOff;
		return piece.outSecOff + addend;
}		}

bool macho::isCodeSection(const InputSection *isec) {		bool macho::isCodeSection(const InputSection *isec) {
uint32_t type = isec->flags & SECTION_TYPE;		uint32_t type = isec->flags & SECTION_TYPE;
if (type != S_REGULAR && type != S_COALESCED)		if (type != S_REGULAR && type != S_COALESCED)
return false;		return false;

uint32_t attr = isec->flags & SECTION_ATTRIBUTES_USR;		uint32_t attr = isec->flags & SECTION_ATTRIBUTES_USR;
Show All 14 Lines

lld/MachO/Options.td

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	def lto_legacy_pass_manager: Flag<["--"], "lto-legacy-pass-manager">,
Group<grp_lld>;		Group<grp_lld>;
def no_lto_legacy_pass_manager : Flag<["--"], "no-lto-legacy-pass-manager">,		def no_lto_legacy_pass_manager : Flag<["--"], "no-lto-legacy-pass-manager">,
HelpText<"Use the new pass manager in LLVM">,		HelpText<"Use the new pass manager in LLVM">,
Group<grp_lld>;		Group<grp_lld>;
def time_trace: Flag<["--"], "time-trace">, HelpText<"Record time trace">;		def time_trace: Flag<["--"], "time-trace">, HelpText<"Record time trace">;
def time_trace_granularity_eq: Joined<["--"], "time-trace-granularity=">,		def time_trace_granularity_eq: Joined<["--"], "time-trace-granularity=">,
HelpText<"Minimum time granularity (in microseconds) traced by time profiler">;		HelpText<"Minimum time granularity (in microseconds) traced by time profiler">;
def time_trace_file_eq: Joined<["--"], "time-trace-file=">, HelpText<"Specify time trace output file">;		def time_trace_file_eq: Joined<["--"], "time-trace-file=">, HelpText<"Specify time trace output file">;
		def deduplicate_literals: Flag<["--"], "deduplicate-literals">, HelpText<"Enable literal deduplication">;

// This is a complete Options.td compiled from Apple's ld(1) manpage		// This is a complete Options.td compiled from Apple's ld(1) manpage
// dated 2018-03-07 and cross checked with ld64 source code in repo		// dated 2018-03-07 and cross checked with ld64 source code in repo
// https://github.com/apple-opensource/ld64 at git tag "512.4" dated		// https://github.com/apple-opensource/ld64 at git tag "512.4" dated
// 2018-03-18.		// 2018-03-18.

// Flags<[HelpHidden]> marks options that are not yet ported to lld,		// Flags<[HelpHidden]> marks options that are not yet ported to lld,
// and serve as a scoreboard for annotating our progress toward		// and serve as a scoreboard for annotating our progress toward
▲ Show 20 Lines • Show All 1,261 Lines • Show Last 20 Lines

lld/MachO/Symbols.cpp

Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	if (!isec->isFinal) {
assert(target->usesThunks());		assert(target->usesThunks());

// ConcatOutputSection::finalize() can seek the address of a		// ConcatOutputSection::finalize() can seek the address of a
// function before its address is assigned. The thunking algorithm		// function before its address is assigned. The thunking algorithm
// knows that unfinalized functions will be out of range, so it is		// knows that unfinalized functions will be out of range, so it is
// expedient to return a contrived out-of-range address.		// expedient to return a contrived out-of-range address.
return TargetInfo::outOfRangeVA;		return TargetInfo::outOfRangeVA;
}		}
return isec->getVA() + value;		return isec->getVA(value);
}		}

uint64_t Defined::getFileOffset() const {		uint64_t Defined::getFileOffset() const {
if (isAbsolute()) {		if (isAbsolute()) {
error("absolute symbol " + toString(*this) +		error("absolute symbol " + toString(*this) +
" does not have a file offset");		" does not have a file offset");
return 0;		return 0;
}		}
return isec->getFileOffset() + value;		return isec->getFileOffset(value);
}		}

uint64_t DylibSymbol::getVA() const {		uint64_t DylibSymbol::getVA() const {
return isInStubs() ? getStubVA() : Symbol::getVA();		return isInStubs() ? getStubVA() : Symbol::getVA();
}		}

void LazySymbol::fetchArchiveMember() { getFile()->fetch(sym); }		void LazySymbol::fetchArchiveMember() { getFile()->fetch(sym); }

lld/MachO/SyntheticSections.h

Show All 12 Lines
#include "ExportTrie.h"		#include "ExportTrie.h"
#include "InputSection.h"		#include "InputSection.h"
#include "OutputSection.h"		#include "OutputSection.h"
#include "OutputSegment.h"		#include "OutputSegment.h"
#include "Target.h"		#include "Target.h"

#include "llvm/ADT/Hashing.h"		#include "llvm/ADT/Hashing.h"
#include "llvm/ADT/SetVector.h"		#include "llvm/ADT/SetVector.h"
		#include "llvm/MC/StringTableBuilder.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

namespace llvm {		namespace llvm {
class DWARFUnit;		class DWARFUnit;
} // namespace llvm		} // namespace llvm

namespace lld {		namespace lld {
▲ Show 20 Lines • Show All 120 Lines • ▼ Show 20 Lines
};		};

struct Location {		struct Location {
const InputSection *isec;		const InputSection *isec;
uint64_t offset;		uint64_t offset;

Location(const InputSection *isec, uint64_t offset)		Location(const InputSection *isec, uint64_t offset)
: isec(isec), offset(offset) {}		: isec(isec), offset(offset) {}
uint64_t getVA() const { return isec->getVA() + offset; }		uint64_t getVA() const { return isec->getVA(offset); }
};		};

// Stores rebase opcodes, which tell dyld where absolute addresses have been		// Stores rebase opcodes, which tell dyld where absolute addresses have been
// encoded in the binary. If the binary is not loaded at its preferred address,		// encoded in the binary. If the binary is not loaded at its preferred address,
// dyld has to rebase these addresses by adding an offset to them.		// dyld has to rebase these addresses by adding an offset to them.
class RebaseSection : public LinkEditSection {		class RebaseSection : public LinkEditSection {
public:		public:
RebaseSection();		RebaseSection();
▲ Show 20 Lines • Show All 155 Lines • ▼ Show 20 Lines	public:
DylibSymbol *stubBinder = nullptr;		DylibSymbol *stubBinder = nullptr;
Defined *dyldPrivate = nullptr;		Defined *dyldPrivate = nullptr;
};		};

// This section contains space for just a single word, and will be used by dyld		// This section contains space for just a single word, and will be used by dyld
// to cache an address to the image loader it uses. Note that unlike the other		// to cache an address to the image loader it uses. Note that unlike the other
// synthetic sections, which are OutputSections, the ImageLoaderCacheSection is		// synthetic sections, which are OutputSections, the ImageLoaderCacheSection is
// an InputSection that gets merged into the __data OutputSection.		// an InputSection that gets merged into the __data OutputSection.
class ImageLoaderCacheSection : public InputSection {		class ImageLoaderCacheSection : public ConcatInputSection {
public:		public:
ImageLoaderCacheSection();		ImageLoaderCacheSection();
uint64_t getSize() const override { return target->wordSize; }		uint64_t getSize() const override { return target->wordSize; }
};		};

// Note that this section may also be targeted by non-lazy bindings. In		// Note that this section may also be targeted by non-lazy bindings. In
// particular, this happens when branch relocations target weak symbols.		// particular, this happens when branch relocations target weak symbols.
class LazyPointerSection : public SyntheticSection {		class LazyPointerSection : public SyntheticSection {
▲ Show 20 Lines • Show All 172 Lines • ▼ Show 20 Lines	public:
void finalize() override;		void finalize() override;
void writeTo(uint8_t *buf) const override;		void writeTo(uint8_t *buf) const override;

private:		private:
llvm::SmallString<261> xarPath;		llvm::SmallString<261> xarPath;
uint64_t xarSize;		uint64_t xarSize;
};		};

		class CStringSection : public SyntheticSection {
		public:
		CStringSection();
		void addInput(CStringInputSection *);
		uint64_t getSize() const override { return builder.getSize(); }
		void finalize() override;
		bool isNeeded() const override { return !inputs.empty(); }
		void writeTo(uint8_t *buf) const override { builder.write(buf); }

		std::vector<CStringInputSection *> inputs;

		private:
		llvm::StringTableBuilder builder;
		};

struct InStruct {		struct InStruct {
MachHeaderSection *header = nullptr;		MachHeaderSection *header = nullptr;
		CStringSection *cStringSection = nullptr;
RebaseSection *rebase = nullptr;		RebaseSection *rebase = nullptr;
BindingSection *binding = nullptr;		BindingSection *binding = nullptr;
WeakBindingSection *weakBinding = nullptr;		WeakBindingSection *weakBinding = nullptr;
LazyBindingSection *lazyBinding = nullptr;		LazyBindingSection *lazyBinding = nullptr;
ExportSection *exports = nullptr;		ExportSection *exports = nullptr;
GotSection *got = nullptr;		GotSection *got = nullptr;
TlvPointerSection *tlvPointers = nullptr;		TlvPointerSection *tlvPointers = nullptr;
LazyPointerSection *lazyPointers = nullptr;		LazyPointerSection *lazyPointers = nullptr;
Show All 15 Lines

lld/MachO/SyntheticSections.cpp

Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
using namespace lld;		using namespace lld;
using namespace lld::macho;		using namespace lld::macho;

InStruct macho::in;		InStruct macho::in;
std::vector<SyntheticSection *> macho::syntheticSections;		std::vector<SyntheticSection *> macho::syntheticSections;

SyntheticSection::SyntheticSection(const char segname, const char name)		SyntheticSection::SyntheticSection(const char segname, const char name)
: OutputSection(SyntheticKind, name), segname(segname) {		: OutputSection(SyntheticKind, name), segname(segname) {
isec = make<InputSection>();		isec = make<ConcatInputSection>();
isec->segname = segname;		isec->segname = segname;
isec->name = name;		isec->name = name;
isec->parent = this;		isec->parent = this;
isec->outSecOff = 0;
syntheticSections.push_back(this);		syntheticSections.push_back(this);
}		}

// dyld3's MachOLoaded::getSlide() assumes that the __TEXT segment starts		// dyld3's MachOLoaded::getSlide() assumes that the __TEXT segment starts
// from the beginning of the file (i.e. the header).		// from the beginning of the file (i.e. the header).
MachHeaderSection::MachHeaderSection()		MachHeaderSection::MachHeaderSection()
: SyntheticSection(segment_names::text, section_names::header) {		: SyntheticSection(segment_names::text, section_names::header) {
// XXX: This is a hack. (See D97007)		// XXX: This is a hack. (See D97007)
▲ Show 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	if (locations.empty())
return;		return;

raw_svector_ostream os{contents};		raw_svector_ostream os{contents};
Rebase lastRebase;		Rebase lastRebase;

os << static_cast<uint8_t>(REBASE_OPCODE_SET_TYPE_IMM \| REBASE_TYPE_POINTER);		os << static_cast<uint8_t>(REBASE_OPCODE_SET_TYPE_IMM \| REBASE_TYPE_POINTER);

llvm::sort(locations, [](const Location &a, const Location &b) {		llvm::sort(locations, [](const Location &a, const Location &b) {
return a.isec->getVA() < b.isec->getVA();		return a.isec->getVA(a.offset) < b.isec->getVA(b.offset);
});		});
for (const Location &loc : locations)		for (const Location &loc : locations)
encodeRebase(loc.isec->parent, loc.isec->outSecOff + loc.offset, lastRebase,		encodeRebase(loc.isec->parent, loc.isec->getOffset(loc.offset), lastRebase,
os);		os);
if (lastRebase.consecutiveCount != 0)		if (lastRebase.consecutiveCount != 0)
encodeDoRebase(lastRebase, os);		encodeDoRebase(lastRebase, os);

os << static_cast<uint8_t>(REBASE_OPCODE_DONE);		os << static_cast<uint8_t>(REBASE_OPCODE_DONE);
}		}

void RebaseSection::writeTo(uint8_t *buf) const {		void RebaseSection::writeTo(uint8_t *buf) const {
▲ Show 20 Lines • Show All 145 Lines • ▼ Show 20 Lines	void BindingSection::finalizeContents() {
});		});
for (const BindingEntry &b : bindings) {		for (const BindingEntry &b : bindings) {
int16_t ordinal = ordinalForDylibSymbol(*b.dysym);		int16_t ordinal = ordinalForDylibSymbol(*b.dysym);
if (ordinal != lastBinding.ordinal) {		if (ordinal != lastBinding.ordinal) {
encodeDylibOrdinal(ordinal, os);		encodeDylibOrdinal(ordinal, os);
lastBinding.ordinal = ordinal;		lastBinding.ordinal = ordinal;
}		}
encodeBinding(b.dysym, b.target.isec->parent,		encodeBinding(b.dysym, b.target.isec->parent,
b.target.isec->outSecOff + b.target.offset, b.addend,		b.target.isec->getOffset(b.target.offset), b.addend,
/isWeakBinding=/false, lastBinding, os);		/isWeakBinding=/false, lastBinding, os);
}		}
if (!bindings.empty())		if (!bindings.empty())
os << static_cast<uint8_t>(BIND_OPCODE_DONE);		os << static_cast<uint8_t>(BIND_OPCODE_DONE);
}		}

void BindingSection::writeTo(uint8_t *buf) const {		void BindingSection::writeTo(uint8_t *buf) const {
memcpy(buf, contents.data(), contents.size());		memcpy(buf, contents.data(), contents.size());
Show All 12 Lines	void WeakBindingSection::finalizeContents() {
// Since bindings are delta-encoded, sorting them allows for a more compact		// Since bindings are delta-encoded, sorting them allows for a more compact
// result.		// result.
llvm::sort(bindings,		llvm::sort(bindings,
[](const WeakBindingEntry &a, const WeakBindingEntry &b) {		[](const WeakBindingEntry &a, const WeakBindingEntry &b) {
return a.target.getVA() < b.target.getVA();		return a.target.getVA() < b.target.getVA();
});		});
for (const WeakBindingEntry &b : bindings)		for (const WeakBindingEntry &b : bindings)
encodeBinding(b.symbol, b.target.isec->parent,		encodeBinding(b.symbol, b.target.isec->parent,
b.target.isec->outSecOff + b.target.offset, b.addend,		b.target.isec->getOffset(b.target.offset), b.addend,
/isWeakBinding=/true, lastBinding, os);		/isWeakBinding=/true, lastBinding, os);
if (!bindings.empty() \|\| !definitions.empty())		if (!bindings.empty() \|\| !definitions.empty())
os << static_cast<uint8_t>(BIND_OPCODE_DONE);		os << static_cast<uint8_t>(BIND_OPCODE_DONE);
}		}

void WeakBindingSection::writeTo(uint8_t *buf) const {		void WeakBindingSection::writeTo(uint8_t *buf) const {
memcpy(buf, contents.data(), contents.size());		memcpy(buf, contents.data(), contents.size());
}		}
▲ Show 20 Lines • Show All 663 Lines • ▼ Show 20 Lines	void BitcodeBundleSection::writeTo(uint8_t *buf) const {
if (ec)		if (ec)
fatal("failed to map XAR file");		fatal("failed to map XAR file");
memcpy(buf, xarMap.const_data(), xarSize);		memcpy(buf, xarMap.const_data(), xarSize);

closeFile(handle);		closeFile(handle);
remove(xarPath);		remove(xarPath);
}		}

		// Mergeable cstring literals are found under the __TEXT,__cstring section. In
		// contrast to ELF, which puts strings that need different alignments into
		// different sections, clang's Mach-O backend puts them all in one section.
		// Strings that need to be aligned have the .p2align directive emitted before
		// them, which simply translates into zero padding in the object file.
		gkmUnsubmitted Done Reply Inline Actions The extremity of the target-dependent difference in alignment requirement is surprising, and worthy of a comment. gkm: The extremity of the target-dependent difference in alignment requirement is surprising, and…
		int3AuthorUnsubmitted Done Reply Inline Actions Good point. I've copied the relevant bits of the commit message. int3: Good point. I've copied the relevant bits of the commit message.
		//
		// I think ld64 extracts the desired per-string alignment from this data by
		// preserving each string's offset from the last section-aligned address. I'm
		// not entirely certain since it doesn't seem consistent about doing this, and
		// in fact doesn't seem to be correct in general: we can in fact can induce ld64
		// to produce a crashing binary just by linking in an additional object file
		// that only contains a duplicate cstring at a different alignment. See PR50563
		// for details.
		//
		// In practice, the cstrings we've seen so far that require special aligment are
		// all accessed by x86_64 SIMD operations -- x86_64 requires SIMD accesses to be
		// 16-byte-aligned. So for now, I'm just aligning all strings to 16 bytes on
		// x86_64. This is indeed wasteful, but implementation-wise it's simpler than
		// preserving per-string alignment+offsets. It also avoids the aforementioned
		// crash after deduplication of differently-aligned strings. Finally, the
		// overhead is not huge: using 16-byte alignment (vs no alignment) is only a
		// 0.5% size overhead when linking chromium_framework.
		CStringSection::CStringSection()
		: SyntheticSection(segment_names::text, section_names::cString),
		builder(StringTableBuilder::RAW,
		/Alignment=/target->cpuType == CPU_TYPE_X86_64 ? 16 : 1) {
		align = target->cpuType == CPU_TYPE_X86_64 ? 16 : 1;
		flags = S_CSTRING_LITERALS;
		}

		void CStringSection::addInput(CStringInputSection *isec) {
		isec->parent = this;
		inputs.push_back(isec);
		}

		void CStringSection::finalize() {
		// Add all string pieces to the string table builder to create section
		// contents.
		for (const CStringInputSection *isec : inputs)
		for (size_t i = 0, e = isec->pieces.size(); i != e; ++i)
		builder.add(isec->getCachedHashStringRef(i));

		// Fix the string table content. After this, the contents will never change.
		builder.finalizeInOrder();

		// finalize() fixed tail-optimized strings, so we can now get
		// offsets of strings. Get an offset for each string and save it
		// to a corresponding SectionPiece for easy access.
		for (CStringInputSection *isec : inputs) {
		for (size_t i = 0, e = isec->pieces.size(); i != e; ++i) {
		isec->pieces[i].outSecOff =
		builder.getOffset(isec->getCachedHashStringRef(i));
		isec->isFinal = true;
		}
		}
		}

void macho::createSyntheticSymbols() {		void macho::createSyntheticSymbols() {
auto addHeaderSymbol = [](const char *name) {		auto addHeaderSymbol = [](const char *name) {
symtab->addSynthetic(name, in.header->isec, /value=/0,		symtab->addSynthetic(name, in.header->isec, /value=/0,
/privateExtern=/true, /includeInSymtab=/false,		/privateExtern=/true, /includeInSymtab=/false,
/referencedDynamically=/false);		/referencedDynamically=/false);
};		};

switch (config->outputType) {		switch (config->outputType) {
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

lld/MachO/UnwindInfoSection.cpp

	Show First 20 Lines • Show All 208 Lines • ▼ Show 20 Lines
	// We need to apply the relocations to the pre-link compact unwind section			// We need to apply the relocations to the pre-link compact unwind section
	// before converting it to post-link form. There should only be absolute			// before converting it to post-link form. There should only be absolute
	// relocations here: since we are not emitting the pre-link CU section, there			// relocations here: since we are not emitting the pre-link CU section, there
	// is no source address to make a relative location meaningful.			// is no source address to make a relative location meaningful.
	template <class Ptr>			template <class Ptr>
	static void			static void
	relocateCompactUnwind(ConcatOutputSection *compactUnwindSection,			relocateCompactUnwind(ConcatOutputSection *compactUnwindSection,
	std::vector<CompactUnwindEntry<Ptr>> &cuVector) {			std::vector<CompactUnwindEntry<Ptr>> &cuVector) {
	for (const InputSection *isec : compactUnwindSection->inputs) {			for (const ConcatInputSection *isec : compactUnwindSection->inputs) {
	assert(isec->parent == compactUnwindSection);			assert(isec->parent == compactUnwindSection);

	uint8_t *buf =			uint8_t *buf =
	reinterpret_cast<uint8_t *>(cuVector.data()) + isec->outSecFileOff;			reinterpret_cast<uint8_t *>(cuVector.data()) + isec->outSecFileOff;
	memcpy(buf, isec->data.data(), isec->data.size());			memcpy(buf, isec->data.data(), isec->data.size());

	for (const Reloc &r : isec->relocs) {			for (const Reloc &r : isec->relocs) {
	uint64_t referentVA = 0;			uint64_t referentVA = 0;
	if (auto referentSym = r.referent.dyn_cast<Symbol >()) {			if (auto referentSym = r.referent.dyn_cast<Symbol >()) {
	if (!isa<Undefined>(referentSym)) {			if (!isa<Undefined>(referentSym)) {
	assert(referentSym->isInGot());			assert(referentSym->isInGot());
	if (auto *defined = dyn_cast<Defined>(referentSym))			if (auto *defined = dyn_cast<Defined>(referentSym))
	checkTextSegment(defined->isec);			checkTextSegment(defined->isec);
	// At this point in the link, we may not yet know the final address of			// At this point in the link, we may not yet know the final address of
	// the GOT, so we just encode the index. We make it a 1-based index so			// the GOT, so we just encode the index. We make it a 1-based index so
	// that we can distinguish the null pointer case.			// that we can distinguish the null pointer case.
	referentVA = referentSym->gotIndex + 1;			referentVA = referentSym->gotIndex + 1;
	}			}
	} else if (auto referentIsec = r.referent.dyn_cast<InputSection >()) {			} else if (auto referentIsec = r.referent.dyn_cast<InputSection >()) {
	checkTextSegment(referentIsec);			checkTextSegment(referentIsec);
	if (referentIsec->shouldOmitFromOutput())			if (referentIsec->shouldOmitFromOutput())
	referentVA = UINT64_MAX; // Tombstone value			referentVA = UINT64_MAX; // Tombstone value
	else			else
	referentVA = referentIsec->getVA() + r.addend;			referentVA = referentIsec->getVA(r.addend);
	}			}

	writeAddress(buf + r.offset, referentVA, r.length);			writeAddress(buf + r.offset, referentVA, r.length);
	}			}
	}			}
	}			}

	// There should only be a handful of unique personality pointers, so we can			// There should only be a handful of unique personality pointers, so we can
	▲ Show 20 Lines • Show All 307 Lines • Show Last 20 Lines

lld/MachO/Writer.cpp

Show First 20 Lines • Show All 857 Lines • ▼ Show 20 Lines	template <class LP> void Writer::createOutputSections() {
}		}

// Then add input sections to output sections.		// Then add input sections to output sections.
DenseMap<NamePair, ConcatOutputSection *> concatOutputSections;		DenseMap<NamePair, ConcatOutputSection *> concatOutputSections;
for (const auto &p : enumerate(inputSections)) {		for (const auto &p : enumerate(inputSections)) {
InputSection *isec = p.value();		InputSection *isec = p.value();
if (isec->shouldOmitFromOutput())		if (isec->shouldOmitFromOutput())
continue;		continue;
		if (auto *concatIsec = dyn_cast<ConcatInputSection>(isec)) {
NamePair names = maybeRenameSection({isec->segname, isec->name});		NamePair names = maybeRenameSection({isec->segname, isec->name});
ConcatOutputSection *&osec = concatOutputSections[names];		ConcatOutputSection *&osec = concatOutputSections[names];
if (osec == nullptr) {		if (osec == nullptr) {
osec = make<ConcatOutputSection>(names.second);		osec = make<ConcatOutputSection>(names.second);
osec->inputOrder = p.index();		osec->inputOrder = p.index();
}		}
osec->addInput(isec);		osec->addInput(concatIsec);
		} else if (auto *cStringIsec = dyn_cast<CStringInputSection>(isec)) {
		if (in.cStringSection->inputs.empty())
		in.cStringSection->inputOrder = p.index();
		in.cStringSection->addInput(cStringIsec);
		}
}		}

		// Once all the inputs are added, we can finalize the output section
		// properties and create the corresponding output segments.
for (const auto &it : concatOutputSections) {		for (const auto &it : concatOutputSections) {
StringRef segname = it.first.first;		StringRef segname = it.first.first;
ConcatOutputSection *osec = it.second;		ConcatOutputSection *osec = it.second;
if (segname == segment_names::ld) {		if (segname == segment_names::ld) {
assert(osec->name == section_names::compactUnwind);		assert(osec->name == section_names::compactUnwind);
in.unwindInfo->setCompactUnwindSection(osec);		in.unwindInfo->setCompactUnwindSection(osec);
} else {		} else {
getOrCreateOutputSegment(segname)->addOutputSection(osec);		getOrCreateOutputSegment(segname)->addOutputSection(osec);
}		}
}		}

for (SyntheticSection *ssec : syntheticSections) {		for (SyntheticSection *ssec : syntheticSections) {
auto it = concatOutputSections.find({ssec->segname, ssec->name});		auto it = concatOutputSections.find({ssec->segname, ssec->name});
		if (ssec->isNeeded()) {
if (it == concatOutputSections.end()) {		if (it == concatOutputSections.end()) {
		int3AuthorUnsubmitted Done Reply Inline Actions An earlier implementation of this diff always created the CStringLiteralSection, even if literal merging was disabled. I therefore hoisted out this check to avoid having a conflict between the unneeded CStringLiteralSection and the actual ConcatOutputSection when literal merging was not being done. We now only create the CStringLiteralSection as-needed, so this is likely unnecessary. However, I think it still makes sense to avoid unnecessary section name conflicts, so I've left it in. int3: An earlier implementation of this diff always created the CStringLiteralSection, even if…
if (ssec->isNeeded())
getOrCreateOutputSegment(ssec->segname)->addOutputSection(ssec);		getOrCreateOutputSegment(ssec->segname)->addOutputSection(ssec);
} else {		} else {
error("section from " + toString(it->second->firstSection()->file) +		fatal("section from " + toString(it->second->firstSection()->file) +
" conflicts with synthetic section " + ssec->segname + "," +		" conflicts with synthetic section " + ssec->segname + "," +
ssec->name);		ssec->name);
}		}
}		}
		}

// dyld requires __LINKEDIT segment to always exist (even if empty).		// dyld requires __LINKEDIT segment to always exist (even if empty).
linkEditSegment = getOrCreateOutputSegment(segment_names::linkEdit);		linkEditSegment = getOrCreateOutputSegment(segment_names::linkEdit);
}		}

void Writer::finalizeAddresses() {		void Writer::finalizeAddresses() {
TimeTraceScope timeScope("Finalize addresses");		TimeTraceScope timeScope("Finalize addresses");
uint64_t pageSize = target->getPageSize();		uint64_t pageSize = target->getPageSize();
▲ Show 20 Lines • Show All 130 Lines • ▼ Show 20 Lines	template <class LP> void Writer::run() {
writeMapFile();		writeMapFile();
writeOutputFile();		writeOutputFile();
}		}

template <class LP> void macho::writeResult() { Writer().run<LP>(); }		template <class LP> void macho::writeResult() { Writer().run<LP>(); }

void macho::createSyntheticSections() {		void macho::createSyntheticSections() {
in.header = make<MachHeaderSection>();		in.header = make<MachHeaderSection>();
		in.cStringSection = config->dedupLiterals ? make<CStringSection>() : nullptr;
in.rebase = make<RebaseSection>();		in.rebase = make<RebaseSection>();
in.binding = make<BindingSection>();		in.binding = make<BindingSection>();
in.weakBinding = make<WeakBindingSection>();		in.weakBinding = make<WeakBindingSection>();
in.lazyBinding = make<LazyBindingSection>();		in.lazyBinding = make<LazyBindingSection>();
in.exports = make<ExportSection>();		in.exports = make<ExportSection>();
in.got = make<GotSection>();		in.got = make<GotSection>();
in.tlvPointers = make<TlvPointerSection>();		in.tlvPointers = make<TlvPointerSection>();
in.lazyPointers = make<LazyPointerSection>();		in.lazyPointers = make<LazyPointerSection>();
Show All 10 Lines

lld/test/MachO/cstring-dedup.s

This file was added.

				# REQUIRES: x86
				# RUN: rm -rf %t; split-file %s %t
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/test.s -o %t/test.o
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/more-foo.s -o %t/more-foo.o
				# RUN: %lld -dylib --deduplicate-literals %t/test.o %t/more-foo.o -o %t/test
				# RUN: llvm-objdump --macho --section="__TEXT,__cstring" %t/test \| \
				# RUN: FileCheck %s --check-prefix=STR --implicit-check-not foo --implicit-check-not bar
				# RUN: llvm-objdump --macho --section="__DATA,ptrs" --syms %t/test \| FileCheck %s
				# RUN: llvm-readobj --section-headers %t/test \| FileCheck %s --check-prefix=HEADER

				## Make sure we only have 3 deduplicated strings in __cstring, and that they
				## are 16-byte-aligned.
				# STR: Contents of (__TEXT,__cstring) section
				# STR: {{.*}}0 foo
				# STR: {{.*}}0 barbaz
				# STR: {{.*}}0 {{$}}

				## Make sure both symbol and section relocations point to the right thing.
				# CHECK: Contents of (__DATA,ptrs) section
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:foo
				# CHECK-NEXT: __TEXT:__cstring:barbaz
				# CHECK-NEXT: __TEXT:__cstring:baz
				# CHECK-NEXT: __TEXT:__cstring:barbaz
				# CHECK-NEXT: __TEXT:__cstring:baz
				# CHECK-NEXT: __TEXT:__cstring:{{$}}
				# CHECK-NEXT: __TEXT:__cstring:{{$}}

				## Make sure the symbol addresses are correct too.
				# CHECK: SYMBOL TABLE:
				# CHECK-DAG: [[#%.16x,FOO:]] l O __TEXT,__cstring _local_foo1
				# CHECK-DAG: [[#FOO]] l O __TEXT,__cstring _local_foo2
				# CHECK-DAG: [[#FOO]] g O __TEXT,__cstring _globl_foo1
				# CHECK-DAG: [[#FOO]] g O __TEXT,__cstring _globl_foo2
				# CHECK-DAG: [[#%.16x,BAR:]] l O __TEXT,__cstring _bar1
				# CHECK-DAG: [[#BAR]] l O __TEXT,__cstring _bar2
				# CHECK-DAG: [[#%.16x,ZERO:]] l O __TEXT,__cstring _zero1
				# CHECK-DAG: [[#ZERO]] l O __TEXT,__cstring _zero2

				## Make sure we set the right alignment and flags.
				# HEADER: Name: __cstring
				# HEADER-NEXT: Segment: __TEXT
				# HEADER-NEXT: Address:
				# HEADER-NEXT: Size:
				# HEADER-NEXT: Offset:
				# HEADER-NEXT: Alignment: 4
				# HEADER-NEXT: RelocationOffset:
				# HEADER-NEXT: RelocationCount: 0
				# HEADER-NEXT: Type: CStringLiterals
				# HEADER-NEXT: Attributes [ (0x0)
				# HEADER-NEXT: ]
				# HEADER-NEXT: Reserved1: 0x0
				# HEADER-NEXT: Reserved2: 0x0
				# HEADER-NEXT: Reserved3: 0x0

				#--- test.s
				.cstring
				.p2align 2
				_local_foo1:
				.asciz "foo"
				_local_foo2:
				.asciz "foo"
				L_.foo1:
				.asciz "foo"
				L_.foo2:
				.asciz "foo"

				_bar1:
				.ascii "bar"
				_baz1:
				.asciz "baz"
				_bar2:
				.ascii "bar"
				_baz2:
				.asciz "baz"

				_zero1:
				.asciz ""
				_zero2:
				.asciz ""

				.section __DATA,ptrs,literal_pointers
				.quad L_.foo1
				.quad L_.foo2
				.quad _local_foo1
				.quad _local_foo2
				.quad _globl_foo1
				.quad _globl_foo2
				.quad _bar1
				.quad _baz1
				.quad _bar2
				.quad _baz2
				.quad _zero1
				.quad _zero2

				#--- more-foo.s
				.globl _globl_foo1, _globl_foo2
				.cstring
				.p2align 4
				_globl_foo1:
				.asciz "foo"
				_globl_foo2:
				.asciz "foo"

lld/test/MachO/invalid/cstring-dedup.s

This file was added.

				# REQUIRES: x86
				# RUN: rm -rf %t; split-file %s %t
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/not-terminated.s -o %t/not-terminated.o
				# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %t/relocs.s -o %t/relocs.o

				# RUN: not %lld -dylib --deduplicate-literals %t/not-terminated.o 2>&1 \| FileCheck %s --check-prefix=TERM
				# RUN: not %lld -dylib --deduplicate-literals %t/relocs.o 2>&1 \| FileCheck %s --check-prefix=RELOCS

				# TERM: not-terminated.o:(__cstring): string is not null terminated
				# RELOCS: relocs.o contains relocations in __TEXT,__cstring, so LLD cannot deduplicate literals. Try re-running without --deduplicate-literals.

				#--- not-terminated.s
				.cstring
				.asciz "foo"
				.ascii "oh no"

				#--- relocs.s
				.cstring
				_str:
				.asciz "foo"
				.quad _str

lld/test/MachO/invalid/reserved-section-name.s

	# REQUIRES: x86			# REQUIRES: x86
	# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %s -o %t.o
	# RUN: not %lld -o %t %t.o 2>&1 \| FileCheck %s -DFILE=%t.o			# RUN: not %lld -o %t %t.o 2>&1 \| FileCheck %s -DFILE=%t.o
	# CHECK: error: section from [[FILE]] conflicts with synthetic section __DATA_CONST,__got			# CHECK: error: section from [[FILE]] conflicts with synthetic section __DATA_CONST,__got

	.globl _main			.globl _main

	.section __DATA_CONST,__got			.section __DATA_CONST,__got
	.space 1			.space 1

				.data
				_foo:
				.space 1

	.text			.text
	_main:			_main:
	mov $0, %rax			## make sure the GOT will be needed
				pushq _foo@GOTPCREL(%rip)
	ret			ret

lld/test/MachO/subsections-section-relocs.s

	Show All 33 Lines
	_foo_str:			_foo_str:
	.asciz "foo"			.asciz "foo"

	_bar_str:			_bar_str:
	.asciz "bar"			.asciz "bar"

	## References to this generate a section relocation			## References to this generate a section relocation
	## N.B.: ld64 doesn't actually reorder symbols in __cstring based on the order			## N.B.: ld64 doesn't actually reorder symbols in __cstring based on the order
	## file. Only our implementation does. However, I'm not sure how else to			## file. Our implementation only does does so if --no-literal-merge is
	## test section relocations that target an address inside a relocated			## specified. I'm not sure how else to test section relocations that
	## symbol: using a non-__cstring section would cause llvm-mc to emit a			## target an address inside a relocated symbol: using a non-__cstring
	## symbol relocation instead using the nearest symbol.			## section would cause llvm-mc to emit a symbol relocation instead using
				## the nearest symbol. It might be more consistent for LLD to disable
				## symbol-based cstring reordering altogether and leave this functionality
				## untested, at least until we find a real-world use case...
	L_.str:			L_.str:
	.asciz "Private symbol"			.asciz "Private symbol"

	.subsections_via_symbols			.subsections_via_symbols

lld/test/MachO/x86-64-relocs.s

	# REQUIRES: x86			# REQUIRES: x86
	# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=x86_64-apple-darwin %s -o %t.o
	# RUN: %lld -lSystem -o %t %t.o			# RUN: %lld -lSystem -o %t %t.o
	# RUN: llvm-objdump --section-headers --syms -d %t \| FileCheck %s			# RUN: llvm-objdump --section-headers --syms -d %t \| FileCheck %s

	# CHECK-LABEL: Sections:			# CHECK-LABEL: Sections:
	# CHECK: __cstring {{[0-9a-z]+}} [[#%x, CSTRING_ADDR:]]			# CHECK: __data {{[0-9a-z]+}} [[#%x, DATA_ADDR:]]

	# CHECK-LABEL: SYMBOL TABLE:			# CHECK-LABEL: SYMBOL TABLE:
	# CHECK: [[#%x, F_ADDR:]] {{.*}} _f			# CHECK: [[#%x, F_ADDR:]] {{.*}} _f

	# CHECK-LABEL: <_main>:			# CHECK-LABEL: <_main>:
	## Test X86_64_RELOC_BRANCH			## Test X86_64_RELOC_BRANCH
	# CHECK: callq 0x[[#%x, F_ADDR]] <_f>			# CHECK: callq 0x[[#%x, F_ADDR]] <_f>
	## Test extern (symbol) X86_64_RELOC_SIGNED			## Test extern (symbol) X86_64_RELOC_SIGNED
	# CHECK: leaq [[#%u, STR_OFF:]](%rip), %rsi			# CHECK: leaq [[#%u, LOCAL_OFF:]](%rip), %rsi
	# CHECK-NEXT: [[#%x, CSTRING_ADDR - STR_OFF]]			# CHECK-NEXT: [[#%x, DATA_ADDR - LOCAL_OFF]]
	## Test non-extern (section) X86_64_RELOC_SIGNED			## Test non-extern (section) X86_64_RELOC_SIGNED
	# CHECK: leaq [[#%u, LSTR_OFF:]](%rip), %rsi			# CHECK: leaq [[#%u, PRIVATE_OFF:]](%rip), %rsi
	# CHECK-NEXT: [[#%x, CSTRING_ADDR + 22 - LSTR_OFF]]			# CHECK-NEXT: [[#%x, DATA_ADDR + 8 - PRIVATE_OFF]]

	# RUN: llvm-objdump --section=__const --full-contents %t \| FileCheck %s --check-prefix=NONPCREL			# RUN: llvm-objdump --section=__const --full-contents %t \| FileCheck %s --check-prefix=NONPCREL
	# NONPCREL: Contents of section __DATA_CONST,__const:			# NONPCREL: Contents of section __DATA_CONST,__const:
	# NONPCREL-NEXT: 100001000 18040000 01000000 18040000 01000000			# NONPCREL-NEXT: 100001000 08200000 01000000 08200000 01000000

	.section __TEXT,__text			.section __TEXT,__text
	.globl _main, _f			.globl _main, _f
	_main:			_main:
	callq _f # X86_64_RELOC_BRANCH			callq _f # X86_64_RELOC_BRANCH
	mov $0, %rax			mov $0, %rax
	ret			ret

	_f:			_f:
	movl $0x2000004, %eax # write() syscall			leaq _local(%rip), %rsi # Generates a X86_64_RELOC_SIGNED pcrel symbol relocation
	mov $1, %rdi # stdout			leaq L_.private(%rip), %rsi # Generates a X86_64_RELOC_SIGNED pcrel section relocation
	leaq _str(%rip), %rsi # Generates a X86_64_RELOC_SIGNED pcrel symbol relocation			movq L_.ptr_1(%rip), %rsi
	mov $21, %rdx # length of str
	syscall

	movl $0x2000004, %eax # write() syscall
	mov $1, %rdi # stdout
	leaq L_.str(%rip), %rsi # Generates a X86_64_RELOC_SIGNED pcrel section relocation
	mov $15, %rdx # length of str
	syscall

	movl $0x2000004, %eax # write() syscall
	mov $1, %rdi # stdout
	movq L_.ptr_1_to_str(%rip), %rsi
	mov $15, %rdx # length of str
	syscall
	ret			ret

	.section __TEXT,__cstring			.data
	## References to this generate a symbol relocation			## References to this generate a symbol relocation
	_str:			_local:
	.asciz "Local defined symbol\n"			.quad 123
	## References to this generate a section relocation			## References to this generate a section relocation
	L_.str:			L_.private:
	.asciz "Private symbol\n"			.quad 123

	.section __DATA,__const			.section __DATA,__const
	## These generate X86_64_RELOC_UNSIGNED non-pcrel section relocations			## These generate X86_64_RELOC_UNSIGNED non-pcrel section relocations
	L_.ptr_1_to_str:			L_.ptr_1:
	.quad L_.str			.quad L_.private
	L_.ptr_2_to_str:			L_.ptr_2:
	.quad L_.str			.quad L_.private

This is an archive of the discontinued LLVM Phabricator instance.

[lld-macho] Implement cstring deduplicationClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 350484

lld/MachO/ConcatOutputSection.h

lld/MachO/ConcatOutputSection.cpp

lld/MachO/Config.h

lld/MachO/Driver.cpp

lld/MachO/InputFiles.h

lld/MachO/InputFiles.cpp

lld/MachO/InputSection.h

lld/MachO/InputSection.cpp

lld/MachO/Options.td

lld/MachO/Symbols.cpp

lld/MachO/SyntheticSections.h

lld/MachO/SyntheticSections.cpp

lld/MachO/UnwindInfoSection.cpp

lld/MachO/Writer.cpp

lld/test/MachO/cstring-dedup.s

lld/test/MachO/invalid/cstring-dedup.s

lld/test/MachO/invalid/reserved-section-name.s

lld/test/MachO/subsections-section-relocs.s

lld/test/MachO/x86-64-relocs.s

[lld-macho] Implement cstring deduplication
ClosedPublic