This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/MachO/
-
MachO/
-
Arch/
-
ARM64.cpp
-
X86_64.cpp
-
InputFiles.cpp
-
InputSection.cpp
-
Relocations.cpp
-
Target.h
-
Target.cpp
-
UnwindInfoSection.cpp
-
Writer.cpp

Differential D98571

[lld-macho] Optimize getRelocAttrs()
AbandonedPublic

Authored by int3 on Mar 12 2021, 6:05 PM.

Download Raw Diff

Details

Reviewers

None

Group Reviewers

Restricted Project

Summary

Investigation of PR49480 showed that D95121 caused about a 5.0% speed
regression when linking chromium_framework. That diff introduces a (very
useful) additional layer of abstraction over relocations, so the perf
overhead is not too surprising. The diff is pretty large, and perf
didn't give me any great hints, so I just went with optimizing the
likely candidate -- getRelocAttrs(). I managed to claw back about
1.4% of perf this way. Making the relocAttrsArray a global and
devirtualizing getRelocAttrs() gave most of the win; I also marked
the array range check with LLVM_UNLIKELY for good measure.

The numbers above are quoted for chromium_framework (from the tarball in
PR48657).

    N           Min           Max        Median           Avg        Stddev
x  20           4.5          4.66          4.56        4.5715   0.044871161
+  20          4.42          4.61           4.5        4.5075   0.053001986
Difference at 95.0% confidence
        -0.064 +/- 0.0314295
        -1.39998% +/- 0.68751%
        (Student's t, pooled s = 0.0491052)

I also measured v8_unittests:

    N           Min           Max        Median           Avg        Stddev
x  20          0.62          0.65          0.64        0.6355  0.0082557795
+  20          0.61          0.63          0.62         0.616  0.0059824304
Difference at 95.0% confidence
        -0.0195 +/- 0.00461426
        -3.06845% +/- 0.726084%
        (Student's t, pooled s = 0.00720928)

The v8 difference is likely larger because it doesn't use an order file.
Symbol ordering is actually one of the most expensive steps when linking
chromium_framework, and probably is a target for further optimization.

The other hotspot is the assignment of relocations to subsections. I'm
curious as to whether replacing the RB-tree in std::map with a radix
trie would be an improvement...

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

int3 created this revision.Mar 12 2021, 6:05 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 12 2021, 6:05 PM

int3 requested review of this revision.Mar 12 2021, 6:05 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 12 2021, 6:05 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

int3 edited the summary of this revision. (Show Details)Mar 12 2021, 6:06 PM

Harbormaster completed remote builds in B93633: Diff 330411.Mar 12 2021, 6:37 PM

LLVM is not a big fan of std::map:
https://llvm.org/docs/ProgrammersManual.html#map

I'm aware :) the reason why we're using it here is because we need upper_bound(). That said, a sorted vector (as mentioned in the manual) might help here... parseSymbols interleaves inserts and queries to the SubsectionMaps, but parseRelocations only does queries, so we could convert the map to a vector in the latter case.

update

Harbormaster completed remote builds in B93684: Diff 330476.Mar 13 2021, 3:20 PM

maybe using templates would be better here, similar to LLD-ELF's ELFT type. Thanks @smeenai for the idea

This is kind of ugly, and I think there's lower-hanging performance fruit, so we can revisit this later

Revision Contents

Path

Size

lld/

MachO/

Arch/

42 lines

36 lines

10 lines

7 lines

7 lines

23 lines

2 lines

UnwindInfoSection.cpp

2 lines

Writer.cpp

4 lines

Diff 330411

lld/MachO/Arch/ARM64.cpp

Show All 33 Lines	void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,
uint64_t pc) const override;		uint64_t pc) const override;

void writeStub(uint8_t *buf, const macho::Symbol &) const override;		void writeStub(uint8_t *buf, const macho::Symbol &) const override;
void writeStubHelperHeader(uint8_t *buf) const override;		void writeStubHelperHeader(uint8_t *buf) const override;
void writeStubHelperEntry(uint8_t *buf, const DylibSymbol &,		void writeStubHelperEntry(uint8_t *buf, const DylibSymbol &,
uint64_t entryAddr) const override;		uint64_t entryAddr) const override;

void relaxGotLoad(uint8_t *loc, uint8_t type) const override;		void relaxGotLoad(uint8_t *loc, uint8_t type) const override;
const RelocAttrs &getRelocAttrs(uint8_t type) const override;
uint64_t getPageSize() const override { return 16 * 1024; }		uint64_t getPageSize() const override { return 16 * 1024; }
};		};

} // namespace		} // namespace

// Random notes on reloc types:		// Random notes on reloc types:
// ADDEND always pairs with BRANCH26, PAGE21, or PAGEOFF12		// ADDEND always pairs with BRANCH26, PAGE21, or PAGEOFF12
// POINTER_TO_GOT: ld64 supports a 4-byte pc-relative form as well as an 8-byte		// POINTER_TO_GOT: ld64 supports a 4-byte pc-relative form as well as an 8-byte
// absolute version of this relocation. The semantics of the absolute relocation		// absolute version of this relocation. The semantics of the absolute relocation
// are weird -- it results in the value of the GOT slot being written, instead		// are weird -- it results in the value of the GOT slot being written, instead
// of the address. Let's not support it unless we find a real-world use case.		// of the address. Let's not support it unless we find a real-world use case.

const RelocAttrs &ARM64::getRelocAttrs(uint8_t type) const {		static const std::array<RelocAttrs, 11> ARM64RelocAttrsArray{{
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'ARM64RelocAttrsArray' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'ARM64RelocAttrsArray' [readability…
static const std::array<RelocAttrs, 11> relocAttrsArray{{
#define B(x) RelocAttrBits::x		#define B(x) RelocAttrBits::x
{"UNSIGNED", B(UNSIGNED) \| B(ABSOLUTE) \| B(EXTERN) \| B(LOCAL) \|		{"UNSIGNED", B(UNSIGNED) \| B(ABSOLUTE) \| B(EXTERN) \| B(LOCAL) \| B(DYSYM8) \|
B(DYSYM8) \| B(BYTE4) \| B(BYTE8)},		B(BYTE4) \| B(BYTE8)},
{"SUBTRACTOR", B(SUBTRAHEND) \| B(BYTE4) \| B(BYTE8)},		{"SUBTRACTOR", B(SUBTRAHEND) \| B(BYTE4) \| B(BYTE8)},
{"BRANCH26", B(PCREL) \| B(EXTERN) \| B(BRANCH) \| B(BYTE4)},		{"BRANCH26", B(PCREL) \| B(EXTERN) \| B(BRANCH) \| B(BYTE4)},
{"PAGE21", B(PCREL) \| B(EXTERN) \| B(BYTE4)},		{"PAGE21", B(PCREL) \| B(EXTERN) \| B(BYTE4)},
{"PAGEOFF12", B(ABSOLUTE) \| B(EXTERN) \| B(BYTE4)},		{"PAGEOFF12", B(ABSOLUTE) \| B(EXTERN) \| B(BYTE4)},
{"GOT_LOAD_PAGE21", B(PCREL) \| B(EXTERN) \| B(GOT) \| B(BYTE4)},		{"GOT_LOAD_PAGE21", B(PCREL) \| B(EXTERN) \| B(GOT) \| B(BYTE4)},
{"GOT_LOAD_PAGEOFF12",		{"GOT_LOAD_PAGEOFF12",
B(ABSOLUTE) \| B(EXTERN) \| B(GOT) \| B(LOAD) \| B(BYTE4)},		B(ABSOLUTE) \| B(EXTERN) \| B(GOT) \| B(LOAD) \| B(BYTE4)},
{"POINTER_TO_GOT", B(PCREL) \| B(EXTERN) \| B(GOT) \| B(POINTER) \| B(BYTE4)},		{"POINTER_TO_GOT", B(PCREL) \| B(EXTERN) \| B(GOT) \| B(POINTER) \| B(BYTE4)},
{"TLVP_LOAD_PAGE21", B(PCREL) \| B(EXTERN) \| B(TLV) \| B(BYTE4)},		{"TLVP_LOAD_PAGE21", B(PCREL) \| B(EXTERN) \| B(TLV) \| B(BYTE4)},
{"TLVP_LOAD_PAGEOFF12",		{"TLVP_LOAD_PAGEOFF12",
B(ABSOLUTE) \| B(EXTERN) \| B(TLV) \| B(LOAD) \| B(BYTE4)},		B(ABSOLUTE) \| B(EXTERN) \| B(TLV) \| B(LOAD) \| B(BYTE4)},
{"ADDEND", B(ADDEND)},		{"ADDEND", B(ADDEND)},
#undef B		#undef B
}};		}};
assert(type < relocAttrsArray.size() && "invalid relocation type");
if (type >= relocAttrsArray.size())
return invalidRelocAttrs;
return relocAttrsArray[type];
}

int64_t ARM64::getEmbeddedAddend(MemoryBufferRef mb, const section_64 &sec,		int64_t ARM64::getEmbeddedAddend(MemoryBufferRef mb, const section_64 &sec,
const relocation_info rel) const {		const relocation_info rel) const {
if (rel.r_type != ARM64_RELOC_UNSIGNED &&		if (rel.r_type != ARM64_RELOC_UNSIGNED &&
rel.r_type != ARM64_RELOC_SUBTRACTOR) {		rel.r_type != ARM64_RELOC_SUBTRACTOR) {
// All other reloc types should use the ADDEND relocation to store their		// All other reloc types should use the ADDEND relocation to store their
// addends.		// addends.
// TODO(gkm): extract embedded addend just so we can assert that it is 0		// TODO(gkm): extract embedded addend just so we can assert that it is 0
▲ Show 20 Lines • Show All 206 Lines • ▼ Show 20 Lines

ARM64::ARM64() {		ARM64::ARM64() {
cpuType = CPU_TYPE_ARM64;		cpuType = CPU_TYPE_ARM64;
cpuSubtype = CPU_SUBTYPE_ARM64_ALL;		cpuSubtype = CPU_SUBTYPE_ARM64_ALL;

stubSize = sizeof(stubCode);		stubSize = sizeof(stubCode);
stubHelperHeaderSize = sizeof(stubHelperHeaderCode);		stubHelperHeaderSize = sizeof(stubHelperHeaderCode);
stubHelperEntrySize = sizeof(stubHelperEntryCode);		stubHelperEntrySize = sizeof(stubHelperEntryCode);

		relocAttrsArray = ARM64RelocAttrsArray.data();
		relocAttrsArraySize = ARM64RelocAttrsArray.size();
}		}

TargetInfo *macho::createARM64TargetInfo() {		TargetInfo *macho::createARM64TargetInfo() {
static ARM64 t;		static ARM64 t;
return &t;		return &t;
}		}

lld/MachO/Arch/X86_64.cpp

Show All 30 Lines	void relocateOne(uint8_t *loc, const Reloc &, uint64_t va,
uint64_t relocVA) const override;		uint64_t relocVA) const override;

void writeStub(uint8_t *buf, const macho::Symbol &) const override;		void writeStub(uint8_t *buf, const macho::Symbol &) const override;
void writeStubHelperHeader(uint8_t *buf) const override;		void writeStubHelperHeader(uint8_t *buf) const override;
void writeStubHelperEntry(uint8_t *buf, const DylibSymbol &,		void writeStubHelperEntry(uint8_t *buf, const DylibSymbol &,
uint64_t entryAddr) const override;		uint64_t entryAddr) const override;

void relaxGotLoad(uint8_t *loc, uint8_t type) const override;		void relaxGotLoad(uint8_t *loc, uint8_t type) const override;
const RelocAttrs &getRelocAttrs(uint8_t type) const override;
uint64_t getPageSize() const override { return 4 * 1024; }		uint64_t getPageSize() const override { return 4 * 1024; }
};		};

} // namespace		} // namespace

const RelocAttrs &X86_64::getRelocAttrs(uint8_t type) const {		static const std::array<RelocAttrs, 10> X86_64RelocAttrsArray{{
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'X86_64RelocAttrsArray' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'X86_64RelocAttrsArray' [readability…
static const std::array<RelocAttrs, 10> relocAttrsArray{{
#define B(x) RelocAttrBits::x		#define B(x) RelocAttrBits::x
{"UNSIGNED", B(UNSIGNED) \| B(ABSOLUTE) \| B(EXTERN) \| B(LOCAL) \|		{"UNSIGNED", B(UNSIGNED) \| B(ABSOLUTE) \| B(EXTERN) \| B(LOCAL) \| B(DYSYM8) \|
B(DYSYM8) \| B(BYTE4) \| B(BYTE8)},		B(BYTE4) \| B(BYTE8)},
{"SIGNED", B(PCREL) \| B(EXTERN) \| B(LOCAL) \| B(BYTE4)},		{"SIGNED", B(PCREL) \| B(EXTERN) \| B(LOCAL) \| B(BYTE4)},
{"BRANCH", B(PCREL) \| B(EXTERN) \| B(BRANCH) \| B(BYTE4)},		{"BRANCH", B(PCREL) \| B(EXTERN) \| B(BRANCH) \| B(BYTE4)},
{"GOT_LOAD", B(PCREL) \| B(EXTERN) \| B(GOT) \| B(LOAD) \| B(BYTE4)},		{"GOT_LOAD", B(PCREL) \| B(EXTERN) \| B(GOT) \| B(LOAD) \| B(BYTE4)},
{"GOT", B(PCREL) \| B(EXTERN) \| B(GOT) \| B(POINTER) \| B(BYTE4)},		{"GOT", B(PCREL) \| B(EXTERN) \| B(GOT) \| B(POINTER) \| B(BYTE4)},
{"SUBTRACTOR", B(SUBTRAHEND) \| B(BYTE4) \| B(BYTE8)},		{"SUBTRACTOR", B(SUBTRAHEND) \| B(BYTE4) \| B(BYTE8)},
{"SIGNED_1", B(PCREL) \| B(EXTERN) \| B(LOCAL) \| B(BYTE4)},		{"SIGNED_1", B(PCREL) \| B(EXTERN) \| B(LOCAL) \| B(BYTE4)},
{"SIGNED_2", B(PCREL) \| B(EXTERN) \| B(LOCAL) \| B(BYTE4)},		{"SIGNED_2", B(PCREL) \| B(EXTERN) \| B(LOCAL) \| B(BYTE4)},
{"SIGNED_4", B(PCREL) \| B(EXTERN) \| B(LOCAL) \| B(BYTE4)},		{"SIGNED_4", B(PCREL) \| B(EXTERN) \| B(LOCAL) \| B(BYTE4)},
{"TLV", B(PCREL) \| B(EXTERN) \| B(TLV) \| B(LOAD) \| B(BYTE4)},		{"TLV", B(PCREL) \| B(EXTERN) \| B(TLV) \| B(LOAD) \| B(BYTE4)},
#undef B		#undef B
}};		}};
assert(type < relocAttrsArray.size() && "invalid relocation type");
if (type >= relocAttrsArray.size())
return invalidRelocAttrs;
return relocAttrsArray[type];
}

static int pcrelOffset(uint8_t type) {		static int pcrelOffset(uint8_t type) {
switch (type) {		switch (type) {
case X86_64_RELOC_SIGNED_1:		case X86_64_RELOC_SIGNED_1:
return 1;		return 1;
case X86_64_RELOC_SIGNED_2:		case X86_64_RELOC_SIGNED_2:
return 2;		return 2;
case X86_64_RELOC_SIGNED_4:		case X86_64_RELOC_SIGNED_4:
▲ Show 20 Lines • Show All 110 Lines • ▼ Show 20 Lines

X86_64::X86_64() {		X86_64::X86_64() {
cpuType = CPU_TYPE_X86_64;		cpuType = CPU_TYPE_X86_64;
cpuSubtype = CPU_SUBTYPE_X86_64_ALL;		cpuSubtype = CPU_SUBTYPE_X86_64_ALL;

stubSize = sizeof(stub);		stubSize = sizeof(stub);
stubHelperHeaderSize = sizeof(stubHelperHeader);		stubHelperHeaderSize = sizeof(stubHelperHeader);
stubHelperEntrySize = sizeof(stubHelperEntry);		stubHelperEntrySize = sizeof(stubHelperEntry);

		relocAttrsArray = X86_64RelocAttrsArray.data();
		relocAttrsArraySize = X86_64RelocAttrsArray.size();
}		}

TargetInfo *macho::createX86_64TargetInfo() {		TargetInfo *macho::createX86_64TargetInfo() {
static X86_64 t;		static X86_64 t;
return &t;		return &t;
}		}

lld/MachO/InputFiles.cpp

Show First 20 Lines • Show All 197 Lines • ▼ Show 20 Lines	static InputSection *findContainingSubsection(SubsectionMap &map,
uint32_t *offset) {		uint32_t *offset) {
auto it = std::prev(map.upper_bound(*offset));		auto it = std::prev(map.upper_bound(*offset));
*offset -= it->first;		*offset -= it->first;
return it->second;		return it->second;
}		}

static bool validateRelocationInfo(InputFile *file, const section_64 &sec,		static bool validateRelocationInfo(InputFile *file, const section_64 &sec,
relocation_info rel) {		relocation_info rel) {
const RelocAttrs &relocAttrs = target->getRelocAttrs(rel.r_type);		const RelocAttrs &relocAttrs = getRelocAttrs(rel.r_type);
bool valid = true;		bool valid = true;
auto message = [relocAttrs, file, sec, rel, &valid](const Twine &diagnostic) {		auto message = [relocAttrs, file, sec, rel, &valid](const Twine &diagnostic) {
valid = false;		valid = false;
return (relocAttrs.name + " relocation " + diagnostic + " at offset " +		return (relocAttrs.name + " relocation " + diagnostic + " at offset " +
std::to_string(rel.r_address) + " of " + sec.segname + "," +		std::to_string(rel.r_address) + " of " + sec.segname + "," +
sec.sectname + " in " + toString(file))		sec.sectname + " in " + toString(file))
.str();		.str();
};		};
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	for (size_t i = 0; i < relInfos.size(); i++) {
// instruction-stream bytes that would otherwise contain zeroes. By		// instruction-stream bytes that would otherwise contain zeroes. By
// contrast, RISC ISAs such as ARM64 mix opcode bits with with		// contrast, RISC ISAs such as ARM64 mix opcode bits with with
// address bits so that bitwise arithmetic is necessary to extract		// address bits so that bitwise arithmetic is necessary to extract
// and insert them. Storing addends in the instruction stream is		// and insert them. Storing addends in the instruction stream is
// possible, but inconvenient and more costly at link time.		// possible, but inconvenient and more costly at link time.

int64_t pairedAddend = 0;		int64_t pairedAddend = 0;
relocation_info relInfo = relInfos[i];		relocation_info relInfo = relInfos[i];
if (target->hasAttr(relInfo.r_type, RelocAttrBits::ADDEND)) {		if (hasRelocAttr(relInfo.r_type, RelocAttrBits::ADDEND)) {
pairedAddend = SignExtend64<24>(relInfo.r_symbolnum);		pairedAddend = SignExtend64<24>(relInfo.r_symbolnum);
relInfo = relInfos[++i];		relInfo = relInfos[++i];
}		}
assert(i < relInfos.size());		assert(i < relInfos.size());
if (!validateRelocationInfo(this, sec, relInfo))		if (!validateRelocationInfo(this, sec, relInfo))
continue;		continue;
if (relInfo.r_address & R_SCATTERED)		if (relInfo.r_address & R_SCATTERED)
fatal("TODO: Scattered relocations not supported");		fatal("TODO: Scattered relocations not supported");
Show All 15 Lines	if (relInfo.r_extern) {
uint32_t referentOffset;		uint32_t referentOffset;
if (relInfo.r_pcrel) {		if (relInfo.r_pcrel) {
// The implicit addend for pcrel section relocations is the pcrel offset		// The implicit addend for pcrel section relocations is the pcrel offset
// in terms of the addresses in the input file. Here we adjust it so		// in terms of the addresses in the input file. Here we adjust it so
// that it describes the offset from the start of the referent section.		// that it describes the offset from the start of the referent section.
// FIXME This logic was written around x86_64 behavior -- ARM64 doesn't		// FIXME This logic was written around x86_64 behavior -- ARM64 doesn't
// have pcrel section relocations. We may want to factor this out into		// have pcrel section relocations. We may want to factor this out into
// the arch-specific .cpp file.		// the arch-specific .cpp file.
assert(target->hasAttr(r.type, RelocAttrBits::BYTE4));		assert(hasRelocAttr(r.type, RelocAttrBits::BYTE4));
referentOffset =		referentOffset =
sec.addr + relInfo.r_address + 4 + totalAddend - referentSec.addr;		sec.addr + relInfo.r_address + 4 + totalAddend - referentSec.addr;
} else {		} else {
// The addend for a non-pcrel relocation is its absolute address.		// The addend for a non-pcrel relocation is its absolute address.
referentOffset = totalAddend - referentSec.addr;		referentOffset = totalAddend - referentSec.addr;
}		}
r.referent = findContainingSubsection(referentSubsecMap, &referentOffset);		r.referent = findContainingSubsection(referentSubsecMap, &referentOffset);
r.addend = referentOffset;		r.addend = referentOffset;
}		}

InputSection *subsec = findContainingSubsection(subsecMap, &r.offset);		InputSection *subsec = findContainingSubsection(subsecMap, &r.offset);
subsec->relocs.push_back(r);		subsec->relocs.push_back(r);

if (target->hasAttr(r.type, RelocAttrBits::SUBTRAHEND)) {		if (hasRelocAttr(r.type, RelocAttrBits::SUBTRAHEND)) {
relInfo = relInfos[++i];		relInfo = relInfos[++i];
// SUBTRACTOR relocations should always be followed by an UNSIGNED one		// SUBTRACTOR relocations should always be followed by an UNSIGNED one
// indicating the minuend symbol.		// indicating the minuend symbol.
assert(target->hasAttr(relInfo.r_type, RelocAttrBits::UNSIGNED) &&		assert(hasRelocAttr(relInfo.r_type, RelocAttrBits::UNSIGNED) &&
relInfo.r_extern);		relInfo.r_extern);
Reloc p;		Reloc p;
p.type = relInfo.r_type;		p.type = relInfo.r_type;
p.referent = symbols[relInfo.r_symbolnum];		p.referent = symbols[relInfo.r_symbolnum];
subsec->relocs.push_back(p);		subsec->relocs.push_back(p);
}		}
}		}
}		}
▲ Show 20 Lines • Show All 545 Lines • Show Last 20 Lines

lld/MachO/InputSection.cpp

	Show All 30 Lines
	uint64_t InputSection::getFileSize() const {			uint64_t InputSection::getFileSize() const {
	return isZeroFill(flags) ? 0 : getSize();			return isZeroFill(flags) ? 0 : getSize();
	}			}

	uint64_t InputSection::getVA() const { return parent->addr + outSecOff; }			uint64_t InputSection::getVA() const { return parent->addr + outSecOff; }

	static uint64_t resolveSymbolVA(uint8_t *loc, const lld::macho::Symbol &sym,			static uint64_t resolveSymbolVA(uint8_t *loc, const lld::macho::Symbol &sym,
	uint8_t type) {			uint8_t type) {
	const RelocAttrs &relocAttrs = target->getRelocAttrs(type);			const RelocAttrs &relocAttrs = getRelocAttrs(type);
	if (relocAttrs.hasAttr(RelocAttrBits::BRANCH)) {			if (relocAttrs.hasAttr(RelocAttrBits::BRANCH)) {
	if (sym.isInStubs())			if (sym.isInStubs())
	return in.stubs->addr + sym.stubsIndex * target->stubSize;			return in.stubs->addr + sym.stubsIndex * target->stubSize;
	} else if (relocAttrs.hasAttr(RelocAttrBits::GOT)) {			} else if (relocAttrs.hasAttr(RelocAttrBits::GOT)) {
	if (sym.isInGot())			if (sym.isInGot())
	return in.got->addr + sym.gotIndex * WordSize;			return in.got->addr + sym.gotIndex * WordSize;
	} else if (relocAttrs.hasAttr(RelocAttrBits::TLV)) {			} else if (relocAttrs.hasAttr(RelocAttrBits::TLV)) {
	if (sym.isInGot())			if (sym.isInGot())
	return in.tlvPointers->addr + sym.gotIndex * WordSize;			return in.tlvPointers->addr + sym.gotIndex * WordSize;
	assert(isa<Defined>(&sym));			assert(isa<Defined>(&sym));
	}			}
	return sym.getVA();			return sym.getVA();
	}			}

	void InputSection::writeTo(uint8_t *buf) {			void InputSection::writeTo(uint8_t *buf) {
	if (getFileSize() == 0)			if (getFileSize() == 0)
	return;			return;

	memcpy(buf, data.data(), data.size());			memcpy(buf, data.data(), data.size());

	for (size_t i = 0; i < relocs.size(); i++) {			for (size_t i = 0; i < relocs.size(); i++) {
	const Reloc &r = relocs[i];			const Reloc &r = relocs[i];
	uint8_t *loc = buf + r.offset;			uint8_t *loc = buf + r.offset;
	uint64_t referentVA = 0;			uint64_t referentVA = 0;
	if (target->hasAttr(r.type, RelocAttrBits::SUBTRAHEND)) {			if (hasRelocAttr(r.type, RelocAttrBits::SUBTRAHEND)) {
	const Symbol fromSym = r.referent.get<Symbol >();			const Symbol fromSym = r.referent.get<Symbol >();
	const Symbol toSym = relocs[++i].referent.get<Symbol >();			const Symbol toSym = relocs[++i].referent.get<Symbol >();
	referentVA = toSym->getVA() - fromSym->getVA();			referentVA = toSym->getVA() - fromSym->getVA();
	} else if (auto referentSym = r.referent.dyn_cast<Symbol >()) {			} else if (auto referentSym = r.referent.dyn_cast<Symbol >()) {
	if (target->hasAttr(r.type, RelocAttrBits::LOAD) &&			if (hasRelocAttr(r.type, RelocAttrBits::LOAD) && !referentSym->isInGot())
	!referentSym->isInGot())
	target->relaxGotLoad(loc, r.type);			target->relaxGotLoad(loc, r.type);
	referentVA = resolveSymbolVA(loc, *referentSym, r.type);			referentVA = resolveSymbolVA(loc, *referentSym, r.type);

	if (isThreadLocalVariables(flags)) {			if (isThreadLocalVariables(flags)) {
	// References from thread-local variable sections are treated as offsets			// References from thread-local variable sections are treated as offsets
	// relative to the start of the thread-local data memory area, which			// relative to the start of the thread-local data memory area, which
	// is initialized via copying all the TLV data sections (which are all			// is initialized via copying all the TLV data sections (which are all
	// contiguous).			// contiguous).
	Show All 30 Lines

lld/MachO/Relocations.cpp

	Show All 13 Lines
	#include "lld/Common/ErrorHandler.h"			#include "lld/Common/ErrorHandler.h"

	using namespace llvm;			using namespace llvm;
	using namespace lld;			using namespace lld;
	using namespace lld::macho;			using namespace lld::macho;

	bool macho::validateSymbolRelocation(const Symbol *sym,			bool macho::validateSymbolRelocation(const Symbol *sym,
	const InputSection *isec, const Reloc &r) {			const InputSection *isec, const Reloc &r) {
	const RelocAttrs &relocAttrs = target->getRelocAttrs(r.type);			const RelocAttrs &relocAttrs = getRelocAttrs(r.type);
	bool valid = true;			bool valid = true;
	auto message = [relocAttrs, sym, isec, &valid](const Twine &diagnostic) {			auto message = [relocAttrs, sym, isec, &valid](const Twine &diagnostic) {
	valid = false;			valid = false;
	return (relocAttrs.name + " relocation " + diagnostic + " for `" +			return (relocAttrs.name + " relocation " + diagnostic + " for `" +
	sym->getName() + "' in " + toString(isec))			sym->getName() + "' in " + toString(isec))
	.str();			.str();
	};			};

	Show All 9 Lines
	}			}

	void macho::reportRangeError(const Reloc &r, const Twine &v, uint8_t bits,			void macho::reportRangeError(const Reloc &r, const Twine &v, uint8_t bits,
	int64_t min, uint64_t max) {			int64_t min, uint64_t max) {
	std::string hint;			std::string hint;
	if (auto sym = r.referent.dyn_cast<Symbol >())			if (auto sym = r.referent.dyn_cast<Symbol >())
	hint = "; references " + toString(*sym);			hint = "; references " + toString(*sym);
	// TODO: get location of reloc using something like LLD-ELF's getErrorPlace()			// TODO: get location of reloc using something like LLD-ELF's getErrorPlace()
	error("relocation " + target->getRelocAttrs(r.type).name +			error("relocation " + getRelocAttrs(r.type).name + " is out of range: " + v +
	" is out of range: " + v + " is not in [" + Twine(min) + ", " +			" is not in [" + Twine(min) + ", " + Twine(max) + "]" + hint);
	Twine(max) + "]" + hint);
	}			}

	void macho::reportRangeError(SymbolDiagnostic d, const Twine &v, uint8_t bits,			void macho::reportRangeError(SymbolDiagnostic d, const Twine &v, uint8_t bits,
	int64_t min, uint64_t max) {			int64_t min, uint64_t max) {
	std::string hint;			std::string hint;
	if (d.symbol)			if (d.symbol)
	hint = "; references " + toString(*d.symbol);			hint = "; references " + toString(*d.symbol);
	error(d.reason + " is out of range: " + v + " is not in [" + Twine(min) +			error(d.reason + " is out of range: " + v + " is not in [" + Twine(min) +
	", " + Twine(max) + "]" + hint);			", " + Twine(max) + "]" + hint);
	}			}

	const RelocAttrs macho::invalidRelocAttrs{"INVALID", RelocAttrBits::_0};			const RelocAttrs macho::invalidRelocAttrs{"INVALID", RelocAttrBits::_0};

lld/MachO/Target.h

//===- Target.h -------------------------------------------------- C++ --===//		//===- Target.h -------------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLD_MACHO_TARGET_H		#ifndef LLD_MACHO_TARGET_H
#define LLD_MACHO_TARGET_H		#define LLD_MACHO_TARGET_H

#include "Relocations.h"		#include "Relocations.h"

#include "llvm/ADT/BitmaskEnum.h"		#include "llvm/ADT/BitmaskEnum.h"
#include "llvm/BinaryFormat/MachO.h"		#include "llvm/BinaryFormat/MachO.h"
		#include "llvm/Support/Compiler.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"

#include <cstddef>		#include <cstddef>
#include <cstdint>		#include <cstdint>

namespace lld {		namespace lld {
namespace macho {		namespace macho {
LLVM_ENABLE_BITMASK_ENUMS_IN_NAMESPACE();		LLVM_ENABLE_BITMASK_ENUMS_IN_NAMESPACE();
Show All 30 Lines	public:

// Symbols may be referenced via either the GOT or the stubs section,		// Symbols may be referenced via either the GOT or the stubs section,
// depending on the relocation type. prepareSymbolRelocation() will set up the		// depending on the relocation type. prepareSymbolRelocation() will set up the
// GOT/stubs entries, and resolveSymbolVA() will return the addresses of those		// GOT/stubs entries, and resolveSymbolVA() will return the addresses of those
// entries. resolveSymbolVA() may also relax the target instructions to save		// entries. resolveSymbolVA() may also relax the target instructions to save
// on a level of address indirection.		// on a level of address indirection.
virtual void relaxGotLoad(uint8_t *loc, uint8_t type) const = 0;		virtual void relaxGotLoad(uint8_t *loc, uint8_t type) const = 0;

virtual const RelocAttrs &getRelocAttrs(uint8_t type) const = 0;

virtual uint64_t getPageSize() const = 0;		virtual uint64_t getPageSize() const = 0;

bool hasAttr(uint8_t type, RelocAttrBits bit) const {
return getRelocAttrs(type).hasAttr(bit);
}

bool validateRelocationInfo(llvm::MemoryBufferRef,		bool validateRelocationInfo(llvm::MemoryBufferRef,
const llvm::MachO::section_64 &sec,		const llvm::MachO::section_64 &sec,
llvm::MachO::relocation_info);		llvm::MachO::relocation_info);
void prepareSymbolRelocation(Symbol , const InputSection , const Reloc &);		void prepareSymbolRelocation(Symbol , const InputSection , const Reloc &);

uint32_t cpuType;		uint32_t cpuType;
uint32_t cpuSubtype;		uint32_t cpuSubtype;

size_t stubSize;		size_t stubSize;
size_t stubHelperHeaderSize;		size_t stubHelperHeaderSize;
size_t stubHelperEntrySize;		size_t stubHelperEntrySize;
};		};

TargetInfo *createX86_64TargetInfo();		TargetInfo *createX86_64TargetInfo();
TargetInfo *createARM64TargetInfo();		TargetInfo *createARM64TargetInfo();

extern TargetInfo *target;		extern TargetInfo *target;

		// These values are target-specific, but we have hoisted them out of Target
		// because they are accessed in hot loops. Avoiding the extra indirection is a
		// perf win.
		extern size_t relocAttrsArraySize;
		extern const RelocAttrs *relocAttrsArray;

		inline const RelocAttrs &getRelocAttrs(uint8_t type) {
		if (LLVM_UNLIKELY(type >= relocAttrsArraySize))
		return invalidRelocAttrs;
		return relocAttrsArray[type];
		}

		inline bool hasRelocAttr(uint8_t type, RelocAttrBits bit) {
		return getRelocAttrs(type).hasAttr(bit);
		}

} // namespace macho		} // namespace macho
} // namespace lld		} // namespace lld

#endif		#endif

lld/MachO/Target.cpp

	//===- Target.cpp ---------------------------------------------------------===//			//===- Target.cpp ---------------------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "Target.h"			#include "Target.h"

	using namespace lld;			using namespace lld;
	using namespace lld::macho;			using namespace lld::macho;

	TargetInfo *macho::target = nullptr;			TargetInfo *macho::target = nullptr;
				size_t macho::relocAttrsArraySize = 0;
				const RelocAttrs *macho::relocAttrsArray = nullptr;

lld/MachO/UnwindInfoSection.cpp

	Show First 20 Lines • Show All 107 Lines • ▼ Show 20 Lines
	// rebase opcodes for __LD,__compact_unwind, because that section doesn't			// rebase opcodes for __LD,__compact_unwind, because that section doesn't
	// actually end up in the final binary. Second, personality pointers always			// actually end up in the final binary. Second, personality pointers always
	// reside in the GOT and must be treated specially.			// reside in the GOT and must be treated specially.
	void macho::prepareCompactUnwind(InputSection *isec) {			void macho::prepareCompactUnwind(InputSection *isec) {
	assert(isec->segname == segment_names::ld &&			assert(isec->segname == segment_names::ld &&
	isec->name == section_names::compactUnwind);			isec->name == section_names::compactUnwind);

	for (Reloc &r : isec->relocs) {			for (Reloc &r : isec->relocs) {
	assert(target->hasAttr(r.type, RelocAttrBits::UNSIGNED));			assert(hasRelocAttr(r.type, RelocAttrBits::UNSIGNED));
	if (r.offset % sizeof(CompactUnwindEntry64) !=			if (r.offset % sizeof(CompactUnwindEntry64) !=
	offsetof(struct CompactUnwindEntry64, personality))			offsetof(struct CompactUnwindEntry64, personality))
	continue;			continue;

	if (auto s = r.referent.dyn_cast<lld::macho::Symbol >()) {			if (auto s = r.referent.dyn_cast<lld::macho::Symbol >()) {
	if (auto *undefined = dyn_cast<Undefined>(s)) {			if (auto *undefined = dyn_cast<Undefined>(s)) {
	treatUndefinedSymbol(*undefined);			treatUndefinedSymbol(*undefined);
	// treatUndefinedSymbol() can replace s with a DylibSymbol; re-check.			// treatUndefinedSymbol() can replace s with a DylibSymbol; re-check.
	▲ Show 20 Lines • Show All 360 Lines • Show Last 20 Lines

lld/MachO/Writer.cpp

Show First 20 Lines • Show All 438 Lines • ▼ Show 20 Lines	public:

CodeSignatureSection *section;		CodeSignatureSection *section;
};		};

} // namespace		} // namespace

static void prepareSymbolRelocation(lld::macho::Symbol *sym,		static void prepareSymbolRelocation(lld::macho::Symbol *sym,
const InputSection *isec, const Reloc &r) {		const InputSection *isec, const Reloc &r) {
const RelocAttrs &relocAttrs = target->getRelocAttrs(r.type);		const RelocAttrs &relocAttrs = getRelocAttrs(r.type);

if (relocAttrs.hasAttr(RelocAttrBits::BRANCH)) {		if (relocAttrs.hasAttr(RelocAttrBits::BRANCH)) {
prepareBranchTarget(sym);		prepareBranchTarget(sym);
} else if (relocAttrs.hasAttr(RelocAttrBits::GOT)) {		} else if (relocAttrs.hasAttr(RelocAttrBits::GOT)) {
if (relocAttrs.hasAttr(RelocAttrBits::POINTER) \|\| needsBinding(sym))		if (relocAttrs.hasAttr(RelocAttrBits::POINTER) \|\| needsBinding(sym))
in.got->addEntry(sym);		in.got->addEntry(sym);
} else if (relocAttrs.hasAttr(RelocAttrBits::TLV)) {		} else if (relocAttrs.hasAttr(RelocAttrBits::TLV)) {
if (needsBinding(sym))		if (needsBinding(sym))
Show All 11 Lines	void Writer::scanRelocations() {
for (InputSection *isec : inputSections) {		for (InputSection *isec : inputSections) {
if (isec->segname == segment_names::ld) {		if (isec->segname == segment_names::ld) {
prepareCompactUnwind(isec);		prepareCompactUnwind(isec);
continue;		continue;
}		}

for (auto it = isec->relocs.begin(); it != isec->relocs.end(); ++it) {		for (auto it = isec->relocs.begin(); it != isec->relocs.end(); ++it) {
Reloc &r = *it;		Reloc &r = *it;
if (target->hasAttr(r.type, RelocAttrBits::SUBTRAHEND)) {		if (hasRelocAttr(r.type, RelocAttrBits::SUBTRAHEND)) {
// Skip over the following UNSIGNED relocation -- it's just there as the		// Skip over the following UNSIGNED relocation -- it's just there as the
// minuend, and doesn't have the usual UNSIGNED semantics. We don't want		// minuend, and doesn't have the usual UNSIGNED semantics. We don't want
// to emit rebase opcodes for it.		// to emit rebase opcodes for it.
it = std::next(it);		it = std::next(it);
assert(isa<Defined>(it->referent.dyn_cast<lld::macho::Symbol *>()));		assert(isa<Defined>(it->referent.dyn_cast<lld::macho::Symbol *>()));
continue;		continue;
}		}
if (auto sym = r.referent.dyn_cast<lld::macho::Symbol >()) {		if (auto sym = r.referent.dyn_cast<lld::macho::Symbol >()) {
▲ Show 20 Lines • Show All 435 Lines • Show Last 20 Lines