This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/MachO/
-
MachO/
-
ConcatOutputSection.h
-
ConcatOutputSection.cpp
3/3
Driver.cpp
-
ICF.h
-
ICF.cpp
-
InputSection.h
-
InputSection.cpp
-
MarkLive.cpp
-
SyntheticSections.h
1/1
SyntheticSections.cpp
-
UnwindInfoSection.h
1/2
UnwindInfoSection.cpp
2/2
Writer.cpp

Differential D105044

[lld-macho] Move ICF earlier to avoid emitting redundant binds
ClosedPublic

Authored by int3 on Jun 28 2021, 11:34 AM.

Download Raw Diff

Details

Reviewers

gkm
oontvoo

Group Reviewers

Restricted Project

Commits

rG3a11528d97a7: [lld-macho] Move ICF earlier to avoid emitting redundant binds

Summary

This is a pretty big refactoring diff, so here are the motivations:

Previously, ICF ran after scanRelocations(), where we emitting
bind/rebase opcodes etc. So we had a bunch of redundant leftovers after
ICF. Having ICF run before Writer seems like a better design, and is
what LLD-ELF does, so this diff refactors it accordingly.

However, ICF had two dependencies on things occurring in Writer: 1) it
needs literals to be deduplicated beforehand and 2) it needs to know
which functions have unwind info, which was being handled by
UnwindInfoSection::prepareRelocations().

In order to do literal deduplication earlier, we need to add literal
input sections to their corresponding output sections. So instead of
putting all input sections into the big inputSections vector, and then
filtering them by type later on, I've changed things so that literal
sections get added directly to their output sections during the 'gather'
phase. Likewise for compact unwind sections -- they get added directly
to the UnwindInfoSection now. This latter change is not strictly
necessary, but makes it easier for ICF to determine which functions have
unwind info.

Adding literal sections directly to their output sections means that we
can no longer determine inputOrder from iterating over
inputSections. Instead, we store that order explicitly on
InputSection. Bloating the size of InputSection for this purpose would
be unfortunate -- but LLD-ELF has already solved this problem: it reuses
outSecOff to store this order value.

One downside of this refactor is that we now make an additional pass
over the unwind info relocations to figure out which functions have
unwind info, since want to know that before processRelocations(). I've
made sure to run that extra loop only if ICF is enabled, so there should
be no overhead in non-optimizing runs of the linker.

The upside of all this is that the inputSections vector now contains
only ConcatInputSections that are destined for ConcatOutputSections, so
we can clean up a bunch of code that just existed to filter out other
elements from that vector.

I will test for the lack of redundant binds/rebases in the upcoming
cfstring deduplication diff. While binds/rebases can also happen in the
regular .text section, they're more common in .data sections, so it
seems more natural to test it that way.

This change is perf-neutral when linking chromium_framework.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

int3 created this revision.Jun 28 2021, 11:34 AM

Herald added a reviewer: gkm. · View Herald TranscriptJun 28 2021, 11:34 AM

Herald added a project: Restricted Project. · View Herald Transcript

int3 requested review of this revision.Jun 28 2021, 11:34 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 28 2021, 11:34 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B111326: Diff 354958.Jun 28 2021, 11:34 AM

int3 added inline comments.Jun 28 2021, 4:41 PM

lld/MachO/SyntheticSections.cpp
778	another benefit of moving ICF earlier: the lack of canonicalization is now obvious, since the `parent` point of coalesced InputSections is null.

int3 planned changes to this revision.Jun 28 2021, 10:16 PM

change how we implement inputOrder

int3 mentioned this in D104671: [lld-macho] Extend ICF to literal sections.Jun 28 2021, 10:37 PM

Harbormaster completed remote builds in B111443: Diff 355114.Jun 28 2021, 11:11 PM

oontvoo added a subscriber: oontvoo.Jun 30 2021, 9:26 AM

oontvoo added inline comments.

lld/MachO/Driver.cpp
994	(clang-tidy suggestion)
1002	why not leave it as 0-based rather than 1? not saying it's wrong ... just curious :)
lld/MachO/UnwindInfoSection.cpp
133	why "4"? is this not platform dependent?
lld/MachO/Writer.cpp
615
894	nit: just for consistency

int3 mentioned this in D105075: [lld-macho] Only emit one BIND_OPCODE_SET_SYMBOL per symbol.Jun 30 2021, 2:18 PM

int3 added inline comments.Jun 30 2021, 6:59 PM

lld/MachO/Driver.cpp
1002	well some reviewers seem to dislike post-increments :) either a zero- or 1-based index works here, but yeah, maybe keeping it zero-based will be less surprising. I'll change it back
lld/MachO/UnwindInfoSection.cpp
133	this is existing code, I think @gkm wrote this

address comments

Harbormaster completed remote builds in B111897: Diff 355752.Jun 30 2021, 8:13 PM

int3 added a child revision: D105045: [lld-macho] Deduplicate CFStrings.Jun 30 2021, 9:51 PM

int3 edited the summary of this revision. (Show Details)Jul 1 2021, 6:30 AM

LGTM - thanks! (but still curious about the hard-coded alignment)

This revision is now accepted and ready to land.Jul 1 2021, 2:47 PM

A quick check of ld64's output shows that __unwind_info is aligned to 4 across x86_64/x86/arm64/armv7, so I think we're good here :)

Closed by commit rG3a11528d97a7: [lld-macho] Move ICF earlier to avoid emitting redundant binds (authored by int3). · Explain WhyJul 1 2021, 6:23 PM

This revision was automatically updated to reflect the committed changes.

int3 added a commit: rG3a11528d97a7: [lld-macho] Move ICF earlier to avoid emitting redundant binds.

int3 mentioned this in D106214: [lld-macho] ICF: Fold some sections with differing addends.Jul 16 2021, 11:42 PM

Revision Contents

Path

Size

lld/

MachO/

ConcatOutputSection.h

1 line

ConcatOutputSection.cpp

9 lines

66 lines

21 lines

78 lines

9 lines

12 lines

58 lines

3 lines

SyntheticSections.cpp

72 lines

UnwindInfoSection.h

20 lines

UnwindInfoSection.cpp

32 lines

Writer.cpp

119 lines

Diff 356067

lld/MachO/ConcatOutputSection.h

Show All 34 Lines	public:
// These accessors will only be valid after finalizing the section		// These accessors will only be valid after finalizing the section
uint64_t getSize() const override { return size; }		uint64_t getSize() const override { return size; }
uint64_t getFileSize() const override { return fileSize; }		uint64_t getFileSize() const override { return fileSize; }

void addInput(ConcatInputSection *input);		void addInput(ConcatInputSection *input);
void finalize() override;		void finalize() override;
bool needsThunks() const;		bool needsThunks() const;
uint64_t estimateStubsInRangeVA(size_t callIdx) const;		uint64_t estimateStubsInRangeVA(size_t callIdx) const;
void eraseOmittedInputSections();

void writeTo(uint8_t *buf) const override;		void writeTo(uint8_t *buf) const override;

std::vector<ConcatInputSection *> inputs;		std::vector<ConcatInputSection *> inputs;
std::vector<ConcatInputSection *> thunks;		std::vector<ConcatInputSection *> thunks;

static bool classof(const OutputSection *sec) {		static bool classof(const OutputSection *sec) {
return sec->kind() == ConcatKind;		return sec->kind() == ConcatKind;
Show All 37 Lines

lld/MachO/ConcatOutputSection.cpp

Show First 20 Lines • Show All 349 Lines • ▼ Show 20 Lines	void ConcatOutputSection::finalizeFlags(InputSection *input) {
case S_THREAD_LOCAL_INIT_FUNCTION_POINTERS:		case S_THREAD_LOCAL_INIT_FUNCTION_POINTERS:
case S_THREAD_LOCAL_VARIABLE_POINTERS:		case S_THREAD_LOCAL_VARIABLE_POINTERS:
case S_NON_LAZY_SYMBOL_POINTERS:		case S_NON_LAZY_SYMBOL_POINTERS:
case S_SYMBOL_STUBS:		case S_SYMBOL_STUBS:
flags \|= input->flags;		flags \|= input->flags;
break;		break;
}		}
}		}

void ConcatOutputSection::eraseOmittedInputSections() {
// Remove the duplicates from inputs
inputs.erase(std::remove_if(inputs.begin(), inputs.end(),
[](const ConcatInputSection *isec) -> bool {
return isec->shouldOmitFromOutput();
}),
inputs.end());
}

lld/MachO/Driver.cpp

//===- Driver.cpp ---------------------------------------------------------===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

#include "Driver.h"

#include "Config.h"

#include "ICF.h"

#include "InputFiles.h"

#include "LTO.h"

#include "MarkLive.h"

#include "ObjC.h"

#include "OutputSection.h"

#include "OutputSegment.h"

#include "SymbolTable.h"

#include "Symbols.h"

#include "SyntheticSections.h"

#include "Target.h"

#include "UnwindInfoSection.h"

#include "Writer.h"

#include "lld/Common/Args.h"

#include "lld/Common/Driver.h"

#include "lld/Common/ErrorHandler.h"

#include "lld/Common/LLVM.h"

#include "lld/Common/Memory.h"

#include "lld/Common/Reproduce.h"

▲ Show 20 Lines • Show All 949 Lines • ▼ Show 20 Lines

case OPT_weak_framework:

opt.getID() == OPT_reexport_framework, /*isExplicit=*/true);

break;

default:

break;

}

static void gatherInputSections() {

TimeTraceScope timeScope("Gathering input sections");

int inputOrder = 0;

for (const InputFile *file : inputFiles) {

for (const SubsectionMap &map : file->subsections) {

for (const SubsectionEntry &entry : map) {

if (auto *isec = dyn_cast<ConcatInputSection>(entry.isec)) {

oontvooUnsubmitted

Done

for (const SubsectionEntry &entry : map) {

- if (auto isec = dyn_cast<ConcatInputSection>(entry.isec)) {

+ if (auto *isec = dyn_cast<ConcatInputSection>(entry.isec)) {

if (isec->isCoalescedWeak())

(clang-tidy suggestion)

oontvoo: (clang-tidy suggestion)

if (isec->isCoalescedWeak())

continue;

if (isec->segname == segment_names::ld) {

assert(isec->name == section_names::compactUnwind);

in.unwindInfo->addInput(isec);

continue;

}

isec->outSecOff = inputOrder++;

oontvooUnsubmitted

Done

why not leave it as 0-based rather than 1?

not saying it's wrong ... just curious :)

oontvoo: why not leave it as 0-based rather than 1? not saying it's wrong ... just curious :)

int3AuthorUnsubmitted

Done

well some reviewers seem to dislike post-increments :)

either a zero- or 1-based index works here, but yeah, maybe keeping it zero-based will be less surprising. I'll change it back

int3: well some reviewers seem to dislike post-increments :) either a zero- or 1-based index works…

inputSections.push_back(isec);

} else if (auto *isec = dyn_cast<CStringInputSection>(entry.isec)) {

if (in.cStringSection->inputOrder == UnspecifiedInputOrder)

in.cStringSection->inputOrder = inputOrder++;

in.cStringSection->addInput(isec);

} else if (auto *isec = dyn_cast<WordLiteralInputSection>(entry.isec)) {

if (in.wordLiteralSection->inputOrder == UnspecifiedInputOrder)

in.wordLiteralSection->inputOrder = inputOrder++;

in.wordLiteralSection->addInput(isec);

} else {

llvm_unreachable("unexpected input section kind");

}

assert(inputOrder <= UnspecifiedInputOrder);

}

static void foldIdenticalLiterals() {

// We always create a cStringSection, regardless of whether dedupLiterals is

// true. If it isn't, we simply create a non-deduplicating CStringSection.

// Either way, we must unconditionally finalize it here.

in.cStringSection->finalizeContents();

if (in.wordLiteralSection)

in.wordLiteralSection->finalizeContents();

}

bool macho::link(ArrayRef<const char *> argsArr, bool canExitEarly,

raw_ostream &stdoutOS, raw_ostream &stderrOS) {

lld::stdoutOS = &stdoutOS;

lld::stderrOS = &stderrOS;

errorHandler().cleanupCallback = []() { freeArena(); };

errorHandler().logName = args::getFilenameWithoutExe(argsArr[0]);

▲ Show 20 Lines • Show All 345 Lines • ▼ Show 20 Lines

for (const Arg *arg : args.filtered(OPT_sectcreate)) {

StringRef segName = arg->getValue(0);

StringRef sectName = arg->getValue(1);

StringRef fileName = arg->getValue(2);

Optional<MemoryBufferRef> buffer = readFile(fileName);

if (buffer)

inputFiles.insert(make<OpaqueFile>(*buffer, segName, sectName));

}

{

gatherInputSections();

TimeTraceScope timeScope("Gathering input sections");

// Gather all InputSections into one vector.

for (const InputFile *file : inputFiles) {

for (const SubsectionMap &map : file->subsections) {

for (const SubsectionEntry &entry : map) {

if (auto concatIsec = dyn_cast<ConcatInputSection>(entry.isec))

if (concatIsec->isCoalescedWeak())

continue;

inputSections.push_back(entry.isec);

}

assert(inputSections.size() < UnspecifiedInputOrder);

}

if (config->deadStrip)

markLive();

// ICF assumes that all literals have been folded already, so we must run

// foldIdenticalLiterals before foldIdenticalSections.

foldIdenticalLiterals();

if (config->icfLevel != ICFLevel::none)

foldIdenticalSections();

// Write to an output file.

if (target->wordSize == 8)

writeResult<LP64>();

else

writeResult<ILP32>();

depTracker->write(getLLDVersion(), inputFiles, config->outputFile);

}

Show All 17 Lines

lld/MachO/ICF.h

	Show All 9 Lines
	#define LLD_MACHO_ICF_H			#define LLD_MACHO_ICF_H

	#include "lld/Common/LLVM.h"			#include "lld/Common/LLVM.h"
	#include <vector>			#include <vector>

	namespace lld {			namespace lld {
	namespace macho {			namespace macho {

	class ConcatInputSection;			void foldIdenticalSections();

	class ICF {
	public:
	ICF(std::vector<ConcatInputSection *> &inputs);

	void run();
	void segregate(size_t begin, size_t end,
	std::function<bool(const ConcatInputSection *,
	const ConcatInputSection *)>
	equals);
	size_t findBoundary(size_t begin, size_t end);
	void forEachClassRange(size_t begin, size_t end,
	std::function<void(size_t, size_t)> func);
	void forEachClass(std::function<void(size_t, size_t)> func);

	// ICF needs a copy of the inputs vector because its equivalence-class
	// segregation algorithm destroys the proper sequence.
	std::vector<ConcatInputSection *> icfInputs;
	};

	} // namespace macho			} // namespace macho
	} // namespace lld			} // namespace lld

	#endif			#endif

lld/MachO/ICF.cpp

//===- ICF.cpp ------------------------------------------------------------===//		//===- ICF.cpp ------------------------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "ICF.h"		#include "ICF.h"
#include "ConcatOutputSection.h"		#include "ConcatOutputSection.h"
#include "InputSection.h"		#include "InputSection.h"
#include "Symbols.h"		#include "Symbols.h"
		#include "UnwindInfoSection.h"

#include "llvm/Support/Parallel.h"		#include "llvm/Support/Parallel.h"
		#include "llvm/Support/TimeProfiler.h"

#include <atomic>		#include <atomic>

using namespace llvm;		using namespace llvm;
using namespace lld;		using namespace lld;
using namespace lld::macho;		using namespace lld::macho;

		class ICF {
		public:
		ICF(std::vector<ConcatInputSection *> &inputs);

		void run();
		void segregate(size_t begin, size_t end,
		std::function<bool(const ConcatInputSection *,
		const ConcatInputSection *)>
		equals);
		size_t findBoundary(size_t begin, size_t end);
		void forEachClassRange(size_t begin, size_t end,
		std::function<void(size_t, size_t)> func);
		void forEachClass(std::function<void(size_t, size_t)> func);

		// ICF needs a copy of the inputs vector because its equivalence-class
		// segregation algorithm destroys the proper sequence.
		std::vector<ConcatInputSection *> icfInputs;
		};

ICF::ICF(std::vector<ConcatInputSection *> &inputs) {		ICF::ICF(std::vector<ConcatInputSection *> &inputs) {
icfInputs.assign(inputs.begin(), inputs.end());		icfInputs.assign(inputs.begin(), inputs.end());
}		}

// ICF = Identical Code Folding		// ICF = Identical Code Folding
//		//
// We only fold __TEXT,__text, so this is really "code" folding, and not		// We only fold __TEXT,__text, so this is really "code" folding, and not
// "COMDAT" folding. String and scalar constant literals are deduplicated		// "COMDAT" folding. String and scalar constant literals are deduplicated
▲ Show 20 Lines • Show All 242 Lines • ▼ Show 20 Lines	while (begin < end) {

// If we created a group, we need to iterate the main loop again.		// If we created a group, we need to iterate the main loop again.
if (mid != end)		if (mid != end)
icfRepeat = true;		icfRepeat = true;

begin = mid;		begin = mid;
}		}
}		}

		template <class Ptr>
		DenseSet<const InputSection *> findFunctionsWithUnwindInfo() {
		DenseSet<const InputSection *> result;
		for (ConcatInputSection *isec : in.unwindInfo->getInputs()) {
		for (size_t i = 0; i < isec->relocs.size(); ++i) {
		Reloc &r = isec->relocs[i];
		assert(target->hasAttr(r.type, RelocAttrBits::UNSIGNED));
		if (r.offset % sizeof(CompactUnwindEntry<Ptr>) !=
		offsetof(CompactUnwindEntry<Ptr>, functionAddress))
		continue;
		result.insert(r.referent.get<InputSection *>());
		}
		}
		return result;
		}

		void macho::foldIdenticalSections() {
		TimeTraceScope timeScope("Fold Identical Code Sections");
		// The ICF equivalence-class segregation algorithm relies on pre-computed
		// hashes of InputSection::data for the ConcatOutputSection::inputs and all
		// sections referenced by their relocs. We could recursively traverse the
		// relocs to find every referenced InputSection, but that precludes easy
		// parallelization. Therefore, we hash every InputSection here where we have
		// them all accessible as simple vectors.
		std::vector<ConcatInputSection *> codeSections;

		// ICF can't fold functions with unwind info
		DenseSet<const InputSection *> functionsWithUnwindInfo =
		target->wordSize == 8 ? findFunctionsWithUnwindInfo<uint64_t>()
		: findFunctionsWithUnwindInfo<uint32_t>();

		// If an InputSection is ineligible for ICF, we give it a unique ID to force
		// it into an unfoldable singleton equivalence class. Begin the unique-ID
		// space at inputSections.size(), so that it will never intersect with
		// equivalence-class IDs which begin at 0. Since hashes & unique IDs never
		// coexist with equivalence-class IDs, this is not necessary, but might help
		// someone keep the numbers straight in case we ever need to debug the
		// ICF::segregate()
		uint64_t icfUniqueID = inputSections.size();
		for (ConcatInputSection *isec : inputSections) {
		bool isHashable = isCodeSection(isec) && !isec->shouldOmitFromOutput() &&
		!functionsWithUnwindInfo.contains(isec) &&
		isec->isHashableForICF();
		if (isHashable) {
		codeSections.push_back(isec);
		} else {
		isec->icfEqClass[0] = ++icfUniqueID;
		}
		}
		parallelForEach(codeSections,
		[](ConcatInputSection *isec) { isec->hashForICF(); });
		// Now that every input section is either hashed or marked as unique, run the
		// segregation algorithm to detect foldable subsections.
		ICF(codeSections).run();
		}

lld/MachO/InputSection.h

Show First 20 Lines • Show All 89 Lines • ▼ Show 20 Lines	public:

uint64_t getOffset(uint64_t off) const override { return outSecOff + off; }		uint64_t getOffset(uint64_t off) const override { return outSecOff + off; }
uint64_t getVA() const { return InputSection::getVA(0); }		uint64_t getVA() const { return InputSection::getVA(0); }
// ConcatInputSections are entirely live or dead, so the offset is irrelevant.		// ConcatInputSections are entirely live or dead, so the offset is irrelevant.
bool isLive(uint64_t off) const override { return live; }		bool isLive(uint64_t off) const override { return live; }
void markLive(uint64_t off) override { live = true; }		void markLive(uint64_t off) override { live = true; }
bool isCoalescedWeak() const { return wasCoalesced && numRefs == 0; }		bool isCoalescedWeak() const { return wasCoalesced && numRefs == 0; }
bool shouldOmitFromOutput() const { return !live \|\| isCoalescedWeak(); }		bool shouldOmitFromOutput() const { return !live \|\| isCoalescedWeak(); }
bool isHashableForICF(bool isText) const;		bool isHashableForICF() const;
void hashForICF();		void hashForICF();
void writeTo(uint8_t *buf);		void writeTo(uint8_t *buf);

void foldIdentical(ConcatInputSection *redundant);		void foldIdentical(ConcatInputSection *redundant);
InputSection *canonical() override {		InputSection *canonical() override {
return replacement ? replacement : this;		return replacement ? replacement : this;
}		}

static bool classof(const InputSection *isec) {		static bool classof(const InputSection *isec) {
return isec->kind() == ConcatKind;		return isec->kind() == ConcatKind;
}		}

// ICF can't fold functions with LSDA+personality
bool hasPersonality = false;
// Points to the surviving section after this one is folded by ICF		// Points to the surviving section after this one is folded by ICF
InputSection *replacement = nullptr;		InputSection *replacement = nullptr;
// Equivalence-class ID for ICF		// Equivalence-class ID for ICF
uint64_t icfEqClass[2] = {0, 0};		uint64_t icfEqClass[2] = {0, 0};

// With subsections_via_symbols, most symbols have their own InputSection,		// With subsections_via_symbols, most symbols have their own InputSection,
// and for weak symbols (e.g. from inline functions), only the		// and for weak symbols (e.g. from inline functions), only the
// InputSection from one translation unit will make it to the output,		// InputSection from one translation unit will make it to the output,
// while all copies in other translation units are coalesced into the		// while all copies in other translation units are coalesced into the
// first and not copied to the output.		// first and not copied to the output.
bool wasCoalesced = false;		bool wasCoalesced = false;
bool live = !config->deadStrip;		bool live = !config->deadStrip;
// How many symbols refer to this InputSection.		// How many symbols refer to this InputSection.
uint32_t numRefs = 0;		uint32_t numRefs = 0;
		// This variable has two usages. Initially, it represents the input order.
		// After assignAddresses is called, it represents the offset from the
		// beginning of the output section this section was assigned to.
uint64_t outSecOff = 0;		uint64_t outSecOff = 0;
};		};

// Helper functions to make it easy to sprinkle asserts.		// Helper functions to make it easy to sprinkle asserts.

inline bool shouldOmitFromOutput(InputSection *isec) {		inline bool shouldOmitFromOutput(InputSection *isec) {
return isa<ConcatInputSection>(isec) &&		return isa<ConcatInputSection>(isec) &&
cast<ConcatInputSection>(isec)->shouldOmitFromOutput();		cast<ConcatInputSection>(isec)->shouldOmitFromOutput();
▲ Show 20 Lines • Show All 116 Lines • ▼ Show 20 Lines
inline bool isWordLiteralSection(uint32_t flags) {		inline bool isWordLiteralSection(uint32_t flags) {
return sectionType(flags) == llvm::MachO::S_4BYTE_LITERALS \|\|		return sectionType(flags) == llvm::MachO::S_4BYTE_LITERALS \|\|
sectionType(flags) == llvm::MachO::S_8BYTE_LITERALS \|\|		sectionType(flags) == llvm::MachO::S_8BYTE_LITERALS \|\|
sectionType(flags) == llvm::MachO::S_16BYTE_LITERALS;		sectionType(flags) == llvm::MachO::S_16BYTE_LITERALS;
}		}

bool isCodeSection(const InputSection *);		bool isCodeSection(const InputSection *);

extern std::vector<InputSection *> inputSections;		extern std::vector<ConcatInputSection *> inputSections;

namespace section_names {		namespace section_names {

constexpr const char authGot[] = "__auth_got";		constexpr const char authGot[] = "__auth_got";
constexpr const char authPtr[] = "__auth_ptr";		constexpr const char authPtr[] = "__auth_ptr";
constexpr const char binding[] = "__binding";		constexpr const char binding[] = "__binding";
constexpr const char bitcodeBundle[] = "__bundle";		constexpr const char bitcodeBundle[] = "__bundle";
constexpr const char cString[] = "__cstring";		constexpr const char cString[] = "__cstring";
▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

lld/MachO/InputSection.cpp

	Show All 19 Lines
	#include "llvm/Support/xxhash.h"			#include "llvm/Support/xxhash.h"

	using namespace llvm;			using namespace llvm;
	using namespace llvm::MachO;			using namespace llvm::MachO;
	using namespace llvm::support;			using namespace llvm::support;
	using namespace lld;			using namespace lld;
	using namespace lld::macho;			using namespace lld::macho;

	std::vector<InputSection *> macho::inputSections;			std::vector<ConcatInputSection *> macho::inputSections;

	uint64_t InputSection::getFileSize() const {			uint64_t InputSection::getFileSize() const {
	return isZeroFill(flags) ? 0 : getSize();			return isZeroFill(flags) ? 0 : getSize();
	}			}

	uint64_t InputSection::getVA(uint64_t off) const {			uint64_t InputSection::getVA(uint64_t off) const {
	return parent->addr + getOffset(off);			return parent->addr + getOffset(off);
	}			}

	static uint64_t resolveSymbolVA(const Symbol *sym, uint8_t type) {			static uint64_t resolveSymbolVA(const Symbol *sym, uint8_t type) {
	const RelocAttrs &relocAttrs = target->getRelocAttrs(type);			const RelocAttrs &relocAttrs = target->getRelocAttrs(type);
	if (relocAttrs.hasAttr(RelocAttrBits::BRANCH))			if (relocAttrs.hasAttr(RelocAttrBits::BRANCH))
	return sym->resolveBranchVA();			return sym->resolveBranchVA();
	else if (relocAttrs.hasAttr(RelocAttrBits::GOT))			else if (relocAttrs.hasAttr(RelocAttrBits::GOT))
	return sym->resolveGotVA();			return sym->resolveGotVA();
	else if (relocAttrs.hasAttr(RelocAttrBits::TLV))			else if (relocAttrs.hasAttr(RelocAttrBits::TLV))
	return sym->resolveTlvVA();			return sym->resolveTlvVA();
	return sym->getVA();			return sym->getVA();
	}			}

	// ICF needs to hash any section that might potentially be duplicated so			// ICF needs to hash any section that might potentially be duplicated so
	// that it can match on content rather than identity.			// that it can match on content rather than identity.
	bool ConcatInputSection::isHashableForICF(bool isText) const {			bool ConcatInputSection::isHashableForICF() const {
	if (shouldOmitFromOutput())
	return false;
	switch (sectionType(flags)) {			switch (sectionType(flags)) {
	case S_REGULAR:			case S_REGULAR:
	if (isText)			return true;
	return !hasPersonality;
	// One might hope that we could hash __TEXT,__const subsections to fold
	// references to duplicated values, but alas, many tests fail.
	return false;
	case S_CSTRING_LITERALS:			case S_CSTRING_LITERALS:
	case S_4BYTE_LITERALS:			case S_4BYTE_LITERALS:
	case S_8BYTE_LITERALS:			case S_8BYTE_LITERALS:
	case S_16BYTE_LITERALS:			case S_16BYTE_LITERALS:
	case S_LITERAL_POINTERS:			case S_LITERAL_POINTERS:
	llvm_unreachable("found unexpected literal type in ConcatInputSection");			llvm_unreachable("found unexpected literal type in ConcatInputSection");
	case S_ZEROFILL:			case S_ZEROFILL:
	case S_GB_ZEROFILL:			case S_GB_ZEROFILL:
	▲ Show 20 Lines • Show All 175 Lines • Show Last 20 Lines

lld/MachO/MarkLive.cpp

Show First 20 Lines • Show All 95 Lines • ▼ Show 20 Lines	for (const InputFile *file : inputFiles)
if (auto *objFile = dyn_cast<ObjFile>(file))		if (auto *objFile = dyn_cast<ObjFile>(file))
for (Symbol *sym : objFile->symbols)		for (Symbol *sym : objFile->symbols)
if (auto *defined = dyn_cast_or_null<Defined>(sym))		if (auto *defined = dyn_cast_or_null<Defined>(sym))
if (!defined->isExternal() && defined->noDeadStrip)		if (!defined->isExternal() && defined->noDeadStrip)
addSym(defined);		addSym(defined);
if (auto *stubBinder =		if (auto *stubBinder =
dyn_cast_or_null<DylibSymbol>(symtab->find("dyld_stub_binder")))		dyn_cast_or_null<DylibSymbol>(symtab->find("dyld_stub_binder")))
addSym(stubBinder);		addSym(stubBinder);
for (InputSection *isec : inputSections) {		for (ConcatInputSection *isec : inputSections) {
// Sections marked no_dead_strip		// Sections marked no_dead_strip
if (isec->flags & S_ATTR_NO_DEAD_STRIP) {		if (isec->flags & S_ATTR_NO_DEAD_STRIP) {
assert(isa<ConcatInputSection>(isec));
enqueue(isec, 0);		enqueue(isec, 0);
continue;		continue;
}		}

// mod_init_funcs, mod_term_funcs sections		// mod_init_funcs, mod_term_funcs sections
if (sectionType(isec->flags) == S_MOD_INIT_FUNC_POINTERS \|\|		if (sectionType(isec->flags) == S_MOD_INIT_FUNC_POINTERS \|\|
sectionType(isec->flags) == S_MOD_TERM_FUNC_POINTERS) {		sectionType(isec->flags) == S_MOD_TERM_FUNC_POINTERS) {
assert(isa<ConcatInputSection>(isec));
enqueue(isec, 0);		enqueue(isec, 0);
continue;		continue;
}		}
		}

// Dead strip runs before UnwindInfoSection handling so we need to keep		// Dead strip runs before UnwindInfoSection handling so we need to keep
// __LD,__compact_unwind alive here.		// __LD,__compact_unwind alive here.
// But that section contains absolute references to __TEXT,__text and		// But that section contains absolute references to __TEXT,__text and
// keeps most code alive due to that. So we can't just enqueue() the		// keeps most code alive due to that. So we can't just enqueue() the
// section: We must skip the relocations for the functionAddress		// section: We must skip the relocations for the functionAddress
// in each CompactUnwindEntry.		// in each CompactUnwindEntry.
// See also scanEhFrameSection() in lld/ELF/MarkLive.cpp.		// See also scanEhFrameSection() in lld/ELF/MarkLive.cpp.
if (isec->segname == segment_names::ld &&		for (ConcatInputSection *isec : in.unwindInfo->getInputs()) {
isec->name == section_names::compactUnwind) {		isec->live = true;
auto concatIsec = cast<ConcatInputSection>(isec);
concatIsec->live = true;
const int compactUnwindEntrySize =		const int compactUnwindEntrySize =
target->wordSize == 8 ? sizeof(CompactUnwindEntry<uint64_t>)		target->wordSize == 8 ? sizeof(CompactUnwindEntry<uint64_t>)
: sizeof(CompactUnwindEntry<uint32_t>);		: sizeof(CompactUnwindEntry<uint32_t>);
for (const Reloc &r : isec->relocs) {		for (const Reloc &r : isec->relocs) {
// This is the relocation for the address of the function itself.		// This is the relocation for the address of the function itself.
// Ignore it, else these would keep everything alive.		// Ignore it, else these would keep everything alive.
if (r.offset % compactUnwindEntrySize == 0)		if (r.offset % compactUnwindEntrySize == 0)
continue;		continue;

if (auto s = r.referent.dyn_cast<Symbol >())		if (auto s = r.referent.dyn_cast<Symbol >())
addSym(s);		addSym(s);
else		else
enqueue(r.referent.get<InputSection *>(), r.addend);		enqueue(r.referent.get<InputSection *>(), r.addend);
}		}
continue;
}
}		}

do {		do {
// Mark things reachable from GC roots as live.		// Mark things reachable from GC roots as live.
while (!worklist.empty()) {		while (!worklist.empty()) {
ConcatInputSection *s = worklist.pop_back_val();		ConcatInputSection *s = worklist.pop_back_val();
assert(s->live && "We mark as live when pushing onto the worklist!");		assert(s->live && "We mark as live when pushing onto the worklist!");

// Mark all symbols listed in the relocation table for this section.		// Mark all symbols listed in the relocation table for this section.
for (const Reloc &r : s->relocs) {		for (const Reloc &r : s->relocs) {
if (auto s = r.referent.dyn_cast<Symbol >())		if (auto s = r.referent.dyn_cast<Symbol >())
addSym(s);		addSym(s);
else		else
enqueue(r.referent.get<InputSection *>(), r.addend);		enqueue(r.referent.get<InputSection *>(), r.addend);
}		}
}		}

// S_ATTR_LIVE_SUPPORT sections are live if they point _to_ a live section.		// S_ATTR_LIVE_SUPPORT sections are live if they point _to_ a live section.
// Process them in a second pass.		// Process them in a second pass.
for (InputSection *isec : inputSections) {		for (ConcatInputSection *isec : inputSections) {
if (!isa<ConcatInputSection>(isec))
continue;
auto concatIsec = cast<ConcatInputSection>(isec);
// FIXME: Check if copying all S_ATTR_LIVE_SUPPORT sections into a		// FIXME: Check if copying all S_ATTR_LIVE_SUPPORT sections into a
// separate vector and only walking that here is faster.		// separate vector and only walking that here is faster.
if (!(concatIsec->flags & S_ATTR_LIVE_SUPPORT) \|\| concatIsec->live)		if (!(isec->flags & S_ATTR_LIVE_SUPPORT) \|\| isec->live)
continue;		continue;

for (const Reloc &r : isec->relocs) {		for (const Reloc &r : isec->relocs) {
bool referentLive;		bool referentLive;
if (auto s = r.referent.dyn_cast<Symbol >())		if (auto s = r.referent.dyn_cast<Symbol >())
referentLive = s->isLive();		referentLive = s->isLive();
else		else
referentLive = r.referent.get<InputSection *>()->isLive(r.addend);		referentLive = r.referent.get<InputSection *>()->isLive(r.addend);
Show All 14 Lines

lld/MachO/SyntheticSections.h

Show First 20 Lines • Show All 551 Lines • ▼ Show 20 Lines
public:		public:
using UInt128 = std::pair<uint64_t, uint64_t>;		using UInt128 = std::pair<uint64_t, uint64_t>;
// I don't think the standard guarantees the size of a pair, so let's make		// I don't think the standard guarantees the size of a pair, so let's make
// sure it's exact -- that way we can construct it via `mmap`.		// sure it's exact -- that way we can construct it via `mmap`.
static_assert(sizeof(UInt128) == 16, "");		static_assert(sizeof(UInt128) == 16, "");

WordLiteralSection();		WordLiteralSection();
void addInput(WordLiteralInputSection *);		void addInput(WordLiteralInputSection *);
		void finalizeContents();
void writeTo(uint8_t *buf) const override;		void writeTo(uint8_t *buf) const override;

uint64_t getSize() const override {		uint64_t getSize() const override {
return literal16Map.size() * 16 + literal8Map.size() * 8 +		return literal16Map.size() * 16 + literal8Map.size() * 8 +
literal4Map.size() * 4;		literal4Map.size() * 4;
}		}

bool isNeeded() const override {		bool isNeeded() const override {
Show All 11 Lines	public:
}		}

uint64_t getLiteral4Offset(const uint8_t *buf) const {		uint64_t getLiteral4Offset(const uint8_t *buf) const {
return literal16Map.size() * 16 + literal8Map.size() * 8 +		return literal16Map.size() * 16 + literal8Map.size() * 8 +
literal4Map.at(reinterpret_cast<const uint32_t >(buf)) * 4;		literal4Map.at(reinterpret_cast<const uint32_t >(buf)) * 4;
}		}

private:		private:
		std::vector<WordLiteralInputSection *> inputs;

template <class T> struct Hasher {		template <class T> struct Hasher {
llvm::hash_code operator()(T v) const { return llvm::hash_value(v); }		llvm::hash_code operator()(T v) const { return llvm::hash_value(v); }
};		};
// We're using unordered_map instead of DenseMap here because we need to		// We're using unordered_map instead of DenseMap here because we need to
// support all possible integer values -- there are no suitable tombstone		// support all possible integer values -- there are no suitable tombstone
// values for DenseMap.		// values for DenseMap.
std::unordered_map<UInt128, uint64_t, Hasher<UInt128>> literal16Map;		std::unordered_map<UInt128, uint64_t, Hasher<UInt128>> literal16Map;
std::unordered_map<uint64_t, uint64_t> literal8Map;		std::unordered_map<uint64_t, uint64_t> literal8Map;
Show All 30 Lines

lld/MachO/SyntheticSections.cpp

Show First 20 Lines • Show All 769 Lines • ▼ Show 20 Lines	if (lastFile == nullptr \|\| lastFile != file) {
emitEndSourceStab();		emitEndSourceStab();
lastFile = file;		lastFile = file;

emitBeginSourceStab(file->compileUnit);		emitBeginSourceStab(file->compileUnit);
emitObjectFileStab(file);		emitObjectFileStab(file);
}		}

StabsEntry symStab;		StabsEntry symStab;
symStab.sect = defined->isec->parent->index;		symStab.sect = defined->isec->canonical()->parent->index;
		int3AuthorUnsubmitted Done Reply Inline Actions another benefit of moving ICF earlier: the lack of canonicalization is now obvious, since the `parent` point of coalesced InputSections is null. int3: another benefit of moving ICF earlier: the lack of canonicalization is now obvious, since the…
symStab.strx = stringTableSection.addString(defined->getName());		symStab.strx = stringTableSection.addString(defined->getName());
symStab.value = defined->getVA();		symStab.value = defined->getVA();

if (isCodeSection(isec)) {		if (isCodeSection(isec)) {
symStab.type = N_FUN;		symStab.type = N_FUN;
stabs.emplace_back(std::move(symStab));		stabs.emplace_back(std::move(symStab));
emitEndFunStab(defined);		emitEndFunStab(defined);
} else {		} else {
▲ Show 20 Lines • Show All 108 Lines • ▼ Show 20 Lines	if (auto *defined = dyn_cast<Defined>(entry.sym)) {
}		}

if (defined->isAbsolute()) {		if (defined->isAbsolute()) {
nList->n_type = scope \| N_ABS;		nList->n_type = scope \| N_ABS;
nList->n_sect = NO_SECT;		nList->n_sect = NO_SECT;
nList->n_value = defined->value;		nList->n_value = defined->value;
} else {		} else {
nList->n_type = scope \| N_SECT;		nList->n_type = scope \| N_SECT;
nList->n_sect = defined->isec->parent->index;		nList->n_sect = defined->isec->canonical()->parent->index;
// For the N_SECT symbol type, n_value is the address of the symbol		// For the N_SECT symbol type, n_value is the address of the symbol
nList->n_value = defined->getVA();		nList->n_value = defined->getVA();
}		}
nList->n_desc \|= defined->thumb ? N_ARM_THUMB_DEF : 0;		nList->n_desc \|= defined->thumb ? N_ARM_THUMB_DEF : 0;
nList->n_desc \|= defined->isExternalWeakDef() ? N_WEAK_DEF : 0;		nList->n_desc \|= defined->isExternalWeakDef() ? N_WEAK_DEF : 0;
nList->n_desc \|=		nList->n_desc \|=
defined->referencedDynamically ? REFERENCED_DYNAMICALLY : 0;		defined->referencedDynamically ? REFERENCED_DYNAMICALLY : 0;
} else if (auto *dysym = dyn_cast<DylibSymbol>(entry.sym)) {		} else if (auto *dysym = dyn_cast<DylibSymbol>(entry.sym)) {
▲ Show 20 Lines • Show All 338 Lines • ▼ Show 20 Lines
// our merged-literals section a different name.		// our merged-literals section a different name.
WordLiteralSection::WordLiteralSection()		WordLiteralSection::WordLiteralSection()
: SyntheticSection(segment_names::text, section_names::literals) {		: SyntheticSection(segment_names::text, section_names::literals) {
align = 16;		align = 16;
}		}

void WordLiteralSection::addInput(WordLiteralInputSection *isec) {		void WordLiteralSection::addInput(WordLiteralInputSection *isec) {
isec->parent = this;		isec->parent = this;
		inputs.push_back(isec);
		}

		void WordLiteralSection::finalizeContents() {
		for (WordLiteralInputSection *isec : inputs) {
// We do all processing of the InputSection here, so it will be effectively		// We do all processing of the InputSection here, so it will be effectively
// finalized.		// finalized.
isec->isFinal = true;		isec->isFinal = true;
const uint8_t *buf = isec->data.data();		const uint8_t *buf = isec->data.data();
switch (sectionType(isec->flags)) {		switch (sectionType(isec->flags)) {
case S_4BYTE_LITERALS: {		case S_4BYTE_LITERALS: {
for (size_t off = 0, e = isec->data.size(); off < e; off += 4) {		for (size_t off = 0, e = isec->data.size(); off < e; off += 4) {
if (!isec->isLive(off))		if (!isec->isLive(off))
continue;		continue;
uint32_t value = reinterpret_cast<const uint32_t >(buf + off);		uint32_t value = reinterpret_cast<const uint32_t >(buf + off);
literal4Map.emplace(value, literal4Map.size());		literal4Map.emplace(value, literal4Map.size());
}		}
break;		break;
}		}
case S_8BYTE_LITERALS: {		case S_8BYTE_LITERALS: {
for (size_t off = 0, e = isec->data.size(); off < e; off += 8) {		for (size_t off = 0, e = isec->data.size(); off < e; off += 8) {
if (!isec->isLive(off))		if (!isec->isLive(off))
continue;		continue;
uint64_t value = reinterpret_cast<const uint64_t >(buf + off);		uint64_t value = reinterpret_cast<const uint64_t >(buf + off);
literal8Map.emplace(value, literal8Map.size());		literal8Map.emplace(value, literal8Map.size());
}		}
break;		break;
}		}
case S_16BYTE_LITERALS: {		case S_16BYTE_LITERALS: {
for (size_t off = 0, e = isec->data.size(); off < e; off += 16) {		for (size_t off = 0, e = isec->data.size(); off < e; off += 16) {
if (!isec->isLive(off))		if (!isec->isLive(off))
continue;		continue;
UInt128 value = reinterpret_cast<const UInt128 >(buf + off);		UInt128 value = reinterpret_cast<const UInt128 >(buf + off);
literal16Map.emplace(value, literal16Map.size());		literal16Map.emplace(value, literal16Map.size());
}		}
break;		break;
}		}
default:		default:
llvm_unreachable("invalid literal section type");		llvm_unreachable("invalid literal section type");
}		}
}		}
		}

void WordLiteralSection::writeTo(uint8_t *buf) const {		void WordLiteralSection::writeTo(uint8_t *buf) const {
// Note that we don't attempt to do any endianness conversion in addInput(),		// Note that we don't attempt to do any endianness conversion in addInput(),
// so we don't do it here either -- just write out the original value,		// so we don't do it here either -- just write out the original value,
// byte-for-byte.		// byte-for-byte.
for (const auto &p : literal16Map)		for (const auto &p : literal16Map)
memcpy(buf + p.second * 16, &p.first, 16);		memcpy(buf + p.second * 16, &p.first, 16);
buf += literal16Map.size() * 16;		buf += literal16Map.size() * 16;
▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

lld/MachO/UnwindInfoSection.h

Show All 21 Lines	template <class Ptr> struct CompactUnwindEntry {
uint32_t functionLength;		uint32_t functionLength;
compact_unwind_encoding_t encoding;		compact_unwind_encoding_t encoding;
Ptr personality;		Ptr personality;
Ptr lsda;		Ptr lsda;
};		};

class UnwindInfoSection : public SyntheticSection {		class UnwindInfoSection : public SyntheticSection {
public:		public:
bool isNeeded() const override { return compactUnwindSection != nullptr; }		bool isNeeded() const override {
		return !compactUnwindSection->inputs.empty();
		}
uint64_t getSize() const override { return unwindInfoSize; }		uint64_t getSize() const override { return unwindInfoSize; }
virtual void prepareRelocations(ConcatInputSection *) = 0;		virtual void addInput(ConcatInputSection *) = 0;
		std::vector<ConcatInputSection *> getInputs() {
void setCompactUnwindSection(ConcatOutputSection *cuSection) {		return compactUnwindSection->inputs;
compactUnwindSection = cuSection;
}		}
		void prepareRelocations();

protected:		protected:
UnwindInfoSection()		UnwindInfoSection();
: SyntheticSection(segment_names::text, section_names::unwindInfo) {		virtual void prepareRelocations(ConcatInputSection *) = 0;
align = 4;
}

ConcatOutputSection *compactUnwindSection = nullptr;		ConcatOutputSection *compactUnwindSection;
uint64_t unwindInfoSize = 0;		uint64_t unwindInfoSize = 0;
};		};

UnwindInfoSection *makeUnwindInfoSection();		UnwindInfoSection *makeUnwindInfoSection();

} // namespace macho		} // namespace macho
} // namespace lld		} // namespace lld

#endif		#endif

lld/MachO/UnwindInfoSection.cpp

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	struct SecondLevelPage {
uint32_t kind;		uint32_t kind;
size_t entryIndex;		size_t entryIndex;
size_t entryCount;		size_t entryCount;
size_t byteCount;		size_t byteCount;
std::vector<compact_unwind_encoding_t> localEncodings;		std::vector<compact_unwind_encoding_t> localEncodings;
EncodingMap localEncodingIndexes;		EncodingMap localEncodingIndexes;
};		};

template <class Ptr> class UnwindInfoSectionImpl : public UnwindInfoSection {		template <class Ptr>
		class UnwindInfoSectionImpl final : public UnwindInfoSection {
public:		public:
void prepareRelocations(ConcatInputSection *) override;		void prepareRelocations(ConcatInputSection *) override;
		void addInput(ConcatInputSection *) override;
void finalize() override;		void finalize() override;
void writeTo(uint8_t *buf) const override;		void writeTo(uint8_t *buf) const override;

private:		private:
std::vector<std::pair<compact_unwind_encoding_t, size_t>> commonEncodings;		std::vector<std::pair<compact_unwind_encoding_t, size_t>> commonEncodings;
EncodingMap commonEncodingIndexes;		EncodingMap commonEncodingIndexes;
// Indices of personality functions within the GOT.		// Indices of personality functions within the GOT.
std::vector<uint32_t> personalities;		std::vector<uint32_t> personalities;
SmallDenseMap<std::pair<InputSection , uint64_t / addend />, Symbol >		SmallDenseMap<std::pair<InputSection , uint64_t / addend />, Symbol >
personalityTable;		personalityTable;
std::vector<unwind_info_section_header_lsda_index_entry> lsdaEntries;		std::vector<unwind_info_section_header_lsda_index_entry> lsdaEntries;
// Map of function offset (from the image base) to an index within the LSDA		// Map of function offset (from the image base) to an index within the LSDA
// array.		// array.
llvm::DenseMap<uint32_t, uint32_t> functionToLsdaIndex;		llvm::DenseMap<uint32_t, uint32_t> functionToLsdaIndex;
std::vector<CompactUnwindEntry<Ptr>> cuVector;		std::vector<CompactUnwindEntry<Ptr>> cuVector;
std::vector<CompactUnwindEntry<Ptr> *> cuPtrVector;		std::vector<CompactUnwindEntry<Ptr> *> cuPtrVector;
std::vector<SecondLevelPage> secondLevelPages;		std::vector<SecondLevelPage> secondLevelPages;
uint64_t level2PagesOffset = 0;		uint64_t level2PagesOffset = 0;
};		};

		UnwindInfoSection::UnwindInfoSection()
		: SyntheticSection(segment_names::text, section_names::unwindInfo) {
		align = 4;
		oontvooUnsubmitted Not Done Reply Inline Actions why "4"? is this not platform dependent? oontvoo: why "4"? is this not platform dependent?
		int3AuthorUnsubmitted Done Reply Inline Actions this is existing code, I think @gkm wrote this int3: this is existing code, I think @gkm wrote this
		compactUnwindSection =
		make<ConcatOutputSection>(section_names::compactUnwind);
		}

		void UnwindInfoSection::prepareRelocations() {
		for (ConcatInputSection *isec : compactUnwindSection->inputs)
		prepareRelocations(isec);
		}

		template <class Ptr>
		void UnwindInfoSectionImpl<Ptr>::addInput(ConcatInputSection *isec) {
		assert(isec->segname == segment_names::ld &&
		isec->name == section_names::compactUnwind);
		compactUnwindSection->addInput(isec);
		}

// Compact unwind relocations have different semantics, so we handle them in a		// Compact unwind relocations have different semantics, so we handle them in a
// separate code path from regular relocations. First, we do not wish to add		// separate code path from regular relocations. First, we do not wish to add
// rebase opcodes for __LD,__compact_unwind, because that section doesn't		// rebase opcodes for __LD,__compact_unwind, because that section doesn't
// actually end up in the final binary. Second, personality pointers always		// actually end up in the final binary. Second, personality pointers always
// reside in the GOT and must be treated specially.		// reside in the GOT and must be treated specially.
template <class Ptr>		template <class Ptr>
void UnwindInfoSectionImpl<Ptr>::prepareRelocations(ConcatInputSection *isec) {		void UnwindInfoSectionImpl<Ptr>::prepareRelocations(ConcatInputSection *isec) {
assert(isec->segname == segment_names::ld &&
isec->name == section_names::compactUnwind);
assert(!isec->shouldOmitFromOutput() &&		assert(!isec->shouldOmitFromOutput() &&
"__compact_unwind section should not be omitted");		"__compact_unwind section should not be omitted");

// FIXME: Make this skip relocations for CompactUnwindEntries that		// FIXME: Make this skip relocations for CompactUnwindEntries that
// point to dead-stripped functions. That might save some amount of		// point to dead-stripped functions. That might save some amount of
// work. But since there are usually just few personality functions		// work. But since there are usually just few personality functions
// that are referenced from many places, at least some of them likely		// that are referenced from many places, at least some of them likely
// live, it wouldn't reduce number of got entries.		// live, it wouldn't reduce number of got entries.
for (size_t i = 0; i < isec->relocs.size(); ++i) {		for (size_t i = 0; i < isec->relocs.size(); ++i) {
Reloc &r = isec->relocs[i];		Reloc &r = isec->relocs[i];
assert(target->hasAttr(r.type, RelocAttrBits::UNSIGNED));		assert(target->hasAttr(r.type, RelocAttrBits::UNSIGNED));
if (r.offset % sizeof(CompactUnwindEntry<Ptr>) !=		if (r.offset % sizeof(CompactUnwindEntry<Ptr>) !=
offsetof(CompactUnwindEntry<Ptr>, personality))		offsetof(CompactUnwindEntry<Ptr>, personality))
continue;		continue;

Reloc &rFunc = isec->relocs[++i];
assert(r.offset ==
rFunc.offset + offsetof(CompactUnwindEntry<Ptr>, personality));
auto *referentIsec =
cast<ConcatInputSection>(rFunc.referent.get<InputSection *>());
referentIsec->hasPersonality = true;

if (auto s = r.referent.dyn_cast<Symbol >()) {		if (auto s = r.referent.dyn_cast<Symbol >()) {
if (auto *undefined = dyn_cast<Undefined>(s)) {		if (auto *undefined = dyn_cast<Undefined>(s)) {
treatUndefinedSymbol(*undefined);		treatUndefinedSymbol(*undefined);
// treatUndefinedSymbol() can replace s with a DylibSymbol; re-check.		// treatUndefinedSymbol() can replace s with a DylibSymbol; re-check.
if (isa<Undefined>(s))		if (isa<Undefined>(s))
continue;		continue;
}		}
if (auto *defined = dyn_cast<Defined>(s)) {		if (auto *defined = dyn_cast<Defined>(s)) {
▲ Show 20 Lines • Show All 458 Lines • Show Last 20 Lines

lld/MachO/Writer.cpp

//===- Writer.cpp ---------------------------------------------------------===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

#include "Writer.h"

#include "ConcatOutputSection.h"

#include "Config.h"

#include "ICF.h"

#include "InputFiles.h"

#include "InputSection.h"

#include "MapFile.h"

#include "OutputSection.h"

#include "OutputSegment.h"

#include "SymbolTable.h"

#include "Symbols.h"

#include "SyntheticSections.h"

Show All 26 Lines

class Writer {

public:

Writer() : buffer(errorHandler().outputBuffer) {}

void scanRelocations();

void scanSymbols();

template <class LP> void createOutputSections();

template <class LP> void createLoadCommands();

void foldIdenticalLiterals();

void foldIdenticalSections();

void finalizeAddresses();

void finalizeLinkEditSegment();

void assignAddresses(OutputSegment *);

void openFile();

void writeSections();

void writeUuid();

void writeCodeSignature();

▲ Show 20 Lines • Show All 522 Lines • ▼ Show 20 Lines

if (relocAttrs.hasAttr(RelocAttrBits::BRANCH)) {

// need of rebase opcodes.

if (!(isThreadLocalVariables(isec->flags) && isa<Defined>(sym)))

addNonLazyBindingEntries(sym, isec, r.offset, r.addend);

}

void Writer::scanRelocations() {

TimeTraceScope timeScope("Scan relocations");

for (InputSection *isec : inputSections) {

for (ConcatInputSection *isec : inputSections) {

if (!isa<ConcatInputSection>(isec))

if (isec->shouldOmitFromOutput())

continue;

auto concatIsec = cast<ConcatInputSection>(isec);

if (concatIsec->shouldOmitFromOutput())

continue;

if (concatIsec->segname == segment_names::ld) {

in.unwindInfo->prepareRelocations(concatIsec);

continue;

}

for (auto it = isec->relocs.begin(); it != isec->relocs.end(); ++it) {

Reloc &r = *it;

if (target->hasAttr(r.type, RelocAttrBits::SUBTRAHEND)) {

// Skip over the following UNSIGNED relocation -- it's just there as the

// minuend, and doesn't have the usual UNSIGNED semantics. We don't want

// to emit rebase opcodes for it.

it++;

continue;

}

if (auto *sym = r.referent.dyn_cast<Symbol *>()) {

if (auto *undefined = dyn_cast<Undefined>(sym))

treatUndefinedSymbol(*undefined);

// treatUndefinedSymbol() can replace sym with a DylibSymbol; re-check.

if (!isa<Undefined>(sym) && validateSymbolRelocation(sym, isec, r))

prepareSymbolRelocation(sym, isec, r);

} else {

assert(r.referent.is<InputSection *>());

// Canonicalize the referent so that later accesses in Writer won't

// have to worry about it. Perhaps we should do this for Defined::isec

// too...

auto *referentIsec = r.referent.get<InputSection *>();

oontvooUnsubmitted

Done

// too...

- auto referentIsec = r.referent.get<InputSection *>();

+ auto *referentIsec = r.referent.get<InputSection *>();

r.referent = referentIsec->canonical();

oontvoo:

r.referent = referentIsec->canonical();

if (!r.pcrel)

in.rebase->addEntry(isec, r.offset);

}

in.unwindInfo->prepareRelocations();

}

void Writer::scanSymbols() {

TimeTraceScope timeScope("Scan symbols");

for (const Symbol *sym : symtab->getSymbols()) {

if (const auto *defined = dyn_cast<Defined>(sym)) {

if (defined->overridesWeakDef && defined->isLive())

in.weakBinding->addNonWeakDefinition(defined);

▲ Show 20 Lines • Show All 249 Lines • ▼ Show 20 Lines

template <class LP> void Writer::createOutputSections() {

case MH_DYLIB:

case MH_BUNDLE:

break;

default:

llvm_unreachable("unhandled output file type");

}

// Then add input sections to output sections.

for (const auto &p : enumerate(inputSections)) {

for (ConcatInputSection *isec : inputSections) {

InputSection *isec = p.value();

if (isec->shouldOmitFromOutput())

OutputSection *osec;

if (auto *concatIsec = dyn_cast<ConcatInputSection>(isec)) {

if (concatIsec->shouldOmitFromOutput())

continue;

NamePair names = maybeRenameSection({isec->segname, isec->name});

ConcatOutputSection *&concatOsec = concatOutputSections[names];

ConcatOutputSection *&osec = concatOutputSections[names];

if (concatOsec == nullptr)

if (!osec)

oontvooUnsubmitted

Done

ConcatOutputSection *&osec = concatOutputSections[names];

- if (osec == nullptr)

+ if (!osec)

osec = make<ConcatOutputSection>(names.second);

nit: just for consistency

oontvoo: nit: just for consistency

concatOsec = make<ConcatOutputSection>(names.second);

osec = make<ConcatOutputSection>(names.second);

concatOsec->addInput(concatIsec);

osec->addInput(isec);

osec = concatOsec;

osec->inputOrder =

} else if (auto *cStringIsec = dyn_cast<CStringInputSection>(isec)) {

std::min(osec->inputOrder, static_cast<int>(isec->outSecOff));

in.cStringSection->addInput(cStringIsec);

osec = in.cStringSection;

} else if (auto *litIsec = dyn_cast<WordLiteralInputSection>(isec)) {

in.wordLiteralSection->addInput(litIsec);

osec = in.wordLiteralSection;

} else {

llvm_unreachable("unhandled InputSection type");

}

osec->inputOrder = std::min(osec->inputOrder, static_cast<int>(p.index()));

}

// Once all the inputs are added, we can finalize the output section

// properties and create the corresponding output segments.

for (const auto &it : concatOutputSections) {

StringRef segname = it.first.first;

ConcatOutputSection *osec = it.second;

if (segname == segment_names::ld) {

assert(segname != segment_names::ld);

assert(osec->name == section_names::compactUnwind);

in.unwindInfo->setCompactUnwindSection(osec);

} else {

getOrCreateOutputSegment(segname)->addOutputSection(osec);

}

for (SyntheticSection *ssec : syntheticSections) {

auto it = concatOutputSections.find({ssec->segname, ssec->name});

if (ssec->isNeeded()) {

if (it == concatOutputSections.end()) {

getOrCreateOutputSegment(ssec->segname)->addOutputSection(ssec);

} else {

fatal("section from " + toString(it->second->firstSection()->file) +

" conflicts with synthetic section " + ssec->segname + "," +

ssec->name);

}

// dyld requires __LINKEDIT segment to always exist (even if empty).

linkEditSegment = getOrCreateOutputSegment(segment_names::linkEdit);

}

void Writer::foldIdenticalLiterals() {

if (in.cStringSection)

in.cStringSection->finalizeContents();

// TODO: WordLiteralSection & CFStringSection should be finalized here too

}

void Writer::foldIdenticalSections() {

if (config->icfLevel == ICFLevel::none)

return;

ConcatOutputSection *textOutputSection = concatOutputSections.lookup(

maybeRenameSection({segment_names::text, section_names::text}));

if (textOutputSection == nullptr)

return;

TimeTraceScope timeScope("Fold Identical Code Sections");

// The ICF equivalence-class segregation algorithm relies on pre-computed

// hashes of InputSection::data for the ConcatOutputSection::inputs and all

// sections referenced by their relocs. We could recursively traverse the

// relocs to find every referenced InputSection, but that precludes easy

// parallelization. Therefore, we hash every InputSection here where we have

// them all accessible as a simple vector.

std::vector<ConcatInputSection *> hashable;

// If an InputSection is ineligible for ICF, we give it a unique ID to force

// it into an unfoldable singleton equivalence class. Begin the unique-ID

// space at inputSections.size(), so that it will never intersect with

// equivalence-class IDs which begin at 0. Since hashes & unique IDs never

// coexist with equivalence-class IDs, this is not necessary, but might help

// someone keep the numbers straight in case we ever need to debug the

// ICF::segregate()

uint64_t icfUniqueID = inputSections.size();

for (InputSection *isec : inputSections) {

if (auto *concatIsec = dyn_cast<ConcatInputSection>(isec)) {

if (concatIsec->isHashableForICF(isec->parent == textOutputSection))

hashable.push_back(concatIsec);

else

concatIsec->icfEqClass[0] = ++icfUniqueID;

}

// FIXME: hash literal sections here too?

parallelForEach(hashable,

[](ConcatInputSection *isec) { isec->hashForICF(); });

// Now that every input section is either hashed or marked as unique,

// run the segregation algorithm to detect foldable subsections

ICF(textOutputSection->inputs).run();

size_t oldSize = textOutputSection->inputs.size();

textOutputSection->eraseOmittedInputSections();

size_t newSize = textOutputSection->inputs.size();

log("ICF kept " + Twine(newSize) + " removed " + Twine(oldSize - newSize) +

" of " + Twine(oldSize));

}

void Writer::finalizeAddresses() {

TimeTraceScope timeScope("Finalize addresses");

uint64_t pageSize = target->getPageSize();

// Ensure that segments (and the sections they contain) are allocated

// addresses in ascending order, which dyld requires.

// Note that at this point, __LINKEDIT sections are empty, but we need to

// determine addresses of other segments/sections before generating its

▲ Show 20 Lines • Show All 115 Lines • ▼ Show 20 Lines

template <class LP> void Writer::run() {

if (config->entry && !isa<Undefined>(config->entry))

prepareBranchTarget(config->entry);

scanRelocations();

if (in.stubHelper->isNeeded())

in.stubHelper->setup();

scanSymbols();

createOutputSections<LP>();

// ICF assumes that all literals have been folded already, so we must run

// foldIdenticalLiterals before foldIdenticalSections.

foldIdenticalLiterals();

foldIdenticalSections();

// After this point, we create no new segments; HOWEVER, we might

// yet create branch-range extension thunks for architectures whose

// hardware call instructions have limited range, e.g., ARM(64).

// The thunks are created as InputSections interspersed among

// the ordinary __TEXT,_text InputSections.

sortSegmentsAndSections();

createLoadCommands<LP>();

finalizeAddresses();

▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[lld-macho] Move ICF earlier to avoid emitting redundant bindsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 356067

lld/MachO/ConcatOutputSection.h

lld/MachO/ConcatOutputSection.cpp

lld/MachO/Driver.cpp

lld/MachO/ICF.h

lld/MachO/ICF.cpp

lld/MachO/InputSection.h

lld/MachO/InputSection.cpp

lld/MachO/MarkLive.cpp

lld/MachO/SyntheticSections.h

lld/MachO/SyntheticSections.cpp

lld/MachO/UnwindInfoSection.h

lld/MachO/UnwindInfoSection.cpp

lld/MachO/Writer.cpp

[lld-macho] Move ICF earlier to avoid emitting redundant binds
ClosedPublic