This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/trunk/
-
trunk/
-
ELF/
-
Driver.cpp
-
ICF.cpp
-
LinkerScript.h
-
LinkerScript.cpp
-
Writer.h
-
Writer.cpp
-
test/ELF/linkerscript/
-
ELF/
-
linkerscript/
-
early-assign-symbol.s
-
icf-output-sections.s
-
subalign.s

Differential D66717

[ELF] Do not ICF two sections with different output sections (by SECTIONS commands)
ClosedPublic

Authored by MaskRay on Aug 25 2019, 9:13 AM.

Download Raw Diff

Details

Reviewers

grimar
pcc
peter.smith
ruiu
• espindola

Commits

rGd8bc6a48eaa9: [ELF] Do not ICF two sections with different output sections (by SECTIONS…
rLLD370635: [ELF] Do not ICF two sections with different output sections (by SECTIONS…
rL370635: [ELF] Do not ICF two sections with different output sections (by SECTIONS…

Summary

Fixes PR39418. Complements D47241 (the non-linker-script case).

processSectionCommands() assigns input sections to output sections.
ICF is called before it, so .text.foo and .text.bar may be folded even if
their output sections are made different by SECTIONS commands.

markLive<ELFT>()
doIcf<ELFT>()                      // During ICF, we don't know the output sections
writeResult()
  combineEhSections<ELFT>()
  script->processSectionCommands() // InputSection -> OutputSection assignment

This patch splits processSectionCommands() into processSectionCommands() and
processSymbolAssignments(), and moves processSectionCommands() before ICF:

markLive<ELFT>()
combineEhSections<ELFT>()
script->processSectionCommands()
doIcf<ELFT>()                      // should remove folded input sections
writeResult()
  script->processSymbolAssignments()

An alternative approach is to unfold a section sec in
processSectionCommands() when we find sec and sec->repl belong to
different output sections. I feel this patch is superior because this
can fold more sections and the decouple of
SectionCommand/SymbolAssignment gives flexibility:

An ExprValue can't be evaluated before its section is assigned to an output section -> we can delete getOutputSectionVA and simplify another place where we had to check if the output section is null. Moreover, a case in linkerscript/early-assign-symbol.s can be handled now.
processSectionCommands/processSymbolAssignments can be freely moved around.

Diff Detail

Repository: rL LLVM

Event Timeline

MaskRay created this revision.Aug 25 2019, 9:13 AM

Herald added a reviewer: • espindola. · View Herald TranscriptAug 25 2019, 9:13 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, kristof.beyls, arichardson and 2 others. · View Herald Transcript

Harbormaster completed remote builds in B37245: Diff 217048.Aug 25 2019, 9:13 AM

MaskRay mentioned this in D66719: [ELF] Do not ICF two sections with different output sections when there is a linker script.Aug 25 2019, 9:54 AM

Add subscribed people in an previous approach D54422

Delete a stale comment
Fix deletion of dead input section members
Improve linkerscript/icf-different-output-sections.s to catch the case

Harbormaster completed remote builds in B37252: Diff 217069.Aug 25 2019, 7:57 PM

Fix the call tree in the description

Simplify processSymbolAssignments

Harbormaster completed remote builds in B37255: Diff 217072.Aug 25 2019, 9:07 PM

MaskRay mentioned this in D66727: [ELF] EhFrameSection: postpone FDE liveness check to finalizeSections.Aug 25 2019, 10:43 PM

I realized EhInputSection and .ARM.exidx* filtering is tricky.
Created D66727 as a prerequisite. This shall make overall diff smaller

Harbormaster completed remote builds in B37258: Diff 217081.Aug 25 2019, 11:14 PM

Delete a stale .ARM.exidx* check.

I think the new processSymbolAssignments() gives us a simple workaround for https://bugs.llvm.org/show_bug.cgi?id=42506

% ld.lld -shared a.o -T a.x -Bsymbolic
ld.lld: error: can't create dynamic relocation R_AARCH64_ABS32 against symbol: __efistub_stext_offset in readonly segment; recompile object files with -fPIC or pass '-Wl,-z,notext' to allow text relocations in the output
>>> defined in a.x:1
>>> referenced by a.o:(.text+0x4)

If we call script->processSymbolAssignments() the second time, __efistub_stext_offset will be changed to an absolute value.

   // Change values of linker-script-defined symbols from placeholders (assigned
   // by declareSymbols) to actual definitions.
   script->processSymbolAssignments();
+  script->processSymbolAssignments();

% ld.lld -shared a.o -T a.x -Bsymbolic # works

If we say D66279 solves the the value convergence problem, then adding another processSymbolAssignments() can be said to solve the type (absolute or non-absolute) convergence problem.

Harbormaster completed remote builds in B37259: Diff 217083.Aug 26 2019, 12:30 AM

Given the direction of how linkerscript support is going forward in LLD last time, this approach looks generally good to me.

test/ELF/linkerscript/early-assign-symbol.s
11 ↗	(On Diff #217083)	Since with this patch we can handle the case above, this comment needs either to be moved/removed and/or updated.
test/ELF/linkerscript/icf-different-output-sections.s
45 ↗	(On Diff #217083)	Seems you only need to have a 2 .rodata sections for this test? I'd suggest testing/demonstrating the output from the 2 scripts here probably: .rodata.foo : { (.rodata.foo) } .rodata.bar : { (.rodata.bar) } .rodata: { (.rodata.) } I.e. idea is to show in the test that we can put sections in a single or a multiple sections and that in the case (2) input sections will be folded and in case (1) they will not.

I like this new approach because it seems more modular than before.

ELF/ICF.cpp
503 ↗	(On Diff #217083)	This comment is perhaps a bit too terse. Can you explain in the comment why we are doing this?
ELF/Writer.cpp
155 ↗	(On Diff #217083)	Maybe this and `createSynethticSections` should be moved to a new file if they are no longer a part of the writer?

MaskRay mentioned this in rL369890: [ELF] EhFrameSection: postpone FDE liveness check to finalizeSections.Aug 26 2019, 3:33 AM

MaskRay mentioned this in rG1681ceb2c422: [ELF] EhFrameSection: postpone FDE liveness check to finalizeSections.

MaskRay marked an inline comment as done.Aug 26 2019, 3:35 AM

MaskRay added inline comments.

ELF/Writer.cpp
155 ↗	(On Diff #217083)	Do you have a recommendation what the new file should be called? While this is under review, I am experimenting other reordering this change will enable (e.g. moving addOrphanSections from Writer.cpp to between processSectionCommands/ICF). The definition of "Writer" is indeed unclear to me now...

Delete a redundant dyn_cast<OutputSection>
Add more comments to ICF<ELFT>::run()

Harbormaster completed remote builds in B37276: Diff 217113.Aug 26 2019, 4:32 AM

I agree that this is going in the right direction. I'm particularly happy about splitting out the symbol assignments.

ELF/LinkerScript.cpp
53 ↗	(On Diff #217113)	If we end up calling this with sec->getOutputSection() == nullptr we'll get a segfault that could be hard to track down. It may be worth preserving getOutputSectionVA with an assert non nullptr. Alternatively if we can guarantee that getOutputSection() is non nullptr then we could put an assert into getOutputSection().
ELF/Writer.cpp
155 ↗	(On Diff #217083)	I've not got a good answer right now, will try and have a think. May be worth going through the overall control flow and seeing what comes out. My high-level view of a linker control flow is: Load all content from files into an address independent representation. Do address independent transformation such as GC and ICF. Create synthetic InputSections. Layout the output file (InputSections -> OutputSections), although not necessarily assign addresses yet. Populate synthetic content such as the .plt and .got. It helps to have removed as much as possible prior to this point. Assign addresses. Perfom address dependent transformations. Finalize addresses (We now have a logical final ELF file). Write ELF file. At each stage we are adding information, permitting more transformations, but also restricting our freedom as making changes invalidates the information. That is a control flow view of the linker, rather than a structural view of the components though. I think our Writer.cpp combines everything from create synthetic InputSections down. If I were hazarding a guess at a new name for the moved functions I'd recommend something like layout.cpp. In any case it may be worth taking a step back to see what the options are.

Restore static uint64_t getOutputSectionVA(SectionBase *sec) { with an assert

MaskRay added inline comments.Aug 27 2019, 7:40 AM

ELF/LinkerScript.cpp
53 ↗	(On Diff #217113)	We cannot guarantee getOutputSection() returns non nullptr. merge-gc-piece2.s and gc-sections-non-alloc-to-merge.s are tests where a MergeInputSection may have a null InputSection parent. I tried adding more asserts to SectionBase::getOutputSection but that seems hard. We probably should do some other layout changes before enforcing nonnullness in getOutputSection. Restored with an assert.
test/ELF/linkerscript/icf-different-output-sections.s
45 ↗	(On Diff #217083)	I think .rodata* are redundant. I will delete them. Will add another script for your point (1), and add another script for orphans.

Harbormaster completed remote builds in B37362: Diff 217403.Aug 27 2019, 7:40 AM

Shall we move forward with this change? 🤡

(BTW, Peter's comment at https://reviews.llvm.org/D66717#1646703 is great.)

I have no more comments/objections, I think it is good to go.

I'm happy to move forward with the change as well.

LGTM

I'm happy about this change too.

This revision is now accepted and ready to land.Sep 2 2019, 3:12 AM

Closed by commit rL370635: [ELF] Do not ICF two sections with different output sections (by SECTIONS… (authored by MaskRay). · Explain WhySep 2 2019, 3:35 AM

This revision was automatically updated to reflect the committed changes.

MaskRay mentioned this in rL371216: Reland D66717 [ELF] Do not ICF two sections with different output sections (by….Sep 6 2019, 8:57 AM

MaskRay mentioned this in rG8d30c1dcec2a: Reland D66717 [ELF] Do not ICF two sections with different output sections (by….Sep 6 2019, 9:00 AM

MaskRay mentioned this in D76410: [ELF] Don't combine SHF_LINK_ORDER sections linking different output sections.Mar 18 2020, 10:53 PM

MaskRay mentioned this in rG9eb1f2d0ace3: [ELF] Remove a special case from ExprValue::getSectionOffset. NFC.Sep 15 2023, 5:46 PM

Revision Contents

Path

Size

lld/

trunk/

ELF/

20 lines

17 lines

1 line

82 lines

2 lines

36 lines

test/

ELF/

linkerscript/

early-assign-symbol.s

11 lines

icf-output-sections.s

46 lines

subalign.s

12 lines

Diff 218326

lld/trunk/ELF/Driver.cpp

Show First 20 Lines • Show All 1,895 Lines • ▼ Show 20 Lines	template <class ELFT> void LinkerDriver::link(opt::InputArgList &args) {
replaceCommonSymbols();		replaceCommonSymbols();

// Do size optimizations: garbage collection, merging of SHF_MERGE sections		// Do size optimizations: garbage collection, merging of SHF_MERGE sections
// and identical code folding.		// and identical code folding.
splitSections<ELFT>();		splitSections<ELFT>();
markLive<ELFT>();		markLive<ELFT>();
demoteSharedSymbols();		demoteSharedSymbols();
mergeSections();		mergeSections();

		// Make copies of any input sections that need to be copied into each
		// partition.
		copySectionsIntoPartitions();

		// Create synthesized sections such as .got and .plt. This is called before
		// processSectionCommands() so that they can be placed by SECTIONS commands.
		createSyntheticSections<ELFT>();

		// Some input sections that are used for exception handling need to be moved
		// into synthetic sections. Do that now so that they aren't assigned to
		// output sections in the usual way.
		if (!config->relocatable)
		combineEhSections();

		// Create output sections described by SECTIONS commands.
		script->processSectionCommands();

		// Two input sections with different output sections should not be folded.
		// ICF runs after processSectionCommands() so that we know the output sections.
if (config->icf != ICFLevel::None) {		if (config->icf != ICFLevel::None) {
findKeepUniqueSections<ELFT>(args);		findKeepUniqueSections<ELFT>(args);
doIcf<ELFT>();		doIcf<ELFT>();
}		}

// Read the callgraph now that we know what was gced or icfed		// Read the callgraph now that we know what was gced or icfed
if (config->callGraphProfileSort) {		if (config->callGraphProfileSort) {
if (auto *arg = args.getLastArg(OPT_call_graph_ordering_file))		if (auto *arg = args.getLastArg(OPT_call_graph_ordering_file))
if (Optional<MemoryBufferRef> buffer = readFile(arg->getValue()))		if (Optional<MemoryBufferRef> buffer = readFile(arg->getValue()))
readCallGraph(*buffer);		readCallGraph(*buffer);
readCallGraphsFromObjectFiles<ELFT>();		readCallGraphsFromObjectFiles<ELFT>();
}		}

// Write the result to the file.		// Write the result to the file.
writeResult<ELFT>();		writeResult<ELFT>();
}		}

lld/trunk/ELF/ICF.cpp

Show First 20 Lines • Show All 68 Lines • ▼ Show 20 Lines
// [1] Safe ICF: Pointer Safe and Unwinding aware Identical Code Folding		// [1] Safe ICF: Pointer Safe and Unwinding aware Identical Code Folding
// in the Gold Linker		// in the Gold Linker
// http://static.googleusercontent.com/media/research.google.com/en//pubs/archive/36912.pdf		// http://static.googleusercontent.com/media/research.google.com/en//pubs/archive/36912.pdf
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "ICF.h"		#include "ICF.h"
#include "Config.h"		#include "Config.h"
		#include "LinkerScript.h"
		#include "OutputSections.h"
#include "SymbolTable.h"		#include "SymbolTable.h"
#include "Symbols.h"		#include "Symbols.h"
#include "SyntheticSections.h"		#include "SyntheticSections.h"
#include "Writer.h"		#include "Writer.h"
#include "lld/Common/Threads.h"		#include "lld/Common/Threads.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"
#include "llvm/BinaryFormat/ELF.h"		#include "llvm/BinaryFormat/ELF.h"
#include "llvm/Object/ELF.h"		#include "llvm/Object/ELF.h"
▲ Show 20 Lines • Show All 214 Lines • ▼ Show 20 Lines
// except relocation targets.		// except relocation targets.
template <class ELFT>		template <class ELFT>
bool ICF<ELFT>::equalsConstant(const InputSection a, const InputSection b) {		bool ICF<ELFT>::equalsConstant(const InputSection a, const InputSection b) {
if (a->numRelocations != b->numRelocations \|\| a->flags != b->flags \|\|		if (a->numRelocations != b->numRelocations \|\| a->flags != b->flags \|\|
a->getSize() != b->getSize() \|\| a->data() != b->data())		a->getSize() != b->getSize() \|\| a->data() != b->data())
return false;		return false;

// If two sections have different output sections, we cannot merge them.		// If two sections have different output sections, we cannot merge them.
// FIXME: This doesn't do the right thing in the case where there is a linker		if (getOutputSectionName(a) != getOutputSectionName(b) \|\|
// script. We probably need to move output section assignment before ICF to		a->getParent() != b->getParent())
// get the correct behaviour here.
if (getOutputSectionName(a) != getOutputSectionName(b))
return false;		return false;

if (a->areRelocsRela)		if (a->areRelocsRela)
return constantEq(a, a->template relas<ELFT>(), b,		return constantEq(a, a->template relas<ELFT>(), b,
b->template relas<ELFT>());		b->template relas<ELFT>());
return constantEq(a, a->template rels<ELFT>(), b, b->template rels<ELFT>());		return constantEq(a, a->template rels<ELFT>(), b, b->template rels<ELFT>());
}		}

▲ Show 20 Lines • Show All 175 Lines • ▼ Show 20 Lines	for (size_t i = begin + 1; i < end; ++i) {

// At this point we know sections merged are fully identical and hence		// At this point we know sections merged are fully identical and hence
// we want to remove duplicate implicit dependencies such as link order		// we want to remove duplicate implicit dependencies such as link order
// and relocation sections.		// and relocation sections.
for (InputSection *isec : sections[i]->dependentSections)		for (InputSection *isec : sections[i]->dependentSections)
isec->markDead();		isec->markDead();
}		}
});		});

		// InputSectionDescription::sections is populated by processSectionCommands().
		// ICF may fold some input sections assigned to output sections. Remove them.
		for (BaseCommand *base : script->sectionCommands)
		if (auto *sec = dyn_cast<OutputSection>(base))
		for (BaseCommand *sub_base : sec->sectionCommands)
		if (auto *isd = dyn_cast<InputSectionDescription>(sub_base))
		llvm::erase_if(isd->sections,
		[](InputSection *isec) { return !isec->isLive(); });
}		}

// ICF entry point function.		// ICF entry point function.
template <class ELFT> void elf::doIcf() { ICF<ELFT>().run(); }		template <class ELFT> void elf::doIcf() { ICF<ELFT>().run(); }

template void elf::doIcf<ELF32LE>();		template void elf::doIcf<ELF32LE>();
template void elf::doIcf<ELF32BE>();		template void elf::doIcf<ELF32BE>();
template void elf::doIcf<ELF64LE>();		template void elf::doIcf<ELF64LE>();
template void elf::doIcf<ELF64BE>();		template void elf::doIcf<ELF64BE>();

lld/trunk/ELF/LinkerScript.h

Show First 20 Lines • Show All 268 Lines • ▼ Show 20 Lines	public:

std::vector<PhdrEntry *> createPhdrs();		std::vector<PhdrEntry *> createPhdrs();
bool needsInterpSection();		bool needsInterpSection();

bool shouldKeep(InputSectionBase *s);		bool shouldKeep(InputSectionBase *s);
const Defined *assignAddresses();		const Defined *assignAddresses();
void allocateHeaders(std::vector<PhdrEntry *> &phdrs);		void allocateHeaders(std::vector<PhdrEntry *> &phdrs);
void processSectionCommands();		void processSectionCommands();
		void processSymbolAssignments();
void declareSymbols();		void declareSymbols();

// Used to handle INSERT AFTER statements.		// Used to handle INSERT AFTER statements.
void processInsertCommands();		void processInsertCommands();

// SECTIONS command list.		// SECTIONS command list.
std::vector<BaseCommand *> sectionCommands;		std::vector<BaseCommand *> sectionCommands;

Show All 28 Lines

lld/trunk/ELF/LinkerScript.cpp

Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
using namespace llvm::ELF;		using namespace llvm::ELF;
using namespace llvm::object;		using namespace llvm::object;
using namespace llvm::support::endian;		using namespace llvm::support::endian;
using namespace lld;		using namespace lld;
using namespace lld::elf;		using namespace lld::elf;

LinkerScript *elf::script;		LinkerScript *elf::script;

static uint64_t getOutputSectionVA(SectionBase *inputSec, StringRef loc) {		static uint64_t getOutputSectionVA(SectionBase *sec) {
if (OutputSection *os = inputSec->getOutputSection())		OutputSection *os = sec->getOutputSection();
return os->addr;		assert(os && "input section has no output section assigned");
error(loc + ": unable to evaluate expression: input section " +		return os ? os->addr : 0;
inputSec->name + " has no output section assigned");
return 0;
}		}

uint64_t ExprValue::getValue() const {		uint64_t ExprValue::getValue() const {
if (sec)		if (sec)
return alignTo(sec->getOffset(val) + getOutputSectionVA(sec, loc),		return alignTo(sec->getOffset(val) + getOutputSectionVA(sec),
alignment);		alignment);
return alignTo(val, alignment);		return alignTo(val, alignment);
}		}

uint64_t ExprValue::getSecAddr() const {		uint64_t ExprValue::getSecAddr() const {
if (sec)		if (sec)
return sec->getOffset(0) + getOutputSectionVA(sec, loc);		return sec->getOffset(0) + getOutputSectionVA(sec);
return 0;		return 0;
}		}

uint64_t ExprValue::getSectionOffset() const {		uint64_t ExprValue::getSectionOffset() const {
// If the alignment is trivial, we don't have to compute the full		// If the alignment is trivial, we don't have to compute the full
// value to know the offset. This allows this function to succeed in		// value to know the offset. This allows this function to succeed in
// cases where the output section is not yet known.		// cases where the output section is not yet known.
if (alignment == 1 && (!sec \|\| !sec->getOutputSection()))		if (alignment == 1 && !sec)
return val;		return val;
return getValue() - getSecAddr();		return getValue() - getSecAddr();
}		}

OutputSection *LinkerScript::createOutputSection(StringRef name,		OutputSection *LinkerScript::createOutputSection(StringRef name,
StringRef location) {		StringRef location) {
OutputSection *&secRef = nameToOutputSection[name];		OutputSection *&secRef = nameToOutputSection[name];
OutputSection *sec;		OutputSection *sec;
▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	static bool shouldDefineSym(SymbolAssignment *cmd) {
// If a symbol was in PROVIDE(), we need to define it only		// If a symbol was in PROVIDE(), we need to define it only
// when it is a referenced undefined symbol.		// when it is a referenced undefined symbol.
Symbol *b = symtab->find(cmd->name);		Symbol *b = symtab->find(cmd->name);
if (b && !b->isDefined())		if (b && !b->isDefined())
return true;		return true;
return false;		return false;
}		}

// This function is called from processSectionCommands,		// Called by processSymbolAssignments() to assign definitions to
// while we are fixing the output section layout.		// linker-script-defined symbols.
void LinkerScript::addSymbol(SymbolAssignment *cmd) {		void LinkerScript::addSymbol(SymbolAssignment *cmd) {
if (!shouldDefineSym(cmd))		if (!shouldDefineSym(cmd))
return;		return;

// Define a symbol.		// Define a symbol.
ExprValue value = cmd->expression();		ExprValue value = cmd->expression();
SectionBase *sec = value.isAbsolute() ? nullptr : value.sec;		SectionBase *sec = value.isAbsolute() ? nullptr : value.sec;
uint8_t visibility = cmd->hidden ? STV_HIDDEN : STV_DEFAULT;		uint8_t visibility = cmd->hidden ? STV_HIDDEN : STV_DEFAULT;
▲ Show 20 Lines • Show All 303 Lines • ▼ Show 20 Lines	for (BaseCommand *base : outCmd.sectionCommands) {
if (auto *cmd = dyn_cast<InputSectionDescription>(base)) {		if (auto *cmd = dyn_cast<InputSectionDescription>(base)) {
cmd->sections = computeInputSections(cmd);		cmd->sections = computeInputSections(cmd);
ret.insert(ret.end(), cmd->sections.begin(), cmd->sections.end());		ret.insert(ret.end(), cmd->sections.begin(), cmd->sections.end());
}		}
}		}
return ret;		return ret;
}		}

		// Create output sections described by SECTIONS commands.
void LinkerScript::processSectionCommands() {		void LinkerScript::processSectionCommands() {
// A symbol can be assigned before any section is mentioned in the linker
// script. In an DSO, the symbol values are addresses, so the only important
// section values are:
// * SHN_UNDEF
// * SHN_ABS
// * Any value meaning a regular section.
// To handle that, create a dummy aether section that fills the void before
// the linker scripts switches to another section. It has an index of one
// which will map to whatever the first actual section is.
aether = make<OutputSection>("", 0, SHF_ALLOC);
aether->sectionIndex = 1;

// Ctx captures the local AddressState and makes it accessible deliberately.
// This is needed as there are some cases where we cannot just
// thread the current state through to a lambda function created by the
// script parser.
auto deleter = std::make_unique<AddressState>();
ctx = deleter.get();
ctx->outSec = aether;

size_t i = 0;		size_t i = 0;
// Add input sections to output sections.
for (BaseCommand *base : sectionCommands) {		for (BaseCommand *base : sectionCommands) {
// Handle symbol assignments outside of any output section.
if (auto *cmd = dyn_cast<SymbolAssignment>(base)) {
addSymbol(cmd);
continue;
}

if (auto *sec = dyn_cast<OutputSection>(base)) {		if (auto *sec = dyn_cast<OutputSection>(base)) {
std::vector<InputSection > v = createInputSectionList(sec);		std::vector<InputSection > v = createInputSectionList(sec);

// The output section name `/DISCARD/' is special.		// The output section name `/DISCARD/' is special.
// Any input section assigned to it is discarded.		// Any input section assigned to it is discarded.
if (sec->name == "/DISCARD/") {		if (sec->name == "/DISCARD/") {
discard(v);		discard(v);
sec->sectionCommands.clear();		sec->sectionCommands.clear();
Show All 9 Lines	if (auto *sec = dyn_cast<OutputSection>(base)) {
// way to "make it as if it wasn't present" is to make it empty.		// way to "make it as if it wasn't present" is to make it empty.
if (!matchConstraints(v, sec->constraint)) {		if (!matchConstraints(v, sec->constraint)) {
for (InputSectionBase *s : v)		for (InputSectionBase *s : v)
s->assigned = false;		s->assigned = false;
sec->sectionCommands.clear();		sec->sectionCommands.clear();
continue;		continue;
}		}

// A directive may contain symbol definitions like this:
// ".foo : { ...; bar = .; }". Handle them.
for (BaseCommand *base : sec->sectionCommands)
if (auto *outCmd = dyn_cast<SymbolAssignment>(base))
addSymbol(outCmd);

// Handle subalign (e.g. ".foo : SUBALIGN(32) { ... }"). If subalign		// Handle subalign (e.g. ".foo : SUBALIGN(32) { ... }"). If subalign
// is given, input sections are aligned to that value, whether the		// is given, input sections are aligned to that value, whether the
// given value is larger or smaller than the original section alignment.		// given value is larger or smaller than the original section alignment.
if (sec->subalignExpr) {		if (sec->subalignExpr) {
uint32_t subalign = sec->subalignExpr().getValue();		uint32_t subalign = sec->subalignExpr().getValue();
for (InputSectionBase *s : v)		for (InputSectionBase *s : v)
s->alignment = subalign;		s->alignment = subalign;
}		}

// Add input sections to an output section.		// Some input sections may be removed from the list after ICF.
for (InputSection *s : v)		for (InputSection *s : v)
sec->addSection(s);		sec->addSection(s);

sec->sectionIndex = i++;		sec->sectionIndex = i++;
if (sec->noload)		if (sec->noload)
sec->type = SHT_NOBITS;		sec->type = SHT_NOBITS;
if (sec->nonAlloc)		if (sec->nonAlloc)
sec->flags &= ~(uint64_t)SHF_ALLOC;		sec->flags &= ~(uint64_t)SHF_ALLOC;
}		}
}		}
		}

		void LinkerScript::processSymbolAssignments() {
		// Dot outside an output section still represents a relative address, whose
		// sh_shndx should not be SHN_UNDEF or SHN_ABS. Create a dummy aether section
		// that fills the void outside a section. It has an index of one, which is
		// indistinguishable from any other regular section index.
		aether = make<OutputSection>("", 0, SHF_ALLOC);
		aether->sectionIndex = 1;

		// ctx captures the local AddressState and makes it accessible deliberately.
		// This is needed as there are some cases where we cannot just thread the
		// current state through to a lambda function created by the script parser.
		AddressState state;
		ctx = &state;
		ctx->outSec = aether;

		for (BaseCommand *base : sectionCommands) {
		if (auto *cmd = dyn_cast<SymbolAssignment>(base))
		addSymbol(cmd);
		else
		for (BaseCommand *sub_base : cast<OutputSection>(base)->sectionCommands)
		if (auto *cmd = dyn_cast<SymbolAssignment>(sub_base))
		addSymbol(cmd);
		}

ctx = nullptr;		ctx = nullptr;
}		}

static OutputSection findByName(ArrayRef<BaseCommand > vec,		static OutputSection findByName(ArrayRef<BaseCommand > vec,
StringRef name) {		StringRef name) {
for (BaseCommand *base : vec)		for (BaseCommand *base : vec)
if (auto *sec = dyn_cast<OutputSection>(base))		if (auto *sec = dyn_cast<OutputSection>(base))
if (sec->name == name)		if (sec->name == name)
▲ Show 20 Lines • Show All 637 Lines • Show Last 20 Lines

lld/trunk/ELF/Writer.h

	Show All 13 Lines
	#include <cstdint>			#include <cstdint>
	#include <memory>			#include <memory>

	namespace lld {			namespace lld {
	namespace elf {			namespace elf {
	class InputFile;			class InputFile;
	class OutputSection;			class OutputSection;
	class InputSectionBase;			class InputSectionBase;
				void copySectionsIntoPartitions();
				template <class ELFT> void createSyntheticSections();
	void combineEhSections();			void combineEhSections();
	template <class ELFT> void writeResult();			template <class ELFT> void writeResult();

	// This describes a program header entry.			// This describes a program header entry.
	// Each contains type, access flags and range of output sections that will be			// Each contains type, access flags and range of output sections that will be
	// placed in it.			// placed in it.
	struct PhdrEntry {			struct PhdrEntry {
	PhdrEntry(unsigned type, unsigned flags) : p_type(type), p_flags(flags) {}			PhdrEntry(unsigned type, unsigned flags) : p_type(type), p_flags(flags) {}
	Show All 33 Lines

lld/trunk/ELF/Writer.cpp

Show First 20 Lines • Show All 146 Lines • ▼ Show 20 Lines	if (p->p_type != PT_LOAD)
return false;		return false;
if (!p->firstSec)		if (!p->firstSec)
return true;		return true;
uint64_t size = p->lastSec->addr + p->lastSec->size - p->firstSec->addr;		uint64_t size = p->lastSec->addr + p->lastSec->size - p->firstSec->addr;
return size == 0;		return size == 0;
});		});
}		}

static void copySectionsIntoPartitions() {		void elf::copySectionsIntoPartitions() {
std::vector<InputSectionBase *> newSections;		std::vector<InputSectionBase *> newSections;
for (unsigned part = 2; part != partitions.size() + 1; ++part) {		for (unsigned part = 2; part != partitions.size() + 1; ++part) {
for (InputSectionBase *s : inputSections) {		for (InputSectionBase *s : inputSections) {
if (!(s->flags & SHF_ALLOC) \|\| !s->isLive())		if (!(s->flags & SHF_ALLOC) \|\| !s->isLive())
continue;		continue;
InputSectionBase *copy;		InputSectionBase *copy;
if (s->type == SHT_NOTE)		if (s->type == SHT_NOTE)
copy = make<InputSection>(cast<InputSection>(*s));		copy = make<InputSection>(cast<InputSection>(*s));
▲ Show 20 Lines • Show All 139 Lines • ▼ Show 20 Lines
static OutputSection *findSection(StringRef name, unsigned partition = 1) {		static OutputSection *findSection(StringRef name, unsigned partition = 1) {
for (BaseCommand *base : script->sectionCommands)		for (BaseCommand *base : script->sectionCommands)
if (auto *sec = dyn_cast<OutputSection>(base))		if (auto *sec = dyn_cast<OutputSection>(base))
if (sec->name == name && sec->partition == partition)		if (sec->name == name && sec->partition == partition)
return sec;		return sec;
return nullptr;		return nullptr;
}		}

// Initialize Out members.		template <class ELFT> void elf::createSyntheticSections() {
template <class ELFT> static void createSyntheticSections() {
// Initialize all pointers with NULL. This is needed because		// Initialize all pointers with NULL. This is needed because
// you can call lld::elf::main more than once as a library.		// you can call lld::elf::main more than once as a library.
memset(&Out::first, 0, sizeof(Out));		memset(&Out::first, 0, sizeof(Out));

auto add = [](InputSectionBase *sec) { inputSections.push_back(sec); };		auto add = [](InputSectionBase *sec) { inputSections.push_back(sec); };

in.shStrTab = make<StringTableSection>(".shstrtab", false);		in.shStrTab = make<StringTableSection>(".shstrtab", false);

▲ Show 20 Lines • Show All 209 Lines • ▼ Show 20 Lines	if (in.symTabShndx)
add(in.symTabShndx);		add(in.symTabShndx);
add(in.shStrTab);		add(in.shStrTab);
if (in.strTab)		if (in.strTab)
add(in.strTab);		add(in.strTab);
}		}

// The main function of the writer.		// The main function of the writer.
template <class ELFT> void Writer<ELFT>::run() {		template <class ELFT> void Writer<ELFT>::run() {
// Make copies of any input sections that need to be copied into each
// partition.
copySectionsIntoPartitions();

// Create linker-synthesized sections such as .got or .plt.
// Such sections are of type input section.
createSyntheticSections<ELFT>();

// Some input sections that are used for exception handling need to be moved
// into synthetic sections. Do that now so that they aren't assigned to
// output sections in the usual way.
if (!config->relocatable)
combineEhSections();

// We want to process linker script commands. When SECTIONS command
// is given we let it create sections.
script->processSectionCommands();

// Linker scripts controls how input sections are assigned to output sections.		// Linker scripts controls how input sections are assigned to output sections.
// Input sections that were not handled by scripts are called "orphans", and		// Input sections that were not handled by scripts are called "orphans", and
// they are assigned to output sections by the default rule. Process that.		// they are assigned to output sections by the default rule. Process that.
script->addOrphanSections();		script->addOrphanSections();

if (config->discard != DiscardPolicy::All)		if (config->discard != DiscardPolicy::All)
copyLocalSymbols();		copyLocalSymbols();

▲ Show 20 Lines • Show All 1,168 Lines • ▼ Show 20 Lines	template <class ELFT> void Writer<ELFT>::finalizeSections() {
// pieces. The relocation scan uses those pieces, so this has to be		// pieces. The relocation scan uses those pieces, so this has to be
// earlier.		// earlier.
for (Partition &part : partitions)		for (Partition &part : partitions)
finalizeSynthetic(part.ehFrame);		finalizeSynthetic(part.ehFrame);

symtab->forEachSymbol(		symtab->forEachSymbol(
[](Symbol s) { s->isPreemptible = computeIsPreemptible(s); });		[](Symbol s) { s->isPreemptible = computeIsPreemptible(s); });

		// Change values of linker-script-defined symbols from placeholders (assigned
		// by declareSymbols) to actual definitions.
		script->processSymbolAssignments();

// Scan relocations. This must be done after every symbol is declared so that		// Scan relocations. This must be done after every symbol is declared so that
// we can correctly decide if a dynamic relocation is needed.		// we can correctly decide if a dynamic relocation is needed. This is called
		// after processSymbolAssignments() because it needs to know whether a
		// linker-script-defined symbol is absolute.
if (!config->relocatable) {		if (!config->relocatable) {
forEachRelSec(scanRelocations<ELFT>);		forEachRelSec(scanRelocations<ELFT>);
reportUndefinedSymbols<ELFT>();		reportUndefinedSymbols<ELFT>();
}		}

if (in.plt && in.plt->isNeeded())		if (in.plt && in.plt->isNeeded())
in.plt->addSymbols();		in.plt->addSymbols();
if (in.iplt && in.iplt->isNeeded())		if (in.iplt && in.iplt->isNeeded())
▲ Show 20 Lines • Show All 980 Lines • ▼ Show 20 Lines	case BuildIdKind::Uuid:
break;		break;
default:		default:
llvm_unreachable("unknown BuildIdKind");		llvm_unreachable("unknown BuildIdKind");
}		}
for (Partition &part : partitions)		for (Partition &part : partitions)
part.buildId->writeBuildId(buildId);		part.buildId->writeBuildId(buildId);
}		}

		template void elf::createSyntheticSections<ELF32LE>();
		template void elf::createSyntheticSections<ELF32BE>();
		template void elf::createSyntheticSections<ELF64LE>();
		template void elf::createSyntheticSections<ELF64BE>();

template void elf::writeResult<ELF32LE>();		template void elf::writeResult<ELF32LE>();
template void elf::writeResult<ELF32BE>();		template void elf::writeResult<ELF32BE>();
template void elf::writeResult<ELF64LE>();		template void elf::writeResult<ELF64LE>();
template void elf::writeResult<ELF64BE>();		template void elf::writeResult<ELF64BE>();

lld/trunk/test/ELF/linkerscript/early-assign-symbol.s

	# REQUIRES: x86			# REQUIRES: x86
	# RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %s -o %t.o

	# RUN: echo "SECTIONS { aaa = foo \| 1; .text : { (.text) } }" > %t3.script			## The definitions of symbol assignments may reference other symbols.
	# RUN: not ld.lld -o %t --script %t3.script %t.o 2>&1 \| FileCheck %s			## Test we can handle them.

	# CHECK: error: {{.*}}.script:1: unable to evaluate expression: input section .text has no output section assigned			# RUN: echo "SECTIONS { aaa = foo \| 1; .text : { (.text) } }" > %t3.script
				# RUN: ld.lld -o %t --script %t3.script %t.o
				# RUN: llvm-objdump -t %t \| FileCheck --check-prefix=VAL1 %s

	# Simple cases that we can handle.			# VAL1: 0000000000000000 .text 00000000 foo
				# VAL1: 0000000000000001 .text 00000000 aaa

	# RUN: echo "SECTIONS { aaa = ABSOLUTE(foo - 1) + 1; .text : { (.text) } }" > %t.script			# RUN: echo "SECTIONS { aaa = ABSOLUTE(foo - 1) + 1; .text : { (.text) } }" > %t.script
	# RUN: ld.lld -o %t --script %t.script %t.o			# RUN: ld.lld -o %t --script %t.script %t.o
	# RUN: llvm-objdump -t %t \| FileCheck --check-prefix=VAL %s			# RUN: llvm-objdump -t %t \| FileCheck --check-prefix=VAL %s

	# RUN: echo "SECTIONS { aaa = 1 + ABSOLUTE(foo - 1); .text : { (.text) } }" > %t.script			# RUN: echo "SECTIONS { aaa = 1 + ABSOLUTE(foo - 1); .text : { (.text) } }" > %t.script
	# RUN: ld.lld -o %t --script %t.script %t.o			# RUN: ld.lld -o %t --script %t.script %t.o
	# RUN: llvm-objdump -t %t \| FileCheck --check-prefix=VAL %s			# RUN: llvm-objdump -t %t \| FileCheck --check-prefix=VAL %s
	Show All 11 Lines

lld/trunk/test/ELF/linkerscript/icf-output-sections.s

				# REQUIRES: x86
				# RUN: llvm-mc -filetype=obj -triple=x86_64 %s -o %t.o
				# RUN: echo 'SECTIONS { .text : { (.text) } }' > %t1.script

				## Sections within the same output section can be freely folded.
				# RUN: ld.lld %t.o --script %t1.script --icf=all --print-icf-sections -o %t \| FileCheck --check-prefix=ICF1 %s
				# RUN: llvm-readelf -S %t \| FileCheck --check-prefix=SEC1 %s --implicit-check-not=.text

				# ICF1: selected section {{.*}}.o:(.text.foo0)
				# ICF1-NEXT: removing identical section {{.*}}.o:(.text.foo1)
				# ICF1-NEXT: removing identical section {{.*}}.o:(.text.bar0)
				# ICF1-NEXT: removing identical section {{.*}}.o:(.text.bar1)

				# SEC1: .text PROGBITS 0000000000000000 001000 000001

				## Sections with different output sections cannot be folded. Without the
				## linker script, .text.foo* and .text.bar* go to the same output section
				## .text and they will be folded.
				# RUN: echo 'SECTIONS { .text.foo : {(.text.foo)} .text.bar : {(.text.bar)} }' > %t2.script
				# RUN: ld.lld %t.o --script %t2.script --icf=all --print-icf-sections -o %t \| FileCheck --check-prefix=ICF2 %s
				# RUN: llvm-readelf -S %t \| FileCheck --check-prefix=SEC2 %s

				# ICF2: selected section {{.*}}.o:(.text.foo0)
				# ICF2-NEXT: removing identical section {{.*}}.o:(.text.foo1)
				# ICF2-NEXT: selected section {{.*}}.o:(.text.bar0)
				# ICF2-NEXT: removing identical section {{.*}}.o:(.text.bar1)

				# SEC2: .text.foo PROGBITS 0000000000000000 001000 000001
				# SEC2-NEXT: .text.bar PROGBITS 0000000000000001 001001 000001

				## .text.bar* are orphans that get assigned to .text.
				# RUN: echo 'SECTIONS { .text.foo : {(.text.foo)} }' > %t3.script
				# RUN: ld.lld %t.o --script %t3.script --icf=all --print-icf-sections -o %t \| FileCheck --check-prefix=ICF2 %s
				# RUN: llvm-readelf -S %t \| FileCheck --check-prefix=SEC3 %s

				# SEC3: .text.foo PROGBITS 0000000000000000 001000 000001
				# SEC3-NEXT: .text PROGBITS 0000000000000004 001004 000001

				.section .text.foo0,"ax"
				ret
				.section .text.foo1,"ax"
				ret
				.section .text.bar0,"ax"
				ret
				.section .text.bar1,"ax"
				ret

lld/trunk/test/ELF/linkerscript/subalign.s

	Show All 17 Lines
	# RUN: echo "SECTIONS { .aaa : SUBALIGN(1) { (.aaa.) } }" > %t2.script			# RUN: echo "SECTIONS { .aaa : SUBALIGN(1) { (.aaa.) } }" > %t2.script
	# RUN: ld.lld -o %t2 --script %t2.script %t1.o			# RUN: ld.lld -o %t2 --script %t2.script %t1.o
	# RUN: llvm-objdump -s %t2 \| FileCheck -check-prefix=SUBALIGN %s			# RUN: llvm-objdump -s %t2 \| FileCheck -check-prefix=SUBALIGN %s
	# SUBALIGN: Contents of section .aaa:			# SUBALIGN: Contents of section .aaa:
	# SUBALIGN: 01000000 00000000 02000000 00000000			# SUBALIGN: 01000000 00000000 02000000 00000000
	# SUBALIGN: 03000000 00000000 04000000 00000000			# SUBALIGN: 03000000 00000000 04000000 00000000

	## Test we do not assert or crash when dot(.) is used inside SUBALIGN.			## Test we do not assert or crash when dot(.) is used inside SUBALIGN.
	## ld.bfd does not allow to use dot in such expressions, our behavior is			## Value of dot is undefined. Some versions of ld.bfd do not allow to use dot
	## different for simplicity of implementation. Value of dot is undefined.			## in such expressions.
	# RUN: echo "SECTIONS { . = 0x32; .aaa : SUBALIGN(.) { (.aaa) } }" > %t3.script			# RUN: echo "SECTIONS { . = 0x32; .aaa : SUBALIGN(.) { (.aaa) } }" > %t3.script
	# RUN: ld.lld %t1.o --script %t3.script -o %t3			# RUN: not ld.lld %t1.o --script %t3.script -o /dev/null 2>&1 \| FileCheck --check-prefix=ERR1 %s
	# RUN: llvm-objdump -s %t3 > /dev/null			# ERR1: {{.*}}.script:1: unable to get location counter value

	## Test we are able to link with zero alignment, this is consistent with bfd 2.26.1.			## Test we are able to link with zero alignment, this is consistent with bfd 2.26.1.
	# RUN: echo "SECTIONS { .aaa : SUBALIGN(0) { (.aaa) } }" > %t4.script			# RUN: echo "SECTIONS { .aaa : SUBALIGN(0) { (.aaa) } }" > %t4.script
	# RUN: ld.lld %t1.o --script %t4.script -o %t4			# RUN: ld.lld %t1.o --script %t4.script -o %t4
	# RUN: llvm-objdump -s %t4 \| FileCheck -check-prefix=SUBALIGN %s			# RUN: llvm-objdump -s %t4 \| FileCheck -check-prefix=SUBALIGN %s

	## Test we fail gracefuly when alignment value is not a power of 2.			## Test we fail gracefuly when alignment value is not a power of 2.
	# RUN: echo "SECTIONS { .aaa : SUBALIGN(3) { (.aaa) } }" > %t5.script			# RUN: echo "SECTIONS { .aaa : SUBALIGN(3) { (.aaa) } }" > %t5.script
	# RUN: not ld.lld %t1.o --script %t5.script -o /dev/null 2>&1 \| FileCheck -check-prefix=ERR %s			# RUN: not ld.lld %t1.o --script %t5.script -o /dev/null 2>&1 \| FileCheck --check-prefix=ERR2 %s
	# ERR: {{.*}}.script:1: alignment must be power of 2			# ERR2: {{.*}}.script:1: alignment must be power of 2

	.global _start			.global _start
	_start:			_start:
	nop			nop

	.section .aaa.1, "a"			.section .aaa.1, "a"
	.align 16			.align 16
	.quad 1			.quad 1
	Show All 12 Lines