This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
ELF/
-
LinkerScript.h
1/3
LinkerScript.cpp
3/6
Writer.cpp
-
test/ELF/linkerscript/
-
ELF/
-
linkerscript/
-
expr-converge.test

Differential D66279

[ELF] Make LinkerScript::assignAddresses iterative
ClosedPublic

Authored by MaskRay on Aug 15 2019, 2:22 AM.

Download Raw Diff

Details

Reviewers

grimar
peter.smith
ruiu
• espindola

Commits

rGdebcac9fef21: [ELF] Make LinkerScript::assignAddresses iterative
rL369889: [ELF] Make LinkerScript::assignAddresses iterative
rLLD369889: [ELF] Make LinkerScript::assignAddresses iterative

Summary

PR42990. For SECTIONS { b = a; . = 0xff00 + (a >> 8); a = .; },
we currently set st_value(a)=0xff00 while st_value(b)=0xffff.

The following call tree demonstrates the problem:

link<ELF64LE>(Args);
  Script->declareSymbols(); // insert a and b as absolute Defined
  Writer<ELFT>().run();
    Script->processSectionCommands();
      addSymbol(cmd);       // a and b are re-inserted. LinkerScript::getSymbolValue
                            // is lazily called by subsequent evaluation
    finalizeSections();
      forEachRelSec(scanRelocations<ELFT>);
        processRelocAux     // another problem PR42506, not affected by this patch
      finalizeAddressDependentContent(); // loop executed once
        script->assignAddresses(); // a = 0, b = 0xff00
    script->assignAddresses(); // a = 0xff00, _end = 0xffff

We need another assignAddresses() to finalize the value of a.

This patch

modifies assignAddress() to track the original section/value of each symbol and return a symbol whose section/value has changed.
moves the post-finalizeSections assignAddress() inside the loop of finalizeAddressDependentContent() and makes it iterative. Symbol assignment may not converge so we make a few attempts before bailing out.

Note, assignAddresses() must be called at least twice. The penultimate
call finalized section addresses while the last finalized symbol values.
It is somewhat obscure because there was no comment.
linkerscript/addr-zero.test tests this.

Diff Detail

Repository

rLLD LLVM Linker

Build Status

Buildable 36814
Build 36813: arc lint + arc unit

Event Timeline

MaskRay created this revision.Aug 15 2019, 2:22 AM

Herald added a reviewer: • espindola. · View Herald TranscriptAug 15 2019, 2:22 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, arichardson, emaste. · View Herald Transcript

Harbormaster completed remote builds in B36814: Diff 215346.Aug 15 2019, 2:26 AM

I think that this is close. I've got some concerns about the interaction of convergence limits, and it may be worth investigating if it is possible to make a separate symbol update pass that runs after addresses have stabilized.

I think it will be worth adding a test to make sure that the bail out does indeed occur. Something like:

sym1 = sym2
sym2 = sym3
sym3 = sym4
sym4 = sym5
sym6 = sym7
sym8 = .

I'm happy to work on a thunk creation test that takes quite a few iterations to converge.

ELF/Writer.cpp
1571	The thunks code already has a converge limit of 10, I think it would be possible to create a test case that had a single assignment say sym = . that was after a code section that would vary in size due to Thunk creation. If the code section converged within 10 but not within 5 then we'd get an error. I think if the symbol convergence test is within this loop then we should probably extract the convergence test out of ThunkCreation, or at least have some way of synchronizing all the convergence limits. Writing a thunk test that doesn't converge isn't difficult, writing one that takes many iterations to convergence is somewhat tedious. I can help write one as I've done a few before.
1595	To follow on from the previous comment about convergence errors in Thunks, if we choose to keep the convergence tests separate so we know which part isn't converging (if that is even possible) then it might be worth moving the tries and error message into script and assignAddresses() to keep this loop clean.
1602	An argument could be made that testing for convergence of symbols could/should be done after the final addresses have stabilised. I don't think that there is a legal linker script that could be made where assigning to symbols in a linker script should affect the convergence of thunks, patches and partitions. Doing the symbol assignment with stable addresses would avoid the clashing convergence limit problem. My initial thought was to see if just a symbol assignment update could be run here, without going through the complexity of assignAddresses() but that would definitely need some thought.

MaskRay marked an inline comment as done.Aug 15 2019, 3:27 AM

MaskRay added inline comments.

ELF/Writer.cpp
1571	This is a bit different. The return value of `script->assignAddresses();` does not affect `changed`, i.e. before thunks converge (`changed` becomes false) `tries` should not decrease. `script->assignAddresses` is executed at least twice, without or with this patch. This is a bit awkward, but if I change the logic here to: for (;;) assignAddresses(); createThunks(); if (!changed) break; test/ELF/linkerscript/addr-zero.test added by @grimar's D55550 will break. foo = ADDR(.text) - ABSOLUTE(ADDR(.text)); Ideally we don't have to call `assignAddresses` twice. Writing a thunk test that doesn't converge isn't difficult, writing one that takes many iterations to convergence is somewhat tedious. Yes, a test involving both non-trivial symbol assignment and non-trivial thunk will be very useful... Thanks in advance.

Add a non-convergence test expr-converge2.test and improve expr-converge.test

Harbormaster completed remote builds in B36824: Diff 215364.Aug 15 2019, 3:35 AM

Improve test
Add a comment why the two calls of assignAddresses() are written this way.

Harbormaster completed remote builds in B36825: Diff 215374.Aug 15 2019, 5:50 AM

bug fix: return sym; -> changed = sym;

Harbormaster completed remote builds in B36826: Diff 215377.Aug 15 2019, 5:55 AM

Investigated a bit how ld.bfd works.

SECTIONS {
  . = 0x1000;
  a = b + 1; b = c + 1; c = d + 1; d = e + 1; e = f + 1; f = g + 1; g = h + 1; h = .;
}
// ld/ld-new -T a.lds a.o -o a
// undefined symbol `b' referenced in expression

SECTIONS {
  . = 0x1000;
  a = b + 1; b = c + 1; c = d + 1; d = e + 1; e = f + 1; f = g + 1; g = .;
}
// st_value(a) is incorrect 6, not 0x1006.

// st_value(a) is correct (0x1005) with one less variable.

In either case, lang_do_assignments ran at least 3 times.

lang_do_assignments (lang_mark_phase_enum); // in ld/ldlang.c:lang_process
lang_do_assignments (lang_assigning_phase_enum); // in ld/ldlang.c:lang_relax_sections
lang_do_assignments (lang_final_phase_enum); // in ld/ldlang.c:lang_process

The initial 3 calls allow it to compute a slightly longer chain, though a new lang_do_assignments (lang_assigning_phase_enum); only allows it to compute the case with one more variable. I observe that the aarch64 port calls lang_do_assignments 4 times. This is because ld/emultempl/elf-generic.em:gld${EMULATION_NAME}_map_segments (bfd_boolean need_layout) calls lang_relax_sections (need_layout); twice.

In lang_relax_sections, the symbol assignment phase is followed by lang_size_sections. Our script->assignAddresses() should match its behavior well.

void
lang_relax_sections (bfd_boolean need_layout)
{
...
  if (need_layout)
    {
      /* Final extra sizing to report errors.  */
      lang_do_assignments (lang_assigning_phase_enum);
      lang_reset_memory_regions ();
      lang_size_sections (NULL, TRUE);
    }
}

In finalizeAddressDependentContent(), we could extract the symbol assignment iteration and do it separately after the layout is fixed, but that would require more code.

peter.smith mentioned this in D66346: [LLD][ELF][ARM] Add a test that maxes out the thunk convergence limit.Aug 16 2019, 6:44 AM

I've created D66346 which maxes out the permitted Thunk convergence limit. I've done it as Arm rather than AArch64 as the range limits are lower on Arm. This should be sufficient to test that if there is any interaction between convergence limits. Apologies for the delay in putting it together.

Change tests to adapt D66346

Harbormaster completed remote builds in B36887: Diff 215626.Aug 16 2019, 9:43 AM

ruiu added inline comments.Aug 19 2019, 1:00 AM

ELF/Writer.cpp
1576	I believe this line is executed only when you give pathetic object files to the linker, but we still probably have to use `error` to make it work better for the embedding use case.

fatal("thunk creation not converged");

+ error("thunk creation not converged");
+ break;

Harbormaster completed remote builds in B36936: Diff 215830.Aug 19 2019, 1:26 AM

Thanks for the update. Have been thinking in the back of my mind if there were any other way of solving the problem without needing to iterate. The only thing I can think of right now is some kind of analysis that, for want of a better word, topologically sorts expressions so that there are no forward references, with an error message if this isn't possible. This could get complicated as there is limited scope to move or reorder assignments in OutputSections. Other than that I've not got any more comments at the moment.

ELF/LinkerScript.cpp
1067	It might be worth extracting this and the check below into functions. Something like: oldValues = getSymbolAssignmentValues(sectionCommands); ... return findChangedSymbol(oldValues); Not a strong opinion though.
ELF/Writer.cpp
1570–1571	This comment won't be true anymore, depending on linker scripts.

Delete a stale comment
Extract out getSymbolAssignmentValues

Harbormaster completed remote builds in B36944: Diff 215853.Aug 19 2019, 3:30 AM

In D66279#1634954, @peter.smith wrote:

Thanks for the update. Have been thinking in the back of my mind if there were any other way of solving the problem without needing to iterate. The only thing I can think of right now is some kind of analysis that, for want of a better word, topologically sorts expressions so that there are no forward references, with an error message if this isn't possible. This could get complicated as there is limited scope to move or reorder assignments in OutputSections. Other than that I've not got any more comments at the moment.

Unfortunately linker scripts are not in a single static assignment form... = and += can appear multiple times. Such analysis will be difficult. According to my experiments (see prior comments), ld.bfd's evaluation strategy is quite similar to: run assignment statements one by one, but repeat a few times, where the repetition count depends on layout convergence. It doesn't wait for symbol assignments to converge, but it evaluates the assignment statements more than we current do. Thus it can correctly evaluate some simple forward declaration forms. I think the approach we use in this patch should capture its strategy quite well. This is probably sufficient for most forward declarations we may encounter in practice.

ELF/LinkerScript.cpp
1067	I'll do: oldValues = getSymbolAssignmentValues(sectionCommands); ... for (auto &it : oldValues) { /// this part is not too complex now, so i'll inline it ... } return changed;

🎡Ready to roll!

I'm happy with this as it stands. It should help prevent problems coming from alterations to the linux kernel's linker script and also help others migrating from ld.bfd. With luck we'll be able to build on this to fix the symbol type problem in https://bugs.llvm.org/show_bug.cgi?id=42506

nickdesaulniers added a subscriber: nickdesaulniers.Aug 21 2019, 11:58 PM

@ruiu 🎡

🤔

MaskRay mentioned this in D66717: [ELF] Do not ICF two sections with different output sections (by SECTIONS commands).Aug 26 2019, 12:30 AM

LGTM

ELF/LinkerScript.cpp
1054	This needs a function comment.

This revision is now accepted and ready to land.Aug 26 2019, 2:47 AM

Move getSymbolAssignmentValues above because it might be used by processSymbolAssignments in the future.

add a function-level comment

Add getChangedSymbolAssignment

Harbormaster completed remote builds in B37269: Diff 217102.Aug 26 2019, 3:09 AM

vector -> const vector &

Harbormaster completed remote builds in B37270: Diff 217103.Aug 26 2019, 3:21 AM

Closed by commit rL369889: [ELF] Make LinkerScript::assignAddresses iterative (authored by MaskRay). · Explain WhyAug 26 2019, 3:23 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

ELF/

LinkerScript.h

2 lines

LinkerScript.cpp

22 lines

Writer.cpp

23 lines

test/

ELF/

linkerscript/

expr-converge.test

13 lines

Diff 215346

ELF/LinkerScript.h

Show First 20 Lines • Show All 265 Lines • ▼ Show 20 Lines	public:
void addOrphanSections();		void addOrphanSections();
void adjustSectionsBeforeSorting();		void adjustSectionsBeforeSorting();
void adjustSectionsAfterSorting();		void adjustSectionsAfterSorting();

std::vector<PhdrEntry *> createPhdrs();		std::vector<PhdrEntry *> createPhdrs();
bool needsInterpSection();		bool needsInterpSection();

bool shouldKeep(InputSectionBase *s);		bool shouldKeep(InputSectionBase *s);
void assignAddresses();		const Defined *assignAddresses();
void allocateHeaders(std::vector<PhdrEntry *> &phdrs);		void allocateHeaders(std::vector<PhdrEntry *> &phdrs);
void processSectionCommands();		void processSectionCommands();
void declareSymbols();		void declareSymbols();

// Used to handle INSERT AFTER statements.		// Used to handle INSERT AFTER statements.
void processInsertCommands();		void processInsertCommands();

// SECTIONS command list.		// SECTIONS command list.
Show All 30 Lines

ELF/LinkerScript.cpp

Show First 20 Lines • Show All 1,045 Lines • ▼ Show 20 Lines	static uint64_t getInitialDot() {
// The sections with -T<section> have been sorted in order of ascending		// The sections with -T<section> have been sorted in order of ascending
// address. We must lower startAddr if the lowest -T<section address> as		// address. We must lower startAddr if the lowest -T<section address> as
// calls to setDot() must be monotonically increasing.		// calls to setDot() must be monotonically increasing.
for (auto &kv : config->sectionStartMap)		for (auto &kv : config->sectionStartMap)
startAddr = std::min(startAddr, kv.second);		startAddr = std::min(startAddr, kv.second);
return std::min(startAddr, target->getImageBase() + elf::getHeaderSize());		return std::min(startAddr, target->getImageBase() + elf::getHeaderSize());
}		}

// Here we assign addresses as instructed by linker script SECTIONS		// Here we assign addresses as instructed by linker script SECTIONS
		ruiuUnsubmitted Not Done Reply Inline Actions This needs a function comment. ruiu: This needs a function comment.
// sub-commands. Doing that allows us to use final VA values, so here		// sub-commands. Doing that allows us to use final VA values, so here
// we also handle rest commands like symbol assignments and ASSERTs.		// we also handle rest commands like symbol assignments and ASSERTs.
void LinkerScript::assignAddresses() {		// Returns a symbol that has changed its section or value, or nullptr if no
		// symbol has changed.
		const Defined *LinkerScript::assignAddresses() {
dot = getInitialDot();		dot = getInitialDot();

auto deleter = std::make_unique<AddressState>();		auto deleter = std::make_unique<AddressState>();
ctx = deleter.get();		ctx = deleter.get();
errorOnMissingSection = true;		errorOnMissingSection = true;
switchTo(aether);		switchTo(aether);

		DenseMap<const Defined , std::pair<SectionBase , uint64_t>> old;
		peter.smithUnsubmitted Not Done Reply Inline Actions It might be worth extracting this and the check below into functions. Something like: oldValues = getSymbolAssignmentValues(sectionCommands); ... return findChangedSymbol(oldValues); Not a strong opinion though. peter.smith: It might be worth extracting this and the check below into functions. Something like: ```…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions I'll do: oldValues = getSymbolAssignmentValues(sectionCommands); ... for (auto &it : oldValues) { /// this part is not too complex now, so i'll inline it ... } return changed; MaskRay: I'll do: ``` oldValues = getSymbolAssignmentValues(sectionCommands); ... for (auto &it…
		for (BaseCommand *base : sectionCommands)
		if (auto *cmd = dyn_cast<SymbolAssignment>(base))
		if (cmd->sym)
		old.try_emplace(cmd->sym,
		std::make_pair(cmd->sym->section, cmd->sym->value));

for (BaseCommand *base : sectionCommands) {		for (BaseCommand *base : sectionCommands) {
if (auto *cmd = dyn_cast<SymbolAssignment>(base)) {		if (auto *cmd = dyn_cast<SymbolAssignment>(base)) {
cmd->addr = dot;		cmd->addr = dot;
assignSymbol(cmd, false);		assignSymbol(cmd, false);
cmd->size = dot - cmd->addr;		cmd->size = dot - cmd->addr;
continue;		continue;
}		}
assignOffsets(cast<OutputSection>(base));		assignOffsets(cast<OutputSection>(base));
}		}
ctx = nullptr;		ctx = nullptr;

		// Return the lexicographical smallest (for determinism) Defined whose
		// section/value has changed.
		const Defined *changed = nullptr;
		for (auto &it : old) {
		const Defined *sym = it.first;
		if ((sym->section != it.second.first \|\| sym->value != it.second.second) &&
		(!changed \|\| sym->getName() < changed->getName()))
		return sym;
		}
		return nullptr;
}		}

// Creates program headers as instructed by PHDRS linker script command.		// Creates program headers as instructed by PHDRS linker script command.
std::vector<PhdrEntry *> LinkerScript::createPhdrs() {		std::vector<PhdrEntry *> LinkerScript::createPhdrs() {
std::vector<PhdrEntry *> ret;		std::vector<PhdrEntry *> ret;

// Process PHDRS and FILEHDR keywords because they are not		// Process PHDRS and FILEHDR keywords because they are not
// real output sections and cannot be added in the following loop.		// real output sections and cannot be added in the following loop.
▲ Show 20 Lines • Show All 84 Lines • Show Last 20 Lines

ELF/Writer.cpp

Show First 20 Lines • Show All 574 Lines • ▼ Show 20 Lines	template <class ELFT> void Writer<ELFT>::run() {
// completes section contents. For example, we need to add strings		// completes section contents. For example, we need to add strings
// to the string table, and add entries to .got and .plt.		// to the string table, and add entries to .got and .plt.
// finalizeSections does that.		// finalizeSections does that.
finalizeSections();		finalizeSections();
checkExecuteOnly();		checkExecuteOnly();
if (errorCount())		if (errorCount())
return;		return;

script->assignAddresses();

// If -compressed-debug-sections is specified, we need to compress		// If -compressed-debug-sections is specified, we need to compress
// .debug_* sections. Do it right now because it changes the size of		// .debug_* sections. Do it right now because it changes the size of
// output sections.		// output sections.
for (OutputSection *sec : outputSections)		for (OutputSection *sec : outputSections)
sec->maybeCompress<ELFT>();		sec->maybeCompress<ELFT>();

script->allocateHeaders(mainPart->phdrs);		script->allocateHeaders(mainPart->phdrs);

▲ Show 20 Lines • Show All 969 Lines • ▼ Show 20 Lines

// We need to generate and finalize the content that depends on the address of		// We need to generate and finalize the content that depends on the address of
// InputSections. As the generation of the content may also alter InputSection		// InputSections. As the generation of the content may also alter InputSection
// addresses we must converge to a fixed point. We do that here. See the comment		// addresses we must converge to a fixed point. We do that here. See the comment
// in Writer<ELFT>::finalizeSections().		// in Writer<ELFT>::finalizeSections().
template <class ELFT> void Writer<ELFT>::finalizeAddressDependentContent() {		template <class ELFT> void Writer<ELFT>::finalizeAddressDependentContent() {
ThunkCreator tc;		ThunkCreator tc;
AArch64Err843419Patcher a64p;		AArch64Err843419Patcher a64p;
		script->assignAddresses();

// For some targets, like x86, this loop iterates only once.		// For some targets, like x86, this loop iterates only once.
		int tries = 5;
		peter.smithUnsubmitted Not Done Reply Inline Actions The thunks code already has a converge limit of 10, I think it would be possible to create a test case that had a single assignment say sym = . that was after a code section that would vary in size due to Thunk creation. If the code section converged within 10 but not within 5 then we'd get an error. I think if the symbol convergence test is within this loop then we should probably extract the convergence test out of ThunkCreation, or at least have some way of synchronizing all the convergence limits. Writing a thunk test that doesn't converge isn't difficult, writing one that takes many iterations to convergence is somewhat tedious. I can help write one as I've done a few before. peter.smith: The thunks code already has a converge limit of 10, I think it would be possible to create a…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions This is a bit different. The return value of `script->assignAddresses();` does not affect `changed`, i.e. before thunks converge (`changed` becomes false) `tries` should not decrease. `script->assignAddresses` is executed at least twice, without or with this patch. This is a bit awkward, but if I change the logic here to: for (;;) assignAddresses(); createThunks(); if (!changed) break; test/ELF/linkerscript/addr-zero.test added by @grimar's D55550 will break. foo = ADDR(.text) - ABSOLUTE(ADDR(.text)); Ideally we don't have to call `assignAddresses` twice. Writing a thunk test that doesn't converge isn't difficult, writing one that takes many iterations to convergence is somewhat tedious. Yes, a test involving both non-trivial symbol assignment and non-trivial thunk will be very useful... Thanks in advance. MaskRay: This is a bit different. The return value of `script->assignAddresses();` does not affect…
		peter.smithUnsubmitted Done Reply Inline Actions This comment won't be true anymore, depending on linker scripts. peter.smith: This comment won't be true anymore, depending on linker scripts.
for (;;) {		for (;;) {
bool changed = false;		bool changed = target->needsThunks && tc.createThunks(outputSections);

script->assignAddresses();

if (target->needsThunks)
changed \|= tc.createThunks(outputSections);

if (config->fixCortexA53Errata843419) {		if (config->fixCortexA53Errata843419) {
if (changed)		if (changed)
		ruiuUnsubmitted Done Reply Inline Actions I believe this line is executed only when you give pathetic object files to the linker, but we still probably have to use `error` to make it work better for the embedding use case. ruiu: I believe this line is executed only when you give pathetic object files to the linker, but we…
script->assignAddresses();		script->assignAddresses();
changed \|= a64p.createFixes();		changed \|= a64p.createFixes();
}		}

if (in.mipsGot)		if (in.mipsGot)
in.mipsGot->updateAllocSize();		in.mipsGot->updateAllocSize();

for (Partition &part : partitions) {		for (Partition &part : partitions) {
changed \|= part.relaDyn->updateAllocSize();		changed \|= part.relaDyn->updateAllocSize();
if (part.relrDyn)		if (part.relrDyn)
changed \|= part.relrDyn->updateAllocSize();		changed \|= part.relrDyn->updateAllocSize();
}		}

if (!changed)		const Defined *changedSym = script->assignAddresses();
return;		if (!changed) {
		if (!changedSym)
		break;
		if (--tries == 0) {
		errorOrWarn("assignment to symbol " + toString(*changedSym) +
		peter.smithUnsubmitted Not Done Reply Inline Actions To follow on from the previous comment about convergence errors in Thunks, if we choose to keep the convergence tests separate so we know which part isn't converging (if that is even possible) then it might be worth moving the tries and error message into script and assignAddresses() to keep this loop clean. peter.smith: To follow on from the previous comment about convergence errors in Thunks, if we choose to keep…
		" does not converge");
		break;
		}
		}
}		}
}		}

		peter.smithUnsubmitted Not Done Reply Inline Actions An argument could be made that testing for convergence of symbols could/should be done after the final addresses have stabilised. I don't think that there is a legal linker script that could be made where assigning to symbols in a linker script should affect the convergence of thunks, patches and partitions. Doing the symbol assignment with stable addresses would avoid the clashing convergence limit problem. My initial thought was to see if just a symbol assignment update could be run here, without going through the complexity of assignAddresses() but that would definitely need some thought. peter.smith: An argument could be made that testing for convergence of symbols could/should be done after…
static void finalizeSynthetic(SyntheticSection *sec) {		static void finalizeSynthetic(SyntheticSection *sec) {
if (sec && sec->isNeeded() && sec->getParent())		if (sec && sec->isNeeded() && sec->getParent())
sec->finalizeContents();		sec->finalizeContents();
}		}

// In order to allow users to manipulate linker-synthesized sections,		// In order to allow users to manipulate linker-synthesized sections,
// we had to add synthetic sections to the input section list early,		// we had to add synthetic sections to the input section list early,
// even before we make decisions whether they are needed. This allows		// even before we make decisions whether they are needed. This allows
▲ Show 20 Lines • Show All 1,074 Lines • Show Last 20 Lines

test/ELF/linkerscript/expr-converge.test

This file was added.

				# REQUIRES: x86
				# RUN: llvm-mc -filetype=obj -triple=x86_64 /dev/null -o %t.o
				# RUN: ld.lld %t.o -T %s -o %t
				# RUN: llvm-nm %t \| FileCheck %s

				# CHECK: 000000000000ffff T a
				# CHECK: 000000000000ffff T b

				a = b;
				SECTIONS {
				. = 0xffff + (b >> 8) - 0xff;
				b = .;
				}