This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/trunk/
-
trunk/
-
ELF/
-
Writer.cpp
-
test/ELF/linkerscript/
-
ELF/
-
linkerscript/
-
addr-zero.test

Differential D55550

[LLD][ELF] - Fix the different behavior of the linker script symbols on different platforms.
ClosedPublic

Authored by grimar on Dec 11 2018, 2:16 AM.

Download Raw Diff

Details

Reviewers

ruiu
psmith
• espindola

Commits

rGda49faf15e88: [LLD][ELF] - Fix the different behavior of the linker script symbols on…
rLLD358646: [LLD][ELF] - Fix the different behavior of the linker script symbols on…
rL358646: [LLD][ELF] - Fix the different behavior of the linker script symbols on…

Summary

This should help make D55423 be even more simple and also
fixes the broken behavior shown in one of our test cases
for some targets, like x86-64.

The issue occurs when the forward declarations are used in the script.
One of the samples is:

SECTIONS {
  foo = ADDR(.text) - ABSOLUTE(ADDR(.text));
};

In that case, we have a broken output when output target does
not use thunks. That happens because thunks creating code
calls Script->assignAddresses() at least once one more time,
what fixups the values.

In this patch, I generalize and rename maybeAddThunks to be used
by all targets.

Diff Detail

Repository: rL LLVM

Event Timeline

grimar created this revision.Dec 11 2018, 2:16 AM

Herald added a reviewer: • espindola. · View Herald TranscriptDec 11 2018, 2:16 AM

Herald added subscribers: arichardson, emaste, srhines. · View Herald Transcript

grimar mentioned this in D55423: [LLD][ELF] - A fix for "linker script assignment loses relative nature of section" bug..Dec 11 2018, 2:20 AM

Overall I'm happy with this change as I think it is simpler than adding another call to Writer<ELFT>::run(). The one remaining thought is whether Writer<ELFT>::run() can be moved into finalizeAddressDependentContent() as there shouldn't be anything after that function that changes address. Could we move the remaining Writer<ELFT>::run() to the end of finalizeAddressDependentContent() ?

In D55550#1326818, @peter.smith wrote:

Overall I'm happy with this change as I think it is simpler than adding another call to Writer<ELFT>::run(). The one remaining thought is whether Writer<ELFT>::run() can be moved into finalizeAddressDependentContent() as there shouldn't be anything after that function that changes address. Could we move the remaining Writer<ELFT>::run() to the end of finalizeAddressDependentContent() ?

I am not sure honestly. I like that we do not spread and have all the finalizeSynthetic in one place (in one method) and I
like that finalizeAddressDependentContent is an isolated piece where the iteration magic happens.
I would probably try to avoid adding the additional code into it. This patch only removes the code from there
and I think it is good.

But If we decide to do something like this, I would suggest doing it in a different patch.

nickdesaulniers added a subscriber: nickdesaulniers.Dec 11 2018, 11:15 AM

nathanchance added a subscriber: nathanchance.Dec 11 2018, 11:18 AM

I think this should be landed because at least it simplifies the code. Ping.

E5ten added a subscriber: E5ten.Apr 4 2019, 6:10 PM

Herald added a subscriber: MaskRay. · View Herald TranscriptApr 4 2019, 6:10 PM

Ping

As mentioned above, I'm in favour of making this change.

Rui, I am a bit tired of uncertainty in some LLD patches like this one. Could you please either confirm you're not going to accept this, so I'll abandon it. Or if we can proceed with that somehow, please say your word.

smeenai added a subscriber: smeenai.Apr 16 2019, 9:47 PM

In D55550#1466482, @grimar wrote:

Rui, I am a bit tired of uncertainty in some LLD patches like this one. Could you please either confirm you're not going to accept this, so I'll abandon it. Or if we can proceed with that somehow, please say your word.

Somewhat off-topic; I'm wondering if it is worth revisiting this in a wider context to see if we can find a better way of resolving these compatibility problems. At EuroLLVM there was an interesting discussion in the binutils BOF/Round-table on how important compatibility with GNU tools was. The general consensus was that it was particularly important in projects that had to support both sets of tools and that rewriting inputs to work around limitations in one or other of the tools was a significant headache. The overall philosophy to be taken forward was for GNU compatible option names and output wherever possible, i.e. unless the GNU behaviour was impossible to implement or was crazy anyway. I think that it would be a reasonable to come up with a similar position for LLD, perhaps an RFC to LLVM-DEV might be a way forward.

With the linux kernel attempting to use LLD, first with AArch64 and Arm as part of Android, but there are people trying with x86_64 as well, simultaneously there is an increased interest from embedded systems as LLD is naturally part of the Clang/LLVM binutils. I think we are so close to be able to support these projects, but we'll have a long tail of small inconsistencies, like this one that we should fix. It would be good to come up with a common process/design to fix these that didn't get stuck on reviews.

LGTM

Changes to the linker script processor oftentimes has subtle implications and in order to review such change I have to stop and think. But for this patch I took too much time. Apologies for the delay.

Somewhat off-topic; I'm wondering if it is worth revisiting this in a wider context to see if we can find a better way of resolving these compatibility problems. At EuroLLVM there was an interesting discussion in the binutils BOF/Round-table on how important compatibility with GNU tools was. The general consensus was that it was particularly important in projects that had to support both sets of tools and that rewriting inputs to work around limitations in one or other of the tools was a significant headache. The overall philosophy to be taken forward was for GNU compatible option names and output wherever possible, i.e. unless the GNU behaviour was impossible to implement or was crazy anyway. I think that it would be a reasonable to come up with a similar position for LLD, perhaps an RFC to LLVM-DEV might be a way forward.

I think that's a fair argument and we should take that stance; as long as it's not impossible to implement nor too crazy, we should make lld compatible with GNU bfd linker whenever possible at least in terms of linker script compatibility. I don't personally like the linker script and perhaps the ELF file format itself (I dislike the fact that segments and sections are orthogonal in the ELF format), but in many cases a linker script just does its job, and without that some kind of task is not doable. In practice, GNU compatibility is very important.

One of the reasons why it is not easy to review linker script code change is because it's complicated. I don't feel I fully understand the existing code of the linker script processor. It grows organically, and in many cases a small issue can be resolved by adding another call of assignOffsets(). It would for sure fixes a problem, but it is sometimes hard to say whether that's correct or not. Each stage doesn't seem clearly separated, and it is sometimes not clear whether doing something in some stage is the right thing or not. Is this due to the nature of the complexity of the linker script? I don't think so -- we didn't understand the linker script well when we started implementing it a few years ago, and the way how we organized code in the few years was still there. But now we understand it well including corner cases. I want to sit down and think hard about how to (re)organize the code. However, that idea seems the cause of slow review of linker script patches (I always started thinking how this should really be organized). I'll defer the idea of refactoring and let me review linker script changes more casually. I'm sorry about the delay, again.

ELF/Writer.cpp
1503 ↗	(On Diff #177672)	Can you mention that for x86 this loop iterates only once?

This revision is now accepted and ready to land.Apr 17 2019, 11:50 PM

Thanks!

Closed by commit rL358646: [LLD][ELF] - Fix the different behavior of the linker script symbols on… (authored by grimar). · Explain WhyApr 18 2019, 1:18 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptApr 18 2019, 1:18 AM

MaskRay mentioned this in D66279: [ELF] Make LinkerScript::assignAddresses iterative.Aug 15 2019, 3:27 AM

Revision Contents

Path

Size

lld/

trunk/

ELF/

Writer.cpp

51 lines

test/

ELF/

linkerscript/

addr-zero.test

14 lines

Diff 195684

lld/trunk/ELF/Writer.cpp

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	public:
void run();		void run();

private:		private:
void copyLocalSymbols();		void copyLocalSymbols();
void addSectionSymbols();		void addSectionSymbols();
void forEachRelSec(llvm::function_ref<void(InputSectionBase &)> Fn);		void forEachRelSec(llvm::function_ref<void(InputSectionBase &)> Fn);
void sortSections();		void sortSections();
void resolveShfLinkOrder();		void resolveShfLinkOrder();
void maybeAddThunks();		void finalizeAddressDependentContent();
void sortInputSections();		void sortInputSections();
void finalizeSections();		void finalizeSections();
void checkExecuteOnly();		void checkExecuteOnly();
void setReservedSymbolSections();		void setReservedSymbolSections();

std::vector<PhdrEntry *> createPhdrs();		std::vector<PhdrEntry *> createPhdrs();
void removeEmptyPTLoad();		void removeEmptyPTLoad();
void addPhdrForSection(std::vector<PhdrEntry *> &Phdrs, unsigned ShType,		void addPhdrForSection(std::vector<PhdrEntry *> &Phdrs, unsigned ShType,
▲ Show 20 Lines • Show All 1,388 Lines • ▼ Show 20 Lines	for (OutputSection *Sec : OutputSections) {

std::stable_sort(Sections.begin(), Sections.end(), compareByFilePosition);		std::stable_sort(Sections.begin(), Sections.end(), compareByFilePosition);

for (int I = 0, N = Sections.size(); I < N; ++I)		for (int I = 0, N = Sections.size(); I < N; ++I)
*ScriptSections[I] = Sections[I];		*ScriptSections[I] = Sections[I];
}		}
}		}

// For most RISC ISAs, we need to generate content that depends on the address		// We need to generate and finalize the content that depends on the address of
// of InputSections. For example some architectures such as AArch64 use small		// InputSections. As the generation of the content may also alter InputSection
// displacements for jump instructions that is the linker's responsibility for		// addresses we must converge to a fixed point. We do that here. See the comment
// creating range extension thunks for. As the generation of the content may		// in Writer<ELFT>::finalizeSections().
// also alter InputSection addresses we must converge to a fixed point.		template <class ELFT> void Writer<ELFT>::finalizeAddressDependentContent() {
template <class ELFT> void Writer<ELFT>::maybeAddThunks() {
if (!Target->NeedsThunks && !Config->AndroidPackDynRelocs &&
!Config->RelrPackDynRelocs)
return;

ThunkCreator TC;		ThunkCreator TC;
AArch64Err843419Patcher A64P;		AArch64Err843419Patcher A64P;

		// For some targets, like x86, this loop iterates only once.
for (;;) {		for (;;) {
bool Changed = false;		bool Changed = false;

Script->assignAddresses();		Script->assignAddresses();

if (Target->NeedsThunks)		if (Target->NeedsThunks)
Changed \|= TC.createThunks(OutputSections);		Changed \|= TC.createThunks(OutputSections);

▲ Show 20 Lines • Show All 267 Lines • ▼ Show 20 Lines	template <class ELFT> void Writer<ELFT>::finalizeSections() {

if (!Script->HasSectionsCommand && !Config->Relocatable)		if (!Script->HasSectionsCommand && !Config->Relocatable)
fixSectionAlignments();		fixSectionAlignments();

// SHFLinkOrder processing must be processed after relative section placements are		// SHFLinkOrder processing must be processed after relative section placements are
// known but before addresses are allocated.		// known but before addresses are allocated.
resolveShfLinkOrder();		resolveShfLinkOrder();

// Jump instructions in many ISAs have small displacements, and therefore they		// This is used to:
// cannot jump to arbitrary addresses in memory. For example, RISC-V JAL		// 1) Create "thunks":
// instruction can target only +-1 MiB from PC. It is a linker's		// Jump instructions in many ISAs have small displacements, and therefore
// responsibility to create and insert small pieces of code between sections		// they cannot jump to arbitrary addresses in memory. For example, RISC-V
// to extend the ranges if jump targets are out of range. Such code pieces are		// JAL instruction can target only +-1 MiB from PC. It is a linker's
// called "thunks".		// responsibility to create and insert small pieces of code between
//		// sections to extend the ranges if jump targets are out of range. Such
// We add thunks at this stage. We couldn't do this before this point because		// code pieces are called "thunks".
// this is the earliest point where we know sizes of sections and their		//
// layouts (that are needed to determine if jump targets are in range).		// We add thunks at this stage. We couldn't do this before this point
maybeAddThunks();		// because this is the earliest point where we know sizes of sections and
		// their layouts (that are needed to determine if jump targets are in
		// range).
		//
		// 2) Update the sections. We need to generate content that depends on the
		// address of InputSections. For example, MIPS GOT section content or
		// android packed relocations sections content.
		//
		// 3) Assign the final values for the linker script symbols. Linker scripts
		// sometimes using forward symbol declarations. We want to set the correct
		// values. They also might change after adding the thunks.
		finalizeAddressDependentContent();

// maybeAddThunks may have added local symbols to the static symbol table.		// finalizeAddressDependentContent may have added local symbols to the static symbol table.
finalizeSynthetic(In.SymTab);		finalizeSynthetic(In.SymTab);
finalizeSynthetic(In.PPC64LongBranchTarget);		finalizeSynthetic(In.PPC64LongBranchTarget);

// Fill other section headers. The dynamic table is finalized		// Fill other section headers. The dynamic table is finalized
// at the end because some tags like RELSZ depend on result		// at the end because some tags like RELSZ depend on result
// of finalizing other sections.		// of finalizing other sections.
for (OutputSection *Sec : OutputSections)		for (OutputSection *Sec : OutputSections)
Sec->finalize();		Sec->finalize();
▲ Show 20 Lines • Show All 803 Lines • Show Last 20 Lines

lld/trunk/test/ELF/linkerscript/addr-zero.test

	# REQUIRES: x86			# REQUIRES: x86, aarch64
	# RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux /dev/null -o %t.o			# RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux /dev/null -o %t.o
	# RUN: ld.lld -o %t.so --script %s %t.o -shared			# RUN: ld.lld -o %t.so --script %s %t.o -shared
	# RUN: llvm-readobj --symbols %t.so \| FileCheck %s			# RUN: llvm-readobj --symbols %t.so \| FileCheck %s

	# Test that the script creates a non absolute symbol with value			## Test that the script creates a non absolute symbol with value
	# 0 I.E., a symbol that refers to the load address.			## 0 I.E., a symbol that refers to the load address.

	# CHECK: Symbol {			# CHECK: Symbol {
	# CHECK: Name: foo			# CHECK: Name: foo
	# CHECK-NEXT: Value: 0x70			# CHECK-NEXT: Value: 0x0
	# CHECK-NEXT: Size: 0			# CHECK-NEXT: Size: 0
	# CHECK-NEXT: Binding: Global			# CHECK-NEXT: Binding: Global
	# CHECK-NEXT: Type: None			# CHECK-NEXT: Type: None
	# CHECK-NEXT: Other: 0			# CHECK-NEXT: Other: 0
	# CHECK-NEXT: Section: .text			# CHECK-NEXT: Section: .text
	# CHECK-NEXT: }			# CHECK-NEXT: }

				## Because of a bug we had a different behavior (different symbol 'foo' value)
				## on a platforms that might use thunks, like AArch64. Check that issue is fixed.
				# RUN: llvm-mc -filetype=obj -triple=aarch64-linux-gnux /dev/null -o %t.o
				# RUN: ld.lld -o %t.so --script %s %t.o -shared
				# RUN: llvm-readobj --symbols %t.so \| FileCheck %s

	SECTIONS {			SECTIONS {
	foo = ADDR(.text) - ABSOLUTE(ADDR(.text));			foo = ADDR(.text) - ABSOLUTE(ADDR(.text));
	};			};