This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
ELF/
7/9
LinkerScript.cpp
-
Writer.cpp
-
test/ELF/
-
ELF/
-
basic-aarch64.s
-
basic-i386.s
-
basic-ppc.s
-
basic-sparcv9.s
-
ttext-tdata-tbss.s

Differential D67325

[ELF] Map the ELF header at imageBase
ClosedPublic

Authored by MaskRay on Sep 7 2019, 10:50 PM.

Download Raw Diff

Details

Reviewers

grimar
peter.smith
ruiu
• espindola

Commits

rG06bb7dfbd445: [ELF] Map the ELF header at imageBase
rLLD371957: [ELF] Map the ELF header at imageBase
rL371957: [ELF] Map the ELF header at imageBase

Summary

If there is no readonly section, we map:

The ELF header at imageBase+maxPageSize
Program headers at imageBase+maxPageSize+sizeof(Ehdr)
The first section .text at imageBase+maxPageSize+sizeof(Ehdr)+sizeof(program headers)

Due to the interaction between Writer<ELFT>::fixSectionAlignments and
LinkerScript::allocateHeaders,
alignDown(p_vaddr(R PT_LOAD)) = alignDown(p_vaddr(RX PT_LOAD)).
The RX PT_LOAD will override the R PT_LOAD at runtime, which is not ideal:

// PHDR at 0x401034, should be 0x400034
  PHDR           0x000034 0x00401034 0x00401034 0x000a0 0x000a0 R   0x4
// R PT_LOAD contains just Ehdr and program headers.
// At 0x401000, should be 0x400000
  LOAD           0x000000 0x00401000 0x00401000 0x000d4 0x000d4 R   0x1000
  LOAD           0x0000d4 0x004010d4 0x004010d4 0x00001 0x00001 R E 0x1000

createPhdrs allocates the headers to the R PT_LOAD.
fixSectionAlignments assigns imageBase+maxPageSize+sizeof(Ehdr)+sizeof(program headers) (formula: alignTo(dot, maxPageSize) + dot % config->maxPageSize) to addrExpr of .text
allocateHeaders computes the minimum address among SHF_ALLOC sections, i.e. addr(.text)
allocateHeaders sets address of ELF header to addr(.text)-sizeof(Ehdr)-sizeof(program headers) = imageBase+maxPageSize

The main observation is that when the SECTIONS command is not used, we
don't have to call allocateHeaders. This requires an assumption that
the presence of PT_PHDR and addresses of headers can be decided
regardless of address information.

This may seem natural because dot is not manipulated by a linker script.
The other thing is that we have to drop the special rule for -T<section>
in getInitialDot. If -Ttext is smaller than the image base, the headers
will not be allocated with the old behavior (allocateHeaders is called)
but always allocated with the new behavior.

The behavior change is not a problem. Whether and where headers are
allocated can vary among linkers, or ld.bfd across different versions
(--enable-separate-code or not). It is thus advised to use a linker
script with the PHDRS command to have a consistent behavior across
linkers. If PT_PHDR is needed, an explicit --image-base can be a simpler
alternative.

Diff Detail

Repository

rLLD LLVM Linker

Build Status

Buildable 38038
Build 38037: arc lint + arc unit

Event Timeline

MaskRay created this revision.Sep 7 2019, 10:50 PM

Herald added a reviewer: • espindola. · View Herald TranscriptSep 7 2019, 10:50 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, jrtc27, fedor.sergeev and 5 others. · View Herald Transcript

Harbormaster completed remote builds in B37896: Diff 219256.Sep 7 2019, 10:53 PM

Herald added a subscriber: • wuzish. · View Herald TranscriptSep 7 2019, 10:53 PM

Place -> Map (I'm not sure the right term to use, but "map" may be a better term to describe that this applies to p_vaddr, not p_offset)

Add some comments.

Harbormaster completed remote builds in B37897: Diff 219264.Sep 8 2019, 5:59 AM

MaskRay added a parent revision: D67324: [ELF] nmagic or omagic: don't allocate PT_PHDR or PF_R PT_LOAD for the !hasPhdrsCommands case.Sep 8 2019, 11:53 PM

I'm struggling a bit with this one at the moment. I've put some comments in but I still need to work this one through. As I understand it we have today:

allocateHeaders() that decides whether the program headers are allocated or not and assigns the headers addresses. This is the same for both linkerscripts and non-linkerscripts.
fixSectionAlignments is performed only for the non-linkerscript case just prior to the final address allocation. It adds expressions to the OutputSections so that it will be aligned properly.

There is a clash between allocateHeaders and fixSectionAlignments and your change resolves this by only running allocateHeaders in the linkerscript. I can see that this would work, but I think that this could be at the expense of missing some of the logic in allocateHeaders, and leaving a point to diverge in the future. As this is making me a bit nervous at the moment I'd like to try and understand the conflict a bit better and whether there is a better way of doing this (there may not be).

ELF/LinkerScript.cpp
1013–1014	I think this comment would be stale with your proposed change.
1018	Given that allocateHeaders is only called once, it may be worth moving to the callsite in Writer.cpp if (script->hasSectionsCommand) script->allocateHeaders(mainPart->phdrs)
1069	Delay may not be the right word as assignAddresses() is called multiple times and I think allocateHeaders() is called early, most likely before assignAddresses(). Perhaps: // With a linker script assignment of addresses to headers is covered by allocateHeaders().
1072	I think that this would get nmagic and omagic wrong. In this case the ELF Headers and Program Headers are not allocated, even with the default linker script. I'd expect a linker script to be used with nmagic and omagic, but this would be a change in behaviour from ld.bfd and older versions of LLD.

Address review comments

Harbormaster completed remote builds in B37963: Diff 219550.Sep 10 2019, 8:23 AM

There is a clash between allocateHeaders and fixSectionAlignments and your change resolves this by only running allocateHeaders in the linkerscript. I can see that this would work, but I think that this could be at the expense of missing some of the logic in allocateHeaders, and leaving a point to diverge in the future. As this is making me a bit nervous at the moment I'd like to try and understand the conflict a bit better and whether there is a better way of doing this (there may not be).

The main observation of the change is that if we can drop the -T<section> special rule (as we can see, ld.bfd's rule is quite involved; our current behavior already diverges; lld users can get similar behaviors with explicit --image-base), we can skip the whole allocateHeaders logic for the !hasSectionsCommand case.

For the default !hasSectionsCommand case, assignAddresses takes care of assigning addresses to headers, and there should be no case that the ELF header or program headers can be discarded. ld.bfd can discard them for the cases like -Ttext=0 but we can argue they are involved cases, and users should use a linker script to get more predictable behavior. (I can hardly find -Ttext=0 cases in the wild. I think they are either 1) --oformat binary (unaffected by this change) 2) used with SECTIONS command (unaffected by this change) 3) we fail to meet their requirement because our different layout. So the behavior for 3) may not matter much.

ELF/LinkerScript.cpp
1013–1014	When using a linker script, we also check if the headers are covered by the output section. This allows omitting the headers by not leaving enough space for them in the linker script; this pattern is common in embedded systems. I think the two sentences were stale before this change. I'll delete them.
1072	In the nmagic and omagic cases, PT_PHDR and PT_INTERP are not allocated, but other program headers can still exist. Several tests check their behaviors, e.g. nmagic.s, segments.s, magic-page-combo-warn.s. This change does not need an update of them.

I'll take another look tomorrow. Thanks for the clarifications.

ELF/LinkerScript.cpp
1011	From a double check, it seems like unconditional isn't true any more. In createPhdrs() nmagic and omagic don't add the ELF Headers into the loadable segment, looks like I missed updating the comment when adding omagic and nmagic. Perhaps update the comment to: // When the SECTIONS command is used, try to find an address for the file and // program headers output sections, which can be added to the first PT_LOAD segment // when program headers are created.
1072	What I was worrying about was whether the headers and program headers were mapped into the first PT_LOAD as this was something that allocateHeaders() took account of. However it looks like this isn't a problem as createPhdrs() doesn't "unconditionally" add them when either omagic or nmagic is added.

Update a comment

ELF/LinkerScript.cpp
1011	t seems like unconditional isn't true any more. In createPhdrs() nmagic and omagic don't add the ELF Headers into the loadable segment nmagic and omagic were changed in D67324. Applied the suggested comments.

Harbormaster completed remote builds in B37993: Diff 219656.Sep 10 2019, 9:26 PM

Apologies for the delay in responding. I've gone through in a bit more detail. Just to make sure I understand and hopefully give an alternative explanation to anyone else following:

Create an example with no .rodata, like basic-aarch64 for example:

.text
        .globl _start
        .type _start, %function
_start: 
        ret

createPhdrs() will create and allocate the ELF to the RO program header as it is the first PT_LOAD.
InputSections are assigned to OutputSections, there are no InputSections assigned to the RO program segment so it contains no SHF_ALLOC sections.
fixSectionAlignments assigns alignTo(script->getDot(), config->maxPageSize) + (script->getDot() % config->maxPageSize) to addrExpr for both the ELF Header SyntheticSection (RO program segment) and .text (Executable program segment)
AssignAddresses sets initial dot to 0x200000 (AArch64) + sizeof(headers) = 0x200120
AssignAddresses sets base address of .text to 0x210120 (as .text is SHF_ALLOC), the addrExpr for the ELF Header SyntheticSection (RO program header) is never used.
allocateHeaders searches for the first SHF_ALLOC section and finds .text as there are no SHF_ALLOC sections in the RO program segment.
allocateHeaders sets the ELF header to the address of the .text section
The RO program segment now overlaps the Executable program segment as the ELF Header is in the Executable program segment.

My understanding of the proposed solution is to not call allocateHeaders() and instead, effectively, force them into the RO program segment as we always know it will be first. This permits some simplification of the code. I think your comments on -Ttext aren't directly related to the code change, more of an observation. Let me know if I have that right?

One downside of this approach, albeit not a regression from the old behaviour, is that there is always an RO program segment. An alternative fix would be to detect that a program segment only contained the ELF Header and Program Header and move them to the first program segment that contained a SHF_ALLOC section. This would simplify the output when there is no .rodata to something like (AArch64 values simulated by producing a file with just rodata):

PHDR           0x000040 0x0000000000200040 0x0000000000200040 0x0000e0 0x0000e0 R   0x8
LOAD           0x000000 0x0000000000200000 0x0000000000200000 0x000124 0x000124 RE 0x10000

Any thoughts?

For the bit about whether PT_PHDR should be generated. The ELF spec says in the PT_PHDR description:

Moreover, it may occur only if the program header table is part of the memory image of the program. If it is present, it must precede any loadable segment entry. See ``Program Interpreter'' below for more information.

ld.bfd seems to generate PT_PHDR if a PT_INTERP is needed. When I made a simple a.o from int main(void) { return 0; } I got a PT_PHDR from ld.bfd It is possible that in your example:

ld.bfd -o a -Ttext=0x3000 a.o => no PT_PHDR
ld.bfd -o a -Ttext=0x3000 a.o dummy.so (--enable-separate-code [1]) => place .note.gnu.property at 0x4000e8.

your a.o might not have enough in it to generate a PT_INTERP so no PT_PHDR.

I think that as long as our default linker script without -nmagic or -omagic can allocate headers which I think your change ensures I think it is fine for LLD to generate PT_PHDR. I consider not producing it due to no PT_INTERP an optimization.

In D67325#1666249, @peter.smith wrote:

Apologies for the delay in responding. I've gone through in a bit more detail. Just to make sure I understand and hopefully give an alternative explanation to anyone else following:

No worry. Thanks for the detailed explanation!

createPhdrs() will create and allocate the ELF to the RO program header as it is the first PT_LOAD.

InputSections are assigned to OutputSections, there are no InputSections assigned to the RO program segment so it contains no SHF_ALLOC sections.

fixSectionAlignments assigns alignTo(script->getDot(), config->maxPageSize) + (script->getDot() % config->maxPageSize) to addrExpr for both the ELF Header SyntheticSection (RO program segment) and .text (Executable program segment)

AssignAddresses sets initial dot to 0x200000 (AArch64) + sizeof(headers) = 0x200120

AssignAddresses sets base address of .text to 0x210120 (as .text is SHF_ALLOC), the addrExpr for the ELF Header SyntheticSection (RO program header) is never used.

allocateHeaders searches for the first SHF_ALLOC section and finds .text as there are no SHF_ALLOC sections in the RO program segment.

allocateHeaders sets the ELF header to the address of the .text section

The RO program segment now overlaps the Executable program segment as the ELF Header is in the Executable program segment.

I'll update the description with some information here.

My understanding of the proposed solution is to not call allocateHeaders() and instead, effectively, force them into the RO program segment as we always know it will be first. This permits some simplification of the code. I think your comments on -Ttext aren't directly related to the code change, more of an observation. Let me know if I have that right?

Yes. The main observation is that when the SECTIONS command is not used, we
don't have to call allocateHeaders. This requires an assumption that the
presence of PT_PHDR and addresses of headers can be decided regardless of
address information. You are right. -T<section> is a consequence of the assumption.

One downside of this approach, albeit not a regression from the old behaviour, is that there is always an RO program segment. An alternative fix would be to detect that a program segment only contained the ELF Header and Program Header and move them to the first program segment that contained a SHF_ALLOC section. This would simplify the output when there is no .rodata to something like (AArch64 values simulated by producing a file with just rodata):
PHDR           0x000040 0x0000000000200040 0x0000000000200040 0x0000e0 0x0000e0 R   0x8
LOAD           0x000000 0x0000000000200000 0x0000000000200000 0x000124 0x000124 RE 0x10000
Any thoughts?

The RO PT_LOAD is allocated w/o or w/ this change. We can apply the optimization, with the change to createPhdrs:

--- a/ELF/Writer.cpp
+++ b/ELF/Writer.cpp
@@ -2116,3 +2116,6 @@ std::vector<PhdrEntry *> Writer<ELFT>::createPhdrs(Partition &part) {
         sec == relroEnd) {
-      load = addHdr(PT_LOAD, newFlags);
+      if (load && load->lastSec == Out::programHeaders && (newFlags & PF_R))
+        load->p_flags = newFlags;
+      else
+        load = addHdr(PT_LOAD, newFlags);
       flags = newFlags;

It will affect 125 tests, though, so I'm not sure whether we want to apply this optimization.
A user can use --no-rosegment to ensure the RO PT_LOAD does not exist.

For the bit about whether PT_PHDR should be generated. The ELF spec says in the PT_PHDR description:
Moreover, it may occur only if the program header table is part of the memory image of the program. If it is present, it must precede any loadable segment entry. See ``Program Interpreter'' below for more information.
ld.bfd seems to generate PT_PHDR if a PT_INTERP is needed. When I made a simple a.o from int main(void) { return 0; } I got a PT_PHDR from ld.bfd It is possible that in your example:
ld.bfd -o a -Ttext=0x3000 a.o => no PT_PHDR
ld.bfd -o a -Ttext=0x3000 a.o dummy.so (--enable-separate-code [1]) => place .note.gnu.property at 0x4000e8.
your a.o might not have enough in it to generate a PT_INTERP so no PT_PHDR.

I think that as long as our default linker script without -nmagic or -omagic can allocate headers which I think your change ensures I think it is fine for LLD to generate PT_PHDR. I consider not producing it due to no PT_INTERP an optimization.

Confirmed that PT_PHDR will be created if .interp (PT_INTERP) exists. The ld.bfd examples are not directly related to the code change. I'll replace them with a brief mention of the problem.

Update description

Harbormaster completed remote builds in B38038: Diff 219849.Sep 11 2019, 11:16 PM

One downside of this approach, albeit not a regression from the old behaviour, is that there is always an RO program segment. An alternative fix would be to detect that a program segment only contained the ELF Header and Program Header and move them to the first program segment that contained a SHF_ALLOC section. This would simplify the output when there is no .rodata to something like (AArch64 values simulated by producing a file with just rodata):
PHDR           0x000040 0x0000000000200040 0x0000000000200040 0x0000e0 0x0000e0 R   0x8
LOAD           0x000000 0x0000000000200000 0x0000000000200000 0x000124 0x000124 RE 0x10000
Any thoughts?
The RO PT_LOAD is allocated w/o or w/ this change. We can apply the optimization, with the change to createPhdrs:
--- a/ELF/Writer.cpp
+++ b/ELF/Writer.cpp
@@ -2116,3 +2116,6 @@ std::vector<PhdrEntry *> Writer<ELFT>::createPhdrs(Partition &part) {
         sec == relroEnd) {
-      load = addHdr(PT_LOAD, newFlags);
+      if (load && load->lastSec == Out::programHeaders && (newFlags & PF_R))
+        load->p_flags = newFlags;
+      else
+        load = addHdr(PT_LOAD, newFlags);
       flags = newFlags;
It will affect 125 tests, though, so I'm not sure whether we want to apply this optimization.
A user can use --no-rosegment to ensure the RO PT_LOAD does not exist.

I think the number of people that have no rodata at all in a executable or shared object with allocated headers will be sufficiently small that it isn't worth updating 125 tests for. I'm happy enough with the change now that I understand it better. I think it will be worth waiting a few days to see if there are any more opinions, and if there aren't any go ahead with this next week.

MaskRay mentioned this in D67482: [ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_X86_64.Sep 12 2019, 1:41 AM

This revision was not accepted when it landed; it landed in state Needs Review.Sep 16 2019, 12:05 AM

Closed by commit rL371957: [ELF] Map the ELF header at imageBase (authored by MaskRay). · Explain Why

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

ELF/

LinkerScript.cpp

43 lines

Writer.cpp

3 lines

test/

ELF/

8 lines

8 lines

8 lines

8 lines

16 lines

Diff 219849

ELF/LinkerScript.cpp

Show First 20 Lines • Show All 1,001 Lines • ▼ Show 20 Lines	static uint64_t computeBase(uint64_t min, bool allocateHeaders) {
// If there is no SECTIONS or if the linkerscript is explicit about program		// If there is no SECTIONS or if the linkerscript is explicit about program
// headers, do our best to allocate them.		// headers, do our best to allocate them.
if (!script->hasSectionsCommand \|\| allocateHeaders)		if (!script->hasSectionsCommand \|\| allocateHeaders)
return 0;		return 0;
// Otherwise only allocate program headers if that would not add a page.		// Otherwise only allocate program headers if that would not add a page.
return alignDown(min, config->maxPageSize);		return alignDown(min, config->maxPageSize);
}		}

// Try to find an address for the file and program headers output sections,		// When the SECTIONS command is used, try to find an address for the file and
// which were unconditionally added to the first PT_LOAD segment earlier.		// program headers output sections, which can be added to the first PT_LOAD
		peter.smithUnsubmitted Not Done Reply Inline Actions From a double check, it seems like unconditional isn't true any more. In createPhdrs() nmagic and omagic don't add the ELF Headers into the loadable segment, looks like I missed updating the comment when adding omagic and nmagic. Perhaps update the comment to: // When the SECTIONS command is used, try to find an address for the file and // program headers output sections, which can be added to the first PT_LOAD segment // when program headers are created. peter.smith: From a double check, it seems like unconditional isn't true any more. In createPhdrs() nmagic…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions t seems like unconditional isn't true any more. In createPhdrs() nmagic and omagic don't add the ELF Headers into the loadable segment nmagic and omagic were changed in D67324. Applied the suggested comments. MaskRay: > t seems like unconditional isn't true any more. In createPhdrs() nmagic and omagic don't add…
//		// segment when program headers are created.
// When using the default layout, we check if the headers fit below the first		//
// allocated section. When using a linker script, we also check if the headers		// We check if the headers fit below the first allocated section. If there isn't
		peter.smithUnsubmitted Done Reply Inline Actions I think this comment would be stale with your proposed change. peter.smith: I think this comment would be stale with your proposed change.
		MaskRayAuthorUnsubmitted Done Reply Inline Actions When using a linker script, we also check if the headers are covered by the output section. This allows omitting the headers by not leaving enough space for them in the linker script; this pattern is common in embedded systems. I think the two sentences were stale before this change. I'll delete them. MaskRay: > When using a linker script, we also check if the headers are covered by the output section.
// are covered by the output section. This allows omitting the headers by not		// enough space for these sections, we'll remove them from the PT_LOAD segment,
// leaving enough space for them in the linker script; this pattern is common		// and we'll also remove the PT_PHDR segment.
// in embedded systems.
//
// If there isn't enough space for these sections, we'll remove them from the
// PT_LOAD segment, and we'll also remove the PT_PHDR segment.
void LinkerScript::allocateHeaders(std::vector<PhdrEntry *> &phdrs) {		void LinkerScript::allocateHeaders(std::vector<PhdrEntry *> &phdrs) {
uint64_t min = std::numeric_limits<uint64_t>::max();		uint64_t min = std::numeric_limits<uint64_t>::max();
		peter.smithUnsubmitted Done Reply Inline Actions Given that allocateHeaders is only called once, it may be worth moving to the callsite in Writer.cpp if (script->hasSectionsCommand) script->allocateHeaders(mainPart->phdrs) peter.smith: Given that allocateHeaders is only called once, it may be worth moving to the callsite in…
for (OutputSection *sec : outputSections)		for (OutputSection *sec : outputSections)
if (sec->flags & SHF_ALLOC)		if (sec->flags & SHF_ALLOC)
min = std::min<uint64_t>(min, sec->addr);		min = std::min<uint64_t>(min, sec->addr);

auto it = llvm::find_if(		auto it = llvm::find_if(
phdrs, [](const PhdrEntry *e) { return e->p_type == PT_LOAD; });		phdrs, [](const PhdrEntry *e) { return e->p_type == PT_LOAD; });
if (it == phdrs.end())		if (it == phdrs.end())
return;		return;
Show All 27 Lines

LinkerScript::AddressState::AddressState() {		LinkerScript::AddressState::AddressState() {
for (auto &mri : script->memoryRegions) {		for (auto &mri : script->memoryRegions) {
MemoryRegion *mr = mri.second;		MemoryRegion *mr = mri.second;
mr->curPos = mr->origin;		mr->curPos = mr->origin;
}		}
}		}

static uint64_t getInitialDot() {
// By default linker scripts use an initial value of 0 for '.',
// but prefer -image-base if set.
if (script->hasSectionsCommand)
return config->imageBase ? *config->imageBase : 0;

uint64_t startAddr = UINT64_MAX;
// The sections with -T<section> have been sorted in order of ascending
// address. We must lower startAddr if the lowest -T<section address> as
// calls to setDot() must be monotonically increasing.
for (auto &kv : config->sectionStartMap)
startAddr = std::min(startAddr, kv.second);
return std::min(startAddr, target->getImageBase() + elf::getHeaderSize());
}

// Here we assign addresses as instructed by linker script SECTIONS		// Here we assign addresses as instructed by linker script SECTIONS
// sub-commands. Doing that allows us to use final VA values, so here		// sub-commands. Doing that allows us to use final VA values, so here
// we also handle rest commands like symbol assignments and ASSERTs.		// we also handle rest commands like symbol assignments and ASSERTs.
// Returns a symbol that has changed its section or value, or nullptr if no		// Returns a symbol that has changed its section or value, or nullptr if no
// symbol has changed.		// symbol has changed.
const Defined *LinkerScript::assignAddresses() {		const Defined *LinkerScript::assignAddresses() {
dot = getInitialDot();		if (script->hasSectionsCommand) {
		// With a linker script, assignment of addresses to headers is covered by
		peter.smithUnsubmitted Done Reply Inline Actions Delay may not be the right word as assignAddresses() is called multiple times and I think allocateHeaders() is called early, most likely before assignAddresses(). Perhaps: // With a linker script assignment of addresses to headers is covered by allocateHeaders(). peter.smith: Delay may not be the right word as assignAddresses() is called multiple times and I think…
		// allocateHeaders().
		dot = config->imageBase.getValueOr(0);
		} else {
		peter.smithUnsubmitted Not Done Reply Inline Actions I think that this would get nmagic and omagic wrong. In this case the ELF Headers and Program Headers are not allocated, even with the default linker script. I'd expect a linker script to be used with nmagic and omagic, but this would be a change in behaviour from ld.bfd and older versions of LLD. peter.smith: I think that this would get nmagic and omagic wrong. In this case the ELF Headers and Program…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions In the nmagic and omagic cases, PT_PHDR and PT_INTERP are not allocated, but other program headers can still exist. Several tests check their behaviors, e.g. nmagic.s, segments.s, magic-page-combo-warn.s. This change does not need an update of them. MaskRay: In the nmagic and omagic cases, PT_PHDR and PT_INTERP are not allocated, but other program…
		peter.smithUnsubmitted Done Reply Inline Actions What I was worrying about was whether the headers and program headers were mapped into the first PT_LOAD as this was something that allocateHeaders() took account of. However it looks like this isn't a problem as createPhdrs() doesn't "unconditionally" add them when either omagic or nmagic is added. peter.smith: What I was worrying about was whether the headers and program headers were mapped into the…
		// Assign addresses to headers right now.
		dot = target->getImageBase();
		Out::elfHeader->addr = dot;
		Out::programHeaders->addr = dot + Out::elfHeader->size;
		dot += getHeaderSize();
		}

auto deleter = std::make_unique<AddressState>();		auto deleter = std::make_unique<AddressState>();
ctx = deleter.get();		ctx = deleter.get();
errorOnMissingSection = true;		errorOnMissingSection = true;
switchTo(aether);		switchTo(aether);

SymbolAssignmentMap oldValues = getSymbolAssignmentValues(sectionCommands);		SymbolAssignmentMap oldValues = getSymbolAssignmentValues(sectionCommands);
for (BaseCommand *base : sectionCommands) {		for (BaseCommand *base : sectionCommands) {
▲ Show 20 Lines • Show All 103 Lines • Show Last 20 Lines

ELF/Writer.cpp

Show First 20 Lines • Show All 549 Lines • ▼ Show 20 Lines	if (errorCount())
return;		return;

// If -compressed-debug-sections is specified, we need to compress		// If -compressed-debug-sections is specified, we need to compress
// .debug_* sections. Do it right now because it changes the size of		// .debug_* sections. Do it right now because it changes the size of
// output sections.		// output sections.
for (OutputSection *sec : outputSections)		for (OutputSection *sec : outputSections)
sec->maybeCompress<ELFT>();		sec->maybeCompress<ELFT>();

		if (script->hasSectionsCommand)
script->allocateHeaders(mainPart->phdrs);		script->allocateHeaders(mainPart->phdrs);

// Remove empty PT_LOAD to avoid causing the dynamic linker to try to mmap a		// Remove empty PT_LOAD to avoid causing the dynamic linker to try to mmap a
// 0 sized region. This has to be done late since only after assignAddresses		// 0 sized region. This has to be done late since only after assignAddresses
// we know the size of the sections.		// we know the size of the sections.
for (Partition &part : partitions)		for (Partition &part : partitions)
removeEmptyPTLoad(part.phdrs);		removeEmptyPTLoad(part.phdrs);

if (!config->oFormatBinary)		if (!config->oFormatBinary)
▲ Show 20 Lines • Show All 2,162 Lines • Show Last 20 Lines

test/ELF/basic-aarch64.s

	Show First 20 Lines • Show All 153 Lines • ▼ Show 20 Lines
	# CHECK-NEXT: Other: 0			# CHECK-NEXT: Other: 0
	# CHECK-NEXT: Section: .text			# CHECK-NEXT: Section: .text
	# CHECK-NEXT: }			# CHECK-NEXT: }
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: ProgramHeaders [			# CHECK-NEXT: ProgramHeaders [
	# CHECK-NEXT: ProgramHeader {			# CHECK-NEXT: ProgramHeader {
	# CHECK-NEXT: Type: PT_PHDR (0x6)			# CHECK-NEXT: Type: PT_PHDR (0x6)
	# CHECK-NEXT: Offset: 0x40			# CHECK-NEXT: Offset: 0x40
	# CHECK-NEXT: VirtualAddress: 0x210040			# CHECK-NEXT: VirtualAddress: 0x200040
	# CHECK-NEXT: PhysicalAddress: 0x210040			# CHECK-NEXT: PhysicalAddress: 0x200040
	# CHECK-NEXT: FileSize: 224			# CHECK-NEXT: FileSize: 224
	# CHECK-NEXT: MemSize: 224			# CHECK-NEXT: MemSize: 224
	# CHECK-NEXT: Flags [ (0x4)			# CHECK-NEXT: Flags [ (0x4)
	# CHECK-NEXT: PF_R (0x4)			# CHECK-NEXT: PF_R (0x4)
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: Alignment: 8			# CHECK-NEXT: Alignment: 8
	# CHECK-NEXT: }			# CHECK-NEXT: }
	# CHECK-NEXT: ProgramHeader {			# CHECK-NEXT: ProgramHeader {
	# CHECK-NEXT: Type: PT_LOAD (0x1)			# CHECK-NEXT: Type: PT_LOAD (0x1)
	# CHECK-NEXT: Offset: 0x0			# CHECK-NEXT: Offset: 0x0
	# CHECK-NEXT: VirtualAddress: 0x210000			# CHECK-NEXT: VirtualAddress: 0x200000
	# CHECK-NEXT: PhysicalAddress: 0x210000			# CHECK-NEXT: PhysicalAddress: 0x200000
	# CHECK-NEXT: FileSize: 288			# CHECK-NEXT: FileSize: 288
	# CHECK-NEXT: MemSize: 288			# CHECK-NEXT: MemSize: 288
	# CHECK-NEXT: Flags [			# CHECK-NEXT: Flags [
	# CHECK-NEXT: PF_R			# CHECK-NEXT: PF_R
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: Alignment: 65536			# CHECK-NEXT: Alignment: 65536
	# CHECK-NEXT: }			# CHECK-NEXT: }
	# CHECK-NEXT: ProgramHeader {			# CHECK-NEXT: ProgramHeader {
	Show All 26 Lines

test/ELF/basic-i386.s

	Show First 20 Lines • Show All 123 Lines • ▼ Show 20 Lines
	# CHECK-NEXT: AddressAlignment: 1			# CHECK-NEXT: AddressAlignment: 1
	# CHECK-NEXT: EntrySize: 0			# CHECK-NEXT: EntrySize: 0
	# CHECK-NEXT: }			# CHECK-NEXT: }
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: ProgramHeaders [			# CHECK-NEXT: ProgramHeaders [
	# CHECK-NEXT: ProgramHeader {			# CHECK-NEXT: ProgramHeader {
	# CHECK-NEXT: Type: PT_PHDR (0x6)			# CHECK-NEXT: Type: PT_PHDR (0x6)
	# CHECK-NEXT: Offset: 0x34			# CHECK-NEXT: Offset: 0x34
	# CHECK-NEXT: VirtualAddress: 0x401034			# CHECK-NEXT: VirtualAddress: 0x400034
	# CHECK-NEXT: PhysicalAddress: 0x401034			# CHECK-NEXT: PhysicalAddress: 0x400034
	# CHECK-NEXT: FileSize: 128			# CHECK-NEXT: FileSize: 128
	# CHECK-NEXT: MemSize: 128			# CHECK-NEXT: MemSize: 128
	# CHECK-NEXT: Flags [ (0x4)			# CHECK-NEXT: Flags [ (0x4)
	# CHECK-NEXT: PF_R (0x4)			# CHECK-NEXT: PF_R (0x4)
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: Alignment: 4			# CHECK-NEXT: Alignment: 4
	# CHECK-NEXT: }			# CHECK-NEXT: }
	# CHECK-NEXT: ProgramHeader {			# CHECK-NEXT: ProgramHeader {
	# CHECK-NEXT: Type: PT_LOAD (0x1)			# CHECK-NEXT: Type: PT_LOAD (0x1)
	# CHECK-NEXT: Offset: 0x0			# CHECK-NEXT: Offset: 0x0
	# CHECK-NEXT: VirtualAddress: 0x401000			# CHECK-NEXT: VirtualAddress: 0x400000
	# CHECK-NEXT: PhysicalAddress: 0x401000			# CHECK-NEXT: PhysicalAddress: 0x400000
	# CHECK-NEXT: FileSize: 180			# CHECK-NEXT: FileSize: 180
	# CHECK-NEXT: MemSize: 180			# CHECK-NEXT: MemSize: 180
	# CHECK-NEXT: Flags [			# CHECK-NEXT: Flags [
	# CHECK-NEXT: PF_R			# CHECK-NEXT: PF_R
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: Alignment: 4096			# CHECK-NEXT: Alignment: 4096
	# CHECK-NEXT: }			# CHECK-NEXT: }
	# CHECK-NEXT: ProgramHeader {			# CHECK-NEXT: ProgramHeader {
	Show All 26 Lines

test/ELF/basic-ppc.s

	Show First 20 Lines • Show All 137 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: 0000: 00			// CHECK-NEXT: 0000: 00
	// CHECK-NEXT: )			// CHECK-NEXT: )
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: ProgramHeaders [			// CHECK-NEXT: ProgramHeaders [
	// CHECK-NEXT: ProgramHeader {			// CHECK-NEXT: ProgramHeader {
	// CHECK-NEXT: Type: PT_PHDR (0x6)			// CHECK-NEXT: Type: PT_PHDR (0x6)
	// CHECK-NEXT: Offset: 0x34			// CHECK-NEXT: Offset: 0x34
	// CHECK-NEXT: VirtualAddress: 0x10010034			// CHECK-NEXT: VirtualAddress: 0x10000034
	// CHECK-NEXT: PhysicalAddress: 0x10010034			// CHECK-NEXT: PhysicalAddress: 0x10000034
	// CHECK-NEXT: FileSize: 128			// CHECK-NEXT: FileSize: 128
	// CHECK-NEXT: MemSize: 128			// CHECK-NEXT: MemSize: 128
	// CHECK-NEXT: Flags [ (0x4)			// CHECK-NEXT: Flags [ (0x4)
	// CHECK-NEXT: PF_R (0x4)			// CHECK-NEXT: PF_R (0x4)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Alignment: 4			// CHECK-NEXT: Alignment: 4
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: ProgramHeader {			// CHECK-NEXT: ProgramHeader {
	// CHECK-NEXT: Type: PT_LOAD (0x1)			// CHECK-NEXT: Type: PT_LOAD (0x1)
	// CHECK-NEXT: Offset: 0x0			// CHECK-NEXT: Offset: 0x0
	// CHECK-NEXT: VirtualAddress: 0x10010000			// CHECK-NEXT: VirtualAddress: 0x10000000
	// CHECK-NEXT: PhysicalAddress: 0x10010000			// CHECK-NEXT: PhysicalAddress: 0x10000000
	// CHECK-NEXT: FileSize: 180			// CHECK-NEXT: FileSize: 180
	// CHECK-NEXT: MemSize: 180			// CHECK-NEXT: MemSize: 180
	// CHECK-NEXT: Flags [ (0x4)			// CHECK-NEXT: Flags [ (0x4)
	// CHECK-NEXT: PF_R (0x4)			// CHECK-NEXT: PF_R (0x4)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Alignment: 65536			// CHECK-NEXT: Alignment: 65536
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: ProgramHeader {			// CHECK-NEXT: ProgramHeader {
	Show All 26 Lines

test/ELF/basic-sparcv9.s

	Show First 20 Lines • Show All 144 Lines • ▼ Show 20 Lines
	# CHECK-NEXT: Other: 0			# CHECK-NEXT: Other: 0
	# CHECK-NEXT: Section: .text			# CHECK-NEXT: Section: .text
	# CHECK-NEXT: }			# CHECK-NEXT: }
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: ProgramHeaders [			# CHECK-NEXT: ProgramHeaders [
	# CHECK-NEXT: ProgramHeader {			# CHECK-NEXT: ProgramHeader {
	# CHECK-NEXT: Type: PT_PHDR (0x6)			# CHECK-NEXT: Type: PT_PHDR (0x6)
	# CHECK-NEXT: Offset: 0x40			# CHECK-NEXT: Offset: 0x40
	# CHECK-NEXT: VirtualAddress: 0x200040			# CHECK-NEXT: VirtualAddress: 0x100040
	# CHECK-NEXT: PhysicalAddress: 0x200040			# CHECK-NEXT: PhysicalAddress: 0x100040
	# CHECK-NEXT: FileSize: 224			# CHECK-NEXT: FileSize: 224
	# CHECK-NEXT: MemSize: 224			# CHECK-NEXT: MemSize: 224
	# CHECK-NEXT: Flags [ (0x4)			# CHECK-NEXT: Flags [ (0x4)
	# CHECK-NEXT: PF_R (0x4)			# CHECK-NEXT: PF_R (0x4)
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: Alignment: 8			# CHECK-NEXT: Alignment: 8
	# CHECK-NEXT: }			# CHECK-NEXT: }
	# CHECK-NEXT: ProgramHeader {			# CHECK-NEXT: ProgramHeader {
	# CHECK-NEXT: Type: PT_LOAD (0x1)			# CHECK-NEXT: Type: PT_LOAD (0x1)
	# CHECK-NEXT: Offset: 0x0			# CHECK-NEXT: Offset: 0x0
	# CHECK-NEXT: VirtualAddress: 0x200000			# CHECK-NEXT: VirtualAddress: 0x100000
	# CHECK-NEXT: PhysicalAddress: 0x200000			# CHECK-NEXT: PhysicalAddress: 0x100000
	# CHECK-NEXT: FileSize: 288			# CHECK-NEXT: FileSize: 288
	# CHECK-NEXT: MemSize: 288			# CHECK-NEXT: MemSize: 288
	# CHECK-NEXT: Flags [			# CHECK-NEXT: Flags [
	# CHECK-NEXT: PF_R			# CHECK-NEXT: PF_R
	# CHECK-NEXT: ]			# CHECK-NEXT: ]
	# CHECK-NEXT: Alignment: 1048576			# CHECK-NEXT: Alignment: 1048576
	# CHECK-NEXT: }			# CHECK-NEXT: }
	# CHECK-NEXT: ProgramHeader {			# CHECK-NEXT: ProgramHeader {
	Show All 26 Lines

test/ELF/ttext-tdata-tbss.s

	# REQUIRES: x86			# REQUIRES: x86
	# RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %s -o %t.o

	## Show what regular output gives to us.			## Show what regular output gives to us.
	# RUN: ld.lld %t.o -o %t1			# RUN: ld.lld %t.o -o %t1
	# RUN: llvm-readelf -S -l %t1 \| FileCheck %s			# RUN: llvm-readelf -S -l %t1 \| FileCheck %s
	# CHECK: .rodata PROGBITS 0000000000200158 000158 000008			# CHECK: .rodata PROGBITS 0000000000200158 000158 000008
	# CHECK-NEXT: .text PROGBITS 0000000000201000 001000 000001			# CHECK-NEXT: .text PROGBITS 0000000000201000 001000 000001
	# CHECK-NEXT: .aw PROGBITS 0000000000202000 002000 000008			# CHECK-NEXT: .aw PROGBITS 0000000000202000 002000 000008
	# CHECK-NEXT: .data PROGBITS 0000000000202008 002008 000008			# CHECK-NEXT: .data PROGBITS 0000000000202008 002008 000008
	# CHECK-NEXT: .bss NOBITS 0000000000202010 002010 000008			# CHECK-NEXT: .bss NOBITS 0000000000202010 002010 000008
	# CHECK: Type			# CHECK: Type
	# CHECK-NEXT: PHDR			# CHECK-NEXT: PHDR
	# CHECK-NEXT: LOAD 0x000000 0x0000000000200000			# CHECK-NEXT: LOAD 0x000000 0x0000000000200000

	## With .text at 0 there is no space to allocate the headers.			## If -Ttext is smaller than the image base (which defaults to 0x200000 for -no-pie),
				## the headers will still be allocated, but mapped at a higher address,
				## which may look strange.
	# RUN: ld.lld -Ttext 0x0 -Tdata 0x4000 -Tbss 0x8000 %t.o -o %t2			# RUN: ld.lld -Ttext 0x0 -Tdata 0x4000 -Tbss 0x8000 %t.o -o %t2
	# RUN: llvm-readelf -S -l %t2 \| FileCheck %s --check-prefix=USER1			# RUN: llvm-readelf -S -l %t2 \| FileCheck %s --check-prefix=USER1
	# USER1: .text PROGBITS 0000000000000000 001000 000001			# USER1: .text PROGBITS 0000000000000000 001000 000001
	# USER1-NEXT: .data PROGBITS 0000000000004000 002000 000008			# USER1-NEXT: .data PROGBITS 0000000000004000 002000 000008
	# USER1-NEXT: .bss NOBITS 0000000000008000 002008 000008			# USER1-NEXT: .bss NOBITS 0000000000008000 002008 000008
	# USER1-NEXT: .rodata PROGBITS 0000000000009000 003000 000008			# USER1-NEXT: .rodata PROGBITS 0000000000009000 003000 000008
	# USER1-NEXT: .aw PROGBITS 000000000000a000 004000 000008			# USER1-NEXT: .aw PROGBITS 000000000000a000 004000 000008
	# USER1: Type			# USER1: Type
				# USER1-NEXT: PHDR 0x000040 0x0000000000200040
				# USER1-NEXT: LOAD 0x000000 0x0000000000200000
	# USER1-NEXT: LOAD 0x001000 0x0000000000000000			# USER1-NEXT: LOAD 0x001000 0x0000000000000000

	## With .text at 0x1000 there is space to allocate the headers.			## Specify --image-base to make program headers look normal.
	# RUN: ld.lld -Ttext 0x1000 -Tdata 0x4000 -Tbss 0x8000 %t.o -o %t3			# RUN: ld.lld --image-base=0 -Ttext 0x1000 -Tdata 0x4000 -Tbss 0x8000 %t.o -o %t3
	# RUN: llvm-readelf -S -l %t3 \| FileCheck %s --check-prefix=USER2			# RUN: llvm-readelf -S -l %t3 \| FileCheck %s --check-prefix=USER2
	# USER2: .text PROGBITS 0000000000001000 001000 000001			# USER2: .text PROGBITS 0000000000001000 001000 000001
	# USER2-NEXT: .data PROGBITS 0000000000004000 002000 000008			# USER2-NEXT: .data PROGBITS 0000000000004000 002000 000008
	# USER2-NEXT: .bss NOBITS 0000000000008000 002008 000008			# USER2-NEXT: .bss NOBITS 0000000000008000 002008 000008
	# USER2-NEXT: .rodata PROGBITS 0000000000009000 003000 000008			# USER2-NEXT: .rodata PROGBITS 0000000000009000 003000 000008
	# USER2-NEXT: .aw PROGBITS 000000000000a000 004000 000008			# USER2-NEXT: .aw PROGBITS 000000000000a000 004000 000008
	# USER2: Type			# USER2: Type
	# USER2-NEXT: PHDR			# USER2-NEXT: PHDR 0x000040 0x0000000000000040
	# USER2-NEXT: LOAD 0x000000 0x0000000000000000			# USER2-NEXT: LOAD 0x000000 0x0000000000000000
				# USER2-NEXT: LOAD 0x001000 0x0000000000001000

	## With .text well above 200000 we don't need to change the image base			## With .text well above 200000 we don't need to change the image base
	# RUN: ld.lld -Ttext 0x201000 %t.o -o %t4			# RUN: ld.lld -Ttext 0x201000 %t.o -o %t4
	# RUN: llvm-readelf -S -l %t4 \| FileCheck %s --check-prefix=USER3			# RUN: llvm-readelf -S -l %t4 \| FileCheck %s --check-prefix=USER3
	# USER3: .text PROGBITS 0000000000201000 001000 000001			# USER3: .text PROGBITS 0000000000201000 001000 000001
	# USER3-NEX: .rodata PROGBITS 0000000000202000 002000 000008			# USER3-NEX: .rodata PROGBITS 0000000000202000 002000 000008
	# USER3-NEX: .aw PROGBITS 0000000000203000 003000 000008			# USER3-NEX: .aw PROGBITS 0000000000203000 003000 000008
	# USER3-NEX: .data PROGBITS 0000000000203008 003008 000008			# USER3-NEX: .data PROGBITS 0000000000203008 003008 000008
	# USER3-NEX: .bss NOBITS 0000000000203010 003010 000008			# USER3-NEX: .bss NOBITS 0000000000203010 003010 000008
	# USER3: Type			# USER3: Type
	# USER3-NEXT: PHDR			# USER3-NEXT: PHDR 0x000040 0x0000000000200040
	# USER3-NEXT: LOAD 0x000000 0x0000000000200000			# USER3-NEXT: LOAD 0x000000 0x0000000000200000
				# USER3-NEXT: LOAD 0x001000 0x0000000000201000

	.text			.text
	.globl _start			.globl _start
	_start:			_start:
	nop			nop

	.section .rodata,"a"			.section .rodata,"a"
	.quad 0			.quad 0
	Show All 9 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[ELF] Map the ELF header at imageBaseClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 219849

ELF/LinkerScript.cpp

ELF/Writer.cpp

test/ELF/basic-aarch64.s

test/ELF/basic-i386.s

test/ELF/basic-ppc.s

test/ELF/basic-sparcv9.s

test/ELF/ttext-tdata-tbss.s

[ELF] Map the ELF header at imageBase
ClosedPublic