This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/
-
ELF/
-
Arch/
25/28
RISCV.cpp
4/4
InputSection.h
-
InputSection.cpp
-
Relocations.h
-
Relocations.cpp
1/1
Target.h
2/2
Writer.cpp
-
test/ELF/
-
ELF/
-
riscv-relax-align-rvc.s
-
riscv-relax-align.s
-
riscv-reloc-align.s

Differential D127581

[ELF] Relax R_RISCV_ALIGN
ClosedPublic

Authored by MaskRay on Jun 11 2022, 4:51 PM.

Download Raw Diff

Details

Reviewers

gkm
luismarques
jrtc27
kito-cheng
peter.smith
MaskRay

Commits

rG6611d58f5bbc: [ELF] Relax R_RISCV_ALIGN

Summary

Alternative to D125036. Implement R_RISCV_ALIGN relaxation so that we can handle
-mrelax object files (i.e. -mno-relax is no longer needed) and creates a
framework for future relaxation.

relaxAux is placed in a union with InputSectionBase::jumpInstrMod, storing
auxiliary information for relaxation. In the first pass, relaxAux is allocated.
The main data structure is relocDeltas: when referencing relocations[i], the
actual offset is r_offset - (i ? relocDeltas[i-1] : 0).

relaxOnce performs one relaxation pass. It computes relocDeltas for all text
section. Then, adjust st_value/st_size for symbols relative to this section
based on SymbolAnchor. bytesDropped is set so that assignAddresses knows
that the size has changed.

Run relaxOnce in the finalizeAddressDependentContent loop to wait for
convergence of text sections and other address dependent sections (e.g.
SHT_RELR). Note: extrating relaxOnce into a separate loop works for many cases
but has issues in some linker script edge cases.

After convergence, compute section contents: shrink the NOP sequence of each
R_RISCV_ALIGN as appropriate. Instead of deleting bytes, we run a sequence of
memcpy on the content delimitered by relocation locations. For R_RISCV_ALIGN let
the next memcpy skip the desired number of bytes. Section content computation is
parallelizable, but let's ensure the implementation is mature before
optimizations. Technically we can save a copy if we interleave some code with
OutputSection::writeTo, but let's not pollute the generic code (we don't have
templated relocation resolving, so using conditions can impose overhead to
non-RISCV.)

Tested:
make ARCH=riscv CROSS_COMPILE=riscv64-linux-gnu- LLVM=1 defconfig all
built kernel is bootable.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

MaskRay created this revision.Jun 11 2022, 4:51 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 11 2022, 4:51 PM

Herald added subscribers: sunshaoce, VincentWu, luke957 and 30 others. · View Herald Transcript

MaskRay requested review of this revision.Jun 11 2022, 4:51 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 11 2022, 4:51 PM

Herald added subscribers: llvm-commits, • pcwang-thead, eopXD. · View Herald Transcript

Harbormaster completed remote builds in B169267: Diff 436170.Jun 11 2022, 5:05 PM

MaskRay added reviewers: gkm, luismarques, jrtc27, kito-cheng.Jun 12 2022, 2:12 AM

Harbormaster completed remote builds in B169290: Diff 436196.Jun 12 2022, 2:23 AM

CoelacanthusHex added a subscriber: CoelacanthusHex.Jun 12 2022, 2:24 AM

felixonmars added a subscriber: felixonmars.Jun 12 2022, 10:37 AM

R_RISCV_CALL patch will follow

MaskRay mentioned this in D127611: [ELF] Relax R_RISCV_CALL and R_RISCV_CALL_PLT.Jun 12 2022, 11:05 PM

Harbormaster completed remote builds in B169359: Diff 436277.Jun 12 2022, 11:15 PM

MaskRay mentioned this in D125036: [RISCV] Alignment relaxation.Jun 12 2022, 11:27 PM

simplify

MaskRay added a child revision: D127611: [ELF] Relax R_RISCV_CALL and R_RISCV_CALL_PLT.Jun 13 2022, 12:02 AM

Harbormaster completed remote builds in B169364: Diff 436282.Jun 13 2022, 12:11 AM

MaskRay edited the summary of this revision. (Show Details)Jun 14 2022, 12:22 AM

adjust test. add more comments

Herald added a subscriber: mgrang. · View Herald TranscriptJun 14 2022, 1:08 AM

Harbormaster completed remote builds in B169649: Diff 436688.Jun 14 2022, 1:25 AM

Handle --gc-sections

Harbormaster completed remote builds in B169784: Diff 436867.Jun 14 2022, 12:46 PM

luismarques added inline comments.Jun 15 2022, 12:08 PM

lld/ELF/LinkerScript.cpp
949–950 ↗	(On Diff #436867)	Document return
1326 ↗	(On Diff #436867)	We don't have test coverage for this.

Remove unneeded assignAddress change

MaskRay marked 2 inline comments as done.Jun 15 2022, 12:38 PM

MaskRay added inline comments.

lld/ELF/LinkerScript.cpp
949–950 ↗	(On Diff #436867)	I think assignOffsets does not need a change. Removed
1326 ↗	(On Diff #436867)	I removed this change. If we exit the loop with relaxOnce => true assignAddresses # address updated with correct section size information relaxOnce => false # instruction relaxation output does not change It has converged.

Harbormaster completed remote builds in B170083: Diff 437298.Jun 15 2022, 1:51 PM

CoelacanthusHex added a subscriber: Xeonacid.Jun 17 2022, 1:30 AM

(I intended to only bother Peter after some RISC-V folks made some verification, but I guess the two steps can be parallel. This works for some configurations I know, including @compnerd's usage.)
I have asked a distrubotr @felixonmars for testing as well.

Do you really need a plain union? How about a PointerUnion or something llvmy std::variant?

In D127581#3594812, @tschuett wrote:

Do you really need a plain union? How about a PointerUnion or something llvmy std::variant?

The size is very important for memory usage. 8 or 16 bytes here may cost 1% maximum RSS.
Since PointerUnion abstraction seems unnecessary, I'd want to avoid it.

In D127581#3595064, @MaskRay wrote:

In D127581#3594812, @tschuett wrote:

Do you really need a plain union? How about a PointerUnion or something llvmy std::variant?

The size is very important for memory usage. 8 or 16 bytes here may cost 1% maximum RSS.
Since PointerUnion abstraction seems unnecessary, I'd want to avoid it.

I would expect the size of the pointer union to be 8 bytes. My main motivation is that unions are odd.

In D127581#3595417, @tschuett wrote:

In D127581#3595064, @MaskRay wrote:

In D127581#3594812, @tschuett wrote:

Do you really need a plain union? How about a PointerUnion or something llvmy std::variant?

The size is very important for memory usage. 8 or 16 bytes here may cost 1% maximum RSS.
Since PointerUnion abstraction seems unnecessary, I'd want to avoid it.

I would expect the size of the pointer union to be 8 bytes. My main motivation is that unions are odd.

PointerUnion will not increase the size, but the additional abstraction is unnecessary. Both features are used in very specific context and there is no need to worry about misuse (and if there is a misuse, it will fail immediately). I'd also want to avoid the cost clearing the least significant bit.

I haven't looked at the code yet, but I can confirm that I was able to build and boot FreeBSD for RISCV64 with this patch and the FreeBSD patch below to remove -mno-relax (I could have just defined the riscv-relaxations linker feature, but I wanted to make sure that the code is actually built with relaxations):

diff --git a/share/mk/bsd.lib.mk b/share/mk/bsd.lib.mk
index 36d91ea019f3..b37ebe39ecf4 100644
--- a/share/mk/bsd.lib.mk
+++ b/share/mk/bsd.lib.mk
@@ -123,8 +123,8 @@ CXXFLAGS+= ${DEBUG_FILES_CFLAGS}
 CTFFLAGS+= -g
 .endif
 
-.if ${MACHINE_CPUARCH} == "riscv" && ${LINKER_FEATURES:Mriscv-relaxations} == ""
-CFLAGS += -mno-relax
+.if ${MACHINE_CPUARCH} == "riscv"
+CFLAGS += -mrelax
 .endif
 
 .include <bsd.libnames.mk>
diff --git a/share/mk/bsd.prog.mk b/share/mk/bsd.prog.mk
index 6b8da09edaf0..3743250c2c87 100644
--- a/share/mk/bsd.prog.mk
+++ b/share/mk/bsd.prog.mk
@@ -84,8 +84,8 @@ CXXFLAGS+= -ftrivial-auto-var-init=pattern
 # bsd.sanitizer.mk is not installed, so don't require it (e.g. for ports).
 .sinclude "bsd.sanitizer.mk"
 
-.if ${MACHINE_CPUARCH} == "riscv" && ${LINKER_FEATURES:Mriscv-relaxations} == ""
-CFLAGS += -mno-relax
+.if ${MACHINE_CPUARCH} == "riscv"
+CFLAGS += -mrelax
 .endif
 
 .if defined(CRUNCH_CFLAGS)
diff --git a/share/mk/bsd.sys.mk b/share/mk/bsd.sys.mk
index 221e8b028479..f6266d7b991b 100644
--- a/share/mk/bsd.sys.mk
+++ b/share/mk/bsd.sys.mk
@@ -83,6 +83,13 @@ CWARNFLAGS.clang+=   -Wno-unused-const-variable
 .if ${COMPILER_TYPE} == "clang" && ${COMPILER_VERSION} >= 130000
 CWARNFLAGS.clang+=     -Wno-error=unused-but-set-variable
 .endif
+.if ${COMPILER_TYPE} == "clang" && ${COMPILER_VERSION} >= 150000
+CWARNFLAGS.clang+=     -Wno-deprecated-non-prototype
+CWARNFLAGS.clang+=     -Wno-unreachable-code-generic-assoc
+CWARNFLAGS.clang+=     -Wno-strict-prototypes
+CWARNFLAGS.clang+=     -Wno-error=unused-but-set-parameter
+CWARNFLAGS.clang+=     -Wno-error=implicit-function-declaration
+.endif
 .endif # WARNS <= 6
 .if ${WARNS} <= 3
 CWARNFLAGS.clang+=     -Wno-tautological-compare -Wno-unused-value\
diff --git a/stand/defs.mk b/stand/defs.mk
index e9c97f7720ab..753bf39ced31 100644
--- a/stand/defs.mk
+++ b/stand/defs.mk
@@ -175,9 +175,7 @@ CFLAGS+=    -fPIC
 
 # Some RISC-V linkers have support for relaxations, while some (lld) do not
 # yet. If this is the case we inhibit the compiler from emitting relaxations.
-.if ${LINKER_FEATURES:Mriscv-relaxations} == ""
-CFLAGS+=       -mno-relax
-.endif
+CFLAGS+=       -mrelax
 
 # The boot loader build uses dd status=none, where possible, for reproducible
 # build output (since performance varies from run to run). Trouble is that
diff --git a/sys/conf/kern.mk b/sys/conf/kern.mk
index b86149ab4618..2c82ff97b88e 100644
--- a/sys/conf/kern.mk
+++ b/sys/conf/kern.mk
@@ -34,6 +34,9 @@ NO_WUNUSED_BUT_SET_VARIABLE=  -Wno-unused-but-set-variable
 .if ${COMPILER_VERSION} >= 140000
 NO_WBITWISE_INSTEAD_OF_LOGICAL=        -Wno-bitwise-instead-of-logical
 .endif
+.if ${COMPILER_VERSION} >= 150000
+CWARNFLAGS+=   -Wno-strict-prototypes -Wno-error=unused-but-set-variable
+.endif
 # Several other warnings which might be useful in some cases, but not severe
 # enough to error out the whole kernel build.  Display them anyway, so there is
 # some incentive to fix them eventually.
@@ -151,9 +154,7 @@ CFLAGS.clang+=      -mcmodel=medium
 CFLAGS.gcc+=   -mcmodel=medany
 INLINE_LIMIT?= 8000
 
-.if ${LINKER_FEATURES:Mriscv-relaxations} == ""
-CFLAGS+=       -mno-relax
-.endif
+CFLAGS+=       -mrelax
 .endif
 
 #

I'll leave the RISCV details to the experts. General approach looks good to me.

lld/ELF/InputSection.h
102	It looks like the definitions are only used in RISCV.cpp as only a pointer is used in the union below. Could these be forward declared here? I could be missing some use though.
223	update comment for relaxAux? I assume that the union is because we don't have relaxation and basic block sections simultaneously?
lld/ELF/Target.h
92	Will be worth a comment like `needsThunk` to describe what this does, just in case another architecture chooses to do RiscV like relaxations.
lld/ELF/Writer.cpp
1639	Although more source changes. Would it be cleaner to have the passes variable here, and pass it into createThunks as a parameter?

gkm added inline comments.Jun 20 2022, 1:27 PM

lld/ELF/Arch/RISCV.cpp
592	`InputSectionBase::bytesDropped` is merely `uint8_t`, and feels vulnerable to overflow. The comment on the decl says it is intended for basic-block sections, for which 8 bits is reasonable, but this new use, it might be inadequate. Perhaps `uint16_t` ?

comments

Harbormaster completed remote builds in B170952: Diff 438522.Jun 20 2022, 7:01 PM

kito-cheng added inline comments.Jun 21 2022, 4:40 AM

lld/ELF/Arch/RISCV.cpp
609	I hit overflow here as @gkm concern, and the fixed by changing `bytesDropped` to `uint16_t` (yeah, I tested the uint8_t version), maybe we can put an `assert (delta <= numeric_limits<uint16_t>::max());` here to make sure this could catch earlier? I saw there are assertions for `byteDropped` in other place, so I think that should be reasonable? [kitoc@xxxx llvm-project]$ grpe bytesDropped * -R ... lld/ELF/InputSection.h: uint8_t bytesDropped = 0; lld/ELF/InputSection.h: assert(bytesDropped + num < 256); lld/ELF/InputSection.h: bytesDropped += num; lld/ELF/InputSection.h: assert(bytesDropped >= num); lld/ELF/InputSection.h: bytesDropped -= num; ... Gonna run second round of testing.

luismarques added inline comments.Jun 21 2022, 6:18 AM

lld/ELF/Arch/RISCV.cpp
553–561	Can't we skip this for the first pass?
562–564	No test coverage for this?

luismarques added inline comments.Jun 21 2022, 8:18 AM

lld/ELF/Arch/RISCV.cpp
562–564	Nevermind. D127611.

gkm added inline comments.Jun 21 2022, 9:00 AM

lld/ELF/InputSection.h
161

add // namespace

lld/ELF/Arch/RISCV.cpp
553–561	This code takes nearly no time. I don't think we should special case the first pass.
592	Thanks!
609	push_back/drop_back is a code problem of the basic block sections feature. I don't intend to touch the functions for this patch. Changed RISCV.cpp:611 instead.
lld/ELF/InputSection.h
161	push_back/drop_back is a code problem of the basic block sections feature. I don't intend to touch the functions for this patch. Changed RISCV.cpp:611 instead.
lld/ELF/Writer.cpp
1639	ThunkCreator::pass needs to be retained, otherwise `uint32_t pass` needs to be threading though most of its member functions.

Harbormaster completed remote builds in B171234: Diff 438897.Jun 21 2022, 8:17 PM

Thanks for the updates, I don't have any more comments. Happy for someone comfortable with RISCV to approve when they are happy with the details.

I have used this patch to compile and link to the software below. Generally, there are no obvious problems with this patch.

curl-7.82.0:
- command: ./configure CC=/path/to/clang LDFLAGS="-fuse-ld=lld --ld-path=/path/to/ld.lld -mrelax" --without-ssl
- result：The compiler and linker works fine and all the tests that are provided have passed. Simply running is fine.

bash-5.1:
- command: ./configure CC_FOR_BUILD=/path/to/clang LDFLAGS_FOR_BUILD="-fuse-ld=lld --ld-path=/path/to/ld.lld -mrelax" CC=/path/to/clang LDFLAGS="-fuse-ld=lld --ld-path=/path/to/ld.lld -mrelax"
- result：The compiler and linker works fine and part of the tests that are provided has passed. Failed test not related to this patch. Simply running is fine.

vim 8.2.5:
- command: ./configure CC=/path/to/clang LDFLAGS="-fuse-ld=lld --ld-path=/path/to/ld.lld -mrelax
- result：The compiler and linker works fine and part of the tests that are provided has passed. Failed test not related to this patch. Simply running is fine.

libevent:
- command: ./configure CC=/path/to/clang LDFLAGS="-fuse-ld=lld --ld-path=/path/to/ld.lld -mrelax"
- result: The compiler and linker works fine and all the tests that are provided have passed.

tmux:
- command: ./configure CC=/path/to/clang LDFLAGS="-fuse-ld=lld --ld-path=/path/to/ld.lld -mrelax"
- result: The compiler and linker works fine. run make check have no error. Simply running is fine.

Tested with this patch with LLVM testsuite and internal testsuite, and no failure :)

Thanks for all the suggestions and testing! I think the approach implemented in this patch series is what we should use. I'll wait a week and push the two...

In D127581#3594793, @MaskRay wrote:

I have asked a distrubotr @felixonmars for testing as well.

Thanks for the patch. We (@Arch) have done some extended testing with for example the Firefox browser, and everything looks good so far.

MaskRay accepted this revision.Jun 23 2022, 2:45 PM

This revision is now accepted and ready to land.Jun 23 2022, 2:45 PM

Ping @jrtc27

jrtc27 added inline comments.Jul 3 2022, 9:39 AM

lld/ELF/Arch/RISCV.cpp
276	ALIGN is not a hint, it's a requirement (as opposed to R_RISCV_RELAX, which is a true hint). I would suggest separating these two notions at the RelExpr level, then the config->relax check to skip R_RISCV_RELAX in the CALL(_PLT) patch can instead be done at the generic level rather than in the target.
490	Should this not be Elf_Sword or similar? ELFCLASS64 _can_ overflow this, even if you really really really shouldn't.
493	A getter that returns nothing is odd, saveSymbolAnchors/recordSymbolAnchors/initSymbolAnchors/similar?
499	Do you not just want an `int32_t *` (or smart pointer) given it's either 0 or relocations.size() elements? SmallVector adds overhead as it tracks both size and capacity, but we don't need any dynamic behaviour (beyond "does not exist" (null) and "exists with relocations.size() elements").
522	Does this not need to be stable_sort to guarantee the zero-sized symbols have their anchors in the right order? Also, does the order of A's end vs B's start matter for this implementation? That should be documented (and, ideally, why).
539	This seems like it belongs in generic code?
553	Is this actually the original st_value? If you interleave relaxation with other adjustment of st_value, won't delta stay the same but then the "unrelaxed" st_value will be different?
569	It might be nice to hoist this out to a separate function, it's quite nested here and this is the bit people care about editing to add new relaxations, so separating the "do the relaxations" part from all the tracking infrastructure would help there.
652	I don't know if it's a requirement that, say, 6 padding bytes be emitted as `nop; c.nop` rather than `c.nop; nop`. Does binutils make this assumption?

comments

lld/ELF/Arch/RISCV.cpp
276	The R_RISCV_ALIGN case label is specific to RISC-V. The pass is done after scanRelocations, so the code does not fit into the generic code. RelExpr can be split for ALIGN and RELAX but its necessity isn't that high. If just that `HINT` is a bit of misnomer I can rename it separately to `R_RELAX` or `R_RELAX_OR_ALIGN`.
490	There are precedents using int32_t in many places. Elf_Sword is not used. Since relocations aren't that many, just switched to int64_t.
499	changed to std::unique_ptr<int64_t[]>
522	llvm::sort is fine. The previous comment explains it. Improved the comment a bit.
539	This is RISC-V specific. Unless another supported architecture adds relaxation, this can stay here.
553	For most symbols, this is the original st_value. Linker script symbol assignments may rewrite st_value (and does not care about the original value). The code should be fine.
652	I vaguely remember that GNU ld seems to use `nop; c.nop` (prefer long to short). This code should match its behavior.

Harbormaster completed remote builds in B173468: Diff 441971.Jul 3 2022, 12:14 PM

MaskRay added inline comments.Jul 3 2022, 9:35 PM

lld/ELF/Arch/RISCV.cpp
490	Hmm. I guess the original int32_t or uint32_t may be better.

Thank for the thorough comments.
I'll push this in few days if there is no further request.

Rebase after a getInputSections optimization

Harbormaster completed remote builds in B173809: Diff 442448.Jul 5 2022, 11:57 PM

This revision was landed with ongoing or failed builds.Jul 7 2022, 10:16 AM

Closed by commit rG6611d58f5bbc: [ELF] Relax R_RISCV_ALIGN (authored by MaskRay). · Explain Why

This revision was automatically updated to reflect the committed changes.

MaskRay added a commit: rG6611d58f5bbc: [ELF] Relax R_RISCV_ALIGN.

MaskRay mentioned this in rG75e551e5d830: [ELF] Relax R_RISCV_CALL and R_RISCV_CALL_PLT.Jul 7 2022, 10:18 AM

Thanks for implementing this! Tested the kernel patch works well.
https://lore.kernel.org/llvm/20220710071117.446112-1-maskray@google.com/

jrtc27 mentioned this in D77694: [WIP][RISCV][ELF] Linker relaxation support.Aug 9 2022, 11:46 AM

luismarques added inline comments.Jan 19 2023, 6:13 AM

lld/ELF/Arch/RISCV.cpp
598	@MaskRay I ran into this error when building LLVM with LLD in a RISC-V host. I guess we actually need an int32?

Herald added a subscriber: luke. · View Herald TranscriptJan 19 2023, 6:13 AM

MaskRay added inline comments.Jan 19 2023, 7:47 PM

lld/ELF/Arch/RISCV.cpp
598	This will increase the size of InputSection which we should try to avoid (memory usage increase). I am on a trip so cannot investigate it closely. It will help if you can ask the author of `--optimize-bb-jumps` whether it is still used. Removing `nopFiller` will make room for delta.

luismarques added a subscriber: tmsriram.Jan 20 2023, 7:44 AM

luismarques added inline comments.

lld/ELF/Arch/RISCV.cpp
598	It will help if you can ask the author of `--optimize-bb-jumps` whether it is still used. Removing `nopFiller` will make room for delta. @tmsriram any comments?

Revision Contents

Path

Size

lld/

ELF/

Arch/

149 lines

32 lines

4 lines

1 line

4 lines

2 lines

2 lines

test/

ELF/

riscv-relax-align-rvc.s

68 lines

riscv-relax-align.s

109 lines

riscv-reloc-align.s

Diff 436170

lld/ELF/Arch/RISCV.cpp

//===- RISCV.cpp ----------------------------------------------------------===//		//===- RISCV.cpp ----------------------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "InputFiles.h"		#include "InputFiles.h"
		#include "OutputSections.h"
#include "Symbols.h"		#include "Symbols.h"
#include "SyntheticSections.h"		#include "SyntheticSections.h"
#include "Target.h"		#include "Target.h"

using namespace llvm;		using namespace llvm;
using namespace llvm::object;		using namespace llvm::object;
using namespace llvm::support::endian;		using namespace llvm::support::endian;
using namespace llvm::ELF;		using namespace llvm::ELF;
Show All 13 Lines	public:
void writePltHeader(uint8_t *buf) const override;		void writePltHeader(uint8_t *buf) const override;
void writePlt(uint8_t *buf, const Symbol &sym,		void writePlt(uint8_t *buf, const Symbol &sym,
uint64_t pltEntryAddr) const override;		uint64_t pltEntryAddr) const override;
RelType getDynRel(RelType type) const override;		RelType getDynRel(RelType type) const override;
RelExpr getRelExpr(RelType type, const Symbol &s,		RelExpr getRelExpr(RelType type, const Symbol &s,
const uint8_t *loc) const override;		const uint8_t *loc) const override;
void relocate(uint8_t *loc, const Relocation &rel,		void relocate(uint8_t *loc, const Relocation &rel,
uint64_t val) const override;		uint64_t val) const override;
		void relaxSections() const override;
};		};

} // end anonymous namespace		} // end anonymous namespace

const uint64_t dtpOffset = 0x800;		const uint64_t dtpOffset = 0x800;

enum Op {		enum Op {
ADDI = 0x13,		ADDI = 0x13,
▲ Show 20 Lines • Show All 219 Lines • ▼ Show 20 Lines	RelExpr RISCV::getRelExpr(const RelType type, const Symbol &s,
case R_RISCV_TPREL_HI20:		case R_RISCV_TPREL_HI20:
case R_RISCV_TPREL_LO12_I:		case R_RISCV_TPREL_LO12_I:
case R_RISCV_TPREL_LO12_S:		case R_RISCV_TPREL_LO12_S:
return R_TPREL;		return R_TPREL;
case R_RISCV_RELAX:		case R_RISCV_RELAX:
case R_RISCV_TPREL_ADD:		case R_RISCV_TPREL_ADD:
return R_NONE;		return R_NONE;
case R_RISCV_ALIGN:		case R_RISCV_ALIGN:
// Not just a hint; always padded to the worst-case number of NOPs, so may		return R_RELAX_HINT;
		jrtc27Unsubmitted Not Done Reply Inline Actions ALIGN is not a hint, it's a requirement (as opposed to R_RISCV_RELAX, which is a true hint). I would suggest separating these two notions at the RelExpr level, then the config->relax check to skip R_RISCV_RELAX in the CALL(_PLT) patch can instead be done at the generic level rather than in the target. jrtc27: ALIGN is not a hint, it's a requirement (as opposed to R_RISCV_RELAX, which is a true hint). I…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions The R_RISCV_ALIGN case label is specific to RISC-V. The pass is done after scanRelocations, so the code does not fit into the generic code. RelExpr can be split for ALIGN and RELAX but its necessity isn't that high. If just that `HINT` is a bit of misnomer I can rename it separately to `R_RELAX` or `R_RELAX_OR_ALIGN`. MaskRay: The R_RISCV_ALIGN case label is specific to RISC-V. The pass is done after scanRelocations, so…
// not currently be aligned, and without linker relaxation support we can't
// delete NOPs to realign.
errorOrWarn(getErrorLocation(loc) + "relocation R_RISCV_ALIGN requires "
"unimplemented linker relaxation; recompile with -mno-relax");
return R_NONE;
default:		default:
error(getErrorLocation(loc) + "unknown relocation (" + Twine(type) +		error(getErrorLocation(loc) + "unknown relocation (" + Twine(type) +
") against symbol " + toString(s));		") against symbol " + toString(s));
return R_NONE;		return R_NONE;
}		}
}		}

// Extract bits V[Begin:End], where range is inclusive, and Begin must be < 63.		// Extract bits V[Begin:End], where range is inclusive, and Begin must be < 63.
▲ Show 20 Lines • Show All 183 Lines • ▼ Show 20 Lines	void RISCV::relocate(uint8_t *loc, const Relocation &rel, uint64_t val) const {
case R_RISCV_RELAX:		case R_RISCV_RELAX:
return; // Ignored (for now)		return; // Ignored (for now)

default:		default:
llvm_unreachable("unknown relocation");		llvm_unreachable("unknown relocation");
}		}
}		}

		static void getSymbolAnchors() {
		for (OutputSection *osec : outputSections) {
		if (!(osec->flags & SHF_EXECINSTR))
		continue;
		for (InputSection sec : getInputSections(osec)) {
		sec->relaxAux = make<RISCVRelaxAux>();
		sec->relaxAux->relocDeltas.resize(sec->relocations.size());
		}
		}
		// Store anchors (st_value and st_value+st_size) for symbols relatived to text
		// sections.
		for (InputFile *file : objectFiles)
		for (Symbol *sym : file->getSymbols())
		if (auto *d = dyn_cast<Defined>(sym))
		if (auto *sec = dyn_cast_or_null<InputSection>(d->section))
		jrtc27Unsubmitted Done Reply Inline Actions Should this not be Elf_Sword or similar? ELFCLASS64 _can_ overflow this, even if you really really really shouldn't. jrtc27: Should this not be Elf_Sword or similar? ELFCLASS64 _can_ overflow this, even if you really…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions There are precedents using int32_t in many places. Elf_Sword is not used. Since relocations aren't that many, just switched to int64_t. MaskRay: There are precedents using int32_t in many places. Elf_Sword is not used. Since relocations…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions Hmm. I guess the original int32_t or uint32_t may be better. MaskRay: Hmm. I guess the original int32_t or uint32_t may be better.
		if (sec->flags & SHF_EXECINSTR) {
		sec->relaxAux->anchors.push_back({d->value, d, false});
		sec->relaxAux->anchors.push_back({d->value + d->size, d, true});
		jrtc27Unsubmitted Done Reply Inline Actions A getter that returns nothing is odd, saveSymbolAnchors/recordSymbolAnchors/initSymbolAnchors/similar? jrtc27: A getter that returns nothing is odd, saveSymbolAnchors/recordSymbolAnchors/initSymbolAnchors/s…
		}
		// Sort anchors by offset.
		for (OutputSection *osec : outputSections) {
		if (!(osec->flags & SHF_EXECINSTR))
		continue;
		for (InputSection sec : getInputSections(osec)) {
		jrtc27Unsubmitted Done Reply Inline Actions Do you not just want an `int32_t ` (or smart pointer) given it's either 0 or relocations.size() elements? SmallVector adds overhead as it tracks both size and capacity, but we don't need any dynamic behaviour (beyond "does not exist" (null) and "exists with relocations.size() elements"). jrtc27:* Do you not just want an `int32_t *` (or smart pointer) given it's either 0 or relocations.size…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions changed to std::unique_ptr<int64_t[]> MaskRay: changed to std::unique_ptr<int64_t[]>
		llvm::stable_sort(sec->relaxAux->anchors,
		[](auto &a, auto &b) { return a.offset < b.offset; });
		}
		}
		}

		// Do a relaxation pass and return true if we changed something. When relaxing
		// just R_RISCV_ALIGN, relocDeltas is only changed once. For call and load/store
		// R_RISCV_RELAX, code shrinkage may reduce other displacements sufficiently to
		// become eligible for relaxation. Code shrinkage may increase displacement to a
		// call/load/store target at a higher fixed address, invalidating an earlier
		// relaxation which must now be undone. Any change in section sizes can have
		// cascading effect and require another relaxation pass.
		static bool relaxOnce() {
		bool changed = false;
		for (OutputSection *osec : outputSections) {
		if (!(osec->flags & SHF_EXECINSTR))
		continue;
		for (InputSection sec : getInputSections(osec)) {
		auto &aux = *sec->relaxAux;
		int32_t delta = 0;
		for (auto &it : llvm::enumerate(sec->relocations)) {
		const Relocation &r = it.value();
		jrtc27Unsubmitted Done Reply Inline Actions Does this not need to be stable_sort to guarantee the zero-sized symbols have their anchors in the right order? Also, does the order of A's end vs B's start matter for this implementation? That should be documented (and, ideally, why). jrtc27: Does this not need to be stable_sort to guarantee the zero-sized symbols have their anchors in…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions llvm::sort is fine. The previous comment explains it. Improved the comment a bit. MaskRay: llvm::sort is fine. The previous comment explains it. Improved the comment a bit.
		// r.offset is adjusted by delta as previous relocations may have
		// removed content.
		const uint64_t loc = sec->outSecOff + r.offset + delta;
		int32_t &cur = aux.relocDeltas[it.index()];
		switch (r.type) {
		case R_RISCV_ALIGN: {
		const uint64_t nextLoc = loc + r.addend;
		const uint64_t align = PowerOf2Ceil(r.addend + 2);
		// Adjust delta by needed_nops_bytes - nops_bytes.
		delta += static_cast<int32_t>(((loc + align - 1) & -align) - nextLoc);
		assert(delta <= 0 && "R_RISCV_ALIGN needs expanding the content");
		break;
		}
		default:
		// TODO: handle call/jump/load/store/addr-arithmetic relaxation
		break;
		}
		jrtc27Unsubmitted Done Reply Inline Actions This seems like it belongs in generic code? jrtc27: This seems like it belongs in generic code?
		MaskRayAuthorUnsubmitted Done Reply Inline Actions This is RISC-V specific. Unless another supported architecture adds relaxation, this can stay here. MaskRay: This is RISC-V specific. Unless another supported architecture adds relaxation, this can stay…
		if (delta != cur) {
		cur = delta;
		changed = true;
		}
		}
		}
		}
		return changed;
		}

		void RISCV::relaxSections() const {
		if (config->relocatable)
		return;

		jrtc27Unsubmitted Done Reply Inline Actions Is this actually the original st_value? If you interleave relaxation with other adjustment of st_value, won't delta stay the same but then the "unrelaxed" st_value will be different? jrtc27: Is this actually the original st_value? If you interleave relaxation with other adjustment of…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions For most symbols, this is the original st_value. Linker script symbol assignments may rewrite st_value (and does not care about the original value). The code should be fine. MaskRay: For most symbols, this is the original st_value. Linker script symbol assignments may rewrite…
		int pass = 1;
		getSymbolAnchors();
		while (relaxOnce()) {
		script->assignAddresses();
		if (++pass >= 10) {
		errorOrWarn("relaxation does not converge");
		return;
		}
		luismarquesUnsubmitted Done Reply Inline Actions Can't we skip this for the first pass? luismarques: Can't we skip this for the first pass?
		MaskRayAuthorUnsubmitted Done Reply Inline Actions This code takes nearly no time. I don't think we should special case the first pass. MaskRay: This code takes nearly no time. I don't think we should special case the first pass.
		}
		log("relaxation passes: " + Twine(pass));
		if (pass == 1)
		luismarquesUnsubmitted Done Reply Inline Actions No test coverage for this? luismarques: No test coverage for this?
		luismarquesUnsubmitted Done Reply Inline Actions Nevermind. D127611. luismarques: Nevermind. D127611.
		return;

		for (OutputSection *osec : outputSections) {
		if (!(osec->flags & SHF_EXECINSTR))
		continue;
		jrtc27Unsubmitted Done Reply Inline Actions It might be nice to hoist this out to a separate function, it's quite nested here and this is the bit people care about editing to add new relaxations, so separating the "do the relaxations" part from all the tracking infrastructure would help there. jrtc27: It might be nice to hoist this out to a separate function, it's quite nested here and this is…
		for (InputSection sec : getInputSections(osec)) {
		RISCVRelaxAux &aux = *sec->relaxAux;
		if (aux.relocDeltas.empty())
		continue;

		// Delete ranges.
		auto &rels = sec->relocations;
		ArrayRef<uint8_t> old = sec->rawData;
		size_t newSize = old.size() + aux.relocDeltas.back();
		uint8_t *p = context().bAlloc.Allocate<uint8_t>(newSize);
		uint64_t offset = 0;
		int32_t delta = 0;
		sec->rawData = makeArrayRef(p, newSize);
		for (size_t i = 0, e = sec->relocations.size(); i != e; ++i) {
		int32_t inc = aux.relocDeltas[i] - delta;
		delta = aux.relocDeltas[i];
		if (inc == 0)
		continue;
		const Relocation &r = rels[i];
		memcpy(p, old.data() + offset, r.offset - offset);
		p += r.offset - offset;
		offset = r.offset - inc;
		}
		gkmUnsubmitted Done Reply Inline Actions `InputSectionBase::bytesDropped` is merely `uint8_t`, and feels vulnerable to overflow. The comment on the decl says it is intended for basic-block sections, for which 8 bits is reasonable, but this new use, it might be inadequate. Perhaps `uint16_t` ? gkm: `InputSectionBase::bytesDropped` is merely `uint8_t`, and feels vulnerable to overflow. The…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions Thanks! MaskRay: Thanks!
		memcpy(p, old.data() + offset, old.size() - offset);

		// Decrease symbol values and relocation offsets.
		size_t j = 0;
		delta = 0;
		for (SymbolAnchor &sa : aux.anchors) {
		luismarquesUnsubmitted Not Done Reply Inline Actions @MaskRay I ran into this error when building LLVM with LLD in a RISC-V host. I guess we actually need an int32? luismarques: @MaskRay I ran into this error when building LLVM with LLD in a RISC-V host. I guess we…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions This will increase the size of InputSection which we should try to avoid (memory usage increase). I am on a trip so cannot investigate it closely. It will help if you can ask the author of `--optimize-bb-jumps` whether it is still used. Removing `nopFiller` will make room for delta. MaskRay: This will increase the size of InputSection which we should try to avoid (memory usage…
		luismarquesUnsubmitted Not Done Reply Inline Actions It will help if you can ask the author of `--optimize-bb-jumps` whether it is still used. Removing `nopFiller` will make room for delta. @tmsriram any comments? luismarques: > It will help if you can ask the author of `--optimize-bb-jumps` whether it is still used.
		for (; j < rels.size() && rels[j].offset < sa.offset; ++j) {
		rels[j].offset += delta;
		delta = aux.relocDeltas[j];
		}
		if (sa.end)
		sa.d->size = sa.offset + delta - sa.d->value;
		else
		sa.d->value += delta;
		}
		for (; j < rels.size(); ++j) {
		rels[j].offset += delta;
		kito-chengUnsubmitted Done Reply Inline Actions I hit overflow here as @gkm concern, and the fixed by changing `bytesDropped` to `uint16_t` (yeah, I tested the uint8_t version), maybe we can put an `assert (delta <= numeric_limits<uint16_t>::max());` here to make sure this could catch earlier? I saw there are assertions for `byteDropped` in other place, so I think that should be reasonable? [kitoc@xxxx llvm-project]$ grpe bytesDropped * -R ... lld/ELF/InputSection.h: uint8_t bytesDropped = 0; lld/ELF/InputSection.h: assert(bytesDropped + num < 256); lld/ELF/InputSection.h: bytesDropped += num; lld/ELF/InputSection.h: assert(bytesDropped >= num); lld/ELF/InputSection.h: bytesDropped -= num; ... Gonna run second round of testing. kito-cheng: I hit overflow here as @gkm concern, and the fixed by changing `bytesDropped` to `uint16_t`…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions push_back/drop_back is a code problem of the basic block sections feature. I don't intend to touch the functions for this patch. Changed RISCV.cpp:611 instead. MaskRay: push_back/drop_back is a code problem of the basic block sections feature. I don't intend to…
		delta = aux.relocDeltas[j];
		}
		}
		}
		}

TargetInfo *elf::getRISCVTargetInfo() {		TargetInfo *elf::getRISCVTargetInfo() {
static RISCV target;		static RISCV target;
return &target;		return &target;
}		}
		jrtc27Unsubmitted Done Reply Inline Actions I don't know if it's a requirement that, say, 6 padding bytes be emitted as `nop; c.nop` rather than `c.nop; nop`. Does binutils make this assumption? jrtc27: I don't know if it's a requirement that, say, 6 padding bytes be emitted as `nop; c.nop` rather…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions I vaguely remember that GNU ld seems to use `nop; c.nop` (prefer long to short). This code should match its behavior. MaskRay: I vaguely remember that GNU ld seems to use `nop; c.nop` (prefer long to short). This code…

lld/ELF/InputSection.h

//===- InputSection.h -------------------------------------------*- C++ -*-===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

#ifndef LLD_ELF_INPUT_SECTION_H

#define LLD_ELF_INPUT_SECTION_H

#include "Relocations.h"

#include "lld/Common/CommonLinkerContext.h"

#include "lld/Common/LLVM.h"

#include "lld/Common/Memory.h"

#include "llvm/ADT/CachedHashString.h"

#include "llvm/ADT/DenseSet.h"

#include "llvm/ADT/TinyPtrVector.h"

#include "llvm/Object/ELF.h"

namespace lld {

namespace elf {

▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines

protected:

constexpr SectionBase(Kind sectionKind, StringRef name, uint64_t flags,

uint32_t entsize, uint32_t alignment, uint32_t type,

uint32_t info, uint32_t link)

: name(name), sectionKind(sectionKind), bss(false), keepUnique(false),

alignment(alignment), flags(flags), entsize(entsize), type(type),

link(link), info(info) {}

};

struct SymbolAnchor {

peter.smithUnsubmitted

Done

It looks like the definitions are only used in RISCV.cpp as only a pointer is used in the union below. Could these be forward declared here?

I could be missing some use though.

peter.smith: It looks like the definitions are only used in RISCV.cpp as only a pointer is used in the union…

uint64_t offset;

Defined *d;

bool end; // true for the anchor of st_value+st_size

};

// Auxiliary information for RISC-V linker relaxation, attached to an

// InputSection.

struct RISCVRelaxAux {

// This records symbol start and end offsets which will be adjusted according

// to the nearest relocDeltas element.

SmallVector<SymbolAnchor, 0> anchors;

// For relocations[i], the actual offset is r_offset + relocDeltas[i-1].

SmallVector<int32_t, 0> relocDeltas;

};

// This corresponds to a section of an input file.

class InputSectionBase : public SectionBase {

public:

template <class ELFT>

InputSectionBase(ObjFile<ELFT> &file, const typename ELFT::Shdr &header,

StringRef name, Kind sectionKind);

InputSectionBase(InputFile *file, uint64_t flags, uint32_t type,

Show All 27 Lines

public:

// and shrinking a section.

uint8_t bytesDropped = 0;

// Whether the section needs to be padded with a NOP filler due to

// deleteFallThruJmpInsn.

bool nopFiller = false;

void drop_back(unsigned num) {

assert(bytesDropped + num < 256);

gkmUnsubmitted

Done

void drop_back(unsigned num) {

- assert(bytesDropped + num < 256);

+ assert(bytesDropped + num < numeric_limits<uint16_t>::max());

bytesDropped += num;

gkm:

MaskRayAuthorUnsubmitted

Done

push_back/drop_back is a code problem of the basic block sections feature. I don't intend to touch the functions for this patch.
Changed RISCV.cpp:611 instead.

MaskRay: push_back/drop_back is a code problem of the basic block sections feature. I don't intend to…

bytesDropped += num;

}

void push_back(uint64_t num) {

assert(bytesDropped >= num);

bytesDropped -= num;

}

▲ Show 20 Lines • Show All 44 Lines • ▼ Show 20 Lines

static uint64_t getRelocTargetVA(const InputFile *File, RelType Type,

int64_t A, uint64_t P, const Symbol &Sym,

RelExpr Expr);

// The native ELF reloc data type is not very convenient to handle.

// So we convert ELF reloc records to our own records in Relocations.cpp.

// This vector contains such "cooked" relocations.

SmallVector<Relocation, 0> relocations;

union {

// These are modifiers to jump instructions that are necessary when basic

peter.smithUnsubmitted

Done

update comment for relaxAux? I assume that the union is because we don't have relaxation and basic block sections simultaneously?

peter.smith: update comment for relaxAux? I assume that the union is because we don't have relaxation and…

// block sections are enabled. Basic block sections creates opportunities to

// block sections are enabled. Basic block sections creates opportunities

// relax jump instructions at basic block boundaries after reordering the

// to relax jump instructions at basic block boundaries after reordering the

// basic blocks.

JumpInstrMod *jumpInstrMod = nullptr;

RISCVRelaxAux *relaxAux;

};

// A function compiled with -fsplit-stack calling a function

// compiled without -fsplit-stack needs its prologue adjusted. Find

// such functions and adjust their prologues. This is very similar

// to relocation. See https://gcc.gnu.org/wiki/SplitStacks for more

// information.

template <typename ELFT>

void adjustSplitStackFunctionPrologues(uint8_t *buf, uint8_t *end);

▲ Show 20 Lines • Show All 189 Lines • Show Last 20 Lines

lld/ELF/InputSection.cpp

Show First 20 Lines • Show All 615 Lines • ▼ Show 20 Lines	uint64_t InputSectionBase::getRelocTargetVA(const InputFile *file, RelType type,
case R_ABS:		case R_ABS:
case R_DTPREL:		case R_DTPREL:
case R_RELAX_TLS_LD_TO_LE_ABS:		case R_RELAX_TLS_LD_TO_LE_ABS:
case R_RELAX_GOT_PC_NOPIC:		case R_RELAX_GOT_PC_NOPIC:
case R_RISCV_ADD:		case R_RISCV_ADD:
return sym.getVA(a);		return sym.getVA(a);
case R_ADDEND:		case R_ADDEND:
return a;		return a;
		case R_RELAX_HINT:
		return 0;
case R_ARM_SBREL:		case R_ARM_SBREL:
return sym.getVA(a) - getARMStaticBase(sym);		return sym.getVA(a) - getARMStaticBase(sym);
case R_GOT:		case R_GOT:
case R_RELAX_TLS_GD_TO_IE_ABS:		case R_RELAX_TLS_GD_TO_IE_ABS:
return sym.getGotVA() + a;		return sym.getGotVA() + a;
case R_GOTONLY_PC:		case R_GOTONLY_PC:
return in.got->getVA() + a - p;		return in.got->getVA() + a - p;
case R_GOTPLTONLY_PC:		case R_GOTPLTONLY_PC:
▲ Show 20 Lines • Show All 349 Lines • ▼ Show 20 Lines	for (size_t i = 0, size = relocations.size(); i != size; ++i) {
if (auto *sec = dyn_cast<InputSection>(this))		if (auto *sec = dyn_cast<InputSection>(this))
secAddr += sec->outSecOff;		secAddr += sec->outSecOff;
const uint64_t addrLoc = secAddr + offset;		const uint64_t addrLoc = secAddr + offset;
const uint64_t targetVA =		const uint64_t targetVA =
SignExtend64(getRelocTargetVA(file, rel.type, rel.addend, addrLoc,		SignExtend64(getRelocTargetVA(file, rel.type, rel.addend, addrLoc,
*rel.sym, rel.expr),		*rel.sym, rel.expr),
bits);		bits);
switch (rel.expr) {		switch (rel.expr) {
		case R_RELAX_HINT:
		continue;
case R_RELAX_GOT_PC:		case R_RELAX_GOT_PC:
case R_RELAX_GOT_PC_NOPIC:		case R_RELAX_GOT_PC_NOPIC:
target.relaxGot(bufLoc, rel, targetVA);		target.relaxGot(bufLoc, rel, targetVA);
break;		break;
case R_AARCH64_GOT_PAGE_PC:		case R_AARCH64_GOT_PAGE_PC:
if (i + 1 < size && aarch64relaxer.tryRelaxAdrpLdr(		if (i + 1 < size && aarch64relaxer.tryRelaxAdrpLdr(
rel, relocations[i + 1], secAddr, buf)) {		rel, relocations[i + 1], secAddr, buf)) {
++i;		++i;
▲ Show 20 Lines • Show All 464 Lines • Show Last 20 Lines

lld/ELF/Relocations.h

Show All 40 Lines	enum RelExpr {
R_GOTPLT,		R_GOTPLT,
R_GOTPLTREL,		R_GOTPLTREL,
R_GOTREL,		R_GOTREL,
R_NONE,		R_NONE,
R_PC,		R_PC,
R_PLT,		R_PLT,
R_PLT_PC,		R_PLT_PC,
R_PLT_GOTPLT,		R_PLT_GOTPLT,
		R_RELAX_HINT,
R_RELAX_GOT_PC,		R_RELAX_GOT_PC,
R_RELAX_GOT_PC_NOPIC,		R_RELAX_GOT_PC_NOPIC,
R_RELAX_TLS_GD_TO_IE,		R_RELAX_TLS_GD_TO_IE,
R_RELAX_TLS_GD_TO_IE_ABS,		R_RELAX_TLS_GD_TO_IE_ABS,
R_RELAX_TLS_GD_TO_IE_GOT_OFF,		R_RELAX_TLS_GD_TO_IE_GOT_OFF,
R_RELAX_TLS_GD_TO_IE_GOTPLT,		R_RELAX_TLS_GD_TO_IE_GOTPLT,
R_RELAX_TLS_GD_TO_LE,		R_RELAX_TLS_GD_TO_LE,
R_RELAX_TLS_GD_TO_LE_NEG,		R_RELAX_TLS_GD_TO_LE_NEG,
▲ Show 20 Lines • Show All 161 Lines • Show Last 20 Lines

lld/ELF/Relocations.cpp

	Show First 20 Lines • Show All 950 Lines • ▼ Show 20 Lines
	// will return true for such relocation.			// will return true for such relocation.
	//			//
	// If this function returns false, that means we need to emit a			// If this function returns false, that means we need to emit a
	// dynamic relocation so that the relocation will be fixed at load-time.			// dynamic relocation so that the relocation will be fixed at load-time.
	bool RelocationScanner::isStaticLinkTimeConstant(RelExpr e, RelType type,			bool RelocationScanner::isStaticLinkTimeConstant(RelExpr e, RelType type,
	const Symbol &sym,			const Symbol &sym,
	uint64_t relOff) const {			uint64_t relOff) const {
	// These expressions always compute a constant			// These expressions always compute a constant
	if (oneof<R_GOTPLT, R_GOT_OFF, R_MIPS_GOT_LOCAL_PAGE, R_MIPS_GOTREL,			if (oneof<R_GOTPLT, R_GOT_OFF, R_RELAX_HINT, R_MIPS_GOT_LOCAL_PAGE,
	R_MIPS_GOT_OFF, R_MIPS_GOT_OFF32, R_MIPS_GOT_GP_PC,			R_MIPS_GOTREL, R_MIPS_GOT_OFF, R_MIPS_GOT_OFF32, R_MIPS_GOT_GP_PC,
	R_AARCH64_GOT_PAGE_PC, R_GOT_PC, R_GOTONLY_PC, R_GOTPLTONLY_PC,			R_AARCH64_GOT_PAGE_PC, R_GOT_PC, R_GOTONLY_PC, R_GOTPLTONLY_PC,
	R_PLT_PC, R_PLT_GOTPLT, R_PPC32_PLTREL, R_PPC64_CALL_PLT,			R_PLT_PC, R_PLT_GOTPLT, R_PPC32_PLTREL, R_PPC64_CALL_PLT,
	R_PPC64_RELAX_TOC, R_RISCV_ADD, R_AARCH64_GOT_PAGE>(e))			R_PPC64_RELAX_TOC, R_RISCV_ADD, R_AARCH64_GOT_PAGE>(e))
	return true;			return true;

	// These never do, except if the entire file is position dependent or if			// These never do, except if the entire file is position dependent or if
	// only the low bits are used.			// only the low bits are used.
	if (e == R_GOT \|\| e == R_PLT)			if (e == R_GOT \|\| e == R_PLT)
	▲ Show 20 Lines • Show All 1,259 Lines • Show Last 20 Lines

lld/ELF/Target.h

Show First 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	virtual bool inBranchRange(RelType type, uint64_t src,
uint64_t dst) const;		uint64_t dst) const;

virtual void relocate(uint8_t *loc, const Relocation &rel,		virtual void relocate(uint8_t *loc, const Relocation &rel,
uint64_t val) const = 0;		uint64_t val) const = 0;
void relocateNoSym(uint8_t *loc, RelType type, uint64_t val) const {		void relocateNoSym(uint8_t *loc, RelType type, uint64_t val) const {
relocate(loc, Relocation{R_NONE, type, 0, 0, nullptr}, val);		relocate(loc, Relocation{R_NONE, type, 0, 0, nullptr}, val);
}		}

virtual void applyJumpInstrMod(uint8_t *loc, JumpModType type,		virtual void applyJumpInstrMod(uint8_t *loc, JumpModType type,
		peter.smithUnsubmitted Done Reply Inline Actions Will be worth a comment like `needsThunk` to describe what this does, just in case another architecture chooses to do RiscV like relaxations. peter.smith: Will be worth a comment like `needsThunk` to describe what this does, just in case another…
JumpModType val) const {}		JumpModType val) const {}

		virtual void relaxSections() const {}

virtual ~TargetInfo();		virtual ~TargetInfo();

// This deletes a jump insn at the end of the section if it is a fall thru to		// This deletes a jump insn at the end of the section if it is a fall thru to
// the next section. Further, if there is a conditional jump and a direct		// the next section. Further, if there is a conditional jump and a direct
// jump consecutively, it tries to flip the conditional jump to convert the		// jump consecutively, it tries to flip the conditional jump to convert the
// direct jump into a fall thru and delete it. Returns true if a jump		// direct jump into a fall thru and delete it. Returns true if a jump
// instruction can be deleted.		// instruction can be deleted.
virtual bool deleteFallThruJmpInsn(InputSection &is, InputFile *file,		virtual bool deleteFallThruJmpInsn(InputSection &is, InputFile *file,
▲ Show 20 Lines • Show All 223 Lines • Show Last 20 Lines

lld/ELF/Writer.cpp

Show First 20 Lines • Show All 1,624 Lines • ▼ Show 20 Lines	template <class ELFT> void Writer<ELFT>::finalizeAddressDependentContent() {
for (Partition &part : partitions)		for (Partition &part : partitions)
finalizeSynthetic(part.armExidx.get());		finalizeSynthetic(part.armExidx.get());
resolveShfLinkOrder();		resolveShfLinkOrder();

// Converts call x@GDPLT to call __tls_get_addr		// Converts call x@GDPLT to call __tls_get_addr
if (config->emachine == EM_HEXAGON)		if (config->emachine == EM_HEXAGON)
hexagonTLSSymbolUpdate(outputSections);		hexagonTLSSymbolUpdate(outputSections);

		target->relaxSections();

int assignPasses = 0;		int assignPasses = 0;
for (;;) {		for (;;) {
bool changed = target->needsThunks && tc.createThunks(outputSections);		bool changed = target->needsThunks && tc.createThunks(outputSections);

// With Thunk Size much smaller than branch range we expect to		// With Thunk Size much smaller than branch range we expect to
		peter.smithUnsubmitted Done Reply Inline Actions Although more source changes. Would it be cleaner to have the passes variable here, and pass it into createThunks as a parameter? peter.smith: Although more source changes. Would it be cleaner to have the passes variable here, and pass it…
		MaskRayAuthorUnsubmitted Done Reply Inline Actions ThunkCreator::pass needs to be retained, otherwise `uint32_t pass` needs to be threading though most of its member functions. MaskRay: ThunkCreator::pass needs to be retained, otherwise `uint32_t pass` needs to be threading though…
// converge quickly; if we get to 15 something has gone wrong.		// converge quickly; if we get to 15 something has gone wrong.
if (changed && tc.pass >= 15) {		if (changed && tc.pass >= 15) {
error("thunk creation not converged");		error("thunk creation not converged");
break;		break;
}		}

if (config->fixCortexA53Errata843419) {		if (config->fixCortexA53Errata843419) {
if (changed)		if (changed)
▲ Show 20 Lines • Show All 1,338 Lines • Show Last 20 Lines

lld/test/ELF/riscv-relax-align-rvc.s

This file was added.

				# REQUIRES: riscv

				# RUN: rm -rf %t && mkdir %t && cd %t

				# RUN: llvm-mc -filetype=obj -triple=riscv32 -mattr=+c,+relax %s -o 32.o
				# RUN: ld.lld -Ttext=0x10000 32.o -o 32
				# RUN: llvm-objdump -td --no-show-raw-insn -M no-aliases 32 \| FileCheck %s
				## R_RISCV_ALIGN is handled regarldess of --no-relax.
				# RUN: ld.lld -Ttext=0x10000 --no-relax 32.o -o 32.norelax
				# RUN: llvm-objdump -td --no-show-raw-insn -M no-aliases 32.norelax \| FileCheck %s

				# RUN: llvm-mc -filetype=obj -triple=riscv64 -mattr=+c,+relax %s -o 64.o
				# RUN: ld.lld -Ttext=0x10000 64.o -o 64
				# RUN: llvm-objdump -td --no-show-raw-insn -M no-aliases 64 \| FileCheck %s
				# RUN: ld.lld -Ttext=0x10000 --no-relax 64.o -o 64.norelax
				# RUN: llvm-objdump -td --no-show-raw-insn -M no-aliases 64.norelax \| FileCheck %s

				# CHECK-DAG: 00010002 l .text {{0*}}1e a
				# CHECK-DAG: 00010010 l .text {{0*}}22 b
				# CHECK-DAG: 00010012 l .text {{0*}}1e c
				# CHECK-DAG: 00010000 g .text {{0*}}34 _start

				# CHECK: <_start>:
				# CHECK-NEXT: c.addi a0, 1
				# CHECK-EMPTY:
				# CHECK-NEXT: <a>:
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: c.nop
				# CHECK-EMPTY:
				# CHECK-NEXT: <b>:
				# CHECK-NEXT: c.addi a0, 2
				# CHECK-EMPTY:
				# CHECK-NEXT: <c>:
				# CHECK-NEXT: c.addi a0, 3
				# CHECK-NEXT: c.unimp
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: c.nop
				# CHECK-NEXT: c.addi a0, 4
				# CHECK-NEXT: c.addi a0, 5
				# CHECK-NEXT: c.unimp
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: c.nop
				# CHECK-NEXT: c.addi a0, 6
				# CHECK-NEXT: c.addi a0, 7

				.global _start
				_start:
				c.addi a0, 1
				a:
				.balign 16
				b:
				c.addi a0, 2
				c:
				c.addi a0, 3
				.balign 32
				.size a, . - a
				c.addi a0, 4
				c.addi a0, 5
				.balign 16
				.size c, . - c
				c.addi a0, 6
				.size b, . - b
				c.addi a0, 7
				.size _start, . - _start

lld/test/ELF/riscv-relax-align.s

This file was added.

				# REQUIRES: riscv
				## Test that we can handle R_RISCV_ALIGN.

				# RUN: rm -rf %t && mkdir %t && cd %t

				# RUN: llvm-mc -filetype=obj -triple=riscv32 -mattr=+relax %s -o 32.o
				# RUN: ld.lld -Ttext=0x10000 32.o -o 32
				# RUN: llvm-objdump -td --no-show-raw-insn -M no-aliases 32 \| FileCheck %s
				## R_RISCV_ALIGN is handled regarldess of --no-relax.
				# RUN: ld.lld -Ttext=0x10000 --no-relax 32.o -o 32.norelax
				# RUN: llvm-objdump -td --no-show-raw-insn -M no-aliases 32.norelax \| FileCheck %s

				# RUN: llvm-mc -filetype=obj -triple=riscv64 -mattr=+relax %s -o 64.o
				# RUN: ld.lld -Ttext=0x10000 64.o -o 64
				# RUN: llvm-objdump -td --no-show-raw-insn -M no-aliases 64 \| FileCheck %s
				# RUN: ld.lld -Ttext=0x10000 --no-relax 64.o -o 64.norelax
				# RUN: llvm-objdump -td --no-show-raw-insn -M no-aliases 64.norelax \| FileCheck %s

				# CHECK-DAG: 00010004 l .text {{0*}}1c a
				# CHECK-DAG: 00010008 l .text {{0*}}28 b
				# CHECK-DAG: 00010014 l .text {{0*}}20 c
				# CHECK-DAG: 00010000 g .text {{0*}}38 _start

				# CHECK: <_start>:
				# CHECK-NEXT: addi a0, a0, 1
				# CHECK-EMPTY:
				# CHECK-NEXT: <a>:
				# CHECK-NEXT: addi a0, a0, 2
				# CHECK-EMPTY:
				# CHECK-NEXT: <b>:
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: 10010: addi a0, a0, 3
				# CHECK-EMPTY:
				# CHECK-NEXT: <c>:
				# CHECK-NEXT: addi a0, a0, 4
				# CHECK-NEXT: addi a0, a0, 5
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: 10020: addi a0, a0, 6
				# CHECK-NEXT: addi a0, a0, 7
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: 10030: addi a0, a0, 8
				# CHECK-NEXT: addi a0, a0, 9
				# CHECK: <e>:
				# CHECK-NEXT: addi a0, a0, 1
				# CHECK-EMPTY:
				# CHECK-NEXT: <f>:
				# CHECK-NEXT: 10044: addi a0, a0, 2
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: 10060: addi a0, a0, 3

				## _start-0x10070 = 0x10000-0x10070 = -112
				# CHECK: <.L1>:
				# CHECK-NEXT: 10070: auipc a0, 0
				# CHECK-NEXT: addi a0, a0, -112
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: addi zero, zero, 0
				# CHECK-NEXT: auipc a0, 0
				# CHECK-NEXT: addi a0, a0, -112

				.global _start
				_start:
				addi a0, a0, 1
				a:
				addi a0, a0, 2
				b:
				.balign 16
				addi a0, a0, 3
				c:
				addi a0, a0, 4
				addi a0, a0, 5
				.balign 32
				.size a, . - a
				addi a0, a0, 6
				addi a0, a0, 7
				.balign 16
				.size b, . - b
				addi a0, a0, 8
				.size c, . - c
				addi a0, a0, 9
				.size _start, . - _start

				## Test another text section.
				.section .text2,"ax",@progbits
				d:
				e:
				addi a0, a0, 1
				f:
				addi a0, a0, 2
				.balign 32
				.size d, . - d
				addi a0, a0, 3
				.size e, . - e
				.size f, . - f

				.section .pcrel,"ax",@progbits
				.L1:
				auipc a0, %pcrel_hi(_start)
				addi a0, a0, %pcrel_lo(.L1)
				.balign 16
				.L2:
				auipc a0, %pcrel_hi(_start)
				addi a0, a0, %pcrel_lo(.L1)

lld/test/ELF/riscv-reloc-align.s

This file was deleted.

	# REQUIRES: riscv

	# RUN: llvm-mc -filetype=obj -triple=riscv32 -mattr=+relax %s -o %t.o
	# RUN: not ld.lld %t.o -o /dev/null 2>&1 \| FileCheck %s

	# CHECK: relocation R_RISCV_ALIGN requires unimplemented linker relaxation

	.global _start
	_start:
	nop
	.balign 8
	nop

This is an archive of the discontinued LLVM Phabricator instance.

[ELF] Relax R_RISCV_ALIGNClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 436170

lld/ELF/Arch/RISCV.cpp

lld/ELF/InputSection.h

lld/ELF/InputSection.cpp

lld/ELF/Relocations.h

lld/ELF/Relocations.cpp

lld/ELF/Target.h

lld/ELF/Writer.cpp

lld/test/ELF/riscv-relax-align-rvc.s

lld/test/ELF/riscv-relax-align.s

lld/test/ELF/riscv-reloc-align.s

[ELF] Relax R_RISCV_ALIGN
ClosedPublic