This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
ELF/
1/39
Target.cpp
-
test/ELF/
-
ELF/
-
Inputs/
2
arm-thumb-blx-targets.s
-
arm-thumb-narrow-branch.o
-
arm-thumb-narrow-branch.s
-
far-arm-thumb-abs.s
-
arm-blx.s
-
arm-mov-relocs.s
-
arm-thumb-blx.s
-
arm-thumb-branch-error.s
-
arm-thumb-branch.s
-
arm-thumb-narrow-branch-check.s
-
arm-thumb-plt-reloc.s

Differential D21225

[LLD][ARM] Add initial support for Thumb for ARMv7A
ClosedPublic

Authored by peter.smith on Jun 10 2016, 4:48 AM.

Download Raw Diff

Details

Reviewers

zatrazz
ruiu
• rafael

Commits

rGd56b171ee965: [lld][ELF] Support for R_ARM_THM_JUMP8

Summary

Add support for the R_ARM_THM relocations used in the objects present
in arm-linux-gnueabihf-gcc. These are:

R_ARM_THM_CALL
R_ARM_THM_JUMP11
R_ARM_THM_JUMP19
R_ARM_THM_JUMP24
R_ARM_THM_MOVT_ABS
R_ARM_THM_MOVW_ABS_NC

Interworking between ARM and Thumb is partially supported with BLX. The R_ARM_CALL relocation for ARM instructions and R_ARM_THM_CALL relocation for Thumb instructions will write out a BL or BLX depending on the state of the Target.

Assumptions:

Target processor has BLX instruction and extended range of Thumb 4-byte Branch instructions (true for ARMv7a).
In relocateOne if (Val & 0x1) == 1 target is Thumb, 0 is ARM. This will hold for objects that comply with the ABI for the ARM architecture.

This is sufficient interworking support for a Thumb hello world to work with a recent arm-linux-gnueabihf distribution such as Linaro GCC 5.3-2016.02).

Limitations:

No interworking thunks for R_ARM_JUMP24, R_ARM_THM_JUMP24, R_ARM_THM_JUMP19 and the deprecated R_ARM_PLT32 and R_ARM_PC24 instructions as the instructions cannot be written out as a BLX and need a state change thunk.
No range extension thunks. The R_ARM_JUMP24 and R_ARM_THM_CALL have a range of 16Mb
No regression test in the test suite for R_ARM_JUMP11, llvm-mc does not emit a relocation for B.N which is sensible and permissible under the ABI as the range of the branch is too small to be usefully relocated, however gnu-as does produce the relocation and there is at least one instance in the arm-linux-gnueabihf libraries so it may be present in input objects.

References:

The Thumb instruction encodings are unfortunately not pretty, the 4-byte instructions are encoded as a consecutive pair of halfwords. The encodings are defined in the ARM Architecture Reference Manual: http://infocenter.arm.com/help/topic/com.arm.doc.ddi0406c/index.html (requires free registration)
The relocations are defined in ELF for the ARM Architecture: http://infocenter.arm.com/help/topic/com.arm.doc.ihi0044f/IHI0044F_aaelf.pdf (no registration required)

I will look into interworking thunks next.

Diff Detail

Event Timeline

peter.smith updated this revision to Diff 60336.Jun 10 2016, 4:48 AM

peter.smith retitled this revision from to [LLD][ARM] Add initial support for Thumb for ARMv7A.

peter.smith updated this object.

peter.smith added reviewers: ruiu, • rafael, zatrazz.

peter.smith added a subscriber: rengolin.

Herald added a subscriber: aemerson. · View Herald TranscriptJun 10 2016, 4:48 AM

ruiu added inline comments.Jun 10 2016, 10:58 AM

ELF/Target.cpp
1557	Target is Thumb. Use a BLX. ?
1558	'0' -> '1'?
1564	Remove `else` after `break`.
1567	It is not 0xeb but 0xeb000000, no? If so, please add a test to cover this path.
1596	I'd just write 1 instead of 0x1 (and so is 2 and 0x2)
1678–1679	This is probably a personal preference, but I'd name Hi and Lo because they are shorter.

Updated patch to address review comments:

Corrected ARM BL encoding.
Added test cases for ARM BLX to ARM and Thumb BLX to Thumb.
Some name and comment changes.

Thank you for the comments. I've updated with new revision.

ELF/Target.cpp
1557	I've gone for. // If bit 0 of Val is 1 the target is Thumb, we must select a BLX.
1558	Yes in our case the bottom bit of Val is '1'. I've made the change. I took imm24:H:0 from the Architecture reference manual as that is how the immediate is processed by the CPU.
1564	Ok, will remove.
1567	You are correct, it should be 0xeb000000. Apologies for missing this one and test cases. The BLX from Thumb to Thumb is also missing a test case, although the encoding is ok in that case. I will add test cases to arm-blx.s and arm-thumb-blx.s that tests that a BLX from ARM state to ARM state results in a correctly encoded BL instruction.
1596	Ok, will make that change.
1678–1679	Changed to Hi and Lo, with only 80 columns and long expressions it makes sense.

ruiu added inline comments.Jun 13 2016, 3:05 PM

ELF/Target.cpp
1561	nit: indentation
1565	I think you want to check if the most significant byte is 0xFA. So this expression needs to be (read32le(Loc) & 0xff000000) == 0xfa000000 (or equivalently ((read32le(Loc) >> 24) == 0xfa) no?
1600	Can we use alignTo here? Val = alignTo(Val, 4);
1695–1696	Format

Updated for review comments:

Alter mask when matching BLX instruction
Use alignTo instead of raw expression
clang-format run over patch

What code sequence causes gas to produce a R_ARM_THM_JUMP11? Maybe include a .o test with a comment about how it was created?

ELF/Target.cpp
1577	So the jump 11 target has one implicit 0 at the end but jump 19 has 2 (hence checkInt<12> and checkInt<21>)
1599	Not sure this part of the command ("It is equivalent to") adds a lot of value.
1675	This is the same as SignExtern64<11> without the shift, no?
1680	And this can use SignExtend64<19>?
test/ELF/Inputs/arm-thumb-blx-targets.s
11	Can you change the name of the sections to include the relocation it is testing?

• rafael added inline comments.Jun 14 2016, 5:51 AM

ELF/Target.cpp
1472	So, are all of these really R_PLT_PC and not R_PC? If the target symbol is not preemptible we don't create a plt entry and just use the target address. But if the target symbol is preemptible we will create a plt and that will have a call in it. That will break code that is not expecting the link register to change, no?

Trying to get phab to send email.

Hello,

I've managed to get emails from phabricator, will take a look straight away.

Thanks for the comments.

Peter

I've put some answers inline. Will post an update when I've written a test for R_ARM_JUMP11. I was under the impression that binary objects were frowned upon. Will aim to have an updated patch tomorrow.

To get an R_ARM_JUMP11 the narrow encoding of B needs to be forced with B.N.
.text
.syntax unified
.thumb
B.N external_symbol

ELF/Target.cpp
1472	The normal ARM PLT entries don't corrupt the link register. The lazy loading entry saves the link register on entry, to be restored by _dl_runtime_resolve. So it is safe to branch to a PLT entry. Arguably R_ARM_THM_JUMP11 is not R_PLT_PC as a linker is not required to generate a PLT entry for it. The range on the branch is likely to be too short to reach the PLT entry so I think that could be split out. At present interworking that requires thunks like a Thumb branch to an ARM PLT entry is broken as the B can't be changed to a BLX. I'm hoping to address this case with basic interworking thunks for a forthcoming review. From ELF for the ARM Architecture: R_ARM_CALL, R_ARM_THM_CALL, R_ARM_JUMP24, R_ARM_THM_JUMP24, R_ARM_THM_JUMP19 may be subject to PLT generation R_ARM_PC24 and R_ARM_PLT32 are deprecated but still occur in the old library I was using to test the ARM port. These relocation types pre-date the BLX instructions so they can be used on both BL and B instructions hence the mapping to R_ARM_JUMP24. They were split into two relocation types R_ARM_CALL (unconditional BL and BLX) and R_ARM_JUMP24 (conditional BL and B).
1561	Thanks for spotting, now fixed.
1565	Not quite the most significant byte, although I think that 0xfa000000 == 0xfa000000 is not quite right The BLX encoding is from Most Significant bit: 1111 101H (where H is part of the immediate so can be 0 or 1) The BL encoding is from Most Significant bit: cond 1011 Where cond is not 1111 (0xf is never). ARM reused the never space to encode more instructions. Given that R_ARM_CALL can only be used on a BL or BLX and is correctly used by the object producer it would be sufficient to use 0xf0000000 == 0xf000000. I have a weak preference to match as many of the fixed bits in the BLX instruction as possible as it helps match what is written in the architecture manual. This should make the condition 0xfe000000 == 0xfa000000. If you prefer 0xf0000000 == 0xf0000000 which is enough to discriminate BLX from BL, I'm happy to change.
1577	Yes. It is unfortunately not always easy to tell from the number in the relocation what overflow check is needed. My understanding, and distant memory from the time, was that the number represents the size of the immediate in the instruction. From the ARM Architecture Reference Manual Thumb B.N for (R_ARM_THM_JUMP11) imm32 = SignExtend(imm11:’0’, 32); Thumb B<cond>.W for (R_ARM_JUMP19) imm32 = SignExtend(S:J2:J1:imm6:imm11:’0’, 32); I make S:J2:J1:imm6:imm11 20 and not 19, but S could be considered sign and not part of the 19.
1599	I can remove it if it isn't helpful. The reason I put it in was to avoid confusion with pseudo code for BLX in the ARM Architectural reference manual: if targetInstrSet == InstrSet_ARM then targetAddress = Align(PC,4) + imm32; Where Align(PC, 4) is rounding down and not up like AlignTo.
1600	Yes we can. Will use that in future wherever I can.
1675	Yes I think you're right there. Will test to make sure and change.
1680	Yes I think so, with the appropriate modifications to the shifts.
1695–1696	I've run clang-format on the file and used its indentation.
test/ELF/Inputs/arm-thumb-blx-targets.s
11	Yes, will make the change.

Almost looking good. A few nits.

ELF/Target.cpp
1565	`0xfe000000 == 0xfa000000` seems fine to me.
1569	nit: indent with two more spaces.
1599	I'd remove that part "It is equivalent to..." since the code is now obvious with alignTo().
1604	nit: two more spaces.

This revision is now accepted and ready to land.Jun 14 2016, 9:15 AM

Added patch to address Rafael and Rui's last comments:

Added test case for R_ARM_THM_JUMP11. I will add a binary version of the Inputs/arm-thumb-narrow-branch.s assembled by the GNU assembler as well as the source.
Formatting and comment clean ups.
R_ARM_THM_JUMP11 uses R_PC not R_PLT_PC as a linker does not have to generate a PLT entry for it according to ELF for the ARM architecture.
I have not changed the SignExtend for R_ARM_THM_JUMP11 and R_ARM_JUMP19 as the shift is important to the correctness of the expression, the shift can be moved outside the SignExtend, but I think it is a closer mapping to the architecture description to have the shift inside the SignExtend.

I'll wait till at least tomorrow morning for any more comments.

LGTM

LGTM too.

Thank you.

Committed revision 272881

ELF/Target.cpp
1569	Ok done. The original is where clang-format puts it, but I prefer the indented form as well.
1599	Ok, I've removed that part.
1604	Ok done. The original is where clang-format puts it, but I prefer the indented form as well.
1675	Turns out that it isn't quite the same as the shift is needed to multiply by two. It is the same as (SignExtend64<11>(read16le(Buf) & 0x07ff) << 1) but I'm not sure it is worth making that change so I've left it as it is.
1680	As with R_ARM_JUMP11 the shift << 1 can be moved outside the SignExtend64<19> like so: (SignExtend64<19>(((Hi & 0x0400) << 9) \| // S ((Lo & 0x0800) << 7) \| // J2 ((Lo & 0x2000) << 4) \| // J1 ((Hi & 0x003f) << 11) \| // imm6 (Lo & 0x07ff)) << 1 I've not made this change as I think the original is easier to relate to the instruction description in the reference manual.

phosek added a commit: rGd56b171ee965: [lld][ELF] Support for R_ARM_THM_JUMP8.Nov 11 2021, 9:07 AM

Herald added subscribers: MaskRay, kristof.beyls, emaste. · View Herald TranscriptNov 11 2021, 9:07 AM

Revision Contents

Path

Size

ELF/

Target.cpp

125 lines

test/

ELF/

Inputs/

arm-thumb-blx-targets.s

36 lines

arm-thumb-narrow-branch.o

arm-thumb-narrow-branch.s

18 lines

24 lines

113 lines

6 lines

85 lines

arm-thumb-branch-error.s

19 lines

arm-thumb-branch.s

59 lines

arm-thumb-narrow-branch-check.s

72 lines

arm-thumb-plt-reloc.s

101 lines

Diff 60809

ELF/Target.cpp

Show First 20 Lines • Show All 1,454 Lines • ▼ Show 20 Lines	ARMTargetInfo::ARMTargetInfo() {
PltEntrySize = 16;		PltEntrySize = 16;
PltZeroSize = 20;		PltZeroSize = 20;
}		}

RelExpr ARMTargetInfo::getRelExpr(uint32_t Type, const SymbolBody &S) const {		RelExpr ARMTargetInfo::getRelExpr(uint32_t Type, const SymbolBody &S) const {
switch (Type) {		switch (Type) {
default:		default:
return R_ABS;		return R_ABS;
		case R_ARM_THM_JUMP11:
		return R_PC;
case R_ARM_CALL:		case R_ARM_CALL:
case R_ARM_JUMP24:		case R_ARM_JUMP24:
case R_ARM_PC24:		case R_ARM_PC24:
case R_ARM_PLT32:		case R_ARM_PLT32:
		case R_ARM_THM_JUMP19:
		case R_ARM_THM_JUMP24:
		case R_ARM_THM_CALL:
return R_PLT_PC;		return R_PLT_PC;
		rafaelUnsubmitted Not Done Reply Inline Actions So, are all of these really R_PLT_PC and not R_PC? If the target symbol is not preemptible we don't create a plt entry and just use the target address. But if the target symbol is preemptible we will create a plt and that will have a call in it. That will break code that is not expecting the link register to change, no? rafael: So, are all of these really R_PLT_PC and not R_PC? If the target symbol is not preemptible we…
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions The normal ARM PLT entries don't corrupt the link register. The lazy loading entry saves the link register on entry, to be restored by _dl_runtime_resolve. So it is safe to branch to a PLT entry. Arguably R_ARM_THM_JUMP11 is not R_PLT_PC as a linker is not required to generate a PLT entry for it. The range on the branch is likely to be too short to reach the PLT entry so I think that could be split out. At present interworking that requires thunks like a Thumb branch to an ARM PLT entry is broken as the B can't be changed to a BLX. I'm hoping to address this case with basic interworking thunks for a forthcoming review. From ELF for the ARM Architecture: R_ARM_CALL, R_ARM_THM_CALL, R_ARM_JUMP24, R_ARM_THM_JUMP24, R_ARM_THM_JUMP19 may be subject to PLT generation R_ARM_PC24 and R_ARM_PLT32 are deprecated but still occur in the old library I was using to test the ARM port. These relocation types pre-date the BLX instructions so they can be used on both BL and B instructions hence the mapping to R_ARM_JUMP24. They were split into two relocation types R_ARM_CALL (unconditional BL and BLX) and R_ARM_JUMP24 (conditional BL and B). peter.smith: The normal ARM PLT entries don't corrupt the link register. The lazy loading entry saves the…
case R_ARM_GOTOFF32:		case R_ARM_GOTOFF32:
// (S + A) - GOT_ORG		// (S + A) - GOT_ORG
return R_GOTREL;		return R_GOTREL;
case R_ARM_GOT_BREL:		case R_ARM_GOT_BREL:
// GOT(S) + A - GOT_ORG		// GOT(S) + A - GOT_ORG
return R_GOT_OFF;		return R_GOT_OFF;
case R_ARM_GOT_PREL:		case R_ARM_GOT_PREL:
// GOT(S) + - GOT_ORG		// GOT(S) + - GOT_ORG
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	void ARMTargetInfo::relocateOne(uint8_t *Loc, uint32_t Type,
case R_ARM_REL32:		case R_ARM_REL32:
write32le(Loc, Val);		write32le(Loc, Val);
break;		break;
case R_ARM_PREL31:		case R_ARM_PREL31:
checkInt<31>(Val, Type);		checkInt<31>(Val, Type);
write32le(Loc, (read32le(Loc) & 0x80000000) \| (Val & ~0x80000000));		write32le(Loc, (read32le(Loc) & 0x80000000) \| (Val & ~0x80000000));
break;		break;
case R_ARM_CALL:		case R_ARM_CALL:
		// R_ARM_CALL is used for BL and BLX instructions, depending on the
		// value of bit 0 of Val, we must select a BL or BLX instruction
		if (Val & 1) {
		// If bit 0 of Val is 1 the target is Thumb, we must select a BLX.
		ruiuUnsubmitted Not Done Reply Inline Actions Target is Thumb. Use a BLX. ? ruiu: Target is Thumb. Use a BLX. ?
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions I've gone for. // If bit 0 of Val is 1 the target is Thumb, we must select a BLX. peter.smith: I've gone for. // If bit 0 of Val is 1 the target is Thumb, we must select a BLX.
		// The BLX encoding is 0xfa:H:imm24 where Val = imm24:H:'1'
		ruiuUnsubmitted Not Done Reply Inline Actions '0' -> '1'? ruiu: '0' -> '1'?
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Yes in our case the bottom bit of Val is '1'. I've made the change. I took imm24:H:0 from the Architecture reference manual as that is how the immediate is processed by the CPU. peter.smith: Yes in our case the bottom bit of Val is '1'. I've made the change. I took imm24:H:0 from the…
		checkInt<26>(Val, Type);
		write32le(Loc, 0xfa000000 \| // opcode
		((Val & 2) << 23) \| // H
		ruiuUnsubmitted Not Done Reply Inline Actions nit: indentation ruiu: nit: indentation
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Thanks for spotting, now fixed. peter.smith: Thanks for spotting, now fixed.
		((Val >> 2) & 0x00ffffff)); // imm24
		break;
		}
		ruiuUnsubmitted Not Done Reply Inline Actions Remove `else` after `break`. ruiu: Remove `else` after `break`.
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Ok, will remove. peter.smith: Ok, will remove.
		if ((read32le(Loc) & 0xfe000000) == 0xfa000000)
		ruiuUnsubmitted Not Done Reply Inline Actions I think you want to check if the most significant byte is 0xFA. So this expression needs to be (read32le(Loc) & 0xff000000) == 0xfa000000 (or equivalently ((read32le(Loc) >> 24) == 0xfa) no? ruiu: I think you want to check if the most significant byte is 0xFA. So this expression needs to be…
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Not quite the most significant byte, although I think that 0xfa000000 == 0xfa000000 is not quite right The BLX encoding is from Most Significant bit: 1111 101H (where H is part of the immediate so can be 0 or 1) The BL encoding is from Most Significant bit: cond 1011 Where cond is not 1111 (0xf is never). ARM reused the never space to encode more instructions. Given that R_ARM_CALL can only be used on a BL or BLX and is correctly used by the object producer it would be sufficient to use 0xf0000000 == 0xf000000. I have a weak preference to match as many of the fixed bits in the BLX instruction as possible as it helps match what is written in the architecture manual. This should make the condition 0xfe000000 == 0xfa000000. If you prefer 0xf0000000 == 0xf0000000 which is enough to discriminate BLX from BL, I'm happy to change. peter.smith: Not quite the most significant byte, although I think that 0xfa000000 == 0xfa000000 is not…
		ruiuUnsubmitted Not Done Reply Inline Actions `0xfe000000 == 0xfa000000` seems fine to me. ruiu: `0xfe000000 == 0xfa000000` seems fine to me.
		// BLX (always unconditional) instruction to an ARM Target, select an
		// unconditional BL.
		ruiuUnsubmitted Not Done Reply Inline Actions It is not 0xeb but 0xeb000000, no? If so, please add a test to cover this path. ruiu: It is not 0xeb but 0xeb000000, no? If so, please add a test to cover this path.
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions You are correct, it should be 0xeb000000. Apologies for missing this one and test cases. The BLX from Thumb to Thumb is also missing a test case, although the encoding is ok in that case. I will add test cases to arm-blx.s and arm-thumb-blx.s that tests that a BLX from ARM state to ARM state results in a correctly encoded BL instruction. peter.smith: You are correct, it should be 0xeb000000. Apologies for missing this one and test cases. The…
		write32le(Loc, 0xeb000000 \| (read32le(Loc) & 0x00ffffff));
		// fall through as BL encoding is shared with B
		ruiuUnsubmitted Not Done Reply Inline Actions nit: indent with two more spaces. ruiu: nit: indent with two more spaces.
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Ok done. The original is where clang-format puts it, but I prefer the indented form as well. peter.smith: Ok done. The original is where clang-format puts it, but I prefer the indented form as well.
case R_ARM_JUMP24:		case R_ARM_JUMP24:
case R_ARM_PC24:		case R_ARM_PC24:
case R_ARM_PLT32:		case R_ARM_PLT32:
checkInt<26>(Val, Type);		checkInt<26>(Val, Type);
write32le(Loc, (read32le(Loc) & ~0x00ffffff) \| ((Val >> 2) & 0x00ffffff));		write32le(Loc, (read32le(Loc) & ~0x00ffffff) \| ((Val >> 2) & 0x00ffffff));
break;		break;
		case R_ARM_THM_JUMP11:
		checkInt<12>(Val, Type);
		rafaelUnsubmitted Not Done Reply Inline Actions So the jump 11 target has one implicit 0 at the end but jump 19 has 2 (hence checkInt<12> and checkInt<21>) rafael: So the jump 11 target has one implicit 0 at the end but jump 19 has 2 (hence checkInt<12> and…
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Yes. It is unfortunately not always easy to tell from the number in the relocation what overflow check is needed. My understanding, and distant memory from the time, was that the number represents the size of the immediate in the instruction. From the ARM Architecture Reference Manual Thumb B.N for (R_ARM_THM_JUMP11) imm32 = SignExtend(imm11:’0’, 32); Thumb B<cond>.W for (R_ARM_JUMP19) imm32 = SignExtend(S:J2:J1:imm6:imm11:’0’, 32); I make S:J2:J1:imm6:imm11 20 and not 19, but S could be considered sign and not part of the 19. peter.smith: Yes. It is unfortunately not always easy to tell from the number in the relocation what…
		write16le(Loc, (read32le(Loc) & 0xf800) \| ((Val >> 1) & 0x07ff));
		break;
		case R_ARM_THM_JUMP19:
		// Encoding T3: Val = S:J2:J1:imm6:imm11:0
		checkInt<21>(Val, Type);
		write16le(Loc,
		(read16le(Loc) & 0xfbc0) \| // opcode cond
		((Val >> 10) & 0x0400) \| // S
		((Val >> 12) & 0x003f)); // imm6
		write16le(Loc + 2,
		0x8000 \| // opcode
		((Val >> 8) & 0x0800) \| // J2
		((Val >> 5) & 0x2000) \| // J1
		((Val >> 1) & 0x07ff)); // imm11
		break;
		case R_ARM_THM_CALL:
		// R_ARM_THM_CALL is used for BL and BLX instructions, depending on the
		// value of bit 0 of Val, we must select a BL or BLX instruction
		if ((Val & 1) == 0) {
		ruiuUnsubmitted Not Done Reply Inline Actions I'd just write 1 instead of 0x1 (and so is 2 and 0x2) ruiu: I'd just write 1 instead of 0x1 (and so is 2 and 0x2)
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Ok, will make that change. peter.smith: Ok, will make that change.
		// Ensure BLX destination is 4-byte aligned. As BLX instruction may
		// only be two byte aligned. This must be done before overflow check
		Val = alignTo(Val, 4);
		rafaelUnsubmitted Not Done Reply Inline Actions Not sure this part of the command ("It is equivalent to") adds a lot of value. rafael: Not sure this part of the command ("It is equivalent to") adds a lot of value.
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions I can remove it if it isn't helpful. The reason I put it in was to avoid confusion with pseudo code for BLX in the ARM Architectural reference manual: if targetInstrSet == InstrSet_ARM then targetAddress = Align(PC,4) + imm32; Where Align(PC, 4) is rounding down and not up like AlignTo. peter.smith: I can remove it if it isn't helpful. The reason I put it in was to avoid confusion with pseudo…
		ruiuUnsubmitted Not Done Reply Inline Actions I'd remove that part "It is equivalent to..." since the code is now obvious with alignTo(). ruiu: I'd remove that part "It is equivalent to..." since the code is now obvious with alignTo().
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Ok, I've removed that part. peter.smith: Ok, I've removed that part.
		}
		ruiuUnsubmitted Not Done Reply Inline Actions Can we use alignTo here? Val = alignTo(Val, 4); ruiu: Can we use alignTo here? Val = alignTo(Val, 4);
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Yes we can. Will use that in future wherever I can. peter.smith: Yes we can. Will use that in future wherever I can.
		// Bit 12 is 0 for BLX, 1 for BL
		write16le(Loc + 2, (read16le(Loc + 2) & ~0x1000) \| (Val & 1) << 12);
		// Fall through as rest of encoding is the same as B.W
		case R_ARM_THM_JUMP24:
		ruiuUnsubmitted Not Done Reply Inline Actions nit: two more spaces. ruiu: nit: two more spaces.
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Ok done. The original is where clang-format puts it, but I prefer the indented form as well. peter.smith: Ok done. The original is where clang-format puts it, but I prefer the indented form as well.
		// Encoding B T4, BL T1, BLX T2: Val = S:I1:I2:imm10:imm11:0
		// FIXME: Use of I1 and I2 require v6T2ops
		checkInt<25>(Val, Type);
		write16le(Loc,
		0xf000 \| // opcode
		((Val >> 14) & 0x0400) \| // S
		((Val >> 12) & 0x03ff)); // imm10
		write16le(Loc + 2,
		(read16le(Loc + 2) & 0xd000) \| // opcode
		(((~(Val >> 10)) ^ (Val >> 11)) & 0x2000) \| // J1
		(((~(Val >> 11)) ^ (Val >> 13)) & 0x0800) \| // J2
		((Val >> 1) & 0x07ff)); // imm11
		break;
case R_ARM_MOVW_ABS_NC:		case R_ARM_MOVW_ABS_NC:
write32le(Loc, (read32le(Loc) & ~0x000f0fff) \| ((Val & 0xf000) << 4) \|		write32le(Loc, (read32le(Loc) & ~0x000f0fff) \| ((Val & 0xf000) << 4) \|
(Val & 0x0fff));		(Val & 0x0fff));
break;		break;
case R_ARM_MOVT_ABS:		case R_ARM_MOVT_ABS:
checkUInt<32>(Val, Type);		checkUInt<32>(Val, Type);
write32le(Loc, (read32le(Loc) & ~0x000f0fff) \|		write32le(Loc, (read32le(Loc) & ~0x000f0fff) \|
(((Val >> 16) & 0xf000) << 4) \| ((Val >> 16) & 0xfff));		(((Val >> 16) & 0xf000) << 4) \| ((Val >> 16) & 0xfff));
break;		break;
		case R_ARM_THM_MOVT_ABS:
		// Encoding T1: A = imm4:i:imm3:imm8
		checkUInt<32>(Val, Type);
		write16le(Loc,
		0xf2c0 \| // opcode
		((Val >> 17) & 0x0400) \| // i
		((Val >> 28) & 0x000f)); // imm4
		write16le(Loc + 2,
		(read16le(Loc + 2) & 0x8f00) \| // opcode
		((Val >> 12) & 0x7000) \| // imm3
		((Val >> 16) & 0x00ff)); // imm8
		break;
		case R_ARM_THM_MOVW_ABS_NC:
		// Encoding T3: A = imm4:i:imm3:imm8
		write16le(Loc,
		0xf240 \| // opcode
		((Val >> 1) & 0x0400) \| // i
		((Val >> 12) & 0x000f)); // imm4
		write16le(Loc + 2,
		(read16le(Loc + 2) & 0x8f00) \| // opcode
		((Val << 4) & 0x7000) \| // imm3
		(Val & 0x00ff)); // imm8
		break;
default:		default:
fatal("unrecognized reloc " + Twine(Type));		fatal("unrecognized reloc " + Twine(Type));
}		}
}		}

uint64_t ARMTargetInfo::getImplicitAddend(const uint8_t *Buf,		uint64_t ARMTargetInfo::getImplicitAddend(const uint8_t *Buf,
uint32_t Type) const {		uint32_t Type) const {
switch (Type) {		switch (Type) {
default:		default:
return 0;		return 0;
case R_ARM_ABS32:		case R_ARM_ABS32:
case R_ARM_BASE_PREL:		case R_ARM_BASE_PREL:
case R_ARM_GOTOFF32:		case R_ARM_GOTOFF32:
case R_ARM_GOT_BREL:		case R_ARM_GOT_BREL:
case R_ARM_GOT_PREL:		case R_ARM_GOT_PREL:
case R_ARM_REL32:		case R_ARM_REL32:
return SignExtend64<32>(read32le(Buf));		return SignExtend64<32>(read32le(Buf));
case R_ARM_PREL31:		case R_ARM_PREL31:
return SignExtend64<31>(read32le(Buf));		return SignExtend64<31>(read32le(Buf));
case R_ARM_CALL:		case R_ARM_CALL:
case R_ARM_JUMP24:		case R_ARM_JUMP24:
case R_ARM_PC24:		case R_ARM_PC24:
case R_ARM_PLT32:		case R_ARM_PLT32:
return SignExtend64<26>((read32le(Buf) & 0x00ffffff) << 2);		return SignExtend64<26>((read32le(Buf) & 0x00ffffff) << 2);
case R_ARM_MOVW_ABS_NC:		case R_ARM_THM_JUMP11:
case R_ARM_MOVT_ABS: {		return SignExtend64<12>((read16le(Buf) & 0x07ff) << 1);
		rafaelUnsubmitted Not Done Reply Inline Actions This is the same as SignExtern64<11> without the shift, no? rafael: This is the same as SignExtern64<11> without the shift, no?
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Yes I think you're right there. Will test to make sure and change. peter.smith: Yes I think you're right there. Will test to make sure and change.
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Turns out that it isn't quite the same as the shift is needed to multiply by two. It is the same as (SignExtend64<11>(read16le(Buf) & 0x07ff) << 1) but I'm not sure it is worth making that change so I've left it as it is. peter.smith: Turns out that it isn't quite the same as the shift is needed to multiply by two. It is the…
		case R_ARM_THM_JUMP19: {
		// Encoding T3: A = S:J2:J1:imm10:imm6:0
		uint16_t Hi = read16le(Buf);
		uint16_t Lo = read16le(Buf + 2);
		ruiuUnsubmitted Done Reply Inline Actions This is probably a personal preference, but I'd name Hi and Lo because they are shorter. ruiu: This is probably a personal preference, but I'd name Hi and Lo because they are shorter.
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Changed to Hi and Lo, with only 80 columns and long expressions it makes sense. peter.smith: Changed to Hi and Lo, with only 80 columns and long expressions it makes sense.
		return SignExtend64<20>(((Hi & 0x0400) << 10) \| // S
		rafaelUnsubmitted Not Done Reply Inline Actions And this can use SignExtend64<19>? rafael: And this can use SignExtend64<19>?
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Yes I think so, with the appropriate modifications to the shifts. peter.smith: Yes I think so, with the appropriate modifications to the shifts.
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions As with R_ARM_JUMP11 the shift << 1 can be moved outside the SignExtend64<19> like so: (SignExtend64<19>(((Hi & 0x0400) << 9) \| // S ((Lo & 0x0800) << 7) \| // J2 ((Lo & 0x2000) << 4) \| // J1 ((Hi & 0x003f) << 11) \| // imm6 (Lo & 0x07ff)) << 1 I've not made this change as I think the original is easier to relate to the instruction description in the reference manual. peter.smith: As with R_ARM_JUMP11 the shift << 1 can be moved outside the SignExtend64<19> like so…
		((Lo & 0x0800) << 8) \| // J2
		((Lo & 0x2000) << 5) \| // J1
		((Hi & 0x003f) << 12) \| // imm6
		((Lo & 0x07ff) << 1)); // imm11:0
		}
		case R_ARM_THM_JUMP24:
		case R_ARM_THM_CALL: {
		// Encoding B T4, BL T1, BLX T2: A = S:I1:I2:imm10:imm11:0
		// I1 = NOT(J1 EOR S), I2 = NOT(J2 EOR S)
		// FIXME: I1 and I2 require v6T2ops
		uint16_t Hi = read16le(Buf);
		uint16_t Lo = read16le(Buf + 2);
		return SignExtend64<24>(((Hi & 0x0400) << 14) \| // S
		(~((Lo ^ (Hi << 3)) << 10) & 0x00800000) \| // I1
		(~((Lo ^ (Hi << 1)) << 11) & 0x00400000) \| // I2
		((Hi & 0x003ff) << 12) \| // imm0
		ruiuUnsubmitted Not Done Reply Inline Actions Format ruiu: Format
		peter.smithAuthorUnsubmitted Not Done Reply Inline Actions I've run clang-format on the file and used its indentation. peter.smith: I've run clang-format on the file and used its indentation.
		((Lo & 0x007ff) << 1)); // imm11:0
		}
// ELF for the ARM Architecture 4.6.1.1 the implicit addend for MOVW and		// ELF for the ARM Architecture 4.6.1.1 the implicit addend for MOVW and
// MOVT is in the range -32768 <= A < 32768		// MOVT is in the range -32768 <= A < 32768
		case R_ARM_MOVW_ABS_NC:
		case R_ARM_MOVT_ABS: {
uint64_t Val = read32le(Buf) & 0x000f0fff;		uint64_t Val = read32le(Buf) & 0x000f0fff;
return SignExtend64<16>(((Val & 0x000f0000) >> 4) \| (Val & 0x00fff));		return SignExtend64<16>(((Val & 0x000f0000) >> 4) \| (Val & 0x00fff));
}		}
		case R_ARM_THM_MOVW_ABS_NC:
		case R_ARM_THM_MOVT_ABS: {
		// Encoding T3: A = imm4:i:imm3:imm8
		uint16_t Hi = read16le(Buf);
		uint16_t Lo = read16le(Buf + 2);
		return SignExtend64<16>(((Hi & 0x000f) << 12) \| // imm4
		((Hi & 0x0400) << 1) \| // i
		((Lo & 0x7000) >> 4) \| // imm3
		(Lo & 0x00ff)); // imm8
		}
}		}
}		}

template <class ELFT> MipsTargetInfo<ELFT>::MipsTargetInfo() {		template <class ELFT> MipsTargetInfo<ELFT>::MipsTargetInfo() {
GotPltHeaderEntriesNum = 2;		GotPltHeaderEntriesNum = 2;
PageSize = 65536;		PageSize = 65536;
PltEntrySize = 16;		PltEntrySize = 16;
PltZeroSize = 32;		PltZeroSize = 32;
▲ Show 20 Lines • Show All 310 Lines • Show Last 20 Lines

test/ELF/Inputs/arm-thumb-blx-targets.s

This file was added.

				.syntax unified
				.arm
				.section .R_ARM_CALL24_callee_low, "ax",%progbits
				.align 2
				.globl callee_low
				.type callee_low,%function
				callee_low:
				bx lr

				.section .R_ARM_CALL24_callee_thumb_low, "ax",%progbits
				.balign 0x100
				rafaelUnsubmitted Not Done Reply Inline Actions Can you change the name of the sections to include the relocation it is testing? rafael: Can you change the name of the sections to include the relocation it is testing?
				peter.smithAuthorUnsubmitted Not Done Reply Inline Actions Yes, will make the change. peter.smith: Yes, will make the change.
				.thumb
				.type callee_thumb_low,%function
				.globl callee_thumb_low
				callee_thumb_low:
				bx lr

				.section .R_ARM_CALL24_callee_high, "ax",%progbits
				.balign 0x100
				.arm
				.globl callee_high
				.type callee_high,%function
				callee_high:
				bx lr

				.section .R_ARM_CALL24_callee_thumb_high, "ax",%progbits
				.balign 0x100
				.thumb
				.type callee_thumb_high,%function
				.globl callee_thumb_high
				callee_thumb_high:
				bx lr

				.globl blx_far
				.type blx_far, %function
				blx_far = 0x1010018

test/ELF/Inputs/arm-thumb-narrow-branch.o

This binary file was added.

test/ELF/Inputs/arm-thumb-narrow-branch.s

This file was added.

				// This input must be assembled by the GNU assembler, as llvm-mc does not emit
				// the R_ARM_JUMP11 relocation for a Thumb narrow branch. This is permissible
				// by the ABI for the ARM architecture as the range of the Thumb narrow branch
				// is short enough (+- 2048 bytes) that widespread use would be impractical.
				//
				// The test case will use a pre compiled object arm-thumb-narrow-branch.o
				.syntax unified
				.section .caller, "ax",%progbits
				.thumb
				.align 2
				.type callers,%function
				.globl callers
				callers:
				b.n callee_low_far
				b.n callee_low
				b.n callee_high
				b.n callee_high_far
				bx lr

test/ELF/Inputs/far-arm-thumb-abs.s

This file was added.

				.global far_cond
				.type far_cond,%function
				far_cond = 0x110023
				.global far_uncond
				.type far_uncond,%function
				far_uncond = 0x101001b

				.global too_far1
				.type too_far1,%function
				too_far1 = 0x1020005
				.global too_far2
				.type too_far1,%function
				too_far2 = 0x1020009
				.global too_far3
				.type too_far3,%function
				too_far3 = 0x12000d

				.global blx_far
				.type blx_far, %function
				blx_far = 0x2010025

				.global blx_far2
				.type blx_far2, %function
				blx_far2 = 0x2010029

test/ELF/arm-blx.s

This file was added.

				// RUN: llvm-mc -filetype=obj -triple=armv7a-none-linux-gnueabi %s -o %t
				// RUN: llvm-mc -filetype=obj -triple=armv7a-none-linux-gnueabi %S/Inputs/far-arm-thumb-abs.s -o %tfar
				// RUN: echo "SECTIONS { \
				// RUN: .callee1 : { *(.callee_low) } \
				// RUN: .callee2 : { *(.callee_arm_low) } \
				// RUN: .caller : { *(.text) } \
				// RUN: .callee3 : { *(.callee_high) } \
				// RUN: .callee4 : { *(.callee_arm_high) } } " > %t.script
				// RUN: ld.lld --script %t.script %t %tfar -o %t2 2>&1
				// RUN: llvm-objdump -d -triple=armv7a-none-linux-gnueabi %t2 \| FileCheck -check-prefix=CHECK-ARM %s
				// RUN: llvm-objdump -d -triple=thumbv7a-none-linux-gnueabi %t2 \| FileCheck -check-prefix=CHECK-THUMB %s
				// REQUIRES: arm

				// Test BLX instruction is chosen for ARM BL/BLX instruction and Thumb callee
				// Using two callees to ensure at least one has 2-byte alignment.
				.syntax unified
				.thumb
				.section .callee_low, "ax",%progbits
				.align 2
				.type callee_low,%function
				callee_low:
				bx lr
				.type callee_low2, %function
				callee_low2:
				bx lr

				.section .callee_arm_low, "ax",%progbits
				.arm
				.balign 0x100
				.type callee_arm_low,%function
				.align 2
				callee_arm_low:
				bx lr

				.section .text, "ax",%progbits
				.arm
				.globl _start
				.balign 0x10000
				.type _start,%function
				_start:
				bl callee_low
				blx callee_low
				bl callee_low2
				blx callee_low2
				bl callee_high
				blx callee_high
				bl callee_high2
				blx callee_high2
				bl blx_far
				blx blx_far2
				// blx to ARM instruction should be written as a BL
				bl callee_arm_low
				blx callee_arm_low
				bl callee_arm_high
				blx callee_arm_high
				bx lr

				.section .callee_high, "ax",%progbits
				.balign 0x100
				.thumb
				.type callee_high,%function
				callee_high:
				bx lr
				.type callee_high2,%function
				callee_high2:
				bx lr

				.section .callee_arm_high, "ax",%progbits
				.arm
				.balign 0x100
				.type callee_arm_high,%function
				callee_arm_high:
				bx lr

				// CHECK-THUMB: Disassembly of section .callee1:
				// CHECK-THUMB-NEXT: callee_low:
				// CHECK-THUMB-NEXT: b4: 70 47 bx lr
				// CHECK-THUMB: callee_low2:
				// CHECK-THUMB-NEXT: b6: 70 47 bx lr

				// CHECK-ARM: Disassembly of section .callee2:
				// CHECK-ARM-NEXT: callee_arm_low:
				// CHECK-ARM-NEXT: 100: 1e ff 2f e1 bx lr

				// CHECK-ARM: Disassembly of section .caller:
				// CHECK-ARM-NEXT: _start:
				// CHECK-ARM-NEXT: 10000: 2b c0 ff fa blx #-65364 <callee_low>
				// CHECK-ARM-NEXT: 10004: 2a c0 ff fa blx #-65368 <callee_low>
				// CHECK-ARM-NEXT: 10008: 29 c0 ff fb blx #-65370 <callee_low2>
				// CHECK-ARM-NEXT: 1000c: 28 c0 ff fb blx #-65374 <callee_low2>
				// CHECK-ARM-NEXT: 10010: 3a 00 00 fa blx #232 <callee_high>
				// CHECK-ARM-NEXT: 10014: 39 00 00 fa blx #228 <callee_high>
				// CHECK-ARM-NEXT: 10018: 38 00 00 fb blx #226 <callee_high2>
				// CHECK-ARM-NEXT: 1001c: 37 00 00 fb blx #222 <callee_high2>
				// 10020 + 1FFFFFC + 8 = 0x2010024 = blx_far
				// CHECK-ARM-NEXT: 10020: ff ff 7f fa blx #33554428
				// 10024 + 1FFFFFC + 8 = 0x2010028 = blx_far2
				// CHECK-ARM-NEXT: 10024: ff ff 7f fa blx #33554428
				// CHECK-ARM-NEXT: 10028: 34 c0 ff eb bl #-65328 <callee_arm_low>
				// CHECK-ARM-NEXT: 1002c: 33 c0 ff eb bl #-65332 <callee_arm_low>
				// CHECK-ARM-NEXT: 10030: 72 00 00 eb bl #456 <callee_arm_high>
				// CHECK-ARM-NEXT: 10034: 71 00 00 eb bl #452 <callee_arm_high>
				// CHECK-ARM-NEXT: 10038: 1e ff 2f e1 bx lr

				// CHECK-THUMB: Disassembly of section .callee3:
				// CHECK-THUMB: callee_high:
				// CHECK-THUMB-NEXT: 10100: 70 47 bx lr
				// CHECK-THUMB: callee_high2:
				// CHECK-THUMB-NEXT: 10102: 70 47 bx lr

				// CHECK-ARM: Disassembly of section .callee4:
				// CHECK-NEXT-ARM: callee_arm_high:
				// CHECK-NEXT-ARM: 10200: 1e ff 2f e1 bx lr

test/ELF/arm-mov-relocs.s

	// RUN: llvm-mc -filetype=obj -triple=armv7a-unknown-linux-gnueabi %s -o %t			// RUN: llvm-mc -filetype=obj -triple=armv7a-unknown-linux-gnueabi %s -o %t
	// RUN: ld.lld %t -o %t2			// RUN: ld.lld %t -o %t2
	// RUN: llvm-objdump -d %t2 -triple=armv7a-unknown-linux-gnueabi \| FileCheck %s			// RUN: llvm-objdump -d %t2 -triple=armv7a-unknown-linux-gnueabi \| FileCheck %s
				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-unknown-linux-gnueabi %s -o %t3
				// RUN: ld.lld %t3 -o %t4
				// RUN: llvm-objdump -d %t4 -triple=thumbv7a-unknown-linux-gnueabi \| FileCheck %s
	// REQUIRES: arm			// REQUIRES: arm

	// Test the R_ARM_MOVW_ABS_NC and R_ARM_MOVT_ABS relocations			// Test the R_ARM_MOVW_ABS_NC and R_ARM_MOVT_ABS relocations as well as
				// the R_ARM_THM_MOVW_ABS_NC and R_ARM_THM_MOVT_ABS relocations.
	.syntax unified			.syntax unified
	.globl _start			.globl _start
	_start:			_start:
	.section .R_ARM_MOVW_ABS_NC, "ax",%progbits			.section .R_ARM_MOVW_ABS_NC, "ax",%progbits
	movw r0, :lower16:label			movw r0, :lower16:label
	movw r1, :lower16:label1			movw r1, :lower16:label1
	movw r2, :lower16:label2 + 4			movw r2, :lower16:label2 + 4
	movw r3, :lower16:label3			movw r3, :lower16:label3
	Show All 39 Lines

test/ELF/arm-thumb-blx.s

This file was added.

				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-none-linux-gnueabi %s -o %t
				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-none-linux-gnueabi %S/Inputs/arm-thumb-blx-targets.s -o %ttarget
				// RUN: echo "SECTIONS { \
				// RUN: .R_ARM_CALL24_callee1 : { *(.R_ARM_CALL24_callee_low) } \
				// RUN: .R_ARM_CALL24_callee2 : { *(.R_ARM_CALL24_callee_thumb_low) } \
				// RUN: .caller : { *(.text) } \
				// RUN: .R_ARM_CALL24_callee3 : { *(.R_ARM_CALL24_callee_high) } \
				// RUN: .R_ARM_CALL24_callee4 : { *(.R_ARM_CALL24_callee_thumb_high) } } " > %t.script
				// RUN: ld.lld --script %t.script %t %ttarget -o %t2 2>&1
				// RUN: llvm-objdump -d -triple=thumbv7a-none-linux-gnueabi %t2 \| FileCheck -check-prefix=CHECK-THUMB %s
				// RUN: llvm-objdump -d -triple=armv7a-none-linux-gnueabi %t2 \| FileCheck -check-prefix=CHECK-ARM %s
				// REQUIRES: arm
				// Test BLX instruction is chosen for Thumb BL/BLX instruction and ARM callee
				// 2 byte nops are used to test the pc-rounding behaviour. As a BLX from a
				// 2 byte aligned destination is defined as Align(PC,4) + immediate:00
				// FIXME: llvm-mc has problems assembling BLX unless the destination is
				// external. The targets of the BL and BLX instructions are in arm-thumb-blx-target.s
				.syntax unified
				.section .text, "ax",%progbits
				.thumb
				.globl _start
				.balign 0x10000
				.type _start,%function
				_start:
				blx callee_low
				nop
				bl callee_low
				nop
				blx callee_high
				nop
				bl callee_high
				nop
				blx blx_far
				nop
				bl blx_far
				nop
				// Expect BLX to thumb target to be written out as a BL
				blx callee_thumb_low
				nop
				blx callee_thumb_high
				bx lr

				// CHECK-ARM: Disassembly of section .R_ARM_CALL24_callee1:
				// CHECK-NEXT-ARM: callee_low:
				// CHECK-NEXT-ARM: b4: 1e ff 2f e1 bx lr

				// CHECK-THUMB: Disassembly of section .R_ARM_CALL24_callee2:
				// CHECK-NEXT-THUMB: callee_thumb_low:
				// CHECK-NEXT-THUMB: 100: 70 47 bx lr

				// CHECK-THUMB: Disassembly of section .caller:
				// CHECK-THUMB: _start:
				// Align(0x10000,4) - 0xff50 (65360) + 4 = 0xb4 = callee_low
				// CHECK-NEXT-THUMB: 10000: f0 f7 58 e8 blx #-65360
				// CHECK-NEXT-THUMB: 10004: 00 bf nop
				// Align(0x10006,4) - 0xff54 (65364) + 4 = 0xb4 = callee_low
				// CHECK-NEXT-THUMB: 10006: f0 f7 56 e8 blx #-65364
				// CHECK-NEXT-THUMB: 1000a: 00 bf nop
				// Align(0x1000c,4) + 0xf0 (240) + 4 = 0x10100 = callee_high
				// CHECK-NEXT-THUMB: 1000c: 00 f0 78 e8 blx #240
				// CHECK-NEXT-THUMB: 10010: 00 bf nop
				// Align(0x10012,4) + 0xec (236) + 4 = 0x10100 = callee_high
				// CHECK-NEXT-THUMB: 10012: 00 f0 76 e8 blx #236
				// CHECK-NEXT-THUMB: 10016: 00 bf nop
				// Align(0x10018,4) + 0xfffffc (16777212) = 0x1010018 = blx_far
				// CHECK-NEXT-THUMB: 10018: ff f3 fe c7 blx #16777212
				// CHECK-NEXT-THUMB: 1001c: 00 bf nop
				// Align(0x1001e,4) + 0xfffff8 (16777208) = 0x1010018 = blx_far
				// CHECK-NEXT-THUMB: 1001e: ff f3 fc c7 blx #16777208
				// CHECK-NEXT-THUMB: 10022: 00 bf nop
				// 10024 - 0xff28 (65320) + 4 = 0x100 = callee_thumb_low
				// CHECK-NEXT-THUMB: 10024: f0 f7 6c f8 bl #-65320
				// CHECK-NEXT-THUMB: 10028: 00 bf nop
				// 1002a + 0x1d2 (466) + 4 = 0x10200 = callee_thumb_high
				// CHECK-NEXT-THUMB: 1002a: 00 f0 e9 f8 bl #466
				// CHECK-NEXT-THUMB: 1002e: 70 47 bx lr


				// CHECK-ARM: Disassembly of section .R_ARM_CALL24_callee3:
				// CHECK-NEXT-ARM: callee_high:
				// CHECK-NEXT-ARM: 10100: 1e ff 2f e1 bx lr

				// CHECK: Disassembly of section .R_ARM_CALL24_callee4:
				// CHECK-NEXT-THUMB:callee_thumb_high:
				// CHECK-NEXT-THUMB: 10200: 70 47 bx lr

test/ELF/arm-thumb-branch-error.s

This file was added.

				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-none-linux-gnueabi %s -o %t
				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-none-linux-gnueabi %S/Inputs/far-arm-thumb-abs.s -o %tfar
				// RUN: not ld.lld %t %tfar -o %t2 2>&1 \| FileCheck %s
				// REQUIRES: arm
				.syntax unified
				.section .text, "ax",%progbits
				.globl _start
				.balign 0x10000
				.type _start,%function
				_start:
				// address of too_far symbols are just out of range of ARM branch with
				// 26-bit immediate field and an addend of -8
				bl too_far1
				b too_far2
				beq.w too_far3

				// CHECK: R_ARM_THM_CALL out of range
				// CHECK-NEXT: R_ARM_THM_JUMP24 out of range
				// CHECK-NEXT: R_ARM_THM_JUMP19 out of range

test/ELF/arm-thumb-branch.s

This file was added.

				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-none-linux-gnueabi %s -o %t
				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-none-linux-gnueabi %S/Inputs/far-arm-thumb-abs.s -o %tfar
				// RUN: echo "SECTIONS { \
				// RUN: .callee1 : { *(.callee_low) } \
				// RUN: .caller : { *(.text) } \
				// RUN: .callee2 : { *(.callee_high) } } " > %t.script
				// RUN: ld.lld --script %t.script %t %tfar -o %t2 2>&1
				// RUN: llvm-objdump -d -triple=thumbv7a-none-linux-gnueabi %t2 \| FileCheck %s
				// REQUIRES: arm

				.syntax unified
				.thumb
				.section .callee_low, "ax",%progbits
				.align 2
				.type callee_low,%function
				callee_low:
				bx lr

				.section .text, "ax",%progbits
				.globl _start
				.balign 0x10000
				.type _start,%function
				_start:
				bl callee_low
				b callee_low
				beq callee_low
				bl callee_high
				b callee_high
				bne callee_high
				bl far_uncond
				b far_uncond
				bgt far_cond
				bx lr

				.section .callee_high, "ax",%progbits
				.align 2
				.type callee_high,%function
				callee_high:
				bx lr

				// CHECK: Disassembly of section .callee1:
				// CHECK-NEXT: callee_low:
				// CHECK-NEXT: b4: 70 47 bx lr
				// CHECK-NEXT: Disassembly of section .caller:
				// CHECK-NEXT: _start:
				// CHECK-NEXT: 10000: f0 f7 58 f8 bl #-65360
				// CHECK-NEXT: 10004: f0 f7 56 b8 b.w #-65364
				// CHECK-NEXT: 10008: 30 f4 54 a8 beq.w #-65368
				// CHECK-NEXT: 1000c: 00 f0 0c f8 bl #24
				// CHECK-NEXT: 10010: 00 f0 0a b8 b.w #20
				// CHECK-NEXT: 10014: 40 f0 08 80 bne.w #16
				// CHECK-NEXT: 10018: ff f3 ff d7 bl #16777214
				// CHECK-NEXT: 1001c: ff f3 fd 97 b.w #16777210
				// CHECK-NEXT: 10020: 3f f3 ff af bgt.w #1048574
				// CHECK-NEXT: 10024: 70 47 bx lr
				// CHECK-NEXT: 10026: 00 00 movs r0, r0
				// CHECK-NEXT: Disassembly of section .callee2:
				// CHECK-NEXT: callee_high:
				// CHECK-NEXT: 10028: 70 47 bx lr

test/ELF/arm-thumb-narrow-branch-check.s

This file was added.

				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-none-linux-gnueabi %s -o %t
				// RUN: echo "SECTIONS { \
				// RUN: .R_ARM_PC11_1 : { *(.R_ARM_PC11_1) } \
				// RUN: .caller : { *(.caller) } \
				// RUN: .R_ARM_PC11_2 : { *(.R_ARM_PC11_2) } \
				// RUN: .text : { *(.text) } } " > %t.script
				// RUN: ld.lld --script %t.script %t %S/Inputs/arm-thumb-narrow-branch.o -o %t2 2>&1
				// RUN: llvm-objdump -d -triple=thumbv7a-none-linux-gnueabi %t2 \| FileCheck %s
				// REQUIRES: arm

				// Test the R_ARM_PC11 relocation which is used with the narrow encoding of B.N
				// the source of these relocations is a binary file arm-thumb-narrow-branch.o
				// which has been assembled with the GNU assembler as llvm-mc doesn't emit it
				// as the range of +-2048 bytes is too small to be practically useful for out
				// of section branches.
				.syntax unified

				.global callee_low_far
				.type callee_low_far,%function
				callee_low_far = 0x809

				.section .R_ARM_PC11_1,"ax",%progbits
				.thumb
				.balign 0x1000
				.type callee_low,%function
				.globl callee_low
				callee_low:
				bx lr

				.text
				.align 2
				.thumb
				.globl _start
				.type _start, %function
				_start:
				bl callers
				bx lr

				.section .R_ARM_PC11_2,"ax",%progbits
				.thumb
				.align 2
				.type callee_high,%function
				.globl callee_high
				callee_high:
				bx lr

				.global callee_high_far
				.type callee_high_far,%function
				callee_high_far = 0x180d

				// CHECK: Disassembly of section .R_ARM_PC11_1:
				// CHECK-NEXT: callee_low:
				// CHECK-NEXT: 1000: 70 47 bx lr
				// CHECK-NEXT: Disassembly of section .caller:
				// CHECK-NEXT: callers:
				// 1004 - 0x800 (2048) + 4 = 0x808 = callee_low_far
				// CHECK-NEXT: 1004: 00 e4 b #-2048
				// 1006 - 0xa (10) + 4 = 0x1000 = callee_low
				// CHECK-NEXT: 1006: fb e7 b #-10
				// 1008 + 4 + 4 = 0x1010 = callee_high
				// CHECK-NEXT: 1008: 02 e0 b #4
				// 100a + 0x7fe (2046) + 4 = 0x180c = callee_high_far
				// CHECK-NEXT: 100a: ff e3 b #2046
				// CHECK-NEXT: 100c: 70 47 bx lr
				// CHECK-NEXT: 100e: 00 bf nop
				// CHECK-NEXT: Disassembly of section .R_ARM_PC11_2:
				// CHECK-NEXT: callee_high:
				// CHECK-NEXT: 1010: 70 47 bx lr
				// CHECK-NEXT: Disassembly of section .text:
				// CHECK-NEXT: _start:
				// CHECK-NEXT: 1014: ff f7 f6 ff bl #-20
				// CHECK-NEXT: 1018: 70 47 bx lr

test/ELF/arm-thumb-plt-reloc.s

This file was added.

				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-none-linux-gnueabi %p/Inputs/arm-plt-reloc.s -o %t1
				// RUN: llvm-mc -filetype=obj -triple=thumbv7a-none-linux-gnueabi %s -o %t2
				// RUN: ld.lld %t1 %t2 -o %t
				// RUN: llvm-objdump -triple=thumbv7a-none-linux-gnueabi -d %t \| FileCheck %s
				// RUN: ld.lld -shared %t1 %t2 -o %t3
				// RUN: llvm-objdump -triple=thumbv7a-none-linux-gnueabi -d %t3 \| FileCheck -check-prefix=DSOTHUMB %s
				// RUN: llvm-objdump -triple=armv7a-none-linux-gnueabi -d %t3 \| FileCheck -check-prefix=DSOARM %s
				// RUN: llvm-readobj -s -r %t3 \| FileCheck -check-prefix=DSOREL %s
				// REQUIRES: arm
				//
				// Test PLT entry generation
				.syntax unified
				.text
				.align 2
				.globl _start
				.type _start,%function
				_start:
				// FIXME, interworking is only supported for BL via BLX at the moment, when
				// interworking thunks are available for b.w and b<cond>.w this can be altered
				// to test the different forms of interworking.
				bl func1
				bl func2
				bl func3

				// Executable, expect no PLT
				// CHECK: Disassembly of section .text:
				// CHECK-NEXT: func1:
				// CHECK-NEXT: 11000: 70 47 bx lr
				// CHECK: func2:
				// CHECK-NEXT: 11002: 70 47 bx lr
				// CHECK: func3:
				// CHECK-NEXT: 11004: 70 47 bx lr
				// CHECK-NEXT: 11006: 00 00 movs r0, r0
				// CHECK: _start:
				// 11008 + 4 -12 = 0x11000 = func1
				// CHECK-NEXT: 11008: ff f7 fa ff bl #-12
				// 1100c + 4 -14 = 0x11002 = func2
				// CHECK-NEXT: 1100c: ff f7 f9 ff bl #-14
				// 11010 + 4 -16 = 0x11004 = func3
				// CHECK-NEXT: 11010: ff f7 f8 ff bl #-16

				// Expect PLT entries as symbols can be preempted
				// .text is Thumb and .plt is ARM, llvm-objdump can currently only disassemble
				// as ARM or Thumb. Work around by disassembling twice.
				// DSOTHUMB: Disassembly of section .text:
				// DSOTHUMB: func1:
				// DSOTHUMB-NEXT: 1000: 70 47 bx lr
				// DSOTHUMB: func2:
				// DSOTHUMB-NEXT: 1002: 70 47 bx lr
				// DSOTHUMB: func3:
				// DSOTHUMB-NEXT: 1004: 70 47 bx lr
				// DSOTHUMB-NEXT: 1006: 00 00 movs r0, r0
				// DSOTHUMB: _start:
				// 0x1008 + 0x28 + 4 = 0x1034 = PLT func1
				// DSOTHUMB-NEXT: 1008: 00 f0 14 e8 blx #40
				// 0x100c + 0x34 + 4 = 0x1044 = PLT func2
				// DSOTHUMB-NEXT: 100c: 00 f0 1a e8 blx #52
				// 0x1010 + 0x40 + 4 = 0x1054 = PLT func3
				// DSOTHUMB-NEXT: 1010: 00 f0 20 e8 blx #64
				// DSOARM: Disassembly of section .plt:
				// DSOARM: .plt:
				// DSOARM-NEXT: 1020: 04 e0 2d e5 str lr, [sp, #-4]!
				// DSOARM-NEXT: 1024: 04 e0 9f e5 ldr lr, [pc, #4]
				// DSOARM-NEXT: 1028: 0e e0 8f e0 add lr, pc, lr
				// DSOARM-NEXT: 102c: 08 f0 be e5 ldr pc, [lr, #8]!
				// DSOARM-NEXT: 1030: d0 1f 00 00
				// 0x1028 + 8 + 1fd0 = 0x3000
				// DSOARM-NEXT: 1034: 04 c0 9f e5 ldr r12, [pc, #4]
				// DSOARM-NEXT: 1038: 0f c0 8c e0 add r12, r12, pc
				// DSOARM-NEXT: 103c: 00 f0 9c e5 ldr pc, [r12]
				// DSOARM-NEXT: 1040: cc 1f 00 00
				// 0x1038 + 8 + 1fcc = 0x300c
				// DSOARM-NEXT: 1044: 04 c0 9f e5 ldr r12, [pc, #4]
				// DSOARM-NEXT: 1048: 0f c0 8c e0 add r12, r12, pc
				// DSOARM-NEXT: 104c: 00 f0 9c e5 ldr pc, [r12]
				// DSOARM-NEXT: 1050: c0 1f 00 00
				// 0x1048 + 8 + 1fc0 = 0x3010
				// DSOARM-NEXT: 1054: 04 c0 9f e5 ldr r12, [pc, #4]
				// DSOARM-NEXT: 1058: 0f c0 8c e0 add r12, r12, pc
				// DSOARM-NEXT: 105c: 00 f0 9c e5 ldr pc, [r12]
				// DSOARM-NEXT: 1060: b4 1f 00 00
				// 0x1058 + 8 + 1fb4 = 0x3014

				// DSOREL: Name: .got.plt
				// DSOREL-NEXT: Type: SHT_PROGBITS
				// DSOREL-NEXT: Flags [
				// DSOREL-NEXT: SHF_ALLOC
				// DSOREL-NEXT: SHF_WRITE
				// DSOREL-NEXT: ]
				// DSOREL-NEXT: Address: 0x3000
				// DSOREL-NEXT: Offset:
				// DSOREL-NEXT: Size: 24
				// DSOREL-NEXT: Link:
				// DSOREL-NEXT: Info:
				// DSOREL-NEXT: AddressAlignment: 4
				// DSOREL-NEXT: EntrySize:
				// DSOREL: Relocations [
				// DSOREL-NEXT: Section (4) .rel.plt {
				// DSOREL-NEXT: 0x300C R_ARM_JUMP_SLOT func1 0x0
				// DSOREL-NEXT: 0x3010 R_ARM_JUMP_SLOT func2 0x0
				// DSOREL-NEXT: 0x3014 R_ARM_JUMP_SLOT func3 0x0