This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
ELF/
1/1
InputSection.cpp
4/4
Writer.cpp
-
test/ELF/
-
ELF/
-
basic-ppc64.s
-
ppc64-abs64-dyn.s
-
ppc64-bsymbolic-toc-restore.s
-
ppc64-call-reach.s
-
ppc64-dq.s
-
ppc64-dtprel.s
-
ppc64-entry-point.s
-
ppc64-error-missaligned-dq.s
-
ppc64-error-missaligned-ds.s
-
ppc64-func-entry-points.s
-
ppc64-ifunc.s
-
ppc64-local-dynamic.s
-
ppc64-long-branch-localentry-offset.s
-
ppc64-long-branch.s
-
ppc64-plt-stub.s
-
ppc64-rel-calls.s
-
ppc64-relocs.s
-
ppc64-shared-long_branch.s
-
ppc64-tls-gd.s
-
ppc64-tls-ie.s
-
ppc64-tls-vaddr-align.s
-
ppc64-toc-addis-nop-lqsq.s
-
ppc64-toc-addis-nop.s
-
ppc64-toc-rel.s
-
ppc64-toc-relax-constants.s
-
ppc64-toc-relax-jumptable.s
-
ppc64-toc-relax.s
-
ppc64-toc-restore-recursive-call.s
-
ppc64-toc-restore.s
-
ppc64-weak-undef-call.s
-
relro-copyrel-bss-script.s

Differential D64906

[ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges
ClosedPublic

Authored by MaskRay on Jul 18 2019, 12:51 AM.

Download Raw Diff

Details

Reviewers

• espindola
grimar
mcgrathr
pcc
peter.smith
phosek
ruiu
dalias
nsz
sfertile

Commits

rG01c7f4b60669: [ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges
rLLD369343: [ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges
rL369343: [ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges

Summary

This change affects the non-linker script case (precisely, when the
SECTIONS command is not used). It deletes 3 alignments at PT_LOAD
boundaries for the default case: the size of a powerpc64 binary can be
decreased by at most 192kb. The technique can be ported to other
targets.

Let me demonstrate the idea with a maxPageSize=65536 example:

When assigning the address to the first output section of a new PT_LOAD,
if the end p_vaddr of the previous PT_LOAD is 0x10020, we advance to
the next multiple of maxPageSize: 0x20000. The new PT_LOAD will thus
have p_vaddr=0x20000. Because p_offset and p_vaddr are congruent modulo
maxPageSize, p_offset will be 0x20000, leaving a p_offset gap [0x10020,
0x20000) in the output.

Alternatively, if we advance the position to 0x20020, the new PT_LOAD
will have p_vaddr=0x20020. We can pick either 0x10020 or 0x20020 for p_offset!
Obviously 0x10020 is the choice because it leaves no gap.
At runtime, p_vaddr will be rounded down by pagesize
(65536 if pagesize=maxPageSize). This PT_LOAD will load additional
initial contents from p_offset ranges [0x10000,0x10020), which will also
be loaded by the previous PT_LOAD. This is fine if -z noseparate-code is
in effect or if we are not transiting between executable and
non-executable segments.

ld.bfd -z noseparate-code leverages this technique to keep output small.
This patch implements the technique in lld, which is mostly effective on
targets with large defaultMaxPageSize (AArch64/MIPS/PPC: 65536). The 3
removed alignments can save almost 3*65536 bytes.

Two places that rely on p_vaddr%pagesize = 0 have to be updated.

We used to round p_memsz(PT_GNU_RELRO) up to commonPageSize (defaults to 4096 on all targets). Now p_vaddr%commonPageSize may be non-zero. The updated formula takes account of that factor.
Our TP offsets formulae are only correct if p_vaddr%p_align = 0. Fix them. See the updated comments in InputSection.cpp for details.

On targets that we enable the technique (only PPC64 now), we can potentially make p_vaddr(PT_TLS)%p_align(PT_TLS) != 0 if sh_addralign(.tdata) < sh_addralign(.tbss)

This exposes many problems in ld.so implementations, especially the offsets of dynamic TLS blocks. Known issues:

FreeBSD 13.0-CURRENT rtld-elf (i386/amd64/powerpc/arm64) glibc (HEAD) i386 and x86_64 https://sourceware.org/bugzilla/show_bug.cgi?id=24606 musl<=1.1.22 on TLS Variant I architectures (aarch64/powerpc64/...)

So, force p_vaddr%p_align = 0 by rounding dot up to p_align(PT_TLS).

The technique will be enabled (with updated tests) for other targets in
subsequent patches.

Diff Detail

Repository

rLLD LLVM Linker

Build Status

Buildable 35421
Build 35420: arc lint + arc unit

Event Timeline

MaskRay created this revision.Jul 18 2019, 12:51 AM

Herald added a reviewer: • espindola. · View Herald TranscriptJul 18 2019, 12:51 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, kbarton, arichardson and 2 others. · View Herald Transcript

Harbormaster completed remote builds in B35226: Diff 210497.Jul 18 2019, 12:52 AM

Herald added a subscriber: • wuzish. · View Herald TranscriptJul 18 2019, 12:52 AM

MaskRay mentioned this in D64903: [ELF] Add -z separate-code and pad the last page of last PF_X PT_LOAD with traps only if -z separate-code is specified.Jul 18 2019, 12:53 AM

Fix ppc64 tests

Herald added a subscriber: jsji. · View Herald TranscriptJul 18 2019, 9:16 AM

Harbormaster completed remote builds in B35274: Diff 210598.Jul 18 2019, 9:16 AM

MaskRay added a child revision: D64930: [ELF][AArch64] Allow PT_LOAD to have overlapping p_offset ranges.Jul 18 2019, 9:28 AM

Ready for review.

Harbormaster completed remote builds in B35279: Diff 210604.Jul 18 2019, 9:31 AM

Add descriptions

Herald added subscribers: atanasyan, kristof.beyls, javed.absar, sdardis. · View Herald TranscriptJul 18 2019, 7:11 PM

Harbormaster completed remote builds in B35327: Diff 210725.Jul 18 2019, 7:13 PM

MaskRay edited the summary of this revision. (Show Details)Jul 18 2019, 7:20 PM

Delete an unnecessary change in computeFileOffset

Harbormaster completed remote builds in B35328: Diff 210728.Jul 18 2019, 7:50 PM

Simplify p_memsz of PT_GNU_RELRO

Harbormaster completed remote builds in B35329: Diff 210729.Jul 18 2019, 7:54 PM

MaskRay marked an inline comment as done.Jul 18 2019, 8:28 PM

MaskRay added inline comments.

ELF/Writer.cpp
2378–2379	@pcc I think after D29242, we may lose PROT_READ protection of the last page of PT_GNU_RELRO. (This patch should keep the behavior unchanged.) glibc/musl essentially do: start = p_vaddr & -pagesize mprotect(start, (p_vaddr+p_memsz & -pagesize) - start, PROT_READ) The last page is unprotected if maxPageSize > pagesize >= commonPageSize. I guess that may be why ld.bfd aligns the end of PT_GNU_RELRO, instead of its start.

Discussed with dalias, let me summary the findings (this goes beyond the scope of this patch...):

FreeBSD rtld.c is wrong: its last page of PT_GNU_RELRO may not be protected:

// if p_vaddr%pagesz != 0, relro_size may not cover the last page.
obj->relro_page = obj->relocbase + trunc_page(ph->p_vaddr);
obj->relro_size = round_page(ph->p_memsz);

ld.bfd seems to align the end of PT_GNU_RELRO to common-page-size (can be observed with -z relro -z max-page-size=0x200000 -z common-page-size=0x4000). This may leave a gap before the RW. Moreover, if common-page-size < runtime pagesz <= max-page-size, the last page may not be protected. lld's current two RW approach (D58892) perfectly avoids the waste. This also has the implication that runtime pagesz cannot be larger than common-page-size.

The status quo (D29242) seems the best we can do. We cannot round p_memsz up to max-page-size. If RW(relro(...)) RW(non-relro(empty)), and runtime pagesz < max-page-size, mprotect invoked by ld.so (glibc/elf/dl-reloc.c:_dl_protect_relro and musl/ldso/dynlink.c:reloc_all) will have an out-of-range len. According to POSIX, [ENOMEM] Addresses in the range [addr,addr+len) are invalid for the address space of a process, or specify one or more pages which are not mapped.

If PT_GNU_RELRO could be redesigned, completely dropping p->p_memsz = alignTo(p->p_memsz, config->commonPageSize); and letting ld.so handle round-up would be better. We keep it so that (common-page-size = runtime pagesz) systems will get proper protection of the last page.

The best thing is to keep the lld behavior unchanged, and let ld.so round up p_memsz.

Don't force alignment for relro. It is unnecessary

Harbormaster completed remote builds in B35341: Diff 210780.Jul 19 2019, 2:28 AM

As I understand it this will only affect the non-linker script case as fixSectionAlignments() will only be called when there is no script->hasSectionsCommand. It will be worth making this clear in the description. I expect to get equivalent behaviour in the linker script we'd need to alter the implementation of DATA_SEGMENT_ALIGN in ScriptParser.cpp.

I would like to do a bit more research about RELRO, as I can't see from this patch alone. I think it is fine if RELRO is double mapped into an RO page. However if RELRO is adjacent to RW segments I think it could be a bad idea to have something like

VA [0x10000, 0x10020)	.data.rel.ro	PA [0x10000, 0x10020)
VA [0x20020, ...)	.data	PA [0x10020, ...)

As in theory (I'm not sure about how this works in the OS/loader so I could have this wrong) if the physical contents of .data was mapped RW from 0x10000 -> 0x20000 we'd have an ability to write to the .data.rel.ro via .data.

Is there some other part of the code that prevents this or does some other mechanism in the loader/OS prevent this from happening?

ELF/Writer.cpp
2210–2211	Does this comment need updating. What does page aligned mean now? Moreover does the comment about PT_GNU_RELRO make sense?
2215–2240	I suggest prev or prevPhdr rather than last, or perhaps lastSeen. At a glance last on its own can imply the final Phdr.
2220	Suggest: "When -z separate-code is used we must not have any overlap in pages between an executable segment and a non-executable segment. We align to the next maximum page size boundary on transitions between executable and non-executable segments.

I would like to do a bit more research about RELRO, as I can't see from this patch alone. I think it is fine if RELRO is double mapped into an RO page. However if RELRO is adjacent to RW segments I think it could be a bad idea to have something like

VA [0x10000, 0x10020) .data.rel.ro PA [0x10000, 0x10020)

VA [0x20020, ...) .data PA [0x10020, ...)

As in theory (I'm not sure about how this works in the OS/loader so I could have this wrong) if the physical contents of .data was mapped RW from 0x10000 -> 0x20000 we'd have an ability to write to the .data.rel.ro via .data.

Is there some other part of the code that prevents this or does some other mechanism in the loader/OS prevent this from happening?

To answer my own question https://sourceware.org/binutils/docs-2.32/ld/Builtin-Functions.html has DATA_SEGMENT_RELRO_END which mentions:

DATA_SEGMENT_ALIGN is padded so that exp + offset is aligned to the commonpagesize argument given to DATA_SEGMENT_ALIGN

There is also the comment in DATA_SEGMENT_ALIGN

commonpagesize should be less or equal to maxpagesize and should be the system page size the object wants to be optimized for while still running on system page sizes up to maxpagesize. Note however that ‘-z relro’ protection will not be effective if the system page size is larger than commonpagesize.

So this implies that if you are on a linux distro with a 64k page size and you want full relro protection you must increase the common page size to match the max page size.

Address review comments

Harbormaster completed remote builds in B35348: Diff 210798.Jul 19 2019, 4:33 AM

MaskRay updated this revision to Diff 210814.Jul 19 2019, 6:14 AM

MaskRay edited the summary of this revision. (Show Details)

This comment was removed by MaskRay.

I would like to do a bit more research about RELRO, as I can't see from this patch alone. I think it is fine if RELRO is double mapped into an RO page.

Yes, the RELRO region may be double mapped.

As in theory (I'm not sure about how this works in the OS/loader so I could have this wrong) if the physical contents of .data was mapped RW from 0x10000 -> 0x20000 we'd have an ability to write to the .data.rel.ro via .data.

This should not be a concern. PT_LOAD segments are mapped with the MAP_PRIVATE flag. The contents are copy-on-write and not shared between two maps:

MAP_PRIVATE
   Create  a  private copy-on-write mapping.  Updates to the mapping are not visible to other pro‐
   cesses mapping the same file, and are not carried  through  to  the  underlying  file.   It  is
   unspecified  whether  changes  made to the file after the mmap() call are visible in the mapped
   region.

To answer my own question https://sourceware.org/binutils/docs-2.32/ld/Builtin-Functions.html has DATA_SEGMENT_RELRO_END which mentions:

There is also the comment in DATA_SEGMENT_ALIGN

Thanks for the reference! I see that document partly answered my point 5 above (https://reviews.llvm.org/D64906#1592854). Their choice is to avoid maxpagesize alignment at the end of PT_GNU_RELRO, but there can still be a commonpagesize alignment at DATA_SEGMENT_ALIGN. The downside is that the last page may not be protected (common case on PPC: commonpagesize=4k, pagesz=maxpagesize=64k).

Since D58892, we have two RW segments: RW(relro) + RW(non-relro). By allowing double mapped RELRO contents, we don't have an alignment. (Our .got and .got.plt are separate - which has been the case for a long time. There seems no issue with it.)

MaskRay marked 3 inline comments as done.Jul 19 2019, 6:17 AM

Harbormaster completed remote builds in B35358: Diff 210814.Jul 19 2019, 6:17 AM

MaskRay mentioned this in D64930: [ELF][AArch64] Allow PT_LOAD to have overlapping p_offset ranges.Jul 19 2019, 9:13 AM

Align the RW PT_LOAD which includes PT_TLS, and add ppc64-tls-vaddr-align.s

A stage2 check-llvm check-clang check-lld passed before this revision. But
when I test this patch (the x86_64 version) in our internal code base, I
noticed PT_TLS may not satisfy p_vaddr%p_align=0. This update fixes the issue.

Harbormaster completed remote builds in B35417: Diff 210956.Jul 20 2019, 5:31 AM

Fix TP offset computation if p_vaddr % p_align != 0

Harbormaster completed remote builds in B35421: Diff 210964.Jul 20 2019, 10:05 AM

Fix gcc 8 -Wparentheses

Passed stage 2 check-llvm check-clang check-lld on a powerpc64le machine

Harbormaster completed remote builds in B35444: Diff 211021.Jul 21 2019, 10:52 PM

Comment about static TLS blocks Variants 1 and 2.

Herald added a subscriber: PkmX. · View Herald TranscriptJul 23 2019, 8:41 PM

MaskRay marked an inline comment as done.Jul 23 2019, 8:44 PM

MaskRay added inline comments.

ELF/InputSection.cpp
611	@peter.smith Moved the comment here because this mostly applies on other Variant I targets (PPC,RISC-V,...).

Harbormaster completed remote builds in B35544: Diff 211406.Jul 23 2019, 8:44 PM

Reword the comment about TLS

Harbormaster completed remote builds in B35547: Diff 211410.Jul 23 2019, 9:22 PM

rprichard added a subscriber: rprichard.Jul 29 2019, 4:30 PM

Herald added subscribers: s.egerton, simoncook. · View Herald TranscriptJul 29 2019, 4:30 PM

Add a p_vaddr%p_align = 0 hack to work around some ld.so bugs

Harbormaster completed remote builds in B35863: Diff 212493.Jul 30 2019, 7:48 PM

Edit a comment: FreeBSD rtld-elf amd64 has the same bug as glibc(i386 x86-64) https://sourceware.org/bugzilla/show_bug.cgi?id=24606

Harbormaster completed remote builds in B35864: Diff 212494.Jul 30 2019, 8:03 PM

MaskRay mentioned this in rL367537: [ELF] Add -z separate-code and pad the last page of last PF_X PT_LOAD with….Aug 1 2019, 2:58 AM

MaskRay mentioned this in rG5391f158c236: [ELF] Add -z separate-code and pad the last page of last PF_X PT_LOAD with….

D64903 is committed. This patch is ready now. It can delete 3 alignments at PT_LOAD boundaries (because -z noseparate-code is the default): it can decrease the size of a powerpc64 executable/shared object by at most 192kb (it can save more than 96kb on average).

I'm happy to go ahead as this is a pre-requisite for supporting AArch64.

Update description to mention ld.so bugs (why we have to add a workaround)

Herald added a subscriber: krytarowski. · View Herald TranscriptAug 1 2019, 3:39 AM

Harbormaster completed remote builds in B35953: Diff 212775.Aug 1 2019, 3:39 AM

MaskRay edited the summary of this revision. (Show Details)Aug 1 2019, 3:42 AM

Move AArch64 TLS formula here

Harbormaster completed remote builds in B35955: Diff 212779.Aug 1 2019, 4:32 AM

🥳It would be nice to get this in. Recently I learned that Brandon Bergren at FreeBSD used a local patch to fix the powerpc64 size regression caused by lld:

  // We need 64K pages (at least under glibc/Linux, the loader won't
  // set different permissions on a finer granularity than that).
-  defaultMaxPageSize = 65536;
+ defaultMaxPageSize = 4096;

(It works for them because they use 4k pagesize, but a general fix (this patch) will be preferable.) D64906 is also a prerequisite for aarch64 support, which many more people may want.

MaskRay added a child revision: D65865: [ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_386.Aug 7 2019, 5:55 AM

It turns out I need the proper fix after all in my local tree, because lately I have been working on getting a working Petitboot loader binary, and that means I'm technically cross compiling code for ppc64le Linux. So yeah, it would be very nice to get this in.

MaskRay added a reviewer: sfertile.Aug 13 2019, 4:30 AM

LGTM

This revision is now accepted and ready to land.Aug 20 2019, 12:48 AM

Herald added a subscriber: steven.zhang. · View Herald TranscriptAug 20 2019, 12:48 AM

Rebase. 3 ppc tests have changed recently.

Harbormaster completed remote builds in B37002: Diff 216062.Aug 20 2019, 1:24 AM

Closed by commit rL369343: [ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges (authored by MaskRay). · Explain WhyAug 20 2019, 1:35 AM

This revision was automatically updated to reflect the committed changes.

MaskRay mentioned this in rL369344: [ELF][AArch64] Allow PT_LOAD to have overlapping p_offset ranges.

MaskRay mentioned this in rGf66b767abe5e: [ELF][AArch64] Allow PT_LOAD to have overlapping p_offset ranges.

MaskRay mentioned this in D65865: [ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_386.Aug 20 2019, 1:40 AM

MaskRay mentioned this in rL369347: [ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_386.Aug 20 2019, 1:42 AM

MaskRay mentioned this in rG9c371309f38c: [ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_386.Aug 20 2019, 1:47 AM

MaskRay mentioned this in rL369351: [ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges on EM_PPC.Aug 20 2019, 2:19 AM

MaskRay mentioned this in rG12d83b427015: [ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges on EM_PPC.

MaskRay mentioned this in D66658: [ELF] Align the first section of a PT_LOAD even if its type is SHT_NOBITS.Aug 23 2019, 8:19 AM

MaskRay mentioned this in rGaf47d0021c7a: [ELF] Align the first section of a PT_LOAD even if its type is SHT_NOBITS.Aug 23 2019, 5:47 PM

MaskRay mentioned this in rL369828: [ELF] Align the first section of a PT_LOAD even if its type is SHT_NOBITS.

MaskRay mentioned this in D66749: [ELF][ARM] Allow PT_LOAD to have overlapping p_offset ranges on EM_ARM.Aug 26 2019, 8:52 AM

MaskRay mentioned this in rG024bf27ddfa6: [ELF][ARM] Allow PT_LOAD to have overlapping p_offset ranges on EM_ARM.Aug 27 2019, 5:09 AM

MaskRay mentioned this in rL370049: [ELF][ARM] Allow PT_LOAD to have overlapping p_offset ranges on EM_ARM.Aug 27 2019, 5:14 AM

MaskRay mentioned this in rL370192: [ELF][RISCV] Allow PT_LOAD to have overlapping p_offset ranges on EM_RISCV.Aug 28 2019, 5:06 AM

MaskRay mentioned this in rG523f999acf6f: [ELF][RISCV] Allow PT_LOAD to have overlapping p_offset ranges on EM_RISCV.

MaskRay mentioned this in D67090: [llvm-objcopy][llvm-strip] Support --only-keep-debug.Sep 3 2019, 9:17 AM

hansw mentioned this in rG501ad1d7ba8f: Merging r369828: --------------------------------------------------------------….Sep 6 2019, 4:18 AM

hans mentioned this in rL371196: Merging r369828:.Sep 6 2019, 4:18 AM

atanasyan mentioned this in rL371554: [mips] Allow PT_LOAD to have overlapping p_offset ranges on EM_MIPS.Sep 10 2019, 1:21 PM

atanasyan mentioned this in rG6c6f5a998452: [mips] Allow PT_LOAD to have overlapping p_offset ranges on EM_MIPS.

MaskRay mentioned this in D67481: [ELF] Add -z separate-loadable-segments to complement separate-code and noseparate-code.Sep 12 2019, 1:15 AM

MaskRay mentioned this in D67482: [ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_X86_64.Sep 12 2019, 1:37 AM

MaskRay mentioned this in rL371958: [ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_X86_64.Sep 16 2019, 12:06 AM

MaskRay mentioned this in rGd4306e90cb18: [ELF][X86] Allow PT_LOAD to have overlapping p_offset ranges on EM_X86_64.

MaskRay mentioned this in D67605: [ELF][Hexagon] Allow PT_LOAD to have overlapping p_offset ranges on EM_HEXAGON.Sep 16 2019, 12:23 AM

MaskRay mentioned this in rG4816e516e5ca: [ELF][Hexagon] Allow PT_LOAD to have overlapping p_offset ranges on EM_HEXAGON.Sep 16 2019, 7:46 PM

MaskRay mentioned this in rL372059: [ELF][Hexagon] Allow PT_LOAD to have overlapping p_offset ranges on EM_HEXAGON.

MaskRay mentioned this in rL372807: [ELF] Add -z separate-loadable-segments to complement separate-code and….Sep 24 2019, 8:39 PM

MaskRay mentioned this in rG0264950697e5: [ELF] Add -z separate-loadable-segments to complement separate-code and….

p->p_memsz = alignTo(p->p_offset + p->p_memsz, config->commonPageSize) - p->p_offset;

I think the whole rounding step is questionable, not simply this change to it. As far as I can tell from researching this, the rounding down that occurs is for the starting address to place RELRO on a page boundary. The size of RELRO does not get rounded down, so rounding it up here by any amount risks making more data read-only than is necessary, which can lead to seg faults.

In D64906#1711802, @troyj wrote:
p->p_memsz = alignTo(p->p_offset + p->p_memsz, config->commonPageSize) - p->p_offset;
I think the whole rounding step is questionable, not simply this change to it. As far as I can tell from researching this, the rounding down that occurs is for the starting address to place RELRO on a page boundary. The size of RELRO does not get rounded down, so rounding it up here by any amount risks making more data read-only than is necessary, which can lead to seg faults.

p->p_memsz = alignTo(p->p_offset + p->p_memsz, config->commonPageSize) -
             p->p_offset;

is necessary. In GNU ld, the last page of RELRO may not be protected as documented. To make that page protected on all of glibc/musl/FreeBSD libc, the change like https://reviews.llvm.org/D28267 is needed. Please also read https://reviews.llvm.org/D64906#1592854

If you cannot use -z norelro, you may try -z separate-code or -z separate-loadable-segments (D67481)

Respectfully, I've read all of that plus https://www.airs.com/blog/archives/189, and we've arrived at different conclusions. I'm fine with maintaining a local patch; I just wanted to point it out in case it was useful to others upstream.

In D64906#1712768, @troyj wrote:

Respectfully, I've read all of that plus https://www.airs.com/blog/archives/189, and we've arrived at different conclusions. I'm fine with maintaining a local patch; I just wanted to point it out in case it was useful to others upstream.

Can you be more specific about how this conflicts with the blog post? Out of curiosity I want to learn why your software needs a local patch. I am 90% certain that that specific software makes unfounded assumption about the section/segment layout. GNU ld places PT_GNU_RELRO starting at a maxpagesize boundary and ending at a commonpagesize boundary. The last page may be unprotected. I agree that https://reviews.llvm.org/D29242 is not very ideal but it does not matter in practice: if runtime pagesize is smaller than commonpagesize(4096 on all targets but SPARCV9 that are supported by lld; 4096 on most targets supported by GNU ld), it can segfault.

There some some other differences, e.g. lld has .bss.rel.ro when no SECTIONS command is used. This was motivated by An Evil Copy: How the Loader Betrays You (I really dislike the misleading and exaggerated title) GNU ld and gold haven't implemented this. 2 RW schemes play well with PT_GNU_RELRO. etc

Can you be more specific about how this conflicts with the blog post?

The blog post says "Note that the current dynamic linker code will only work correctly if the PT_GNU_RELRO segment starts on a page boundary. This is because the dynamic linker rounds the p_vaddr field down to the previous page boundary." The lld code comment says "musl/glibc ld.so rounds the size down" and then proceeds to round the size up in what appears to be a countermeasure. So the blog post is talking about the starting address of the segment, but the lld code is rounding the size, not the starting address. For programs that I have linked, the starting address appears to already be on a page boundary, so no rounding is required there, and apply any rounding to the size results in an error because too much of the RW data segment gets marked RO.

Out of curiosity I want to learn why your software needs a local patch. I am 90% certain that that specific software makes unfounded assumption about the section/segment layout.

Yes, it makes assumptions that may be unfounded but happen to match ld.bfd and ld.gold behavior. Specifically, it assumes that the linker creates a single RW segment like ld.bfd and ld.gold, and that the starting address of that RW segment matches the starting address of the RELRO segment. I'm not aware of any rule that ld.lld violates by creating more than one RW segment, but it is inconvenient that it does not match the behavior of the other linkers. My local patch stops splitting the RW segment so that there is only one segment and then removes the rounding of the RELRO size. IF I leave the rounding in place, then the entire RW segment ends up being covered by the RELRO, which is bad because then the program can't write to any of its data. With the rounding removed, the emitted layout is much closer to the one emitted by ld.bfd and ld.gold.

The last page may be unprotected.

The last page being unprotected is better than incorrectly making the first page of writable data be RELRO. The former may miss identifying some programming errors or possibly open a security hole, but the latter certainly leads to the program crashing.

A better way might be to nudge the start of the writable data to begin later in the RW segment so that an integral number of initial pages can be RELRO, but ld.lld doesn't seem to do that and I'm not familiar with the code enough to add that myself. Hence, I'm settling for possibly not protecting all of the RO data until I know of a way to do the above.

In D64906#1717096, @troyj wrote:

Can you be more specific about how this conflicts with the blog post?

The blog post says "Note that the current dynamic linker code will only work correctly if the PT_GNU_RELRO segment starts on a page boundary. This is because the dynamic linker rounds the p_vaddr field down to the previous page boundary." The lld code comment says "musl/glibc ld.so rounds the size down" and then proceeds to round the size up in what appears to be a countermeasure. So the blog post is talking about the starting address of the segment, but the lld code is rounding the size, not the starting address. For programs that I have linked, the starting address appears to already be on a page boundary, so no rounding is required there, and apply any rounding to the size results in an error because too much of the RW data segment gets marked RO.

It is best not to interpret this as a doctrine. musl does mprotect(laddr(p, p->relro_start), p->relro_end-p->relro_start, PROT_READ) where relro_end is computed as (ph->p_vaddr + ph->p_memsz) & -PAGE_SIZE (glibc is similar). The changed formula matches how musl/glibc mprotect PT_GNU_RELRO. FreeBSD rounds the size up. I have said it can be problematic in one of my previous comments and reiterated in my previous comment to your question.

"PT_GNU_RELRO segment starts on a page boundary" does not preclude the possibility that p_vaddr%maxpagesize!=0.

Out of curiosity I want to learn why your software needs a local patch. I am 90% certain that that specific software makes unfounded assumption about the section/segment layout.

Yes, it makes assumptions that may be unfounded but happen to match ld.bfd and ld.gold behavior. Specifically, it assumes that the linker creates a single RW segment like ld.bfd and ld.gold, and that the starting address of that RW segment matches the starting address of the RELRO segment. I'm not aware of any rule that ld.lld violates by creating more than one RW segment, but it is inconvenient that it does not match the behavior of the other linkers. My local patch stops splitting the RW segment so that there is only one segment and then removes the rounding of the RELRO size. IF I leave the rounding in place, then the entire RW segment ends up being covered by the RELRO, which is bad because then the program can't write to any of its data. With the rounding removed, the emitted layout is much closer to the one emitted by ld.bfd and ld.gold.

Your software may need -z separate-loadable-segments, as I mentioned in my previous comments to your question. We still have 2 RW, but they don't have overlapping p_offset, and they can actually be merged into one. Put it in another way, after mprotect'ing PT_GNU_RELRO, the memory mapping layout is not different from the case when there is only one RW. If your program does not parse its program header, there should be no runtime perceivable behavior differences. Merging 2 RW into one can be seen as an optimization, and lld does not support it. I think it is fine because a program header just costs sizeof(Elf64_Phdr)=56 bytes on ELF64.

The last page may be unprotected.

The last page being unprotected is better than incorrectly making the first page of writable data be RELRO. The former may miss identifying some programming errors or possibly open a security hole, but the latter certainly leads to the program crashing.

A better way might be to nudge the start of the writable data to begin later in the RW segment so that an integral number of initial pages can be RELRO, but ld.lld doesn't seem to do that and I'm not familiar with the code enough to add that myself. Hence, I'm settling for possibly not protecting all of the RO data until I know of a way to do the above.

The current layout is R RX RW(RELRO) RW(non-RELRO). Android folks proposed an alternative layout in August https://lists.llvm.org/pipermail/llvm-dev/2019-August/134801.html You can find David Chisnall's and my replies. I am not convinced it improves things.

If your program does not parse its program header, there should be no runtime perceivable behavior differences.

It parses the program header. That's why it knows that there is more than one RW. It's looking for a single RW so that it can mremap its data, but it's confused when it finds more than one RW.

Merging 2 RW into one can be seen as an optimization, and lld does not support it.

Well....hold on. lld used to emit one RW prior to that patch, and now it doesn't. My issue is that I'm fine reverting the patch locally, but then RELRO overlaps the entire RW due to the rounding. So far I've tried eliminating the rounding, but now I'm trying it with the rounding restored and attempting to insert a dummy section prior to .data. to push .data. up to the next page boundary. I'm trying to figure that part out now, but it's not clear if I need to create it early as another synthetic section or if I can just create one in Writer.cpp.

In D64906#1718052, @troyj wrote:

If your program does not parse its program header, there should be no runtime perceivable behavior differences.

It parses the program header. That's why it knows that there is more than one RW. It's looking for a single RW so that it can mremap its data, but it's confused when it finds more than one RW.

Merging 2 RW into one can be seen as an optimization, and lld does not support it.

Well....hold on. lld used to emit one RW prior to that patch, and now it doesn't. My issue is that I'm fine reverting the patch locally, but then RELRO overlaps the entire RW due to the rounding.

My feeling is one RW partially overlapping PT_GNU_RELRO can be more problematic. Because lld does not have code to align a middle section in a segment to maxpagesize. "overlaps the entire RW due to the rounding" is indeed the problem you may face. I think the current lld segment layout is better than GNU linkers'.

So far I've tried eliminating the rounding, but now I'm trying it with the rounding restored and attempting to insert a dummy section prior to .data. to push .data. up to the next page boundary. I'm trying to figure that part out now, but it's not clear if I need to create it early as another synthetic section or if I can just create one in Writer.cpp.

I suggest that you update the software that parses RW. If you really want to keep a local lld patch that restores the original behavior (which I consider inferior), in Writer.cpp:fixSectionAlignments, sets addrExpr to align to the maxpagesize for the first section after the last PT_GNU_RELRO section.

I suggest that you update the software that parses RW. If you really want to keep a local lld patch that restores the original behavior (which I consider inferior) ...

Thank you for the advice. I understand from your perspective it is inferior, so I'm not trying to submit an upstream patch for this. I'm dealing with released software that is not receiving any further updates but will need to coexist with a new toolchain that uses lld. I can modify lld; I can't modify the other software.

Revision Contents

Path

Size

ELF/

InputSection.cpp

17 lines

Writer.cpp

49 lines

test/

ELF/

basic-ppc64.s

62 lines

ppc64-abs64-dyn.s

8 lines

ppc64-bsymbolic-toc-restore.s

2 lines

26 lines

6 lines

10 lines

16 lines

ppc64-error-missaligned-dq.s

4 lines

ppc64-error-missaligned-ds.s

4 lines

ppc64-func-entry-points.s

22 lines

ppc64-ifunc.s

42 lines

ppc64-local-dynamic.s

4 lines

ppc64-long-branch-localentry-offset.s

2 lines

12 lines

12 lines

20 lines

54 lines

ppc64-shared-long_branch.s

15 lines

ppc64-tls-gd.s

16 lines

ppc64-tls-ie.s

8 lines

ppc64-tls-vaddr-align.s

30 lines

ppc64-toc-addis-nop-lqsq.s

1 line

ppc64-toc-addis-nop.s

73 lines

ppc64-toc-rel.s

15 lines

ppc64-toc-relax-constants.s

16 lines

ppc64-toc-relax-jumptable.s

8 lines

ppc64-toc-relax.s

30 lines

ppc64-toc-restore-recursive-call.s

6 lines

ppc64-toc-restore.s

22 lines

ppc64-weak-undef-call.s

6 lines

relro-copyrel-bss-script.s

19 lines

Diff 210964

ELF/InputSection.cpp

	Show First 20 Lines • Show All 602 Lines • ▼ Show 20 Lines

	// A TLS symbol's virtual address is relative to the TLS segment. Add a			// A TLS symbol's virtual address is relative to the TLS segment. Add a
	// target-specific adjustment to produce a thread-pointer-relative offset.			// target-specific adjustment to produce a thread-pointer-relative offset.
	static int64_t getTlsTpOffset(const Symbol &s) {			static int64_t getTlsTpOffset(const Symbol &s) {
	// On targets that support TLSDESC, _TLS_MODULE_BASE_@tpoff = 0.			// On targets that support TLSDESC, _TLS_MODULE_BASE_@tpoff = 0.
	if (&s == ElfSym::tlsModuleBase)			if (&s == ElfSym::tlsModuleBase)
	return 0;			return 0;

				elf::PhdrEntry *tls = Out::tlsPhdr;
				MaskRayAuthorUnsubmitted Done Reply Inline Actions @peter.smith Moved the comment here because this mostly applies on other Variant I targets (PPC,RISC-V,...). MaskRay: @peter.smith Moved the comment here because this mostly applies on other Variant I targets…
	switch (config->emachine) {			switch (config->emachine) {
	case EM_ARM:			case EM_ARM:
	case EM_AARCH64:			case EM_AARCH64:
	// Variant 1. The thread pointer points to a TCB with a fixed 2-word size,			// Variant 1. The thread pointer points to a TCB with a fixed 2-word size,
	// followed by a variable amount of alignment padding, followed by the TLS			// followed by a variable amount of alignment padding, followed by the TLS
	// segment.			// segment.
	return s.getVA(0) + alignTo(config->wordsize * 2, Out::tlsPhdr->p_align);			return s.getVA(0) + alignTo(config->wordsize * 2, Out::tlsPhdr->p_align);
	case EM_386:			case EM_386:
	case EM_X86_64:			case EM_X86_64:
	// Variant 2. The TLS segment is located just before the thread pointer.			// Variant 2. The end of PT_TLS (p_vaddr+p_memsz) rounded up to p_align has
	return s.getVA(0) - alignTo(Out::tlsPhdr->p_memsz, Out::tlsPhdr->p_align);			// TP offset 0.
				return s.getVA(0) - tls->p_memsz -
				(-tls->p_vaddr - tls->p_memsz & tls->p_align - 1);
	case EM_PPC:			case EM_PPC:
	case EM_PPC64:			case EM_PPC64:
	// The thread pointer points to a fixed offset from the start of the			// Variant 1. p_vaddr rounded down to p_align has TP offset -0x7000.
	// executable's TLS segment. An offset of 0x7000 allows a signed 16-bit			// The start of PT_TLS (p_vaddr) has TP offset (p_vaddr%p_align - 0x7000).
	// offset to reach 0x1000 of TCB/thread-library data and 0xf000 of the			// An offset of 0x7000 allows a signed 16-bit offset to reach 0x1000 of
	// program's TLS segment.			// TCB/thread-library data and 0xf000 of the program's TLS segment.
	return s.getVA(0) - 0x7000;			return s.getVA(0) + (tls->p_vaddr & tls->p_align - 1) - 0x7000;
	case EM_RISCV:			case EM_RISCV:
	return s.getVA(0);			return s.getVA(0);
	default:			default:
	llvm_unreachable("unhandled Config->EMachine");			llvm_unreachable("unhandled Config->EMachine");
	}			}
	}			}

	static uint64_t getRelocTargetVA(const InputFile *file, RelType type, int64_t a,			static uint64_t getRelocTargetVA(const InputFile *file, RelType type, int64_t a,
	▲ Show 20 Lines • Show All 700 Lines • Show Last 20 Lines

ELF/Writer.cpp

Show First 20 Lines • Show All 2,201 Lines • ▼ Show 20 Lines	void Writer<ELFT>::addPhdrForSection(Partition &part, unsigned shType,
});		});
if (i == outputSections.end())		if (i == outputSections.end())
return;		return;

PhdrEntry *entry = make<PhdrEntry>(pType, pFlags);		PhdrEntry *entry = make<PhdrEntry>(pType, pFlags);
entry->add(*i);		entry->add(*i);
part.phdrs.push_back(entry);		part.phdrs.push_back(entry);
}		}

// The first section of each PT_LOAD, the first section in PT_GNU_RELRO and the		// Place the first section of each PT_LOAD to a different page (of maxPageSize).
		peter.smithUnsubmitted Done Reply Inline Actions Does this comment need updating. What does page aligned mean now? Moreover does the comment about PT_GNU_RELRO make sense? peter.smith: Does this comment need updating. What does page aligned mean now? Moreover does the comment…
// first section after PT_GNU_RELRO have to be page aligned so that the dynamic		// This is achieved by assigning an alignment expression to addrExpr of each
// linker can set the permissions.		// such section.
template <class ELFT> void Writer<ELFT>::fixSectionAlignments() {		template <class ELFT> void Writer<ELFT>::fixSectionAlignments() {
auto pageAlign = [](OutputSection *cmd) {		const PhdrEntry *prev;
if (cmd && !cmd->addrExpr)		auto pageAlign = [&](const PhdrEntry *p) {
cmd->addrExpr = [=] {		OutputSection *cmd = p->firstSec;
		if (cmd && !cmd->addrExpr) {
		// Prefer advancing to align(dot, maxPageSize) + dot%maxPageSize to avoid
		// padding in the file contents.
		peter.smithUnsubmitted Done Reply Inline Actions Suggest: "When -z separate-code is used we must not have any overlap in pages between an executable segment and a non-executable segment. We align to the next maximum page size boundary on transitions between executable and non-executable segments. peter.smith: Suggest: "When -z separate-code is used we must not have any overlap in pages between an…
		//
		// When -z separate-code is used we must not have any overlap in pages
		// between an executable segment and a non-executable segment. We align to
		// the next maximum page size boundary on transitions between executable
		// and non-executable segments.
		//
		// TODO Enable this technique on all targets.
		bool enable = config->emachine == EM_PPC64;

		if (!enable \|\| (config->zSeparateCode && prev &&
		(prev->p_flags & PF_X) != (p->p_flags & PF_X)))
		cmd->addrExpr = [] {
return alignTo(script->getDot(), config->maxPageSize);		return alignTo(script->getDot(), config->maxPageSize);
};		};
		else
		cmd->addrExpr = [] {
		return alignTo(script->getDot(), config->maxPageSize) +
		script->getDot() % config->maxPageSize;
		};
		}
		peter.smithUnsubmitted Done Reply Inline Actions I suggest prev or prevPhdr rather than last, or perhaps lastSeen. At a glance last on its own can imply the final Phdr. peter.smith: I suggest prev or prevPhdr rather than last, or perhaps lastSeen. At a glance last on its own…
};		};

for (Partition &part : partitions) {		for (Partition &part : partitions) {
		prev = nullptr;
for (const PhdrEntry *p : part.phdrs)		for (const PhdrEntry *p : part.phdrs)
if (p->p_type == PT_LOAD && p->firstSec)		if (p->p_type == PT_LOAD && p->firstSec) {
pageAlign(p->firstSec);		pageAlign(p);
		prev = p;
		}
}		}
}		}

// Compute an in-file position for a given section. The file offset must be the		// Compute an in-file position for a given section. The file offset must be the
// same with its virtual address modulo the page size, so that the loader can		// same with its virtual address modulo the page size, so that the loader can
// load executables without any address adjustment.		// load executables without any address adjustment.
static uint64_t computeFileOffset(OutputSection *os, uint64_t off) {		static uint64_t computeFileOffset(OutputSection *os, uint64_t off) {
// File offsets are not significant for .bss sections. By convention, we keep		// File offsets are not significant for .bss sections. By convention, we keep
▲ Show 20 Lines • Show All 109 Lines • ▼ Show 20 Lines	if (first) {
if (!p->hasLMA)		if (!p->hasLMA)
p->p_paddr = first->getLMA();		p->p_paddr = first->getLMA();
}		}

if (p->p_type == PT_LOAD) {		if (p->p_type == PT_LOAD) {
p->p_align = std::max<uint64_t>(p->p_align, config->maxPageSize);		p->p_align = std::max<uint64_t>(p->p_align, config->maxPageSize);
} else if (p->p_type == PT_GNU_RELRO) {		} else if (p->p_type == PT_GNU_RELRO) {
p->p_align = 1;		p->p_align = 1;
// The glibc dynamic loader rounds the size down, so we need to round up		// musl/glibc ld.so rounds the size down, so we need to round up
// to protect the last page. This is a no-op on FreeBSD which always		// to protect the last page. This is a no-op on FreeBSD which always
// rounds up.		// rounds up.
p->p_memsz = alignTo(p->p_memsz, config->commonPageSize);		p->p_memsz = alignTo(p->p_offset + p->p_memsz, config->commonPageSize) -
		p->p_offset;
		MaskRayAuthorUnsubmitted Done Reply Inline Actions @pcc I think after D29242, we may lose PROT_READ protection of the last page of PT_GNU_RELRO. (This patch should keep the behavior unchanged.) glibc/musl essentially do: start = p_vaddr & -pagesize mprotect(start, (p_vaddr+p_memsz & -pagesize) - start, PROT_READ) The last page is unprotected if maxPageSize > pagesize >= commonPageSize. I guess that may be why ld.bfd aligns the end of PT_GNU_RELRO, instead of its start. MaskRay: @pcc I think after D29242, we may lose PROT_READ protection of the last page of PT_GNU_RELRO.
}		}
}		}
}		}

// A helper struct for checkSectionOverlap.		// A helper struct for checkSectionOverlap.
namespace {		namespace {
struct SectionOffset {		struct SectionOffset {
OutputSection *sec;		OutputSection *sec;
▲ Show 20 Lines • Show All 325 Lines • Show Last 20 Lines

test/ELF/basic-ppc64.s

	Show All 27 Lines
	// CHECK-NEXT: FileVersion: 1			// CHECK-NEXT: FileVersion: 1
	// CHECK-NEXT: OS/ABI: SystemV (0x0)			// CHECK-NEXT: OS/ABI: SystemV (0x0)
	// CHECK-NEXT: ABIVersion: 0			// CHECK-NEXT: ABIVersion: 0
	// CHECK-NEXT: Unused: (00 00 00 00 00 00 00)			// CHECK-NEXT: Unused: (00 00 00 00 00 00 00)
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: Type: SharedObject (0x3)			// CHECK-NEXT: Type: SharedObject (0x3)
	// CHECK-NEXT: Machine: EM_PPC64 (0x15)			// CHECK-NEXT: Machine: EM_PPC64 (0x15)
	// CHECK-NEXT: Version: 1			// CHECK-NEXT: Version: 1
	// CHECK-NEXT: Entry: 0x10000			// CHECK-NEXT: Entry: 0x1022C
	// CHECK-NEXT: ProgramHeaderOffset: 0x40			// CHECK-NEXT: ProgramHeaderOffset: 0x40
	// CHECK-NEXT: SectionHeaderOffset: 0x200F8			// CHECK-NEXT: SectionHeaderOffset: 0x330
	// CHECK-NEXT: Flags [ (0x2)			// CHECK-NEXT: Flags [ (0x2)
	// CHECK-NEXT: 0x2			// CHECK-NEXT: 0x2
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: HeaderSize: 64			// CHECK-NEXT: HeaderSize: 64
	// CHECK-NEXT: ProgramHeaderEntrySize: 56			// CHECK-NEXT: ProgramHeaderEntrySize: 56
	// CHECK-NEXT: ProgramHeaderCount: 7			// CHECK-NEXT: ProgramHeaderCount: 7
	// CHECK-NEXT: SectionHeaderEntrySize: 64			// CHECK-NEXT: SectionHeaderEntrySize: 64
	// CHECK-NEXT: SectionHeaderCount: 11			// CHECK-NEXT: SectionHeaderCount: 11
	▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: Section {			// CHECK-NEXT: Section {
	// CHECK-NEXT: Index: 4			// CHECK-NEXT: Index: 4
	// CHECK-NEXT: Name: .text (23)			// CHECK-NEXT: Name: .text (23)
	// CHECK-NEXT: Type: SHT_PROGBITS (0x1)			// CHECK-NEXT: Type: SHT_PROGBITS (0x1)
	// CHECK-NEXT: Flags [ (0x6)			// CHECK-NEXT: Flags [ (0x6)
	// CHECK-NEXT: SHF_ALLOC (0x2)			// CHECK-NEXT: SHF_ALLOC (0x2)
	// CHECK-NEXT: SHF_EXECINSTR (0x4)			// CHECK-NEXT: SHF_EXECINSTR (0x4)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Address: 0x10000			// CHECK-NEXT: Address: 0x1022C
	// CHECK-NEXT: Offset: 0x10000			// CHECK-NEXT: Offset: 0x22C
	// CHECK-NEXT: Size: 12			// CHECK-NEXT: Size: 12
	// CHECK-NEXT: Link: 0			// CHECK-NEXT: Link: 0
	// CHECK-NEXT: Info: 0			// CHECK-NEXT: Info: 0
	// CHECK-NEXT: AddressAlignment: 4			// CHECK-NEXT: AddressAlignment: 4
	// CHECK-NEXT: EntrySize: 0			// CHECK-NEXT: EntrySize: 0
	// CHECK-NEXT: SectionData (			// CHECK-NEXT: SectionData (
	// LE-NEXT: 0000: 01000038 37006038 02000044			// LE-NEXT: 0000: 01000038 37006038 02000044
	// BE-NEXT: 0000: 38000001 38600037 44000002			// BE-NEXT: 0000: 38000001 38600037 44000002
	// CHECK-NEXT: )			// CHECK-NEXT: )
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: Section {			// CHECK-NEXT: Section {
	// CHECK-NEXT: Index: 5			// CHECK-NEXT: Index: 5
	// CHECK-NEXT: Name: .dynamic (29)			// CHECK-NEXT: Name: .dynamic (29)
	// CHECK-NEXT: Type: SHT_DYNAMIC (0x6)			// CHECK-NEXT: Type: SHT_DYNAMIC (0x6)
	// CHECK-NEXT: Flags [ (0x3)			// CHECK-NEXT: Flags [ (0x3)
	// CHECK-NEXT: SHF_ALLOC (0x2)			// CHECK-NEXT: SHF_ALLOC (0x2)
	// CHECK-NEXT: SHF_WRITE (0x1)			// CHECK-NEXT: SHF_WRITE (0x1)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Address: 0x20000			// CHECK-NEXT: Address: 0x20238
	// CHECK-NEXT: Offset: 0x20000			// CHECK-NEXT: Offset: 0x238
	// CHECK-NEXT: Size: 96			// CHECK-NEXT: Size: 96
	// CHECK-NEXT: Link: 3			// CHECK-NEXT: Link: 3
	// CHECK-NEXT: Info: 0			// CHECK-NEXT: Info: 0
	// CHECK-NEXT: AddressAlignment: 8			// CHECK-NEXT: AddressAlignment: 8
	// CHECK-NEXT: EntrySize: 16			// CHECK-NEXT: EntrySize: 16
	// CHECK-NEXT: SectionData (			// CHECK-NEXT: SectionData (
	// LE-NEXT: 0000: 06000000 00000000 00020000 00000000 \|			// LE-NEXT: 0000: 06000000 00000000 00020000 00000000 \|
	// LE-NEXT: 0010: 0B000000 00000000 18000000 00000000 \|			// LE-NEXT: 0010: 0B000000 00000000 18000000 00000000 \|
	Show All 12 Lines
	// CHECK-NEXT: Section {			// CHECK-NEXT: Section {
	// CHECK-NEXT: Index: 6			// CHECK-NEXT: Index: 6
	// CHECK-NEXT: Name: .branch_lt (38)			// CHECK-NEXT: Name: .branch_lt (38)
	// CHECK-NEXT: Type: SHT_NOBITS (0x8)			// CHECK-NEXT: Type: SHT_NOBITS (0x8)
	// CHECK-NEXT: Flags [ (0x3)			// CHECK-NEXT: Flags [ (0x3)
	// CHECK-NEXT: SHF_ALLOC (0x2)			// CHECK-NEXT: SHF_ALLOC (0x2)
	// CHECK-NEXT: SHF_WRITE (0x1)			// CHECK-NEXT: SHF_WRITE (0x1)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Address: 0x30000			// CHECK-NEXT: Address: 0x30298
	// CHECK-NEXT: Offset: 0x20060			// CHECK-NEXT: Offset: 0x298
	// CHECK-NEXT: Size: 0			// CHECK-NEXT: Size: 0
	// CHECK-NEXT: Link: 0			// CHECK-NEXT: Link: 0
	// CHECK-NEXT: Info: 0			// CHECK-NEXT: Info: 0
	// CHECK-NEXT: AddressAlignment: 8			// CHECK-NEXT: AddressAlignment: 8
	// CHECK-NEXT: EntrySize: 0			// CHECK-NEXT: EntrySize: 0
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: Section {			// CHECK-NEXT: Section {
	// CHECK-NEXT: Index: 7			// CHECK-NEXT: Index: 7
	// CHECK-NEXT: Name: .comment (49)			// CHECK-NEXT: Name: .comment (49)
	// CHECK-NEXT: Type: SHT_PROGBITS (0x1)			// CHECK-NEXT: Type: SHT_PROGBITS (0x1)
	// CHECK-NEXT: Flags [ (0x30)			// CHECK-NEXT: Flags [ (0x30)
	// CHECK-NEXT: SHF_MERGE (0x10)			// CHECK-NEXT: SHF_MERGE (0x10)
	// CHECK-NEXT: SHF_STRINGS (0x20)			// CHECK-NEXT: SHF_STRINGS (0x20)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Address: 0x0			// CHECK-NEXT: Address: 0x0
	// CHECK-NEXT: Offset: 0x20060			// CHECK-NEXT: Offset: 0x298
	// CHECK-NEXT: Size: 8			// CHECK-NEXT: Size: 8
	// CHECK-NEXT: Link: 0			// CHECK-NEXT: Link: 0
	// CHECK-NEXT: Info: 0			// CHECK-NEXT: Info: 0
	// CHECK-NEXT: AddressAlignment: 1			// CHECK-NEXT: AddressAlignment: 1
	// CHECK-NEXT: EntrySize: 1			// CHECK-NEXT: EntrySize: 1
	// CHECK-NEXT: SectionData (			// CHECK-NEXT: SectionData (
	// CHECK-NEXT: 0000: 4C4C4420 312E3000 \|LLD 1.0.\|			// CHECK-NEXT: 0000: 4C4C4420 312E3000 \|LLD 1.0.\|
	// CHECK-NEXT: )			// CHECK-NEXT: )
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: Section {			// CHECK-NEXT: Section {
	// CHECK-NEXT: Index: 8			// CHECK-NEXT: Index: 8
	// CHECK-NEXT: Name: .symtab (58)			// CHECK-NEXT: Name: .symtab (58)
	// CHECK-NEXT: Type: SHT_SYMTAB (0x2)			// CHECK-NEXT: Type: SHT_SYMTAB (0x2)
	// CHECK-NEXT: Flags [ (0x0)			// CHECK-NEXT: Flags [ (0x0)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Address: 0x0			// CHECK-NEXT: Address: 0x0
	// CHECK-NEXT: Offset: 0x20068			// CHECK-NEXT: Offset: 0x2A0
	// CHECK-NEXT: Size: 48			// CHECK-NEXT: Size: 48
	// CHECK-NEXT: Link: 10			// CHECK-NEXT: Link: 10
	// CHECK-NEXT: Info: 2			// CHECK-NEXT: Info: 2
	// CHECK-NEXT: AddressAlignment: 8			// CHECK-NEXT: AddressAlignment: 8
	// CHECK-NEXT: EntrySize: 24			// CHECK-NEXT: EntrySize: 24
	// CHECK-NEXT: SectionData (			// CHECK-NEXT: SectionData (
	// LE-NEXT: 0000: 00000000 00000000 00000000 00000000 \|................\|			// LE-NEXT: 0000: 00000000 00000000 00000000 00000000
	// LE-NEXT: 0010: 00000000 00000000 01000000 00020500 \|................\|			// LE-NEXT: 0010: 00000000 00000000 01000000 00020500
	// LE-NEXT: 0020: 00000200 00000000 00000000 00000000 \|................\|			// LE-NEXT: 0020: 38020200 00000000 00000000 00000000
	// BE-NEXT: 0000: 00000000 00000000 00000000 00000000 \|................\|			// BE-NEXT: 0000: 00000000 00000000 00000000 00000000
	// BE-NEXT: 0010: 00000000 00000000 00000001 00020005 \|................\|			// BE-NEXT: 0010: 00000000 00000000 00000001 00020005
	// BE-NEXT: 0020: 00000000 00020000 00000000 00000000 \|................\|			// BE-NEXT: 0020: 00000000 00020238 00000000 00000000
	// CHECK-NEXT: )			// CHECK-NEXT: )
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: Section {			// CHECK-NEXT: Section {
	// CHECK-NEXT: Index: 9			// CHECK-NEXT: Index: 9
	// CHECK-NEXT: Name: .shstrtab (66)			// CHECK-NEXT: Name: .shstrtab (66)
	// CHECK-NEXT: Type: SHT_STRTAB (0x3)			// CHECK-NEXT: Type: SHT_STRTAB (0x3)
	// CHECK-NEXT: Flags [ (0x0)			// CHECK-NEXT: Flags [ (0x0)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Address: 0x0			// CHECK-NEXT: Address: 0x0
	// CHECK-NEXT: Offset: 0x20098			// CHECK-NEXT: Offset: 0x2D0
	// CHECK-NEXT: Size: 84			// CHECK-NEXT: Size: 84
	// CHECK-NEXT: Link: 0			// CHECK-NEXT: Link: 0
	// CHECK-NEXT: Info: 0			// CHECK-NEXT: Info: 0
	// CHECK-NEXT: AddressAlignment: 1			// CHECK-NEXT: AddressAlignment: 1
	// CHECK-NEXT: EntrySize: 0			// CHECK-NEXT: EntrySize: 0
	// CHECK-NEXT: SectionData (			// CHECK-NEXT: SectionData (
	// CHECK-NEXT: 0000: 002E6479 6E73796D 002E6861 7368002E \|..dynsym..hash..\|			// CHECK-NEXT: 0000: 002E6479 6E73796D 002E6861 7368002E \|..dynsym..hash..\|
	// CHECK-NEXT: 0010: 64796E73 7472002E 74657874 002E6479 \|dynstr..text..dy\|			// CHECK-NEXT: 0010: 64796E73 7472002E 74657874 002E6479 \|dynstr..text..dy\|
	// CHECK-NEXT: 0020: 6E616D69 63002E62 72616E63 685F6C74 \|namic..branch_lt\|			// CHECK-NEXT: 0020: 6E616D69 63002E62 72616E63 685F6C74 \|namic..branch_lt\|
	// CHECK-NEXT: 0030: 002E636F 6D6D656E 74002E73 796D7461 \|..comment..symta\|			// CHECK-NEXT: 0030: 002E636F 6D6D656E 74002E73 796D7461 \|..comment..symta\|
	// CHECK-NEXT: 0040: 62002E73 68737472 74616200 2E737472 \|b..shstrtab..str\|			// CHECK-NEXT: 0040: 62002E73 68737472 74616200 2E737472 \|b..shstrtab..str\|
	// CHECK-NEXT: 0050: 74616200 \|tab.\|			// CHECK-NEXT: 0050: 74616200 \|tab.\|
	// CHECK-NEXT: )			// CHECK-NEXT: )
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: Section {			// CHECK-NEXT: Section {
	// CHECK-NEXT: Index: 10			// CHECK-NEXT: Index: 10
	// CHECK-NEXT: Name: .strtab (76)			// CHECK-NEXT: Name: .strtab (76)
	// CHECK-NEXT: Type: SHT_STRTAB (0x3)			// CHECK-NEXT: Type: SHT_STRTAB (0x3)
	// CHECK-NEXT: Flags [ (0x0)			// CHECK-NEXT: Flags [ (0x0)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Address: 0x0			// CHECK-NEXT: Address: 0x0
	// CHECK-NEXT: Offset: 0x200EC			// CHECK-NEXT: Offset: 0x324
	// CHECK-NEXT: Size: 10			// CHECK-NEXT: Size: 10
	// CHECK-NEXT: Link: 0			// CHECK-NEXT: Link: 0
	// CHECK-NEXT: Info: 0			// CHECK-NEXT: Info: 0
	// CHECK-NEXT: AddressAlignment: 1			// CHECK-NEXT: AddressAlignment: 1
	// CHECK-NEXT: EntrySize: 0			// CHECK-NEXT: EntrySize: 0
	// CHECK-NEXT: SectionData (			// CHECK-NEXT: SectionData (
	// CHECK-NEXT: 0000: 005F4459 4E414D49 4300 \|._DYNAMIC.\|			// CHECK-NEXT: 0000: 005F4459 4E414D49 4300 \|._DYNAMIC.\|
	// CHECK-NEXT: )			// CHECK-NEXT: )
	Show All 21 Lines
	// CHECK-NEXT: MemSize: 553			// CHECK-NEXT: MemSize: 553
	// CHECK-NEXT: Flags [ (0x4)			// CHECK-NEXT: Flags [ (0x4)
	// CHECK-NEXT: PF_R (0x4)			// CHECK-NEXT: PF_R (0x4)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Alignment: 65536			// CHECK-NEXT: Alignment: 65536
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: ProgramHeader {			// CHECK-NEXT: ProgramHeader {
	// CHECK-NEXT: Type: PT_LOAD (0x1)			// CHECK-NEXT: Type: PT_LOAD (0x1)
	// CHECK-NEXT: Offset: 0x10000			// CHECK-NEXT: Offset: 0x22C
	// CHECK-NEXT: VirtualAddress: 0x10000			// CHECK-NEXT: VirtualAddress: 0x1022C
	// CHECK-NEXT: PhysicalAddress: 0x10000			// CHECK-NEXT: PhysicalAddress: 0x1022C
	// CHECK-NEXT: FileSize: 12			// CHECK-NEXT: FileSize: 12
	// CHECK-NEXT: MemSize: 12			// CHECK-NEXT: MemSize: 12
	// CHECK-NEXT: Flags [ (0x5)			// CHECK-NEXT: Flags [ (0x5)
	// CHECK-NEXT: PF_R (0x4)			// CHECK-NEXT: PF_R (0x4)
	// CHECK-NEXT: PF_X (0x1)			// CHECK-NEXT: PF_X (0x1)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Alignment: 65536			// CHECK-NEXT: Alignment: 65536
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: ProgramHeader {			// CHECK-NEXT: ProgramHeader {
	// CHECK-NEXT: Type: PT_LOAD (0x1)			// CHECK-NEXT: Type: PT_LOAD (0x1)
	// CHECK-NEXT: Offset: 0x20000			// CHECK-NEXT: Offset: 0x238
	// CHECK-NEXT: VirtualAddress: 0x20000			// CHECK-NEXT: VirtualAddress: 0x20238
	// CHECK-NEXT: PhysicalAddress: 0x20000			// CHECK-NEXT: PhysicalAddress: 0x20238
	// CHECK-NEXT: FileSize: 96			// CHECK-NEXT: FileSize: 96
	// CHECK-NEXT: MemSize: 96			// CHECK-NEXT: MemSize: 96
	// CHECK-NEXT: Flags [ (0x6)			// CHECK-NEXT: Flags [ (0x6)
	// CHECK-NEXT: PF_R (0x4)			// CHECK-NEXT: PF_R (0x4)
	// CHECK-NEXT: PF_W (0x2)			// CHECK-NEXT: PF_W (0x2)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Alignment: 65536			// CHECK-NEXT: Alignment: 65536
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: ProgramHeader {			// CHECK-NEXT: ProgramHeader {
	// CHECK-NEXT: Type: PT_DYNAMIC (0x2)			// CHECK-NEXT: Type: PT_DYNAMIC (0x2)
	// CHECK-NEXT: Offset: 0x20000			// CHECK-NEXT: Offset: 0x238
	// CHECK-NEXT: VirtualAddress: 0x20000			// CHECK-NEXT: VirtualAddress: 0x20238
	// CHECK-NEXT: PhysicalAddress: 0x20000			// CHECK-NEXT: PhysicalAddress: 0x20238
	// CHECK-NEXT: FileSize: 96			// CHECK-NEXT: FileSize: 96
	// CHECK-NEXT: MemSize: 96			// CHECK-NEXT: MemSize: 96
	// CHECK-NEXT: Flags [ (0x6)			// CHECK-NEXT: Flags [ (0x6)
	// CHECK-NEXT: PF_R (0x4)			// CHECK-NEXT: PF_R (0x4)
	// CHECK-NEXT: PF_W (0x2)			// CHECK-NEXT: PF_W (0x2)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Alignment: 8			// CHECK-NEXT: Alignment: 8
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: ProgramHeader {			// CHECK-NEXT: ProgramHeader {
	// CHECK-NEXT: Type: PT_GNU_RELRO (0x6474E552)			// CHECK-NEXT: Type: PT_GNU_RELRO (0x6474E552)
	// CHECK-NEXT: Offset: 0x20000			// CHECK-NEXT: Offset: 0x238
	// CHECK-NEXT: VirtualAddress: 0x20000			// CHECK-NEXT: VirtualAddress: 0x20238
	// CHECK-NEXT: PhysicalAddress: 0x20000			// CHECK-NEXT: PhysicalAddress: 0x20238
	// CHECK-NEXT: FileSize: 96			// CHECK-NEXT: FileSize: 96
	// CHECK-NEXT: MemSize: 4096			// CHECK-NEXT: MemSize: 3528
	// CHECK-NEXT: Flags [ (0x4)			// CHECK-NEXT: Flags [ (0x4)
	// CHECK-NEXT: PF_R (0x4)			// CHECK-NEXT: PF_R (0x4)
	// CHECK-NEXT: ]			// CHECK-NEXT: ]
	// CHECK-NEXT: Alignment: 1			// CHECK-NEXT: Alignment: 1
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: ProgramHeader {			// CHECK-NEXT: ProgramHeader {
	// CHECK-NEXT: Type: PT_GNU_STACK (0x6474E551)			// CHECK-NEXT: Type: PT_GNU_STACK (0x6474E551)
	// CHECK-NEXT: Offset: 0x0			// CHECK-NEXT: Offset: 0x0
	Show All 11 Lines

test/ELF/ppc64-abs64-dyn.s

	# REQUIRES: ppc			# REQUIRES: ppc
	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	# RUN: ld.lld -shared %t.o -o %t.so			# RUN: ld.lld -shared %t.o -o %t.so
	# RUN: llvm-readobj -r %t.so \| FileCheck %s			# RUN: llvm-readobj -r %t.so \| FileCheck %s

	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	# RUN: ld.lld -shared %t.o -o %t.so			# RUN: ld.lld -shared %t.o -o %t.so
	# RUN: llvm-readobj -r %t.so \| FileCheck %s			# RUN: llvm-readobj -r %t.so \| FileCheck %s

	## Test that we create R_PPC64_RELATIVE for R_PPC64_ADDR64 to non-preemptable			## Test that we create R_PPC64_RELATIVE for R_PPC64_ADDR64 to non-preemptable
	## symbols and R_PPC64_TOC in writable sections.			## symbols and R_PPC64_TOC in writable sections.

	## FIXME the addend for offset 0x20000 should be TOC base+0x8000+1, not 0x80001.			## FIXME the addend for offset 0x20000 should be TOC base+0x8000+1, not 0x80001.
	# CHECK: .rela.dyn {			# CHECK: .rela.dyn {
	# CHECK-NEXT: 0x20000 R_PPC64_RELATIVE - 0x8001			# CHECK-NEXT: 0x303B0 R_PPC64_RELATIVE - 0x8001
	# CHECK-NEXT: 0x20008 R_PPC64_RELATIVE - 0x20001			# CHECK-NEXT: 0x303B8 R_PPC64_RELATIVE - 0x303B1
	# CHECK-NEXT: 0x20010 R_PPC64_ADDR64 external 0x1			# CHECK-NEXT: 0x303C0 R_PPC64_ADDR64 external 0x1
	# CHECK-NEXT: 0x20018 R_PPC64_ADDR64 global 0x1			# CHECK-NEXT: 0x303C8 R_PPC64_ADDR64 global 0x1
	# CHECK-NEXT: }			# CHECK-NEXT: }

	.data			.data
	.globl global			.globl global
	global:			global:
	local:			local:

	.quad .TOC.@tocbase + 1			.quad .TOC.@tocbase + 1
	.quad local + 1			.quad local + 1
	.quad external + 1			.quad external + 1
	.quad global + 1			.quad global + 1

test/ELF/ppc64-bsymbolic-toc-restore.s

	Show First 20 Lines • Show All 57 Lines • ▼ Show 20 Lines
	# CHECK-NEXT: add 3, 3, 31			# CHECK-NEXT: add 3, 3, 31
	# CHECK-NEXT: addi 1, 1, 32			# CHECK-NEXT: addi 1, 1, 32
	# CHECK-NEXT: ld 0, -16(1)			# CHECK-NEXT: ld 0, -16(1)
	# CHECK-NEXT: mtlr 0			# CHECK-NEXT: mtlr 0
	# CHECK-NEXT: blr			# CHECK-NEXT: blr
	# CHECK-EMPTY:			# CHECK-EMPTY:
	# CHECK-NEXT: def:			# CHECK-NEXT: def:
	# CHECK-NEXT: addis 2, 12, 2			# CHECK-NEXT: addis 2, 12, 2
	# CHECK-NEXT: addi 2, 2, -32616			# CHECK-NEXT: addi 2, 2, -32456
	# CHECK-NEXT: li 3, 55			# CHECK-NEXT: li 3, 55
	# CHECK-NEXT: blr			# CHECK-NEXT: blr

test/ELF/ppc64-call-reach.s

	# REQUIRES: ppc			# REQUIRES: ppc

	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	# RUN: ld.lld --defsym callee=0x12010010 --defsym tail_callee=0x12010020 \			# RUN: ld.lld --defsym callee=0x12010010 --defsym tail_callee=0x12010020 \
	# RUN: %t.o -o %t			# RUN: -z separate-code %t.o -o %t
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s
	# RUN: ld.lld --defsym callee=0x12010010 --defsym tail_callee=0x12010020 \			# RUN: ld.lld --defsym callee=0x12010010 --defsym tail_callee=0x12010020 \
	# RUN: %t.o -o %t			# RUN: -z separate-code %t.o -o %t
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s
	# RUN: ld.lld --defsym callee=0xE010014 --defsym tail_callee=0xE010024 \			# RUN: ld.lld --defsym callee=0xE010014 --defsym tail_callee=0xE010024 \
	# RUN: %t.o -o %t			# RUN: -z separate-code %t.o -o %t
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=NEGOFFSET %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=NEGOFFSET %s
	# RUN: ld.lld --defsym callee=0x12010018 --defsym tail_callee=0x12010028 \			# RUN: ld.lld --defsym callee=0x12010018 --defsym tail_callee=0x12010028 \
	# RUN: %t.o -o %t			# RUN: -z separate-code %t.o -o %t
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=THUNK %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=THUNK %s
	# RUN: llvm-readelf --sections %t \| FileCheck --check-prefix=BRANCHLT %s			# RUN: llvm-readelf --sections %t \| FileCheck --check-prefix=BRANCHLT %s
	# RUN: not ld.lld --defsym callee=0x1001002D --defsym tail_callee=0x1001002F \			# RUN: not ld.lld --defsym callee=0x1001002D --defsym tail_callee=0x1001002F \
	# RUN: %t.o -o %t 2>&1 \| FileCheck --check-prefix=MISSALIGNED %s			# RUN: -z separate-code %t.o -o %t 2>&1 \| FileCheck --check-prefix=MISSALIGNED %s

	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	# RUN: ld.lld --defsym callee=0x12010010 --defsym tail_callee=0x12010020 \			# RUN: ld.lld --defsym callee=0x12010010 --defsym tail_callee=0x12010020 \
	# RUN: %t.o -o %t			# RUN: -z separate-code %t.o -o %t
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s
	# RUN: ld.lld --defsym callee=0x12010010 --defsym tail_callee=0x12010020 \			# RUN: ld.lld --defsym callee=0x12010010 --defsym tail_callee=0x12010020 \
	# RUN: %t.o -o %t			# RUN: -z separate-code %t.o -o %t
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s
	# RUN: ld.lld --defsym callee=0xE010014 --defsym tail_callee=0xE010024 \			# RUN: ld.lld --defsym callee=0xE010014 --defsym tail_callee=0xE010024 \
	# RUN: %t.o -o %t			# RUN: -z separate-code %t.o -o %t
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=NEGOFFSET %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=NEGOFFSET %s
	# RUN: ld.lld --defsym callee=0x12010018 --defsym tail_callee=0x12010028 \			# RUN: ld.lld --defsym callee=0x12010018 --defsym tail_callee=0x12010028 \
	# RUN: %t.o -o %t			# RUN: -z separate-code %t.o -o %t
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=THUNK %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=THUNK %s
	# RUN: llvm-readelf --sections %t \| FileCheck --check-prefix=BRANCHLT %s			# RUN: llvm-readelf --sections %t \| FileCheck --check-prefix=BRANCHLT %s
	# RUN: not ld.lld --defsym callee=0x1001002D --defsym tail_callee=0x1001002F \			# RUN: not ld.lld --defsym callee=0x1001002D --defsym tail_callee=0x1001002F \
	# RUN: %t.o -o %t 2>&1 \| FileCheck --check-prefix=MISSALIGNED %s			# RUN: -z separate-code %t.o -o %t 2>&1 \| FileCheck --check-prefix=MISSALIGNED %s

	# MISSALIGNED: ld.lld: error: {{.*}}.o:(.text+0x14): improper alignment for relocation R_PPC64_REL24: 0x19 is not aligned to 4 bytes			# MISSALIGNED: ld.lld: error: {{.*}}.o:(.text+0x14): improper alignment for relocation R_PPC64_REL24: 0x19 is not aligned to 4 bytes
	# MISSALIGNED: ld.lld: error: {{.*}}.o:(.text+0x24): improper alignment for relocation R_PPC64_REL24: 0xB is not aligned to 4 bytes			# MISSALIGNED: ld.lld: error: {{.*}}.o:(.text+0x24): improper alignment for relocation R_PPC64_REL24: 0xB is not aligned to 4 bytes

	.global test			.global test
	.p2align 4			.p2align 4
	.type test,@function			.type test,@function
	test:			test:
	Show All 23 Lines

	# THUNK-LABEL: test:			# THUNK-LABEL: test:
	# THUNK: 10010014: bl .+20			# THUNK: 10010014: bl .+20
	# THUNK: 10010024: b .+20			# THUNK: 10010024: b .+20

	# .branch_lt[0]			# .branch_lt[0]
	# THUNK-LABEL: __long_branch_callee:			# THUNK-LABEL: __long_branch_callee:
	# THUNK-NEXT: 10010028: addis 12, 2, 1			# THUNK-NEXT: 10010028: addis 12, 2, 1
	# THUNK-NEXT: ld 12, -32768(12)			# THUNK-NEXT: ld 12, -32760(12)
	# THUNK-NEXT: mtctr 12			# THUNK-NEXT: mtctr 12
	# THUNK-NEXT: bctr			# THUNK-NEXT: bctr

	# .branch_lt[1]			# .branch_lt[1]
	# THUNK-LABEL: __long_branch_tail_callee:			# THUNK-LABEL: __long_branch_tail_callee:
	# THUNK-NEXT: 10010038: addis 12, 2, 1			# THUNK-NEXT: 10010038: addis 12, 2, 1
	# THUNK-NEXT: ld 12, -32760(12)			# THUNK-NEXT: ld 12, -32752(12)
	# THUNK-NEXT: mtctr 12			# THUNK-NEXT: mtctr 12
	# THUNK-NEXT: bctr			# THUNK-NEXT: bctr

	# The offset from the TOC to the .branch_lt section is (-1 << 16) - 32768.			# The offset from the TOC to the .branch_lt section is (-1 << 16) - 32768.
	# Name Type Address Off Size			# Name Type Address Off Size
	# BRANCHLT: .got PROGBITS 0000000010020000 020000 000008			# BRANCHLT: .got PROGBITS 0000000010020000 020000 000008
	# BRANCHLT: .branch_lt PROGBITS 0000000010030000 030000 000010			# BRANCHLT: .branch_lt PROGBITS 0000000010030008 020008 000010
	# BRANCHLT-NOT: .plt			# BRANCHLT-NOT: .plt

test/ELF/ppc64-dq.s

Show All 22 Lines	.Llep:
stxv 3, qword@toc@l(3)		stxv 3, qword@toc@l(3)
blr		blr

.comm qword, 16, 16		.comm qword, 16, 16

# Verify that we don't overwrite any of the extended opcode bits on a DQ form		# Verify that we don't overwrite any of the extended opcode bits on a DQ form
# instruction.		# instruction.
# CHECK-LABEL: test		# CHECK-LABEL: test
# CHECK: lxv 3, -32768(3)		# CHECK: addis 3, 2, 1
# CHECK: stxv 3, -32768(3)		# CHECK-NEXT: lxv 3, -32752(3)
		# CHECK-NEXT: addis 3, 2, 1
		# CHECK-NEXT: stxv 3, -32752(3)

test/ELF/ppc64-dtprel.s

	Show First 20 Lines • Show All 131 Lines • ▼ Show 20 Lines
	// InputRelocs: R_PPC64_DTPREL64 {{[0-9a-f]+}} i + 8000			// InputRelocs: R_PPC64_DTPREL64 {{[0-9a-f]+}} i + 8000

	// Expect a single dynamic relocation in the '.rela.dyn section for the module id.			// Expect a single dynamic relocation in the '.rela.dyn section for the module id.
	// OutputRelocs: Relocation section '.rela.dyn' at offset 0x{{[0-9a-f]+}} contains 1 entries:			// OutputRelocs: Relocation section '.rela.dyn' at offset 0x{{[0-9a-f]+}} contains 1 entries:
	// OutputRelocs-NEXT: Offset Info Type Symbol's Value Symbol's Name + Addend			// OutputRelocs-NEXT: Offset Info Type Symbol's Value Symbol's Name + Addend
	// OutputRelocs-NEXT: R_PPC64_DTPMOD64			// OutputRelocs-NEXT: R_PPC64_DTPMOD64


	// The got entry for i is at .got+8*3 = 0x420510			// The got entry for i is at .got+8*1 = 0x4209e0
	// i@dtprel = 1024 - 0x8000 = -31744 = 0xffffffffffff8400			// i@dtprel = 1024 - 0x8000 = -31744 = 0xffffffffffff8400
	// HEX-LE: section '.got':			// HEX-LE: section '.got':
	// HEX-LE-NEXT: 4204f8 f8844200 00000000 00000000 00000000			// HEX-LE-NEXT: 4209d8 d8894200 00000000 00000000 00000000
	// HEX-LE-NEXT: 420508 00000000 00000000			// HEX-LE-NEXT: 4209e8 00000000 00000000

	// HEX-BE: section '.got':			// HEX-BE: section '.got':
	// HEX-BE-NEXT: 4204f8 00000000 004284f8 00000000 00000000			// HEX-BE-NEXT: 4209d8 00000000 004289d8 00000000 00000000
	// HEX-BE-NEXT: 420508 00000000 00000000			// HEX-BE-NEXT: 4209e8 00000000 00000000

	// Dis: test:			// Dis: test:
	// Dis: addi 4, 3, -31744			// Dis: addi 4, 3, -31744
	// Dis-NEXT: lwa 4, -31744(3)			// Dis-NEXT: lwa 4, -31744(3)

	// #k@dtprel(1024 + 4 + 1024 * 1024 * 4) = 0x400404			// #k@dtprel(1024 + 4 + 1024 * 1024 * 4) = 0x400404

	// #highesta(k@dtprel) --> ((0x400404 - 0x8000 + 0x8000) >> 48) & 0xffff = 0			// #highesta(k@dtprel) --> ((0x400404 - 0x8000 + 0x8000) >> 48) & 0xffff = 0
	Show All 19 Lines

test/ELF/ppc64-entry-point.s

Show All 28 Lines	.Lfunc_gep0:

# exit 55		# exit 55
li 0, 1		li 0, 1
li 3, 55		li 3, 55
sc		sc
.Lfunc_end0:		.Lfunc_end0:
.size _start, .Lfunc_end0-.Lfunc_begin0		.size _start, .Lfunc_end0-.Lfunc_begin0

# NM-DAG: 0000000010028000 d .TOC.		# NM-DAG: 00000000100281f0 d .TOC.
# NM-DAG: 0000000010010000 T _start		# NM-DAG: 00000000100101d0 T _start

# 0x10010000 = (4097<<16) + 0		# 0x100101d0 = (4097<<16) + 464
# CHECK: 10010000: lis 4, 4097		# CHECK: 100101d0: lis 4, 4097
# CHECK-NEXT: 10010004: addi 4, 4, 0		# CHECK-NEXT: 100101d4: addi 4, 4, 464
# .TOC. - _start = (2<<16) - 32768		# .TOC. - _start = (2<<16) - 32736
# CHECK-NEXT: 10010008: lis 5, 2		# CHECK-NEXT: 100101d8: lis 5, 2
# CHECK-NEXT: 1001000c: addi 5, 5, -32768		# CHECK-NEXT: 100101dc: addi 5, 5, -32736

test/ELF/ppc64-error-missaligned-dq.s

	# REQUIRES: ppc			# REQUIRES: ppc
	#			#
	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	# RUN: not ld.lld %t.o -o %t 2>&1 \| FileCheck %s			# RUN: not ld.lld %t.o -o %t 2>&1 \| FileCheck %s

	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	# RUN: not ld.lld %t.o -o %t 2>&1 \| FileCheck %s			# RUN: not ld.lld %t.o -o %t 2>&1 \| FileCheck %s

	# CHECK: improper alignment for relocation R_PPC64_TOC16_LO_DS: 0x8001 is not aligned to 16 bytes			# CHECK: improper alignment for relocation R_PPC64_TOC16_LO_DS: 0x8009 is not aligned to 16 bytes

	.global test			.global test
	.p2align 4			.p2align 4
	.type test,@function			.type test,@function
	test:			test:
	.Lgep:			.Lgep:
	addis 2, 12, .TOC.-.Lgep@ha			addis 2, 12, .TOC.-.Lgep@ha
	addi 2, 2, .TOC.-.Lgep@l			addi 2, 2, .TOC.-.Lgep@l
	.Llep:			.Llep:
	.localentry test, .Llep-.Lgep			.localentry test, .Llep-.Lgep
	addis 3, 2, qword@toc@ha			addis 3, 2, qword@toc@ha
	lxv 3, qword@toc@l(3)			lxv 3, qword@toc@l(3)
	blr			blr

				.p2align 4
	.comm pad, 1, 1			.comm pad, 1, 1
	.comm qword, 16, 1			.comm qword, 16, 1

test/ELF/ppc64-error-missaligned-ds.s

	# REQUIRES: ppc			# REQUIRES: ppc

	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	# RUN: not ld.lld %t.o -o %t 2>&1 \| FileCheck %s			# RUN: not ld.lld %t.o -o %t 2>&1 \| FileCheck %s

	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	# RUN: not ld.lld %t.o -o %t 2>&1 \| FileCheck %s			# RUN: not ld.lld %t.o -o %t 2>&1 \| FileCheck %s

	# CHECK: improper alignment for relocation R_PPC64_TOC16_LO_DS: 0x8001 is not aligned to 4 bytes			# CHECK: improper alignment for relocation R_PPC64_TOC16_LO_DS: 0x8009 is not aligned to 4 bytes

	.global test			.global test
	.p2align 4			.p2align 4
	.type test,@function			.type test,@function
	test:			test:
	.Lgep:			.Lgep:
	addis 2, 12, .TOC.-.Lgep@ha			addis 2, 12, .TOC.-.Lgep@ha
	addi 2, 2, .TOC.-.Lgep@l			addi 2, 2, .TOC.-.Lgep@l
	.Llep:			.Llep:
	.localentry test, .Llep-.Lgep			.localentry test, .Llep-.Lgep
	addis 3, 2, word@toc@ha			addis 3, 2, word@toc@ha
	lwa 3, word@toc@l(3)			lwa 3, word@toc@l(3)
	blr			blr

				.p2align 4
	.comm pad, 1, 1			.comm pad, 1, 1
	.comm word, 4, 1			.comm word, 4, 1

test/ELF/ppc64-func-entry-points.s

	// REQUIRES: ppc			// REQUIRES: ppc

	// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-func-global-entry.s -o %t2.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-func-global-entry.s -o %t2.o
	// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-func-local-entry.s -o %t3.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-func-local-entry.s -o %t3.o
	// RUN: ld.lld -dynamic-linker /lib64/ld64.so.2 %t.o %t2.o %t3.o -o %t			// RUN: ld.lld -dynamic-linker /lib64/ld64.so.2 %t.o %t2.o %t3.o -o %t
	// RUN: llvm-objdump -d %t \| FileCheck %s			// RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s

	// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-func-global-entry.s -o %t2.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-func-global-entry.s -o %t2.o
	// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-func-local-entry.s -o %t3.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-func-local-entry.s -o %t3.o
	// RUN: ld.lld -dynamic-linker /lib64/ld64.so.2 %t.o %t2.o %t3.o -o %t			// RUN: ld.lld -dynamic-linker /lib64/ld64.so.2 %t.o %t2.o %t3.o -o %t
	// RUN: llvm-objdump -d %t \| FileCheck %s			// RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s

	.text			.text
	.abiversion 2			.abiversion 2
	.globl _start # -- Begin function _start			.globl _start # -- Begin function _start
	.p2align 4			.p2align 4
	.type _start,@function			.type _start,@function
	_start: # @_start			_start: # @_start
	.Lfunc_begin0:			.Lfunc_begin0:
	▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	glob:			glob:
	.long 10 # 0xa			.long 10 # 0xa
	.size glob, 4			.size glob, 4

	# Check that foo_external_diff has a global entry point and we branch to			# Check that foo_external_diff has a global entry point and we branch to
	# foo_external_diff+8. Also check that foo_external_same has no global entry			# foo_external_diff+8. Also check that foo_external_same has no global entry
	# point and we branch to start of foo_external_same.			# point and we branch to start of foo_external_same.

	// CHECK: _start:			// CHECK-LABEL: _start:
	// CHECK: 10010020: {{.*}} bl .+144			// CHECK: 100101f0: bl .+144
	// CHECK: 10010034: {{.*}} bl .+84			// CHECK: 10010204: bl .+84
	// CHECK: foo_external_diff:			// CHECK-LABEL: foo_external_diff:
	// CHECK-NEXT: 10010080: {{.*}} addis 2, 12, 1			// CHECK-NEXT: 10010250: addis 2, 12, 2
	// CHECK-NEXT: 10010084: {{.*}} addi 2, 2, 32640			// CHECK-NEXT: 10010254: addi 2, 2, -32696
	// CHECK-NEXT: 10010088: {{.*}} addis 5, 2, 1			// CHECK-NEXT: 10010258: addis 5, 2, 1
	// CHECK: foo_external_same:			// CHECK-LABEL: foo_external_same:
	// CHECK-NEXT: 100100b0: {{.*}} add 3, 4, 3			// CHECK-NEXT: 10010280: add 3, 4, 3

test/ELF/ppc64-ifunc.s

	# REQUIRES: ppc			# REQUIRES: ppc

	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	# RUN: ld.lld %t.o -o %t			# RUN: ld.lld %t.o -o %t
	# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s			# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s
	# RUN: llvm-readelf -S %t \| FileCheck --check-prefix=SECTIONS %s			# RUN: llvm-readelf -S %t \| FileCheck --check-prefix=SECTIONS %s
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s
	# RUN: llvm-readelf -r %t \| FileCheck --check-prefix=DYNREL %s			# RUN: llvm-readelf -r %t \| FileCheck --check-prefix=DYNREL %s

	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	# RUN: ld.lld %t.o -o %t			# RUN: ld.lld %t.o -o %t
	# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s			# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s
	# RUN: llvm-readelf -S %t \| FileCheck --check-prefix=SECTIONS %s			# RUN: llvm-readelf -S %t \| FileCheck --check-prefix=SECTIONS %s
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s
	# RUN: llvm-readelf -r %t \| FileCheck --check-prefix=DYNREL %s			# RUN: llvm-readelf -r %t \| FileCheck --check-prefix=DYNREL %s

	# NM-DAG: 0000000010028000 d .TOC.			# NM-DAG: 0000000010028248 d .TOC.
	# NM-DAG: 0000000010010000 T ifunc			# NM-DAG: 00000000100101f8 T ifunc
	# NM-DAG: 0000000010010004 T ifunc2			# NM-DAG: 00000000100101fc T ifunc2

	# SECTIONS: .plt NOBITS 0000000010030000			# SECTIONS: .plt NOBITS 0000000010030250 000250 000010 00 WA 0 0 8

	# __plt_ifunc - . = 0x10010020 - 0x10010010 = 16			# __plt_ifunc - . = 0x10010218 - 0x10010208 = 16
	# __plt_ifunc2 - . = 0x10010044 - 0x10010018 = 28			# __plt_ifunc2 - . = 0x1001022c - 0x10010210 = 28
	# CHECK: _start:			# CHECK: _start:
	# CHECK-NEXT: addis 2, 12, 1			# CHECK-NEXT: addis 2, 12, 2
	# CHECK-NEXT: addi 2, 2, 32760			# CHECK-NEXT: addi 2, 2, -32696
	# CHECK-NEXT: 10010010: bl .+16			# CHECK-NEXT: 10010208: bl .+16
	# CHECK-NEXT: ld 2, 24(1)			# CHECK-NEXT: ld 2, 24(1)
	# CHECK-NEXT: 10010018: bl .+28			# CHECK-NEXT: 10010210: bl .+28
	# CHECK-NEXT: ld 2, 24(1)			# CHECK-NEXT: ld 2, 24(1)

	# .plt[0] - .TOC. = 0x10030000 - 0x10028000 = (1<<16) - 32768			# .plt[0] - .TOC. = 0x10030250 - 0x10028248 = (1<<16) - 32760
	# CHECK: __plt_ifunc:			# CHECK: __plt_ifunc:
	# CHECK-NEXT: std 2, 24(1)			# CHECK-NEXT: std 2, 24(1)
	# CHECK-NEXT: addis 12, 2, 1			# CHECK-NEXT: addis 12, 2, 1
	# CHECK-NEXT: ld 12, -32768(12)			# CHECK-NEXT: ld 12, -32760(12)
	# CHECK-NEXT: mtctr 12			# CHECK-NEXT: mtctr 12
	# CHECK-NEXT: bctr			# CHECK-NEXT: bctr

	# .plt[1] - .TOC. = 0x10030000+8 - 0x10028000 = (1<<16) - 32760			# .plt[1] - .TOC. = 0x10030250+8 - 0x10028248 = (1<<16) - 32752
	# CHECK: __plt_ifunc2:			# CHECK: __plt_ifunc2:
	# CHECK-NEXT: std 2, 24(1)			# CHECK-NEXT: std 2, 24(1)
	# CHECK-NEXT: addis 12, 2, 1			# CHECK-NEXT: addis 12, 2, 1
	# CHECK-NEXT: ld 12, -32760(12)			# CHECK-NEXT: ld 12, -32752(12)
	# CHECK-NEXT: mtctr 12			# CHECK-NEXT: mtctr 12
	# CHECK-NEXT: bctr			# CHECK-NEXT: bctr

	# Check that we emit 2 R_PPC64_IRELATIVE.			# Check that we emit 2 R_PPC64_IRELATIVE.
	# DYNREL: R_PPC64_IRELATIVE 10010000			# DYNREL: R_PPC64_IRELATIVE 100101f8
	# DYNREL: R_PPC64_IRELATIVE 10010004			# DYNREL: R_PPC64_IRELATIVE 100101fc

	.type ifunc STT_GNU_IFUNC			.type ifunc STT_GNU_IFUNC
	.globl ifunc			.globl ifunc
	ifunc:			ifunc:
	nop			nop

	.type ifunc2 STT_GNU_IFUNC			.type ifunc2 STT_GNU_IFUNC
	.globl ifunc2			.globl ifunc2
	Show All 16 Lines

test/ELF/ppc64-local-dynamic.s

	// REQUIRES: ppc			// REQUIRES: ppc

	// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	// RUN: ld.lld -shared %t.o -o %t.so			// RUN: ld.lld -shared %t.o -z separate-code -o %t.so
	// RUN: llvm-readelf -r %t.o \| FileCheck --check-prefix=InputRelocs %s			// RUN: llvm-readelf -r %t.o \| FileCheck --check-prefix=InputRelocs %s
	// RUN: llvm-readelf -r %t.so \| FileCheck --check-prefix=OutputRelocs %s			// RUN: llvm-readelf -r %t.so \| FileCheck --check-prefix=OutputRelocs %s
	// RUN: llvm-objdump --section-headers %t.so \| FileCheck --check-prefix=CheckGot %s			// RUN: llvm-objdump --section-headers %t.so \| FileCheck --check-prefix=CheckGot %s
	// RUN: llvm-objdump -d %t.so \| FileCheck --check-prefix=Dis %s			// RUN: llvm-objdump -d %t.so \| FileCheck --check-prefix=Dis %s

	// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	// RUN: ld.lld -shared %t.o -o %t.so			// RUN: ld.lld -shared %t.o -z separate-code -o %t.so
	// RUN: llvm-readelf -r %t.o \| FileCheck --check-prefix=InputRelocs %s			// RUN: llvm-readelf -r %t.o \| FileCheck --check-prefix=InputRelocs %s
	// RUN: llvm-readelf -r %t.so \| FileCheck --check-prefix=OutputRelocs %s			// RUN: llvm-readelf -r %t.so \| FileCheck --check-prefix=OutputRelocs %s
	// RUN: llvm-objdump --section-headers %t.so \| FileCheck --check-prefix=CheckGot %s			// RUN: llvm-objdump --section-headers %t.so \| FileCheck --check-prefix=CheckGot %s
	// RUN: llvm-objdump -d %t.so \| FileCheck --check-prefix=Dis %s			// RUN: llvm-objdump -d %t.so \| FileCheck --check-prefix=Dis %s

	.text			.text
	.abiversion 2			.abiversion 2
	.globl test			.globl test
	▲ Show 20 Lines • Show All 109 Lines • Show Last 20 Lines

test/ELF/ppc64-long-branch-localentry-offset.s

	# REQUIRES: ppc			# REQUIRES: ppc

	# RUN: llvm-mc -filetype=obj -triple=ppc64le %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=ppc64le %s -o %t.o
	# RUN: ld.lld %t.o -o %t			# RUN: ld.lld %t.o -z separate-code -o %t
	# RUN: llvm-nm %t \| FileCheck %s			# RUN: llvm-nm %t \| FileCheck %s

	# CHECK-DAG: 0000000010010000 t __long_branch_callee			# CHECK-DAG: 0000000010010000 t __long_branch_callee
	# CHECK-DAG: 0000000010010010 T _start			# CHECK-DAG: 0000000010010010 T _start
	# CHECK-DAG: 0000000012010008 T callee			# CHECK-DAG: 0000000012010008 T callee

	# The bl instruction jumps to the local entry. The distance requires a long branch stub:			# The bl instruction jumps to the local entry. The distance requires a long branch stub:
	# localentry(callee) - _start = 0x12010008+8 - 0x10010010 = 0x2000000			# localentry(callee) - _start = 0x12010008+8 - 0x10010010 = 0x2000000
	Show All 18 Lines

test/ELF/ppc64-long-branch.s

	# REQUIRES: ppc			# REQUIRES: ppc

	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	# RUN: ld.lld --no-toc-optimize %t.o -o %t			# RUN: ld.lld --no-toc-optimize -z separate-code %t.o -o %t
	# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s			# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s
	# RUN: llvm-readelf -x .branch_lt %t \| FileCheck %s -check-prefix=BRANCH-LE			# RUN: llvm-readelf -x .branch_lt %t \| FileCheck %s -check-prefix=BRANCH-LE
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s

	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	# RUN: ld.lld --no-toc-optimize %t.o -o %t			# RUN: ld.lld --no-toc-optimize -z separate-code %t.o -o %t
	# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s			# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s
	# RUN: llvm-readelf -x .branch_lt %t \| FileCheck %s -check-prefix=BRANCH-BE			# RUN: llvm-readelf -x .branch_lt %t \| FileCheck %s -check-prefix=BRANCH-BE
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s

	.text			.text
	.abiversion 2			.abiversion 2
	.protected callee			.protected callee
	.globl callee			.globl callee
	▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines
	# __long_branch_callee - . = 0x12010050 - 0x12010034 = 20			# __long_branch_callee - . = 0x12010050 - 0x12010034 = 20
	# __long_branch_callee is not a PLT call stub. Calling it does not need TOC			# __long_branch_callee is not a PLT call stub. Calling it does not need TOC
	# restore, so it doesn't have to be followed by a nop.			# restore, so it doesn't have to be followed by a nop.
	# CHECK: _start:			# CHECK: _start:
	# CHECK: 12010034: bl .+20			# CHECK: 12010034: bl .+20
	# CHECK: 12010038: bl .+16			# CHECK: 12010038: bl .+16

	# BRANCH-LE: section '.branch_lt':			# BRANCH-LE: section '.branch_lt':
	# BRANCH-LE-NEXT: 0x12030008 08000110 00000000			# BRANCH-LE-NEXT: 0x12030018 08000110 00000000
	# BRANCH-BE: section '.branch_lt':			# BRANCH-BE: section '.branch_lt':
	# BRANCH-BE-NEXT: 0x12030008 00000000 10010008			# BRANCH-BE-NEXT: 0x12030018 00000000 10010008

	# .branch_lt - .TOC. = 0x12030008 - 0x12028000 = (1<<16) - 32760			# .branch_lt - .TOC. = 0x12030018 - 0x12028000 = (1<<16) - 32744
	# CHECK: __long_branch_callee:			# CHECK: __long_branch_callee:
	# CHECK-NEXT: 12010048: addis 12, 2, 1			# CHECK-NEXT: 12010048: addis 12, 2, 1
	# CHECK-NEXT: ld 12, -32760(12)			# CHECK-NEXT: ld 12, -32744(12)
	# CHECK-NEXT: mtctr 12			# CHECK-NEXT: mtctr 12
	# CHECK-NEXT: bctr			# CHECK-NEXT: bctr

test/ELF/ppc64-plt-stub.s

	Show All 9 Lines
	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o
	# RUN: ld.lld -shared %t2.o -soname=t2.so -o %t2.so			# RUN: ld.lld -shared %t2.o -soname=t2.so -o %t2.so
	# RUN: ld.lld %t.o %t2.so -o %t			# RUN: ld.lld %t.o %t2.so -o %t
	# RUN: llvm-readelf -S -d %t \| FileCheck --check-prefix=SEC %s			# RUN: llvm-readelf -S -d %t \| FileCheck --check-prefix=SEC %s
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s

	## DT_PLTGOT points to .plt			## DT_PLTGOT points to .plt
	# SEC: .plt NOBITS 0000000010030000 030000 000018			# SEC: .plt NOBITS 00000000100303e8 0003e8 000018
	# SEC: 0x0000000000000003 (PLTGOT) 0x10030000			# SEC: 0x0000000000000003 (PLTGOT) 0x100303e8

	## .plt[0] holds the address of _dl_runtime_resolve.			## .plt[0] holds the address of _dl_runtime_resolve.
	## .plt[1] holds the link map.			## .plt[1] holds the link map.
	## The JMP_SLOT relocation is stored at .plt[2]			## The JMP_SLOT relocation is stored at .plt[2]
	# RELOC: 0x10030010 R_PPC64_JMP_SLOT foo 0x0			# RELOC: 0x10030010 R_PPC64_JMP_SLOT foo 0x0

	# CHECK: _start:			# CHECK: _start:
	# CHECK: 10010008: bl .+16			# CHECK: 10010298: bl .+16

	# CHECK-LABEL: 0000000010010018 __plt_foo:			# CHECK-LABEL: 00000000100102a8 __plt_foo:
	# CHECK-NEXT: std 2, 24(1)			# CHECK-NEXT: std 2, 24(1)
	# CHECK-NEXT: addis 12, 2, 0			# CHECK-NEXT: addis 12, 2, 1
	# CHECK-NEXT: ld 12, 32560(12)			# CHECK-NEXT: ld 12, -32744(12)
	# CHECK-NEXT: mtctr 12			# CHECK-NEXT: mtctr 12
	# CHECK-NEXT: bctr			# CHECK-NEXT: bctr


	.text			.text
	.abiversion 2			.abiversion 2
	.globl _start			.globl _start
	.p2align 4			.p2align 4
	Show All 13 Lines

test/ELF/ppc64-rel-calls.s

	# REQUIRES: ppc			# REQUIRES: ppc

	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t
	# RUN: ld.lld %t -o %t2			# RUN: ld.lld %t -o %t2
	# RUN: llvm-objdump -d %t2 \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t2 \| FileCheck %s

	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t
	# RUN: ld.lld %t -o %t2			# RUN: ld.lld %t -o %t2
	# RUN: llvm-objdump -d %t2 \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn %t2 \| FileCheck %s

	# CHECK: Disassembly of section .text:			# CHECK: Disassembly of section .text:
	# CHECK-EMPTY:			# CHECK-EMPTY:

	.text			.text
	.global _start			.global _start
	_start:			_start:
	.Lfoo:			.Lfoo:
	li 0,1			li 0,1
	li 3,42			li 3,42
	sc			sc

	# CHECK: 10010000: {{.*}} li 0, 1			# CHECK: 10010158: li 0, 1
	# CHECK: 10010004: {{.*}} li 3, 42			# CHECK: 1001015c: li 3, 42
	# CHECK: 10010008: {{.*}} sc			# CHECK: 10010160: sc

	.global bar			.global bar
	bar:			bar:
	bl _start			bl _start
	nop			nop
	bl .Lfoo			bl .Lfoo
	nop			nop
	blr			blr

	# CHECK: 1001000c: {{.*}} bl .-12			# CHECK: 10010164: bl .-12
	# CHECK: 10010010: {{.*}} nop			# CHECK-NEXT: nop
	# CHECK: 10010014: {{.*}} bl .-20			# CHECK-NEXT: 1001016c: bl .-20
	# CHECK: 10010018: {{.*}} nop			# CHECK-NEXT: nop
	# CHECK: 1001001c: {{.*}} blr			# CHECK-NEXT: blr

test/ELF/ppc64-relocs.s

	Show All 27 Lines
	.quad 22, 37, 89, 47			.quad 22, 37, 89, 47
	.LC0:			.LC0:
	.tc .LJTI0_0[TC],.LJTI0_0			.tc .LJTI0_0[TC],.LJTI0_0

	.section .R_PPC64_TOC16_LO_DS,"ax",@progbits			.section .R_PPC64_TOC16_LO_DS,"ax",@progbits
	ld 1, .L1@toc@l(2)			ld 1, .L1@toc@l(2)

	# CHECK-LABEL: Disassembly of section .R_PPC64_TOC16_LO_DS:			# CHECK-LABEL: Disassembly of section .R_PPC64_TOC16_LO_DS:
	# CHECK: 1001000c: ld 1, -32768(2)			# CHECK: 10010220: ld 1, -32768(2)

	.section .R_PPC64_TOC16_LO,"ax",@progbits			.section .R_PPC64_TOC16_LO,"ax",@progbits
	addi 1, 2, .L1@toc@l			addi 1, 2, .L1@toc@l

	# CHECK-LABEL: Disassembly of section .R_PPC64_TOC16_LO:			# CHECK-LABEL: Disassembly of section .R_PPC64_TOC16_LO:
	# CHECK: 10010010: addi 1, 2, -32768			# CHECK: 10010224: addi 1, 2, -32768

	.section .R_PPC64_TOC16_HI,"ax",@progbits			.section .R_PPC64_TOC16_HI,"ax",@progbits
	addis 1, 2, .L1@toc@h			addis 1, 2, .L1@toc@h

	# CHECK-LABEL: Disassembly of section .R_PPC64_TOC16_HI:			# CHECK-LABEL: Disassembly of section .R_PPC64_TOC16_HI:
	# CHECK: 10010014: addis 1, 2, -1			# CHECK: 10010228: addis 1, 2, -1

	.section .R_PPC64_TOC16_HA,"ax",@progbits			.section .R_PPC64_TOC16_HA,"ax",@progbits
	addis 1, 2, .L1@toc@ha			addis 1, 2, .L1@toc@ha

	# CHECK-LABEL: Disassembly of section .R_PPC64_TOC16_HA:			# CHECK-LABEL: Disassembly of section .R_PPC64_TOC16_HA:
	# CHECK: 10010018: addis 1, 2, 0			# CHECK: 1001022c: addis 1, 2, 0

	.section .R_PPC64_REL24,"ax",@progbits			.section .R_PPC64_REL24,"ax",@progbits
	b 1f			b 1f
	1:			1:

	# CHECK-LABEL: Disassembly of section .R_PPC64_REL24:			# CHECK-LABEL: Disassembly of section .R_PPC64_REL24:
	# CHECK: 1001001c: b .+4			# CHECK: 10010230: b .+4

	.section .R_PPC64_REL14,"ax",@progbits			.section .R_PPC64_REL14,"ax",@progbits
	beq 1f			beq 1f
	1:			1:

	# CHECK-LABEL: Disassembly of section .R_PPC64_REL14:			# CHECK-LABEL: Disassembly of section .R_PPC64_REL14:
	# CHECK: 10010020: bt 2, .+4			# CHECK: 10010234: bt 2, .+4

	.section .R_PPC64_ADDR16_LO,"ax",@progbits			.section .R_PPC64_ADDR16_LO,"ax",@progbits
	li 1, .Lfoo@l			li 1, .Lfoo@l

	# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_LO:			# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_LO:
	# CHECK: 10010024: li 1, 0			# CHECK: 10010238: li 1, 532

	.section .R_PPC64_ADDR16_HI,"ax",@progbits			.section .R_PPC64_ADDR16_HI,"ax",@progbits
	li 1, .Lfoo@h			li 1, .Lfoo@h

	# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HI:			# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HI:
	# CHECK: 10010028: li 1, 4097			# CHECK: 1001023c: li 1, 4097

	.section .R_PPC64_ADDR16_HA,"ax",@progbits			.section .R_PPC64_ADDR16_HA,"ax",@progbits
	li 1, .Lfoo@ha			li 1, .Lfoo@ha

	# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HA:			# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HA:
	# CHECK: 1001002c: li 1, 4097			# CHECK: 10010240: li 1, 4097

	.section .R_PPC64_ADDR16_HIGHER,"ax",@progbits			.section .R_PPC64_ADDR16_HIGHER,"ax",@progbits
	li 1, .Lfoo@higher			li 1, .Lfoo@higher

	# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HIGHER:			# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HIGHER:
	# CHECK: 10010030: li 1, 0			# CHECK: 10010244: li 1, 0

	.section .R_PPC64_ADDR16_HIGHERA,"ax",@progbits			.section .R_PPC64_ADDR16_HIGHERA,"ax",@progbits
	li 1, .Lfoo@highera			li 1, .Lfoo@highera

	# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HIGHERA:			# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HIGHERA:
	# CHECK: 10010034: li 1, 0			# CHECK: 10010248: li 1, 0

	.section .R_PPC64_ADDR16_HIGHEST,"ax",@progbits			.section .R_PPC64_ADDR16_HIGHEST,"ax",@progbits
	li 1, .Lfoo@highest			li 1, .Lfoo@highest

	# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HIGHEST:			# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HIGHEST:
	# CHECK: 10010038: li 1, 0			# CHECK: 1001024c: li 1, 0

	.section .R_PPC64_ADDR16_HIGHESTA,"ax",@progbits			.section .R_PPC64_ADDR16_HIGHESTA,"ax",@progbits
	li 1, .Lfoo@highesta			li 1, .Lfoo@highesta

	# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HIGHESTA:			# CHECK-LABEL: Disassembly of section .R_PPC64_ADDR16_HIGHESTA:
	# CHECK: 1001003c: li 1, 0			# CHECK: 10010250: li 1, 0

	.section .R_PPC64_REL32, "ax",@progbits			.section .R_PPC64_REL32, "ax",@progbits
	addis 5, 2, .LC0@toc@ha			addis 5, 2, .LC0@toc@ha
	ld 5, .LC0@toc@l(5)			ld 5, .LC0@toc@l(5)
	.LBB0_2:			.LBB0_2:
	add 3, 3, 4			add 3, 3, 4

	# DATALE: '.rodata':			# DATALE: '.rodata':
	# DATALE: 0x100001c8 80fe0000			# DATALE: 0x100001c8 94000100

	# DATABE: '.rodata':			# DATABE: '.rodata':
	# DATABE: 0x100001c8 0000fe80			# DATABE: 0x100001c8 00010094

	# Address of rodata + value stored at rodata entry			# Address of rodata + value stored at rodata entry
	# should equal address of LBB0_2.			# should equal address of LBB0_2.
	# 0x10000190 + 0xfeb4 = 0x10010044			# 0x100001c8 + 0x10094 = 0x10010258
	# CHECK-LABEL: Disassembly of section .R_PPC64_REL32:			# CHECK-LABEL: Disassembly of section .R_PPC64_REL32:
	# CHECK: 10010040: addis 5, 2, 0			# CHECK: 10010254: addis 5, 2, 0
	# CHECK: 10010044: ld 5, -32736(5)			# CHECK: 10010258: ld 5, -32736(5)
	# CHECK: 10010048: add 3, 3, 4			# CHECK: 1001025c: add 3, 3, 4

	.section .R_PPC64_REL64, "ax",@progbits			.section .R_PPC64_REL64, "ax",@progbits
	.cfi_startproc			.cfi_startproc
	.cfi_personality 148, __foo			.cfi_personality 148, __foo
	li 0, 1			li 0, 1
	li 3, 55			li 3, 55
	sc			sc
	.cfi_endproc			.cfi_endproc
	__foo:			__foo:
	li 3,0			li 3,0

	.section .R_PPC64_TOC,"a",@progbits			.section .R_PPC64_TOC,"a",@progbits
	.quad .TOC.@tocbase			.quad .TOC.@tocbase

	# SEC: .got PROGBITS 0000000010020000			# SEC: .got PROGBITS 0000000010020270

	## tocbase = .got+0x8000 = 0x10028000			## tocbase = .got+0x8000 = 0x10028270
	# DATALE-LABEL: section '.R_PPC64_TOC':			# DATALE-LABEL: section '.R_PPC64_TOC':
	# DATALE: 00800210 00000000			# DATALE: 70820210 00000000

	# DATABE-LABEL: section '.R_PPC64_TOC':			# DATABE-LABEL: section '.R_PPC64_TOC':
	# DATABE: 00000000 10028000			# DATABE: 00000000 10028270

	# Check that the personality (relocated by R_PPC64_REL64) in the .eh_frame			# Check that the personality (relocated by R_PPC64_REL64) in the .eh_frame
	# equals the address of __foo.			# equals the address of __foo.
	# 0x100001ea + 0xfe6e = 0x10010058			# 0x100001ea + 0x010082 = 0x1001026c
	# DATALE: section '.eh_frame':			# DATALE: section '.eh_frame':
	# DATALE: 0x100001e8 {{....}}6efe			# DATALE: 0x100001e8 {{....}}8200 01{{......}}

	# DATABE: section '.eh_frame':			# DATABE: section '.eh_frame':
	# DATABE: 0x100001e8 {{[0-9a-f]+ [0-9a-f]+}} fe6e{{....}}			# DATABE: 0x100001e8 {{[0-9a-f]+ [0-9a-f]+}} 00821{{...}}

	# CHECK: __foo			# CHECK: __foo
	# CHECK-NEXT: 10010058: li 3, 0			# CHECK-NEXT: 1001026c: li 3, 0

test/ELF/ppc64-shared-long_branch.s

	# REQUIRES: ppc			# REQUIRES: ppc

	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	# RUN: ld.lld --no-toc-optimize -shared %t.o -o %t			# RUN: ld.lld --no-toc-optimize -shared -z separate-code %t.o -o %t
	# RUN: llvm-objdump -d -start-address=0x10000 -stop-address=0x10018 %t \| FileCheck %s -check-prefix=CALLEE_DUMP			# RUN: llvm-objdump -d -start-address=0x10000 -stop-address=0x10018 %t \| FileCheck %s -check-prefix=CALLEE_DUMP
	# RUN: llvm-objdump -d -start-address=0x2010020 -stop-address=0x2010070 %t \| FileCheck %s -check-prefix=CALLER_DUMP			# RUN: llvm-objdump -d -start-address=0x2010020 -stop-address=0x2010070 %t \| FileCheck %s -check-prefix=CALLER_DUMP
	# RUN: llvm-readelf --sections %t \| FileCheck %s -check-prefix=SECTIONS			# RUN: llvm-readelf --sections %t \| FileCheck %s -check-prefix=SECTIONS
	# RUN: llvm-readelf --relocations %t \| FileCheck %s -check-prefix=DYNRELOC			# RUN: llvm-readelf --relocations %t \| FileCheck %s -check-prefix=DYNRELOC


	# _start calls protected function callee. Since callee is protected no plt stub			# _start calls protected function callee. Since callee is protected no plt stub
	# is needed. The binary however has been padded out with space so that the call			# is needed. The binary however has been padded out with space so that the call
	▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines
	# CALLEE_DUMP: 10004: {{.*}} addi 2, 2, -32528			# CALLEE_DUMP: 10004: {{.*}} addi 2, 2, -32528
	# CALLEE_DUMP: 10008: {{.*}} addis 4, 2, 0			# CALLEE_DUMP: 10008: {{.*}} addis 4, 2, 0

	# Verify the address of _start, and the call to the long-branch thunk.			# Verify the address of _start, and the call to the long-branch thunk.
	# CALLER_DUMP: _start:			# CALLER_DUMP: _start:
	# CALLER_DUMP: 2010020: {{.*}} addis 2, 12, 2			# CALLER_DUMP: 2010020: {{.*}} addis 2, 12, 2
	# CALLER_DUMP: 2010038: {{.*}} bl .+56			# CALLER_DUMP: 2010038: {{.*}} bl .+56

	# Verify the thunks contents: TOC-pointer + offset = .branch_lt[0]			## .branch_lt[0] - .TOC. =
	# 0x20280F0 + 32560 = 0x2030020
	# CALLER_DUMP: __long_branch_callee:			# CALLER_DUMP: __long_branch_callee:
	# CALLER_DUMP: 2010060: {{.*}} addis 12, 2, 0			# CALLER_DUMP: 2010060: {{.*}} addis 12, 2, 1
	# CALLER_DUMP: 2010064: {{.*}} ld 12, 32560(12)			# CALLER_DUMP: 2010064: {{.*}} ld 12, -32712(12)
	# CALLER_DUMP: 2010068: {{.*}} mtctr 12			# CALLER_DUMP: 2010068: {{.*}} mtctr 12
	# CALLER_DUMP: 201006c: {{.*}} bctr			# CALLER_DUMP: 201006c: {{.*}} bctr

	# .got section is at address 0x20300f0 so TOC pointer points to 0x20400F0.			# .got section is at address 0x20300f0 so TOC pointer points to 0x20400F0.
	# .plt section has a 2 entry header and a single entry for the long branch.			# .plt section has a 2 entry header and a single entry for the long branch.
	# [Nr] Name Type Address Off Size			# [Nr] Name Type Address Off Size
	# SECTIONS: [10] .got PROGBITS 00000000020200f0 20200f0 000008			# SECTIONS: [10] .got PROGBITS 00000000020200f0 20200f0 000008
	# SECTIONS: [13] .plt NOBITS 0000000002030008 2030008 000018			# SECTIONS: [13] .plt NOBITS 0000000002030110 2020110 000018
	# SECTIONS: [14] .branch_lt NOBITS 0000000002030020 2030008 000008			# SECTIONS: [14] .branch_lt NOBITS 0000000002030128 2020110 000008

	# There is a relative dynamic relocation for (.plt + 16 bytes), with a base			# There is a relative dynamic relocation for (.plt + 16 bytes), with a base
	# address equal to callees local entry point (0x10000 + 8).			# address equal to callees local entry point (0x10000 + 8).
	# DYNRELOC: Relocation section '.rela.dyn' at offset 0x{{[0-9a-f]+}} contains 3 entries:			# DYNRELOC: Relocation section '.rela.dyn' at offset 0x{{[0-9a-f]+}} contains 3 entries:
	# DYNRELOC: Offset Info Type Symbol's Value			# DYNRELOC: Offset Info Type Symbol's Value
	# DYNRELOC: 0000000002030020 0000000000000016 R_PPC64_RELATIVE 10008			# DYNRELOC: 0000000002030128 0000000000000016 R_PPC64_RELATIVE 10008

test/ELF/ppc64-tls-gd.s

	Show All 10 Lines
	# RUN: llvm-readelf -r %t \| FileCheck --check-prefix=NOREL %s			# RUN: llvm-readelf -r %t \| FileCheck --check-prefix=NOREL %s
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=LE %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=LE %s

	# RUN: ld.lld %t.o %t1.so -o %t			# RUN: ld.lld %t.o %t1.so -o %t
	# RUN: llvm-readobj -r %t \| FileCheck --check-prefix=IE-REL %s			# RUN: llvm-readobj -r %t \| FileCheck --check-prefix=IE-REL %s
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=IE %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=IE %s

	# GD-REL: .rela.dyn {			# GD-REL: .rela.dyn {
	# GD-REL-NEXT: 0x200F0 R_PPC64_DTPMOD64 a 0x0			# GD-REL-NEXT: 0x20540 R_PPC64_DTPMOD64 a 0x0
	# GD-REL-NEXT: 0x200F8 R_PPC64_DTPREL64 a 0x0			# GD-REL-NEXT: 0x20548 R_PPC64_DTPREL64 a 0x0
	# GD-REL-NEXT: 0x20100 R_PPC64_DTPMOD64 b 0x0			# GD-REL-NEXT: 0x20550 R_PPC64_DTPMOD64 b 0x0
	# GD-REL-NEXT: 0x20108 R_PPC64_DTPREL64 b 0x0			# GD-REL-NEXT: 0x20558 R_PPC64_DTPREL64 b 0x0
	# GD-REL-NEXT: 0x20110 R_PPC64_DTPMOD64 c 0x0			# GD-REL-NEXT: 0x20560 R_PPC64_DTPMOD64 c 0x0
	# GD-REL-NEXT: 0x20118 R_PPC64_DTPREL64 c 0x0			# GD-REL-NEXT: 0x20568 R_PPC64_DTPREL64 c 0x0
	# GD-REL-NEXT: }			# GD-REL-NEXT: }

	## &DTPMOD(a) - .TOC. = &.got[0] - (.got+0x8000) = -32768			## &DTPMOD(a) - .TOC. = &.got[0] - (.got+0x8000) = -32768
	# GD: addis 3, 2, 0			# GD: addis 3, 2, 0
	# GD-NEXT: addi 3, 3, -32768			# GD-NEXT: addi 3, 3, -32768
	# GD-NEXT: bl .+40			# GD-NEXT: bl .+40
	# GD-NEXT: ld 2, 24(1)			# GD-NEXT: ld 2, 24(1)

	Show All 21 Lines
	# LE-NEXT: nop			# LE-NEXT: nop
	# LE-NEXT: addi 3, 3, -28660			# LE-NEXT: addi 3, 3, -28660
	## c@tprel = st_value(c)-0x7000 = -28656			## c@tprel = st_value(c)-0x7000 = -28656
	# LE-NEXT: addis 3, 13, 0			# LE-NEXT: addis 3, 13, 0
	# LE-NEXT: nop			# LE-NEXT: nop
	# LE-NEXT: addi 3, 3, -28656			# LE-NEXT: addi 3, 3, -28656

	# IE-REL: .rela.dyn {			# IE-REL: .rela.dyn {
	# IE-REL-NEXT: 0x100200C0 R_PPC64_TPREL64 b 0x0			# IE-REL-NEXT: 0x10020418 R_PPC64_TPREL64 b 0x0
	# IE-REL-NEXT: 0x100200C8 R_PPC64_TPREL64 c 0x0			# IE-REL-NEXT: 0x10020420 R_PPC64_TPREL64 c 0x0
	# IE-REL-NEXT: }			# IE-REL-NEXT: }

	## a is relaxed to use LE.			## a is relaxed to use LE.
	## a@tprel = st_value(a)-0x7000 = -28664			## a@tprel = st_value(a)-0x7000 = -28664
	# IE: nop			# IE: nop
	# IE-NEXT: addis 3, 13, 0			# IE-NEXT: addis 3, 13, 0
	# IE-NEXT: nop			# IE-NEXT: nop
	# IE-NEXT: addi 3, 3, -28664			# IE-NEXT: addi 3, 3, -28664
	Show All 31 Lines

test/ELF/ppc64-tls-ie.s

	Show All 17 Lines
	# RUN: llvm-readobj -r %t.so \| FileCheck --check-prefix=IE-REL %s			# RUN: llvm-readobj -r %t.so \| FileCheck --check-prefix=IE-REL %s
	# RUN: llvm-objdump -d --no-show-raw-insn %t.so \| FileCheck --check-prefix=IE %s			# RUN: llvm-objdump -d --no-show-raw-insn %t.so \| FileCheck --check-prefix=IE %s
	## IE -> LE			## IE -> LE
	# RUN: ld.lld %t.o -o %t			# RUN: ld.lld %t.o -o %t
	# RUN: llvm-readelf -r %t \| FileCheck --check-prefix=NOREL %s			# RUN: llvm-readelf -r %t \| FileCheck --check-prefix=NOREL %s
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=LE %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=LE %s

	# IE-REL: .rela.dyn {			# IE-REL: .rela.dyn {
	# IE-REL-NEXT: 0x200B0 R_PPC64_TPREL64 c 0x0			# IE-REL-NEXT: 0x204A0 R_PPC64_TPREL64 c 0x0
	# IE-REL-NEXT: 0x200C0 R_PPC64_TPREL64 i 0x0			# IE-REL-NEXT: 0x204B0 R_PPC64_TPREL64 i 0x0
	# IE-REL-NEXT: 0x200C8 R_PPC64_TPREL64 l 0x0			# IE-REL-NEXT: 0x204B8 R_PPC64_TPREL64 l 0x0
	# IE-REL-NEXT: 0x200B8 R_PPC64_TPREL64 s 0x0			# IE-REL-NEXT: 0x204A8 R_PPC64_TPREL64 s 0x0
	# IE-REL-NEXT: }			# IE-REL-NEXT: }

	# INPUT-REL: R_PPC64_GOT_TPREL16_HA c 0x0			# INPUT-REL: R_PPC64_GOT_TPREL16_HA c 0x0
	# INPUT-REL: R_PPC64_GOT_TPREL16_LO_DS c 0x0			# INPUT-REL: R_PPC64_GOT_TPREL16_LO_DS c 0x0
	# INPUT-REL: R_PPC64_TLS c 0x0			# INPUT-REL: R_PPC64_TLS c 0x0
	## &.got[0] - .TOC. = -32768			## &.got[0] - .TOC. = -32768
	# IE-LABEL: test1:			# IE-LABEL: test1:
	# IE-NEXT: addis 3, 2, 0			# IE-NEXT: addis 3, 2, 0
	▲ Show 20 Lines • Show All 134 Lines • Show Last 20 Lines

test/ELF/ppc64-tls-vaddr-align.s

This file was added.

				# REQUIRES: ppc

				# RUN: llvm-mc -filetype=obj -triple=powerpc64le %s -o %t.o
				# RUN: ld.lld %t.o -o %t
				# RUN: llvm-readelf -S -l %t \| FileCheck --check-prefix=SEC %s
				# RUN: llvm-objdump -d %t \| FileCheck --check-prefix=DIS %s

				# SEC: Name Type Address Off Size ES Flg Lk Inf Al
				# SEC: .tdata PROGBITS 0000000010020204 000204 000001 00 WAT 0 0 1
				# SEC: .tbss NOBITS 0000000010020300 000205 000008 00 WAT 0 0 256

				# SEC: Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align
				# SEC: TLS 0x000204 0x0000000010020204 0x0000000010020204 0x000001 0x000104 R 0x100

				## TP offset computation is a bit tricky if p_vaddr%p_align != 0.
				## p_vaddr rounded down to p_align has TP offset -0x7000.
				## The first address of PT_TLS (p_vaddr) has TP offset (p_vaddr%p_align - 0x7000).

				## a@tprel = st_value(a) + p_vaddr%p_align - 0x7000 = 0x10020300-0x10020204 + 4 - 0x7000 = -28420
				# DIS: ld 3, -28416(13)

				ld 3, a@tprel(13)

				.section .tdata,"awT"
				.byte 0

				.section .tbss,"awT"
				.p2align 8
				a:
				.quad 0

test/ELF/ppc64-toc-addis-nop-lqsq.s

	# REQUIRES: ppc			# REQUIRES: ppc
				# XFAIL: *

	# RUN: llvm-readelf -relocations --wide %p/Inputs/ppc64le-quadword-ldst.o \| FileCheck --check-prefix=QuadInputRelocs %s			# RUN: llvm-readelf -relocations --wide %p/Inputs/ppc64le-quadword-ldst.o \| FileCheck --check-prefix=QuadInputRelocs %s

	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o
	# RUN: ld.lld -shared %t2.o -o %t2.so			# RUN: ld.lld -shared %t2.o -o %t2.so

	# RUN: ld.lld %t2.so %p/Inputs/ppc64le-quadword-ldst.o -o %t			# RUN: ld.lld %t2.so %p/Inputs/ppc64le-quadword-ldst.o -o %t
	# RUN: llvm-objdump -d %t \| FileCheck --check-prefix=Dis %s			# RUN: llvm-objdump -d %t \| FileCheck --check-prefix=Dis %s
	▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

test/ELF/ppc64-toc-addis-nop.s

# REQUIRES: ppc		# REQUIRES: ppc

# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o		# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
# RUN: llvm-readelf -r %t.o \| FileCheck --check-prefix=InputRelocs %s		# RUN: llvm-readelf -r %t.o \| FileCheck --check-prefix=InputRelocs %s

# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o		# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o
# RUN: ld.lld -shared %t2.o -o %t2.so		# RUN: ld.lld -shared -soname=t2.so %t2.o -o %t2.so

		## Place all sections in the same segment so that .text and .TOC. are on the same page.
		# RUN: echo 'PHDRS { all PT_LOAD; }' > %t.script
#		#
# RUN: ld.lld %t2.so %t.o -o %t		# RUN: ld.lld %t2.so %t.o -T %t.script -o %t
# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=Dis %s		# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=Dis %s
#		#
# RUN: ld.lld --no-toc-optimize %t2.so %t.o -o %t		# RUN: ld.lld %t2.so %t.o -T %t.script --no-toc-optimize -o %t
# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=NoOpt %s		# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefix=NoOpt %s

# InputRelocs: Relocation section '.rela.text'		# InputRelocs: Relocation section '.rela.text'
# InputRelocs: R_PPC64_TOC16_HA		# InputRelocs: R_PPC64_TOC16_HA
# InputRelocs: R_PPC64_TOC16_LO		# InputRelocs: R_PPC64_TOC16_LO
# InputRelocs: R_PPC64_TOC16_LO_DS		# InputRelocs: R_PPC64_TOC16_LO_DS


Show All 13 Lines	.Lbytes_lep:
lbz 3, byteLd@toc@l(3)		lbz 3, byteLd@toc@l(3)
addis 4, 2, byteSt@toc@ha		addis 4, 2, byteSt@toc@ha
stb 3, byteSt@toc@l(4)		stb 3, byteSt@toc@l(4)
blr		blr
# Dis-LABEL: bytes:		# Dis-LABEL: bytes:
# Dis-NEXT: addis		# Dis-NEXT: addis
# Dis-NEXT: addi		# Dis-NEXT: addi
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: lbz 3, 32624(2)		# Dis-NEXT: lbz 3, -32752(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: stb 3, 32625(2)		# Dis-NEXT: stb 3, -32751(2)
# Dis-NEXT: blr		# Dis-NEXT: blr

# NoOpt-LABEL: bytes:		# NoOpt-LABEL: bytes:
# NoOpt-NEXT: addis		# NoOpt-NEXT: addis
# NoOpt-NEXT: addi		# NoOpt-NEXT: addi
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: lbz 3, 32624(3)		# NoOpt-NEXT: lbz 3, -32752(3)
# NoOpt-NEXT: addis 4, 2, 0		# NoOpt-NEXT: addis 4, 2, 0
# NoOpt-NEXT: stb 3, 32625(4)		# NoOpt-NEXT: stb 3, -32751(4)
# NoOpt-NEXT: blr		# NoOpt-NEXT: blr

.global halfs		.global halfs
.p2align 4		.p2align 4
.type halfs,@function		.type halfs,@function
halfs:		halfs:
.Lhalfs_gep:		.Lhalfs_gep:
addis 2, 12, .TOC.-.Lhalfs_gep@ha		addis 2, 12, .TOC.-.Lhalfs_gep@ha
addi 2, 2, .TOC.-.Lhalfs_gep@l		addi 2, 2, .TOC.-.Lhalfs_gep@l
.Lhalfs_lep:		.Lhalfs_lep:
.localentry halfs, .Lhalfs_lep-.Lhalfs_gep		.localentry halfs, .Lhalfs_lep-.Lhalfs_gep
addis 3, 2, halfLd@toc@ha		addis 3, 2, halfLd@toc@ha
lhz 3, halfLd@toc@l(3)		lhz 3, halfLd@toc@l(3)
addis 4, 2, halfLd@toc@ha		addis 4, 2, halfLd@toc@ha
lha 4, halfLd@toc@l(4)		lha 4, halfLd@toc@l(4)
addis 5, 2, halfSt@toc@ha		addis 5, 2, halfSt@toc@ha
sth 4, halfSt@toc@l(5)		sth 4, halfSt@toc@l(5)
blr		blr
# Dis-LABEL: halfs:		# Dis-LABEL: halfs:
# Dis-NEXT: addis		# Dis-NEXT: addis
# Dis-NEXT: addi		# Dis-NEXT: addi
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: lhz 3, 32626(2)		# Dis-NEXT: lhz 3, -32750(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: lha 4, 32626(2)		# Dis-NEXT: lha 4, -32750(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: sth 4, 32628(2)		# Dis-NEXT: sth 4, -32748(2)
# Dis-NEXT: blr		# Dis-NEXT: blr

# NoOpt-LABEL: halfs:		# NoOpt-LABEL: halfs:
# NoOpt-NEXT: addis		# NoOpt-NEXT: addis
# NoOpt-NEXT: addi		# NoOpt-NEXT: addi
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: lhz 3, 32626(3)		# NoOpt-NEXT: lhz 3, -32750(3)
# NoOpt-NEXT: addis 4, 2, 0		# NoOpt-NEXT: addis 4, 2, 0
# NoOpt-NEXT: lha 4, 32626(4)		# NoOpt-NEXT: lha 4, -32750(4)
# NoOpt-NEXT: addis 5, 2, 0		# NoOpt-NEXT: addis 5, 2, 0
# NoOpt-NEXT: sth 4, 32628(5)		# NoOpt-NEXT: sth 4, -32748(5)
# NoOpt-NEXT: blr		# NoOpt-NEXT: blr


.global words		.global words
.p2align 4		.p2align 4
.type words,@function		.type words,@function
words:		words:
.Lwords_gep:		.Lwords_gep:
addis 2, 12, .TOC.-.Lwords_gep@ha		addis 2, 12, .TOC.-.Lwords_gep@ha
addi 2, 2, .TOC.-.Lwords_gep@l		addi 2, 2, .TOC.-.Lwords_gep@l
.Lwords_lep:		.Lwords_lep:
.localentry words, .Lwords_lep-.Lwords_gep		.localentry words, .Lwords_lep-.Lwords_gep
addis 3, 2, wordLd@toc@ha		addis 3, 2, wordLd@toc@ha
lwz 3, wordLd@toc@l(3)		lwz 3, wordLd@toc@l(3)
addis 4, 2, wordLd@toc@ha		addis 4, 2, wordLd@toc@ha
lwa 4, wordLd@toc@l(4)		lwa 4, wordLd@toc@l(4)
addis 5, 2, wordSt@toc@ha		addis 5, 2, wordSt@toc@ha
stw 4, wordSt@toc@l(5)		stw 4, wordSt@toc@l(5)
blr		blr
# Dis-LABEL: words		# Dis-LABEL: words
# Dis-NEXT: addis		# Dis-NEXT: addis
# Dis-NEXT: addi		# Dis-NEXT: addi
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: lwz 3, 32632(2)		# Dis-NEXT: lwz 3, -32744(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: lwa 4, 32632(2)		# Dis-NEXT: lwa 4, -32744(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: stw 4, 32636(2)		# Dis-NEXT: stw 4, -32740(2)
# Dis-NEXT: blr		# Dis-NEXT: blr

# NoOpt-LABEL: words		# NoOpt-LABEL: words
# NoOpt-NEXT: addis		# NoOpt-NEXT: addis
# NoOpt-NEXT: addi		# NoOpt-NEXT: addi
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: lwz 3, 32632(3)		# NoOpt-NEXT: lwz 3, -32744(3)
# NoOpt-NEXT: addis 4, 2, 0		# NoOpt-NEXT: addis 4, 2, 0
# NoOpt-NEXT: lwa 4, 32632(4)		# NoOpt-NEXT: lwa 4, -32744(4)
# NoOpt-NEXT: addis 5, 2, 0		# NoOpt-NEXT: addis 5, 2, 0
# NoOpt-NEXT: stw 4, 32636(5)		# NoOpt-NEXT: stw 4, -32740(5)
# NoOpt-NEXT: blr		# NoOpt-NEXT: blr

.global doublewords		.global doublewords
.p2align 4		.p2align 4
.type doublewords,@function		.type doublewords,@function
doublewords:		doublewords:
.Ldoublewords_gep:		.Ldoublewords_gep:
addis 2, 12, .TOC.-.Ldoublewords_gep@ha		addis 2, 12, .TOC.-.Ldoublewords_gep@ha
addi 2, 2, .TOC.-.Ldoublewords_gep@l		addi 2, 2, .TOC.-.Ldoublewords_gep@l
.Ldoublewords_lep:		.Ldoublewords_lep:
.localentry doublewords, .Ldoublewords_lep-.Ldoublewords_gep		.localentry doublewords, .Ldoublewords_lep-.Ldoublewords_gep
addis 3, 2, dwordLd@toc@ha		addis 3, 2, dwordLd@toc@ha
ld 3, dwordLd@toc@l(3)		ld 3, dwordLd@toc@l(3)
addis 4, 2, dwordSt@toc@ha		addis 4, 2, dwordSt@toc@ha
std 3, dwordSt@toc@l(4)		std 3, dwordSt@toc@l(4)
blr		blr

# Dis-LABEL: doublewords		# Dis-LABEL: doublewords
# Dis-NEXT: addis		# Dis-NEXT: addis
# Dis-NEXT: addi		# Dis-NEXT: addi
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: ld 3, 32640(2)		# Dis-NEXT: ld 3, -32736(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: std 3, 32648(2)		# Dis-NEXT: std 3, -32728(2)
# Dis-NEXT: blr		# Dis-NEXT: blr

# NoOpt-LABEL: doublewords		# NoOpt-LABEL: doublewords
# NoOpt-NEXT: addis		# NoOpt-NEXT: addis
# NoOpt-NEXT: addi		# NoOpt-NEXT: addi
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: ld 3, 32640(3)		# NoOpt-NEXT: ld 3, -32736(3)
# NoOpt-NEXT: addis 4, 2, 0		# NoOpt-NEXT: addis 4, 2, 0
# NoOpt-NEXT: std 3, 32648(4)		# NoOpt-NEXT: std 3, -32728(4)
# NoOpt-NEXT: blr		# NoOpt-NEXT: blr

.global vec_dq		.global vec_dq
.p2align 4		.p2align 4
.type vec_dq,@function		.type vec_dq,@function
vec_dq:		vec_dq:
.Lvec_dq_gep:		.Lvec_dq_gep:
addis 2, 12, .TOC.-.Lvec_dq_gep@ha		addis 2, 12, .TOC.-.Lvec_dq_gep@ha
addi 2, 2, .TOC.-.Lvec_dq_gep@l		addi 2, 2, .TOC.-.Lvec_dq_gep@l
.Lvec_dq_lep:		.Lvec_dq_lep:
.localentry vec_dq, .Lvec_dq_lep-.Lvec_dq_gep		.localentry vec_dq, .Lvec_dq_lep-.Lvec_dq_gep
addis 3, 2, vecLd@toc@ha		addis 3, 2, vecLd@toc@ha
lxv 3, vecLd@toc@l(3)		lxv 3, vecLd@toc@l(3)
addis 3, 2, vecSt@toc@ha		addis 3, 2, vecSt@toc@ha
stxv 3, vecSt@toc@l(3)		stxv 3, vecSt@toc@l(3)
blr		blr

# Dis-LABEL: vec_dq:		# Dis-LABEL: vec_dq:
# Dis-NEXT: addis		# Dis-NEXT: addis
# Dis-NEXT: addi		# Dis-NEXT: addi
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: lxv 3, 32656(2)		# Dis-NEXT: lxv 3, -32720(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: stxv 3, 32672(2)		# Dis-NEXT: stxv 3, -32704(2)
# Dis-NEXT: blr		# Dis-NEXT: blr

# NoOpt-LABEL: vec_dq:		# NoOpt-LABEL: vec_dq:
# NoOpt-NEXT: addis		# NoOpt-NEXT: addis
# NoOpt-NEXT: addi		# NoOpt-NEXT: addi
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: lxv 3, 32656(3)		# NoOpt-NEXT: lxv 3, -32720(3)
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: stxv 3, 32672(3)		# NoOpt-NEXT: stxv 3, -32704(3)
# NoOpt-NEXT: blr		# NoOpt-NEXT: blr

.global vec_ds		.global vec_ds
.p2align 4		.p2align 4
.type vec_ds,@function		.type vec_ds,@function
vec_ds:		vec_ds:
.Lvec_ds_gep:		.Lvec_ds_gep:
addis 2, 12, .TOC.-.Lvec_ds_gep@ha		addis 2, 12, .TOC.-.Lvec_ds_gep@ha
addi 2, 2, .TOC.-.Lvec_ds_gep@l		addi 2, 2, .TOC.-.Lvec_ds_gep@l
.Lvec_ds_lep:		.Lvec_ds_lep:
.localentry vec_ds, .Lvec_dq_lep-.Lvec_dq_gep		.localentry vec_ds, .Lvec_dq_lep-.Lvec_dq_gep
addis 3, 2, vecLd@toc@ha		addis 3, 2, vecLd@toc@ha
lxsd 3, vecLd@toc@l(3)		lxsd 3, vecLd@toc@l(3)
addis 3, 2, vecSt@toc@ha		addis 3, 2, vecSt@toc@ha
stxsd 3, vecSt@toc@l(3)		stxsd 3, vecSt@toc@l(3)
addis 3, 2, vecLd@toc@ha		addis 3, 2, vecLd@toc@ha
lxssp 3, vecLd@toc@l(3)		lxssp 3, vecLd@toc@l(3)
addis 3, 2, vecSt@toc@ha		addis 3, 2, vecSt@toc@ha
stxssp 3, vecSt@toc@l(3)		stxssp 3, vecSt@toc@l(3)
blr		blr
# Dis-LABEL: vec_ds:		# Dis-LABEL: vec_ds:
# Dis-NEXT: addis		# Dis-NEXT: addis
# Dis-NEXT: addi		# Dis-NEXT: addi
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: lxsd 3, 32656(2)		# Dis-NEXT: lxsd 3, -32720(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: stxsd 3, 32672(2)		# Dis-NEXT: stxsd 3, -32704(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: lxssp 3, 32656(2)		# Dis-NEXT: lxssp 3, -32720(2)
# Dis-NEXT: nop		# Dis-NEXT: nop
# Dis-NEXT: stxssp 3, 32672(2)		# Dis-NEXT: stxssp 3, -32704(2)
# Dis-NEXT: blr		# Dis-NEXT: blr

# NoOpt-LABEL: vec_ds:		# NoOpt-LABEL: vec_ds:
# NoOpt-NEXT: addis		# NoOpt-NEXT: addis
# NoOpt-NEXT: addi		# NoOpt-NEXT: addi
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: lxsd 3, 32656(3)		# NoOpt-NEXT: lxsd 3, -32720(3)
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: stxsd 3, 32672(3)		# NoOpt-NEXT: stxsd 3, -32704(3)
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: lxssp 3, 32656(3)		# NoOpt-NEXT: lxssp 3, -32720(3)
# NoOpt-NEXT: addis 3, 2, 0		# NoOpt-NEXT: addis 3, 2, 0
# NoOpt-NEXT: stxssp 3, 32672(3)		# NoOpt-NEXT: stxssp 3, -32704(3)
# NoOpt-NEXT: blr		# NoOpt-NEXT: blr


.global byteLd		.global byteLd
.lcomm byteLd, 1, 1		.lcomm byteLd, 1, 1

.global byteSt		.global byteSt
.lcomm byteSt, 1, 1		.lcomm byteSt, 1, 1
Show All 24 Lines

test/ELF/ppc64-toc-rel.s

	Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines

	# RELOCS-BE: Relocations [			# RELOCS-BE: Relocations [
	# RELOCS-BE-NEXT: .rela.text {			# RELOCS-BE-NEXT: .rela.text {
	# RELOCS-BE: 0xA R_PPC64_TOC16_HA global_a 0x0			# RELOCS-BE: 0xA R_PPC64_TOC16_HA global_a 0x0
	# RELOCS-BE: 0xE R_PPC64_TOC16_LO global_a 0x0			# RELOCS-BE: 0xE R_PPC64_TOC16_LO global_a 0x0

	# The .TOC. symbol represents the TOC base address: .got + 0x8000 = 0x10028000,			# The .TOC. symbol represents the TOC base address: .got + 0x8000 = 0x10028000,
	# which is stored in the first entry of .got			# which is stored in the first entry of .got
	# NM: 0000000010028000 d .TOC.			# NM: 00000000100281e8 d .TOC.
	# NM: 0000000010030000 D global_a			# NM: 00000000100301f0 D global_a
	# HEX-LE: section '.got':			# HEX-LE: section '.got':
	# HEX-LE-NEXT: 0x10020000 00800210 00000000			# HEX-LE-NEXT: 0x100201e8 e8810210 00000000
	# HEX-BE: section '.got':			# HEX-BE: section '.got':
	# HEX-BE-NEXT: 0x10020000 00000000 10028000			# HEX-BE-NEXT: 0x100201e8 00000000 100281e8

	# r2 stores the TOC base address. To access global_a with r3, it			# r2 stores the TOC base address. To access global_a with r3, it
	# computes the address with TOC plus an offset.			# computes the address with TOC plus an offset.
	# The offset global_a - .TOC. = 0x10030000 - 0x10028000 = 0x8000			# global_a - .TOC. = 0x100301f0 - 0x100281e8 = (1 << 16) - 32760
	# gets materialized as (1 << 16) - 32768.
	# CHECK: _start:			# CHECK: _start:
	# CHECK: 10010008: addis 3, 2, 1			# CHECK: 100101d0: addis 3, 2, 1
	# CHECK-NEXT: 1001000c: addi 3, 3, -32768			# CHECK-NEXT: 100101d4: addi 3, 3, -32760

test/ELF/ppc64-toc-relax-constants.s

	# REQUIRES: ppc			# REQUIRES: ppc

	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unkown-linux %p/Inputs/ppc64-toc-relax-shared.s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unkown-linux %p/Inputs/ppc64-toc-relax-shared.s -o %t.o
	# RUN: ld.lld -shared %t.o -o %t.so			# RUN: ld.lld -shared -soname=t.so %t.o -o %t.so
	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t1.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t1.o
	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-toc-relax.s -o %t2.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-toc-relax.s -o %t2.o
	# RUN: llvm-readobj -r %t1.o \| FileCheck --check-prefix=RELOCS %s			# RUN: llvm-readobj -r %t1.o \| FileCheck --check-prefix=RELOCS %s
	# RUN: ld.lld %t1.o %t2.o %t.so -o %t			# RUN: ld.lld %t1.o %t2.o %t.so -o %t
	# RUN: llvm-readelf -S %t \| FileCheck --check-prefix=SECTIONS %s			# RUN: llvm-readelf -S %t \| FileCheck --check-prefix=SECTIONS %s
	# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s			# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s
	# RUN: llvm-objdump -D %t \| FileCheck %s			# RUN: llvm-objdump -D %t \| FileCheck %s

	# In most cases, .toc contains exclusively addresses relocated by R_PPC64_ADDR16.			# In most cases, .toc contains exclusively addresses relocated by R_PPC64_ADDR16.
	# Rarely .toc contain constants or variables.			# Rarely .toc contain constants or variables.
	# Test we can still perform toc-indirect to toc-relative relaxation.			# Test we can still perform toc-indirect to toc-relative relaxation.

	# RELOCS: .rela.text {			# RELOCS: .rela.text {
	# RELOCS-NEXT: 0x0 R_PPC64_TOC16_HA .toc 0x0			# RELOCS-NEXT: 0x0 R_PPC64_TOC16_HA .toc 0x0
	# RELOCS-NEXT: 0x4 R_PPC64_TOC16_LO_DS .toc 0x0			# RELOCS-NEXT: 0x4 R_PPC64_TOC16_LO_DS .toc 0x0
	# RELOCS-NEXT: 0x8 R_PPC64_TOC16_HA .toc 0x8			# RELOCS-NEXT: 0x8 R_PPC64_TOC16_HA .toc 0x8
	# RELOCS-NEXT: 0xC R_PPC64_TOC16_LO_DS .toc 0x8			# RELOCS-NEXT: 0xC R_PPC64_TOC16_LO_DS .toc 0x8
	# RELOCS-NEXT: 0x10 R_PPC64_TOC16_HA .toc 0x10			# RELOCS-NEXT: 0x10 R_PPC64_TOC16_HA .toc 0x10
	# RELOCS-NEXT: 0x14 R_PPC64_TOC16_LO_DS .toc 0x10			# RELOCS-NEXT: 0x14 R_PPC64_TOC16_LO_DS .toc 0x10
	# RELOCS-NEXT: }			# RELOCS-NEXT: }

	# SECTIONS: .got PROGBITS 0000000010020090			# SECTIONS: .got PROGBITS 00000000100202f8
	# SECTIONS: .toc PROGBITS 0000000010020090			# SECTIONS: .toc PROGBITS 00000000100202f8

	# NM: 0000000010030000 D default			# NM: 0000000010030310 D default

	# .LCONST1 is .toc[0].			# .LCONST1 is .toc[0].
	# .LCONST1 - (.got+0x8000) = 0x10020090 - (0x10020090+0x8000) = -32768			# .LCONST1 - (.got+0x8000) = 0x10020350 - (0x10020350+0x8000) = -32768
	# CHECK: nop			# CHECK: nop
	# CHECK: lwa 3, -32768(2)			# CHECK: lwa 3, -32768(2)
	addis 3, 2, .LCONST1@toc@ha			addis 3, 2, .LCONST1@toc@ha
	lwa 3, .LCONST1@toc@l(3)			lwa 3, .LCONST1@toc@l(3)

	# .LCONST2 is .toc[1]			# .LCONST2 is .toc[1]
	# .LCONST2 - (.got+0x8000) = 0x10020098 - (0x10020090+0x8000) = -32760			# .LCONST2 - (.got+0x8000) = 0x10020358 - (0x10020350+0x8000) = -32760
	# CHECK: nop			# CHECK: nop
	# CHECK: ld 4, -32760(2)			# CHECK: ld 4, -32760(2)
	addis 4, 2, .LCONST2@toc@ha			addis 4, 2, .LCONST2@toc@ha
	ld 4, .LCONST2@toc@l(4)			ld 4, .LCONST2@toc@l(4)

	# .Ldefault is .toc[2]. `default` is not preemptable when producing an executable.			# .Ldefault is .toc[2]. `default` is not preemptable when producing an executable.
	# After toc-indirection to toc-relative relaxation, it is loaded using an			# After toc-indirection to toc-relative relaxation, it is loaded using an
	# offset relative to r2.			# offset relative to r2.
	# CHECK: nop			# CHECK: addis 5, 2, 1
	# CHECK: addi 5, 2, 32624			# CHECK: addi 5, 5, -32744
	# CHECK: lwa 5, 0(5)			# CHECK: lwa 5, 0(5)
	addis 5, 2, .Ldefault@toc@ha			addis 5, 2, .Ldefault@toc@ha
	ld 5, .Ldefault@toc@l(5)			ld 5, .Ldefault@toc@l(5)
	lwa 5, 0(5)			lwa 5, 0(5)

	.section .toc,"aw",@progbits			.section .toc,"aw",@progbits
	.LCONST1:			.LCONST1:
	.quad 11			.quad 11
	.LCONST2:			.LCONST2:
	.quad 22			.quad 22
	.Ldefault:			.Ldefault:
	.tc default[TC],default			.tc default[TC],default

test/ELF/ppc64-toc-relax-jumptable.s

	Show All 11 Lines
	# RUN: llvm-objdump -d %t \| FileCheck --check-prefixes=CHECK %s			# RUN: llvm-objdump -d %t \| FileCheck --check-prefixes=CHECK %s

	# .LJT is a local symbol (non-preemptable).			# .LJT is a local symbol (non-preemptable).
	# Test we can perform the toc-indirect to toc-relative relaxation.			# Test we can perform the toc-indirect to toc-relative relaxation.

	# SECTIONS: .rodata PROGBITS 00000000100001c8			# SECTIONS: .rodata PROGBITS 00000000100001c8

	# HEX-LE: section '.toc':			# HEX-LE: section '.toc':
	# HEX-LE-NEXT: 10020008 c8010010 00000000			# HEX-LE-NEXT: 10020228 c8010010 00000000

	# HEX-BE: section '.toc':			# HEX-BE: section '.toc':
	# HEX-BE-NEXT: 10020008 00000000 100001c8			# HEX-BE-NEXT: 10020228 00000000 100001c8

	# CHECK-LABEL: _start			# CHECK-LABEL: _start
	# CHECK: clrldi 3, 3, 62			# CHECK: clrldi 3, 3, 62
	# CHECK-NEXT: addis 4, 2, -2			# CHECK-NEXT: addis 4, 2, -3
	# CHECK-NEXT: addi 4, 4, -32312			# CHECK-NEXT: addi 4, 4, 32680
	# CHECK-NEXT: sldi 3, 3, 2			# CHECK-NEXT: sldi 3, 3, 2

	.text			.text
	.global _start			.global _start
	.type _start, @function			.type _start, @function
	_start:			_start:
	.Lstart_gep:			.Lstart_gep:
	addis 2, 12, .TOC.-.Lstart_gep@ha			addis 2, 12, .TOC.-.Lstart_gep@ha
	Show All 37 Lines

test/ELF/ppc64-toc-relax.s

	# REQUIRES: ppc			# REQUIRES: ppc
	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-toc-relax-shared.s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-toc-relax-shared.s -o %t.o
	# RUN: ld.lld -shared %t.o -o %t.so			# RUN: ld.lld -shared -soname=t.so %t.o -o %t.so
	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t1.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t1.o
	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-toc-relax.s -o %t2.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-toc-relax.s -o %t2.o
	# RUN: llvm-readobj -r %t1.o \| FileCheck --check-prefixes=RELOCS-LE,RELOCS %s			# RUN: llvm-readobj -r %t1.o \| FileCheck --check-prefixes=RELOCS-LE,RELOCS %s
	# RUN: ld.lld %t1.o %t2.o %t.so -o %t			# RUN: ld.lld %t1.o %t2.o %t.so -o %t
				# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefixes=COMMON,EXE %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefixes=COMMON,EXE %s

	# RUN: ld.lld -shared %t1.o %t2.o %t.so -o %t2.so			# RUN: ld.lld -shared %t1.o %t2.o %t.so -o %t2.so
	# RUN: llvm-objdump -d --no-show-raw-insn %t2.so \| FileCheck --check-prefixes=COMMON,SHARED %s			# RUN: llvm-objdump -d --no-show-raw-insn %t2.so \| FileCheck --check-prefixes=COMMON,SHARED %s

	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-toc-relax-shared.s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-toc-relax-shared.s -o %t.o
	# RUN: ld.lld -shared %t.o -o %t.so			# RUN: ld.lld -shared -soname=t.so %t.o -o %t.so
	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t1.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t1.o
	# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-toc-relax.s -o %t2.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-toc-relax.s -o %t2.o
	# RUN: llvm-readobj -r %t1.o \| FileCheck --check-prefixes=RELOCS-BE,RELOCS %s			# RUN: llvm-readobj -r %t1.o \| FileCheck --check-prefixes=RELOCS-BE,RELOCS %s
	# RUN: ld.lld %t1.o %t2.o %t.so -o %t			# RUN: ld.lld %t1.o %t2.o %t.so -o %t
				# RUN: llvm-nm %t \| FileCheck --check-prefix=NM %s
	# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefixes=COMMON,EXE %s			# RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck --check-prefixes=COMMON,EXE %s

	# RUN: ld.lld -shared %t1.o %t2.o %t.so -o %t2.so			# RUN: ld.lld -shared %t1.o %t2.o %t.so -o %t2.so
	# RUN: llvm-objdump -d --no-show-raw-insn %t2.so \| FileCheck --check-prefixes=COMMON,SHARED %s			# RUN: llvm-objdump -d --no-show-raw-insn %t2.so \| FileCheck --check-prefixes=COMMON,SHARED %s

	# RELOCS-LE: .rela.text {			# RELOCS-LE: .rela.text {
	# RELOCS-LE-NEXT: 0x0 R_PPC64_TOC16_HA .toc 0x0			# RELOCS-LE-NEXT: 0x0 R_PPC64_TOC16_HA .toc 0x0
	# RELOCS-LE-NEXT: 0x4 R_PPC64_TOC16_LO_DS .toc 0x0			# RELOCS-LE-NEXT: 0x4 R_PPC64_TOC16_LO_DS .toc 0x0
	Show All 18 Lines

	# RELOCS: .rela.toc {			# RELOCS: .rela.toc {
	# RELOCS-NEXT: 0x0 R_PPC64_ADDR64 hidden 0x0			# RELOCS-NEXT: 0x0 R_PPC64_ADDR64 hidden 0x0
	# RELOCS-NEXT: 0x8 R_PPC64_ADDR64 hidden2 0x0			# RELOCS-NEXT: 0x8 R_PPC64_ADDR64 hidden2 0x0
	# RELOCS-NEXT: 0x10 R_PPC64_ADDR64 shared 0x0			# RELOCS-NEXT: 0x10 R_PPC64_ADDR64 shared 0x0
	# RELOCS-NEXT: 0x18 R_PPC64_ADDR64 default 0x0			# RELOCS-NEXT: 0x18 R_PPC64_ADDR64 default 0x0
	# RELOCS-NEXT: }			# RELOCS-NEXT: }

	# NM-DAG: 0000000010030000 D default			# NM-DAG: 00000000100303a0 D default
	# NM-DAG: 0000000010030000 d hidden			# NM-DAG: 00000000100303a0 d hidden
	# NM-DAG: 0000000010040000 d hidden2			# NM-DAG: 00000000100403a0 d hidden2

	# 'hidden' is non-preemptable. It is relaxed.			# 'hidden' is non-preemptable. It is relaxed.
	# address(hidden) - (.got+0x8000) = 0x10030000 - (0x100200c0+0x8000) = 32576			# address(hidden) - (.got+0x8000) = 0x100303a0 - (0x10020380+0x8000) = (1<<16) - 32736
	# COMMON: nop			# COMMON: addis 3, 2, 1
	# COMMON: addi 3, 2, 32576			# COMMON: addi 3, 3, -32736
	# COMMON: lwa 3, 0(3)			# COMMON: lwa 3, 0(3)
	addis 3, 2, .Lhidden@toc@ha			addis 3, 2, .Lhidden@toc@ha
	ld 3, .Lhidden@toc@l(3)			ld 3, .Lhidden@toc@l(3)
	lwa 3, 0(3)			lwa 3, 0(3)

	# address(hidden2) - (.got+0x8000) = 0x10040000 - (0x100200c0+0x8000) = (1<<16)+32576			# address(hidden2) - (.got+0x8000) = 0x100403a0 - (0x10020380+0x8000) = (2<<16) - 32736
	# COMMON: addis 3, 2, 1			# COMMON: addis 3, 2, 2
	# COMMON: addi 3, 3, 32576			# COMMON: addi 3, 3, -32736
	# COMMON: lwa 3, 0(3)			# COMMON: lwa 3, 0(3)
	addis 3, 2, .Lhidden2@toc@ha			addis 3, 2, .Lhidden2@toc@ha
	ld 3, .Lhidden2@toc@l(3)			ld 3, .Lhidden2@toc@l(3)
	lwa 3, 0(3)			lwa 3, 0(3)

	# 'shared' is not defined in an object file. Its definition is determined at			# 'shared' is not defined in an object file. Its definition is determined at
	# runtime by the dynamic linker, so the extra indirection cannot be relaxed.			# runtime by the dynamic linker, so the extra indirection cannot be relaxed.
	# The first addis can still be relaxed to nop, though.			# The first addis can still be relaxed to nop, though.
	# COMMON: nop			# COMMON: nop
	# COMMON: ld 4, -32752(2)			# COMMON: ld 4, -32752(2)
	# COMMON: lwa 4, 0(4)			# COMMON: lwa 4, 0(4)
	addis 4, 2, .Lshared@toc@ha			addis 4, 2, .Lshared@toc@ha
	ld 4, .Lshared@toc@l(4)			ld 4, .Lshared@toc@l(4)
	lwa 4, 0(4)			lwa 4, 0(4)

	# 'default' has default visibility. It is non-preemptable when producing an executable.			# 'default' has default visibility. It is non-preemptable when producing an executable.
	# address(default) - (.got+0x8000) = 0x10030000 - (0x100200c0+0x8000) = 32576			# address(default) - (.got+0x8000) = 0x100303a0 - (0x10020380+0x8000) = (1<<16) - 32736
	# EXE: nop			# EXE: addis 5, 2, 1
	# EXE: addi 5, 2, 32576			# EXE: addi 5, 5, -32736
	# EXE: lwa 5, 0(5)			# EXE: lwa 5, 0(5)

	# SHARED: nop			# SHARED: nop
	# SHARED: ld 5, -32744(2)			# SHARED: ld 5, -32744(2)
	# SHARED: lwa 5, 0(5)			# SHARED: lwa 5, 0(5)
	addis 5, 2, .Ldefault@toc@ha			addis 5, 2, .Ldefault@toc@ha
	ld 5, .Ldefault@toc@l(5)			ld 5, .Ldefault@toc@l(5)
	lwa 5, 0(5)			lwa 5, 0(5)
	Show All 10 Lines

test/ELF/ppc64-toc-restore-recursive-call.s

	# REQUIRES: ppc			# REQUIRES: ppc

	# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			# RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	# RUN: ld.lld -shared %t.o -o %t.so			# RUN: ld.lld -shared %t.o -o %t.so
	# RUN: llvm-objdump -d --no-show-raw-insn -r %t.so \| FileCheck %s			# RUN: llvm-objdump -d --no-show-raw-insn -r %t.so \| FileCheck %s

	# For a recursive call that is interposable the linker calls the plt-stub rather			# For a recursive call that is interposable the linker calls the plt-stub rather
	# then calling the function directly. Since the call is through a plt stub and			# then calling the function directly. Since the call is through a plt stub and
	# might be interposed with a different definition at runtime, a toc-restore is			# might be interposed with a different definition at runtime, a toc-restore is
	# required to follow the call.			# required to follow the call.

	# The decision to use a plt-stub for the recursive call is not one I feel			# The decision to use a plt-stub for the recursive call is not one I feel
	# strongly about either way. It was done because it matches what bfd and gold do			# strongly about either way. It was done because it matches what bfd and gold do
	# for recursive calls as well as keeps the logic for recursive calls consistent			# for recursive calls as well as keeps the logic for recursive calls consistent
	# with non-recursive calls.			# with non-recursive calls.

	# CHECK-LABEL: 0000000000010000 recursive_func:			# CHECK-LABEL: 0000000000010290 recursive_func:
	# CHECK: 10028: bl .+32			# CHECK: 102b8: bl .+32
	# CHECK-NEXT: ld 2, 24(1)			# CHECK-NEXT: ld 2, 24(1)

	# CHECK-LABEL: 0000000000010048 __plt_recursive_func:			# CHECK-LABEL: 00000000000102d8 __plt_recursive_func:

	.abiversion 2			.abiversion 2
	.section ".text"			.section ".text"
	.p2align 2			.p2align 2
	.global recursive_func			.global recursive_func
	.type recursive_func, @function			.type recursive_func, @function
	recursive_func:			recursive_func:
	.Lrf_gep:			.Lrf_gep:
	Show All 22 Lines

test/ELF/ppc64-toc-restore.s

	// REQUIRES: ppc			// REQUIRES: ppc

	// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %s -o %t.o
	// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o
	// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-func.s -o %t3.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64le-unknown-linux %p/Inputs/ppc64-func.s -o %t3.o
	// RUN: ld.lld -shared %t2.o -o %t2.so			// RUN: ld.lld -shared -soname=t2.so %t2.o -o %t2.so
	// RUN: ld.lld %t.o %t2.so %t3.o -o %t			// RUN: ld.lld %t.o %t2.so %t3.o -o %t
	// RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			// RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s

	// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %s -o %t.o
	// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/shared-ppc64.s -o %t2.o
	// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-func.s -o %t3.o			// RUN: llvm-mc -filetype=obj -triple=powerpc64-unknown-linux %p/Inputs/ppc64-func.s -o %t3.o
	// RUN: ld.lld -shared %t2.o -o %t2.so			// RUN: ld.lld -shared -soname=t2.so %t2.o -o %t2.so
	// RUN: ld.lld %t.o %t2.so %t3.o -o %t			// RUN: ld.lld %t.o %t2.so %t3.o -o %t
	// RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s			// RUN: llvm-objdump -d --no-show-raw-insn %t \| FileCheck %s

	.text			.text
	.abiversion 2			.abiversion 2
	.global bar_local			.global bar_local
	bar_local:			bar_local:
	li 3, 2			li 3, 2
	blr			blr

	# Calling external function foo in a shared object needs a nop.			# Calling external function foo in a shared object needs a nop.
	# Calling local function bar_local doe not need a nop.			# Calling local function bar_local doe not need a nop.
	.global _start			.global _start
	_start:			_start:
	bl foo			bl foo
	nop			nop
	bl bar_local			bl bar_local
	// CHECK-LABEL: _start:			// CHECK-LABEL: _start:
	// CHECK-NEXT: 10010008: bl .+64			// CHECK-NEXT: 100102c8: bl .+64
	// CHECK-NEXT: 1001000c: ld 2, 24(1)			// CHECK-NEXT: 100102cc: ld 2, 24(1)
	// CHECK-NEXT: 10010010: bl .-16			// CHECK-NEXT: 100102d0: bl .-16
	// CHECK-EMPTY:			// CHECK-EMPTY:

	# Calling a function in another object file which will have same			# Calling a function in another object file which will have same
	# TOC base does not need a nop. If nop present, do not rewrite to			# TOC base does not need a nop. If nop present, do not rewrite to
	# a toc restore			# a toc restore
	.global diff_object			.global diff_object
	_diff_object:			_diff_object:
	bl foo_not_shared			bl foo_not_shared
	bl foo_not_shared			bl foo_not_shared
	nop			nop
	// CHECK-LABEL: _diff_object:			// CHECK-LABEL: _diff_object:
	// CHECK-NEXT: 10010014: bl .+28			// CHECK-NEXT: 100102d4: bl .+28
	// CHECK-NEXT: 10010018: bl .+24			// CHECK-NEXT: 100102d8: bl .+24
	// CHECK-NEXT: 1001001c: nop			// CHECK-NEXT: 100102dc: nop

	# Branching to a local function does not need a nop			# Branching to a local function does not need a nop
	.global noretbranch			.global noretbranch
	noretbranch:			noretbranch:
	b bar_local			b bar_local
	// CHECK-LABEL: noretbranch:			// CHECK-LABEL: noretbranch:
	// CHECK: 10010020: b .+67108832			// CHECK: 100102e0: b .+67108832
	// CHECK-EMPTY:			// CHECK-EMPTY:

	// This should come last to check the end-of-buffer condition.			// This should come last to check the end-of-buffer condition.
	.global last			.global last
	last:			last:
	bl foo			bl foo
	nop			nop
	// CHECK-LABEL: last:			// CHECK-LABEL: last:
	// CHECK-NEXT: 10010024: bl .+36			// CHECK-NEXT: 100102e4: bl .+36
	// CHECK-NEXT: 10010028: ld 2, 24(1)			// CHECK-NEXT: 100102e8: ld 2, 24(1)

test/ELF/ppc64-weak-undef-call.s

Show All 18 Lines	_start:
blr		blr

.weak weakfunc		.weak weakfunc

# It does not really matter how we fixup the bl, if at all, because it needs to		# It does not really matter how we fixup the bl, if at all, because it needs to
# be unreachable. But, we should link successfully. We should not, however,		# be unreachable. But, we should link successfully. We should not, however,
# generate a .plt entry (this would be wasted space). For now, we do nothing		# generate a .plt entry (this would be wasted space). For now, we do nothing
# (leaving the zero relative offset present in the input).		# (leaving the zero relative offset present in the input).
# CHECK: 10010000: bl .+0		# CHECK: 10010158: bl .+0
# CHECK: 10010004: nop		# CHECK: 1001015c: nop
# CHECK: 10010008: blr		# CHECK: 10010160: blr

test/ELF/relro-copyrel-bss-script.s

	// REQUIRES: x86			// REQUIRES: x86
	// RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %p/Inputs/shared.s -o %t.o			// RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %p/Inputs/shared.s -o %t.o
	// RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %p/Inputs/copy-in-shared.s -o %t2.o			// RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %p/Inputs/copy-in-shared.s -o %t2.o
	// RUN: ld.lld -shared %t.o %t2.o -o %t.so			// RUN: ld.lld -shared -soname=t.so %t.o %t2.o -z separate-code -o %t.so

	// A linker script that will map .bss.rel.ro into .bss.			// A linker script that will map .bss.rel.ro into .bss.
	// RUN: echo "SECTIONS { \			// RUN: echo "SECTIONS { \
	// RUN: .bss : { (.bss) (.bss.*) } \			// RUN: .bss : { (.bss) (.bss.*) } \
	// RUN: } " > %t.script			// RUN: } " > %t.script

	// RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %s -o %t3.o			// RUN: llvm-mc -filetype=obj -triple=x86_64-unknown-linux %s -o %t3.o
	// RUN: ld.lld %t3.o %t.so -z relro -o %t --script=%t.script 2>&1			// RUN: ld.lld %t3.o %t.so -z relro -z separate-code -o %t --script=%t.script 2>&1
	// RUN: llvm-readobj --program-headers %t \| FileCheck %s			// RUN: llvm-readelf -l %t \| FileCheck %s
	.section .text, "ax", @progbits			.section .text, "ax", @progbits
	.global bar			.global bar
	.global foo			.global foo
	.global _start			.global _start
	_start:			_start:
	callq bar			callq bar
	// Will produce .bss.rel.ro that will match in .bss, this will lose			// Will produce .bss.rel.ro that will match in .bss, this will lose
	// the relro property of the copy relocation.			// the relro property of the copy relocation.
	.quad foo			.quad foo

	// Non relro bss			// Non relro bss
	.bss			.bss
	// make large enough to affect PT_GNU_RELRO MemSize if this was marked			// make large enough to affect PT_GNU_RELRO MemSize if this was marked
	// as relro.			// as relro.
	.space 0x2000			.space 0x2000

	// CHECK: Type: PT_GNU_RELRO (0x6474E552)			// CHECK: Type Offset VirtAddr PhysAddr FileSiz MemSiz Flg Align
	// CHECK-NEXT: Offset:			// CHECK: GNU_RELRO 0x002150 0x0000000000000150 0x0000000000000150 0x000100 0x000eb0 R 0x1
	// CHECK-NEXT: VirtualAddress:
	// CHECK-NEXT: PhysicalAddress:
	// CHECK-NEXT: FileSize:
	// CHECK-NEXT: MemSize: 4096
	// CHECK-NEXT: Flags [ (0x4)
	// CHECK-NEXT: PF_R (0x4)
	// CHECK-NEXT: ]
	// CHECK-NEXT: Alignment: 1
	// CHECK-NEXT: }

This is an archive of the discontinued LLVM Phabricator instance.

[ELF][PPC] Allow PT_LOAD to have overlapping p_offset rangesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 210964

ELF/InputSection.cpp

ELF/Writer.cpp

test/ELF/basic-ppc64.s

test/ELF/ppc64-abs64-dyn.s

test/ELF/ppc64-bsymbolic-toc-restore.s

test/ELF/ppc64-call-reach.s

test/ELF/ppc64-dq.s

test/ELF/ppc64-dtprel.s

test/ELF/ppc64-entry-point.s

test/ELF/ppc64-error-missaligned-dq.s

test/ELF/ppc64-error-missaligned-ds.s

test/ELF/ppc64-func-entry-points.s

test/ELF/ppc64-ifunc.s

test/ELF/ppc64-local-dynamic.s

test/ELF/ppc64-long-branch-localentry-offset.s

test/ELF/ppc64-long-branch.s

test/ELF/ppc64-plt-stub.s

test/ELF/ppc64-rel-calls.s

test/ELF/ppc64-relocs.s

test/ELF/ppc64-shared-long_branch.s

test/ELF/ppc64-tls-gd.s

test/ELF/ppc64-tls-ie.s

test/ELF/ppc64-tls-vaddr-align.s

test/ELF/ppc64-toc-addis-nop-lqsq.s

test/ELF/ppc64-toc-addis-nop.s

test/ELF/ppc64-toc-rel.s

test/ELF/ppc64-toc-relax-constants.s

test/ELF/ppc64-toc-relax-jumptable.s

test/ELF/ppc64-toc-relax.s

test/ELF/ppc64-toc-restore-recursive-call.s

test/ELF/ppc64-toc-restore.s

test/ELF/ppc64-weak-undef-call.s

test/ELF/relro-copyrel-bss-script.s

[ELF][PPC] Allow PT_LOAD to have overlapping p_offset ranges
ClosedPublic