This patch adds the target call back relaxTlsIeToLe to support TLS relaxation from initial exec to local exec model.
Details
Diff Detail
Event Timeline
lld/ELF/Arch/PPC64.cpp | ||
---|---|---|
190 | In lld we do not take the approach that "every constant should be defined as variable" but this is perhaps a bit cross the line. Don't you have constants for these values? |
lld/ELF/Arch/PPC64.cpp | ||
---|---|---|
25 | nit: add a blank line before any multi-line class, struct, enum, etc definitions/declarations. It needs a comment. Is this a instruction number? | |
186 | format? | |
187–188 | Can you please clang-format? | |
205–207 | Which does this mean, we expect the three sequential instructions or one of them? | |
lld/ELF/Relocations.h | ||
47 | I don't know if you really need to add a new type. IE→LE relaxation has already implemented for some other architectures. I wonder if you can do the same thing as others. | |
lld/test/ELF/ppc64-tls-ie-le.s | ||
2 | How did you create this file? It seems a bit too long. Can you reduce it by hand? Also, splitting to smaller test files might improve maintainability. |
lld/ELF/Arch/PPC64.cpp | ||
---|---|---|
205–207 | These may not be sequential but are all part of the initial exec sequence and we expect each one of them to be converted into a new instruction. addis r9, r2, x@got@tprel@ha ld r9, x@got@tprel@l(r9) are building the offset of x relative to the thread pointer by reading it from the got. add r9, r9, x@tls completes the address by replacing x@tls with r13 which is the thread pointer. We will do the following replacements as started by the V2 ABI: addis r9, r2, x@got@tprel@ha changes into nop ld r9, x@got@tprel@l(r9) changes into addis r9, r13, x@tprel@ha add r9, r9, x@tls changes into addi r9, r9, x@tprel@l This new sequence now calculates the offset of x relative to the thread pointer rather than reading it from the got. | |
lld/test/ELF/ppc64-tls-ie-le.s | ||
2 | I created this file by hand. It is composed of multiple small tests that are all related so I think it would be better if we kept them in one file. The small tests are meant to test the same relocation relaxation which has multiple options for the instruction: lbzx -> lbz lhzx -> lhz lwzx -> lwz ldx -> ld stbx -> stb sthx -> sth stwx -> stw stdx -> std add -> addi |
lld/ELF/Relocations.h | ||
---|---|---|
47 | I added this new type because I wanted a relocation which would get ignored similar to how R_HINT and R_NONE are, except for when handling TLS relaxations. I added R_TLS_IE_HINT as the RelExp for R_PPC64_TLS. add r9, r9, x@tls into addi r9, r9, x@tprel@l Leaving it as R_HINT causes us to skip calling relaxTlsGdToLe with R_PPC64_TLS and we are unable to do the above change since there is an early exit in scanReloc for R_HINT. If I use the existing relocation types used for initial exec, like R_GOT_OFF, I would be evaluating it in getRelocTargetVA, just to never use it. Adding this new type allows me to skip getRelocTargetVA for R_PPC64_TLS, while being able to handle it in the TLS relaxations. |
lld/ELF/Relocations.h | ||
---|---|---|
47 | Maybe R_TLSIE_HINT instead to match R_TLSLD_HINT added in other patch. |
lld/ELF/Arch/PPC64.cpp | ||
---|---|---|
207 | nit: error messages should start with a lowercase letter. | |
238 | nit: remove {} since we don't need to create a new scope for local variable for this case. | |
249 | unsigned -> uint32_t | |
251 | U -> u | |
252–253 | unsigned -> uint32_t. | |
lld/ELF/Relocations.h | ||
83 | Does this compile? R_TLSIE_HINT is added but other files refer R_TLS_IE_HINT. |
Just a question. Does this patch support the optimization that fills in the GOT slot when a group of initial-exec relocations are known to be link-time constants?
__attribute__((tls_model("initial-exec"))) static __thread DTLS dtls;
48: 00 00 62 3c addis r3,r2,0 48: R_PPC64_GOT_TPREL16_HA _ZN11__sanitizerL4dtlsE 4c: 00 00 63 e8 ld r3,0(r3) 4c: R_PPC64_GOT_TPREL16_LO_DS _ZN11__sanitizerL4dtlsE 50: 14 6a 83 7c add r4,r3,r13 50: R_PPC64_TLS _ZN11__sanitizerL4dtlsE
https://github.com/llvm-mirror/lld/tree/master/ELF/Relocations.cpp#L747
template <class ELFT> static void addGotEntry(Symbol &Sym) { ... bool IsLinkTimeConstant = !Sym.IsPreemptible && (!Config->Pic || isAbsolute(Sym)); /// if R_PPC64_GOT_TPREL16_HA R_PPC64_GOT_TPREL16_LO_DS are link-time constants, they can be filled but `Target->GotRel` should not be used here as it is a 64-bit value. /// Target->GotRel is R_PPC64_GLOB_DAT but I think it should not be used. if (IsLinkTimeConstant) { InX::Got->Relocations.push_back({Expr, Target->GotRel, Off, 0, &Sym}); return; }
For the example of:
attribute((tls_model("initial-exec")))
static __thread DTLS dtls;
We handle the TLS relocations for R_PPC64_GOT_TPREL16_HA, R_PPC64_GOT_TPREL16_LO_DS, R_PPC64_TLS before we check for needsGot.
When handling these relocations, they are optimized into the local exec model. This model does not require any GOT slots, so there isn't anything to optimize for the GOT slots.
lld/ELF/Arch/PPC64.cpp | ||
---|---|---|
26 | Do we need to relax vector load/stores as well? | |
222 | A brief description of what x@tls means would be useful here. | |
225 | getDFormOp already lists all the instructions we want to convert form X-form to D-form. We can probably condense this comment to not list them all explicitly. The last instruction in the initial exec sequence has multiple variations that need to be handled. If we are building an address it will use an add instruction, if we are accessing memory it will use any of the X-form indexed load or store instructions | |
257 | We should have a default case with an llvm_unreachable, and do we need to handle R_PPC64_GOT_TPREL16_DS and R_PPC64_GOT_TPREL16_HI? | |
lld/ELF/InputSection.cpp | ||
515 | Sort alphabetically. |
nit: add a blank line before any multi-line class, struct, enum, etc definitions/declarations.
It needs a comment. Is this a instruction number?