This is an archive of the discontinued LLVM Phabricator instance.

Implement indirect branch generation in position independent code for the RISC-V target
ClosedPublic

Authored by msizanoen1 on Jul 29 2020, 3:25 AM.

Diff Detail

Event Timeline

msizanoen1 created this revision.Jul 29 2020, 3:25 AM
msizanoen1 created this object with edit policy "msizanoen1 (msizanoen1)".
msizanoen1 requested review of this revision.Jul 29 2020, 3:25 AM
msizanoen1 changed the edit policy from "msizanoen1 (msizanoen1)" to "Subscribers".Jul 29 2020, 3:28 AM

Context: this is part of the patch which fixes an issue seen when building rust-analyzer for riscv64 linux hosts. I think this patch also fixes an issue seen by the Zig toolchain developers, who were trying to compile the LLVM NVPTX backend for riscv64 linux hosts.

@msizanoen1 Please can you upload the entire patch that you proposed in the rust-lang thread (including the parts about reordering branch relaxation), and also ensure you include all the context using git diff -U999999 there are more instructions here: https://llvm.org/docs/Phabricator.html#requesting-a-review-via-the-web-interface

Thanks for getting the patch prepared so quickly!

msizanoen1 updated this revision to Diff 281515.
msizanoen1 retitled this revision from Implement indirect branch generation in position independent code for the RISC-V target to Implement indirect branch generation in position independent code for the RISC-V target and reorder BranchRelax before RISCVExpandPseudo.
msizanoen1 retitled this revision from Implement indirect branch generation in position independent code for the RISC-V target and reorder BranchRelax before RISCVExpandPseudo to Implement indirect branch generation in position independent code for the RISC-V target and reorder BranchRelaxation to be run after RISCVExpandPseudo.Jul 29 2020, 4:23 AM
jrtc27 requested changes to this revision.Jul 29 2020, 5:03 AM
jrtc27 added inline comments.
llvm/lib/Target/RISCV/RISCVTargetMachine.cpp
35–37 ↗(On Diff #281515)

Do we really want to be able to turn this off? It's only used when needed, and I don't see why you'd ever want to force code to fail to compile that could easily be fixed up by the compiler to work. Do note that the relaxation pass does double-duty, doing *both* direct -> indirect relaxation (rare) but *also* conditional direct -> unconditional direct relaxation which isn't that rare given the 13-bit signed displacement available for conditional branches (compared with JAL's 21-bit signed displacement). For example, I know FreeBSD kernels hits the latter case in a few places. Though even if this option only affected the indirect relaxation I'd still argue your compiler shouldn't provide an option that does nothing to affect CodeGen other than artificially stopping it from compiling certain programs.

182–184 ↗(On Diff #281515)

This should not matter. All the pseudoinstructions are supposed to have correct lengths for their expanded forms (see https://reviews.llvm.org/D77443), so please revert this hunk. That also means you can use PseudoLLA in insertIndirectBranch rather than having to duplicate the expansion of it.

This revision now requires changes to proceed.Jul 29 2020, 5:03 AM

(the fact that FreeBSD kernel builds could trigger relaxation is what exposed the bug fixed in https://reviews.llvm.org/D77443)

msizanoen1 retitled this revision from Implement indirect branch generation in position independent code for the RISC-V target and reorder BranchRelaxation to be run after RISCVExpandPseudo to Implement indirect branch generation in position independent code for the RISC-V target.
jrtc27 requested changes to this revision.Jul 29 2020, 5:28 AM

Please avoid duplication. Something like this should be better in that regard, both for duplicating between the if/else branches and duplicating with the existing AUIPC-based expansions elsewhere.

llvm/lib/Target/RISCV/RISCVInstrInfo.cpp
397–418
This revision now requires changes to proceed.Jul 29 2020, 5:28 AM

Although I feel PseudoJump might be a sensible choice even for position-dependent code? By this point we know the branch target is more than 2 MiB away, and I assume that neither LUI/JALR nor AUIPC/JALR will be either relaxable at link time (branch target would need to be a low address) or compressible. Plus AUIPC/JALR is more like JAL in being PC-relative, so it "feels" more "faithful".

This still need to be backported to rust-lang/llvm-project fork which is still on LLVM 10 so using PseudoJump is not an option.

jrtc27 requested changes to this revision.Jul 29 2020, 6:03 AM

This still need to be backported to rust-lang/llvm-project fork which is still on LLVM 10 so using PseudoJump is not an option.

The Rust community is free to do what it likes in its fork, but their restrictions should not cause us as upstream to commit lower-quality code. There's also nothing stopping them back-porting https://reviews.llvm.org/D73178 (and https://reviews.llvm.org/D77443, which they'll need now too). So please, let's keep the code in LLVM upstream as clean and concise as possible.

This revision now requires changes to proceed.Jul 29 2020, 6:03 AM
jrtc27 added inline comments.Jul 29 2020, 7:01 AM
llvm/lib/Target/RISCV/RISCVInstrInfo.cpp
399

I don't know how much it matters, but we know the register is dead here.

msizanoen1 marked an inline comment as done.

This looks good to me now, let's see what others think. But please update llvm/test/CodeGen/RISCV/branch-relaxation.ll to reflect the change in CodeGen from LUI/JALR to AUIPC/JALR, and add PIC RUN lines. Plus whilst you're there I guess it wouldn't hurt to add RV64 variants too.

(and my ; TODO: Extend simm12's MCOperandPredicate so the jalr zero is printed as a jr. then goes away, not because that's been fixed, but because by using PseudoJump we no longer expose it... perhaps that comment should be relocated to RISCVInstrInfo.td's definition of simm12, but it's a minor stylistic thing, it doesn't really matter beyond not being the canonical alias)

jrtc27 added inline comments.Jul 29 2020, 7:42 AM
llvm/test/CodeGen/RISCV/branch-relaxation.ll
7

You've lost the check for the non-PIC one and didn't check that the PIC one actually assembles (it really should, but you never know; the point is that unless you emit a .o the fixups are never actually computed and thus you won't get errors if they end up overflowing).

llvm/test/CodeGen/RISCV/branch-relaxation.ll
2–5
6–8
msizanoen1 updated this revision to Diff 281594.
msizanoen1 marked an inline comment as done.
jrtc27 added inline comments.Jul 29 2020, 8:01 AM
llvm/test/CodeGen/RISCV/branch-relaxation.ll
6–8

This isn't quite what I suggested, as it's too long a line without wrapping more of the arguments to the next line.

asb added a comment.Jul 30 2020, 5:37 AM

@jrtc27 are you happy with this patch now? Thanks for your review on this and thanks @msizanoen1 for providing this fix.

I think @jrtc27's suggestion about deleting/moving the TODO line in test code still needs to be addressed.

In D84833#2184491, @asb wrote:

@jrtc27 are you happy with this patch now? Thanks for your review on this and thanks @msizanoen1 for providing this fix.

I think @jrtc27's suggestion about deleting/moving the TODO line in test code still needs to be addressed.

Other than the TODO, yes (and hopefully you agree with my suggestions here!). I would also like to see branch-relaxation.ll grow RV64 RUN lines for completeness, even if it's probably a bit redundant (and creates a slight explosion..), but I can do that as a follow-up patch.

Other than the TODO, yes (and hopefully you agree with my suggestions here!). I would also like to see branch-relaxation.ll grow RV64 RUN lines for completeness, even if it's probably a bit redundant (and creates a slight explosion..), but I can do that as a follow-up patch.

This patch still seems to need some additional work. When I apply it and run the tests I get a crash for the branch-relaxation.ll test:

*** Bad machine code: MBB has unexpected successors which are not branch targets, fallthrough, EHPads, or inlineasm_br targets. ***
- function:    relax_jal
- basic block: %bb.3  (0x5621c4161a48)

Other than the TODO, yes (and hopefully you agree with my suggestions here!). I would also like to see branch-relaxation.ll grow RV64 RUN lines for completeness, even if it's probably a bit redundant (and creates a slight explosion..), but I can do that as a follow-up patch.

This patch still seems to need some additional work. When I apply it and run the tests I get a crash for the branch-relaxation.ll test:

*** Bad machine code: MBB has unexpected successors which are not branch targets, fallthrough, EHPads, or inlineasm_br targets. ***
- function:    relax_jal
- basic block: %bb.3  (0x5621c4161a48)

Ah, that's because the TableGen definition is broken (works fine for asm, but not code gen):

let isCall = 0, isBarrier = 0, isCodeGenOnly = 0, hasSideEffects = 0,
    mayStore = 0, mayLoad = 0 in

I _believe_ we need:

isBranch = 1, isTerminator = 1, isBarrier = 1

?

I _believe_ we need:

isBranch = 1, isTerminator = 1, isBarrier = 1

?

Sounds about right. (I don't know offhand in what circumstances you have different values for isTerminator and isBarrier).
With that change we now run into an llvm_unreachable in RISCVInstrInfo::isBranchOffsetInRange, maybe it's just implementing that missing case.

jrtc27 requested changes to this revision.Jul 30 2020, 7:42 AM

Given @luismarques's comment, have you now actually run the tests (and I mean all RISC-V tests, not just branch-relaxation.ll, in case anything has been missed)? Perhaps I shouldn't take it for granted that people run tests before submitting patches, including revisions (though I'd still verify myself before committing anything), but you really should as otherwise it just wastes our time having to tell you they're broken when you could just run them yourself.

llvm/lib/Target/RISCV/RISCVInstrInfo.td
1015–1017

Group the branch/terminator-related bits together and avoid unnecessary line breaks

This revision now requires changes to proceed.Jul 30 2020, 7:42 AM

I _believe_ we need:

isBranch = 1, isTerminator = 1, isBarrier = 1

?

Sounds about right. (I don't know offhand in what circumstances you have different values for isTerminator and isBarrier).

My guess would be barrier implies terminator but that you can't really express that in TableGen (though maybe clever use of ? and backend tweaks could do something ad-hoc) so for simplicity you just have to specify both. The other way round certainly doesn't hold; conditional branches can terminate a basic block and fall through to the next one.

With that change we now run into an llvm_unreachable in RISCVInstrInfo::isBranchOffsetInRange, maybe it's just implementing that missing case.

Yeah, looks like it. The easy answer is to do isIntN(32, BrOffset), though that's not strictly correct on RV64 due to the implicit + 0x800 in the HI/LO relocations, so should really be isIntN(32, SignExtend64(BrOffset + 0x800, XLen)) I think, and is basically what LLD does for checking the range of HI20 values (LLD also left-shifts and does an isInt<20> instead, but they're equivalent). The same also goes for the assertion in insertIndirectBranch. I won't ask why we use isInt<X>(...) there but isIntN(X, ...) in isBranchOffsetInRange... :)

jrtc27 added inline comments.Jul 30 2020, 8:17 PM
llvm/lib/Target/RISCV/RISCVInstrInfo.cpp
428

Isn't STI a RISCVSubtarget, so we can just call getXLen?

445

Nowhere else in the tree casts for uses of SignExtend64 like this (e.g. MipsMCExpr.cpp), even if strictly there could be signed integer overflow here, but where you've put the cast is too late. If you're going to cast anywhere, do it on BrOffset itself, but I'd probably just match what everyone else does and drop it althogether.

llvm/lib/Target/RISCV/RISCVInstrInfo.td
1015–1017

Please combine the two short lines like I said

asb added a comment.Aug 6 2020, 6:28 AM

@jrtc27 are you happy with this patch in its current form or are there outstanding issues to be addressed?

jrtc27 requested changes to this revision.Aug 6 2020, 6:31 AM
jrtc27 added inline comments.
llvm/lib/Target/RISCV/RISCVInstrInfo.td
1017

It's not an indirect branch? The code sequence is identical to PseudoTAIL, just with a non-hard-coded temporary register. Any microarchitecture that does macro-op fusion will recognise this sequence as a PC-relative direct jump.

This revision now requires changes to proceed.Aug 6 2020, 6:31 AM
msizanoen1 updated this revision to Diff 283597.Aug 6 2020, 6:55 AM
msizanoen1 updated this revision to Diff 283598.Aug 6 2020, 7:03 AM
jrtc27 accepted this revision.Aug 6 2020, 7:15 AM

Subject to the tests all passing, I think this is now good.

This revision is now accepted and ready to land.Aug 6 2020, 7:15 AM

@lenary I don't have commit access, can you commit this?

Sorry, I was away. It's on my list to get to today!