This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
CodeGen/
-
MachineBasicBlock.h
5/7
MachineInstr.h
-
Support/
1/3
TargetOpcodes.def
-
Target/
4/4
Target.td
-
lib/
-
CodeGen/
-
DetectDeadLanes.cpp
2/3
ExpandPostRAPseudos.cpp
-
GlobalISel/
-
CombinerHelper.cpp
-
GISelKnownBits.cpp
-
InstructionSelect.cpp
-
MachineIRBuilder.cpp
-
Utils.cpp
-
MachineBasicBlock.cpp
-
MachineInstr.cpp
4/8
MachineSink.cpp
1/2
MachineVerifier.cpp
-
PeepholeOptimizer.cpp
-
ReachingDefAnalysis.cpp
-
RegAllocFast.cpp
-
SelectionDAG/
2/4
InstrEmitter.cpp
-
ScheduleDAGSDNodes.cpp
-
SelectionDAGBuilder.cpp
-
Target/
-
AArch64/
-
AArch64CallLowering.cpp
-
AArch64FastISel.cpp
-
AArch64InstrInfo.cpp
-
AArch64InstructionSelector.cpp
-
AArch64RegisterBankInfo.cpp
-
AMDGPU/
-
SIISelLowering.cpp
-
SIInstrInfo.cpp
-
Hexagon/
-
BitTracker.cpp
-
HexagonBitSimplify.cpp
-
HexagonFrameLowering.cpp
-
HexagonGenPredicate.cpp
-
HexagonHardwareLoops.cpp
-
HexagonISelDAGToDAGHVX.cpp
-
HexagonInstrInfo.cpp
-
HexagonMachineScheduler.cpp
-
HexagonNewValueJump.cpp
-
HexagonSplitDouble.cpp
-
RDFCopy.cpp
-
Mips/
-
MipsRegisterBankInfo.cpp
-
MipsSEFrameLowering.cpp
-
NVPTX/
-
NVPTXReplaceImageHandles.cpp
-
X86/
-
X86DomainReassignment.cpp
-
X86FlagsCopyLowering.cpp
-
X86FloatingPoint.cpp
-
X86InstrInfo.cpp
-
test/CodeGen/
-
CodeGen/
-
AArch64/
-
callbr-asm-label.ll
-
SystemZ/
-
asm-20.ll
-
X86/
-
callbr-asm-label-addr.ll
-
callbr-asm-outputs-tcopy-spilling.ll
-
callbr-asm-outputs.ll

Differential D75098

Add TCOPY, a terminator form of the COPY instr
Needs ReviewPublic

Authored by void on Feb 24 2020, 6:47 PM.

Download Raw Diff

Details

Reviewers

jyknight
hfinkel
MaskRay
lattner
qcolombet

Summary

The INLINEASM_BR instruction's a terminator, and the code generator
doesn't allow non-terminator instructions after a terminator. This is an
issue when an INLINEASM_BR defines a physical register. We can't place
the copies of the physical registers into virtual registers in the
fallthrough block because physical registers can't be marked as
"live-in" until after register allocation.

To get around this issue, we introduce a new pseudo-instruction, TCOPY,
that's identical to the COPY instruction, but is a terminator. With it
we're able to copy the physical registers to virtual registers without
needing to place the copies in a fallthrough block:

bb.1:
  INLINEASM_BR &"" ..., implicit-def $esi, $1:[regdef],...
  %9:gr32 = TCOPY $esi

bb.2:
; predecessors: %bb.1
  successors: %bb.3(0x80000000); %bb.3(100.00%)

  %0:gr32 = COPY %9:gr32
  JMP_1 %bb.3

The TCOPY is converted to a normal COPY after register allocation, when
we have live variable information and live-ins are allowed on basic
blocks.

The fast register allocator behaves a bit differently because everything
is spilled before the end of a basic block. Therefore, we allow a store
of a physical register after the INLINEASM_BR when optimizations are
disabled.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

void created this revision.Feb 24 2020, 6:47 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 24 2020, 6:47 PM

Herald added subscribers: llvm-commits, hiraditya, MatzeB. · View Herald Transcript

Harbormaster completed remote builds in B47179: Diff 246359.Feb 24 2020, 6:52 PM

This will be useful for AMDGPU, we currently have a set of _term mov instructions for this purpose.

llvm/include/llvm/Support/TargetOpcodes.def
101	s/terminal/terminator
llvm/include/llvm/Target/Target.td
1131–1132	This shouldn't need isBranch or isIndirectBranch
llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
179	I would expect this to not special case INLINEASM_BR. This should be any value defined by a terminator instruction

arsenm added inline comments.Feb 24 2020, 7:14 PM

llvm/include/llvm/Support/TargetOpcodes.def
101	COPY_TERM or COPY_TERMINATOR would be better since it isn't really a branch

lkail added a subscriber: lkail.Feb 24 2020, 7:40 PM

nickdesaulniers added inline comments.Feb 25 2020, 11:31 AM

llvm/include/llvm/CodeGen/MachineInstr.h
1132	Does this save a few `getOpcode()` calls? The rest of this CL explicitly compares the opcode against `COPY_BR`? I just worry if this might mess up existing callsites of `isCopy` when `getOpcode() == TargetOpcode::COPY_BR;`.
llvm/include/llvm/Target/Target.td
1133	`COPY` also sets `hasNoSchedulingInfo` to `0`. Should `COPY_BR` do that as well?
llvm/lib/CodeGen/ExpandPostRAPseudos.cpp
198	what? if we could use a range based for loop for the `MBB`'s, surely we can for `MI`.
llvm/lib/CodeGen/MachineSink.cpp
1395	Does it make sense to "sink" register info invalidation into `performSink()`? (pun intended) Since it's already checking the `MI`'s opcode? Or are the two call sites of `performSink()` problematic? The implementation of `invalidateLiveness()` looks pretty cheap, IMO.
llvm/lib/CodeGen/MachineVerifier.cpp
829	this is hard to read. Is there a nice way to simply this? Maybe negate, and fold into parent `if`?

void marked 7 inline comments as done.Feb 25 2020, 1:10 PM

void added inline comments.

llvm/include/llvm/CodeGen/MachineInstr.h
1132	It's more than just a few. `isCopy()` is used pretty extensively. I want the `COPY_BR` (or `COPY_TERM`) to be exactly like a normal `COPY`, with the only exception that it can come after a terminator.
llvm/include/llvm/Support/TargetOpcodes.def
101	How about `TCOPY`?
llvm/include/llvm/Target/Target.td
1133	Possibly, though I had trouble finding where to place its scheduling info. It seems like `COPY` is special and TableGen adds it by default. I couldn't find the place to do that for `COPY_BR`, but I'll take another look.

void added inline comments.Feb 25 2020, 1:10 PM

llvm/lib/CodeGen/ExpandPostRAPseudos.cpp
198	The `++mi` below made me worried, and the fact that MI could be erased.
llvm/lib/CodeGen/MachineSink.cpp
1395	Possibly. If we're going to make `COPY_TERM` (or whatever name we settle on) not specific to `INLINEASM_BR`, then we'll probably need to modify this to accept any case where we sink copies after a terminator. We'll come back to this after other comments settle.
llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
179	Does this include unconditional jump terminators?

arsenm added inline comments.Feb 25 2020, 1:16 PM

llvm/include/llvm/Target/Target.td
1133	Last time I looked at adding a copy variant, the right solution was just isAsCheapAsAMove = 1. Otherwise it would have required adding the COPY_BR to every target's COPY handling
llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
179	I'm not sure how you could construct a sensible unconditional jump that would require a copy after it. Seems like something to check in the verifier

Also, if INLINEASM_BR is a terminator, but we want TCOPY to be a terminator, should this patch make INLINEASM_BR NOT a terminator? Or would that be a follow up patch after creating TCOPY?

Also, none of the tests have TCOPY (or w/e). I feel like any patch adding a new instructions at w/e level of IR should have some test that the instruction exists.

Also, I don't quite follow what the changes to the tests are testing for with this change.

llvm/lib/CodeGen/ExpandPostRAPseudos.cpp
198	Ah sorry, I missed that. I'm not sure what's meant by `erased`, but I'd be curious if a `continue` could not be used instead of manual iterator advancement.

In D75098#1892171, @nickdesaulniers wrote:

Also, if INLINEASM_BR is a terminator, but we want TCOPY to be a terminator, should this patch make INLINEASM_BR NOT a terminator? Or would that be a follow up patch after creating TCOPY?

INLINEASM_BR being a terminator makes a lot of things "just work" for the code generator. I'm not 100% convinced that it *should* be a terminator, but if we decided to make it a normal instruction, then I think we'd have to touch way too much code to make it worthwhile.

Also, none of the tests have TCOPY (or w/e). I feel like any patch adding a new instructions at w/e level of IR should have some test that the instruction exists.

The tests are the current "asm goto with outputs" tests.

Also, I don't quite follow what the changes to the tests are testing for with this change.

The test changes are testing that we can compile at -O0.

Don't mark TCOPY as a branch. Also s/COPY_BR/TCOPY/.

Harbormaster completed remote builds in B47264: Diff 246611.Feb 25 2020, 5:48 PM

void retitled this revision from Add COPY_BR, a terminator form of the COPY instr to Add TCOPY, a terminator form of the COPY instr.Feb 25 2020, 5:52 PM

void edited the summary of this revision. (Show Details)

void marked 5 inline comments as done.Feb 25 2020, 5:56 PM

void added inline comments.

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
179	My comment wasn't very good. When converting something to a TCOPY, we need to figure out which terminators are candidates for having a TCOPY after them. Or perhaps better, what condition requires a TCOPY instead of a regular COPY.

Liveness is handled by the PostRA machine sink pass, so there's no need to
invalidate it. Also simplify the logic in the machine verifier that allows
stack dumps after a terminator.

nickdesaulniers added inline comments.Feb 27 2020, 1:35 PM

llvm/lib/CodeGen/MachineSink.cpp
841	pass as `const&`, or make `performSink` a private method.

Fix accidental change inclusion in the verifier.

Harbormaster failed remote builds in B47501: Diff 247090!Feb 27 2020, 2:28 PM

void marked an inline comment as done.Feb 27 2020, 2:35 PM

void added inline comments.

llvm/lib/CodeGen/MachineSink.cpp
841	Two different passes use this function so it can't be made a private method. I"m not sure why passing it as "const&" is better than a "const*"...

Harbormaster failed remote builds in B47509: Diff 247103!Feb 27 2020, 2:51 PM

Reformat and satisfy clang-tidy.

nickdesaulniers added inline comments.Feb 27 2020, 3:08 PM

llvm/lib/CodeGen/MachineSink.cpp
841	Generally, passing by const reference indicates that a parameter is strictly an input, as opposed to both input AND output, which is why you don't see a mix of pointers and references in this function signature.

Harbormaster completed remote builds in B47516: Diff 247115.Feb 27 2020, 3:34 PM

void marked an inline comment as done.Feb 27 2020, 4:02 PM

void added inline comments.

llvm/lib/CodeGen/MachineSink.cpp
841	A 'const*' doesn't allow modifications either. Also note that none of the references in this function signature are 'const', and are probably passed by reference because they aren't pointers (`SuccToSinkto` notwithstanding) in the originating function. Converting this to a reference is not useful.

arsenm added inline comments.Feb 28 2020, 7:52 AM

llvm/include/llvm/CodeGen/MachineInstr.h
1132	I think this is potentially hazardous. I'm worried about the peephole optimizer doing things like folding a copy into a tcopy and losing the terminator bit. Can you add some testcases with coalescable tcopy pairs for PeepholeOptimizer and the register coalescer?

arsenm added a reviewer: qcolombet.Feb 28 2020, 7:58 AM

Add a verification step to ensure a COPY doesn't follow a TCOPY.
Comment that the predicate for determining whether to use a TCOPY or COPY should be improved, but at this time we don't have enough information to determine the best criteria.

Harbormaster failed remote builds in B47826: Diff 247719!Mar 2 2020, 2:09 PM

void added inline comments.Mar 2 2020, 3:26 PM

llvm/include/llvm/CodeGen/MachineInstr.h
1132	I'll craft some tests, though note that TCOPY should only be after a terminator, so a COPY should never be merged with it. (I'll add a verifier check to ensure that TCOPY doesn't happen before a terminator.)

Add TCOPY to some switch statements that handle COPY

Harbormaster completed remote builds in B48743: Diff 249489.Mar 10 2020, 2:13 PM

void marked an inline comment as done.Mar 10 2020, 4:59 PM

void added inline comments.

llvm/include/llvm/CodeGen/MachineInstr.h
1132	@arsenm I'm struggling a bit to come up with tests that will exercise the peephole optimizer and register coalescer. Do you have any advice on how to do this?

Rebase and update testcase.

Friendly ping. :-)

Harbormaster completed remote builds in B49260: Diff 250443.Mar 15 2020, 4:05 PM

Another friendly ping. :-)

nickdesaulniers removed a reviewer: nickdesaulniers.Mar 18 2020, 4:07 PM

nickdesaulniers added a subscriber: nickdesaulniers.

arsenm added inline comments.Mar 30 2020, 4:26 PM

llvm/include/llvm/CodeGen/MachineInstr.h
1132	You shouldn't need a special verifier check, it should be caught by the normal terminator before non-terminator check. I mean something like %1 = COPY %0 %2 = TCOPY %1 and then maybe mix in some subregistsers? The worry is the COPY will somehow replace the TCOPY

efriedma mentioned this in D78586: [MachineVerifier] Add more checks for registers in live-in lists..Apr 21 2020, 1:22 PM

efriedma mentioned this in D77849: [calcspillweights] mark LiveIntervals from INLINEASM_BR defs as not spillable.Apr 27 2020, 8:12 PM

nickdesaulniers mentioned this in D79055: [LiveVariables] Mark PhysReg implicit-def MachineOperands of INLINEASM_BR as LiveOut.Apr 28 2020, 4:56 PM

void marked an inline comment as done.May 4 2020, 3:12 AM

void added inline comments.

llvm/include/llvm/CodeGen/MachineInstr.h
1132	This isn't an issue as far as I can see (at least the kind of example you're referring to). A TCOPY can become a COPY if appropriate. The only time it can't become a COPY is if it occurs after a terminator, in which case it should be caught by current verifier checks. I don't think a COPY should become a TCOPY though.

Add support to MIR parsing.
Rebase current change.

Harbormaster failed remote builds in B55615: Diff 261771!May 4 2020, 4:14 AM

arsenm added inline comments.May 4 2020, 7:43 AM

llvm/lib/CodeGen/MIRParser/MIRParser.cpp
312–342 ↗	(On Diff #261771)	This is an unrelated change
llvm/lib/CodeGen/MachineSink.cpp
858–861	I don't see this captured in a test?
llvm/lib/CodeGen/MachineVerifier.cpp
615–616	This also looks like a separate change

I don't think introducing the TCOPY opcode is a valid solution.
Let's assume the copy doesn't get coalesced, that means it needs to be executed after the inlineasm branch. And this is not going to happen if it sits in the same basic block right after the jump.

What happens if you put the copy at the beginning of each successor?
You mentioned that this is not possible because the live-ins wouldn't be set, but I would expect that this information would be correctly computed when building the liveness information. I.e., I don't expect this to be a problem.

we allow a store of a physical register after the INLINEASM_BR when optimizations are disabled.

That sounds wrong.

Could we step back a little bit, what is the semantic of INLINEASM_BR?

This revision now requires changes to proceed.May 4 2020, 9:56 AM

In D75098#2017980, @qcolombet wrote:

I don't think introducing the TCOPY opcode is a valid solution.
Let's assume the copy doesn't get coalesced, that means it needs to be executed after the inlineasm branch. And this is not going to happen if it sits in the same basic block right after the jump.

What happens if you put the copy at the beginning of each successor?
You mentioned that this is not possible because the live-ins wouldn't be set, but I would expect that this information would be correctly computed when building the liveness information. I.e., I don't expect this to be a problem.

we allow a store of a physical register after the INLINEASM_BR when optimizations are disabled.

That sounds wrong.

Could we step back a little bit, what is the semantic of INLINEASM_BR?

[I'm sorry if some of this is basic. I just want to give a clear description of what's going on and why.]

INLINEASM_BR is an attempt to replicate the behavior of "asm goto". As such, it has one default destination (more-or-less the fallthrough block), and one or more indirect destinations. If the indirect destinations aren't taken, the resulting control flow is to fall out of the bottom of the ASM block. The callbr instruction is a terminator so that it can model this behavior as best as it can (which isn't all that great with LLVM's IR, but doable (see "invoke")). Once it's converted to INLINEASM_BR the behavior obviously doesn't change, but now that we have outputs, we need to figure out how to handle them given the constraints of MIR (i.e. you can't have non-terminals after a terminal, and live-ins are only expected to be correct after register allocation).

Because of how callbr is modeled, INLINEASM_BR is also a terminator. So during ISEL the values that are defined by INLINEASM_BR need to be spilled before jumping to the default block. Because of the constraint against having non-terminators after a terminator, we artificially modify the code after ISEL's finished building the block, placing any spills into a "copy" block (that's between the INLINEASM_BR and its default destination), and adding any registers defined by INLINEASM_BR as "live-ins" to the copy block. This is bad, because it violates the constraints, but was necessary at the time because we had no way to avoid it.

[Note that the outputs from an INLINEASM_BR are valid only on the default branch. It's very difficult to have them on the indirect branches without making the resulting code bad---both in style and in performance.]

Okay, so that's the behavior of INLINEASM_BR and some of the issues we came across that led us to here. I think TCOPY is a good solution because the default behavior of INLINEASM_BR is to fall out the bottom of the ASM block, so it would "naturally" execute the TCOPY instructions. And since we only ensure that the values are valid on the default branch, the COPYs will always be executed when needed (note a block containing an INLINEASM_BR should either exit to the default via a fallthrough or unconditional jump). It's still possible to split the block directly after the INLINEASM_BR and convert the TCOPYs into regular COPYs, in fact I would encourage us to do that at the appropriate point. It could remove a lot of the code in ScheduleDAGSDNodes.cpp.

What happens if you put the copy at the beginning of each successor?

I found that this didn't work. CodeGen wants registers to be spilled at the end of a block. The spill slot is recorded and used later on when accessing the value. If we just add a COPY into the default block it won't have the necessary information available to load the correct value. We would have to add the information as a "live-in" on the default block, but that's a constraint violation, wash-rinse-repeat.

Thanks Bill for the detailed explanations. A couple more questions.

CodeGen wants registers to be spilled at the end of a block.

That's fast reg alloc only. What is happening with the other allocators?

The spill slot is recorded and used later on when accessing the value. If we just add a COPY into the default block it won't have the necessary information available to load the correct value.

Why is that? (Why doesn't it have the necessary information to load the correct value.)

I feel that we are trying to work around a bug/limitation in the fast register allocator where physreg are assumed not to cross basic block boundaries. I wonder if it wouldn't be easier to fix fast reg alloc if that is the problem.

Out-of-curiosity, how do we deal with invoke? I would expect we have the exact same problems.

In D75098#2021573, @qcolombet wrote:

Thanks Bill for the detailed explanations. A couple more questions.

CodeGen wants registers to be spilled at the end of a block.

That's fast reg alloc only. What is happening with the other allocators?

They don't appear to use the correct registers for the defined values.

The spill slot is recorded and used later on when accessing the value. If we just add a COPY into the default block it won't have the necessary information available to load the correct value.

Why is that? (Why doesn't it have the necessary information to load the correct value.)

I feel that we are trying to work around a bug/limitation in the fast register allocator where physreg are assumed not to cross basic block boundaries. I wonder if it wouldn't be easier to fix fast reg alloc if that is the problem.

It's not so much working around a bug/limitation in any of the register allocators. What I'm most concerned about is having to specify live-ins to MBBs before register allocation. That invariant was a huge sticking point in the review process for this feature. We're currently violating it because it "seems to work", but I don't want to rely on that.

Because of the live-ins restriction, all live physical registers at the end of an MBB need to be "spilled", or at least recorded in a way so that other blocks can access the information without having their live-ins set. (This is my understanding, it may be inaccurate). The SelectionDAGBuilder::CopyToExportRegsIfNeeded() function is one of the ways this is done.

Out-of-curiosity, how do we deal with invoke? I would expect we have the exact same problems.

Invoke is turned into a regular call with exception handling stuff around it. E.g.:

$ cat ex.cpp:
void g();
int f() {
  try {
    g();
  } catch (int&) {
    return 37;
  }
  return 1;
}

$ clang++ -mllvm -print-after-all ex.cpp -c -o /dev/null
...
bb.0 (%ir-block.0):
  successors: %bb.1, %bb.2

  EH_LABEL <mcsymbol .Ltmp0>
  ADJCALLSTACKDOWN64 0, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
  CALL64pcrel32 @_Z1gv, <regmask $bh $bl $bp $bph $bpl $bx $ebp $ebx $hbp $hbx $rbp $rbx $r12 $r13 $r14 $r15 $r12b $r13b $r14b $r15b $r12bh $r13bh $r14bh $r15bh $r12d $r13d $r14d $r15d $r12w $r13w $r14w $r15w $r12wh and 3 more...>, implicit $rsp, implicit $ssp, implicit-def $rsp, implicit-def $ssp
  ADJCALLSTACKUP64 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
  EH_LABEL <mcsymbol .Ltmp1>
  JMP_1 %bb.1
...

Remove some of the hackery in the SDNode scheduler that splits below an
INLINEASM_BR instruction. It's not needed with TCOPY.

Herald added subscribers: kerbowa, atanasyan, jrtc27 and 5 others. · View Herald TranscriptMay 8 2020, 4:34 AM

Harbormaster failed remote builds in B56151: Diff 262856!May 8 2020, 5:19 AM

Allow a TCOPY to sink, since it's exactly like a COPY.

Harbormaster failed remote builds in B56199: Diff 262957!May 8 2020, 3:04 PM

void marked an inline comment as done.May 8 2020, 4:36 PM

void added inline comments.

llvm/lib/CodeGen/MachineSink.cpp
858–861	This will happen now that we correctly mark `TCOPY` as sinkable. I'll see if I can craft an MIR test to explicitly do this.

Spill TCOPY in FastRA right before the TCOPY instr instead of before the
terminators.

Harbormaster failed remote builds in B56221: Diff 263000!May 9 2020, 3:09 AM

Fix formatting and clang-tidy warnings.

Harbormaster failed remote builds in B56228: Diff 263010!May 9 2020, 4:45 AM

Revert bad formatting that sneaked (snuck?) in.

Harbormaster failed remote builds in B56261: Diff 263063!May 10 2020, 4:15 AM

In D75098#2026425, @void wrote:

In D75098#2021573, @qcolombet wrote:

Thanks Bill for the detailed explanations. A couple more questions.

CodeGen wants registers to be spilled at the end of a block.

That's fast reg alloc only. What is happening with the other allocators?

They don't appear to use the correct registers for the defined values.

I should probably explain a bit better. With a normal INLINEASM instruction, any defined registers are spilled directly afterwards. This is one reason why I want to do the same for INLINEASM_BR with the TCOPY. There could be other ways to achieve the same thing, but I think it would involve placing a larger burden on the code generation passes than needs be.

I think the better fix here is to make INLINEASM_BR _not_ be a terminator -- just like the CALL involved in an invoke is not.

That change turned out to be a bit complicated, and while I had started taking a stab at that a couple months ago, I got distracted by many other things going on...However, I've now dusted off that experiment and got it into a useful state, and will send out a review for this tomorrow, so we can consider both the options.

In D75098#2030777, @jyknight wrote:

I think the better fix here is to make INLINEASM_BR _not_ be a terminator -- just like the CALL involved in an invoke is not.

That change turned out to be a bit complicated, and while I had started taking a stab at that a couple months ago, I got distracted by many other things going on...However, I've now dusted off that experiment and got it into a useful state, and will send out a review for this tomorrow, so we can consider both the options.

I've thought about the same thing. I'm right now running up against an issue that Nick pointed out. Having a non-branch as a terminator works until the register allocator wants to spill. Then it could insert a copy where the TCOPY is, which messes up analyzeBranch. I'm interested to see your patch.

In D75098#2030777, @jyknight wrote:

I think the better fix here is to make INLINEASM_BR _not_ be a terminator -- just like the CALL involved in an invoke is not.

That change turned out to be a bit complicated, and while I had started taking a stab at that a couple months ago, I got distracted by many other things going on...However, I've now dusted off that experiment and got it into a useful state, and will send out a review for this tomorrow, so we can consider both the options.

In D75098#2032980, @void wrote:

In D75098#2030777, @jyknight wrote:

I think the better fix here is to make INLINEASM_BR _not_ be a terminator -- just like the CALL involved in an invoke is not.

That change turned out to be a bit complicated, and while I had started taking a stab at that a couple months ago, I got distracted by many other things going on...However, I've now dusted off that experiment and got it into a useful state, and will send out a review for this tomorrow, so we can consider both the options.

I've thought about the same thing. I'm right now running up against an issue that Nick pointed out. Having a non-branch as a terminator works until the register allocator wants to spill. Then it could insert a copy where the TCOPY is, which messes up analyzeBranch. I'm interested to see your patch.

Regardless of what happens to INLINEASM_BR, I think we still need a terminator copy

Regardless of what happens to INLINEASM_BR, I think we still need a terminator copy

What for?
The whole concept is bogus to me because if the previous instructions are really terminators (as in they branch to somewhere else), then this copy would never be executed.

In D75098#2049787, @qcolombet wrote:

Regardless of what happens to INLINEASM_BR, I think we still need a terminator copy

What for?
The whole concept is bogus to me because if the previous instructions are really terminators (as in they branch to somewhere else), then this copy would never be executed.

Terminator does not imply a branch, it only means glued to the end of the block. It's the only way to get correct spill code placement around exec mask writes. We need a copy to the exec mask, but spill code needs to be inserted before that point. Currently we get unnecessary copies since we use a raw move instructions for this purpose, when we could have coalesced a copy

In D75098#2049787, @qcolombet wrote:

Regardless of what happens to INLINEASM_BR, I think we still need a terminator copy

What for?
The whole concept is bogus to me because if the previous instructions are really terminators (as in they branch to somewhere else), then this copy would never be executed.

That's not necessarily so, because you can have multiple branching terminators then there must be some that branch conditionally (allowing other code to exist between the jumps), otherwise there wouldn't be a need for multiple terminators.

What's the status of this? I'm still interested in terminator copies

In D75098#2170382, @arsenm wrote:

What's the status of this? I'm still interested in terminator copies

When we last left our adventurers, there was an issue with live range splitting. In some cases it could emit a spill after terminators, which failed validation.

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

MachineBasicBlock.h

21 lines

MachineInstr.h

3 lines

Support/

TargetOpcodes.def

12 lines

Target/

Target.td

8 lines

lib/

CodeGen/

DetectDeadLanes.cpp

3 lines

ExpandPostRAPseudos.cpp

35 lines

GlobalISel/

CombinerHelper.cpp

2 lines

GISelKnownBits.cpp

6 lines

InstructionSelect.cpp

2 lines

MachineIRBuilder.cpp

1 line

Utils.cpp

3 lines

MachineBasicBlock.cpp

25 lines

MachineInstr.cpp

2 lines

MachineSink.cpp

22 lines

MachineVerifier.cpp

14 lines

PeepholeOptimizer.cpp

1 line

ReachingDefAnalysis.cpp

2 lines

RegAllocFast.cpp

6 lines

SelectionDAG/

InstrEmitter.cpp

26 lines

ScheduleDAGSDNodes.cpp

54 lines

SelectionDAGBuilder.cpp

1 line

Target/

AArch64/

AArch64CallLowering.cpp

2 lines

AArch64FastISel.cpp

5 lines

AArch64InstrInfo.cpp

12 lines

AArch64InstructionSelector.cpp

5 lines

AArch64RegisterBankInfo.cpp

11 lines

AMDGPU/

SIISelLowering.cpp

3 lines

SIInstrInfo.cpp

2 lines

Hexagon/

BitTracker.cpp

3 lines

HexagonBitSimplify.cpp

8 lines

HexagonFrameLowering.cpp

1 line

HexagonGenPredicate.cpp

8 lines

HexagonHardwareLoops.cpp

1 line

HexagonISelDAGToDAGHVX.cpp

7 lines

HexagonInstrInfo.cpp

4 lines

HexagonMachineScheduler.cpp

2 lines

HexagonNewValueJump.cpp

5 lines

HexagonSplitDouble.cpp

5 lines

RDFCopy.cpp

3 lines

Mips/

MipsRegisterBankInfo.cpp

8 lines

MipsSEFrameLowering.cpp

1 line

NVPTX/

NVPTXReplaceImageHandles.cpp

3 lines

X86/

X86DomainReassignment.cpp

2 lines

X86FlagsCopyLowering.cpp

12 lines

X86FloatingPoint.cpp

3 lines

X86InstrInfo.cpp

1 line

test/

CodeGen/

AArch64/

callbr-asm-label.ll

19 lines

SystemZ/

asm-20.ll

1 line

X86/

callbr-asm-label-addr.ll

11 lines

callbr-asm-outputs-tcopy-spilling.ll

74 lines

callbr-asm-outputs.ll

2 lines

Diff 263063

llvm/include/llvm/CodeGen/MachineBasicBlock.h

Show First 20 Lines • Show All 164 Lines • ▼ Show 20 Lines	private:
MBBSectionID SectionID{0};		MBBSectionID SectionID{0};

// Indicate that this basic block begins a section.		// Indicate that this basic block begins a section.
bool IsBeginSection = false;		bool IsBeginSection = false;

// Indicate that this basic block ends a section.		// Indicate that this basic block ends a section.
bool IsEndSection = false;		bool IsEndSection = false;

/// Default target of the callbr of a basic block.
bool InlineAsmBrDefaultTarget = false;

/// List of indirect targets of the callbr of a basic block.		/// List of indirect targets of the callbr of a basic block.
SmallPtrSet<const MachineBasicBlock*, 4> InlineAsmBrIndirectTargets;		SmallPtrSet<const MachineBasicBlock*, 4> InlineAsmBrIndirectTargets;

/// since getSymbol is a relatively heavy-weight operation, the symbol		/// since getSymbol is a relatively heavy-weight operation, the symbol
/// is only computed once and is cached.		/// is only computed once and is cached.
mutable MCSymbol *CachedMCSymbol = nullptr;		mutable MCSymbol *CachedMCSymbol = nullptr;

/// Used during basic block sections to mark the end of a basic block.		/// Used during basic block sections to mark the end of a basic block.
▲ Show 20 Lines • Show All 295 Lines • ▼ Show 20 Lines	bool isInlineAsmBrIndirectTarget(const MachineBasicBlock *Tgt) const {
return InlineAsmBrIndirectTargets.count(Tgt);		return InlineAsmBrIndirectTargets.count(Tgt);
}		}

/// Indicates if this is the indirect dest of an INLINEASM_BR.		/// Indicates if this is the indirect dest of an INLINEASM_BR.
void addInlineAsmBrIndirectTarget(const MachineBasicBlock *Tgt) {		void addInlineAsmBrIndirectTarget(const MachineBasicBlock *Tgt) {
InlineAsmBrIndirectTargets.insert(Tgt);		InlineAsmBrIndirectTargets.insert(Tgt);
}		}

/// Transfers indirect targets to INLINEASM_BR's copy block.		/// Returns the default destination of an INLINEASM_BR instruction.
void transferInlineAsmBrIndirectTargets(MachineBasicBlock *CopyBB) {		MachineBasicBlock *getInlineAsmBrDefaultTarget();
for (auto *Target : InlineAsmBrIndirectTargets)
CopyBB->addInlineAsmBrIndirectTarget(Target);
return InlineAsmBrIndirectTargets.clear();
}

/// Returns true if this is the default dest of an INLINEASM_BR.
bool isInlineAsmBrDefaultTarget() const {
return InlineAsmBrDefaultTarget;
}

/// Indicates if this is the default deft of an INLINEASM_BR.
void setInlineAsmBrDefaultTarget() {
InlineAsmBrDefaultTarget = true;
}

/// Returns true if it is legal to hoist instructions into this block.		/// Returns true if it is legal to hoist instructions into this block.
bool isLegalToHoistInto() const;		bool isLegalToHoistInto() const;

// Code Layout methods.		// Code Layout methods.

/// Move 'this' block before or after the specified block. This only moves		/// Move 'this' block before or after the specified block. This only moves
/// the block, it does not modify the CFG or adjust potential fall-throughs at		/// the block, it does not modify the CFG or adjust potential fall-throughs at
▲ Show 20 Lines • Show All 567 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/MachineInstr.h

Show First 20 Lines • Show All 1,122 Lines • ▼ Show 20 Lines	bool isRegSequence() const {
return getOpcode() == TargetOpcode::REG_SEQUENCE;		return getOpcode() == TargetOpcode::REG_SEQUENCE;
}		}

bool isBundle() const {		bool isBundle() const {
return getOpcode() == TargetOpcode::BUNDLE;		return getOpcode() == TargetOpcode::BUNDLE;
}		}

bool isCopy() const {		bool isCopy() const {
return getOpcode() == TargetOpcode::COPY;		return getOpcode() == TargetOpcode::COPY \|\|
		getOpcode() == TargetOpcode::TCOPY;
		nickdesaulniersUnsubmitted Done Reply Inline Actions Does this save a few `getOpcode()` calls? The rest of this CL explicitly compares the opcode against `COPY_BR`? I just worry if this might mess up existing callsites of `isCopy` when `getOpcode() == TargetOpcode::COPY_BR;`. nickdesaulniers: Does this save a few `getOpcode()` calls? The rest of this CL explicitly compares the opcode…
		voidAuthorUnsubmitted Done Reply Inline Actions It's more than just a few. `isCopy()` is used pretty extensively. I want the `COPY_BR` (or `COPY_TERM`) to be exactly like a normal `COPY`, with the only exception that it can come after a terminator. void: It's more than just a few. `isCopy()` is used pretty extensively. I want the `COPY_BR` (or…
		arsenmUnsubmitted Not Done Reply Inline Actions I think this is potentially hazardous. I'm worried about the peephole optimizer doing things like folding a copy into a tcopy and losing the terminator bit. Can you add some testcases with coalescable tcopy pairs for PeepholeOptimizer and the register coalescer? arsenm: I think this is potentially hazardous. I'm worried about the peephole optimizer doing things…
		voidAuthorUnsubmitted Done Reply Inline Actions I'll craft some tests, though note that TCOPY should only be after a terminator, so a COPY should never be merged with it. (I'll add a verifier check to ensure that TCOPY doesn't happen before a terminator.) void: I'll craft some tests, though note that TCOPY should only be after a terminator, so a COPY…
		voidAuthorUnsubmitted Done Reply Inline Actions @arsenm I'm struggling a bit to come up with tests that will exercise the peephole optimizer and register coalescer. Do you have any advice on how to do this? void: @arsenm I'm struggling a bit to come up with tests that will exercise the peephole optimizer…
		arsenmUnsubmitted Not Done Reply Inline Actions You shouldn't need a special verifier check, it should be caught by the normal terminator before non-terminator check. I mean something like %1 = COPY %0 %2 = TCOPY %1 and then maybe mix in some subregistsers? The worry is the COPY will somehow replace the TCOPY arsenm: You shouldn't need a special verifier check, it should be caught by the normal terminator…
		voidAuthorUnsubmitted Done Reply Inline Actions This isn't an issue as far as I can see (at least the kind of example you're referring to). A TCOPY can become a COPY if appropriate. The only time it can't become a COPY is if it occurs after a terminator, in which case it should be caught by current verifier checks. I don't think a COPY should become a TCOPY though. void: This isn't an issue as far as I can see (at least the kind of example you're referring to). A…
}		}

bool isFullCopy() const {		bool isFullCopy() const {
return isCopy() && !getOperand(0).getSubReg() && !getOperand(1).getSubReg();		return isCopy() && !getOperand(0).getSubReg() && !getOperand(1).getSubReg();
}		}

bool isExtractSubreg() const {		bool isExtractSubreg() const {
return getOpcode() == TargetOpcode::EXTRACT_SUBREG;		return getOpcode() == TargetOpcode::EXTRACT_SUBREG;
▲ Show 20 Lines • Show All 615 Lines • Show Last 20 Lines

llvm/include/llvm/Support/TargetOpcodes.def

	Show First 20 Lines • Show All 66 Lines • ▼ Show 20 Lines
	HANDLE_TARGET_OPCODE(SUBREG_TO_REG)			HANDLE_TARGET_OPCODE(SUBREG_TO_REG)

	/// COPY_TO_REGCLASS - This instruction is a placeholder for a plain			/// COPY_TO_REGCLASS - This instruction is a placeholder for a plain
	/// register-to-register copy into a specific register class. This is only			/// register-to-register copy into a specific register class. This is only
	/// used between instruction selection and MachineInstr creation, before			/// used between instruction selection and MachineInstr creation, before
	/// virtual registers have been created for all the instructions, and it's			/// virtual registers have been created for all the instructions, and it's
	/// only needed in cases where the register classes implied by the			/// only needed in cases where the register classes implied by the
	/// instructions are insufficient. It is emitted as a COPY MachineInstr.			/// instructions are insufficient. It is emitted as a COPY MachineInstr.
	HANDLE_TARGET_OPCODE(COPY_TO_REGCLASS)			HANDLE_TARGET_OPCODE(COPY_TO_REGCLASS)

	/// DBG_VALUE - a mapping of the llvm.dbg.value intrinsic			/// DBG_VALUE - a mapping of the llvm.dbg.value intrinsic
	HANDLE_TARGET_OPCODE(DBG_VALUE)			HANDLE_TARGET_OPCODE(DBG_VALUE)

	/// DBG_LABEL - a mapping of the llvm.dbg.label intrinsic			/// DBG_LABEL - a mapping of the llvm.dbg.label intrinsic
	HANDLE_TARGET_OPCODE(DBG_LABEL)			HANDLE_TARGET_OPCODE(DBG_LABEL)

	/// REG_SEQUENCE - This variadic instruction is used to form a register that			/// REG_SEQUENCE - This variadic instruction is used to form a register that
	/// represents a consecutive sequence of sub-registers. It's used as a			/// represents a consecutive sequence of sub-registers. It's used as a
	/// register coalescing / allocation aid and must be eliminated before code			/// register coalescing / allocation aid and must be eliminated before code
	/// emission.			/// emission.
	// In SDNode form, the first operand encodes the register class created by			// In SDNode form, the first operand encodes the register class created by
	// the REG_SEQUENCE, while each subsequent pair names a vreg + subreg index			// the REG_SEQUENCE, while each subsequent pair names a vreg + subreg index
	// pair. Once it has been lowered to a MachineInstr, the regclass operand			// pair. Once it has been lowered to a MachineInstr, the regclass operand
	// is no longer present.			// is no longer present.
	/// e.g. v1027 = REG_SEQUENCE v1024, 3, v1025, 4, v1026, 5			/// e.g. v1027 = REG_SEQUENCE v1024, 3, v1025, 4, v1026, 5
	/// After register coalescing references of v1024 should be replace with			/// After register coalescing references of v1024 should be replace with
	/// v1027:3, v1025 with v1027:4, etc.			/// v1027:3, v1025 with v1027:4, etc.
	HANDLE_TARGET_OPCODE(REG_SEQUENCE)			HANDLE_TARGET_OPCODE(REG_SEQUENCE)

	/// COPY - Target-independent register copy. This instruction can also be			/// COPY - Target-independent register copy. This instruction can also be
	/// used to copy between subregisters of virtual registers.			/// used to copy between subregisters of virtual registers.
	HANDLE_TARGET_OPCODE(COPY)			HANDLE_TARGET_OPCODE(COPY)

				/// TCOPY - This instruction is the terminator version of COPY. The purpose
				/// is to allow copies from terminators to be properly represented (e.g. an
				arsenmUnsubmitted Not Done Reply Inline Actions s/terminal/terminator arsenm: s/terminal/terminator
				arsenmUnsubmitted Not Done Reply Inline Actions COPY_TERM or COPY_TERMINATOR would be better since it isn't really a branch arsenm: COPY_TERM or COPY_TERMINATOR would be better since it isn't really a branch
				voidAuthorUnsubmitted Done Reply Inline Actions How about `TCOPY`? void: How about `TCOPY`?
				/// INLINEASM_BR that defines a physical register) without having
				/// to introduce "live-ins" for physical registers before register allocation.
				HANDLE_TARGET_OPCODE(TCOPY)

	/// BUNDLE - This instruction represents an instruction bundle. Instructions			/// BUNDLE - This instruction represents an instruction bundle. Instructions
	/// which immediately follow a BUNDLE instruction which are marked with			/// which immediately follow a BUNDLE instruction which are marked with
	/// 'InsideBundle' flag are inside the bundle.			/// 'InsideBundle' flag are inside the bundle.
	HANDLE_TARGET_OPCODE(BUNDLE)			HANDLE_TARGET_OPCODE(BUNDLE)

	/// Lifetime markers.			/// Lifetime markers.
	HANDLE_TARGET_OPCODE(LIFETIME_START)			HANDLE_TARGET_OPCODE(LIFETIME_START)
	HANDLE_TARGET_OPCODE(LIFETIME_END)			HANDLE_TARGET_OPCODE(LIFETIME_END)
	▲ Show 20 Lines • Show All 544 Lines • Show Last 20 Lines

llvm/include/llvm/Target/Target.td

	Show First 20 Lines • Show All 1,115 Lines • ▼ Show 20 Lines
	def COPY : StandardPseudoInstruction {			def COPY : StandardPseudoInstruction {
	let OutOperandList = (outs unknown:$dst);			let OutOperandList = (outs unknown:$dst);
	let InOperandList = (ins unknown:$src);			let InOperandList = (ins unknown:$src);
	let AsmString = "";			let AsmString = "";
	let hasSideEffects = 0;			let hasSideEffects = 0;
	let isAsCheapAsAMove = 1;			let isAsCheapAsAMove = 1;
	let hasNoSchedulingInfo = 0;			let hasNoSchedulingInfo = 0;
	}			}
				def TCOPY : StandardPseudoInstruction {
				let OutOperandList = (outs unknown:$dst);
				let InOperandList = (ins unknown:$src);
				let AsmString = "";
				let hasSideEffects = 0;
				let isAsCheapAsAMove = 1;
				let isTerminator = 1;
				}
	def BUNDLE : StandardPseudoInstruction {			def BUNDLE : StandardPseudoInstruction {
				arsenmUnsubmitted Done Reply Inline Actions This shouldn't need isBranch or isIndirectBranch arsenm: This shouldn't need isBranch or isIndirectBranch
	let OutOperandList = (outs);			let OutOperandList = (outs);
				nickdesaulniersUnsubmitted Done Reply Inline Actions `COPY` also sets `hasNoSchedulingInfo` to `0`. Should `COPY_BR` do that as well? nickdesaulniers: `COPY` also sets `hasNoSchedulingInfo` to `0`. Should `COPY_BR` do that as well?
				voidAuthorUnsubmitted Done Reply Inline Actions Possibly, though I had trouble finding where to place its scheduling info. It seems like `COPY` is special and TableGen adds it by default. I couldn't find the place to do that for `COPY_BR`, but I'll take another look. void: Possibly, though I had trouble finding where to place its scheduling info. It seems like `COPY`…
				arsenmUnsubmitted Done Reply Inline Actions Last time I looked at adding a copy variant, the right solution was just isAsCheapAsAMove = 1. Otherwise it would have required adding the COPY_BR to every target's COPY handling arsenm: Last time I looked at adding a copy variant, the right solution was just isAsCheapAsAMove = 1.
	let InOperandList = (ins variable_ops);			let InOperandList = (ins variable_ops);
	let AsmString = "BUNDLE";			let AsmString = "BUNDLE";
	let hasSideEffects = 0;			let hasSideEffects = 0;
	}			}
	def LIFETIME_START : StandardPseudoInstruction {			def LIFETIME_START : StandardPseudoInstruction {
	let OutOperandList = (outs);			let OutOperandList = (outs);
	let InOperandList = (ins i32imm:$id);			let InOperandList = (ins i32imm:$id);
	let AsmString = "LIFETIME_START";			let AsmString = "LIFETIME_START";
	▲ Show 20 Lines • Show All 522 Lines • Show Last 20 Lines

llvm/lib/CodeGen/DetectDeadLanes.cpp

Show First 20 Lines • Show All 134 Lines • ▼ Show 20 Lines
/// Returns true if \p MI will get lowered to a series of COPY instructions.		/// Returns true if \p MI will get lowered to a series of COPY instructions.
/// We call this a COPY-like instruction.		/// We call this a COPY-like instruction.
static bool lowersToCopies(const MachineInstr &MI) {		static bool lowersToCopies(const MachineInstr &MI) {
// Note: We could support instructions with MCInstrDesc::isRegSequenceLike(),		// Note: We could support instructions with MCInstrDesc::isRegSequenceLike(),
// isExtractSubRegLike(), isInsertSubregLike() in the future even though they		// isExtractSubRegLike(), isInsertSubregLike() in the future even though they
// are not lowered to a COPY.		// are not lowered to a COPY.
switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::PHI:		case TargetOpcode::PHI:
case TargetOpcode::INSERT_SUBREG:		case TargetOpcode::INSERT_SUBREG:
case TargetOpcode::REG_SEQUENCE:		case TargetOpcode::REG_SEQUENCE:
case TargetOpcode::EXTRACT_SUBREG:		case TargetOpcode::EXTRACT_SUBREG:
return true;		return true;
}		}
return false;		return false;
}		}
▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	LaneBitmask DetectDeadLanes::transferUsedLanes(const MachineInstr &MI,
LaneBitmask UsedLanes,		LaneBitmask UsedLanes,
const MachineOperand &MO) const {		const MachineOperand &MO) const {
unsigned OpNum = MI.getOperandNo(&MO);		unsigned OpNum = MI.getOperandNo(&MO);
assert(lowersToCopies(MI) &&		assert(lowersToCopies(MI) &&
DefinedByCopy[Register::virtReg2Index(MI.getOperand(0).getReg())]);		DefinedByCopy[Register::virtReg2Index(MI.getOperand(0).getReg())]);

switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::PHI:		case TargetOpcode::PHI:
return UsedLanes;		return UsedLanes;
case TargetOpcode::REG_SEQUENCE: {		case TargetOpcode::REG_SEQUENCE: {
assert(OpNum % 2 == 1);		assert(OpNum % 2 == 1);
unsigned SubIdx = MI.getOperand(OpNum + 1).getImm();		unsigned SubIdx = MI.getOperand(OpNum + 1).getImm();
return TRI->reverseComposeSubRegIndexLaneMask(SubIdx, UsedLanes);		return TRI->reverseComposeSubRegIndexLaneMask(SubIdx, UsedLanes);
}		}
case TargetOpcode::INSERT_SUBREG: {		case TargetOpcode::INSERT_SUBREG: {
▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	LaneBitmask DetectDeadLanes::transferDefinedLanes(const MachineOperand &Def,
}		}
case TargetOpcode::EXTRACT_SUBREG: {		case TargetOpcode::EXTRACT_SUBREG: {
unsigned SubIdx = MI.getOperand(2).getImm();		unsigned SubIdx = MI.getOperand(2).getImm();
assert(OpNum == 1 && "EXTRACT_SUBREG must have one register operand only");		assert(OpNum == 1 && "EXTRACT_SUBREG must have one register operand only");
DefinedLanes = TRI->reverseComposeSubRegIndexLaneMask(SubIdx, DefinedLanes);		DefinedLanes = TRI->reverseComposeSubRegIndexLaneMask(SubIdx, DefinedLanes);
break;		break;
}		}
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::PHI:		case TargetOpcode::PHI:
break;		break;
default:		default:
llvm_unreachable("function must be called with COPY-like instruction");		llvm_unreachable("function must be called with COPY-like instruction");
}		}

assert(Def.getSubReg() == 0 &&		assert(Def.getSubReg() == 0 &&
"Should not have subregister defs in machine SSA phase");		"Should not have subregister defs in machine SSA phase");
▲ Show 20 Lines • Show All 251 Lines • Show Last 20 Lines

llvm/lib/CodeGen/ExpandPostRAPseudos.cpp

Show All 27 Lines
#define DEBUG_TYPE "postrapseudos"		#define DEBUG_TYPE "postrapseudos"

namespace {		namespace {
struct ExpandPostRA : public MachineFunctionPass {		struct ExpandPostRA : public MachineFunctionPass {
private:		private:
const TargetRegisterInfo *TRI;		const TargetRegisterInfo *TRI;
const TargetInstrInfo *TII;		const TargetInstrInfo *TII;

		MachineBasicBlock *TCopyDestBlock;

public:		public:
static char ID; // Pass identification, replacement for typeid		static char ID; // Pass identification, replacement for typeid
ExpandPostRA() : MachineFunctionPass(ID) {}		ExpandPostRA() : MachineFunctionPass(ID) {}

void getAnalysisUsage(AnalysisUsage &AU) const override {		void getAnalysisUsage(AnalysisUsage &AU) const override {
AU.setPreservesCFG();		AU.setPreservesCFG();
AU.addPreservedID(MachineLoopInfoID);		AU.addPreservedID(MachineLoopInfoID);
AU.addPreservedID(MachineDominatorsID);		AU.addPreservedID(MachineDominatorsID);
▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	bool ExpandPostRA::LowerSubregToReg(MachineInstr *MI) {
}		}

LLVM_DEBUG(dbgs() << '\n');		LLVM_DEBUG(dbgs() << '\n');
MBB->erase(MI);		MBB->erase(MI);
return true;		return true;
}		}

bool ExpandPostRA::LowerCopy(MachineInstr *MI) {		bool ExpandPostRA::LowerCopy(MachineInstr *MI) {

if (MI->allDefsAreDead()) {		if (MI->allDefsAreDead()) {
LLVM_DEBUG(dbgs() << "dead copy: " << *MI);		LLVM_DEBUG(dbgs() << "dead copy: " << *MI);
MI->setDesc(TII->get(TargetOpcode::KILL));		MI->setDesc(TII->get(TargetOpcode::KILL));
LLVM_DEBUG(dbgs() << "replaced by: " << *MI);		LLVM_DEBUG(dbgs() << "replaced by: " << *MI);
return true;		return true;
}		}

MachineOperand &DstMO = MI->getOperand(0);		MachineOperand &DstMO = MI->getOperand(0);
Show All 13 Lines	if (SrcMO.isUndef() \|\| MI->getNumOperands() > 2) {
return true;		return true;
}		}
// Vanilla identity copy.		// Vanilla identity copy.
MI->eraseFromParent();		MI->eraseFromParent();
return true;		return true;
}		}

LLVM_DEBUG(dbgs() << "real copy: " << *MI);		LLVM_DEBUG(dbgs() << "real copy: " << *MI);
TII->copyPhysReg(*MI->getParent(), MI, MI->getDebugLoc(),		MachineBasicBlock *CopyBlock = MI->getParent();
DstMO.getReg(), SrcMO.getReg(), SrcMO.isKill());		MachineBasicBlock::iterator MII(MI);
		if (MI->getOpcode() == TargetOpcode::TCOPY) {
		CopyBlock = TCopyDestBlock;
		MII = TCopyDestBlock->getFirstTerminator();
		}
		TII->copyPhysReg(*CopyBlock, MII, MI->getDebugLoc(), DstMO.getReg(),
		SrcMO.getReg(), SrcMO.isKill());

if (MI->getNumOperands() > 2)		if (MI->getNumOperands() > 2)
TransferImplicitOperands(MI);		TransferImplicitOperands(MI);
LLVM_DEBUG({		LLVM_DEBUG({
MachineBasicBlock::iterator dMI = MI;		MachineBasicBlock::iterator dMI = MI;
dbgs() << "replaced by: " << *(--dMI);		dbgs() << "replaced by: " << *(--dMI);
});		});
MI->eraseFromParent();		MI->eraseFromParent();
return true;		return true;
}		}

/// runOnMachineFunction - Reduce subregister inserts and extracts to register		/// runOnMachineFunction - Reduce subregister inserts and extracts to register
/// copies.		/// copies.
///		///
bool ExpandPostRA::runOnMachineFunction(MachineFunction &MF) {		bool ExpandPostRA::runOnMachineFunction(MachineFunction &MF) {
LLVM_DEBUG(dbgs() << "Machine Function\n"		LLVM_DEBUG(dbgs() << "Machine Function\n"
<< "******** EXPANDING POST-RA PSEUDO INSTRS ********\n"		<< "******** EXPANDING POST-RA PSEUDO INSTRS ********\n"
<< "********** Function: " << MF.getName() << '\n');		<< "********** Function: " << MF.getName() << '\n');
TRI = MF.getSubtarget().getRegisterInfo();		TRI = MF.getSubtarget().getRegisterInfo();
TII = MF.getSubtarget().getInstrInfo();		TII = MF.getSubtarget().getInstrInfo();

bool MadeChange = false;		bool MadeChange = false;

for (MachineFunction::iterator mbbi = MF.begin(), mbbe = MF.end();		for (auto &MBB : MF) {
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions what? if we could use a range based for loop for the `MBB`'s, surely we can for `MI`. nickdesaulniers: what? if we could use a range based for loop for the `MBB`'s, surely we can for `MI`.
		voidAuthorUnsubmitted Done Reply Inline Actions The `++mi` below made me worried, and the fact that MI could be erased. void: The `++mi` below made me worried, and the fact that MI could be erased.
		nickdesaulniersUnsubmitted Done Reply Inline Actions Ah sorry, I missed that. I'm not sure what's meant by `erased`, but I'd be curious if a `continue` could not be used instead of manual iterator advancement. nickdesaulniers: Ah sorry, I missed that. I'm not sure what's meant by `erased`, but I'd be curious if a…
mbbi != mbbe; ++mbbi) {		for (auto MII = MBB.begin(), ME = MBB.end(); MII != ME;) {
for (MachineBasicBlock::iterator mi = mbbi->begin(), me = mbbi->end();		MachineInstr &MI = *MII;
mi != me;) {
MachineInstr &MI = *mi;
// Advance iterator here because MI may be erased.		// Advance iterator here because MI may be erased.
++mi;		++MII;

// Only expand pseudos.		// Only expand pseudos.
if (!MI.isPseudo())		if (!MI.isPseudo())
continue;		continue;

// Give targets a chance to expand even standard pseudos.		// Give targets a chance to expand even standard pseudos.
if (TII->expandPostRAPseudo(MI)) {		if (TII->expandPostRAPseudo(MI)) {
MadeChange = true;		MadeChange = true;
continue;		continue;
}		}

// Expand standard pseudos.		// Expand standard pseudos.
switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
		case TargetOpcode::INLINEASM_BR: {
		MachineBasicBlock::iterator Next(MI);
		Next = detail::next_or_end(Next, MBB.end());
		if (Next == MBB.end() \|\| Next->getOpcode() != TargetOpcode::TCOPY)
		break;

		// Find the destination for any TCOPY instructions to sink into.
		TCopyDestBlock = MBB.getInlineAsmBrDefaultTarget();
		assert(TCopyDestBlock && "Cannot find default dest block for callbr!");
		break;
		}
case TargetOpcode::SUBREG_TO_REG:		case TargetOpcode::SUBREG_TO_REG:
MadeChange \|= LowerSubregToReg(&MI);		MadeChange \|= LowerSubregToReg(&MI);
break;		break;
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
MadeChange \|= LowerCopy(&MI);		MadeChange \|= LowerCopy(&MI);
break;		break;
case TargetOpcode::DBG_VALUE:		case TargetOpcode::DBG_VALUE:
continue;		continue;
case TargetOpcode::INSERT_SUBREG:		case TargetOpcode::INSERT_SUBREG:
case TargetOpcode::EXTRACT_SUBREG:		case TargetOpcode::EXTRACT_SUBREG:
llvm_unreachable("Sub-register pseudos should have been eliminated.");		llvm_unreachable("Sub-register pseudos should have been eliminated.");
}		}
}		}
}		}

return MadeChange;		return MadeChange;
}		}

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp

	Show First 20 Lines • Show All 67 Lines • ▼ Show 20 Lines
	bool CombinerHelper::tryCombineCopy(MachineInstr &MI) {			bool CombinerHelper::tryCombineCopy(MachineInstr &MI) {
	if (matchCombineCopy(MI)) {			if (matchCombineCopy(MI)) {
	applyCombineCopy(MI);			applyCombineCopy(MI);
	return true;			return true;
	}			}
	return false;			return false;
	}			}
	bool CombinerHelper::matchCombineCopy(MachineInstr &MI) {			bool CombinerHelper::matchCombineCopy(MachineInstr &MI) {
	if (MI.getOpcode() != TargetOpcode::COPY)			if (!MI.isCopy())
	return false;			return false;
	Register DstReg = MI.getOperand(0).getReg();			Register DstReg = MI.getOperand(0).getReg();
	Register SrcReg = MI.getOperand(1).getReg();			Register SrcReg = MI.getOperand(1).getReg();
	return canReplaceReg(DstReg, SrcReg, MRI);			return canReplaceReg(DstReg, SrcReg, MRI);
	}			}
	void CombinerHelper::applyCombineCopy(MachineInstr &MI) {			void CombinerHelper::applyCombineCopy(MachineInstr &MI) {
	Register DstReg = MI.getOperand(0).getReg();			Register DstReg = MI.getOperand(0).getReg();
	Register SrcReg = MI.getOperand(1).getReg();			Register SrcReg = MI.getOperand(1).getReg();
	▲ Show 20 Lines • Show All 1,538 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/GISelKnownBits.cpp

Show First 20 Lines • Show All 151 Lines • ▼ Show 20 Lines	void GISelKnownBits::computeKnownBitsImpl(Register R, KnownBits &Known,
KnownBits Known2;		KnownBits Known2;

switch (Opcode) {		switch (Opcode) {
default:		default:
TL.computeKnownBitsForTargetInstr(*this, R, Known, DemandedElts, MRI,		TL.computeKnownBitsForTargetInstr(*this, R, Known, DemandedElts, MRI,
Depth);		Depth);
break;		break;
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::G_PHI:		case TargetOpcode::G_PHI:
case TargetOpcode::PHI: {		case TargetOpcode::PHI: {
Known.One = APInt::getAllOnesValue(BitWidth);		Known.One = APInt::getAllOnesValue(BitWidth);
Known.Zero = APInt::getAllOnesValue(BitWidth);		Known.Zero = APInt::getAllOnesValue(BitWidth);
// Destination registers should not have subregisters at this		// Destination registers should not have subregisters at this
// point of the pipeline, otherwise the main live-range will be		// point of the pipeline, otherwise the main live-range will be
// defined more than once, which is against SSA.		// defined more than once, which is against SSA.
assert(MI.getOperand(0).getSubReg() == 0 && "Is this code in SSA?");		assert(MI.getOperand(0).getSubReg() == 0 && "Is this code in SSA?");
Show All 18 Lines	for (unsigned Idx = 1; Idx < MI.getNumOperands(); Idx += 2) {
// register class.		// register class.
//		//
// We can't use NoSubRegister by name as it's defined by each target but		// We can't use NoSubRegister by name as it's defined by each target but
// it's always defined to be 0 by tablegen.		// it's always defined to be 0 by tablegen.
if (SrcReg.isVirtual() && Src.getSubReg() == 0 /NoSubRegister/ &&		if (SrcReg.isVirtual() && Src.getSubReg() == 0 /NoSubRegister/ &&
MRI.getType(SrcReg).isValid()) {		MRI.getType(SrcReg).isValid()) {
// For COPYs we don't do anything, don't increase the depth.		// For COPYs we don't do anything, don't increase the depth.
computeKnownBitsImpl(SrcReg, Known2, DemandedElts,		computeKnownBitsImpl(SrcReg, Known2, DemandedElts,
Depth + (Opcode != TargetOpcode::COPY));		Depth + !MI.isCopy());
Known.One &= Known2.One;		Known.One &= Known2.One;
Known.Zero &= Known2.Zero;		Known.Zero &= Known2.Zero;
// If we reach a point where we don't know anything		// If we reach a point where we don't know anything
// just stop looking through the operands.		// just stop looking through the operands.
if (Known.One == 0 && Known.Zero == 0)		if (Known.One == 0 && Known.Zero == 0)
break;		break;
} else {		} else {
// We know nothing.		// We know nothing.
▲ Show 20 Lines • Show All 227 Lines • ▼ Show 20 Lines	unsigned GISelKnownBits::computeNumSignBits(Register R,
// type constraint. This is unlikely to occur except by looking through copies		// type constraint. This is unlikely to occur except by looking through copies
// but it is possible for the initial register being queried to be in this		// but it is possible for the initial register being queried to be in this
// state.		// state.
if (!DstTy.isValid())		if (!DstTy.isValid())
return 1;		return 1;

unsigned FirstAnswer = 1;		unsigned FirstAnswer = 1;
switch (Opcode) {		switch (Opcode) {
case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY: {
MachineOperand &Src = MI.getOperand(1);		MachineOperand &Src = MI.getOperand(1);
if (Src.getReg().isVirtual() && Src.getSubReg() == 0 &&		if (Src.getReg().isVirtual() && Src.getSubReg() == 0 &&
MRI.getType(Src.getReg()).isValid()) {		MRI.getType(Src.getReg()).isValid()) {
// Don't increment Depth for this one since we didn't do any work.		// Don't increment Depth for this one since we didn't do any work.
return computeNumSignBits(Src.getReg(), DemandedElts, Depth);		return computeNumSignBits(Src.getReg(), DemandedElts, Depth);
}		}

return 1;		return 1;
▲ Show 20 Lines • Show All 65 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/InstructionSelect.cpp

Show First 20 Lines • Show All 159 Lines • ▼ Show 20 Lines	for (auto MII = std::prev(MBB.end()), Begin = MBB.begin(); !ReachedBegin;) {
// Select this instruction.		// Select this instruction.
MachineInstr &MI = *MII;		MachineInstr &MI = *MII;

// And have our iterator point to the next instruction, if there is one.		// And have our iterator point to the next instruction, if there is one.
if (MII == Begin)		if (MII == Begin)
ReachedBegin = true;		ReachedBegin = true;
else		else
--MII;		--MII;
if (MI.getOpcode() != TargetOpcode::COPY)		if (!MI.isCopy())
continue;		continue;
Register SrcReg = MI.getOperand(1).getReg();		Register SrcReg = MI.getOperand(1).getReg();
Register DstReg = MI.getOperand(0).getReg();		Register DstReg = MI.getOperand(0).getReg();
if (Register::isVirtualRegister(SrcReg) &&		if (Register::isVirtualRegister(SrcReg) &&
Register::isVirtualRegister(DstReg)) {		Register::isVirtualRegister(DstReg)) {
auto SrcRC = MRI.getRegClass(SrcReg);		auto SrcRC = MRI.getRegClass(SrcReg);
auto DstRC = MRI.getRegClass(DstReg);		auto DstRC = MRI.getRegClass(DstReg);
if (SrcRC == DstRC) {		if (SrcRC == DstRC) {
▲ Show 20 Lines • Show All 88 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/MachineIRBuilder.cpp

Show First 20 Lines • Show All 1,001 Lines • ▼ Show 20 Lines	MachineInstrBuilder MachineIRBuilder::buildInstr(unsigned Opc,
case TargetOpcode::G_BITCAST: {		case TargetOpcode::G_BITCAST: {
assert(DstOps.size() == 1 && "Invalid Dst");		assert(DstOps.size() == 1 && "Invalid Dst");
assert(SrcOps.size() == 1 && "Invalid Srcs");		assert(SrcOps.size() == 1 && "Invalid Srcs");
assert(DstOps[0].getLLTTy(*getMRI()).getSizeInBits() ==		assert(DstOps[0].getLLTTy(*getMRI()).getSizeInBits() ==
SrcOps[0].getLLTTy(*getMRI()).getSizeInBits() && "invalid bitcast");		SrcOps[0].getLLTTy(*getMRI()).getSizeInBits() && "invalid bitcast");
break;		break;
}		}
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
assert(DstOps.size() == 1 && "Invalid Dst");		assert(DstOps.size() == 1 && "Invalid Dst");
// If the caller wants to add a subreg source it has to be done separately		// If the caller wants to add a subreg source it has to be done separately
// so we may not have any SrcOps at this point yet.		// so we may not have any SrcOps at this point yet.
break;		break;
case TargetOpcode::G_FCMP:		case TargetOpcode::G_FCMP:
case TargetOpcode::G_ICMP: {		case TargetOpcode::G_ICMP: {
assert(DstOps.size() == 1 && "Invalid Dst Operands");		assert(DstOps.size() == 1 && "Invalid Dst Operands");
assert(SrcOps.size() == 3 && "Invalid Src Operands");		assert(SrcOps.size() == 3 && "Invalid Src Operands");
▲ Show 20 Lines • Show All 157 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/Utils.cpp

Show First 20 Lines • Show All 285 Lines • ▼ Show 20 Lines	while ((MI = MRI.getVRegDef(VReg)) && !IsConstantOpcode(MI->getOpcode()) &&
case TargetOpcode::G_SEXT:		case TargetOpcode::G_SEXT:
case TargetOpcode::G_ZEXT:		case TargetOpcode::G_ZEXT:
SeenOpcodes.push_back(std::make_pair(		SeenOpcodes.push_back(std::make_pair(
MI->getOpcode(),		MI->getOpcode(),
MRI.getType(MI->getOperand(0).getReg()).getSizeInBits()));		MRI.getType(MI->getOperand(0).getReg()).getSizeInBits()));
VReg = MI->getOperand(1).getReg();		VReg = MI->getOperand(1).getReg();
break;		break;
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
VReg = MI->getOperand(1).getReg();		VReg = MI->getOperand(1).getReg();
if (Register::isPhysicalRegister(VReg))		if (Register::isPhysicalRegister(VReg))
return None;		return None;
break;		break;
case TargetOpcode::G_INTTOPTR:		case TargetOpcode::G_INTTOPTR:
VReg = MI->getOperand(1).getReg();		VReg = MI->getOperand(1).getReg();
break;		break;
default:		default:
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines

static llvm::Optional<DefinitionAndSourceRegister>		static llvm::Optional<DefinitionAndSourceRegister>
getDefSrcRegIgnoringCopies(Register Reg, const MachineRegisterInfo &MRI) {		getDefSrcRegIgnoringCopies(Register Reg, const MachineRegisterInfo &MRI) {
Register DefSrcReg = Reg;		Register DefSrcReg = Reg;
auto *DefMI = MRI.getVRegDef(Reg);		auto *DefMI = MRI.getVRegDef(Reg);
auto DstTy = MRI.getType(DefMI->getOperand(0).getReg());		auto DstTy = MRI.getType(DefMI->getOperand(0).getReg());
if (!DstTy.isValid())		if (!DstTy.isValid())
return None;		return None;
while (DefMI->getOpcode() == TargetOpcode::COPY) {		while (DefMI->isCopy()) {
Register SrcReg = DefMI->getOperand(1).getReg();		Register SrcReg = DefMI->getOperand(1).getReg();
auto SrcTy = MRI.getType(SrcReg);		auto SrcTy = MRI.getType(SrcReg);
if (!SrcTy.isValid() \|\| SrcTy != DstTy)		if (!SrcTy.isValid() \|\| SrcTy != DstTy)
break;		break;
DefMI = MRI.getVRegDef(SrcReg);		DefMI = MRI.getVRegDef(SrcReg);
DefSrcReg = SrcReg;		DefSrcReg = SrcReg;
}		}
return DefinitionAndSourceRegister{DefMI, DefSrcReg};		return DefinitionAndSourceRegister{DefMI, DefSrcReg};
▲ Show 20 Lines • Show All 195 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MachineBasicBlock.cpp

	Show First 20 Lines • Show All 265 Lines • ▼ Show 20 Lines

	bool MachineBasicBlock::hasEHPadSuccessor() const {			bool MachineBasicBlock::hasEHPadSuccessor() const {
	for (const_succ_iterator I = succ_begin(), E = succ_end(); I != E; ++I)			for (const_succ_iterator I = succ_begin(), E = succ_end(); I != E; ++I)
	if ((*I)->isEHPad())			if ((*I)->isEHPad())
	return true;			return true;
	return false;			return false;
	}			}

				MachineBasicBlock *MachineBasicBlock::getInlineAsmBrDefaultTarget() {
				if (llvm::none_of(terminators(), [](const MachineInstr &Term) {
				return Term.getOpcode() == TargetOpcode::INLINEASM_BR;
				}))
				return nullptr;

				MachineBasicBlock *DefaultTarget = nullptr;
				for (auto Succ : successors())
				if (!isInlineAsmBrIndirectTarget(Succ)) {
				DefaultTarget = Succ;
				break;
				}
				if (!DefaultTarget) {
				const auto &Br = back();
				if (Br.isUnconditionalBranch()) {
				for (const MachineOperand &MO : Br.operands())
				if (MO.isMBB()) {
				DefaultTarget = MO.getMBB();
				break;
				}
				}
				}
				return DefaultTarget;
				}

	#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)			#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
	LLVM_DUMP_METHOD void MachineBasicBlock::dump() const {			LLVM_DUMP_METHOD void MachineBasicBlock::dump() const {
	print(dbgs());			print(dbgs());
	}			}
	#endif			#endif

	bool MachineBasicBlock::isLegalToHoistInto() const {			bool MachineBasicBlock::isLegalToHoistInto() const {
	if (isReturnBlock() \|\| hasEHPadSuccessor())			if (isReturnBlock() \|\| hasEHPadSuccessor())
	▲ Show 20 Lines • Show All 1,204 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MachineInstr.cpp

Show First 20 Lines • Show All 1,186 Lines • ▼ Show 20 Lines	bool MachineInstr::isSafeToMove(AAResults *AA, bool &SawStore) const {
// volatiles, but it is required for atomic loads. It is not allowed to move		// volatiles, but it is required for atomic loads. It is not allowed to move
// a load across an atomic load with Ordering > Monotonic.		// a load across an atomic load with Ordering > Monotonic.
if (mayStore() \|\| isCall() \|\| isPHI() \|\|		if (mayStore() \|\| isCall() \|\| isPHI() \|\|
(mayLoad() && hasOrderedMemoryRef())) {		(mayLoad() && hasOrderedMemoryRef())) {
SawStore = true;		SawStore = true;
return false;		return false;
}		}

if (isPosition() \|\| isDebugInstr() \|\| isTerminator() \|\|		if (isPosition() \|\| isDebugInstr() \|\| (isTerminator() && !isCopy()) \|\|
mayRaiseFPException() \|\| hasUnmodeledSideEffects())		mayRaiseFPException() \|\| hasUnmodeledSideEffects())
return false;		return false;

// See if this instruction does a load. If so, we have to guarantee that the		// See if this instruction does a load. If so, we have to guarantee that the
// loaded value doesn't change between the load and the its intended		// loaded value doesn't change between the load and the its intended
// destination. The check for isInvariantLoad gives the targe the chance to		// destination. The check for isInvariantLoad gives the targe the chance to
// classify the load as always returning a constant, e.g. a constant pool		// classify the load as always returning a constant, e.g. a constant pool
// load.		// load.
▲ Show 20 Lines • Show All 1,010 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MachineSink.cpp

Show First 20 Lines • Show All 832 Lines • ▼ Show 20 Lines	static bool attemptDebugCopyProp(MachineInstr &SinkInst, MachineInstr &DbgMI) {
DbgMI.getOperand(0).setReg(SrcMO->getReg());		DbgMI.getOperand(0).setReg(SrcMO->getReg());
DbgMI.getOperand(0).setSubReg(SrcMO->getSubReg());		DbgMI.getOperand(0).setSubReg(SrcMO->getSubReg());
return true;		return true;
}		}

/// Sink an instruction and its associated debug instructions.		/// Sink an instruction and its associated debug instructions.
static void performSink(MachineInstr &MI, MachineBasicBlock &SuccToSinkTo,		static void performSink(MachineInstr &MI, MachineBasicBlock &SuccToSinkTo,
MachineBasicBlock::iterator InsertPos,		MachineBasicBlock::iterator InsertPos,
		const TargetInstrInfo *TII,
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions pass as `const&`, or make `performSink` a private method. nickdesaulniers: pass as `const&`, or make `performSink` a private method.
		voidAuthorUnsubmitted Done Reply Inline Actions Two different passes use this function so it can't be made a private method. I"m not sure why passing it as "const&" is better than a "const"... void:* Two different passes use this function so it can't be made a private method. I"m not sure why…
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions Generally, passing by const reference indicates that a parameter is strictly an input, as opposed to both input AND output, which is why you don't see a mix of pointers and references in this function signature. nickdesaulniers: Generally, passing by const reference indicates that a parameter is strictly an input, as…
		voidAuthorUnsubmitted Done Reply Inline Actions A 'const' doesn't allow modifications either. Also note that none of the references in this function signature are 'const', and are probably passed by reference because they aren't pointers (`SuccToSinkto` notwithstanding) in the originating function. Converting this to a reference is not useful. void:* A 'const*' doesn't allow modifications either. Also note that none of the references in this…
SmallVectorImpl<MachineInstr *> &DbgValuesToSink) {		SmallVectorImpl<MachineInstr *> &DbgValuesToSink) {

// If we cannot find a location to use (merge with), then we erase the debug		// If we cannot find a location to use (merge with), then we erase the debug
// location to prevent debug-info driven tools from potentially reporting		// location to prevent debug-info driven tools from potentially reporting
// wrong location information.		// wrong location information.
if (!SuccToSinkTo.empty() && InsertPos != SuccToSinkTo.end())		if (!SuccToSinkTo.empty() && InsertPos != SuccToSinkTo.end())
MI.setDebugLoc(DILocation::getMergedLocation(MI.getDebugLoc(),		MI.setDebugLoc(DILocation::getMergedLocation(MI.getDebugLoc(),
InsertPos->getDebugLoc()));		InsertPos->getDebugLoc()));
else		else
MI.setDebugLoc(DebugLoc());		MI.setDebugLoc(DebugLoc());

// Move the instruction.		// Move the instruction.
MachineBasicBlock *ParentBlock = MI.getParent();		MachineBasicBlock *ParentBlock = MI.getParent();
SuccToSinkTo.splice(InsertPos, ParentBlock, MI,		SuccToSinkTo.splice(InsertPos, ParentBlock, MI,
++MachineBasicBlock::iterator(MI));		++MachineBasicBlock::iterator(MI));

		// The copy no longer needs to be a terminator, so convert it to a normal
		// COPY.
		if (MI.getOpcode() == TargetOpcode::TCOPY)
		MI.setDesc(TII->get(TargetOpcode::COPY));
		arsenmUnsubmitted Not Done Reply Inline Actions I don't see this captured in a test? arsenm: I don't see this captured in a test?
		voidAuthorUnsubmitted Done Reply Inline Actions This will happen now that we correctly mark `TCOPY` as sinkable. I'll see if I can craft an MIR test to explicitly do this. void: This will happen now that we correctly mark `TCOPY` as sinkable. I'll see if I can craft an MIR…

// Sink a copy of debug users to the insert position. Mark the original		// Sink a copy of debug users to the insert position. Mark the original
// DBG_VALUE location as 'undef', indicating that any earlier variable		// DBG_VALUE location as 'undef', indicating that any earlier variable
// location should be terminated as we've optimised away the value at this		// location should be terminated as we've optimised away the value at this
// point.		// point.
for (SmallVectorImpl<MachineInstr *>::iterator DBI = DbgValuesToSink.begin(),		for (auto *DbgMI : DbgValuesToSink) {
DBE = DbgValuesToSink.end();		MachineInstr *NewDbgMI = DbgMI->getMF()->CloneMachineInstr(DbgMI);
DBI != DBE; ++DBI) {
MachineInstr DbgMI = DBI;
MachineInstr NewDbgMI = DbgMI->getMF()->CloneMachineInstr(DBI);
SuccToSinkTo.insert(InsertPos, NewDbgMI);		SuccToSinkTo.insert(InsertPos, NewDbgMI);

if (!attemptDebugCopyProp(MI, *DbgMI))		if (!attemptDebugCopyProp(MI, *DbgMI))
DbgMI->getOperand(0).setReg(0);		DbgMI->getOperand(0).setReg(0);
}		}
}		}

/// SinkInstruction - Determine whether it is safe to sink the specified machine		/// SinkInstruction - Determine whether it is safe to sink the specified machine
/// instruction out of its current block into a successor.		/// instruction out of its current block into a successor.
bool MachineSinking::SinkInstruction(MachineInstr &MI, bool &SawStore,		bool MachineSinking::SinkInstruction(MachineInstr &MI, bool &SawStore,
AllSuccsCache &AllSuccessors) {		AllSuccsCache &AllSuccessors) {
// Don't sink instructions that the target prefers not to sink.		// Don't sink instructions that the target prefers not to sink.
if (!TII->shouldSink(MI))		if (!TII->shouldSink(MI))
return false;		return false;

// Check if it's safe to move the instruction.		// Check if it's safe to move the instruction.
if (!MI.isSafeToMove(AA, SawStore))		if (!MI.isSafeToMove(AA, SawStore))
return false;		return false;

// Convergent operations may not be made control-dependent on additional		// Convergent operations may not be made control-dependent on additional
// values.		// values.
if (MI.isConvergent())		if (MI.isConvergent())
return false;		return false;

		// Sink TCOPY instructions after register allocation to avoid mucking with
		// live-ins.
		if (MI.getOpcode() == TargetOpcode::TCOPY)
		return false;

// Don't break implicit null checks. This is a performance heuristic, and not		// Don't break implicit null checks. This is a performance heuristic, and not
// required for correctness.		// required for correctness.
if (SinkingPreventsImplicitNullCheck(MI, TII, TRI))		if (SinkingPreventsImplicitNullCheck(MI, TII, TRI))
return false;		return false;

// FIXME: This should include support for sinking instructions within the		// FIXME: This should include support for sinking instructions within the
// block they are currently in to shorten the live ranges. We often get		// block they are currently in to shorten the live ranges. We often get
// instructions sunk into the top of a large block, but it would be better to		// instructions sunk into the top of a large block, but it would be better to
▲ Show 20 Lines • Show All 110 Lines • ▼ Show 20 Lines	bool MachineSinking::SinkInstruction(MachineInstr &MI, bool &SawStore,
}		}

// After sinking, some debug users may not be dominated any more. If possible,		// After sinking, some debug users may not be dominated any more. If possible,
// copy-propagate their operands. As it's expensive, don't do this if there's		// copy-propagate their operands. As it's expensive, don't do this if there's
// no debuginfo in the program.		// no debuginfo in the program.
if (MI.getMF()->getFunction().getSubprogram() && MI.isCopy())		if (MI.getMF()->getFunction().getSubprogram() && MI.isCopy())
SalvageUnsunkDebugUsersOfCopy(MI, SuccToSinkTo);		SalvageUnsunkDebugUsersOfCopy(MI, SuccToSinkTo);

performSink(MI, *SuccToSinkTo, InsertPos, DbgUsersToSink);		performSink(MI, *SuccToSinkTo, InsertPos, TII, DbgUsersToSink);

// Conservatively, clear any kill flags, since it's possible that they are no		// Conservatively, clear any kill flags, since it's possible that they are no
// longer correct.		// longer correct.
// Note that we have to clear the kill flags for any register this instruction		// Note that we have to clear the kill flags for any register this instruction
// uses as we may sink over another instruction which currently kills the		// uses as we may sink over another instruction which currently kills the
// used registers.		// used registers.
for (MachineOperand &MO : MI.operands()) {		for (MachineOperand &MO : MI.operands()) {
if (MO.isReg() && MO.isUse())		if (MO.isReg() && MO.isUse())
▲ Show 20 Lines • Show All 346 Lines • ▼ Show 20 Lines	for (auto I = CurBB.rbegin(), E = CurBB.rend(); I != E;) {
}		}
DbgValsToSink.insert(DbgValsToSink.begin(), DbgValsToSinkSet.begin(),		DbgValsToSink.insert(DbgValsToSink.begin(), DbgValsToSinkSet.begin(),
DbgValsToSinkSet.end());		DbgValsToSinkSet.end());

// Clear the kill flag if SrcReg is killed between MI and the end of the		// Clear the kill flag if SrcReg is killed between MI and the end of the
// block.		// block.
clearKillFlags(MI, CurBB, UsedOpsInCopy, UsedRegUnits, TRI);		clearKillFlags(MI, CurBB, UsedOpsInCopy, UsedRegUnits, TRI);
MachineBasicBlock::iterator InsertPos = SuccBB->getFirstNonPHI();		MachineBasicBlock::iterator InsertPos = SuccBB->getFirstNonPHI();
performSink(MI, SuccBB, InsertPos, DbgValsToSink);		performSink(MI, SuccBB, InsertPos, TII, DbgValsToSink);
updateLiveIn(MI, SuccBB, UsedOpsInCopy, DefedRegsInCopy);		updateLiveIn(MI, SuccBB, UsedOpsInCopy, DefedRegsInCopy);

Changed = true;		Changed = true;
++NumPostRACopySink;		++NumPostRACopySink;
}		}
return Changed;		return Changed;
}		}

		nickdesaulniersUnsubmitted Not Done Reply Inline Actions Does it make sense to "sink" register info invalidation into `performSink()`? (pun intended) Since it's already checking the `MI`'s opcode? Or are the two call sites of `performSink()` problematic? The implementation of `invalidateLiveness()` looks pretty cheap, IMO. nickdesaulniers: Does it make sense to "sink" register info invalidation into `performSink()`? (pun intended)…
		voidAuthorUnsubmitted Done Reply Inline Actions Possibly. If we're going to make `COPY_TERM` (or whatever name we settle on) not specific to `INLINEASM_BR`, then we'll probably need to modify this to accept any case where we sink copies after a terminator. We'll come back to this after other comments settle. void: Possibly. If we're going to make `COPY_TERM` (or whatever name we settle on) not specific to…
bool PostRAMachineSinking::runOnMachineFunction(MachineFunction &MF) {		bool PostRAMachineSinking::runOnMachineFunction(MachineFunction &MF) {
if (skipFunction(MF.getFunction()))		if (skipFunction(MF.getFunction()))
return false;		return false;

bool Changed = false;		bool Changed = false;
const TargetRegisterInfo *TRI = MF.getSubtarget().getRegisterInfo();		const TargetRegisterInfo *TRI = MF.getSubtarget().getRegisterInfo();
const TargetInstrInfo *TII = MF.getSubtarget().getInstrInfo();		const TargetInstrInfo *TII = MF.getSubtarget().getInstrInfo();

ModifiedRegUnits.init(*TRI);		ModifiedRegUnits.init(*TRI);
UsedRegUnits.init(*TRI);		UsedRegUnits.init(*TRI);
for (auto &BB : MF)		for (auto &BB : MF)
Changed \|= tryToSinkCopy(BB, MF, TRI, TII);		Changed \|= tryToSinkCopy(BB, MF, TRI, TII);

return Changed;		return Changed;
}		}

llvm/lib/CodeGen/MachineVerifier.cpp

Show First 20 Lines • Show All 584 Lines • ▼ Show 20 Lines	MachineVerifier::visitMachineBasicBlockBefore(const MachineBasicBlock *MBB) {
FirstNonPHI = nullptr;		FirstNonPHI = nullptr;

if (!MF->getProperties().hasProperty(		if (!MF->getProperties().hasProperty(
MachineFunctionProperties::Property::NoPHIs) && MRI->tracksLiveness()) {		MachineFunctionProperties::Property::NoPHIs) && MRI->tracksLiveness()) {
// If this block has allocatable physical registers live-in, check that		// If this block has allocatable physical registers live-in, check that
// it is an entry block or landing pad.		// it is an entry block or landing pad.
for (const auto &LI : MBB->liveins()) {		for (const auto &LI : MBB->liveins()) {
if (isAllocatable(LI.PhysReg) && !MBB->isEHPad() &&		if (isAllocatable(LI.PhysReg) && !MBB->isEHPad() &&
!MBB->isInlineAsmBrDefaultTarget() &&
MBB->getIterator() != MBB->getParent()->begin()) {		MBB->getIterator() != MBB->getParent()->begin()) {
report("MBB has allocatable live-in, but isn't entry or landing-pad.", MBB);		report("MBB has allocatable live-in, but isn't entry or landing-pad.", MBB);
report_context(LI.PhysReg);		report_context(LI.PhysReg);
}		}
}		}
}		}

// Count the number of landing pad successors.		// Count the number of landing pad successors.
SmallPtrSet<const MachineBasicBlock*, 4> LandingPadSuccs;		SmallPtrSet<const MachineBasicBlock*, 4> LandingPadSuccs;
for (const auto *succ : MBB->successors()) {		for (const auto *succ : MBB->successors()) {
if (succ->isEHPad())		if (succ->isEHPad())
LandingPadSuccs.insert(succ);		LandingPadSuccs.insert(succ);
if (!FunctionBlocks.count(succ))		if (!FunctionBlocks.count(succ))
report("MBB has successor that isn't part of the function.", MBB);		report("MBB has successor that isn't part of the function.", MBB);
if (!MBBInfoMap[succ].Preds.count(MBB)) {		if (!MBBInfoMap[succ].Preds.count(MBB)) {
report("Inconsistent CFG", MBB);		report("Inconsistent CFG", MBB);
errs() << "MBB is not in the predecessor list of the successor "		errs() << "MBB is not in the predecessor list of the successor "
<< printMBBReference(*succ) << ".\n";		<< printMBBReference(*succ) << ".\n";
}		}
}		}

// Count the number of INLINEASM_BR indirect target successors.		// Count the number of INLINEASM_BR indirect target successors.
SmallPtrSet<const MachineBasicBlock*, 4> IndirectTargetSuccs;		SmallPtrSet<const MachineBasicBlock*, 4> IndirectTargetSuccs;
for (const auto *succ : MBB->successors()) {		for (const auto *succ : MBB->successors()) {
		arsenmUnsubmitted Not Done Reply Inline Actions This also looks like a separate change arsenm: This also looks like a separate change
if (MBB->isInlineAsmBrIndirectTarget(succ))		if (MBB->isInlineAsmBrIndirectTarget(succ))
IndirectTargetSuccs.insert(succ);		IndirectTargetSuccs.insert(succ);
if (!FunctionBlocks.count(succ))		if (!FunctionBlocks.count(succ))
report("MBB has successor that isn't part of the function.", MBB);		report("MBB has successor that isn't part of the function.", MBB);
if (!MBBInfoMap[succ].Preds.count(MBB)) {		if (!MBBInfoMap[succ].Preds.count(MBB)) {
report("Inconsistent CFG", MBB);		report("Inconsistent CFG", MBB);
errs() << "MBB is not in the predecessor list of the successor "		errs() << "MBB is not in the predecessor list of the successor "
<< printMBBReference(*succ) << ".\n";		<< printMBBReference(*succ) << ".\n";
▲ Show 20 Lines • Show All 196 Lines • ▼ Show 20 Lines	void MachineVerifier::visitMachineBundleBefore(const MachineInstr *MI) {
if (MI->isTerminator() && !TII->isPredicated(*MI)) {		if (MI->isTerminator() && !TII->isPredicated(*MI)) {
if (!FirstTerminator)		if (!FirstTerminator)
FirstTerminator = MI;		FirstTerminator = MI;
} else if (FirstTerminator) {		} else if (FirstTerminator) {
report("Non-terminator instruction after the first terminator", MI);		report("Non-terminator instruction after the first terminator", MI);
errs() << "First terminator was:\t" << *FirstTerminator;		errs() << "First terminator was:\t" << *FirstTerminator;
}		}
}		}

		nickdesaulniersUnsubmitted Done Reply Inline Actions this is hard to read. Is there a nice way to simply this? Maybe negate, and fold into parent `if`? nickdesaulniers: this is hard to read. Is there a nice way to simply this? Maybe negate, and fold into parent…
// The operands on an INLINEASM instruction must follow a template.		// The operands on an INLINEASM instruction must follow a template.
// Verify that the flag operands make sense.		// Verify that the flag operands make sense.
void MachineVerifier::verifyInlineAsm(const MachineInstr *MI) {		void MachineVerifier::verifyInlineAsm(const MachineInstr *MI) {
// The first two operands on INLINEASM are the asm string and global flags.		// The first two operands on INLINEASM are the asm string and global flags.
if (MI->getNumOperands() < 2) {		if (MI->getNumOperands() < 2) {
report("Too few operands on inline asm", MI);		report("Too few operands on inline asm", MI);
return;		return;
}		}
▲ Show 20 Lines • Show All 659 Lines • ▼ Show 20 Lines	void MachineVerifier::visitMachineInstrBefore(const MachineInstr *MI) {
}		}

StringRef ErrorInfo;		StringRef ErrorInfo;
if (!TII->verifyInstruction(*MI, ErrorInfo))		if (!TII->verifyInstruction(*MI, ErrorInfo))
report(ErrorInfo.data(), MI);		report(ErrorInfo.data(), MI);

// Verify properties of various specific instruction types		// Verify properties of various specific instruction types
switch (MI->getOpcode()) {		switch (MI->getOpcode()) {
		case TargetOpcode::TCOPY: {
		MachineBasicBlock::const_iterator MII(MI), MIE = MI->getParent()->end();
		for (; MII != MIE; ++MII) {
		if (MII->getOpcode() != TargetOpcode::COPY)
		continue;
		report("TCOPY and COPY instructions are intermixed", &*MII);
		errs() << "- TCOPY instruction: ";
		if (Indexes && Indexes->hasIndex(*MI))
		errs() << Indexes->getInstructionIndex(*MI) << '\t';
		MI->print(errs(), /SkipOpers=/true);
		}
		LLVM_FALLTHROUGH;
		}
case TargetOpcode::COPY: {		case TargetOpcode::COPY: {
if (foundErrors)		if (foundErrors)
break;		break;
const MachineOperand &DstOp = MI->getOperand(0);		const MachineOperand &DstOp = MI->getOperand(0);
const MachineOperand &SrcOp = MI->getOperand(1);		const MachineOperand &SrcOp = MI->getOperand(1);
LLT DstTy = MRI->getType(DstOp.getReg());		LLT DstTy = MRI->getType(DstOp.getReg());
LLT SrcTy = MRI->getType(SrcOp.getReg());		LLT SrcTy = MRI->getType(SrcOp.getReg());
if (SrcTy.isValid() && DstTy.isValid()) {		if (SrcTy.isValid() && DstTy.isValid()) {
▲ Show 20 Lines • Show All 1,453 Lines • Show Last 20 Lines

llvm/lib/CodeGen/PeepholeOptimizer.cpp

Show First 20 Lines • Show All 1,088 Lines • ▼ Show 20 Lines	static Rewriter *getCopyRewriter(MachineInstr &MI, const TargetInstrInfo &TII) {
if (MI.isBitcast() \|\| MI.isRegSequenceLike() \|\| MI.isInsertSubregLike() \|\|		if (MI.isBitcast() \|\| MI.isRegSequenceLike() \|\| MI.isInsertSubregLike() \|\|
MI.isExtractSubregLike())		MI.isExtractSubregLike())
return new UncoalescableRewriter(MI);		return new UncoalescableRewriter(MI);

switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
default:		default:
return nullptr;		return nullptr;
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
return new CopyRewriter(MI);		return new CopyRewriter(MI);
case TargetOpcode::INSERT_SUBREG:		case TargetOpcode::INSERT_SUBREG:
return new InsertSubregRewriter(MI);		return new InsertSubregRewriter(MI);
case TargetOpcode::EXTRACT_SUBREG:		case TargetOpcode::EXTRACT_SUBREG:
return new ExtractSubregRewriter(MI, TII);		return new ExtractSubregRewriter(MI, TII);
case TargetOpcode::REG_SEQUENCE:		case TargetOpcode::REG_SEQUENCE:
return new RegSequenceRewriter(MI);		return new RegSequenceRewriter(MI);
}		}
▲ Show 20 Lines • Show All 1,016 Lines • Show Last 20 Lines

llvm/lib/CodeGen/ReachingDefAnalysis.cpp

Show First 20 Lines • Show All 506 Lines • ▼ Show 20 Lines	for (auto &MO : Last->operands())
if (isValidRegDefOf(MO, PhysReg))		if (isValidRegDefOf(MO, PhysReg))
return Last;		return Last;

return Def < 0 ? nullptr : getInstFromId(MBB, Def);		return Def < 0 ? nullptr : getInstFromId(MBB, Def);
}		}

static bool mayHaveSideEffects(MachineInstr &MI) {		static bool mayHaveSideEffects(MachineInstr &MI) {
return MI.mayLoadOrStore() \|\| MI.mayRaiseFPException() \|\|		return MI.mayLoadOrStore() \|\| MI.mayRaiseFPException() \|\|
MI.hasUnmodeledSideEffects() \|\| MI.isTerminator() \|\|		MI.hasUnmodeledSideEffects() \|\| (MI.isTerminator() && !MI.isCopy()) \|\|
MI.isCall() \|\| MI.isBarrier() \|\| MI.isBranch() \|\| MI.isReturn();		MI.isCall() \|\| MI.isBarrier() \|\| MI.isBranch() \|\| MI.isReturn();
}		}

// Can we safely move 'From' to just before 'To'? To satisfy this, 'From' must		// Can we safely move 'From' to just before 'To'? To satisfy this, 'From' must
// not define a register that is used by any instructions, after and including,		// not define a register that is used by any instructions, after and including,
// 'To'. These instructions also must not redefine any of Froms operands.		// 'To'. These instructions also must not redefine any of Froms operands.
template<typename Iterator>		template<typename Iterator>
bool ReachingDefAnalysis::isSafeToMove(MachineInstr *From,		bool ReachingDefAnalysis::isSafeToMove(MachineInstr *From,
▲ Show 20 Lines • Show All 148 Lines • Show Last 20 Lines

llvm/lib/CodeGen/RegAllocFast.cpp

Show First 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	private:
/// Basic block currently being allocated.		/// Basic block currently being allocated.
MachineBasicBlock *MBB;		MachineBasicBlock *MBB;

/// Maps virtual regs to the frame index where these values are spilled.		/// Maps virtual regs to the frame index where these values are spilled.
IndexedMap<int, VirtReg2IndexFunctor> StackSlotForVirtReg;		IndexedMap<int, VirtReg2IndexFunctor> StackSlotForVirtReg;

/// Everything we know about a live virtual register.		/// Everything we know about a live virtual register.
struct LiveReg {		struct LiveReg {
		MachineInstr *OrigMI = nullptr; ///< Instr the LiveReg comes from.
MachineInstr *LastUse = nullptr; ///< Last instr to use reg.		MachineInstr *LastUse = nullptr; ///< Last instr to use reg.
Register VirtReg; ///< Virtual register number.		Register VirtReg; ///< Virtual register number.
MCPhysReg PhysReg = 0; ///< Currently held here.		MCPhysReg PhysReg = 0; ///< Currently held here.
unsigned short LastOpNum = 0; ///< OpNum on LastUse.		unsigned short LastOpNum = 0; ///< OpNum on LastUse.
bool Dirty = false; ///< Register needs spill.		bool Dirty = false; ///< Register needs spill.

explicit LiveReg(Register VirtReg) : VirtReg(VirtReg) {}		explicit LiveReg(Register VirtReg) : VirtReg(VirtReg) {}

▲ Show 20 Lines • Show All 345 Lines • ▼ Show 20 Lines	if (LiveVirtRegs.empty())
return;		return;
// The LiveRegMap is keyed by an unsigned (the virtreg number), so the order		// The LiveRegMap is keyed by an unsigned (the virtreg number), so the order
// of spilling here is deterministic, if arbitrary.		// of spilling here is deterministic, if arbitrary.
for (LiveReg &LR : LiveVirtRegs) {		for (LiveReg &LR : LiveVirtRegs) {
if (!LR.PhysReg)		if (!LR.PhysReg)
continue;		continue;
if (OnlyLiveOut && !mayLiveOut(LR.VirtReg))		if (OnlyLiveOut && !mayLiveOut(LR.VirtReg))
continue;		continue;
spillVirtReg(MI, LR);		spillVirtReg(LR.OrigMI->getOpcode() == TargetOpcode::TCOPY ? LR.OrigMI : MI,
		LR);
}		}
LiveVirtRegs.clear();		LiveVirtRegs.clear();
}		}

/// Handle the direct use of a physical register. Check that the register is		/// Handle the direct use of a physical register. Check that the register is
/// not used by a virtreg. Kill the physreg, marking it free. This may add		/// not used by a virtreg. Kill the physreg, marking it free. This may add
/// implicit kills to MO->getParent() and invalidate MO.		/// implicit kills to MO->getParent() and invalidate MO.
void RegAllocFast::usePhysReg(MachineOperand &MO) {		void RegAllocFast::usePhysReg(MachineOperand &MO) {
▲ Show 20 Lines • Show All 338 Lines • ▼ Show 20 Lines	if (!LRI->PhysReg) {
allocVirtReg(MI, *LRI, Hint);		allocVirtReg(MI, *LRI, Hint);
} else if (LRI->LastUse) {		} else if (LRI->LastUse) {
// Redefining a live register - kill at the last use, unless it is this		// Redefining a live register - kill at the last use, unless it is this
// instruction defining VirtReg multiple times.		// instruction defining VirtReg multiple times.
if (LRI->LastUse != &MI \|\| LRI->LastUse->getOperand(LRI->LastOpNum).isUse())		if (LRI->LastUse != &MI \|\| LRI->LastUse->getOperand(LRI->LastOpNum).isUse())
addKillFlag(*LRI);		addKillFlag(*LRI);
}		}
assert(LRI->PhysReg && "Register not assigned");		assert(LRI->PhysReg && "Register not assigned");
		LRI->OrigMI = &MI;
LRI->LastUse = &MI;		LRI->LastUse = &MI;
LRI->LastOpNum = OpNum;		LRI->LastOpNum = OpNum;
LRI->Dirty = true;		LRI->Dirty = true;
markRegUsedInInstr(LRI->PhysReg);		markRegUsedInInstr(LRI->PhysReg);
return LRI->PhysReg;		return LRI->PhysReg;
}		}

/// Make sure VirtReg is available in a physreg and return it.		/// Make sure VirtReg is available in a physreg and return it.
Show All 30 Lines	if (!LRI->PhysReg) {
// This would cause a second reload of %x into a different register.		// This would cause a second reload of %x into a different register.
LLVM_DEBUG(dbgs() << "Clearing clean kill: " << MO << '\n');		LLVM_DEBUG(dbgs() << "Clearing clean kill: " << MO << '\n');
MO.setIsKill(false);		MO.setIsKill(false);
} else if (MO.isDead()) {		} else if (MO.isDead()) {
LLVM_DEBUG(dbgs() << "Clearing clean dead: " << MO << '\n');		LLVM_DEBUG(dbgs() << "Clearing clean dead: " << MO << '\n');
MO.setIsDead(false);		MO.setIsDead(false);
}		}
assert(LRI->PhysReg && "Register not assigned");		assert(LRI->PhysReg && "Register not assigned");
		LRI->OrigMI = &MI;
LRI->LastUse = &MI;		LRI->LastUse = &MI;
LRI->LastOpNum = OpNum;		LRI->LastOpNum = OpNum;
markRegUsedInInstr(LRI->PhysReg);		markRegUsedInInstr(LRI->PhysReg);
return *LRI;		return *LRI;
}		}

/// Changes operand OpNum in MI the refer the PhysReg, considering subregs. This		/// Changes operand OpNum in MI the refer the PhysReg, considering subregs. This
/// may invalidate any operand pointers. Return true if the operand kills its		/// may invalidate any operand pointers. Return true if the operand kills its
▲ Show 20 Lines • Show All 475 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp

Show First 20 Lines • Show All 169 Lines • ▼ Show 20 Lines	EmitCopyFromReg(SDNode *Node, unsigned ResNo, bool IsClone, bool IsCloned,

// If all uses are reading from the src physical register and copying the		// If all uses are reading from the src physical register and copying the
// register is either impossible or very expensive, then don't create a copy.		// register is either impossible or very expensive, then don't create a copy.
if (MatchReg && SrcRC->getCopyCost() < 0) {		if (MatchReg && SrcRC->getCopyCost() < 0) {
VRBase = SrcReg;		VRBase = SrcReg;
} else {		} else {
// Create the reg, emit the copy.		// Create the reg, emit the copy.
VRBase = MRI->createVirtualRegister(DstRC);		VRBase = MRI->createVirtualRegister(DstRC);
BuildMI(*MBB, InsertPos, Node->getDebugLoc(), TII->get(TargetOpcode::COPY),
VRBase).addReg(SrcReg);		// FIXME: The predicate to determine whether an instruction is a COPY or
		arsenmUnsubmitted Not Done Reply Inline Actions I would expect this to not special case INLINEASM_BR. This should be any value defined by a terminator instruction arsenm: I would expect this to not special case INLINEASM_BR. This should be any value defined by a…
		voidAuthorUnsubmitted Done Reply Inline Actions Does this include unconditional jump terminators? void: Does this include unconditional jump terminators?
		arsenmUnsubmitted Not Done Reply Inline Actions I'm not sure how you could construct a sensible unconditional jump that would require a copy after it. Seems like something to check in the verifier arsenm: I'm not sure how you could construct a sensible unconditional jump that would require a copy…
		voidAuthorUnsubmitted Done Reply Inline Actions My comment wasn't very good. When converting something to a TCOPY, we need to figure out which terminators are candidates for having a TCOPY after them. Or perhaps better, what condition requires a TCOPY instead of a regular COPY. void: My comment wasn't very good. When converting something to a TCOPY, we need to figure out which…
		// TCOPY should be generic. At this time though the criteria isn't
		// well-known except for INLINEASM_BR instructions.
		unsigned TgtOpc =
		llvm::any_of(MBB->terminators(),
		[](const MachineInstr &Term) {
		return Term.getOpcode() == TargetOpcode::INLINEASM_BR;
		})
		? TargetOpcode::TCOPY
		: TargetOpcode::COPY;
		BuildMI(*MBB, InsertPos, Node->getDebugLoc(), TII->get(TgtOpc), VRBase)
		.addReg(SrcReg);
}		}

SDValue Op(Node, ResNo);		SDValue Op(Node, ResNo);
if (IsClone)		if (IsClone)
VRBaseMap.erase(Op);		VRBaseMap.erase(Op);
bool isNew = VRBaseMap.insert(std::make_pair(Op, VRBase)).second;		bool isNew = VRBaseMap.insert(std::make_pair(Op, VRBase)).second;
(void)isNew; // Silence compiler warning.		(void)isNew; // Silence compiler warning.
assert(isNew && "Node emitted out of order - early");		assert(isNew && "Node emitted out of order - early");
▲ Show 20 Lines • Show All 817 Lines • ▼ Show 20 Lines	case ISD::CopyToReg: {
if (RegisterSDNode *R = dyn_cast<RegisterSDNode>(SrcVal))		if (RegisterSDNode *R = dyn_cast<RegisterSDNode>(SrcVal))
SrcReg = R->getReg();		SrcReg = R->getReg();
else		else
SrcReg = getVR(SrcVal, VRBaseMap);		SrcReg = getVR(SrcVal, VRBaseMap);

if (SrcReg == DestReg) // Coalesced away the copy? Ignore.		if (SrcReg == DestReg) // Coalesced away the copy? Ignore.
break;		break;

BuildMI(*MBB, InsertPos, Node->getDebugLoc(), TII->get(TargetOpcode::COPY),		unsigned TgtOpc =
DestReg).addReg(SrcReg);		llvm::any_of(MBB->terminators(),
		[](const MachineInstr &Term) {
		return Term.getOpcode() == TargetOpcode::INLINEASM_BR;
		})
		? TargetOpcode::TCOPY
		: TargetOpcode::COPY;
		BuildMI(*MBB, InsertPos, Node->getDebugLoc(), TII->get(TgtOpc), DestReg)
		.addReg(SrcReg);
break;		break;
}		}
case ISD::CopyFromReg: {		case ISD::CopyFromReg: {
unsigned SrcReg = cast<RegisterSDNode>(Node->getOperand(1))->getReg();		unsigned SrcReg = cast<RegisterSDNode>(Node->getOperand(1))->getReg();
EmitCopyFromReg(Node, 0, IsClone, IsCloned, SrcReg, VRBaseMap);		EmitCopyFromReg(Node, 0, IsClone, IsCloned, SrcReg, VRBaseMap);
break;		break;
}		}
case ISD::EH_LABEL:		case ISD::EH_LABEL:
▲ Show 20 Lines • Show All 143 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp

Show First 20 Lines • Show All 1,022 Lines • ▼ Show 20 Lines	for (const auto &InstrOrder : Orders) {
}		}
if (DLI == DLE)		if (DLI == DLE)
break;		break;

LastOrder = Order;		LastOrder = Order;
}		}
}		}

// Split after an INLINEASM_BR block with outputs. This allows us to keep the		// Split after an INLINEASM_BR block with outputs. This gives us a place to
// copy to/from register instructions from being between two terminator		// store output values.
// instructions, which causes the machine instruction verifier agita.		auto InlineAsmBr = llvm::find_if(BB->terminators(), [](MachineInstr &term) {
auto TI = llvm::find_if(*BB, [](const MachineInstr &MI){		return term.getOpcode() == TargetOpcode::INLINEASM_BR;
return MI.getOpcode() == TargetOpcode::INLINEASM_BR;
});		});
auto SplicePt = TI != BB->end() ? std::next(TI) : BB->end();		auto TermIter = detail::next_or_end(InlineAsmBr, BB->end());
if (TI != BB->end() && SplicePt != BB->end() &&		if (InlineAsmBr != BB->end() && TermIter != BB->end() &&
TI->getOpcode() == TargetOpcode::INLINEASM_BR &&		TermIter->getOpcode() == TargetOpcode::TCOPY) {
SplicePt->getOpcode() == TargetOpcode::COPY) {		do {
MachineBasicBlock *FallThrough = BB->getFallThrough();		++TermIter;
if (!FallThrough)		} while (TermIter != BB->end() &&
for (const MachineOperand &MO : BB->back().operands())		TermIter->getOpcode() == TargetOpcode::TCOPY);
if (MO.isMBB()) {
FallThrough = MO.getMBB();		MachineBasicBlock *DefaultTarget = BB->getInlineAsmBrDefaultTarget();
break;		assert(DefaultTarget && "Cannot find default dest block for callbr!");
}
assert(FallThrough && "Cannot find default dest block for callbr!");

MachineBasicBlock *CopyBB = MF.CreateMachineBasicBlock(BB->getBasicBlock());		MachineBasicBlock *CopyBB = MF.CreateMachineBasicBlock(BB->getBasicBlock());
MachineFunction::iterator BBI(*BB);		MachineFunction::iterator BBI(*BB);
MF.insert(++BBI, CopyBB);		MF.insert(++BBI, CopyBB);
		if (TermIter != BB->end())
		CopyBB->splice(CopyBB->begin(), BB, TermIter, BB->end());

CopyBB->splice(CopyBB->begin(), BB, SplicePt, BB->end());		CopyBB->addSuccessor(DefaultTarget, BranchProbability::getOne());
CopyBB->setInlineAsmBrDefaultTarget();		BB->removeSuccessor(DefaultTarget);

CopyBB->addSuccessor(FallThrough, BranchProbability::getOne());
BB->removeSuccessor(FallThrough);
BB->addSuccessor(CopyBB, BranchProbability::getOne());		BB->addSuccessor(CopyBB, BranchProbability::getOne());

// Mark all physical registers defined in the original block as being live
// on entry to the copy block.
for (const auto &MI : *CopyBB)
for (const MachineOperand &MO : MI.operands())
if (MO.isReg()) {
Register reg = MO.getReg();
if (Register::isPhysicalRegister(reg)) {
CopyBB->addLiveIn(reg);
break;
}
}

CopyBB->normalizeSuccProbs();		CopyBB->normalizeSuccProbs();
BB->normalizeSuccProbs();		BB->normalizeSuccProbs();

BB->transferInlineAsmBrIndirectTargets(CopyBB);

InsertPos = CopyBB->end();		InsertPos = CopyBB->end();
return CopyBB;		return CopyBB;
}		}

InsertPos = Emitter.getInsertPos();		InsertPos = Emitter.getInsertPos();
return Emitter.getBlock();		return Emitter.getBlock();
}		}

/// Return the basic block label.		/// Return the basic block label.
std::string ScheduleDAGSDNodes::getDAGName() const {		std::string ScheduleDAGSDNodes::getDAGName() const {
return "sunit-dag." + BB->getFullName();		return "sunit-dag." + BB->getFullName();
}		}

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,859 Lines • ▼ Show 20 Lines	assert(!I.hasOperandBundlesOtherThan(
"Cannot lower callbrs with arbitrary operand bundles yet!");		"Cannot lower callbrs with arbitrary operand bundles yet!");

assert(I.isInlineAsm() && "Only know how to handle inlineasm callbr");		assert(I.isInlineAsm() && "Only know how to handle inlineasm callbr");
visitInlineAsm(I);		visitInlineAsm(I);
CopyToExportRegsIfNeeded(&I);		CopyToExportRegsIfNeeded(&I);

// Retrieve successors.		// Retrieve successors.
MachineBasicBlock *Return = FuncInfo.MBBMap[I.getDefaultDest()];		MachineBasicBlock *Return = FuncInfo.MBBMap[I.getDefaultDest()];
Return->setInlineAsmBrDefaultTarget();

// Update successor info.		// Update successor info.
addSuccessorWithProb(CallBrMBB, Return, BranchProbability::getOne());		addSuccessorWithProb(CallBrMBB, Return, BranchProbability::getOne());
for (unsigned i = 0, e = I.getNumIndirectDests(); i < e; ++i) {		for (unsigned i = 0, e = I.getNumIndirectDests(); i < e; ++i) {
MachineBasicBlock *Target = FuncInfo.MBBMap[I.getIndirectDest(i)];		MachineBasicBlock *Target = FuncInfo.MBBMap[I.getIndirectDest(i)];
addSuccessorWithProb(CallBrMBB, Target, BranchProbability::getZero());		addSuccessorWithProb(CallBrMBB, Target, BranchProbability::getZero());
CallBrMBB->addInlineAsmBrIndirectTarget(Target);		CallBrMBB->addInlineAsmBrIndirectTarget(Target);
}		}
▲ Show 20 Lines • Show All 7,693 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64CallLowering.cpp

Show First 20 Lines • Show All 646 Lines • ▼ Show 20 Lines	if (OutInfo.Regs.size() > 1) {
dbgs() << "... Cannot handle arguments in multiple registers.\n");		dbgs() << "... Cannot handle arguments in multiple registers.\n");
return false;		return false;
}		}

// Check if we copy the register, walking through copies from virtual		// Check if we copy the register, walking through copies from virtual
// registers. Note that getDefIgnoringCopies does not ignore copies from		// registers. Note that getDefIgnoringCopies does not ignore copies from
// physical registers.		// physical registers.
MachineInstr *RegDef = getDefIgnoringCopies(OutInfo.Regs[0], MRI);		MachineInstr *RegDef = getDefIgnoringCopies(OutInfo.Regs[0], MRI);
if (!RegDef \|\| RegDef->getOpcode() != TargetOpcode::COPY) {		if (!RegDef \|\| !RegDef->isCopy()) {
LLVM_DEBUG(		LLVM_DEBUG(
dbgs()		dbgs()
<< "... Parameter was not copied into a VReg, cannot tail call.\n");		<< "... Parameter was not copied into a VReg, cannot tail call.\n");
return false;		return false;
}		}

// Got a copy. Verify that it's the same as the register we want.		// Got a copy. Verify that it's the same as the register we want.
Register CopyRHS = RegDef->getOperand(1).getReg();		Register CopyRHS = RegDef->getOperand(1).getReg();
▲ Show 20 Lines • Show All 380 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64FastISel.cpp

Show First 20 Lines • Show All 4,541 Lines • ▼ Show 20 Lines	bool AArch64FastISel::optimizeIntExtLoad(const Instruction *I, MVT RetVT,
MachineInstr *MI = MRI.getUniqueVRegDef(Reg);		MachineInstr *MI = MRI.getUniqueVRegDef(Reg);
if (!MI)		if (!MI)
return false;		return false;

// Check if the correct load instruction has been emitted - SelectionDAG might		// Check if the correct load instruction has been emitted - SelectionDAG might
// have emitted a zero-extending load, but we need a sign-extending load.		// have emitted a zero-extending load, but we need a sign-extending load.
bool IsZExt = isa<ZExtInst>(I);		bool IsZExt = isa<ZExtInst>(I);
const auto *LoadMI = MI;		const auto *LoadMI = MI;
if (LoadMI->getOpcode() == TargetOpcode::COPY &&		if (LoadMI->isCopy() &&
LoadMI->getOperand(1).getSubReg() == AArch64::sub_32) {		LoadMI->getOperand(1).getSubReg() == AArch64::sub_32) {
Register LoadReg = MI->getOperand(1).getReg();		Register LoadReg = MI->getOperand(1).getReg();
LoadMI = MRI.getUniqueVRegDef(LoadReg);		LoadMI = MRI.getUniqueVRegDef(LoadReg);
assert(LoadMI && "Expected valid instruction");		assert(LoadMI && "Expected valid instruction");
}		}
if (!(IsZExt && isZExtLoad(LoadMI)) && !(!IsZExt && isSExtLoad(LoadMI)))		if (!(IsZExt && isZExtLoad(LoadMI)) && !(!IsZExt && isSExtLoad(LoadMI)))
return false;		return false;

// Nothing to be done.		// Nothing to be done.
if (RetVT != MVT::i64 \|\| SrcVT > MVT::i32) {		if (RetVT != MVT::i64 \|\| SrcVT > MVT::i32) {
updateValueMap(I, Reg);		updateValueMap(I, Reg);
return true;		return true;
}		}

if (IsZExt) {		if (IsZExt) {
unsigned Reg64 = createResultReg(&AArch64::GPR64RegClass);		unsigned Reg64 = createResultReg(&AArch64::GPR64RegClass);
BuildMI(*FuncInfo.MBB, FuncInfo.InsertPt, DbgLoc,		BuildMI(*FuncInfo.MBB, FuncInfo.InsertPt, DbgLoc,
TII.get(AArch64::SUBREG_TO_REG), Reg64)		TII.get(AArch64::SUBREG_TO_REG), Reg64)
.addImm(0)		.addImm(0)
.addReg(Reg, getKillRegState(true))		.addReg(Reg, getKillRegState(true))
.addImm(AArch64::sub_32);		.addImm(AArch64::sub_32);
Reg = Reg64;		Reg = Reg64;
} else {		} else {
assert((MI->getOpcode() == TargetOpcode::COPY &&		assert((MI->isCopy() && MI->getOperand(1).getSubReg() == AArch64::sub_32) &&
MI->getOperand(1).getSubReg() == AArch64::sub_32) &&
"Expected copy instruction");		"Expected copy instruction");
Reg = MI->getOperand(1).getReg();		Reg = MI->getOperand(1).getReg();
MachineBasicBlock::iterator I(MI);		MachineBasicBlock::iterator I(MI);
removeDeadCode(I, std::next(I));		removeDeadCode(I, std::next(I));
}		}
updateValueMap(I, Reg);		updateValueMap(I, Reg);
return true;		return true;
}		}
▲ Show 20 Lines • Show All 664 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

Show First 20 Lines • Show All 713 Lines • ▼ Show 20 Lines	bool AArch64InstrInfo::isAsCheapAsAMove(const MachineInstr &MI) const {
if (Subtarget.hasZeroCycleZeroingFP()) {		if (Subtarget.hasZeroCycleZeroingFP()) {
if (Opcode == AArch64::FMOVH0 \|\|		if (Opcode == AArch64::FMOVH0 \|\|
Opcode == AArch64::FMOVS0 \|\|		Opcode == AArch64::FMOVS0 \|\|
Opcode == AArch64::FMOVD0)		Opcode == AArch64::FMOVD0)
return true;		return true;
}		}

if (Subtarget.hasZeroCycleZeroingGP()) {		if (Subtarget.hasZeroCycleZeroingGP()) {
if (Opcode == TargetOpcode::COPY &&		if (MI.isCopy() && (MI.getOperand(1).getReg() == AArch64::WZR \|\|
(MI.getOperand(1).getReg() == AArch64::WZR \|\|
MI.getOperand(1).getReg() == AArch64::XZR))		MI.getOperand(1).getReg() == AArch64::XZR))
return true;		return true;
}		}

// Secondly, check cases specific to sub-targets.		// Secondly, check cases specific to sub-targets.

if (Subtarget.hasExynosCheapAsMoveHandling()) {		if (Subtarget.hasExynosCheapAsMoveHandling()) {
if (isExynosCheapAsMove(MI))		if (isExynosCheapAsMove(MI))
return true;		return true;
▲ Show 20 Lines • Show All 878 Lines • ▼ Show 20 Lines	if (MI.getOperand(1).isImm() && MI.getOperand(1).getImm() == 0) {
return true;		return true;
}		}
break;		break;
case AArch64::ANDWri: // and Rd, Rzr, #imm		case AArch64::ANDWri: // and Rd, Rzr, #imm
return MI.getOperand(1).getReg() == AArch64::WZR;		return MI.getOperand(1).getReg() == AArch64::WZR;
case AArch64::ANDXri:		case AArch64::ANDXri:
return MI.getOperand(1).getReg() == AArch64::XZR;		return MI.getOperand(1).getReg() == AArch64::XZR;
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
return MI.getOperand(1).getReg() == AArch64::WZR;		return MI.getOperand(1).getReg() == AArch64::WZR;
}		}
return false;		return false;
}		}

// Return true if this instruction simply renames a general register without		// Return true if this instruction simply renames a general register without
// modifying bits.		// modifying bits.
bool AArch64InstrInfo::isGPRCopy(const MachineInstr &MI) {		bool AArch64InstrInfo::isGPRCopy(const MachineInstr &MI) {
switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
default:		default:
break;		break;
case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY: {
// GPR32 copies will by lowered to ORRXrs		// GPR32 copies will by lowered to ORRXrs
Register DstReg = MI.getOperand(0).getReg();		Register DstReg = MI.getOperand(0).getReg();
return (AArch64::GPR32RegClass.contains(DstReg) \|\|		return (AArch64::GPR32RegClass.contains(DstReg) \|\|
AArch64::GPR64RegClass.contains(DstReg));		AArch64::GPR64RegClass.contains(DstReg));
}		}
case AArch64::ORRXrs: // orr Xd, Xzr, Xm (LSL #0)		case AArch64::ORRXrs: // orr Xd, Xzr, Xm (LSL #0)
if (MI.getOperand(1).getReg() == AArch64::XZR) {		if (MI.getOperand(1).getReg() == AArch64::XZR) {
assert(MI.getDesc().getNumOperands() == 4 &&		assert(MI.getDesc().getNumOperands() == 4 &&
Show All 13 Lines
}		}

// Return true if this instruction simply renames a general register without		// Return true if this instruction simply renames a general register without
// modifying bits.		// modifying bits.
bool AArch64InstrInfo::isFPRCopy(const MachineInstr &MI) {		bool AArch64InstrInfo::isFPRCopy(const MachineInstr &MI) {
switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
default:		default:
break;		break;
case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY: {
// FPR64 copies will by lowered to ORR.16b		// FPR64 copies will by lowered to ORR.16b
Register DstReg = MI.getOperand(0).getReg();		Register DstReg = MI.getOperand(0).getReg();
return (AArch64::FPR64RegClass.contains(DstReg) \|\|		return (AArch64::FPR64RegClass.contains(DstReg) \|\|
AArch64::FPR128RegClass.contains(DstReg));		AArch64::FPR128RegClass.contains(DstReg));
}		}
case AArch64::ORRv16i8:		case AArch64::ORRv16i8:
if (MI.getOperand(1).getReg() == MI.getOperand(2).getReg()) {		if (MI.getOperand(1).getReg() == MI.getOperand(2).getReg()) {
assert(MI.getDesc().getNumOperands() == 3 && MI.getOperand(0).isReg() &&		assert(MI.getDesc().getNumOperands() == 3 && MI.getOperand(0).isReg() &&
▲ Show 20 Lines • Show All 5,103 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64InstructionSelector.cpp

Show First 20 Lines • Show All 3,867 Lines • ▼ Show 20 Lines	if (!MRI.hasOneNonDBGUse(CondDefReg)) {
if (UI.getOpcode() != TargetOpcode::G_SELECT)		if (UI.getOpcode() != TargetOpcode::G_SELECT)
return false;		return false;
}		}
}		}

// We can skip over G_TRUNC since the condition is 1-bit.		// We can skip over G_TRUNC since the condition is 1-bit.
// Truncating/extending can have no impact on the value.		// Truncating/extending can have no impact on the value.
unsigned Opc = CondDef->getOpcode();		unsigned Opc = CondDef->getOpcode();
if (Opc != TargetOpcode::COPY && Opc != TargetOpcode::G_TRUNC)		if (!CondDef->isCopy() && Opc != TargetOpcode::G_TRUNC)
break;		break;

// Can't see past copies from physregs.		// Can't see past copies from physregs.
if (Opc == TargetOpcode::COPY &&		if (CondDef->isCopy() &&
Register::isPhysicalRegister(CondDef->getOperand(1).getReg()))		Register::isPhysicalRegister(CondDef->getOperand(1).getReg()))
return false;		return false;

CondDef = MRI.getVRegDef(CondDef->getOperand(1).getReg());		CondDef = MRI.getVRegDef(CondDef->getOperand(1).getReg());
}		}

// Is the condition defined by a compare?		// Is the condition defined by a compare?
if (!CondDef)		if (!CondDef)
▲ Show 20 Lines • Show All 1,601 Lines • ▼ Show 20 Lines	bool AArch64InstructionSelector::isDef32(const MachineInstr &MI) const {
// Only return true if we know the operation will zero-out the high half of		// Only return true if we know the operation will zero-out the high half of
// the 64-bit register. Truncates can be subregister copies, which don't		// the 64-bit register. Truncates can be subregister copies, which don't
// zero out the high bits. Copies and other copy-like instructions can be		// zero out the high bits. Copies and other copy-like instructions can be
// fed by truncates, or could be lowered as subregister copies.		// fed by truncates, or could be lowered as subregister copies.
switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
default:		default:
return true;		return true;
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::G_BITCAST:		case TargetOpcode::G_BITCAST:
case TargetOpcode::G_TRUNC:		case TargetOpcode::G_TRUNC:
case TargetOpcode::G_PHI:		case TargetOpcode::G_PHI:
return false;		return false;
}		}
}		}


▲ Show 20 Lines • Show All 96 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64RegisterBankInfo.cpp

Show First 20 Lines • Show All 461 Lines • ▼ Show 20 Lines	#endif // End NDEBUG.

return getInstructionMapping(DefaultMappingID, 1,		return getInstructionMapping(DefaultMappingID, 1,
getValueMapping(RBIdx, Size), NumOperands);		getValueMapping(RBIdx, Size), NumOperands);
}		}

bool AArch64RegisterBankInfo::hasFPConstraints(		bool AArch64RegisterBankInfo::hasFPConstraints(
const MachineInstr &MI, const MachineRegisterInfo &MRI,		const MachineInstr &MI, const MachineRegisterInfo &MRI,
const TargetRegisterInfo &TRI) const {		const TargetRegisterInfo &TRI) const {
unsigned Op = MI.getOpcode();

// Do we have an explicit floating point instruction?		// Do we have an explicit floating point instruction?
if (isPreISelGenericFloatingPointOpcode(Op))		if (isPreISelGenericFloatingPointOpcode(MI.getOpcode()))
return true;		return true;

// No. Check if we have a copy-like instruction. If we do, then we could		// No. Check if we have a copy-like instruction. If we do, then we could
// still be fed by floating point instructions.		// still be fed by floating point instructions.
if (Op != TargetOpcode::COPY && !MI.isPHI())		if (!MI.isCopy() && !MI.isPHI())
return false;		return false;

// MI is copy-like. Return true if it outputs an FPR.		// MI is copy-like. Return true if it outputs an FPR.
return getRegBank(MI.getOperand(0).getReg(), MRI, TRI) ==		return getRegBank(MI.getOperand(0).getReg(), MRI, TRI) ==
&AArch64::FPRRegBank;		&AArch64::FPRRegBank;
}		}

bool AArch64RegisterBankInfo::onlyUsesFP(const MachineInstr &MI,		bool AArch64RegisterBankInfo::onlyUsesFP(const MachineInstr &MI,
Show All 26 Lines
}		}

const RegisterBankInfo::InstructionMapping &		const RegisterBankInfo::InstructionMapping &
AArch64RegisterBankInfo::getInstrMapping(const MachineInstr &MI) const {		AArch64RegisterBankInfo::getInstrMapping(const MachineInstr &MI) const {
const unsigned Opc = MI.getOpcode();		const unsigned Opc = MI.getOpcode();

// Try the default logic for non-generic instructions that are either copies		// Try the default logic for non-generic instructions that are either copies
// or already have some operands assigned to banks.		// or already have some operands assigned to banks.
if ((Opc != TargetOpcode::COPY && !isPreISelGenericOpcode(Opc)) \|\|		if ((!MI.isCopy() && !isPreISelGenericOpcode(Opc)) \|\|
Opc == TargetOpcode::G_PHI) {		Opc == TargetOpcode::G_PHI) {
const RegisterBankInfo::InstructionMapping &Mapping =		const RegisterBankInfo::InstructionMapping &Mapping =
getInstrMappingImpl(MI);		getInstrMappingImpl(MI);
if (Mapping.isValid())		if (Mapping.isValid())
return Mapping;		return Mapping;
}		}

const MachineFunction &MF = *MI.getParent()->getParent();		const MachineFunction &MF = *MI.getParent()->getParent();
Show All 34 Lines	AArch64RegisterBankInfo::getInstrMapping(const MachineInstr &MI) const {
case TargetOpcode::G_ASHR: {		case TargetOpcode::G_ASHR: {
LLT ShiftAmtTy = MRI.getType(MI.getOperand(2).getReg());		LLT ShiftAmtTy = MRI.getType(MI.getOperand(2).getReg());
LLT SrcTy = MRI.getType(MI.getOperand(1).getReg());		LLT SrcTy = MRI.getType(MI.getOperand(1).getReg());
if (ShiftAmtTy.getSizeInBits() == 64 && SrcTy.getSizeInBits() == 32)		if (ShiftAmtTy.getSizeInBits() == 64 && SrcTy.getSizeInBits() == 32)
return getInstructionMapping(DefaultMappingID, 1,		return getInstructionMapping(DefaultMappingID, 1,
&ValMappings[Shift64Imm], 3);		&ValMappings[Shift64Imm], 3);
return getSameKindOfOperandsMapping(MI);		return getSameKindOfOperandsMapping(MI);
}		}
case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY: {
Register DstReg = MI.getOperand(0).getReg();		Register DstReg = MI.getOperand(0).getReg();
Register SrcReg = MI.getOperand(1).getReg();		Register SrcReg = MI.getOperand(1).getReg();
// Check if one of the register is not a generic register.		// Check if one of the register is not a generic register.
if ((Register::isPhysicalRegister(DstReg) \|\|		if ((Register::isPhysicalRegister(DstReg) \|\|
!MRI.getType(DstReg).isValid()) \|\|		!MRI.getType(DstReg).isValid()) \|\|
(Register::isPhysicalRegister(SrcReg) \|\|		(Register::isPhysicalRegister(SrcReg) \|\|
!MRI.getType(SrcReg).isValid())) {		!MRI.getType(SrcReg).isValid())) {
const RegisterBank *DstRB = getRegBank(DstReg, MRI, TRI);		const RegisterBank *DstRB = getRegBank(DstReg, MRI, TRI);
▲ Show 20 Lines • Show All 277 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/SIISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,814 Lines • ▼ Show 20 Lines	case AMDGPU::SI_INIT_EXEC_FROM_INPUT: {
MachineInstr FirstMI = &BB->begin();		MachineInstr FirstMI = &BB->begin();
MachineRegisterInfo &MRI = MF->getRegInfo();		MachineRegisterInfo &MRI = MF->getRegInfo();
Register InputReg = MI.getOperand(0).getReg();		Register InputReg = MI.getOperand(0).getReg();
Register CountReg = MRI.createVirtualRegister(&AMDGPU::SGPR_32RegClass);		Register CountReg = MRI.createVirtualRegister(&AMDGPU::SGPR_32RegClass);
bool Found = false;		bool Found = false;

// Move the COPY of the input reg to the beginning, so that we can use it.		// Move the COPY of the input reg to the beginning, so that we can use it.
for (auto I = BB->begin(); I != &MI; I++) {		for (auto I = BB->begin(); I != &MI; I++) {
if (I->getOpcode() != TargetOpcode::COPY \|\|		if (!I->isCopy() \|\| I->getOperand(0).getReg() != InputReg)
I->getOperand(0).getReg() != InputReg)
continue;		continue;

if (I == FirstMI) {		if (I == FirstMI) {
FirstMI = &*++BB->begin();		FirstMI = &*++BB->begin();
} else {		} else {
I->removeFromParent();		I->removeFromParent();
BB->insert(FirstMI, &*I);		BB->insert(FirstMI, &*I);
}		}
▲ Show 20 Lines • Show All 7,345 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/SIInstrInfo.cpp

Show First 20 Lines • Show All 796 Lines • ▼ Show 20 Lines	void SIInstrInfo::copyPhysReg(MachineBasicBlock &MBB,

for (unsigned Idx = 0; Idx < SubIndices.size(); ++Idx) {		for (unsigned Idx = 0; Idx < SubIndices.size(); ++Idx) {
unsigned SubIdx;		unsigned SubIdx;
if (Forward)		if (Forward)
SubIdx = SubIndices[Idx];		SubIdx = SubIndices[Idx];
else		else
SubIdx = SubIndices[SubIndices.size() - Idx - 1];		SubIdx = SubIndices[SubIndices.size() - Idx - 1];

if (Opcode == TargetOpcode::COPY) {		if (Opcode == TargetOpcode::COPY \|\| Opcode == TargetOpcode::TCOPY) {
copyPhysReg(MBB, MI, DL, RI.getSubReg(DestReg, SubIdx),		copyPhysReg(MBB, MI, DL, RI.getSubReg(DestReg, SubIdx),
RI.getSubReg(SrcReg, SubIdx), KillSrc);		RI.getSubReg(SrcReg, SubIdx), KillSrc);
continue;		continue;
}		}

MachineInstrBuilder Builder = BuildMI(MBB, MI, DL,		MachineInstrBuilder Builder = BuildMI(MBB, MI, DL,
get(Opcode), RI.getSubReg(DestReg, SubIdx));		get(Opcode), RI.getSubReg(DestReg, SubIdx));

▲ Show 20 Lines • Show All 6,123 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/BitTracker.cpp

Show First 20 Lines • Show All 733 Lines • ▼ Show 20 Lines	case TargetOpcode::REG_SEQUENCE: {
uint16_t W = getRegBitWidth(RD);		uint16_t W = getRegBitWidth(RD);
RegisterCell Res(W);		RegisterCell Res(W);
Res.insert(RegisterCell::ref(getCell(RS, Inputs)), mask(RD.Reg, SS));		Res.insert(RegisterCell::ref(getCell(RS, Inputs)), mask(RD.Reg, SS));
Res.insert(RegisterCell::ref(getCell(RT, Inputs)), mask(RD.Reg, ST));		Res.insert(RegisterCell::ref(getCell(RT, Inputs)), mask(RD.Reg, ST));
putCell(RD, Res, Outputs);		putCell(RD, Res, Outputs);
break;		break;
}		}

case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY: {
// COPY can transfer a smaller register into a wider one.		// COPY can transfer a smaller register into a wider one.
// If that is the case, fill the remaining high bits with 0.		// If that is the case, fill the remaining high bits with 0.
RegisterRef RD = MI.getOperand(0);		RegisterRef RD = MI.getOperand(0);
RegisterRef RS = MI.getOperand(1);		RegisterRef RS = MI.getOperand(1);
assert(RD.Sub == 0);		assert(RD.Sub == 0);
uint16_t WD = getRegBitWidth(RD);		uint16_t WD = getRegBitWidth(RD);
uint16_t WS = getRegBitWidth(RS);		uint16_t WS = getRegBitWidth(RS);
assert(WD >= WS);		assert(WD >= WS);
▲ Show 20 Lines • Show All 398 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonBitSimplify.cpp

Show First 20 Lines • Show All 1,302 Lines • ▼ Show 20 Lines	bool RedundantInstrElimination::processBlock(MachineBasicBlock &B,
if (!BT.reached(&B))		if (!BT.reached(&B))
return false;		return false;
bool Changed = false;		bool Changed = false;

for (auto I = B.begin(), E = B.end(), NextI = I; I != E; ++I) {		for (auto I = B.begin(), E = B.end(), NextI = I; I != E; ++I) {
NextI = std::next(I);		NextI = std::next(I);
MachineInstr MI = &I;		MachineInstr MI = &I;

if (MI->getOpcode() == TargetOpcode::COPY)		if (MI->isCopy())
continue;		continue;
if (MI->isPHI() \|\| MI->hasUnmodeledSideEffects() \|\| MI->isInlineAsm())		if (MI->isPHI() \|\| MI->hasUnmodeledSideEffects() \|\| MI->isInlineAsm())
continue;		continue;
unsigned NumD = MI->getDesc().getNumDefs();		unsigned NumD = MI->getDesc().getNumDefs();
if (NumD != 1)		if (NumD != 1)
continue;		continue;

BitTracker::RegisterRef RD = MI->getOperand(0);		BitTracker::RegisterRef RD = MI->getOperand(0);
▲ Show 20 Lines • Show All 327 Lines • ▼ Show 20 Lines	bool CopyGeneration::processBlock(MachineBasicBlock &B,
}		}

return Changed;		return Changed;
}		}

bool CopyPropagation::isCopyReg(unsigned Opc, bool NoConv) {		bool CopyPropagation::isCopyReg(unsigned Opc, bool NoConv) {
switch (Opc) {		switch (Opc) {
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::REG_SEQUENCE:		case TargetOpcode::REG_SEQUENCE:
case Hexagon::A4_combineir:		case Hexagon::A4_combineir:
case Hexagon::A4_combineri:		case Hexagon::A4_combineri:
return true;		return true;
case Hexagon::A2_tfr:		case Hexagon::A2_tfr:
case Hexagon::A2_tfrp:		case Hexagon::A2_tfrp:
case Hexagon::A2_combinew:		case Hexagon::A2_combinew:
case Hexagon::V6_vcombine:		case Hexagon::V6_vcombine:
return NoConv;		return NoConv;
default:		default:
break;		break;
}		}
return false;		return false;
}		}

bool CopyPropagation::propagateRegCopy(MachineInstr &MI) {		bool CopyPropagation::propagateRegCopy(MachineInstr &MI) {
bool Changed = false;		bool Changed = false;
unsigned Opc = MI.getOpcode();		unsigned Opc = MI.getOpcode();
BitTracker::RegisterRef RD = MI.getOperand(0);		BitTracker::RegisterRef RD = MI.getOperand(0);
assert(MI.getOperand(0).getSubReg() == 0);		assert(MI.getOperand(0).getSubReg() == 0);

switch (Opc) {		switch (Opc) {
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case Hexagon::A2_tfr:		case Hexagon::A2_tfr:
case Hexagon::A2_tfrp: {		case Hexagon::A2_tfrp: {
BitTracker::RegisterRef RS = MI.getOperand(1);		BitTracker::RegisterRef RS = MI.getOperand(1);
if (!HBS::isTransparentCopy(RD, RS, MRI))		if (!HBS::isTransparentCopy(RD, RS, MRI))
break;		break;
if (RS.Sub != 0)		if (RS.Sub != 0)
Changed = HBS::replaceRegWithSub(RD.Reg, RS.Reg, RS.Sub, MRI);		Changed = HBS::replaceRegWithSub(RD.Reg, RS.Reg, RS.Sub, MRI);
else		else
▲ Show 20 Lines • Show All 1,022 Lines • ▼ Show 20 Lines	bool BitSimplification::processBlock(MachineBasicBlock &B,
RegisterSet AVB = AVs;		RegisterSet AVB = AVs;
RegisterSet Defs;		RegisterSet Defs;

for (auto I = B.begin(), E = B.end(); I != E; ++I, AVB.insert(Defs)) {		for (auto I = B.begin(), E = B.end(); I != E; ++I, AVB.insert(Defs)) {
MachineInstr MI = &I;		MachineInstr MI = &I;
Defs.clear();		Defs.clear();
HBS::getInstrDefs(*MI, Defs);		HBS::getInstrDefs(*MI, Defs);

unsigned Opc = MI->getOpcode();		if (MI->isCopy() \|\| MI->getOpcode() == TargetOpcode::REG_SEQUENCE)
if (Opc == TargetOpcode::COPY \|\| Opc == TargetOpcode::REG_SEQUENCE)
continue;		continue;

if (MI->mayStore()) {		if (MI->mayStore()) {
bool T = genStoreUpperHalf(MI);		bool T = genStoreUpperHalf(MI);
T = T \|\| genStoreImmediate(MI);		T = T \|\| genStoreImmediate(MI);
Changed \|= T;		Changed \|= T;
continue;		continue;
}		}
▲ Show 20 Lines • Show All 256 Lines • ▼ Show 20 Lines	bool HexagonLoopRescheduling::isConst(unsigned Reg) const {
return true;		return true;
}		}

bool HexagonLoopRescheduling::isBitShuffle(const MachineInstr *MI,		bool HexagonLoopRescheduling::isBitShuffle(const MachineInstr *MI,
unsigned DefR) const {		unsigned DefR) const {
unsigned Opc = MI->getOpcode();		unsigned Opc = MI->getOpcode();
switch (Opc) {		switch (Opc) {
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case Hexagon::S2_lsr_i_r:		case Hexagon::S2_lsr_i_r:
case Hexagon::S2_asr_i_r:		case Hexagon::S2_asr_i_r:
case Hexagon::S2_asl_i_r:		case Hexagon::S2_asl_i_r:
case Hexagon::S2_lsr_i_p:		case Hexagon::S2_lsr_i_p:
case Hexagon::S2_asr_i_p:		case Hexagon::S2_asr_i_p:
case Hexagon::S2_asl_i_p:		case Hexagon::S2_asl_i_p:
case Hexagon::S2_insert:		case Hexagon::S2_insert:
case Hexagon::A2_or:		case Hexagon::A2_or:
▲ Show 20 Lines • Show All 380 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonFrameLowering.cpp

Show First 20 Lines • Show All 2,085 Lines • ▼ Show 20 Lines	for (auto &B : MF) {
MachineBasicBlock::iterator NextI;		MachineBasicBlock::iterator NextI;
for (auto I = B.begin(), E = B.end(); I != E; I = NextI) {		for (auto I = B.begin(), E = B.end(); I != E; I = NextI) {
MachineInstr MI = &I;		MachineInstr MI = &I;
NextI = std::next(I);		NextI = std::next(I);
unsigned Opc = MI->getOpcode();		unsigned Opc = MI->getOpcode();

switch (Opc) {		switch (Opc) {
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
Changed \|= expandCopy(B, I, MRI, HII, NewRegs);		Changed \|= expandCopy(B, I, MRI, HII, NewRegs);
break;		break;
case Hexagon::STriw_pred:		case Hexagon::STriw_pred:
case Hexagon::STriw_ctr:		case Hexagon::STriw_ctr:
Changed \|= expandStoreInt(B, I, MRI, HII, NewRegs);		Changed \|= expandStoreInt(B, I, MRI, HII, NewRegs);
break;		break;
case Hexagon::LDriw_pred:		case Hexagon::LDriw_pred:
case Hexagon::LDriw_ctr:		case Hexagon::LDriw_ctr:
▲ Show 20 Lines • Show All 634 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonGenPredicate.cpp

Show First 20 Lines • Show All 206 Lines • ▼ Show 20 Lines	void HexagonGenPredicate::collectPredicateGPR(MachineFunction &MF) {
for (MachineFunction::iterator A = MF.begin(), Z = MF.end(); A != Z; ++A) {		for (MachineFunction::iterator A = MF.begin(), Z = MF.end(); A != Z; ++A) {
MachineBasicBlock &B = *A;		MachineBasicBlock &B = *A;
for (MachineBasicBlock::iterator I = B.begin(), E = B.end(); I != E; ++I) {		for (MachineBasicBlock::iterator I = B.begin(), E = B.end(); I != E; ++I) {
MachineInstr MI = &I;		MachineInstr MI = &I;
unsigned Opc = MI->getOpcode();		unsigned Opc = MI->getOpcode();
switch (Opc) {		switch (Opc) {
case Hexagon::C2_tfrpr:		case Hexagon::C2_tfrpr:
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
if (isPredReg(MI->getOperand(1).getReg())) {		if (isPredReg(MI->getOperand(1).getReg())) {
RegisterSubReg RD = MI->getOperand(0);		RegisterSubReg RD = MI->getOperand(0);
if (Register::isVirtualRegister(RD.R))		if (Register::isVirtualRegister(RD.R))
PredGPRs.insert(RD);		PredGPRs.insert(RD);
}		}
break;		break;
}		}
}		}
Show All 27 Lines	RegisterSubReg HexagonGenPredicate::getPredRegFor(const RegisterSubReg &Reg) {
RegToRegMap::iterator F = G2P.find(Reg);		RegToRegMap::iterator F = G2P.find(Reg);
if (F != G2P.end())		if (F != G2P.end())
return F->second;		return F->second;

LLVM_DEBUG(dbgs() << __func__ << ": " << PrintRegister(Reg, *TRI));		LLVM_DEBUG(dbgs() << __func__ << ": " << PrintRegister(Reg, *TRI));
MachineInstr *DefI = MRI->getVRegDef(Reg.R);		MachineInstr *DefI = MRI->getVRegDef(Reg.R);
assert(DefI);		assert(DefI);
unsigned Opc = DefI->getOpcode();		unsigned Opc = DefI->getOpcode();
if (Opc == Hexagon::C2_tfrpr \|\| Opc == TargetOpcode::COPY) {		if (Opc == Hexagon::C2_tfrpr \|\| DefI->isCopy()) {
assert(DefI->getOperand(0).isDef() && DefI->getOperand(1).isUse());		assert(DefI->getOperand(0).isDef() && DefI->getOperand(1).isUse());
RegisterSubReg PR = DefI->getOperand(1);		RegisterSubReg PR = DefI->getOperand(1);
G2P.insert(std::make_pair(Reg, PR));		G2P.insert(std::make_pair(Reg, PR));
LLVM_DEBUG(dbgs() << " -> " << PrintRegister(PR, *TRI) << '\n');		LLVM_DEBUG(dbgs() << " -> " << PrintRegister(PR, *TRI) << '\n');
return PR;		return PR;
}		}

MachineBasicBlock &B = *DefI->getParent();		MachineBasicBlock &B = *DefI->getParent();
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	bool HexagonGenPredicate::isScalarPred(RegisterSubReg PredReg) {
while (!WorkQ.empty()) {		while (!WorkQ.empty()) {
RegisterSubReg PR = WorkQ.front();		RegisterSubReg PR = WorkQ.front();
WorkQ.pop();		WorkQ.pop();
const MachineInstr *DefI = MRI->getVRegDef(PR.R);		const MachineInstr *DefI = MRI->getVRegDef(PR.R);
if (!DefI)		if (!DefI)
return false;		return false;
unsigned DefOpc = DefI->getOpcode();		unsigned DefOpc = DefI->getOpcode();
switch (DefOpc) {		switch (DefOpc) {
case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - case TargetOpcode::COPY: - case TargetOpcode::TCOPY: { - const TargetRegisterClass PredRC = &Hexagon::PredRegsRegClass; - if (MRI->getRegClass(PR.R) != PredRC) - return false; - // If it is a copy between two predicate registers, fall through. - LLVM_FALLTHROUGH; - } + case TargetOpcode::COPY: + case TargetOpcode::TCOPY: { 6 diff lines are omitted. See full diff. Lint: Pre-merge checks:* clang-format: please reformat the code ``` - case TargetOpcode::COPY: - case…
		case TargetOpcode::TCOPY: {
const TargetRegisterClass *PredRC = &Hexagon::PredRegsRegClass;		const TargetRegisterClass *PredRC = &Hexagon::PredRegsRegClass;
if (MRI->getRegClass(PR.R) != PredRC)		if (MRI->getRegClass(PR.R) != PredRC)
return false;		return false;
// If it is a copy between two predicate registers, fall through.		// If it is a copy between two predicate registers, fall through.
LLVM_FALLTHROUGH;		LLVM_FALLTHROUGH;
}		}
case Hexagon::C2_and:		case Hexagon::C2_and:
case Hexagon::C2_andn:		case Hexagon::C2_andn:
▲ Show 20 Lines • Show All 120 Lines • ▼ Show 20 Lines	bool HexagonGenPredicate::eliminatePredCopies(MachineFunction &MF) {
// PredR2 = PredR1		// PredR2 = PredR1
// Such sequences can be generated when a copy-into-pred is generated from		// Such sequences can be generated when a copy-into-pred is generated from
// a gpr register holding a result of a convertible instruction. After		// a gpr register holding a result of a convertible instruction. After
// the convertible instruction is converted, its predicate result will be		// the convertible instruction is converted, its predicate result will be
// copied back into the original gpr.		// copied back into the original gpr.

for (MachineBasicBlock &MBB : MF) {		for (MachineBasicBlock &MBB : MF) {
for (MachineInstr &MI : MBB) {		for (MachineInstr &MI : MBB) {
if (MI.getOpcode() != TargetOpcode::COPY)		if (!MI.isCopy())
continue;		continue;
RegisterSubReg DR = MI.getOperand(0);		RegisterSubReg DR = MI.getOperand(0);
RegisterSubReg SR = MI.getOperand(1);		RegisterSubReg SR = MI.getOperand(1);
if (!Register::isVirtualRegister(DR.R))		if (!Register::isVirtualRegister(DR.R))
continue;		continue;
if (!Register::isVirtualRegister(SR.R))		if (!Register::isVirtualRegister(SR.R))
continue;		continue;
if (MRI->getRegClass(DR.R) != PredRC)		if (MRI->getRegClass(DR.R) != PredRC)
▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonHardwareLoops.cpp

Show First 20 Lines • Show All 1,510 Lines • ▼ Show 20 Lines	bool HexagonHardwareLoops::checkForImmediate(const MachineOperand &MO,

Register R = MO.getReg();		Register R = MO.getReg();
if (!Register::isVirtualRegister(R))		if (!Register::isVirtualRegister(R))
return false;		return false;
MachineInstr *DI = MRI->getVRegDef(R);		MachineInstr *DI = MRI->getVRegDef(R);
unsigned DOpc = DI->getOpcode();		unsigned DOpc = DI->getOpcode();
switch (DOpc) {		switch (DOpc) {
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case Hexagon::A2_tfrsi:		case Hexagon::A2_tfrsi:
case Hexagon::A2_tfrpi:		case Hexagon::A2_tfrpi:
case Hexagon::CONST32:		case Hexagon::CONST32:
case Hexagon::CONST64:		case Hexagon::CONST64:
// Call recursively to avoid an extra check whether operand(1) is		// Call recursively to avoid an extra check whether operand(1) is
// indeed an immediate (it could be a global address, for example),		// indeed an immediate (it could be a global address, for example),
// plus we can handle COPY at the same time.		// plus we can handle COPY at the same time.
if (!checkForImmediate(DI->getOperand(1), TV))		if (!checkForImmediate(DI->getOperand(1), TV))
▲ Show 20 Lines • Show All 481 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonISelDAGToDAGHVX.cpp

Show First 20 Lines • Show All 1,013 Lines • ▼ Show 20 Lines	for (const OpRef &R : Node.Ops) {
unsigned Sub = (Part == OpRef::LoHalf) ? Hexagon::vsub_lo		unsigned Sub = (Part == OpRef::LoHalf) ? Hexagon::vsub_lo
: Hexagon::vsub_hi;		: Hexagon::vsub_hi;
Op = DAG.getTargetExtractSubreg(Sub, dl, HalfTy, Op);		Op = DAG.getTargetExtractSubreg(Sub, dl, HalfTy, Op);
}		}
Ops.push_back(Op);		Ops.push_back(Op);
} // for (Node : Results)		} // for (Node : Results)

assert(Node.Ty != MVT::Other);		assert(Node.Ty != MVT::Other);
SDNode *ResN = (Node.Opc == TargetOpcode::COPY)		SDNode *ResN =
		(Node.Opc == TargetOpcode::COPY \|\| Node.Opc == TargetOpcode::TCOPY)
? Ops.front().getNode()		? Ops.front().getNode()
: DAG.getMachineNode(Node.Opc, dl, Node.Ty, Ops);		: DAG.getMachineNode(Node.Opc, dl, Node.Ty, Ops);
Output.push_back(SDValue(ResN, 0));		Output.push_back(SDValue(ResN, 0));
}		}

SDNode *OutN = Output.back().getNode();		SDNode *OutN = Output.back().getNode();
SDNode *InpN = Results.InpNode;		SDNode *InpN = Results.InpNode;
DEBUG_WITH_TYPE("isel", {		DEBUG_WITH_TYPE("isel", {
dbgs() << "Generated node:\n";		dbgs() << "Generated node:\n";
OutN->dumpr(&DAG);		OutN->dumpr(&DAG);
▲ Show 20 Lines • Show All 1,206 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonInstrInfo.cpp

Show First 20 Lines • Show All 1,025 Lines • ▼ Show 20 Lines	auto UseAligned = [&] (const MachineInstr &MI, unsigned NeedAlign) {
if (MI.memoperands().empty())		if (MI.memoperands().empty())
return false;		return false;
return all_of(MI.memoperands(), [NeedAlign](const MachineMemOperand *MMO) {		return all_of(MI.memoperands(), [NeedAlign](const MachineMemOperand *MMO) {
return MMO->getAlign() >= NeedAlign;		return MMO->getAlign() >= NeedAlign;
});		});
};		};

switch (Opc) {		switch (Opc) {
case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - case TargetOpcode::COPY: - case TargetOpcode::TCOPY: { - MachineOperand &MD = MI.getOperand(0); - MachineOperand &MS = MI.getOperand(1); - MachineBasicBlock::iterator MBBI = MI.getIterator(); - if (MD.getReg() != MS.getReg() && !MS.isUndef()) { - copyPhysReg(MBB, MI, DL, MD.getReg(), MS.getReg(), MS.isKill()); - std::prev(MBBI)->copyImplicitOps(MBB.getParent(), MI); - } - MBB.erase(MBBI); 9 diff lines are omitted. See full diff. Lint: Pre-merge checks:* clang-format: please reformat the code ``` - case TargetOpcode::COPY: - case TargetOpcode…
		case TargetOpcode::TCOPY: {
MachineOperand &MD = MI.getOperand(0);		MachineOperand &MD = MI.getOperand(0);
MachineOperand &MS = MI.getOperand(1);		MachineOperand &MS = MI.getOperand(1);
MachineBasicBlock::iterator MBBI = MI.getIterator();		MachineBasicBlock::iterator MBBI = MI.getIterator();
if (MD.getReg() != MS.getReg() && !MS.isUndef()) {		if (MD.getReg() != MS.getReg() && !MS.isUndef()) {
copyPhysReg(MBB, MI, DL, MD.getReg(), MS.getReg(), MS.isKill());		copyPhysReg(MBB, MI, DL, MD.getReg(), MS.getReg(), MS.isKill());
std::prev(MBBI)->copyImplicitOps(*MBB.getParent(), MI);		std::prev(MBBI)->copyImplicitOps(*MBB.getParent(), MI);
}		}
MBB.erase(MBBI);		MBB.erase(MBBI);
return true;		return true;
}		}
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code + MBB.erase(MBBI); + return true; + } Lint: Pre-merge checks: clang-format: please reformat the code ``` + MBB.erase(MBBI); + return true; + } ```
case Hexagon::PS_aligna:		case Hexagon::PS_aligna:
BuildMI(MBB, MI, DL, get(Hexagon::A2_andir), MI.getOperand(0).getReg())		BuildMI(MBB, MI, DL, get(Hexagon::A2_andir), MI.getOperand(0).getReg())
.addReg(HRI.getFrameRegister())		.addReg(HRI.getFrameRegister())
.addImm(-MI.getOperand(1).getImm());		.addImm(-MI.getOperand(1).getImm());
MBB.erase(MI);		MBB.erase(MI);
return true;		return true;
case Hexagon::V6_vassignp: {		case Hexagon::V6_vassignp: {
Register SrcReg = MI.getOperand(1).getReg();		Register SrcReg = MI.getOperand(1).getReg();
▲ Show 20 Lines • Show All 1,301 Lines • ▼ Show 20 Lines
bool HexagonInstrInfo::isLateResultInstr(const MachineInstr &MI) const {		bool HexagonInstrInfo::isLateResultInstr(const MachineInstr &MI) const {
switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
case TargetOpcode::EXTRACT_SUBREG:		case TargetOpcode::EXTRACT_SUBREG:
case TargetOpcode::INSERT_SUBREG:		case TargetOpcode::INSERT_SUBREG:
case TargetOpcode::SUBREG_TO_REG:		case TargetOpcode::SUBREG_TO_REG:
case TargetOpcode::REG_SEQUENCE:		case TargetOpcode::REG_SEQUENCE:
case TargetOpcode::IMPLICIT_DEF:		case TargetOpcode::IMPLICIT_DEF:
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::INLINEASM:		case TargetOpcode::INLINEASM:
case TargetOpcode::PHI:		case TargetOpcode::PHI:
return false;		return false;
default:		default:
break;		break;
}		}

unsigned SchedClass = MI.getDesc().getSchedClass();		unsigned SchedClass = MI.getDesc().getSchedClass();
▲ Show 20 Lines • Show All 2,284 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonMachineScheduler.cpp

Show First 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	if (!ResourcesModel->canReserveResources(*SU->getInstr()))
return false;		return false;
break;		break;
case TargetOpcode::EXTRACT_SUBREG:		case TargetOpcode::EXTRACT_SUBREG:
case TargetOpcode::INSERT_SUBREG:		case TargetOpcode::INSERT_SUBREG:
case TargetOpcode::SUBREG_TO_REG:		case TargetOpcode::SUBREG_TO_REG:
case TargetOpcode::REG_SEQUENCE:		case TargetOpcode::REG_SEQUENCE:
case TargetOpcode::IMPLICIT_DEF:		case TargetOpcode::IMPLICIT_DEF:
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::INLINEASM:		case TargetOpcode::INLINEASM:
case TargetOpcode::INLINEASM_BR:		case TargetOpcode::INLINEASM_BR:
break;		break;
}		}

MachineBasicBlock *MBB = SU->getInstr()->getParent();		MachineBasicBlock *MBB = SU->getInstr()->getParent();
auto &QST = MBB->getParent()->getSubtarget<HexagonSubtarget>();		auto &QST = MBB->getParent()->getSubtarget<HexagonSubtarget>();
const auto &QII = *QST.getInstrInfo();		const auto &QII = *QST.getInstrInfo();
Show All 40 Lines	bool VLIWResourceModel::reserveResources(SUnit *SU, bool IsTop) {
case TargetOpcode::INSERT_SUBREG:		case TargetOpcode::INSERT_SUBREG:
case TargetOpcode::SUBREG_TO_REG:		case TargetOpcode::SUBREG_TO_REG:
case TargetOpcode::REG_SEQUENCE:		case TargetOpcode::REG_SEQUENCE:
case TargetOpcode::IMPLICIT_DEF:		case TargetOpcode::IMPLICIT_DEF:
case TargetOpcode::KILL:		case TargetOpcode::KILL:
case TargetOpcode::CFI_INSTRUCTION:		case TargetOpcode::CFI_INSTRUCTION:
case TargetOpcode::EH_LABEL:		case TargetOpcode::EH_LABEL:
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::INLINEASM:		case TargetOpcode::INLINEASM:
case TargetOpcode::INLINEASM_BR:		case TargetOpcode::INLINEASM_BR:
break;		break;
}		}
Packet.push_back(SU);		Packet.push_back(SU);

#ifndef NDEBUG		#ifndef NDEBUG
LLVM_DEBUG(dbgs() << "Packet[" << TotalPackets << "]:\n");		LLVM_DEBUG(dbgs() << "Packet[" << TotalPackets << "]:\n");
▲ Show 20 Lines • Show All 825 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonNewValueJump.cpp

Show First 20 Lines • Show All 213 Lines • ▼ Show 20 Lines	if (!afterRA) {
// KILL sets kill flag on the opcode. It also sets up a		// KILL sets kill flag on the opcode. It also sets up a
// single register, out of pair.		// single register, out of pair.
// %d0 = S2_lsr_r_p killed %d0, killed %r2		// %d0 = S2_lsr_r_p killed %d0, killed %r2
// %r0 = KILL %r0, implicit killed %d0		// %r0 = KILL %r0, implicit killed %d0
// %p0 = C2_cmpeqi killed %r0, 0		// %p0 = C2_cmpeqi killed %r0, 0
// PHI can be anything after RA.		// PHI can be anything after RA.
// COPY can remateriaze things in between feeder, compare and nvj.		// COPY can remateriaze things in between feeder, compare and nvj.
if (MII->getOpcode() == TargetOpcode::KILL \|\|		if (MII->getOpcode() == TargetOpcode::KILL \|\|
MII->getOpcode() == TargetOpcode::PHI \|\|		MII->getOpcode() == TargetOpcode::PHI \|\| MII->isCopy())
MII->getOpcode() == TargetOpcode::COPY)
return false;		return false;

// The following pseudo Hexagon instructions sets "use" and "def"		// The following pseudo Hexagon instructions sets "use" and "def"
// of registers by individual passes in the backend. At this time,		// of registers by individual passes in the backend. At this time,
// we don't know the scope of usage and definitions of these		// we don't know the scope of usage and definitions of these
// instructions.		// instructions.
if (MII->getOpcode() == Hexagon::LDriw_pred \|\|		if (MII->getOpcode() == Hexagon::LDriw_pred \|\|
MII->getOpcode() == Hexagon::STriw_pred)		MII->getOpcode() == Hexagon::STriw_pred)
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	if (cmpReg1 == cmpOp2)
return false;		return false;

// Make sure that the second register is not from COPY		// Make sure that the second register is not from COPY
// at machine code level, we don't need this, but if we decide		// at machine code level, we don't need this, but if we decide
// to move new value jump prior to RA, we would be needing this.		// to move new value jump prior to RA, we would be needing this.
MachineRegisterInfo &MRI = MF.getRegInfo();		MachineRegisterInfo &MRI = MF.getRegInfo();
if (secondReg && !Register::isPhysicalRegister(cmpOp2)) {		if (secondReg && !Register::isPhysicalRegister(cmpOp2)) {
MachineInstr *def = MRI.getVRegDef(cmpOp2);		MachineInstr *def = MRI.getVRegDef(cmpOp2);
if (def->getOpcode() == TargetOpcode::COPY)		if (def->isCopy())
return false;		return false;
}		}
}		}

// Walk the instructions after the compare (predicate def) to the jump,		// Walk the instructions after the compare (predicate def) to the jump,
// and satisfy the following conditions.		// and satisfy the following conditions.
++II;		++II;
for (MachineBasicBlock::iterator localII = II; localII != end; ++localII) {		for (MachineBasicBlock::iterator localII = II; localII != end; ++localII) {
▲ Show 20 Lines • Show All 423 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonSplitDouble.cpp

Show First 20 Lines • Show All 166 Lines • ▼ Show 20 Lines	bool HexagonSplitDoubleRegs::isFixedInstr(const MachineInstr *MI) const {

unsigned Opc = MI->getOpcode();		unsigned Opc = MI->getOpcode();
switch (Opc) {		switch (Opc) {
default:		default:
return true;		return true;

case TargetOpcode::PHI:		case TargetOpcode::PHI:
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
break;		break;

case Hexagon::L2_loadrd_io:		case Hexagon::L2_loadrd_io:
// Not handling stack stores (only reg-based addresses).		// Not handling stack stores (only reg-based addresses).
if (MI->getOperand(1).isReg())		if (MI->getOperand(1).isReg())
break;		break;
return true;		return true;
case Hexagon::S2_storerd_io:		case Hexagon::S2_storerd_io:
▲ Show 20 Lines • Show All 134 Lines • ▼ Show 20 Lines	int32_t HexagonSplitDoubleRegs::profit(const MachineInstr *MI) const {
unsigned Opc = MI->getOpcode();		unsigned Opc = MI->getOpcode();
switch (Opc) {		switch (Opc) {
case TargetOpcode::PHI:		case TargetOpcode::PHI:
for (const auto &Op : MI->operands())		for (const auto &Op : MI->operands())
if (!Op.getSubReg())		if (!Op.getSubReg())
return 0;		return 0;
return 10;		return 10;
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
if (MI->getOperand(1).getSubReg() != 0)		if (MI->getOperand(1).getSubReg() != 0)
return 10;		return 10;
return 0;		return 0;

case Hexagon::L2_loadrd_io:		case Hexagon::L2_loadrd_io:
case Hexagon::S2_storerd_io:		case Hexagon::S2_storerd_io:
return -1;		return -1;
case Hexagon::L2_loadrd_pi:		case Hexagon::L2_loadrd_pi:
▲ Show 20 Lines • Show All 664 Lines • ▼ Show 20 Lines	bool HexagonSplitDoubleRegs::splitInstr(MachineInstr *MI,
using namespace Hexagon;		using namespace Hexagon;

LLVM_DEBUG(dbgs() << "Splitting: " << *MI);		LLVM_DEBUG(dbgs() << "Splitting: " << *MI);
bool Split = false;		bool Split = false;
unsigned Opc = MI->getOpcode();		unsigned Opc = MI->getOpcode();

switch (Opc) {		switch (Opc) {
case TargetOpcode::PHI:		case TargetOpcode::PHI:
case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY: {
Register DstR = MI->getOperand(0).getReg();		Register DstR = MI->getOperand(0).getReg();
if (MRI->getRegClass(DstR) == DoubleRC) {		if (MRI->getRegClass(DstR) == DoubleRC) {
createHalfInstr(Opc, MI, PairMap, isub_lo);		createHalfInstr(Opc, MI, PairMap, isub_lo);
createHalfInstr(Opc, MI, PairMap, isub_hi);		createHalfInstr(Opc, MI, PairMap, isub_hi);
Split = true;		Split = true;
}		}
break;		break;
}		}
▲ Show 20 Lines • Show All 226 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/RDFCopy.cpp

	Show All 34 Lines
	#ifndef NDEBUG			#ifndef NDEBUG
	static cl::opt<unsigned> CpLimit("rdf-cp-limit", cl::init(0), cl::Hidden);			static cl::opt<unsigned> CpLimit("rdf-cp-limit", cl::init(0), cl::Hidden);
	static unsigned CpCount = 0;			static unsigned CpCount = 0;
	#endif			#endif

	bool CopyPropagation::interpretAsCopy(const MachineInstr *MI, EqualityMap &EM) {			bool CopyPropagation::interpretAsCopy(const MachineInstr *MI, EqualityMap &EM) {
	unsigned Opc = MI->getOpcode();			unsigned Opc = MI->getOpcode();
	switch (Opc) {			switch (Opc) {
	case TargetOpcode::COPY: {			case TargetOpcode::COPY:
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - case TargetOpcode::COPY: - case TargetOpcode::TCOPY: { - const MachineOperand &Dst = MI->getOperand(0); - const MachineOperand &Src = MI->getOperand(1); - RegisterRef DstR = DFG.makeRegRef(Dst.getReg(), Dst.getSubReg()); - RegisterRef SrcR = DFG.makeRegRef(Src.getReg(), Src.getSubReg()); - assert(Register::isPhysicalRegister(DstR.Reg)); - assert(Register::isPhysicalRegister(SrcR.Reg)); - const TargetRegisterInfo &TRI = DFG.getTRI(); - if (TRI.getMinimalPhysRegClass(DstR.Reg) != 20 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - case TargetOpcode::COPY: - case TargetOpcode…
				case TargetOpcode::TCOPY: {
	const MachineOperand &Dst = MI->getOperand(0);			const MachineOperand &Dst = MI->getOperand(0);
	const MachineOperand &Src = MI->getOperand(1);			const MachineOperand &Src = MI->getOperand(1);
	RegisterRef DstR = DFG.makeRegRef(Dst.getReg(), Dst.getSubReg());			RegisterRef DstR = DFG.makeRegRef(Dst.getReg(), Dst.getSubReg());
	RegisterRef SrcR = DFG.makeRegRef(Src.getReg(), Src.getSubReg());			RegisterRef SrcR = DFG.makeRegRef(Src.getReg(), Src.getSubReg());
	assert(Register::isPhysicalRegister(DstR.Reg));			assert(Register::isPhysicalRegister(DstR.Reg));
	assert(Register::isPhysicalRegister(SrcR.Reg));			assert(Register::isPhysicalRegister(SrcR.Reg));
	const TargetRegisterInfo &TRI = DFG.getTRI();			const TargetRegisterInfo &TRI = DFG.getTRI();
	if (TRI.getMinimalPhysRegClass(DstR.Reg) !=			if (TRI.getMinimalPhysRegClass(DstR.Reg) !=
	▲ Show 20 Lines • Show All 162 Lines • Show Last 20 Lines

llvm/lib/Target/Mips/MipsRegisterBankInfo.cpp

Show First 20 Lines • Show All 179 Lines • ▼ Show 20 Lines

void MipsRegisterBankInfo::AmbiguousRegDefUseContainer::addDefUses(		void MipsRegisterBankInfo::AmbiguousRegDefUseContainer::addDefUses(
Register Reg, const MachineRegisterInfo &MRI) {		Register Reg, const MachineRegisterInfo &MRI) {
assert(!MRI.getType(Reg).isPointer() &&		assert(!MRI.getType(Reg).isPointer() &&
"Pointers are gprb, they should not be considered as ambiguous.\n");		"Pointers are gprb, they should not be considered as ambiguous.\n");
for (MachineInstr &UseMI : MRI.use_instructions(Reg)) {		for (MachineInstr &UseMI : MRI.use_instructions(Reg)) {
MachineInstr *NonCopyInstr = skipCopiesOutgoing(&UseMI);		MachineInstr *NonCopyInstr = skipCopiesOutgoing(&UseMI);
// Copy with many uses.		// Copy with many uses.
if (NonCopyInstr->getOpcode() == TargetOpcode::COPY &&		if (NonCopyInstr->isCopy() &&
!Register::isPhysicalRegister(NonCopyInstr->getOperand(0).getReg()))		!Register::isPhysicalRegister(NonCopyInstr->getOperand(0).getReg()))
addDefUses(NonCopyInstr->getOperand(0).getReg(), MRI);		addDefUses(NonCopyInstr->getOperand(0).getReg(), MRI);
else		else
DefUses.push_back(skipCopiesOutgoing(&UseMI));		DefUses.push_back(skipCopiesOutgoing(&UseMI));
}		}
}		}

void MipsRegisterBankInfo::AmbiguousRegDefUseContainer::addUseDef(		void MipsRegisterBankInfo::AmbiguousRegDefUseContainer::addUseDef(
Register Reg, const MachineRegisterInfo &MRI) {		Register Reg, const MachineRegisterInfo &MRI) {
assert(!MRI.getType(Reg).isPointer() &&		assert(!MRI.getType(Reg).isPointer() &&
"Pointers are gprb, they should not be considered as ambiguous.\n");		"Pointers are gprb, they should not be considered as ambiguous.\n");
MachineInstr *DefMI = MRI.getVRegDef(Reg);		MachineInstr *DefMI = MRI.getVRegDef(Reg);
UseDefs.push_back(skipCopiesIncoming(DefMI));		UseDefs.push_back(skipCopiesIncoming(DefMI));
}		}

MachineInstr *		MachineInstr *
MipsRegisterBankInfo::AmbiguousRegDefUseContainer::skipCopiesOutgoing(		MipsRegisterBankInfo::AmbiguousRegDefUseContainer::skipCopiesOutgoing(
MachineInstr *MI) const {		MachineInstr *MI) const {
const MachineFunction &MF = *MI->getParent()->getParent();		const MachineFunction &MF = *MI->getParent()->getParent();
const MachineRegisterInfo &MRI = MF.getRegInfo();		const MachineRegisterInfo &MRI = MF.getRegInfo();
MachineInstr *Ret = MI;		MachineInstr *Ret = MI;
while (Ret->getOpcode() == TargetOpcode::COPY &&		while (Ret->isCopy() &&
!Register::isPhysicalRegister(Ret->getOperand(0).getReg()) &&		!Register::isPhysicalRegister(Ret->getOperand(0).getReg()) &&
MRI.hasOneUse(Ret->getOperand(0).getReg())) {		MRI.hasOneUse(Ret->getOperand(0).getReg())) {
Ret = &(*MRI.use_instr_begin(Ret->getOperand(0).getReg()));		Ret = &(*MRI.use_instr_begin(Ret->getOperand(0).getReg()));
}		}
return Ret;		return Ret;
}		}

MachineInstr *		MachineInstr *
MipsRegisterBankInfo::AmbiguousRegDefUseContainer::skipCopiesIncoming(		MipsRegisterBankInfo::AmbiguousRegDefUseContainer::skipCopiesIncoming(
MachineInstr *MI) const {		MachineInstr *MI) const {
const MachineFunction &MF = *MI->getParent()->getParent();		const MachineFunction &MF = *MI->getParent()->getParent();
const MachineRegisterInfo &MRI = MF.getRegInfo();		const MachineRegisterInfo &MRI = MF.getRegInfo();
MachineInstr *Ret = MI;		MachineInstr *Ret = MI;
while (Ret->getOpcode() == TargetOpcode::COPY &&		while (Ret->isCopy() &&
!Register::isPhysicalRegister(Ret->getOperand(1).getReg()))		!Register::isPhysicalRegister(Ret->getOperand(1).getReg()))
Ret = MRI.getVRegDef(Ret->getOperand(1).getReg());		Ret = MRI.getVRegDef(Ret->getOperand(1).getReg());
return Ret;		return Ret;
}		}

MipsRegisterBankInfo::AmbiguousRegDefUseContainer::AmbiguousRegDefUseContainer(		MipsRegisterBankInfo::AmbiguousRegDefUseContainer::AmbiguousRegDefUseContainer(
const MachineInstr *MI) {		const MachineInstr *MI) {
assert(isAmbiguous(MI->getOpcode()) &&		assert(isAmbiguous(MI->getOpcode()) &&
▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	while (!AdjacentInstrs.empty()) {
if (isDefUse ? isFloatingPointOpcodeUse(AdjMI->getOpcode())		if (isDefUse ? isFloatingPointOpcodeUse(AdjMI->getOpcode())
: isFloatingPointOpcodeDef(AdjMI->getOpcode())) {		: isFloatingPointOpcodeDef(AdjMI->getOpcode())) {
setTypes(MI, InstType::FloatingPoint);		setTypes(MI, InstType::FloatingPoint);
return true;		return true;
}		}

// Determine InstType from register bank of phys register that is		// Determine InstType from register bank of phys register that is
// 'isDefUse ? def : use' of this copy.		// 'isDefUse ? def : use' of this copy.
if (AdjMI->getOpcode() == TargetOpcode::COPY) {		if (AdjMI->isCopy()) {
setTypesAccordingToPhysicalRegister(MI, AdjMI, isDefUse ? 0 : 1);		setTypesAccordingToPhysicalRegister(MI, AdjMI, isDefUse ? 0 : 1);
return true;		return true;
}		}

// Defaults to integer instruction. Small registers in G_MERGE (uses) and		// Defaults to integer instruction. Small registers in G_MERGE (uses) and
// G_UNMERGE (defs) will always be gprb.		// G_UNMERGE (defs) will always be gprb.
if ((!isDefUse && AdjMI->getOpcode() == TargetOpcode::G_UNMERGE_VALUES) \|\|		if ((!isDefUse && AdjMI->getOpcode() == TargetOpcode::G_UNMERGE_VALUES) \|\|
(isDefUse && AdjMI->getOpcode() == TargetOpcode::G_MERGE_VALUES) \|\|		(isDefUse && AdjMI->getOpcode() == TargetOpcode::G_MERGE_VALUES) \|\|
▲ Show 20 Lines • Show All 439 Lines • Show Last 20 Lines

llvm/lib/Target/Mips/MipsSEFrameLowering.cpp

Show First 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	case Mips::ExtractElementF64:
if (expandExtractElementF64(MBB, I, false))		if (expandExtractElementF64(MBB, I, false))
MBB.erase(I);		MBB.erase(I);
return false;		return false;
case Mips::ExtractElementF64_64:		case Mips::ExtractElementF64_64:
if (expandExtractElementF64(MBB, I, true))		if (expandExtractElementF64(MBB, I, true))
MBB.erase(I);		MBB.erase(I);
return false;		return false;
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
if (!expandCopy(MBB, I))		if (!expandCopy(MBB, I))
return false;		return false;
break;		break;
default:		default:
return false;		return false;
}		}

MBB.erase(I);		MBB.erase(I);
▲ Show 20 Lines • Show All 758 Lines • Show Last 20 Lines

llvm/lib/Target/NVPTX/NVPTXReplaceImageHandles.cpp

Show First 20 Lines • Show All 168 Lines • ▼ Show 20 Lines	case NVPTX::texsurf_handles: {
assert(TexHandleDef.getOperand(1).isGlobal() && "Load is not a global!");		assert(TexHandleDef.getOperand(1).isGlobal() && "Load is not a global!");
const GlobalValue *GV = TexHandleDef.getOperand(1).getGlobal();		const GlobalValue *GV = TexHandleDef.getOperand(1).getGlobal();
assert(GV->hasName() && "Global sampler must be named!");		assert(GV->hasName() && "Global sampler must be named!");
InstrsToRemove.insert(&TexHandleDef);		InstrsToRemove.insert(&TexHandleDef);
Idx = MFI->getImageHandleSymbolIndex(GV->getName().data());		Idx = MFI->getImageHandleSymbolIndex(GV->getName().data());
return true;		return true;
}		}
case NVPTX::nvvm_move_i64:		case NVPTX::nvvm_move_i64:
case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY: {
bool Res = findIndexForHandle(TexHandleDef.getOperand(1), MF, Idx);		bool Res = findIndexForHandle(TexHandleDef.getOperand(1), MF, Idx);
if (Res) {		if (Res) {
InstrsToRemove.insert(&TexHandleDef);		InstrsToRemove.insert(&TexHandleDef);
}		}
return Res;		return Res;
}		}
default:		default:
llvm_unreachable("Unknown instruction operating on handle");		llvm_unreachable("Unknown instruction operating on handle");
}		}
}		}

MachineFunctionPass *llvm::createNVPTXReplaceImageHandlesPass() {		MachineFunctionPass *llvm::createNVPTXReplaceImageHandlesPass() {
return new NVPTXReplaceImageHandles();		return new NVPTXReplaceImageHandles();
}		}

llvm/lib/Target/X86/X86DomainReassignment.cpp

Show First 20 Lines • Show All 229 Lines • ▼ Show 20 Lines	if (Register::isPhysicalRegister(SrcReg) &&
X86::GR16RegClass.contains(SrcReg)))		X86::GR16RegClass.contains(SrcReg)))
return false;		return false;

return true;		return true;
}		}

double getExtraCost(const MachineInstr *MI,		double getExtraCost(const MachineInstr *MI,
MachineRegisterInfo *MRI) const override {		MachineRegisterInfo *MRI) const override {
assert(MI->getOpcode() == TargetOpcode::COPY && "Expected a COPY");		assert(MI->isCopy() && "Expected a COPY");

for (auto &MO : MI->operands()) {		for (auto &MO : MI->operands()) {
// Physical registers will not be converted. Assume that converting the		// Physical registers will not be converted. Assume that converting the
// COPY to the destination domain will eventually result in a actual		// COPY to the destination domain will eventually result in a actual
// instruction.		// instruction.
if (Register::isPhysicalRegister(MO.getReg()))		if (Register::isPhysicalRegister(MO.getReg()))
return 1;		return 1;

▲ Show 20 Lines • Show All 551 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86FlagsCopyLowering.cpp

Show First 20 Lines • Show All 378 Lines • ▼ Show 20 Lines	bool X86FlagsCopyLoweringPass::runOnMachineFunction(MachineFunction &MF) {
// Collect the copies in RPO so that when there are chains where a copy is in		// Collect the copies in RPO so that when there are chains where a copy is in
// turn copied again we visit the first one first. This ensures we can find		// turn copied again we visit the first one first. This ensures we can find
// viable locations for testing the original EFLAGS that dominate all the		// viable locations for testing the original EFLAGS that dominate all the
// uses across complex CFGs.		// uses across complex CFGs.
SmallVector<MachineInstr *, 4> Copies;		SmallVector<MachineInstr *, 4> Copies;
ReversePostOrderTraversal<MachineFunction *> RPOT(&MF);		ReversePostOrderTraversal<MachineFunction *> RPOT(&MF);
for (MachineBasicBlock *MBB : RPOT)		for (MachineBasicBlock *MBB : RPOT)
for (MachineInstr &MI : *MBB)		for (MachineInstr &MI : *MBB)
if (MI.getOpcode() == TargetOpcode::COPY &&		if (MI.isCopy() && MI.getOperand(0).getReg() == X86::EFLAGS)
MI.getOperand(0).getReg() == X86::EFLAGS)
Copies.push_back(&MI);		Copies.push_back(&MI);

for (MachineInstr *CopyI : Copies) {		for (MachineInstr *CopyI : Copies) {
MachineBasicBlock &MBB = *CopyI->getParent();		MachineBasicBlock &MBB = *CopyI->getParent();

MachineOperand &VOp = CopyI->getOperand(1);		MachineOperand &VOp = CopyI->getOperand(1);
assert(VOp.isReg() &&		assert(VOp.isReg() &&
"The input to the copy for EFLAGS should always be a register!");		"The input to the copy for EFLAGS should always be a register!");
MachineInstr &CopyDefI = *MRI->getVRegDef(VOp.getReg());		MachineInstr &CopyDefI = *MRI->getVRegDef(VOp.getReg());
if (CopyDefI.getOpcode() != TargetOpcode::COPY) {		if (!CopyDefI.isCopy()) {
// FIXME: The big likely candidate here are PHI nodes. We could in theory		// FIXME: The big likely candidate here are PHI nodes. We could in theory
// handle PHI nodes, but it gets really, really hard. Insanely hard. Hard		// handle PHI nodes, but it gets really, really hard. Insanely hard. Hard
// enough that it is probably better to change every other part of LLVM		// enough that it is probably better to change every other part of LLVM
// to avoid creating them. The issue is that once we have PHIs we won't		// to avoid creating them. The issue is that once we have PHIs we won't
// know which original EFLAGS value we need to capture with our setCCs		// know which original EFLAGS value we need to capture with our setCCs
// below. The end result will be computing a complete set of setCCs that		// below. The end result will be computing a complete set of setCCs that
// we might want, computing them in every place where we copy out of		// we might want, computing them in every place where we copy out of
// EFLAGS and then doing SSA formation on all of them to insert necessary		// EFLAGS and then doing SSA formation on all of them to insert necessary
▲ Show 20 Lines • Show All 212 Lines • ▼ Show 20 Lines	do {

// Otherwise we can just rewrite in-place.		// Otherwise we can just rewrite in-place.
if (X86::getCondFromCMov(MI) != X86::COND_INVALID) {		if (X86::getCondFromCMov(MI) != X86::COND_INVALID) {
rewriteCMov(TestMBB, TestPos, TestLoc, MI, FlagUse, CondRegs);		rewriteCMov(TestMBB, TestPos, TestLoc, MI, FlagUse, CondRegs);
} else if (getCondFromFCMOV(MI.getOpcode()) != X86::COND_INVALID) {		} else if (getCondFromFCMOV(MI.getOpcode()) != X86::COND_INVALID) {
rewriteFCMov(TestMBB, TestPos, TestLoc, MI, FlagUse, CondRegs);		rewriteFCMov(TestMBB, TestPos, TestLoc, MI, FlagUse, CondRegs);
} else if (X86::getCondFromSETCC(MI) != X86::COND_INVALID) {		} else if (X86::getCondFromSETCC(MI) != X86::COND_INVALID) {
rewriteSetCC(TestMBB, TestPos, TestLoc, MI, FlagUse, CondRegs);		rewriteSetCC(TestMBB, TestPos, TestLoc, MI, FlagUse, CondRegs);
} else if (MI.getOpcode() == TargetOpcode::COPY) {		} else if (MI.isCopy()) {
rewriteCopy(MI, *FlagUse, CopyDefI);		rewriteCopy(MI, *FlagUse, CopyDefI);
} else {		} else {
// We assume all other instructions that use flags also def them.		// We assume all other instructions that use flags also def them.
assert(MI.findRegisterDefOperand(X86::EFLAGS) &&		assert(MI.findRegisterDefOperand(X86::EFLAGS) &&
"Expected a def of EFLAGS for this instruction!");		"Expected a def of EFLAGS for this instruction!");

// NB!!! Several arithmetic instructions only partially update		// NB!!! Several arithmetic instructions only partially update
// flags. Theoretically, we could generate MI code sequences that		// flags. Theoretically, we could generate MI code sequences that
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	for (MachineInstr *CopyI : Copies) {

// FIXME: Mark the last use of EFLAGS before the copy's def as a kill if		// FIXME: Mark the last use of EFLAGS before the copy's def as a kill if
// the copy's def operand is itself a kill.		// the copy's def operand is itself a kill.
}		}

#ifndef NDEBUG		#ifndef NDEBUG
for (MachineBasicBlock &MBB : MF)		for (MachineBasicBlock &MBB : MF)
for (MachineInstr &MI : MBB)		for (MachineInstr &MI : MBB)
if (MI.getOpcode() == TargetOpcode::COPY &&		if (MI.isCopy() && (MI.getOperand(0).getReg() == X86::EFLAGS \|\|
(MI.getOperand(0).getReg() == X86::EFLAGS \|\|
MI.getOperand(1).getReg() == X86::EFLAGS)) {		MI.getOperand(1).getReg() == X86::EFLAGS)) {
LLVM_DEBUG(dbgs() << "ERROR: Found a COPY involving EFLAGS: ";		LLVM_DEBUG(dbgs() << "ERROR: Found a COPY involving EFLAGS: ";
MI.dump());		MI.dump());
llvm_unreachable("Unlowered EFLAGS copy!");		llvm_unreachable("Unlowered EFLAGS copy!");
}		}
#endif		#endif

return true;		return true;
}		}
▲ Show 20 Lines • Show All 257 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86FloatingPoint.cpp

Show First 20 Lines • Show All 1,452 Lines • ▼ Show 20 Lines	void FPS::handleSpecialFP(MachineBasicBlock::iterator &Inst) {

if (MI.isReturn()) {		if (MI.isReturn()) {
handleReturn(Inst);		handleReturn(Inst);
return;		return;
}		}

switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
default: llvm_unreachable("Unknown SpecialFP instruction!");		default: llvm_unreachable("Unknown SpecialFP instruction!");
case TargetOpcode::COPY: {		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY: {
// We handle three kinds of copies: FP <- FP, FP <- ST, and ST <- FP.		// We handle three kinds of copies: FP <- FP, FP <- ST, and ST <- FP.
const MachineOperand &MO1 = MI.getOperand(1);		const MachineOperand &MO1 = MI.getOperand(1);
const MachineOperand &MO0 = MI.getOperand(0);		const MachineOperand &MO0 = MI.getOperand(0);
bool KillsSrc = MI.killsRegister(MO1.getReg());		bool KillsSrc = MI.killsRegister(MO1.getReg());

// FP <- FP copy.		// FP <- FP copy.
unsigned DstFP = getFPReg(MO0);		unsigned DstFP = getFPReg(MO0);
unsigned SrcFP = getFPReg(MO1);		unsigned SrcFP = getFPReg(MO1);
▲ Show 20 Lines • Show All 261 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86InstrInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	bool X86InstrInfo::isDataInvariant(MachineInstr &MI) {
switch (MI.getOpcode()) {		switch (MI.getOpcode()) {
default:		default:
// By default, assume that the instruction is not data invariant.		// By default, assume that the instruction is not data invariant.
return false;		return false;

// Some target-independent operations that trivially lower to data-invariant		// Some target-independent operations that trivially lower to data-invariant
// instructions.		// instructions.
case TargetOpcode::COPY:		case TargetOpcode::COPY:
		case TargetOpcode::TCOPY:
case TargetOpcode::INSERT_SUBREG:		case TargetOpcode::INSERT_SUBREG:
case TargetOpcode::SUBREG_TO_REG:		case TargetOpcode::SUBREG_TO_REG:
return true;		return true;

// On x86 it is believed that imul is constant time w.r.t. the loaded data.		// On x86 it is believed that imul is constant time w.r.t. the loaded data.
// However, they set flags and are perhaps the most surprisingly constant		// However, they set flags and are perhaps the most surprisingly constant
// time operations so we call them out here separately.		// time operations so we call them out here separately.
case X86::IMUL16rr:		case X86::IMUL16rr:
▲ Show 20 Lines • Show All 8,762 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/callbr-asm-label.ll

	; RUN: llc < %s -mtriple=aarch64-linux-gnu \| FileCheck %s			; RUN: llc < %s -mtriple=aarch64-linux-gnu \| FileCheck %s
				; RUN: llc < %s -mtriple=aarch64-linux-gnu -O0 \| FileCheck %s --check-prefix=CHECK-O0

	@X = common local_unnamed_addr global i32 0, align 4			@X = common local_unnamed_addr global i32 0, align 4

	define i32 @test1() {			define i32 @test1() {
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: .word b			; CHECK: .word b
	; CHECK-NEXT: .word .Ltmp0			; CHECK-NEXT: .word .Ltmp0
	; CHECK-LABEL: .LBB0_1: // %cleanup			; CHECK-LABEL: .LBB0_1: // %cleanup
	; CHECK-LABEL: .Ltmp0:			; CHECK-LABEL: .Ltmp0:
	; CHECK-LABEL: .LBB0_2: // %indirect			; CHECK-LABEL: .LBB0_2: // %indirect

				; CHECK-O0-LABEL: test1:
				; CHECK-O0: .word b
				; CHECK-O0-NEXT: .word .Ltmp1
				; CHECK-O0-LABEL: .Ltmp1:
				; CHECK-O0-LABEL: .LBB0_1: // %indirect
				; CHECK-O0-LABEL: .LBB0_2: // %cleanup
	entry:			entry:
	callbr void asm sideeffect "1:\0A\09.word b, ${0:l}\0A\09", "X"(i8* blockaddress(@test1, %indirect))			callbr void asm sideeffect "1:\0A\09.word b, ${0:l}\0A\09", "X"(i8* blockaddress(@test1, %indirect))
	to label %cleanup [label %indirect]			to label %cleanup [label %indirect]

	indirect:			indirect:
	br label %cleanup			br label %cleanup

	cleanup:			cleanup:
	%retval.0 = phi i32 [ 1, %indirect ], [ 0, %entry ]			%retval.0 = phi i32 [ 1, %indirect ], [ 0, %entry ]
	ret i32 %retval.0			ret i32 %retval.0
	}			}

	define void @test2() {			define void @test2() {
	; CHECK-LABEL: test2:			; CHECK-LABEL: test2:
				; CHECK-O0-LABEL: test2:
	entry:			entry:
	%0 = load i32, i32* @X, align 4			%0 = load i32, i32* @X, align 4
	%and = and i32 %0, 1			%and = and i32 %0, 1
	%tobool = icmp eq i32 %and, 0			%tobool = icmp eq i32 %and, 0
	br i1 %tobool, label %if.end10, label %if.then			br i1 %tobool, label %if.end10, label %if.then

	if.then:			if.then:
	; CHECK: .word b			; CHECK: .word b
	; CHECK-NEXT: .word .Ltmp2			; CHECK-NEXT: .word .Ltmp2
	; CHECK-LABEL: .Ltmp2:			; CHECK-LABEL: .Ltmp2:
	; CHECK-NEXT: .LBB1_3: // %if.end6			; CHECK-NEXT: .LBB1_3: // %if.end6

				; CHECK-O0: .word b
				; CHECK-O0-NEXT: .word .Ltmp3
				; CHECK-O0-LABEL: .Ltmp3:
				; CHECK-O0-NEXT: .LBB1_3: // %if.end6
	callbr void asm sideeffect "1:\0A\09.word b, ${0:l}\0A\09", "X"(i8* blockaddress(@test2, %if.end6))			callbr void asm sideeffect "1:\0A\09.word b, ${0:l}\0A\09", "X"(i8* blockaddress(@test2, %if.end6))
	to label %if.then4 [label %if.end6]			to label %if.then4 [label %if.end6]

	if.then4:			if.then4:
	%call5 = tail call i32 bitcast (i32 (...)* @g to i32 ()*)()			%call5 = tail call i32 bitcast (i32 (...)* @g to i32 ()*)()
	br label %if.end6			br label %if.end6

	if.end6:			if.end6:
	%.pre = load i32, i32* @X, align 4			%.pre = load i32, i32* @X, align 4
	%.pre13 = and i32 %.pre, 1			%.pre13 = and i32 %.pre, 1
	%phitmp = icmp eq i32 %.pre13, 0			%phitmp = icmp eq i32 %.pre13, 0
	br i1 %phitmp, label %if.end10, label %if.then9			br i1 %phitmp, label %if.end10, label %if.then9

	if.then9:			if.then9:
	; CHECK-LABEL: .Ltmp4:			; CHECK-LABEL: .Ltmp4:
	; CHECK-NEXT: .LBB1_5: // %l_yes			; CHECK-NEXT: .LBB1_5: // %l_yes

				; CHECK-O0-LABEL: .Ltmp5:
				; CHECK-O0-NEXT: .LBB1_6: // %l_yes
	callbr void asm sideeffect "", "X"(i8* blockaddress(@test2, %l_yes))			callbr void asm sideeffect "", "X"(i8* blockaddress(@test2, %l_yes))
	to label %if.end10 [label %l_yes]			to label %if.end10 [label %l_yes]

	if.end10:			if.end10:
	br label %l_yes			br label %l_yes

	l_yes:			l_yes:
	ret void			ret void
	}			}

	declare i32 @g(...)			declare i32 @g(...)

llvm/test/CodeGen/SystemZ/asm-20.ll

	; Test that asm goto can be compiled.			; Test that asm goto can be compiled.
	;			;
	; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14			; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z14 -O0

	define i32 @c() {			define i32 @c() {
	entry:			entry:
	callbr void asm sideeffect "j d", "X"(i8* blockaddress(@c, %d))			callbr void asm sideeffect "j d", "X"(i8* blockaddress(@c, %d))
	to label %asm.fallthrough [label %d]			to label %asm.fallthrough [label %d]

	asm.fallthrough: ; preds = %entry			asm.fallthrough: ; preds = %entry
	br label %d			br label %d

	d: ; preds = %asm.fallthrough, %entry			d: ; preds = %asm.fallthrough, %entry
	ret i32 undef			ret i32 undef
	}			}

llvm/test/CodeGen/X86/callbr-asm-label-addr.ll

	; RUN: llc < %s -mtriple=x86_64-unknown-linux-gnu \| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-unknown-linux-gnu \| FileCheck %s
				; RUN: llc < %s -mtriple=x86_64-unknown-linux-gnu -O0 \| FileCheck %s --check-prefix=CHECK-O0

	define i32 @test1(i32 %x) {			define i32 @test1(i32 %x) {
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: .quad .Ltmp0			; CHECK: .quad .Ltmp0
	; CHECK-NEXT: .quad .Ltmp1			; CHECK-NEXT: .quad .Ltmp1
	; CHECK-LABEL: .Ltmp1:			; CHECK-LABEL: .Ltmp1:
	; CHECK-LABEL: .LBB0_1: # %bar			; CHECK-LABEL: .LBB0_1: # %bar
	; CHECK-NEXT: callq foo			; CHECK-NEXT: callq foo
	; CHECK-LABEL: .Ltmp0:			; CHECK-LABEL: .Ltmp0:
	; CHECK-NEXT: # %bb.2: # %baz			; CHECK-NEXT: # %bb.2: # %baz

				; CHECK-O0-LABEL: test1:
				; CHECK-O0: .quad .Ltmp0
				; CHECK-O0-NEXT: .quad .Ltmp1
				; CHECK-O0-LABEL: .Ltmp1:
				; CHECK-O0-LABEL: .LBB0_2: # %bar
				; CHECK-O0-NEXT: movl
				; CHECK-O0-NEXT: callq foo
				; CHECK-O0-LABEL: .Ltmp0:
				; CHECK-O0-NEXT: # %bb.3: # %baz
	entry:			entry:
	callbr void asm sideeffect ".quad ${0:l}\0A\09.quad ${1:l}", "i,X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@test1, %baz), i8* blockaddress(@test1, %bar))			callbr void asm sideeffect ".quad ${0:l}\0A\09.quad ${1:l}", "i,X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@test1, %baz), i8* blockaddress(@test1, %bar))
	to label %asm.fallthrough [label %bar]			to label %asm.fallthrough [label %bar]

	asm.fallthrough:			asm.fallthrough:
	br label %bar			br label %bar

	bar:			bar:
	Show All 11 Lines

llvm/test/CodeGen/X86/callbr-asm-outputs-tcopy-spilling.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=x86_64-unknown-linux-gnu -O0 \| FileCheck %s --check-prefix=CHECK

				%struct.kernel_rseq = type { i32, i32, i8*, i32, [12 x i8] }

				@__rseq_abi = external thread_local global %struct.kernel_rseq, align 32

				define i32 @test1(i8* %percpu_data, i64 %lock_value) #0 {
				; CHECK-LABEL: test1:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: pushq %rbp
				; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: .cfi_offset %rbp, -16
				; CHECK-NEXT: movq %rsp, %rbp
				; CHECK-NEXT: .cfi_def_cfa_register %rbp
				; CHECK-NEXT: movq %rdi, -16(%rbp)
				; CHECK-NEXT: movq %rsi, -24(%rbp)
				; CHECK-NEXT: movq -16(%rbp), %rax
				; CHECK-NEXT: movq -24(%rbp), %rcx
				; CHECK-NEXT: movq __rseq_abi@{{.*}}(%rip), %rdx
				; CHECK-NEXT: movq %fs:0, %rsi
				; CHECK-NEXT: leaq 8(%rsi,%rdx), %rdi
				; CHECK-NEXT: leaq 4(%rsi,%rdx), %rdx
				; CHECK-NEXT: #APP
				; CHECK-NEXT: .Ltmp1:
				; CHECK-NEXT: leaq __rseq_cs_RseqFunction_PerCpuTryLock_0(%rip), %rsi
				; CHECK-NEXT: movq %rsi, (%rdi)
				; CHECK-NEXT: .Ltmp2:
				; CHECK-NEXT: movl (%rdx), %r8d
				; CHECK-NEXT: movl %r8d, %esi
				; CHECK-NEXT: shlq $12, %rsi
				; CHECK-NEXT: addq %rax, %rsi
				; CHECK-NEXT: cmpq $0, (%rsi)
				; CHECK-NEXT: jne .Ltmp0
				; CHECK-NEXT: movq %rcx, (%rsi)
				; CHECK-NEXT: .Ltmp3:
				; CHECK-NEXT: #NO_APP
				; CHECK-NEXT: movl %r8d, -40(%rbp) # 4-byte Spill
				; CHECK-NEXT: movq %rsi, -48(%rbp) # 8-byte Spill
				; CHECK-NEXT: # %bb.4: # %entry
				; CHECK-NEXT: jmp .LBB0_1
				entry:
				%retval = alloca i32, align 4
				%percpu_data.addr = alloca i8*, align 8
				%lock_value.addr = alloca i64, align 8
				%scratch = alloca i64, align 8
				%cpu = alloca i32, align 4
				store i8* %percpu_data, i8** %percpu_data.addr, align 8
				store i64 %lock_value, i64* %lock_value.addr, align 8
				%0 = load i8, i8* %percpu_data.addr, align 8
				%1 = load i64, i64* %lock_value.addr, align 8
				%2 = callbr { i64, i32 } asm "3:\0Alea __rseq_cs_RseqFunction_PerCpuTryLock_${:uid}(%rip), $0\0Amov $0, ($2)\0A4:\0Amov ($3), $1\0Amov $1, ${0:k}\0Ashl $5, $0\0Aadd $6, $0\0Acmpq $$0, ($0)\0Ajne ${8:l}\0Amov $7, ($0)\0A5:", "=&r,=&r,r,r,n,n,r,r,X,~{cc},~{memory},~{dirflag},~{fpsr},~{flags}"(i8** getelementptr inbounds (%struct.kernel_rseq, %struct.kernel_rseq* @__rseq_abi, i32 0, i32 2), i32* getelementptr inbounds (%struct.kernel_rseq, %struct.kernel_rseq* @__rseq_abi, i32 0, i32 1), i32 1392848979, i32 12, i8* %0, i64 %1, i8* blockaddress(@test1, %fail_contended)) #1
				to label %asm.fallthrough [label %fail_contended]

				asm.fallthrough: ; preds = %entry
				%asmresult = extractvalue { i64, i32 } %2, 0
				%asmresult1 = extractvalue { i64, i32 } %2, 1
				store i64 %asmresult, i64* %scratch, align 8
				store i32 %asmresult1, i32* %cpu, align 4
				%3 = load i32, i32* %cpu, align 4
				store i32 %3, i32* %retval, align 4
				br label %return

				fail_contended: ; preds = %entry
				store i32 -1, i32* %retval, align 4
				br label %return

				return: ; preds = %fail_contended, %asm.fallthrough
				%4 = load i32, i32* %retval, align 4
				ret i32 %4
				}

				attributes #0 = { noinline nounwind optnone uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="all" "less-precise-fpmad"="false" "min-legal-vector-width"="0" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #1 = { nounwind }

llvm/test/CodeGen/X86/callbr-asm-outputs.ll

	Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: jge .LBB1_3			; CHECK-NEXT: jge .LBB1_3
	; CHECK-NEXT: # %bb.1: # %if.then			; CHECK-NEXT: # %bb.1: # %if.then
	; CHECK-NEXT: #APP			; CHECK-NEXT: #APP
	; CHECK-NEXT: testl %esi, %esi			; CHECK-NEXT: testl %esi, %esi
	; CHECK-NEXT: testl %edi, %esi			; CHECK-NEXT: testl %edi, %esi
	; CHECK-NEXT: jne .Ltmp1			; CHECK-NEXT: jne .Ltmp1
	; CHECK-NEXT: #NO_APP			; CHECK-NEXT: #NO_APP
	; CHECK-NEXT: .LBB1_2: # %if.then			; CHECK-NEXT: .LBB1_2: # %if.then
				; CHECK-NEXT: addl %esi, %edi
	; CHECK-NEXT: movl %edi, %eax			; CHECK-NEXT: movl %edi, %eax
	; CHECK-NEXT: addl %esi, %eax
	; CHECK-NEXT: .Ltmp2: # Block address taken			; CHECK-NEXT: .Ltmp2: # Block address taken
	; CHECK-NEXT: .LBB1_6: # %return			; CHECK-NEXT: .LBB1_6: # %return
	; CHECK-NEXT: popl %esi			; CHECK-NEXT: popl %esi
	; CHECK-NEXT: .cfi_def_cfa_offset 8			; CHECK-NEXT: .cfi_def_cfa_offset 8
	; CHECK-NEXT: popl %edi			; CHECK-NEXT: popl %edi
	; CHECK-NEXT: .cfi_def_cfa_offset 4			; CHECK-NEXT: .cfi_def_cfa_offset 4
	; CHECK-NEXT: retl			; CHECK-NEXT: retl
	; CHECK-NEXT: .LBB1_3: # %if.else			; CHECK-NEXT: .LBB1_3: # %if.else
	▲ Show 20 Lines • Show All 144 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add TCOPY, a terminator form of the COPY instrNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 263063

llvm/include/llvm/CodeGen/MachineBasicBlock.h

llvm/include/llvm/CodeGen/MachineInstr.h

llvm/include/llvm/Support/TargetOpcodes.def

llvm/include/llvm/Target/Target.td

llvm/lib/CodeGen/DetectDeadLanes.cpp

llvm/lib/CodeGen/ExpandPostRAPseudos.cpp

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp

llvm/lib/CodeGen/GlobalISel/GISelKnownBits.cpp

llvm/lib/CodeGen/GlobalISel/InstructionSelect.cpp

llvm/lib/CodeGen/GlobalISel/MachineIRBuilder.cpp

llvm/lib/CodeGen/GlobalISel/Utils.cpp

llvm/lib/CodeGen/MachineBasicBlock.cpp

llvm/lib/CodeGen/MachineInstr.cpp

llvm/lib/CodeGen/MachineSink.cpp

llvm/lib/CodeGen/MachineVerifier.cpp

llvm/lib/CodeGen/PeepholeOptimizer.cpp

llvm/lib/CodeGen/ReachingDefAnalysis.cpp

llvm/lib/CodeGen/RegAllocFast.cpp

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/Target/AArch64/AArch64CallLowering.cpp

llvm/lib/Target/AArch64/AArch64FastISel.cpp

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

llvm/lib/Target/AArch64/AArch64InstructionSelector.cpp

llvm/lib/Target/AArch64/AArch64RegisterBankInfo.cpp

llvm/lib/Target/AMDGPU/SIISelLowering.cpp

llvm/lib/Target/AMDGPU/SIInstrInfo.cpp

llvm/lib/Target/Hexagon/BitTracker.cpp

llvm/lib/Target/Hexagon/HexagonBitSimplify.cpp

llvm/lib/Target/Hexagon/HexagonFrameLowering.cpp

llvm/lib/Target/Hexagon/HexagonGenPredicate.cpp

llvm/lib/Target/Hexagon/HexagonHardwareLoops.cpp

llvm/lib/Target/Hexagon/HexagonISelDAGToDAGHVX.cpp

llvm/lib/Target/Hexagon/HexagonInstrInfo.cpp

llvm/lib/Target/Hexagon/HexagonMachineScheduler.cpp

llvm/lib/Target/Hexagon/HexagonNewValueJump.cpp

llvm/lib/Target/Hexagon/HexagonSplitDouble.cpp

llvm/lib/Target/Hexagon/RDFCopy.cpp

llvm/lib/Target/Mips/MipsRegisterBankInfo.cpp

llvm/lib/Target/Mips/MipsSEFrameLowering.cpp

llvm/lib/Target/NVPTX/NVPTXReplaceImageHandles.cpp

llvm/lib/Target/X86/X86DomainReassignment.cpp

llvm/lib/Target/X86/X86FlagsCopyLowering.cpp

llvm/lib/Target/X86/X86FloatingPoint.cpp

llvm/lib/Target/X86/X86InstrInfo.cpp

llvm/test/CodeGen/AArch64/callbr-asm-label.ll

llvm/test/CodeGen/SystemZ/asm-20.ll

llvm/test/CodeGen/X86/callbr-asm-label-addr.ll

llvm/test/CodeGen/X86/callbr-asm-outputs-tcopy-spilling.ll

llvm/test/CodeGen/X86/callbr-asm-outputs.ll

Add TCOPY, a terminator form of the COPY instr
Needs ReviewPublic