This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
CodeGen/
-
ISDOpcodes.h
4/4
MachineBasicBlock.h
-
Target/
1/1
Target.td
-
lib/
-
CodeGen/
3/3
BranchFolding.cpp
6/6
MachineBasicBlock.cpp
-
MachineSink.cpp
2/2
MachineVerifier.cpp
-
PHIEliminationUtils.cpp
-
RegisterCoalescer.cpp
-
SelectionDAG/
1/1
ScheduleDAGSDNodes.cpp
1/1
SelectionDAGBuilder.h
-
SelectionDAGBuilder.cpp
-
ShrinkWrap.cpp
-
SplitKit.h
4/4
SplitKit.cpp
-
TailDuplicator.cpp
-
Target/
-
Hexagon/
4/4
BitTracker.cpp
5/5
HexagonConstPropagation.cpp
-
PowerPC/
-
PPCBranchCoalescing.cpp
-
X86/
-
X86InstrInfo.cpp
-
test/
-
CodeGen/
-
AArch64/
-
callbr-asm-label.ll
-
callbr-asm-obj-file.ll
-
ARM/
2/2
ifcvt-diamond-unanalyzable-common.mir
-
ifcvt-size.mir
-
X86/
-
callbr-asm-blockplacement.ll
-
callbr-asm-branch-folding.ll
-
callbr-asm-label-addr.ll
-
callbr-asm-outputs-pred-succ.ll
4/4
callbr-asm-outputs.ll
-
callbr-asm.ll
-
shrinkwrap-callbr.ll
-
Verifier/
-
callbr.ll

Differential D79794

Change the INLINEASM_BR MachineInstr to be a non-terminating instruction.
ClosedPublic

Authored by jyknight on May 12 2020, 9:58 AM.

Download Raw Diff

Details

Reviewers

nickdesaulniers
void
arsenm
qcolombet
efriedma

Commits

rG4b0aa5724fea: Change the INLINEASM_BR MachineInstr to be a non-terminating instruction.

Summary

Before this instruction supported output values, it fit fairly
naturally as a terminator. However, being a terminator while also
supporting outputs causes some trouble, as the physreg->vreg COPY
operations cannot be in the same block.

Modeling it as a non-terminator allows it to be handled the same way
as invoke is handled already.

Most of the changes here were created by auditing all the existing
users of MachineBasicBlock::isEHPad() and
MachineBasicBlock::hasEHPadSuccessor(), and adding calls to
isInlineAsmBrIndirectTarget or hasInlineAsmBr, as appropriate.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	40 ms	LLVM.CodeGen/X86::Unknown Unit Message ("")
	100 ms	LLVM.CodeGen/X86::Unknown Unit Message ("")
	40 ms	LLVM.CodeGen/X86::Unknown Unit Message ("")
	60 ms	LLVM.CodeGen/X86::Unknown Unit Message ("")

Event Timeline

jyknight created this revision.May 12 2020, 9:58 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 12 2020, 9:58 AM

Herald added subscribers: llvm-commits, tpr, kbarton and 4 others. · View Herald Transcript

jyknight added a parent revision: D79793: Simplify MachineVerifier's block-successor verification..May 12 2020, 10:01 AM

Herald added a subscriber: • wuzish. · View Herald TranscriptMay 12 2020, 10:01 AM

jyknight added a parent revision: D79605: MachineBasicBlock::updateTerminator now requires an explicit layout successor..May 12 2020, 10:02 AM

Harbormaster failed remote builds in B56453: Diff 263461!May 12 2020, 11:17 AM

An interesting approach. It certainly seems to simplify some of the special handling. I'm happy that it seems to delete much of the existing code. Thank you for taking the time to write up this patch.

I don't fully understand what "shrink wrapping" is, or the changes to BranchFolding, but the rest of the patch looks pretty good to me. This obviously has implications for asm goto without outputs, so I'd like to run this through a couple kernel builds to ensure we haven't regressed anything. From there, I can test kernel builds that use outputs.

Three other things to check:

check we don't spill post terminators (https://reviews.llvm.org/D78166). Probably no issue there.
check that BranchFolder::RemoveDeadBlock isn't removing any MachineBasicBlock that have their address taken. (https://reviews.llvm.org/D78234) tries to do this, but we've seen cases where asm goto w/ outputs results in the indirect successor being removed by BranchFolder. We have a case from the kernel that triggered the above two patches, and is still a problem for https://reviews.llvm.org/D75098 that I can send you.
That live ins match live outs (https://reviews.llvm.org/D78586). Probably no issue there.

llvm/include/llvm/CodeGen/MachineBasicBlock.h
486	I think @efriedma has been noting that the API at the MIR level doesn't feel symmetric with the LLVM IR level. https://reviews.llvm.org/D78234#1987870 and https://reviews.llvm.org/D78234#1989382 In LLVM IR, you have `BasicBlock::HasAddressTaken`, but at the MIR level the operands are still `BlockAddress` (which reference a `Function` and `BasicBlock`, two LLVM IR level concepts). It's too bad we don't lower these to just `MachineBasicBlocks` (or a new `MachineBlockAddress`) as operands, and have equivalent machinery for detecting whether a `MachineBasicBlock` has its address taken.
llvm/include/llvm/Target/Target.td
1023	Delete
llvm/lib/CodeGen/BranchFolding.cpp
1705	That's neat, I didn't know you could use initializer lists as ranges for range based for loops.s
llvm/lib/CodeGen/MachineBasicBlock.cpp
281–284	return any_of(successors(), [](MachineBasicBlock* Succ) { return Succ->isInlineAsmBrIndirectTarget(); }; or better yet, why not be precise and iterate `terminators()` checking the `getOpcode() == TargetOpcode::INLINEASM_BR`?
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h
410	Unused?
llvm/lib/CodeGen/SplitKit.cpp
100	Is this assignment ok to remove?
llvm/lib/Target/Hexagon/BitTracker.cpp
959–960	DefaultToAll \|= B.hasInlineAsmBr();
llvm/test/CodeGen/ARM/ifcvt-diamond-unanalyzable-common.mir
66	INLINEASM_BR just disappears from this test?
llvm/test/CodeGen/X86/callbr-asm-outputs.ll
60	does this `-1` get overwritten immediately on the next instruction?

nickdesaulniers added subscribers: craig.topper, kparzysz.May 12 2020, 11:52 AM

In D79794#2032052, @nickdesaulniers wrote:

An interesting approach. It certainly seems to simplify some of the special handling. I'm happy that it seems to delete much of the existing code. Thank you for taking the time to write up this patch.

This is funny. The original asm goto patches started off with just using INLINEASM and trying to follow exception handling. We switched to INLINEASM_BR as a terminator to "simplify" things.

kparzysz added inline comments.May 12 2020, 12:45 PM

llvm/lib/Target/Hexagon/BitTracker.cpp
958	It would defeat the purpose of this function. It calculates the set of possible targets for each branch, given the updated register states (i.e. branch conditions), which can be a proper subset of the set of targets listed in the branch.
959–960	Ok.
llvm/lib/Target/Hexagon/HexagonConstPropagation.cpp
758	Same reason as in BitTracker.
760	Ok.
817	...
820	Ok.
825	Ok.

I think we should rename INLINEASM_BR to something that doesn't involve the _BR bit, since it's no longer a branch.

In general, I'm okay with this approach. (I thought we shouldn't make it a terminator in the first place.)

llvm/lib/CodeGen/MachineBasicBlock.cpp
281–284	It's no longer a terminator, but could be written: return any_of(*this, [](const MachineInstr &MI) { return MI.getOpcode() == TargetOpcode::INLINEASM_BR; }
288	Is this correct? We should be able to hoist into a block with `INLINEASM_BR` if it's coming from the default target.
llvm/lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp
1031	Yay!

With this series applied (all 3), I can build and boot an x86 defconfig Linux kernel that makes use of asm goto with outputs.

without outputs, just vanilla asm goto:

x86 allyesconfig builds (we don't try to boot allyesconfigs).

aarch32 defconfig kernels build and boot.

aarch64 defconfig kernels build, but they've regressed booting. No panic, just a dead machine. That will take a while to debug.

Quick ping to ask about the status of this change. :-)

In the review on the prerequisite patch, it turned out there's another thing that needs to be fixed first before this -- disambiguating whether the end of a block falls through, or is actually unreachable.

I'm working on creating a patch to do that, after which I'll get back to this patch to address the comments here.

Just rebase. (Doesn't address review comments)

Harbormaster failed remote builds in B60162: Diff 270500!Jun 12 2020, 1:42 PM

void added inline comments.Jun 12 2020, 11:18 PM

llvm/test/CodeGen/X86/callbr-asm-outputs.ll
60	It's the same on line 52. Is it a problem with the PHI elimination pass?

I think you need the following patch. We shouldn't allow instructions to be rescheduled after an INLINEASM_BR, because all values who's definition dominate the INLINEASM_BR are assumed to be valid on the indirect branch.

diff --git a/llvm/lib/CodeGen/MachineScheduler.cpp b/llvm/lib/CodeGen/MachineScheduler.cpp
index 0f21c97a30f..8cbbd79b661 100644
--- a/llvm/lib/CodeGen/MachineScheduler.cpp
+++ b/llvm/lib/CodeGen/MachineScheduler.cpp
@@ -443,7 +443,8 @@ static bool isSchedBoundary(MachineBasicBlock::iterator MI,
                             MachineBasicBlock *MBB,
                             MachineFunction *MF,
                             const TargetInstrInfo *TII) {
-  return MI->isCall() || TII->isSchedulingBoundary(*MI, MBB, *MF);
+  return MI->isCall() || TII->isSchedulingBoundary(*MI, MBB, *MF) ||
+      (MI->isInlineAsm() && MI->getOpcode() == TargetOpcode::INLINEASM_BR);
 }
 
 /// A region of an MBB for scheduling.
diff --git a/llvm/test/CodeGen/X86/callbr-asm-outputs.ll b/llvm/test/CodeGen/X86/callbr-asm-outputs.ll
index 61baa31074e..a4447bc15f1 100644
--- a/llvm/test/CodeGen/X86/callbr-asm-outputs.ll
+++ b/llvm/test/CodeGen/X86/callbr-asm-outputs.ll
@@ -41,6 +41,7 @@ define i32 @test2(i32 %out1, i32 %out2) {
 ; CHECK-NEXT:    .cfi_offset %edi, -8
 ; CHECK-NEXT:    movl {{[0-9]+}}(%esp), %edi
 ; CHECK-NEXT:    movl {{[0-9]+}}(%esp), %esi
+; CHECK-NEXT:    movl $-1, %eax
 ; CHECK-NEXT:    cmpl %edi, %esi
 ; CHECK-NEXT:    jge .LBB1_2
 ; CHECK-NEXT:  # %bb.1: # %if.then
@@ -49,7 +50,6 @@ define i32 @test2(i32 %out1, i32 %out2) {
 ; CHECK-NEXT:    testl %edi, %esi
 ; CHECK-NEXT:    jne .Ltmp1
 ; CHECK-NEXT:    #NO_APP
-; CHECK-NEXT:    movl $-1, %eax
 ; CHECK-NEXT:    jmp .LBB1_3
 ; CHECK-NEXT:  .LBB1_2: # %if.else
 ; CHECK-NEXT:    #APP
@@ -57,7 +57,6 @@ define i32 @test2(i32 %out1, i32 %out2) {
 ; CHECK-NEXT:    testl %esi, %edi
 ; CHECK-NEXT:    jne .Ltmp2
 ; CHECK-NEXT:    #NO_APP
-; CHECK-NEXT:    movl $-1, %eax
 ; CHECK-NEXT:  .LBB1_3:
 ; CHECK-NEXT:    movl %esi, %eax
 ; CHECK-NEXT:    addl %edi, %eax

nickdesaulniers mentioned this in D81607: BreakCriticalEdges for callbr indirect dests.Jun 16 2020, 2:37 PM

@nathanchance tested this (version Diff 270500) and also observed boot failures for mainline linux (v5.8-rc1) for aarch64, but additionally aarch64+5.4 LTS stable, and WSL2 kernel for x86_64. I have only briefly started debugging; there were some unrelated issues with RELR relocations and PAC/BTI aarch64 ISA extensions.

@void 's additional diff fixes the arm64 mainline boot issue I was observing. Maybe @nathanchance can help test with that additional diff applied atop this patch that we're in good shape?

In D79794#2104506, @nickdesaulniers wrote:

@void 's additional diff fixes the arm64 mainline boot issue I was observing. Maybe @nathanchance can help test with that additional diff applied atop this patch that we're in good shape?

Unfortunately, my test case is still broken. With https://gist.github.com/nathanchance/f69a0281d63d6e72a1449d2f5a98636b on top of 1cfdda57fa63dd6d770ecb4411bd4d2b59e78544, this kernel boots (v5.7):

$ make -skj"$(nproc)" LLVM=1 O=out/x86_64 distclean defconfig bzImage
...

$ ../../cbl/github/boot-utils/boot-qemu.sh -a x86_64 -k out/x86_64 -t 30s |& grep -A1 qemu-system-x86_64
+ timeout --foreground 30s unbuffer qemu-system-x86_64 -cpu host -d unimp,guest_errors -enable-kvm -smp 64 -append 'console=ttyS0 ' -display none -initrd /home/nathan/cbl/github/boot-utils/images/x86_64/rootfs.cpio -kernel /home/nathan/src/linux/out/x86_64/arch/x86_64/boot/bzImage -m 512m -nodefaults -serial mon:stdio
[    0.000000] Linux version 5.7.0 (nathan@ubuntu-n2-xlarge-x86) (ClangBuiltLinux clang version 11.0.0 (https://github.com/llvm/llvm-project 07bde6d2c4646c888b1011aa079bbaaa250f79b8), LLD 11.0.0 (https://github.com/llvm/llvm-project 07bde6d2c4646c888b1011aa079bbaaa250f79b8)) #1 SMP Fri Jun 19 22:07:10 MST 2020

but this one does not:

$ make -skj"$(nproc)" KCFLAGS=-march=znver2 LLVM=1 LOCALVERSION=-znver2 O=out/x86_64 distclean defconfig bzImage
...

$ ../../cbl/github/boot-utils/boot-qemu.sh -a x86_64 -k out/x86_64 -t 30s |& grep -A1 qemu-system-x86_64
+ timeout --foreground 30s unbuffer qemu-system-x86_64 -cpu host -d unimp,guest_errors -enable-kvm -smp 64 -append 'console=ttyS0 ' -display none -initrd /home/nathan/cbl/github/boot-utils/images/x86_64/rootfs.cpio -kernel /home/nathan/src/linux/out/x86_64/arch/x86_64/boot/bzImage -m 512m -nodefaults -serial mon:stdio
+ RET=124

At 1cfdda57fa63dd6d770ecb4411bd4d2b59e78544, both kernels boot without any issue.

$ make -skj"$(nproc)" LLVM=1 O=out/x86_64 distclean defconfig bzImage
...

$ ../../cbl/github/boot-utils/boot-qemu.sh -a x86_64 -k out/x86_64 -t 30s |& grep -A1 qemu-system-x86_64
+ timeout --foreground 30s unbuffer qemu-system-x86_64 -cpu host -d unimp,guest_errors -enable-kvm -smp 64 -append 'console=ttyS0 ' -display none -initrd /home/nathan/cbl/github/boot-utils/images/x86_64/rootfs.cpio -kernel /home/nathan/src/linux/out/x86_64/arch/x86_64/boot/bzImage -m 512m -nodefaults -serial mon:stdio
[    0.000000] Linux version 5.7.0 (nathan@ubuntu-n2-xlarge-x86) (ClangBuiltLinux clang version 11.0.0 (https://github.com/llvm/llvm-project 1cfdda57fa63dd6d770ecb4411bd4d2b59e78544), LLD 11.0.0 (https://github.com/llvm/llvm-project 1cfdda57fa63dd6d770ecb4411bd4d2b59e78544)) #1 SMP Fri Jun 19 22:02:11 MST 2020

$ make -skj"$(nproc)" KCFLAGS=-march=znver2 LLVM=1 LOCALVERSION=-znver2 O=out/x86_64 distclean defconfig bzImage
...

$ ../../cbl/github/boot-utils/boot-qemu.sh -a x86_64 -k out/x86_64 -t 30s |& grep -A1 qemu-system-x86_64
+ timeout --foreground 30s unbuffer qemu-system-x86_64 -cpu host -d unimp,guest_errors -enable-kvm -smp 64 -append 'console=ttyS0 ' -display none -initrd /home/nathan/cbl/github/boot-utils/images/x86_64/rootfs.cpio -kernel /home/nathan/src/linux/out/x86_64/arch/x86_64/boot/bzImage -m 512m -nodefaults -serial mon:stdio
[    0.000000] Linux version 5.7.0-znver2 (nathan@ubuntu-n2-xlarge-x86) (ClangBuiltLinux clang version 11.0.0 (https://github.com/llvm/llvm-project 1cfdda57fa63dd6d770ecb4411bd4d2b59e78544), LLD 11.0.0 (https://github.com/llvm/llvm-project 1cfdda57fa63dd6d770ecb4411bd4d2b59e78544)) #1 SMP Fri Jun 19 22:00:23 MST 2020

@nathanchance & @nickdesaulniers: I forgot the post scheduler. Try the patch below.

diff --git a/llvm/lib/CodeGen/MachineScheduler.cpp b/llvm/lib/CodeGen/MachineScheduler.cpp
index 0f21c97a30f..e28d25bbfae 100644
--- a/llvm/lib/CodeGen/MachineScheduler.cpp
+++ b/llvm/lib/CodeGen/MachineScheduler.cpp
@@ -443,7 +443,8 @@ static bool isSchedBoundary(MachineBasicBlock::iterator MI,
                             MachineBasicBlock *MBB,
                             MachineFunction *MF,
                             const TargetInstrInfo *TII) {
-  return MI->isCall() || TII->isSchedulingBoundary(*MI, MBB, *MF);
+  return MI->isCall() || MI->getOpcode() == TargetOpcode::INLINEASM_BR ||
+      TII->isSchedulingBoundary(*MI, MBB, *MF);
 }
 
 /// A region of an MBB for scheduling.
diff --git a/llvm/lib/CodeGen/PostRASchedulerList.cpp b/llvm/lib/CodeGen/PostRASchedulerList.cpp
index b85f00a61ea..1a2a1d753cd 100644
--- a/llvm/lib/CodeGen/PostRASchedulerList.cpp
+++ b/llvm/lib/CodeGen/PostRASchedulerList.cpp
@@ -338,7 +338,8 @@ bool PostRAScheduler::runOnMachineFunction(MachineFunction &Fn) {
       // Calls are not scheduling boundaries before register allocation, but
       // post-ra we don't gain anything by scheduling across calls since we
       // don't need to worry about register pressure.
-      if (MI.isCall() || TII->isSchedulingBoundary(MI, &MBB, Fn)) {
+      if (MI.isCall() || MI.getOpcode() == TargetOpcode::INLINEASM_BR ||
+          TII->isSchedulingBoundary(MI, &MBB, Fn)) {
         Scheduler.enterRegion(&MBB, I, Current, CurrentCount - Count);
         Scheduler.setEndIndex(CurrentCount);
         Scheduler.schedule();
diff --git a/llvm/test/CodeGen/X86/callbr-asm-outputs.ll b/llvm/test/CodeGen/X86/callbr-asm-outputs.ll
index 61baa31074e..a4447bc15f1 100644
--- a/llvm/test/CodeGen/X86/callbr-asm-outputs.ll
+++ b/llvm/test/CodeGen/X86/callbr-asm-outputs.ll
@@ -41,6 +41,7 @@ define i32 @test2(i32 %out1, i32 %out2) {
 ; CHECK-NEXT:    .cfi_offset %edi, -8
 ; CHECK-NEXT:    movl {{[0-9]+}}(%esp), %edi
 ; CHECK-NEXT:    movl {{[0-9]+}}(%esp), %esi
+; CHECK-NEXT:    movl $-1, %eax
 ; CHECK-NEXT:    cmpl %edi, %esi
 ; CHECK-NEXT:    jge .LBB1_2
 ; CHECK-NEXT:  # %bb.1: # %if.then
@@ -49,7 +50,6 @@ define i32 @test2(i32 %out1, i32 %out2) {
 ; CHECK-NEXT:    testl %edi, %esi
 ; CHECK-NEXT:    jne .Ltmp1
 ; CHECK-NEXT:    #NO_APP
-; CHECK-NEXT:    movl $-1, %eax
 ; CHECK-NEXT:    jmp .LBB1_3
 ; CHECK-NEXT:  .LBB1_2: # %if.else
 ; CHECK-NEXT:    #APP
@@ -57,7 +57,6 @@ define i32 @test2(i32 %out1, i32 %out2) {
 ; CHECK-NEXT:    testl %esi, %edi
 ; CHECK-NEXT:    jne .Ltmp2
 ; CHECK-NEXT:    #NO_APP
-; CHECK-NEXT:    movl $-1, %eax
 ; CHECK-NEXT:  .LBB1_3:
 ; CHECK-NEXT:    movl %esi, %eax
 ; CHECK-NEXT:    addl %edi, %eax

@void that diff on top of this revision resolves the issue I was seeing, thanks!

+ timeout --foreground 30s unbuffer qemu-system-x86_64 -cpu host -d unimp,guest_errors -enable-kvm -smp 64 -append 'console=ttyS0 ' -display none -initrd /home/nathan/cbl/github/boot-utils/images/x86_64/rootfs.cpio -kernel /home/nathan/src/linux/out/x86_64/arch/x86_64/boot/bzImage -m 512m -nodefaults -serial mon:stdio
[    0.000000] Linux version 5.7.0-znver2 (nathan@ubuntu-n2-xlarge-x86) (ClangBuiltLinux clang version 11.0.0 (https://github.com/llvm/llvm-project a3c12c2214b98696d2f465d9413d91dfcd02160c), LLD 11.0.0 (https://github.com/llvm/llvm-project a3c12c2214b98696d2f465d9413d91dfcd02160c)) #1 SMP Sun Jun 21 12:00:34 MST 2020

In D79794#2105792, @nathanchance wrote:

@void that diff on top of this revision resolves the issue I was seeing, thanks!

That's great! @void is there a test case we can add for that post RA schedule, so that we don't regress that? @jyknight would you mind rolling @void 's changes up into this and rebasing, please? I'm curious if there's anything we can do to combine isSchedBoundary and isSchedulingBoundary?

In D79794#2107203, @nickdesaulniers wrote:

In D79794#2105792, @nathanchance wrote:

@void that diff on top of this revision resolves the issue I was seeing, thanks!

That's great! @void is there a test case we can add for that post RA schedule, so that we don't regress that? @jyknight would you mind rolling @void 's changes up into this and rebasing, please? I'm curious if there's anything we can do to combine isSchedBoundary and isSchedulingBoundary?

I'm working on getting the test case ready.

In D79794#2107203, @nickdesaulniers wrote:

In D79794#2105792, @nathanchance wrote:

@void that diff on top of this revision resolves the issue I was seeing, thanks!

That's great! @void is there a test case we can add for that post RA schedule, so that we don't regress that? @jyknight would you mind rolling @void 's changes up into this and rebasing, please? I'm curious if there's anything we can do to combine isSchedBoundary and isSchedulingBoundary?

Here's the testcase:

$ more llvm/test/CodeGen/X86/callbr-asm-instr-scheduling.ll
; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc -mtriple=x86_64-unknown-linux-gnu -verify-machineinstrs -mcpu=znver2 -O2 -frame-pointer=none < %s | FileCheck %s

; Make sure that instructions aren't scheduled after the "callbr". In the
; example below, we don't want the "shrxq" through "leaq" instructions to be
; moved after the "callbr".

%struct.cpuinfo_x86 = type { i8, i8, i8, i8, i32, [3 x i32], i8, i8, i8, i8, i32, i32, %union.anon.83, [16 x i8], [64 x i8], i32, i32, i32, i32, i32, i32, i64, i16, i16, i16, i16, i16, i16, i16, i16, i16, i16, i16, i32, i8, i8 }
%union.anon.83 = type { i64, [72 x i8] }
%struct.pgd_t = type { i64 }
%struct.p4d_t = type { i64 }
%struct.pud_t = type { i64 }

@boot_cpu_data = external dso_local global %struct.cpuinfo_x86, align 8
@page_offset_base = external dso_local local_unnamed_addr global i64, align 8
@pgdir_shift = external dso_local local_unnamed_addr global i32, align 4
@__force_order = external dso_local global i64, align 8
@ptrs_per_p4d = external dso_local local_unnamed_addr global i32, align 4

define i64 @early_ioremap_pmd(i64 %addr) {
; CHECK-LABEL: early_ioremap_pmd:
; CHECK:       # %bb.0: # %entry
; CHECK-NEXT:    #APP
; CHECK-NEXT:    movq %cr3, %rax
; CHECK-EMPTY:
; CHECK-NEXT:    #NO_APP
; CHECK-NEXT:    movabsq $9223372036854771712, %rdx # imm = 0x7FFFFFFFFFFFF000
; CHECK-NEXT:    andq %rax, %rdx
; CHECK-NEXT:    movb {{.*}}(%rip), %al
; CHECK-NEXT:    movq {{.*}}(%rip), %rcx
; CHECK-NEXT:    shrxq %rax, %rdi, %rax
; CHECK-NEXT:    addq %rcx, %rdx
; CHECK-NEXT:    andl $511, %eax # imm = 0x1FF
; CHECK-NEXT:    leaq (%rdx,%rax,8), %rax
; CHECK-NEXT:    #APP
; CHECK-NEXT:  .Ltmp2:
; CHECK-NEXT:    jmp .Ltmp3
; CHECK-NEXT:  .Ltmp4:
; CHECK-NEXT:    .zero (-(((.Ltmp5-.Ltmp6)-(.Ltmp4-.Ltmp2))>0))*((.Ltmp5-.Ltmp6)-(.Ltmp4-.Ltmp2)),144
; CHECK-NEXT:  .Ltmp7:
entry:
  %0 = tail call i64 asm sideeffect "mov %cr3,$0\0A\09", "=r,=*m,~{dirflag},~{fpsr},~{flags}"(i64* nonnull @__force_order)
  %and.i = and i64 %0, 9223372036854771712
  %1 = load i64, i64* @page_offset_base, align 8
  %add = add i64 %and.i, %1
  %2 = inttoptr i64 %add to %struct.pgd_t*
  %3 = load i32, i32* @pgdir_shift, align 4
  %sh_prom = zext i32 %3 to i64
  %shr = lshr i64 %addr, %sh_prom
  %and = and i64 %shr, 511
  %arrayidx = getelementptr %struct.pgd_t, %struct.pgd_t* %2, i64 %and
  callbr void asm sideeffect "1: jmp 6f\0A2:\0A.skip -(((5f-4f) - (2b-1b)) > 0) * ((5f-4f) - (2b-1b)),0x90\0A3:\0A.section .altinstructions,\22a\22\0A .long 1b - .\0A .long 4f - .\0A .word ${1:P}\0A .byte 3b - 1b\0A .byte 5f - 4f\0
A .byte 3b - 2b\0A.previous\0A.section .altinstr_replacement,\22ax\22\0A4: jmp ${5:l}\0A5:\0A.previous\0A.section .altinstructions,\22a\22\0A .long 1b - .\0A .long 0\0A .word ${0:P}\0A .byte 3b - 1b\0A .byte 0\0A .byte 0\0A.previou
s\0A.section .altinstr_aux,\22ax\22\0A6:\0A testb $2,$3\0A jnz ${4:l}\0A jmp ${5:l}\0A.previous\0A", "i,i,i,*m,X,X,~{dirflag},~{fpsr},~{flags}"(i16 528, i32 117, i32 1, i8* getelementptr inbounds (%struct.cpuinfo_x86, %struct.cpuin
fo_x86* @boot_cpu_data, i64 0, i32 12, i32 1, i64 58), i8* blockaddress(@early_ioremap_pmd, %if.end.i), i8* blockaddress(@early_ioremap_pmd, %if.then.i))
          to label %_static_cpu_has.exit.thread.i [label %if.end.i, label %if.then.i]

_static_cpu_has.exit.thread.i:                    ; preds = %entry
  br label %if.end.i

if.then.i:                                        ; preds = %entry
  %4 = bitcast %struct.pgd_t* %arrayidx to %struct.p4d_t*
  br label %p4d_offset.exit

if.end.i:                                         ; preds = %_static_cpu_has.exit.thread.i, %entry
  %coerce.dive.i = getelementptr inbounds %struct.pgd_t, %struct.pgd_t* %arrayidx, i64 0, i32 0
  %5 = load i64, i64* %coerce.dive.i, align 8
  %6 = inttoptr i64 %5 to %struct.p4d_t*
  %7 = load i32, i32* @ptrs_per_p4d, align 4
  %sub.i.i = add i32 %7, 33554431
  %8 = and i32 %sub.i.i, 33554431
  %and.i1.i = zext i32 %8 to i64
  %add.ptr.i = getelementptr %struct.p4d_t, %struct.p4d_t* %6, i64 %and.i1.i
  br label %p4d_offset.exit

p4d_offset.exit:                                  ; preds = %if.end.i, %if.then.i
  %retval.0.i = phi %struct.p4d_t* [ %add.ptr.i, %if.end.i ], [ %4, %if.then.i ]
  %coerce.dive.i12 = getelementptr inbounds %struct.p4d_t, %struct.p4d_t* %retval.0.i, i64 0, i32 0
  %9 = load i64, i64* %coerce.dive.i12, align 8
  %and.i.i13 = and i64 %9, 4503599627366400
  %add.i.i14 = add i64 %and.i.i13, %1
  %10 = inttoptr i64 %add.i.i14 to %struct.pud_t*
  %coerce.dive.i16 = getelementptr %struct.pud_t, %struct.pud_t* %10, i64 511, i32 0
  %11 = load i64, i64* %coerce.dive.i16, align 8
  %tobool.i.i.i = icmp slt i64 %11, 0
  %..i.i.i = select i1 %tobool.i.i.i, i64 4503598553628672, i64 4503599627366400
  ret i64 %..i.i.i
}

Differences in new patch, since previous one:

Folded in void's change (but I made it in the isSchedulingBoundary() functions, rather than the two callers.
Marked the INLINEASM_BR instruction as always having unmodeled side-effects, rather than only if it's marked thus. (not to fix any known miscompilation -- clang always marked it as such anyhow.)
Removed FIXME comments, per review comments.

Herald added subscribers: kerbowa, nhaehnle, jvesely. · View Herald TranscriptJun 26 2020, 4:34 PM

After working on the previously mentioned unreachable-block-end cleanup for a bit (not done yet), and thinking about this patch more, I actually think it's okay to go ahead with this change now. The potential problem I was worried about is that the code may think that if you have INLINEASM_BR in a basic-block, where the block-end is unreachable, and the block is followed by a basic-block which is one of the indirect targets of the INLINEASM_BR, the code will think the first block falls through, when it does not.

I'm no longer concerned about this now, for two reasons:

I currently _suspect_ it's not actually possible for this to happen, because of the way we restrict handling/merging blocks. It will never be created that way (because callbr is a terminator in IR), and I currently believe we'll never merge the blocks to create that situation, because inlineasm_br block has multiple successors. I am not certain it can never happen, but at least I've not been able to induce it to occur. (This relates to another worry I had previously -- that we might end up with a block that has both a Call instruction that could throw, and an inlineasm_br. Or, for that matter, two inlineasm_br in the same MachineBasicBlock. But, even though it's not a terminator in MI, I believe this will not occur.)
Even if there _is_ some way that might happen, the badness that might occur is limited. Assuming there's a possible fallthrough, when it is in fact impossible, and the block is only an exceptional successor, seems not likely be a problem. We may insert an extraneous jump upon block rearrangement, for example, but that would not be truly problematic.

So, I think this is ready for another round of review and then to submit.

Thanks!

llvm/include/llvm/CodeGen/MachineBasicBlock.h
486	(we do actually have MachineBasicBlock::hasAddressTaken().) I believe the change I have here resolves efriedma's concerns you've linked, by removing the mapping of source-block to target-block. It now overestimates, but should be conservatively correct, even in the face of instructions being moved/split/etc, because the _target_ cannot be so transformed.
llvm/lib/CodeGen/MachineBasicBlock.cpp
281–284	I'm worried that iterating over all the instructions in the block to retrieve this information would be bad for performance. I don't think the overestimating here is likely to be a major optimization issue, but if it turns out to be, I'd like to revisit in a future patch.
288	I think it's correct -- in that it returns false strictly more often than it needs to. Just the same as we could sometimes hoist into a block which has an invoke in it, we could sometimes do so for an inlineasm_br.
llvm/lib/CodeGen/SplitKit.cpp
100	The code below which is invoked when LIP.second is set is simply to decide which of LIP.first or LIP.second to return. Letting it take the !LIP.second codepath has the same effect, with marginally less work.
llvm/lib/Target/Hexagon/BitTracker.cpp
958	Thanks for the explanation.
llvm/test/CodeGen/ARM/ifcvt-diamond-unanalyzable-common.mir
66	I changed the test to a different mechanism of testing unanalyzable branch sequences, since INLINEASM_BR doesn't trigger the problem anymore.

Thanks! :-)

This revision is now accepted and ready to land.Jun 26 2020, 6:06 PM

The only other concern I have is whether we have enough test coverage of the scheduling boundary changes? (Does removing the added checks there cause any existing or new test from the change to file? If not, seems like a lack of test coverage.) In the case of PostRASchedulerList, that implies a MIR test (writing MIR tests isn't something I'd wish on my worst enemy though).

llvm/include/llvm/CodeGen/MachineBasicBlock.h
477	Sorry to bikeshed the name of this method, but I find it currently doesn't match the implementation well. I'd take `hasInlineAsmBr` to mean "this `MachineBasicBlock` has an `INLINEASM_BR` `MCInst`." Instead, the implementation is checking whether any of the successors of this `MachineBasicBlock` is the indirect target of an `INLINEASM_BR` `MCInst`. Those two aren't the same thing. In fact, you could have a `MachineBasicBlock` where the current implementation of `hasInlineAsmBR` would return `true`, and yet the `MachineBasicBlock` does not actually contain a `INLINEASM_BR` `MCInst`. I know that's what you're alluding to in the comment, but I think the method should be named `hasInlineAsmBrSuccessors` or something of the sort. Maybe `MaybeHasInlineAsmBr` since it sounds like you'd rather it not be precise by scanning all instructions for the relevant opcode? (Couldn't ISEL mark these when creating the MBB? I guess having a `bool` member is tricky, since transforms would have to update that correctly. In this case, rematerializing the value by rescanning is simpler, and harder to get wrong.) I guess the name sounds precise, though the comment and implementation denote that it's not, and that's what I'm most hung up on.
llvm/lib/CodeGen/BranchFolding.cpp
1710	was removing the `!isEHPad` check intentional?
llvm/lib/CodeGen/MachineBasicBlock.cpp
281–284	This could at least use a range-for: for (const MachineBasicBlock *Succ : successors()) { if (Succ->isInlineAsmBrIndirectTarget()) ...
llvm/lib/CodeGen/MachineVerifier.cpp
584	should the report string be updated, too, to mention `INLINEASM_BR fallthrough/default target`?
llvm/lib/CodeGen/SplitKit.cpp
108	I wonder why we don't use a reverse const iterator here?
llvm/lib/Target/ARM/ARMBaseInstrInfo.cpp
2020 ↗	(On Diff #273860)	I think there's one more `override` of this method that was missed: `AArch64InstrInfo::isSchedulingBoundary` in `llvm/lib/Target/AArch64/AArch64InstrInfo.cpp`? Ah, nvm it defers to `TargetInstrInfo::isSchedulingBoundary` first.
llvm/test/CodeGen/X86/callbr-asm-outputs.ll
33	@void 's earlier comments mentioned modifications to this test case's `movl $-1, %eax`. Are those still needed?

In D79794#2121018, @nickdesaulniers wrote:

The only other concern I have is whether we have enough test coverage of the scheduling boundary changes? (Does removing the added checks there cause any existing or new test from the change to file? If not, seems like a lack of test coverage.) In the case of PostRASchedulerList, that implies a MIR test (writing MIR tests isn't something I'd wish on my worst enemy though).

The reason it wasn't failing before was because INLINEASM_BR was a terminator, and the boundary check automatically caught it. The existing test, llvm/test/CodeGen/X86/callbr-asm-outputs.ll, did fail though. So at least we're covered there.

Don't get me wrong, I'm always up for adding more tests, but I think this change is okay in that regard. :-)

void added inline comments.Jun 29 2020, 2:34 PM

llvm/test/CodeGen/X86/callbr-asm-outputs.ll
33	No, those are taken care of by the scheduling boundary check.

jyknight marked 14 inline comments as done.Jun 29 2020, 4:12 PM

jyknight added inline comments.

llvm/include/llvm/CodeGen/MachineBasicBlock.h
477	Renamed to mayHaveInlineAsmBr. I don't like "hasInlineAsmBrSuccessors", because the important property is whether there is such an instruction in the block. That it currently tests via looking at the successor in the current implementation is just an implementation detail. It would be nice for it to be precise, but as I mentioned in the other thread, it's probably not really worth the expense.
llvm/lib/CodeGen/BranchFolding.cpp
1710	Yes, no longer need to filter out EHpads, because it's now only looking at the analyzeBranch results CurFBB/CurTBB, not all successors.
llvm/lib/CodeGen/MachineVerifier.cpp
584	Maybe it should've before, but this change _removed_ the allowance for that.
llvm/lib/CodeGen/SplitKit.cpp
108	Who knows. =) Changed.

Minor fixups.

Harbormaster failed remote builds in B62243: Diff 274281!Jun 29 2020, 4:54 PM

nickdesaulniers accepted this revision.Jun 29 2020, 5:16 PM

Closed by commit rG4b0aa5724fea: Change the INLINEASM_BR MachineInstr to be a non-terminating instruction. (authored by jyknight). · Explain WhyJul 1 2020, 10:16 AM

This revision was automatically updated to reflect the committed changes.

nickdesaulniers mentioned this in D83523: MachineSink: permit sinking into INLINEASM_BR indirect targets.Jul 9 2020, 5:15 PM

So effectively we have an extended basic-block model in the LLVM backend now (blocks have 1 entry, but multiple exits that can be in the middle of the block). I know this was already the case for exception control-flow, but this here cemented it further. With that not being documented anywhere, and being different from LLVM IR/middleend, we are probably asking for trouble...

Herald added subscribers: foad, pengfei. · View Herald TranscriptOct 4 2021, 11:15 AM

We used to consider the the invoke mechanism more of a bug/misdesign. But I guess without anyone cleaning this up it was just a matter of time for other uses like this to do the same :-(

In D79794#3040491, @MatzeB wrote:

So effectively we have an extended basic-block model in the LLVM backend now (blocks have 1 entry, but multiple exits that can be in the middle of the block). I know this was already the case for exception control-flow, but this here cemented it further. With that not being documented anywhere, and being different from LLVM IR/middleend, we are probably asking for trouble...

We took great pains to ensure that nothing could be moved past it, so that the semantics won't change and we can reload registers, etc.. Not being able to 100% model the control flow of an assembly block already existed with current assembly blocks. There's no guarantee that a given assembly block won't have branches, or loops, or calls, or exits. None of which is modeled in MIR.

We took great pains to ensure that nothing could be moved past it, so that the semantics won't change and we can reload registers, etc..

Yes, I see that this is going through great pain to ensure things still work. But this all feels to me like you are making a form of EBBs work here. There is a way to jump out of the basic block without having all instructions in it executed. There must be some cases where COPYs or SPILLs insructions end up after the branch now which brings us into this trouble.

I realize that this was painful to model since LLVM was never prepared to have terminator instructions produce values. I guess we lack the time+expertise to fix this after the fact now :-(
The design was already messed up before INLINEASM_BR for exception handling.

So I don't know what to do here except rant and predict random bugs popping up over the years because intuition about how basic blocks work is broken.

Not being able to 100% model the control flow of an assembly block already existed with current assembly blocks. There's no guarantee that a given assembly block won't have branches, or loops, or calls, or exits. None of which is modeled in MIR.

Control flow within an assembly instruction is not a problem; Similar to a CALL instruction can also trigger arbitrary control flow, we can ignore that in the modeling since we have a guarnatee that control continues after the instruction eventually.

In D79794#3040927, @MatzeB wrote:

We took great pains to ensure that nothing could be moved past it, so that the semantics won't change and we can reload registers, etc..

Yes, I see that this is going through great pain to ensure things still work. But this all feels to me like you are making a form of EBBs work here. There is a way to jump out of the basic block without having all instructions in it executed. There must be some cases where COPYs or SPILLs insructions end up after the branch now which brings us into this trouble.

I realize that this was painful to model since LLVM was never prepared to have terminator instructions produce values. I guess we lack the time+expertise to fix this after the fact now :-(

The design was already messed up before INLINEASM_BR for exception handling.

So I don't know what to do here except rant and predict random bugs popping up over the years because intuition about how basic blocks work is broken.

Any thoughts on how to fix this without regressing support for asm goto w/ outputs?

In D79794#3040927, @MatzeB wrote:

We took great pains to ensure that nothing could be moved past it, so that the semantics won't change and we can reload registers, etc..

Yes, I see that this is going through great pain to ensure things still work. But this all feels to me like you are making a form of EBBs work here. There is a way to jump out of the basic block without having all instructions in it executed. There must be some cases where COPYs or SPILLs insructions end up after the branch now which brings us into this trouble.

This is a general issue with "asm goto", not just with "asm goto with outputs". Someone could stomp on registers, memory, and the like and leap to BFE without the necessary COPY / SPILL instructions to clean things up. We skirt around some of these issues by saying that outputs aren't valid on the indirect branch.

There's a reason why "asm goto" wasn't implemented in LLVM until we were forced to do it...

I realize that this was painful to model since LLVM was never prepared to have terminator instructions produce values. I guess we lack the time+expertise to fix this after the fact now :-(

The design was already messed up before INLINEASM_BR for exception handling.

So I don't know what to do here except rant and predict random bugs popping up over the years because intuition about how basic blocks work is broken.

One method to perhaps address your concerns would be to create a "terminating" COPY instruction. This allows one to reload registers after the ASM block. There were some sticky issues with it. Going this method was much easier. (Yes, I know that's not the best reason to do something, but...) However, the only issue a hypothetical terminating COPY would solve is issues with instructions invalidly moving past the INLINEASM_BR.

I would *love* to fix this. Chris claims that MLIR would solve all of this. (nudge...nudge) :-)

This is a general issue with "asm goto", not just with "asm goto with outputs". Someone could stomp on registers, memory, and the like and leap to BFE without the necessary COPY / SPILL instructions to clean things up. We skirt around some of these issues by saying that outputs aren't valid on the indirect branch.

Not sure I follow here... if someone stomps on registers in inline-asm (without declaring that in the inputs/outputs) then it's obviously a bug in the input and not something the compiler can fix.

Any thoughts on how to fix this without regressing support for asm goto w/ outputs?

For the record: I am not advocating to revert any patches at this point; I am a obviously a year to late for the review and despite the asm-goto the same problem exists for exception control too. I just feel that we should be aware of what we are doing and call things by their name :)

So unless I am missing something here the whole problem is about terminator instructions producing values. Correct me if I am missing some other aspect of the discussion.

So about supporting outputs in terminator instructions: The tricky part is that regalloc typically expects to be able to place things behind the definition of a value (typically spills or COPYs). The reason this is tricky, is that for a terminator you are forced to place those things into the successor block(s) instead. Placing the operations into the successor blocks in turn can be tricky if the successors have multiple predecessor ("critical edges") because then we can end up executing the instructions wrongfully when coming from the other predecessors. So that would require to either be in a position to split the critical edges as needed or have the register allocator be robust enough to deal with spills/COPYs being executed in more situations than desired in case of critical edges. If I understand the asm-goto construct correctly, then the asm-labels have enough abstraction that breaking critical edges is always possible (you are forced to list all possible targets in the ASM instructions and the compiler could replace 1 block for another, and cannot jump to arbitrary computed destinations like asm-goto, right?).

One method to perhaps address your concerns would be to create a "terminating" COPY instruction. This allows one to reload registers after the ASM block. There were some sticky issues with it. Going this method was much easier. (Yes, I know that's not the best reason to do something, but...) However, the only issue a hypothetical terminating COPY would solve is issues with instructions invalidly moving past the INLINEASM_BR.

But that still would mean we end up with terminator instructions that assign registers, so we're still in this regalloc grey-area because that is technically not supported.

In D79794#3041059, @MatzeB wrote:

One method to perhaps address your concerns would be to create a "terminating" COPY instruction. This allows one to reload registers after the ASM block. There were some sticky issues with it. Going this method was much easier. (Yes, I know that's not the best reason to do something, but...) However, the only issue a hypothetical terminating COPY would solve is issues with instructions invalidly moving past the INLINEASM_BR.

But that still would mean we end up with terminator instructions that assign registers, so we're still in this regalloc grey-area because that is technically not supported.

Yup! You identified the issue we faced. :-)

The design was already messed up before INLINEASM_BR for exception handling.

To me, this is the main point.

Given the existing handling for invoke, I still believe that using the same underlying mechanisms for inlineasm_br was the best implementation choice, because inlineasm_br behaves almost exactly like an invoke with a user-defined custom calling convention.

If you can come up with a proposal to handle invoke differently, then I expect inlineasm_br should fit easily into that.

nickdesaulniers mentioned this in D136871: [SelectionDAG] remove stale check assuming INLINEASM_BR is terminator.Oct 27 2022, 11:42 AM

nickdesaulniers mentioned this in D142924: [llvm][IfConversion] update successor list when merging INLINEASM_BR.Feb 2 2023, 11:38 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

ISDOpcodes.h

2 lines

MachineBasicBlock.h

37 lines

Target/

Target.td

6 lines

lib/

CodeGen/

BranchFolding.cpp

42 lines

MachineBasicBlock.cpp

11 lines

MachineSink.cpp

7 lines

MachineVerifier.cpp

1 line

PHIEliminationUtils.cpp

5 lines

RegisterCoalescer.cpp

4 lines

SelectionDAG/

ScheduleDAGSDNodes.cpp

51 lines

SelectionDAGBuilder.h

3 lines

SelectionDAGBuilder.cpp

16 lines

16 lines

2 lines

28 lines

2 lines

Target/

Hexagon/

BitTracker.cpp

5 lines

HexagonConstPropagation.cpp

12 lines

PowerPC/

PPCBranchCoalescing.cpp

5 lines

X86/

X86InstrInfo.cpp

2 lines

test/

CodeGen/

AArch64/

callbr-asm-label.ll

10 lines

callbr-asm-obj-file.ll

4 lines

ARM/

ifcvt-diamond-unanalyzable-common.mir

17 lines

ifcvt-size.mir

12 lines

X86/

callbr-asm-blockplacement.ll

4 lines

callbr-asm-branch-folding.ll

4 lines

callbr-asm-label-addr.ll

6 lines

callbr-asm-outputs-pred-succ.ll

24 lines

callbr-asm-outputs.ll

44 lines

callbr-asm.ll

12 lines

shrinkwrap-callbr.ll

61 lines

Verifier/

callbr.ll

38 lines

Diff 263461

llvm/include/llvm/CodeGen/ISDOpcodes.h

Show All 31 Lines	namespace ISD {
/// instruction sets as much as possible, and only use target-dependent		/// instruction sets as much as possible, and only use target-dependent
/// operators when they have special requirements.		/// operators when they have special requirements.
///		///
/// Finally, during and after selection proper, SNodes may use special		/// Finally, during and after selection proper, SNodes may use special
/// operator codes that correspond directly with MachineInstr opcodes. These		/// operator codes that correspond directly with MachineInstr opcodes. These
/// are used to represent selected instructions. See the isMachineOpcode()		/// are used to represent selected instructions. See the isMachineOpcode()
/// and getMachineOpcode() member functions of SDNode.		/// and getMachineOpcode() member functions of SDNode.
///		///
enum NodeType {		enum NodeType {
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - enum NodeType { - /// DELETED_NODE - This is an illegal value that is used to catch - /// errors. This opcode is not a legal opcode for any node. - DELETED_NODE, - - /// EntryToken - This is the marker used to indicate the start of a region. - EntryToken, - - /// TokenFactor - This node takes multiple tokens as input and produces a - /// single token result. This is used to represent the fact that the operand 1289 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - enum NodeType { - /// DELETED_NODE - This is…
/// DELETED_NODE - This is an illegal value that is used to catch		/// DELETED_NODE - This is an illegal value that is used to catch
/// errors. This opcode is not a legal opcode for any node.		/// errors. This opcode is not a legal opcode for any node.
DELETED_NODE,		DELETED_NODE,

/// EntryToken - This is the marker used to indicate the start of a region.		/// EntryToken - This is the marker used to indicate the start of a region.
EntryToken,		EntryToken,

/// TokenFactor - This node takes multiple tokens as input and produces a		/// TokenFactor - This node takes multiple tokens as input and produces a
▲ Show 20 Lines • Show All 682 Lines • ▼ Show 20 Lines	enum NodeType {
/// ... however many operands ...		/// ... however many operands ...
/// Operand #last: Optional, an incoming flag.		/// Operand #last: Optional, an incoming flag.
///		///
/// The variable width operands are required to represent target addressing		/// The variable width operands are required to represent target addressing
/// modes as a single "operand", even though they may have multiple		/// modes as a single "operand", even though they may have multiple
/// SDOperands.		/// SDOperands.
INLINEASM,		INLINEASM,

/// INLINEASM_BR - Terminator version of inline asm. Used by asm-goto.		/// INLINEASM_BR - Branching version of inline asm. Used by asm-goto.
INLINEASM_BR,		INLINEASM_BR,

/// EH_LABEL - Represents a label in mid basic block used to track		/// EH_LABEL - Represents a label in mid basic block used to track
/// locations needed for debug and exception handling tables. These nodes		/// locations needed for debug and exception handling tables. These nodes
/// take a chain as input and return a chain.		/// take a chain as input and return a chain.
EH_LABEL,		EH_LABEL,

/// ANNOTATION_LABEL - Represents a mid basic block label used by		/// ANNOTATION_LABEL - Represents a mid basic block label used by
▲ Show 20 Lines • Show All 219 Lines • ▼ Show 20 Lines	namespace ISD {
/// this value. Those that do must not be less than this value, and can		/// this value. Those that do must not be less than this value, and can
/// be used with SelectionDAG::getMemIntrinsicNode.		/// be used with SelectionDAG::getMemIntrinsicNode.
static const int FIRST_TARGET_MEMORY_OPCODE = BUILTIN_OP_END+500;		static const int FIRST_TARGET_MEMORY_OPCODE = BUILTIN_OP_END+500;

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
/// MemIndexedMode enum - This enum defines the load / store indexed		/// MemIndexedMode enum - This enum defines the load / store indexed
/// addressing modes.		/// addressing modes.
///		///
/// UNINDEXED "Normal" load / store. The effective address is already		/// UNINDEXED "Normal" load / store. The effective address is already
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// UNINDEXED "Normal" load / store. The effective address is already - /// computed and is available in the base pointer. The offset - /// operand is always undefined. In addition to producing a - /// chain, an unindexed load produces one value (result of the - /// load); an unindexed store does not produce a value. + /// The TRUNC = 1 case is used in cases where we know that the value will + /// not be modified by the node, because Y is not using any of the extra + /// precision of source type. This allows certain transformations like + /// STRICT_FP_EXTEND(STRICT_FP_ROUND(X,1)) -> X which are not safe for + /// STRICT_FP_EXTEND(STRICT_FP_ROUND(X,0)) because the extra bits aren't 134 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// UNINDEXED "Normal" load / store. The…
/// computed and is available in the base pointer. The offset		/// computed and is available in the base pointer. The offset
/// operand is always undefined. In addition to producing a		/// operand is always undefined. In addition to producing a
/// chain, an unindexed load produces one value (result of the		/// chain, an unindexed load produces one value (result of the
/// load); an unindexed store does not produce a value.		/// load); an unindexed store does not produce a value.
///		///
/// PRE_INC Similar to the unindexed mode where the effective address is		/// PRE_INC Similar to the unindexed mode where the effective address is
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// PRE_INC Similar to the unindexed mode where the effective address is - /// PRE_DEC the value of the base pointer add / subtract the offset. - /// It considers the computation as being folded into the load / - /// store operation (i.e. the load / store does the address - /// computation as well as performing the memory transaction). - /// The base operand is always undefined. In addition to - /// producing a chain, pre-indexed load produces two values - /// (result of the load and the result of the address - /// computation); a pre-indexed store produces one value (result - /// of the address computation). 138 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// PRE_INC Similar to the unindexed mode…
/// PRE_DEC the value of the base pointer add / subtract the offset.		/// PRE_DEC the value of the base pointer add / subtract the offset.
/// It considers the computation as being folded into the load /		/// It considers the computation as being folded into the load /
/// store operation (i.e. the load / store does the address		/// store operation (i.e. the load / store does the address
/// computation as well as performing the memory transaction).		/// computation as well as performing the memory transaction).
/// The base operand is always undefined. In addition to		/// The base operand is always undefined. In addition to
/// producing a chain, pre-indexed load produces two values		/// producing a chain, pre-indexed load produces two values
/// (result of the load and the result of the address		/// (result of the load and the result of the address
/// computation); a pre-indexed store produces one value (result		/// computation); a pre-indexed store produces one value (result
/// of the address computation).		/// of the address computation).
///		///
/// POST_INC The effective address is the value of the base pointer. The		/// POST_INC The effective address is the value of the base pointer. The
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// POST_INC The effective address is the value of the base pointer. The - /// POST_DEC value of the offset operand is then added to / subtracted - /// from the base after memory transaction. In addition to - /// producing a chain, post-indexed load produces two values - /// (the result of the load and the result of the base +/- offset - /// computation); a post-indexed store produces one value (the - /// the result of the base +/- offset computation). - enum MemIndexedMode { - UNINDEXED = 0, - PRE_INC, 36 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// POST_INC The effective address is the…
/// POST_DEC value of the offset operand is then added to / subtracted		/// POST_DEC value of the offset operand is then added to / subtracted
/// from the base after memory transaction. In addition to		/// from the base after memory transaction. In addition to
/// producing a chain, post-indexed load produces two values		/// producing a chain, post-indexed load produces two values
/// (the result of the load and the result of the base +/- offset		/// (the result of the load and the result of the base +/- offset
/// computation); a post-indexed store produces one value (the		/// computation); a post-indexed store produces one value (the
/// the result of the base +/- offset computation).		/// the result of the base +/- offset computation).
enum MemIndexedMode {		enum MemIndexedMode {
UNINDEXED = 0,		UNINDEXED = 0,
PRE_INC,		PRE_INC,
PRE_DEC,		PRE_DEC,
POST_INC,		POST_INC,
POST_DEC		POST_DEC
};		};

static const int LAST_INDEXED_MODE = POST_DEC + 1;		static const int LAST_INDEXED_MODE = POST_DEC + 1;

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
/// MemIndexType enum - This enum defines how to interpret MGATHER/SCATTER's		/// MemIndexType enum - This enum defines how to interpret MGATHER/SCATTER's
/// index parameter when calculating addresses.		/// index parameter when calculating addresses.
///		///
/// SIGNED_SCALED Addr = Base + ((signed)Index * sizeof(element))		/// SIGNED_SCALED Addr = Base + ((signed)Index * sizeof(element))
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// SIGNED_SCALED Addr = Base + ((signed)Index * sizeof(element)) - /// SIGNED_UNSCALED Addr = Base + (signed)Index - /// UNSIGNED_SCALED Addr = Base + ((unsigned)Index * sizeof(element)) - /// UNSIGNED_UNSCALED Addr = Base + (unsigned)Index - enum MemIndexType { - SIGNED_SCALED = 0, - SIGNED_UNSCALED, - UNSIGNED_SCALED, - UNSIGNED_UNSCALED - }; 83 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// SIGNED_SCALED Addr = Base +…
/// SIGNED_UNSCALED Addr = Base + (signed)Index		/// SIGNED_UNSCALED Addr = Base + (signed)Index
/// UNSIGNED_SCALED Addr = Base + ((unsigned)Index * sizeof(element))		/// UNSIGNED_SCALED Addr = Base + ((unsigned)Index * sizeof(element))
/// UNSIGNED_UNSCALED Addr = Base + (unsigned)Index		/// UNSIGNED_UNSCALED Addr = Base + (unsigned)Index
enum MemIndexType {		enum MemIndexType {
SIGNED_SCALED = 0,		SIGNED_SCALED = 0,
SIGNED_UNSCALED,		SIGNED_UNSCALED,
UNSIGNED_SCALED,		UNSIGNED_SCALED,
UNSIGNED_UNSCALED		UNSIGNED_UNSCALED
Show All 25 Lines
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
/// ISD::CondCode enum - These are ordered carefully to make the bitfields		/// ISD::CondCode enum - These are ordered carefully to make the bitfields
/// below work out, when considering SETFALSE (something that never exists		/// below work out, when considering SETFALSE (something that never exists
/// dynamically) as 0. "U" -> Unsigned (for integer operands) or Unordered		/// dynamically) as 0. "U" -> Unsigned (for integer operands) or Unordered
/// (for floating point), "L" -> Less than, "G" -> Greater than, "E" -> Equal		/// (for floating point), "L" -> Less than, "G" -> Greater than, "E" -> Equal
/// to. If the "N" column is 1, the result of the comparison is undefined if		/// to. If the "N" column is 1, the result of the comparison is undefined if
/// the input is a NAN.		/// the input is a NAN.
///		///
/// All of these (except for the 'always folded ops') should be handled for		/// All of these (except for the 'always folded ops') should be handled for
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// All of these (except for the 'always folded ops') should be handled for - /// floating point. For integer, only the SETEQ,SETNE,SETLT,SETLE,SETGT, - /// SETGE,SETULT,SETULE,SETUGT, and SETUGE opcodes are used. + /// The return value of (FMINNUM 0.0, -0.0) could be either 0.0 or -0.0. + FMINNUM, + FMAXNUM, + + /// FMINNUM_IEEE/FMAXNUM_IEEE - Perform floating-point minimum or maximum on + /// two values, following the IEEE-754 2008 definition. This differs from + /// FMINNUM/FMAXNUM in the handling of signaling NaNs. If one input is a 67 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// All of these (except for the 'always folded…
/// floating point. For integer, only the SETEQ,SETNE,SETLT,SETLE,SETGT,		/// floating point. For integer, only the SETEQ,SETNE,SETLT,SETLE,SETGT,
/// SETGE,SETULT,SETULE,SETUGT, and SETUGE opcodes are used.		/// SETGE,SETULT,SETULE,SETUGT, and SETUGE opcodes are used.
///		///
/// Note that these are laid out in a specific order to allow bit-twiddling		/// Note that these are laid out in a specific order to allow bit-twiddling
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// Note that these are laid out in a specific order to allow bit-twiddling - /// to transform conditions. - enum CondCode { - // Opcode N U L G E Intuitive operation - SETFALSE, // 0 0 0 0 Always false (always folded) - SETOEQ, // 0 0 0 1 True if ordered and equal - SETOGT, // 0 0 1 0 True if ordered and greater than - SETOGE, // 0 0 1 1 True if ordered and greater than or equal - SETOLT, // 0 1 0 0 True if ordered and less than - SETOLE, // 0 1 0 1 True if ordered and less than or equal 391 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// Note that these are laid out in a specific…
/// to transform conditions.		/// to transform conditions.
enum CondCode {		enum CondCode {
// Opcode N U L G E Intuitive operation		// Opcode N U L G E Intuitive operation
SETFALSE, // 0 0 0 0 Always false (always folded)		SETFALSE, // 0 0 0 0 Always false (always folded)
SETOEQ, // 0 0 0 1 True if ordered and equal		SETOEQ, // 0 0 0 1 True if ordered and equal
SETOGT, // 0 0 1 0 True if ordered and greater than		SETOGT, // 0 0 1 0 True if ordered and greater than
SETOGE, // 0 0 1 1 True if ordered and greater than or equal		SETOGE, // 0 0 1 1 True if ordered and greater than or equal
SETOLT, // 0 1 0 0 True if ordered and less than		SETOLT, // 0 1 0 0 True if ordered and less than
▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/MachineBasicBlock.h

Show First 20 Lines • Show All 164 Lines • ▼ Show 20 Lines	private:
MBBSectionID SectionID{0};		MBBSectionID SectionID{0};

// Indicate that this basic block begins a section.		// Indicate that this basic block begins a section.
bool IsBeginSection = false;		bool IsBeginSection = false;

// Indicate that this basic block ends a section.		// Indicate that this basic block ends a section.
bool IsEndSection = false;		bool IsEndSection = false;

/// Default target of the callbr of a basic block.		/// Indicate that this basic block is the indirect dest of an INLINEASM_BR.
bool InlineAsmBrDefaultTarget = false;		bool IsInlineAsmBrIndirectTarget = false;

/// List of indirect targets of the callbr of a basic block.
SmallPtrSet<const MachineBasicBlock*, 4> InlineAsmBrIndirectTargets;

/// since getSymbol is a relatively heavy-weight operation, the symbol		/// since getSymbol is a relatively heavy-weight operation, the symbol
/// is only computed once and is cached.		/// is only computed once and is cached.
mutable MCSymbol *CachedMCSymbol = nullptr;		mutable MCSymbol *CachedMCSymbol = nullptr;

/// Used during basic block sections to mark the end of a basic block.		/// Used during basic block sections to mark the end of a basic block.
MCSymbol *EndMCSymbol = nullptr;		MCSymbol *EndMCSymbol = nullptr;

▲ Show 20 Lines • Show All 283 Lines • ▼ Show 20 Lines	#endif
void setIsEndSection(bool V = true) { IsEndSection = V; }		void setIsEndSection(bool V = true) { IsEndSection = V; }

/// Returns the section ID of this basic block.		/// Returns the section ID of this basic block.
MBBSectionID getSectionID() const { return SectionID; }		MBBSectionID getSectionID() const { return SectionID; }

/// Sets the section ID for this basic block.		/// Sets the section ID for this basic block.
void setSectionID(MBBSectionID V) { SectionID = V; }		void setSectionID(MBBSectionID V) { SectionID = V; }

		/// Returns true if this block may have an INLINEASM_BR (overestimate, by
		/// checking if any of the successors are indirect targets of any inlineasm_br
		/// in the function).
		bool hasInlineAsmBr() const;
		nickdesaulniersUnsubmitted Done Reply Inline Actions Sorry to bikeshed the name of this method, but I find it currently doesn't match the implementation well. I'd take `hasInlineAsmBr` to mean "this `MachineBasicBlock` has an `INLINEASM_BR` `MCInst`." Instead, the implementation is checking whether any of the successors of this `MachineBasicBlock` is the indirect target of an `INLINEASM_BR` `MCInst`. Those two aren't the same thing. In fact, you could have a `MachineBasicBlock` where the current implementation of `hasInlineAsmBR` would return `true`, and yet the `MachineBasicBlock` does not actually contain a `INLINEASM_BR` `MCInst`. I know that's what you're alluding to in the comment, but I think the method should be named `hasInlineAsmBrSuccessors` or something of the sort. Maybe `MaybeHasInlineAsmBr` since it sounds like you'd rather it not be precise by scanning all instructions for the relevant opcode? (Couldn't ISEL mark these when creating the MBB? I guess having a `bool` member is tricky, since transforms would have to update that correctly. In this case, rematerializing the value by rescanning is simpler, and harder to get wrong.) I guess the name sounds precise, though the comment and implementation denote that it's not, and that's what I'm most hung up on. nickdesaulniers: Sorry to bikeshed the name of this method, but I find it currently doesn't match the…
		jyknightAuthorUnsubmitted Done Reply Inline Actions Renamed to mayHaveInlineAsmBr. I don't like "hasInlineAsmBrSuccessors", because the important property is whether there is such an instruction in the block. That it currently tests via looking at the successor in the current implementation is just an implementation detail. It would be nice for it to be precise, but as I mentioned in the other thread, it's probably not really worth the expense. jyknight: Renamed to mayHaveInlineAsmBr. I don't like "hasInlineAsmBrSuccessors", because the important…

/// Returns true if this is the indirect dest of an INLINEASM_BR.		/// Returns true if this is the indirect dest of an INLINEASM_BR.
bool isInlineAsmBrIndirectTarget(const MachineBasicBlock *Tgt) const {		bool isInlineAsmBrIndirectTarget() const {
return InlineAsmBrIndirectTargets.count(Tgt);		return IsInlineAsmBrIndirectTarget;
}		}

/// Indicates if this is the indirect dest of an INLINEASM_BR.		/// Indicates if this is the indirect dest of an INLINEASM_BR.
void addInlineAsmBrIndirectTarget(const MachineBasicBlock *Tgt) {		void setIsInlineAsmBrIndirectTarget(bool V = true) {
InlineAsmBrIndirectTargets.insert(Tgt);		IsInlineAsmBrIndirectTarget = V;
		nickdesaulniersUnsubmitted Done Reply Inline Actions I think @efriedma has been noting that the API at the MIR level doesn't feel symmetric with the LLVM IR level. https://reviews.llvm.org/D78234#1987870 and https://reviews.llvm.org/D78234#1989382 In LLVM IR, you have `BasicBlock::HasAddressTaken`, but at the MIR level the operands are still `BlockAddress` (which reference a `Function` and `BasicBlock`, two LLVM IR level concepts). It's too bad we don't lower these to just `MachineBasicBlocks` (or a new `MachineBlockAddress`) as operands, and have equivalent machinery for detecting whether a `MachineBasicBlock` has its address taken. nickdesaulniers: I think @efriedma has been noting that the API at the MIR level doesn't feel symmetric with the…
		jyknightAuthorUnsubmitted Done Reply Inline Actions (we do actually have MachineBasicBlock::hasAddressTaken().) I believe the change I have here resolves efriedma's concerns you've linked, by removing the mapping of source-block to target-block. It now overestimates, but should be conservatively correct, even in the face of instructions being moved/split/etc, because the _target_ cannot be so transformed. jyknight: (we do actually have MachineBasicBlock::hasAddressTaken().) I believe the change I have here…
}

/// Transfers indirect targets to INLINEASM_BR's copy block.
void transferInlineAsmBrIndirectTargets(MachineBasicBlock *CopyBB) {
for (auto *Target : InlineAsmBrIndirectTargets)
CopyBB->addInlineAsmBrIndirectTarget(Target);
return InlineAsmBrIndirectTargets.clear();
}

/// Returns true if this is the default dest of an INLINEASM_BR.
bool isInlineAsmBrDefaultTarget() const {
return InlineAsmBrDefaultTarget;
}

/// Indicates if this is the default deft of an INLINEASM_BR.
void setInlineAsmBrDefaultTarget() {
InlineAsmBrDefaultTarget = true;
}		}

/// Returns true if it is legal to hoist instructions into this block.		/// Returns true if it is legal to hoist instructions into this block.
bool isLegalToHoistInto() const;		bool isLegalToHoistInto() const;

// Code Layout methods.		// Code Layout methods.

/// Move 'this' block before or after the specified block. This only moves		/// Move 'this' block before or after the specified block. This only moves
▲ Show 20 Lines • Show All 570 Lines • Show Last 20 Lines

llvm/include/llvm/Target/Target.td

Show First 20 Lines • Show All 1,012 Lines • ▼ Show 20 Lines	def INLINEASM : StandardPseudoInstruction {
let AsmString = "";		let AsmString = "";
let hasSideEffects = 0; // Note side effect is encoded in an operand.		let hasSideEffects = 0; // Note side effect is encoded in an operand.
}		}
def INLINEASM_BR : StandardPseudoInstruction {		def INLINEASM_BR : StandardPseudoInstruction {
let OutOperandList = (outs);		let OutOperandList = (outs);
let InOperandList = (ins variable_ops);		let InOperandList = (ins variable_ops);
let AsmString = "";		let AsmString = "";
let hasSideEffects = 0; // Note side effect is encoded in an operand.		let hasSideEffects = 0; // Note side effect is encoded in an operand.
let isTerminator = 1;		// let isTerminator = 1;
let isBranch = 1;		// let isBranch = 1;
let isIndirectBranch = 1;		// let isIndirectBranch = 1;
		nickdesaulniersUnsubmitted Done Reply Inline Actions Delete nickdesaulniers: Delete
}		}
def CFI_INSTRUCTION : StandardPseudoInstruction {		def CFI_INSTRUCTION : StandardPseudoInstruction {
let OutOperandList = (outs);		let OutOperandList = (outs);
let InOperandList = (ins i32imm:$id);		let InOperandList = (ins i32imm:$id);
let AsmString = "";		let AsmString = "";
let hasCtrlDep = 1;		let hasCtrlDep = 1;
let hasSideEffects = 0;		let hasSideEffects = 0;
let isNotDuplicable = 1;		let isNotDuplicable = 1;
▲ Show 20 Lines • Show All 624 Lines • Show Last 20 Lines

llvm/lib/CodeGen/BranchFolding.cpp

Show First 20 Lines • Show All 1,076 Lines • ▼ Show 20 Lines	for (MachineBasicBlock *PBB : I->predecessors()) {
// Skip blocks that loop to themselves, can't tail merge these.		// Skip blocks that loop to themselves, can't tail merge these.
if (PBB == IBB)		if (PBB == IBB)
continue;		continue;

// Visit each predecessor only once.		// Visit each predecessor only once.
if (!UniquePreds.insert(PBB).second)		if (!UniquePreds.insert(PBB).second)
continue;		continue;

// Skip blocks which may jump to a landing pad. Can't tail merge these.		// Skip blocks which may jump to a landing pad or jump from an asm blob.
if (PBB->hasEHPadSuccessor())		// Can't tail merge these.
		if (PBB->hasEHPadSuccessor() \|\| PBB->hasInlineAsmBr())
continue;		continue;

// After block placement, only consider predecessors that belong to the		// After block placement, only consider predecessors that belong to the
// same loop as IBB. The reason is the same as above when skipping loop		// same loop as IBB. The reason is the same as above when skipping loop
// header.		// header.
if (AfterBlockPlacement && MLI)		if (AfterBlockPlacement && MLI)
if (ML != MLI->getLoopFor(PBB))		if (ML != MLI->getLoopFor(PBB))
continue;		continue;
▲ Show 20 Lines • Show All 564 Lines • ▼ Show 20 Lines	ReoptimizeBlock:
// place to move this block where a fall-through will happen.		// place to move this block where a fall-through will happen.
if (!PrevBB.canFallThrough()) {		if (!PrevBB.canFallThrough()) {
// Now we know that there was no fall-through into this block, check to		// Now we know that there was no fall-through into this block, check to
// see if it has a fall-through into its successor.		// see if it has a fall-through into its successor.
bool CurFallsThru = MBB->canFallThrough();		bool CurFallsThru = MBB->canFallThrough();

if (!MBB->isEHPad()) {		if (!MBB->isEHPad()) {
// Check all the predecessors of this block. If one of them has no fall		// Check all the predecessors of this block. If one of them has no fall
// throughs, move this block right after it.		// throughs, and analyzeBranch thinks it _could_ fallthrough to this
		// block, move this block right after it.
for (MachineBasicBlock *PredBB : MBB->predecessors()) {		for (MachineBasicBlock *PredBB : MBB->predecessors()) {
// Analyze the branch at the end of the pred.		// Analyze the branch at the end of the pred.
MachineBasicBlock PredTBB = nullptr, PredFBB = nullptr;		MachineBasicBlock PredTBB = nullptr, PredFBB = nullptr;
SmallVector<MachineOperand, 4> PredCond;		SmallVector<MachineOperand, 4> PredCond;
if (PredBB != MBB && !PredBB->canFallThrough() &&		if (PredBB != MBB && !PredBB->canFallThrough() &&
!TII->analyzeBranch(*PredBB, PredTBB, PredFBB, PredCond, true) &&		!TII->analyzeBranch(*PredBB, PredTBB, PredFBB, PredCond, true) &&
		(PredTBB == MBB \|\| PredFBB == MBB) &&
(!CurFallsThru \|\| !CurTBB \|\| !CurFBB) &&		(!CurFallsThru \|\| !CurTBB \|\| !CurFBB) &&
(!CurFallsThru \|\| MBB->getNumber() >= PredBB->getNumber())) {		(!CurFallsThru \|\| MBB->getNumber() >= PredBB->getNumber())) {
// If the current block doesn't fall through, just move it.		// If the current block doesn't fall through, just move it.
// If the current block can fall through and does not end with a		// If the current block can fall through and does not end with a
// conditional branch, we need to append an unconditional jump to		// conditional branch, we need to append an unconditional jump to
// the (current) next block. To avoid a possible compile-time		// the (current) next block. To avoid a possible compile-time
// infinite loop, move blocks only backward in this case.		// infinite loop, move blocks only backward in this case.
// Also, if there are already 2 branches here, we cannot add a third;		// Also, if there are already 2 branches here, we cannot add a third;
Show All 9 Lines	if (!MBB->isEHPad()) {
MBB->moveAfter(PredBB);		MBB->moveAfter(PredBB);
MadeChange = true;		MadeChange = true;
goto ReoptimizeBlock;		goto ReoptimizeBlock;
}		}
}		}
}		}

if (!CurFallsThru) {		if (!CurFallsThru) {
// Check all successors to see if we can move this block before it.		// Check analyzable branch-successors to see if we can move this block
for (MachineBasicBlock *SuccBB : MBB->successors()) {		// before one.
		if (!CurUnAnalyzable) {
		for (MachineBasicBlock *SuccBB : {CurFBB, CurTBB}) {
		nickdesaulniersUnsubmitted Done Reply Inline Actions That's neat, I didn't know you could use initializer lists as ranges for range based for loops.s nickdesaulniers: That's neat, I didn't know you could use initializer lists as ranges for range based for loops.s
		if (!SuccBB)
		continue;
// Analyze the branch at the end of the block before the succ.		// Analyze the branch at the end of the block before the succ.
MachineFunction::iterator SuccPrev = --SuccBB->getIterator();		MachineFunction::iterator SuccPrev = --SuccBB->getIterator();

// If this block doesn't already fall-through to that successor, and if		// If this block doesn't already fall-through to that successor, and
// the succ doesn't already have a block that can fall through into it,		// if the succ doesn't already have a block that can fall through into
// and if the successor isn't an EH destination, we can arrange for the		// it, we can arrange for the fallthrough to happen.
// fallthrough to happen.
if (SuccBB != MBB && &*SuccPrev != MBB &&		if (SuccBB != MBB && &*SuccPrev != MBB &&
!SuccPrev->canFallThrough() && !CurUnAnalyzable &&		!SuccPrev->canFallThrough()) {
!SuccBB->isEHPad()) {
nickdesaulniersUnsubmitted Done Reply Inline Actions was removing the `!isEHPad` check intentional? nickdesaulniers: was removing the `!isEHPad` check intentional?
jyknightAuthorUnsubmitted Done Reply Inline Actions Yes, no longer need to filter out EHpads, because it's now only looking at the analyzeBranch results CurFBB/CurTBB, not all successors. jyknight: Yes, no longer need to filter out EHpads, because it's now only looking at the analyzeBranch…
MBB->moveBefore(SuccBB);		MBB->moveBefore(SuccBB);
MadeChange = true;		MadeChange = true;
goto ReoptimizeBlock;		goto ReoptimizeBlock;
}		}
}		}
		}

// Okay, there is no really great place to put this block. If, however,		// Okay, there is no really great place to put this block. If, however,
// the block before this one would be a fall-through if this block were		// the block before this one would be a fall-through if this block were
// removed, move this block to the end of the function. There is no real		// removed, move this block to the end of the function. There is no real
// advantage in "falling through" to an EH block, so we don't want to		// advantage in "falling through" to an EH block, so we don't want to
// perform this transformation for that case.		// perform this transformation for that case.
//		//
// Also, Windows EH introduced the possibility of an arbitrary number of		// Also, Windows EH introduced the possibility of an arbitrary number of
▲ Show 20 Lines • Show All 310 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MachineBasicBlock.cpp

Show First 20 Lines • Show All 271 Lines • ▼ Show 20 Lines
}		}

#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)		#if !defined(NDEBUG) \|\| defined(LLVM_ENABLE_DUMP)
LLVM_DUMP_METHOD void MachineBasicBlock::dump() const {		LLVM_DUMP_METHOD void MachineBasicBlock::dump() const {
print(dbgs());		print(dbgs());
}		}
#endif		#endif

		bool MachineBasicBlock::hasInlineAsmBr() const {
		for (const_succ_iterator I = succ_begin(), E = succ_end(); I != E; ++I)
		if ((*I)->isInlineAsmBrIndirectTarget())
		return true;
		return false;
		nickdesaulniersUnsubmitted Done Reply Inline Actions return any_of(successors(), [](MachineBasicBlock* Succ) { return Succ->isInlineAsmBrIndirectTarget(); }; or better yet, why not be precise and iterate `terminators()` checking the `getOpcode() == TargetOpcode::INLINEASM_BR`? nickdesaulniers: ``` return any_of(successors(), [](MachineBasicBlock* Succ) { return Succ…
		voidUnsubmitted Done Reply Inline Actions It's no longer a terminator, but could be written: return any_of(this, [](const MachineInstr &MI) { return MI.getOpcode() == TargetOpcode::INLINEASM_BR; } void:* It's no longer a terminator, but could be written: ``` return any_of(*this, [](const…
		jyknightAuthorUnsubmitted Done Reply Inline Actions I'm worried that iterating over all the instructions in the block to retrieve this information would be bad for performance. I don't think the overestimating here is likely to be a major optimization issue, but if it turns out to be, I'd like to revisit in a future patch. jyknight: I'm worried that iterating over all the instructions in the block to retrieve this information…
		nickdesaulniersUnsubmitted Done Reply Inline Actions This could at least use a range-for: for (const MachineBasicBlock Succ : successors()) { if (Succ->isInlineAsmBrIndirectTarget()) ... nickdesaulniers:* This could at least use a range-for: ``` for (const MachineBasicBlock *Succ : successors()) {…
		}

bool MachineBasicBlock::isLegalToHoistInto() const {		bool MachineBasicBlock::isLegalToHoistInto() const {
if (isReturnBlock() \|\| hasEHPadSuccessor())		if (isReturnBlock() \|\| hasEHPadSuccessor() \|\| hasInlineAsmBr())
		voidUnsubmitted Done Reply Inline Actions Is this correct? We should be able to hoist into a block with `INLINEASM_BR` if it's coming from the default target. void: Is this correct? We should be able to hoist into a block with `INLINEASM_BR` if it's coming…
		jyknightAuthorUnsubmitted Done Reply Inline Actions I think it's correct -- in that it returns false strictly more often than it needs to. Just the same as we could sometimes hoist into a block which has an invoke in it, we could sometimes do so for an inlineasm_br. jyknight: I think it's correct -- in that it returns false strictly more often than it needs to. Just the…
return false;		return false;
return true;		return true;
}		}

StringRef MachineBasicBlock::getName() const {		StringRef MachineBasicBlock::getName() const {
if (const BasicBlock *LBB = getBasicBlock())		if (const BasicBlock *LBB = getBasicBlock())
return LBB->getName();		return LBB->getName();
else		else
▲ Show 20 Lines • Show All 835 Lines • ▼ Show 20 Lines	bool MachineBasicBlock::canSplitCriticalEdge(
const MachineBasicBlock *Succ) const {		const MachineBasicBlock *Succ) const {
// Splitting the critical edge to a landing pad block is non-trivial. Don't do		// Splitting the critical edge to a landing pad block is non-trivial. Don't do
// it in this generic function.		// it in this generic function.
if (Succ->isEHPad())		if (Succ->isEHPad())
return false;		return false;

// Splitting the critical edge to a callbr's indirect block isn't advised.		// Splitting the critical edge to a callbr's indirect block isn't advised.
// Don't do it in this generic function.		// Don't do it in this generic function.
if (isInlineAsmBrIndirectTarget(Succ))		if (Succ->isInlineAsmBrIndirectTarget())
return false;		return false;

const MachineFunction *MF = getParent();		const MachineFunction *MF = getParent();
// Performance might be harmed on HW that implements branching using exec mask		// Performance might be harmed on HW that implements branching using exec mask
// where both sides of the branches are always executed.		// where both sides of the branches are always executed.
if (MF->getTarget().requiresStructuredCFG())		if (MF->getTarget().requiresStructuredCFG())
return false;		return false;

▲ Show 20 Lines • Show All 329 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MachineSink.cpp

Show First 20 Lines • Show All 728 Lines • ▼ Show 20 Lines	MachineSinking::FindSuccToSinkTo(MachineInstr &MI, MachineBasicBlock *MBB,
if (MBB == SuccToSinkTo)		if (MBB == SuccToSinkTo)
return nullptr;		return nullptr;

// It's not safe to sink instructions to EH landing pad. Control flow into		// It's not safe to sink instructions to EH landing pad. Control flow into
// landing pad is implicitly defined.		// landing pad is implicitly defined.
if (SuccToSinkTo && SuccToSinkTo->isEHPad())		if (SuccToSinkTo && SuccToSinkTo->isEHPad())
return nullptr;		return nullptr;

		// It ought to be okay to sink instructions into an INLINEASM_BR target, but
		// only if we make sure that MI occurs _before_ an INLINEASM_BR instruction in
		// the source block (which this code does not yet do). So for now, forbid
		// doing so.
		if (SuccToSinkTo && SuccToSinkTo->isInlineAsmBrIndirectTarget())
		return nullptr;

return SuccToSinkTo;		return SuccToSinkTo;
}		}

/// Return true if MI is likely to be usable as a memory operation by the		/// Return true if MI is likely to be usable as a memory operation by the
/// implicit null check optimization.		/// implicit null check optimization.
///		///
/// This is a "best effort" heuristic, and should not be relied upon for		/// This is a "best effort" heuristic, and should not be relied upon for
/// correctness. This returning true does not guarantee that the implicit null		/// correctness. This returning true does not guarantee that the implicit null
▲ Show 20 Lines • Show All 658 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MachineVerifier.cpp

Show First 20 Lines • Show All 574 Lines • ▼ Show 20 Lines	MachineVerifier::visitMachineBasicBlockBefore(const MachineBasicBlock *MBB) {
FirstNonPHI = nullptr;		FirstNonPHI = nullptr;

if (!MF->getProperties().hasProperty(		if (!MF->getProperties().hasProperty(
MachineFunctionProperties::Property::NoPHIs) && MRI->tracksLiveness()) {		MachineFunctionProperties::Property::NoPHIs) && MRI->tracksLiveness()) {
// If this block has allocatable physical registers live-in, check that		// If this block has allocatable physical registers live-in, check that
// it is an entry block or landing pad.		// it is an entry block or landing pad.
for (const auto &LI : MBB->liveins()) {		for (const auto &LI : MBB->liveins()) {
if (isAllocatable(LI.PhysReg) && !MBB->isEHPad() &&		if (isAllocatable(LI.PhysReg) && !MBB->isEHPad() &&
!MBB->isInlineAsmBrDefaultTarget() &&
MBB->getIterator() != MBB->getParent()->begin()) {		MBB->getIterator() != MBB->getParent()->begin()) {
report("MBB has allocatable live-in, but isn't entry or landing-pad.", MBB);		report("MBB has allocatable live-in, but isn't entry or landing-pad.", MBB);
		nickdesaulniersUnsubmitted Done Reply Inline Actions should the report string be updated, too, to mention `INLINEASM_BR fallthrough/default target`? nickdesaulniers: should the report string be updated, too, to mention `INLINEASM_BR fallthrough/default target`?
		jyknightAuthorUnsubmitted Done Reply Inline Actions Maybe it should've before, but this change _removed_ the allowance for that. jyknight: Maybe it should've before, but this change _removed_ the allowance for that.
report_context(LI.PhysReg);		report_context(LI.PhysReg);
}		}
}		}
}		}

// Count the number of landing pad successors.		// Count the number of landing pad successors.
SmallPtrSet<const MachineBasicBlock*, 4> LandingPadSuccs;		SmallPtrSet<const MachineBasicBlock*, 4> LandingPadSuccs;
for (const auto *succ : MBB->successors()) {		for (const auto *succ : MBB->successors()) {
▲ Show 20 Lines • Show All 2,329 Lines • Show Last 20 Lines

llvm/lib/CodeGen/PHIEliminationUtils.cpp

	Show All 20 Lines
	llvm::findPHICopyInsertPoint(MachineBasicBlock* MBB, MachineBasicBlock* SuccMBB,			llvm::findPHICopyInsertPoint(MachineBasicBlock* MBB, MachineBasicBlock* SuccMBB,
	unsigned SrcReg) {			unsigned SrcReg) {
	// Handle the trivial case trivially.			// Handle the trivial case trivially.
	if (MBB->empty())			if (MBB->empty())
	return MBB->begin();			return MBB->begin();

	// Usually, we just want to insert the copy before the first terminator			// Usually, we just want to insert the copy before the first terminator
	// instruction. However, for the edge going to a landing pad, we must insert			// instruction. However, for the edge going to a landing pad, we must insert
	// the copy before the call/invoke instruction.			// the copy before the call/invoke instruction. Similarly for an INLINEASM_BR
	if (!SuccMBB->isEHPad())			// going to an indirect target.
				if (!SuccMBB->isEHPad() && !SuccMBB->isInlineAsmBrIndirectTarget())
	return MBB->getFirstTerminator();			return MBB->getFirstTerminator();

	// Discover any defs/uses in this basic block.			// Discover any defs/uses in this basic block.
	SmallPtrSet<MachineInstr*, 8> DefUsesInMBB;			SmallPtrSet<MachineInstr*, 8> DefUsesInMBB;
	MachineRegisterInfo& MRI = MBB->getParent()->getRegInfo();			MachineRegisterInfo& MRI = MBB->getParent()->getRegInfo();
	for (MachineInstr &RI : MRI.reg_instructions(SrcReg)) {			for (MachineInstr &RI : MRI.reg_instructions(SrcReg)) {
	if (RI.getParent() == MBB)			if (RI.getParent() == MBB)
	DefUsesInMBB.insert(&RI);			DefUsesInMBB.insert(&RI);
	Show All 21 Lines

llvm/lib/CodeGen/RegisterCoalescer.cpp

	Show First 20 Lines • Show All 1,058 Lines • ▼ Show 20 Lines
	/// the movement of copy is beneficial.			/// the movement of copy is beneficial.
	bool RegisterCoalescer::removePartialRedundancy(const CoalescerPair &CP,			bool RegisterCoalescer::removePartialRedundancy(const CoalescerPair &CP,
	MachineInstr &CopyMI) {			MachineInstr &CopyMI) {
	assert(!CP.isPhys());			assert(!CP.isPhys());
	if (!CopyMI.isFullCopy())			if (!CopyMI.isFullCopy())
	return false;			return false;

	MachineBasicBlock &MBB = *CopyMI.getParent();			MachineBasicBlock &MBB = *CopyMI.getParent();
	if (MBB.isEHPad())			// If this block is the target of an invoke/inlineasm_br, moving the copy into
				// the predecessor is tricker, and we don't handle it.
				if (MBB.isEHPad() \|\| MBB.isInlineAsmBrIndirectTarget())
	return false;			return false;

	if (MBB.pred_size() != 2)			if (MBB.pred_size() != 2)
	return false;			return false;

	LiveInterval &IntA =			LiveInterval &IntA =
	LIS->getInterval(CP.isFlipped() ? CP.getDstReg() : CP.getSrcReg());			LIS->getInterval(CP.isFlipped() ? CP.getDstReg() : CP.getSrcReg());
	LiveInterval &IntB =			LiveInterval &IntB =
	▲ Show 20 Lines • Show All 2,867 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp

Show First 20 Lines • Show All 1,022 Lines • ▼ Show 20 Lines	for (const auto &InstrOrder : Orders) {
}		}
if (DLI == DLE)		if (DLI == DLE)
break;		break;

LastOrder = Order;		LastOrder = Order;
}		}
}		}

// Split after an INLINEASM_BR block with outputs. This allows us to keep the
voidUnsubmitted Done Reply Inline Actions Yay! void: Yay!
// copy to/from register instructions from being between two terminator
// instructions, which causes the machine instruction verifier agita.
auto TI = llvm::find_if(*BB, [](const MachineInstr &MI){
return MI.getOpcode() == TargetOpcode::INLINEASM_BR;
});
auto SplicePt = TI != BB->end() ? std::next(TI) : BB->end();
if (TI != BB->end() && SplicePt != BB->end() &&
TI->getOpcode() == TargetOpcode::INLINEASM_BR &&
SplicePt->getOpcode() == TargetOpcode::COPY) {
MachineBasicBlock *FallThrough = BB->getFallThrough();
if (!FallThrough)
for (const MachineOperand &MO : BB->back().operands())
if (MO.isMBB()) {
FallThrough = MO.getMBB();
break;
}
assert(FallThrough && "Cannot find default dest block for callbr!");

MachineBasicBlock *CopyBB = MF.CreateMachineBasicBlock(BB->getBasicBlock());
MachineFunction::iterator BBI(*BB);
MF.insert(++BBI, CopyBB);

CopyBB->splice(CopyBB->begin(), BB, SplicePt, BB->end());
CopyBB->setInlineAsmBrDefaultTarget();

CopyBB->addSuccessor(FallThrough, BranchProbability::getOne());
BB->removeSuccessor(FallThrough);
BB->addSuccessor(CopyBB, BranchProbability::getOne());

// Mark all physical registers defined in the original block as being live
// on entry to the copy block.
for (const auto &MI : *CopyBB)
for (const MachineOperand &MO : MI.operands())
if (MO.isReg()) {
Register reg = MO.getReg();
if (Register::isPhysicalRegister(reg)) {
CopyBB->addLiveIn(reg);
break;
}
}

CopyBB->normalizeSuccProbs();
BB->normalizeSuccProbs();

BB->transferInlineAsmBrIndirectTargets(CopyBB);

InsertPos = CopyBB->end();
return CopyBB;
}

InsertPos = Emitter.getInsertPos();		InsertPos = Emitter.getInsertPos();
return Emitter.getBlock();		return Emitter.getBlock();
}		}

/// Return the basic block label.		/// Return the basic block label.
std::string ScheduleDAGSDNodes::getDAGName() const {		std::string ScheduleDAGSDNodes::getDAGName() const {
return "sunit-dag." + BB->getFullName();		return "sunit-dag." + BB->getFullName();
}		}

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h

Show First 20 Lines • Show All 400 Lines • ▼ Show 20 Lines	virtual void addSuccessorWithProb(
BranchProbability Prob = BranchProbability::getUnknown()) override {		BranchProbability Prob = BranchProbability::getUnknown()) override {
SDB->addSuccessorWithProb(Src, Dst, Prob);		SDB->addSuccessorWithProb(Src, Dst, Prob);
}		}

private:		private:
SelectionDAGBuilder *SDB;		SelectionDAGBuilder *SDB;
};		};

		std::function<void(SelectionDAGBuilder *self, SDValue Chain, SDValue Flag)>
		DeferredInlineAsmOutputCallback;
		nickdesaulniersUnsubmitted Done Reply Inline Actions Unused? nickdesaulniers: Unused?

// Data related to deferred switch lowerings. Used to construct additional		// Data related to deferred switch lowerings. Used to construct additional
// Basic Blocks in SelectionDAGISel::FinishBasicBlock.		// Basic Blocks in SelectionDAGISel::FinishBasicBlock.
std::unique_ptr<SDAGSwitchLowering> SL;		std::unique_ptr<SDAGSwitchLowering> SL;

/// A StackProtectorDescriptor structure used to communicate stack protector		/// A StackProtectorDescriptor structure used to communicate stack protector
/// information in between SelectBasicBlock and FinishBasicBlock.		/// information in between SelectBasicBlock and FinishBasicBlock.
StackProtectorDescriptor SPDescriptor;		StackProtectorDescriptor SPDescriptor;

▲ Show 20 Lines • Show All 494 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,859 Lines • ▼ Show 20 Lines	assert(!I.hasOperandBundlesOtherThan(
"Cannot lower callbrs with arbitrary operand bundles yet!");		"Cannot lower callbrs with arbitrary operand bundles yet!");

assert(I.isInlineAsm() && "Only know how to handle inlineasm callbr");		assert(I.isInlineAsm() && "Only know how to handle inlineasm callbr");
visitInlineAsm(I);		visitInlineAsm(I);
CopyToExportRegsIfNeeded(&I);		CopyToExportRegsIfNeeded(&I);

// Retrieve successors.		// Retrieve successors.
MachineBasicBlock *Return = FuncInfo.MBBMap[I.getDefaultDest()];		MachineBasicBlock *Return = FuncInfo.MBBMap[I.getDefaultDest()];
Return->setInlineAsmBrDefaultTarget();

// Update successor info.		// Update successor info.
addSuccessorWithProb(CallBrMBB, Return, BranchProbability::getOne());		addSuccessorWithProb(CallBrMBB, Return, BranchProbability::getOne());
for (unsigned i = 0, e = I.getNumIndirectDests(); i < e; ++i) {		for (unsigned i = 0, e = I.getNumIndirectDests(); i < e; ++i) {
MachineBasicBlock *Target = FuncInfo.MBBMap[I.getIndirectDest(i)];		MachineBasicBlock *Target = FuncInfo.MBBMap[I.getIndirectDest(i)];
addSuccessorWithProb(CallBrMBB, Target, BranchProbability::getZero());		addSuccessorWithProb(CallBrMBB, Target, BranchProbability::getZero());
CallBrMBB->addInlineAsmBrIndirectTarget(Target);		Target->setIsInlineAsmBrIndirectTarget();
}		}
CallBrMBB->normalizeSuccProbs();		CallBrMBB->normalizeSuccProbs();

// Drop into default successor.		// Drop into default successor.
DAG.setRoot(DAG.getNode(ISD::BR, getCurSDLoc(),		DAG.setRoot(DAG.getNode(ISD::BR, getCurSDLoc(),
MVT::Other, getControlRoot(),		MVT::Other, getControlRoot(),
DAG.getBasicBlock(Return)));		DAG.getBasicBlock(Return)));
}		}
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	void SelectionDAGBuilder::UpdateSplitBlock(MachineBasicBlock *First,
for (unsigned i = 0, e = SL->JTCases.size(); i != e; ++i)		for (unsigned i = 0, e = SL->JTCases.size(); i != e; ++i)
if (SL->JTCases[i].first.HeaderBB == First)		if (SL->JTCases[i].first.HeaderBB == First)
SL->JTCases[i].first.HeaderBB = Last;		SL->JTCases[i].first.HeaderBB = Last;

// Update BitTestCases.		// Update BitTestCases.
for (unsigned i = 0, e = SL->BitTestCases.size(); i != e; ++i)		for (unsigned i = 0, e = SL->BitTestCases.size(); i != e; ++i)
if (SL->BitTestCases[i].Parent == First)		if (SL->BitTestCases[i].Parent == First)
SL->BitTestCases[i].Parent = Last;		SL->BitTestCases[i].Parent = Last;

// SelectionDAGISel::FinishBasicBlock will add PHI operands for the
// successors of the fallthrough block. Here, we add PHI operands for the
// successors of the INLINEASM_BR block itself.
if (First->getFirstTerminator()->getOpcode() == TargetOpcode::INLINEASM_BR)
for (std::pair<MachineInstr *, unsigned> &pair : FuncInfo.PHINodesToUpdate)
if (First->isSuccessor(pair.first->getParent()))
MachineInstrBuilder(*First->getParent(), pair.first)
.addReg(pair.second)
.addMBB(First);
}		}

void SelectionDAGBuilder::visitIndirectBr(const IndirectBrInst &I) {		void SelectionDAGBuilder::visitIndirectBr(const IndirectBrInst &I) {
MachineBasicBlock *IndirectBrMBB = FuncInfo.MBB;		MachineBasicBlock *IndirectBrMBB = FuncInfo.MBB;

// Update machine-CFG edges with unique successors.		// Update machine-CFG edges with unique successors.
SmallSet<BasicBlock*, 32> Done;		SmallSet<BasicBlock*, 32> Done;
for (unsigned i = 0, e = I.getNumSuccessors(); i != e; ++i) {		for (unsigned i = 0, e = I.getNumSuccessors(); i != e; ++i) {
▲ Show 20 Lines • Show All 4,770 Lines • ▼ Show 20 Lines	if (!OpTy->isSingleValueType() && OpTy->isSized()) {
break;		break;
}		}
}		}

return TLI.getValueType(DL, OpTy, true);		return TLI.getValueType(DL, OpTy, true);
}		}
};		};

using SDISelAsmOperandInfoVector = SmallVector<SDISelAsmOperandInfo, 16>;

} // end anonymous namespace		} // end anonymous namespace

/// Make sure that the output operand \p OpInfo and its corresponding input		/// Make sure that the output operand \p OpInfo and its corresponding input
/// operand \p MatchingOpInfo have compatible constraint types (otherwise error		/// operand \p MatchingOpInfo have compatible constraint types (otherwise error
/// out).		/// out).
static void patchMatchingInput(const SDISelAsmOperandInfo &OpInfo,		static void patchMatchingInput(const SDISelAsmOperandInfo &OpInfo,
SDISelAsmOperandInfo &MatchingOpInfo,		SDISelAsmOperandInfo &MatchingOpInfo,
▲ Show 20 Lines • Show All 229 Lines • ▼ Show 20 Lines

} // end anonymous namespace		} // end anonymous namespace

/// visitInlineAsm - Handle a call to an InlineAsm object.		/// visitInlineAsm - Handle a call to an InlineAsm object.
void SelectionDAGBuilder::visitInlineAsm(const CallBase &Call) {		void SelectionDAGBuilder::visitInlineAsm(const CallBase &Call) {
const InlineAsm *IA = cast<InlineAsm>(Call.getCalledOperand());		const InlineAsm *IA = cast<InlineAsm>(Call.getCalledOperand());

/// ConstraintOperands - Information about all of the constraints.		/// ConstraintOperands - Information about all of the constraints.
SDISelAsmOperandInfoVector ConstraintOperands;		SmallVector<SDISelAsmOperandInfo, 16> ConstraintOperands;

const TargetLowering &TLI = DAG.getTargetLoweringInfo();		const TargetLowering &TLI = DAG.getTargetLoweringInfo();
TargetLowering::AsmOperandInfoVector TargetConstraints = TLI.ParseConstraints(		TargetLowering::AsmOperandInfoVector TargetConstraints = TLI.ParseConstraints(
DAG.getDataLayout(), DAG.getSubtarget().getRegisterInfo(), Call);		DAG.getDataLayout(), DAG.getSubtarget().getRegisterInfo(), Call);

// First Pass: Calculate HasSideEffects and ExtraFlags (AlignStack,		// First Pass: Calculate HasSideEffects and ExtraFlags (AlignStack,
// AsmDialect, MayLoad, MayStore).		// AsmDialect, MayLoad, MayStore).
bool HasSideEffect = IA->hasSideEffects();		bool HasSideEffect = IA->hasSideEffects();
▲ Show 20 Lines • Show All 2,571 Lines • Show Last 20 Lines

llvm/lib/CodeGen/ShrinkWrap.cpp

Show First 20 Lines • Show All 488 Lines • ▼ Show 20 Lines	for (MachineBasicBlock &MBB : MF) {
LLVM_DEBUG(dbgs() << "Look into: " << MBB.getNumber() << ' '		LLVM_DEBUG(dbgs() << "Look into: " << MBB.getNumber() << ' '
<< MBB.getName() << '\n');		<< MBB.getName() << '\n');

if (MBB.isEHFuncletEntry())		if (MBB.isEHFuncletEntry())
return giveUpWithRemarks(ORE, "UnsupportedEHFunclets",		return giveUpWithRemarks(ORE, "UnsupportedEHFunclets",
"EH Funclets are not supported yet.",		"EH Funclets are not supported yet.",
MBB.front().getDebugLoc(), &MBB);		MBB.front().getDebugLoc(), &MBB);

if (MBB.isEHPad()) {		if (MBB.isEHPad() \|\| MBB.isInlineAsmBrIndirectTarget()) {
// Push the prologue and epilogue outside of		// Push the prologue and epilogue outside of the region that may throw (or
// the region that may throw by making sure		// jump out via inlineasm_br), by making sure that all the landing pads
// that all the landing pads are at least at the		// are at least at the boundary of the save and restore points. The
// boundary of the save and restore points.		// problem is that a basic block can jump out from the middle in these
// The problem with exceptions is that the throw		// cases, which we do not handle.
// is not properly modeled and in particular, a
// basic block can jump out from the middle.
updateSaveRestorePoints(MBB, RS.get());		updateSaveRestorePoints(MBB, RS.get());
if (!ArePointsInteresting()) {		if (!ArePointsInteresting()) {
LLVM_DEBUG(dbgs() << "EHPad prevents shrink-wrapping\n");		LLVM_DEBUG(dbgs() << "EHPad/inlineasm_br prevents shrink-wrapping\n");
return false;		return false;
}		}
continue;		continue;
}		}

for (const MachineInstr &MI : MBB) {		for (const MachineInstr &MI : MBB) {
if (!useOrDefCSROrFI(MI, RS.get()))		if (!useOrDefCSROrFI(MI, RS.get()))
continue;		continue;
▲ Show 20 Lines • Show All 103 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SplitKit.h

	Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	/// spill or other instruction related with CurLI.			/// spill or other instruction related with CurLI.
	class LLVM_LIBRARY_VISIBILITY InsertPointAnalysis {			class LLVM_LIBRARY_VISIBILITY InsertPointAnalysis {
	private:			private:
	const LiveIntervals &LIS;			const LiveIntervals &LIS;

	/// Last legal insert point in each basic block in the current function.			/// Last legal insert point in each basic block in the current function.
	/// The first entry is the first terminator, the second entry is the			/// The first entry is the first terminator, the second entry is the
	/// last valid point to insert a split or spill for a variable that is			/// last valid point to insert a split or spill for a variable that is
	/// live into a landing pad successor.			/// live into a landing pad or inlineasm_br successor.
	SmallVector<std::pair<SlotIndex, SlotIndex>, 8> LastInsertPoint;			SmallVector<std::pair<SlotIndex, SlotIndex>, 8> LastInsertPoint;

	SlotIndex computeLastInsertPoint(const LiveInterval &CurLI,			SlotIndex computeLastInsertPoint(const LiveInterval &CurLI,
	const MachineBasicBlock &MBB);			const MachineBasicBlock &MBB);

	public:			public:
	InsertPointAnalysis(const LiveIntervals &lis, unsigned BBNum);			InsertPointAnalysis(const LiveIntervals &lis, unsigned BBNum);

	▲ Show 20 Lines • Show All 500 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SplitKit.cpp

	Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines

	SlotIndex			SlotIndex
	InsertPointAnalysis::computeLastInsertPoint(const LiveInterval &CurLI,			InsertPointAnalysis::computeLastInsertPoint(const LiveInterval &CurLI,
	const MachineBasicBlock &MBB) {			const MachineBasicBlock &MBB) {
	unsigned Num = MBB.getNumber();			unsigned Num = MBB.getNumber();
	std::pair<SlotIndex, SlotIndex> &LIP = LastInsertPoint[Num];			std::pair<SlotIndex, SlotIndex> &LIP = LastInsertPoint[Num];
	SlotIndex MBBEnd = LIS.getMBBEndIdx(&MBB);			SlotIndex MBBEnd = LIS.getMBBEndIdx(&MBB);

	SmallVector<const MachineBasicBlock *, 1> EHPadSuccessors;			SmallVector<const MachineBasicBlock *, 1> ExceptionalSuccessors;
	for (const MachineBasicBlock *SMBB : MBB.successors())			bool EHPadSuccessor = false;
	if (SMBB->isEHPad())			for (const MachineBasicBlock *SMBB : MBB.successors()) {
	EHPadSuccessors.push_back(SMBB);			if (SMBB->isEHPad()) {
				ExceptionalSuccessors.push_back(SMBB);
				EHPadSuccessor = true;
				} else if (SMBB->isInlineAsmBrIndirectTarget())
				ExceptionalSuccessors.push_back(SMBB);
				}

	// Compute insert points on the first call. The pair is independent of the			// Compute insert points on the first call. The pair is independent of the
	// current live interval.			// current live interval.
	if (!LIP.first.isValid()) {			if (!LIP.first.isValid()) {
	MachineBasicBlock::const_iterator FirstTerm = MBB.getFirstTerminator();			MachineBasicBlock::const_iterator FirstTerm = MBB.getFirstTerminator();
	if (FirstTerm == MBB.end())			if (FirstTerm == MBB.end())
	LIP.first = MBBEnd;			LIP.first = MBBEnd;
	else			else
	LIP.first = LIS.getInstructionIndex(*FirstTerm);			LIP.first = LIS.getInstructionIndex(*FirstTerm);

	// If there is a landing pad successor, also find the call instruction.			// If there is a landing pad or inlineasm_br successor, also find the
	if (EHPadSuccessors.empty())			// instruction. We assume there can be only one invoke or inlineasm_br in a
				// given block, and that it will be last (FIXME: is this property actually
				// guaranteed? It seems to be the case currently, at least...). If there is
				// no such instruction, we don't need to do anything special.
				if (ExceptionalSuccessors.empty())
	return LIP.first;			return LIP.first;
	// There may not be a call instruction (?) in which case we ignore LPad.
	LIP.second = LIP.first;
	nickdesaulniersUnsubmitted Done Reply Inline Actions Is this assignment ok to remove? nickdesaulniers: Is this assignment ok to remove?
	jyknightAuthorUnsubmitted Done Reply Inline Actions The code below which is invoked when LIP.second is set is simply to decide which of LIP.first or LIP.second to return. Letting it take the !LIP.second codepath has the same effect, with marginally less work. jyknight: The code below which is invoked when LIP.second is set is simply to decide which of LIP.first…
	for (MachineBasicBlock::const_iterator I = MBB.end(), E = MBB.begin();			for (MachineBasicBlock::const_iterator I = MBB.end(), E = MBB.begin();
				nickdesaulniersUnsubmitted Done Reply Inline Actions I wonder why we don't use a reverse const iterator here? nickdesaulniers: I wonder why we don't use a reverse const iterator here?
				jyknightAuthorUnsubmitted Done Reply Inline Actions Who knows. =) Changed. jyknight: Who knows. =) Changed.
	I != E;) {			I != E;) {
	--I;			--I;
	if (I->isCall()) {			if ((EHPadSuccessor && I->isCall()) \|\|
				I->getOpcode() == TargetOpcode::INLINEASM_BR) {
	LIP.second = LIS.getInstructionIndex(*I);			LIP.second = LIS.getInstructionIndex(*I);
	break;			break;
	}			}
	}			}
	}			}

	// If CurLI is live into a landing pad successor, move the last insert point			// If CurLI is live into a landing pad successor, move the last insert point
	// back to the call that may throw.			// back to the call that may throw.
	if (!LIP.second)			if (!LIP.second)
	return LIP.first;			return LIP.first;

	if (none_of(EHPadSuccessors, [&](const MachineBasicBlock *EHPad) {			if (none_of(ExceptionalSuccessors, [&](const MachineBasicBlock *EHPad) {
	return LIS.isLiveInToMBB(CurLI, EHPad);			return LIS.isLiveInToMBB(CurLI, EHPad);
	}))			}))
	return LIP.first;			return LIP.first;

	// Find the value leaving MBB.			// Find the value leaving MBB.
	const VNInfo *VNI = CurLI.getVNInfoBefore(MBBEnd);			const VNInfo *VNI = CurLI.getVNInfoBefore(MBBEnd);
	if (!VNI)			if (!VNI)
	return LIP.first;			return LIP.first;
	▲ Show 20 Lines • Show All 1,732 Lines • Show Last 20 Lines

llvm/lib/CodeGen/TailDuplicator.cpp

Show First 20 Lines • Show All 710 Lines • ▼ Show 20 Lines	bool TailDuplicator::duplicateSimpleBB(
const DenseSet<unsigned> &UsedByPhi,		const DenseSet<unsigned> &UsedByPhi,
SmallVectorImpl<MachineInstr *> &Copies) {		SmallVectorImpl<MachineInstr *> &Copies) {
SmallPtrSet<MachineBasicBlock *, 8> Succs(TailBB->succ_begin(),		SmallPtrSet<MachineBasicBlock *, 8> Succs(TailBB->succ_begin(),
TailBB->succ_end());		TailBB->succ_end());
SmallVector<MachineBasicBlock *, 8> Preds(TailBB->pred_begin(),		SmallVector<MachineBasicBlock *, 8> Preds(TailBB->pred_begin(),
TailBB->pred_end());		TailBB->pred_end());
bool Changed = false;		bool Changed = false;
for (MachineBasicBlock *PredBB : Preds) {		for (MachineBasicBlock *PredBB : Preds) {
if (PredBB->hasEHPadSuccessor())		if (PredBB->hasEHPadSuccessor() \|\| PredBB->hasInlineAsmBr())
continue;		continue;

if (bothUsedInPHI(*PredBB, Succs))		if (bothUsedInPHI(*PredBB, Succs))
continue;		continue;

MachineBasicBlock PredTBB = nullptr, PredFBB = nullptr;		MachineBasicBlock PredTBB = nullptr, PredFBB = nullptr;
SmallVector<MachineOperand, 4> PredCond;		SmallVector<MachineOperand, 4> PredCond;
if (TII->analyzeBranch(*PredBB, PredTBB, PredFBB, PredCond))		if (TII->analyzeBranch(*PredBB, PredTBB, PredFBB, PredCond))
▲ Show 20 Lines • Show All 318 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/BitTracker.cpp

Show First 20 Lines • Show All 948 Lines • ▼ Show 20 Lines	if (!Eval) {
else		else
dbgs() << "\n does not fall through\n";		dbgs() << "\n does not fall through\n";
}		}
Targets.insert(BTs.begin(), BTs.end());		Targets.insert(BTs.begin(), BTs.end());
}		}
++It;		++It;
} while (FallsThrough && It != End);		} while (FallsThrough && It != End);

		// FIXME: why do we need to recompute successors info in this function?
		// Shouldn't successors() be fine as-is?
		kparzyszUnsubmitted Done Reply Inline Actions It would defeat the purpose of this function. It calculates the set of possible targets for each branch, given the updated register states (i.e. branch conditions), which can be a proper subset of the set of targets listed in the branch. kparzysz: It would defeat the purpose of this function. It calculates the set of possible targets for…
		jyknightAuthorUnsubmitted Done Reply Inline Actions Thanks for the explanation. jyknight: Thanks for the explanation.
		if (B.hasInlineAsmBr())
		DefaultToAll = true;
		nickdesaulniersUnsubmitted Done Reply Inline Actions DefaultToAll \|= B.hasInlineAsmBr(); nickdesaulniers: ``` DefaultToAll \|= B.hasInlineAsmBr(); ```
		kparzyszUnsubmitted Done Reply Inline Actions Ok. kparzysz: Ok.

if (!DefaultToAll) {		if (!DefaultToAll) {
// Need to add all CFG successors that lead to EH landing pads.		// Need to add all CFG successors that lead to EH landing pads.
// There won't be explicit branches to these blocks, but they must		// There won't be explicit branches to these blocks, but they must
// be processed.		// be processed.
for (const MachineBasicBlock *SB : B.successors()) {		for (const MachineBasicBlock *SB : B.successors()) {
if (SB->isEHPad())		if (SB->isEHPad())
Targets.insert(SB);		Targets.insert(SB);
}		}
▲ Show 20 Lines • Show All 184 Lines • Show Last 20 Lines

llvm/lib/Target/Hexagon/HexagonConstPropagation.cpp

Show First 20 Lines • Show All 748 Lines • ▼ Show 20 Lines	while (It != End) {
EvalOk = EvalOk && MCE.evaluate(MI, Cells, Targets, FallsThru);		EvalOk = EvalOk && MCE.evaluate(MI, Cells, Targets, FallsThru);
if (!EvalOk)		if (!EvalOk)
FallsThru = true;		FallsThru = true;
if (!FallsThru)		if (!FallsThru)
break;		break;
++It;		++It;
}		}

		// FIXME: why do we need to recompute successors info in this function?
		// Shouldn't successors() be fine as-is?
		kparzyszUnsubmitted Done Reply Inline Actions Same reason as in BitTracker. kparzysz: Same reason as in BitTracker.
		if (B.hasInlineAsmBr())
		EvalOk = false;
		kparzyszUnsubmitted Done Reply Inline Actions Ok. kparzysz: Ok.

if (EvalOk) {		if (EvalOk) {
// Need to add all CFG successors that lead to EH landing pads.		// Need to add all CFG successors that lead to EH landing pads.
// There won't be explicit branches to these blocks, but they must		// There won't be explicit branches to these blocks, but they must
// be processed.		// be processed.
for (const MachineBasicBlock *SB : B.successors()) {		for (const MachineBasicBlock *SB : B.successors()) {
if (SB->isEHPad())		if (SB->isEHPad())
Targets.insert(SB);		Targets.insert(SB);
}		}
Show All 38 Lines	if (MI.isPHI())
visitPHI(MI);		visitPHI(MI);
else if (!MI.isBranch())		else if (!MI.isBranch())
visitNonBranch(MI);		visitNonBranch(MI);
else		else
visitBranchesFrom(MI);		visitBranchesFrom(MI);
}		}
}		}

		// FIXME: what is this function for? It doesn't seem like we ought to need to
		// recompute the successor list from scratch?
		kparzyszUnsubmitted Done Reply Inline Actions ... kparzysz: ...
bool MachineConstPropagator::computeBlockSuccessors(const MachineBasicBlock *MB,		bool MachineConstPropagator::computeBlockSuccessors(const MachineBasicBlock *MB,
SetVector<const MachineBasicBlock*> &Targets) {		SetVector<const MachineBasicBlock*> &Targets) {
		Targets.clear();
		kparzyszUnsubmitted Done Reply Inline Actions Ok. kparzysz: Ok.

MachineBasicBlock::const_iterator FirstBr = MB->end();		MachineBasicBlock::const_iterator FirstBr = MB->end();
for (const MachineInstr &MI : *MB) {		for (const MachineInstr &MI : *MB) {
		if (MI.getOpcode() == TargetOpcode::INLINEASM_BR)
		return false;
		kparzyszUnsubmitted Done Reply Inline Actions Ok. kparzysz: Ok.
if (MI.isDebugInstr())		if (MI.isDebugInstr())
continue;		continue;
if (MI.isBranch()) {		if (MI.isBranch()) {
FirstBr = MI.getIterator();		FirstBr = MI.getIterator();
break;		break;
}		}
}		}

Targets.clear();
MachineBasicBlock::const_iterator End = MB->end();		MachineBasicBlock::const_iterator End = MB->end();

bool DoNext = true;		bool DoNext = true;
for (MachineBasicBlock::const_iterator I = FirstBr; I != End; ++I) {		for (MachineBasicBlock::const_iterator I = FirstBr; I != End; ++I) {
const MachineInstr &MI = *I;		const MachineInstr &MI = *I;
// Can there be debug instructions between branches?		// Can there be debug instructions between branches?
if (MI.isDebugInstr())		if (MI.isDebugInstr())
continue;		continue;
▲ Show 20 Lines • Show All 2,360 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCBranchCoalescing.cpp

Show First 20 Lines • Show All 266 Lines • ▼ Show 20 Lines	for (auto &I : Cand.BranchBlock->terminators()) {
}		}
}		}

if (Cand.BranchBlock->isEHPad() \|\| Cand.BranchBlock->hasEHPadSuccessor()) {		if (Cand.BranchBlock->isEHPad() \|\| Cand.BranchBlock->hasEHPadSuccessor()) {
LLVM_DEBUG(dbgs() << "EH Pad - skip\n");		LLVM_DEBUG(dbgs() << "EH Pad - skip\n");
return false;		return false;
}		}

		if (Cand.BranchBlock->hasInlineAsmBr()) {
		LLVM_DEBUG(dbgs() << "Inline Asm Br - skip\n");
		return false;
		}

// For now only consider triangles (i.e, BranchTargetBlock is set,		// For now only consider triangles (i.e, BranchTargetBlock is set,
// FalseMBB is null, and BranchTargetBlock is a successor to BranchBlock)		// FalseMBB is null, and BranchTargetBlock is a successor to BranchBlock)
if (!Cand.BranchTargetBlock \|\| FalseMBB \|\|		if (!Cand.BranchTargetBlock \|\| FalseMBB \|\|
!Cand.BranchBlock->isSuccessor(Cand.BranchTargetBlock)) {		!Cand.BranchBlock->isSuccessor(Cand.BranchTargetBlock)) {
LLVM_DEBUG(dbgs() << "Does not form a triangle - skip\n");		LLVM_DEBUG(dbgs() << "Does not form a triangle - skip\n");
return false;		return false;
}		}

▲ Show 20 Lines • Show All 506 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86InstrInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,949 Lines • ▼ Show 20 Lines	void X86InstrInfo::replaceBranchWithTailCall(
}		}

I->eraseFromParent();		I->eraseFromParent();
}		}

// Given a MBB and its TBB, find the FBB which was a fallthrough MBB (it may		// Given a MBB and its TBB, find the FBB which was a fallthrough MBB (it may
// not be a fallthrough MBB now due to layout changes). Return nullptr if the		// not be a fallthrough MBB now due to layout changes). Return nullptr if the
// fallthrough MBB cannot be identified.		// fallthrough MBB cannot be identified.
		// FIXME: delete this function. We shouldn't need to guess based on successor
		// list.
static MachineBasicBlock getFallThroughMBB(MachineBasicBlock MBB,		static MachineBasicBlock getFallThroughMBB(MachineBasicBlock MBB,
MachineBasicBlock *TBB) {		MachineBasicBlock *TBB) {
// Look for non-EHPad successors other than TBB. If we find exactly one, it		// Look for non-EHPad successors other than TBB. If we find exactly one, it
// is the fallthrough MBB. If we find zero, then TBB is both the target MBB		// is the fallthrough MBB. If we find zero, then TBB is both the target MBB
// and fallthrough MBB. If we find more than one, we cannot identify the		// and fallthrough MBB. If we find more than one, we cannot identify the
// fallthrough MBB and should return nullptr.		// fallthrough MBB and should return nullptr.
MachineBasicBlock *FallthroughBB = nullptr;		MachineBasicBlock *FallthroughBB = nullptr;
for (auto SI = MBB->succ_begin(), SE = MBB->succ_end(); SI != SE; ++SI) {		for (auto SI = MBB->succ_begin(), SE = MBB->succ_end(); SI != SE; ++SI) {
▲ Show 20 Lines • Show All 5,972 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/callbr-asm-label.ll

	; RUN: llc < %s -mtriple=aarch64-linux-gnu \| FileCheck %s			; RUN: llc < %s -mtriple=aarch64-linux-gnu \| FileCheck %s

	@X = common local_unnamed_addr global i32 0, align 4			@X = common local_unnamed_addr global i32 0, align 4

	define i32 @test1() {			define i32 @test1() {
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: .word b			; CHECK: .word b
	; CHECK-NEXT: .word .Ltmp0			; CHECK-NEXT: .word .Ltmp0
	; CHECK-LABEL: .LBB0_1: // %cleanup			; CHECK: // %bb.1:
	; CHECK-LABEL: .Ltmp0:			; CHECK: .Ltmp0:
	; CHECK-LABEL: .LBB0_2: // %indirect			; CHECK: .LBB0_2: // %indirect
	entry:			entry:
	callbr void asm sideeffect "1:\0A\09.word b, ${0:l}\0A\09", "X"(i8* blockaddress(@test1, %indirect))			callbr void asm sideeffect "1:\0A\09.word b, ${0:l}\0A\09", "X"(i8* blockaddress(@test1, %indirect))
	to label %cleanup [label %indirect]			to label %cleanup [label %indirect]

	indirect:			indirect:
	br label %cleanup			br label %cleanup

	cleanup:			cleanup:
	%retval.0 = phi i32 [ 1, %indirect ], [ 0, %entry ]			%retval.0 = phi i32 [ 1, %indirect ], [ 0, %entry ]
	ret i32 %retval.0			ret i32 %retval.0
	}			}

	define void @test2() {			define void @test2() {
	; CHECK-LABEL: test2:			; CHECK-LABEL: test2:
	entry:			entry:
	%0 = load i32, i32* @X, align 4			%0 = load i32, i32* @X, align 4
	%and = and i32 %0, 1			%and = and i32 %0, 1
	%tobool = icmp eq i32 %and, 0			%tobool = icmp eq i32 %and, 0
	br i1 %tobool, label %if.end10, label %if.then			br i1 %tobool, label %if.end10, label %if.then

	if.then:			if.then:
	; CHECK: .word b			; CHECK: .word b
	; CHECK-NEXT: .word .Ltmp2			; CHECK-NEXT: .word .Ltmp2
	; CHECK-LABEL: .Ltmp2:			; CHECK: .Ltmp2:
	; CHECK-NEXT: .LBB1_3: // %if.end6			; CHECK-NEXT: .LBB1_3: // %if.end6
	callbr void asm sideeffect "1:\0A\09.word b, ${0:l}\0A\09", "X"(i8* blockaddress(@test2, %if.end6))			callbr void asm sideeffect "1:\0A\09.word b, ${0:l}\0A\09", "X"(i8* blockaddress(@test2, %if.end6))
	to label %if.then4 [label %if.end6]			to label %if.then4 [label %if.end6]

	if.then4:			if.then4:
	%call5 = tail call i32 bitcast (i32 (...)* @g to i32 ()*)()			%call5 = tail call i32 bitcast (i32 (...)* @g to i32 ()*)()
	br label %if.end6			br label %if.end6

	if.end6:			if.end6:
	%.pre = load i32, i32* @X, align 4			%.pre = load i32, i32* @X, align 4
	%.pre13 = and i32 %.pre, 1			%.pre13 = and i32 %.pre, 1
	%phitmp = icmp eq i32 %.pre13, 0			%phitmp = icmp eq i32 %.pre13, 0
	br i1 %phitmp, label %if.end10, label %if.then9			br i1 %phitmp, label %if.end10, label %if.then9

	if.then9:			if.then9:
	; CHECK-LABEL: .Ltmp4:			; CHECK: .Ltmp4:
	; CHECK-NEXT: .LBB1_5: // %l_yes			; CHECK-NEXT: .LBB1_5: // %l_yes
	callbr void asm sideeffect "", "X"(i8* blockaddress(@test2, %l_yes))			callbr void asm sideeffect "", "X"(i8* blockaddress(@test2, %l_yes))
	to label %if.end10 [label %l_yes]			to label %if.end10 [label %l_yes]

	if.end10:			if.end10:
	br label %l_yes			br label %l_yes

	l_yes:			l_yes:
	ret void			ret void
	}			}

	declare i32 @g(...)			declare i32 @g(...)

llvm/test/CodeGen/AArch64/callbr-asm-obj-file.ll

; RUN: llc < %s -mtriple=aarch64-unknown-linux-gnu -filetype=obj -o - \		; RUN: llc < %s -mtriple=aarch64-unknown-linux-gnu -filetype=obj -o - \
; RUN: \| llvm-objdump --triple=aarch64-unknown-linux-gnu -d - \		; RUN: \| llvm-objdump --triple=aarch64-unknown-linux-gnu -d - \
; RUN: \| FileCheck %s		; RUN: \| FileCheck %s

%struct.c = type { i1 (...)* }		%struct.c = type { i1 (...)* }

@l = common hidden local_unnamed_addr global i32 0, align 4		@l = common hidden local_unnamed_addr global i32 0, align 4

; CHECK-LABEL: <test1>:		; CHECK-LABEL: <test1>:
; CHECK-LABEL: <$d.1>:		; CHECK-LABEL: <$d.1>:
; CHECK-LABEL: <$x.2>:		; CHECK-LABEL: <$x.2>:
; CHECK-NEXT: b 0x30 <$x.4+0x4>		; CHECK-NEXT: b 0x2c <$x.4>
; CHECK-LABEL: <$x.4>:		; CHECK-LABEL: <$x.4>:
; CHECK-NEXT: b 0x30 <$x.4+0x4>
; CHECK-NEXT: mov w0, wzr		; CHECK-NEXT: mov w0, wzr
; CHECK-NEXT: ldr x30, [sp], #16		; CHECK-NEXT: ldr x30, [sp], #16
; CHECK-NEXT: ret		; CHECK-NEXT: ret
define hidden i32 @test1() {		define hidden i32 @test1() {
%1 = tail call i32 bitcast (i32 (...)* @g to i32 ()*)()		%1 = tail call i32 bitcast (i32 (...)* @g to i32 ()*)()
%2 = icmp eq i32 %1, 0		%2 = icmp eq i32 %1, 0
br i1 %2, label %3, label %5		br i1 %2, label %3, label %5

▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	10: ; preds = %7, %0, %6, %9
ret i32 undef		ret i32 undef
}		}

; CHECK-LABEL: <test3>:		; CHECK-LABEL: <test3>:
; CHECK-LABEL: <$d.9>:		; CHECK-LABEL: <$d.9>:
; CHECK-LABEL: <$x.10>:		; CHECK-LABEL: <$x.10>:
; CHECK-NEXT: b {{.*}} <test3+0x18>		; CHECK-NEXT: b {{.*}} <test3+0x18>
; CHECK-LABEL: <$x.12>:		; CHECK-LABEL: <$x.12>:
; CHECK-NEXT: b {{.*}} <$x.12+0x4>
; CHECK-NEXT: mov w0, wzr		; CHECK-NEXT: mov w0, wzr
; CHECK-NEXT: ldr x30, [sp], #16		; CHECK-NEXT: ldr x30, [sp], #16
; CHECK-NEXT: ret		; CHECK-NEXT: ret
define internal i1 @test3() {		define internal i1 @test3() {
%1 = tail call i32 bitcast (i32 (...)* @g to i32 ()*)()		%1 = tail call i32 bitcast (i32 (...)* @g to i32 ()*)()
%2 = icmp eq i32 %1, 0		%2 = icmp eq i32 %1, 0
br i1 %2, label %3, label %5		br i1 %2, label %3, label %5

Show All 16 Lines

llvm/test/CodeGen/ARM/ifcvt-diamond-unanalyzable-common.mir

# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py		# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
# RUN: llc %s -o - -run-pass=if-converter -verify-machineinstrs \| FileCheck %s		# RUN: llc %s -o - -run-pass=if-converter -verify-machineinstrs \| FileCheck %s
# Make sure we correctly if-convert blocks containing an INLINEASM_BR.		# Make sure we correctly if-convert blocks containing an unanalyzable branch sequence.
		# (In this case, multiple conditional branches)

--- \|		--- \|
target triple = "thumbv7-unknown-linux-gnueabi"		target triple = "thumbv7-unknown-linux-gnueabi"

define dso_local void @fn1() {		define dso_local void @fn1() {
l_yes:		l_yes:
ret void		ret void
}		}
Show All 9 Lines	body: \|
; CHECK: bb.0:		; CHECK: bb.0:
; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)		; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)
; CHECK: liveins: $r0, $r1, $r2, $r4, $lr		; CHECK: liveins: $r0, $r1, $r2, $r4, $lr
; CHECK: $sp = frame-setup t2STMDB_UPD $sp, 14 /* CC::al */, $noreg, killed $r4, killed $lr		; CHECK: $sp = frame-setup t2STMDB_UPD $sp, 14 /* CC::al */, $noreg, killed $r4, killed $lr
; CHECK: t2CMPri killed renamable $r2, 34, 14 /* CC::al */, $noreg, implicit-def $cpsr		; CHECK: t2CMPri killed renamable $r2, 34, 14 /* CC::al */, $noreg, implicit-def $cpsr
; CHECK: $r0 = t2MOVi 2, 1 /* CC::ne */, $cpsr, $noreg		; CHECK: $r0 = t2MOVi 2, 1 /* CC::ne */, $cpsr, $noreg
; CHECK: $r0 = t2MOVi 3, 0 /* CC::eq */, killed $cpsr, $noreg, implicit killed $r0		; CHECK: $r0 = t2MOVi 3, 0 /* CC::eq */, killed $cpsr, $noreg, implicit killed $r0
; CHECK: tBL 14 /* CC::al */, $noreg, @fn2, csr_aapcs, implicit-def dead $lr, implicit $sp, implicit killed $r0, implicit killed $r1, implicit-def $sp, implicit-def dead $r0		; CHECK: tBL 14 /* CC::al */, $noreg, @fn2, csr_aapcs, implicit-def dead $lr, implicit $sp, implicit killed $r0, implicit killed $r1, implicit-def $sp, implicit-def dead $r0
; CHECK: INLINEASM_BR &"", 9 /* sideeffect mayload attdialect /, 13 / imm /, 0, 13 / imm */, blockaddress(@fn1, %ir-block.l_yes)		; CHECK: t2CMPri $sp, 34, 14 /* CC::al */, $noreg, implicit-def $cpsr
		; CHECK: t2Bcc %bb.2, 1 /* CC::ne */, $cpsr
		; CHECK: t2Bcc %bb.2, 2 /* CC::hs */, killed $cpsr
; CHECK: t2B %bb.1, 14 /* CC::al */, $noreg		; CHECK: t2B %bb.1, 14 /* CC::al */, $noreg
; CHECK: bb.1:		; CHECK: bb.1:
; CHECK: INLINEASM &"", 1 /* sideeffect attdialect */		; CHECK: INLINEASM &"", 1
; CHECK: $sp = t2LDMIA_RET $sp, 14 /* CC::al */, $noreg, def $r4, def $pc		; CHECK: $sp = t2LDMIA_RET $sp, 14 /* CC::al */, $noreg, def $r4, def $pc
; CHECK: bb.2.l_yes (address-taken):		; CHECK: bb.2.l_yes (address-taken):
; CHECK: $sp = t2LDMIA_RET $sp, 14 /* CC::al */, $noreg, def $r4, def $pc		; CHECK: $sp = t2LDMIA_RET $sp, 14 /* CC::al */, $noreg, def $r4, def $pc
bb.0:		bb.0:
successors: %bb.1(0x40000000), %bb.2(0x40000000)		successors: %bb.1(0x40000000), %bb.2(0x40000000)
liveins: $r0, $r1, $r2, $r4, $lr		liveins: $r0, $r1, $r2, $r4, $lr

$sp = frame-setup t2STMDB_UPD $sp, 14, $noreg, killed $r4, killed $lr		$sp = frame-setup t2STMDB_UPD $sp, 14, $noreg, killed $r4, killed $lr
t2CMPri killed renamable $r2, 34, 14, $noreg, implicit-def $cpsr		t2CMPri killed renamable $r2, 34, 14, $noreg, implicit-def $cpsr
t2Bcc %bb.2, 1, killed $cpsr		t2Bcc %bb.2, 1, killed $cpsr

bb.1:		bb.1:
successors: %bb.3(0x40000000), %bb.4(0x40000000)		successors: %bb.3(0x40000000), %bb.4(0x40000000)
liveins: $r1		liveins: $r1

$r0 = t2MOVi 3, 14, $noreg, $noreg		$r0 = t2MOVi 3, 14, $noreg, $noreg
tBL 14, $noreg, @fn2, csr_aapcs, implicit-def dead $lr, implicit $sp, implicit $r0, implicit $r1, implicit-def $sp, implicit-def dead $r0		tBL 14, $noreg, @fn2, csr_aapcs, implicit-def dead $lr, implicit $sp, implicit $r0, implicit $r1, implicit-def $sp, implicit-def dead $r0
INLINEASM_BR &"", 9, 13, 0, 13, blockaddress(@fn1, %ir-block.l_yes)		t2CMPri $sp, 34, 14, $noreg, implicit-def $cpsr
		t2Bcc %bb.4, 1, $cpsr
		t2Bcc %bb.4, 2, killed $cpsr
t2B %bb.3, 14, $noreg		t2B %bb.3, 14, $noreg

bb.2:		bb.2:
successors: %bb.3(0x40000000), %bb.4(0x40000000)		successors: %bb.3(0x40000000), %bb.4(0x40000000)
liveins: $r1		liveins: $r1

$r0 = t2MOVi 2, 14, $noreg, $noreg		$r0 = t2MOVi 2, 14, $noreg, $noreg
tBL 14, $noreg, @fn2, csr_aapcs, implicit-def dead $lr, implicit $sp, implicit $r0, implicit $r1, implicit-def $sp, implicit-def dead $r0		tBL 14, $noreg, @fn2, csr_aapcs, implicit-def dead $lr, implicit $sp, implicit $r0, implicit $r1, implicit-def $sp, implicit-def dead $r0
INLINEASM_BR &"", 9, 13, 0, 13, blockaddress(@fn1, %ir-block.l_yes)		t2CMPri $sp, 34, 14, $noreg, implicit-def $cpsr
		t2Bcc %bb.4, 1, $cpsr
		t2Bcc %bb.4, 2, killed $cpsr
		nickdesaulniersUnsubmitted Done Reply Inline Actions INLINEASM_BR just disappears from this test? nickdesaulniers: INLINEASM_BR just disappears from this test?
		jyknightAuthorUnsubmitted Done Reply Inline Actions I changed the test to a different mechanism of testing unanalyzable branch sequences, since INLINEASM_BR doesn't trigger the problem anymore. jyknight: I changed the test to a different mechanism of testing unanalyzable branch sequences, since…
t2B %bb.3, 14, $noreg		t2B %bb.3, 14, $noreg

bb.3:		bb.3:
INLINEASM &"", 1		INLINEASM &"", 1
$sp = t2LDMIA_RET $sp, 14, $noreg, def $r4, def $pc		$sp = t2LDMIA_RET $sp, 14, $noreg, def $r4, def $pc

bb.4.l_yes (address-taken):		bb.4.l_yes (address-taken):
$sp = t2LDMIA_RET $sp, 14, $noreg, def $r4, def $pc		$sp = t2LDMIA_RET $sp, 14, $noreg, def $r4, def $pc

...		...

llvm/test/CodeGen/ARM/ifcvt-size.mir

	Show First 20 Lines • Show All 519 Lines • ▼ Show 20 Lines
	# CHECK: tCMPi8			# CHECK: tCMPi8
	# CHECK-NEXT: tLDRi			# CHECK-NEXT: tLDRi
	# CHECK-NEXT: tLDRi			# CHECK-NEXT: tLDRi
	# CHECK-NEXT: tLDRi			# CHECK-NEXT: tLDRi
	# CHECK-NEXT: t2LDRSHi12			# CHECK-NEXT: t2LDRSHi12
	# CHECK-NEXT: INLINEASM_BR			# CHECK-NEXT: INLINEASM_BR

	# DEBUG-LABEL: Ifcvt: function ({{[0-9]+}}) 'fn9'			# DEBUG-LABEL: Ifcvt: function ({{[0-9]+}}) 'fn9'
	# DEBUG: MeetIfcvtSizeLimit(BranchBytes=2, CommonBytes=6, NumPredicatedInstructions=4, ExtraPredicateBytes=2)			# DEBUG: MeetIfcvtSizeLimit(BranchBytes=2, CommonBytes=8, NumPredicatedInstructions=4, ExtraPredicateBytes=2)

	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	successors: %bb.1(0x30000000), %bb.3(0x50000000)			successors: %bb.1(0x30000000), %bb.3(0x50000000)
	liveins: $r0, $r1, $r2			liveins: $r0, $r1, $r2

	tCMPi8 killed renamable $r2, 42, 14, $noreg, implicit-def $cpsr			tCMPi8 renamable $r2, 42, 14, $noreg, implicit-def $cpsr
	t2Bcc %bb.3, 1, killed $cpsr			t2Bcc %bb.3, 1, killed $cpsr

	bb.1.if.then:			bb.1.if.then:
	successors: %bb.5(0x7fffffff)			successors: %bb.5(0x7fffffff)
	liveins: $r0			liveins: $r0, $r2

	renamable $r0 = tLDRi killed renamable $r0, 0, 14, $noreg			renamable $r0 = tLDRi killed renamable $r0, 0, 14, $noreg
	INLINEASM_BR &"b ${0:l}", 1, 13, blockaddress(@fn9, %ir-block.lab1)			INLINEASM_BR &"b ${0:l}", 1, 13, blockaddress(@fn9, %ir-block.lab1)
				tBX_RET 14, $noreg, implicit $r2

	bb.3.if.else:			bb.3.if.else:
	successors: %bb.5(0x7fffffff)			successors: %bb.5(0x7fffffff)
	liveins: $r1			liveins: $r1, $r2

	renamable $r0 = tLDRi killed renamable $r1, 0, 14, $noreg			renamable $r0 = tLDRi killed renamable $r1, 0, 14, $noreg
	renamable $r0 = tLDRi killed renamable $r0, 0, 14, $noreg			renamable $r0 = tLDRi killed renamable $r0, 0, 14, $noreg
	renamable $r0 = t2LDRSHi12 killed renamable $r0, 0, 14, $noreg			renamable $r0 = t2LDRSHi12 killed renamable $r0, 0, 14, $noreg
	INLINEASM_BR &"b ${0:l}", 1, 13, blockaddress(@fn9, %ir-block.lab1)			INLINEASM_BR &"b ${0:l}", 1, 13, blockaddress(@fn9, %ir-block.lab1)
				tBX_RET 14, $noreg, implicit $r2

	bb.5.lab1 (address-taken):			bb.5.lab1 (address-taken):
	liveins: $r0			liveins: $r0

	renamable $r0, dead $cpsr = nsw tADDi8 killed renamable $r0, 5, 14, $noreg			renamable $r0, dead $cpsr = nsw tADDi8 killed renamable $r0, 5, 14, $noreg
	tBX_RET 14, $noreg, implicit $r0			tBX_RET 14, $noreg, implicit $r0
	...			...

llvm/test/CodeGen/X86/callbr-asm-blockplacement.ll

	Show All 39 Lines
	; CHECK-NEXT: callq hoge			; CHECK-NEXT: callq hoge
	; CHECK-NEXT: movq %r12, %rdi			; CHECK-NEXT: movq %r12, %rdi
	; CHECK-NEXT: callq hoge			; CHECK-NEXT: callq hoge
	; CHECK-NEXT: testb %r13b, %r13b			; CHECK-NEXT: testb %r13b, %r13b
	; CHECK-NEXT: jne .LBB0_2			; CHECK-NEXT: jne .LBB0_2
	; CHECK-NEXT: # %bb.3: # %bb15			; CHECK-NEXT: # %bb.3: # %bb15
	; CHECK-NEXT: leaq (%rbp,%rbp,2), %rax			; CHECK-NEXT: leaq (%rbp,%rbp,2), %rax
	; CHECK-NEXT: movq %rbx, global+16(,%rax,8)			; CHECK-NEXT: movq %rbx, global+16(,%rax,8)
	; CHECK-NEXT: movabsq $-2305847407260205056, %rbx # imm = 0xDFFFFC0000000000
	; CHECK-NEXT: #APP			; CHECK-NEXT: #APP
	; CHECK-NEXT: #NO_APP			; CHECK-NEXT: #NO_APP
	; CHECK-NEXT: .LBB0_4: # %bb17			; CHECK-NEXT: movabsq $-2305847407260205056, %rbx # imm = 0xDFFFFC0000000000
				; CHECK-NEXT: # %bb.4: # %bb17
	; CHECK-NEXT: callq widget			; CHECK-NEXT: callq widget
	; CHECK-NEXT: .Ltmp0: # Block address taken			; CHECK-NEXT: .Ltmp0: # Block address taken
	; CHECK-NEXT: .LBB0_5: # %bb18			; CHECK-NEXT: .LBB0_5: # %bb18
	; CHECK-NEXT: movw $0, 14(%rbx)			; CHECK-NEXT: movw $0, 14(%rbx)
	; CHECK-NEXT: addq $8, %rsp			; CHECK-NEXT: addq $8, %rsp
	; CHECK-NEXT: popq %rbx			; CHECK-NEXT: popq %rbx
	; CHECK-NEXT: popq %r12			; CHECK-NEXT: popq %r12
	; CHECK-NEXT: popq %r13			; CHECK-NEXT: popq %r13
	▲ Show 20 Lines • Show All 47 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/callbr-asm-branch-folding.ll

	Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: je .LBB0_6			; CHECK-NEXT: je .LBB0_6
	; CHECK-NEXT: # %bb.5:			; CHECK-NEXT: # %bb.5:
	; CHECK-NEXT: andl $4, %ebx			; CHECK-NEXT: andl $4, %ebx
	; CHECK-NEXT: jmp .LBB0_3			; CHECK-NEXT: jmp .LBB0_3
	; CHECK-NEXT: .LBB0_6: # %if.end12			; CHECK-NEXT: .LBB0_6: # %if.end12
	; CHECK-NEXT: testl %ebp, %ebp			; CHECK-NEXT: testl %ebp, %ebp
	; CHECK-NEXT: je .LBB0_9			; CHECK-NEXT: je .LBB0_9
	; CHECK-NEXT: # %bb.7: # %if.then14			; CHECK-NEXT: # %bb.7: # %if.then14
	; CHECK-NEXT: movl {{[-0-9]+}}(%r{{[sb]}}p), %eax # 4-byte Reload
	; CHECK-NEXT: #APP			; CHECK-NEXT: #APP
	; CHECK-NEXT: #NO_APP			; CHECK-NEXT: #NO_APP
				; CHECK-NEXT: movl {{[-0-9]+}}(%r{{[sb]}}p), %eax # 4-byte Reload
	; CHECK-NEXT: jmp .LBB0_10			; CHECK-NEXT: jmp .LBB0_10
	; CHECK-NEXT: .Ltmp0: # Block address taken			; CHECK-NEXT: .Ltmp0: # Block address taken
	; CHECK-NEXT: .LBB0_8: # %if.then20.critedge			; CHECK-NEXT: # %bb.8: # %if.then20.critedge
	; CHECK-NEXT: movl {{.*}}(%rip), %edi			; CHECK-NEXT: movl {{.*}}(%rip), %edi
	; CHECK-NEXT: movslq %eax, %rcx			; CHECK-NEXT: movslq %eax, %rcx
	; CHECK-NEXT: movl $1, %esi			; CHECK-NEXT: movl $1, %esi
	; CHECK-NEXT: movq %r15, %rdx			; CHECK-NEXT: movq %r15, %rdx
	; CHECK-NEXT: addq $8, %rsp			; CHECK-NEXT: addq $8, %rsp
	; CHECK-NEXT: popq %rbx			; CHECK-NEXT: popq %rbx
	; CHECK-NEXT: popq %r12			; CHECK-NEXT: popq %r12
	; CHECK-NEXT: popq %r13			; CHECK-NEXT: popq %r13
	▲ Show 20 Lines • Show All 81 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/callbr-asm-label-addr.ll

	; RUN: llc < %s -mtriple=x86_64-unknown-linux-gnu \| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-unknown-linux-gnu \| FileCheck %s

	define i32 @test1(i32 %x) {			define i32 @test1(i32 %x) {
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: .quad .Ltmp0			; CHECK: .quad .Ltmp0
	; CHECK-NEXT: .quad .Ltmp1			; CHECK-NEXT: .quad .Ltmp1
	; CHECK-LABEL: .Ltmp1:			; CHECK: .Ltmp1:
	; CHECK-LABEL: .LBB0_1: # %bar			; CHECK-NEXT: # %bb.1: # %bar
	; CHECK-NEXT: callq foo			; CHECK-NEXT: callq foo
	; CHECK-LABEL: .Ltmp0:			; CHECK-NEXT: .Ltmp0:
	; CHECK-NEXT: # %bb.2: # %baz			; CHECK-NEXT: # %bb.2: # %baz
	entry:			entry:
	callbr void asm sideeffect ".quad ${0:l}\0A\09.quad ${1:l}", "i,X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@test1, %baz), i8* blockaddress(@test1, %bar))			callbr void asm sideeffect ".quad ${0:l}\0A\09.quad ${1:l}", "i,X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@test1, %baz), i8* blockaddress(@test1, %bar))
	to label %asm.fallthrough [label %bar]			to label %asm.fallthrough [label %bar]

	asm.fallthrough:			asm.fallthrough:
	br label %bar			br label %bar

	Show All 12 Lines

llvm/test/CodeGen/X86/callbr-asm-outputs-pred-succ.ll

	; Tests that InstrEmitter::EmitMachineNode correctly sets predecessors and			; Tests that InstrEmitter::EmitMachineNode correctly sets predecessors and
	; successors.			; successors.

	; RUN: llc -stop-after=finalize-isel -print-after=finalize-isel -mtriple=i686-- < %s 2>&1 \| FileCheck %s			; RUN: llc -stop-after=finalize-isel -print-after=finalize-isel -mtriple=i686-- < %s 2>&1 \| FileCheck %s

	; The block containting the INLINEASM_BR should have a fallthrough and its			; The block containting the INLINEASM_BR should have a fallthrough and its
	; indirect targets as its successors. The fallthrough is a block we synthesized			; indirect targets as its successors. Fallthrough should have 100% branch weight,
	; in InstrEmitter::EmitMachineNode. Fallthrough should have 100% branch weight,
	; while the indirect targets have 0%.			; while the indirect targets have 0%.
	; CHECK: bb.0 (%ir-block.2):			; CHECK: bb.0 (%ir-block.2):
	; CHECK-NEXT: successors: %bb.4(0x00000000), %bb.6(0x80000000); %bb.4(0.00%), %bb.6(100.00%)			; CHECK-NEXT: successors: %bb.1(0x80000000), %bb.4(0x00000000); %bb.1(100.00%), %bb.4(0.00%)

	; The fallthrough block is predaccessed by the block containing INLINEASM_BR,			; The fallthrough is a block containing a second INLINEASM_BR. Check it has two successors,
	; and succeeded by the INLINEASM_BR's original fallthrough block pre-splitting.			; and the the probability for fallthrough is 100%.
	; CHECK: bb.6 (%ir-block.2):
	; CHECK-NEXT: predecessors: %bb.0
	; CHECK-NEXT: successors: %bb.1(0x80000000); %bb.1(100.00%)

	; Another block containing a second INLINEASM_BR. Check it has two successors,
	; and the the probability for fallthrough is 100%. Predecessor check irrelevant.
	; CHECK: bb.1 (%ir-block.4):			; CHECK: bb.1 (%ir-block.4):
	; CHECK: successors: %bb.2(0x00000000), %bb.7(0x80000000); %bb.2(0.00%), %bb.7(100.00%)			; CHECK-NEXT: predecessors: %bb.0
				; CHECK-NEXT: successors: %bb.3(0x80000000), %bb.2(0x00000000); %bb.3(100.00%), %bb.2(0.00%)
	; Check the synthesized fallthrough block for the second INLINEASM_BR is
	; preceded correctly, and has the original successor pre-splitting.
	; CHECK: bb.7 (%ir-block.4):
	; CHECK-NEXT: predecessors: %bb.1
	; CHECK-NEXT: successors: %bb.3(0x80000000); %bb.3(100.00%)

	; Check the second INLINEASM_BR target block is preceded by the block with the			; Check the second INLINEASM_BR target block is preceded by the block with the
	; second INLINEASM_BR.			; second INLINEASM_BR.
	; CHECK: bb.2 (%ir-block.7, address-taken):			; CHECK: bb.2 (%ir-block.7, address-taken):
	; CHECK-NEXT: predecessors: %bb.1			; CHECK-NEXT: predecessors: %bb.1

	; Check the first INLINEASM_BR target block is predecessed by the block with			; Check the first INLINEASM_BR target block is predecessed by the block with
	; the first INLINEASM_BR.			; the first INLINEASM_BR.
	Show All 37 Lines

llvm/test/CodeGen/X86/callbr-asm-outputs.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -mtriple=i686-- -verify-machineinstrs < %s \| FileCheck %s			; RUN: llc -mtriple=i686-- -verify-machineinstrs < %s \| FileCheck %s

	; A test for asm-goto output			; A test for asm-goto output

	define i32 @test1(i32 %x) {			define i32 @test1(i32 %x) {
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: movl {{[0-9]+}}(%esp), %eax			; CHECK-NEXT: movl {{[0-9]+}}(%esp), %eax
	; CHECK-NEXT: addl $4, %eax			; CHECK-NEXT: addl $4, %eax
	; CHECK-NEXT: #APP			; CHECK-NEXT: #APP
	; CHECK-NEXT: xorl %eax, %eax			; CHECK-NEXT: xorl %eax, %eax
	; CHECK-NEXT: jmp .Ltmp0			; CHECK-NEXT: jmp .Ltmp0
	; CHECK-NEXT: #NO_APP			; CHECK-NEXT: #NO_APP
	; CHECK-NEXT: .LBB0_1: # %normal			; CHECK-NEXT: # %bb.1: # %normal
	; CHECK-NEXT: retl			; CHECK-NEXT: retl
	; CHECK-NEXT: .Ltmp0: # Block address taken			; CHECK-NEXT: .Ltmp0: # Block address taken
	; CHECK-NEXT: .LBB0_2: # %abnormal			; CHECK-NEXT: .LBB0_2: # %abnormal
	; CHECK-NEXT: movl $1, %eax			; CHECK-NEXT: movl $1, %eax
	; CHECK-NEXT: retl			; CHECK-NEXT: retl
	entry:			entry:
	%add = add nsw i32 %x, 4			%add = add nsw i32 %x, 4
	%ret = callbr i32 asm "xorl $1, $0; jmp ${2:l}", "=r,r,X,~{dirflag},~{fpsr},~{flags}"(i32 %add, i8* blockaddress(@test1, %abnormal))			%ret = callbr i32 asm "xorl $1, $0; jmp ${2:l}", "=r,r,X,~{dirflag},~{fpsr},~{flags}"(i32 %add, i8* blockaddress(@test1, %abnormal))
	to label %normal [label %abnormal]			to label %normal [label %abnormal]

	normal:			normal:
	ret i32 %ret			ret i32 %ret

	abnormal:			abnormal:
	ret i32 1			ret i32 1
	}			}

	define i32 @test2(i32 %out1, i32 %out2) {			define i32 @test2(i32 %out1, i32 %out2) {
				nickdesaulniersUnsubmitted Done Reply Inline Actions @void 's earlier comments mentioned modifications to this test case's `movl $-1, %eax`. Are those still needed? nickdesaulniers: @void 's [earlier comments](https://reviews.llvm.org/D79794#2105655) mentioned modifications to…
				voidUnsubmitted Done Reply Inline Actions No, those are taken care of by the scheduling boundary check. void: No, those are taken care of by the scheduling boundary check.
	; CHECK-LABEL: test2:			; CHECK-LABEL: test2:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: pushl %edi			; CHECK-NEXT: pushl %edi
	; CHECK-NEXT: .cfi_def_cfa_offset 8			; CHECK-NEXT: .cfi_def_cfa_offset 8
	; CHECK-NEXT: pushl %esi			; CHECK-NEXT: pushl %esi
	; CHECK-NEXT: .cfi_def_cfa_offset 12			; CHECK-NEXT: .cfi_def_cfa_offset 12
	; CHECK-NEXT: .cfi_offset %esi, -12			; CHECK-NEXT: .cfi_offset %esi, -12
	; CHECK-NEXT: .cfi_offset %edi, -8			; CHECK-NEXT: .cfi_offset %edi, -8
	; CHECK-NEXT: movl {{[0-9]+}}(%esp), %edi			; CHECK-NEXT: movl {{[0-9]+}}(%esp), %edi
	; CHECK-NEXT: movl {{[0-9]+}}(%esp), %esi			; CHECK-NEXT: movl {{[0-9]+}}(%esp), %esi
	; CHECK-NEXT: movl $-1, %eax
	; CHECK-NEXT: cmpl %edi, %esi			; CHECK-NEXT: cmpl %edi, %esi
	; CHECK-NEXT: jge .LBB1_3			; CHECK-NEXT: jge .LBB1_2
	; CHECK-NEXT: # %bb.1: # %if.then			; CHECK-NEXT: # %bb.1: # %if.then
	; CHECK-NEXT: #APP			; CHECK-NEXT: #APP
	; CHECK-NEXT: testl %esi, %esi			; CHECK-NEXT: testl %esi, %esi
	; CHECK-NEXT: testl %edi, %esi			; CHECK-NEXT: testl %edi, %esi
	; CHECK-NEXT: jne .Ltmp1			; CHECK-NEXT: jne .Ltmp1
	; CHECK-NEXT: #NO_APP			; CHECK-NEXT: #NO_APP
	; CHECK-NEXT: .LBB1_2: # %if.then			; CHECK-NEXT: movl $-1, %eax
	; CHECK-NEXT: movl %edi, %eax			; CHECK-NEXT: jmp .LBB1_3
	; CHECK-NEXT: addl %esi, %eax			; CHECK-NEXT: .LBB1_2: # %if.else
				; CHECK-NEXT: #APP
				; CHECK-NEXT: testl %esi, %edi
				; CHECK-NEXT: testl %esi, %edi
				; CHECK-NEXT: jne .Ltmp2
				; CHECK-NEXT: #NO_APP
				; CHECK-NEXT: movl $-1, %eax
				nickdesaulniersUnsubmitted Done Reply Inline Actions does this `-1` get overwritten immediately on the next instruction? nickdesaulniers: does this `-1` get overwritten immediately on the next instruction?
				voidUnsubmitted Done Reply Inline Actions It's the same on line 52. Is it a problem with the PHI elimination pass? void: It's the same on line 52. Is it a problem with the PHI elimination pass?
				; CHECK-NEXT: .LBB1_3:
				; CHECK-NEXT: movl %esi, %eax
				; CHECK-NEXT: addl %edi, %eax
	; CHECK-NEXT: .Ltmp2: # Block address taken			; CHECK-NEXT: .Ltmp2: # Block address taken
	; CHECK-NEXT: .LBB1_6: # %return			; CHECK-NEXT: .LBB1_5: # %return
	; CHECK-NEXT: popl %esi			; CHECK-NEXT: popl %esi
	; CHECK-NEXT: .cfi_def_cfa_offset 8			; CHECK-NEXT: .cfi_def_cfa_offset 8
	; CHECK-NEXT: popl %edi			; CHECK-NEXT: popl %edi
	; CHECK-NEXT: .cfi_def_cfa_offset 4			; CHECK-NEXT: .cfi_def_cfa_offset 4
	; CHECK-NEXT: retl			; CHECK-NEXT: retl
	; CHECK-NEXT: .LBB1_3: # %if.else
	; CHECK-NEXT: .cfi_def_cfa_offset 12
	; CHECK-NEXT: #APP
	; CHECK-NEXT: testl %esi, %edi
	; CHECK-NEXT: testl %esi, %edi
	; CHECK-NEXT: jne .Ltmp2
	; CHECK-NEXT: #NO_APP
	; CHECK-NEXT: .LBB1_4: # %if.else
	; CHECK-NEXT: jmp .LBB1_2
	; CHECK-NEXT: .Ltmp1: # Block address taken			; CHECK-NEXT: .Ltmp1: # Block address taken
	; CHECK-NEXT: .LBB1_5: # %label_true			; CHECK-NEXT: .LBB1_4: # %label_true
				; CHECK-NEXT: .cfi_def_cfa_offset 12
	; CHECK-NEXT: movl $-2, %eax			; CHECK-NEXT: movl $-2, %eax
	; CHECK-NEXT: jmp .LBB1_6			; CHECK-NEXT: jmp .LBB1_5
	entry:			entry:
	%cmp = icmp slt i32 %out1, %out2			%cmp = icmp slt i32 %out1, %out2
	br i1 %cmp, label %if.then, label %if.else			br i1 %cmp, label %if.then, label %if.else

	if.then: ; preds = %entry			if.then: ; preds = %entry
	%0 = callbr { i32, i32 } asm sideeffect "testl $0, $0; testl $1, $2; jne ${3:l}", "={si},={di},r,X,X,0,1,~{dirflag},~{fpsr},~{flags}"(i32 %out1, i8* blockaddress(@test2, %label_true), i8* blockaddress(@test2, %return), i32 %out1, i32 %out2)			%0 = callbr { i32, i32 } asm sideeffect "testl $0, $0; testl $1, $2; jne ${3:l}", "={si},={di},r,X,X,0,1,~{dirflag},~{fpsr},~{flags}"(i32 %out1, i8* blockaddress(@test2, %label_true), i8* blockaddress(@test2, %return), i32 %out1, i32 %out2)
	to label %if.end [label %label_true, label %return]			to label %if.end [label %label_true, label %return]

	Show All 27 Lines
	; CHECK-NEXT: .cfi_offset %edi, -8			; CHECK-NEXT: .cfi_offset %edi, -8
	; CHECK-NEXT: testb $1, {{[0-9]+}}(%esp)			; CHECK-NEXT: testb $1, {{[0-9]+}}(%esp)
	; CHECK-NEXT: je .LBB2_3			; CHECK-NEXT: je .LBB2_3
	; CHECK-NEXT: # %bb.1: # %true			; CHECK-NEXT: # %bb.1: # %true
	; CHECK-NEXT: #APP			; CHECK-NEXT: #APP
	; CHECK-NEXT: .short %esi			; CHECK-NEXT: .short %esi
	; CHECK-NEXT: .short %edi			; CHECK-NEXT: .short %edi
	; CHECK-NEXT: #NO_APP			; CHECK-NEXT: #NO_APP
	; CHECK-NEXT: .LBB2_2: # %true			; CHECK-NEXT: # %bb.2:
	; CHECK-NEXT: movl %edi, %eax			; CHECK-NEXT: movl %edi, %eax
	; CHECK-NEXT: jmp .LBB2_5			; CHECK-NEXT: jmp .LBB2_5
	; CHECK-NEXT: .LBB2_3: # %false			; CHECK-NEXT: .LBB2_3: # %false
	; CHECK-NEXT: #APP			; CHECK-NEXT: #APP
	; CHECK-NEXT: .short %eax			; CHECK-NEXT: .short %eax
	; CHECK-NEXT: .short %edx			; CHECK-NEXT: .short %edx
	; CHECK-NEXT: #NO_APP			; CHECK-NEXT: #NO_APP
	; CHECK-NEXT: .LBB2_4: # %false			; CHECK-NEXT: # %bb.4:
	; CHECK-NEXT: movl %edx, %eax			; CHECK-NEXT: movl %edx, %eax
	; CHECK-NEXT: .LBB2_5: # %asm.fallthrough			; CHECK-NEXT: .LBB2_5: # %asm.fallthrough
	; CHECK-NEXT: popl %esi			; CHECK-NEXT: popl %esi
	; CHECK-NEXT: .cfi_def_cfa_offset 8			; CHECK-NEXT: .cfi_def_cfa_offset 8
	; CHECK-NEXT: popl %edi			; CHECK-NEXT: popl %edi
	; CHECK-NEXT: .cfi_def_cfa_offset 4			; CHECK-NEXT: .cfi_def_cfa_offset 4
	; CHECK-NEXT: retl			; CHECK-NEXT: retl
	; CHECK-NEXT: .Ltmp3: # Block address taken			; CHECK-NEXT: .Ltmp3: # Block address taken
	Show All 25 Lines
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: movl $-1, %eax			; CHECK-NEXT: movl $-1, %eax
	; CHECK-NEXT: movl {{[0-9]+}}(%esp), %ecx			; CHECK-NEXT: movl {{[0-9]+}}(%esp), %ecx
	; CHECK-NEXT: #APP			; CHECK-NEXT: #APP
	; CHECK-NEXT: testl %ecx, %ecx			; CHECK-NEXT: testl %ecx, %ecx
	; CHECK-NEXT: testl %edx, %ecx			; CHECK-NEXT: testl %edx, %ecx
	; CHECK-NEXT: jne .Ltmp4			; CHECK-NEXT: jne .Ltmp4
	; CHECK-NEXT: #NO_APP			; CHECK-NEXT: #NO_APP
	; CHECK-NEXT: .LBB3_1: # %asm.fallthrough			; CHECK-NEXT: # %bb.1: # %asm.fallthrough
	; CHECK-NEXT: #APP			; CHECK-NEXT: #APP
	; CHECK-NEXT: testl %ecx, %edx			; CHECK-NEXT: testl %ecx, %edx
	; CHECK-NEXT: testl %ecx, %edx			; CHECK-NEXT: testl %ecx, %edx
	; CHECK-NEXT: jne .Ltmp5			; CHECK-NEXT: jne .Ltmp5
	; CHECK-NEXT: #NO_APP			; CHECK-NEXT: #NO_APP
	; CHECK-NEXT: .LBB3_2: # %asm.fallthrough			; CHECK-NEXT: # %bb.2: # %asm.fallthrough2
	; CHECK-NEXT: addl %edx, %ecx			; CHECK-NEXT: addl %edx, %ecx
	; CHECK-NEXT: movl %ecx, %eax			; CHECK-NEXT: movl %ecx, %eax
	; CHECK-NEXT: retl			; CHECK-NEXT: retl
	; CHECK-NEXT: .Ltmp4: # Block address taken			; CHECK-NEXT: .Ltmp4: # Block address taken
	; CHECK-NEXT: .LBB3_3: # %label_true			; CHECK-NEXT: .LBB3_3: # %label_true
	; CHECK-NEXT: movl $-2, %eax			; CHECK-NEXT: movl $-2, %eax
	; CHECK-NEXT: .Ltmp5: # Block address taken			; CHECK-NEXT: .Ltmp5: # Block address taken
	; CHECK-NEXT: .LBB3_4: # %return			; CHECK-NEXT: .LBB3_4: # %return
	Show All 24 Lines

llvm/test/CodeGen/X86/callbr-asm.ll

; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc < %s -mtriple=i686-- -O3 -verify-machineinstrs \| FileCheck %s		; RUN: llc < %s -mtriple=i686-- -O3 -verify-machineinstrs \| FileCheck %s

; Tests for using callbr as an asm-goto wrapper		; Tests for using callbr as an asm-goto wrapper

; Test 1 - fallthrough label gets removed, but the fallthrough code that is		; Test 1 - fallthrough label gets removed, but the fallthrough code that is
; unreachable due to asm ending on a jmp is still left in.		; unreachable due to asm ending on a jmp is still left in.
define i32 @test1(i32 %a) {		define i32 @test1(i32 %a) {
; CHECK-LABEL: test1:		; CHECK-LABEL: test1:
; CHECK: # %bb.0: # %entry		; CHECK: # %bb.0: # %entry
; CHECK-NEXT: movl {{[0-9]+}}(%esp), %eax		; CHECK-NEXT: movl {{[0-9]+}}(%esp), %eax
; CHECK-NEXT: addl $4, %eax		; CHECK-NEXT: addl $4, %eax
; CHECK-NEXT: #APP		; CHECK-NEXT: #APP
; CHECK-NEXT: xorl %eax, %eax		; CHECK-NEXT: xorl %eax, %eax
; CHECK-NEXT: jmp .Ltmp0		; CHECK-NEXT: jmp .Ltmp0
; CHECK-NEXT: #NO_APP		; CHECK-NEXT: #NO_APP
; CHECK-NEXT: .LBB0_1: # %normal		; CHECK-NEXT: # %bb.1: # %normal
; CHECK-NEXT: xorl %eax, %eax		; CHECK-NEXT: xorl %eax, %eax
; CHECK-NEXT: retl		; CHECK-NEXT: retl
; CHECK-NEXT: .Ltmp0: # Block address taken		; CHECK-NEXT: .Ltmp0: # Block address taken
; CHECK-NEXT: .LBB0_2: # %fail		; CHECK-NEXT: .LBB0_2: # %fail
; CHECK-NEXT: movl $1, %eax		; CHECK-NEXT: movl $1, %eax
; CHECK-NEXT: retl		; CHECK-NEXT: retl
entry:		entry:
%0 = add i32 %a, 4		%0 = add i32 %a, 4
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines
; CHECK-NEXT: # Parent Loop BB2_2 Depth=2		; CHECK-NEXT: # Parent Loop BB2_2 Depth=2
; CHECK-NEXT: # Parent Loop BB2_3 Depth=3		; CHECK-NEXT: # Parent Loop BB2_3 Depth=3
; CHECK-NEXT: # => This Inner Loop Header: Depth=4		; CHECK-NEXT: # => This Inner Loop Header: Depth=4
; CHECK-NEXT: #APP		; CHECK-NEXT: #APP
; CHECK-NEXT: jmp .Ltmp1		; CHECK-NEXT: jmp .Ltmp1
; CHECK-NEXT: jmp .Ltmp2		; CHECK-NEXT: jmp .Ltmp2
; CHECK-NEXT: jmp .Ltmp3		; CHECK-NEXT: jmp .Ltmp3
; CHECK-NEXT: #NO_APP		; CHECK-NEXT: #NO_APP
; CHECK-NEXT: .LBB2_5: # %normal0		; CHECK-NEXT: # %bb.5: # %normal0
; CHECK-NEXT: # in Loop: Header=BB2_4 Depth=4		; CHECK-NEXT: # in Loop: Header=BB2_4 Depth=4
; CHECK-NEXT: #APP		; CHECK-NEXT: #APP
; CHECK-NEXT: jmp .Ltmp1		; CHECK-NEXT: jmp .Ltmp1
; CHECK-NEXT: jmp .Ltmp2		; CHECK-NEXT: jmp .Ltmp2
; CHECK-NEXT: jmp .Ltmp3		; CHECK-NEXT: jmp .Ltmp3
; CHECK-NEXT: jmp .Ltmp4		; CHECK-NEXT: jmp .Ltmp4
; CHECK-NEXT: #NO_APP		; CHECK-NEXT: #NO_APP
; CHECK-NEXT: .LBB2_6: # %normal1		; CHECK-NEXT: # %bb.6: # %normal1
; CHECK-NEXT: movl {{[0-9]+}}(%esp), %eax		; CHECK-NEXT: movl {{[0-9]+}}(%esp), %eax
; CHECK-NEXT: retl		; CHECK-NEXT: retl
entry:		entry:
%a.addr = alloca i32, align 4		%a.addr = alloca i32, align 4
store i32 %a, i32* %a.addr, align 4		store i32 %a, i32* %a.addr, align 4
br label %label01		br label %label01

label01: ; preds = %normal0, %label04, %entry		label01: ; preds = %normal0, %label04, %entry
Show All 21 Lines	normal1: ; preds = %normal0
ret i32 %1		ret i32 %1
}		}

; Test 4 - asm-goto referenced with the 'l' (ell) modifier and not.		; Test 4 - asm-goto referenced with the 'l' (ell) modifier and not.
define void @test4() {		define void @test4() {
; CHECK-LABEL: test4:		; CHECK-LABEL: test4:
; CHECK: # %bb.0: # %entry		; CHECK: # %bb.0: # %entry
; CHECK-NEXT: #APP		; CHECK-NEXT: #APP
; CHECK-NEXT: ja .Ltmp5{{$}}		; CHECK-NEXT: ja .Ltmp5
; CHECK-NEXT: #NO_APP		; CHECK-NEXT: #NO_APP
; CHECK-NEXT: .LBB3_1: # %asm.fallthrough		; CHECK-NEXT: # %bb.1: # %asm.fallthrough
; CHECK-NEXT: #APP		; CHECK-NEXT: #APP
; CHECK-NEXT: ja .Ltmp5{{$}}		; CHECK-NEXT: ja .Ltmp5
; CHECK-NEXT: #NO_APP		; CHECK-NEXT: #NO_APP
; CHECK-NEXT: .Ltmp5: # Block address taken		; CHECK-NEXT: .Ltmp5: # Block address taken
; CHECK-NEXT: .LBB3_3: # %quux		; CHECK-NEXT: .LBB3_3: # %quux
; CHECK-NEXT: retl		; CHECK-NEXT: retl
entry:		entry:
callbr void asm sideeffect "ja $0", "X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@test4, %quux))		callbr void asm sideeffect "ja $0", "X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@test4, %quux))
to label %asm.fallthrough [label %quux]		to label %asm.fallthrough [label %quux]

Show All 10 Lines

llvm/test/CodeGen/X86/shrinkwrap-callbr.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -enable-shrink-wrap=true \| FileCheck %s

				;; Ensure that shrink-wrapping understands that INLINEASM_BR may exit
				;; the block before the end, and you cannot simply place stack
				;; adjustment at the end of that block.
				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				declare i32 @fn()

				; Function Attrs: uwtable
				define i32 @test1(i32 %v) {
				; CHECK-LABEL: test1:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: pushq %rax
				; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: testl %edi, %edi
				; CHECK-NEXT: je .LBB0_3
				; CHECK-NEXT: # %bb.1: # %if.end
				; CHECK-NEXT: callq fn
				; CHECK-NEXT: #APP
				; CHECK-NEXT: # jump to .Ltmp0
				; CHECK-NEXT: #NO_APP
				; CHECK-NEXT: # %bb.2: # %return
				; CHECK-NEXT: movl $4, %eax
				; CHECK-NEXT: popq %rcx
				; CHECK-NEXT: .cfi_def_cfa_offset 8
				; CHECK-NEXT: retq
				; CHECK-NEXT: .LBB0_3: # %ret0
				; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: xorl %eax, %eax
				; CHECK-NEXT: popq %rcx
				; CHECK-NEXT: .cfi_def_cfa_offset 8
				; CHECK-NEXT: retq
				; CHECK-NEXT: .Ltmp0: # Block address taken
				; CHECK-NEXT: .LBB0_4: # %two
				; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: popq %rax
				; CHECK-NEXT: .cfi_def_cfa_offset 8
				; CHECK-NEXT: jmp fn # TAILCALL
				entry:
				%tobool = icmp eq i32 %v, 0
				br i1 %tobool, label %ret0, label %if.end

				ret0:
				ret i32 0

				if.end:
				%call = tail call i32 @fn()
				callbr void asm sideeffect "# jump to $0", "X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@test1, %two))
				to label %return [label %two]

				two:
				%call1 = tail call i32 @fn()
				br label %return

				return:
				%retval.1 = phi i32 [ %call1, %two ], [ 4, %if.end ]
				ret i32 %retval.1
				}

llvm/test/Verifier/callbr.ll

	; RUN: not opt -S %s -verify 2>&1 \| FileCheck %s			; RUN: not opt -S %s -verify 2>&1 \| FileCheck %s

	; CHECK: Indirect label missing from arglist.			; CHECK: Indirect label missing from arglist.
	define void @foo() {			; CHECK-NEXT: #test1
				define void @test1() {
	; The %4 in the indirect label list is not found in the blockaddresses in the			; The %4 in the indirect label list is not found in the blockaddresses in the
	; arg list (bad).			; arg list (bad).
	callbr void asm sideeffect "${0:l} {1:l}", "X,X"(i8* blockaddress(@foo, %3), i8* blockaddress(@foo, %2))			callbr void asm sideeffect "#test1", "X,X"(i8* blockaddress(@test1, %3), i8* blockaddress(@test1, %2))
	to label %1 [label %4, label %2]			to label %1 [label %4, label %2]
	1:			1:
	ret void			ret void
	2:			2:
	ret void			ret void
	3:			3:
	ret void			ret void
	4:			4:
	ret void			ret void
	}			}

	; CHECK-NOT: Indirect label missing from arglist.			; CHECK-NOT: Indirect label missing from arglist.
	define void @bar() {			define void @test2() {
	; %4 and %2 are both in the indirect label list and the arg list (good).			; %4 and %2 are both in the indirect label list and the arg list (good).
	callbr void asm sideeffect "${0:l} ${1:l}", "X,X"(i8* blockaddress(@bar, %4), i8* blockaddress(@bar, %2))			callbr void asm sideeffect "${0:l} ${1:l}", "X,X"(i8* blockaddress(@test2, %4), i8* blockaddress(@test2, %2))
	to label %1 [label %4, label %2]			to label %1 [label %4, label %2]
	1:			1:
	ret void			ret void
	2:			2:
	ret void			ret void
	3:			3:
	ret void			ret void
	4:			4:
	ret void			ret void
	}			}

	; CHECK-NOT: Indirect label missing from arglist.			; CHECK-NOT: Indirect label missing from arglist.
	define void @baz() {			define void @test3() {
	; note %2 blockaddress. Such a case is possible when passing the address of			; note %2 blockaddress. Such a case is possible when passing the address of
	; a label as an input to the inline asm (both address of label and asm goto			; a label as an input to the inline asm (both address of label and asm goto
	; use blockaddress constants; we're testing that the indirect label list from			; use blockaddress constants; we're testing that the indirect label list from
	; the asm goto is in the arg list to the asm).			; the asm goto is in the arg list to the asm).
	callbr void asm sideeffect "${0:l} ${1:l} ${2:l}", "X,X,X"(i8* blockaddress(@baz, %4), i8* blockaddress(@baz, %2), i8* blockaddress(@baz, %3))			callbr void asm sideeffect "${0:l} ${1:l} ${2:l}", "X,X,X"(i8* blockaddress(@test3, %4), i8* blockaddress(@test3, %2), i8* blockaddress(@test3, %3))
	to label %1 [label %3, label %4]			to label %1 [label %3, label %4]
	1:			1:
	ret void			ret void
	2:			2:
	ret void			ret void
	3:			3:
	ret void			ret void
	4:			4:
	ret void			ret void
	}			}

				;; Ensure you cannot use the return value of a callbr in indirect targets.
				; CHECK: Instruction does not dominate all uses!
				; CHECK-NEXT: #test4
				define i32 @test4(i1 %var) {
				entry:
				%ret = callbr i32 asm sideeffect "#test4", "=r,X"(i8* blockaddress(@test4, %abnormal)) to label %normal [label %abnormal]

				normal:
				ret i32 0

				abnormal:
				ret i32 %ret
				}

				;; Ensure you cannot specify the same label as both normal and indirect targets.
				; CHECK: Duplicate callbr destination!
				; CHECK-NEXT: #test5
				define i32 @test5() {
				entry:
				%ret = callbr i32 asm sideeffect "#test5", "=r,X"(i8* blockaddress(@test5, %both)) to label %both [label %both]

				both:
				ret i32 0
				}

This is an archive of the discontinued LLVM Phabricator instance.

Change the INLINEASM_BR MachineInstr to be a non-terminating instruction.ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 263461

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/CodeGen/MachineBasicBlock.h

llvm/include/llvm/Target/Target.td

llvm/lib/CodeGen/BranchFolding.cpp

llvm/lib/CodeGen/MachineBasicBlock.cpp

llvm/lib/CodeGen/MachineSink.cpp

llvm/lib/CodeGen/MachineVerifier.cpp

llvm/lib/CodeGen/PHIEliminationUtils.cpp

llvm/lib/CodeGen/RegisterCoalescer.cpp

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.h

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/ShrinkWrap.cpp

llvm/lib/CodeGen/SplitKit.h

llvm/lib/CodeGen/SplitKit.cpp

llvm/lib/CodeGen/TailDuplicator.cpp

llvm/lib/Target/Hexagon/BitTracker.cpp

llvm/lib/Target/Hexagon/HexagonConstPropagation.cpp

llvm/lib/Target/PowerPC/PPCBranchCoalescing.cpp

llvm/lib/Target/X86/X86InstrInfo.cpp

llvm/test/CodeGen/AArch64/callbr-asm-label.ll

llvm/test/CodeGen/AArch64/callbr-asm-obj-file.ll

llvm/test/CodeGen/ARM/ifcvt-diamond-unanalyzable-common.mir

llvm/test/CodeGen/ARM/ifcvt-size.mir

llvm/test/CodeGen/X86/callbr-asm-blockplacement.ll

llvm/test/CodeGen/X86/callbr-asm-branch-folding.ll

llvm/test/CodeGen/X86/callbr-asm-label-addr.ll

llvm/test/CodeGen/X86/callbr-asm-outputs-pred-succ.ll

llvm/test/CodeGen/X86/callbr-asm-outputs.ll

llvm/test/CodeGen/X86/callbr-asm.ll

llvm/test/CodeGen/X86/shrinkwrap-callbr.ll

llvm/test/Verifier/callbr.ll

Change the INLINEASM_BR MachineInstr to be a non-terminating instruction.
ClosedPublic