This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Always split blocks for si_end_cf
AcceptedPublic

Authored by arsenm on Mar 5 2023, 12:47 PM.

Download Raw Diff

Details

Reviewers

rampitec
critson
foad
nhaehnle
cdevadas

Group Reviewers

Restricted Project

Summary

This is to fix incorrect VGPR spill placement with fastregalloc.
If SGPR spills were inserted in the first regalloc run used for the
exec mask, these instructions would break the block prolog. The
second regalloc run would then incorrectly insert VGPR spills before
the point where exec was setup.

Fixes #61083

Diff Detail

Event Timeline

arsenm created this revision.Mar 5 2023, 12:47 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 5 2023, 12:47 PM

Herald added subscribers: kosarev, StephenFan, kerbowa and 8 others. · View Herald Transcript

arsenm requested review of this revision.Mar 5 2023, 12:47 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 5 2023, 12:47 PM

Herald added a subscriber: wdng. · View Herald Transcript

arsenm added a parent revision: D145323: AMDGPU: Fix LiveVariables verifier error for values defined before SI_END_CF.Mar 5 2023, 12:47 PM

Harbormaster completed remote builds in B217447: Diff 502465.Mar 5 2023, 12:48 PM

Unfortunately this interferes with WQM mode change insertion.
You can see this in the reordered s_or + s_and instruction pairs.
I guess this was always a risk with block splitting.

Seems like we need to modify the WQM pass to handle terminators that modify exec.

llvm/test/CodeGen/AMDGPU/collapse-endcf.ll
266	Is this reordering fixing the bug mentioned in the description? (Exec mask is restored before buffer_load, rather than after.)

Mztea928 added a child revision: D145351: [cmake] Export component info needed to determine which libraries are in libLLVM.so..Mar 5 2023, 10:16 PM

I was assuming WQM needs to split blocks more aggressively itself to avoid the same problems

llvm/test/CodeGen/AMDGPU/collapse-endcf.ll
266	Yes, previously we would only correctly handle spills used for the exec source value, not other spills

arsenm mentioned this in D87543: AMDGPU: Always split si_end_cf blocks.Mar 30 2023, 7:13 PM

I know we've talked about this in the past but I'm not a fan because it's sort of difficult to justify with a "clean" foundational semantics story.

I think a clean story would be:

Every basic block has a canonical exec mask
Basic blocks can have prologs and epilogs in which the canonical exec mask doesn't apply. (SI_END_CF is the prolog case.)

If SI_END_CF is split off, the new basic block containing the SI_END_CF still doesn't have a canonical exec mask that makes sense.

Yes, what I'm thinking of as a clean story would require changing the register allocator a little.

In D145329#4236105, @nhaehnle wrote:

Yes, what I'm thinking of as a clean story would require changing the register allocator a little.

OK, so we'll leave things broken for at least 5 more years. Terminators and phis are the only insertion points we can reasonably expect to work today so I think we just need to do this. We could explore better options whenever we get to trying to explicit track both CFGs in the IR at the same time. Right now I think it's a lower barrier of entry to teaching more passes to understand consecutive fall through blocks

My intention is to sit down and try make WQM work with this.
It is a non-trivial change, so no promises, but I will try to look at it in the next few days.

In D145329#4245383, @critson wrote:

My intention is to sit down and try make WQM work with this.
It is a non-trivial change, so no promises, but I will try to look at it in the next few days.

ping on this

critson mentioned this in D151797: [AMDGPU] WQM: Allow insertion of exact mode transition as terminator.May 31 2023, 5:06 AM

Sorry for the delay. I believe D151797 should allow this to proceed.

I will also look at adding block splitting to WQM, but I need to take care with this as it can have some unintended effects on reg alloc.
In general we have not had to worry about spilling correctness in graphics for pixel shaders (where WQM occurs) as spilling is generally a no-go for having good performance in graphics.

critson mentioned this in rG2e87ed80b23a: [AMDGPU] WQM: Allow insertion of exact mode transition as terminator.Jun 1 2023, 10:01 PM

Rebase on top of wqm patches

Harbormaster completed remote builds in B236907: Diff 528807.Jun 6 2023, 6:44 AM

LGTM w.r.t. WQM test changes.

I will run some local testing for verification, but feel free to proceed with this.

This revision is now accepted and ready to land.Jun 8 2023, 10:04 PM

ping @arsenm could you finalize this patch and close https://github.com/llvm/llvm-project/issues/61083?

In D145329#4410519, @ye-luo wrote:

ping @arsenm could you finalize this patch and close https://github.com/llvm/llvm-project/issues/61083?

Testing found a new crash in blender so I need to look at this more

In D145329#4411300, @arsenm wrote:

In D145329#4410519, @ye-luo wrote:

ping @arsenm could you finalize this patch and close https://github.com/llvm/llvm-project/issues/61083?

Testing found a new crash in blender so I need to look at this more

Any progress?

In D145329#4442143, @ye-luo wrote:

In D145329#4411300, @arsenm wrote:

In D145329#4410519, @ye-luo wrote:

ping @arsenm could you finalize this patch and close https://github.com/llvm/llvm-project/issues/61083?

Testing found a new crash in blender so I need to look at this more

Any progress?

I'm having a really hard time getting the last version of blender that supports opencl to build. However, I have found a few machine verifier errors from its kernels which will hopefully happen to fix the precheck crash

The good news: the blender failure has seemingly disappeared (I'd assume it's related to the spill patches but I'm not going to bother tracking down why)
The bad news: now a few rocBLAS tests fail

Revision Contents

Path

Size

llvm/

lib/

Target/

AMDGPU/

SILowerControlFlow.cpp

40 lines

test/

CodeGen/

AMDGPU/

GlobalISel/

llvm.amdgcn.wqm.demote.ll

36 lines

block-should-not-be-in-alive-blocks.mir

12 lines

branch-folding-implicit-def-subreg.ll

151 lines

collapse-endcf.ll

102 lines

collapse-endcf.mir

165 lines

control-flow-fastregalloc.ll

10 lines

global-atomics-fp.ll

248 lines

mubuf-legalize-operands-non-ptr-intrinsics.ll

8 lines

mubuf-legalize-operands.ll

8 lines

tuple-allocation-failure.ll

214 lines

uniform-phi-with-undef.ll

1 line

vgpr-spill-placement-issue61083.ll

9 lines

wave32.ll

56 lines

wwm-reserved-spill.ll

8 lines

Diff 528807

llvm/lib/Target/AMDGPU/SILowerControlFlow.cpp

Show First 20 Lines • Show All 471 Lines • ▼ Show 20 Lines	do {
B = Succ;		B = Succ;
} while (true);		} while (true);
}		}

MachineBasicBlock *SILowerControlFlow::emitEndCf(MachineInstr &MI) {		MachineBasicBlock *SILowerControlFlow::emitEndCf(MachineInstr &MI) {
MachineBasicBlock &MBB = *MI.getParent();		MachineBasicBlock &MBB = *MI.getParent();
const DebugLoc &DL = MI.getDebugLoc();		const DebugLoc &DL = MI.getDebugLoc();

MachineBasicBlock::iterator InsPt = MBB.begin();		MachineBasicBlock::iterator InsPt = MI;

// If we have instructions that aren't prolog instructions, split the block
// and emit a terminator instruction. This ensures correct spill placement.
// FIXME: We should unconditionally split the block here.
bool NeedBlockSplit = false;
Register DataReg = MI.getOperand(0).getReg();		Register DataReg = MI.getOperand(0).getReg();
for (MachineBasicBlock::iterator I = InsPt, E = MI.getIterator();
I != E; ++I) {
if (I->modifiesRegister(DataReg, TRI)) {
NeedBlockSplit = true;
break;
}
}

unsigned Opcode = OrOpc;		// If we have instructions that aren't prolog instructions, split the block
MachineBasicBlock *SplitBB = &MBB;		// and emit a terminator instruction. This ensures correct spill placement
if (NeedBlockSplit) {		// relative to exec writes.
SplitBB = MBB.splitAt(MI, /UpdateLiveIns/true, LIS);		MachineBasicBlock SplitBB = MBB.splitAt(MI, /UpdateLiveIns=*/true, LIS);
if (MDT && SplitBB != &MBB) {		if (MDT && SplitBB != &MBB) {
MachineDomTreeNode MBBNode = (MDT)[&MBB];		MachineDomTreeNode MBBNode = (MDT)[&MBB];
SmallVector<MachineDomTreeNode *> Children(MBBNode->begin(),		SmallVector<MachineDomTreeNode *> Children(MBBNode->begin(),
MBBNode->end());		MBBNode->end());
MachineDomTreeNode *SplitBBNode = MDT->addNewBlock(SplitBB, &MBB);		MachineDomTreeNode *SplitBBNode = MDT->addNewBlock(SplitBB, &MBB);
for (MachineDomTreeNode *Child : Children)		for (MachineDomTreeNode *Child : Children)
MDT->changeImmediateDominator(Child, SplitBBNode);		MDT->changeImmediateDominator(Child, SplitBBNode);
}		}
Opcode = OrTermrOpc;
InsPt = MI;
}

MachineInstr *NewMI =		MachineInstr *NewMI =
BuildMI(MBB, InsPt, DL, TII->get(Opcode), Exec)		BuildMI(MBB, InsPt, DL, TII->get(OrTermrOpc), Exec)
.addReg(Exec)		.addReg(Exec)
.add(MI.getOperand(0));		.add(MI.getOperand(0));
if (LV) {		if (LV) {
LV->replaceKillInstruction(DataReg, MI, *NewMI);		LV->replaceKillInstruction(DataReg, MI, *NewMI);

if (SplitBB != &MBB) {		if (SplitBB != &MBB) {
// Track the set of registers defined in the original block so we don't		// Track the set of registers defined in the original block so we don't
// accidentally add the original block to AliveBlocks. AliveBlocks only		// accidentally add the original block to AliveBlocks. AliveBlocks only
▲ Show 20 Lines • Show All 435 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.wqm.demote.ll

	Show First 20 Lines • Show All 879 Lines • ▼ Show 20 Lines
	}			}

	define amdgpu_ps void @wqm_deriv_loop(<2 x float> %input, float %arg, i32 %index, i32 %limit) {			define amdgpu_ps void @wqm_deriv_loop(<2 x float> %input, float %arg, i32 %index, i32 %limit) {
	; SI-LABEL: wqm_deriv_loop:			; SI-LABEL: wqm_deriv_loop:
	; SI: ; %bb.0: ; %.entry			; SI: ; %bb.0: ; %.entry
	; SI-NEXT: s_mov_b64 s[0:1], exec			; SI-NEXT: s_mov_b64 s[0:1], exec
	; SI-NEXT: s_wqm_b64 exec, exec			; SI-NEXT: s_wqm_b64 exec, exec
	; SI-NEXT: v_cvt_i32_f32_e32 v0, v0			; SI-NEXT: v_cvt_i32_f32_e32 v0, v0
	; SI-NEXT: s_mov_b32 s4, 0			; SI-NEXT: s_mov_b32 s6, 0
	; SI-NEXT: v_cmp_ne_u32_e32 vcc, 0, v0			; SI-NEXT: v_cmp_ne_u32_e32 vcc, 0, v0
	; SI-NEXT: s_and_saveexec_b64 s[2:3], vcc			; SI-NEXT: s_and_saveexec_b64 s[2:3], vcc
	; SI-NEXT: s_xor_b64 s[2:3], exec, s[2:3]			; SI-NEXT: s_xor_b64 s[4:5], exec, s[2:3]
	; SI-NEXT: s_cbranch_execz .LBB7_3			; SI-NEXT: s_cbranch_execz .LBB7_3
	; SI-NEXT: ; %bb.1: ; %.demote0			; SI-NEXT: ; %bb.1: ; %.demote0
	; SI-NEXT: s_andn2_b64 s[0:1], s[0:1], exec			; SI-NEXT: s_andn2_b64 s[0:1], s[0:1], exec
	; SI-NEXT: s_cbranch_scc0 .LBB7_9			; SI-NEXT: s_cbranch_scc0 .LBB7_9
	; SI-NEXT: ; %bb.2: ; %.demote0			; SI-NEXT: ; %bb.2: ; %.demote0
	; SI-NEXT: s_wqm_b64 s[6:7], s[0:1]			; SI-NEXT: s_wqm_b64 s[2:3], s[0:1]
	; SI-NEXT: s_and_b64 exec, exec, s[6:7]			; SI-NEXT: s_and_b64 exec, exec, s[2:3]
	; SI-NEXT: .LBB7_3: ; %.continue0.preheader			; SI-NEXT: .LBB7_3: ; %.continue0.preheader
	; SI-NEXT: s_or_b64 exec, exec, s[2:3]
	; SI-NEXT: s_mov_b64 s[2:3], 0			; SI-NEXT: s_mov_b64 s[2:3], 0
	; SI-NEXT: v_mov_b32_e32 v0, s4			; SI-NEXT: s_or_b64 exec, exec, s[4:5]
				; SI-NEXT: v_mov_b32_e32 v0, s6
	; SI-NEXT: s_branch .LBB7_5			; SI-NEXT: s_branch .LBB7_5
	; SI-NEXT: .LBB7_4: ; %.continue1			; SI-NEXT: .LBB7_4: ; %.continue1
	; SI-NEXT: ; in Loop: Header=BB7_5 Depth=1			; SI-NEXT: ; in Loop: Header=BB7_5 Depth=1
	; SI-NEXT: s_or_b64 exec, exec, s[4:5]			; SI-NEXT: s_or_b64 exec, exec, s[4:5]
	; SI-NEXT: v_add_u32_e32 v0, vcc, 1, v0			; SI-NEXT: v_add_u32_e32 v0, vcc, 1, v0
	; SI-NEXT: v_cmp_ge_i32_e32 vcc, v0, v1			; SI-NEXT: v_cmp_ge_i32_e32 vcc, v0, v1
	; SI-NEXT: s_or_b64 s[2:3], vcc, s[2:3]			; SI-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
	; SI-NEXT: s_andn2_b64 exec, exec, s[2:3]			; SI-NEXT: s_andn2_b64 exec, exec, s[2:3]
	Show All 35 Lines
	; SI-NEXT: exp null off, off, off, off done vm			; SI-NEXT: exp null off, off, off, off done vm
	; SI-NEXT: s_endpgm			; SI-NEXT: s_endpgm
	;			;
	; GFX9-LABEL: wqm_deriv_loop:			; GFX9-LABEL: wqm_deriv_loop:
	; GFX9: ; %bb.0: ; %.entry			; GFX9: ; %bb.0: ; %.entry
	; GFX9-NEXT: s_mov_b64 s[0:1], exec			; GFX9-NEXT: s_mov_b64 s[0:1], exec
	; GFX9-NEXT: s_wqm_b64 exec, exec			; GFX9-NEXT: s_wqm_b64 exec, exec
	; GFX9-NEXT: v_cvt_i32_f32_e32 v0, v0			; GFX9-NEXT: v_cvt_i32_f32_e32 v0, v0
	; GFX9-NEXT: s_mov_b32 s4, 0			; GFX9-NEXT: s_mov_b32 s6, 0
	; GFX9-NEXT: v_cmp_ne_u32_e32 vcc, 0, v0			; GFX9-NEXT: v_cmp_ne_u32_e32 vcc, 0, v0
	; GFX9-NEXT: s_and_saveexec_b64 s[2:3], vcc			; GFX9-NEXT: s_and_saveexec_b64 s[2:3], vcc
	; GFX9-NEXT: s_xor_b64 s[2:3], exec, s[2:3]			; GFX9-NEXT: s_xor_b64 s[4:5], exec, s[2:3]
	; GFX9-NEXT: s_cbranch_execz .LBB7_3			; GFX9-NEXT: s_cbranch_execz .LBB7_3
	; GFX9-NEXT: ; %bb.1: ; %.demote0			; GFX9-NEXT: ; %bb.1: ; %.demote0
	; GFX9-NEXT: s_andn2_b64 s[0:1], s[0:1], exec			; GFX9-NEXT: s_andn2_b64 s[0:1], s[0:1], exec
	; GFX9-NEXT: s_cbranch_scc0 .LBB7_9			; GFX9-NEXT: s_cbranch_scc0 .LBB7_9
	; GFX9-NEXT: ; %bb.2: ; %.demote0			; GFX9-NEXT: ; %bb.2: ; %.demote0
	; GFX9-NEXT: s_wqm_b64 s[6:7], s[0:1]			; GFX9-NEXT: s_wqm_b64 s[2:3], s[0:1]
	; GFX9-NEXT: s_and_b64 exec, exec, s[6:7]			; GFX9-NEXT: s_and_b64 exec, exec, s[2:3]
	; GFX9-NEXT: .LBB7_3: ; %.continue0.preheader			; GFX9-NEXT: .LBB7_3: ; %.continue0.preheader
	; GFX9-NEXT: s_or_b64 exec, exec, s[2:3]
	; GFX9-NEXT: s_mov_b64 s[2:3], 0			; GFX9-NEXT: s_mov_b64 s[2:3], 0
	; GFX9-NEXT: v_mov_b32_e32 v0, s4			; GFX9-NEXT: s_or_b64 exec, exec, s[4:5]
				; GFX9-NEXT: v_mov_b32_e32 v0, s6
	; GFX9-NEXT: s_branch .LBB7_5			; GFX9-NEXT: s_branch .LBB7_5
	; GFX9-NEXT: .LBB7_4: ; %.continue1			; GFX9-NEXT: .LBB7_4: ; %.continue1
	; GFX9-NEXT: ; in Loop: Header=BB7_5 Depth=1			; GFX9-NEXT: ; in Loop: Header=BB7_5 Depth=1
	; GFX9-NEXT: s_or_b64 exec, exec, s[4:5]			; GFX9-NEXT: s_or_b64 exec, exec, s[4:5]
	; GFX9-NEXT: v_add_u32_e32 v0, 1, v0			; GFX9-NEXT: v_add_u32_e32 v0, 1, v0
	; GFX9-NEXT: v_cmp_ge_i32_e32 vcc, v0, v1			; GFX9-NEXT: v_cmp_ge_i32_e32 vcc, v0, v1
	; GFX9-NEXT: s_or_b64 s[2:3], vcc, s[2:3]			; GFX9-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
	; GFX9-NEXT: s_andn2_b64 exec, exec, s[2:3]			; GFX9-NEXT: s_andn2_b64 exec, exec, s[2:3]
	▲ Show 20 Lines • Show All 98 Lines • ▼ Show 20 Lines
	; GFX10-32-NEXT: exp null off, off, off, off done vm			; GFX10-32-NEXT: exp null off, off, off, off done vm
	; GFX10-32-NEXT: s_endpgm			; GFX10-32-NEXT: s_endpgm
	;			;
	; GFX10-64-LABEL: wqm_deriv_loop:			; GFX10-64-LABEL: wqm_deriv_loop:
	; GFX10-64: ; %bb.0: ; %.entry			; GFX10-64: ; %bb.0: ; %.entry
	; GFX10-64-NEXT: s_mov_b64 s[0:1], exec			; GFX10-64-NEXT: s_mov_b64 s[0:1], exec
	; GFX10-64-NEXT: s_wqm_b64 exec, exec			; GFX10-64-NEXT: s_wqm_b64 exec, exec
	; GFX10-64-NEXT: v_cvt_i32_f32_e32 v0, v0			; GFX10-64-NEXT: v_cvt_i32_f32_e32 v0, v0
	; GFX10-64-NEXT: s_mov_b32 s4, 0			; GFX10-64-NEXT: s_mov_b32 s6, 0
	; GFX10-64-NEXT: v_cmp_ne_u32_e32 vcc, 0, v0			; GFX10-64-NEXT: v_cmp_ne_u32_e32 vcc, 0, v0
	; GFX10-64-NEXT: s_and_saveexec_b64 s[2:3], vcc			; GFX10-64-NEXT: s_and_saveexec_b64 s[2:3], vcc
	; GFX10-64-NEXT: s_xor_b64 s[2:3], exec, s[2:3]			; GFX10-64-NEXT: s_xor_b64 s[4:5], exec, s[2:3]
	; GFX10-64-NEXT: s_cbranch_execz .LBB7_3			; GFX10-64-NEXT: s_cbranch_execz .LBB7_3
	; GFX10-64-NEXT: ; %bb.1: ; %.demote0			; GFX10-64-NEXT: ; %bb.1: ; %.demote0
	; GFX10-64-NEXT: s_andn2_b64 s[0:1], s[0:1], exec			; GFX10-64-NEXT: s_andn2_b64 s[0:1], s[0:1], exec
	; GFX10-64-NEXT: s_cbranch_scc0 .LBB7_9			; GFX10-64-NEXT: s_cbranch_scc0 .LBB7_9
	; GFX10-64-NEXT: ; %bb.2: ; %.demote0			; GFX10-64-NEXT: ; %bb.2: ; %.demote0
	; GFX10-64-NEXT: s_wqm_b64 s[6:7], s[0:1]			; GFX10-64-NEXT: s_wqm_b64 s[2:3], s[0:1]
	; GFX10-64-NEXT: s_and_b64 exec, exec, s[6:7]			; GFX10-64-NEXT: s_and_b64 exec, exec, s[2:3]
	; GFX10-64-NEXT: .LBB7_3: ; %.continue0.preheader			; GFX10-64-NEXT: .LBB7_3: ; %.continue0.preheader
	; GFX10-64-NEXT: s_or_b64 exec, exec, s[2:3]
	; GFX10-64-NEXT: v_mov_b32_e32 v0, s4
	; GFX10-64-NEXT: s_mov_b64 s[2:3], 0			; GFX10-64-NEXT: s_mov_b64 s[2:3], 0
				; GFX10-64-NEXT: s_or_b64 exec, exec, s[4:5]
				; GFX10-64-NEXT: v_mov_b32_e32 v0, s6
	; GFX10-64-NEXT: s_branch .LBB7_5			; GFX10-64-NEXT: s_branch .LBB7_5
	; GFX10-64-NEXT: .LBB7_4: ; %.continue1			; GFX10-64-NEXT: .LBB7_4: ; %.continue1
	; GFX10-64-NEXT: ; in Loop: Header=BB7_5 Depth=1			; GFX10-64-NEXT: ; in Loop: Header=BB7_5 Depth=1
	; GFX10-64-NEXT: s_or_b64 exec, exec, s[4:5]			; GFX10-64-NEXT: s_or_b64 exec, exec, s[4:5]
	; GFX10-64-NEXT: v_add_nc_u32_e32 v0, 1, v0			; GFX10-64-NEXT: v_add_nc_u32_e32 v0, 1, v0
	; GFX10-64-NEXT: v_cmp_ge_i32_e32 vcc, v0, v1			; GFX10-64-NEXT: v_cmp_ge_i32_e32 vcc, v0, v1
	; GFX10-64-NEXT: s_or_b64 s[2:3], vcc, s[2:3]			; GFX10-64-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
	; GFX10-64-NEXT: s_andn2_b64 exec, exec, s[2:3]			; GFX10-64-NEXT: s_andn2_b64 exec, exec, s[2:3]
	▲ Show 20 Lines • Show All 91 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/block-should-not-be-in-alive-blocks.mir

Show All 28 Lines	body: \|
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.5, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.5, implicit $exec
; CHECK-NEXT: S_BRANCH %bb.2		; CHECK-NEXT: S_BRANCH %bb.2
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.1:		; CHECK-NEXT: bb.1:
; CHECK-NEXT: successors: %bb.7(0x80000000)		; CHECK-NEXT: successors: %bb.7(0x80000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[S_LOAD_DWORDX2_IMM:%[0-9]+]]:sreg_64_xexec = S_LOAD_DWORDX2_IMM killed [[COPY]], 0, 0 :: (dereferenceable invariant load (s64), align 16, addrspace 4)		; CHECK-NEXT: [[S_LOAD_DWORDX2_IMM:%[0-9]+]]:sreg_64_xexec = S_LOAD_DWORDX2_IMM killed [[COPY]], 0, 0 :: (dereferenceable invariant load (s64), align 16, addrspace 4)
; CHECK-NEXT: [[V_ADD_CO_U32_e64_:%[0-9]+]]:vgpr_32, [[V_ADD_CO_U32_e64_1:%[0-9]+]]:sreg_32_xm0_xexec = V_ADD_CO_U32_e64 [[S_LOAD_DWORDX2_IMM]].sub0, killed %15, 0, implicit $exec		; CHECK-NEXT: [[V_ADD_CO_U32_e64_:%[0-9]+]]:vgpr_32, [[V_ADD_CO_U32_e64_1:%[0-9]+]]:sreg_32_xm0_xexec = V_ADD_CO_U32_e64 [[S_LOAD_DWORDX2_IMM]].sub0, killed %15, 0, implicit $exec
; CHECK-NEXT: %7:vgpr_32, dead %8:sreg_32_xm0_xexec = V_ADDC_U32_e64 0, killed [[S_LOAD_DWORDX2_IMM]].sub1, killed [[V_ADD_CO_U32_e64_1]], 0, implicit $exec		; CHECK-NEXT: [[V_ADDC_U32_e64_:%[0-9]+]]:vgpr_32, dead [[V_ADDC_U32_e64_1:%[0-9]+]]:sreg_32_xm0_xexec = V_ADDC_U32_e64 0, killed [[S_LOAD_DWORDX2_IMM]].sub1, killed [[V_ADD_CO_U32_e64_1]], 0, implicit $exec
; CHECK-NEXT: [[REG_SEQUENCE:%[0-9]+]]:vreg_64 = REG_SEQUENCE killed [[V_ADD_CO_U32_e64_]], %subreg.sub0, killed %7, %subreg.sub1		; CHECK-NEXT: [[REG_SEQUENCE:%[0-9]+]]:vreg_64 = REG_SEQUENCE killed [[V_ADD_CO_U32_e64_]], %subreg.sub0, killed [[V_ADDC_U32_e64_]], %subreg.sub1
; CHECK-NEXT: [[GLOBAL_LOAD_UBYTE:%[0-9]+]]:vgpr_32 = GLOBAL_LOAD_UBYTE killed [[REG_SEQUENCE]], 0, 0, implicit $exec :: (load (s8), addrspace 1)		; CHECK-NEXT: [[GLOBAL_LOAD_UBYTE:%[0-9]+]]:vgpr_32 = GLOBAL_LOAD_UBYTE killed [[REG_SEQUENCE]], 0, 0, implicit $exec :: (load (s8), addrspace 1)
; CHECK-NEXT: [[V_MOV_B:%[0-9]+]]:vreg_64 = V_MOV_B64_PSEUDO 0, implicit $exec		; CHECK-NEXT: [[V_MOV_B:%[0-9]+]]:vreg_64 = V_MOV_B64_PSEUDO 0, implicit $exec
; CHECK-NEXT: GLOBAL_STORE_BYTE killed [[V_MOV_B]], killed [[GLOBAL_LOAD_UBYTE]], 0, 0, implicit $exec :: (store (s8), addrspace 1)		; CHECK-NEXT: GLOBAL_STORE_BYTE killed [[V_MOV_B]], killed [[GLOBAL_LOAD_UBYTE]], 0, 0, implicit $exec :: (store (s8), addrspace 1)
; CHECK-NEXT: S_BRANCH %bb.7		; CHECK-NEXT: S_BRANCH %bb.7
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.2:		; CHECK-NEXT: bb.2:
; CHECK-NEXT: successors: %bb.4(0x40000000), %bb.3(0x40000000)		; CHECK-NEXT: successors: %bb.4(0x40000000), %bb.3(0x40000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: S_CBRANCH_SCC0 %bb.4, implicit undef $scc		; CHECK-NEXT: S_CBRANCH_SCC0 %bb.4, implicit undef $scc
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.3:		; CHECK-NEXT: bb.3:
; CHECK-NEXT: successors: %bb.6(0x80000000)		; CHECK-NEXT: successors: %bb.6(0x80000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: S_BRANCH %bb.6		; CHECK-NEXT: S_BRANCH %bb.6
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.4:		; CHECK-NEXT: bb.4:
; CHECK-NEXT: successors: %bb.6(0x80000000)		; CHECK-NEXT: successors: %bb.6(0x80000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[V_MOV_B1:%[0-9]+]]:vreg_64 = V_MOV_B64_PSEUDO 0, implicit $exec		; CHECK-NEXT: [[V_MOV_B1:%[0-9]+]]:vreg_64 = V_MOV_B64_PSEUDO 0, implicit $exec
; CHECK-NEXT: dead %13:vgpr_32 = GLOBAL_LOAD_UBYTE killed [[V_MOV_B1]], 0, 0, implicit $exec :: (load (s8), addrspace 1)		; CHECK-NEXT: dead [[GLOBAL_LOAD_UBYTE1:%[0-9]+]]:vgpr_32 = GLOBAL_LOAD_UBYTE killed [[V_MOV_B1]], 0, 0, implicit $exec :: (load (s8), addrspace 1)
; CHECK-NEXT: S_BRANCH %bb.6		; CHECK-NEXT: S_BRANCH %bb.6
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.5:		; CHECK-NEXT: bb.5:
; CHECK-NEXT: successors: %bb.1(0x40000000), %bb.7(0x40000000)		; CHECK-NEXT: successors: %bb.1(0x40000000), %bb.7(0x40000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[S_OR_SAVEEXEC_B32_:%[0-9]+]]:sreg_32 = S_OR_SAVEEXEC_B32 killed [[S_XOR_B32_]], implicit-def $exec, implicit-def $scc, implicit $exec		; CHECK-NEXT: [[S_OR_SAVEEXEC_B32_:%[0-9]+]]:sreg_32 = S_OR_SAVEEXEC_B32 killed [[S_XOR_B32_]], implicit-def $exec, implicit-def $scc, implicit $exec
; CHECK-NEXT: [[COPY4:%[0-9]+]]:vgpr_32 = COPY killed [[COPY2]]		; CHECK-NEXT: [[COPY4:%[0-9]+]]:vgpr_32 = COPY killed [[COPY2]]
; CHECK-NEXT: [[S_AND_B32_1:%[0-9]+]]:sreg_32 = S_AND_B32 $exec_lo, [[S_OR_SAVEEXEC_B32_]], implicit-def $scc		; CHECK-NEXT: [[S_AND_B32_1:%[0-9]+]]:sreg_32 = S_AND_B32 $exec_lo, [[S_OR_SAVEEXEC_B32_]], implicit-def $scc
; CHECK-NEXT: $exec_lo = S_XOR_B32_term $exec_lo, [[S_AND_B32_1]], implicit-def $scc		; CHECK-NEXT: $exec_lo = S_XOR_B32_term $exec_lo, [[S_AND_B32_1]], implicit-def $scc
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.7, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.7, implicit $exec
; CHECK-NEXT: S_BRANCH %bb.1		; CHECK-NEXT: S_BRANCH %bb.1
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.6:		; CHECK-NEXT: bb.6:
; CHECK-NEXT: successors: %bb.5(0x80000000)		; CHECK-NEXT: successors: %bb.5(0x80000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[DEF:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF		; CHECK-NEXT: [[DEF:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
; CHECK-NEXT: S_BRANCH %bb.5		; CHECK-NEXT: S_BRANCH %bb.5
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.7:		; CHECK-NEXT: bb.7:
; CHECK-NEXT: $exec_lo = S_OR_B32 $exec_lo, killed [[S_AND_B32_1]], implicit-def $scc		; CHECK-NEXT: successors: %bb.8(0x80000000)
		; CHECK-NEXT: {{ $}}
		; CHECK-NEXT: $exec_lo = S_OR_B32_term $exec_lo, killed [[S_AND_B32_1]], implicit-def $scc
		; CHECK-NEXT: {{ $}}
		; CHECK-NEXT: bb.8:
; CHECK-NEXT: S_ENDPGM 0		; CHECK-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.2(0x40000000), %bb.5(0x40000000)		successors: %bb.2(0x40000000), %bb.5(0x40000000)
liveins: $vgpr0, $sgpr4_sgpr5		liveins: $vgpr0, $sgpr4_sgpr5

%0:sgpr_64 = COPY $sgpr4_sgpr5		%0:sgpr_64 = COPY $sgpr4_sgpr5
%1:vgpr_32 = COPY $vgpr0		%1:vgpr_32 = COPY $vgpr0
%2:sreg_32 = V_CMP_NE_U32_e64 0, %1, implicit $exec		%2:sreg_32 = V_CMP_NE_U32_e64 0, %1, implicit $exec
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/branch-folding-implicit-def-subreg.ll

Show All 26 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: renamable $sgpr30_sgpr31 = S_XOR_B64 killed renamable $sgpr18_sgpr19, -1, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr30_sgpr31 = S_XOR_B64 killed renamable $sgpr18_sgpr19, -1, implicit-def dead $scc
; GFX90A-NEXT: renamable $vgpr3 = V_MOV_B32_e32 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr3 = V_MOV_B32_e32 0, implicit $exec
; GFX90A-NEXT: renamable $vgpr2 = DS_READ_B32_gfx9 renamable $vgpr3, 0, 0, implicit $exec :: (load (s32) from `ptr addrspace(3) null`, align 8, addrspace 3)		; GFX90A-NEXT: renamable $vgpr2 = DS_READ_B32_gfx9 renamable $vgpr3, 0, 0, implicit $exec :: (load (s32) from `ptr addrspace(3) null`, align 8, addrspace 3)
; GFX90A-NEXT: renamable $sgpr18_sgpr19 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr18_sgpr19 = S_MOV_B64 0
; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, renamable $sgpr28_sgpr29, implicit-def dead $scc		; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, renamable $sgpr28_sgpr29, implicit-def dead $scc
; GFX90A-NEXT: S_CBRANCH_VCCZ %bb.2, implicit $vcc		; GFX90A-NEXT: S_CBRANCH_VCCZ %bb.2, implicit $vcc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.1.bb103:		; GFX90A-NEXT: bb.1.bb103:
; GFX90A-NEXT: successors: %bb.58(0x40000000), %bb.2(0x40000000)		; GFX90A-NEXT: successors: %bb.59(0x40000000), %bb.2(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x00000000000000FF, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000FF, $vgpr2_vgpr3:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x00000000000000FF, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000FF, $vgpr2_vgpr3:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr34_sgpr35 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr34_sgpr35 = S_MOV_B64 0
; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, renamable $sgpr30_sgpr31, implicit-def dead $scc		; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, renamable $sgpr30_sgpr31, implicit-def dead $scc
; GFX90A-NEXT: $vgpr24 = IMPLICIT_DEF		; GFX90A-NEXT: $vgpr24 = IMPLICIT_DEF
; GFX90A-NEXT: $agpr0 = IMPLICIT_DEF		; GFX90A-NEXT: $vgpr12 = IMPLICIT_DEF
; GFX90A-NEXT: $vgpr26 = IMPLICIT_DEF		; GFX90A-NEXT: $vgpr26 = IMPLICIT_DEF
; GFX90A-NEXT: $vgpr20 = IMPLICIT_DEF		; GFX90A-NEXT: $vgpr20 = IMPLICIT_DEF
; GFX90A-NEXT: $vgpr22 = IMPLICIT_DEF		; GFX90A-NEXT: $vgpr22 = IMPLICIT_DEF
; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.58, implicit $vcc		; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.59, implicit $vcc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.2:		; GFX90A-NEXT: bb.2:
; GFX90A-NEXT: successors: %bb.3(0x80000000)		; GFX90A-NEXT: successors: %bb.3(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr24, $sgpr33, $vgpr31, $agpr0, $vgpr26, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8, $sgpr9, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr58, $sgpr59, $sgpr20_sgpr21_sgpr22, $sgpr24_sgpr25_sgpr26, $sgpr26_sgpr27, $vgpr2, $vgpr3, $vgpr20, $vgpr22		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr24, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8, $sgpr9, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr58, $sgpr59, $sgpr20_sgpr21_sgpr22, $sgpr24_sgpr25_sgpr26, $sgpr26_sgpr27, $vgpr2, $vgpr3, $vgpr12, $vgpr26, $vgpr20, $vgpr22
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr23 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr23 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr21 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr21 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr23 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr23 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr25 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr25 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr27 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr27 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr36_sgpr37 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr36_sgpr37 = S_MOV_B64 0
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.3.Flow17:		; GFX90A-NEXT: bb.3.Flow17:
; GFX90A-NEXT: successors: %bb.4(0x40000000), %bb.57(0x40000000)		; GFX90A-NEXT: successors: %bb.4(0x40000000), %bb.58(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $sgpr23, $sgpr33, $vgpr31, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000FF, $vgpr2_vgpr3:0x000000000000000F, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $sgpr23, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000FF, $vgpr2_vgpr3:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr4 = V_AND_B32_e32 1023, $vgpr31, implicit $exec		; GFX90A-NEXT: renamable $vgpr4 = V_AND_B32_e32 1023, $vgpr31, implicit $exec
; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, killed renamable $sgpr34_sgpr35, implicit-def dead $scc		; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, killed renamable $sgpr34_sgpr35, implicit-def dead $scc
; GFX90A-NEXT: S_CBRANCH_VCCZ %bb.57, implicit $vcc		; GFX90A-NEXT: S_CBRANCH_VCCZ %bb.58, implicit $vcc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.4.bb15:		; GFX90A-NEXT: bb.4.bb15:
; GFX90A-NEXT: successors: %bb.35(0x40000000), %bb.5(0x40000000)		; GFX90A-NEXT: successors: %bb.35(0x40000000), %bb.5(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr36_sgpr37, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000FF, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr18_sgpr19		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr36_sgpr37, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000FF, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr18_sgpr19
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr0_vgpr1 = V_LSHLREV_B64_e64 2, $vgpr2_vgpr3, implicit $exec		; GFX90A-NEXT: renamable $vgpr0_vgpr1 = V_LSHLREV_B64_e64 2, $vgpr2_vgpr3, implicit $exec
; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr25, implicit $exec		; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr25, implicit $exec
; GFX90A-NEXT: renamable $vgpr46, renamable $vcc = V_ADD_CO_U32_e64 $sgpr24, $vgpr0, 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr46, renamable $vcc = V_ADD_CO_U32_e64 $sgpr24, $vgpr0, 0, implicit $exec
Show All 32 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: renamable $vgpr42_vgpr43 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr42_vgpr43 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.6.Flow20:		; GFX90A-NEXT: bb.6.Flow20:
; GFX90A-NEXT: successors: %bb.7(0x80000000)		; GFX90A-NEXT: successors: %bb.7(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr21 = COPY renamable $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr21 = COPY renamable $sgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr20 = COPY $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr20 = COPY $sgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr23 = COPY $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr23 = COPY $sgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr22 = COPY $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr22 = COPY $sgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr25 = COPY $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr25 = COPY $sgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr24 = COPY $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr24 = COPY $sgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr27 = COPY $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr27 = COPY $sgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr26 = COPY $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr26 = COPY $sgpr17, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.7.Flow19:		; GFX90A-NEXT: bb.7.Flow19:
; GFX90A-NEXT: successors: %bb.62(0x40000000), %bb.8(0x40000000)		; GFX90A-NEXT: successors: %bb.62(0x40000000), %bb.8(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr58_sgpr59 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr58_sgpr59 = S_MOV_B64 0
; GFX90A-NEXT: $sgpr24_sgpr25 = S_AND_SAVEEXEC_B64 $sgpr36_sgpr37, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr24_sgpr25 = S_AND_SAVEEXEC_B64 $sgpr36_sgpr37, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.62, implicit $exec		; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.62, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.8.Flow32:		; GFX90A-NEXT: bb.8.Flow32:
; GFX90A-NEXT: successors: %bb.9(0x40000000), %bb.10(0x40000000)		; GFX90A-NEXT: successors: %bb.9(0x40000000), %bb.10(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr58_sgpr59, $vgpr0_vgpr1:0x000000000000000F, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr58_sgpr59, $vgpr0_vgpr1:0x000000000000000F, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
▲ Show 20 Lines • Show All 252 Lines • ▼ Show 20 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: renamable $vgpr44_vgpr45 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr44_vgpr45 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: $sgpr24_sgpr25 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr24_sgpr25 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.37, implicit $exec		; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.37, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.36.Flow21:		; GFX90A-NEXT: bb.36.Flow21:
; GFX90A-NEXT: successors: %bb.6(0x80000000)		; GFX90A-NEXT: successors: %bb.6(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr24_sgpr25, implicit-def $scc		; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr24_sgpr25, implicit-def $scc
; GFX90A-NEXT: S_BRANCH %bb.6		; GFX90A-NEXT: S_BRANCH %bb.6
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.37.bb27:		; GFX90A-NEXT: bb.37.bb27:
; GFX90A-NEXT: successors: %bb.39(0x40000000), %bb.38(0x40000000)		; GFX90A-NEXT: successors: %bb.39(0x40000000), %bb.38(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr18_sgpr19, $sgpr56_sgpr57, $sgpr54_sgpr55, $sgpr52_sgpr53, $sgpr50_sgpr51, $sgpr48_sgpr49, $sgpr46_sgpr47, $sgpr44_sgpr45, $sgpr42_sgpr43		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr18_sgpr19, $sgpr56_sgpr57, $sgpr54_sgpr55, $sgpr52_sgpr53, $sgpr50_sgpr51, $sgpr48_sgpr49, $sgpr46_sgpr47, $sgpr44_sgpr45, $sgpr42_sgpr43
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
Show All 13 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: renamable $vgpr56_vgpr57 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr56_vgpr57 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: $sgpr38_sgpr39 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr38_sgpr39 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.39, implicit $exec		; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.39, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.38.Flow22:		; GFX90A-NEXT: bb.38.Flow22:
; GFX90A-NEXT: successors: %bb.36(0x80000000)		; GFX90A-NEXT: successors: %bb.36(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr38_sgpr39, implicit-def $scc		; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr38_sgpr39, implicit-def $scc
; GFX90A-NEXT: renamable $sgpr38_sgpr39 = S_XOR_B64 $exec, -1, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr38_sgpr39 = S_XOR_B64 $exec, -1, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr40_sgpr41 = S_AND_B64 killed renamable $sgpr40_sgpr41, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr40_sgpr41 = S_AND_B64 killed renamable $sgpr40_sgpr41, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr42_sgpr43 = S_AND_B64 killed renamable $sgpr42_sgpr43, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr42_sgpr43 = S_AND_B64 killed renamable $sgpr42_sgpr43, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr44_sgpr45 = S_AND_B64 killed renamable $sgpr44_sgpr45, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr44_sgpr45 = S_AND_B64 killed renamable $sgpr44_sgpr45, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_AND_B64 killed renamable $sgpr46_sgpr47, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_AND_B64 killed renamable $sgpr46_sgpr47, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_AND_B64 killed renamable $sgpr48_sgpr49, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_AND_B64 killed renamable $sgpr48_sgpr49, $exec, implicit-def dead $scc
Show All 26 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: renamable $vgpr58_vgpr59 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr58_vgpr59 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: $sgpr40_sgpr41 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr40_sgpr41 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.41, implicit $exec		; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.41, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.40.Flow23:		; GFX90A-NEXT: bb.40.Flow23:
; GFX90A-NEXT: successors: %bb.38(0x80000000)		; GFX90A-NEXT: successors: %bb.38(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr40_sgpr41, implicit-def $scc		; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr40_sgpr41, implicit-def $scc
; GFX90A-NEXT: renamable $sgpr40_sgpr41 = S_XOR_B64 $exec, -1, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr40_sgpr41 = S_XOR_B64 $exec, -1, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr42_sgpr43 = S_AND_B64 killed renamable $sgpr42_sgpr43, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr42_sgpr43 = S_AND_B64 killed renamable $sgpr42_sgpr43, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr44_sgpr45 = S_AND_B64 killed renamable $sgpr44_sgpr45, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr44_sgpr45 = S_AND_B64 killed renamable $sgpr44_sgpr45, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_AND_B64 killed renamable $sgpr46_sgpr47, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_AND_B64 killed renamable $sgpr46_sgpr47, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_AND_B64 killed renamable $sgpr48_sgpr49, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_AND_B64 killed renamable $sgpr48_sgpr49, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_AND_B64 killed renamable $sgpr50_sgpr51, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_AND_B64 killed renamable $sgpr50_sgpr51, $exec, implicit-def dead $scc
Show All 28 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: renamable $vgpr60_vgpr61 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr60_vgpr61 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: $sgpr42_sgpr43 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr42_sgpr43 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.46, implicit $exec		; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.46, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.42.Flow24:		; GFX90A-NEXT: bb.42.Flow24:
; GFX90A-NEXT: successors: %bb.40(0x80000000)		; GFX90A-NEXT: successors: %bb.40(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr62_sgpr63, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr62_sgpr63, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr42_sgpr43, implicit-def $scc		; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr42_sgpr43, implicit-def $scc
; GFX90A-NEXT: renamable $vgpr59 = COPY killed renamable $vgpr20, implicit $exec		; GFX90A-NEXT: renamable $vgpr59 = COPY killed renamable $vgpr20, implicit $exec
; GFX90A-NEXT: renamable $sgpr42_sgpr43 = S_XOR_B64 $exec, -1, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr42_sgpr43 = S_XOR_B64 $exec, -1, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr44_sgpr45 = S_AND_B64 killed renamable $sgpr44_sgpr45, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr44_sgpr45 = S_AND_B64 killed renamable $sgpr44_sgpr45, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_AND_B64 killed renamable $sgpr62_sgpr63, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_AND_B64 killed renamable $sgpr62_sgpr63, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_AND_B64 killed renamable $sgpr48_sgpr49, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_AND_B64 killed renamable $sgpr48_sgpr49, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_AND_B64 killed renamable $sgpr50_sgpr51, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_AND_B64 killed renamable $sgpr50_sgpr51, $exec, implicit-def dead $scc
Show All 11 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr33, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr44_sgpr45, $sgpr52_sgpr53, $sgpr56_sgpr57, $sgpr54_sgpr55, $sgpr46_sgpr47		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr33, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr44_sgpr45, $sgpr52_sgpr53, $sgpr56_sgpr57, $sgpr54_sgpr55, $sgpr46_sgpr47
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: S_BITCMP1_B32 killed renamable $sgpr33, 16, implicit-def $scc		; GFX90A-NEXT: S_BITCMP1_B32 killed renamable $sgpr33, 16, implicit-def $scc
; GFX90A-NEXT: renamable $sgpr64_sgpr65 = S_CSELECT_B64 -1, 0, implicit killed $scc		; GFX90A-NEXT: renamable $sgpr64_sgpr65 = S_CSELECT_B64 -1, 0, implicit killed $scc
; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_XOR_B64 renamable $sgpr64_sgpr65, -1, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_XOR_B64 renamable $sgpr64_sgpr65, -1, implicit-def dead $scc
; GFX90A-NEXT: renamable $vgpr62 = V_ADD_CO_U32_e32 6144, $vgpr40, implicit-def $vcc, implicit $exec		; GFX90A-NEXT: renamable $vgpr62 = V_ADD_CO_U32_e32 6144, $vgpr40, implicit-def $vcc, implicit $exec
; GFX90A-NEXT: renamable $vgpr63, dead renamable $vcc = V_ADDC_U32_e64 0, $vgpr41, killed $vcc, 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr63, dead renamable $vcc = V_ADDC_U32_e64 0, $vgpr41, killed $vcc, 0, implicit $exec
; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, renamable $sgpr48_sgpr49, implicit-def dead $scc		; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, renamable $sgpr48_sgpr49, implicit-def dead $scc
; GFX90A-NEXT: $agpr0 = IMPLICIT_DEF		; GFX90A-NEXT: $vgpr12 = IMPLICIT_DEF
; GFX90A-NEXT: $vgpr14 = IMPLICIT_DEF		; GFX90A-NEXT: $vgpr14 = IMPLICIT_DEF
; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.48, implicit $vcc		; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.48, implicit $vcc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.44:		; GFX90A-NEXT: bb.44:
; GFX90A-NEXT: successors: %bb.45(0x80000000)		; GFX90A-NEXT: successors: %bb.45(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr58, $vgpr57, $vgpr20, $vgpr61, $vgpr31, $vgpr63, $agpr0, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8, $sgpr9, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $vgpr40, $vgpr62, $vgpr60, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22, $sgpr22_sgpr23, $sgpr24_sgpr25_sgpr26, $sgpr26_sgpr27, $vgpr56, $vgpr47, $vgpr2, $vgpr3, $vgpr4, $vgpr46, $vgpr45, $vgpr44, $vgpr43, $vgpr42, $vgpr41, $vgpr14		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr58, $vgpr57, $vgpr20, $vgpr61, $vgpr31, $vgpr63, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8, $sgpr9, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $vgpr40, $vgpr62, $vgpr60, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22, $sgpr22_sgpr23, $sgpr24_sgpr25_sgpr26, $sgpr26_sgpr27, $vgpr56, $vgpr47, $vgpr2, $vgpr3, $vgpr4, $vgpr46, $vgpr45, $vgpr44, $vgpr43, $vgpr42, $vgpr41, $vgpr12, $vgpr14
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = COPY renamable $sgpr36_sgpr37		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = COPY renamable $sgpr36_sgpr37
; GFX90A-NEXT: renamable $vgpr10_vgpr11 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr10_vgpr11 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr8_vgpr9 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr8_vgpr9 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr6_vgpr7 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr6_vgpr7 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr0_vgpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr0_vgpr1 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_MOV_B64 0
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.45.Flow26:		; GFX90A-NEXT: bb.45.Flow26:
; GFX90A-NEXT: successors: %bb.47(0x80000000)		; GFX90A-NEXT: successors: %bb.47(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr60_sgpr61 = S_XOR_B64 $exec, -1, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr60_sgpr61 = S_XOR_B64 $exec, -1, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr70_sgpr71 = S_AND_B64 killed renamable $sgpr44_sgpr45, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr70_sgpr71 = S_AND_B64 killed renamable $sgpr44_sgpr45, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr68_sgpr69 = S_AND_B64 killed renamable $sgpr46_sgpr47, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr68_sgpr69 = S_AND_B64 killed renamable $sgpr46_sgpr47, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr66_sgpr67 = S_AND_B64 killed renamable $sgpr48_sgpr49, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr66_sgpr67 = S_AND_B64 killed renamable $sgpr48_sgpr49, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_AND_B64 killed renamable $sgpr54_sgpr55, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_AND_B64 killed renamable $sgpr54_sgpr55, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr56_sgpr57 = S_AND_B64 killed renamable $sgpr56_sgpr57, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr56_sgpr57 = S_AND_B64 killed renamable $sgpr56_sgpr57, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_AND_B64 killed renamable $sgpr52_sgpr53, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_AND_B64 killed renamable $sgpr52_sgpr53, $exec, implicit-def dead $scc
Show All 25 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: renamable $vgpr62_vgpr63 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr62_vgpr63 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: $sgpr18_sgpr19 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr18_sgpr19 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.43, implicit $exec		; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.43, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.47.Flow25:		; GFX90A-NEXT: bb.47.Flow25:
; GFX90A-NEXT: successors: %bb.42(0x80000000)		; GFX90A-NEXT: successors: %bb.42(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr46_sgpr47, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr64_sgpr65, $sgpr66_sgpr67, $sgpr68_sgpr69, $sgpr70_sgpr71, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr46_sgpr47, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr64_sgpr65, $sgpr66_sgpr67, $sgpr68_sgpr69, $sgpr70_sgpr71, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr18_sgpr19, implicit-def $scc		; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr18_sgpr19, implicit-def $scc
; GFX90A-NEXT: renamable $sgpr44_sgpr45 = S_XOR_B64 $exec, -1, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr44_sgpr45 = S_XOR_B64 $exec, -1, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr62_sgpr63 = S_AND_B64 killed renamable $sgpr60_sgpr61, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr62_sgpr63 = S_AND_B64 killed renamable $sgpr60_sgpr61, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_AND_B64 killed renamable $sgpr70_sgpr71, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_AND_B64 killed renamable $sgpr70_sgpr71, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_AND_B64 killed renamable $sgpr68_sgpr69, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_AND_B64 killed renamable $sgpr68_sgpr69, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_AND_B64 killed renamable $sgpr66_sgpr67, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_AND_B64 killed renamable $sgpr66_sgpr67, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_AND_B64 killed renamable $sgpr54_sgpr55, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_AND_B64 killed renamable $sgpr54_sgpr55, $exec, implicit-def dead $scc
Show All 14 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: bb.49:		; GFX90A-NEXT: bb.49:
; GFX90A-NEXT: successors: %bb.44(0x80000000)		; GFX90A-NEXT: successors: %bb.44(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr52_sgpr53, $sgpr56_sgpr57, $sgpr54_sgpr55		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr52_sgpr53, $sgpr56_sgpr57, $sgpr54_sgpr55
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_MOV_B64 -1		; GFX90A-NEXT: renamable $sgpr46_sgpr47 = S_MOV_B64 -1
; GFX90A-NEXT: S_BRANCH %bb.44		; GFX90A-NEXT: S_BRANCH %bb.44
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.50.bb68:		; GFX90A-NEXT: bb.50.bb68:
; GFX90A-NEXT: successors: %bb.54(0x40000000), %bb.51(0x40000000)		; GFX90A-NEXT: successors: %bb.55(0x40000000), %bb.51(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr48_sgpr49, $sgpr58_sgpr59:0x000000000000000F, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr46_sgpr47, $sgpr52_sgpr53, $sgpr56_sgpr57, $sgpr54_sgpr55		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr48_sgpr49, $sgpr58_sgpr59:0x000000000000000F, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr46_sgpr47, $sgpr52_sgpr53, $sgpr56_sgpr57, $sgpr54_sgpr55
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr0_vgpr1 = V_LSHLREV_B64_e64 3, $vgpr4_vgpr5, implicit $exec		; GFX90A-NEXT: renamable $vgpr0_vgpr1 = V_LSHLREV_B64_e64 3, $vgpr4_vgpr5, implicit $exec
; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, killed renamable $sgpr48_sgpr49, implicit-def dead $scc		; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, killed renamable $sgpr48_sgpr49, implicit-def dead $scc
; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.54, implicit $vcc		; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.55, implicit $vcc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.51:		; GFX90A-NEXT: bb.51:
; GFX90A-NEXT: successors: %bb.45(0x80000000)		; GFX90A-NEXT: successors: %bb.45(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr52_sgpr53, $sgpr56_sgpr57, $sgpr54_sgpr55		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr52_sgpr53, $sgpr56_sgpr57, $sgpr54_sgpr55
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_MOV_B64 -1		; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_MOV_B64 -1
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = COPY renamable $sgpr36_sgpr37		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = COPY renamable $sgpr36_sgpr37
; GFX90A-NEXT: renamable $vgpr10_vgpr11 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr10_vgpr11 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr8_vgpr9 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr8_vgpr9 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr6_vgpr7 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr6_vgpr7 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: S_BRANCH %bb.45		; GFX90A-NEXT: S_BRANCH %bb.45
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.52.bb80:		; GFX90A-NEXT: bb.52.bb80:
; GFX90A-NEXT: successors: %bb.59(0x40000000), %bb.53(0x40000000)		; GFX90A-NEXT: successors: %bb.60(0x40000000), %bb.53(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr58_sgpr59:0x000000000000000F, $sgpr60_sgpr61, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr58_sgpr59:0x000000000000000F, $sgpr60_sgpr61, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr17 = S_BFE_U32 renamable $sgpr20, 65560, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr17 = S_BFE_U32 renamable $sgpr20, 65560, implicit-def dead $scc
; GFX90A-NEXT: S_CMP_EQ_U32 killed renamable $sgpr17, 0, implicit-def $scc		; GFX90A-NEXT: S_CMP_EQ_U32 killed renamable $sgpr17, 0, implicit-def $scc
; GFX90A-NEXT: renamable $vgpr8 = V_ADD_CO_U32_e32 4096, $vgpr0, implicit-def $vcc, implicit $exec		; GFX90A-NEXT: renamable $vgpr8 = V_ADD_CO_U32_e32 4096, $vgpr0, implicit-def $vcc, implicit $exec
; GFX90A-NEXT: renamable $vgpr9, dead renamable $vcc = V_ADDC_U32_e64 0, $vgpr1, killed $vcc, 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr9, dead renamable $vcc = V_ADDC_U32_e64 0, $vgpr1, killed $vcc, 0, implicit $exec
; GFX90A-NEXT: S_CBRANCH_SCC1 %bb.59, implicit killed $scc		; GFX90A-NEXT: S_CBRANCH_SCC1 %bb.60, implicit killed $scc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.53:		; GFX90A-NEXT: bb.53:
; GFX90A-NEXT: successors: %bb.61(0x80000000)		; GFX90A-NEXT: successors: %bb.54(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr60_sgpr61, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr60_sgpr61, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_MOV_B64 0
; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_MOV_B64 -1		; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_MOV_B64 -1
; GFX90A-NEXT: renamable $sgpr62_sgpr63 = COPY renamable $sgpr36_sgpr37		; GFX90A-NEXT: renamable $sgpr62_sgpr63 = COPY renamable $sgpr36_sgpr37
; GFX90A-NEXT: renamable $vgpr10_vgpr11 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr10_vgpr11 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: S_BRANCH %bb.61
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.54.bb73:		; GFX90A-NEXT: bb.54.Flow30:
; GFX90A-NEXT: successors: %bb.52(0x40000000), %bb.55(0x40000000)		; GFX90A-NEXT: successors: %bb.56(0x80000000)
		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr60_sgpr61, $sgpr62_sgpr63, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
		; GFX90A-NEXT: {{ $}}
		; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_XOR_B64 $exec, -1, implicit-def dead $scc
		; GFX90A-NEXT: renamable $sgpr56_sgpr57 = S_AND_B64 killed renamable $sgpr52_sgpr53, $exec, implicit-def dead $scc
		; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_AND_B64 killed renamable $sgpr50_sgpr51, $exec, implicit-def dead $scc
		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_ANDN2_B64 renamable $sgpr36_sgpr37, $exec, implicit-def dead $scc
		; GFX90A-NEXT: renamable $sgpr58_sgpr59 = S_AND_B64 killed renamable $sgpr62_sgpr63, $exec, implicit-def dead $scc
		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_OR_B64 killed renamable $sgpr50_sgpr51, killed renamable $sgpr58_sgpr59, implicit-def dead $scc
		; GFX90A-NEXT: S_BRANCH %bb.56
		; GFX90A-NEXT: {{ $}}
		; GFX90A-NEXT: bb.55.bb73:
		; GFX90A-NEXT: successors: %bb.52(0x40000000), %bb.56(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr58_sgpr59:0x000000000000000F, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr52_sgpr53, $sgpr56_sgpr57		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr58_sgpr59:0x000000000000000F, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003F, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3, $sgpr52_sgpr53, $sgpr56_sgpr57
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr5 = GLOBAL_LOAD_UBYTE renamable $vgpr0_vgpr1, 2048, 0, implicit $exec :: (load (s8) from %ir.i74, addrspace 1)		; GFX90A-NEXT: renamable $vgpr5 = GLOBAL_LOAD_UBYTE renamable $vgpr0_vgpr1, 2048, 0, implicit $exec :: (load (s8) from %ir.i74, addrspace 1)
; GFX90A-NEXT: renamable $vgpr6 = V_ADD_CO_U32_e32 2048, $vgpr0, implicit-def $vcc, implicit $exec		; GFX90A-NEXT: renamable $vgpr6 = V_ADD_CO_U32_e32 2048, $vgpr0, implicit-def $vcc, implicit $exec
; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_MOV_B64 0
; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_MOV_B64 -1		; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_MOV_B64 -1
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = COPY renamable $sgpr36_sgpr37		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = COPY renamable $sgpr36_sgpr37
; GFX90A-NEXT: renamable $vgpr7, dead renamable $vcc = V_ADDC_U32_e64 0, $vgpr1, killed $vcc, 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr7, dead renamable $vcc = V_ADDC_U32_e64 0, $vgpr1, killed $vcc, 0, implicit $exec
; GFX90A-NEXT: renamable $vcc = V_CMP_EQ_U16_e64 0, killed $vgpr5, implicit $exec		; GFX90A-NEXT: renamable $vcc = V_CMP_EQ_U16_e64 0, killed $vgpr5, implicit $exec
; GFX90A-NEXT: renamable $vgpr10_vgpr11 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr10_vgpr11 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr8_vgpr9 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr8_vgpr9 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $agpr1 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $sgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: $sgpr60_sgpr61 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr60_sgpr61 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.52, implicit $exec		; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.52, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.55.Flow29:		; GFX90A-NEXT: bb.56.Flow29:
; GFX90A-NEXT: successors: %bb.45(0x80000000)		; GFX90A-NEXT: successors: %bb.45(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr60_sgpr61, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr60_sgpr61, implicit-def $scc		; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr60_sgpr61, implicit-def $scc
; GFX90A-NEXT: S_BRANCH %bb.45		; GFX90A-NEXT: S_BRANCH %bb.45
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.56.bb90:		; GFX90A-NEXT: bb.57.bb90:
; GFX90A-NEXT: successors: %bb.60(0x80000000)		; GFX90A-NEXT: successors: %bb.61(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr52_sgpr53, $sgpr58_sgpr59:0x000000000000000F, $sgpr60_sgpr61, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr52_sgpr53, $sgpr58_sgpr59:0x000000000000000F, $sgpr60_sgpr61, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr54 = V_CNDMASK_B32_e64 0, 0, 0, 1, killed $sgpr64_sgpr65, implicit $exec		; GFX90A-NEXT: renamable $vgpr54 = V_CNDMASK_B32_e64 0, 0, 0, 1, killed $sgpr64_sgpr65, implicit $exec
; GFX90A-NEXT: renamable $vgpr5 = V_MOV_B32_e32 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr5 = V_MOV_B32_e32 0, implicit $exec
; GFX90A-NEXT: renamable $vgpr16_vgpr17 = DS_READ_B64_gfx9 killed renamable $vgpr5, 0, 0, implicit $exec :: (load (s64) from `ptr addrspace(3) null`, addrspace 3)		; GFX90A-NEXT: renamable $vgpr16_vgpr17 = DS_READ_B64_gfx9 killed renamable $vgpr5, 0, 0, implicit $exec :: (load (s64) from `ptr addrspace(3) null`, addrspace 3)
; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr21, implicit $exec		; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr21, implicit $exec
; GFX90A-NEXT: renamable $vgpr18_vgpr19 = DS_READ_B64_gfx9 killed renamable $vgpr5, 0, 0, implicit $exec :: (load (s64) from %ir.7, addrspace 3)		; GFX90A-NEXT: renamable $vgpr18_vgpr19 = DS_READ_B64_gfx9 killed renamable $vgpr5, 0, 0, implicit $exec :: (load (s64) from %ir.7, addrspace 3)
; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr22, implicit $exec		; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr22, implicit $exec
; GFX90A-NEXT: renamable $vgpr14_vgpr15 = DS_READ_B64_gfx9 killed renamable $vgpr5, 0, 0, implicit $exec :: (load (s64) from %ir.8, addrspace 3)		; GFX90A-NEXT: renamable $vgpr14_vgpr15 = DS_READ_B64_gfx9 killed renamable $vgpr5, 0, 0, implicit $exec :: (load (s64) from %ir.8, addrspace 3)
; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr58, implicit $exec		; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr58, implicit $exec
; GFX90A-NEXT: renamable $vgpr13 = V_ALIGNBIT_B32_e64 killed $sgpr59, killed $vgpr5, 1, implicit $exec		; GFX90A-NEXT: renamable $vgpr13 = V_ALIGNBIT_B32_e64 killed $sgpr59, killed $vgpr5, 1, implicit $exec
; GFX90A-NEXT: renamable $vgpr30 = V_ALIGNBIT_B32_e64 $vgpr19, $vgpr18, 1, implicit $exec		; GFX90A-NEXT: renamable $vgpr30 = V_ALIGNBIT_B32_e64 $vgpr19, $vgpr18, 1, implicit $exec
; GFX90A-NEXT: renamable $vgpr19 = V_CNDMASK_B32_e64 0, 0, 0, 1, $sgpr12_sgpr13, implicit $exec		; GFX90A-NEXT: renamable $vgpr19 = V_CNDMASK_B32_e64 0, 0, 0, 1, $sgpr12_sgpr13, implicit $exec
; GFX90A-NEXT: renamable $vgpr17 = V_ALIGNBIT_B32_e64 $vgpr17, $vgpr16, 1, implicit $exec		; GFX90A-NEXT: renamable $vgpr17 = V_ALIGNBIT_B32_e64 $vgpr17, $vgpr16, 1, implicit $exec
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_XOR_B64 $exec, -1, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_XOR_B64 $exec, -1, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr62_sgpr63 = S_OR_B64 renamable $sgpr36_sgpr37, $exec, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr62_sgpr63 = S_OR_B64 renamable $sgpr36_sgpr37, $exec, implicit-def dead $scc
; GFX90A-NEXT: S_BRANCH %bb.60		; GFX90A-NEXT: S_BRANCH %bb.61
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.57:		; GFX90A-NEXT: bb.58:
; GFX90A-NEXT: successors: %bb.7(0x80000000)		; GFX90A-NEXT: successors: %bb.7(0x80000000)
; GFX90A-NEXT: liveins: $exec:0x000000000000000F, $sgpr14, $sgpr15, $sgpr16, $sgpr17:0x0000000000000003, $sgpr23:0x0000000000000003, $vgpr31, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr36_sgpr37, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $exec:0x000000000000000F, $sgpr14, $sgpr15, $sgpr16, $sgpr17:0x0000000000000003, $sgpr23:0x0000000000000003, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr36_sgpr37, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr12_vgpr13:0x000000000000000F, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr17 = COPY killed renamable $sgpr23, implicit $exec		; GFX90A-NEXT: renamable $vgpr17 = COPY killed renamable $sgpr23, implicit $exec
; GFX90A-NEXT: renamable $vgpr19 = COPY killed renamable $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr19 = COPY killed renamable $sgpr17, implicit $exec
; GFX90A-NEXT: renamable $sgpr56_sgpr57 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr56_sgpr57 = S_MOV_B64 0
; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_MOV_B64 0
; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_MOV_B64 0
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_MOV_B64 0
; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr48_sgpr49 = S_MOV_B64 0
Show All 18 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: renamable $vgpr30 = COPY renamable $vgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr30 = COPY renamable $vgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr18 = COPY renamable $vgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr18 = COPY renamable $vgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr54 = COPY renamable $vgpr19, implicit $exec		; GFX90A-NEXT: renamable $vgpr54 = COPY renamable $vgpr19, implicit $exec
; GFX90A-NEXT: renamable $vgpr15 = COPY renamable $vgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr15 = COPY renamable $vgpr17, implicit $exec
; GFX90A-NEXT: renamable $vgpr14 = COPY renamable $vgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr14 = COPY renamable $vgpr17, implicit $exec
; GFX90A-NEXT: renamable $sgpr34_sgpr35 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr34_sgpr35 = S_MOV_B64 0
; GFX90A-NEXT: S_BRANCH %bb.7		; GFX90A-NEXT: S_BRANCH %bb.7
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.58.bb105:		; GFX90A-NEXT: bb.59.bb105:
; GFX90A-NEXT: successors: %bb.3(0x80000000)		; GFX90A-NEXT: successors: %bb.3(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x00000000000000FF, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000FF, $vgpr2_vgpr3:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $sgpr33, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr58_sgpr59:0x000000000000000F, $sgpr20_sgpr21_sgpr22_sgpr23:0x00000000000000FF, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000FF, $vgpr2_vgpr3:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr0 = V_MOV_B32_e32 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr0 = V_MOV_B32_e32 0, implicit $exec
; GFX90A-NEXT: renamable $vgpr24_vgpr25 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from `ptr addrspace(3) null`, addrspace 3)		; GFX90A-NEXT: renamable $vgpr24_vgpr25 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from `ptr addrspace(3) null`, addrspace 3)
; GFX90A-NEXT: renamable $vgpr0 = COPY renamable $sgpr23, implicit $exec		; GFX90A-NEXT: renamable $vgpr0 = COPY renamable $sgpr23, implicit $exec
; GFX90A-NEXT: renamable $vgpr22_vgpr23 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from %ir.434, addrspace 3)		; GFX90A-NEXT: renamable $vgpr22_vgpr23 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from %ir.434, addrspace 3)
; GFX90A-NEXT: renamable $vgpr0 = COPY renamable $sgpr21, implicit $exec		; GFX90A-NEXT: renamable $vgpr0 = COPY renamable $sgpr21, implicit $exec
; GFX90A-NEXT: renamable $vgpr20_vgpr21 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from %ir.7, addrspace 3)		; GFX90A-NEXT: renamable $vgpr20_vgpr21 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from %ir.7, addrspace 3)
; GFX90A-NEXT: renamable $vgpr0 = COPY killed renamable $sgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr0 = COPY killed renamable $sgpr17, implicit $exec
; GFX90A-NEXT: renamable $agpr0_agpr1 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from %ir.435, addrspace 3)		; GFX90A-NEXT: renamable $vgpr12_vgpr13 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from %ir.435, addrspace 3)
; GFX90A-NEXT: renamable $vgpr0 = COPY renamable $sgpr22, implicit $exec		; GFX90A-NEXT: renamable $vgpr0 = COPY renamable $sgpr22, implicit $exec
; GFX90A-NEXT: renamable $vgpr26_vgpr27 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from %ir.8, addrspace 3)		; GFX90A-NEXT: renamable $vgpr26_vgpr27 = DS_READ_B64_gfx9 killed renamable $vgpr0, 0, 0, implicit $exec :: (load (s64) from %ir.8, addrspace 3)
; GFX90A-NEXT: renamable $sgpr36_sgpr37 = S_MOV_B64 -1		; GFX90A-NEXT: renamable $sgpr36_sgpr37 = S_MOV_B64 -1
; GFX90A-NEXT: renamable $sgpr23 = S_MOV_B32 0		; GFX90A-NEXT: renamable $sgpr23 = S_MOV_B32 0
; GFX90A-NEXT: renamable $sgpr17 = S_MOV_B32 0		; GFX90A-NEXT: renamable $sgpr17 = S_MOV_B32 0
; GFX90A-NEXT: S_BRANCH %bb.3		; GFX90A-NEXT: S_BRANCH %bb.3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.59.bb85:		; GFX90A-NEXT: bb.60.bb85:
; GFX90A-NEXT: successors: %bb.56(0x40000000), %bb.60(0x40000000)		; GFX90A-NEXT: successors: %bb.57(0x40000000), %bb.61(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr58_sgpr59:0x000000000000000F, $sgpr60_sgpr61, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr20, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr58_sgpr59:0x000000000000000F, $sgpr60_sgpr61, $sgpr64_sgpr65, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr10 = V_OR_B32_e32 1, $vgpr8, implicit $exec		; GFX90A-NEXT: renamable $vgpr10 = V_OR_B32_e32 1, $vgpr8, implicit $exec
; GFX90A-NEXT: renamable $vgpr11 = COPY renamable $vgpr9, implicit $exec		; GFX90A-NEXT: renamable $vgpr11 = COPY renamable $vgpr9, implicit $exec
; GFX90A-NEXT: renamable $vgpr5 = FLAT_LOAD_UBYTE renamable $vgpr10_vgpr11, 0, 0, implicit $exec, implicit $flat_scr :: (load (s8) from %ir.i86)		; GFX90A-NEXT: renamable $vgpr5 = FLAT_LOAD_UBYTE renamable $vgpr10_vgpr11, 0, 0, implicit $exec, implicit $flat_scr :: (load (s8) from %ir.i86)
; GFX90A-NEXT: renamable $sgpr17 = S_MOV_B32 0		; GFX90A-NEXT: renamable $sgpr17 = S_MOV_B32 0
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_MOV_B64 -1		; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_MOV_B64 -1
; GFX90A-NEXT: renamable $vcc = V_CMP_EQ_U16_e64 0, killed $vgpr5, implicit $exec		; GFX90A-NEXT: renamable $vcc = V_CMP_EQ_U16_e64 0, killed $vgpr5, implicit $exec
; GFX90A-NEXT: renamable $sgpr62_sgpr63 = COPY renamable $sgpr36_sgpr37		; GFX90A-NEXT: renamable $sgpr62_sgpr63 = COPY renamable $sgpr36_sgpr37
; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr19 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr17 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr16 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr30 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr18 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr54 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr15 = IMPLICIT_DEF
; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF		; GFX90A-NEXT: renamable $vgpr13 = IMPLICIT_DEF
; GFX90A-NEXT: $sgpr52_sgpr53 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr52_sgpr53 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.56, implicit $exec		; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.57, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.60.Flow31:		; GFX90A-NEXT: bb.61.Flow31:
; GFX90A-NEXT: successors: %bb.61(0x80000000)		; GFX90A-NEXT: successors: %bb.54(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr60_sgpr61, $sgpr62_sgpr63, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000C, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr60_sgpr61, $sgpr62_sgpr63, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000C, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr52_sgpr53, implicit-def $scc		; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr52_sgpr53, implicit-def $scc
; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_MOV_B64 0
; GFX90A-NEXT: renamable $vgpr12 = COPY renamable $vgpr16, implicit $exec		; GFX90A-NEXT: renamable $vgpr12 = COPY renamable $vgpr16, implicit $exec
; GFX90A-NEXT: renamable $agpr0_agpr1 = COPY killed renamable $vgpr12_vgpr13, implicit $exec		; GFX90A-NEXT: S_BRANCH %bb.54
; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.61.Flow30:
; GFX90A-NEXT: successors: %bb.55(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr17, $vgpr17, $vgpr19, $vgpr20, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr60_sgpr61, $sgpr62_sgpr63, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x0000000000000003, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr54_sgpr55 = S_XOR_B64 $exec, -1, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr56_sgpr57 = S_AND_B64 killed renamable $sgpr52_sgpr53, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr52_sgpr53 = S_AND_B64 killed renamable $sgpr50_sgpr51, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_ANDN2_B64 renamable $sgpr36_sgpr37, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr58_sgpr59 = S_AND_B64 killed renamable $sgpr62_sgpr63, $exec, implicit-def dead $scc
; GFX90A-NEXT: renamable $sgpr50_sgpr51 = S_OR_B64 killed renamable $sgpr50_sgpr51, killed renamable $sgpr58_sgpr59, implicit-def dead $scc
; GFX90A-NEXT: S_BRANCH %bb.55
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.62.bb140:		; GFX90A-NEXT: bb.62.bb140:
; GFX90A-NEXT: successors: %bb.68(0x40000000), %bb.63(0x40000000)		; GFX90A-NEXT: successors: %bb.68(0x40000000), %bb.63(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr30_sgpr31, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr36_sgpr37 = S_MOV_B64 -1		; GFX90A-NEXT: renamable $sgpr36_sgpr37 = S_MOV_B64 -1
; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, killed renamable $sgpr30_sgpr31, implicit-def dead $scc		; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, killed renamable $sgpr30_sgpr31, implicit-def dead $scc
; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.68, implicit $vcc		; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.68, implicit $vcc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.63.Flow13:		; GFX90A-NEXT: bb.63.Flow13:
; GFX90A-NEXT: successors: %bb.64(0x40000000), %bb.66(0x40000000)		; GFX90A-NEXT: successors: %bb.64(0x40000000), %bb.66(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000C, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000C, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $vcc = S_ANDN2_B64 $exec, killed renamable $sgpr36_sgpr37, implicit-def dead $scc		; GFX90A-NEXT: $vcc = S_ANDN2_B64 $exec, killed renamable $sgpr36_sgpr37, implicit-def dead $scc
; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.66, implicit $vcc		; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.66, implicit $vcc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.64.bb159:		; GFX90A-NEXT: bb.64.bb159:
; GFX90A-NEXT: successors: %bb.67(0x40000000), %bb.65(0x40000000)		; GFX90A-NEXT: successors: %bb.67(0x40000000), %bb.65(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000C, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000C, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vcc = V_CMP_NE_U32_e64 0, killed $vgpr4, implicit $exec		; GFX90A-NEXT: renamable $vcc = V_CMP_NE_U32_e64 0, killed $vgpr4, implicit $exec
; GFX90A-NEXT: $sgpr12_sgpr13 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr12_sgpr13 = S_AND_SAVEEXEC_B64 $vcc, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: renamable $sgpr12_sgpr13 = S_XOR_B64 $exec, killed renamable $sgpr12_sgpr13, implicit-def dead $scc		; GFX90A-NEXT: renamable $sgpr12_sgpr13 = S_XOR_B64 $exec, killed renamable $sgpr12_sgpr13, implicit-def dead $scc
; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.67, implicit $exec		; GFX90A-NEXT: S_CBRANCH_EXECNZ %bb.67, implicit $exec
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.65.Flow10:		; GFX90A-NEXT: bb.65.Flow10:
; GFX90A-NEXT: successors: %bb.66(0x80000000)		; GFX90A-NEXT: successors: %bb.66(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $sgpr12_sgpr13 = S_ANDN2_SAVEEXEC_B64 $sgpr12_sgpr13, implicit-def $exec, implicit-def $scc, implicit $exec		; GFX90A-NEXT: $sgpr12_sgpr13 = S_ANDN2_SAVEEXEC_B64 $sgpr12_sgpr13, implicit-def $exec, implicit-def $scc, implicit $exec
; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr12_sgpr13, implicit-def $scc		; GFX90A-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr12_sgpr13, implicit-def $scc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.66.Flow14:		; GFX90A-NEXT: bb.66.Flow14:
; GFX90A-NEXT: successors: %bb.8(0x80000000)		; GFX90A-NEXT: successors: %bb.8(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr31, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr58_sgpr59 = COPY $exec		; GFX90A-NEXT: renamable $sgpr58_sgpr59 = COPY $exec
; GFX90A-NEXT: S_BRANCH %bb.8		; GFX90A-NEXT: S_BRANCH %bb.8
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.67.bb161:		; GFX90A-NEXT: bb.67.bb161:
; GFX90A-NEXT: successors: %bb.65(0x80000000)		; GFX90A-NEXT: successors: %bb.65(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000C, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000C, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr23, killed $vgpr25, implicit $exec		; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr23, killed $vgpr25, implicit $exec
; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr2, killed $vgpr27, implicit $exec		; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr2, killed $vgpr27, implicit $exec
; GFX90A-NEXT: renamable $vgpr3 = COPY killed renamable $agpr1, implicit $exec		; GFX90A-NEXT: renamable $vgpr3 = V_OR_B32_e32 killed $vgpr13, killed $vgpr21, implicit $exec
; GFX90A-NEXT: renamable $vgpr3 = V_OR_B32_e32 killed $vgpr3, killed $vgpr21, implicit $exec
; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr3, killed $vgpr2, implicit $exec		; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr3, killed $vgpr2, implicit $exec
; GFX90A-NEXT: renamable $vgpr3 = V_MOV_B32_e32 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr3 = V_MOV_B32_e32 0, implicit $exec
; GFX90A-NEXT: renamable $vcc = V_CMP_EQ_U16_sdwa 0, killed $vgpr54, 0, $vgpr3, 0, 0, 6, implicit $exec		; GFX90A-NEXT: renamable $vcc = V_CMP_EQ_U16_sdwa 0, killed $vgpr54, 0, $vgpr3, 0, 0, 6, implicit $exec
; GFX90A-NEXT: renamable $vgpr2 = V_CNDMASK_B32_e64 0, 0, 0, killed $vgpr2, killed $vcc, implicit $exec		; GFX90A-NEXT: renamable $vgpr2 = V_CNDMASK_B32_e64 0, 0, 0, killed $vgpr2, killed $vcc, implicit $exec
; GFX90A-NEXT: renamable $vgpr4 = V_OR_B32_e32 killed $vgpr30, killed $vgpr15, implicit $exec		; GFX90A-NEXT: renamable $vgpr4 = V_OR_B32_e32 killed $vgpr30, killed $vgpr15, implicit $exec
; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr4, killed $vgpr2, implicit $exec		; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr4, killed $vgpr2, implicit $exec
; GFX90A-NEXT: renamable $vcc = V_CMP_EQ_U16_sdwa 0, killed $vgpr19, 0, $vgpr3, 0, 0, 6, implicit $exec		; GFX90A-NEXT: renamable $vcc = V_CMP_EQ_U16_sdwa 0, killed $vgpr19, 0, $vgpr3, 0, 0, 6, implicit $exec
; GFX90A-NEXT: renamable $vgpr2 = V_CNDMASK_B32_e64 0, 0, 0, killed $vgpr2, killed $vcc, implicit $exec		; GFX90A-NEXT: renamable $vgpr2 = V_CNDMASK_B32_e64 0, 0, 0, killed $vgpr2, killed $vcc, implicit $exec
; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr2, killed $vgpr17, implicit $exec		; GFX90A-NEXT: renamable $vgpr2 = V_OR_B32_e32 killed $vgpr2, killed $vgpr17, implicit $exec
; GFX90A-NEXT: DS_WRITE2_B32_gfx9 killed renamable $vgpr3, killed renamable $vgpr2, renamable $vgpr3, 0, 1, 0, implicit $exec :: (store (s64) into `ptr addrspace(3) null`, align 4, addrspace 3)		; GFX90A-NEXT: DS_WRITE2_B32_gfx9 killed renamable $vgpr3, killed renamable $vgpr2, renamable $vgpr3, 0, 1, 0, implicit $exec :: (store (s64) into `ptr addrspace(3) null`, align 4, addrspace 3)
; GFX90A-NEXT: S_BRANCH %bb.65		; GFX90A-NEXT: S_BRANCH %bb.65
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.68.bb174:		; GFX90A-NEXT: bb.68.bb174:
; GFX90A-NEXT: successors: %bb.72(0x40000000), %bb.69(0x40000000)		; GFX90A-NEXT: successors: %bb.72(0x40000000), %bb.69(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000F, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr28_sgpr29, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000F, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr20_vgpr21:0x000000000000000F, $vgpr22_vgpr23:0x000000000000000F, $vgpr24_vgpr25:0x000000000000000F, $vgpr26_vgpr27:0x000000000000000F, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr28 = V_OR_B32_e32 1, $vgpr26, implicit $exec		; GFX90A-NEXT: renamable $vgpr28 = V_OR_B32_e32 1, $vgpr26, implicit $exec
; GFX90A-NEXT: renamable $vgpr38 = V_OR_B32_e32 $vgpr28, $vgpr24, implicit $exec		; GFX90A-NEXT: renamable $vgpr38 = V_OR_B32_e32 $vgpr28, $vgpr24, implicit $exec
; GFX90A-NEXT: renamable $vgpr36 = V_OR_B32_e32 $vgpr38, $vgpr22, implicit $exec		; GFX90A-NEXT: renamable $vgpr36 = V_OR_B32_e32 $vgpr38, $vgpr22, implicit $exec
; GFX90A-NEXT: renamable $vgpr32 = V_CNDMASK_B32_e64 0, $vgpr36, 0, 0, $sgpr12_sgpr13, implicit $exec		; GFX90A-NEXT: renamable $vgpr32 = V_CNDMASK_B32_e64 0, $vgpr36, 0, 0, $sgpr12_sgpr13, implicit $exec
; GFX90A-NEXT: renamable $vgpr50 = V_OR_B32_e32 $vgpr32, $vgpr20, implicit $exec		; GFX90A-NEXT: renamable $vgpr50 = V_OR_B32_e32 $vgpr32, $vgpr20, implicit $exec
; GFX90A-NEXT: renamable $vgpr12_vgpr13 = COPY renamable $agpr0_agpr1, implicit $exec		; GFX90A-NEXT: renamable $vgpr48 = V_OR_B32_e32 $vgpr50, $vgpr12, implicit $exec
; GFX90A-NEXT: renamable $vgpr48 = V_OR_B32_e32 $vgpr50, killed $vgpr12, implicit $exec
; GFX90A-NEXT: renamable $vgpr34 = V_OR_B32_e32 $vgpr48, $vgpr14, implicit $exec		; GFX90A-NEXT: renamable $vgpr34 = V_OR_B32_e32 $vgpr48, $vgpr14, implicit $exec
; GFX90A-NEXT: renamable $vgpr52 = V_CNDMASK_B32_e64 0, 0, 0, $vgpr34, killed $sgpr12_sgpr13, implicit $exec		; GFX90A-NEXT: renamable $vgpr52 = V_CNDMASK_B32_e64 0, 0, 0, $vgpr34, killed $sgpr12_sgpr13, implicit $exec
; GFX90A-NEXT: renamable $sgpr12_sgpr13 = S_MOV_B64 -1		; GFX90A-NEXT: renamable $sgpr12_sgpr13 = S_MOV_B64 -1
; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, killed renamable $sgpr28_sgpr29, implicit-def dead $scc		; GFX90A-NEXT: renamable $vcc = S_AND_B64 $exec, killed renamable $sgpr28_sgpr29, implicit-def dead $scc
; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.72, implicit $vcc		; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.72, implicit $vcc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.69.Flow:		; GFX90A-NEXT: bb.69.Flow:
; GFX90A-NEXT: successors: %bb.70(0x40000000), %bb.71(0x40000000)		; GFX90A-NEXT: successors: %bb.70(0x40000000), %bb.71(0x40000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000C, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr28_vgpr29:0x0000000000000003, $vgpr32_vgpr33:0x0000000000000003, $vgpr34_vgpr35:0x0000000000000003, $vgpr36_vgpr37:0x0000000000000003, $vgpr38_vgpr39:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr48_vgpr49:0x0000000000000003, $vgpr50_vgpr51:0x0000000000000003, $vgpr52_vgpr53:0x0000000000000003, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000C, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr28_vgpr29:0x0000000000000003, $vgpr32_vgpr33:0x0000000000000003, $vgpr34_vgpr35:0x0000000000000003, $vgpr36_vgpr37:0x0000000000000003, $vgpr38_vgpr39:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr48_vgpr49:0x0000000000000003, $vgpr50_vgpr51:0x0000000000000003, $vgpr52_vgpr53:0x0000000000000003, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: $vcc = S_ANDN2_B64 $exec, killed renamable $sgpr12_sgpr13, implicit-def dead $scc		; GFX90A-NEXT: $vcc = S_ANDN2_B64 $exec, killed renamable $sgpr12_sgpr13, implicit-def dead $scc
; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.71, implicit $vcc		; GFX90A-NEXT: S_CBRANCH_VCCNZ %bb.71, implicit $vcc
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.70.bb186:		; GFX90A-NEXT: bb.70.bb186:
; GFX90A-NEXT: successors: %bb.71(0x80000000)		; GFX90A-NEXT: successors: %bb.71(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000C, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr28_vgpr29:0x0000000000000003, $vgpr32_vgpr33:0x0000000000000003, $vgpr34_vgpr35:0x0000000000000003, $vgpr36_vgpr37:0x0000000000000003, $vgpr38_vgpr39:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr48_vgpr49:0x0000000000000003, $vgpr50_vgpr51:0x0000000000000003, $vgpr52_vgpr53:0x0000000000000003, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000C, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr28_vgpr29:0x0000000000000003, $vgpr32_vgpr33:0x0000000000000003, $vgpr34_vgpr35:0x0000000000000003, $vgpr36_vgpr37:0x0000000000000003, $vgpr38_vgpr39:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr48_vgpr49:0x0000000000000003, $vgpr50_vgpr51:0x0000000000000003, $vgpr52_vgpr53:0x0000000000000003, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr2_vgpr3 = V_LSHLREV_B64_e64 3, killed $vgpr2_vgpr3, implicit $exec		; GFX90A-NEXT: renamable $vgpr2_vgpr3 = V_LSHLREV_B64_e64 3, killed $vgpr2_vgpr3, implicit $exec
; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr27, implicit $exec		; GFX90A-NEXT: renamable $vgpr5 = COPY renamable $sgpr27, implicit $exec
; GFX90A-NEXT: renamable $vgpr2, renamable $vcc = V_ADD_CO_U32_e64 killed $sgpr26, $vgpr2, 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr2, renamable $vcc = V_ADD_CO_U32_e64 killed $sgpr26, $vgpr2, 0, implicit $exec
; GFX90A-NEXT: renamable $vgpr3, dead renamable $vcc = V_ADDC_U32_e64 killed $vgpr5, killed $vgpr3, killed $vcc, 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr3, dead renamable $vcc = V_ADDC_U32_e64 killed $vgpr5, killed $vgpr3, killed $vcc, 0, implicit $exec
; GFX90A-NEXT: renamable $vgpr29 = V_MOV_B32_e32 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr29 = V_MOV_B32_e32 0, implicit $exec
; GFX90A-NEXT: renamable $vgpr39 = COPY renamable $vgpr29, implicit $exec		; GFX90A-NEXT: renamable $vgpr39 = COPY renamable $vgpr29, implicit $exec
; GFX90A-NEXT: renamable $vgpr37 = COPY renamable $vgpr29, implicit $exec		; GFX90A-NEXT: renamable $vgpr37 = COPY renamable $vgpr29, implicit $exec
Show All 12 Lines	define amdgpu_kernel void @f1(ptr addrspace(1) %arg, ptr addrspace(1) %arg1, i64 %arg2, i1 %arg3, i1 %arg4, i1 %arg5, i1 %arg6, ptr addrspace(3) %arg7, ptr addrspace(3) %arg8, ptr addrspace(3) %arg9, ptr addrspace(3) %arg10) {
; GFX90A-NEXT: DS_WRITE_B64_gfx9 renamable $vgpr29, killed renamable $vgpr32_vgpr33, 0, 0, implicit $exec :: (store (s64) into `ptr addrspace(3) null`, addrspace 3)		; GFX90A-NEXT: DS_WRITE_B64_gfx9 renamable $vgpr29, killed renamable $vgpr32_vgpr33, 0, 0, implicit $exec :: (store (s64) into `ptr addrspace(3) null`, addrspace 3)
; GFX90A-NEXT: DS_WRITE_B64_gfx9 killed renamable $vgpr5, killed renamable $vgpr52_vgpr53, 0, 0, implicit $exec :: (store (s64) into %ir.7, addrspace 3)		; GFX90A-NEXT: DS_WRITE_B64_gfx9 killed renamable $vgpr5, killed renamable $vgpr52_vgpr53, 0, 0, implicit $exec :: (store (s64) into %ir.7, addrspace 3)
; GFX90A-NEXT: DS_WRITE_B64_gfx9 killed renamable $vgpr29, killed renamable $vgpr34_vgpr35, 0, 0, implicit $exec :: (store (s64) into `ptr addrspace(3) null`, addrspace 3)		; GFX90A-NEXT: DS_WRITE_B64_gfx9 killed renamable $vgpr29, killed renamable $vgpr34_vgpr35, 0, 0, implicit $exec :: (store (s64) into `ptr addrspace(3) null`, addrspace 3)
; GFX90A-NEXT: BUFFER_STORE_DWORD_OFFSET killed renamable $vgpr3, $sgpr0_sgpr1_sgpr2_sgpr3, 0, 4, 0, 0, implicit $exec :: (store (s32) into `ptr addrspace(5) null` + 4, basealign 8, addrspace 5)		; GFX90A-NEXT: BUFFER_STORE_DWORD_OFFSET killed renamable $vgpr3, $sgpr0_sgpr1_sgpr2_sgpr3, 0, 4, 0, 0, implicit $exec :: (store (s32) into `ptr addrspace(5) null` + 4, basealign 8, addrspace 5)
; GFX90A-NEXT: BUFFER_STORE_DWORD_OFFSET killed renamable $vgpr2, $sgpr0_sgpr1_sgpr2_sgpr3, 0, 0, 0, 0, implicit $exec :: (store (s32) into `ptr addrspace(5) null`, align 8, addrspace 5)		; GFX90A-NEXT: BUFFER_STORE_DWORD_OFFSET killed renamable $vgpr2, $sgpr0_sgpr1_sgpr2_sgpr3, 0, 0, 0, 0, implicit $exec :: (store (s32) into `ptr addrspace(5) null`, align 8, addrspace 5)
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.71.Flow9:		; GFX90A-NEXT: bb.71.Flow9:
; GFX90A-NEXT: successors: %bb.63(0x80000000)		; GFX90A-NEXT: successors: %bb.63(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000C, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $vgpr0_vgpr1:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000C, $vgpr14_vgpr15:0x000000000000000C, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $sgpr36_sgpr37 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr36_sgpr37 = S_MOV_B64 0
; GFX90A-NEXT: S_BRANCH %bb.63		; GFX90A-NEXT: S_BRANCH %bb.63
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: bb.72.bb196:		; GFX90A-NEXT: bb.72.bb196:
; GFX90A-NEXT: successors: %bb.69(0x80000000)		; GFX90A-NEXT: successors: %bb.69(0x80000000)
; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $agpr0_agpr1:0x000000000000000C, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr14_vgpr15:0x000000000000000C, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr28_vgpr29:0x0000000000000003, $vgpr32_vgpr33:0x0000000000000003, $vgpr34_vgpr35:0x0000000000000003, $vgpr36_vgpr37:0x0000000000000003, $vgpr38_vgpr39:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr48_vgpr49:0x0000000000000003, $vgpr50_vgpr51:0x0000000000000003, $vgpr52_vgpr53:0x0000000000000003, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3		; GFX90A-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr17, $vgpr19, $vgpr30, $vgpr31, $vgpr54, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x000000000000000F, $sgpr10_sgpr11, $sgpr18_sgpr19, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr52_sgpr53, $sgpr54_sgpr55, $sgpr56_sgpr57, $sgpr20_sgpr21_sgpr22_sgpr23:0x000000000000003C, $sgpr24_sgpr25_sgpr26_sgpr27:0x00000000000000F0, $vgpr0_vgpr1:0x000000000000000F, $vgpr2_vgpr3:0x000000000000000F, $vgpr4_vgpr5:0x0000000000000003, $vgpr6_vgpr7:0x000000000000000F, $vgpr8_vgpr9:0x000000000000000F, $vgpr10_vgpr11:0x000000000000000F, $vgpr12_vgpr13:0x000000000000000C, $vgpr14_vgpr15:0x000000000000000C, $vgpr16_vgpr17:0x0000000000000003, $vgpr18_vgpr19:0x0000000000000003, $vgpr20_vgpr21:0x000000000000000C, $vgpr22_vgpr23:0x000000000000000C, $vgpr24_vgpr25:0x000000000000000C, $vgpr26_vgpr27:0x000000000000000C, $vgpr28_vgpr29:0x0000000000000003, $vgpr32_vgpr33:0x0000000000000003, $vgpr34_vgpr35:0x0000000000000003, $vgpr36_vgpr37:0x0000000000000003, $vgpr38_vgpr39:0x0000000000000003, $vgpr40_vgpr41:0x000000000000000F, $vgpr42_vgpr43:0x000000000000000F, $vgpr44_vgpr45:0x000000000000000F, $vgpr46_vgpr47:0x000000000000000F, $vgpr48_vgpr49:0x0000000000000003, $vgpr50_vgpr51:0x0000000000000003, $vgpr52_vgpr53:0x0000000000000003, $vgpr56_vgpr57:0x000000000000000F, $vgpr58_vgpr59:0x000000000000000F, $vgpr60_vgpr61:0x000000000000000F, $vgpr62_vgpr63:0x000000000000000F, $sgpr0_sgpr1_sgpr2_sgpr3
; GFX90A-NEXT: {{ $}}		; GFX90A-NEXT: {{ $}}
; GFX90A-NEXT: renamable $vgpr5 = V_OR_B32_e32 $vgpr52, killed $vgpr18, implicit $exec		; GFX90A-NEXT: renamable $vgpr5 = V_OR_B32_e32 $vgpr52, killed $vgpr18, implicit $exec
		; GFX90A-NEXT: renamable $vgpr29 = COPY killed renamable $vgpr13, implicit $exec
; GFX90A-NEXT: renamable $vgpr12 = V_OR_B32_e32 killed $vgpr5, killed $vgpr16, implicit $exec		; GFX90A-NEXT: renamable $vgpr12 = V_OR_B32_e32 killed $vgpr5, killed $vgpr16, implicit $exec
; GFX90A-NEXT: renamable $vgpr13 = V_MOV_B32_e32 0, implicit $exec		; GFX90A-NEXT: renamable $vgpr13 = V_MOV_B32_e32 0, implicit $exec
; GFX90A-NEXT: DS_WRITE_B64_gfx9 killed renamable $vgpr13, renamable $vgpr12_vgpr13, 0, 0, implicit $exec :: (store (s64) into `ptr addrspace(3) null`, addrspace 3)		; GFX90A-NEXT: DS_WRITE_B64_gfx9 killed renamable $vgpr13, renamable $vgpr12_vgpr13, 0, 0, implicit $exec :: (store (s64) into `ptr addrspace(3) null`, addrspace 3)
		; GFX90A-NEXT: renamable $vgpr13 = COPY killed renamable $vgpr29, implicit $exec
; GFX90A-NEXT: renamable $sgpr12_sgpr13 = S_MOV_B64 0		; GFX90A-NEXT: renamable $sgpr12_sgpr13 = S_MOV_B64 0
; GFX90A-NEXT: S_BRANCH %bb.69		; GFX90A-NEXT: S_BRANCH %bb.69
bb:		bb:
%i = tail call i32 @llvm.amdgcn.workitem.id.x()		%i = tail call i32 @llvm.amdgcn.workitem.id.x()
%i11 = icmp eq i32 %i, 0		%i11 = icmp eq i32 %i, 0
%i12 = load i32, ptr addrspace(3) null, align 8		%i12 = load i32, ptr addrspace(3) null, align 8
%i13 = zext i32 %i12 to i64		%i13 = zext i32 %i12 to i64
%i14 = getelementptr i32, ptr addrspace(1) %arg, i64 %i13		%i14 = getelementptr i32, ptr addrspace(1) %arg, i64 %i13
▲ Show 20 Lines • Show All 294 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/collapse-endcf.ll

	Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines
	; GCN-O0-NEXT: buffer_store_dword v2, off, s[8:11], 0 offset:4 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v2, off, s[8:11], 0 offset:4 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_mov_b32 s0, 1			; GCN-O0-NEXT: s_mov_b32 s0, 1
	; GCN-O0-NEXT: v_cmp_gt_u32_e64 s[2:3], v0, s0			; GCN-O0-NEXT: v_cmp_gt_u32_e64 s[2:3], v0, s0
	; GCN-O0-NEXT: s_mov_b64 s[0:1], exec			; GCN-O0-NEXT: s_mov_b64 s[0:1], exec
	; GCN-O0-NEXT: v_writelane_b32 v1, s0, 2			; GCN-O0-NEXT: v_writelane_b32 v1, s0, 2
	; GCN-O0-NEXT: v_writelane_b32 v1, s1, 3			; GCN-O0-NEXT: v_writelane_b32 v1, s1, 3
	; GCN-O0-NEXT: s_and_b64 s[0:1], s[0:1], s[2:3]			; GCN-O0-NEXT: s_and_b64 s[0:1], s[0:1], s[2:3]
	; GCN-O0-NEXT: s_mov_b64 exec, s[0:1]			; GCN-O0-NEXT: s_mov_b64 exec, s[0:1]
	; GCN-O0-NEXT: s_cbranch_execz .LBB0_4			; GCN-O0-NEXT: s_cbranch_execz .LBB0_5
	; GCN-O0-NEXT: ; %bb.1: ; %bb.outer.then			; GCN-O0-NEXT: ; %bb.1: ; %bb.outer.then
	; GCN-O0-NEXT: buffer_load_dword v0, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v0, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload
	; GCN-O0-NEXT: v_readlane_b32 s4, v1, 0			; GCN-O0-NEXT: v_readlane_b32 s4, v1, 0
	; GCN-O0-NEXT: v_readlane_b32 s5, v1, 1			; GCN-O0-NEXT: v_readlane_b32 s5, v1, 1
	; GCN-O0-NEXT: s_mov_b32 s2, 0xf000			; GCN-O0-NEXT: s_mov_b32 s2, 0xf000
	; GCN-O0-NEXT: s_mov_b32 s0, 0			; GCN-O0-NEXT: s_mov_b32 s0, 0
	; GCN-O0-NEXT: ; kill: def $sgpr0 killed $sgpr0 def $sgpr0_sgpr1			; GCN-O0-NEXT: ; kill: def $sgpr0 killed $sgpr0 def $sgpr0_sgpr1
	; GCN-O0-NEXT: s_mov_b32 s1, s2			; GCN-O0-NEXT: s_mov_b32 s1, s2
	Show All 34 Lines
	; GCN-O0-NEXT: s_mov_b32 s5, s2			; GCN-O0-NEXT: s_mov_b32 s5, s2
	; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3			; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3
	; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]
	; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64			; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64
	; GCN-O0-NEXT: .LBB0_3: ; %Flow			; GCN-O0-NEXT: .LBB0_3: ; %Flow
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 4			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 4
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 5			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 5
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
	; GCN-O0-NEXT: .LBB0_4: ; %bb.outer.end			; GCN-O0-NEXT: ; %bb.4: ; %Flow
				; GCN-O0-NEXT: .LBB0_5: ; %bb.outer.end
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
				; GCN-O0-NEXT: ; %bb.6: ; %bb.outer.end
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: v_mov_b32_e32 v2, 3			; GCN-O0-NEXT: v_mov_b32_e32 v2, 3
	; GCN-O0-NEXT: v_mov_b32_e32 v0, 0			; GCN-O0-NEXT: v_mov_b32_e32 v0, 0
	; GCN-O0-NEXT: s_mov_b32 m0, -1			; GCN-O0-NEXT: s_mov_b32 m0, -1
	; GCN-O0-NEXT: ds_write_b32 v0, v2			; GCN-O0-NEXT: ds_write_b32 v0, v2
	; GCN-O0-NEXT: s_endpgm			; GCN-O0-NEXT: s_endpgm
	bb:			bb:
	%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()			%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()
	▲ Show 20 Lines • Show All 100 Lines • ▼ Show 20 Lines
	; GCN-O0-NEXT: v_mov_b32_e32 v2, 0			; GCN-O0-NEXT: v_mov_b32_e32 v2, 0
	; GCN-O0-NEXT: buffer_store_dword v2, v[3:4], s[4:7], 0 addr64			; GCN-O0-NEXT: buffer_store_dword v2, v[3:4], s[4:7], 0 addr64
	; GCN-O0-NEXT: v_cmp_ne_u32_e64 s[2:3], v0, s0			; GCN-O0-NEXT: v_cmp_ne_u32_e64 s[2:3], v0, s0
	; GCN-O0-NEXT: s_mov_b64 s[0:1], exec			; GCN-O0-NEXT: s_mov_b64 s[0:1], exec
	; GCN-O0-NEXT: v_writelane_b32 v1, s0, 4			; GCN-O0-NEXT: v_writelane_b32 v1, s0, 4
	; GCN-O0-NEXT: v_writelane_b32 v1, s1, 5			; GCN-O0-NEXT: v_writelane_b32 v1, s1, 5
	; GCN-O0-NEXT: s_and_b64 s[0:1], s[0:1], s[2:3]			; GCN-O0-NEXT: s_and_b64 s[0:1], s[0:1], s[2:3]
	; GCN-O0-NEXT: s_mov_b64 exec, s[0:1]			; GCN-O0-NEXT: s_mov_b64 exec, s[0:1]
	; GCN-O0-NEXT: s_cbranch_execz .LBB1_4			; GCN-O0-NEXT: s_cbranch_execz .LBB1_5
	; GCN-O0-NEXT: ; %bb.2: ; %bb.inner.then			; GCN-O0-NEXT: ; %bb.2: ; %bb.inner.then
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: buffer_load_dword v2, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v2, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 0			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 0
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 1			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 1
	; GCN-O0-NEXT: v_mov_b32_e32 v0, 1			; GCN-O0-NEXT: v_mov_b32_e32 v0, 1
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: v_add_i32_e64 v2, s[2:3], v2, v0			; GCN-O0-NEXT: v_add_i32_e64 v2, s[2:3], v2, v0
	; GCN-O0-NEXT: v_ashrrev_i32_e64 v4, 31, v2			; GCN-O0-NEXT: v_ashrrev_i32_e64 v4, 31, v2
	; GCN-O0-NEXT: ; kill: def $vgpr2 killed $vgpr2 def $vgpr2_vgpr3 killed $exec			; GCN-O0-NEXT: ; kill: def $vgpr2 killed $vgpr2 def $vgpr2_vgpr3 killed $exec
	; GCN-O0-NEXT: v_mov_b32_e32 v3, v4			; GCN-O0-NEXT: v_mov_b32_e32 v3, v4
	; GCN-O0-NEXT: s_mov_b32 s2, 2			; GCN-O0-NEXT: s_mov_b32 s2, 2
	; GCN-O0-NEXT: v_lshl_b64 v[2:3], v[2:3], s2			; GCN-O0-NEXT: v_lshl_b64 v[2:3], v[2:3], s2
	; GCN-O0-NEXT: s_mov_b32 s2, 0xf000			; GCN-O0-NEXT: s_mov_b32 s2, 0xf000
	; GCN-O0-NEXT: s_mov_b32 s4, 0			; GCN-O0-NEXT: s_mov_b32 s4, 0
	; GCN-O0-NEXT: ; kill: def $sgpr4 killed $sgpr4 def $sgpr4_sgpr5			; GCN-O0-NEXT: ; kill: def $sgpr4 killed $sgpr4 def $sgpr4_sgpr5
	; GCN-O0-NEXT: s_mov_b32 s5, s2			; GCN-O0-NEXT: s_mov_b32 s5, s2
	; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3			; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3
	; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]
	; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64			; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64
	; GCN-O0-NEXT: s_branch .LBB1_4			; GCN-O0-NEXT: s_branch .LBB1_5
	; GCN-O0-NEXT: .LBB1_3: ; %Flow			; GCN-O0-NEXT: .LBB1_3: ; %Flow
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
	; GCN-O0-NEXT: s_branch .LBB1_5			; GCN-O0-NEXT: ; %bb.4: ; %Flow
	; GCN-O0-NEXT: .LBB1_4: ; %bb.inner.end			; GCN-O0-NEXT: s_branch .LBB1_7
				; GCN-O0-NEXT: .LBB1_5: ; %bb.inner.end
				; GCN-O0-NEXT: v_readlane_b32 s0, v1, 4
				critsonUnsubmitted Not Done Reply Inline Actions Is this reordering fixing the bug mentioned in the description? (Exec mask is restored before buffer_load, rather than after.) critson: Is this reordering fixing the bug mentioned in the description? (Exec mask is restored before…
				arsenmAuthorUnsubmitted Done Reply Inline Actions Yes, previously we would only correctly handle spills used for the exec source value, not other spills arsenm: Yes, previously we would only correctly handle spills used for the exec source value, not other…
				; GCN-O0-NEXT: v_readlane_b32 s1, v1, 5
				; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
				; GCN-O0-NEXT: ; %bb.6: ; %bb.inner.end
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: buffer_load_dword v2, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v2, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload
	; GCN-O0-NEXT: v_readlane_b32 s2, v1, 4
	; GCN-O0-NEXT: v_readlane_b32 s3, v1, 5
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[2:3]
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 0			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 0
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 1			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 1
	; GCN-O0-NEXT: v_mov_b32_e32 v0, 2			; GCN-O0-NEXT: v_mov_b32_e32 v0, 2
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: v_add_i32_e64 v2, s[2:3], v2, v0			; GCN-O0-NEXT: v_add_i32_e64 v2, s[2:3], v2, v0
	; GCN-O0-NEXT: v_ashrrev_i32_e64 v4, 31, v2			; GCN-O0-NEXT: v_ashrrev_i32_e64 v4, 31, v2
	; GCN-O0-NEXT: ; kill: def $vgpr2 killed $vgpr2 def $vgpr2_vgpr3 killed $exec			; GCN-O0-NEXT: ; kill: def $vgpr2 killed $vgpr2 def $vgpr2_vgpr3 killed $exec
	; GCN-O0-NEXT: v_mov_b32_e32 v3, v4			; GCN-O0-NEXT: v_mov_b32_e32 v3, v4
	; GCN-O0-NEXT: v_lshl_b64 v[2:3], v[2:3], v0			; GCN-O0-NEXT: v_lshl_b64 v[2:3], v[2:3], v0
	; GCN-O0-NEXT: s_mov_b32 s2, 0xf000			; GCN-O0-NEXT: s_mov_b32 s2, 0xf000
	; GCN-O0-NEXT: s_mov_b32 s4, 0			; GCN-O0-NEXT: s_mov_b32 s4, 0
	; GCN-O0-NEXT: ; kill: def $sgpr4 killed $sgpr4 def $sgpr4_sgpr5			; GCN-O0-NEXT: ; kill: def $sgpr4 killed $sgpr4 def $sgpr4_sgpr5
	; GCN-O0-NEXT: s_mov_b32 s5, s2			; GCN-O0-NEXT: s_mov_b32 s5, s2
	; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3			; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3
	; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]
	; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64			; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64
	; GCN-O0-NEXT: s_branch .LBB1_3			; GCN-O0-NEXT: s_branch .LBB1_3
	; GCN-O0-NEXT: .LBB1_5: ; %bb.outer.end			; GCN-O0-NEXT: .LBB1_7: ; %bb.outer.end
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: v_mov_b32_e32 v2, 3			; GCN-O0-NEXT: v_mov_b32_e32 v2, 3
	; GCN-O0-NEXT: v_mov_b32_e32 v0, 0			; GCN-O0-NEXT: v_mov_b32_e32 v0, 0
	; GCN-O0-NEXT: s_mov_b32 m0, -1			; GCN-O0-NEXT: s_mov_b32 m0, -1
	; GCN-O0-NEXT: ds_write_b32 v0, v2			; GCN-O0-NEXT: ds_write_b32 v0, v2
	; GCN-O0-NEXT: s_endpgm			; GCN-O0-NEXT: s_endpgm
	bb:			bb:
	%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()			%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()
	▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines
	; GCN-O0-NEXT: buffer_store_dword v2, v[3:4], s[0:3], 0 addr64			; GCN-O0-NEXT: buffer_store_dword v2, v[3:4], s[0:3], 0 addr64
	; GCN-O0-NEXT: s_mov_b32 s0, 1			; GCN-O0-NEXT: s_mov_b32 s0, 1
	; GCN-O0-NEXT: v_cmp_gt_u32_e64 s[2:3], v0, s0			; GCN-O0-NEXT: v_cmp_gt_u32_e64 s[2:3], v0, s0
	; GCN-O0-NEXT: s_mov_b64 s[0:1], exec			; GCN-O0-NEXT: s_mov_b64 s[0:1], exec
	; GCN-O0-NEXT: v_writelane_b32 v1, s0, 2			; GCN-O0-NEXT: v_writelane_b32 v1, s0, 2
	; GCN-O0-NEXT: v_writelane_b32 v1, s1, 3			; GCN-O0-NEXT: v_writelane_b32 v1, s1, 3
	; GCN-O0-NEXT: s_and_b64 s[0:1], s[0:1], s[2:3]			; GCN-O0-NEXT: s_and_b64 s[0:1], s[0:1], s[2:3]
	; GCN-O0-NEXT: s_mov_b64 exec, s[0:1]			; GCN-O0-NEXT: s_mov_b64 exec, s[0:1]
	; GCN-O0-NEXT: s_cbranch_execz .LBB2_6			; GCN-O0-NEXT: s_cbranch_execz .LBB2_7
	; GCN-O0-NEXT: ; %bb.1: ; %bb.outer.then			; GCN-O0-NEXT: ; %bb.1: ; %bb.outer.then
	; GCN-O0-NEXT: buffer_load_dword v0, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v0, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload
	; GCN-O0-NEXT: s_mov_b32 s0, 2			; GCN-O0-NEXT: s_mov_b32 s0, 2
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: v_cmp_ne_u32_e64 s[0:1], v0, s0			; GCN-O0-NEXT: v_cmp_ne_u32_e64 s[0:1], v0, s0
	; GCN-O0-NEXT: s_mov_b64 s[2:3], exec			; GCN-O0-NEXT: s_mov_b64 s[2:3], exec
	; GCN-O0-NEXT: s_and_b64 s[0:1], s[2:3], s[0:1]			; GCN-O0-NEXT: s_and_b64 s[0:1], s[2:3], s[0:1]
	; GCN-O0-NEXT: s_xor_b64 s[2:3], s[0:1], s[2:3]			; GCN-O0-NEXT: s_xor_b64 s[2:3], s[0:1], s[2:3]
	▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines
	; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3			; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3
	; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]
	; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64			; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64
	; GCN-O0-NEXT: s_branch .LBB2_2			; GCN-O0-NEXT: s_branch .LBB2_2
	; GCN-O0-NEXT: .LBB2_5: ; %Flow1			; GCN-O0-NEXT: .LBB2_5: ; %Flow1
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 6			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 6
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 7			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 7
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
	; GCN-O0-NEXT: .LBB2_6: ; %bb.outer.end			; GCN-O0-NEXT: ; %bb.6: ; %Flow1
				; GCN-O0-NEXT: .LBB2_7: ; %bb.outer.end
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
				; GCN-O0-NEXT: ; %bb.8: ; %bb.outer.end
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: v_mov_b32_e32 v2, 3			; GCN-O0-NEXT: v_mov_b32_e32 v2, 3
	; GCN-O0-NEXT: v_mov_b32_e32 v0, 0			; GCN-O0-NEXT: v_mov_b32_e32 v0, 0
	; GCN-O0-NEXT: s_mov_b32 m0, -1			; GCN-O0-NEXT: s_mov_b32 m0, -1
	; GCN-O0-NEXT: ds_write_b32 v0, v2			; GCN-O0-NEXT: ds_write_b32 v0, v2
	; GCN-O0-NEXT: s_endpgm			; GCN-O0-NEXT: s_endpgm
	bb:			bb:
	%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()			%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()
	▲ Show 20 Lines • Show All 138 Lines • ▼ Show 20 Lines
	; GCN-O0-NEXT: .LBB3_1: ; %Flow2			; GCN-O0-NEXT: .LBB3_1: ; %Flow2
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 0			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 0
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 1			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 1
	; GCN-O0-NEXT: s_or_saveexec_b64 s[0:1], s[0:1]			; GCN-O0-NEXT: s_or_saveexec_b64 s[0:1], s[0:1]
	; GCN-O0-NEXT: s_and_b64 s[0:1], exec, s[0:1]			; GCN-O0-NEXT: s_and_b64 s[0:1], exec, s[0:1]
	; GCN-O0-NEXT: v_writelane_b32 v1, s0, 2			; GCN-O0-NEXT: v_writelane_b32 v1, s0, 2
	; GCN-O0-NEXT: v_writelane_b32 v1, s1, 3			; GCN-O0-NEXT: v_writelane_b32 v1, s1, 3
	; GCN-O0-NEXT: s_xor_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_xor_b64 exec, exec, s[0:1]
	; GCN-O0-NEXT: s_cbranch_execz .LBB3_8			; GCN-O0-NEXT: s_cbranch_execz .LBB3_10
	; GCN-O0-NEXT: ; %bb.2: ; %bb.outer.then			; GCN-O0-NEXT: ; %bb.2: ; %bb.outer.then
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: buffer_load_dword v0, off, s[8:11], 0 offset:12 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v0, off, s[8:11], 0 offset:12 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v3, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v3, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v4, off, s[8:11], 0 offset:8 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v4, off, s[8:11], 0 offset:8 ; 4-byte Folded Reload
	; GCN-O0-NEXT: s_mov_b32 s0, 0xf000			; GCN-O0-NEXT: s_mov_b32 s0, 0xf000
	; GCN-O0-NEXT: s_mov_b32 s2, 0			; GCN-O0-NEXT: s_mov_b32 s2, 0
	; GCN-O0-NEXT: s_mov_b32 s4, s2			; GCN-O0-NEXT: s_mov_b32 s4, s2
	; GCN-O0-NEXT: s_mov_b32 s5, s0			; GCN-O0-NEXT: s_mov_b32 s5, s0
	; GCN-O0-NEXT: s_mov_b32 s0, s2			; GCN-O0-NEXT: s_mov_b32 s0, s2
	; GCN-O0-NEXT: s_mov_b32 s1, s2			; GCN-O0-NEXT: s_mov_b32 s1, s2
	; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3			; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3
	; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]
	; GCN-O0-NEXT: v_mov_b32_e32 v2, 1			; GCN-O0-NEXT: v_mov_b32_e32 v2, 1
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v2, v[3:4], s[0:3], 0 addr64 offset:4			; GCN-O0-NEXT: buffer_store_dword v2, v[3:4], s[0:3], 0 addr64 offset:4
	; GCN-O0-NEXT: s_mov_b32 s0, 2			; GCN-O0-NEXT: s_mov_b32 s0, 2
	; GCN-O0-NEXT: v_cmp_eq_u32_e64 s[2:3], v0, s0			; GCN-O0-NEXT: v_cmp_eq_u32_e64 s[2:3], v0, s0
	; GCN-O0-NEXT: s_mov_b64 s[0:1], exec			; GCN-O0-NEXT: s_mov_b64 s[0:1], exec
	; GCN-O0-NEXT: v_writelane_b32 v1, s0, 4			; GCN-O0-NEXT: v_writelane_b32 v1, s0, 4
	; GCN-O0-NEXT: v_writelane_b32 v1, s1, 5			; GCN-O0-NEXT: v_writelane_b32 v1, s1, 5
	; GCN-O0-NEXT: s_and_b64 s[0:1], s[0:1], s[2:3]			; GCN-O0-NEXT: s_and_b64 s[0:1], s[0:1], s[2:3]
	; GCN-O0-NEXT: s_mov_b64 exec, s[0:1]			; GCN-O0-NEXT: s_mov_b64 exec, s[0:1]
	; GCN-O0-NEXT: s_cbranch_execz .LBB3_7			; GCN-O0-NEXT: s_cbranch_execz .LBB3_8
	; GCN-O0-NEXT: ; %bb.3: ; %bb.inner.then			; GCN-O0-NEXT: ; %bb.3: ; %bb.inner.then
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: buffer_load_dword v2, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v2, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v3, off, s[8:11], 0 offset:8 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v3, off, s[8:11], 0 offset:8 ; 4-byte Folded Reload
	; GCN-O0-NEXT: s_mov_b32 s0, 0xf000			; GCN-O0-NEXT: s_mov_b32 s0, 0xf000
	; GCN-O0-NEXT: s_mov_b32 s2, 0			; GCN-O0-NEXT: s_mov_b32 s2, 0
	; GCN-O0-NEXT: s_mov_b32 s4, s2			; GCN-O0-NEXT: s_mov_b32 s4, s2
	; GCN-O0-NEXT: s_mov_b32 s5, s0			; GCN-O0-NEXT: s_mov_b32 s5, s0
	; GCN-O0-NEXT: s_mov_b32 s0, s2			; GCN-O0-NEXT: s_mov_b32 s0, s2
	; GCN-O0-NEXT: s_mov_b32 s1, s2			; GCN-O0-NEXT: s_mov_b32 s1, s2
	; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3			; GCN-O0-NEXT: ; kill: def $sgpr0_sgpr1 killed $sgpr0_sgpr1 def $sgpr0_sgpr1_sgpr2_sgpr3
	; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]
	; GCN-O0-NEXT: v_mov_b32_e32 v0, 2			; GCN-O0-NEXT: v_mov_b32_e32 v0, 2
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64 offset:8			; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64 offset:8
	; GCN-O0-NEXT: s_branch .LBB3_7			; GCN-O0-NEXT: s_branch .LBB3_8
	; GCN-O0-NEXT: .LBB3_4: ; %bb.outer.else			; GCN-O0-NEXT: .LBB3_4: ; %bb.outer.else
	; GCN-O0-NEXT: buffer_load_dword v0, off, s[8:11], 0 offset:12 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v0, off, s[8:11], 0 offset:12 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v3, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v3, off, s[8:11], 0 offset:4 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v4, off, s[8:11], 0 offset:8 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v4, off, s[8:11], 0 offset:8 ; 4-byte Folded Reload
	; GCN-O0-NEXT: s_mov_b32 s1, 0xf000			; GCN-O0-NEXT: s_mov_b32 s1, 0xf000
	; GCN-O0-NEXT: s_mov_b32 s0, 0			; GCN-O0-NEXT: s_mov_b32 s0, 0
	; GCN-O0-NEXT: s_mov_b32 s2, s0			; GCN-O0-NEXT: s_mov_b32 s2, s0
	; GCN-O0-NEXT: s_mov_b32 s3, s1			; GCN-O0-NEXT: s_mov_b32 s3, s1
	Show All 26 Lines
	; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[2:3], s[4:5]
	; GCN-O0-NEXT: v_mov_b32_e32 v0, 4			; GCN-O0-NEXT: v_mov_b32_e32 v0, 4
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64 offset:16			; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64 offset:16
	; GCN-O0-NEXT: .LBB3_6: ; %Flow			; GCN-O0-NEXT: .LBB3_6: ; %Flow
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 6			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 6
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 7			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 7
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
				; GCN-O0-NEXT: ; %bb.7: ; %Flow
	; GCN-O0-NEXT: s_branch .LBB3_1			; GCN-O0-NEXT: s_branch .LBB3_1
	; GCN-O0-NEXT: .LBB3_7: ; %Flow1			; GCN-O0-NEXT: .LBB3_8: ; %Flow1
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 4			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 4
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 5			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 5
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
	; GCN-O0-NEXT: .LBB3_8: ; %bb.outer.end			; GCN-O0-NEXT: ; %bb.9: ; %Flow1
				; GCN-O0-NEXT: .LBB3_10: ; %bb.outer.end
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
				; GCN-O0-NEXT: ; %bb.11: ; %bb.outer.end
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: v_mov_b32_e32 v2, 3			; GCN-O0-NEXT: v_mov_b32_e32 v2, 3
	; GCN-O0-NEXT: v_mov_b32_e32 v0, 0			; GCN-O0-NEXT: v_mov_b32_e32 v0, 0
	; GCN-O0-NEXT: s_mov_b32 m0, -1			; GCN-O0-NEXT: s_mov_b32 m0, -1
	; GCN-O0-NEXT: ds_write_b32 v0, v2			; GCN-O0-NEXT: ds_write_b32 v0, v2
	; GCN-O0-NEXT: s_endpgm			; GCN-O0-NEXT: s_endpgm
	bb:			bb:
	%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()			%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()
	▲ Show 20 Lines • Show All 89 Lines • ▼ Show 20 Lines
	; GCN-O0-NEXT: s_mov_b32 s4, 2			; GCN-O0-NEXT: s_mov_b32 s4, 2
	; GCN-O0-NEXT: v_lshl_b64 v[2:3], v[2:3], s4			; GCN-O0-NEXT: v_lshl_b64 v[2:3], v[2:3], s4
	; GCN-O0-NEXT: v_mov_b32_e32 v0, 0			; GCN-O0-NEXT: v_mov_b32_e32 v0, 0
	; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64			; GCN-O0-NEXT: buffer_store_dword v0, v[2:3], s[0:3], 0 addr64
	; GCN-O0-NEXT: .LBB4_2: ; %bb.end			; GCN-O0-NEXT: .LBB4_2: ; %bb.end
	; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2			; GCN-O0-NEXT: v_readlane_b32 s0, v1, 2
	; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3			; GCN-O0-NEXT: v_readlane_b32 s1, v1, 3
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[0:1]
				; GCN-O0-NEXT: ; %bb.3: ; %bb.end
	; GCN-O0-NEXT: s_waitcnt vmcnt(0) expcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0) expcnt(0)
	; GCN-O0-NEXT: s_barrier			; GCN-O0-NEXT: s_barrier
	; GCN-O0-NEXT: s_endpgm			; GCN-O0-NEXT: s_endpgm
	bb:			bb:
	%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()			%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()
	%tmp1 = icmp ugt i32 %tmp, 1			%tmp1 = icmp ugt i32 %tmp, 1
	br i1 %tmp1, label %bb.then, label %bb.end			br i1 %tmp1, label %bb.then, label %bb.end

	▲ Show 20 Lines • Show All 112 Lines • ▼ Show 20 Lines
	; GCN-O0-NEXT: v_writelane_b32 v1, s7, 1			; GCN-O0-NEXT: v_writelane_b32 v1, s7, 1
	; GCN-O0-NEXT: s_mov_b64 s[6:7], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[6:7], s[4:5]
	; GCN-O0-NEXT: v_writelane_b32 v1, s6, 2			; GCN-O0-NEXT: v_writelane_b32 v1, s6, 2
	; GCN-O0-NEXT: v_writelane_b32 v1, s7, 3			; GCN-O0-NEXT: v_writelane_b32 v1, s7, 3
	; GCN-O0-NEXT: s_andn2_b64 exec, exec, s[4:5]			; GCN-O0-NEXT: s_andn2_b64 exec, exec, s[4:5]
	; GCN-O0-NEXT: s_cbranch_execnz .LBB5_1			; GCN-O0-NEXT: s_cbranch_execnz .LBB5_1
	; GCN-O0-NEXT: ; %bb.2: ; %bb2			; GCN-O0-NEXT: ; %bb.2: ; %bb2
	; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1			; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
	; GCN-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 ; 4-byte Folded Reload
	; GCN-O0-NEXT: v_readlane_b32 s4, v1, 6			; GCN-O0-NEXT: v_readlane_b32 s4, v1, 6
	; GCN-O0-NEXT: v_readlane_b32 s5, v1, 7			; GCN-O0-NEXT: v_readlane_b32 s5, v1, 7
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[4:5]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[4:5]
				; GCN-O0-NEXT: ; %bb.3: ; %bb2
				; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
				; GCN-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 ; 4-byte Folded Reload
	; GCN-O0-NEXT: s_mov_b32 s6, 0			; GCN-O0-NEXT: s_mov_b32 s6, 0
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: v_cmp_ne_u32_e64 s[4:5], v0, s6			; GCN-O0-NEXT: v_cmp_ne_u32_e64 s[4:5], v0, s6
	; GCN-O0-NEXT: v_cmp_eq_u32_e64 s[6:7], v0, s6			; GCN-O0-NEXT: v_cmp_eq_u32_e64 s[6:7], v0, s6
	; GCN-O0-NEXT: v_writelane_b32 v1, s4, 8			; GCN-O0-NEXT: v_writelane_b32 v1, s4, 8
	; GCN-O0-NEXT: v_writelane_b32 v1, s5, 9			; GCN-O0-NEXT: v_writelane_b32 v1, s5, 9
	; GCN-O0-NEXT: s_mov_b32 s4, 0			; GCN-O0-NEXT: s_mov_b32 s4, 0
	; GCN-O0-NEXT: s_mov_b32 s8, s4			; GCN-O0-NEXT: s_mov_b32 s8, s4
	Show All 9 Lines
	; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:8 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:8 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:12 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:12 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:16 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:16 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_mov_b64 s[4:5], exec			; GCN-O0-NEXT: s_mov_b64 s[4:5], exec
	; GCN-O0-NEXT: v_writelane_b32 v1, s4, 10			; GCN-O0-NEXT: v_writelane_b32 v1, s4, 10
	; GCN-O0-NEXT: v_writelane_b32 v1, s5, 11			; GCN-O0-NEXT: v_writelane_b32 v1, s5, 11
	; GCN-O0-NEXT: s_and_b64 s[4:5], s[4:5], s[6:7]			; GCN-O0-NEXT: s_and_b64 s[4:5], s[4:5], s[6:7]
	; GCN-O0-NEXT: s_mov_b64 exec, s[4:5]			; GCN-O0-NEXT: s_mov_b64 exec, s[4:5]
	; GCN-O0-NEXT: s_cbranch_execz .LBB5_5			; GCN-O0-NEXT: s_cbranch_execz .LBB5_6
	; GCN-O0-NEXT: ; %bb.3: ; %bb4			; GCN-O0-NEXT: ; %bb.4: ; %bb4
	; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1			; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
	; GCN-O0-NEXT: ; implicit-def: $sgpr4			; GCN-O0-NEXT: ; implicit-def: $sgpr4
	; GCN-O0-NEXT: v_mov_b32_e32 v0, s4			; GCN-O0-NEXT: v_mov_b32_e32 v0, s4
	; GCN-O0-NEXT: buffer_load_dword v0, v0, s[0:3], 0 offen			; GCN-O0-NEXT: buffer_load_dword v0, v0, s[0:3], 0 offen
	; GCN-O0-NEXT: s_mov_b32 s4, 0			; GCN-O0-NEXT: s_mov_b32 s4, 0
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: v_cmp_lt_f32_e64 s[6:7], v0, s4			; GCN-O0-NEXT: v_cmp_lt_f32_e64 s[6:7], v0, s4
	; GCN-O0-NEXT: s_mov_b32 s8, s4			; GCN-O0-NEXT: s_mov_b32 s8, s4
	Show All 10 Lines
	; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:24 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:24 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:28 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:28 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:32 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:32 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_mov_b64 s[4:5], exec			; GCN-O0-NEXT: s_mov_b64 s[4:5], exec
	; GCN-O0-NEXT: v_writelane_b32 v1, s4, 12			; GCN-O0-NEXT: v_writelane_b32 v1, s4, 12
	; GCN-O0-NEXT: v_writelane_b32 v1, s5, 13			; GCN-O0-NEXT: v_writelane_b32 v1, s5, 13
	; GCN-O0-NEXT: s_and_b64 s[4:5], s[4:5], s[6:7]			; GCN-O0-NEXT: s_and_b64 s[4:5], s[4:5], s[6:7]
	; GCN-O0-NEXT: s_mov_b64 exec, s[4:5]			; GCN-O0-NEXT: s_mov_b64 exec, s[4:5]
	; GCN-O0-NEXT: s_cbranch_execz .LBB5_6			; GCN-O0-NEXT: s_cbranch_execz .LBB5_8
	; GCN-O0-NEXT: ; %bb.4: ; %bb8			; GCN-O0-NEXT: ; %bb.5: ; %bb8
	; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1			; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
	; GCN-O0-NEXT: s_mov_b32 s10, 0			; GCN-O0-NEXT: s_mov_b32 s10, 0
	; GCN-O0-NEXT: ; implicit-def: $sgpr4			; GCN-O0-NEXT: ; implicit-def: $sgpr4
	; GCN-O0-NEXT: ; implicit-def: $sgpr5			; GCN-O0-NEXT: ; implicit-def: $sgpr5
	; GCN-O0-NEXT: ; implicit-def: $sgpr9			; GCN-O0-NEXT: ; implicit-def: $sgpr9
	; GCN-O0-NEXT: ; implicit-def: $sgpr5			; GCN-O0-NEXT: ; implicit-def: $sgpr5
	; GCN-O0-NEXT: ; implicit-def: $sgpr8			; GCN-O0-NEXT: ; implicit-def: $sgpr8
	; GCN-O0-NEXT: ; implicit-def: $sgpr5			; GCN-O0-NEXT: ; implicit-def: $sgpr5
	; GCN-O0-NEXT: ; kill: def $sgpr4 killed $sgpr4 def $sgpr4_sgpr5_sgpr6_sgpr7			; GCN-O0-NEXT: ; kill: def $sgpr4 killed $sgpr4 def $sgpr4_sgpr5_sgpr6_sgpr7
	; GCN-O0-NEXT: s_mov_b32 s5, s10			; GCN-O0-NEXT: s_mov_b32 s5, s10
	; GCN-O0-NEXT: s_mov_b32 s6, s9			; GCN-O0-NEXT: s_mov_b32 s6, s9
	; GCN-O0-NEXT: s_mov_b32 s7, s8			; GCN-O0-NEXT: s_mov_b32 s7, s8
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: v_mov_b32_e32 v2, s4			; GCN-O0-NEXT: v_mov_b32_e32 v2, s4
	; GCN-O0-NEXT: v_mov_b32_e32 v3, s5			; GCN-O0-NEXT: v_mov_b32_e32 v3, s5
	; GCN-O0-NEXT: v_mov_b32_e32 v4, s6			; GCN-O0-NEXT: v_mov_b32_e32 v4, s6
	; GCN-O0-NEXT: v_mov_b32_e32 v5, s7			; GCN-O0-NEXT: v_mov_b32_e32 v5, s7
	; GCN-O0-NEXT: buffer_store_dword v2, off, s[0:3], s32 offset:20 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v2, off, s[0:3], s32 offset:20 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:24 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:24 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:28 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:28 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:32 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:32 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_branch .LBB5_6			; GCN-O0-NEXT: s_branch .LBB5_8
	; GCN-O0-NEXT: .LBB5_5: ; %Flow2			; GCN-O0-NEXT: .LBB5_6: ; %Flow2
				; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
				; GCN-O0-NEXT: v_readlane_b32 s4, v1, 10
				; GCN-O0-NEXT: v_readlane_b32 s5, v1, 11
				; GCN-O0-NEXT: s_or_b64 exec, exec, s[4:5]
				; GCN-O0-NEXT: ; %bb.7: ; %Flow2
	; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1			; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:4 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:4 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v3, off, s[0:3], s32 offset:8 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v3, off, s[0:3], s32 offset:8 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 offset:12 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 offset:12 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v5, off, s[0:3], s32 offset:16 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v5, off, s[0:3], s32 offset:16 ; 4-byte Folded Reload
	; GCN-O0-NEXT: v_readlane_b32 s4, v1, 10
	; GCN-O0-NEXT: v_readlane_b32 s5, v1, 11
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[4:5]
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v2, off, s[0:3], s32 offset:36 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v2, off, s[0:3], s32 offset:36 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:40 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:40 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:44 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:44 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:48 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:48 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_branch .LBB5_7			; GCN-O0-NEXT: s_branch .LBB5_10
	; GCN-O0-NEXT: .LBB5_6: ; %Flow			; GCN-O0-NEXT: .LBB5_8: ; %Flow
				; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
				; GCN-O0-NEXT: v_readlane_b32 s4, v1, 12
				; GCN-O0-NEXT: v_readlane_b32 s5, v1, 13
				; GCN-O0-NEXT: s_or_b64 exec, exec, s[4:5]
				; GCN-O0-NEXT: ; %bb.9: ; %Flow
	; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1			; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:20 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:20 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v3, off, s[0:3], s32 offset:24 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v3, off, s[0:3], s32 offset:24 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 offset:28 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 offset:28 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v5, off, s[0:3], s32 offset:32 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v5, off, s[0:3], s32 offset:32 ; 4-byte Folded Reload
	; GCN-O0-NEXT: v_readlane_b32 s4, v1, 12
	; GCN-O0-NEXT: v_readlane_b32 s5, v1, 13
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[4:5]
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v2, off, s[0:3], s32 offset:4 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v2, off, s[0:3], s32 offset:4 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:8 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:8 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:12 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:12 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:16 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:16 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_branch .LBB5_5			; GCN-O0-NEXT: s_branch .LBB5_6
	; GCN-O0-NEXT: .LBB5_7: ; %bb10			; GCN-O0-NEXT: .LBB5_10: ; %bb10
	; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1			; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
	; GCN-O0-NEXT: v_readlane_b32 s6, v1, 8			; GCN-O0-NEXT: v_readlane_b32 s6, v1, 8
	; GCN-O0-NEXT: v_readlane_b32 s7, v1, 9			; GCN-O0-NEXT: v_readlane_b32 s7, v1, 9
	; GCN-O0-NEXT: s_mov_b64 s[4:5], -1			; GCN-O0-NEXT: s_mov_b64 s[4:5], -1
	; GCN-O0-NEXT: v_writelane_b32 v1, s4, 14			; GCN-O0-NEXT: v_writelane_b32 v1, s4, 14
	; GCN-O0-NEXT: v_writelane_b32 v1, s5, 15			; GCN-O0-NEXT: v_writelane_b32 v1, s5, 15
	; GCN-O0-NEXT: s_mov_b64 s[4:5], exec			; GCN-O0-NEXT: s_mov_b64 s[4:5], exec
	; GCN-O0-NEXT: v_writelane_b32 v1, s4, 16			; GCN-O0-NEXT: v_writelane_b32 v1, s4, 16
	; GCN-O0-NEXT: v_writelane_b32 v1, s5, 17			; GCN-O0-NEXT: v_writelane_b32 v1, s5, 17
	; GCN-O0-NEXT: s_and_b64 s[4:5], s[4:5], s[6:7]			; GCN-O0-NEXT: s_and_b64 s[4:5], s[4:5], s[6:7]
	; GCN-O0-NEXT: s_mov_b64 exec, s[4:5]			; GCN-O0-NEXT: s_mov_b64 exec, s[4:5]
	; GCN-O0-NEXT: s_cbranch_execz .LBB5_9			; GCN-O0-NEXT: s_cbranch_execz .LBB5_12
	; GCN-O0-NEXT: ; %bb.8: ; %Flow1			; GCN-O0-NEXT: ; %bb.11: ; %Flow1
	; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1			; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
	; GCN-O0-NEXT: s_mov_b64 s[4:5], 0			; GCN-O0-NEXT: s_mov_b64 s[4:5], 0
	; GCN-O0-NEXT: s_xor_b64 s[4:5], exec, -1			; GCN-O0-NEXT: s_xor_b64 s[4:5], exec, -1
	; GCN-O0-NEXT: v_writelane_b32 v1, s4, 14			; GCN-O0-NEXT: v_writelane_b32 v1, s4, 14
	; GCN-O0-NEXT: v_writelane_b32 v1, s5, 15			; GCN-O0-NEXT: v_writelane_b32 v1, s5, 15
	; GCN-O0-NEXT: .LBB5_9: ; %Flow3			; GCN-O0-NEXT: .LBB5_12: ; %Flow3
				; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
				; GCN-O0-NEXT: v_readlane_b32 s4, v1, 16
				; GCN-O0-NEXT: v_readlane_b32 s5, v1, 17
				; GCN-O0-NEXT: s_or_b64 exec, exec, s[4:5]
				; GCN-O0-NEXT: ; %bb.13: ; %Flow3
	; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1			; GCN-O0-NEXT: ; in Loop: Header=BB5_1 Depth=1
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:36 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:36 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v3, off, s[0:3], s32 offset:40 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v3, off, s[0:3], s32 offset:40 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 offset:44 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 offset:44 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v5, off, s[0:3], s32 offset:48 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v5, off, s[0:3], s32 offset:48 ; 4-byte Folded Reload
	; GCN-O0-NEXT: v_readlane_b32 s8, v1, 16
	; GCN-O0-NEXT: v_readlane_b32 s9, v1, 17
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[8:9]
	; GCN-O0-NEXT: v_readlane_b32 s6, v1, 4			; GCN-O0-NEXT: v_readlane_b32 s6, v1, 4
	; GCN-O0-NEXT: v_readlane_b32 s7, v1, 5			; GCN-O0-NEXT: v_readlane_b32 s7, v1, 5
	; GCN-O0-NEXT: v_readlane_b32 s4, v1, 14			; GCN-O0-NEXT: v_readlane_b32 s4, v1, 14
	; GCN-O0-NEXT: v_readlane_b32 s5, v1, 15			; GCN-O0-NEXT: v_readlane_b32 s5, v1, 15
	; GCN-O0-NEXT: s_and_b64 s[4:5], exec, s[4:5]			; GCN-O0-NEXT: s_and_b64 s[4:5], exec, s[4:5]
	; GCN-O0-NEXT: s_or_b64 s[4:5], s[4:5], s[6:7]			; GCN-O0-NEXT: s_or_b64 s[4:5], s[4:5], s[6:7]
	; GCN-O0-NEXT: s_mov_b64 s[6:7], 0			; GCN-O0-NEXT: s_mov_b64 s[6:7], 0
	; GCN-O0-NEXT: s_mov_b64 s[8:9], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[8:9], s[4:5]
	; GCN-O0-NEXT: v_writelane_b32 v1, s8, 0			; GCN-O0-NEXT: v_writelane_b32 v1, s8, 0
	; GCN-O0-NEXT: v_writelane_b32 v1, s9, 1			; GCN-O0-NEXT: v_writelane_b32 v1, s9, 1
	; GCN-O0-NEXT: v_writelane_b32 v1, s6, 2			; GCN-O0-NEXT: v_writelane_b32 v1, s6, 2
	; GCN-O0-NEXT: v_writelane_b32 v1, s7, 3			; GCN-O0-NEXT: v_writelane_b32 v1, s7, 3
	; GCN-O0-NEXT: s_mov_b64 s[6:7], s[4:5]			; GCN-O0-NEXT: s_mov_b64 s[6:7], s[4:5]
	; GCN-O0-NEXT: v_writelane_b32 v1, s6, 18			; GCN-O0-NEXT: v_writelane_b32 v1, s6, 18
	; GCN-O0-NEXT: v_writelane_b32 v1, s7, 19			; GCN-O0-NEXT: v_writelane_b32 v1, s7, 19
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v2, off, s[0:3], s32 offset:52 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v2, off, s[0:3], s32 offset:52 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:56 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v3, off, s[0:3], s32 offset:56 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:60 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v4, off, s[0:3], s32 offset:60 ; 4-byte Folded Spill
	; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:64 ; 4-byte Folded Spill			; GCN-O0-NEXT: buffer_store_dword v5, off, s[0:3], s32 offset:64 ; 4-byte Folded Spill
	; GCN-O0-NEXT: s_andn2_b64 exec, exec, s[4:5]			; GCN-O0-NEXT: s_andn2_b64 exec, exec, s[4:5]
	; GCN-O0-NEXT: s_cbranch_execnz .LBB5_1			; GCN-O0-NEXT: s_cbranch_execnz .LBB5_1
	; GCN-O0-NEXT: ; %bb.10: ; %bb12			; GCN-O0-NEXT: ; %bb.14: ; %bb12
	; GCN-O0-NEXT: v_readlane_b32 s4, v1, 18			; GCN-O0-NEXT: v_readlane_b32 s4, v1, 18
	; GCN-O0-NEXT: v_readlane_b32 s5, v1, 19			; GCN-O0-NEXT: v_readlane_b32 s5, v1, 19
	; GCN-O0-NEXT: s_or_b64 exec, exec, s[4:5]			; GCN-O0-NEXT: s_or_b64 exec, exec, s[4:5]
	; GCN-O0-NEXT: ; %bb.11: ; %bb12			; GCN-O0-NEXT: ; %bb.15: ; %bb12
	; GCN-O0-NEXT: s_waitcnt expcnt(0)			; GCN-O0-NEXT: s_waitcnt expcnt(0)
	; GCN-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:52 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:52 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v3, off, s[0:3], s32 offset:56 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v3, off, s[0:3], s32 offset:56 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 offset:60 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 offset:60 ; 4-byte Folded Reload
	; GCN-O0-NEXT: buffer_load_dword v5, off, s[0:3], s32 offset:64 ; 4-byte Folded Reload			; GCN-O0-NEXT: buffer_load_dword v5, off, s[0:3], s32 offset:64 ; 4-byte Folded Reload
	; GCN-O0-NEXT: s_waitcnt vmcnt(0)			; GCN-O0-NEXT: s_waitcnt vmcnt(0)
	; GCN-O0-NEXT: v_mov_b32_e32 v0, v5			; GCN-O0-NEXT: v_mov_b32_e32 v0, v5
	; GCN-O0-NEXT: ; implicit-def: $sgpr4			; GCN-O0-NEXT: ; implicit-def: $sgpr4
	▲ Show 20 Lines • Show All 65 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/collapse-endcf.mir

Show All 12 Lines	body: \|
; GCN-NEXT: successors: %bb.1(0x40000000), %bb.4(0x40000000)		; GCN-NEXT: successors: %bb.1(0x40000000), %bb.4(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], undef %1:sreg_64, implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], undef %1:sreg_64, implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.1:		; GCN-NEXT: bb.1:
; GCN-NEXT: successors: %bb.2(0x40000000), %bb.4(0x40000000)		; GCN-NEXT: successors: %bb.2(0x40000000), %bb.5(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], undef %3:sreg_64, implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], undef %3:sreg_64, implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.5, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.2:		; GCN-NEXT: bb.2:
		; GCN-NEXT: successors: %bb.5(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.5:
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
		; GCN-NEXT: DBG_VALUE
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: successors: %bb.6(0x80000000)
		; GCN-NEXT: {{ $}}
; GCN-NEXT: DBG_VALUE		; GCN-NEXT: DBG_VALUE
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.6:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.4		successors: %bb.1, %bb.4

%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

bb.1:		bb.1:
successors: %bb.2, %bb.3		successors: %bb.2, %bb.3
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	body: \|
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: successors: %bb.5(0x80000000)		; GCN-NEXT: successors: %bb.5(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.5:		; GCN-NEXT: bb.5:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: successors: %bb.6(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.6:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

bb.1:		bb.1:
successors: %bb.2, %bb.3		successors: %bb.2, %bb.3

%2:sreg_64 = SI_IF undef %3:sreg_64, %bb.3, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%2:sreg_64 = SI_IF undef %3:sreg_64, %bb.3, implicit-def dead $exec, implicit-def dead $scc, implicit $exec
Show All 40 Lines	body: \|
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: successors: %bb.5(0x80000000)		; GCN-NEXT: successors: %bb.5(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: DBG_VALUE		; GCN-NEXT: DBG_VALUE
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.5:		; GCN-NEXT: bb.5:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: successors: %bb.6(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.6:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.4		successors: %bb.1, %bb.4

%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

bb.1:		bb.1:
successors: %bb.2, %bb.3		successors: %bb.2, %bb.3
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	body: \|
; GCN-NEXT: bb.3:		; GCN-NEXT: bb.3:
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[DEF:%[0-9]+]]:sgpr_32 = IMPLICIT_DEF		; GCN-NEXT: [[DEF:%[0-9]+]]:sgpr_32 = IMPLICIT_DEF
; GCN-NEXT: [[S_BREV_B32_:%[0-9]+]]:sgpr_32 = S_BREV_B32 [[DEF]]		; GCN-NEXT: [[S_BREV_B32_:%[0-9]+]]:sgpr_32 = S_BREV_B32 [[DEF]]
; GCN-NEXT: KILL [[DEF]]		; GCN-NEXT: KILL [[DEF]]
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: successors: %bb.5(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.5:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.4		successors: %bb.1, %bb.4
liveins: $vgpr0, $sgpr0_sgpr1		liveins: $vgpr0, $sgpr0_sgpr1

%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

bb.1:		bb.1:
Show All 28 Lines	body: \|
; GCN-NEXT: successors: %bb.1(0x40000000), %bb.4(0x40000000)		; GCN-NEXT: successors: %bb.1(0x40000000), %bb.4(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], undef %1:sreg_64, implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], undef %1:sreg_64, implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.1:		; GCN-NEXT: bb.1:
; GCN-NEXT: successors: %bb.2(0x40000000), %bb.3(0x40000000)		; GCN-NEXT: successors: %bb.2(0x40000000), %bb.5(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], undef %3:sreg_64, implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], undef %3:sreg_64, implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.3, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.5, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.2:		; GCN-NEXT: bb.2:
; GCN-NEXT: successors: %bb.3(0x80000000)		; GCN-NEXT: successors: %bb.5(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.3:		; GCN-NEXT: bb.5:
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[DEF:%[0-9]+]]:sgpr_32 = IMPLICIT_DEF		; GCN-NEXT: [[DEF:%[0-9]+]]:sgpr_32 = IMPLICIT_DEF
; GCN-NEXT: [[S_BREV_B32_:%[0-9]+]]:sgpr_32 = S_BREV_B32 [[DEF]]		; GCN-NEXT: [[S_BREV_B32_:%[0-9]+]]:sgpr_32 = S_BREV_B32 [[DEF]]
; GCN-NEXT: KILL [[DEF]]		; GCN-NEXT: KILL [[DEF]]
; GCN-NEXT: [[COPY2:%[0-9]+]]:sgpr_32 = COPY [[S_BREV_B32_]]		; GCN-NEXT: [[COPY2:%[0-9]+]]:sgpr_32 = COPY [[S_BREV_B32_]]
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: successors: %bb.6(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.6:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.4		successors: %bb.1, %bb.4

%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

bb.1:		bb.1:
successors: %bb.2, %bb.3		successors: %bb.2, %bb.3
Show All 39 Lines	body: \|
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.3, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.3, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.2:		; GCN-NEXT: bb.2:
; GCN-NEXT: successors: %bb.3(0x80000000)		; GCN-NEXT: successors: %bb.3(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.3:		; GCN-NEXT: bb.3:
		; GCN-NEXT: successors: %bb.5(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY1]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.5:
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY1]], implicit-def $scc
; GCN-NEXT: [[S_BREV_B64_:%[0-9]+]]:sreg_64 = S_BREV_B64 $exec		; GCN-NEXT: [[S_BREV_B64_:%[0-9]+]]:sreg_64 = S_BREV_B64 $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: successors: %bb.6(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.6:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.4		successors: %bb.1, %bb.4

%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

bb.1:		bb.1:
successors: %bb.2, %bb.3		successors: %bb.2, %bb.3
Show All 37 Lines	body: \|
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.3, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.3, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.2:		; GCN-NEXT: bb.2:
; GCN-NEXT: successors: %bb.3(0x80000000)		; GCN-NEXT: successors: %bb.3(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.3:		; GCN-NEXT: bb.3:
		; GCN-NEXT: successors: %bb.5(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY1]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.5:
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY1]], implicit-def $scc
; GCN-NEXT: [[COPY2:%[0-9]+]]:vgpr_32 = COPY [[DEF]].sub2		; GCN-NEXT: [[COPY2:%[0-9]+]]:vgpr_32 = COPY [[DEF]].sub2
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: successors: %bb.6(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.6:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.4		successors: %bb.1, %bb.4

%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

bb.1:		bb.1:
successors: %bb.2, %bb.3		successors: %bb.2, %bb.3
Show All 24 Lines	body: \|
; GCN-NEXT: successors: %bb.1(0x40000000), %bb.4(0x40000000)		; GCN-NEXT: successors: %bb.1(0x40000000), %bb.4(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], undef %1:sreg_64, implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], undef %1:sreg_64, implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.1:		; GCN-NEXT: bb.1:
; GCN-NEXT: successors: %bb.2(0x40000000), %bb.5(0x40000000)		; GCN-NEXT: successors: %bb.2(0x40000000), %bb.6(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], undef %3:sreg_64, implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], undef %3:sreg_64, implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.5, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.6, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.2:		; GCN-NEXT: bb.2:
		; GCN-NEXT: successors: %bb.6(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.6:
; GCN-NEXT: successors: %bb.5(0x80000000)		; GCN-NEXT: successors: %bb.5(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
		; GCN-NEXT: S_BRANCH %bb.5
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.4:
		; GCN-NEXT: successors: %bb.7(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.7:
		; GCN-NEXT: S_ENDPGM 0
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.5:		; GCN-NEXT: bb.5:
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: S_BRANCH %bb.4		; GCN-NEXT: S_BRANCH %bb.4
; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc
; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.4		successors: %bb.1, %bb.4

%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

bb.1:		bb.1:
successors: %bb.2, %bb.3		successors: %bb.2, %bb.3

Show All 28 Lines	body: \|
; GCN: bb.0:		; GCN: bb.0:
; GCN-NEXT: successors: %bb.1(0x80000000)		; GCN-NEXT: successors: %bb.1(0x80000000)
; GCN-NEXT: liveins: $vgpr0		; GCN-NEXT: liveins: $vgpr0
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[COPY:%[0-9]+]]:vgpr_32 = COPY $vgpr0		; GCN-NEXT: [[COPY:%[0-9]+]]:vgpr_32 = COPY $vgpr0
; GCN-NEXT: [[V_CMP_LT_U32_e64_:%[0-9]+]]:sreg_64 = V_CMP_LT_U32_e64 1, [[COPY]], implicit $exec		; GCN-NEXT: [[V_CMP_LT_U32_e64_:%[0-9]+]]:sreg_64 = V_CMP_LT_U32_e64 1, [[COPY]], implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.1:		; GCN-NEXT: bb.1:
		; GCN-NEXT: successors: %bb.2(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[V_CMP_LT_U32_e64_]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.2:
; GCN-NEXT: successors: %bb.1(0x80000000)		; GCN-NEXT: successors: %bb.1(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: $exec = S_OR_B64 $exec, [[V_CMP_LT_U32_e64_]], implicit-def $scc
; GCN-NEXT: S_BRANCH %bb.1		; GCN-NEXT: S_BRANCH %bb.1
bb.0:		bb.0:
successors: %bb.1		successors: %bb.1
liveins: $vgpr0		liveins: $vgpr0

%0:vgpr_32 = COPY $vgpr0		%0:vgpr_32 = COPY $vgpr0
%2:sreg_64 = V_CMP_LT_U32_e64 1, %0, implicit $exec		%2:sreg_64 = V_CMP_LT_U32_e64 1, %0, implicit $exec

▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	body: \|
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: successors: %bb.5(0x80000000)		; GCN-NEXT: successors: %bb.5(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.5:		; GCN-NEXT: bb.5:
; GCN-NEXT: successors: %bb.6(0x80000000)		; GCN-NEXT: successors: %bb.6(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY1]], implicit-def $scc		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY1]], implicit-def $scc
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.6:		; GCN-NEXT: bb.6:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[S_AND_B64_1]], implicit-def $scc		; GCN-NEXT: successors: %bb.7(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[S_AND_B64_1]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.7:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.2		successors: %bb.1, %bb.2

%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.2, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.2, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

bb.1:		bb.1:
successors: %bb.2		successors: %bb.2
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	body: \|
; GCN-NEXT: bb.2:		; GCN-NEXT: bb.2:
; GCN-NEXT: successors: %bb.6(0x80000000)		; GCN-NEXT: successors: %bb.6(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: S_BRANCH %bb.6		; GCN-NEXT: S_BRANCH %bb.6
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.3:		; GCN-NEXT: bb.3:
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: successors: %bb.5(0x80000000)		; GCN-NEXT: successors: %bb.5(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: $exec = S_OR_B64 $exec, %2, implicit-def $scc		; GCN-NEXT: $exec = S_OR_B64_term $exec, %2, implicit-def $scc
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.5:		; GCN-NEXT: bb.5:
; GCN-NEXT: successors: %bb.6(0x80000000)		; GCN-NEXT: successors: %bb.6(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.6:		; GCN-NEXT: bb.6:
; GCN-NEXT: successors: %bb.4(0x40000000), %bb.0(0x40000000)		; GCN-NEXT: successors: %bb.4(0x40000000), %bb.0(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	body: \|
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.2:		; GCN-NEXT: bb.2:
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: successors: %bb.5(0x80000000)		; GCN-NEXT: successors: %bb.5(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.5:		; GCN-NEXT: bb.5:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.4		successors: %bb.1, %bb.4

%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec		%0:sreg_64 = SI_IF undef %1:sreg_64, %bb.4, implicit-def dead $exec, implicit-def dead $scc, implicit $exec

Show All 32 Lines	body: \|
; GCN-NEXT: successors: %bb.1(0x40000000), %bb.4(0x40000000)		; GCN-NEXT: successors: %bb.1(0x40000000), %bb.4(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], undef %1:sreg_64, implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], undef %1:sreg_64, implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.1:		; GCN-NEXT: bb.1:
; GCN-NEXT: successors: %bb.2(0x40000000), %bb.5(0x40000000)		; GCN-NEXT: successors: %bb.2(0x40000000), %bb.7(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], undef %3:sreg_64, implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], undef %3:sreg_64, implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.5, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.7, implicit $exec
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.2:		; GCN-NEXT: bb.2:
		; GCN-NEXT: successors: %bb.7(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.7:
; GCN-NEXT: successors: %bb.5(0x80000000)		; GCN-NEXT: successors: %bb.5(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: S_BRANCH %bb.5		; GCN-NEXT: S_BRANCH %bb.5
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.5:		; GCN-NEXT: bb.5:
; GCN-NEXT: successors: %bb.6(0x80000000)		; GCN-NEXT: successors: %bb.6(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.6:		; GCN-NEXT: bb.6:
; GCN-NEXT: successors: %bb.4(0x80000000)		; GCN-NEXT: successors: %bb.4(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: S_BRANCH %bb.4		; GCN-NEXT: S_BRANCH %bb.4
bb.0:		bb.0:
successors: %bb.1, %bb.4		successors: %bb.1, %bb.4

▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	body: \|
; GCN-NEXT: [[V_CMP_EQ_U32_e64_:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF]], implicit $exec		; GCN-NEXT: [[V_CMP_EQ_U32_e64_:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF]], implicit $exec
; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], [[V_CMP_EQ_U32_e64_]], implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY]], [[V_CMP_EQ_U32_e64_]], implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.14, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.14, implicit $exec
; GCN-NEXT: S_BRANCH %bb.1		; GCN-NEXT: S_BRANCH %bb.1
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.1:		; GCN-NEXT: bb.1:
; GCN-NEXT: successors: %bb.2(0x40000000), %bb.14(0x40000000)		; GCN-NEXT: successors: %bb.2(0x40000000), %bb.16(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[DEF1:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF		; GCN-NEXT: [[DEF1:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
; GCN-NEXT: [[V_CMP_EQ_U32_e64_1:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF1]], implicit $exec		; GCN-NEXT: [[V_CMP_EQ_U32_e64_1:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF1]], implicit $exec
; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY1:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], killed [[V_CMP_EQ_U32_e64_1]], implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_1:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY1]], killed [[V_CMP_EQ_U32_e64_1]], implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_1]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.14, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.16, implicit $exec
; GCN-NEXT: S_BRANCH %bb.2		; GCN-NEXT: S_BRANCH %bb.2
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.2:		; GCN-NEXT: bb.2:
; GCN-NEXT: successors: %bb.3(0x40000000), %bb.7(0x40000000)		; GCN-NEXT: successors: %bb.3(0x40000000), %bb.7(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[DEF2:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF		; GCN-NEXT: [[DEF2:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
; GCN-NEXT: [[V_CMP_EQ_U32_e64_2:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF2]], implicit $exec		; GCN-NEXT: [[V_CMP_EQ_U32_e64_2:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF2]], implicit $exec
; GCN-NEXT: [[COPY2:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY2:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_2:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY2]], killed [[V_CMP_EQ_U32_e64_2]], implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_2:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY2]], killed [[V_CMP_EQ_U32_e64_2]], implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_2]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_2]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.7, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.7, implicit $exec
; GCN-NEXT: S_BRANCH %bb.3		; GCN-NEXT: S_BRANCH %bb.3
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.3:		; GCN-NEXT: bb.3:
; GCN-NEXT: successors: %bb.4(0x40000000), %bb.7(0x40000000)		; GCN-NEXT: successors: %bb.4(0x40000000), %bb.15(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[DEF3:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF		; GCN-NEXT: [[DEF3:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
; GCN-NEXT: [[V_CMP_EQ_U32_e64_3:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF3]], implicit $exec		; GCN-NEXT: [[V_CMP_EQ_U32_e64_3:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF3]], implicit $exec
; GCN-NEXT: [[COPY3:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY3:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_3:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY3]], killed [[V_CMP_EQ_U32_e64_3]], implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_3:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY3]], killed [[V_CMP_EQ_U32_e64_3]], implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_3]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_3]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.7, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.15, implicit $exec
; GCN-NEXT: S_BRANCH %bb.4		; GCN-NEXT: S_BRANCH %bb.4
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.4:		; GCN-NEXT: bb.4:
		; GCN-NEXT: successors: %bb.15(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: S_BRANCH %bb.15
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.15:
; GCN-NEXT: successors: %bb.7(0x80000000)		; GCN-NEXT: successors: %bb.7(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: S_BRANCH %bb.7		; GCN-NEXT: S_BRANCH %bb.7
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.16:
		; GCN-NEXT: successors: %bb.14(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: S_BRANCH %bb.14
		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.7:		; GCN-NEXT: bb.7:
		; GCN-NEXT: successors: %bb.17(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY2]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.17:
; GCN-NEXT: successors: %bb.8(0x80000000)		; GCN-NEXT: successors: %bb.8(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY2]], implicit-def $scc
; GCN-NEXT: S_BRANCH %bb.8		; GCN-NEXT: S_BRANCH %bb.8
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.8:		; GCN-NEXT: bb.8:
; GCN-NEXT: successors: %bb.9(0x80000000)		; GCN-NEXT: successors: %bb.9(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: S_BRANCH %bb.9		; GCN-NEXT: S_BRANCH %bb.9
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.9:		; GCN-NEXT: bb.9:
; GCN-NEXT: successors: %bb.11(0x40000000), %bb.12(0x40000000)		; GCN-NEXT: successors: %bb.11(0x40000000), %bb.12(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[DEF4:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF		; GCN-NEXT: [[DEF4:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
; GCN-NEXT: [[V_CMP_EQ_U32_e64_4:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF4]], implicit $exec		; GCN-NEXT: [[V_CMP_EQ_U32_e64_4:%[0-9]+]]:sreg_64 = V_CMP_EQ_U32_e64 0, killed [[DEF4]], implicit $exec
; GCN-NEXT: [[COPY4:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec		; GCN-NEXT: [[COPY4:%[0-9]+]]:sreg_64 = COPY $exec, implicit-def $exec
; GCN-NEXT: [[S_AND_B64_4:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY4]], killed [[V_CMP_EQ_U32_e64_4]], implicit-def dead $scc		; GCN-NEXT: [[S_AND_B64_4:%[0-9]+]]:sreg_64 = S_AND_B64 [[COPY4]], killed [[V_CMP_EQ_U32_e64_4]], implicit-def dead $scc
; GCN-NEXT: [[S_XOR_B64_:%[0-9]+]]:sreg_64 = S_XOR_B64 [[S_AND_B64_4]], [[COPY4]], implicit-def dead $scc		; GCN-NEXT: [[S_XOR_B64_:%[0-9]+]]:sreg_64 = S_XOR_B64 [[S_AND_B64_4]], [[COPY4]], implicit-def dead $scc
; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_4]]		; GCN-NEXT: $exec = S_MOV_B64_term killed [[S_AND_B64_4]]
; GCN-NEXT: S_CBRANCH_EXECZ %bb.12, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.12, implicit $exec
; GCN-NEXT: S_BRANCH %bb.11		; GCN-NEXT: S_BRANCH %bb.11
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.10:		; GCN-NEXT: bb.10:
; GCN-NEXT: successors: %bb.14(0x80000000)		; GCN-NEXT: successors: %bb.18(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: S_BRANCH %bb.14		; GCN-NEXT: S_BRANCH %bb.18
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.11:		; GCN-NEXT: bb.11:
; GCN-NEXT: successors: %bb.12(0x80000000)		; GCN-NEXT: successors: %bb.12(0x80000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: S_BRANCH %bb.12		; GCN-NEXT: S_BRANCH %bb.12
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.12:		; GCN-NEXT: bb.12:
; GCN-NEXT: successors: %bb.10(0x40000000), %bb.14(0x40000000)		; GCN-NEXT: successors: %bb.10(0x40000000), %bb.18(0x40000000)
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
; GCN-NEXT: [[S_OR_SAVEEXEC_B64_:%[0-9]+]]:sreg_64 = S_OR_SAVEEXEC_B64 [[S_XOR_B64_]], implicit-def $exec, implicit-def $scc, implicit $exec		; GCN-NEXT: [[S_OR_SAVEEXEC_B64_:%[0-9]+]]:sreg_64 = S_OR_SAVEEXEC_B64 [[S_XOR_B64_]], implicit-def $exec, implicit-def $scc, implicit $exec
; GCN-NEXT: [[S_AND_B64_5:%[0-9]+]]:sreg_64 = S_AND_B64 $exec, [[S_OR_SAVEEXEC_B64_]], implicit-def $scc		; GCN-NEXT: [[S_AND_B64_5:%[0-9]+]]:sreg_64 = S_AND_B64 $exec, [[S_OR_SAVEEXEC_B64_]], implicit-def $scc
; GCN-NEXT: $exec = S_XOR_B64_term $exec, [[S_AND_B64_5]], implicit-def $scc		; GCN-NEXT: $exec = S_XOR_B64_term $exec, [[S_AND_B64_5]], implicit-def $scc
; GCN-NEXT: S_CBRANCH_EXECZ %bb.14, implicit $exec		; GCN-NEXT: S_CBRANCH_EXECZ %bb.18, implicit $exec
; GCN-NEXT: S_BRANCH %bb.10		; GCN-NEXT: S_BRANCH %bb.10
; GCN-NEXT: {{ $}}		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.18:
		; GCN-NEXT: successors: %bb.16(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: S_BRANCH %bb.16
		; GCN-NEXT: {{ $}}
; GCN-NEXT: bb.14:		; GCN-NEXT: bb.14:
; GCN-NEXT: $exec = S_OR_B64 $exec, [[COPY]], implicit-def $scc		; GCN-NEXT: successors: %bb.19(0x80000000)
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: $exec = S_OR_B64_term $exec, [[COPY]], implicit-def $scc
		; GCN-NEXT: {{ $}}
		; GCN-NEXT: bb.19:
; GCN-NEXT: S_ENDPGM 0		; GCN-NEXT: S_ENDPGM 0
bb.0:		bb.0:
successors: %bb.1, %bb.14		successors: %bb.1, %bb.14

%0:vgpr_32 = IMPLICIT_DEF		%0:vgpr_32 = IMPLICIT_DEF
%1:sreg_64 = V_CMP_EQ_U32_e64 0, killed %0:vgpr_32, implicit $exec		%1:sreg_64 = V_CMP_EQ_U32_e64 0, killed %0:vgpr_32, implicit $exec
%2:sreg_64 = SI_IF %1:sreg_64, %bb.14, implicit-def $exec, implicit-def dead $scc, implicit $exec		%2:sreg_64 = SI_IF %1:sreg_64, %bb.14, implicit-def $exec, implicit-def dead $scc, implicit $exec
S_BRANCH %bb.1		S_BRANCH %bb.1
▲ Show 20 Lines • Show All 103 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/control-flow-fastregalloc.ll

	Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines


	; Spill val register			; Spill val register
	; GCN: v_add_i32_e32 [[VAL:v[0-9]+]], vcc, [[RELOAD_LOAD0]], [[LOAD1]]			; GCN: v_add_i32_e32 [[VAL:v[0-9]+]], vcc, [[RELOAD_LOAD0]], [[LOAD1]]
	; GCN: buffer_store_dword [[VAL]], off, s[0:3], 0 offset:[[VAL_OFFSET:[0-9]+]] ; 4-byte Folded Spill			; GCN: buffer_store_dword [[VAL]], off, s[0:3], 0 offset:[[VAL_OFFSET:[0-9]+]] ; 4-byte Folded Spill

	; VMEM: [[ENDIF]]:			; VMEM: [[ENDIF]]:

	; Restore val
	; GCN: buffer_load_dword [[RELOAD_VAL:v[0-9]+]], off, s[0:3], 0 offset:[[VAL_OFFSET]] ; 4-byte Folded Reload

	; Reload and restore exec mask			; Reload and restore exec mask
	; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], [[SPILL_VGPR]], [[SAVEEXEC_LO_LANE]]			; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], [[SPILL_VGPR]], [[SAVEEXEC_LO_LANE]]
	; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], [[SPILL_VGPR]], [[SAVEEXEC_HI_LANE]]			; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], [[SPILL_VGPR]], [[SAVEEXEC_HI_LANE]]

	; VMEM: buffer_load_dword v[[V_RELOAD_SAVEEXEC:[0-9]+]], off, s[0:3], 0 offset:[[V_EXEC_SPILL_OFFSET]] ; 4-byte Folded Reload			; VMEM: buffer_load_dword v[[V_RELOAD_SAVEEXEC:[0-9]+]], off, s[0:3], 0 offset:[[V_EXEC_SPILL_OFFSET]] ; 4-byte Folded Reload
	; VMEM: s_waitcnt vmcnt(0)			; VMEM: s_waitcnt vmcnt(0)
	; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 0			; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 0
	; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 1			; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 1

	; GCN: s_or_b64 exec, exec, s[[[S_RELOAD_SAVEEXEC_LO]]:[[S_RELOAD_SAVEEXEC_HI]]]			; GCN: s_or_b64 exec, exec, s[[[S_RELOAD_SAVEEXEC_LO]]:[[S_RELOAD_SAVEEXEC_HI]]]
				; Restore val
				; GCN: buffer_load_dword [[RELOAD_VAL:v[0-9]+]], off, s[0:3], 0 offset:[[VAL_OFFSET]] ; 4-byte Folded Reload


	; GCN: flat_store_dword v{{\[[0-9]+:[0-9]+\]}}, [[RELOAD_VAL]]			; GCN: flat_store_dword v{{\[[0-9]+:[0-9]+\]}}, [[RELOAD_VAL]]
	define amdgpu_kernel void @divergent_if_endif(ptr addrspace(1) %out) #0 {			define amdgpu_kernel void @divergent_if_endif(ptr addrspace(1) %out) #0 {
	entry:			entry:
	%tid = call i32 @llvm.amdgcn.workitem.id.x()			%tid = call i32 @llvm.amdgcn.workitem.id.x()
	%load0 = load volatile i32, ptr addrspace(3) undef			%load0 = load volatile i32, ptr addrspace(3) undef
	%cmp0 = icmp eq i32 %tid, 0			%cmp0 = icmp eq i32 %tid, 0
	br i1 %cmp0, label %if, label %endif			br i1 %cmp0, label %if, label %endif
	▲ Show 20 Lines • Show All 44 Lines • ▼ Show 20 Lines
	; VMEM: buffer_store_dword			; VMEM: buffer_store_dword
	; VMEM: buffer_store_dword			; VMEM: buffer_store_dword
	; GCN: buffer_store_dword v[[VAL_LOOP_RELOAD]], off, s[0:3], 0 offset:{{[0-9]+}} ; 4-byte Folded Spill			; GCN: buffer_store_dword v[[VAL_LOOP_RELOAD]], off, s[0:3], 0 offset:{{[0-9]+}} ; 4-byte Folded Spill
	; GCN-NEXT: s_cbranch_scc1 [[LOOP]]			; GCN-NEXT: s_cbranch_scc1 [[LOOP]]

	; GCN: buffer_store_dword v[[VAL_LOOP_RELOAD]], off, s[0:3], 0 offset:[[VAL_SUB_OFFSET:[0-9]+]] ; 4-byte Folded Spill			; GCN: buffer_store_dword v[[VAL_LOOP_RELOAD]], off, s[0:3], 0 offset:[[VAL_SUB_OFFSET:[0-9]+]] ; 4-byte Folded Spill

	; GCN: [[END]]:			; GCN: [[END]]:
	; GCN: buffer_load_dword v[[VAL_END:[0-9]+]], off, s[0:3], 0 offset:[[VAL_SUB_OFFSET]] ; 4-byte Folded Reload
	; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], [[SPILL_VGPR]], [[SAVEEXEC_LO_LANE]]			; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], [[SPILL_VGPR]], [[SAVEEXEC_LO_LANE]]
	; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], [[SPILL_VGPR]], [[SAVEEXEC_HI_LANE]]			; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], [[SPILL_VGPR]], [[SAVEEXEC_HI_LANE]]

	; VMEM: buffer_load_dword v[[V_RELOAD_SAVEEXEC:[0-9]+]], off, s[0:3], 0 offset:[[V_EXEC_SPILL_OFFSET]] ; 4-byte Folded Reload			; VMEM: buffer_load_dword v[[V_RELOAD_SAVEEXEC:[0-9]+]], off, s[0:3], 0 offset:[[V_EXEC_SPILL_OFFSET]] ; 4-byte Folded Reload
	; VMEM: s_waitcnt vmcnt(0)			; VMEM: s_waitcnt vmcnt(0)
	; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 0			; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 0
	; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 1			; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 1

	; GCN: s_or_b64 exec, exec, s[[[S_RELOAD_SAVEEXEC_LO]]:[[S_RELOAD_SAVEEXEC_HI]]]			; GCN: s_or_b64 exec, exec, s[[[S_RELOAD_SAVEEXEC_LO]]:[[S_RELOAD_SAVEEXEC_HI]]]
				; GCN: buffer_load_dword v[[VAL_END:[0-9]+]], off, s[0:3], 0 offset:[[VAL_SUB_OFFSET]] ; 4-byte Folded Reload

	; GCN: flat_store_dword v{{\[[0-9]+:[0-9]+\]}}, v[[VAL_END]]			; GCN: flat_store_dword v{{\[[0-9]+:[0-9]+\]}}, v[[VAL_END]]
	define amdgpu_kernel void @divergent_loop(ptr addrspace(1) %out) #0 {			define amdgpu_kernel void @divergent_loop(ptr addrspace(1) %out) #0 {
	entry:			entry:
	%tid = call i32 @llvm.amdgcn.workitem.id.x()			%tid = call i32 @llvm.amdgcn.workitem.id.x()
	%load0 = load volatile i32, ptr addrspace(3) null			%load0 = load volatile i32, ptr addrspace(3) null
	%cmp0 = icmp eq i32 %tid, 0			%cmp0 = icmp eq i32 %tid, 0
	br i1 %cmp0, label %loop, label %end			br i1 %cmp0, label %loop, label %end
	▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines

	; GCN: [[ELSE]]: ; %else			; GCN: [[ELSE]]: ; %else
	; GCN: buffer_load_dword v[[LOAD0_RELOAD:[0-9]+]], off, s[0:3], 0 offset:[[LOAD0_OFFSET]] ; 4-byte Folded Reload			; GCN: buffer_load_dword v[[LOAD0_RELOAD:[0-9]+]], off, s[0:3], 0 offset:[[LOAD0_OFFSET]] ; 4-byte Folded Reload
	; GCN: v_sub_i32_e32 v[[LOAD0_RELOAD]], vcc, v[[LOAD0_RELOAD]], v{{[0-9]+}}			; GCN: v_sub_i32_e32 v[[LOAD0_RELOAD]], vcc, v[[LOAD0_RELOAD]], v{{[0-9]+}}
	; GCN: buffer_store_dword v[[LOAD0_RELOAD]], off, s[0:3], 0 offset:[[FLOW_RESULT_OFFSET:[0-9]+]] ; 4-byte Folded Spill			; GCN: buffer_store_dword v[[LOAD0_RELOAD]], off, s[0:3], 0 offset:[[FLOW_RESULT_OFFSET:[0-9]+]] ; 4-byte Folded Spill
	; GCN-NEXT: s_branch [[FLOW]]			; GCN-NEXT: s_branch [[FLOW]]

	; GCN: [[ENDIF]]:			; GCN: [[ENDIF]]:
	; GCN: buffer_load_dword v[[RESULT:[0-9]+]], off, s[0:3], 0 offset:[[RESULT_OFFSET]] ; 4-byte Folded Reload
	; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], [[SPILL_VGPR]], [[FLOW_SAVEEXEC_LO_LANE]]			; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], [[SPILL_VGPR]], [[FLOW_SAVEEXEC_LO_LANE]]
	; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], [[SPILL_VGPR]], [[FLOW_SAVEEXEC_HI_LANE]]			; VGPR: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], [[SPILL_VGPR]], [[FLOW_SAVEEXEC_HI_LANE]]


	; VMEM: buffer_load_dword v[[V_RELOAD_SAVEEXEC:[0-9]+]], off, s[0:3], 0 offset:[[FLOW_SAVEEXEC_OFFSET]] ; 4-byte Folded Reload			; VMEM: buffer_load_dword v[[V_RELOAD_SAVEEXEC:[0-9]+]], off, s[0:3], 0 offset:[[FLOW_SAVEEXEC_OFFSET]] ; 4-byte Folded Reload
	; VMEM: s_waitcnt vmcnt(0)			; VMEM: s_waitcnt vmcnt(0)
	; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 0			; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_LO:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 0
	; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 1			; VMEM: v_readlane_b32 s[[S_RELOAD_SAVEEXEC_HI:[0-9]+]], v[[V_RELOAD_SAVEEXEC]], 1

	; GCN: s_or_b64 exec, exec, s[[[S_RELOAD_SAVEEXEC_LO]]:[[S_RELOAD_SAVEEXEC_HI]]]			; GCN: s_or_b64 exec, exec, s[[[S_RELOAD_SAVEEXEC_LO]]:[[S_RELOAD_SAVEEXEC_HI]]]
				; GCN: buffer_load_dword v[[RESULT:[0-9]+]], off, s[0:3], 0 offset:[[RESULT_OFFSET]] ; 4-byte Folded Reload

	; GCN: flat_store_dword v{{\[[0-9]+:[0-9]+\]}}, v[[RESULT]]			; GCN: flat_store_dword v{{\[[0-9]+:[0-9]+\]}}, v[[RESULT]]
	define amdgpu_kernel void @divergent_if_else_endif(ptr addrspace(1) %out) #0 {			define amdgpu_kernel void @divergent_if_else_endif(ptr addrspace(1) %out) #0 {
	entry:			entry:
	%tid = call i32 @llvm.amdgcn.workitem.id.x()			%tid = call i32 @llvm.amdgcn.workitem.id.x()
	%load0 = load volatile i32, ptr addrspace(3) null			%load0 = load volatile i32, ptr addrspace(3) null
	%cmp0 = icmp eq i32 %tid, 0			%cmp0 = icmp eq i32 %tid, 0
	br i1 %cmp0, label %if, label %else			br i1 %cmp0, label %if, label %else
	Show All 24 Lines

llvm/test/CodeGen/AMDGPU/global-atomics-fp.ll

; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc -march=amdgcn -mcpu=gfx900 -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GCN,GFX900 %s		; RUN: llc -march=amdgcn -mcpu=gfx900 -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GCN,GFX900 %s
; RUN: llc -march=amdgcn -mcpu=gfx908 -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GCN,GFX908 %s		; RUN: llc -march=amdgcn -mcpu=gfx908 -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GCN,GFX908 %s
; RUN: llc -march=amdgcn -mcpu=gfx90a -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GCN,GFX90A %s		; RUN: llc -march=amdgcn -mcpu=gfx90a -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GCN,GFX90A %s
; RUN: llc -march=amdgcn -mcpu=gfx1010 -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GCN,GFX10 %s		; RUN: llc -march=amdgcn -mcpu=gfx1010 -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GCN,GFX10 %s
; RUN: llc -march=amdgcn -mcpu=gfx1100 -amdgpu-enable-delay-alu=0 -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GFX11 %s		; RUN: llc -march=amdgcn -mcpu=gfx1100 -amdgpu-enable-delay-alu=0 -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefixes=GFX11 %s

define amdgpu_kernel void @global_atomic_fadd_ret_f32(ptr addrspace(1) %ptr) #0 {		define amdgpu_kernel void @global_atomic_fadd_ret_f32(ptr addrspace(1) %ptr) #0 {
; GFX900-LABEL: global_atomic_fadd_ret_f32:		; GFX900-LABEL: global_atomic_fadd_ret_f32:
; GFX900: ; %bb.0:		; GFX900: ; %bb.0:
; GFX900-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX900-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX900-NEXT: s_mov_b64 s[2:3], 0		; GFX900-NEXT: s_mov_b64 s[2:3], 0
; GFX900-NEXT: v_mov_b32_e32 v0, 0		; GFX900-NEXT: v_mov_b32_e32 v1, 0
; GFX900-NEXT: s_waitcnt lgkmcnt(0)		; GFX900-NEXT: s_waitcnt lgkmcnt(0)
; GFX900-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX900-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX900-NEXT: s_waitcnt lgkmcnt(0)		; GFX900-NEXT: s_waitcnt lgkmcnt(0)
; GFX900-NEXT: v_mov_b32_e32 v1, s4		; GFX900-NEXT: v_mov_b32_e32 v0, s4
; GFX900-NEXT: .LBB0_1: ; %atomicrmw.start		; GFX900-NEXT: .LBB0_1: ; %atomicrmw.start
; GFX900-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX900-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX900-NEXT: v_mov_b32_e32 v2, v1		; GFX900-NEXT: v_mov_b32_e32 v3, v0
; GFX900-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX900-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX900-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX900-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX900-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX900-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX900-NEXT: s_waitcnt vmcnt(0)		; GFX900-NEXT: s_waitcnt vmcnt(0)
; GFX900-NEXT: buffer_wbinvl1_vol		; GFX900-NEXT: buffer_wbinvl1_vol
; GFX900-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GFX900-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX900-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX900-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX900-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX900-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX900-NEXT: s_cbranch_execnz .LBB0_1		; GFX900-NEXT: s_cbranch_execnz .LBB0_1
; GFX900-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX900-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX900-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX900-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX900-NEXT: global_store_dword v[0:1], v1, off		; GFX900-NEXT: global_store_dword v[0:1], v0, off
; GFX900-NEXT: s_endpgm		; GFX900-NEXT: s_endpgm
;		;
; GFX908-LABEL: global_atomic_fadd_ret_f32:		; GFX908-LABEL: global_atomic_fadd_ret_f32:
; GFX908: ; %bb.0:		; GFX908: ; %bb.0:
; GFX908-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX908-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX908-NEXT: s_mov_b64 s[2:3], 0		; GFX908-NEXT: s_mov_b64 s[2:3], 0
; GFX908-NEXT: v_mov_b32_e32 v0, 0		; GFX908-NEXT: v_mov_b32_e32 v1, 0
; GFX908-NEXT: s_waitcnt lgkmcnt(0)		; GFX908-NEXT: s_waitcnt lgkmcnt(0)
; GFX908-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX908-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX908-NEXT: s_waitcnt lgkmcnt(0)		; GFX908-NEXT: s_waitcnt lgkmcnt(0)
; GFX908-NEXT: v_mov_b32_e32 v1, s4		; GFX908-NEXT: v_mov_b32_e32 v0, s4
; GFX908-NEXT: .LBB0_1: ; %atomicrmw.start		; GFX908-NEXT: .LBB0_1: ; %atomicrmw.start
; GFX908-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX908-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX908-NEXT: v_mov_b32_e32 v2, v1		; GFX908-NEXT: v_mov_b32_e32 v3, v0
; GFX908-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX908-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX908-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX908-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX908-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX908-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX908-NEXT: s_waitcnt vmcnt(0)		; GFX908-NEXT: s_waitcnt vmcnt(0)
; GFX908-NEXT: buffer_wbinvl1_vol		; GFX908-NEXT: buffer_wbinvl1_vol
; GFX908-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GFX908-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX908-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX908-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX908-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX908-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX908-NEXT: s_cbranch_execnz .LBB0_1		; GFX908-NEXT: s_cbranch_execnz .LBB0_1
; GFX908-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX908-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX908-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX908-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX908-NEXT: global_store_dword v[0:1], v1, off		; GFX908-NEXT: global_store_dword v[0:1], v0, off
; GFX908-NEXT: s_endpgm		; GFX908-NEXT: s_endpgm
;		;
; GFX90A-LABEL: global_atomic_fadd_ret_f32:		; GFX90A-LABEL: global_atomic_fadd_ret_f32:
; GFX90A: ; %bb.0:		; GFX90A: ; %bb.0:
; GFX90A-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX90A-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX90A-NEXT: s_mov_b64 s[2:3], 0		; GFX90A-NEXT: s_mov_b64 s[2:3], 0
; GFX90A-NEXT: v_mov_b32_e32 v0, 0		; GFX90A-NEXT: v_mov_b32_e32 v1, 0
; GFX90A-NEXT: s_waitcnt lgkmcnt(0)		; GFX90A-NEXT: s_waitcnt lgkmcnt(0)
; GFX90A-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX90A-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX90A-NEXT: s_waitcnt lgkmcnt(0)		; GFX90A-NEXT: s_waitcnt lgkmcnt(0)
; GFX90A-NEXT: v_mov_b32_e32 v1, s4		; GFX90A-NEXT: v_mov_b32_e32 v0, s4
; GFX90A-NEXT: .LBB0_1: ; %atomicrmw.start		; GFX90A-NEXT: .LBB0_1: ; %atomicrmw.start
; GFX90A-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX90A-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX90A-NEXT: v_mov_b32_e32 v3, v1		; GFX90A-NEXT: v_mov_b32_e32 v3, v0
; GFX90A-NEXT: v_add_f32_e32 v2, 4.0, v3		; GFX90A-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX90A-NEXT: buffer_wbl2		; GFX90A-NEXT: buffer_wbl2
; GFX90A-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX90A-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX90A-NEXT: global_atomic_cmpswap v1, v0, v[2:3], s[0:1] glc		; GFX90A-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX90A-NEXT: s_waitcnt vmcnt(0)		; GFX90A-NEXT: s_waitcnt vmcnt(0)
; GFX90A-NEXT: buffer_invl2		; GFX90A-NEXT: buffer_invl2
; GFX90A-NEXT: buffer_wbinvl1_vol		; GFX90A-NEXT: buffer_wbinvl1_vol
; GFX90A-NEXT: v_cmp_eq_u32_e32 vcc, v1, v3		; GFX90A-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX90A-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX90A-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX90A-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX90A-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX90A-NEXT: s_cbranch_execnz .LBB0_1		; GFX90A-NEXT: s_cbranch_execnz .LBB0_1
; GFX90A-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX90A-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX90A-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX90A-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX90A-NEXT: global_store_dword v[0:1], v1, off		; GFX90A-NEXT: global_store_dword v[0:1], v0, off
; GFX90A-NEXT: s_endpgm		; GFX90A-NEXT: s_endpgm
;		;
; GFX10-LABEL: global_atomic_fadd_ret_f32:		; GFX10-LABEL: global_atomic_fadd_ret_f32:
; GFX10: ; %bb.0:		; GFX10: ; %bb.0:
; GFX10-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX10-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX10-NEXT: v_mov_b32_e32 v0, 0		; GFX10-NEXT: v_mov_b32_e32 v1, 0
; GFX10-NEXT: s_waitcnt lgkmcnt(0)		; GFX10-NEXT: s_waitcnt lgkmcnt(0)
; GFX10-NEXT: s_load_dword s2, s[0:1], 0x0		; GFX10-NEXT: s_load_dword s2, s[0:1], 0x0
; GFX10-NEXT: s_waitcnt lgkmcnt(0)		; GFX10-NEXT: s_waitcnt lgkmcnt(0)
; GFX10-NEXT: v_mov_b32_e32 v1, s2		; GFX10-NEXT: v_mov_b32_e32 v0, s2
; GFX10-NEXT: s_mov_b32 s2, 0		; GFX10-NEXT: s_mov_b32 s2, 0
; GFX10-NEXT: .LBB0_1: ; %atomicrmw.start		; GFX10-NEXT: .LBB0_1: ; %atomicrmw.start
; GFX10-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX10-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX10-NEXT: v_mov_b32_e32 v2, v1		; GFX10-NEXT: v_mov_b32_e32 v3, v0
; GFX10-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX10-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX10-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX10-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX10-NEXT: s_waitcnt_vscnt null, 0x0		; GFX10-NEXT: s_waitcnt_vscnt null, 0x0
; GFX10-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX10-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX10-NEXT: s_waitcnt vmcnt(0)		; GFX10-NEXT: s_waitcnt vmcnt(0)
; GFX10-NEXT: buffer_gl0_inv		; GFX10-NEXT: buffer_gl0_inv
; GFX10-NEXT: buffer_gl1_inv		; GFX10-NEXT: buffer_gl1_inv
; GFX10-NEXT: v_cmp_eq_u32_e32 vcc_lo, v1, v2		; GFX10-NEXT: v_cmp_eq_u32_e32 vcc_lo, v0, v3
; GFX10-NEXT: s_or_b32 s2, vcc_lo, s2		; GFX10-NEXT: s_or_b32 s2, vcc_lo, s2
; GFX10-NEXT: s_andn2_b32 exec_lo, exec_lo, s2		; GFX10-NEXT: s_andn2_b32 exec_lo, exec_lo, s2
; GFX10-NEXT: s_cbranch_execnz .LBB0_1		; GFX10-NEXT: s_cbranch_execnz .LBB0_1
; GFX10-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX10-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX10-NEXT: s_or_b32 exec_lo, exec_lo, s2		; GFX10-NEXT: s_or_b32 exec_lo, exec_lo, s2
; GFX10-NEXT: global_store_dword v[0:1], v1, off		; GFX10-NEXT: global_store_dword v[0:1], v0, off
; GFX10-NEXT: s_endpgm		; GFX10-NEXT: s_endpgm
;		;
; GFX11-LABEL: global_atomic_fadd_ret_f32:		; GFX11-LABEL: global_atomic_fadd_ret_f32:
; GFX11: ; %bb.0:		; GFX11: ; %bb.0:
; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x24		; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x24
; GFX11-NEXT: v_mov_b32_e32 v0, 0		; GFX11-NEXT: v_mov_b32_e32 v1, 0
; GFX11-NEXT: s_waitcnt lgkmcnt(0)		; GFX11-NEXT: s_waitcnt lgkmcnt(0)
; GFX11-NEXT: s_load_b32 s2, s[0:1], 0x0		; GFX11-NEXT: s_load_b32 s2, s[0:1], 0x0
; GFX11-NEXT: s_waitcnt lgkmcnt(0)		; GFX11-NEXT: s_waitcnt lgkmcnt(0)
; GFX11-NEXT: v_mov_b32_e32 v1, s2		; GFX11-NEXT: v_mov_b32_e32 v0, s2
; GFX11-NEXT: s_mov_b32 s2, 0		; GFX11-NEXT: s_mov_b32 s2, 0
; GFX11-NEXT: .LBB0_1: ; %atomicrmw.start		; GFX11-NEXT: .LBB0_1: ; %atomicrmw.start
; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX11-NEXT: v_mov_b32_e32 v2, v1		; GFX11-NEXT: v_mov_b32_e32 v3, v0
; GFX11-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX11-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX11-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX11-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX11-NEXT: s_waitcnt_vscnt null, 0x0		; GFX11-NEXT: s_waitcnt_vscnt null, 0x0
; GFX11-NEXT: global_atomic_cmpswap_b32 v1, v0, v[1:2], s[0:1] glc		; GFX11-NEXT: global_atomic_cmpswap_b32 v0, v1, v[2:3], s[0:1] glc
; GFX11-NEXT: s_waitcnt vmcnt(0)		; GFX11-NEXT: s_waitcnt vmcnt(0)
; GFX11-NEXT: buffer_gl0_inv		; GFX11-NEXT: buffer_gl0_inv
; GFX11-NEXT: buffer_gl1_inv		; GFX11-NEXT: buffer_gl1_inv
; GFX11-NEXT: v_cmp_eq_u32_e32 vcc_lo, v1, v2		; GFX11-NEXT: v_cmp_eq_u32_e32 vcc_lo, v0, v3
; GFX11-NEXT: s_or_b32 s2, vcc_lo, s2		; GFX11-NEXT: s_or_b32 s2, vcc_lo, s2
; GFX11-NEXT: s_and_not1_b32 exec_lo, exec_lo, s2		; GFX11-NEXT: s_and_not1_b32 exec_lo, exec_lo, s2
; GFX11-NEXT: s_cbranch_execnz .LBB0_1		; GFX11-NEXT: s_cbranch_execnz .LBB0_1
; GFX11-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX11-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX11-NEXT: s_or_b32 exec_lo, exec_lo, s2		; GFX11-NEXT: s_or_b32 exec_lo, exec_lo, s2
; GFX11-NEXT: global_store_b32 v[0:1], v1, off		; GFX11-NEXT: global_store_b32 v[0:1], v0, off
; GFX11-NEXT: s_sendmsg sendmsg(MSG_DEALLOC_VGPRS)		; GFX11-NEXT: s_sendmsg sendmsg(MSG_DEALLOC_VGPRS)
; GFX11-NEXT: s_endpgm		; GFX11-NEXT: s_endpgm
%result = atomicrmw fadd ptr addrspace(1) %ptr, float 4.0 seq_cst		%result = atomicrmw fadd ptr addrspace(1) %ptr, float 4.0 seq_cst
store float %result, ptr addrspace(1) undef		store float %result, ptr addrspace(1) undef
ret void		ret void
}		}

define amdgpu_kernel void @global_atomic_fadd_ret_f32_ieee(ptr addrspace(1) %ptr) #2 {		define amdgpu_kernel void @global_atomic_fadd_ret_f32_ieee(ptr addrspace(1) %ptr) #2 {
; GFX900-LABEL: global_atomic_fadd_ret_f32_ieee:		; GFX900-LABEL: global_atomic_fadd_ret_f32_ieee:
; GFX900: ; %bb.0:		; GFX900: ; %bb.0:
; GFX900-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX900-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX900-NEXT: s_mov_b64 s[2:3], 0		; GFX900-NEXT: s_mov_b64 s[2:3], 0
; GFX900-NEXT: v_mov_b32_e32 v0, 0		; GFX900-NEXT: v_mov_b32_e32 v1, 0
; GFX900-NEXT: s_waitcnt lgkmcnt(0)		; GFX900-NEXT: s_waitcnt lgkmcnt(0)
; GFX900-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX900-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX900-NEXT: s_waitcnt lgkmcnt(0)		; GFX900-NEXT: s_waitcnt lgkmcnt(0)
; GFX900-NEXT: v_mov_b32_e32 v1, s4		; GFX900-NEXT: v_mov_b32_e32 v0, s4
; GFX900-NEXT: .LBB1_1: ; %atomicrmw.start		; GFX900-NEXT: .LBB1_1: ; %atomicrmw.start
; GFX900-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX900-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX900-NEXT: v_mov_b32_e32 v2, v1		; GFX900-NEXT: v_mov_b32_e32 v3, v0
; GFX900-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX900-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX900-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX900-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX900-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX900-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX900-NEXT: s_waitcnt vmcnt(0)		; GFX900-NEXT: s_waitcnt vmcnt(0)
; GFX900-NEXT: buffer_wbinvl1_vol		; GFX900-NEXT: buffer_wbinvl1_vol
; GFX900-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GFX900-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX900-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX900-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX900-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX900-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX900-NEXT: s_cbranch_execnz .LBB1_1		; GFX900-NEXT: s_cbranch_execnz .LBB1_1
; GFX900-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX900-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX900-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX900-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX900-NEXT: global_store_dword v[0:1], v1, off		; GFX900-NEXT: global_store_dword v[0:1], v0, off
; GFX900-NEXT: s_endpgm		; GFX900-NEXT: s_endpgm
;		;
; GFX908-LABEL: global_atomic_fadd_ret_f32_ieee:		; GFX908-LABEL: global_atomic_fadd_ret_f32_ieee:
; GFX908: ; %bb.0:		; GFX908: ; %bb.0:
; GFX908-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX908-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX908-NEXT: s_mov_b64 s[2:3], 0		; GFX908-NEXT: s_mov_b64 s[2:3], 0
; GFX908-NEXT: v_mov_b32_e32 v0, 0		; GFX908-NEXT: v_mov_b32_e32 v1, 0
; GFX908-NEXT: s_waitcnt lgkmcnt(0)		; GFX908-NEXT: s_waitcnt lgkmcnt(0)
; GFX908-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX908-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX908-NEXT: s_waitcnt lgkmcnt(0)		; GFX908-NEXT: s_waitcnt lgkmcnt(0)
; GFX908-NEXT: v_mov_b32_e32 v1, s4		; GFX908-NEXT: v_mov_b32_e32 v0, s4
; GFX908-NEXT: .LBB1_1: ; %atomicrmw.start		; GFX908-NEXT: .LBB1_1: ; %atomicrmw.start
; GFX908-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX908-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX908-NEXT: v_mov_b32_e32 v2, v1		; GFX908-NEXT: v_mov_b32_e32 v3, v0
; GFX908-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX908-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX908-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX908-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX908-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX908-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX908-NEXT: s_waitcnt vmcnt(0)		; GFX908-NEXT: s_waitcnt vmcnt(0)
; GFX908-NEXT: buffer_wbinvl1_vol		; GFX908-NEXT: buffer_wbinvl1_vol
; GFX908-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GFX908-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX908-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX908-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX908-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX908-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX908-NEXT: s_cbranch_execnz .LBB1_1		; GFX908-NEXT: s_cbranch_execnz .LBB1_1
; GFX908-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX908-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX908-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX908-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX908-NEXT: global_store_dword v[0:1], v1, off		; GFX908-NEXT: global_store_dword v[0:1], v0, off
; GFX908-NEXT: s_endpgm		; GFX908-NEXT: s_endpgm
;		;
; GFX90A-LABEL: global_atomic_fadd_ret_f32_ieee:		; GFX90A-LABEL: global_atomic_fadd_ret_f32_ieee:
; GFX90A: ; %bb.0:		; GFX90A: ; %bb.0:
; GFX90A-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX90A-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX90A-NEXT: v_mov_b32_e32 v0, 0		; GFX90A-NEXT: v_mov_b32_e32 v0, 0
; GFX90A-NEXT: v_mov_b32_e32 v1, 4.0		; GFX90A-NEXT: v_mov_b32_e32 v1, 4.0
; GFX90A-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX90A-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX90A-NEXT: global_atomic_add_f32 v0, v0, v1, s[0:1] glc		; GFX90A-NEXT: global_atomic_add_f32 v0, v0, v1, s[0:1] glc
; GFX90A-NEXT: s_waitcnt vmcnt(0)		; GFX90A-NEXT: s_waitcnt vmcnt(0)
; GFX90A-NEXT: buffer_wbinvl1_vol		; GFX90A-NEXT: buffer_wbinvl1_vol
; GFX90A-NEXT: global_store_dword v[0:1], v0, off		; GFX90A-NEXT: global_store_dword v[0:1], v0, off
; GFX90A-NEXT: s_endpgm		; GFX90A-NEXT: s_endpgm
;		;
; GFX10-LABEL: global_atomic_fadd_ret_f32_ieee:		; GFX10-LABEL: global_atomic_fadd_ret_f32_ieee:
; GFX10: ; %bb.0:		; GFX10: ; %bb.0:
; GFX10-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX10-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX10-NEXT: v_mov_b32_e32 v0, 0		; GFX10-NEXT: v_mov_b32_e32 v1, 0
; GFX10-NEXT: s_waitcnt lgkmcnt(0)		; GFX10-NEXT: s_waitcnt lgkmcnt(0)
; GFX10-NEXT: s_load_dword s2, s[0:1], 0x0		; GFX10-NEXT: s_load_dword s2, s[0:1], 0x0
; GFX10-NEXT: s_waitcnt lgkmcnt(0)		; GFX10-NEXT: s_waitcnt lgkmcnt(0)
; GFX10-NEXT: v_mov_b32_e32 v1, s2		; GFX10-NEXT: v_mov_b32_e32 v0, s2
; GFX10-NEXT: s_mov_b32 s2, 0		; GFX10-NEXT: s_mov_b32 s2, 0
; GFX10-NEXT: .LBB1_1: ; %atomicrmw.start		; GFX10-NEXT: .LBB1_1: ; %atomicrmw.start
; GFX10-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX10-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX10-NEXT: v_mov_b32_e32 v2, v1		; GFX10-NEXT: v_mov_b32_e32 v3, v0
; GFX10-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX10-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX10-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX10-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX10-NEXT: s_waitcnt_vscnt null, 0x0		; GFX10-NEXT: s_waitcnt_vscnt null, 0x0
; GFX10-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX10-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX10-NEXT: s_waitcnt vmcnt(0)		; GFX10-NEXT: s_waitcnt vmcnt(0)
; GFX10-NEXT: buffer_gl0_inv		; GFX10-NEXT: buffer_gl0_inv
; GFX10-NEXT: buffer_gl1_inv		; GFX10-NEXT: buffer_gl1_inv
; GFX10-NEXT: v_cmp_eq_u32_e32 vcc_lo, v1, v2		; GFX10-NEXT: v_cmp_eq_u32_e32 vcc_lo, v0, v3
; GFX10-NEXT: s_or_b32 s2, vcc_lo, s2		; GFX10-NEXT: s_or_b32 s2, vcc_lo, s2
; GFX10-NEXT: s_andn2_b32 exec_lo, exec_lo, s2		; GFX10-NEXT: s_andn2_b32 exec_lo, exec_lo, s2
; GFX10-NEXT: s_cbranch_execnz .LBB1_1		; GFX10-NEXT: s_cbranch_execnz .LBB1_1
; GFX10-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX10-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX10-NEXT: s_or_b32 exec_lo, exec_lo, s2		; GFX10-NEXT: s_or_b32 exec_lo, exec_lo, s2
; GFX10-NEXT: global_store_dword v[0:1], v1, off		; GFX10-NEXT: global_store_dword v[0:1], v0, off
; GFX10-NEXT: s_endpgm		; GFX10-NEXT: s_endpgm
;		;
; GFX11-LABEL: global_atomic_fadd_ret_f32_ieee:		; GFX11-LABEL: global_atomic_fadd_ret_f32_ieee:
; GFX11: ; %bb.0:		; GFX11: ; %bb.0:
; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x24		; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x24
; GFX11-NEXT: v_dual_mov_b32 v0, 0 :: v_dual_mov_b32 v1, 4.0		; GFX11-NEXT: v_dual_mov_b32 v0, 0 :: v_dual_mov_b32 v1, 4.0
; GFX11-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX11-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX11-NEXT: s_waitcnt_vscnt null, 0x0		; GFX11-NEXT: s_waitcnt_vscnt null, 0x0
▲ Show 20 Lines • Show All 185 Lines • ▼ Show 20 Lines	; GFX11-NEXT: s_endpgm
ret void		ret void
}		}

define amdgpu_kernel void @global_atomic_fadd_ret_f32_agent(ptr addrspace(1) %ptr) #0 {		define amdgpu_kernel void @global_atomic_fadd_ret_f32_agent(ptr addrspace(1) %ptr) #0 {
; GFX900-LABEL: global_atomic_fadd_ret_f32_agent:		; GFX900-LABEL: global_atomic_fadd_ret_f32_agent:
; GFX900: ; %bb.0:		; GFX900: ; %bb.0:
; GFX900-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX900-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX900-NEXT: s_mov_b64 s[2:3], 0		; GFX900-NEXT: s_mov_b64 s[2:3], 0
; GFX900-NEXT: v_mov_b32_e32 v0, 0		; GFX900-NEXT: v_mov_b32_e32 v1, 0
; GFX900-NEXT: s_waitcnt lgkmcnt(0)		; GFX900-NEXT: s_waitcnt lgkmcnt(0)
; GFX900-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX900-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX900-NEXT: s_waitcnt lgkmcnt(0)		; GFX900-NEXT: s_waitcnt lgkmcnt(0)
; GFX900-NEXT: v_mov_b32_e32 v1, s4		; GFX900-NEXT: v_mov_b32_e32 v0, s4
; GFX900-NEXT: .LBB4_1: ; %atomicrmw.start		; GFX900-NEXT: .LBB4_1: ; %atomicrmw.start
; GFX900-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX900-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX900-NEXT: v_mov_b32_e32 v2, v1		; GFX900-NEXT: v_mov_b32_e32 v3, v0
; GFX900-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX900-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX900-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX900-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX900-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX900-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX900-NEXT: s_waitcnt vmcnt(0)		; GFX900-NEXT: s_waitcnt vmcnt(0)
; GFX900-NEXT: buffer_wbinvl1_vol		; GFX900-NEXT: buffer_wbinvl1_vol
; GFX900-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GFX900-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX900-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX900-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX900-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX900-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX900-NEXT: s_cbranch_execnz .LBB4_1		; GFX900-NEXT: s_cbranch_execnz .LBB4_1
; GFX900-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX900-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX900-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX900-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX900-NEXT: global_store_dword v[0:1], v1, off		; GFX900-NEXT: global_store_dword v[0:1], v0, off
; GFX900-NEXT: s_endpgm		; GFX900-NEXT: s_endpgm
;		;
; GFX908-LABEL: global_atomic_fadd_ret_f32_agent:		; GFX908-LABEL: global_atomic_fadd_ret_f32_agent:
; GFX908: ; %bb.0:		; GFX908: ; %bb.0:
; GFX908-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX908-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX908-NEXT: s_mov_b64 s[2:3], 0		; GFX908-NEXT: s_mov_b64 s[2:3], 0
; GFX908-NEXT: v_mov_b32_e32 v0, 0		; GFX908-NEXT: v_mov_b32_e32 v1, 0
; GFX908-NEXT: s_waitcnt lgkmcnt(0)		; GFX908-NEXT: s_waitcnt lgkmcnt(0)
; GFX908-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX908-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX908-NEXT: s_waitcnt lgkmcnt(0)		; GFX908-NEXT: s_waitcnt lgkmcnt(0)
; GFX908-NEXT: v_mov_b32_e32 v1, s4		; GFX908-NEXT: v_mov_b32_e32 v0, s4
; GFX908-NEXT: .LBB4_1: ; %atomicrmw.start		; GFX908-NEXT: .LBB4_1: ; %atomicrmw.start
; GFX908-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX908-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX908-NEXT: v_mov_b32_e32 v2, v1		; GFX908-NEXT: v_mov_b32_e32 v3, v0
; GFX908-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX908-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX908-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX908-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX908-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX908-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX908-NEXT: s_waitcnt vmcnt(0)		; GFX908-NEXT: s_waitcnt vmcnt(0)
; GFX908-NEXT: buffer_wbinvl1_vol		; GFX908-NEXT: buffer_wbinvl1_vol
; GFX908-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GFX908-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX908-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX908-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX908-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX908-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX908-NEXT: s_cbranch_execnz .LBB4_1		; GFX908-NEXT: s_cbranch_execnz .LBB4_1
; GFX908-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX908-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX908-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX908-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX908-NEXT: global_store_dword v[0:1], v1, off		; GFX908-NEXT: global_store_dword v[0:1], v0, off
; GFX908-NEXT: s_endpgm		; GFX908-NEXT: s_endpgm
;		;
; GFX90A-LABEL: global_atomic_fadd_ret_f32_agent:		; GFX90A-LABEL: global_atomic_fadd_ret_f32_agent:
; GFX90A: ; %bb.0:		; GFX90A: ; %bb.0:
; GFX90A-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX90A-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX90A-NEXT: v_mov_b32_e32 v0, 0		; GFX90A-NEXT: v_mov_b32_e32 v0, 0
; GFX90A-NEXT: v_mov_b32_e32 v1, 4.0		; GFX90A-NEXT: v_mov_b32_e32 v1, 4.0
; GFX90A-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX90A-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX90A-NEXT: global_atomic_add_f32 v0, v0, v1, s[0:1] glc		; GFX90A-NEXT: global_atomic_add_f32 v0, v0, v1, s[0:1] glc
; GFX90A-NEXT: s_waitcnt vmcnt(0)		; GFX90A-NEXT: s_waitcnt vmcnt(0)
; GFX90A-NEXT: buffer_wbinvl1_vol		; GFX90A-NEXT: buffer_wbinvl1_vol
; GFX90A-NEXT: global_store_dword v[0:1], v0, off		; GFX90A-NEXT: global_store_dword v[0:1], v0, off
; GFX90A-NEXT: s_endpgm		; GFX90A-NEXT: s_endpgm
;		;
; GFX10-LABEL: global_atomic_fadd_ret_f32_agent:		; GFX10-LABEL: global_atomic_fadd_ret_f32_agent:
; GFX10: ; %bb.0:		; GFX10: ; %bb.0:
; GFX10-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX10-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX10-NEXT: v_mov_b32_e32 v0, 0		; GFX10-NEXT: v_mov_b32_e32 v1, 0
; GFX10-NEXT: s_waitcnt lgkmcnt(0)		; GFX10-NEXT: s_waitcnt lgkmcnt(0)
; GFX10-NEXT: s_load_dword s2, s[0:1], 0x0		; GFX10-NEXT: s_load_dword s2, s[0:1], 0x0
; GFX10-NEXT: s_waitcnt lgkmcnt(0)		; GFX10-NEXT: s_waitcnt lgkmcnt(0)
; GFX10-NEXT: v_mov_b32_e32 v1, s2		; GFX10-NEXT: v_mov_b32_e32 v0, s2
; GFX10-NEXT: s_mov_b32 s2, 0		; GFX10-NEXT: s_mov_b32 s2, 0
; GFX10-NEXT: .LBB4_1: ; %atomicrmw.start		; GFX10-NEXT: .LBB4_1: ; %atomicrmw.start
; GFX10-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX10-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX10-NEXT: v_mov_b32_e32 v2, v1		; GFX10-NEXT: v_mov_b32_e32 v3, v0
; GFX10-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX10-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX10-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX10-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX10-NEXT: s_waitcnt_vscnt null, 0x0		; GFX10-NEXT: s_waitcnt_vscnt null, 0x0
; GFX10-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX10-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX10-NEXT: s_waitcnt vmcnt(0)		; GFX10-NEXT: s_waitcnt vmcnt(0)
; GFX10-NEXT: buffer_gl0_inv		; GFX10-NEXT: buffer_gl0_inv
; GFX10-NEXT: buffer_gl1_inv		; GFX10-NEXT: buffer_gl1_inv
; GFX10-NEXT: v_cmp_eq_u32_e32 vcc_lo, v1, v2		; GFX10-NEXT: v_cmp_eq_u32_e32 vcc_lo, v0, v3
; GFX10-NEXT: s_or_b32 s2, vcc_lo, s2		; GFX10-NEXT: s_or_b32 s2, vcc_lo, s2
; GFX10-NEXT: s_andn2_b32 exec_lo, exec_lo, s2		; GFX10-NEXT: s_andn2_b32 exec_lo, exec_lo, s2
; GFX10-NEXT: s_cbranch_execnz .LBB4_1		; GFX10-NEXT: s_cbranch_execnz .LBB4_1
; GFX10-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX10-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX10-NEXT: s_or_b32 exec_lo, exec_lo, s2		; GFX10-NEXT: s_or_b32 exec_lo, exec_lo, s2
; GFX10-NEXT: global_store_dword v[0:1], v1, off		; GFX10-NEXT: global_store_dword v[0:1], v0, off
; GFX10-NEXT: s_endpgm		; GFX10-NEXT: s_endpgm
;		;
; GFX11-LABEL: global_atomic_fadd_ret_f32_agent:		; GFX11-LABEL: global_atomic_fadd_ret_f32_agent:
; GFX11: ; %bb.0:		; GFX11: ; %bb.0:
; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x24		; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x24
; GFX11-NEXT: v_dual_mov_b32 v0, 0 :: v_dual_mov_b32 v1, 4.0		; GFX11-NEXT: v_dual_mov_b32 v0, 0 :: v_dual_mov_b32 v1, 4.0
; GFX11-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX11-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX11-NEXT: s_waitcnt_vscnt null, 0x0		; GFX11-NEXT: s_waitcnt_vscnt null, 0x0
Show All 9 Lines	; GFX11-NEXT: s_endpgm
ret void		ret void
}		}

define amdgpu_kernel void @global_atomic_fadd_ret_f32_system(ptr addrspace(1) %ptr) #0 {		define amdgpu_kernel void @global_atomic_fadd_ret_f32_system(ptr addrspace(1) %ptr) #0 {
; GFX900-LABEL: global_atomic_fadd_ret_f32_system:		; GFX900-LABEL: global_atomic_fadd_ret_f32_system:
; GFX900: ; %bb.0:		; GFX900: ; %bb.0:
; GFX900-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX900-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX900-NEXT: s_mov_b64 s[2:3], 0		; GFX900-NEXT: s_mov_b64 s[2:3], 0
; GFX900-NEXT: v_mov_b32_e32 v0, 0		; GFX900-NEXT: v_mov_b32_e32 v1, 0
; GFX900-NEXT: s_waitcnt lgkmcnt(0)		; GFX900-NEXT: s_waitcnt lgkmcnt(0)
; GFX900-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX900-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX900-NEXT: s_waitcnt lgkmcnt(0)		; GFX900-NEXT: s_waitcnt lgkmcnt(0)
; GFX900-NEXT: v_mov_b32_e32 v1, s4		; GFX900-NEXT: v_mov_b32_e32 v0, s4
; GFX900-NEXT: .LBB5_1: ; %atomicrmw.start		; GFX900-NEXT: .LBB5_1: ; %atomicrmw.start
; GFX900-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX900-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX900-NEXT: v_mov_b32_e32 v2, v1		; GFX900-NEXT: v_mov_b32_e32 v3, v0
; GFX900-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX900-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX900-NEXT: s_waitcnt vmcnt(0)		; GFX900-NEXT: s_waitcnt vmcnt(0)
; GFX900-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX900-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX900-NEXT: s_waitcnt vmcnt(0)		; GFX900-NEXT: s_waitcnt vmcnt(0)
; GFX900-NEXT: buffer_wbinvl1_vol		; GFX900-NEXT: buffer_wbinvl1_vol
; GFX900-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GFX900-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX900-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX900-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX900-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX900-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX900-NEXT: s_cbranch_execnz .LBB5_1		; GFX900-NEXT: s_cbranch_execnz .LBB5_1
; GFX900-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX900-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX900-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX900-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX900-NEXT: global_store_dword v[0:1], v1, off		; GFX900-NEXT: global_store_dword v[0:1], v0, off
; GFX900-NEXT: s_endpgm		; GFX900-NEXT: s_endpgm
;		;
; GFX908-LABEL: global_atomic_fadd_ret_f32_system:		; GFX908-LABEL: global_atomic_fadd_ret_f32_system:
; GFX908: ; %bb.0:		; GFX908: ; %bb.0:
; GFX908-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX908-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX908-NEXT: s_mov_b64 s[2:3], 0		; GFX908-NEXT: s_mov_b64 s[2:3], 0
; GFX908-NEXT: v_mov_b32_e32 v0, 0		; GFX908-NEXT: v_mov_b32_e32 v1, 0
; GFX908-NEXT: s_waitcnt lgkmcnt(0)		; GFX908-NEXT: s_waitcnt lgkmcnt(0)
; GFX908-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX908-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX908-NEXT: s_waitcnt lgkmcnt(0)		; GFX908-NEXT: s_waitcnt lgkmcnt(0)
; GFX908-NEXT: v_mov_b32_e32 v1, s4		; GFX908-NEXT: v_mov_b32_e32 v0, s4
; GFX908-NEXT: .LBB5_1: ; %atomicrmw.start		; GFX908-NEXT: .LBB5_1: ; %atomicrmw.start
; GFX908-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX908-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX908-NEXT: v_mov_b32_e32 v2, v1		; GFX908-NEXT: v_mov_b32_e32 v3, v0
; GFX908-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX908-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX908-NEXT: s_waitcnt vmcnt(0)		; GFX908-NEXT: s_waitcnt vmcnt(0)
; GFX908-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX908-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX908-NEXT: s_waitcnt vmcnt(0)		; GFX908-NEXT: s_waitcnt vmcnt(0)
; GFX908-NEXT: buffer_wbinvl1_vol		; GFX908-NEXT: buffer_wbinvl1_vol
; GFX908-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GFX908-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX908-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX908-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX908-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX908-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX908-NEXT: s_cbranch_execnz .LBB5_1		; GFX908-NEXT: s_cbranch_execnz .LBB5_1
; GFX908-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX908-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX908-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX908-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX908-NEXT: global_store_dword v[0:1], v1, off		; GFX908-NEXT: global_store_dword v[0:1], v0, off
; GFX908-NEXT: s_endpgm		; GFX908-NEXT: s_endpgm
;		;
; GFX90A-LABEL: global_atomic_fadd_ret_f32_system:		; GFX90A-LABEL: global_atomic_fadd_ret_f32_system:
; GFX90A: ; %bb.0:		; GFX90A: ; %bb.0:
; GFX90A-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX90A-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX90A-NEXT: s_mov_b64 s[2:3], 0		; GFX90A-NEXT: s_mov_b64 s[2:3], 0
; GFX90A-NEXT: v_mov_b32_e32 v0, 0		; GFX90A-NEXT: v_mov_b32_e32 v1, 0
; GFX90A-NEXT: s_waitcnt lgkmcnt(0)		; GFX90A-NEXT: s_waitcnt lgkmcnt(0)
; GFX90A-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX90A-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX90A-NEXT: s_waitcnt lgkmcnt(0)		; GFX90A-NEXT: s_waitcnt lgkmcnt(0)
; GFX90A-NEXT: v_mov_b32_e32 v1, s4		; GFX90A-NEXT: v_mov_b32_e32 v0, s4
; GFX90A-NEXT: .LBB5_1: ; %atomicrmw.start		; GFX90A-NEXT: .LBB5_1: ; %atomicrmw.start
; GFX90A-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX90A-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX90A-NEXT: v_mov_b32_e32 v3, v1		; GFX90A-NEXT: v_mov_b32_e32 v3, v0
; GFX90A-NEXT: v_add_f32_e32 v2, 4.0, v3		; GFX90A-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX90A-NEXT: buffer_wbl2		; GFX90A-NEXT: buffer_wbl2
; GFX90A-NEXT: s_waitcnt vmcnt(0)		; GFX90A-NEXT: s_waitcnt vmcnt(0)
; GFX90A-NEXT: global_atomic_cmpswap v1, v0, v[2:3], s[0:1] glc		; GFX90A-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX90A-NEXT: s_waitcnt vmcnt(0)		; GFX90A-NEXT: s_waitcnt vmcnt(0)
; GFX90A-NEXT: buffer_invl2		; GFX90A-NEXT: buffer_invl2
; GFX90A-NEXT: buffer_wbinvl1_vol		; GFX90A-NEXT: buffer_wbinvl1_vol
; GFX90A-NEXT: v_cmp_eq_u32_e32 vcc, v1, v3		; GFX90A-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX90A-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX90A-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX90A-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX90A-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX90A-NEXT: s_cbranch_execnz .LBB5_1		; GFX90A-NEXT: s_cbranch_execnz .LBB5_1
; GFX90A-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX90A-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX90A-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX90A-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX90A-NEXT: global_store_dword v[0:1], v1, off		; GFX90A-NEXT: global_store_dword v[0:1], v0, off
; GFX90A-NEXT: s_endpgm		; GFX90A-NEXT: s_endpgm
;		;
; GFX10-LABEL: global_atomic_fadd_ret_f32_system:		; GFX10-LABEL: global_atomic_fadd_ret_f32_system:
; GFX10: ; %bb.0:		; GFX10: ; %bb.0:
; GFX10-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX10-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX10-NEXT: v_mov_b32_e32 v0, 0		; GFX10-NEXT: v_mov_b32_e32 v1, 0
; GFX10-NEXT: s_waitcnt lgkmcnt(0)		; GFX10-NEXT: s_waitcnt lgkmcnt(0)
; GFX10-NEXT: s_load_dword s2, s[0:1], 0x0		; GFX10-NEXT: s_load_dword s2, s[0:1], 0x0
; GFX10-NEXT: s_waitcnt lgkmcnt(0)		; GFX10-NEXT: s_waitcnt lgkmcnt(0)
; GFX10-NEXT: v_mov_b32_e32 v1, s2		; GFX10-NEXT: v_mov_b32_e32 v0, s2
; GFX10-NEXT: s_mov_b32 s2, 0		; GFX10-NEXT: s_mov_b32 s2, 0
; GFX10-NEXT: .LBB5_1: ; %atomicrmw.start		; GFX10-NEXT: .LBB5_1: ; %atomicrmw.start
; GFX10-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX10-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX10-NEXT: v_mov_b32_e32 v2, v1		; GFX10-NEXT: v_mov_b32_e32 v3, v0
; GFX10-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX10-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX10-NEXT: s_waitcnt vmcnt(0)		; GFX10-NEXT: s_waitcnt vmcnt(0)
; GFX10-NEXT: s_waitcnt_vscnt null, 0x0		; GFX10-NEXT: s_waitcnt_vscnt null, 0x0
; GFX10-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX10-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX10-NEXT: s_waitcnt vmcnt(0)		; GFX10-NEXT: s_waitcnt vmcnt(0)
; GFX10-NEXT: buffer_gl0_inv		; GFX10-NEXT: buffer_gl0_inv
; GFX10-NEXT: buffer_gl1_inv		; GFX10-NEXT: buffer_gl1_inv
; GFX10-NEXT: v_cmp_eq_u32_e32 vcc_lo, v1, v2		; GFX10-NEXT: v_cmp_eq_u32_e32 vcc_lo, v0, v3
; GFX10-NEXT: s_or_b32 s2, vcc_lo, s2		; GFX10-NEXT: s_or_b32 s2, vcc_lo, s2
; GFX10-NEXT: s_andn2_b32 exec_lo, exec_lo, s2		; GFX10-NEXT: s_andn2_b32 exec_lo, exec_lo, s2
; GFX10-NEXT: s_cbranch_execnz .LBB5_1		; GFX10-NEXT: s_cbranch_execnz .LBB5_1
; GFX10-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX10-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX10-NEXT: s_or_b32 exec_lo, exec_lo, s2		; GFX10-NEXT: s_or_b32 exec_lo, exec_lo, s2
; GFX10-NEXT: global_store_dword v[0:1], v1, off		; GFX10-NEXT: global_store_dword v[0:1], v0, off
; GFX10-NEXT: s_endpgm		; GFX10-NEXT: s_endpgm
;		;
; GFX11-LABEL: global_atomic_fadd_ret_f32_system:		; GFX11-LABEL: global_atomic_fadd_ret_f32_system:
; GFX11: ; %bb.0:		; GFX11: ; %bb.0:
; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x24		; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x24
; GFX11-NEXT: v_mov_b32_e32 v0, 0		; GFX11-NEXT: v_mov_b32_e32 v1, 0
; GFX11-NEXT: s_waitcnt lgkmcnt(0)		; GFX11-NEXT: s_waitcnt lgkmcnt(0)
; GFX11-NEXT: s_load_b32 s2, s[0:1], 0x0		; GFX11-NEXT: s_load_b32 s2, s[0:1], 0x0
; GFX11-NEXT: s_waitcnt lgkmcnt(0)		; GFX11-NEXT: s_waitcnt lgkmcnt(0)
; GFX11-NEXT: v_mov_b32_e32 v1, s2		; GFX11-NEXT: v_mov_b32_e32 v0, s2
; GFX11-NEXT: s_mov_b32 s2, 0		; GFX11-NEXT: s_mov_b32 s2, 0
; GFX11-NEXT: .LBB5_1: ; %atomicrmw.start		; GFX11-NEXT: .LBB5_1: ; %atomicrmw.start
; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX11-NEXT: v_mov_b32_e32 v2, v1		; GFX11-NEXT: v_mov_b32_e32 v3, v0
; GFX11-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX11-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX11-NEXT: s_waitcnt vmcnt(0)		; GFX11-NEXT: s_waitcnt vmcnt(0)
; GFX11-NEXT: s_waitcnt_vscnt null, 0x0		; GFX11-NEXT: s_waitcnt_vscnt null, 0x0
; GFX11-NEXT: global_atomic_cmpswap_b32 v1, v0, v[1:2], s[0:1] glc		; GFX11-NEXT: global_atomic_cmpswap_b32 v0, v1, v[2:3], s[0:1] glc
; GFX11-NEXT: s_waitcnt vmcnt(0)		; GFX11-NEXT: s_waitcnt vmcnt(0)
; GFX11-NEXT: buffer_gl0_inv		; GFX11-NEXT: buffer_gl0_inv
; GFX11-NEXT: buffer_gl1_inv		; GFX11-NEXT: buffer_gl1_inv
; GFX11-NEXT: v_cmp_eq_u32_e32 vcc_lo, v1, v2		; GFX11-NEXT: v_cmp_eq_u32_e32 vcc_lo, v0, v3
; GFX11-NEXT: s_or_b32 s2, vcc_lo, s2		; GFX11-NEXT: s_or_b32 s2, vcc_lo, s2
; GFX11-NEXT: s_and_not1_b32 exec_lo, exec_lo, s2		; GFX11-NEXT: s_and_not1_b32 exec_lo, exec_lo, s2
; GFX11-NEXT: s_cbranch_execnz .LBB5_1		; GFX11-NEXT: s_cbranch_execnz .LBB5_1
; GFX11-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX11-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX11-NEXT: s_or_b32 exec_lo, exec_lo, s2		; GFX11-NEXT: s_or_b32 exec_lo, exec_lo, s2
; GFX11-NEXT: global_store_b32 v[0:1], v1, off		; GFX11-NEXT: global_store_b32 v[0:1], v0, off
; GFX11-NEXT: s_sendmsg sendmsg(MSG_DEALLOC_VGPRS)		; GFX11-NEXT: s_sendmsg sendmsg(MSG_DEALLOC_VGPRS)
; GFX11-NEXT: s_endpgm		; GFX11-NEXT: s_endpgm
%result = atomicrmw fadd ptr addrspace(1) %ptr, float 4.0 syncscope("one-as") seq_cst		%result = atomicrmw fadd ptr addrspace(1) %ptr, float 4.0 syncscope("one-as") seq_cst
store float %result, ptr addrspace(1) undef		store float %result, ptr addrspace(1) undef
ret void		ret void
}		}

define amdgpu_kernel void @global_atomic_fadd_ret_f32_wrong_subtarget(ptr addrspace(1) %ptr) #1 {		define amdgpu_kernel void @global_atomic_fadd_ret_f32_wrong_subtarget(ptr addrspace(1) %ptr) #1 {
; GCN-LABEL: global_atomic_fadd_ret_f32_wrong_subtarget:		; GCN-LABEL: global_atomic_fadd_ret_f32_wrong_subtarget:
; GCN: ; %bb.0:		; GCN: ; %bb.0:
; GCN-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GCN-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GCN-NEXT: s_mov_b64 s[2:3], 0		; GCN-NEXT: s_mov_b64 s[2:3], 0
; GCN-NEXT: v_mov_b32_e32 v0, 0		; GCN-NEXT: v_mov_b32_e32 v1, 0
; GCN-NEXT: s_waitcnt lgkmcnt(0)		; GCN-NEXT: s_waitcnt lgkmcnt(0)
; GCN-NEXT: s_load_dword s4, s[0:1], 0x0		; GCN-NEXT: s_load_dword s4, s[0:1], 0x0
; GCN-NEXT: s_waitcnt lgkmcnt(0)		; GCN-NEXT: s_waitcnt lgkmcnt(0)
; GCN-NEXT: v_mov_b32_e32 v1, s4		; GCN-NEXT: v_mov_b32_e32 v0, s4
; GCN-NEXT: .LBB6_1: ; %atomicrmw.start		; GCN-NEXT: .LBB6_1: ; %atomicrmw.start
; GCN-NEXT: ; =>This Inner Loop Header: Depth=1		; GCN-NEXT: ; =>This Inner Loop Header: Depth=1
; GCN-NEXT: v_mov_b32_e32 v2, v1		; GCN-NEXT: v_mov_b32_e32 v3, v0
; GCN-NEXT: v_add_f32_e32 v1, 4.0, v2		; GCN-NEXT: v_add_f32_e32 v2, 4.0, v3
; GCN-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GCN-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GCN-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GCN-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GCN-NEXT: s_waitcnt vmcnt(0)		; GCN-NEXT: s_waitcnt vmcnt(0)
; GCN-NEXT: buffer_wbinvl1_vol		; GCN-NEXT: buffer_wbinvl1_vol
; GCN-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GCN-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GCN-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GCN-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GCN-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GCN-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GCN-NEXT: s_cbranch_execnz .LBB6_1		; GCN-NEXT: s_cbranch_execnz .LBB6_1
; GCN-NEXT: ; %bb.2: ; %atomicrmw.end		; GCN-NEXT: ; %bb.2: ; %atomicrmw.end
; GCN-NEXT: s_or_b64 exec, exec, s[2:3]		; GCN-NEXT: s_or_b64 exec, exec, s[2:3]
; GCN-NEXT: global_store_dword v[0:1], v1, off		; GCN-NEXT: global_store_dword v[0:1], v0, off
; GCN-NEXT: s_endpgm		; GCN-NEXT: s_endpgm
;		;
; GFX11-LABEL: global_atomic_fadd_ret_f32_wrong_subtarget:		; GFX11-LABEL: global_atomic_fadd_ret_f32_wrong_subtarget:
; GFX11: ; %bb.0:		; GFX11: ; %bb.0:
; GFX11-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24		; GFX11-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
; GFX11-NEXT: s_mov_b64 s[2:3], 0		; GFX11-NEXT: s_mov_b64 s[2:3], 0
; GFX11-NEXT: v_mov_b32_e32 v0, 0		; GFX11-NEXT: v_mov_b32_e32 v1, 0
; GFX11-NEXT: s_waitcnt lgkmcnt(0)		; GFX11-NEXT: s_waitcnt lgkmcnt(0)
; GFX11-NEXT: s_load_dword s4, s[0:1], 0x0		; GFX11-NEXT: s_load_dword s4, s[0:1], 0x0
; GFX11-NEXT: s_waitcnt lgkmcnt(0)		; GFX11-NEXT: s_waitcnt lgkmcnt(0)
; GFX11-NEXT: v_mov_b32_e32 v1, s4		; GFX11-NEXT: v_mov_b32_e32 v0, s4
; GFX11-NEXT: .LBB6_1: ; %atomicrmw.start		; GFX11-NEXT: .LBB6_1: ; %atomicrmw.start
; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1		; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1
; GFX11-NEXT: v_mov_b32_e32 v2, v1		; GFX11-NEXT: v_mov_b32_e32 v3, v0
; GFX11-NEXT: v_add_f32_e32 v1, 4.0, v2		; GFX11-NEXT: v_add_f32_e32 v2, 4.0, v3
; GFX11-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)		; GFX11-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
; GFX11-NEXT: global_atomic_cmpswap v1, v0, v[1:2], s[0:1] glc		; GFX11-NEXT: global_atomic_cmpswap v0, v1, v[2:3], s[0:1] glc
; GFX11-NEXT: s_waitcnt vmcnt(0)		; GFX11-NEXT: s_waitcnt vmcnt(0)
; GFX11-NEXT: buffer_wbinvl1_vol		; GFX11-NEXT: buffer_wbinvl1_vol
; GFX11-NEXT: v_cmp_eq_u32_e32 vcc, v1, v2		; GFX11-NEXT: v_cmp_eq_u32_e32 vcc, v0, v3
; GFX11-NEXT: s_or_b64 s[2:3], vcc, s[2:3]		; GFX11-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
; GFX11-NEXT: s_andn2_b64 exec, exec, s[2:3]		; GFX11-NEXT: s_andn2_b64 exec, exec, s[2:3]
; GFX11-NEXT: s_cbranch_execnz .LBB6_1		; GFX11-NEXT: s_cbranch_execnz .LBB6_1
; GFX11-NEXT: ; %bb.2: ; %atomicrmw.end		; GFX11-NEXT: ; %bb.2: ; %atomicrmw.end
; GFX11-NEXT: s_or_b64 exec, exec, s[2:3]		; GFX11-NEXT: s_or_b64 exec, exec, s[2:3]
; GFX11-NEXT: global_store_dword v[0:1], v1, off		; GFX11-NEXT: global_store_dword v[0:1], v0, off
; GFX11-NEXT: s_endpgm		; GFX11-NEXT: s_endpgm
%result = atomicrmw fadd ptr addrspace(1) %ptr, float 4.0 syncscope("agent") seq_cst		%result = atomicrmw fadd ptr addrspace(1) %ptr, float 4.0 syncscope("agent") seq_cst
store float %result, ptr addrspace(1) undef		store float %result, ptr addrspace(1) undef
ret void		ret void
}		}

define amdgpu_kernel void @global_atomic_fadd_noret_f32_wrong_subtarget(ptr addrspace(1) %ptr) #1 {		define amdgpu_kernel void @global_atomic_fadd_noret_f32_wrong_subtarget(ptr addrspace(1) %ptr) #1 {
; GCN-LABEL: global_atomic_fadd_noret_f32_wrong_subtarget:		; GCN-LABEL: global_atomic_fadd_noret_f32_wrong_subtarget:
▲ Show 20 Lines • Show All 241 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/mubuf-legalize-operands-non-ptr-intrinsics.ll

	Show First 20 Lines • Show All 1,146 Lines • ▼ Show 20 Lines
	; W64-O0-NEXT: ; %bb.7:			; W64-O0-NEXT: ; %bb.7:
	; W64-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:68 ; 4-byte Folded Reload			; W64-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:68 ; 4-byte Folded Reload
	; W64-O0-NEXT: v_readlane_b32 s4, v8, 13			; W64-O0-NEXT: v_readlane_b32 s4, v8, 13
	; W64-O0-NEXT: v_readlane_b32 s5, v8, 14			; W64-O0-NEXT: v_readlane_b32 s5, v8, 14
	; W64-O0-NEXT: s_mov_b64 exec, s[4:5]			; W64-O0-NEXT: s_mov_b64 exec, s[4:5]
	; W64-O0-NEXT: s_waitcnt vmcnt(0)			; W64-O0-NEXT: s_waitcnt vmcnt(0)
	; W64-O0-NEXT: buffer_store_dword v0, off, s[0:3], s32 offset:60 ; 4-byte Folded Spill			; W64-O0-NEXT: buffer_store_dword v0, off, s[0:3], s32 offset:60 ; 4-byte Folded Spill
	; W64-O0-NEXT: .LBB2_8: ; %bb2			; W64-O0-NEXT: .LBB2_8: ; %bb2
	; W64-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:20 ; 4-byte Folded Reload
	; W64-O0-NEXT: s_nop 0
	; W64-O0-NEXT: buffer_load_dword v1, off, s[0:3], s32 offset:24 ; 4-byte Folded Reload
	; W64-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:60 ; 4-byte Folded Reload
	; W64-O0-NEXT: v_readlane_b32 s4, v8, 10			; W64-O0-NEXT: v_readlane_b32 s4, v8, 10
	; W64-O0-NEXT: v_readlane_b32 s5, v8, 11			; W64-O0-NEXT: v_readlane_b32 s5, v8, 11
	; W64-O0-NEXT: s_or_b64 exec, exec, s[4:5]			; W64-O0-NEXT: s_or_b64 exec, exec, s[4:5]
				; W64-O0-NEXT: ; %bb.9: ; %bb2
				; W64-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:20 ; 4-byte Folded Reload
				; W64-O0-NEXT: buffer_load_dword v1, off, s[0:3], s32 offset:24 ; 4-byte Folded Reload
				; W64-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:60 ; 4-byte Folded Reload
	; W64-O0-NEXT: s_waitcnt vmcnt(0)			; W64-O0-NEXT: s_waitcnt vmcnt(0)
	; W64-O0-NEXT: global_store_dword v[0:1], v2, off			; W64-O0-NEXT: global_store_dword v[0:1], v2, off
	; W64-O0-NEXT: s_waitcnt vmcnt(0)			; W64-O0-NEXT: s_waitcnt vmcnt(0)
	; W64-O0-NEXT: s_xor_saveexec_b64 s[4:5], -1			; W64-O0-NEXT: s_xor_saveexec_b64 s[4:5], -1
	; W64-O0-NEXT: buffer_load_dword v8, off, s[0:3], s32 offset:72 ; 4-byte Folded Reload			; W64-O0-NEXT: buffer_load_dword v8, off, s[0:3], s32 offset:72 ; 4-byte Folded Reload
	; W64-O0-NEXT: s_mov_b64 exec, s[4:5]			; W64-O0-NEXT: s_mov_b64 exec, s[4:5]
	; W64-O0-NEXT: s_waitcnt vmcnt(0)			; W64-O0-NEXT: s_waitcnt vmcnt(0)
	; W64-O0-NEXT: s_setpc_b64 s[30:31]			; W64-O0-NEXT: s_setpc_b64 s[30:31]
	Show All 22 Lines

llvm/test/CodeGen/AMDGPU/mubuf-legalize-operands.ll

	Show First 20 Lines • Show All 1,216 Lines • ▼ Show 20 Lines
	; W64-O0-NEXT: ; %bb.7:			; W64-O0-NEXT: ; %bb.7:
	; W64-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:92 ; 4-byte Folded Reload			; W64-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:92 ; 4-byte Folded Reload
	; W64-O0-NEXT: v_readlane_b32 s4, v8, 13			; W64-O0-NEXT: v_readlane_b32 s4, v8, 13
	; W64-O0-NEXT: v_readlane_b32 s5, v8, 14			; W64-O0-NEXT: v_readlane_b32 s5, v8, 14
	; W64-O0-NEXT: s_mov_b64 exec, s[4:5]			; W64-O0-NEXT: s_mov_b64 exec, s[4:5]
	; W64-O0-NEXT: s_waitcnt vmcnt(0)			; W64-O0-NEXT: s_waitcnt vmcnt(0)
	; W64-O0-NEXT: buffer_store_dword v0, off, s[0:3], s32 offset:68 ; 4-byte Folded Spill			; W64-O0-NEXT: buffer_store_dword v0, off, s[0:3], s32 offset:68 ; 4-byte Folded Spill
	; W64-O0-NEXT: .LBB2_8: ; %bb2			; W64-O0-NEXT: .LBB2_8: ; %bb2
	; W64-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:4 ; 4-byte Folded Reload
	; W64-O0-NEXT: s_nop 0
	; W64-O0-NEXT: buffer_load_dword v1, off, s[0:3], s32 offset:8 ; 4-byte Folded Reload
	; W64-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:68 ; 4-byte Folded Reload
	; W64-O0-NEXT: v_readlane_b32 s4, v8, 10			; W64-O0-NEXT: v_readlane_b32 s4, v8, 10
	; W64-O0-NEXT: v_readlane_b32 s5, v8, 11			; W64-O0-NEXT: v_readlane_b32 s5, v8, 11
	; W64-O0-NEXT: s_or_b64 exec, exec, s[4:5]			; W64-O0-NEXT: s_or_b64 exec, exec, s[4:5]
				; W64-O0-NEXT: ; %bb.9: ; %bb2
				; W64-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:4 ; 4-byte Folded Reload
				; W64-O0-NEXT: buffer_load_dword v1, off, s[0:3], s32 offset:8 ; 4-byte Folded Reload
				; W64-O0-NEXT: buffer_load_dword v2, off, s[0:3], s32 offset:68 ; 4-byte Folded Reload
	; W64-O0-NEXT: s_waitcnt vmcnt(0)			; W64-O0-NEXT: s_waitcnt vmcnt(0)
	; W64-O0-NEXT: global_store_dword v[0:1], v2, off			; W64-O0-NEXT: global_store_dword v[0:1], v2, off
	; W64-O0-NEXT: s_waitcnt vmcnt(0)			; W64-O0-NEXT: s_waitcnt vmcnt(0)
	; W64-O0-NEXT: s_xor_saveexec_b64 s[4:5], -1			; W64-O0-NEXT: s_xor_saveexec_b64 s[4:5], -1
	; W64-O0-NEXT: buffer_load_dword v8, off, s[0:3], s32 offset:96 ; 4-byte Folded Reload			; W64-O0-NEXT: buffer_load_dword v8, off, s[0:3], s32 offset:96 ; 4-byte Folded Reload
	; W64-O0-NEXT: s_mov_b64 exec, s[4:5]			; W64-O0-NEXT: s_mov_b64 exec, s[4:5]
	; W64-O0-NEXT: s_waitcnt vmcnt(0)			; W64-O0-NEXT: s_waitcnt vmcnt(0)
	; W64-O0-NEXT: s_setpc_b64 s[30:31]			; W64-O0-NEXT: s_setpc_b64 s[30:31]
	Show All 22 Lines

llvm/test/CodeGen/AMDGPU/tuple-allocation-failure.ll

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	; GLOBALNESS1-NEXT: v_cmp_eq_u32_e32 vcc, 1, v0			; GLOBALNESS1-NEXT: v_cmp_eq_u32_e32 vcc, 1, v0
	; GLOBALNESS1-NEXT: v_cndmask_b32_e64 v3, 0, 1, vcc			; GLOBALNESS1-NEXT: v_cndmask_b32_e64 v3, 0, 1, vcc
	; GLOBALNESS1-NEXT: v_cmp_eq_u32_e32 vcc, 0, v0			; GLOBALNESS1-NEXT: v_cmp_eq_u32_e32 vcc, 0, v0
	; GLOBALNESS1-NEXT: v_cndmask_b32_e64 v0, 0, 1, vcc			; GLOBALNESS1-NEXT: v_cndmask_b32_e64 v0, 0, 1, vcc
	; GLOBALNESS1-NEXT: v_cmp_ne_u32_e64 s[54:55], 1, v1			; GLOBALNESS1-NEXT: v_cmp_ne_u32_e64 s[54:55], 1, v1
	; GLOBALNESS1-NEXT: v_cmp_ne_u32_e64 s[56:57], 1, v2			; GLOBALNESS1-NEXT: v_cmp_ne_u32_e64 s[56:57], 1, v2
	; GLOBALNESS1-NEXT: v_cmp_ne_u32_e64 s[58:59], 1, v3			; GLOBALNESS1-NEXT: v_cmp_ne_u32_e64 s[58:59], 1, v3
	; GLOBALNESS1-NEXT: v_cmp_ne_u32_e64 s[60:61], 1, v0			; GLOBALNESS1-NEXT: v_cmp_ne_u32_e64 s[60:61], 1, v0
	; GLOBALNESS1-NEXT: s_branch .LBB1_4			; GLOBALNESS1-NEXT: s_branch .LBB1_3
	; GLOBALNESS1-NEXT: .LBB1_1: ; %bb70.i			; GLOBALNESS1-NEXT: .LBB1_1: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], -1
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[60:61]			; GLOBALNESS1-NEXT: ; implicit-def: $vgpr0_vgpr1
	; GLOBALNESS1-NEXT: s_cbranch_vccz .LBB1_29			; GLOBALNESS1-NEXT: .LBB1_2: ; %Flow28
	; GLOBALNESS1-NEXT: .LBB1_2: ; %Flow15			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1
	; GLOBALNESS1-NEXT: s_or_b64 exec, exec, s[4:5]
	; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], 0
	; GLOBALNESS1-NEXT: ; implicit-def: $sgpr4_sgpr5
	; GLOBALNESS1-NEXT: .LBB1_3: ; %Flow28
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[6:7]			; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[6:7]
	; GLOBALNESS1-NEXT: v_pk_mov_b32 v[44:45], v[0:1], v[0:1] op_sel:[0,1]			; GLOBALNESS1-NEXT: v_pk_mov_b32 v[44:45], v[0:1], v[0:1] op_sel:[0,1]
	; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_30			; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_30
	; GLOBALNESS1-NEXT: .LBB1_4: ; %bb5			; GLOBALNESS1-NEXT: .LBB1_3: ; %bb5
	; GLOBALNESS1-NEXT: ; =>This Loop Header: Depth=1			; GLOBALNESS1-NEXT: ; =>This Loop Header: Depth=1
	; GLOBALNESS1-NEXT: ; Child Loop BB1_15 Depth 2			; GLOBALNESS1-NEXT: ; Child Loop BB1_15 Depth 2
	; GLOBALNESS1-NEXT: v_pk_mov_b32 v[0:1], s[74:75], s[74:75] op_sel:[0,1]			; GLOBALNESS1-NEXT: v_pk_mov_b32 v[0:1], s[74:75], s[74:75] op_sel:[0,1]
	; GLOBALNESS1-NEXT: flat_load_dword v43, v[0:1]			; GLOBALNESS1-NEXT: flat_load_dword v43, v[0:1]
	; GLOBALNESS1-NEXT: s_add_u32 s8, s38, 40			; GLOBALNESS1-NEXT: s_add_u32 s8, s38, 40
	; GLOBALNESS1-NEXT: buffer_store_dword v40, off, s[0:3], 0			; GLOBALNESS1-NEXT: buffer_store_dword v40, off, s[0:3], 0
	; GLOBALNESS1-NEXT: flat_load_dword v46, v[0:1]			; GLOBALNESS1-NEXT: flat_load_dword v46, v[0:1]
	; GLOBALNESS1-NEXT: s_addc_u32 s9, s39, 0			; GLOBALNESS1-NEXT: s_addc_u32 s9, s39, 0
	; GLOBALNESS1-NEXT: s_mov_b64 s[4:5], s[40:41]			; GLOBALNESS1-NEXT: s_mov_b64 s[4:5], s[40:41]
	; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], s[36:37]			; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], s[36:37]
	; GLOBALNESS1-NEXT: s_mov_b64 s[10:11], s[34:35]			; GLOBALNESS1-NEXT: s_mov_b64 s[10:11], s[34:35]
	; GLOBALNESS1-NEXT: s_mov_b32 s12, s72			; GLOBALNESS1-NEXT: s_mov_b32 s12, s72
	; GLOBALNESS1-NEXT: s_mov_b32 s13, s71			; GLOBALNESS1-NEXT: s_mov_b32 s13, s71
	; GLOBALNESS1-NEXT: s_mov_b32 s14, s70			; GLOBALNESS1-NEXT: s_mov_b32 s14, s70
	; GLOBALNESS1-NEXT: v_mov_b32_e32 v31, v42			; GLOBALNESS1-NEXT: v_mov_b32_e32 v31, v42
	; GLOBALNESS1-NEXT: s_waitcnt lgkmcnt(0)			; GLOBALNESS1-NEXT: s_waitcnt lgkmcnt(0)
	; GLOBALNESS1-NEXT: s_swappc_b64 s[30:31], s[76:77]			; GLOBALNESS1-NEXT: s_swappc_b64 s[30:31], s[76:77]
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[46:47]			; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[46:47]
	; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], -1			; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], -1
	; GLOBALNESS1-NEXT: ; implicit-def: $sgpr4_sgpr5			; GLOBALNESS1-NEXT: ; implicit-def: $sgpr4_sgpr5
	; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_8			; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_8
	; GLOBALNESS1-NEXT: ; %bb.5: ; %NodeBlock			; GLOBALNESS1-NEXT: ; %bb.4: ; %NodeBlock
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: s_cmp_lt_i32 s79, 1			; GLOBALNESS1-NEXT: s_cmp_lt_i32 s79, 1
	; GLOBALNESS1-NEXT: s_cbranch_scc1 .LBB1_7			; GLOBALNESS1-NEXT: s_cbranch_scc1 .LBB1_6
	; GLOBALNESS1-NEXT: ; %bb.6: ; %LeafBlock12			; GLOBALNESS1-NEXT: ; %bb.5: ; %LeafBlock12
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: s_cmp_lg_u32 s79, 1			; GLOBALNESS1-NEXT: s_cmp_lg_u32 s79, 1
	; GLOBALNESS1-NEXT: s_mov_b64 s[4:5], -1			; GLOBALNESS1-NEXT: s_mov_b64 s[4:5], -1
	; GLOBALNESS1-NEXT: s_cselect_b64 s[6:7], -1, 0			; GLOBALNESS1-NEXT: s_cselect_b64 s[6:7], -1, 0
	; GLOBALNESS1-NEXT: s_cbranch_execnz .LBB1_8			; GLOBALNESS1-NEXT: s_cbranch_execz .LBB1_7
	; GLOBALNESS1-NEXT: s_branch .LBB1_23			; GLOBALNESS1-NEXT: s_branch .LBB1_8
	; GLOBALNESS1-NEXT: .LBB1_7: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: .LBB1_6: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], 0			; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], 0
	; GLOBALNESS1-NEXT: ; implicit-def: $sgpr4_sgpr5			; GLOBALNESS1-NEXT: ; implicit-def: $sgpr4_sgpr5
	; GLOBALNESS1-NEXT: s_branch .LBB1_23			; GLOBALNESS1-NEXT: .LBB1_7: ; %LeafBlock
				; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
				; GLOBALNESS1-NEXT: s_cmp_lg_u32 s79, 0
				; GLOBALNESS1-NEXT: s_mov_b64 s[4:5], 0
				; GLOBALNESS1-NEXT: s_cselect_b64 s[6:7], -1, 0
	; GLOBALNESS1-NEXT: .LBB1_8: ; %Flow25			; GLOBALNESS1-NEXT: .LBB1_8: ; %Flow25
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[6:7]			; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[6:7]
	; GLOBALNESS1-NEXT: s_cbranch_vccz .LBB1_24			; GLOBALNESS1-NEXT: s_cbranch_vccz .LBB1_1
	; GLOBALNESS1-NEXT: .LBB1_9: ; %baz.exit.i			; GLOBALNESS1-NEXT: ; %bb.9: ; %baz.exit.i
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: v_pk_mov_b32 v[2:3], 0, 0			; GLOBALNESS1-NEXT: v_pk_mov_b32 v[2:3], 0, 0
	; GLOBALNESS1-NEXT: flat_load_dword v0, v[2:3]			; GLOBALNESS1-NEXT: flat_load_dword v0, v[2:3]
	; GLOBALNESS1-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)			; GLOBALNESS1-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
	; GLOBALNESS1-NEXT: v_cmp_gt_i32_e64 s[62:63], 0, v0			; GLOBALNESS1-NEXT: v_cmp_gt_i32_e64 s[62:63], 0, v0
	; GLOBALNESS1-NEXT: v_mov_b32_e32 v0, 0			; GLOBALNESS1-NEXT: v_mov_b32_e32 v0, 0
	; GLOBALNESS1-NEXT: v_mov_b32_e32 v1, 0x3ff00000			; GLOBALNESS1-NEXT: v_mov_b32_e32 v1, 0x3ff00000
	; GLOBALNESS1-NEXT: s_and_saveexec_b64 s[80:81], s[62:63]			; GLOBALNESS1-NEXT: s_and_saveexec_b64 s[80:81], s[62:63]
	; GLOBALNESS1-NEXT: s_cbranch_execz .LBB1_26			; GLOBALNESS1-NEXT: s_cbranch_execz .LBB1_24
	; GLOBALNESS1-NEXT: ; %bb.10: ; %bb33.i			; GLOBALNESS1-NEXT: ; %bb.10: ; %bb33.i
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: global_load_dwordx2 v[0:1], v[2:3], off			; GLOBALNESS1-NEXT: global_load_dwordx2 v[0:1], v[2:3], off
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[54:55]			; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[54:55]
	; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_12			; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_12
	; GLOBALNESS1-NEXT: ; %bb.11: ; %bb39.i			; GLOBALNESS1-NEXT: ; %bb.11: ; %bb39.i
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: v_mov_b32_e32 v41, v40			; GLOBALNESS1-NEXT: v_mov_b32_e32 v41, v40
	; GLOBALNESS1-NEXT: v_pk_mov_b32 v[2:3], 0, 0			; GLOBALNESS1-NEXT: v_pk_mov_b32 v[2:3], 0, 0
	; GLOBALNESS1-NEXT: global_store_dwordx2 v[2:3], v[40:41], off			; GLOBALNESS1-NEXT: global_store_dwordx2 v[2:3], v[40:41], off
	; GLOBALNESS1-NEXT: .LBB1_12: ; %bb44.lr.ph.i			; GLOBALNESS1-NEXT: .LBB1_12: ; %bb44.lr.ph.i
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: v_cmp_ne_u32_e32 vcc, 0, v46			; GLOBALNESS1-NEXT: v_cmp_ne_u32_e32 vcc, 0, v46
	; GLOBALNESS1-NEXT: v_cndmask_b32_e32 v2, 0, v43, vcc			; GLOBALNESS1-NEXT: v_cndmask_b32_e32 v2, 0, v43, vcc
	; GLOBALNESS1-NEXT: s_waitcnt vmcnt(0)			; GLOBALNESS1-NEXT: s_waitcnt vmcnt(0)
	; GLOBALNESS1-NEXT: v_cmp_nlt_f64_e64 s[64:65], 0, v[0:1]			; GLOBALNESS1-NEXT: v_cmp_nlt_f64_e64 s[64:65], 0, v[0:1]
	; GLOBALNESS1-NEXT: v_cmp_eq_u32_e64 s[66:67], 0, v2			; GLOBALNESS1-NEXT: v_cmp_eq_u32_e64 s[66:67], 0, v2
	; GLOBALNESS1-NEXT: s_branch .LBB1_15			; GLOBALNESS1-NEXT: s_branch .LBB1_15
	; GLOBALNESS1-NEXT: .LBB1_13: ; %Flow16			; GLOBALNESS1-NEXT: .LBB1_13: ; %Flow16
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_15 Depth=2			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_15 Depth=2
	; GLOBALNESS1-NEXT: s_or_b64 exec, exec, s[4:5]			; GLOBALNESS1-NEXT: s_or_b64 exec, exec, s[4:5]
	; GLOBALNESS1-NEXT: .LBB1_14: ; %bb63.i			; GLOBALNESS1-NEXT: .LBB1_14: ; %bb63.i
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_15 Depth=2			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_15 Depth=2
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[52:53]			; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[52:53]
	; GLOBALNESS1-NEXT: s_cbranch_vccz .LBB1_25			; GLOBALNESS1-NEXT: s_cbranch_vccz .LBB1_23
	; GLOBALNESS1-NEXT: .LBB1_15: ; %bb44.i			; GLOBALNESS1-NEXT: .LBB1_15: ; %bb44.i
	; GLOBALNESS1-NEXT: ; Parent Loop BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; Parent Loop BB1_3 Depth=1
	; GLOBALNESS1-NEXT: ; => This Inner Loop Header: Depth=2			; GLOBALNESS1-NEXT: ; => This Inner Loop Header: Depth=2
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[48:49]			; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[48:49]
	; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_14			; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_14
	; GLOBALNESS1-NEXT: ; %bb.16: ; %bb46.i			; GLOBALNESS1-NEXT: ; %bb.16: ; %bb46.i
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_15 Depth=2			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_15 Depth=2
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[50:51]			; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[50:51]
	; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_14			; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_14
	; GLOBALNESS1-NEXT: ; %bb.17: ; %bb50.i			; GLOBALNESS1-NEXT: ; %bb.17: ; %bb50.i
	Show All 37 Lines
	; GLOBALNESS1-NEXT: s_swappc_b64 s[30:31], s[76:77]			; GLOBALNESS1-NEXT: s_swappc_b64 s[30:31], s[76:77]
	; GLOBALNESS1-NEXT: s_and_saveexec_b64 s[4:5], s[66:67]			; GLOBALNESS1-NEXT: s_and_saveexec_b64 s[4:5], s[66:67]
	; GLOBALNESS1-NEXT: s_cbranch_execz .LBB1_13			; GLOBALNESS1-NEXT: s_cbranch_execz .LBB1_13
	; GLOBALNESS1-NEXT: ; %bb.22: ; %bb62.i			; GLOBALNESS1-NEXT: ; %bb.22: ; %bb62.i
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_15 Depth=2			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_15 Depth=2
	; GLOBALNESS1-NEXT: v_mov_b32_e32 v41, v40			; GLOBALNESS1-NEXT: v_mov_b32_e32 v41, v40
	; GLOBALNESS1-NEXT: global_store_dwordx2 v[46:47], v[40:41], off			; GLOBALNESS1-NEXT: global_store_dwordx2 v[46:47], v[40:41], off
	; GLOBALNESS1-NEXT: s_branch .LBB1_13			; GLOBALNESS1-NEXT: s_branch .LBB1_13
	; GLOBALNESS1-NEXT: .LBB1_23: ; %LeafBlock			; GLOBALNESS1-NEXT: .LBB1_23: ; %Flow23
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: s_cmp_lg_u32 s79, 0
	; GLOBALNESS1-NEXT: s_mov_b64 s[4:5], 0
	; GLOBALNESS1-NEXT: s_cselect_b64 s[6:7], -1, 0
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[6:7]
	; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_9
	; GLOBALNESS1-NEXT: .LBB1_24: ; in Loop: Header=BB1_4 Depth=1
	; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], -1
	; GLOBALNESS1-NEXT: ; implicit-def: $vgpr0_vgpr1
	; GLOBALNESS1-NEXT: s_branch .LBB1_3
	; GLOBALNESS1-NEXT: .LBB1_25: ; %Flow23
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1
	; GLOBALNESS1-NEXT: v_pk_mov_b32 v[0:1], 0, 0			; GLOBALNESS1-NEXT: v_pk_mov_b32 v[0:1], 0, 0
	; GLOBALNESS1-NEXT: .LBB1_26: ; %Flow24			; GLOBALNESS1-NEXT: .LBB1_24: ; %Flow24
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: s_or_b64 exec, exec, s[80:81]			; GLOBALNESS1-NEXT: s_or_b64 exec, exec, s[80:81]
	; GLOBALNESS1-NEXT: s_and_saveexec_b64 s[4:5], s[62:63]			; GLOBALNESS1-NEXT: s_and_saveexec_b64 s[4:5], s[62:63]
	; GLOBALNESS1-NEXT: s_cbranch_execz .LBB1_2			; GLOBALNESS1-NEXT: s_cbranch_execz .LBB1_29
	; GLOBALNESS1-NEXT: ; %bb.27: ; %bb67.i			; GLOBALNESS1-NEXT: ; %bb.25: ; %bb67.i
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[58:59]			; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[58:59]
	; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_1			; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_27
	; GLOBALNESS1-NEXT: ; %bb.28: ; %bb69.i			; GLOBALNESS1-NEXT: ; %bb.26: ; %bb69.i
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: v_mov_b32_e32 v41, v40			; GLOBALNESS1-NEXT: v_mov_b32_e32 v41, v40
	; GLOBALNESS1-NEXT: v_pk_mov_b32 v[2:3], 0, 0			; GLOBALNESS1-NEXT: v_pk_mov_b32 v[2:3], 0, 0
	; GLOBALNESS1-NEXT: global_store_dwordx2 v[2:3], v[40:41], off			; GLOBALNESS1-NEXT: global_store_dwordx2 v[2:3], v[40:41], off
	; GLOBALNESS1-NEXT: s_branch .LBB1_1			; GLOBALNESS1-NEXT: .LBB1_27: ; %bb70.i
	; GLOBALNESS1-NEXT: .LBB1_29: ; %bb73.i			; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS1-NEXT: s_and_b64 vcc, exec, s[60:61]
				; GLOBALNESS1-NEXT: s_cbranch_vccnz .LBB1_29
				; GLOBALNESS1-NEXT: ; %bb.28: ; %bb73.i
				; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS1-NEXT: v_mov_b32_e32 v41, v40			; GLOBALNESS1-NEXT: v_mov_b32_e32 v41, v40
	; GLOBALNESS1-NEXT: v_pk_mov_b32 v[2:3], 0, 0			; GLOBALNESS1-NEXT: v_pk_mov_b32 v[2:3], 0, 0
	; GLOBALNESS1-NEXT: global_store_dwordx2 v[2:3], v[40:41], off			; GLOBALNESS1-NEXT: global_store_dwordx2 v[2:3], v[40:41], off
				; GLOBALNESS1-NEXT: .LBB1_29: ; %Flow15
				; GLOBALNESS1-NEXT: ; in Loop: Header=BB1_3 Depth=1
				; GLOBALNESS1-NEXT: s_or_b64 exec, exec, s[4:5]
				; GLOBALNESS1-NEXT: s_mov_b64 s[6:7], 0
				; GLOBALNESS1-NEXT: ; implicit-def: $sgpr4_sgpr5
	; GLOBALNESS1-NEXT: s_branch .LBB1_2			; GLOBALNESS1-NEXT: s_branch .LBB1_2
	; GLOBALNESS1-NEXT: .LBB1_30: ; %loop.exit.guard			; GLOBALNESS1-NEXT: .LBB1_30: ; %loop.exit.guard
	; GLOBALNESS1-NEXT: s_andn2_b64 vcc, exec, s[4:5]			; GLOBALNESS1-NEXT: s_andn2_b64 vcc, exec, s[4:5]
	; GLOBALNESS1-NEXT: s_mov_b64 s[4:5], -1			; GLOBALNESS1-NEXT: s_mov_b64 s[4:5], -1
	; GLOBALNESS1-NEXT: s_cbranch_vccz .LBB1_32			; GLOBALNESS1-NEXT: s_cbranch_vccz .LBB1_32
	; GLOBALNESS1-NEXT: ; %bb.31: ; %bb7.i.i			; GLOBALNESS1-NEXT: ; %bb.31: ; %bb7.i.i
	; GLOBALNESS1-NEXT: s_add_u32 s8, s38, 40			; GLOBALNESS1-NEXT: s_add_u32 s8, s38, 40
	; GLOBALNESS1-NEXT: s_addc_u32 s9, s39, 0			; GLOBALNESS1-NEXT: s_addc_u32 s9, s39, 0
	▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines
	; GLOBALNESS0-NEXT: v_cmp_eq_u32_e32 vcc, 1, v0			; GLOBALNESS0-NEXT: v_cmp_eq_u32_e32 vcc, 1, v0
	; GLOBALNESS0-NEXT: v_cndmask_b32_e64 v3, 0, 1, vcc			; GLOBALNESS0-NEXT: v_cndmask_b32_e64 v3, 0, 1, vcc
	; GLOBALNESS0-NEXT: v_cmp_eq_u32_e32 vcc, 0, v0			; GLOBALNESS0-NEXT: v_cmp_eq_u32_e32 vcc, 0, v0
	; GLOBALNESS0-NEXT: v_cndmask_b32_e64 v0, 0, 1, vcc			; GLOBALNESS0-NEXT: v_cndmask_b32_e64 v0, 0, 1, vcc
	; GLOBALNESS0-NEXT: v_cmp_ne_u32_e64 s[54:55], 1, v1			; GLOBALNESS0-NEXT: v_cmp_ne_u32_e64 s[54:55], 1, v1
	; GLOBALNESS0-NEXT: v_cmp_ne_u32_e64 s[56:57], 1, v2			; GLOBALNESS0-NEXT: v_cmp_ne_u32_e64 s[56:57], 1, v2
	; GLOBALNESS0-NEXT: v_cmp_ne_u32_e64 s[58:59], 1, v3			; GLOBALNESS0-NEXT: v_cmp_ne_u32_e64 s[58:59], 1, v3
	; GLOBALNESS0-NEXT: v_cmp_ne_u32_e64 s[60:61], 1, v0			; GLOBALNESS0-NEXT: v_cmp_ne_u32_e64 s[60:61], 1, v0
	; GLOBALNESS0-NEXT: s_branch .LBB1_4			; GLOBALNESS0-NEXT: s_branch .LBB1_3
	; GLOBALNESS0-NEXT: .LBB1_1: ; %bb70.i			; GLOBALNESS0-NEXT: .LBB1_1: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], -1
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[60:61]			; GLOBALNESS0-NEXT: ; implicit-def: $vgpr0_vgpr1
	; GLOBALNESS0-NEXT: s_cbranch_vccz .LBB1_29			; GLOBALNESS0-NEXT: .LBB1_2: ; %Flow28
	; GLOBALNESS0-NEXT: .LBB1_2: ; %Flow15			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1
	; GLOBALNESS0-NEXT: s_or_b64 exec, exec, s[4:5]
	; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], 0
	; GLOBALNESS0-NEXT: ; implicit-def: $sgpr4_sgpr5
	; GLOBALNESS0-NEXT: .LBB1_3: ; %Flow28
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[6:7]			; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[6:7]
	; GLOBALNESS0-NEXT: v_pk_mov_b32 v[44:45], v[0:1], v[0:1] op_sel:[0,1]			; GLOBALNESS0-NEXT: v_pk_mov_b32 v[44:45], v[0:1], v[0:1] op_sel:[0,1]
	; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_30			; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_30
	; GLOBALNESS0-NEXT: .LBB1_4: ; %bb5			; GLOBALNESS0-NEXT: .LBB1_3: ; %bb5
	; GLOBALNESS0-NEXT: ; =>This Loop Header: Depth=1			; GLOBALNESS0-NEXT: ; =>This Loop Header: Depth=1
	; GLOBALNESS0-NEXT: ; Child Loop BB1_15 Depth 2			; GLOBALNESS0-NEXT: ; Child Loop BB1_15 Depth 2
	; GLOBALNESS0-NEXT: v_pk_mov_b32 v[0:1], s[76:77], s[76:77] op_sel:[0,1]			; GLOBALNESS0-NEXT: v_pk_mov_b32 v[0:1], s[76:77], s[76:77] op_sel:[0,1]
	; GLOBALNESS0-NEXT: flat_load_dword v43, v[0:1]			; GLOBALNESS0-NEXT: flat_load_dword v43, v[0:1]
	; GLOBALNESS0-NEXT: s_add_u32 s8, s38, 40			; GLOBALNESS0-NEXT: s_add_u32 s8, s38, 40
	; GLOBALNESS0-NEXT: buffer_store_dword v40, off, s[0:3], 0			; GLOBALNESS0-NEXT: buffer_store_dword v40, off, s[0:3], 0
	; GLOBALNESS0-NEXT: flat_load_dword v46, v[0:1]			; GLOBALNESS0-NEXT: flat_load_dword v46, v[0:1]
	; GLOBALNESS0-NEXT: s_addc_u32 s9, s39, 0			; GLOBALNESS0-NEXT: s_addc_u32 s9, s39, 0
	; GLOBALNESS0-NEXT: s_mov_b64 s[4:5], s[40:41]			; GLOBALNESS0-NEXT: s_mov_b64 s[4:5], s[40:41]
	; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], s[36:37]			; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], s[36:37]
	; GLOBALNESS0-NEXT: s_mov_b64 s[10:11], s[34:35]			; GLOBALNESS0-NEXT: s_mov_b64 s[10:11], s[34:35]
	; GLOBALNESS0-NEXT: s_mov_b32 s12, s70			; GLOBALNESS0-NEXT: s_mov_b32 s12, s70
	; GLOBALNESS0-NEXT: s_mov_b32 s13, s69			; GLOBALNESS0-NEXT: s_mov_b32 s13, s69
	; GLOBALNESS0-NEXT: s_mov_b32 s14, s68			; GLOBALNESS0-NEXT: s_mov_b32 s14, s68
	; GLOBALNESS0-NEXT: v_mov_b32_e32 v31, v42			; GLOBALNESS0-NEXT: v_mov_b32_e32 v31, v42
	; GLOBALNESS0-NEXT: s_waitcnt lgkmcnt(0)			; GLOBALNESS0-NEXT: s_waitcnt lgkmcnt(0)
	; GLOBALNESS0-NEXT: s_swappc_b64 s[30:31], s[78:79]			; GLOBALNESS0-NEXT: s_swappc_b64 s[30:31], s[78:79]
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[46:47]			; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[46:47]
	; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], -1			; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], -1
	; GLOBALNESS0-NEXT: ; implicit-def: $sgpr4_sgpr5			; GLOBALNESS0-NEXT: ; implicit-def: $sgpr4_sgpr5
	; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_8			; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_8
	; GLOBALNESS0-NEXT: ; %bb.5: ; %NodeBlock			; GLOBALNESS0-NEXT: ; %bb.4: ; %NodeBlock
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: s_cmp_lt_i32 s75, 1			; GLOBALNESS0-NEXT: s_cmp_lt_i32 s75, 1
	; GLOBALNESS0-NEXT: s_cbranch_scc1 .LBB1_7			; GLOBALNESS0-NEXT: s_cbranch_scc1 .LBB1_6
	; GLOBALNESS0-NEXT: ; %bb.6: ; %LeafBlock12			; GLOBALNESS0-NEXT: ; %bb.5: ; %LeafBlock12
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: s_cmp_lg_u32 s75, 1			; GLOBALNESS0-NEXT: s_cmp_lg_u32 s75, 1
	; GLOBALNESS0-NEXT: s_mov_b64 s[4:5], -1			; GLOBALNESS0-NEXT: s_mov_b64 s[4:5], -1
	; GLOBALNESS0-NEXT: s_cselect_b64 s[6:7], -1, 0			; GLOBALNESS0-NEXT: s_cselect_b64 s[6:7], -1, 0
	; GLOBALNESS0-NEXT: s_cbranch_execnz .LBB1_8			; GLOBALNESS0-NEXT: s_cbranch_execz .LBB1_7
	; GLOBALNESS0-NEXT: s_branch .LBB1_23			; GLOBALNESS0-NEXT: s_branch .LBB1_8
	; GLOBALNESS0-NEXT: .LBB1_7: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: .LBB1_6: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], 0			; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], 0
	; GLOBALNESS0-NEXT: ; implicit-def: $sgpr4_sgpr5			; GLOBALNESS0-NEXT: ; implicit-def: $sgpr4_sgpr5
	; GLOBALNESS0-NEXT: s_branch .LBB1_23			; GLOBALNESS0-NEXT: .LBB1_7: ; %LeafBlock
				; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
				; GLOBALNESS0-NEXT: s_cmp_lg_u32 s75, 0
				; GLOBALNESS0-NEXT: s_mov_b64 s[4:5], 0
				; GLOBALNESS0-NEXT: s_cselect_b64 s[6:7], -1, 0
	; GLOBALNESS0-NEXT: .LBB1_8: ; %Flow25			; GLOBALNESS0-NEXT: .LBB1_8: ; %Flow25
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[6:7]			; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[6:7]
	; GLOBALNESS0-NEXT: s_cbranch_vccz .LBB1_24			; GLOBALNESS0-NEXT: s_cbranch_vccz .LBB1_1
	; GLOBALNESS0-NEXT: .LBB1_9: ; %baz.exit.i			; GLOBALNESS0-NEXT: ; %bb.9: ; %baz.exit.i
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: v_pk_mov_b32 v[2:3], 0, 0			; GLOBALNESS0-NEXT: v_pk_mov_b32 v[2:3], 0, 0
	; GLOBALNESS0-NEXT: flat_load_dword v0, v[2:3]			; GLOBALNESS0-NEXT: flat_load_dword v0, v[2:3]
	; GLOBALNESS0-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)			; GLOBALNESS0-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
	; GLOBALNESS0-NEXT: v_cmp_gt_i32_e64 s[62:63], 0, v0			; GLOBALNESS0-NEXT: v_cmp_gt_i32_e64 s[62:63], 0, v0
	; GLOBALNESS0-NEXT: v_mov_b32_e32 v0, 0			; GLOBALNESS0-NEXT: v_mov_b32_e32 v0, 0
	; GLOBALNESS0-NEXT: v_mov_b32_e32 v1, 0x3ff00000			; GLOBALNESS0-NEXT: v_mov_b32_e32 v1, 0x3ff00000
	; GLOBALNESS0-NEXT: s_and_saveexec_b64 s[80:81], s[62:63]			; GLOBALNESS0-NEXT: s_and_saveexec_b64 s[80:81], s[62:63]
	; GLOBALNESS0-NEXT: s_cbranch_execz .LBB1_26			; GLOBALNESS0-NEXT: s_cbranch_execz .LBB1_24
	; GLOBALNESS0-NEXT: ; %bb.10: ; %bb33.i			; GLOBALNESS0-NEXT: ; %bb.10: ; %bb33.i
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: global_load_dwordx2 v[0:1], v[2:3], off			; GLOBALNESS0-NEXT: global_load_dwordx2 v[0:1], v[2:3], off
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[54:55]			; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[54:55]
	; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_12			; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_12
	; GLOBALNESS0-NEXT: ; %bb.11: ; %bb39.i			; GLOBALNESS0-NEXT: ; %bb.11: ; %bb39.i
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: v_mov_b32_e32 v41, v40			; GLOBALNESS0-NEXT: v_mov_b32_e32 v41, v40
	; GLOBALNESS0-NEXT: v_pk_mov_b32 v[2:3], 0, 0			; GLOBALNESS0-NEXT: v_pk_mov_b32 v[2:3], 0, 0
	; GLOBALNESS0-NEXT: global_store_dwordx2 v[2:3], v[40:41], off			; GLOBALNESS0-NEXT: global_store_dwordx2 v[2:3], v[40:41], off
	; GLOBALNESS0-NEXT: .LBB1_12: ; %bb44.lr.ph.i			; GLOBALNESS0-NEXT: .LBB1_12: ; %bb44.lr.ph.i
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: v_cmp_ne_u32_e32 vcc, 0, v46			; GLOBALNESS0-NEXT: v_cmp_ne_u32_e32 vcc, 0, v46
	; GLOBALNESS0-NEXT: v_cndmask_b32_e32 v2, 0, v43, vcc			; GLOBALNESS0-NEXT: v_cndmask_b32_e32 v2, 0, v43, vcc
	; GLOBALNESS0-NEXT: s_waitcnt vmcnt(0)			; GLOBALNESS0-NEXT: s_waitcnt vmcnt(0)
	; GLOBALNESS0-NEXT: v_cmp_nlt_f64_e64 s[64:65], 0, v[0:1]			; GLOBALNESS0-NEXT: v_cmp_nlt_f64_e64 s[64:65], 0, v[0:1]
	; GLOBALNESS0-NEXT: v_cmp_eq_u32_e64 s[66:67], 0, v2			; GLOBALNESS0-NEXT: v_cmp_eq_u32_e64 s[66:67], 0, v2
	; GLOBALNESS0-NEXT: s_branch .LBB1_15			; GLOBALNESS0-NEXT: s_branch .LBB1_15
	; GLOBALNESS0-NEXT: .LBB1_13: ; %Flow16			; GLOBALNESS0-NEXT: .LBB1_13: ; %Flow16
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_15 Depth=2			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_15 Depth=2
	; GLOBALNESS0-NEXT: s_or_b64 exec, exec, s[4:5]			; GLOBALNESS0-NEXT: s_or_b64 exec, exec, s[4:5]
	; GLOBALNESS0-NEXT: .LBB1_14: ; %bb63.i			; GLOBALNESS0-NEXT: .LBB1_14: ; %bb63.i
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_15 Depth=2			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_15 Depth=2
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[52:53]			; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[52:53]
	; GLOBALNESS0-NEXT: s_cbranch_vccz .LBB1_25			; GLOBALNESS0-NEXT: s_cbranch_vccz .LBB1_23
	; GLOBALNESS0-NEXT: .LBB1_15: ; %bb44.i			; GLOBALNESS0-NEXT: .LBB1_15: ; %bb44.i
	; GLOBALNESS0-NEXT: ; Parent Loop BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; Parent Loop BB1_3 Depth=1
	; GLOBALNESS0-NEXT: ; => This Inner Loop Header: Depth=2			; GLOBALNESS0-NEXT: ; => This Inner Loop Header: Depth=2
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[48:49]			; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[48:49]
	; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_14			; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_14
	; GLOBALNESS0-NEXT: ; %bb.16: ; %bb46.i			; GLOBALNESS0-NEXT: ; %bb.16: ; %bb46.i
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_15 Depth=2			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_15 Depth=2
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[50:51]			; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[50:51]
	; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_14			; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_14
	; GLOBALNESS0-NEXT: ; %bb.17: ; %bb50.i			; GLOBALNESS0-NEXT: ; %bb.17: ; %bb50.i
	Show All 37 Lines
	; GLOBALNESS0-NEXT: s_swappc_b64 s[30:31], s[78:79]			; GLOBALNESS0-NEXT: s_swappc_b64 s[30:31], s[78:79]
	; GLOBALNESS0-NEXT: s_and_saveexec_b64 s[4:5], s[66:67]			; GLOBALNESS0-NEXT: s_and_saveexec_b64 s[4:5], s[66:67]
	; GLOBALNESS0-NEXT: s_cbranch_execz .LBB1_13			; GLOBALNESS0-NEXT: s_cbranch_execz .LBB1_13
	; GLOBALNESS0-NEXT: ; %bb.22: ; %bb62.i			; GLOBALNESS0-NEXT: ; %bb.22: ; %bb62.i
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_15 Depth=2			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_15 Depth=2
	; GLOBALNESS0-NEXT: v_mov_b32_e32 v41, v40			; GLOBALNESS0-NEXT: v_mov_b32_e32 v41, v40
	; GLOBALNESS0-NEXT: global_store_dwordx2 v[46:47], v[40:41], off			; GLOBALNESS0-NEXT: global_store_dwordx2 v[46:47], v[40:41], off
	; GLOBALNESS0-NEXT: s_branch .LBB1_13			; GLOBALNESS0-NEXT: s_branch .LBB1_13
	; GLOBALNESS0-NEXT: .LBB1_23: ; %LeafBlock			; GLOBALNESS0-NEXT: .LBB1_23: ; %Flow23
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: s_cmp_lg_u32 s75, 0
	; GLOBALNESS0-NEXT: s_mov_b64 s[4:5], 0
	; GLOBALNESS0-NEXT: s_cselect_b64 s[6:7], -1, 0
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[6:7]
	; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_9
	; GLOBALNESS0-NEXT: .LBB1_24: ; in Loop: Header=BB1_4 Depth=1
	; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], -1
	; GLOBALNESS0-NEXT: ; implicit-def: $vgpr0_vgpr1
	; GLOBALNESS0-NEXT: s_branch .LBB1_3
	; GLOBALNESS0-NEXT: .LBB1_25: ; %Flow23
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1
	; GLOBALNESS0-NEXT: v_pk_mov_b32 v[0:1], 0, 0			; GLOBALNESS0-NEXT: v_pk_mov_b32 v[0:1], 0, 0
	; GLOBALNESS0-NEXT: .LBB1_26: ; %Flow24			; GLOBALNESS0-NEXT: .LBB1_24: ; %Flow24
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: s_or_b64 exec, exec, s[80:81]			; GLOBALNESS0-NEXT: s_or_b64 exec, exec, s[80:81]
	; GLOBALNESS0-NEXT: s_and_saveexec_b64 s[4:5], s[62:63]			; GLOBALNESS0-NEXT: s_and_saveexec_b64 s[4:5], s[62:63]
	; GLOBALNESS0-NEXT: s_cbranch_execz .LBB1_2			; GLOBALNESS0-NEXT: s_cbranch_execz .LBB1_29
	; GLOBALNESS0-NEXT: ; %bb.27: ; %bb67.i			; GLOBALNESS0-NEXT: ; %bb.25: ; %bb67.i
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[58:59]			; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[58:59]
	; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_1			; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_27
	; GLOBALNESS0-NEXT: ; %bb.28: ; %bb69.i			; GLOBALNESS0-NEXT: ; %bb.26: ; %bb69.i
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: v_mov_b32_e32 v41, v40			; GLOBALNESS0-NEXT: v_mov_b32_e32 v41, v40
	; GLOBALNESS0-NEXT: v_pk_mov_b32 v[2:3], 0, 0			; GLOBALNESS0-NEXT: v_pk_mov_b32 v[2:3], 0, 0
	; GLOBALNESS0-NEXT: global_store_dwordx2 v[2:3], v[40:41], off			; GLOBALNESS0-NEXT: global_store_dwordx2 v[2:3], v[40:41], off
	; GLOBALNESS0-NEXT: s_branch .LBB1_1			; GLOBALNESS0-NEXT: .LBB1_27: ; %bb70.i
	; GLOBALNESS0-NEXT: .LBB1_29: ; %bb73.i			; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_4 Depth=1			; GLOBALNESS0-NEXT: s_and_b64 vcc, exec, s[60:61]
				; GLOBALNESS0-NEXT: s_cbranch_vccnz .LBB1_29
				; GLOBALNESS0-NEXT: ; %bb.28: ; %bb73.i
				; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
	; GLOBALNESS0-NEXT: v_mov_b32_e32 v41, v40			; GLOBALNESS0-NEXT: v_mov_b32_e32 v41, v40
	; GLOBALNESS0-NEXT: v_pk_mov_b32 v[2:3], 0, 0			; GLOBALNESS0-NEXT: v_pk_mov_b32 v[2:3], 0, 0
	; GLOBALNESS0-NEXT: global_store_dwordx2 v[2:3], v[40:41], off			; GLOBALNESS0-NEXT: global_store_dwordx2 v[2:3], v[40:41], off
				; GLOBALNESS0-NEXT: .LBB1_29: ; %Flow15
				; GLOBALNESS0-NEXT: ; in Loop: Header=BB1_3 Depth=1
				; GLOBALNESS0-NEXT: s_or_b64 exec, exec, s[4:5]
				; GLOBALNESS0-NEXT: s_mov_b64 s[6:7], 0
				; GLOBALNESS0-NEXT: ; implicit-def: $sgpr4_sgpr5
	; GLOBALNESS0-NEXT: s_branch .LBB1_2			; GLOBALNESS0-NEXT: s_branch .LBB1_2
	; GLOBALNESS0-NEXT: .LBB1_30: ; %loop.exit.guard			; GLOBALNESS0-NEXT: .LBB1_30: ; %loop.exit.guard
	; GLOBALNESS0-NEXT: s_andn2_b64 vcc, exec, s[4:5]			; GLOBALNESS0-NEXT: s_andn2_b64 vcc, exec, s[4:5]
	; GLOBALNESS0-NEXT: s_mov_b64 s[4:5], -1			; GLOBALNESS0-NEXT: s_mov_b64 s[4:5], -1
	; GLOBALNESS0-NEXT: s_cbranch_vccz .LBB1_32			; GLOBALNESS0-NEXT: s_cbranch_vccz .LBB1_32
	; GLOBALNESS0-NEXT: ; %bb.31: ; %bb7.i.i			; GLOBALNESS0-NEXT: ; %bb.31: ; %bb7.i.i
	; GLOBALNESS0-NEXT: s_add_u32 s8, s38, 40			; GLOBALNESS0-NEXT: s_add_u32 s8, s38, 40
	; GLOBALNESS0-NEXT: s_addc_u32 s9, s39, 0			; GLOBALNESS0-NEXT: s_addc_u32 s9, s39, 0
	▲ Show 20 Lines • Show All 145 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/uniform-phi-with-undef.ll

	Show All 26 Lines
	; GCN-NEXT: v_mul_f32_e64 v4, v3, v2			; GCN-NEXT: v_mul_f32_e64 v4, v3, v2
	; GCN-NEXT: v_fma_f32 v5, -v1, v4, v3			; GCN-NEXT: v_fma_f32 v5, -v1, v4, v3
	; GCN-NEXT: v_fmac_f32_e64 v4, v5, v2			; GCN-NEXT: v_fmac_f32_e64 v4, v5, v2
	; GCN-NEXT: v_fma_f32 v1, -v1, v4, v3			; GCN-NEXT: v_fma_f32 v1, -v1, v4, v3
	; GCN-NEXT: v_div_fmas_f32 v1, v1, v2, v4			; GCN-NEXT: v_div_fmas_f32 v1, v1, v2, v4
	; GCN-NEXT: v_div_fixup_f32 v0, v1, s2, v0			; GCN-NEXT: v_div_fixup_f32 v0, v1, s2, v0
	; GCN-NEXT: .LBB0_2: ; %end			; GCN-NEXT: .LBB0_2: ; %end
	; GCN-NEXT: s_or_b32 exec_lo, exec_lo, s1			; GCN-NEXT: s_or_b32 exec_lo, exec_lo, s1
				; GCN-NEXT: ; %bb.3: ; %end
	; GCN-NEXT: v_add_f32_e64 v0, v0, s0			; GCN-NEXT: v_add_f32_e64 v0, v0, s0
	; GCN-NEXT: ; return to shader part epilog			; GCN-NEXT: ; return to shader part epilog
	entry:			entry:
	%cc = icmp slt i32 %y, %x			%cc = icmp slt i32 %y, %x
	br i1 %cc, label %if, label %end			br i1 %cc, label %if, label %end

	if:			if:
	%v.if = fdiv float %v, 2.0			%v.if = fdiv float %v, 2.0
	Show All 10 Lines

llvm/test/CodeGen/AMDGPU/vgpr-spill-placement-issue61083.ll

	Show All 26 Lines
	; CHECK-NEXT: s_mov_b64 s[4:5], exec			; CHECK-NEXT: s_mov_b64 s[4:5], exec
	; CHECK-NEXT: v_writelane_b32 v1, s4, 0			; CHECK-NEXT: v_writelane_b32 v1, s4, 0
	; CHECK-NEXT: v_writelane_b32 v1, s5, 1			; CHECK-NEXT: v_writelane_b32 v1, s5, 1
	; CHECK-NEXT: s_and_b64 s[4:5], s[4:5], s[6:7]			; CHECK-NEXT: s_and_b64 s[4:5], s[4:5], s[6:7]
	; CHECK-NEXT: s_mov_b64 exec, s[4:5]			; CHECK-NEXT: s_mov_b64 exec, s[4:5]
	; CHECK-NEXT: s_cbranch_execz .LBB0_2			; CHECK-NEXT: s_cbranch_execz .LBB0_2
	; CHECK-NEXT: ; %bb.1: ; %bb193			; CHECK-NEXT: ; %bb.1: ; %bb193
	; CHECK-NEXT: .LBB0_2: ; %bb194			; CHECK-NEXT: .LBB0_2: ; %bb194
	; CHECK-NEXT: buffer_load_dword v0, off, s[0:3], 0 offset:4 ; 4-byte Folded Reload
	; CHECK-NEXT: v_readlane_b32 s4, v1, 0			; CHECK-NEXT: v_readlane_b32 s4, v1, 0
	; CHECK-NEXT: v_readlane_b32 s5, v1, 1			; CHECK-NEXT: v_readlane_b32 s5, v1, 1
	; CHECK-NEXT: s_or_b64 exec, exec, s[4:5]			; CHECK-NEXT: s_or_b64 exec, exec, s[4:5]
				; CHECK-NEXT: ; %bb.3: ; %bb194
				; CHECK-NEXT: buffer_load_dword v0, off, s[0:3], 0 offset:4 ; 4-byte Folded Reload
	; CHECK-NEXT: s_mov_b32 s4, 0			; CHECK-NEXT: s_mov_b32 s4, 0
	; CHECK-NEXT: s_waitcnt vmcnt(0)			; CHECK-NEXT: s_waitcnt vmcnt(0)
	; CHECK-NEXT: v_cmp_ne_u16_e64 s[4:5], v0, s4			; CHECK-NEXT: v_cmp_ne_u16_e64 s[4:5], v0, s4
	; CHECK-NEXT: s_and_b64 vcc, exec, s[4:5]			; CHECK-NEXT: s_and_b64 vcc, exec, s[4:5]
	; CHECK-NEXT: s_cbranch_vccnz .LBB0_4			; CHECK-NEXT: s_cbranch_vccnz .LBB0_5
	; CHECK-NEXT: ; %bb.3: ; %bb201			; CHECK-NEXT: ; %bb.4: ; %bb201
	; CHECK-NEXT: buffer_load_dword v2, off, s[0:3], 0 offset:4 ; 4-byte Folded Reload			; CHECK-NEXT: buffer_load_dword v2, off, s[0:3], 0 offset:4 ; 4-byte Folded Reload
	; CHECK-NEXT: s_getpc_b64 s[4:5]			; CHECK-NEXT: s_getpc_b64 s[4:5]
	; CHECK-NEXT: s_add_u32 s4, s4, V2@rel32@lo+4			; CHECK-NEXT: s_add_u32 s4, s4, V2@rel32@lo+4
	; CHECK-NEXT: s_addc_u32 s5, s5, V2@rel32@hi+12			; CHECK-NEXT: s_addc_u32 s5, s5, V2@rel32@hi+12
	; CHECK-NEXT: v_mov_b32_e32 v0, 0			; CHECK-NEXT: v_mov_b32_e32 v0, 0
	; CHECK-NEXT: s_waitcnt vmcnt(0)			; CHECK-NEXT: s_waitcnt vmcnt(0)
	; CHECK-NEXT: global_store_short v0, v2, s[4:5]			; CHECK-NEXT: global_store_short v0, v2, s[4:5]
	; CHECK-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)			; CHECK-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
	; CHECK-NEXT: s_barrier			; CHECK-NEXT: s_barrier
	; CHECK-NEXT: s_trap 2			; CHECK-NEXT: s_trap 2
	; CHECK-NEXT: ; divergent unreachable			; CHECK-NEXT: ; divergent unreachable
	; CHECK-NEXT: .LBB0_4: ; %UnifiedReturnBlock			; CHECK-NEXT: .LBB0_5: ; %UnifiedReturnBlock
	; CHECK-NEXT: s_endpgm			; CHECK-NEXT: s_endpgm
	bb:			bb:
	%i10 = tail call i32 @llvm.amdgcn.workitem.id.x()			%i10 = tail call i32 @llvm.amdgcn.workitem.id.x()
	%i13 = tail call align 4 dereferenceable(64) ptr addrspace(4) @llvm.amdgcn.dispatch.ptr()			%i13 = tail call align 4 dereferenceable(64) ptr addrspace(4) @llvm.amdgcn.dispatch.ptr()
	%i14 = getelementptr i8, ptr addrspace(4) %i13, i64 4			%i14 = getelementptr i8, ptr addrspace(4) %i13, i64 4
	%i15 = load i16, ptr addrspace(4) %i14, align 4			%i15 = load i16, ptr addrspace(4) %i14, align 4
	%i22 = icmp eq i32 %i10, 0			%i22 = icmp eq i32 %i10, 0
	store i8 0, ptr addrspace(3) @Q			store i8 0, ptr addrspace(3) @Q
	Show All 25 Lines

llvm/test/CodeGen/AMDGPU/wave32.ll

	Show First 20 Lines • Show All 350 Lines • ▼ Show 20 Lines
	endif:			endif:
	ret void			ret void
	}			}

	define amdgpu_kernel void @test_loop_with_if(ptr addrspace(1) %arg) #0 {			define amdgpu_kernel void @test_loop_with_if(ptr addrspace(1) %arg) #0 {
	; GFX1032-LABEL: test_loop_with_if:			; GFX1032-LABEL: test_loop_with_if:
	; GFX1032: ; %bb.0: ; %bb			; GFX1032: ; %bb.0: ; %bb
	; GFX1032-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24			; GFX1032-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
	; GFX1032-NEXT: v_mov_b32_e32 v1, 0			; GFX1032-NEXT: v_mov_b32_e32 v3, 0
	; GFX1032-NEXT: s_mov_b32 s2, 0			; GFX1032-NEXT: s_mov_b32 s2, 0
	; GFX1032-NEXT: ; implicit-def: $vgpr2_vgpr3			; GFX1032-NEXT: ; implicit-def: $vgpr1_vgpr2
	; GFX1032-NEXT: s_branch .LBB10_2			; GFX1032-NEXT: s_branch .LBB10_2
	; GFX1032-NEXT: .LBB10_1: ; %bb13			; GFX1032-NEXT: .LBB10_1: ; %bb13
	; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1032-NEXT: s_waitcnt_depctr 0xffe3			; GFX1032-NEXT: s_waitcnt_depctr 0xffe3
	; GFX1032-NEXT: s_or_b32 exec_lo, exec_lo, s4			; GFX1032-NEXT: s_or_b32 exec_lo, exec_lo, s4
	; GFX1032-NEXT: v_cmp_lt_i32_e32 vcc_lo, 0xfe, v4			; GFX1032-NEXT: v_cmp_lt_i32_e32 vcc_lo, 0xfe, v4
	; GFX1032-NEXT: v_add_nc_u32_e32 v1, 1, v4			; GFX1032-NEXT: v_add_nc_u32_e32 v3, 1, v4
	; GFX1032-NEXT: s_or_b32 s2, vcc_lo, s2			; GFX1032-NEXT: s_or_b32 s2, vcc_lo, s2
	; GFX1032-NEXT: s_andn2_b32 exec_lo, exec_lo, s2			; GFX1032-NEXT: s_andn2_b32 exec_lo, exec_lo, s2
	; GFX1032-NEXT: s_cbranch_execz .LBB10_8			; GFX1032-NEXT: s_cbranch_execz .LBB10_8
	; GFX1032-NEXT: .LBB10_2: ; %bb2			; GFX1032-NEXT: .LBB10_2: ; %bb2
	; GFX1032-NEXT: ; =>This Inner Loop Header: Depth=1			; GFX1032-NEXT: ; =>This Inner Loop Header: Depth=1
	; GFX1032-NEXT: v_cmp_ge_i32_e64 s4, v1, v0			; GFX1032-NEXT: v_cmp_ge_i32_e64 s4, v3, v0
	; GFX1032-NEXT: v_cmp_lt_i32_e32 vcc_lo, v1, v0			; GFX1032-NEXT: v_cmp_lt_i32_e32 vcc_lo, v3, v0
	; GFX1032-NEXT: s_mov_b32 s3, 0			; GFX1032-NEXT: s_mov_b32 s3, 0
	; GFX1032-NEXT: s_and_saveexec_b32 s5, vcc_lo			; GFX1032-NEXT: s_and_saveexec_b32 s5, vcc_lo
	; GFX1032-NEXT: s_cbranch_execz .LBB10_4			; GFX1032-NEXT: s_cbranch_execz .LBB10_4
	; GFX1032-NEXT: ; %bb.3: ; %bb5			; GFX1032-NEXT: ; %bb.3: ; %bb5
	; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1032-NEXT: v_ashrrev_i32_e32 v2, 31, v1			; GFX1032-NEXT: v_ashrrev_i32_e32 v4, 31, v3
	; GFX1032-NEXT: s_andn2_b32 s4, s4, exec_lo			; GFX1032-NEXT: s_andn2_b32 s4, s4, exec_lo
	; GFX1032-NEXT: s_mov_b32 s3, exec_lo			; GFX1032-NEXT: s_mov_b32 s3, exec_lo
	; GFX1032-NEXT: v_lshlrev_b64 v[2:3], 2, v[1:2]			; GFX1032-NEXT: v_lshlrev_b64 v[1:2], 2, v[3:4]
	; GFX1032-NEXT: s_waitcnt lgkmcnt(0)			; GFX1032-NEXT: s_waitcnt lgkmcnt(0)
	; GFX1032-NEXT: v_add_co_u32 v2, vcc_lo, s0, v2			; GFX1032-NEXT: v_add_co_u32 v1, vcc_lo, s0, v1
	; GFX1032-NEXT: v_add_co_ci_u32_e32 v3, vcc_lo, s1, v3, vcc_lo			; GFX1032-NEXT: v_add_co_ci_u32_e32 v2, vcc_lo, s1, v2, vcc_lo
	; GFX1032-NEXT: global_load_dword v4, v[2:3], off			; GFX1032-NEXT: global_load_dword v4, v[1:2], off
	; GFX1032-NEXT: s_waitcnt vmcnt(0)			; GFX1032-NEXT: s_waitcnt vmcnt(0)
	; GFX1032-NEXT: v_cmp_gt_i32_e32 vcc_lo, 11, v4			; GFX1032-NEXT: v_cmp_gt_i32_e32 vcc_lo, 11, v4
	; GFX1032-NEXT: s_and_b32 s6, vcc_lo, exec_lo			; GFX1032-NEXT: s_and_b32 s6, vcc_lo, exec_lo
	; GFX1032-NEXT: s_or_b32 s4, s4, s6			; GFX1032-NEXT: s_or_b32 s4, s4, s6
	; GFX1032-NEXT: .LBB10_4: ; %Flow			; GFX1032-NEXT: .LBB10_4: ; %Flow
	; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1032-NEXT: s_or_b32 exec_lo, exec_lo, s5			; GFX1032-NEXT: s_or_b32 exec_lo, exec_lo, s5
	; GFX1032-NEXT: ; implicit-def: $vgpr4			; GFX1032-NEXT: ; implicit-def: $vgpr4
	; GFX1032-NEXT: s_and_saveexec_b32 s5, s4			; GFX1032-NEXT: s_and_saveexec_b32 s5, s4
	; GFX1032-NEXT: s_xor_b32 s4, exec_lo, s5			; GFX1032-NEXT: s_xor_b32 s4, exec_lo, s5
	; GFX1032-NEXT: ; %bb.5: ; %bb11			; GFX1032-NEXT: ; %bb.5: ; %bb11
	; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1032-NEXT: v_lshrrev_b32_e32 v4, 31, v1			; GFX1032-NEXT: v_lshrrev_b32_e32 v4, 31, v3
	; GFX1032-NEXT: s_andn2_b32 s3, s3, exec_lo			; GFX1032-NEXT: s_andn2_b32 s3, s3, exec_lo
	; GFX1032-NEXT: v_add_nc_u32_e32 v4, v1, v4			; GFX1032-NEXT: v_add_nc_u32_e32 v4, v3, v4
	; GFX1032-NEXT: v_ashrrev_i32_e32 v4, 1, v4			; GFX1032-NEXT: v_ashrrev_i32_e32 v4, 1, v4
	; GFX1032-NEXT: ; %bb.6: ; %Flow1			; GFX1032-NEXT: ; %bb.6: ; %Flow1
	; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1032-NEXT: s_or_b32 exec_lo, exec_lo, s4			; GFX1032-NEXT: s_or_b32 exec_lo, exec_lo, s4
	; GFX1032-NEXT: s_and_saveexec_b32 s4, s3			; GFX1032-NEXT: s_and_saveexec_b32 s4, s3
	; GFX1032-NEXT: s_cbranch_execz .LBB10_1			; GFX1032-NEXT: s_cbranch_execz .LBB10_1
	; GFX1032-NEXT: ; %bb.7: ; %bb10			; GFX1032-NEXT: ; %bb.7: ; %bb10
	; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1032-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1032-NEXT: v_mov_b32_e32 v4, v1			; GFX1032-NEXT: v_mov_b32_e32 v4, v3
	; GFX1032-NEXT: global_store_dword v[2:3], v0, off			; GFX1032-NEXT: global_store_dword v[1:2], v0, off
	; GFX1032-NEXT: s_branch .LBB10_1			; GFX1032-NEXT: s_branch .LBB10_1
	; GFX1032-NEXT: .LBB10_8: ; %bb1			; GFX1032-NEXT: .LBB10_8: ; %bb1
	; GFX1032-NEXT: s_endpgm			; GFX1032-NEXT: s_endpgm
	;			;
	; GFX1064-LABEL: test_loop_with_if:			; GFX1064-LABEL: test_loop_with_if:
	; GFX1064: ; %bb.0: ; %bb			; GFX1064: ; %bb.0: ; %bb
	; GFX1064-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24			; GFX1064-NEXT: s_load_dwordx2 s[0:1], s[0:1], 0x24
	; GFX1064-NEXT: v_mov_b32_e32 v1, 0			; GFX1064-NEXT: v_mov_b32_e32 v3, 0
	; GFX1064-NEXT: s_mov_b64 s[2:3], 0			; GFX1064-NEXT: s_mov_b64 s[2:3], 0
	; GFX1064-NEXT: ; implicit-def: $vgpr2_vgpr3			; GFX1064-NEXT: ; implicit-def: $vgpr1_vgpr2
	; GFX1064-NEXT: s_branch .LBB10_2			; GFX1064-NEXT: s_branch .LBB10_2
	; GFX1064-NEXT: .LBB10_1: ; %bb13			; GFX1064-NEXT: .LBB10_1: ; %bb13
	; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1064-NEXT: s_waitcnt_depctr 0xffe3			; GFX1064-NEXT: s_waitcnt_depctr 0xffe3
	; GFX1064-NEXT: s_or_b64 exec, exec, s[6:7]			; GFX1064-NEXT: s_or_b64 exec, exec, s[6:7]
	; GFX1064-NEXT: v_cmp_lt_i32_e32 vcc, 0xfe, v4			; GFX1064-NEXT: v_cmp_lt_i32_e32 vcc, 0xfe, v4
	; GFX1064-NEXT: v_add_nc_u32_e32 v1, 1, v4			; GFX1064-NEXT: v_add_nc_u32_e32 v3, 1, v4
	; GFX1064-NEXT: s_or_b64 s[2:3], vcc, s[2:3]			; GFX1064-NEXT: s_or_b64 s[2:3], vcc, s[2:3]
	; GFX1064-NEXT: s_andn2_b64 exec, exec, s[2:3]			; GFX1064-NEXT: s_andn2_b64 exec, exec, s[2:3]
	; GFX1064-NEXT: s_cbranch_execz .LBB10_8			; GFX1064-NEXT: s_cbranch_execz .LBB10_8
	; GFX1064-NEXT: .LBB10_2: ; %bb2			; GFX1064-NEXT: .LBB10_2: ; %bb2
	; GFX1064-NEXT: ; =>This Inner Loop Header: Depth=1			; GFX1064-NEXT: ; =>This Inner Loop Header: Depth=1
	; GFX1064-NEXT: v_cmp_ge_i32_e64 s[6:7], v1, v0			; GFX1064-NEXT: v_cmp_ge_i32_e64 s[6:7], v3, v0
	; GFX1064-NEXT: v_cmp_lt_i32_e32 vcc, v1, v0			; GFX1064-NEXT: v_cmp_lt_i32_e32 vcc, v3, v0
	; GFX1064-NEXT: s_mov_b64 s[4:5], 0			; GFX1064-NEXT: s_mov_b64 s[4:5], 0
	; GFX1064-NEXT: s_and_saveexec_b64 s[8:9], vcc			; GFX1064-NEXT: s_and_saveexec_b64 s[8:9], vcc
	; GFX1064-NEXT: s_cbranch_execz .LBB10_4			; GFX1064-NEXT: s_cbranch_execz .LBB10_4
	; GFX1064-NEXT: ; %bb.3: ; %bb5			; GFX1064-NEXT: ; %bb.3: ; %bb5
	; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1064-NEXT: v_ashrrev_i32_e32 v2, 31, v1			; GFX1064-NEXT: v_ashrrev_i32_e32 v4, 31, v3
	; GFX1064-NEXT: s_andn2_b64 s[6:7], s[6:7], exec			; GFX1064-NEXT: s_andn2_b64 s[6:7], s[6:7], exec
	; GFX1064-NEXT: s_mov_b64 s[4:5], exec			; GFX1064-NEXT: s_mov_b64 s[4:5], exec
	; GFX1064-NEXT: v_lshlrev_b64 v[2:3], 2, v[1:2]			; GFX1064-NEXT: v_lshlrev_b64 v[1:2], 2, v[3:4]
	; GFX1064-NEXT: s_waitcnt lgkmcnt(0)			; GFX1064-NEXT: s_waitcnt lgkmcnt(0)
	; GFX1064-NEXT: v_add_co_u32 v2, vcc, s0, v2			; GFX1064-NEXT: v_add_co_u32 v1, vcc, s0, v1
	; GFX1064-NEXT: v_add_co_ci_u32_e32 v3, vcc, s1, v3, vcc			; GFX1064-NEXT: v_add_co_ci_u32_e32 v2, vcc, s1, v2, vcc
	; GFX1064-NEXT: global_load_dword v4, v[2:3], off			; GFX1064-NEXT: global_load_dword v4, v[1:2], off
	; GFX1064-NEXT: s_waitcnt vmcnt(0)			; GFX1064-NEXT: s_waitcnt vmcnt(0)
	; GFX1064-NEXT: v_cmp_gt_i32_e32 vcc, 11, v4			; GFX1064-NEXT: v_cmp_gt_i32_e32 vcc, 11, v4
	; GFX1064-NEXT: s_and_b64 s[10:11], vcc, exec			; GFX1064-NEXT: s_and_b64 s[10:11], vcc, exec
	; GFX1064-NEXT: s_or_b64 s[6:7], s[6:7], s[10:11]			; GFX1064-NEXT: s_or_b64 s[6:7], s[6:7], s[10:11]
	; GFX1064-NEXT: .LBB10_4: ; %Flow			; GFX1064-NEXT: .LBB10_4: ; %Flow
	; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1064-NEXT: s_or_b64 exec, exec, s[8:9]			; GFX1064-NEXT: s_or_b64 exec, exec, s[8:9]
	; GFX1064-NEXT: ; implicit-def: $vgpr4			; GFX1064-NEXT: ; implicit-def: $vgpr4
	; GFX1064-NEXT: s_and_saveexec_b64 s[8:9], s[6:7]			; GFX1064-NEXT: s_and_saveexec_b64 s[8:9], s[6:7]
	; GFX1064-NEXT: s_xor_b64 s[6:7], exec, s[8:9]			; GFX1064-NEXT: s_xor_b64 s[6:7], exec, s[8:9]
	; GFX1064-NEXT: ; %bb.5: ; %bb11			; GFX1064-NEXT: ; %bb.5: ; %bb11
	; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1064-NEXT: v_lshrrev_b32_e32 v4, 31, v1			; GFX1064-NEXT: v_lshrrev_b32_e32 v4, 31, v3
	; GFX1064-NEXT: s_andn2_b64 s[4:5], s[4:5], exec			; GFX1064-NEXT: s_andn2_b64 s[4:5], s[4:5], exec
	; GFX1064-NEXT: v_add_nc_u32_e32 v4, v1, v4			; GFX1064-NEXT: v_add_nc_u32_e32 v4, v3, v4
	; GFX1064-NEXT: v_ashrrev_i32_e32 v4, 1, v4			; GFX1064-NEXT: v_ashrrev_i32_e32 v4, 1, v4
	; GFX1064-NEXT: ; %bb.6: ; %Flow1			; GFX1064-NEXT: ; %bb.6: ; %Flow1
	; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1064-NEXT: s_or_b64 exec, exec, s[6:7]			; GFX1064-NEXT: s_or_b64 exec, exec, s[6:7]
	; GFX1064-NEXT: s_and_saveexec_b64 s[6:7], s[4:5]			; GFX1064-NEXT: s_and_saveexec_b64 s[6:7], s[4:5]
	; GFX1064-NEXT: s_cbranch_execz .LBB10_1			; GFX1064-NEXT: s_cbranch_execz .LBB10_1
	; GFX1064-NEXT: ; %bb.7: ; %bb10			; GFX1064-NEXT: ; %bb.7: ; %bb10
	; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1			; GFX1064-NEXT: ; in Loop: Header=BB10_2 Depth=1
	; GFX1064-NEXT: v_mov_b32_e32 v4, v1			; GFX1064-NEXT: v_mov_b32_e32 v4, v3
	; GFX1064-NEXT: global_store_dword v[2:3], v0, off			; GFX1064-NEXT: global_store_dword v[1:2], v0, off
	; GFX1064-NEXT: s_branch .LBB10_1			; GFX1064-NEXT: s_branch .LBB10_1
	; GFX1064-NEXT: .LBB10_8: ; %bb1			; GFX1064-NEXT: .LBB10_8: ; %bb1
	; GFX1064-NEXT: s_endpgm			; GFX1064-NEXT: s_endpgm
	bb:			bb:
	%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()			%tmp = tail call i32 @llvm.amdgcn.workitem.id.x()
	br label %bb2			br label %bb2

	bb1:			bb1:
	▲ Show 20 Lines • Show All 2,486 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/wwm-reserved-spill.ll

	Show First 20 Lines • Show All 205 Lines • ▼ Show 20 Lines
	; GFX9-O0-NEXT: s_not_b64 exec, exec			; GFX9-O0-NEXT: s_not_b64 exec, exec
	; GFX9-O0-NEXT: s_or_saveexec_b64 s[34:35], -1			; GFX9-O0-NEXT: s_or_saveexec_b64 s[34:35], -1
	; GFX9-O0-NEXT: v_mov_b32_dpp v1, v2 row_bcast:31 row_mask:0xc bank_mask:0xf			; GFX9-O0-NEXT: v_mov_b32_dpp v1, v2 row_bcast:31 row_mask:0xc bank_mask:0xf
	; GFX9-O0-NEXT: v_add_u32_e64 v1, v2, v1			; GFX9-O0-NEXT: v_add_u32_e64 v1, v2, v1
	; GFX9-O0-NEXT: s_mov_b64 exec, s[34:35]			; GFX9-O0-NEXT: s_mov_b64 exec, s[34:35]
	; GFX9-O0-NEXT: v_mov_b32_e32 v0, v1			; GFX9-O0-NEXT: v_mov_b32_e32 v0, v1
	; GFX9-O0-NEXT: buffer_store_dword v0, off, s[0:3], s32 ; 4-byte Folded Spill			; GFX9-O0-NEXT: buffer_store_dword v0, off, s[0:3], s32 ; 4-byte Folded Spill
	; GFX9-O0-NEXT: .LBB1_2: ; %merge			; GFX9-O0-NEXT: .LBB1_2: ; %merge
				; GFX9-O0-NEXT: v_readlane_b32 s34, v3, 4
				; GFX9-O0-NEXT: v_readlane_b32 s35, v3, 5
				; GFX9-O0-NEXT: s_or_b64 exec, exec, s[34:35]
				; GFX9-O0-NEXT: ; %bb.3: ; %merge
	; GFX9-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:4 ; 4-byte Folded Reload			; GFX9-O0-NEXT: buffer_load_dword v0, off, s[0:3], s32 offset:4 ; 4-byte Folded Reload
	; GFX9-O0-NEXT: s_nop 0
	; GFX9-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 ; 4-byte Folded Reload			; GFX9-O0-NEXT: buffer_load_dword v4, off, s[0:3], s32 ; 4-byte Folded Reload
	; GFX9-O0-NEXT: v_readlane_b32 s36, v3, 4
	; GFX9-O0-NEXT: v_readlane_b32 s37, v3, 5
	; GFX9-O0-NEXT: s_or_b64 exec, exec, s[36:37]
	; GFX9-O0-NEXT: v_readlane_b32 s38, v3, 0			; GFX9-O0-NEXT: v_readlane_b32 s38, v3, 0
	; GFX9-O0-NEXT: v_readlane_b32 s39, v3, 1			; GFX9-O0-NEXT: v_readlane_b32 s39, v3, 1
	; GFX9-O0-NEXT: v_readlane_b32 s34, v3, 2			; GFX9-O0-NEXT: v_readlane_b32 s34, v3, 2
	; GFX9-O0-NEXT: v_readlane_b32 s35, v3, 3			; GFX9-O0-NEXT: v_readlane_b32 s35, v3, 3
	; GFX9-O0-NEXT: s_waitcnt vmcnt(0)			; GFX9-O0-NEXT: s_waitcnt vmcnt(0)
	; GFX9-O0-NEXT: v_cmp_eq_u32_e64 s[36:37], v0, v4			; GFX9-O0-NEXT: v_cmp_eq_u32_e64 s[36:37], v0, v4
	; GFX9-O0-NEXT: v_cndmask_b32_e64 v0, 0, 1, s[36:37]			; GFX9-O0-NEXT: v_cndmask_b32_e64 v0, 0, 1, s[36:37]
	; GFX9-O0-NEXT: s_mov_b32 s36, 1			; GFX9-O0-NEXT: s_mov_b32 s36, 1
	▲ Show 20 Lines • Show All 1,150 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Always split blocks for si_end_cfAcceptedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 528807

llvm/lib/Target/AMDGPU/SILowerControlFlow.cpp

llvm/test/CodeGen/AMDGPU/GlobalISel/llvm.amdgcn.wqm.demote.ll

llvm/test/CodeGen/AMDGPU/block-should-not-be-in-alive-blocks.mir

llvm/test/CodeGen/AMDGPU/branch-folding-implicit-def-subreg.ll

llvm/test/CodeGen/AMDGPU/collapse-endcf.ll

llvm/test/CodeGen/AMDGPU/collapse-endcf.mir

llvm/test/CodeGen/AMDGPU/control-flow-fastregalloc.ll

llvm/test/CodeGen/AMDGPU/global-atomics-fp.ll

llvm/test/CodeGen/AMDGPU/mubuf-legalize-operands-non-ptr-intrinsics.ll

llvm/test/CodeGen/AMDGPU/mubuf-legalize-operands.ll

llvm/test/CodeGen/AMDGPU/tuple-allocation-failure.ll

llvm/test/CodeGen/AMDGPU/uniform-phi-with-undef.ll

llvm/test/CodeGen/AMDGPU/vgpr-spill-placement-issue61083.ll

llvm/test/CodeGen/AMDGPU/wave32.ll

llvm/test/CodeGen/AMDGPU/wwm-reserved-spill.ll

AMDGPU: Always split blocks for si_end_cf
AcceptedPublic