This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/
-
CodeGen/
-
RegAllocGreedy.h
2/6
RegAllocGreedy.cpp
-
test/
-
CodeGen/
-
AArch64/
-
cgp-usubo.ll
-
csr-split.ll
-
AMDGPU/
-
blender-no-live-segment-at-def-implicit-def.ll
-
greedy-alloc-fail-sgpr1024-spill.mir
-
insert-delay-alu-bug.ll
2/3
ran-out-of-sgprs-allocation-failure.mir
-
unstructured-cfg-def-use-issue.ll
-
ARM/
-
csr-split.ll
-
divmod-eabi.ll
-
thumb2-size-opt.ll
-
AVR/
-
cttz.ll
-
Hexagon/
-
noreturn-noepilog.ll
-
PowerPC/
-
csr-split.ll
-
ppc64-rop-protection-aix.ll
-
ppc64-rop-protection.ll
-
subreg-postra.ll
-
tail-dup-break-cfg.ll
-
tls-cse.ll
-
RISCV/
-
double-stack-spill-restore.ll
-
fmax-fmin.ll
-
overflow-intrinsics.ll
-
rvv/
-
fixed-vectors-strided-vpload.ll
-
strided-vpload.ll
-
Thumb2/
-
pacbti-m-outliner-3.ll
-
X86/
-
2007-11-30-LoadFolding-Bug.ll
-
cgp-usubo.ll
-
csr-split.ll
-
ragreedy-bug.ll
1/2
ragreedy-hoist-spill.ll
-
speculative-load-hardening.ll
-
split-reg-with-hint.ll
-
statepoint-call-lowering.ll
-
statepoint-invoke-ra-enter-at-end.mir
-
statepoint-ra.ll
-
statepoint-vreg-details.ll
-
statepoint-vreg.ll
-
DebugInfo/
-
ARM/
-
sdag-split-arg.ll
-
X86/
1/2
live-debug-values.ll
-
tools/llvm-objdump/XCOFF/
-
llvm-objdump/
-
XCOFF/
-
disassemble-symbolize-operands.ll

Differential D156491

[RA] Split a virtual register in cold blocks if it is not assigned preferred physical register
ClosedPublic

Authored by Carrot on Jul 27 2023, 2:40 PM.

Download Raw Diff

Details

Reviewers

jhenderson
MaskRay
qcolombet
arsenm
efriedma
wmi

Commits

rGcbdccb30c23f: [RA] Split a virtual register in cold blocks if it is not assigned preferred…

Summary

If a virtual register is not assigned preferred physical register, it means some COPY instructions will be changed to real register move instructions. In this case we can try to split the virtual register in colder blocks, if success, the original COPY instructions can be deleted, and the new COPY instructions in colder blocks will be generated as register move instructions. It results in fewer dynamic register move instructions executed.

The new test case split-reg-with-hint.ll gives an example, the hot path contains 24 instructions without this patch, now it is only 4 instructions with this patch.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Carrot created this revision.Jul 27 2023, 2:40 PM

Herald added a reviewer: jhenderson. · View Herald TranscriptJul 27 2023, 2:40 PM

Herald added a reviewer: MaskRay. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: luke, frasercrmck, kerbowa and 28 others. · View Herald Transcript

Carrot requested review of this revision.Jul 27 2023, 2:40 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 27 2023, 2:40 PM

Herald added subscribers: llvm-commits, wangpc. · View Herald Transcript

Harbormaster completed remote builds in B248695: Diff 544929.Jul 27 2023, 2:41 PM

Interesting! This is showing some neat improvements enabling more shrink-wrapping in the test-cases.

Though I suspect changes of the splitting algorithm could trigger regressions (given it's all heuristics). So I would feel better if this change was backed up by more statistics. (Something like compiling llvm-test-suite and collecting regalloc stats or remarks?) and/or report any other benchmarks you did?

arsenm added a subscriber: arsenm.Jul 27 2023, 2:52 PM

arsenm added inline comments.

llvm/test/CodeGen/AMDGPU/ran-out-of-sgprs-allocation-failure.mir
105	This is a huge improvement (though this heuristic is probably not the reason it should have split)

MatzeB added reviewers: qcolombet, arsenm, efriedma.Jul 27 2023, 2:52 PM

Herald added a subscriber: wdng. · View Herald TranscriptJul 27 2023, 2:52 PM

MatzeB added a reviewer: wmi.Jul 27 2023, 2:55 PM

MatzeB added inline comments.Jul 27 2023, 3:04 PM

llvm/test/DebugInfo/X86/live-debug-values.ll
31–32	uhoh...

In D156491#4540597, @MatzeB wrote:

Interesting! This is showing some neat improvements enabling more shrink-wrapping in the test-cases.

Though I suspect changes of the splitting algorithm could trigger regressions (given it's all heuristics). So I would feel better if this change was backed up by more statistics. (Something like compiling llvm-test-suite and collecting regalloc stats or remarks?) and/or report any other benchmarks you did?

Regressions are really observed, such as ARM/fptosi-sat-scalar.ll.
Internal analysis shows it has big impact to tcmalloc and protobuf parsing. I will run it on SPEC INT.

llvm/test/CodeGen/AMDGPU/ran-out-of-sgprs-allocation-failure.mir
105	Thank you for the confirmation, I really don't what it is doing.
llvm/test/DebugInfo/X86/live-debug-values.ll
31–32	I guess the debug information is not maintained well when live range split occurs.

Carrot added inline comments.Jul 27 2023, 3:29 PM

llvm/test/CodeGen/AMDGPU/ran-out-of-sgprs-allocation-failure.mir
105	I really don't know what it is doing.

lkail added a subscriber: lkail.Jul 27 2023, 10:18 PM

Improve the cost computation when the VirtReg is assigned a non hint physical register, it fixed several regressions.

Harbormaster completed remote builds in B250387: Diff 547287.Aug 4 2023, 1:26 PM

qcolombet requested changes to this revision.Aug 8 2023, 4:26 AM

qcolombet added inline comments.

llvm/lib/CodeGen/RegAllocGreedy.cpp
2404	Splitting decisions are supposed to be taken later (`trySplit`), this breaks the general algorithm's flow.

This revision now requires changes to proceed.Aug 8 2023, 4:26 AM

qcolombet added inline comments.Aug 8 2023, 5:16 AM

llvm/lib/CodeGen/RegAllocGreedy.cpp
2404	Concretely, I'm asking if you could do this kind of splitting in the splitting heuristic. You may want to add a new stage in the algorithm (and that'll come at a compile time cost.)

Carrot added inline comments.Aug 8 2023, 10:33 AM

llvm/lib/CodeGen/RegAllocGreedy.cpp
2404	The problem is we need to decide between a physical register and split, if we move it to trySplit, we will miss the possibility of assigning a physical register to the whole virtual register. trySplit compares splits against different physical registers, we need to compare split against non-split. Function tryAssignCSRFirstTime does similar thing. trySplitAroundHintReg is even more special because it tries to split against only 1 physical register.

In D156491#4540597, @MatzeB wrote:

Interesting! This is showing some neat improvements enabling more shrink-wrapping in the test-cases.

Though I suspect changes of the splitting algorithm could trigger regressions (given it's all heuristics). So I would feel better if this change was backed up by more statistics. (Something like compiling llvm-test-suite and collecting regalloc stats or remarks?) and/or report any other benchmarks you did?

Following are regalloc stats when compiler SPECINT2006 with FDO

        stats                                            old       new
Number of copies inserted for splitting                103020    121524
Number of splits finished                               19248     24346
Number of split global live ranges                      11068     16305
Number of new live ranges queued                       157359    172528
Number of rematerialized defs for splitting              2756      5536
Number of splits that were simple                        8605     12145

Number of registers assigned                           649170    663979
Number of instructions deleted by DCE                   83051     84678
Number of interferences evicted                         21432     21320
Number of folded stack accesses                         10992     10914
Number of folded loads                                    623       624
Number of live ranges fractured by DCE                    406       390
Number of identity moves eliminated after rewriting    232947    240607
Number of dead lane conflicts tested                     7451      7503
Number of dead lane conflicts resolved                   3943      3957
Number of split local live ranges                        2280      2230
Number of instructions rematerialized                  122540    126312
Number of instructions re-materialized                  93828     94520
Number of reloads inserted                              50031     51344
Number of reloads removed                                 827       787
Number of rematerialized defs for spilling              25833     26237
Number of shrinkToUses called                           94489     95202
Number of spilled snippets                               1083      1100
Number of spill slots allocated                         15944     16053
Number of spilled live ranges                           28650     28776
Number of spills inserted                               22428     22448
Number of spills removed                                  878       778
Number of registers unassigned                          26406     26478
Number of instruction commuting performed                 640       650
Number of cross class joins performed                  159091    161155
Number of copies extended                                  15        14
Number of interval joins performed                     508329    506998
Number of register classes inflated                         4         4
Number of single use loads folded after DCE                32        32

The first 6 stats are much larger than old values, these are direct results of the new splitting. Other stats numbers are not significantly impacted.

The runtime performance of SPECINT2006 on my desktop is

without this patch
400.perlbench                                    9770        227       43.1 *
401.bzip2                                        9650        370       26.1 *
403.gcc                                          8050        194       41.6 *
429.mcf                                          9120        198       46.1 *
445.gobmk                                       10490        342       30.7 *
456.hmmer                                        9330        246       37.9 *
458.sjeng                                       12100        372       32.5 *
462.libquantum                                  20720        297       69.9 *
464.h264ref                                     22130        328       67.5 *
471.omnetpp                                      6250        233       26.8 *
473.astar                                        7020        294       23.9 *
483.xalancbmk                                    6900        159       43.4 *
 Est. SPECint(R)_base2006           Not Run
 Est. SPECint2006                                                      38.5


with this patch
400.perlbench                                    9770        227       43.0 *
401.bzip2                                        9650        373       25.8 *
403.gcc                                          8050        191       42.1 *
429.mcf                                          9120        200       45.6 *
445.gobmk                                       10490        346       30.3 *
456.hmmer                                        9330        249       37.5 *
458.sjeng                                       12100        376       32.2 *
462.libquantum                                  20720        310       66.8 *
464.h264ref                                     22130        324       68.2 *
471.omnetpp                                      6250        232       27.0 *
473.astar                                        7020        280       25.1 *
483.xalancbmk                                    6900        158       43.7 *
 Est. SPECint(R)_base2006           Not Run
 Est. SPECint2006                                                      38.5

The final scores are same, but 462.libquantum and 473.astar have big difference. I double checked them.

462.libquantum
It has big variation for each run. Because this patch mainly impact the dynamic number of move instructions, I checked the executed instructions for two version, they are 2314346841712 vs 2314346834848, basically no difference. I also checked the hottest 6 functions, they are same in both versions. So there is no regression in 462.libquantum.

473.astar
This benchmark has a very stable run time. The performance difference is consistently reproduced. I also checked the dynamic instruction numbers, the new version has less instructions executed. So this improvement is real.

input                    old             new
BigLakes2048.cfg    334603128498    322906773354
rivers.cfg          663093493368    653682859787

ping

Herald added a subscriber: sunshaoce. · View Herald TranscriptAug 22 2023, 6:15 PM

ping.
The 473.astar is improved by 5%.

xbolva00 added a subscriber: xbolva00.Aug 31 2023, 11:27 PM

ping, @qcolombet

qcolombet added inline comments.Sep 13 2023, 8:50 AM

llvm/lib/CodeGen/RegAllocGreedy.cpp
2404	Function tryAssignCSRFirstTime does similar thing. True but I feel it is justified because these physreg are not free to use the first time. I don't see this argument applying here. if we move it to trySplit, we will miss the possibility of assigning a physical register to the whole virtual register. Hmm, I don't get that part, we're going to split in `trySplitAroundHintReg`, how do we get to assign the same physreg to the whole vreg? we need to compare split against non-split. I don't get that part either. For a given vreg, the first time we get through trySplit, we compare non-split against split. I'm not totally opposed to the patch, but I still fail to see why it doesn't fit in the regular splitting mechanism and hence why we need a special mechanism for this.

Carrot added inline comments.Sep 13 2023, 10:52 AM

llvm/lib/CodeGen/RegAllocGreedy.cpp
2404	Function tryAssignCSRFirstTime does similar thing. True but I feel it is justified because these physreg are not free to use the first time. I don't see this argument applying here. The key point here is we need to compare the cost of using CSR(for the first time) against the cost of splitting the virtual register. Then we can decide to split the virtual register or assign a single physical register to it. if we move it to trySplit, we will miss the possibility of assigning a physical register to the whole virtual register. Hmm, I don't get that part, we're going to split in `trySplitAroundHintReg`, how do we get to assign the same physreg to the whole vreg? We already get a physical register before calling trySplitAroundHintReg. If trySplitAroundHintReg can't find a split better than a non-hint physical register, it returns false, and the caller tryAssign can simply return the physical register for the whole vreg. In function trySplitAroundHintReg it first computes the cost of using non-hint physical register for the whole vreg, then it passes the cost to calculateRegionSplitCostAroundReg, calculateRegionSplitCostAroundReg only records split candidate with lower cost, so trySplitAroundHintReg actually compares using a non-hint physical register for the whole vreg against splitting the vreg around the hint register. we need to compare split against non-split. I don't get that part either. For a given vreg, the first time we get through trySplit, we compare non-split against split. Sorry for the confusion of non-split. I mean compare split against using a single physical register. I'm not totally opposed to the patch, but I still fail to see why it doesn't fit in the regular splitting mechanism and hence why we need a special mechanism for this.

Thanks for the clarifications @Carrot

llvm/lib/CodeGen/RegAllocGreedy.cpp
2404	Got it. Makes sense. Now, I understand the comment around `trySplitAroundHintReg`.

This revision is now accepted and ready to land.Sep 13 2023, 11:58 PM

This revision was landed with ongoing or failed builds.Sep 15 2023, 12:55 PM

Closed by commit rGcbdccb30c23f: [RA] Split a virtual register in cold blocks if it is not assigned preferred… (authored by Carrot). · Explain Why

This revision was automatically updated to reflect the committed changes.

Carrot added a commit: rGcbdccb30c23f: [RA] Split a virtual register in cold blocks if it is not assigned preferred….

mtrofin mentioned this in rG0af95c32625c: [mlgo] Fix regalloc tests.Sep 15 2023, 2:27 PM

skan added a subscriber: skan.Sep 15 2023, 11:49 PM

skan added a subscriber: wxiao3.Sep 15 2023, 11:56 PM

skan added inline comments.

llvm/test/CodeGen/X86/ragreedy-hoist-spill.ll
78	@wxiao3 @pengfei It seems a regression for x86.

Carrot added inline comments.Sep 16 2023, 12:55 AM

llvm/test/CodeGen/X86/ragreedy-hoist-spill.ll
78	BB6 has higher frequency than BB7. If the branch in BB6 has 50% of taken probability, the final number of executed instructions are same.

It looks like this is causing a ~10% performance regression on some of our workloads on ARM64 macOS. Working on a reproducer.

This patch also seems to cause near-infinite compile times in same cases (don't know if they actually finish as the process gets killed via OoM). Trying to get a reproducer now.

@Carrot created an issue with the hang reproducer: https://github.com/llvm/llvm-project/issues/67188

In D156491#4650156, @danilaml wrote:

@Carrot created an issue with the hang reproducer: https://github.com/llvm/llvm-project/issues/67188

Thanks for the report!
While I'm investigating, you can use -split-threshold-for-reg-with-hint=0 to unblock your work.

In D156491#4650159, @Carrot wrote:

In D156491#4650156, @danilaml wrote:

@Carrot created an issue with the hang reproducer: https://github.com/llvm/llvm-project/issues/67188

Thanks for the report!
While I'm investigating, you can use -split-threshold-for-reg-with-hint=0 to unblock your work.

Hi! If it takes too long, can you pls check that change in or revert this change, since we are having a broken ToT at the moment?

We're seeing binary size increases in Chromium, in particular for Android and Fuchsia where size is critical: https://crbug.com/1488374
Is that an inherent property of this change, and could it be scaled back for optsize functions for example?

In D156491#4652598, @hans wrote:

We're seeing binary size increases in Chromium, in particular for Android and Fuchsia where size is critical: https://crbug.com/1488374
Is that an inherent property of this change, and could it be scaled back for optsize functions for example?

This change is driven by the cost of using a single physical register and split a register. As most cost computations in RA, these are based on the weighted number of instructions (dynamic number of instructions or performance), static number of instructions is not considered. So its impact to code size is random. You can use -split-threshold-for-reg-with-hint=0 to disable this optimization.

In D156491#4652685, @Carrot wrote:

In D156491#4652598, @hans wrote:

We're seeing binary size increases in Chromium, in particular for Android and Fuchsia where size is critical: https://crbug.com/1488374
Is that an inherent property of this change, and could it be scaled back for optsize functions for example?

This change is driven by the cost of using a single physical register and split a register. As most cost computations in RA, these are based on the weighted number of instructions (dynamic number of instructions or performance), static number of instructions is not considered. So its impact to code size is random. You can use -split-threshold-for-reg-with-hint=0 to disable this optimization.

It does consistently increase binary size (where we measured, which is Android and Fuchsia), so I wouldn't call it random. We are using -split-threshold-for-reg-with-hint=0 to disable this for now, but we really don't want to use internal compiler flags, and I don't think we can expect users to do so in general.

Did you look at binary size in your benchmarking? If we can't make it size neutral, I think we should consider turning it off in optsize functions.

In D156491#4652888, @hans wrote:

In D156491#4652685, @Carrot wrote:

In D156491#4652598, @hans wrote:

We're seeing binary size increases in Chromium, in particular for Android and Fuchsia where size is critical: https://crbug.com/1488374
Is that an inherent property of this change, and could it be scaled back for optsize functions for example?

This change is driven by the cost of using a single physical register and split a register. As most cost computations in RA, these are based on the weighted number of instructions (dynamic number of instructions or performance), static number of instructions is not considered. So its impact to code size is random. You can use -split-threshold-for-reg-with-hint=0 to disable this optimization.

It does consistently increase binary size (where we measured, which is Android and Fuchsia), so I wouldn't call it random. We are using -split-threshold-for-reg-with-hint=0 to disable this for now, but we really don't want to use internal compiler flags, and I don't think we can expect users to do so in general.

Did you look at binary size in your benchmarking? If we can't make it size neutral, I think we should consider turning it off in optsize functions.

I thought about it more. There are two typical cases that can change size.

One COPY instruction between virtual register and physical register in hot block, it constructs a hint for the virtual register, they interfere in two or more cold blocks, we can split the virtual register in cold blocks, it increases code size.
Two or more COPY instructions between same virtual register and physical register in hot blocks, they construct a hint for the virtual register, they interfere in one cold block, we can split the virtual register in the cold block, it decreases code size.

The case 1 is more common than case 2. So the it has more possibility to increase size. So it's reasonable to disable it for optsize functions. I will send a patch for it.

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

RegAllocGreedy.h

10 lines

RegAllocGreedy.cpp

211 lines

test/

CodeGen/

AArch64/

cgp-usubo.ll

6 lines

csr-split.ll

41 lines

AMDGPU/

blender-no-live-segment-at-def-implicit-def.ll

42 lines

greedy-alloc-fail-sgpr1024-spill.mir

8 lines

insert-delay-alu-bug.ll

27 lines

ran-out-of-sgprs-allocation-failure.mir

221 lines

unstructured-cfg-def-use-issue.ll

79 lines

ARM/

csr-split.ll

22 lines

divmod-eabi.ll

1 line

thumb2-size-opt.ll

4 lines

AVR/

cttz.ll

5 lines

Hexagon/

noreturn-noepilog.ll

4 lines

PowerPC/

csr-split.ll

40 lines

ppc64-rop-protection-aix.ll

84 lines

ppc64-rop-protection.ll

105 lines

subreg-postra.ll

2 lines

tail-dup-break-cfg.ll

11 lines

tls-cse.ll

6 lines

RISCV/

double-stack-spill-restore.ll

14 lines

fmax-fmin.ll

20 lines

overflow-intrinsics.ll

10 lines

rvv/

fixed-vectors-strided-vpload.ll

7 lines

strided-vpload.ll

14 lines

Thumb2/

pacbti-m-outliner-3.ll

6 lines

X86/

2007-11-30-LoadFolding-Bug.ll

81 lines

cgp-usubo.ll

2 lines

csr-split.ll

9 lines

ragreedy-bug.ll

4 lines

ragreedy-hoist-spill.ll

10 lines

speculative-load-hardening.ll

190 lines

split-reg-with-hint.ll

80 lines

statepoint-call-lowering.ll

23 lines

statepoint-invoke-ra-enter-at-end.mir

9 lines

statepoint-ra.ll

66 lines

statepoint-vreg-details.ll

2 lines

statepoint-vreg.ll

19 lines

DebugInfo/

ARM/

sdag-split-arg.ll

4 lines

X86/

live-debug-values.ll

3 lines

tools/

llvm-objdump/

XCOFF/

disassemble-symbolize-operands.ll

14 lines

Diff 556878

llvm/lib/CodeGen/RegAllocGreedy.h

Show First 20 Lines • Show All 342 Lines • ▼ Show 20 Lines	private:

MCRegister tryAssign(const LiveInterval &, AllocationOrder &,		MCRegister tryAssign(const LiveInterval &, AllocationOrder &,
SmallVectorImpl<Register> &, const SmallVirtRegSet &);		SmallVectorImpl<Register> &, const SmallVirtRegSet &);
MCRegister tryEvict(const LiveInterval &, AllocationOrder &,		MCRegister tryEvict(const LiveInterval &, AllocationOrder &,
SmallVectorImpl<Register> &, uint8_t,		SmallVectorImpl<Register> &, uint8_t,
const SmallVirtRegSet &);		const SmallVirtRegSet &);
MCRegister tryRegionSplit(const LiveInterval &, AllocationOrder &,		MCRegister tryRegionSplit(const LiveInterval &, AllocationOrder &,
SmallVectorImpl<Register> &);		SmallVectorImpl<Register> &);
		/// Calculate cost of region splitting around the specified register.
		unsigned calculateRegionSplitCostAroundReg(MCPhysReg PhysReg,
		AllocationOrder &Order,
		BlockFrequency &BestCost,
		unsigned &NumCands,
		unsigned &BestCand);
/// Calculate cost of region splitting.		/// Calculate cost of region splitting.
unsigned calculateRegionSplitCost(const LiveInterval &VirtReg,		unsigned calculateRegionSplitCost(const LiveInterval &VirtReg,
AllocationOrder &Order,		AllocationOrder &Order,
BlockFrequency &BestCost,		BlockFrequency &BestCost,
unsigned &NumCands, bool IgnoreCSR);		unsigned &NumCands, bool IgnoreCSR);
/// Perform region splitting.		/// Perform region splitting.
unsigned doRegionSplit(const LiveInterval &VirtReg, unsigned BestCand,		unsigned doRegionSplit(const LiveInterval &VirtReg, unsigned BestCand,
bool HasCompact, SmallVectorImpl<Register> &NewVRegs);		bool HasCompact, SmallVectorImpl<Register> &NewVRegs);
		/// Try to split VirtReg around physical Hint register.
		bool trySplitAroundHintReg(MCPhysReg Hint, const LiveInterval &VirtReg,
		SmallVectorImpl<Register> &NewVRegs,
		AllocationOrder &Order);
/// Check other options before using a callee-saved register for the first		/// Check other options before using a callee-saved register for the first
/// time.		/// time.
MCRegister tryAssignCSRFirstTime(const LiveInterval &VirtReg,		MCRegister tryAssignCSRFirstTime(const LiveInterval &VirtReg,
AllocationOrder &Order, MCRegister PhysReg,		AllocationOrder &Order, MCRegister PhysReg,
uint8_t &CostPerUseLimit,		uint8_t &CostPerUseLimit,
SmallVectorImpl<Register> &NewVRegs);		SmallVectorImpl<Register> &NewVRegs);
void initializeCSRCost();		void initializeCSRCost();
unsigned tryBlockSplit(const LiveInterval &, AllocationOrder &,		unsigned tryBlockSplit(const LiveInterval &, AllocationOrder &,
▲ Show 20 Lines • Show All 81 Lines • Show Last 20 Lines

llvm/lib/CodeGen/RegAllocGreedy.cpp

Show First 20 Lines • Show All 135 Lines • ▼ Show 20 Lines	static cl::opt<bool> GreedyRegClassPriorityTrumpsGlobalness(
cl::Hidden);		cl::Hidden);

static cl::opt<bool> GreedyReverseLocalAssignment(		static cl::opt<bool> GreedyReverseLocalAssignment(
"greedy-reverse-local-assignment",		"greedy-reverse-local-assignment",
cl::desc("Reverse allocation order of local live ranges, such that "		cl::desc("Reverse allocation order of local live ranges, such that "
"shorter local live ranges will tend to be allocated first"),		"shorter local live ranges will tend to be allocated first"),
cl::Hidden);		cl::Hidden);

		static cl::opt<unsigned> SplitThresholdForRegWithHint(
		"split-threshold-for-reg-with-hint",
		cl::desc("The threshold for splitting a virtual register with a hint, in "
		"percentate"),
		cl::init(75), cl::Hidden);

static RegisterRegAlloc greedyRegAlloc("greedy", "greedy register allocator",		static RegisterRegAlloc greedyRegAlloc("greedy", "greedy register allocator",
createGreedyRegisterAllocator);		createGreedyRegisterAllocator);

char RAGreedy::ID = 0;		char RAGreedy::ID = 0;
char &llvm::RAGreedyID = RAGreedy::ID;		char &llvm::RAGreedyID = RAGreedy::ID;

INITIALIZE_PASS_BEGIN(RAGreedy, "greedy",		INITIALIZE_PASS_BEGIN(RAGreedy, "greedy",
"Greedy Register Allocator", false, false)		"Greedy Register Allocator", false, false)
▲ Show 20 Lines • Show All 265 Lines • ▼ Show 20 Lines	if (Order.isHint(Hint)) {
MCRegister PhysHint = Hint.asMCReg();		MCRegister PhysHint = Hint.asMCReg();
LLVM_DEBUG(dbgs() << "missed hint " << printReg(PhysHint, TRI) << '\n');		LLVM_DEBUG(dbgs() << "missed hint " << printReg(PhysHint, TRI) << '\n');

if (EvictAdvisor->canEvictHintInterference(VirtReg, PhysHint,		if (EvictAdvisor->canEvictHintInterference(VirtReg, PhysHint,
FixedRegisters)) {		FixedRegisters)) {
evictInterference(VirtReg, PhysHint, NewVRegs);		evictInterference(VirtReg, PhysHint, NewVRegs);
return PhysHint;		return PhysHint;
}		}

		// We can also split the virtual register in cold blocks.
		if (trySplitAroundHintReg(PhysHint, VirtReg, NewVRegs, Order))
		return 0;

// Record the missed hint, we may be able to recover		// Record the missed hint, we may be able to recover
// at the end if the surrounding allocation changed.		// at the end if the surrounding allocation changed.
SetOfBrokenHints.insert(&VirtReg);		SetOfBrokenHints.insert(&VirtReg);
}		}

// Try to evict interference from a cheaper alternative.		// Try to evict interference from a cheaper alternative.
uint8_t Cost = RegCosts[PhysReg];		uint8_t Cost = RegCosts[PhysReg];

▲ Show 20 Lines • Show All 626 Lines • ▼ Show 20 Lines	MCRegister RAGreedy::tryRegionSplit(const LiveInterval &VirtReg,

// No solutions found, fall back to single block splitting.		// No solutions found, fall back to single block splitting.
if (!HasCompact && BestCand == NoCand)		if (!HasCompact && BestCand == NoCand)
return MCRegister::NoRegister;		return MCRegister::NoRegister;

return doRegionSplit(VirtReg, BestCand, HasCompact, NewVRegs);		return doRegionSplit(VirtReg, BestCand, HasCompact, NewVRegs);
}		}

unsigned RAGreedy::calculateRegionSplitCost(const LiveInterval &VirtReg,		unsigned
		RAGreedy::calculateRegionSplitCostAroundReg(MCPhysReg PhysReg,
AllocationOrder &Order,		AllocationOrder &Order,
BlockFrequency &BestCost,		BlockFrequency &BestCost,
unsigned &NumCands,		unsigned &NumCands,
bool IgnoreCSR) {		unsigned &BestCand) {
unsigned BestCand = NoCand;
for (MCPhysReg PhysReg : Order) {
assert(PhysReg);
if (IgnoreCSR && EvictAdvisor->isUnusedCalleeSavedReg(PhysReg))
continue;

// Discard bad candidates before we run out of interference cache cursors.		// Discard bad candidates before we run out of interference cache cursors.
// This will only affect register classes with a lot of registers (>32).		// This will only affect register classes with a lot of registers (>32).
if (NumCands == IntfCache.getMaxCursors()) {		if (NumCands == IntfCache.getMaxCursors()) {
unsigned WorstCount = ~0u;		unsigned WorstCount = ~0u;
unsigned Worst = 0;		unsigned Worst = 0;
for (unsigned CandIndex = 0; CandIndex != NumCands; ++CandIndex) {		for (unsigned CandIndex = 0; CandIndex != NumCands; ++CandIndex) {
if (CandIndex == BestCand \|\| !GlobalCand[CandIndex].PhysReg)		if (CandIndex == BestCand \|\| !GlobalCand[CandIndex].PhysReg)
continue;		continue;
unsigned Count = GlobalCand[CandIndex].LiveBundles.count();		unsigned Count = GlobalCand[CandIndex].LiveBundles.count();
if (Count < WorstCount) {		if (Count < WorstCount) {
Worst = CandIndex;		Worst = CandIndex;
WorstCount = Count;		WorstCount = Count;
}		}
}		}
--NumCands;		--NumCands;
GlobalCand[Worst] = GlobalCand[NumCands];		GlobalCand[Worst] = GlobalCand[NumCands];
if (BestCand == NumCands)		if (BestCand == NumCands)
BestCand = Worst;		BestCand = Worst;
}		}

if (GlobalCand.size() <= NumCands)		if (GlobalCand.size() <= NumCands)
GlobalCand.resize(NumCands+1);		GlobalCand.resize(NumCands+1);
GlobalSplitCandidate &Cand = GlobalCand[NumCands];		GlobalSplitCandidate &Cand = GlobalCand[NumCands];
Cand.reset(IntfCache, PhysReg);		Cand.reset(IntfCache, PhysReg);

SpillPlacer->prepare(Cand.LiveBundles);		SpillPlacer->prepare(Cand.LiveBundles);
BlockFrequency Cost;		BlockFrequency Cost;
if (!addSplitConstraints(Cand.Intf, Cost)) {		if (!addSplitConstraints(Cand.Intf, Cost)) {
LLVM_DEBUG(dbgs() << printReg(PhysReg, TRI) << "\tno positive bundles\n");		LLVM_DEBUG(dbgs() << printReg(PhysReg, TRI) << "\tno positive bundles\n");
continue;		return BestCand;
}		}
LLVM_DEBUG(dbgs() << printReg(PhysReg, TRI) << "\tstatic = ";		LLVM_DEBUG(dbgs() << printReg(PhysReg, TRI) << "\tstatic = ";
MBFI->printBlockFreq(dbgs(), Cost));		MBFI->printBlockFreq(dbgs(), Cost));
if (Cost >= BestCost) {		if (Cost >= BestCost) {
LLVM_DEBUG({		LLVM_DEBUG({
if (BestCand == NoCand)		if (BestCand == NoCand)
dbgs() << " worse than no bundles\n";		dbgs() << " worse than no bundles\n";
else		else
dbgs() << " worse than "		dbgs() << " worse than "
<< printReg(GlobalCand[BestCand].PhysReg, TRI) << '\n';		<< printReg(GlobalCand[BestCand].PhysReg, TRI) << '\n';
});		});
continue;		return BestCand;
}		}
if (!growRegion(Cand)) {		if (!growRegion(Cand)) {
LLVM_DEBUG(dbgs() << ", cannot spill all interferences.\n");		LLVM_DEBUG(dbgs() << ", cannot spill all interferences.\n");
continue;		return BestCand;
}		}

SpillPlacer->finish();		SpillPlacer->finish();

// No live bundles, defer to splitSingleBlocks().		// No live bundles, defer to splitSingleBlocks().
if (!Cand.LiveBundles.any()) {		if (!Cand.LiveBundles.any()) {
LLVM_DEBUG(dbgs() << " no bundles.\n");		LLVM_DEBUG(dbgs() << " no bundles.\n");
continue;		return BestCand;
}		}

Cost += calcGlobalSplitCost(Cand, Order);		Cost += calcGlobalSplitCost(Cand, Order);
LLVM_DEBUG({		LLVM_DEBUG({
dbgs() << ", total = ";		dbgs() << ", total = ";
MBFI->printBlockFreq(dbgs(), Cost) << " with bundles";		MBFI->printBlockFreq(dbgs(), Cost) << " with bundles";
for (int I : Cand.LiveBundles.set_bits())		for (int I : Cand.LiveBundles.set_bits())
dbgs() << " EB#" << I;		dbgs() << " EB#" << I;
dbgs() << ".\n";		dbgs() << ".\n";
});		});
if (Cost < BestCost) {		if (Cost < BestCost) {
BestCand = NumCands;		BestCand = NumCands;
BestCost = Cost;		BestCost = Cost;
}		}
++NumCands;		++NumCands;

		return BestCand;
		}

		unsigned RAGreedy::calculateRegionSplitCost(const LiveInterval &VirtReg,
		AllocationOrder &Order,
		BlockFrequency &BestCost,
		unsigned &NumCands,
		bool IgnoreCSR) {
		unsigned BestCand = NoCand;
		for (MCPhysReg PhysReg : Order) {
		assert(PhysReg);
		if (IgnoreCSR && EvictAdvisor->isUnusedCalleeSavedReg(PhysReg))
		continue;

		calculateRegionSplitCostAroundReg(PhysReg, Order, BestCost, NumCands,
		BestCand);
}		}

return BestCand;		return BestCand;
}		}

unsigned RAGreedy::doRegionSplit(const LiveInterval &VirtReg, unsigned BestCand,		unsigned RAGreedy::doRegionSplit(const LiveInterval &VirtReg, unsigned BestCand,
bool HasCompact,		bool HasCompact,
SmallVectorImpl<Register> &NewVRegs) {		SmallVectorImpl<Register> &NewVRegs) {
Show All 29 Lines	if (unsigned B = Cand.getBundles(BundleCand, 0)) {
(void)B;		(void)B;
}		}
}		}

splitAroundRegion(LREdit, UsedCands);		splitAroundRegion(LREdit, UsedCands);
return 0;		return 0;
}		}

		// VirtReg has a physical Hint, this function tries to split VirtReg around
		// Hint if we can place new COPY instructions in cold blocks.
		bool RAGreedy::trySplitAroundHintReg(MCPhysReg Hint,
		const LiveInterval &VirtReg,
		SmallVectorImpl<Register> &NewVRegs,
		AllocationOrder &Order) {
		BlockFrequency Cost = 0;
		Register Reg = VirtReg.reg();

		// Compute the cost of assigning a non Hint physical register to VirtReg.
		// We define it as the total frequency of broken COPY instructions to/from
		// Hint register, and after split, they can be deleted.
		for (const MachineInstr &Instr : MRI->reg_nodbg_instructions(Reg)) {
		if (!TII->isFullCopyInstr(Instr))
		continue;
		Register OtherReg = Instr.getOperand(1).getReg();
		if (OtherReg == Reg) {
		OtherReg = Instr.getOperand(0).getReg();
		if (OtherReg == Reg)
		continue;
		// Check if VirtReg interferes with OtherReg after this COPY instruction.
		if (VirtReg.liveAt(LIS->getInstructionIndex(Instr).getRegSlot()))
		continue;
		}
		MCRegister OtherPhysReg =
		OtherReg.isPhysical() ? OtherReg.asMCReg() : VRM->getPhys(OtherReg);
		if (OtherPhysReg == Hint)
		Cost += MBFI->getBlockFreq(Instr.getParent());
		}

		// Decrease the cost so it will be split in colder blocks.
		BranchProbability Threshold(SplitThresholdForRegWithHint, 100);
		Cost *= Threshold;
		if (Cost == 0)
		return false;

		unsigned NumCands = 0;
		unsigned BestCand = NoCand;
		SA->analyze(&VirtReg);
		calculateRegionSplitCostAroundReg(Hint, Order, Cost, NumCands, BestCand);
		if (BestCand == NoCand)
		return false;

		doRegionSplit(VirtReg, BestCand, false/HasCompact/, NewVRegs);
		return true;
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Per-Block Splitting		// Per-Block Splitting
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// tryBlockSplit - Split a global live range around every block with uses. This		/// tryBlockSplit - Split a global live range around every block with uses. This
/// creates a lot of local live ranges, that will be split by tryLocalSplit if		/// creates a lot of local live ranges, that will be split by tryLocalSplit if
/// they don't allocate.		/// they don't allocate.
unsigned RAGreedy::tryBlockSplit(const LiveInterval &VirtReg,		unsigned RAGreedy::tryBlockSplit(const LiveInterval &VirtReg,
▲ Show 20 Lines • Show All 1,124 Lines • ▼ Show 20 Lines	if (CSRCost.getFrequency() &&
CostPerUseLimit, NewVRegs);		CostPerUseLimit, NewVRegs);
if (CSRReg \|\| !NewVRegs.empty())		if (CSRReg \|\| !NewVRegs.empty())
// Return now if we decide to use a CSR or create new vregs due to		// Return now if we decide to use a CSR or create new vregs due to
// pre-splitting.		// pre-splitting.
return CSRReg;		return CSRReg;
} else		} else
return PhysReg;		return PhysReg;
}		}
		// Non emtpy NewVRegs means VirtReg has been split.
		if (!NewVRegs.empty())
		return 0;
		qcolombetUnsubmitted Not Done Reply Inline Actions Splitting decisions are supposed to be taken later (`trySplit`), this breaks the general algorithm's flow. qcolombet: Splitting decisions are supposed to be taken later (`trySplit`), this breaks the general…
		qcolombetUnsubmitted Not Done Reply Inline Actions Concretely, I'm asking if you could do this kind of splitting in the splitting heuristic. You may want to add a new stage in the algorithm (and that'll come at a compile time cost.) qcolombet: Concretely, I'm asking if you could do this kind of splitting in the splitting heuristic. You…
		CarrotAuthorUnsubmitted Done Reply Inline Actions The problem is we need to decide between a physical register and split, if we move it to trySplit, we will miss the possibility of assigning a physical register to the whole virtual register. trySplit compares splits against different physical registers, we need to compare split against non-split. Function tryAssignCSRFirstTime does similar thing. trySplitAroundHintReg is even more special because it tries to split against only 1 physical register. Carrot: The problem is we need to decide between a physical register and split, if we move it to…
		qcolombetUnsubmitted Not Done Reply Inline Actions Function tryAssignCSRFirstTime does similar thing. True but I feel it is justified because these physreg are not free to use the first time. I don't see this argument applying here. if we move it to trySplit, we will miss the possibility of assigning a physical register to the whole virtual register. Hmm, I don't get that part, we're going to split in `trySplitAroundHintReg`, how do we get to assign the same physreg to the whole vreg? we need to compare split against non-split. I don't get that part either. For a given vreg, the first time we get through trySplit, we compare non-split against split. I'm not totally opposed to the patch, but I still fail to see why it doesn't fit in the regular splitting mechanism and hence why we need a special mechanism for this. qcolombet: > Function tryAssignCSRFirstTime does similar thing. True but I feel it is justified because…
		CarrotAuthorUnsubmitted Done Reply Inline Actions Function tryAssignCSRFirstTime does similar thing. True but I feel it is justified because these physreg are not free to use the first time. I don't see this argument applying here. The key point here is we need to compare the cost of using CSR(for the first time) against the cost of splitting the virtual register. Then we can decide to split the virtual register or assign a single physical register to it. if we move it to trySplit, we will miss the possibility of assigning a physical register to the whole virtual register. Hmm, I don't get that part, we're going to split in `trySplitAroundHintReg`, how do we get to assign the same physreg to the whole vreg? We already get a physical register before calling trySplitAroundHintReg. If trySplitAroundHintReg can't find a split better than a non-hint physical register, it returns false, and the caller tryAssign can simply return the physical register for the whole vreg. In function trySplitAroundHintReg it first computes the cost of using non-hint physical register for the whole vreg, then it passes the cost to calculateRegionSplitCostAroundReg, calculateRegionSplitCostAroundReg only records split candidate with lower cost, so trySplitAroundHintReg actually compares using a non-hint physical register for the whole vreg against splitting the vreg around the hint register. we need to compare split against non-split. I don't get that part either. For a given vreg, the first time we get through trySplit, we compare non-split against split. Sorry for the confusion of non-split. I mean compare split against using a single physical register. I'm not totally opposed to the patch, but I still fail to see why it doesn't fit in the regular splitting mechanism and hence why we need a special mechanism for this. Carrot: > > Function tryAssignCSRFirstTime does similar thing. > > True but I feel it is justified…
		qcolombetUnsubmitted Not Done Reply Inline Actions Got it. Makes sense. Now, I understand the comment around `trySplitAroundHintReg`. qcolombet: Got it. Makes sense. Now, I understand the comment around `trySplitAroundHintReg`.

LiveRangeStage Stage = ExtraInfo->getStage(VirtReg);		LiveRangeStage Stage = ExtraInfo->getStage(VirtReg);
LLVM_DEBUG(dbgs() << StageName[Stage] << " Cascade "		LLVM_DEBUG(dbgs() << StageName[Stage] << " Cascade "
<< ExtraInfo->getCascade(VirtReg.reg()) << '\n');		<< ExtraInfo->getCascade(VirtReg.reg()) << '\n');

// Try to evict a less worthy live range, but only for ranges from the primary		// Try to evict a less worthy live range, but only for ranges from the primary
// queue. The RS_Split ranges already failed to do this, and they should not		// queue. The RS_Split ranges already failed to do this, and they should not
// get a second chance until they have been split.		// get a second chance until they have been split.
▲ Show 20 Lines • Show All 333 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/cgp-usubo.ll

	Show First 20 Lines • Show All 155 Lines • ▼ Show 20 Lines
	; CHECK: // %bb.0: // %entry			; CHECK: // %bb.0: // %entry
	; CHECK-NEXT: stp x30, x23, [sp, #-48]! // 16-byte Folded Spill			; CHECK-NEXT: stp x30, x23, [sp, #-48]! // 16-byte Folded Spill
	; CHECK-NEXT: stp x20, x19, [sp, #32] // 16-byte Folded Spill			; CHECK-NEXT: stp x20, x19, [sp, #32] // 16-byte Folded Spill
	; CHECK-NEXT: mov w19, w3			; CHECK-NEXT: mov w19, w3
	; CHECK-NEXT: stp x22, x21, [sp, #16] // 16-byte Folded Spill			; CHECK-NEXT: stp x22, x21, [sp, #16] // 16-byte Folded Spill
	; CHECK-NEXT: tbz w3, #0, .LBB8_3			; CHECK-NEXT: tbz w3, #0, .LBB8_3
	; CHECK-NEXT: // %bb.1: // %t			; CHECK-NEXT: // %bb.1: // %t
	; CHECK-NEXT: cmp x0, x1			; CHECK-NEXT: cmp x0, x1
	; CHECK-NEXT: mov x23, x0			; CHECK-NEXT: mov x22, x0
	; CHECK-NEXT: mov x20, x2			; CHECK-NEXT: mov x20, x2
	; CHECK-NEXT: cset w21, lo			; CHECK-NEXT: cset w21, lo
	; CHECK-NEXT: mov x22, x1			; CHECK-NEXT: mov x23, x1
	; CHECK-NEXT: mov w0, w21			; CHECK-NEXT: mov w0, w21
	; CHECK-NEXT: bl call			; CHECK-NEXT: bl call
	; CHECK-NEXT: subs x8, x23, x22			; CHECK-NEXT: subs x8, x22, x23
	; CHECK-NEXT: b.hs .LBB8_3			; CHECK-NEXT: b.hs .LBB8_3
	; CHECK-NEXT: // %bb.2: // %end			; CHECK-NEXT: // %bb.2: // %end
	; CHECK-NEXT: mov w19, w21			; CHECK-NEXT: mov w19, w21
	; CHECK-NEXT: str x8, [x20]			; CHECK-NEXT: str x8, [x20]
	; CHECK-NEXT: .LBB8_3: // %common.ret			; CHECK-NEXT: .LBB8_3: // %common.ret
	; CHECK-NEXT: and w0, w19, #0x1			; CHECK-NEXT: and w0, w19, #0x1
	; CHECK-NEXT: ldp x20, x19, [sp, #32] // 16-byte Folded Reload			; CHECK-NEXT: ldp x20, x19, [sp, #32] // 16-byte Folded Reload
	; CHECK-NEXT: ldp x22, x21, [sp, #16] // 16-byte Folded Reload			; CHECK-NEXT: ldp x22, x21, [sp, #16] // 16-byte Folded Reload
	Show All 19 Lines

llvm/test/CodeGen/AArch64/csr-split.ll

	Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines

	declare signext i32 @callVoid(...) local_unnamed_addr			declare signext i32 @callVoid(...) local_unnamed_addr

	declare signext i32 @callNonVoid(ptr) local_unnamed_addr			declare signext i32 @callNonVoid(ptr) local_unnamed_addr

	define dso_local signext i32 @test2(ptr %p1) local_unnamed_addr uwtable {			define dso_local signext i32 @test2(ptr %p1) local_unnamed_addr uwtable {
	; CHECK-LABEL: test2:			; CHECK-LABEL: test2:
	; CHECK: // %bb.0: // %entry			; CHECK: // %bb.0: // %entry
	; CHECK-NEXT: stp x30, x19, [sp, #-16]! // 16-byte Folded Spill
	; CHECK-NEXT: .cfi_def_cfa_offset 16
	; CHECK-NEXT: .cfi_offset w19, -8
	; CHECK-NEXT: .cfi_offset w30, -16
	; CHECK-NEXT: .cfi_remember_state
	; CHECK-NEXT: cbz x0, .LBB1_3			; CHECK-NEXT: cbz x0, .LBB1_3
	; CHECK-NEXT: // %bb.1: // %entry			; CHECK-NEXT: // %bb.1: // %entry
	; CHECK-NEXT: adrp x8, a			; CHECK-NEXT: adrp x8, a
	; CHECK-NEXT: mov x19, x0
	; CHECK-NEXT: ldrsw x8, [x8, :lo12:a]			; CHECK-NEXT: ldrsw x8, [x8, :lo12:a]
	; CHECK-NEXT: cmp x8, x0			; CHECK-NEXT: cmp x8, x0
	; CHECK-NEXT: b.ne .LBB1_3			; CHECK-NEXT: b.ne .LBB1_3
	; CHECK-NEXT: // %bb.2: // %if.then2			; CHECK-NEXT: // %bb.2: // %if.then2
				; CHECK-NEXT: stp x30, x19, [sp, #-16]! // 16-byte Folded Spill
				; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: .cfi_offset w19, -8
				; CHECK-NEXT: .cfi_offset w30, -16
				; CHECK-NEXT: mov x19, x0
	; CHECK-NEXT: bl callVoid			; CHECK-NEXT: bl callVoid
	; CHECK-NEXT: mov x0, x19			; CHECK-NEXT: mov x0, x19
	; CHECK-NEXT: ldp x30, x19, [sp], #16 // 16-byte Folded Reload			; CHECK-NEXT: ldp x30, x19, [sp], #16 // 16-byte Folded Reload
	; CHECK-NEXT: .cfi_def_cfa_offset 0			; CHECK-NEXT: .cfi_def_cfa_offset 0
	; CHECK-NEXT: .cfi_restore w19			; CHECK-NEXT: .cfi_restore w19
	; CHECK-NEXT: .cfi_restore w30			; CHECK-NEXT: .cfi_restore w30
	; CHECK-NEXT: b callNonVoid			; CHECK-NEXT: b callNonVoid
	; CHECK-NEXT: .LBB1_3: // %return			; CHECK-NEXT: .LBB1_3: // %return
	; CHECK-NEXT: .cfi_restore_state
	; CHECK-NEXT: mov w0, wzr			; CHECK-NEXT: mov w0, wzr
	; CHECK-NEXT: ldp x30, x19, [sp], #16 // 16-byte Folded Reload
	; CHECK-NEXT: .cfi_def_cfa_offset 0
	; CHECK-NEXT: .cfi_restore w19
	; CHECK-NEXT: .cfi_restore w30
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	;			;
	; CHECK-APPLE-LABEL: test2:			; CHECK-APPLE-LABEL: test2:
	; CHECK-APPLE: ; %bb.0: ; %entry			; CHECK-APPLE: ; %bb.0: ; %entry
	; CHECK-APPLE-NEXT: stp x20, x19, [sp, #-32]! ; 16-byte Folded Spill
	; CHECK-APPLE-NEXT: .cfi_def_cfa_offset 32
	; CHECK-APPLE-NEXT: stp x29, x30, [sp, #16] ; 16-byte Folded Spill
	; CHECK-APPLE-NEXT: .cfi_offset w30, -8
	; CHECK-APPLE-NEXT: .cfi_offset w29, -16
	; CHECK-APPLE-NEXT: .cfi_offset w19, -24
	; CHECK-APPLE-NEXT: .cfi_offset w20, -32
	; CHECK-APPLE-NEXT: .cfi_remember_state
	; CHECK-APPLE-NEXT: cbz x0, LBB1_3			; CHECK-APPLE-NEXT: cbz x0, LBB1_3
	; CHECK-APPLE-NEXT: ; %bb.1: ; %entry			; CHECK-APPLE-NEXT: ; %bb.1: ; %entry
	; CHECK-APPLE-NEXT: Lloh2:			; CHECK-APPLE-NEXT: Lloh2:
	; CHECK-APPLE-NEXT: adrp x8, _a@PAGE			; CHECK-APPLE-NEXT: adrp x8, _a@PAGE
	; CHECK-APPLE-NEXT: mov x19, x0
	; CHECK-APPLE-NEXT: Lloh3:			; CHECK-APPLE-NEXT: Lloh3:
	; CHECK-APPLE-NEXT: ldrsw x8, [x8, _a@PAGEOFF]			; CHECK-APPLE-NEXT: ldrsw x8, [x8, _a@PAGEOFF]
	; CHECK-APPLE-NEXT: cmp x8, x0			; CHECK-APPLE-NEXT: cmp x8, x0
	; CHECK-APPLE-NEXT: b.ne LBB1_3			; CHECK-APPLE-NEXT: b.ne LBB1_3
	; CHECK-APPLE-NEXT: ; %bb.2: ; %if.then2			; CHECK-APPLE-NEXT: ; %bb.2: ; %if.then2
				; CHECK-APPLE-NEXT: stp x20, x19, [sp, #-32]! ; 16-byte Folded Spill
				; CHECK-APPLE-NEXT: .cfi_def_cfa_offset 32
				; CHECK-APPLE-NEXT: stp x29, x30, [sp, #16] ; 16-byte Folded Spill
				; CHECK-APPLE-NEXT: .cfi_offset w30, -8
				; CHECK-APPLE-NEXT: .cfi_offset w29, -16
				; CHECK-APPLE-NEXT: .cfi_offset w19, -24
				; CHECK-APPLE-NEXT: .cfi_offset w20, -32
				; CHECK-APPLE-NEXT: mov x19, x0
	; CHECK-APPLE-NEXT: bl _callVoid			; CHECK-APPLE-NEXT: bl _callVoid
	; CHECK-APPLE-NEXT: ldp x29, x30, [sp, #16] ; 16-byte Folded Reload			; CHECK-APPLE-NEXT: ldp x29, x30, [sp, #16] ; 16-byte Folded Reload
	; CHECK-APPLE-NEXT: mov x0, x19			; CHECK-APPLE-NEXT: mov x0, x19
	; CHECK-APPLE-NEXT: ldp x20, x19, [sp], #32 ; 16-byte Folded Reload			; CHECK-APPLE-NEXT: ldp x20, x19, [sp], #32 ; 16-byte Folded Reload
	; CHECK-APPLE-NEXT: .cfi_def_cfa_offset 0			; CHECK-APPLE-NEXT: .cfi_def_cfa_offset 0
	; CHECK-APPLE-NEXT: .cfi_restore w30			; CHECK-APPLE-NEXT: .cfi_restore w30
	; CHECK-APPLE-NEXT: .cfi_restore w29			; CHECK-APPLE-NEXT: .cfi_restore w29
	; CHECK-APPLE-NEXT: .cfi_restore w19			; CHECK-APPLE-NEXT: .cfi_restore w19
	; CHECK-APPLE-NEXT: .cfi_restore w20			; CHECK-APPLE-NEXT: .cfi_restore w20
	; CHECK-APPLE-NEXT: b _callNonVoid			; CHECK-APPLE-NEXT: b _callNonVoid
	; CHECK-APPLE-NEXT: LBB1_3: ; %return			; CHECK-APPLE-NEXT: LBB1_3: ; %return
	; CHECK-APPLE-NEXT: .cfi_restore_state
	; CHECK-APPLE-NEXT: ldp x29, x30, [sp, #16] ; 16-byte Folded Reload
	; CHECK-APPLE-NEXT: mov w0, wzr			; CHECK-APPLE-NEXT: mov w0, wzr
	; CHECK-APPLE-NEXT: ldp x20, x19, [sp], #32 ; 16-byte Folded Reload
	; CHECK-APPLE-NEXT: .cfi_def_cfa_offset 0
	; CHECK-APPLE-NEXT: .cfi_restore w30
	; CHECK-APPLE-NEXT: .cfi_restore w29
	; CHECK-APPLE-NEXT: .cfi_restore w19
	; CHECK-APPLE-NEXT: .cfi_restore w20
	; CHECK-APPLE-NEXT: ret			; CHECK-APPLE-NEXT: ret
	; CHECK-APPLE-NEXT: .loh AdrpLdr Lloh2, Lloh3			; CHECK-APPLE-NEXT: .loh AdrpLdr Lloh2, Lloh3
	entry:			entry:
	%tobool = icmp eq ptr %p1, null			%tobool = icmp eq ptr %p1, null
	br i1 %tobool, label %return, label %if.end			br i1 %tobool, label %return, label %if.end

	if.end: ; preds = %entry			if.end: ; preds = %entry
	%0 = load i32, ptr @a, align 4, !tbaa !2			%0 = load i32, ptr @a, align 4, !tbaa !2
	▲ Show 20 Lines • Show All 96 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/blender-no-live-segment-at-def-implicit-def.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 2			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 2
	; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx1031 -verify-coalescing < %s \| FileCheck %s			; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx1031 -verify-coalescing < %s \| FileCheck %s

	define amdgpu_kernel void @blender_no_live_segment_at_def_error(<4 x float> %extractVec358.i.i, i32 %cmp5.i.i.arg, float %i1.i, i32 %cmp221.i.i.arg, i32 %cmp262.i.i.arg, ptr addrspace(1) %arg) {			define amdgpu_kernel void @blender_no_live_segment_at_def_error(<4 x float> %extractVec358.i.i, i32 %cmp5.i.i.arg, float %i1.i, i32 %cmp221.i.i.arg, i32 %cmp262.i.i.arg, ptr addrspace(1) %arg) {
	; CHECK-LABEL: blender_no_live_segment_at_def_error:			; CHECK-LABEL: blender_no_live_segment_at_def_error:
	; CHECK: ; %bb.0: ; %entry			; CHECK: ; %bb.0: ; %entry
	; CHECK-NEXT: s_add_u32 s12, s12, s17			; CHECK-NEXT: s_add_u32 s12, s12, s17
	; CHECK-NEXT: s_mov_b32 s32, 0			; CHECK-NEXT: s_mov_b32 s32, 0
	; CHECK-NEXT: s_addc_u32 s13, s13, 0			; CHECK-NEXT: s_addc_u32 s13, s13, 0
	; CHECK-NEXT: s_setreg_b32 hwreg(HW_REG_FLAT_SCR_LO), s12			; CHECK-NEXT: s_setreg_b32 hwreg(HW_REG_FLAT_SCR_LO), s12
	; CHECK-NEXT: s_setreg_b32 hwreg(HW_REG_FLAT_SCR_HI), s13			; CHECK-NEXT: s_setreg_b32 hwreg(HW_REG_FLAT_SCR_HI), s13
	; CHECK-NEXT: s_load_dwordx8 s[36:43], s[8:9], 0x0			; CHECK-NEXT: s_load_dwordx8 s[36:43], s[8:9], 0x0
	; CHECK-NEXT: s_add_u32 s0, s0, s17			; CHECK-NEXT: s_add_u32 s0, s0, s17
	; CHECK-NEXT: s_addc_u32 s1, s1, 0			; CHECK-NEXT: s_addc_u32 s1, s1, 0
	; CHECK-NEXT: s_mov_b64 s[34:35], s[8:9]			; CHECK-NEXT: s_mov_b32 s12, 0
	; CHECK-NEXT: s_mov_b32 s8, 0
	; CHECK-NEXT: s_waitcnt lgkmcnt(0)			; CHECK-NEXT: s_waitcnt lgkmcnt(0)
	; CHECK-NEXT: s_cmp_lg_u32 s40, 0			; CHECK-NEXT: s_cmp_lg_u32 s40, 0
	; CHECK-NEXT: s_cbranch_scc1 .LBB0_8			; CHECK-NEXT: s_cbranch_scc1 .LBB0_8
	; CHECK-NEXT: ; %bb.1: ; %if.end13.i.i			; CHECK-NEXT: ; %bb.1: ; %if.end13.i.i
	; CHECK-NEXT: s_cmp_eq_u32 s42, 0			; CHECK-NEXT: s_cmp_eq_u32 s42, 0
	; CHECK-NEXT: s_cbranch_scc1 .LBB0_4			; CHECK-NEXT: s_cbranch_scc1 .LBB0_4
	; CHECK-NEXT: ; %bb.2: ; %if.else251.i.i			; CHECK-NEXT: ; %bb.2: ; %if.else251.i.i
	; CHECK-NEXT: s_cmp_lg_u32 s43, 0			; CHECK-NEXT: s_cmp_lg_u32 s43, 0
	; CHECK-NEXT: s_mov_b32 s12, 0			; CHECK-NEXT: s_mov_b32 s17, 0
	; CHECK-NEXT: s_cselect_b32 s8, -1, 0			; CHECK-NEXT: s_cselect_b32 s12, -1, 0
	; CHECK-NEXT: s_and_b32 vcc_lo, exec_lo, s8			; CHECK-NEXT: s_and_b32 vcc_lo, exec_lo, s12
	; CHECK-NEXT: s_cbranch_vccz .LBB0_5			; CHECK-NEXT: s_cbranch_vccz .LBB0_5
	; CHECK-NEXT: ; %bb.3:			; CHECK-NEXT: ; %bb.3:
	; CHECK-NEXT: s_mov_b32 s36, 0			; CHECK-NEXT: s_mov_b32 s36, 0
	; CHECK-NEXT: s_andn2_b32 vcc_lo, exec_lo, s8			; CHECK-NEXT: s_andn2_b32 vcc_lo, exec_lo, s12
	; CHECK-NEXT: s_cbranch_vccz .LBB0_6			; CHECK-NEXT: s_cbranch_vccz .LBB0_6
	; CHECK-NEXT: s_branch .LBB0_7			; CHECK-NEXT: s_branch .LBB0_7
	; CHECK-NEXT: .LBB0_4:			; CHECK-NEXT: .LBB0_4:
	; CHECK-NEXT: s_mov_b32 s10, s8			; CHECK-NEXT: s_mov_b32 s14, s12
	; CHECK-NEXT: s_mov_b32 s11, s8			; CHECK-NEXT: s_mov_b32 s15, s12
	; CHECK-NEXT: s_mov_b32 s9, s8			; CHECK-NEXT: s_mov_b32 s13, s12
	; CHECK-NEXT: s_mov_b64 s[38:39], s[10:11]			; CHECK-NEXT: s_mov_b64 s[38:39], s[14:15]
	; CHECK-NEXT: s_mov_b64 s[36:37], s[8:9]			; CHECK-NEXT: s_mov_b64 s[36:37], s[12:13]
	; CHECK-NEXT: s_branch .LBB0_7			; CHECK-NEXT: s_branch .LBB0_7
	; CHECK-NEXT: .LBB0_5: ; %if.then263.i.i			; CHECK-NEXT: .LBB0_5: ; %if.then263.i.i
	; CHECK-NEXT: v_cmp_lt_f32_e64 s8, s41, 0			; CHECK-NEXT: v_cmp_lt_f32_e64 s12, s41, 0
	; CHECK-NEXT: s_mov_b32 s36, 1.0			; CHECK-NEXT: s_mov_b32 s36, 1.0
	; CHECK-NEXT: s_mov_b32 s12, 0x7fc00000			; CHECK-NEXT: s_mov_b32 s17, 0x7fc00000
	; CHECK-NEXT: s_mov_b32 s37, s36			; CHECK-NEXT: s_mov_b32 s37, s36
	; CHECK-NEXT: s_mov_b32 s38, s36			; CHECK-NEXT: s_mov_b32 s38, s36
	; CHECK-NEXT: s_mov_b32 s39, s36			; CHECK-NEXT: s_mov_b32 s39, s36
	; CHECK-NEXT: s_andn2_b32 vcc_lo, exec_lo, s8			; CHECK-NEXT: s_andn2_b32 vcc_lo, exec_lo, s12
	; CHECK-NEXT: s_cbranch_vccnz .LBB0_7			; CHECK-NEXT: s_cbranch_vccnz .LBB0_7
	; CHECK-NEXT: .LBB0_6: ; %if.end273.i.i			; CHECK-NEXT: .LBB0_6: ; %if.end273.i.i
	; CHECK-NEXT: s_add_u32 s8, s34, 40			; CHECK-NEXT: s_add_u32 s12, s8, 40
	; CHECK-NEXT: s_addc_u32 s9, s35, 0			; CHECK-NEXT: s_addc_u32 s13, s9, 0
	; CHECK-NEXT: s_getpc_b64 s[18:19]			; CHECK-NEXT: s_getpc_b64 s[18:19]
	; CHECK-NEXT: s_add_u32 s18, s18, _Z3dotDv3_fS_@gotpcrel32@lo+4			; CHECK-NEXT: s_add_u32 s18, s18, _Z3dotDv3_fS_@gotpcrel32@lo+4
	; CHECK-NEXT: s_addc_u32 s19, s19, _Z3dotDv3_fS_@gotpcrel32@hi+12			; CHECK-NEXT: s_addc_u32 s19, s19, _Z3dotDv3_fS_@gotpcrel32@hi+12
	; CHECK-NEXT: v_lshlrev_b32_e32 v2, 20, v2			; CHECK-NEXT: v_lshlrev_b32_e32 v2, 20, v2
	; CHECK-NEXT: s_load_dwordx2 s[18:19], s[18:19], 0x0			; CHECK-NEXT: s_load_dwordx2 s[18:19], s[18:19], 0x0
	; CHECK-NEXT: v_lshlrev_b32_e32 v3, 10, v1			; CHECK-NEXT: v_lshlrev_b32_e32 v3, 10, v1
	; CHECK-NEXT: v_add_f32_e64 v1, s12, s36			; CHECK-NEXT: v_add_f32_e64 v1, s17, s36
				; CHECK-NEXT: s_mov_b64 s[34:35], s[8:9]
				; CHECK-NEXT: s_mov_b64 s[8:9], s[12:13]
	; CHECK-NEXT: s_mov_b32 s12, s14			; CHECK-NEXT: s_mov_b32 s12, s14
	; CHECK-NEXT: s_mov_b32 s13, s15
	; CHECK-NEXT: s_mov_b32 s14, s16
	; CHECK-NEXT: v_or3_b32 v31, v0, v3, v2			; CHECK-NEXT: v_or3_b32 v31, v0, v3, v2
	; CHECK-NEXT: v_mov_b32_e32 v0, v1			; CHECK-NEXT: v_mov_b32_e32 v0, v1
	; CHECK-NEXT: v_mov_b32_e32 v1, 0			; CHECK-NEXT: v_mov_b32_e32 v1, 0
	; CHECK-NEXT: v_mov_b32_e32 v2, 0			; CHECK-NEXT: v_mov_b32_e32 v2, 0
				; CHECK-NEXT: s_mov_b32 s13, s15
				; CHECK-NEXT: s_mov_b32 s14, s16
	; CHECK-NEXT: s_mov_b32 s36, 0			; CHECK-NEXT: s_mov_b32 s36, 0
	; CHECK-NEXT: s_waitcnt lgkmcnt(0)			; CHECK-NEXT: s_waitcnt lgkmcnt(0)
	; CHECK-NEXT: s_swappc_b64 s[30:31], s[18:19]			; CHECK-NEXT: s_swappc_b64 s[30:31], s[18:19]
				; CHECK-NEXT: s_mov_b64 s[8:9], s[34:35]
	; CHECK-NEXT: s_mov_b32 s37, s36			; CHECK-NEXT: s_mov_b32 s37, s36
	; CHECK-NEXT: s_mov_b32 s38, s36			; CHECK-NEXT: s_mov_b32 s38, s36
	; CHECK-NEXT: s_mov_b32 s39, s36			; CHECK-NEXT: s_mov_b32 s39, s36
	; CHECK-NEXT: .LBB0_7: ; %if.end294.i.i			; CHECK-NEXT: .LBB0_7: ; %if.end294.i.i
	; CHECK-NEXT: v_mov_b32_e32 v0, 0			; CHECK-NEXT: v_mov_b32_e32 v0, 0
	; CHECK-NEXT: buffer_store_dword v0, off, s[0:3], 0 offset:12			; CHECK-NEXT: buffer_store_dword v0, off, s[0:3], 0 offset:12
	; CHECK-NEXT: buffer_store_dword v0, off, s[0:3], 0 offset:8			; CHECK-NEXT: buffer_store_dword v0, off, s[0:3], 0 offset:8
	; CHECK-NEXT: buffer_store_dword v0, off, s[0:3], 0 offset:4			; CHECK-NEXT: buffer_store_dword v0, off, s[0:3], 0 offset:4
	; CHECK-NEXT: buffer_store_dword v0, off, s[0:3], 0			; CHECK-NEXT: buffer_store_dword v0, off, s[0:3], 0
	; CHECK-NEXT: .LBB0_8: ; %kernel_direct_lighting.exit			; CHECK-NEXT: .LBB0_8: ; %kernel_direct_lighting.exit
	; CHECK-NEXT: s_load_dwordx2 s[4:5], s[34:35], 0x20			; CHECK-NEXT: s_load_dwordx2 s[4:5], s[8:9], 0x20
	; CHECK-NEXT: v_mov_b32_e32 v0, s36			; CHECK-NEXT: v_mov_b32_e32 v0, s36
	; CHECK-NEXT: v_mov_b32_e32 v4, 0			; CHECK-NEXT: v_mov_b32_e32 v4, 0
	; CHECK-NEXT: v_mov_b32_e32 v1, s37			; CHECK-NEXT: v_mov_b32_e32 v1, s37
	; CHECK-NEXT: v_mov_b32_e32 v2, s38			; CHECK-NEXT: v_mov_b32_e32 v2, s38
	; CHECK-NEXT: v_mov_b32_e32 v3, s39			; CHECK-NEXT: v_mov_b32_e32 v3, s39
	; CHECK-NEXT: s_waitcnt lgkmcnt(0)			; CHECK-NEXT: s_waitcnt lgkmcnt(0)
	; CHECK-NEXT: global_store_dwordx4 v4, v[0:3], s[4:5]			; CHECK-NEXT: global_store_dwordx4 v4, v[0:3], s[4:5]
	; CHECK-NEXT: s_endpgm			; CHECK-NEXT: s_endpgm
	Show All 37 Lines

llvm/test/CodeGen/AMDGPU/greedy-alloc-fail-sgpr1024-spill.mir

Show First 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	body: \|
; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: $sgpr4_sgpr5 = COPY killed renamable $sgpr40_sgpr41		; CHECK-NEXT: $sgpr4_sgpr5 = COPY killed renamable $sgpr40_sgpr41
; CHECK-NEXT: $sgpr6_sgpr7 = COPY killed renamable $sgpr38_sgpr39		; CHECK-NEXT: $sgpr6_sgpr7 = COPY killed renamable $sgpr38_sgpr39
; CHECK-NEXT: $sgpr8_sgpr9 = COPY killed renamable $sgpr34_sgpr35		; CHECK-NEXT: $sgpr8_sgpr9 = COPY killed renamable $sgpr34_sgpr35
; CHECK-NEXT: $sgpr10_sgpr11 = COPY killed renamable $sgpr36_sgpr37		; CHECK-NEXT: $sgpr10_sgpr11 = COPY killed renamable $sgpr36_sgpr37
; CHECK-NEXT: $sgpr12 = COPY killed renamable $sgpr42		; CHECK-NEXT: $sgpr12 = COPY killed renamable $sgpr42
; CHECK-NEXT: $sgpr13 = COPY killed renamable $sgpr33		; CHECK-NEXT: $sgpr13 = COPY killed renamable $sgpr33
; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL killed renamable $sgpr68_sgpr69, 0, csr_amdgpu, implicit $sgpr4_sgpr5, implicit $sgpr6_sgpr7, implicit $sgpr8_sgpr9, implicit $sgpr10_sgpr11, implicit killed $sgpr12, implicit killed $sgpr13, implicit $sgpr0_sgpr1_sgpr2_sgpr3		; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL killed renamable $sgpr68_sgpr69, 0, csr_amdgpu, implicit $sgpr4_sgpr5, implicit $sgpr6_sgpr7, implicit $sgpr8_sgpr9, implicit $sgpr10_sgpr11, implicit $sgpr12, implicit $sgpr13, implicit $sgpr0_sgpr1_sgpr2_sgpr3
; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: renamable $sgpr4_sgpr5 = COPY $exec, implicit-def $exec		; CHECK-NEXT: renamable $sgpr4_sgpr5 = COPY $exec, implicit-def $exec
; CHECK-NEXT: dead renamable $sgpr6_sgpr7 = IMPLICIT_DEF		; CHECK-NEXT: dead renamable $sgpr6_sgpr7 = IMPLICIT_DEF
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.1:		; CHECK-NEXT: bb.1:
; CHECK-NEXT: successors: %bb.2(0x40000000), %bb.4(0x40000000)		; CHECK-NEXT: successors: %bb.2(0x40000000), %bb.4(0x40000000)
; CHECK-NEXT: liveins: $sgpr4_sgpr5, $sgpr66_sgpr67:0x000000000000000F, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x000003FFFFFFFFFF		; CHECK-NEXT: liveins: $sgpr4_sgpr5, $sgpr66_sgpr67:0x000000000000000F, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x000003FFFFFFFFFF
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: renamable $sgpr6_sgpr7 = COPY $exec, implicit-def $exec		; CHECK-NEXT: renamable $sgpr6_sgpr7 = COPY $exec, implicit-def $exec
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.4, implicit $exec
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.2:		; CHECK-NEXT: bb.2:
; CHECK-NEXT: successors: %bb.3(0x80000000)		; CHECK-NEXT: successors: %bb.3(0x80000000)
; CHECK-NEXT: liveins: $sgpr4_sgpr5, $sgpr66_sgpr67:0x000000000000000F, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x000003FFFFFFFFFF		; CHECK-NEXT: liveins: $sgpr4_sgpr5, $sgpr66_sgpr67:0x000000000000000F, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x000003FFFFFFFFFF
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[COPY:%[0-9]+]]:vreg_1024 = COPY renamable $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75		; CHECK-NEXT: [[COPY:%[0-9]+]]:vreg_1024 = COPY renamable $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75
; CHECK-NEXT: renamable $sgpr6 = S_LSHL_B32 renamable $sgpr67, 1, implicit-def dead $scc		; CHECK-NEXT: renamable $sgpr6 = S_LSHL_B32 renamable $sgpr67, 1, implicit-def dead $scc
; CHECK-NEXT: dead [[COPY]]:vreg_1024 = V_INDIRECT_REG_WRITE_GPR_IDX_B32_V32 [[COPY]], 0, killed $sgpr6, 3, implicit-def $m0, implicit $m0, implicit $exec		; CHECK-NEXT: dead [[V_INDIRECT_REG_WRITE_GPR_IDX_B32_V32_:%[0-9]+]]:vreg_1024 = V_INDIRECT_REG_WRITE_GPR_IDX_B32_V32 [[V_INDIRECT_REG_WRITE_GPR_IDX_B32_V32_]], 0, killed $sgpr6, 3, implicit-def $m0, implicit $m0, implicit $exec
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.3:		; CHECK-NEXT: bb.3:
; CHECK-NEXT: successors: %bb.5(0x40000000), %bb.1(0x40000000)		; CHECK-NEXT: successors: %bb.5(0x40000000), %bb.1(0x40000000)
; CHECK-NEXT: liveins: $sgpr4_sgpr5, $sgpr66_sgpr67:0x000000000000000F, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x000003FFFFFFFFFF		; CHECK-NEXT: liveins: $sgpr4_sgpr5, $sgpr66_sgpr67:0x000000000000000F, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x000003FFFFFFFFFF
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: renamable $sgpr6_sgpr7 = S_OR_SAVEEXEC_B64 renamable $sgpr4_sgpr5, implicit-def $exec, implicit-def $scc, implicit $exec		; CHECK-NEXT: renamable $sgpr6_sgpr7 = S_OR_SAVEEXEC_B64 renamable $sgpr4_sgpr5, implicit-def $exec, implicit-def $scc, implicit $exec
; CHECK-NEXT: renamable $sgpr68 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr68 = COPY renamable $sgpr44
; CHECK-NEXT: renamable $sgpr69 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr69 = COPY renamable $sgpr44
Show All 12 Lines	body: \|
; CHECK-NEXT: renamable $sgpr82 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr82 = COPY renamable $sgpr44
; CHECK-NEXT: renamable $sgpr83 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr83 = COPY renamable $sgpr44
; CHECK-NEXT: renamable $sgpr84 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr84 = COPY renamable $sgpr44
; CHECK-NEXT: renamable $sgpr85 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr85 = COPY renamable $sgpr44
; CHECK-NEXT: renamable $sgpr86 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr86 = COPY renamable $sgpr44
; CHECK-NEXT: renamable $sgpr87 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr87 = COPY renamable $sgpr44
; CHECK-NEXT: renamable $sgpr88 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr88 = COPY renamable $sgpr44
; CHECK-NEXT: renamable $sgpr89 = COPY renamable $sgpr44		; CHECK-NEXT: renamable $sgpr89 = COPY renamable $sgpr44
; CHECK-NEXT: dead %18:vreg_1024 = COPY renamable $sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95_sgpr96_sgpr97_sgpr98_sgpr99, implicit $exec		; CHECK-NEXT: dead [[COPY:%[0-9]+]]:vreg_1024 = COPY renamable $sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95_sgpr96_sgpr97_sgpr98_sgpr99, implicit $exec
; CHECK-NEXT: $exec = S_XOR_B64_term $exec, killed renamable $sgpr6_sgpr7, implicit-def $scc		; CHECK-NEXT: $exec = S_XOR_B64_term $exec, killed renamable $sgpr6_sgpr7, implicit-def $scc
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.5, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.5, implicit $exec
; CHECK-NEXT: S_BRANCH %bb.1		; CHECK-NEXT: S_BRANCH %bb.1
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.4:		; CHECK-NEXT: bb.4:
; CHECK-NEXT: successors: %bb.5(0x80000000)		; CHECK-NEXT: successors: %bb.5(0x80000000)
; CHECK-NEXT: liveins: $sgpr6_sgpr7, $sgpr66_sgpr67:0x0000000000000003, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x000003FFFFFFFFFF		; CHECK-NEXT: liveins: $sgpr6_sgpr7, $sgpr66_sgpr67:0x0000000000000003, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x000003FFFFFFFFFF
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr6_sgpr7, implicit-def $scc		; CHECK-NEXT: $exec = S_OR_B64 $exec, killed renamable $sgpr6_sgpr7, implicit-def $scc
; CHECK-NEXT: dead renamable $sgpr4 = S_LSHL_B32 killed renamable $sgpr66, 1, implicit-def dead $scc		; CHECK-NEXT: dead renamable $sgpr4 = S_LSHL_B32 killed renamable $sgpr66, 1, implicit-def dead $scc
; CHECK-NEXT: dead %16:vreg_1024 = COPY renamable $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75		; CHECK-NEXT: dead [[COPY1:%[0-9]+]]:vreg_1024 = COPY renamable $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.5:		; CHECK-NEXT: bb.5:
bb.0:		bb.0:
liveins: $vgpr0, $vgpr1, $vgpr2, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9, $sgpr10_sgpr11, $sgpr14, $sgpr15		liveins: $vgpr0, $vgpr1, $vgpr2, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9, $sgpr10_sgpr11, $sgpr14, $sgpr15

%0:sgpr_64 = COPY $sgpr8_sgpr9		%0:sgpr_64 = COPY $sgpr8_sgpr9
%1:sgpr_32 = COPY $sgpr15		%1:sgpr_32 = COPY $sgpr15
%2:sgpr_32 = COPY $sgpr14		%2:sgpr_32 = COPY $sgpr14
▲ Show 20 Lines • Show All 90 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/insert-delay-alu-bug.ll

	Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines

	; FIXME: This generates "instid1(/* invalid instid value */)".			; FIXME: This generates "instid1(/* invalid instid value */)".
	define amdgpu_kernel void @f2(i32 %arg, i32 %arg1, i32 %arg2, i1 %arg3, i32 %arg4, i1 %arg5, ptr %arg6, i32 %arg7, i32 %arg8, i32 %arg9, i32 %arg10, i1 %arg11) {			define amdgpu_kernel void @f2(i32 %arg, i32 %arg1, i32 %arg2, i1 %arg3, i32 %arg4, i1 %arg5, ptr %arg6, i32 %arg7, i32 %arg8, i32 %arg9, i32 %arg10, i1 %arg11) {
	; GFX11-LABEL: f2:			; GFX11-LABEL: f2:
	; GFX11: ; %bb.0: ; %bb			; GFX11: ; %bb.0: ; %bb
	; GFX11-NEXT: s_mov_b64 s[16:17], s[4:5]			; GFX11-NEXT: s_mov_b64 s[16:17], s[4:5]
	; GFX11-NEXT: v_mov_b32_e32 v31, v0			; GFX11-NEXT: v_mov_b32_e32 v31, v0
	; GFX11-NEXT: s_load_b32 s24, s[16:17], 0x24			; GFX11-NEXT: s_load_b32 s24, s[16:17], 0x24
	; GFX11-NEXT: s_mov_b32 s18, s14
	; GFX11-NEXT: s_mov_b32 s12, s13			; GFX11-NEXT: s_mov_b32 s12, s13
	; GFX11-NEXT: s_mov_b64 s[10:11], s[6:7]			; GFX11-NEXT: s_mov_b64 s[10:11], s[6:7]
	; GFX11-NEXT: v_and_b32_e32 v0, 0x3ff, v31
	; GFX11-NEXT: s_mov_b64 s[6:7], s[2:3]			; GFX11-NEXT: s_mov_b64 s[6:7], s[2:3]
				; GFX11-NEXT: v_and_b32_e32 v0, 0x3ff, v31
	; GFX11-NEXT: s_mov_b64 s[4:5], s[0:1]			; GFX11-NEXT: s_mov_b64 s[4:5], s[0:1]
	; GFX11-NEXT: s_mov_b32 s3, 0			; GFX11-NEXT: s_mov_b32 s3, 0
	; GFX11-NEXT: s_mov_b32 s0, -1			; GFX11-NEXT: s_mov_b32 s0, -1
	; GFX11-NEXT: s_mov_b32 s19, exec_lo			; GFX11-NEXT: s_mov_b32 s18, exec_lo
	; GFX11-NEXT: s_mov_b32 s32, 0			; GFX11-NEXT: s_mov_b32 s32, 0
	; GFX11-NEXT: s_waitcnt lgkmcnt(0)			; GFX11-NEXT: s_waitcnt lgkmcnt(0)
	; GFX11-NEXT: v_mul_lo_u32 v0, s24, v0			; GFX11-NEXT: v_mul_lo_u32 v0, s24, v0
	; GFX11-NEXT: s_delay_alu instid0(VALU_DEP_1)			; GFX11-NEXT: s_delay_alu instid0(VALU_DEP_1)
	; GFX11-NEXT: v_cmpx_eq_u32_e32 0, v0			; GFX11-NEXT: v_cmpx_eq_u32_e32 0, v0
	; GFX11-NEXT: s_cbranch_execz .LBB2_13			; GFX11-NEXT: s_cbranch_execz .LBB2_13
	; GFX11-NEXT: ; %bb.1: ; %bb14			; GFX11-NEXT: ; %bb.1: ; %bb14
	; GFX11-NEXT: s_load_b128 s[20:23], s[16:17], 0x2c			; GFX11-NEXT: s_load_b128 s[20:23], s[16:17], 0x2c
				; GFX11-NEXT: s_mov_b32 s19, 0
	; GFX11-NEXT: s_waitcnt lgkmcnt(0)			; GFX11-NEXT: s_waitcnt lgkmcnt(0)
	; GFX11-NEXT: s_bitcmp1_b32 s21, 0			; GFX11-NEXT: s_bitcmp1_b32 s21, 0
	; GFX11-NEXT: s_cselect_b32 s25, -1, 0			; GFX11-NEXT: s_cselect_b32 s25, -1, 0
	; GFX11-NEXT: s_bitcmp0_b32 s21, 0			; GFX11-NEXT: s_bitcmp0_b32 s21, 0
	; GFX11-NEXT: s_mov_b32 s21, 0
	; GFX11-NEXT: s_cbranch_scc0 .LBB2_3			; GFX11-NEXT: s_cbranch_scc0 .LBB2_3
	; GFX11-NEXT: ; %bb.2: ; %bb15			; GFX11-NEXT: ; %bb.2: ; %bb15
	; GFX11-NEXT: s_add_u32 s8, s16, 0x58			; GFX11-NEXT: s_add_u32 s8, s16, 0x58
	; GFX11-NEXT: s_addc_u32 s9, s17, 0			; GFX11-NEXT: s_addc_u32 s9, s17, 0
	; GFX11-NEXT: s_getpc_b64 s[0:1]			; GFX11-NEXT: s_getpc_b64 s[0:1]
	; GFX11-NEXT: s_add_u32 s0, s0, f0@gotpcrel32@lo+4			; GFX11-NEXT: s_add_u32 s0, s0, f0@gotpcrel32@lo+4
	; GFX11-NEXT: s_addc_u32 s1, s1, f0@gotpcrel32@hi+12			; GFX11-NEXT: s_addc_u32 s1, s1, f0@gotpcrel32@hi+12
	; GFX11-NEXT: s_mov_b32 s13, s18			; GFX11-NEXT: s_mov_b32 s13, s14
	; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x0			; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x0
				; GFX11-NEXT: s_mov_b32 s21, s14
	; GFX11-NEXT: s_mov_b32 s14, s15			; GFX11-NEXT: s_mov_b32 s14, s15
	; GFX11-NEXT: s_waitcnt lgkmcnt(0)			; GFX11-NEXT: s_waitcnt lgkmcnt(0)
	; GFX11-NEXT: s_swappc_b64 s[30:31], s[0:1]			; GFX11-NEXT: s_swappc_b64 s[30:31], s[0:1]
				; GFX11-NEXT: s_mov_b32 s14, s21
	; GFX11-NEXT: s_mov_b32 s1, -1			; GFX11-NEXT: s_mov_b32 s1, -1
	; GFX11-NEXT: s_and_not1_b32 vcc_lo, exec_lo, s3			; GFX11-NEXT: s_and_not1_b32 vcc_lo, exec_lo, s3
	; GFX11-NEXT: s_cbranch_vccz .LBB2_4			; GFX11-NEXT: s_cbranch_vccz .LBB2_4
	; GFX11-NEXT: s_branch .LBB2_12			; GFX11-NEXT: s_branch .LBB2_12
	; GFX11-NEXT: .LBB2_3:			; GFX11-NEXT: .LBB2_3:
	; GFX11-NEXT: s_mov_b32 s1, 0			; GFX11-NEXT: s_mov_b32 s1, 0
	; GFX11-NEXT: .LBB2_4: ; %bb16			; GFX11-NEXT: .LBB2_4: ; %bb16
	; GFX11-NEXT: s_load_b32 s2, s[16:17], 0x54			; GFX11-NEXT: s_load_b32 s2, s[16:17], 0x54
	Show All 20 Lines
	; GFX11-NEXT: s_delay_alu instid0(VALU_DEP_2) \| instskip(NEXT) \| instid1(SALU_CYCLE_1)			; GFX11-NEXT: s_delay_alu instid0(VALU_DEP_2) \| instskip(NEXT) \| instid1(SALU_CYCLE_1)
	; GFX11-NEXT: s_or_b32 s2, s2, 1			; GFX11-NEXT: s_or_b32 s2, s2, 1
	; GFX11-NEXT: s_lshr_b32 s2, s2, s30			; GFX11-NEXT: s_lshr_b32 s2, s2, s30
	; GFX11-NEXT: s_delay_alu instid0(SALU_CYCLE_1) \| instskip(NEXT) \| instid1(SALU_CYCLE_1)			; GFX11-NEXT: s_delay_alu instid0(SALU_CYCLE_1) \| instskip(NEXT) \| instid1(SALU_CYCLE_1)
	; GFX11-NEXT: s_mul_i32 s2, s2, s22			; GFX11-NEXT: s_mul_i32 s2, s2, s22
	; GFX11-NEXT: s_mul_i32 s2, s2, s20			; GFX11-NEXT: s_mul_i32 s2, s2, s20
	; GFX11-NEXT: s_delay_alu instid0(SALU_CYCLE_1) \| instskip(NEXT) \| instid1(SALU_CYCLE_1)			; GFX11-NEXT: s_delay_alu instid0(SALU_CYCLE_1) \| instskip(NEXT) \| instid1(SALU_CYCLE_1)
	; GFX11-NEXT: s_or_b32 s2, s24, s2			; GFX11-NEXT: s_or_b32 s2, s24, s2
	; GFX11-NEXT: s_lshl_b64 s[22:23], s[2:3], 1			; GFX11-NEXT: s_lshl_b64 s[20:21], s[2:3], 1
	; GFX11-NEXT: global_load_u16 v2, v1, s[22:23]			; GFX11-NEXT: global_load_u16 v2, v1, s[20:21]
	; GFX11-NEXT: s_waitcnt vmcnt(0)			; GFX11-NEXT: s_waitcnt vmcnt(0)
	; GFX11-NEXT: v_cmp_eq_u16_e32 vcc_lo, 0, v2			; GFX11-NEXT: v_cmp_eq_u16_e32 vcc_lo, 0, v2
	; GFX11-NEXT: v_cndmask_b32_e64 v2, 0, 1, vcc_lo			; GFX11-NEXT: v_cndmask_b32_e64 v2, 0, 1, vcc_lo
	; GFX11-NEXT: .p2align 6			; GFX11-NEXT: .p2align 6
	; GFX11-NEXT: .LBB2_6: ; %bb18			; GFX11-NEXT: .LBB2_6: ; %bb18
	; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1			; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1
	; GFX11-NEXT: v_cmp_ne_u16_e64 s2, s3, 0			; GFX11-NEXT: v_cmp_ne_u16_e64 s2, s3, 0
	; GFX11-NEXT: v_cmp_ne_u16_e32 vcc_lo, 0, v1			; GFX11-NEXT: v_cmp_ne_u16_e32 vcc_lo, 0, v1
	Show All 22 Lines
	; GFX11-NEXT: ; %bb.9:			; GFX11-NEXT: ; %bb.9:
	; GFX11-NEXT: s_xor_b32 s0, s8, -1			; GFX11-NEXT: s_xor_b32 s0, s8, -1
	; GFX11-NEXT: .LBB2_10: ; %bb17			; GFX11-NEXT: .LBB2_10: ; %bb17
	; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1			; GFX11-NEXT: ; =>This Inner Loop Header: Depth=1
	; GFX11-NEXT: s_delay_alu instid0(SALU_CYCLE_1)			; GFX11-NEXT: s_delay_alu instid0(SALU_CYCLE_1)
	; GFX11-NEXT: s_and_b32 vcc_lo, exec_lo, s0			; GFX11-NEXT: s_and_b32 vcc_lo, exec_lo, s0
	; GFX11-NEXT: s_cbranch_vccz .LBB2_10			; GFX11-NEXT: s_cbranch_vccz .LBB2_10
	; GFX11-NEXT: ; %bb.11: ; %Flow6			; GFX11-NEXT: ; %bb.11: ; %Flow6
	; GFX11-NEXT: s_mov_b32 s21, -1			; GFX11-NEXT: s_mov_b32 s19, -1
	; GFX11-NEXT: .LBB2_12: ; %Flow11			; GFX11-NEXT: .LBB2_12: ; %Flow11
	; GFX11-NEXT: s_and_b32 s3, s1, exec_lo			; GFX11-NEXT: s_and_b32 s3, s1, exec_lo
	; GFX11-NEXT: s_or_not1_b32 s0, s21, exec_lo			; GFX11-NEXT: s_or_not1_b32 s0, s19, exec_lo
	; GFX11-NEXT: .LBB2_13: ; %Flow9			; GFX11-NEXT: .LBB2_13: ; %Flow9
	; GFX11-NEXT: s_or_b32 exec_lo, exec_lo, s19			; GFX11-NEXT: s_or_b32 exec_lo, exec_lo, s18
	; GFX11-NEXT: s_and_saveexec_b32 s19, s0			; GFX11-NEXT: s_and_saveexec_b32 s18, s0
	; GFX11-NEXT: s_cbranch_execz .LBB2_15			; GFX11-NEXT: s_cbranch_execz .LBB2_15
	; GFX11-NEXT: ; %bb.14: ; %bb43			; GFX11-NEXT: ; %bb.14: ; %bb43
	; GFX11-NEXT: s_add_u32 s8, s16, 0x58			; GFX11-NEXT: s_add_u32 s8, s16, 0x58
	; GFX11-NEXT: s_addc_u32 s9, s17, 0			; GFX11-NEXT: s_addc_u32 s9, s17, 0
	; GFX11-NEXT: s_getpc_b64 s[0:1]			; GFX11-NEXT: s_getpc_b64 s[0:1]
	; GFX11-NEXT: s_add_u32 s0, s0, f0@gotpcrel32@lo+4			; GFX11-NEXT: s_add_u32 s0, s0, f0@gotpcrel32@lo+4
	; GFX11-NEXT: s_addc_u32 s1, s1, f0@gotpcrel32@hi+12			; GFX11-NEXT: s_addc_u32 s1, s1, f0@gotpcrel32@hi+12
	; GFX11-NEXT: s_mov_b32 s13, s18			; GFX11-NEXT: s_mov_b32 s13, s14
	; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x0			; GFX11-NEXT: s_load_b64 s[0:1], s[0:1], 0x0
	; GFX11-NEXT: s_mov_b32 s14, s15			; GFX11-NEXT: s_mov_b32 s14, s15
	; GFX11-NEXT: s_waitcnt lgkmcnt(0)			; GFX11-NEXT: s_waitcnt lgkmcnt(0)
	; GFX11-NEXT: s_swappc_b64 s[30:31], s[0:1]			; GFX11-NEXT: s_swappc_b64 s[30:31], s[0:1]
	; GFX11-NEXT: s_or_b32 s3, s3, exec_lo			; GFX11-NEXT: s_or_b32 s3, s3, exec_lo
	; GFX11-NEXT: .LBB2_15: ; %Flow14			; GFX11-NEXT: .LBB2_15: ; %Flow14
	; GFX11-NEXT: s_or_b32 exec_lo, exec_lo, s19			; GFX11-NEXT: s_or_b32 exec_lo, exec_lo, s18
	; GFX11-NEXT: s_and_saveexec_b32 s0, s3			; GFX11-NEXT: s_and_saveexec_b32 s0, s3
	; GFX11-NEXT: ; %bb.16: ; %UnifiedUnreachableBlock			; GFX11-NEXT: ; %bb.16: ; %UnifiedUnreachableBlock
	; GFX11-NEXT: ; divergent unreachable			; GFX11-NEXT: ; divergent unreachable
	; GFX11-NEXT: ; %bb.17: ; %UnifiedReturnBlock			; GFX11-NEXT: ; %bb.17: ; %UnifiedReturnBlock
	; GFX11-NEXT: s_endpgm			; GFX11-NEXT: s_endpgm
	bb:			bb:
	%i = tail call i32 @llvm.amdgcn.workitem.id.x()			%i = tail call i32 @llvm.amdgcn.workitem.id.x()
	%i12 = mul i32 %arg, %i			%i12 = mul i32 %arg, %i
	▲ Show 20 Lines • Show All 49 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/ran-out-of-sgprs-allocation-failure.mir

Show All 34 Lines	argumentInfo:
privateSegmentWaveByteOffset: { reg: '$sgpr17' }		privateSegmentWaveByteOffset: { reg: '$sgpr17' }
occupancy: 8		occupancy: 8
body: \|		body: \|
; CHECK-LABEL: name: need_large_tuple_split		; CHECK-LABEL: name: need_large_tuple_split
; CHECK: bb.0:		; CHECK: bb.0:
; CHECK-NEXT: successors: %bb.1(0x80000000)		; CHECK-NEXT: successors: %bb.1(0x80000000)
; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr0, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9, $sgpr10_sgpr11		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $vgpr0, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9, $sgpr10_sgpr11
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: renamable $sgpr33 = COPY $sgpr14		; CHECK-NEXT: renamable $sgpr34_sgpr35 = V_CMP_GT_I32_e64 1, undef %18:vgpr_32, implicit $exec
; CHECK-NEXT: renamable $sgpr34_sgpr35 = COPY $sgpr4_sgpr5		; CHECK-NEXT: renamable $sgpr36_sgpr37 = V_CMP_EQ_U32_e64 0, undef %18:vgpr_32, implicit $exec
; CHECK-NEXT: renamable $sgpr12_sgpr13 = V_CMP_GT_I32_e64 1, undef %18:vgpr_32, implicit $exec		; CHECK-NEXT: renamable $sgpr38_sgpr39 = V_CMP_NE_U32_e64 0, undef %18:vgpr_32, implicit $exec
; CHECK-NEXT: renamable $sgpr18_sgpr19 = V_CMP_EQ_U32_e64 0, undef %18:vgpr_32, implicit $exec		; CHECK-NEXT: renamable $sgpr40_sgpr41 = V_CMP_GT_I32_e64 0, undef %18:vgpr_32, implicit $exec
; CHECK-NEXT: renamable $sgpr20_sgpr21 = V_CMP_NE_U32_e64 0, undef %18:vgpr_32, implicit $exec		; CHECK-NEXT: renamable $sgpr60 = S_MOV_B32 0
; CHECK-NEXT: renamable $sgpr22_sgpr23 = V_CMP_GT_I32_e64 0, undef %18:vgpr_32, implicit $exec		; CHECK-NEXT: renamable $sgpr42_sgpr43 = V_CMP_EQ_U32_e64 undef $sgpr4, undef %18:vgpr_32, implicit $exec
; CHECK-NEXT: renamable $sgpr52 = S_MOV_B32 0		; CHECK-NEXT: [[COPY:%[0-9]+]]:vreg_1024_align2 = COPY renamable $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75, implicit $exec
; CHECK-NEXT: renamable $sgpr24_sgpr25 = V_CMP_EQ_U32_e64 undef $sgpr4, undef %18:vgpr_32, implicit $exec		; CHECK-NEXT: renamable $sgpr44_sgpr45 = V_CMP_NE_U32_e64 1, undef %18:vgpr_32, implicit $exec
; CHECK-NEXT: [[COPY:%[0-9]+]]:vreg_1024_align2 = COPY renamable $sgpr36_sgpr37_sgpr38_sgpr39_sgpr40_sgpr41_sgpr42_sgpr43_sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67, implicit $exec		; CHECK-NEXT: renamable $sgpr61 = S_MOV_B32 1083786240
; CHECK-NEXT: renamable $sgpr100_sgpr101 = V_CMP_NE_U32_e64 1, undef %18:vgpr_32, implicit $exec
; CHECK-NEXT: renamable $sgpr53 = S_MOV_B32 1083786240
; CHECK-NEXT: SI_SPILL_S1024_SAVE renamable $sgpr36_sgpr37_sgpr38_sgpr39_sgpr40_sgpr41_sgpr42_sgpr43_sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67, %stack.1, implicit $exec, implicit $sgpr32 :: (store (s1024) into %stack.1, align 4, addrspace 5)
; CHECK-NEXT: S_BRANCH %bb.1		; CHECK-NEXT: S_BRANCH %bb.1
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.1:		; CHECK-NEXT: bb.1:
; CHECK-NEXT: successors: %bb.2(0x40000000), %bb.17(0x40000000)		; CHECK-NEXT: successors: %bb.2(0x40000000), %bb.17(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: $vcc = S_AND_B64 $exec, renamable $sgpr100_sgpr101, implicit-def dead $scc		; CHECK-NEXT: $vcc = S_AND_B64 $exec, renamable $sgpr44_sgpr45, implicit-def dead $scc
; CHECK-NEXT: [[COPY1:%[0-9]+]]:vreg_1024_align2 = COPY [[COPY]]		; CHECK-NEXT: [[COPY1:%[0-9]+]]:vreg_1024_align2 = COPY [[COPY]]
; CHECK-NEXT: S_CBRANCH_VCCNZ %bb.2, implicit $vcc		; CHECK-NEXT: S_CBRANCH_VCCNZ %bb.2, implicit $vcc
; CHECK-NEXT: S_BRANCH %bb.17		; CHECK-NEXT: S_BRANCH %bb.17
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.2:		; CHECK-NEXT: bb.2:
; CHECK-NEXT: successors: %bb.11(0x40000000), %bb.5(0x40000000)		; CHECK-NEXT: successors: %bb.11(0x40000000), %bb.5(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: renamable $sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: renamable $sgpr64 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr36 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr65 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr37 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr66 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr38 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr67 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr39 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr68 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr40 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr69 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr41 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr70 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr42 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr71 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr43 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr72 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr44 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr73 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr45 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr74 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr46 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr75 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr47 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr76 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr48 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr77 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr49 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr78 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr50 = COPY renamable $sgpr68		; CHECK-NEXT: renamable $sgpr79 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr51 = COPY killed renamable $sgpr68		; CHECK-NEXT: renamable $sgpr80 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95_sgpr96_sgpr97_sgpr98_sgpr99 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: renamable $sgpr81 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr52 = COPY killed renamable $sgpr84		; CHECK-NEXT: renamable $sgpr82 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: renamable $sgpr83 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr53 = COPY killed renamable $sgpr72		; CHECK-NEXT: renamable $sgpr84 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: renamable $sgpr85 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr54 = COPY killed renamable $sgpr72		; CHECK-NEXT: renamable $sgpr86 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: renamable $sgpr87 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr55 = COPY killed renamable $sgpr72		; CHECK-NEXT: renamable $sgpr88 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: renamable $sgpr89 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr56 = COPY killed renamable $sgpr72		; CHECK-NEXT: renamable $sgpr90 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: renamable $sgpr91 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr57 = COPY killed renamable $sgpr76		; CHECK-NEXT: renamable $sgpr92 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: renamable $sgpr93 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr58 = COPY killed renamable $sgpr76		; CHECK-NEXT: renamable $sgpr94 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: renamable $sgpr95 = COPY renamable $sgpr60
; CHECK-NEXT: renamable $sgpr59 = COPY killed renamable $sgpr76		; CHECK-NEXT: [[COPY2:%[0-9]+]]:vreg_1024_align2 = COPY killed renamable $sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95, implicit $exec
; CHECK-NEXT: renamable $sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)
; CHECK-NEXT: renamable $sgpr60 = COPY killed renamable $sgpr76
; CHECK-NEXT: renamable $sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)
arsenmUnsubmitted Not Done Reply Inline Actions This is a huge improvement (though this heuristic is probably not the reason it should have split) arsenm: This is a huge improvement (though this heuristic is probably not the reason it should have…
CarrotAuthorUnsubmitted Done Reply Inline Actions Thank you for the confirmation, I really don't what it is doing. Carrot: Thank you for the confirmation, I really don't what it is doing.
CarrotAuthorUnsubmitted Done Reply Inline Actions I really don't know what it is doing. Carrot: I really don't know what it is doing.
; CHECK-NEXT: renamable $sgpr61 = COPY killed renamable $sgpr80
; CHECK-NEXT: renamable $sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)
; CHECK-NEXT: renamable $sgpr62 = COPY killed renamable $sgpr80
; CHECK-NEXT: renamable $sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)
; CHECK-NEXT: renamable $sgpr63 = COPY killed renamable $sgpr80
; CHECK-NEXT: renamable $sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)
; CHECK-NEXT: renamable $sgpr64 = COPY killed renamable $sgpr80
; CHECK-NEXT: renamable $sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95_sgpr96_sgpr97_sgpr98_sgpr99 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)
; CHECK-NEXT: renamable $sgpr65 = COPY killed renamable $sgpr84
; CHECK-NEXT: renamable $sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95_sgpr96_sgpr97_sgpr98_sgpr99 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)
; CHECK-NEXT: renamable $sgpr66 = COPY killed renamable $sgpr84
; CHECK-NEXT: renamable $sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83_sgpr84_sgpr85_sgpr86_sgpr87_sgpr88_sgpr89_sgpr90_sgpr91_sgpr92_sgpr93_sgpr94_sgpr95_sgpr96_sgpr97_sgpr98_sgpr99 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)
; CHECK-NEXT: renamable $sgpr67 = COPY killed renamable $sgpr84
; CHECK-NEXT: [[COPY2:%[0-9]+]]:vreg_1024_align2 = COPY killed renamable $sgpr36_sgpr37_sgpr38_sgpr39_sgpr40_sgpr41_sgpr42_sgpr43_sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67, implicit $exec
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.11, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.11, implicit $exec
; CHECK-NEXT: S_BRANCH %bb.5		; CHECK-NEXT: S_BRANCH %bb.5
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.3:		; CHECK-NEXT: bb.3:
; CHECK-NEXT: successors: %bb.4(0x80000000)		; CHECK-NEXT: successors: %bb.4(0x80000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: renamable $sgpr60 = COPY killed renamable $sgpr33		; CHECK-NEXT: renamable $sgpr60 = COPY killed renamable $sgpr14
; CHECK-NEXT: renamable $sgpr62 = COPY killed renamable $sgpr15		; CHECK-NEXT: renamable $sgpr62 = COPY killed renamable $sgpr15
; CHECK-NEXT: SI_SPILL_S32_SAVE killed renamable $sgpr16, %stack.0, implicit $exec, implicit $sgpr32 :: (store (s32) into %stack.0, addrspace 5)		; CHECK-NEXT: SI_SPILL_S32_SAVE killed renamable $sgpr16, %stack.0, implicit $exec, implicit $sgpr32 :: (store (s32) into %stack.0, addrspace 5)
; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL undef renamable $sgpr4_sgpr5, 0, CustomRegMask($sgpr60,$sgpr62)		; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL undef renamable $sgpr4_sgpr5, 0, CustomRegMask($sgpr60,$sgpr62)
; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.4:		; CHECK-NEXT: bb.4:
; CHECK-NEXT: successors: %bb.17(0x80000000)		; CHECK-NEXT: successors: %bb.17(0x80000000)
; CHECK-NEXT: liveins: $sgpr60, $sgpr62		; CHECK-NEXT: liveins: $sgpr60, $sgpr62
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: $sgpr12 = COPY killed renamable $sgpr60		; CHECK-NEXT: $sgpr12 = COPY killed renamable $sgpr60
; CHECK-NEXT: $sgpr13 = COPY killed renamable $sgpr62		; CHECK-NEXT: $sgpr13 = COPY killed renamable $sgpr62
; CHECK-NEXT: $sgpr14 = SI_SPILL_S32_RESTORE %stack.0, implicit $exec, implicit $sgpr32 :: (load (s32) from %stack.0, addrspace 5)		; CHECK-NEXT: $sgpr14 = SI_SPILL_S32_RESTORE %stack.0, implicit $exec, implicit $sgpr32 :: (load (s32) from %stack.0, addrspace 5)
; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL undef renamable $sgpr4_sgpr5, 0, csr_amdgpu_noregs, implicit $sgpr12, implicit $sgpr13, implicit $sgpr14		; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL undef renamable $sgpr4_sgpr5, 0, csr_amdgpu_noregs, implicit $sgpr12, implicit $sgpr13, implicit $sgpr14
; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: S_BRANCH %bb.17		; CHECK-NEXT: S_BRANCH %bb.17
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.5:		; CHECK-NEXT: bb.5:
; CHECK-NEXT: successors: %bb.12(0x40000000), %bb.6(0x40000000)		; CHECK-NEXT: successors: %bb.12(0x40000000), %bb.6(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: renamable $sgpr4_sgpr5 = S_AND_B64 renamable $sgpr20_sgpr21, undef renamable $sgpr88_sgpr89, implicit-def dead $scc		; CHECK-NEXT: renamable $sgpr12_sgpr13 = S_AND_B64 renamable $sgpr38_sgpr39, undef renamable $sgpr46_sgpr47, implicit-def dead $scc
; CHECK-NEXT: renamable $sgpr88_sgpr89 = V_CMP_GT_I32_e64 0, undef %18:vgpr_32, implicit $exec		; CHECK-NEXT: renamable $sgpr46_sgpr47 = V_CMP_GT_I32_e64 0, undef %18:vgpr_32, implicit $exec
; CHECK-NEXT: $exec = S_MOV_B64_term killed renamable $sgpr4_sgpr5		; CHECK-NEXT: $exec = S_MOV_B64_term killed renamable $sgpr12_sgpr13
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.12, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.12, implicit $exec
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.6:		; CHECK-NEXT: bb.6:
; CHECK-NEXT: successors: %bb.7(0x80000000)		; CHECK-NEXT: successors: %bb.7(0x80000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr88_sgpr89, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: dead [[V_CNDMASK_B32_e64_:%[0-9]+]]:vgpr_32 = V_CNDMASK_B32_e64 0, 0, 0, 1, $sgpr22_sgpr23, implicit $exec		; CHECK-NEXT: dead [[V_CNDMASK_B32_e64_:%[0-9]+]]:vgpr_32 = V_CNDMASK_B32_e64 0, 0, 0, 1, $sgpr40_sgpr41, implicit $exec
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.7:		; CHECK-NEXT: bb.7:
; CHECK-NEXT: successors: %bb.8(0x80000000)		; CHECK-NEXT: successors: %bb.8(0x80000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr88_sgpr89, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: renamable $sgpr90_sgpr91 = nofpexcept V_CMP_NLT_F64_e64 0, undef $sgpr4_sgpr5, 0, undef %29:vreg_64_align2, 0, implicit $mode, implicit $exec		; CHECK-NEXT: renamable $sgpr48_sgpr49 = nofpexcept V_CMP_NLT_F64_e64 0, undef $sgpr4_sgpr5, 0, undef %29:vreg_64_align2, 0, implicit $mode, implicit $exec
; CHECK-NEXT: renamable $sgpr92_sgpr93 = nofpexcept V_CMP_NLT_F64_e64 0, 4607182418800017408, 0, undef %29:vreg_64_align2, 0, implicit $mode, implicit $exec		; CHECK-NEXT: renamable $sgpr50_sgpr51 = nofpexcept V_CMP_NLT_F64_e64 0, 4607182418800017408, 0, undef %29:vreg_64_align2, 0, implicit $mode, implicit $exec
; CHECK-NEXT: dead [[V_INDIRECT_REG_READ_GPR_IDX_B32_V32_:%[0-9]+]]:vgpr_32 = V_INDIRECT_REG_READ_GPR_IDX_B32_V32 [[COPY1]], undef $sgpr33, 11, implicit-def $m0, implicit $m0, implicit $exec		; CHECK-NEXT: dead [[V_INDIRECT_REG_READ_GPR_IDX_B32_V32_:%[0-9]+]]:vgpr_32 = V_INDIRECT_REG_READ_GPR_IDX_B32_V32 [[COPY1]], undef $sgpr14, 11, implicit-def $m0, implicit $m0, implicit $exec
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.8:		; CHECK-NEXT: bb.8:
; CHECK-NEXT: successors: %bb.10(0x40000000), %bb.9(0x40000000)		; CHECK-NEXT: successors: %bb.10(0x40000000), %bb.9(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr88_sgpr89, $sgpr90_sgpr91, $sgpr92_sgpr93, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: $vcc = S_AND_B64 $exec, renamable $sgpr90_sgpr91, implicit-def dead $scc		; CHECK-NEXT: $vcc = S_AND_B64 $exec, renamable $sgpr48_sgpr49, implicit-def dead $scc
; CHECK-NEXT: S_CBRANCH_VCCNZ %bb.10, implicit $vcc		; CHECK-NEXT: S_CBRANCH_VCCNZ %bb.10, implicit $vcc
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.9:		; CHECK-NEXT: bb.9:
; CHECK-NEXT: successors: %bb.10(0x40000000), %bb.17(0x40000000)		; CHECK-NEXT: successors: %bb.10(0x40000000), %bb.17(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr88_sgpr89, $sgpr90_sgpr91, $sgpr92_sgpr93, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: renamable $sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75_sgpr76_sgpr77_sgpr78_sgpr79_sgpr80_sgpr81_sgpr82_sgpr83 = SI_SPILL_S1024_RESTORE %stack.1, implicit $exec, implicit $sgpr32 :: (load (s1024) from %stack.1, align 4, addrspace 5)		; CHECK-NEXT: [[COPY2:%[0-9]+]]:vreg_64_align2 = COPY renamable $sgpr60_sgpr61, implicit $exec
; CHECK-NEXT: [[COPY2:%[0-9]+]]:vreg_64_align2 = COPY killed renamable $sgpr68_sgpr69, implicit $exec
; CHECK-NEXT: GLOBAL_STORE_DWORDX2_SADDR undef %18:vgpr_32, [[COPY2]], undef renamable $sgpr4_sgpr5, 0, 0, implicit $exec :: (store (s64), addrspace 1)		; CHECK-NEXT: GLOBAL_STORE_DWORDX2_SADDR undef %18:vgpr_32, [[COPY2]], undef renamable $sgpr4_sgpr5, 0, 0, implicit $exec :: (store (s64), addrspace 1)
; CHECK-NEXT: [[V_CNDMASK_B32_e64_1:%[0-9]+]]:vgpr_32 = V_CNDMASK_B32_e64 0, 0, 0, 1, $sgpr12_sgpr13, implicit $exec		; CHECK-NEXT: [[V_CNDMASK_B32_e64_1:%[0-9]+]]:vgpr_32 = V_CNDMASK_B32_e64 0, 0, 0, 1, $sgpr34_sgpr35, implicit $exec
; CHECK-NEXT: dead renamable $sgpr4_sgpr5 = V_CMP_NE_U32_e64 1, [[V_CNDMASK_B32_e64_1]], implicit $exec		; CHECK-NEXT: dead renamable $sgpr12_sgpr13 = V_CMP_NE_U32_e64 1, [[V_CNDMASK_B32_e64_1]], implicit $exec
; CHECK-NEXT: renamable $sgpr64 = S_ADD_U32 renamable $sgpr8, 32, implicit-def dead $scc		; CHECK-NEXT: renamable $sgpr58 = S_ADD_U32 renamable $sgpr8, 32, implicit-def dead $scc
; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: $sgpr4_sgpr5 = COPY renamable $sgpr34_sgpr35		; CHECK-NEXT: renamable $sgpr52_sgpr53 = COPY killed renamable $sgpr4_sgpr5
; CHECK-NEXT: renamable $sgpr42_sgpr43 = COPY killed renamable $sgpr6_sgpr7		; CHECK-NEXT: $sgpr4_sgpr5 = COPY renamable $sgpr52_sgpr53
; CHECK-NEXT: $sgpr6_sgpr7 = COPY renamable $sgpr42_sgpr43		; CHECK-NEXT: renamable $sgpr54_sgpr55 = COPY killed renamable $sgpr6_sgpr7
; CHECK-NEXT: renamable $sgpr38_sgpr39 = COPY killed renamable $sgpr10_sgpr11		; CHECK-NEXT: $sgpr6_sgpr7 = COPY renamable $sgpr54_sgpr55
; CHECK-NEXT: $sgpr10_sgpr11 = COPY renamable $sgpr38_sgpr39		; CHECK-NEXT: renamable $sgpr56_sgpr57 = COPY killed renamable $sgpr10_sgpr11
; CHECK-NEXT: renamable $sgpr44_sgpr45 = COPY killed renamable $sgpr12_sgpr13		; CHECK-NEXT: $sgpr10_sgpr11 = COPY renamable $sgpr56_sgpr57
; CHECK-NEXT: $sgpr12 = COPY renamable $sgpr33		; CHECK-NEXT: $sgpr12 = COPY renamable $sgpr14
; CHECK-NEXT: $sgpr13 = COPY renamable $sgpr15		; CHECK-NEXT: $sgpr13 = COPY renamable $sgpr15
; CHECK-NEXT: renamable $sgpr36 = COPY killed renamable $sgpr16		; CHECK-NEXT: renamable $sgpr62 = COPY killed renamable $sgpr8
; CHECK-NEXT: renamable $sgpr37 = COPY killed renamable $sgpr15		; CHECK-NEXT: renamable $sgpr33 = COPY killed renamable $sgpr16
; CHECK-NEXT: renamable $sgpr40 = COPY killed renamable $sgpr8		; CHECK-NEXT: renamable $sgpr59 = COPY killed renamable $sgpr15
; CHECK-NEXT: renamable $sgpr46_sgpr47 = COPY killed renamable $sgpr18_sgpr19		; CHECK-NEXT: renamable $sgpr63 = COPY killed renamable $sgpr14
; CHECK-NEXT: renamable $sgpr48_sgpr49 = COPY killed renamable $sgpr20_sgpr21		; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL undef renamable $sgpr12_sgpr13, 0, csr_amdgpu_gfx90ainsts, implicit $sgpr4_sgpr5, implicit $sgpr6_sgpr7, implicit $sgpr10_sgpr11, implicit $sgpr12, implicit $sgpr13
; CHECK-NEXT: renamable $sgpr50_sgpr51 = COPY killed renamable $sgpr22_sgpr23
; CHECK-NEXT: renamable $sgpr52_sgpr53 = COPY killed renamable $sgpr24_sgpr25
; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL undef renamable $sgpr4_sgpr5, 0, csr_amdgpu_gfx90ainsts, implicit $sgpr4_sgpr5, implicit $sgpr6_sgpr7, implicit $sgpr10_sgpr11, implicit $sgpr12, implicit $sgpr13
; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKUP 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: $sgpr8_sgpr9 = COPY renamable $sgpr64_sgpr65		; CHECK-NEXT: $sgpr8_sgpr9 = COPY renamable $sgpr58_sgpr59
; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL undef renamable $sgpr4_sgpr5, 0, csr_amdgpu_gfx90ainsts, implicit $sgpr8_sgpr9		; CHECK-NEXT: dead $sgpr30_sgpr31 = SI_CALL undef renamable $sgpr12_sgpr13, 0, csr_amdgpu_gfx90ainsts, implicit $sgpr8_sgpr9
; CHECK-NEXT: renamable $sgpr24_sgpr25 = COPY killed renamable $sgpr52_sgpr53		; CHECK-NEXT: renamable $sgpr14 = COPY killed renamable $sgpr63
; CHECK-NEXT: renamable $sgpr22_sgpr23 = COPY killed renamable $sgpr50_sgpr51		; CHECK-NEXT: renamable $sgpr15 = COPY killed renamable $sgpr59
; CHECK-NEXT: renamable $sgpr20_sgpr21 = COPY killed renamable $sgpr48_sgpr49		; CHECK-NEXT: renamable $sgpr16 = COPY killed renamable $sgpr33
; CHECK-NEXT: renamable $sgpr18_sgpr19 = COPY killed renamable $sgpr46_sgpr47		; CHECK-NEXT: renamable $sgpr4_sgpr5 = COPY killed renamable $sgpr52_sgpr53
; CHECK-NEXT: renamable $sgpr12_sgpr13 = COPY killed renamable $sgpr44_sgpr45		; CHECK-NEXT: renamable $sgpr6_sgpr7 = COPY killed renamable $sgpr54_sgpr55
; CHECK-NEXT: renamable $sgpr6_sgpr7 = COPY killed renamable $sgpr42_sgpr43		; CHECK-NEXT: renamable $sgpr8 = COPY killed renamable $sgpr62
; CHECK-NEXT: renamable $sgpr8 = COPY killed renamable $sgpr40		; CHECK-NEXT: renamable $sgpr10_sgpr11 = COPY killed renamable $sgpr56_sgpr57
; CHECK-NEXT: renamable $sgpr10_sgpr11 = COPY killed renamable $sgpr38_sgpr39
; CHECK-NEXT: renamable $sgpr15 = COPY killed renamable $sgpr37
; CHECK-NEXT: renamable $sgpr16 = COPY killed renamable $sgpr36
; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32		; CHECK-NEXT: ADJCALLSTACKDOWN 0, 0, implicit-def dead $scc, implicit-def $sgpr32, implicit $sgpr32
; CHECK-NEXT: $exec = S_MOV_B64_term renamable $sgpr92_sgpr93		; CHECK-NEXT: $exec = S_MOV_B64_term renamable $sgpr50_sgpr51
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.10, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.10, implicit $exec
; CHECK-NEXT: S_BRANCH %bb.17		; CHECK-NEXT: S_BRANCH %bb.17
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.10:		; CHECK-NEXT: bb.10:
; CHECK-NEXT: successors: %bb.8(0x40000000), %bb.12(0x40000000)		; CHECK-NEXT: successors: %bb.8(0x40000000), %bb.12(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr88_sgpr89, $sgpr90_sgpr91, $sgpr92_sgpr93, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr48_sgpr49, $sgpr50_sgpr51, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.8, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.8, implicit $exec
; CHECK-NEXT: S_BRANCH %bb.12		; CHECK-NEXT: S_BRANCH %bb.12
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.11:		; CHECK-NEXT: bb.11:
; CHECK-NEXT: successors: %bb.1(0x40000000), %bb.17(0x40000000)		; CHECK-NEXT: successors: %bb.1(0x40000000), %bb.17(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.1, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.1, implicit $exec
; CHECK-NEXT: S_BRANCH %bb.17		; CHECK-NEXT: S_BRANCH %bb.17
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.12:		; CHECK-NEXT: bb.12:
; CHECK-NEXT: successors: %bb.11(0x40000000), %bb.13(0x40000000)		; CHECK-NEXT: successors: %bb.11(0x40000000), %bb.13(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr88_sgpr89, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr46_sgpr47, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: $exec = S_MOV_B64_term killed renamable $sgpr88_sgpr89		; CHECK-NEXT: $exec = S_MOV_B64_term killed renamable $sgpr46_sgpr47
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.11, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.11, implicit $exec
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.13:		; CHECK-NEXT: bb.13:
; CHECK-NEXT: successors: %bb.15(0x40000000), %bb.14(0x40000000)		; CHECK-NEXT: successors: %bb.15(0x40000000), %bb.14(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: $vcc = S_AND_B64 $exec, renamable $sgpr24_sgpr25, implicit-def dead $scc		; CHECK-NEXT: $vcc = S_AND_B64 $exec, renamable $sgpr42_sgpr43, implicit-def dead $scc
; CHECK-NEXT: S_CBRANCH_VCCNZ %bb.15, implicit $vcc		; CHECK-NEXT: S_CBRANCH_VCCNZ %bb.15, implicit $vcc
; CHECK-NEXT: S_BRANCH %bb.14		; CHECK-NEXT: S_BRANCH %bb.14
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.14:		; CHECK-NEXT: bb.14:
; CHECK-NEXT: successors: %bb.15(0x80000000)		; CHECK-NEXT: successors: %bb.15(0x80000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.15:		; CHECK-NEXT: bb.15:
; CHECK-NEXT: successors: %bb.11(0x40000000), %bb.16(0x40000000)		; CHECK-NEXT: successors: %bb.11(0x40000000), %bb.16(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr12_sgpr13, $sgpr18_sgpr19, $sgpr20_sgpr21, $sgpr22_sgpr23, $sgpr24_sgpr25, $sgpr34_sgpr35, $sgpr100_sgpr101		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9:0x0000000000000003, $sgpr10_sgpr11, $sgpr34_sgpr35, $sgpr36_sgpr37, $sgpr38_sgpr39, $sgpr40_sgpr41, $sgpr42_sgpr43, $sgpr44_sgpr45, $sgpr44_sgpr45_sgpr46_sgpr47_sgpr48_sgpr49_sgpr50_sgpr51_sgpr52_sgpr53_sgpr54_sgpr55_sgpr56_sgpr57_sgpr58_sgpr59_sgpr60_sgpr61_sgpr62_sgpr63_sgpr64_sgpr65_sgpr66_sgpr67_sgpr68_sgpr69_sgpr70_sgpr71_sgpr72_sgpr73_sgpr74_sgpr75:0x0000000F00000000
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: $vcc = S_AND_B64 $exec, renamable $sgpr18_sgpr19, implicit-def dead $scc		; CHECK-NEXT: $vcc = S_AND_B64 $exec, renamable $sgpr36_sgpr37, implicit-def dead $scc
; CHECK-NEXT: S_CBRANCH_VCCNZ %bb.11, implicit $vcc		; CHECK-NEXT: S_CBRANCH_VCCNZ %bb.11, implicit $vcc
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.16:		; CHECK-NEXT: bb.16:
; CHECK-NEXT: successors: %bb.3(0x40000000), %bb.17(0x40000000)		; CHECK-NEXT: successors: %bb.3(0x40000000), %bb.17(0x40000000)
; CHECK-NEXT: liveins: $sgpr15, $sgpr16, $sgpr33		; CHECK-NEXT: liveins: $sgpr14, $sgpr15, $sgpr16
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: S_CBRANCH_EXECZ %bb.3, implicit $exec		; CHECK-NEXT: S_CBRANCH_EXECZ %bb.3, implicit $exec
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.17:		; CHECK-NEXT: bb.17:
bb.0:		bb.0:
liveins: $vgpr0, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9, $sgpr10_sgpr11, $sgpr14, $sgpr15, $sgpr16		liveins: $vgpr0, $sgpr4_sgpr5, $sgpr6_sgpr7, $sgpr8_sgpr9, $sgpr10_sgpr11, $sgpr14, $sgpr15, $sgpr16

%11:sgpr_32 = COPY $sgpr16		%11:sgpr_32 = COPY $sgpr16
▲ Show 20 Lines • Show All 143 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/unstructured-cfg-def-use-issue.ll

	Show All 24 Lines
	; GCN-NEXT: v_writelane_b32 v40, s40, 8			; GCN-NEXT: v_writelane_b32 v40, s40, 8
	; GCN-NEXT: v_writelane_b32 v40, s41, 9			; GCN-NEXT: v_writelane_b32 v40, s41, 9
	; GCN-NEXT: v_writelane_b32 v40, s42, 10			; GCN-NEXT: v_writelane_b32 v40, s42, 10
	; GCN-NEXT: v_writelane_b32 v40, s43, 11			; GCN-NEXT: v_writelane_b32 v40, s43, 11
	; GCN-NEXT: v_writelane_b32 v40, s44, 12			; GCN-NEXT: v_writelane_b32 v40, s44, 12
	; GCN-NEXT: v_writelane_b32 v40, s45, 13			; GCN-NEXT: v_writelane_b32 v40, s45, 13
	; GCN-NEXT: v_writelane_b32 v40, s46, 14			; GCN-NEXT: v_writelane_b32 v40, s46, 14
	; GCN-NEXT: v_writelane_b32 v40, s47, 15			; GCN-NEXT: v_writelane_b32 v40, s47, 15
	; GCN-NEXT: v_mov_b32_e32 v41, v31
	; GCN-NEXT: s_mov_b32 s42, s15
	; GCN-NEXT: s_mov_b32 s43, s14
	; GCN-NEXT: s_mov_b32 s44, s13
	; GCN-NEXT: s_mov_b32 s45, s12
	; GCN-NEXT: s_mov_b64 s[34:35], s[10:11]
	; GCN-NEXT: s_mov_b64 s[36:37], s[8:9]
	; GCN-NEXT: s_mov_b64 s[38:39], s[6:7]
	; GCN-NEXT: s_mov_b64 s[40:41], s[4:5]
	; GCN-NEXT: v_mov_b32_e32 v0, 0			; GCN-NEXT: v_mov_b32_e32 v0, 0
	; GCN-NEXT: v_mov_b32_e32 v1, 0			; GCN-NEXT: v_mov_b32_e32 v1, 0
	; GCN-NEXT: flat_load_dword v0, v[0:1]			; GCN-NEXT: flat_load_dword v0, v[0:1]
	; GCN-NEXT: s_mov_b64 s[4:5], 0			; GCN-NEXT: s_mov_b64 s[16:17], 0
	; GCN-NEXT: s_mov_b64 s[8:9], -1			; GCN-NEXT: s_mov_b64 s[20:21], -1
	; GCN-NEXT: s_waitcnt vmcnt(0)			; GCN-NEXT: s_waitcnt vmcnt(0)
	; GCN-NEXT: v_cmp_gt_i32_e32 vcc, 21, v0			; GCN-NEXT: v_cmp_gt_i32_e32 vcc, 21, v0
	; GCN-NEXT: s_mov_b64 s[46:47], 0			; GCN-NEXT: s_mov_b64 s[46:47], 0
	; GCN-NEXT: s_mov_b64 s[6:7], 0			; GCN-NEXT: s_mov_b64 s[18:19], 0
	; GCN-NEXT: s_cbranch_vccz .LBB0_9			; GCN-NEXT: s_cbranch_vccz .LBB0_9
	; GCN-NEXT: ; %bb.1: ; %Flow			; GCN-NEXT: ; %bb.1: ; %Flow
	; GCN-NEXT: s_andn2_b64 vcc, exec, s[8:9]			; GCN-NEXT: s_andn2_b64 vcc, exec, s[20:21]
	; GCN-NEXT: s_cbranch_vccz .LBB0_10			; GCN-NEXT: s_cbranch_vccz .LBB0_10
	; GCN-NEXT: .LBB0_2: ; %Flow1			; GCN-NEXT: .LBB0_2: ; %Flow1
	; GCN-NEXT: s_andn2_b64 vcc, exec, s[6:7]			; GCN-NEXT: s_andn2_b64 vcc, exec, s[18:19]
	; GCN-NEXT: s_cbranch_vccnz .LBB0_4			; GCN-NEXT: s_cbranch_vccnz .LBB0_4
	; GCN-NEXT: .LBB0_3: ; %bb9			; GCN-NEXT: .LBB0_3: ; %bb9
	; GCN-NEXT: s_getpc_b64 s[16:17]			; GCN-NEXT: s_getpc_b64 s[16:17]
	; GCN-NEXT: s_add_u32 s16, s16, wibble@rel32@lo+4			; GCN-NEXT: s_add_u32 s16, s16, wibble@rel32@lo+4
	; GCN-NEXT: s_addc_u32 s17, s17, wibble@rel32@hi+12			; GCN-NEXT: s_addc_u32 s17, s17, wibble@rel32@hi+12
	; GCN-NEXT: s_mov_b64 s[4:5], s[40:41]			; GCN-NEXT: s_mov_b64 s[34:35], s[4:5]
	; GCN-NEXT: s_mov_b64 s[6:7], s[38:39]			; GCN-NEXT: s_mov_b64 s[36:37], s[6:7]
	; GCN-NEXT: s_mov_b64 s[8:9], s[36:37]			; GCN-NEXT: s_mov_b64 s[38:39], s[8:9]
	; GCN-NEXT: s_mov_b64 s[10:11], s[34:35]			; GCN-NEXT: s_mov_b64 s[40:41], s[10:11]
	; GCN-NEXT: s_mov_b32 s12, s45			; GCN-NEXT: s_mov_b32 s42, s12
	; GCN-NEXT: s_mov_b32 s13, s44			; GCN-NEXT: s_mov_b32 s43, s13
	; GCN-NEXT: s_mov_b32 s14, s43			; GCN-NEXT: s_mov_b32 s44, s14
	; GCN-NEXT: s_mov_b32 s15, s42			; GCN-NEXT: s_mov_b32 s45, s15
	; GCN-NEXT: v_mov_b32_e32 v31, v41			; GCN-NEXT: v_mov_b32_e32 v41, v31
	; GCN-NEXT: s_swappc_b64 s[30:31], s[16:17]			; GCN-NEXT: s_swappc_b64 s[30:31], s[16:17]
				; GCN-NEXT: v_mov_b32_e32 v31, v41
				; GCN-NEXT: s_mov_b32 s12, s42
				; GCN-NEXT: s_mov_b32 s13, s43
				; GCN-NEXT: s_mov_b32 s14, s44
				; GCN-NEXT: s_mov_b32 s15, s45
				; GCN-NEXT: s_mov_b64 s[4:5], s[34:35]
				; GCN-NEXT: s_mov_b64 s[6:7], s[36:37]
				; GCN-NEXT: s_mov_b64 s[8:9], s[38:39]
				; GCN-NEXT: s_mov_b64 s[10:11], s[40:41]
	; GCN-NEXT: v_cmp_nlt_f32_e32 vcc, 0, v0			; GCN-NEXT: v_cmp_nlt_f32_e32 vcc, 0, v0
	; GCN-NEXT: s_mov_b64 s[4:5], 0			; GCN-NEXT: s_mov_b64 s[16:17], 0
	; GCN-NEXT: s_andn2_b64 s[6:7], s[46:47], exec			; GCN-NEXT: s_andn2_b64 s[18:19], s[46:47], exec
	; GCN-NEXT: s_and_b64 s[8:9], vcc, exec			; GCN-NEXT: s_and_b64 s[20:21], vcc, exec
	; GCN-NEXT: s_or_b64 s[46:47], s[6:7], s[8:9]			; GCN-NEXT: s_or_b64 s[46:47], s[18:19], s[20:21]
	; GCN-NEXT: .LBB0_4: ; %Flow2			; GCN-NEXT: .LBB0_4: ; %Flow2
	; GCN-NEXT: s_and_saveexec_b64 s[6:7], s[46:47]			; GCN-NEXT: s_and_saveexec_b64 s[18:19], s[46:47]
	; GCN-NEXT: s_xor_b64 s[6:7], exec, s[6:7]			; GCN-NEXT: s_xor_b64 s[18:19], exec, s[18:19]
	; GCN-NEXT: s_cbranch_execz .LBB0_6			; GCN-NEXT: s_cbranch_execz .LBB0_6
	; GCN-NEXT: ; %bb.5: ; %bb12			; GCN-NEXT: ; %bb.5: ; %bb12
	; GCN-NEXT: v_mov_b32_e32 v2, 0			; GCN-NEXT: v_mov_b32_e32 v2, 0
	; GCN-NEXT: v_mov_b32_e32 v0, 0			; GCN-NEXT: v_mov_b32_e32 v0, 0
	; GCN-NEXT: v_mov_b32_e32 v1, 0			; GCN-NEXT: v_mov_b32_e32 v1, 0
	; GCN-NEXT: flat_store_dword v[0:1], v2			; GCN-NEXT: flat_store_dword v[0:1], v2
	; GCN-NEXT: .LBB0_6: ; %Flow3			; GCN-NEXT: .LBB0_6: ; %Flow3
	; GCN-NEXT: s_or_b64 exec, exec, s[6:7]			; GCN-NEXT: s_or_b64 exec, exec, s[18:19]
	; GCN-NEXT: s_andn2_b64 vcc, exec, s[4:5]			; GCN-NEXT: s_andn2_b64 vcc, exec, s[16:17]
	; GCN-NEXT: s_cbranch_vccnz .LBB0_8			; GCN-NEXT: s_cbranch_vccnz .LBB0_8
	; GCN-NEXT: ; %bb.7: ; %bb7			; GCN-NEXT: ; %bb.7: ; %bb7
	; GCN-NEXT: s_getpc_b64 s[16:17]			; GCN-NEXT: s_getpc_b64 s[16:17]
	; GCN-NEXT: s_add_u32 s16, s16, wibble@rel32@lo+4			; GCN-NEXT: s_add_u32 s16, s16, wibble@rel32@lo+4
	; GCN-NEXT: s_addc_u32 s17, s17, wibble@rel32@hi+12			; GCN-NEXT: s_addc_u32 s17, s17, wibble@rel32@hi+12
	; GCN-NEXT: s_mov_b64 s[4:5], s[40:41]
	; GCN-NEXT: s_mov_b64 s[6:7], s[38:39]
	; GCN-NEXT: s_mov_b64 s[8:9], s[36:37]
	; GCN-NEXT: s_mov_b64 s[10:11], s[34:35]
	; GCN-NEXT: s_mov_b32 s12, s45
	; GCN-NEXT: s_mov_b32 s13, s44
	; GCN-NEXT: s_mov_b32 s14, s43
	; GCN-NEXT: s_mov_b32 s15, s42
	; GCN-NEXT: v_mov_b32_e32 v31, v41
	; GCN-NEXT: s_swappc_b64 s[30:31], s[16:17]			; GCN-NEXT: s_swappc_b64 s[30:31], s[16:17]
	; GCN-NEXT: .LBB0_8: ; %UnifiedReturnBlock			; GCN-NEXT: .LBB0_8: ; %UnifiedReturnBlock
	; GCN-NEXT: v_readlane_b32 s47, v40, 15			; GCN-NEXT: v_readlane_b32 s47, v40, 15
	; GCN-NEXT: v_readlane_b32 s46, v40, 14			; GCN-NEXT: v_readlane_b32 s46, v40, 14
	; GCN-NEXT: v_readlane_b32 s45, v40, 13			; GCN-NEXT: v_readlane_b32 s45, v40, 13
	; GCN-NEXT: v_readlane_b32 s44, v40, 12			; GCN-NEXT: v_readlane_b32 s44, v40, 12
	; GCN-NEXT: v_readlane_b32 s43, v40, 11			; GCN-NEXT: v_readlane_b32 s43, v40, 11
	; GCN-NEXT: v_readlane_b32 s42, v40, 10			; GCN-NEXT: v_readlane_b32 s42, v40, 10
	Show All 13 Lines
	; GCN-NEXT: buffer_load_dword v40, off, s[0:3], s33 offset:4 ; 4-byte Folded Reload			; GCN-NEXT: buffer_load_dword v40, off, s[0:3], s33 offset:4 ; 4-byte Folded Reload
	; GCN-NEXT: s_mov_b64 exec, s[6:7]			; GCN-NEXT: s_mov_b64 exec, s[6:7]
	; GCN-NEXT: s_addk_i32 s32, 0xfc00			; GCN-NEXT: s_addk_i32 s32, 0xfc00
	; GCN-NEXT: s_mov_b32 s33, s4			; GCN-NEXT: s_mov_b32 s33, s4
	; GCN-NEXT: s_waitcnt vmcnt(0)			; GCN-NEXT: s_waitcnt vmcnt(0)
	; GCN-NEXT: s_setpc_b64 s[30:31]			; GCN-NEXT: s_setpc_b64 s[30:31]
	; GCN-NEXT: .LBB0_9: ; %bb2			; GCN-NEXT: .LBB0_9: ; %bb2
	; GCN-NEXT: v_cmp_eq_u32_e64 s[46:47], 21, v0			; GCN-NEXT: v_cmp_eq_u32_e64 s[46:47], 21, v0
	; GCN-NEXT: v_cmp_ne_u32_e64 s[6:7], 21, v0			; GCN-NEXT: v_cmp_ne_u32_e64 s[18:19], 21, v0
	; GCN-NEXT: s_mov_b64 vcc, exec			; GCN-NEXT: s_mov_b64 vcc, exec
	; GCN-NEXT: s_cbranch_execnz .LBB0_2			; GCN-NEXT: s_cbranch_execnz .LBB0_2
	; GCN-NEXT: .LBB0_10: ; %bb4			; GCN-NEXT: .LBB0_10: ; %bb4
	; GCN-NEXT: s_mov_b64 s[4:5], -1			; GCN-NEXT: s_mov_b64 s[16:17], -1
	; GCN-NEXT: v_cmp_ne_u32_e64 s[6:7], 9, v0			; GCN-NEXT: v_cmp_ne_u32_e64 s[18:19], 9, v0
	; GCN-NEXT: s_andn2_b64 vcc, exec, s[6:7]			; GCN-NEXT: s_andn2_b64 vcc, exec, s[18:19]
	; GCN-NEXT: s_cbranch_vccz .LBB0_3			; GCN-NEXT: s_cbranch_vccz .LBB0_3
	; GCN-NEXT: s_branch .LBB0_4			; GCN-NEXT: s_branch .LBB0_4
	; SI-OPT-LABEL: @widget(			; SI-OPT-LABEL: @widget(
	; SI-OPT-NEXT: bb:			; SI-OPT-NEXT: bb:
	; SI-OPT-NEXT: [[TMP:%.*]] = load i32, ptr addrspace(1) null, align 16			; SI-OPT-NEXT: [[TMP:%.*]] = load i32, ptr addrspace(1) null, align 16
	; SI-OPT-NEXT: [[TMP1:%.*]] = icmp slt i32 [[TMP]], 21			; SI-OPT-NEXT: [[TMP1:%.*]] = icmp slt i32 [[TMP]], 21
	; SI-OPT-NEXT: br i1 [[TMP1]], label [[BB4:%.]], label [[BB2:%.]]			; SI-OPT-NEXT: br i1 [[TMP1]], label [[BB4:%.]], label [[BB2:%.]]
	; SI-OPT: bb2:			; SI-OPT: bb2:
	▲ Show 20 Lines • Show All 390 Lines • Show Last 20 Lines

llvm/test/CodeGen/ARM/csr-split.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -verify-machineinstrs -mtriple=arm-unknown-linux < %s \| FileCheck %s			; RUN: llc -verify-machineinstrs -mtriple=arm-unknown-linux < %s \| FileCheck %s

	; Check CSR split can work properly for tests below.			; Check CSR split can work properly for tests below.

	@a = common dso_local local_unnamed_addr global i32 0, align 4			@a = common dso_local local_unnamed_addr global i32 0, align 4

	define dso_local signext i32 @test1(ptr %b) local_unnamed_addr {			define dso_local signext i32 @test1(ptr %b) local_unnamed_addr {
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: @ %bb.0: @ %entry			; CHECK: @ %bb.0: @ %entry
	; CHECK-NEXT: push {r4, lr}			; CHECK-NEXT: ldr r1, .LCPI0_0
	; CHECK-NEXT: mov r4, r0			; CHECK-NEXT: ldr r1, [r1]
	; CHECK-NEXT: ldr r0, .LCPI0_0			; CHECK-NEXT: cmp r1, r0
	; CHECK-NEXT: ldr r0, [r0]
	; CHECK-NEXT: cmp r0, r4
	; CHECK-NEXT: popne {r4, lr}
	; CHECK-NEXT: movne pc, lr			; CHECK-NEXT: movne pc, lr
	; CHECK-NEXT: .LBB0_1: @ %if.then			; CHECK-NEXT: .LBB0_1: @ %if.then
				; CHECK-NEXT: push {r4, lr}
				; CHECK-NEXT: mov r4, r0
	; CHECK-NEXT: bl callVoid			; CHECK-NEXT: bl callVoid
	; CHECK-NEXT: mov r0, r4			; CHECK-NEXT: mov r0, r4
	; CHECK-NEXT: pop {r4, lr}			; CHECK-NEXT: pop {r4, lr}
	; CHECK-NEXT: b callNonVoid			; CHECK-NEXT: b callNonVoid
	; CHECK-NEXT: .p2align 2			; CHECK-NEXT: .p2align 2
	; CHECK-NEXT: @ %bb.2:			; CHECK-NEXT: @ %bb.2:
	; CHECK-NEXT: .LCPI0_0:			; CHECK-NEXT: .LCPI0_0:
	; CHECK-NEXT: .long a			; CHECK-NEXT: .long a
	Show All 16 Lines

	declare signext i32 @callVoid(...) local_unnamed_addr			declare signext i32 @callVoid(...) local_unnamed_addr

	declare signext i32 @callNonVoid(ptr) local_unnamed_addr			declare signext i32 @callNonVoid(ptr) local_unnamed_addr

	define dso_local signext i32 @test2(ptr %p1) local_unnamed_addr {			define dso_local signext i32 @test2(ptr %p1) local_unnamed_addr {
	; CHECK-LABEL: test2:			; CHECK-LABEL: test2:
	; CHECK: @ %bb.0: @ %entry			; CHECK: @ %bb.0: @ %entry
	; CHECK-NEXT: push {r4, lr}
	; CHECK-NEXT: cmp r0, #0			; CHECK-NEXT: cmp r0, #0
	; CHECK-NEXT: beq .LBB1_2			; CHECK-NEXT: beq .LBB1_2
	; CHECK-NEXT: @ %bb.1: @ %if.end			; CHECK-NEXT: @ %bb.1: @ %if.end
	; CHECK-NEXT: mov r4, r0			; CHECK-NEXT: ldr r1, .LCPI1_0
	; CHECK-NEXT: ldr r0, .LCPI1_0			; CHECK-NEXT: ldr r1, [r1]
	; CHECK-NEXT: ldr r0, [r0]			; CHECK-NEXT: cmp r1, r0
	; CHECK-NEXT: cmp r0, r4
	; CHECK-NEXT: beq .LBB1_3			; CHECK-NEXT: beq .LBB1_3
	; CHECK-NEXT: .LBB1_2: @ %return			; CHECK-NEXT: .LBB1_2: @ %return
	; CHECK-NEXT: mov r0, #0			; CHECK-NEXT: mov r0, #0
	; CHECK-NEXT: pop {r4, lr}
	; CHECK-NEXT: mov pc, lr			; CHECK-NEXT: mov pc, lr
	; CHECK-NEXT: .LBB1_3: @ %if.then2			; CHECK-NEXT: .LBB1_3: @ %if.then2
				; CHECK-NEXT: push {r4, lr}
				; CHECK-NEXT: mov r4, r0
	; CHECK-NEXT: bl callVoid			; CHECK-NEXT: bl callVoid
	; CHECK-NEXT: mov r0, r4			; CHECK-NEXT: mov r0, r4
	; CHECK-NEXT: pop {r4, lr}			; CHECK-NEXT: pop {r4, lr}
	; CHECK-NEXT: b callNonVoid			; CHECK-NEXT: b callNonVoid
	; CHECK-NEXT: .p2align 2			; CHECK-NEXT: .p2align 2
	; CHECK-NEXT: @ %bb.4:			; CHECK-NEXT: @ %bb.4:
	; CHECK-NEXT: .LCPI1_0:			; CHECK-NEXT: .LCPI1_0:
	; CHECK-NEXT: .long a			; CHECK-NEXT: .long a
	▲ Show 20 Lines • Show All 66 Lines • Show Last 20 Lines

llvm/test/CodeGen/ARM/divmod-eabi.ll

	Show First 20 Lines • Show All 212 Lines • ▼ Show 20 Lines
	entry:			entry:
	%rem = srem i32 %a, %b			%rem = srem i32 %a, %b
	; EABI: __aeabi_idivmod			; EABI: __aeabi_idivmod
	; EABI: mov [[mod:r[0-9]+]], r1			; EABI: mov [[mod:r[0-9]+]], r1
	; DARWIN: __modsi3			; DARWIN: __modsi3
	; DARWIN: mov [[sum:r[0-9]+]], r0			; DARWIN: mov [[sum:r[0-9]+]], r0
	; DARWIN-O0: __modsi3			; DARWIN-O0: __modsi3
	; WINDOWS: __rt_sdiv			; WINDOWS: __rt_sdiv
				; WINDOWS: mov [[arg:r[0-9]+]], r1
	; WINDOWS: mov [[rem:r[0-9]+]], r1			; WINDOWS: mov [[rem:r[0-9]+]], r1
	%rem1 = srem i32 %b, %rem			%rem1 = srem i32 %b, %rem
	; EABI: __aeabi_idivmod			; EABI: __aeabi_idivmod
	; DARWIN: __modsi3			; DARWIN: __modsi3
	; DARWIN-O0: __modsi3			; DARWIN-O0: __modsi3
	; WINDOWS: __rt_sdiv			; WINDOWS: __rt_sdiv
	%add = add nsw i32 %rem1, %rem			%add = add nsw i32 %rem1, %rem
	; EABI: add r0, r1, [[mod]]			; EABI: add r0, r1, [[mod]]
	Show All 30 Lines

llvm/test/CodeGen/ARM/thumb2-size-opt.ll

	Show First 20 Lines • Show All 79 Lines • ▼ Show 20 Lines
	; CHECK-OPT: lsrs r{{[0-7]}}, r{{[0-7]}} @ encoding: [{{0x..,0x..}}]			; CHECK-OPT: lsrs r{{[0-7]}}, r{{[0-7]}} @ encoding: [{{0x..,0x..}}]
	entry:			entry:
	%shr = lshr i32 %a, %b			%shr = lshr i32 %a, %b
	ret i32 %shr			ret i32 %shr
	}			}

	define i32 @bundled_instruction(ptr %addr, ptr %addr2, i1 %tst) minsize {			define i32 @bundled_instruction(ptr %addr, ptr %addr2, i1 %tst) minsize {
	; CHECK-LABEL: bundled_instruction:			; CHECK-LABEL: bundled_instruction:
	; CHECK: itee ne			; CHECK: iteee ne
	; CHECK: ldmeq r3!, {{{r[0-9]+}}}			; CHECK: ldmeq r2!, {{{r[0-9]+}}}
	br i1 %tst, label %true, label %false			br i1 %tst, label %true, label %false

	true:			true:
	ret i32 0			ret i32 0

	false:			false:
	%res = load i32, ptr %addr, align 4			%res = load i32, ptr %addr, align 4
	%next = getelementptr i32, ptr %addr, i32 1			%next = getelementptr i32, ptr %addr, i32 1
	Show All 16 Lines

llvm/test/CodeGen/AVR/cttz.ll

	Show All 22 Lines
	; CHECK: andi {{.*}}[[SCRATCH]], 51			; CHECK: andi {{.*}}[[SCRATCH]], 51
	; CHECK: lsr {{.*}}[[RESULT]]			; CHECK: lsr {{.*}}[[RESULT]]
	; CHECK: lsr {{.*}}[[RESULT]]			; CHECK: lsr {{.*}}[[RESULT]]
	; CHECK: andi {{.*}}[[RESULT]], 51			; CHECK: andi {{.*}}[[RESULT]], 51
	; CHECK: add {{.}}[[RESULT]], {{.}}[[SCRATCH]]			; CHECK: add {{.}}[[RESULT]], {{.}}[[SCRATCH]]
	; CHECK: mov {{.}}[[SCRATCH]], {{.}}[[RESULT]]			; CHECK: mov {{.}}[[SCRATCH]], {{.}}[[RESULT]]
	; CHECK: swap {{.*}}[[SCRATCH]]			; CHECK: swap {{.*}}[[SCRATCH]]
	; CHECK: add {{.}}[[SCRATCH]], {{.}}[[RESULT]]			; CHECK: add {{.}}[[SCRATCH]], {{.}}[[RESULT]]
	; CHECK: andi {{.*}}[[SCRATCH]], 15
	; CHECK: mov {{.}}[[RESULT]], {{.}}[[SCRATCH]]			; CHECK: mov {{.}}[[RESULT]], {{.}}[[SCRATCH]]
				; CHECK: andi {{.*}}[[RESULT]], 15
	; CHECK: ret			; CHECK: ret
	; CHECK: [[END_BB]]:			; CHECK: [[END_BB]]:
	; CHECK: ldi {{.*}}[[SCRATCH]], 8			; CHECK: ldi {{.*}}[[RESULT]], 8
	; CHECK: mov {{.}}[[RESULT]], {{.}}[[SCRATCH]]
	; CHECK: ret			; CHECK: ret

llvm/test/CodeGen/Hexagon/noreturn-noepilog.ll

	; RUN: llc -march=hexagon < %s \| FileCheck %s			; RUN: llc -march=hexagon < %s \| FileCheck %s
	;			;
	; XFAIL: *
	; This test is failing after post-ra machine sinking.
	;			;
	; Check that no epilogue is inserted after a noreturn call.			; Check that no epilogue is inserted after a noreturn call.
	;			;
	; CHECK-LABEL: f1:			; CHECK-LABEL: f1:
	; CHECK: allocframe(r29,#0):raw			; CHECK: allocframe
	; CHECK-NOT: deallocframe			; CHECK-NOT: deallocframe

	target triple = "hexagon"			target triple = "hexagon"

	%s.0 = type <{ i16, i8, i8, i8 }>			%s.0 = type <{ i16, i8, i8, i8 }>

	@g0 = internal constant %s.0 <{ i16 1, i8 2, i8 3, i8 4 }>, align 4			@g0 = internal constant %s.0 <{ i16 1, i8 2, i8 3, i8 4 }>, align 4

	Show All 19 Lines

llvm/test/CodeGen/PowerPC/csr-split.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -verify-machineinstrs -ppc-vsr-nums-as-vr -ppc-asm-full-reg-names \			; RUN: llc -verify-machineinstrs -ppc-vsr-nums-as-vr -ppc-asm-full-reg-names \
	; RUN: -mtriple=powerpc64le-unknown-linux-gnu -mcpu=pwr9 < %s \| FileCheck %s --check-prefix=CHECK-PWR9			; RUN: -mtriple=powerpc64le-unknown-linux-gnu -mcpu=pwr9 < %s \| FileCheck %s --check-prefix=CHECK-PWR9
	; RUN: llc -verify-machineinstrs -ppc-vsr-nums-as-vr -ppc-asm-full-reg-names \			; RUN: llc -verify-machineinstrs -ppc-vsr-nums-as-vr -ppc-asm-full-reg-names \
	; RUN: -mtriple=powerpc64-unknown-linux-gnu -mcpu=pwr8 < %s \| FileCheck %s			; RUN: -mtriple=powerpc64-unknown-linux-gnu -mcpu=pwr8 < %s \| FileCheck %s

	; Check CSR split can work properly for tests below.			; Check CSR split can work properly for tests below.

	@a = dso_local local_unnamed_addr global i32 0, align 4			@a = dso_local local_unnamed_addr global i32 0, align 4

	define dso_local signext i32 @test1(ptr %b) local_unnamed_addr {			define dso_local signext i32 @test1(ptr %b) local_unnamed_addr {
	; CHECK-PWR9-LABEL: test1:			; CHECK-PWR9-LABEL: test1:
	; CHECK-PWR9: # %bb.0: # %entry			; CHECK-PWR9: # %bb.0: # %entry
				; CHECK-PWR9-NEXT: addis r4, r2, a@toc@ha
				; CHECK-PWR9-NEXT: lwa r4, a@toc@l(r4)
				; CHECK-PWR9-NEXT: cmpld r4, r3
				; CHECK-PWR9-NEXT: # implicit-def: $r4
				; CHECK-PWR9-NEXT: beq cr0, .LBB0_2
				; CHECK-PWR9-NEXT: # %bb.1: # %if.end
				; CHECK-PWR9-NEXT: extsw r3, r4
				; CHECK-PWR9-NEXT: blr
				; CHECK-PWR9-NEXT: .LBB0_2: # %if.then
	; CHECK-PWR9-NEXT: mflr r0			; CHECK-PWR9-NEXT: mflr r0
	; CHECK-PWR9-NEXT: .cfi_def_cfa_offset 48			; CHECK-PWR9-NEXT: .cfi_def_cfa_offset 48
	; CHECK-PWR9-NEXT: .cfi_offset lr, 16			; CHECK-PWR9-NEXT: .cfi_offset lr, 16
	; CHECK-PWR9-NEXT: .cfi_offset r30, -16			; CHECK-PWR9-NEXT: .cfi_offset r30, -16
	; CHECK-PWR9-NEXT: std r30, -16(r1) # 8-byte Folded Spill			; CHECK-PWR9-NEXT: std r30, -16(r1) # 8-byte Folded Spill
	; CHECK-PWR9-NEXT: stdu r1, -48(r1)			; CHECK-PWR9-NEXT: stdu r1, -48(r1)
	; CHECK-PWR9-NEXT: mr r30, r3
	; CHECK-PWR9-NEXT: addis r3, r2, a@toc@ha
	; CHECK-PWR9-NEXT: std r0, 64(r1)			; CHECK-PWR9-NEXT: std r0, 64(r1)
	; CHECK-PWR9-NEXT: lwa r3, a@toc@l(r3)			; CHECK-PWR9-NEXT: mr r30, r3
	; CHECK-PWR9-NEXT: cmpld r3, r30
	; CHECK-PWR9-NEXT: # implicit-def: $r3
	; CHECK-PWR9-NEXT: bne cr0, .LBB0_2
	; CHECK-PWR9-NEXT: # %bb.1: # %if.then
	; CHECK-PWR9-NEXT: bl callVoid			; CHECK-PWR9-NEXT: bl callVoid
	; CHECK-PWR9-NEXT: nop			; CHECK-PWR9-NEXT: nop
	; CHECK-PWR9-NEXT: mr r3, r30			; CHECK-PWR9-NEXT: mr r3, r30
	; CHECK-PWR9-NEXT: bl callNonVoid			; CHECK-PWR9-NEXT: bl callNonVoid
	; CHECK-PWR9-NEXT: nop			; CHECK-PWR9-NEXT: nop
	; CHECK-PWR9-NEXT: .LBB0_2: # %if.end			; CHECK-PWR9-NEXT: mr r4, r3
	; CHECK-PWR9-NEXT: extsw r3, r3
	; CHECK-PWR9-NEXT: addi r1, r1, 48			; CHECK-PWR9-NEXT: addi r1, r1, 48
	; CHECK-PWR9-NEXT: ld r0, 16(r1)			; CHECK-PWR9-NEXT: ld r0, 16(r1)
	; CHECK-PWR9-NEXT: ld r30, -16(r1) # 8-byte Folded Reload			; CHECK-PWR9-NEXT: ld r30, -16(r1) # 8-byte Folded Reload
	; CHECK-PWR9-NEXT: mtlr r0			; CHECK-PWR9-NEXT: mtlr r0
				; CHECK-PWR9-NEXT: extsw r3, r4
	; CHECK-PWR9-NEXT: blr			; CHECK-PWR9-NEXT: blr
	;			;
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: addis r4, r2, a@toc@ha
				; CHECK-NEXT: lwa r4, a@toc@l(r4)
				; CHECK-NEXT: cmpld r4, r3
				; CHECK-NEXT: # implicit-def: $r4
				; CHECK-NEXT: beq cr0, .LBB0_2
				; CHECK-NEXT: # %bb.1: # %if.end
				; CHECK-NEXT: extsw r3, r4
				; CHECK-NEXT: blr
				; CHECK-NEXT: .LBB0_2: # %if.then
	; CHECK-NEXT: mflr r0			; CHECK-NEXT: mflr r0
	; CHECK-NEXT: stdu r1, -128(r1)			; CHECK-NEXT: stdu r1, -128(r1)
	; CHECK-NEXT: std r0, 144(r1)			; CHECK-NEXT: std r0, 144(r1)
	; CHECK-NEXT: .cfi_def_cfa_offset 128			; CHECK-NEXT: .cfi_def_cfa_offset 128
	; CHECK-NEXT: .cfi_offset lr, 16			; CHECK-NEXT: .cfi_offset lr, 16
	; CHECK-NEXT: .cfi_offset r30, -16			; CHECK-NEXT: .cfi_offset r30, -16
	; CHECK-NEXT: std r30, 112(r1) # 8-byte Folded Spill			; CHECK-NEXT: std r30, 112(r1) # 8-byte Folded Spill
	; CHECK-NEXT: mr r30, r3			; CHECK-NEXT: mr r30, r3
	; CHECK-NEXT: addis r3, r2, a@toc@ha
	; CHECK-NEXT: lwa r3, a@toc@l(r3)
	; CHECK-NEXT: cmpld r3, r30
	; CHECK-NEXT: # implicit-def: $r3
	; CHECK-NEXT: bne cr0, .LBB0_2
	; CHECK-NEXT: # %bb.1: # %if.then
	; CHECK-NEXT: bl callVoid			; CHECK-NEXT: bl callVoid
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: mr r3, r30			; CHECK-NEXT: mr r3, r30
	; CHECK-NEXT: bl callNonVoid			; CHECK-NEXT: bl callNonVoid
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: .LBB0_2: # %if.end
	; CHECK-NEXT: ld r30, 112(r1) # 8-byte Folded Reload			; CHECK-NEXT: ld r30, 112(r1) # 8-byte Folded Reload
	; CHECK-NEXT: extsw r3, r3			; CHECK-NEXT: mr r4, r3
	; CHECK-NEXT: addi r1, r1, 128			; CHECK-NEXT: addi r1, r1, 128
	; CHECK-NEXT: ld r0, 16(r1)			; CHECK-NEXT: ld r0, 16(r1)
	; CHECK-NEXT: mtlr r0			; CHECK-NEXT: mtlr r0
				; CHECK-NEXT: extsw r3, r4
	; CHECK-NEXT: blr			; CHECK-NEXT: blr
	entry:			entry:
	%0 = load i32, ptr @a, align 4, !tbaa !2			%0 = load i32, ptr @a, align 4, !tbaa !2
	%conv = sext i32 %0 to i64			%conv = sext i32 %0 to i64
	%1 = inttoptr i64 %conv to ptr			%1 = inttoptr i64 %conv to ptr
	%cmp = icmp eq ptr %1, %b			%cmp = icmp eq ptr %1, %b
	br i1 %cmp, label %if.then, label %if.end			br i1 %cmp, label %if.then, label %if.end

	▲ Show 20 Lines • Show All 184 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/ppc64-rop-protection-aix.ll

	Show First 20 Lines • Show All 1,921 Lines • ▼ Show 20 Lines
	; BE-P10: # %bb.0: # %entry			; BE-P10: # %bb.0: # %entry
	; BE-P10-NEXT: cmpldi r3, 0			; BE-P10-NEXT: cmpldi r3, 0
	; BE-P10-NEXT: beq cr0, L..BB2_2			; BE-P10-NEXT: beq cr0, L..BB2_2
	; BE-P10-NEXT: # %bb.1: # %if.end			; BE-P10-NEXT: # %bb.1: # %if.end
	; BE-P10-NEXT: mflr r0			; BE-P10-NEXT: mflr r0
	; BE-P10-NEXT: std r0, 16(r1)			; BE-P10-NEXT: std r0, 16(r1)
	; BE-P10-NEXT: hashst r0, -16(r1)			; BE-P10-NEXT: hashst r0, -16(r1)
	; BE-P10-NEXT: stdu r1, -144(r1)			; BE-P10-NEXT: stdu r1, -144(r1)
				; BE-P10-NEXT: lwz r4, 12(r3)
	; BE-P10-NEXT: std r31, 136(r1) # 8-byte Folded Spill			; BE-P10-NEXT: std r31, 136(r1) # 8-byte Folded Spill
	; BE-P10-NEXT: mr r31, r3			; BE-P10-NEXT: mr r31, r3
	; BE-P10-NEXT: lwz r3, 12(r3)			; BE-P10-NEXT: stw r4, 124(r1)
	; BE-P10-NEXT: stw r3, 124(r1)			; BE-P10-NEXT: addi r4, r1, 124
	; BE-P10-NEXT: addi r3, r1, 124			; BE-P10-NEXT: mr r3, r4
	; BE-P10-NEXT: bl .callee2[PR]			; BE-P10-NEXT: bl .callee2[PR]
	; BE-P10-NEXT: nop			; BE-P10-NEXT: nop
	; BE-P10-NEXT: lwz r4, 16(r31)			; BE-P10-NEXT: lwz r4, 16(r31)
	; BE-P10-NEXT: ld r31, 136(r1) # 8-byte Folded Reload			; BE-P10-NEXT: ld r31, 136(r1) # 8-byte Folded Reload
	; BE-P10-NEXT: add r3, r4, r3			; BE-P10-NEXT: add r3, r4, r3
	; BE-P10-NEXT: addi r1, r1, 144			; BE-P10-NEXT: addi r1, r1, 144
	; BE-P10-NEXT: ld r0, 16(r1)			; BE-P10-NEXT: ld r0, 16(r1)
	; BE-P10-NEXT: clrldi r3, r3, 32			; BE-P10-NEXT: clrldi r3, r3, 32
	Show All 10 Lines
	; BE-P9-NEXT: beq cr0, L..BB2_2			; BE-P9-NEXT: beq cr0, L..BB2_2
	; BE-P9-NEXT: # %bb.1: # %if.end			; BE-P9-NEXT: # %bb.1: # %if.end
	; BE-P9-NEXT: mflr r0			; BE-P9-NEXT: mflr r0
	; BE-P9-NEXT: stdu r1, -144(r1)			; BE-P9-NEXT: stdu r1, -144(r1)
	; BE-P9-NEXT: std r0, 160(r1)			; BE-P9-NEXT: std r0, 160(r1)
	; BE-P9-NEXT: hashst r0, -16(r1)			; BE-P9-NEXT: hashst r0, -16(r1)
	; BE-P9-NEXT: std r31, 136(r1) # 8-byte Folded Spill			; BE-P9-NEXT: std r31, 136(r1) # 8-byte Folded Spill
	; BE-P9-NEXT: mr r31, r3			; BE-P9-NEXT: mr r31, r3
	; BE-P9-NEXT: lwz r3, 12(r3)			; BE-P9-NEXT: lwz r4, 12(r3)
	; BE-P9-NEXT: stw r3, 124(r1)			; BE-P9-NEXT: stw r4, 124(r1)
	; BE-P9-NEXT: addi r3, r1, 124			; BE-P9-NEXT: addi r4, r1, 124
				; BE-P9-NEXT: mr r3, r4
	; BE-P9-NEXT: bl .callee2[PR]			; BE-P9-NEXT: bl .callee2[PR]
	; BE-P9-NEXT: nop			; BE-P9-NEXT: nop
	; BE-P9-NEXT: lwz r4, 16(r31)			; BE-P9-NEXT: lwz r4, 16(r31)
	; BE-P9-NEXT: ld r31, 136(r1) # 8-byte Folded Reload			; BE-P9-NEXT: ld r31, 136(r1) # 8-byte Folded Reload
	; BE-P9-NEXT: add r3, r4, r3			; BE-P9-NEXT: add r3, r4, r3
	; BE-P9-NEXT: addi r1, r1, 144			; BE-P9-NEXT: addi r1, r1, 144
	; BE-P9-NEXT: ld r0, 16(r1)			; BE-P9-NEXT: ld r0, 16(r1)
	; BE-P9-NEXT: clrldi r3, r3, 32			; BE-P9-NEXT: clrldi r3, r3, 32
	Show All 10 Lines
	; BE-P8-NEXT: beq cr0, L..BB2_2			; BE-P8-NEXT: beq cr0, L..BB2_2
	; BE-P8-NEXT: # %bb.1: # %if.end			; BE-P8-NEXT: # %bb.1: # %if.end
	; BE-P8-NEXT: mflr r0			; BE-P8-NEXT: mflr r0
	; BE-P8-NEXT: stdu r1, -144(r1)			; BE-P8-NEXT: stdu r1, -144(r1)
	; BE-P8-NEXT: std r0, 160(r1)			; BE-P8-NEXT: std r0, 160(r1)
	; BE-P8-NEXT: hashst r0, -16(r1)			; BE-P8-NEXT: hashst r0, -16(r1)
	; BE-P8-NEXT: std r31, 136(r1) # 8-byte Folded Spill			; BE-P8-NEXT: std r31, 136(r1) # 8-byte Folded Spill
	; BE-P8-NEXT: mr r31, r3			; BE-P8-NEXT: mr r31, r3
	; BE-P8-NEXT: lwz r3, 12(r3)			; BE-P8-NEXT: lwz r4, 12(r3)
	; BE-P8-NEXT: stw r3, 124(r1)			; BE-P8-NEXT: stw r4, 124(r1)
	; BE-P8-NEXT: addi r3, r1, 124			; BE-P8-NEXT: addi r4, r1, 124
				; BE-P8-NEXT: mr r3, r4
	; BE-P8-NEXT: bl .callee2[PR]			; BE-P8-NEXT: bl .callee2[PR]
	; BE-P8-NEXT: nop			; BE-P8-NEXT: nop
	; BE-P8-NEXT: lwz r4, 16(r31)			; BE-P8-NEXT: lwz r4, 16(r31)
	; BE-P8-NEXT: ld r31, 136(r1) # 8-byte Folded Reload			; BE-P8-NEXT: ld r31, 136(r1) # 8-byte Folded Reload
	; BE-P8-NEXT: add r3, r4, r3			; BE-P8-NEXT: add r3, r4, r3
	; BE-P8-NEXT: addi r1, r1, 144			; BE-P8-NEXT: addi r1, r1, 144
	; BE-P8-NEXT: ld r0, 16(r1)			; BE-P8-NEXT: ld r0, 16(r1)
	; BE-P8-NEXT: clrldi r3, r3, 32			; BE-P8-NEXT: clrldi r3, r3, 32
	; BE-P8-NEXT: hashchk r0, -16(r1)			; BE-P8-NEXT: hashchk r0, -16(r1)
	; BE-P8-NEXT: mtlr r0			; BE-P8-NEXT: mtlr r0
	; BE-P8-NEXT: blr			; BE-P8-NEXT: blr
	; BE-P8-NEXT: L..BB2_2:			; BE-P8-NEXT: L..BB2_2:
	; BE-P8-NEXT: li r3, 0			; BE-P8-NEXT: li r3, 0
	; BE-P8-NEXT: blr			; BE-P8-NEXT: blr
	;			;
	; BE-32BIT-P10-LABEL: shrinkwrap:			; BE-32BIT-P10-LABEL: shrinkwrap:
	; BE-32BIT-P10: # %bb.0: # %entry			; BE-32BIT-P10: # %bb.0: # %entry
	; BE-32BIT-P10-NEXT: cmplwi r3, 0			; BE-32BIT-P10-NEXT: cmplwi r3, 0
	; BE-32BIT-P10-NEXT: beq cr0, L..BB2_2			; BE-32BIT-P10-NEXT: beq cr0, L..BB2_2
	; BE-32BIT-P10-NEXT: # %bb.1: # %if.end			; BE-32BIT-P10-NEXT: # %bb.1: # %if.end
	; BE-32BIT-P10-NEXT: mflr r0			; BE-32BIT-P10-NEXT: mflr r0
	; BE-32BIT-P10-NEXT: stw r0, 8(r1)			; BE-32BIT-P10-NEXT: stw r0, 8(r1)
	; BE-32BIT-P10-NEXT: hashst r0, -16(r1)			; BE-32BIT-P10-NEXT: hashst r0, -16(r1)
	; BE-32BIT-P10-NEXT: stwu r1, -80(r1)			; BE-32BIT-P10-NEXT: stwu r1, -80(r1)
				; BE-32BIT-P10-NEXT: lwz r4, 12(r3)
	; BE-32BIT-P10-NEXT: stw r31, 76(r1) # 4-byte Folded Spill			; BE-32BIT-P10-NEXT: stw r31, 76(r1) # 4-byte Folded Spill
	; BE-32BIT-P10-NEXT: mr r31, r3			; BE-32BIT-P10-NEXT: mr r31, r3
	; BE-32BIT-P10-NEXT: lwz r3, 12(r3)			; BE-32BIT-P10-NEXT: stw r4, 60(r1)
	; BE-32BIT-P10-NEXT: stw r3, 60(r1)			; BE-32BIT-P10-NEXT: addi r4, r1, 60
	; BE-32BIT-P10-NEXT: addi r3, r1, 60			; BE-32BIT-P10-NEXT: mr r3, r4
	; BE-32BIT-P10-NEXT: bl .callee2[PR]			; BE-32BIT-P10-NEXT: bl .callee2[PR]
	; BE-32BIT-P10-NEXT: nop			; BE-32BIT-P10-NEXT: nop
	; BE-32BIT-P10-NEXT: lwz r4, 16(r31)			; BE-32BIT-P10-NEXT: lwz r4, 16(r31)
	; BE-32BIT-P10-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload			; BE-32BIT-P10-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload
	; BE-32BIT-P10-NEXT: add r3, r4, r3			; BE-32BIT-P10-NEXT: add r3, r4, r3
	; BE-32BIT-P10-NEXT: addi r1, r1, 80			; BE-32BIT-P10-NEXT: addi r1, r1, 80
	; BE-32BIT-P10-NEXT: lwz r0, 8(r1)			; BE-32BIT-P10-NEXT: lwz r0, 8(r1)
	; BE-32BIT-P10-NEXT: hashchk r0, -16(r1)			; BE-32BIT-P10-NEXT: hashchk r0, -16(r1)
	Show All 9 Lines
	; BE-32BIT-P9-NEXT: beq cr0, L..BB2_2			; BE-32BIT-P9-NEXT: beq cr0, L..BB2_2
	; BE-32BIT-P9-NEXT: # %bb.1: # %if.end			; BE-32BIT-P9-NEXT: # %bb.1: # %if.end
	; BE-32BIT-P9-NEXT: mflr r0			; BE-32BIT-P9-NEXT: mflr r0
	; BE-32BIT-P9-NEXT: stwu r1, -80(r1)			; BE-32BIT-P9-NEXT: stwu r1, -80(r1)
	; BE-32BIT-P9-NEXT: stw r0, 88(r1)			; BE-32BIT-P9-NEXT: stw r0, 88(r1)
	; BE-32BIT-P9-NEXT: hashst r0, -16(r1)			; BE-32BIT-P9-NEXT: hashst r0, -16(r1)
	; BE-32BIT-P9-NEXT: stw r31, 76(r1) # 4-byte Folded Spill			; BE-32BIT-P9-NEXT: stw r31, 76(r1) # 4-byte Folded Spill
	; BE-32BIT-P9-NEXT: mr r31, r3			; BE-32BIT-P9-NEXT: mr r31, r3
	; BE-32BIT-P9-NEXT: lwz r3, 12(r3)			; BE-32BIT-P9-NEXT: lwz r4, 12(r3)
	; BE-32BIT-P9-NEXT: stw r3, 60(r1)			; BE-32BIT-P9-NEXT: stw r4, 60(r1)
	; BE-32BIT-P9-NEXT: addi r3, r1, 60			; BE-32BIT-P9-NEXT: addi r4, r1, 60
				; BE-32BIT-P9-NEXT: mr r3, r4
	; BE-32BIT-P9-NEXT: bl .callee2[PR]			; BE-32BIT-P9-NEXT: bl .callee2[PR]
	; BE-32BIT-P9-NEXT: nop			; BE-32BIT-P9-NEXT: nop
	; BE-32BIT-P9-NEXT: lwz r4, 16(r31)			; BE-32BIT-P9-NEXT: lwz r4, 16(r31)
	; BE-32BIT-P9-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload			; BE-32BIT-P9-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload
	; BE-32BIT-P9-NEXT: add r3, r4, r3			; BE-32BIT-P9-NEXT: add r3, r4, r3
	; BE-32BIT-P9-NEXT: addi r1, r1, 80			; BE-32BIT-P9-NEXT: addi r1, r1, 80
	; BE-32BIT-P9-NEXT: lwz r0, 8(r1)			; BE-32BIT-P9-NEXT: lwz r0, 8(r1)
	; BE-32BIT-P9-NEXT: mtlr r0			; BE-32BIT-P9-NEXT: mtlr r0
	Show All 9 Lines
	; BE-32BIT-P8-NEXT: beq cr0, L..BB2_2			; BE-32BIT-P8-NEXT: beq cr0, L..BB2_2
	; BE-32BIT-P8-NEXT: # %bb.1: # %if.end			; BE-32BIT-P8-NEXT: # %bb.1: # %if.end
	; BE-32BIT-P8-NEXT: mflr r0			; BE-32BIT-P8-NEXT: mflr r0
	; BE-32BIT-P8-NEXT: stwu r1, -80(r1)			; BE-32BIT-P8-NEXT: stwu r1, -80(r1)
	; BE-32BIT-P8-NEXT: stw r0, 88(r1)			; BE-32BIT-P8-NEXT: stw r0, 88(r1)
	; BE-32BIT-P8-NEXT: hashst r0, -16(r1)			; BE-32BIT-P8-NEXT: hashst r0, -16(r1)
	; BE-32BIT-P8-NEXT: stw r31, 76(r1) # 4-byte Folded Spill			; BE-32BIT-P8-NEXT: stw r31, 76(r1) # 4-byte Folded Spill
	; BE-32BIT-P8-NEXT: mr r31, r3			; BE-32BIT-P8-NEXT: mr r31, r3
	; BE-32BIT-P8-NEXT: lwz r3, 12(r3)			; BE-32BIT-P8-NEXT: lwz r4, 12(r3)
	; BE-32BIT-P8-NEXT: stw r3, 60(r1)			; BE-32BIT-P8-NEXT: stw r4, 60(r1)
	; BE-32BIT-P8-NEXT: addi r3, r1, 60			; BE-32BIT-P8-NEXT: addi r4, r1, 60
				; BE-32BIT-P8-NEXT: mr r3, r4
	; BE-32BIT-P8-NEXT: bl .callee2[PR]			; BE-32BIT-P8-NEXT: bl .callee2[PR]
	; BE-32BIT-P8-NEXT: nop			; BE-32BIT-P8-NEXT: nop
	; BE-32BIT-P8-NEXT: lwz r4, 16(r31)			; BE-32BIT-P8-NEXT: lwz r4, 16(r31)
	; BE-32BIT-P8-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload			; BE-32BIT-P8-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload
	; BE-32BIT-P8-NEXT: add r3, r4, r3			; BE-32BIT-P8-NEXT: add r3, r4, r3
	; BE-32BIT-P8-NEXT: addi r1, r1, 80			; BE-32BIT-P8-NEXT: addi r1, r1, 80
	; BE-32BIT-P8-NEXT: lwz r0, 8(r1)			; BE-32BIT-P8-NEXT: lwz r0, 8(r1)
	; BE-32BIT-P8-NEXT: mtlr r0			; BE-32BIT-P8-NEXT: mtlr r0
	; BE-32BIT-P8-NEXT: hashchk r0, -16(r1)			; BE-32BIT-P8-NEXT: hashchk r0, -16(r1)
	; BE-32BIT-P8-NEXT: blr			; BE-32BIT-P8-NEXT: blr
	; BE-32BIT-P8-NEXT: L..BB2_2:			; BE-32BIT-P8-NEXT: L..BB2_2:
	; BE-32BIT-P8-NEXT: li r3, 0			; BE-32BIT-P8-NEXT: li r3, 0
	; BE-32BIT-P8-NEXT: blr			; BE-32BIT-P8-NEXT: blr
	;			;
	; BE-P10-PRIV-LABEL: shrinkwrap:			; BE-P10-PRIV-LABEL: shrinkwrap:
	; BE-P10-PRIV: # %bb.0: # %entry			; BE-P10-PRIV: # %bb.0: # %entry
	; BE-P10-PRIV-NEXT: cmpldi r3, 0			; BE-P10-PRIV-NEXT: cmpldi r3, 0
	; BE-P10-PRIV-NEXT: beq cr0, L..BB2_2			; BE-P10-PRIV-NEXT: beq cr0, L..BB2_2
	; BE-P10-PRIV-NEXT: # %bb.1: # %if.end			; BE-P10-PRIV-NEXT: # %bb.1: # %if.end
	; BE-P10-PRIV-NEXT: mflr r0			; BE-P10-PRIV-NEXT: mflr r0
	; BE-P10-PRIV-NEXT: std r0, 16(r1)			; BE-P10-PRIV-NEXT: std r0, 16(r1)
	; BE-P10-PRIV-NEXT: hashstp r0, -16(r1)			; BE-P10-PRIV-NEXT: hashstp r0, -16(r1)
	; BE-P10-PRIV-NEXT: stdu r1, -144(r1)			; BE-P10-PRIV-NEXT: stdu r1, -144(r1)
				; BE-P10-PRIV-NEXT: lwz r4, 12(r3)
	; BE-P10-PRIV-NEXT: std r31, 136(r1) # 8-byte Folded Spill			; BE-P10-PRIV-NEXT: std r31, 136(r1) # 8-byte Folded Spill
	; BE-P10-PRIV-NEXT: mr r31, r3			; BE-P10-PRIV-NEXT: mr r31, r3
	; BE-P10-PRIV-NEXT: lwz r3, 12(r3)			; BE-P10-PRIV-NEXT: stw r4, 124(r1)
	; BE-P10-PRIV-NEXT: stw r3, 124(r1)			; BE-P10-PRIV-NEXT: addi r4, r1, 124
	; BE-P10-PRIV-NEXT: addi r3, r1, 124			; BE-P10-PRIV-NEXT: mr r3, r4
	; BE-P10-PRIV-NEXT: bl .callee2[PR]			; BE-P10-PRIV-NEXT: bl .callee2[PR]
	; BE-P10-PRIV-NEXT: nop			; BE-P10-PRIV-NEXT: nop
	; BE-P10-PRIV-NEXT: lwz r4, 16(r31)			; BE-P10-PRIV-NEXT: lwz r4, 16(r31)
	; BE-P10-PRIV-NEXT: ld r31, 136(r1) # 8-byte Folded Reload			; BE-P10-PRIV-NEXT: ld r31, 136(r1) # 8-byte Folded Reload
	; BE-P10-PRIV-NEXT: add r3, r4, r3			; BE-P10-PRIV-NEXT: add r3, r4, r3
	; BE-P10-PRIV-NEXT: addi r1, r1, 144			; BE-P10-PRIV-NEXT: addi r1, r1, 144
	; BE-P10-PRIV-NEXT: ld r0, 16(r1)			; BE-P10-PRIV-NEXT: ld r0, 16(r1)
	; BE-P10-PRIV-NEXT: clrldi r3, r3, 32			; BE-P10-PRIV-NEXT: clrldi r3, r3, 32
	Show All 10 Lines
	; BE-P9-PRIV-NEXT: beq cr0, L..BB2_2			; BE-P9-PRIV-NEXT: beq cr0, L..BB2_2
	; BE-P9-PRIV-NEXT: # %bb.1: # %if.end			; BE-P9-PRIV-NEXT: # %bb.1: # %if.end
	; BE-P9-PRIV-NEXT: mflr r0			; BE-P9-PRIV-NEXT: mflr r0
	; BE-P9-PRIV-NEXT: stdu r1, -144(r1)			; BE-P9-PRIV-NEXT: stdu r1, -144(r1)
	; BE-P9-PRIV-NEXT: std r0, 160(r1)			; BE-P9-PRIV-NEXT: std r0, 160(r1)
	; BE-P9-PRIV-NEXT: hashstp r0, -16(r1)			; BE-P9-PRIV-NEXT: hashstp r0, -16(r1)
	; BE-P9-PRIV-NEXT: std r31, 136(r1) # 8-byte Folded Spill			; BE-P9-PRIV-NEXT: std r31, 136(r1) # 8-byte Folded Spill
	; BE-P9-PRIV-NEXT: mr r31, r3			; BE-P9-PRIV-NEXT: mr r31, r3
	; BE-P9-PRIV-NEXT: lwz r3, 12(r3)			; BE-P9-PRIV-NEXT: lwz r4, 12(r3)
	; BE-P9-PRIV-NEXT: stw r3, 124(r1)			; BE-P9-PRIV-NEXT: stw r4, 124(r1)
	; BE-P9-PRIV-NEXT: addi r3, r1, 124			; BE-P9-PRIV-NEXT: addi r4, r1, 124
				; BE-P9-PRIV-NEXT: mr r3, r4
	; BE-P9-PRIV-NEXT: bl .callee2[PR]			; BE-P9-PRIV-NEXT: bl .callee2[PR]
	; BE-P9-PRIV-NEXT: nop			; BE-P9-PRIV-NEXT: nop
	; BE-P9-PRIV-NEXT: lwz r4, 16(r31)			; BE-P9-PRIV-NEXT: lwz r4, 16(r31)
	; BE-P9-PRIV-NEXT: ld r31, 136(r1) # 8-byte Folded Reload			; BE-P9-PRIV-NEXT: ld r31, 136(r1) # 8-byte Folded Reload
	; BE-P9-PRIV-NEXT: add r3, r4, r3			; BE-P9-PRIV-NEXT: add r3, r4, r3
	; BE-P9-PRIV-NEXT: addi r1, r1, 144			; BE-P9-PRIV-NEXT: addi r1, r1, 144
	; BE-P9-PRIV-NEXT: ld r0, 16(r1)			; BE-P9-PRIV-NEXT: ld r0, 16(r1)
	; BE-P9-PRIV-NEXT: clrldi r3, r3, 32			; BE-P9-PRIV-NEXT: clrldi r3, r3, 32
	Show All 10 Lines
	; BE-P8-PRIV-NEXT: beq cr0, L..BB2_2			; BE-P8-PRIV-NEXT: beq cr0, L..BB2_2
	; BE-P8-PRIV-NEXT: # %bb.1: # %if.end			; BE-P8-PRIV-NEXT: # %bb.1: # %if.end
	; BE-P8-PRIV-NEXT: mflr r0			; BE-P8-PRIV-NEXT: mflr r0
	; BE-P8-PRIV-NEXT: stdu r1, -144(r1)			; BE-P8-PRIV-NEXT: stdu r1, -144(r1)
	; BE-P8-PRIV-NEXT: std r0, 160(r1)			; BE-P8-PRIV-NEXT: std r0, 160(r1)
	; BE-P8-PRIV-NEXT: hashstp r0, -16(r1)			; BE-P8-PRIV-NEXT: hashstp r0, -16(r1)
	; BE-P8-PRIV-NEXT: std r31, 136(r1) # 8-byte Folded Spill			; BE-P8-PRIV-NEXT: std r31, 136(r1) # 8-byte Folded Spill
	; BE-P8-PRIV-NEXT: mr r31, r3			; BE-P8-PRIV-NEXT: mr r31, r3
	; BE-P8-PRIV-NEXT: lwz r3, 12(r3)			; BE-P8-PRIV-NEXT: lwz r4, 12(r3)
	; BE-P8-PRIV-NEXT: stw r3, 124(r1)			; BE-P8-PRIV-NEXT: stw r4, 124(r1)
	; BE-P8-PRIV-NEXT: addi r3, r1, 124			; BE-P8-PRIV-NEXT: addi r4, r1, 124
				; BE-P8-PRIV-NEXT: mr r3, r4
	; BE-P8-PRIV-NEXT: bl .callee2[PR]			; BE-P8-PRIV-NEXT: bl .callee2[PR]
	; BE-P8-PRIV-NEXT: nop			; BE-P8-PRIV-NEXT: nop
	; BE-P8-PRIV-NEXT: lwz r4, 16(r31)			; BE-P8-PRIV-NEXT: lwz r4, 16(r31)
	; BE-P8-PRIV-NEXT: ld r31, 136(r1) # 8-byte Folded Reload			; BE-P8-PRIV-NEXT: ld r31, 136(r1) # 8-byte Folded Reload
	; BE-P8-PRIV-NEXT: add r3, r4, r3			; BE-P8-PRIV-NEXT: add r3, r4, r3
	; BE-P8-PRIV-NEXT: addi r1, r1, 144			; BE-P8-PRIV-NEXT: addi r1, r1, 144
	; BE-P8-PRIV-NEXT: ld r0, 16(r1)			; BE-P8-PRIV-NEXT: ld r0, 16(r1)
	; BE-P8-PRIV-NEXT: clrldi r3, r3, 32			; BE-P8-PRIV-NEXT: clrldi r3, r3, 32
	; BE-P8-PRIV-NEXT: hashchkp r0, -16(r1)			; BE-P8-PRIV-NEXT: hashchkp r0, -16(r1)
	; BE-P8-PRIV-NEXT: mtlr r0			; BE-P8-PRIV-NEXT: mtlr r0
	; BE-P8-PRIV-NEXT: blr			; BE-P8-PRIV-NEXT: blr
	; BE-P8-PRIV-NEXT: L..BB2_2:			; BE-P8-PRIV-NEXT: L..BB2_2:
	; BE-P8-PRIV-NEXT: li r3, 0			; BE-P8-PRIV-NEXT: li r3, 0
	; BE-P8-PRIV-NEXT: blr			; BE-P8-PRIV-NEXT: blr
	;			;
	; BE-32BIT-P10-PRIV-LABEL: shrinkwrap:			; BE-32BIT-P10-PRIV-LABEL: shrinkwrap:
	; BE-32BIT-P10-PRIV: # %bb.0: # %entry			; BE-32BIT-P10-PRIV: # %bb.0: # %entry
	; BE-32BIT-P10-PRIV-NEXT: cmplwi r3, 0			; BE-32BIT-P10-PRIV-NEXT: cmplwi r3, 0
	; BE-32BIT-P10-PRIV-NEXT: beq cr0, L..BB2_2			; BE-32BIT-P10-PRIV-NEXT: beq cr0, L..BB2_2
	; BE-32BIT-P10-PRIV-NEXT: # %bb.1: # %if.end			; BE-32BIT-P10-PRIV-NEXT: # %bb.1: # %if.end
	; BE-32BIT-P10-PRIV-NEXT: mflr r0			; BE-32BIT-P10-PRIV-NEXT: mflr r0
	; BE-32BIT-P10-PRIV-NEXT: stw r0, 8(r1)			; BE-32BIT-P10-PRIV-NEXT: stw r0, 8(r1)
	; BE-32BIT-P10-PRIV-NEXT: hashstp r0, -16(r1)			; BE-32BIT-P10-PRIV-NEXT: hashstp r0, -16(r1)
	; BE-32BIT-P10-PRIV-NEXT: stwu r1, -80(r1)			; BE-32BIT-P10-PRIV-NEXT: stwu r1, -80(r1)
				; BE-32BIT-P10-PRIV-NEXT: lwz r4, 12(r3)
	; BE-32BIT-P10-PRIV-NEXT: stw r31, 76(r1) # 4-byte Folded Spill			; BE-32BIT-P10-PRIV-NEXT: stw r31, 76(r1) # 4-byte Folded Spill
	; BE-32BIT-P10-PRIV-NEXT: mr r31, r3			; BE-32BIT-P10-PRIV-NEXT: mr r31, r3
	; BE-32BIT-P10-PRIV-NEXT: lwz r3, 12(r3)			; BE-32BIT-P10-PRIV-NEXT: stw r4, 60(r1)
	; BE-32BIT-P10-PRIV-NEXT: stw r3, 60(r1)			; BE-32BIT-P10-PRIV-NEXT: addi r4, r1, 60
	; BE-32BIT-P10-PRIV-NEXT: addi r3, r1, 60			; BE-32BIT-P10-PRIV-NEXT: mr r3, r4
	; BE-32BIT-P10-PRIV-NEXT: bl .callee2[PR]			; BE-32BIT-P10-PRIV-NEXT: bl .callee2[PR]
	; BE-32BIT-P10-PRIV-NEXT: nop			; BE-32BIT-P10-PRIV-NEXT: nop
	; BE-32BIT-P10-PRIV-NEXT: lwz r4, 16(r31)			; BE-32BIT-P10-PRIV-NEXT: lwz r4, 16(r31)
	; BE-32BIT-P10-PRIV-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload			; BE-32BIT-P10-PRIV-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload
	; BE-32BIT-P10-PRIV-NEXT: add r3, r4, r3			; BE-32BIT-P10-PRIV-NEXT: add r3, r4, r3
	; BE-32BIT-P10-PRIV-NEXT: addi r1, r1, 80			; BE-32BIT-P10-PRIV-NEXT: addi r1, r1, 80
	; BE-32BIT-P10-PRIV-NEXT: lwz r0, 8(r1)			; BE-32BIT-P10-PRIV-NEXT: lwz r0, 8(r1)
	; BE-32BIT-P10-PRIV-NEXT: hashchkp r0, -16(r1)			; BE-32BIT-P10-PRIV-NEXT: hashchkp r0, -16(r1)
	Show All 9 Lines
	; BE-32BIT-P9-PRIV-NEXT: beq cr0, L..BB2_2			; BE-32BIT-P9-PRIV-NEXT: beq cr0, L..BB2_2
	; BE-32BIT-P9-PRIV-NEXT: # %bb.1: # %if.end			; BE-32BIT-P9-PRIV-NEXT: # %bb.1: # %if.end
	; BE-32BIT-P9-PRIV-NEXT: mflr r0			; BE-32BIT-P9-PRIV-NEXT: mflr r0
	; BE-32BIT-P9-PRIV-NEXT: stwu r1, -80(r1)			; BE-32BIT-P9-PRIV-NEXT: stwu r1, -80(r1)
	; BE-32BIT-P9-PRIV-NEXT: stw r0, 88(r1)			; BE-32BIT-P9-PRIV-NEXT: stw r0, 88(r1)
	; BE-32BIT-P9-PRIV-NEXT: hashstp r0, -16(r1)			; BE-32BIT-P9-PRIV-NEXT: hashstp r0, -16(r1)
	; BE-32BIT-P9-PRIV-NEXT: stw r31, 76(r1) # 4-byte Folded Spill			; BE-32BIT-P9-PRIV-NEXT: stw r31, 76(r1) # 4-byte Folded Spill
	; BE-32BIT-P9-PRIV-NEXT: mr r31, r3			; BE-32BIT-P9-PRIV-NEXT: mr r31, r3
	; BE-32BIT-P9-PRIV-NEXT: lwz r3, 12(r3)			; BE-32BIT-P9-PRIV-NEXT: lwz r4, 12(r3)
	; BE-32BIT-P9-PRIV-NEXT: stw r3, 60(r1)			; BE-32BIT-P9-PRIV-NEXT: stw r4, 60(r1)
	; BE-32BIT-P9-PRIV-NEXT: addi r3, r1, 60			; BE-32BIT-P9-PRIV-NEXT: addi r4, r1, 60
				; BE-32BIT-P9-PRIV-NEXT: mr r3, r4
	; BE-32BIT-P9-PRIV-NEXT: bl .callee2[PR]			; BE-32BIT-P9-PRIV-NEXT: bl .callee2[PR]
	; BE-32BIT-P9-PRIV-NEXT: nop			; BE-32BIT-P9-PRIV-NEXT: nop
	; BE-32BIT-P9-PRIV-NEXT: lwz r4, 16(r31)			; BE-32BIT-P9-PRIV-NEXT: lwz r4, 16(r31)
	; BE-32BIT-P9-PRIV-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload			; BE-32BIT-P9-PRIV-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload
	; BE-32BIT-P9-PRIV-NEXT: add r3, r4, r3			; BE-32BIT-P9-PRIV-NEXT: add r3, r4, r3
	; BE-32BIT-P9-PRIV-NEXT: addi r1, r1, 80			; BE-32BIT-P9-PRIV-NEXT: addi r1, r1, 80
	; BE-32BIT-P9-PRIV-NEXT: lwz r0, 8(r1)			; BE-32BIT-P9-PRIV-NEXT: lwz r0, 8(r1)
	; BE-32BIT-P9-PRIV-NEXT: mtlr r0			; BE-32BIT-P9-PRIV-NEXT: mtlr r0
	Show All 9 Lines
	; BE-32BIT-P8-PRIV-NEXT: beq cr0, L..BB2_2			; BE-32BIT-P8-PRIV-NEXT: beq cr0, L..BB2_2
	; BE-32BIT-P8-PRIV-NEXT: # %bb.1: # %if.end			; BE-32BIT-P8-PRIV-NEXT: # %bb.1: # %if.end
	; BE-32BIT-P8-PRIV-NEXT: mflr r0			; BE-32BIT-P8-PRIV-NEXT: mflr r0
	; BE-32BIT-P8-PRIV-NEXT: stwu r1, -80(r1)			; BE-32BIT-P8-PRIV-NEXT: stwu r1, -80(r1)
	; BE-32BIT-P8-PRIV-NEXT: stw r0, 88(r1)			; BE-32BIT-P8-PRIV-NEXT: stw r0, 88(r1)
	; BE-32BIT-P8-PRIV-NEXT: hashstp r0, -16(r1)			; BE-32BIT-P8-PRIV-NEXT: hashstp r0, -16(r1)
	; BE-32BIT-P8-PRIV-NEXT: stw r31, 76(r1) # 4-byte Folded Spill			; BE-32BIT-P8-PRIV-NEXT: stw r31, 76(r1) # 4-byte Folded Spill
	; BE-32BIT-P8-PRIV-NEXT: mr r31, r3			; BE-32BIT-P8-PRIV-NEXT: mr r31, r3
	; BE-32BIT-P8-PRIV-NEXT: lwz r3, 12(r3)			; BE-32BIT-P8-PRIV-NEXT: lwz r4, 12(r3)
	; BE-32BIT-P8-PRIV-NEXT: stw r3, 60(r1)			; BE-32BIT-P8-PRIV-NEXT: stw r4, 60(r1)
	; BE-32BIT-P8-PRIV-NEXT: addi r3, r1, 60			; BE-32BIT-P8-PRIV-NEXT: addi r4, r1, 60
				; BE-32BIT-P8-PRIV-NEXT: mr r3, r4
	; BE-32BIT-P8-PRIV-NEXT: bl .callee2[PR]			; BE-32BIT-P8-PRIV-NEXT: bl .callee2[PR]
	; BE-32BIT-P8-PRIV-NEXT: nop			; BE-32BIT-P8-PRIV-NEXT: nop
	; BE-32BIT-P8-PRIV-NEXT: lwz r4, 16(r31)			; BE-32BIT-P8-PRIV-NEXT: lwz r4, 16(r31)
	; BE-32BIT-P8-PRIV-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload			; BE-32BIT-P8-PRIV-NEXT: lwz r31, 76(r1) # 4-byte Folded Reload
	; BE-32BIT-P8-PRIV-NEXT: add r3, r4, r3			; BE-32BIT-P8-PRIV-NEXT: add r3, r4, r3
	; BE-32BIT-P8-PRIV-NEXT: addi r1, r1, 80			; BE-32BIT-P8-PRIV-NEXT: addi r1, r1, 80
	; BE-32BIT-P8-PRIV-NEXT: lwz r0, 8(r1)			; BE-32BIT-P8-PRIV-NEXT: lwz r0, 8(r1)
	; BE-32BIT-P8-PRIV-NEXT: mtlr r0			; BE-32BIT-P8-PRIV-NEXT: mtlr r0
	▲ Show 20 Lines • Show All 570 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/ppc64-rop-protection.ll

	Show First 20 Lines • Show All 2,858 Lines • ▼ Show 20 Lines
	; LE-P10-NEXT: cmpldi r3, 0			; LE-P10-NEXT: cmpldi r3, 0
	; LE-P10-NEXT: beq cr0, .LBB2_2			; LE-P10-NEXT: beq cr0, .LBB2_2
	; LE-P10-NEXT: # %bb.1: # %if.end			; LE-P10-NEXT: # %bb.1: # %if.end
	; LE-P10-NEXT: mflr r0			; LE-P10-NEXT: mflr r0
	; LE-P10-NEXT: std r30, -16(r1) # 8-byte Folded Spill			; LE-P10-NEXT: std r30, -16(r1) # 8-byte Folded Spill
	; LE-P10-NEXT: std r0, 16(r1)			; LE-P10-NEXT: std r0, 16(r1)
	; LE-P10-NEXT: hashst r0, -24(r1)			; LE-P10-NEXT: hashst r0, -24(r1)
	; LE-P10-NEXT: stdu r1, -64(r1)			; LE-P10-NEXT: stdu r1, -64(r1)
				; LE-P10-NEXT: lwz r4, 12(r3)
	; LE-P10-NEXT: mr r30, r3			; LE-P10-NEXT: mr r30, r3
	; LE-P10-NEXT: lwz r3, 12(r3)			; LE-P10-NEXT: stw r4, 36(r1)
	; LE-P10-NEXT: stw r3, 36(r1)			; LE-P10-NEXT: addi r4, r1, 36
	; LE-P10-NEXT: addi r3, r1, 36			; LE-P10-NEXT: mr r3, r4
	; LE-P10-NEXT: bl callee2@notoc			; LE-P10-NEXT: bl callee2@notoc
	; LE-P10-NEXT: lwz r4, 16(r30)			; LE-P10-NEXT: lwz r4, 16(r30)
	; LE-P10-NEXT: add r3, r4, r3			; LE-P10-NEXT: add r3, r4, r3
	; LE-P10-NEXT: addi r1, r1, 64			; LE-P10-NEXT: addi r1, r1, 64
	; LE-P10-NEXT: ld r0, 16(r1)			; LE-P10-NEXT: ld r0, 16(r1)
	; LE-P10-NEXT: clrldi r3, r3, 32			; LE-P10-NEXT: clrldi r3, r3, 32
	; LE-P10-NEXT: hashchk r0, -24(r1)			; LE-P10-NEXT: hashchk r0, -24(r1)
	; LE-P10-NEXT: ld r30, -16(r1) # 8-byte Folded Reload			; LE-P10-NEXT: ld r30, -16(r1) # 8-byte Folded Reload
	Show All 9 Lines
	; LE-P9-NEXT: beq cr0, .LBB2_2			; LE-P9-NEXT: beq cr0, .LBB2_2
	; LE-P9-NEXT: # %bb.1: # %if.end			; LE-P9-NEXT: # %bb.1: # %if.end
	; LE-P9-NEXT: mflr r0			; LE-P9-NEXT: mflr r0
	; LE-P9-NEXT: std r30, -16(r1) # 8-byte Folded Spill			; LE-P9-NEXT: std r30, -16(r1) # 8-byte Folded Spill
	; LE-P9-NEXT: stdu r1, -64(r1)			; LE-P9-NEXT: stdu r1, -64(r1)
	; LE-P9-NEXT: std r0, 80(r1)			; LE-P9-NEXT: std r0, 80(r1)
	; LE-P9-NEXT: hashst r0, -24(r1)			; LE-P9-NEXT: hashst r0, -24(r1)
	; LE-P9-NEXT: mr r30, r3			; LE-P9-NEXT: mr r30, r3
	; LE-P9-NEXT: lwz r3, 12(r3)			; LE-P9-NEXT: lwz r4, 12(r3)
	; LE-P9-NEXT: stw r3, 36(r1)			; LE-P9-NEXT: stw r4, 36(r1)
	; LE-P9-NEXT: addi r3, r1, 36			; LE-P9-NEXT: addi r4, r1, 36
				; LE-P9-NEXT: mr r3, r4
	; LE-P9-NEXT: bl callee2			; LE-P9-NEXT: bl callee2
	; LE-P9-NEXT: nop			; LE-P9-NEXT: nop
	; LE-P9-NEXT: lwz r4, 16(r30)			; LE-P9-NEXT: lwz r4, 16(r30)
	; LE-P9-NEXT: add r3, r4, r3			; LE-P9-NEXT: add r3, r4, r3
	; LE-P9-NEXT: addi r1, r1, 64			; LE-P9-NEXT: addi r1, r1, 64
	; LE-P9-NEXT: ld r0, 16(r1)			; LE-P9-NEXT: ld r0, 16(r1)
	; LE-P9-NEXT: clrldi r3, r3, 32			; LE-P9-NEXT: clrldi r3, r3, 32
	; LE-P9-NEXT: mtlr r0			; LE-P9-NEXT: mtlr r0
	Show All 10 Lines
	; LE-P8-NEXT: beq cr0, .LBB2_2			; LE-P8-NEXT: beq cr0, .LBB2_2
	; LE-P8-NEXT: # %bb.1: # %if.end			; LE-P8-NEXT: # %bb.1: # %if.end
	; LE-P8-NEXT: mflr r0			; LE-P8-NEXT: mflr r0
	; LE-P8-NEXT: std r30, -16(r1) # 8-byte Folded Spill			; LE-P8-NEXT: std r30, -16(r1) # 8-byte Folded Spill
	; LE-P8-NEXT: stdu r1, -64(r1)			; LE-P8-NEXT: stdu r1, -64(r1)
	; LE-P8-NEXT: std r0, 80(r1)			; LE-P8-NEXT: std r0, 80(r1)
	; LE-P8-NEXT: hashst r0, -24(r1)			; LE-P8-NEXT: hashst r0, -24(r1)
	; LE-P8-NEXT: mr r30, r3			; LE-P8-NEXT: mr r30, r3
	; LE-P8-NEXT: lwz r3, 12(r3)			; LE-P8-NEXT: lwz r4, 12(r3)
	; LE-P8-NEXT: stw r3, 36(r1)			; LE-P8-NEXT: stw r4, 36(r1)
	; LE-P8-NEXT: addi r3, r1, 36			; LE-P8-NEXT: addi r4, r1, 36
				; LE-P8-NEXT: mr r3, r4
	; LE-P8-NEXT: bl callee2			; LE-P8-NEXT: bl callee2
	; LE-P8-NEXT: nop			; LE-P8-NEXT: nop
	; LE-P8-NEXT: lwz r4, 16(r30)			; LE-P8-NEXT: lwz r4, 16(r30)
	; LE-P8-NEXT: add r3, r4, r3			; LE-P8-NEXT: add r3, r4, r3
	; LE-P8-NEXT: addi r1, r1, 64			; LE-P8-NEXT: addi r1, r1, 64
	; LE-P8-NEXT: ld r0, 16(r1)			; LE-P8-NEXT: ld r0, 16(r1)
	; LE-P8-NEXT: clrldi r3, r3, 32			; LE-P8-NEXT: clrldi r3, r3, 32
	; LE-P8-NEXT: hashchk r0, -24(r1)			; LE-P8-NEXT: hashchk r0, -24(r1)
	▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines
	; BE-P10: # %bb.0: # %entry			; BE-P10: # %bb.0: # %entry
	; BE-P10-NEXT: cmpldi r3, 0			; BE-P10-NEXT: cmpldi r3, 0
	; BE-P10-NEXT: beq cr0, .LBB2_2			; BE-P10-NEXT: beq cr0, .LBB2_2
	; BE-P10-NEXT: # %bb.1: # %if.end			; BE-P10-NEXT: # %bb.1: # %if.end
	; BE-P10-NEXT: mflr r0			; BE-P10-NEXT: mflr r0
	; BE-P10-NEXT: std r0, 16(r1)			; BE-P10-NEXT: std r0, 16(r1)
	; BE-P10-NEXT: hashst r0, -24(r1)			; BE-P10-NEXT: hashst r0, -24(r1)
	; BE-P10-NEXT: stdu r1, -144(r1)			; BE-P10-NEXT: stdu r1, -144(r1)
				; BE-P10-NEXT: lwz r4, 12(r3)
	; BE-P10-NEXT: std r30, 128(r1) # 8-byte Folded Spill			; BE-P10-NEXT: std r30, 128(r1) # 8-byte Folded Spill
	; BE-P10-NEXT: mr r30, r3			; BE-P10-NEXT: mr r30, r3
	; BE-P10-NEXT: lwz r3, 12(r3)			; BE-P10-NEXT: stw r4, 116(r1)
	; BE-P10-NEXT: stw r3, 116(r1)			; BE-P10-NEXT: addi r4, r1, 116
	; BE-P10-NEXT: addi r3, r1, 116			; BE-P10-NEXT: mr r3, r4
	; BE-P10-NEXT: bl callee2			; BE-P10-NEXT: bl callee2
	; BE-P10-NEXT: nop			; BE-P10-NEXT: nop
	; BE-P10-NEXT: lwz r4, 16(r30)			; BE-P10-NEXT: lwz r4, 16(r30)
	; BE-P10-NEXT: ld r30, 128(r1) # 8-byte Folded Reload			; BE-P10-NEXT: ld r30, 128(r1) # 8-byte Folded Reload
	; BE-P10-NEXT: add r3, r4, r3			; BE-P10-NEXT: add r3, r4, r3
	; BE-P10-NEXT: addi r1, r1, 144			; BE-P10-NEXT: addi r1, r1, 144
	; BE-P10-NEXT: ld r0, 16(r1)			; BE-P10-NEXT: ld r0, 16(r1)
	; BE-P10-NEXT: clrldi r3, r3, 32			; BE-P10-NEXT: clrldi r3, r3, 32
	Show All 10 Lines
	; BE-P9-NEXT: beq cr0, .LBB2_2			; BE-P9-NEXT: beq cr0, .LBB2_2
	; BE-P9-NEXT: # %bb.1: # %if.end			; BE-P9-NEXT: # %bb.1: # %if.end
	; BE-P9-NEXT: mflr r0			; BE-P9-NEXT: mflr r0
	; BE-P9-NEXT: stdu r1, -144(r1)			; BE-P9-NEXT: stdu r1, -144(r1)
	; BE-P9-NEXT: std r0, 160(r1)			; BE-P9-NEXT: std r0, 160(r1)
	; BE-P9-NEXT: hashst r0, -24(r1)			; BE-P9-NEXT: hashst r0, -24(r1)
	; BE-P9-NEXT: std r30, 128(r1) # 8-byte Folded Spill			; BE-P9-NEXT: std r30, 128(r1) # 8-byte Folded Spill
	; BE-P9-NEXT: mr r30, r3			; BE-P9-NEXT: mr r30, r3
	; BE-P9-NEXT: lwz r3, 12(r3)			; BE-P9-NEXT: lwz r4, 12(r3)
	; BE-P9-NEXT: stw r3, 116(r1)			; BE-P9-NEXT: stw r4, 116(r1)
	; BE-P9-NEXT: addi r3, r1, 116			; BE-P9-NEXT: addi r4, r1, 116
				; BE-P9-NEXT: mr r3, r4
	; BE-P9-NEXT: bl callee2			; BE-P9-NEXT: bl callee2
	; BE-P9-NEXT: nop			; BE-P9-NEXT: nop
	; BE-P9-NEXT: lwz r4, 16(r30)			; BE-P9-NEXT: lwz r4, 16(r30)
	; BE-P9-NEXT: ld r30, 128(r1) # 8-byte Folded Reload			; BE-P9-NEXT: ld r30, 128(r1) # 8-byte Folded Reload
	; BE-P9-NEXT: add r3, r4, r3			; BE-P9-NEXT: add r3, r4, r3
	; BE-P9-NEXT: addi r1, r1, 144			; BE-P9-NEXT: addi r1, r1, 144
	; BE-P9-NEXT: ld r0, 16(r1)			; BE-P9-NEXT: ld r0, 16(r1)
	; BE-P9-NEXT: clrldi r3, r3, 32			; BE-P9-NEXT: clrldi r3, r3, 32
	Show All 10 Lines
	; BE-P8-NEXT: beq cr0, .LBB2_2			; BE-P8-NEXT: beq cr0, .LBB2_2
	; BE-P8-NEXT: # %bb.1: # %if.end			; BE-P8-NEXT: # %bb.1: # %if.end
	; BE-P8-NEXT: mflr r0			; BE-P8-NEXT: mflr r0
	; BE-P8-NEXT: stdu r1, -144(r1)			; BE-P8-NEXT: stdu r1, -144(r1)
	; BE-P8-NEXT: std r0, 160(r1)			; BE-P8-NEXT: std r0, 160(r1)
	; BE-P8-NEXT: hashst r0, -24(r1)			; BE-P8-NEXT: hashst r0, -24(r1)
	; BE-P8-NEXT: std r30, 128(r1) # 8-byte Folded Spill			; BE-P8-NEXT: std r30, 128(r1) # 8-byte Folded Spill
	; BE-P8-NEXT: mr r30, r3			; BE-P8-NEXT: mr r30, r3
	; BE-P8-NEXT: lwz r3, 12(r3)			; BE-P8-NEXT: lwz r4, 12(r3)
	; BE-P8-NEXT: stw r3, 116(r1)			; BE-P8-NEXT: stw r4, 116(r1)
	; BE-P8-NEXT: addi r3, r1, 116			; BE-P8-NEXT: addi r4, r1, 116
				; BE-P8-NEXT: mr r3, r4
	; BE-P8-NEXT: bl callee2			; BE-P8-NEXT: bl callee2
	; BE-P8-NEXT: nop			; BE-P8-NEXT: nop
	; BE-P8-NEXT: lwz r4, 16(r30)			; BE-P8-NEXT: lwz r4, 16(r30)
	; BE-P8-NEXT: ld r30, 128(r1) # 8-byte Folded Reload			; BE-P8-NEXT: ld r30, 128(r1) # 8-byte Folded Reload
	; BE-P8-NEXT: add r3, r4, r3			; BE-P8-NEXT: add r3, r4, r3
	; BE-P8-NEXT: addi r1, r1, 144			; BE-P8-NEXT: addi r1, r1, 144
	; BE-P8-NEXT: ld r0, 16(r1)			; BE-P8-NEXT: ld r0, 16(r1)
	; BE-P8-NEXT: clrldi r3, r3, 32			; BE-P8-NEXT: clrldi r3, r3, 32
	Show All 9 Lines
	; BE-32BIT-P10-NEXT: mflr r0			; BE-32BIT-P10-NEXT: mflr r0
	; BE-32BIT-P10-NEXT: stw r0, 4(r1)			; BE-32BIT-P10-NEXT: stw r0, 4(r1)
	; BE-32BIT-P10-NEXT: hashst r0, -16(r1)			; BE-32BIT-P10-NEXT: hashst r0, -16(r1)
	; BE-32BIT-P10-NEXT: stwu r1, -32(r1)			; BE-32BIT-P10-NEXT: stwu r1, -32(r1)
	; BE-32BIT-P10-NEXT: cmplwi r3, 0			; BE-32BIT-P10-NEXT: cmplwi r3, 0
	; BE-32BIT-P10-NEXT: stw r30, 24(r1) # 4-byte Folded Spill			; BE-32BIT-P10-NEXT: stw r30, 24(r1) # 4-byte Folded Spill
	; BE-32BIT-P10-NEXT: beq cr0, .LBB2_2			; BE-32BIT-P10-NEXT: beq cr0, .LBB2_2
	; BE-32BIT-P10-NEXT: # %bb.1: # %if.end			; BE-32BIT-P10-NEXT: # %bb.1: # %if.end
				; BE-32BIT-P10-NEXT: lwz r4, 12(r3)
	; BE-32BIT-P10-NEXT: mr r30, r3			; BE-32BIT-P10-NEXT: mr r30, r3
	; BE-32BIT-P10-NEXT: lwz r3, 12(r3)			; BE-32BIT-P10-NEXT: stw r4, 12(r1)
	; BE-32BIT-P10-NEXT: stw r3, 12(r1)			; BE-32BIT-P10-NEXT: addi r4, r1, 12
	; BE-32BIT-P10-NEXT: addi r3, r1, 12			; BE-32BIT-P10-NEXT: mr r3, r4
	; BE-32BIT-P10-NEXT: bl callee2			; BE-32BIT-P10-NEXT: bl callee2
	; BE-32BIT-P10-NEXT: lwz r4, 16(r30)			; BE-32BIT-P10-NEXT: lwz r4, 16(r30)
	; BE-32BIT-P10-NEXT: add r3, r4, r3			; BE-32BIT-P10-NEXT: add r3, r4, r3
	; BE-32BIT-P10-NEXT: b .LBB2_3			; BE-32BIT-P10-NEXT: b .LBB2_3
	; BE-32BIT-P10-NEXT: .LBB2_2:			; BE-32BIT-P10-NEXT: .LBB2_2:
	; BE-32BIT-P10-NEXT: li r3, 0			; BE-32BIT-P10-NEXT: li r3, 0
	; BE-32BIT-P10-NEXT: .LBB2_3: # %return			; BE-32BIT-P10-NEXT: .LBB2_3: # %return
	; BE-32BIT-P10-NEXT: lwz r30, 24(r1) # 4-byte Folded Reload			; BE-32BIT-P10-NEXT: lwz r30, 24(r1) # 4-byte Folded Reload
	; BE-32BIT-P10-NEXT: lwz r0, 36(r1)			; BE-32BIT-P10-NEXT: lwz r0, 36(r1)
	; BE-32BIT-P10-NEXT: addi r1, r1, 32			; BE-32BIT-P10-NEXT: addi r1, r1, 32
	; BE-32BIT-P10-NEXT: hashchk r0, -16(r1)			; BE-32BIT-P10-NEXT: hashchk r0, -16(r1)
	; BE-32BIT-P10-NEXT: mtlr r0			; BE-32BIT-P10-NEXT: mtlr r0
	; BE-32BIT-P10-NEXT: blr			; BE-32BIT-P10-NEXT: blr
	;			;
	; BE-32BIT-P9-LABEL: shrinkwrap:			; BE-32BIT-P9-LABEL: shrinkwrap:
	; BE-32BIT-P9: # %bb.0: # %entry			; BE-32BIT-P9: # %bb.0: # %entry
	; BE-32BIT-P9-NEXT: mflr r0			; BE-32BIT-P9-NEXT: mflr r0
	; BE-32BIT-P9-NEXT: stwu r1, -32(r1)			; BE-32BIT-P9-NEXT: stwu r1, -32(r1)
	; BE-32BIT-P9-NEXT: cmplwi r3, 0			; BE-32BIT-P9-NEXT: cmplwi r3, 0
	; BE-32BIT-P9-NEXT: stw r0, 36(r1)			; BE-32BIT-P9-NEXT: stw r0, 36(r1)
	; BE-32BIT-P9-NEXT: hashst r0, -16(r1)			; BE-32BIT-P9-NEXT: hashst r0, -16(r1)
	; BE-32BIT-P9-NEXT: stw r30, 24(r1) # 4-byte Folded Spill			; BE-32BIT-P9-NEXT: stw r30, 24(r1) # 4-byte Folded Spill
	; BE-32BIT-P9-NEXT: beq cr0, .LBB2_2			; BE-32BIT-P9-NEXT: beq cr0, .LBB2_2
	; BE-32BIT-P9-NEXT: # %bb.1: # %if.end			; BE-32BIT-P9-NEXT: # %bb.1: # %if.end
				; BE-32BIT-P9-NEXT: lwz r4, 12(r3)
	; BE-32BIT-P9-NEXT: mr r30, r3			; BE-32BIT-P9-NEXT: mr r30, r3
	; BE-32BIT-P9-NEXT: lwz r3, 12(r3)			; BE-32BIT-P9-NEXT: stw r4, 12(r1)
	; BE-32BIT-P9-NEXT: stw r3, 12(r1)			; BE-32BIT-P9-NEXT: addi r4, r1, 12
	; BE-32BIT-P9-NEXT: addi r3, r1, 12			; BE-32BIT-P9-NEXT: mr r3, r4
	; BE-32BIT-P9-NEXT: bl callee2			; BE-32BIT-P9-NEXT: bl callee2
	; BE-32BIT-P9-NEXT: lwz r4, 16(r30)			; BE-32BIT-P9-NEXT: lwz r4, 16(r30)
	; BE-32BIT-P9-NEXT: add r3, r4, r3			; BE-32BIT-P9-NEXT: add r3, r4, r3
	; BE-32BIT-P9-NEXT: b .LBB2_3			; BE-32BIT-P9-NEXT: b .LBB2_3
	; BE-32BIT-P9-NEXT: .LBB2_2:			; BE-32BIT-P9-NEXT: .LBB2_2:
	; BE-32BIT-P9-NEXT: li r3, 0			; BE-32BIT-P9-NEXT: li r3, 0
	; BE-32BIT-P9-NEXT: .LBB2_3: # %return			; BE-32BIT-P9-NEXT: .LBB2_3: # %return
	; BE-32BIT-P9-NEXT: lwz r30, 24(r1) # 4-byte Folded Reload			; BE-32BIT-P9-NEXT: lwz r30, 24(r1) # 4-byte Folded Reload
	; BE-32BIT-P9-NEXT: lwz r0, 36(r1)			; BE-32BIT-P9-NEXT: lwz r0, 36(r1)
	; BE-32BIT-P9-NEXT: addi r1, r1, 32			; BE-32BIT-P9-NEXT: addi r1, r1, 32
	; BE-32BIT-P9-NEXT: mtlr r0			; BE-32BIT-P9-NEXT: mtlr r0
	; BE-32BIT-P9-NEXT: hashchk r0, -16(r1)			; BE-32BIT-P9-NEXT: hashchk r0, -16(r1)
	; BE-32BIT-P9-NEXT: blr			; BE-32BIT-P9-NEXT: blr
	;			;
	; BE-32BIT-P8-LABEL: shrinkwrap:			; BE-32BIT-P8-LABEL: shrinkwrap:
	; BE-32BIT-P8: # %bb.0: # %entry			; BE-32BIT-P8: # %bb.0: # %entry
	; BE-32BIT-P8-NEXT: mflr r0			; BE-32BIT-P8-NEXT: mflr r0
	; BE-32BIT-P8-NEXT: stwu r1, -32(r1)			; BE-32BIT-P8-NEXT: stwu r1, -32(r1)
	; BE-32BIT-P8-NEXT: cmplwi r3, 0			; BE-32BIT-P8-NEXT: cmplwi r3, 0
	; BE-32BIT-P8-NEXT: stw r0, 36(r1)			; BE-32BIT-P8-NEXT: stw r0, 36(r1)
	; BE-32BIT-P8-NEXT: hashst r0, -16(r1)			; BE-32BIT-P8-NEXT: hashst r0, -16(r1)
	; BE-32BIT-P8-NEXT: stw r30, 24(r1) # 4-byte Folded Spill			; BE-32BIT-P8-NEXT: stw r30, 24(r1) # 4-byte Folded Spill
	; BE-32BIT-P8-NEXT: beq cr0, .LBB2_2			; BE-32BIT-P8-NEXT: beq cr0, .LBB2_2
	; BE-32BIT-P8-NEXT: # %bb.1: # %if.end			; BE-32BIT-P8-NEXT: # %bb.1: # %if.end
				; BE-32BIT-P8-NEXT: lwz r4, 12(r3)
	; BE-32BIT-P8-NEXT: mr r30, r3			; BE-32BIT-P8-NEXT: mr r30, r3
	; BE-32BIT-P8-NEXT: lwz r3, 12(r3)			; BE-32BIT-P8-NEXT: stw r4, 12(r1)
	; BE-32BIT-P8-NEXT: stw r3, 12(r1)			; BE-32BIT-P8-NEXT: addi r4, r1, 12
	; BE-32BIT-P8-NEXT: addi r3, r1, 12			; BE-32BIT-P8-NEXT: mr r3, r4
	; BE-32BIT-P8-NEXT: bl callee2			; BE-32BIT-P8-NEXT: bl callee2
	; BE-32BIT-P8-NEXT: lwz r4, 16(r30)			; BE-32BIT-P8-NEXT: lwz r4, 16(r30)
	; BE-32BIT-P8-NEXT: add r3, r4, r3			; BE-32BIT-P8-NEXT: add r3, r4, r3
	; BE-32BIT-P8-NEXT: b .LBB2_3			; BE-32BIT-P8-NEXT: b .LBB2_3
	; BE-32BIT-P8-NEXT: .LBB2_2:			; BE-32BIT-P8-NEXT: .LBB2_2:
	; BE-32BIT-P8-NEXT: li r3, 0			; BE-32BIT-P8-NEXT: li r3, 0
	; BE-32BIT-P8-NEXT: .LBB2_3: # %return			; BE-32BIT-P8-NEXT: .LBB2_3: # %return
	; BE-32BIT-P8-NEXT: lwz r30, 24(r1) # 4-byte Folded Reload			; BE-32BIT-P8-NEXT: lwz r30, 24(r1) # 4-byte Folded Reload
	; BE-32BIT-P8-NEXT: lwz r0, 36(r1)			; BE-32BIT-P8-NEXT: lwz r0, 36(r1)
	; BE-32BIT-P8-NEXT: addi r1, r1, 32			; BE-32BIT-P8-NEXT: addi r1, r1, 32
	; BE-32BIT-P8-NEXT: mtlr r0			; BE-32BIT-P8-NEXT: mtlr r0
	; BE-32BIT-P8-NEXT: hashchk r0, -16(r1)			; BE-32BIT-P8-NEXT: hashchk r0, -16(r1)
	; BE-32BIT-P8-NEXT: blr			; BE-32BIT-P8-NEXT: blr
	;			;
	; LE-P10-PRIV-LABEL: shrinkwrap:			; LE-P10-PRIV-LABEL: shrinkwrap:
	; LE-P10-PRIV: # %bb.0: # %entry			; LE-P10-PRIV: # %bb.0: # %entry
	; LE-P10-PRIV-NEXT: cmpldi r3, 0			; LE-P10-PRIV-NEXT: cmpldi r3, 0
	; LE-P10-PRIV-NEXT: beq cr0, .LBB2_2			; LE-P10-PRIV-NEXT: beq cr0, .LBB2_2
	; LE-P10-PRIV-NEXT: # %bb.1: # %if.end			; LE-P10-PRIV-NEXT: # %bb.1: # %if.end
	; LE-P10-PRIV-NEXT: mflr r0			; LE-P10-PRIV-NEXT: mflr r0
	; LE-P10-PRIV-NEXT: std r30, -16(r1) # 8-byte Folded Spill			; LE-P10-PRIV-NEXT: std r30, -16(r1) # 8-byte Folded Spill
	; LE-P10-PRIV-NEXT: std r0, 16(r1)			; LE-P10-PRIV-NEXT: std r0, 16(r1)
	; LE-P10-PRIV-NEXT: hashstp r0, -24(r1)			; LE-P10-PRIV-NEXT: hashstp r0, -24(r1)
	; LE-P10-PRIV-NEXT: stdu r1, -64(r1)			; LE-P10-PRIV-NEXT: stdu r1, -64(r1)
				; LE-P10-PRIV-NEXT: lwz r4, 12(r3)
	; LE-P10-PRIV-NEXT: mr r30, r3			; LE-P10-PRIV-NEXT: mr r30, r3
	; LE-P10-PRIV-NEXT: lwz r3, 12(r3)			; LE-P10-PRIV-NEXT: stw r4, 36(r1)
	; LE-P10-PRIV-NEXT: stw r3, 36(r1)			; LE-P10-PRIV-NEXT: addi r4, r1, 36
	; LE-P10-PRIV-NEXT: addi r3, r1, 36			; LE-P10-PRIV-NEXT: mr r3, r4
	; LE-P10-PRIV-NEXT: bl callee2@notoc			; LE-P10-PRIV-NEXT: bl callee2@notoc
	; LE-P10-PRIV-NEXT: lwz r4, 16(r30)			; LE-P10-PRIV-NEXT: lwz r4, 16(r30)
	; LE-P10-PRIV-NEXT: add r3, r4, r3			; LE-P10-PRIV-NEXT: add r3, r4, r3
	; LE-P10-PRIV-NEXT: addi r1, r1, 64			; LE-P10-PRIV-NEXT: addi r1, r1, 64
	; LE-P10-PRIV-NEXT: ld r0, 16(r1)			; LE-P10-PRIV-NEXT: ld r0, 16(r1)
	; LE-P10-PRIV-NEXT: clrldi r3, r3, 32			; LE-P10-PRIV-NEXT: clrldi r3, r3, 32
	; LE-P10-PRIV-NEXT: hashchkp r0, -24(r1)			; LE-P10-PRIV-NEXT: hashchkp r0, -24(r1)
	; LE-P10-PRIV-NEXT: ld r30, -16(r1) # 8-byte Folded Reload			; LE-P10-PRIV-NEXT: ld r30, -16(r1) # 8-byte Folded Reload
	Show All 9 Lines
	; LE-P9-PRIV-NEXT: beq cr0, .LBB2_2			; LE-P9-PRIV-NEXT: beq cr0, .LBB2_2
	; LE-P9-PRIV-NEXT: # %bb.1: # %if.end			; LE-P9-PRIV-NEXT: # %bb.1: # %if.end
	; LE-P9-PRIV-NEXT: mflr r0			; LE-P9-PRIV-NEXT: mflr r0
	; LE-P9-PRIV-NEXT: std r30, -16(r1) # 8-byte Folded Spill			; LE-P9-PRIV-NEXT: std r30, -16(r1) # 8-byte Folded Spill
	; LE-P9-PRIV-NEXT: stdu r1, -64(r1)			; LE-P9-PRIV-NEXT: stdu r1, -64(r1)
	; LE-P9-PRIV-NEXT: std r0, 80(r1)			; LE-P9-PRIV-NEXT: std r0, 80(r1)
	; LE-P9-PRIV-NEXT: hashstp r0, -24(r1)			; LE-P9-PRIV-NEXT: hashstp r0, -24(r1)
	; LE-P9-PRIV-NEXT: mr r30, r3			; LE-P9-PRIV-NEXT: mr r30, r3
	; LE-P9-PRIV-NEXT: lwz r3, 12(r3)			; LE-P9-PRIV-NEXT: lwz r4, 12(r3)
	; LE-P9-PRIV-NEXT: stw r3, 36(r1)			; LE-P9-PRIV-NEXT: stw r4, 36(r1)
	; LE-P9-PRIV-NEXT: addi r3, r1, 36			; LE-P9-PRIV-NEXT: addi r4, r1, 36
				; LE-P9-PRIV-NEXT: mr r3, r4
	; LE-P9-PRIV-NEXT: bl callee2			; LE-P9-PRIV-NEXT: bl callee2
	; LE-P9-PRIV-NEXT: nop			; LE-P9-PRIV-NEXT: nop
	; LE-P9-PRIV-NEXT: lwz r4, 16(r30)			; LE-P9-PRIV-NEXT: lwz r4, 16(r30)
	; LE-P9-PRIV-NEXT: add r3, r4, r3			; LE-P9-PRIV-NEXT: add r3, r4, r3
	; LE-P9-PRIV-NEXT: addi r1, r1, 64			; LE-P9-PRIV-NEXT: addi r1, r1, 64
	; LE-P9-PRIV-NEXT: ld r0, 16(r1)			; LE-P9-PRIV-NEXT: ld r0, 16(r1)
	; LE-P9-PRIV-NEXT: clrldi r3, r3, 32			; LE-P9-PRIV-NEXT: clrldi r3, r3, 32
	; LE-P9-PRIV-NEXT: mtlr r0			; LE-P9-PRIV-NEXT: mtlr r0
	Show All 10 Lines
	; LE-P8-PRIV-NEXT: beq cr0, .LBB2_2			; LE-P8-PRIV-NEXT: beq cr0, .LBB2_2
	; LE-P8-PRIV-NEXT: # %bb.1: # %if.end			; LE-P8-PRIV-NEXT: # %bb.1: # %if.end
	; LE-P8-PRIV-NEXT: mflr r0			; LE-P8-PRIV-NEXT: mflr r0
	; LE-P8-PRIV-NEXT: std r30, -16(r1) # 8-byte Folded Spill			; LE-P8-PRIV-NEXT: std r30, -16(r1) # 8-byte Folded Spill
	; LE-P8-PRIV-NEXT: stdu r1, -64(r1)			; LE-P8-PRIV-NEXT: stdu r1, -64(r1)
	; LE-P8-PRIV-NEXT: std r0, 80(r1)			; LE-P8-PRIV-NEXT: std r0, 80(r1)
	; LE-P8-PRIV-NEXT: hashstp r0, -24(r1)			; LE-P8-PRIV-NEXT: hashstp r0, -24(r1)
	; LE-P8-PRIV-NEXT: mr r30, r3			; LE-P8-PRIV-NEXT: mr r30, r3
	; LE-P8-PRIV-NEXT: lwz r3, 12(r3)			; LE-P8-PRIV-NEXT: lwz r4, 12(r3)
	; LE-P8-PRIV-NEXT: stw r3, 36(r1)			; LE-P8-PRIV-NEXT: stw r4, 36(r1)
	; LE-P8-PRIV-NEXT: addi r3, r1, 36			; LE-P8-PRIV-NEXT: addi r4, r1, 36
				; LE-P8-PRIV-NEXT: mr r3, r4
	; LE-P8-PRIV-NEXT: bl callee2			; LE-P8-PRIV-NEXT: bl callee2
	; LE-P8-PRIV-NEXT: nop			; LE-P8-PRIV-NEXT: nop
	; LE-P8-PRIV-NEXT: lwz r4, 16(r30)			; LE-P8-PRIV-NEXT: lwz r4, 16(r30)
	; LE-P8-PRIV-NEXT: add r3, r4, r3			; LE-P8-PRIV-NEXT: add r3, r4, r3
	; LE-P8-PRIV-NEXT: addi r1, r1, 64			; LE-P8-PRIV-NEXT: addi r1, r1, 64
	; LE-P8-PRIV-NEXT: ld r0, 16(r1)			; LE-P8-PRIV-NEXT: ld r0, 16(r1)
	; LE-P8-PRIV-NEXT: clrldi r3, r3, 32			; LE-P8-PRIV-NEXT: clrldi r3, r3, 32
	; LE-P8-PRIV-NEXT: hashchkp r0, -24(r1)			; LE-P8-PRIV-NEXT: hashchkp r0, -24(r1)
	; LE-P8-PRIV-NEXT: mtlr r0			; LE-P8-PRIV-NEXT: mtlr r0
	; LE-P8-PRIV-NEXT: ld r30, -16(r1) # 8-byte Folded Reload			; LE-P8-PRIV-NEXT: ld r30, -16(r1) # 8-byte Folded Reload
	; LE-P8-PRIV-NEXT: blr			; LE-P8-PRIV-NEXT: blr
	; LE-P8-PRIV-NEXT: .LBB2_2:			; LE-P8-PRIV-NEXT: .LBB2_2:
	; LE-P8-PRIV-NEXT: li r3, 0			; LE-P8-PRIV-NEXT: li r3, 0
	; LE-P8-PRIV-NEXT: blr			; LE-P8-PRIV-NEXT: blr
	;			;
	; BE-P10-PRIV-LABEL: shrinkwrap:			; BE-P10-PRIV-LABEL: shrinkwrap:
	; BE-P10-PRIV: # %bb.0: # %entry			; BE-P10-PRIV: # %bb.0: # %entry
	; BE-P10-PRIV-NEXT: cmpldi r3, 0			; BE-P10-PRIV-NEXT: cmpldi r3, 0
	; BE-P10-PRIV-NEXT: beq cr0, .LBB2_2			; BE-P10-PRIV-NEXT: beq cr0, .LBB2_2
	; BE-P10-PRIV-NEXT: # %bb.1: # %if.end			; BE-P10-PRIV-NEXT: # %bb.1: # %if.end
	; BE-P10-PRIV-NEXT: mflr r0			; BE-P10-PRIV-NEXT: mflr r0
	; BE-P10-PRIV-NEXT: std r0, 16(r1)			; BE-P10-PRIV-NEXT: std r0, 16(r1)
	; BE-P10-PRIV-NEXT: hashstp r0, -24(r1)			; BE-P10-PRIV-NEXT: hashstp r0, -24(r1)
	; BE-P10-PRIV-NEXT: stdu r1, -144(r1)			; BE-P10-PRIV-NEXT: stdu r1, -144(r1)
				; BE-P10-PRIV-NEXT: lwz r4, 12(r3)
	; BE-P10-PRIV-NEXT: std r30, 128(r1) # 8-byte Folded Spill			; BE-P10-PRIV-NEXT: std r30, 128(r1) # 8-byte Folded Spill
	; BE-P10-PRIV-NEXT: mr r30, r3			; BE-P10-PRIV-NEXT: mr r30, r3
	; BE-P10-PRIV-NEXT: lwz r3, 12(r3)			; BE-P10-PRIV-NEXT: stw r4, 116(r1)
	; BE-P10-PRIV-NEXT: stw r3, 116(r1)			; BE-P10-PRIV-NEXT: addi r4, r1, 116
	; BE-P10-PRIV-NEXT: addi r3, r1, 116			; BE-P10-PRIV-NEXT: mr r3, r4
	; BE-P10-PRIV-NEXT: bl callee2			; BE-P10-PRIV-NEXT: bl callee2
	; BE-P10-PRIV-NEXT: nop			; BE-P10-PRIV-NEXT: nop
	; BE-P10-PRIV-NEXT: lwz r4, 16(r30)			; BE-P10-PRIV-NEXT: lwz r4, 16(r30)
	; BE-P10-PRIV-NEXT: ld r30, 128(r1) # 8-byte Folded Reload			; BE-P10-PRIV-NEXT: ld r30, 128(r1) # 8-byte Folded Reload
	; BE-P10-PRIV-NEXT: add r3, r4, r3			; BE-P10-PRIV-NEXT: add r3, r4, r3
	; BE-P10-PRIV-NEXT: addi r1, r1, 144			; BE-P10-PRIV-NEXT: addi r1, r1, 144
	; BE-P10-PRIV-NEXT: ld r0, 16(r1)			; BE-P10-PRIV-NEXT: ld r0, 16(r1)
	; BE-P10-PRIV-NEXT: clrldi r3, r3, 32			; BE-P10-PRIV-NEXT: clrldi r3, r3, 32
	Show All 10 Lines
	; BE-P9-PRIV-NEXT: beq cr0, .LBB2_2			; BE-P9-PRIV-NEXT: beq cr0, .LBB2_2
	; BE-P9-PRIV-NEXT: # %bb.1: # %if.end			; BE-P9-PRIV-NEXT: # %bb.1: # %if.end
	; BE-P9-PRIV-NEXT: mflr r0			; BE-P9-PRIV-NEXT: mflr r0
	; BE-P9-PRIV-NEXT: stdu r1, -144(r1)			; BE-P9-PRIV-NEXT: stdu r1, -144(r1)
	; BE-P9-PRIV-NEXT: std r0, 160(r1)			; BE-P9-PRIV-NEXT: std r0, 160(r1)
	; BE-P9-PRIV-NEXT: hashstp r0, -24(r1)			; BE-P9-PRIV-NEXT: hashstp r0, -24(r1)
	; BE-P9-PRIV-NEXT: std r30, 128(r1) # 8-byte Folded Spill			; BE-P9-PRIV-NEXT: std r30, 128(r1) # 8-byte Folded Spill
	; BE-P9-PRIV-NEXT: mr r30, r3			; BE-P9-PRIV-NEXT: mr r30, r3
	; BE-P9-PRIV-NEXT: lwz r3, 12(r3)			; BE-P9-PRIV-NEXT: lwz r4, 12(r3)
	; BE-P9-PRIV-NEXT: stw r3, 116(r1)			; BE-P9-PRIV-NEXT: stw r4, 116(r1)
	; BE-P9-PRIV-NEXT: addi r3, r1, 116			; BE-P9-PRIV-NEXT: addi r4, r1, 116
				; BE-P9-PRIV-NEXT: mr r3, r4
	; BE-P9-PRIV-NEXT: bl callee2			; BE-P9-PRIV-NEXT: bl callee2
	; BE-P9-PRIV-NEXT: nop			; BE-P9-PRIV-NEXT: nop
	; BE-P9-PRIV-NEXT: lwz r4, 16(r30)			; BE-P9-PRIV-NEXT: lwz r4, 16(r30)
	; BE-P9-PRIV-NEXT: ld r30, 128(r1) # 8-byte Folded Reload			; BE-P9-PRIV-NEXT: ld r30, 128(r1) # 8-byte Folded Reload
	; BE-P9-PRIV-NEXT: add r3, r4, r3			; BE-P9-PRIV-NEXT: add r3, r4, r3
	; BE-P9-PRIV-NEXT: addi r1, r1, 144			; BE-P9-PRIV-NEXT: addi r1, r1, 144
	; BE-P9-PRIV-NEXT: ld r0, 16(r1)			; BE-P9-PRIV-NEXT: ld r0, 16(r1)
	; BE-P9-PRIV-NEXT: clrldi r3, r3, 32			; BE-P9-PRIV-NEXT: clrldi r3, r3, 32
	Show All 10 Lines
	; BE-P8-PRIV-NEXT: beq cr0, .LBB2_2			; BE-P8-PRIV-NEXT: beq cr0, .LBB2_2
	; BE-P8-PRIV-NEXT: # %bb.1: # %if.end			; BE-P8-PRIV-NEXT: # %bb.1: # %if.end
	; BE-P8-PRIV-NEXT: mflr r0			; BE-P8-PRIV-NEXT: mflr r0
	; BE-P8-PRIV-NEXT: stdu r1, -144(r1)			; BE-P8-PRIV-NEXT: stdu r1, -144(r1)
	; BE-P8-PRIV-NEXT: std r0, 160(r1)			; BE-P8-PRIV-NEXT: std r0, 160(r1)
	; BE-P8-PRIV-NEXT: hashstp r0, -24(r1)			; BE-P8-PRIV-NEXT: hashstp r0, -24(r1)
	; BE-P8-PRIV-NEXT: std r30, 128(r1) # 8-byte Folded Spill			; BE-P8-PRIV-NEXT: std r30, 128(r1) # 8-byte Folded Spill
	; BE-P8-PRIV-NEXT: mr r30, r3			; BE-P8-PRIV-NEXT: mr r30, r3
	; BE-P8-PRIV-NEXT: lwz r3, 12(r3)			; BE-P8-PRIV-NEXT: lwz r4, 12(r3)
	; BE-P8-PRIV-NEXT: stw r3, 116(r1)			; BE-P8-PRIV-NEXT: stw r4, 116(r1)
	; BE-P8-PRIV-NEXT: addi r3, r1, 116			; BE-P8-PRIV-NEXT: addi r4, r1, 116
				; BE-P8-PRIV-NEXT: mr r3, r4
	; BE-P8-PRIV-NEXT: bl callee2			; BE-P8-PRIV-NEXT: bl callee2
	; BE-P8-PRIV-NEXT: nop			; BE-P8-PRIV-NEXT: nop
	; BE-P8-PRIV-NEXT: lwz r4, 16(r30)			; BE-P8-PRIV-NEXT: lwz r4, 16(r30)
	; BE-P8-PRIV-NEXT: ld r30, 128(r1) # 8-byte Folded Reload			; BE-P8-PRIV-NEXT: ld r30, 128(r1) # 8-byte Folded Reload
	; BE-P8-PRIV-NEXT: add r3, r4, r3			; BE-P8-PRIV-NEXT: add r3, r4, r3
	; BE-P8-PRIV-NEXT: addi r1, r1, 144			; BE-P8-PRIV-NEXT: addi r1, r1, 144
	; BE-P8-PRIV-NEXT: ld r0, 16(r1)			; BE-P8-PRIV-NEXT: ld r0, 16(r1)
	; BE-P8-PRIV-NEXT: clrldi r3, r3, 32			; BE-P8-PRIV-NEXT: clrldi r3, r3, 32
	▲ Show 20 Lines • Show All 842 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/subreg-postra.ll

	Show First 20 Lines • Show All 145 Lines • ▼ Show 20 Lines

	; CHECK-LABEL: @jbd2_journal_commit_transaction			; CHECK-LABEL: @jbd2_journal_commit_transaction
	; CHECK-NO-ISEL-LABEL: @jbd2_journal_commit_transaction			; CHECK-NO-ISEL-LABEL: @jbd2_journal_commit_transaction
	; CHECK: andi.			; CHECK: andi.
	; CHECK: crmove			; CHECK: crmove
	; CHECK: stdcx.			; CHECK: stdcx.
	; CHECK: iselgt {{[0-9]+}}, {{[0-9]+}}, {{[0-9]+}}			; CHECK: iselgt {{[0-9]+}}, {{[0-9]+}}, {{[0-9]+}}
	; CHECK-NO-ISEL: bc 12, 1, [[TRUE:.LBB[0-9]+]]			; CHECK-NO-ISEL: bc 12, 1, [[TRUE:.LBB[0-9]+]]
	; CHECK-NO-ISEL: ori 29, 3, 0			; CHECK-NO-ISEL: ori 5, 3, 0
	; CHECK-NO-ISEL: b [[SUCCESSOR:.LBB[0-9]+]]			; CHECK-NO-ISEL: b [[SUCCESSOR:.LBB[0-9]+]]


	while.end418: ; preds = %wait_on_buffer.exit1319, %do.body378			while.end418: ; preds = %wait_on_buffer.exit1319, %do.body378
	%err.4.lcssa = phi i32 [ %inp2, %do.body378 ], [ %.err.4, %wait_on_buffer.exit1319 ]			%err.4.lcssa = phi i32 [ %inp2, %do.body378 ], [ %.err.4, %wait_on_buffer.exit1319 ]
	br i1 %inp7, label %if.end421, label %if.then420			br i1 %inp7, label %if.end421, label %if.then420

	if.then420: ; preds = %while.end418			if.then420: ; preds = %while.end418
	Show All 14 Lines

llvm/test/CodeGen/PowerPC/tail-dup-break-cfg.ll

	Show First 20 Lines • Show All 159 Lines • ▼ Show 20 Lines
	; CHECK-LABEL: tail_dup_no_succ:			; CHECK-LABEL: tail_dup_no_succ:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: mflr 0			; CHECK-NEXT: mflr 0
	; CHECK-NEXT: .cfi_def_cfa_offset 48			; CHECK-NEXT: .cfi_def_cfa_offset 48
	; CHECK-NEXT: .cfi_offset lr, 16			; CHECK-NEXT: .cfi_offset lr, 16
	; CHECK-NEXT: .cfi_offset r30, -16			; CHECK-NEXT: .cfi_offset r30, -16
	; CHECK-NEXT: std 30, -16(1) # 8-byte Folded Spill			; CHECK-NEXT: std 30, -16(1) # 8-byte Folded Spill
	; CHECK-NEXT: stdu 1, -48(1)			; CHECK-NEXT: stdu 1, -48(1)
	; CHECK-NEXT: mr 30, 3			; CHECK-NEXT: andi. 4, 3, 1
	; CHECK-NEXT: andi. 3, 3, 1
	; CHECK-NEXT: std 0, 64(1)			; CHECK-NEXT: std 0, 64(1)
	; CHECK-NEXT: bc 12, 1, .LBB2_3			; CHECK-NEXT: bc 12, 1, .LBB2_3
	; CHECK-NEXT: .LBB2_1: # %v			; CHECK-NEXT: .LBB2_1: # %v
	; CHECK-NEXT: bl d			; CHECK-NEXT: bl d
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: bl d			; CHECK-NEXT: bl d
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: .LBB2_2: # %ret			; CHECK-NEXT: .LBB2_2: # %ret
	; CHECK-NEXT: addi 1, 1, 48			; CHECK-NEXT: addi 1, 1, 48
	; CHECK-NEXT: ld 0, 16(1)			; CHECK-NEXT: ld 0, 16(1)
	; CHECK-NEXT: ld 30, -16(1) # 8-byte Folded Reload			; CHECK-NEXT: ld 30, -16(1) # 8-byte Folded Reload
	; CHECK-NEXT: mtlr 0			; CHECK-NEXT: mtlr 0
	; CHECK-NEXT: blr			; CHECK-NEXT: blr
	; CHECK-NEXT: .LBB2_3: # %bb			; CHECK-NEXT: .LBB2_3: # %bb
	; CHECK-NEXT: andi. 3, 30, 2			; CHECK-NEXT: andi. 4, 3, 2
	; CHECK-NEXT: bne 0, .LBB2_5			; CHECK-NEXT: bne 0, .LBB2_5
	; CHECK-NEXT: # %bb.4: # %succ			; CHECK-NEXT: # %bb.4: # %succ
	; CHECK-NEXT: andi. 3, 30, 4			; CHECK-NEXT: andi. 3, 3, 4
	; CHECK-NEXT: beq 0, .LBB2_2			; CHECK-NEXT: beq 0, .LBB2_2
	; CHECK-NEXT: b .LBB2_1			; CHECK-NEXT: b .LBB2_1
	; CHECK-NEXT: .LBB2_5: # %c			; CHECK-NEXT: .LBB2_5: # %c
				; CHECK-NEXT: mr 30, 3
	; CHECK-NEXT: bl c			; CHECK-NEXT: bl c
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: bl c			; CHECK-NEXT: bl c
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: andi. 3, 30, 4			; CHECK-NEXT: mr 3, 30
				; CHECK-NEXT: andi. 3, 3, 4
	; CHECK-NEXT: beq 0, .LBB2_2			; CHECK-NEXT: beq 0, .LBB2_2
	; CHECK-NEXT: b .LBB2_1			; CHECK-NEXT: b .LBB2_1
	entry:			entry:
	%tagbit1 = and i32 %tag, 1			%tagbit1 = and i32 %tag, 1
	%tagbit1eq0 = icmp eq i32 %tagbit1, 0			%tagbit1eq0 = icmp eq i32 %tagbit1, 0
	br i1 %tagbit1eq0, label %v, label %bb, !prof !2 ; %v very much more likely			br i1 %tagbit1eq0, label %v, label %bb, !prof !2 ; %v very much more likely
	bb:			bb:
	%tagbit2 = and i32 %tag, 2			%tagbit2 = and i32 %tag, 2
	Show All 21 Lines

llvm/test/CodeGen/PowerPC/tls-cse.ll

Show All 38 Lines	_ZN4llvm21PrettyStackTraceEntryD2Ev.exit: ; preds = %entry
tail call void @_ZdlPv(ptr %this)		tail call void @_ZdlPv(ptr %this)
ret void		ret void
}		}

; CHECK-LABEL: _ZN4llvm21PrettyStackTraceEntryD0Ev:		; CHECK-LABEL: _ZN4llvm21PrettyStackTraceEntryD0Ev:
; CHECK: addis [[REG1:[0-9]+]], 2, _ZL20PrettyStackTraceHead@got@tlsld@ha		; CHECK: addis [[REG1:[0-9]+]], 2, _ZL20PrettyStackTraceHead@got@tlsld@ha
; CHECK: addi 3, [[REG1]], _ZL20PrettyStackTraceHead@got@tlsld@l		; CHECK: addi 3, [[REG1]], _ZL20PrettyStackTraceHead@got@tlsld@l
; CHECK: bl __tls_get_addr(_ZL20PrettyStackTraceHead@tlsld)		; CHECK: bl __tls_get_addr(_ZL20PrettyStackTraceHead@tlsld)
; CHECK: addis 3, 3, _ZL20PrettyStackTraceHead@dtprel@ha		; CHECK: addis [[REG2:[0-9]+]], 3, _ZL20PrettyStackTraceHead@dtprel@ha
; CHECK: ld {{[0-9]+}}, _ZL20PrettyStackTraceHead@dtprel@l(3)		; CHECK: ld {{[0-9]+}}, _ZL20PrettyStackTraceHead@dtprel@l([[REG2]])
; CHECK: std {{[0-9]+}}, _ZL20PrettyStackTraceHead@dtprel@l(3)		; CHECK: std {{[0-9]+}}, _ZL20PrettyStackTraceHead@dtprel@l([[REG2]])

llvm/test/CodeGen/RISCV/double-stack-spill-restore.ll

	Show First 20 Lines • Show All 94 Lines • ▼ Show 20 Lines
	; RV32IZFINXZDINX-NEXT: lw ra, 28(sp) # 4-byte Folded Reload			; RV32IZFINXZDINX-NEXT: lw ra, 28(sp) # 4-byte Folded Reload
	; RV32IZFINXZDINX-NEXT: lw s0, 24(sp) # 4-byte Folded Reload			; RV32IZFINXZDINX-NEXT: lw s0, 24(sp) # 4-byte Folded Reload
	; RV32IZFINXZDINX-NEXT: lw s1, 20(sp) # 4-byte Folded Reload			; RV32IZFINXZDINX-NEXT: lw s1, 20(sp) # 4-byte Folded Reload
	; RV32IZFINXZDINX-NEXT: addi sp, sp, 32			; RV32IZFINXZDINX-NEXT: addi sp, sp, 32
	; RV32IZFINXZDINX-NEXT: ret			; RV32IZFINXZDINX-NEXT: ret
	;			;
	; RV64IZFINXZDINX-LABEL: func:			; RV64IZFINXZDINX-LABEL: func:
	; RV64IZFINXZDINX: # %bb.0: # %entry			; RV64IZFINXZDINX: # %bb.0: # %entry
	; RV64IZFINXZDINX-NEXT: addi sp, sp, -16
	; RV64IZFINXZDINX-NEXT: sd ra, 8(sp) # 8-byte Folded Spill
	; RV64IZFINXZDINX-NEXT: sd s0, 0(sp) # 8-byte Folded Spill
	; RV64IZFINXZDINX-NEXT: sext.w a2, a1			; RV64IZFINXZDINX-NEXT: sext.w a2, a1
	; RV64IZFINXZDINX-NEXT: mv s0, a0
	; RV64IZFINXZDINX-NEXT: beqz a2, .LBB0_2			; RV64IZFINXZDINX-NEXT: beqz a2, .LBB0_2
	; RV64IZFINXZDINX-NEXT: # %bb.1: # %if.else			; RV64IZFINXZDINX-NEXT: # %bb.1: # %if.else
				; RV64IZFINXZDINX-NEXT: addi sp, sp, -16
				; RV64IZFINXZDINX-NEXT: sd ra, 8(sp) # 8-byte Folded Spill
				; RV64IZFINXZDINX-NEXT: sd s0, 0(sp) # 8-byte Folded Spill
	; RV64IZFINXZDINX-NEXT: addiw a1, a1, -1			; RV64IZFINXZDINX-NEXT: addiw a1, a1, -1
	; RV64IZFINXZDINX-NEXT: mv a0, s0			; RV64IZFINXZDINX-NEXT: mv s0, a0
	; RV64IZFINXZDINX-NEXT: call func@plt			; RV64IZFINXZDINX-NEXT: call func@plt
	; RV64IZFINXZDINX-NEXT: fadd.d a0, a0, s0			; RV64IZFINXZDINX-NEXT: fadd.d a0, a0, s0
	; RV64IZFINXZDINX-NEXT: j .LBB0_3
	; RV64IZFINXZDINX-NEXT: .LBB0_2: # %return
	; RV64IZFINXZDINX-NEXT: mv a0, s0
	; RV64IZFINXZDINX-NEXT: .LBB0_3: # %return
	; RV64IZFINXZDINX-NEXT: ld ra, 8(sp) # 8-byte Folded Reload			; RV64IZFINXZDINX-NEXT: ld ra, 8(sp) # 8-byte Folded Reload
	; RV64IZFINXZDINX-NEXT: ld s0, 0(sp) # 8-byte Folded Reload			; RV64IZFINXZDINX-NEXT: ld s0, 0(sp) # 8-byte Folded Reload
	; RV64IZFINXZDINX-NEXT: addi sp, sp, 16			; RV64IZFINXZDINX-NEXT: addi sp, sp, 16
				; RV64IZFINXZDINX-NEXT: .LBB0_2: # %return
	; RV64IZFINXZDINX-NEXT: ret			; RV64IZFINXZDINX-NEXT: ret
	entry:			entry:
	%cmp = icmp eq i32 %n, 0			%cmp = icmp eq i32 %n, 0
	br i1 %cmp, label %return, label %if.else			br i1 %cmp, label %return, label %if.else

	if.else:			if.else:
	%sub = add i32 %n, -1			%sub = add i32 %n, -1
	%call = tail call double @func(double %d, i32 %sub)			%call = tail call double @func(double %d, i32 %sub)
	%add = fadd double %call, %d			%add = fadd double %call, %d
	ret double %add			ret double %add

	return:			return:
	ret double %d			ret double %d
	}			}

llvm/test/CodeGen/RISCV/fmax-fmin.ll

	Show First 20 Lines • Show All 93 Lines • ▼ Show 20 Lines
	; R32-LABEL: maxnum_f64_nnan:			; R32-LABEL: maxnum_f64_nnan:
	; R32: # %bb.0:			; R32: # %bb.0:
	; R32-NEXT: addi sp, sp, -32			; R32-NEXT: addi sp, sp, -32
	; R32-NEXT: sw ra, 28(sp) # 4-byte Folded Spill			; R32-NEXT: sw ra, 28(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s0, 24(sp) # 4-byte Folded Spill			; R32-NEXT: sw s0, 24(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s1, 20(sp) # 4-byte Folded Spill			; R32-NEXT: sw s1, 20(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s2, 16(sp) # 4-byte Folded Spill			; R32-NEXT: sw s2, 16(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s3, 12(sp) # 4-byte Folded Spill			; R32-NEXT: sw s3, 12(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s4, 8(sp) # 4-byte Folded Spill
	; R32-NEXT: mv s1, a3			; R32-NEXT: mv s1, a3
	; R32-NEXT: mv s2, a2			; R32-NEXT: mv s2, a2
	; R32-NEXT: mv s0, a1			; R32-NEXT: mv s0, a1
	; R32-NEXT: mv s4, a0			; R32-NEXT: mv s3, a0
	; R32-NEXT: call __gtdf2@plt			; R32-NEXT: call __gtdf2@plt
	; R32-NEXT: mv s3, s4			; R32-NEXT: mv a1, a0
	; R32-NEXT: bgtz a0, .LBB3_2			; R32-NEXT: mv a0, s3
				; R32-NEXT: bgtz a1, .LBB3_2
	; R32-NEXT: # %bb.1:			; R32-NEXT: # %bb.1:
	; R32-NEXT: mv s3, s2			; R32-NEXT: mv s3, s2
	; R32-NEXT: .LBB3_2:			; R32-NEXT: .LBB3_2:
	; R32-NEXT: mv a0, s4
	; R32-NEXT: mv a1, s0			; R32-NEXT: mv a1, s0
	; R32-NEXT: mv a2, s2			; R32-NEXT: mv a2, s2
	; R32-NEXT: mv a3, s1			; R32-NEXT: mv a3, s1
	; R32-NEXT: call __gtdf2@plt			; R32-NEXT: call __gtdf2@plt
	; R32-NEXT: bgtz a0, .LBB3_4			; R32-NEXT: bgtz a0, .LBB3_4
	; R32-NEXT: # %bb.3:			; R32-NEXT: # %bb.3:
	; R32-NEXT: mv s0, s1			; R32-NEXT: mv s0, s1
	; R32-NEXT: .LBB3_4:			; R32-NEXT: .LBB3_4:
	; R32-NEXT: mv a0, s3			; R32-NEXT: mv a0, s3
	; R32-NEXT: mv a1, s0			; R32-NEXT: mv a1, s0
	; R32-NEXT: lw ra, 28(sp) # 4-byte Folded Reload			; R32-NEXT: lw ra, 28(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s0, 24(sp) # 4-byte Folded Reload			; R32-NEXT: lw s0, 24(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s1, 20(sp) # 4-byte Folded Reload			; R32-NEXT: lw s1, 20(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s2, 16(sp) # 4-byte Folded Reload			; R32-NEXT: lw s2, 16(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s3, 12(sp) # 4-byte Folded Reload			; R32-NEXT: lw s3, 12(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s4, 8(sp) # 4-byte Folded Reload
	; R32-NEXT: addi sp, sp, 32			; R32-NEXT: addi sp, sp, 32
	; R32-NEXT: ret			; R32-NEXT: ret
	;			;
	; R64-LABEL: maxnum_f64_nnan:			; R64-LABEL: maxnum_f64_nnan:
	; R64: # %bb.0:			; R64: # %bb.0:
	; R64-NEXT: addi sp, sp, -32			; R64-NEXT: addi sp, sp, -32
	; R64-NEXT: sd ra, 24(sp) # 8-byte Folded Spill			; R64-NEXT: sd ra, 24(sp) # 8-byte Folded Spill
	; R64-NEXT: sd s0, 16(sp) # 8-byte Folded Spill			; R64-NEXT: sd s0, 16(sp) # 8-byte Folded Spill
	▲ Show 20 Lines • Show All 107 Lines • ▼ Show 20 Lines
	; R32-LABEL: minnum_f64_fast:			; R32-LABEL: minnum_f64_fast:
	; R32: # %bb.0:			; R32: # %bb.0:
	; R32-NEXT: addi sp, sp, -32			; R32-NEXT: addi sp, sp, -32
	; R32-NEXT: sw ra, 28(sp) # 4-byte Folded Spill			; R32-NEXT: sw ra, 28(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s0, 24(sp) # 4-byte Folded Spill			; R32-NEXT: sw s0, 24(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s1, 20(sp) # 4-byte Folded Spill			; R32-NEXT: sw s1, 20(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s2, 16(sp) # 4-byte Folded Spill			; R32-NEXT: sw s2, 16(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s3, 12(sp) # 4-byte Folded Spill			; R32-NEXT: sw s3, 12(sp) # 4-byte Folded Spill
	; R32-NEXT: sw s4, 8(sp) # 4-byte Folded Spill
	; R32-NEXT: mv s1, a3			; R32-NEXT: mv s1, a3
	; R32-NEXT: mv s2, a2			; R32-NEXT: mv s2, a2
	; R32-NEXT: mv s0, a1			; R32-NEXT: mv s0, a1
	; R32-NEXT: mv s4, a0			; R32-NEXT: mv s3, a0
	; R32-NEXT: call __ltdf2@plt			; R32-NEXT: call __ltdf2@plt
	; R32-NEXT: mv s3, s4			; R32-NEXT: mv a1, a0
	; R32-NEXT: bltz a0, .LBB7_2			; R32-NEXT: mv a0, s3
				; R32-NEXT: bltz a1, .LBB7_2
	; R32-NEXT: # %bb.1:			; R32-NEXT: # %bb.1:
	; R32-NEXT: mv s3, s2			; R32-NEXT: mv s3, s2
	; R32-NEXT: .LBB7_2:			; R32-NEXT: .LBB7_2:
	; R32-NEXT: mv a0, s4
	; R32-NEXT: mv a1, s0			; R32-NEXT: mv a1, s0
	; R32-NEXT: mv a2, s2			; R32-NEXT: mv a2, s2
	; R32-NEXT: mv a3, s1			; R32-NEXT: mv a3, s1
	; R32-NEXT: call __ltdf2@plt			; R32-NEXT: call __ltdf2@plt
	; R32-NEXT: bltz a0, .LBB7_4			; R32-NEXT: bltz a0, .LBB7_4
	; R32-NEXT: # %bb.3:			; R32-NEXT: # %bb.3:
	; R32-NEXT: mv s0, s1			; R32-NEXT: mv s0, s1
	; R32-NEXT: .LBB7_4:			; R32-NEXT: .LBB7_4:
	; R32-NEXT: mv a0, s3			; R32-NEXT: mv a0, s3
	; R32-NEXT: mv a1, s0			; R32-NEXT: mv a1, s0
	; R32-NEXT: lw ra, 28(sp) # 4-byte Folded Reload			; R32-NEXT: lw ra, 28(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s0, 24(sp) # 4-byte Folded Reload			; R32-NEXT: lw s0, 24(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s1, 20(sp) # 4-byte Folded Reload			; R32-NEXT: lw s1, 20(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s2, 16(sp) # 4-byte Folded Reload			; R32-NEXT: lw s2, 16(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s3, 12(sp) # 4-byte Folded Reload			; R32-NEXT: lw s3, 12(sp) # 4-byte Folded Reload
	; R32-NEXT: lw s4, 8(sp) # 4-byte Folded Reload
	; R32-NEXT: addi sp, sp, 32			; R32-NEXT: addi sp, sp, 32
	; R32-NEXT: ret			; R32-NEXT: ret
	;			;
	; R64-LABEL: minnum_f64_fast:			; R64-LABEL: minnum_f64_fast:
	; R64: # %bb.0:			; R64: # %bb.0:
	; R64-NEXT: addi sp, sp, -32			; R64-NEXT: addi sp, sp, -32
	; R64-NEXT: sd ra, 24(sp) # 8-byte Folded Spill			; R64-NEXT: sd ra, 24(sp) # 8-byte Folded Spill
	; R64-NEXT: sd s0, 16(sp) # 8-byte Folded Spill			; R64-NEXT: sd s0, 16(sp) # 8-byte Folded Spill
	Show All 23 Lines

llvm/test/CodeGen/RISCV/overflow-intrinsics.ll

	Show First 20 Lines • Show All 996 Lines • ▼ Show 20 Lines

	; This used to verify insertion point for multi-BB, but now we just bail out.			; This used to verify insertion point for multi-BB, but now we just bail out.

	declare void @call(i1)			declare void @call(i1)

	define i1 @usubo_ult_sub_dominates_i64(i64 %x, i64 %y, ptr %p, i1 %cond) {			define i1 @usubo_ult_sub_dominates_i64(i64 %x, i64 %y, ptr %p, i1 %cond) {
	; RV32-LABEL: usubo_ult_sub_dominates_i64:			; RV32-LABEL: usubo_ult_sub_dominates_i64:
	; RV32: # %bb.0: # %entry			; RV32: # %bb.0: # %entry
	; RV32-NEXT: andi a7, a5, 1			; RV32-NEXT: andi a6, a5, 1
	; RV32-NEXT: beqz a7, .LBB31_5			; RV32-NEXT: beqz a6, .LBB31_5
	; RV32-NEXT: # %bb.1: # %t			; RV32-NEXT: # %bb.1: # %t
	; RV32-NEXT: mv a6, a0			; RV32-NEXT: mv a7, a0
	; RV32-NEXT: sltu a0, a0, a2			; RV32-NEXT: sltu a0, a0, a2
	; RV32-NEXT: sub t0, a1, a3			; RV32-NEXT: sub t0, a1, a3
	; RV32-NEXT: sub t0, t0, a0			; RV32-NEXT: sub t0, t0, a0
	; RV32-NEXT: sub a2, a6, a2			; RV32-NEXT: sub a2, a7, a2
	; RV32-NEXT: sw a2, 0(a4)			; RV32-NEXT: sw a2, 0(a4)
	; RV32-NEXT: sw t0, 4(a4)			; RV32-NEXT: sw t0, 4(a4)
	; RV32-NEXT: beqz a7, .LBB31_5			; RV32-NEXT: beqz a6, .LBB31_5
	; RV32-NEXT: # %bb.2: # %end			; RV32-NEXT: # %bb.2: # %end
	; RV32-NEXT: beq a1, a3, .LBB31_4			; RV32-NEXT: beq a1, a3, .LBB31_4
	; RV32-NEXT: # %bb.3: # %end			; RV32-NEXT: # %bb.3: # %end
	; RV32-NEXT: sltu a0, a1, a3			; RV32-NEXT: sltu a0, a1, a3
	; RV32-NEXT: .LBB31_4: # %end			; RV32-NEXT: .LBB31_4: # %end
	; RV32-NEXT: ret			; RV32-NEXT: ret
	; RV32-NEXT: .LBB31_5: # %f			; RV32-NEXT: .LBB31_5: # %f
	; RV32-NEXT: mv a0, a5			; RV32-NEXT: mv a0, a5
	▲ Show 20 Lines • Show All 304 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-strided-vpload.ll

	Show First 20 Lines • Show All 408 Lines • ▼ Show 20 Lines

	declare <3 x double> @llvm.experimental.vp.strided.load.v3f64.p0.i32(ptr, i32, <3 x i1>, i32)			declare <3 x double> @llvm.experimental.vp.strided.load.v3f64.p0.i32(ptr, i32, <3 x i1>, i32)

	; Splitting			; Splitting
	define <32 x double> @strided_vpload_v32f64(ptr %ptr, i32 signext %stride, <32 x i1> %m, i32 zeroext %evl) nounwind {			define <32 x double> @strided_vpload_v32f64(ptr %ptr, i32 signext %stride, <32 x i1> %m, i32 zeroext %evl) nounwind {
	; CHECK-LABEL: strided_vpload_v32f64:			; CHECK-LABEL: strided_vpload_v32f64:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: li a4, 16			; CHECK-NEXT: li a4, 16
	; CHECK-NEXT: vmv1r.v v8, v0			; CHECK-NEXT: vmv1r.v v9, v0
	; CHECK-NEXT: mv a3, a2			; CHECK-NEXT: mv a3, a2
	; CHECK-NEXT: bltu a2, a4, .LBB33_2			; CHECK-NEXT: bltu a2, a4, .LBB33_2
	; CHECK-NEXT: # %bb.1:			; CHECK-NEXT: # %bb.1:
	; CHECK-NEXT: li a3, 16			; CHECK-NEXT: li a3, 16
	; CHECK-NEXT: .LBB33_2:			; CHECK-NEXT: .LBB33_2:
	; CHECK-NEXT: mul a4, a3, a1			; CHECK-NEXT: mul a4, a3, a1
	; CHECK-NEXT: add a4, a0, a4			; CHECK-NEXT: add a4, a0, a4
	; CHECK-NEXT: addi a5, a2, -16			; CHECK-NEXT: addi a5, a2, -16
	; CHECK-NEXT: sltu a2, a2, a5			; CHECK-NEXT: sltu a2, a2, a5
	; CHECK-NEXT: addi a2, a2, -1			; CHECK-NEXT: addi a2, a2, -1
	; CHECK-NEXT: and a2, a2, a5			; CHECK-NEXT: and a2, a2, a5
	; CHECK-NEXT: vsetivli zero, 2, e8, mf4, ta, ma			; CHECK-NEXT: vsetivli zero, 2, e8, mf4, ta, ma
	; CHECK-NEXT: vslidedown.vi v0, v8, 2			; CHECK-NEXT: vslidedown.vi v8, v9, 2
	; CHECK-NEXT: vsetvli zero, a2, e64, m8, ta, ma			; CHECK-NEXT: vsetvli zero, a2, e64, m8, ta, ma
				; CHECK-NEXT: vmv1r.v v0, v8
	; CHECK-NEXT: vlse64.v v16, (a4), a1, v0.t			; CHECK-NEXT: vlse64.v v16, (a4), a1, v0.t
	; CHECK-NEXT: vsetvli zero, a3, e64, m8, ta, ma			; CHECK-NEXT: vsetvli zero, a3, e64, m8, ta, ma
	; CHECK-NEXT: vmv1r.v v0, v8			; CHECK-NEXT: vmv1r.v v0, v9
	; CHECK-NEXT: vlse64.v v8, (a0), a1, v0.t			; CHECK-NEXT: vlse64.v v8, (a0), a1, v0.t
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	%load = call <32 x double> @llvm.experimental.vp.strided.load.v32f64.p0.i32(ptr %ptr, i32 %stride, <32 x i1> %m, i32 %evl)			%load = call <32 x double> @llvm.experimental.vp.strided.load.v32f64.p0.i32(ptr %ptr, i32 %stride, <32 x i1> %m, i32 %evl)
	ret <32 x double> %load			ret <32 x double> %load
	}			}

	define <32 x double> @strided_vpload_v32f64_allones_mask(ptr %ptr, i32 signext %stride, i32 zeroext %evl) nounwind {			define <32 x double> @strided_vpload_v32f64_allones_mask(ptr %ptr, i32 signext %stride, i32 zeroext %evl) nounwind {
	; CHECK-LABEL: strided_vpload_v32f64_allones_mask:			; CHECK-LABEL: strided_vpload_v32f64_allones_mask:
	▲ Show 20 Lines • Show All 132 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/rvv/strided-vpload.ll

	Show First 20 Lines • Show All 759 Lines • ▼ Show 20 Lines
	}			}

	declare <vscale x 3 x double> @llvm.experimental.vp.strided.load.nxv3f64.p0.i32(ptr, i32, <vscale x 3 x i1>, i32)			declare <vscale x 3 x double> @llvm.experimental.vp.strided.load.nxv3f64.p0.i32(ptr, i32, <vscale x 3 x i1>, i32)

	; Splitting			; Splitting
	define <vscale x 16 x double> @strided_load_nxv16f64(ptr %ptr, i64 %stride, <vscale x 16 x i1> %mask, i32 zeroext %evl) {			define <vscale x 16 x double> @strided_load_nxv16f64(ptr %ptr, i64 %stride, <vscale x 16 x i1> %mask, i32 zeroext %evl) {
	; CHECK-RV32-LABEL: strided_load_nxv16f64:			; CHECK-RV32-LABEL: strided_load_nxv16f64:
	; CHECK-RV32: # %bb.0:			; CHECK-RV32: # %bb.0:
	; CHECK-RV32-NEXT: vmv1r.v v8, v0			; CHECK-RV32-NEXT: vmv1r.v v9, v0
	; CHECK-RV32-NEXT: csrr a4, vlenb			; CHECK-RV32-NEXT: csrr a4, vlenb
	; CHECK-RV32-NEXT: sub a2, a3, a4			; CHECK-RV32-NEXT: sub a2, a3, a4
	; CHECK-RV32-NEXT: sltu a5, a3, a2			; CHECK-RV32-NEXT: sltu a5, a3, a2
	; CHECK-RV32-NEXT: addi a5, a5, -1			; CHECK-RV32-NEXT: addi a5, a5, -1
	; CHECK-RV32-NEXT: and a2, a5, a2			; CHECK-RV32-NEXT: and a2, a5, a2
	; CHECK-RV32-NEXT: bltu a3, a4, .LBB42_2			; CHECK-RV32-NEXT: bltu a3, a4, .LBB42_2
	; CHECK-RV32-NEXT: # %bb.1:			; CHECK-RV32-NEXT: # %bb.1:
	; CHECK-RV32-NEXT: mv a3, a4			; CHECK-RV32-NEXT: mv a3, a4
	; CHECK-RV32-NEXT: .LBB42_2:			; CHECK-RV32-NEXT: .LBB42_2:
	; CHECK-RV32-NEXT: mul a5, a3, a1			; CHECK-RV32-NEXT: mul a5, a3, a1
	; CHECK-RV32-NEXT: add a5, a0, a5			; CHECK-RV32-NEXT: add a5, a0, a5
	; CHECK-RV32-NEXT: srli a4, a4, 3			; CHECK-RV32-NEXT: srli a4, a4, 3
	; CHECK-RV32-NEXT: vsetvli a6, zero, e8, mf4, ta, ma			; CHECK-RV32-NEXT: vsetvli a6, zero, e8, mf4, ta, ma
	; CHECK-RV32-NEXT: vslidedown.vx v0, v8, a4			; CHECK-RV32-NEXT: vslidedown.vx v8, v9, a4
	; CHECK-RV32-NEXT: vsetvli zero, a2, e64, m8, ta, ma			; CHECK-RV32-NEXT: vsetvli zero, a2, e64, m8, ta, ma
				; CHECK-RV32-NEXT: vmv1r.v v0, v8
	; CHECK-RV32-NEXT: vlse64.v v16, (a5), a1, v0.t			; CHECK-RV32-NEXT: vlse64.v v16, (a5), a1, v0.t
	; CHECK-RV32-NEXT: vsetvli zero, a3, e64, m8, ta, ma			; CHECK-RV32-NEXT: vsetvli zero, a3, e64, m8, ta, ma
	; CHECK-RV32-NEXT: vmv1r.v v0, v8			; CHECK-RV32-NEXT: vmv1r.v v0, v9
	; CHECK-RV32-NEXT: vlse64.v v8, (a0), a1, v0.t			; CHECK-RV32-NEXT: vlse64.v v8, (a0), a1, v0.t
	; CHECK-RV32-NEXT: ret			; CHECK-RV32-NEXT: ret
	;			;
	; CHECK-RV64-LABEL: strided_load_nxv16f64:			; CHECK-RV64-LABEL: strided_load_nxv16f64:
	; CHECK-RV64: # %bb.0:			; CHECK-RV64: # %bb.0:
	; CHECK-RV64-NEXT: vmv1r.v v8, v0			; CHECK-RV64-NEXT: vmv1r.v v9, v0
	; CHECK-RV64-NEXT: csrr a4, vlenb			; CHECK-RV64-NEXT: csrr a4, vlenb
	; CHECK-RV64-NEXT: sub a3, a2, a4			; CHECK-RV64-NEXT: sub a3, a2, a4
	; CHECK-RV64-NEXT: sltu a5, a2, a3			; CHECK-RV64-NEXT: sltu a5, a2, a3
	; CHECK-RV64-NEXT: addi a5, a5, -1			; CHECK-RV64-NEXT: addi a5, a5, -1
	; CHECK-RV64-NEXT: and a3, a5, a3			; CHECK-RV64-NEXT: and a3, a5, a3
	; CHECK-RV64-NEXT: bltu a2, a4, .LBB42_2			; CHECK-RV64-NEXT: bltu a2, a4, .LBB42_2
	; CHECK-RV64-NEXT: # %bb.1:			; CHECK-RV64-NEXT: # %bb.1:
	; CHECK-RV64-NEXT: mv a2, a4			; CHECK-RV64-NEXT: mv a2, a4
	; CHECK-RV64-NEXT: .LBB42_2:			; CHECK-RV64-NEXT: .LBB42_2:
	; CHECK-RV64-NEXT: mul a5, a2, a1			; CHECK-RV64-NEXT: mul a5, a2, a1
	; CHECK-RV64-NEXT: add a5, a0, a5			; CHECK-RV64-NEXT: add a5, a0, a5
	; CHECK-RV64-NEXT: srli a4, a4, 3			; CHECK-RV64-NEXT: srli a4, a4, 3
	; CHECK-RV64-NEXT: vsetvli a6, zero, e8, mf4, ta, ma			; CHECK-RV64-NEXT: vsetvli a6, zero, e8, mf4, ta, ma
	; CHECK-RV64-NEXT: vslidedown.vx v0, v8, a4			; CHECK-RV64-NEXT: vslidedown.vx v8, v9, a4
	; CHECK-RV64-NEXT: vsetvli zero, a3, e64, m8, ta, ma			; CHECK-RV64-NEXT: vsetvli zero, a3, e64, m8, ta, ma
				; CHECK-RV64-NEXT: vmv1r.v v0, v8
	; CHECK-RV64-NEXT: vlse64.v v16, (a5), a1, v0.t			; CHECK-RV64-NEXT: vlse64.v v16, (a5), a1, v0.t
	; CHECK-RV64-NEXT: vsetvli zero, a2, e64, m8, ta, ma			; CHECK-RV64-NEXT: vsetvli zero, a2, e64, m8, ta, ma
	; CHECK-RV64-NEXT: vmv1r.v v0, v8			; CHECK-RV64-NEXT: vmv1r.v v0, v9
	; CHECK-RV64-NEXT: vlse64.v v8, (a0), a1, v0.t			; CHECK-RV64-NEXT: vlse64.v v8, (a0), a1, v0.t
	; CHECK-RV64-NEXT: ret			; CHECK-RV64-NEXT: ret
	%v = call <vscale x 16 x double> @llvm.experimental.vp.strided.load.nxv16f64.p0.i64(ptr %ptr, i64 %stride, <vscale x 16 x i1> %mask, i32 %evl)			%v = call <vscale x 16 x double> @llvm.experimental.vp.strided.load.nxv16f64.p0.i64(ptr %ptr, i64 %stride, <vscale x 16 x i1> %mask, i32 %evl)
	ret <vscale x 16 x double> %v			ret <vscale x 16 x double> %v
	}			}

	define <vscale x 16 x double> @strided_load_nxv16f64_allones_mask(ptr %ptr, i64 %stride, i32 zeroext %evl) {			define <vscale x 16 x double> @strided_load_nxv16f64_allones_mask(ptr %ptr, i64 %stride, i32 zeroext %evl) {
	; CHECK-RV32-LABEL: strided_load_nxv16f64_allones_mask:			; CHECK-RV32-LABEL: strided_load_nxv16f64_allones_mask:
	▲ Show 20 Lines • Show All 152 Lines • Show Last 20 Lines

llvm/test/CodeGen/Thumb2/pacbti-m-outliner-3.ll

	Show All 27 Lines

	define hidden i32 @f(i32 %a, i32 %b, i32 %c, i32 %d) local_unnamed_addr #0 {			define hidden i32 @f(i32 %a, i32 %b, i32 %c, i32 %d) local_unnamed_addr #0 {
	entry:			entry:
	%cmp = icmp slt i32 %a, 0			%cmp = icmp slt i32 %a, 0
	br i1 %cmp, label %return, label %if.end			br i1 %cmp, label %return, label %if.end

	if.end: ; preds = %entry			if.end: ; preds = %entry
	%mul = mul nsw i32 %a, 11			%mul = mul nsw i32 %a, 11
	%sub = sub nsw i32 %mul, %b			%xor = xor i32 %mul, 100
				%sub = sub nsw i32 %xor, %b
	%call = tail call i32 @h(i32 %sub, i32 %b)			%call = tail call i32 @h(i32 %sub, i32 %b)
	%add = add nsw i32 %call, %b			%add = add nsw i32 %call, %b
	%mul1 = mul nsw i32 %add, %call			%mul1 = mul nsw i32 %add, %call
	%add2 = add nsw i32 %d, %c			%add2 = add nsw i32 %d, %c
	%div = sdiv i32 %mul1, %add2			%div = sdiv i32 %mul1, %add2
	%add3 = add nsw i32 %div, 2			%add3 = add nsw i32 %div, 2
	br label %return			br label %return

	Show All 31 Lines

	define hidden i32 @g(i32 %a, i32 %b, i32 %c, i32 %d) local_unnamed_addr #0 {			define hidden i32 @g(i32 %a, i32 %b, i32 %c, i32 %d) local_unnamed_addr #0 {
	entry:			entry:
	%cmp = icmp slt i32 %a, 0			%cmp = icmp slt i32 %a, 0
	br i1 %cmp, label %return, label %if.end			br i1 %cmp, label %return, label %if.end

	if.end: ; preds = %entry			if.end: ; preds = %entry
	%mul = mul nsw i32 %a, 11			%mul = mul nsw i32 %a, 11
	%sub = sub nsw i32 %mul, %b			%xor = xor i32 %mul, 100
				%sub = sub nsw i32 %xor, %b
	%call = tail call i32 @h(i32 %sub, i32 %b)			%call = tail call i32 @h(i32 %sub, i32 %b)
	%add = add nsw i32 %call, %b			%add = add nsw i32 %call, %b
	%mul1 = mul nsw i32 %add, %call			%mul1 = mul nsw i32 %add, %call
	%add2 = add nsw i32 %d, %c			%add2 = add nsw i32 %d, %c
	%div = sdiv i32 %mul1, %add2			%div = sdiv i32 %mul1, %add2
	%add3 = add nsw i32 %div, 1			%add3 = add nsw i32 %div, 1
	br label %return			br label %return

	▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/2007-11-30-LoadFolding-Bug.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=i686-- -mattr=+sse2 \| FileCheck %s			; RUN: llc < %s -mtriple=i686-- -mattr=+sse2 \| FileCheck %s
	; Increment in loop bb.i28.i adjusted to 2, to prevent loop reversal from			; Increment in loop bb.i28.i adjusted to 2, to prevent loop reversal from
	; kicking in.			; kicking in.

	declare fastcc void @rdft(i32, i32, ptr, ptr, ptr)			declare fastcc void @rdft(i32, i32, ptr, ptr, ptr)

	define fastcc void @mp_sqrt(i32 %n, i32 %radix, ptr %in, ptr %out, ptr %tmp1, ptr %tmp2, i32 %nfft, ptr %tmp1fft, ptr %tmp2fft, ptr %ip, ptr %w) nounwind {			define fastcc void @mp_sqrt(i32 %n, i32 %radix, ptr %in, ptr %out, ptr %tmp1, ptr %tmp2, i32 %nfft, ptr %tmp1fft, ptr %tmp2fft, ptr %ip, ptr %w) nounwind {
	; CHECK-LABEL: mp_sqrt:			; CHECK-LABEL: mp_sqrt:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: pushl %ebp			; CHECK-NEXT: pushl %ebp
	; CHECK-NEXT: pushl %ebx			; CHECK-NEXT: pushl %ebx
	; CHECK-NEXT: pushl %edi			; CHECK-NEXT: pushl %edi
	; CHECK-NEXT: pushl %esi			; CHECK-NEXT: pushl %esi
	; CHECK-NEXT: movl %edx, %esi			; CHECK-NEXT: pushl %eax
	; CHECK-NEXT: movb $1, %cl			; CHECK-NEXT: movb $1, %cl
	; CHECK-NEXT: movl $1, %ebx			; CHECK-NEXT: movl $1, %ebx
	; CHECK-NEXT: movl {{[0-9]+}}(%esp), %edi			; CHECK-NEXT: movl {{[0-9]+}}(%esp), %esi
	; CHECK-NEXT: .p2align 4, 0x90			; CHECK-NEXT: .p2align 4, 0x90
	; CHECK-NEXT: .LBB0_1: # %bb.i5			; CHECK-NEXT: .LBB0_1: # %bb.i5
	; CHECK-NEXT: # =>This Inner Loop Header: Depth=1			; CHECK-NEXT: # =>This Inner Loop Header: Depth=1
	; CHECK-NEXT: movl %ecx, %eax			; CHECK-NEXT: movl %ecx, %eax
	; CHECK-NEXT: addl %ebx, %ebx			; CHECK-NEXT: addl %ebx, %ebx
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
	; CHECK-NEXT: testb $1, %al			; CHECK-NEXT: testb $1, %al
	; CHECK-NEXT: jne .LBB0_1			; CHECK-NEXT: jne .LBB0_1
	; CHECK-NEXT: # %bb.2: # %mp_unexp_mp2d.exit.i			; CHECK-NEXT: # %bb.2: # %mp_unexp_mp2d.exit.i
	; CHECK-NEXT: je .LBB0_10			; CHECK-NEXT: je .LBB0_3
	; CHECK-NEXT: # %bb.3: # %cond_next.i			; CHECK-NEXT: # %bb.5: # %cond_next.i
	; CHECK-NEXT: testb $1, %al			; CHECK-NEXT: testb $1, %al
	; CHECK-NEXT: jne .LBB0_10			; CHECK-NEXT: jne .LBB0_3
	; CHECK-NEXT: # %bb.4: # %cond_next36.i			; CHECK-NEXT: # %bb.6: # %cond_next36.i
	; CHECK-NEXT: movl $0, 0			; CHECK-NEXT: movl $0, 0
	; CHECK-NEXT: movzbl %al, %ebp			; CHECK-NEXT: movzbl %al, %ebp
	; CHECK-NEXT: andl $1, %ebp			; CHECK-NEXT: andl $1, %ebp
	; CHECK-NEXT: xorpd %xmm0, %xmm0			; CHECK-NEXT: xorpd %xmm0, %xmm0
	; CHECK-NEXT: xorl %eax, %eax			; CHECK-NEXT: xorl %eax, %eax
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
	; CHECK-NEXT: xorpd %xmm1, %xmm1			; CHECK-NEXT: xorpd %xmm1, %xmm1
	; CHECK-NEXT: .p2align 4, 0x90			; CHECK-NEXT: .p2align 4, 0x90
	; CHECK-NEXT: .LBB0_5: # %bb.i28.i			; CHECK-NEXT: .LBB0_7: # %bb.i28.i
	; CHECK-NEXT: # =>This Inner Loop Header: Depth=1			; CHECK-NEXT: # =>This Inner Loop Header: Depth=1
	; CHECK-NEXT: cvttsd2si %xmm1, %edx			; CHECK-NEXT: cvttsd2si %xmm1, %edi
	; CHECK-NEXT: cmpl %esi, %edx			; CHECK-NEXT: cmpl %edx, %edi
	; CHECK-NEXT: cmovgel %eax, %edx			; CHECK-NEXT: cmovgel %eax, %edi
	; CHECK-NEXT: addl $2, %ecx			; CHECK-NEXT: addl $2, %ecx
	; CHECK-NEXT: xorps %xmm2, %xmm2			; CHECK-NEXT: xorps %xmm2, %xmm2
	; CHECK-NEXT: cvtsi2sd %edx, %xmm2			; CHECK-NEXT: cvtsi2sd %edi, %xmm2
	; CHECK-NEXT: xorpd %xmm1, %xmm1			; CHECK-NEXT: xorpd %xmm1, %xmm1
	; CHECK-NEXT: subsd %xmm2, %xmm1			; CHECK-NEXT: subsd %xmm2, %xmm1
	; CHECK-NEXT: mulsd %xmm0, %xmm1			; CHECK-NEXT: mulsd %xmm0, %xmm1
	; CHECK-NEXT: addl $-2, %ebp			; CHECK-NEXT: addl $-2, %ebp
	; CHECK-NEXT: jne .LBB0_5			; CHECK-NEXT: jne .LBB0_7
	; CHECK-NEXT: # %bb.6: # %mp_unexp_d2mp.exit29.i			; CHECK-NEXT: # %bb.8: # %mp_unexp_d2mp.exit29.i
	; CHECK-NEXT: movl $0, 0			; CHECK-NEXT: movl $0, 0
	; CHECK-NEXT: je .LBB0_7			; CHECK-NEXT: je .LBB0_9
	; CHECK-NEXT: # %bb.8: # %mp_sqrt_init.exit			; CHECK-NEXT: # %bb.10: # %mp_sqrt_init.exit
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
	; CHECK-NEXT: movl %edi, %edx			; CHECK-NEXT: movl %edx, %edi
				; CHECK-NEXT: movl %esi, %edx
	; CHECK-NEXT: calll mp_mul_csqu@PLT			; CHECK-NEXT: calll mp_mul_csqu@PLT
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
	; CHECK-NEXT: movl $-1, %edx			; CHECK-NEXT: movl $-1, %edx
	; CHECK-NEXT: pushl {{[0-9]+}}(%esp)			; CHECK-NEXT: pushl {{[0-9]+}}(%esp)
	; CHECK-NEXT: pushl {{[0-9]+}}(%esp)			; CHECK-NEXT: pushl {{[0-9]+}}(%esp)
	; CHECK-NEXT: pushl $0			; CHECK-NEXT: pushl $0
	; CHECK-NEXT: calll rdft@PLT			; CHECK-NEXT: calll rdft@PLT
	; CHECK-NEXT: addl $12, %esp			; CHECK-NEXT: addl $12, %esp
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
	; CHECK-NEXT: movl %esi, %edx			; CHECK-NEXT: movl %edi, (%esp) # 4-byte Spill
	; CHECK-NEXT: movl {{[0-9]+}}(%esp), %eax			; CHECK-NEXT: movl %edi, %edx
	; CHECK-NEXT: pushl %eax			; CHECK-NEXT: movl {{[0-9]+}}(%esp), %edi
	; CHECK-NEXT: pushl %edi			; CHECK-NEXT: pushl %edi
				; CHECK-NEXT: pushl %esi
	; CHECK-NEXT: pushl $0			; CHECK-NEXT: pushl $0
	; CHECK-NEXT: calll mp_mul_d2i@PLT			; CHECK-NEXT: calll mp_mul_d2i@PLT
	; CHECK-NEXT: addl $12, %esp			; CHECK-NEXT: addl $12, %esp
	; CHECK-NEXT: testl %ebp, %ebp			; CHECK-NEXT: testl %ebp, %ebp
	; CHECK-NEXT: jne .LBB0_10			; CHECK-NEXT: je .LBB0_11
	; CHECK-NEXT: # %bb.9: # %cond_false.i			; CHECK-NEXT: .LBB0_3: # %cond_true.i
				; CHECK-NEXT: addl $4, %esp
				; CHECK-NEXT: .LBB0_4: # %cond_true.i
				; CHECK-NEXT: popl %esi
				; CHECK-NEXT: popl %edi
				; CHECK-NEXT: popl %ebx
				; CHECK-NEXT: popl %ebp
				; CHECK-NEXT: retl
				; CHECK-NEXT: .p2align 4, 0x90
				; CHECK-NEXT: .LBB0_9: # %bb.i.i
				; CHECK-NEXT: # =>This Inner Loop Header: Depth=1
				; CHECK-NEXT: jmp .LBB0_9
				; CHECK-NEXT: .LBB0_11: # %cond_false.i
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
				; CHECK-NEXT: movl (%esp), %esi # 4-byte Reload
	; CHECK-NEXT: movl %esi, %edx			; CHECK-NEXT: movl %esi, %edx
	; CHECK-NEXT: pushl {{[0-9]+}}(%esp)			; CHECK-NEXT: pushl {{[0-9]+}}(%esp)
	; CHECK-NEXT: pushl $0			; CHECK-NEXT: pushl $0
	; CHECK-NEXT: calll mp_round@PLT			; CHECK-NEXT: calll mp_round@PLT
	; CHECK-NEXT: addl $8, %esp			; CHECK-NEXT: addl $8, %esp
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
	; CHECK-NEXT: movl %esi, %edx			; CHECK-NEXT: movl %esi, %edx
	; CHECK-NEXT: movl {{[0-9]+}}(%esp), %edi
	; CHECK-NEXT: pushl %edi
	; CHECK-NEXT: movl {{[0-9]+}}(%esp), %ebp			; CHECK-NEXT: movl {{[0-9]+}}(%esp), %ebp
	; CHECK-NEXT: pushl %ebp			; CHECK-NEXT: pushl %ebp
	; CHECK-NEXT: pushl %edi			; CHECK-NEXT: pushl %edi
				; CHECK-NEXT: pushl %ebp
	; CHECK-NEXT: calll mp_add@PLT			; CHECK-NEXT: calll mp_add@PLT
	; CHECK-NEXT: addl $12, %esp			; CHECK-NEXT: addl $12, %esp
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
	; CHECK-NEXT: movl %esi, %edx			; CHECK-NEXT: movl %esi, %edx
	; CHECK-NEXT: pushl %ebp			; CHECK-NEXT: pushl %edi
	; CHECK-NEXT: pushl %ebp			; CHECK-NEXT: pushl %edi
	; CHECK-NEXT: pushl {{[0-9]+}}(%esp)			; CHECK-NEXT: pushl {{[0-9]+}}(%esp)
	; CHECK-NEXT: calll mp_sub@PLT			; CHECK-NEXT: calll mp_sub@PLT
	; CHECK-NEXT: addl $12, %esp			; CHECK-NEXT: addl $12, %esp
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
	; CHECK-NEXT: movl %esi, %edx			; CHECK-NEXT: movl %esi, %edx
	; CHECK-NEXT: pushl %edi			; CHECK-NEXT: pushl %ebp
	; CHECK-NEXT: pushl $0			; CHECK-NEXT: pushl $0
	; CHECK-NEXT: calll mp_round@PLT			; CHECK-NEXT: calll mp_round@PLT
	; CHECK-NEXT: addl $8, %esp			; CHECK-NEXT: addl $8, %esp
	; CHECK-NEXT: xorl %ecx, %ecx			; CHECK-NEXT: xorl %ecx, %ecx
	; CHECK-NEXT: movl %esi, %edx			; CHECK-NEXT: movl %esi, %edx
	; CHECK-NEXT: pushl %ebp			; CHECK-NEXT: pushl %edi
	; CHECK-NEXT: pushl {{[0-9]+}}(%esp)			; CHECK-NEXT: pushl {{[0-9]+}}(%esp)
	; CHECK-NEXT: pushl %ebx			; CHECK-NEXT: pushl %ebx
	; CHECK-NEXT: calll mp_mul_d2i@PLT			; CHECK-NEXT: calll mp_mul_d2i@PLT
	; CHECK-NEXT: addl $12, %esp			; CHECK-NEXT: addl $16, %esp
	; CHECK-NEXT: .LBB0_10: # %cond_true.i			; CHECK-NEXT: jmp .LBB0_4
	; CHECK-NEXT: popl %esi
	; CHECK-NEXT: popl %edi
	; CHECK-NEXT: popl %ebx
	; CHECK-NEXT: popl %ebp
	; CHECK-NEXT: retl
	; CHECK-NEXT: .p2align 4, 0x90
	; CHECK-NEXT: .LBB0_7: # %bb.i.i
	; CHECK-NEXT: # =>This Inner Loop Header: Depth=1
	; CHECK-NEXT: jmp .LBB0_7
	entry:			entry:
	br label %bb.i5			br label %bb.i5

	bb.i5: ; preds = %bb.i5, %entry			bb.i5: ; preds = %bb.i5, %entry
	%nfft_init.0.i = phi i32 [ 1, %entry ], [ %tmp7.i3, %bb.i5 ] ; <i32> [#uses=1]			%nfft_init.0.i = phi i32 [ 1, %entry ], [ %tmp7.i3, %bb.i5 ] ; <i32> [#uses=1]
	%foo = phi i1 [1, %entry], [0, %bb.i5]			%foo = phi i1 [1, %entry], [0, %bb.i5]
	%tmp7.i3 = shl i32 %nfft_init.0.i, 1 ; <i32> [#uses=2]			%tmp7.i3 = shl i32 %nfft_init.0.i, 1 ; <i32> [#uses=2]
	br i1 %foo, label %bb.i5, label %mp_unexp_mp2d.exit.i			br i1 %foo, label %bb.i5, label %mp_unexp_mp2d.exit.i
	▲ Show 20 Lines • Show All 68 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/cgp-usubo.ll

	Show First 20 Lines • Show All 166 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: pushq %r14			; CHECK-NEXT: pushq %r14
	; CHECK-NEXT: pushq %rbx			; CHECK-NEXT: pushq %rbx
	; CHECK-NEXT: pushq %rax			; CHECK-NEXT: pushq %rax
	; CHECK-NEXT: movl %ecx, %ebp			; CHECK-NEXT: movl %ecx, %ebp
	; CHECK-NEXT: testb $1, %bpl			; CHECK-NEXT: testb $1, %bpl
	; CHECK-NEXT: je .LBB9_2			; CHECK-NEXT: je .LBB9_2
	; CHECK-NEXT: # %bb.1: # %t			; CHECK-NEXT: # %bb.1: # %t
	; CHECK-NEXT: movq %rdx, %rbx			; CHECK-NEXT: movq %rdx, %rbx
	; CHECK-NEXT: movq %rsi, %r15
	; CHECK-NEXT: movq %rdi, %r14			; CHECK-NEXT: movq %rdi, %r14
	; CHECK-NEXT: xorl %edi, %edi			; CHECK-NEXT: xorl %edi, %edi
	; CHECK-NEXT: cmpq %rsi, %r14			; CHECK-NEXT: cmpq %rsi, %r14
	; CHECK-NEXT: setb %dil			; CHECK-NEXT: setb %dil
				; CHECK-NEXT: movq %rsi, %r15
	; CHECK-NEXT: callq call@PLT			; CHECK-NEXT: callq call@PLT
	; CHECK-NEXT: subq %r15, %r14			; CHECK-NEXT: subq %r15, %r14
	; CHECK-NEXT: jae .LBB9_2			; CHECK-NEXT: jae .LBB9_2
	; CHECK-NEXT: # %bb.4: # %end			; CHECK-NEXT: # %bb.4: # %end
	; CHECK-NEXT: setb %al			; CHECK-NEXT: setb %al
	; CHECK-NEXT: movq %r14, (%rbx)			; CHECK-NEXT: movq %r14, (%rbx)
	; CHECK-NEXT: jmp .LBB9_3			; CHECK-NEXT: jmp .LBB9_3
	; CHECK-NEXT: .LBB9_2: # %f			; CHECK-NEXT: .LBB9_2: # %f
	▲ Show 20 Lines • Show All 73 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/csr-split.ll

	Show First 20 Lines • Show All 60 Lines • ▼ Show 20 Lines

	declare signext i32 @callVoid(...) local_unnamed_addr			declare signext i32 @callVoid(...) local_unnamed_addr

	declare signext i32 @callNonVoid(ptr) local_unnamed_addr			declare signext i32 @callNonVoid(ptr) local_unnamed_addr

	define dso_local signext i32 @test2(ptr %p1) local_unnamed_addr {			define dso_local signext i32 @test2(ptr %p1) local_unnamed_addr {
	; CHECK-LABEL: test2:			; CHECK-LABEL: test2:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: pushq %rbx
	; CHECK-NEXT: .cfi_def_cfa_offset 16
	; CHECK-NEXT: .cfi_offset %rbx, -16
	; CHECK-NEXT: testq %rdi, %rdi			; CHECK-NEXT: testq %rdi, %rdi
	; CHECK-NEXT: je .LBB1_2			; CHECK-NEXT: je .LBB1_2
	; CHECK-NEXT: # %bb.1: # %if.end			; CHECK-NEXT: # %bb.1: # %if.end
	; CHECK-NEXT: movq %rdi, %rbx
	; CHECK-NEXT: movslq a(%rip), %rax			; CHECK-NEXT: movslq a(%rip), %rax
	; CHECK-NEXT: cmpq %rdi, %rax			; CHECK-NEXT: cmpq %rdi, %rax
	; CHECK-NEXT: je .LBB1_3			; CHECK-NEXT: je .LBB1_3
	; CHECK-NEXT: .LBB1_2: # %return			; CHECK-NEXT: .LBB1_2: # %return
	; CHECK-NEXT: xorl %eax, %eax			; CHECK-NEXT: xorl %eax, %eax
	; CHECK-NEXT: popq %rbx
	; CHECK-NEXT: .cfi_def_cfa_offset 8
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	; CHECK-NEXT: .LBB1_3: # %if.then2			; CHECK-NEXT: .LBB1_3: # %if.then2
				; CHECK-NEXT: pushq %rbx
	; CHECK-NEXT: .cfi_def_cfa_offset 16			; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: .cfi_offset %rbx, -16
				; CHECK-NEXT: movq %rdi, %rbx
	; CHECK-NEXT: callq callVoid@PLT			; CHECK-NEXT: callq callVoid@PLT
	; CHECK-NEXT: movq %rbx, %rdi			; CHECK-NEXT: movq %rbx, %rdi
	; CHECK-NEXT: popq %rbx			; CHECK-NEXT: popq %rbx
	; CHECK-NEXT: .cfi_def_cfa_offset 8			; CHECK-NEXT: .cfi_def_cfa_offset 8
	; CHECK-NEXT: jmp callNonVoid@PLT # TAILCALL			; CHECK-NEXT: jmp callNonVoid@PLT # TAILCALL
	;			;
	; CHECK-32BIT-LABEL: test2:			; CHECK-32BIT-LABEL: test2:
	; CHECK-32BIT: # %bb.0: # %entry			; CHECK-32BIT: # %bb.0: # %entry
	▲ Show 20 Lines • Show All 134 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/ragreedy-bug.ll

	Show All 16 Lines
	; Mem-move			; Mem-move
	; CHECK-NEXT: movl			; CHECK-NEXT: movl
	; CHECK-NEXT: andl			; CHECK-NEXT: andl
	; CHECK-NEXT: testl			; CHECK-NEXT: testl
	; CHECK-NEXT: je			; CHECK-NEXT: je
	; CHECK: cond.false.i.i			; CHECK: cond.false.i.i
	; CHECK: maskrune			; CHECK: maskrune
	; CHECK-NEXT: movzbl			; CHECK-NEXT: movzbl
	; CHECK-NEXT: movzbl			; CHECK-NEXT: movq
	; CHECK-NEXT: testl			; CHECK-NEXT: testl
	; CHECK-NEXT: je			; CHECK-NEXT: je
	; CHECK: cond.false.i.i219			; CHECK: cond.false.i.i219
	; CHECK: maskrune			; CHECK: maskrune
	; CHECK-NEXT: movzbl			; CHECK-NEXT: movzbl
	; CHECK-NEXT: movzbl			; CHECK-NEXT: movq
	; CHECK-NEXT: testl			; CHECK-NEXT: testl
	; CHECK-NEXT: jne			; CHECK-NEXT: jne

	%struct.List_o_links_struct = type { i32, i32, i32, ptr }			%struct.List_o_links_struct = type { i32, i32, i32, ptr }
	%struct.Connector_struct = type { i16, i16, i8, i8, ptr, ptr }			%struct.Connector_struct = type { i16, i16, i8, i8, ptr, ptr }
	%struct._RuneLocale = type { [8 x i8], [32 x i8], ptr, ptr, i32, [256 x i32], [256 x i32], [256 x i32], %struct._RuneRange, %struct._RuneRange, %struct._RuneRange, ptr, i32, i32, ptr }			%struct._RuneLocale = type { [8 x i8], [32 x i8], ptr, ptr, i32, [256 x i32], [256 x i32], [256 x i32], %struct._RuneRange, %struct._RuneRange, %struct._RuneRange, ptr, i32, i32, ptr }
	%struct._RuneRange = type { i32, ptr }			%struct._RuneRange = type { i32, ptr }
	%struct._RuneEntry = type { i32, i32, i32, ptr }			%struct._RuneEntry = type { i32, i32, i32, ptr }
	▲ Show 20 Lines • Show All 269 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/ragreedy-hoist-spill.ll

	Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: popq %r15			; CHECK-NEXT: popq %r15
	; CHECK-NEXT: popq %rbp			; CHECK-NEXT: popq %rbp
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	; CHECK-NEXT: LBB0_5: ## %if.end25			; CHECK-NEXT: LBB0_5: ## %if.end25
	; CHECK-NEXT: xorl %eax, %eax			; CHECK-NEXT: xorl %eax, %eax
	; CHECK-NEXT: testb %al, %al			; CHECK-NEXT: testb %al, %al
	; CHECK-NEXT: je LBB0_54			; CHECK-NEXT: je LBB0_54
	; CHECK-NEXT: ## %bb.6: ## %SyTime.exit2720			; CHECK-NEXT: ## %bb.6: ## %SyTime.exit2720
	; CHECK-NEXT: movq %rdx, %r14
	; CHECK-NEXT: movq %rdi, %rbx
	; CHECK-NEXT: leaq {{[0-9]+}}(%rsp), %rax			; CHECK-NEXT: leaq {{[0-9]+}}(%rsp), %rax
	; CHECK-NEXT: leaq {{[0-9]+}}(%rsp), %rcx			; CHECK-NEXT: leaq {{[0-9]+}}(%rsp), %rcx
	; CHECK-NEXT: cmpq %rax, %rcx			; CHECK-NEXT: cmpq %rax, %rcx
	; CHECK-NEXT: jae LBB0_8			; CHECK-NEXT: jae LBB0_8
	; CHECK-NEXT: ## %bb.7: ## %for.body.lr.ph			; CHECK-NEXT: ## %bb.7: ## %for.body.lr.ph
				; CHECK-NEXT: movq %rdx, %rbx
	; CHECK-NEXT: movl $512, %edx ## imm = 0x200			; CHECK-NEXT: movl $512, %edx ## imm = 0x200
	; CHECK-NEXT: movl $32, %esi			; CHECK-NEXT: movl $32, %esi
				; CHECK-NEXT: movq %rdi, %r14
	; CHECK-NEXT: callq _memset			; CHECK-NEXT: callq _memset
				; CHECK-NEXT: movq %r14, %rdi
				skanUnsubmitted Not Done Reply Inline Actions @wxiao3 @pengfei It seems a regression for x86. skan: @wxiao3 @pengfei It seems a regression for x86.
				CarrotAuthorUnsubmitted Done Reply Inline Actions BB6 has higher frequency than BB7. If the branch in BB6 has 50% of taken probability, the final number of executed instructions are same. Carrot: BB6 has higher frequency than BB7. If the branch in BB6 has 50% of taken probability, the final…
				; CHECK-NEXT: movq %rbx, %rdx
	; CHECK-NEXT: LBB0_8: ## %while.body.preheader			; CHECK-NEXT: LBB0_8: ## %while.body.preheader
	; CHECK-NEXT: imulq $1040, %r14, %rax ## imm = 0x410			; CHECK-NEXT: imulq $1040, %rdx, %rax ## imm = 0x410
	; CHECK-NEXT: movq _syBuf@GOTPCREL(%rip), %rcx			; CHECK-NEXT: movq _syBuf@GOTPCREL(%rip), %rcx
	; CHECK-NEXT: leaq 8(%rcx,%rax), %rdx			; CHECK-NEXT: leaq 8(%rcx,%rax), %rdx
	; CHECK-NEXT: movl $1, %r13d			; CHECK-NEXT: movl $1, %r13d
	; CHECK-NEXT: movq _syCTRO@GOTPCREL(%rip), %rax			; CHECK-NEXT: movq _syCTRO@GOTPCREL(%rip), %rax
	; CHECK-NEXT: movb $1, %cl			; CHECK-NEXT: movb $1, %cl
	; CHECK-NEXT: .p2align 4, 0x90			; CHECK-NEXT: .p2align 4, 0x90
	; CHECK-NEXT: LBB0_9: ## %do.body			; CHECK-NEXT: LBB0_9: ## %do.body
	; CHECK-NEXT: ## =>This Inner Loop Header: Depth=1			; CHECK-NEXT: ## =>This Inner Loop Header: Depth=1
	; CHECK-NEXT: movl $0, (%rax)			; CHECK-NEXT: movl $0, (%rax)
	; CHECK-NEXT: testb %cl, %cl			; CHECK-NEXT: testb %cl, %cl
	; CHECK-NEXT: jne LBB0_9			; CHECK-NEXT: jne LBB0_9
	; CHECK-NEXT: ## %bb.10: ## %do.end			; CHECK-NEXT: ## %bb.10: ## %do.end
	; CHECK-NEXT: movq %rdx, {{[-0-9]+}}(%r{{[sb]}}p) ## 8-byte Spill			; CHECK-NEXT: movq %rdx, {{[-0-9]+}}(%r{{[sb]}}p) ## 8-byte Spill
	; CHECK-NEXT: movq %rbx, {{[-0-9]+}}(%r{{[sb]}}p) ## 8-byte Spill			; CHECK-NEXT: movq %rdi, {{[-0-9]+}}(%r{{[sb]}}p) ## 8-byte Spill
	; CHECK-NEXT: xorl %ebp, %ebp			; CHECK-NEXT: xorl %ebp, %ebp
	; CHECK-NEXT: testb %bpl, %bpl			; CHECK-NEXT: testb %bpl, %bpl
	; CHECK-NEXT: jne LBB0_11			; CHECK-NEXT: jne LBB0_11
	; CHECK-NEXT: ## %bb.12: ## %while.body200.preheader			; CHECK-NEXT: ## %bb.12: ## %while.body200.preheader
	; CHECK-NEXT: xorl %r12d, %r12d			; CHECK-NEXT: xorl %r12d, %r12d
	; CHECK-NEXT: leaq LJTI0_0(%rip), %rdx			; CHECK-NEXT: leaq LJTI0_0(%rip), %rdx
	; CHECK-NEXT: leaq LJTI0_1(%rip), %r14			; CHECK-NEXT: leaq LJTI0_1(%rip), %r14
	; CHECK-NEXT: movb $1, %sil			; CHECK-NEXT: movb $1, %sil
	▲ Show 20 Lines • Show All 580 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/speculative-load-hardening.ll

	Show All 36 Lines
	; X64-NEXT: pushq %r14			; X64-NEXT: pushq %r14
	; X64-NEXT: .cfi_def_cfa_offset 24			; X64-NEXT: .cfi_def_cfa_offset 24
	; X64-NEXT: pushq %rbx			; X64-NEXT: pushq %rbx
	; X64-NEXT: .cfi_def_cfa_offset 32			; X64-NEXT: .cfi_def_cfa_offset 32
	; X64-NEXT: .cfi_offset %rbx, -32			; X64-NEXT: .cfi_offset %rbx, -32
	; X64-NEXT: .cfi_offset %r14, -24			; X64-NEXT: .cfi_offset %r14, -24
	; X64-NEXT: .cfi_offset %r15, -16			; X64-NEXT: .cfi_offset %r15, -16
	; X64-NEXT: movq %rsp, %rax			; X64-NEXT: movq %rsp, %rax
	; X64-NEXT: movq $-1, %r14			; X64-NEXT: movq $-1, %rbx
	; X64-NEXT: sarq $63, %rax			; X64-NEXT: sarq $63, %rax
	; X64-NEXT: testl %edi, %edi			; X64-NEXT: testl %edi, %edi
	; X64-NEXT: jne .LBB1_1			; X64-NEXT: jne .LBB1_1
	; X64-NEXT: # %bb.2: # %then1			; X64-NEXT: # %bb.2: # %then1
	; X64-NEXT: cmovneq %r14, %rax			; X64-NEXT: cmovneq %rbx, %rax
	; X64-NEXT: testl %esi, %esi			; X64-NEXT: testl %esi, %esi
	; X64-NEXT: je .LBB1_4			; X64-NEXT: je .LBB1_4
	; X64-NEXT: .LBB1_1:			; X64-NEXT: .LBB1_1:
	; X64-NEXT: cmoveq %r14, %rax			; X64-NEXT: cmoveq %rbx, %rax
	; X64-NEXT: .LBB1_8: # %exit			; X64-NEXT: .LBB1_8: # %exit
	; X64-NEXT: shlq $47, %rax			; X64-NEXT: shlq $47, %rax
	; X64-NEXT: orq %rax, %rsp			; X64-NEXT: orq %rax, %rsp
	; X64-NEXT: popq %rbx			; X64-NEXT: popq %rbx
	; X64-NEXT: .cfi_def_cfa_offset 24			; X64-NEXT: .cfi_def_cfa_offset 24
	; X64-NEXT: popq %r14			; X64-NEXT: popq %r14
	; X64-NEXT: .cfi_def_cfa_offset 16			; X64-NEXT: .cfi_def_cfa_offset 16
	; X64-NEXT: popq %r15			; X64-NEXT: popq %r15
	; X64-NEXT: .cfi_def_cfa_offset 8			; X64-NEXT: .cfi_def_cfa_offset 8
	; X64-NEXT: retq			; X64-NEXT: retq
	; X64-NEXT: .LBB1_4: # %then2			; X64-NEXT: .LBB1_4: # %then2
	; X64-NEXT: .cfi_def_cfa_offset 32			; X64-NEXT: .cfi_def_cfa_offset 32
	; X64-NEXT: movq %r8, %rbx			; X64-NEXT: cmovneq %rbx, %rax
	; X64-NEXT: cmovneq %r14, %rax
	; X64-NEXT: testl %edx, %edx			; X64-NEXT: testl %edx, %edx
	; X64-NEXT: je .LBB1_6			; X64-NEXT: je .LBB1_6
	; X64-NEXT: # %bb.5: # %else3			; X64-NEXT: # %bb.5: # %else3
	; X64-NEXT: cmoveq %r14, %rax			; X64-NEXT: cmoveq %rbx, %rax
	; X64-NEXT: movslq (%r9), %rcx			; X64-NEXT: movslq (%r9), %rcx
	; X64-NEXT: orq %rax, %rcx			; X64-NEXT: orq %rax, %rcx
	; X64-NEXT: leaq (%rbx,%rcx,4), %r15			; X64-NEXT: leaq (%r8,%rcx,4), %r14
	; X64-NEXT: movl %ecx, (%rbx,%rcx,4)			; X64-NEXT: movl %ecx, (%r8,%rcx,4)
	; X64-NEXT: jmp .LBB1_7			; X64-NEXT: jmp .LBB1_7
	; X64-NEXT: .LBB1_6: # %then3			; X64-NEXT: .LBB1_6: # %then3
	; X64-NEXT: cmovneq %r14, %rax			; X64-NEXT: cmovneq %rbx, %rax
	; X64-NEXT: movl (%rcx), %ecx			; X64-NEXT: movl (%rcx), %ecx
	; X64-NEXT: addl (%rbx), %ecx			; X64-NEXT: addl (%r8), %ecx
	; X64-NEXT: movslq %ecx, %rdi			; X64-NEXT: movslq %ecx, %rdi
	; X64-NEXT: orq %rax, %rdi			; X64-NEXT: orq %rax, %rdi
	; X64-NEXT: movl (%rbx,%rdi,4), %esi			; X64-NEXT: movl (%r8,%rdi,4), %esi
	; X64-NEXT: orl %eax, %esi			; X64-NEXT: orl %eax, %esi
	; X64-NEXT: movq (%r9), %r15			; X64-NEXT: movq (%r9), %r14
	; X64-NEXT: orq %rax, %r15			; X64-NEXT: orq %rax, %r14
	; X64-NEXT: addl (%r15), %esi			; X64-NEXT: addl (%r14), %esi
	; X64-NEXT: shlq $47, %rax			; X64-NEXT: shlq $47, %rax
	; X64-NEXT: # kill: def $edi killed $edi killed $rdi			; X64-NEXT: # kill: def $edi killed $edi killed $rdi
	; X64-NEXT: orq %rax, %rsp			; X64-NEXT: orq %rax, %rsp
				; X64-NEXT: movq %r8, %r15
	; X64-NEXT: callq leak@PLT			; X64-NEXT: callq leak@PLT
	; X64-NEXT: .Lslh_ret_addr0:			; X64-NEXT: .Lslh_ret_addr0:
				; X64-NEXT: movq %r15, %r8
	; X64-NEXT: movq %rsp, %rax			; X64-NEXT: movq %rsp, %rax
	; X64-NEXT: movq -{{[0-9]+}}(%rsp), %rcx			; X64-NEXT: movq -{{[0-9]+}}(%rsp), %rcx
	; X64-NEXT: sarq $63, %rax			; X64-NEXT: sarq $63, %rax
	; X64-NEXT: cmpq $.Lslh_ret_addr0, %rcx			; X64-NEXT: cmpq $.Lslh_ret_addr0, %rcx
	; X64-NEXT: cmovneq %r14, %rax			; X64-NEXT: cmovneq %rbx, %rax
	; X64-NEXT: .LBB1_7: # %merge			; X64-NEXT: .LBB1_7: # %merge
	; X64-NEXT: movslq (%r15), %rcx			; X64-NEXT: movslq (%r14), %rcx
	; X64-NEXT: orq %rax, %rcx			; X64-NEXT: orq %rax, %rcx
	; X64-NEXT: movl $0, (%rbx,%rcx,4)			; X64-NEXT: movl $0, (%r8,%rcx,4)
	; X64-NEXT: jmp .LBB1_8			; X64-NEXT: jmp .LBB1_8
	;			;
	; X64-LFENCE-LABEL: test_basic_conditions:			; X64-LFENCE-LABEL: test_basic_conditions:
	; X64-LFENCE: # %bb.0: # %entry			; X64-LFENCE: # %bb.0: # %entry
	; X64-LFENCE-NEXT: pushq %r14			; X64-LFENCE-NEXT: pushq %r14
	; X64-LFENCE-NEXT: .cfi_def_cfa_offset 16			; X64-LFENCE-NEXT: .cfi_def_cfa_offset 16
	; X64-LFENCE-NEXT: pushq %rbx			; X64-LFENCE-NEXT: pushq %rbx
	; X64-LFENCE-NEXT: .cfi_def_cfa_offset 24			; X64-LFENCE-NEXT: .cfi_def_cfa_offset 24
	; X64-LFENCE-NEXT: pushq %rax			; X64-LFENCE-NEXT: pushq %rax
	; X64-LFENCE-NEXT: .cfi_def_cfa_offset 32			; X64-LFENCE-NEXT: .cfi_def_cfa_offset 32
	; X64-LFENCE-NEXT: .cfi_offset %rbx, -24			; X64-LFENCE-NEXT: .cfi_offset %rbx, -24
	; X64-LFENCE-NEXT: .cfi_offset %r14, -16			; X64-LFENCE-NEXT: .cfi_offset %r14, -16
	; X64-LFENCE-NEXT: testl %edi, %edi			; X64-LFENCE-NEXT: testl %edi, %edi
	; X64-LFENCE-NEXT: jne .LBB1_6			; X64-LFENCE-NEXT: jne .LBB1_6
	; X64-LFENCE-NEXT: # %bb.1: # %then1			; X64-LFENCE-NEXT: # %bb.1: # %then1
	; X64-LFENCE-NEXT: lfence			; X64-LFENCE-NEXT: lfence
	; X64-LFENCE-NEXT: testl %esi, %esi			; X64-LFENCE-NEXT: testl %esi, %esi
	; X64-LFENCE-NEXT: jne .LBB1_6			; X64-LFENCE-NEXT: jne .LBB1_6
	; X64-LFENCE-NEXT: # %bb.2: # %then2			; X64-LFENCE-NEXT: # %bb.2: # %then2
	; X64-LFENCE-NEXT: movq %r8, %rbx
	; X64-LFENCE-NEXT: lfence			; X64-LFENCE-NEXT: lfence
	; X64-LFENCE-NEXT: testl %edx, %edx			; X64-LFENCE-NEXT: testl %edx, %edx
	; X64-LFENCE-NEXT: je .LBB1_3			; X64-LFENCE-NEXT: je .LBB1_3
	; X64-LFENCE-NEXT: # %bb.4: # %else3			; X64-LFENCE-NEXT: # %bb.4: # %else3
	; X64-LFENCE-NEXT: lfence			; X64-LFENCE-NEXT: lfence
	; X64-LFENCE-NEXT: movslq (%r9), %rax			; X64-LFENCE-NEXT: movslq (%r9), %rax
	; X64-LFENCE-NEXT: leaq (%rbx,%rax,4), %r14			; X64-LFENCE-NEXT: leaq (%r8,%rax,4), %rbx
	; X64-LFENCE-NEXT: movl %eax, (%rbx,%rax,4)			; X64-LFENCE-NEXT: movl %eax, (%r8,%rax,4)
	; X64-LFENCE-NEXT: jmp .LBB1_5			; X64-LFENCE-NEXT: jmp .LBB1_5
	; X64-LFENCE-NEXT: .LBB1_3: # %then3			; X64-LFENCE-NEXT: .LBB1_3: # %then3
	; X64-LFENCE-NEXT: lfence			; X64-LFENCE-NEXT: lfence
	; X64-LFENCE-NEXT: movl (%rcx), %eax			; X64-LFENCE-NEXT: movl (%rcx), %eax
	; X64-LFENCE-NEXT: addl (%rbx), %eax			; X64-LFENCE-NEXT: addl (%r8), %eax
	; X64-LFENCE-NEXT: movslq %eax, %rdi			; X64-LFENCE-NEXT: movslq %eax, %rdi
	; X64-LFENCE-NEXT: movl (%rbx,%rdi,4), %esi			; X64-LFENCE-NEXT: movl (%r8,%rdi,4), %esi
	; X64-LFENCE-NEXT: movq (%r9), %r14			; X64-LFENCE-NEXT: movq (%r9), %rbx
	; X64-LFENCE-NEXT: addl (%r14), %esi			; X64-LFENCE-NEXT: addl (%rbx), %esi
	; X64-LFENCE-NEXT: # kill: def $edi killed $edi killed $rdi			; X64-LFENCE-NEXT: # kill: def $edi killed $edi killed $rdi
				; X64-LFENCE-NEXT: movq %r8, %r14
	; X64-LFENCE-NEXT: callq leak@PLT			; X64-LFENCE-NEXT: callq leak@PLT
				; X64-LFENCE-NEXT: movq %r14, %r8
	; X64-LFENCE-NEXT: .LBB1_5: # %merge			; X64-LFENCE-NEXT: .LBB1_5: # %merge
	; X64-LFENCE-NEXT: movslq (%r14), %rax			; X64-LFENCE-NEXT: movslq (%rbx), %rax
	; X64-LFENCE-NEXT: movl $0, (%rbx,%rax,4)			; X64-LFENCE-NEXT: movl $0, (%r8,%rax,4)
	; X64-LFENCE-NEXT: .LBB1_6: # %exit			; X64-LFENCE-NEXT: .LBB1_6: # %exit
	; X64-LFENCE-NEXT: lfence			; X64-LFENCE-NEXT: lfence
	; X64-LFENCE-NEXT: addq $8, %rsp			; X64-LFENCE-NEXT: addq $8, %rsp
	; X64-LFENCE-NEXT: .cfi_def_cfa_offset 24			; X64-LFENCE-NEXT: .cfi_def_cfa_offset 24
	; X64-LFENCE-NEXT: popq %rbx			; X64-LFENCE-NEXT: popq %rbx
	; X64-LFENCE-NEXT: .cfi_def_cfa_offset 16			; X64-LFENCE-NEXT: .cfi_def_cfa_offset 16
	; X64-LFENCE-NEXT: popq %r14			; X64-LFENCE-NEXT: popq %r14
	; X64-LFENCE-NEXT: .cfi_def_cfa_offset 8			; X64-LFENCE-NEXT: .cfi_def_cfa_offset 8
	▲ Show 20 Lines • Show All 343 Lines • ▼ Show 20 Lines

	declare i32 @__gxx_personality_v0(...)			declare i32 @__gxx_personality_v0(...)

	declare ptr @__cxa_allocate_exception(i64) local_unnamed_addr			declare ptr @__cxa_allocate_exception(i64) local_unnamed_addr

	declare void @__cxa_throw(ptr, ptr, ptr) local_unnamed_addr			declare void @__cxa_throw(ptr, ptr, ptr) local_unnamed_addr

	define void @test_basic_eh(i32 %a, ptr %ptr1, ptr %ptr2) speculative_load_hardening personality ptr @__gxx_personality_v0 {			define void @test_basic_eh(i32 %a, ptr %ptr1, ptr %ptr2) speculative_load_hardening personality ptr @__gxx_personality_v0 {
				; X64-LABEL: test_basic_eh:
				; X64: # %bb.0: # %entry
				; X64-NEXT: pushq %rbp
				; X64-NEXT: .cfi_def_cfa_offset 16
				; X64-NEXT: pushq %r15
				; X64-NEXT: .cfi_def_cfa_offset 24
				; X64-NEXT: pushq %r14
				; X64-NEXT: .cfi_def_cfa_offset 32
				; X64-NEXT: pushq %rbx
				; X64-NEXT: .cfi_def_cfa_offset 40
				; X64-NEXT: pushq %rax
				; X64-NEXT: .cfi_def_cfa_offset 48
				; X64-NEXT: .cfi_offset %rbx, -40
				; X64-NEXT: .cfi_offset %r14, -32
				; X64-NEXT: .cfi_offset %r15, -24
				; X64-NEXT: .cfi_offset %rbp, -16
				; X64-NEXT: movq %rsp, %rax
				; X64-NEXT: movq $-1, %rbx
				; X64-NEXT: sarq $63, %rax
				; X64-NEXT: cmpl $41, %edi
				; X64-NEXT: jg .LBB4_1
				; X64-NEXT: # %bb.2: # %thrower
				; X64-NEXT: movq %rdx, %r14
				; X64-NEXT: cmovgq %rbx, %rax
				; X64-NEXT: movslq %edi, %rcx
				; X64-NEXT: movq %rsi, %r15
				; X64-NEXT: movl (%rsi,%rcx,4), %ebp
				; X64-NEXT: orl %eax, %ebp
				; X64-NEXT: movl $4, %edi
				; X64-NEXT: shlq $47, %rax
				; X64-NEXT: orq %rax, %rsp
				; X64-NEXT: callq __cxa_allocate_exception@PLT
				; X64-NEXT: .Lslh_ret_addr4:
				; X64-NEXT: movq %rsp, %rcx
				; X64-NEXT: movq -{{[0-9]+}}(%rsp), %rdx
				; X64-NEXT: sarq $63, %rcx
				; X64-NEXT: cmpq $.Lslh_ret_addr4, %rdx
				; X64-NEXT: cmovneq %rbx, %rcx
				; X64-NEXT: movl %ebp, (%rax)
				; X64-NEXT: .Ltmp0:
				; X64-NEXT: shlq $47, %rcx
				; X64-NEXT: movq %rax, %rdi
				; X64-NEXT: xorl %esi, %esi
				; X64-NEXT: xorl %edx, %edx
				; X64-NEXT: orq %rcx, %rsp
				; X64-NEXT: callq __cxa_throw@PLT
				; X64-NEXT: .Lslh_ret_addr5:
				; X64-NEXT: movq %rsp, %rax
				; X64-NEXT: movq -{{[0-9]+}}(%rsp), %rcx
				; X64-NEXT: sarq $63, %rax
				; X64-NEXT: cmpq $.Lslh_ret_addr5, %rcx
				; X64-NEXT: cmovneq %rbx, %rax
				; X64-NEXT: .Ltmp1:
				; X64-NEXT: jmp .LBB4_3
				; X64-NEXT: .LBB4_1:
				; X64-NEXT: cmovleq %rbx, %rax
				; X64-NEXT: .LBB4_3: # %exit
				; X64-NEXT: shlq $47, %rax
				; X64-NEXT: orq %rax, %rsp
				; X64-NEXT: addq $8, %rsp
				; X64-NEXT: .cfi_def_cfa_offset 40
				; X64-NEXT: popq %rbx
				; X64-NEXT: .cfi_def_cfa_offset 32
				; X64-NEXT: popq %r14
				; X64-NEXT: .cfi_def_cfa_offset 24
				; X64-NEXT: popq %r15
				; X64-NEXT: .cfi_def_cfa_offset 16
				; X64-NEXT: popq %rbp
				; X64-NEXT: .cfi_def_cfa_offset 8
				; X64-NEXT: retq
				; X64-NEXT: .LBB4_4: # %lpad
				; X64-NEXT: .cfi_def_cfa_offset 48
				; X64-NEXT: .Ltmp2:
				; X64-NEXT: movq %rsp, %rcx
				; X64-NEXT: sarq $63, %rcx
				; X64-NEXT: movl (%rax), %eax
				; X64-NEXT: addl (%r15), %eax
				; X64-NEXT: cltq
				; X64-NEXT: orq %rcx, %rax
				; X64-NEXT: movl (%r14,%rax,4), %edi
				; X64-NEXT: orl %ecx, %edi
				; X64-NEXT: shlq $47, %rcx
				; X64-NEXT: orq %rcx, %rsp
				; X64-NEXT: callq sink@PLT
				; X64-NEXT: .Lslh_ret_addr6:
				; X64-NEXT: movq %rsp, %rax
				; X64-NEXT: movq -{{[0-9]+}}(%rsp), %rcx
				; X64-NEXT: sarq $63, %rax
				; X64-NEXT: cmpq $.Lslh_ret_addr6, %rcx
				; X64-NEXT: cmovneq %rbx, %rax
				;
				; X64-LFENCE-LABEL: test_basic_eh:
				; X64-LFENCE: # %bb.0: # %entry
				; X64-LFENCE-NEXT: pushq %rbp
				; X64-LFENCE-NEXT: .cfi_def_cfa_offset 16
				; X64-LFENCE-NEXT: pushq %r14
				; X64-LFENCE-NEXT: .cfi_def_cfa_offset 24
				; X64-LFENCE-NEXT: pushq %rbx
				; X64-LFENCE-NEXT: .cfi_def_cfa_offset 32
				; X64-LFENCE-NEXT: .cfi_offset %rbx, -32
				; X64-LFENCE-NEXT: .cfi_offset %r14, -24
				; X64-LFENCE-NEXT: .cfi_offset %rbp, -16
				; X64-LFENCE-NEXT: cmpl $41, %edi
				; X64-LFENCE-NEXT: jg .LBB4_2
				; X64-LFENCE-NEXT: # %bb.1: # %thrower
				; X64-LFENCE-NEXT: movq %rdx, %rbx
				; X64-LFENCE-NEXT: movq %rsi, %r14
				; X64-LFENCE-NEXT: lfence
				; X64-LFENCE-NEXT: movslq %edi, %rax
				; X64-LFENCE-NEXT: movl (%rsi,%rax,4), %ebp
				; X64-LFENCE-NEXT: movl $4, %edi
				; X64-LFENCE-NEXT: callq __cxa_allocate_exception@PLT
				; X64-LFENCE-NEXT: movl %ebp, (%rax)
				; X64-LFENCE-NEXT: .Ltmp0:
				; X64-LFENCE-NEXT: movq %rax, %rdi
				; X64-LFENCE-NEXT: xorl %esi, %esi
				; X64-LFENCE-NEXT: xorl %edx, %edx
				; X64-LFENCE-NEXT: callq __cxa_throw@PLT
				; X64-LFENCE-NEXT: .Ltmp1:
				; X64-LFENCE-NEXT: .LBB4_2: # %exit
				; X64-LFENCE-NEXT: lfence
				; X64-LFENCE-NEXT: popq %rbx
				; X64-LFENCE-NEXT: .cfi_def_cfa_offset 24
				; X64-LFENCE-NEXT: popq %r14
				; X64-LFENCE-NEXT: .cfi_def_cfa_offset 16
				; X64-LFENCE-NEXT: popq %rbp
				; X64-LFENCE-NEXT: .cfi_def_cfa_offset 8
				; X64-LFENCE-NEXT: retq
				; X64-LFENCE-NEXT: .LBB4_3: # %lpad
				; X64-LFENCE-NEXT: .cfi_def_cfa_offset 32
				; X64-LFENCE-NEXT: .Ltmp2:
				; X64-LFENCE-NEXT: movl (%rax), %eax
				; X64-LFENCE-NEXT: addl (%r14), %eax
				; X64-LFENCE-NEXT: cltq
				; X64-LFENCE-NEXT: movl (%rbx,%rax,4), %edi
				; X64-LFENCE-NEXT: callq sink@PLT
	entry:			entry:
	%a.cmp = icmp slt i32 %a, 42			%a.cmp = icmp slt i32 %a, 42
	br i1 %a.cmp, label %thrower, label %exit			br i1 %a.cmp, label %thrower, label %exit

	thrower:			thrower:
	%badidx = getelementptr i32, ptr %ptr1, i32 %a			%badidx = getelementptr i32, ptr %ptr1, i32 %a
	%secret1 = load i32, ptr %badidx			%secret1 = load i32, ptr %badidx
	%e.ptr = call ptr @__cxa_allocate_exception(i64 4)			%e.ptr = call ptr @__cxa_allocate_exception(i64 4)
	▲ Show 20 Lines • Show All 514 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/split-reg-with-hint.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 2
				; RUN: llc < %s -mtriple=x86_64-linux \| FileCheck %s

				; %ptr has a hint to %rdi in entry block, it also has a interference with %rdi
				; in block if.then. It should be split in cold block if.then.
				; Similarly %p2, %p3, %p4, %p5 and %p6 should also be split in cold block
				; if.then.

				define ptr @foo(ptr %ptr, i64 %p2, i64 %p3, i64 %p4, i64 %p5, i64 %p6) {
				; CHECK-LABEL: foo:
				; CHECK: # %bb.0: # %entry
				; CHECK-NEXT: testq %rdi, %rdi
				; CHECK-NEXT: je .LBB0_1
				; CHECK-NEXT: # %bb.2: # %if.end
				; CHECK-NEXT: incq %rdi
				; CHECK-NEXT: jmp qux@PLT # TAILCALL
				; CHECK-NEXT: .LBB0_1: # %if.then
				; CHECK-NEXT: pushq %r15
				; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: pushq %r14
				; CHECK-NEXT: .cfi_def_cfa_offset 24
				; CHECK-NEXT: pushq %r13
				; CHECK-NEXT: .cfi_def_cfa_offset 32
				; CHECK-NEXT: pushq %r12
				; CHECK-NEXT: .cfi_def_cfa_offset 40
				; CHECK-NEXT: pushq %rbx
				; CHECK-NEXT: .cfi_def_cfa_offset 48
				; CHECK-NEXT: .cfi_offset %rbx, -48
				; CHECK-NEXT: .cfi_offset %r12, -40
				; CHECK-NEXT: .cfi_offset %r13, -32
				; CHECK-NEXT: .cfi_offset %r14, -24
				; CHECK-NEXT: .cfi_offset %r15, -16
				; CHECK-NEXT: movq %rsi, %rbx
				; CHECK-NEXT: movq %rdx, %r14
				; CHECK-NEXT: movq %rcx, %r15
				; CHECK-NEXT: movq %r8, %r12
				; CHECK-NEXT: movq %r9, %r13
				; CHECK-NEXT: callq bar@PLT
				; CHECK-NEXT: movq %rbx, %rsi
				; CHECK-NEXT: movq %r14, %rdx
				; CHECK-NEXT: movq %r15, %rcx
				; CHECK-NEXT: movq %r12, %r8
				; CHECK-NEXT: movq %r13, %r9
				; CHECK-NEXT: movq %rax, %rdi
				; CHECK-NEXT: popq %rbx
				; CHECK-NEXT: .cfi_def_cfa_offset 40
				; CHECK-NEXT: popq %r12
				; CHECK-NEXT: .cfi_def_cfa_offset 32
				; CHECK-NEXT: popq %r13
				; CHECK-NEXT: .cfi_def_cfa_offset 24
				; CHECK-NEXT: popq %r14
				; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: popq %r15
				; CHECK-NEXT: .cfi_def_cfa_offset 8
				; CHECK-NEXT: .cfi_restore %rbx
				; CHECK-NEXT: .cfi_restore %r12
				; CHECK-NEXT: .cfi_restore %r13
				; CHECK-NEXT: .cfi_restore %r14
				; CHECK-NEXT: .cfi_restore %r15
				; CHECK-NEXT: incq %rdi
				; CHECK-NEXT: jmp qux@PLT # TAILCALL
				entry:
				%tobool.not = icmp eq ptr %ptr, null
				br i1 %tobool.not, label %if.then, label %if.end, !prof !5

				if.then: ; preds = %entry
				%call = tail call ptr @bar(ptr %ptr, i64 %p2, i64 %p3, i64 %p4, i64 %p5, i64 %p6)
				br label %if.end

				if.end: ; preds = %if.then, %entry
				%ptr.addr.0 = phi ptr [ %call, %if.then ], [ %ptr, %entry ]
				%incdec.ptr = getelementptr inbounds i8, ptr %ptr.addr.0, i64 1
				%call2 = tail call ptr @qux(ptr %incdec.ptr, i64 %p2, i64 %p3, i64 %p4, i64 %p5, i64 %p6)
				ret ptr %call2
				}

				!5 = !{!"branch_weights", i32 1, i32 2000}

				declare ptr @bar(ptr, i64, i64, i64, i64, i64)
				declare ptr @qux(ptr, i64, i64, i64, i64, i64)

llvm/test/CodeGen/X86/statepoint-call-lowering.ll

Show First 20 Lines • Show All 152 Lines • ▼ Show 20 Lines	entry:
ret i1 %call1		ret i1 %call1
}		}

declare void @consume(ptr addrspace(1) %obj)		declare void @consume(ptr addrspace(1) %obj)

define i1 @test_cross_bb(ptr addrspace(1) %a, i1 %external_cond) gc "statepoint-example" {		define i1 @test_cross_bb(ptr addrspace(1) %a, i1 %external_cond) gc "statepoint-example" {
; CHECK-LABEL: test_cross_bb:		; CHECK-LABEL: test_cross_bb:
; CHECK: # %bb.0: # %entry		; CHECK: # %bb.0: # %entry
; CHECK-NEXT: pushq %rbp
; CHECK-NEXT: .cfi_def_cfa_offset 16
; CHECK-NEXT: pushq %rbx		; CHECK-NEXT: pushq %rbx
; CHECK-NEXT: .cfi_def_cfa_offset 24		; CHECK-NEXT: .cfi_def_cfa_offset 16
; CHECK-NEXT: pushq %rax		; CHECK-NEXT: subq $16, %rsp
; CHECK-NEXT: .cfi_def_cfa_offset 32		; CHECK-NEXT: .cfi_def_cfa_offset 32
; CHECK-NEXT: .cfi_offset %rbx, -24		; CHECK-NEXT: .cfi_offset %rbx, -16
; CHECK-NEXT: .cfi_offset %rbp, -16		; CHECK-NEXT: movl %esi, %ebx
; CHECK-NEXT: movl %esi, %ebp		; CHECK-NEXT: movq %rdi, {{[0-9]+}}(%rsp)
; CHECK-NEXT: movq %rdi, (%rsp)
; CHECK-NEXT: callq return_i1@PLT		; CHECK-NEXT: callq return_i1@PLT
; CHECK-NEXT: .Ltmp8:		; CHECK-NEXT: .Ltmp8:
; CHECK-NEXT: testb $1, %bpl		; CHECK-NEXT: testb $1, %bl
; CHECK-NEXT: je .LBB8_2		; CHECK-NEXT: je .LBB8_2
; CHECK-NEXT: # %bb.1: # %left		; CHECK-NEXT: # %bb.1: # %left
		; CHECK-NEXT: movq {{[0-9]+}}(%rsp), %rdi
; CHECK-NEXT: movl %eax, %ebx		; CHECK-NEXT: movl %eax, %ebx
; CHECK-NEXT: movq (%rsp), %rdi
; CHECK-NEXT: callq consume@PLT		; CHECK-NEXT: callq consume@PLT
; CHECK-NEXT: movl %ebx, %eax		; CHECK-NEXT: movl %ebx, %eax
; CHECK-NEXT: jmp .LBB8_3		; CHECK-NEXT: jmp .LBB8_3
; CHECK-NEXT: .LBB8_2: # %right		; CHECK-NEXT: .LBB8_2: # %right
; CHECK-NEXT: movb $1, %al		; CHECK-NEXT: movb $1, %al
; CHECK-NEXT: .LBB8_3: # %right		; CHECK-NEXT: .LBB8_3: # %right
; CHECK-NEXT: addq $8, %rsp		; CHECK-NEXT: addq $16, %rsp
; CHECK-NEXT: .cfi_def_cfa_offset 24
; CHECK-NEXT: popq %rbx
; CHECK-NEXT: .cfi_def_cfa_offset 16		; CHECK-NEXT: .cfi_def_cfa_offset 16
; CHECK-NEXT: popq %rbp		; CHECK-NEXT: popq %rbx
; CHECK-NEXT: .cfi_def_cfa_offset 8		; CHECK-NEXT: .cfi_def_cfa_offset 8
; CHECK-NEXT: retq		; CHECK-NEXT: retq
entry:		entry:
%safepoint_token = tail call token (i64, i32, ptr, i32, i32, ...) @llvm.experimental.gc.statepoint.p0(i64 0, i32 0, ptr elementtype(i1 ()) @return_i1, i32 0, i32 0, i32 0, i32 0) ["gc-live" (ptr addrspace(1) %a)]		%safepoint_token = tail call token (i64, i32, ptr, i32, i32, ...) @llvm.experimental.gc.statepoint.p0(i64 0, i32 0, ptr elementtype(i1 ()) @return_i1, i32 0, i32 0, i32 0, i32 0) ["gc-live" (ptr addrspace(1) %a)]
br i1 %external_cond, label %left, label %right		br i1 %external_cond, label %left, label %right

left:		left:
%call1 = call ptr addrspace(1) @llvm.experimental.gc.relocate.p1(token %safepoint_token, i32 0, i32 0)		%call1 = call ptr addrspace(1) @llvm.experimental.gc.relocate.p1(token %safepoint_token, i32 0, i32 0)
▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/statepoint-invoke-ra-enter-at-end.mir

Show First 20 Lines • Show All 290 Lines • ▼ Show 20 Lines	body: \|
; CHECK-NEXT: TEST8rr [[MOV32r0_]].sub_8bit, [[MOV32r0_]].sub_8bit, implicit-def $eflags		; CHECK-NEXT: TEST8rr [[MOV32r0_]].sub_8bit, [[MOV32r0_]].sub_8bit, implicit-def $eflags
; CHECK-NEXT: JCC_1 %bb.4, 5, implicit killed $eflags		; CHECK-NEXT: JCC_1 %bb.4, 5, implicit killed $eflags
; CHECK-NEXT: JMP_1 %bb.3		; CHECK-NEXT: JMP_1 %bb.3
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.3.bb15:		; CHECK-NEXT: bb.3.bb15:
; CHECK-NEXT: successors: {{$}}		; CHECK-NEXT: successors: {{$}}
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp		; CHECK-NEXT: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
; CHECK-NEXT: dead [[COPY1]]:gr64, dead [[COPY]]:gr64 = STATEPOINT 2882400000, 0, 1, target-flags(x86-plt) @barney, undef $rdi, 2, 0, 2, 0, 2, 45, 2, 0, 2, 2, 2, 0, 2, 0, 2, 0, 2, 1, 2, 0, 2, 7, 2, 0, 2, 2, 2, 1, 2, 71, 2, 0, 2, 5, 2, 0, 2, 0, [[COPY1]], 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 2, 2, 5, 2, 1, 2, 0, 2, 2, 2, 0, 2, 0, [[COPY]], 2, 7, 2, 0, 2, 1, 2, 6, 2, 0, 2, 0, 2, 1, 2, 1, 2, 0, [[COPY]], 2, 8, 2, 10, 2, 2, [[COPY1]](tied-def 0), [[COPY]](tied-def 1), 2, 0, 2, 2, 0, 0, 1, 1, csr_64, implicit-def $rsp, implicit-def $ssp		; CHECK-NEXT: [[COPY2:%[0-9]+]]:gr64 = COPY [[COPY1]]
		; CHECK-NEXT: dead [[COPY2]]:gr64, dead [[COPY]]:gr64 = STATEPOINT 2882400000, 0, 1, target-flags(x86-plt) @barney, undef $rdi, 2, 0, 2, 0, 2, 45, 2, 0, 2, 2, 2, 0, 2, 0, 2, 0, 2, 1, 2, 0, 2, 7, 2, 0, 2, 2, 2, 1, 2, 71, 2, 0, 2, 5, 2, 0, 2, 0, [[COPY2]], 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 2, 2, 5, 2, 1, 2, 0, 2, 2, 2, 0, 2, 0, [[COPY]], 2, 7, 2, 0, 2, 1, 2, 6, 2, 0, 2, 0, 2, 1, 2, 1, 2, 0, [[COPY]], 2, 8, 2, 10, 2, 2, [[COPY2]](tied-def 0), [[COPY]](tied-def 1), 2, 0, 2, 2, 0, 0, 1, 1, csr_64, implicit-def $rsp, implicit-def $ssp
; CHECK-NEXT: ADJCALLSTACKUP64 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp		; CHECK-NEXT: ADJCALLSTACKUP64 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.4.bb17:		; CHECK-NEXT: bb.4.bb17:
; CHECK-NEXT: successors: %bb.5(0x80000000), %bb.8(0x00000000)		; CHECK-NEXT: successors: %bb.5(0x80000000), %bb.8(0x00000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[MOV32rm:%[0-9]+]]:gr32 = MOV32rm undef %35:gr64, 1, $noreg, 0, $noreg :: (load unordered (s32) from `ptr addrspace(1) undef`, addrspace 1)		; CHECK-NEXT: [[MOV32rm:%[0-9]+]]:gr32 = MOV32rm undef %35:gr64, 1, $noreg, 0, $noreg :: (load unordered (s32) from `ptr addrspace(1) undef`, addrspace 1)
; CHECK-NEXT: [[AND32ri:%[0-9]+]]:gr32 = AND32ri [[AND32ri]], 33554431, implicit-def dead $eflags		; CHECK-NEXT: [[AND32ri:%[0-9]+]]:gr32 = AND32ri [[AND32ri]], 33554431, implicit-def dead $eflags
; CHECK-NEXT: EH_LABEL <mcsymbol .Ltmp0>		; CHECK-NEXT: EH_LABEL <mcsymbol .Ltmp0>
Show All 35 Lines	body: \|
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.7.bb33:		; CHECK-NEXT: bb.7.bb33:
; CHECK-NEXT: successors: %bb.7(0x80000000)		; CHECK-NEXT: successors: %bb.7(0x80000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[INC64r:%[0-9]+]]:gr64_with_sub_8bit = nuw nsw INC64r [[INC64r]], implicit-def dead $eflags		; CHECK-NEXT: [[INC64r:%[0-9]+]]:gr64_with_sub_8bit = nuw nsw INC64r [[INC64r]], implicit-def dead $eflags
; CHECK-NEXT: [[MOV64rm2:%[0-9]+]]:gr64 = MOV64rm undef %59:gr64, 1, $noreg, 0, $noreg :: (load unordered (s64) from `ptr addrspace(1) undef`, addrspace 1)		; CHECK-NEXT: [[MOV64rm2:%[0-9]+]]:gr64 = MOV64rm undef %59:gr64, 1, $noreg, 0, $noreg :: (load unordered (s64) from `ptr addrspace(1) undef`, addrspace 1)
; CHECK-NEXT: [[NOT64r2:%[0-9]+]]:gr64 = NOT64r [[NOT64r2]]		; CHECK-NEXT: [[NOT64r2:%[0-9]+]]:gr64 = NOT64r [[NOT64r2]]
; CHECK-NEXT: CMP64rr [[NOT64r2]], [[COPY6]], implicit-def $eflags		; CHECK-NEXT: CMP64rr [[NOT64r2]], [[COPY6]], implicit-def $eflags
; CHECK-NEXT: undef %100.sub_32bit:gr64_with_sub_8bit = MOV32ri 0		; CHECK-NEXT: undef [[MOV32ri2:%[0-9]+]].sub_32bit:gr64_with_sub_8bit = MOV32ri 0
; CHECK-NEXT: [[CMOV64rr:%[0-9]+]]:gr64 = CMOV64rr [[CMOV64rr]], %100, 4, implicit killed $eflags		; CHECK-NEXT: [[CMOV64rr:%[0-9]+]]:gr64 = CMOV64rr [[CMOV64rr]], [[MOV32ri2]], 4, implicit killed $eflags
; CHECK-NEXT: INLINEASM &"lock btsq $0,($1)", 1 /* sideeffect attdialect /, 4521993 / reguse:GR64 /, %100, 4521993 / reguse:GR64 /, undef %56:gr64, 12 / clobber /, implicit-def dead early-clobber $df, 12 / clobber /, implicit-def early-clobber $fpsw, 12 / clobber */, implicit-def dead early-clobber $eflags		; CHECK-NEXT: INLINEASM &"lock btsq $0,($1)", 1 /* sideeffect attdialect /, 4521993 / reguse:GR64 /, [[MOV32ri2]], 4521993 / reguse:GR64 /, undef %56:gr64, 12 / clobber /, implicit-def dead early-clobber $df, 12 / clobber /, implicit-def early-clobber $fpsw, 12 / clobber */, implicit-def dead early-clobber $eflags
; CHECK-NEXT: LCMPXCHG32 undef %67:gr64, 1, $noreg, 0, $noreg, [[COPY5]], implicit-def dead $eax, implicit-def dead $eflags, implicit undef $eax :: (load store acquire monotonic (s32) on `ptr addrspace(1) undef`, addrspace 1)		; CHECK-NEXT: LCMPXCHG32 undef %67:gr64, 1, $noreg, 0, $noreg, [[COPY5]], implicit-def dead $eax, implicit-def dead $eflags, implicit undef $eax :: (load store acquire monotonic (s32) on `ptr addrspace(1) undef`, addrspace 1)
; CHECK-NEXT: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp		; CHECK-NEXT: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
; CHECK-NEXT: $rdi = COPY [[COPY4]]		; CHECK-NEXT: $rdi = COPY [[COPY4]]
; CHECK-NEXT: CALL64pcrel32 target-flags(x86-plt) @wobble, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit-def $rsp, implicit-def $ssp		; CHECK-NEXT: CALL64pcrel32 target-flags(x86-plt) @wobble, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit-def $rsp, implicit-def $ssp
; CHECK-NEXT: ADJCALLSTACKUP64 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp		; CHECK-NEXT: ADJCALLSTACKUP64 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
; CHECK-NEXT: [[MOV64rm2:%[0-9]+]]:gr64 = MOV64rm $rip, 1, $noreg, target-flags(x86-gotpcrel) @global, $noreg :: (load (s64) from got)		; CHECK-NEXT: [[MOV64rm2:%[0-9]+]]:gr64 = MOV64rm $rip, 1, $noreg, target-flags(x86-gotpcrel) @global, $noreg :: (load (s64) from got)
; CHECK-NEXT: [[MOV64rm3:%[0-9]+]]:gr64 = MOV64rm [[MOV64rm2]], 1, $noreg, 0, $noreg :: (dereferenceable load unordered (s64) from @global)		; CHECK-NEXT: [[MOV64rm3:%[0-9]+]]:gr64 = MOV64rm [[MOV64rm2]], 1, $noreg, 0, $noreg :: (dereferenceable load unordered (s64) from @global)
; CHECK-NEXT: [[NOT64r2:%[0-9]+]]:gr64 = NOT64r [[NOT64r2]]		; CHECK-NEXT: [[NOT64r2:%[0-9]+]]:gr64 = NOT64r [[NOT64r2]]
▲ Show 20 Lines • Show All 145 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/statepoint-ra.ll

	Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines
	declare i32 @eggs(i32, ptr addrspace(1), i32)			declare i32 @eggs(i32, ptr addrspace(1), i32)
	declare void @ham(ptr addrspace(1), double, double, double, double, double, double, double, double, double, i32)			declare void @ham(ptr addrspace(1), double, double, double, double, double, double, double, double, double, i32)
	declare token @llvm.experimental.gc.statepoint.p0(i64 , i32 , ptr, i32 , i32 , ...)			declare token @llvm.experimental.gc.statepoint.p0(i64 , i32 , ptr, i32 , i32 , ...)

	;CHECK: body: \|			;CHECK: body: \|
	;CHECK: bb.0.bb:			;CHECK: bb.0.bb:
	;CHECK: successors: %bb.2(0x40000000), %bb.1(0x40000000)			;CHECK: successors: %bb.2(0x40000000), %bb.1(0x40000000)
	;CHECK: liveins: $rdi, $xmm0, $xmm1, $xmm2, $xmm3, $xmm4, $xmm5, $xmm6, $xmm7			;CHECK: liveins: $rdi, $xmm0, $xmm1, $xmm2, $xmm3, $xmm4, $xmm5, $xmm6, $xmm7
	;CHECK: %49:fr64 = COPY $xmm7			;CHECK: %55:fr64 = COPY $xmm7
	;CHECK: %10:fr64 = COPY $xmm6			;CHECK: %10:fr64 = COPY $xmm6
	;CHECK: %41:fr64 = COPY $xmm5			;CHECK: %45:fr64 = COPY $xmm5
	;CHECK: %45:fr64 = COPY $xmm4			;CHECK: %52:fr64 = COPY $xmm4
	;CHECK: %53:fr64 = COPY $xmm3			;CHECK: %59:fr64 = COPY $xmm3
	;CHECK: %6:fr64 = COPY $xmm2			;CHECK: %6:fr64 = COPY $xmm2
	;CHECK: %58:fr64 = COPY $xmm1			;CHECK: %64:fr64 = COPY $xmm1
	;CHECK: %62:fr64 = COPY $xmm0			;CHECK: %68:fr64 = COPY $xmm0
	;CHECK: %3:gr64 = COPY $rdi			;CHECK: %3:gr64 = COPY $rdi
	;CHECK: %76:fr64 = MOVSDrm_alt %fixed-stack.0, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.0)			;CHECK: %82:fr64 = MOVSDrm_alt %fixed-stack.0, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.0)
	;CHECK: %14:fr64 = MOVSDrm_alt %fixed-stack.1, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.1, align 16)			;CHECK: %14:fr64 = MOVSDrm_alt %fixed-stack.1, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.1, align 16)
	;CHECK: %66:fr64 = MOVSDrm_alt %fixed-stack.2, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.2)			;CHECK: %72:fr64 = MOVSDrm_alt %fixed-stack.2, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.2)
	;CHECK: %71:fr64 = MOVSDrm_alt %fixed-stack.3, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.3, align 16)			;CHECK: %77:fr64 = MOVSDrm_alt %fixed-stack.3, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.3, align 16)
	;CHECK: MOV64mr %stack.0, 1, $noreg, 0, $noreg, %3 :: (store (s64) into %stack.0)			;CHECK: MOV64mr %stack.0, 1, $noreg, 0, $noreg, %3 :: (store (s64) into %stack.0)
	;CHECK: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp			;CHECK: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
	;CHECK: STATEPOINT 2882400000, 0, 0, target-flags(x86-plt) @blam, 2, 9, 2, 0, 2, 59, 2, 0, 2, 1, 2, 0, 2, 0, 2, 0, 2, 26, 2, 0, 2, 0, 1, 8, %stack.0, 0, 2, 4, %62, 2, 7, 2, 0, 2, 4, %58, 2, 7, 2, 0, 2, 4, %6, 2, 7, 2, 0, 2, 4, %53, 2, 7, 2, 0, 2, 4, %45, 2, 7, 2, 0, 2, 4, %41, 2, 7, 2, 0, 2, 4, %10, 2, 7, 2, 0, 2, 4, %49, 2, 7, 2, 0, 2, 4, %71, 2, 7, 2, 0, 2, 4, %66, 2, 7, 2, 0, 2, 4, %14, 2, 7, 2, 0, 2, 4, %76, 2, 7, 2, 0, 2, 7, 2, 0, 2, 1, 1, 8, %stack.0, 0, 2, 0, 2, 1, 0, 0, csr_64_mostregs, implicit-def $rsp, implicit-def $ssp :: (volatile load store (s64) on %stack.0)			;CHECK: STATEPOINT 2882400000, 0, 0, target-flags(x86-plt) @blam, 2, 9, 2, 0, 2, 59, 2, 0, 2, 1, 2, 0, 2, 0, 2, 0, 2, 26, 2, 0, 2, 0, 1, 8, %stack.0, 0, 2, 4, %68, 2, 7, 2, 0, 2, 4, %64, 2, 7, 2, 0, 2, 4, %6, 2, 7, 2, 0, 2, 4, %59, 2, 7, 2, 0, 2, 4, %52, 2, 7, 2, 0, 2, 4, %45, 2, 7, 2, 0, 2, 4, %10, 2, 7, 2, 0, 2, 4, %55, 2, 7, 2, 0, 2, 4, %77, 2, 7, 2, 0, 2, 4, %72, 2, 7, 2, 0, 2, 4, %14, 2, 7, 2, 0, 2, 4, %82, 2, 7, 2, 0, 2, 7, 2, 0, 2, 1, 1, 8, %stack.0, 0, 2, 0, 2, 1, 0, 0, csr_64_mostregs, implicit-def $rsp, implicit-def $ssp :: (volatile load store (s64) on %stack.0)
	;CHECK: ADJCALLSTACKUP64 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp			;CHECK: ADJCALLSTACKUP64 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
	;CHECK: %17:gr32 = MOV32r0 implicit-def dead $eflags			;CHECK: %17:gr32 = MOV32r0 implicit-def dead $eflags
	;CHECK: TEST8rr %17.sub_8bit, %17.sub_8bit, implicit-def $eflags			;CHECK: TEST8rr %17.sub_8bit, %17.sub_8bit, implicit-def $eflags
	;CHECK: MOVSDmr %stack.1, 1, $noreg, 0, $noreg, %41 :: (store (s64) into %stack.1)			;CHECK: MOVSDmr %stack.1, 1, $noreg, 0, $noreg, %45 :: (store (s64) into %stack.1)
	;CHECK: MOVSDmr %stack.2, 1, $noreg, 0, $noreg, %45 :: (store (s64) into %stack.2)			;CHECK: MOVSDmr %stack.2, 1, $noreg, 0, $noreg, %52 :: (store (s64) into %stack.2)
	;CHECK: MOVSDmr %stack.5, 1, $noreg, 0, $noreg, %58 :: (store (s64) into %stack.5)			;CHECK: MOVSDmr %stack.5, 1, $noreg, 0, $noreg, %64 :: (store (s64) into %stack.5)
	;CHECK: MOVSDmr %stack.6, 1, $noreg, 0, $noreg, %62 :: (store (s64) into %stack.6)			;CHECK: MOVSDmr %stack.6, 1, $noreg, 0, $noreg, %68 :: (store (s64) into %stack.6)
	;CHECK: JCC_1 %bb.2, 4, implicit killed $eflags			;CHECK: JCC_1 %bb.2, 4, implicit killed $eflags
	;CHECK: bb.1:			;CHECK: bb.1:
	;CHECK: successors: %bb.3(0x80000000)			;CHECK: successors: %bb.3(0x80000000)
	;CHECK: %54:fr64 = MOVSDrm_alt $rip, 1, $noreg, %const.0, $noreg :: (load (s64) from constant-pool)			;CHECK: %60:fr64 = MOVSDrm_alt $rip, 1, $noreg, %const.0, $noreg :: (load (s64) from constant-pool)
	;CHECK: MOVSDmr %stack.3, 1, $noreg, 0, $noreg, %54 :: (store (s64) into %stack.3)			;CHECK: MOVSDmr %stack.3, 1, $noreg, 0, $noreg, %60 :: (store (s64) into %stack.3)
	;CHECK: MOVSDmr %stack.4, 1, $noreg, 0, $noreg, %54 :: (store (s64) into %stack.4)			;CHECK: MOVSDmr %stack.4, 1, $noreg, 0, $noreg, %60 :: (store (s64) into %stack.4)
	;CHECK: MOVSDmr %stack.7, 1, $noreg, 0, $noreg, %54 :: (store (s64) into %stack.7)			;CHECK: MOVSDmr %stack.7, 1, $noreg, 0, $noreg, %60 :: (store (s64) into %stack.7)
	;CHECK: JMP_1 %bb.3			;CHECK: JMP_1 %bb.3
	;CHECK: bb.2.bb13:			;CHECK: bb.2.bb13:
	;CHECK: successors: %bb.3(0x80000000)			;CHECK: successors: %bb.3(0x80000000)
	;CHECK: ADJCALLSTACKDOWN64 8, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp			;CHECK: ADJCALLSTACKDOWN64 8, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
	;CHECK: MOVSDmr $rsp, 1, $noreg, 0, $noreg, %14 :: (store (s64) into stack)			;CHECK: MOVSDmr $rsp, 1, $noreg, 0, $noreg, %14 :: (store (s64) into stack)
	;CHECK: dead $edi = MOV32r0 implicit-def dead $eflags, implicit-def $rdi			;CHECK: dead $edi = MOV32r0 implicit-def dead $eflags, implicit-def $rdi
	;CHECK: $xmm0 = COPY %62			;CHECK: $xmm0 = COPY %68
	;CHECK: $xmm1 = COPY %58			;CHECK: $xmm1 = COPY %64
	;CHECK: $xmm2 = COPY %6			;CHECK: $xmm2 = COPY %6
	;CHECK: $xmm3 = COPY %45			;CHECK: $xmm3 = COPY %52
	;CHECK: $xmm4 = COPY %41			;CHECK: $xmm4 = COPY %45
	;CHECK: $xmm5 = COPY %10			;CHECK: $xmm5 = COPY %10
	;CHECK: $xmm6 = COPY %71			;CHECK: $xmm6 = COPY %77
	;CHECK: $xmm7 = COPY %66			;CHECK: $xmm7 = COPY %72
	;CHECK: MOVSDmr %stack.3, 1, $noreg, 0, $noreg, %49 :: (store (s64) into %stack.3)			;CHECK: MOVSDmr %stack.3, 1, $noreg, 0, $noreg, %55 :: (store (s64) into %stack.3)
	;CHECK: MOVSDmr %stack.4, 1, $noreg, 0, $noreg, %53 :: (store (s64) into %stack.4)			;CHECK: MOVSDmr %stack.4, 1, $noreg, 0, $noreg, %59 :: (store (s64) into %stack.4)
	;CHECK: MOVSDmr %stack.7, 1, $noreg, 0, $noreg, %76 :: (store (s64) into %stack.7)			;CHECK: MOVSDmr %stack.7, 1, $noreg, 0, $noreg, %82 :: (store (s64) into %stack.7)
	;CHECK: STATEPOINT 2, 5, 9, undef %22:gr64, $rdi, $xmm0, $xmm1, $xmm2, $xmm3, $xmm4, $xmm5, $xmm6, $xmm7, 2, 0, 2, 0, 2, 59, 2, 0, 2, 2, 2, 0, 2, 70, 2, 0, 2, 26, 2, 0, 2, 0, 2, 0, 2, 4, 1, 8, %stack.6, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.5, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.4, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.1, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.0, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 1, 2, 0, 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp, implicit-def dead $eax :: (load (s64) from %stack.1), (load (s64) from %stack.2), (load (s64) from %stack.3), (load (s64) from %stack.4), (load (s64) from %stack.5), (load (s64) from %stack.6), (load (s64) from %fixed-stack.2), (load (s64) from %fixed-stack.3, align 16), (load (s64) from %fixed-stack.0)			;CHECK: STATEPOINT 2, 5, 9, undef %22:gr64, $rdi, $xmm0, $xmm1, $xmm2, $xmm3, $xmm4, $xmm5, $xmm6, $xmm7, 2, 0, 2, 0, 2, 59, 2, 0, 2, 2, 2, 0, 2, 70, 2, 0, 2, 26, 2, 0, 2, 0, 2, 0, 2, 4, 1, 8, %stack.6, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.5, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.4, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.1, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.0, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 1, 2, 0, 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp, implicit-def dead $eax :: (load (s64) from %stack.1), (load (s64) from %stack.2), (load (s64) from %stack.3), (load (s64) from %stack.4), (load (s64) from %stack.5), (load (s64) from %stack.6), (load (s64) from %fixed-stack.2), (load (s64) from %fixed-stack.3, align 16), (load (s64) from %fixed-stack.0)
	;CHECK: ADJCALLSTACKUP64 8, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp			;CHECK: ADJCALLSTACKUP64 8, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
	;CHECK: bb.3.bb15:			;CHECK: bb.3.bb15:
	;CHECK: successors: %bb.7(0x7ffff800), %bb.4(0x00000800)			;CHECK: successors: %bb.7(0x7ffff800), %bb.4(0x00000800)
	;CHECK: %24:gr32 = MOV32r0 implicit-def dead $eflags			;CHECK: %24:gr32 = MOV32r0 implicit-def dead $eflags
	;CHECK: TEST8rr %24.sub_8bit, %24.sub_8bit, implicit-def $eflags			;CHECK: TEST8rr %24.sub_8bit, %24.sub_8bit, implicit-def $eflags
	;CHECK: JCC_1 %bb.7, 5, implicit killed $eflags			;CHECK: JCC_1 %bb.7, 5, implicit killed $eflags
	;CHECK: JMP_1 %bb.4			;CHECK: JMP_1 %bb.4
	;CHECK: bb.4.bb19:			;CHECK: bb.4.bb19:
	;CHECK: successors: %bb.5(0x00000000), %bb.6(0x80000000)			;CHECK: successors: %bb.5(0x00000000), %bb.6(0x80000000)
	;CHECK: EH_LABEL <mcsymbol >			;CHECK: EH_LABEL <mcsymbol >
	;CHECK: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp			;CHECK: ADJCALLSTACKDOWN64 0, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
	;CHECK: $edx = MOV32r0 implicit-def dead $eflags			;CHECK: $edx = MOV32r0 implicit-def dead $eflags
	;CHECK: STATEPOINT 1, 16, 3, undef %29:gr64, undef $edi, undef $rsi, $edx, 2, 0, 2, 0, 2, 105, 2, 0, 2, 2, 2, 0, 2, 97, 2, 0, 2, 26, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 0, 2, 2, 2, 3, 2, 0, 2, 20, 2, 0, 2, 0, 2, 4278124286, 2, 4, 1, 8, %stack.6, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.5, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.4, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.1, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.7, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 1, 2, 4278124286, 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp, implicit-def dead $eax :: (load (s64) from %stack.1), (load (s64) from %stack.2), (load (s64) from %stack.3), (load (s64) from %stack.4), (load (s64) from %stack.5), (load (s64) from %stack.6), (load (s64) from %fixed-stack.2), (load (s64) from %fixed-stack.3, align 16), (load (s64) from %stack.7)			;CHECK: STATEPOINT 1, 16, 3, undef %29:gr64, undef $edi, undef $rsi, $edx, 2, 0, 2, 0, 2, 105, 2, 0, 2, 2, 2, 0, 2, 97, 2, 0, 2, 26, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 0, 2, 2, 2, 3, 2, 0, 2, 20, 2, 0, 2, 0, 2, 4278124286, 2, 4, 1, 8, %stack.6, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.5, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.4, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.1, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.7, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 1, 2, 4278124286, 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp, implicit-def dead $eax :: (load (s64) from %stack.1), (load (s64) from %stack.2), (load (s64) from %stack.3), (load (s64) from %stack.4), (load (s64) from %stack.5), (load (s64) from %stack.6), (load (s64) from %fixed-stack.2), (load (s64) from %fixed-stack.3, align 16), (load (s64) from %stack.7)
	;CHECK: ADJCALLSTACKUP64 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp			;CHECK: ADJCALLSTACKUP64 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
	;CHECK: EH_LABEL <mcsymbol >			;CHECK: EH_LABEL <mcsymbol >
	;CHECK: JMP_1 %bb.5			;CHECK: JMP_1 %bb.5
	;CHECK: bb.5.bb21:			;CHECK: bb.5.bb21:
	;CHECK: successors:			;CHECK: successors:
	;CHECK: ADJCALLSTACKDOWN64 8, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp			;CHECK: ADJCALLSTACKDOWN64 8, 0, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
	;CHECK: %79:fr64 = MOVSDrm_alt %stack.7, 1, $noreg, 0, $noreg :: (load (s64) from %stack.7)			;CHECK: %85:fr64 = MOVSDrm_alt %stack.7, 1, $noreg, 0, $noreg :: (load (s64) from %stack.7)
	;CHECK: MOVSDmr $rsp, 1, $noreg, 0, $noreg, %79 :: (store (s64) into stack)			;CHECK: MOVSDmr $rsp, 1, $noreg, 0, $noreg, %85 :: (store (s64) into stack)
	;CHECK: $xmm0 = MOVSDrm_alt %stack.6, 1, $noreg, 0, $noreg :: (load (s64) from %stack.6)			;CHECK: $xmm0 = MOVSDrm_alt %stack.6, 1, $noreg, 0, $noreg :: (load (s64) from %stack.6)
	;CHECK: $xmm1 = MOVSDrm_alt %stack.5, 1, $noreg, 0, $noreg :: (load (s64) from %stack.5)			;CHECK: $xmm1 = MOVSDrm_alt %stack.5, 1, $noreg, 0, $noreg :: (load (s64) from %stack.5)
	;CHECK: $xmm2 = MOVSDrm_alt %stack.4, 1, $noreg, 0, $noreg :: (load (s64) from %stack.4)			;CHECK: $xmm2 = MOVSDrm_alt %stack.4, 1, $noreg, 0, $noreg :: (load (s64) from %stack.4)
	;CHECK: $xmm3 = MOVSDrm_alt %stack.2, 1, $noreg, 0, $noreg :: (load (s64) from %stack.2)			;CHECK: $xmm3 = MOVSDrm_alt %stack.2, 1, $noreg, 0, $noreg :: (load (s64) from %stack.2)
	;CHECK: $xmm4 = MOVSDrm_alt %stack.1, 1, $noreg, 0, $noreg :: (load (s64) from %stack.1)			;CHECK: $xmm4 = MOVSDrm_alt %stack.1, 1, $noreg, 0, $noreg :: (load (s64) from %stack.1)
	;CHECK: $xmm5 = MOVSDrm_alt %stack.3, 1, $noreg, 0, $noreg :: (load (s64) from %stack.3)			;CHECK: $xmm5 = MOVSDrm_alt %stack.3, 1, $noreg, 0, $noreg :: (load (s64) from %stack.3)
	;CHECK: %74:fr64 = MOVSDrm_alt %fixed-stack.3, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.3, align 16)			;CHECK: %80:fr64 = MOVSDrm_alt %fixed-stack.3, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.3, align 16)
	;CHECK: $xmm6 = COPY %74			;CHECK: $xmm6 = COPY %80
	;CHECK: $esi = MOV32ri 51			;CHECK: $esi = MOV32ri 51
	;CHECK: %69:fr64 = MOVSDrm_alt %fixed-stack.2, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.2)			;CHECK: %75:fr64 = MOVSDrm_alt %fixed-stack.2, 1, $noreg, 0, $noreg :: (load (s64) from %fixed-stack.2)
	;CHECK: $xmm7 = COPY %69			;CHECK: $xmm7 = COPY %75
	;CHECK: STATEPOINT 2, 5, 10, undef %36:gr64, undef $rdi, $xmm0, $xmm1, $xmm2, $xmm3, $xmm4, $xmm5, $xmm6, $xmm7, killed $esi, 2, 0, 2, 0, 2, 105, 2, 0, 2, 2, 2, 0, 2, 97, 2, 0, 2, 26, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 2, 2, 2, 2, 46, 2, 0, 2, 20, 2, 0, 2, 0, 2, 4278124286, 2, 4, 1, 8, %stack.6, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.5, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.4, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.1, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.7, 0, 2, 7, 2, 0, 2, 3, 2, 51, 2, 1, 2, 4278124286, 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp :: (load (s64) from %stack.1), (load (s64) from %stack.2), (load (s64) from %stack.3), (load (s64) from %stack.4), (load (s64) from %stack.5), (load (s64) from %stack.6), (load (s64) from %fixed-stack.2), (load (s64) from %fixed-stack.3, align 16), (load (s64) from %stack.7)			;CHECK: STATEPOINT 2, 5, 10, undef %36:gr64, undef $rdi, $xmm0, $xmm1, $xmm2, $xmm3, $xmm4, $xmm5, $xmm6, $xmm7, killed $esi, 2, 0, 2, 0, 2, 105, 2, 0, 2, 2, 2, 0, 2, 97, 2, 0, 2, 26, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 7, 2, 0, 2, 2, 2, 2, 2, 46, 2, 0, 2, 20, 2, 0, 2, 0, 2, 4278124286, 2, 4, 1, 8, %stack.6, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.5, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.4, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.1, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.3, 0, 2, 7, 2, 0, 2, 4, 1, 8, %fixed-stack.2, 0, 2, 7, 2, 0, 2, 4, 1, 8, %stack.7, 0, 2, 7, 2, 0, 2, 3, 2, 51, 2, 1, 2, 4278124286, 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp :: (load (s64) from %stack.1), (load (s64) from %stack.2), (load (s64) from %stack.3), (load (s64) from %stack.4), (load (s64) from %stack.5), (load (s64) from %stack.6), (load (s64) from %fixed-stack.2), (load (s64) from %fixed-stack.3, align 16), (load (s64) from %stack.7)
	;CHECK: ADJCALLSTACKUP64 8, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp			;CHECK: ADJCALLSTACKUP64 8, 0, implicit-def dead $rsp, implicit-def dead $eflags, implicit-def dead $ssp, implicit $rsp, implicit $ssp
	;CHECK: bb.6.bb23 (landing-pad):			;CHECK: bb.6.bb23 (landing-pad):
	;CHECK: liveins: $rax, $rdx			;CHECK: liveins: $rax, $rdx
	;CHECK: EH_LABEL <mcsymbol >			;CHECK: EH_LABEL <mcsymbol >
	;CHECK: RET 0			;CHECK: RET 0
	;CHECK: bb.7.bb25:			;CHECK: bb.7.bb25:
	;CHECK: RET 0			;CHECK: RET 0

llvm/test/CodeGen/X86/statepoint-vreg-details.ll

	Show All 25 Lines
	; CHECK-VREG: %1:gr64 = STATEPOINT 0, 0, 0, @return_i1, 2, 0, 2, 0, 2, 0, 2, 1, %0(tied-def 0), 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp, implicit-def $al			; CHECK-VREG: %1:gr64 = STATEPOINT 0, 0, 0, @return_i1, 2, 0, 2, 0, 2, 0, 2, 1, %0(tied-def 0), 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp, implicit-def $al
	; CHECK-VREG: %2:gr8 = COPY $al			; CHECK-VREG: %2:gr8 = COPY $al
	; CHECK-VREG: $rdi = COPY %1			; CHECK-VREG: $rdi = COPY %1
	; CHECK-VREG: CALL64pcrel32 @consume, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit-def $rsp, implicit-def $ssp			; CHECK-VREG: CALL64pcrel32 @consume, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit-def $rsp, implicit-def $ssp

	; CHECK-PREG-LABEL: name: test_relocate			; CHECK-PREG-LABEL: name: test_relocate
	; CHECK-PREG: renamable $rbx = COPY $rdi			; CHECK-PREG: renamable $rbx = COPY $rdi
	; CHECK-PREG: renamable $rbx = STATEPOINT 0, 0, 0, @return_i1, 2, 0, 2, 0, 2, 0, 2, 1, killed renamable $rbx(tied-def 0), 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp, implicit-def $al			; CHECK-PREG: renamable $rbx = STATEPOINT 0, 0, 0, @return_i1, 2, 0, 2, 0, 2, 0, 2, 1, killed renamable $rbx(tied-def 0), 2, 0, 2, 1, 0, 0, csr_64, implicit-def $rsp, implicit-def $ssp, implicit-def $al
	; CHECK-PREG: renamable $bpl = COPY killed $al			; CHECK-PREG: renamable $bpl = COPY $al
	; CHECK-PREG: $rdi = COPY killed renamable $rbx			; CHECK-PREG: $rdi = COPY killed renamable $rbx
	; CHECK-PREG: CALL64pcrel32 @consume, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit-def $rsp, implicit-def $ssp			; CHECK-PREG: CALL64pcrel32 @consume, csr_64, implicit $rsp, implicit $ssp, implicit $rdi, implicit-def $rsp, implicit-def $ssp

	entry:			entry:
	%safepoint_token = tail call token (i64, i32, ptr, i32, i32, ...) @llvm.experimental.gc.statepoint.p0(i64 0, i32 0, ptr elementtype(i1 ()) @return_i1, i32 0, i32 0, i32 0, i32 0) ["gc-live" (ptr addrspace(1) %a)]			%safepoint_token = tail call token (i64, i32, ptr, i32, i32, ...) @llvm.experimental.gc.statepoint.p0(i64 0, i32 0, ptr elementtype(i1 ()) @return_i1, i32 0, i32 0, i32 0, i32 0) ["gc-live" (ptr addrspace(1) %a)]
	%rel1 = call ptr addrspace(1) @llvm.experimental.gc.relocate.p1(token %safepoint_token, i32 0, i32 0)			%rel1 = call ptr addrspace(1) @llvm.experimental.gc.relocate.p1(token %safepoint_token, i32 0, i32 0)
	%res1 = call zeroext i1 @llvm.experimental.gc.result.i1(token %safepoint_token)			%res1 = call zeroext i1 @llvm.experimental.gc.result.i1(token %safepoint_token)
	call void @consume(ptr addrspace(1) %rel1)			call void @consume(ptr addrspace(1) %rel1)
	▲ Show 20 Lines • Show All 370 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/statepoint-vreg.ll

	Show First 20 Lines • Show All 238 Lines • ▼ Show 20 Lines

	;			;
	; Cross-basicblock relocates are handled with spilling for now.			; Cross-basicblock relocates are handled with spilling for now.
	define i1 @test_cross_bb(ptr addrspace(1) %a, i1 %external_cond) gc "statepoint-example" {			define i1 @test_cross_bb(ptr addrspace(1) %a, i1 %external_cond) gc "statepoint-example" {
	; CHECK-LABEL: test_cross_bb:			; CHECK-LABEL: test_cross_bb:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: pushq %rbp			; CHECK-NEXT: pushq %rbp
	; CHECK-NEXT: .cfi_def_cfa_offset 16			; CHECK-NEXT: .cfi_def_cfa_offset 16
	; CHECK-NEXT: pushq %r14
	; CHECK-NEXT: .cfi_def_cfa_offset 24
	; CHECK-NEXT: pushq %rbx			; CHECK-NEXT: pushq %rbx
				; CHECK-NEXT: .cfi_def_cfa_offset 24
				; CHECK-NEXT: pushq %rax
	; CHECK-NEXT: .cfi_def_cfa_offset 32			; CHECK-NEXT: .cfi_def_cfa_offset 32
	; CHECK-NEXT: .cfi_offset %rbx, -32			; CHECK-NEXT: .cfi_offset %rbx, -24
	; CHECK-NEXT: .cfi_offset %r14, -24
	; CHECK-NEXT: .cfi_offset %rbp, -16			; CHECK-NEXT: .cfi_offset %rbp, -16
	; CHECK-NEXT: movl %esi, %r14d			; CHECK-NEXT: movl %esi, %ebp
	; CHECK-NEXT: movq %rdi, %rbx			; CHECK-NEXT: movq %rdi, %rbx
	; CHECK-NEXT: callq return_i1@PLT			; CHECK-NEXT: callq return_i1@PLT
	; CHECK-NEXT: .Ltmp7:			; CHECK-NEXT: .Ltmp7:
	; CHECK-NEXT: testb $1, %r14b			; CHECK-NEXT: testb $1, %bpl
	; CHECK-NEXT: je .LBB7_2			; CHECK-NEXT: je .LBB7_2
	; CHECK-NEXT: # %bb.1: # %left			; CHECK-NEXT: # %bb.1: # %left
	; CHECK-NEXT: movl %eax, %ebp
	; CHECK-NEXT: movq %rbx, %rdi			; CHECK-NEXT: movq %rbx, %rdi
				; CHECK-NEXT: movl %eax, %ebx
	; CHECK-NEXT: callq consume@PLT			; CHECK-NEXT: callq consume@PLT
	; CHECK-NEXT: movl %ebp, %eax			; CHECK-NEXT: movl %ebx, %eax
	; CHECK-NEXT: jmp .LBB7_3			; CHECK-NEXT: jmp .LBB7_3
	; CHECK-NEXT: .LBB7_2: # %right			; CHECK-NEXT: .LBB7_2: # %right
	; CHECK-NEXT: movb $1, %al			; CHECK-NEXT: movb $1, %al
	; CHECK-NEXT: .LBB7_3: # %right			; CHECK-NEXT: .LBB7_3: # %right
	; CHECK-NEXT: popq %rbx			; CHECK-NEXT: addq $8, %rsp
	; CHECK-NEXT: .cfi_def_cfa_offset 24			; CHECK-NEXT: .cfi_def_cfa_offset 24
	; CHECK-NEXT: popq %r14			; CHECK-NEXT: popq %rbx
	; CHECK-NEXT: .cfi_def_cfa_offset 16			; CHECK-NEXT: .cfi_def_cfa_offset 16
	; CHECK-NEXT: popq %rbp			; CHECK-NEXT: popq %rbp
	; CHECK-NEXT: .cfi_def_cfa_offset 8			; CHECK-NEXT: .cfi_def_cfa_offset 8
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	entry:			entry:
	%safepoint_token = tail call token (i64, i32, ptr, i32, i32, ...) @llvm.experimental.gc.statepoint.p0(i64 0, i32 0, ptr elementtype(i1 ()) @return_i1, i32 0, i32 0, i32 0, i32 0) ["gc-live" (ptr addrspace(1) %a)]			%safepoint_token = tail call token (i64, i32, ptr, i32, i32, ...) @llvm.experimental.gc.statepoint.p0(i64 0, i32 0, ptr elementtype(i1 ()) @return_i1, i32 0, i32 0, i32 0, i32 0) ["gc-live" (ptr addrspace(1) %a)]
	br i1 %external_cond, label %left, label %right			br i1 %external_cond, label %left, label %right

	▲ Show 20 Lines • Show All 214 Lines • Show Last 20 Lines

llvm/test/DebugInfo/ARM/sdag-split-arg.ll

	Show All 13 Lines
	target datalayout = "e-m:o-p:32:32-i64:64-a:0:32-n32-S128"			target datalayout = "e-m:o-p:32:32-i64:64-a:0:32-n32-S128"
	target triple = "thumbv7k-apple-watchos2.0.0"			target triple = "thumbv7k-apple-watchos2.0.0"

	@g = local_unnamed_addr global i64 0, align 8, !dbg !0			@g = local_unnamed_addr global i64 0, align 8, !dbg !0

	; Function Attrs: optsize ssp			; Function Attrs: optsize ssp
	define i64 @_Z3foox(i64 returned) local_unnamed_addr #0 !dbg !13 {			define i64 @_Z3foox(i64 returned) local_unnamed_addr #0 !dbg !13 {
	tail call void @llvm.dbg.value(metadata i64 %0, metadata !17, metadata !DIExpression()), !dbg !18			tail call void @llvm.dbg.value(metadata i64 %0, metadata !17, metadata !DIExpression()), !dbg !18
	; CHECK: @DEBUG_VALUE: foo:offset <- [DW_OP_LLVM_fragment 0 32] $r5			; CHECK: @DEBUG_VALUE: foo:offset <- [DW_OP_LLVM_fragment 0 32] $r0
	; CHECK: @DEBUG_VALUE: foo:offset <- [DW_OP_LLVM_fragment 32 32] $r4			; CHECK: @DEBUG_VALUE: foo:offset <- [DW_OP_LLVM_fragment 32 32] $r1

	%2 = load i64, ptr @g, align 8, !dbg !19, !tbaa !21			%2 = load i64, ptr @g, align 8, !dbg !19, !tbaa !21
	%3 = icmp eq i64 %2, %0, !dbg !19			%3 = icmp eq i64 %2, %0, !dbg !19
	br i1 %3, label %5, label %4, !dbg !25			br i1 %3, label %5, label %4, !dbg !25

	; <label>:4: ; preds = %1			; <label>:4: ; preds = %1
	tail call void @_Z5writex(i64 %0) #3, !dbg !26			tail call void @_Z5writex(i64 %0) #3, !dbg !26
	br label %5, !dbg !26			br label %5, !dbg !26
	▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

llvm/test/DebugInfo/X86/live-debug-values.ll

	Show All 22 Lines
	; m = inc(n); // var `m' doesn't has a dbg.value			; m = inc(n); // var `m' doesn't has a dbg.value
	; printf("m(main): %d\n", m);			; printf("m(main): %d\n", m);
	; return 0;			; return 0;
	; }			; }
	; with clang -g -O3 -emit-llvm -c LiveDebugValues.c -S -o live-debug-values.ll			; with clang -g -O3 -emit-llvm -c LiveDebugValues.c -S -o live-debug-values.ll
	; This case will also produce multiple locations but only the debug range			; This case will also produce multiple locations but only the debug range
	; extension is tested here.			; extension is tested here.

				; XFAIL: *
				; This test is failing after splitting the live range of variable "n" in %bb.3.
				MatzeBUnsubmitted Not Done Reply Inline Actions uhoh... MatzeB: uhoh...
				CarrotAuthorUnsubmitted Done Reply Inline Actions I guess the debug information is not maintained well when live range split occurs. Carrot: I guess the debug information is not maintained well when live range split occurs.

	; DBG_VALUE for variable "n" is extended into %bb.5 from its predecessors %bb.3			; DBG_VALUE for variable "n" is extended into %bb.5 from its predecessors %bb.3
	; and %bb.4.			; and %bb.4.
	; CHECK: .LBB0_5:			; CHECK: .LBB0_5:
	; INSTRREF-NEXT: #DEBUG_VALUE: main:argc <- [DW_OP_LLVM_entry_value 1] $edi			; INSTRREF-NEXT: #DEBUG_VALUE: main:argc <- [DW_OP_LLVM_entry_value 1] $edi
	; INSTRREF-NEXT: #DEBUG_VALUE: main:argv <- [DW_OP_LLVM_entry_value 1] $rsi			; INSTRREF-NEXT: #DEBUG_VALUE: main:argv <- [DW_OP_LLVM_entry_value 1] $rsi
	; CHECK-NEXT: #DEBUG_VALUE: main:n <- $ebx			; CHECK-NEXT: #DEBUG_VALUE: main:n <- $ebx
	; Other register values have been clobbered.			; Other register values have been clobbered.
	; CHECK-NOT: #DEBUG_VALUE:			; CHECK-NOT: #DEBUG_VALUE:
	▲ Show 20 Lines • Show All 123 Lines • Show Last 20 Lines

llvm/test/tools/llvm-objdump/XCOFF/disassemble-symbolize-operands.ll

	Show All 11 Lines
	; CHECK-NEXT: 10: bdnz 0xc <L0>			; CHECK-NEXT: 10: bdnz 0xc <L0>
	; CHECK-NEXT: 14: blr			; CHECK-NEXT: 14: blr

	; CHECK-LABEL: <.foo>:			; CHECK-LABEL: <.foo>:
	; CHECK: 5c: b 0x64 <L0>			; CHECK: 5c: b 0x64 <L0>
	; CHECK-NEXT: <L2>:			; CHECK-NEXT: <L2>:
	; CHECK-NEXT: 60: bf 8, 0x84 <L1>			; CHECK-NEXT: 60: bf 8, 0x84 <L1>
	; CHECK-NEXT: <L0>:			; CHECK-NEXT: <L0>:
	; CHECK-NEXT: 64: mr 3, 31			; CHECK-NEXT: 64: bl 0x0 <.internal>
	; CHECK-NEXT: 68: bl 0x0 <.internal>			; CHECK-NEXT: 68: cmplwi 3, 11
	; CHECK-NEXT: 6c: mr 31, 3			; CHECK-NEXT: 6c: bt 0, 0x60 <L2>
	; CHECK-NEXT: 70: cmplwi 3, 11			; CHECK-NEXT: 70: mr 31, 3
	; CHECK-NEXT: 74: bt 0, 0x60 <L2>			; CHECK-NEXT: 74: bl 0x0 <.internal>
	; CHECK-NEXT: 78: bl 0x0 <.internal>			; CHECK-NEXT: 78: nop
	; CHECK-NEXT: 7c: nop			; CHECK-NEXT: 7c: mr 3, 31
	; CHECK-NEXT: 80: b 0x60 <L2>			; CHECK-NEXT: 80: b 0x60 <L2>
	; CHECK-NEXT: <L1>:			; CHECK-NEXT: <L1>:
	; CHECK-NEXT: 84: lwz 31, 60(1)			; CHECK-NEXT: 84: lwz 31, 60(1)

	define internal i32 @internal(i32 %a) {			define internal i32 @internal(i32 %a) {
	entry:			entry:
	br label %for.body			br label %for.body

	Show All 28 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[RA] Split a virtual register in cold blocks if it is not assigned preferred physical registerClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 556878

llvm/lib/CodeGen/RegAllocGreedy.h

llvm/lib/CodeGen/RegAllocGreedy.cpp

llvm/test/CodeGen/AArch64/cgp-usubo.ll

llvm/test/CodeGen/AArch64/csr-split.ll

llvm/test/CodeGen/AMDGPU/blender-no-live-segment-at-def-implicit-def.ll

llvm/test/CodeGen/AMDGPU/greedy-alloc-fail-sgpr1024-spill.mir

llvm/test/CodeGen/AMDGPU/insert-delay-alu-bug.ll

llvm/test/CodeGen/AMDGPU/ran-out-of-sgprs-allocation-failure.mir

llvm/test/CodeGen/AMDGPU/unstructured-cfg-def-use-issue.ll

llvm/test/CodeGen/ARM/csr-split.ll

llvm/test/CodeGen/ARM/divmod-eabi.ll

llvm/test/CodeGen/ARM/thumb2-size-opt.ll

llvm/test/CodeGen/AVR/cttz.ll

llvm/test/CodeGen/Hexagon/noreturn-noepilog.ll

llvm/test/CodeGen/PowerPC/csr-split.ll

llvm/test/CodeGen/PowerPC/ppc64-rop-protection-aix.ll

llvm/test/CodeGen/PowerPC/ppc64-rop-protection.ll

llvm/test/CodeGen/PowerPC/subreg-postra.ll

llvm/test/CodeGen/PowerPC/tail-dup-break-cfg.ll

llvm/test/CodeGen/PowerPC/tls-cse.ll

llvm/test/CodeGen/RISCV/double-stack-spill-restore.ll

llvm/test/CodeGen/RISCV/fmax-fmin.ll

llvm/test/CodeGen/RISCV/overflow-intrinsics.ll

llvm/test/CodeGen/RISCV/rvv/fixed-vectors-strided-vpload.ll

llvm/test/CodeGen/RISCV/rvv/strided-vpload.ll

llvm/test/CodeGen/Thumb2/pacbti-m-outliner-3.ll

llvm/test/CodeGen/X86/2007-11-30-LoadFolding-Bug.ll

llvm/test/CodeGen/X86/cgp-usubo.ll

llvm/test/CodeGen/X86/csr-split.ll

llvm/test/CodeGen/X86/ragreedy-bug.ll

llvm/test/CodeGen/X86/ragreedy-hoist-spill.ll

llvm/test/CodeGen/X86/speculative-load-hardening.ll

llvm/test/CodeGen/X86/split-reg-with-hint.ll

llvm/test/CodeGen/X86/statepoint-call-lowering.ll

llvm/test/CodeGen/X86/statepoint-invoke-ra-enter-at-end.mir

llvm/test/CodeGen/X86/statepoint-ra.ll

llvm/test/CodeGen/X86/statepoint-vreg-details.ll

llvm/test/CodeGen/X86/statepoint-vreg.ll

llvm/test/DebugInfo/ARM/sdag-split-arg.ll

llvm/test/DebugInfo/X86/live-debug-values.ll

llvm/test/tools/llvm-objdump/XCOFF/disassemble-symbolize-operands.ll

[RA] Split a virtual register in cold blocks if it is not assigned preferred physical register
ClosedPublic