This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/
-
CodeGen/
1/2
MachineOutliner.cpp
1/3
MachineVerifier.cpp
-
Target/
-
AArch64/
2/5
AArch64InstrInfo.cpp
-
X86/
-
X86InstrInfo.cpp
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
-
machine-outliner-cfi.mir

Differential D126930

Fix interaction of CFI instructions with MachineOutliner.
ClosedPublic

Authored by efriedma on Jun 2 2022, 3:50 PM.

Download Raw Diff

Details

Reviewers

paquette
AndrewLitteken
smeenai
kyulee

Commits

rG0ff51d5dde29: Fix interaction of CFI instructions with MachineOutliner.

Summary

Fix interaction of CFI instructions with MachineOutliner.

When checking if a candidate contains a CFI instruction, actually iterate over all of the instructions, instead of stopping halfway through.
Make sure copied CFI instructions refer to the correct instruction.

Fixes https://github.com/llvm/llvm-project/issues/55842

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

efriedma created this revision.Jun 2 2022, 3:50 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 2 2022, 3:50 PM

Herald added subscribers: jsji, pengfei, hiraditya. · View Herald Transcript

efriedma requested review of this revision.Jun 2 2022, 3:50 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 2 2022, 3:50 PM

efriedma retitled this revision from Fix interaction of CFI with MachineOutliner. to Fix interaction of CFI instructions with MachineOutliner..Jun 2 2022, 3:51 PM

efriedma edited the summary of this revision. (Show Details)

Thank you!

Is it possible or worthwhile to have a test case?

Should be possible to add a testcase, I think. Testcases are a bit tricky to write, though.

Need to think about the MachineOutliner.cpp change a bit more; it's broken.

I guess if we're tail-calling, and we grab all the CFI instructions, it should be okay to just clone all of them? But that's making me doubt the way CFI instructions are being counted; I don't think it's safe to say that getFrameInstructions().size() is actually the number of CFI instructions in the function. It is if you're writing MIR testcases for MachineOutliner, because of the way serialization works, but I don't think that works in general.

If we fix that check, and fix MachineOutliner.cpp to properly clone the instructions, then I guess everything works? But I'd have to spend a day writing testcases.

Harbormaster completed remote builds in B167638: Diff 433910.Jun 2 2022, 5:12 PM

Updated according to my best understanding of how this is supposed to work. Not really confident I'm actually getting this right, but seems like an improvement. Modified some tests to cover the new changes.

Harbormaster completed remote builds in B167653: Diff 433932.Jun 2 2022, 6:55 PM

kyulee added a subscriber: kyulee.Jun 4 2022, 8:54 AM

kyulee added inline comments.

llvm/lib/CodeGen/MachineOutliner.cpp
674–680	I think it's somewhat inefficient by cloning the entire instruction, removing this and then adding one back. CFIIndex should be reassigned for the new function (because the old function might optimize or shuffle it), so we will create new CFI instruction while adding CFIIndex to the new function. Can we just do this way? // Don't keep debug information for outlined instructions. auto DL = DebugLoc(); if (I->isCFIInstruction()) { unsigned CFIIndex = I->getOperand(0).getCFIIndex(); MCCFIInstruction CFI = Instrs[CFIIndex]; BuildMI(MBB, MBB.end(), DL, TII.get(TargetOpcode::CFI_INSTRUCTION)) .addCFIIndex(MF.addFrameInst(CFI)); } else { MachineInstr NewMI = MF.CloneMachineInstr(&I); NewMI->dropMemRefs(MF); NewMI->setDebugLoc(DL); MBB.insert(MBB.end(), NewMI); }
llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
7016	I think this is a nice catch. We should check the entire instructions of the candidate which will be outlined, instead of looking the range of index to the hashes.
7030	I'm just a bit confused about this code. Are you checking a function with a single block? Even that, I don't think walking all the instruction in the block adds value -- do you see more outlining or improvement by doing this? The existing logic that just checks the size of CFIInstructions might be enough conservatively although it might include dead CFIs. If you see the following logic later, the machine outliner actually outlines a tail case only if CFIs are outlined.

efriedma added inline comments.Jun 6 2022, 9:40 AM

llvm/lib/CodeGen/MachineOutliner.cpp
674–680	The efficiency difference is unlikely to matter, but sure, this is probably more clear.
llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
7030	Probably the size of CFIInstructions is greater than or equal to the number of CFI instructions in the function (unless we clone a CFI instruction, which probably shouldn't happen). That said, I don't really want to trust the size of CFIInstructions. Any heuristic that depends on it will be very confusing to anyone looking at MIR dumps (since we print CFI instructions inline). The single basic block restriction isn't strictly necessary; I'll try to come up with a better way to write this.

kyulee added inline comments.Jun 6 2022, 9:58 AM

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
7030	Probably the size of CFIInstructions is greater than or equal to the number of CFI instructions in the function I think this is conservatively safe to use it in this context if that happens -- we're likely to bail-out outlining due to mis-matches in # of CFI instructions. I wonder how many cases we have, in practice. Any heuristic that depends on it will be very confusing to anyone looking at MIR dumps (since we print CFI instructions inline). Can you elaborate it? Probably I don't understand the problem clearly. Let say we have N as CFIInstructions' size while there are actually M CFI instructions in the function body (M <= N). Didn't M instances still have valid CFI Index, assigned to the function? Do you have an example to show this is the problem?

efriedma added inline comments.Jun 6 2022, 10:47 AM

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
7030	See the changed llvm/test/CodeGen/AArch64/machine-outliner-cfi-tail.mir ; without this change, we don't transform.

kyulee added inline comments.Jun 6 2022, 11:24 AM

llvm/test/CodeGen/AArch64/machine-outliner-cfi-tail.mir
37 ↗	(On Diff #433932)	I understand this will make a difference to outline it if we can prove this is dead and thus safely ignore it -- probably at the cost of walking IRs or blocks for all parent functions of candidates. Do you see this case often in practice or is it just synthesized for this testing purpose? Even if that is the case, I think this is irrelevant for this fix, and probably suggest another patch as an optimization (or improving it).

Address review comments.

I don't have any evidence the CFIInstructions.size() thing can cause a miscompile, so leaving out for now.

Thanks for fixing this!
LGTM.

kyulee accepted this revision.Jun 7 2022, 4:06 PM

This revision is now accepted and ready to land.Jun 7 2022, 4:06 PM

LGTM, thank you!

llvm/lib/CodeGen/MachineVerifier.cpp
2216	Unfortunately, this doesn't appear to be enough to catch this issue. If I undo the other changes in this diff and remove the assertion I added in D126919, the test case from https://github.com/llvm/llvm-project/issues/55842 still crashes with an unexpected instruction error instead of hitting this check.

smeenai added inline comments.Jun 7 2022, 4:38 PM

llvm/lib/CodeGen/MachineVerifier.cpp
2216	NVM, this was PEBKAC – I forgot to pass `-verify-machineinstrs` to llc. I imagine Clang does that automatically.

efriedma added inline comments.Jun 7 2022, 5:11 PM

llvm/lib/CodeGen/MachineVerifier.cpp
2216	verify-machineinstrs is expensive because it checks after every pass. So it's off by default in all tools.

Harbormaster completed remote builds in B168431: Diff 434981.Jun 7 2022, 5:12 PM

This revision was landed with ongoing or failed builds.Jun 10 2022, 1:38 PM

Closed by commit rG0ff51d5dde29: Fix interaction of CFI instructions with MachineOutliner. (authored by efriedma). · Explain Why

This revision was automatically updated to reflect the committed changes.

efriedma added a commit: rG0ff51d5dde29: Fix interaction of CFI instructions with MachineOutliner..

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

MachineOutliner.cpp

19 lines

MachineVerifier.cpp

6 lines

Target/

AArch64/

AArch64InstrInfo.cpp

8 lines

X86/

X86InstrInfo.cpp

8 lines

test/

CodeGen/

AArch64/

machine-outliner-cfi.mir

81 lines

Diff 436031

llvm/lib/CodeGen/MachineOutliner.cpp

Show First 20 Lines • Show All 659 Lines • ▼ Show 20 Lines	MachineFunction *MachineOutliner::createOutlinedFunction(

MachineFunction *OriginalMF = FirstCand.front()->getMF();		MachineFunction *OriginalMF = FirstCand.front()->getMF();
const std::vector<MCCFIInstruction> &Instrs =		const std::vector<MCCFIInstruction> &Instrs =
OriginalMF->getFrameInstructions();		OriginalMF->getFrameInstructions();
for (auto I = FirstCand.front(), E = std::next(FirstCand.back()); I != E;		for (auto I = FirstCand.front(), E = std::next(FirstCand.back()); I != E;
++I) {		++I) {
if (I->isDebugInstr())		if (I->isDebugInstr())
continue;		continue;
MachineInstr NewMI = MF.CloneMachineInstr(&I);
		// Don't keep debug information for outlined instructions.
		auto DL = DebugLoc();
if (I->isCFIInstruction()) {		if (I->isCFIInstruction()) {
unsigned CFIIndex = NewMI->getOperand(0).getCFIIndex();		unsigned CFIIndex = I->getOperand(0).getCFIIndex();
MCCFIInstruction CFI = Instrs[CFIIndex];		MCCFIInstruction CFI = Instrs[CFIIndex];
(void)MF.addFrameInst(CFI);		BuildMI(MBB, MBB.end(), DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
}		.addCFIIndex(MF.addFrameInst(CFI));
		} else {
		MachineInstr NewMI = MF.CloneMachineInstr(&I);
NewMI->dropMemRefs(MF);		NewMI->dropMemRefs(MF);
		NewMI->setDebugLoc(DL);
// Don't keep debug information for outlined instructions.
NewMI->setDebugLoc(DebugLoc());
MBB.insert(MBB.end(), NewMI);		MBB.insert(MBB.end(), NewMI);
		kyuleeUnsubmitted Not Done Reply Inline Actions I think it's somewhat inefficient by cloning the entire instruction, removing this and then adding one back. CFIIndex should be reassigned for the new function (because the old function might optimize or shuffle it), so we will create new CFI instruction while adding CFIIndex to the new function. Can we just do this way? // Don't keep debug information for outlined instructions. auto DL = DebugLoc(); if (I->isCFIInstruction()) { unsigned CFIIndex = I->getOperand(0).getCFIIndex(); MCCFIInstruction CFI = Instrs[CFIIndex]; BuildMI(MBB, MBB.end(), DL, TII.get(TargetOpcode::CFI_INSTRUCTION)) .addCFIIndex(MF.addFrameInst(CFI)); } else { MachineInstr NewMI = MF.CloneMachineInstr(&I); NewMI->dropMemRefs(MF); NewMI->setDebugLoc(DL); MBB.insert(MBB.end(), NewMI); } kyulee: I think it's somewhat inefficient by cloning the entire instruction, removing this and then…
		efriedmaAuthorUnsubmitted Done Reply Inline Actions The efficiency difference is unlikely to matter, but sure, this is probably more clear. efriedma: The efficiency difference is unlikely to matter, but sure, this is probably more clear.
}		}
		}

// Set normal properties for a late MachineFunction.		// Set normal properties for a late MachineFunction.
MF.getProperties().reset(MachineFunctionProperties::Property::IsSSA);		MF.getProperties().reset(MachineFunctionProperties::Property::IsSSA);
MF.getProperties().set(MachineFunctionProperties::Property::NoPHIs);		MF.getProperties().set(MachineFunctionProperties::Property::NoPHIs);
MF.getProperties().set(MachineFunctionProperties::Property::NoVRegs);		MF.getProperties().set(MachineFunctionProperties::Property::NoVRegs);
MF.getProperties().set(MachineFunctionProperties::Property::TracksLiveness);		MF.getProperties().set(MachineFunctionProperties::Property::TracksLiveness);
MF.getRegInfo().freezeReservedRegs(MF);		MF.getRegInfo().freezeReservedRegs(MF);

▲ Show 20 Lines • Show All 399 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MachineVerifier.cpp

Show First 20 Lines • Show All 58 Lines • ▼ Show 20 Lines
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/InlineAsm.h"		#include "llvm/IR/InlineAsm.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/InitializePasses.h"		#include "llvm/InitializePasses.h"
#include "llvm/MC/LaneBitmask.h"		#include "llvm/MC/LaneBitmask.h"
#include "llvm/MC/MCAsmInfo.h"		#include "llvm/MC/MCAsmInfo.h"
		#include "llvm/MC/MCDwarf.h"
#include "llvm/MC/MCInstrDesc.h"		#include "llvm/MC/MCInstrDesc.h"
#include "llvm/MC/MCRegisterInfo.h"		#include "llvm/MC/MCRegisterInfo.h"
#include "llvm/MC/MCTargetOptions.h"		#include "llvm/MC/MCTargetOptions.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/LowLevelTypeImpl.h"		#include "llvm/Support/LowLevelTypeImpl.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
▲ Show 20 Lines • Show All 2,132 Lines • ▼ Show 20 Lines	if (LiveStks && LiveStks->hasInterval(MO->getIndex()) &&
}		}
if (stores && !LI.liveAt(Idx.getRegSlot())) {		if (stores && !LI.liveAt(Idx.getRegSlot())) {
report("Instruction stores to dead spill slot", MO, MONum);		report("Instruction stores to dead spill slot", MO, MONum);
errs() << "Live stack: " << LI << '\n';		errs() << "Live stack: " << LI << '\n';
}		}
}		}
break;		break;

		case MachineOperand::MO_CFIIndex:
		smeenaiUnsubmitted Not Done Reply Inline Actions Unfortunately, this doesn't appear to be enough to catch this issue. If I undo the other changes in this diff and remove the assertion I added in D126919, the test case from https://github.com/llvm/llvm-project/issues/55842 still crashes with an unexpected instruction error instead of hitting this check. smeenai: Unfortunately, this doesn't appear to be enough to catch this issue. If I undo the other…
		smeenaiUnsubmitted Not Done Reply Inline Actions NVM, this was PEBKAC – I forgot to pass `-verify-machineinstrs` to llc. I imagine Clang does that automatically. smeenai: NVM, this was PEBKAC – I forgot to pass `-verify-machineinstrs` to llc. I imagine Clang does…
		efriedmaAuthorUnsubmitted Done Reply Inline Actions verify-machineinstrs is expensive because it checks after every pass. So it's off by default in all tools. efriedma: verify-machineinstrs is expensive because it checks after every pass. So it's off by default…
		if (MO->getCFIIndex() >= MF->getFrameInstructions().size())
		report("CFI instruction has invalid index", MO, MONum);
		break;

default:		default:
break;		break;
}		}
}		}

void MachineVerifier::checkLivenessAtUse(const MachineOperand *MO,		void MachineVerifier::checkLivenessAtUse(const MachineOperand *MO,
unsigned MONum, SlotIndex UseIdx,		unsigned MONum, SlotIndex UseIdx,
const LiveRange &LR,		const LiveRange &LR,
▲ Show 20 Lines • Show All 1,129 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,006 Lines • ▼ Show 20 Lines	outliner::OutlinedFunction AArch64InstrInfo::getOutliningCandidateInfo(

bool HasBTI = any_of(RepeatedSequenceLocs, [](outliner::Candidate &C) {		bool HasBTI = any_of(RepeatedSequenceLocs, [](outliner::Candidate &C) {
return C.getMF()->getInfo<AArch64FunctionInfo>()->branchTargetEnforcement();		return C.getMF()->getInfo<AArch64FunctionInfo>()->branchTargetEnforcement();
});		});

// We check to see if CFI Instructions are present, and if they are		// We check to see if CFI Instructions are present, and if they are
// we find the number of CFI Instructions in the candidates.		// we find the number of CFI Instructions in the candidates.
unsigned CFICount = 0;		unsigned CFICount = 0;
MachineBasicBlock::iterator MBBI = RepeatedSequenceLocs[0].front();		for (auto &I : make_range(RepeatedSequenceLocs[0].front(),
for (unsigned Loc = RepeatedSequenceLocs[0].getStartIdx();		std::next(RepeatedSequenceLocs[0].back()))) {
		kyuleeUnsubmitted Not Done Reply Inline Actions I think this is a nice catch. We should check the entire instructions of the candidate which will be outlined, instead of looking the range of index to the hashes. kyulee: I think this is a nice catch. We should check the entire instructions of the candidate which…
Loc < RepeatedSequenceLocs[0].getEndIdx() + 1; Loc++) {		if (I.isCFIInstruction())
if (MBBI->isCFIInstruction())
CFICount++;		CFICount++;
MBBI++;
}		}

// We compare the number of found CFI Instructions to the number of CFI		// We compare the number of found CFI Instructions to the number of CFI
// instructions in the parent function for each candidate. We must check this		// instructions in the parent function for each candidate. We must check this
// since if we outline one of the CFI instructions in a function, we have to		// since if we outline one of the CFI instructions in a function, we have to
// outline them all for correctness. If we do not, the address offsets will be		// outline them all for correctness. If we do not, the address offsets will be
// incorrect between the two sections of the program.		// incorrect between the two sections of the program.
for (outliner::Candidate &C : RepeatedSequenceLocs) {		for (outliner::Candidate &C : RepeatedSequenceLocs) {
std::vector<MCCFIInstruction> CFIInstructions =		std::vector<MCCFIInstruction> CFIInstructions =
C.getMF()->getFrameInstructions();		C.getMF()->getFrameInstructions();

if (CFICount > 0 && CFICount != CFIInstructions.size())		if (CFICount > 0 && CFICount != CFIInstructions.size())
		kyuleeUnsubmitted Not Done Reply Inline Actions I'm just a bit confused about this code. Are you checking a function with a single block? Even that, I don't think walking all the instruction in the block adds value -- do you see more outlining or improvement by doing this? The existing logic that just checks the size of CFIInstructions might be enough conservatively although it might include dead CFIs. If you see the following logic later, the machine outliner actually outlines a tail case only if CFIs are outlined. kyulee: I'm just a bit confused about this code. Are you checking a function with a single block? Even…
		efriedmaAuthorUnsubmitted Done Reply Inline Actions Probably the size of CFIInstructions is greater than or equal to the number of CFI instructions in the function (unless we clone a CFI instruction, which probably shouldn't happen). That said, I don't really want to trust the size of CFIInstructions. Any heuristic that depends on it will be very confusing to anyone looking at MIR dumps (since we print CFI instructions inline). The single basic block restriction isn't strictly necessary; I'll try to come up with a better way to write this. efriedma: Probably the size of CFIInstructions is greater than or equal to the number of CFI instructions…
		kyuleeUnsubmitted Not Done Reply Inline Actions Probably the size of CFIInstructions is greater than or equal to the number of CFI instructions in the function I think this is conservatively safe to use it in this context if that happens -- we're likely to bail-out outlining due to mis-matches in # of CFI instructions. I wonder how many cases we have, in practice. Any heuristic that depends on it will be very confusing to anyone looking at MIR dumps (since we print CFI instructions inline). Can you elaborate it? Probably I don't understand the problem clearly. Let say we have N as CFIInstructions' size while there are actually M CFI instructions in the function body (M <= N). Didn't M instances still have valid CFI Index, assigned to the function? Do you have an example to show this is the problem? kyulee: > Probably the size of CFIInstructions is greater than or equal to the number of CFI…
		efriedmaAuthorUnsubmitted Done Reply Inline Actions See the changed llvm/test/CodeGen/AArch64/machine-outliner-cfi-tail.mir ; without this change, we don't transform. efriedma: See the changed llvm/test/CodeGen/AArch64/machine-outliner-cfi-tail.mir ; without this change…
return outliner::OutlinedFunction();		return outliner::OutlinedFunction();
}		}

// Returns true if an instructions is safe to fix up, false otherwise.		// Returns true if an instructions is safe to fix up, false otherwise.
auto IsSafeToFixup = [this, &TRI](MachineInstr &MI) {		auto IsSafeToFixup = [this, &TRI](MachineInstr &MI) {
if (MI.isCall())		if (MI.isCall())
return true;		return true;

▲ Show 20 Lines • Show All 974 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86InstrInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 9,481 Lines • ▼ Show 20 Lines	unsigned SequenceSize =
if (MI.isDebugInstr() \|\| MI.isKill())		if (MI.isDebugInstr() \|\| MI.isKill())
return Sum;		return Sum;
return Sum + 1;		return Sum + 1;
});		});

// We check to see if CFI Instructions are present, and if they are		// We check to see if CFI Instructions are present, and if they are
// we find the number of CFI Instructions in the candidates.		// we find the number of CFI Instructions in the candidates.
unsigned CFICount = 0;		unsigned CFICount = 0;
MachineBasicBlock::iterator MBBI = RepeatedSequenceLocs[0].front();		for (auto &I : make_range(RepeatedSequenceLocs[0].front(),
for (unsigned Loc = RepeatedSequenceLocs[0].getStartIdx();		std::next(RepeatedSequenceLocs[0].back()))) {
Loc < RepeatedSequenceLocs[0].getEndIdx() + 1; Loc++) {		if (I.isCFIInstruction())
if (MBBI->isCFIInstruction())
CFICount++;		CFICount++;
MBBI++;
}		}

// We compare the number of found CFI Instructions to the number of CFI		// We compare the number of found CFI Instructions to the number of CFI
// instructions in the parent function for each candidate. We must check this		// instructions in the parent function for each candidate. We must check this
// since if we outline one of the CFI instructions in a function, we have to		// since if we outline one of the CFI instructions in a function, we have to
// outline them all for correctness. If we do not, the address offsets will be		// outline them all for correctness. If we do not, the address offsets will be
// incorrect between the two sections of the program.		// incorrect between the two sections of the program.
for (outliner::Candidate &C : RepeatedSequenceLocs) {		for (outliner::Candidate &C : RepeatedSequenceLocs) {
▲ Show 20 Lines • Show All 147 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/machine-outliner-cfi.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -mtriple=aarch64-apple-unknown -run-pass=machine-outliner -verify-machineinstrs %s -o - \| FileCheck %s			# RUN: llc -mtriple=aarch64-apple-unknown -run-pass=machine-outliner -verify-machineinstrs -enable-machine-outliner=always %s -o - \| FileCheck %s

	# Outlining CFI instructions is unsafe if it is not tail called, but otherwise,			# Outlining CFI instructions is unsafe if it is not tail called, but otherwise,
	# it requires fixups. Show that we don't include CFI instructions in non			# it requires fixups. Show that we don't include CFI instructions in non
	# tail call outlined sequences right now.			# tail call outlined sequences right now. Show that we count CFI instructions
				# correctly in the presence of debug info.

	--- \|			--- \|
	define void @foo() #0 { ret void }			define void @foo() #0 { ret void }
	define void @bar() #0 { ret void }			define void @bar() #0 { ret void }
	define void @baz() #0 { ret void }			define void @baz() #0 { ret void }
	attributes #0 = { noredzone }			attributes #0 = { noredzone }

	...			...
	---			---
	name: foo			name: foo
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $lr			liveins: $lr
	; CHECK-LABEL: name: foo			; CHECK-LABEL: name: foo
	; CHECK: liveins: $lr			; CHECK: liveins: $lr
	; CHECK: $w9 = ORRWri $wzr, 1			; CHECK-NEXT: {{ $}}
	; CHECK: $w10 = ORRWri $wzr, 2			; CHECK-NEXT: $w9 = ORRWri $wzr, 1
	; CHECK: $w11 = ORRWri $wzr, 3			; CHECK-NEXT: $w10 = ORRWri $wzr, 2
	; CHECK: frame-setup CFI_INSTRUCTION def_cfa $w29, 16			; CHECK-NEXT: $w11 = ORRWri $wzr, 3
	; CHECK: $w20 = ORRWri $wzr, 1			; CHECK-NEXT: DBG_VALUE $w9, $noreg
	; CHECK: TCRETURNdi @OUTLINED_FUNCTION_0, 0, implicit $sp, implicit-def $w12, implicit-def $w13, implicit-def $w14, implicit-def $w15, implicit $wzr, implicit $sp			; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: frame-setup CFI_INSTRUCTION def_cfa $w29, 16
				; CHECK-NEXT: $w20 = ORRWri $wzr, 1
				; CHECK-NEXT: TCRETURNdi @OUTLINED_FUNCTION_0, 0, implicit $sp, implicit-def $w12, implicit-def $w13, implicit-def $w14, implicit-def $w15, implicit $wzr, implicit $sp
	$w9 = ORRWri $wzr, 1			$w9 = ORRWri $wzr, 1
	$w10 = ORRWri $wzr, 2			$w10 = ORRWri $wzr, 2
	$w11 = ORRWri $wzr, 3			$w11 = ORRWri $wzr, 3
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
	frame-setup CFI_INSTRUCTION def_cfa $w29, 16			frame-setup CFI_INSTRUCTION def_cfa $w29, 16
	$w20 = ORRWri $wzr, 1			$w20 = ORRWri $wzr, 1
	$w12 = ORRWri $wzr, 1			$w12 = ORRWri $wzr, 1
	$w13 = ORRWri $wzr, 2			$w13 = ORRWri $wzr, 2
	$w14 = ORRWri $wzr, 3			$w14 = ORRWri $wzr, 3
	$w15 = ORRWri $wzr, 4			$w15 = ORRWri $wzr, 4
	RET undef $lr			RET undef $lr
	...			...
	---			---
	name: bar			name: bar
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $lr			liveins: $lr
	; CHECK-LABEL: name: bar			; CHECK-LABEL: name: bar
	; CHECK: liveins: $lr			; CHECK: liveins: $lr
	; CHECK: $w9 = ORRWri $wzr, 1			; CHECK-NEXT: {{ $}}
	; CHECK: $w10 = ORRWri $wzr, 2			; CHECK-NEXT: $w9 = ORRWri $wzr, 1
	; CHECK: $w11 = ORRWri $wzr, 3			; CHECK-NEXT: $w10 = ORRWri $wzr, 2
	; CHECK: frame-setup CFI_INSTRUCTION def_cfa $w29, 16			; CHECK-NEXT: $w11 = ORRWri $wzr, 3
	; CHECK: $w21 = ORRWri $wzr, 1			; CHECK-NEXT: DBG_VALUE $w9, $noreg
	; CHECK: TCRETURNdi @OUTLINED_FUNCTION_0, 0, implicit $sp, implicit-def $w12, implicit-def $w13, implicit-def $w14, implicit-def $w15, implicit $wzr, implicit $sp			; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: frame-setup CFI_INSTRUCTION def_cfa $w29, 16
				; CHECK-NEXT: $w21 = ORRWri $wzr, 1
				; CHECK-NEXT: TCRETURNdi @OUTLINED_FUNCTION_0, 0, implicit $sp, implicit-def $w12, implicit-def $w13, implicit-def $w14, implicit-def $w15, implicit $wzr, implicit $sp
	$w9 = ORRWri $wzr, 1			$w9 = ORRWri $wzr, 1
	$w10 = ORRWri $wzr, 2			$w10 = ORRWri $wzr, 2
	$w11 = ORRWri $wzr, 3			$w11 = ORRWri $wzr, 3
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
	frame-setup CFI_INSTRUCTION def_cfa $w29, 16			frame-setup CFI_INSTRUCTION def_cfa $w29, 16
	$w21 = ORRWri $wzr, 1			$w21 = ORRWri $wzr, 1
	$w12 = ORRWri $wzr, 1			$w12 = ORRWri $wzr, 1
	$w13 = ORRWri $wzr, 2			$w13 = ORRWri $wzr, 2
	$w14 = ORRWri $wzr, 3			$w14 = ORRWri $wzr, 3
	$w15 = ORRWri $wzr, 4			$w15 = ORRWri $wzr, 4
	RET undef $lr			RET undef $lr
	...			...
	---			---
	name: baz			name: baz
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $lr			liveins: $lr
	; CHECK-LABEL: name: baz			; CHECK-LABEL: name: baz
	; CHECK: liveins: $lr			; CHECK: liveins: $lr
	; CHECK: $w9 = ORRWri $wzr, 1			; CHECK-NEXT: {{ $}}
	; CHECK: $w10 = ORRWri $wzr, 2			; CHECK-NEXT: $w9 = ORRWri $wzr, 1
	; CHECK: $w11 = ORRWri $wzr, 3			; CHECK-NEXT: $w10 = ORRWri $wzr, 2
	; CHECK: frame-setup CFI_INSTRUCTION def_cfa $w29, 16			; CHECK-NEXT: $w11 = ORRWri $wzr, 3
	; CHECK: $w22 = ORRWri $wzr, 1			; CHECK-NEXT: DBG_VALUE $w9, $noreg
	; CHECK: TCRETURNdi @OUTLINED_FUNCTION_0, 0, implicit $sp, implicit-def $w12, implicit-def $w13, implicit-def $w14, implicit-def $w15, implicit $wzr, implicit $sp			; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: DBG_VALUE $w9, $noreg
				; CHECK-NEXT: frame-setup CFI_INSTRUCTION def_cfa $w29, 16
				; CHECK-NEXT: $w22 = ORRWri $wzr, 1
				; CHECK-NEXT: TCRETURNdi @OUTLINED_FUNCTION_0, 0, implicit $sp, implicit-def $w12, implicit-def $w13, implicit-def $w14, implicit-def $w15, implicit $wzr, implicit $sp
	$w9 = ORRWri $wzr, 1			$w9 = ORRWri $wzr, 1
	$w10 = ORRWri $wzr, 2			$w10 = ORRWri $wzr, 2
	$w11 = ORRWri $wzr, 3			$w11 = ORRWri $wzr, 3
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
				DBG_VALUE $w9, $noreg
	frame-setup CFI_INSTRUCTION def_cfa $w29, 16			frame-setup CFI_INSTRUCTION def_cfa $w29, 16
	$w22 = ORRWri $wzr, 1			$w22 = ORRWri $wzr, 1
	$w12 = ORRWri $wzr, 1			$w12 = ORRWri $wzr, 1
	$w13 = ORRWri $wzr, 2			$w13 = ORRWri $wzr, 2
	$w14 = ORRWri $wzr, 3			$w14 = ORRWri $wzr, 3
	$w15 = ORRWri $wzr, 4			$w15 = ORRWri $wzr, 4
	RET undef $lr			RET undef $lr