This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/CodeGen/
-
CodeGen/
6
MachineSink.cpp
-
test/CodeGen/
-
CodeGen/
-
AArch64/
-
O3-pipeline.ll
-
post-ra-machine-sink-spill.mir
-
PowerPC/
-
aggressive-anti-dep-breaker-subreg.ll
-
cc.ll
-
X86/
-
catchret-regmask.ll

Differential D45110

[PostRASink]Sink spill to a block reachable to reload
Needs ReviewPublic

Authored by junbuml on Mar 30 2018, 2:44 PM.

Download Raw Diff

Details

Reviewers

thegameg
sebpop
MatzeB
qcolombet
mcrosier
gberry
javed.absar

Summary

This change sink a spill into a successor, if reloads from the stack slot
are reachable only through the successor.

For the machine IR below, we will sink the store to %stack.0 in bb.0 into
bb.1 because there is no reload from stack.0 in bb.2.

bb.0:
  STRXui $x0, %stack.0, 0
  Bcc 11, %bb.2

bb.1:
  $x0 = LDRXui %stack.0, 0
  RET $x0

bb.2:
  $x0 = COPY $xzr
  RET   $x0

Diff Detail

Event Timeline

junbuml created this revision.Mar 30 2018, 2:44 PM

Herald added subscribers: javed.absar, mcrosier, qcolombet. · View Herald TranscriptMar 30 2018, 2:44 PM

junbuml edited the summary of this revision. (Show Details)Mar 30 2018, 2:46 PM

junbuml edited the summary of this revision. (Show Details)

junbuml updated this revision to Diff 140850.Apr 3 2018, 1:10 PM

junbuml retitled this revision from [PostRASink][WIP]Sink spill to a block reachable to reload to [PostRASink]Sink spill to a block reachable to reload.

junbuml added reviewers: thegameg, sebpop, MatzeB, qcolombet, mcrosier, gberry.

Herald added a subscriber: nemanjai. · View Herald TranscriptApr 3 2018, 1:10 PM

junbuml added a reviewer: rnk.Apr 3 2018, 1:11 PM

With/without this change, I collected llvm stats for spec2000/2006/2017 on AArch64. I observed +31.87% more loads from stores promoted in AArch64LoadStoreOptimizer pass, and minor improvement in shrink-wrapping in case spills are sunk from the entry.

Thanks for working on this.

I’m wondering why do we end up with spills placed like this in the first place?

My initial motivation case for this was when spilling the incoming argument register. This is somewhat related with the initial motivation of PostRASink pass because we do not sink the COPY for argument register before allocating. When RA try to spill the incoming argument register, it probably don't want to change the placement of the spill during RA, since sinking it down will extend the live range of the spilled value. So I do this after RA just like we did it for Copy in PostRASink pass.

Kindly ping?

Herald added a reviewer: javed.absar. · View Herald TranscriptApr 26 2018, 1:41 PM

rnk removed a reviewer: rnk.Apr 26 2018, 1:45 PM

Sorry for the wait, this looks very good! Did you see any compile time impact on this?

I'll check how the benchmarks are affected on my side next week and let you know!

lib/CodeGen/MachineSink.cpp
1288	From what I see, `hasLoadFromStackSlot` checks for `MachineMemoryOperand`s attached to the MI. It looks for things like `:: (load 8 from %stack.0)`, and if the load source is a fixed stack it grabs the FI. I wonder what happens if an instruction: MI.mayLoad == true any_of(MI.operands(), isFI) == true but MI.machinememoperands().empty() == true I've been confused about `MachineMemoryOperand`s for a while now, so I'm not sure if an instruction with no MMOs should act like a barrier or something else. Are there any other passes that rely on this? What do you think? I think if we can rely on MMOs it would be great, and we should document it somewhere. This shouldn't be blocking for this patch, but if you can document these assumptions it would be great.
1288	Slightly related, I've been wanting to add this to the MachineVerifier. It would be nice if mayLoad / mayStore instructions don't pass the verifier on some conditions like: has no MMO has MMO == FixedStack but no (fixed) FI operands has MMO != FixedStack but has FI operands probably more we can check here
1319–1320	Actually, why don't we sink across function calls? I would assume the regmasks and the implicit operands would be enough to keep it safe, but I might be wrong.

thegameg added inline comments.Apr 27 2018, 12:30 PM

lib/CodeGen/MachineSink.cpp
1288	Hmm actually, why not use `isLoadFromStackSlot` here as you do for stores?

Refactors this change based on current tip. Made the check for load from FI more conservative.

In my test for spec2000/2006/2017 on AArach64, I didn't see any compile time impact.

lib/CodeGen/MachineSink.cpp
1288	From what I see, hasLoadFromStackSlot checks for MachineMemoryOperands attached to the MI. It looks for things >like :: (load 8 from %stack.0), and if the load source is a fixed stack it grabs the FI. I wonder what happens if an instruction: MI.mayLoad == true any_of(MI.operands(), isFI) == true but MI.machinememoperands().empty() == true I've been confused about MachineMemoryOperands for a while now, so I'm not sure if an instruction with no MMOs >should act like a barrier or something else. Are there any other passes that rely on this? What do you think? I think if we can rely on MMOs it would be great, and we should document it somewhere. This shouldn't be blocking for this patch, but if you can document these assumptions it would be great. Thanks for bring this up. I can see similary code in InstructionStoresToFI() @MachineLICM. If MMO is empty, I conservatively assume an instruction load from some FI. Slightly related, I've been wanting to add this to the MachineVerifier. It would be nice if mayLoad / mayStore instructions don't pass the verifier on some conditions like: has no MMO has MMO == FixedStack but no (fixed) FI operands has MMO != FixedStack but has FI operands probably more we can check here I agree with this check and I will be happy to do it. I will update you for this.
1319–1320	I agree that we can use regmask and implicit operands, but I'm not perfectly clear if it's safe in all targets and calling conventions. I add FIXME about it. Another issue here is that this also skip DBG_VALUE, causing different code being generated with -g. A fix is being discussed in https://reviews.llvm.org/D45878.

Any chance the way FIs are tracked here might be prone to this bug ?

Revision Contents

Path

Size

lib/

CodeGen/

MachineSink.cpp

312 lines

test/

CodeGen/

AArch64/

O3-pipeline.ll

2 lines

post-ra-machine-sink-spill.mir

182 lines

PowerPC/

aggressive-anti-dep-breaker-subreg.ll

2 lines

cc.ll

4 lines

X86/

catchret-regmask.ll

4 lines

Diff 144753

lib/CodeGen/MachineSink.cpp

Show All 20 Lines
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/SparseBitVector.h"		#include "llvm/ADT/SparseBitVector.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/Analysis/AliasAnalysis.h"		#include "llvm/Analysis/AliasAnalysis.h"
#include "llvm/CodeGen/MachineBasicBlock.h"		#include "llvm/CodeGen/MachineBasicBlock.h"
#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"		#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"
#include "llvm/CodeGen/MachineBranchProbabilityInfo.h"		#include "llvm/CodeGen/MachineBranchProbabilityInfo.h"
#include "llvm/CodeGen/MachineDominators.h"		#include "llvm/CodeGen/MachineDominators.h"
		#include "llvm/CodeGen/MachineFrameInfo.h"
#include "llvm/CodeGen/MachineFunction.h"		#include "llvm/CodeGen/MachineFunction.h"
#include "llvm/CodeGen/MachineFunctionPass.h"		#include "llvm/CodeGen/MachineFunctionPass.h"
#include "llvm/CodeGen/MachineInstr.h"		#include "llvm/CodeGen/MachineInstr.h"
#include "llvm/CodeGen/MachineLoopInfo.h"		#include "llvm/CodeGen/MachineLoopInfo.h"
#include "llvm/CodeGen/MachineOperand.h"		#include "llvm/CodeGen/MachineOperand.h"
#include "llvm/CodeGen/MachinePostDominators.h"		#include "llvm/CodeGen/MachinePostDominators.h"
#include "llvm/CodeGen/MachineRegisterInfo.h"		#include "llvm/CodeGen/MachineRegisterInfo.h"
#include "llvm/CodeGen/TargetInstrInfo.h"		#include "llvm/CodeGen/TargetInstrInfo.h"
Show All 36 Lines	cl::desc(
"speculative execution of up to 1 instruction to avoid branching to "		"speculative execution of up to 1 instruction to avoid branching to "
"splitted critical edge"),		"splitted critical edge"),
cl::init(40), cl::Hidden);		cl::init(40), cl::Hidden);

STATISTIC(NumSunk, "Number of machine instructions sunk");		STATISTIC(NumSunk, "Number of machine instructions sunk");
STATISTIC(NumSplit, "Number of critical edges split");		STATISTIC(NumSplit, "Number of critical edges split");
STATISTIC(NumCoalesces, "Number of copies coalesced");		STATISTIC(NumCoalesces, "Number of copies coalesced");
STATISTIC(NumPostRACopySink, "Number of copies sunk after RA");		STATISTIC(NumPostRACopySink, "Number of copies sunk after RA");
		STATISTIC(NumPostRASpillSink, "Number of spills sunk after RA");

namespace {		namespace {

class MachineSinking : public MachineFunctionPass {		class MachineSinking : public MachineFunctionPass {
const TargetInstrInfo *TII;		const TargetInstrInfo *TII;
const TargetRegisterInfo *TRI;		const TargetRegisterInfo *TRI;
MachineRegisterInfo *MRI; // Machine register information		MachineRegisterInfo *MRI; // Machine register information
MachineDominatorTree *DT; // Machine dominator tree		MachineDominatorTree *DT; // Machine dominator tree
▲ Show 20 Lines • Show All 846 Lines • ▼ Show 20 Lines
// %bb.2:		// %bb.2:
// %w0 = COPY %wzr		// %w0 = COPY %wzr
// RET %w0		// RET %w0
// As we sink %w19 (CSR in AArch64) into %bb.1, the shrink-wrapping pass will be		// As we sink %w19 (CSR in AArch64) into %bb.1, the shrink-wrapping pass will be
// able to see %bb.0 as a candidate.		// able to see %bb.0 as a candidate.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
namespace {		namespace {

		typedef DenseMap<int, SmallPtrSet<MachineBasicBlock *, 4>> StackUseMapTy;

class PostRAMachineSinking : public MachineFunctionPass {		class PostRAMachineSinking : public MachineFunctionPass {
public:		public:
bool runOnMachineFunction(MachineFunction &MF) override;		bool runOnMachineFunction(MachineFunction &MF) override;

static char ID;		static char ID;
PostRAMachineSinking() : MachineFunctionPass(ID) {}		PostRAMachineSinking() : MachineFunctionPass(ID) {}
StringRef getPassName() const override { return "PostRA Machine Sink"; }		StringRef getPassName() const override { return "PostRA Machine Sink"; }

void getAnalysisUsage(AnalysisUsage &AU) const override {		void getAnalysisUsage(AnalysisUsage &AU) const override {
AU.setPreservesCFG();		AU.setPreservesCFG();
		AU.addRequired<MachineDominatorTree>();
MachineFunctionPass::getAnalysisUsage(AU);		MachineFunctionPass::getAnalysisUsage(AU);
}		}

MachineFunctionProperties getRequiredProperties() const override {		MachineFunctionProperties getRequiredProperties() const override {
return MachineFunctionProperties().set(		return MachineFunctionProperties().set(
MachineFunctionProperties::Property::NoVRegs);		MachineFunctionProperties::Property::NoVRegs);
}		}

private:		private:
		const TargetRegisterInfo *TRI;
		const TargetInstrInfo *TII;
		const MachineFrameInfo *MFI;
		const MachineDominatorTree *MDT;

/// Track which register units have been modified and used.		/// Track which register units have been modified and used.
LiveRegUnits ModifiedRegUnits, UsedRegUnits;		LiveRegUnits ModifiedRegUnits, UsedRegUnits;

		/// Hold the uses of stack slots.
		StackUseMapTy StackUseMap;

		/// Find basic blocks where there are loads from stack slots.
		bool scanLoadFromStackSlot(MachineFunction &MF);

		/// Sink register spills close to their reloads.
		bool tryToSinkSpill(MachineInstr &MI, MachineBasicBlock &CurBB,
		SmallPtrSetImpl<MachineBasicBlock *> &SinkableBBs);

/// Sink Copy instructions unused in the same block close to their uses in		/// Sink Copy instructions unused in the same block close to their uses in
/// successors.		/// successors.
bool tryToSinkCopy(MachineBasicBlock &BB, MachineFunction &MF,		bool tryToSinkCopy(MachineInstr &MI, MachineBasicBlock &CurBB,
const TargetRegisterInfo TRI, const TargetInstrInfo TII);		SmallPtrSetImpl<MachineBasicBlock *> &SinkableBBs);

		/// Perform sinking for Copy and store to stack.
		bool tryToSink(MachineBasicBlock &BB, bool EnableSinkSpill);
};		};
} // namespace		} // namespace

char PostRAMachineSinking::ID = 0;		char PostRAMachineSinking::ID = 0;
char &llvm::PostRAMachineSinkingID = PostRAMachineSinking::ID;		char &llvm::PostRAMachineSinkingID = PostRAMachineSinking::ID;

INITIALIZE_PASS(PostRAMachineSinking, "postra-machine-sink",		INITIALIZE_PASS_BEGIN(PostRAMachineSinking, "postra-machine-sink",
"PostRA Machine Sink", false, false)		"PostRA Machine Sink", false, false)
		INITIALIZE_PASS_DEPENDENCY(MachineDominatorTree)
		INITIALIZE_PASS_END(PostRAMachineSinking, "postra-machine-sink",
		"PostRA Machine Sink", false, false)

		// Return true if there is potential path from FromMBB to any basic block which
		// contains uses of FI.
		static bool isReachableToUseOfFI(MachineBasicBlock *FromMBB,
		StackUseMapTy &StackUseMap, int FI,
		const MachineDominatorTree *MDT) {
		SmallPtrSet<MachineBasicBlock *, 4> &UseBBs = StackUseMap[FI];
		if (UseBBs.count(FromMBB))
		return true;

		if (any_of(UseBBs, [&](MachineBasicBlock *UseBB) {
		return MDT->dominates(FromMBB, UseBB);
		}))
		return true;

		// Limit the number of blocks we visit for compile times.
		// The default value (32) was chosen arbitrarily.
		unsigned Limit = 32;

		DenseSet<const MachineBasicBlock *> Visited;
		SmallVector<MachineBasicBlock *, 4> Worklist(FromMBB->succ_begin(),
		FromMBB->succ_end());
		while (!Worklist.empty()) {
		MachineBasicBlock *MBB = Worklist.pop_back_val();
		if (!Visited.insert(MBB).second)
		continue;
		if (UseBBs.count(MBB))
		return true;

		// If it reaches to the limit, conservatively return true (there is
		// potentially a path).
		if (!--Limit)
		return true;

		Worklist.append(MBB->succ_begin(), MBB->succ_end());
		}
		return false;
		}

		static MachineBasicBlock *
		getSingleReloadInSuccBB(MachineBasicBlock &CurBB, StackUseMapTy &StackUseMap,
		SmallPtrSetImpl<MachineBasicBlock *> &SinkableBBs,
		int FI, const MachineDominatorTree *MDT) {
		// TODO: We should track access to FI just like we track the def of register.
		if (StackUseMap[FI].count(&CurBB))
		return nullptr;

		MachineBasicBlock *SingleReachableSucc = nullptr;

		// Try to find a single sinkable successor from which all use of FI is
		// reachable.
		for (auto *SI : SinkableBBs) {
		if (SI == &CurBB)
		continue;
		if (isReachableToUseOfFI(SI, StackUseMap, FI, MDT)) {
		if (SingleReachableSucc)
		return nullptr;
		SingleReachableSucc = SI;
		}
		}
		if (!SingleReachableSucc)
		return nullptr;

		// Check if there is any other successor through which a use of FI is reachable.
		for (MachineBasicBlock *SI : CurBB.successors())
		if (!SinkableBBs.count(SI) && isReachableToUseOfFI(SI, StackUseMap, FI, MDT))
		return nullptr;

		return SingleReachableSucc;
		}

		bool PostRAMachineSinking::scanLoadFromStackSlot(MachineFunction &MF) {
		StackUseMap.clear();
		for (auto &MBB : MF)
		for (auto &MI : MBB) {
		for (const MachineOperand &MO : MI.operands()) {
		if (!MO.isFI())
		continue;
		int FI = MO.getIndex();
		if (!MFI->isSpillSlotObjectIndex(FI))
		continue;
		// If MMO is empty, conservatively assume that the instruction load from
		// some FI.
		if (MI.memoperands_empty())
		return false;
		for (MachineInstr::mmo_iterator o = MI.memoperands_begin(),
		oe = MI.memoperands_end();
		o != oe; ++o)
		if ((*o)->isLoad())
		if (const FixedStackPseudoSourceValue *Value =
		dyn_cast_or_null<FixedStackPseudoSourceValue>(
		(*o)->getPseudoValue()))
		if (Value->getFrameIndex() == FI)
		StackUseMap[FI].insert(&MBB);
		}
		}
		return true;
		}

static bool aliasWithRegsInLiveIn(MachineBasicBlock &MBB, unsigned Reg,		static bool aliasWithRegsInLiveIn(MachineBasicBlock &MBB, unsigned Reg,
const TargetRegisterInfo *TRI) {		const TargetRegisterInfo *TRI) {
LiveRegUnits LiveInRegUnits(*TRI);		LiveRegUnits LiveInRegUnits(*TRI);
LiveInRegUnits.addLiveIns(MBB);		LiveInRegUnits.addLiveIns(MBB);
return !LiveInRegUnits.available(Reg);		return !LiveInRegUnits.available(Reg);
}		}

Show All 35 Lines	MachineBasicBlock *BB =
getSingleLiveInSuccBB(CurBB, SinkableBBs, DefReg, TRI);		getSingleLiveInSuccBB(CurBB, SinkableBBs, DefReg, TRI);
if (!BB \|\| (SingleBB && SingleBB != BB))		if (!BB \|\| (SingleBB && SingleBB != BB))
return nullptr;		return nullptr;
SingleBB = BB;		SingleBB = BB;
}		}
return SingleBB;		return SingleBB;
}		}

static void clearKillFlags(MachineInstr *MI, MachineBasicBlock &CurBB,		static void clearKillFlags(MachineInstr &MI, MachineBasicBlock &CurBB,
SmallVectorImpl<unsigned> &UsedOpsInCopy,		SmallVectorImpl<unsigned> &UsedOpsInCopy,
LiveRegUnits &UsedRegUnits,		LiveRegUnits &UsedRegUnits,
const TargetRegisterInfo *TRI) {		const TargetRegisterInfo *TRI) {
for (auto U : UsedOpsInCopy) {		for (auto U : UsedOpsInCopy) {
MachineOperand &MO = MI->getOperand(U);		MachineOperand &MO = MI.getOperand(U);
unsigned SrcReg = MO.getReg();		unsigned SrcReg = MO.getReg();
if (!UsedRegUnits.available(SrcReg)) {		if (!UsedRegUnits.available(SrcReg)) {
MachineBasicBlock::iterator NI = std::next(MI->getIterator());		MachineBasicBlock::iterator NI = std::next(MI.getIterator());
for (MachineInstr &UI : make_range(NI, CurBB.end())) {		for (MachineInstr &UI : make_range(NI, CurBB.end())) {
if (UI.killsRegister(SrcReg, TRI)) {		if (UI.killsRegister(SrcReg, TRI)) {
UI.clearRegisterKills(SrcReg, TRI);		UI.clearRegisterKills(SrcReg, TRI);
MO.setIsKill(true);		MO.setIsKill(true);
break;		break;
}		}
}		}
}		}
}		}
}		}

static void updateLiveIn(MachineInstr MI, MachineBasicBlock SuccBB,		static void updateLiveIn(MachineInstr &MI, MachineBasicBlock *SuccBB,
SmallVectorImpl<unsigned> &UsedOpsInCopy,		SmallVectorImpl<unsigned> &UsedOpsInCopy,
SmallVectorImpl<unsigned> &DefedRegsInCopy) {		SmallVectorImpl<unsigned> &DefedRegsInCopy) {
for (auto DefReg : DefedRegsInCopy)		for (auto DefReg : DefedRegsInCopy)
SuccBB->removeLiveIn(DefReg);		SuccBB->removeLiveIn(DefReg);
for (auto U : UsedOpsInCopy) {		for (auto U : UsedOpsInCopy) {
unsigned Reg = MI->getOperand(U).getReg();		unsigned Reg = MI.getOperand(U).getReg();
if (!SuccBB->isLiveIn(Reg))		if (!SuccBB->isLiveIn(Reg))
SuccBB->addLiveIn(Reg);		SuccBB->addLiveIn(Reg);
}		}
}		}

static bool hasRegisterDependency(MachineInstr *MI,		static bool hasRegisterDependency(MachineInstr &MI,
SmallVectorImpl<unsigned> &UsedOpsInCopy,		SmallVectorImpl<unsigned> &UsedOpsInCopy,
SmallVectorImpl<unsigned> &DefedRegsInCopy,		SmallVectorImpl<unsigned> &DefedRegsInCopy,
LiveRegUnits &ModifiedRegUnits,		LiveRegUnits &ModifiedRegUnits,
LiveRegUnits &UsedRegUnits) {		LiveRegUnits &UsedRegUnits) {
bool HasRegDependency = false;		bool HasRegDependency = false;
for (unsigned i = 0, e = MI->getNumOperands(); i != e; ++i) {		for (unsigned i = 0, e = MI.getNumOperands(); i != e; ++i) {
MachineOperand &MO = MI->getOperand(i);		MachineOperand &MO = MI.getOperand(i);
if (!MO.isReg())		if (!MO.isReg())
continue;		continue;
unsigned Reg = MO.getReg();		unsigned Reg = MO.getReg();
if (!Reg)		if (!Reg)
continue;		continue;
if (MO.isDef()) {		if (MO.isDef()) {
if (!ModifiedRegUnits.available(Reg) \|\| !UsedRegUnits.available(Reg)) {		if (!ModifiedRegUnits.available(Reg) \|\| !UsedRegUnits.available(Reg)) {
HasRegDependency = true;		HasRegDependency = true;
Show All 11 Lines	if (MO.isDef()) {
break;		break;
}		}
UsedOpsInCopy.push_back(i);		UsedOpsInCopy.push_back(i);
}		}
}		}
return HasRegDependency;		return HasRegDependency;
}		}

bool PostRAMachineSinking::tryToSinkCopy(MachineBasicBlock &CurBB,		bool PostRAMachineSinking::tryToSinkSpill(MachineInstr &MI,
MachineFunction &MF,		MachineBasicBlock &CurBB,
const TargetRegisterInfo *TRI,		SmallPtrSetImpl<MachineBasicBlock *> &SinkableBBs) {
const TargetInstrInfo *TII) {		int FI;
SmallPtrSet<MachineBasicBlock *, 2> SinkableBBs;		unsigned StrValReg = TII->isStoreToStackSlot(MI, FI);
// FIXME: For now, we sink only to a successor which has a single predecessor		if (!StrValReg \|\| !MFI->isSpillSlotObjectIndex(FI))
// so that we can directly sink COPY instructions to the successor without		return false;
// adding any new block or branch instruction.
for (MachineBasicBlock *SI : CurBB.successors())
if (!SI->livein_empty() && SI->pred_size() == 1)
SinkableBBs.insert(SI);

if (SinkableBBs.empty())		// Track the operand index for use in Copy.
		SmallVector<unsigned, 2> UsedOpsInCopy;
		// Track the register number defed in Copy.
		SmallVector<unsigned, 2> DefedRegsInCopy;
		// Don't sink the spill if it would violate a register dependency.
		if (hasRegisterDependency(MI, UsedOpsInCopy, DefedRegsInCopy,
		ModifiedRegUnits, UsedRegUnits))
return false;		return false;

bool Changed = false;		MachineBasicBlock *SuccBB =
		getSingleReloadInSuccBB(CurBB, StackUseMap, SinkableBBs, FI, MDT);

// Track which registers have been modified and used between the end of the		if (!SuccBB)
// block and the current instruction.		return false;
ModifiedRegUnits.clear();
UsedRegUnits.clear();

for (auto I = CurBB.rbegin(), E = CurBB.rend(); I != E;) {		// Clear the kill flag if SrcReg is killed between MI and the end of the
MachineInstr MI = &I;		// block.
++I;		clearKillFlags(MI, CurBB, UsedOpsInCopy, UsedRegUnits, TRI);

// Do not move any instruction across function call.		// FIXME: We should collect debug values and sink them together.
if (MI->isCall())		MachineBasicBlock::iterator InsertPos = SuccBB->getFirstNonPHI();
return false;		SuccBB->splice(InsertPos, &CurBB, &MI);
		updateLiveIn(MI, SuccBB, UsedOpsInCopy, DefedRegsInCopy);

if (!MI->isCopy() \|\| !MI->getOperand(0).isRenamable()) {		++NumPostRASpillSink;
LiveRegUnits::accumulateUsedDefed(*MI, ModifiedRegUnits, UsedRegUnits,		return true;
TRI);
continue;
}		}

		bool PostRAMachineSinking::tryToSinkCopy(
		MachineInstr &MI, MachineBasicBlock &CurBB,
		SmallPtrSetImpl<MachineBasicBlock *> &SinkableBBs) {
		if (!MI.getOperand(0).isRenamable())
		return false;

// Track the operand index for use in Copy.		// Track the operand index for use in Copy.
SmallVector<unsigned, 2> UsedOpsInCopy;		SmallVector<unsigned, 2> UsedOpsInCopy;
// Track the register number defed in Copy.		// Track the register number defed in Copy.
SmallVector<unsigned, 2> DefedRegsInCopy;		SmallVector<unsigned, 2> DefedRegsInCopy;

// Don't sink the COPY if it would violate a register dependency.		// Don't sink the COPY if it would violate a register dependency.
if (hasRegisterDependency(MI, UsedOpsInCopy, DefedRegsInCopy,		if (hasRegisterDependency(MI, UsedOpsInCopy, DefedRegsInCopy,
ModifiedRegUnits, UsedRegUnits)) {		ModifiedRegUnits, UsedRegUnits))
LiveRegUnits::accumulateUsedDefed(*MI, ModifiedRegUnits, UsedRegUnits,		return false;
TRI);
continue;
}
assert((!UsedOpsInCopy.empty() && !DefedRegsInCopy.empty()) &&		assert((!UsedOpsInCopy.empty() && !DefedRegsInCopy.empty()) &&
"Unexpect SrcReg or DefReg");		"Unexpect SrcReg or DefReg");
MachineBasicBlock *SuccBB =		MachineBasicBlock *SuccBB =
getSingleLiveInSuccBB(CurBB, SinkableBBs, DefedRegsInCopy, TRI);		getSingleLiveInSuccBB(CurBB, SinkableBBs, DefedRegsInCopy, TRI);

// Don't sink if we cannot find a single sinkable successor in which Reg		// Don't sink if we cannot find a single sinkable successor in which Reg
// is live-in.		// is live-in.
if (!SuccBB) {		if (!SuccBB)
LiveRegUnits::accumulateUsedDefed(*MI, ModifiedRegUnits, UsedRegUnits,		return false;
TRI);
continue;
}
assert((SuccBB->pred_size() == 1 && *SuccBB->pred_begin() == &CurBB) &&		assert((SuccBB->pred_size() == 1 && *SuccBB->pred_begin() == &CurBB) &&
"Unexpected predecessor");

		"Unexpected predecessor");
// Clear the kill flag if SrcReg is killed between MI and the end of the		// Clear the kill flag if SrcReg is killed between MI and the end of the
// block.		// block.
clearKillFlags(MI, CurBB, UsedOpsInCopy, UsedRegUnits, TRI);		clearKillFlags(MI, CurBB, UsedOpsInCopy, UsedRegUnits, TRI);

MachineBasicBlock::iterator InsertPos = SuccBB->getFirstNonPHI();		MachineBasicBlock::iterator InsertPos = SuccBB->getFirstNonPHI();
		thegamegUnsubmitted Not Done Reply Inline Actions From what I see, `hasLoadFromStackSlot` checks for `MachineMemoryOperand`s attached to the MI. It looks for things like `:: (load 8 from %stack.0)`, and if the load source is a fixed stack it grabs the FI. I wonder what happens if an instruction: MI.mayLoad == true any_of(MI.operands(), isFI) == true but MI.machinememoperands().empty() == true I've been confused about `MachineMemoryOperand`s for a while now, so I'm not sure if an instruction with no MMOs should act like a barrier or something else. Are there any other passes that rely on this? What do you think? I think if we can rely on MMOs it would be great, and we should document it somewhere. This shouldn't be blocking for this patch, but if you can document these assumptions it would be great. thegameg: From what I see, `hasLoadFromStackSlot` checks for `MachineMemoryOperand`s attached to the MI.
		thegamegUnsubmitted Not Done Reply Inline Actions Hmm actually, why not use `isLoadFromStackSlot` here as you do for stores? thegameg: Hmm actually, why not use `isLoadFromStackSlot` here as you do for stores?
		thegamegUnsubmitted Not Done Reply Inline Actions Slightly related, I've been wanting to add this to the MachineVerifier. It would be nice if mayLoad / mayStore instructions don't pass the verifier on some conditions like: has no MMO has MMO == FixedStack but no (fixed) FI operands has MMO != FixedStack but has FI operands probably more we can check here thegameg: Slightly related, I've been wanting to add this to the MachineVerifier. It would be nice if…
		junbumlAuthorUnsubmitted Not Done Reply Inline Actions From what I see, hasLoadFromStackSlot checks for MachineMemoryOperands attached to the MI. It looks for things >like :: (load 8 from %stack.0), and if the load source is a fixed stack it grabs the FI. I wonder what happens if an instruction: MI.mayLoad == true any_of(MI.operands(), isFI) == true but MI.machinememoperands().empty() == true I've been confused about MachineMemoryOperands for a while now, so I'm not sure if an instruction with no MMOs >should act like a barrier or something else. Are there any other passes that rely on this? What do you think? I think if we can rely on MMOs it would be great, and we should document it somewhere. This shouldn't be blocking for this patch, but if you can document these assumptions it would be great. Thanks for bring this up. I can see similary code in InstructionStoresToFI() @MachineLICM. If MMO is empty, I conservatively assume an instruction load from some FI. Slightly related, I've been wanting to add this to the MachineVerifier. It would be nice if mayLoad / mayStore instructions don't pass the verifier on some conditions like: has no MMO has MMO == FixedStack but no (fixed) FI operands has MMO != FixedStack but has FI operands probably more we can check here I agree with this check and I will be happy to do it. I will update you for this. junbuml: >From what I see, hasLoadFromStackSlot checks for MachineMemoryOperands attached to the MI. It…
SuccBB->splice(InsertPos, &CurBB, MI);		SuccBB->splice(InsertPos, &CurBB, MI);
updateLiveIn(MI, SuccBB, UsedOpsInCopy, DefedRegsInCopy);		updateLiveIn(MI, SuccBB, UsedOpsInCopy, DefedRegsInCopy);

Changed = true;
++NumPostRACopySink;		++NumPostRACopySink;
		return true;
		}

		bool PostRAMachineSinking::tryToSink(MachineBasicBlock &CurBB,
		bool EnableSinkSpill) {
		SmallPtrSet<MachineBasicBlock *, 2> SinkableBBs;
		// FIXME: For now, we sink only to a successor which has a single predecessor
		// so that we can directly sink instructions to the successor without adding
		// any new block or branch instruction.
		for (MachineBasicBlock *SI : CurBB.successors())
		if (SI->pred_size() == 1)
		SinkableBBs.insert(SI);

		if (SinkableBBs.empty())
		return false;

		bool Changed = false;

		// Track which registers have been modified and used between the end of the
		// block and the current instruction.
		ModifiedRegUnits.clear();
		UsedRegUnits.clear();

		for (auto I = CurBB.rbegin(), E = CurBB.rend(); I != E;) {
		MachineInstr &MI = *I;
		++I;

		// FIXME: It would be safe to use regmask and implicit operands to across a
		thegamegUnsubmitted Not Done Reply Inline Actions Actually, why don't we sink across function calls? I would assume the regmasks and the implicit operands would be enough to keep it safe, but I might be wrong. thegameg: Actually, why don't we sink across function calls? I would assume the regmasks and the implicit…
		junbumlAuthorUnsubmitted Not Done Reply Inline Actions I agree that we can use regmask and implicit operands, but I'm not perfectly clear if it's safe in all targets and calling conventions. I add FIXME about it. Another issue here is that this also skip DBG_VALUE, causing different code being generated with -g. A fix is being discussed in https://reviews.llvm.org/D45878. junbuml: I agree that we can use regmask and implicit operands, but I'm not perfectly clear if it's safe…
		// function call. However, for now we do not allow moving any instruction
		// across function call as we are not perfectly clear if it's safe in all
		// targets and calling conventions.
		if (MI.isCall())
		return Changed;

		if (EnableSinkSpill && MI.mayStore()) {
		if (tryToSinkSpill(MI, CurBB, SinkableBBs)) {
		Changed = true;
		continue;
		}
		} else if (MI.isCopy()) {
		if (tryToSinkCopy(MI, CurBB, SinkableBBs)) {
		Changed = true;
		continue;
		}
		}
		LiveRegUnits::accumulateUsedDefed(MI, ModifiedRegUnits, UsedRegUnits,
		TRI);
}		}
return Changed;		return Changed;
}		}

bool PostRAMachineSinking::runOnMachineFunction(MachineFunction &MF) {		bool PostRAMachineSinking::runOnMachineFunction(MachineFunction &MF) {
bool Changed = false;		bool Changed = false;
const TargetRegisterInfo *TRI = MF.getSubtarget().getRegisterInfo();		TRI = MF.getSubtarget().getRegisterInfo();
const TargetInstrInfo *TII = MF.getSubtarget().getInstrInfo();		TII = MF.getSubtarget().getInstrInfo();
		MFI = &MF.getFrameInfo();
		MDT = &getAnalysis<MachineDominatorTree>();

ModifiedRegUnits.init(*TRI);		ModifiedRegUnits.init(*TRI);
UsedRegUnits.init(*TRI);		UsedRegUnits.init(*TRI);
for (auto &BB : MF)
Changed \|= tryToSinkCopy(BB, MF, TRI, TII);


		bool EnableSinkSpill = scanLoadFromStackSlot(MF);
		for (auto &BB : MF)
		Changed \|= tryToSink(BB, EnableSinkSpill);
return Changed;		return Changed;
}		}

test/CodeGen/AArch64/O3-pipeline.ll

	Show First 20 Lines • Show All 118 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Machine Optimization Remark Emitter			; CHECK-NEXT: Machine Optimization Remark Emitter
	; CHECK-NEXT: Greedy Register Allocator			; CHECK-NEXT: Greedy Register Allocator
	; CHECK-NEXT: Virtual Register Rewriter			; CHECK-NEXT: Virtual Register Rewriter
	; CHECK-NEXT: Stack Slot Coloring			; CHECK-NEXT: Stack Slot Coloring
	; CHECK-NEXT: Machine Copy Propagation Pass			; CHECK-NEXT: Machine Copy Propagation Pass
	; CHECK-NEXT: Machine Loop Invariant Code Motion			; CHECK-NEXT: Machine Loop Invariant Code Motion
	; CHECK-NEXT: AArch64 Redundant Copy Elimination			; CHECK-NEXT: AArch64 Redundant Copy Elimination
	; CHECK-NEXT: A57 FP Anti-dependency breaker			; CHECK-NEXT: A57 FP Anti-dependency breaker
	; CHECK-NEXT: PostRA Machine Sink
	; CHECK-NEXT: MachineDominator Tree Construction			; CHECK-NEXT: MachineDominator Tree Construction
				; CHECK-NEXT: PostRA Machine Sink
	; CHECK-NEXT: Machine Natural Loop Construction			; CHECK-NEXT: Machine Natural Loop Construction
	; CHECK-NEXT: Machine Block Frequency Analysis			; CHECK-NEXT: Machine Block Frequency Analysis
	; CHECK-NEXT: MachinePostDominator Tree Construction			; CHECK-NEXT: MachinePostDominator Tree Construction
	; CHECK-NEXT: Shrink Wrapping analysis			; CHECK-NEXT: Shrink Wrapping analysis
	; CHECK-NEXT: Lazy Machine Block Frequency Analysis			; CHECK-NEXT: Lazy Machine Block Frequency Analysis
	; CHECK-NEXT: Machine Optimization Remark Emitter			; CHECK-NEXT: Machine Optimization Remark Emitter
	; CHECK-NEXT: Prologue/Epilogue Insertion & Frame Finalization			; CHECK-NEXT: Prologue/Epilogue Insertion & Frame Finalization
	; CHECK-NEXT: Control Flow Optimizer			; CHECK-NEXT: Control Flow Optimizer
	Show All 31 Lines

test/CodeGen/AArch64/post-ra-machine-sink-spill.mir

This file was added.

				# RUN: llc -mtriple=aarch64-none-linux-gnu -run-pass=postra-machine-sink -verify-machineinstrs -o - %s \| FileCheck %s

				---
				# Sink the store to stack.0 to %bb.1.
				# CHECK-LABEL: name: sinkspill1
				# CHECK-LABEL: bb.0:
				# CHECK-NOT: STRXui $x0, %stack.0
				# CHECK-LABEL: bb.1:
				# CHECK: liveins: $x0
				# CHECK: STRXui $x0, %stack.0
				name: sinkspill1
				tracksRegLiveness: true
				stack:
				- { id: 0, name: '', type: spill-slot, offset: 0, size: 8, alignment: 8,
				stack-id: 0, callee-saved-register: '', callee-saved-restored: true }
				body: \|
				bb.0:
				liveins: $x0 , $w1
				$w1 = SUBSWri $w1, 1, 0, implicit-def $nzcv
				STRXui $x0, %stack.0, 0 :: (store 8 into %stack.0)
				Bcc 11, %bb.1, implicit $nzcv
				B %bb.2

				bb.1:
				$x0 = LDRXui %stack.0, 0 :: (load 8 from %stack.0)
				RET $x0

				bb.2:
				$x0 = COPY $xzr
				RET $x0
				...

				---
				# Sink the store to stack.0 to %bb.2.
				# CHECK-LABEL: name: sinkspill2
				# CHECK-LABEL: bb.0:
				# CHECK-NOT: STRXui $x0, %stack.0
				# CHECK-LABEL: bb.2:
				# CHECK: liveins:{{.*}} $x0
				# CHECK: STRXui $x0, %stack.0
				name: sinkspill2
				tracksRegLiveness: true
				stack:
				- { id: 0, name: '', type: spill-slot, offset: 0, size: 8, alignment: 8,
				stack-id: 0, callee-saved-register: '', callee-saved-restored: true }
				body: \|
				bb.0:
				liveins: $x0, $w1
				$w1 = SUBSWri $w1, 1, 0, implicit-def $nzcv
				STRXui $x0, %stack.0, 0 :: (store 8 into %stack.0)
				Bcc 11, %bb.2, implicit $nzcv

				bb.1:
				liveins: $x0
				RET $x0

				bb.2:
				liveins: $x1
				$x0 = ADDXrr $x1, killed $x1

				bb.3:
				$x0 = LDRXui %stack.0, 0 :: (load 8 from %stack.0)
				RET $x0
				...

				---
				# Sink the store to stack.0 to %bb.3.
				# CHECK-LABEL: name: sinkspill3
				# CHECK-LABEL: bb.0:
				# CHECK-NOT: STRXui $x0, %stack.0
				# CHECK-LABEL: bb.2:
				# CHECK: liveins:{{.*}} $x0
				# CHECK-LABEL: bb.3:
				# CHECK: liveins:{{.*}} $x0
				# CHECK: STRXui $x0, %stack.0
				name: sinkspill3
				tracksRegLiveness: true
				stack:
				- { id: 0, name: '', type: spill-slot, offset: 0, size: 8, alignment: 8,
				stack-id: 0, callee-saved-register: '', callee-saved-restored: true }
				body: \|
				bb.0:
				liveins: $x0, $w1
				$w1 = SUBSWri $w1, 1, 0, implicit-def $nzcv
				STRXui $x0, %stack.0, 0 :: (store 8 into %stack.0)
				Bcc 11, %bb.2, implicit $nzcv

				bb.1:
				liveins: $x0
				RET $x0

				bb.2:
				liveins: $x1
				$x1 = ADDXrr $x1, killed $x1

				bb.3:
				$x0 = LDRXui %stack.0, 0 :: (load 8 from %stack.0)
				RET $x0
				...

				---
				# Keep the store to stack.0 in bb.0.
				# CHECK-LABEL: name: donotsinkspill1
				# CHECK-LABEL: bb.0:
				# CHECK: STRXui $x0, %stack.0
				name: donotsinkspill1
				tracksRegLiveness: true
				stack:
				- { id: 0, name: '', type: spill-slot, offset: 0, size: 8, alignment: 8,
				stack-id: 0, callee-saved-register: '', callee-saved-restored: true }
				body: \|
				bb.0:
				liveins: $x0 , $w1
				$w1 = SUBSWri $w1, 1, 0, implicit-def $nzcv
				STRXui $x0, %stack.0, 0 :: (store 8 into %stack.0)
				Bcc 11, %bb.1, implicit $nzcv
				B %bb.2

				bb.1:
				$x0 = LDRXui %stack.0, 0 :: (load 8 from %stack.0)
				RET $x0

				bb.2:
				$x0 = LDRXui %stack.0, 0 :: (load 8 from %stack.0)
				RET $x0
				...

				---
				# Keep the store to stack.0 in bb.0 due to register dependency on x0.
				# CHECK-LABEL: name: donotsinkspill2
				# CHECK-LABEL: bb.0:
				# CHECK: STRXui $x0, %stack.0
				name: donotsinkspill2
				tracksRegLiveness: true
				stack:
				- { id: 0, name: '', type: spill-slot, offset: 0, size: 8, alignment: 8,
				stack-id: 0, callee-saved-register: '', callee-saved-restored: true }
				body: \|
				bb.0:
				liveins: $x0 , $w1
				$w1 = SUBSWri $w1, 1, 0, implicit-def $nzcv
				STRXui $x0, %stack.0, 0 :: (store 8 into %stack.0)
				$x0 = ADDXrr $x1, killed $x1
				Bcc 11, %bb.1, implicit $nzcv
				B %bb.2

				bb.1:
				$x0 = LDRXui %stack.0, 0 :: (load 8 from %stack.0)
				RET $x0

				bb.2:
				$x0 = COPY $xzr
				RET $x0
				...

				---
				# Keep the store to stack.0 in bb.0 due to LDRXui in bb.0.
				# CHECK-LABEL: name: donotsinkspill3
				# CHECK-LABEL: bb.0:
				# CHECK: STRXui $x0, %stack.0
				name: donotsinkspill3
				tracksRegLiveness: true
				stack:
				- { id: 0, name: '', type: spill-slot, offset: 0, size: 8, alignment: 8,
				stack-id: 0, callee-saved-register: '', callee-saved-restored: true }
				body: \|
				bb.0:
				liveins: $x0 , $w1
				$w1 = SUBSWri $w1, 1, 0, implicit-def $nzcv
				STRXui $x0, %stack.0, 0 :: (store 8 into %stack.0)
				$x0 = LDRXui %stack.0, 0 :: (load 8 from %stack.0)
				Bcc 11, %bb.1, implicit $nzcv
				B %bb.2

				bb.1:
				$x0 = LDRXui %stack.0, 0 :: (load 8 from %stack.0)
				RET $x0

				bb.2:
				$x0 = COPY $xzr
				RET $x0
				...

test/CodeGen/PowerPC/aggressive-anti-dep-breaker-subreg.ll

	; RUN: llc -verify-machineinstrs %s -mtriple=powerpc64-unknown-linux-gnu -O2 -o - -optimize-regalloc=false -regalloc=fast \| FileCheck %s			; RUN: llc -verify-machineinstrs %s -mtriple=powerpc64-unknown-linux-gnu -O2 -o - -optimize-regalloc=false -regalloc=fast \| FileCheck %s

	declare void @func(i8*, i64, i64)			declare void @func(i8*, i64, i64)

	define void @test(i8* %context, i32** %elementArrayPtr, i32 %value) {			define void @test(i8* %context, i32** %elementArrayPtr, i32 %value) {
	entry:			entry:
	%cmp = icmp eq i32 %value, 0			%cmp = icmp eq i32 %value, 0
	br i1 %cmp, label %lreturn, label %lnext			br i1 %cmp, label %lreturn, label %lnext

	lnext:			lnext:
	%elementArray = load i32, i32* %elementArrayPtr, align 8			%elementArray = load i32, i32* %elementArrayPtr, align 8
	; CHECK: lwz [[LDREG:[0-9]+]], 124(1) # 4-byte Folded Reload
	; CHECK: # implicit-def: $x[[TEMPREG:[0-9]+]]			; CHECK: # implicit-def: $x[[TEMPREG:[0-9]+]]
				; CHECK: lwz [[LDREG:[0-9]+]], 124(1) # 4-byte Folded Reload
	%element = load i32, i32* %elementArray, align 4			%element = load i32, i32* %elementArray, align 4
	; CHECK: mr [[TEMPREG]], [[LDREG]]			; CHECK: mr [[TEMPREG]], [[LDREG]]
	; CHECK: clrldi 4, [[TEMPREG]], 32			; CHECK: clrldi 4, [[TEMPREG]], 32
	%element.ext = zext i32 %element to i64			%element.ext = zext i32 %element to i64
	%value.ext = zext i32 %value to i64			%value.ext = zext i32 %value to i64
	call void @func(i8* %context, i64 %value.ext, i64 %element.ext)			call void @func(i8* %context, i64 %value.ext, i64 %element.ext)
	br label %lreturn			br label %lreturn

	lreturn:			lreturn:
	ret void			ret void
	}			}

test/CodeGen/PowerPC/cc.ll

	Show All 20 Lines
	; CHECK: mfcr [[REG1:[0-9]+]]			; CHECK: mfcr [[REG1:[0-9]+]]
	; CHECK-DAG: cmpd			; CHECK-DAG: cmpd
	; CHECK-DAG: mfocrf [[REG2:[0-9]+]],			; CHECK-DAG: mfocrf [[REG2:[0-9]+]],
	; CHECK-DAG: stw [[REG1]], 8(1)			; CHECK-DAG: stw [[REG1]], 8(1)
	; CHECK-DAG: stw [[REG2]], -4(1)			; CHECK-DAG: stw [[REG2]], -4(1)

	; CHECK: sc			; CHECK: sc
	; CHECK: lwz [[REG3:[0-9]+]], -4(1)			; CHECK: lwz [[REG3:[0-9]+]], -4(1)
				; CHECK: lwz [[REG4:[0-9]+]], 8(1)
	; CHECK: mtocrf 128, [[REG3]]			; CHECK: mtocrf 128, [[REG3]]

	; CHECK: lwz [[REG4:[0-9]+]], 8(1)
	; CHECK-DAG: mtocrf 32, [[REG4]]			; CHECK-DAG: mtocrf 32, [[REG4]]
	; CHECK-DAG: mtocrf 16, [[REG4]]			; CHECK-DAG: mtocrf 16, [[REG4]]
	; CHECK-DAG: mtocrf 8, [[REG4]]			; CHECK-DAG: mtocrf 8, [[REG4]]
	; CHECK: blr			; CHECK: blr
	}			}

	define i64 @test2(i64 %a, i64 %b) {			define i64 @test2(i64 %a, i64 %b) {
	entry:			entry:
	Show All 14 Lines
	; CHECK: mfcr [[REG1:[0-9]+]]			; CHECK: mfcr [[REG1:[0-9]+]]
	; CHECK-DAG: cmpd			; CHECK-DAG: cmpd
	; CHECK-DAG: mfocrf [[REG2:[0-9]+]],			; CHECK-DAG: mfocrf [[REG2:[0-9]+]],
	; CHECK-DAG: stw [[REG1]], 8(1)			; CHECK-DAG: stw [[REG1]], 8(1)
	; CHECK-DAG: stw [[REG2]], -4(1)			; CHECK-DAG: stw [[REG2]], -4(1)

	; CHECK: sc			; CHECK: sc
	; CHECK: lwz [[REG3:[0-9]+]], -4(1)			; CHECK: lwz [[REG3:[0-9]+]], -4(1)
				; CHECK: lwz [[REG4:[0-9]+]], 8(1)
	; CHECK: mtocrf 128, [[REG3]]			; CHECK: mtocrf 128, [[REG3]]

	; CHECK: lwz [[REG4:[0-9]+]], 8(1)
	; CHECK-DAG: mtocrf 32, [[REG4]]			; CHECK-DAG: mtocrf 32, [[REG4]]
	; CHECK-DAG: mtocrf 16, [[REG4]]			; CHECK-DAG: mtocrf 16, [[REG4]]
	; CHECK-DAG: mtocrf 8, [[REG4]]			; CHECK-DAG: mtocrf 8, [[REG4]]
	; CHECK: blr			; CHECK: blr
	}			}

test/CodeGen/X86/catchret-regmask.ll

	Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines

	return:			return:
	ret i8* %val			ret i8* %val
	}			}

	; CHECK-LABEL: spill_in_pad: # @spill_in_pad			; CHECK-LABEL: spill_in_pad: # @spill_in_pad
	; CHECK: callq throw			; CHECK: callq throw
	; CHECK: ud2			; CHECK: ud2
	; CHECK: movq -[[val_slot:[0-9]+]](%rbp), %rax # 8-byte Reload			; CHECK: movq %rax, -[[val_slot:[0-9]+]](%rbp) # 8-byte Spill
				; CHECK: movq -[[val_slot]](%rbp), %rax # 8-byte Reload
	; CHECK: retq			; CHECK: retq

	; CHECK: "?catch$3@?0?spill_in_pad@4HA":			; CHECK: "?catch$3@?0?spill_in_pad@4HA":
	; CHECK: callq getval			; CHECK: callq getval
	; CHECK: movq %rax, -[[val_slot]](%rbp) # 8-byte Spill
	; CHECK: retq			; CHECK: retq

	attributes #0 = { uwtable }			attributes #0 = { uwtable }