This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/
-
llvm/
-
CodeGen/
-
Passes.h
-
InitializePasses.h
-
lib/
-
CodeGen/
-
CodeGen.cpp
6
PrologEpilogInserter.cpp
-
Target/WebAssembly/
-
WebAssembly/
-
WebAssemblyTargetMachine.cpp

Differential D16481

Split PrologEpilogInserter into 2 parts
AbandonedPublic

Authored by dschuff on Jan 22 2016, 12:10 PM.

Download Raw Diff

Details

Reviewers

qcolombet

Summary

The base part finalizes frame layout, inserts prolog/epilog code, and
eliminates FrameIndex operands. It supports virtual registers but does not
support callee-saved registers or scavenging
The CSR-supporting part handles spilling of callee-saved registers and
scavenging, but does not support virtual registers.

Diff Detail

Event Timeline

dschuff updated this revision to Diff 45723.Jan 22 2016, 12:10 PM

dschuff retitled this revision from to Split PrologEpilogInserter into 2 parts.

dschuff updated this object.

dschuff added a reviewer: qcolombet.

dschuff added a subscriber: llvm-commits.

Herald added subscribers: dschuff, jfb. · View Herald TranscriptJan 22 2016, 12:10 PM

hfinkel added a subscriber: hfinkel.Feb 3 2016, 6:41 AM

hfinkel added inline comments.

lib/CodeGen/PrologEpilogInserter.cpp
264	Did we decide that we need to split the passes this way? It seems like the boilerplate from having two passes seems >= the difference between this runOnMachineFunction implementation and the simpler version.
356	Does having > INT_MAX FIs really work with just these changes?

dschuff added inline comments.Feb 4 2016, 1:43 PM

lib/CodeGen/PrologEpilogInserter.cpp
264	The motivation for this change was the really strong desire expressed to have clear separation at the pass level between passes that support virtual registers and passes that do not. (see also http://reviews.llvm.org/D16483). If we implement that, then we have to have a PEI pass that supports virtual registers. Of course we could just declare that this PEI pass supports virtual registers. From my own experience and auditing of the code, it seems to work fine for WebAssembly. The subsets that WebAssembly uses (i.e. the parts that I've broken out here into the base PEI) are the only parts that actually run when the target returns an empty set of CSRs; I've basically just made that distinction explicit in this change. I'd be interested in getting an opinion from @qcolombet based on this more concrete change.
356	No, this change was just to make the types (signed vs unsigned) consistent (see also line 664 below), and, if there's going to be a sentinel value, it might as well be the max for the actual type used (unsigned) instead of a different type.

dschuff mentioned this in D16483: Require MachineFunctionPasses to declare their support for virtual registers..Feb 4 2016, 1:45 PM

Hi Derek,

I am behind on reviews and I am still fighting to catching up.
I have one high level comment though, see the inlined comment.

Cheers,
-Quentin

lib/CodeGen/PrologEpilogInserter.cpp
77	It feels wrong to me that the high level class needs to have this field.

dschuff added inline comments.Mar 18 2016, 1:26 PM

lib/CodeGen/PrologEpilogInserter.cpp
77	I kind of agree. There are a couple of reasons: `TargetFrameLowering::processFunctionBeforeFrameFinalized()` is called by the base `PEI::finalizeFrame()` and it takes an RS pointer because this is the point where targets (PPC) create a stack slot for the RS to spill into. We could maybe work around that by adding a virtual method on the base class that calls `processFunctionBeforeFrameFinalized()` with a nullptr and with the real RS in the derived class. The base `PEI::calculateFrameObjectOffsets()` takes the scavenging spill slots into account when it lays out the stack frame, and uses the RS to find out where they are. (Likewise it also handles the CSR spill slots, which in principle the base class doesn't need to do). This seems trickier to work around. We could move the code that actually uses RS out to some other function but the logic of where it goes in the frame would still be local. The base `PEI::replaceFrameIndices()` keeps the RS up to date as it goes and passes RS to `TargetRegisterInfo::eliminateFrameIndex()` which is of course where it might actually be wanted by a target. The root sources of this are that the stack frame layout (which is implemented in this pass, and is always needed) has to be aware of both CSRs and scavenging; and the target hooks (which are also always used) might need RS in any case. We could try to move the RS/CSR frame layout code and the RS updating code out into some kind of other abstraction, but the RS class is itself essentially the abstraction that you'd want.

Another option might be to use a similar factoring (which splits the code that must deal with virtual registers out from the code that need not), but not actually create 2 different MachineFunctionPasses. We could say that the pass itself supports virtual registers, but we'd have better organization and clearer separation.

Hi Derek,

Another option might be to use a similar factoring (which splits the code that must deal with virtual registers out from the code that need not), but not actually create 2 different MachineFunctionPasses. We could say that the pass itself supports virtual registers, but we'd have better organization and clearer separation.

This may be the cleanest solution.

Could you prepare a patch for that direction?

Thanks,
-Quentin

http://reviews.llvm.org/D18366 has a change to tweak the organization and naming in PEI, with better-targeted asserts. We could also go a bit further and split the calls that are currently in runOnMachineFunction into different functions the way we have e.g. finalizeframe() here.

Abandoned in favor of http://reviews.llvm.org/D18366

Revision Contents

Path

Size

include/

llvm/

CodeGen/

Passes.h

6 lines

InitializePasses.h

1 line

lib/

CodeGen/

CodeGen.cpp

1 line

PrologEpilogInserter.cpp

185 lines

Target/

WebAssembly/

WebAssemblyTargetMachine.cpp

14 lines

Diff 45723

include/llvm/CodeGen/Passes.h

Show First 20 Lines • Show All 475 Lines • ▼ Show 20 Lines	/// MachineDominanaceFrontier - This pass is a machine dominators analysis pass.
/// Quadratic Prograaming (PBQP) based register allocator.		/// Quadratic Prograaming (PBQP) based register allocator.
///		///
FunctionPass *createDefaultPBQPRegisterAllocator();		FunctionPass *createDefaultPBQPRegisterAllocator();

/// PrologEpilogCodeInserter - This pass inserts prolog and epilog code,		/// PrologEpilogCodeInserter - This pass inserts prolog and epilog code,
/// and eliminates abstract frame references.		/// and eliminates abstract frame references.
extern char &PrologEpilogCodeInserterID;		extern char &PrologEpilogCodeInserterID;

		/// Prolog/Epilog inserter without CSRs - A version of PEI that does not use
		/// or support callee-saved registers or scavenging, but does support use of
		/// virtual registers instead of only physical registers. Targets can
		/// substitute it for the default PrologEpilogCodeInserterID.
		extern char &VRegPrologEpilogCodeInserterID;

/// ExpandPostRAPseudos - This pass expands pseudo instructions after		/// ExpandPostRAPseudos - This pass expands pseudo instructions after
/// register allocation.		/// register allocation.
extern char &ExpandPostRAPseudosID;		extern char &ExpandPostRAPseudosID;

/// createPostRAScheduler - This pass performs post register allocation		/// createPostRAScheduler - This pass performs post register allocation
/// scheduling.		/// scheduling.
extern char &PostRASchedulerID;		extern char &PostRASchedulerID;

▲ Show 20 Lines • Show All 194 Lines • Show Last 20 Lines

include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 213 Lines • ▼ Show 20 Lines
	void initializeObjCARCAPElimPass(PassRegistry&);			void initializeObjCARCAPElimPass(PassRegistry&);
	void initializeObjCARCExpandPass(PassRegistry&);			void initializeObjCARCExpandPass(PassRegistry&);
	void initializeObjCARCContractPass(PassRegistry&);			void initializeObjCARCContractPass(PassRegistry&);
	void initializeObjCARCOptPass(PassRegistry&);			void initializeObjCARCOptPass(PassRegistry&);
	void initializePAEvalPass(PassRegistry &);			void initializePAEvalPass(PassRegistry &);
	void initializeOptimizePHIsPass(PassRegistry&);			void initializeOptimizePHIsPass(PassRegistry&);
	void initializePartiallyInlineLibCallsPass(PassRegistry&);			void initializePartiallyInlineLibCallsPass(PassRegistry&);
	void initializePEIPass(PassRegistry&);			void initializePEIPass(PassRegistry&);
				void initializePEIWithCSRsPass(PassRegistry&);
	void initializePHIEliminationPass(PassRegistry&);			void initializePHIEliminationPass(PassRegistry&);
	void initializePartialInlinerPass(PassRegistry&);			void initializePartialInlinerPass(PassRegistry&);
	void initializePeepholeOptimizerPass(PassRegistry&);			void initializePeepholeOptimizerPass(PassRegistry&);
	void initializePostDomOnlyPrinterPass(PassRegistry&);			void initializePostDomOnlyPrinterPass(PassRegistry&);
	void initializePostDomOnlyViewerPass(PassRegistry&);			void initializePostDomOnlyViewerPass(PassRegistry&);
	void initializePostDomPrinterPass(PassRegistry&);			void initializePostDomPrinterPass(PassRegistry&);
	void initializePostDomViewerPass(PassRegistry&);			void initializePostDomViewerPass(PassRegistry&);
	void initializePostDominatorTreePass(PassRegistry&);			void initializePostDominatorTreePass(PassRegistry&);
	▲ Show 20 Lines • Show All 87 Lines • Show Last 20 Lines

lib/CodeGen/CodeGen.cpp

Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	void llvm::initializeCodeGen(PassRegistry &Registry) {
initializeMachineLoopInfoPass(Registry);		initializeMachineLoopInfoPass(Registry);
initializeMachineModuleInfoPass(Registry);		initializeMachineModuleInfoPass(Registry);
initializeMachinePostDominatorTreePass(Registry);		initializeMachinePostDominatorTreePass(Registry);
initializeMachineSchedulerPass(Registry);		initializeMachineSchedulerPass(Registry);
initializeMachineSinkingPass(Registry);		initializeMachineSinkingPass(Registry);
initializeMachineVerifierPassPass(Registry);		initializeMachineVerifierPassPass(Registry);
initializeOptimizePHIsPass(Registry);		initializeOptimizePHIsPass(Registry);
initializePEIPass(Registry);		initializePEIPass(Registry);
		initializePEIWithCSRsPass(Registry);
initializePHIEliminationPass(Registry);		initializePHIEliminationPass(Registry);
initializePeepholeOptimizerPass(Registry);		initializePeepholeOptimizerPass(Registry);
initializePostMachineSchedulerPass(Registry);		initializePostMachineSchedulerPass(Registry);
initializePostRASchedulerPass(Registry);		initializePostRASchedulerPass(Registry);
initializeProcessImplicitDefsPass(Registry);		initializeProcessImplicitDefsPass(Registry);
initializeRegisterCoalescerPass(Registry);		initializeRegisterCoalescerPass(Registry);
initializeShrinkWrapPass(Registry);		initializeShrinkWrapPass(Registry);
initializeSlotIndexesPass(Registry);		initializeSlotIndexesPass(Registry);
Show All 19 Lines

lib/CodeGen/PrologEpilogInserter.cpp

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines
#include "llvm/Target/TargetSubtargetInfo.h"		#include "llvm/Target/TargetSubtargetInfo.h"
#include <climits>		#include <climits>

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "pei"		#define DEBUG_TYPE "pei"

namespace {		namespace {
		// The PrologEpilogInserter base class performs frame finalization,
		// prolog/epilog code insertion, and FrameIndex elimination. It does not handle
		// callee-saved registers or scavenging, but it does support virtual registers
		// (i.e. it is suitable for virtual targets which do not use a register
		// allocator)
class PEI : public MachineFunctionPass {		class PEI : public MachineFunctionPass {
public:		public:
static char ID;		static char ID;
PEI() : MachineFunctionPass(ID) {		PEI() : PEI(ID) { initializePEIPass(*PassRegistry::getPassRegistry()); }
initializePEIPass(*PassRegistry::getPassRegistry());
}

void getAnalysisUsage(AnalysisUsage &AU) const override;		void getAnalysisUsage(AnalysisUsage &AU) const override;

/// runOnMachineFunction - Insert prolog/epilog code and replace abstract		/// runOnMachineFunction - Insert prolog/epilog code and replace abstract
/// frame indexes with appropriate references.		/// frame indexes with appropriate references.
///		///
bool runOnMachineFunction(MachineFunction &Fn) override;		bool runOnMachineFunction(MachineFunction &Fn) override;

private:		protected:
RegScavenger *RS;		PEI(char &SubclassID) : MachineFunctionPass(SubclassID) {}

		// Register scavenging is only supported in PEIWithCSRs and not in the base
		// class. However, an RS pointer (possibly null) must be passed to several
		// of the target hooks used in the base class and the logic for FI
		// replacement sometimes depends on it.
		RegScavenger *RS = nullptr;
		qcolombetUnsubmitted Not Done Reply Inline Actions It feels wrong to me that the high level class needs to have this field. qcolombet: It feels wrong to me that the high level class needs to have this field.
		dschuffAuthorUnsubmitted Not Done Reply Inline Actions I kind of agree. There are a couple of reasons: `TargetFrameLowering::processFunctionBeforeFrameFinalized()` is called by the base `PEI::finalizeFrame()` and it takes an RS pointer because this is the point where targets (PPC) create a stack slot for the RS to spill into. We could maybe work around that by adding a virtual method on the base class that calls `processFunctionBeforeFrameFinalized()` with a nullptr and with the real RS in the derived class. The base `PEI::calculateFrameObjectOffsets()` takes the scavenging spill slots into account when it lays out the stack frame, and uses the RS to find out where they are. (Likewise it also handles the CSR spill slots, which in principle the base class doesn't need to do). This seems trickier to work around. We could move the code that actually uses RS out to some other function but the logic of where it goes in the frame would still be local. The base `PEI::replaceFrameIndices()` keeps the RS up to date as it goes and passes RS to `TargetRegisterInfo::eliminateFrameIndex()` which is of course where it might actually be wanted by a target. The root sources of this are that the stack frame layout (which is implemented in this pass, and is always needed) has to be aware of both CSRs and scavenging; and the target hooks (which are also always used) might need RS in any case. We could try to move the RS/CSR frame layout code and the RS updating code out into some kind of other abstraction, but the RS class is itself essentially the abstraction that you'd want. dschuff: I kind of agree. There are a couple of reasons: 1. `TargetFrameLowering…

		// Flag to control whether to use the register scavenger to resolve
		// frame index materialization registers. Set according to
		// TRI->requiresFrameIndexScavenging() for the current function.
		bool FrameIndexVirtualScavenging = false;

// MinCSFrameIndex, MaxCSFrameIndex - Keeps the range of callee saved		// MinCSFrameIndex, MaxCSFrameIndex - Keeps the range of callee saved
// stack frame indexes.		// stack frame indexes.
unsigned MinCSFrameIndex, MaxCSFrameIndex;		unsigned MinCSFrameIndex = std::numeric_limits<unsigned>::max();
		unsigned MaxCSFrameIndex = 0;

// Save and Restore blocks of the current function. Typically there is a		// Save and Restore blocks of the current function. Typically there is a
// single save block, unless Windows EH funclets are involved.		// single save block, unless Windows EH funclets are involved.
SmallVector<MachineBasicBlock *, 1> SaveBlocks;		SmallVector<MachineBasicBlock *, 1> SaveBlocks;
SmallVector<MachineBasicBlock *, 4> RestoreBlocks;		SmallVector<MachineBasicBlock *, 4> RestoreBlocks;

// Flag to control whether to use the register scavenger to resolve		void init(MachineFunction &Fn);
// frame index materialization registers. Set according to
// TRI->requiresFrameIndexScavenging() for the current function.
bool FrameIndexVirtualScavenging;

void calculateSets(MachineFunction &Fn);		void calculateSets(MachineFunction &Fn);
void calculateCallsInformation(MachineFunction &Fn);		void calculateCallsInformation(MachineFunction &Fn);
void assignCalleeSavedSpillSlots(MachineFunction &Fn,
const BitVector &SavedRegs);		void finalizeFrame(MachineFunction &Fn);
void insertCSRSpillsAndRestores(MachineFunction &Fn);
void calculateFrameObjectOffsets(MachineFunction &Fn);		void calculateFrameObjectOffsets(MachineFunction &Fn);
void replaceFrameIndices(MachineFunction &Fn);		void replaceFrameIndices(MachineFunction &Fn);
void replaceFrameIndices(MachineBasicBlock *BB, MachineFunction &Fn,		void replaceFrameIndices(MachineBasicBlock *BB, MachineFunction &Fn,
int &SPAdj);		int &SPAdj);
void scavengeFrameVirtualRegs(MachineFunction &Fn);
void insertPrologEpilogCode(MachineFunction &Fn);		void insertPrologEpilogCode(MachineFunction &Fn);
};		};

		class PEIWithCSRs : public PEI {
		public:
		static char ID;
		PEIWithCSRs() : PEI(ID) {
		initializePEIWithCSRsPass(*PassRegistry::getPassRegistry());
		}
		/// runOnMachineFunction - Insert prolog/epilog code and replace abstract
		/// frame indexes with appropriate references.
		///
		bool runOnMachineFunction(MachineFunction &Fn) override;

		private:
		void assignCalleeSavedSpillSlots(MachineFunction &Fn,
		const BitVector &SavedRegs);
		void insertCSRSpillsAndRestores(MachineFunction &Fn);
		void scavengeFrameVirtualRegs(MachineFunction &Fn);
		};
} // namespace		} // namespace

char PEI::ID = 0;		char PEI::ID = 0;
char &llvm::PrologEpilogCodeInserterID = PEI::ID;		char PEIWithCSRs::ID = 0;
		char &llvm::PrologEpilogCodeInserterID = PEIWithCSRs::ID;
		char &llvm::VRegPrologEpilogCodeInserterID = PEI::ID;

static cl::opt<unsigned>		static cl::opt<unsigned>
WarnStackSize("warn-stack-size", cl::Hidden, cl::init((unsigned)-1),		WarnStackSize("warn-stack-size", cl::Hidden, cl::init((unsigned)-1),
cl::desc("Warn for stack size bigger than the given"		cl::desc("Warn for stack size bigger than the given"
" number"));		" number"));

INITIALIZE_PASS_BEGIN(PEI, "prologepilog",		INITIALIZE_PASS_BEGIN(PEI, "prologepilog-no-csrs",
		"Prologue/Epilogue Insertion (no CSRs)", false, false)
		INITIALIZE_PASS_DEPENDENCY(MachineLoopInfo)
		INITIALIZE_PASS_DEPENDENCY(MachineDominatorTree)
		INITIALIZE_PASS_DEPENDENCY(StackProtector)
		INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)
		INITIALIZE_PASS_END(
		PEI, "prologepilog-no-csrs",
		"Prologue/Epilogue Insertion & Frame Finalization (no CSRs)", false, false)

		INITIALIZE_PASS_BEGIN(PEIWithCSRs, "prologepilog",
"Prologue/Epilogue Insertion", false, false)		"Prologue/Epilogue Insertion", false, false)
INITIALIZE_PASS_DEPENDENCY(MachineLoopInfo)		INITIALIZE_PASS_DEPENDENCY(MachineLoopInfo)
INITIALIZE_PASS_DEPENDENCY(MachineDominatorTree)		INITIALIZE_PASS_DEPENDENCY(MachineDominatorTree)
INITIALIZE_PASS_DEPENDENCY(StackProtector)		INITIALIZE_PASS_DEPENDENCY(StackProtector)
INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)		INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)
INITIALIZE_PASS_END(PEI, "prologepilog",		INITIALIZE_PASS_END(PEIWithCSRs, "prologepilog",
"Prologue/Epilogue Insertion & Frame Finalization",		"Prologue/Epilogue Insertion & Frame Finalization", false,
false, false)		false)

STATISTIC(NumScavengedRegs, "Number of frame index regs scavenged");		STATISTIC(NumScavengedRegs, "Number of frame index regs scavenged");
STATISTIC(NumBytesStackSpace,		STATISTIC(NumBytesStackSpace,
"Number of bytes used for stack in all functions");		"Number of bytes used for stack in all functions");

void PEI::getAnalysisUsage(AnalysisUsage &AU) const {		void PEI::getAnalysisUsage(AnalysisUsage &AU) const {
AU.setPreservesCFG();		AU.setPreservesCFG();
AU.addPreserved<MachineLoopInfo>();		AU.addPreserved<MachineLoopInfo>();
Show All 32 Lines	for (MachineBasicBlock &MBB : Fn) {
if (MBB.isReturnBlock())		if (MBB.isReturnBlock())
RestoreBlocks.push_back(&MBB);		RestoreBlocks.push_back(&MBB);
}		}
}		}

/// StackObjSet - A set of stack object indexes		/// StackObjSet - A set of stack object indexes
typedef SmallSetVector<int, 8> StackObjSet;		typedef SmallSetVector<int, 8> StackObjSet;

/// runOnMachineFunction - Insert prolog/epilog code and replace abstract		void PEI::init(MachineFunction &Fn) {
/// frame indexes with appropriate references.
///
bool PEI::runOnMachineFunction(MachineFunction &Fn) {
const Function* F = Fn.getFunction();
const TargetRegisterInfo *TRI = Fn.getSubtarget().getRegisterInfo();
const TargetFrameLowering *TFI = Fn.getSubtarget().getFrameLowering();

assert(!Fn.getRegInfo().getNumVirtRegs() && "Regalloc must assign all vregs");

RS = TRI->requiresRegisterScavenging(Fn) ? new RegScavenger() : nullptr;
FrameIndexVirtualScavenging = TRI->requiresFrameIndexScavenging(Fn);

// Calculate the MaxCallFrameSize and AdjustsStack variables for the		// Calculate the MaxCallFrameSize and AdjustsStack variables for the
// function's frame information. Also eliminates call frame pseudo		// function's frame information. Also eliminates call frame pseudo
// instructions.		// instructions.
calculateCallsInformation(Fn);		calculateCallsInformation(Fn);

// Determine which of the registers in the callee save list should be saved.
BitVector SavedRegs;
TFI->determineCalleeSaves(Fn, SavedRegs, RS);

// Insert spill code for any callee saved registers that are modified.
assignCalleeSavedSpillSlots(Fn, SavedRegs);

// Determine placement of CSR spill/restore code:		// Determine placement of CSR spill/restore code:
// place all spills in the entry block, all restores in return blocks.		// place all spills in the entry block, all restores in return blocks.
calculateSets(Fn);		calculateSets(Fn);
		}

// Add the code to save and restore the callee saved registers.		/// runOnMachineFunction - Insert prolog/epilog code and replace abstract
if (!F->hasFnAttribute(Attribute::Naked))		/// frame indexes with appropriate references.
insertCSRSpillsAndRestores(Fn);		///
		bool PEI::runOnMachineFunction(MachineFunction &Fn) {
		assert(!RS && "Base PEI does not support scavenging");

		init(Fn);
		finalizeFrame(Fn);

		return true;
		}

		/// Finalize the frame layout, insert prolog and epilog code, and replace
		/// FrameIndex operands.
		void PEI::finalizeFrame(MachineFunction &Fn) {
		const Function *F = Fn.getFunction();
		const TargetFrameLowering *TFI = Fn.getSubtarget().getFrameLowering();

// Allow the target machine to make final modifications to the function		// Allow the target machine to make final modifications to the function
// before the frame layout is finalized.		// before the frame layout is finalized.
TFI->processFunctionBeforeFrameFinalized(Fn, RS);		TFI->processFunctionBeforeFrameFinalized(Fn, RS);

// Calculate actual frame offsets for all abstract stack objects...		// Calculate actual frame offsets for all abstract stack objects...
calculateFrameObjectOffsets(Fn);		calculateFrameObjectOffsets(Fn);

// Add prolog and epilog code to the function. This function is required		// Add prolog and epilog code to the function. This function is required
// to align the stack frame as necessary for any stack variables or		// to align the stack frame as necessary for any stack variables or
// called functions. Because of this, calculateCalleeSavedRegisters()		// called functions. Because of this, calculateCalleeSavedRegisters()
// must be called before this function in order to set the AdjustsStack		// must be called before this function in order to set the AdjustsStack
// and MaxCallFrameSize variables.		// and MaxCallFrameSize variables.
if (!F->hasFnAttribute(Attribute::Naked))		if (!F->hasFnAttribute(Attribute::Naked))
insertPrologEpilogCode(Fn);		insertPrologEpilogCode(Fn);

// Replace all MO_FrameIndex operands with physical register references		// Replace all MO_FrameIndex operands with physical register references
// and actual offsets.		// and actual offsets.
//		//
replaceFrameIndices(Fn);		replaceFrameIndices(Fn);

// If register scavenging is needed, as we've enabled doing it as a
// post-pass, scavenge the virtual registers that frame index elimination
// inserted.
if (TRI->requiresRegisterScavenging(Fn) && FrameIndexVirtualScavenging)
scavengeFrameVirtualRegs(Fn);

// Clear any vregs created by virtual scavenging.
Fn.getRegInfo().clearVirtRegs();

// Warn on stack size when we exceeds the given limit.		// Warn on stack size when we exceeds the given limit.
MachineFrameInfo *MFI = Fn.getFrameInfo();		MachineFrameInfo *MFI = Fn.getFrameInfo();
uint64_t StackSize = MFI->getStackSize();		uint64_t StackSize = MFI->getStackSize();
if (WarnStackSize.getNumOccurrences() > 0 && WarnStackSize < StackSize) {		if (WarnStackSize.getNumOccurrences() > 0 && WarnStackSize < StackSize) {
DiagnosticInfoStackSize DiagStackSize(*F, StackSize);		DiagnosticInfoStackSize DiagStackSize(*F, StackSize);
F->getContext().diagnose(DiagStackSize);		F->getContext().diagnose(DiagStackSize);
}		}

delete RS;
SaveBlocks.clear();		SaveBlocks.clear();
RestoreBlocks.clear();		RestoreBlocks.clear();
		}

		bool PEIWithCSRs::runOnMachineFunction(MachineFunction &Fn) {
		hfinkelUnsubmitted Not Done Reply Inline Actions Did we decide that we need to split the passes this way? It seems like the boilerplate from having two passes seems >= the difference between this runOnMachineFunction implementation and the simpler version. hfinkel: Did we decide that we need to split the passes this way? It seems like the boilerplate from…
		dschuffAuthorUnsubmitted Not Done Reply Inline Actions The motivation for this change was the really strong desire expressed to have clear separation at the pass level between passes that support virtual registers and passes that do not. (see also http://reviews.llvm.org/D16483). If we implement that, then we have to have a PEI pass that supports virtual registers. Of course we could just declare that this PEI pass supports virtual registers. From my own experience and auditing of the code, it seems to work fine for WebAssembly. The subsets that WebAssembly uses (i.e. the parts that I've broken out here into the base PEI) are the only parts that actually run when the target returns an empty set of CSRs; I've basically just made that distinction explicit in this change. I'd be interested in getting an opinion from @qcolombet based on this more concrete change. dschuff: The motivation for this change was the really strong desire expressed to have clear separation…
		const TargetFrameLowering *TFI = Fn.getSubtarget().getFrameLowering();
		const TargetRegisterInfo *TRI = Fn.getSubtarget().getRegisterInfo();
		RS = TRI->requiresRegisterScavenging(Fn) ? new RegScavenger() : nullptr;
		FrameIndexVirtualScavenging = TRI->requiresFrameIndexScavenging(Fn);

		assert(!Fn.getRegInfo().getNumVirtRegs() && "Regalloc must assign all vregs");

		init(Fn);

		// Determine which of the registers in the callee save list should be saved.
		BitVector SavedRegs;
		TFI->determineCalleeSaves(Fn, SavedRegs, RS);

		// Assign stack slots for any callee-saved registers that must be spilled.
		assignCalleeSavedSpillSlots(Fn, SavedRegs);

		// Add the code to save and restore the callee saved registers.
		if (!Fn.getFunction()->hasFnAttribute(Attribute::Naked))
		insertCSRSpillsAndRestores(Fn);

		finalizeFrame(Fn);

		// If register scavenging is needed, as we've enabled doing it as a
		// post-pass, scavenge the virtual registers that frame index elimination
		// inserted.
		if (RS && FrameIndexVirtualScavenging) scavengeFrameVirtualRegs(Fn);

		// Clear any vregs created by virtual scavenging.
		Fn.getRegInfo().clearVirtRegs();

		delete RS;
return true;		return true;
}		}

/// calculateCallsInformation - Calculate the MaxCallFrameSize and AdjustsStack		/// calculateCallsInformation - Calculate the MaxCallFrameSize and AdjustsStack
/// variables for the function's frame information and eliminate call frame		/// variables for the function's frame information and eliminate call frame
/// pseudo instructions.		/// pseudo instructions.
void PEI::calculateCallsInformation(MachineFunction &Fn) {		void PEI::calculateCallsInformation(MachineFunction &Fn) {
const TargetInstrInfo &TII = *Fn.getSubtarget().getInstrInfo();		const TargetInstrInfo &TII = *Fn.getSubtarget().getInstrInfo();
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	for (std::vector<MachineBasicBlock::iterator>::iterator
// the target doesn't indicate otherwise, remove the call frame pseudos		// the target doesn't indicate otherwise, remove the call frame pseudos
// here. The sub/add sp instruction pairs are still inserted, but we don't		// here. The sub/add sp instruction pairs are still inserted, but we don't
// need to track the SP adjustment for frame index elimination.		// need to track the SP adjustment for frame index elimination.
if (TFI->canSimplifyCallFramePseudos(Fn))		if (TFI->canSimplifyCallFramePseudos(Fn))
TFI->eliminateCallFramePseudoInstr(Fn, *I->getParent(), I);		TFI->eliminateCallFramePseudoInstr(Fn, *I->getParent(), I);
}		}
}		}

void PEI::assignCalleeSavedSpillSlots(MachineFunction &F,		void PEIWithCSRs::assignCalleeSavedSpillSlots(MachineFunction &F,
const BitVector &SavedRegs) {		const BitVector &SavedRegs) {
// These are used to keep track the callee-save area. Initialize them.		// These are used to keep track the callee-save area. Initialize them.
MinCSFrameIndex = INT_MAX;		MinCSFrameIndex = std::numeric_limits<unsigned>::max();
		hfinkelUnsubmitted Not Done Reply Inline Actions Does having > INT_MAX FIs really work with just these changes? hfinkel: Does having > INT_MAX FIs really work with just these changes?
		dschuffAuthorUnsubmitted Not Done Reply Inline Actions No, this change was just to make the types (signed vs unsigned) consistent (see also line 664 below), and, if there's going to be a sentinel value, it might as well be the max for the actual type used (unsigned) instead of a different type. dschuff: No, this change was just to make the types (signed vs unsigned) consistent (see also line 664…
MaxCSFrameIndex = 0;		MaxCSFrameIndex = 0;

if (SavedRegs.empty())		if (SavedRegs.empty())
return;		return;

const TargetRegisterInfo *RegInfo = F.getSubtarget().getRegisterInfo();		const TargetRegisterInfo *RegInfo = F.getSubtarget().getRegisterInfo();
const MCPhysReg *CSRegs = RegInfo->getCalleeSavedRegs(&F);		const MCPhysReg *CSRegs = RegInfo->getCalleeSavedRegs(&F);

▲ Show 20 Lines • Show All 117 Lines • ▼ Show 20 Lines	for (MachineBasicBlock *MBB : Visited) {
MBB->addLiveIn(Reg);		MBB->addLiveIn(Reg);
}		}
}		}
}		}

/// insertCSRSpillsAndRestores - Insert spill and restore code for		/// insertCSRSpillsAndRestores - Insert spill and restore code for
/// callee saved registers used in the function.		/// callee saved registers used in the function.
///		///
void PEI::insertCSRSpillsAndRestores(MachineFunction &Fn) {		void PEIWithCSRs::insertCSRSpillsAndRestores(MachineFunction &Fn) {
// Get callee saved register information.		// Get callee saved register information.
MachineFrameInfo *MFI = Fn.getFrameInfo();		MachineFrameInfo *MFI = Fn.getFrameInfo();
const std::vector<CalleeSavedInfo> &CSI = MFI->getCalleeSavedInfo();		const std::vector<CalleeSavedInfo> &CSI = MFI->getCalleeSavedInfo();

MFI->setCalleeSavedInfoValid(true);		MFI->setCalleeSavedInfoValid(true);

// Early exit if no callee saved registers are modified!		// Early exit if no callee saved registers are modified!
if (CSI.empty())		if (CSI.empty())
▲ Show 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	for (unsigned i = MinCSFrameIndex; i <= MaxCSFrameIndex; ++i) {

unsigned Align = MFI->getObjectAlignment(i);		unsigned Align = MFI->getObjectAlignment(i);
// Adjust to alignment boundary		// Adjust to alignment boundary
Offset = alignTo(Offset, Align, Skew);		Offset = alignTo(Offset, Align, Skew);

MFI->setObjectOffset(i, -Offset); // Set the computed offset		MFI->setObjectOffset(i, -Offset); // Set the computed offset
}		}
} else {		} else {
int MaxCSFI = MaxCSFrameIndex, MinCSFI = MinCSFrameIndex;		unsigned MaxCSFI = MaxCSFrameIndex, MinCSFI = MinCSFrameIndex;
for (int i = MaxCSFI; i >= MinCSFI ; --i) {		for (unsigned i = MaxCSFI; i >= MinCSFI; --i) {
unsigned Align = MFI->getObjectAlignment(i);		unsigned Align = MFI->getObjectAlignment(i);
// Adjust to alignment boundary		// Adjust to alignment boundary
Offset = alignTo(Offset, Align, Skew);		Offset = alignTo(Offset, Align, Skew);

MFI->setObjectOffset(i, Offset);		MFI->setObjectOffset(i, Offset);
Offset += MFI->getObjectSize(i);		Offset += MFI->getObjectSize(i);
}		}
}		}
▲ Show 20 Lines • Show All 346 Lines • ▼ Show 20 Lines

/// scavengeFrameVirtualRegs - Replace all frame index virtual registers		/// scavengeFrameVirtualRegs - Replace all frame index virtual registers
/// with physical registers. Use the register scavenger to find an		/// with physical registers. Use the register scavenger to find an
/// appropriate register to use.		/// appropriate register to use.
///		///
/// FIXME: Iterating over the instruction stream is unnecessary. We can simply		/// FIXME: Iterating over the instruction stream is unnecessary. We can simply
/// iterate over the vreg use list, which at this point only contains machine		/// iterate over the vreg use list, which at this point only contains machine
/// operands for which eliminateFrameIndex need a new scratch reg.		/// operands for which eliminateFrameIndex need a new scratch reg.
void		void PEIWithCSRs::scavengeFrameVirtualRegs(MachineFunction &Fn) {
PEI::scavengeFrameVirtualRegs(MachineFunction &Fn) {
// Run through the instructions and find any virtual registers.		// Run through the instructions and find any virtual registers.
for (MachineFunction::iterator BB = Fn.begin(),		for (MachineFunction::iterator BB = Fn.begin(),
E = Fn.end(); BB != E; ++BB) {		E = Fn.end(); BB != E; ++BB) {
RS->enterBasicBlock(&*BB);		RS->enterBasicBlock(&*BB);

int SPAdj = 0;		int SPAdj = 0;

// The instruction stream may change in the loop, so check BB->end()		// The instruction stream may change in the loop, so check BB->end()
Show All 35 Lines	for (MachineBasicBlock::iterator I = BB->begin(); I != BB->end(); ) {
unsigned ScratchReg = RS->scavengeRegister(RC, J, SPAdj);		unsigned ScratchReg = RS->scavengeRegister(RC, J, SPAdj);

++NumScavengedRegs;		++NumScavengedRegs;

// Replace this reference to the virtual register with the		// Replace this reference to the virtual register with the
// scratch register.		// scratch register.
assert (ScratchReg && "Missing scratch register!");		assert (ScratchReg && "Missing scratch register!");
Fn.getRegInfo().replaceRegWith(Reg, ScratchReg);		Fn.getRegInfo().replaceRegWith(Reg, ScratchReg);

// Because this instruction was processed by the RS before this		// Because this instruction was processed by the RS before this
// register was allocated, make sure that the RS now records the		// register was allocated, make sure that the RS now records the
// register as being used.		// register as being used.
RS->setRegUsed(ScratchReg);		RS->setRegUsed(ScratchReg);
}		}
}		}

// If the scavenger needed to use one of its spill slots, the		// If the scavenger needed to use one of its spill slots, the
Show All 21 Lines

lib/Target/WebAssembly/WebAssemblyTargetMachine.cpp

Show First 20 Lines • Show All 168 Lines • ▼ Show 20 Lines	void WebAssemblyPassConfig::addPreRegAlloc() {
if (getOptLevel() != CodeGenOpt::None)		if (getOptLevel() != CodeGenOpt::None)
addPass(createWebAssemblyStoreResults());		addPass(createWebAssemblyStoreResults());
}		}

void WebAssemblyPassConfig::addPostRegAlloc() {		void WebAssemblyPassConfig::addPostRegAlloc() {
// TODO: The following CodeGen passes don't currently support code containing		// TODO: The following CodeGen passes don't currently support code containing
// virtual registers. Consider removing their restrictions and re-enabling		// virtual registers. Consider removing their restrictions and re-enabling
// them.		// them.
//
// We use our own PrologEpilogInserter which is very slightly modified to		// Use the VReg version of PrologEpilogInserter which doesn't support CSRs
// tolerate virtual registers.		// but does support virtual registers.
disablePass(&PrologEpilogCodeInserterID);		// TODO: Delete WebAssemblyPEI code.
		substitutePass(&PrologEpilogCodeInserterID, &VRegPrologEpilogCodeInserterID);
// Fails with: should be run after register allocation.		// Fails with: should be run after register allocation.
disablePass(&MachineCopyPropagationID);		disablePass(&MachineCopyPropagationID);

// Mark registers as representing wasm's expression stack.		// Mark registers as representing wasm's expression stack.
addPass(createWebAssemblyRegStackify());		addPass(createWebAssemblyRegStackify());

// Run the register coloring pass to reduce the total number of registers.		// Run the register coloring pass to reduce the total number of registers.
addPass(createWebAssemblyRegColoring());		addPass(createWebAssemblyRegColoring());

TargetPassConfig::addPostRegAlloc();		TargetPassConfig::addPostRegAlloc();

// Run WebAssembly's version of the PrologEpilogInserter. Target-independent
// PEI runs after PostRegAlloc and after ShrinkWrap. Putting it here will run
// PEI before ShrinkWrap but otherwise in the same position in the order.
addPass(createWebAssemblyPEI());
}		}

void WebAssemblyPassConfig::addPreEmitPass() {		void WebAssemblyPassConfig::addPreEmitPass() {
TargetPassConfig::addPreEmitPass();		TargetPassConfig::addPreEmitPass();

// Put the CFG in structured form; insert BLOCK and LOOP markers.		// Put the CFG in structured form; insert BLOCK and LOOP markers.
addPass(createWebAssemblyCFGStackify());		addPass(createWebAssemblyCFGStackify());

Show All 10 Lines