This is an archive of the discontinued LLVM Phabricator instance.

[Greedy RegAlloc] Take into account the cost of local intervals when selecting split candidate.
ClosedPublic

Authored by myatsina on Dec 26 2017, 9:47 AM.

Download Raw Diff

Details

Reviewers

qcolombet
MatzeB
wmi
stoklund

Commits

rGcd5bc4a2cd52: Take into account the cost of local intervals when selecting split candidate.
rL323870: Take into account the cost of local intervals when selecting split candidate.

Summary

When selecting a split candidate for region splitting, the register allocator tries to predict which candidate will have the cheapest spill cost.
Global splitting may cause the creation of local intervals, and they might spill.

This patch makes RA take into account the spill cost of local split intervals in use blocks (we already take into account the spill cost in through blocks).
A flag ("-condsider-local-interval-cost") controls weather we do this advanced cost calculation (it's on by default for X86 target, off for the rest).

Diff Detail

Event Timeline

myatsina created this revision.Dec 26 2017, 9:47 AM

ping

Hi Marina,

Looks good to me. Couple of nitpicks below.
One question: What is the compile time impact of this new heuristic?

Cheers,
-Quentin

include/llvm/CodeGen/LiveRegMatrix.h
114	*segment
lib/CodeGen/LiveRegMatrix.cpp
211	Period at the end of comments
lib/CodeGen/RegAllocGreedy.cpp
1570	*interference
test/CodeGen/X86/bug26810.ll
28	Why is the new code sequence better?

This revision is now accepted and ready to land.Jan 10 2018, 9:32 AM

Hi Quentin,

Sorry it took me a while to get back to you.
I've measure the compile time influence on CTMark for X86 SKL with default compile flags and O1, O2, O3 optimization levels (O0 does not use Greedy RA).
The difference of this patch (and my previous patch regarding the bad evition chains) is between -0.92% and +0.8% for O1 and between -0.35% and +0.5% for O2, O3.
So compile time seems to not be seriously affected by this (or my previous) patch.

Thanks,
Marina

test/CodeGen/X86/bug26810.ll
28	If we look at the new sequence of the full loop, then it is not worse than the original one. Here the test is only matching a small part of the loop (because this test is meant to check some other sequence). Before patch: loop: MOVAPSmr ... SUBPDrr MOVAPSmr MOVAPSrm MULPDrm ADDPDrr ADD32ri8 ... jmp loop After patch: loop: ... SUBPDrr MOVAPSmr MULPDrm MOVAPSrm ADDPDrr MOVAPSrm ADD32ri8 ... jmp loop So the MOVAPSmr which was in the beginning of the loop was moved to the end of the loop.

Ah makes sense.

Thanks for the clarifications.

LGTM (again :)).

Closed by commit rL323870: Take into account the cost of local intervals when selecting split candidate. (authored by myatsina). · Explain WhyJan 31 2018, 5:32 AM

This revision was automatically updated to reflect the committed changes.

myatsina marked 3 inline comments as done.

I was looking at flags in this file and noticed there is a typo in the flag added here (condsider-local-interval-cost should be "consider-...", note the extraneous "d" in the flag). Also, there doesn't seem to be any test utilizing this flag. Should it be removed?

Herald added a project: Restricted Project. · View Herald TranscriptJun 25 2019, 9:39 AM

Revision Contents

Path

Size

include/

llvm/

CodeGen/

LiveRegMatrix.h

313 lines

lib/

CodeGen/

LiveRegMatrix.cpp

430 lines

RegAllocGreedy.cpp

6314 lines

test/

CodeGen/

X86/

bug26810.ll

3 lines

regalloc-advanced-split-cost.ll

89 lines

sad.ll

239 lines

Diff 128178

include/llvm/CodeGen/LiveRegMatrix.h

	//===- LiveRegMatrix.h - Track register interference ----------- C++ ----===//			//===- LiveRegMatrix.h - Track register interference ----------- C++ ----===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// The LiveRegMatrix analysis pass keeps track of virtual register interference			// The LiveRegMatrix analysis pass keeps track of virtual register interference
	// along two dimensions: Slot indexes and register units. The matrix is used by			// along two dimensions: Slot indexes and register units. The matrix is used by
	// register allocators to ensure that no interfering virtual registers get			// register allocators to ensure that no interfering virtual registers get
	// assigned to overlapping physical registers.			// assigned to overlapping physical registers.
	//			//
	// Register units are defined in MCRegisterInfo.h, they represent the smallest			// Register units are defined in MCRegisterInfo.h, they represent the smallest
	// unit of interference when dealing with overlapping physical registers. The			// unit of interference when dealing with overlapping physical registers. The
	// LiveRegMatrix is represented as a LiveIntervalUnion per register unit. When			// LiveRegMatrix is represented as a LiveIntervalUnion per register unit. When
	// a virtual register is assigned to a physical register, the live range for			// a virtual register is assigned to a physical register, the live range for
	// the virtual register is inserted into the LiveIntervalUnion for each regunit			// the virtual register is inserted into the LiveIntervalUnion for each regunit
	// in the physreg.			// in the physreg.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_CODEGEN_LIVEREGMATRIX_H			#ifndef LLVM_CODEGEN_LIVEREGMATRIX_H
	#define LLVM_CODEGEN_LIVEREGMATRIX_H			#define LLVM_CODEGEN_LIVEREGMATRIX_H

	#include "llvm/ADT/BitVector.h"			#include "llvm/ADT/BitVector.h"
	#include "llvm/CodeGen/LiveIntervalUnion.h"			#include "llvm/CodeGen/LiveIntervalUnion.h"
	#include "llvm/CodeGen/MachineFunctionPass.h"			#include "llvm/CodeGen/MachineFunctionPass.h"
	#include <memory>			#include <memory>

	namespace llvm {			namespace llvm {

	class AnalysisUsage;			class AnalysisUsage;
	class LiveInterval;			class LiveInterval;
	class LiveIntervals;			class LiveIntervals;
	class MachineFunction;			class MachineFunction;
	class TargetRegisterInfo;			class TargetRegisterInfo;
	class VirtRegMap;			class VirtRegMap;

	class LiveRegMatrix : public MachineFunctionPass {			class LiveRegMatrix : public MachineFunctionPass {
	const TargetRegisterInfo *TRI;			const TargetRegisterInfo *TRI;
	LiveIntervals *LIS;			LiveIntervals *LIS;
	VirtRegMap *VRM;			VirtRegMap *VRM;

	// UserTag changes whenever virtual registers have been modified.			// UserTag changes whenever virtual registers have been modified.
	unsigned UserTag = 0;			unsigned UserTag = 0;

	// The matrix is represented as a LiveIntervalUnion per register unit.			// The matrix is represented as a LiveIntervalUnion per register unit.
	LiveIntervalUnion::Allocator LIUAlloc;			LiveIntervalUnion::Allocator LIUAlloc;
	LiveIntervalUnion::Array Matrix;			LiveIntervalUnion::Array Matrix;

	// Cached queries per register unit.			// Cached queries per register unit.
	std::unique_ptr<LiveIntervalUnion::Query[]> Queries;			std::unique_ptr<LiveIntervalUnion::Query[]> Queries;

	// Cached register mask interference info.			// Cached register mask interference info.
	unsigned RegMaskTag = 0;			unsigned RegMaskTag = 0;
	unsigned RegMaskVirtReg = 0;			unsigned RegMaskVirtReg = 0;
	BitVector RegMaskUsable;			BitVector RegMaskUsable;

	// MachineFunctionPass boilerplate.			// MachineFunctionPass boilerplate.
	void getAnalysisUsage(AnalysisUsage &) const override;			void getAnalysisUsage(AnalysisUsage &) const override;
	bool runOnMachineFunction(MachineFunction &) override;			bool runOnMachineFunction(MachineFunction &) override;
	void releaseMemory() override;			void releaseMemory() override;

	public:			public:
	static char ID;			static char ID;

	LiveRegMatrix();			LiveRegMatrix();

	//===--------------------------------------------------------------------===//			//===--------------------------------------------------------------------===//
	// High-level interface.			// High-level interface.
	//===--------------------------------------------------------------------===//			//===--------------------------------------------------------------------===//
	//			//
	// Check for interference before assigning virtual registers to physical			// Check for interference before assigning virtual registers to physical
	// registers.			// registers.
	//			//

	/// Invalidate cached interference queries after modifying virtual register			/// Invalidate cached interference queries after modifying virtual register
	/// live ranges. Interference checks may return stale information unless			/// live ranges. Interference checks may return stale information unless
	/// caches are invalidated.			/// caches are invalidated.
	void invalidateVirtRegs() { ++UserTag; }			void invalidateVirtRegs() { ++UserTag; }

	enum InterferenceKind {			enum InterferenceKind {
	/// No interference, go ahead and assign.			/// No interference, go ahead and assign.
	IK_Free = 0,			IK_Free = 0,

	/// Virtual register interference. There are interfering virtual registers			/// Virtual register interference. There are interfering virtual registers
	/// assigned to PhysReg or its aliases. This interference could be resolved			/// assigned to PhysReg or its aliases. This interference could be resolved
	/// by unassigning those other virtual registers.			/// by unassigning those other virtual registers.
	IK_VirtReg,			IK_VirtReg,

	/// Register unit interference. A fixed live range is in the way, typically			/// Register unit interference. A fixed live range is in the way, typically
	/// argument registers for a call. This can't be resolved by unassigning			/// argument registers for a call. This can't be resolved by unassigning
	/// other virtual registers.			/// other virtual registers.
	IK_RegUnit,			IK_RegUnit,

	/// RegMask interference. The live range is crossing an instruction with a			/// RegMask interference. The live range is crossing an instruction with a
	/// regmask operand that doesn't preserve PhysReg. This typically means			/// regmask operand that doesn't preserve PhysReg. This typically means
	/// VirtReg is live across a call, and PhysReg isn't call-preserved.			/// VirtReg is live across a call, and PhysReg isn't call-preserved.
	IK_RegMask			IK_RegMask
	};			};

	/// Check for interference before assigning VirtReg to PhysReg.			/// Check for interference before assigning VirtReg to PhysReg.
	/// If this function returns IK_Free, it is legal to assign(VirtReg, PhysReg).			/// If this function returns IK_Free, it is legal to assign(VirtReg, PhysReg).
	/// When there is more than one kind of interference, the InterferenceKind			/// When there is more than one kind of interference, the InterferenceKind
	/// with the highest enum value is returned.			/// with the highest enum value is returned.
	InterferenceKind checkInterference(LiveInterval &VirtReg, unsigned PhysReg);			InterferenceKind checkInterference(LiveInterval &VirtReg, unsigned PhysReg);

	/// Assign VirtReg to PhysReg.			/// Check for interference in the segment [Start, End) that may prevent
	/// This will mark VirtReg's live range as occupied in the LiveRegMatrix and			/// assignment to PhysReg. If this function returns true, there is
	/// update VirtRegMap. The live range is expected to be available in PhysReg.			/// interference in the segmant [Start, End) of some other interval already
	void assign(LiveInterval &VirtReg, unsigned PhysReg);			/// assigned to PhysReg. If this function returns false, PhysReg is free at
				/// the segmant [Start, End).
				qcolombetUnsubmitted Done Reply Inline Actions segment qcolombet:* *segment
	/// Unassign VirtReg from its PhysReg.			bool checkInterference(SlotIndex Start, SlotIndex End, unsigned PhysReg);
	/// Assuming that VirtReg was previously assigned to a PhysReg, this undoes
	/// the assignment and updates VirtRegMap accordingly.			/// Assign VirtReg to PhysReg.
	void unassign(LiveInterval &VirtReg);			/// This will mark VirtReg's live range as occupied in the LiveRegMatrix and
				/// update VirtRegMap. The live range is expected to be available in PhysReg.
	/// Returns true if the given \p PhysReg has any live intervals assigned.			void assign(LiveInterval &VirtReg, unsigned PhysReg);
	bool isPhysRegUsed(unsigned PhysReg) const;
				/// Unassign VirtReg from its PhysReg.
	//===--------------------------------------------------------------------===//			/// Assuming that VirtReg was previously assigned to a PhysReg, this undoes
	// Low-level interface.			/// the assignment and updates VirtRegMap accordingly.
	//===--------------------------------------------------------------------===//			void unassign(LiveInterval &VirtReg);
	//
	// Provide access to the underlying LiveIntervalUnions.			/// Returns true if the given \p PhysReg has any live intervals assigned.
	//			bool isPhysRegUsed(unsigned PhysReg) const;

	/// Check for regmask interference only.			//===--------------------------------------------------------------------===//
	/// Return true if VirtReg crosses a regmask operand that clobbers PhysReg.			// Low-level interface.
	/// If PhysReg is null, check if VirtReg crosses any regmask operands.			//===--------------------------------------------------------------------===//
	bool checkRegMaskInterference(LiveInterval &VirtReg, unsigned PhysReg = 0);			//
				// Provide access to the underlying LiveIntervalUnions.
	/// Check for regunit interference only.			//
	/// Return true if VirtReg overlaps a fixed assignment of one of PhysRegs's
	/// register units.			/// Check for regmask interference only.
	bool checkRegUnitInterference(LiveInterval &VirtReg, unsigned PhysReg);			/// Return true if VirtReg crosses a regmask operand that clobbers PhysReg.
				/// If PhysReg is null, check if VirtReg crosses any regmask operands.
	/// Query a line of the assigned virtual register matrix directly.			bool checkRegMaskInterference(LiveInterval &VirtReg, unsigned PhysReg = 0);
	/// Use MCRegUnitIterator to enumerate all regunits in the desired PhysReg.
	/// This returns a reference to an internal Query data structure that is only			/// Check for regunit interference only.
	/// valid until the next query() call.			/// Return true if VirtReg overlaps a fixed assignment of one of PhysRegs's
	LiveIntervalUnion::Query &query(const LiveRange &LR, unsigned RegUnit);			/// register units.
				bool checkRegUnitInterference(LiveInterval &VirtReg, unsigned PhysReg);
	/// Directly access the live interval unions per regunit.
	/// This returns an array indexed by the regunit number.			/// Query a line of the assigned virtual register matrix directly.
	LiveIntervalUnion *getLiveUnions() { return &Matrix[0]; }			/// Use MCRegUnitIterator to enumerate all regunits in the desired PhysReg.
	};			/// This returns a reference to an internal Query data structure that is only
				/// valid until the next query() call.
	} // end namespace llvm			LiveIntervalUnion::Query &query(const LiveRange &LR, unsigned RegUnit);

	#endif // LLVM_CODEGEN_LIVEREGMATRIX_H			/// Directly access the live interval unions per regunit.
				/// This returns an array indexed by the regunit number.
				LiveIntervalUnion *getLiveUnions() { return &Matrix[0]; }
				};

				} // end namespace llvm

				#endif // LLVM_CODEGEN_LIVEREGMATRIX_H

lib/CodeGen/LiveRegMatrix.cpp

	//===- LiveRegMatrix.cpp - Track register interference --------------------===//			//===- LiveRegMatrix.cpp - Track register interference --------------------===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file defines the LiveRegMatrix analysis pass.			// This file defines the LiveRegMatrix analysis pass.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/CodeGen/LiveRegMatrix.h"			#include "llvm/CodeGen/LiveRegMatrix.h"
	#include "RegisterCoalescer.h"			#include "RegisterCoalescer.h"
	#include "llvm/ADT/Statistic.h"			#include "llvm/ADT/Statistic.h"
	#include "llvm/CodeGen/LiveInterval.h"			#include "llvm/CodeGen/LiveInterval.h"
	#include "llvm/CodeGen/LiveIntervalUnion.h"			#include "llvm/CodeGen/LiveIntervalUnion.h"
	#include "llvm/CodeGen/LiveIntervals.h"			#include "llvm/CodeGen/LiveIntervals.h"
	#include "llvm/CodeGen/MachineFunction.h"			#include "llvm/CodeGen/MachineFunction.h"
	#include "llvm/CodeGen/TargetRegisterInfo.h"			#include "llvm/CodeGen/TargetRegisterInfo.h"
	#include "llvm/CodeGen/TargetSubtargetInfo.h"			#include "llvm/CodeGen/TargetSubtargetInfo.h"
	#include "llvm/CodeGen/VirtRegMap.h"			#include "llvm/CodeGen/VirtRegMap.h"
	#include "llvm/MC/LaneBitmask.h"			#include "llvm/MC/LaneBitmask.h"
	#include "llvm/MC/MCRegisterInfo.h"			#include "llvm/MC/MCRegisterInfo.h"
	#include "llvm/Pass.h"			#include "llvm/Pass.h"
	#include "llvm/Support/Debug.h"			#include "llvm/Support/Debug.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"
	#include <cassert>			#include <cassert>

	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "regalloc"			#define DEBUG_TYPE "regalloc"

	STATISTIC(NumAssigned , "Number of registers assigned");			STATISTIC(NumAssigned , "Number of registers assigned");
	STATISTIC(NumUnassigned , "Number of registers unassigned");			STATISTIC(NumUnassigned , "Number of registers unassigned");

	char LiveRegMatrix::ID = 0;			char LiveRegMatrix::ID = 0;
	INITIALIZE_PASS_BEGIN(LiveRegMatrix, "liveregmatrix",			INITIALIZE_PASS_BEGIN(LiveRegMatrix, "liveregmatrix",
	"Live Register Matrix", false, false)			"Live Register Matrix", false, false)
	INITIALIZE_PASS_DEPENDENCY(LiveIntervals)			INITIALIZE_PASS_DEPENDENCY(LiveIntervals)
	INITIALIZE_PASS_DEPENDENCY(VirtRegMap)			INITIALIZE_PASS_DEPENDENCY(VirtRegMap)
	INITIALIZE_PASS_END(LiveRegMatrix, "liveregmatrix",			INITIALIZE_PASS_END(LiveRegMatrix, "liveregmatrix",
	"Live Register Matrix", false, false)			"Live Register Matrix", false, false)

	LiveRegMatrix::LiveRegMatrix() : MachineFunctionPass(ID) {}			LiveRegMatrix::LiveRegMatrix() : MachineFunctionPass(ID) {}

	void LiveRegMatrix::getAnalysisUsage(AnalysisUsage &AU) const {			void LiveRegMatrix::getAnalysisUsage(AnalysisUsage &AU) const {
	AU.setPreservesAll();			AU.setPreservesAll();
	AU.addRequiredTransitive<LiveIntervals>();			AU.addRequiredTransitive<LiveIntervals>();
	AU.addRequiredTransitive<VirtRegMap>();			AU.addRequiredTransitive<VirtRegMap>();
	MachineFunctionPass::getAnalysisUsage(AU);			MachineFunctionPass::getAnalysisUsage(AU);
	}			}

	bool LiveRegMatrix::runOnMachineFunction(MachineFunction &MF) {			bool LiveRegMatrix::runOnMachineFunction(MachineFunction &MF) {
	TRI = MF.getSubtarget().getRegisterInfo();			TRI = MF.getSubtarget().getRegisterInfo();
	LIS = &getAnalysis<LiveIntervals>();			LIS = &getAnalysis<LiveIntervals>();
	VRM = &getAnalysis<VirtRegMap>();			VRM = &getAnalysis<VirtRegMap>();

	unsigned NumRegUnits = TRI->getNumRegUnits();			unsigned NumRegUnits = TRI->getNumRegUnits();
	if (NumRegUnits != Matrix.size())			if (NumRegUnits != Matrix.size())
	Queries.reset(new LiveIntervalUnion::Query[NumRegUnits]);			Queries.reset(new LiveIntervalUnion::Query[NumRegUnits]);
	Matrix.init(LIUAlloc, NumRegUnits);			Matrix.init(LIUAlloc, NumRegUnits);

	// Make sure no stale queries get reused.			// Make sure no stale queries get reused.
	invalidateVirtRegs();			invalidateVirtRegs();
	return false;			return false;
	}			}

	void LiveRegMatrix::releaseMemory() {			void LiveRegMatrix::releaseMemory() {
	for (unsigned i = 0, e = Matrix.size(); i != e; ++i) {			for (unsigned i = 0, e = Matrix.size(); i != e; ++i) {
	Matrix[i].clear();			Matrix[i].clear();
	// No need to clear Queries here, since LiveIntervalUnion::Query doesn't			// No need to clear Queries here, since LiveIntervalUnion::Query doesn't
	// have anything important to clear and LiveRegMatrix's runOnFunction()			// have anything important to clear and LiveRegMatrix's runOnFunction()
	// does a std::unique_ptr::reset anyways.			// does a std::unique_ptr::reset anyways.
	}			}
	}			}

	template <typename Callable>			template <typename Callable>
	static bool foreachUnit(const TargetRegisterInfo *TRI,			static bool foreachUnit(const TargetRegisterInfo *TRI,
	LiveInterval &VRegInterval, unsigned PhysReg,			LiveInterval &VRegInterval, unsigned PhysReg,
	Callable Func) {			Callable Func) {
	if (VRegInterval.hasSubRanges()) {			if (VRegInterval.hasSubRanges()) {
	for (MCRegUnitMaskIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			for (MCRegUnitMaskIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	unsigned Unit = (*Units).first;			unsigned Unit = (*Units).first;
	LaneBitmask Mask = (*Units).second;			LaneBitmask Mask = (*Units).second;
	for (LiveInterval::SubRange &S : VRegInterval.subranges()) {			for (LiveInterval::SubRange &S : VRegInterval.subranges()) {
	if ((S.LaneMask & Mask).any()) {			if ((S.LaneMask & Mask).any()) {
	if (Func(Unit, S))			if (Func(Unit, S))
	return true;			return true;
	break;			break;
	}			}
	}			}
	}			}
	} else {			} else {
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	if (Func(*Units, VRegInterval))			if (Func(*Units, VRegInterval))
	return true;			return true;
	}			}
	}			}
	return false;			return false;
	}			}

	void LiveRegMatrix::assign(LiveInterval &VirtReg, unsigned PhysReg) {			void LiveRegMatrix::assign(LiveInterval &VirtReg, unsigned PhysReg) {
	DEBUG(dbgs() << "assigning " << printReg(VirtReg.reg, TRI)			DEBUG(dbgs() << "assigning " << printReg(VirtReg.reg, TRI)
	<< " to " << printReg(PhysReg, TRI) << ':');			<< " to " << printReg(PhysReg, TRI) << ':');
	assert(!VRM->hasPhys(VirtReg.reg) && "Duplicate VirtReg assignment");			assert(!VRM->hasPhys(VirtReg.reg) && "Duplicate VirtReg assignment");
	VRM->assignVirt2Phys(VirtReg.reg, PhysReg);			VRM->assignVirt2Phys(VirtReg.reg, PhysReg);

	foreachUnit(TRI, VirtReg, PhysReg, [&](unsigned Unit,			foreachUnit(TRI, VirtReg, PhysReg, [&](unsigned Unit,
	const LiveRange &Range) {			const LiveRange &Range) {
	DEBUG(dbgs() << ' ' << printRegUnit(Unit, TRI) << ' ' << Range);			DEBUG(dbgs() << ' ' << printRegUnit(Unit, TRI) << ' ' << Range);
	Matrix[Unit].unify(VirtReg, Range);			Matrix[Unit].unify(VirtReg, Range);
	return false;			return false;
	});			});

	++NumAssigned;			++NumAssigned;
	DEBUG(dbgs() << '\n');			DEBUG(dbgs() << '\n');
	}			}

	void LiveRegMatrix::unassign(LiveInterval &VirtReg) {			void LiveRegMatrix::unassign(LiveInterval &VirtReg) {
	unsigned PhysReg = VRM->getPhys(VirtReg.reg);			unsigned PhysReg = VRM->getPhys(VirtReg.reg);
	DEBUG(dbgs() << "unassigning " << printReg(VirtReg.reg, TRI)			DEBUG(dbgs() << "unassigning " << printReg(VirtReg.reg, TRI)
	<< " from " << printReg(PhysReg, TRI) << ':');			<< " from " << printReg(PhysReg, TRI) << ':');
	VRM->clearVirt(VirtReg.reg);			VRM->clearVirt(VirtReg.reg);

	foreachUnit(TRI, VirtReg, PhysReg, [&](unsigned Unit,			foreachUnit(TRI, VirtReg, PhysReg, [&](unsigned Unit,
	const LiveRange &Range) {			const LiveRange &Range) {
	DEBUG(dbgs() << ' ' << printRegUnit(Unit, TRI));			DEBUG(dbgs() << ' ' << printRegUnit(Unit, TRI));
	Matrix[Unit].extract(VirtReg, Range);			Matrix[Unit].extract(VirtReg, Range);
	return false;			return false;
	});			});

	++NumUnassigned;			++NumUnassigned;
	DEBUG(dbgs() << '\n');			DEBUG(dbgs() << '\n');
	}			}

	bool LiveRegMatrix::isPhysRegUsed(unsigned PhysReg) const {			bool LiveRegMatrix::isPhysRegUsed(unsigned PhysReg) const {
	for (MCRegUnitIterator Unit(PhysReg, TRI); Unit.isValid(); ++Unit) {			for (MCRegUnitIterator Unit(PhysReg, TRI); Unit.isValid(); ++Unit) {
	if (!Matrix[*Unit].empty())			if (!Matrix[*Unit].empty())
	return true;			return true;
	}			}
	return false;			return false;
	}			}

	bool LiveRegMatrix::checkRegMaskInterference(LiveInterval &VirtReg,			bool LiveRegMatrix::checkRegMaskInterference(LiveInterval &VirtReg,
	unsigned PhysReg) {			unsigned PhysReg) {
	// Check if the cached information is valid.			// Check if the cached information is valid.
	// The same BitVector can be reused for all PhysRegs.			// The same BitVector can be reused for all PhysRegs.
	// We could cache multiple VirtRegs if it becomes necessary.			// We could cache multiple VirtRegs if it becomes necessary.
	if (RegMaskVirtReg != VirtReg.reg \|\| RegMaskTag != UserTag) {			if (RegMaskVirtReg != VirtReg.reg \|\| RegMaskTag != UserTag) {
	RegMaskVirtReg = VirtReg.reg;			RegMaskVirtReg = VirtReg.reg;
	RegMaskTag = UserTag;			RegMaskTag = UserTag;
	RegMaskUsable.clear();			RegMaskUsable.clear();
	LIS->checkRegMaskInterference(VirtReg, RegMaskUsable);			LIS->checkRegMaskInterference(VirtReg, RegMaskUsable);
	}			}

	// The BitVector is indexed by PhysReg, not register unit.			// The BitVector is indexed by PhysReg, not register unit.
	// Regmask interference is more fine grained than regunits.			// Regmask interference is more fine grained than regunits.
	// For example, a Win64 call can clobber %ymm8 yet preserve %xmm8.			// For example, a Win64 call can clobber %ymm8 yet preserve %xmm8.
	return !RegMaskUsable.empty() && (!PhysReg \|\| !RegMaskUsable.test(PhysReg));			return !RegMaskUsable.empty() && (!PhysReg \|\| !RegMaskUsable.test(PhysReg));
	}			}

	bool LiveRegMatrix::checkRegUnitInterference(LiveInterval &VirtReg,			bool LiveRegMatrix::checkRegUnitInterference(LiveInterval &VirtReg,
	unsigned PhysReg) {			unsigned PhysReg) {
	if (VirtReg.empty())			if (VirtReg.empty())
	return false;			return false;
	CoalescerPair CP(VirtReg.reg, PhysReg, *TRI);			CoalescerPair CP(VirtReg.reg, PhysReg, *TRI);

	bool Result = foreachUnit(TRI, VirtReg, PhysReg, [&](unsigned Unit,			bool Result = foreachUnit(TRI, VirtReg, PhysReg, [&](unsigned Unit,
	const LiveRange &Range) {			const LiveRange &Range) {
	const LiveRange &UnitRange = LIS->getRegUnit(Unit);			const LiveRange &UnitRange = LIS->getRegUnit(Unit);
	return Range.overlaps(UnitRange, CP, *LIS->getSlotIndexes());			return Range.overlaps(UnitRange, CP, *LIS->getSlotIndexes());
	});			});
	return Result;			return Result;
	}			}

	LiveIntervalUnion::Query &LiveRegMatrix::query(const LiveRange &LR,			LiveIntervalUnion::Query &LiveRegMatrix::query(const LiveRange &LR,
	unsigned RegUnit) {			unsigned RegUnit) {
	LiveIntervalUnion::Query &Q = Queries[RegUnit];			LiveIntervalUnion::Query &Q = Queries[RegUnit];
	Q.init(UserTag, LR, Matrix[RegUnit]);			Q.init(UserTag, LR, Matrix[RegUnit]);
	return Q;			return Q;
	}			}

	LiveRegMatrix::InterferenceKind			LiveRegMatrix::InterferenceKind
	LiveRegMatrix::checkInterference(LiveInterval &VirtReg, unsigned PhysReg) {			LiveRegMatrix::checkInterference(LiveInterval &VirtReg, unsigned PhysReg) {
	if (VirtReg.empty())			if (VirtReg.empty())
	return IK_Free;			return IK_Free;

	// Regmask interference is the fastest check.			// Regmask interference is the fastest check.
	if (checkRegMaskInterference(VirtReg, PhysReg))			if (checkRegMaskInterference(VirtReg, PhysReg))
	return IK_RegMask;			return IK_RegMask;

	// Check for fixed interference.			// Check for fixed interference.
	if (checkRegUnitInterference(VirtReg, PhysReg))			if (checkRegUnitInterference(VirtReg, PhysReg))
	return IK_RegUnit;			return IK_RegUnit;

	// Check the matrix for virtual register interference.			// Check the matrix for virtual register interference.
	bool Interference = foreachUnit(TRI, VirtReg, PhysReg,			bool Interference = foreachUnit(TRI, VirtReg, PhysReg,
	[&](unsigned Unit, const LiveRange &LR) {			[&](unsigned Unit, const LiveRange &LR) {
	return query(LR, Unit).checkInterference();			return query(LR, Unit).checkInterference();
	});			});
	if (Interference)			if (Interference)
	return IK_VirtReg;			return IK_VirtReg;

	return IK_Free;			return IK_Free;
	}			}

				bool LiveRegMatrix::checkInterference(SlotIndex Start, SlotIndex End,
				unsigned PhysReg) {
				// Construct artificial live range containing only one segment [Start, End)
				qcolombetUnsubmitted Done Reply Inline Actions Period at the end of comments qcolombet: Period at the end of comments
				VNInfo valno(0, Start);
				LiveRange::Segment Seg(Start, End, &valno);
				LiveRange LR;
				LR.addSegment(Seg);

				// Check for interference with that segment
				for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
				if (query(LR, *Units).checkInterference())
				return true;
				}
				return false;
				}

lib/CodeGen/RegAllocGreedy.cpp

	//===- RegAllocGreedy.cpp - greedy register allocator ---------------------===//			//===- RegAllocGreedy.cpp - greedy register allocator ---------------------===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file defines the RAGreedy function pass for register allocation in			// This file defines the RAGreedy function pass for register allocation in
	// optimized builds.			// optimized builds.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "AllocationOrder.h"			#include "AllocationOrder.h"
	#include "InterferenceCache.h"			#include "InterferenceCache.h"
	#include "LiveDebugVariables.h"			#include "LiveDebugVariables.h"
	#include "RegAllocBase.h"			#include "RegAllocBase.h"
	#include "SpillPlacement.h"			#include "SpillPlacement.h"
	#include "Spiller.h"			#include "Spiller.h"
	#include "SplitKit.h"			#include "SplitKit.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/BitVector.h"			#include "llvm/ADT/BitVector.h"
	#include "llvm/ADT/DenseMap.h"			#include "llvm/ADT/DenseMap.h"
	#include "llvm/ADT/IndexedMap.h"			#include "llvm/ADT/IndexedMap.h"
	#include "llvm/ADT/MapVector.h"			#include "llvm/ADT/MapVector.h"
	#include "llvm/ADT/SetVector.h"			#include "llvm/ADT/SetVector.h"
	#include "llvm/ADT/SmallPtrSet.h"			#include "llvm/ADT/SmallPtrSet.h"
	#include "llvm/ADT/SmallSet.h"			#include "llvm/ADT/SmallSet.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
	#include "llvm/ADT/Statistic.h"			#include "llvm/ADT/Statistic.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/Analysis/AliasAnalysis.h"			#include "llvm/Analysis/AliasAnalysis.h"
	#include "llvm/Analysis/OptimizationRemarkEmitter.h"			#include "llvm/Analysis/OptimizationRemarkEmitter.h"
	#include "llvm/CodeGen/CalcSpillWeights.h"			#include "llvm/CodeGen/CalcSpillWeights.h"
	#include "llvm/CodeGen/EdgeBundles.h"			#include "llvm/CodeGen/EdgeBundles.h"
	#include "llvm/CodeGen/LiveInterval.h"			#include "llvm/CodeGen/LiveInterval.h"
	#include "llvm/CodeGen/LiveIntervalUnion.h"			#include "llvm/CodeGen/LiveIntervalUnion.h"
	#include "llvm/CodeGen/LiveIntervals.h"			#include "llvm/CodeGen/LiveIntervals.h"
	#include "llvm/CodeGen/LiveRangeEdit.h"			#include "llvm/CodeGen/LiveRangeEdit.h"
	#include "llvm/CodeGen/LiveRegMatrix.h"			#include "llvm/CodeGen/LiveRegMatrix.h"
	#include "llvm/CodeGen/LiveStacks.h"			#include "llvm/CodeGen/LiveStacks.h"
	#include "llvm/CodeGen/MachineBasicBlock.h"			#include "llvm/CodeGen/MachineBasicBlock.h"
	#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"			#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"
	#include "llvm/CodeGen/MachineDominators.h"			#include "llvm/CodeGen/MachineDominators.h"
	#include "llvm/CodeGen/MachineFrameInfo.h"			#include "llvm/CodeGen/MachineFrameInfo.h"
	#include "llvm/CodeGen/MachineFunction.h"			#include "llvm/CodeGen/MachineFunction.h"
	#include "llvm/CodeGen/MachineFunctionPass.h"			#include "llvm/CodeGen/MachineFunctionPass.h"
	#include "llvm/CodeGen/MachineInstr.h"			#include "llvm/CodeGen/MachineInstr.h"
	#include "llvm/CodeGen/MachineLoopInfo.h"			#include "llvm/CodeGen/MachineLoopInfo.h"
	#include "llvm/CodeGen/MachineOperand.h"			#include "llvm/CodeGen/MachineOperand.h"
	#include "llvm/CodeGen/MachineOptimizationRemarkEmitter.h"			#include "llvm/CodeGen/MachineOptimizationRemarkEmitter.h"
	#include "llvm/CodeGen/MachineRegisterInfo.h"			#include "llvm/CodeGen/MachineRegisterInfo.h"
	#include "llvm/CodeGen/RegAllocRegistry.h"			#include "llvm/CodeGen/RegAllocRegistry.h"
	#include "llvm/CodeGen/RegisterClassInfo.h"			#include "llvm/CodeGen/RegisterClassInfo.h"
	#include "llvm/CodeGen/SlotIndexes.h"			#include "llvm/CodeGen/SlotIndexes.h"
	#include "llvm/CodeGen/TargetInstrInfo.h"			#include "llvm/CodeGen/TargetInstrInfo.h"
	#include "llvm/CodeGen/TargetRegisterInfo.h"			#include "llvm/CodeGen/TargetRegisterInfo.h"
	#include "llvm/CodeGen/TargetSubtargetInfo.h"			#include "llvm/CodeGen/TargetSubtargetInfo.h"
	#include "llvm/CodeGen/VirtRegMap.h"			#include "llvm/CodeGen/VirtRegMap.h"
	#include "llvm/IR/Function.h"			#include "llvm/IR/Function.h"
	#include "llvm/IR/LLVMContext.h"			#include "llvm/IR/LLVMContext.h"
	#include "llvm/MC/MCRegisterInfo.h"			#include "llvm/MC/MCRegisterInfo.h"
	#include "llvm/Pass.h"			#include "llvm/Pass.h"
	#include "llvm/Support/BlockFrequency.h"			#include "llvm/Support/BlockFrequency.h"
	#include "llvm/Support/BranchProbability.h"			#include "llvm/Support/BranchProbability.h"
	#include "llvm/Support/CommandLine.h"			#include "llvm/Support/CommandLine.h"
	#include "llvm/Support/Debug.h"			#include "llvm/Support/Debug.h"
	#include "llvm/Support/MathExtras.h"			#include "llvm/Support/MathExtras.h"
	#include "llvm/Support/Timer.h"			#include "llvm/Support/Timer.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"
	#include "llvm/Target/TargetMachine.h"			#include "llvm/Target/TargetMachine.h"
	#include <algorithm>			#include <algorithm>
	#include <cassert>			#include <cassert>
	#include <cstdint>			#include <cstdint>
	#include <memory>			#include <memory>
	#include <queue>			#include <queue>
	#include <tuple>			#include <tuple>
	#include <utility>			#include <utility>

	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "regalloc"			#define DEBUG_TYPE "regalloc"

	STATISTIC(NumGlobalSplits, "Number of split global live ranges");			STATISTIC(NumGlobalSplits, "Number of split global live ranges");
	STATISTIC(NumLocalSplits, "Number of split local live ranges");			STATISTIC(NumLocalSplits, "Number of split local live ranges");
	STATISTIC(NumEvicted, "Number of interferences evicted");			STATISTIC(NumEvicted, "Number of interferences evicted");

	static cl::opt<SplitEditor::ComplementSpillMode> SplitSpillMode(			static cl::opt<SplitEditor::ComplementSpillMode> SplitSpillMode(
	"split-spill-mode", cl::Hidden,			"split-spill-mode", cl::Hidden,
	cl::desc("Spill mode for splitting live ranges"),			cl::desc("Spill mode for splitting live ranges"),
	cl::values(clEnumValN(SplitEditor::SM_Partition, "default", "Default"),			cl::values(clEnumValN(SplitEditor::SM_Partition, "default", "Default"),
	clEnumValN(SplitEditor::SM_Size, "size", "Optimize for size"),			clEnumValN(SplitEditor::SM_Size, "size", "Optimize for size"),
	clEnumValN(SplitEditor::SM_Speed, "speed", "Optimize for speed")),			clEnumValN(SplitEditor::SM_Speed, "speed", "Optimize for speed")),
	cl::init(SplitEditor::SM_Speed));			cl::init(SplitEditor::SM_Speed));

	static cl::opt<unsigned>			static cl::opt<unsigned>
	LastChanceRecoloringMaxDepth("lcr-max-depth", cl::Hidden,			LastChanceRecoloringMaxDepth("lcr-max-depth", cl::Hidden,
	cl::desc("Last chance recoloring max depth"),			cl::desc("Last chance recoloring max depth"),
	cl::init(5));			cl::init(5));

	static cl::opt<unsigned> LastChanceRecoloringMaxInterference(			static cl::opt<unsigned> LastChanceRecoloringMaxInterference(
	"lcr-max-interf", cl::Hidden,			"lcr-max-interf", cl::Hidden,
	cl::desc("Last chance recoloring maximum number of considered"			cl::desc("Last chance recoloring maximum number of considered"
	" interference at a time"),			" interference at a time"),
	cl::init(8));			cl::init(8));

	static cl::opt<bool> ExhaustiveSearch(			static cl::opt<bool> ExhaustiveSearch(
	"exhaustive-register-search", cl::NotHidden,			"exhaustive-register-search", cl::NotHidden,
	cl::desc("Exhaustive Search for registers bypassing the depth "			cl::desc("Exhaustive Search for registers bypassing the depth "
	"and interference cutoffs of last chance recoloring"),			"and interference cutoffs of last chance recoloring"),
	cl::Hidden);			cl::Hidden);

	static cl::opt<bool> EnableLocalReassignment(			static cl::opt<bool> EnableLocalReassignment(
	"enable-local-reassign", cl::Hidden,			"enable-local-reassign", cl::Hidden,
	cl::desc("Local reassignment can yield better allocation decisions, but "			cl::desc("Local reassignment can yield better allocation decisions, but "
	"may be compile time intensive"),			"may be compile time intensive"),
	cl::init(false));			cl::init(false));

	static cl::opt<bool> EnableDeferredSpilling(			static cl::opt<bool> EnableDeferredSpilling(
	"enable-deferred-spilling", cl::Hidden,			"enable-deferred-spilling", cl::Hidden,
	cl::desc("Instead of spilling a variable right away, defer the actual "			cl::desc("Instead of spilling a variable right away, defer the actual "
	"code insertion to the end of the allocation. That way the "			"code insertion to the end of the allocation. That way the "
	"allocator might still find a suitable coloring for this "			"allocator might still find a suitable coloring for this "
	"variable because of other evicted variables."),			"variable because of other evicted variables."),
	cl::init(false));			cl::init(false));

	// FIXME: Find a good default for this flag and remove the flag.			// FIXME: Find a good default for this flag and remove the flag.
	static cl::opt<unsigned>			static cl::opt<unsigned>
	CSRFirstTimeCost("regalloc-csr-first-time-cost",			CSRFirstTimeCost("regalloc-csr-first-time-cost",
	cl::desc("Cost for first time use of callee-saved register."),			cl::desc("Cost for first time use of callee-saved register."),
	cl::init(0), cl::Hidden);			cl::init(0), cl::Hidden);

	static cl::opt<bool> ConsiderLocalIntervalCost(			static cl::opt<bool> ConsiderLocalIntervalCost(
	"condsider-local-interval-cost", cl::Hidden,			"condsider-local-interval-cost", cl::Hidden,
	cl::desc("Consider the cost of local intervals created by a split "			cl::desc("Consider the cost of local intervals created by a split "
	"candidate when choosing the best split candidate."),			"candidate when choosing the best split candidate."),
	cl::init(false));			cl::init(false));

	static RegisterRegAlloc greedyRegAlloc("greedy", "greedy register allocator",			static RegisterRegAlloc greedyRegAlloc("greedy", "greedy register allocator",
	createGreedyRegisterAllocator);			createGreedyRegisterAllocator);

	namespace {			namespace {

	class RAGreedy : public MachineFunctionPass,			class RAGreedy : public MachineFunctionPass,
	public RegAllocBase,			public RegAllocBase,
	private LiveRangeEdit::Delegate {			private LiveRangeEdit::Delegate {
	// Convenient shortcuts.			// Convenient shortcuts.
	using PQueue = std::priority_queue<std::pair<unsigned, unsigned>>;			using PQueue = std::priority_queue<std::pair<unsigned, unsigned>>;
	using SmallLISet = SmallPtrSet<LiveInterval *, 4>;			using SmallLISet = SmallPtrSet<LiveInterval *, 4>;
	using SmallVirtRegSet = SmallSet<unsigned, 16>;			using SmallVirtRegSet = SmallSet<unsigned, 16>;

	// context			// context
	MachineFunction *MF;			MachineFunction *MF;

	// Shortcuts to some useful interface.			// Shortcuts to some useful interface.
	const TargetInstrInfo *TII;			const TargetInstrInfo *TII;
	const TargetRegisterInfo *TRI;			const TargetRegisterInfo *TRI;
	RegisterClassInfo RCI;			RegisterClassInfo RCI;

	// analyses			// analyses
	SlotIndexes *Indexes;			SlotIndexes *Indexes;
	MachineBlockFrequencyInfo *MBFI;			MachineBlockFrequencyInfo *MBFI;
	MachineDominatorTree *DomTree;			MachineDominatorTree *DomTree;
	MachineLoopInfo *Loops;			MachineLoopInfo *Loops;
	MachineOptimizationRemarkEmitter *ORE;			MachineOptimizationRemarkEmitter *ORE;
	EdgeBundles *Bundles;			EdgeBundles *Bundles;
	SpillPlacement *SpillPlacer;			SpillPlacement *SpillPlacer;
	LiveDebugVariables *DebugVars;			LiveDebugVariables *DebugVars;
	AliasAnalysis *AA;			AliasAnalysis *AA;

	// state			// state
	std::unique_ptr<Spiller> SpillerInstance;			std::unique_ptr<Spiller> SpillerInstance;
	PQueue Queue;			PQueue Queue;
	unsigned NextCascade;			unsigned NextCascade;

	// Live ranges pass through a number of stages as we try to allocate them.			// Live ranges pass through a number of stages as we try to allocate them.
	// Some of the stages may also create new live ranges:			// Some of the stages may also create new live ranges:
	//			//
	// - Region splitting.			// - Region splitting.
	// - Per-block splitting.			// - Per-block splitting.
	// - Local splitting.			// - Local splitting.
	// - Spilling.			// - Spilling.
	//			//
	// Ranges produced by one of the stages skip the previous stages when they are			// Ranges produced by one of the stages skip the previous stages when they are
	// dequeued. This improves performance because we can skip interference checks			// dequeued. This improves performance because we can skip interference checks
	// that are unlikely to give any results. It also guarantees that the live			// that are unlikely to give any results. It also guarantees that the live
	// range splitting algorithm terminates, something that is otherwise hard to			// range splitting algorithm terminates, something that is otherwise hard to
	// ensure.			// ensure.
	enum LiveRangeStage {			enum LiveRangeStage {
	/// Newly created live range that has never been queued.			/// Newly created live range that has never been queued.
	RS_New,			RS_New,

	/// Only attempt assignment and eviction. Then requeue as RS_Split.			/// Only attempt assignment and eviction. Then requeue as RS_Split.
	RS_Assign,			RS_Assign,

	/// Attempt live range splitting if assignment is impossible.			/// Attempt live range splitting if assignment is impossible.
	RS_Split,			RS_Split,

	/// Attempt more aggressive live range splitting that is guaranteed to make			/// Attempt more aggressive live range splitting that is guaranteed to make
	/// progress. This is used for split products that may not be making			/// progress. This is used for split products that may not be making
	/// progress.			/// progress.
	RS_Split2,			RS_Split2,

	/// Live range will be spilled. No more splitting will be attempted.			/// Live range will be spilled. No more splitting will be attempted.
	RS_Spill,			RS_Spill,


	/// Live range is in memory. Because of other evictions, it might get moved			/// Live range is in memory. Because of other evictions, it might get moved
	/// in a register in the end.			/// in a register in the end.
	RS_Memory,			RS_Memory,

	/// There is nothing more we can do to this live range. Abort compilation			/// There is nothing more we can do to this live range. Abort compilation
	/// if it can't be assigned.			/// if it can't be assigned.
	RS_Done			RS_Done
	};			};

	// Enum CutOffStage to keep a track whether the register allocation failed			// Enum CutOffStage to keep a track whether the register allocation failed
	// because of the cutoffs encountered in last chance recoloring.			// because of the cutoffs encountered in last chance recoloring.
	// Note: This is used as bitmask. New value should be next power of 2.			// Note: This is used as bitmask. New value should be next power of 2.
	enum CutOffStage {			enum CutOffStage {
	// No cutoffs encountered			// No cutoffs encountered
	CO_None = 0,			CO_None = 0,

	// lcr-max-depth cutoff encountered			// lcr-max-depth cutoff encountered
	CO_Depth = 1,			CO_Depth = 1,

	// lcr-max-interf cutoff encountered			// lcr-max-interf cutoff encountered
	CO_Interf = 2			CO_Interf = 2
	};			};

	uint8_t CutOffInfo;			uint8_t CutOffInfo;

	#ifndef NDEBUG			#ifndef NDEBUG
	static const char *const StageName[];			static const char *const StageName[];
	#endif			#endif

	// RegInfo - Keep additional information about each live range.			// RegInfo - Keep additional information about each live range.
	struct RegInfo {			struct RegInfo {
	LiveRangeStage Stage = RS_New;			LiveRangeStage Stage = RS_New;

	// Cascade - Eviction loop prevention. See canEvictInterference().			// Cascade - Eviction loop prevention. See canEvictInterference().
	unsigned Cascade = 0;			unsigned Cascade = 0;

	RegInfo() = default;			RegInfo() = default;
	};			};

	IndexedMap<RegInfo, VirtReg2IndexFunctor> ExtraRegInfo;			IndexedMap<RegInfo, VirtReg2IndexFunctor> ExtraRegInfo;

	LiveRangeStage getStage(const LiveInterval &VirtReg) const {			LiveRangeStage getStage(const LiveInterval &VirtReg) const {
	return ExtraRegInfo[VirtReg.reg].Stage;			return ExtraRegInfo[VirtReg.reg].Stage;
	}			}

	void setStage(const LiveInterval &VirtReg, LiveRangeStage Stage) {			void setStage(const LiveInterval &VirtReg, LiveRangeStage Stage) {
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			ExtraRegInfo.resize(MRI->getNumVirtRegs());
	ExtraRegInfo[VirtReg.reg].Stage = Stage;			ExtraRegInfo[VirtReg.reg].Stage = Stage;
	}			}

	template<typename Iterator>			template<typename Iterator>
	void setStage(Iterator Begin, Iterator End, LiveRangeStage NewStage) {			void setStage(Iterator Begin, Iterator End, LiveRangeStage NewStage) {
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			ExtraRegInfo.resize(MRI->getNumVirtRegs());
	for (;Begin != End; ++Begin) {			for (;Begin != End; ++Begin) {
	unsigned Reg = *Begin;			unsigned Reg = *Begin;
	if (ExtraRegInfo[Reg].Stage == RS_New)			if (ExtraRegInfo[Reg].Stage == RS_New)
	ExtraRegInfo[Reg].Stage = NewStage;			ExtraRegInfo[Reg].Stage = NewStage;
	}			}
	}			}

	/// Cost of evicting interference.			/// Cost of evicting interference.
	struct EvictionCost {			struct EvictionCost {
	unsigned BrokenHints = 0; ///< Total number of broken hints.			unsigned BrokenHints = 0; ///< Total number of broken hints.
	float MaxWeight = 0; ///< Maximum spill weight evicted.			float MaxWeight = 0; ///< Maximum spill weight evicted.

	EvictionCost() = default;			EvictionCost() = default;

	bool isMax() const { return BrokenHints == ~0u; }			bool isMax() const { return BrokenHints == ~0u; }

	void setMax() { BrokenHints = ~0u; }			void setMax() { BrokenHints = ~0u; }

	void setBrokenHints(unsigned NHints) { BrokenHints = NHints; }			void setBrokenHints(unsigned NHints) { BrokenHints = NHints; }

	bool operator<(const EvictionCost &O) const {			bool operator<(const EvictionCost &O) const {
	return std::tie(BrokenHints, MaxWeight) <			return std::tie(BrokenHints, MaxWeight) <
	std::tie(O.BrokenHints, O.MaxWeight);			std::tie(O.BrokenHints, O.MaxWeight);
	}			}
	};			};

	/// EvictionTrack - Keeps track of past evictions in order to optimize region			/// EvictionTrack - Keeps track of past evictions in order to optimize region
	/// split decision.			/// split decision.
	class EvictionTrack {			class EvictionTrack {

	public:			public:
	using EvictorInfo =			using EvictorInfo =
	std::pair<unsigned /* evictor /, unsigned / physreg */>;			std::pair<unsigned /* evictor /, unsigned / physreg */>;
	using EvicteeInfo = llvm::MapVector<unsigned /* evictee */, EvictorInfo>;			using EvicteeInfo = llvm::MapVector<unsigned /* evictee */, EvictorInfo>;

	private:			private:
	/// Each Vreg that has been evicted in the last stage of selectOrSplit will			/// Each Vreg that has been evicted in the last stage of selectOrSplit will
	/// be mapped to the evictor Vreg and the PhysReg it was evicted from.			/// be mapped to the evictor Vreg and the PhysReg it was evicted from.
	EvicteeInfo Evictees;			EvicteeInfo Evictees;

	public:			public:
	/// \brief Clear all eviction information.			/// \brief Clear all eviction information.
	void clear() { Evictees.clear(); }			void clear() { Evictees.clear(); }

	/// \brief Clear eviction information for the given evictee Vreg.			/// \brief Clear eviction information for the given evictee Vreg.
	/// E.g. when Vreg get's a new allocation, the old eviction info is no			/// E.g. when Vreg get's a new allocation, the old eviction info is no
	/// longer relevant.			/// longer relevant.
	/// \param Evictee The evictee Vreg for whom we want to clear collected			/// \param Evictee The evictee Vreg for whom we want to clear collected
	/// eviction info.			/// eviction info.
	void clearEvicteeInfo(unsigned Evictee) { Evictees.erase(Evictee); }			void clearEvicteeInfo(unsigned Evictee) { Evictees.erase(Evictee); }

	/// \brief Track new eviction.			/// \brief Track new eviction.
	/// The Evictor vreg has evicted the Evictee vreg from Physreg.			/// The Evictor vreg has evicted the Evictee vreg from Physreg.
	/// \praram PhysReg The phisical register Evictee was evicted from.			/// \praram PhysReg The phisical register Evictee was evicted from.
	/// \praram Evictor The evictor Vreg that evicted Evictee.			/// \praram Evictor The evictor Vreg that evicted Evictee.
	/// \praram Evictee The evictee Vreg.			/// \praram Evictee The evictee Vreg.
	void addEviction(unsigned PhysReg, unsigned Evictor, unsigned Evictee) {			void addEviction(unsigned PhysReg, unsigned Evictor, unsigned Evictee) {
	Evictees[Evictee].first = Evictor;			Evictees[Evictee].first = Evictor;
	Evictees[Evictee].second = PhysReg;			Evictees[Evictee].second = PhysReg;
	}			}

	/// Return the Evictor Vreg which evicted Evictee Vreg from PhysReg.			/// Return the Evictor Vreg which evicted Evictee Vreg from PhysReg.
	/// \praram Evictee The evictee vreg.			/// \praram Evictee The evictee vreg.
	/// \return The Evictor vreg which evicted Evictee vreg from PhysReg. 0 if			/// \return The Evictor vreg which evicted Evictee vreg from PhysReg. 0 if
	/// nobody has evicted Evictee from PhysReg.			/// nobody has evicted Evictee from PhysReg.
	EvictorInfo getEvictor(unsigned Evictee) {			EvictorInfo getEvictor(unsigned Evictee) {
	if (Evictees.count(Evictee)) {			if (Evictees.count(Evictee)) {
	return Evictees[Evictee];			return Evictees[Evictee];
	}			}

	return EvictorInfo(0, 0);			return EvictorInfo(0, 0);
	}			}
	};			};

	// Keeps track of past evictions in order to optimize region split decision.			// Keeps track of past evictions in order to optimize region split decision.
	EvictionTrack LastEvicted;			EvictionTrack LastEvicted;

	// splitting state.			// splitting state.
	std::unique_ptr<SplitAnalysis> SA;			std::unique_ptr<SplitAnalysis> SA;
	std::unique_ptr<SplitEditor> SE;			std::unique_ptr<SplitEditor> SE;

	/// Cached per-block interference maps			/// Cached per-block interference maps
	InterferenceCache IntfCache;			InterferenceCache IntfCache;

	/// All basic blocks where the current register has uses.			/// All basic blocks where the current register has uses.
	SmallVector<SpillPlacement::BlockConstraint, 8> SplitConstraints;			SmallVector<SpillPlacement::BlockConstraint, 8> SplitConstraints;

	/// Global live range splitting candidate info.			/// Global live range splitting candidate info.
	struct GlobalSplitCandidate {			struct GlobalSplitCandidate {
	// Register intended for assignment, or 0.			// Register intended for assignment, or 0.
	unsigned PhysReg;			unsigned PhysReg;

	// SplitKit interval index for this candidate.			// SplitKit interval index for this candidate.
	unsigned IntvIdx;			unsigned IntvIdx;

	// Interference for PhysReg.			// Interference for PhysReg.
	InterferenceCache::Cursor Intf;			InterferenceCache::Cursor Intf;

	// Bundles where this candidate should be live.			// Bundles where this candidate should be live.
	BitVector LiveBundles;			BitVector LiveBundles;
	SmallVector<unsigned, 8> ActiveBlocks;			SmallVector<unsigned, 8> ActiveBlocks;

	void reset(InterferenceCache &Cache, unsigned Reg) {			void reset(InterferenceCache &Cache, unsigned Reg) {
	PhysReg = Reg;			PhysReg = Reg;
	IntvIdx = 0;			IntvIdx = 0;
	Intf.setPhysReg(Cache, Reg);			Intf.setPhysReg(Cache, Reg);
	LiveBundles.clear();			LiveBundles.clear();
	ActiveBlocks.clear();			ActiveBlocks.clear();
	}			}

	// Set B[i] = C for every live bundle where B[i] was NoCand.			// Set B[i] = C for every live bundle where B[i] was NoCand.
	unsigned getBundles(SmallVectorImpl<unsigned> &B, unsigned C) {			unsigned getBundles(SmallVectorImpl<unsigned> &B, unsigned C) {
	unsigned Count = 0;			unsigned Count = 0;
	for (unsigned i : LiveBundles.set_bits())			for (unsigned i : LiveBundles.set_bits())
	if (B[i] == NoCand) {			if (B[i] == NoCand) {
	B[i] = C;			B[i] = C;
	Count++;			Count++;
	}			}
	return Count;			return Count;
	}			}
	};			};

	/// Candidate info for each PhysReg in AllocationOrder.			/// Candidate info for each PhysReg in AllocationOrder.
	/// This vector never shrinks, but grows to the size of the largest register			/// This vector never shrinks, but grows to the size of the largest register
	/// class.			/// class.
	SmallVector<GlobalSplitCandidate, 32> GlobalCand;			SmallVector<GlobalSplitCandidate, 32> GlobalCand;

	enum : unsigned { NoCand = ~0u };			enum : unsigned { NoCand = ~0u };

	/// Candidate map. Each edge bundle is assigned to a GlobalCand entry, or to			/// Candidate map. Each edge bundle is assigned to a GlobalCand entry, or to
	/// NoCand which indicates the stack interval.			/// NoCand which indicates the stack interval.
	SmallVector<unsigned, 32> BundleCand;			SmallVector<unsigned, 32> BundleCand;

	/// Callee-save register cost, calculated once per machine function.			/// Callee-save register cost, calculated once per machine function.
	BlockFrequency CSRCost;			BlockFrequency CSRCost;

	/// Run or not the local reassignment heuristic. This information is			/// Run or not the local reassignment heuristic. This information is
	/// obtained from the TargetSubtargetInfo.			/// obtained from the TargetSubtargetInfo.
	bool EnableLocalReassign;			bool EnableLocalReassign;

	/// Enable or not the the consideration of the cost of local intervals created			/// Enable or not the the consideration of the cost of local intervals created
	/// by a split candidate when choosing the best split candidate.			/// by a split candidate when choosing the best split candidate.
	bool EnableAdvancedRASplitCost;			bool EnableAdvancedRASplitCost;

	/// Set of broken hints that may be reconciled later because of eviction.			/// Set of broken hints that may be reconciled later because of eviction.
	SmallSetVector<LiveInterval *, 8> SetOfBrokenHints;			SmallSetVector<LiveInterval *, 8> SetOfBrokenHints;

	public:			public:
	RAGreedy();			RAGreedy();

	/// Return the pass name.			/// Return the pass name.
	StringRef getPassName() const override { return "Greedy Register Allocator"; }			StringRef getPassName() const override { return "Greedy Register Allocator"; }

	/// RAGreedy analysis usage.			/// RAGreedy analysis usage.
	void getAnalysisUsage(AnalysisUsage &AU) const override;			void getAnalysisUsage(AnalysisUsage &AU) const override;
	void releaseMemory() override;			void releaseMemory() override;
	Spiller &spiller() override { return *SpillerInstance; }			Spiller &spiller() override { return *SpillerInstance; }
	void enqueue(LiveInterval *LI) override;			void enqueue(LiveInterval *LI) override;
	LiveInterval *dequeue() override;			LiveInterval *dequeue() override;
	unsigned selectOrSplit(LiveInterval&, SmallVectorImpl<unsigned>&) override;			unsigned selectOrSplit(LiveInterval&, SmallVectorImpl<unsigned>&) override;
	void aboutToRemoveInterval(LiveInterval &) override;			void aboutToRemoveInterval(LiveInterval &) override;

	/// Perform register allocation.			/// Perform register allocation.
	bool runOnMachineFunction(MachineFunction &mf) override;			bool runOnMachineFunction(MachineFunction &mf) override;

	MachineFunctionProperties getRequiredProperties() const override {			MachineFunctionProperties getRequiredProperties() const override {
	return MachineFunctionProperties().set(			return MachineFunctionProperties().set(
	MachineFunctionProperties::Property::NoPHIs);			MachineFunctionProperties::Property::NoPHIs);
	}			}

	static char ID;			static char ID;

	private:			private:
	unsigned selectOrSplitImpl(LiveInterval &, SmallVectorImpl<unsigned> &,			unsigned selectOrSplitImpl(LiveInterval &, SmallVectorImpl<unsigned> &,
	SmallVirtRegSet &, unsigned = 0);			SmallVirtRegSet &, unsigned = 0);

	bool LRE_CanEraseVirtReg(unsigned) override;			bool LRE_CanEraseVirtReg(unsigned) override;
	void LRE_WillShrinkVirtReg(unsigned) override;			void LRE_WillShrinkVirtReg(unsigned) override;
	void LRE_DidCloneVirtReg(unsigned, unsigned) override;			void LRE_DidCloneVirtReg(unsigned, unsigned) override;
	void enqueue(PQueue &CurQueue, LiveInterval *LI);			void enqueue(PQueue &CurQueue, LiveInterval *LI);
	LiveInterval *dequeue(PQueue &CurQueue);			LiveInterval *dequeue(PQueue &CurQueue);

	BlockFrequency calcSpillCost();			BlockFrequency calcSpillCost();
	bool addSplitConstraints(InterferenceCache::Cursor, BlockFrequency&);			bool addSplitConstraints(InterferenceCache::Cursor, BlockFrequency&);
	void addThroughConstraints(InterferenceCache::Cursor, ArrayRef<unsigned>);			void addThroughConstraints(InterferenceCache::Cursor, ArrayRef<unsigned>);
	void growRegion(GlobalSplitCandidate &Cand);			void growRegion(GlobalSplitCandidate &Cand);
	bool splitCanCauseEvictionChain(unsigned Evictee, GlobalSplitCandidate &Cand,			bool splitCanCauseEvictionChain(unsigned Evictee, GlobalSplitCandidate &Cand,
	unsigned BBNumber,			unsigned BBNumber,
	const AllocationOrder &Order);			const AllocationOrder &Order);
	BlockFrequency calcGlobalSplitCost(GlobalSplitCandidate &,			bool splitCanCauseLocalSpill(unsigned VirtRegToSplit,
	const AllocationOrder &Order,			GlobalSplitCandidate &Cand, unsigned BBNumber,
	bool *CanCauseEvictionChain);			const AllocationOrder &Order);
	bool calcCompactRegion(GlobalSplitCandidate&);			BlockFrequency calcGlobalSplitCost(GlobalSplitCandidate &,
	void splitAroundRegion(LiveRangeEdit&, ArrayRef<unsigned>);			const AllocationOrder &Order,
	void calcGapWeights(unsigned, SmallVectorImpl<float>&);			bool *CanCauseEvictionChain);
	unsigned canReassign(LiveInterval &VirtReg, unsigned PhysReg);			bool calcCompactRegion(GlobalSplitCandidate&);
	bool shouldEvict(LiveInterval &A, bool, LiveInterval &B, bool);			void splitAroundRegion(LiveRangeEdit&, ArrayRef<unsigned>);
	bool canEvictInterference(LiveInterval&, unsigned, bool, EvictionCost&);			void calcGapWeights(unsigned, SmallVectorImpl<float>&);
	bool canEvictInterferenceInRange(LiveInterval &VirtReg, unsigned PhysReg,			unsigned canReassign(LiveInterval &VirtReg, unsigned PhysReg);
	SlotIndex Start, SlotIndex End,			bool shouldEvict(LiveInterval &A, bool, LiveInterval &B, bool);
	EvictionCost &MaxCost);			bool canEvictInterference(LiveInterval&, unsigned, bool, EvictionCost&);
	unsigned getCheapestEvicteeWeight(const AllocationOrder &Order,			bool canEvictInterferenceInRange(LiveInterval &VirtReg, unsigned PhysReg,
	LiveInterval &VirtReg, SlotIndex Start,			SlotIndex Start, SlotIndex End,
	SlotIndex End, float *BestEvictWeight);			EvictionCost &MaxCost);
	void evictInterference(LiveInterval&, unsigned,			unsigned getCheapestEvicteeWeight(const AllocationOrder &Order,
	SmallVectorImpl<unsigned>&);			LiveInterval &VirtReg, SlotIndex Start,
	bool mayRecolorAllInterferences(unsigned PhysReg, LiveInterval &VirtReg,			SlotIndex End, float *BestEvictWeight);
	SmallLISet &RecoloringCandidates,			void evictInterference(LiveInterval&, unsigned,
	const SmallVirtRegSet &FixedRegisters);			SmallVectorImpl<unsigned>&);
				bool mayRecolorAllInterferences(unsigned PhysReg, LiveInterval &VirtReg,
	unsigned tryAssign(LiveInterval&, AllocationOrder&,			SmallLISet &RecoloringCandidates,
	SmallVectorImpl<unsigned>&);			const SmallVirtRegSet &FixedRegisters);
	unsigned tryEvict(LiveInterval&, AllocationOrder&,
	SmallVectorImpl<unsigned>&, unsigned = ~0u);			unsigned tryAssign(LiveInterval&, AllocationOrder&,
	unsigned tryRegionSplit(LiveInterval&, AllocationOrder&,			SmallVectorImpl<unsigned>&);
	SmallVectorImpl<unsigned>&);			unsigned tryEvict(LiveInterval&, AllocationOrder&,
	/// Calculate cost of region splitting.			SmallVectorImpl<unsigned>&, unsigned = ~0u);
	unsigned calculateRegionSplitCost(LiveInterval &VirtReg,			unsigned tryRegionSplit(LiveInterval&, AllocationOrder&,
	AllocationOrder &Order,			SmallVectorImpl<unsigned>&);
	BlockFrequency &BestCost,			/// Calculate cost of region splitting.
	unsigned &NumCands, bool IgnoreCSR,			unsigned calculateRegionSplitCost(LiveInterval &VirtReg,
	bool *CanCauseEvictionChain = nullptr);			AllocationOrder &Order,
	/// Perform region splitting.			BlockFrequency &BestCost,
	unsigned doRegionSplit(LiveInterval &VirtReg, unsigned BestCand,			unsigned &NumCands, bool IgnoreCSR,
	bool HasCompact,			bool *CanCauseEvictionChain = nullptr);
	SmallVectorImpl<unsigned> &NewVRegs);			/// Perform region splitting.
	/// Check other options before using a callee-saved register for the first			unsigned doRegionSplit(LiveInterval &VirtReg, unsigned BestCand,
	/// time.			bool HasCompact,
	unsigned tryAssignCSRFirstTime(LiveInterval &VirtReg, AllocationOrder &Order,			SmallVectorImpl<unsigned> &NewVRegs);
	unsigned PhysReg, unsigned &CostPerUseLimit,			/// Check other options before using a callee-saved register for the first
	SmallVectorImpl<unsigned> &NewVRegs);			/// time.
	void initializeCSRCost();			unsigned tryAssignCSRFirstTime(LiveInterval &VirtReg, AllocationOrder &Order,
	unsigned tryBlockSplit(LiveInterval&, AllocationOrder&,			unsigned PhysReg, unsigned &CostPerUseLimit,
	SmallVectorImpl<unsigned>&);			SmallVectorImpl<unsigned> &NewVRegs);
	unsigned tryInstructionSplit(LiveInterval&, AllocationOrder&,			void initializeCSRCost();
	SmallVectorImpl<unsigned>&);			unsigned tryBlockSplit(LiveInterval&, AllocationOrder&,
	unsigned tryLocalSplit(LiveInterval&, AllocationOrder&,			SmallVectorImpl<unsigned>&);
	SmallVectorImpl<unsigned>&);			unsigned tryInstructionSplit(LiveInterval&, AllocationOrder&,
	unsigned trySplit(LiveInterval&, AllocationOrder&,			SmallVectorImpl<unsigned>&);
	SmallVectorImpl<unsigned>&);			unsigned tryLocalSplit(LiveInterval&, AllocationOrder&,
	unsigned tryLastChanceRecoloring(LiveInterval &, AllocationOrder &,			SmallVectorImpl<unsigned>&);
	SmallVectorImpl<unsigned> &,			unsigned trySplit(LiveInterval&, AllocationOrder&,
	SmallVirtRegSet &, unsigned);			SmallVectorImpl<unsigned>&);
	bool tryRecoloringCandidates(PQueue &, SmallVectorImpl<unsigned> &,			unsigned tryLastChanceRecoloring(LiveInterval &, AllocationOrder &,
	SmallVirtRegSet &, unsigned);			SmallVectorImpl<unsigned> &,
	void tryHintRecoloring(LiveInterval &);			SmallVirtRegSet &, unsigned);
	void tryHintsRecoloring();			bool tryRecoloringCandidates(PQueue &, SmallVectorImpl<unsigned> &,
				SmallVirtRegSet &, unsigned);
	/// Model the information carried by one end of a copy.			void tryHintRecoloring(LiveInterval &);
	struct HintInfo {			void tryHintsRecoloring();
	/// The frequency of the copy.
	BlockFrequency Freq;			/// Model the information carried by one end of a copy.
	/// The virtual register or physical register.			struct HintInfo {
	unsigned Reg;			/// The frequency of the copy.
	/// Its currently assigned register.			BlockFrequency Freq;
	/// In case of a physical register Reg == PhysReg.			/// The virtual register or physical register.
	unsigned PhysReg;			unsigned Reg;
				/// Its currently assigned register.
	HintInfo(BlockFrequency Freq, unsigned Reg, unsigned PhysReg)			/// In case of a physical register Reg == PhysReg.
	: Freq(Freq), Reg(Reg), PhysReg(PhysReg) {}			unsigned PhysReg;
	};
	using HintsInfo = SmallVector<HintInfo, 4>;			HintInfo(BlockFrequency Freq, unsigned Reg, unsigned PhysReg)
				: Freq(Freq), Reg(Reg), PhysReg(PhysReg) {}
	BlockFrequency getBrokenHintFreq(const HintsInfo &, unsigned);			};
	void collectHintInfo(unsigned, HintsInfo &);			using HintsInfo = SmallVector<HintInfo, 4>;

	bool isUnusedCalleeSavedReg(unsigned PhysReg) const;			BlockFrequency getBrokenHintFreq(const HintsInfo &, unsigned);
				void collectHintInfo(unsigned, HintsInfo &);
	/// Compute and report the number of spills and reloads for a loop.
	void reportNumberOfSplillsReloads(MachineLoop *L, unsigned &Reloads,			bool isUnusedCalleeSavedReg(unsigned PhysReg) const;
	unsigned &FoldedReloads, unsigned &Spills,
	unsigned &FoldedSpills);			/// Compute and report the number of spills and reloads for a loop.
				void reportNumberOfSplillsReloads(MachineLoop *L, unsigned &Reloads,
	/// Report the number of spills and reloads for each loop.			unsigned &FoldedReloads, unsigned &Spills,
	void reportNumberOfSplillsReloads() {			unsigned &FoldedSpills);
	for (MachineLoop L : Loops) {
	unsigned Reloads, FoldedReloads, Spills, FoldedSpills;			/// Report the number of spills and reloads for each loop.
	reportNumberOfSplillsReloads(L, Reloads, FoldedReloads, Spills,			void reportNumberOfSplillsReloads() {
	FoldedSpills);			for (MachineLoop L : Loops) {
	}			unsigned Reloads, FoldedReloads, Spills, FoldedSpills;
	}			reportNumberOfSplillsReloads(L, Reloads, FoldedReloads, Spills,
	};			FoldedSpills);
				}
	} // end anonymous namespace			}
				};
	char RAGreedy::ID = 0;
	char &llvm::RAGreedyID = RAGreedy::ID;			} // end anonymous namespace

	INITIALIZE_PASS_BEGIN(RAGreedy, "greedy",			char RAGreedy::ID = 0;
	"Greedy Register Allocator", false, false)			char &llvm::RAGreedyID = RAGreedy::ID;
	INITIALIZE_PASS_DEPENDENCY(LiveDebugVariables)
	INITIALIZE_PASS_DEPENDENCY(SlotIndexes)			INITIALIZE_PASS_BEGIN(RAGreedy, "greedy",
	INITIALIZE_PASS_DEPENDENCY(LiveIntervals)			"Greedy Register Allocator", false, false)
	INITIALIZE_PASS_DEPENDENCY(RegisterCoalescer)			INITIALIZE_PASS_DEPENDENCY(LiveDebugVariables)
	INITIALIZE_PASS_DEPENDENCY(MachineScheduler)			INITIALIZE_PASS_DEPENDENCY(SlotIndexes)
	INITIALIZE_PASS_DEPENDENCY(LiveStacks)			INITIALIZE_PASS_DEPENDENCY(LiveIntervals)
	INITIALIZE_PASS_DEPENDENCY(MachineDominatorTree)			INITIALIZE_PASS_DEPENDENCY(RegisterCoalescer)
	INITIALIZE_PASS_DEPENDENCY(MachineLoopInfo)			INITIALIZE_PASS_DEPENDENCY(MachineScheduler)
	INITIALIZE_PASS_DEPENDENCY(VirtRegMap)			INITIALIZE_PASS_DEPENDENCY(LiveStacks)
	INITIALIZE_PASS_DEPENDENCY(LiveRegMatrix)			INITIALIZE_PASS_DEPENDENCY(MachineDominatorTree)
	INITIALIZE_PASS_DEPENDENCY(EdgeBundles)			INITIALIZE_PASS_DEPENDENCY(MachineLoopInfo)
	INITIALIZE_PASS_DEPENDENCY(SpillPlacement)			INITIALIZE_PASS_DEPENDENCY(VirtRegMap)
	INITIALIZE_PASS_DEPENDENCY(MachineOptimizationRemarkEmitterPass)			INITIALIZE_PASS_DEPENDENCY(LiveRegMatrix)
	INITIALIZE_PASS_END(RAGreedy, "greedy",			INITIALIZE_PASS_DEPENDENCY(EdgeBundles)
	"Greedy Register Allocator", false, false)			INITIALIZE_PASS_DEPENDENCY(SpillPlacement)
				INITIALIZE_PASS_DEPENDENCY(MachineOptimizationRemarkEmitterPass)
	#ifndef NDEBUG			INITIALIZE_PASS_END(RAGreedy, "greedy",
	const char *const RAGreedy::StageName[] = {			"Greedy Register Allocator", false, false)
	"RS_New",
	"RS_Assign",			#ifndef NDEBUG
	"RS_Split",			const char *const RAGreedy::StageName[] = {
	"RS_Split2",			"RS_New",
	"RS_Spill",			"RS_Assign",
	"RS_Memory",			"RS_Split",
	"RS_Done"			"RS_Split2",
	};			"RS_Spill",
	#endif			"RS_Memory",
				"RS_Done"
	// Hysteresis to use when comparing floats.			};
	// This helps stabilize decisions based on float comparisons.			#endif
	const float Hysteresis = (2007 / 2048.0f); // 0.97998046875
				// Hysteresis to use when comparing floats.
	FunctionPass* llvm::createGreedyRegisterAllocator() {			// This helps stabilize decisions based on float comparisons.
	return new RAGreedy();			const float Hysteresis = (2007 / 2048.0f); // 0.97998046875
	}
				FunctionPass* llvm::createGreedyRegisterAllocator() {
	RAGreedy::RAGreedy(): MachineFunctionPass(ID) {			return new RAGreedy();
	}			}

	void RAGreedy::getAnalysisUsage(AnalysisUsage &AU) const {			RAGreedy::RAGreedy(): MachineFunctionPass(ID) {
	AU.setPreservesCFG();			}
	AU.addRequired<MachineBlockFrequencyInfo>();
	AU.addPreserved<MachineBlockFrequencyInfo>();			void RAGreedy::getAnalysisUsage(AnalysisUsage &AU) const {
	AU.addRequired<AAResultsWrapperPass>();			AU.setPreservesCFG();
	AU.addPreserved<AAResultsWrapperPass>();			AU.addRequired<MachineBlockFrequencyInfo>();
	AU.addRequired<LiveIntervals>();			AU.addPreserved<MachineBlockFrequencyInfo>();
	AU.addPreserved<LiveIntervals>();			AU.addRequired<AAResultsWrapperPass>();
	AU.addRequired<SlotIndexes>();			AU.addPreserved<AAResultsWrapperPass>();
	AU.addPreserved<SlotIndexes>();			AU.addRequired<LiveIntervals>();
	AU.addRequired<LiveDebugVariables>();			AU.addPreserved<LiveIntervals>();
	AU.addPreserved<LiveDebugVariables>();			AU.addRequired<SlotIndexes>();
	AU.addRequired<LiveStacks>();			AU.addPreserved<SlotIndexes>();
	AU.addPreserved<LiveStacks>();			AU.addRequired<LiveDebugVariables>();
	AU.addRequired<MachineDominatorTree>();			AU.addPreserved<LiveDebugVariables>();
	AU.addPreserved<MachineDominatorTree>();			AU.addRequired<LiveStacks>();
	AU.addRequired<MachineLoopInfo>();			AU.addPreserved<LiveStacks>();
	AU.addPreserved<MachineLoopInfo>();			AU.addRequired<MachineDominatorTree>();
	AU.addRequired<VirtRegMap>();			AU.addPreserved<MachineDominatorTree>();
	AU.addPreserved<VirtRegMap>();			AU.addRequired<MachineLoopInfo>();
	AU.addRequired<LiveRegMatrix>();			AU.addPreserved<MachineLoopInfo>();
	AU.addPreserved<LiveRegMatrix>();			AU.addRequired<VirtRegMap>();
	AU.addRequired<EdgeBundles>();			AU.addPreserved<VirtRegMap>();
	AU.addRequired<SpillPlacement>();			AU.addRequired<LiveRegMatrix>();
	AU.addRequired<MachineOptimizationRemarkEmitterPass>();			AU.addPreserved<LiveRegMatrix>();
	MachineFunctionPass::getAnalysisUsage(AU);			AU.addRequired<EdgeBundles>();
	}			AU.addRequired<SpillPlacement>();
				AU.addRequired<MachineOptimizationRemarkEmitterPass>();
	//===----------------------------------------------------------------------===//			MachineFunctionPass::getAnalysisUsage(AU);
	// LiveRangeEdit delegate methods			}
	//===----------------------------------------------------------------------===//
				//===----------------------------------------------------------------------===//
	bool RAGreedy::LRE_CanEraseVirtReg(unsigned VirtReg) {			// LiveRangeEdit delegate methods
	LiveInterval &LI = LIS->getInterval(VirtReg);			//===----------------------------------------------------------------------===//
	if (VRM->hasPhys(VirtReg)) {
	Matrix->unassign(LI);			bool RAGreedy::LRE_CanEraseVirtReg(unsigned VirtReg) {
	aboutToRemoveInterval(LI);			LiveInterval &LI = LIS->getInterval(VirtReg);
	return true;			if (VRM->hasPhys(VirtReg)) {
	}			Matrix->unassign(LI);
	// Unassigned virtreg is probably in the priority queue.			aboutToRemoveInterval(LI);
	// RegAllocBase will erase it after dequeueing.			return true;
	// Nonetheless, clear the live-range so that the debug			}
	// dump will show the right state for that VirtReg.			// Unassigned virtreg is probably in the priority queue.
	LI.clear();			// RegAllocBase will erase it after dequeueing.
	return false;			// Nonetheless, clear the live-range so that the debug
	}			// dump will show the right state for that VirtReg.
				LI.clear();
	void RAGreedy::LRE_WillShrinkVirtReg(unsigned VirtReg) {			return false;
	if (!VRM->hasPhys(VirtReg))			}
	return;
				void RAGreedy::LRE_WillShrinkVirtReg(unsigned VirtReg) {
	// Register is assigned, put it back on the queue for reassignment.			if (!VRM->hasPhys(VirtReg))
	LiveInterval &LI = LIS->getInterval(VirtReg);			return;
	Matrix->unassign(LI);
	enqueue(&LI);			// Register is assigned, put it back on the queue for reassignment.
	}			LiveInterval &LI = LIS->getInterval(VirtReg);
				Matrix->unassign(LI);
	void RAGreedy::LRE_DidCloneVirtReg(unsigned New, unsigned Old) {			enqueue(&LI);
	// Cloning a register we haven't even heard about yet? Just ignore it.			}
	if (!ExtraRegInfo.inBounds(Old))
	return;			void RAGreedy::LRE_DidCloneVirtReg(unsigned New, unsigned Old) {
				// Cloning a register we haven't even heard about yet? Just ignore it.
	// LRE may clone a virtual register because dead code elimination causes it to			if (!ExtraRegInfo.inBounds(Old))
	// be split into connected components. The new components are much smaller			return;
	// than the original, so they should get a new chance at being assigned.
	// same stage as the parent.			// LRE may clone a virtual register because dead code elimination causes it to
	ExtraRegInfo[Old].Stage = RS_Assign;			// be split into connected components. The new components are much smaller
	ExtraRegInfo.grow(New);			// than the original, so they should get a new chance at being assigned.
	ExtraRegInfo[New] = ExtraRegInfo[Old];			// same stage as the parent.
	}			ExtraRegInfo[Old].Stage = RS_Assign;
				ExtraRegInfo.grow(New);
	void RAGreedy::releaseMemory() {			ExtraRegInfo[New] = ExtraRegInfo[Old];
	SpillerInstance.reset();			}
	ExtraRegInfo.clear();
	GlobalCand.clear();			void RAGreedy::releaseMemory() {
	}			SpillerInstance.reset();
				ExtraRegInfo.clear();
	void RAGreedy::enqueue(LiveInterval *LI) { enqueue(Queue, LI); }			GlobalCand.clear();
				}
	void RAGreedy::enqueue(PQueue &CurQueue, LiveInterval *LI) {
	// Prioritize live ranges by size, assigning larger ranges first.			void RAGreedy::enqueue(LiveInterval *LI) { enqueue(Queue, LI); }
	// The queue holds (size, reg) pairs.
	const unsigned Size = LI->getSize();			void RAGreedy::enqueue(PQueue &CurQueue, LiveInterval *LI) {
	const unsigned Reg = LI->reg;			// Prioritize live ranges by size, assigning larger ranges first.
	assert(TargetRegisterInfo::isVirtualRegister(Reg) &&			// The queue holds (size, reg) pairs.
	"Can only enqueue virtual registers");			const unsigned Size = LI->getSize();
	unsigned Prio;			const unsigned Reg = LI->reg;
				assert(TargetRegisterInfo::isVirtualRegister(Reg) &&
	ExtraRegInfo.grow(Reg);			"Can only enqueue virtual registers");
	if (ExtraRegInfo[Reg].Stage == RS_New)			unsigned Prio;
	ExtraRegInfo[Reg].Stage = RS_Assign;
				ExtraRegInfo.grow(Reg);
	if (ExtraRegInfo[Reg].Stage == RS_Split) {			if (ExtraRegInfo[Reg].Stage == RS_New)
	// Unsplit ranges that couldn't be allocated immediately are deferred until			ExtraRegInfo[Reg].Stage = RS_Assign;
	// everything else has been allocated.
	Prio = Size;			if (ExtraRegInfo[Reg].Stage == RS_Split) {
	} else if (ExtraRegInfo[Reg].Stage == RS_Memory) {			// Unsplit ranges that couldn't be allocated immediately are deferred until
	// Memory operand should be considered last.			// everything else has been allocated.
	// Change the priority such that Memory operand are assigned in			Prio = Size;
	// the reverse order that they came in.			} else if (ExtraRegInfo[Reg].Stage == RS_Memory) {
	// TODO: Make this a member variable and probably do something about hints.			// Memory operand should be considered last.
	static unsigned MemOp = 0;			// Change the priority such that Memory operand are assigned in
	Prio = MemOp++;			// the reverse order that they came in.
	} else {			// TODO: Make this a member variable and probably do something about hints.
	// Giant live ranges fall back to the global assignment heuristic, which			static unsigned MemOp = 0;
	// prevents excessive spilling in pathological cases.			Prio = MemOp++;
	bool ReverseLocal = TRI->reverseLocalAssignment();			} else {
	const TargetRegisterClass &RC = *MRI->getRegClass(Reg);			// Giant live ranges fall back to the global assignment heuristic, which
	bool ForceGlobal = !ReverseLocal &&			// prevents excessive spilling in pathological cases.
	(Size / SlotIndex::InstrDist) > (2 * RC.getNumRegs());			bool ReverseLocal = TRI->reverseLocalAssignment();
				const TargetRegisterClass &RC = *MRI->getRegClass(Reg);
	if (ExtraRegInfo[Reg].Stage == RS_Assign && !ForceGlobal && !LI->empty() &&			bool ForceGlobal = !ReverseLocal &&
	LIS->intervalIsInOneMBB(*LI)) {			(Size / SlotIndex::InstrDist) > (2 * RC.getNumRegs());
	// Allocate original local ranges in linear instruction order. Since they
	// are singly defined, this produces optimal coloring in the absence of			if (ExtraRegInfo[Reg].Stage == RS_Assign && !ForceGlobal && !LI->empty() &&
	// global interference and other constraints.			LIS->intervalIsInOneMBB(*LI)) {
	if (!ReverseLocal)			// Allocate original local ranges in linear instruction order. Since they
	Prio = LI->beginIndex().getInstrDistance(Indexes->getLastIndex());			// are singly defined, this produces optimal coloring in the absence of
	else {			// global interference and other constraints.
	// Allocating bottom up may allow many short LRGs to be assigned first			if (!ReverseLocal)
	// to one of the cheap registers. This could be much faster for very			Prio = LI->beginIndex().getInstrDistance(Indexes->getLastIndex());
	// large blocks on targets with many physical registers.			else {
	Prio = Indexes->getZeroIndex().getInstrDistance(LI->endIndex());			// Allocating bottom up may allow many short LRGs to be assigned first
	}			// to one of the cheap registers. This could be much faster for very
	Prio \|= RC.AllocationPriority << 24;			// large blocks on targets with many physical registers.
	} else {			Prio = Indexes->getZeroIndex().getInstrDistance(LI->endIndex());
	// Allocate global and split ranges in long->short order. Long ranges that			}
	// don't fit should be spilled (or split) ASAP so they don't create			Prio \|= RC.AllocationPriority << 24;
	// interference. Mark a bit to prioritize global above local ranges.			} else {
	Prio = (1u << 29) + Size;			// Allocate global and split ranges in long->short order. Long ranges that
	}			// don't fit should be spilled (or split) ASAP so they don't create
	// Mark a higher bit to prioritize global and local above RS_Split.			// interference. Mark a bit to prioritize global above local ranges.
	Prio \|= (1u << 31);			Prio = (1u << 29) + Size;
				}
	// Boost ranges that have a physical register hint.			// Mark a higher bit to prioritize global and local above RS_Split.
	if (VRM->hasKnownPreference(Reg))			Prio \|= (1u << 31);
	Prio \|= (1u << 30);
	}			// Boost ranges that have a physical register hint.
	// The virtual register number is a tie breaker for same-sized ranges.			if (VRM->hasKnownPreference(Reg))
	// Give lower vreg numbers higher priority to assign them first.			Prio \|= (1u << 30);
	CurQueue.push(std::make_pair(Prio, ~Reg));			}
	}			// The virtual register number is a tie breaker for same-sized ranges.
				// Give lower vreg numbers higher priority to assign them first.
	LiveInterval *RAGreedy::dequeue() { return dequeue(Queue); }			CurQueue.push(std::make_pair(Prio, ~Reg));
				}
	LiveInterval *RAGreedy::dequeue(PQueue &CurQueue) {
	if (CurQueue.empty())			LiveInterval *RAGreedy::dequeue() { return dequeue(Queue); }
	return nullptr;
	LiveInterval *LI = &LIS->getInterval(~CurQueue.top().second);			LiveInterval *RAGreedy::dequeue(PQueue &CurQueue) {
	CurQueue.pop();			if (CurQueue.empty())
	return LI;			return nullptr;
	}			LiveInterval *LI = &LIS->getInterval(~CurQueue.top().second);
				CurQueue.pop();
	//===----------------------------------------------------------------------===//			return LI;
	// Direct Assignment			}
	//===----------------------------------------------------------------------===//
				//===----------------------------------------------------------------------===//
	/// tryAssign - Try to assign VirtReg to an available register.			// Direct Assignment
	unsigned RAGreedy::tryAssign(LiveInterval &VirtReg,			//===----------------------------------------------------------------------===//
	AllocationOrder &Order,
	SmallVectorImpl<unsigned> &NewVRegs) {			/// tryAssign - Try to assign VirtReg to an available register.
	Order.rewind();			unsigned RAGreedy::tryAssign(LiveInterval &VirtReg,
	unsigned PhysReg;			AllocationOrder &Order,
	while ((PhysReg = Order.next()))			SmallVectorImpl<unsigned> &NewVRegs) {
	if (!Matrix->checkInterference(VirtReg, PhysReg))			Order.rewind();
	break;			unsigned PhysReg;
	if (!PhysReg \|\| Order.isHint())			while ((PhysReg = Order.next()))
	return PhysReg;			if (!Matrix->checkInterference(VirtReg, PhysReg))
				break;
	// PhysReg is available, but there may be a better choice.			if (!PhysReg \|\| Order.isHint())
				return PhysReg;
	// If we missed a simple hint, try to cheaply evict interference from the
	// preferred register.			// PhysReg is available, but there may be a better choice.
	if (unsigned Hint = MRI->getSimpleHint(VirtReg.reg))
	if (Order.isHint(Hint)) {			// If we missed a simple hint, try to cheaply evict interference from the
	DEBUG(dbgs() << "missed hint " << printReg(Hint, TRI) << '\n');			// preferred register.
	EvictionCost MaxCost;			if (unsigned Hint = MRI->getSimpleHint(VirtReg.reg))
	MaxCost.setBrokenHints(1);			if (Order.isHint(Hint)) {
	if (canEvictInterference(VirtReg, Hint, true, MaxCost)) {			DEBUG(dbgs() << "missed hint " << printReg(Hint, TRI) << '\n');
	evictInterference(VirtReg, Hint, NewVRegs);			EvictionCost MaxCost;
	return Hint;			MaxCost.setBrokenHints(1);
	}			if (canEvictInterference(VirtReg, Hint, true, MaxCost)) {
	// Record the missed hint, we may be able to recover			evictInterference(VirtReg, Hint, NewVRegs);
	// at the end if the surrounding allocation changed.			return Hint;
	SetOfBrokenHints.insert(&VirtReg);			}
	}			// Record the missed hint, we may be able to recover
				// at the end if the surrounding allocation changed.
	// Try to evict interference from a cheaper alternative.			SetOfBrokenHints.insert(&VirtReg);
	unsigned Cost = TRI->getCostPerUse(PhysReg);			}

	// Most registers have 0 additional cost.			// Try to evict interference from a cheaper alternative.
	if (!Cost)			unsigned Cost = TRI->getCostPerUse(PhysReg);
	return PhysReg;
				// Most registers have 0 additional cost.
	DEBUG(dbgs() << printReg(PhysReg, TRI) << " is available at cost " << Cost			if (!Cost)
	<< '\n');			return PhysReg;
	unsigned CheapReg = tryEvict(VirtReg, Order, NewVRegs, Cost);
	return CheapReg ? CheapReg : PhysReg;			DEBUG(dbgs() << printReg(PhysReg, TRI) << " is available at cost " << Cost
	}			<< '\n');
				unsigned CheapReg = tryEvict(VirtReg, Order, NewVRegs, Cost);
	//===----------------------------------------------------------------------===//			return CheapReg ? CheapReg : PhysReg;
	// Interference eviction			}
	//===----------------------------------------------------------------------===//
				//===----------------------------------------------------------------------===//
	unsigned RAGreedy::canReassign(LiveInterval &VirtReg, unsigned PrevReg) {			// Interference eviction
	AllocationOrder Order(VirtReg.reg, *VRM, RegClassInfo, Matrix);			//===----------------------------------------------------------------------===//
	unsigned PhysReg;
	while ((PhysReg = Order.next())) {			unsigned RAGreedy::canReassign(LiveInterval &VirtReg, unsigned PrevReg) {
	if (PhysReg == PrevReg)			AllocationOrder Order(VirtReg.reg, *VRM, RegClassInfo, Matrix);
	continue;			unsigned PhysReg;
				while ((PhysReg = Order.next())) {
	MCRegUnitIterator Units(PhysReg, TRI);			if (PhysReg == PrevReg)
	for (; Units.isValid(); ++Units) {			continue;
	// Instantiate a "subquery", not to be confused with the Queries array.
	LiveIntervalUnion::Query subQ(VirtReg, Matrix->getLiveUnions()[*Units]);			MCRegUnitIterator Units(PhysReg, TRI);
	if (subQ.checkInterference())			for (; Units.isValid(); ++Units) {
	break;			// Instantiate a "subquery", not to be confused with the Queries array.
	}			LiveIntervalUnion::Query subQ(VirtReg, Matrix->getLiveUnions()[*Units]);
	// If no units have interference, break out with the current PhysReg.			if (subQ.checkInterference())
	if (!Units.isValid())			break;
	break;			}
	}			// If no units have interference, break out with the current PhysReg.
	if (PhysReg)			if (!Units.isValid())
	DEBUG(dbgs() << "can reassign: " << VirtReg << " from "			break;
	<< printReg(PrevReg, TRI) << " to " << printReg(PhysReg, TRI)			}
	<< '\n');			if (PhysReg)
	return PhysReg;			DEBUG(dbgs() << "can reassign: " << VirtReg << " from "
	}			<< printReg(PrevReg, TRI) << " to " << printReg(PhysReg, TRI)
				<< '\n');
	/// shouldEvict - determine if A should evict the assigned live range B. The			return PhysReg;
	/// eviction policy defined by this function together with the allocation order			}
	/// defined by enqueue() decides which registers ultimately end up being split
	/// and spilled.			/// shouldEvict - determine if A should evict the assigned live range B. The
	///			/// eviction policy defined by this function together with the allocation order
	/// Cascade numbers are used to prevent infinite loops if this function is a			/// defined by enqueue() decides which registers ultimately end up being split
	/// cyclic relation.			/// and spilled.
	///			///
	/// @param A The live range to be assigned.			/// Cascade numbers are used to prevent infinite loops if this function is a
	/// @param IsHint True when A is about to be assigned to its preferred			/// cyclic relation.
	/// register.			///
	/// @param B The live range to be evicted.			/// @param A The live range to be assigned.
	/// @param BreaksHint True when B is already assigned to its preferred register.			/// @param IsHint True when A is about to be assigned to its preferred
	bool RAGreedy::shouldEvict(LiveInterval &A, bool IsHint,			/// register.
	LiveInterval &B, bool BreaksHint) {			/// @param B The live range to be evicted.
	bool CanSplit = getStage(B) < RS_Spill;			/// @param BreaksHint True when B is already assigned to its preferred register.
				bool RAGreedy::shouldEvict(LiveInterval &A, bool IsHint,
	// Be fairly aggressive about following hints as long as the evictee can be			LiveInterval &B, bool BreaksHint) {
	// split.			bool CanSplit = getStage(B) < RS_Spill;
	if (CanSplit && IsHint && !BreaksHint)
	return true;			// Be fairly aggressive about following hints as long as the evictee can be
				// split.
	if (A.weight > B.weight) {			if (CanSplit && IsHint && !BreaksHint)
	DEBUG(dbgs() << "should evict: " << B << " w= " << B.weight << '\n');			return true;
	return true;
	}			if (A.weight > B.weight) {
	return false;			DEBUG(dbgs() << "should evict: " << B << " w= " << B.weight << '\n');
	}			return true;
				}
	/// canEvictInterference - Return true if all interferences between VirtReg and			return false;
	/// PhysReg can be evicted.			}
	///
	/// @param VirtReg Live range that is about to be assigned.			/// canEvictInterference - Return true if all interferences between VirtReg and
	/// @param PhysReg Desired register for assignment.			/// PhysReg can be evicted.
	/// @param IsHint True when PhysReg is VirtReg's preferred register.			///
	/// @param MaxCost Only look for cheaper candidates and update with new cost			/// @param VirtReg Live range that is about to be assigned.
	/// when returning true.			/// @param PhysReg Desired register for assignment.
	/// @returns True when interference can be evicted cheaper than MaxCost.			/// @param IsHint True when PhysReg is VirtReg's preferred register.
	bool RAGreedy::canEvictInterference(LiveInterval &VirtReg, unsigned PhysReg,			/// @param MaxCost Only look for cheaper candidates and update with new cost
	bool IsHint, EvictionCost &MaxCost) {			/// when returning true.
	// It is only possible to evict virtual register interference.			/// @returns True when interference can be evicted cheaper than MaxCost.
	if (Matrix->checkInterference(VirtReg, PhysReg) > LiveRegMatrix::IK_VirtReg)			bool RAGreedy::canEvictInterference(LiveInterval &VirtReg, unsigned PhysReg,
	return false;			bool IsHint, EvictionCost &MaxCost) {
				// It is only possible to evict virtual register interference.
	bool IsLocal = LIS->intervalIsInOneMBB(VirtReg);			if (Matrix->checkInterference(VirtReg, PhysReg) > LiveRegMatrix::IK_VirtReg)
				return false;
	// Find VirtReg's cascade number. This will be unassigned if VirtReg was never
	// involved in an eviction before. If a cascade number was assigned, deny			bool IsLocal = LIS->intervalIsInOneMBB(VirtReg);
	// evicting anything with the same or a newer cascade number. This prevents
	// infinite eviction loops.			// Find VirtReg's cascade number. This will be unassigned if VirtReg was never
	//			// involved in an eviction before. If a cascade number was assigned, deny
	// This works out so a register without a cascade number is allowed to evict			// evicting anything with the same or a newer cascade number. This prevents
	// anything, and it can be evicted by anything.			// infinite eviction loops.
	unsigned Cascade = ExtraRegInfo[VirtReg.reg].Cascade;			//
	if (!Cascade)			// This works out so a register without a cascade number is allowed to evict
	Cascade = NextCascade;			// anything, and it can be evicted by anything.
				unsigned Cascade = ExtraRegInfo[VirtReg.reg].Cascade;
	EvictionCost Cost;			if (!Cascade)
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			Cascade = NextCascade;
	LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
	// If there is 10 or more interferences, chances are one is heavier.			EvictionCost Cost;
	if (Q.collectInterferingVRegs(10) >= 10)			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	return false;			LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
				// If there is 10 or more interferences, chances are one is heavier.
	// Check if any interfering live range is heavier than MaxWeight.			if (Q.collectInterferingVRegs(10) >= 10)
	for (unsigned i = Q.interferingVRegs().size(); i; --i) {			return false;
	LiveInterval *Intf = Q.interferingVRegs()[i - 1];
	assert(TargetRegisterInfo::isVirtualRegister(Intf->reg) &&			// Check if any interfering live range is heavier than MaxWeight.
	"Only expecting virtual register interference from query");			for (unsigned i = Q.interferingVRegs().size(); i; --i) {
	// Never evict spill products. They cannot split or spill.			LiveInterval *Intf = Q.interferingVRegs()[i - 1];
	if (getStage(*Intf) == RS_Done)			assert(TargetRegisterInfo::isVirtualRegister(Intf->reg) &&
	return false;			"Only expecting virtual register interference from query");
	// Once a live range becomes small enough, it is urgent that we find a			// Never evict spill products. They cannot split or spill.
	// register for it. This is indicated by an infinite spill weight. These			if (getStage(*Intf) == RS_Done)
	// urgent live ranges get to evict almost anything.			return false;
	//			// Once a live range becomes small enough, it is urgent that we find a
	// Also allow urgent evictions of unspillable ranges from a strictly			// register for it. This is indicated by an infinite spill weight. These
	// larger allocation order.			// urgent live ranges get to evict almost anything.
	bool Urgent = !VirtReg.isSpillable() &&			//
	(Intf->isSpillable() \|\|			// Also allow urgent evictions of unspillable ranges from a strictly
	RegClassInfo.getNumAllocatableRegs(MRI->getRegClass(VirtReg.reg)) <			// larger allocation order.
	RegClassInfo.getNumAllocatableRegs(MRI->getRegClass(Intf->reg)));			bool Urgent = !VirtReg.isSpillable() &&
	// Only evict older cascades or live ranges without a cascade.			(Intf->isSpillable() \|\|
	unsigned IntfCascade = ExtraRegInfo[Intf->reg].Cascade;			RegClassInfo.getNumAllocatableRegs(MRI->getRegClass(VirtReg.reg)) <
	if (Cascade <= IntfCascade) {			RegClassInfo.getNumAllocatableRegs(MRI->getRegClass(Intf->reg)));
	if (!Urgent)			// Only evict older cascades or live ranges without a cascade.
	return false;			unsigned IntfCascade = ExtraRegInfo[Intf->reg].Cascade;
	// We permit breaking cascades for urgent evictions. It should be the			if (Cascade <= IntfCascade) {
	// last resort, though, so make it really expensive.			if (!Urgent)
	Cost.BrokenHints += 10;			return false;
	}			// We permit breaking cascades for urgent evictions. It should be the
	// Would this break a satisfied hint?			// last resort, though, so make it really expensive.
	bool BreaksHint = VRM->hasPreferredPhys(Intf->reg);			Cost.BrokenHints += 10;
	// Update eviction cost.			}
	Cost.BrokenHints += BreaksHint;			// Would this break a satisfied hint?
	Cost.MaxWeight = std::max(Cost.MaxWeight, Intf->weight);			bool BreaksHint = VRM->hasPreferredPhys(Intf->reg);
	// Abort if this would be too expensive.			// Update eviction cost.
	if (!(Cost < MaxCost))			Cost.BrokenHints += BreaksHint;
	return false;			Cost.MaxWeight = std::max(Cost.MaxWeight, Intf->weight);
	if (Urgent)			// Abort if this would be too expensive.
	continue;			if (!(Cost < MaxCost))
	// Apply the eviction policy for non-urgent evictions.			return false;
	if (!shouldEvict(VirtReg, IsHint, *Intf, BreaksHint))			if (Urgent)
	return false;			continue;
	// If !MaxCost.isMax(), then we're just looking for a cheap register.			// Apply the eviction policy for non-urgent evictions.
	// Evicting another local live range in this case could lead to suboptimal			if (!shouldEvict(VirtReg, IsHint, *Intf, BreaksHint))
	// coloring.			return false;
	if (!MaxCost.isMax() && IsLocal && LIS->intervalIsInOneMBB(*Intf) &&			// If !MaxCost.isMax(), then we're just looking for a cheap register.
	(!EnableLocalReassign \|\| !canReassign(*Intf, PhysReg))) {			// Evicting another local live range in this case could lead to suboptimal
	return false;			// coloring.
	}			if (!MaxCost.isMax() && IsLocal && LIS->intervalIsInOneMBB(*Intf) &&
	}			(!EnableLocalReassign \|\| !canReassign(*Intf, PhysReg))) {
	}			return false;
	MaxCost = Cost;			}
	return true;			}
	}			}
				MaxCost = Cost;
	/// \brief Return true if all interferences between VirtReg and PhysReg between			return true;
	/// Start and End can be evicted.			}
	///
	/// \param VirtReg Live range that is about to be assigned.			/// \brief Return true if all interferences between VirtReg and PhysReg between
	/// \param PhysReg Desired register for assignment.			/// Start and End can be evicted.
	/// \param Start Start of range to look for interferences.			///
	/// \param End End of range to look for interferences.			/// \param VirtReg Live range that is about to be assigned.
	/// \param MaxCost Only look for cheaper candidates and update with new cost			/// \param PhysReg Desired register for assignment.
	/// when returning true.			/// \param Start Start of range to look for interferences.
	/// \return True when interference can be evicted cheaper than MaxCost.			/// \param End End of range to look for interferences.
	bool RAGreedy::canEvictInterferenceInRange(LiveInterval &VirtReg,			/// \param MaxCost Only look for cheaper candidates and update with new cost
	unsigned PhysReg, SlotIndex Start,			/// when returning true.
	SlotIndex End,			/// \return True when interference can be evicted cheaper than MaxCost.
	EvictionCost &MaxCost) {			bool RAGreedy::canEvictInterferenceInRange(LiveInterval &VirtReg,
	EvictionCost Cost;			unsigned PhysReg, SlotIndex Start,
				SlotIndex End,
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			EvictionCost &MaxCost) {
	LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);			EvictionCost Cost;

	// Check if any interfering live range is heavier than MaxWeight.			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	for (unsigned i = Q.interferingVRegs().size(); i; --i) {			LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
	LiveInterval *Intf = Q.interferingVRegs()[i - 1];
				// Check if any interfering live range is heavier than MaxWeight.
	// Check if interference overlast the segment in interest.			for (unsigned i = Q.interferingVRegs().size(); i; --i) {
	if (!Intf->overlaps(Start, End))			LiveInterval *Intf = Q.interferingVRegs()[i - 1];
	continue;
				// Check if interference overlast the segment in interest.
	// Cannot evict non virtual reg interference.			if (!Intf->overlaps(Start, End))
	if (!TargetRegisterInfo::isVirtualRegister(Intf->reg))			continue;
	return false;
	// Never evict spill products. They cannot split or spill.			// Cannot evict non virtual reg interference.
	if (getStage(*Intf) == RS_Done)			if (!TargetRegisterInfo::isVirtualRegister(Intf->reg))
	return false;			return false;
				// Never evict spill products. They cannot split or spill.
	// Would this break a satisfied hint?			if (getStage(*Intf) == RS_Done)
	bool BreaksHint = VRM->hasPreferredPhys(Intf->reg);			return false;
	// Update eviction cost.
	Cost.BrokenHints += BreaksHint;			// Would this break a satisfied hint?
	Cost.MaxWeight = std::max(Cost.MaxWeight, Intf->weight);			bool BreaksHint = VRM->hasPreferredPhys(Intf->reg);
	// Abort if this would be too expensive.			// Update eviction cost.
	if (!(Cost < MaxCost))			Cost.BrokenHints += BreaksHint;
	return false;			Cost.MaxWeight = std::max(Cost.MaxWeight, Intf->weight);
	}			// Abort if this would be too expensive.
	}			if (!(Cost < MaxCost))
				return false;
	if (Cost.MaxWeight == 0)			}
	return false;			}

	MaxCost = Cost;			if (Cost.MaxWeight == 0)
	return true;			return false;
	}
				MaxCost = Cost;
	/// \brief Return tthe physical register that will be best			return true;
	/// candidate for eviction by a local split interval that will be created			}
	/// between Start and End.
	///			/// \brief Return tthe physical register that will be best
	/// \param Order The allocation order			/// candidate for eviction by a local split interval that will be created
	/// \param VirtReg Live range that is about to be assigned.			/// between Start and End.
	/// \param Start Start of range to look for interferences			///
	/// \param End End of range to look for interferences			/// \param Order The allocation order
	/// \param BestEvictweight The eviction cost of that eviction			/// \param VirtReg Live range that is about to be assigned.
	/// \return The PhysReg which is the best candidate for eviction and the			/// \param Start Start of range to look for interferences
	/// eviction cost in BestEvictweight			/// \param End End of range to look for interferences
	unsigned RAGreedy::getCheapestEvicteeWeight(const AllocationOrder &Order,			/// \param BestEvictweight The eviction cost of that eviction
	LiveInterval &VirtReg,			/// \return The PhysReg which is the best candidate for eviction and the
	SlotIndex Start, SlotIndex End,			/// eviction cost in BestEvictweight
	float *BestEvictweight) {			unsigned RAGreedy::getCheapestEvicteeWeight(const AllocationOrder &Order,
	EvictionCost BestEvictCost;			LiveInterval &VirtReg,
	BestEvictCost.setMax();			SlotIndex Start, SlotIndex End,
	BestEvictCost.MaxWeight = VirtReg.weight;			float *BestEvictweight) {
	unsigned BestEvicteePhys = 0;			EvictionCost BestEvictCost;
				BestEvictCost.setMax();
	// Go over all physical registers and find the best candidate for eviction			BestEvictCost.MaxWeight = VirtReg.weight;
	for (auto PhysReg : Order.getOrder()) {			unsigned BestEvicteePhys = 0;

	if (!canEvictInterferenceInRange(VirtReg, PhysReg, Start, End,			// Go over all physical registers and find the best candidate for eviction
	BestEvictCost))			for (auto PhysReg : Order.getOrder()) {
	continue;
				if (!canEvictInterferenceInRange(VirtReg, PhysReg, Start, End,
	// Best so far.			BestEvictCost))
	BestEvicteePhys = PhysReg;			continue;
	}
	*BestEvictweight = BestEvictCost.MaxWeight;			// Best so far.
	return BestEvicteePhys;			BestEvicteePhys = PhysReg;
	}			}
				*BestEvictweight = BestEvictCost.MaxWeight;
	/// evictInterference - Evict any interferring registers that prevent VirtReg			return BestEvicteePhys;
	/// from being assigned to Physreg. This assumes that canEvictInterference			}
	/// returned true.
	void RAGreedy::evictInterference(LiveInterval &VirtReg, unsigned PhysReg,			/// evictInterference - Evict any interferring registers that prevent VirtReg
	SmallVectorImpl<unsigned> &NewVRegs) {			/// from being assigned to Physreg. This assumes that canEvictInterference
	// Make sure that VirtReg has a cascade number, and assign that cascade			/// returned true.
	// number to every evicted register. These live ranges than then only be			void RAGreedy::evictInterference(LiveInterval &VirtReg, unsigned PhysReg,
	// evicted by a newer cascade, preventing infinite loops.			SmallVectorImpl<unsigned> &NewVRegs) {
	unsigned Cascade = ExtraRegInfo[VirtReg.reg].Cascade;			// Make sure that VirtReg has a cascade number, and assign that cascade
	if (!Cascade)			// number to every evicted register. These live ranges than then only be
	Cascade = ExtraRegInfo[VirtReg.reg].Cascade = NextCascade++;			// evicted by a newer cascade, preventing infinite loops.
				unsigned Cascade = ExtraRegInfo[VirtReg.reg].Cascade;
	DEBUG(dbgs() << "evicting " << printReg(PhysReg, TRI)			if (!Cascade)
	<< " interference: Cascade " << Cascade << '\n');			Cascade = ExtraRegInfo[VirtReg.reg].Cascade = NextCascade++;

	// Collect all interfering virtregs first.			DEBUG(dbgs() << "evicting " << printReg(PhysReg, TRI)
	SmallVector<LiveInterval*, 8> Intfs;			<< " interference: Cascade " << Cascade << '\n');
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);			// Collect all interfering virtregs first.
	// We usually have the interfering VRegs cached so collectInterferingVRegs()			SmallVector<LiveInterval*, 8> Intfs;
	// should be fast, we may need to recalculate if when different physregs			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	// overlap the same register unit so we had different SubRanges queried			LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
	// against it.			// We usually have the interfering VRegs cached so collectInterferingVRegs()
	Q.collectInterferingVRegs();			// should be fast, we may need to recalculate if when different physregs
	ArrayRef<LiveInterval*> IVR = Q.interferingVRegs();			// overlap the same register unit so we had different SubRanges queried
	Intfs.append(IVR.begin(), IVR.end());			// against it.
	}			Q.collectInterferingVRegs();
				ArrayRef<LiveInterval*> IVR = Q.interferingVRegs();
	// Evict them second. This will invalidate the queries.			Intfs.append(IVR.begin(), IVR.end());
	for (unsigned i = 0, e = Intfs.size(); i != e; ++i) {			}
	LiveInterval *Intf = Intfs[i];
	// The same VirtReg may be present in multiple RegUnits. Skip duplicates.			// Evict them second. This will invalidate the queries.
	if (!VRM->hasPhys(Intf->reg))			for (unsigned i = 0, e = Intfs.size(); i != e; ++i) {
	continue;			LiveInterval *Intf = Intfs[i];
				// The same VirtReg may be present in multiple RegUnits. Skip duplicates.
	LastEvicted.addEviction(PhysReg, VirtReg.reg, Intf->reg);			if (!VRM->hasPhys(Intf->reg))
				continue;
	Matrix->unassign(*Intf);
	assert((ExtraRegInfo[Intf->reg].Cascade < Cascade \|\|			LastEvicted.addEviction(PhysReg, VirtReg.reg, Intf->reg);
	VirtReg.isSpillable() < Intf->isSpillable()) &&
	"Cannot decrease cascade number, illegal eviction");			Matrix->unassign(*Intf);
	ExtraRegInfo[Intf->reg].Cascade = Cascade;			assert((ExtraRegInfo[Intf->reg].Cascade < Cascade \|\|
	++NumEvicted;			VirtReg.isSpillable() < Intf->isSpillable()) &&
	NewVRegs.push_back(Intf->reg);			"Cannot decrease cascade number, illegal eviction");
	}			ExtraRegInfo[Intf->reg].Cascade = Cascade;
	}			++NumEvicted;
				NewVRegs.push_back(Intf->reg);
	/// Returns true if the given \p PhysReg is a callee saved register and has not			}
	/// been used for allocation yet.			}
	bool RAGreedy::isUnusedCalleeSavedReg(unsigned PhysReg) const {
	unsigned CSR = RegClassInfo.getLastCalleeSavedAlias(PhysReg);			/// Returns true if the given \p PhysReg is a callee saved register and has not
	if (CSR == 0)			/// been used for allocation yet.
	return false;			bool RAGreedy::isUnusedCalleeSavedReg(unsigned PhysReg) const {
				unsigned CSR = RegClassInfo.getLastCalleeSavedAlias(PhysReg);
	return !Matrix->isPhysRegUsed(PhysReg);			if (CSR == 0)
	}			return false;

	/// tryEvict - Try to evict all interferences for a physreg.			return !Matrix->isPhysRegUsed(PhysReg);
	/// @param VirtReg Currently unassigned virtual register.			}
	/// @param Order Physregs to try.
	/// @return Physreg to assign VirtReg, or 0.			/// tryEvict - Try to evict all interferences for a physreg.
	unsigned RAGreedy::tryEvict(LiveInterval &VirtReg,			/// @param VirtReg Currently unassigned virtual register.
	AllocationOrder &Order,			/// @param Order Physregs to try.
	SmallVectorImpl<unsigned> &NewVRegs,			/// @return Physreg to assign VirtReg, or 0.
	unsigned CostPerUseLimit) {			unsigned RAGreedy::tryEvict(LiveInterval &VirtReg,
	NamedRegionTimer T("evict", "Evict", TimerGroupName, TimerGroupDescription,			AllocationOrder &Order,
	TimePassesIsEnabled);			SmallVectorImpl<unsigned> &NewVRegs,
				unsigned CostPerUseLimit) {
	// Keep track of the cheapest interference seen so far.			NamedRegionTimer T("evict", "Evict", TimerGroupName, TimerGroupDescription,
	EvictionCost BestCost;			TimePassesIsEnabled);
	BestCost.setMax();
	unsigned BestPhys = 0;			// Keep track of the cheapest interference seen so far.
	unsigned OrderLimit = Order.getOrder().size();			EvictionCost BestCost;
				BestCost.setMax();
	// When we are just looking for a reduced cost per use, don't break any			unsigned BestPhys = 0;
	// hints, and only evict smaller spill weights.			unsigned OrderLimit = Order.getOrder().size();
	if (CostPerUseLimit < ~0u) {
	BestCost.BrokenHints = 0;			// When we are just looking for a reduced cost per use, don't break any
	BestCost.MaxWeight = VirtReg.weight;			// hints, and only evict smaller spill weights.
				if (CostPerUseLimit < ~0u) {
	// Check of any registers in RC are below CostPerUseLimit.			BestCost.BrokenHints = 0;
	const TargetRegisterClass *RC = MRI->getRegClass(VirtReg.reg);			BestCost.MaxWeight = VirtReg.weight;
	unsigned MinCost = RegClassInfo.getMinCost(RC);
	if (MinCost >= CostPerUseLimit) {			// Check of any registers in RC are below CostPerUseLimit.
	DEBUG(dbgs() << TRI->getRegClassName(RC) << " minimum cost = " << MinCost			const TargetRegisterClass *RC = MRI->getRegClass(VirtReg.reg);
	<< ", no cheaper registers to be found.\n");			unsigned MinCost = RegClassInfo.getMinCost(RC);
	return 0;			if (MinCost >= CostPerUseLimit) {
	}			DEBUG(dbgs() << TRI->getRegClassName(RC) << " minimum cost = " << MinCost
				<< ", no cheaper registers to be found.\n");
	// It is normal for register classes to have a long tail of registers with			return 0;
	// the same cost. We don't need to look at them if they're too expensive.			}
	if (TRI->getCostPerUse(Order.getOrder().back()) >= CostPerUseLimit) {
	OrderLimit = RegClassInfo.getLastCostChange(RC);			// It is normal for register classes to have a long tail of registers with
	DEBUG(dbgs() << "Only trying the first " << OrderLimit << " regs.\n");			// the same cost. We don't need to look at them if they're too expensive.
	}			if (TRI->getCostPerUse(Order.getOrder().back()) >= CostPerUseLimit) {
	}			OrderLimit = RegClassInfo.getLastCostChange(RC);
				DEBUG(dbgs() << "Only trying the first " << OrderLimit << " regs.\n");
	Order.rewind();			}
	while (unsigned PhysReg = Order.next(OrderLimit)) {			}
	if (TRI->getCostPerUse(PhysReg) >= CostPerUseLimit)
	continue;			Order.rewind();
	// The first use of a callee-saved register in a function has cost 1.			while (unsigned PhysReg = Order.next(OrderLimit)) {
	// Don't start using a CSR when the CostPerUseLimit is low.			if (TRI->getCostPerUse(PhysReg) >= CostPerUseLimit)
	if (CostPerUseLimit == 1 && isUnusedCalleeSavedReg(PhysReg)) {			continue;
	DEBUG(dbgs() << printReg(PhysReg, TRI) << " would clobber CSR "			// The first use of a callee-saved register in a function has cost 1.
	<< printReg(RegClassInfo.getLastCalleeSavedAlias(PhysReg), TRI)			// Don't start using a CSR when the CostPerUseLimit is low.
	<< '\n');			if (CostPerUseLimit == 1 && isUnusedCalleeSavedReg(PhysReg)) {
	continue;			DEBUG(dbgs() << printReg(PhysReg, TRI) << " would clobber CSR "
	}			<< printReg(RegClassInfo.getLastCalleeSavedAlias(PhysReg), TRI)
				<< '\n');
	if (!canEvictInterference(VirtReg, PhysReg, false, BestCost))			continue;
	continue;			}

	// Best so far.			if (!canEvictInterference(VirtReg, PhysReg, false, BestCost))
	BestPhys = PhysReg;			continue;

	// Stop if the hint can be used.			// Best so far.
	if (Order.isHint())			BestPhys = PhysReg;
	break;
	}			// Stop if the hint can be used.
				if (Order.isHint())
	if (!BestPhys)			break;
	return 0;			}

	evictInterference(VirtReg, BestPhys, NewVRegs);			if (!BestPhys)
	return BestPhys;			return 0;
	}
				evictInterference(VirtReg, BestPhys, NewVRegs);
	//===----------------------------------------------------------------------===//			return BestPhys;
	// Region Splitting			}
	//===----------------------------------------------------------------------===//
				//===----------------------------------------------------------------------===//
	/// addSplitConstraints - Fill out the SplitConstraints vector based on the			// Region Splitting
	/// interference pattern in Physreg and its aliases. Add the constraints to			//===----------------------------------------------------------------------===//
	/// SpillPlacement and return the static cost of this split in Cost, assuming
	/// that all preferences in SplitConstraints are met.			/// addSplitConstraints - Fill out the SplitConstraints vector based on the
	/// Return false if there are no bundles with positive bias.			/// interference pattern in Physreg and its aliases. Add the constraints to
	bool RAGreedy::addSplitConstraints(InterferenceCache::Cursor Intf,			/// SpillPlacement and return the static cost of this split in Cost, assuming
	BlockFrequency &Cost) {			/// that all preferences in SplitConstraints are met.
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();			/// Return false if there are no bundles with positive bias.
				bool RAGreedy::addSplitConstraints(InterferenceCache::Cursor Intf,
	// Reset interference dependent info.			BlockFrequency &Cost) {
	SplitConstraints.resize(UseBlocks.size());			ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	BlockFrequency StaticCost = 0;
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			// Reset interference dependent info.
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];			SplitConstraints.resize(UseBlocks.size());
	SpillPlacement::BlockConstraint &BC = SplitConstraints[i];			BlockFrequency StaticCost = 0;
				for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	BC.Number = BI.MBB->getNumber();			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	Intf.moveToBlock(BC.Number);			SpillPlacement::BlockConstraint &BC = SplitConstraints[i];
	BC.Entry = BI.LiveIn ? SpillPlacement::PrefReg : SpillPlacement::DontCare;
	BC.Exit = BI.LiveOut ? SpillPlacement::PrefReg : SpillPlacement::DontCare;			BC.Number = BI.MBB->getNumber();
	BC.ChangesValue = BI.FirstDef.isValid();			Intf.moveToBlock(BC.Number);
				BC.Entry = BI.LiveIn ? SpillPlacement::PrefReg : SpillPlacement::DontCare;
	if (!Intf.hasInterference())			BC.Exit = BI.LiveOut ? SpillPlacement::PrefReg : SpillPlacement::DontCare;
	continue;			BC.ChangesValue = BI.FirstDef.isValid();

	// Number of spill code instructions to insert.			if (!Intf.hasInterference())
	unsigned Ins = 0;			continue;

	// Interference for the live-in value.			// Number of spill code instructions to insert.
	if (BI.LiveIn) {			unsigned Ins = 0;
	if (Intf.first() <= Indexes->getMBBStartIdx(BC.Number)) {
	BC.Entry = SpillPlacement::MustSpill;			// Interference for the live-in value.
	++Ins;			if (BI.LiveIn) {
	} else if (Intf.first() < BI.FirstInstr) {			if (Intf.first() <= Indexes->getMBBStartIdx(BC.Number)) {
	BC.Entry = SpillPlacement::PrefSpill;			BC.Entry = SpillPlacement::MustSpill;
	++Ins;			++Ins;
	} else if (Intf.first() < BI.LastInstr) {			} else if (Intf.first() < BI.FirstInstr) {
	++Ins;			BC.Entry = SpillPlacement::PrefSpill;
	}			++Ins;
	}			} else if (Intf.first() < BI.LastInstr) {
				++Ins;
	// Interference for the live-out value.			}
	if (BI.LiveOut) {			}
	if (Intf.last() >= SA->getLastSplitPoint(BC.Number)) {
	BC.Exit = SpillPlacement::MustSpill;			// Interference for the live-out value.
	++Ins;			if (BI.LiveOut) {
	} else if (Intf.last() > BI.LastInstr) {			if (Intf.last() >= SA->getLastSplitPoint(BC.Number)) {
	BC.Exit = SpillPlacement::PrefSpill;			BC.Exit = SpillPlacement::MustSpill;
	++Ins;			++Ins;
	} else if (Intf.last() > BI.FirstInstr) {			} else if (Intf.last() > BI.LastInstr) {
	++Ins;			BC.Exit = SpillPlacement::PrefSpill;
	}			++Ins;
	}			} else if (Intf.last() > BI.FirstInstr) {
				++Ins;
	// Accumulate the total frequency of inserted spill code.			}
	while (Ins--)			}
	StaticCost += SpillPlacer->getBlockFrequency(BC.Number);
	}			// Accumulate the total frequency of inserted spill code.
	Cost = StaticCost;			while (Ins--)
				StaticCost += SpillPlacer->getBlockFrequency(BC.Number);
	// Add constraints for use-blocks. Note that these are the only constraints			}
	// that may add a positive bias, it is downhill from here.			Cost = StaticCost;
	SpillPlacer->addConstraints(SplitConstraints);
	return SpillPlacer->scanActiveBundles();			// Add constraints for use-blocks. Note that these are the only constraints
	}			// that may add a positive bias, it is downhill from here.
				SpillPlacer->addConstraints(SplitConstraints);
	/// addThroughConstraints - Add constraints and links to SpillPlacer from the			return SpillPlacer->scanActiveBundles();
	/// live-through blocks in Blocks.			}
	void RAGreedy::addThroughConstraints(InterferenceCache::Cursor Intf,
	ArrayRef<unsigned> Blocks) {			/// addThroughConstraints - Add constraints and links to SpillPlacer from the
	const unsigned GroupSize = 8;			/// live-through blocks in Blocks.
	SpillPlacement::BlockConstraint BCS[GroupSize];			void RAGreedy::addThroughConstraints(InterferenceCache::Cursor Intf,
	unsigned TBS[GroupSize];			ArrayRef<unsigned> Blocks) {
	unsigned B = 0, T = 0;			const unsigned GroupSize = 8;
				SpillPlacement::BlockConstraint BCS[GroupSize];
	for (unsigned i = 0; i != Blocks.size(); ++i) {			unsigned TBS[GroupSize];
	unsigned Number = Blocks[i];			unsigned B = 0, T = 0;
	Intf.moveToBlock(Number);
				for (unsigned i = 0; i != Blocks.size(); ++i) {
	if (!Intf.hasInterference()) {			unsigned Number = Blocks[i];
	assert(T < GroupSize && "Array overflow");			Intf.moveToBlock(Number);
	TBS[T] = Number;
	if (++T == GroupSize) {			if (!Intf.hasInterference()) {
	SpillPlacer->addLinks(makeArrayRef(TBS, T));			assert(T < GroupSize && "Array overflow");
	T = 0;			TBS[T] = Number;
	}			if (++T == GroupSize) {
	continue;			SpillPlacer->addLinks(makeArrayRef(TBS, T));
	}			T = 0;
				}
	assert(B < GroupSize && "Array overflow");			continue;
	BCS[B].Number = Number;			}

	// Interference for the live-in value.			assert(B < GroupSize && "Array overflow");
	if (Intf.first() <= Indexes->getMBBStartIdx(Number))			BCS[B].Number = Number;
	BCS[B].Entry = SpillPlacement::MustSpill;
	else			// Interference for the live-in value.
	BCS[B].Entry = SpillPlacement::PrefSpill;			if (Intf.first() <= Indexes->getMBBStartIdx(Number))
				BCS[B].Entry = SpillPlacement::MustSpill;
	// Interference for the live-out value.			else
	if (Intf.last() >= SA->getLastSplitPoint(Number))			BCS[B].Entry = SpillPlacement::PrefSpill;
	BCS[B].Exit = SpillPlacement::MustSpill;
	else			// Interference for the live-out value.
	BCS[B].Exit = SpillPlacement::PrefSpill;			if (Intf.last() >= SA->getLastSplitPoint(Number))
				BCS[B].Exit = SpillPlacement::MustSpill;
	if (++B == GroupSize) {			else
	SpillPlacer->addConstraints(makeArrayRef(BCS, B));			BCS[B].Exit = SpillPlacement::PrefSpill;
	B = 0;
	}			if (++B == GroupSize) {
	}			SpillPlacer->addConstraints(makeArrayRef(BCS, B));
				B = 0;
	SpillPlacer->addConstraints(makeArrayRef(BCS, B));			}
	SpillPlacer->addLinks(makeArrayRef(TBS, T));			}
	}
				SpillPlacer->addConstraints(makeArrayRef(BCS, B));
	void RAGreedy::growRegion(GlobalSplitCandidate &Cand) {			SpillPlacer->addLinks(makeArrayRef(TBS, T));
	// Keep track of through blocks that have not been added to SpillPlacer.			}
	BitVector Todo = SA->getThroughBlocks();
	SmallVectorImpl<unsigned> &ActiveBlocks = Cand.ActiveBlocks;			void RAGreedy::growRegion(GlobalSplitCandidate &Cand) {
	unsigned AddedTo = 0;			// Keep track of through blocks that have not been added to SpillPlacer.
	#ifndef NDEBUG			BitVector Todo = SA->getThroughBlocks();
	unsigned Visited = 0;			SmallVectorImpl<unsigned> &ActiveBlocks = Cand.ActiveBlocks;
	#endif			unsigned AddedTo = 0;
				#ifndef NDEBUG
	while (true) {			unsigned Visited = 0;
	ArrayRef<unsigned> NewBundles = SpillPlacer->getRecentPositive();			#endif
	// Find new through blocks in the periphery of PrefRegBundles.
	for (int i = 0, e = NewBundles.size(); i != e; ++i) {			while (true) {
	unsigned Bundle = NewBundles[i];			ArrayRef<unsigned> NewBundles = SpillPlacer->getRecentPositive();
	// Look at all blocks connected to Bundle in the full graph.			// Find new through blocks in the periphery of PrefRegBundles.
	ArrayRef<unsigned> Blocks = Bundles->getBlocks(Bundle);			for (int i = 0, e = NewBundles.size(); i != e; ++i) {
	for (ArrayRef<unsigned>::iterator I = Blocks.begin(), E = Blocks.end();			unsigned Bundle = NewBundles[i];
	I != E; ++I) {			// Look at all blocks connected to Bundle in the full graph.
	unsigned Block = *I;			ArrayRef<unsigned> Blocks = Bundles->getBlocks(Bundle);
	if (!Todo.test(Block))			for (ArrayRef<unsigned>::iterator I = Blocks.begin(), E = Blocks.end();
	continue;			I != E; ++I) {
	Todo.reset(Block);			unsigned Block = *I;
	// This is a new through block. Add it to SpillPlacer later.			if (!Todo.test(Block))
	ActiveBlocks.push_back(Block);			continue;
	#ifndef NDEBUG			Todo.reset(Block);
	++Visited;			// This is a new through block. Add it to SpillPlacer later.
	#endif			ActiveBlocks.push_back(Block);
	}			#ifndef NDEBUG
	}			++Visited;
	// Any new blocks to add?			#endif
	if (ActiveBlocks.size() == AddedTo)			}
	break;			}
				// Any new blocks to add?
	// Compute through constraints from the interference, or assume that all			if (ActiveBlocks.size() == AddedTo)
	// through blocks prefer spilling when forming compact regions.			break;
	auto NewBlocks = makeArrayRef(ActiveBlocks).slice(AddedTo);
	if (Cand.PhysReg)			// Compute through constraints from the interference, or assume that all
	addThroughConstraints(Cand.Intf, NewBlocks);			// through blocks prefer spilling when forming compact regions.
	else			auto NewBlocks = makeArrayRef(ActiveBlocks).slice(AddedTo);
	// Provide a strong negative bias on through blocks to prevent unwanted			if (Cand.PhysReg)
	// liveness on loop backedges.			addThroughConstraints(Cand.Intf, NewBlocks);
	SpillPlacer->addPrefSpill(NewBlocks, /* Strong= */ true);			else
	AddedTo = ActiveBlocks.size();			// Provide a strong negative bias on through blocks to prevent unwanted
				// liveness on loop backedges.
	// Perhaps iterating can enable more bundles?			SpillPlacer->addPrefSpill(NewBlocks, /* Strong= */ true);
	SpillPlacer->iterate();			AddedTo = ActiveBlocks.size();
	}
	DEBUG(dbgs() << ", v=" << Visited);			// Perhaps iterating can enable more bundles?
	}			SpillPlacer->iterate();
				}
	/// calcCompactRegion - Compute the set of edge bundles that should be live			DEBUG(dbgs() << ", v=" << Visited);
	/// when splitting the current live range into compact regions. Compact			}
	/// regions can be computed without looking at interference. They are the
	/// regions formed by removing all the live-through blocks from the live range.			/// calcCompactRegion - Compute the set of edge bundles that should be live
	///			/// when splitting the current live range into compact regions. Compact
	/// Returns false if the current live range is already compact, or if the			/// regions can be computed without looking at interference. They are the
	/// compact regions would form single block regions anyway.			/// regions formed by removing all the live-through blocks from the live range.
	bool RAGreedy::calcCompactRegion(GlobalSplitCandidate &Cand) {			///
	// Without any through blocks, the live range is already compact.			/// Returns false if the current live range is already compact, or if the
	if (!SA->getNumThroughBlocks())			/// compact regions would form single block regions anyway.
	return false;			bool RAGreedy::calcCompactRegion(GlobalSplitCandidate &Cand) {
				// Without any through blocks, the live range is already compact.
	// Compact regions don't correspond to any physreg.			if (!SA->getNumThroughBlocks())
	Cand.reset(IntfCache, 0);			return false;

	DEBUG(dbgs() << "Compact region bundles");			// Compact regions don't correspond to any physreg.
				Cand.reset(IntfCache, 0);
	// Use the spill placer to determine the live bundles. GrowRegion pretends
	// that all the through blocks have interference when PhysReg is unset.			DEBUG(dbgs() << "Compact region bundles");
	SpillPlacer->prepare(Cand.LiveBundles);
				// Use the spill placer to determine the live bundles. GrowRegion pretends
	// The static split cost will be zero since Cand.Intf reports no interference.			// that all the through blocks have interference when PhysReg is unset.
	BlockFrequency Cost;			SpillPlacer->prepare(Cand.LiveBundles);
	if (!addSplitConstraints(Cand.Intf, Cost)) {
	DEBUG(dbgs() << ", none.\n");			// The static split cost will be zero since Cand.Intf reports no interference.
	return false;			BlockFrequency Cost;
	}			if (!addSplitConstraints(Cand.Intf, Cost)) {
				DEBUG(dbgs() << ", none.\n");
	growRegion(Cand);			return false;
	SpillPlacer->finish();			}

	if (!Cand.LiveBundles.any()) {			growRegion(Cand);
	DEBUG(dbgs() << ", none.\n");			SpillPlacer->finish();
	return false;
	}			if (!Cand.LiveBundles.any()) {
				DEBUG(dbgs() << ", none.\n");
	DEBUG({			return false;
	for (int i : Cand.LiveBundles.set_bits())			}
	dbgs() << " EB#" << i;
	dbgs() << ".\n";			DEBUG({
	});			for (int i : Cand.LiveBundles.set_bits())
	return true;			dbgs() << " EB#" << i;
	}			dbgs() << ".\n";
				});
	/// calcSpillCost - Compute how expensive it would be to split the live range in			return true;
	/// SA around all use blocks instead of forming bundle regions.			}
	BlockFrequency RAGreedy::calcSpillCost() {
	BlockFrequency Cost = 0;			/// calcSpillCost - Compute how expensive it would be to split the live range in
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();			/// SA around all use blocks instead of forming bundle regions.
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			BlockFrequency RAGreedy::calcSpillCost() {
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];			BlockFrequency Cost = 0;
	unsigned Number = BI.MBB->getNumber();			ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	// We normally only need one spill instruction - a load or a store.			for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	Cost += SpillPlacer->getBlockFrequency(Number);			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
				unsigned Number = BI.MBB->getNumber();
	// Unless the value is redefined in the block.			// We normally only need one spill instruction - a load or a store.
	if (BI.LiveIn && BI.LiveOut && BI.FirstDef)			Cost += SpillPlacer->getBlockFrequency(Number);
	Cost += SpillPlacer->getBlockFrequency(Number);
	}			// Unless the value is redefined in the block.
	return Cost;			if (BI.LiveIn && BI.LiveOut && BI.FirstDef)
	}			Cost += SpillPlacer->getBlockFrequency(Number);
				}
	/// \brief Check if splitting Evictee will create a local split interval in			return Cost;
	/// basic block number BBNumber that may cause a bad eviction chain. This is			}
	/// intended to prevent bad eviction sequences like:
	/// movl %ebp, 8(%esp) # 4-byte Spill			/// \brief Check if splitting Evictee will create a local split interval in
	/// movl %ecx, %ebp			/// basic block number BBNumber that may cause a bad eviction chain. This is
	/// movl %ebx, %ecx			/// intended to prevent bad eviction sequences like:
	/// movl %edi, %ebx			/// movl %ebp, 8(%esp) # 4-byte Spill
	/// movl %edx, %edi			/// movl %ecx, %ebp
	/// cltd			/// movl %ebx, %ecx
	/// idivl %esi			/// movl %edi, %ebx
	/// movl %edi, %edx			/// movl %edx, %edi
	/// movl %ebx, %edi			/// cltd
	/// movl %ecx, %ebx			/// idivl %esi
	/// movl %ebp, %ecx			/// movl %edi, %edx
	/// movl 16(%esp), %ebp # 4 - byte Reload			/// movl %ebx, %edi
	///			/// movl %ecx, %ebx
	/// Such sequences are created in 2 scenarios:			/// movl %ebp, %ecx
	///			/// movl 16(%esp), %ebp # 4 - byte Reload
	/// Scenario #1:			///
	/// %0 is evicted from physreg0 by %1.			/// Such sequences are created in 2 scenarios:
	/// Evictee %0 is intended for region splitting with split candidate			///
	/// physreg0 (the reg %0 was evicted from).			/// Scenario #1:
	/// Region splitting creates a local interval because of interference with the			/// %0 is evicted from physreg0 by %1.
	/// evictor %1 (normally region spliitting creates 2 interval, the "by reg"			/// Evictee %0 is intended for region splitting with split candidate
	/// and "by stack" intervals and local interval created when interference			/// physreg0 (the reg %0 was evicted from).
	/// occurs).			/// Region splitting creates a local interval because of interference with the
	/// One of the split intervals ends up evicting %2 from physreg1.			/// evictor %1 (normally region spliitting creates 2 interval, the "by reg"
	/// Evictee %2 is intended for region splitting with split candidate			/// and "by stack" intervals and local interval created when interference
	/// physreg1.			/// occurs).
	/// One of the split intervals ends up evicting %3 from physreg2, etc.			/// One of the split intervals ends up evicting %2 from physreg1.
	///			/// Evictee %2 is intended for region splitting with split candidate
	/// Scenario #2			/// physreg1.
	/// %0 is evicted from physreg0 by %1.			/// One of the split intervals ends up evicting %3 from physreg2, etc.
	/// %2 is evicted from physreg2 by %3 etc.			///
	/// Evictee %0 is intended for region splitting with split candidate			/// Scenario #2
	/// physreg1.			/// %0 is evicted from physreg0 by %1.
	/// Region splitting creates a local interval because of interference with the			/// %2 is evicted from physreg2 by %3 etc.
	/// evictor %1.			/// Evictee %0 is intended for region splitting with split candidate
	/// One of the split intervals ends up evicting back original evictor %1			/// physreg1.
	/// from physreg0 (the reg %0 was evicted from).			/// Region splitting creates a local interval because of interference with the
	/// Another evictee %2 is intended for region splitting with split candidate			/// evictor %1.
	/// physreg1.			/// One of the split intervals ends up evicting back original evictor %1
	/// One of the split intervals ends up evicting %3 from physreg2, etc.			/// from physreg0 (the reg %0 was evicted from).
	///			/// Another evictee %2 is intended for region splitting with split candidate
	/// \param Evictee The register considered to be split.			/// physreg1.
	/// \param Cand The split candidate that determines the physical register			/// One of the split intervals ends up evicting %3 from physreg2, etc.
	/// we are splitting for and the interferences.			///
	/// \param BBNumber The number of a BB for which the region split process will			/// \param Evictee The register considered to be split.
	/// create a local split interval.			/// \param Cand The split candidate that determines the physical register
	/// \param Order The phisical registers that may get evicted by a split			/// we are splitting for and the interferences.
	/// artifact of Evictee.			/// \param BBNumber The number of a BB for which the region split process will
	/// \return True if splitting Evictee may cause a bad eviction chain, false			/// create a local split interval.
	/// otherwise.			/// \param Order The physical registers that may get evicted by a split
	bool RAGreedy::splitCanCauseEvictionChain(unsigned Evictee,			/// artifact of Evictee.
	GlobalSplitCandidate &Cand,			/// \return True if splitting Evictee may cause a bad eviction chain, false
	unsigned BBNumber,			/// otherwise.
	const AllocationOrder &Order) {			bool RAGreedy::splitCanCauseEvictionChain(unsigned Evictee,
	EvictionTrack::EvictorInfo VregEvictorInfo = LastEvicted.getEvictor(Evictee);			GlobalSplitCandidate &Cand,
	unsigned Evictor = VregEvictorInfo.first;			unsigned BBNumber,
	unsigned PhysReg = VregEvictorInfo.second;			const AllocationOrder &Order) {
				EvictionTrack::EvictorInfo VregEvictorInfo = LastEvicted.getEvictor(Evictee);
	// No actual evictor.			unsigned Evictor = VregEvictorInfo.first;
	if (!Evictor \|\| !PhysReg)			unsigned PhysReg = VregEvictorInfo.second;
	return false;
				// No actual evictor.
	float MaxWeight = 0;			if (!Evictor \|\| !PhysReg)
	unsigned FutureEvictedPhysReg =			return false;
	getCheapestEvicteeWeight(Order, LIS->getInterval(Evictee),
	Cand.Intf.first(), Cand.Intf.last(), &MaxWeight);			float MaxWeight = 0;
				unsigned FutureEvictedPhysReg =
	// The bad eviction chain occurs when either the split candidate the the			getCheapestEvicteeWeight(Order, LIS->getInterval(Evictee),
	// evited reg or one of the split artifact will evict the evicting reg.			Cand.Intf.first(), Cand.Intf.last(), &MaxWeight);
	if ((PhysReg != Cand.PhysReg) && (PhysReg != FutureEvictedPhysReg))
	return false;			// The bad eviction chain occurs when either the split candidate is the
				// evicting reg or one of the split artifact will evict the evicting reg.
	Cand.Intf.moveToBlock(BBNumber);			if ((PhysReg != Cand.PhysReg) && (PhysReg != FutureEvictedPhysReg))
				return false;
	// Check to see if the Evictor contains interference (with Evictee) in the
	// given BB. If so, this interference caused the eviction of Evictee from			Cand.Intf.moveToBlock(BBNumber);
	// PhysReg. This suggest that we will create a local interval during the
	// region split to avoid this interference This local interval may cause a bad			// Check to see if the Evictor contains interference (with Evictee) in the
	// eviction chain.			// given BB. If so, this interference caused the eviction of Evictee from
	if (!LIS->hasInterval(Evictor))			// PhysReg. This suggest that we will create a local interval during the
	return false;			// region split to avoid this interference This local interval may cause a bad
	LiveInterval &EvictorLI = LIS->getInterval(Evictor);			// eviction chain.
	if (EvictorLI.FindSegmentContaining(Cand.Intf.first()) == EvictorLI.end())			if (!LIS->hasInterval(Evictor))
	return false;			return false;
				LiveInterval &EvictorLI = LIS->getInterval(Evictor);
	// Now, check to see if the local interval we will create is going to be			if (EvictorLI.FindSegmentContaining(Cand.Intf.first()) == EvictorLI.end())
	// expensive enough to evict somebody If so, this may cause a bad eviction			return false;
	// chain.
	VirtRegAuxInfo VRAI(MF, LIS, VRM, getAnalysis<MachineLoopInfo>(), *MBFI);			// Now, check to see if the local interval we will create is going to be
	float splitArtifactWeight =			// expensive enough to evict somebody. If so, this may cause a bad eviction
	VRAI.futureWeight(LIS->getInterval(Evictee),			// chain.
	Cand.Intf.first().getPrevIndex(), Cand.Intf.last());			VirtRegAuxInfo VRAI(MF, LIS, VRM, getAnalysis<MachineLoopInfo>(), *MBFI);
	if (splitArtifactWeight >= 0 && splitArtifactWeight < MaxWeight)			float splitArtifactWeight =
	return false;			VRAI.futureWeight(LIS->getInterval(Evictee),
				Cand.Intf.first().getPrevIndex(), Cand.Intf.last());
	return true;			if (splitArtifactWeight >= 0 && splitArtifactWeight < MaxWeight)
	}			return false;

	/// calcGlobalSplitCost - Return the global split cost of following the split			return true;
	/// pattern in LiveBundles. This cost should be added to the local cost of the			}
	/// interference pattern in SplitConstraints.
	///			/// \brief Check if splitting VirtRegToSplit will create a local split interval
	BlockFrequency RAGreedy::calcGlobalSplitCost(GlobalSplitCandidate &Cand,			/// in basic block number BBNumber that may cause a spill.
	const AllocationOrder &Order,			///
	bool *CanCauseEvictionChain) {			/// \param VirtRegToSplit The register considered to be split.
	BlockFrequency GlobalCost = 0;			/// \param Cand The split candidate that determines the physical
	const BitVector &LiveBundles = Cand.LiveBundles;			/// register we are splitting for and the interferences.
	unsigned VirtRegToSplit = SA->getParent().reg;			/// \param BBNumber The number of a BB for which the region split process
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();			/// will create a local split interval.
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			/// \param Order The physical registers that may get evicted by a
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];			/// split artifact of VirtRegToSplit.
	SpillPlacement::BlockConstraint &BC = SplitConstraints[i];			/// \return True if splitting VirtRegToSplit may cause a spill, false
	bool RegIn = LiveBundles[Bundles->getBundle(BC.Number, false)];			/// otherwise.
	bool RegOut = LiveBundles[Bundles->getBundle(BC.Number, true)];			bool RAGreedy::splitCanCauseLocalSpill(unsigned VirtRegToSplit,
	unsigned Ins = 0;			GlobalSplitCandidate &Cand,
				unsigned BBNumber,
	Cand.Intf.moveToBlock(BC.Number);			const AllocationOrder &Order) {
	// Check wheather a local interval is going to be created during the region			Cand.Intf.moveToBlock(BBNumber);
	// split.
	if (EnableAdvancedRASplitCost && CanCauseEvictionChain &&			// Check if the local interval will find a non interfereing assignment.
	Cand.Intf.hasInterference() && BI.LiveIn && BI.LiveOut && RegIn &&			for (auto PhysReg : Order.getOrder()) {
	RegOut) {			if (!Matrix->checkInterference(Cand.Intf.first().getPrevIndex(),
				Cand.Intf.last(), PhysReg))
	if (splitCanCauseEvictionChain(VirtRegToSplit, Cand, BC.Number, Order)) {			return false;
	// This interfernce cause our eviction from this assignment, we might			}
	// evict somebody else, add that cost.
	// See splitCanCauseEvictionChain for detailed description of scenarios.			// Check if the local interval will evict a cheaper interval.
	GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);			float CheapestEvictWeight = 0;
	GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);			unsigned FutureEvictedPhysReg = getCheapestEvicteeWeight(
				Order, LIS->getInterval(VirtRegToSplit), Cand.Intf.first(),
	*CanCauseEvictionChain = true;			Cand.Intf.last(), &CheapestEvictWeight);
	}
	}			// Have we found an interval that can be evicted?
				if (FutureEvictedPhysReg) {
	if (BI.LiveIn)			VirtRegAuxInfo VRAI(MF, LIS, VRM, getAnalysis<MachineLoopInfo>(), *MBFI);
	Ins += RegIn != (BC.Entry == SpillPlacement::PrefReg);			float splitArtifactWeight =
	if (BI.LiveOut)			VRAI.futureWeight(LIS->getInterval(VirtRegToSplit),
	Ins += RegOut != (BC.Exit == SpillPlacement::PrefReg);			Cand.Intf.first().getPrevIndex(), Cand.Intf.last());
	while (Ins--)			// Will the weight of the local interval be higher than the cheapest evictee
	GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);			// weight? If so it will evict it and will not cause a spill.
	}			if (splitArtifactWeight >= 0 && splitArtifactWeight > CheapestEvictWeight)
				return false;
	for (unsigned i = 0, e = Cand.ActiveBlocks.size(); i != e; ++i) {			}
	unsigned Number = Cand.ActiveBlocks[i];
	bool RegIn = LiveBundles[Bundles->getBundle(Number, false)];			// The local interval is not able to find non interferening assignment and not
	bool RegOut = LiveBundles[Bundles->getBundle(Number, true)];			// able to evict a less worthy interval, therfore, it can cause a spill.
	if (!RegIn && !RegOut)			return true;
	continue;			}
	if (RegIn && RegOut) {
	// We need double spill code if this block has interference.			/// calcGlobalSplitCost - Return the global split cost of following the split
	Cand.Intf.moveToBlock(Number);			/// pattern in LiveBundles. This cost should be added to the local cost of the
	if (Cand.Intf.hasInterference()) {			/// interference pattern in SplitConstraints.
	GlobalCost += SpillPlacer->getBlockFrequency(Number);			///
	GlobalCost += SpillPlacer->getBlockFrequency(Number);			BlockFrequency RAGreedy::calcGlobalSplitCost(GlobalSplitCandidate &Cand,
				const AllocationOrder &Order,
	// Check wheather a local interval is going to be created during the			bool *CanCauseEvictionChain) {
	// region split.			BlockFrequency GlobalCost = 0;
	if (EnableAdvancedRASplitCost && CanCauseEvictionChain &&			const BitVector &LiveBundles = Cand.LiveBundles;
	splitCanCauseEvictionChain(VirtRegToSplit, Cand, Number, Order)) {			unsigned VirtRegToSplit = SA->getParent().reg;
	// This interfernce cause our eviction from this assignment, we might			ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	// evict somebody else, add that cost.			for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	// See splitCanCauseEvictionChain for detailed description of			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	// scenarios.			SpillPlacement::BlockConstraint &BC = SplitConstraints[i];
	GlobalCost += SpillPlacer->getBlockFrequency(Number);			bool RegIn = LiveBundles[Bundles->getBundle(BC.Number, false)];
	GlobalCost += SpillPlacer->getBlockFrequency(Number);			bool RegOut = LiveBundles[Bundles->getBundle(BC.Number, true)];
				unsigned Ins = 0;
	*CanCauseEvictionChain = true;
	}			Cand.Intf.moveToBlock(BC.Number);
	}			// Check wheather a local interval is going to be created during the region
	continue;			// split. Calculate adavanced spilt cost (cost of local intervals) if option
	}			// is enabled.
	// live-in / stack-out or stack-in live-out.			if (EnableAdvancedRASplitCost && Cand.Intf.hasInterference() && BI.LiveIn &&
	GlobalCost += SpillPlacer->getBlockFrequency(Number);			BI.LiveOut && RegIn && RegOut) {
	}
	return GlobalCost;			if (CanCauseEvictionChain &&
	}			splitCanCauseEvictionChain(VirtRegToSplit, Cand, BC.Number, Order)) {
				// This interfernce causes our eviction from this assignment, we might
	/// splitAroundRegion - Split the current live range around the regions			// evict somebody else and eventually someone will spill, add that cost.
	/// determined by BundleCand and GlobalCand.			// See splitCanCauseEvictionChain for detailed description of scenarios.
	///			GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);
	/// Before calling this function, GlobalCand and BundleCand must be initialized			GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);
	/// so each bundle is assigned to a valid candidate, or NoCand for the
	/// stack-bound bundles. The shared SA/SE SplitAnalysis and SplitEditor			*CanCauseEvictionChain = true;
	/// objects must be initialized for the current live range, and intervals
	/// created for the used candidates.			} else if (splitCanCauseLocalSpill(VirtRegToSplit, Cand, BC.Number,
	///			Order)) {
	/// @param LREdit The LiveRangeEdit object handling the current split.			// This interfernce causes local interval to spill, add that cost.
				qcolombetUnsubmitted Done Reply Inline Actions interference qcolombet:* *interference
	/// @param UsedCands List of used GlobalCand entries. Every BundleCand value			GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);
	/// must appear in this list.			GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);
	void RAGreedy::splitAroundRegion(LiveRangeEdit &LREdit,			}
	ArrayRef<unsigned> UsedCands) {			}
	// These are the intervals created for new global ranges. We may create more
	// intervals for local ranges.			if (BI.LiveIn)
	const unsigned NumGlobalIntvs = LREdit.size();			Ins += RegIn != (BC.Entry == SpillPlacement::PrefReg);
	DEBUG(dbgs() << "splitAroundRegion with " << NumGlobalIntvs << " globals.\n");			if (BI.LiveOut)
	assert(NumGlobalIntvs && "No global intervals configured");			Ins += RegOut != (BC.Exit == SpillPlacement::PrefReg);
				while (Ins--)
	// Isolate even single instructions when dealing with a proper sub-class.			GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);
	// That guarantees register class inflation for the stack interval because it			}
	// is all copies.
	unsigned Reg = SA->getParent().reg;			for (unsigned i = 0, e = Cand.ActiveBlocks.size(); i != e; ++i) {
	bool SingleInstrs = RegClassInfo.isProperSubClass(MRI->getRegClass(Reg));			unsigned Number = Cand.ActiveBlocks[i];
				bool RegIn = LiveBundles[Bundles->getBundle(Number, false)];
	// First handle all the blocks with uses.			bool RegOut = LiveBundles[Bundles->getBundle(Number, true)];
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();			if (!RegIn && !RegOut)
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			continue;
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];			if (RegIn && RegOut) {
	unsigned Number = BI.MBB->getNumber();			// We need double spill code if this block has interference.
	unsigned IntvIn = 0, IntvOut = 0;			Cand.Intf.moveToBlock(Number);
	SlotIndex IntfIn, IntfOut;			if (Cand.Intf.hasInterference()) {
	if (BI.LiveIn) {			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	unsigned CandIn = BundleCand[Bundles->getBundle(Number, false)];			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	if (CandIn != NoCand) {
	GlobalSplitCandidate &Cand = GlobalCand[CandIn];			// Check wheather a local interval is going to be created during the
	IntvIn = Cand.IntvIdx;			// region split.
	Cand.Intf.moveToBlock(Number);			if (EnableAdvancedRASplitCost && CanCauseEvictionChain &&
	IntfIn = Cand.Intf.first();			splitCanCauseEvictionChain(VirtRegToSplit, Cand, Number, Order)) {
	}			// This interfernce cause our eviction from this assignment, we might
	}			// evict somebody else, add that cost.
	if (BI.LiveOut) {			// See splitCanCauseEvictionChain for detailed description of
	unsigned CandOut = BundleCand[Bundles->getBundle(Number, true)];			// scenarios.
	if (CandOut != NoCand) {			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	GlobalSplitCandidate &Cand = GlobalCand[CandOut];			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	IntvOut = Cand.IntvIdx;
	Cand.Intf.moveToBlock(Number);			*CanCauseEvictionChain = true;
	IntfOut = Cand.Intf.last();			}
	}			}
	}			continue;
				}
	// Create separate intervals for isolated blocks with multiple uses.			// live-in / stack-out or stack-in live-out.
	if (!IntvIn && !IntvOut) {			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	DEBUG(dbgs() << printMBBReference(*BI.MBB) << " isolated.\n");			}
	if (SA->shouldSplitSingleBlock(BI, SingleInstrs))			return GlobalCost;
	SE->splitSingleBlock(BI);			}
	continue;
	}			/// splitAroundRegion - Split the current live range around the regions
				/// determined by BundleCand and GlobalCand.
	if (IntvIn && IntvOut)			///
	SE->splitLiveThroughBlock(Number, IntvIn, IntfIn, IntvOut, IntfOut);			/// Before calling this function, GlobalCand and BundleCand must be initialized
	else if (IntvIn)			/// so each bundle is assigned to a valid candidate, or NoCand for the
	SE->splitRegInBlock(BI, IntvIn, IntfIn);			/// stack-bound bundles. The shared SA/SE SplitAnalysis and SplitEditor
	else			/// objects must be initialized for the current live range, and intervals
	SE->splitRegOutBlock(BI, IntvOut, IntfOut);			/// created for the used candidates.
	}			///
				/// @param LREdit The LiveRangeEdit object handling the current split.
	// Handle live-through blocks. The relevant live-through blocks are stored in			/// @param UsedCands List of used GlobalCand entries. Every BundleCand value
	// the ActiveBlocks list with each candidate. We need to filter out			/// must appear in this list.
	// duplicates.			void RAGreedy::splitAroundRegion(LiveRangeEdit &LREdit,
	BitVector Todo = SA->getThroughBlocks();			ArrayRef<unsigned> UsedCands) {
	for (unsigned c = 0; c != UsedCands.size(); ++c) {			// These are the intervals created for new global ranges. We may create more
	ArrayRef<unsigned> Blocks = GlobalCand[UsedCands[c]].ActiveBlocks;			// intervals for local ranges.
	for (unsigned i = 0, e = Blocks.size(); i != e; ++i) {			const unsigned NumGlobalIntvs = LREdit.size();
	unsigned Number = Blocks[i];			DEBUG(dbgs() << "splitAroundRegion with " << NumGlobalIntvs << " globals.\n");
	if (!Todo.test(Number))			assert(NumGlobalIntvs && "No global intervals configured");
	continue;
	Todo.reset(Number);			// Isolate even single instructions when dealing with a proper sub-class.
				// That guarantees register class inflation for the stack interval because it
	unsigned IntvIn = 0, IntvOut = 0;			// is all copies.
	SlotIndex IntfIn, IntfOut;			unsigned Reg = SA->getParent().reg;
				bool SingleInstrs = RegClassInfo.isProperSubClass(MRI->getRegClass(Reg));
	unsigned CandIn = BundleCand[Bundles->getBundle(Number, false)];
	if (CandIn != NoCand) {			// First handle all the blocks with uses.
	GlobalSplitCandidate &Cand = GlobalCand[CandIn];			ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	IntvIn = Cand.IntvIdx;			for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	Cand.Intf.moveToBlock(Number);			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	IntfIn = Cand.Intf.first();			unsigned Number = BI.MBB->getNumber();
	}			unsigned IntvIn = 0, IntvOut = 0;
				SlotIndex IntfIn, IntfOut;
	unsigned CandOut = BundleCand[Bundles->getBundle(Number, true)];			if (BI.LiveIn) {
	if (CandOut != NoCand) {			unsigned CandIn = BundleCand[Bundles->getBundle(Number, false)];
	GlobalSplitCandidate &Cand = GlobalCand[CandOut];			if (CandIn != NoCand) {
	IntvOut = Cand.IntvIdx;			GlobalSplitCandidate &Cand = GlobalCand[CandIn];
	Cand.Intf.moveToBlock(Number);			IntvIn = Cand.IntvIdx;
	IntfOut = Cand.Intf.last();			Cand.Intf.moveToBlock(Number);
	}			IntfIn = Cand.Intf.first();
	if (!IntvIn && !IntvOut)			}
	continue;			}
	SE->splitLiveThroughBlock(Number, IntvIn, IntfIn, IntvOut, IntfOut);			if (BI.LiveOut) {
	}			unsigned CandOut = BundleCand[Bundles->getBundle(Number, true)];
	}			if (CandOut != NoCand) {
				GlobalSplitCandidate &Cand = GlobalCand[CandOut];
	++NumGlobalSplits;			IntvOut = Cand.IntvIdx;
				Cand.Intf.moveToBlock(Number);
	SmallVector<unsigned, 8> IntvMap;			IntfOut = Cand.Intf.last();
	SE->finish(&IntvMap);			}
	DebugVars->splitRegister(Reg, LREdit.regs(), *LIS);			}

	ExtraRegInfo.resize(MRI->getNumVirtRegs());			// Create separate intervals for isolated blocks with multiple uses.
	unsigned OrigBlocks = SA->getNumLiveBlocks();			if (!IntvIn && !IntvOut) {
				DEBUG(dbgs() << printMBBReference(*BI.MBB) << " isolated.\n");
	// Sort out the new intervals created by splitting. We get four kinds:			if (SA->shouldSplitSingleBlock(BI, SingleInstrs))
	// - Remainder intervals should not be split again.			SE->splitSingleBlock(BI);
	// - Candidate intervals can be assigned to Cand.PhysReg.			continue;
	// - Block-local splits are candidates for local splitting.			}
	// - DCE leftovers should go back on the queue.
	for (unsigned i = 0, e = LREdit.size(); i != e; ++i) {			if (IntvIn && IntvOut)
	LiveInterval &Reg = LIS->getInterval(LREdit.get(i));			SE->splitLiveThroughBlock(Number, IntvIn, IntfIn, IntvOut, IntfOut);
				else if (IntvIn)
	// Ignore old intervals from DCE.			SE->splitRegInBlock(BI, IntvIn, IntfIn);
	if (getStage(Reg) != RS_New)			else
	continue;			SE->splitRegOutBlock(BI, IntvOut, IntfOut);
				}
	// Remainder interval. Don't try splitting again, spill if it doesn't
	// allocate.			// Handle live-through blocks. The relevant live-through blocks are stored in
	if (IntvMap[i] == 0) {			// the ActiveBlocks list with each candidate. We need to filter out
	setStage(Reg, RS_Spill);			// duplicates.
	continue;			BitVector Todo = SA->getThroughBlocks();
	}			for (unsigned c = 0; c != UsedCands.size(); ++c) {
				ArrayRef<unsigned> Blocks = GlobalCand[UsedCands[c]].ActiveBlocks;
	// Global intervals. Allow repeated splitting as long as the number of live			for (unsigned i = 0, e = Blocks.size(); i != e; ++i) {
	// blocks is strictly decreasing.			unsigned Number = Blocks[i];
	if (IntvMap[i] < NumGlobalIntvs) {			if (!Todo.test(Number))
	if (SA->countLiveBlocks(&Reg) >= OrigBlocks) {			continue;
	DEBUG(dbgs() << "Main interval covers the same " << OrigBlocks			Todo.reset(Number);
	<< " blocks as original.\n");
	// Don't allow repeated splitting as a safe guard against looping.			unsigned IntvIn = 0, IntvOut = 0;
	setStage(Reg, RS_Split2);			SlotIndex IntfIn, IntfOut;
	}
	continue;			unsigned CandIn = BundleCand[Bundles->getBundle(Number, false)];
	}			if (CandIn != NoCand) {
				GlobalSplitCandidate &Cand = GlobalCand[CandIn];
	// Other intervals are treated as new. This includes local intervals created			IntvIn = Cand.IntvIdx;
	// for blocks with multiple uses, and anything created by DCE.			Cand.Intf.moveToBlock(Number);
	}			IntfIn = Cand.Intf.first();
				}
	if (VerifyEnabled)
	MF->verify(this, "After splitting live range around region");			unsigned CandOut = BundleCand[Bundles->getBundle(Number, true)];
	}			if (CandOut != NoCand) {
				GlobalSplitCandidate &Cand = GlobalCand[CandOut];
	unsigned RAGreedy::tryRegionSplit(LiveInterval &VirtReg, AllocationOrder &Order,			IntvOut = Cand.IntvIdx;
	SmallVectorImpl<unsigned> &NewVRegs) {			Cand.Intf.moveToBlock(Number);
	unsigned NumCands = 0;			IntfOut = Cand.Intf.last();
	BlockFrequency SpillCost = calcSpillCost();			}
	BlockFrequency BestCost;			if (!IntvIn && !IntvOut)
				continue;
	// Check if we can split this live range around a compact region.			SE->splitLiveThroughBlock(Number, IntvIn, IntfIn, IntvOut, IntfOut);
	bool HasCompact = calcCompactRegion(GlobalCand.front());			}
	if (HasCompact) {			}
	// Yes, keep GlobalCand[0] as the compact region candidate.
	NumCands = 1;			++NumGlobalSplits;
	BestCost = BlockFrequency::getMaxFrequency();
	} else {			SmallVector<unsigned, 8> IntvMap;
	// No benefit from the compact region, our fallback will be per-block			SE->finish(&IntvMap);
	// splitting. Make sure we find a solution that is cheaper than spilling.			DebugVars->splitRegister(Reg, LREdit.regs(), *LIS);
	BestCost = SpillCost;
	DEBUG(dbgs() << "Cost of isolating all blocks = ";			ExtraRegInfo.resize(MRI->getNumVirtRegs());
	MBFI->printBlockFreq(dbgs(), BestCost) << '\n');			unsigned OrigBlocks = SA->getNumLiveBlocks();
	}
				// Sort out the new intervals created by splitting. We get four kinds:
	bool CanCauseEvictionChain = false;			// - Remainder intervals should not be split again.
	unsigned BestCand =			// - Candidate intervals can be assigned to Cand.PhysReg.
	calculateRegionSplitCost(VirtReg, Order, BestCost, NumCands,			// - Block-local splits are candidates for local splitting.
	false /IgnoreCSR/, &CanCauseEvictionChain);			// - DCE leftovers should go back on the queue.
				for (unsigned i = 0, e = LREdit.size(); i != e; ++i) {
	// Split candidates with compact regions can cause a bad eviction sequence.			LiveInterval &Reg = LIS->getInterval(LREdit.get(i));
	// See splitCanCauseEvictionChain for detailed description of scenarios.
	// To avoid it, we need to comapre the cost with the spill cost and not the			// Ignore old intervals from DCE.
	// current max frequency.			if (getStage(Reg) != RS_New)
	if (HasCompact && (BestCost > SpillCost) && (BestCand != NoCand) &&			continue;
	CanCauseEvictionChain) {
	return 0;			// Remainder interval. Don't try splitting again, spill if it doesn't
	}			// allocate.
				if (IntvMap[i] == 0) {
	// No solutions found, fall back to single block splitting.			setStage(Reg, RS_Spill);
	if (!HasCompact && BestCand == NoCand)			continue;
	return 0;			}

	return doRegionSplit(VirtReg, BestCand, HasCompact, NewVRegs);			// Global intervals. Allow repeated splitting as long as the number of live
	}			// blocks is strictly decreasing.
				if (IntvMap[i] < NumGlobalIntvs) {
	unsigned RAGreedy::calculateRegionSplitCost(LiveInterval &VirtReg,			if (SA->countLiveBlocks(&Reg) >= OrigBlocks) {
	AllocationOrder &Order,			DEBUG(dbgs() << "Main interval covers the same " << OrigBlocks
	BlockFrequency &BestCost,			<< " blocks as original.\n");
	unsigned &NumCands, bool IgnoreCSR,			// Don't allow repeated splitting as a safe guard against looping.
	bool *CanCauseEvictionChain) {			setStage(Reg, RS_Split2);
	unsigned BestCand = NoCand;			}
	Order.rewind();			continue;
	while (unsigned PhysReg = Order.next()) {			}
	if (IgnoreCSR && isUnusedCalleeSavedReg(PhysReg))
	continue;			// Other intervals are treated as new. This includes local intervals created
				// for blocks with multiple uses, and anything created by DCE.
	// Discard bad candidates before we run out of interference cache cursors.			}
	// This will only affect register classes with a lot of registers (>32).
	if (NumCands == IntfCache.getMaxCursors()) {			if (VerifyEnabled)
	unsigned WorstCount = ~0u;			MF->verify(this, "After splitting live range around region");
	unsigned Worst = 0;			}
	for (unsigned i = 0; i != NumCands; ++i) {
	if (i == BestCand \|\| !GlobalCand[i].PhysReg)			unsigned RAGreedy::tryRegionSplit(LiveInterval &VirtReg, AllocationOrder &Order,
	continue;			SmallVectorImpl<unsigned> &NewVRegs) {
	unsigned Count = GlobalCand[i].LiveBundles.count();			unsigned NumCands = 0;
	if (Count < WorstCount) {			BlockFrequency SpillCost = calcSpillCost();
	Worst = i;			BlockFrequency BestCost;
	WorstCount = Count;
	}			// Check if we can split this live range around a compact region.
	}			bool HasCompact = calcCompactRegion(GlobalCand.front());
	--NumCands;			if (HasCompact) {
	GlobalCand[Worst] = GlobalCand[NumCands];			// Yes, keep GlobalCand[0] as the compact region candidate.
	if (BestCand == NumCands)			NumCands = 1;
	BestCand = Worst;			BestCost = BlockFrequency::getMaxFrequency();
	}			} else {
				// No benefit from the compact region, our fallback will be per-block
	if (GlobalCand.size() <= NumCands)			// splitting. Make sure we find a solution that is cheaper than spilling.
	GlobalCand.resize(NumCands+1);			BestCost = SpillCost;
	GlobalSplitCandidate &Cand = GlobalCand[NumCands];			DEBUG(dbgs() << "Cost of isolating all blocks = ";
	Cand.reset(IntfCache, PhysReg);			MBFI->printBlockFreq(dbgs(), BestCost) << '\n');
				}
	SpillPlacer->prepare(Cand.LiveBundles);
	BlockFrequency Cost;			bool CanCauseEvictionChain = false;
	if (!addSplitConstraints(Cand.Intf, Cost)) {			unsigned BestCand =
	DEBUG(dbgs() << printReg(PhysReg, TRI) << "\tno positive bundles\n");			calculateRegionSplitCost(VirtReg, Order, BestCost, NumCands,
	continue;			false /IgnoreCSR/, &CanCauseEvictionChain);
	}
	DEBUG(dbgs() << printReg(PhysReg, TRI) << "\tstatic = ";			// Split candidates with compact regions can cause a bad eviction sequence.
	MBFI->printBlockFreq(dbgs(), Cost));			// See splitCanCauseEvictionChain for detailed description of scenarios.
	if (Cost >= BestCost) {			// To avoid it, we need to comapre the cost with the spill cost and not the
	DEBUG({			// current max frequency.
	if (BestCand == NoCand)			if (HasCompact && (BestCost > SpillCost) && (BestCand != NoCand) &&
	dbgs() << " worse than no bundles\n";			CanCauseEvictionChain) {
	else			return 0;
	dbgs() << " worse than "			}
	<< printReg(GlobalCand[BestCand].PhysReg, TRI) << '\n';
	});			// No solutions found, fall back to single block splitting.
	continue;			if (!HasCompact && BestCand == NoCand)
	}			return 0;
	growRegion(Cand);
				return doRegionSplit(VirtReg, BestCand, HasCompact, NewVRegs);
	SpillPlacer->finish();			}

	// No live bundles, defer to splitSingleBlocks().			unsigned RAGreedy::calculateRegionSplitCost(LiveInterval &VirtReg,
	if (!Cand.LiveBundles.any()) {			AllocationOrder &Order,
	DEBUG(dbgs() << " no bundles.\n");			BlockFrequency &BestCost,
	continue;			unsigned &NumCands, bool IgnoreCSR,
	}			bool *CanCauseEvictionChain) {
				unsigned BestCand = NoCand;
	bool HasEvictionChain = false;			Order.rewind();
	Cost += calcGlobalSplitCost(Cand, Order, &HasEvictionChain);			while (unsigned PhysReg = Order.next()) {
	DEBUG({			if (IgnoreCSR && isUnusedCalleeSavedReg(PhysReg))
	dbgs() << ", total = "; MBFI->printBlockFreq(dbgs(), Cost)			continue;
	<< " with bundles";
	for (int i : Cand.LiveBundles.set_bits())			// Discard bad candidates before we run out of interference cache cursors.
	dbgs() << " EB#" << i;			// This will only affect register classes with a lot of registers (>32).
	dbgs() << ".\n";			if (NumCands == IntfCache.getMaxCursors()) {
	});			unsigned WorstCount = ~0u;
	if (Cost < BestCost) {			unsigned Worst = 0;
	BestCand = NumCands;			for (unsigned i = 0; i != NumCands; ++i) {
	BestCost = Cost;			if (i == BestCand \|\| !GlobalCand[i].PhysReg)
	// See splitCanCauseEvictionChain for detailed description of bad			continue;
	// eviction chain scenarios.			unsigned Count = GlobalCand[i].LiveBundles.count();
	if (CanCauseEvictionChain)			if (Count < WorstCount) {
	*CanCauseEvictionChain = HasEvictionChain;			Worst = i;
	}			WorstCount = Count;
	++NumCands;			}
	}			}
				--NumCands;
	if (CanCauseEvictionChain && BestCand != NoCand) {			GlobalCand[Worst] = GlobalCand[NumCands];
	// See splitCanCauseEvictionChain for detailed description of bad			if (BestCand == NumCands)
	// eviction chain scenarios.			BestCand = Worst;
	DEBUG(dbgs() << "Best split candidate of vreg "			}
	<< printReg(VirtReg.reg, TRI) << " may ");
	if (!(*CanCauseEvictionChain))			if (GlobalCand.size() <= NumCands)
	DEBUG(dbgs() << "not ");			GlobalCand.resize(NumCands+1);
	DEBUG(dbgs() << "cause bad eviction chain\n");			GlobalSplitCandidate &Cand = GlobalCand[NumCands];
	}			Cand.reset(IntfCache, PhysReg);

	return BestCand;			SpillPlacer->prepare(Cand.LiveBundles);
	}			BlockFrequency Cost;
				if (!addSplitConstraints(Cand.Intf, Cost)) {
	unsigned RAGreedy::doRegionSplit(LiveInterval &VirtReg, unsigned BestCand,			DEBUG(dbgs() << printReg(PhysReg, TRI) << "\tno positive bundles\n");
	bool HasCompact,			continue;
	SmallVectorImpl<unsigned> &NewVRegs) {			}
	SmallVector<unsigned, 8> UsedCands;			DEBUG(dbgs() << printReg(PhysReg, TRI) << "\tstatic = ";
	// Prepare split editor.			MBFI->printBlockFreq(dbgs(), Cost));
	LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);			if (Cost >= BestCost) {
	SE->reset(LREdit, SplitSpillMode);			DEBUG({
				if (BestCand == NoCand)
	// Assign all edge bundles to the preferred candidate, or NoCand.			dbgs() << " worse than no bundles\n";
	BundleCand.assign(Bundles->getNumBundles(), NoCand);			else
				dbgs() << " worse than "
	// Assign bundles for the best candidate region.			<< printReg(GlobalCand[BestCand].PhysReg, TRI) << '\n';
	if (BestCand != NoCand) {			});
	GlobalSplitCandidate &Cand = GlobalCand[BestCand];			continue;
	if (unsigned B = Cand.getBundles(BundleCand, BestCand)) {			}
	UsedCands.push_back(BestCand);			growRegion(Cand);
	Cand.IntvIdx = SE->openIntv();
	DEBUG(dbgs() << "Split for " << printReg(Cand.PhysReg, TRI) << " in "			SpillPlacer->finish();
	<< B << " bundles, intv " << Cand.IntvIdx << ".\n");
	(void)B;			// No live bundles, defer to splitSingleBlocks().
	}			if (!Cand.LiveBundles.any()) {
	}			DEBUG(dbgs() << " no bundles.\n");
				continue;
	// Assign bundles for the compact region.			}
	if (HasCompact) {
	GlobalSplitCandidate &Cand = GlobalCand.front();			bool HasEvictionChain = false;
	assert(!Cand.PhysReg && "Compact region has no physreg");			Cost += calcGlobalSplitCost(Cand, Order, &HasEvictionChain);
	if (unsigned B = Cand.getBundles(BundleCand, 0)) {			DEBUG({
	UsedCands.push_back(0);			dbgs() << ", total = "; MBFI->printBlockFreq(dbgs(), Cost)
	Cand.IntvIdx = SE->openIntv();			<< " with bundles";
	DEBUG(dbgs() << "Split for compact region in " << B << " bundles, intv "			for (int i : Cand.LiveBundles.set_bits())
	<< Cand.IntvIdx << ".\n");			dbgs() << " EB#" << i;
	(void)B;			dbgs() << ".\n";
	}			});
	}			if (Cost < BestCost) {
				BestCand = NumCands;
	splitAroundRegion(LREdit, UsedCands);			BestCost = Cost;
	return 0;			// See splitCanCauseEvictionChain for detailed description of bad
	}			// eviction chain scenarios.
				if (CanCauseEvictionChain)
	//===----------------------------------------------------------------------===//			*CanCauseEvictionChain = HasEvictionChain;
	// Per-Block Splitting			}
	//===----------------------------------------------------------------------===//			++NumCands;
				}
	/// tryBlockSplit - Split a global live range around every block with uses. This
	/// creates a lot of local live ranges, that will be split by tryLocalSplit if			if (CanCauseEvictionChain && BestCand != NoCand) {
	/// they don't allocate.			// See splitCanCauseEvictionChain for detailed description of bad
	unsigned RAGreedy::tryBlockSplit(LiveInterval &VirtReg, AllocationOrder &Order,			// eviction chain scenarios.
	SmallVectorImpl<unsigned> &NewVRegs) {			DEBUG(dbgs() << "Best split candidate of vreg "
	assert(&SA->getParent() == &VirtReg && "Live range wasn't analyzed");			<< printReg(VirtReg.reg, TRI) << " may ");
	unsigned Reg = VirtReg.reg;			if (!(*CanCauseEvictionChain))
	bool SingleInstrs = RegClassInfo.isProperSubClass(MRI->getRegClass(Reg));			DEBUG(dbgs() << "not ");
	LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);			DEBUG(dbgs() << "cause bad eviction chain\n");
	SE->reset(LREdit, SplitSpillMode);			}
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			return BestCand;
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];			}
	if (SA->shouldSplitSingleBlock(BI, SingleInstrs))
	SE->splitSingleBlock(BI);			unsigned RAGreedy::doRegionSplit(LiveInterval &VirtReg, unsigned BestCand,
	}			bool HasCompact,
	// No blocks were split.			SmallVectorImpl<unsigned> &NewVRegs) {
	if (LREdit.empty())			SmallVector<unsigned, 8> UsedCands;
	return 0;			// Prepare split editor.
				LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
	// We did split for some blocks.			SE->reset(LREdit, SplitSpillMode);
	SmallVector<unsigned, 8> IntvMap;
	SE->finish(&IntvMap);			// Assign all edge bundles to the preferred candidate, or NoCand.
				BundleCand.assign(Bundles->getNumBundles(), NoCand);
	// Tell LiveDebugVariables about the new ranges.
	DebugVars->splitRegister(Reg, LREdit.regs(), *LIS);			// Assign bundles for the best candidate region.
				if (BestCand != NoCand) {
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			GlobalSplitCandidate &Cand = GlobalCand[BestCand];
				if (unsigned B = Cand.getBundles(BundleCand, BestCand)) {
	// Sort out the new intervals created by splitting. The remainder interval			UsedCands.push_back(BestCand);
	// goes straight to spilling, the new local ranges get to stay RS_New.			Cand.IntvIdx = SE->openIntv();
	for (unsigned i = 0, e = LREdit.size(); i != e; ++i) {			DEBUG(dbgs() << "Split for " << printReg(Cand.PhysReg, TRI) << " in "
	LiveInterval &LI = LIS->getInterval(LREdit.get(i));			<< B << " bundles, intv " << Cand.IntvIdx << ".\n");
	if (getStage(LI) == RS_New && IntvMap[i] == 0)			(void)B;
	setStage(LI, RS_Spill);			}
	}			}

	if (VerifyEnabled)			// Assign bundles for the compact region.
	MF->verify(this, "After splitting live range around basic blocks");			if (HasCompact) {
	return 0;			GlobalSplitCandidate &Cand = GlobalCand.front();
	}			assert(!Cand.PhysReg && "Compact region has no physreg");
				if (unsigned B = Cand.getBundles(BundleCand, 0)) {
	//===----------------------------------------------------------------------===//			UsedCands.push_back(0);
	// Per-Instruction Splitting			Cand.IntvIdx = SE->openIntv();
	//===----------------------------------------------------------------------===//			DEBUG(dbgs() << "Split for compact region in " << B << " bundles, intv "
				<< Cand.IntvIdx << ".\n");
	/// Get the number of allocatable registers that match the constraints of \p Reg			(void)B;
	/// on \p MI and that are also in \p SuperRC.			}
	static unsigned getNumAllocatableRegsForConstraints(			}
	const MachineInstr MI, unsigned Reg, const TargetRegisterClass SuperRC,
	const TargetInstrInfo TII, const TargetRegisterInfo TRI,			splitAroundRegion(LREdit, UsedCands);
	const RegisterClassInfo &RCI) {			return 0;
	assert(SuperRC && "Invalid register class");			}

	const TargetRegisterClass *ConstrainedRC =			//===----------------------------------------------------------------------===//
	MI->getRegClassConstraintEffectForVReg(Reg, SuperRC, TII, TRI,			// Per-Block Splitting
	/* ExploreBundle */ true);			//===----------------------------------------------------------------------===//
	if (!ConstrainedRC)
	return 0;			/// tryBlockSplit - Split a global live range around every block with uses. This
	return RCI.getNumAllocatableRegs(ConstrainedRC);			/// creates a lot of local live ranges, that will be split by tryLocalSplit if
	}			/// they don't allocate.
				unsigned RAGreedy::tryBlockSplit(LiveInterval &VirtReg, AllocationOrder &Order,
	/// tryInstructionSplit - Split a live range around individual instructions.			SmallVectorImpl<unsigned> &NewVRegs) {
	/// This is normally not worthwhile since the spiller is doing essentially the			assert(&SA->getParent() == &VirtReg && "Live range wasn't analyzed");
	/// same thing. However, when the live range is in a constrained register			unsigned Reg = VirtReg.reg;
	/// class, it may help to insert copies such that parts of the live range can			bool SingleInstrs = RegClassInfo.isProperSubClass(MRI->getRegClass(Reg));
	/// be moved to a larger register class.			LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
	///			SE->reset(LREdit, SplitSpillMode);
	/// This is similar to spilling to a larger register class.			ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	unsigned			for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	RAGreedy::tryInstructionSplit(LiveInterval &VirtReg, AllocationOrder &Order,			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	SmallVectorImpl<unsigned> &NewVRegs) {			if (SA->shouldSplitSingleBlock(BI, SingleInstrs))
	const TargetRegisterClass *CurRC = MRI->getRegClass(VirtReg.reg);			SE->splitSingleBlock(BI);
	// There is no point to this if there are no larger sub-classes.			}
	if (!RegClassInfo.isProperSubClass(CurRC))			// No blocks were split.
	return 0;			if (LREdit.empty())
				return 0;
	// Always enable split spill mode, since we're effectively spilling to a
	// register.			// We did split for some blocks.
	LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);			SmallVector<unsigned, 8> IntvMap;
	SE->reset(LREdit, SplitEditor::SM_Size);			SE->finish(&IntvMap);

	ArrayRef<SlotIndex> Uses = SA->getUseSlots();			// Tell LiveDebugVariables about the new ranges.
	if (Uses.size() <= 1)			DebugVars->splitRegister(Reg, LREdit.regs(), *LIS);
	return 0;
				ExtraRegInfo.resize(MRI->getNumVirtRegs());
	DEBUG(dbgs() << "Split around " << Uses.size() << " individual instrs.\n");
				// Sort out the new intervals created by splitting. The remainder interval
	const TargetRegisterClass *SuperRC =			// goes straight to spilling, the new local ranges get to stay RS_New.
	TRI->getLargestLegalSuperClass(CurRC, *MF);			for (unsigned i = 0, e = LREdit.size(); i != e; ++i) {
	unsigned SuperRCNumAllocatableRegs = RCI.getNumAllocatableRegs(SuperRC);			LiveInterval &LI = LIS->getInterval(LREdit.get(i));
	// Split around every non-copy instruction if this split will relax			if (getStage(LI) == RS_New && IntvMap[i] == 0)
	// the constraints on the virtual register.			setStage(LI, RS_Spill);
	// Otherwise, splitting just inserts uncoalescable copies that do not help			}
	// the allocation.
	for (unsigned i = 0; i != Uses.size(); ++i) {			if (VerifyEnabled)
	if (const MachineInstr *MI = Indexes->getInstructionFromIndex(Uses[i]))			MF->verify(this, "After splitting live range around basic blocks");
	if (MI->isFullCopy() \|\|			return 0;
	SuperRCNumAllocatableRegs ==			}
	getNumAllocatableRegsForConstraints(MI, VirtReg.reg, SuperRC, TII,
	TRI, RCI)) {			//===----------------------------------------------------------------------===//
	DEBUG(dbgs() << " skip:\t" << Uses[i] << '\t' << *MI);			// Per-Instruction Splitting
	continue;			//===----------------------------------------------------------------------===//
	}
	SE->openIntv();			/// Get the number of allocatable registers that match the constraints of \p Reg
	SlotIndex SegStart = SE->enterIntvBefore(Uses[i]);			/// on \p MI and that are also in \p SuperRC.
	SlotIndex SegStop = SE->leaveIntvAfter(Uses[i]);			static unsigned getNumAllocatableRegsForConstraints(
	SE->useIntv(SegStart, SegStop);			const MachineInstr MI, unsigned Reg, const TargetRegisterClass SuperRC,
	}			const TargetInstrInfo TII, const TargetRegisterInfo TRI,
				const RegisterClassInfo &RCI) {
	if (LREdit.empty()) {			assert(SuperRC && "Invalid register class");
	DEBUG(dbgs() << "All uses were copies.\n");
	return 0;			const TargetRegisterClass *ConstrainedRC =
	}			MI->getRegClassConstraintEffectForVReg(Reg, SuperRC, TII, TRI,
				/* ExploreBundle */ true);
	SmallVector<unsigned, 8> IntvMap;			if (!ConstrainedRC)
	SE->finish(&IntvMap);			return 0;
	DebugVars->splitRegister(VirtReg.reg, LREdit.regs(), *LIS);			return RCI.getNumAllocatableRegs(ConstrainedRC);
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			}

	// Assign all new registers to RS_Spill. This was the last chance.			/// tryInstructionSplit - Split a live range around individual instructions.
	setStage(LREdit.begin(), LREdit.end(), RS_Spill);			/// This is normally not worthwhile since the spiller is doing essentially the
	return 0;			/// same thing. However, when the live range is in a constrained register
	}			/// class, it may help to insert copies such that parts of the live range can
				/// be moved to a larger register class.
	//===----------------------------------------------------------------------===//			///
	// Local Splitting			/// This is similar to spilling to a larger register class.
	//===----------------------------------------------------------------------===//			unsigned
				RAGreedy::tryInstructionSplit(LiveInterval &VirtReg, AllocationOrder &Order,
	/// calcGapWeights - Compute the maximum spill weight that needs to be evicted			SmallVectorImpl<unsigned> &NewVRegs) {
	/// in order to use PhysReg between two entries in SA->UseSlots.			const TargetRegisterClass *CurRC = MRI->getRegClass(VirtReg.reg);
	///			// There is no point to this if there are no larger sub-classes.
	/// GapWeight[i] represents the gap between UseSlots[i] and UseSlots[i+1].			if (!RegClassInfo.isProperSubClass(CurRC))
	///			return 0;
	void RAGreedy::calcGapWeights(unsigned PhysReg,
	SmallVectorImpl<float> &GapWeight) {			// Always enable split spill mode, since we're effectively spilling to a
	assert(SA->getUseBlocks().size() == 1 && "Not a local interval");			// register.
	const SplitAnalysis::BlockInfo &BI = SA->getUseBlocks().front();			LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
	ArrayRef<SlotIndex> Uses = SA->getUseSlots();			SE->reset(LREdit, SplitEditor::SM_Size);
	const unsigned NumGaps = Uses.size()-1;
				ArrayRef<SlotIndex> Uses = SA->getUseSlots();
	// Start and end points for the interference check.			if (Uses.size() <= 1)
	SlotIndex StartIdx =			return 0;
	BI.LiveIn ? BI.FirstInstr.getBaseIndex() : BI.FirstInstr;
	SlotIndex StopIdx =			DEBUG(dbgs() << "Split around " << Uses.size() << " individual instrs.\n");
	BI.LiveOut ? BI.LastInstr.getBoundaryIndex() : BI.LastInstr;
				const TargetRegisterClass *SuperRC =
	GapWeight.assign(NumGaps, 0.0f);			TRI->getLargestLegalSuperClass(CurRC, *MF);
				unsigned SuperRCNumAllocatableRegs = RCI.getNumAllocatableRegs(SuperRC);
	// Add interference from each overlapping register.			// Split around every non-copy instruction if this split will relax
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			// the constraints on the virtual register.
	if (!Matrix->query(const_cast<LiveInterval&>(SA->getParent()), *Units)			// Otherwise, splitting just inserts uncoalescable copies that do not help
	.checkInterference())			// the allocation.
	continue;			for (unsigned i = 0; i != Uses.size(); ++i) {
				if (const MachineInstr *MI = Indexes->getInstructionFromIndex(Uses[i]))
	// We know that VirtReg is a continuous interval from FirstInstr to			if (MI->isFullCopy() \|\|
	// LastInstr, so we don't need InterferenceQuery.			SuperRCNumAllocatableRegs ==
	//			getNumAllocatableRegsForConstraints(MI, VirtReg.reg, SuperRC, TII,
	// Interference that overlaps an instruction is counted in both gaps			TRI, RCI)) {
	// surrounding the instruction. The exception is interference before			DEBUG(dbgs() << " skip:\t" << Uses[i] << '\t' << *MI);
	// StartIdx and after StopIdx.			continue;
	//			}
	LiveIntervalUnion::SegmentIter IntI =			SE->openIntv();
	Matrix->getLiveUnions()[*Units] .find(StartIdx);			SlotIndex SegStart = SE->enterIntvBefore(Uses[i]);
	for (unsigned Gap = 0; IntI.valid() && IntI.start() < StopIdx; ++IntI) {			SlotIndex SegStop = SE->leaveIntvAfter(Uses[i]);
	// Skip the gaps before IntI.			SE->useIntv(SegStart, SegStop);
	while (Uses[Gap+1].getBoundaryIndex() < IntI.start())			}
	if (++Gap == NumGaps)
	break;			if (LREdit.empty()) {
	if (Gap == NumGaps)			DEBUG(dbgs() << "All uses were copies.\n");
	break;			return 0;
				}
	// Update the gaps covered by IntI.
	const float weight = IntI.value()->weight;			SmallVector<unsigned, 8> IntvMap;
	for (; Gap != NumGaps; ++Gap) {			SE->finish(&IntvMap);
	GapWeight[Gap] = std::max(GapWeight[Gap], weight);			DebugVars->splitRegister(VirtReg.reg, LREdit.regs(), *LIS);
	if (Uses[Gap+1].getBaseIndex() >= IntI.stop())			ExtraRegInfo.resize(MRI->getNumVirtRegs());
	break;
	}			// Assign all new registers to RS_Spill. This was the last chance.
	if (Gap == NumGaps)			setStage(LREdit.begin(), LREdit.end(), RS_Spill);
	break;			return 0;
	}			}
	}
				//===----------------------------------------------------------------------===//
	// Add fixed interference.			// Local Splitting
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			//===----------------------------------------------------------------------===//
	const LiveRange &LR = LIS->getRegUnit(*Units);
	LiveRange::const_iterator I = LR.find(StartIdx);			/// calcGapWeights - Compute the maximum spill weight that needs to be evicted
	LiveRange::const_iterator E = LR.end();			/// in order to use PhysReg between two entries in SA->UseSlots.
				///
	// Same loop as above. Mark any overlapped gaps as HUGE_VALF.			/// GapWeight[i] represents the gap between UseSlots[i] and UseSlots[i+1].
	for (unsigned Gap = 0; I != E && I->start < StopIdx; ++I) {			///
	while (Uses[Gap+1].getBoundaryIndex() < I->start)			void RAGreedy::calcGapWeights(unsigned PhysReg,
	if (++Gap == NumGaps)			SmallVectorImpl<float> &GapWeight) {
	break;			assert(SA->getUseBlocks().size() == 1 && "Not a local interval");
	if (Gap == NumGaps)			const SplitAnalysis::BlockInfo &BI = SA->getUseBlocks().front();
	break;			ArrayRef<SlotIndex> Uses = SA->getUseSlots();
				const unsigned NumGaps = Uses.size()-1;
	for (; Gap != NumGaps; ++Gap) {
	GapWeight[Gap] = huge_valf;			// Start and end points for the interference check.
	if (Uses[Gap+1].getBaseIndex() >= I->end)			SlotIndex StartIdx =
	break;			BI.LiveIn ? BI.FirstInstr.getBaseIndex() : BI.FirstInstr;
	}			SlotIndex StopIdx =
	if (Gap == NumGaps)			BI.LiveOut ? BI.LastInstr.getBoundaryIndex() : BI.LastInstr;
	break;
	}			GapWeight.assign(NumGaps, 0.0f);
	}
	}			// Add interference from each overlapping register.
				for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	/// tryLocalSplit - Try to split VirtReg into smaller intervals inside its only			if (!Matrix->query(const_cast<LiveInterval&>(SA->getParent()), *Units)
	/// basic block.			.checkInterference())
	///			continue;
	unsigned RAGreedy::tryLocalSplit(LiveInterval &VirtReg, AllocationOrder &Order,
	SmallVectorImpl<unsigned> &NewVRegs) {			// We know that VirtReg is a continuous interval from FirstInstr to
	assert(SA->getUseBlocks().size() == 1 && "Not a local interval");			// LastInstr, so we don't need InterferenceQuery.
	const SplitAnalysis::BlockInfo &BI = SA->getUseBlocks().front();			//
				// Interference that overlaps an instruction is counted in both gaps
	// Note that it is possible to have an interval that is live-in or live-out			// surrounding the instruction. The exception is interference before
	// while only covering a single block - A phi-def can use undef values from			// StartIdx and after StopIdx.
	// predecessors, and the block could be a single-block loop.			//
	// We don't bother doing anything clever about such a case, we simply assume			LiveIntervalUnion::SegmentIter IntI =
	// that the interval is continuous from FirstInstr to LastInstr. We should			Matrix->getLiveUnions()[*Units] .find(StartIdx);
	// make sure that we don't do anything illegal to such an interval, though.			for (unsigned Gap = 0; IntI.valid() && IntI.start() < StopIdx; ++IntI) {
				// Skip the gaps before IntI.
	ArrayRef<SlotIndex> Uses = SA->getUseSlots();			while (Uses[Gap+1].getBoundaryIndex() < IntI.start())
	if (Uses.size() <= 2)			if (++Gap == NumGaps)
	return 0;			break;
	const unsigned NumGaps = Uses.size()-1;			if (Gap == NumGaps)
				break;
	DEBUG({
	dbgs() << "tryLocalSplit: ";			// Update the gaps covered by IntI.
	for (unsigned i = 0, e = Uses.size(); i != e; ++i)			const float weight = IntI.value()->weight;
	dbgs() << ' ' << Uses[i];			for (; Gap != NumGaps; ++Gap) {
	dbgs() << '\n';			GapWeight[Gap] = std::max(GapWeight[Gap], weight);
	});			if (Uses[Gap+1].getBaseIndex() >= IntI.stop())
				break;
	// If VirtReg is live across any register mask operands, compute a list of			}
	// gaps with register masks.			if (Gap == NumGaps)
	SmallVector<unsigned, 8> RegMaskGaps;			break;
	if (Matrix->checkRegMaskInterference(VirtReg)) {			}
	// Get regmask slots for the whole block.			}
	ArrayRef<SlotIndex> RMS = LIS->getRegMaskSlotsInBlock(BI.MBB->getNumber());
	DEBUG(dbgs() << RMS.size() << " regmasks in block:");			// Add fixed interference.
	// Constrain to VirtReg's live range.			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	unsigned ri = std::lower_bound(RMS.begin(), RMS.end(),			const LiveRange &LR = LIS->getRegUnit(*Units);
	Uses.front().getRegSlot()) - RMS.begin();			LiveRange::const_iterator I = LR.find(StartIdx);
	unsigned re = RMS.size();			LiveRange::const_iterator E = LR.end();
	for (unsigned i = 0; i != NumGaps && ri != re; ++i) {
	// Look for Uses[i] <= RMS <= Uses[i+1].			// Same loop as above. Mark any overlapped gaps as HUGE_VALF.
	assert(!SlotIndex::isEarlierInstr(RMS[ri], Uses[i]));			for (unsigned Gap = 0; I != E && I->start < StopIdx; ++I) {
	if (SlotIndex::isEarlierInstr(Uses[i+1], RMS[ri]))			while (Uses[Gap+1].getBoundaryIndex() < I->start)
	continue;			if (++Gap == NumGaps)
	// Skip a regmask on the same instruction as the last use. It doesn't			break;
	// overlap the live range.			if (Gap == NumGaps)
	if (SlotIndex::isSameInstr(Uses[i+1], RMS[ri]) && i+1 == NumGaps)			break;
	break;
	DEBUG(dbgs() << ' ' << RMS[ri] << ':' << Uses[i] << '-' << Uses[i+1]);			for (; Gap != NumGaps; ++Gap) {
	RegMaskGaps.push_back(i);			GapWeight[Gap] = huge_valf;
	// Advance ri to the next gap. A regmask on one of the uses counts in			if (Uses[Gap+1].getBaseIndex() >= I->end)
	// both gaps.			break;
	while (ri != re && SlotIndex::isEarlierInstr(RMS[ri], Uses[i+1]))			}
	++ri;			if (Gap == NumGaps)
	}			break;
	DEBUG(dbgs() << '\n');			}
	}			}
				}
	// Since we allow local split results to be split again, there is a risk of
	// creating infinite loops. It is tempting to require that the new live			/// tryLocalSplit - Try to split VirtReg into smaller intervals inside its only
	// ranges have less instructions than the original. That would guarantee			/// basic block.
	// convergence, but it is too strict. A live range with 3 instructions can be			///
	// split 2+3 (including the COPY), and we want to allow that.			unsigned RAGreedy::tryLocalSplit(LiveInterval &VirtReg, AllocationOrder &Order,
	//			SmallVectorImpl<unsigned> &NewVRegs) {
	// Instead we use these rules:			assert(SA->getUseBlocks().size() == 1 && "Not a local interval");
	//			const SplitAnalysis::BlockInfo &BI = SA->getUseBlocks().front();
	// 1. Allow any split for ranges with getStage() < RS_Split2. (Except for the
	// noop split, of course).			// Note that it is possible to have an interval that is live-in or live-out
	// 2. Require progress be made for ranges with getStage() == RS_Split2. All			// while only covering a single block - A phi-def can use undef values from
	// the new ranges must have fewer instructions than before the split.			// predecessors, and the block could be a single-block loop.
	// 3. New ranges with the same number of instructions are marked RS_Split2,			// We don't bother doing anything clever about such a case, we simply assume
	// smaller ranges are marked RS_New.			// that the interval is continuous from FirstInstr to LastInstr. We should
	//			// make sure that we don't do anything illegal to such an interval, though.
	// These rules allow a 3 -> 2+3 split once, which we need. They also prevent
	// excessive splitting and infinite loops.			ArrayRef<SlotIndex> Uses = SA->getUseSlots();
	//			if (Uses.size() <= 2)
	bool ProgressRequired = getStage(VirtReg) >= RS_Split2;			return 0;
				const unsigned NumGaps = Uses.size()-1;
	// Best split candidate.
	unsigned BestBefore = NumGaps;			DEBUG({
	unsigned BestAfter = 0;			dbgs() << "tryLocalSplit: ";
	float BestDiff = 0;			for (unsigned i = 0, e = Uses.size(); i != e; ++i)
				dbgs() << ' ' << Uses[i];
	const float blockFreq =			dbgs() << '\n';
	SpillPlacer->getBlockFrequency(BI.MBB->getNumber()).getFrequency() *			});
	(1.0f / MBFI->getEntryFreq());
	SmallVector<float, 8> GapWeight;			// If VirtReg is live across any register mask operands, compute a list of
				// gaps with register masks.
	Order.rewind();			SmallVector<unsigned, 8> RegMaskGaps;
	while (unsigned PhysReg = Order.next()) {			if (Matrix->checkRegMaskInterference(VirtReg)) {
	// Keep track of the largest spill weight that would need to be evicted in			// Get regmask slots for the whole block.
	// order to make use of PhysReg between UseSlots[i] and UseSlots[i+1].			ArrayRef<SlotIndex> RMS = LIS->getRegMaskSlotsInBlock(BI.MBB->getNumber());
	calcGapWeights(PhysReg, GapWeight);			DEBUG(dbgs() << RMS.size() << " regmasks in block:");
				// Constrain to VirtReg's live range.
	// Remove any gaps with regmask clobbers.			unsigned ri = std::lower_bound(RMS.begin(), RMS.end(),
	if (Matrix->checkRegMaskInterference(VirtReg, PhysReg))			Uses.front().getRegSlot()) - RMS.begin();
	for (unsigned i = 0, e = RegMaskGaps.size(); i != e; ++i)			unsigned re = RMS.size();
	GapWeight[RegMaskGaps[i]] = huge_valf;			for (unsigned i = 0; i != NumGaps && ri != re; ++i) {
				// Look for Uses[i] <= RMS <= Uses[i+1].
	// Try to find the best sequence of gaps to close.			assert(!SlotIndex::isEarlierInstr(RMS[ri], Uses[i]));
	// The new spill weight must be larger than any gap interference.			if (SlotIndex::isEarlierInstr(Uses[i+1], RMS[ri]))
				continue;
	// We will split before Uses[SplitBefore] and after Uses[SplitAfter].			// Skip a regmask on the same instruction as the last use. It doesn't
	unsigned SplitBefore = 0, SplitAfter = 1;			// overlap the live range.
				if (SlotIndex::isSameInstr(Uses[i+1], RMS[ri]) && i+1 == NumGaps)
	// MaxGap should always be max(GapWeight[SplitBefore..SplitAfter-1]).			break;
	// It is the spill weight that needs to be evicted.			DEBUG(dbgs() << ' ' << RMS[ri] << ':' << Uses[i] << '-' << Uses[i+1]);
	float MaxGap = GapWeight[0];			RegMaskGaps.push_back(i);
				// Advance ri to the next gap. A regmask on one of the uses counts in
	while (true) {			// both gaps.
	// Live before/after split?			while (ri != re && SlotIndex::isEarlierInstr(RMS[ri], Uses[i+1]))
	const bool LiveBefore = SplitBefore != 0 \|\| BI.LiveIn;			++ri;
	const bool LiveAfter = SplitAfter != NumGaps \|\| BI.LiveOut;			}
				DEBUG(dbgs() << '\n');
	DEBUG(dbgs() << printReg(PhysReg, TRI) << ' '			}
	<< Uses[SplitBefore] << '-' << Uses[SplitAfter]
	<< " i=" << MaxGap);			// Since we allow local split results to be split again, there is a risk of
				// creating infinite loops. It is tempting to require that the new live
	// Stop before the interval gets so big we wouldn't be making progress.			// ranges have less instructions than the original. That would guarantee
	if (!LiveBefore && !LiveAfter) {			// convergence, but it is too strict. A live range with 3 instructions can be
	DEBUG(dbgs() << " all\n");			// split 2+3 (including the COPY), and we want to allow that.
	break;			//
	}			// Instead we use these rules:
	// Should the interval be extended or shrunk?			//
	bool Shrink = true;			// 1. Allow any split for ranges with getStage() < RS_Split2. (Except for the
				// noop split, of course).
	// How many gaps would the new range have?			// 2. Require progress be made for ranges with getStage() == RS_Split2. All
	unsigned NewGaps = LiveBefore + SplitAfter - SplitBefore + LiveAfter;			// the new ranges must have fewer instructions than before the split.
				// 3. New ranges with the same number of instructions are marked RS_Split2,
	// Legally, without causing looping?			// smaller ranges are marked RS_New.
	bool Legal = !ProgressRequired \|\| NewGaps < NumGaps;			//
				// These rules allow a 3 -> 2+3 split once, which we need. They also prevent
	if (Legal && MaxGap < huge_valf) {			// excessive splitting and infinite loops.
	// Estimate the new spill weight. Each instruction reads or writes the			//
	// register. Conservatively assume there are no read-modify-write			bool ProgressRequired = getStage(VirtReg) >= RS_Split2;
	// instructions.
	//			// Best split candidate.
	// Try to guess the size of the new interval.			unsigned BestBefore = NumGaps;
	const float EstWeight = normalizeSpillWeight(			unsigned BestAfter = 0;
	blockFreq * (NewGaps + 1),			float BestDiff = 0;
	Uses[SplitBefore].distance(Uses[SplitAfter]) +
	(LiveBefore + LiveAfter) * SlotIndex::InstrDist,			const float blockFreq =
	1);			SpillPlacer->getBlockFrequency(BI.MBB->getNumber()).getFrequency() *
	// Would this split be possible to allocate?			(1.0f / MBFI->getEntryFreq());
	// Never allocate all gaps, we wouldn't be making progress.			SmallVector<float, 8> GapWeight;
	DEBUG(dbgs() << " w=" << EstWeight);
	if (EstWeight * Hysteresis >= MaxGap) {			Order.rewind();
	Shrink = false;			while (unsigned PhysReg = Order.next()) {
	float Diff = EstWeight - MaxGap;			// Keep track of the largest spill weight that would need to be evicted in
	if (Diff > BestDiff) {			// order to make use of PhysReg between UseSlots[i] and UseSlots[i+1].
	DEBUG(dbgs() << " (best)");			calcGapWeights(PhysReg, GapWeight);
	BestDiff = Hysteresis * Diff;
	BestBefore = SplitBefore;			// Remove any gaps with regmask clobbers.
	BestAfter = SplitAfter;			if (Matrix->checkRegMaskInterference(VirtReg, PhysReg))
	}			for (unsigned i = 0, e = RegMaskGaps.size(); i != e; ++i)
	}			GapWeight[RegMaskGaps[i]] = huge_valf;
	}
				// Try to find the best sequence of gaps to close.
	// Try to shrink.			// The new spill weight must be larger than any gap interference.
	if (Shrink) {
	if (++SplitBefore < SplitAfter) {			// We will split before Uses[SplitBefore] and after Uses[SplitAfter].
	DEBUG(dbgs() << " shrink\n");			unsigned SplitBefore = 0, SplitAfter = 1;
	// Recompute the max when necessary.
	if (GapWeight[SplitBefore - 1] >= MaxGap) {			// MaxGap should always be max(GapWeight[SplitBefore..SplitAfter-1]).
	MaxGap = GapWeight[SplitBefore];			// It is the spill weight that needs to be evicted.
	for (unsigned i = SplitBefore + 1; i != SplitAfter; ++i)			float MaxGap = GapWeight[0];
	MaxGap = std::max(MaxGap, GapWeight[i]);
	}			while (true) {
	continue;			// Live before/after split?
	}			const bool LiveBefore = SplitBefore != 0 \|\| BI.LiveIn;
	MaxGap = 0;			const bool LiveAfter = SplitAfter != NumGaps \|\| BI.LiveOut;
	}
				DEBUG(dbgs() << printReg(PhysReg, TRI) << ' '
	// Try to extend the interval.			<< Uses[SplitBefore] << '-' << Uses[SplitAfter]
	if (SplitAfter >= NumGaps) {			<< " i=" << MaxGap);
	DEBUG(dbgs() << " end\n");
	break;			// Stop before the interval gets so big we wouldn't be making progress.
	}			if (!LiveBefore && !LiveAfter) {
				DEBUG(dbgs() << " all\n");
	DEBUG(dbgs() << " extend\n");			break;
	MaxGap = std::max(MaxGap, GapWeight[SplitAfter++]);			}
	}			// Should the interval be extended or shrunk?
	}			bool Shrink = true;

	// Didn't find any candidates?			// How many gaps would the new range have?
	if (BestBefore == NumGaps)			unsigned NewGaps = LiveBefore + SplitAfter - SplitBefore + LiveAfter;
	return 0;
				// Legally, without causing looping?
	DEBUG(dbgs() << "Best local split range: " << Uses[BestBefore]			bool Legal = !ProgressRequired \|\| NewGaps < NumGaps;
	<< '-' << Uses[BestAfter] << ", " << BestDiff
	<< ", " << (BestAfter - BestBefore + 1) << " instrs\n");			if (Legal && MaxGap < huge_valf) {
				// Estimate the new spill weight. Each instruction reads or writes the
	LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);			// register. Conservatively assume there are no read-modify-write
	SE->reset(LREdit);			// instructions.
				//
	SE->openIntv();			// Try to guess the size of the new interval.
	SlotIndex SegStart = SE->enterIntvBefore(Uses[BestBefore]);			const float EstWeight = normalizeSpillWeight(
	SlotIndex SegStop = SE->leaveIntvAfter(Uses[BestAfter]);			blockFreq * (NewGaps + 1),
	SE->useIntv(SegStart, SegStop);			Uses[SplitBefore].distance(Uses[SplitAfter]) +
	SmallVector<unsigned, 8> IntvMap;			(LiveBefore + LiveAfter) * SlotIndex::InstrDist,
	SE->finish(&IntvMap);			1);
	DebugVars->splitRegister(VirtReg.reg, LREdit.regs(), *LIS);			// Would this split be possible to allocate?
				// Never allocate all gaps, we wouldn't be making progress.
	// If the new range has the same number of instructions as before, mark it as			DEBUG(dbgs() << " w=" << EstWeight);
	// RS_Split2 so the next split will be forced to make progress. Otherwise,			if (EstWeight * Hysteresis >= MaxGap) {
	// leave the new intervals as RS_New so they can compete.			Shrink = false;
	bool LiveBefore = BestBefore != 0 \|\| BI.LiveIn;			float Diff = EstWeight - MaxGap;
	bool LiveAfter = BestAfter != NumGaps \|\| BI.LiveOut;			if (Diff > BestDiff) {
	unsigned NewGaps = LiveBefore + BestAfter - BestBefore + LiveAfter;			DEBUG(dbgs() << " (best)");
	if (NewGaps >= NumGaps) {			BestDiff = Hysteresis * Diff;
	DEBUG(dbgs() << "Tagging non-progress ranges: ");			BestBefore = SplitBefore;
	assert(!ProgressRequired && "Didn't make progress when it was required.");			BestAfter = SplitAfter;
	for (unsigned i = 0, e = IntvMap.size(); i != e; ++i)			}
	if (IntvMap[i] == 1) {			}
	setStage(LIS->getInterval(LREdit.get(i)), RS_Split2);			}
	DEBUG(dbgs() << printReg(LREdit.get(i)));
	}			// Try to shrink.
	DEBUG(dbgs() << '\n');			if (Shrink) {
	}			if (++SplitBefore < SplitAfter) {
	++NumLocalSplits;			DEBUG(dbgs() << " shrink\n");
				// Recompute the max when necessary.
	return 0;			if (GapWeight[SplitBefore - 1] >= MaxGap) {
	}			MaxGap = GapWeight[SplitBefore];
				for (unsigned i = SplitBefore + 1; i != SplitAfter; ++i)
	//===----------------------------------------------------------------------===//			MaxGap = std::max(MaxGap, GapWeight[i]);
	// Live Range Splitting			}
	//===----------------------------------------------------------------------===//			continue;
				}
	/// trySplit - Try to split VirtReg or one of its interferences, making it			MaxGap = 0;
	/// assignable.			}
	/// @return Physreg when VirtReg may be assigned and/or new NewVRegs.
	unsigned RAGreedy::trySplit(LiveInterval &VirtReg, AllocationOrder &Order,			// Try to extend the interval.
	SmallVectorImpl<unsigned>&NewVRegs) {			if (SplitAfter >= NumGaps) {
	// Ranges must be Split2 or less.			DEBUG(dbgs() << " end\n");
	if (getStage(VirtReg) >= RS_Spill)			break;
	return 0;			}

	// Local intervals are handled separately.			DEBUG(dbgs() << " extend\n");
	if (LIS->intervalIsInOneMBB(VirtReg)) {			MaxGap = std::max(MaxGap, GapWeight[SplitAfter++]);
	NamedRegionTimer T("local_split", "Local Splitting", TimerGroupName,			}
	TimerGroupDescription, TimePassesIsEnabled);			}
	SA->analyze(&VirtReg);
	unsigned PhysReg = tryLocalSplit(VirtReg, Order, NewVRegs);			// Didn't find any candidates?
	if (PhysReg \|\| !NewVRegs.empty())			if (BestBefore == NumGaps)
	return PhysReg;			return 0;
	return tryInstructionSplit(VirtReg, Order, NewVRegs);
	}			DEBUG(dbgs() << "Best local split range: " << Uses[BestBefore]
				<< '-' << Uses[BestAfter] << ", " << BestDiff
	NamedRegionTimer T("global_split", "Global Splitting", TimerGroupName,			<< ", " << (BestAfter - BestBefore + 1) << " instrs\n");
	TimerGroupDescription, TimePassesIsEnabled);
				LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
	SA->analyze(&VirtReg);			SE->reset(LREdit);

	// FIXME: SplitAnalysis may repair broken live ranges coming from the			SE->openIntv();
	// coalescer. That may cause the range to become allocatable which means that			SlotIndex SegStart = SE->enterIntvBefore(Uses[BestBefore]);
	// tryRegionSplit won't be making progress. This check should be replaced with			SlotIndex SegStop = SE->leaveIntvAfter(Uses[BestAfter]);
	// an assertion when the coalescer is fixed.			SE->useIntv(SegStart, SegStop);
	if (SA->didRepairRange()) {			SmallVector<unsigned, 8> IntvMap;
	// VirtReg has changed, so all cached queries are invalid.			SE->finish(&IntvMap);
	Matrix->invalidateVirtRegs();			DebugVars->splitRegister(VirtReg.reg, LREdit.regs(), *LIS);
	if (unsigned PhysReg = tryAssign(VirtReg, Order, NewVRegs))
	return PhysReg;			// If the new range has the same number of instructions as before, mark it as
	}			// RS_Split2 so the next split will be forced to make progress. Otherwise,
				// leave the new intervals as RS_New so they can compete.
	// First try to split around a region spanning multiple blocks. RS_Split2			bool LiveBefore = BestBefore != 0 \|\| BI.LiveIn;
	// ranges already made dubious progress with region splitting, so they go			bool LiveAfter = BestAfter != NumGaps \|\| BI.LiveOut;
	// straight to single block splitting.			unsigned NewGaps = LiveBefore + BestAfter - BestBefore + LiveAfter;
	if (getStage(VirtReg) < RS_Split2) {			if (NewGaps >= NumGaps) {
	unsigned PhysReg = tryRegionSplit(VirtReg, Order, NewVRegs);			DEBUG(dbgs() << "Tagging non-progress ranges: ");
	if (PhysReg \|\| !NewVRegs.empty())			assert(!ProgressRequired && "Didn't make progress when it was required.");
	return PhysReg;			for (unsigned i = 0, e = IntvMap.size(); i != e; ++i)
	}			if (IntvMap[i] == 1) {
				setStage(LIS->getInterval(LREdit.get(i)), RS_Split2);
	// Then isolate blocks.			DEBUG(dbgs() << printReg(LREdit.get(i)));
	return tryBlockSplit(VirtReg, Order, NewVRegs);			}
	}			DEBUG(dbgs() << '\n');
				}
	//===----------------------------------------------------------------------===//			++NumLocalSplits;
	// Last Chance Recoloring
	//===----------------------------------------------------------------------===//			return 0;
				}
	/// Return true if \p reg has any tied def operand.
	static bool hasTiedDef(MachineRegisterInfo *MRI, unsigned reg) {			//===----------------------------------------------------------------------===//
	for (const MachineOperand &MO : MRI->def_operands(reg))			// Live Range Splitting
	if (MO.isTied())			//===----------------------------------------------------------------------===//
	return true;
				/// trySplit - Try to split VirtReg or one of its interferences, making it
	return false;			/// assignable.
	}			/// @return Physreg when VirtReg may be assigned and/or new NewVRegs.
				unsigned RAGreedy::trySplit(LiveInterval &VirtReg, AllocationOrder &Order,
	/// mayRecolorAllInterferences - Check if the virtual registers that			SmallVectorImpl<unsigned>&NewVRegs) {
	/// interfere with \p VirtReg on \p PhysReg (or one of its aliases) may be			// Ranges must be Split2 or less.
	/// recolored to free \p PhysReg.			if (getStage(VirtReg) >= RS_Spill)
	/// When true is returned, \p RecoloringCandidates has been augmented with all			return 0;
	/// the live intervals that need to be recolored in order to free \p PhysReg
	/// for \p VirtReg.			// Local intervals are handled separately.
	/// \p FixedRegisters contains all the virtual registers that cannot be			if (LIS->intervalIsInOneMBB(VirtReg)) {
	/// recolored.			NamedRegionTimer T("local_split", "Local Splitting", TimerGroupName,
	bool			TimerGroupDescription, TimePassesIsEnabled);
	RAGreedy::mayRecolorAllInterferences(unsigned PhysReg, LiveInterval &VirtReg,			SA->analyze(&VirtReg);
	SmallLISet &RecoloringCandidates,			unsigned PhysReg = tryLocalSplit(VirtReg, Order, NewVRegs);
	const SmallVirtRegSet &FixedRegisters) {			if (PhysReg \|\| !NewVRegs.empty())
	const TargetRegisterClass *CurRC = MRI->getRegClass(VirtReg.reg);			return PhysReg;
				return tryInstructionSplit(VirtReg, Order, NewVRegs);
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			}
	LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
	// If there is LastChanceRecoloringMaxInterference or more interferences,			NamedRegionTimer T("global_split", "Global Splitting", TimerGroupName,
	// chances are one would not be recolorable.			TimerGroupDescription, TimePassesIsEnabled);
	if (Q.collectInterferingVRegs(LastChanceRecoloringMaxInterference) >=
	LastChanceRecoloringMaxInterference && !ExhaustiveSearch) {			SA->analyze(&VirtReg);
	DEBUG(dbgs() << "Early abort: too many interferences.\n");
	CutOffInfo \|= CO_Interf;			// FIXME: SplitAnalysis may repair broken live ranges coming from the
	return false;			// coalescer. That may cause the range to become allocatable which means that
	}			// tryRegionSplit won't be making progress. This check should be replaced with
	for (unsigned i = Q.interferingVRegs().size(); i; --i) {			// an assertion when the coalescer is fixed.
	LiveInterval *Intf = Q.interferingVRegs()[i - 1];			if (SA->didRepairRange()) {
	// If Intf is done and sit on the same register class as VirtReg,			// VirtReg has changed, so all cached queries are invalid.
	// it would not be recolorable as it is in the same state as VirtReg.			Matrix->invalidateVirtRegs();
	// However, if VirtReg has tied defs and Intf doesn't, then			if (unsigned PhysReg = tryAssign(VirtReg, Order, NewVRegs))
	// there is still a point in examining if it can be recolorable.			return PhysReg;
	if (((getStage(*Intf) == RS_Done &&			}
	MRI->getRegClass(Intf->reg) == CurRC) &&
	!(hasTiedDef(MRI, VirtReg.reg) && !hasTiedDef(MRI, Intf->reg))) \|\|			// First try to split around a region spanning multiple blocks. RS_Split2
	FixedRegisters.count(Intf->reg)) {			// ranges already made dubious progress with region splitting, so they go
	DEBUG(dbgs() << "Early abort: the interference is not recolorable.\n");			// straight to single block splitting.
	return false;			if (getStage(VirtReg) < RS_Split2) {
	}			unsigned PhysReg = tryRegionSplit(VirtReg, Order, NewVRegs);
	RecoloringCandidates.insert(Intf);			if (PhysReg \|\| !NewVRegs.empty())
	}			return PhysReg;
	}			}
	return true;
	}			// Then isolate blocks.
				return tryBlockSplit(VirtReg, Order, NewVRegs);
	/// tryLastChanceRecoloring - Try to assign a color to \p VirtReg by recoloring			}
	/// its interferences.
	/// Last chance recoloring chooses a color for \p VirtReg and recolors every			//===----------------------------------------------------------------------===//
	/// virtual register that was using it. The recoloring process may recursively			// Last Chance Recoloring
	/// use the last chance recoloring. Therefore, when a virtual register has been			//===----------------------------------------------------------------------===//
	/// assigned a color by this mechanism, it is marked as Fixed, i.e., it cannot
	/// be last-chance-recolored again during this recoloring "session".			/// Return true if \p reg has any tied def operand.
	/// E.g.,			static bool hasTiedDef(MachineRegisterInfo *MRI, unsigned reg) {
	/// Let			for (const MachineOperand &MO : MRI->def_operands(reg))
	/// vA can use {R1, R2 }			if (MO.isTied())
	/// vB can use { R2, R3}			return true;
	/// vC can use {R1 }
	/// Where vA, vB, and vC cannot be split anymore (they are reloads for			return false;
	/// instance) and they all interfere.			}
	///
	/// vA is assigned R1			/// mayRecolorAllInterferences - Check if the virtual registers that
	/// vB is assigned R2			/// interfere with \p VirtReg on \p PhysReg (or one of its aliases) may be
	/// vC tries to evict vA but vA is already done.			/// recolored to free \p PhysReg.
	/// Regular register allocation fails.			/// When true is returned, \p RecoloringCandidates has been augmented with all
	///			/// the live intervals that need to be recolored in order to free \p PhysReg
	/// Last chance recoloring kicks in:			/// for \p VirtReg.
	/// vC does as if vA was evicted => vC uses R1.			/// \p FixedRegisters contains all the virtual registers that cannot be
	/// vC is marked as fixed.			/// recolored.
	/// vA needs to find a color.			bool
	/// None are available.			RAGreedy::mayRecolorAllInterferences(unsigned PhysReg, LiveInterval &VirtReg,
	/// vA cannot evict vC: vC is a fixed virtual register now.			SmallLISet &RecoloringCandidates,
	/// vA does as if vB was evicted => vA uses R2.			const SmallVirtRegSet &FixedRegisters) {
	/// vB needs to find a color.			const TargetRegisterClass *CurRC = MRI->getRegClass(VirtReg.reg);
	/// R3 is available.
	/// Recoloring => vC = R1, vA = R2, vB = R3			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	///			LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
	/// \p Order defines the preferred allocation order for \p VirtReg.			// If there is LastChanceRecoloringMaxInterference or more interferences,
	/// \p NewRegs will contain any new virtual register that have been created			// chances are one would not be recolorable.
	/// (split, spill) during the process and that must be assigned.			if (Q.collectInterferingVRegs(LastChanceRecoloringMaxInterference) >=
	/// \p FixedRegisters contains all the virtual registers that cannot be			LastChanceRecoloringMaxInterference && !ExhaustiveSearch) {
	/// recolored.			DEBUG(dbgs() << "Early abort: too many interferences.\n");
	/// \p Depth gives the current depth of the last chance recoloring.			CutOffInfo \|= CO_Interf;
	/// \return a physical register that can be used for VirtReg or ~0u if none			return false;
	/// exists.			}
	unsigned RAGreedy::tryLastChanceRecoloring(LiveInterval &VirtReg,			for (unsigned i = Q.interferingVRegs().size(); i; --i) {
	AllocationOrder &Order,			LiveInterval *Intf = Q.interferingVRegs()[i - 1];
	SmallVectorImpl<unsigned> &NewVRegs,			// If Intf is done and sit on the same register class as VirtReg,
	SmallVirtRegSet &FixedRegisters,			// it would not be recolorable as it is in the same state as VirtReg.
	unsigned Depth) {			// However, if VirtReg has tied defs and Intf doesn't, then
	DEBUG(dbgs() << "Try last chance recoloring for " << VirtReg << '\n');			// there is still a point in examining if it can be recolorable.
	// Ranges must be Done.			if (((getStage(*Intf) == RS_Done &&
	assert((getStage(VirtReg) >= RS_Done \|\| !VirtReg.isSpillable()) &&			MRI->getRegClass(Intf->reg) == CurRC) &&
	"Last chance recoloring should really be last chance");			!(hasTiedDef(MRI, VirtReg.reg) && !hasTiedDef(MRI, Intf->reg))) \|\|
	// Set the max depth to LastChanceRecoloringMaxDepth.			FixedRegisters.count(Intf->reg)) {
	// We may want to reconsider that if we end up with a too large search space			DEBUG(dbgs() << "Early abort: the interference is not recolorable.\n");
	// for target with hundreds of registers.			return false;
	// Indeed, in that case we may want to cut the search space earlier.			}
	if (Depth >= LastChanceRecoloringMaxDepth && !ExhaustiveSearch) {			RecoloringCandidates.insert(Intf);
	DEBUG(dbgs() << "Abort because max depth has been reached.\n");			}
	CutOffInfo \|= CO_Depth;			}
	return ~0u;			return true;
	}			}

	// Set of Live intervals that will need to be recolored.			/// tryLastChanceRecoloring - Try to assign a color to \p VirtReg by recoloring
	SmallLISet RecoloringCandidates;			/// its interferences.
	// Record the original mapping virtual register to physical register in case			/// Last chance recoloring chooses a color for \p VirtReg and recolors every
	// the recoloring fails.			/// virtual register that was using it. The recoloring process may recursively
	DenseMap<unsigned, unsigned> VirtRegToPhysReg;			/// use the last chance recoloring. Therefore, when a virtual register has been
	// Mark VirtReg as fixed, i.e., it will not be recolored pass this point in			/// assigned a color by this mechanism, it is marked as Fixed, i.e., it cannot
	// this recoloring "session".			/// be last-chance-recolored again during this recoloring "session".
	FixedRegisters.insert(VirtReg.reg);			/// E.g.,
	SmallVector<unsigned, 4> CurrentNewVRegs;			/// Let
				/// vA can use {R1, R2 }
	Order.rewind();			/// vB can use { R2, R3}
	while (unsigned PhysReg = Order.next()) {			/// vC can use {R1 }
	DEBUG(dbgs() << "Try to assign: " << VirtReg << " to "			/// Where vA, vB, and vC cannot be split anymore (they are reloads for
	<< printReg(PhysReg, TRI) << '\n');			/// instance) and they all interfere.
	RecoloringCandidates.clear();			///
	VirtRegToPhysReg.clear();			/// vA is assigned R1
	CurrentNewVRegs.clear();			/// vB is assigned R2
				/// vC tries to evict vA but vA is already done.
	// It is only possible to recolor virtual register interference.			/// Regular register allocation fails.
	if (Matrix->checkInterference(VirtReg, PhysReg) >			///
	LiveRegMatrix::IK_VirtReg) {			/// Last chance recoloring kicks in:
	DEBUG(dbgs() << "Some interferences are not with virtual registers.\n");			/// vC does as if vA was evicted => vC uses R1.
				/// vC is marked as fixed.
	continue;			/// vA needs to find a color.
	}			/// None are available.
				/// vA cannot evict vC: vC is a fixed virtual register now.
	// Early give up on this PhysReg if it is obvious we cannot recolor all			/// vA does as if vB was evicted => vA uses R2.
	// the interferences.			/// vB needs to find a color.
	if (!mayRecolorAllInterferences(PhysReg, VirtReg, RecoloringCandidates,			/// R3 is available.
	FixedRegisters)) {			/// Recoloring => vC = R1, vA = R2, vB = R3
	DEBUG(dbgs() << "Some interferences cannot be recolored.\n");			///
	continue;			/// \p Order defines the preferred allocation order for \p VirtReg.
	}			/// \p NewRegs will contain any new virtual register that have been created
				/// (split, spill) during the process and that must be assigned.
	// RecoloringCandidates contains all the virtual registers that interfer			/// \p FixedRegisters contains all the virtual registers that cannot be
	// with VirtReg on PhysReg (or one of its aliases).			/// recolored.
	// Enqueue them for recoloring and perform the actual recoloring.			/// \p Depth gives the current depth of the last chance recoloring.
	PQueue RecoloringQueue;			/// \return a physical register that can be used for VirtReg or ~0u if none
	for (SmallLISet::iterator It = RecoloringCandidates.begin(),			/// exists.
	EndIt = RecoloringCandidates.end();			unsigned RAGreedy::tryLastChanceRecoloring(LiveInterval &VirtReg,
	It != EndIt; ++It) {			AllocationOrder &Order,
	unsigned ItVirtReg = (*It)->reg;			SmallVectorImpl<unsigned> &NewVRegs,
	enqueue(RecoloringQueue, *It);			SmallVirtRegSet &FixedRegisters,
	assert(VRM->hasPhys(ItVirtReg) &&			unsigned Depth) {
	"Interferences are supposed to be with allocated vairables");			DEBUG(dbgs() << "Try last chance recoloring for " << VirtReg << '\n');
				// Ranges must be Done.
	// Record the current allocation.			assert((getStage(VirtReg) >= RS_Done \|\| !VirtReg.isSpillable()) &&
	VirtRegToPhysReg[ItVirtReg] = VRM->getPhys(ItVirtReg);			"Last chance recoloring should really be last chance");
	// unset the related struct.			// Set the max depth to LastChanceRecoloringMaxDepth.
	Matrix->unassign(**It);			// We may want to reconsider that if we end up with a too large search space
	}			// for target with hundreds of registers.
				// Indeed, in that case we may want to cut the search space earlier.
	// Do as if VirtReg was assigned to PhysReg so that the underlying			if (Depth >= LastChanceRecoloringMaxDepth && !ExhaustiveSearch) {
	// recoloring has the right information about the interferes and			DEBUG(dbgs() << "Abort because max depth has been reached.\n");
	// available colors.			CutOffInfo \|= CO_Depth;
	Matrix->assign(VirtReg, PhysReg);			return ~0u;
				}
	// Save the current recoloring state.
	// If we cannot recolor all the interferences, we will have to start again			// Set of Live intervals that will need to be recolored.
	// at this point for the next physical register.			SmallLISet RecoloringCandidates;
	SmallVirtRegSet SaveFixedRegisters(FixedRegisters);			// Record the original mapping virtual register to physical register in case
	if (tryRecoloringCandidates(RecoloringQueue, CurrentNewVRegs,			// the recoloring fails.
	FixedRegisters, Depth)) {			DenseMap<unsigned, unsigned> VirtRegToPhysReg;
	// Push the queued vregs into the main queue.			// Mark VirtReg as fixed, i.e., it will not be recolored pass this point in
	for (unsigned NewVReg : CurrentNewVRegs)			// this recoloring "session".
	NewVRegs.push_back(NewVReg);			FixedRegisters.insert(VirtReg.reg);
	// Do not mess up with the global assignment process.			SmallVector<unsigned, 4> CurrentNewVRegs;
	// I.e., VirtReg must be unassigned.
	Matrix->unassign(VirtReg);			Order.rewind();
	return PhysReg;			while (unsigned PhysReg = Order.next()) {
	}			DEBUG(dbgs() << "Try to assign: " << VirtReg << " to "
				<< printReg(PhysReg, TRI) << '\n');
	DEBUG(dbgs() << "Fail to assign: " << VirtReg << " to "			RecoloringCandidates.clear();
	<< printReg(PhysReg, TRI) << '\n');			VirtRegToPhysReg.clear();
				CurrentNewVRegs.clear();
	// The recoloring attempt failed, undo the changes.
	FixedRegisters = SaveFixedRegisters;			// It is only possible to recolor virtual register interference.
	Matrix->unassign(VirtReg);			if (Matrix->checkInterference(VirtReg, PhysReg) >
				LiveRegMatrix::IK_VirtReg) {
	// For a newly created vreg which is also in RecoloringCandidates,			DEBUG(dbgs() << "Some interferences are not with virtual registers.\n");
	// don't add it to NewVRegs because its physical register will be restored
	// below. Other vregs in CurrentNewVRegs are created by calling			continue;
	// selectOrSplit and should be added into NewVRegs.			}
	for (SmallVectorImpl<unsigned>::iterator Next = CurrentNewVRegs.begin(),
	End = CurrentNewVRegs.end();			// Early give up on this PhysReg if it is obvious we cannot recolor all
	Next != End; ++Next) {			// the interferences.
	if (RecoloringCandidates.count(&LIS->getInterval(*Next)))			if (!mayRecolorAllInterferences(PhysReg, VirtReg, RecoloringCandidates,
	continue;			FixedRegisters)) {
	NewVRegs.push_back(*Next);			DEBUG(dbgs() << "Some interferences cannot be recolored.\n");
	}			continue;
				}
	for (SmallLISet::iterator It = RecoloringCandidates.begin(),
	EndIt = RecoloringCandidates.end();			// RecoloringCandidates contains all the virtual registers that interfer
	It != EndIt; ++It) {			// with VirtReg on PhysReg (or one of its aliases).
	unsigned ItVirtReg = (*It)->reg;			// Enqueue them for recoloring and perform the actual recoloring.
	if (VRM->hasPhys(ItVirtReg))			PQueue RecoloringQueue;
	Matrix->unassign(**It);			for (SmallLISet::iterator It = RecoloringCandidates.begin(),
	unsigned ItPhysReg = VirtRegToPhysReg[ItVirtReg];			EndIt = RecoloringCandidates.end();
	Matrix->assign(**It, ItPhysReg);			It != EndIt; ++It) {
	}			unsigned ItVirtReg = (*It)->reg;
	}			enqueue(RecoloringQueue, *It);
				assert(VRM->hasPhys(ItVirtReg) &&
	// Last chance recoloring did not worked either, give up.			"Interferences are supposed to be with allocated vairables");
	return ~0u;
	}			// Record the current allocation.
				VirtRegToPhysReg[ItVirtReg] = VRM->getPhys(ItVirtReg);
	/// tryRecoloringCandidates - Try to assign a new color to every register			// unset the related struct.
	/// in \RecoloringQueue.			Matrix->unassign(**It);
	/// \p NewRegs will contain any new virtual register created during the			}
	/// recoloring process.
	/// \p FixedRegisters[in/out] contains all the registers that have been			// Do as if VirtReg was assigned to PhysReg so that the underlying
	/// recolored.			// recoloring has the right information about the interferes and
	/// \return true if all virtual registers in RecoloringQueue were successfully			// available colors.
	/// recolored, false otherwise.			Matrix->assign(VirtReg, PhysReg);
	bool RAGreedy::tryRecoloringCandidates(PQueue &RecoloringQueue,
	SmallVectorImpl<unsigned> &NewVRegs,			// Save the current recoloring state.
	SmallVirtRegSet &FixedRegisters,			// If we cannot recolor all the interferences, we will have to start again
	unsigned Depth) {			// at this point for the next physical register.
	while (!RecoloringQueue.empty()) {			SmallVirtRegSet SaveFixedRegisters(FixedRegisters);
	LiveInterval *LI = dequeue(RecoloringQueue);			if (tryRecoloringCandidates(RecoloringQueue, CurrentNewVRegs,
	DEBUG(dbgs() << "Try to recolor: " << *LI << '\n');			FixedRegisters, Depth)) {
	unsigned PhysReg;			// Push the queued vregs into the main queue.
	PhysReg = selectOrSplitImpl(*LI, NewVRegs, FixedRegisters, Depth + 1);			for (unsigned NewVReg : CurrentNewVRegs)
	// When splitting happens, the live-range may actually be empty.			NewVRegs.push_back(NewVReg);
	// In that case, this is okay to continue the recoloring even			// Do not mess up with the global assignment process.
	// if we did not find an alternative color for it. Indeed,			// I.e., VirtReg must be unassigned.
	// there will not be anything to color for LI in the end.			Matrix->unassign(VirtReg);
	if (PhysReg == ~0u \|\| (!PhysReg && !LI->empty()))			return PhysReg;
	return false;			}

	if (!PhysReg) {			DEBUG(dbgs() << "Fail to assign: " << VirtReg << " to "
	assert(LI->empty() && "Only empty live-range do not require a register");			<< printReg(PhysReg, TRI) << '\n');
	DEBUG(dbgs() << "Recoloring of " << *LI << " succeeded. Empty LI.\n");
	continue;			// The recoloring attempt failed, undo the changes.
	}			FixedRegisters = SaveFixedRegisters;
	DEBUG(dbgs() << "Recoloring of " << *LI			Matrix->unassign(VirtReg);
	<< " succeeded with: " << printReg(PhysReg, TRI) << '\n');
				// For a newly created vreg which is also in RecoloringCandidates,
	Matrix->assign(*LI, PhysReg);			// don't add it to NewVRegs because its physical register will be restored
	FixedRegisters.insert(LI->reg);			// below. Other vregs in CurrentNewVRegs are created by calling
	}			// selectOrSplit and should be added into NewVRegs.
	return true;			for (SmallVectorImpl<unsigned>::iterator Next = CurrentNewVRegs.begin(),
	}			End = CurrentNewVRegs.end();
				Next != End; ++Next) {
	//===----------------------------------------------------------------------===//			if (RecoloringCandidates.count(&LIS->getInterval(*Next)))
	// Main Entry Point			continue;
	//===----------------------------------------------------------------------===//			NewVRegs.push_back(*Next);
				}
	unsigned RAGreedy::selectOrSplit(LiveInterval &VirtReg,
	SmallVectorImpl<unsigned> &NewVRegs) {			for (SmallLISet::iterator It = RecoloringCandidates.begin(),
	CutOffInfo = CO_None;			EndIt = RecoloringCandidates.end();
	LLVMContext &Ctx = MF->getFunction().getContext();			It != EndIt; ++It) {
	SmallVirtRegSet FixedRegisters;			unsigned ItVirtReg = (*It)->reg;
	unsigned Reg = selectOrSplitImpl(VirtReg, NewVRegs, FixedRegisters);			if (VRM->hasPhys(ItVirtReg))
	if (Reg == ~0U && (CutOffInfo != CO_None)) {			Matrix->unassign(**It);
	uint8_t CutOffEncountered = CutOffInfo & (CO_Depth \| CO_Interf);			unsigned ItPhysReg = VirtRegToPhysReg[ItVirtReg];
	if (CutOffEncountered == CO_Depth)			Matrix->assign(**It, ItPhysReg);
	Ctx.emitError("register allocation failed: maximum depth for recoloring "			}
	"reached. Use -fexhaustive-register-search to skip "			}
	"cutoffs");
	else if (CutOffEncountered == CO_Interf)			// Last chance recoloring did not worked either, give up.
	Ctx.emitError("register allocation failed: maximum interference for "			return ~0u;
	"recoloring reached. Use -fexhaustive-register-search "			}
	"to skip cutoffs");
	else if (CutOffEncountered == (CO_Depth \| CO_Interf))			/// tryRecoloringCandidates - Try to assign a new color to every register
	Ctx.emitError("register allocation failed: maximum interference and "			/// in \RecoloringQueue.
	"depth for recoloring reached. Use "			/// \p NewRegs will contain any new virtual register created during the
	"-fexhaustive-register-search to skip cutoffs");			/// recoloring process.
	}			/// \p FixedRegisters[in/out] contains all the registers that have been
	return Reg;			/// recolored.
	}			/// \return true if all virtual registers in RecoloringQueue were successfully
				/// recolored, false otherwise.
	/// Using a CSR for the first time has a cost because it causes push\|pop			bool RAGreedy::tryRecoloringCandidates(PQueue &RecoloringQueue,
	/// to be added to prologue\|epilogue. Splitting a cold section of the live			SmallVectorImpl<unsigned> &NewVRegs,
	/// range can have lower cost than using the CSR for the first time;			SmallVirtRegSet &FixedRegisters,
	/// Spilling a live range in the cold path can have lower cost than using			unsigned Depth) {
	/// the CSR for the first time. Returns the physical register if we decide			while (!RecoloringQueue.empty()) {
	/// to use the CSR; otherwise return 0.			LiveInterval *LI = dequeue(RecoloringQueue);
	unsigned RAGreedy::tryAssignCSRFirstTime(LiveInterval &VirtReg,			DEBUG(dbgs() << "Try to recolor: " << *LI << '\n');
	AllocationOrder &Order,			unsigned PhysReg;
	unsigned PhysReg,			PhysReg = selectOrSplitImpl(*LI, NewVRegs, FixedRegisters, Depth + 1);
	unsigned &CostPerUseLimit,			// When splitting happens, the live-range may actually be empty.
	SmallVectorImpl<unsigned> &NewVRegs) {			// In that case, this is okay to continue the recoloring even
	if (getStage(VirtReg) == RS_Spill && VirtReg.isSpillable()) {			// if we did not find an alternative color for it. Indeed,
	// We choose spill over using the CSR for the first time if the spill cost			// there will not be anything to color for LI in the end.
	// is lower than CSRCost.			if (PhysReg == ~0u \|\| (!PhysReg && !LI->empty()))
	SA->analyze(&VirtReg);			return false;
	if (calcSpillCost() >= CSRCost)
	return PhysReg;			if (!PhysReg) {
				assert(LI->empty() && "Only empty live-range do not require a register");
	// We are going to spill, set CostPerUseLimit to 1 to make sure that			DEBUG(dbgs() << "Recoloring of " << *LI << " succeeded. Empty LI.\n");
	// we will not use a callee-saved register in tryEvict.			continue;
	CostPerUseLimit = 1;			}
	return 0;			DEBUG(dbgs() << "Recoloring of " << *LI
	}			<< " succeeded with: " << printReg(PhysReg, TRI) << '\n');
	if (getStage(VirtReg) < RS_Split) {
	// We choose pre-splitting over using the CSR for the first time if			Matrix->assign(*LI, PhysReg);
	// the cost of splitting is lower than CSRCost.			FixedRegisters.insert(LI->reg);
	SA->analyze(&VirtReg);			}
	unsigned NumCands = 0;			return true;
	BlockFrequency BestCost = CSRCost; // Don't modify CSRCost.			}
	unsigned BestCand = calculateRegionSplitCost(VirtReg, Order, BestCost,
	NumCands, true /IgnoreCSR/);			//===----------------------------------------------------------------------===//
	if (BestCand == NoCand)			// Main Entry Point
	// Use the CSR if we can't find a region split below CSRCost.			//===----------------------------------------------------------------------===//
	return PhysReg;
				unsigned RAGreedy::selectOrSplit(LiveInterval &VirtReg,
	// Perform the actual pre-splitting.			SmallVectorImpl<unsigned> &NewVRegs) {
	doRegionSplit(VirtReg, BestCand, false/HasCompact/, NewVRegs);			CutOffInfo = CO_None;
	return 0;			LLVMContext &Ctx = MF->getFunction().getContext();
	}			SmallVirtRegSet FixedRegisters;
	return PhysReg;			unsigned Reg = selectOrSplitImpl(VirtReg, NewVRegs, FixedRegisters);
	}			if (Reg == ~0U && (CutOffInfo != CO_None)) {
				uint8_t CutOffEncountered = CutOffInfo & (CO_Depth \| CO_Interf);
	void RAGreedy::aboutToRemoveInterval(LiveInterval &LI) {			if (CutOffEncountered == CO_Depth)
	// Do not keep invalid information around.			Ctx.emitError("register allocation failed: maximum depth for recoloring "
	SetOfBrokenHints.remove(&LI);			"reached. Use -fexhaustive-register-search to skip "
	}			"cutoffs");
				else if (CutOffEncountered == CO_Interf)
	void RAGreedy::initializeCSRCost() {			Ctx.emitError("register allocation failed: maximum interference for "
	// We use the larger one out of the command-line option and the value report			"recoloring reached. Use -fexhaustive-register-search "
	// by TRI.			"to skip cutoffs");
	CSRCost = BlockFrequency(			else if (CutOffEncountered == (CO_Depth \| CO_Interf))
	std::max((unsigned)CSRFirstTimeCost, TRI->getCSRFirstUseCost()));			Ctx.emitError("register allocation failed: maximum interference and "
	if (!CSRCost.getFrequency())			"depth for recoloring reached. Use "
	return;			"-fexhaustive-register-search to skip cutoffs");
				}
	// Raw cost is relative to Entry == 2^14; scale it appropriately.			return Reg;
	uint64_t ActualEntry = MBFI->getEntryFreq();			}
	if (!ActualEntry) {
	CSRCost = 0;			/// Using a CSR for the first time has a cost because it causes push\|pop
	return;			/// to be added to prologue\|epilogue. Splitting a cold section of the live
	}			/// range can have lower cost than using the CSR for the first time;
	uint64_t FixedEntry = 1 << 14;			/// Spilling a live range in the cold path can have lower cost than using
	if (ActualEntry < FixedEntry)			/// the CSR for the first time. Returns the physical register if we decide
	CSRCost *= BranchProbability(ActualEntry, FixedEntry);			/// to use the CSR; otherwise return 0.
	else if (ActualEntry <= UINT32_MAX)			unsigned RAGreedy::tryAssignCSRFirstTime(LiveInterval &VirtReg,
	// Invert the fraction and divide.			AllocationOrder &Order,
	CSRCost /= BranchProbability(FixedEntry, ActualEntry);			unsigned PhysReg,
	else			unsigned &CostPerUseLimit,
	// Can't use BranchProbability in general, since it takes 32-bit numbers.			SmallVectorImpl<unsigned> &NewVRegs) {
	CSRCost = CSRCost.getFrequency() * (ActualEntry / FixedEntry);			if (getStage(VirtReg) == RS_Spill && VirtReg.isSpillable()) {
	}			// We choose spill over using the CSR for the first time if the spill cost
				// is lower than CSRCost.
	/// \brief Collect the hint info for \p Reg.			SA->analyze(&VirtReg);
	/// The results are stored into \p Out.			if (calcSpillCost() >= CSRCost)
	/// \p Out is not cleared before being populated.			return PhysReg;
	void RAGreedy::collectHintInfo(unsigned Reg, HintsInfo &Out) {
	for (const MachineInstr &Instr : MRI->reg_nodbg_instructions(Reg)) {			// We are going to spill, set CostPerUseLimit to 1 to make sure that
	if (!Instr.isFullCopy())			// we will not use a callee-saved register in tryEvict.
	continue;			CostPerUseLimit = 1;
	// Look for the other end of the copy.			return 0;
	unsigned OtherReg = Instr.getOperand(0).getReg();			}
	if (OtherReg == Reg) {			if (getStage(VirtReg) < RS_Split) {
	OtherReg = Instr.getOperand(1).getReg();			// We choose pre-splitting over using the CSR for the first time if
	if (OtherReg == Reg)			// the cost of splitting is lower than CSRCost.
	continue;			SA->analyze(&VirtReg);
	}			unsigned NumCands = 0;
	// Get the current assignment.			BlockFrequency BestCost = CSRCost; // Don't modify CSRCost.
	unsigned OtherPhysReg = TargetRegisterInfo::isPhysicalRegister(OtherReg)			unsigned BestCand = calculateRegionSplitCost(VirtReg, Order, BestCost,
	? OtherReg			NumCands, true /IgnoreCSR/);
	: VRM->getPhys(OtherReg);			if (BestCand == NoCand)
	// Push the collected information.			// Use the CSR if we can't find a region split below CSRCost.
	Out.push_back(HintInfo(MBFI->getBlockFreq(Instr.getParent()), OtherReg,			return PhysReg;
	OtherPhysReg));
	}			// Perform the actual pre-splitting.
	}			doRegionSplit(VirtReg, BestCand, false/HasCompact/, NewVRegs);
				return 0;
	/// \brief Using the given \p List, compute the cost of the broken hints if			}
	/// \p PhysReg was used.			return PhysReg;
	/// \return The cost of \p List for \p PhysReg.			}
	BlockFrequency RAGreedy::getBrokenHintFreq(const HintsInfo &List,
	unsigned PhysReg) {			void RAGreedy::aboutToRemoveInterval(LiveInterval &LI) {
	BlockFrequency Cost = 0;			// Do not keep invalid information around.
	for (const HintInfo &Info : List) {			SetOfBrokenHints.remove(&LI);
	if (Info.PhysReg != PhysReg)			}
	Cost += Info.Freq;
	}			void RAGreedy::initializeCSRCost() {
	return Cost;			// We use the larger one out of the command-line option and the value report
	}			// by TRI.
				CSRCost = BlockFrequency(
	/// \brief Using the register assigned to \p VirtReg, try to recolor			std::max((unsigned)CSRFirstTimeCost, TRI->getCSRFirstUseCost()));
	/// all the live ranges that are copy-related with \p VirtReg.			if (!CSRCost.getFrequency())
	/// The recoloring is then propagated to all the live-ranges that have			return;
	/// been recolored and so on, until no more copies can be coalesced or
	/// it is not profitable.			// Raw cost is relative to Entry == 2^14; scale it appropriately.
	/// For a given live range, profitability is determined by the sum of the			uint64_t ActualEntry = MBFI->getEntryFreq();
	/// frequencies of the non-identity copies it would introduce with the old			if (!ActualEntry) {
	/// and new register.			CSRCost = 0;
	void RAGreedy::tryHintRecoloring(LiveInterval &VirtReg) {			return;
	// We have a broken hint, check if it is possible to fix it by			}
	// reusing PhysReg for the copy-related live-ranges. Indeed, we evicted			uint64_t FixedEntry = 1 << 14;
	// some register and PhysReg may be available for the other live-ranges.			if (ActualEntry < FixedEntry)
	SmallSet<unsigned, 4> Visited;			CSRCost *= BranchProbability(ActualEntry, FixedEntry);
	SmallVector<unsigned, 2> RecoloringCandidates;			else if (ActualEntry <= UINT32_MAX)
	HintsInfo Info;			// Invert the fraction and divide.
	unsigned Reg = VirtReg.reg;			CSRCost /= BranchProbability(FixedEntry, ActualEntry);
	unsigned PhysReg = VRM->getPhys(Reg);			else
	// Start the recoloring algorithm from the input live-interval, then			// Can't use BranchProbability in general, since it takes 32-bit numbers.
	// it will propagate to the ones that are copy-related with it.			CSRCost = CSRCost.getFrequency() * (ActualEntry / FixedEntry);
	Visited.insert(Reg);			}
	RecoloringCandidates.push_back(Reg);
				/// \brief Collect the hint info for \p Reg.
	DEBUG(dbgs() << "Trying to reconcile hints for: " << printReg(Reg, TRI) << '('			/// The results are stored into \p Out.
	<< printReg(PhysReg, TRI) << ")\n");			/// \p Out is not cleared before being populated.
				void RAGreedy::collectHintInfo(unsigned Reg, HintsInfo &Out) {
	do {			for (const MachineInstr &Instr : MRI->reg_nodbg_instructions(Reg)) {
	Reg = RecoloringCandidates.pop_back_val();			if (!Instr.isFullCopy())
				continue;
	// We cannot recolor physical register.			// Look for the other end of the copy.
	if (TargetRegisterInfo::isPhysicalRegister(Reg))			unsigned OtherReg = Instr.getOperand(0).getReg();
	continue;			if (OtherReg == Reg) {
				OtherReg = Instr.getOperand(1).getReg();
	assert(VRM->hasPhys(Reg) && "We have unallocated variable!!");			if (OtherReg == Reg)
				continue;
	// Get the live interval mapped with this virtual register to be able			}
	// to check for the interference with the new color.			// Get the current assignment.
	LiveInterval &LI = LIS->getInterval(Reg);			unsigned OtherPhysReg = TargetRegisterInfo::isPhysicalRegister(OtherReg)
	unsigned CurrPhys = VRM->getPhys(Reg);			? OtherReg
	// Check that the new color matches the register class constraints and			: VRM->getPhys(OtherReg);
	// that it is free for this live range.			// Push the collected information.
	if (CurrPhys != PhysReg && (!MRI->getRegClass(Reg)->contains(PhysReg) \|\|			Out.push_back(HintInfo(MBFI->getBlockFreq(Instr.getParent()), OtherReg,
	Matrix->checkInterference(LI, PhysReg)))			OtherPhysReg));
	continue;			}
				}
	DEBUG(dbgs() << printReg(Reg, TRI) << '(' << printReg(CurrPhys, TRI)
	<< ") is recolorable.\n");			/// \brief Using the given \p List, compute the cost of the broken hints if
				/// \p PhysReg was used.
	// Gather the hint info.			/// \return The cost of \p List for \p PhysReg.
	Info.clear();			BlockFrequency RAGreedy::getBrokenHintFreq(const HintsInfo &List,
	collectHintInfo(Reg, Info);			unsigned PhysReg) {
	// Check if recoloring the live-range will increase the cost of the			BlockFrequency Cost = 0;
	// non-identity copies.			for (const HintInfo &Info : List) {
	if (CurrPhys != PhysReg) {			if (Info.PhysReg != PhysReg)
	DEBUG(dbgs() << "Checking profitability:\n");			Cost += Info.Freq;
	BlockFrequency OldCopiesCost = getBrokenHintFreq(Info, CurrPhys);			}
	BlockFrequency NewCopiesCost = getBrokenHintFreq(Info, PhysReg);			return Cost;
	DEBUG(dbgs() << "Old Cost: " << OldCopiesCost.getFrequency()			}
	<< "\nNew Cost: " << NewCopiesCost.getFrequency() << '\n');
	if (OldCopiesCost < NewCopiesCost) {			/// \brief Using the register assigned to \p VirtReg, try to recolor
	DEBUG(dbgs() << "=> Not profitable.\n");			/// all the live ranges that are copy-related with \p VirtReg.
	continue;			/// The recoloring is then propagated to all the live-ranges that have
	}			/// been recolored and so on, until no more copies can be coalesced or
	// At this point, the cost is either cheaper or equal. If it is			/// it is not profitable.
	// equal, we consider this is profitable because it may expose			/// For a given live range, profitability is determined by the sum of the
	// more recoloring opportunities.			/// frequencies of the non-identity copies it would introduce with the old
	DEBUG(dbgs() << "=> Profitable.\n");			/// and new register.
	// Recolor the live-range.			void RAGreedy::tryHintRecoloring(LiveInterval &VirtReg) {
	Matrix->unassign(LI);			// We have a broken hint, check if it is possible to fix it by
	Matrix->assign(LI, PhysReg);			// reusing PhysReg for the copy-related live-ranges. Indeed, we evicted
	}			// some register and PhysReg may be available for the other live-ranges.
	// Push all copy-related live-ranges to keep reconciling the broken			SmallSet<unsigned, 4> Visited;
	// hints.			SmallVector<unsigned, 2> RecoloringCandidates;
	for (const HintInfo &HI : Info) {			HintsInfo Info;
	if (Visited.insert(HI.Reg).second)			unsigned Reg = VirtReg.reg;
	RecoloringCandidates.push_back(HI.Reg);			unsigned PhysReg = VRM->getPhys(Reg);
	}			// Start the recoloring algorithm from the input live-interval, then
	} while (!RecoloringCandidates.empty());			// it will propagate to the ones that are copy-related with it.
	}			Visited.insert(Reg);
				RecoloringCandidates.push_back(Reg);
	/// \brief Try to recolor broken hints.
	/// Broken hints may be repaired by recoloring when an evicted variable			DEBUG(dbgs() << "Trying to reconcile hints for: " << printReg(Reg, TRI) << '('
	/// freed up a register for a larger live-range.			<< printReg(PhysReg, TRI) << ")\n");
	/// Consider the following example:
	/// BB1:			do {
	/// a =			Reg = RecoloringCandidates.pop_back_val();
	/// b =
	/// BB2:			// We cannot recolor physical register.
	/// ...			if (TargetRegisterInfo::isPhysicalRegister(Reg))
	/// = b			continue;
	/// = a
	/// Let us assume b gets split:			assert(VRM->hasPhys(Reg) && "We have unallocated variable!!");
	/// BB1:
	/// a =			// Get the live interval mapped with this virtual register to be able
	/// b =			// to check for the interference with the new color.
	/// BB2:			LiveInterval &LI = LIS->getInterval(Reg);
	/// c = b			unsigned CurrPhys = VRM->getPhys(Reg);
	/// ...			// Check that the new color matches the register class constraints and
	/// d = c			// that it is free for this live range.
	/// = d			if (CurrPhys != PhysReg && (!MRI->getRegClass(Reg)->contains(PhysReg) \|\|
	/// = a			Matrix->checkInterference(LI, PhysReg)))
	/// Because of how the allocation work, b, c, and d may be assigned different			continue;
	/// colors. Now, if a gets evicted later:
	/// BB1:			DEBUG(dbgs() << printReg(Reg, TRI) << '(' << printReg(CurrPhys, TRI)
	/// a =			<< ") is recolorable.\n");
	/// st a, SpillSlot
	/// b =			// Gather the hint info.
	/// BB2:			Info.clear();
	/// c = b			collectHintInfo(Reg, Info);
	/// ...			// Check if recoloring the live-range will increase the cost of the
	/// d = c			// non-identity copies.
	/// = d			if (CurrPhys != PhysReg) {
	/// e = ld SpillSlot			DEBUG(dbgs() << "Checking profitability:\n");
	/// = e			BlockFrequency OldCopiesCost = getBrokenHintFreq(Info, CurrPhys);
	/// This is likely that we can assign the same register for b, c, and d,			BlockFrequency NewCopiesCost = getBrokenHintFreq(Info, PhysReg);
	/// getting rid of 2 copies.			DEBUG(dbgs() << "Old Cost: " << OldCopiesCost.getFrequency()
	void RAGreedy::tryHintsRecoloring() {			<< "\nNew Cost: " << NewCopiesCost.getFrequency() << '\n');
	for (LiveInterval *LI : SetOfBrokenHints) {			if (OldCopiesCost < NewCopiesCost) {
	assert(TargetRegisterInfo::isVirtualRegister(LI->reg) &&			DEBUG(dbgs() << "=> Not profitable.\n");
	"Recoloring is possible only for virtual registers");			continue;
	// Some dead defs may be around (e.g., because of debug uses).			}
	// Ignore those.			// At this point, the cost is either cheaper or equal. If it is
	if (!VRM->hasPhys(LI->reg))			// equal, we consider this is profitable because it may expose
	continue;			// more recoloring opportunities.
	tryHintRecoloring(*LI);			DEBUG(dbgs() << "=> Profitable.\n");
	}			// Recolor the live-range.
	}			Matrix->unassign(LI);
				Matrix->assign(LI, PhysReg);
	unsigned RAGreedy::selectOrSplitImpl(LiveInterval &VirtReg,			}
	SmallVectorImpl<unsigned> &NewVRegs,			// Push all copy-related live-ranges to keep reconciling the broken
	SmallVirtRegSet &FixedRegisters,			// hints.
	unsigned Depth) {			for (const HintInfo &HI : Info) {
	unsigned CostPerUseLimit = ~0u;			if (Visited.insert(HI.Reg).second)
	// First try assigning a free register.			RecoloringCandidates.push_back(HI.Reg);
	AllocationOrder Order(VirtReg.reg, *VRM, RegClassInfo, Matrix);			}
	if (unsigned PhysReg = tryAssign(VirtReg, Order, NewVRegs)) {			} while (!RecoloringCandidates.empty());
	// If VirtReg got an assignment, the eviction info is no longre relevant.			}
	LastEvicted.clearEvicteeInfo(VirtReg.reg);
	// When NewVRegs is not empty, we may have made decisions such as evicting			/// \brief Try to recolor broken hints.
	// a virtual register, go with the earlier decisions and use the physical			/// Broken hints may be repaired by recoloring when an evicted variable
	// register.			/// freed up a register for a larger live-range.
	if (CSRCost.getFrequency() && isUnusedCalleeSavedReg(PhysReg) &&			/// Consider the following example:
	NewVRegs.empty()) {			/// BB1:
	unsigned CSRReg = tryAssignCSRFirstTime(VirtReg, Order, PhysReg,			/// a =
	CostPerUseLimit, NewVRegs);			/// b =
	if (CSRReg \|\| !NewVRegs.empty())			/// BB2:
	// Return now if we decide to use a CSR or create new vregs due to			/// ...
	// pre-splitting.			/// = b
	return CSRReg;			/// = a
	} else			/// Let us assume b gets split:
	return PhysReg;			/// BB1:
	}			/// a =
				/// b =
	LiveRangeStage Stage = getStage(VirtReg);			/// BB2:
	DEBUG(dbgs() << StageName[Stage]			/// c = b
	<< " Cascade " << ExtraRegInfo[VirtReg.reg].Cascade << '\n');			/// ...
				/// d = c
	// Try to evict a less worthy live range, but only for ranges from the primary			/// = d
	// queue. The RS_Split ranges already failed to do this, and they should not			/// = a
	// get a second chance until they have been split.			/// Because of how the allocation work, b, c, and d may be assigned different
	if (Stage != RS_Split)			/// colors. Now, if a gets evicted later:
	if (unsigned PhysReg =			/// BB1:
	tryEvict(VirtReg, Order, NewVRegs, CostPerUseLimit)) {			/// a =
	unsigned Hint = MRI->getSimpleHint(VirtReg.reg);			/// st a, SpillSlot
	// If VirtReg has a hint and that hint is broken record this			/// b =
	// virtual register as a recoloring candidate for broken hint.			/// BB2:
	// Indeed, since we evicted a variable in its neighborhood it is			/// c = b
	// likely we can at least partially recolor some of the			/// ...
	// copy-related live-ranges.			/// d = c
	if (Hint && Hint != PhysReg)			/// = d
	SetOfBrokenHints.insert(&VirtReg);			/// e = ld SpillSlot
	// If VirtReg eviction someone, the eviction info for it as an evictee is			/// = e
	// no longre relevant.			/// This is likely that we can assign the same register for b, c, and d,
	LastEvicted.clearEvicteeInfo(VirtReg.reg);			/// getting rid of 2 copies.
	return PhysReg;			void RAGreedy::tryHintsRecoloring() {
	}			for (LiveInterval *LI : SetOfBrokenHints) {
				assert(TargetRegisterInfo::isVirtualRegister(LI->reg) &&
	assert((NewVRegs.empty() \|\| Depth) && "Cannot append to existing NewVRegs");			"Recoloring is possible only for virtual registers");
				// Some dead defs may be around (e.g., because of debug uses).
	// The first time we see a live range, don't try to split or spill.			// Ignore those.
	// Wait until the second time, when all smaller ranges have been allocated.			if (!VRM->hasPhys(LI->reg))
	// This gives a better picture of the interference to split around.			continue;
	if (Stage < RS_Split) {			tryHintRecoloring(*LI);
	setStage(VirtReg, RS_Split);			}
	DEBUG(dbgs() << "wait for second round\n");			}
	NewVRegs.push_back(VirtReg.reg);
	return 0;			unsigned RAGreedy::selectOrSplitImpl(LiveInterval &VirtReg,
	}			SmallVectorImpl<unsigned> &NewVRegs,
				SmallVirtRegSet &FixedRegisters,
	if (Stage < RS_Spill) {			unsigned Depth) {
	// Try splitting VirtReg or interferences.			unsigned CostPerUseLimit = ~0u;
	unsigned NewVRegSizeBefore = NewVRegs.size();			// First try assigning a free register.
	unsigned PhysReg = trySplit(VirtReg, Order, NewVRegs);			AllocationOrder Order(VirtReg.reg, *VRM, RegClassInfo, Matrix);
	if (PhysReg \|\| (NewVRegs.size() - NewVRegSizeBefore)) {			if (unsigned PhysReg = tryAssign(VirtReg, Order, NewVRegs)) {
	// If VirtReg got split, the eviction info is no longre relevant.			// If VirtReg got an assignment, the eviction info is no longre relevant.
	LastEvicted.clearEvicteeInfo(VirtReg.reg);			LastEvicted.clearEvicteeInfo(VirtReg.reg);
	return PhysReg;			// When NewVRegs is not empty, we may have made decisions such as evicting
	}			// a virtual register, go with the earlier decisions and use the physical
	}			// register.
				if (CSRCost.getFrequency() && isUnusedCalleeSavedReg(PhysReg) &&
	// If we couldn't allocate a register from spilling, there is probably some			NewVRegs.empty()) {
	// invalid inline assembly. The base class will report it.			unsigned CSRReg = tryAssignCSRFirstTime(VirtReg, Order, PhysReg,
	if (Stage >= RS_Done \|\| !VirtReg.isSpillable())			CostPerUseLimit, NewVRegs);
	return tryLastChanceRecoloring(VirtReg, Order, NewVRegs, FixedRegisters,			if (CSRReg \|\| !NewVRegs.empty())
	Depth);			// Return now if we decide to use a CSR or create new vregs due to
				// pre-splitting.
	// Finally spill VirtReg itself.			return CSRReg;
	if (EnableDeferredSpilling && getStage(VirtReg) < RS_Memory) {			} else
	// TODO: This is experimental and in particular, we do not model			return PhysReg;
	// the live range splitting done by spilling correctly.			}
	// We would need a deep integration with the spiller to do the
	// right thing here. Anyway, that is still good for early testing.			LiveRangeStage Stage = getStage(VirtReg);
	setStage(VirtReg, RS_Memory);			DEBUG(dbgs() << StageName[Stage]
	DEBUG(dbgs() << "Do as if this register is in memory\n");			<< " Cascade " << ExtraRegInfo[VirtReg.reg].Cascade << '\n');
	NewVRegs.push_back(VirtReg.reg);
	} else {			// Try to evict a less worthy live range, but only for ranges from the primary
	NamedRegionTimer T("spill", "Spiller", TimerGroupName,			// queue. The RS_Split ranges already failed to do this, and they should not
	TimerGroupDescription, TimePassesIsEnabled);			// get a second chance until they have been split.
	LiveRangeEdit LRE(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);			if (Stage != RS_Split)
	spiller().spill(LRE);			if (unsigned PhysReg =
	setStage(NewVRegs.begin(), NewVRegs.end(), RS_Done);			tryEvict(VirtReg, Order, NewVRegs, CostPerUseLimit)) {
				unsigned Hint = MRI->getSimpleHint(VirtReg.reg);
	if (VerifyEnabled)			// If VirtReg has a hint and that hint is broken record this
	MF->verify(this, "After spilling");			// virtual register as a recoloring candidate for broken hint.
	}			// Indeed, since we evicted a variable in its neighborhood it is
				// likely we can at least partially recolor some of the
	// The live virtual register requesting allocation was spilled, so tell			// copy-related live-ranges.
	// the caller not to allocate anything during this round.			if (Hint && Hint != PhysReg)
	return 0;			SetOfBrokenHints.insert(&VirtReg);
	}			// If VirtReg eviction someone, the eviction info for it as an evictee is
				// no longre relevant.
	void RAGreedy::reportNumberOfSplillsReloads(MachineLoop *L, unsigned &Reloads,			LastEvicted.clearEvicteeInfo(VirtReg.reg);
	unsigned &FoldedReloads,			return PhysReg;
	unsigned &Spills,			}
	unsigned &FoldedSpills) {
	Reloads = 0;			assert((NewVRegs.empty() \|\| Depth) && "Cannot append to existing NewVRegs");
	FoldedReloads = 0;
	Spills = 0;			// The first time we see a live range, don't try to split or spill.
	FoldedSpills = 0;			// Wait until the second time, when all smaller ranges have been allocated.
				// This gives a better picture of the interference to split around.
	// Sum up the spill and reloads in subloops.			if (Stage < RS_Split) {
	for (MachineLoop SubLoop : L) {			setStage(VirtReg, RS_Split);
	unsigned SubReloads;			DEBUG(dbgs() << "wait for second round\n");
	unsigned SubFoldedReloads;			NewVRegs.push_back(VirtReg.reg);
	unsigned SubSpills;			return 0;
	unsigned SubFoldedSpills;			}

	reportNumberOfSplillsReloads(SubLoop, SubReloads, SubFoldedReloads,			if (Stage < RS_Spill) {
	SubSpills, SubFoldedSpills);			// Try splitting VirtReg or interferences.
	Reloads += SubReloads;			unsigned NewVRegSizeBefore = NewVRegs.size();
	FoldedReloads += SubFoldedReloads;			unsigned PhysReg = trySplit(VirtReg, Order, NewVRegs);
	Spills += SubSpills;			if (PhysReg \|\| (NewVRegs.size() - NewVRegSizeBefore)) {
	FoldedSpills += SubFoldedSpills;			// If VirtReg got split, the eviction info is no longre relevant.
	}			LastEvicted.clearEvicteeInfo(VirtReg.reg);
				return PhysReg;
	const MachineFrameInfo &MFI = MF->getFrameInfo();			}
	const TargetInstrInfo *TII = MF->getSubtarget().getInstrInfo();			}
	int FI;
				// If we couldn't allocate a register from spilling, there is probably some
	for (MachineBasicBlock *MBB : L->getBlocks())			// invalid inline assembly. The base class will report it.
	// Handle blocks that were not included in subloops.			if (Stage >= RS_Done \|\| !VirtReg.isSpillable())
	if (Loops->getLoopFor(MBB) == L)			return tryLastChanceRecoloring(VirtReg, Order, NewVRegs, FixedRegisters,
	for (MachineInstr &MI : *MBB) {			Depth);
	const MachineMemOperand *MMO;
				// Finally spill VirtReg itself.
	if (TII->isLoadFromStackSlot(MI, FI) && MFI.isSpillSlotObjectIndex(FI))			if (EnableDeferredSpilling && getStage(VirtReg) < RS_Memory) {
	++Reloads;			// TODO: This is experimental and in particular, we do not model
	else if (TII->hasLoadFromStackSlot(MI, MMO, FI) &&			// the live range splitting done by spilling correctly.
	MFI.isSpillSlotObjectIndex(FI))			// We would need a deep integration with the spiller to do the
	++FoldedReloads;			// right thing here. Anyway, that is still good for early testing.
	else if (TII->isStoreToStackSlot(MI, FI) &&			setStage(VirtReg, RS_Memory);
	MFI.isSpillSlotObjectIndex(FI))			DEBUG(dbgs() << "Do as if this register is in memory\n");
	++Spills;			NewVRegs.push_back(VirtReg.reg);
	else if (TII->hasStoreToStackSlot(MI, MMO, FI) &&			} else {
	MFI.isSpillSlotObjectIndex(FI))			NamedRegionTimer T("spill", "Spiller", TimerGroupName,
	++FoldedSpills;			TimerGroupDescription, TimePassesIsEnabled);
	}			LiveRangeEdit LRE(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
				spiller().spill(LRE);
	if (Reloads \|\| FoldedReloads \|\| Spills \|\| FoldedSpills) {			setStage(NewVRegs.begin(), NewVRegs.end(), RS_Done);
	using namespace ore;
				if (VerifyEnabled)
	ORE->emit([&]() {			MF->verify(this, "After spilling");
	MachineOptimizationRemarkMissed R(DEBUG_TYPE, "LoopSpillReload",			}
	L->getStartLoc(), L->getHeader());
	if (Spills)			// The live virtual register requesting allocation was spilled, so tell
	R << NV("NumSpills", Spills) << " spills ";			// the caller not to allocate anything during this round.
	if (FoldedSpills)			return 0;
	R << NV("NumFoldedSpills", FoldedSpills) << " folded spills ";			}
	if (Reloads)
	R << NV("NumReloads", Reloads) << " reloads ";			void RAGreedy::reportNumberOfSplillsReloads(MachineLoop *L, unsigned &Reloads,
	if (FoldedReloads)			unsigned &FoldedReloads,
	R << NV("NumFoldedReloads", FoldedReloads) << " folded reloads ";			unsigned &Spills,
	R << "generated in loop";			unsigned &FoldedSpills) {
	return R;			Reloads = 0;
	});			FoldedReloads = 0;
	}			Spills = 0;
	}			FoldedSpills = 0;

	bool RAGreedy::runOnMachineFunction(MachineFunction &mf) {			// Sum up the spill and reloads in subloops.
	DEBUG(dbgs() << "******** GREEDY REGISTER ALLOCATION ********\n"			for (MachineLoop SubLoop : L) {
	<< "********** Function: " << mf.getName() << '\n');			unsigned SubReloads;
				unsigned SubFoldedReloads;
	MF = &mf;			unsigned SubSpills;
	TRI = MF->getSubtarget().getRegisterInfo();			unsigned SubFoldedSpills;
	TII = MF->getSubtarget().getInstrInfo();
	RCI.runOnMachineFunction(mf);			reportNumberOfSplillsReloads(SubLoop, SubReloads, SubFoldedReloads,
				SubSpills, SubFoldedSpills);
	EnableLocalReassign = EnableLocalReassignment \|\|			Reloads += SubReloads;
	MF->getSubtarget().enableRALocalReassignment(			FoldedReloads += SubFoldedReloads;
	MF->getTarget().getOptLevel());			Spills += SubSpills;
				FoldedSpills += SubFoldedSpills;
	EnableAdvancedRASplitCost = ConsiderLocalIntervalCost \|\|			}
	MF->getSubtarget().enableAdvancedRASplitCost();
				const MachineFrameInfo &MFI = MF->getFrameInfo();
	if (VerifyEnabled)			const TargetInstrInfo *TII = MF->getSubtarget().getInstrInfo();
	MF->verify(this, "Before greedy register allocator");			int FI;

	RegAllocBase::init(getAnalysis<VirtRegMap>(),			for (MachineBasicBlock *MBB : L->getBlocks())
	getAnalysis<LiveIntervals>(),			// Handle blocks that were not included in subloops.
	getAnalysis<LiveRegMatrix>());			if (Loops->getLoopFor(MBB) == L)
	Indexes = &getAnalysis<SlotIndexes>();			for (MachineInstr &MI : *MBB) {
	MBFI = &getAnalysis<MachineBlockFrequencyInfo>();			const MachineMemOperand *MMO;
	DomTree = &getAnalysis<MachineDominatorTree>();
	ORE = &getAnalysis<MachineOptimizationRemarkEmitterPass>().getORE();			if (TII->isLoadFromStackSlot(MI, FI) && MFI.isSpillSlotObjectIndex(FI))
	SpillerInstance.reset(createInlineSpiller(this, MF, *VRM));			++Reloads;
	Loops = &getAnalysis<MachineLoopInfo>();			else if (TII->hasLoadFromStackSlot(MI, MMO, FI) &&
	Bundles = &getAnalysis<EdgeBundles>();			MFI.isSpillSlotObjectIndex(FI))
	SpillPlacer = &getAnalysis<SpillPlacement>();			++FoldedReloads;
	DebugVars = &getAnalysis<LiveDebugVariables>();			else if (TII->isStoreToStackSlot(MI, FI) &&
	AA = &getAnalysis<AAResultsWrapperPass>().getAAResults();			MFI.isSpillSlotObjectIndex(FI))
				++Spills;
	initializeCSRCost();			else if (TII->hasStoreToStackSlot(MI, MMO, FI) &&
				MFI.isSpillSlotObjectIndex(FI))
	calculateSpillWeightsAndHints(LIS, mf, VRM, Loops, *MBFI);			++FoldedSpills;
				}
	DEBUG(LIS->dump());
				if (Reloads \|\| FoldedReloads \|\| Spills \|\| FoldedSpills) {
	SA.reset(new SplitAnalysis(VRM, LIS, *Loops));			using namespace ore;
	SE.reset(new SplitEditor(SA, AA, LIS, VRM, DomTree, MBFI));
	ExtraRegInfo.clear();			ORE->emit([&]() {
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			MachineOptimizationRemarkMissed R(DEBUG_TYPE, "LoopSpillReload",
	NextCascade = 1;			L->getStartLoc(), L->getHeader());
	IntfCache.init(MF, Matrix->getLiveUnions(), Indexes, LIS, TRI);			if (Spills)
	GlobalCand.resize(32); // This will grow as needed.			R << NV("NumSpills", Spills) << " spills ";
	SetOfBrokenHints.clear();			if (FoldedSpills)
	LastEvicted.clear();			R << NV("NumFoldedSpills", FoldedSpills) << " folded spills ";
				if (Reloads)
	allocatePhysRegs();			R << NV("NumReloads", Reloads) << " reloads ";
	tryHintsRecoloring();			if (FoldedReloads)
	postOptimization();			R << NV("NumFoldedReloads", FoldedReloads) << " folded reloads ";
	reportNumberOfSplillsReloads();			R << "generated in loop";
				return R;
	releaseMemory();			});
	return true;			}
	}			}

				bool RAGreedy::runOnMachineFunction(MachineFunction &mf) {
				DEBUG(dbgs() << "******** GREEDY REGISTER ALLOCATION ********\n"
				<< "********** Function: " << mf.getName() << '\n');

				MF = &mf;
				TRI = MF->getSubtarget().getRegisterInfo();
				TII = MF->getSubtarget().getInstrInfo();
				RCI.runOnMachineFunction(mf);

				EnableLocalReassign = EnableLocalReassignment \|\|
				MF->getSubtarget().enableRALocalReassignment(
				MF->getTarget().getOptLevel());

				EnableAdvancedRASplitCost = ConsiderLocalIntervalCost \|\|
				MF->getSubtarget().enableAdvancedRASplitCost();

				if (VerifyEnabled)
				MF->verify(this, "Before greedy register allocator");

				RegAllocBase::init(getAnalysis<VirtRegMap>(),
				getAnalysis<LiveIntervals>(),
				getAnalysis<LiveRegMatrix>());
				Indexes = &getAnalysis<SlotIndexes>();
				MBFI = &getAnalysis<MachineBlockFrequencyInfo>();
				DomTree = &getAnalysis<MachineDominatorTree>();
				ORE = &getAnalysis<MachineOptimizationRemarkEmitterPass>().getORE();
				SpillerInstance.reset(createInlineSpiller(this, MF, *VRM));
				Loops = &getAnalysis<MachineLoopInfo>();
				Bundles = &getAnalysis<EdgeBundles>();
				SpillPlacer = &getAnalysis<SpillPlacement>();
				DebugVars = &getAnalysis<LiveDebugVariables>();
				AA = &getAnalysis<AAResultsWrapperPass>().getAAResults();

				initializeCSRCost();

				calculateSpillWeightsAndHints(LIS, mf, VRM, Loops, *MBFI);

				DEBUG(LIS->dump());

				SA.reset(new SplitAnalysis(VRM, LIS, *Loops));
				SE.reset(new SplitEditor(SA, AA, LIS, VRM, DomTree, MBFI));
				ExtraRegInfo.clear();
				ExtraRegInfo.resize(MRI->getNumVirtRegs());
				NextCascade = 1;
				IntfCache.init(MF, Matrix->getLiveUnions(), Indexes, LIS, TRI);
				GlobalCand.resize(32); // This will grow as needed.
				SetOfBrokenHints.clear();
				LastEvicted.clear();

				allocatePhysRegs();
				tryHintsRecoloring();
				postOptimization();
				reportNumberOfSplillsReloads();

				releaseMemory();
				return true;
				}

test/CodeGen/X86/bug26810.ll

Context not available.
	; CHECK: bb.2.for.body:	; CHECK: bb.2.for.body:
	; CHECK: SUBPDrr	; CHECK: SUBPDrr
	; CHECK-NEXT: MOVAPSmr	; CHECK-NEXT: MOVAPSmr
	; CHECK-NEXT: MOVAPSrm
	; CHECK-NEXT: MULPDrm	; CHECK-NEXT: MULPDrm
		; CHECK-NEXT: MOVAPSrm
	; CHECK-NEXT: ADDPDrr	; CHECK-NEXT: ADDPDrr
		; CHECK-NEXT: MOVAPSmr
		qcolombetUnsubmitted Not Done Reply Inline Actions Why is the new code sequence better? qcolombet: Why is the new code sequence better?
		myatsinaAuthorUnsubmitted Not Done Reply Inline Actions If we look at the new sequence of the full loop, then it is not worse than the original one. Here the test is only matching a small part of the loop (because this test is meant to check some other sequence). Before patch: loop: MOVAPSmr ... SUBPDrr MOVAPSmr MOVAPSrm MULPDrm ADDPDrr ADD32ri8 ... jmp loop After patch: loop: ... SUBPDrr MOVAPSmr MULPDrm MOVAPSrm ADDPDrr MOVAPSrm ADD32ri8 ... jmp loop So the MOVAPSmr which was in the beginning of the loop was moved to the end of the loop. myatsina: If we look at the new sequence of the full loop, then it is not worse than the original one.
	; CHECK-NEXT: ADD32ri8	; CHECK-NEXT: ADD32ri8

	target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"	target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"
Context not available.

test/CodeGen/X86/regalloc-advanced-split-cost.ll

This file was added.

				; RUN: llc < %s -march=x86 -regalloc=greedy --debug-only=regalloc 2>&1 \| FileCheck %s

				; This test is meant to make sure that the weight of local intervals that are
				; created during split is taken into account when choosing the best candidate
				; register.
				; %shl is the interval that will be split.
				; The inline assembly calls interfere with %shl and make only 2 available split
				; candidates - %esi and %ebp.
				; The old code would have chosen %esi as the split candidate ignoring the fact
				; that this choice will cause the creation of a local interval that will have a
				; certain spill cost.
				; The new code choses %ebp as the split candidate as it has lower spill cost.

				; Make sure the split behaves as expected
				; CHECK: RS_Split Cascade 1
				; CHECK-NOT: %eax static =
				; CHECK: %eax no positive bundles
				; CHECK-NEXT: %ecx no positive bundles
				; CHECK-NEXT: %edx no positive bundles
				; CHECK-NEXT: %esi static =
				; CHECK-NEXT: %edi no positive bundles
				; CHECK-NEXT: %ebx no positive bundles
				; CHECK-NEXT: %ebp static =
				; CHECK: Split for %ebp

				; Function Attrs: nounwind
				define i32 @foo(i32* %array, i32 %cond1, i32 %val) local_unnamed_addr #0 {
				entry:
				%array.addr = alloca i32*, align 4
				store i32* %array, i32** %array.addr, align 4, !tbaa !3
				%0 = load i32, i32* %array, align 4, !tbaa !7
				%arrayidx1 = getelementptr inbounds i32, i32* %array, i32 1
				%1 = load i32, i32* %arrayidx1, align 4, !tbaa !7
				%arrayidx2 = getelementptr inbounds i32, i32* %array, i32 2
				%2 = load i32, i32* %arrayidx2, align 4, !tbaa !7
				%arrayidx3 = getelementptr inbounds i32, i32* %array, i32 3
				%3 = load i32, i32* %arrayidx3, align 4, !tbaa !7
				%arrayidx4 = getelementptr inbounds i32, i32* %array, i32 4
				%4 = load i32, i32* %arrayidx4, align 4, !tbaa !7
				%arrayidx6 = getelementptr inbounds i32, i32* %array, i32 %val
				%5 = load i32, i32* %arrayidx6, align 4, !tbaa !7
				%shl = shl i32 %5, 5
				%tobool = icmp eq i32 %cond1, 0
				br i1 %tobool, label %if.else, label %if.then

				if.then: ; preds = %entry
				%arrayidx7 = getelementptr inbounds i32, i32* %array, i32 6
				store i32 %shl, i32* %arrayidx7, align 4, !tbaa !7
				call void asm "nop", "=m,r,r,r,r,r,m,~{dirflag},~{fpsr},~{flags}"(i32 nonnull %array.addr, i32 %0, i32 %1, i32 %2, i32 %3, i32 %4, i32 nonnull %array.addr) #1, !srcloc !9
				%6 = load i32, i32* %array.addr, align 4, !tbaa !3
				%arrayidx8 = getelementptr inbounds i32, i32* %6, i32 7
				br label %if.end

				if.else: ; preds = %entry
				%arrayidx5 = getelementptr inbounds i32, i32* %array, i32 5
				%7 = load i32, i32* %arrayidx5, align 4, !tbaa !7
				%arrayidx9 = getelementptr inbounds i32, i32* %array, i32 8
				store i32 %shl, i32* %arrayidx9, align 4, !tbaa !7
				call void asm "nop", "=m,{ax},{bx},{cx},{dx},{di},{si},{ebp},m,~{dirflag},~{fpsr},~{flags}"(i32** nonnull %array.addr, i32 %0, i32 %1, i32 %2, i32 %3, i32 %4, i32 %7, i32* undef, i32** nonnull %array.addr) #1, !srcloc !10
				%8 = load i32, i32* %array.addr, align 4, !tbaa !3
				%arrayidx10 = getelementptr inbounds i32, i32* %8, i32 9
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				%arrayidx10.sink = phi i32* [ %arrayidx10, %if.else ], [ %arrayidx8, %if.then ]
				%9 = phi i32* [ %8, %if.else ], [ %6, %if.then ]
				store i32 %shl, i32* %arrayidx10.sink, align 4, !tbaa !7
				%10 = load i32, i32* %9, align 4, !tbaa !7
				%add = add nsw i32 %10, %shl
				ret i32 %add
				}

				attributes #0 = { nounwind "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-features"="+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #1 = { nounwind }

				!llvm.module.flags = !{!0, !1}
				!llvm.ident = !{!2}

				!0 = !{i32 1, !"NumRegisterParameters", i32 0}
				!1 = !{i32 1, !"wchar_size", i32 4}
				!2 = !{!"clang version 6.0.0"}
				!3 = !{!4, !4, i64 0}
				!4 = !{!"any pointer", !5, i64 0}
				!5 = !{!"omnipotent char", !6, i64 0}
				!6 = !{!"Simple C/C++ TBAA"}
				!7 = !{!8, !8, i64 0}
				!8 = !{!"int", !5, i64 0}
				!9 = !{i32 268}
				!10 = !{i32 390}

test/CodeGen/X86/sad.ll

Context not available.
	; SSE2: # %bb.0: # %entry	; SSE2: # %bb.0: # %entry
	; SSE2-NEXT: pxor %xmm12, %xmm12	; SSE2-NEXT: pxor %xmm12, %xmm12
	; SSE2-NEXT: movq $-1024, %rax # imm = 0xFC00	; SSE2-NEXT: movq $-1024, %rax # imm = 0xFC00
	; SSE2-NEXT: pxor %xmm13, %xmm13	; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: pxor %xmm6, %xmm6	; SSE2-NEXT: pxor %xmm6, %xmm6
	; SSE2-NEXT: pxor %xmm4, %xmm4	; SSE2-NEXT: pxor %xmm13, %xmm13
	; SSE2-NEXT: pxor %xmm3, %xmm3	; SSE2-NEXT: pxor %xmm0, %xmm0
	; SSE2-NEXT: pxor %xmm14, %xmm14	; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: pxor %xmm15, %xmm15	; SSE2-NEXT: pxor %xmm15, %xmm15
	; SSE2-NEXT: pxor %xmm1, %xmm1
	; SSE2-NEXT: pxor %xmm0, %xmm0	; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm14, %xmm14
	; SSE2-NEXT: .p2align 4, 0x90	; SSE2-NEXT: .p2align 4, 0x90
	; SSE2-NEXT: .LBB1_1: # %vector.body	; SSE2-NEXT: .LBB1_1: # %vector.body
	; SSE2-NEXT: # =>This Inner Loop Header: Depth=1	; SSE2-NEXT: # =>This Inner Loop Header: Depth=1
	; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm3, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm4, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa a+1040(%rax), %xmm8	; SSE2-NEXT: movdqa a+1040(%rax), %xmm8
	; SSE2-NEXT: movdqa a+1024(%rax), %xmm3	; SSE2-NEXT: movdqa a+1024(%rax), %xmm3
	; SSE2-NEXT: movdqa %xmm3, %xmm4	; SSE2-NEXT: movdqa %xmm3, %xmm4
Context not available.
	; SSE2-NEXT: psrad $31, %xmm6	; SSE2-NEXT: psrad $31, %xmm6
	; SSE2-NEXT: paddd %xmm6, %xmm7	; SSE2-NEXT: paddd %xmm6, %xmm7
	; SSE2-NEXT: pxor %xmm6, %xmm7	; SSE2-NEXT: pxor %xmm6, %xmm7
	; SSE2-NEXT: paddd %xmm7, %xmm13	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm6 # 16-byte Reload
		; SSE2-NEXT: paddd %xmm7, %xmm6
		; SSE2-NEXT: movdqa %xmm6, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm4, %xmm6	; SSE2-NEXT: movdqa %xmm4, %xmm6
	; SSE2-NEXT: psrad $31, %xmm6	; SSE2-NEXT: psrad $31, %xmm6
	; SSE2-NEXT: paddd %xmm6, %xmm4	; SSE2-NEXT: paddd %xmm6, %xmm4
	; SSE2-NEXT: pxor %xmm6, %xmm4	; SSE2-NEXT: pxor %xmm6, %xmm4
	; SSE2-NEXT: movdqa %xmm10, %xmm6	; SSE2-NEXT: movdqa %xmm10, %xmm6
	; SSE2-NEXT: paddd %xmm4, %xmm6	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm7 # 16-byte Reload
		; SSE2-NEXT: paddd %xmm4, %xmm7
		; SSE2-NEXT: movdqa %xmm7, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm1, %xmm4	; SSE2-NEXT: movdqa %xmm1, %xmm4
	; SSE2-NEXT: psrad $31, %xmm4	; SSE2-NEXT: psrad $31, %xmm4
	; SSE2-NEXT: paddd %xmm4, %xmm1	; SSE2-NEXT: paddd %xmm4, %xmm1
	; SSE2-NEXT: pxor %xmm4, %xmm1	; SSE2-NEXT: pxor %xmm4, %xmm1
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm4 # 16-byte Reload	; SSE2-NEXT: paddd %xmm1, %xmm6
	; SSE2-NEXT: paddd %xmm1, %xmm4
	; SSE2-NEXT: movdqa %xmm3, %xmm1	; SSE2-NEXT: movdqa %xmm3, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm3	; SSE2-NEXT: paddd %xmm1, %xmm3
	; SSE2-NEXT: pxor %xmm1, %xmm3	; SSE2-NEXT: pxor %xmm1, %xmm3
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm3, %xmm1	; SSE2-NEXT: paddd %xmm3, %xmm1
	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm3 # 16-byte Reload
	; SSE2-NEXT: movdqa %xmm5, %xmm1	; SSE2-NEXT: movdqa %xmm5, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm5	; SSE2-NEXT: paddd %xmm1, %xmm5
	; SSE2-NEXT: pxor %xmm1, %xmm5	; SSE2-NEXT: pxor %xmm1, %xmm5
	; SSE2-NEXT: paddd %xmm5, %xmm14	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
		; SSE2-NEXT: paddd %xmm5, %xmm1
		; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm0, %xmm1	; SSE2-NEXT: movdqa %xmm0, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm0	; SSE2-NEXT: paddd %xmm1, %xmm0
	; SSE2-NEXT: pxor %xmm1, %xmm0	; SSE2-NEXT: pxor %xmm1, %xmm0
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm0, %xmm15	; SSE2-NEXT: paddd %xmm0, %xmm15
	; SSE2-NEXT: movdqa %xmm2, %xmm0	; SSE2-NEXT: movdqa %xmm2, %xmm0
	; SSE2-NEXT: psrad $31, %xmm0	; SSE2-NEXT: psrad $31, %xmm0
	; SSE2-NEXT: paddd %xmm0, %xmm2	; SSE2-NEXT: paddd %xmm0, %xmm2
	; SSE2-NEXT: pxor %xmm0, %xmm2	; SSE2-NEXT: pxor %xmm0, %xmm2
	; SSE2-NEXT: paddd %xmm2, %xmm1	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload
		; SSE2-NEXT: paddd %xmm2, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm8, %xmm0	; SSE2-NEXT: movdqa %xmm8, %xmm0
	; SSE2-NEXT: psrad $31, %xmm0	; SSE2-NEXT: psrad $31, %xmm0
	; SSE2-NEXT: paddd %xmm0, %xmm8	; SSE2-NEXT: paddd %xmm0, %xmm8
	; SSE2-NEXT: pxor %xmm0, %xmm8	; SSE2-NEXT: pxor %xmm0, %xmm8
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload	; SSE2-NEXT: paddd %xmm8, %xmm14
	; SSE2-NEXT: paddd %xmm8, %xmm0
	; SSE2-NEXT: addq $4, %rax	; SSE2-NEXT: addq $4, %rax
	; SSE2-NEXT: jne .LBB1_1	; SSE2-NEXT: jne .LBB1_1
	; SSE2-NEXT: # %bb.2: # %middle.block	; SSE2-NEXT: # %bb.2: # %middle.block
	; SSE2-NEXT: paddd %xmm15, %xmm6	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm0, %xmm3	; SSE2-NEXT: paddd %xmm15, %xmm0
	; SSE2-NEXT: paddd %xmm6, %xmm3
	; SSE2-NEXT: paddd %xmm14, %xmm13	; SSE2-NEXT: paddd %xmm14, %xmm13
	; SSE2-NEXT: paddd %xmm1, %xmm4	; SSE2-NEXT: paddd %xmm0, %xmm13
	; SSE2-NEXT: paddd %xmm3, %xmm4	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm13, %xmm4	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Folded Reload
	; SSE2-NEXT: pshufd {{.*#+}} xmm0 = xmm4[2,3,0,1]	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm6 # 16-byte Folded Reload
	; SSE2-NEXT: paddd %xmm4, %xmm0	; SSE2-NEXT: paddd %xmm13, %xmm6
		; SSE2-NEXT: paddd %xmm0, %xmm6
		; SSE2-NEXT: pshufd {{.*#+}} xmm0 = xmm6[2,3,0,1]
		; SSE2-NEXT: paddd %xmm6, %xmm0
	; SSE2-NEXT: pshufd {{.*#+}} xmm1 = xmm0[1,1,2,3]	; SSE2-NEXT: pshufd {{.*#+}} xmm1 = xmm0[1,1,2,3]
	; SSE2-NEXT: paddd %xmm0, %xmm1	; SSE2-NEXT: paddd %xmm0, %xmm1
	; SSE2-NEXT: movd %xmm1, %eax	; SSE2-NEXT: movd %xmm1, %eax
Context not available.
	; SSE2-NEXT: subq $200, %rsp	; SSE2-NEXT: subq $200, %rsp
	; SSE2-NEXT: pxor %xmm14, %xmm14	; SSE2-NEXT: pxor %xmm14, %xmm14
	; SSE2-NEXT: movq $-1024, %rax # imm = 0xFC00	; SSE2-NEXT: movq $-1024, %rax # imm = 0xFC00
	; SSE2-NEXT: pxor %xmm15, %xmm15
	; SSE2-NEXT: pxor %xmm10, %xmm10
	; SSE2-NEXT: pxor %xmm3, %xmm3
	; SSE2-NEXT: pxor %xmm5, %xmm5
	; SSE2-NEXT: pxor %xmm13, %xmm13
	; SSE2-NEXT: pxor %xmm1, %xmm1
	; SSE2-NEXT: pxor %xmm8, %xmm8
	; SSE2-NEXT: pxor %xmm0, %xmm0	; SSE2-NEXT: pxor %xmm0, %xmm0
	; SSE2-NEXT: pxor %xmm2, %xmm2	; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: pxor %xmm11, %xmm11	; SSE2-NEXT: pxor %xmm0, %xmm0
	; SSE2-NEXT: pxor %xmm4, %xmm4	; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm4, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: pxor %xmm0, %xmm0
	; SSE2-NEXT: pxor %xmm7, %xmm7	; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm7, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: pxor %xmm0, %xmm0
	; SSE2-NEXT: pxor %xmm7, %xmm7	; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm7, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: pxor %xmm0, %xmm0
	; SSE2-NEXT: pxor %xmm7, %xmm7	; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm7, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: pxor %xmm0, %xmm0
	; SSE2-NEXT: pxor %xmm7, %xmm7	; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm7, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: pxor %xmm0, %xmm0
	; SSE2-NEXT: pxor %xmm7, %xmm7	; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm7, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, (%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: pxor %xmm0, %xmm0
		; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: .p2align 4, 0x90	; SSE2-NEXT: .p2align 4, 0x90
	; SSE2-NEXT: .LBB2_1: # %vector.body	; SSE2-NEXT: .LBB2_1: # %vector.body
	; SSE2-NEXT: # =>This Inner Loop Header: Depth=1	; SSE2-NEXT: # =>This Inner Loop Header: Depth=1
	; SSE2-NEXT: movdqa %xmm2, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm3, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm8, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm11, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm5, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm13, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm10, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm1, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm15, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movaps a+1040(%rax), %xmm0	; SSE2-NEXT: movaps a+1040(%rax), %xmm0
	; SSE2-NEXT: movaps %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: movaps %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa a+1024(%rax), %xmm12	; SSE2-NEXT: movdqa a+1024(%rax), %xmm12
	; SSE2-NEXT: movdqa a+1056(%rax), %xmm15	; SSE2-NEXT: movdqa a+1056(%rax), %xmm15
	; SSE2-NEXT: movdqa a+1072(%rax), %xmm4	; SSE2-NEXT: movdqa a+1072(%rax), %xmm4
Context not available.
	; SSE2-NEXT: movdqa %xmm0, %xmm3	; SSE2-NEXT: movdqa %xmm0, %xmm3
	; SSE2-NEXT: punpckhwd {{.*#+}} xmm3 = xmm3[4],xmm14[4],xmm3[5],xmm14[5],xmm3[6],xmm14[6],xmm3[7],xmm14[7]	; SSE2-NEXT: punpckhwd {{.*#+}} xmm3 = xmm3[4],xmm14[4],xmm3[5],xmm14[5],xmm3[6],xmm14[6],xmm3[7],xmm14[7]
	; SSE2-NEXT: psubd %xmm3, %xmm2	; SSE2-NEXT: psubd %xmm3, %xmm2
	; SSE2-NEXT: movdqa %xmm2, (%rsp) # 16-byte Spill	; SSE2-NEXT: movdqa %xmm2, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: punpcklwd {{.*#+}} xmm0 = xmm0[0],xmm14[0],xmm0[1],xmm14[1],xmm0[2],xmm14[2],xmm0[3],xmm14[3]	; SSE2-NEXT: punpcklwd {{.*#+}} xmm0 = xmm0[0],xmm14[0],xmm0[1],xmm14[1],xmm0[2],xmm14[2],xmm0[3],xmm14[3]
	; SSE2-NEXT: psubd %xmm0, %xmm15	; SSE2-NEXT: psubd %xmm0, %xmm15
	; SSE2-NEXT: movdqa %xmm7, %xmm0	; SSE2-NEXT: movdqa %xmm7, %xmm0
Context not available.
	; SSE2-NEXT: punpcklwd {{.*#+}} xmm3 = xmm3[0],xmm14[0],xmm3[1],xmm14[1],xmm3[2],xmm14[2],xmm3[3],xmm14[3]	; SSE2-NEXT: punpcklwd {{.*#+}} xmm3 = xmm3[0],xmm14[0],xmm3[1],xmm14[1],xmm3[2],xmm14[2],xmm3[3],xmm14[3]
	; SSE2-NEXT: psubd %xmm3, %xmm9	; SSE2-NEXT: psubd %xmm3, %xmm9
	; SSE2-NEXT: movdqa %xmm9, {{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: movdqa %xmm9, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm2 # 16-byte Reload	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm2 # 16-byte Reload
	; SSE2-NEXT: movdqa %xmm2, %xmm9	; SSE2-NEXT: movdqa %xmm2, %xmm9
	; SSE2-NEXT: punpcklbw {{.*#+}} xmm9 = xmm9[0],xmm14[0],xmm9[1],xmm14[1],xmm9[2],xmm14[2],xmm9[3],xmm14[3],xmm9[4],xmm14[4],xmm9[5],xmm14[5],xmm9[6],xmm14[6],xmm9[7],xmm14[7]	; SSE2-NEXT: punpcklbw {{.*#+}} xmm9 = xmm9[0],xmm14[0],xmm9[1],xmm14[1],xmm9[2],xmm14[2],xmm9[3],xmm14[3],xmm9[4],xmm14[4],xmm9[5],xmm14[5],xmm9[6],xmm14[6],xmm9[7],xmm14[7]
	; SSE2-NEXT: punpckhwd {{.*#+}} xmm0 = xmm0[4],xmm14[4],xmm0[5],xmm14[5],xmm0[6],xmm14[6],xmm0[7],xmm14[7]	; SSE2-NEXT: punpckhwd {{.*#+}} xmm0 = xmm0[4],xmm14[4],xmm0[5],xmm14[5],xmm0[6],xmm14[6],xmm0[7],xmm14[7]
Context not available.
	; SSE2-NEXT: punpckhwd {{.*#+}} xmm2 = xmm2[4],xmm14[4],xmm2[5],xmm14[5],xmm2[6],xmm14[6],xmm2[7],xmm14[7]	; SSE2-NEXT: punpckhwd {{.*#+}} xmm2 = xmm2[4],xmm14[4],xmm2[5],xmm14[5],xmm2[6],xmm14[6],xmm2[7],xmm14[7]
	; SSE2-NEXT: punpckhwd {{.*#+}} xmm13 = xmm13[4],xmm14[4],xmm13[5],xmm14[5],xmm13[6],xmm14[6],xmm13[7],xmm14[7]	; SSE2-NEXT: punpckhwd {{.*#+}} xmm13 = xmm13[4],xmm14[4],xmm13[5],xmm14[5],xmm13[6],xmm14[6],xmm13[7],xmm14[7]
	; SSE2-NEXT: psubd %xmm13, %xmm2	; SSE2-NEXT: psubd %xmm13, %xmm2
	; SSE2-NEXT: movdqa %xmm2, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: movdqa %xmm2, %xmm13
	; SSE2-NEXT: movdqa %xmm1, %xmm3	; SSE2-NEXT: movdqa %xmm1, %xmm3
	; SSE2-NEXT: psrad $31, %xmm3	; SSE2-NEXT: psrad $31, %xmm3
	; SSE2-NEXT: paddd %xmm3, %xmm1	; SSE2-NEXT: paddd %xmm3, %xmm1
Context not available.
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm6, %xmm1	; SSE2-NEXT: paddd %xmm6, %xmm1
	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm3 # 16-byte Reload
	; SSE2-NEXT: movdqa %xmm5, %xmm1	; SSE2-NEXT: movdqa %xmm5, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm5	; SSE2-NEXT: paddd %xmm1, %xmm5
	; SSE2-NEXT: pxor %xmm1, %xmm5	; SSE2-NEXT: pxor %xmm1, %xmm5
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload	; SSE2-NEXT: movdqa (%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm5, %xmm1	; SSE2-NEXT: paddd %xmm5, %xmm1
	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: movdqa %xmm1, (%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm6 # 16-byte Reload
	; SSE2-NEXT: movdqa %xmm4, %xmm1	; SSE2-NEXT: movdqa %xmm4, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm4	; SSE2-NEXT: paddd %xmm1, %xmm4
Context not available.
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm4, %xmm1	; SSE2-NEXT: paddd %xmm4, %xmm1
	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm13 # 16-byte Reload
	; SSE2-NEXT: movdqa %xmm8, %xmm1	; SSE2-NEXT: movdqa %xmm8, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm8	; SSE2-NEXT: paddd %xmm1, %xmm8
Context not available.
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm8, %xmm1	; SSE2-NEXT: paddd %xmm8, %xmm1
	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm5 # 16-byte Reload
	; SSE2-NEXT: movdqa %xmm11, %xmm1	; SSE2-NEXT: movdqa %xmm11, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm11	; SSE2-NEXT: paddd %xmm1, %xmm11
Context not available.
	; SSE2-NEXT: paddd %xmm11, %xmm1	; SSE2-NEXT: paddd %xmm11, %xmm1
	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm2 # 16-byte Reload	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm2 # 16-byte Reload
	; SSE2-NEXT: movdqa (%rsp), %xmm4 # 16-byte Reload	; SSE2-NEXT: movdqa %xmm2, %xmm1
	; SSE2-NEXT: movdqa %xmm4, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm4	; SSE2-NEXT: paddd %xmm1, %xmm2
	; SSE2-NEXT: pxor %xmm1, %xmm4	; SSE2-NEXT: pxor %xmm1, %xmm2
	; SSE2-NEXT: paddd %xmm4, %xmm3	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: movdqa %xmm3, %xmm11	; SSE2-NEXT: paddd %xmm2, %xmm1
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm3 # 16-byte Reload	; SSE2-NEXT: movdqa %xmm1, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm15, %xmm1	; SSE2-NEXT: movdqa %xmm15, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm15	; SSE2-NEXT: paddd %xmm1, %xmm15
	; SSE2-NEXT: pxor %xmm1, %xmm15	; SSE2-NEXT: pxor %xmm1, %xmm15
	; SSE2-NEXT: paddd %xmm15, %xmm2	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm4 # 16-byte Reload	; SSE2-NEXT: paddd %xmm15, %xmm1
	; SSE2-NEXT: movdqa %xmm4, %xmm1	; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill
		; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm2 # 16-byte Reload
		; SSE2-NEXT: movdqa %xmm2, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm4	; SSE2-NEXT: paddd %xmm1, %xmm2
	; SSE2-NEXT: pxor %xmm1, %xmm4	; SSE2-NEXT: pxor %xmm1, %xmm2
	; SSE2-NEXT: paddd %xmm4, %xmm6	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: movdqa %xmm6, %xmm15	; SSE2-NEXT: paddd %xmm2, %xmm1
		; SSE2-NEXT: movdqa %xmm1, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm10, %xmm1	; SSE2-NEXT: movdqa %xmm10, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm10	; SSE2-NEXT: paddd %xmm1, %xmm10
	; SSE2-NEXT: pxor %xmm1, %xmm10	; SSE2-NEXT: pxor %xmm1, %xmm10
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm10, %xmm1	; SSE2-NEXT: paddd %xmm10, %xmm1
	; SSE2-NEXT: movdqa %xmm1, %xmm10	; SSE2-NEXT: movdqa %xmm1, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm6 # 16-byte Reload	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm2 # 16-byte Reload
	; SSE2-NEXT: movdqa %xmm6, %xmm1	; SSE2-NEXT: movdqa %xmm2, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm6	; SSE2-NEXT: paddd %xmm1, %xmm2
	; SSE2-NEXT: pxor %xmm1, %xmm6	; SSE2-NEXT: pxor %xmm1, %xmm2
	; SSE2-NEXT: paddd %xmm6, %xmm3	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
		; SSE2-NEXT: paddd %xmm2, %xmm1
		; SSE2-NEXT: movdqa %xmm1, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm12, %xmm1	; SSE2-NEXT: movdqa %xmm12, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm12	; SSE2-NEXT: paddd %xmm1, %xmm12
	; SSE2-NEXT: pxor %xmm1, %xmm12	; SSE2-NEXT: pxor %xmm1, %xmm12
	; SSE2-NEXT: paddd %xmm12, %xmm5	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
		; SSE2-NEXT: paddd %xmm12, %xmm1
		; SSE2-NEXT: movdqa %xmm1, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm0, %xmm1	; SSE2-NEXT: movdqa %xmm0, %xmm1
	; SSE2-NEXT: psrad $31, %xmm1	; SSE2-NEXT: psrad $31, %xmm1
	; SSE2-NEXT: paddd %xmm1, %xmm0	; SSE2-NEXT: paddd %xmm1, %xmm0
	; SSE2-NEXT: pxor %xmm1, %xmm0	; SSE2-NEXT: pxor %xmm1, %xmm0
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm0, %xmm13	; SSE2-NEXT: paddd %xmm0, %xmm1
		; SSE2-NEXT: movdqa %xmm1, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm9, %xmm0	; SSE2-NEXT: movdqa %xmm9, %xmm0
	; SSE2-NEXT: psrad $31, %xmm0	; SSE2-NEXT: psrad $31, %xmm0
	; SSE2-NEXT: paddd %xmm0, %xmm9	; SSE2-NEXT: paddd %xmm0, %xmm9
	; SSE2-NEXT: pxor %xmm0, %xmm9	; SSE2-NEXT: pxor %xmm0, %xmm9
	; SSE2-NEXT: paddd %xmm9, %xmm1	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload
		; SSE2-NEXT: paddd %xmm9, %xmm0
		; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa %xmm7, %xmm0	; SSE2-NEXT: movdqa %xmm7, %xmm0
	; SSE2-NEXT: psrad $31, %xmm0	; SSE2-NEXT: psrad $31, %xmm0
	; SSE2-NEXT: paddd %xmm0, %xmm7	; SSE2-NEXT: paddd %xmm0, %xmm7
Context not available.
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm7, %xmm0	; SSE2-NEXT: paddd %xmm7, %xmm0
	; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill	; SSE2-NEXT: movdqa %xmm0, -{{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm8 # 16-byte Reload	; SSE2-NEXT: movdqa %xmm13, %xmm1
	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm7 # 16-byte Reload	; SSE2-NEXT: movdqa %xmm1, %xmm0
	; SSE2-NEXT: movdqa %xmm7, %xmm0
	; SSE2-NEXT: psrad $31, %xmm0	; SSE2-NEXT: psrad $31, %xmm0
	; SSE2-NEXT: paddd %xmm0, %xmm7	; SSE2-NEXT: paddd %xmm0, %xmm1
	; SSE2-NEXT: pxor %xmm0, %xmm7	; SSE2-NEXT: pxor %xmm0, %xmm1
	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm7, %xmm0	; SSE2-NEXT: paddd %xmm1, %xmm0
		; SSE2-NEXT: movdqa %xmm0, {{[0-9]+}}(%rsp) # 16-byte Spill
	; SSE2-NEXT: addq $4, %rax	; SSE2-NEXT: addq $4, %rax
	; SSE2-NEXT: jne .LBB2_1	; SSE2-NEXT: jne .LBB2_1
	; SSE2-NEXT: # %bb.2: # %middle.block	; SSE2-NEXT: # %bb.2: # %middle.block
	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm3 # 16-byte Folded Reload	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload
	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm8 # 16-byte Folded Reload
	; SSE2-NEXT: paddd %xmm3, %xmm8
	; SSE2-NEXT: paddd %xmm2, %xmm15
	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm13 # 16-byte Folded Reload
	; SSE2-NEXT: paddd %xmm8, %xmm13
	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm5 # 16-byte Folded Reload
	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Folded Reload	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Folded Reload
	; SSE2-NEXT: paddd %xmm5, %xmm0	; SSE2-NEXT: movdqa -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm11, %xmm10
	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Folded Reload	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Folded Reload
	; SSE2-NEXT: paddd %xmm0, %xmm1	; SSE2-NEXT: paddd %xmm0, %xmm1
	; SSE2-NEXT: paddd %xmm10, %xmm1	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm0 # 16-byte Reload
	; SSE2-NEXT: paddd %xmm13, %xmm1	; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm0 # 16-byte Folded Reload
	; SSE2-NEXT: paddd %xmm15, %xmm1	; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm3 # 16-byte Reload
		; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm3 # 16-byte Folded Reload
		; SSE2-NEXT: paddd %xmm1, %xmm3
		; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
		; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm1 # 16-byte Folded Reload
		; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm4 # 16-byte Reload
		; SSE2-NEXT: paddd -{{[0-9]+}}(%rsp), %xmm4 # 16-byte Folded Reload
		; SSE2-NEXT: paddd %xmm1, %xmm4
		; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm2 # 16-byte Reload
		; SSE2-NEXT: paddd {{[0-9]+}}(%rsp), %xmm2 # 16-byte Folded Reload
		; SSE2-NEXT: movdqa {{[0-9]+}}(%rsp), %xmm1 # 16-byte Reload
		; SSE2-NEXT: paddd (%rsp), %xmm1 # 16-byte Folded Reload
		; SSE2-NEXT: paddd %xmm4, %xmm1
		; SSE2-NEXT: paddd %xmm2, %xmm1
		; SSE2-NEXT: paddd %xmm3, %xmm1
		; SSE2-NEXT: paddd %xmm0, %xmm1
	; SSE2-NEXT: pshufd {{.*#+}} xmm0 = xmm1[2,3,0,1]	; SSE2-NEXT: pshufd {{.*#+}} xmm0 = xmm1[2,3,0,1]
	; SSE2-NEXT: paddd %xmm1, %xmm0	; SSE2-NEXT: paddd %xmm1, %xmm0
	; SSE2-NEXT: pshufd {{.*#+}} xmm1 = xmm0[1,1,2,3]	; SSE2-NEXT: pshufd {{.*#+}} xmm1 = xmm0[1,1,2,3]
Context not available.

This is an archive of the discontinued LLVM Phabricator instance.

[Greedy RegAlloc] Take into account the cost of local intervals when selecting split candidate.ClosedPublic

Details

Diff Detail

Event Timeline

Before patch:

After patch:

Revision Contents

Diff 128178

include/llvm/CodeGen/LiveRegMatrix.h

lib/CodeGen/LiveRegMatrix.cpp

lib/CodeGen/RegAllocGreedy.cpp

test/CodeGen/X86/bug26810.ll

Before patch:

After patch:

test/CodeGen/X86/regalloc-advanced-split-cost.ll

test/CodeGen/X86/sad.ll

[Greedy RegAlloc] Take into account the cost of local intervals when selecting split candidate.
ClosedPublic