This is an archive of the discontinued LLVM Phabricator instance.

[Greedy RegAlloc] Add logic to greedy reg alloc to avoid bad eviction chains
ClosedPublic

Authored by myatsina on Jul 24 2017, 2:01 PM.

Download Raw Diff

Details

Reviewers

MatzeB
qcolombet
wmi
stoklund

Commits

rGf9371d821f74: Add logic to greedy reg alloc to avoid bad eviction chains
rL316295: Add logic to greedy reg alloc to avoid bad eviction chains

Summary

This fixes bugzilla 26810
https://bugs.llvm.org/show_bug.cgi?id=26810

This is intended to prevent sequences like:
movl %ebp, 8(%esp) # 4-byte Spill
movl %ecx, %ebp
movl %ebx, %ecx
movl %edi, %ebx
movl %edx, %edi
cltd
idivl %esi
movl %edi, %edx
movl %ebx, %edi
movl %ecx, %ebx
movl %ebp, %ecx
movl 16(%esp), %ebp # 4 - byte Reload

Such sequences are created in 2 scenarios:

Scenario #1:
vreg0 is evicted from physreg0 by vreg1
Evictee vreg0 is intended for region splitting with split candidate physreg0 (the reg vreg0 was evicted from)
Region splitting creates a local interval because of interference with the evictor vreg1 (normally region spliiting creates 2 interval, the "by reg" and "by stack" intervals. Local interval created when interference occurs.)
one of the split intervals ends up evicting vreg2 from physreg1
Evictee vreg2 is intended for region splitting with split candidate physreg1
one of the split intervals ends up evicting vreg3 from physreg2 etc.. until someone spills

Scenario #2
vreg0 is evicted from physreg0 by vreg1
vreg2 is evicted from physreg2 by vreg3 etc
Evictee vreg0 is intended for region splitting with split candidate physreg1
Region splitting creates a local interval because of interference with the evictor vreg1
one of the split intervals ends up evicting back original evictor vreg1 from physreg0 (the reg vreg0 was evicted from)
Another evictee vreg2 is intended for region splitting with split candidate physreg1
one of the split intervals ends up evicting vreg3 from physreg2 etc.. until someone spills

As compile time was a concern, I've added a flag to control weather we do cost calculations for local intervals we expect to be created (it's on by default for X86 target, off for the rest).

Diff Detail

Repository: rL LLVM

Event Timeline

myatsina created this revision.Jul 24 2017, 2:01 PM

myatsina added a subscriber: aaboud.Jul 24 2017, 2:04 PM

Hi,

The greedy allocator is already very complicated and I am not sure the additional complexity of the eviction track is worth it.
Is it something that could be cleaned up in machine copy propagation? The problem is very local so that sounds doable.

I will have a closer look to the patch because fixing the problem from the start is obviously better that patching up later, but given how rare that problem is I really believe exploring other, less complex avenue is interesting.

Cheers,
Quentin

test/CodeGen/X86/bug26810.ll
1	Could you use a .mir test to make the test more robust?
3	That sounds wrong for a new test. Testing should be positive as much as possible IMO.

This revision now requires changes to proceed.Jul 24 2017, 3:44 PM

In D35816#819501, @qcolombet wrote:

Hi,

The greedy allocator is already very complicated and I am not sure the additional complexity of the eviction track is worth it.
Is it something that could be cleaned up in machine copy propagation? The problem is very local so that sounds doable.

I will have a closer look to the patch because fixing the problem from the start is obviously better that patching up later, but given how rare that problem is I really believe exploring other, less complex avenue is interesting.

Cheers,
Quentin

Thank you for suggesting the machine copy propagation, I've started working on this direction, it definitely seems easier to implement it there.
On the other hand, if I understood correctly, one of the issues with the old llvm register allocator (linear scan) was that that it did a lot of decisions that the rewriter had to clean up afterwards, and it was intended that greedy will try to avoid such decisions. I'm not sure if this eviction chain falls under this category or not.

Thanks,
Marina

test/CodeGen/X86/bug26810.ll
1	Will do.
3	I wasn't very satisfied with this check as well. I'll make it into a positive test indeed.

In D35816#822628, @myatsina wrote:

In D35816#819501, @qcolombet wrote:

Hi,

The greedy allocator is already very complicated and I am not sure the additional complexity of the eviction track is worth it.
Is it something that could be cleaned up in machine copy propagation? The problem is very local so that sounds doable.

I will have a closer look to the patch because fixing the problem from the start is obviously better that patching up later, but given how rare that problem is I really believe exploring other, less complex avenue is interesting.

Cheers,
Quentin

Thank you for suggesting the machine copy propagation, I've started working on this direction, it definitely seems easier to implement it there.
On the other hand, if I understood correctly, one of the issues with the old llvm register allocator (linear scan) was that that it did a lot of decisions that the rewriter had to clean up afterwards, and it was intended that greedy will try to avoid such decisions. I'm not sure if this eviction chain falls under this category or not.

Thanks,
Marina

I've checked the copy propagation pass feasibility -
I was able to catch a few new cases (probably because the increase in weight I did in my Greedy patch wasn't high enough, but that's heuristic and we might be able to tune it).
On the other hand, I'm now failing to catch all the cases that cross basic blocks because this pass works at a BB block level.

Based on this I think the solution should probably be kept in Greedy (+ possibly additional cleanup in the copy propagation pass).

Thanks,
Marina

Based on this I think the solution should probably be kept in Greedy (+ possibly additional cleanup in the copy propagation pass).

Would a super block copy propagation pass work?
I believe the code in that pass should just work in such configuration.

In D35816#841290, @qcolombet wrote:

Based on this I think the solution should probably be kept in Greedy (+ possibly additional cleanup in the copy propagation pass).

Would a super block copy propagation pass work?
I believe the code in that pass should just work in such configuration.

When you say "super blocks" do you refer to restructuring the CFG (using tail duplication) and making the common path linear so that it can be combined into one large basic block?
I haven't really seen this concept in llvm (except for some "SuperBlock" in debug info, which seems to be unrelated), so if you have some references for me that would be great.

If we're talking about somehow flattening and "ordering" the BB, control flow and loops and and scanning them looking for cross block chains, then I don't think it's something trivial.
It's not always legal to replace such chains (if someone uses or clobbers one of the registers in the middle of the chain I can no longer do the replacement).
Here's one example, I cannot do the replacement "xmm0 = copy xmm3" + "xmm3 =copy xmm0" because if I reach bb2 from bb1 then xmm0 is part of the copy chain, but if I reach bb2 from bb3, then it is not.

bb1:
xmm0 = copy xmm1
// fall through bb2

bb2:
xmm1 = copy xmm2
xmm2 = copy xmm3
...
xmm3 = copy xmm2
xmm2 = copy xmm1
xmm1 = copy xmm0
test
je bb3

bb3:
xmm0 = /* something */
test
je bb2

In order to properly identify this I need to do liveness analysis for each reg suspected to be in the copy chain. I need to check if any clobbering (or even use of an "intermediate" value) might reach one of the BBs the chain is spread across - if so, I cannot do replacement.

Also, I may have several suspected chains in parallel which complicates it even more.

Please let me know if I understood correctly the "super block copy propagation".

Thanks,
Marina

As far as I read (http://www.eecs.umich.edu/~mahlke/papers/1993/hwu_jsuper93.pdf), in order to create superblocks, we need to identify traces using execution profile information, and then do tail duplication to avoid multiple entrances.
According the authors of this technique, this transformation as itself takes significant amount of code and compile time.
I don’t think this transformation is something we should do only for the sake of machine copy propagation pass, as it adds significant complexity.
The decision of supporting this transformation and the possible optimizations that can benefit from it seems like an orthogonal discussion that is not directly related to this bad eviction chains I’m trying to solve.

Even if I do have some transformation to superblocks that have single entry and multiple exists, I will still need to do liveness tracking over all possible paths to maintain correctness:

bb1:
xmm0 = copy xmm1
xmm1 = copy xmm2
xmm2 = copy xmm3
...

test
je bb3

bb2:
xmm3 = copy xmm2
xmm2 = copy xmm1
xmm1 = copy xmm0
return

bb3:
return

The path bb1->bb2 can benefit from the change, but it is not legal for me to this change if a paths like bb1->bb3 exits – I need to scan all paths.

For each suspected copy chain I will need to track a whole subtree in this superblock CFG which begins with the first copy of the chain.
I will need to make sure all possible paths from that first copy contain the whole chain and that there is no path that clobbers one of the registers in the middle of that chain.
So I find myself doing some sort of liveness tracking here too.
I know my original solution added complexity to Greedy, but Greedy’s decisions are the source of this issue, and it doesn’t seem like we have an elegant way to clean up the consequences of those decisions when we’re talking about cross-BB chains.

Thanks,
Marina

Have you had a chance to look at it yet?

Thanks,
Marina

Hi Marina,

Thanks for reminding me about this patch.

I was not able to look at it yet.
I will try to get to it in the next two weeks.

Cheers,
-Quentin

Hi Marina,

I had a quick look at the patch and I am not sure this the right approach.
The patch tries to avoid splitting when it might be part of a bad eviction chain, but I would argue there is no such thing as bad eviction chain. The evictions happened to relax the constraints on the allocation problem and blocking the splitting it won't help.
Now, unless phabricator is not showing everything (it is acting weird since the patch is quite big), my understanding of the patch is that it actually does not prevent eviction chains, it just resorts on less fancy splitting heuristics, which happens to spread splitting decisions around instead of having them localized at regions boundaries. Thus, we may still have eviction chains but they may be harder to spot.

Generally speaking, split points are not bad. What is bad though is the fact that we make poor allocation decisions that prevent to get rid of them later. I would focus my effort on that front if I were you.
For instance, in the example from the PR, I believe the bunch of copies don't get coalesced because we choose the color for the less constrained live-range first. That is if we were to allocate vreg79 first, then I believe vreg80 could use that as a hint eliminating one of the copy. Same for vreg81 and vreg82 and so on.
However, what happened is that we allocate vreg80 first and the allocation order makes it such that vreg79 won't be able to satisfy the hint, since what we pick interferes with what vreg79 can use. Given we have the same structure before and after the split point, the live-ranges get allocated in the order. Thus, the first mistake propagates through all the live-range (vreg79 prevents vreg81 to satisfy its live-range and so on).
For instance, if we were to delay vreg79, I believe we would satisfy all hints.

There is a lot of speculation in what I described for the example from the PR. I will try to spend some time verifying if any of those changes would indeed fix this problem.
In particular, I believe the problem can be solved with some tweaks in:

TargetRegisterInfo::getRegAllocationHints (e.g., we could give a hint to vreg80 so that it avoids vreg79 interferences)
Priority when enqueueing live-ranges
Consider the cost of using a register that is going to create a broken hint down the road when assigning a color (similar idea than first item)
Try to reconcile hints that are in the same region instead of one at a time

The last point is probably the one that is going to less affect existing code and thus would be probably the easiest to qualify.

Cheers,
-Quentin

For the record, playing with the order helps but to do the optimal (local) coloring here I would need to spend more time on the heuristic.
Again the easiest fix is probably the hints reconciling by region.

Hi Quentin,

I wouldn’t say my patch tries to avoid splitting, but rather tries to improve the calculation of the spill weight of split candidates:
When the register allocator decides to do a region split, it looks for the best physical register candidate for the split.
The best candidate is the one that will cause the minimal spill cost.
When calculating the spill cost of each candidate the algorithm takes into account interferences in the entrance/exist of the basic blocks.
However, there may be interference local to a basic block, which later, during the split itself will cause the creation of a new local interval (which will be local to the basic block) on top of the “by reg” and “by stack” intervals which are created during the split.
The algorithm currently ignores the fact that this local interval may cause spills (and thus may increase the spill weight of this candidate for the split).

My solution is to try to predict if this split candidate will case the creation of local intervals and if they in turn will cause spills, and add their spill weights to the total weight.
By doing so, I try to make the spill weight calculation of each candidate more accurate and allow the algorithm to choose a more suitable candidate.

If a local interval is created then we have a few options for its allocation:

The interval will be allocated to some free reg – no additional spill cost needed.
The interval may cause an eviction – in some cases this eviction is "bad" and guaranteed to causes a spill (it’s “bad” when you’re evicting the interval that evicted you, kind of like a cat and mouse game - somebody must loose here) - in this patch I’m trying to predict if it’s "bad" or not, and incorporate the spill weight of this interval.
The interval may spill – I’ve already encountered a case where the new local interval is in a hot loop and ends us spilling around all uses – this spill cost wasn’t considered when the candidate was chosen. I have a solution for this case which is based on parts of this patch.
The interval may split – I guess there might be some spill cost to consider here as well, but I didn’t explore this case yet.

I did see nice performance results with my current solution.
I will try to look into the hint reconciling as well, but I do think that the current spill weight calculation of the split candidates is not accurate enough and we need to consider the affects of those local intervals.

Thanks,
Marina

gberry added a subscriber: gberry.Oct 11 2017, 1:16 PM

As compile time was a concern, I've added a flag to control weather we do cost calculations for local intervals we expect to be created (it's on by default for X86 target, off for the rest).
I've fixed the tests and some comments.

Do you have additional comments?

Hi Marina,

Looks reasonable to me.
Do a second pass before committing to make sure everything follow LLVM coding standard. I've highlighted a few problems.

I can help with that if you wish, but I figured you probably don't want to wait for me to do that :P.

Cheers,
-Quentin

lib/CodeGen/RegAllocGreedy.cpp
1	Check*
16	brief*
17	Use lower case for the first letter for methods.
24	Ditto

This revision is now accepted and ready to land.Oct 16 2017, 8:57 AM

Closed by commit rL316295: Add logic to greedy reg alloc to avoid bad eviction chains (authored by myatsina). · Explain WhyOct 22 2017, 11:00 AM

This revision was automatically updated to reflect the committed changes.

myatsina marked 4 inline comments as done.

ChuanqiXu added a subscriber: ChuanqiXu.Nov 19 2020, 5:59 PM

Herald added a project: Restricted Project. · View Herald TranscriptNov 19 2020, 5:59 PM

Herald added a subscriber: pengfei. · View Herald Transcript

mtrofin mentioned this in D98232: [regalloc] Ensure Query::collectInterferringVregs is called before interval iteration.Mar 10 2021, 12:03 PM

mtrofin mentioned this in D121128: [regalloc] Remove -consider-local-interval-cost.Mar 7 2022, 8:30 AM

mtrofin mentioned this in rG294eca35a00f: [regalloc] Remove -consider-local-interval-cost.Mar 14 2022, 10:49 AM

Revision Contents

Path

Size

include/

llvm/

CodeGen/

	CalcSpillWeights.h
	CalcSpillWeights.h (revision 305640)

24 lines

	LiveIntervalAnalysis.h
	LiveIntervalAnalysis.h (revision 305640)

5 lines

lib/

CodeGen/

	CalcSpillWeights.cpp
	CalcSpillWeights.cpp (revision 305640)

518 lines

	LiveIntervalAnalysis.cpp
	LiveIntervalAnalysis.cpp (revision 305640)

8 lines

	RegAllocGreedy.cpp
	RegAllocGreedy.cpp (revision 305640)

5866 lines

test/

CodeGen/

X86/

	bug26810.ll
	bug26810.ll (revision 0)

312 lines

	greedy_regalloc_bad_eviction_sequence.ll
	greedy_regalloc_bad_eviction_sequence.ll (revision 0)

119 lines

Diff 107959

include/llvm/CodeGen/CalcSpillWeights.h

Context not available.

	/// \brief (re)compute li's spill weight and allocation hint.	/// \brief (re)compute li's spill weight and allocation hint.
	void calculateSpillWeightAndHint(LiveInterval &li);	void calculateSpillWeightAndHint(LiveInterval &li);
		/// \brief Compute future expected spill weight of a split artifact of li
		/// that will span between start and end slot indexes.
		/// \param li The live interval to be split.
		/// \param start The expected begining of the split artifact. Instructions
		/// before start will not affect the weight.
		/// \param end The expected end of the split artifact. Instructions
		/// after end will not affect the weight.
		/// \return The expected spill weight of the split artifact. Returns
		/// negative weight for unspillable li.
		float futureWeight(LiveInterval &li, SlotIndex start, SlotIndex end);
		/// \brief Helper function for weight calculations.
		/// (Re)compute li's spill weight and allocation hint, or, for non null
		/// start and end - compute future expected spill weight of a split
		/// artifact of li that will span between start and end slot indexes.
		/// \param li The live interval for which to compute the weight.
		/// \param start The expected begining of the split artifact. Instructions
		/// before start will not affect the weight. Relevant for
		/// weight calculation of future split artifact.
		/// \param end The expected end of the split artifact. Instructions
		/// after end will not affect the weight. Relevant for
		/// weight calculation of future split artifact.
		/// \return The spill weight. Returns negative weight for unspillable li.
		float weightCalcHelper(LiveInterval &li, SlotIndex *start = nullptr,
		SlotIndex *end = nullptr);
	};	};

	/// \brief Compute spill weights and allocation hints for all virtual register	/// \brief Compute spill weights and allocation hints for all virtual register
Context not available.

include/llvm/CodeGen/LiveIntervalAnalysis.h

Context not available.
	const MachineBlockFrequencyInfo *MBFI,	const MachineBlockFrequencyInfo *MBFI,
	const MachineInstr &Instr);	const MachineInstr &Instr);

		/// Calculate the spill weight to assign to a single instruction.
		static float getSpillWeight(bool isDef, bool isUse,
		const MachineBlockFrequencyInfo *MBFI,
		const MachineBasicBlock *MBB);

	LiveInterval &getInterval(unsigned Reg) {	LiveInterval &getInterval(unsigned Reg) {
	if (hasInterval(Reg))	if (hasInterval(Reg))
	return *VirtRegIntervals[Reg];	return *VirtRegIntervals[Reg];
Context not available.

lib/CodeGen/CalcSpillWeights.cpp

	//===------------------------ CalcSpillWeights.cpp ------------------------===//			//===------------------------ CalcSpillWeights.cpp ------------------------===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/CodeGen/CalcSpillWeights.h"			#include "llvm/CodeGen/CalcSpillWeights.h"
	#include "llvm/CodeGen/LiveIntervalAnalysis.h"			#include "llvm/CodeGen/LiveIntervalAnalysis.h"
	#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"			#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"
	#include "llvm/CodeGen/MachineFunction.h"			#include "llvm/CodeGen/MachineFunction.h"
	#include "llvm/CodeGen/MachineLoopInfo.h"			#include "llvm/CodeGen/MachineLoopInfo.h"
	#include "llvm/CodeGen/MachineRegisterInfo.h"			#include "llvm/CodeGen/MachineRegisterInfo.h"
	#include "llvm/CodeGen/VirtRegMap.h"			#include "llvm/CodeGen/VirtRegMap.h"
	#include "llvm/Support/Debug.h"			#include "llvm/Support/Debug.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"
	#include "llvm/Target/TargetInstrInfo.h"			#include "llvm/Target/TargetInstrInfo.h"
	#include "llvm/Target/TargetRegisterInfo.h"			#include "llvm/Target/TargetRegisterInfo.h"
	#include "llvm/Target/TargetSubtargetInfo.h"			#include "llvm/Target/TargetSubtargetInfo.h"
	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "calcspillweights"			#define DEBUG_TYPE "calcspillweights"

	void llvm::calculateSpillWeightsAndHints(LiveIntervals &LIS,			void llvm::calculateSpillWeightsAndHints(LiveIntervals &LIS,
	MachineFunction &MF,			MachineFunction &MF,
	VirtRegMap *VRM,			VirtRegMap *VRM,
	const MachineLoopInfo &MLI,			const MachineLoopInfo &MLI,
	const MachineBlockFrequencyInfo &MBFI,			const MachineBlockFrequencyInfo &MBFI,
	VirtRegAuxInfo::NormalizingFn norm) {			VirtRegAuxInfo::NormalizingFn norm) {
	DEBUG(dbgs() << "******** Compute Spill Weights ********\n"			DEBUG(dbgs() << "******** Compute Spill Weights ********\n"
	<< "********** Function: " << MF.getName() << '\n');			<< "********** Function: " << MF.getName() << '\n');

	MachineRegisterInfo &MRI = MF.getRegInfo();			MachineRegisterInfo &MRI = MF.getRegInfo();
	VirtRegAuxInfo VRAI(MF, LIS, VRM, MLI, MBFI, norm);			VirtRegAuxInfo VRAI(MF, LIS, VRM, MLI, MBFI, norm);
	for (unsigned i = 0, e = MRI.getNumVirtRegs(); i != e; ++i) {			for (unsigned i = 0, e = MRI.getNumVirtRegs(); i != e; ++i) {
	unsigned Reg = TargetRegisterInfo::index2VirtReg(i);			unsigned Reg = TargetRegisterInfo::index2VirtReg(i);
	if (MRI.reg_nodbg_empty(Reg))			if (MRI.reg_nodbg_empty(Reg))
	continue;			continue;
	VRAI.calculateSpillWeightAndHint(LIS.getInterval(Reg));			VRAI.calculateSpillWeightAndHint(LIS.getInterval(Reg));
	}			}
	}			}

	// Return the preferred allocation register for reg, given a COPY instruction.			// Return the preferred allocation register for reg, given a COPY instruction.
	static unsigned copyHint(const MachineInstr *mi, unsigned reg,			static unsigned copyHint(const MachineInstr *mi, unsigned reg,
	const TargetRegisterInfo &tri,			const TargetRegisterInfo &tri,
	const MachineRegisterInfo &mri) {			const MachineRegisterInfo &mri) {
	unsigned sub, hreg, hsub;			unsigned sub, hreg, hsub;
	if (mi->getOperand(0).getReg() == reg) {			if (mi->getOperand(0).getReg() == reg) {
	sub = mi->getOperand(0).getSubReg();			sub = mi->getOperand(0).getSubReg();
	hreg = mi->getOperand(1).getReg();			hreg = mi->getOperand(1).getReg();
	hsub = mi->getOperand(1).getSubReg();			hsub = mi->getOperand(1).getSubReg();
	} else {			} else {
	sub = mi->getOperand(1).getSubReg();			sub = mi->getOperand(1).getSubReg();
	hreg = mi->getOperand(0).getReg();			hreg = mi->getOperand(0).getReg();
	hsub = mi->getOperand(0).getSubReg();			hsub = mi->getOperand(0).getSubReg();
	}			}

	if (!hreg)			if (!hreg)
	return 0;			return 0;

	if (TargetRegisterInfo::isVirtualRegister(hreg))			if (TargetRegisterInfo::isVirtualRegister(hreg))
	return sub == hsub ? hreg : 0;			return sub == hsub ? hreg : 0;

	const TargetRegisterClass *rc = mri.getRegClass(reg);			const TargetRegisterClass *rc = mri.getRegClass(reg);

	// Only allow physreg hints in rc.			// Only allow physreg hints in rc.
	if (sub == 0)			if (sub == 0)
	return rc->contains(hreg) ? hreg : 0;			return rc->contains(hreg) ? hreg : 0;

	// reg:sub should match the physreg hreg.			// reg:sub should match the physreg hreg.
	return tri.getMatchingSuperReg(hreg, sub, rc);			return tri.getMatchingSuperReg(hreg, sub, rc);
	}			}

	// Check if all values in LI are rematerializable			// Check if all values in LI are rematerializable
	static bool isRematerializable(const LiveInterval &LI,			static bool isRematerializable(const LiveInterval &LI,
	const LiveIntervals &LIS,			const LiveIntervals &LIS,
	VirtRegMap *VRM,			VirtRegMap *VRM,
	const TargetInstrInfo &TII) {			const TargetInstrInfo &TII) {
	unsigned Reg = LI.reg;			unsigned Reg = LI.reg;
	unsigned Original = VRM ? VRM->getOriginal(Reg) : 0;			unsigned Original = VRM ? VRM->getOriginal(Reg) : 0;
	for (LiveInterval::const_vni_iterator I = LI.vni_begin(), E = LI.vni_end();			for (LiveInterval::const_vni_iterator I = LI.vni_begin(), E = LI.vni_end();
	I != E; ++I) {			I != E; ++I) {
	const VNInfo VNI = I;			const VNInfo VNI = I;
	if (VNI->isUnused())			if (VNI->isUnused())
	continue;			continue;
	if (VNI->isPHIDef())			if (VNI->isPHIDef())
	return false;			return false;

	MachineInstr *MI = LIS.getInstructionFromIndex(VNI->def);			MachineInstr *MI = LIS.getInstructionFromIndex(VNI->def);
	assert(MI && "Dead valno in interval");			assert(MI && "Dead valno in interval");

	// Trace copies introduced by live range splitting. The inline			// Trace copies introduced by live range splitting. The inline
	// spiller can rematerialize through these copies, so the spill			// spiller can rematerialize through these copies, so the spill
	// weight must reflect this.			// weight must reflect this.
	if (VRM) {			if (VRM) {
	while (MI->isFullCopy()) {			while (MI->isFullCopy()) {
	// The copy destination must match the interval register.			// The copy destination must match the interval register.
	if (MI->getOperand(0).getReg() != Reg)			if (MI->getOperand(0).getReg() != Reg)
	return false;			return false;

	// Get the source register.			// Get the source register.
	Reg = MI->getOperand(1).getReg();			Reg = MI->getOperand(1).getReg();

	// If the original (pre-splitting) registers match this			// If the original (pre-splitting) registers match this
	// copy came from a split.			// copy came from a split.
	if (!TargetRegisterInfo::isVirtualRegister(Reg) \|\|			if (!TargetRegisterInfo::isVirtualRegister(Reg) \|\|
	VRM->getOriginal(Reg) != Original)			VRM->getOriginal(Reg) != Original)
	return false;			return false;

	// Follow the copy live-in value.			// Follow the copy live-in value.
	const LiveInterval &SrcLI = LIS.getInterval(Reg);			const LiveInterval &SrcLI = LIS.getInterval(Reg);
	LiveQueryResult SrcQ = SrcLI.Query(VNI->def);			LiveQueryResult SrcQ = SrcLI.Query(VNI->def);
	VNI = SrcQ.valueIn();			VNI = SrcQ.valueIn();
	assert(VNI && "Copy from non-existing value");			assert(VNI && "Copy from non-existing value");
	if (VNI->isPHIDef())			if (VNI->isPHIDef())
	return false;			return false;
	MI = LIS.getInstructionFromIndex(VNI->def);			MI = LIS.getInstructionFromIndex(VNI->def);
	assert(MI && "Dead valno in interval");			assert(MI && "Dead valno in interval");
	}			}
	}			}

	if (!TII.isTriviallyReMaterializable(*MI, LIS.getAliasAnalysis()))			if (!TII.isTriviallyReMaterializable(*MI, LIS.getAliasAnalysis()))
	return false;			return false;
	}			}
	return true;			return true;
	}			}

	void			void VirtRegAuxInfo::calculateSpillWeightAndHint(LiveInterval &li) {
	VirtRegAuxInfo::calculateSpillWeightAndHint(LiveInterval &li) {			float weight = weightCalcHelper(li);
	MachineRegisterInfo &mri = MF.getRegInfo();			// check if unspillable
	const TargetRegisterInfo &tri = *MF.getSubtarget().getRegisterInfo();			if (weight < 0)
	MachineBasicBlock *mbb = nullptr;			return;
	MachineLoop *loop = nullptr;			li.weight = weight;
	bool isExiting = false;			}
	float totalWeight = 0;
	unsigned numInstr = 0; // Number of instructions using li			float VirtRegAuxInfo::futureWeight(LiveInterval &li, SlotIndex start,
	SmallPtrSet<MachineInstr*, 8> visited;			SlotIndex end) {
				return weightCalcHelper(li, &start, &end);
	// Find the best physreg hint and the best virtreg hint.			}
	float bestPhys = 0, bestVirt = 0;
	unsigned hintPhys = 0, hintVirt = 0;			float VirtRegAuxInfo::weightCalcHelper(LiveInterval &li, SlotIndex *start,
				SlotIndex *end) {
	// Don't recompute a target specific hint.			MachineRegisterInfo &mri = MF.getRegInfo();
	bool noHint = mri.getRegAllocationHint(li.reg).first != 0;			const TargetRegisterInfo &tri = *MF.getSubtarget().getRegisterInfo();
				MachineBasicBlock *mbb = nullptr;
	// Don't recompute spill weight for an unspillable register.			MachineLoop *loop = nullptr;
	bool Spillable = li.isSpillable();			bool isExiting = false;
				float totalWeight = 0;
	for (MachineRegisterInfo::reg_instr_iterator			unsigned numInstr = 0; // Number of instructions using li
	I = mri.reg_instr_begin(li.reg), E = mri.reg_instr_end();			SmallPtrSet<MachineInstr *, 8> visited;
	I != E; ) {
	MachineInstr mi = &(I++);			// Find the best physreg hint and the best virtreg hint.
	numInstr++;			float bestPhys = 0, bestVirt = 0;
	if (mi->isIdentityCopy() \|\| mi->isImplicitDef() \|\| mi->isDebugValue())			unsigned hintPhys = 0, hintVirt = 0;
	continue;
	if (!visited.insert(mi).second)			// Don't recompute a target specific hint.
	continue;			bool noHint = mri.getRegAllocationHint(li.reg).first != 0;

	float weight = 1.0f;			// Don't recompute spill weight for an unspillable register.
	if (Spillable) {			bool Spillable = li.isSpillable();
	// Get loop info for mi.
	if (mi->getParent() != mbb) {			bool LocalSplitArtifact = start && end;
	mbb = mi->getParent();
	loop = Loops.getLoopFor(mbb);			// Do not update future local split artifacts
	isExiting = loop ? loop->isLoopExiting(mbb) : false;			bool UpdateLI = !LocalSplitArtifact;
	}
				if (LocalSplitArtifact) {
	// Calculate instr weight.			MachineBasicBlock LocalMBB = LIS.getMBBFromIndex(end);
	bool reads, writes;			assert(LocalMBB == LIS.getMBBFromIndex(*start) &&
	std::tie(reads, writes) = mi->readsWritesVirtualRegister(li.reg);			"start and end are expected to be in the same basic block");
	weight = LiveIntervals::getSpillWeight(writes, reads, &MBFI, *mi);
				// Local split artifact will have 2 additional copy instructions and they
	// Give extra weight to what looks like a loop induction variable update.			// will be in the same BB.
	if (writes && isExiting && LIS.isLiveOutOfMBB(li, mbb))			// localLI = COPY other
	weight *= 3;			// ...
				// other = COPY localLI
	totalWeight += weight;			totalWeight += LiveIntervals::getSpillWeight(true, false, &MBFI, LocalMBB);
	}			totalWeight += LiveIntervals::getSpillWeight(false, true, &MBFI, LocalMBB);

	// Get allocation hints from copies.			numInstr += 2;
	if (noHint \|\| !mi->isCopy())			}
	continue;
	unsigned hint = copyHint(mi, li.reg, tri, mri);			for (MachineRegisterInfo::reg_instr_iterator
	if (!hint)			I = mri.reg_instr_begin(li.reg), E = mri.reg_instr_end();
	continue;			I != E; ) {
	// Force hweight onto the stack so that x86 doesn't add hidden precision,			MachineInstr mi = &(I++);
	// making the comparison incorrectly pass (i.e., 1 > 1 == true??).
	//			// For local split artifacts, we are interested only in instructions between
	// FIXME: we probably shouldn't use floats at all.			// the expected start and end of the range
	volatile float hweight = Hint[hint] += weight;			SlotIndex SI = LIS.getInstructionIndex(*mi);
	if (TargetRegisterInfo::isPhysicalRegister(hint)) {			if (LocalSplitArtifact && ((SI < start) \|\| (SI > end)))
	if (hweight > bestPhys && mri.isAllocatable(hint)) {			continue;
	bestPhys = hweight;
	hintPhys = hint;			numInstr++;
	}			if (mi->isIdentityCopy() \|\| mi->isImplicitDef() \|\| mi->isDebugValue())
	} else {			continue;
	if (hweight > bestVirt) {			if (!visited.insert(mi).second)
	bestVirt = hweight;			continue;
	hintVirt = hint;
	}			float weight = 1.0f;
	}			if (Spillable) {
	}			// Get loop info for mi.
				if (mi->getParent() != mbb) {
	Hint.clear();			mbb = mi->getParent();
				loop = Loops.getLoopFor(mbb);
	// Always prefer the physreg hint.			isExiting = loop ? loop->isLoopExiting(mbb) : false;
	if (unsigned hint = hintPhys ? hintPhys : hintVirt) {			}
	mri.setRegAllocationHint(li.reg, 0, hint);
	// Weakly boost the spill weight of hinted registers.			// Calculate instr weight.
	totalWeight *= 1.01F;			bool reads, writes;
	}			std::tie(reads, writes) = mi->readsWritesVirtualRegister(li.reg);
				weight = LiveIntervals::getSpillWeight(writes, reads, &MBFI, *mi);
	// If the live interval was already unspillable, leave it that way.
	if (!Spillable)			// Give extra weight to what looks like a loop induction variable update.
	return;			if (writes && isExiting && LIS.isLiveOutOfMBB(li, mbb))
				weight *= 3;
	// Mark li as unspillable if all live ranges are tiny and the interval
	// is not live at any reg mask. If the interval is live at a reg mask			totalWeight += weight;
	// spilling may be required.			}
	if (li.isZeroLength(LIS.getSlotIndexes()) &&
	!li.isLiveAtIndexes(LIS.getRegMaskSlots())) {			// Get allocation hints from copies.
	li.markNotSpillable();			if (noHint \|\| !mi->isCopy())
	return;			continue;
	}			unsigned hint = copyHint(mi, li.reg, tri, mri);
				if (!hint)
	// If all of the definitions of the interval are re-materializable,			continue;
	// it is a preferred candidate for spilling.			// Force hweight onto the stack so that x86 doesn't add hidden precision,
	// FIXME: this gets much more complicated once we support non-trivial			// making the comparison incorrectly pass (i.e., 1 > 1 == true??).
	// re-materialization.			//
	if (isRematerializable(li, LIS, VRM, *MF.getSubtarget().getInstrInfo()))			// FIXME: we probably shouldn't use floats at all.
	totalWeight *= 0.5F;			volatile float hweight = Hint[hint] += weight;
				if (TargetRegisterInfo::isPhysicalRegister(hint)) {
	li.weight = normalize(totalWeight, li.getSize(), numInstr);			if (hweight > bestPhys && mri.isAllocatable(hint)) {
	}			bestPhys = hweight;
				hintPhys = hint;
				}
				} else {
				if (hweight > bestVirt) {
				bestVirt = hweight;
				hintVirt = hint;
				}
				}
				}

				Hint.clear();

				// Always prefer the physreg hint.
				if (UpdateLI) {
				if (unsigned hint = hintPhys ? hintPhys : hintVirt) {
				mri.setRegAllocationHint(li.reg, 0, hint);
				// Weakly boost the spill weight of hinted registers.
				totalWeight *= 1.01F;
				}
				}

				// If the live interval was already unspillable, leave it that way.
				if (!Spillable)
				return -1.0;

				// Mark li as unspillable if all live ranges are tiny and the interval
				// is not live at any reg mask. If the interval is live at a reg mask
				// spilling may be required.
				if (UpdateLI && li.isZeroLength(LIS.getSlotIndexes()) &&
				!li.isLiveAtIndexes(LIS.getRegMaskSlots())) {
				li.markNotSpillable();
				return -1.0;
				}

				// If all of the definitions of the interval are re-materializable,
				// it is a preferred candidate for spilling.
				// FIXME: this gets much more complicated once we support non-trivial
				// re-materialization.
				if (isRematerializable(li, LIS, VRM, *MF.getSubtarget().getInstrInfo()))
				totalWeight *= 0.5F;

				if (LocalSplitArtifact)
				return normalize(totalWeight, start->distance(*end), numInstr);
				return normalize(totalWeight, li.getSize(), numInstr);
				}

lib/CodeGen/LiveIntervalAnalysis.cpp

Context not available.
	float LiveIntervals::getSpillWeight(bool isDef, bool isUse,	float LiveIntervals::getSpillWeight(bool isDef, bool isUse,
	const MachineBlockFrequencyInfo *MBFI,	const MachineBlockFrequencyInfo *MBFI,
	const MachineInstr &MI) {	const MachineInstr &MI) {
	BlockFrequency Freq = MBFI->getBlockFreq(MI.getParent());	return getSpillWeight(isDef, isUse, MBFI, MI.getParent());
		}

		float LiveIntervals::getSpillWeight(bool isDef, bool isUse,
		const MachineBlockFrequencyInfo *MBFI,
		const MachineBasicBlock *MBB) {
		BlockFrequency Freq = MBFI->getBlockFreq(MBB);
	const float Scale = 1.0f / MBFI->getEntryFreq();	const float Scale = 1.0f / MBFI->getEntryFreq();
	return (isDef + isUse) * (Freq.getFrequency() * Scale);	return (isDef + isUse) * (Freq.getFrequency() * Scale);
	}	}
Context not available.

lib/CodeGen/RegAllocGreedy.cpp

	//===- RegAllocGreedy.cpp - greedy register allocator ---------------------===//			//===- RegAllocGreedy.cpp - greedy register allocator ---------------------===//
				qcolombetUnsubmitted Done Reply Inline Actions Check* qcolombet: Check*
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file defines the RAGreedy function pass for register allocation in			// This file defines the RAGreedy function pass for register allocation in
	// optimized builds.			// optimized builds.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "AllocationOrder.h"			#include "AllocationOrder.h"
	#include "InterferenceCache.h"			#include "InterferenceCache.h"
				qcolombetUnsubmitted Done Reply Inline Actions brief* qcolombet: brief*
	#include "LiveDebugVariables.h"			#include "LiveDebugVariables.h"
				qcolombetUnsubmitted Done Reply Inline Actions Use lower case for the first letter for methods. qcolombet: Use lower case for the first letter for methods.
	#include "RegAllocBase.h"			#include "RegAllocBase.h"
	#include "SpillPlacement.h"			#include "SpillPlacement.h"
	#include "Spiller.h"			#include "Spiller.h"
	#include "SplitKit.h"			#include "SplitKit.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/BitVector.h"			#include "llvm/ADT/BitVector.h"
	#include "llvm/ADT/DenseMap.h"			#include "llvm/ADT/DenseMap.h"
				qcolombetUnsubmitted Done Reply Inline Actions Ditto qcolombet: Ditto
	#include "llvm/ADT/IndexedMap.h"			#include "llvm/ADT/IndexedMap.h"
	#include "llvm/ADT/SetVector.h"			#include "llvm/ADT/MapVector.h"
	#include "llvm/ADT/SmallPtrSet.h"			#include "llvm/ADT/SetVector.h"
	#include "llvm/ADT/SmallSet.h"			#include "llvm/ADT/SmallPtrSet.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallSet.h"
	#include "llvm/ADT/Statistic.h"			#include "llvm/ADT/SmallVector.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/Statistic.h"
	#include "llvm/Analysis/AliasAnalysis.h"			#include "llvm/ADT/StringRef.h"
	#include "llvm/Analysis/OptimizationDiagnosticInfo.h"			#include "llvm/Analysis/AliasAnalysis.h"
	#include "llvm/CodeGen/CalcSpillWeights.h"			#include "llvm/Analysis/OptimizationDiagnosticInfo.h"
	#include "llvm/CodeGen/EdgeBundles.h"			#include "llvm/CodeGen/CalcSpillWeights.h"
	#include "llvm/CodeGen/LiveInterval.h"			#include "llvm/CodeGen/EdgeBundles.h"
	#include "llvm/CodeGen/LiveIntervalAnalysis.h"			#include "llvm/CodeGen/LiveInterval.h"
	#include "llvm/CodeGen/LiveIntervalUnion.h"			#include "llvm/CodeGen/LiveIntervalAnalysis.h"
	#include "llvm/CodeGen/LiveRangeEdit.h"			#include "llvm/CodeGen/LiveIntervalUnion.h"
	#include "llvm/CodeGen/LiveRegMatrix.h"			#include "llvm/CodeGen/LiveRangeEdit.h"
	#include "llvm/CodeGen/LiveStackAnalysis.h"			#include "llvm/CodeGen/LiveRegMatrix.h"
	#include "llvm/CodeGen/MachineBasicBlock.h"			#include "llvm/CodeGen/LiveStackAnalysis.h"
	#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"			#include "llvm/CodeGen/MachineBasicBlock.h"
	#include "llvm/CodeGen/MachineDominators.h"			#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"
	#include "llvm/CodeGen/MachineFrameInfo.h"			#include "llvm/CodeGen/MachineDominators.h"
	#include "llvm/CodeGen/MachineFunction.h"			#include "llvm/CodeGen/MachineFrameInfo.h"
	#include "llvm/CodeGen/MachineFunctionPass.h"			#include "llvm/CodeGen/MachineFunction.h"
	#include "llvm/CodeGen/MachineInstr.h"			#include "llvm/CodeGen/MachineFunctionPass.h"
	#include "llvm/CodeGen/MachineLoopInfo.h"			#include "llvm/CodeGen/MachineInstr.h"
	#include "llvm/CodeGen/MachineOperand.h"			#include "llvm/CodeGen/MachineLoopInfo.h"
	#include "llvm/CodeGen/MachineOptimizationRemarkEmitter.h"			#include "llvm/CodeGen/MachineOperand.h"
	#include "llvm/CodeGen/MachineRegisterInfo.h"			#include "llvm/CodeGen/MachineOptimizationRemarkEmitter.h"
	#include "llvm/CodeGen/RegAllocRegistry.h"			#include "llvm/CodeGen/MachineRegisterInfo.h"
	#include "llvm/CodeGen/RegisterClassInfo.h"			#include "llvm/CodeGen/RegAllocRegistry.h"
	#include "llvm/CodeGen/SlotIndexes.h"			#include "llvm/CodeGen/RegisterClassInfo.h"
	#include "llvm/CodeGen/VirtRegMap.h"			#include "llvm/CodeGen/SlotIndexes.h"
	#include "llvm/IR/Function.h"			#include "llvm/CodeGen/VirtRegMap.h"
	#include "llvm/IR/LLVMContext.h"			#include "llvm/IR/Function.h"
	#include "llvm/MC/MCRegisterInfo.h"			#include "llvm/IR/LLVMContext.h"
	#include "llvm/Pass.h"			#include "llvm/MC/MCRegisterInfo.h"
	#include "llvm/Support/BlockFrequency.h"			#include "llvm/Pass.h"
	#include "llvm/Support/BranchProbability.h"			#include "llvm/Support/BlockFrequency.h"
	#include "llvm/Support/CommandLine.h"			#include "llvm/Support/BranchProbability.h"
	#include "llvm/Support/Debug.h"			#include "llvm/Support/CommandLine.h"
	#include "llvm/Support/MathExtras.h"			#include "llvm/Support/Debug.h"
	#include "llvm/Support/Timer.h"			#include "llvm/Support/MathExtras.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/Timer.h"
	#include "llvm/Target/TargetInstrInfo.h"			#include "llvm/Support/raw_ostream.h"
	#include "llvm/Target/TargetMachine.h"			#include "llvm/Target/TargetInstrInfo.h"
	#include "llvm/Target/TargetRegisterInfo.h"			#include "llvm/Target/TargetMachine.h"
	#include "llvm/Target/TargetSubtargetInfo.h"			#include "llvm/Target/TargetRegisterInfo.h"
	#include <algorithm>			#include "llvm/Target/TargetSubtargetInfo.h"
	#include <cassert>			#include <algorithm>
	#include <cstdint>			#include <cassert>
	#include <memory>			#include <cstdint>
	#include <queue>			#include <memory>
	#include <tuple>			#include <queue>
	#include <utility>			#include <tuple>
				#include <utility>
	using namespace llvm;
				using namespace llvm;
	#define DEBUG_TYPE "regalloc"
				#define DEBUG_TYPE "regalloc"
	STATISTIC(NumGlobalSplits, "Number of split global live ranges");
	STATISTIC(NumLocalSplits, "Number of split local live ranges");			STATISTIC(NumGlobalSplits, "Number of split global live ranges");
	STATISTIC(NumEvicted, "Number of interferences evicted");			STATISTIC(NumLocalSplits, "Number of split local live ranges");
				STATISTIC(NumEvicted, "Number of interferences evicted");
	static cl::opt<SplitEditor::ComplementSpillMode> SplitSpillMode(
	"split-spill-mode", cl::Hidden,			static cl::opt<SplitEditor::ComplementSpillMode> SplitSpillMode(
	cl::desc("Spill mode for splitting live ranges"),			"split-spill-mode", cl::Hidden,
	cl::values(clEnumValN(SplitEditor::SM_Partition, "default", "Default"),			cl::desc("Spill mode for splitting live ranges"),
	clEnumValN(SplitEditor::SM_Size, "size", "Optimize for size"),			cl::values(clEnumValN(SplitEditor::SM_Partition, "default", "Default"),
	clEnumValN(SplitEditor::SM_Speed, "speed", "Optimize for speed")),			clEnumValN(SplitEditor::SM_Size, "size", "Optimize for size"),
	cl::init(SplitEditor::SM_Speed));			clEnumValN(SplitEditor::SM_Speed, "speed", "Optimize for speed")),
				cl::init(SplitEditor::SM_Speed));
	static cl::opt<unsigned>
	LastChanceRecoloringMaxDepth("lcr-max-depth", cl::Hidden,			static cl::opt<unsigned>
	cl::desc("Last chance recoloring max depth"),			LastChanceRecoloringMaxDepth("lcr-max-depth", cl::Hidden,
	cl::init(5));			cl::desc("Last chance recoloring max depth"),
				cl::init(5));
	static cl::opt<unsigned> LastChanceRecoloringMaxInterference(
	"lcr-max-interf", cl::Hidden,			static cl::opt<unsigned> LastChanceRecoloringMaxInterference(
	cl::desc("Last chance recoloring maximum number of considered"			"lcr-max-interf", cl::Hidden,
	" interference at a time"),			cl::desc("Last chance recoloring maximum number of considered"
	cl::init(8));			" interference at a time"),
				cl::init(8));
	static cl::opt<bool>
	ExhaustiveSearch("exhaustive-register-search", cl::NotHidden,			static cl::opt<bool>
	cl::desc("Exhaustive Search for registers bypassing the depth "			ExhaustiveSearch("exhaustive-register-search", cl::NotHidden,
	"and interference cutoffs of last chance recoloring"));			cl::desc("Exhaustive Search for registers bypassing the depth "
				"and interference cutoffs of last chance recoloring"));
	static cl::opt<bool> EnableLocalReassignment(
	"enable-local-reassign", cl::Hidden,			static cl::opt<bool> EnableLocalReassignment(
	cl::desc("Local reassignment can yield better allocation decisions, but "			"enable-local-reassign", cl::Hidden,
	"may be compile time intensive"),			cl::desc("Local reassignment can yield better allocation decisions, but "
	cl::init(false));			"may be compile time intensive"),
				cl::init(false));
	static cl::opt<bool> EnableDeferredSpilling(
	"enable-deferred-spilling", cl::Hidden,			static cl::opt<bool> EnableDeferredSpilling(
	cl::desc("Instead of spilling a variable right away, defer the actual "			"enable-deferred-spilling", cl::Hidden,
	"code insertion to the end of the allocation. That way the "			cl::desc("Instead of spilling a variable right away, defer the actual "
	"allocator might still find a suitable coloring for this "			"code insertion to the end of the allocation. That way the "
	"variable because of other evicted variables."),			"allocator might still find a suitable coloring for this "
	cl::init(false));			"variable because of other evicted variables."),
				cl::init(false));
	// FIXME: Find a good default for this flag and remove the flag.
	static cl::opt<unsigned>			// FIXME: Find a good default for this flag and remove the flag.
	CSRFirstTimeCost("regalloc-csr-first-time-cost",			static cl::opt<unsigned>
	cl::desc("Cost for first time use of callee-saved register."),			CSRFirstTimeCost("regalloc-csr-first-time-cost",
	cl::init(0), cl::Hidden);			cl::desc("Cost for first time use of callee-saved register."),
				cl::init(0), cl::Hidden);
	static RegisterRegAlloc greedyRegAlloc("greedy", "greedy register allocator",
	createGreedyRegisterAllocator);			static RegisterRegAlloc greedyRegAlloc("greedy", "greedy register allocator",
				createGreedyRegisterAllocator);
	namespace {
				namespace {
	class RAGreedy : public MachineFunctionPass,
	public RegAllocBase,			class RAGreedy : public MachineFunctionPass,
	private LiveRangeEdit::Delegate {			public RegAllocBase,
	// Convenient shortcuts.			private LiveRangeEdit::Delegate {
	using PQueue = std::priority_queue<std::pair<unsigned, unsigned>>;			// Convenient shortcuts.
	using SmallLISet = SmallPtrSet<LiveInterval *, 4>;			using PQueue = std::priority_queue<std::pair<unsigned, unsigned>>;
	using SmallVirtRegSet = SmallSet<unsigned, 16>;			using SmallLISet = SmallPtrSet<LiveInterval *, 4>;
				using SmallVirtRegSet = SmallSet<unsigned, 16>;
	// context
	MachineFunction *MF;			// context
				MachineFunction *MF;
	// Shortcuts to some useful interface.
	const TargetInstrInfo *TII;			// Shortcuts to some useful interface.
	const TargetRegisterInfo *TRI;			const TargetInstrInfo *TII;
	RegisterClassInfo RCI;			const TargetRegisterInfo *TRI;
				RegisterClassInfo RCI;
	// analyses
	SlotIndexes *Indexes;			// analyses
	MachineBlockFrequencyInfo *MBFI;			SlotIndexes *Indexes;
	MachineDominatorTree *DomTree;			MachineBlockFrequencyInfo *MBFI;
	MachineLoopInfo *Loops;			MachineDominatorTree *DomTree;
	MachineOptimizationRemarkEmitter *ORE;			MachineLoopInfo *Loops;
	EdgeBundles *Bundles;			MachineOptimizationRemarkEmitter *ORE;
	SpillPlacement *SpillPlacer;			EdgeBundles *Bundles;
	LiveDebugVariables *DebugVars;			SpillPlacement *SpillPlacer;
	AliasAnalysis *AA;			LiveDebugVariables *DebugVars;
				AliasAnalysis *AA;
	// state
	std::unique_ptr<Spiller> SpillerInstance;			// state
	PQueue Queue;			std::unique_ptr<Spiller> SpillerInstance;
	unsigned NextCascade;			PQueue Queue;
				unsigned NextCascade;
	// Live ranges pass through a number of stages as we try to allocate them.
	// Some of the stages may also create new live ranges:			// Live ranges pass through a number of stages as we try to allocate them.
	//			// Some of the stages may also create new live ranges:
	// - Region splitting.			//
	// - Per-block splitting.			// - Region splitting.
	// - Local splitting.			// - Per-block splitting.
	// - Spilling.			// - Local splitting.
	//			// - Spilling.
	// Ranges produced by one of the stages skip the previous stages when they are			//
	// dequeued. This improves performance because we can skip interference checks			// Ranges produced by one of the stages skip the previous stages when they are
	// that are unlikely to give any results. It also guarantees that the live			// dequeued. This improves performance because we can skip interference checks
	// range splitting algorithm terminates, something that is otherwise hard to			// that are unlikely to give any results. It also guarantees that the live
	// ensure.			// range splitting algorithm terminates, something that is otherwise hard to
	enum LiveRangeStage {			// ensure.
	/// Newly created live range that has never been queued.			enum LiveRangeStage {
	RS_New,			/// Newly created live range that has never been queued.
				RS_New,
	/// Only attempt assignment and eviction. Then requeue as RS_Split.
	RS_Assign,			/// Only attempt assignment and eviction. Then requeue as RS_Split.
				RS_Assign,
	/// Attempt live range splitting if assignment is impossible.
	RS_Split,			/// Attempt live range splitting if assignment is impossible.
				RS_Split,
	/// Attempt more aggressive live range splitting that is guaranteed to make
	/// progress. This is used for split products that may not be making			/// Attempt more aggressive live range splitting that is guaranteed to make
	/// progress.			/// progress. This is used for split products that may not be making
	RS_Split2,			/// progress.
				RS_Split2,
	/// Live range will be spilled. No more splitting will be attempted.
	RS_Spill,			/// Live range will be spilled. No more splitting will be attempted.
				RS_Spill,

	/// Live range is in memory. Because of other evictions, it might get moved
	/// in a register in the end.			/// Live range is in memory. Because of other evictions, it might get moved
	RS_Memory,			/// in a register in the end.
				RS_Memory,
	/// There is nothing more we can do to this live range. Abort compilation
	/// if it can't be assigned.			/// There is nothing more we can do to this live range. Abort compilation
	RS_Done			/// if it can't be assigned.
	};			RS_Done
				};
	// Enum CutOffStage to keep a track whether the register allocation failed
	// because of the cutoffs encountered in last chance recoloring.			// Enum CutOffStage to keep a track whether the register allocation failed
	// Note: This is used as bitmask. New value should be next power of 2.			// because of the cutoffs encountered in last chance recoloring.
	enum CutOffStage {			// Note: This is used as bitmask. New value should be next power of 2.
	// No cutoffs encountered			enum CutOffStage {
	CO_None = 0,			// No cutoffs encountered
				CO_None = 0,
	// lcr-max-depth cutoff encountered
	CO_Depth = 1,			// lcr-max-depth cutoff encountered
				CO_Depth = 1,
	// lcr-max-interf cutoff encountered
	CO_Interf = 2			// lcr-max-interf cutoff encountered
	};			CO_Interf = 2
				};
	uint8_t CutOffInfo;
				uint8_t CutOffInfo;
	#ifndef NDEBUG
	static const char *const StageName[];			#ifndef NDEBUG
	#endif			static const char *const StageName[];
				#endif
	// RegInfo - Keep additional information about each live range.
	struct RegInfo {			// RegInfo - Keep additional information about each live range.
	LiveRangeStage Stage = RS_New;			struct RegInfo {
				LiveRangeStage Stage = RS_New;
	// Cascade - Eviction loop prevention. See canEvictInterference().
	unsigned Cascade = 0;			// Cascade - Eviction loop prevention. See canEvictInterference().
				unsigned Cascade = 0;
	RegInfo() = default;
	};			RegInfo() = default;
				};
	IndexedMap<RegInfo, VirtReg2IndexFunctor> ExtraRegInfo;
				IndexedMap<RegInfo, VirtReg2IndexFunctor> ExtraRegInfo;
	LiveRangeStage getStage(const LiveInterval &VirtReg) const {
	return ExtraRegInfo[VirtReg.reg].Stage;			LiveRangeStage getStage(const LiveInterval &VirtReg) const {
	}			return ExtraRegInfo[VirtReg.reg].Stage;
				}
	void setStage(const LiveInterval &VirtReg, LiveRangeStage Stage) {
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			void setStage(const LiveInterval &VirtReg, LiveRangeStage Stage) {
	ExtraRegInfo[VirtReg.reg].Stage = Stage;			ExtraRegInfo.resize(MRI->getNumVirtRegs());
	}			ExtraRegInfo[VirtReg.reg].Stage = Stage;
				}
	template<typename Iterator>
	void setStage(Iterator Begin, Iterator End, LiveRangeStage NewStage) {			template<typename Iterator>
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			void setStage(Iterator Begin, Iterator End, LiveRangeStage NewStage) {
	for (;Begin != End; ++Begin) {			ExtraRegInfo.resize(MRI->getNumVirtRegs());
	unsigned Reg = *Begin;			for (;Begin != End; ++Begin) {
	if (ExtraRegInfo[Reg].Stage == RS_New)			unsigned Reg = *Begin;
	ExtraRegInfo[Reg].Stage = NewStage;			if (ExtraRegInfo[Reg].Stage == RS_New)
	}			ExtraRegInfo[Reg].Stage = NewStage;
	}			}
				}
	/// Cost of evicting interference.
	struct EvictionCost {			/// Cost of evicting interference.
	unsigned BrokenHints = 0; ///< Total number of broken hints.			struct EvictionCost {
	float MaxWeight = 0; ///< Maximum spill weight evicted.			unsigned BrokenHints = 0; ///< Total number of broken hints.
				float MaxWeight = 0; ///< Maximum spill weight evicted.
	EvictionCost() = default;
				EvictionCost() = default;
	bool isMax() const { return BrokenHints == ~0u; }
				bool isMax() const { return BrokenHints == ~0u; }
	void setMax() { BrokenHints = ~0u; }
				void setMax() { BrokenHints = ~0u; }
	void setBrokenHints(unsigned NHints) { BrokenHints = NHints; }
				void setBrokenHints(unsigned NHints) { BrokenHints = NHints; }
	bool operator<(const EvictionCost &O) const {
	return std::tie(BrokenHints, MaxWeight) <			bool operator<(const EvictionCost &O) const {
	std::tie(O.BrokenHints, O.MaxWeight);			return std::tie(BrokenHints, MaxWeight) <
	}			std::tie(O.BrokenHints, O.MaxWeight);
	};			}
				};
	// splitting state.
	std::unique_ptr<SplitAnalysis> SA;			/// EvictionTrack - Keeps track of past evictions in order to optimize region
	std::unique_ptr<SplitEditor> SE;			/// split decision.
				class EvictionTrack {
	/// Cached per-block interference maps
	InterferenceCache IntfCache;			public:
				using EvictorInfo =
	/// All basic blocks where the current register has uses.			std::pair<unsigned /* evictor /, unsigned / physreg */>;
	SmallVector<SpillPlacement::BlockConstraint, 8> SplitConstraints;			using EvicteeInfo = llvm::MapVector<unsigned /* evictee */, EvictorInfo>;

	/// Global live range splitting candidate info.			private:
	struct GlobalSplitCandidate {			/// Each Vreg that has been evicted in the last stage of selectOrSplit will
	// Register intended for assignment, or 0.			/// be mapped to the evictor Vreg and the PhysReg it was evicted from.
	unsigned PhysReg;			EvicteeInfo Evictees;

	// SplitKit interval index for this candidate.			public:
	unsigned IntvIdx;			/// \breif Clear all eviction information.
				void Clear() { Evictees.clear(); }
	// Interference for PhysReg.
	InterferenceCache::Cursor Intf;			/// \breif Clear eviction information for the given evictee Vreg.
				/// E.g. when Vreg get's a new allocation, the old eviction info is no
	// Bundles where this candidate should be live.			/// longer relevant.
	BitVector LiveBundles;			/// \param Evictee The evictee Vreg for whom we want to clear collected
	SmallVector<unsigned, 8> ActiveBlocks;			/// eviction info.
				void ClearEvicteeInfo(unsigned Evictee) { Evictees.erase(Evictee); }
	void reset(InterferenceCache &Cache, unsigned Reg) {
	PhysReg = Reg;			/// \breif Track new eviction.
	IntvIdx = 0;			/// The Evictor vreg has evicted the Evictee vreg from Physreg.
	Intf.setPhysReg(Cache, Reg);			/// \praram PhysReg The phisical register Evictee was evicted from.
	LiveBundles.clear();			/// \praram Evictor The evictor Vreg that evicted Evictee.
	ActiveBlocks.clear();			/// \praram Evictee The evictee Vreg.
	}			void AddEviction(unsigned PhysReg, unsigned Evictor, unsigned Evictee) {
				Evictees[Evictee].first = Evictor;
	// Set B[i] = C for every live bundle where B[i] was NoCand.			Evictees[Evictee].second = PhysReg;
	unsigned getBundles(SmallVectorImpl<unsigned> &B, unsigned C) {			}
	unsigned Count = 0;
	for (unsigned i : LiveBundles.set_bits())			/// Return the Evictor Vreg which evicted Evictee Vreg from PhysReg.
	if (B[i] == NoCand) {			/// \praram Evictee The evictee vreg.
	B[i] = C;			/// \return The Evictor vreg which evicted Evictee vreg from PhysReg. 0 if
	Count++;			/// nobody has evicted Evictee from PhysReg.
	}			EvictorInfo GetEvictor(unsigned Evictee) {
	return Count;			if (Evictees.count(Evictee)) {
	}			return Evictees[Evictee];
	};			}

	/// Candidate info for each PhysReg in AllocationOrder.			return EvictorInfo(0, 0);
	/// This vector never shrinks, but grows to the size of the largest register			}
	/// class.			};
	SmallVector<GlobalSplitCandidate, 32> GlobalCand;
				// Keeps track of past evictions in order to optimize region split decision
	enum : unsigned { NoCand = ~0u };			EvictionTrack LastEvicted;

	/// Candidate map. Each edge bundle is assigned to a GlobalCand entry, or to			// splitting state.
	/// NoCand which indicates the stack interval.			std::unique_ptr<SplitAnalysis> SA;
	SmallVector<unsigned, 32> BundleCand;			std::unique_ptr<SplitEditor> SE;

	/// Callee-save register cost, calculated once per machine function.			/// Cached per-block interference maps
	BlockFrequency CSRCost;			InterferenceCache IntfCache;

	/// Run or not the local reassignment heuristic. This information is			/// All basic blocks where the current register has uses.
	/// obtained from the TargetSubtargetInfo.			SmallVector<SpillPlacement::BlockConstraint, 8> SplitConstraints;
	bool EnableLocalReassign;
				/// Global live range splitting candidate info.
	/// Set of broken hints that may be reconciled later because of eviction.			struct GlobalSplitCandidate {
	SmallSetVector<LiveInterval *, 8> SetOfBrokenHints;			// Register intended for assignment, or 0.
				unsigned PhysReg;
	public:
	RAGreedy();			// SplitKit interval index for this candidate.
				unsigned IntvIdx;
	/// Return the pass name.
	StringRef getPassName() const override { return "Greedy Register Allocator"; }			// Interference for PhysReg.
				InterferenceCache::Cursor Intf;
	/// RAGreedy analysis usage.
	void getAnalysisUsage(AnalysisUsage &AU) const override;			// Bundles where this candidate should be live.
	void releaseMemory() override;			BitVector LiveBundles;
	Spiller &spiller() override { return *SpillerInstance; }			SmallVector<unsigned, 8> ActiveBlocks;
	void enqueue(LiveInterval *LI) override;
	LiveInterval *dequeue() override;			void reset(InterferenceCache &Cache, unsigned Reg) {
	unsigned selectOrSplit(LiveInterval&, SmallVectorImpl<unsigned>&) override;			PhysReg = Reg;
	void aboutToRemoveInterval(LiveInterval &) override;			IntvIdx = 0;
				Intf.setPhysReg(Cache, Reg);
	/// Perform register allocation.			LiveBundles.clear();
	bool runOnMachineFunction(MachineFunction &mf) override;			ActiveBlocks.clear();
				}
	MachineFunctionProperties getRequiredProperties() const override {
	return MachineFunctionProperties().set(			// Set B[i] = C for every live bundle where B[i] was NoCand.
	MachineFunctionProperties::Property::NoPHIs);			unsigned getBundles(SmallVectorImpl<unsigned> &B, unsigned C) {
	}			unsigned Count = 0;
				for (unsigned i : LiveBundles.set_bits())
	static char ID;			if (B[i] == NoCand) {
				B[i] = C;
	private:			Count++;
	unsigned selectOrSplitImpl(LiveInterval &, SmallVectorImpl<unsigned> &,			}
	SmallVirtRegSet &, unsigned = 0);			return Count;
				}
	bool LRE_CanEraseVirtReg(unsigned) override;			};
	void LRE_WillShrinkVirtReg(unsigned) override;
	void LRE_DidCloneVirtReg(unsigned, unsigned) override;			/// Candidate info for each PhysReg in AllocationOrder.
	void enqueue(PQueue &CurQueue, LiveInterval *LI);			/// This vector never shrinks, but grows to the size of the largest register
	LiveInterval *dequeue(PQueue &CurQueue);			/// class.
				SmallVector<GlobalSplitCandidate, 32> GlobalCand;
	BlockFrequency calcSpillCost();
	bool addSplitConstraints(InterferenceCache::Cursor, BlockFrequency&);			enum : unsigned { NoCand = ~0u };
	void addThroughConstraints(InterferenceCache::Cursor, ArrayRef<unsigned>);
	void growRegion(GlobalSplitCandidate &Cand);			/// Candidate map. Each edge bundle is assigned to a GlobalCand entry, or to
	BlockFrequency calcGlobalSplitCost(GlobalSplitCandidate&);			/// NoCand which indicates the stack interval.
	bool calcCompactRegion(GlobalSplitCandidate&);			SmallVector<unsigned, 32> BundleCand;
	void splitAroundRegion(LiveRangeEdit&, ArrayRef<unsigned>);
	void calcGapWeights(unsigned, SmallVectorImpl<float>&);			/// Callee-save register cost, calculated once per machine function.
	unsigned canReassign(LiveInterval &VirtReg, unsigned PhysReg);			BlockFrequency CSRCost;
	bool shouldEvict(LiveInterval &A, bool, LiveInterval &B, bool);
	bool canEvictInterference(LiveInterval&, unsigned, bool, EvictionCost&);			/// Run or not the local reassignment heuristic. This information is
	void evictInterference(LiveInterval&, unsigned,			/// obtained from the TargetSubtargetInfo.
	SmallVectorImpl<unsigned>&);			bool EnableLocalReassign;
	bool mayRecolorAllInterferences(unsigned PhysReg, LiveInterval &VirtReg,
	SmallLISet &RecoloringCandidates,			/// Set of broken hints that may be reconciled later because of eviction.
	const SmallVirtRegSet &FixedRegisters);			SmallSetVector<LiveInterval *, 8> SetOfBrokenHints;

	unsigned tryAssign(LiveInterval&, AllocationOrder&,			public:
	SmallVectorImpl<unsigned>&);			RAGreedy();
	unsigned tryEvict(LiveInterval&, AllocationOrder&,
	SmallVectorImpl<unsigned>&, unsigned = ~0u);			/// Return the pass name.
	unsigned tryRegionSplit(LiveInterval&, AllocationOrder&,			StringRef getPassName() const override { return "Greedy Register Allocator"; }
	SmallVectorImpl<unsigned>&);
	/// Calculate cost of region splitting.			/// RAGreedy analysis usage.
	unsigned calculateRegionSplitCost(LiveInterval &VirtReg,			void getAnalysisUsage(AnalysisUsage &AU) const override;
	AllocationOrder &Order,			void releaseMemory() override;
	BlockFrequency &BestCost,			Spiller &spiller() override { return *SpillerInstance; }
	unsigned &NumCands, bool IgnoreCSR);			void enqueue(LiveInterval *LI) override;
	/// Perform region splitting.			LiveInterval *dequeue() override;
	unsigned doRegionSplit(LiveInterval &VirtReg, unsigned BestCand,			unsigned selectOrSplit(LiveInterval&, SmallVectorImpl<unsigned>&) override;
	bool HasCompact,			void aboutToRemoveInterval(LiveInterval &) override;
	SmallVectorImpl<unsigned> &NewVRegs);
	/// Check other options before using a callee-saved register for the first			/// Perform register allocation.
	/// time.			bool runOnMachineFunction(MachineFunction &mf) override;
	unsigned tryAssignCSRFirstTime(LiveInterval &VirtReg, AllocationOrder &Order,
	unsigned PhysReg, unsigned &CostPerUseLimit,			MachineFunctionProperties getRequiredProperties() const override {
	SmallVectorImpl<unsigned> &NewVRegs);			return MachineFunctionProperties().set(
	void initializeCSRCost();			MachineFunctionProperties::Property::NoPHIs);
	unsigned tryBlockSplit(LiveInterval&, AllocationOrder&,			}
	SmallVectorImpl<unsigned>&);
	unsigned tryInstructionSplit(LiveInterval&, AllocationOrder&,			static char ID;
	SmallVectorImpl<unsigned>&);
	unsigned tryLocalSplit(LiveInterval&, AllocationOrder&,			private:
	SmallVectorImpl<unsigned>&);			unsigned selectOrSplitImpl(LiveInterval &, SmallVectorImpl<unsigned> &,
	unsigned trySplit(LiveInterval&, AllocationOrder&,			SmallVirtRegSet &, unsigned = 0);
	SmallVectorImpl<unsigned>&);
	unsigned tryLastChanceRecoloring(LiveInterval &, AllocationOrder &,			bool LRE_CanEraseVirtReg(unsigned) override;
	SmallVectorImpl<unsigned> &,			void LRE_WillShrinkVirtReg(unsigned) override;
	SmallVirtRegSet &, unsigned);			void LRE_DidCloneVirtReg(unsigned, unsigned) override;
	bool tryRecoloringCandidates(PQueue &, SmallVectorImpl<unsigned> &,			void enqueue(PQueue &CurQueue, LiveInterval *LI);
	SmallVirtRegSet &, unsigned);			LiveInterval *dequeue(PQueue &CurQueue);
	void tryHintRecoloring(LiveInterval &);
	void tryHintsRecoloring();			BlockFrequency calcSpillCost();
				bool addSplitConstraints(InterferenceCache::Cursor, BlockFrequency&);
	/// Model the information carried by one end of a copy.			void addThroughConstraints(InterferenceCache::Cursor, ArrayRef<unsigned>);
	struct HintInfo {			void growRegion(GlobalSplitCandidate &Cand);
	/// The frequency of the copy.			bool splitCanCauseEvictionChain(unsigned Evictee, GlobalSplitCandidate &Cand,
	BlockFrequency Freq;			unsigned BBNumber,
	/// The virtual register or physical register.			const AllocationOrder &Order);
	unsigned Reg;			BlockFrequency calcGlobalSplitCost(GlobalSplitCandidate &,
	/// Its currently assigned register.			const AllocationOrder &Order,
	/// In case of a physical register Reg == PhysReg.			bool *canCauseEvictionChain);
	unsigned PhysReg;			bool calcCompactRegion(GlobalSplitCandidate&);
				void splitAroundRegion(LiveRangeEdit&, ArrayRef<unsigned>);
	HintInfo(BlockFrequency Freq, unsigned Reg, unsigned PhysReg)			void calcGapWeights(unsigned, SmallVectorImpl<float>&);
	: Freq(Freq), Reg(Reg), PhysReg(PhysReg) {}			unsigned canReassign(LiveInterval &VirtReg, unsigned PhysReg);
	};			bool shouldEvict(LiveInterval &A, bool, LiveInterval &B, bool);
	using HintsInfo = SmallVector<HintInfo, 4>;			bool canEvictInterference(LiveInterval&, unsigned, bool, EvictionCost&);
				bool canEvictInterferenceInRange(LiveInterval &VirtReg, unsigned PhysReg,
	BlockFrequency getBrokenHintFreq(const HintsInfo &, unsigned);			SlotIndex Start, SlotIndex End,
	void collectHintInfo(unsigned, HintsInfo &);			EvictionCost &MaxCost);
				unsigned getCheapestEvicteeWeight(const AllocationOrder &Order,
	bool isUnusedCalleeSavedReg(unsigned PhysReg) const;			LiveInterval &VirtReg, SlotIndex Start,
				SlotIndex End, float *BestEvictWeight);
	/// Compute and report the number of spills and reloads for a loop.			void evictInterference(LiveInterval&, unsigned,
	void reportNumberOfSplillsReloads(MachineLoop *L, unsigned &Reloads,			SmallVectorImpl<unsigned>&);
	unsigned &FoldedReloads, unsigned &Spills,			bool mayRecolorAllInterferences(unsigned PhysReg, LiveInterval &VirtReg,
	unsigned &FoldedSpills);			SmallLISet &RecoloringCandidates,
				const SmallVirtRegSet &FixedRegisters);
	/// Report the number of spills and reloads for each loop.
	void reportNumberOfSplillsReloads() {			unsigned tryAssign(LiveInterval&, AllocationOrder&,
	for (MachineLoop L : Loops) {			SmallVectorImpl<unsigned>&);
	unsigned Reloads, FoldedReloads, Spills, FoldedSpills;			unsigned tryEvict(LiveInterval&, AllocationOrder&,
	reportNumberOfSplillsReloads(L, Reloads, FoldedReloads, Spills,			SmallVectorImpl<unsigned>&, unsigned = ~0u);
	FoldedSpills);			unsigned tryRegionSplit(LiveInterval&, AllocationOrder&,
	}			SmallVectorImpl<unsigned>&);
	}			/// Calculate cost of region splitting.
	};			unsigned calculateRegionSplitCost(LiveInterval &VirtReg,
				AllocationOrder &Order,
	} // end anonymous namespace			BlockFrequency &BestCost,
				unsigned &NumCands, bool IgnoreCSR,
	char RAGreedy::ID = 0;			bool *canCauseEvictionChain = nullptr);
	char &llvm::RAGreedyID = RAGreedy::ID;			/// Perform region splitting.
				unsigned doRegionSplit(LiveInterval &VirtReg, unsigned BestCand,
	INITIALIZE_PASS_BEGIN(RAGreedy, "greedy",			bool HasCompact,
	"Greedy Register Allocator", false, false)			SmallVectorImpl<unsigned> &NewVRegs);
	INITIALIZE_PASS_DEPENDENCY(LiveDebugVariables)			/// Check other options before using a callee-saved register for the first
	INITIALIZE_PASS_DEPENDENCY(SlotIndexes)			/// time.
	INITIALIZE_PASS_DEPENDENCY(LiveIntervals)			unsigned tryAssignCSRFirstTime(LiveInterval &VirtReg, AllocationOrder &Order,
	INITIALIZE_PASS_DEPENDENCY(RegisterCoalescer)			unsigned PhysReg, unsigned &CostPerUseLimit,
	INITIALIZE_PASS_DEPENDENCY(MachineScheduler)			SmallVectorImpl<unsigned> &NewVRegs);
	INITIALIZE_PASS_DEPENDENCY(LiveStacks)			void initializeCSRCost();
	INITIALIZE_PASS_DEPENDENCY(MachineDominatorTree)			unsigned tryBlockSplit(LiveInterval&, AllocationOrder&,
	INITIALIZE_PASS_DEPENDENCY(MachineLoopInfo)			SmallVectorImpl<unsigned>&);
	INITIALIZE_PASS_DEPENDENCY(VirtRegMap)			unsigned tryInstructionSplit(LiveInterval&, AllocationOrder&,
	INITIALIZE_PASS_DEPENDENCY(LiveRegMatrix)			SmallVectorImpl<unsigned>&);
	INITIALIZE_PASS_DEPENDENCY(EdgeBundles)			unsigned tryLocalSplit(LiveInterval&, AllocationOrder&,
	INITIALIZE_PASS_DEPENDENCY(SpillPlacement)			SmallVectorImpl<unsigned>&);
	INITIALIZE_PASS_DEPENDENCY(MachineOptimizationRemarkEmitterPass)			unsigned trySplit(LiveInterval&, AllocationOrder&,
	INITIALIZE_PASS_END(RAGreedy, "greedy",			SmallVectorImpl<unsigned>&);
	"Greedy Register Allocator", false, false)			unsigned tryLastChanceRecoloring(LiveInterval &, AllocationOrder &,
				SmallVectorImpl<unsigned> &,
	#ifndef NDEBUG			SmallVirtRegSet &, unsigned);
	const char *const RAGreedy::StageName[] = {			bool tryRecoloringCandidates(PQueue &, SmallVectorImpl<unsigned> &,
	"RS_New",			SmallVirtRegSet &, unsigned);
	"RS_Assign",			void tryHintRecoloring(LiveInterval &);
	"RS_Split",			void tryHintsRecoloring();
	"RS_Split2",
	"RS_Spill",			/// Model the information carried by one end of a copy.
	"RS_Memory",			struct HintInfo {
	"RS_Done"			/// The frequency of the copy.
	};			BlockFrequency Freq;
	#endif			/// The virtual register or physical register.
				unsigned Reg;
	// Hysteresis to use when comparing floats.			/// Its currently assigned register.
	// This helps stabilize decisions based on float comparisons.			/// In case of a physical register Reg == PhysReg.
	const float Hysteresis = (2007 / 2048.0f); // 0.97998046875			unsigned PhysReg;

	FunctionPass* llvm::createGreedyRegisterAllocator() {			HintInfo(BlockFrequency Freq, unsigned Reg, unsigned PhysReg)
	return new RAGreedy();			: Freq(Freq), Reg(Reg), PhysReg(PhysReg) {}
	}			};
				using HintsInfo = SmallVector<HintInfo, 4>;
	RAGreedy::RAGreedy(): MachineFunctionPass(ID) {
	}			BlockFrequency getBrokenHintFreq(const HintsInfo &, unsigned);
				void collectHintInfo(unsigned, HintsInfo &);
	void RAGreedy::getAnalysisUsage(AnalysisUsage &AU) const {
	AU.setPreservesCFG();			bool isUnusedCalleeSavedReg(unsigned PhysReg) const;
	AU.addRequired<MachineBlockFrequencyInfo>();
	AU.addPreserved<MachineBlockFrequencyInfo>();			/// Compute and report the number of spills and reloads for a loop.
	AU.addRequired<AAResultsWrapperPass>();			void reportNumberOfSplillsReloads(MachineLoop *L, unsigned &Reloads,
	AU.addPreserved<AAResultsWrapperPass>();			unsigned &FoldedReloads, unsigned &Spills,
	AU.addRequired<LiveIntervals>();			unsigned &FoldedSpills);
	AU.addPreserved<LiveIntervals>();
	AU.addRequired<SlotIndexes>();			/// Report the number of spills and reloads for each loop.
	AU.addPreserved<SlotIndexes>();			void reportNumberOfSplillsReloads() {
	AU.addRequired<LiveDebugVariables>();			for (MachineLoop L : Loops) {
	AU.addPreserved<LiveDebugVariables>();			unsigned Reloads, FoldedReloads, Spills, FoldedSpills;
	AU.addRequired<LiveStacks>();			reportNumberOfSplillsReloads(L, Reloads, FoldedReloads, Spills,
	AU.addPreserved<LiveStacks>();			FoldedSpills);
	AU.addRequired<MachineDominatorTree>();			}
	AU.addPreserved<MachineDominatorTree>();			}
	AU.addRequired<MachineLoopInfo>();			};
	AU.addPreserved<MachineLoopInfo>();
	AU.addRequired<VirtRegMap>();			} // end anonymous namespace
	AU.addPreserved<VirtRegMap>();
	AU.addRequired<LiveRegMatrix>();			char RAGreedy::ID = 0;
	AU.addPreserved<LiveRegMatrix>();			char &llvm::RAGreedyID = RAGreedy::ID;
	AU.addRequired<EdgeBundles>();
	AU.addRequired<SpillPlacement>();			INITIALIZE_PASS_BEGIN(RAGreedy, "greedy",
	AU.addRequired<MachineOptimizationRemarkEmitterPass>();			"Greedy Register Allocator", false, false)
	MachineFunctionPass::getAnalysisUsage(AU);			INITIALIZE_PASS_DEPENDENCY(LiveDebugVariables)
	}			INITIALIZE_PASS_DEPENDENCY(SlotIndexes)
				INITIALIZE_PASS_DEPENDENCY(LiveIntervals)
	//===----------------------------------------------------------------------===//			INITIALIZE_PASS_DEPENDENCY(RegisterCoalescer)
	// LiveRangeEdit delegate methods			INITIALIZE_PASS_DEPENDENCY(MachineScheduler)
	//===----------------------------------------------------------------------===//			INITIALIZE_PASS_DEPENDENCY(LiveStacks)
				INITIALIZE_PASS_DEPENDENCY(MachineDominatorTree)
	bool RAGreedy::LRE_CanEraseVirtReg(unsigned VirtReg) {			INITIALIZE_PASS_DEPENDENCY(MachineLoopInfo)
	if (VRM->hasPhys(VirtReg)) {			INITIALIZE_PASS_DEPENDENCY(VirtRegMap)
	LiveInterval &LI = LIS->getInterval(VirtReg);			INITIALIZE_PASS_DEPENDENCY(LiveRegMatrix)
	Matrix->unassign(LI);			INITIALIZE_PASS_DEPENDENCY(EdgeBundles)
	aboutToRemoveInterval(LI);			INITIALIZE_PASS_DEPENDENCY(SpillPlacement)
	return true;			INITIALIZE_PASS_DEPENDENCY(MachineOptimizationRemarkEmitterPass)
	}			INITIALIZE_PASS_END(RAGreedy, "greedy",
	// Unassigned virtreg is probably in the priority queue.			"Greedy Register Allocator", false, false)
	// RegAllocBase will erase it after dequeueing.
	return false;			#ifndef NDEBUG
	}			const char *const RAGreedy::StageName[] = {
				"RS_New",
	void RAGreedy::LRE_WillShrinkVirtReg(unsigned VirtReg) {			"RS_Assign",
	if (!VRM->hasPhys(VirtReg))			"RS_Split",
	return;			"RS_Split2",
				"RS_Spill",
	// Register is assigned, put it back on the queue for reassignment.			"RS_Memory",
	LiveInterval &LI = LIS->getInterval(VirtReg);			"RS_Done"
	Matrix->unassign(LI);			};
	enqueue(&LI);			#endif
	}
				// Hysteresis to use when comparing floats.
	void RAGreedy::LRE_DidCloneVirtReg(unsigned New, unsigned Old) {			// This helps stabilize decisions based on float comparisons.
	// Cloning a register we haven't even heard about yet? Just ignore it.			const float Hysteresis = (2007 / 2048.0f); // 0.97998046875
	if (!ExtraRegInfo.inBounds(Old))
	return;			FunctionPass* llvm::createGreedyRegisterAllocator() {
				return new RAGreedy();
	// LRE may clone a virtual register because dead code elimination causes it to			}
	// be split into connected components. The new components are much smaller
	// than the original, so they should get a new chance at being assigned.			RAGreedy::RAGreedy(): MachineFunctionPass(ID) {
	// same stage as the parent.			}
	ExtraRegInfo[Old].Stage = RS_Assign;
	ExtraRegInfo.grow(New);			void RAGreedy::getAnalysisUsage(AnalysisUsage &AU) const {
	ExtraRegInfo[New] = ExtraRegInfo[Old];			AU.setPreservesCFG();
	}			AU.addRequired<MachineBlockFrequencyInfo>();
				AU.addPreserved<MachineBlockFrequencyInfo>();
	void RAGreedy::releaseMemory() {			AU.addRequired<AAResultsWrapperPass>();
	SpillerInstance.reset();			AU.addPreserved<AAResultsWrapperPass>();
	ExtraRegInfo.clear();			AU.addRequired<LiveIntervals>();
	GlobalCand.clear();			AU.addPreserved<LiveIntervals>();
	}			AU.addRequired<SlotIndexes>();
				AU.addPreserved<SlotIndexes>();
	void RAGreedy::enqueue(LiveInterval *LI) { enqueue(Queue, LI); }			AU.addRequired<LiveDebugVariables>();
				AU.addPreserved<LiveDebugVariables>();
	void RAGreedy::enqueue(PQueue &CurQueue, LiveInterval *LI) {			AU.addRequired<LiveStacks>();
	// Prioritize live ranges by size, assigning larger ranges first.			AU.addPreserved<LiveStacks>();
	// The queue holds (size, reg) pairs.			AU.addRequired<MachineDominatorTree>();
	const unsigned Size = LI->getSize();			AU.addPreserved<MachineDominatorTree>();
	const unsigned Reg = LI->reg;			AU.addRequired<MachineLoopInfo>();
	assert(TargetRegisterInfo::isVirtualRegister(Reg) &&			AU.addPreserved<MachineLoopInfo>();
	"Can only enqueue virtual registers");			AU.addRequired<VirtRegMap>();
	unsigned Prio;			AU.addPreserved<VirtRegMap>();
				AU.addRequired<LiveRegMatrix>();
	ExtraRegInfo.grow(Reg);			AU.addPreserved<LiveRegMatrix>();
	if (ExtraRegInfo[Reg].Stage == RS_New)			AU.addRequired<EdgeBundles>();
	ExtraRegInfo[Reg].Stage = RS_Assign;			AU.addRequired<SpillPlacement>();
				AU.addRequired<MachineOptimizationRemarkEmitterPass>();
	if (ExtraRegInfo[Reg].Stage == RS_Split) {			MachineFunctionPass::getAnalysisUsage(AU);
	// Unsplit ranges that couldn't be allocated immediately are deferred until			}
	// everything else has been allocated.
	Prio = Size;			//===----------------------------------------------------------------------===//
	} else if (ExtraRegInfo[Reg].Stage == RS_Memory) {			// LiveRangeEdit delegate methods
	// Memory operand should be considered last.			//===----------------------------------------------------------------------===//
	// Change the priority such that Memory operand are assigned in
	// the reverse order that they came in.			bool RAGreedy::LRE_CanEraseVirtReg(unsigned VirtReg) {
	// TODO: Make this a member variable and probably do something about hints.			if (VRM->hasPhys(VirtReg)) {
	static unsigned MemOp = 0;			LiveInterval &LI = LIS->getInterval(VirtReg);
	Prio = MemOp++;			Matrix->unassign(LI);
	} else {			aboutToRemoveInterval(LI);
	// Giant live ranges fall back to the global assignment heuristic, which			return true;
	// prevents excessive spilling in pathological cases.			}
	bool ReverseLocal = TRI->reverseLocalAssignment();			// Unassigned virtreg is probably in the priority queue.
	const TargetRegisterClass &RC = *MRI->getRegClass(Reg);			// RegAllocBase will erase it after dequeueing.
	bool ForceGlobal = !ReverseLocal &&			return false;
	(Size / SlotIndex::InstrDist) > (2 * RC.getNumRegs());			}

	if (ExtraRegInfo[Reg].Stage == RS_Assign && !ForceGlobal && !LI->empty() &&			void RAGreedy::LRE_WillShrinkVirtReg(unsigned VirtReg) {
	LIS->intervalIsInOneMBB(*LI)) {			if (!VRM->hasPhys(VirtReg))
	// Allocate original local ranges in linear instruction order. Since they			return;
	// are singly defined, this produces optimal coloring in the absence of
	// global interference and other constraints.			// Register is assigned, put it back on the queue for reassignment.
	if (!ReverseLocal)			LiveInterval &LI = LIS->getInterval(VirtReg);
	Prio = LI->beginIndex().getInstrDistance(Indexes->getLastIndex());			Matrix->unassign(LI);
	else {			enqueue(&LI);
	// Allocating bottom up may allow many short LRGs to be assigned first			}
	// to one of the cheap registers. This could be much faster for very
	// large blocks on targets with many physical registers.			void RAGreedy::LRE_DidCloneVirtReg(unsigned New, unsigned Old) {
	Prio = Indexes->getZeroIndex().getInstrDistance(LI->endIndex());			// Cloning a register we haven't even heard about yet? Just ignore it.
	}			if (!ExtraRegInfo.inBounds(Old))
	Prio \|= RC.AllocationPriority << 24;			return;
	} else {
	// Allocate global and split ranges in long->short order. Long ranges that			// LRE may clone a virtual register because dead code elimination causes it to
	// don't fit should be spilled (or split) ASAP so they don't create			// be split into connected components. The new components are much smaller
	// interference. Mark a bit to prioritize global above local ranges.			// than the original, so they should get a new chance at being assigned.
	Prio = (1u << 29) + Size;			// same stage as the parent.
	}			ExtraRegInfo[Old].Stage = RS_Assign;
	// Mark a higher bit to prioritize global and local above RS_Split.			ExtraRegInfo.grow(New);
	Prio \|= (1u << 31);			ExtraRegInfo[New] = ExtraRegInfo[Old];
				}
	// Boost ranges that have a physical register hint.
	if (VRM->hasKnownPreference(Reg))			void RAGreedy::releaseMemory() {
	Prio \|= (1u << 30);			SpillerInstance.reset();
	}			ExtraRegInfo.clear();
	// The virtual register number is a tie breaker for same-sized ranges.			GlobalCand.clear();
	// Give lower vreg numbers higher priority to assign them first.			}
	CurQueue.push(std::make_pair(Prio, ~Reg));
	}			void RAGreedy::enqueue(LiveInterval *LI) { enqueue(Queue, LI); }

	LiveInterval *RAGreedy::dequeue() { return dequeue(Queue); }			void RAGreedy::enqueue(PQueue &CurQueue, LiveInterval *LI) {
				// Prioritize live ranges by size, assigning larger ranges first.
	LiveInterval *RAGreedy::dequeue(PQueue &CurQueue) {			// The queue holds (size, reg) pairs.
	if (CurQueue.empty())			const unsigned Size = LI->getSize();
	return nullptr;			const unsigned Reg = LI->reg;
	LiveInterval *LI = &LIS->getInterval(~CurQueue.top().second);			assert(TargetRegisterInfo::isVirtualRegister(Reg) &&
	CurQueue.pop();			"Can only enqueue virtual registers");
	return LI;			unsigned Prio;
	}
				ExtraRegInfo.grow(Reg);
	//===----------------------------------------------------------------------===//			if (ExtraRegInfo[Reg].Stage == RS_New)
	// Direct Assignment			ExtraRegInfo[Reg].Stage = RS_Assign;
	//===----------------------------------------------------------------------===//
				if (ExtraRegInfo[Reg].Stage == RS_Split) {
	/// tryAssign - Try to assign VirtReg to an available register.			// Unsplit ranges that couldn't be allocated immediately are deferred until
	unsigned RAGreedy::tryAssign(LiveInterval &VirtReg,			// everything else has been allocated.
	AllocationOrder &Order,			Prio = Size;
	SmallVectorImpl<unsigned> &NewVRegs) {			} else if (ExtraRegInfo[Reg].Stage == RS_Memory) {
	Order.rewind();			// Memory operand should be considered last.
	unsigned PhysReg;			// Change the priority such that Memory operand are assigned in
	while ((PhysReg = Order.next()))			// the reverse order that they came in.
	if (!Matrix->checkInterference(VirtReg, PhysReg))			// TODO: Make this a member variable and probably do something about hints.
	break;			static unsigned MemOp = 0;
	if (!PhysReg \|\| Order.isHint())			Prio = MemOp++;
	return PhysReg;			} else {
				// Giant live ranges fall back to the global assignment heuristic, which
	// PhysReg is available, but there may be a better choice.			// prevents excessive spilling in pathological cases.
				bool ReverseLocal = TRI->reverseLocalAssignment();
	// If we missed a simple hint, try to cheaply evict interference from the			const TargetRegisterClass &RC = *MRI->getRegClass(Reg);
	// preferred register.			bool ForceGlobal = !ReverseLocal &&
	if (unsigned Hint = MRI->getSimpleHint(VirtReg.reg))			(Size / SlotIndex::InstrDist) > (2 * RC.getNumRegs());
	if (Order.isHint(Hint)) {
	DEBUG(dbgs() << "missed hint " << PrintReg(Hint, TRI) << '\n');			if (ExtraRegInfo[Reg].Stage == RS_Assign && !ForceGlobal && !LI->empty() &&
	EvictionCost MaxCost;			LIS->intervalIsInOneMBB(*LI)) {
	MaxCost.setBrokenHints(1);			// Allocate original local ranges in linear instruction order. Since they
	if (canEvictInterference(VirtReg, Hint, true, MaxCost)) {			// are singly defined, this produces optimal coloring in the absence of
	evictInterference(VirtReg, Hint, NewVRegs);			// global interference and other constraints.
	return Hint;			if (!ReverseLocal)
	}			Prio = LI->beginIndex().getInstrDistance(Indexes->getLastIndex());
	// Record the missed hint, we may be able to recover			else {
	// at the end if the surrounding allocation changed.			// Allocating bottom up may allow many short LRGs to be assigned first
	SetOfBrokenHints.insert(&VirtReg);			// to one of the cheap registers. This could be much faster for very
	}			// large blocks on targets with many physical registers.
				Prio = Indexes->getZeroIndex().getInstrDistance(LI->endIndex());
	// Try to evict interference from a cheaper alternative.			}
	unsigned Cost = TRI->getCostPerUse(PhysReg);			Prio \|= RC.AllocationPriority << 24;
				} else {
	// Most registers have 0 additional cost.			// Allocate global and split ranges in long->short order. Long ranges that
	if (!Cost)			// don't fit should be spilled (or split) ASAP so they don't create
	return PhysReg;			// interference. Mark a bit to prioritize global above local ranges.
				Prio = (1u << 29) + Size;
	DEBUG(dbgs() << PrintReg(PhysReg, TRI) << " is available at cost " << Cost			}
	<< '\n');			// Mark a higher bit to prioritize global and local above RS_Split.
	unsigned CheapReg = tryEvict(VirtReg, Order, NewVRegs, Cost);			Prio \|= (1u << 31);
	return CheapReg ? CheapReg : PhysReg;
	}			// Boost ranges that have a physical register hint.
				if (VRM->hasKnownPreference(Reg))
	//===----------------------------------------------------------------------===//			Prio \|= (1u << 30);
	// Interference eviction			}
	//===----------------------------------------------------------------------===//			// The virtual register number is a tie breaker for same-sized ranges.
				// Give lower vreg numbers higher priority to assign them first.
	unsigned RAGreedy::canReassign(LiveInterval &VirtReg, unsigned PrevReg) {			CurQueue.push(std::make_pair(Prio, ~Reg));
	AllocationOrder Order(VirtReg.reg, *VRM, RegClassInfo, Matrix);			}
	unsigned PhysReg;
	while ((PhysReg = Order.next())) {			LiveInterval *RAGreedy::dequeue() { return dequeue(Queue); }
	if (PhysReg == PrevReg)
	continue;			LiveInterval *RAGreedy::dequeue(PQueue &CurQueue) {
				if (CurQueue.empty())
	MCRegUnitIterator Units(PhysReg, TRI);			return nullptr;
	for (; Units.isValid(); ++Units) {			LiveInterval *LI = &LIS->getInterval(~CurQueue.top().second);
	// Instantiate a "subquery", not to be confused with the Queries array.			CurQueue.pop();
	LiveIntervalUnion::Query subQ(VirtReg, Matrix->getLiveUnions()[*Units]);			return LI;
	if (subQ.checkInterference())			}
	break;
	}			//===----------------------------------------------------------------------===//
	// If no units have interference, break out with the current PhysReg.			// Direct Assignment
	if (!Units.isValid())			//===----------------------------------------------------------------------===//
	break;
	}			/// tryAssign - Try to assign VirtReg to an available register.
	if (PhysReg)			unsigned RAGreedy::tryAssign(LiveInterval &VirtReg,
	DEBUG(dbgs() << "can reassign: " << VirtReg << " from "			AllocationOrder &Order,
	<< PrintReg(PrevReg, TRI) << " to " << PrintReg(PhysReg, TRI)			SmallVectorImpl<unsigned> &NewVRegs) {
	<< '\n');			Order.rewind();
	return PhysReg;			unsigned PhysReg;
	}			while ((PhysReg = Order.next()))
				if (!Matrix->checkInterference(VirtReg, PhysReg))
	/// shouldEvict - determine if A should evict the assigned live range B. The			break;
	/// eviction policy defined by this function together with the allocation order			if (!PhysReg \|\| Order.isHint())
	/// defined by enqueue() decides which registers ultimately end up being split			return PhysReg;
	/// and spilled.
	///			// PhysReg is available, but there may be a better choice.
	/// Cascade numbers are used to prevent infinite loops if this function is a
	/// cyclic relation.			// If we missed a simple hint, try to cheaply evict interference from the
	///			// preferred register.
	/// @param A The live range to be assigned.			if (unsigned Hint = MRI->getSimpleHint(VirtReg.reg))
	/// @param IsHint True when A is about to be assigned to its preferred			if (Order.isHint(Hint)) {
	/// register.			DEBUG(dbgs() << "missed hint " << PrintReg(Hint, TRI) << '\n');
	/// @param B The live range to be evicted.			EvictionCost MaxCost;
	/// @param BreaksHint True when B is already assigned to its preferred register.			MaxCost.setBrokenHints(1);
	bool RAGreedy::shouldEvict(LiveInterval &A, bool IsHint,			if (canEvictInterference(VirtReg, Hint, true, MaxCost)) {
	LiveInterval &B, bool BreaksHint) {			evictInterference(VirtReg, Hint, NewVRegs);
	bool CanSplit = getStage(B) < RS_Spill;			return Hint;
				}
	// Be fairly aggressive about following hints as long as the evictee can be			// Record the missed hint, we may be able to recover
	// split.			// at the end if the surrounding allocation changed.
	if (CanSplit && IsHint && !BreaksHint)			SetOfBrokenHints.insert(&VirtReg);
	return true;			}

	if (A.weight > B.weight) {			// Try to evict interference from a cheaper alternative.
	DEBUG(dbgs() << "should evict: " << B << " w= " << B.weight << '\n');			unsigned Cost = TRI->getCostPerUse(PhysReg);
	return true;
	}			// Most registers have 0 additional cost.
	return false;			if (!Cost)
	}			return PhysReg;

	/// canEvictInterference - Return true if all interferences between VirtReg and			DEBUG(dbgs() << PrintReg(PhysReg, TRI) << " is available at cost " << Cost
	/// PhysReg can be evicted.			<< '\n');
	///			unsigned CheapReg = tryEvict(VirtReg, Order, NewVRegs, Cost);
	/// @param VirtReg Live range that is about to be assigned.			return CheapReg ? CheapReg : PhysReg;
	/// @param PhysReg Desired register for assignment.			}
	/// @param IsHint True when PhysReg is VirtReg's preferred register.
	/// @param MaxCost Only look for cheaper candidates and update with new cost			//===----------------------------------------------------------------------===//
	/// when returning true.			// Interference eviction
	/// @returns True when interference can be evicted cheaper than MaxCost.			//===----------------------------------------------------------------------===//
	bool RAGreedy::canEvictInterference(LiveInterval &VirtReg, unsigned PhysReg,
	bool IsHint, EvictionCost &MaxCost) {			unsigned RAGreedy::canReassign(LiveInterval &VirtReg, unsigned PrevReg) {
	// It is only possible to evict virtual register interference.			AllocationOrder Order(VirtReg.reg, *VRM, RegClassInfo, Matrix);
	if (Matrix->checkInterference(VirtReg, PhysReg) > LiveRegMatrix::IK_VirtReg)			unsigned PhysReg;
	return false;			while ((PhysReg = Order.next())) {
				if (PhysReg == PrevReg)
	bool IsLocal = LIS->intervalIsInOneMBB(VirtReg);			continue;

	// Find VirtReg's cascade number. This will be unassigned if VirtReg was never			MCRegUnitIterator Units(PhysReg, TRI);
	// involved in an eviction before. If a cascade number was assigned, deny			for (; Units.isValid(); ++Units) {
	// evicting anything with the same or a newer cascade number. This prevents			// Instantiate a "subquery", not to be confused with the Queries array.
	// infinite eviction loops.			LiveIntervalUnion::Query subQ(VirtReg, Matrix->getLiveUnions()[*Units]);
	//			if (subQ.checkInterference())
	// This works out so a register without a cascade number is allowed to evict			break;
	// anything, and it can be evicted by anything.			}
	unsigned Cascade = ExtraRegInfo[VirtReg.reg].Cascade;			// If no units have interference, break out with the current PhysReg.
	if (!Cascade)			if (!Units.isValid())
	Cascade = NextCascade;			break;
				}
	EvictionCost Cost;			if (PhysReg)
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			DEBUG(dbgs() << "can reassign: " << VirtReg << " from "
	LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);			<< PrintReg(PrevReg, TRI) << " to " << PrintReg(PhysReg, TRI)
	// If there is 10 or more interferences, chances are one is heavier.			<< '\n');
	if (Q.collectInterferingVRegs(10) >= 10)			return PhysReg;
	return false;			}

	// Check if any interfering live range is heavier than MaxWeight.			/// shouldEvict - determine if A should evict the assigned live range B. The
	for (unsigned i = Q.interferingVRegs().size(); i; --i) {			/// eviction policy defined by this function together with the allocation order
	LiveInterval *Intf = Q.interferingVRegs()[i - 1];			/// defined by enqueue() decides which registers ultimately end up being split
	assert(TargetRegisterInfo::isVirtualRegister(Intf->reg) &&			/// and spilled.
	"Only expecting virtual register interference from query");			///
	// Never evict spill products. They cannot split or spill.			/// Cascade numbers are used to prevent infinite loops if this function is a
	if (getStage(*Intf) == RS_Done)			/// cyclic relation.
	return false;			///
	// Once a live range becomes small enough, it is urgent that we find a			/// @param A The live range to be assigned.
	// register for it. This is indicated by an infinite spill weight. These			/// @param IsHint True when A is about to be assigned to its preferred
	// urgent live ranges get to evict almost anything.			/// register.
	//			/// @param B The live range to be evicted.
	// Also allow urgent evictions of unspillable ranges from a strictly			/// @param BreaksHint True when B is already assigned to its preferred register.
	// larger allocation order.			bool RAGreedy::shouldEvict(LiveInterval &A, bool IsHint,
	bool Urgent = !VirtReg.isSpillable() &&			LiveInterval &B, bool BreaksHint) {
	(Intf->isSpillable() \|\|			bool CanSplit = getStage(B) < RS_Spill;
	RegClassInfo.getNumAllocatableRegs(MRI->getRegClass(VirtReg.reg)) <
	RegClassInfo.getNumAllocatableRegs(MRI->getRegClass(Intf->reg)));			// Be fairly aggressive about following hints as long as the evictee can be
	// Only evict older cascades or live ranges without a cascade.			// split.
	unsigned IntfCascade = ExtraRegInfo[Intf->reg].Cascade;			if (CanSplit && IsHint && !BreaksHint)
	if (Cascade <= IntfCascade) {			return true;
	if (!Urgent)
	return false;			if (A.weight > B.weight) {
	// We permit breaking cascades for urgent evictions. It should be the			DEBUG(dbgs() << "should evict: " << B << " w= " << B.weight << '\n');
	// last resort, though, so make it really expensive.			return true;
	Cost.BrokenHints += 10;			}
	}			return false;
	// Would this break a satisfied hint?			}
	bool BreaksHint = VRM->hasPreferredPhys(Intf->reg);
	// Update eviction cost.			/// canEvictInterference - Return true if all interferences between VirtReg and
	Cost.BrokenHints += BreaksHint;			/// PhysReg can be evicted.
	Cost.MaxWeight = std::max(Cost.MaxWeight, Intf->weight);			///
	// Abort if this would be too expensive.			/// @param VirtReg Live range that is about to be assigned.
	if (!(Cost < MaxCost))			/// @param PhysReg Desired register for assignment.
	return false;			/// @param IsHint True when PhysReg is VirtReg's preferred register.
	if (Urgent)			/// @param MaxCost Only look for cheaper candidates and update with new cost
	continue;			/// when returning true.
	// Apply the eviction policy for non-urgent evictions.			/// @returns True when interference can be evicted cheaper than MaxCost.
	if (!shouldEvict(VirtReg, IsHint, *Intf, BreaksHint))			bool RAGreedy::canEvictInterference(LiveInterval &VirtReg, unsigned PhysReg,
	return false;			bool IsHint, EvictionCost &MaxCost) {
	// If !MaxCost.isMax(), then we're just looking for a cheap register.			// It is only possible to evict virtual register interference.
	// Evicting another local live range in this case could lead to suboptimal			if (Matrix->checkInterference(VirtReg, PhysReg) > LiveRegMatrix::IK_VirtReg)
	// coloring.			return false;
	if (!MaxCost.isMax() && IsLocal && LIS->intervalIsInOneMBB(*Intf) &&
	(!EnableLocalReassign \|\| !canReassign(*Intf, PhysReg))) {			bool IsLocal = LIS->intervalIsInOneMBB(VirtReg);
	return false;
	}			// Find VirtReg's cascade number. This will be unassigned if VirtReg was never
	}			// involved in an eviction before. If a cascade number was assigned, deny
	}			// evicting anything with the same or a newer cascade number. This prevents
	MaxCost = Cost;			// infinite eviction loops.
	return true;			//
	}			// This works out so a register without a cascade number is allowed to evict
				// anything, and it can be evicted by anything.
	/// evictInterference - Evict any interferring registers that prevent VirtReg			unsigned Cascade = ExtraRegInfo[VirtReg.reg].Cascade;
	/// from being assigned to Physreg. This assumes that canEvictInterference			if (!Cascade)
	/// returned true.			Cascade = NextCascade;
	void RAGreedy::evictInterference(LiveInterval &VirtReg, unsigned PhysReg,
	SmallVectorImpl<unsigned> &NewVRegs) {			EvictionCost Cost;
	// Make sure that VirtReg has a cascade number, and assign that cascade			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	// number to every evicted register. These live ranges than then only be			LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
	// evicted by a newer cascade, preventing infinite loops.			// If there is 10 or more interferences, chances are one is heavier.
	unsigned Cascade = ExtraRegInfo[VirtReg.reg].Cascade;			if (Q.collectInterferingVRegs(10) >= 10)
	if (!Cascade)			return false;
	Cascade = ExtraRegInfo[VirtReg.reg].Cascade = NextCascade++;
				// Check if any interfering live range is heavier than MaxWeight.
	DEBUG(dbgs() << "evicting " << PrintReg(PhysReg, TRI)			for (unsigned i = Q.interferingVRegs().size(); i; --i) {
	<< " interference: Cascade " << Cascade << '\n');			LiveInterval *Intf = Q.interferingVRegs()[i - 1];
				assert(TargetRegisterInfo::isVirtualRegister(Intf->reg) &&
	// Collect all interfering virtregs first.			"Only expecting virtual register interference from query");
	SmallVector<LiveInterval*, 8> Intfs;			// Never evict spill products. They cannot split or spill.
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			if (getStage(*Intf) == RS_Done)
	LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);			return false;
	// We usually have the interfering VRegs cached so collectInterferingVRegs()			// Once a live range becomes small enough, it is urgent that we find a
	// should be fast, we may need to recalculate if when different physregs			// register for it. This is indicated by an infinite spill weight. These
	// overlap the same register unit so we had different SubRanges queried			// urgent live ranges get to evict almost anything.
	// against it.			//
	Q.collectInterferingVRegs();			// Also allow urgent evictions of unspillable ranges from a strictly
	ArrayRef<LiveInterval*> IVR = Q.interferingVRegs();			// larger allocation order.
	Intfs.append(IVR.begin(), IVR.end());			bool Urgent = !VirtReg.isSpillable() &&
	}			(Intf->isSpillable() \|\|
				RegClassInfo.getNumAllocatableRegs(MRI->getRegClass(VirtReg.reg)) <
	// Evict them second. This will invalidate the queries.			RegClassInfo.getNumAllocatableRegs(MRI->getRegClass(Intf->reg)));
	for (unsigned i = 0, e = Intfs.size(); i != e; ++i) {			// Only evict older cascades or live ranges without a cascade.
	LiveInterval *Intf = Intfs[i];			unsigned IntfCascade = ExtraRegInfo[Intf->reg].Cascade;
	// The same VirtReg may be present in multiple RegUnits. Skip duplicates.			if (Cascade <= IntfCascade) {
	if (!VRM->hasPhys(Intf->reg))			if (!Urgent)
	continue;			return false;
	Matrix->unassign(*Intf);			// We permit breaking cascades for urgent evictions. It should be the
	assert((ExtraRegInfo[Intf->reg].Cascade < Cascade \|\|			// last resort, though, so make it really expensive.
	VirtReg.isSpillable() < Intf->isSpillable()) &&			Cost.BrokenHints += 10;
	"Cannot decrease cascade number, illegal eviction");			}
	ExtraRegInfo[Intf->reg].Cascade = Cascade;			// Would this break a satisfied hint?
	++NumEvicted;			bool BreaksHint = VRM->hasPreferredPhys(Intf->reg);
	NewVRegs.push_back(Intf->reg);			// Update eviction cost.
	}			Cost.BrokenHints += BreaksHint;
	}			Cost.MaxWeight = std::max(Cost.MaxWeight, Intf->weight);
				// Abort if this would be too expensive.
	/// Returns true if the given \p PhysReg is a callee saved register and has not			if (!(Cost < MaxCost))
	/// been used for allocation yet.			return false;
	bool RAGreedy::isUnusedCalleeSavedReg(unsigned PhysReg) const {			if (Urgent)
	unsigned CSR = RegClassInfo.getLastCalleeSavedAlias(PhysReg);			continue;
	if (CSR == 0)			// Apply the eviction policy for non-urgent evictions.
	return false;			if (!shouldEvict(VirtReg, IsHint, *Intf, BreaksHint))
				return false;
	return !Matrix->isPhysRegUsed(PhysReg);			// If !MaxCost.isMax(), then we're just looking for a cheap register.
	}			// Evicting another local live range in this case could lead to suboptimal
				// coloring.
	/// tryEvict - Try to evict all interferences for a physreg.			if (!MaxCost.isMax() && IsLocal && LIS->intervalIsInOneMBB(*Intf) &&
	/// @param VirtReg Currently unassigned virtual register.			(!EnableLocalReassign \|\| !canReassign(*Intf, PhysReg))) {
	/// @param Order Physregs to try.			return false;
	/// @return Physreg to assign VirtReg, or 0.			}
	unsigned RAGreedy::tryEvict(LiveInterval &VirtReg,			}
	AllocationOrder &Order,			}
	SmallVectorImpl<unsigned> &NewVRegs,			MaxCost = Cost;
	unsigned CostPerUseLimit) {			return true;
	NamedRegionTimer T("evict", "Evict", TimerGroupName, TimerGroupDescription,			}
	TimePassesIsEnabled);
				/// \brief Return true if all interferences between VirtReg and PhysReg between
	// Keep track of the cheapest interference seen so far.			/// Start and End can be evicted.
	EvictionCost BestCost;			///
	BestCost.setMax();			/// \param VirtReg Live range that is about to be assigned.
	unsigned BestPhys = 0;			/// \param PhysReg Desired register for assignment.
	unsigned OrderLimit = Order.getOrder().size();			/// \param Start Start of range to look for interferences.
				/// \param End End of range to look for interferences.
	// When we are just looking for a reduced cost per use, don't break any			/// \param MaxCost Only look for cheaper candidates and update with new cost
	// hints, and only evict smaller spill weights.			/// when returning true.
	if (CostPerUseLimit < ~0u) {			/// \return True when interference can be evicted cheaper than MaxCost.
	BestCost.BrokenHints = 0;			bool RAGreedy::canEvictInterferenceInRange(LiveInterval &VirtReg,
	BestCost.MaxWeight = VirtReg.weight;			unsigned PhysReg, SlotIndex Start,
				SlotIndex End,
	// Check of any registers in RC are below CostPerUseLimit.			EvictionCost &MaxCost) {
	const TargetRegisterClass *RC = MRI->getRegClass(VirtReg.reg);			EvictionCost Cost;
	unsigned MinCost = RegClassInfo.getMinCost(RC);
	if (MinCost >= CostPerUseLimit) {			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	DEBUG(dbgs() << TRI->getRegClassName(RC) << " minimum cost = " << MinCost			LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
	<< ", no cheaper registers to be found.\n");
	return 0;			// Check if any interfering live range is heavier than MaxWeight.
	}			for (unsigned i = Q.interferingVRegs().size(); i; --i) {
				LiveInterval *Intf = Q.interferingVRegs()[i - 1];
	// It is normal for register classes to have a long tail of registers with
	// the same cost. We don't need to look at them if they're too expensive.			// Check if interference overlast the segment in interest
	if (TRI->getCostPerUse(Order.getOrder().back()) >= CostPerUseLimit) {			if (!Intf->overlaps(Start, End))
	OrderLimit = RegClassInfo.getLastCostChange(RC);			continue;
	DEBUG(dbgs() << "Only trying the first " << OrderLimit << " regs.\n");
	}			// Cannot evict non virtual reg interference
	}			if (!TargetRegisterInfo::isVirtualRegister(Intf->reg))
				return false;
	Order.rewind();			// Never evict spill products. They cannot split or spill.
	while (unsigned PhysReg = Order.next(OrderLimit)) {			if (getStage(*Intf) == RS_Done)
	if (TRI->getCostPerUse(PhysReg) >= CostPerUseLimit)			return false;
	continue;
	// The first use of a callee-saved register in a function has cost 1.			// Would this break a satisfied hint?
	// Don't start using a CSR when the CostPerUseLimit is low.			bool BreaksHint = VRM->hasPreferredPhys(Intf->reg);
	if (CostPerUseLimit == 1 && isUnusedCalleeSavedReg(PhysReg)) {			// Update eviction cost.
	DEBUG(dbgs() << PrintReg(PhysReg, TRI) << " would clobber CSR "			Cost.BrokenHints += BreaksHint;
	<< PrintReg(RegClassInfo.getLastCalleeSavedAlias(PhysReg), TRI)			Cost.MaxWeight = std::max(Cost.MaxWeight, Intf->weight);
	<< '\n');			// Abort if this would be too expensive.
	continue;			if (!(Cost < MaxCost))
	}			return false;
				}
	if (!canEvictInterference(VirtReg, PhysReg, false, BestCost))			}
	continue;
				if (Cost.MaxWeight == 0)
	// Best so far.			return false;
	BestPhys = PhysReg;
				MaxCost = Cost;
	// Stop if the hint can be used.			return true;
	if (Order.isHint())			}
	break;
	}			/// \brief Return tthe physical register that will be best
				/// candidate for eviction by a local split interval that will be created
	if (!BestPhys)			/// between Start and End.
	return 0;			///
				/// \param Order The allocation order
	evictInterference(VirtReg, BestPhys, NewVRegs);			/// \param VirtReg Live range that is about to be assigned.
	return BestPhys;			/// \param Start Start of range to look for interferences
	}			/// \param End End of range to look for interferences
				/// \param BestEvictweight The eviction cost of that eviction
	//===----------------------------------------------------------------------===//			/// \return The PhysReg which is the best candidate for eviction and the
	// Region Splitting			/// eviction cost in BestEvictweight
	//===----------------------------------------------------------------------===//			unsigned RAGreedy::getCheapestEvicteeWeight(const AllocationOrder &Order,
				LiveInterval &VirtReg,
	/// addSplitConstraints - Fill out the SplitConstraints vector based on the			SlotIndex Start, SlotIndex End,
	/// interference pattern in Physreg and its aliases. Add the constraints to			float *BestEvictweight) {
	/// SpillPlacement and return the static cost of this split in Cost, assuming			EvictionCost BestEvictCost;
	/// that all preferences in SplitConstraints are met.			BestEvictCost.setMax();
	/// Return false if there are no bundles with positive bias.			BestEvictCost.MaxWeight = VirtReg.weight;
	bool RAGreedy::addSplitConstraints(InterferenceCache::Cursor Intf,			unsigned BestEvicteePhys = 0;
	BlockFrequency &Cost) {
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();			// Go over all physical registers and find the best candidate for eviction
				for (auto PhysReg : Order.getOrder()) {
	// Reset interference dependent info.
	SplitConstraints.resize(UseBlocks.size());			if (!canEvictInterferenceInRange(VirtReg, PhysReg, Start, End,
	BlockFrequency StaticCost = 0;			BestEvictCost))
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			continue;
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	SpillPlacement::BlockConstraint &BC = SplitConstraints[i];			// Best so far.
				BestEvicteePhys = PhysReg;
	BC.Number = BI.MBB->getNumber();			}
	Intf.moveToBlock(BC.Number);			*BestEvictweight = BestEvictCost.MaxWeight;
	BC.Entry = BI.LiveIn ? SpillPlacement::PrefReg : SpillPlacement::DontCare;			return BestEvicteePhys;
	BC.Exit = BI.LiveOut ? SpillPlacement::PrefReg : SpillPlacement::DontCare;			}
	BC.ChangesValue = BI.FirstDef.isValid();
				/// evictInterference - Evict any interferring registers that prevent VirtReg
	if (!Intf.hasInterference())			/// from being assigned to Physreg. This assumes that canEvictInterference
	continue;			/// returned true.
				void RAGreedy::evictInterference(LiveInterval &VirtReg, unsigned PhysReg,
	// Number of spill code instructions to insert.			SmallVectorImpl<unsigned> &NewVRegs) {
	unsigned Ins = 0;			// Make sure that VirtReg has a cascade number, and assign that cascade
				// number to every evicted register. These live ranges than then only be
	// Interference for the live-in value.			// evicted by a newer cascade, preventing infinite loops.
	if (BI.LiveIn) {			unsigned Cascade = ExtraRegInfo[VirtReg.reg].Cascade;
	if (Intf.first() <= Indexes->getMBBStartIdx(BC.Number)) {			if (!Cascade)
	BC.Entry = SpillPlacement::MustSpill;			Cascade = ExtraRegInfo[VirtReg.reg].Cascade = NextCascade++;
	++Ins;
	} else if (Intf.first() < BI.FirstInstr) {			DEBUG(dbgs() << "evicting " << PrintReg(PhysReg, TRI)
	BC.Entry = SpillPlacement::PrefSpill;			<< " interference: Cascade " << Cascade << '\n');
	++Ins;
	} else if (Intf.first() < BI.LastInstr) {			// Collect all interfering virtregs first.
	++Ins;			SmallVector<LiveInterval*, 8> Intfs;
	}			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	}			LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
				// We usually have the interfering VRegs cached so collectInterferingVRegs()
	// Interference for the live-out value.			// should be fast, we may need to recalculate if when different physregs
	if (BI.LiveOut) {			// overlap the same register unit so we had different SubRanges queried
	if (Intf.last() >= SA->getLastSplitPoint(BC.Number)) {			// against it.
	BC.Exit = SpillPlacement::MustSpill;			Q.collectInterferingVRegs();
	++Ins;			ArrayRef<LiveInterval*> IVR = Q.interferingVRegs();
	} else if (Intf.last() > BI.LastInstr) {			Intfs.append(IVR.begin(), IVR.end());
	BC.Exit = SpillPlacement::PrefSpill;			}
	++Ins;
	} else if (Intf.last() > BI.FirstInstr) {			// Evict them second. This will invalidate the queries.
	++Ins;			for (unsigned i = 0, e = Intfs.size(); i != e; ++i) {
	}			LiveInterval *Intf = Intfs[i];
	}			// The same VirtReg may be present in multiple RegUnits. Skip duplicates.
				if (!VRM->hasPhys(Intf->reg))
	// Accumulate the total frequency of inserted spill code.			continue;
	while (Ins--)
	StaticCost += SpillPlacer->getBlockFrequency(BC.Number);			LastEvicted.AddEviction(PhysReg, VirtReg.reg, Intf->reg);
	}
	Cost = StaticCost;			Matrix->unassign(*Intf);
				assert((ExtraRegInfo[Intf->reg].Cascade < Cascade \|\|
	// Add constraints for use-blocks. Note that these are the only constraints			VirtReg.isSpillable() < Intf->isSpillable()) &&
	// that may add a positive bias, it is downhill from here.			"Cannot decrease cascade number, illegal eviction");
	SpillPlacer->addConstraints(SplitConstraints);			ExtraRegInfo[Intf->reg].Cascade = Cascade;
	return SpillPlacer->scanActiveBundles();			++NumEvicted;
	}			NewVRegs.push_back(Intf->reg);
				}
	/// addThroughConstraints - Add constraints and links to SpillPlacer from the			}
	/// live-through blocks in Blocks.
	void RAGreedy::addThroughConstraints(InterferenceCache::Cursor Intf,			/// Returns true if the given \p PhysReg is a callee saved register and has not
	ArrayRef<unsigned> Blocks) {			/// been used for allocation yet.
	const unsigned GroupSize = 8;			bool RAGreedy::isUnusedCalleeSavedReg(unsigned PhysReg) const {
	SpillPlacement::BlockConstraint BCS[GroupSize];			unsigned CSR = RegClassInfo.getLastCalleeSavedAlias(PhysReg);
	unsigned TBS[GroupSize];			if (CSR == 0)
	unsigned B = 0, T = 0;			return false;

	for (unsigned i = 0; i != Blocks.size(); ++i) {			return !Matrix->isPhysRegUsed(PhysReg);
	unsigned Number = Blocks[i];			}
	Intf.moveToBlock(Number);
				/// tryEvict - Try to evict all interferences for a physreg.
	if (!Intf.hasInterference()) {			/// @param VirtReg Currently unassigned virtual register.
	assert(T < GroupSize && "Array overflow");			/// @param Order Physregs to try.
	TBS[T] = Number;			/// @return Physreg to assign VirtReg, or 0.
	if (++T == GroupSize) {			unsigned RAGreedy::tryEvict(LiveInterval &VirtReg,
	SpillPlacer->addLinks(makeArrayRef(TBS, T));			AllocationOrder &Order,
	T = 0;			SmallVectorImpl<unsigned> &NewVRegs,
	}			unsigned CostPerUseLimit) {
	continue;			NamedRegionTimer T("evict", "Evict", TimerGroupName, TimerGroupDescription,
	}			TimePassesIsEnabled);

	assert(B < GroupSize && "Array overflow");			// Keep track of the cheapest interference seen so far.
	BCS[B].Number = Number;			EvictionCost BestCost;
				BestCost.setMax();
	// Interference for the live-in value.			unsigned BestPhys = 0;
	if (Intf.first() <= Indexes->getMBBStartIdx(Number))			unsigned OrderLimit = Order.getOrder().size();
	BCS[B].Entry = SpillPlacement::MustSpill;
	else			// When we are just looking for a reduced cost per use, don't break any
	BCS[B].Entry = SpillPlacement::PrefSpill;			// hints, and only evict smaller spill weights.
				if (CostPerUseLimit < ~0u) {
	// Interference for the live-out value.			BestCost.BrokenHints = 0;
	if (Intf.last() >= SA->getLastSplitPoint(Number))			BestCost.MaxWeight = VirtReg.weight;
	BCS[B].Exit = SpillPlacement::MustSpill;
	else			// Check of any registers in RC are below CostPerUseLimit.
	BCS[B].Exit = SpillPlacement::PrefSpill;			const TargetRegisterClass *RC = MRI->getRegClass(VirtReg.reg);
				unsigned MinCost = RegClassInfo.getMinCost(RC);
	if (++B == GroupSize) {			if (MinCost >= CostPerUseLimit) {
	SpillPlacer->addConstraints(makeArrayRef(BCS, B));			DEBUG(dbgs() << TRI->getRegClassName(RC) << " minimum cost = " << MinCost
	B = 0;			<< ", no cheaper registers to be found.\n");
	}			return 0;
	}			}

	SpillPlacer->addConstraints(makeArrayRef(BCS, B));			// It is normal for register classes to have a long tail of registers with
	SpillPlacer->addLinks(makeArrayRef(TBS, T));			// the same cost. We don't need to look at them if they're too expensive.
	}			if (TRI->getCostPerUse(Order.getOrder().back()) >= CostPerUseLimit) {
				OrderLimit = RegClassInfo.getLastCostChange(RC);
	void RAGreedy::growRegion(GlobalSplitCandidate &Cand) {			DEBUG(dbgs() << "Only trying the first " << OrderLimit << " regs.\n");
	// Keep track of through blocks that have not been added to SpillPlacer.			}
	BitVector Todo = SA->getThroughBlocks();			}
	SmallVectorImpl<unsigned> &ActiveBlocks = Cand.ActiveBlocks;
	unsigned AddedTo = 0;			Order.rewind();
	#ifndef NDEBUG			while (unsigned PhysReg = Order.next(OrderLimit)) {
	unsigned Visited = 0;			if (TRI->getCostPerUse(PhysReg) >= CostPerUseLimit)
	#endif			continue;
				// The first use of a callee-saved register in a function has cost 1.
	while (true) {			// Don't start using a CSR when the CostPerUseLimit is low.
	ArrayRef<unsigned> NewBundles = SpillPlacer->getRecentPositive();			if (CostPerUseLimit == 1 && isUnusedCalleeSavedReg(PhysReg)) {
	// Find new through blocks in the periphery of PrefRegBundles.			DEBUG(dbgs() << PrintReg(PhysReg, TRI) << " would clobber CSR "
	for (int i = 0, e = NewBundles.size(); i != e; ++i) {			<< PrintReg(RegClassInfo.getLastCalleeSavedAlias(PhysReg), TRI)
	unsigned Bundle = NewBundles[i];			<< '\n');
	// Look at all blocks connected to Bundle in the full graph.			continue;
	ArrayRef<unsigned> Blocks = Bundles->getBlocks(Bundle);			}
	for (ArrayRef<unsigned>::iterator I = Blocks.begin(), E = Blocks.end();
	I != E; ++I) {			if (!canEvictInterference(VirtReg, PhysReg, false, BestCost))
	unsigned Block = *I;			continue;
	if (!Todo.test(Block))
	continue;			// Best so far.
	Todo.reset(Block);			BestPhys = PhysReg;
	// This is a new through block. Add it to SpillPlacer later.
	ActiveBlocks.push_back(Block);			// Stop if the hint can be used.
	#ifndef NDEBUG			if (Order.isHint())
	++Visited;			break;
	#endif			}
	}
	}			if (!BestPhys)
	// Any new blocks to add?			return 0;
	if (ActiveBlocks.size() == AddedTo)
	break;			evictInterference(VirtReg, BestPhys, NewVRegs);
				return BestPhys;
	// Compute through constraints from the interference, or assume that all			}
	// through blocks prefer spilling when forming compact regions.
	auto NewBlocks = makeArrayRef(ActiveBlocks).slice(AddedTo);			//===----------------------------------------------------------------------===//
	if (Cand.PhysReg)			// Region Splitting
	addThroughConstraints(Cand.Intf, NewBlocks);			//===----------------------------------------------------------------------===//
	else
	// Provide a strong negative bias on through blocks to prevent unwanted			/// addSplitConstraints - Fill out the SplitConstraints vector based on the
	// liveness on loop backedges.			/// interference pattern in Physreg and its aliases. Add the constraints to
	SpillPlacer->addPrefSpill(NewBlocks, /* Strong= */ true);			/// SpillPlacement and return the static cost of this split in Cost, assuming
	AddedTo = ActiveBlocks.size();			/// that all preferences in SplitConstraints are met.
				/// Return false if there are no bundles with positive bias.
	// Perhaps iterating can enable more bundles?			bool RAGreedy::addSplitConstraints(InterferenceCache::Cursor Intf,
	SpillPlacer->iterate();			BlockFrequency &Cost) {
	}			ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	DEBUG(dbgs() << ", v=" << Visited);
	}			// Reset interference dependent info.
				SplitConstraints.resize(UseBlocks.size());
	/// calcCompactRegion - Compute the set of edge bundles that should be live			BlockFrequency StaticCost = 0;
	/// when splitting the current live range into compact regions. Compact			for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	/// regions can be computed without looking at interference. They are the			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	/// regions formed by removing all the live-through blocks from the live range.			SpillPlacement::BlockConstraint &BC = SplitConstraints[i];
	///
	/// Returns false if the current live range is already compact, or if the			BC.Number = BI.MBB->getNumber();
	/// compact regions would form single block regions anyway.			Intf.moveToBlock(BC.Number);
	bool RAGreedy::calcCompactRegion(GlobalSplitCandidate &Cand) {			BC.Entry = BI.LiveIn ? SpillPlacement::PrefReg : SpillPlacement::DontCare;
	// Without any through blocks, the live range is already compact.			BC.Exit = BI.LiveOut ? SpillPlacement::PrefReg : SpillPlacement::DontCare;
	if (!SA->getNumThroughBlocks())			BC.ChangesValue = BI.FirstDef.isValid();
	return false;
				if (!Intf.hasInterference())
	// Compact regions don't correspond to any physreg.			continue;
	Cand.reset(IntfCache, 0);
				// Number of spill code instructions to insert.
	DEBUG(dbgs() << "Compact region bundles");			unsigned Ins = 0;

	// Use the spill placer to determine the live bundles. GrowRegion pretends			// Interference for the live-in value.
	// that all the through blocks have interference when PhysReg is unset.			if (BI.LiveIn) {
	SpillPlacer->prepare(Cand.LiveBundles);			if (Intf.first() <= Indexes->getMBBStartIdx(BC.Number)) {
				BC.Entry = SpillPlacement::MustSpill;
	// The static split cost will be zero since Cand.Intf reports no interference.			++Ins;
	BlockFrequency Cost;			} else if (Intf.first() < BI.FirstInstr) {
	if (!addSplitConstraints(Cand.Intf, Cost)) {			BC.Entry = SpillPlacement::PrefSpill;
	DEBUG(dbgs() << ", none.\n");			++Ins;
	return false;			} else if (Intf.first() < BI.LastInstr) {
	}			++Ins;
				}
	growRegion(Cand);			}
	SpillPlacer->finish();
				// Interference for the live-out value.
	if (!Cand.LiveBundles.any()) {			if (BI.LiveOut) {
	DEBUG(dbgs() << ", none.\n");			if (Intf.last() >= SA->getLastSplitPoint(BC.Number)) {
	return false;			BC.Exit = SpillPlacement::MustSpill;
	}			++Ins;
				} else if (Intf.last() > BI.LastInstr) {
	DEBUG({			BC.Exit = SpillPlacement::PrefSpill;
	for (int i : Cand.LiveBundles.set_bits())			++Ins;
	dbgs() << " EB#" << i;			} else if (Intf.last() > BI.FirstInstr) {
	dbgs() << ".\n";			++Ins;
	});			}
	return true;			}
	}
				// Accumulate the total frequency of inserted spill code.
	/// calcSpillCost - Compute how expensive it would be to split the live range in			while (Ins--)
	/// SA around all use blocks instead of forming bundle regions.			StaticCost += SpillPlacer->getBlockFrequency(BC.Number);
	BlockFrequency RAGreedy::calcSpillCost() {			}
	BlockFrequency Cost = 0;			Cost = StaticCost;
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			// Add constraints for use-blocks. Note that these are the only constraints
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];			// that may add a positive bias, it is downhill from here.
	unsigned Number = BI.MBB->getNumber();			SpillPlacer->addConstraints(SplitConstraints);
	// We normally only need one spill instruction - a load or a store.			return SpillPlacer->scanActiveBundles();
	Cost += SpillPlacer->getBlockFrequency(Number);			}

	// Unless the value is redefined in the block.			/// addThroughConstraints - Add constraints and links to SpillPlacer from the
	if (BI.LiveIn && BI.LiveOut && BI.FirstDef)			/// live-through blocks in Blocks.
	Cost += SpillPlacer->getBlockFrequency(Number);			void RAGreedy::addThroughConstraints(InterferenceCache::Cursor Intf,
	}			ArrayRef<unsigned> Blocks) {
	return Cost;			const unsigned GroupSize = 8;
	}			SpillPlacement::BlockConstraint BCS[GroupSize];
				unsigned TBS[GroupSize];
	/// calcGlobalSplitCost - Return the global split cost of following the split			unsigned B = 0, T = 0;
	/// pattern in LiveBundles. This cost should be added to the local cost of the
	/// interference pattern in SplitConstraints.			for (unsigned i = 0; i != Blocks.size(); ++i) {
	///			unsigned Number = Blocks[i];
	BlockFrequency RAGreedy::calcGlobalSplitCost(GlobalSplitCandidate &Cand) {			Intf.moveToBlock(Number);
	BlockFrequency GlobalCost = 0;
	const BitVector &LiveBundles = Cand.LiveBundles;			if (!Intf.hasInterference()) {
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();			assert(T < GroupSize && "Array overflow");
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			TBS[T] = Number;
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];			if (++T == GroupSize) {
	SpillPlacement::BlockConstraint &BC = SplitConstraints[i];			SpillPlacer->addLinks(makeArrayRef(TBS, T));
	bool RegIn = LiveBundles[Bundles->getBundle(BC.Number, false)];			T = 0;
	bool RegOut = LiveBundles[Bundles->getBundle(BC.Number, true)];			}
	unsigned Ins = 0;			continue;
				}
	if (BI.LiveIn)
	Ins += RegIn != (BC.Entry == SpillPlacement::PrefReg);			assert(B < GroupSize && "Array overflow");
	if (BI.LiveOut)			BCS[B].Number = Number;
	Ins += RegOut != (BC.Exit == SpillPlacement::PrefReg);
	while (Ins--)			// Interference for the live-in value.
	GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);			if (Intf.first() <= Indexes->getMBBStartIdx(Number))
	}			BCS[B].Entry = SpillPlacement::MustSpill;
				else
	for (unsigned i = 0, e = Cand.ActiveBlocks.size(); i != e; ++i) {			BCS[B].Entry = SpillPlacement::PrefSpill;
	unsigned Number = Cand.ActiveBlocks[i];
	bool RegIn = LiveBundles[Bundles->getBundle(Number, false)];			// Interference for the live-out value.
	bool RegOut = LiveBundles[Bundles->getBundle(Number, true)];			if (Intf.last() >= SA->getLastSplitPoint(Number))
	if (!RegIn && !RegOut)			BCS[B].Exit = SpillPlacement::MustSpill;
	continue;			else
	if (RegIn && RegOut) {			BCS[B].Exit = SpillPlacement::PrefSpill;
	// We need double spill code if this block has interference.
	Cand.Intf.moveToBlock(Number);			if (++B == GroupSize) {
	if (Cand.Intf.hasInterference()) {			SpillPlacer->addConstraints(makeArrayRef(BCS, B));
	GlobalCost += SpillPlacer->getBlockFrequency(Number);			B = 0;
	GlobalCost += SpillPlacer->getBlockFrequency(Number);			}
	}			}
	continue;
	}			SpillPlacer->addConstraints(makeArrayRef(BCS, B));
	// live-in / stack-out or stack-in live-out.			SpillPlacer->addLinks(makeArrayRef(TBS, T));
	GlobalCost += SpillPlacer->getBlockFrequency(Number);			}
	}
	return GlobalCost;			void RAGreedy::growRegion(GlobalSplitCandidate &Cand) {
	}			// Keep track of through blocks that have not been added to SpillPlacer.
				BitVector Todo = SA->getThroughBlocks();
	/// splitAroundRegion - Split the current live range around the regions			SmallVectorImpl<unsigned> &ActiveBlocks = Cand.ActiveBlocks;
	/// determined by BundleCand and GlobalCand.			unsigned AddedTo = 0;
	///			#ifndef NDEBUG
	/// Before calling this function, GlobalCand and BundleCand must be initialized			unsigned Visited = 0;
	/// so each bundle is assigned to a valid candidate, or NoCand for the			#endif
	/// stack-bound bundles. The shared SA/SE SplitAnalysis and SplitEditor
	/// objects must be initialized for the current live range, and intervals			while (true) {
	/// created for the used candidates.			ArrayRef<unsigned> NewBundles = SpillPlacer->getRecentPositive();
	///			// Find new through blocks in the periphery of PrefRegBundles.
	/// @param LREdit The LiveRangeEdit object handling the current split.			for (int i = 0, e = NewBundles.size(); i != e; ++i) {
	/// @param UsedCands List of used GlobalCand entries. Every BundleCand value			unsigned Bundle = NewBundles[i];
	/// must appear in this list.			// Look at all blocks connected to Bundle in the full graph.
	void RAGreedy::splitAroundRegion(LiveRangeEdit &LREdit,			ArrayRef<unsigned> Blocks = Bundles->getBlocks(Bundle);
	ArrayRef<unsigned> UsedCands) {			for (ArrayRef<unsigned>::iterator I = Blocks.begin(), E = Blocks.end();
	// These are the intervals created for new global ranges. We may create more			I != E; ++I) {
	// intervals for local ranges.			unsigned Block = *I;
	const unsigned NumGlobalIntvs = LREdit.size();			if (!Todo.test(Block))
	DEBUG(dbgs() << "splitAroundRegion with " << NumGlobalIntvs << " globals.\n");			continue;
	assert(NumGlobalIntvs && "No global intervals configured");			Todo.reset(Block);
				// This is a new through block. Add it to SpillPlacer later.
	// Isolate even single instructions when dealing with a proper sub-class.			ActiveBlocks.push_back(Block);
	// That guarantees register class inflation for the stack interval because it			#ifndef NDEBUG
	// is all copies.			++Visited;
	unsigned Reg = SA->getParent().reg;			#endif
	bool SingleInstrs = RegClassInfo.isProperSubClass(MRI->getRegClass(Reg));			}
				}
	// First handle all the blocks with uses.			// Any new blocks to add?
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();			if (ActiveBlocks.size() == AddedTo)
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			break;
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	unsigned Number = BI.MBB->getNumber();			// Compute through constraints from the interference, or assume that all
	unsigned IntvIn = 0, IntvOut = 0;			// through blocks prefer spilling when forming compact regions.
	SlotIndex IntfIn, IntfOut;			auto NewBlocks = makeArrayRef(ActiveBlocks).slice(AddedTo);
	if (BI.LiveIn) {			if (Cand.PhysReg)
	unsigned CandIn = BundleCand[Bundles->getBundle(Number, false)];			addThroughConstraints(Cand.Intf, NewBlocks);
	if (CandIn != NoCand) {			else
	GlobalSplitCandidate &Cand = GlobalCand[CandIn];			// Provide a strong negative bias on through blocks to prevent unwanted
	IntvIn = Cand.IntvIdx;			// liveness on loop backedges.
	Cand.Intf.moveToBlock(Number);			SpillPlacer->addPrefSpill(NewBlocks, /* Strong= */ true);
	IntfIn = Cand.Intf.first();			AddedTo = ActiveBlocks.size();
	}
	}			// Perhaps iterating can enable more bundles?
	if (BI.LiveOut) {			SpillPlacer->iterate();
	unsigned CandOut = BundleCand[Bundles->getBundle(Number, true)];			}
	if (CandOut != NoCand) {			DEBUG(dbgs() << ", v=" << Visited);
	GlobalSplitCandidate &Cand = GlobalCand[CandOut];			}
	IntvOut = Cand.IntvIdx;
	Cand.Intf.moveToBlock(Number);			/// calcCompactRegion - Compute the set of edge bundles that should be live
	IntfOut = Cand.Intf.last();			/// when splitting the current live range into compact regions. Compact
	}			/// regions can be computed without looking at interference. They are the
	}			/// regions formed by removing all the live-through blocks from the live range.
				///
	// Create separate intervals for isolated blocks with multiple uses.			/// Returns false if the current live range is already compact, or if the
	if (!IntvIn && !IntvOut) {			/// compact regions would form single block regions anyway.
	DEBUG(dbgs() << "BB#" << BI.MBB->getNumber() << " isolated.\n");			bool RAGreedy::calcCompactRegion(GlobalSplitCandidate &Cand) {
	if (SA->shouldSplitSingleBlock(BI, SingleInstrs))			// Without any through blocks, the live range is already compact.
	SE->splitSingleBlock(BI);			if (!SA->getNumThroughBlocks())
	continue;			return false;
	}
				// Compact regions don't correspond to any physreg.
	if (IntvIn && IntvOut)			Cand.reset(IntfCache, 0);
	SE->splitLiveThroughBlock(Number, IntvIn, IntfIn, IntvOut, IntfOut);
	else if (IntvIn)			DEBUG(dbgs() << "Compact region bundles");
	SE->splitRegInBlock(BI, IntvIn, IntfIn);
	else			// Use the spill placer to determine the live bundles. GrowRegion pretends
	SE->splitRegOutBlock(BI, IntvOut, IntfOut);			// that all the through blocks have interference when PhysReg is unset.
	}			SpillPlacer->prepare(Cand.LiveBundles);

	// Handle live-through blocks. The relevant live-through blocks are stored in			// The static split cost will be zero since Cand.Intf reports no interference.
	// the ActiveBlocks list with each candidate. We need to filter out			BlockFrequency Cost;
	// duplicates.			if (!addSplitConstraints(Cand.Intf, Cost)) {
	BitVector Todo = SA->getThroughBlocks();			DEBUG(dbgs() << ", none.\n");
	for (unsigned c = 0; c != UsedCands.size(); ++c) {			return false;
	ArrayRef<unsigned> Blocks = GlobalCand[UsedCands[c]].ActiveBlocks;			}
	for (unsigned i = 0, e = Blocks.size(); i != e; ++i) {
	unsigned Number = Blocks[i];			growRegion(Cand);
	if (!Todo.test(Number))			SpillPlacer->finish();
	continue;
	Todo.reset(Number);			if (!Cand.LiveBundles.any()) {
				DEBUG(dbgs() << ", none.\n");
	unsigned IntvIn = 0, IntvOut = 0;			return false;
	SlotIndex IntfIn, IntfOut;			}

	unsigned CandIn = BundleCand[Bundles->getBundle(Number, false)];			DEBUG({
	if (CandIn != NoCand) {			for (int i : Cand.LiveBundles.set_bits())
	GlobalSplitCandidate &Cand = GlobalCand[CandIn];			dbgs() << " EB#" << i;
	IntvIn = Cand.IntvIdx;			dbgs() << ".\n";
	Cand.Intf.moveToBlock(Number);			});
	IntfIn = Cand.Intf.first();			return true;
	}			}

	unsigned CandOut = BundleCand[Bundles->getBundle(Number, true)];			/// calcSpillCost - Compute how expensive it would be to split the live range in
	if (CandOut != NoCand) {			/// SA around all use blocks instead of forming bundle regions.
	GlobalSplitCandidate &Cand = GlobalCand[CandOut];			BlockFrequency RAGreedy::calcSpillCost() {
	IntvOut = Cand.IntvIdx;			BlockFrequency Cost = 0;
	Cand.Intf.moveToBlock(Number);			ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	IntfOut = Cand.Intf.last();			for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	}			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	if (!IntvIn && !IntvOut)			unsigned Number = BI.MBB->getNumber();
	continue;			// We normally only need one spill instruction - a load or a store.
	SE->splitLiveThroughBlock(Number, IntvIn, IntfIn, IntvOut, IntfOut);			Cost += SpillPlacer->getBlockFrequency(Number);
	}
	}			// Unless the value is redefined in the block.
				if (BI.LiveIn && BI.LiveOut && BI.FirstDef)
	++NumGlobalSplits;			Cost += SpillPlacer->getBlockFrequency(Number);
				}
	SmallVector<unsigned, 8> IntvMap;			return Cost;
	SE->finish(&IntvMap);			}
	DebugVars->splitRegister(Reg, LREdit.regs(), *LIS);
				/// \brief Chek if splitting Evictee will create a local split interval in basic
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			/// block number BBNumber that may cause a bad eviction chain. This is intended
	unsigned OrigBlocks = SA->getNumLiveBlocks();			/// to prevent bad eviction sequences like:
				/// movl %ebp, 8(%esp) # 4-byte Spill
	// Sort out the new intervals created by splitting. We get four kinds:			/// movl %ecx, %ebp
	// - Remainder intervals should not be split again.			/// movl %ebx, %ecx
	// - Candidate intervals can be assigned to Cand.PhysReg.			/// movl %edi, %ebx
	// - Block-local splits are candidates for local splitting.			/// movl %edx, %edi
	// - DCE leftovers should go back on the queue.			/// cltd
	for (unsigned i = 0, e = LREdit.size(); i != e; ++i) {			/// idivl %esi
	LiveInterval &Reg = LIS->getInterval(LREdit.get(i));			/// movl %edi, %edx
				/// movl %ebx, %edi
	// Ignore old intervals from DCE.			/// movl %ecx, %ebx
	if (getStage(Reg) != RS_New)			/// movl %ebp, %ecx
	continue;			/// movl 16(%esp), %ebp # 4 - byte Reload
				///
	// Remainder interval. Don't try splitting again, spill if it doesn't			/// Such sequences are created in 2 scenarios:
	// allocate.			///
	if (IntvMap[i] == 0) {			/// Scenario #1:
	setStage(Reg, RS_Spill);			/// vreg0 is evicted from physreg0 by vreg1
	continue;			/// Evictee vreg0 is intended for region splitting with split candidate
	}			/// physreg0 (the reg vreg0 was evicted from).
				/// Region splitting creates a local interval because of interference with the
	// Global intervals. Allow repeated splitting as long as the number of live			/// evictor vreg1 (normally region spliitting creates 2 interval, the "by reg"
	// blocks is strictly decreasing.			/// and "by stack" intervals and local interval created when interference
	if (IntvMap[i] < NumGlobalIntvs) {			/// occurs).
	if (SA->countLiveBlocks(&Reg) >= OrigBlocks) {			/// One of the split intervals ends up evicting vreg2 from physreg1.
	DEBUG(dbgs() << "Main interval covers the same " << OrigBlocks			/// Evictee vreg2 is intended for region splitting with split candidate
	<< " blocks as original.\n");			/// physreg1.
	// Don't allow repeated splitting as a safe guard against looping.			/// One of the split intervals ends up evicting vreg3 from physreg2, etc..
	setStage(Reg, RS_Split2);			///
	}			/// Scenario #2
	continue;			/// vreg0 is evicted from physreg0 by vreg1
	}			/// vreg2 is evicted from physreg2 by vreg3 etc
				/// Evictee vreg0 is intended for region splitting with split candidate
	// Other intervals are treated as new. This includes local intervals created			/// physreg1.
	// for blocks with multiple uses, and anything created by DCE.			/// Region splitting creates a local interval because of interference with the
	}			/// evictor vreg1.
				/// One of the split intervals ends up evicting back original evictor vreg1
	if (VerifyEnabled)			/// from physreg0 (the reg vreg0 was evicted from).
	MF->verify(this, "After splitting live range around region");			/// Another evictee vreg2 is intended for region splitting with split candidate
	}			/// physreg1.
				/// One of the split intervals ends up evicting vreg3 from physreg2, etc..
	unsigned RAGreedy::tryRegionSplit(LiveInterval &VirtReg, AllocationOrder &Order,			///
	SmallVectorImpl<unsigned> &NewVRegs) {			/// \param Evictee The register considered to be split.
	unsigned NumCands = 0;			/// \param Cand The split candidate that determines the physical register
	BlockFrequency BestCost;			/// we are splitting for and the interferences.
				/// \param BBNumber The number of a BB for which the region split process will
	// Check if we can split this live range around a compact region.			/// create a local split interval.
	bool HasCompact = calcCompactRegion(GlobalCand.front());			/// \param Order The phisical registers that may get evicted by a split
	if (HasCompact) {			/// artifact of Evictee.
	// Yes, keep GlobalCand[0] as the compact region candidate.			/// \return True if splitting Evictee may cause a bad eviction chain, false
	NumCands = 1;			/// otherwise.
	BestCost = BlockFrequency::getMaxFrequency();			bool RAGreedy::splitCanCauseEvictionChain(unsigned Evictee,
	} else {			GlobalSplitCandidate &Cand,
	// No benefit from the compact region, our fallback will be per-block			unsigned BBNumber,
	// splitting. Make sure we find a solution that is cheaper than spilling.			const AllocationOrder &Order) {
	BestCost = calcSpillCost();			EvictionTrack::EvictorInfo VregEvictorInfo = LastEvicted.GetEvictor(Evictee);
	DEBUG(dbgs() << "Cost of isolating all blocks = ";			unsigned Evictor = VregEvictorInfo.first;
	MBFI->printBlockFreq(dbgs(), BestCost) << '\n');			unsigned PhysReg = VregEvictorInfo.second;
	}
				// No actual evictor
	unsigned BestCand =			if (!Evictor \|\| !PhysReg)
	calculateRegionSplitCost(VirtReg, Order, BestCost, NumCands,			return false;
	false/IgnoreCSR/);
				float MaxWeight = 0;
	// No solutions found, fall back to single block splitting.			unsigned FutureEvictedPhysReg =
	if (!HasCompact && BestCand == NoCand)			getCheapestEvicteeWeight(Order, LIS->getInterval(Evictee),
	return 0;			Cand.Intf.first(), Cand.Intf.last(), &MaxWeight);

	return doRegionSplit(VirtReg, BestCand, HasCompact, NewVRegs);			// The bad eviction chain occurs when either the split candidate the the
	}			// evited reg or one of the split artifact will evict the evicting reg.
				if ((PhysReg != Cand.PhysReg) && (PhysReg != FutureEvictedPhysReg))
	unsigned RAGreedy::calculateRegionSplitCost(LiveInterval &VirtReg,			return false;
	AllocationOrder &Order,
	BlockFrequency &BestCost,			Cand.Intf.moveToBlock(BBNumber);
	unsigned &NumCands,
	bool IgnoreCSR) {			// Check to see if the Evictor contains interference (with Evictee) in the
	unsigned BestCand = NoCand;			// given BB. If so, this interference caused the eviction of Evictee from
	Order.rewind();			// PhysReg This suggest that we will create a local interval during the region
	while (unsigned PhysReg = Order.next()) {			// split to avoid this interference This local interval may cause a bad
	if (IgnoreCSR && isUnusedCalleeSavedReg(PhysReg))			// eviction chain.
	continue;			if (!LIS->hasInterval(Evictor))
				return false;
	// Discard bad candidates before we run out of interference cache cursors.			LiveInterval &evictorLI = LIS->getInterval(Evictor);
	// This will only affect register classes with a lot of registers (>32).			if (evictorLI.FindSegmentContaining(Cand.Intf.first()) == evictorLI.end())
	if (NumCands == IntfCache.getMaxCursors()) {			return false;
	unsigned WorstCount = ~0u;
	unsigned Worst = 0;			// Now, check to see if the local interval we will create is going to be
	for (unsigned i = 0; i != NumCands; ++i) {			// expensive enough to evict somebody If so, this may cause a bad eviction
	if (i == BestCand \|\| !GlobalCand[i].PhysReg)			// chain
	continue;			VirtRegAuxInfo VRAI(MF, LIS, VRM, getAnalysis<MachineLoopInfo>(), *MBFI);
	unsigned Count = GlobalCand[i].LiveBundles.count();			float splitArtifactWeight =
	if (Count < WorstCount) {			VRAI.futureWeight(LIS->getInterval(Evictee),
	Worst = i;			Cand.Intf.first().getPrevIndex(), Cand.Intf.last());
	WorstCount = Count;			if (splitArtifactWeight >= 0 && splitArtifactWeight < MaxWeight)
	}			return false;
	}
	--NumCands;			return true;
	GlobalCand[Worst] = GlobalCand[NumCands];			}
	if (BestCand == NumCands)
	BestCand = Worst;			/// calcGlobalSplitCost - Return the global split cost of following the split
	}			/// pattern in LiveBundles. This cost should be added to the local cost of the
				/// interference pattern in SplitConstraints.
	if (GlobalCand.size() <= NumCands)			///
	GlobalCand.resize(NumCands+1);			BlockFrequency RAGreedy::calcGlobalSplitCost(GlobalSplitCandidate &Cand,
	GlobalSplitCandidate &Cand = GlobalCand[NumCands];			const AllocationOrder &Order,
	Cand.reset(IntfCache, PhysReg);			bool *canCauseEvictionChain) {
				BlockFrequency GlobalCost = 0;
	SpillPlacer->prepare(Cand.LiveBundles);			const BitVector &LiveBundles = Cand.LiveBundles;
	BlockFrequency Cost;			unsigned VirtRegToSplit = SA->getParent().reg;
	if (!addSplitConstraints(Cand.Intf, Cost)) {			ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	DEBUG(dbgs() << PrintReg(PhysReg, TRI) << "\tno positive bundles\n");			for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	continue;			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	}			SpillPlacement::BlockConstraint &BC = SplitConstraints[i];
	DEBUG(dbgs() << PrintReg(PhysReg, TRI) << "\tstatic = ";			bool RegIn = LiveBundles[Bundles->getBundle(BC.Number, false)];
	MBFI->printBlockFreq(dbgs(), Cost));			bool RegOut = LiveBundles[Bundles->getBundle(BC.Number, true)];
	if (Cost >= BestCost) {			unsigned Ins = 0;
	DEBUG({
	if (BestCand == NoCand)			Cand.Intf.moveToBlock(BC.Number);
	dbgs() << " worse than no bundles\n";			// Check wheather a local interval is going to be created during the region
	else			// split
	dbgs() << " worse than "			if (canCauseEvictionChain && Cand.Intf.hasInterference() && BI.LiveIn &&
	<< PrintReg(GlobalCand[BestCand].PhysReg, TRI) << '\n';			BI.LiveOut && RegIn && RegOut) {
	});
	continue;			if (splitCanCauseEvictionChain(VirtRegToSplit, Cand, BC.Number, Order)) {
	}			// This interfernce cause our eviction from this assignment, we might
	growRegion(Cand);			// evict somebody else, add that cost
				// See splitCanCauseEvictionChain for detailed description of scenarios
	SpillPlacer->finish();			GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);
				GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);
	// No live bundles, defer to splitSingleBlocks().
	if (!Cand.LiveBundles.any()) {			*canCauseEvictionChain = true;
	DEBUG(dbgs() << " no bundles.\n");			}
	continue;			}
	}
				if (BI.LiveIn)
	Cost += calcGlobalSplitCost(Cand);			Ins += RegIn != (BC.Entry == SpillPlacement::PrefReg);
	DEBUG({			if (BI.LiveOut)
	dbgs() << ", total = "; MBFI->printBlockFreq(dbgs(), Cost)			Ins += RegOut != (BC.Exit == SpillPlacement::PrefReg);
	<< " with bundles";			while (Ins--)
	for (int i : Cand.LiveBundles.set_bits())			GlobalCost += SpillPlacer->getBlockFrequency(BC.Number);
	dbgs() << " EB#" << i;			}
	dbgs() << ".\n";
	});			for (unsigned i = 0, e = Cand.ActiveBlocks.size(); i != e; ++i) {
	if (Cost < BestCost) {			unsigned Number = Cand.ActiveBlocks[i];
	BestCand = NumCands;			bool RegIn = LiveBundles[Bundles->getBundle(Number, false)];
	BestCost = Cost;			bool RegOut = LiveBundles[Bundles->getBundle(Number, true)];
	}			if (!RegIn && !RegOut)
	++NumCands;			continue;
	}			if (RegIn && RegOut) {
	return BestCand;			// We need double spill code if this block has interference.
	}			Cand.Intf.moveToBlock(Number);
				if (Cand.Intf.hasInterference()) {
	unsigned RAGreedy::doRegionSplit(LiveInterval &VirtReg, unsigned BestCand,			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	bool HasCompact,			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	SmallVectorImpl<unsigned> &NewVRegs) {
	SmallVector<unsigned, 8> UsedCands;			// Check wheather a local interval is going to be created during the
	// Prepare split editor.			// region split
	LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);			if (canCauseEvictionChain &&
	SE->reset(LREdit, SplitSpillMode);			splitCanCauseEvictionChain(VirtRegToSplit, Cand, Number, Order)) {
				// This interfernce cause our eviction from this assignment, we might
	// Assign all edge bundles to the preferred candidate, or NoCand.			// evict somebody else, add that cost
	BundleCand.assign(Bundles->getNumBundles(), NoCand);			// See splitCanCauseEvictionChain for detailed description of
				// scenarios
	// Assign bundles for the best candidate region.			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	if (BestCand != NoCand) {			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	GlobalSplitCandidate &Cand = GlobalCand[BestCand];
	if (unsigned B = Cand.getBundles(BundleCand, BestCand)) {			*canCauseEvictionChain = true;
	UsedCands.push_back(BestCand);			}
	Cand.IntvIdx = SE->openIntv();			}
	DEBUG(dbgs() << "Split for " << PrintReg(Cand.PhysReg, TRI) << " in "			continue;
	<< B << " bundles, intv " << Cand.IntvIdx << ".\n");			}
	(void)B;			// live-in / stack-out or stack-in live-out.
	}			GlobalCost += SpillPlacer->getBlockFrequency(Number);
	}			}
				return GlobalCost;
	// Assign bundles for the compact region.			}
	if (HasCompact) {
	GlobalSplitCandidate &Cand = GlobalCand.front();			/// splitAroundRegion - Split the current live range around the regions
	assert(!Cand.PhysReg && "Compact region has no physreg");			/// determined by BundleCand and GlobalCand.
	if (unsigned B = Cand.getBundles(BundleCand, 0)) {			///
	UsedCands.push_back(0);			/// Before calling this function, GlobalCand and BundleCand must be initialized
	Cand.IntvIdx = SE->openIntv();			/// so each bundle is assigned to a valid candidate, or NoCand for the
	DEBUG(dbgs() << "Split for compact region in " << B << " bundles, intv "			/// stack-bound bundles. The shared SA/SE SplitAnalysis and SplitEditor
	<< Cand.IntvIdx << ".\n");			/// objects must be initialized for the current live range, and intervals
	(void)B;			/// created for the used candidates.
	}			///
	}			/// @param LREdit The LiveRangeEdit object handling the current split.
				/// @param UsedCands List of used GlobalCand entries. Every BundleCand value
	splitAroundRegion(LREdit, UsedCands);			/// must appear in this list.
	return 0;			void RAGreedy::splitAroundRegion(LiveRangeEdit &LREdit,
	}			ArrayRef<unsigned> UsedCands) {
				// These are the intervals created for new global ranges. We may create more
	//===----------------------------------------------------------------------===//			// intervals for local ranges.
	// Per-Block Splitting			const unsigned NumGlobalIntvs = LREdit.size();
	//===----------------------------------------------------------------------===//			DEBUG(dbgs() << "splitAroundRegion with " << NumGlobalIntvs << " globals.\n");
				assert(NumGlobalIntvs && "No global intervals configured");
	/// tryBlockSplit - Split a global live range around every block with uses. This
	/// creates a lot of local live ranges, that will be split by tryLocalSplit if			// Isolate even single instructions when dealing with a proper sub-class.
	/// they don't allocate.			// That guarantees register class inflation for the stack interval because it
	unsigned RAGreedy::tryBlockSplit(LiveInterval &VirtReg, AllocationOrder &Order,			// is all copies.
	SmallVectorImpl<unsigned> &NewVRegs) {			unsigned Reg = SA->getParent().reg;
	assert(&SA->getParent() == &VirtReg && "Live range wasn't analyzed");			bool SingleInstrs = RegClassInfo.isProperSubClass(MRI->getRegClass(Reg));
	unsigned Reg = VirtReg.reg;
	bool SingleInstrs = RegClassInfo.isProperSubClass(MRI->getRegClass(Reg));			// First handle all the blocks with uses.
	LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);			ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	SE->reset(LREdit, SplitSpillMode);			for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	for (unsigned i = 0; i != UseBlocks.size(); ++i) {			unsigned Number = BI.MBB->getNumber();
	const SplitAnalysis::BlockInfo &BI = UseBlocks[i];			unsigned IntvIn = 0, IntvOut = 0;
	if (SA->shouldSplitSingleBlock(BI, SingleInstrs))			SlotIndex IntfIn, IntfOut;
	SE->splitSingleBlock(BI);			if (BI.LiveIn) {
	}			unsigned CandIn = BundleCand[Bundles->getBundle(Number, false)];
	// No blocks were split.			if (CandIn != NoCand) {
	if (LREdit.empty())			GlobalSplitCandidate &Cand = GlobalCand[CandIn];
	return 0;			IntvIn = Cand.IntvIdx;
				Cand.Intf.moveToBlock(Number);
	// We did split for some blocks.			IntfIn = Cand.Intf.first();
	SmallVector<unsigned, 8> IntvMap;			}
	SE->finish(&IntvMap);			}
				if (BI.LiveOut) {
	// Tell LiveDebugVariables about the new ranges.			unsigned CandOut = BundleCand[Bundles->getBundle(Number, true)];
	DebugVars->splitRegister(Reg, LREdit.regs(), *LIS);			if (CandOut != NoCand) {
				GlobalSplitCandidate &Cand = GlobalCand[CandOut];
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			IntvOut = Cand.IntvIdx;
				Cand.Intf.moveToBlock(Number);
	// Sort out the new intervals created by splitting. The remainder interval			IntfOut = Cand.Intf.last();
	// goes straight to spilling, the new local ranges get to stay RS_New.			}
	for (unsigned i = 0, e = LREdit.size(); i != e; ++i) {			}
	LiveInterval &LI = LIS->getInterval(LREdit.get(i));
	if (getStage(LI) == RS_New && IntvMap[i] == 0)			// Create separate intervals for isolated blocks with multiple uses.
	setStage(LI, RS_Spill);			if (!IntvIn && !IntvOut) {
	}			DEBUG(dbgs() << "BB#" << BI.MBB->getNumber() << " isolated.\n");
				if (SA->shouldSplitSingleBlock(BI, SingleInstrs))
	if (VerifyEnabled)			SE->splitSingleBlock(BI);
	MF->verify(this, "After splitting live range around basic blocks");			continue;
	return 0;			}
	}
				if (IntvIn && IntvOut)
	//===----------------------------------------------------------------------===//			SE->splitLiveThroughBlock(Number, IntvIn, IntfIn, IntvOut, IntfOut);
	// Per-Instruction Splitting			else if (IntvIn)
	//===----------------------------------------------------------------------===//			SE->splitRegInBlock(BI, IntvIn, IntfIn);
				else
	/// Get the number of allocatable registers that match the constraints of \p Reg			SE->splitRegOutBlock(BI, IntvOut, IntfOut);
	/// on \p MI and that are also in \p SuperRC.			}
	static unsigned getNumAllocatableRegsForConstraints(
	const MachineInstr MI, unsigned Reg, const TargetRegisterClass SuperRC,			// Handle live-through blocks. The relevant live-through blocks are stored in
	const TargetInstrInfo TII, const TargetRegisterInfo TRI,			// the ActiveBlocks list with each candidate. We need to filter out
	const RegisterClassInfo &RCI) {			// duplicates.
	assert(SuperRC && "Invalid register class");			BitVector Todo = SA->getThroughBlocks();
				for (unsigned c = 0; c != UsedCands.size(); ++c) {
	const TargetRegisterClass *ConstrainedRC =			ArrayRef<unsigned> Blocks = GlobalCand[UsedCands[c]].ActiveBlocks;
	MI->getRegClassConstraintEffectForVReg(Reg, SuperRC, TII, TRI,			for (unsigned i = 0, e = Blocks.size(); i != e; ++i) {
	/* ExploreBundle */ true);			unsigned Number = Blocks[i];
	if (!ConstrainedRC)			if (!Todo.test(Number))
	return 0;			continue;
	return RCI.getNumAllocatableRegs(ConstrainedRC);			Todo.reset(Number);
	}
				unsigned IntvIn = 0, IntvOut = 0;
	/// tryInstructionSplit - Split a live range around individual instructions.			SlotIndex IntfIn, IntfOut;
	/// This is normally not worthwhile since the spiller is doing essentially the
	/// same thing. However, when the live range is in a constrained register			unsigned CandIn = BundleCand[Bundles->getBundle(Number, false)];
	/// class, it may help to insert copies such that parts of the live range can			if (CandIn != NoCand) {
	/// be moved to a larger register class.			GlobalSplitCandidate &Cand = GlobalCand[CandIn];
	///			IntvIn = Cand.IntvIdx;
	/// This is similar to spilling to a larger register class.			Cand.Intf.moveToBlock(Number);
	unsigned			IntfIn = Cand.Intf.first();
	RAGreedy::tryInstructionSplit(LiveInterval &VirtReg, AllocationOrder &Order,			}
	SmallVectorImpl<unsigned> &NewVRegs) {
	const TargetRegisterClass *CurRC = MRI->getRegClass(VirtReg.reg);			unsigned CandOut = BundleCand[Bundles->getBundle(Number, true)];
	// There is no point to this if there are no larger sub-classes.			if (CandOut != NoCand) {
	if (!RegClassInfo.isProperSubClass(CurRC))			GlobalSplitCandidate &Cand = GlobalCand[CandOut];
	return 0;			IntvOut = Cand.IntvIdx;
				Cand.Intf.moveToBlock(Number);
	// Always enable split spill mode, since we're effectively spilling to a			IntfOut = Cand.Intf.last();
	// register.			}
	LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);			if (!IntvIn && !IntvOut)
	SE->reset(LREdit, SplitEditor::SM_Size);			continue;
				SE->splitLiveThroughBlock(Number, IntvIn, IntfIn, IntvOut, IntfOut);
	ArrayRef<SlotIndex> Uses = SA->getUseSlots();			}
	if (Uses.size() <= 1)			}
	return 0;
				++NumGlobalSplits;
	DEBUG(dbgs() << "Split around " << Uses.size() << " individual instrs.\n");
				SmallVector<unsigned, 8> IntvMap;
	const TargetRegisterClass *SuperRC =			SE->finish(&IntvMap);
	TRI->getLargestLegalSuperClass(CurRC, *MF);			DebugVars->splitRegister(Reg, LREdit.regs(), *LIS);
	unsigned SuperRCNumAllocatableRegs = RCI.getNumAllocatableRegs(SuperRC);
	// Split around every non-copy instruction if this split will relax			ExtraRegInfo.resize(MRI->getNumVirtRegs());
	// the constraints on the virtual register.			unsigned OrigBlocks = SA->getNumLiveBlocks();
	// Otherwise, splitting just inserts uncoalescable copies that do not help
	// the allocation.			// Sort out the new intervals created by splitting. We get four kinds:
	for (unsigned i = 0; i != Uses.size(); ++i) {			// - Remainder intervals should not be split again.
	if (const MachineInstr *MI = Indexes->getInstructionFromIndex(Uses[i]))			// - Candidate intervals can be assigned to Cand.PhysReg.
	if (MI->isFullCopy() \|\|			// - Block-local splits are candidates for local splitting.
	SuperRCNumAllocatableRegs ==			// - DCE leftovers should go back on the queue.
	getNumAllocatableRegsForConstraints(MI, VirtReg.reg, SuperRC, TII,			for (unsigned i = 0, e = LREdit.size(); i != e; ++i) {
	TRI, RCI)) {			LiveInterval &Reg = LIS->getInterval(LREdit.get(i));
	DEBUG(dbgs() << " skip:\t" << Uses[i] << '\t' << *MI);
	continue;			// Ignore old intervals from DCE.
	}			if (getStage(Reg) != RS_New)
	SE->openIntv();			continue;
	SlotIndex SegStart = SE->enterIntvBefore(Uses[i]);
	SlotIndex SegStop = SE->leaveIntvAfter(Uses[i]);			// Remainder interval. Don't try splitting again, spill if it doesn't
	SE->useIntv(SegStart, SegStop);			// allocate.
	}			if (IntvMap[i] == 0) {
				setStage(Reg, RS_Spill);
	if (LREdit.empty()) {			continue;
	DEBUG(dbgs() << "All uses were copies.\n");			}
	return 0;
	}			// Global intervals. Allow repeated splitting as long as the number of live
				// blocks is strictly decreasing.
	SmallVector<unsigned, 8> IntvMap;			if (IntvMap[i] < NumGlobalIntvs) {
	SE->finish(&IntvMap);			if (SA->countLiveBlocks(&Reg) >= OrigBlocks) {
	DebugVars->splitRegister(VirtReg.reg, LREdit.regs(), *LIS);			DEBUG(dbgs() << "Main interval covers the same " << OrigBlocks
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			<< " blocks as original.\n");
				// Don't allow repeated splitting as a safe guard against looping.
	// Assign all new registers to RS_Spill. This was the last chance.			setStage(Reg, RS_Split2);
	setStage(LREdit.begin(), LREdit.end(), RS_Spill);			}
	return 0;			continue;
	}			}

	//===----------------------------------------------------------------------===//			// Other intervals are treated as new. This includes local intervals created
	// Local Splitting			// for blocks with multiple uses, and anything created by DCE.
	//===----------------------------------------------------------------------===//			}

	/// calcGapWeights - Compute the maximum spill weight that needs to be evicted			if (VerifyEnabled)
	/// in order to use PhysReg between two entries in SA->UseSlots.			MF->verify(this, "After splitting live range around region");
	///			}
	/// GapWeight[i] represents the gap between UseSlots[i] and UseSlots[i+1].
	///			unsigned RAGreedy::tryRegionSplit(LiveInterval &VirtReg, AllocationOrder &Order,
	void RAGreedy::calcGapWeights(unsigned PhysReg,			SmallVectorImpl<unsigned> &NewVRegs) {
	SmallVectorImpl<float> &GapWeight) {			unsigned NumCands = 0;
	assert(SA->getUseBlocks().size() == 1 && "Not a local interval");			BlockFrequency SpillCost = calcSpillCost();
	const SplitAnalysis::BlockInfo &BI = SA->getUseBlocks().front();			BlockFrequency BestCost;
	ArrayRef<SlotIndex> Uses = SA->getUseSlots();
	const unsigned NumGaps = Uses.size()-1;			// Check if we can split this live range around a compact region.
				bool HasCompact = calcCompactRegion(GlobalCand.front());
	// Start and end points for the interference check.			if (HasCompact) {
	SlotIndex StartIdx =			// Yes, keep GlobalCand[0] as the compact region candidate.
	BI.LiveIn ? BI.FirstInstr.getBaseIndex() : BI.FirstInstr;			NumCands = 1;
	SlotIndex StopIdx =			BestCost = BlockFrequency::getMaxFrequency();
	BI.LiveOut ? BI.LastInstr.getBoundaryIndex() : BI.LastInstr;			} else {
				// No benefit from the compact region, our fallback will be per-block
	GapWeight.assign(NumGaps, 0.0f);			// splitting. Make sure we find a solution that is cheaper than spilling.
				BestCost = SpillCost;
	// Add interference from each overlapping register.			DEBUG(dbgs() << "Cost of isolating all blocks = ";
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			MBFI->printBlockFreq(dbgs(), BestCost) << '\n');
	if (!Matrix->query(const_cast<LiveInterval&>(SA->getParent()), *Units)			}
	.checkInterference())
	continue;			bool canCauseEvictionChain = false;
				unsigned BestCand =
	// We know that VirtReg is a continuous interval from FirstInstr to			calculateRegionSplitCost(VirtReg, Order, BestCost, NumCands,
	// LastInstr, so we don't need InterferenceQuery.			false /IgnoreCSR/, &canCauseEvictionChain);
	//
	// Interference that overlaps an instruction is counted in both gaps			// Split candidates with compact regions can cause a bad eviction sequence.
	// surrounding the instruction. The exception is interference before			// See splitCanCauseEvictionChain for detailed description of scenarios.
	// StartIdx and after StopIdx.			// To avoid it, we need to comapre the cost with the spill cost and not the
	//			// current max frequency.
	LiveIntervalUnion::SegmentIter IntI =			if (HasCompact && (BestCost > SpillCost) && (BestCand != NoCand) &&
	Matrix->getLiveUnions()[*Units] .find(StartIdx);			canCauseEvictionChain) {
	for (unsigned Gap = 0; IntI.valid() && IntI.start() < StopIdx; ++IntI) {			return 0;
	// Skip the gaps before IntI.			}
	while (Uses[Gap+1].getBoundaryIndex() < IntI.start())
	if (++Gap == NumGaps)			// No solutions found, fall back to single block splitting.
	break;			if (!HasCompact && BestCand == NoCand)
	if (Gap == NumGaps)			return 0;
	break;
				return doRegionSplit(VirtReg, BestCand, HasCompact, NewVRegs);
	// Update the gaps covered by IntI.			}
	const float weight = IntI.value()->weight;
	for (; Gap != NumGaps; ++Gap) {			unsigned RAGreedy::calculateRegionSplitCost(LiveInterval &VirtReg,
	GapWeight[Gap] = std::max(GapWeight[Gap], weight);			AllocationOrder &Order,
	if (Uses[Gap+1].getBaseIndex() >= IntI.stop())			BlockFrequency &BestCost,
	break;			unsigned &NumCands, bool IgnoreCSR,
	}			bool *canCauseEvictionChain) {
	if (Gap == NumGaps)			unsigned BestCand = NoCand;
	break;			Order.rewind();
	}			while (unsigned PhysReg = Order.next()) {
	}			if (IgnoreCSR && isUnusedCalleeSavedReg(PhysReg))
				continue;
	// Add fixed interference.
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			// Discard bad candidates before we run out of interference cache cursors.
	const LiveRange &LR = LIS->getRegUnit(*Units);			// This will only affect register classes with a lot of registers (>32).
	LiveRange::const_iterator I = LR.find(StartIdx);			if (NumCands == IntfCache.getMaxCursors()) {
	LiveRange::const_iterator E = LR.end();			unsigned WorstCount = ~0u;
				unsigned Worst = 0;
	// Same loop as above. Mark any overlapped gaps as HUGE_VALF.			for (unsigned i = 0; i != NumCands; ++i) {
	for (unsigned Gap = 0; I != E && I->start < StopIdx; ++I) {			if (i == BestCand \|\| !GlobalCand[i].PhysReg)
	while (Uses[Gap+1].getBoundaryIndex() < I->start)			continue;
	if (++Gap == NumGaps)			unsigned Count = GlobalCand[i].LiveBundles.count();
	break;			if (Count < WorstCount) {
	if (Gap == NumGaps)			Worst = i;
	break;			WorstCount = Count;
				}
	for (; Gap != NumGaps; ++Gap) {			}
	GapWeight[Gap] = huge_valf;			--NumCands;
	if (Uses[Gap+1].getBaseIndex() >= I->end)			GlobalCand[Worst] = GlobalCand[NumCands];
	break;			if (BestCand == NumCands)
	}			BestCand = Worst;
	if (Gap == NumGaps)			}
	break;
	}			if (GlobalCand.size() <= NumCands)
	}			GlobalCand.resize(NumCands+1);
	}			GlobalSplitCandidate &Cand = GlobalCand[NumCands];
				Cand.reset(IntfCache, PhysReg);
	/// tryLocalSplit - Try to split VirtReg into smaller intervals inside its only
	/// basic block.			SpillPlacer->prepare(Cand.LiveBundles);
	///			BlockFrequency Cost;
	unsigned RAGreedy::tryLocalSplit(LiveInterval &VirtReg, AllocationOrder &Order,			if (!addSplitConstraints(Cand.Intf, Cost)) {
	SmallVectorImpl<unsigned> &NewVRegs) {			DEBUG(dbgs() << PrintReg(PhysReg, TRI) << "\tno positive bundles\n");
	assert(SA->getUseBlocks().size() == 1 && "Not a local interval");			continue;
	const SplitAnalysis::BlockInfo &BI = SA->getUseBlocks().front();			}
				DEBUG(dbgs() << PrintReg(PhysReg, TRI) << "\tstatic = ";
	// Note that it is possible to have an interval that is live-in or live-out			MBFI->printBlockFreq(dbgs(), Cost));
	// while only covering a single block - A phi-def can use undef values from			if (Cost >= BestCost) {
	// predecessors, and the block could be a single-block loop.			DEBUG({
	// We don't bother doing anything clever about such a case, we simply assume			if (BestCand == NoCand)
	// that the interval is continuous from FirstInstr to LastInstr. We should			dbgs() << " worse than no bundles\n";
	// make sure that we don't do anything illegal to such an interval, though.			else
				dbgs() << " worse than "
	ArrayRef<SlotIndex> Uses = SA->getUseSlots();			<< PrintReg(GlobalCand[BestCand].PhysReg, TRI) << '\n';
	if (Uses.size() <= 2)			});
	return 0;			continue;
	const unsigned NumGaps = Uses.size()-1;			}
				growRegion(Cand);
	DEBUG({
	dbgs() << "tryLocalSplit: ";			SpillPlacer->finish();
	for (unsigned i = 0, e = Uses.size(); i != e; ++i)
	dbgs() << ' ' << Uses[i];			// No live bundles, defer to splitSingleBlocks().
	dbgs() << '\n';			if (!Cand.LiveBundles.any()) {
	});			DEBUG(dbgs() << " no bundles.\n");
				continue;
	// If VirtReg is live across any register mask operands, compute a list of			}
	// gaps with register masks.
	SmallVector<unsigned, 8> RegMaskGaps;			bool hasEvictionChain = false;
	if (Matrix->checkRegMaskInterference(VirtReg)) {			Cost += calcGlobalSplitCost(Cand, Order, &hasEvictionChain);
	// Get regmask slots for the whole block.			DEBUG({
	ArrayRef<SlotIndex> RMS = LIS->getRegMaskSlotsInBlock(BI.MBB->getNumber());			dbgs() << ", total = "; MBFI->printBlockFreq(dbgs(), Cost)
	DEBUG(dbgs() << RMS.size() << " regmasks in block:");			<< " with bundles";
	// Constrain to VirtReg's live range.			for (int i : Cand.LiveBundles.set_bits())
	unsigned ri = std::lower_bound(RMS.begin(), RMS.end(),			dbgs() << " EB#" << i;
	Uses.front().getRegSlot()) - RMS.begin();			dbgs() << ".\n";
	unsigned re = RMS.size();			});
	for (unsigned i = 0; i != NumGaps && ri != re; ++i) {			if (Cost < BestCost) {
	// Look for Uses[i] <= RMS <= Uses[i+1].			BestCand = NumCands;
	assert(!SlotIndex::isEarlierInstr(RMS[ri], Uses[i]));			BestCost = Cost;
	if (SlotIndex::isEarlierInstr(Uses[i+1], RMS[ri]))			// See splitCanCauseEvictionChain for detailed description of bad
	continue;			// eviction chain scenarios
	// Skip a regmask on the same instruction as the last use. It doesn't			if (canCauseEvictionChain)
	// overlap the live range.			*canCauseEvictionChain = hasEvictionChain;
	if (SlotIndex::isSameInstr(Uses[i+1], RMS[ri]) && i+1 == NumGaps)			}
	break;			++NumCands;
	DEBUG(dbgs() << ' ' << RMS[ri] << ':' << Uses[i] << '-' << Uses[i+1]);			}
	RegMaskGaps.push_back(i);
	// Advance ri to the next gap. A regmask on one of the uses counts in			if (canCauseEvictionChain && BestCand != NoCand) {
	// both gaps.			// See splitCanCauseEvictionChain for detailed description of bad
	while (ri != re && SlotIndex::isEarlierInstr(RMS[ri], Uses[i+1]))			// eviction chain scenarios
	++ri;			DEBUG(dbgs() << "Best split candidate of vreg "
	}			<< PrintReg(VirtReg.reg, TRI) << " may ");
	DEBUG(dbgs() << '\n');			if (!(*canCauseEvictionChain))
	}			DEBUG(dbgs() << "not ");
				DEBUG(dbgs() << "cause bad eviction chain\n");
	// Since we allow local split results to be split again, there is a risk of			}
	// creating infinite loops. It is tempting to require that the new live
	// ranges have less instructions than the original. That would guarantee			return BestCand;
	// convergence, but it is too strict. A live range with 3 instructions can be			}
	// split 2+3 (including the COPY), and we want to allow that.
	//			unsigned RAGreedy::doRegionSplit(LiveInterval &VirtReg, unsigned BestCand,
	// Instead we use these rules:			bool HasCompact,
	//			SmallVectorImpl<unsigned> &NewVRegs) {
	// 1. Allow any split for ranges with getStage() < RS_Split2. (Except for the			SmallVector<unsigned, 8> UsedCands;
	// noop split, of course).			// Prepare split editor.
	// 2. Require progress be made for ranges with getStage() == RS_Split2. All			LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
	// the new ranges must have fewer instructions than before the split.			SE->reset(LREdit, SplitSpillMode);
	// 3. New ranges with the same number of instructions are marked RS_Split2,
	// smaller ranges are marked RS_New.			// Assign all edge bundles to the preferred candidate, or NoCand.
	//			BundleCand.assign(Bundles->getNumBundles(), NoCand);
	// These rules allow a 3 -> 2+3 split once, which we need. They also prevent
	// excessive splitting and infinite loops.			// Assign bundles for the best candidate region.
	//			if (BestCand != NoCand) {
	bool ProgressRequired = getStage(VirtReg) >= RS_Split2;			GlobalSplitCandidate &Cand = GlobalCand[BestCand];
				if (unsigned B = Cand.getBundles(BundleCand, BestCand)) {
	// Best split candidate.			UsedCands.push_back(BestCand);
	unsigned BestBefore = NumGaps;			Cand.IntvIdx = SE->openIntv();
	unsigned BestAfter = 0;			DEBUG(dbgs() << "Split for " << PrintReg(Cand.PhysReg, TRI) << " in "
	float BestDiff = 0;			<< B << " bundles, intv " << Cand.IntvIdx << ".\n");
				(void)B;
	const float blockFreq =			}
	SpillPlacer->getBlockFrequency(BI.MBB->getNumber()).getFrequency() *			}
	(1.0f / MBFI->getEntryFreq());
	SmallVector<float, 8> GapWeight;			// Assign bundles for the compact region.
				if (HasCompact) {
	Order.rewind();			GlobalSplitCandidate &Cand = GlobalCand.front();
	while (unsigned PhysReg = Order.next()) {			assert(!Cand.PhysReg && "Compact region has no physreg");
	// Keep track of the largest spill weight that would need to be evicted in			if (unsigned B = Cand.getBundles(BundleCand, 0)) {
	// order to make use of PhysReg between UseSlots[i] and UseSlots[i+1].			UsedCands.push_back(0);
	calcGapWeights(PhysReg, GapWeight);			Cand.IntvIdx = SE->openIntv();
				DEBUG(dbgs() << "Split for compact region in " << B << " bundles, intv "
	// Remove any gaps with regmask clobbers.			<< Cand.IntvIdx << ".\n");
	if (Matrix->checkRegMaskInterference(VirtReg, PhysReg))			(void)B;
	for (unsigned i = 0, e = RegMaskGaps.size(); i != e; ++i)			}
	GapWeight[RegMaskGaps[i]] = huge_valf;			}

	// Try to find the best sequence of gaps to close.			splitAroundRegion(LREdit, UsedCands);
	// The new spill weight must be larger than any gap interference.			return 0;
				}
	// We will split before Uses[SplitBefore] and after Uses[SplitAfter].
	unsigned SplitBefore = 0, SplitAfter = 1;			//===----------------------------------------------------------------------===//
				// Per-Block Splitting
	// MaxGap should always be max(GapWeight[SplitBefore..SplitAfter-1]).			//===----------------------------------------------------------------------===//
	// It is the spill weight that needs to be evicted.
	float MaxGap = GapWeight[0];			/// tryBlockSplit - Split a global live range around every block with uses. This
				/// creates a lot of local live ranges, that will be split by tryLocalSplit if
	while (true) {			/// they don't allocate.
	// Live before/after split?			unsigned RAGreedy::tryBlockSplit(LiveInterval &VirtReg, AllocationOrder &Order,
	const bool LiveBefore = SplitBefore != 0 \|\| BI.LiveIn;			SmallVectorImpl<unsigned> &NewVRegs) {
	const bool LiveAfter = SplitAfter != NumGaps \|\| BI.LiveOut;			assert(&SA->getParent() == &VirtReg && "Live range wasn't analyzed");
				unsigned Reg = VirtReg.reg;
	DEBUG(dbgs() << PrintReg(PhysReg, TRI) << ' '			bool SingleInstrs = RegClassInfo.isProperSubClass(MRI->getRegClass(Reg));
	<< Uses[SplitBefore] << '-' << Uses[SplitAfter]			LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
	<< " i=" << MaxGap);			SE->reset(LREdit, SplitSpillMode);
				ArrayRef<SplitAnalysis::BlockInfo> UseBlocks = SA->getUseBlocks();
	// Stop before the interval gets so big we wouldn't be making progress.			for (unsigned i = 0; i != UseBlocks.size(); ++i) {
	if (!LiveBefore && !LiveAfter) {			const SplitAnalysis::BlockInfo &BI = UseBlocks[i];
	DEBUG(dbgs() << " all\n");			if (SA->shouldSplitSingleBlock(BI, SingleInstrs))
	break;			SE->splitSingleBlock(BI);
	}			}
	// Should the interval be extended or shrunk?			// No blocks were split.
	bool Shrink = true;			if (LREdit.empty())
				return 0;
	// How many gaps would the new range have?
	unsigned NewGaps = LiveBefore + SplitAfter - SplitBefore + LiveAfter;			// We did split for some blocks.
				SmallVector<unsigned, 8> IntvMap;
	// Legally, without causing looping?			SE->finish(&IntvMap);
	bool Legal = !ProgressRequired \|\| NewGaps < NumGaps;
				// Tell LiveDebugVariables about the new ranges.
	if (Legal && MaxGap < huge_valf) {			DebugVars->splitRegister(Reg, LREdit.regs(), *LIS);
	// Estimate the new spill weight. Each instruction reads or writes the
	// register. Conservatively assume there are no read-modify-write			ExtraRegInfo.resize(MRI->getNumVirtRegs());
	// instructions.
	//			// Sort out the new intervals created by splitting. The remainder interval
	// Try to guess the size of the new interval.			// goes straight to spilling, the new local ranges get to stay RS_New.
	const float EstWeight = normalizeSpillWeight(			for (unsigned i = 0, e = LREdit.size(); i != e; ++i) {
	blockFreq * (NewGaps + 1),			LiveInterval &LI = LIS->getInterval(LREdit.get(i));
	Uses[SplitBefore].distance(Uses[SplitAfter]) +			if (getStage(LI) == RS_New && IntvMap[i] == 0)
	(LiveBefore + LiveAfter) * SlotIndex::InstrDist,			setStage(LI, RS_Spill);
	1);			}
	// Would this split be possible to allocate?
	// Never allocate all gaps, we wouldn't be making progress.			if (VerifyEnabled)
	DEBUG(dbgs() << " w=" << EstWeight);			MF->verify(this, "After splitting live range around basic blocks");
	if (EstWeight * Hysteresis >= MaxGap) {			return 0;
	Shrink = false;			}
	float Diff = EstWeight - MaxGap;
	if (Diff > BestDiff) {			//===----------------------------------------------------------------------===//
	DEBUG(dbgs() << " (best)");			// Per-Instruction Splitting
	BestDiff = Hysteresis * Diff;			//===----------------------------------------------------------------------===//
	BestBefore = SplitBefore;
	BestAfter = SplitAfter;			/// Get the number of allocatable registers that match the constraints of \p Reg
	}			/// on \p MI and that are also in \p SuperRC.
	}			static unsigned getNumAllocatableRegsForConstraints(
	}			const MachineInstr MI, unsigned Reg, const TargetRegisterClass SuperRC,
				const TargetInstrInfo TII, const TargetRegisterInfo TRI,
	// Try to shrink.			const RegisterClassInfo &RCI) {
	if (Shrink) {			assert(SuperRC && "Invalid register class");
	if (++SplitBefore < SplitAfter) {
	DEBUG(dbgs() << " shrink\n");			const TargetRegisterClass *ConstrainedRC =
	// Recompute the max when necessary.			MI->getRegClassConstraintEffectForVReg(Reg, SuperRC, TII, TRI,
	if (GapWeight[SplitBefore - 1] >= MaxGap) {			/* ExploreBundle */ true);
	MaxGap = GapWeight[SplitBefore];			if (!ConstrainedRC)
	for (unsigned i = SplitBefore + 1; i != SplitAfter; ++i)			return 0;
	MaxGap = std::max(MaxGap, GapWeight[i]);			return RCI.getNumAllocatableRegs(ConstrainedRC);
	}			}
	continue;
	}			/// tryInstructionSplit - Split a live range around individual instructions.
	MaxGap = 0;			/// This is normally not worthwhile since the spiller is doing essentially the
	}			/// same thing. However, when the live range is in a constrained register
				/// class, it may help to insert copies such that parts of the live range can
	// Try to extend the interval.			/// be moved to a larger register class.
	if (SplitAfter >= NumGaps) {			///
	DEBUG(dbgs() << " end\n");			/// This is similar to spilling to a larger register class.
	break;			unsigned
	}			RAGreedy::tryInstructionSplit(LiveInterval &VirtReg, AllocationOrder &Order,
				SmallVectorImpl<unsigned> &NewVRegs) {
	DEBUG(dbgs() << " extend\n");			const TargetRegisterClass *CurRC = MRI->getRegClass(VirtReg.reg);
	MaxGap = std::max(MaxGap, GapWeight[SplitAfter++]);			// There is no point to this if there are no larger sub-classes.
	}			if (!RegClassInfo.isProperSubClass(CurRC))
	}			return 0;

	// Didn't find any candidates?			// Always enable split spill mode, since we're effectively spilling to a
	if (BestBefore == NumGaps)			// register.
	return 0;			LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
				SE->reset(LREdit, SplitEditor::SM_Size);
	DEBUG(dbgs() << "Best local split range: " << Uses[BestBefore]
	<< '-' << Uses[BestAfter] << ", " << BestDiff			ArrayRef<SlotIndex> Uses = SA->getUseSlots();
	<< ", " << (BestAfter - BestBefore + 1) << " instrs\n");			if (Uses.size() <= 1)
				return 0;
	LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
	SE->reset(LREdit);			DEBUG(dbgs() << "Split around " << Uses.size() << " individual instrs.\n");

	SE->openIntv();			const TargetRegisterClass *SuperRC =
	SlotIndex SegStart = SE->enterIntvBefore(Uses[BestBefore]);			TRI->getLargestLegalSuperClass(CurRC, *MF);
	SlotIndex SegStop = SE->leaveIntvAfter(Uses[BestAfter]);			unsigned SuperRCNumAllocatableRegs = RCI.getNumAllocatableRegs(SuperRC);
	SE->useIntv(SegStart, SegStop);			// Split around every non-copy instruction if this split will relax
	SmallVector<unsigned, 8> IntvMap;			// the constraints on the virtual register.
	SE->finish(&IntvMap);			// Otherwise, splitting just inserts uncoalescable copies that do not help
	DebugVars->splitRegister(VirtReg.reg, LREdit.regs(), *LIS);			// the allocation.
				for (unsigned i = 0; i != Uses.size(); ++i) {
	// If the new range has the same number of instructions as before, mark it as			if (const MachineInstr *MI = Indexes->getInstructionFromIndex(Uses[i]))
	// RS_Split2 so the next split will be forced to make progress. Otherwise,			if (MI->isFullCopy() \|\|
	// leave the new intervals as RS_New so they can compete.			SuperRCNumAllocatableRegs ==
	bool LiveBefore = BestBefore != 0 \|\| BI.LiveIn;			getNumAllocatableRegsForConstraints(MI, VirtReg.reg, SuperRC, TII,
	bool LiveAfter = BestAfter != NumGaps \|\| BI.LiveOut;			TRI, RCI)) {
	unsigned NewGaps = LiveBefore + BestAfter - BestBefore + LiveAfter;			DEBUG(dbgs() << " skip:\t" << Uses[i] << '\t' << *MI);
	if (NewGaps >= NumGaps) {			continue;
	DEBUG(dbgs() << "Tagging non-progress ranges: ");			}
	assert(!ProgressRequired && "Didn't make progress when it was required.");			SE->openIntv();
	for (unsigned i = 0, e = IntvMap.size(); i != e; ++i)			SlotIndex SegStart = SE->enterIntvBefore(Uses[i]);
	if (IntvMap[i] == 1) {			SlotIndex SegStop = SE->leaveIntvAfter(Uses[i]);
	setStage(LIS->getInterval(LREdit.get(i)), RS_Split2);			SE->useIntv(SegStart, SegStop);
	DEBUG(dbgs() << PrintReg(LREdit.get(i)));			}
	}
	DEBUG(dbgs() << '\n');			if (LREdit.empty()) {
	}			DEBUG(dbgs() << "All uses were copies.\n");
	++NumLocalSplits;			return 0;
				}
	return 0;
	}			SmallVector<unsigned, 8> IntvMap;
				SE->finish(&IntvMap);
	//===----------------------------------------------------------------------===//			DebugVars->splitRegister(VirtReg.reg, LREdit.regs(), *LIS);
	// Live Range Splitting			ExtraRegInfo.resize(MRI->getNumVirtRegs());
	//===----------------------------------------------------------------------===//
				// Assign all new registers to RS_Spill. This was the last chance.
	/// trySplit - Try to split VirtReg or one of its interferences, making it			setStage(LREdit.begin(), LREdit.end(), RS_Spill);
	/// assignable.			return 0;
	/// @return Physreg when VirtReg may be assigned and/or new NewVRegs.			}
	unsigned RAGreedy::trySplit(LiveInterval &VirtReg, AllocationOrder &Order,
	SmallVectorImpl<unsigned>&NewVRegs) {			//===----------------------------------------------------------------------===//
	// Ranges must be Split2 or less.			// Local Splitting
	if (getStage(VirtReg) >= RS_Spill)			//===----------------------------------------------------------------------===//
	return 0;
				/// calcGapWeights - Compute the maximum spill weight that needs to be evicted
	// Local intervals are handled separately.			/// in order to use PhysReg between two entries in SA->UseSlots.
	if (LIS->intervalIsInOneMBB(VirtReg)) {			///
	NamedRegionTimer T("local_split", "Local Splitting", TimerGroupName,			/// GapWeight[i] represents the gap between UseSlots[i] and UseSlots[i+1].
	TimerGroupDescription, TimePassesIsEnabled);			///
	SA->analyze(&VirtReg);			void RAGreedy::calcGapWeights(unsigned PhysReg,
	unsigned PhysReg = tryLocalSplit(VirtReg, Order, NewVRegs);			SmallVectorImpl<float> &GapWeight) {
	if (PhysReg \|\| !NewVRegs.empty())			assert(SA->getUseBlocks().size() == 1 && "Not a local interval");
	return PhysReg;			const SplitAnalysis::BlockInfo &BI = SA->getUseBlocks().front();
	return tryInstructionSplit(VirtReg, Order, NewVRegs);			ArrayRef<SlotIndex> Uses = SA->getUseSlots();
	}			const unsigned NumGaps = Uses.size()-1;

	NamedRegionTimer T("global_split", "Global Splitting", TimerGroupName,			// Start and end points for the interference check.
	TimerGroupDescription, TimePassesIsEnabled);			SlotIndex StartIdx =
				BI.LiveIn ? BI.FirstInstr.getBaseIndex() : BI.FirstInstr;
	SA->analyze(&VirtReg);			SlotIndex StopIdx =
				BI.LiveOut ? BI.LastInstr.getBoundaryIndex() : BI.LastInstr;
	// FIXME: SplitAnalysis may repair broken live ranges coming from the
	// coalescer. That may cause the range to become allocatable which means that			GapWeight.assign(NumGaps, 0.0f);
	// tryRegionSplit won't be making progress. This check should be replaced with
	// an assertion when the coalescer is fixed.			// Add interference from each overlapping register.
	if (SA->didRepairRange()) {			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	// VirtReg has changed, so all cached queries are invalid.			if (!Matrix->query(const_cast<LiveInterval&>(SA->getParent()), *Units)
	Matrix->invalidateVirtRegs();			.checkInterference())
	if (unsigned PhysReg = tryAssign(VirtReg, Order, NewVRegs))			continue;
	return PhysReg;
	}			// We know that VirtReg is a continuous interval from FirstInstr to
				// LastInstr, so we don't need InterferenceQuery.
	// First try to split around a region spanning multiple blocks. RS_Split2			//
	// ranges already made dubious progress with region splitting, so they go			// Interference that overlaps an instruction is counted in both gaps
	// straight to single block splitting.			// surrounding the instruction. The exception is interference before
	if (getStage(VirtReg) < RS_Split2) {			// StartIdx and after StopIdx.
	unsigned PhysReg = tryRegionSplit(VirtReg, Order, NewVRegs);			//
	if (PhysReg \|\| !NewVRegs.empty())			LiveIntervalUnion::SegmentIter IntI =
	return PhysReg;			Matrix->getLiveUnions()[*Units] .find(StartIdx);
	}			for (unsigned Gap = 0; IntI.valid() && IntI.start() < StopIdx; ++IntI) {
				// Skip the gaps before IntI.
	// Then isolate blocks.			while (Uses[Gap+1].getBoundaryIndex() < IntI.start())
	return tryBlockSplit(VirtReg, Order, NewVRegs);			if (++Gap == NumGaps)
	}			break;
				if (Gap == NumGaps)
	//===----------------------------------------------------------------------===//			break;
	// Last Chance Recoloring
	//===----------------------------------------------------------------------===//			// Update the gaps covered by IntI.
				const float weight = IntI.value()->weight;
	/// mayRecolorAllInterferences - Check if the virtual registers that			for (; Gap != NumGaps; ++Gap) {
	/// interfere with \p VirtReg on \p PhysReg (or one of its aliases) may be			GapWeight[Gap] = std::max(GapWeight[Gap], weight);
	/// recolored to free \p PhysReg.			if (Uses[Gap+1].getBaseIndex() >= IntI.stop())
	/// When true is returned, \p RecoloringCandidates has been augmented with all			break;
	/// the live intervals that need to be recolored in order to free \p PhysReg			}
	/// for \p VirtReg.			if (Gap == NumGaps)
	/// \p FixedRegisters contains all the virtual registers that cannot be			break;
	/// recolored.			}
	bool			}
	RAGreedy::mayRecolorAllInterferences(unsigned PhysReg, LiveInterval &VirtReg,
	SmallLISet &RecoloringCandidates,			// Add fixed interference.
	const SmallVirtRegSet &FixedRegisters) {			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	const TargetRegisterClass *CurRC = MRI->getRegClass(VirtReg.reg);			const LiveRange &LR = LIS->getRegUnit(*Units);
				LiveRange::const_iterator I = LR.find(StartIdx);
	for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {			LiveRange::const_iterator E = LR.end();
	LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
	// If there is LastChanceRecoloringMaxInterference or more interferences,			// Same loop as above. Mark any overlapped gaps as HUGE_VALF.
	// chances are one would not be recolorable.			for (unsigned Gap = 0; I != E && I->start < StopIdx; ++I) {
	if (Q.collectInterferingVRegs(LastChanceRecoloringMaxInterference) >=			while (Uses[Gap+1].getBoundaryIndex() < I->start)
	LastChanceRecoloringMaxInterference && !ExhaustiveSearch) {			if (++Gap == NumGaps)
	DEBUG(dbgs() << "Early abort: too many interferences.\n");			break;
	CutOffInfo \|= CO_Interf;			if (Gap == NumGaps)
	return false;			break;
	}
	for (unsigned i = Q.interferingVRegs().size(); i; --i) {			for (; Gap != NumGaps; ++Gap) {
	LiveInterval *Intf = Q.interferingVRegs()[i - 1];			GapWeight[Gap] = huge_valf;
	// If Intf is done and sit on the same register class as VirtReg,			if (Uses[Gap+1].getBaseIndex() >= I->end)
	// it would not be recolorable as it is in the same state as VirtReg.			break;
	if ((getStage(*Intf) == RS_Done &&			}
	MRI->getRegClass(Intf->reg) == CurRC) \|\|			if (Gap == NumGaps)
	FixedRegisters.count(Intf->reg)) {			break;
	DEBUG(dbgs() << "Early abort: the inteference is not recolorable.\n");			}
	return false;			}
	}			}
	RecoloringCandidates.insert(Intf);
	}			/// tryLocalSplit - Try to split VirtReg into smaller intervals inside its only
	}			/// basic block.
	return true;			///
	}			unsigned RAGreedy::tryLocalSplit(LiveInterval &VirtReg, AllocationOrder &Order,
				SmallVectorImpl<unsigned> &NewVRegs) {
	/// tryLastChanceRecoloring - Try to assign a color to \p VirtReg by recoloring			assert(SA->getUseBlocks().size() == 1 && "Not a local interval");
	/// its interferences.			const SplitAnalysis::BlockInfo &BI = SA->getUseBlocks().front();
	/// Last chance recoloring chooses a color for \p VirtReg and recolors every
	/// virtual register that was using it. The recoloring process may recursively			// Note that it is possible to have an interval that is live-in or live-out
	/// use the last chance recoloring. Therefore, when a virtual register has been			// while only covering a single block - A phi-def can use undef values from
	/// assigned a color by this mechanism, it is marked as Fixed, i.e., it cannot			// predecessors, and the block could be a single-block loop.
	/// be last-chance-recolored again during this recoloring "session".			// We don't bother doing anything clever about such a case, we simply assume
	/// E.g.,			// that the interval is continuous from FirstInstr to LastInstr. We should
	/// Let			// make sure that we don't do anything illegal to such an interval, though.
	/// vA can use {R1, R2 }
	/// vB can use { R2, R3}			ArrayRef<SlotIndex> Uses = SA->getUseSlots();
	/// vC can use {R1 }			if (Uses.size() <= 2)
	/// Where vA, vB, and vC cannot be split anymore (they are reloads for			return 0;
	/// instance) and they all interfere.			const unsigned NumGaps = Uses.size()-1;
	///
	/// vA is assigned R1			DEBUG({
	/// vB is assigned R2			dbgs() << "tryLocalSplit: ";
	/// vC tries to evict vA but vA is already done.			for (unsigned i = 0, e = Uses.size(); i != e; ++i)
	/// Regular register allocation fails.			dbgs() << ' ' << Uses[i];
	///			dbgs() << '\n';
	/// Last chance recoloring kicks in:			});
	/// vC does as if vA was evicted => vC uses R1.
	/// vC is marked as fixed.			// If VirtReg is live across any register mask operands, compute a list of
	/// vA needs to find a color.			// gaps with register masks.
	/// None are available.			SmallVector<unsigned, 8> RegMaskGaps;
	/// vA cannot evict vC: vC is a fixed virtual register now.			if (Matrix->checkRegMaskInterference(VirtReg)) {
	/// vA does as if vB was evicted => vA uses R2.			// Get regmask slots for the whole block.
	/// vB needs to find a color.			ArrayRef<SlotIndex> RMS = LIS->getRegMaskSlotsInBlock(BI.MBB->getNumber());
	/// R3 is available.			DEBUG(dbgs() << RMS.size() << " regmasks in block:");
	/// Recoloring => vC = R1, vA = R2, vB = R3			// Constrain to VirtReg's live range.
	///			unsigned ri = std::lower_bound(RMS.begin(), RMS.end(),
	/// \p Order defines the preferred allocation order for \p VirtReg.			Uses.front().getRegSlot()) - RMS.begin();
	/// \p NewRegs will contain any new virtual register that have been created			unsigned re = RMS.size();
	/// (split, spill) during the process and that must be assigned.			for (unsigned i = 0; i != NumGaps && ri != re; ++i) {
	/// \p FixedRegisters contains all the virtual registers that cannot be			// Look for Uses[i] <= RMS <= Uses[i+1].
	/// recolored.			assert(!SlotIndex::isEarlierInstr(RMS[ri], Uses[i]));
	/// \p Depth gives the current depth of the last chance recoloring.			if (SlotIndex::isEarlierInstr(Uses[i+1], RMS[ri]))
	/// \return a physical register that can be used for VirtReg or ~0u if none			continue;
	/// exists.			// Skip a regmask on the same instruction as the last use. It doesn't
	unsigned RAGreedy::tryLastChanceRecoloring(LiveInterval &VirtReg,			// overlap the live range.
	AllocationOrder &Order,			if (SlotIndex::isSameInstr(Uses[i+1], RMS[ri]) && i+1 == NumGaps)
	SmallVectorImpl<unsigned> &NewVRegs,			break;
	SmallVirtRegSet &FixedRegisters,			DEBUG(dbgs() << ' ' << RMS[ri] << ':' << Uses[i] << '-' << Uses[i+1]);
	unsigned Depth) {			RegMaskGaps.push_back(i);
	DEBUG(dbgs() << "Try last chance recoloring for " << VirtReg << '\n');			// Advance ri to the next gap. A regmask on one of the uses counts in
	// Ranges must be Done.			// both gaps.
	assert((getStage(VirtReg) >= RS_Done \|\| !VirtReg.isSpillable()) &&			while (ri != re && SlotIndex::isEarlierInstr(RMS[ri], Uses[i+1]))
	"Last chance recoloring should really be last chance");			++ri;
	// Set the max depth to LastChanceRecoloringMaxDepth.			}
	// We may want to reconsider that if we end up with a too large search space			DEBUG(dbgs() << '\n');
	// for target with hundreds of registers.			}
	// Indeed, in that case we may want to cut the search space earlier.
	if (Depth >= LastChanceRecoloringMaxDepth && !ExhaustiveSearch) {			// Since we allow local split results to be split again, there is a risk of
	DEBUG(dbgs() << "Abort because max depth has been reached.\n");			// creating infinite loops. It is tempting to require that the new live
	CutOffInfo \|= CO_Depth;			// ranges have less instructions than the original. That would guarantee
	return ~0u;			// convergence, but it is too strict. A live range with 3 instructions can be
	}			// split 2+3 (including the COPY), and we want to allow that.
				//
	// Set of Live intervals that will need to be recolored.			// Instead we use these rules:
	SmallLISet RecoloringCandidates;			//
	// Record the original mapping virtual register to physical register in case			// 1. Allow any split for ranges with getStage() < RS_Split2. (Except for the
	// the recoloring fails.			// noop split, of course).
	DenseMap<unsigned, unsigned> VirtRegToPhysReg;			// 2. Require progress be made for ranges with getStage() == RS_Split2. All
	// Mark VirtReg as fixed, i.e., it will not be recolored pass this point in			// the new ranges must have fewer instructions than before the split.
	// this recoloring "session".			// 3. New ranges with the same number of instructions are marked RS_Split2,
	FixedRegisters.insert(VirtReg.reg);			// smaller ranges are marked RS_New.
	SmallVector<unsigned, 4> CurrentNewVRegs;			//
				// These rules allow a 3 -> 2+3 split once, which we need. They also prevent
	Order.rewind();			// excessive splitting and infinite loops.
	while (unsigned PhysReg = Order.next()) {			//
	DEBUG(dbgs() << "Try to assign: " << VirtReg << " to "			bool ProgressRequired = getStage(VirtReg) >= RS_Split2;
	<< PrintReg(PhysReg, TRI) << '\n');
	RecoloringCandidates.clear();			// Best split candidate.
	VirtRegToPhysReg.clear();			unsigned BestBefore = NumGaps;
	CurrentNewVRegs.clear();			unsigned BestAfter = 0;
				float BestDiff = 0;
	// It is only possible to recolor virtual register interference.
	if (Matrix->checkInterference(VirtReg, PhysReg) >			const float blockFreq =
	LiveRegMatrix::IK_VirtReg) {			SpillPlacer->getBlockFrequency(BI.MBB->getNumber()).getFrequency() *
	DEBUG(dbgs() << "Some inteferences are not with virtual registers.\n");			(1.0f / MBFI->getEntryFreq());
				SmallVector<float, 8> GapWeight;
	continue;
	}			Order.rewind();
				while (unsigned PhysReg = Order.next()) {
	// Early give up on this PhysReg if it is obvious we cannot recolor all			// Keep track of the largest spill weight that would need to be evicted in
	// the interferences.			// order to make use of PhysReg between UseSlots[i] and UseSlots[i+1].
	if (!mayRecolorAllInterferences(PhysReg, VirtReg, RecoloringCandidates,			calcGapWeights(PhysReg, GapWeight);
	FixedRegisters)) {
	DEBUG(dbgs() << "Some inteferences cannot be recolored.\n");			// Remove any gaps with regmask clobbers.
	continue;			if (Matrix->checkRegMaskInterference(VirtReg, PhysReg))
	}			for (unsigned i = 0, e = RegMaskGaps.size(); i != e; ++i)
				GapWeight[RegMaskGaps[i]] = huge_valf;
	// RecoloringCandidates contains all the virtual registers that interfer
	// with VirtReg on PhysReg (or one of its aliases).			// Try to find the best sequence of gaps to close.
	// Enqueue them for recoloring and perform the actual recoloring.			// The new spill weight must be larger than any gap interference.
	PQueue RecoloringQueue;
	for (SmallLISet::iterator It = RecoloringCandidates.begin(),			// We will split before Uses[SplitBefore] and after Uses[SplitAfter].
	EndIt = RecoloringCandidates.end();			unsigned SplitBefore = 0, SplitAfter = 1;
	It != EndIt; ++It) {
	unsigned ItVirtReg = (*It)->reg;			// MaxGap should always be max(GapWeight[SplitBefore..SplitAfter-1]).
	enqueue(RecoloringQueue, *It);			// It is the spill weight that needs to be evicted.
	assert(VRM->hasPhys(ItVirtReg) &&			float MaxGap = GapWeight[0];
	"Interferences are supposed to be with allocated vairables");
				while (true) {
	// Record the current allocation.			// Live before/after split?
	VirtRegToPhysReg[ItVirtReg] = VRM->getPhys(ItVirtReg);			const bool LiveBefore = SplitBefore != 0 \|\| BI.LiveIn;
	// unset the related struct.			const bool LiveAfter = SplitAfter != NumGaps \|\| BI.LiveOut;
	Matrix->unassign(**It);
	}			DEBUG(dbgs() << PrintReg(PhysReg, TRI) << ' '
				<< Uses[SplitBefore] << '-' << Uses[SplitAfter]
	// Do as if VirtReg was assigned to PhysReg so that the underlying			<< " i=" << MaxGap);
	// recoloring has the right information about the interferes and
	// available colors.			// Stop before the interval gets so big we wouldn't be making progress.
	Matrix->assign(VirtReg, PhysReg);			if (!LiveBefore && !LiveAfter) {
				DEBUG(dbgs() << " all\n");
	// Save the current recoloring state.			break;
	// If we cannot recolor all the interferences, we will have to start again			}
	// at this point for the next physical register.			// Should the interval be extended or shrunk?
	SmallVirtRegSet SaveFixedRegisters(FixedRegisters);			bool Shrink = true;
	if (tryRecoloringCandidates(RecoloringQueue, CurrentNewVRegs,
	FixedRegisters, Depth)) {			// How many gaps would the new range have?
	// Push the queued vregs into the main queue.			unsigned NewGaps = LiveBefore + SplitAfter - SplitBefore + LiveAfter;
	for (unsigned NewVReg : CurrentNewVRegs)
	NewVRegs.push_back(NewVReg);			// Legally, without causing looping?
	// Do not mess up with the global assignment process.			bool Legal = !ProgressRequired \|\| NewGaps < NumGaps;
	// I.e., VirtReg must be unassigned.
	Matrix->unassign(VirtReg);			if (Legal && MaxGap < huge_valf) {
	return PhysReg;			// Estimate the new spill weight. Each instruction reads or writes the
	}			// register. Conservatively assume there are no read-modify-write
				// instructions.
	DEBUG(dbgs() << "Fail to assign: " << VirtReg << " to "			//
	<< PrintReg(PhysReg, TRI) << '\n');			// Try to guess the size of the new interval.
				const float EstWeight = normalizeSpillWeight(
	// The recoloring attempt failed, undo the changes.			blockFreq * (NewGaps + 1),
	FixedRegisters = SaveFixedRegisters;			Uses[SplitBefore].distance(Uses[SplitAfter]) +
	Matrix->unassign(VirtReg);			(LiveBefore + LiveAfter) * SlotIndex::InstrDist,
				1);
	// For a newly created vreg which is also in RecoloringCandidates,			// Would this split be possible to allocate?
	// don't add it to NewVRegs because its physical register will be restored			// Never allocate all gaps, we wouldn't be making progress.
	// below. Other vregs in CurrentNewVRegs are created by calling			DEBUG(dbgs() << " w=" << EstWeight);
	// selectOrSplit and should be added into NewVRegs.			if (EstWeight * Hysteresis >= MaxGap) {
	for (SmallVectorImpl<unsigned>::iterator Next = CurrentNewVRegs.begin(),			Shrink = false;
	End = CurrentNewVRegs.end();			float Diff = EstWeight - MaxGap;
	Next != End; ++Next) {			if (Diff > BestDiff) {
	if (RecoloringCandidates.count(&LIS->getInterval(*Next)))			DEBUG(dbgs() << " (best)");
	continue;			BestDiff = Hysteresis * Diff;
	NewVRegs.push_back(*Next);			BestBefore = SplitBefore;
	}			BestAfter = SplitAfter;
				}
	for (SmallLISet::iterator It = RecoloringCandidates.begin(),			}
	EndIt = RecoloringCandidates.end();			}
	It != EndIt; ++It) {
	unsigned ItVirtReg = (*It)->reg;			// Try to shrink.
	if (VRM->hasPhys(ItVirtReg))			if (Shrink) {
	Matrix->unassign(**It);			if (++SplitBefore < SplitAfter) {
	unsigned ItPhysReg = VirtRegToPhysReg[ItVirtReg];			DEBUG(dbgs() << " shrink\n");
	Matrix->assign(**It, ItPhysReg);			// Recompute the max when necessary.
	}			if (GapWeight[SplitBefore - 1] >= MaxGap) {
	}			MaxGap = GapWeight[SplitBefore];
				for (unsigned i = SplitBefore + 1; i != SplitAfter; ++i)
	// Last chance recoloring did not worked either, give up.			MaxGap = std::max(MaxGap, GapWeight[i]);
	return ~0u;			}
	}			continue;
				}
	/// tryRecoloringCandidates - Try to assign a new color to every register			MaxGap = 0;
	/// in \RecoloringQueue.			}
	/// \p NewRegs will contain any new virtual register created during the
	/// recoloring process.			// Try to extend the interval.
	/// \p FixedRegisters[in/out] contains all the registers that have been			if (SplitAfter >= NumGaps) {
	/// recolored.			DEBUG(dbgs() << " end\n");
	/// \return true if all virtual registers in RecoloringQueue were successfully			break;
	/// recolored, false otherwise.			}
	bool RAGreedy::tryRecoloringCandidates(PQueue &RecoloringQueue,
	SmallVectorImpl<unsigned> &NewVRegs,			DEBUG(dbgs() << " extend\n");
	SmallVirtRegSet &FixedRegisters,			MaxGap = std::max(MaxGap, GapWeight[SplitAfter++]);
	unsigned Depth) {			}
	while (!RecoloringQueue.empty()) {			}
	LiveInterval *LI = dequeue(RecoloringQueue);
	DEBUG(dbgs() << "Try to recolor: " << *LI << '\n');			// Didn't find any candidates?
	unsigned PhysReg;			if (BestBefore == NumGaps)
	PhysReg = selectOrSplitImpl(*LI, NewVRegs, FixedRegisters, Depth + 1);			return 0;
	// When splitting happens, the live-range may actually be empty.
	// In that case, this is okay to continue the recoloring even			DEBUG(dbgs() << "Best local split range: " << Uses[BestBefore]
	// if we did not find an alternative color for it. Indeed,			<< '-' << Uses[BestAfter] << ", " << BestDiff
	// there will not be anything to color for LI in the end.			<< ", " << (BestAfter - BestBefore + 1) << " instrs\n");
	if (PhysReg == ~0u \|\| (!PhysReg && !LI->empty()))
	return false;			LiveRangeEdit LREdit(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
				SE->reset(LREdit);
	if (!PhysReg) {
	assert(LI->empty() && "Only empty live-range do not require a register");			SE->openIntv();
	DEBUG(dbgs() << "Recoloring of " << *LI << " succeeded. Empty LI.\n");			SlotIndex SegStart = SE->enterIntvBefore(Uses[BestBefore]);
	continue;			SlotIndex SegStop = SE->leaveIntvAfter(Uses[BestAfter]);
	}			SE->useIntv(SegStart, SegStop);
	DEBUG(dbgs() << "Recoloring of " << *LI			SmallVector<unsigned, 8> IntvMap;
	<< " succeeded with: " << PrintReg(PhysReg, TRI) << '\n');			SE->finish(&IntvMap);
				DebugVars->splitRegister(VirtReg.reg, LREdit.regs(), *LIS);
	Matrix->assign(*LI, PhysReg);
	FixedRegisters.insert(LI->reg);			// If the new range has the same number of instructions as before, mark it as
	}			// RS_Split2 so the next split will be forced to make progress. Otherwise,
	return true;			// leave the new intervals as RS_New so they can compete.
	}			bool LiveBefore = BestBefore != 0 \|\| BI.LiveIn;
				bool LiveAfter = BestAfter != NumGaps \|\| BI.LiveOut;
	//===----------------------------------------------------------------------===//			unsigned NewGaps = LiveBefore + BestAfter - BestBefore + LiveAfter;
	// Main Entry Point			if (NewGaps >= NumGaps) {
	//===----------------------------------------------------------------------===//			DEBUG(dbgs() << "Tagging non-progress ranges: ");
				assert(!ProgressRequired && "Didn't make progress when it was required.");
	unsigned RAGreedy::selectOrSplit(LiveInterval &VirtReg,			for (unsigned i = 0, e = IntvMap.size(); i != e; ++i)
	SmallVectorImpl<unsigned> &NewVRegs) {			if (IntvMap[i] == 1) {
	CutOffInfo = CO_None;			setStage(LIS->getInterval(LREdit.get(i)), RS_Split2);
	LLVMContext &Ctx = MF->getFunction()->getContext();			DEBUG(dbgs() << PrintReg(LREdit.get(i)));
	SmallVirtRegSet FixedRegisters;			}
	unsigned Reg = selectOrSplitImpl(VirtReg, NewVRegs, FixedRegisters);			DEBUG(dbgs() << '\n');
	if (Reg == ~0U && (CutOffInfo != CO_None)) {			}
	uint8_t CutOffEncountered = CutOffInfo & (CO_Depth \| CO_Interf);			++NumLocalSplits;
	if (CutOffEncountered == CO_Depth)
	Ctx.emitError("register allocation failed: maximum depth for recoloring "			return 0;
	"reached. Use -fexhaustive-register-search to skip "			}
	"cutoffs");
	else if (CutOffEncountered == CO_Interf)			//===----------------------------------------------------------------------===//
	Ctx.emitError("register allocation failed: maximum interference for "			// Live Range Splitting
	"recoloring reached. Use -fexhaustive-register-search "			//===----------------------------------------------------------------------===//
	"to skip cutoffs");
	else if (CutOffEncountered == (CO_Depth \| CO_Interf))			/// trySplit - Try to split VirtReg or one of its interferences, making it
	Ctx.emitError("register allocation failed: maximum interference and "			/// assignable.
	"depth for recoloring reached. Use "			/// @return Physreg when VirtReg may be assigned and/or new NewVRegs.
	"-fexhaustive-register-search to skip cutoffs");			unsigned RAGreedy::trySplit(LiveInterval &VirtReg, AllocationOrder &Order,
	}			SmallVectorImpl<unsigned>&NewVRegs) {
	return Reg;			// Ranges must be Split2 or less.
	}			if (getStage(VirtReg) >= RS_Spill)
				return 0;
	/// Using a CSR for the first time has a cost because it causes push\|pop
	/// to be added to prologue\|epilogue. Splitting a cold section of the live			// Local intervals are handled separately.
	/// range can have lower cost than using the CSR for the first time;			if (LIS->intervalIsInOneMBB(VirtReg)) {
	/// Spilling a live range in the cold path can have lower cost than using			NamedRegionTimer T("local_split", "Local Splitting", TimerGroupName,
	/// the CSR for the first time. Returns the physical register if we decide			TimerGroupDescription, TimePassesIsEnabled);
	/// to use the CSR; otherwise return 0.			SA->analyze(&VirtReg);
	unsigned RAGreedy::tryAssignCSRFirstTime(LiveInterval &VirtReg,			unsigned PhysReg = tryLocalSplit(VirtReg, Order, NewVRegs);
	AllocationOrder &Order,			if (PhysReg \|\| !NewVRegs.empty())
	unsigned PhysReg,			return PhysReg;
	unsigned &CostPerUseLimit,			return tryInstructionSplit(VirtReg, Order, NewVRegs);
	SmallVectorImpl<unsigned> &NewVRegs) {			}
	if (getStage(VirtReg) == RS_Spill && VirtReg.isSpillable()) {
	// We choose spill over using the CSR for the first time if the spill cost			NamedRegionTimer T("global_split", "Global Splitting", TimerGroupName,
	// is lower than CSRCost.			TimerGroupDescription, TimePassesIsEnabled);
	SA->analyze(&VirtReg);
	if (calcSpillCost() >= CSRCost)			SA->analyze(&VirtReg);
	return PhysReg;
				// FIXME: SplitAnalysis may repair broken live ranges coming from the
	// We are going to spill, set CostPerUseLimit to 1 to make sure that			// coalescer. That may cause the range to become allocatable which means that
	// we will not use a callee-saved register in tryEvict.			// tryRegionSplit won't be making progress. This check should be replaced with
	CostPerUseLimit = 1;			// an assertion when the coalescer is fixed.
	return 0;			if (SA->didRepairRange()) {
	}			// VirtReg has changed, so all cached queries are invalid.
	if (getStage(VirtReg) < RS_Split) {			Matrix->invalidateVirtRegs();
	// We choose pre-splitting over using the CSR for the first time if			if (unsigned PhysReg = tryAssign(VirtReg, Order, NewVRegs))
	// the cost of splitting is lower than CSRCost.			return PhysReg;
	SA->analyze(&VirtReg);			}
	unsigned NumCands = 0;
	BlockFrequency BestCost = CSRCost; // Don't modify CSRCost.			// First try to split around a region spanning multiple blocks. RS_Split2
	unsigned BestCand = calculateRegionSplitCost(VirtReg, Order, BestCost,			// ranges already made dubious progress with region splitting, so they go
	NumCands, true /IgnoreCSR/);			// straight to single block splitting.
	if (BestCand == NoCand)			if (getStage(VirtReg) < RS_Split2) {
	// Use the CSR if we can't find a region split below CSRCost.			unsigned PhysReg = tryRegionSplit(VirtReg, Order, NewVRegs);
	return PhysReg;			if (PhysReg \|\| !NewVRegs.empty())
				return PhysReg;
	// Perform the actual pre-splitting.			}
	doRegionSplit(VirtReg, BestCand, false/HasCompact/, NewVRegs);
	return 0;			// Then isolate blocks.
	}			return tryBlockSplit(VirtReg, Order, NewVRegs);
	return PhysReg;			}
	}
				//===----------------------------------------------------------------------===//
	void RAGreedy::aboutToRemoveInterval(LiveInterval &LI) {			// Last Chance Recoloring
	// Do not keep invalid information around.			//===----------------------------------------------------------------------===//
	SetOfBrokenHints.remove(&LI);
	}			/// mayRecolorAllInterferences - Check if the virtual registers that
				/// interfere with \p VirtReg on \p PhysReg (or one of its aliases) may be
	void RAGreedy::initializeCSRCost() {			/// recolored to free \p PhysReg.
	// We use the larger one out of the command-line option and the value report			/// When true is returned, \p RecoloringCandidates has been augmented with all
	// by TRI.			/// the live intervals that need to be recolored in order to free \p PhysReg
	CSRCost = BlockFrequency(			/// for \p VirtReg.
	std::max((unsigned)CSRFirstTimeCost, TRI->getCSRFirstUseCost()));			/// \p FixedRegisters contains all the virtual registers that cannot be
	if (!CSRCost.getFrequency())			/// recolored.
	return;			bool
				RAGreedy::mayRecolorAllInterferences(unsigned PhysReg, LiveInterval &VirtReg,
	// Raw cost is relative to Entry == 2^14; scale it appropriately.			SmallLISet &RecoloringCandidates,
	uint64_t ActualEntry = MBFI->getEntryFreq();			const SmallVirtRegSet &FixedRegisters) {
	if (!ActualEntry) {			const TargetRegisterClass *CurRC = MRI->getRegClass(VirtReg.reg);
	CSRCost = 0;
	return;			for (MCRegUnitIterator Units(PhysReg, TRI); Units.isValid(); ++Units) {
	}			LiveIntervalUnion::Query &Q = Matrix->query(VirtReg, *Units);
	uint64_t FixedEntry = 1 << 14;			// If there is LastChanceRecoloringMaxInterference or more interferences,
	if (ActualEntry < FixedEntry)			// chances are one would not be recolorable.
	CSRCost *= BranchProbability(ActualEntry, FixedEntry);			if (Q.collectInterferingVRegs(LastChanceRecoloringMaxInterference) >=
	else if (ActualEntry <= UINT32_MAX)			LastChanceRecoloringMaxInterference && !ExhaustiveSearch) {
	// Invert the fraction and divide.			DEBUG(dbgs() << "Early abort: too many interferences.\n");
	CSRCost /= BranchProbability(FixedEntry, ActualEntry);			CutOffInfo \|= CO_Interf;
	else			return false;
	// Can't use BranchProbability in general, since it takes 32-bit numbers.			}
	CSRCost = CSRCost.getFrequency() * (ActualEntry / FixedEntry);			for (unsigned i = Q.interferingVRegs().size(); i; --i) {
	}			LiveInterval *Intf = Q.interferingVRegs()[i - 1];
				// If Intf is done and sit on the same register class as VirtReg,
	/// \brief Collect the hint info for \p Reg.			// it would not be recolorable as it is in the same state as VirtReg.
	/// The results are stored into \p Out.			if ((getStage(*Intf) == RS_Done &&
	/// \p Out is not cleared before being populated.			MRI->getRegClass(Intf->reg) == CurRC) \|\|
	void RAGreedy::collectHintInfo(unsigned Reg, HintsInfo &Out) {			FixedRegisters.count(Intf->reg)) {
	for (const MachineInstr &Instr : MRI->reg_nodbg_instructions(Reg)) {			DEBUG(dbgs() << "Early abort: the inteference is not recolorable.\n");
	if (!Instr.isFullCopy())			return false;
	continue;			}
	// Look for the other end of the copy.			RecoloringCandidates.insert(Intf);
	unsigned OtherReg = Instr.getOperand(0).getReg();			}
	if (OtherReg == Reg) {			}
	OtherReg = Instr.getOperand(1).getReg();			return true;
	if (OtherReg == Reg)			}
	continue;
	}			/// tryLastChanceRecoloring - Try to assign a color to \p VirtReg by recoloring
	// Get the current assignment.			/// its interferences.
	unsigned OtherPhysReg = TargetRegisterInfo::isPhysicalRegister(OtherReg)			/// Last chance recoloring chooses a color for \p VirtReg and recolors every
	? OtherReg			/// virtual register that was using it. The recoloring process may recursively
	: VRM->getPhys(OtherReg);			/// use the last chance recoloring. Therefore, when a virtual register has been
	// Push the collected information.			/// assigned a color by this mechanism, it is marked as Fixed, i.e., it cannot
	Out.push_back(HintInfo(MBFI->getBlockFreq(Instr.getParent()), OtherReg,			/// be last-chance-recolored again during this recoloring "session".
	OtherPhysReg));			/// E.g.,
	}			/// Let
	}			/// vA can use {R1, R2 }
				/// vB can use { R2, R3}
	/// \brief Using the given \p List, compute the cost of the broken hints if			/// vC can use {R1 }
	/// \p PhysReg was used.			/// Where vA, vB, and vC cannot be split anymore (they are reloads for
	/// \return The cost of \p List for \p PhysReg.			/// instance) and they all interfere.
	BlockFrequency RAGreedy::getBrokenHintFreq(const HintsInfo &List,			///
	unsigned PhysReg) {			/// vA is assigned R1
	BlockFrequency Cost = 0;			/// vB is assigned R2
	for (const HintInfo &Info : List) {			/// vC tries to evict vA but vA is already done.
	if (Info.PhysReg != PhysReg)			/// Regular register allocation fails.
	Cost += Info.Freq;			///
	}			/// Last chance recoloring kicks in:
	return Cost;			/// vC does as if vA was evicted => vC uses R1.
	}			/// vC is marked as fixed.
				/// vA needs to find a color.
	/// \brief Using the register assigned to \p VirtReg, try to recolor			/// None are available.
	/// all the live ranges that are copy-related with \p VirtReg.			/// vA cannot evict vC: vC is a fixed virtual register now.
	/// The recoloring is then propagated to all the live-ranges that have			/// vA does as if vB was evicted => vA uses R2.
	/// been recolored and so on, until no more copies can be coalesced or			/// vB needs to find a color.
	/// it is not profitable.			/// R3 is available.
	/// For a given live range, profitability is determined by the sum of the			/// Recoloring => vC = R1, vA = R2, vB = R3
	/// frequencies of the non-identity copies it would introduce with the old			///
	/// and new register.			/// \p Order defines the preferred allocation order for \p VirtReg.
	void RAGreedy::tryHintRecoloring(LiveInterval &VirtReg) {			/// \p NewRegs will contain any new virtual register that have been created
	// We have a broken hint, check if it is possible to fix it by			/// (split, spill) during the process and that must be assigned.
	// reusing PhysReg for the copy-related live-ranges. Indeed, we evicted			/// \p FixedRegisters contains all the virtual registers that cannot be
	// some register and PhysReg may be available for the other live-ranges.			/// recolored.
	SmallSet<unsigned, 4> Visited;			/// \p Depth gives the current depth of the last chance recoloring.
	SmallVector<unsigned, 2> RecoloringCandidates;			/// \return a physical register that can be used for VirtReg or ~0u if none
	HintsInfo Info;			/// exists.
	unsigned Reg = VirtReg.reg;			unsigned RAGreedy::tryLastChanceRecoloring(LiveInterval &VirtReg,
	unsigned PhysReg = VRM->getPhys(Reg);			AllocationOrder &Order,
	// Start the recoloring algorithm from the input live-interval, then			SmallVectorImpl<unsigned> &NewVRegs,
	// it will propagate to the ones that are copy-related with it.			SmallVirtRegSet &FixedRegisters,
	Visited.insert(Reg);			unsigned Depth) {
	RecoloringCandidates.push_back(Reg);			DEBUG(dbgs() << "Try last chance recoloring for " << VirtReg << '\n');
				// Ranges must be Done.
	DEBUG(dbgs() << "Trying to reconcile hints for: " << PrintReg(Reg, TRI) << '('			assert((getStage(VirtReg) >= RS_Done \|\| !VirtReg.isSpillable()) &&
	<< PrintReg(PhysReg, TRI) << ")\n");			"Last chance recoloring should really be last chance");
				// Set the max depth to LastChanceRecoloringMaxDepth.
	do {			// We may want to reconsider that if we end up with a too large search space
	Reg = RecoloringCandidates.pop_back_val();			// for target with hundreds of registers.
				// Indeed, in that case we may want to cut the search space earlier.
	// We cannot recolor physcal register.			if (Depth >= LastChanceRecoloringMaxDepth && !ExhaustiveSearch) {
	if (TargetRegisterInfo::isPhysicalRegister(Reg))			DEBUG(dbgs() << "Abort because max depth has been reached.\n");
	continue;			CutOffInfo \|= CO_Depth;
				return ~0u;
	assert(VRM->hasPhys(Reg) && "We have unallocated variable!!");			}

	// Get the live interval mapped with this virtual register to be able			// Set of Live intervals that will need to be recolored.
	// to check for the interference with the new color.			SmallLISet RecoloringCandidates;
	LiveInterval &LI = LIS->getInterval(Reg);			// Record the original mapping virtual register to physical register in case
	unsigned CurrPhys = VRM->getPhys(Reg);			// the recoloring fails.
	// Check that the new color matches the register class constraints and			DenseMap<unsigned, unsigned> VirtRegToPhysReg;
	// that it is free for this live range.			// Mark VirtReg as fixed, i.e., it will not be recolored pass this point in
	if (CurrPhys != PhysReg && (!MRI->getRegClass(Reg)->contains(PhysReg) \|\|			// this recoloring "session".
	Matrix->checkInterference(LI, PhysReg)))			FixedRegisters.insert(VirtReg.reg);
	continue;			SmallVector<unsigned, 4> CurrentNewVRegs;

	DEBUG(dbgs() << PrintReg(Reg, TRI) << '(' << PrintReg(CurrPhys, TRI)			Order.rewind();
	<< ") is recolorable.\n");			while (unsigned PhysReg = Order.next()) {
				DEBUG(dbgs() << "Try to assign: " << VirtReg << " to "
	// Gather the hint info.			<< PrintReg(PhysReg, TRI) << '\n');
	Info.clear();			RecoloringCandidates.clear();
	collectHintInfo(Reg, Info);			VirtRegToPhysReg.clear();
	// Check if recoloring the live-range will increase the cost of the			CurrentNewVRegs.clear();
	// non-identity copies.
	if (CurrPhys != PhysReg) {			// It is only possible to recolor virtual register interference.
	DEBUG(dbgs() << "Checking profitability:\n");			if (Matrix->checkInterference(VirtReg, PhysReg) >
	BlockFrequency OldCopiesCost = getBrokenHintFreq(Info, CurrPhys);			LiveRegMatrix::IK_VirtReg) {
	BlockFrequency NewCopiesCost = getBrokenHintFreq(Info, PhysReg);			DEBUG(dbgs() << "Some inteferences are not with virtual registers.\n");
	DEBUG(dbgs() << "Old Cost: " << OldCopiesCost.getFrequency()
	<< "\nNew Cost: " << NewCopiesCost.getFrequency() << '\n');			continue;
	if (OldCopiesCost < NewCopiesCost) {			}
	DEBUG(dbgs() << "=> Not profitable.\n");
	continue;			// Early give up on this PhysReg if it is obvious we cannot recolor all
	}			// the interferences.
	// At this point, the cost is either cheaper or equal. If it is			if (!mayRecolorAllInterferences(PhysReg, VirtReg, RecoloringCandidates,
	// equal, we consider this is profitable because it may expose			FixedRegisters)) {
	// more recoloring opportunities.			DEBUG(dbgs() << "Some inteferences cannot be recolored.\n");
	DEBUG(dbgs() << "=> Profitable.\n");			continue;
	// Recolor the live-range.			}
	Matrix->unassign(LI);
	Matrix->assign(LI, PhysReg);			// RecoloringCandidates contains all the virtual registers that interfer
	}			// with VirtReg on PhysReg (or one of its aliases).
	// Push all copy-related live-ranges to keep reconciling the broken			// Enqueue them for recoloring and perform the actual recoloring.
	// hints.			PQueue RecoloringQueue;
	for (const HintInfo &HI : Info) {			for (SmallLISet::iterator It = RecoloringCandidates.begin(),
	if (Visited.insert(HI.Reg).second)			EndIt = RecoloringCandidates.end();
	RecoloringCandidates.push_back(HI.Reg);			It != EndIt; ++It) {
	}			unsigned ItVirtReg = (*It)->reg;
	} while (!RecoloringCandidates.empty());			enqueue(RecoloringQueue, *It);
	}			assert(VRM->hasPhys(ItVirtReg) &&
				"Interferences are supposed to be with allocated vairables");
	/// \brief Try to recolor broken hints.
	/// Broken hints may be repaired by recoloring when an evicted variable			// Record the current allocation.
	/// freed up a register for a larger live-range.			VirtRegToPhysReg[ItVirtReg] = VRM->getPhys(ItVirtReg);
	/// Consider the following example:			// unset the related struct.
	/// BB1:			Matrix->unassign(**It);
	/// a =			}
	/// b =
	/// BB2:			// Do as if VirtReg was assigned to PhysReg so that the underlying
	/// ...			// recoloring has the right information about the interferes and
	/// = b			// available colors.
	/// = a			Matrix->assign(VirtReg, PhysReg);
	/// Let us assume b gets split:
	/// BB1:			// Save the current recoloring state.
	/// a =			// If we cannot recolor all the interferences, we will have to start again
	/// b =			// at this point for the next physical register.
	/// BB2:			SmallVirtRegSet SaveFixedRegisters(FixedRegisters);
	/// c = b			if (tryRecoloringCandidates(RecoloringQueue, CurrentNewVRegs,
	/// ...			FixedRegisters, Depth)) {
	/// d = c			// Push the queued vregs into the main queue.
	/// = d			for (unsigned NewVReg : CurrentNewVRegs)
	/// = a			NewVRegs.push_back(NewVReg);
	/// Because of how the allocation work, b, c, and d may be assigned different			// Do not mess up with the global assignment process.
	/// colors. Now, if a gets evicted later:			// I.e., VirtReg must be unassigned.
	/// BB1:			Matrix->unassign(VirtReg);
	/// a =			return PhysReg;
	/// st a, SpillSlot			}
	/// b =
	/// BB2:			DEBUG(dbgs() << "Fail to assign: " << VirtReg << " to "
	/// c = b			<< PrintReg(PhysReg, TRI) << '\n');
	/// ...
	/// d = c			// The recoloring attempt failed, undo the changes.
	/// = d			FixedRegisters = SaveFixedRegisters;
	/// e = ld SpillSlot			Matrix->unassign(VirtReg);
	/// = e
	/// This is likely that we can assign the same register for b, c, and d,			// For a newly created vreg which is also in RecoloringCandidates,
	/// getting rid of 2 copies.			// don't add it to NewVRegs because its physical register will be restored
	void RAGreedy::tryHintsRecoloring() {			// below. Other vregs in CurrentNewVRegs are created by calling
	for (LiveInterval *LI : SetOfBrokenHints) {			// selectOrSplit and should be added into NewVRegs.
	assert(TargetRegisterInfo::isVirtualRegister(LI->reg) &&			for (SmallVectorImpl<unsigned>::iterator Next = CurrentNewVRegs.begin(),
	"Recoloring is possible only for virtual registers");			End = CurrentNewVRegs.end();
	// Some dead defs may be around (e.g., because of debug uses).			Next != End; ++Next) {
	// Ignore those.			if (RecoloringCandidates.count(&LIS->getInterval(*Next)))
	if (!VRM->hasPhys(LI->reg))			continue;
	continue;			NewVRegs.push_back(*Next);
	tryHintRecoloring(*LI);			}
	}
	}			for (SmallLISet::iterator It = RecoloringCandidates.begin(),
				EndIt = RecoloringCandidates.end();
	unsigned RAGreedy::selectOrSplitImpl(LiveInterval &VirtReg,			It != EndIt; ++It) {
	SmallVectorImpl<unsigned> &NewVRegs,			unsigned ItVirtReg = (*It)->reg;
	SmallVirtRegSet &FixedRegisters,			if (VRM->hasPhys(ItVirtReg))
	unsigned Depth) {			Matrix->unassign(**It);
	unsigned CostPerUseLimit = ~0u;			unsigned ItPhysReg = VirtRegToPhysReg[ItVirtReg];
	// First try assigning a free register.			Matrix->assign(**It, ItPhysReg);
	AllocationOrder Order(VirtReg.reg, *VRM, RegClassInfo, Matrix);			}
	if (unsigned PhysReg = tryAssign(VirtReg, Order, NewVRegs)) {			}
	// When NewVRegs is not empty, we may have made decisions such as evicting
	// a virtual register, go with the earlier decisions and use the physical			// Last chance recoloring did not worked either, give up.
	// register.			return ~0u;
	if (CSRCost.getFrequency() && isUnusedCalleeSavedReg(PhysReg) &&			}
	NewVRegs.empty()) {
	unsigned CSRReg = tryAssignCSRFirstTime(VirtReg, Order, PhysReg,			/// tryRecoloringCandidates - Try to assign a new color to every register
	CostPerUseLimit, NewVRegs);			/// in \RecoloringQueue.
	if (CSRReg \|\| !NewVRegs.empty())			/// \p NewRegs will contain any new virtual register created during the
	// Return now if we decide to use a CSR or create new vregs due to			/// recoloring process.
	// pre-splitting.			/// \p FixedRegisters[in/out] contains all the registers that have been
	return CSRReg;			/// recolored.
	} else			/// \return true if all virtual registers in RecoloringQueue were successfully
	return PhysReg;			/// recolored, false otherwise.
	}			bool RAGreedy::tryRecoloringCandidates(PQueue &RecoloringQueue,
				SmallVectorImpl<unsigned> &NewVRegs,
	LiveRangeStage Stage = getStage(VirtReg);			SmallVirtRegSet &FixedRegisters,
	DEBUG(dbgs() << StageName[Stage]			unsigned Depth) {
	<< " Cascade " << ExtraRegInfo[VirtReg.reg].Cascade << '\n');			while (!RecoloringQueue.empty()) {
				LiveInterval *LI = dequeue(RecoloringQueue);
	// Try to evict a less worthy live range, but only for ranges from the primary			DEBUG(dbgs() << "Try to recolor: " << *LI << '\n');
	// queue. The RS_Split ranges already failed to do this, and they should not			unsigned PhysReg;
	// get a second chance until they have been split.			PhysReg = selectOrSplitImpl(*LI, NewVRegs, FixedRegisters, Depth + 1);
	if (Stage != RS_Split)			// When splitting happens, the live-range may actually be empty.
	if (unsigned PhysReg =			// In that case, this is okay to continue the recoloring even
	tryEvict(VirtReg, Order, NewVRegs, CostPerUseLimit)) {			// if we did not find an alternative color for it. Indeed,
	unsigned Hint = MRI->getSimpleHint(VirtReg.reg);			// there will not be anything to color for LI in the end.
	// If VirtReg has a hint and that hint is broken record this			if (PhysReg == ~0u \|\| (!PhysReg && !LI->empty()))
	// virtual register as a recoloring candidate for broken hint.			return false;
	// Indeed, since we evicted a variable in its neighborhood it is
	// likely we can at least partially recolor some of the			if (!PhysReg) {
	// copy-related live-ranges.			assert(LI->empty() && "Only empty live-range do not require a register");
	if (Hint && Hint != PhysReg)			DEBUG(dbgs() << "Recoloring of " << *LI << " succeeded. Empty LI.\n");
	SetOfBrokenHints.insert(&VirtReg);			continue;
	return PhysReg;			}
	}			DEBUG(dbgs() << "Recoloring of " << *LI
				<< " succeeded with: " << PrintReg(PhysReg, TRI) << '\n');
	assert((NewVRegs.empty() \|\| Depth) && "Cannot append to existing NewVRegs");
				Matrix->assign(*LI, PhysReg);
	// The first time we see a live range, don't try to split or spill.			FixedRegisters.insert(LI->reg);
	// Wait until the second time, when all smaller ranges have been allocated.			}
	// This gives a better picture of the interference to split around.			return true;
	if (Stage < RS_Split) {			}
	setStage(VirtReg, RS_Split);
	DEBUG(dbgs() << "wait for second round\n");			//===----------------------------------------------------------------------===//
	NewVRegs.push_back(VirtReg.reg);			// Main Entry Point
	return 0;			//===----------------------------------------------------------------------===//
	}
				unsigned RAGreedy::selectOrSplit(LiveInterval &VirtReg,
	if (Stage < RS_Spill) {			SmallVectorImpl<unsigned> &NewVRegs) {
	// Try splitting VirtReg or interferences.			CutOffInfo = CO_None;
	unsigned NewVRegSizeBefore = NewVRegs.size();			LLVMContext &Ctx = MF->getFunction()->getContext();
	unsigned PhysReg = trySplit(VirtReg, Order, NewVRegs);			SmallVirtRegSet FixedRegisters;
	if (PhysReg \|\| (NewVRegs.size() - NewVRegSizeBefore))			unsigned Reg = selectOrSplitImpl(VirtReg, NewVRegs, FixedRegisters);
	return PhysReg;			if (Reg == ~0U && (CutOffInfo != CO_None)) {
	}			uint8_t CutOffEncountered = CutOffInfo & (CO_Depth \| CO_Interf);
				if (CutOffEncountered == CO_Depth)
	// If we couldn't allocate a register from spilling, there is probably some			Ctx.emitError("register allocation failed: maximum depth for recoloring "
	// invalid inline assembly. The base class wil report it.			"reached. Use -fexhaustive-register-search to skip "
	if (Stage >= RS_Done \|\| !VirtReg.isSpillable())			"cutoffs");
	return tryLastChanceRecoloring(VirtReg, Order, NewVRegs, FixedRegisters,			else if (CutOffEncountered == CO_Interf)
	Depth);			Ctx.emitError("register allocation failed: maximum interference for "
				"recoloring reached. Use -fexhaustive-register-search "
	// Finally spill VirtReg itself.			"to skip cutoffs");
	if (EnableDeferredSpilling && getStage(VirtReg) < RS_Memory) {			else if (CutOffEncountered == (CO_Depth \| CO_Interf))
	// TODO: This is experimental and in particular, we do not model			Ctx.emitError("register allocation failed: maximum interference and "
	// the live range splitting done by spilling correctly.			"depth for recoloring reached. Use "
	// We would need a deep integration with the spiller to do the			"-fexhaustive-register-search to skip cutoffs");
	// right thing here. Anyway, that is still good for early testing.			}
	setStage(VirtReg, RS_Memory);			return Reg;
	DEBUG(dbgs() << "Do as if this register is in memory\n");			}
	NewVRegs.push_back(VirtReg.reg);
	} else {			/// Using a CSR for the first time has a cost because it causes push\|pop
	NamedRegionTimer T("spill", "Spiller", TimerGroupName,			/// to be added to prologue\|epilogue. Splitting a cold section of the live
	TimerGroupDescription, TimePassesIsEnabled);			/// range can have lower cost than using the CSR for the first time;
	LiveRangeEdit LRE(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);			/// Spilling a live range in the cold path can have lower cost than using
	spiller().spill(LRE);			/// the CSR for the first time. Returns the physical register if we decide
	setStage(NewVRegs.begin(), NewVRegs.end(), RS_Done);			/// to use the CSR; otherwise return 0.
				unsigned RAGreedy::tryAssignCSRFirstTime(LiveInterval &VirtReg,
	if (VerifyEnabled)			AllocationOrder &Order,
	MF->verify(this, "After spilling");			unsigned PhysReg,
	}			unsigned &CostPerUseLimit,
				SmallVectorImpl<unsigned> &NewVRegs) {
	// The live virtual register requesting allocation was spilled, so tell			if (getStage(VirtReg) == RS_Spill && VirtReg.isSpillable()) {
	// the caller not to allocate anything during this round.			// We choose spill over using the CSR for the first time if the spill cost
	return 0;			// is lower than CSRCost.
	}			SA->analyze(&VirtReg);
				if (calcSpillCost() >= CSRCost)
	void RAGreedy::reportNumberOfSplillsReloads(MachineLoop *L, unsigned &Reloads,			return PhysReg;
	unsigned &FoldedReloads,
	unsigned &Spills,			// We are going to spill, set CostPerUseLimit to 1 to make sure that
	unsigned &FoldedSpills) {			// we will not use a callee-saved register in tryEvict.
	Reloads = 0;			CostPerUseLimit = 1;
	FoldedReloads = 0;			return 0;
	Spills = 0;			}
	FoldedSpills = 0;			if (getStage(VirtReg) < RS_Split) {
				// We choose pre-splitting over using the CSR for the first time if
	// Sum up the spill and reloads in subloops.			// the cost of splitting is lower than CSRCost.
	for (MachineLoop SubLoop : L) {			SA->analyze(&VirtReg);
	unsigned SubReloads;			unsigned NumCands = 0;
	unsigned SubFoldedReloads;			BlockFrequency BestCost = CSRCost; // Don't modify CSRCost.
	unsigned SubSpills;			unsigned BestCand = calculateRegionSplitCost(VirtReg, Order, BestCost,
	unsigned SubFoldedSpills;			NumCands, true /IgnoreCSR/);
				if (BestCand == NoCand)
	reportNumberOfSplillsReloads(SubLoop, SubReloads, SubFoldedReloads,			// Use the CSR if we can't find a region split below CSRCost.
	SubSpills, SubFoldedSpills);			return PhysReg;
	Reloads += SubReloads;
	FoldedReloads += SubFoldedReloads;			// Perform the actual pre-splitting.
	Spills += SubSpills;			doRegionSplit(VirtReg, BestCand, false/HasCompact/, NewVRegs);
	FoldedSpills += SubFoldedSpills;			return 0;
	}			}
				return PhysReg;
	const MachineFrameInfo &MFI = MF->getFrameInfo();			}
	const TargetInstrInfo *TII = MF->getSubtarget().getInstrInfo();
	int FI;			void RAGreedy::aboutToRemoveInterval(LiveInterval &LI) {
				// Do not keep invalid information around.
	for (MachineBasicBlock *MBB : L->getBlocks())			SetOfBrokenHints.remove(&LI);
	// Handle blocks that were not included in subloops.			}
	if (Loops->getLoopFor(MBB) == L)
	for (MachineInstr &MI : *MBB) {			void RAGreedy::initializeCSRCost() {
	const MachineMemOperand *MMO;			// We use the larger one out of the command-line option and the value report
				// by TRI.
	if (TII->isLoadFromStackSlot(MI, FI) && MFI.isSpillSlotObjectIndex(FI))			CSRCost = BlockFrequency(
	++Reloads;			std::max((unsigned)CSRFirstTimeCost, TRI->getCSRFirstUseCost()));
	else if (TII->hasLoadFromStackSlot(MI, MMO, FI) &&			if (!CSRCost.getFrequency())
	MFI.isSpillSlotObjectIndex(FI))			return;
	++FoldedReloads;
	else if (TII->isStoreToStackSlot(MI, FI) &&			// Raw cost is relative to Entry == 2^14; scale it appropriately.
	MFI.isSpillSlotObjectIndex(FI))			uint64_t ActualEntry = MBFI->getEntryFreq();
	++Spills;			if (!ActualEntry) {
	else if (TII->hasStoreToStackSlot(MI, MMO, FI) &&			CSRCost = 0;
	MFI.isSpillSlotObjectIndex(FI))			return;
	++FoldedSpills;			}
	}			uint64_t FixedEntry = 1 << 14;
				if (ActualEntry < FixedEntry)
	if (Reloads \|\| FoldedReloads \|\| Spills \|\| FoldedSpills) {			CSRCost *= BranchProbability(ActualEntry, FixedEntry);
	using namespace ore;			else if (ActualEntry <= UINT32_MAX)
				// Invert the fraction and divide.
	MachineOptimizationRemarkMissed R(DEBUG_TYPE, "LoopSpillReload",			CSRCost /= BranchProbability(FixedEntry, ActualEntry);
	L->getStartLoc(), L->getHeader());			else
	if (Spills)			// Can't use BranchProbability in general, since it takes 32-bit numbers.
	R << NV("NumSpills", Spills) << " spills ";			CSRCost = CSRCost.getFrequency() * (ActualEntry / FixedEntry);
	if (FoldedSpills)			}
	R << NV("NumFoldedSpills", FoldedSpills) << " folded spills ";
	if (Reloads)			/// \brief Collect the hint info for \p Reg.
	R << NV("NumReloads", Reloads) << " reloads ";			/// The results are stored into \p Out.
	if (FoldedReloads)			/// \p Out is not cleared before being populated.
	R << NV("NumFoldedReloads", FoldedReloads) << " folded reloads ";			void RAGreedy::collectHintInfo(unsigned Reg, HintsInfo &Out) {
	ORE->emit(R << "generated in loop");			for (const MachineInstr &Instr : MRI->reg_nodbg_instructions(Reg)) {
	}			if (!Instr.isFullCopy())
	}			continue;
				// Look for the other end of the copy.
	bool RAGreedy::runOnMachineFunction(MachineFunction &mf) {			unsigned OtherReg = Instr.getOperand(0).getReg();
	DEBUG(dbgs() << "******** GREEDY REGISTER ALLOCATION ********\n"			if (OtherReg == Reg) {
	<< "********** Function: " << mf.getName() << '\n');			OtherReg = Instr.getOperand(1).getReg();
				if (OtherReg == Reg)
	MF = &mf;			continue;
	TRI = MF->getSubtarget().getRegisterInfo();			}
	TII = MF->getSubtarget().getInstrInfo();			// Get the current assignment.
	RCI.runOnMachineFunction(mf);			unsigned OtherPhysReg = TargetRegisterInfo::isPhysicalRegister(OtherReg)
				? OtherReg
	EnableLocalReassign = EnableLocalReassignment \|\|			: VRM->getPhys(OtherReg);
	MF->getSubtarget().enableRALocalReassignment(			// Push the collected information.
	MF->getTarget().getOptLevel());			Out.push_back(HintInfo(MBFI->getBlockFreq(Instr.getParent()), OtherReg,
				OtherPhysReg));
	if (VerifyEnabled)			}
	MF->verify(this, "Before greedy register allocator");			}

	RegAllocBase::init(getAnalysis<VirtRegMap>(),			/// \brief Using the given \p List, compute the cost of the broken hints if
	getAnalysis<LiveIntervals>(),			/// \p PhysReg was used.
	getAnalysis<LiveRegMatrix>());			/// \return The cost of \p List for \p PhysReg.
	Indexes = &getAnalysis<SlotIndexes>();			BlockFrequency RAGreedy::getBrokenHintFreq(const HintsInfo &List,
	MBFI = &getAnalysis<MachineBlockFrequencyInfo>();			unsigned PhysReg) {
	DomTree = &getAnalysis<MachineDominatorTree>();			BlockFrequency Cost = 0;
	ORE = &getAnalysis<MachineOptimizationRemarkEmitterPass>().getORE();			for (const HintInfo &Info : List) {
	SpillerInstance.reset(createInlineSpiller(this, MF, *VRM));			if (Info.PhysReg != PhysReg)
	Loops = &getAnalysis<MachineLoopInfo>();			Cost += Info.Freq;
	Bundles = &getAnalysis<EdgeBundles>();			}
	SpillPlacer = &getAnalysis<SpillPlacement>();			return Cost;
	DebugVars = &getAnalysis<LiveDebugVariables>();			}
	AA = &getAnalysis<AAResultsWrapperPass>().getAAResults();
				/// \brief Using the register assigned to \p VirtReg, try to recolor
	initializeCSRCost();			/// all the live ranges that are copy-related with \p VirtReg.
				/// The recoloring is then propagated to all the live-ranges that have
	calculateSpillWeightsAndHints(LIS, mf, VRM, Loops, *MBFI);			/// been recolored and so on, until no more copies can be coalesced or
				/// it is not profitable.
	DEBUG(LIS->dump());			/// For a given live range, profitability is determined by the sum of the
				/// frequencies of the non-identity copies it would introduce with the old
	SA.reset(new SplitAnalysis(VRM, LIS, *Loops));			/// and new register.
	SE.reset(new SplitEditor(SA, AA, LIS, VRM, DomTree, MBFI));			void RAGreedy::tryHintRecoloring(LiveInterval &VirtReg) {
	ExtraRegInfo.clear();			// We have a broken hint, check if it is possible to fix it by
	ExtraRegInfo.resize(MRI->getNumVirtRegs());			// reusing PhysReg for the copy-related live-ranges. Indeed, we evicted
	NextCascade = 1;			// some register and PhysReg may be available for the other live-ranges.
	IntfCache.init(MF, Matrix->getLiveUnions(), Indexes, LIS, TRI);			SmallSet<unsigned, 4> Visited;
	GlobalCand.resize(32); // This will grow as needed.			SmallVector<unsigned, 2> RecoloringCandidates;
	SetOfBrokenHints.clear();			HintsInfo Info;
				unsigned Reg = VirtReg.reg;
	allocatePhysRegs();			unsigned PhysReg = VRM->getPhys(Reg);
	tryHintsRecoloring();			// Start the recoloring algorithm from the input live-interval, then
	postOptimization();			// it will propagate to the ones that are copy-related with it.
	reportNumberOfSplillsReloads();			Visited.insert(Reg);
				RecoloringCandidates.push_back(Reg);
	releaseMemory();
	return true;			DEBUG(dbgs() << "Trying to reconcile hints for: " << PrintReg(Reg, TRI) << '('
	}			<< PrintReg(PhysReg, TRI) << ")\n");

				do {
				Reg = RecoloringCandidates.pop_back_val();

				// We cannot recolor physcal register.
				if (TargetRegisterInfo::isPhysicalRegister(Reg))
				continue;

				assert(VRM->hasPhys(Reg) && "We have unallocated variable!!");

				// Get the live interval mapped with this virtual register to be able
				// to check for the interference with the new color.
				LiveInterval &LI = LIS->getInterval(Reg);
				unsigned CurrPhys = VRM->getPhys(Reg);
				// Check that the new color matches the register class constraints and
				// that it is free for this live range.
				if (CurrPhys != PhysReg && (!MRI->getRegClass(Reg)->contains(PhysReg) \|\|
				Matrix->checkInterference(LI, PhysReg)))
				continue;

				DEBUG(dbgs() << PrintReg(Reg, TRI) << '(' << PrintReg(CurrPhys, TRI)
				<< ") is recolorable.\n");

				// Gather the hint info.
				Info.clear();
				collectHintInfo(Reg, Info);
				// Check if recoloring the live-range will increase the cost of the
				// non-identity copies.
				if (CurrPhys != PhysReg) {
				DEBUG(dbgs() << "Checking profitability:\n");
				BlockFrequency OldCopiesCost = getBrokenHintFreq(Info, CurrPhys);
				BlockFrequency NewCopiesCost = getBrokenHintFreq(Info, PhysReg);
				DEBUG(dbgs() << "Old Cost: " << OldCopiesCost.getFrequency()
				<< "\nNew Cost: " << NewCopiesCost.getFrequency() << '\n');
				if (OldCopiesCost < NewCopiesCost) {
				DEBUG(dbgs() << "=> Not profitable.\n");
				continue;
				}
				// At this point, the cost is either cheaper or equal. If it is
				// equal, we consider this is profitable because it may expose
				// more recoloring opportunities.
				DEBUG(dbgs() << "=> Profitable.\n");
				// Recolor the live-range.
				Matrix->unassign(LI);
				Matrix->assign(LI, PhysReg);
				}
				// Push all copy-related live-ranges to keep reconciling the broken
				// hints.
				for (const HintInfo &HI : Info) {
				if (Visited.insert(HI.Reg).second)
				RecoloringCandidates.push_back(HI.Reg);
				}
				} while (!RecoloringCandidates.empty());
				}

				/// \brief Try to recolor broken hints.
				/// Broken hints may be repaired by recoloring when an evicted variable
				/// freed up a register for a larger live-range.
				/// Consider the following example:
				/// BB1:
				/// a =
				/// b =
				/// BB2:
				/// ...
				/// = b
				/// = a
				/// Let us assume b gets split:
				/// BB1:
				/// a =
				/// b =
				/// BB2:
				/// c = b
				/// ...
				/// d = c
				/// = d
				/// = a
				/// Because of how the allocation work, b, c, and d may be assigned different
				/// colors. Now, if a gets evicted later:
				/// BB1:
				/// a =
				/// st a, SpillSlot
				/// b =
				/// BB2:
				/// c = b
				/// ...
				/// d = c
				/// = d
				/// e = ld SpillSlot
				/// = e
				/// This is likely that we can assign the same register for b, c, and d,
				/// getting rid of 2 copies.
				void RAGreedy::tryHintsRecoloring() {
				for (LiveInterval *LI : SetOfBrokenHints) {
				assert(TargetRegisterInfo::isVirtualRegister(LI->reg) &&
				"Recoloring is possible only for virtual registers");
				// Some dead defs may be around (e.g., because of debug uses).
				// Ignore those.
				if (!VRM->hasPhys(LI->reg))
				continue;
				tryHintRecoloring(*LI);
				}
				}

				unsigned RAGreedy::selectOrSplitImpl(LiveInterval &VirtReg,
				SmallVectorImpl<unsigned> &NewVRegs,
				SmallVirtRegSet &FixedRegisters,
				unsigned Depth) {
				unsigned CostPerUseLimit = ~0u;
				// First try assigning a free register.
				AllocationOrder Order(VirtReg.reg, *VRM, RegClassInfo, Matrix);
				if (unsigned PhysReg = tryAssign(VirtReg, Order, NewVRegs)) {
				// If VirtReg got an assignment, the eviction info is no longre relevant
				LastEvicted.ClearEvicteeInfo(VirtReg.reg);
				// When NewVRegs is not empty, we may have made decisions such as evicting
				// a virtual register, go with the earlier decisions and use the physical
				// register.
				if (CSRCost.getFrequency() && isUnusedCalleeSavedReg(PhysReg) &&
				NewVRegs.empty()) {
				unsigned CSRReg = tryAssignCSRFirstTime(VirtReg, Order, PhysReg,
				CostPerUseLimit, NewVRegs);
				if (CSRReg \|\| !NewVRegs.empty())
				// Return now if we decide to use a CSR or create new vregs due to
				// pre-splitting.
				return CSRReg;
				} else
				return PhysReg;
				}

				LiveRangeStage Stage = getStage(VirtReg);
				DEBUG(dbgs() << StageName[Stage]
				<< " Cascade " << ExtraRegInfo[VirtReg.reg].Cascade << '\n');

				// Try to evict a less worthy live range, but only for ranges from the primary
				// queue. The RS_Split ranges already failed to do this, and they should not
				// get a second chance until they have been split.
				if (Stage != RS_Split)
				if (unsigned PhysReg =
				tryEvict(VirtReg, Order, NewVRegs, CostPerUseLimit)) {
				unsigned Hint = MRI->getSimpleHint(VirtReg.reg);
				// If VirtReg has a hint and that hint is broken record this
				// virtual register as a recoloring candidate for broken hint.
				// Indeed, since we evicted a variable in its neighborhood it is
				// likely we can at least partially recolor some of the
				// copy-related live-ranges.
				if (Hint && Hint != PhysReg)
				SetOfBrokenHints.insert(&VirtReg);
				// If VirtReg eviction someone, the eviction info for it as an evictee is
				// no longre relevant
				LastEvicted.ClearEvicteeInfo(VirtReg.reg);
				return PhysReg;
				}

				assert((NewVRegs.empty() \|\| Depth) && "Cannot append to existing NewVRegs");

				// The first time we see a live range, don't try to split or spill.
				// Wait until the second time, when all smaller ranges have been allocated.
				// This gives a better picture of the interference to split around.
				if (Stage < RS_Split) {
				setStage(VirtReg, RS_Split);
				DEBUG(dbgs() << "wait for second round\n");
				NewVRegs.push_back(VirtReg.reg);
				return 0;
				}

				if (Stage < RS_Spill) {
				// Try splitting VirtReg or interferences.
				unsigned NewVRegSizeBefore = NewVRegs.size();
				unsigned PhysReg = trySplit(VirtReg, Order, NewVRegs);
				if (PhysReg \|\| (NewVRegs.size() - NewVRegSizeBefore)) {
				// If VirtReg got split, the eviction info is no longre relevant
				LastEvicted.ClearEvicteeInfo(VirtReg.reg);
				return PhysReg;
				}
				}

				// If we couldn't allocate a register from spilling, there is probably some
				// invalid inline assembly. The base class wil report it.
				if (Stage >= RS_Done \|\| !VirtReg.isSpillable())
				return tryLastChanceRecoloring(VirtReg, Order, NewVRegs, FixedRegisters,
				Depth);

				// Finally spill VirtReg itself.
				if (EnableDeferredSpilling && getStage(VirtReg) < RS_Memory) {
				// TODO: This is experimental and in particular, we do not model
				// the live range splitting done by spilling correctly.
				// We would need a deep integration with the spiller to do the
				// right thing here. Anyway, that is still good for early testing.
				setStage(VirtReg, RS_Memory);
				DEBUG(dbgs() << "Do as if this register is in memory\n");
				NewVRegs.push_back(VirtReg.reg);
				} else {
				NamedRegionTimer T("spill", "Spiller", TimerGroupName,
				TimerGroupDescription, TimePassesIsEnabled);
				LiveRangeEdit LRE(&VirtReg, NewVRegs, MF, LIS, VRM, this, &DeadRemats);
				spiller().spill(LRE);
				setStage(NewVRegs.begin(), NewVRegs.end(), RS_Done);

				if (VerifyEnabled)
				MF->verify(this, "After spilling");
				}

				// The live virtual register requesting allocation was spilled, so tell
				// the caller not to allocate anything during this round.
				return 0;
				}

				void RAGreedy::reportNumberOfSplillsReloads(MachineLoop *L, unsigned &Reloads,
				unsigned &FoldedReloads,
				unsigned &Spills,
				unsigned &FoldedSpills) {
				Reloads = 0;
				FoldedReloads = 0;
				Spills = 0;
				FoldedSpills = 0;

				// Sum up the spill and reloads in subloops.
				for (MachineLoop SubLoop : L) {
				unsigned SubReloads;
				unsigned SubFoldedReloads;
				unsigned SubSpills;
				unsigned SubFoldedSpills;

				reportNumberOfSplillsReloads(SubLoop, SubReloads, SubFoldedReloads,
				SubSpills, SubFoldedSpills);
				Reloads += SubReloads;
				FoldedReloads += SubFoldedReloads;
				Spills += SubSpills;
				FoldedSpills += SubFoldedSpills;
				}

				const MachineFrameInfo &MFI = MF->getFrameInfo();
				const TargetInstrInfo *TII = MF->getSubtarget().getInstrInfo();
				int FI;

				for (MachineBasicBlock *MBB : L->getBlocks())
				// Handle blocks that were not included in subloops.
				if (Loops->getLoopFor(MBB) == L)
				for (MachineInstr &MI : *MBB) {
				const MachineMemOperand *MMO;

				if (TII->isLoadFromStackSlot(MI, FI) && MFI.isSpillSlotObjectIndex(FI))
				++Reloads;
				else if (TII->hasLoadFromStackSlot(MI, MMO, FI) &&
				MFI.isSpillSlotObjectIndex(FI))
				++FoldedReloads;
				else if (TII->isStoreToStackSlot(MI, FI) &&
				MFI.isSpillSlotObjectIndex(FI))
				++Spills;
				else if (TII->hasStoreToStackSlot(MI, MMO, FI) &&
				MFI.isSpillSlotObjectIndex(FI))
				++FoldedSpills;
				}

				if (Reloads \|\| FoldedReloads \|\| Spills \|\| FoldedSpills) {
				using namespace ore;

				MachineOptimizationRemarkMissed R(DEBUG_TYPE, "LoopSpillReload",
				L->getStartLoc(), L->getHeader());
				if (Spills)
				R << NV("NumSpills", Spills) << " spills ";
				if (FoldedSpills)
				R << NV("NumFoldedSpills", FoldedSpills) << " folded spills ";
				if (Reloads)
				R << NV("NumReloads", Reloads) << " reloads ";
				if (FoldedReloads)
				R << NV("NumFoldedReloads", FoldedReloads) << " folded reloads ";
				ORE->emit(R << "generated in loop");
				}
				}

				bool RAGreedy::runOnMachineFunction(MachineFunction &mf) {
				DEBUG(dbgs() << "******** GREEDY REGISTER ALLOCATION ********\n"
				<< "********** Function: " << mf.getName() << '\n');

				MF = &mf;
				TRI = MF->getSubtarget().getRegisterInfo();
				TII = MF->getSubtarget().getInstrInfo();
				RCI.runOnMachineFunction(mf);

				EnableLocalReassign = EnableLocalReassignment \|\|
				MF->getSubtarget().enableRALocalReassignment(
				MF->getTarget().getOptLevel());

				if (VerifyEnabled)
				MF->verify(this, "Before greedy register allocator");

				RegAllocBase::init(getAnalysis<VirtRegMap>(),
				getAnalysis<LiveIntervals>(),
				getAnalysis<LiveRegMatrix>());
				Indexes = &getAnalysis<SlotIndexes>();
				MBFI = &getAnalysis<MachineBlockFrequencyInfo>();
				DomTree = &getAnalysis<MachineDominatorTree>();
				ORE = &getAnalysis<MachineOptimizationRemarkEmitterPass>().getORE();
				SpillerInstance.reset(createInlineSpiller(this, MF, *VRM));
				Loops = &getAnalysis<MachineLoopInfo>();
				Bundles = &getAnalysis<EdgeBundles>();
				SpillPlacer = &getAnalysis<SpillPlacement>();
				DebugVars = &getAnalysis<LiveDebugVariables>();
				AA = &getAnalysis<AAResultsWrapperPass>().getAAResults();

				initializeCSRCost();

				calculateSpillWeightsAndHints(LIS, mf, VRM, Loops, *MBFI);

				DEBUG(LIS->dump());

				SA.reset(new SplitAnalysis(VRM, LIS, *Loops));
				SE.reset(new SplitEditor(SA, AA, LIS, VRM, DomTree, MBFI));
				ExtraRegInfo.clear();
				ExtraRegInfo.resize(MRI->getNumVirtRegs());
				NextCascade = 1;
				IntfCache.init(MF, Matrix->getLiveUnions(), Indexes, LIS, TRI);
				GlobalCand.resize(32); // This will grow as needed.
				SetOfBrokenHints.clear();
				LastEvicted.Clear();

				allocatePhysRegs();
				tryHintsRecoloring();
				postOptimization();
				reportNumberOfSplillsReloads();

				releaseMemory();
				return true;
				}

test/CodeGen/X86/bug26810.ll

				; RUN: llc < %s -march=x86 -regalloc=greedy \| FileCheck %s
				qcolombetUnsubmitted Not Done Reply Inline Actions Could you use a .mir test to make the test more robust? qcolombet: Could you use a .mir test to make the test more robust?
				myatsinaAuthorUnsubmitted Not Done Reply Inline Actions Will do. myatsina: Will do.
				; Make sure bad eviction sequence doesnt occur
				; XFAIL: *
				qcolombetUnsubmitted Not Done Reply Inline Actions That sounds wrong for a new test. Testing should be positive as much as possible IMO. qcolombet: That sounds wrong for a new test. Testing should be positive as much as possible IMO.
				myatsinaAuthorUnsubmitted Not Done Reply Inline Actions I wasn't very satisfied with this check as well. I'll make it into a positive test indeed. myatsina: I wasn't very satisfied with this check as well. I'll make it into a positive test indeed.

				; Fix for bugzilla 26810.
				; This test is meant to make sure bad eviction sequence like the one described
				; below does not occur
				;
				; movapd %xmm7, 160(%esp) # 16-byte Spill
				; movapd %xmm5, %xmm7
				; movapd %xmm4, %xmm5
				; movapd %xmm3, %xmm4
				; movapd %xmm2, %xmm3
				; some_inst
				; movapd %xmm3, %xmm2
				; movapd %xmm4, %xmm3
				; movapd %xmm5, %xmm4
				; movapd %xmm7, %xmm5
				; movapd 160(%esp), %xmm7 # 16-byte Reload

				; CHECK: movapd %[[REGB:.]], %[[REGA:.]]
				; CHECK: movapd %[[REGC:.*]], %[[REGB]]
				; CHECK: movapd %[[REGD:.*]], %[[REGC]]

				; CHECK: movapd %[[REGC]], %[[REGD]]
				; CHECK: movapd %[[REGB]], %[[REGC]]
				; CHECK: movapd %[[REGA]], %[[REGB]]

				; ModuleID = 'D:\iusers\myatsina\workspaces\llorg_win\builds\llorgefi2win_debug\llvm\bin\bug26810.cc'
				source_filename = "D:\5Ciusers\5Cmyatsina\5Cworkspaces\5Cllorg_win\5Cbuilds\5Cllorgefi2win_debug\5Cllvm\5Cbin\5Cbug26810.cc"
				target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"
				target triple = "i386-pc-linux-gnu"

				%struct._iobuf = type { i8* }

				$"\01??_C@_01NOFIACDB@w?$AA@" = comdat any

				$"\01??_C@_09LAIDGMDM@?1dev?1null?$AA@" = comdat any

				@"\01?v@@3PAU__m128d@@A" = global [8 x <2 x double>] zeroinitializer, align 16
				@"\01?m1@@3PAU__m128d@@A" = local_unnamed_addr global [76800000 x <2 x double>] zeroinitializer, align 16
				@"\01?m2@@3PAU__m128d@@A" = local_unnamed_addr global [8 x <2 x double>] zeroinitializer, align 16
				@"\01??_C@_01NOFIACDB@w?$AA@" = linkonce_odr unnamed_addr constant [2 x i8] c"w\00", comdat, align 1
				@"\01??_C@_09LAIDGMDM@?1dev?1null?$AA@" = linkonce_odr unnamed_addr constant [10 x i8] c"/dev/null\00", comdat, align 1

				; Function Attrs: norecurse
				define i32 @main() local_unnamed_addr #0 {
				entry:
				tail call void @"\01?init@@YAXXZ"()
				%0 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 0), align 16, !tbaa !8
				%1 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 1), align 16, !tbaa !8
				%2 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 2), align 16, !tbaa !8
				%3 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 3), align 16, !tbaa !8
				%4 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 4), align 16, !tbaa !8
				%5 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 5), align 16, !tbaa !8
				%6 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 6), align 16, !tbaa !8
				%7 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 7), align 16, !tbaa !8
				%.promoted.i = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 0), align 16, !tbaa !8
				%.promoted51.i = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 1), align 16, !tbaa !8
				%.promoted53.i = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 2), align 16, !tbaa !8
				%.promoted55.i = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 3), align 16, !tbaa !8
				%.promoted57.i = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 4), align 16, !tbaa !8
				%.promoted59.i = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 5), align 16, !tbaa !8
				%.promoted61.i = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 6), align 16, !tbaa !8
				%.promoted63.i = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 7), align 16, !tbaa !8
				br label %for.body.i

				for.body.i: ; preds = %for.body.i, %entry
				%add.i64.i = phi <2 x double> [ %.promoted63.i, %entry ], [ %add.i.i, %for.body.i ]
				%add.i3662.i = phi <2 x double> [ %.promoted61.i, %entry ], [ %add.i36.i, %for.body.i ]
				%add.i3860.i = phi <2 x double> [ %.promoted59.i, %entry ], [ %add.i38.i, %for.body.i ]
				%add.i4058.i = phi <2 x double> [ %.promoted57.i, %entry ], [ %add.i40.i, %for.body.i ]
				%add.i4256.i = phi <2 x double> [ %.promoted55.i, %entry ], [ %add.i42.i, %for.body.i ]
				%add.i4454.i = phi <2 x double> [ %.promoted53.i, %entry ], [ %add.i44.i, %for.body.i ]
				%add.i4652.i = phi <2 x double> [ %.promoted51.i, %entry ], [ %add.i46.i, %for.body.i ]
				%add.i4850.i = phi <2 x double> [ %.promoted.i, %entry ], [ %add.i48.i, %for.body.i ]
				%i.049.i = phi i32 [ 0, %entry ], [ %inc.i, %for.body.i ]
				%arrayidx.i = getelementptr inbounds [76800000 x <2 x double>], [76800000 x <2 x double>]* @"\01?m1@@3PAU__m128d@@A", i32 0, i32 %i.049.i
				%8 = load <2 x double>, <2 x double>* %arrayidx.i, align 16, !tbaa !8
				%mul.i.i = fmul <2 x double> %0, %8
				%add.i48.i = fadd <2 x double> %add.i4850.i, %mul.i.i
				%mul.i47.i = fmul <2 x double> %1, %8
				%add.i46.i = fadd <2 x double> %add.i4652.i, %mul.i47.i
				%mul.i45.i = fmul <2 x double> %2, %8
				%add.i44.i = fadd <2 x double> %add.i4454.i, %mul.i45.i
				%mul.i43.i = fmul <2 x double> %3, %8
				%add.i42.i = fadd <2 x double> %add.i4256.i, %mul.i43.i
				%mul.i41.i = fmul <2 x double> %4, %8
				%add.i40.i = fadd <2 x double> %add.i4058.i, %mul.i41.i
				%mul.i39.i = fmul <2 x double> %5, %8
				%add.i38.i = fadd <2 x double> %add.i3860.i, %mul.i39.i
				%mul.i37.i = fmul <2 x double> %6, %8
				%add.i36.i = fadd <2 x double> %add.i3662.i, %mul.i37.i
				%mul.i35.i = fmul <2 x double> %7, %8
				%add.i.i = fadd <2 x double> %add.i64.i, %mul.i35.i
				%inc.i = add nuw nsw i32 %i.049.i, 1
				%exitcond.i = icmp eq i32 %inc.i, 76800000
				br i1 %exitcond.i, label %"\01?loop@@YAXXZ.exit", label %for.body.i

				"\01?loop@@YAXXZ.exit": ; preds = %for.body.i
				store <2 x double> %add.i48.i, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 0), align 16, !tbaa !8
				store <2 x double> %add.i46.i, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 1), align 16, !tbaa !8
				store <2 x double> %add.i44.i, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 2), align 16, !tbaa !8
				store <2 x double> %add.i42.i, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 3), align 16, !tbaa !8
				store <2 x double> %add.i40.i, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 4), align 16, !tbaa !8
				store <2 x double> %add.i38.i, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 5), align 16, !tbaa !8
				store <2 x double> %add.i36.i, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 6), align 16, !tbaa !8
				store <2 x double> %add.i.i, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 7), align 16, !tbaa !8
				%call.i = tail call %struct._iobuf* @fopen(i8* getelementptr inbounds ([10 x i8], [10 x i8]* @"\01??_C@_09LAIDGMDM@?1dev?1null?$AA@", i32 0, i32 0), i8* getelementptr inbounds ([2 x i8], [2 x i8]* @"\01??_C@_01NOFIACDB@w?$AA@", i32 0, i32 0)) #7
				%call1.i = tail call i32 @fwrite(i8* bitcast ([8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A" to i8), i32 16, i32 8, %struct._iobuf %call.i) #7
				%call2.i = tail call i32 @fclose(%struct._iobuf* %call.i) #7
				ret i32 0
				}

				define void @"\01?init@@YAXXZ"() local_unnamed_addr #1 {
				entry:
				call void @llvm.memset.p0i8.i32(i8* bitcast ([8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A" to i8*), i8 0, i32 128, i32 16, i1 false)
				%call.i = tail call i64 @_time64(i64* null)
				%conv = trunc i64 %call.i to i32
				tail call void @srand(i32 %conv)
				br label %for.body6

				for.body6: ; preds = %for.body6, %entry
				%i2.051 = phi i32 [ 0, %entry ], [ %inc14, %for.body6 ]
				%call7 = tail call i32 @rand()
				%conv8 = sitofp i32 %call7 to double
				%tmp.sroa.0.0.vec.insert = insertelement <2 x double> undef, double %conv8, i32 0
				%call9 = tail call i32 @rand()
				%conv10 = sitofp i32 %call9 to double
				%tmp.sroa.0.8.vec.insert = insertelement <2 x double> %tmp.sroa.0.0.vec.insert, double %conv10, i32 1
				%arrayidx12 = getelementptr inbounds [76800000 x <2 x double>], [76800000 x <2 x double>]* @"\01?m1@@3PAU__m128d@@A", i32 0, i32 %i2.051
				store <2 x double> %tmp.sroa.0.8.vec.insert, <2 x double>* %arrayidx12, align 16, !tbaa !8
				%inc14 = add nuw nsw i32 %i2.051, 1
				%exitcond = icmp eq i32 %inc14, 76800000
				br i1 %exitcond, label %for.body21.preheader, label %for.body6

				for.body21.preheader: ; preds = %for.body6
				%call25 = tail call i32 @rand()
				%conv26 = sitofp i32 %call25 to double
				%tmp23.sroa.0.0.vec.insert = insertelement <2 x double> undef, double %conv26, i32 0
				%call28 = tail call i32 @rand()
				%conv29 = sitofp i32 %call28 to double
				%tmp23.sroa.0.8.vec.insert = insertelement <2 x double> %tmp23.sroa.0.0.vec.insert, double %conv29, i32 1
				store <2 x double> %tmp23.sroa.0.8.vec.insert, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 0), align 16, !tbaa !8
				%call25.1 = tail call i32 @rand()
				%conv26.1 = sitofp i32 %call25.1 to double
				%tmp23.sroa.0.0.vec.insert.1 = insertelement <2 x double> undef, double %conv26.1, i32 0
				%call28.1 = tail call i32 @rand()
				%conv29.1 = sitofp i32 %call28.1 to double
				%tmp23.sroa.0.8.vec.insert.1 = insertelement <2 x double> %tmp23.sroa.0.0.vec.insert.1, double %conv29.1, i32 1
				store <2 x double> %tmp23.sroa.0.8.vec.insert.1, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 1), align 16, !tbaa !8
				%call25.2 = tail call i32 @rand()
				%conv26.2 = sitofp i32 %call25.2 to double
				%tmp23.sroa.0.0.vec.insert.2 = insertelement <2 x double> undef, double %conv26.2, i32 0
				%call28.2 = tail call i32 @rand()
				%conv29.2 = sitofp i32 %call28.2 to double
				%tmp23.sroa.0.8.vec.insert.2 = insertelement <2 x double> %tmp23.sroa.0.0.vec.insert.2, double %conv29.2, i32 1
				store <2 x double> %tmp23.sroa.0.8.vec.insert.2, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 2), align 16, !tbaa !8
				%call25.3 = tail call i32 @rand()
				%conv26.3 = sitofp i32 %call25.3 to double
				%tmp23.sroa.0.0.vec.insert.3 = insertelement <2 x double> undef, double %conv26.3, i32 0
				%call28.3 = tail call i32 @rand()
				%conv29.3 = sitofp i32 %call28.3 to double
				%tmp23.sroa.0.8.vec.insert.3 = insertelement <2 x double> %tmp23.sroa.0.0.vec.insert.3, double %conv29.3, i32 1
				store <2 x double> %tmp23.sroa.0.8.vec.insert.3, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 3), align 16, !tbaa !8
				%call25.4 = tail call i32 @rand()
				%conv26.4 = sitofp i32 %call25.4 to double
				%tmp23.sroa.0.0.vec.insert.4 = insertelement <2 x double> undef, double %conv26.4, i32 0
				%call28.4 = tail call i32 @rand()
				%conv29.4 = sitofp i32 %call28.4 to double
				%tmp23.sroa.0.8.vec.insert.4 = insertelement <2 x double> %tmp23.sroa.0.0.vec.insert.4, double %conv29.4, i32 1
				store <2 x double> %tmp23.sroa.0.8.vec.insert.4, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 4), align 16, !tbaa !8
				%call25.5 = tail call i32 @rand()
				%conv26.5 = sitofp i32 %call25.5 to double
				%tmp23.sroa.0.0.vec.insert.5 = insertelement <2 x double> undef, double %conv26.5, i32 0
				%call28.5 = tail call i32 @rand()
				%conv29.5 = sitofp i32 %call28.5 to double
				%tmp23.sroa.0.8.vec.insert.5 = insertelement <2 x double> %tmp23.sroa.0.0.vec.insert.5, double %conv29.5, i32 1
				store <2 x double> %tmp23.sroa.0.8.vec.insert.5, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 5), align 16, !tbaa !8
				%call25.6 = tail call i32 @rand()
				%conv26.6 = sitofp i32 %call25.6 to double
				%tmp23.sroa.0.0.vec.insert.6 = insertelement <2 x double> undef, double %conv26.6, i32 0
				%call28.6 = tail call i32 @rand()
				%conv29.6 = sitofp i32 %call28.6 to double
				%tmp23.sroa.0.8.vec.insert.6 = insertelement <2 x double> %tmp23.sroa.0.0.vec.insert.6, double %conv29.6, i32 1
				store <2 x double> %tmp23.sroa.0.8.vec.insert.6, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 6), align 16, !tbaa !8
				%call25.7 = tail call i32 @rand()
				%conv26.7 = sitofp i32 %call25.7 to double
				%tmp23.sroa.0.0.vec.insert.7 = insertelement <2 x double> undef, double %conv26.7, i32 0
				%call28.7 = tail call i32 @rand()
				%conv29.7 = sitofp i32 %call28.7 to double
				%tmp23.sroa.0.8.vec.insert.7 = insertelement <2 x double> %tmp23.sroa.0.0.vec.insert.7, double %conv29.7, i32 1
				store <2 x double> %tmp23.sroa.0.8.vec.insert.7, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 7), align 16, !tbaa !8
				ret void
				}

				; Function Attrs: norecurse nounwind
				define void @"\01?loop@@YAXXZ"() local_unnamed_addr #2 {
				entry:
				%0 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 0), align 16, !tbaa !8
				%1 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 1), align 16, !tbaa !8
				%2 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 2), align 16, !tbaa !8
				%3 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 3), align 16, !tbaa !8
				%4 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 4), align 16, !tbaa !8
				%5 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 5), align 16, !tbaa !8
				%6 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 6), align 16, !tbaa !8
				%7 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?m2@@3PAU__m128d@@A", i32 0, i32 7), align 16, !tbaa !8
				%.promoted = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 0), align 16, !tbaa !8
				%.promoted51 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 1), align 16, !tbaa !8
				%.promoted53 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 2), align 16, !tbaa !8
				%.promoted55 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 3), align 16, !tbaa !8
				%.promoted57 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 4), align 16, !tbaa !8
				%.promoted59 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 5), align 16, !tbaa !8
				%.promoted61 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 6), align 16, !tbaa !8
				%.promoted63 = load <2 x double>, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 7), align 16, !tbaa !8
				br label %for.body

				for.cond.cleanup: ; preds = %for.body
				store <2 x double> %add.i48, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 0), align 16, !tbaa !8
				store <2 x double> %add.i46, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 1), align 16, !tbaa !8
				store <2 x double> %add.i44, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 2), align 16, !tbaa !8
				store <2 x double> %add.i42, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 3), align 16, !tbaa !8
				store <2 x double> %add.i40, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 4), align 16, !tbaa !8
				store <2 x double> %add.i38, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 5), align 16, !tbaa !8
				store <2 x double> %add.i36, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 6), align 16, !tbaa !8
				store <2 x double> %add.i, <2 x double>* getelementptr inbounds ([8 x <2 x double>], [8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A", i32 0, i32 7), align 16, !tbaa !8
				ret void

				for.body: ; preds = %for.body, %entry
				%add.i64 = phi <2 x double> [ %.promoted63, %entry ], [ %add.i, %for.body ]
				%add.i3662 = phi <2 x double> [ %.promoted61, %entry ], [ %add.i36, %for.body ]
				%add.i3860 = phi <2 x double> [ %.promoted59, %entry ], [ %add.i38, %for.body ]
				%add.i4058 = phi <2 x double> [ %.promoted57, %entry ], [ %add.i40, %for.body ]
				%add.i4256 = phi <2 x double> [ %.promoted55, %entry ], [ %add.i42, %for.body ]
				%add.i4454 = phi <2 x double> [ %.promoted53, %entry ], [ %add.i44, %for.body ]
				%add.i4652 = phi <2 x double> [ %.promoted51, %entry ], [ %add.i46, %for.body ]
				%add.i4850 = phi <2 x double> [ %.promoted, %entry ], [ %add.i48, %for.body ]
				%i.049 = phi i32 [ 0, %entry ], [ %inc, %for.body ]
				%arrayidx = getelementptr inbounds [76800000 x <2 x double>], [76800000 x <2 x double>]* @"\01?m1@@3PAU__m128d@@A", i32 0, i32 %i.049
				%8 = load <2 x double>, <2 x double>* %arrayidx, align 16, !tbaa !8
				%mul.i = fmul <2 x double> %8, %0
				%add.i48 = fadd <2 x double> %add.i4850, %mul.i
				%mul.i47 = fmul <2 x double> %8, %1
				%add.i46 = fadd <2 x double> %add.i4652, %mul.i47
				%mul.i45 = fmul <2 x double> %8, %2
				%add.i44 = fadd <2 x double> %add.i4454, %mul.i45
				%mul.i43 = fmul <2 x double> %8, %3
				%add.i42 = fadd <2 x double> %add.i4256, %mul.i43
				%mul.i41 = fmul <2 x double> %8, %4
				%add.i40 = fadd <2 x double> %add.i4058, %mul.i41
				%mul.i39 = fmul <2 x double> %8, %5
				%add.i38 = fadd <2 x double> %add.i3860, %mul.i39
				%mul.i37 = fmul <2 x double> %8, %6
				%add.i36 = fadd <2 x double> %add.i3662, %mul.i37
				%mul.i35 = fmul <2 x double> %8, %7
				%add.i = fadd <2 x double> %add.i64, %mul.i35
				%inc = add nuw nsw i32 %i.049, 1
				%exitcond = icmp eq i32 %inc, 76800000
				br i1 %exitcond, label %for.cond.cleanup, label %for.body
				}

				; Function Attrs: nounwind
				define void @"\01?dump@@YAXXZ"() local_unnamed_addr #3 {
				entry:
				%call = tail call %struct._iobuf* @fopen(i8* getelementptr inbounds ([10 x i8], [10 x i8]* @"\01??_C@_09LAIDGMDM@?1dev?1null?$AA@", i32 0, i32 0), i8* getelementptr inbounds ([2 x i8], [2 x i8]* @"\01??_C@_01NOFIACDB@w?$AA@", i32 0, i32 0))
				%call1 = tail call i32 @fwrite(i8* bitcast ([8 x <2 x double>]* @"\01?v@@3PAU__m128d@@A" to i8), i32 16, i32 8, %struct._iobuf %call)
				%call2 = tail call i32 @fclose(%struct._iobuf* %call)
				ret void
				}

				declare void @srand(i32) local_unnamed_addr #4

				declare i32 @rand() local_unnamed_addr #4

				; Function Attrs: nounwind
				declare noalias %struct._iobuf* @fopen(i8* nocapture readonly, i8* nocapture readonly) local_unnamed_addr #5

				; Function Attrs: nounwind
				declare i32 @fwrite(i8* nocapture, i32, i32, %struct._iobuf* nocapture) local_unnamed_addr #5

				; Function Attrs: nounwind
				declare i32 @fclose(%struct._iobuf* nocapture) local_unnamed_addr #5

				declare i64 @_time64(i64*) local_unnamed_addr #4

				; Function Attrs: argmemonly nounwind
				declare void @llvm.memset.p0i8.i32(i8* nocapture writeonly, i8, i32, i32, i1) #6

				attributes #0 = { norecurse "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="pentium4" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #1 = { "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="pentium4" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #2 = { norecurse nounwind "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="pentium4" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #3 = { nounwind "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="pentium4" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #4 = { "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="pentium4" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #5 = { nounwind "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="pentium4" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #6 = { argmemonly nounwind }
				attributes #7 = { nounwind }

				!llvm.linker.options = !{!0, !1, !2, !3, !4}
				!llvm.module.flags = !{!5, !6}
				!llvm.ident = !{!7}

				!0 = !{!"/FAILIFMISMATCH:\22_MSC_VER=1900\22"}
				!1 = !{!"/FAILIFMISMATCH:\22_ITERATOR_DEBUG_LEVEL=0\22"}
				!2 = !{!"/FAILIFMISMATCH:\22RuntimeLibrary=MT_StaticRelease\22"}
				!3 = !{!"/DEFAULTLIB:libcpmt.lib"}
				!4 = !{!"/FAILIFMISMATCH:\22_CRT_STDIO_ISO_WIDE_SPECIFIERS=0\22"}
				!5 = !{i32 1, !"NumRegisterParameters", i32 0}
				!6 = !{i32 1, !"wchar_size", i32 2}
				!7 = !{!"clang version 5.0.0 (cfe/trunk 305640)"}
				!8 = !{!9, !9, i64 0}
				!9 = !{!"omnipotent char", !10, i64 0}
				!10 = !{!"Simple C++ TBAA"}

test/CodeGen/X86/greedy_regalloc_bad_eviction_sequence.ll

				; RUN: llc < %s -march=x86 -regalloc=greedy \| FileCheck %s
				; Make sure bad eviction sequence doesnt occur
				; XFAIL: *

				; Part of the fix for bugzilla 26810.
				; This test is meant to make sure bad eviction sequence like the one described
				; below does not occur
				;
				; movl %ebp, 8(%esp) # 4-byte Spill
				; movl %ecx, %ebp
				; movl %ebx, %ecx
				; movl %edi, %ebx
				; movl %edx, %edi
				; cltd
				; idivl %esi
				; movl %edi, %edx
				; movl %ebx, %edi
				; movl %ecx, %ebx
				; movl %ebp, %ecx
				; movl 16(%esp), %ebp # 4 - byte Reload

				; CHECK: movl %[[REGB:.]], %[[REGA:.]]
				; CHECK: movl %[[REGC:.*]], %[[REGB]]
				; CHECK: movl %[[REGD:.*]], %[[REGC]]

				; CHECK: movl %[[REGC]], %[[REGD]]
				; CHECK: movl %[[REGB]], %[[REGC]]
				; CHECK: movl %[[REGA]], %[[REGB]]


				; ModuleID = 'C:\iusers\myatsina\workspaces\llorg\vsbuild\Win64_VS2015\install\bin\regRotation_repro.c'
				source_filename = "C:\5Ciusers\5Cmyatsina\5Cworkspaces\5Cllorg\5Cvsbuild\5CWin64_VS2015\5Cinstall\5Cbin\5CregRotation_repro.c"
				target datalayout = "e-m:x-p:32:32-i64:64-f80:32-n8:16:32-a:0:32-S32"
				target triple = "i386-pc-linux-gnu"


				; Function Attrs: norecurse nounwind readonly
				define i32 @bar(i32 %size, i32* nocapture readonly %arr, i32* nocapture readnone %tmp) local_unnamed_addr #1 {
				entry:
				%0 = load i32, i32* %arr, align 4, !tbaa !3
				%arrayidx3 = getelementptr inbounds i32, i32* %arr, i32 1
				%1 = load i32, i32* %arrayidx3, align 4, !tbaa !3
				%arrayidx5 = getelementptr inbounds i32, i32* %arr, i32 2
				%2 = load i32, i32* %arrayidx5, align 4, !tbaa !3
				%arrayidx7 = getelementptr inbounds i32, i32* %arr, i32 3
				%3 = load i32, i32* %arrayidx7, align 4, !tbaa !3
				%arrayidx9 = getelementptr inbounds i32, i32* %arr, i32 4
				%4 = load i32, i32* %arrayidx9, align 4, !tbaa !3
				%arrayidx11 = getelementptr inbounds i32, i32* %arr, i32 5
				%5 = load i32, i32* %arrayidx11, align 4, !tbaa !3
				%arrayidx13 = getelementptr inbounds i32, i32* %arr, i32 6
				%6 = load i32, i32* %arrayidx13, align 4, !tbaa !3
				%arrayidx15 = getelementptr inbounds i32, i32* %arr, i32 7
				%7 = load i32, i32* %arrayidx15, align 4, !tbaa !3
				%arrayidx17 = getelementptr inbounds i32, i32* %arr, i32 8
				%8 = load i32, i32* %arrayidx17, align 4, !tbaa !3
				%cmp69 = icmp sgt i32 %size, 1
				br i1 %cmp69, label %for.body, label %for.cond.cleanup

				for.cond.cleanup: ; preds = %for.body, %entry
				%x0.0.lcssa = phi i32 [ %0, %entry ], [ %add, %for.body ]
				%x1.0.lcssa = phi i32 [ %1, %entry ], [ %sub, %for.body ]
				%x2.0.lcssa = phi i32 [ %2, %entry ], [ %mul, %for.body ]
				%x3.0.lcssa = phi i32 [ %3, %entry ], [ %div, %for.body ]
				%x4.0.lcssa = phi i32 [ %4, %entry ], [ %add19, %for.body ]
				%x5.0.lcssa = phi i32 [ %5, %entry ], [ %sub20, %for.body ]
				%x6.0.lcssa = phi i32 [ %6, %entry ], [ %add21, %for.body ]
				%x7.0.lcssa = phi i32 [ %7, %entry ], [ %mul22, %for.body ]
				%x8.0.lcssa = phi i32 [ %8, %entry ], [ %sub23, %for.body ]
				%mul24 = mul nsw i32 %x1.0.lcssa, %x0.0.lcssa
				%mul25 = mul nsw i32 %mul24, %x2.0.lcssa
				%mul26 = mul nsw i32 %mul25, %x3.0.lcssa
				%mul27 = mul nsw i32 %mul26, %x4.0.lcssa
				%mul28 = mul nsw i32 %mul27, %x5.0.lcssa
				%mul29 = mul nsw i32 %mul28, %x6.0.lcssa
				%mul30 = mul nsw i32 %mul29, %x7.0.lcssa
				%mul31 = mul nsw i32 %mul30, %x8.0.lcssa
				ret i32 %mul31

				for.body: ; preds = %entry, %for.body
				%i.079 = phi i32 [ %inc, %for.body ], [ 1, %entry ]
				%x8.078 = phi i32 [ %sub23, %for.body ], [ %8, %entry ]
				%x7.077 = phi i32 [ %mul22, %for.body ], [ %7, %entry ]
				%x6.076 = phi i32 [ %add21, %for.body ], [ %6, %entry ]
				%x5.075 = phi i32 [ %sub20, %for.body ], [ %5, %entry ]
				%x4.074 = phi i32 [ %add19, %for.body ], [ %4, %entry ]
				%x3.073 = phi i32 [ %div, %for.body ], [ %3, %entry ]
				%x2.072 = phi i32 [ %mul, %for.body ], [ %2, %entry ]
				%x1.071 = phi i32 [ %sub, %for.body ], [ %1, %entry ]
				%x0.070 = phi i32 [ %add, %for.body ], [ %0, %entry ]
				%add = add nsw i32 %x1.071, %x0.070
				%sub = sub nsw i32 %x1.071, %x2.072
				%mul = mul nsw i32 %x3.073, %x2.072
				%div = sdiv i32 %x3.073, %x4.074
				%add19 = add nsw i32 %x5.075, %x4.074
				%sub20 = sub nsw i32 %x5.075, %x6.076
				%add21 = add nsw i32 %x7.077, %x6.076
				%mul22 = mul nsw i32 %x8.078, %x7.077
				%sub23 = sub nsw i32 %x8.078, %add
				%inc = add nuw nsw i32 %i.079, 1
				%exitcond = icmp eq i32 %inc, %size
				br i1 %exitcond, label %for.cond.cleanup, label %for.body, !llvm.loop !7
				}

				attributes #0 = { norecurse nounwind readnone "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-features"="+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #1 = { norecurse nounwind readonly "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-features"="+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }

				!llvm.module.flags = !{!0, !1}
				!llvm.ident = !{!2}

				!0 = !{i32 1, !"NumRegisterParameters", i32 0}
				!1 = !{i32 1, !"wchar_size", i32 2}
				!2 = !{!"clang version 5.0.0 (cfe/trunk 305640)"}
				!3 = !{!4, !4, i64 0}
				!4 = !{!"int", !5, i64 0}
				!5 = !{!"omnipotent char", !6, i64 0}
				!6 = !{!"Simple C/C++ TBAA"}
				!7 = distinct !{!7, !8}
				!8 = !{!"llvm.loop.unroll.disable"}

This is an archive of the discontinued LLVM Phabricator instance.

[Greedy RegAlloc] Add logic to greedy reg alloc to avoid bad eviction chainsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 107959

include/llvm/CodeGen/CalcSpillWeights.h

include/llvm/CodeGen/LiveIntervalAnalysis.h

lib/CodeGen/CalcSpillWeights.cpp

lib/CodeGen/LiveIntervalAnalysis.cpp

lib/CodeGen/RegAllocGreedy.cpp

test/CodeGen/X86/bug26810.ll

test/CodeGen/X86/greedy_regalloc_bad_eviction_sequence.ll

[Greedy RegAlloc] Add logic to greedy reg alloc to avoid bad eviction chains
ClosedPublic