This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/CodeGen/
-
llvm/
-
CodeGen/
-
ScheduleDAGInstrs.h
-
lib/CodeGen/
-
CodeGen/
1
MachineScheduler.cpp
21
ScheduleDAGInstrs.cpp

Differential D15667

[MachineScheduler] Handle regmasks and allow calls to be rescheduled.
Needs ReviewPublic

Authored by jonpa on Dec 19 2015, 5:34 AM.

Download Raw Diff

Details

Reviewers

MatzeB
atrick

Summary

I have experimented with rescheduling calls in MachineScheduler, and have a
patch I would like to offer for comments and review. As far as I could
understand from the comment above isSchedBoundary(), this is something that
may be wanted generally.

Or is it otherwise a good enough option to use the pre-RA (isel) scheduler
to handle this?

This patch adds:

Handling of regmasks while building the dag, with a new ScheduleDAGInstrs
method 'addRegMaskDeps()'.

A new method 'clearCallsInDefsForReg()' has been added from code factored
out of addPhysRegDeps(), as it is also needed by addRegMaskDeps().

A new method SURegDefIsDead() that checks for a dead-def reg MO, and also
for a regmask, as opposed to calling registerDefIsDead(), which does not check
for the regmask.

A new SubtargetInfo virtual method 'MISchedRescheduleCalls()' and a new
CL option 'RescheduleCalls' to control the behaviour. Default is false, i.e.
no rescheduling of calls.

Diff Detail

Event Timeline

jonpa updated this revision to Diff 43302.Dec 19 2015, 5:34 AM

jonpa retitled this revision from to [MachineScheduler] Handle regmasks and allow calls to be rescheduled..

jonpa updated this object.

jonpa added a subscriber: llvm-commits.

hfinkel added a reviewer: atrick.Feb 2 2016, 5:31 PM

The obvious high-level question is: What do we gain by scheduling calls, and what's the right way to model them? Are you just trying to reduce register pressure in the caller, or are there other benefits as well?

(I suspect there are gains to be had by scheduling calls to reduce register pressure, regardless of anything else, but it is not immediately clear from your patch that this is what would happen).

I noticed the comment phrasing including "currently" about the scheduler not handling calls, so I thought there might be some general interest.

For SystemZ this may be of interest also post-ra for the purpose of decoding, but that is just experimentation so far.

More scheduling freedom pre-ra would also be possible, and perhaps this would be good in some cases.

Otherwise, I am not really sure at the moment how needed this is.

In D15667#342810, @jonpa wrote:

I noticed the comment phrasing including "currently" about the scheduler not handling calls, so I thought there might be some general interest.

For SystemZ this may be of interest also post-ra for the purpose of decoding, but that is just experimentation so far.

More scheduling freedom pre-ra would also be possible, and perhaps this would be good in some cases.

Otherwise, I am not really sure at the moment how needed this is.

I believe there is general interest. I'm interested, however, in specifically what motivated you to look at this. Can you explain how decoding is relevant here?

Considering that different instructions may have different grouping rules, meaning that in some cases some instructions can only go in certain slots and so on, a greater freedom for the scheduler should generally be good. I think I became interested in this while looking at some functions with long chains of calls. But, as I said, this is just experimentation so far.

High level, I have the same questions as Hal. But I'm not opposed to adding the support so that targets can evaluate the possibility of call scheduling.

Please test correctness at least on x86 by running the InstructionShuffler with RescheduleCalls.

lib/CodeGen/MachineScheduler.cpp
401–406	Don't we still want to call TII->isSchedulingBoundary for calls? The target may treat some calls differently. How about this: Always honor the command line option if present (it's just for debugging/experimentation anyway): if (RescheduleCalls.getNumOccurrences()) return !RescheduleCalls; Otherwise simply defer to the target: return TII->isSchedulingBoundary(MI, MBB, *MF); By default, have TargetInstrInfo::isSchedulingBoundary return true for isCall(). Make sure any in-tree targets that override this method return true for isCall(). Post on llvm-dev explaining that the scheduler now works across calls. If you have an out-of-tree target that overrides TargetInstrInfo::isSchedulingBoundary, you now need to check isCall yourself.
lib/CodeGen/ScheduleDAGInstrs.cpp
324–329	I know this is copied code, but I don't understand why you break out of the loop here. Also, I don't understand why you don't just clear all calls in your case, since you're about to add the most call to the Defs set.
344–345	I think this is a redundant check (as it is in addPhysRegDataDeps).

Please make sure we don't move reserved register definitions across calls. I'm not sure how those are reflected in the regmask.

Changed the isSchedBoundary() function per request so that TII is used for calls handling.

Changes in clearCallsInDefsForReg() per suggestions to simply erase all previous calls before adding the current one.

New method addCallDeps(), to implement deps for reserved registers. I was not sure if all calls are guaranteed to have a regmask operand - in that case this method could have been eliminated and handled in addRegMaskDeps().

-misched=shuffle passes on X86.

Regarding the current discussion of compile time: this patch will make dags bigger, and potentially slower. Perhaps this should be checked?

With the test-suite on X86 (with default scheduler), I got:

Results

IMPROVED : 251
REGRESSED : 234

Herald added a subscriber: MatzeB. · View Herald TranscriptApr 12 2016, 5:05 AM

It would be nice to get this feature in. It's encouraging that misched=shuffle passes.

Matthias, could you review the register dependencies at call sites and comment on the regmask handling?

For this to be a real feature, we need test cases and a way for subtargets to opt-in.

MatzeB added inline comments.Apr 27 2016, 1:21 PM

lib/CodeGen/ScheduleDAGInstrs.cpp
324–329	+1 Also arguments to calls are not necessarily dead, they may reside in callee saved registers. Simply forgetting about all call-induced Defs seems invalid to me.
334–336	Have you actually tried without this code. If the call potentially reads/writes a reserved register then it must announce that with a use/def machine operand. If it does not this is a bug that should be fixed in the target IMO.
360–361	Don't repeat the function name in the doxygen comment.
366	Variable names should start with uppercase letter.
367	Avoid indentation with `if (!MO.clobbersPhysReg(reg)) continue;`
371	register masks already contain all register, it should not be necessary to iterate over the aliases.
378	Variable name should start with upper case letter. This call could be deferred until we know that `DefSU != SU`.
379	Isn't `defOp` equivalent to `!SURegDefIsDead()`?
380	Maybe introduce a variable `unsigned AliasReg = Alias`; so we don't need to remind ourself with a comment that `Alias` is a register...
386–387	see above, I would expect it to simply erase all previous Defs for this register `reg`.
609–610	`\p SU` and `\p Reg` is better doxygen style.
611	I'd tend to pass a `const MachineInstr &` instead of `const SUnit*` here (nullptr is invalid and MachineInstr instead of SUnit makes the function useful in more contexts).
612	No space before `(`.
613	upper case.
615	Please avoid `auto` in cases where the type is not obvious, we also tend to use the variable name `MO` for machine operands in most other loops.
618–623	Do we need to take register aliases into account here?
626	no braces.
1035–1036	You can add a `continue;` here
1053–1054	See my comment in addCallDeps(). I really think calls should not be special in any way regarding register dependencies...

qiucf added a subscriber: qiucf.Mar 30 2021, 11:14 PM

Herald added a subscriber: javed.absar. · View Herald TranscriptMar 30 2021, 11:14 PM

Revision Contents

Path

Size

include/

llvm/

CodeGen/

ScheduleDAGInstrs.h

7 lines

lib/

CodeGen/

MachineScheduler.cpp

20 lines

ScheduleDAGInstrs.cpp

126 lines

Diff 53384

include/llvm/CodeGen/ScheduleDAGInstrs.h

Show First 20 Lines • Show All 305 Lines • ▼ Show 20 Lines	public:

/// Return a label for the region of code covered by the DAG.		/// Return a label for the region of code covered by the DAG.
std::string getDAGName() const override;		std::string getDAGName() const override;

/// \brief Fix register kill flags that scheduling has made invalid.		/// \brief Fix register kill flags that scheduling has made invalid.
void fixupKills(MachineBasicBlock *MBB);		void fixupKills(MachineBasicBlock *MBB);
protected:		protected:
void initSUnits();		void initSUnits();
		void clearCallsInDefsForReg(unsigned Reg);
void addPhysRegDataDeps(SUnit *SU, unsigned OperIdx);		void addPhysRegDataDeps(SUnit *SU, unsigned OperIdx);
		void addCallDeps(SUnit *SU);
		void addRegMaskDeps(SUnit *SU, unsigned OperIdx);
void addPhysRegDeps(SUnit *SU, unsigned OperIdx);		void addPhysRegDeps(SUnit *SU, unsigned OperIdx);
void addVRegDefDeps(SUnit *SU, unsigned OperIdx);		void addVRegDefDeps(SUnit *SU, unsigned OperIdx);
void addVRegUseDeps(SUnit *SU, unsigned OperIdx);		void addVRegUseDeps(SUnit *SU, unsigned OperIdx);

		/// Check if the MI of SU has a dead def of Reg, or if MI clobbers
		/// it according to a regmask.
		bool SURegDefIsDead(const SUnit *SU, unsigned Reg);

/// \brief PostRA helper for rewriting kill flags.		/// \brief PostRA helper for rewriting kill flags.
void startBlockForKills(MachineBasicBlock *BB);		void startBlockForKills(MachineBasicBlock *BB);

/// \brief Toggle a register operand kill flag.		/// \brief Toggle a register operand kill flag.
///		///
/// Other adjustments may be made to the instruction if necessary. Return		/// Other adjustments may be made to the instruction if necessary. Return
/// true if the operand has been deleted, false if not.		/// true if the operand has been deleted, false if not.
bool toggleKillFlag(MachineInstr *MI, MachineOperand &MO);		bool toggleKillFlag(MachineInstr *MI, MachineOperand &MO);
Show All 30 Lines

lib/CodeGen/MachineScheduler.cpp

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines

// Experimental heuristics		// Experimental heuristics
static cl::opt<bool> EnableMacroFusion("misched-fusion", cl::Hidden,		static cl::opt<bool> EnableMacroFusion("misched-fusion", cl::Hidden,
cl::desc("Enable scheduling for macro fusion."), cl::init(true));		cl::desc("Enable scheduling for macro fusion."), cl::init(true));

static cl::opt<bool> VerifyScheduling("verify-misched", cl::Hidden,		static cl::opt<bool> VerifyScheduling("verify-misched", cl::Hidden,
cl::desc("Verify machine instrs before and after machine scheduling"));		cl::desc("Verify machine instrs before and after machine scheduling"));

		static cl::opt<bool> RescheduleCalls(
		"resched-calls",
		cl::desc("Don't treat calls as scheduling boundaries in the machine "
		"instruction scheduling pass."), cl::init(true),
		cl::Hidden);

// DAG subtrees must have at least this many nodes.		// DAG subtrees must have at least this many nodes.
static const unsigned MinSubtreeSize = 8;		static const unsigned MinSubtreeSize = 8;

// Pin the vtables to this file.		// Pin the vtables to this file.
void MachineSchedStrategy::anchor() {}		void MachineSchedStrategy::anchor() {}
void ScheduleDAGMutation::anchor() {}		void ScheduleDAGMutation::anchor() {}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 291 Lines • ▼ Show 20 Lines	bool PostMachineScheduler::runOnMachineFunction(MachineFunction &mf) {

if (VerifyScheduling)		if (VerifyScheduling)
MF->verify(this, "After post machine scheduling.");		MF->verify(this, "After post machine scheduling.");
return true;		return true;
}		}

/// Return true of the given instruction should not be included in a scheduling		/// Return true of the given instruction should not be included in a scheduling
/// region.		/// region.
///
/// MachineScheduler does not currently support scheduling across calls. To
/// handle calls, the DAG builder needs to be modified to create register
/// anti/output dependencies on the registers clobbered by the call's regmask
/// operand. In PreRA scheduling, the stack pointer adjustment already prevents
/// scheduling across calls. In PostRA scheduling, we need the isCall to enforce
/// the boundary, but there would be no benefit to postRA scheduling across
/// calls this late anyway.
static bool isSchedBoundary(MachineBasicBlock::iterator MI,		static bool isSchedBoundary(MachineBasicBlock::iterator MI,
MachineBasicBlock *MBB,		MachineBasicBlock *MBB,
MachineFunction *MF,		MachineFunction *MF,
const TargetInstrInfo *TII) {		const TargetInstrInfo *TII) {
return MI->isCall() \|\| TII->isSchedulingBoundary(MI, MBB, *MF);		// Calls rescheduling may be controlled by CL option.
		if (MI->isCall() && !MI->isTerminator() && RescheduleCalls.getNumOccurrences())
		return !RescheduleCalls;

		return TII->isSchedulingBoundary(MI, MBB, *MF);
}		}
		atrickUnsubmitted Not Done Reply Inline Actions Don't we still want to call TII->isSchedulingBoundary for calls? The target may treat some calls differently. How about this: Always honor the command line option if present (it's just for debugging/experimentation anyway): if (RescheduleCalls.getNumOccurrences()) return !RescheduleCalls; Otherwise simply defer to the target: return TII->isSchedulingBoundary(MI, MBB, MF); By default, have TargetInstrInfo::isSchedulingBoundary return true for isCall(). Make sure any in-tree targets that override this method return true for isCall(). Post on llvm-dev explaining that the scheduler now works across calls. If you have an out-of-tree target that overrides TargetInstrInfo::isSchedulingBoundary, you now need to check isCall yourself. atrick:* Don't we still want to call TII->isSchedulingBoundary for calls? The target may treat some…

/// Main driver for both MachineScheduler and PostMachineScheduler.		/// Main driver for both MachineScheduler and PostMachineScheduler.
void MachineSchedulerBase::scheduleRegions(ScheduleDAGInstrs &Scheduler,		void MachineSchedulerBase::scheduleRegions(ScheduleDAGInstrs &Scheduler,
bool FixKillFlags) {		bool FixKillFlags) {
const TargetInstrInfo *TII = MF->getSubtarget().getInstrInfo();		const TargetInstrInfo *TII = MF->getSubtarget().getInstrInfo();

// Visit all machine basic blocks.		// Visit all machine basic blocks.
//		//
▲ Show 20 Lines • Show All 3,079 Lines • Show Last 20 Lines

lib/CodeGen/ScheduleDAGInstrs.cpp

Show First 20 Lines • Show All 306 Lines • ▼ Show 20 Lines	for (Reg2SUnitsMap::iterator I = Uses.find(*Alias); I != Uses.end(); ++I) {
UseOp));		UseOp));

ST.adjustSchedDependency(SU, UseSU, Dep);		ST.adjustSchedDependency(SU, UseSU, Dep);
UseSU->addPred(Dep);		UseSU->addPred(Dep);
}		}
}		}
}		}

		void ScheduleDAGInstrs::clearCallsInDefsForReg(unsigned Reg) {
		// Calls will not be reordered because of chain dependencies (see
		// below). Since call operands are dead, calls may continue to be added
		// to the DefList making dependence checking quadratic in the size of
		// the block. Instead, we leave only one call at the back of the
		// DefList, which will be added after this.
		Reg2SUnitsMap::RangePair P = Defs.equal_range(Reg);
		Reg2SUnitsMap::iterator B = P.first;
		Reg2SUnitsMap::iterator I = P.second;
		for (bool isBegin = I == B; !isBegin; /* empty */) {
		isBegin = (--I) == B;
		if (I->SU->isCall)
		I = Defs.erase(I);
		}
		}
		atrickUnsubmitted Not Done Reply Inline Actions I know this is copied code, but I don't understand why you break out of the loop here. Also, I don't understand why you don't just clear all calls in your case, since you're about to add the most call to the Defs set. atrick: I know this is copied code, but I don't understand why you break out of the loop here. Also, I…
		MatzeBUnsubmitted Not Done Reply Inline Actions +1 Also arguments to calls are not necessarily dead, they may reside in callee saved registers. Simply forgetting about all call-induced Defs seems invalid to me. MatzeB: +1 Also arguments to calls are not necessarily dead, they may reside in callee saved registers.

		void ScheduleDAGInstrs::addCallDeps(SUnit *SU) {
		assert(SU->isCall);

		// Make sure we don't move reserved register definitions across
		// calls. (This would be unnecessary if they were guaranteed to
		// always be part of a regmask operand on each call. Are they?)
		MatzeBUnsubmitted Not Done Reply Inline Actions Have you actually tried without this code. If the call potentially reads/writes a reserved register then it must announce that with a use/def machine operand. If it does not this is a bug that should be fixed in the target IMO. MatzeB: Have you actually tried without this code. If the call potentially reads/writes a reserved…
		for (unsigned reg = 1; reg < TRI->getNumRegs(); reg++) {
		if (MRI.isReserved(reg)) {
		// Add output depencencies on all reserved registers.
		for (MCRegAliasIterator Alias(reg, TRI, true); Alias.isValid(); ++Alias) {
		for (Reg2SUnitsMap::iterator I = Defs.find(*Alias); I != Defs.end(); ++I) {
		SUnit *DefSU = I->SU;
		if (DefSU == &ExitSU)
		continue;

		atrickUnsubmitted Not Done Reply Inline Actions I think this is a redundant check (as it is in addPhysRegDataDeps). atrick: I think this is a redundant check (as it is in addPhysRegDataDeps).
		bool defOp = DefSU->getInstr()->definesRegister(*Alias);
		if (DefSU != SU && defOp) {
		SDep Dep(SU, SDep::Output, /Reg=/*Alias);
		DefSU->addPred(Dep);
		}
		}
		}

		clearCallsInDefsForReg(reg);
		Defs.insert(PhysRegSUOper(SU, -1, reg));
		}
		}
		}

		/// addRegMaskDeps - Handle regmasks to be able to reschedule around
		/// calls.
		MatzeBUnsubmitted Not Done Reply Inline Actions Don't repeat the function name in the doxygen comment. MatzeB: Don't repeat the function name in the doxygen comment.
		void ScheduleDAGInstrs::addRegMaskDeps(SUnit *SU, unsigned OperIdx) {
		MachineInstr *MI = SU->getInstr();
		MachineOperand &MO = MI->getOperand(OperIdx);

		for (unsigned reg = 1; reg < TRI->getNumRegs(); reg++) {
		MatzeBUnsubmitted Not Done Reply Inline Actions Variable names should start with uppercase letter. MatzeB: Variable names should start with uppercase letter.
		if (MO.clobbersPhysReg(reg)) {
		MatzeBUnsubmitted Not Done Reply Inline Actions Avoid indentation with `if (!MO.clobbersPhysReg(reg)) continue;` MatzeB: Avoid indentation with `if (!MO.clobbersPhysReg(reg)) continue;`
		// Add output depencencies on all clobberd registers. Calls are
		// expected to have register operands for in/out arguments, so
		// they are not handled here.
		for (MCRegAliasIterator Alias(reg, TRI, true); Alias.isValid(); ++Alias) {
		MatzeBUnsubmitted Not Done Reply Inline Actions register masks already contain all register, it should not be necessary to iterate over the aliases. MatzeB: register masks already contain all register, it should not be necessary to iterate over the…
		for (Reg2SUnitsMap::iterator I = Defs.find(*Alias); I != Defs.end(); ++I) {
		SUnit *DefSU = I->SU;
		if (DefSU == &ExitSU)
		continue;

		// Don't add dependency to another dead def or another regmask.
		bool defOp = DefSU->getInstr()->definesRegister(*Alias);
		MatzeBUnsubmitted Not Done Reply Inline Actions Variable name should start with upper case letter. This call could be deferred until we know that `DefSU != SU`. MatzeB: Variable name should start with upper case letter. This call could be deferred until we know…
		if (DefSU != SU && defOp && !SURegDefIsDead(DefSU, *Alias)) {
		MatzeBUnsubmitted Not Done Reply Inline Actions Isn't `defOp` equivalent to `!SURegDefIsDead()`? MatzeB: Isn't `defOp` equivalent to `!SURegDefIsDead()`?
		SDep Dep(SU, SDep::Output, /Reg=/*Alias);
		MatzeBUnsubmitted Not Done Reply Inline Actions Maybe introduce a variable `unsigned AliasReg = Alias`; so we don't need to remind ourself with a comment that `Alias` is a register... MatzeB: Maybe introduce a variable `unsigned AliasReg = *Alias`; so we don't need to remind ourself…
		DefSU->addPred(Dep);
		}
		}
		}

		if (SU->isCall)
		clearCallsInDefsForReg(reg);
		MatzeBUnsubmitted Not Done Reply Inline Actions see above, I would expect it to simply erase all previous Defs for this register `reg`. MatzeB: see above, I would expect it to simply erase all previous Defs for this register `reg`.
		Defs.insert(PhysRegSUOper(SU, -1, reg));
		}
		}
		}

/// addPhysRegDeps - Add register dependencies (data, anti, and output) from		/// addPhysRegDeps - Add register dependencies (data, anti, and output) from
/// this SUnit to following instructions in the same scheduling region that		/// this SUnit to following instructions in the same scheduling region that
/// depend the physical register referenced at OperIdx.		/// depend the physical register referenced at OperIdx.
void ScheduleDAGInstrs::addPhysRegDeps(SUnit *SU, unsigned OperIdx) {		void ScheduleDAGInstrs::addPhysRegDeps(SUnit *SU, unsigned OperIdx) {
MachineInstr *MI = SU->getInstr();		MachineInstr *MI = SU->getInstr();
MachineOperand &MO = MI->getOperand(OperIdx);		MachineOperand &MO = MI->getOperand(OperIdx);

// Optionally add output and anti dependencies. For anti		// Optionally add output and anti dependencies. For anti
// dependencies we use a latency of 0 because for a multi-issue		// dependencies we use a latency of 0 because for a multi-issue
// target we want to allow the defining instruction to issue		// target we want to allow the defining instruction to issue
// in the same cycle as the using instruction.		// in the same cycle as the using instruction.
// TODO: Using a latency of 1 here for output dependencies assumes		// TODO: Using a latency of 1 here for output dependencies assumes
// there's no cost for reusing registers.		// there's no cost for reusing registers.
SDep::Kind Kind = MO.isUse() ? SDep::Anti : SDep::Output;		SDep::Kind Kind = MO.isUse() ? SDep::Anti : SDep::Output;
for (MCRegAliasIterator Alias(MO.getReg(), TRI, true);		for (MCRegAliasIterator Alias(MO.getReg(), TRI, true);
Alias.isValid(); ++Alias) {		Alias.isValid(); ++Alias) {
if (!Defs.contains(*Alias))		if (!Defs.contains(*Alias))
continue;		continue;
for (Reg2SUnitsMap::iterator I = Defs.find(*Alias); I != Defs.end(); ++I) {		for (Reg2SUnitsMap::iterator I = Defs.find(*Alias); I != Defs.end(); ++I) {
SUnit *DefSU = I->SU;		SUnit *DefSU = I->SU;
if (DefSU == &ExitSU)		if (DefSU == &ExitSU)
continue;		continue;
if (DefSU != SU &&		if (DefSU != SU &&
(Kind != SDep::Output \|\| !MO.isDead() \|\|		(Kind != SDep::Output \|\| !MO.isDead() \|\|
!DefSU->getInstr()->registerDefIsDead(*Alias))) {		!SURegDefIsDead(DefSU, *Alias))) {
if (Kind == SDep::Anti)		if (Kind == SDep::Anti)
DefSU->addPred(SDep(SU, Kind, /Reg=/*Alias));		DefSU->addPred(SDep(SU, Kind, /Reg=/*Alias));
else {		else {
SDep Dep(SU, Kind, /Reg=/*Alias);		SDep Dep(SU, Kind, /Reg=/*Alias);
Dep.setLatency(		Dep.setLatency(
SchedModel.computeOutputLatency(MI, OperIdx, DefSU->getInstr()));		SchedModel.computeOutputLatency(MI, OperIdx, DefSU->getInstr()));
DefSU->addPred(Dep);		DefSU->addPred(Dep);
}		}
Show All 13 Lines	void ScheduleDAGInstrs::addPhysRegDeps(SUnit *SU, unsigned OperIdx) {
else {		else {
addPhysRegDataDeps(SU, OperIdx);		addPhysRegDataDeps(SU, OperIdx);
unsigned Reg = MO.getReg();		unsigned Reg = MO.getReg();

// clear this register's use list		// clear this register's use list
if (Uses.contains(Reg))		if (Uses.contains(Reg))
Uses.eraseAll(Reg);		Uses.eraseAll(Reg);

if (!MO.isDead()) {		if (!MO.isDead())
Defs.eraseAll(Reg);		Defs.eraseAll(Reg);
} else if (SU->isCall) {		else if (SU->isCall)
// Calls will not be reordered because of chain dependencies (see		clearCallsInDefsForReg(Reg);
// below). Since call operands are dead, calls may continue to be added
// to the DefList making dependence checking quadratic in the size of
// the block. Instead, we leave only one call at the back of the
// DefList.
Reg2SUnitsMap::RangePair P = Defs.equal_range(Reg);
Reg2SUnitsMap::iterator B = P.first;
Reg2SUnitsMap::iterator I = P.second;
for (bool isBegin = I == B; !isBegin; /* empty */) {
isBegin = (--I) == B;
if (!I->SU->isCall)
break;
I = Defs.erase(I);
}
}

// Defs are pushed in the order they are visited and never reordered.		// Defs are pushed in the order they are visited and never reordered.
Defs.insert(PhysRegSUOper(SU, OperIdx, Reg));		Defs.insert(PhysRegSUOper(SU, OperIdx, Reg));
}		}
}		}

LaneBitmask ScheduleDAGInstrs::getLaneMaskForMO(const MachineOperand &MO) const		LaneBitmask ScheduleDAGInstrs::getLaneMaskForMO(const MachineOperand &MO) const
{		{
▲ Show 20 Lines • Show All 142 Lines • ▼ Show 20 Lines	if ((PrevDefLaneMask & LaneMask) == 0)
continue;		continue;
if (V2SU.SU == SU)		if (V2SU.SU == SU)
continue;		continue;

V2SU.SU->addPred(SDep(SU, SDep::Anti, Reg));		V2SU.SU->addPred(SDep(SU, SDep::Anti, Reg));
}		}
}		}

		/// Return true if SU has a dead register def operand of Reg, or a
		/// regmask that clobbers it, without having a live def of it as well.
		MatzeBUnsubmitted Not Done Reply Inline Actions `\p SU` and `\p Reg` is better doxygen style. MatzeB: `\p SU` and `\p Reg` is better doxygen style.
		bool ScheduleDAGInstrs::SURegDefIsDead(const SUnit *SU, unsigned Reg) {
		MatzeBUnsubmitted Not Done Reply Inline Actions I'd tend to pass a `const MachineInstr &` instead of `const SUnit` here (nullptr is invalid and MachineInstr instead of SUnit makes the function useful in more contexts). MatzeB:* I'd tend to pass a `const MachineInstr &` instead of `const SUnit*` here (nullptr is invalid…
		assert (TRI->isPhysicalRegister(Reg));
		MatzeBUnsubmitted Not Done Reply Inline Actions No space before `(`. MatzeB: No space before `(`.
		bool hasDeadDef = false;
		MatzeBUnsubmitted Not Done Reply Inline Actions upper case. MatzeB: upper case.
		MachineInstr *MI = SU->getInstr();
		for (const auto &I : MI->operands()) {
		MatzeBUnsubmitted Not Done Reply Inline Actions Please avoid `auto` in cases where the type is not obvious, we also tend to use the variable name `MO` for machine operands in most other loops. MatzeB: Please avoid `auto` in cases where the type is not obvious, we also tend to use the variable…
		if (I.isRegMask() && I.clobbersPhysReg(Reg))
		hasDeadDef = true;
		else if (I.isReg() && I.isDef() && I.getReg() == Reg) {
		if (I.isDead())
		hasDeadDef = true;
		else
		return false;
		}
		MatzeBUnsubmitted Not Done Reply Inline Actions Do we need to take register aliases into account here? MatzeB: Do we need to take register aliases into account here?
		}

		return (hasDeadDef);
		MatzeBUnsubmitted Not Done Reply Inline Actions no braces. MatzeB: no braces.
		}

/// Return true if MI is an instruction we are unable to reason about		/// Return true if MI is an instruction we are unable to reason about
/// (like a call or something with unmodeled side effects).		/// (like a call or something with unmodeled side effects).
static inline bool isGlobalMemoryObject(AliasAnalysis AA, MachineInstr MI) {		static inline bool isGlobalMemoryObject(AliasAnalysis AA, MachineInstr MI) {
return MI->isCall() \|\| MI->hasUnmodeledSideEffects() \|\|		return MI->isCall() \|\| MI->hasUnmodeledSideEffects() \|\|
(MI->hasOrderedMemoryRef() && !MI->isInvariantLoad(AA));		(MI->hasOrderedMemoryRef() && !MI->isInvariantLoad(AA));
}		}

/// This returns true if the two MIs need a chain edge between them.		/// This returns true if the two MIs need a chain edge between them.
▲ Show 20 Lines • Show All 390 Lines • ▼ Show 20 Lines	for (MachineBasicBlock::iterator MII = RegionEnd, MIE = RegionBegin;
assert(		assert(
(CanHandleTerminators \|\| (!MI->isTerminator() && !MI->isPosition())) &&		(CanHandleTerminators \|\| (!MI->isTerminator() && !MI->isPosition())) &&
"Cannot schedule terminators or labels!");		"Cannot schedule terminators or labels!");

// Add register-based dependencies (data, anti, and output).		// Add register-based dependencies (data, anti, and output).
bool HasVRegDef = false;		bool HasVRegDef = false;
for (unsigned j = 0, n = MI->getNumOperands(); j != n; ++j) {		for (unsigned j = 0, n = MI->getNumOperands(); j != n; ++j) {
const MachineOperand &MO = MI->getOperand(j);		const MachineOperand &MO = MI->getOperand(j);
		if (MO.isRegMask())
		addRegMaskDeps(SU, j);
		MatzeBUnsubmitted Not Done Reply Inline Actions You can add a `continue;` here MatzeB: You can add a `continue;` here
if (!MO.isReg()) continue;		if (!MO.isReg()) continue;
unsigned Reg = MO.getReg();		unsigned Reg = MO.getReg();
if (Reg == 0) continue;		if (Reg == 0) continue;

if (TRI->isPhysicalRegister(Reg))		if (TRI->isPhysicalRegister(Reg))
addPhysRegDeps(SU, j);		addPhysRegDeps(SU, j);
else {		else {
if (MO.isDef()) {		if (MO.isDef()) {
HasVRegDef = true;		HasVRegDef = true;
addVRegDefDeps(SU, j);		addVRegDefDeps(SU, j);
}		}
else if (MO.readsReg()) // ignore undef operands		else if (MO.readsReg()) // ignore undef operands
addVRegUseDeps(SU, j);		addVRegUseDeps(SU, j);
}		}
}		}

		if (SU->isCall)
		addCallDeps(SU);
		MatzeBUnsubmitted Not Done Reply Inline Actions See my comment in addCallDeps(). I really think calls should not be special in any way regarding register dependencies... MatzeB: See my comment in addCallDeps(). I really think calls should not be special in any way…

// If we haven't seen any uses in this scheduling region, create a		// If we haven't seen any uses in this scheduling region, create a
// dependence edge to ExitSU to model the live-out latency. This is required		// dependence edge to ExitSU to model the live-out latency. This is required
// for vreg defs with no in-region use, and prefetches with no vreg def.		// for vreg defs with no in-region use, and prefetches with no vreg def.
//		//
// FIXME: NumDataSuccs would be more precise than NumSuccs here. This		// FIXME: NumDataSuccs would be more precise than NumSuccs here. This
// check currently relies on being called before adding chain deps.		// check currently relies on being called before adding chain deps.
if (SU->NumSuccs == 0 && SU->Latency > 1		if (SU->NumSuccs == 0 && SU->Latency > 1
&& (HasVRegDef \|\| MI->mayLoad())) {		&& (HasVRegDef \|\| MI->mayLoad())) {
▲ Show 20 Lines • Show All 726 Lines • Show Last 20 Lines