Diff 307111

llvm/lib/Target/ARM/ARMScheduleA57.td

	Show First 20 Lines • Show All 171 Lines • ▼ Show 20 Lines
	// RSB{S}, RSC{S}, SUB{S}, SBC{S}, TEQ, TST			// RSB{S}, RSC{S}, SUB{S}, SBC{S}, TEQ, TST

	def : InstRW<[A57Write_1cyc_1I], (instregex "tADDframe")>;			def : InstRW<[A57Write_1cyc_1I], (instregex "tADDframe")>;

	// shift by register, conditional or unconditional			// shift by register, conditional or unconditional
	// TODO: according to the doc, conditional uses I0/I1, unconditional uses M			// TODO: according to the doc, conditional uses I0/I1, unconditional uses M
	// Why more complex instruction uses more simple pipeline?			// Why more complex instruction uses more simple pipeline?
	// May be an error in doc.			// May be an error in doc.
	def A57WriteALUsi : SchedWriteVariant<[
	dmgreenUnsubmitted Not Done Reply Inline Actions This is "Move, shift by immed, no setflags" _and_ "Move, shift by immed, setflags"? I agree that the predicated pred should not matter, but there probably should be some difference between flag setting and not. I think the TODO above is referring to A57WriteALUSsr? I'm not sure why A57WriteALUsr is treated the same way though. From what I can see it should be using A57Write_1cyc_1. Is A57ReadALUsr worth keeping around? dmgreen: This is "Move, shift by immed, no setflags" _and_ "Move, shift by immed, setflags"? I agree…
	evgeny777AuthorUnsubmitted Done Reply Inline Actions Is A57ReadALUsr worth keeping around? I don't think it is, however I decided to keep it for now for testing purposes. I'm not sure why A57WriteALUsr is treated the same way though. From what I can see it should be using A57Write_1cyc_1. Why? From opt guide: ALU, shift by register, unconditional (same as above) 2 1 M ALU, shift by register, conditional (same as above) 2 1 I0/I1 evgeny777: > Is A57ReadALUsr worth keeping around? I don't think it is, however I decided to keep it for…
	dmgreenUnsubmitted Not Done Reply Inline Actions Hmm. Which opt guide is that from? I seem to see: Move, shift by immed, no setflags 1I Move, shift by immed, setflags 2M Move, shift by register, no setflags, unconditional 1I Move, shift by register, no setflags, conditional 2I Move, shift by register, setflags, unconditional 2M Move, shift by register, setflags, conditional 2I So the first 2 are currently in WriteALUsi (which should probably be split up to get it correct, but that is a separate issue. A better default is probably A57Write_1cyc_1I). A57WriteALUSsr is the last 2 and fits the opt guide correctly at least. A57WriteALUsr is the middle two, but should probably be using Pred:A57Write_2cyc_1I and NoPred: A57Write_1cyc_1I. dmgreen: Hmm. Which opt guide is that from? I seem to see: Move, shift by immed, no setflags 1I…
	evgeny777AuthorUnsubmitted Done Reply Inline Actions Hmm. Which opt guide is that from? I think we're both looking at the same one. I've copy-pasted from section 3.3 I seem to see: Right, but `Move, shift by register` (MOVsr) is bound to `WriteALU` (ARM version) and thumb version (t2MOVsr) is unbound. The following commands are currently bound to WriteALUsr: ADCrsr ADDrsr ANDrsr BICrsr EORrsr ORRrsr RSBrsr RSCrsr SBCrsr SUBrsr SXTAB SXTAB16 SXTAH UXTAB UXTAB16 UXTAH It seems ARM/Thumb instruction definition is incomplete and broken in many ways when it comes to scheduling. Patch however is more about simplifying adding new models, not fixing existing ones. evgeny777: > Hmm. Which opt guide is that from? I think we're both looking at the same one. I've copy…
	dmgreenUnsubmitted Not Done Reply Inline Actions Ah. I was thinking more about shifts than arithmetic operations. In that case, A57Write_2cyc_1M is probably a better default than A57Write_1cyc_1I. dmgreen: Ah. I was thinking more about shifts than arithmetic operations. In that case, A57Write_2cyc_1M…
	// lsl #2, lsl #1, or lsr #1.
	SchedVar<IsPredicatedPred, [A57Write_2cyc_1M]>,
	SchedVar<NoSchedPred, [A57Write_2cyc_1M]>
	]>;
	def A57WriteALUsr : SchedWriteVariant<[			def A57WriteALUsr : SchedWriteVariant<[
	SchedVar<IsPredicatedPred, [A57Write_2cyc_1I]>,			SchedVar<IsPredicatedPred, [A57Write_2cyc_1I]>,
	SchedVar<NoSchedPred, [A57Write_2cyc_1M]>			SchedVar<NoSchedPred, [A57Write_2cyc_1M]>
	]>;			]>;
	def A57WriteALUSsr : SchedWriteVariant<[			def A57WriteALUSsr : SchedWriteVariant<[
	SchedVar<IsPredicatedPred, [A57Write_2cyc_1I]>,			SchedVar<IsPredicatedPred, [A57Write_2cyc_1I]>,
	SchedVar<NoSchedPred, [A57Write_2cyc_1M]>			SchedVar<NoSchedPred, [A57Write_2cyc_1M]>
	]>;			]>;
	def A57ReadALUsr : SchedReadVariant<[			def A57ReadALUsr : SchedReadVariant<[
	SchedVar<IsPredicatedPred, [ReadDefault]>,			SchedVar<IsPredicatedPred, [ReadDefault]>,
	SchedVar<NoSchedPred, [ReadDefault]>			SchedVar<NoSchedPred, [ReadDefault]>
	]>;			]>;
	def : SchedAlias<WriteALUsi, A57WriteALUsi>;			def : SchedAlias<WriteALUsi, A57Write_2cyc_1M>;
	def : SchedAlias<WriteALUsr, A57WriteALUsr>;			def : SchedAlias<WriteALUsr, A57WriteALUsr>;
	def : SchedAlias<WriteALUSsr, A57WriteALUSsr>;			def : SchedAlias<WriteALUSsr, A57WriteALUSsr>;
	def : SchedAlias<ReadALUsr, A57ReadALUsr>;			def : SchedAlias<ReadALUsr, A57ReadALUsr>;

	def A57WriteCMPsr : SchedWriteVariant<[			def A57WriteCMPsr : SchedWriteVariant<[
	SchedVar<IsPredicatedPred, [A57Write_2cyc_1I]>,			SchedVar<IsPredicatedPred, [A57Write_2cyc_1I]>,
	SchedVar<NoSchedPred, [A57Write_2cyc_1M]>			SchedVar<NoSchedPred, [A57Write_2cyc_1M]>
	]>;			]>;
	▲ Show 20 Lines • Show All 1,296 Lines • Show Last 20 Lines

llvm/utils/TableGen/CodeGenSchedule.cpp

Show First 20 Lines • Show All 1,311 Lines • ▼ Show 20 Lines
struct PredTransition {		struct PredTransition {
// A predicate term is a conjunction of PredChecks.		// A predicate term is a conjunction of PredChecks.
SmallVector<PredCheck, 4> PredTerm;		SmallVector<PredCheck, 4> PredTerm;
SmallVector<SmallVector<unsigned,4>, 16> WriteSequences;		SmallVector<SmallVector<unsigned,4>, 16> WriteSequences;
SmallVector<SmallVector<unsigned,4>, 16> ReadSequences;		SmallVector<SmallVector<unsigned,4>, 16> ReadSequences;
SmallVector<unsigned, 4> ProcIndices;		SmallVector<unsigned, 4> ProcIndices;

PredTransition() = default;		PredTransition() = default;
PredTransition(ArrayRef<PredCheck> PT) {
PredTerm.assign(PT.begin(), PT.end());
ProcIndices.assign(1, 0);
}
PredTransition(ArrayRef<PredCheck> PT, ArrayRef<unsigned> PIds) {		PredTransition(ArrayRef<PredCheck> PT, ArrayRef<unsigned> PIds) {
PredTerm.assign(PT.begin(), PT.end());		PredTerm.assign(PT.begin(), PT.end());
ProcIndices.assign(PIds.begin(), PIds.end());		ProcIndices.assign(PIds.begin(), PIds.end());
}		}
};		};

// Encapsulate a set of partially constructed transitions.		// Encapsulate a set of partially constructed transitions.
// The results are built by repeated calls to substituteVariants.		// The results are built by repeated calls to substituteVariants.
Show All 12 Lines	public:
void substituteVariants(const PredTransition &Trans);		void substituteVariants(const PredTransition &Trans);

#ifndef NDEBUG		#ifndef NDEBUG
void dump() const;		void dump() const;
#endif		#endif

private:		private:
bool mutuallyExclusive(Record *PredDef, ArrayRef<PredCheck> Term);		bool mutuallyExclusive(Record *PredDef, ArrayRef<PredCheck> Term);
void getIntersectingVariants(		void getIntersectingVariants(const CodeGenSchedRW &SchedRW, unsigned TransIdx,
const CodeGenSchedRW &SchedRW, unsigned TransIdx,		std::vector<TransVariant> &IntersectingVariants,
		DenseMap<TransVariant, bool> &VarTracker);
		void addIntersectingVariant(unsigned TransIdx, TransVariant &Variant,
std::vector<TransVariant> &IntersectingVariants);		std::vector<TransVariant> &IntersectingVariants);
void pushVariant(const TransVariant &VInfo, bool IsRead);		void pushVariant(const TransVariant &VInfo, bool IsRead);
};		};

} // end anonymous namespace		} // end anonymous namespace

		template <> struct llvm::DenseMapInfo<TransVariant> {
		dmgreenUnsubmitted Not Done Reply Inline Actions Does this need to use a densemap? It seems to be being used to check whether the TransVariant have already been handled. Can it use a set or something simpler for that? dmgreen: Does this need to use a densemap? It seems to be being used to check whether the TransVariant…
		evgeny777AuthorUnsubmitted Done Reply Inline Actions Unfortunately it can't, because we not only need to check for same record definition, but also for same processor index (it's a bug in current implementation to ignore this). This is because same variant record may be shared between different processor models (like ReadAdrBase in ThunderX2T99 and ThunderX3T110) evgeny777: Unfortunately it can't, because we not only need to check for same record definition, but also…
		static inline TransVariant getEmptyKey() { return {nullptr, 0, 0, 0}; }
		static inline TransVariant getTombstoneKey() { return {nullptr, -1U, 0, 0}; }
		static unsigned getHashValue(const TransVariant &Val) { return Val.RWIdx; }
		static bool isEqual(const TransVariant &LHS, const TransVariant &RHS) {
		return LHS.VarOrSeqDef == RHS.VarOrSeqDef && LHS.ProcIdx == RHS.ProcIdx;
		}
		};

// Return true if this predicate is mutually exclusive with a PredTerm. This		// Return true if this predicate is mutually exclusive with a PredTerm. This
// degenerates into checking if the predicate is mutually exclusive with any		// degenerates into checking if the predicate is mutually exclusive with any
// predicate in the Term's conjunction.		// predicate in the Term's conjunction.
//		//
// All predicates associated with a given SchedRW are considered mutually		// All predicates associated with a given SchedRW are considered mutually
// exclusive. This should work even if the conditions expressed by the		// exclusive. This should work even if the conditions expressed by the
// predicates are not exclusive because the predicates for a given SchedWrite		// predicates are not exclusive because the predicates for a given SchedWrite
// are always checked in the order they are defined in the .td file. Later		// are always checked in the order they are defined in the .td file. Later
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	for (const PredTransition &PTI : Transitions) {
for (const SmallVectorImpl<unsigned> &RSI : PTI.ReadSequences)		for (const SmallVectorImpl<unsigned> &RSI : PTI.ReadSequences)
for (unsigned RI : RSI)		for (unsigned RI : RSI)
if (hasAliasedVariants(SchedModels.getSchedRead(RI), SchedModels))		if (hasAliasedVariants(SchedModels.getSchedRead(RI), SchedModels))
return true;		return true;
}		}
return false;		return false;
}		}

		void PredTransitions::addIntersectingVariant(
		unsigned TransIdx, TransVariant &Variant,
		std::vector<TransVariant> &IntersectingVariants) {
		if (Variant.VarOrSeqDef->isSubClassOf("SchedVar")) {
		Record *PredDef = Variant.VarOrSeqDef->getValueAsDef("Predicate");
		if (mutuallyExclusive(PredDef, TransVec[TransIdx].PredTerm))
		return;
		}
		if (IntersectingVariants.empty()) {
		// The first variant builds on the existing transition.
		Variant.TransVecIdx = TransIdx;
		IntersectingVariants.push_back(Variant);
		} else {
		// Push another copy of the current transition for more variants.
		Variant.TransVecIdx = TransVec.size();
		IntersectingVariants.push_back(Variant);
		TransVec.push_back(TransVec[TransIdx]);
		}
		}

// Populate IntersectingVariants with any variants or aliased sequences of the		// Populate IntersectingVariants with any variants or aliased sequences of the
// given SchedRW whose processor indices and predicates are not mutually		// given SchedRW whose processor indices and predicates are not mutually
// exclusive with the given transition.		// exclusive with the given transition.
void PredTransitions::getIntersectingVariants(		void PredTransitions::getIntersectingVariants(
const CodeGenSchedRW &SchedRW, unsigned TransIdx,		const CodeGenSchedRW &SchedRW, unsigned TransIdx,
std::vector<TransVariant> &IntersectingVariants) {		std::vector<TransVariant> &IntersectingVariants,
		DenseMap<TransVariant, bool> &VarTracker) {

bool GenericRW = false;		bool GenericRW = false;

std::vector<TransVariant> Variants;		std::vector<TransVariant> Variants;
if (SchedRW.HasVariants) {		if (SchedRW.HasVariants) {
unsigned VarProcIdx = 0;		unsigned VarProcIdx = 0;
if (SchedRW.TheDef->getValueInit("SchedModel")->isComplete()) {		if (SchedRW.TheDef->getValueInit("SchedModel")->isComplete()) {
Record *ModelDef = SchedRW.TheDef->getValueAsDef("SchedModel");		Record *ModelDef = SchedRW.TheDef->getValueAsDef("SchedModel");
Show All 31 Lines	void PredTransitions::getIntersectingVariants(
}		}
for (TransVariant &Variant : Variants) {		for (TransVariant &Variant : Variants) {
// Don't expand variants if the processor models don't intersect.		// Don't expand variants if the processor models don't intersect.
// A zero processor index means any processor.		// A zero processor index means any processor.
SmallVectorImpl<unsigned> &ProcIndices = TransVec[TransIdx].ProcIndices;		SmallVectorImpl<unsigned> &ProcIndices = TransVec[TransIdx].ProcIndices;
if (ProcIndices[0] && Variant.ProcIdx) {		if (ProcIndices[0] && Variant.ProcIdx) {
unsigned Cnt = std::count(ProcIndices.begin(), ProcIndices.end(),		unsigned Cnt = std::count(ProcIndices.begin(), ProcIndices.end(),
Variant.ProcIdx);		Variant.ProcIdx);
if (!Cnt)		if (!Cnt) {
		VarTracker.insert({Variant, false});
continue;		continue;
		}
if (Cnt > 1) {		if (Cnt > 1) {
const CodeGenProcModel &PM =		const CodeGenProcModel &PM =
*(SchedModels.procModelBegin() + Variant.ProcIdx);		*(SchedModels.procModelBegin() + Variant.ProcIdx);
PrintFatalError(Variant.VarOrSeqDef->getLoc(),		PrintFatalError(Variant.VarOrSeqDef->getLoc(),
"Multiple variants defined for processor " +		"Multiple variants defined for processor " +
PM.ModelName +		PM.ModelName +
" Ensure only one SchedAlias exists per RW.");		" Ensure only one SchedAlias exists per RW.");
}		}
}		}
if (Variant.VarOrSeqDef->isSubClassOf("SchedVar")) {		VarTracker[Variant] = true;
Record *PredDef = Variant.VarOrSeqDef->getValueAsDef("Predicate");		addIntersectingVariant(TransIdx, Variant, IntersectingVariants);
if (mutuallyExclusive(PredDef, TransVec[TransIdx].PredTerm))
continue;
}
if (IntersectingVariants.empty()) {
// The first variant builds on the existing transition.
Variant.TransVecIdx = TransIdx;
IntersectingVariants.push_back(Variant);
}
else {
// Push another copy of the current transition for more variants.
Variant.TransVecIdx = TransVec.size();
IntersectingVariants.push_back(Variant);
TransVec.push_back(TransVec[TransIdx]);
}
}		}
if (GenericRW && IntersectingVariants.empty()) {		if (GenericRW && IntersectingVariants.empty()) {
PrintFatalError(SchedRW.TheDef->getLoc(), "No variant of this type has "		PrintFatalError(SchedRW.TheDef->getLoc(), "No variant of this type has "
"a matching predicate on any processor");		"a matching predicate on any processor");
}		}
}		}

// Push the Reads/Writes selected by this variant onto the PredTransition		// Push the Reads/Writes selected by this variant onto the PredTransition
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
// RWSeq is a sequence of all Reads or all Writes for the next read or write		// RWSeq is a sequence of all Reads or all Writes for the next read or write
// operand. StartIdx is an index into TransVec where partial results		// operand. StartIdx is an index into TransVec where partial results
// starts. RWSeq must be applied to all transitions between StartIdx and the end		// starts. RWSeq must be applied to all transitions between StartIdx and the end
// of TransVec.		// of TransVec.
void PredTransitions::substituteVariantOperand(		void PredTransitions::substituteVariantOperand(
const SmallVectorImpl<unsigned> &RWSeq, bool IsRead, bool IsForAnyCPU,		const SmallVectorImpl<unsigned> &RWSeq, bool IsRead, bool IsForAnyCPU,
unsigned StartIdx) {		unsigned StartIdx) {

auto CollectAndAddVariants = [&](unsigned TransIdx,		auto PushVariants = [&](std::vector<TransVariant> &Variants, bool IsRead) {
const CodeGenSchedRW &SchedRW) {
// Distribute this partial PredTransition across intersecting variants.
// This will push a copies of TransVec[TransIdx] on the back of TransVec.
std::vector<TransVariant> IntersectingVariants;
getIntersectingVariants(SchedRW, TransIdx, IntersectingVariants);
// Now expand each variant on top of its copy of the transition.		// Now expand each variant on top of its copy of the transition.
for (const TransVariant &IV : IntersectingVariants)		for (const TransVariant &IV : Variants)
pushVariant(IV, IsRead);		pushVariant(IV, IsRead);
return !IntersectingVariants.empty();
};		};

// Visit each original RW within the current sequence.		// Visit each original RW within the current sequence.
for (SmallVectorImpl<unsigned>::const_iterator		for (SmallVectorImpl<unsigned>::const_iterator
RWI = RWSeq.begin(), RWE = RWSeq.end(); RWI != RWE; ++RWI) {		RWI = RWSeq.begin(), RWE = RWSeq.end(); RWI != RWE; ++RWI) {
const CodeGenSchedRW &SchedRW = SchedModels.getSchedRW(*RWI, IsRead);		const CodeGenSchedRW &SchedRW = SchedModels.getSchedRW(*RWI, IsRead);
// Push this RW on all partial PredTransitions or distribute variants.		// Push this RW on all partial PredTransitions or distribute variants.
// New PredTransitions may be pushed within this loop which should not be		// New PredTransitions may be pushed within this loop which should not be
// revisited (TransEnd must be loop invariant).		// revisited (TransEnd must be loop invariant).
bool HasAliases = false, WasPushed = false;		DenseMap<TransVariant, bool> VTracker;
for (unsigned TransIdx = StartIdx, TransEnd = TransVec.size();		for (unsigned TransIdx = StartIdx, TransEnd = TransVec.size();
TransIdx != TransEnd; ++TransIdx) {		TransIdx != TransEnd; ++TransIdx) {
// In the common case, push RW onto the current operand's sequence.		// In the common case, push RW onto the current operand's sequence.
if (!hasAliasedVariants(SchedRW, SchedModels)) {		if (!hasAliasedVariants(SchedRW, SchedModels)) {
if (IsRead)		if (IsRead)
TransVec[TransIdx].ReadSequences.back().push_back(*RWI);		TransVec[TransIdx].ReadSequences.back().push_back(*RWI);
else		else
TransVec[TransIdx].WriteSequences.back().push_back(*RWI);		TransVec[TransIdx].WriteSequences.back().push_back(*RWI);
continue;		continue;
}		}
HasAliases = true;		// Distribute this partial PredTransition across intersecting variants.
WasPushed \|= CollectAndAddVariants(TransIdx, SchedRW);		// This will push a copies of TransVec[TransIdx] on the back of TransVec.
		std::vector<TransVariant> IntersectingVariants;
		getIntersectingVariants(SchedRW, TransIdx, IntersectingVariants,
		VTracker);
		PushVariants(IntersectingVariants, IsRead);
}		}
if (IsRead && IsForAnyCPU && HasAliases && !WasPushed) {		if (IsRead && IsForAnyCPU) {
// If we're here this means that in some sched class:		// If we're here this means that in some sched class:
// a) We have read variant for CPU A		// a) We have read variant for CPU A
// b) We have write variant for CPU B		// b) We have write variant for CPU B
// b) We don't have write variant for CPU A		// b) We don't have write variant for CPU A
// d) We must expand all read/write variants (IsForAnyCPU is true)		// d) We must expand all read/write variants (IsForAnyCPU is true)
// e) We couldn't expand SchedRW because TransVec doesn't have		// e) We couldn't expand SchedRW or some of its variants, because
// any transition with compatible CPU ID.		// TransVec doesn't have any transition with compatible CPU ID.
// In such case we create new empty transition with zero (AnyCPU)		// In such case we create new empty transition with zero (AnyCPU)
// index.		// index.
TransVec.reserve(TransVec.size() + 1);		TransVec.reserve(TransVec.size() + 1);
TransVec.emplace_back(TransVec[StartIdx].PredTerm);		TransVec.emplace_back();
TransVec.back().ReadSequences.emplace_back();		TransVec.back().ReadSequences.emplace_back();
CollectAndAddVariants(TransVec.size() - 1, SchedRW);		std::vector<TransVariant> Variants;
		for (auto &P : VTracker)
		if (!P.second)
		addIntersectingVariant(TransVec.size() - 1, P.first, Variants);
		PushVariants(Variants, IsRead);
		// Remove empty transition, if we haven't found anything to push.
		if (Variants.empty())
		TransVec.pop_back();
}		}
}		}
}		}

// For each variant of a Read/Write in Trans, substitute the sequence of		// For each variant of a Read/Write in Trans, substitute the sequence of
// Read/Writes guarded by the variant. This is exponential in the number of		// Read/Writes guarded by the variant. This is exponential in the number of
// variant Read/Writes, but in practice detection of mutually exclusive		// variant Read/Writes, but in practice detection of mutually exclusive
// predicates should result in linear growth in the total number variants.		// predicates should result in linear growth in the total number variants.
▲ Show 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	static void dumpTransition(const CodeGenSchedModels &SchedModels,
LLVM_DEBUG(dbgs() << "Adding transition from " << FromSC.Name << "("		LLVM_DEBUG(dbgs() << "Adding transition from " << FromSC.Name << "("
<< FromSC.Index << ") to "		<< FromSC.Index << ") to "
<< SchedModels.getSchedClass(SCTrans.ToClassIdx).Name << "("		<< SchedModels.getSchedClass(SCTrans.ToClassIdx).Name << "("
<< SCTrans.ToClassIdx << ")"		<< SCTrans.ToClassIdx << ")"
<< " on processor indices: (";		<< " on processor indices: (";
dumpIdxVec(SCTrans.ProcIndices); dbgs() << ")\n");		dumpIdxVec(SCTrans.ProcIndices); dbgs() << ")\n");
}		}
// Create a new SchedClass for each variant found by inferFromRW. Pass		// Create a new SchedClass for each variant found by inferFromRW. Pass
static void inferFromTransitions(ArrayRef<PredTransition> LastTransitions,		static void inferFromTransitions(
unsigned FromClassIdx,		ArrayRef<PredTransition> LastTransitions,
CodeGenSchedModels &SchedModels) {		const SmallVectorImpl<SmallVector<unsigned, 4>> &InitialWrites,
		unsigned FromClassIdx, CodeGenSchedModels &SchedModels) {
// For each PredTransition, create a new CodeGenSchedTransition, which usually		// For each PredTransition, create a new CodeGenSchedTransition, which usually
// requires creating a new SchedClass.		// requires creating a new SchedClass.
for (ArrayRef<PredTransition>::iterator		for (ArrayRef<PredTransition>::iterator
I = LastTransitions.begin(), E = LastTransitions.end(); I != E; ++I) {		I = LastTransitions.begin(), E = LastTransitions.end(); I != E; ++I) {
IdxVec OperWritesVariant, OperReadsVariant;		IdxVec OperWritesVariant, OperReadsVariant;
addSequences(SchedModels, I->WriteSequences, OperWritesVariant, false);		addSequences(SchedModels, I->WriteSequences, OperWritesVariant, false);
addSequences(SchedModels, I->ReadSequences, OperReadsVariant, true);		addSequences(SchedModels, I->ReadSequences, OperReadsVariant, true);
CodeGenSchedTransition SCTrans;		CodeGenSchedTransition SCTrans;

// Transition should not contain processor indices already assigned to		// Transition should not contain processor indices already assigned to
// InstRWs in this scheduling class.		// InstRWs in this scheduling class.
const CodeGenSchedClass &FromSC = SchedModels.getSchedClass(FromClassIdx);		const CodeGenSchedClass &FromSC = SchedModels.getSchedClass(FromClassIdx);
llvm::copy_if(I->ProcIndices, std::back_inserter(SCTrans.ProcIndices),		llvm::copy_if(I->ProcIndices, std::back_inserter(SCTrans.ProcIndices),
[&FromSC](unsigned PIdx) {		[&FromSC](unsigned PIdx) {
return !FromSC.InstRWProcIndices.count(PIdx);		return !FromSC.InstRWProcIndices.count(PIdx);
});		});
if (SCTrans.ProcIndices.empty())		if (SCTrans.ProcIndices.empty())
continue;		continue;

		// Some sched classes may only have read variants. In such case we
		// populate writes from initially expanded sequences. We can do this,
		// because none of those writes is variant for any processor in
		// I->ProcIndices.
		if (OperWritesVariant.empty())
		addSequences(SchedModels, InitialWrites, OperWritesVariant, false);

		assert(!OperWritesVariant.empty() && "No writes in variant sched class");
SCTrans.ToClassIdx =		SCTrans.ToClassIdx =
SchedModels.addSchedClass(/ItinClassDef=/nullptr, OperWritesVariant,		SchedModels.addSchedClass(/ItinClassDef=/nullptr, OperWritesVariant,
OperReadsVariant, I->ProcIndices);		OperReadsVariant, I->ProcIndices);
dumpTransition(SchedModels, FromSC, SCTrans);		dumpTransition(SchedModels, FromSC, SCTrans);
// The final PredTerm is unique set of predicates guarding the transition.		// The final PredTerm is unique set of predicates guarding the transition.
RecVec Preds;		RecVec Preds;
transform(I->PredTerm, std::back_inserter(Preds),		transform(I->PredTerm, std::back_inserter(Preds),
[](const PredCheck &P) {		[](const PredCheck &P) {
Show All 36 Lines	for (unsigned ReadIdx : OperReads) {
expandRWSequence(ReadIdx, ReadSeq, /IsRead=/true);		expandRWSequence(ReadIdx, ReadSeq, /IsRead=/true);
LastTransitions[0].ReadSequences.emplace_back();		LastTransitions[0].ReadSequences.emplace_back();
SmallVectorImpl<unsigned> &Seq = LastTransitions[0].ReadSequences.back();		SmallVectorImpl<unsigned> &Seq = LastTransitions[0].ReadSequences.back();
Seq.append(ReadSeq.begin(), ReadSeq.end());		Seq.append(ReadSeq.begin(), ReadSeq.end());
LLVM_DEBUG(dbgs() << "("; dumpIdxVec(Seq); dbgs() << ") ");		LLVM_DEBUG(dbgs() << "("; dumpIdxVec(Seq); dbgs() << ") ");
}		}
LLVM_DEBUG(dbgs() << '\n');		LLVM_DEBUG(dbgs() << '\n');

		SmallVector<SmallVector<unsigned, 4>, 16> InitialWrites =
		LastTransitions[0].WriteSequences;
// Collect all PredTransitions for individual operands.		// Collect all PredTransitions for individual operands.
// Iterate until no variant writes remain.		// Iterate until no variant writes remain.
while (hasVariant(LastTransitions, *this)) {		while (hasVariant(LastTransitions, *this)) {
PredTransitions Transitions(*this);		PredTransitions Transitions(*this);
for (const PredTransition &Trans : LastTransitions)		for (const PredTransition &Trans : LastTransitions)
Transitions.substituteVariants(Trans);		Transitions.substituteVariants(Trans);
LLVM_DEBUG(Transitions.dump());		LLVM_DEBUG(Transitions.dump());
LastTransitions.swap(Transitions.TransVec);		LastTransitions.swap(Transitions.TransVec);
}		}
// If the first transition has no variants, nothing to do.		// If the first transition has no variants, nothing to do.
if (LastTransitions[0].PredTerm.empty())		if (LastTransitions[0].PredTerm.empty())
return;		return;

// WARNING: We are about to mutate the SchedClasses vector. Do not refer to		// WARNING: We are about to mutate the SchedClasses vector. Do not refer to
// OperWrites, OperReads, or ProcIndices after calling inferFromTransitions.		// OperWrites, OperReads, or ProcIndices after calling inferFromTransitions.
inferFromTransitions(LastTransitions, FromClassIdx, *this);		inferFromTransitions(LastTransitions, InitialWrites, FromClassIdx, *this);
}		}

// Check if any processor resource group contains all resource records in		// Check if any processor resource group contains all resource records in
// SubUnits.		// SubUnits.
bool CodeGenSchedModels::hasSuperGroup(RecVec &SubUnits, CodeGenProcModel &PM) {		bool CodeGenSchedModels::hasSuperGroup(RecVec &SubUnits, CodeGenProcModel &PM) {
for (unsigned i = 0, e = PM.ProcResourceDefs.size(); i < e; ++i) {		for (unsigned i = 0, e = PM.ProcResourceDefs.size(); i < e; ++i) {
if (!PM.ProcResourceDefs[i]->isSubClassOf("ProcResGroup"))		if (!PM.ProcResourceDefs[i]->isSubClassOf("ProcResGroup"))
continue;		continue;
▲ Show 20 Lines • Show All 494 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[TableGen][SchedModels] Fix read/write variant substitution #2
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 307111

llvm/lib/Target/ARM/ARMScheduleA57.td

llvm/utils/TableGen/CodeGenSchedule.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[TableGen][SchedModels] Fix read/write variant substitution #2ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 307111

llvm/lib/Target/ARM/ARMScheduleA57.td

llvm/utils/TableGen/CodeGenSchedule.cpp

[TableGen][SchedModels] Fix read/write variant substitution #2
ClosedPublic