Download Raw Diff

Details

Reviewers

spatel
RKSimon
uweigand
dmgreen

Commits

rGfb3ba3802188: [CostModel] Remove getExtCost

Summary

This has not been implemented by any backends, which appear to cover the functionality through getCastInstrCost. Sink what there is in the default implementation into BasicTTI.

Diff Detail

Event Timeline

samparker created this revision.Apr 27 2020, 6:06 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 27 2020, 6:06 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

samparker added a parent revision: D78918: [CostModel] Modify BasicTTI getCastInstrCost.Apr 27 2020, 6:07 AM

Harbormaster failed remote builds in B54789: Diff 260297!Apr 27 2020, 6:57 AM

RKSimon added inline comments.Apr 27 2020, 7:39 AM

llvm/include/llvm/CodeGen/BasicTTIImpl.h
724	This doesn't look as effective as the getTLI()->isExtLoad call in getExtCost - missing one use checks etc.

samparker mentioned this in D78937: [CostModel] Use isExtLoad in BasicTTI.Apr 27 2020, 8:57 AM

samparker marked an inline comment as done.

samparker added inline comments.

llvm/include/llvm/CodeGen/BasicTTIImpl.h
724	Thanks, I hadn't done that intentionally. I've uploaded that change in D78937 because it stops this patch from looking like an NFC. I'm not sure whether the vectorization changes are good/expected.

Ping.

samparker mentioned this in D78552: [TTI] Use getCastInstrCost for getUserCost Exts.May 4 2020, 3:19 AM

After turning on asserts, two inlining tests were failing for AArch64 and PowerPC. I've now updated their implementations of getCastInstrCost to pass the Instruction to the default implementation. I've also done the same for the ARM backend.

Herald added subscribers: kbarton, nemanjai. · View Herald TranscriptMay 4 2020, 8:08 AM

spatel mentioned this in D78997: [SLP] add another bailout for load-combine patterns.May 5 2020, 7:52 AM

samparker mentioned this in D79483: [CostModel] Replace getUserCost with getInstructionCost..May 6 2020, 5:27 AM

It may be better to separate out the "add I to getCastInstrCost" change, which is something I agree with but seems to deserve it's own patch to get the details correct.

llvm/test/Analysis/CostModel/ARM/cast.ll
1669 ↗	(On Diff #261824)	I don't think this should be free for Neon, as far as I understand. It doesn't fold the extend into the load like it would for MVE or scalar.

Herald added a subscriber: • wuzish. · View Herald TranscriptMay 6 2020, 11:39 PM

Yeah, fair enough.

Spun out ARM changes into D79561.

AArch64 change: D79562.

Rebased on top of the arm and aarch64 changes.

samparker retitled this revision from [CostModel] Remove getExtCost to [NFCI][CostModel] Remove getExtCost.May 7 2020, 3:54 AM

Rebased with the new aarch64 tests.

samparker mentioned this in D78547: [TTI] getUserCost to return getCastInstrCost.May 13 2020, 12:08 AM

samparker added a child revision: D79848: [CostModel] Unify getCastInstrCost.May 13 2020, 4:13 AM

ping.

LGTM

This revision is now accepted and ready to land.May 20 2020, 6:27 AM

dmgreen added inline comments.May 20 2020, 7:31 AM

llvm/include/llvm/CodeGen/BasicTTIImpl.h
717	I don't believe this will be correct from the vectorizer. It can pass a context instruction that has different types to the final IR due to it truncating at the same time it vectorizes. It is generally unsound to rely on I being accurate.

samparker marked an inline comment as done.May 20 2020, 7:45 AM

samparker added inline comments.

llvm/include/llvm/CodeGen/BasicTTIImpl.h
717	I'm sure there's going to be many cases like that, and the vectorizer can always choose to not pass the instruction.

dmgreen added inline comments.May 20 2020, 9:05 AM

llvm/include/llvm/CodeGen/BasicTTIImpl.h
717	Hmm. You may be correct there in the end. I was kind of hoping that we could keep the context instruction but only look at opcodes. I don't think even that will remain truly correct for very long though. I believe that adding this would at least cause inaccuracies in the costmodelling over what we have right now, possibly regressions in some cases. Not passing I through from the vectorizer (and any other place where we couldn't trust the context) would seem to need a different change to me though.
llvm/test/Analysis/CostModel/AArch64/cast.ll
30 ↗	(On Diff #263402)	Also, and I may be forgetting something because I thought these looked fine the last time I looked, but why would a sext i32->i64 be free? Would it not need a sxtw?

samparker marked an inline comment as done.May 20 2020, 10:54 PM

samparker added inline comments.

llvm/test/Analysis/CostModel/AArch64/cast.ll
30 ↗	(On Diff #263402)	We would, this confused me too. AArch64 looks at all the users of the cast and in the case of no users in returns 0! So it completely breaks these tests. I've haven't given enough context to this patch, because I've also added another cast test with users and that hasn't changed.

I believe that adding this would at least cause inaccuracies in the costmodelling over what we have right now, possibly regressions in some cases.

Don't forget that this is just doing what it did before though and hopefully anyone who is really concerned about performance won't be relying on BasicTTI to get it. There's also D78937 which may help too.

Closed by commit rGfb3ba3802188: [CostModel] Remove getExtCost (authored by samparker). · Explain WhyMay 20 2020, 11:25 PM

This revision was automatically updated to reflect the committed changes.

Diff 262597

llvm/include/llvm/Analysis/TargetTransformInfo.h

Show First 20 Lines • Show All 205 Lines • ▼ Show 20 Lines	enum TargetCostConstants {
TCC_Expensive = 4 ///< The cost of a 'div' instruction on x86.		TCC_Expensive = 4 ///< The cost of a 'div' instruction on x86.
};		};

/// Estimate the cost of a GEP operation when lowered.		/// Estimate the cost of a GEP operation when lowered.
int getGEPCost(Type PointeeType, const Value Ptr,		int getGEPCost(Type PointeeType, const Value Ptr,
ArrayRef<const Value *> Operands,		ArrayRef<const Value *> Operands,
TargetCostKind CostKind = TCK_SizeAndLatency) const;		TargetCostKind CostKind = TCK_SizeAndLatency) const;

/// Estimate the cost of a EXT operation when lowered.
int getExtCost(const Instruction I, const Value Src) const;

/// \returns A value by which our inlining threshold should be multiplied.		/// \returns A value by which our inlining threshold should be multiplied.
/// This is primarily used to bump up the inlining threshold wholesale on		/// This is primarily used to bump up the inlining threshold wholesale on
/// targets where calls are unusually expensive.		/// targets where calls are unusually expensive.
///		///
/// TODO: This is a rather blunt instrument. Perhaps altering the costs of		/// TODO: This is a rather blunt instrument. Perhaps altering the costs of
/// individual classes of instructions would be better.		/// individual classes of instructions would be better.
unsigned getInliningThresholdMultiplier() const;		unsigned getInliningThresholdMultiplier() const;

▲ Show 20 Lines • Show All 958 Lines • ▼ Show 20 Lines

class TargetTransformInfo::Concept {		class TargetTransformInfo::Concept {
public:		public:
virtual ~Concept() = 0;		virtual ~Concept() = 0;
virtual const DataLayout &getDataLayout() const = 0;		virtual const DataLayout &getDataLayout() const = 0;
virtual int getGEPCost(Type PointeeType, const Value Ptr,		virtual int getGEPCost(Type PointeeType, const Value Ptr,
ArrayRef<const Value *> Operands,		ArrayRef<const Value *> Operands,
TTI::TargetCostKind CostKind) = 0;		TTI::TargetCostKind CostKind) = 0;
virtual int getExtCost(const Instruction I, const Value Src) = 0;
virtual unsigned getInliningThresholdMultiplier() = 0;		virtual unsigned getInliningThresholdMultiplier() = 0;
virtual int getInlinerVectorBonusPercent() = 0;		virtual int getInlinerVectorBonusPercent() = 0;
virtual int getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,		virtual int getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,
ArrayRef<Type > ParamTys, const User U,		ArrayRef<Type > ParamTys, const User U,
enum TargetCostKind CostKind) = 0;		enum TargetCostKind CostKind) = 0;
virtual int getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,		virtual int getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,
ArrayRef<const Value *> Arguments,		ArrayRef<const Value *> Arguments,
const User *U,		const User *U,
▲ Show 20 Lines • Show All 251 Lines • ▼ Show 20 Lines	const DataLayout &getDataLayout() const override {
return Impl.getDataLayout();		return Impl.getDataLayout();
}		}

int getGEPCost(Type PointeeType, const Value Ptr,		int getGEPCost(Type PointeeType, const Value Ptr,
ArrayRef<const Value *> Operands,		ArrayRef<const Value *> Operands,
enum TargetTransformInfo::TargetCostKind CostKind) override {		enum TargetTransformInfo::TargetCostKind CostKind) override {
return Impl.getGEPCost(PointeeType, Ptr, Operands);		return Impl.getGEPCost(PointeeType, Ptr, Operands);
}		}
int getExtCost(const Instruction I, const Value Src) override {
return Impl.getExtCost(I, Src);
}
unsigned getInliningThresholdMultiplier() override {		unsigned getInliningThresholdMultiplier() override {
return Impl.getInliningThresholdMultiplier();		return Impl.getInliningThresholdMultiplier();
}		}
int getInlinerVectorBonusPercent() override {		int getInlinerVectorBonusPercent() override {
return Impl.getInlinerVectorBonusPercent();		return Impl.getInlinerVectorBonusPercent();
}		}
int getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,		int getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,
ArrayRef<Type *> ParamTys,		ArrayRef<Type *> ParamTys,
▲ Show 20 Lines • Show All 492 Lines • Show Last 20 Lines

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	unsigned getEstimatedNumberOfCaseClusters(const SwitchInst &SI,
ProfileSummaryInfo *PSI,		ProfileSummaryInfo *PSI,
BlockFrequencyInfo *BFI) {		BlockFrequencyInfo *BFI) {
(void)PSI;		(void)PSI;
(void)BFI;		(void)BFI;
JTSize = 0;		JTSize = 0;
return SI.getNumCases();		return SI.getNumCases();
}		}

int getExtCost(const Instruction I, const Value Src) {
return TTI::TCC_Basic;
}

unsigned getInliningThresholdMultiplier() { return 1; }		unsigned getInliningThresholdMultiplier() { return 1; }

int getInlinerVectorBonusPercent() { return 150; }		int getInlinerVectorBonusPercent() { return 150; }

unsigned getMemcpyCost(const Instruction *I) { return TTI::TCC_Expensive; }		unsigned getMemcpyCost(const Instruction *I) { return TTI::TCC_Expensive; }

bool hasBranchDivergence() { return false; }		bool hasBranchDivergence() { return false; }

▲ Show 20 Lines • Show All 797 Lines • ▼ Show 20 Lines	case Instruction::Trunc:
break;		break;
case Instruction::BitCast:		case Instruction::BitCast:
if (getCastInstrCost(Opcode, Ty, OpTy, CostKind, I) == TTI::TCC_Free)		if (getCastInstrCost(Opcode, Ty, OpTy, CostKind, I) == TTI::TCC_Free)
return TTI::TCC_Free;		return TTI::TCC_Free;
break;		break;
case Instruction::FPExt:		case Instruction::FPExt:
case Instruction::SExt:		case Instruction::SExt:
case Instruction::ZExt:		case Instruction::ZExt:
if (I && TargetTTI->getExtCost(I, Operands.back()) == TTI::TCC_Free)		if (TargetTTI->getCastInstrCost(Opcode, Ty, OpTy, CostKind, I) == TTI::TCC_Free)
return TTI::TCC_Free;		return TTI::TCC_Free;
break;		break;
}		}
// By default, just classify everything as 'basic'.		// By default, just classify everything as 'basic'.
return TTI::TCC_Basic;		return TTI::TCC_Basic;
}		}

int getInstructionLatency(const Instruction *I) {		int getInstructionLatency(const Instruction *I) {
Show All 34 Lines

llvm/include/llvm/CodeGen/BasicTTIImpl.h

Show First 20 Lines • Show All 286 Lines • ▼ Show 20 Lines	bool isTypeLegal(Type *Ty) {
return getTLI()->isTypeLegal(VT);		return getTLI()->isTypeLegal(VT);
}		}

int getGEPCost(Type PointeeType, const Value Ptr,		int getGEPCost(Type PointeeType, const Value Ptr,
ArrayRef<const Value *> Operands) {		ArrayRef<const Value *> Operands) {
return BaseT::getGEPCost(PointeeType, Ptr, Operands);		return BaseT::getGEPCost(PointeeType, Ptr, Operands);
}		}

int getExtCost(const Instruction I, const Value Src) {
if (getTLI()->isExtFree(I))
return TargetTransformInfo::TCC_Free;

if (isa<ZExtInst>(I) \|\| isa<SExtInst>(I))
if (const LoadInst *LI = dyn_cast<LoadInst>(Src))
if (getTLI()->isExtLoad(LI, I, DL))
return TargetTransformInfo::TCC_Free;

return TargetTransformInfo::TCC_Basic;
}

unsigned getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,		unsigned getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,
ArrayRef<const Value > Arguments, const User U,		ArrayRef<const Value > Arguments, const User U,
TTI::TargetCostKind CostKind) {		TTI::TargetCostKind CostKind) {
return BaseT::getIntrinsicCost(IID, RetTy, Arguments, U, CostKind);		return BaseT::getIntrinsicCost(IID, RetTy, Arguments, U, CostKind);
}		}

unsigned getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,		unsigned getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,
ArrayRef<Type > ParamTys, const User U,		ArrayRef<Type > ParamTys, const User U,
▲ Show 20 Lines • Show All 394 Lines • ▼ Show 20 Lines	case Instruction::Trunc:
if (TLI->isTruncateFree(SrcLT.second, DstLT.second))		if (TLI->isTruncateFree(SrcLT.second, DstLT.second))
return 0;		return 0;
LLVM_FALLTHROUGH;		LLVM_FALLTHROUGH;
case Instruction::BitCast:		case Instruction::BitCast:
// Bitcast between types that are legalized to the same type are free.		// Bitcast between types that are legalized to the same type are free.
if (SrcLT.first == DstLT.first && SrcSize == DstSize)		if (SrcLT.first == DstLT.first && SrcSize == DstSize)
return 0;		return 0;
break;		break;
		case Instruction::FPExt:
		if (I && getTLI()->isExtFree(I))
		return TargetTransformInfo::TCC_Free;
		break;
case Instruction::ZExt:		case Instruction::ZExt:
if (TLI->isZExtFree(SrcLT.second, DstLT.second))		if (TLI->isZExtFree(SrcLT.second, DstLT.second))
return 0;		return 0;
LLVM_FALLTHROUGH;		LLVM_FALLTHROUGH;
case Instruction::SExt: {		case Instruction::SExt: {
		if (I && getTLI()->isExtFree(I))
		return TargetTransformInfo::TCC_Free;

// If this is a zext/sext of a load, return 0 if the corresponding		// If this is a zext/sext of a load, return 0 if the corresponding
		dmgreenUnsubmitted Not Done Reply Inline Actions I don't believe this will be correct from the vectorizer. It can pass a context instruction that has different types to the final IR due to it truncating at the same time it vectorizes. It is generally unsound to rely on I being accurate. dmgreen: I don't believe this will be correct from the vectorizer. It can pass a context instruction…
		samparkerAuthorUnsubmitted Done Reply Inline Actions I'm sure there's going to be many cases like that, and the vectorizer can always choose to not pass the instruction. samparker: I'm sure there's going to be many cases like that, and the vectorizer can always choose to not…
		dmgreenUnsubmitted Not Done Reply Inline Actions Hmm. You may be correct there in the end. I was kind of hoping that we could keep the context instruction but only look at opcodes. I don't think even that will remain truly correct for very long though. I believe that adding this would at least cause inaccuracies in the costmodelling over what we have right now, possibly regressions in some cases. Not passing I through from the vectorizer (and any other place where we couldn't trust the context) would seem to need a different change to me though. dmgreen: Hmm. You may be correct there in the end. I was kind of hoping that we could keep the context…
// extending load exists on target.		// extending load exists on target.
if (I && isa<LoadInst>(I->getOperand(0))) {		if (I && isa<LoadInst>(I->getOperand(0))) {
EVT ExtVT = EVT::getEVT(Dst);		EVT ExtVT = EVT::getEVT(Dst);
EVT LoadVT = EVT::getEVT(Src);		EVT LoadVT = EVT::getEVT(Src);
unsigned LType =		unsigned LType =
((Opcode == Instruction::ZExt) ? ISD::ZEXTLOAD : ISD::SEXTLOAD);		((Opcode == Instruction::ZExt) ? ISD::ZEXTLOAD : ISD::SEXTLOAD);
if (TLI->isLoadExtLegal(LType, ExtVT, LoadVT))		if (TLI->isLoadExtLegal(LType, ExtVT, LoadVT))
		RKSimonUnsubmitted Not Done Reply Inline Actions This doesn't look as effective as the getTLI()->isExtLoad call in getExtCost - missing one use checks etc. RKSimon: This doesn't look as effective as the getTLI()->isExtLoad call in getExtCost - missing one use…
		samparkerAuthorUnsubmitted Done Reply Inline Actions Thanks, I hadn't done that intentionally. I've uploaded that change in D78937 because it stops this patch from looking like an NFC. I'm not sure whether the vectorization changes are good/expected. samparker: Thanks, I hadn't done that intentionally. I've uploaded that change in D78937 because it stops…
return 0;		return 0;
}		}
break;		break;
}		}
case Instruction::AddrSpaceCast:		case Instruction::AddrSpaceCast:
if (TLI->isFreeAddrSpaceCast(Src->getPointerAddressSpace(),		if (TLI->isFreeAddrSpaceCast(Src->getPointerAddressSpace(),
Dst->getPointerAddressSpace()))		Dst->getPointerAddressSpace()))
return 0;		return 0;
▲ Show 20 Lines • Show All 484 Lines • Show Last 20 Lines

llvm/lib/Analysis/TargetTransformInfo.cpp

	Show First 20 Lines • Show All 153 Lines • ▼ Show 20 Lines
	}			}

	int TargetTransformInfo::getGEPCost(Type PointeeType, const Value Ptr,			int TargetTransformInfo::getGEPCost(Type PointeeType, const Value Ptr,
	ArrayRef<const Value *> Operands,			ArrayRef<const Value *> Operands,
	TTI::TargetCostKind CostKind) const {			TTI::TargetCostKind CostKind) const {
	return TTIImpl->getGEPCost(PointeeType, Ptr, Operands, CostKind);			return TTIImpl->getGEPCost(PointeeType, Ptr, Operands, CostKind);
	}			}

	int TargetTransformInfo::getExtCost(const Instruction *I,
	const Value *Src) const {
	return TTIImpl->getExtCost(I, Src);
	}

	int TargetTransformInfo::getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,			int TargetTransformInfo::getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,
	ArrayRef<const Value *> Arguments,			ArrayRef<const Value *> Arguments,
	const User *U,			const User *U,
	TTI::TargetCostKind CostKind) const {			TTI::TargetCostKind CostKind) const {
	int Cost = TTIImpl->getIntrinsicCost(IID, RetTy, Arguments, U, CostKind);			int Cost = TTIImpl->getIntrinsicCost(IID, RetTy, Arguments, U, CostKind);
	assert(Cost >= 0 && "TTI should not produce negative costs!");			assert(Cost >= 0 && "TTI should not produce negative costs!");
	return Cost;			return Cost;
	}			}
	▲ Show 20 Lines • Show All 492 Lines • Show Last 20 Lines

llvm/lib/Target/PowerPC/PPCTargetTransformInfo.cpp

	Show First 20 Lines • Show All 492 Lines • ▼ Show 20 Lines
	nullptr);			nullptr);
	}			}

	int PPCTTIImpl::getCastInstrCost(unsigned Opcode, Type Dst, Type Src,			int PPCTTIImpl::getCastInstrCost(unsigned Opcode, Type Dst, Type Src,
	TTI::TargetCostKind CostKind,			TTI::TargetCostKind CostKind,
	const Instruction *I) {			const Instruction *I) {
	assert(TLI->InstructionOpcodeToISD(Opcode) && "Invalid opcode");			assert(TLI->InstructionOpcodeToISD(Opcode) && "Invalid opcode");

	int Cost = BaseT::getCastInstrCost(Opcode, Dst, Src, CostKind);			int Cost = BaseT::getCastInstrCost(Opcode, Dst, Src, CostKind, I);
	return vectorCostAdjustment(Cost, Opcode, Dst, Src);			return vectorCostAdjustment(Cost, Opcode, Dst, Src);
	}			}

	int PPCTTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,			int PPCTTIImpl::getCmpSelInstrCost(unsigned Opcode, Type ValTy, Type CondTy,
	TTI::TargetCostKind CostKind,			TTI::TargetCostKind CostKind,
	const Instruction *I) {			const Instruction *I) {
	int Cost = BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy, CostKind, I);			int Cost = BaseT::getCmpSelInstrCost(Opcode, ValTy, CondTy, CostKind, I);
	return vectorCostAdjustment(Cost, Opcode, ValTy, nullptr);			return vectorCostAdjustment(Cost, Opcode, ValTy, nullptr);
	▲ Show 20 Lines • Show All 239 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[CostModel] Remove getExtCost
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 262597

llvm/include/llvm/Analysis/TargetTransformInfo.h

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

llvm/include/llvm/CodeGen/BasicTTIImpl.h

llvm/lib/Analysis/TargetTransformInfo.cpp

llvm/lib/Target/PowerPC/PPCTargetTransformInfo.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[CostModel] Remove getExtCostClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 262597

llvm/include/llvm/Analysis/TargetTransformInfo.h

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

llvm/include/llvm/CodeGen/BasicTTIImpl.h

llvm/lib/Analysis/TargetTransformInfo.cpp

llvm/lib/Target/PowerPC/PPCTargetTransformInfo.cpp

[CostModel] Remove getExtCost
ClosedPublic