This is an archive of the discontinued LLVM Phabricator instance.

Changing TargetTransformInfo::getGEPCost to take GetElementPtrInst as parameter
Needs ReviewPublic

Authored by eastig on Mar 21 2017, 5:48 AM.

Download Raw Diff

Details

Reviewers

chandlerc
efriedma

Summary

This is a patch for RFC: Change TargetTransformInfo::getGEPCost to take GetElementPtrInst as a parameter (http://lists.llvm.org/pipermail/llvm-dev/2017-March/111066.html)

The current signature of TargetTransformInfo::getGEPCost is:

/// \brief Estimate the cost of a GEP operation when lowered.
///
/// The contract for this function is the same as \c getOperationCost except
/// that it supports an interface that provides extra information specific to
/// the GEP operation.
int getGEPCost(Type *PointeeType, const Value *Ptr,
               ArrayRef<const Value *> Operands) const;

I’d like to change it to:

int getGEPCost(const GetElementPtrInst *GEP,  ArrayRef<const Value *> Operands) const;

All uses of the current getGEPCost look like: TTI.getGEPCost(GEP.getSourceElementType(), GEP.getPointerOperand(), …):

lib/Analysis/InlineCost.cpp

TTI.getGEPCost(GEP.getSourceElementType(), GEP.getPointerOperand(),

lib/Transforms/Scalar/NaryReassociate.cpp

return TTI->getGEPCost(GEP->getSourceElementType(), GEP->getPointerOperand(),

lib/Transforms/Scalar/StraightLineStrengthReduce.cpp

return TTI->getGEPCost(GEP->getSourceElementType(), GEP->getPointerOperand(),

Rationale:

In the following IR produced from the code of a simple memcopy function GEPs are not free:

while.cond:                                       ; preds = %while.body, %entry
  %dest.addr.0 = phi i8* [ %dest, %entry ], [ %incdec.ptr1, %while.body ]
  %src.addr.0 = phi i8* [ %src, %entry ], [ %incdec.ptr, %while.body ]
  %tobool = icmp eq i32 %size.addr.0, 0
  br i1 %tobool, label %while.end, label %while.body

while.body:                                       ; preds = %while.cond
  %dec = add nsw i32 %size.addr.0, -1
  %incdec.ptr = getelementptr inbounds i8, i8* %src.addr.0, i32 1
  %0 = load i8, i8* %src.addr.0, align 1, !tbaa !12
  %incdec.ptr1 = getelementptr inbounds i8, i8* %dest.addr.0, i32 1
  store i8 %0, i8* %dest.addr.0, align 1, !tbaa !12
  br label %while.cond

while.end:                                        ; preds = %while.cond

For x86 and ARM they are lowered into ADD instructions. So they are not free but the current getGEPCost returns they are free. E.g., this affects the cost of inlining. The calculated cost is lower than it should be and functions are inlined.
We can do the analysis before the call of getGEPCost but this will require to do it at all places where getGEPCost is called. So it’s better to do this in one place, in the getGEPCost function or its implementations for targets.
To detect this case and other Def-Use based cases GEPs need to be accessed in getGEPCost which is not possible with the current signature.

Diff Detail

Build Status

Buildable 4919
Build 4919: arc lint + arc unit

Event Timeline

eastig created this revision.Mar 21 2017, 5:48 AM

Herald added subscribers: rengolin, aemerson. · View Herald TranscriptMar 21 2017, 5:48 AM

In some contexts, we might want to compute the cost of a GEP which doesn't actually exist in IR at the moment. I guess there aren't any callers in the tree like that at the moment, but we use this sort of capability in other contexts, so we want to structure the code to allow it.

Actually, the inliner itself could in theory take advantage of that: if you have a function which returns a GEP, the cost depends on how the caller uses the value. That might be overkill, though, given the tiny effect in most cases.

Along those lines, it probably makes sense to structure the code more like this:

The core cost function for GEPs should keep its existing signature, and just assume the GEP has users which are not memory operations (so it's not free unless all the indices are zero).
We should have a utility function which examines the users of the GEP, and checks if the GEP can be folded into them, using isLegalAddressingMode or something like that.

include/llvm/Analysis/TargetTransformInfoImpl.h
571	Useless assertion/

In D31186#706614, @efriedma wrote:

In some contexts, we might want to compute the cost of a GEP which doesn't actually exist in IR at the moment. I guess there aren't any callers in the tree like that at the moment, but we use this sort of capability in other contexts, so we want to structure the code to allow it.

Actually, the inliner itself could in theory take advantage of that: if you have a function which returns a GEP, the cost depends on how the caller uses the value. That might be overkill, though, given the tiny effect in most cases.

Along those lines, it probably makes sense to structure the code more like this:

The core cost function for GEPs should keep its existing signature, and just assume the GEP has users which are not memory operations (so it's not free unless all the indices are zero).

We should have a utility function which examines the users of the GEP, and checks if the GEP can be folded into them, using isLegalAddressingMode or something like that.

I'll also point out that we're further walking down the path of costing instruction patterns, instead of individual instructions. We should clearly document how we expect this to work. Specifically, we want to have a uniform scheme that we can use without overcounting. We can't for example, have some logic look at an instruction along with its operands and another loop at an instruction along with its users. I think that the approach described, where we look at the users, seems potentially the easiest to make consistent, but regardless, we should make a decision.

In some contexts, we might want to compute the cost of a GEP which doesn't actually exist in IR at the moment. I guess there aren't any callers in the tree like that at the moment, but we use this sort of capability in other contexts, so we want to structure the code to allow it.

I have also been working on improving cost functions by passing the actual instruction pointer when available. With the same argument - that there might not actually be an instruction to pass in a speculative context - I did not replace the existing version, but merely adding the instruction as an optional argument. I am hoping this is the right thing to do. See https://reviews.llvm.org/D29631

Eli, Hal and Jonas, thank you for comments.

I see there are use cases I have not taken into account.

Let me summarize:

Requirements:

"get*Cost" functions should try to estimate the cost of an operation when lowered.

User stories:

An user wants to estimate the cost of an operation to make a decision whether it's worth to create it. The operation does not exist in IR.
An user has an operation in IR and wants to know the cost of the operation to estimate its contribution into execution.

Actually, the inliner itself could in theory take advantage of that: if you have a function which returns a GEP, the cost depends on how the caller uses the value. That might be overkill, though, given the tiny effect in most cases.

Eli, you wrote: "the cost depends on how the caller uses the value." A question is where the dependency should be taken into account: in get*Cost or in user's place?

Hal, you wrote:

I'll also point out that we're further walking down the path of costing instruction patterns

What do you mean "instruction patterns"?

Taking into account all of these I thin "get*Cost" functions should answer the question: What is the cost of an operation if I want to have it or I have it in DFG/CFG?

Maybe it's time to redesign API?

Jonas, an optional parameter duplicates the information passed through other parameters. It can provide all of the needed information. Also single API for all use cases might create some kind of misunderstanding.

include/llvm/Analysis/TargetTransformInfoImpl.h
571	I prefer to have controlled crashes instead of uncontrolled ones. It has saved debugging time quite often when a pointer was passed through a chain of calls and was dereferenced in somewhere in the middle of the chain. The assert triggered at the beginning of the chain so I didn't need to examine the whole call stack. Here it's also for the purpose of a contract: GEP must not be null.

User stories:

An user wants to estimate the cost of an operation to make a decision whether it's worth to create it. The operation does not exist in IR.
An user has an operation in IR and wants to know the cost of the operation to estimate its contribution into execution.

I think there is also the case in a vectorizer, where there is an existing instruction but the cost query is for the same instruction *vectorized* with VF.

Jonas, an optional parameter duplicates the information passed through other parameters. It can provide all of the needed information. Also single API for all use cases might create some kind of misunderstanding.

No, not in the case in the vectorizer. Here the scalar instruction is passed, with vector types.

In D31186#708507, @jonpa wrote:
User stories:

An user wants to estimate the cost of an operation to make a decision whether it's worth to create it. The operation does not exist in IR.
An user has an operation in IR and wants to know the cost of the operation to estimate its contribution into execution.
I think there is also the case in a vectorizer, where there is an existing instruction but the cost query is for the same instruction *vectorized* with VF.

Jonas, an optional parameter duplicates the information passed through other parameters. It can provide all of the needed information. Also single API for all use cases might create some kind of misunderstanding.

No, not in the case in the vectorizer. Here the scalar instruction is passed, with vector types.

But that is that the vectorizer normally does. It takes the scalar instruction and emits the same instruction with vector types. You need to explain how any other case is different.

In D31186#707662, @eastig wrote:

...

Hal, you wrote:

I'll also point out that we're further walking down the path of costing instruction patterns

What do you mean "instruction patterns"?

I mean taking into account folding decisions that will be made by the backend. For example, it might be the case that, for some data type, add(zext(x), zext(x)), will be given a cost of 3 because each zext costs 1 and the add costs 1. These costs are accurate in isolation, however, the target can actually lower this into a single instruction, so the overall cost should be 1 for all three operations. I think that the proposed approach, where we pass each instruction so we can look at the users, probably makes the most sense. We need to be careful, however, that we don't overcount.

Maybe it's time to redesign API?

This may be true. Also, I think that the VPlan work being done (D28975) may be highly relevant to how we do this (we might just want to let the backends cost VPlans directly in cases where the fine-grained modeling won't work).

In D31186#708662, @hfinkel wrote:
In D31186#708507, @jonpa wrote:
User stories:

An user wants to estimate the cost of an operation to make a decision whether it's worth to create it. The operation does not exist in IR.
An user has an operation in IR and wants to know the cost of the operation to estimate its contribution into execution.
I think there is also the case in a vectorizer, where there is an existing instruction but the cost query is for the same instruction *vectorized* with VF.

Jonas, an optional parameter duplicates the information passed through other parameters. It can provide all of the needed information. Also single API for all use cases might create some kind of misunderstanding.

No, not in the case in the vectorizer. Here the scalar instruction is passed, with vector types.
But that is that the vectorizer normally does. It takes the scalar instruction and emits the same instruction with vector types. You need to explain how any other case is different.

I meant that it is not quite true that it would be enough to just pass the Instruction without types, as a response to the statement that it just duplicates information. Currently LoopVectorizer passes types as arguments to TTI. Passing the instruction to TTI does not provide the Types, it only gives clues of the original instruction plus the possibility to inspect users / operands. This is part of D29632 for SystemZ, which is still under review.

In D31186#708714, @jonpa wrote:
In D31186#708662, @hfinkel wrote:
In D31186#708507, @jonpa wrote:
User stories:

An user wants to estimate the cost of an operation to make a decision whether it's worth to create it. The operation does not exist in IR.
An user has an operation in IR and wants to know the cost of the operation to estimate its contribution into execution.
I think there is also the case in a vectorizer, where there is an existing instruction but the cost query is for the same instruction *vectorized* with VF.

Jonas, an optional parameter duplicates the information passed through other parameters. It can provide all of the needed information. Also single API for all use cases might create some kind of misunderstanding.

No, not in the case in the vectorizer. Here the scalar instruction is passed, with vector types.
But that is that the vectorizer normally does. It takes the scalar instruction and emits the same instruction with vector types. You need to explain how any other case is different.
I meant that it is not quite true that it would be enough to just pass the Instruction without types, as a response to the statement that it just duplicates information. Currently LoopVectorizer passes types as arguments to TTI. Passing the instruction to TTI does not provide the Types, it only gives clues of the original instruction plus the possibility to inspect users / operands.

I agree. We can't only have the current instruction.

I found that there is TTI::getUserCost which provides needed functionality by design. So it can be used instead of these changes. I created https://reviews.llvm.org/D33685.

Revision Contents

Path

Size

include/

llvm/

Analysis/

TargetTransformInfo.h

8 lines

TargetTransformInfoImpl.h

14 lines

CodeGen/

BasicTTIImpl.h

4 lines

lib/

Analysis/

InlineCost.cpp

4 lines

TargetTransformInfo.cpp

4 lines

Transforms/

Scalar/

NaryReassociate.cpp

3 lines

StraightLineStrengthReduce.cpp

3 lines

Diff 92474

include/llvm/Analysis/TargetTransformInfo.h

Show First 20 Lines • Show All 143 Lines • ▼ Show 20 Lines	public:
/// comments for a detailed explanation of the cost values.		/// comments for a detailed explanation of the cost values.
int getOperationCost(unsigned Opcode, Type Ty, Type OpTy = nullptr) const;		int getOperationCost(unsigned Opcode, Type Ty, Type OpTy = nullptr) const;

/// \brief Estimate the cost of a GEP operation when lowered.		/// \brief Estimate the cost of a GEP operation when lowered.
///		///
/// The contract for this function is the same as \c getOperationCost except		/// The contract for this function is the same as \c getOperationCost except
/// that it supports an interface that provides extra information specific to		/// that it supports an interface that provides extra information specific to
/// the GEP operation.		/// the GEP operation.
int getGEPCost(Type PointeeType, const Value Ptr,		int getGEPCost(const GetElementPtrInst *GEP,
ArrayRef<const Value *> Operands) const;		ArrayRef<const Value *> Operands) const;

/// \brief Estimate the cost of a function call when lowered.		/// \brief Estimate the cost of a function call when lowered.
///		///
/// The contract for this is the same as \c getOperationCost except that it		/// The contract for this is the same as \c getOperationCost except that it
/// supports an interface that provides extra information specific to call		/// supports an interface that provides extra information specific to call
/// instructions.		/// instructions.
///		///
▲ Show 20 Lines • Show All 569 Lines • ▼ Show 20 Lines	private:
std::unique_ptr<Concept> TTIImpl;		std::unique_ptr<Concept> TTIImpl;
};		};

class TargetTransformInfo::Concept {		class TargetTransformInfo::Concept {
public:		public:
virtual ~Concept() = 0;		virtual ~Concept() = 0;
virtual const DataLayout &getDataLayout() const = 0;		virtual const DataLayout &getDataLayout() const = 0;
virtual int getOperationCost(unsigned Opcode, Type Ty, Type OpTy) = 0;		virtual int getOperationCost(unsigned Opcode, Type Ty, Type OpTy) = 0;
virtual int getGEPCost(Type PointeeType, const Value Ptr,		virtual int getGEPCost(const GetElementPtrInst *GEP,
ArrayRef<const Value *> Operands) = 0;		ArrayRef<const Value *> Operands) = 0;
virtual int getCallCost(FunctionType *FTy, int NumArgs) = 0;		virtual int getCallCost(FunctionType *FTy, int NumArgs) = 0;
virtual int getCallCost(const Function *F, int NumArgs) = 0;		virtual int getCallCost(const Function *F, int NumArgs) = 0;
virtual int getCallCost(const Function *F,		virtual int getCallCost(const Function *F,
ArrayRef<const Value *> Arguments) = 0;		ArrayRef<const Value *> Arguments) = 0;
virtual unsigned getInliningThresholdMultiplier() = 0;		virtual unsigned getInliningThresholdMultiplier() = 0;
virtual int getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,		virtual int getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,
ArrayRef<Type *> ParamTys) = 0;		ArrayRef<Type *> ParamTys) = 0;
▲ Show 20 Lines • Show All 130 Lines • ▼ Show 20 Lines	public:

const DataLayout &getDataLayout() const override {		const DataLayout &getDataLayout() const override {
return Impl.getDataLayout();		return Impl.getDataLayout();
}		}

int getOperationCost(unsigned Opcode, Type Ty, Type OpTy) override {		int getOperationCost(unsigned Opcode, Type Ty, Type OpTy) override {
return Impl.getOperationCost(Opcode, Ty, OpTy);		return Impl.getOperationCost(Opcode, Ty, OpTy);
}		}
int getGEPCost(Type PointeeType, const Value Ptr,		int getGEPCost(const GetElementPtrInst *GEP,
ArrayRef<const Value *> Operands) override {		ArrayRef<const Value *> Operands) override {
return Impl.getGEPCost(PointeeType, Ptr, Operands);		return Impl.getGEPCost(GEP, Operands);
}		}
int getCallCost(FunctionType *FTy, int NumArgs) override {		int getCallCost(FunctionType *FTy, int NumArgs) override {
return Impl.getCallCost(FTy, NumArgs);		return Impl.getCallCost(FTy, NumArgs);
}		}
int getCallCost(const Function *F, int NumArgs) override {		int getCallCost(const Function *F, int NumArgs) override {
return Impl.getCallCost(F, NumArgs);		return Impl.getCallCost(F, NumArgs);
}		}
int getCallCost(const Function *F,		int getCallCost(const Function *F,
▲ Show 20 Lines • Show All 365 Lines • Show Last 20 Lines

include/llvm/Analysis/TargetTransformInfoImpl.h

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	case Instruction::Trunc:
// shift-right of the same width).		// shift-right of the same width).
if (DL.isLegalInteger(DL.getTypeSizeInBits(Ty)))		if (DL.isLegalInteger(DL.getTypeSizeInBits(Ty)))
return TTI::TCC_Free;		return TTI::TCC_Free;

return TTI::TCC_Basic;		return TTI::TCC_Basic;
}		}
}		}

int getGEPCost(Type PointeeType, const Value Ptr,		int getGEPCost(const GetElementPtrInst *GEP,
ArrayRef<const Value *> Operands) {		ArrayRef<const Value *> Operands) {
// In the basic model, we just assume that all-constant GEPs will be folded		// In the basic model, we just assume that all-constant GEPs will be folded
// into their uses via addressing modes.		// into their uses via addressing modes.
for (unsigned Idx = 0, Size = Operands.size(); Idx != Size; ++Idx)		for (unsigned Idx = 0, Size = Operands.size(); Idx != Size; ++Idx)
if (!isa<Constant>(Operands[Idx]))		if (!isa<Constant>(Operands[Idx]))
return TTI::TCC_Basic;		return TTI::TCC_Basic;

return TTI::TCC_Free;		return TTI::TCC_Free;
▲ Show 20 Lines • Show All 446 Lines • ▼ Show 20 Lines	unsigned getCallCost(const Function F, ArrayRef<const Value > Arguments) {
// Simply delegate to generic handling of the call.		// Simply delegate to generic handling of the call.
// FIXME: We should use instsimplify or something else to catch calls which		// FIXME: We should use instsimplify or something else to catch calls which
// will constant fold with these arguments.		// will constant fold with these arguments.
return static_cast<T *>(this)->getCallCost(F, Arguments.size());		return static_cast<T *>(this)->getCallCost(F, Arguments.size());
}		}

using BaseT::getGEPCost;		using BaseT::getGEPCost;

int getGEPCost(Type PointeeType, const Value Ptr,		int getGEPCost(const GetElementPtrInst *GEP,
ArrayRef<const Value *> Operands) {		ArrayRef<const Value *> Operands) {
		assert(GEP);
		efriedmaUnsubmitted Not Done Reply Inline Actions Useless assertion/ efriedma: Useless assertion/
		eastigAuthorUnsubmitted Not Done Reply Inline Actions I prefer to have controlled crashes instead of uncontrolled ones. It has saved debugging time quite often when a pointer was passed through a chain of calls and was dereferenced in somewhere in the middle of the chain. The assert triggered at the beginning of the chain so I didn't need to examine the whole call stack. Here it's also for the purpose of a contract: GEP must not be null. eastig: I prefer to have controlled crashes instead of uncontrolled ones. It has saved debugging time…
		const Value *Ptr = GEP->getPointerOperand();
const GlobalValue *BaseGV = nullptr;		const GlobalValue *BaseGV = nullptr;
if (Ptr != nullptr) {		if (Ptr != nullptr) {
// TODO: will remove this when pointers have an opaque type.		// TODO: will remove this when pointers have an opaque type.
assert(Ptr->getType()->getScalarType()->getPointerElementType() ==		assert(Ptr->getType()->getScalarType()->getPointerElementType() ==
PointeeType &&		GEP->getSourceElementType() &&
"explicit pointee type doesn't match operand's pointee type");		"explicit pointee type doesn't match operand's pointee type");
BaseGV = dyn_cast<GlobalValue>(Ptr->stripPointerCasts());		BaseGV = dyn_cast<GlobalValue>(Ptr->stripPointerCasts());
}		}
bool HasBaseReg = (BaseGV == nullptr);		bool HasBaseReg = (BaseGV == nullptr);
int64_t BaseOffset = 0;		int64_t BaseOffset = 0;
int64_t Scale = 0;		int64_t Scale = 0;

auto GTI = gep_type_begin(PointeeType, Operands);		auto GTI = gep_type_begin(GEP);
Type *TargetType;		Type *TargetType;
for (auto I = Operands.begin(); I != Operands.end(); ++I, ++GTI) {		for (auto I = Operands.begin(); I != Operands.end(); ++I, ++GTI) {
TargetType = GTI.getIndexedType();		TargetType = GTI.getIndexedType();
// We assume that the cost of Scalar GEP with constant index and the		// We assume that the cost of Scalar GEP with constant index and the
// cost of Vector GEP with splat constant index are the same.		// cost of Vector GEP with splat constant index are the same.
const ConstantInt ConstIdx = dyn_cast<ConstantInt>(I);		const ConstantInt ConstIdx = dyn_cast<ConstantInt>(I);
if (!ConstIdx)		if (!ConstIdx)
if (auto Splat = getSplatValue(*I))		if (auto Splat = getSplatValue(*I))
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	public:
}		}

unsigned getUserCost(const User *U) {		unsigned getUserCost(const User *U) {
if (isa<PHINode>(U))		if (isa<PHINode>(U))
return TTI::TCC_Free; // Model all PHI nodes as free.		return TTI::TCC_Free; // Model all PHI nodes as free.

if (const GEPOperator *GEP = dyn_cast<GEPOperator>(U)) {		if (const GEPOperator *GEP = dyn_cast<GEPOperator>(U)) {
SmallVector<Value *, 4> Indices(GEP->idx_begin(), GEP->idx_end());		SmallVector<Value *, 4> Indices(GEP->idx_begin(), GEP->idx_end());
return static_cast<T *>(this)->getGEPCost(		return static_cast<T *>(this)->getGEPCost(cast<GetElementPtrInst>(GEP),
GEP->getSourceElementType(), GEP->getPointerOperand(), Indices);		Indices);
}		}

if (auto CS = ImmutableCallSite(U)) {		if (auto CS = ImmutableCallSite(U)) {
const Function *F = CS.getCalledFunction();		const Function *F = CS.getCalledFunction();
if (!F) {		if (!F) {
// Just use the called value type.		// Just use the called value type.
Type *FTy = CS.getCalledValue()->getType()->getPointerElementType();		Type *FTy = CS.getCalledValue()->getType()->getPointerElementType();
return static_cast<T *>(this)		return static_cast<T *>(this)
Show All 23 Lines

include/llvm/CodeGen/BasicTTIImpl.h

Show First 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	bool isProfitableToHoist(Instruction *I) {
return getTLI()->isProfitableToHoist(I);		return getTLI()->isProfitableToHoist(I);
}		}

bool isTypeLegal(Type *Ty) {		bool isTypeLegal(Type *Ty) {
EVT VT = getTLI()->getValueType(DL, Ty);		EVT VT = getTLI()->getValueType(DL, Ty);
return getTLI()->isTypeLegal(VT);		return getTLI()->isTypeLegal(VT);
}		}

int getGEPCost(Type PointeeType, const Value Ptr,		int getGEPCost(const GetElementPtrInst *GEP,
ArrayRef<const Value *> Operands) {		ArrayRef<const Value *> Operands) {
return BaseT::getGEPCost(PointeeType, Ptr, Operands);		return BaseT::getGEPCost(GEP, Operands);
}		}

unsigned getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,		unsigned getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,
ArrayRef<const Value *> Arguments) {		ArrayRef<const Value *> Arguments) {
return BaseT::getIntrinsicCost(IID, RetTy, Arguments);		return BaseT::getIntrinsicCost(IID, RetTy, Arguments);
}		}

unsigned getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,		unsigned getIntrinsicCost(Intrinsic::ID IID, Type *RetTy,
▲ Show 20 Lines • Show All 939 Lines • Show Last 20 Lines

lib/Analysis/InlineCost.cpp

	Show First 20 Lines • Show All 337 Lines • ▼ Show 20 Lines
	/// Respects any simplified values known during the analysis of this callsite.			/// Respects any simplified values known during the analysis of this callsite.
	bool CallAnalyzer::isGEPFree(GetElementPtrInst &GEP) {			bool CallAnalyzer::isGEPFree(GetElementPtrInst &GEP) {
	SmallVector<Value *, 4> Indices;			SmallVector<Value *, 4> Indices;
	for (User::op_iterator I = GEP.idx_begin(), E = GEP.idx_end(); I != E; ++I)			for (User::op_iterator I = GEP.idx_begin(), E = GEP.idx_end(); I != E; ++I)
	if (Constant SimpleOp = SimplifiedValues.lookup(I))			if (Constant SimpleOp = SimplifiedValues.lookup(I))
	Indices.push_back(SimpleOp);			Indices.push_back(SimpleOp);
	else			else
	Indices.push_back(*I);			Indices.push_back(*I);
	return TargetTransformInfo::TCC_Free ==			return TargetTransformInfo::TCC_Free == TTI.getGEPCost(&GEP, Indices);
	TTI.getGEPCost(GEP.getSourceElementType(), GEP.getPointerOperand(),
	Indices);
	}			}

	bool CallAnalyzer::visitAlloca(AllocaInst &I) {			bool CallAnalyzer::visitAlloca(AllocaInst &I) {
	// Check whether inlining will turn a dynamic alloca into a static			// Check whether inlining will turn a dynamic alloca into a static
	// alloca and handle that case.			// alloca and handle that case.
	if (I.isArrayAllocation()) {			if (I.isArrayAllocation()) {
	Constant *Size = SimplifiedValues.lookup(I.getArraySize());			Constant *Size = SimplifiedValues.lookup(I.getArraySize());
	if (auto *AllocSize = dyn_cast_or_null<ConstantInt>(Size)) {			if (auto *AllocSize = dyn_cast_or_null<ConstantInt>(Size)) {
	▲ Show 20 Lines • Show All 1,264 Lines • Show Last 20 Lines

lib/Analysis/TargetTransformInfo.cpp

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	int TargetTransformInfo::getCallCost(const Function *F,
assert(Cost >= 0 && "TTI should not produce negative costs!");		assert(Cost >= 0 && "TTI should not produce negative costs!");
return Cost;		return Cost;
}		}

unsigned TargetTransformInfo::getInliningThresholdMultiplier() const {		unsigned TargetTransformInfo::getInliningThresholdMultiplier() const {
return TTIImpl->getInliningThresholdMultiplier();		return TTIImpl->getInliningThresholdMultiplier();
}		}

int TargetTransformInfo::getGEPCost(Type PointeeType, const Value Ptr,		int TargetTransformInfo::getGEPCost(const GetElementPtrInst *GEP,
ArrayRef<const Value *> Operands) const {		ArrayRef<const Value *> Operands) const {
return TTIImpl->getGEPCost(PointeeType, Ptr, Operands);		return TTIImpl->getGEPCost(GEP, Operands);
}		}

int TargetTransformInfo::getIntrinsicCost(		int TargetTransformInfo::getIntrinsicCost(
Intrinsic::ID IID, Type RetTy, ArrayRef<const Value > Arguments) const {		Intrinsic::ID IID, Type RetTy, ArrayRef<const Value > Arguments) const {
int Cost = TTIImpl->getIntrinsicCost(IID, RetTy, Arguments);		int Cost = TTIImpl->getIntrinsicCost(IID, RetTy, Arguments);
assert(Cost >= 0 && "TTI should not produce negative costs!");		assert(Cost >= 0 && "TTI should not produce negative costs!");
return Cost;		return Cost;
}		}
▲ Show 20 Lines • Show All 444 Lines • Show Last 20 Lines

lib/Transforms/Scalar/NaryReassociate.cpp

Show First 20 Lines • Show All 257 Lines • ▼ Show 20 Lines	Instruction NaryReassociatePass::tryReassociate(Instruction I) {
}		}
}		}

static bool isGEPFoldable(GetElementPtrInst *GEP,		static bool isGEPFoldable(GetElementPtrInst *GEP,
const TargetTransformInfo *TTI) {		const TargetTransformInfo *TTI) {
SmallVector<const Value*, 4> Indices;		SmallVector<const Value*, 4> Indices;
for (auto I = GEP->idx_begin(); I != GEP->idx_end(); ++I)		for (auto I = GEP->idx_begin(); I != GEP->idx_end(); ++I)
Indices.push_back(*I);		Indices.push_back(*I);
return TTI->getGEPCost(GEP->getSourceElementType(), GEP->getPointerOperand(),		return TTI->getGEPCost(GEP, Indices) == TargetTransformInfo::TCC_Free;
Indices) == TargetTransformInfo::TCC_Free;
}		}

Instruction NaryReassociatePass::tryReassociateGEP(GetElementPtrInst GEP) {		Instruction NaryReassociatePass::tryReassociateGEP(GetElementPtrInst GEP) {
// Not worth reassociating GEP if it is foldable.		// Not worth reassociating GEP if it is foldable.
if (isGEPFoldable(GEP, TTI))		if (isGEPFoldable(GEP, TTI))
return nullptr;		return nullptr;

gep_type_iterator GTI = gep_type_begin(*GEP);		gep_type_iterator GTI = gep_type_begin(*GEP);
▲ Show 20 Lines • Show All 232 Lines • Show Last 20 Lines

lib/Transforms/Scalar/StraightLineStrengthReduce.cpp

Show First 20 Lines • Show All 233 Lines • ▼ Show 20 Lines	return (Basis.Ins != C.Ins && // skip the same instruction
Basis.CandidateKind == C.CandidateKind);		Basis.CandidateKind == C.CandidateKind);
}		}

static bool isGEPFoldable(GetElementPtrInst *GEP,		static bool isGEPFoldable(GetElementPtrInst *GEP,
const TargetTransformInfo *TTI) {		const TargetTransformInfo *TTI) {
SmallVector<const Value*, 4> Indices;		SmallVector<const Value*, 4> Indices;
for (auto I = GEP->idx_begin(); I != GEP->idx_end(); ++I)		for (auto I = GEP->idx_begin(); I != GEP->idx_end(); ++I)
Indices.push_back(*I);		Indices.push_back(*I);
return TTI->getGEPCost(GEP->getSourceElementType(), GEP->getPointerOperand(),		return TTI->getGEPCost(GEP, Indices) == TargetTransformInfo::TCC_Free;
Indices) == TargetTransformInfo::TCC_Free;
}		}

// Returns whether (Base + Index * Stride) can be folded to an addressing mode.		// Returns whether (Base + Index * Stride) can be folded to an addressing mode.
static bool isAddFoldable(const SCEV Base, ConstantInt Index, Value *Stride,		static bool isAddFoldable(const SCEV Base, ConstantInt Index, Value *Stride,
TargetTransformInfo *TTI) {		TargetTransformInfo *TTI) {
// Index->getSExtValue() may crash if Index is wider than 64-bit.		// Index->getSExtValue() may crash if Index is wider than 64-bit.
return Index->getBitWidth() <= 64 &&		return Index->getBitWidth() <= 64 &&
TTI->isLegalAddressingMode(Base->getType(), nullptr, 0, true,		TTI->isLegalAddressingMode(Base->getType(), nullptr, 0, true,
▲ Show 20 Lines • Show All 450 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Changing TargetTransformInfo::getGEPCost to take GetElementPtrInst as parameterNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 92474

include/llvm/Analysis/TargetTransformInfo.h

include/llvm/Analysis/TargetTransformInfoImpl.h

include/llvm/CodeGen/BasicTTIImpl.h

lib/Analysis/InlineCost.cpp

lib/Analysis/TargetTransformInfo.cpp

lib/Transforms/Scalar/NaryReassociate.cpp

lib/Transforms/Scalar/StraightLineStrengthReduce.cpp

Changing TargetTransformInfo::getGEPCost to take GetElementPtrInst as parameter
Needs ReviewPublic