This is an archive of the discontinued LLVM Phabricator instance.

[SystemZ] Improve handling and cost estimates of vector integer div/rem
ClosedPublic

Authored by jonpa on Oct 12 2018, 7:16 AM.

Download Raw Diff

Details

Reviewers

uweigand
jonpa

Summary

I changed the constant cost of an sdiv/udiv to 20 in getArithmeticInstrCost for the divide with register case which uses a target divide instruction.

I also discovered that there is actually a third case (in addition to the divisor being a register or a power of 2 constant splat): a constant vector which is *not* a power of two splat. In that case we can get a sequence with a mul and shifts. So I now have three costs - see comment in the cost method.

In addition to this, I also found that only the incomplete vector ops, e.g. srem <2 x i32> would actually get that optimization into the mul sequence. A <4 x i32> would not, and I found a way to handle that by adding a target DAGCombine for vector SDIV, UDIV, SREM and UREM in order to scalarize them early. See comment in combineIntDIVREM for motivation (I suppose perhaps all of them could be scalarized early, but I added a check to just handle the case with a constant vector divisor).

Tests of costs are extended. Some tests removed from int-arith.ll that are now redundant.

Also testing that the constant divisors don't use a divide instruction by running llc in the CostModel test cases (like X86 does).

SPEC impact:
z13/z14: 5 loops in 2 files improved. 2 other files also changed, which seems in one case to relate to SLP costs, and the other file gets mul sequence instead of div for <4 x i32> div.

Diff Detail

Event Timeline

jonpa created this revision.Oct 12 2018, 7:16 AM

LGTM, thanks!

Committed as r345321.

This revision is now accepted and ready to land.Oct 25 2018, 2:57 PM

jonpa closed this revision.Oct 25 2018, 2:57 PM

Revision Contents

Path

Size

lib/

Target/

SystemZ/

SystemZISelLowering.h

1 line

SystemZISelLowering.cpp

25 lines

SystemZTargetTransformInfo.cpp

82 lines

test/

Analysis/

CostModel/

SystemZ/

291 lines

383 lines

286 lines

187 lines

memop-folding-int-arith.ll

28 lines

Diff 169395

lib/Target/SystemZ/SystemZISelLowering.h

Show First 20 Lines • Show All 599 Lines • ▼ Show 20 Lines	private:
SDValue combineSTORE(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue combineSTORE(SDNode *N, DAGCombinerInfo &DCI) const;
SDValue combineEXTRACT_VECTOR_ELT(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue combineEXTRACT_VECTOR_ELT(SDNode *N, DAGCombinerInfo &DCI) const;
SDValue combineJOIN_DWORDS(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue combineJOIN_DWORDS(SDNode *N, DAGCombinerInfo &DCI) const;
SDValue combineFP_ROUND(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue combineFP_ROUND(SDNode *N, DAGCombinerInfo &DCI) const;
SDValue combineBSWAP(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue combineBSWAP(SDNode *N, DAGCombinerInfo &DCI) const;
SDValue combineBR_CCMASK(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue combineBR_CCMASK(SDNode *N, DAGCombinerInfo &DCI) const;
SDValue combineSELECT_CCMASK(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue combineSELECT_CCMASK(SDNode *N, DAGCombinerInfo &DCI) const;
SDValue combineGET_CCMASK(SDNode *N, DAGCombinerInfo &DCI) const;		SDValue combineGET_CCMASK(SDNode *N, DAGCombinerInfo &DCI) const;
		SDValue combineIntDIVREM(SDNode *N, DAGCombinerInfo &DCI) const;

// If the last instruction before MBBI in MBB was some form of COMPARE,		// If the last instruction before MBBI in MBB was some form of COMPARE,
// try to replace it with a COMPARE AND BRANCH just before MBBI.		// try to replace it with a COMPARE AND BRANCH just before MBBI.
// CCMask and Target are the BRC-like operands for the branch.		// CCMask and Target are the BRC-like operands for the branch.
// Return true if the change was made.		// Return true if the change was made.
bool convertPrevCompareToBranch(MachineBasicBlock *MBB,		bool convertPrevCompareToBranch(MachineBasicBlock *MBB,
MachineBasicBlock::iterator MBBI,		MachineBasicBlock::iterator MBBI,
unsigned CCMask,		unsigned CCMask,
Show All 38 Lines

lib/Target/SystemZ/SystemZISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 521 Lines • ▼ Show 20 Lines	SystemZTargetLowering::SystemZTargetLowering(const TargetMachine &TM,
// Codes for which we want to perform some z-specific combinations.		// Codes for which we want to perform some z-specific combinations.
setTargetDAGCombine(ISD::ZERO_EXTEND);		setTargetDAGCombine(ISD::ZERO_EXTEND);
setTargetDAGCombine(ISD::SIGN_EXTEND);		setTargetDAGCombine(ISD::SIGN_EXTEND);
setTargetDAGCombine(ISD::SIGN_EXTEND_INREG);		setTargetDAGCombine(ISD::SIGN_EXTEND_INREG);
setTargetDAGCombine(ISD::STORE);		setTargetDAGCombine(ISD::STORE);
setTargetDAGCombine(ISD::EXTRACT_VECTOR_ELT);		setTargetDAGCombine(ISD::EXTRACT_VECTOR_ELT);
setTargetDAGCombine(ISD::FP_ROUND);		setTargetDAGCombine(ISD::FP_ROUND);
setTargetDAGCombine(ISD::BSWAP);		setTargetDAGCombine(ISD::BSWAP);
		setTargetDAGCombine(ISD::SDIV);
		setTargetDAGCombine(ISD::UDIV);
		setTargetDAGCombine(ISD::SREM);
		setTargetDAGCombine(ISD::UREM);

// Handle intrinsics.		// Handle intrinsics.
setOperationAction(ISD::INTRINSIC_W_CHAIN, MVT::Other, Custom);		setOperationAction(ISD::INTRINSIC_W_CHAIN, MVT::Other, Custom);
setOperationAction(ISD::INTRINSIC_WO_CHAIN, MVT::Other, Custom);		setOperationAction(ISD::INTRINSIC_WO_CHAIN, MVT::Other, Custom);

// We want to use MVC in preference to even a single load/store pair.		// We want to use MVC in preference to even a single load/store pair.
MaxStoresPerMemcpy = 0;		MaxStoresPerMemcpy = 0;
MaxStoresPerMemcpyOptSize = 0;		MaxStoresPerMemcpyOptSize = 0;
▲ Show 20 Lines • Show All 5,121 Lines • ▼ Show 20 Lines	SDValue SystemZTargetLowering::combineGET_CCMASK(
if (SelectCCValidVal & ~CCValidVal)		if (SelectCCValidVal & ~CCValidVal)
return SDValue();		return SDValue();
if (SelectCCMaskVal != (CCMaskVal & SelectCCValidVal))		if (SelectCCMaskVal != (CCMaskVal & SelectCCValidVal))
return SDValue();		return SDValue();

return Select->getOperand(4);		return Select->getOperand(4);
}		}

		SDValue SystemZTargetLowering::combineIntDIVREM(
		SDNode *N, DAGCombinerInfo &DCI) const {
		SelectionDAG &DAG = DCI.DAG;
		EVT VT = N->getValueType(0);
		// In the case where the divisor is a vector of constants a cheaper
		// sequence of instructions can replace the divide. BuildSDIV is called to
		// do this during DAG combining, but it only succeeds when it can build a
		// multiplication node. The only option for SystemZ is ISD::SMUL_LOHI, and
		// since it is not Legal but Custom it can only happen before
		// legalization. Therefore we must scalarize this early before Combine
		// 1. For widened vectors, this is already the result of type legalization.
		if (VT.isVector() && isTypeLegal(VT) &&
		DAG.isConstantIntBuildVectorOrConstantInt(N->getOperand(1)))
		return DAG.UnrollVectorOp(N);
		return SDValue();
		}

SDValue SystemZTargetLowering::PerformDAGCombine(SDNode *N,		SDValue SystemZTargetLowering::PerformDAGCombine(SDNode *N,
DAGCombinerInfo &DCI) const {		DAGCombinerInfo &DCI) const {
switch(N->getOpcode()) {		switch(N->getOpcode()) {
default: break;		default: break;
case ISD::ZERO_EXTEND: return combineZERO_EXTEND(N, DCI);		case ISD::ZERO_EXTEND: return combineZERO_EXTEND(N, DCI);
case ISD::SIGN_EXTEND: return combineSIGN_EXTEND(N, DCI);		case ISD::SIGN_EXTEND: return combineSIGN_EXTEND(N, DCI);
case ISD::SIGN_EXTEND_INREG: return combineSIGN_EXTEND_INREG(N, DCI);		case ISD::SIGN_EXTEND_INREG: return combineSIGN_EXTEND_INREG(N, DCI);
case SystemZISD::MERGE_HIGH:		case SystemZISD::MERGE_HIGH:
case SystemZISD::MERGE_LOW: return combineMERGE(N, DCI);		case SystemZISD::MERGE_LOW: return combineMERGE(N, DCI);
case ISD::STORE: return combineSTORE(N, DCI);		case ISD::STORE: return combineSTORE(N, DCI);
case ISD::EXTRACT_VECTOR_ELT: return combineEXTRACT_VECTOR_ELT(N, DCI);		case ISD::EXTRACT_VECTOR_ELT: return combineEXTRACT_VECTOR_ELT(N, DCI);
case SystemZISD::JOIN_DWORDS: return combineJOIN_DWORDS(N, DCI);		case SystemZISD::JOIN_DWORDS: return combineJOIN_DWORDS(N, DCI);
case ISD::FP_ROUND: return combineFP_ROUND(N, DCI);		case ISD::FP_ROUND: return combineFP_ROUND(N, DCI);
case ISD::BSWAP: return combineBSWAP(N, DCI);		case ISD::BSWAP: return combineBSWAP(N, DCI);
case SystemZISD::BR_CCMASK: return combineBR_CCMASK(N, DCI);		case SystemZISD::BR_CCMASK: return combineBR_CCMASK(N, DCI);
case SystemZISD::SELECT_CCMASK: return combineSELECT_CCMASK(N, DCI);		case SystemZISD::SELECT_CCMASK: return combineSELECT_CCMASK(N, DCI);
case SystemZISD::GET_CCMASK: return combineGET_CCMASK(N, DCI);		case SystemZISD::GET_CCMASK: return combineGET_CCMASK(N, DCI);
		case ISD::SDIV:
		case ISD::UDIV:
		case ISD::SREM:
		case ISD::UREM: return combineIntDIVREM(N, DCI);
}		}

return SDValue();		return SDValue();
}		}

// Return the demanded elements for the OpNo source operand of Op. DemandedElts		// Return the demanded elements for the OpNo source operand of Op. DemandedElts
// are for Op.		// are for Op.
static APInt getDemandedSrcElements(SDValue Op, const APInt &DemandedElts,		static APInt getDemandedSrcElements(SDValue Op, const APInt &DemandedElts,
▲ Show 20 Lines • Show All 1,598 Lines • Show Last 20 Lines

lib/Target/SystemZ/SystemZTargetTransformInfo.cpp

Show First 20 Lines • Show All 356 Lines • ▼ Show 20 Lines	int SystemZTTIImpl::getArithmeticInstrCost(
// TODO: return a good value for BB-VECTORIZER that includes the		// TODO: return a good value for BB-VECTORIZER that includes the
// immediate loads, which we do not want to count for the loop		// immediate loads, which we do not want to count for the loop
// vectorizer, since they are hopefully hoisted out of the loop. This		// vectorizer, since they are hopefully hoisted out of the loop. This
// would require a new parameter 'InLoop', but not sure if constant		// would require a new parameter 'InLoop', but not sure if constant
// args are common enough to motivate this.		// args are common enough to motivate this.

unsigned ScalarBits = Ty->getScalarSizeInBits();		unsigned ScalarBits = Ty->getScalarSizeInBits();

// Div with a constant which is a power of 2 will be converted by		// There are thre cases of division and remainder: Dividing with a register
// DAGCombiner to use shifts. With vector shift-element instructions, a		// needs a divide instruction. A divisor which is a power of two constant
// vector sdiv costs about as much as a scalar one.		// can be implemented with a sequence of shifts. Any other constant needs a
const unsigned SDivCostEstimate = 4;		// multiply and shifts.
bool SDivPow2 = false;		const unsigned DivInstrCost = 20;
bool UDivPow2 = false;		const unsigned DivMulSeqCost = 10;
if ((Opcode == Instruction::SDiv \|\| Opcode == Instruction::UDiv) &&		const unsigned SDivPow2Cost = 4;
Args.size() == 2) {
const ConstantInt *CI = nullptr;		bool SignedDivRem =
		Opcode == Instruction::SDiv \|\| Opcode == Instruction::SRem;
		bool UnsignedDivRem =
		Opcode == Instruction::UDiv \|\| Opcode == Instruction::URem;

		// Check for a constant divisor.
		bool DivRemConst = false;
		bool DivRemConstPow2 = false;
		if ((SignedDivRem \|\| UnsignedDivRem) && Args.size() == 2) {
if (const Constant *C = dyn_cast<Constant>(Args[1])) {		if (const Constant *C = dyn_cast<Constant>(Args[1])) {
if (C->getType()->isVectorTy())		const ConstantInt *CVal =
CI = dyn_cast_or_null<const ConstantInt>(C->getSplatValue());		(C->getType()->isVectorTy()
		? dyn_cast_or_null<const ConstantInt>(C->getSplatValue())
		: dyn_cast<const ConstantInt>(C));
		if (CVal != nullptr &&
		(CVal->getValue().isPowerOf2() \|\| (-CVal->getValue()).isPowerOf2()))
		DivRemConstPow2 = true;
else		else
CI = dyn_cast<const ConstantInt>(C);		DivRemConst = true;
}
if (CI != nullptr &&
(CI->getValue().isPowerOf2() \|\| (-CI->getValue()).isPowerOf2())) {
if (Opcode == Instruction::SDiv)
SDivPow2 = true;
else
UDivPow2 = true;
}		}
}		}

if (Ty->isVectorTy()) {		if (Ty->isVectorTy()) {
assert (ST->hasVector() && "getArithmeticInstrCost() called with vector type.");		assert (ST->hasVector() && "getArithmeticInstrCost() called with vector type.");
unsigned VF = Ty->getVectorNumElements();		unsigned VF = Ty->getVectorNumElements();
unsigned NumVectors = getNumVectorRegs(Ty);		unsigned NumVectors = getNumVectorRegs(Ty);

// These vector operations are custom handled, but are still supported		// These vector operations are custom handled, but are still supported
// with one instruction per vector, regardless of element size.		// with one instruction per vector, regardless of element size.
if (Opcode == Instruction::Shl \|\| Opcode == Instruction::LShr \|\|		if (Opcode == Instruction::Shl \|\| Opcode == Instruction::LShr \|\|
Opcode == Instruction::AShr \|\| UDivPow2) {		Opcode == Instruction::AShr) {
return NumVectors;		return NumVectors;
}		}

if (SDivPow2)		if (DivRemConstPow2)
return (NumVectors * SDivCostEstimate);		return (NumVectors * (SignedDivRem ? SDivPow2Cost : 1));
		if (DivRemConst)
		return VF * DivMulSeqCost + getScalarizationOverhead(Ty, Args);
		if ((SignedDivRem \|\| UnsignedDivRem) && VF > 4)
// Temporary hack: disable high vectorization factors with integer		// Temporary hack: disable high vectorization factors with integer
// division/remainder, which will get scalarized and handled with GR128		// division/remainder, which will get scalarized and handled with
// registers. The mischeduler is not clever enough to avoid spilling yet.		// GR128 registers. The mischeduler is not clever enough to avoid
if ((Opcode == Instruction::UDiv \|\| Opcode == Instruction::SDiv \|\|		// spilling yet.
Opcode == Instruction::URem \|\| Opcode == Instruction::SRem) && VF > 4)
return 1000;		return 1000;

// These FP operations are supported with a single vector instruction for		// These FP operations are supported with a single vector instruction for
// double (base implementation assumes float generally costs 2). For		// double (base implementation assumes float generally costs 2). For
// FP128, the scalar cost is 1, and there is no overhead since the values		// FP128, the scalar cost is 1, and there is no overhead since the values
// are already in scalar registers.		// are already in scalar registers.
if (Opcode == Instruction::FAdd \|\| Opcode == Instruction::FSub \|\|		if (Opcode == Instruction::FAdd \|\| Opcode == Instruction::FSub \|\|
Opcode == Instruction::FMul \|\| Opcode == Instruction::FDiv) {		Opcode == Instruction::FMul \|\| Opcode == Instruction::FDiv) {
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	if (Opcode == Instruction::Or)
return 1;		return 1;

if (Opcode == Instruction::Xor && ScalarBits == 1) {		if (Opcode == Instruction::Xor && ScalarBits == 1) {
if (ST->hasLoadStoreOnCond2())		if (ST->hasLoadStoreOnCond2())
return 5; // 2 * (li 0; loc 1); xor		return 5; // 2 * (li 0; loc 1); xor
return 7; // 2 * ipm sequences ; xor ; shift ; compare		return 7; // 2 * ipm sequences ; xor ; shift ; compare
}		}

if (UDivPow2)		if (DivRemConstPow2)
return 1;		return (SignedDivRem ? SDivPow2Cost : 1);
if (SDivPow2)		if (DivRemConst)
return SDivCostEstimate;		return DivMulSeqCost;
		if (SignedDivRem)
// An extra extension for narrow types is needed.
if ((Opcode == Instruction::SDiv \|\| Opcode == Instruction::SRem))
// sext of op(s) for narrow types		// sext of op(s) for narrow types
return (ScalarBits < 32 ? 4 : (ScalarBits == 32 ? 2 : 1));		return DivInstrCost + (ScalarBits < 32 ? 3 : (ScalarBits == 32 ? 1 : 0));
		if (UnsignedDivRem)
if (Opcode == Instruction::UDiv \|\| Opcode == Instruction::URem)
// Clearing of low 64 bit reg + sext of op(s) for narrow types + dl[g]r		// Clearing of low 64 bit reg + sext of op(s) for narrow types + dl[g]r
return (ScalarBits < 32 ? 4 : 2);		return DivInstrCost + (ScalarBits < 32 ? 3 : 1);
}		}

// Fallback to the default implementation.		// Fallback to the default implementation.
return BaseT::getArithmeticInstrCost(Opcode, Ty, Op1Info, Op2Info,		return BaseT::getArithmeticInstrCost(Opcode, Ty, Op1Info, Op2Info,
Opd1PropInfo, Opd2PropInfo, Args);		Opd1PropInfo, Opd2PropInfo, Args);
}		}

int SystemZTTIImpl::getShuffleCost(TTI::ShuffleKind Kind, Type *Tp, int Index,		int SystemZTTIImpl::getShuffleCost(TTI::ShuffleKind Kind, Type *Tp, int Index,
▲ Show 20 Lines • Show All 440 Lines • Show Last 20 Lines

test/Analysis/CostModel/SystemZ/div-pow2.ll

This file was deleted.

	; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s

	; Scalar sdiv

	define i64 @fun0(i64 %a) {
	%r = sdiv i64 %a, 2
	ret i64 %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i64 %a, 2
	}

	define i64 @fun1(i64 %a) {
	%r = sdiv i64 %a, -4
	ret i64 %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i64 %a, -4
	}

	define i32 @fun2(i32 %a) {
	%r = sdiv i32 %a, 8
	ret i32 %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i32 %a, 8
	}

	define i32 @fun3(i32 %a) {
	%r = sdiv i32 %a, -16
	ret i32 %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i32 %a, -16
	}

	define i16 @fun4(i16 %a) {
	%r = sdiv i16 %a, 32
	ret i16 %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i16 %a, 32
	}

	define i16 @fun5(i16 %a) {
	%r = sdiv i16 %a, -64
	ret i16 %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i16 %a, -64
	}

	define i8 @fun6(i8 %a) {
	%r = sdiv i8 %a, 64
	ret i8 %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i8 %a, 64
	}

	define i8 @fun7(i8 %a) {
	%r = sdiv i8 %a, -128
	ret i8 %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i8 %a, -128
	}


	; Vector sdiv

	define <2 x i64> @fun8(<2 x i64> %a) {
	%r = sdiv <2 x i64> %a, <i64 2, i64 2>
	ret <2 x i64> %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <2 x i64> %a, <i64 2, i64 2>
	}

	define <2 x i64> @fun9(<2 x i64> %a) {
	%r = sdiv <2 x i64> %a, <i64 -4, i64 -4>
	ret <2 x i64> %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <2 x i64> %a, <i64 -4, i64 -4>
	}

	define <4 x i32> @fun10(<4 x i32> %a) {
	%r = sdiv <4 x i32> %a, <i32 8, i32 8, i32 8, i32 8>
	ret <4 x i32> %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <4 x i32> %a, <i32 8, i32 8, i32 8, i32 8>
	}

	define <4 x i32> @fun11(<4 x i32> %a) {
	%r = sdiv <4 x i32> %a, <i32 -16, i32 -16, i32 -16, i32 -16>
	ret <4 x i32> %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <4 x i32> %a, <i32 -16
	}

	define <8 x i16> @fun12(<8 x i16> %a) {
	%r = sdiv <8 x i16> %a, <i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32>
	ret <8 x i16> %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <8 x i16> %a, <i16 32
	}

	define <8 x i16> @fun13(<8 x i16> %a) {
	%r = sdiv <8 x i16> %a, <i16 -64, i16 -64, i16 -64, i16 -64, i16 -64, i16 -64, i16 -64, i16 -64>
	ret <8 x i16> %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <8 x i16> %a, <i16 -64
	}

	define <16 x i8> @fun14(<16 x i8> %a) {
	%r = sdiv <16 x i8> %a, <i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64>
	ret <16 x i8> %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <16 x i8> %a, <i8 64
	}

	define <16 x i8> @fun15(<16 x i8> %a) {
	%r = sdiv <16 x i8> %a, <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>
	ret <16 x i8> %r
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <16 x i8> %a, <i8 -128
	}

	; Scalar udiv

	define i64 @fun16(i64 %a) {
	%r = udiv i64 %a, 2
	ret i64 %r
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv i64 %a, 2
	}

	define i32 @fun17(i32 %a) {
	%r = udiv i32 %a, 8
	ret i32 %r
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv i32 %a, 8
	}

	define i16 @fun18(i16 %a) {
	%r = udiv i16 %a, 32
	ret i16 %r
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv i16 %a, 32
	}

	define i8 @fun19(i8 %a) {
	%r = udiv i8 %a, 128
	ret i8 %r
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv i8 %a, -128
	}

	; Vector udiv

	define <2 x i64> @fun20(<2 x i64> %a) {
	%r = udiv <2 x i64> %a, <i64 2, i64 2>
	ret <2 x i64> %r
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <2 x i64> %a, <i64 2
	}

	define <4 x i32> @fun21(<4 x i32> %a) {
	%r = udiv <4 x i32> %a, <i32 8, i32 8, i32 8, i32 8>
	ret <4 x i32> %r
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <4 x i32> %a, <i32 8
	}

	define <8 x i16> @fun22(<8 x i16> %a) {
	%r = udiv <8 x i16> %a, <i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32>
	ret <8 x i16> %r
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <8 x i16> %a, <i16 32
	}

	define <16 x i8> @fun23(<16 x i8> %a) {
	%r = udiv <16 x i8> %a, <i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128>
	ret <16 x i8> %r
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <16 x i8> %a, <i8 -128
	}

test/Analysis/CostModel/SystemZ/divrem-const.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \
				; RUN: \| FileCheck %s -check-prefix=COST

				; Check that all divide/remainder instructions are implemented by cheaper instructions.
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 -o - \| FileCheck %s
				; CHECK-NOT: dsg
				; CHECK-NOT: dl

				; Check costs of divisions/remainders by a vector of constants that is not
				; a power of two. A sequence containing a multiply and shifts will replace
				; the divide instruction.

				; Scalar sdiv

				define i64 @fun0(i64 %a) {
				%r = sdiv i64 %a, 20
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = sdiv i64 %a, 20
				}

				define i32 @fun1(i32 %a) {
				%r = sdiv i32 %a, 20
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = sdiv i32 %a, 20
				}

				define i16 @fun2(i16 %a) {
				%r = sdiv i16 %a, 20
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = sdiv i16 %a, 20
				}

				define i8 @fun3(i8 %a) {
				%r = sdiv i8 %a, 20
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = sdiv i8 %a, 20
				}

				; Vector sdiv

				define <2 x i64> @fun4(<2 x i64> %a) {
				%r = sdiv <2 x i64> %a, <i64 20, i64 21>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 24 for instruction: %r = sdiv <2 x i64>
				}

				define <4 x i32> @fun5(<4 x i32> %a) {
				%r = sdiv <4 x i32> %a, <i32 20, i32 20, i32 20, i32 20>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 49 for instruction: %r = sdiv <4 x i32>
				}

				define <2 x i32> @fun6(<2 x i32> %a) {
				%r = sdiv <2 x i32> %a, <i32 20, i32 21>
				ret <2 x i32> %r
				; COST: Cost Model: Found an estimated cost of 25 for instruction: %r = sdiv <2 x i32>
				}

				define <8 x i16> @fun7(<8 x i16> %a) {
				%r = sdiv <8 x i16> %a, <i16 20, i16 20, i16 20, i16 20, i16 20, i16 20, i16 20, i16 20>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 97 for instruction: %r = sdiv <8 x i16>
				}

				define <4 x i16> @fun8(<4 x i16> %a) {
				%r = sdiv <4 x i16> %a, <i16 20, i16 20, i16 20, i16 21>
				ret <4 x i16> %r
				; COST: Cost Model: Found an estimated cost of 49 for instruction: %r = sdiv <4 x i16>
				}

				define <16 x i8> @fun9(<16 x i8> %a) {
				%r = sdiv <16 x i8> %a, <i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 193 for instruction: %r = sdiv <16 x i8>
				}

				define <8 x i8> @fun10(<8 x i8> %a) {
				%r = sdiv <8 x i8> %a, <i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 21>
				ret <8 x i8> %r
				; COST: Cost Model: Found an estimated cost of 97 for instruction: %r = sdiv <8 x i8>
				}

				; Scalar udiv

				define i64 @fun11(i64 %a) {
				%r = udiv i64 %a, 20
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = udiv i64 %a, 20
				}

				define i32 @fun12(i32 %a) {
				%r = udiv i32 %a, 20
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = udiv i32 %a, 20
				}

				define i16 @fun13(i16 %a) {
				%r = udiv i16 %a, 20
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = udiv i16 %a, 20
				}

				define i8 @fun14(i8 %a) {
				%r = udiv i8 %a, 20
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = udiv i8
				}

				; Vector udiv

				define <2 x i64> @fun15(<2 x i64> %a) {
				%r = udiv <2 x i64> %a, <i64 20, i64 20>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 24 for instruction: %r = udiv <2 x i64>
				}

				define <4 x i32> @fun16(<4 x i32> %a) {
				%r = udiv <4 x i32> %a, <i32 20, i32 20, i32 20, i32 21>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 49 for instruction: %r = udiv <4 x i32>
				}

				define <2 x i32> @fun17(<2 x i32> %a) {
				%r = udiv <2 x i32> %a, <i32 20, i32 20>
				ret <2 x i32> %r
				; COST: Cost Model: Found an estimated cost of 25 for instruction: %r = udiv <2 x i32>
				}

				define <8 x i16> @fun18(<8 x i16> %a) {
				%r = udiv <8 x i16> %a, <i16 20, i16 20, i16 20, i16 20, i16 20, i16 20, i16 20, i16 21>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 97 for instruction: %r = udiv <8 x i16>
				}

				define <4 x i16> @fun19(<4 x i16> %a) {
				%r = udiv <4 x i16> %a, <i16 20, i16 20, i16 20, i16 20>
				ret <4 x i16> %r
				; COST: Cost Model: Found an estimated cost of 49 for instruction: %r = udiv <4 x i16>
				}

				define <16 x i8> @fun20(<16 x i8> %a) {
				%r = udiv <16 x i8> %a, <i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 21>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 193 for instruction: %r = udiv <16 x i8>
				}

				define <8 x i8> @fun21(<8 x i8> %a) {
				%r = udiv <8 x i8> %a, <i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20>
				ret <8 x i8> %r
				; COST: Cost Model: Found an estimated cost of 97 for instruction: %r = udiv <8 x i8>
				}

				; Scalar srem

				define i64 @fun22(i64 %a) {
				%r = srem i64 %a, 20
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = srem i64
				}

				define i32 @fun23(i32 %a) {
				%r = srem i32 %a, 20
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = srem i32
				}

				define i16 @fun24(i16 %a) {
				%r = srem i16 %a, 20
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = srem i16
				}

				define i8 @fun25(i8 %a) {
				%r = srem i8 %a, 20
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = srem i8
				}

				; Vector srem

				define <2 x i64> @fun26(<2 x i64> %a) {
				%r = srem <2 x i64> %a, <i64 20, i64 21>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 24 for instruction: %r = srem <2 x i64>
				}

				define <4 x i32> @fun27(<4 x i32> %a) {
				%r = srem <4 x i32> %a, <i32 20, i32 20, i32 20, i32 20>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 49 for instruction: %r = srem <4 x i32>
				}

				define <2 x i32> @fun28(<2 x i32> %a) {
				%r = srem <2 x i32> %a, <i32 20, i32 21>
				ret <2 x i32> %r
				; COST: Cost Model: Found an estimated cost of 25 for instruction: %r = srem <2 x i32>
				}

				define <8 x i16> @fun29(<8 x i16> %a) {
				%r = srem <8 x i16> %a, <i16 20, i16 20, i16 20, i16 20, i16 20, i16 20, i16 20, i16 20>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 97 for instruction: %r = srem <8 x i16>
				}

				define <4 x i16> @fun30(<4 x i16> %a) {
				%r = srem <4 x i16> %a, <i16 20, i16 20, i16 20, i16 21>
				ret <4 x i16> %r
				; COST: Cost Model: Found an estimated cost of 49 for instruction: %r = srem <4 x i16>
				}

				define <16 x i8> @fun31(<16 x i8> %a) {
				%r = srem <16 x i8> %a, <i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 193 for instruction: %r = srem <16 x i8>
				}

				define <8 x i8> @fun32(<8 x i8> %a) {
				%r = srem <8 x i8> %a, <i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 21>
				ret <8 x i8> %r
				; COST: Cost Model: Found an estimated cost of 97 for instruction: %r = srem <8 x i8>
				}

				; Scalar urem

				define i64 @fun33(i64 %a) {
				%r = urem i64 %a, 20
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = urem i64
				}

				define i32 @fun34(i32 %a) {
				%r = urem i32 %a, 20
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = urem i32
				}

				define i16 @fun35(i16 %a) {
				%r = urem i16 %a, 20
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = urem i16
				}

				define i8 @fun36(i8 %a) {
				%r = urem i8 %a, 20
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 10 for instruction: %r = urem i8
				}

				; Vector urem

				define <2 x i64> @fun37(<2 x i64> %a) {
				%r = urem <2 x i64> %a, <i64 20, i64 20>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 24 for instruction: %r = urem <2 x i64>
				}

				define <4 x i32> @fun38(<4 x i32> %a) {
				%r = urem <4 x i32> %a, <i32 20, i32 20, i32 20, i32 21>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 49 for instruction: %r = urem <4 x i32>
				}

				define <2 x i32> @fun39(<2 x i32> %a) {
				%r = urem <2 x i32> %a, <i32 20, i32 20>
				ret <2 x i32> %r
				; COST: Cost Model: Found an estimated cost of 25 for instruction: %r = urem <2 x i32>
				}

				define <8 x i16> @fun40(<8 x i16> %a) {
				%r = urem <8 x i16> %a, <i16 20, i16 20, i16 20, i16 20, i16 20, i16 20, i16 20, i16 21>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 97 for instruction: %r = urem <8 x i16>
				}

				define <4 x i16> @fun41(<4 x i16> %a) {
				%r = urem <4 x i16> %a, <i16 20, i16 20, i16 20, i16 20>
				ret <4 x i16> %r
				; COST: Cost Model: Found an estimated cost of 49 for instruction: %r = urem <4 x i16>
				}

				define <16 x i8> @fun42(<16 x i8> %a) {
				%r = urem <16 x i8> %a, <i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 21>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 193 for instruction: %r = urem <16 x i8>
				}

				define <8 x i8> @fun43(<8 x i8> %a) {
				%r = urem <8 x i8> %a, <i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20, i8 20>
				ret <8 x i8> %r
				; COST: Cost Model: Found an estimated cost of 97 for instruction: %r = urem <8 x i8>
				}

test/Analysis/CostModel/SystemZ/divrem-pow2.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \
				; RUN: \| FileCheck %s -check-prefix=COST

				; Check that all divide/remainder instructions are implemented by cheaper instructions.
				; RUN: llc < %s -mtriple=s390x-linux-gnu -mcpu=z13 -o - \| FileCheck %s
				; CHECK-NOT: dsg
				; CHECK-NOT: dl

				; Scalar sdiv

				define i64 @fun0(i64 %a) {
				%r = sdiv i64 %a, 2
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i64 %a, 2
				}

				define i64 @fun1(i64 %a) {
				%r = sdiv i64 %a, -4
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i64 %a, -4
				}

				define i32 @fun2(i32 %a) {
				%r = sdiv i32 %a, 8
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i32 %a, 8
				}

				define i32 @fun3(i32 %a) {
				%r = sdiv i32 %a, -16
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i32 %a, -16
				}

				define i16 @fun4(i16 %a) {
				%r = sdiv i16 %a, 32
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i16 %a, 32
				}

				define i16 @fun5(i16 %a) {
				%r = sdiv i16 %a, -64
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i16 %a, -64
				}

				define i8 @fun6(i8 %a) {
				%r = sdiv i8 %a, 64
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i8 %a, 64
				}

				define i8 @fun7(i8 %a) {
				%r = sdiv i8 %a, -128
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv i8 %a, -128
				}

				; Vector sdiv

				define <2 x i64> @fun8(<2 x i64> %a) {
				%r = sdiv <2 x i64> %a, <i64 2, i64 2>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <2 x i64> %a, <i64 2, i64 2>
				}

				define <2 x i64> @fun9(<2 x i64> %a) {
				%r = sdiv <2 x i64> %a, <i64 -4, i64 -4>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <2 x i64> %a, <i64 -4, i64 -4>
				}

				define <4 x i32> @fun10(<4 x i32> %a) {
				%r = sdiv <4 x i32> %a, <i32 8, i32 8, i32 8, i32 8>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <4 x i32> %a, <i32 8, i32 8, i32 8, i32 8>
				}

				define <4 x i32> @fun11(<4 x i32> %a) {
				%r = sdiv <4 x i32> %a, <i32 -16, i32 -16, i32 -16, i32 -16>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <4 x i32> %a, <i32 -16
				}

				define <2 x i32> @fun12(<2 x i32> %a) {
				%r = sdiv <2 x i32> %a, <i32 -16, i32 -16>
				ret <2 x i32> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <2 x i32> %a, <i32 -16
				}

				define <8 x i16> @fun13(<8 x i16> %a) {
				%r = sdiv <8 x i16> %a, <i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <8 x i16> %a, <i16 32
				}

				define <8 x i16> @fun14(<8 x i16> %a) {
				%r = sdiv <8 x i16> %a, <i16 -64, i16 -64, i16 -64, i16 -64, i16 -64, i16 -64, i16 -64, i16 -64>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <8 x i16> %a, <i16 -64
				}

				define <4 x i16> @fun15(<4 x i16> %a) {
				%r = sdiv <4 x i16> %a, <i16 32, i16 32, i16 32, i16 32>
				ret <4 x i16> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <4 x i16> %a, <i16 32
				}

				define <16 x i8> @fun16(<16 x i8> %a) {
				%r = sdiv <16 x i8> %a, <i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <16 x i8> %a, <i8 64
				}

				define <16 x i8> @fun17(<16 x i8> %a) {
				%r = sdiv <16 x i8> %a, <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <16 x i8> %a, <i8 -128
				}

				define <8 x i8> @fun18(<8 x i8> %a) {
				%r = sdiv <8 x i8> %a, <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>
				ret <8 x i8> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = sdiv <8 x i8> %a, <i8 -128
				}

				; Scalar udiv

				define i64 @fun19(i64 %a) {
				%r = udiv i64 %a, 2
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv i64 %a, 2
				}

				define i32 @fun20(i32 %a) {
				%r = udiv i32 %a, 8
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv i32 %a, 8
				}

				define i16 @fun21(i16 %a) {
				%r = udiv i16 %a, 32
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv i16 %a, 32
				}

				define i8 @fun22(i8 %a) {
				%r = udiv i8 %a, 128
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv i8 %a, -128
				}

				; Vector udiv

				define <2 x i64> @fun23(<2 x i64> %a) {
				%r = udiv <2 x i64> %a, <i64 2, i64 2>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <2 x i64> %a, <i64 2
				}

				define <4 x i32> @fun24(<4 x i32> %a) {
				%r = udiv <4 x i32> %a, <i32 8, i32 8, i32 8, i32 8>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <4 x i32> %a, <i32 8
				}

				define <2 x i32> @fun25(<2 x i32> %a) {
				%r = udiv <2 x i32> %a, <i32 8, i32 8>
				ret <2 x i32> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <2 x i32> %a, <i32 8
				}

				define <8 x i16> @fun26(<8 x i16> %a) {
				%r = udiv <8 x i16> %a, <i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <8 x i16> %a, <i16 32
				}

				define <4 x i16> @fun27(<4 x i16> %a) {
				%r = udiv <4 x i16> %a, <i16 32, i16 32, i16 32, i16 32>
				ret <4 x i16> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <4 x i16> %a, <i16 32
				}

				define <16 x i8> @fun28(<16 x i8> %a) {
				%r = udiv <16 x i8> %a, <i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <16 x i8> %a, <i8 -128
				}

				define <8 x i8> @fun29(<8 x i8> %a) {
				%r = udiv <8 x i8> %a, <i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128>
				ret <8 x i8> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = udiv <8 x i8> %a, <i8 -128
				}

				; Scalar srem

				define i64 @fun30(i64 %a) {
				%r = srem i64 %a, 2
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem i64 %a, 2
				}

				define i64 @fun31(i64 %a) {
				%r = srem i64 %a, -4
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem i64 %a, -4
				}

				define i32 @fun32(i32 %a) {
				%r = srem i32 %a, 8
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem i32 %a, 8
				}

				define i32 @fun33(i32 %a) {
				%r = srem i32 %a, -16
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem i32 %a, -16
				}

				define i16 @fun34(i16 %a) {
				%r = srem i16 %a, 32
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem i16 %a, 32
				}

				define i16 @fun35(i16 %a) {
				%r = srem i16 %a, -64
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem i16 %a, -64
				}

				define i8 @fun36(i8 %a) {
				%r = srem i8 %a, 64
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem i8 %a, 64
				}

				define i8 @fun37(i8 %a) {
				%r = srem i8 %a, -128
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem i8 %a, -128
				}

				; Vector srem

				define <2 x i64> @fun38(<2 x i64> %a) {
				%r = srem <2 x i64> %a, <i64 2, i64 2>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <2 x i64> %a, <i64 2, i64 2>
				}

				define <2 x i64> @fun39(<2 x i64> %a) {
				%r = srem <2 x i64> %a, <i64 -4, i64 -4>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <2 x i64> %a, <i64 -4, i64 -4>
				}

				define <4 x i32> @fun40(<4 x i32> %a) {
				%r = srem <4 x i32> %a, <i32 8, i32 8, i32 8, i32 8>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <4 x i32> %a, <i32 8, i32 8, i32 8, i32 8>
				}

				define <4 x i32> @fun41(<4 x i32> %a) {
				%r = srem <4 x i32> %a, <i32 -16, i32 -16, i32 -16, i32 -16>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <4 x i32> %a, <i32 -16
				}

				define <2 x i32> @fun42(<2 x i32> %a) {
				%r = srem <2 x i32> %a, <i32 -16, i32 -16>
				ret <2 x i32> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <2 x i32> %a, <i32 -16
				}

				define <8 x i16> @fun43(<8 x i16> %a) {
				%r = srem <8 x i16> %a, <i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <8 x i16> %a, <i16 32
				}

				define <8 x i16> @fun44(<8 x i16> %a) {
				%r = srem <8 x i16> %a, <i16 -64, i16 -64, i16 -64, i16 -64, i16 -64, i16 -64, i16 -64, i16 -64>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <8 x i16> %a, <i16 -64
				}

				define <4 x i16> @fun45(<4 x i16> %a) {
				%r = srem <4 x i16> %a, <i16 32, i16 32, i16 32, i16 32>
				ret <4 x i16> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <4 x i16> %a, <i16 32
				}

				define <16 x i8> @fun46(<16 x i8> %a) {
				%r = srem <16 x i8> %a, <i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64, i8 64>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <16 x i8> %a, <i8 64
				}

				define <16 x i8> @fun47(<16 x i8> %a) {
				%r = srem <16 x i8> %a, <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <16 x i8> %a, <i8 -128
				}

				define <8 x i8> @fun48(<8 x i8> %a) {
				%r = srem <8 x i8> %a, <i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128, i8 -128>
				ret <8 x i8> %r
				; COST: Cost Model: Found an estimated cost of 4 for instruction: %r = srem <8 x i8> %a, <i8 -128
				}

				; Scalar urem

				define i64 @fun49(i64 %a) {
				%r = urem i64 %a, 2
				ret i64 %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem i64 %a, 2
				}

				define i32 @fun50(i32 %a) {
				%r = urem i32 %a, 8
				ret i32 %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem i32 %a, 8
				}

				define i16 @fun51(i16 %a) {
				%r = urem i16 %a, 32
				ret i16 %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem i16 %a, 32
				}

				define i8 @fun52(i8 %a) {
				%r = urem i8 %a, 128
				ret i8 %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem i8 %a, -128
				}

				; Vector urem

				define <2 x i64> @fun53(<2 x i64> %a) {
				%r = urem <2 x i64> %a, <i64 2, i64 2>
				ret <2 x i64> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem <2 x i64> %a, <i64 2
				}

				define <4 x i32> @fun54(<4 x i32> %a) {
				%r = urem <4 x i32> %a, <i32 8, i32 8, i32 8, i32 8>
				ret <4 x i32> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem <4 x i32> %a, <i32 8
				}

				define <2 x i32> @fun55(<2 x i32> %a) {
				%r = urem <2 x i32> %a, <i32 8, i32 8>
				ret <2 x i32> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem <2 x i32> %a, <i32 8
				}

				define <8 x i16> @fun56(<8 x i16> %a) {
				%r = urem <8 x i16> %a, <i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32, i16 32>
				ret <8 x i16> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem <8 x i16> %a, <i16 32
				}

				define <4 x i16> @fun57(<4 x i16> %a) {
				%r = urem <4 x i16> %a, <i16 32, i16 32, i16 32, i16 32>
				ret <4 x i16> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem <4 x i16> %a, <i16 32
				}

				define <16 x i8> @fun58(<16 x i8> %a) {
				%r = urem <16 x i8> %a, <i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128>
				ret <16 x i8> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem <16 x i8> %a, <i8 -128
				}

				define <8 x i8> @fun59(<8 x i8> %a) {
				%r = urem <8 x i8> %a, <i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128, i8 128>
				ret <8 x i8> %r
				; COST: Cost Model: Found an estimated cost of 1 for instruction: %r = urem <8 x i8> %a, <i8 -128
				}

test/Analysis/CostModel/SystemZ/divrem-reg.ll

This file was added.

				; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s

				; Check costs of divisions by register
				;
				; Note: Vectorization of division/remainder is temporarily disabled for high
				; vectorization factors by returning 1000.

				; Scalar sdiv

				define i64 @fun0(i64 %a, i64 %b) {
				%r = sdiv i64 %a, %b
				ret i64 %r
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %r = sdiv i64
				}

				define i32 @fun1(i32 %a, i32 %b) {
				%r = sdiv i32 %a, %b
				ret i32 %r
				; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %r = sdiv i32 %a, %b
				}

				define i16 @fun2(i16 %a, i16 %b) {
				%r = sdiv i16 %a, %b
				ret i16 %r
				; CHECK: Cost Model: Found an estimated cost of 23 for instruction: %r = sdiv i16 %a, %b
				}

				define i8 @fun3(i8 %a, i8 %b) {
				%r = sdiv i8 %a, %b
				ret i8 %r
				; CHECK: Cost Model: Found an estimated cost of 23 for instruction: %r = sdiv i8 %a, %b
				}

				; Vector sdiv

				define <2 x i64> @fun4(<2 x i64> %a, <2 x i64> %b) {
				%r = sdiv <2 x i64> %a, %b
				ret <2 x i64> %r
				; CHECK: Cost Model: Found an estimated cost of 47 for instruction: %r = sdiv <2 x i64>
				}

				define <4 x i32> @fun5(<4 x i32> %a, <4 x i32> %b) {
				%r = sdiv <4 x i32> %a, %b
				ret <4 x i32> %r
				; CHECK: Cost Model: Found an estimated cost of 98 for instruction: %r = sdiv <4 x i32>
				}

				define <2 x i32> @fun6(<2 x i32> %a, <2 x i32> %b) {
				%r = sdiv <2 x i32> %a, %b
				ret <2 x i32> %r
				; CHECK: Cost Model: Found an estimated cost of 50 for instruction: %r = sdiv <2 x i32>
				}

				define <8 x i16> @fun7(<8 x i16> %a, <8 x i16> %b) {
				%r = sdiv <8 x i16> %a, %b
				ret <8 x i16> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = sdiv <8 x i16>
				}

				define <4 x i16> @fun8(<4 x i16> %a, <4 x i16> %b) {
				%r = sdiv <4 x i16> %a, %b
				ret <4 x i16> %r
				; CHECK: Cost Model: Found an estimated cost of 106 for instruction: %r = sdiv <4 x i16>
				}

				define <16 x i8> @fun9(<16 x i8> %a, <16 x i8> %b) {
				%r = sdiv <16 x i8> %a, %b
				ret <16 x i8> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = sdiv <16 x i8>
				}

				define <8 x i8> @fun10(<8 x i8> %a, <8 x i8> %b) {
				%r = sdiv <8 x i8> %a, %b
				ret <8 x i8> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = sdiv <8 x i8>
				}

				; Scalar udiv

				define i64 @fun11(i64 %a, i64 %b) {
				%r = udiv i64 %a, %b
				ret i64 %r
				; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %r = udiv i64 %a, %b
				}

				define i32 @fun12(i32 %a, i32 %b) {
				%r = udiv i32 %a, %b
				ret i32 %r
				; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %r = udiv i32
				}

				define i16 @fun13(i16 %a, i16 %b) {
				%r = udiv i16 %a, %b
				ret i16 %r
				; CHECK: Cost Model: Found an estimated cost of 23 for instruction: %r = udiv i16
				}

				define i8 @fun14(i8 %a, i8 %b) {
				%r = udiv i8 %a, %b
				ret i8 %r
				; CHECK: Cost Model: Found an estimated cost of 23 for instruction: %r = udiv i8
				}

				; Vector udiv

				define <2 x i64> @fun15(<2 x i64> %a, <2 x i64> %b) {
				%r = udiv <2 x i64> %a, %b
				ret <2 x i64> %r
				; CHECK: Cost Model: Found an estimated cost of 49 for instruction: %r = udiv <2 x i64>
				}

				define <4 x i32> @fun16(<4 x i32> %a, <4 x i32> %b) {
				%r = udiv <4 x i32> %a, %b
				ret <4 x i32> %r
				; CHECK: Cost Model: Found an estimated cost of 98 for instruction: %r = udiv <4 x i32>
				}

				define <2 x i32> @fun17(<2 x i32> %a, <2 x i32> %b) {
				%r = udiv <2 x i32> %a, %b
				ret <2 x i32> %r
				; CHECK: Cost Model: Found an estimated cost of 50 for instruction: %r = udiv <2 x i32>
				}

				define <8 x i16> @fun18(<8 x i16> %a, <8 x i16> %b) {
				%r = udiv <8 x i16> %a, %b
				ret <8 x i16> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = udiv <8 x i16>
				}

				define <4 x i16> @fun19(<4 x i16> %a, <4 x i16> %b) {
				%r = udiv <4 x i16> %a, %b
				ret <4 x i16> %r
				; CHECK: Cost Model: Found an estimated cost of 106 for instruction: %r = udiv <4 x i16>
				}

				define <16 x i8> @fun20(<16 x i8> %a, <16 x i8> %b) {
				%r = udiv <16 x i8> %a, %b
				ret <16 x i8> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = udiv <16 x i8>
				}

				define <8 x i8> @fun21(<8 x i8> %a, <8 x i8> %b) {
				%r = udiv <8 x i8> %a, %b
				ret <8 x i8> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = udiv <8 x i8>
				}

				; Scalar srem

				define i64 @fun22(i64 %a, i64 %b) {
				%r = srem i64 %a, %b
				ret i64 %r
				; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %r = srem i64
				}

				define i32 @fun23(i32 %a, i32 %b) {
				%r = srem i32 %a, %b
				ret i32 %r
				; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %r = srem i32
				}

				define i16 @fun24(i16 %a, i16 %b) {
				%r = srem i16 %a, %b
				ret i16 %r
				; CHECK: Cost Model: Found an estimated cost of 23 for instruction: %r = srem i16
				}

				define i8 @fun25(i8 %a, i8 %b) {
				%r = srem i8 %a, %b
				ret i8 %r
				; CHECK: Cost Model: Found an estimated cost of 23 for instruction: %r = srem i8
				}

				; Vector srem

				define <2 x i64> @fun26(<2 x i64> %a, <2 x i64> %b) {
				%r = srem <2 x i64> %a, %b
				ret <2 x i64> %r
				; CHECK: Cost Model: Found an estimated cost of 47 for instruction: %r = srem <2 x i64>
				}

				define <4 x i32> @fun27(<4 x i32> %a, <4 x i32> %b) {
				%r = srem <4 x i32> %a, %b
				ret <4 x i32> %r
				; CHECK: Cost Model: Found an estimated cost of 98 for instruction: %r = srem <4 x i32>
				}

				define <2 x i32> @fun28(<2 x i32> %a, <2 x i32> %b) {
				%r = srem <2 x i32> %a, %b
				ret <2 x i32> %r
				; CHECK: Cost Model: Found an estimated cost of 50 for instruction: %r = srem <2 x i32>
				}

				define <8 x i16> @fun29(<8 x i16> %a, <8 x i16> %b) {
				%r = srem <8 x i16> %a, %b
				ret <8 x i16> %r
				; CHECK: ost Model: Found an estimated cost of 1000 for instruction: %r = srem <8 x i16>
				}

				define <4 x i16> @fun30(<4 x i16> %a, <4 x i16> %b) {
				%r = srem <4 x i16> %a, %b
				ret <4 x i16> %r
				; CHECK: Cost Model: Found an estimated cost of 106 for instruction: %r = srem <4 x i16>
				}

				define <16 x i8> @fun31(<16 x i8> %a, <16 x i8> %b) {
				%r = srem <16 x i8> %a, %b
				ret <16 x i8> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = srem <16 x i8>
				}

				define <8 x i8> @fun32(<8 x i8> %a, <8 x i8> %b) {
				%r = srem <8 x i8> %a, %b
				ret <8 x i8> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = srem <8 x i8>
				}

				; Scalar urem

				define i64 @fun33(i64 %a, i64 %b) {
				%r = urem i64 %a, %b
				ret i64 %r
				; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %r = urem i64
				}

				define i32 @fun34(i32 %a, i32 %b) {
				%r = urem i32 %a, %b
				ret i32 %r
				; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %r = urem i32
				}

				define i16 @fun35(i16 %a, i16 %b) {
				%r = urem i16 %a, %b
				ret i16 %r
				; CHECK: Cost Model: Found an estimated cost of 23 for instruction: %r = urem i16
				}

				define i8 @fun36(i8 %a, i8 %b) {
				%r = urem i8 %a, %b
				ret i8 %r
				; CHECK: Cost Model: Found an estimated cost of 23 for instruction: %r = urem i8
				}

				; Vector urem

				define <2 x i64> @fun37(<2 x i64> %a, <2 x i64> %b) {
				%r = urem <2 x i64> %a, %b
				ret <2 x i64> %r
				; CHECK: Cost Model: Found an estimated cost of 49 for instruction: %r = urem <2 x i64>
				}

				define <4 x i32> @fun38(<4 x i32> %a, <4 x i32> %b) {
				%r = urem <4 x i32> %a, %b
				ret <4 x i32> %r
				; CHECK: Cost Model: Found an estimated cost of 98 for instruction: %r = urem <4 x i32>
				}

				define <2 x i32> @fun39(<2 x i32> %a, <2 x i32> %b) {
				%r = urem <2 x i32> %a, %b
				ret <2 x i32> %r
				; CHECK: Cost Model: Found an estimated cost of 50 for instruction: %r = urem <2 x i32>
				}

				define <8 x i16> @fun40(<8 x i16> %a, <8 x i16> %b) {
				%r = urem <8 x i16> %a, %b
				ret <8 x i16> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = urem <8 x i16>
				}

				define <4 x i16> @fun41(<4 x i16> %a, <4 x i16> %b) {
				%r = urem <4 x i16> %a, %b
				ret <4 x i16> %r
				; CHECK: Cost Model: Found an estimated cost of 106 for instruction: %r = urem <4 x i16>
				}

				define <16 x i8> @fun42(<16 x i8> %a, <16 x i8> %b) {
				%r = urem <16 x i8> %a, %b
				ret <16 x i8> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = urem <16 x i8>
				}

				define <8 x i8> @fun43(<8 x i8> %a, <8 x i8> %b) {
				%r = urem <8 x i8> %a, %b
				ret <8 x i8> %r
				; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %r = urem <8 x i8>
				}

test/Analysis/CostModel/SystemZ/int-arith.ll

	; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s			; RUN: opt < %s -cost-model -analyze -mtriple=systemz-unknown -mcpu=z13 \| FileCheck %s
	;			;
	; Note: The scalarized vector instructions costs are not including any			; Note: The scalarized vector instructions costs are not including any
	; extracts, due to the undef operands.			; extracts, due to the undef operands.
	;
	; Note: Vectorization of division/remainder is temporarily disabled for high
	; vectorization factors by returning 1000.

	define void @add() {			define void @add() {
	%res0 = add i8 undef, undef			%res0 = add i8 undef, undef
	%res1 = add i16 undef, undef			%res1 = add i16 undef, undef
	%res2 = add i32 undef, undef			%res2 = add i32 undef, undef
	%res3 = add i64 undef, undef			%res3 = add i64 undef, undef
	%res4 = add <2 x i8> undef, undef			%res4 = add <2 x i8> undef, undef
	%res5 = add <2 x i16> undef, undef			%res5 = add <2 x i16> undef, undef
	▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines
	; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res15 = mul <8 x i64> undef, undef			; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res15 = mul <8 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = mul <16 x i8> undef, undef			; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res16 = mul <16 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = mul <16 x i16> undef, undef			; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res17 = mul <16 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = mul <16 x i32> undef, undef			; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res18 = mul <16 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %res19 = mul <16 x i64> undef, undef			; CHECK: Cost Model: Found an estimated cost of 24 for instruction: %res19 = mul <16 x i64> undef, undef

	ret void;			ret void;
	}			}

	define void @sdiv() {
	%res0 = sdiv i8 undef, undef
	%res1 = sdiv i16 undef, undef
	%res2 = sdiv i32 undef, undef
	%res3 = sdiv i64 undef, undef
	%res4 = sdiv <2 x i8> undef, undef
	%res5 = sdiv <2 x i16> undef, undef
	%res6 = sdiv <2 x i32> undef, undef
	%res7 = sdiv <2 x i64> undef, undef
	%res8 = sdiv <4 x i8> undef, undef
	%res9 = sdiv <4 x i16> undef, undef
	%res10 = sdiv <4 x i32> undef, undef
	%res11 = sdiv <4 x i64> undef, undef
	%res12 = sdiv <8 x i8> undef, undef
	%res13 = sdiv <8 x i16> undef, undef
	%res14 = sdiv <8 x i32> undef, undef
	%res15 = sdiv <8 x i64> undef, undef
	%res16 = sdiv <16 x i8> undef, undef
	%res17 = sdiv <16 x i16> undef, undef
	%res18 = sdiv <16 x i32> undef, undef
	%res19 = sdiv <16 x i64> undef, undef

	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res0 = sdiv i8 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res1 = sdiv i16 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res2 = sdiv i32 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = sdiv i64 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res4 = sdiv <2 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res5 = sdiv <2 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res6 = sdiv <2 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %res7 = sdiv <2 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res8 = sdiv <4 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res9 = sdiv <4 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res10 = sdiv <4 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res11 = sdiv <4 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res12 = sdiv <8 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res13 = sdiv <8 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res14 = sdiv <8 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res15 = sdiv <8 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res16 = sdiv <16 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res17 = sdiv <16 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res18 = sdiv <16 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res19 = sdiv <16 x i64> undef, undef

	ret void;
	}

	define void @srem() {
	%res0 = srem i8 undef, undef
	%res1 = srem i16 undef, undef
	%res2 = srem i32 undef, undef
	%res3 = srem i64 undef, undef
	%res4 = srem <2 x i8> undef, undef
	%res5 = srem <2 x i16> undef, undef
	%res6 = srem <2 x i32> undef, undef
	%res7 = srem <2 x i64> undef, undef
	%res8 = srem <4 x i8> undef, undef
	%res9 = srem <4 x i16> undef, undef
	%res10 = srem <4 x i32> undef, undef
	%res11 = srem <4 x i64> undef, undef
	%res12 = srem <8 x i8> undef, undef
	%res13 = srem <8 x i16> undef, undef
	%res14 = srem <8 x i32> undef, undef
	%res15 = srem <8 x i64> undef, undef
	%res16 = srem <16 x i8> undef, undef
	%res17 = srem <16 x i16> undef, undef
	%res18 = srem <16 x i32> undef, undef
	%res19 = srem <16 x i64> undef, undef

	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res0 = srem i8 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res1 = srem i16 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res2 = srem i32 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %res3 = srem i64 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res4 = srem <2 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res5 = srem <2 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res6 = srem <2 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 3 for instruction: %res7 = srem <2 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res8 = srem <4 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res9 = srem <4 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res10 = srem <4 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res11 = srem <4 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res12 = srem <8 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res13 = srem <8 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res14 = srem <8 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res15 = srem <8 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res16 = srem <16 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res17 = srem <16 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res18 = srem <16 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res19 = srem <16 x i64> undef, undef

	ret void;
	}

	define void @udiv() {
	%res0 = udiv i8 undef, undef
	%res1 = udiv i16 undef, undef
	%res2 = udiv i32 undef, undef
	%res3 = udiv i64 undef, undef
	%res4 = udiv <2 x i8> undef, undef
	%res5 = udiv <2 x i16> undef, undef
	%res6 = udiv <2 x i32> undef, undef
	%res7 = udiv <2 x i64> undef, undef
	%res8 = udiv <4 x i8> undef, undef
	%res9 = udiv <4 x i16> undef, undef
	%res10 = udiv <4 x i32> undef, undef
	%res11 = udiv <4 x i64> undef, undef
	%res12 = udiv <8 x i8> undef, undef
	%res13 = udiv <8 x i16> undef, undef
	%res14 = udiv <8 x i32> undef, undef
	%res15 = udiv <8 x i64> undef, undef
	%res16 = udiv <16 x i8> undef, undef
	%res17 = udiv <16 x i16> undef, undef
	%res18 = udiv <16 x i32> undef, undef
	%res19 = udiv <16 x i64> undef, undef

	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res0 = udiv i8 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res1 = udiv i16 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res2 = udiv i32 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res3 = udiv i64 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res4 = udiv <2 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res5 = udiv <2 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res6 = udiv <2 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 5 for instruction: %res7 = udiv <2 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res8 = udiv <4 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res9 = udiv <4 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res10 = udiv <4 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res11 = udiv <4 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res12 = udiv <8 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res13 = udiv <8 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res14 = udiv <8 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res15 = udiv <8 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res16 = udiv <16 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res17 = udiv <16 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res18 = udiv <16 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res19 = udiv <16 x i64> undef, undef

	ret void;
	}

	define void @urem() {
	%res0 = urem i8 undef, undef
	%res1 = urem i16 undef, undef
	%res2 = urem i32 undef, undef
	%res3 = urem i64 undef, undef
	%res4 = urem <2 x i8> undef, undef
	%res5 = urem <2 x i16> undef, undef
	%res6 = urem <2 x i32> undef, undef
	%res7 = urem <2 x i64> undef, undef
	%res8 = urem <4 x i8> undef, undef
	%res9 = urem <4 x i16> undef, undef
	%res10 = urem <4 x i32> undef, undef
	%res11 = urem <4 x i64> undef, undef
	%res12 = urem <8 x i8> undef, undef
	%res13 = urem <8 x i16> undef, undef
	%res14 = urem <8 x i32> undef, undef
	%res15 = urem <8 x i64> undef, undef
	%res16 = urem <16 x i8> undef, undef
	%res17 = urem <16 x i16> undef, undef
	%res18 = urem <16 x i32> undef, undef
	%res19 = urem <16 x i64> undef, undef

	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res0 = urem i8 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 4 for instruction: %res1 = urem i16 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res2 = urem i32 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %res3 = urem i64 undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res4 = urem <2 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res5 = urem <2 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 6 for instruction: %res6 = urem <2 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 5 for instruction: %res7 = urem <2 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res8 = urem <4 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %res9 = urem <4 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 12 for instruction: %res10 = urem <4 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 10 for instruction: %res11 = urem <4 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res12 = urem <8 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res13 = urem <8 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res14 = urem <8 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res15 = urem <8 x i64> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res16 = urem <16 x i8> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res17 = urem <16 x i16> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res18 = urem <16 x i32> undef, undef
	; CHECK: Cost Model: Found an estimated cost of 1000 for instruction: %res19 = urem <16 x i64> undef, undef

	ret void;
	}

test/Analysis/CostModel/SystemZ/memop-folding-int-arith.ll

	Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %2 = mul i32 %li32_0, %li32_1			; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %2 = mul i32 %li32_0, %li32_1
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64 = load i64, i64* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64 = load i64, i64* undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %3 = mul i64 %li64, undef			; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %3 = mul i64 %li64, undef
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64_0 = load i64, i64* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64_0 = load i64, i64* undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li64_1 = load i64, i64* undef			; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li64_1 = load i64, i64* undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %4 = mul i64 %li64_0, %li64_1			; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %4 = mul i64 %li64_0, %li64_1
	}			}

	define void @sdiv() {			define void @sdiv(i32 %arg32, i64 %arg64) {
	%li32 = load i32, i32* undef			%li32 = load i32, i32* undef
	sdiv i32 %li32, undef			sdiv i32 %li32, %arg32

	%li32_0 = load i32, i32* undef			%li32_0 = load i32, i32* undef
	%li32_1 = load i32, i32* undef			%li32_1 = load i32, i32* undef
	sdiv i32 %li32_0, %li32_1			sdiv i32 %li32_0, %li32_1

	%li64 = load i64, i64* undef			%li64 = load i64, i64* undef
	sdiv i64 %li64, undef			sdiv i64 %li64, %arg64

	%li64_0 = load i64, i64* undef			%li64_0 = load i64, i64* undef
	%li64_1 = load i64, i64* undef			%li64_1 = load i64, i64* undef
	sdiv i64 %li64_0, %li64_1			sdiv i64 %li64_0, %li64_1

	ret void;			ret void;
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li32 = load i32, i32* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li32 = load i32, i32* undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %1 = sdiv i32 %li32, undef			; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %1 = sdiv i32 %li32, %arg32
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li32_0 = load i32, i32* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li32_0 = load i32, i32* undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li32_1 = load i32, i32* undef			; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li32_1 = load i32, i32* undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %2 = sdiv i32 %li32_0, %li32_1			; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %2 = sdiv i32 %li32_0, %li32_1
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64 = load i64, i64* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64 = load i64, i64* undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %3 = sdiv i64 %li64, undef			; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %3 = sdiv i64 %li64, %arg64
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64_0 = load i64, i64* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64_0 = load i64, i64* undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li64_1 = load i64, i64* undef			; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li64_1 = load i64, i64* undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %4 = sdiv i64 %li64_0, %li64_1			; CHECK: Cost Model: Found an estimated cost of 20 for instruction: %4 = sdiv i64 %li64_0, %li64_1
	}			}

	define void @udiv() {			define void @udiv(i32 %arg32, i64 %arg64) {
	%li32 = load i32, i32* undef			%li32 = load i32, i32* undef
	udiv i32 %li32, undef			udiv i32 %li32, %arg32

	%li32_0 = load i32, i32* undef			%li32_0 = load i32, i32* undef
	%li32_1 = load i32, i32* undef			%li32_1 = load i32, i32* undef
	udiv i32 %li32_0, %li32_1			udiv i32 %li32_0, %li32_1

	%li64 = load i64, i64* undef			%li64 = load i64, i64* undef
	udiv i64 %li64, undef			udiv i64 %li64, %arg64

	%li64_0 = load i64, i64* undef			%li64_0 = load i64, i64* undef
	%li64_1 = load i64, i64* undef			%li64_1 = load i64, i64* undef
	udiv i64 %li64_0, %li64_1			udiv i64 %li64_0, %li64_1

	ret void;			ret void;
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li32 = load i32, i32* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li32 = load i32, i32* undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %1 = udiv i32 %li32, undef			; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %1 = udiv i32 %li32, %arg32
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li32_0 = load i32, i32* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li32_0 = load i32, i32* undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li32_1 = load i32, i32* undef			; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li32_1 = load i32, i32* undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %2 = udiv i32 %li32_0, %li32_1			; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %2 = udiv i32 %li32_0, %li32_1
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64 = load i64, i64* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64 = load i64, i64* undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %3 = udiv i64 %li64, undef			; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %3 = udiv i64 %li64, %arg64
	; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64_0 = load i64, i64* undef			; CHECK: Cost Model: Found an estimated cost of 0 for instruction: %li64_0 = load i64, i64* undef
	; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li64_1 = load i64, i64* undef			; CHECK: Cost Model: Found an estimated cost of 1 for instruction: %li64_1 = load i64, i64* undef
	; CHECK: Cost Model: Found an estimated cost of 2 for instruction: %4 = udiv i64 %li64_0, %li64_1			; CHECK: Cost Model: Found an estimated cost of 21 for instruction: %4 = udiv i64 %li64_0, %li64_1
	}			}

	define void @and() {			define void @and() {
	%li32 = load i32, i32* undef			%li32 = load i32, i32* undef
	and i32 %li32, undef			and i32 %li32, undef

	%li32_0 = load i32, i32* undef			%li32_0 = load i32, i32* undef
	%li32_1 = load i32, i32* undef			%li32_1 = load i32, i32* undef
	▲ Show 20 Lines • Show All 105 Lines • Show Last 20 Lines