This is an archive of the discontinued LLVM Phabricator instance.

[CodeGen] Prepare for introduction of v3 and v5 MVTs
ClosedPublic

Authored by tpr on Mar 4 2019, 7:05 AM.

Download Raw Diff

Details

Reviewers

craig.topper
arsenm
efriedma
echristo

Commits

rGc302b9b5fe0e: [CodeGen] Prepare for introduction of v3 and v5 MVTs
rL356350: [CodeGen] Prepare for introduction of v3 and v5 MVTs

Summary

AMDGPU would like to have MVTs for v3i32, v3f32, v5i32, v5f32. This
commit does not add them, but makes preparatory changes:

Exclude non-legal non-power-of-2 vector types from ComputeRegisterProp mechanism in TargetLoweringBase::getTypeConversion.

Cope with SETCC and VSELECT for odd-width i1 vector when the other vectors are legal type.

Fixed an assumption of power-of-2 vector type in ARM.

Fixed assumptions of power-of-2 vector type in AMDGPU kernel arg handling.

Fixed AMDGPU cost analysis to behave the same.

Some of this patch is from Matt Arsenault, also of AMD.

Change-Id: Ib5f23377dbef511be3a936211a0b9f94e46331f8

Diff Detail

Repository: rL LLVM

Event Timeline

tpr created this revision.Mar 4 2019, 7:05 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 4 2019, 7:05 AM

Herald added subscribers: llvm-commits, jdoerfert, kristof.beyls and 4 others. · View Herald Transcript

Harbormaster completed remote builds in B28746: Diff 189142.Mar 4 2019, 7:05 AM

tpr added a child revision: D58901: [CodeGen] Defined MVTs v3i32, v3f32, v5i32, v5f32.Mar 4 2019, 7:25 AM

The ARM patch at least should be split into a separate patch

lib/Target/AMDGPU/AMDGPUISelLowering.cpp
1034 ↗	(On Diff #189142)	Braces
1036 ↗	(On Diff #189142)	Should also make sure to add some 5x vector argument tests to the kernarg tests since you are adding those
lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
343 ↗	(On Diff #189142)	This could use a test in test/Analysis/CostModel/AMDGPU
lib/Target/ARM/ARMISelLowering.cpp
12153 ↗	(On Diff #189142)	Combine the last 2 checks into NumLanes >= 3
12211 ↗	(On Diff #189142)	Ditto

tpr added reviewers: craig.topper, arsenm, efriedma, echristo.Mar 4 2019, 7:39 AM

Herald added a subscriber: wdng. · View Herald TranscriptMar 4 2019, 7:39 AM

tpr marked 3 inline comments as done.Mar 4 2019, 11:59 AM

tpr added inline comments.

lib/Target/ARM/ARMISelLowering.cpp
12153 ↗	(On Diff #189142)	That won't work -- it needs to be true for ==3 and >4, but not for ==4.

The general idea makes sense, I think.

lib/CodeGen/TargetLoweringBase.cpp
746 ↗	(On Diff #189142)	The reason we're checking for isSimple() here is precisely so that we can perform a lookup in the TransformToType array. If we need a couple extra lines of code to handle TypeWidenVector, we should just add them (and maybe refactor the code so it doesn't repeat itself so much).

V2: Moved ARM and AMDGPU changes out to their own commits.

Harbormaster completed remote builds in B28773: Diff 189217.Mar 4 2019, 3:17 PM

tpr marked 4 inline comments as done.Mar 4 2019, 3:24 PM

tpr added inline comments.

lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
343 ↗	(On Diff #189142)	I have separated out the amdgpu changes into another commit, but: In adding v5 cost model tests, I found that (a) this change here is unnecessary, and (b) I had a bug in one of my later changes (the one that makes v5 legal on amdgpu) that made the test fail. So I have added the tests.

V3: Addressed review comment by widening illegal odd vectors in a
different way.

Harbormaster completed remote builds in B28852: Diff 189577.Mar 6 2019, 1:33 PM

tpr marked an inline comment as done.Mar 6 2019, 1:35 PM

LGTM

This revision is now accepted and ready to land.Mar 6 2019, 3:49 PM

Closed by commit rL356350: [CodeGen] Prepare for introduction of v3 and v5 MVTs (authored by tpr). · Explain WhyMar 17 2019, 2:42 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

SelectionDAG.h

3 lines

TargetLowering.h

3 lines

lib/

CodeGen/

SelectionDAG/

LegalizeTypes.h

1 line

LegalizeVectorTypes.cpp

30 lines

SelectionDAG.cpp

9 lines

Diff 191049

llvm/trunk/include/llvm/CodeGen/SelectionDAG.h

Show First 20 Lines • Show All 1,582 Lines • ▼ Show 20 Lines	public:

/// Split the node's operand with EXTRACT_SUBVECTOR and		/// Split the node's operand with EXTRACT_SUBVECTOR and
/// return the low/high part.		/// return the low/high part.
std::pair<SDValue, SDValue> SplitVectorOperand(const SDNode *N, unsigned OpNo)		std::pair<SDValue, SDValue> SplitVectorOperand(const SDNode *N, unsigned OpNo)
{		{
return SplitVector(N->getOperand(OpNo), SDLoc(N));		return SplitVector(N->getOperand(OpNo), SDLoc(N));
}		}

		/// Widen the vector up to the next power of two using INSERT_SUBVECTOR.
		SDValue WidenVector(const SDValue &N, const SDLoc &DL);

/// Append the extracted elements from Start to Count out of the vector Op		/// Append the extracted elements from Start to Count out of the vector Op
/// in Args. If Count is 0, all of the elements will be extracted.		/// in Args. If Count is 0, all of the elements will be extracted.
void ExtractVectorElements(SDValue Op, SmallVectorImpl<SDValue> &Args,		void ExtractVectorElements(SDValue Op, SmallVectorImpl<SDValue> &Args,
unsigned Start = 0, unsigned Count = 0);		unsigned Start = 0, unsigned Count = 0);

/// Compute the default alignment value for the given type.		/// Compute the default alignment value for the given type.
unsigned getEVTAlignment(EVT MemoryVT) const;		unsigned getEVTAlignment(EVT MemoryVT) const;

▲ Show 20 Lines • Show All 105 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/CodeGen/TargetLowering.h

Show First 20 Lines • Show All 288 Lines • ▼ Show 20 Lines	public:
bool hasExtractBitsInsn() const { return HasExtractBitsInsn; }		bool hasExtractBitsInsn() const { return HasExtractBitsInsn; }

/// Return the preferred vector type legalization action.		/// Return the preferred vector type legalization action.
virtual TargetLoweringBase::LegalizeTypeAction		virtual TargetLoweringBase::LegalizeTypeAction
getPreferredVectorAction(MVT VT) const {		getPreferredVectorAction(MVT VT) const {
// The default action for one element vectors is to scalarize		// The default action for one element vectors is to scalarize
if (VT.getVectorNumElements() == 1)		if (VT.getVectorNumElements() == 1)
return TypeScalarizeVector;		return TypeScalarizeVector;
		// The default action for an odd-width vector is to widen.
		if (!VT.isPow2VectorType())
		return TypeWidenVector;
// The default action for other vectors is to promote		// The default action for other vectors is to promote
return TypePromoteInteger;		return TypePromoteInteger;
}		}

// There are two general methods for expanding a BUILD_VECTOR node:		// There are two general methods for expanding a BUILD_VECTOR node:
// 1. Use SCALAR_TO_VECTOR on the defined scalar values and then shuffle		// 1. Use SCALAR_TO_VECTOR on the defined scalar values and then shuffle
// them together.		// them together.
// 2. Build the vector on the stack and then load it.		// 2. Build the vector on the stack and then load it.
▲ Show 20 Lines • Show All 3,667 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeTypes.h

Show First 20 Lines • Show All 830 Lines • ▼ Show 20 Lines	private:
SDValue WidenVecOp_EXTEND(SDNode *N);		SDValue WidenVecOp_EXTEND(SDNode *N);
SDValue WidenVecOp_EXTRACT_VECTOR_ELT(SDNode *N);		SDValue WidenVecOp_EXTRACT_VECTOR_ELT(SDNode *N);
SDValue WidenVecOp_EXTRACT_SUBVECTOR(SDNode *N);		SDValue WidenVecOp_EXTRACT_SUBVECTOR(SDNode *N);
SDValue WidenVecOp_STORE(SDNode* N);		SDValue WidenVecOp_STORE(SDNode* N);
SDValue WidenVecOp_MSTORE(SDNode* N, unsigned OpNo);		SDValue WidenVecOp_MSTORE(SDNode* N, unsigned OpNo);
SDValue WidenVecOp_MGATHER(SDNode* N, unsigned OpNo);		SDValue WidenVecOp_MGATHER(SDNode* N, unsigned OpNo);
SDValue WidenVecOp_MSCATTER(SDNode* N, unsigned OpNo);		SDValue WidenVecOp_MSCATTER(SDNode* N, unsigned OpNo);
SDValue WidenVecOp_SETCC(SDNode* N);		SDValue WidenVecOp_SETCC(SDNode* N);
		SDValue WidenVecOp_VSELECT(SDNode *N);

SDValue WidenVecOp_Convert(SDNode *N);		SDValue WidenVecOp_Convert(SDNode *N);
SDValue WidenVecOp_FCOPYSIGN(SDNode *N);		SDValue WidenVecOp_FCOPYSIGN(SDNode *N);
SDValue WidenVecOp_VECREDUCE(SDNode *N);		SDValue WidenVecOp_VECREDUCE(SDNode *N);

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Vector Widening Utilities Support: LegalizeVectorTypes.cpp		// Vector Widening Utilities Support: LegalizeVectorTypes.cpp
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 111 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

Show First 20 Lines • Show All 3,825 Lines • ▼ Show 20 Lines	SDValue DAGTypeLegalizer::WidenVecRes_SETCC(SDNode *N) {
// we'd prefer to widen the result type, the input operands have been split.		// we'd prefer to widen the result type, the input operands have been split.
// In this case, we also need to split the result of this node as well.		// In this case, we also need to split the result of this node as well.
if (getTypeAction(InVT) == TargetLowering::TypeSplitVector) {		if (getTypeAction(InVT) == TargetLowering::TypeSplitVector) {
SDValue SplitVSetCC = SplitVecOp_VSETCC(N);		SDValue SplitVSetCC = SplitVecOp_VSETCC(N);
SDValue Res = ModifyToType(SplitVSetCC, WidenVT);		SDValue Res = ModifyToType(SplitVSetCC, WidenVT);
return Res;		return Res;
}		}

		// If the inputs also widen, handle them directly. Otherwise widen by hand.
		SDValue InOp2 = N->getOperand(1);
		if (getTypeAction(InVT) == TargetLowering::TypeWidenVector) {
InOp1 = GetWidenedVector(InOp1);		InOp1 = GetWidenedVector(InOp1);
SDValue InOp2 = GetWidenedVector(N->getOperand(1));		InOp2 = GetWidenedVector(InOp2);
		} else {
		InOp1 = DAG.WidenVector(InOp1, SDLoc(N));
		InOp2 = DAG.WidenVector(InOp2, SDLoc(N));
		}

// Assume that the input and output will be widen appropriately. If not,		// Assume that the input and output will be widen appropriately. If not,
// we will have to unroll it at some point.		// we will have to unroll it at some point.
assert(InOp1.getValueType() == WidenInVT &&		assert(InOp1.getValueType() == WidenInVT &&
InOp2.getValueType() == WidenInVT &&		InOp2.getValueType() == WidenInVT &&
"Input not widened to expected type!");		"Input not widened to expected type!");
(void)WidenInVT;		(void)WidenInVT;
return DAG.getNode(ISD::SETCC, SDLoc(N),		return DAG.getNode(ISD::SETCC, SDLoc(N),
Show All 26 Lines	#endif
case ISD::CONCAT_VECTORS: Res = WidenVecOp_CONCAT_VECTORS(N); break;		case ISD::CONCAT_VECTORS: Res = WidenVecOp_CONCAT_VECTORS(N); break;
case ISD::EXTRACT_SUBVECTOR: Res = WidenVecOp_EXTRACT_SUBVECTOR(N); break;		case ISD::EXTRACT_SUBVECTOR: Res = WidenVecOp_EXTRACT_SUBVECTOR(N); break;
case ISD::EXTRACT_VECTOR_ELT: Res = WidenVecOp_EXTRACT_VECTOR_ELT(N); break;		case ISD::EXTRACT_VECTOR_ELT: Res = WidenVecOp_EXTRACT_VECTOR_ELT(N); break;
case ISD::STORE: Res = WidenVecOp_STORE(N); break;		case ISD::STORE: Res = WidenVecOp_STORE(N); break;
case ISD::MSTORE: Res = WidenVecOp_MSTORE(N, OpNo); break;		case ISD::MSTORE: Res = WidenVecOp_MSTORE(N, OpNo); break;
case ISD::MGATHER: Res = WidenVecOp_MGATHER(N, OpNo); break;		case ISD::MGATHER: Res = WidenVecOp_MGATHER(N, OpNo); break;
case ISD::MSCATTER: Res = WidenVecOp_MSCATTER(N, OpNo); break;		case ISD::MSCATTER: Res = WidenVecOp_MSCATTER(N, OpNo); break;
case ISD::SETCC: Res = WidenVecOp_SETCC(N); break;		case ISD::SETCC: Res = WidenVecOp_SETCC(N); break;
		case ISD::VSELECT: Res = WidenVecOp_VSELECT(N); break;
case ISD::FCOPYSIGN: Res = WidenVecOp_FCOPYSIGN(N); break;		case ISD::FCOPYSIGN: Res = WidenVecOp_FCOPYSIGN(N); break;

case ISD::ANY_EXTEND:		case ISD::ANY_EXTEND:
case ISD::SIGN_EXTEND:		case ISD::SIGN_EXTEND:
case ISD::ZERO_EXTEND:		case ISD::ZERO_EXTEND:
Res = WidenVecOp_EXTEND(N);		Res = WidenVecOp_EXTEND(N);
break;		break;

▲ Show 20 Lines • Show All 420 Lines • ▼ Show 20 Lines	SDValue DAGTypeLegalizer::WidenVecOp_VECREDUCE(SDNode *N) {
unsigned WideElts = WideVT.getVectorNumElements();		unsigned WideElts = WideVT.getVectorNumElements();
for (unsigned Idx = OrigElts; Idx < WideElts; Idx++)		for (unsigned Idx = OrigElts; Idx < WideElts; Idx++)
Op = DAG.getNode(ISD::INSERT_VECTOR_ELT, dl, WideVT, Op, NeutralElem,		Op = DAG.getNode(ISD::INSERT_VECTOR_ELT, dl, WideVT, Op, NeutralElem,
DAG.getConstant(Idx, dl, TLI.getVectorIdxTy(DAG.getDataLayout())));		DAG.getConstant(Idx, dl, TLI.getVectorIdxTy(DAG.getDataLayout())));

return DAG.getNode(N->getOpcode(), dl, N->getValueType(0), Op, N->getFlags());		return DAG.getNode(N->getOpcode(), dl, N->getValueType(0), Op, N->getFlags());
}		}

		SDValue DAGTypeLegalizer::WidenVecOp_VSELECT(SDNode *N) {
		// This only gets called in the case that the left and right inputs and
		// result are of a legal odd vector type, and the condition is illegal i1 of
		// the same odd width that needs widening.
		EVT VT = N->getValueType(0);
		assert(VT.isVector() && !VT.isPow2VectorType() && isTypeLegal(VT));

		SDValue Cond = GetWidenedVector(N->getOperand(0));
		SDValue LeftIn = DAG.WidenVector(N->getOperand(1), SDLoc(N));
		SDValue RightIn = DAG.WidenVector(N->getOperand(2), SDLoc(N));
		SDLoc DL(N);

		SDValue Select = DAG.getNode(N->getOpcode(), DL, LeftIn.getValueType(), Cond,
		LeftIn, RightIn);
		return DAG.getNode(
		ISD::EXTRACT_SUBVECTOR, DL, VT, Select,
		DAG.getConstant(0, DL, TLI.getVectorIdxTy(DAG.getDataLayout())));
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Vector Widening Utilities		// Vector Widening Utilities
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// Utility function to find the type to chop up a widen vector for load/store		// Utility function to find the type to chop up a widen vector for load/store
// TLI: Target lowering used to determine legal types.		// TLI: Target lowering used to determine legal types.
// Width: Width left need to load/store.		// Width: Width left need to load/store.
▲ Show 20 Lines • Show All 453 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 9,121 Lines • ▼ Show 20 Lines	SelectionDAG::SplitVector(const SDValue &N, const SDLoc &DL, const EVT &LoVT,
Lo = getNode(ISD::EXTRACT_SUBVECTOR, DL, LoVT, N,		Lo = getNode(ISD::EXTRACT_SUBVECTOR, DL, LoVT, N,
getConstant(0, DL, TLI->getVectorIdxTy(getDataLayout())));		getConstant(0, DL, TLI->getVectorIdxTy(getDataLayout())));
Hi = getNode(ISD::EXTRACT_SUBVECTOR, DL, HiVT, N,		Hi = getNode(ISD::EXTRACT_SUBVECTOR, DL, HiVT, N,
getConstant(LoVT.getVectorNumElements(), DL,		getConstant(LoVT.getVectorNumElements(), DL,
TLI->getVectorIdxTy(getDataLayout())));		TLI->getVectorIdxTy(getDataLayout())));
return std::make_pair(Lo, Hi);		return std::make_pair(Lo, Hi);
}		}

		/// Widen the vector up to the next power of two using INSERT_SUBVECTOR.
		SDValue SelectionDAG::WidenVector(const SDValue &N, const SDLoc &DL) {
		EVT VT = N.getValueType();
		EVT WideVT = EVT::getVectorVT(*getContext(), VT.getVectorElementType(),
		NextPowerOf2(VT.getVectorNumElements()));
		return getNode(ISD::INSERT_SUBVECTOR, DL, WideVT, getUNDEF(WideVT), N,
		getConstant(0, DL, TLI->getVectorIdxTy(getDataLayout())));
		}

void SelectionDAG::ExtractVectorElements(SDValue Op,		void SelectionDAG::ExtractVectorElements(SDValue Op,
SmallVectorImpl<SDValue> &Args,		SmallVectorImpl<SDValue> &Args,
unsigned Start, unsigned Count) {		unsigned Start, unsigned Count) {
EVT VT = Op.getValueType();		EVT VT = Op.getValueType();
if (Count == 0)		if (Count == 0)
Count = VT.getVectorNumElements();		Count = VT.getVectorNumElements();

EVT EltVT = VT.getVectorElementType();		EVT EltVT = VT.getVectorElementType();
▲ Show 20 Lines • Show All 299 Lines • Show Last 20 Lines