This is an archive of the discontinued LLVM Phabricator instance.

Maybe I've missed something in the discussions about support for unsigned fixed point types, but in our implementation the unsigned types are using the positive range of the signed type, such as [0, SIGNED_MAX]. So we keep the sign bit zero for the unsigned fixed point types. Are you planning for something different here?

In our implementation we lower addition of two saturated unsigned fixed point types to sadd.sat (if the input values are in the range [0, SIGNED_MAX] the result will be in the range [0, SIGNED_MAX] as well). So we haven't really found a need for implementing uadd.sat.

LGTM!

In D53340#1267393, @bjope wrote:

In our implementation we lower addition of two saturated unsigned fixed point types to sadd.sat (if the input values are in the range [0, SIGNED_MAX] the result will be in the range [0, SIGNED_MAX] as well). So we haven't really found a need for implementing uadd.sat.

Our implementation is one without the padding bit on unsigned types, so we don't need a uadd.sat. The same applies to fixumul (which we also don't have). As you say, unsigned operations can be implemented in terms of the signed ones in that kind of implementation.

bjope added inline comments.Oct 17 2018, 1:56 AM

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
4685	Sum >= 0 Also, these comments should probably be moved inside the if clause below (as they apply for SADDSAT). For UADDSAT it is enough to check if we got overflow.

nhaehnle removed a subscriber: nhaehnle.Oct 17 2018, 3:13 AM

In D53340#1267393, @bjope wrote:

Is this intrinsic really needed for the fixed point support?

Maybe I've missed something in the discussions about support for unsigned fixed point types, but in our implementation the unsigned types are using the positive range of the signed type, such as [0, SIGNED_MAX]. So we keep the sign bit zero for the unsigned fixed point types. Are you planning for something different here?

In our implementation we lower addition of two saturated unsigned fixed point types to sadd.sat (if the input values are in the range [0, SIGNED_MAX] the result will be in the range [0, SIGNED_MAX] as well). So we haven't really found a need for implementing uadd.sat.

By default we have unsigned fixed point types in clang to use the entire length of the int, so it ranges from [0, UNSIGNED_MAX]. In this case I think it would be useful to have an unsigned version as well. This could also be used for regular unsigned integers that do not have a padding bit, and I believe @craig.topper was interested in this so they could be replaced with target specific calls to x86 intrinsics like PADDUSB and PADDUSW.

In D53340#1267851, @leonardchan wrote:

In D53340#1267393, @bjope wrote:

Is this intrinsic really needed for the fixed point support?

Maybe I've missed something in the discussions about support for unsigned fixed point types, but in our implementation the unsigned types are using the positive range of the signed type, such as [0, SIGNED_MAX]. So we keep the sign bit zero for the unsigned fixed point types. Are you planning for something different here?

In our implementation we lower addition of two saturated unsigned fixed point types to sadd.sat (if the input values are in the range [0, SIGNED_MAX] the result will be in the range [0, SIGNED_MAX] as well). So we haven't really found a need for implementing uadd.sat.

By default we have unsigned fixed point types in clang to use the entire length of the int, so it ranges from [0, UNSIGNED_MAX]. In this case I think it would be useful to have an unsigned version as well. This could also be used for regular unsigned integers that do not have a padding bit, and I believe @craig.topper was interested in this so they could be replaced with target specific calls to x86 intrinsics like PADDUSB and PADDUSW.

Ok, I see. That explains the need for it.

I haven't really reviewed the test case in detail (I'm not that familiar with the X86 vector instructions), but this LGTM.

@craig.topper Any more comments on this patch?

LGTM.

This revision is now accepted and ready to land.Oct 22 2018, 3:36 PM

Closed by commit rL344971: [Intrinsic] Unigned Saturation Addition Intrinsic (authored by leonardchan). · Explain WhyOct 22 2018, 4:11 PM

This revision was automatically updated to reflect the committed changes.

spatel mentioned this in D96904: [IR] restrict vector reduction intrinsic types.Feb 19 2021, 9:23 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

ISDOpcodes.h

8 lines

TargetLowering.h

7 lines

IR/

Intrinsics.td

3 lines

Target/

TargetSelectionDAG.td

1 line

lib/

CodeGen/

SelectionDAG/

LegalizeDAG.cpp

8 lines

LegalizeIntegerTypes.cpp

27 lines

LegalizeTypes.h

4 lines

LegalizeVectorOps.cpp

1 line

LegalizeVectorTypes.cpp

2 lines

SelectionDAGBuilder.cpp

6 lines

SelectionDAGDumper.cpp

1 line

TargetLowering.cpp

39 lines

TargetLoweringBase.cpp

1 line

IR/

Verifier.cpp

13 lines

test/

CodeGen/

X86/

uadd_sat.ll

157 lines

Diff 169891

llvm/include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 250 Lines • ▼ Show 20 Lines	enum NodeType {
SADDO, UADDO,		SADDO, UADDO,

/// Same for subtraction.		/// Same for subtraction.
SSUBO, USUBO,		SSUBO, USUBO,

/// Same for multiplication.		/// Same for multiplication.
SMULO, UMULO,		SMULO, UMULO,

/// RESULT = SADDSAT(LHS, RHS) - Perform signed saturation addition on 2		/// RESULT = [US]ADDSAT(LHS, RHS) - Perform saturation addition on 2
/// integers with the same bit width (W). If the true value of LHS + RHS		/// integers with the same bit width (W). If the true value of LHS + RHS
/// exceeds the largest signed value that can be represented by W bits, the		/// exceeds the largest value that can be represented by W bits, the
/// resulting value is this maximum value. Otherwise, if this value is less		/// resulting value is this maximum value. Otherwise, if this value is less
/// than the smallest signed value that can be represented by W bits, the		/// than the smallest value that can be represented by W bits, the
/// resulting value is this minimum value.		/// resulting value is this minimum value.
SADDSAT,		SADDSAT, UADDSAT,

/// Simple binary floating point operators.		/// Simple binary floating point operators.
FADD, FSUB, FMUL, FDIV, FREM,		FADD, FSUB, FMUL, FDIV, FREM,

/// Constrained versions of the binary floating point operators.		/// Constrained versions of the binary floating point operators.
/// These will be lowered to the simple operators before final selection.		/// These will be lowered to the simple operators before final selection.
/// They are used to limit optimizations while the DAG is being		/// They are used to limit optimizations while the DAG is being
/// optimized.		/// optimized.
▲ Show 20 Lines • Show All 734 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/TargetLowering.h

Show First 20 Lines • Show All 3,675 Lines • ▼ Show 20 Lines	public:

/// Get a pointer to vector element \p Idx located in memory for a vector of		/// Get a pointer to vector element \p Idx located in memory for a vector of
/// type \p VecVT starting at a base address of \p VecPtr. If \p Idx is out of		/// type \p VecVT starting at a base address of \p VecPtr. If \p Idx is out of
/// bounds the returned pointer is unspecified, but will be within the vector		/// bounds the returned pointer is unspecified, but will be within the vector
/// bounds.		/// bounds.
SDValue getVectorElementPointer(SelectionDAG &DAG, SDValue VecPtr, EVT VecVT,		SDValue getVectorElementPointer(SelectionDAG &DAG, SDValue VecPtr, EVT VecVT,
SDValue Index) const;		SDValue Index) const;

/// Method for building the DAG expansion of ISD::SADDSAT. This method accepts		/// Method for building the DAG expansion of ISD::[US]ADDSAT. This method
/// integers or vectors of integers as its arguments.		/// accepts integers or vectors of integers as its arguments.
SDValue getExpandedSignedSaturationAddition(SDNode *Node,		SDValue getExpandedSaturationAddition(SDNode *Node, SelectionDAG &DAG) const;
SelectionDAG &DAG) const;

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Instruction Emitting Hooks		// Instruction Emitting Hooks
//		//

/// This method should be implemented by targets that mark instructions with		/// This method should be implemented by targets that mark instructions with
/// the 'usesCustomInserter' flag. These instructions are special in various		/// the 'usesCustomInserter' flag. These instructions are special in various
/// ways, which require special support to insert. The specified MachineInstr		/// ways, which require special support to insert. The specified MachineInstr
▲ Show 20 Lines • Show All 64 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 707 Lines • ▼ Show 20 Lines	def int_umul_with_overflow : Intrinsic<[llvm_anyint_ty, llvm_i1_ty],
[LLVMMatchType<0>, LLVMMatchType<0>],		[LLVMMatchType<0>, LLVMMatchType<0>],
[IntrNoMem, IntrSpeculatable]>;		[IntrNoMem, IntrSpeculatable]>;

//===------------------------- Fixed Point Intrinsics ---------------------===//		//===------------------------- Fixed Point Intrinsics ---------------------===//
//		//
def int_sadd_sat : Intrinsic<[llvm_anyint_ty],		def int_sadd_sat : Intrinsic<[llvm_anyint_ty],
[LLVMMatchType<0>, LLVMMatchType<0>],		[LLVMMatchType<0>, LLVMMatchType<0>],
[IntrNoMem, IntrSpeculatable, Commutative]>;		[IntrNoMem, IntrSpeculatable, Commutative]>;
		def int_uadd_sat : Intrinsic<[llvm_anyint_ty],
		[LLVMMatchType<0>, LLVMMatchType<0>],
		[IntrNoMem, IntrSpeculatable, Commutative]>;

//===------------------------- Memory Use Markers -------------------------===//		//===------------------------- Memory Use Markers -------------------------===//
//		//
def int_lifetime_start : Intrinsic<[],		def int_lifetime_start : Intrinsic<[],
[llvm_i64_ty, llvm_anyptr_ty],		[llvm_i64_ty, llvm_anyptr_ty],
[IntrArgMemOnly, NoCapture<1>]>;		[IntrArgMemOnly, NoCapture<1>]>;
def int_lifetime_end : Intrinsic<[],		def int_lifetime_end : Intrinsic<[],
[llvm_i64_ty, llvm_anyptr_ty],		[llvm_i64_ty, llvm_anyptr_ty],
▲ Show 20 Lines • Show All 302 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetSelectionDAG.td

	Show First 20 Lines • Show All 368 Lines • ▼ Show 20 Lines
	def smax : SDNode<"ISD::SMAX" , SDTIntBinOp,			def smax : SDNode<"ISD::SMAX" , SDTIntBinOp,
	[SDNPCommutative, SDNPAssociative]>;			[SDNPCommutative, SDNPAssociative]>;
	def umin : SDNode<"ISD::UMIN" , SDTIntBinOp,			def umin : SDNode<"ISD::UMIN" , SDTIntBinOp,
	[SDNPCommutative, SDNPAssociative]>;			[SDNPCommutative, SDNPAssociative]>;
	def umax : SDNode<"ISD::UMAX" , SDTIntBinOp,			def umax : SDNode<"ISD::UMAX" , SDTIntBinOp,
	[SDNPCommutative, SDNPAssociative]>;			[SDNPCommutative, SDNPAssociative]>;

	def saddsat : SDNode<"ISD::SADDSAT" , SDTIntBinOp, [SDNPCommutative]>;			def saddsat : SDNode<"ISD::SADDSAT" , SDTIntBinOp, [SDNPCommutative]>;
				def uaddsat : SDNode<"ISD::UADDSAT" , SDTIntBinOp, [SDNPCommutative]>;

	def sext_inreg : SDNode<"ISD::SIGN_EXTEND_INREG", SDTExtInreg>;			def sext_inreg : SDNode<"ISD::SIGN_EXTEND_INREG", SDTExtInreg>;
	def sext_invec : SDNode<"ISD::SIGN_EXTEND_VECTOR_INREG", SDTExtInvec>;			def sext_invec : SDNode<"ISD::SIGN_EXTEND_VECTOR_INREG", SDTExtInvec>;
	def zext_invec : SDNode<"ISD::ZERO_EXTEND_VECTOR_INREG", SDTExtInvec>;			def zext_invec : SDNode<"ISD::ZERO_EXTEND_VECTOR_INREG", SDTExtInvec>;

	def abs : SDNode<"ISD::ABS" , SDTIntUnaryOp>;			def abs : SDNode<"ISD::ABS" , SDTIntUnaryOp>;
	def bitreverse : SDNode<"ISD::BITREVERSE" , SDTIntUnaryOp>;			def bitreverse : SDNode<"ISD::BITREVERSE" , SDTIntUnaryOp>;
	def bswap : SDNode<"ISD::BSWAP" , SDTIntUnaryOp>;			def bswap : SDNode<"ISD::BSWAP" , SDTIntUnaryOp>;
	▲ Show 20 Lines • Show All 948 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 1,109 Lines • ▼ Show 20 Lines	#endif
case ISD::STRICT_FNEARBYINT:		case ISD::STRICT_FNEARBYINT:
// These pseudo-ops get legalized as if they were their non-strict		// These pseudo-ops get legalized as if they were their non-strict
// equivalent. For instance, if ISD::FSQRT is legal then ISD::STRICT_FSQRT		// equivalent. For instance, if ISD::FSQRT is legal then ISD::STRICT_FSQRT
// is also legal, but if ISD::FSQRT requires expansion then so does		// is also legal, but if ISD::FSQRT requires expansion then so does
// ISD::STRICT_FSQRT.		// ISD::STRICT_FSQRT.
Action = TLI.getStrictFPOperationAction(Node->getOpcode(),		Action = TLI.getStrictFPOperationAction(Node->getOpcode(),
Node->getValueType(0));		Node->getValueType(0));
break;		break;
case ISD::SADDSAT: {		case ISD::SADDSAT:
		case ISD::UADDSAT: {
Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));		Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));
break;		break;
}		}
case ISD::MSCATTER:		case ISD::MSCATTER:
Action = TLI.getOperationAction(Node->getOpcode(),		Action = TLI.getOperationAction(Node->getOpcode(),
cast<MaskedScatterSDNode>(Node)->getValue().getValueType());		cast<MaskedScatterSDNode>(Node)->getValue().getValueType());
break;		break;
case ISD::MSTORE:		case ISD::MSTORE:
▲ Show 20 Lines • Show All 2,323 Lines • ▼ Show 20 Lines	if (TLI.isOperationLegalOrCustom(ISD::ZERO_EXTEND, VT) &&
SDValue Shift =		SDValue Shift =
DAG.getConstant(HalfType.getSizeInBits(), dl,		DAG.getConstant(HalfType.getSizeInBits(), dl,
TLI.getShiftAmountTy(HalfType, DAG.getDataLayout()));		TLI.getShiftAmountTy(HalfType, DAG.getDataLayout()));
Hi = DAG.getNode(ISD::SHL, dl, VT, Hi, Shift);		Hi = DAG.getNode(ISD::SHL, dl, VT, Hi, Shift);
Results.push_back(DAG.getNode(ISD::OR, dl, VT, Lo, Hi));		Results.push_back(DAG.getNode(ISD::OR, dl, VT, Lo, Hi));
}		}
break;		break;
}		}
case ISD::SADDSAT: {		case ISD::SADDSAT:
Results.push_back(TLI.getExpandedSignedSaturationAddition(Node, DAG));		case ISD::UADDSAT: {
		Results.push_back(TLI.getExpandedSaturationAddition(Node, DAG));
break;		break;
}		}
case ISD::SADDO:		case ISD::SADDO:
case ISD::SSUBO: {		case ISD::SSUBO: {
SDValue LHS = Node->getOperand(0);		SDValue LHS = Node->getOperand(0);
SDValue RHS = Node->getOperand(1);		SDValue RHS = Node->getOperand(1);
SDValue Sum = DAG.getNode(Node->getOpcode() == ISD::SADDO ?		SDValue Sum = DAG.getNode(Node->getOpcode() == ISD::SADDO ?
ISD::ADD : ISD::SUB, dl, LHS.getValueType(),		ISD::ADD : ISD::SUB, dl, LHS.getValueType(),
▲ Show 20 Lines • Show All 1,342 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

Show First 20 Lines • Show All 135 Lines • ▼ Show 20 Lines	#endif
case ISD::UADDO:		case ISD::UADDO:
case ISD::USUBO: Res = PromoteIntRes_UADDSUBO(N, ResNo); break;		case ISD::USUBO: Res = PromoteIntRes_UADDSUBO(N, ResNo); break;
case ISD::SMULO:		case ISD::SMULO:
case ISD::UMULO: Res = PromoteIntRes_XMULO(N, ResNo); break;		case ISD::UMULO: Res = PromoteIntRes_XMULO(N, ResNo); break;

case ISD::ADDCARRY:		case ISD::ADDCARRY:
case ISD::SUBCARRY: Res = PromoteIntRes_ADDSUBCARRY(N, ResNo); break;		case ISD::SUBCARRY: Res = PromoteIntRes_ADDSUBCARRY(N, ResNo); break;

case ISD::SADDSAT: Res = PromoteIntRes_SADDSAT(N); break;		case ISD::SADDSAT:
		case ISD::UADDSAT: Res = PromoteIntRes_ADDSAT(N); break;

case ISD::ATOMIC_LOAD:		case ISD::ATOMIC_LOAD:
Res = PromoteIntRes_Atomic0(cast<AtomicSDNode>(N)); break;		Res = PromoteIntRes_Atomic0(cast<AtomicSDNode>(N)); break;

case ISD::ATOMIC_LOAD_ADD:		case ISD::ATOMIC_LOAD_ADD:
case ISD::ATOMIC_LOAD_SUB:		case ISD::ATOMIC_LOAD_SUB:
case ISD::ATOMIC_LOAD_AND:		case ISD::ATOMIC_LOAD_AND:
case ISD::ATOMIC_LOAD_CLR:		case ISD::ATOMIC_LOAD_CLR:
▲ Show 20 Lines • Show All 390 Lines • ▼ Show 20 Lines	SDValue DAGTypeLegalizer::PromoteIntRes_Overflow(SDNode *N) {

// Modified the sum result - switch anything that used the old sum to use		// Modified the sum result - switch anything that used the old sum to use
// the new one.		// the new one.
ReplaceValueWith(SDValue(N, 0), Res);		ReplaceValueWith(SDValue(N, 0), Res);

return SDValue(Res.getNode(), 1);		return SDValue(Res.getNode(), 1);
}		}

SDValue DAGTypeLegalizer::PromoteIntRes_SADDSAT(SDNode *N) {		SDValue DAGTypeLegalizer::PromoteIntRes_ADDSAT(SDNode *N) {
// For promoting iN -> iM, this can be expanded by		// For promoting iN -> iM, this can be expanded by
// 1. ANY_EXTEND iN to iM		// 1. ANY_EXTEND iN to iM
// 2. SHL by M-N		// 2. SHL by M-N
// 3. SADDSAT		// 3. U/SADDSAT
// 4. ASHR by M-N		// 4. L/ASHR by M-N
SDLoc dl(N);		SDLoc dl(N);
SDValue Op1 = N->getOperand(0);		SDValue Op1 = N->getOperand(0);
SDValue Op2 = N->getOperand(1);		SDValue Op2 = N->getOperand(1);
unsigned OldBits = Op1.getValueSizeInBits();		unsigned OldBits = Op1.getValueSizeInBits();

		unsigned Opcode = N->getOpcode();
		assert((Opcode == ISD::SADDSAT \|\| Opcode == ISD::UADDSAT) &&
		"Expected opcode to be SADDSAT or UADDSAT");
		unsigned ShiftOp = Opcode == ISD::SADDSAT ? ISD::SRA : ISD::SRL;

SDValue Op1Promoted = GetPromotedInteger(Op1);		SDValue Op1Promoted = GetPromotedInteger(Op1);
SDValue Op2Promoted = GetPromotedInteger(Op2);		SDValue Op2Promoted = GetPromotedInteger(Op2);

EVT PromotedType = Op1Promoted.getValueType();		EVT PromotedType = Op1Promoted.getValueType();
unsigned NewBits = Op1Promoted.getValueSizeInBits();		unsigned NewBits = Op1Promoted.getValueSizeInBits();
unsigned SHLAmount = NewBits - OldBits;		unsigned SHLAmount = NewBits - OldBits;
EVT SHVT = TLI.getShiftAmountTy(PromotedType, DAG.getDataLayout());		EVT SHVT = TLI.getShiftAmountTy(PromotedType, DAG.getDataLayout());
SDValue ShiftAmount = DAG.getConstant(SHLAmount, dl, SHVT);		SDValue ShiftAmount = DAG.getConstant(SHLAmount, dl, SHVT);
Op1Promoted =		Op1Promoted =
DAG.getNode(ISD::SHL, dl, PromotedType, Op1Promoted, ShiftAmount);		DAG.getNode(ISD::SHL, dl, PromotedType, Op1Promoted, ShiftAmount);
Op2Promoted =		Op2Promoted =
DAG.getNode(ISD::SHL, dl, PromotedType, Op2Promoted, ShiftAmount);		DAG.getNode(ISD::SHL, dl, PromotedType, Op2Promoted, ShiftAmount);

SDValue Result =		SDValue Result =
DAG.getNode(ISD::SADDSAT, dl, PromotedType, Op1Promoted, Op2Promoted);		DAG.getNode(Opcode, dl, PromotedType, Op1Promoted, Op2Promoted);
return DAG.getNode(ISD::SRA, dl, PromotedType, Result, ShiftAmount);		return DAG.getNode(ShiftOp, dl, PromotedType, Result, ShiftAmount);
}		}

SDValue DAGTypeLegalizer::PromoteIntRes_SADDSUBO(SDNode *N, unsigned ResNo) {		SDValue DAGTypeLegalizer::PromoteIntRes_SADDSUBO(SDNode *N, unsigned ResNo) {
if (ResNo == 1)		if (ResNo == 1)
return PromoteIntRes_Overflow(N);		return PromoteIntRes_Overflow(N);

// The operation overflowed iff the result in the larger type is not the		// The operation overflowed iff the result in the larger type is not the
// sign extension of its truncation to the original type.		// sign extension of its truncation to the original type.
▲ Show 20 Lines • Show All 907 Lines • ▼ Show 20 Lines	#endif

case ISD::SADDO:		case ISD::SADDO:
case ISD::SSUBO: ExpandIntRes_SADDSUBO(N, Lo, Hi); break;		case ISD::SSUBO: ExpandIntRes_SADDSUBO(N, Lo, Hi); break;
case ISD::UADDO:		case ISD::UADDO:
case ISD::USUBO: ExpandIntRes_UADDSUBO(N, Lo, Hi); break;		case ISD::USUBO: ExpandIntRes_UADDSUBO(N, Lo, Hi); break;
case ISD::UMULO:		case ISD::UMULO:
case ISD::SMULO: ExpandIntRes_XMULO(N, Lo, Hi); break;		case ISD::SMULO: ExpandIntRes_XMULO(N, Lo, Hi); break;

case ISD::SADDSAT: ExpandIntRes_SADDSAT(N, Lo, Hi); break;		case ISD::SADDSAT:
		case ISD::UADDSAT: ExpandIntRes_ADDSAT(N, Lo, Hi); break;
}		}

// If Lo/Hi is null, the sub-method took care of registering results etc.		// If Lo/Hi is null, the sub-method took care of registering results etc.
if (Lo.getNode())		if (Lo.getNode())
SetExpandedInteger(SDValue(N, ResNo), Lo, Hi);		SetExpandedInteger(SDValue(N, ResNo), Lo, Hi);
}		}

/// Lower an atomic node to the appropriate builtin call.		/// Lower an atomic node to the appropriate builtin call.
▲ Show 20 Lines • Show All 946 Lines • ▼ Show 20 Lines	void DAGTypeLegalizer::ExpandIntRes_READCYCLECOUNTER(SDNode *N, SDValue &Lo,
EVT NVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));		EVT NVT = TLI.getTypeToTransformTo(*DAG.getContext(), N->getValueType(0));
SDVTList VTs = DAG.getVTList(NVT, NVT, MVT::Other);		SDVTList VTs = DAG.getVTList(NVT, NVT, MVT::Other);
SDValue R = DAG.getNode(N->getOpcode(), DL, VTs, N->getOperand(0));		SDValue R = DAG.getNode(N->getOpcode(), DL, VTs, N->getOperand(0));
Lo = R.getValue(0);		Lo = R.getValue(0);
Hi = R.getValue(1);		Hi = R.getValue(1);
ReplaceValueWith(SDValue(N, 1), R.getValue(2));		ReplaceValueWith(SDValue(N, 1), R.getValue(2));
}		}

void DAGTypeLegalizer::ExpandIntRes_SADDSAT(SDNode *N, SDValue &Lo,		void DAGTypeLegalizer::ExpandIntRes_ADDSAT(SDNode *N, SDValue &Lo,
SDValue &Hi) {		SDValue &Hi) {
SDValue Result = TLI.getExpandedSignedSaturationAddition(N, DAG);		SDValue Result = TLI.getExpandedSaturationAddition(N, DAG);
SplitInteger(Result, Lo, Hi);		SplitInteger(Result, Lo, Hi);
}		}

void DAGTypeLegalizer::ExpandIntRes_SADDSUBO(SDNode *Node,		void DAGTypeLegalizer::ExpandIntRes_SADDSUBO(SDNode *Node,
SDValue &Lo, SDValue &Hi) {		SDValue &Lo, SDValue &Hi) {
SDValue LHS = Node->getOperand(0);		SDValue LHS = Node->getOperand(0);
SDValue RHS = Node->getOperand(1);		SDValue RHS = Node->getOperand(1);
SDLoc dl(Node);		SDLoc dl(Node);
▲ Show 20 Lines • Show All 1,185 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

Show First 20 Lines • Show All 324 Lines • ▼ Show 20 Lines	private:
SDValue PromoteIntRes_SRA(SDNode *N);		SDValue PromoteIntRes_SRA(SDNode *N);
SDValue PromoteIntRes_SRL(SDNode *N);		SDValue PromoteIntRes_SRL(SDNode *N);
SDValue PromoteIntRes_TRUNCATE(SDNode *N);		SDValue PromoteIntRes_TRUNCATE(SDNode *N);
SDValue PromoteIntRes_UADDSUBO(SDNode *N, unsigned ResNo);		SDValue PromoteIntRes_UADDSUBO(SDNode *N, unsigned ResNo);
SDValue PromoteIntRes_ADDSUBCARRY(SDNode *N, unsigned ResNo);		SDValue PromoteIntRes_ADDSUBCARRY(SDNode *N, unsigned ResNo);
SDValue PromoteIntRes_UNDEF(SDNode *N);		SDValue PromoteIntRes_UNDEF(SDNode *N);
SDValue PromoteIntRes_VAARG(SDNode *N);		SDValue PromoteIntRes_VAARG(SDNode *N);
SDValue PromoteIntRes_XMULO(SDNode *N, unsigned ResNo);		SDValue PromoteIntRes_XMULO(SDNode *N, unsigned ResNo);
SDValue PromoteIntRes_SADDSAT(SDNode *N);		SDValue PromoteIntRes_ADDSAT(SDNode *N);

// Integer Operand Promotion.		// Integer Operand Promotion.
bool PromoteIntegerOperand(SDNode *N, unsigned OpNo);		bool PromoteIntegerOperand(SDNode *N, unsigned OpNo);
SDValue PromoteIntOp_ANY_EXTEND(SDNode *N);		SDValue PromoteIntOp_ANY_EXTEND(SDNode *N);
SDValue PromoteIntOp_ATOMIC_STORE(AtomicSDNode *N);		SDValue PromoteIntOp_ATOMIC_STORE(AtomicSDNode *N);
SDValue PromoteIntOp_BITCAST(SDNode *N);		SDValue PromoteIntOp_BITCAST(SDNode *N);
SDValue PromoteIntOp_BUILD_PAIR(SDNode *N);		SDValue PromoteIntOp_BUILD_PAIR(SDNode *N);
SDValue PromoteIntOp_BR_CC(SDNode *N, unsigned OpNo);		SDValue PromoteIntOp_BR_CC(SDNode *N, unsigned OpNo);
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	private:
void ExpandIntRes_UREM (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_UREM (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_Shift (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_Shift (SDNode *N, SDValue &Lo, SDValue &Hi);

void ExpandIntRes_MINMAX (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_MINMAX (SDNode *N, SDValue &Lo, SDValue &Hi);

void ExpandIntRes_SADDSUBO (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_SADDSUBO (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_UADDSUBO (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_UADDSUBO (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_XMULO (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_XMULO (SDNode *N, SDValue &Lo, SDValue &Hi);
void ExpandIntRes_SADDSAT (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_ADDSAT (SDNode *N, SDValue &Lo, SDValue &Hi);

void ExpandIntRes_ATOMIC_LOAD (SDNode *N, SDValue &Lo, SDValue &Hi);		void ExpandIntRes_ATOMIC_LOAD (SDNode *N, SDValue &Lo, SDValue &Hi);

void ExpandShiftByConstant(SDNode *N, const APInt &Amt,		void ExpandShiftByConstant(SDNode *N, const APInt &Amt,
SDValue &Lo, SDValue &Hi);		SDValue &Lo, SDValue &Hi);
bool ExpandShiftWithKnownAmountBit(SDNode *N, SDValue &Lo, SDValue &Hi);		bool ExpandShiftWithKnownAmountBit(SDNode *N, SDValue &Lo, SDValue &Hi);
bool ExpandShiftWithUnknownAmountBit(SDNode *N, SDValue &Lo, SDValue &Hi);		bool ExpandShiftWithUnknownAmountBit(SDNode *N, SDValue &Lo, SDValue &Hi);

▲ Show 20 Lines • Show All 500 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

Show First 20 Lines • Show All 381 Lines • ▼ Show 20 Lines	SDValue VectorLegalizer::LegalizeOp(SDValue Op) {
case ISD::SMIN:		case ISD::SMIN:
case ISD::SMAX:		case ISD::SMAX:
case ISD::UMIN:		case ISD::UMIN:
case ISD::UMAX:		case ISD::UMAX:
case ISD::SMUL_LOHI:		case ISD::SMUL_LOHI:
case ISD::UMUL_LOHI:		case ISD::UMUL_LOHI:
case ISD::FCANONICALIZE:		case ISD::FCANONICALIZE:
case ISD::SADDSAT:		case ISD::SADDSAT:
		case ISD::UADDSAT:
Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));		Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));
break;		break;
case ISD::FP_ROUND_INREG:		case ISD::FP_ROUND_INREG:
Action = TLI.getOperationAction(Node->getOpcode(),		Action = TLI.getOperationAction(Node->getOpcode(),
cast<VTSDNode>(Node->getOperand(1))->getVT());		cast<VTSDNode>(Node->getOperand(1))->getVT());
break;		break;
case ISD::SINT_TO_FP:		case ISD::SINT_TO_FP:
case ISD::UINT_TO_FP:		case ISD::UINT_TO_FP:
▲ Show 20 Lines • Show All 802 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

Show First 20 Lines • Show All 115 Lines • ▼ Show 20 Lines	#endif
case ISD::FMINNAN:		case ISD::FMINNAN:
case ISD::FMAXNAN:		case ISD::FMAXNAN:
case ISD::SMIN:		case ISD::SMIN:
case ISD::SMAX:		case ISD::SMAX:
case ISD::UMIN:		case ISD::UMIN:
case ISD::UMAX:		case ISD::UMAX:

case ISD::SADDSAT:		case ISD::SADDSAT:
		case ISD::UADDSAT:

case ISD::FPOW:		case ISD::FPOW:
case ISD::FREM:		case ISD::FREM:
case ISD::FSUB:		case ISD::FSUB:
case ISD::MUL:		case ISD::MUL:
case ISD::OR:		case ISD::OR:
case ISD::SDIV:		case ISD::SDIV:
case ISD::SREM:		case ISD::SREM:
▲ Show 20 Lines • Show All 666 Lines • ▼ Show 20 Lines	#endif
case ISD::UREM:		case ISD::UREM:
case ISD::SREM:		case ISD::SREM:
case ISD::FREM:		case ISD::FREM:
case ISD::SMIN:		case ISD::SMIN:
case ISD::SMAX:		case ISD::SMAX:
case ISD::UMIN:		case ISD::UMIN:
case ISD::UMAX:		case ISD::UMAX:
case ISD::SADDSAT:		case ISD::SADDSAT:
		case ISD::UADDSAT:
SplitVecRes_BinOp(N, Lo, Hi);		SplitVecRes_BinOp(N, Lo, Hi);
break;		break;
case ISD::FMA:		case ISD::FMA:
SplitVecRes_TernaryOp(N, Lo, Hi);		SplitVecRes_TernaryOp(N, Lo, Hi);
break;		break;
case ISD::STRICT_FADD:		case ISD::STRICT_FADD:
case ISD::STRICT_FSUB:		case ISD::STRICT_FSUB:
case ISD::STRICT_FMUL:		case ISD::STRICT_FMUL:
▲ Show 20 Lines • Show All 3,645 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,771 Lines • ▼ Show 20 Lines	case Intrinsic::fshr: {
return nullptr;		return nullptr;
}		}
case Intrinsic::sadd_sat: {		case Intrinsic::sadd_sat: {
SDValue Op1 = getValue(I.getArgOperand(0));		SDValue Op1 = getValue(I.getArgOperand(0));
SDValue Op2 = getValue(I.getArgOperand(1));		SDValue Op2 = getValue(I.getArgOperand(1));
setValue(&I, DAG.getNode(ISD::SADDSAT, sdl, Op1.getValueType(), Op1, Op2));		setValue(&I, DAG.getNode(ISD::SADDSAT, sdl, Op1.getValueType(), Op1, Op2));
return nullptr;		return nullptr;
}		}
		case Intrinsic::uadd_sat: {
		SDValue Op1 = getValue(I.getArgOperand(0));
		SDValue Op2 = getValue(I.getArgOperand(1));
		setValue(&I, DAG.getNode(ISD::UADDSAT, sdl, Op1.getValueType(), Op1, Op2));
		return nullptr;
		}
case Intrinsic::stacksave: {		case Intrinsic::stacksave: {
SDValue Op = getRoot();		SDValue Op = getRoot();
Res = DAG.getNode(		Res = DAG.getNode(
ISD::STACKSAVE, sdl,		ISD::STACKSAVE, sdl,
DAG.getVTList(TLI.getPointerTy(DAG.getDataLayout()), MVT::Other), Op);		DAG.getVTList(TLI.getPointerTy(DAG.getDataLayout()), MVT::Other), Op);
setValue(&I, Res);		setValue(&I, Res);
DAG.setRoot(Res.getValue(1));		DAG.setRoot(Res.getValue(1));
return nullptr;		return nullptr;
▲ Show 20 Lines • Show All 4,581 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 277 Lines • ▼ Show 20 Lines	#endif
case ISD::SUBC: return "subc";		case ISD::SUBC: return "subc";
case ISD::SUBE: return "sube";		case ISD::SUBE: return "sube";
case ISD::SUBCARRY: return "subcarry";		case ISD::SUBCARRY: return "subcarry";
case ISD::SHL_PARTS: return "shl_parts";		case ISD::SHL_PARTS: return "shl_parts";
case ISD::SRA_PARTS: return "sra_parts";		case ISD::SRA_PARTS: return "sra_parts";
case ISD::SRL_PARTS: return "srl_parts";		case ISD::SRL_PARTS: return "srl_parts";

case ISD::SADDSAT: return "saddsat";		case ISD::SADDSAT: return "saddsat";
		case ISD::UADDSAT: return "uaddsat";

// Conversion operators.		// Conversion operators.
case ISD::SIGN_EXTEND: return "sign_extend";		case ISD::SIGN_EXTEND: return "sign_extend";
case ISD::ZERO_EXTEND: return "zero_extend";		case ISD::ZERO_EXTEND: return "zero_extend";
case ISD::ANY_EXTEND: return "any_extend";		case ISD::ANY_EXTEND: return "any_extend";
case ISD::SIGN_EXTEND_INREG: return "sign_extend_inreg";		case ISD::SIGN_EXTEND_INREG: return "sign_extend_inreg";
case ISD::ANY_EXTEND_VECTOR_INREG: return "any_extend_vector_inreg";		case ISD::ANY_EXTEND_VECTOR_INREG: return "any_extend_vector_inreg";
case ISD::SIGN_EXTEND_VECTOR_INREG: return "sign_extend_vector_inreg";		case ISD::SIGN_EXTEND_VECTOR_INREG: return "sign_extend_vector_inreg";
▲ Show 20 Lines • Show All 572 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp

Show First 20 Lines • Show All 4,646 Lines • ▼ Show 20 Lines	if (C->isNullValue() && CC == ISD::SETEQ) {
SDValue Scc = DAG.getNode(ISD::SRL, dl, VT, Clz,		SDValue Scc = DAG.getNode(ISD::SRL, dl, VT, Clz,
DAG.getConstant(Log2b, dl, MVT::i32));		DAG.getConstant(Log2b, dl, MVT::i32));
return DAG.getNode(ISD::TRUNCATE, dl, MVT::i32, Scc);		return DAG.getNode(ISD::TRUNCATE, dl, MVT::i32, Scc);
}		}
}		}
return SDValue();		return SDValue();
}		}

SDValue		SDValue TargetLowering::getExpandedSaturationAddition(SDNode *Node,
TargetLowering::getExpandedSignedSaturationAddition(SDNode *Node,
SelectionDAG &DAG) const {		SelectionDAG &DAG) const {
assert(Node->getOpcode() == ISD::SADDSAT &&		unsigned Opcode = Node->getOpcode();
"Expected method to receive SADDSAT node.");		assert((Opcode == ISD::SADDSAT \|\| Opcode == ISD::UADDSAT) &&
assert(Node->getNumOperands() == 2 &&		"Expected method to receive SADDSAT or UADDSAT node.");
"Expected SADDSAT node to have 2 operands.");		assert(Node->getNumOperands() == 2 && "Expected node to have 2 operands.");

SDLoc dl(Node);		SDLoc dl(Node);
SDValue LHS = Node->getOperand(0);		SDValue LHS = Node->getOperand(0);
SDValue RHS = Node->getOperand(1);		SDValue RHS = Node->getOperand(1);
assert(LHS.getValueType().isScalarInteger() &&		assert(LHS.getValueType().isScalarInteger() &&
"Expected operands to be integers. Vector of int arguments should "		"Expected operands to be integers. Vector of int arguments should "
"already be unrolled.");		"already be unrolled.");
assert(RHS.getValueType().isScalarInteger() &&		assert(RHS.getValueType().isScalarInteger() &&
"Expected operands to be integers. Vector of int arguments should "		"Expected operands to be integers. Vector of int arguments should "
"already be unrolled.");		"already be unrolled.");
assert(LHS.getValueType() == RHS.getValueType() &&		assert(LHS.getValueType() == RHS.getValueType() &&
"Expected both operands of SADDSAT to be the same type");		"Expected both operands to be the same type");

		unsigned OverflowOp = Opcode == ISD::SADDSAT ? ISD::SADDO : ISD::UADDO;
unsigned BitWidth = LHS.getValueSizeInBits();		unsigned BitWidth = LHS.getValueSizeInBits();
EVT ResultType = LHS.getValueType();		EVT ResultType = LHS.getValueType();
EVT BoolVT =		EVT BoolVT =
getSetCCResultType(DAG.getDataLayout(), *DAG.getContext(), ResultType);		getSetCCResultType(DAG.getDataLayout(), *DAG.getContext(), ResultType);
SDValue Result =		SDValue Result =
DAG.getNode(ISD::SADDO, dl, DAG.getVTList(ResultType, BoolVT), LHS, RHS);		DAG.getNode(OverflowOp, dl, DAG.getVTList(ResultType, BoolVT), LHS, RHS);
SDValue Sum = Result.getValue(0);		SDValue Sum = Result.getValue(0);
SDValue Overflow = Result.getValue(1);		SDValue Overflow = Result.getValue(1);

// SatMax -> Overflow && Sum < 0		// SatMax -> Overflow && Sum < 0
// SatMin -> Overflow && Sum > 0		// SatMin -> Overflow && Sum > 0
		bjopeUnsubmitted Done Reply Inline Actions Sum >= 0 Also, these comments should probably be moved inside the if clause below (as they apply for SADDSAT). For UADDSAT it is enough to check if we got overflow. bjope: Sum >= 0 Also, these comments should probably be moved inside the if clause below (as they…
SDValue Zero = DAG.getConstant(0, dl, LHS.getValueType());		SDValue Zero = DAG.getConstant(0, dl, LHS.getValueType());

SDValue SumNeg = DAG.getSetCC(dl, BoolVT, Sum, Zero, ISD::SETLT);		if (Opcode == ISD::SADDSAT) {
APInt MinVal = APInt::getSignedMinValue(BitWidth);		APInt MinVal = APInt::getSignedMinValue(BitWidth);
APInt MaxVal = APInt::getSignedMaxValue(BitWidth);		APInt MaxVal = APInt::getSignedMaxValue(BitWidth);
SDValue SatMin = DAG.getConstant(MinVal, dl, ResultType);		SDValue SatMin = DAG.getConstant(MinVal, dl, ResultType);
SDValue SatMax = DAG.getConstant(MaxVal, dl, ResultType);		SDValue SatMax = DAG.getConstant(MaxVal, dl, ResultType);
		SDValue SumNeg = DAG.getSetCC(dl, BoolVT, Sum, Zero, ISD::SETLT);
Result = DAG.getSelect(dl, ResultType, SumNeg, SatMax, SatMin);		Result = DAG.getSelect(dl, ResultType, SumNeg, SatMax, SatMin);
return DAG.getSelect(dl, ResultType, Overflow, Result, Sum);		return DAG.getSelect(dl, ResultType, Overflow, Result, Sum);
		} else {
		APInt MaxVal = APInt::getMaxValue(BitWidth);
		SDValue SatMax = DAG.getConstant(MaxVal, dl, ResultType);
		return DAG.getSelect(dl, ResultType, Overflow, SatMax, Sum);
		}
}		}

llvm/lib/CodeGen/TargetLoweringBase.cpp

Show First 20 Lines • Show All 603 Lines • ▼ Show 20 Lines	for (MVT VT : MVT::all_valuetypes()) {
setOperationAction(ISD::FMAXNAN, VT, Expand);		setOperationAction(ISD::FMAXNAN, VT, Expand);
setOperationAction(ISD::FMAD, VT, Expand);		setOperationAction(ISD::FMAD, VT, Expand);
setOperationAction(ISD::SMIN, VT, Expand);		setOperationAction(ISD::SMIN, VT, Expand);
setOperationAction(ISD::SMAX, VT, Expand);		setOperationAction(ISD::SMAX, VT, Expand);
setOperationAction(ISD::UMIN, VT, Expand);		setOperationAction(ISD::UMIN, VT, Expand);
setOperationAction(ISD::UMAX, VT, Expand);		setOperationAction(ISD::UMAX, VT, Expand);
setOperationAction(ISD::ABS, VT, Expand);		setOperationAction(ISD::ABS, VT, Expand);
setOperationAction(ISD::SADDSAT, VT, Expand);		setOperationAction(ISD::SADDSAT, VT, Expand);
		setOperationAction(ISD::UADDSAT, VT, Expand);

// Overflow operations default to expand		// Overflow operations default to expand
setOperationAction(ISD::SADDO, VT, Expand);		setOperationAction(ISD::SADDO, VT, Expand);
setOperationAction(ISD::SSUBO, VT, Expand);		setOperationAction(ISD::SSUBO, VT, Expand);
setOperationAction(ISD::UADDO, VT, Expand);		setOperationAction(ISD::UADDO, VT, Expand);
setOperationAction(ISD::USUBO, VT, Expand);		setOperationAction(ISD::USUBO, VT, Expand);
setOperationAction(ISD::SMULO, VT, Expand);		setOperationAction(ISD::SMULO, VT, Expand);
setOperationAction(ISD::UMULO, VT, Expand);		setOperationAction(ISD::UMULO, VT, Expand);
▲ Show 20 Lines • Show All 1,241 Lines • Show Last 20 Lines

llvm/lib/IR/Verifier.cpp

Show First 20 Lines • Show All 4,468 Lines • ▼ Show 20 Lines	if (CS.isCall()) {
if (!CS.getType()->isVoidTy() && RI)		if (!CS.getType()->isVoidTy() && RI)
Assert(RI->getReturnValue() == DeoptCI,		Assert(RI->getReturnValue() == DeoptCI,
"calls to experimental_deoptimize must be followed by a return "		"calls to experimental_deoptimize must be followed by a return "
"of the value computed by experimental_deoptimize");		"of the value computed by experimental_deoptimize");
}		}

break;		break;
}		}
case Intrinsic::sadd_sat: {		case Intrinsic::sadd_sat:
		case Intrinsic::uadd_sat: {
Value *Op1 = CS.getArgOperand(0);		Value *Op1 = CS.getArgOperand(0);
Value *Op2 = CS.getArgOperand(1);		Value *Op2 = CS.getArgOperand(1);
Assert(Op1->getType()->isIntOrIntVectorTy(),		Assert(
"first operand of sadd_sat must be an int type or vector of ints");		Op1->getType()->isIntOrIntVectorTy(),
Assert(Op2->getType()->isIntOrIntVectorTy(),		"first operand of [us]add_sat must be an int type or vector of ints");
"second operand of sadd_sat must be an int type or vector of ints");		Assert(
		Op2->getType()->isIntOrIntVectorTy(),
		"second operand of [us]add_sat must be an int type or vector of ints");
break;		break;
}		}
};		};
}		}

/// Carefully grab the subprogram from a local scope.		/// Carefully grab the subprogram from a local scope.
///		///
/// This carefully grabs the subprogram from a local scope, avoiding the		/// This carefully grabs the subprogram from a local scope, avoiding the
▲ Show 20 Lines • Show All 670 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/uadd_sat.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mcpu=generic -mtriple=x86_64-linux \| FileCheck %s
				; RUN: llc < %s -mcpu=generic -mtriple=i686 -mattr=cmov \| FileCheck %s --check-prefix=CHECK32

				declare i4 @llvm.uadd.sat.i4 (i4, i4)
				declare i32 @llvm.uadd.sat.i32 (i32, i32)
				declare i64 @llvm.uadd.sat.i64 (i64, i64)
				declare <4 x i32> @llvm.uadd.sat.v4i32(<4 x i32>, <4 x i32>)

				define i32 @func(i32 %x, i32 %y) {
				; CHECK-LABEL: func:
				; CHECK: # %bb.0:
				; CHECK-NEXT: addl %esi, %edi
				; CHECK-NEXT: movl $-1, %eax
				; CHECK-NEXT: cmovael %edi, %eax
				; CHECK-NEXT: retq
				;
				; CHECK32-LABEL: func:
				; CHECK32: # %bb.0:
				; CHECK32-NEXT: movl {{[0-9]+}}(%esp), %ecx
				; CHECK32-NEXT: addl {{[0-9]+}}(%esp), %ecx
				; CHECK32-NEXT: movl $-1, %eax
				; CHECK32-NEXT: cmovael %ecx, %eax
				; CHECK32-NEXT: retl
				%tmp = call i32 @llvm.uadd.sat.i32(i32 %x, i32 %y);
				ret i32 %tmp;
				}

				define i64 @func2(i64 %x, i64 %y) {
				; CHECK-LABEL: func2:
				; CHECK: # %bb.0:
				; CHECK-NEXT: addq %rsi, %rdi
				; CHECK-NEXT: movq $-1, %rax
				; CHECK-NEXT: cmovaeq %rdi, %rax
				; CHECK-NEXT: retq
				;
				; CHECK32-LABEL: func2:
				; CHECK32: # %bb.0:
				; CHECK32-NEXT: movl {{[0-9]+}}(%esp), %eax
				; CHECK32-NEXT: movl {{[0-9]+}}(%esp), %edx
				; CHECK32-NEXT: addl {{[0-9]+}}(%esp), %eax
				; CHECK32-NEXT: adcl {{[0-9]+}}(%esp), %edx
				; CHECK32-NEXT: movl $-1, %ecx
				; CHECK32-NEXT: cmovbl %ecx, %edx
				; CHECK32-NEXT: cmovbl %ecx, %eax
				; CHECK32-NEXT: retl
				%tmp = call i64 @llvm.uadd.sat.i64(i64 %x, i64 %y);
				ret i64 %tmp;
				}

				define i4 @func3(i4 %x, i4 %y) {
				; CHECK-LABEL: func3:
				; CHECK: # %bb.0:
				; CHECK-NEXT: shlb $4, %sil
				; CHECK-NEXT: shlb $4, %dil
				; CHECK-NEXT: addb %sil, %dil
				; CHECK-NEXT: movb $-1, %al
				; CHECK-NEXT: jb .LBB2_2
				; CHECK-NEXT: # %bb.1:
				; CHECK-NEXT: movl %edi, %eax
				; CHECK-NEXT: .LBB2_2:
				; CHECK-NEXT: shrb $4, %al
				; CHECK-NEXT: retq
				;
				; CHECK32-LABEL: func3:
				; CHECK32: # %bb.0:
				; CHECK32-NEXT: movb {{[0-9]+}}(%esp), %cl
				; CHECK32-NEXT: movb {{[0-9]+}}(%esp), %al
				; CHECK32-NEXT: shlb $4, %al
				; CHECK32-NEXT: shlb $4, %cl
				; CHECK32-NEXT: addb %al, %cl
				; CHECK32-NEXT: movb $-1, %al
				; CHECK32-NEXT: jb .LBB2_2
				; CHECK32-NEXT: # %bb.1:
				; CHECK32-NEXT: movl %ecx, %eax
				; CHECK32-NEXT: .LBB2_2:
				; CHECK32-NEXT: shrb $4, %al
				; CHECK32-NEXT: retl
				%tmp = call i4 @llvm.uadd.sat.i4(i4 %x, i4 %y);
				ret i4 %tmp;
				}

				define <4 x i32> @vec(<4 x i32> %x, <4 x i32> %y) {
				; CHECK-LABEL: vec:
				; CHECK: # %bb.0:
				; CHECK-NEXT: pshufd {{.*#+}} xmm2 = xmm1[3,1,2,3]
				; CHECK-NEXT: movd %xmm2, %eax
				; CHECK-NEXT: pshufd {{.*#+}} xmm2 = xmm0[3,1,2,3]
				; CHECK-NEXT: movd %xmm2, %ecx
				; CHECK-NEXT: addl %eax, %ecx
				; CHECK-NEXT: movl $-1, %eax
				; CHECK-NEXT: cmovbl %eax, %ecx
				; CHECK-NEXT: movd %ecx, %xmm2
				; CHECK-NEXT: pshufd {{.*#+}} xmm3 = xmm1[2,3,0,1]
				; CHECK-NEXT: movd %xmm3, %ecx
				; CHECK-NEXT: pshufd {{.*#+}} xmm3 = xmm0[2,3,0,1]
				; CHECK-NEXT: movd %xmm3, %edx
				; CHECK-NEXT: addl %ecx, %edx
				; CHECK-NEXT: cmovbl %eax, %edx
				; CHECK-NEXT: movd %edx, %xmm3
				; CHECK-NEXT: punpckldq {{.*#+}} xmm3 = xmm3[0],xmm2[0],xmm3[1],xmm2[1]
				; CHECK-NEXT: movd %xmm1, %ecx
				; CHECK-NEXT: movd %xmm0, %edx
				; CHECK-NEXT: addl %ecx, %edx
				; CHECK-NEXT: cmovbl %eax, %edx
				; CHECK-NEXT: movd %edx, %xmm2
				; CHECK-NEXT: pshufd {{.*#+}} xmm1 = xmm1[1,1,2,3]
				; CHECK-NEXT: movd %xmm1, %ecx
				; CHECK-NEXT: pshufd {{.*#+}} xmm0 = xmm0[1,1,2,3]
				; CHECK-NEXT: movd %xmm0, %edx
				; CHECK-NEXT: addl %ecx, %edx
				; CHECK-NEXT: cmovbl %eax, %edx
				; CHECK-NEXT: movd %edx, %xmm0
				; CHECK-NEXT: punpckldq {{.*#+}} xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1]
				; CHECK-NEXT: punpcklqdq {{.*#+}} xmm2 = xmm2[0],xmm3[0]
				; CHECK-NEXT: movdqa %xmm2, %xmm0
				; CHECK-NEXT: retq
				;
				; CHECK32-LABEL: vec:
				; CHECK32: # %bb.0:
				; CHECK32-NEXT: pushl %ebx
				; CHECK32-NEXT: .cfi_def_cfa_offset 8
				; CHECK32-NEXT: pushl %edi
				; CHECK32-NEXT: .cfi_def_cfa_offset 12
				; CHECK32-NEXT: pushl %esi
				; CHECK32-NEXT: .cfi_def_cfa_offset 16
				; CHECK32-NEXT: .cfi_offset %esi, -16
				; CHECK32-NEXT: .cfi_offset %edi, -12
				; CHECK32-NEXT: .cfi_offset %ebx, -8
				; CHECK32-NEXT: movl {{[0-9]+}}(%esp), %eax
				; CHECK32-NEXT: movl {{[0-9]+}}(%esp), %ecx
				; CHECK32-NEXT: movl {{[0-9]+}}(%esp), %edx
				; CHECK32-NEXT: movl {{[0-9]+}}(%esp), %esi
				; CHECK32-NEXT: movl {{[0-9]+}}(%esp), %edi
				; CHECK32-NEXT: addl {{[0-9]+}}(%esp), %edi
				; CHECK32-NEXT: movl $-1, %ebx
				; CHECK32-NEXT: cmovbl %ebx, %edi
				; CHECK32-NEXT: addl {{[0-9]+}}(%esp), %esi
				; CHECK32-NEXT: cmovbl %ebx, %esi
				; CHECK32-NEXT: addl {{[0-9]+}}(%esp), %edx
				; CHECK32-NEXT: cmovbl %ebx, %edx
				; CHECK32-NEXT: addl {{[0-9]+}}(%esp), %ecx
				; CHECK32-NEXT: cmovbl %ebx, %ecx
				; CHECK32-NEXT: movl %ecx, 12(%eax)
				; CHECK32-NEXT: movl %edx, 8(%eax)
				; CHECK32-NEXT: movl %esi, 4(%eax)
				; CHECK32-NEXT: movl %edi, (%eax)
				; CHECK32-NEXT: popl %esi
				; CHECK32-NEXT: .cfi_def_cfa_offset 12
				; CHECK32-NEXT: popl %edi
				; CHECK32-NEXT: .cfi_def_cfa_offset 8
				; CHECK32-NEXT: popl %ebx
				; CHECK32-NEXT: .cfi_def_cfa_offset 4
				; CHECK32-NEXT: retl $4
				%tmp = call <4 x i32> @llvm.uadd.sat.v4i32(<4 x i32> %x, <4 x i32> %y);
				ret <4 x i32> %tmp;
				}

This is an archive of the discontinued LLVM Phabricator instance.

[Intrinsic] Unigned Saturation Addition IntrinsicClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 169891

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/CodeGen/TargetLowering.h

llvm/include/llvm/IR/Intrinsics.td

llvm/include/llvm/Target/TargetSelectionDAG.td

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp

llvm/lib/CodeGen/TargetLoweringBase.cpp

llvm/lib/IR/Verifier.cpp

llvm/test/CodeGen/X86/uadd_sat.ll

[Intrinsic] Unigned Saturation Addition Intrinsic
ClosedPublic