This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
docs/
-
LangRef.rst
-
include/llvm/
-
llvm/
-
CodeGen/
-
ISDOpcodes.h
-
TargetLowering.h
-
IR/
-
Intrinsics.td
-
Target/
-
TargetSelectionDAG.td
-
lib/
-
Analysis/
-
VectorUtils.cpp
-
CodeGen/
-
SelectionDAG/
-
DAGCombiner.cpp
-
LegalizeDAG.cpp
-
LegalizeIntegerTypes.cpp
-
LegalizeVectorOps.cpp
-
LegalizeVectorTypes.cpp
-
SelectionDAGBuilder.cpp
-
SelectionDAGDumper.cpp
-
TargetLowering.cpp
-
TargetLoweringBase.cpp
-
IR/
-
Verifier.cpp
-
test/
-
CodeGen/
-
PowerPC/
-
umulfixsat.ll
-
X86/
-
mulfix_combine.ll
-
umul_fix_sat.ll
-
vector-mulfix-legalize.ll
-
Transforms/Scalarizer/
-
Scalarizer/
-
intrinsics.ll
-
Verifier/
-
intrinsic-immarg.ll

Differential D57836

[Intrinsic] Unsigned Fixed Point Saturation Multiplication Intrinsic
ClosedPublic

Authored by bjope on Feb 6 2019, 12:26 PM.

Download Raw Diff

Details

Reviewers

RKSimon
craig.topper
bevinh
leonardchan
lebedev.ri
spatel

Commits

rG5e331e4ce85a: [Intrinsic] Add the llvm.umul.fix.sat intrinsic
rL371308: [Intrinsic] Add the llvm.umul.fix.sat intrinsic

Summary

Add an intrinsic that takes 2 unsigned integers with the scale of them provided as the third argument and performs fixed point multiplication on them. The result is saturated and clamped between the largest and smallest representable values of the first 2 operands.

This is a part of implementing fixed point arithmetic in clang where some of the more complex operations will be implemented as intrinsics.

Diff Detail

Repository: rL LLVM

Event Timeline

leonardchan created this revision.Feb 6 2019, 12:26 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 6 2019, 12:26 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

leonardchan added a child revision: D55720: [Intrinsic] Signed Fixed Point Saturation Multiplication Intrinsic.Feb 6 2019, 12:26 PM

Ka-Ka added a subscriber: Ka-Ka.May 20 2019, 2:02 PM

bjope mentioned this in D66052: [DAGCombiner] Add simple folds for SMULFIX/UMULFIX/SMULFIXSAT.Aug 16 2019, 4:41 AM

Rebase and update now that D66052 has landed?

Herald added a subscriber: jdoerfert. · View Herald TranscriptAug 22 2019, 8:27 AM

I'll try to finish the work on this patch (hence the commandeer action). I hope @leonardchan won't mind that.

Rebase turned out not to be trivial, so I'll look into that.

Working on a rebase to trunk.

In D57836#1641679, @bjope wrote:

I'll try to finish the work on this patch (hence the commandeer action). I hope @leonardchan won't mind that.

Woops. Yeah sorry I've gotten sidetracked other projects but I don't mind being the reviewer for this. Thanks for commandeering. Didn't even know you could do that on phabricator.

bjope added inline comments.Aug 31 2019, 2:01 AM

llvm/docs/LangRef.rst
13321 ↗	(On Diff #185613)	Should say: "; %res = 4 (or 5)"
13323 ↗	(On Diff #185613)	Neither of these examples will overflow as currently written so they do not really describe the effect of saturation (and the results shown are incorrect afaict). I'm planning to change this into: ; Saturation %res = call i4 @llvm.umul.fix.sat.i4(i4 8, i4 2, i32 0) ; %res = 15 (8 x 2 -> clamped to 15) %res = call i4 @llvm.umul.fix.sat.i4(i4 8, i4 8, i32 2) ; %res = 15 (2 x 2 -> clamped to 3.75)
llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
2750 ↗	(On Diff #185613)	I don't get this. Afaict SatMin isn't used for unsigned. Anyway, I'm currently thinking about refactoring this part of the code, in a separate patch, before adding the umul.fix.sat support.
2780 ↗	(On Diff #185613)	Afaict this should be the inverse. We overflow if `HH != 0`, and then we need to clamp. Now we clamp to maximum value when `HH == 0`.

Commandeered (from leonardchan).
Rebased.
Now being based on monorepo.
Fixed bugs in LangRef.
Fixed bugs in ExpandIntRes_MULFIX (basically reimplemented saturation for UMULFIXSAT after breaking it out into a separate if-clause (no longer implementing shift by scale, signed saturation and unsigned saturation in the same piece of code).

Harbormaster completed remote builds in B37594: Diff 218208.Aug 31 2019, 2:57 AM

bjope marked 4 inline comments as done.Aug 31 2019, 2:59 AM

RKSimon added reviewers: lebedev.ri, spatel.Sep 1 2019, 5:50 AM

RKSimon added inline comments.

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
6724 ↗	(On Diff #218208)	Can't we now get here if !Saturating && !isOperationLegalOrCustom(ISD::MUL, VT) ?

bjope added inline comments.Sep 1 2019, 8:55 AM

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
6724 ↗	(On Diff #218208)	Yes, that would be wrong! I will restore this to fall through to check for SMUL_LOHI, MULHS etc. Another idea is to disallow scale==0 for SMULFIX and UMULFIX. SelectionDAGBuilder could just select an ordinary MUL instead for smul.fix when scale is zero. Similarly umul.fix/umul.fix.sat with scale==VTSize could be selected as MULHU instead of UMULFIX and UMULFIXSAT (getting rid of some more special cases).
6810 ↗	(On Diff #218208)	This does not work for Scale==0 (and we might pass through the special handling for Scale==0 above. The same problem exists in the old code. I'll need to look into this a bit more.

Some fixes are planned:

Fix problem with scale==0 spotted in TargetLowering::expandFixedPointMul
Add fixes that has been added for smul.fix.sat since this patch originally was put up for review (isTriviallyVectorizable, VectorLegalizer::Expand, DAGTypeLegalizer::WidenVectorResult, DAGCompielr::visitMULFIX).

Fixes to get umul.fix.sat on par with smul.fix.sat. Basically we now handle umul.fix.sat wherever we handled smul.fix.sat (and UMULFIXSAT wherever we handle SMULFIXSAT), except for ConstantFolding.

Herald added subscribers: jsji, MaskRay, nemanjai. · View Herald TranscriptSep 2 2019, 4:41 AM

Harbormaster completed remote builds in B37624: Diff 218331.Sep 2 2019, 4:44 AM

Herald added a subscriber: • wuzish. · View Herald TranscriptSep 2 2019, 4:44 AM

bjope added a child revision: D67071: [CodeGen] Handle SMULFIXSAT with scale zero in TargetLowering::expandFixedPointMul.Sep 2 2019, 4:45 AM

bjope marked 2 inline comments as done.Sep 4 2019, 10:51 PM

bjope added inline comments.

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
6810 ↗	(On Diff #218208)	Fix for that is in https://reviews.llvm.org/D67071

Friendly ping.
(totally understand if tihs hasn't been on top of prio list for reviewers, and that it might be a lot to digest)

As I wrote earlier, right now this should be "on par" with smul.fix.sat when it comes to promotion/widening/legalization/scalarization etc. Also matching the amount of in-tree testing that we got for smul.fix.sat.
No outstanding comments that has not been resolved (afaict after having commandeered this revision).

Herald added a subscriber: ychen. · View Herald TranscriptSep 5 2019, 11:49 PM

rebase after D67071 has landed? (and add equivalent scale == 0 tests)

In D57836#1660809, @RKSimon wrote:

rebase after D67071 has landed? (and add equivalent scale == 0 tests)

Right now D67071 is based on this one. So it would be simpler to land this first.
And this patch also already includes the equivalent scale==0 zero tests: llvm/test/CodeGen/PowerPC/umulfixsat.ll

LGTM. Thanks for taking over this!

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
2984 ↗	(On Diff #218331)	Nit: Add an else case with an `llvm_unreachable` mentioning that we returned earlier when `Scale == VTSize` and that the scale must be less than or equal to the width of the operands.

This revision is now accepted and ready to land.Sep 6 2019, 11:05 AM

Closed by commit rL371308: [Intrinsic] Add the llvm.umul.fix.sat intrinsic (authored by bjope). · Explain WhySep 7 2019, 5:15 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

docs/

LangRef.rst

67 lines

include/

llvm/

CodeGen/

ISDOpcodes.h

2 lines

TargetLowering.h

5 lines

IR/

Intrinsics.td

3 lines

Target/

TargetSelectionDAG.td

1 line

lib/

Analysis/

VectorUtils.cpp

2 lines

CodeGen/

SelectionDAG/

DAGCombiner.cpp

6 lines

LegalizeDAG.cpp

4 lines

LegalizeIntegerTypes.cpp

84 lines

LegalizeVectorOps.cpp

13 lines

LegalizeVectorTypes.cpp

3 lines

SelectionDAGBuilder.cpp

8 lines

SelectionDAGDumper.cpp

1 line

TargetLowering.cpp

75 lines

TargetLoweringBase.cpp

1 line

IR/

Verifier.cpp

3 lines

test/

CodeGen/

PowerPC/

umulfixsat.ll

36 lines

X86/

mulfix_combine.ll

38 lines

umul_fix_sat.ll

542 lines

vector-mulfix-legalize.ll

45 lines

Transforms/

Scalarizer/

intrinsics.ll

12 lines

Verifier/

intrinsic-immarg.ll

18 lines

Diff 219237

llvm/trunk/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 13,758 Lines • ▼ Show 20 Lines	.. code-block:: llvm
%res = call i4 @llvm.smul.fix.sat.i4(i4 -8, i4 5, i32 2) ; %res = -8		%res = call i4 @llvm.smul.fix.sat.i4(i4 -8, i4 5, i32 2) ; %res = -8
%res = call i4 @llvm.smul.fix.sat.i4(i4 -8, i4 -2, i32 1) ; %res = 7		%res = call i4 @llvm.smul.fix.sat.i4(i4 -8, i4 -2, i32 1) ; %res = 7

; Scale can affect the saturation result		; Scale can affect the saturation result
%res = call i4 @llvm.smul.fix.sat.i4(i4 2, i4 4, i32 0) ; %res = 7 (2 x 4 -> clamped to 7)		%res = call i4 @llvm.smul.fix.sat.i4(i4 2, i4 4, i32 0) ; %res = 7 (2 x 4 -> clamped to 7)
%res = call i4 @llvm.smul.fix.sat.i4(i4 2, i4 4, i32 1) ; %res = 4 (1 x 2 = 2)		%res = call i4 @llvm.smul.fix.sat.i4(i4 2, i4 4, i32 1) ; %res = 4 (1 x 2 = 2)


		'``llvm.umul.fix.sat.*``' Intrinsics
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

		Syntax
		"""""""

		This is an overloaded intrinsic. You can use ``llvm.umul.fix.sat``
		on any integer bit width or vectors of integers.

		::

		declare i16 @llvm.umul.fix.sat.i16(i16 %a, i16 %b, i32 %scale)
		declare i32 @llvm.umul.fix.sat.i32(i32 %a, i32 %b, i32 %scale)
		declare i64 @llvm.umul.fix.sat.i64(i64 %a, i64 %b, i32 %scale)
		declare <4 x i32> @llvm.umul.fix.sat.v4i32(<4 x i32> %a, <4 x i32> %b, i32 %scale)

		Overview
		"""""""""

		The '``llvm.umul.fix.sat``' family of intrinsic functions perform unsigned
		fixed point saturation multiplication on 2 arguments of the same scale.

		Arguments
		""""""""""

		The arguments (%a and %b) and the result may be of integer types of any bit
		width, but they must have the same bit width. ``%a`` and ``%b`` are the two
		values that will undergo unsigned fixed point multiplication. The argument
		``%scale`` represents the scale of both operands, and must be a constant
		integer.

		Semantics:
		""""""""""

		This operation performs fixed point multiplication on the 2 arguments of a
		specified scale. The result will also be returned in the same scale specified
		in the third argument.

		If the result value cannot be precisely represented in the given scale, the
		value is rounded up or down to the closest representable value. The rounding
		direction is unspecified.

		The maximum value this operation can clamp to is the largest unsigned value
		representable by the bit width of the first 2 arguments. The minimum value is the
		smallest unsigned value representable by this bit width (zero).


		Examples
		"""""""""

		.. code-block:: llvm

		%res = call i4 @llvm.umul.fix.sat.i4(i4 3, i4 2, i32 0) ; %res = 6 (2 x 3 = 6)
		%res = call i4 @llvm.umul.fix.sat.i4(i4 3, i4 2, i32 1) ; %res = 3 (1.5 x 1 = 1.5)

		; The result in the following could be rounded down to 2 or up to 2.5
		%res = call i4 @llvm.umul.fix.sat.i4(i4 3, i4 3, i32 1) ; %res = 4 (or 5) (1.5 x 1.5 = 2.25)

		; Saturation
		%res = call i4 @llvm.umul.fix.sat.i4(i4 8, i4 2, i32 0) ; %res = 15 (8 x 2 -> clamped to 15)
		%res = call i4 @llvm.umul.fix.sat.i4(i4 8, i4 8, i32 2) ; %res = 15 (2 x 2 -> clamped to 3.75)

		; Scale can affect the saturation result
		%res = call i4 @llvm.umul.fix.sat.i4(i4 2, i4 4, i32 0) ; %res = 7 (2 x 4 -> clamped to 7)
		%res = call i4 @llvm.umul.fix.sat.i4(i4 2, i4 4, i32 1) ; %res = 4 (1 x 2 = 2)


Specialised Arithmetic Intrinsics		Specialised Arithmetic Intrinsics
---------------------------------		---------------------------------

'``llvm.canonicalize.*``' Intrinsic		'``llvm.canonicalize.*``' Intrinsic
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Syntax:		Syntax:
"""""""		"""""""
▲ Show 20 Lines • Show All 3,839 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 275 Lines • ▼ Show 20 Lines	enum NodeType {
/// both operands as fixed point numbers. This SCALE parameter must be a		/// both operands as fixed point numbers. This SCALE parameter must be a
/// constant integer. A scale of zero is effectively performing		/// constant integer. A scale of zero is effectively performing
/// multiplication on 2 integers.		/// multiplication on 2 integers.
SMULFIX, UMULFIX,		SMULFIX, UMULFIX,

/// Same as the corresponding unsaturated fixed point instructions, but the		/// Same as the corresponding unsaturated fixed point instructions, but the
/// result is clamped between the min and max values representable by the		/// result is clamped between the min and max values representable by the
/// bits of the first 2 operands.		/// bits of the first 2 operands.
SMULFIXSAT,		SMULFIXSAT, UMULFIXSAT,

/// Simple binary floating point operators.		/// Simple binary floating point operators.
FADD, FSUB, FMUL, FDIV, FREM,		FADD, FSUB, FMUL, FDIV, FREM,

/// Constrained versions of the binary floating point operators.		/// Constrained versions of the binary floating point operators.
/// These will be lowered to the simple operators before final selection.		/// These will be lowered to the simple operators before final selection.
/// They are used to limit optimizations while the DAG is being		/// They are used to limit optimizations while the DAG is being
/// optimized.		/// optimized.
▲ Show 20 Lines • Show All 806 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/CodeGen/TargetLowering.h

Show First 20 Lines • Show All 917 Lines • ▼ Show 20 Lines	LegalizeAction getFixedPointOperationAction(unsigned Op, EVT VT,
// scales.		// scales.
bool Supported;		bool Supported;
switch (Op) {		switch (Op) {
default:		default:
llvm_unreachable("Unexpected fixed point operation.");		llvm_unreachable("Unexpected fixed point operation.");
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
		case ISD::UMULFIXSAT:
Supported = isSupportedFixedPointOperation(Op, VT, Scale);		Supported = isSupportedFixedPointOperation(Op, VT, Scale);
break;		break;
}		}

return Supported ? Action : Expand;		return Supported ? Action : Expand;
}		}

// If Op is a strict floating-point operation, return the result		// If Op is a strict floating-point operation, return the result
▲ Show 20 Lines • Show All 3,158 Lines • ▼ Show 20 Lines
/// bounds.		/// bounds.
SDValue getVectorElementPointer(SelectionDAG &DAG, SDValue VecPtr, EVT VecVT,		SDValue getVectorElementPointer(SelectionDAG &DAG, SDValue VecPtr, EVT VecVT,
SDValue Index) const;		SDValue Index) const;

/// Method for building the DAG expansion of ISD::[US][ADD\|SUB]SAT. This		/// Method for building the DAG expansion of ISD::[US][ADD\|SUB]SAT. This
/// method accepts integers as its arguments.		/// method accepts integers as its arguments.
SDValue expandAddSubSat(SDNode *Node, SelectionDAG &DAG) const;		SDValue expandAddSubSat(SDNode *Node, SelectionDAG &DAG) const;

/// Method for building the DAG expansion of ISD::SMULFIX. This method accepts		/// Method for building the DAG expansion of ISD::[U\|S]MULFIX[SAT]. This
/// integers as its arguments.		/// method accepts integers as its arguments.
SDValue expandFixedPointMul(SDNode *Node, SelectionDAG &DAG) const;		SDValue expandFixedPointMul(SDNode *Node, SelectionDAG &DAG) const;

/// Method for building the DAG expansion of ISD::U(ADD\|SUB)O. Expansion		/// Method for building the DAG expansion of ISD::U(ADD\|SUB)O. Expansion
/// always suceeds and populates the Result and Overflow arguments.		/// always suceeds and populates the Result and Overflow arguments.
void expandUADDSUBO(SDNode *Node, SDValue &Result, SDValue &Overflow,		void expandUADDSUBO(SDNode *Node, SDValue &Result, SDValue &Overflow,
SelectionDAG &DAG) const;		SelectionDAG &DAG) const;

/// Method for building the DAG expansion of ISD::S(ADD\|SUB)O. Expansion		/// Method for building the DAG expansion of ISD::S(ADD\|SUB)O. Expansion
▲ Show 20 Lines • Show All 106 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 889 Lines • ▼ Show 20 Lines	def int_umul_fix : Intrinsic<[llvm_anyint_ty],
[LLVMMatchType<0>, LLVMMatchType<0>, llvm_i32_ty],		[LLVMMatchType<0>, LLVMMatchType<0>, llvm_i32_ty],
[IntrNoMem, IntrSpeculatable, IntrWillReturn, Commutative, ImmArg<2>]>;		[IntrNoMem, IntrSpeculatable, IntrWillReturn, Commutative, ImmArg<2>]>;

//===------------------- Fixed Point Saturation Arithmetic Intrinsics ----------------===//		//===------------------- Fixed Point Saturation Arithmetic Intrinsics ----------------===//
//		//
def int_smul_fix_sat : Intrinsic<[llvm_anyint_ty],		def int_smul_fix_sat : Intrinsic<[llvm_anyint_ty],
[LLVMMatchType<0>, LLVMMatchType<0>, llvm_i32_ty],		[LLVMMatchType<0>, LLVMMatchType<0>, llvm_i32_ty],
[IntrNoMem, IntrSpeculatable, IntrWillReturn, Commutative, ImmArg<2>]>;		[IntrNoMem, IntrSpeculatable, IntrWillReturn, Commutative, ImmArg<2>]>;
		def int_umul_fix_sat : Intrinsic<[llvm_anyint_ty],
		[LLVMMatchType<0>, LLVMMatchType<0>, llvm_i32_ty],
		[IntrNoMem, IntrSpeculatable, IntrWillReturn, Commutative, ImmArg<2>]>;

//===------------------------- Memory Use Markers -------------------------===//		//===------------------------- Memory Use Markers -------------------------===//
//		//
def int_lifetime_start : Intrinsic<[],		def int_lifetime_start : Intrinsic<[],
[llvm_i64_ty, llvm_anyptr_ty],		[llvm_i64_ty, llvm_anyptr_ty],
[IntrArgMemOnly, IntrWillReturn, NoCapture<1>, ImmArg<0>]>;		[IntrArgMemOnly, IntrWillReturn, NoCapture<1>, ImmArg<0>]>;
def int_lifetime_end : Intrinsic<[],		def int_lifetime_end : Intrinsic<[],
[llvm_i64_ty, llvm_anyptr_ty],		[llvm_i64_ty, llvm_anyptr_ty],
▲ Show 20 Lines • Show All 364 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Target/TargetSelectionDAG.td

	Show First 20 Lines • Show All 390 Lines • ▼ Show 20 Lines
	def saddsat : SDNode<"ISD::SADDSAT" , SDTIntBinOp, [SDNPCommutative]>;			def saddsat : SDNode<"ISD::SADDSAT" , SDTIntBinOp, [SDNPCommutative]>;
	def uaddsat : SDNode<"ISD::UADDSAT" , SDTIntBinOp, [SDNPCommutative]>;			def uaddsat : SDNode<"ISD::UADDSAT" , SDTIntBinOp, [SDNPCommutative]>;
	def ssubsat : SDNode<"ISD::SSUBSAT" , SDTIntBinOp>;			def ssubsat : SDNode<"ISD::SSUBSAT" , SDTIntBinOp>;
	def usubsat : SDNode<"ISD::USUBSAT" , SDTIntBinOp>;			def usubsat : SDNode<"ISD::USUBSAT" , SDTIntBinOp>;

	def smulfix : SDNode<"ISD::SMULFIX" , SDTIntScaledBinOp, [SDNPCommutative]>;			def smulfix : SDNode<"ISD::SMULFIX" , SDTIntScaledBinOp, [SDNPCommutative]>;
	def smulfixsat : SDNode<"ISD::SMULFIXSAT", SDTIntScaledBinOp, [SDNPCommutative]>;			def smulfixsat : SDNode<"ISD::SMULFIXSAT", SDTIntScaledBinOp, [SDNPCommutative]>;
	def umulfix : SDNode<"ISD::UMULFIX" , SDTIntScaledBinOp, [SDNPCommutative]>;			def umulfix : SDNode<"ISD::UMULFIX" , SDTIntScaledBinOp, [SDNPCommutative]>;
				def umulfixsat : SDNode<"ISD::UMULFIXSAT", SDTIntScaledBinOp, [SDNPCommutative]>;

	def sext_inreg : SDNode<"ISD::SIGN_EXTEND_INREG", SDTExtInreg>;			def sext_inreg : SDNode<"ISD::SIGN_EXTEND_INREG", SDTExtInreg>;
	def sext_invec : SDNode<"ISD::SIGN_EXTEND_VECTOR_INREG", SDTExtInvec>;			def sext_invec : SDNode<"ISD::SIGN_EXTEND_VECTOR_INREG", SDTExtInvec>;
	def zext_invec : SDNode<"ISD::ZERO_EXTEND_VECTOR_INREG", SDTExtInvec>;			def zext_invec : SDNode<"ISD::ZERO_EXTEND_VECTOR_INREG", SDTExtInvec>;

	def abs : SDNode<"ISD::ABS" , SDTIntUnaryOp>;			def abs : SDNode<"ISD::ABS" , SDTIntUnaryOp>;
	def bitreverse : SDNode<"ISD::BITREVERSE" , SDTIntUnaryOp>;			def bitreverse : SDNode<"ISD::BITREVERSE" , SDTIntUnaryOp>;
	def bswap : SDNode<"ISD::BSWAP" , SDTIntUnaryOp>;			def bswap : SDNode<"ISD::BSWAP" , SDTIntUnaryOp>;
	▲ Show 20 Lines • Show All 1,154 Lines • Show Last 20 Lines

llvm/trunk/lib/Analysis/VectorUtils.cpp

Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	bool llvm::isTriviallyVectorizable(Intrinsic::ID ID) {
case Intrinsic::fshr:		case Intrinsic::fshr:
case Intrinsic::sadd_sat:		case Intrinsic::sadd_sat:
case Intrinsic::ssub_sat:		case Intrinsic::ssub_sat:
case Intrinsic::uadd_sat:		case Intrinsic::uadd_sat:
case Intrinsic::usub_sat:		case Intrinsic::usub_sat:
case Intrinsic::smul_fix:		case Intrinsic::smul_fix:
case Intrinsic::smul_fix_sat:		case Intrinsic::smul_fix_sat:
case Intrinsic::umul_fix:		case Intrinsic::umul_fix:
		case Intrinsic::umul_fix_sat:
case Intrinsic::sqrt: // Begin floating-point.		case Intrinsic::sqrt: // Begin floating-point.
case Intrinsic::sin:		case Intrinsic::sin:
case Intrinsic::cos:		case Intrinsic::cos:
case Intrinsic::exp:		case Intrinsic::exp:
case Intrinsic::exp2:		case Intrinsic::exp2:
case Intrinsic::log:		case Intrinsic::log:
case Intrinsic::log10:		case Intrinsic::log10:
case Intrinsic::log2:		case Intrinsic::log2:
Show All 26 Lines	bool llvm::hasVectorInstrinsicScalarOpd(Intrinsic::ID ID,
switch (ID) {		switch (ID) {
case Intrinsic::ctlz:		case Intrinsic::ctlz:
case Intrinsic::cttz:		case Intrinsic::cttz:
case Intrinsic::powi:		case Intrinsic::powi:
return (ScalarOpdIdx == 1);		return (ScalarOpdIdx == 1);
case Intrinsic::smul_fix:		case Intrinsic::smul_fix:
case Intrinsic::smul_fix_sat:		case Intrinsic::smul_fix_sat:
case Intrinsic::umul_fix:		case Intrinsic::umul_fix:
		case Intrinsic::umul_fix_sat:
return (ScalarOpdIdx == 2);		return (ScalarOpdIdx == 2);
default:		default:
return false;		return false;
}		}
}		}

/// Returns intrinsic ID for call.		/// Returns intrinsic ID for call.
/// For the input call instruction it finds mapping intrinsic and returns		/// For the input call instruction it finds mapping intrinsic and returns
▲ Show 20 Lines • Show All 1,051 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,743 Lines • ▼ Show 20 Lines	SDValue DAGCombiner::visit(SDNode *N) {
case ISD::SSUBO:		case ISD::SSUBO:
case ISD::USUBO: return visitSUBO(N);		case ISD::USUBO: return visitSUBO(N);
case ISD::ADDE: return visitADDE(N);		case ISD::ADDE: return visitADDE(N);
case ISD::ADDCARRY: return visitADDCARRY(N);		case ISD::ADDCARRY: return visitADDCARRY(N);
case ISD::SUBE: return visitSUBE(N);		case ISD::SUBE: return visitSUBE(N);
case ISD::SUBCARRY: return visitSUBCARRY(N);		case ISD::SUBCARRY: return visitSUBCARRY(N);
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX: return visitMULFIX(N);		case ISD::UMULFIX:
		case ISD::UMULFIXSAT: return visitMULFIX(N);
case ISD::MUL: return visitMUL(N);		case ISD::MUL: return visitMUL(N);
case ISD::SDIV: return visitSDIV(N);		case ISD::SDIV: return visitSDIV(N);
case ISD::UDIV: return visitUDIV(N);		case ISD::UDIV: return visitUDIV(N);
case ISD::SREM:		case ISD::SREM:
case ISD::UREM: return visitREM(N);		case ISD::UREM: return visitREM(N);
case ISD::MULHU: return visitMULHU(N);		case ISD::MULHU: return visitMULHU(N);
case ISD::MULHS: return visitMULHS(N);		case ISD::MULHS: return visitMULHS(N);
case ISD::SMUL_LOHI: return visitSMUL_LOHI(N);		case ISD::SMUL_LOHI: return visitSMUL_LOHI(N);
▲ Show 20 Lines • Show All 1,753 Lines • ▼ Show 20 Lines	if (isNullConstant(CarryIn)) {
if (!LegalOperations \|\|		if (!LegalOperations \|\|
TLI.isOperationLegalOrCustom(ISD::USUBO, N->getValueType(0)))		TLI.isOperationLegalOrCustom(ISD::USUBO, N->getValueType(0)))
return DAG.getNode(ISD::USUBO, SDLoc(N), N->getVTList(), N0, N1);		return DAG.getNode(ISD::USUBO, SDLoc(N), N->getVTList(), N0, N1);
}		}

return SDValue();		return SDValue();
}		}

// Notice that "mulfix" can be any of SMULFIX, SMULFIXSAT and UMULFIX here.		// Notice that "mulfix" can be any of SMULFIX, SMULFIXSAT, UMULFIX and
		// UMULFIXSAT here.
SDValue DAGCombiner::visitMULFIX(SDNode *N) {		SDValue DAGCombiner::visitMULFIX(SDNode *N) {
SDValue N0 = N->getOperand(0);		SDValue N0 = N->getOperand(0);
SDValue N1 = N->getOperand(1);		SDValue N1 = N->getOperand(1);
SDValue Scale = N->getOperand(2);		SDValue Scale = N->getOperand(2);
EVT VT = N0.getValueType();		EVT VT = N0.getValueType();

// fold (mulfix x, undef, scale) -> 0		// fold (mulfix x, undef, scale) -> 0
if (N0.isUndef() \|\| N1.isUndef())		if (N0.isUndef() \|\| N1.isUndef())
▲ Show 20 Lines • Show All 17,397 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 1,109 Lines • ▼ Show 20 Lines	#endif
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT: {		case ISD::USUBSAT: {
Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));		Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));
break;		break;
}		}
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX: {		case ISD::UMULFIX:
		case ISD::UMULFIXSAT: {
unsigned Scale = Node->getConstantOperandVal(2);		unsigned Scale = Node->getConstantOperandVal(2);
Action = TLI.getFixedPointOperationAction(Node->getOpcode(),		Action = TLI.getFixedPointOperationAction(Node->getOpcode(),
Node->getValueType(0), Scale);		Node->getValueType(0), Scale);
break;		break;
}		}
case ISD::MSCATTER:		case ISD::MSCATTER:
Action = TLI.getOperationAction(Node->getOpcode(),		Action = TLI.getOperationAction(Node->getOpcode(),
cast<MaskedScatterSDNode>(Node)->getValue().getValueType());		cast<MaskedScatterSDNode>(Node)->getValue().getValueType());
▲ Show 20 Lines • Show All 2,221 Lines • ▼ Show 20 Lines	bool SelectionDAGLegalize::ExpandNode(SDNode *Node) {
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT:		case ISD::USUBSAT:
Results.push_back(TLI.expandAddSubSat(Node, DAG));		Results.push_back(TLI.expandAddSubSat(Node, DAG));
break;		break;
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
		case ISD::UMULFIXSAT:
Results.push_back(TLI.expandFixedPointMul(Node, DAG));		Results.push_back(TLI.expandFixedPointMul(Node, DAG));
break;		break;
case ISD::ADDCARRY:		case ISD::ADDCARRY:
case ISD::SUBCARRY: {		case ISD::SUBCARRY: {
SDValue LHS = Node->getOperand(0);		SDValue LHS = Node->getOperand(0);
SDValue RHS = Node->getOperand(1);		SDValue RHS = Node->getOperand(1);
SDValue Carry = Node->getOperand(2);		SDValue Carry = Node->getOperand(2);

▲ Show 20 Lines • Show All 1,252 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

Show First 20 Lines • Show All 144 Lines • ▼ Show 20 Lines	#endif
case ISD::SUBE:		case ISD::SUBE:
case ISD::ADDCARRY:		case ISD::ADDCARRY:
case ISD::SUBCARRY: Res = PromoteIntRes_ADDSUBCARRY(N, ResNo); break;		case ISD::SUBCARRY: Res = PromoteIntRes_ADDSUBCARRY(N, ResNo); break;

case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT: Res = PromoteIntRes_ADDSUBSAT(N); break;		case ISD::USUBSAT: Res = PromoteIntRes_ADDSUBSAT(N); break;

case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX: Res = PromoteIntRes_MULFIX(N); break;		case ISD::UMULFIX:
		case ISD::UMULFIXSAT: Res = PromoteIntRes_MULFIX(N); break;

case ISD::ABS: Res = PromoteIntRes_ABS(N); break;		case ISD::ABS: Res = PromoteIntRes_ABS(N); break;

case ISD::ATOMIC_LOAD:		case ISD::ATOMIC_LOAD:
Res = PromoteIntRes_Atomic0(cast<AtomicSDNode>(N)); break;		Res = PromoteIntRes_Atomic0(cast<AtomicSDNode>(N)); break;

case ISD::ATOMIC_LOAD_ADD:		case ISD::ATOMIC_LOAD_ADD:
case ISD::ATOMIC_LOAD_SUB:		case ISD::ATOMIC_LOAD_SUB:
case ISD::ATOMIC_LOAD_AND:		case ISD::ATOMIC_LOAD_AND:
▲ Show 20 Lines • Show All 520 Lines • ▼ Show 20 Lines
}		}

SDValue DAGTypeLegalizer::PromoteIntRes_MULFIX(SDNode *N) {		SDValue DAGTypeLegalizer::PromoteIntRes_MULFIX(SDNode *N) {
// Can just promote the operands then continue with operation.		// Can just promote the operands then continue with operation.
SDLoc dl(N);		SDLoc dl(N);
SDValue Op1Promoted, Op2Promoted;		SDValue Op1Promoted, Op2Promoted;
bool Signed =		bool Signed =
N->getOpcode() == ISD::SMULFIX \|\| N->getOpcode() == ISD::SMULFIXSAT;		N->getOpcode() == ISD::SMULFIX \|\| N->getOpcode() == ISD::SMULFIXSAT;
		bool Saturating =
		N->getOpcode() == ISD::SMULFIXSAT \|\| N->getOpcode() == ISD::UMULFIXSAT;
if (Signed) {		if (Signed) {
Op1Promoted = SExtPromotedInteger(N->getOperand(0));		Op1Promoted = SExtPromotedInteger(N->getOperand(0));
Op2Promoted = SExtPromotedInteger(N->getOperand(1));		Op2Promoted = SExtPromotedInteger(N->getOperand(1));
} else {		} else {
Op1Promoted = ZExtPromotedInteger(N->getOperand(0));		Op1Promoted = ZExtPromotedInteger(N->getOperand(0));
Op2Promoted = ZExtPromotedInteger(N->getOperand(1));		Op2Promoted = ZExtPromotedInteger(N->getOperand(1));
}		}
EVT OldType = N->getOperand(0).getValueType();		EVT OldType = N->getOperand(0).getValueType();
EVT PromotedType = Op1Promoted.getValueType();		EVT PromotedType = Op1Promoted.getValueType();
unsigned DiffSize =		unsigned DiffSize =
PromotedType.getScalarSizeInBits() - OldType.getScalarSizeInBits();		PromotedType.getScalarSizeInBits() - OldType.getScalarSizeInBits();

bool Saturating = N->getOpcode() == ISD::SMULFIXSAT;
if (Saturating) {		if (Saturating) {
// Promoting the operand and result values changes the saturation width,		// Promoting the operand and result values changes the saturation width,
// which is extends the values that we clamp to on saturation. This could be		// which is extends the values that we clamp to on saturation. This could be
// resolved by shifting one of the operands the same amount, which would		// resolved by shifting one of the operands the same amount, which would
// also shift the result we compare against, then shifting back.		// also shift the result we compare against, then shifting back.
EVT ShiftTy = TLI.getShiftAmountTy(PromotedType, DAG.getDataLayout());		EVT ShiftTy = TLI.getShiftAmountTy(PromotedType, DAG.getDataLayout());
Op1Promoted = DAG.getNode(ISD::SHL, dl, PromotedType, Op1Promoted,		Op1Promoted = DAG.getNode(ISD::SHL, dl, PromotedType, Op1Promoted,
DAG.getConstant(DiffSize, dl, ShiftTy));		DAG.getConstant(DiffSize, dl, ShiftTy));
▲ Show 20 Lines • Show All 446 Lines • ▼ Show 20 Lines	bool DAGTypeLegalizer::PromoteIntegerOperand(SDNode *N, unsigned OpNo) {

case ISD::FRAMEADDR:		case ISD::FRAMEADDR:
case ISD::RETURNADDR: Res = PromoteIntOp_FRAMERETURNADDR(N); break;		case ISD::RETURNADDR: Res = PromoteIntOp_FRAMERETURNADDR(N); break;

case ISD::PREFETCH: Res = PromoteIntOp_PREFETCH(N, OpNo); break;		case ISD::PREFETCH: Res = PromoteIntOp_PREFETCH(N, OpNo); break;

case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX: Res = PromoteIntOp_MULFIX(N); break;		case ISD::UMULFIX:
		case ISD::UMULFIXSAT: Res = PromoteIntOp_MULFIX(N); break;

case ISD::FPOWI: Res = PromoteIntOp_FPOWI(N); break;		case ISD::FPOWI: Res = PromoteIntOp_FPOWI(N); break;

case ISD::VECREDUCE_ADD:		case ISD::VECREDUCE_ADD:
case ISD::VECREDUCE_MUL:		case ISD::VECREDUCE_MUL:
case ISD::VECREDUCE_AND:		case ISD::VECREDUCE_AND:
case ISD::VECREDUCE_OR:		case ISD::VECREDUCE_OR:
case ISD::VECREDUCE_XOR:		case ISD::VECREDUCE_XOR:
▲ Show 20 Lines • Show All 558 Lines • ▼ Show 20 Lines	#endif

case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT: ExpandIntRes_ADDSUBSAT(N, Lo, Hi); break;		case ISD::USUBSAT: ExpandIntRes_ADDSUBSAT(N, Lo, Hi); break;

case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX: ExpandIntRes_MULFIX(N, Lo, Hi); break;		case ISD::UMULFIX:
		case ISD::UMULFIXSAT: ExpandIntRes_MULFIX(N, Lo, Hi); break;

case ISD::VECREDUCE_ADD:		case ISD::VECREDUCE_ADD:
case ISD::VECREDUCE_MUL:		case ISD::VECREDUCE_MUL:
case ISD::VECREDUCE_AND:		case ISD::VECREDUCE_AND:
case ISD::VECREDUCE_OR:		case ISD::VECREDUCE_OR:
case ISD::VECREDUCE_XOR:		case ISD::VECREDUCE_XOR:
case ISD::VECREDUCE_SMAX:		case ISD::VECREDUCE_SMAX:
case ISD::VECREDUCE_SMIN:		case ISD::VECREDUCE_SMIN:
▲ Show 20 Lines • Show All 1,054 Lines • ▼ Show 20 Lines
void DAGTypeLegalizer::ExpandIntRes_MULFIX(SDNode *N, SDValue &Lo,		void DAGTypeLegalizer::ExpandIntRes_MULFIX(SDNode *N, SDValue &Lo,
SDValue &Hi) {		SDValue &Hi) {
SDLoc dl(N);		SDLoc dl(N);
EVT VT = N->getValueType(0);		EVT VT = N->getValueType(0);
unsigned VTSize = VT.getScalarSizeInBits();		unsigned VTSize = VT.getScalarSizeInBits();
SDValue LHS = N->getOperand(0);		SDValue LHS = N->getOperand(0);
SDValue RHS = N->getOperand(1);		SDValue RHS = N->getOperand(1);
uint64_t Scale = N->getConstantOperandVal(2);		uint64_t Scale = N->getConstantOperandVal(2);
bool Saturating = N->getOpcode() == ISD::SMULFIXSAT;		bool Saturating = (N->getOpcode() == ISD::SMULFIXSAT \|\|
		N->getOpcode() == ISD::UMULFIXSAT);
bool Signed = (N->getOpcode() == ISD::SMULFIX \|\|		bool Signed = (N->getOpcode() == ISD::SMULFIX \|\|
N->getOpcode() == ISD::SMULFIXSAT);		N->getOpcode() == ISD::SMULFIXSAT);

// Handle special case when scale is equal to zero.		// Handle special case when scale is equal to zero.
if (!Scale) {		if (!Scale) {
SDValue Result;		SDValue Result;
if (!Saturating) {		if (!Saturating) {
Result = DAG.getNode(ISD::MUL, dl, VT, LHS, RHS);		Result = DAG.getNode(ISD::MUL, dl, VT, LHS, RHS);
} else {		} else {
EVT BoolVT = getSetCCResultType(VT);		EVT BoolVT = getSetCCResultType(VT);
Result = DAG.getNode(ISD::SMULO, dl, DAG.getVTList(VT, BoolVT), LHS, RHS);		unsigned MulOp = Signed ? ISD::SMULO : ISD::UMULO;
		Result = DAG.getNode(MulOp, dl, DAG.getVTList(VT, BoolVT), LHS, RHS);
SDValue Product = Result.getValue(0);		SDValue Product = Result.getValue(0);
SDValue Overflow = Result.getValue(1);		SDValue Overflow = Result.getValue(1);
assert(Signed && "Unsigned saturation not supported (yet).");		if (Signed) {
APInt MinVal = APInt::getSignedMinValue(VTSize);		APInt MinVal = APInt::getSignedMinValue(VTSize);
APInt MaxVal = APInt::getSignedMaxValue(VTSize);		APInt MaxVal = APInt::getSignedMaxValue(VTSize);
SDValue SatMin = DAG.getConstant(MinVal, dl, VT);		SDValue SatMin = DAG.getConstant(MinVal, dl, VT);
SDValue SatMax = DAG.getConstant(MaxVal, dl, VT);		SDValue SatMax = DAG.getConstant(MaxVal, dl, VT);
SDValue Zero = DAG.getConstant(0, dl, VT);		SDValue Zero = DAG.getConstant(0, dl, VT);
SDValue ProdNeg = DAG.getSetCC(dl, BoolVT, Product, Zero, ISD::SETLT);		SDValue ProdNeg = DAG.getSetCC(dl, BoolVT, Product, Zero, ISD::SETLT);
Result = DAG.getSelect(dl, VT, ProdNeg, SatMax, SatMin);		Result = DAG.getSelect(dl, VT, ProdNeg, SatMax, SatMin);
Result = DAG.getSelect(dl, VT, Overflow, Result, Product);		Result = DAG.getSelect(dl, VT, Overflow, Result, Product);
		} else {
		// For unsigned multiplication, we only need to check the max since we
		// can't really overflow towards zero.
		APInt MaxVal = APInt::getMaxValue(VTSize);
		SDValue SatMax = DAG.getConstant(MaxVal, dl, VT);
		Result = DAG.getSelect(dl, VT, Overflow, SatMax, Product);
		}
}		}
SplitInteger(Result, Lo, Hi);		SplitInteger(Result, Lo, Hi);
return;		return;
}		}

		// For SMULFIX[SAT] we only expect to find Scale<VTSize, but this assert will
		// cover for unhandled cases below, while still being valid for UMULFIX[SAT].
		assert(Scale <= VTSize && "Scale can't be larger than the value type size.");

EVT NVT = TLI.getTypeToTransformTo(*DAG.getContext(), VT);		EVT NVT = TLI.getTypeToTransformTo(*DAG.getContext(), VT);
SDValue LL, LH, RL, RH;		SDValue LL, LH, RL, RH;
GetExpandedInteger(LHS, LL, LH);		GetExpandedInteger(LHS, LL, LH);
GetExpandedInteger(RHS, RL, RH);		GetExpandedInteger(RHS, RL, RH);
SmallVector<SDValue, 4> Result;		SmallVector<SDValue, 4> Result;

unsigned LoHiOp = Signed ? ISD::SMUL_LOHI : ISD::UMUL_LOHI;		unsigned LoHiOp = Signed ? ISD::SMUL_LOHI : ISD::UMUL_LOHI;
if (!TLI.expandMUL_LOHI(LoHiOp, VT, dl, LHS, RHS, Result, NVT, DAG,		if (!TLI.expandMUL_LOHI(LoHiOp, VT, dl, LHS, RHS, Result, NVT, DAG,
Show All 38 Lines	if (Scale % NVTSize) {
Lo = Result[Part0];		Lo = Result[Part0];
Hi = Result[Part0 + 1];		Hi = Result[Part0 + 1];
}		}

// Unless saturation is requested we are done. The result is in <Hi,Lo>.		// Unless saturation is requested we are done. The result is in <Hi,Lo>.
if (!Saturating)		if (!Saturating)
return;		return;

		// Can not overflow when there is no integer part.
		if (Scale == VTSize)
		return;

// To handle saturation we must check for overflow in the multiplication.		// To handle saturation we must check for overflow in the multiplication.
//		//
		// Unsigned overflow happened if the upper (VTSize - Scale) bits (of Result)
		// aren't all zeroes.
		//
// Signed overflow happened if the upper (VTSize - Scale + 1) bits (of Result)		// Signed overflow happened if the upper (VTSize - Scale + 1) bits (of Result)
// aren't all ones or all zeroes.		// aren't all ones or all zeroes.
//		//
// We cannot overflow past HH when multiplying 2 ints of size VTSize, so the		// We cannot overflow past HH when multiplying 2 ints of size VTSize, so the
// highest bit of HH determines saturation direction in the event of		// highest bit of HH determines saturation direction in the event of signed
// saturation.		// saturation.

SDValue ResultHL = Result[2];		SDValue ResultHL = Result[2];
SDValue ResultHH = Result[3];		SDValue ResultHH = Result[3];

SDValue SatMax, SatMin;		SDValue SatMax, SatMin;
SDValue NVTZero = DAG.getConstant(0, dl, NVT);		SDValue NVTZero = DAG.getConstant(0, dl, NVT);
SDValue NVTNeg1 = DAG.getConstant(-1, dl, NVT);		SDValue NVTNeg1 = DAG.getConstant(-1, dl, NVT);
EVT BoolNVT = getSetCCResultType(NVT);		EVT BoolNVT = getSetCCResultType(NVT);

if (!Signed)		if (!Signed) {
llvm_unreachable("Unsigned saturation not supported (yet).");		if (Scale < NVTSize) {
		// Overflow happened if ((HH \| (HL >> Scale)) != 0).
		SDValue HLAdjusted = DAG.getNode(ISD::SRL, dl, NVT, ResultHL,
		DAG.getConstant(Scale, dl, ShiftTy));
		SDValue Tmp = DAG.getNode(ISD::OR, dl, NVT, HLAdjusted, ResultHH);
		SatMax = DAG.getSetCC(dl, BoolNVT, Tmp, NVTZero, ISD::SETNE);
		} else if (Scale == NVTSize) {
		// Overflow happened if (HH != 0).
		SatMax = DAG.getSetCC(dl, BoolNVT, ResultHH, NVTZero, ISD::SETNE);
		} else if (Scale < VTSize) {
		// Overflow happened if ((HH >> (Scale - NVTSize)) != 0).
		SDValue HLAdjusted = DAG.getNode(ISD::SRL, dl, NVT, ResultHL,
		DAG.getConstant(Scale - NVTSize, dl,
		ShiftTy));
		SatMax = DAG.getSetCC(dl, BoolNVT, HLAdjusted, NVTZero, ISD::SETNE);
		} else
		llvm_unreachable("Scale must be less or equal to VTSize for UMULFIXSAT"
		"(and saturation can't happen with Scale==VTSize).");

		Hi = DAG.getSelect(dl, NVT, SatMax, NVTNeg1, Hi);
		Lo = DAG.getSelect(dl, NVT, SatMax, NVTNeg1, Lo);
		return;
		}

if (Scale < NVTSize) {		if (Scale < NVTSize) {
// The number of overflow bits we can check are VTSize - Scale + 1 (we		// The number of overflow bits we can check are VTSize - Scale + 1 (we
// include the sign bit). If these top bits are > 0, then we overflowed past		// include the sign bit). If these top bits are > 0, then we overflowed past
// the max value. If these top bits are < -1, then we overflowed past the		// the max value. If these top bits are < -1, then we overflowed past the
// min value. Otherwise, we did not overflow.		// min value. Otherwise, we did not overflow.
unsigned OverflowBits = VTSize - Scale + 1;		unsigned OverflowBits = VTSize - Scale + 1;
assert(OverflowBits <= VTSize && OverflowBits > NVTSize &&		assert(OverflowBits <= VTSize && OverflowBits > NVTSize &&
▲ Show 20 Lines • Show All 1,280 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

Show First 20 Lines • Show All 446 Lines • ▼ Show 20 Lines	SDValue VectorLegalizer::LegalizeOp(SDValue Op) {
case ISD::SADDSAT:		case ISD::SADDSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::USUBSAT:		case ISD::USUBSAT:
Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));		Action = TLI.getOperationAction(Node->getOpcode(), Node->getValueType(0));
break;		break;
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX: {		case ISD::UMULFIX:
		case ISD::UMULFIXSAT: {
unsigned Scale = Node->getConstantOperandVal(2);		unsigned Scale = Node->getConstantOperandVal(2);
Action = TLI.getFixedPointOperationAction(Node->getOpcode(),		Action = TLI.getFixedPointOperationAction(Node->getOpcode(),
Node->getValueType(0), Scale);		Node->getValueType(0), Scale);
break;		break;
}		}
case ISD::FP_ROUND_INREG:		case ISD::FP_ROUND_INREG:
Action = TLI.getOperationAction(Node->getOpcode(),		Action = TLI.getOperationAction(Node->getOpcode(),
cast<VTSDNode>(Node->getOperand(1))->getVT());		cast<VTSDNode>(Node->getOperand(1))->getVT());
▲ Show 20 Lines • Show All 365 Lines • ▼ Show 20 Lines	SDValue VectorLegalizer::Expand(SDValue Op) {
case ISD::SSUBSAT:		case ISD::SSUBSAT:
case ISD::UADDSAT:		case ISD::UADDSAT:
case ISD::SADDSAT:		case ISD::SADDSAT:
return ExpandAddSubSat(Op);		return ExpandAddSubSat(Op);
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::UMULFIX:		case ISD::UMULFIX:
return ExpandFixedPointMul(Op);		return ExpandFixedPointMul(Op);
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
// FIXME: We do not expand SMULFIXSAT here yet, not sure why. Maybe it		case ISD::UMULFIXSAT:
// results in worse codegen compared to the default unroll? This should		// FIXME: We do not expand SMULFIXSAT/UMULFIXSAT here yet, not sure exactly
// probably be investigated. And if we still prefer to unroll an explanation		// why. Maybe it results in worse codegen compared to the unroll for some
// could be helpful, otherwise it just looks like something that hasn't been		// targets? This should probably be investigated. And if we still prefer to
// "implemented" yet.		// unroll an explanation could be helpful.
return DAG.UnrollVectorOp(Op.getNode());		return DAG.UnrollVectorOp(Op.getNode());
case ISD::STRICT_FADD:		case ISD::STRICT_FADD:
case ISD::STRICT_FSUB:		case ISD::STRICT_FSUB:
case ISD::STRICT_FMUL:		case ISD::STRICT_FMUL:
case ISD::STRICT_FDIV:		case ISD::STRICT_FDIV:
case ISD::STRICT_FREM:		case ISD::STRICT_FREM:
case ISD::STRICT_FSQRT:		case ISD::STRICT_FSQRT:
case ISD::STRICT_FMA:		case ISD::STRICT_FMA:
▲ Show 20 Lines • Show All 597 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

Show First 20 Lines • Show All 181 Lines • ▼ Show 20 Lines	#endif
case ISD::SSUBO:		case ISD::SSUBO:
case ISD::UMULO:		case ISD::UMULO:
case ISD::SMULO:		case ISD::SMULO:
R = ScalarizeVecRes_OverflowOp(N, ResNo);		R = ScalarizeVecRes_OverflowOp(N, ResNo);
break;		break;
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
		case ISD::UMULFIXSAT:
R = ScalarizeVecRes_MULFIX(N);		R = ScalarizeVecRes_MULFIX(N);
break;		break;
}		}

// If R is null, the sub-method took care of registering the result.		// If R is null, the sub-method took care of registering the result.
if (R.getNode())		if (R.getNode())
SetScalarizedVector(SDValue(N, ResNo), R);		SetScalarizedVector(SDValue(N, ResNo), R);
}		}
▲ Show 20 Lines • Show All 799 Lines • ▼ Show 20 Lines	#endif
case ISD::SSUBO:		case ISD::SSUBO:
case ISD::UMULO:		case ISD::UMULO:
case ISD::SMULO:		case ISD::SMULO:
SplitVecRes_OverflowOp(N, ResNo, Lo, Hi);		SplitVecRes_OverflowOp(N, ResNo, Lo, Hi);
break;		break;
case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
		case ISD::UMULFIXSAT:
SplitVecRes_MULFIX(N, Lo, Hi);		SplitVecRes_MULFIX(N, Lo, Hi);
break;		break;
}		}

// If Lo/Hi is null, the sub-method took care of registering results etc.		// If Lo/Hi is null, the sub-method took care of registering results etc.
if (Lo.getNode())		if (Lo.getNode())
SetSplitVector(SDValue(N, ResNo), Lo, Hi);		SetSplitVector(SDValue(N, ResNo), Lo, Hi);
}		}
▲ Show 20 Lines • Show All 1,747 Lines • ▼ Show 20 Lines	#endif
case ISD::SREM:		case ISD::SREM:
case ISD::UREM:		case ISD::UREM:
Res = WidenVecRes_BinaryCanTrap(N);		Res = WidenVecRes_BinaryCanTrap(N);
break;		break;

case ISD::SMULFIX:		case ISD::SMULFIX:
case ISD::SMULFIXSAT:		case ISD::SMULFIXSAT:
case ISD::UMULFIX:		case ISD::UMULFIX:
		case ISD::UMULFIXSAT:
// These are binary operations, but with an extra operand that shouldn't		// These are binary operations, but with an extra operand that shouldn't
// be widened (the scale).		// be widened (the scale).
Res = WidenVecRes_BinaryWithExtraScalarOp(N);		Res = WidenVecRes_BinaryWithExtraScalarOp(N);
break;		break;

case ISD::STRICT_FADD:		case ISD::STRICT_FADD:
case ISD::STRICT_FSUB:		case ISD::STRICT_FSUB:
case ISD::STRICT_FMUL:		case ISD::STRICT_FMUL:
▲ Show 20 Lines • Show All 2,340 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,311 Lines • ▼ Show 20 Lines	void SelectionDAGBuilder::visitIntrinsicCall(const CallInst &I,
case Intrinsic::smul_fix_sat: {		case Intrinsic::smul_fix_sat: {
SDValue Op1 = getValue(I.getArgOperand(0));		SDValue Op1 = getValue(I.getArgOperand(0));
SDValue Op2 = getValue(I.getArgOperand(1));		SDValue Op2 = getValue(I.getArgOperand(1));
SDValue Op3 = getValue(I.getArgOperand(2));		SDValue Op3 = getValue(I.getArgOperand(2));
setValue(&I, DAG.getNode(ISD::SMULFIXSAT, sdl, Op1.getValueType(), Op1, Op2,		setValue(&I, DAG.getNode(ISD::SMULFIXSAT, sdl, Op1.getValueType(), Op1, Op2,
Op3));		Op3));
return;		return;
}		}
		case Intrinsic::umul_fix_sat: {
		SDValue Op1 = getValue(I.getArgOperand(0));
		SDValue Op2 = getValue(I.getArgOperand(1));
		SDValue Op3 = getValue(I.getArgOperand(2));
		setValue(&I, DAG.getNode(ISD::UMULFIXSAT, sdl, Op1.getValueType(), Op1, Op2,
		Op3));
		return;
		}
case Intrinsic::stacksave: {		case Intrinsic::stacksave: {
SDValue Op = getRoot();		SDValue Op = getRoot();
Res = DAG.getNode(		Res = DAG.getNode(
ISD::STACKSAVE, sdl,		ISD::STACKSAVE, sdl,
DAG.getVTList(TLI.getPointerTy(DAG.getDataLayout()), MVT::Other), Op);		DAG.getVTList(TLI.getPointerTy(DAG.getDataLayout()), MVT::Other), Op);
setValue(&I, Res);		setValue(&I, Res);
DAG.setRoot(Res.getValue(1));		DAG.setRoot(Res.getValue(1));
return;		return;
▲ Show 20 Lines • Show All 4,217 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 299 Lines • ▼ Show 20 Lines	#endif
case ISD::SADDSAT: return "saddsat";		case ISD::SADDSAT: return "saddsat";
case ISD::UADDSAT: return "uaddsat";		case ISD::UADDSAT: return "uaddsat";
case ISD::SSUBSAT: return "ssubsat";		case ISD::SSUBSAT: return "ssubsat";
case ISD::USUBSAT: return "usubsat";		case ISD::USUBSAT: return "usubsat";

case ISD::SMULFIX: return "smulfix";		case ISD::SMULFIX: return "smulfix";
case ISD::SMULFIXSAT: return "smulfixsat";		case ISD::SMULFIXSAT: return "smulfixsat";
case ISD::UMULFIX: return "umulfix";		case ISD::UMULFIX: return "umulfix";
		case ISD::UMULFIXSAT: return "umulfixsat";

// Conversion operators.		// Conversion operators.
case ISD::SIGN_EXTEND: return "sign_extend";		case ISD::SIGN_EXTEND: return "sign_extend";
case ISD::ZERO_EXTEND: return "zero_extend";		case ISD::ZERO_EXTEND: return "zero_extend";
case ISD::ANY_EXTEND: return "any_extend";		case ISD::ANY_EXTEND: return "any_extend";
case ISD::SIGN_EXTEND_INREG: return "sign_extend_inreg";		case ISD::SIGN_EXTEND_INREG: return "sign_extend_inreg";
case ISD::ANY_EXTEND_VECTOR_INREG: return "any_extend_vector_inreg";		case ISD::ANY_EXTEND_VECTOR_INREG: return "any_extend_vector_inreg";
case ISD::SIGN_EXTEND_VECTOR_INREG: return "sign_extend_vector_inreg";		case ISD::SIGN_EXTEND_VECTOR_INREG: return "sign_extend_vector_inreg";
▲ Show 20 Lines • Show All 643 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/TargetLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,692 Lines • ▼ Show 20 Lines	if (Opcode == ISD::UADDSAT) {
return DAG.getSelect(dl, VT, Overflow, Result, SumDiff);		return DAG.getSelect(dl, VT, Overflow, Result, SumDiff);
}		}
}		}

SDValue		SDValue
TargetLowering::expandFixedPointMul(SDNode *Node, SelectionDAG &DAG) const {		TargetLowering::expandFixedPointMul(SDNode *Node, SelectionDAG &DAG) const {
assert((Node->getOpcode() == ISD::SMULFIX \|\|		assert((Node->getOpcode() == ISD::SMULFIX \|\|
Node->getOpcode() == ISD::UMULFIX \|\|		Node->getOpcode() == ISD::UMULFIX \|\|
Node->getOpcode() == ISD::SMULFIXSAT) &&		Node->getOpcode() == ISD::SMULFIXSAT \|\|
		Node->getOpcode() == ISD::UMULFIXSAT) &&
"Expected a fixed point multiplication opcode");		"Expected a fixed point multiplication opcode");

SDLoc dl(Node);		SDLoc dl(Node);
SDValue LHS = Node->getOperand(0);		SDValue LHS = Node->getOperand(0);
SDValue RHS = Node->getOperand(1);		SDValue RHS = Node->getOperand(1);
EVT VT = LHS.getValueType();		EVT VT = LHS.getValueType();
unsigned Scale = Node->getConstantOperandVal(2);		unsigned Scale = Node->getConstantOperandVal(2);
bool Saturating = Node->getOpcode() == ISD::SMULFIXSAT;		bool Saturating = (Node->getOpcode() == ISD::SMULFIXSAT \|\|
		Node->getOpcode() == ISD::UMULFIXSAT);
		bool Signed = (Node->getOpcode() == ISD::SMULFIX \|\|
		Node->getOpcode() == ISD::SMULFIXSAT);
EVT BoolVT = getSetCCResultType(DAG.getDataLayout(), *DAG.getContext(), VT);		EVT BoolVT = getSetCCResultType(DAG.getDataLayout(), *DAG.getContext(), VT);
unsigned VTSize = VT.getScalarSizeInBits();		unsigned VTSize = VT.getScalarSizeInBits();

if (!Scale) {		if (!Scale) {
// [us]mul.fix(a, b, 0) -> mul(a, b)		// [us]mul.fix(a, b, 0) -> mul(a, b)
if (!Saturating && isOperationLegalOrCustom(ISD::MUL, VT)) {		if (!Saturating) {
		if (isOperationLegalOrCustom(ISD::MUL, VT))
return DAG.getNode(ISD::MUL, dl, VT, LHS, RHS);		return DAG.getNode(ISD::MUL, dl, VT, LHS, RHS);
} else if (Saturating && isOperationLegalOrCustom(ISD::SMULO, VT)) {		} else if (Signed && isOperationLegalOrCustom(ISD::SMULO, VT)) {
SDValue Result =		SDValue Result =
DAG.getNode(ISD::SMULO, dl, DAG.getVTList(VT, BoolVT), LHS, RHS);		DAG.getNode(ISD::SMULO, dl, DAG.getVTList(VT, BoolVT), LHS, RHS);
SDValue Product = Result.getValue(0);		SDValue Product = Result.getValue(0);
SDValue Overflow = Result.getValue(1);		SDValue Overflow = Result.getValue(1);
SDValue Zero = DAG.getConstant(0, dl, VT);		SDValue Zero = DAG.getConstant(0, dl, VT);

APInt MinVal = APInt::getSignedMinValue(VTSize);		APInt MinVal = APInt::getSignedMinValue(VTSize);
APInt MaxVal = APInt::getSignedMaxValue(VTSize);		APInt MaxVal = APInt::getSignedMaxValue(VTSize);
SDValue SatMin = DAG.getConstant(MinVal, dl, VT);		SDValue SatMin = DAG.getConstant(MinVal, dl, VT);
SDValue SatMax = DAG.getConstant(MaxVal, dl, VT);		SDValue SatMax = DAG.getConstant(MaxVal, dl, VT);
SDValue ProdNeg = DAG.getSetCC(dl, BoolVT, Product, Zero, ISD::SETLT);		SDValue ProdNeg = DAG.getSetCC(dl, BoolVT, Product, Zero, ISD::SETLT);
Result = DAG.getSelect(dl, VT, ProdNeg, SatMax, SatMin);		Result = DAG.getSelect(dl, VT, ProdNeg, SatMax, SatMin);
return DAG.getSelect(dl, VT, Overflow, Result, Product);		return DAG.getSelect(dl, VT, Overflow, Result, Product);
		} else if (!Signed && isOperationLegalOrCustom(ISD::UMULO, VT)) {
		SDValue Result =
		DAG.getNode(ISD::UMULO, dl, DAG.getVTList(VT, BoolVT), LHS, RHS);
		SDValue Product = Result.getValue(0);
		SDValue Overflow = Result.getValue(1);

		APInt MaxVal = APInt::getMaxValue(VTSize);
		SDValue SatMax = DAG.getConstant(MaxVal, dl, VT);
		return DAG.getSelect(dl, VT, Overflow, SatMax, Product);
}		}
}		}

bool Signed =
Node->getOpcode() == ISD::SMULFIX \|\| Node->getOpcode() == ISD::SMULFIXSAT;
assert(((Signed && Scale < VTSize) \|\| (!Signed && Scale <= VTSize)) &&		assert(((Signed && Scale < VTSize) \|\| (!Signed && Scale <= VTSize)) &&
"Expected scale to be less than the number of bits if signed or at "		"Expected scale to be less than the number of bits if signed or at "
"most the number of bits if unsigned.");		"most the number of bits if unsigned.");
assert(LHS.getValueType() == RHS.getValueType() &&		assert(LHS.getValueType() == RHS.getValueType() &&
"Expected both operands to be the same type");		"Expected both operands to be the same type");

// Get the upper and lower bits of the result.		// Get the upper and lower bits of the result.
SDValue Lo, Hi;		SDValue Lo, Hi;
Show All 9 Lines	TargetLowering::expandFixedPointMul(SDNode *Node, SelectionDAG &DAG) const {
} else if (VT.isVector()) {		} else if (VT.isVector()) {
return SDValue();		return SDValue();
} else {		} else {
report_fatal_error("Unable to expand fixed point multiplication.");		report_fatal_error("Unable to expand fixed point multiplication.");
}		}

if (Scale == VTSize)		if (Scale == VTSize)
// Result is just the top half since we'd be shifting by the width of the		// Result is just the top half since we'd be shifting by the width of the
// operand.		// operand. Overflow impossible so this works for both UMULFIX and
		// UMULFIXSAT.
return Hi;		return Hi;

// The result will need to be shifted right by the scale since both operands		// The result will need to be shifted right by the scale since both operands
// are scaled. The result is given to us in 2 halves, so we only want part of		// are scaled. The result is given to us in 2 halves, so we only want part of
// both in the result.		// both in the result.
EVT ShiftTy = getShiftAmountTy(VT, DAG.getDataLayout());		EVT ShiftTy = getShiftAmountTy(VT, DAG.getDataLayout());
SDValue Result = DAG.getNode(ISD::FSHR, dl, VT, Hi, Lo,		SDValue Result = DAG.getNode(ISD::FSHR, dl, VT, Hi, Lo,
DAG.getConstant(Scale, dl, ShiftTy));		DAG.getConstant(Scale, dl, ShiftTy));
if (!Saturating)		if (!Saturating)
return Result;		return Result;

unsigned OverflowBits = VTSize - Scale + 1; // +1 for the sign		if (!Signed) {
SDValue HiMask =		// Unsigned overflow happened if the upper (VTSize - Scale) bits (of the
DAG.getConstant(APInt::getHighBitsSet(VTSize, OverflowBits), dl, VT);		// widened multiplication) aren't all zeroes.
SDValue LoMask = DAG.getConstant(
APInt::getLowBitsSet(VTSize, VTSize - OverflowBits), dl, VT);		// Saturate to max if ((Hi >> Scale) != 0),
APInt MaxVal = APInt::getSignedMaxValue(VTSize);		// which is the same as if (Hi > ((1 << Scale) - 1))
APInt MinVal = APInt::getSignedMinValue(VTSize);		APInt MaxVal = APInt::getMaxValue(VTSize);
		SDValue LowMask = DAG.getConstant(APInt::getLowBitsSet(VTSize, Scale),
		dl, VT);
		Result = DAG.getSelectCC(dl, Hi, LowMask,
		DAG.getConstant(MaxVal, dl, VT), Result,
		ISD::SETUGT);

Result = DAG.getSelectCC(dl, Hi, LoMask,		return Result;
		}

		// Signed overflow happened if the upper (VTSize - Scale + 1) bits (of the
		// widened multiplication) aren't all ones or all zeroes. We handled Scale==0
		// above so all the bits to examine is in Hi.

		// Saturate to max if ((Hi >> (Scale - 1)) > 0),
		// which is the same as if (Hi > (1 << (Scale - 1)) - 1)
		APInt MaxVal = APInt::getSignedMaxValue(VTSize);
		SDValue LowMask = DAG.getConstant(APInt::getLowBitsSet(VTSize, Scale - 1),
		dl, VT);
		Result = DAG.getSelectCC(dl, Hi, LowMask,
DAG.getConstant(MaxVal, dl, VT), Result,		DAG.getConstant(MaxVal, dl, VT), Result,
ISD::SETGT);		ISD::SETGT);
return DAG.getSelectCC(dl, Hi, HiMask,		// Saturate to min if (Hi >> (Scale - 1)) < -1),
		// which is the same as if (HI < (-1 << (Scale - 1))
		APInt MinVal = APInt::getSignedMinValue(VTSize);
		SDValue HighMask =
		DAG.getConstant(APInt::getHighBitsSet(VTSize, VTSize - Scale + 1),
		dl, VT);
		Result = DAG.getSelectCC(dl, Hi, HighMask,
DAG.getConstant(MinVal, dl, VT), Result,		DAG.getConstant(MinVal, dl, VT), Result,
ISD::SETLT);		ISD::SETLT);
		return Result;
}		}

void TargetLowering::expandUADDSUBO(		void TargetLowering::expandUADDSUBO(
SDNode *Node, SDValue &Result, SDValue &Overflow, SelectionDAG &DAG) const {		SDNode *Node, SDValue &Result, SDValue &Overflow, SelectionDAG &DAG) const {
SDLoc dl(Node);		SDLoc dl(Node);
SDValue LHS = Node->getOperand(0);		SDValue LHS = Node->getOperand(0);
SDValue RHS = Node->getOperand(1);		SDValue RHS = Node->getOperand(1);
bool IsAdd = Node->getOpcode() == ISD::UADDO;		bool IsAdd = Node->getOpcode() == ISD::UADDO;
▲ Show 20 Lines • Show All 269 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/TargetLoweringBase.cpp

Show First 20 Lines • Show All 646 Lines • ▼ Show 20 Lines	for (MVT VT : MVT::all_valuetypes()) {
setOperationAction(ISD::FSHR, VT, Expand);		setOperationAction(ISD::FSHR, VT, Expand);
setOperationAction(ISD::SADDSAT, VT, Expand);		setOperationAction(ISD::SADDSAT, VT, Expand);
setOperationAction(ISD::UADDSAT, VT, Expand);		setOperationAction(ISD::UADDSAT, VT, Expand);
setOperationAction(ISD::SSUBSAT, VT, Expand);		setOperationAction(ISD::SSUBSAT, VT, Expand);
setOperationAction(ISD::USUBSAT, VT, Expand);		setOperationAction(ISD::USUBSAT, VT, Expand);
setOperationAction(ISD::SMULFIX, VT, Expand);		setOperationAction(ISD::SMULFIX, VT, Expand);
setOperationAction(ISD::SMULFIXSAT, VT, Expand);		setOperationAction(ISD::SMULFIXSAT, VT, Expand);
setOperationAction(ISD::UMULFIX, VT, Expand);		setOperationAction(ISD::UMULFIX, VT, Expand);
		setOperationAction(ISD::UMULFIXSAT, VT, Expand);

// Overflow operations default to expand		// Overflow operations default to expand
setOperationAction(ISD::SADDO, VT, Expand);		setOperationAction(ISD::SADDO, VT, Expand);
setOperationAction(ISD::SSUBO, VT, Expand);		setOperationAction(ISD::SSUBO, VT, Expand);
setOperationAction(ISD::UADDO, VT, Expand);		setOperationAction(ISD::UADDO, VT, Expand);
setOperationAction(ISD::USUBO, VT, Expand);		setOperationAction(ISD::USUBO, VT, Expand);
setOperationAction(ISD::SMULO, VT, Expand);		setOperationAction(ISD::SMULO, VT, Expand);
setOperationAction(ISD::UMULO, VT, Expand);		setOperationAction(ISD::UMULO, VT, Expand);
▲ Show 20 Lines • Show All 1,314 Lines • Show Last 20 Lines

llvm/trunk/lib/IR/Verifier.cpp

Show First 20 Lines • Show All 4,665 Lines • ▼ Show 20 Lines	Assert(Op1->getType()->isIntOrIntVectorTy(),
"of ints");		"of ints");
Assert(Op2->getType()->isIntOrIntVectorTy(),		Assert(Op2->getType()->isIntOrIntVectorTy(),
"second operand of [us][add\|sub]_sat must be an int type or vector "		"second operand of [us][add\|sub]_sat must be an int type or vector "
"of ints");		"of ints");
break;		break;
}		}
case Intrinsic::smul_fix:		case Intrinsic::smul_fix:
case Intrinsic::smul_fix_sat:		case Intrinsic::smul_fix_sat:
case Intrinsic::umul_fix: {		case Intrinsic::umul_fix:
		case Intrinsic::umul_fix_sat: {
Value *Op1 = Call.getArgOperand(0);		Value *Op1 = Call.getArgOperand(0);
Value *Op2 = Call.getArgOperand(1);		Value *Op2 = Call.getArgOperand(1);
Assert(Op1->getType()->isIntOrIntVectorTy(),		Assert(Op1->getType()->isIntOrIntVectorTy(),
"first operand of [us]mul_fix[_sat] must be an int type or vector "		"first operand of [us]mul_fix[_sat] must be an int type or vector "
"of ints");		"of ints");
Assert(Op2->getType()->isIntOrIntVectorTy(),		Assert(Op2->getType()->isIntOrIntVectorTy(),
"second operand of [us]mul_fix_[sat] must be an int type or vector "		"second operand of [us]mul_fix_[sat] must be an int type or vector "
"of ints");		"of ints");
▲ Show 20 Lines • Show All 829 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/PowerPC/umulfixsat.ll

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=ppc32 \| FileCheck %s

				declare i32 @llvm.umul.fix.sat.i32(i32, i32, i32)

				define i32 @func1(i32 %x, i32 %y) nounwind {
				; CHECK-LABEL: func1:
				; CHECK: # %bb.0:
				; CHECK-NEXT: li 5, -1
				; CHECK-NEXT: mulhwu. 6, 3, 4
				; CHECK-NEXT: mullw 3, 3, 4
				; CHECK-NEXT: bclr 12, 2, 0
				; CHECK-NEXT: # %bb.1:
				; CHECK-NEXT: ori 3, 5, 0
				; CHECK-NEXT: blr
				%tmp = call i32 @llvm.umul.fix.sat.i32(i32 %x, i32 %y, i32 0)
				ret i32 %tmp
				}

				define i32 @func2(i32 %x, i32 %y) nounwind {
				; CHECK-LABEL: func2:
				; CHECK: # %bb.0:
				; CHECK-NEXT: mulhwu 6, 3, 4
				; CHECK-NEXT: li 5, -1
				; CHECK-NEXT: cmplwi 6, 1
				; CHECK-NEXT: mullw 3, 3, 4
				; CHECK-NEXT: rotlwi 3, 3, 31
				; CHECK-NEXT: rlwimi 3, 6, 31, 0, 0
				; CHECK-NEXT: bc 12, 1, .LBB1_1
				; CHECK-NEXT: blr
				; CHECK-NEXT: .LBB1_1:
				; CHECK-NEXT: addi 3, 5, 0
				; CHECK-NEXT: blr
				%tmp = call i32 @llvm.umul.fix.sat.i32(i32 %x, i32 %y, i32 1)
				ret i32 %tmp
				}

llvm/trunk/test/CodeGen/X86/mulfix_combine.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=x86_64-linux -o - \| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-linux -o - \| FileCheck %s

	declare i32 @llvm.smul.fix.i32(i32, i32, i32 immarg)			declare i32 @llvm.smul.fix.i32(i32, i32, i32 immarg)
	declare i32 @llvm.umul.fix.i32(i32, i32, i32 immarg)			declare i32 @llvm.umul.fix.i32(i32, i32, i32 immarg)
	declare i32 @llvm.smul.fix.sat.i32(i32, i32, i32 immarg)			declare i32 @llvm.smul.fix.sat.i32(i32, i32, i32 immarg)
				declare i32 @llvm.umul.fix.sat.i32(i32, i32, i32 immarg)

	declare <4 x i32> @llvm.smul.fix.v4i32(<4 x i32>, <4 x i32>, i32 immarg)			declare <4 x i32> @llvm.smul.fix.v4i32(<4 x i32>, <4 x i32>, i32 immarg)
	declare <4 x i32> @llvm.umul.fix.v4i32(<4 x i32>, <4 x i32>, i32 immarg)			declare <4 x i32> @llvm.umul.fix.v4i32(<4 x i32>, <4 x i32>, i32 immarg)
	declare <4 x i32> @llvm.smul.fix.sat.v4i32(<4 x i32>, <4 x i32>, i32 immarg)			declare <4 x i32> @llvm.smul.fix.sat.v4i32(<4 x i32>, <4 x i32>, i32 immarg)
				declare <4 x i32> @llvm.umul.fix.sat.v4i32(<4 x i32>, <4 x i32>, i32 immarg)

	define i32 @smulfix_undef(i32 %y) nounwind {			define i32 @smulfix_undef(i32 %y) nounwind {
	; CHECK-LABEL: smulfix_undef:			; CHECK-LABEL: smulfix_undef:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: xorl %eax, %eax			; CHECK-NEXT: xorl %eax, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%tmp = call i32 @llvm.smul.fix.i32(i32 undef, i32 %y, i32 2)			%tmp = call i32 @llvm.smul.fix.i32(i32 undef, i32 %y, i32 2)
	ret i32 %tmp			ret i32 %tmp
	Show All 39 Lines
	; CHECK-LABEL: smulfixsat_zero:			; CHECK-LABEL: smulfixsat_zero:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: xorl %eax, %eax			; CHECK-NEXT: xorl %eax, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%tmp = call i32 @llvm.smul.fix.sat.i32(i32 0, i32 %y, i32 2)			%tmp = call i32 @llvm.smul.fix.sat.i32(i32 0, i32 %y, i32 2)
	ret i32 %tmp			ret i32 %tmp
	}			}

				define i32 @umulfixsat_undef(i32 %y) nounwind {
				; CHECK-LABEL: umulfixsat_undef:
				; CHECK: # %bb.0:
				; CHECK-NEXT: xorl %eax, %eax
				; CHECK-NEXT: retq
				%tmp = call i32 @llvm.umul.fix.sat.i32(i32 undef, i32 %y, i32 2)
				ret i32 %tmp
				}

				define i32 @umulfixsat_zero(i32 %y) nounwind {
				; CHECK-LABEL: umulfixsat_zero:
				; CHECK: # %bb.0:
				; CHECK-NEXT: xorl %eax, %eax
				; CHECK-NEXT: retq
				%tmp = call i32 @llvm.umul.fix.sat.i32(i32 0, i32 %y, i32 2)
				ret i32 %tmp
				}

	define <4 x i32> @vec_smulfix_undef(<4 x i32> %y) nounwind {			define <4 x i32> @vec_smulfix_undef(<4 x i32> %y) nounwind {
	; CHECK-LABEL: vec_smulfix_undef:			; CHECK-LABEL: vec_smulfix_undef:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: xorps %xmm0, %xmm0			; CHECK-NEXT: xorps %xmm0, %xmm0
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%tmp = call <4 x i32> @llvm.smul.fix.v4i32(<4 x i32> undef, <4 x i32> %y, i32 2)			%tmp = call <4 x i32> @llvm.smul.fix.v4i32(<4 x i32> undef, <4 x i32> %y, i32 2)
	ret <4 x i32> %tmp			ret <4 x i32> %tmp
	}			}
	Show All 37 Lines
	define <4 x i32> @vec_smulfixsat_zero(<4 x i32> %y) nounwind {			define <4 x i32> @vec_smulfixsat_zero(<4 x i32> %y) nounwind {
	; CHECK-LABEL: vec_smulfixsat_zero:			; CHECK-LABEL: vec_smulfixsat_zero:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: xorps %xmm0, %xmm0			; CHECK-NEXT: xorps %xmm0, %xmm0
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%tmp = call <4 x i32> @llvm.smul.fix.sat.v4i32(<4 x i32> <i32 0, i32 0, i32 0, i32 0>, <4 x i32> %y, i32 2)			%tmp = call <4 x i32> @llvm.smul.fix.sat.v4i32(<4 x i32> <i32 0, i32 0, i32 0, i32 0>, <4 x i32> %y, i32 2)
	ret <4 x i32> %tmp			ret <4 x i32> %tmp
	}			}

				define <4 x i32> @vec_umulfixsat_undef(<4 x i32> %y) nounwind {
				; CHECK-LABEL: vec_umulfixsat_undef:
				; CHECK: # %bb.0:
				; CHECK-NEXT: xorps %xmm0, %xmm0
				; CHECK-NEXT: retq
				%tmp = call <4 x i32> @llvm.umul.fix.sat.v4i32(<4 x i32> undef, <4 x i32> %y, i32 2)
				ret <4 x i32> %tmp
				}

				define <4 x i32> @vec_umulfixsat_zero(<4 x i32> %y) nounwind {
				; CHECK-LABEL: vec_umulfixsat_zero:
				; CHECK: # %bb.0:
				; CHECK-NEXT: xorps %xmm0, %xmm0
				; CHECK-NEXT: retq
				%tmp = call <4 x i32> @llvm.umul.fix.sat.v4i32(<4 x i32> <i32 0, i32 0, i32 0, i32 0>, <4 x i32> %y, i32 2)
				ret <4 x i32> %tmp
				}

llvm/trunk/test/CodeGen/X86/umul_fix_sat.ll

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=x86_64-linux \| FileCheck %s --check-prefix=X64
				; RUN: llc < %s -mtriple=i686 -mattr=cmov \| FileCheck %s --check-prefix=X86

				declare i4 @llvm.umul.fix.sat.i4 (i4, i4, i32)
				declare i32 @llvm.umul.fix.sat.i32 (i32, i32, i32)
				declare i64 @llvm.umul.fix.sat.i64 (i64, i64, i32)
				declare <4 x i32> @llvm.umul.fix.sat.v4i32(<4 x i32>, <4 x i32>, i32)

				define i32 @func(i32 %x, i32 %y) nounwind {
				; X64-LABEL: func:
				; X64: # %bb.0:
				; X64-NEXT: movl %esi, %eax
				; X64-NEXT: movl %edi, %ecx
				; X64-NEXT: imulq %rax, %rcx
				; X64-NEXT: movq %rcx, %rax
				; X64-NEXT: shrq $32, %rax
				; X64-NEXT: shrdl $2, %eax, %ecx
				; X64-NEXT: cmpl $3, %eax
				; X64-NEXT: movl $-1, %eax
				; X64-NEXT: cmovbel %ecx, %eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func:
				; X86: # %bb.0:
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: shrdl $2, %edx, %eax
				; X86-NEXT: cmpl $3, %edx
				; X86-NEXT: movl $-1, %ecx
				; X86-NEXT: cmoval %ecx, %eax
				; X86-NEXT: retl
				%tmp = call i32 @llvm.umul.fix.sat.i32(i32 %x, i32 %y, i32 2)
				ret i32 %tmp
				}

				define i64 @func2(i64 %x, i64 %y) nounwind {
				; X64-LABEL: func2:
				; X64: # %bb.0:
				; X64-NEXT: movq %rdi, %rax
				; X64-NEXT: mulq %rsi
				; X64-NEXT: shrdq $2, %rdx, %rax
				; X64-NEXT: cmpq $3, %rdx
				; X64-NEXT: movq $-1, %rcx
				; X64-NEXT: cmovaq %rcx, %rax
				; X64-NEXT: retq
				;
				; X86-LABEL: func2:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: pushl %ebx
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %esi
				; X86-NEXT: movl %ecx, %eax
				; X86-NEXT: mull %esi
				; X86-NEXT: movl %edx, %edi
				; X86-NEXT: movl %eax, %ebx
				; X86-NEXT: movl %ecx, %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl %eax, %ecx
				; X86-NEXT: movl %edx, %ebp
				; X86-NEXT: addl %ebx, %ebp
				; X86-NEXT: adcl $0, %edi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull %esi
				; X86-NEXT: movl %edx, %ebx
				; X86-NEXT: movl %eax, %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: addl %ebp, %eax
				; X86-NEXT: adcl %edi, %edx
				; X86-NEXT: adcl $0, %ebx
				; X86-NEXT: addl %esi, %edx
				; X86-NEXT: adcl $0, %ebx
				; X86-NEXT: shrdl $2, %eax, %ecx
				; X86-NEXT: shrdl $2, %edx, %eax
				; X86-NEXT: shrl $2, %edx
				; X86-NEXT: orl %ebx, %edx
				; X86-NEXT: movl $-1, %edx
				; X86-NEXT: cmovnel %edx, %ecx
				; X86-NEXT: cmovel %eax, %edx
				; X86-NEXT: movl %ecx, %eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: popl %ebx
				; X86-NEXT: popl %ebp
				; X86-NEXT: retl
				%tmp = call i64 @llvm.umul.fix.sat.i64(i64 %x, i64 %y, i32 2)
				ret i64 %tmp
				}

				define i4 @func3(i4 %x, i4 %y) nounwind {
				; X64-LABEL: func3:
				; X64: # %bb.0:
				; X64-NEXT: andl $15, %esi
				; X64-NEXT: shlb $4, %dil
				; X64-NEXT: movzbl %dil, %eax
				; X64-NEXT: imull %esi, %eax
				; X64-NEXT: movl %eax, %ecx
				; X64-NEXT: shrb $2, %cl
				; X64-NEXT: shrl $8, %eax
				; X64-NEXT: movl %eax, %edx
				; X64-NEXT: shlb $6, %dl
				; X64-NEXT: orb %cl, %dl
				; X64-NEXT: movzbl %dl, %ecx
				; X64-NEXT: cmpb $3, %al
				; X64-NEXT: movl $255, %eax
				; X64-NEXT: cmovbel %ecx, %eax
				; X64-NEXT: shrb $4, %al
				; X64-NEXT: # kill: def $al killed $al killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func3:
				; X86: # %bb.0:
				; X86-NEXT: movb {{[0-9]+}}(%esp), %al
				; X86-NEXT: andb $15, %al
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: movzbl %al, %edx
				; X86-NEXT: shlb $4, %cl
				; X86-NEXT: movzbl %cl, %eax
				; X86-NEXT: imull %edx, %eax
				; X86-NEXT: movb %ah, %cl
				; X86-NEXT: shlb $6, %cl
				; X86-NEXT: shrb $2, %al
				; X86-NEXT: orb %cl, %al
				; X86-NEXT: movzbl %al, %ecx
				; X86-NEXT: cmpb $3, %ah
				; X86-NEXT: movl $255, %eax
				; X86-NEXT: cmovbel %ecx, %eax
				; X86-NEXT: shrb $4, %al
				; X86-NEXT: # kill: def $al killed $al killed $eax
				; X86-NEXT: retl
				%tmp = call i4 @llvm.umul.fix.sat.i4(i4 %x, i4 %y, i32 2)
				ret i4 %tmp
				}

				define <4 x i32> @vec(<4 x i32> %x, <4 x i32> %y) nounwind {
				; X64-LABEL: vec:
				; X64: # %bb.0:
				; X64-NEXT: pshufd {{.*#+}} xmm2 = xmm1[3,1,2,3]
				; X64-NEXT: movd %xmm2, %eax
				; X64-NEXT: pshufd {{.*#+}} xmm2 = xmm0[3,1,2,3]
				; X64-NEXT: movd %xmm2, %ecx
				; X64-NEXT: imulq %rax, %rcx
				; X64-NEXT: movq %rcx, %rax
				; X64-NEXT: shrq $32, %rax
				; X64-NEXT: shrdl $2, %eax, %ecx
				; X64-NEXT: cmpl $3, %eax
				; X64-NEXT: movl $-1, %eax
				; X64-NEXT: cmoval %eax, %ecx
				; X64-NEXT: movd %ecx, %xmm2
				; X64-NEXT: pshufd {{.*#+}} xmm3 = xmm1[2,3,0,1]
				; X64-NEXT: movd %xmm3, %ecx
				; X64-NEXT: pshufd {{.*#+}} xmm3 = xmm0[2,3,0,1]
				; X64-NEXT: movd %xmm3, %edx
				; X64-NEXT: imulq %rcx, %rdx
				; X64-NEXT: movq %rdx, %rcx
				; X64-NEXT: shrq $32, %rcx
				; X64-NEXT: shrdl $2, %ecx, %edx
				; X64-NEXT: cmpl $3, %ecx
				; X64-NEXT: cmoval %eax, %edx
				; X64-NEXT: movd %edx, %xmm3
				; X64-NEXT: punpckldq {{.*#+}} xmm3 = xmm3[0],xmm2[0],xmm3[1],xmm2[1]
				; X64-NEXT: movd %xmm1, %ecx
				; X64-NEXT: movd %xmm0, %edx
				; X64-NEXT: imulq %rcx, %rdx
				; X64-NEXT: movq %rdx, %rcx
				; X64-NEXT: shrq $32, %rcx
				; X64-NEXT: shrdl $2, %ecx, %edx
				; X64-NEXT: cmpl $3, %ecx
				; X64-NEXT: cmoval %eax, %edx
				; X64-NEXT: movd %edx, %xmm2
				; X64-NEXT: pshufd {{.*#+}} xmm1 = xmm1[1,1,2,3]
				; X64-NEXT: movd %xmm1, %ecx
				; X64-NEXT: pshufd {{.*#+}} xmm0 = xmm0[1,1,2,3]
				; X64-NEXT: movd %xmm0, %edx
				; X64-NEXT: imulq %rcx, %rdx
				; X64-NEXT: movq %rdx, %rcx
				; X64-NEXT: shrq $32, %rcx
				; X64-NEXT: shrdl $2, %ecx, %edx
				; X64-NEXT: cmpl $3, %ecx
				; X64-NEXT: cmoval %eax, %edx
				; X64-NEXT: movd %edx, %xmm0
				; X64-NEXT: punpckldq {{.*#+}} xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1]
				; X64-NEXT: punpcklqdq {{.*#+}} xmm2 = xmm2[0],xmm3[0]
				; X64-NEXT: movdqa %xmm2, %xmm0
				; X64-NEXT: retq
				;
				; X86-LABEL: vec:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: pushl %ebx
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ebx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ebp
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl %eax, %esi
				; X86-NEXT: shrdl $2, %edx, %esi
				; X86-NEXT: cmpl $3, %edx
				; X86-NEXT: movl $-1, %ecx
				; X86-NEXT: cmoval %ecx, %esi
				; X86-NEXT: movl %ebp, %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl %eax, %ebp
				; X86-NEXT: shrdl $2, %edx, %ebp
				; X86-NEXT: cmpl $3, %edx
				; X86-NEXT: cmoval %ecx, %ebp
				; X86-NEXT: movl %ebx, %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl %eax, %ebx
				; X86-NEXT: shrdl $2, %edx, %ebx
				; X86-NEXT: cmpl $3, %edx
				; X86-NEXT: cmoval %ecx, %ebx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: shrdl $2, %edx, %eax
				; X86-NEXT: cmpl $3, %edx
				; X86-NEXT: cmoval %ecx, %eax
				; X86-NEXT: movl %eax, 12(%edi)
				; X86-NEXT: movl %ebx, 8(%edi)
				; X86-NEXT: movl %ebp, 4(%edi)
				; X86-NEXT: movl %esi, (%edi)
				; X86-NEXT: movl %edi, %eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: popl %ebx
				; X86-NEXT: popl %ebp
				; X86-NEXT: retl $4
				%tmp = call <4 x i32> @llvm.umul.fix.sat.v4i32(<4 x i32> %x, <4 x i32> %y, i32 2)
				ret <4 x i32> %tmp
				}

				; These result in regular integer multiplication
				define i32 @func4(i32 %x, i32 %y) nounwind {
				; X64-LABEL: func4:
				; X64: # %bb.0:
				; X64-NEXT: movl %edi, %eax
				; X64-NEXT: mull %esi
				; X64-NEXT: movl $-1, %ecx
				; X64-NEXT: cmovol %ecx, %eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func4:
				; X86: # %bb.0:
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl $-1, %ecx
				; X86-NEXT: cmovol %ecx, %eax
				; X86-NEXT: retl
				%tmp = call i32 @llvm.umul.fix.sat.i32(i32 %x, i32 %y, i32 0)
				ret i32 %tmp
				}

				define i64 @func5(i64 %x, i64 %y) {
				; X64-LABEL: func5:
				; X64: # %bb.0:
				; X64-NEXT: movq %rdi, %rax
				; X64-NEXT: mulq %rsi
				; X64-NEXT: movq $-1, %rcx
				; X64-NEXT: cmovoq %rcx, %rax
				; X64-NEXT: retq
				;
				; X86-LABEL: func5:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: .cfi_def_cfa_offset 8
				; X86-NEXT: pushl %ebx
				; X86-NEXT: .cfi_def_cfa_offset 12
				; X86-NEXT: pushl %edi
				; X86-NEXT: .cfi_def_cfa_offset 16
				; X86-NEXT: pushl %esi
				; X86-NEXT: .cfi_def_cfa_offset 20
				; X86-NEXT: .cfi_offset %esi, -20
				; X86-NEXT: .cfi_offset %edi, -16
				; X86-NEXT: .cfi_offset %ebx, -12
				; X86-NEXT: .cfi_offset %ebp, -8
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ebp
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: movl {{[0-9]+}}(%esp), %esi
				; X86-NEXT: testl %esi, %esi
				; X86-NEXT: setne %dl
				; X86-NEXT: testl %eax, %eax
				; X86-NEXT: setne %cl
				; X86-NEXT: andb %dl, %cl
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl %eax, %edi
				; X86-NEXT: seto %bl
				; X86-NEXT: movl %esi, %eax
				; X86-NEXT: mull %ebp
				; X86-NEXT: movl %eax, %esi
				; X86-NEXT: seto %ch
				; X86-NEXT: orb %bl, %ch
				; X86-NEXT: addl %edi, %esi
				; X86-NEXT: movl %ebp, %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: addl %esi, %edx
				; X86-NEXT: setb %bl
				; X86-NEXT: orb %ch, %bl
				; X86-NEXT: orb %cl, %bl
				; X86-NEXT: movl $-1, %ecx
				; X86-NEXT: cmovnel %ecx, %eax
				; X86-NEXT: cmovnel %ecx, %edx
				; X86-NEXT: popl %esi
				; X86-NEXT: .cfi_def_cfa_offset 16
				; X86-NEXT: popl %edi
				; X86-NEXT: .cfi_def_cfa_offset 12
				; X86-NEXT: popl %ebx
				; X86-NEXT: .cfi_def_cfa_offset 8
				; X86-NEXT: popl %ebp
				; X86-NEXT: .cfi_def_cfa_offset 4
				; X86-NEXT: retl
				%tmp = call i64 @llvm.umul.fix.sat.i64(i64 %x, i64 %y, i32 0)
				ret i64 %tmp
				}

				define i4 @func6(i4 %x, i4 %y) nounwind {
				; X64-LABEL: func6:
				; X64: # %bb.0:
				; X64-NEXT: movl %edi, %eax
				; X64-NEXT: andb $15, %sil
				; X64-NEXT: shlb $4, %al
				; X64-NEXT: # kill: def $al killed $al killed $eax
				; X64-NEXT: mulb %sil
				; X64-NEXT: movzbl %al, %ecx
				; X64-NEXT: movl $255, %eax
				; X64-NEXT: cmovnol %ecx, %eax
				; X64-NEXT: shrb $4, %al
				; X64-NEXT: # kill: def $al killed $al killed $eax
				; X64-NEXT: retq
				;
				; X86-LABEL: func6:
				; X86: # %bb.0:
				; X86-NEXT: movb {{[0-9]+}}(%esp), %cl
				; X86-NEXT: andb $15, %cl
				; X86-NEXT: movb {{[0-9]+}}(%esp), %al
				; X86-NEXT: shlb $4, %al
				; X86-NEXT: mulb %cl
				; X86-NEXT: movzbl %al, %ecx
				; X86-NEXT: movl $255, %eax
				; X86-NEXT: cmovnol %ecx, %eax
				; X86-NEXT: shrb $4, %al
				; X86-NEXT: # kill: def $al killed $al killed $eax
				; X86-NEXT: retl
				%tmp = call i4 @llvm.umul.fix.sat.i4(i4 %x, i4 %y, i32 0)
				ret i4 %tmp
				}

				define <4 x i32> @vec2(<4 x i32> %x, <4 x i32> %y) nounwind {
				; X64-LABEL: vec2:
				; X64: # %bb.0:
				; X64-NEXT: pshufd {{.*#+}} xmm2 = xmm0[3,1,2,3]
				; X64-NEXT: movd %xmm2, %eax
				; X64-NEXT: pshufd {{.*#+}} xmm2 = xmm1[3,1,2,3]
				; X64-NEXT: movd %xmm2, %ecx
				; X64-NEXT: mull %ecx
				; X64-NEXT: movl $-1, %ecx
				; X64-NEXT: cmovol %ecx, %eax
				; X64-NEXT: movd %eax, %xmm2
				; X64-NEXT: pshufd {{.*#+}} xmm3 = xmm0[2,3,0,1]
				; X64-NEXT: movd %xmm3, %eax
				; X64-NEXT: pshufd {{.*#+}} xmm3 = xmm1[2,3,0,1]
				; X64-NEXT: movd %xmm3, %edx
				; X64-NEXT: mull %edx
				; X64-NEXT: cmovol %ecx, %eax
				; X64-NEXT: movd %eax, %xmm3
				; X64-NEXT: punpckldq {{.*#+}} xmm3 = xmm3[0],xmm2[0],xmm3[1],xmm2[1]
				; X64-NEXT: movd %xmm0, %eax
				; X64-NEXT: movd %xmm1, %edx
				; X64-NEXT: mull %edx
				; X64-NEXT: cmovol %ecx, %eax
				; X64-NEXT: movd %eax, %xmm2
				; X64-NEXT: pshufd {{.*#+}} xmm0 = xmm0[1,1,2,3]
				; X64-NEXT: movd %xmm0, %eax
				; X64-NEXT: pshufd {{.*#+}} xmm0 = xmm1[1,1,2,3]
				; X64-NEXT: movd %xmm0, %edx
				; X64-NEXT: mull %edx
				; X64-NEXT: cmovol %ecx, %eax
				; X64-NEXT: movd %eax, %xmm0
				; X64-NEXT: punpckldq {{.*#+}} xmm2 = xmm2[0],xmm0[0],xmm2[1],xmm0[1]
				; X64-NEXT: punpcklqdq {{.*#+}} xmm2 = xmm2[0],xmm3[0]
				; X64-NEXT: movdqa %xmm2, %xmm0
				; X64-NEXT: retq
				;
				; X86-LABEL: vec2:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: pushl %ebx
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %edi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ebx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl %eax, %ebp
				; X86-NEXT: movl $-1, %esi
				; X86-NEXT: cmovol %esi, %ebp
				; X86-NEXT: movl %ebx, %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl %eax, %ebx
				; X86-NEXT: cmovol %esi, %ebx
				; X86-NEXT: movl %edi, %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl %eax, %edi
				; X86-NEXT: cmovol %esi, %edi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: cmovol %esi, %eax
				; X86-NEXT: movl %eax, 12(%ecx)
				; X86-NEXT: movl %edi, 8(%ecx)
				; X86-NEXT: movl %ebx, 4(%ecx)
				; X86-NEXT: movl %ebp, (%ecx)
				; X86-NEXT: movl %ecx, %eax
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: popl %ebx
				; X86-NEXT: popl %ebp
				; X86-NEXT: retl $4
				%tmp = call <4 x i32> @llvm.umul.fix.sat.v4i32(<4 x i32> %x, <4 x i32> %y, i32 0)
				ret <4 x i32> %tmp
				}

				define i64 @func7(i64 %x, i64 %y) nounwind {
				; X64-LABEL: func7:
				; X64: # %bb.0:
				; X64-NEXT: movq %rdi, %rax
				; X64-NEXT: mulq %rsi
				; X64-NEXT: shrdq $32, %rdx, %rax
				; X64-NEXT: movl $4294967295, %ecx # imm = 0xFFFFFFFF
				; X64-NEXT: cmpq %rcx, %rdx
				; X64-NEXT: movq $-1, %rcx
				; X64-NEXT: cmovaq %rcx, %rax
				; X64-NEXT: retq
				;
				; X86-LABEL: func7:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: pushl %ebx
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ebp
				; X86-NEXT: movl %ecx, %eax
				; X86-NEXT: mull %ebp
				; X86-NEXT: movl %edx, %edi
				; X86-NEXT: movl %eax, %ebx
				; X86-NEXT: movl %ecx, %eax
				; X86-NEXT: mull %esi
				; X86-NEXT: movl %edx, %ecx
				; X86-NEXT: addl %ebx, %ecx
				; X86-NEXT: adcl $0, %edi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull %ebp
				; X86-NEXT: movl %edx, %ebx
				; X86-NEXT: movl %eax, %ebp
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull %esi
				; X86-NEXT: addl %ecx, %eax
				; X86-NEXT: adcl %edi, %edx
				; X86-NEXT: adcl $0, %ebx
				; X86-NEXT: addl %ebp, %edx
				; X86-NEXT: adcl $0, %ebx
				; X86-NEXT: xorl %ecx, %ecx
				; X86-NEXT: cmpl $1, %ebx
				; X86-NEXT: sbbl %ecx, %ecx
				; X86-NEXT: notl %ecx
				; X86-NEXT: orl %ecx, %eax
				; X86-NEXT: orl %ecx, %edx
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: popl %ebx
				; X86-NEXT: popl %ebp
				; X86-NEXT: retl
				%tmp = call i64 @llvm.umul.fix.sat.i64(i64 %x, i64 %y, i32 32)
				ret i64 %tmp
				}

				define i64 @func8(i64 %x, i64 %y) nounwind {
				; X64-LABEL: func8:
				; X64: # %bb.0:
				; X64-NEXT: movq %rdi, %rax
				; X64-NEXT: mulq %rsi
				; X64-NEXT: shrdq $63, %rdx, %rax
				; X64-NEXT: movabsq $9223372036854775807, %rcx # imm = 0x7FFFFFFFFFFFFFFF
				; X64-NEXT: cmpq %rcx, %rdx
				; X64-NEXT: movq $-1, %rcx
				; X64-NEXT: cmovaq %rcx, %rax
				; X64-NEXT: retq
				;
				; X86-LABEL: func8:
				; X86: # %bb.0:
				; X86-NEXT: pushl %ebp
				; X86-NEXT: pushl %ebx
				; X86-NEXT: pushl %edi
				; X86-NEXT: pushl %esi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %esi
				; X86-NEXT: movl %ecx, %eax
				; X86-NEXT: mull %esi
				; X86-NEXT: movl %edx, %edi
				; X86-NEXT: movl %eax, %ebx
				; X86-NEXT: movl %ecx, %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: movl %edx, %ebp
				; X86-NEXT: addl %ebx, %ebp
				; X86-NEXT: adcl $0, %edi
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull %esi
				; X86-NEXT: movl %edx, %ecx
				; X86-NEXT: movl %eax, %ebx
				; X86-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NEXT: mull {{[0-9]+}}(%esp)
				; X86-NEXT: addl %ebp, %eax
				; X86-NEXT: adcl %edi, %edx
				; X86-NEXT: adcl $0, %ecx
				; X86-NEXT: addl %ebx, %edx
				; X86-NEXT: adcl $0, %ecx
				; X86-NEXT: shrdl $31, %edx, %eax
				; X86-NEXT: movl %edx, %esi
				; X86-NEXT: shrl $31, %esi
				; X86-NEXT: xorl %edi, %edi
				; X86-NEXT: cmpl $1, %esi
				; X86-NEXT: sbbl %edi, %edi
				; X86-NEXT: notl %edi
				; X86-NEXT: orl %edi, %eax
				; X86-NEXT: shldl $1, %edx, %ecx
				; X86-NEXT: orl %edi, %ecx
				; X86-NEXT: movl %ecx, %edx
				; X86-NEXT: popl %esi
				; X86-NEXT: popl %edi
				; X86-NEXT: popl %ebx
				; X86-NEXT: popl %ebp
				; X86-NEXT: retl
				%tmp = call i64 @llvm.umul.fix.sat.i64(i64 %x, i64 %y, i32 63)
				ret i64 %tmp
				}

llvm/trunk/test/CodeGen/X86/vector-mulfix-legalize.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -O1 -mtriple=x86_64-unknown-unknown -o - \| FileCheck %s			; RUN: llc < %s -O1 -mtriple=x86_64-unknown-unknown -o - \| FileCheck %s

	; We used to assert on widening the SMULFIX/UMULFIX/SMULFIXSAT node result,			; We used to assert on widening the SMULFIX/UMULFIX/SMULFIXSAT node result,
	; so primiary goal with the test is to see that we support legalization for			; so primiary goal with the test is to see that we support legalization for
	; such vectors.			; such vectors.

	declare <4 x i16> @llvm.smul.fix.v4i16(<4 x i16>, <4 x i16>, i32 immarg)			declare <4 x i16> @llvm.smul.fix.v4i16(<4 x i16>, <4 x i16>, i32 immarg)
	declare <4 x i16> @llvm.umul.fix.v4i16(<4 x i16>, <4 x i16>, i32 immarg)			declare <4 x i16> @llvm.umul.fix.v4i16(<4 x i16>, <4 x i16>, i32 immarg)
	declare <4 x i16> @llvm.smul.fix.sat.v4i16(<4 x i16>, <4 x i16>, i32 immarg)			declare <4 x i16> @llvm.smul.fix.sat.v4i16(<4 x i16>, <4 x i16>, i32 immarg)
				declare <4 x i16> @llvm.umul.fix.sat.v4i16(<4 x i16>, <4 x i16>, i32 immarg)

	define <4 x i16> @smulfix(<4 x i16> %a) {			define <4 x i16> @smulfix(<4 x i16> %a) {
	; CHECK-LABEL: smulfix:			; CHECK-LABEL: smulfix:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: movdqa {{.*#+}} xmm1 = <1,2,3,4,u,u,u,u>			; CHECK-NEXT: movdqa {{.*#+}} xmm1 = <1,2,3,4,u,u,u,u>
	; CHECK-NEXT: movdqa %xmm0, %xmm2			; CHECK-NEXT: movdqa %xmm0, %xmm2
	; CHECK-NEXT: pmullw %xmm1, %xmm2			; CHECK-NEXT: pmullw %xmm1, %xmm2
	; CHECK-NEXT: psrlw $15, %xmm2			; CHECK-NEXT: psrlw $15, %xmm2
	▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: pinsrw $3, %esi, %xmm1			; CHECK-NEXT: pinsrw $3, %esi, %xmm1
	; CHECK-NEXT: movdqa %xmm1, %xmm0			; CHECK-NEXT: movdqa %xmm1, %xmm0
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%t = call <4 x i16> @llvm.smul.fix.sat.v4i16(<4 x i16> <i16 1, i16 2, i16 3, i16 4>, <4 x i16> %a, i32 15)			%t = call <4 x i16> @llvm.smul.fix.sat.v4i16(<4 x i16> <i16 1, i16 2, i16 3, i16 4>, <4 x i16> %a, i32 15)
	ret <4 x i16> %t			ret <4 x i16> %t
	}			}


				define <4 x i16> @umulfixsat(<4 x i16> %a) {
				; CHECK-LABEL: umulfixsat:
				; CHECK: # %bb.0:
				; CHECK-NEXT: pextrw $2, %xmm0, %eax
				; CHECK-NEXT: leal (%rax,%rax,2), %eax
				; CHECK-NEXT: movl %eax, %edx
				; CHECK-NEXT: shrl $16, %edx
				; CHECK-NEXT: movl %edx, %ecx
				; CHECK-NEXT: shldw $1, %ax, %cx
				; CHECK-NEXT: cmpl $32767, %edx # imm = 0x7FFF
				; CHECK-NEXT: movl $65535, %eax # imm = 0xFFFF
				; CHECK-NEXT: cmoval %eax, %ecx
				; CHECK-NEXT: pextrw $1, %xmm0, %edx
				; CHECK-NEXT: addl %edx, %edx
				; CHECK-NEXT: movl %edx, %esi
				; CHECK-NEXT: shrl $16, %esi
				; CHECK-NEXT: movl %esi, %edi
				; CHECK-NEXT: shldw $1, %dx, %di
				; CHECK-NEXT: cmpl $32767, %esi # imm = 0x7FFF
				; CHECK-NEXT: cmoval %eax, %edi
				; CHECK-NEXT: movd %xmm0, %edx
				; CHECK-NEXT: xorl %esi, %esi
				; CHECK-NEXT: shldw $1, %dx, %si
				; CHECK-NEXT: movl $32767, %edx # imm = 0x7FFF
				; CHECK-NEXT: negl %edx
				; CHECK-NEXT: cmoval %eax, %esi
				; CHECK-NEXT: pxor %xmm1, %xmm1
				; CHECK-NEXT: pinsrw $0, %esi, %xmm1
				; CHECK-NEXT: pinsrw $1, %edi, %xmm1
				; CHECK-NEXT: pinsrw $2, %ecx, %xmm1
				; CHECK-NEXT: pextrw $3, %xmm0, %ecx
				; CHECK-NEXT: shll $2, %ecx
				; CHECK-NEXT: movl %ecx, %edx
				; CHECK-NEXT: shrl $16, %edx
				; CHECK-NEXT: movl %edx, %esi
				; CHECK-NEXT: shldw $1, %cx, %si
				; CHECK-NEXT: cmpl $32767, %edx # imm = 0x7FFF
				; CHECK-NEXT: cmoval %eax, %esi
				; CHECK-NEXT: pinsrw $3, %esi, %xmm1
				; CHECK-NEXT: movdqa %xmm1, %xmm0
				; CHECK-NEXT: retq
				%t = call <4 x i16> @llvm.umul.fix.sat.v4i16(<4 x i16> <i16 1, i16 2, i16 3, i16 4>, <4 x i16> %a, i32 15)
				ret <4 x i16> %t
				}

llvm/trunk/test/Transforms/Scalarizer/intrinsics.ll

	Show All 17 Lines
	; Unary int plus constant scalar operand			; Unary int plus constant scalar operand
	declare <2 x i32> @llvm.ctlz.v2i32(<2 x i32>, i1)			declare <2 x i32> @llvm.ctlz.v2i32(<2 x i32>, i1)

	; Unary fp plus any scalar operand			; Unary fp plus any scalar operand
	declare <2 x float> @llvm.powi.v2f32(<2 x float>, i32)			declare <2 x float> @llvm.powi.v2f32(<2 x float>, i32)

	; Binary int plus constant scalar operand			; Binary int plus constant scalar operand
	declare <2 x i32> @llvm.smul.fix.sat.v2i32(<2 x i32>, <2 x i32>, i32)			declare <2 x i32> @llvm.smul.fix.sat.v2i32(<2 x i32>, <2 x i32>, i32)
				declare <2 x i32> @llvm.umul.fix.sat.v2i32(<2 x i32>, <2 x i32>, i32)


	; CHECK-LABEL: @scalarize_sqrt_v2f32(			; CHECK-LABEL: @scalarize_sqrt_v2f32(
	; CHECK: %sqrt.i0 = call float @llvm.sqrt.f32(float %x.i0)			; CHECK: %sqrt.i0 = call float @llvm.sqrt.f32(float %x.i0)
	; CHECK: %sqrt.i1 = call float @llvm.sqrt.f32(float %x.i1)			; CHECK: %sqrt.i1 = call float @llvm.sqrt.f32(float %x.i1)
	; CHECK: %sqrt.upto0 = insertelement <2 x float> undef, float %sqrt.i0, i32 0			; CHECK: %sqrt.upto0 = insertelement <2 x float> undef, float %sqrt.i0, i32 0
	; CHECK: %sqrt = insertelement <2 x float> %sqrt.upto0, float %sqrt.i1, i32 1			; CHECK: %sqrt = insertelement <2 x float> %sqrt.upto0, float %sqrt.i1, i32 1
	; CHECK: ret <2 x float> %sqrt			; CHECK: ret <2 x float> %sqrt
	▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines
	; CHECK: %smulfixsat.i1 = call i32 @llvm.smul.fix.sat.i32(i32 %x.i1, i32 19, i32 31)			; CHECK: %smulfixsat.i1 = call i32 @llvm.smul.fix.sat.i32(i32 %x.i1, i32 19, i32 31)
	; CHECK: %smulfixsat.upto0 = insertelement <2 x i32> undef, i32 %smulfixsat.i0, i32 0			; CHECK: %smulfixsat.upto0 = insertelement <2 x i32> undef, i32 %smulfixsat.i0, i32 0
	; CHECK: %smulfixsat = insertelement <2 x i32> %smulfixsat.upto0, i32 %smulfixsat.i1, i32 1			; CHECK: %smulfixsat = insertelement <2 x i32> %smulfixsat.upto0, i32 %smulfixsat.i1, i32 1
	; CHECK: ret <2 x i32> %smulfixsat			; CHECK: ret <2 x i32> %smulfixsat
	define <2 x i32> @scalarize_smul_fix_sat_v2i32(<2 x i32> %x) #0 {			define <2 x i32> @scalarize_smul_fix_sat_v2i32(<2 x i32> %x) #0 {
	%smulfixsat = call <2 x i32> @llvm.smul.fix.sat.v2i32(<2 x i32> %x, <2 x i32> <i32 5, i32 19>, i32 31)			%smulfixsat = call <2 x i32> @llvm.smul.fix.sat.v2i32(<2 x i32> %x, <2 x i32> <i32 5, i32 19>, i32 31)
	ret <2 x i32> %smulfixsat			ret <2 x i32> %smulfixsat
	}			}

				; CHECK-LABEL: @scalarize_umul_fix_sat_v2i32(
				; CHECK: %umulfixsat.i0 = call i32 @llvm.umul.fix.sat.i32(i32 %x.i0, i32 5, i32 31)
				; CHECK: %umulfixsat.i1 = call i32 @llvm.umul.fix.sat.i32(i32 %x.i1, i32 19, i32 31)
				; CHECK: %umulfixsat.upto0 = insertelement <2 x i32> undef, i32 %umulfixsat.i0, i32 0
				; CHECK: %umulfixsat = insertelement <2 x i32> %umulfixsat.upto0, i32 %umulfixsat.i1, i32 1
				; CHECK: ret <2 x i32> %umulfixsat
				define <2 x i32> @scalarize_umul_fix_sat_v2i32(<2 x i32> %x) #0 {
				%umulfixsat = call <2 x i32> @llvm.umul.fix.sat.v2i32(<2 x i32> %x, <2 x i32> <i32 5, i32 19>, i32 31)
				ret <2 x i32> %umulfixsat
				}

llvm/trunk/test/Verifier/intrinsic-immarg.ll

	Show First 20 Lines • Show All 68 Lines • ▼ Show 20 Lines
	define i64 @smul_fix(i64 %arg0, i64 %arg1, i32 %arg2) {			define i64 @smul_fix(i64 %arg0, i64 %arg1, i32 %arg2) {
	; CHECK: immarg operand has non-immediate parameter			; CHECK: immarg operand has non-immediate parameter
	; CHECK-NEXT: i32 %arg2			; CHECK-NEXT: i32 %arg2
	; CHECK-NEXT: %ret = call i64 @llvm.smul.fix.i64(i64 %arg0, i64 %arg1, i32 %arg2)			; CHECK-NEXT: %ret = call i64 @llvm.smul.fix.i64(i64 %arg0, i64 %arg1, i32 %arg2)
	%ret = call i64 @llvm.smul.fix.i64(i64 %arg0, i64 %arg1, i32 %arg2)			%ret = call i64 @llvm.smul.fix.i64(i64 %arg0, i64 %arg1, i32 %arg2)
	ret i64 %ret			ret i64 %ret
	}			}

				declare i64 @llvm.smul.fix.sat.i64(i64, i64, i32)
				define i64 @smul_fix_sat(i64 %arg0, i64 %arg1, i32 %arg2) {
				; CHECK: immarg operand has non-immediate parameter
				; CHECK-NEXT: i32 %arg2
				; CHECK-NEXT: %ret = call i64 @llvm.smul.fix.sat.i64(i64 %arg0, i64 %arg1, i32 %arg2)
				%ret = call i64 @llvm.smul.fix.sat.i64(i64 %arg0, i64 %arg1, i32 %arg2)
				ret i64 %ret
				}

	declare i64 @llvm.umul.fix.i64(i64, i64, i32)			declare i64 @llvm.umul.fix.i64(i64, i64, i32)
	define i64 @umul_fix(i64 %arg0, i64 %arg1, i32 %arg2) {			define i64 @umul_fix(i64 %arg0, i64 %arg1, i32 %arg2) {
	; CHECK: immarg operand has non-immediate parameter			; CHECK: immarg operand has non-immediate parameter
	; CHECK-NEXT: i32 %arg2			; CHECK-NEXT: i32 %arg2
	; CHECK-NEXT: %ret = call i64 @llvm.umul.fix.i64(i64 %arg0, i64 %arg1, i32 %arg2)			; CHECK-NEXT: %ret = call i64 @llvm.umul.fix.i64(i64 %arg0, i64 %arg1, i32 %arg2)
	%ret = call i64 @llvm.umul.fix.i64(i64 %arg0, i64 %arg1, i32 %arg2)			%ret = call i64 @llvm.umul.fix.i64(i64 %arg0, i64 %arg1, i32 %arg2)
	ret i64 %ret			ret i64 %ret
	}			}

				declare i64 @llvm.umul.fix.sat.i64(i64, i64, i32)
				define i64 @umul_fix_sat(i64 %arg0, i64 %arg1, i32 %arg2) {
				; CHECK: immarg operand has non-immediate parameter
				; CHECK-NEXT: i32 %arg2
				; CHECK-NEXT: %ret = call i64 @llvm.umul.fix.sat.i64(i64 %arg0, i64 %arg1, i32 %arg2)
				%ret = call i64 @llvm.umul.fix.sat.i64(i64 %arg0, i64 %arg1, i32 %arg2)
				ret i64 %ret
				}

	declare <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>*, i32, <2 x i1>, <2 x double>)			declare <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>*, i32, <2 x i1>, <2 x double>)
	define <2 x double> @masked_load(<2 x i1> %mask, <2 x double>* %addr, <2 x double> %dst, i32 %align) {			define <2 x double> @masked_load(<2 x i1> %mask, <2 x double>* %addr, <2 x double> %dst, i32 %align) {
	; CHECK: immarg operand has non-immediate parameter			; CHECK: immarg operand has non-immediate parameter
	; CHECK-NEXT: i32 %align			; CHECK-NEXT: i32 %align
	; CHECK-NEXT: %res = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>* %addr, i32 %align, <2 x i1> %mask, <2 x double> %dst)			; CHECK-NEXT: %res = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>* %addr, i32 %align, <2 x i1> %mask, <2 x double> %dst)
	%res = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>* %addr, i32 %align, <2 x i1> %mask, <2 x double> %dst)			%res = call <2 x double> @llvm.masked.load.v2f64.p0v2f64(<2 x double>* %addr, i32 %align, <2 x i1> %mask, <2 x double> %dst)
	ret <2 x double> %res			ret <2 x double> %res
	}			}
	▲ Show 20 Lines • Show All 121 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Intrinsic] Unsigned Fixed Point Saturation Multiplication IntrinsicClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 219237

llvm/trunk/docs/LangRef.rst

llvm/trunk/include/llvm/CodeGen/ISDOpcodes.h

llvm/trunk/include/llvm/CodeGen/TargetLowering.h

llvm/trunk/include/llvm/IR/Intrinsics.td

llvm/trunk/include/llvm/Target/TargetSelectionDAG.td

llvm/trunk/lib/Analysis/VectorUtils.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/TargetLowering.cpp

llvm/trunk/lib/CodeGen/TargetLoweringBase.cpp

llvm/trunk/lib/IR/Verifier.cpp

llvm/trunk/test/CodeGen/PowerPC/umulfixsat.ll

llvm/trunk/test/CodeGen/X86/mulfix_combine.ll

llvm/trunk/test/CodeGen/X86/umul_fix_sat.ll

llvm/trunk/test/CodeGen/X86/vector-mulfix-legalize.ll

llvm/trunk/test/Transforms/Scalarizer/intrinsics.ll

llvm/trunk/test/Verifier/intrinsic-immarg.ll

[Intrinsic] Unsigned Fixed Point Saturation Multiplication Intrinsic
ClosedPublic