Diff 351135

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 18,220 Lines • ▼ Show 20 Lines	.. code-block:: llvm

%r = call <4 x i32> @llvm.vp.xor.v4i32(<4 x i32> %a, <4 x i32> %b, <4 x i1> %mask, i32 %evl)		%r = call <4 x i32> @llvm.vp.xor.v4i32(<4 x i32> %a, <4 x i32> %b, <4 x i1> %mask, i32 %evl)
;; For all lanes below %evl, %r is lane-wise equivalent to %also.r		;; For all lanes below %evl, %r is lane-wise equivalent to %also.r

%t = xor <4 x i32> %a, %b		%t = xor <4 x i32> %a, %b
%also.r = select <4 x i1> %mask, <4 x i32> %t, <4 x i32> undef		%also.r = select <4 x i1> %mask, <4 x i32> %t, <4 x i32> undef


		.. _int_vp_fadd:

		'``llvm.vp.fadd.*``' Intrinsics
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

		Syntax:
		"""""""
		This is an overloaded intrinsic.

		::

		declare <16 x float> @llvm.vp.fadd.v16f32 (<16 x float> <left_op>, <16 x float> <right_op>, <16 x i1> <mask>, i32 <vector_length>)
		declare <vscale x 4 x float> @llvm.vp.fadd.nxv4f32 (<vscale x 4 x float> <left_op>, <vscale x 4 x float> <right_op>, <vscale x 4 x i1> <mask>, i32 <vector_length>)
		declare <256 x double> @llvm.vp.fadd.v256f64 (<256 x double> <left_op>, <256 x double> <right_op>, <256 x i1> <mask>, i32 <vector_length>)

		Overview:
		"""""""""

		Predicated floating-point addition of two vectors of floating-point values.


		Arguments:
		""""""""""

		The first two operands and the result have the same vector of floating-point type. The
		third operand is the vector mask and has the same number of elements as the
		result vector type. The fourth operand is the explicit vector length of the
		operation.

		Semantics:
		""""""""""

		The '``llvm.vp.fadd``' intrinsic performs floating-point addition (:ref:`add <i_fadd>`)
		of the first and second vector operand on each enabled lane. The result on
		disabled lanes is undefined. The operation is performed in the default
		floating-point environment.

		Examples:
		"""""""""

		.. code-block:: llvm

		%r = call <4 x float> @llvm.vp.fadd.v4f32(<4 x float> %a, <4 x float> %b, <4 x i1> %mask, i32 %evl)
		;; For all lanes below %evl, %r is lane-wise equivalent to %also.r
		frasercrmckUnsubmitted Done Reply Inline Actions I realise this inherits from the documentation of the integer intrinsics, but I was wondering if this can be expressed as being equivalent to `<%evl x float>` in the following example. Would that be any clearer that the intrinsic is conceptually working on vectors of length `%evl` isn't actually executing the lanes above `%evl` (as in `%t` below)? frasercrmck: I realise this inherits from the documentation of the integer intrinsics, but I was wondering…
		simollAuthorUnsubmitted Done Reply Inline Actions I see what are you getting at with this. I am not sure that introducing novel syntax only for explaining things is really helpful. My take here is that most aren't used to the `%evl` concept - but it is not a complicated thing per se. simoll: I see what are you getting at with this. I am not sure that introducing novel syntax only for…

		%t = fadd <4 x float> %a, %b
		%also.r = select <4 x i1> %mask, <4 x float> %t, <4 x float> undef
		majnemerUnsubmitted Done Reply Inline Actions Would it be more general/useful to have the intrinsics take an alternative value just like llvm.masked.load and its passthru operand? You would get identical behavior to your proposal by setting this passthru to undef. majnemer: Would it be more general/useful to have the intrinsics take an alternative value just like llvm.
		simollAuthorUnsubmitted Done Reply Inline Actions My take is this: if there was a passthru parameter we'd still have to optimize/match the `op+select` into `op_with_passthru`. So, if the simple op+select idiom (that people are already using, btw) suffices to model passthru why add a redundant parameter/code path? We had this discussion a while back: https://reviews.llvm.org/D57504#1851456 simoll: My take is this: if there was a passthru parameter we'd still have to optimize/match the…
		majnemerUnsubmitted Done Reply Inline Actions Ah, I see. Fair enough :) majnemer: Ah, I see. Fair enough :)
		simollAuthorUnsubmitted Done Reply Inline Actions Thanks for following up on your drive-by comment ;-) simoll: Thanks for following up on your drive-by comment ;-)


		.. _int_vp_fsub:

		'``llvm.vp.fsub.*``' Intrinsics
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

		Syntax:
		"""""""
		This is an overloaded intrinsic.

		::

		declare <16 x float> @llvm.vp.fsub.v16f32 (<16 x float> <left_op>, <16 x float> <right_op>, <16 x i1> <mask>, i32 <vector_length>)
		declare <vscale x 4 x float> @llvm.vp.fsub.nxv4f32 (<vscale x 4 x float> <left_op>, <vscale x 4 x float> <right_op>, <vscale x 4 x i1> <mask>, i32 <vector_length>)
		declare <256 x double> @llvm.vp.fsub.v256f64 (<256 x double> <left_op>, <256 x double> <right_op>, <256 x i1> <mask>, i32 <vector_length>)

		Overview:
		"""""""""

		Predicated floating-point subtraction of two vectors of floating-point values.
		frasercrmckUnsubmitted Done Reply Inline Actions `addition` -> `subtraction` frasercrmck: `addition` -> `subtraction`


		Arguments:
		""""""""""

		The first two operands and the result have the same vector of floating-point type. The
		third operand is the vector mask and has the same number of elements as the
		result vector type. The fourth operand is the explicit vector length of the
		operation.

		Semantics:
		""""""""""

		The '``llvm.vp.fsub``' intrinsic performs floating-point subtraction (:ref:`add <i_fsub>`)
		frasercrmckUnsubmitted Done Reply Inline Actions same here: `addition` and maybe `add <i_fsub>`? frasercrmck: same here: `addition` and maybe `add <i_fsub>`?
		of the first and second vector operand on each enabled lane. The result on
		disabled lanes is undefined. The operation is performed in the default
		floating-point environment.

		Examples:
		"""""""""

		.. code-block:: llvm

		%r = call <4 x float> @llvm.vp.fsub.v4f32(<4 x float> %a, <4 x float> %b, <4 x i1> %mask, i32 %evl)
		;; For all lanes below %evl, %r is lane-wise equivalent to %also.r

		%t = fsub <4 x float> %a, %b
		%also.r = select <4 x i1> %mask, <4 x float> %t, <4 x float> undef


		.. _int_vp_fmul:

		'``llvm.vp.fmul.*``' Intrinsics
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

		Syntax:
		"""""""
		This is an overloaded intrinsic.

		::

		declare <16 x float> @llvm.vp.fmul.v16f32 (<16 x float> <left_op>, <16 x float> <right_op>, <16 x i1> <mask>, i32 <vector_length>)
		declare <vscale x 4 x float> @llvm.vp.fmul.nxv4f32 (<vscale x 4 x float> <left_op>, <vscale x 4 x float> <right_op>, <vscale x 4 x i1> <mask>, i32 <vector_length>)
		declare <256 x double> @llvm.vp.fmul.v256f64 (<256 x double> <left_op>, <256 x double> <right_op>, <256 x i1> <mask>, i32 <vector_length>)

		Overview:
		"""""""""

		Predicated floating-point multiplication of two vectors of floating-point values.
		frasercrmckUnsubmitted Done Reply Inline Actions `addition` -> `multiplication` frasercrmck: `addition` -> `multiplication`


		Arguments:
		""""""""""

		The first two operands and the result have the same vector of floating-point type. The
		third operand is the vector mask and has the same number of elements as the
		result vector type. The fourth operand is the explicit vector length of the
		operation.

		Semantics:
		""""""""""

		The '``llvm.vp.fmul``' intrinsic performs floating-point multiplication (:ref:`add <i_fmul>`)
		frasercrmckUnsubmitted Done Reply Inline Actions `addition` -> `multiplication` frasercrmck: `addition` -> `multiplication`
		of the first and second vector operand on each enabled lane. The result on
		disabled lanes is undefined. The operation is performed in the default
		floating-point environment.

		Examples:
		"""""""""

		.. code-block:: llvm

		%r = call <4 x float> @llvm.vp.fmul.v4f32(<4 x float> %a, <4 x float> %b, <4 x i1> %mask, i32 %evl)
		;; For all lanes below %evl, %r is lane-wise equivalent to %also.r

		%t = fmul <4 x float> %a, %b
		%also.r = select <4 x i1> %mask, <4 x float> %t, <4 x float> undef


		.. _int_vp_fdiv:

		'``llvm.vp.fdiv.*``' Intrinsics
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

		Syntax:
		"""""""
		This is an overloaded intrinsic.

		::

		declare <16 x float> @llvm.vp.fdiv.v16f32 (<16 x float> <left_op>, <16 x float> <right_op>, <16 x i1> <mask>, i32 <vector_length>)
		declare <vscale x 4 x float> @llvm.vp.fdiv.nxv4f32 (<vscale x 4 x float> <left_op>, <vscale x 4 x float> <right_op>, <vscale x 4 x i1> <mask>, i32 <vector_length>)
		declare <256 x double> @llvm.vp.fdiv.v256f64 (<256 x double> <left_op>, <256 x double> <right_op>, <256 x i1> <mask>, i32 <vector_length>)

		Overview:
		"""""""""

		Predicated floating-point division of two vectors of floating-point values.
		frasercrmckUnsubmitted Done Reply Inline Actions `addition` -> `division` frasercrmck: `addition` -> `division`


		Arguments:
		""""""""""

		The first two operands and the result have the same vector of floating-point type. The
		third operand is the vector mask and has the same number of elements as the
		result vector type. The fourth operand is the explicit vector length of the
		operation.

		Semantics:
		""""""""""

		The '``llvm.vp.fdiv``' intrinsic performs floating-point division (:ref:`add <i_fdiv>`)
		frasercrmckUnsubmitted Done Reply Inline Actions same here frasercrmck: same here
		of the first and second vector operand on each enabled lane. The result on
		disabled lanes is undefined. The operation is performed in the default
		floating-point environment.

		Examples:
		"""""""""

		.. code-block:: llvm

		%r = call <4 x float> @llvm.vp.fdiv.v4f32(<4 x float> %a, <4 x float> %b, <4 x i1> %mask, i32 %evl)
		;; For all lanes below %evl, %r is lane-wise equivalent to %also.r

		%t = fdiv <4 x float> %a, %b
		%also.r = select <4 x i1> %mask, <4 x float> %t, <4 x float> undef


		.. _int_vp_frem:

		'``llvm.vp.frem.*``' Intrinsics
		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

		Syntax:
		"""""""
		This is an overloaded intrinsic.

		::

		declare <16 x float> @llvm.vp.frem.v16f32 (<16 x float> <left_op>, <16 x float> <right_op>, <16 x i1> <mask>, i32 <vector_length>)
		declare <vscale x 4 x float> @llvm.vp.frem.nxv4f32 (<vscale x 4 x float> <left_op>, <vscale x 4 x float> <right_op>, <vscale x 4 x i1> <mask>, i32 <vector_length>)
		declare <256 x double> @llvm.vp.frem.v256f64 (<256 x double> <left_op>, <256 x double> <right_op>, <256 x i1> <mask>, i32 <vector_length>)

		Overview:
		"""""""""

		Predicated floating-point remainder of two vectors of floating-point values.
		frasercrmckUnsubmitted Done Reply Inline Actions `addition` -> `remainder` frasercrmck: `addition` -> `remainder`


		Arguments:
		""""""""""

		The first two operands and the result have the same vector of floating-point type. The
		third operand is the vector mask and has the same number of elements as the
		result vector type. The fourth operand is the explicit vector length of the
		operation.

		Semantics:
		""""""""""

		The '``llvm.vp.frem``' intrinsic performs floating-point remainder (:ref:`add <i_frem>`)
		frasercrmckUnsubmitted Done Reply Inline Actions `addition` -> `remainder` frasercrmck: `addition` -> `remainder`
		of the first and second vector operand on each enabled lane. The result on
		disabled lanes is undefined. The operation is performed in the default
		floating-point environment.

		Examples:
		"""""""""

		.. code-block:: llvm

		%r = call <4 x float> @llvm.vp.frem.v4f32(<4 x float> %a, <4 x float> %b, <4 x i1> %mask, i32 %evl)
		;; For all lanes below %evl, %r is lane-wise equivalent to %also.r

		%t = frem <4 x float> %a, %b
		%also.r = select <4 x i1> %mask, <4 x float> %t, <4 x float> undef



.. _int_get_active_lane_mask:		.. _int_get_active_lane_mask:

'``llvm.get.active.lane.mask.*``' Intrinsics		'``llvm.get.active.lane.mask.*``' Intrinsics
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^		^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Syntax:		Syntax:
"""""""		"""""""
This is an overloaded intrinsic.		This is an overloaded intrinsic.
▲ Show 20 Lines • Show All 4,033 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 1,433 Lines • ▼ Show 20 Lines	def int_vp_srem : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
llvm_i32_ty]>;		llvm_i32_ty]>;
def int_vp_urem : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],		def int_vp_urem : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
LLVMMatchType<0>,		LLVMMatchType<0>,
LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
llvm_i32_ty]>;		llvm_i32_ty]>;
}		}

		// Floating-point arithmetic.
		let IntrProperties =
		[IntrSpeculatable, IntrNoMem, IntrNoSync, IntrWillReturn] in {
		def int_vp_fadd : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
		[ LLVMMatchType<0>,
		LLVMMatchType<0>,
		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
		llvm_i32_ty]>;
		def int_vp_fsub : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
		[ LLVMMatchType<0>,
		LLVMMatchType<0>,
		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
		llvm_i32_ty]>;
		def int_vp_fmul : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
		[ LLVMMatchType<0>,
		LLVMMatchType<0>,
		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
		llvm_i32_ty]>;
		def int_vp_fdiv : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
		[ LLVMMatchType<0>,
		LLVMMatchType<0>,
		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
		llvm_i32_ty]>;
		def int_vp_frem : DefaultAttrsIntrinsic<[ llvm_anyvector_ty ],
		[ LLVMMatchType<0>,
		LLVMMatchType<0>,
		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
		llvm_i32_ty]>;
		}

def int_get_active_lane_mask:		def int_get_active_lane_mask:
DefaultAttrsIntrinsic<[llvm_anyvector_ty],		DefaultAttrsIntrinsic<[llvm_anyvector_ty],
[llvm_anyint_ty, LLVMMatchType<1>],		[llvm_anyint_ty, LLVMMatchType<1>],
[IntrNoMem, IntrNoSync, IntrWillReturn]>;		[IntrNoMem, IntrNoSync, IntrWillReturn]>;

//===-------------------------- Masked Intrinsics -------------------------===//		//===-------------------------- Masked Intrinsics -------------------------===//
//		//
def int_masked_load:		def int_masked_load:
▲ Show 20 Lines • Show All 275 Lines • Show Last 20 Lines

llvm/include/llvm/IR/VPIntrinsics.def

	Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines
	// Property Macros {			// Property Macros {

	// The intrinsic and/or SDNode has the same function as this LLVM IR Opcode.			// The intrinsic and/or SDNode has the same function as this LLVM IR Opcode.
	// \p OPC The standard IR opcode.			// \p OPC The standard IR opcode.
	#ifndef HANDLE_VP_TO_OPC			#ifndef HANDLE_VP_TO_OPC
	#define HANDLE_VP_TO_OPC(OPC)			#define HANDLE_VP_TO_OPC(OPC)
	#endif			#endif

				// Whether the intrinsic may have a rounding mode or exception behavior operand
				// bundle.
				// \p HASROUND '1' if the intrinsic can have a rounding mode operand bundle,
				// '0' otherwise.
				// \p HASEXCEPT '1' if the intrinsic can have an exception behavior operand
				// bundle, '0' otherwise.
				// \p INTRINID The constrained fp intrinsic this VP intrinsic corresponds to.
				#ifndef HANDLE_VP_TO_CONSTRAINEDFP
				#define HANDLE_VP_TO_CONSTRAINEDFP(HASROUND, HASEXCEPT, INTRINID)
				#endif

	/// } Property Macros			/// } Property Macros

	///// Integer Arithmetic {			///// Integer Arithmetic {

	// Specialized helper macro for integer binary operators (%x, %y, %mask, %evl).			// Specialized helper macro for integer binary operators (%x, %y, %mask, %evl).
	#ifdef HELPER_REGISTER_BINARY_INT_VP			#ifdef HELPER_REGISTER_BINARY_INT_VP
	#error "The internal helper macro HELPER_REGISTER_BINARY_INT_VP is already defined!"			#error "The internal helper macro HELPER_REGISTER_BINARY_INT_VP is already defined!"
	#endif			#endif
	▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines

	// llvm.vp.xor(x,y,mask,vlen)			// llvm.vp.xor(x,y,mask,vlen)
	HELPER_REGISTER_BINARY_INT_VP(vp_xor, VP_XOR, Xor)			HELPER_REGISTER_BINARY_INT_VP(vp_xor, VP_XOR, Xor)

	#undef HELPER_REGISTER_BINARY_INT_VP			#undef HELPER_REGISTER_BINARY_INT_VP

	///// } Integer Arithmetic			///// } Integer Arithmetic

				///// Floating-Point Arithmetic {

				// Specialized helper macro for floating-point binary operators
				frasercrmckUnsubmitted Done Reply Inline Actions `integer` -> `floating-point`? frasercrmck: `integer` -> `floating-point`?
				// <operation>(%x, %y, %mask, %evl).
				#ifdef HELPER_REGISTER_BINARY_FP_VP
				#error \
				"The internal helper macro HELPER_REGISTER_BINARY_FP_VP is already defined!"
				#endif
				#define HELPER_REGISTER_BINARY_FP_VP(OPSUFFIX, SDOPC, OPC) \
				BEGIN_REGISTER_VP(vp_##OPSUFFIX, 2, 3, SDOPC, -1) \
				HANDLE_VP_TO_OPC(OPC) \
				HANDLE_VP_TO_CONSTRAINEDFP(1, 1, experimental_constrained_##OPSUFFIX) \
				END_REGISTER_VP(vp_##OPSUFFIX, SDOPC)

				// llvm.vp.fadd(x,y,mask,vlen)
				simollAuthorUnsubmitted Done Reply Inline Actions .. `fsub` is missing here.. i will supplement it and add a unit test to check the integrity of the VPIntrinsics.def file. simoll: .. `fsub` is missing here.. i will supplement it and add a unit test to check the integrity of…
				HELPER_REGISTER_BINARY_FP_VP(fadd, VP_FADD, FAdd)

				// llvm.vp.fsub(x,y,mask,vlen)
				HELPER_REGISTER_BINARY_FP_VP(fsub, VP_FSUB, FSub)

				// llvm.vp.fmul(x,y,mask,vlen)
				HELPER_REGISTER_BINARY_FP_VP(fmul, VP_FMUL, FMul)

				// llvm.vp.fdiv(x,y,mask,vlen)
				HELPER_REGISTER_BINARY_FP_VP(fdiv, VP_FDIV, FDiv)

				// llvm.vp.frem(x,y,mask,vlen)
				HELPER_REGISTER_BINARY_FP_VP(frem, VP_FREM, FRem)

				#undef HELPER_REGISTER_BINARY_FP_VP

				///// } Floating-Point Arithmetic

	#undef BEGIN_REGISTER_VP			#undef BEGIN_REGISTER_VP
	#undef BEGIN_REGISTER_VP_INTRINSIC			#undef BEGIN_REGISTER_VP_INTRINSIC
	#undef BEGIN_REGISTER_VP_SDNODE			#undef BEGIN_REGISTER_VP_SDNODE
	#undef END_REGISTER_VP			#undef END_REGISTER_VP
	#undef END_REGISTER_VP_INTRINSIC			#undef END_REGISTER_VP_INTRINSIC
	#undef END_REGISTER_VP_SDNODE			#undef END_REGISTER_VP_SDNODE
	#undef HANDLE_VP_TO_OPC			#undef HANDLE_VP_TO_OPC
				#undef HANDLE_VP_TO_CONSTRAINEDFP

llvm/test/Verifier/vp-intrinsics.ll

Show All 11 Lines	define void @test_vp_int(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n) {
%r8 = call <8 x i32> @llvm.vp.or.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)		%r8 = call <8 x i32> @llvm.vp.or.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)
%r9 = call <8 x i32> @llvm.vp.xor.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)		%r9 = call <8 x i32> @llvm.vp.xor.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)
%rA = call <8 x i32> @llvm.vp.ashr.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)		%rA = call <8 x i32> @llvm.vp.ashr.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)
%rB = call <8 x i32> @llvm.vp.lshr.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)		%rB = call <8 x i32> @llvm.vp.lshr.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)
%rC = call <8 x i32> @llvm.vp.shl.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)		%rC = call <8 x i32> @llvm.vp.shl.v8i32(<8 x i32> %i0, <8 x i32> %i1, <8 x i1> %m, i32 %n)
ret void		ret void
}		}


		define void @test_vp_fp(<8 x double> %f0, <8 x double> %f1, <8 x i1> %m, i32 %n) {
		%r0 = call <8 x double> @llvm.vp.fadd.v8f64(<8 x double> %f0, <8 x double> %f1, <8 x i1> %m, i32 %n)
		%r1 = call <8 x double> @llvm.vp.fsub.v8f64(<8 x double> %f0, <8 x double> %f1, <8 x i1> %m, i32 %n)
		%r2 = call <8 x double> @llvm.vp.fmul.v8f64(<8 x double> %f0, <8 x double> %f1, <8 x i1> %m, i32 %n)
		%r3 = call <8 x double> @llvm.vp.fdiv.v8f64(<8 x double> %f0, <8 x double> %f1, <8 x i1> %m, i32 %n)
		%r4 = call <8 x double> @llvm.vp.frem.v8f64(<8 x double> %f0, <8 x double> %f1, <8 x i1> %m, i32 %n)
		ret void
		}

		; TODO: test_vp_constrained_fp

; integer arith		; integer arith
declare <8 x i32> @llvm.vp.add.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.add.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.sub.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.sub.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.mul.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.mul.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.sdiv.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.sdiv.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.srem.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.srem.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.udiv.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.udiv.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.urem.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.urem.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
; bit arith		; bit arith
declare <8 x i32> @llvm.vp.and.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.and.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.or.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.or.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.xor.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.xor.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.ashr.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.ashr.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.lshr.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.lshr.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
declare <8 x i32> @llvm.vp.shl.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)		declare <8 x i32> @llvm.vp.shl.v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32)
		; fp arith
		declare <8 x double> @llvm.vp.fadd.v8f64(<8 x double>, <8 x double>, <8 x i1>, i32)
		declare <8 x double> @llvm.vp.fsub.v8f64(<8 x double>, <8 x double>, <8 x i1>, i32)
		declare <8 x double> @llvm.vp.fmul.v8f64(<8 x double>, <8 x double>, <8 x i1>, i32)
		declare <8 x double> @llvm.vp.fdiv.v8f64(<8 x double>, <8 x double>, <8 x i1>, i32)
		declare <8 x double> @llvm.vp.frem.v8f64(<8 x double>, <8 x double>, <8 x i1>, i32)

llvm/unittests/IR/VPIntrinsicTest.cpp

Show All 13 Lines
#include "llvm/IR/IntrinsicInst.h"		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IR/Verifier.h"		#include "llvm/IR/Verifier.h"
#include "llvm/Support/SourceMgr.h"		#include "llvm/Support/SourceMgr.h"
#include "gtest/gtest.h"		#include "gtest/gtest.h"
#include <sstream>		#include <sstream>

using namespace llvm;		using namespace llvm;
		frasercrmckUnsubmitted Done Reply Inline Actions Is this used? frasercrmck: Is this used?
		craig.topperUnsubmitted Done Reply Inline Actions This looks duplicated craig.topper: This looks duplicated

namespace {		namespace {

class VPIntrinsicTest : public testing::Test {		class VPIntrinsicTest : public testing::Test {
protected:		protected:
LLVMContext Context;		LLVMContext Context;

VPIntrinsicTest() : Context() {}		VPIntrinsicTest() : Context() {}

LLVMContext C;		LLVMContext C;
SMDiagnostic Err;		SMDiagnostic Err;

std::unique_ptr<Module> CreateVPDeclarationModule() {		std::unique_ptr<Module> CreateVPDeclarationModule() {
const char *BinaryIntOpcodes[] = {"add", "sub", "mul", "sdiv", "srem",		const char *BinaryIntOpcodes[] = {"add", "sub", "mul", "sdiv", "srem",
"udiv", "urem", "and", "xor", "or",		"udiv", "urem", "and", "xor", "or",
"ashr", "lshr", "shl"};		"ashr", "lshr", "shl"};
std::stringstream Str;		std::stringstream Str;
		simollAuthorUnsubmitted Done Reply Inline Actions This is why we need the `sstream` include. simoll: This is why we need the `sstream` include.
for (const char *BinaryIntOpcode : BinaryIntOpcodes)		for (const char *BinaryIntOpcode : BinaryIntOpcodes)
Str << " declare <8 x i32> @llvm.vp." << BinaryIntOpcode		Str << " declare <8 x i32> @llvm.vp." << BinaryIntOpcode
<< ".v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32) ";		<< ".v8i32(<8 x i32>, <8 x i32>, <8 x i1>, i32) ";

		const char *BinaryFPOpcodes[] = {"fadd", "fsub", "fmul", "fdiv", "frem"};
		for (const char *BinaryFPOpcode : BinaryFPOpcodes)
		Str << " declare <8 x float> @llvm.vp." << BinaryFPOpcode
		<< ".v8f32(<8 x float>, <8 x float>, <8 x i1>, i32) ";

return parseAssemblyString(Str.str(), Err, C);		return parseAssemblyString(Str.str(), Err, C);
}		}
};		};

/// Check that the property scopes include/llvm/IR/VPIntrinsics.def are closed.		/// Check that the property scopes include/llvm/IR/VPIntrinsics.def are closed.
TEST_F(VPIntrinsicTest, VPIntrinsicsDefScopes) {		TEST_F(VPIntrinsicTest, VPIntrinsicsDefScopes) {
Optional<Intrinsic::ID> ScopeVPID;		Optional<Intrinsic::ID> ScopeVPID;
#define BEGIN_REGISTER_VP_INTRINSIC(VPID, ...) \		#define BEGIN_REGISTER_VP_INTRINSIC(VPID, ...) \
Show All 23 Lines
TEST_F(VPIntrinsicTest, VPModuleComplete) {		TEST_F(VPIntrinsicTest, VPModuleComplete) {
std::unique_ptr<Module> M = CreateVPDeclarationModule();		std::unique_ptr<Module> M = CreateVPDeclarationModule();
assert(M);		assert(M);

// Check that all @llvm.vp.* functions in the module are recognized vp		// Check that all @llvm.vp.* functions in the module are recognized vp
// intrinsics.		// intrinsics.
std::set<Intrinsic::ID> SeenIDs;		std::set<Intrinsic::ID> SeenIDs;
for (const auto &VPDecl : *M) {		for (const auto &VPDecl : *M) {
ASSERT_TRUE(VPDecl.isIntrinsic());		ASSERT_TRUE(VPDecl.isIntrinsic());
simollAuthorUnsubmitted Done Reply Inline Actions This assertion is redundantly checked in the `GetParamPos` test. simoll: This assertion is redundantly checked in the `GetParamPos` test.
ASSERT_TRUE(VPIntrinsic::isVPIntrinsic(VPDecl.getIntrinsicID()));		ASSERT_TRUE(VPIntrinsic::isVPIntrinsic(VPDecl.getIntrinsicID()));
SeenIDs.insert(VPDecl.getIntrinsicID());		SeenIDs.insert(VPDecl.getIntrinsicID());
}		}

// Check that every registered VP intrinsic has an instance in the test		// Check that every registered VP intrinsic has an instance in the test
// module.		// module.
#define BEGIN_REGISTER_VP_INTRINSIC(VPID, ...) \		#define BEGIN_REGISTER_VP_INTRINSIC(VPID, ...) \
ASSERT_TRUE(SeenIDs.count(Intrinsic::VPID));		ASSERT_TRUE(SeenIDs.count(Intrinsic::VPID));
Show All 34 Lines
" ret void "		" ret void "
"}",		"}",
Err, C);		Err, C);

auto *F = M->getFunction("test_static_vlen");		auto *F = M->getFunction("test_static_vlen");
assert(F);		assert(F);

const int NumExpected = 12;		const int NumExpected = 12;
const bool Expected[] = {false, true, false, false, false, true, false, false, true, false, true, false};		const bool Expected[] = {false, true, false, false, false, true,
		false, false, true, false, true, false};
int i = 0;		int i = 0;
for (auto &I : F->getEntryBlock()) {		for (auto &I : F->getEntryBlock()) {
VPIntrinsic *VPI = dyn_cast<VPIntrinsic>(&I);		VPIntrinsic *VPI = dyn_cast<VPIntrinsic>(&I);
if (!VPI)		if (!VPI)
continue;		continue;

ASSERT_LT(i, NumExpected);		ASSERT_LT(i, NumExpected);
ASSERT_EQ(Expected[i], VPI->canIgnoreVectorLengthParam());		ASSERT_EQ(Expected[i], VPI->canIgnoreVectorLengthParam());
++i;		++i;
}		}
}		}

/// Check that the argument returned by		/// Check that the argument returned by
/// VPIntrinsic::get<X>ParamPos(Intrinsic::ID) has the expected type.		/// VPIntrinsic::get<X>ParamPos(Intrinsic::ID) has the expected type.
TEST_F(VPIntrinsicTest, GetParamPos) {		TEST_F(VPIntrinsicTest, GetParamPos) {
std::unique_ptr<Module> M = CreateVPDeclarationModule();		std::unique_ptr<Module> M = CreateVPDeclarationModule();
assert(M);		assert(M);

for (Function &F : *M) {		for (Function &F : *M) {
ASSERT_TRUE(F.isIntrinsic());		ASSERT_TRUE(F.isIntrinsic());
frasercrmckUnsubmitted Done Reply Inline Actions Does this condition no longer hold? frasercrmck: Does this condition no longer hold?
Optional<unsigned> MaskParamPos =		Optional<unsigned> MaskParamPos =
VPIntrinsic::getMaskParamPos(F.getIntrinsicID());		VPIntrinsic::getMaskParamPos(F.getIntrinsicID());
if (MaskParamPos.hasValue()) {		if (MaskParamPos.hasValue()) {
Type *MaskParamType = F.getArg(MaskParamPos.getValue())->getType();		Type *MaskParamType = F.getArg(MaskParamPos.getValue())->getType();
ASSERT_TRUE(MaskParamType->isVectorTy());		ASSERT_TRUE(MaskParamType->isVectorTy());
ASSERT_TRUE(cast<VectorType>(MaskParamType)->getElementType()->isIntegerTy(1));		ASSERT_TRUE(
		cast<VectorType>(MaskParamType)->getElementType()->isIntegerTy(1));
}		}

Optional<unsigned> VecLenParamPos =		Optional<unsigned> VecLenParamPos =
VPIntrinsic::getVectorLengthParamPos(F.getIntrinsicID());		VPIntrinsic::getVectorLengthParamPos(F.getIntrinsicID());
if (VecLenParamPos.hasValue()) {		if (VecLenParamPos.hasValue()) {
Type *VecLenParamType = F.getArg(VecLenParamPos.getValue())->getType();		Type *VecLenParamType = F.getArg(VecLenParamPos.getValue())->getType();
ASSERT_TRUE(VecLenParamType->isIntegerTy(32));		ASSERT_TRUE(VecLenParamType->isIntegerTy(32));
}		}
▲ Show 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	for (auto &F : *M) {
for (auto *ParamTy : FuncTy->params()) {		for (auto *ParamTy : FuncTy->params()) {
ASSERT_NE(ItNewParams, EndItNewParams);		ASSERT_NE(ItNewParams, EndItNewParams);
ASSERT_EQ(*ItNewParams, ParamTy);		ASSERT_EQ(*ItNewParams, ParamTy);
++ItNewParams;		++ItNewParams;
}		}
}		}
}		}

		/// Check that the HANDLE_VP_TO_CONSTRAINEDFP maps to an existing intrinsic with
		/// the right amount of metadata args.
		TEST_F(VPIntrinsicTest, HandleToConstrainedFP) {
		#define HANDLE_VP_TO_CONSTRAINEDFP(HASROUND, HASEXCEPT, CFPID) \
		{ \
		SmallVector<Intrinsic::IITDescriptor, 5> T; \
		Intrinsic::getIntrinsicInfoTableEntries(Intrinsic::CFPID, T); \
		unsigned NumMetadataArgs = 0; \
		for (auto TD : T) \
		NumMetadataArgs += (TD.Kind == Intrinsic::IITDescriptor::Metadata); \
		ASSERT_EQ(NumMetadataArgs, (unsigned)(HASROUND + HASEXCEPT)); \
		}
		#include "llvm/IR/VPIntrinsics.def"
		}

} // end anonymous namespace		} // end anonymous namespace

This is an archive of the discontinued LLVM Phabricator instance.

[VP] Binary floating-point intrinsics.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 351135

llvm/docs/LangRef.rst

llvm/include/llvm/IR/Intrinsics.td

llvm/include/llvm/IR/VPIntrinsics.def

llvm/test/Verifier/vp-intrinsics.ll

llvm/unittests/IR/VPIntrinsicTest.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[VP] Binary floating-point intrinsics.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 351135

llvm/docs/LangRef.rst

llvm/include/llvm/IR/Intrinsics.td

llvm/include/llvm/IR/VPIntrinsics.def

llvm/test/Verifier/vp-intrinsics.ll

llvm/unittests/IR/VPIntrinsicTest.cpp

[VP] Binary floating-point intrinsics.
ClosedPublic