This is an archive of the discontinued LLVM Phabricator instance.

-In clang generate fcmps when appropriate for neon intrinsics
-Fix legalization of scalarized strict FP vector operations
-Add some missing strict FP handling to AArch64TargetLowering
-Adjust the aarch64-neon-intrinsics-constrained.c clang test to expect the right output and un-XFAIL it.

The other parts of this have been split off into D118257, D118258, and D118259. I've also added some extra testing, as the clang test aarch64-neon-intrinsics-constrained wasn't actually testing all of the changes here.

Harbormaster completed remote builds in B145999: Diff 403614.Jan 27 2022, 10:47 AM

john.brawn added a child revision: D115620: [AArch64] Lowering and legalization of strict FP16.Jan 31 2022, 2:42 AM

dmgreen added inline comments.Feb 1 2022, 2:17 AM

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
1492	Can you split this into a separate patch? I know I sound like a broken record, but it doesn't seem to be related to the converts below. Also pre-committing as much of the test that works as possible would cut it down from this patch quite a bit.
3431	Some formatting apparently needs fixing up.
3682	Op.getOperand(1) -> In?
3703	dl is already defined.
3706	Op.getOperand(IsStrict ? 1 : 0) -> In?
3709	This isn't used anywhere

john.brawn added inline comments.Feb 1 2022, 9:45 AM

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
1492	Instead of a separate patch just for these two, it would probably make more sense to move them into D114946 with the rest of the setOperationAction lines. On the test, without the changes in this patch it hits an assertion failure so as a separate commit before this it wouldn't be able to test anything.

dmgreen added inline comments.Feb 1 2022, 1:07 PM

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
1492	D114946 is already pretty big :) What about the add_v4f32 test (for example) requires the changes in this patch? That's what I meant by "precommit as much as possible". My other question about this was going to be - why can't we use the vector instructions for STRICT_FSETCCS? The FCMGE and FCMGT look like they would set exception flags to me, but I may be misunderstanding some part of it.

john.brawn planned changes to this revision.Feb 2 2022, 3:08 AM

john.brawn added inline comments.

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
1492	D114946 is already pretty big :) A bit, though only in terms of number of lines. Conceptually it fairly simple, as it's almost all "strict_xyz can be handled the same as (non-strict) xyz". What about the add_v4f32 test (for example) requires the changes in this patch? That's what I meant by "precommit as much as possible". I _could_ try and disentangle the parts that compile without this patch, but it seems like more work than it's worth. My other question about this was going to be - why can't we use the vector instructions for STRICT_FSETCCS? The FCMGE and FCMGT look like they would set exception flags to me, but I may be misunderstanding some part of it. Hmm, it looks like the FCMXY instructions are inconsistent in their handling of NaNs. FCMEQ performs a quiet comparison (only signalling NaNs raise an exception) like FCMP and STRICT_FSETCC, FCMGT etc. performs a signalling comparison (all NaNs raise an exception) like FCMPE and STRICT_FSETCCS. I'll adjust the comments (and also move this to D114946).

Update based on review comments and adjust formatting.

Harbormaster completed remote builds in B147095: Diff 405213.Feb 2 2022, 4:15 AM

Thanks for the changes. LGTM

This revision is now accepted and ready to land.Feb 7 2022, 7:39 AM

This revision was landed with ongoing or failed builds.Feb 17 2022, 8:11 AM

Closed by commit rG8e17c9613f36: [AArch64] Add some missing strict FP vector lowering (authored by john.brawn). · Explain Why

This revision was automatically updated to reflect the committed changes.

john.brawn added a commit: rG8e17c9613f36: [AArch64] Add some missing strict FP vector lowering.

Revision Contents

Path

Size

llvm/

lib/

Target/

AArch64/

AArch64ISelLowering.cpp

63 lines

test/

CodeGen/

AArch64/

fp-intrinsics-vector.ll

886 lines

Diff 409663

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,483 Lines • ▼ Show 20 Lines	void AArch64TargetLowering::addTypeForNEON(MVT VT) {
// Strict fp extend and trunc are legal		// Strict fp extend and trunc are legal
if (VT.isFloatingPoint() && VT.getScalarSizeInBits() != 16)		if (VT.isFloatingPoint() && VT.getScalarSizeInBits() != 16)
setOperationAction(ISD::STRICT_FP_EXTEND, VT, Legal);		setOperationAction(ISD::STRICT_FP_EXTEND, VT, Legal);
if (VT.isFloatingPoint() && VT.getScalarSizeInBits() != 64)		if (VT.isFloatingPoint() && VT.getScalarSizeInBits() != 64)
setOperationAction(ISD::STRICT_FP_ROUND, VT, Legal);		setOperationAction(ISD::STRICT_FP_ROUND, VT, Legal);

// FIXME: We could potentially make use of the vector comparison instructions		// FIXME: We could potentially make use of the vector comparison instructions
// for STRICT_FSETCC and STRICT_FSETCSS, but there's a number of		// for STRICT_FSETCC and STRICT_FSETCSS, but there's a number of
// complications:		// complications:
		dmgreenUnsubmitted Not Done Reply Inline Actions Can you split this into a separate patch? I know I sound like a broken record, but it doesn't seem to be related to the converts below. Also pre-committing as much of the test that works as possible would cut it down from this patch quite a bit. dmgreen: Can you split this into a separate patch? I know I sound like a broken record, but it doesn't…
		john.brawnAuthorUnsubmitted Done Reply Inline Actions Instead of a separate patch just for these two, it would probably make more sense to move them into D114946 with the rest of the setOperationAction lines. On the test, without the changes in this patch it hits an assertion failure so as a separate commit before this it wouldn't be able to test anything. john.brawn: Instead of a separate patch just for these two, it would probably make more sense to move them…
		dmgreenUnsubmitted Not Done Reply Inline Actions D114946 is already pretty big :) What about the add_v4f32 test (for example) requires the changes in this patch? That's what I meant by "precommit as much as possible". My other question about this was going to be - why can't we use the vector instructions for STRICT_FSETCCS? The FCMGE and FCMGT look like they would set exception flags to me, but I may be misunderstanding some part of it. dmgreen: D114946 is already pretty big :) What about the add_v4f32 test (for example) requires the…
		john.brawnAuthorUnsubmitted Done Reply Inline Actions D114946 is already pretty big :) A bit, though only in terms of number of lines. Conceptually it fairly simple, as it's almost all "strict_xyz can be handled the same as (non-strict) xyz". What about the add_v4f32 test (for example) requires the changes in this patch? That's what I meant by "precommit as much as possible". I _could_ try and disentangle the parts that compile without this patch, but it seems like more work than it's worth. My other question about this was going to be - why can't we use the vector instructions for STRICT_FSETCCS? The FCMGE and FCMGT look like they would set exception flags to me, but I may be misunderstanding some part of it. Hmm, it looks like the FCMXY instructions are inconsistent in their handling of NaNs. FCMEQ performs a quiet comparison (only signalling NaNs raise an exception) like FCMP and STRICT_FSETCC, FCMGT etc. performs a signalling comparison (all NaNs raise an exception) like FCMPE and STRICT_FSETCCS. I'll adjust the comments (and also move this to D114946). john.brawn: > D114946 is already pretty big :) A bit, though only in terms of number of lines. Conceptually…
// * FCMPEQ/NE are quiet comparisons, the rest are signalling comparisons,		// * FCMPEQ/NE are quiet comparisons, the rest are signalling comparisons,
// so we would need to expand when the condition code doesn't match the		// so we would need to expand when the condition code doesn't match the
// kind of comparison.		// kind of comparison.
// * Some kinds of comparison require more than one FCMXY instruction so		// * Some kinds of comparison require more than one FCMXY instruction so
// would need to be expanded instead.		// would need to be expanded instead.
// * The lowering of the non-strict versions involves target-specific ISD		// * The lowering of the non-strict versions involves target-specific ISD
// nodes so we would likely need to add strict versions of all of them and		// nodes so we would likely need to add strict versions of all of them and
// handle them appropriately.		// handle them appropriately.
▲ Show 20 Lines • Show All 1,898 Lines • ▼ Show 20 Lines	SDValue AArch64TargetLowering::LowerFP_ROUND(SDValue Op,
return SDValue();		return SDValue();
}		}

SDValue AArch64TargetLowering::LowerVectorFP_TO_INT(SDValue Op,		SDValue AArch64TargetLowering::LowerVectorFP_TO_INT(SDValue Op,
SelectionDAG &DAG) const {		SelectionDAG &DAG) const {
// Warning: We maintain cost tables in AArch64TargetTransformInfo.cpp.		// Warning: We maintain cost tables in AArch64TargetTransformInfo.cpp.
// Any additional optimization in this function should be recorded		// Any additional optimization in this function should be recorded
// in the cost tables.		// in the cost tables.
EVT InVT = Op.getOperand(0).getValueType();		bool IsStrict = Op->isStrictFPOpcode();
		EVT InVT = Op.getOperand(IsStrict ? 1 : 0).getValueType();
EVT VT = Op.getValueType();		EVT VT = Op.getValueType();

if (VT.isScalableVector()) {		if (VT.isScalableVector()) {
unsigned Opcode = Op.getOpcode() == ISD::FP_TO_UINT		unsigned Opcode = Op.getOpcode() == ISD::FP_TO_UINT
? AArch64ISD::FCVTZU_MERGE_PASSTHRU		? AArch64ISD::FCVTZU_MERGE_PASSTHRU
: AArch64ISD::FCVTZS_MERGE_PASSTHRU;		: AArch64ISD::FCVTZS_MERGE_PASSTHRU;
return LowerToPredicatedOp(Op, DAG, Opcode);		return LowerToPredicatedOp(Op, DAG, Opcode);
}		}

if (useSVEForFixedLengthVectorVT(VT) \|\| useSVEForFixedLengthVectorVT(InVT))		if (useSVEForFixedLengthVectorVT(VT) \|\| useSVEForFixedLengthVectorVT(InVT))
return LowerFixedLengthFPToIntToSVE(Op, DAG);		return LowerFixedLengthFPToIntToSVE(Op, DAG);

unsigned NumElts = InVT.getVectorNumElements();		unsigned NumElts = InVT.getVectorNumElements();

// f16 conversions are promoted to f32 when full fp16 is not supported.		// f16 conversions are promoted to f32 when full fp16 is not supported.
if (InVT.getVectorElementType() == MVT::f16 &&		if (InVT.getVectorElementType() == MVT::f16 &&
!Subtarget->hasFullFP16()) {		!Subtarget->hasFullFP16()) {
MVT NewVT = MVT::getVectorVT(MVT::f32, NumElts);		MVT NewVT = MVT::getVectorVT(MVT::f32, NumElts);
SDLoc dl(Op);		SDLoc dl(Op);
		if (IsStrict) {
		SDValue Ext = DAG.getNode(ISD::STRICT_FP_EXTEND, dl, {NewVT, MVT::Other},
		{Op.getOperand(0), Op.getOperand(1)});
		return DAG.getNode(Op.getOpcode(), dl, {VT, MVT::Other},
		dmgreenUnsubmitted Not Done Reply Inline Actions Some formatting apparently needs fixing up. dmgreen: Some formatting apparently needs fixing up.
		{Ext.getValue(1), Ext.getValue(0)});
		}
return DAG.getNode(		return DAG.getNode(
Op.getOpcode(), dl, Op.getValueType(),		Op.getOpcode(), dl, Op.getValueType(),
DAG.getNode(ISD::FP_EXTEND, dl, NewVT, Op.getOperand(0)));		DAG.getNode(ISD::FP_EXTEND, dl, NewVT, Op.getOperand(0)));
}		}

uint64_t VTSize = VT.getFixedSizeInBits();		uint64_t VTSize = VT.getFixedSizeInBits();
uint64_t InVTSize = InVT.getFixedSizeInBits();		uint64_t InVTSize = InVT.getFixedSizeInBits();
if (VTSize < InVTSize) {		if (VTSize < InVTSize) {
SDLoc dl(Op);		SDLoc dl(Op);
		if (IsStrict) {
		InVT = InVT.changeVectorElementTypeToInteger();
		SDValue Cv = DAG.getNode(Op.getOpcode(), dl, {InVT, MVT::Other},
		{Op.getOperand(0), Op.getOperand(1)});
		SDValue Trunc = DAG.getNode(ISD::TRUNCATE, dl, VT, Cv);
		return DAG.getMergeValues({Trunc, Cv.getValue(1)}, dl);
		}
SDValue Cv =		SDValue Cv =
DAG.getNode(Op.getOpcode(), dl, InVT.changeVectorElementTypeToInteger(),		DAG.getNode(Op.getOpcode(), dl, InVT.changeVectorElementTypeToInteger(),
Op.getOperand(0));		Op.getOperand(0));
return DAG.getNode(ISD::TRUNCATE, dl, VT, Cv);		return DAG.getNode(ISD::TRUNCATE, dl, VT, Cv);
}		}

if (VTSize > InVTSize) {		if (VTSize > InVTSize) {
SDLoc dl(Op);		SDLoc dl(Op);
MVT ExtVT =		MVT ExtVT =
MVT::getVectorVT(MVT::getFloatingPointVT(VT.getScalarSizeInBits()),		MVT::getVectorVT(MVT::getFloatingPointVT(VT.getScalarSizeInBits()),
VT.getVectorNumElements());		VT.getVectorNumElements());
		if (IsStrict) {
		SDValue Ext = DAG.getNode(ISD::STRICT_FP_EXTEND, dl,
		{ExtVT, MVT::Other},
		{Op.getOperand(0), Op.getOperand(1)});
		return DAG.getNode(Op.getOpcode(), dl, {VT, MVT::Other},
		{Ext.getValue(1), Ext.getValue(0)});
		}
SDValue Ext = DAG.getNode(ISD::FP_EXTEND, dl, ExtVT, Op.getOperand(0));		SDValue Ext = DAG.getNode(ISD::FP_EXTEND, dl, ExtVT, Op.getOperand(0));
return DAG.getNode(Op.getOpcode(), dl, VT, Ext);		return DAG.getNode(Op.getOpcode(), dl, VT, Ext);
}		}

		// Use a scalar operation for conversions between single-element vectors of
		// the same size.
		if (NumElts == 1) {
		SDLoc dl(Op);
		SDValue Extract = DAG.getNode(
		ISD::EXTRACT_VECTOR_ELT, dl, InVT.getScalarType(),
		Op.getOperand(IsStrict ? 1 : 0), DAG.getConstant(0, dl, MVT::i64));
		EVT ScalarVT = VT.getScalarType();
		SDValue ScalarCvt;
		if (IsStrict)
		return DAG.getNode(Op.getOpcode(), dl, {ScalarVT, MVT::Other},
		{Op.getOperand(0), Extract});
		return DAG.getNode(Op.getOpcode(), dl, ScalarVT, Extract);
		}

// Type changing conversions are illegal.		// Type changing conversions are illegal.
return Op;		return Op;
}		}

SDValue AArch64TargetLowering::LowerFP_TO_INT(SDValue Op,		SDValue AArch64TargetLowering::LowerFP_TO_INT(SDValue Op,
SelectionDAG &DAG) const {		SelectionDAG &DAG) const {
bool IsStrict = Op->isStrictFPOpcode();		bool IsStrict = Op->isStrictFPOpcode();
SDValue SrcVal = Op.getOperand(IsStrict ? 1 : 0);		SDValue SrcVal = Op.getOperand(IsStrict ? 1 : 0);
▲ Show 20 Lines • Show All 146 Lines • ▼ Show 20 Lines	SDValue AArch64TargetLowering::LowerFP_TO_INT_SAT(SDValue Op,
return DAG.getNode(ISD::TRUNCATE, DL, DstVT, Sat);		return DAG.getNode(ISD::TRUNCATE, DL, DstVT, Sat);
}		}

SDValue AArch64TargetLowering::LowerVectorINT_TO_FP(SDValue Op,		SDValue AArch64TargetLowering::LowerVectorINT_TO_FP(SDValue Op,
SelectionDAG &DAG) const {		SelectionDAG &DAG) const {
// Warning: We maintain cost tables in AArch64TargetTransformInfo.cpp.		// Warning: We maintain cost tables in AArch64TargetTransformInfo.cpp.
// Any additional optimization in this function should be recorded		// Any additional optimization in this function should be recorded
// in the cost tables.		// in the cost tables.
		bool IsStrict = Op->isStrictFPOpcode();
EVT VT = Op.getValueType();		EVT VT = Op.getValueType();
SDLoc dl(Op);		SDLoc dl(Op);
SDValue In = Op.getOperand(0);		SDValue In = Op.getOperand(IsStrict ? 1 : 0);
EVT InVT = In.getValueType();		EVT InVT = In.getValueType();
unsigned Opc = Op.getOpcode();		unsigned Opc = Op.getOpcode();
bool IsSigned = Opc == ISD::SINT_TO_FP \|\| Opc == ISD::STRICT_SINT_TO_FP;		bool IsSigned = Opc == ISD::SINT_TO_FP \|\| Opc == ISD::STRICT_SINT_TO_FP;

if (VT.isScalableVector()) {		if (VT.isScalableVector()) {
if (InVT.getVectorElementType() == MVT::i1) {		if (InVT.getVectorElementType() == MVT::i1) {
// We can't directly extend an SVE predicate; extend it first.		// We can't directly extend an SVE predicate; extend it first.
unsigned CastOpc = IsSigned ? ISD::SIGN_EXTEND : ISD::ZERO_EXTEND;		unsigned CastOpc = IsSigned ? ISD::SIGN_EXTEND : ISD::ZERO_EXTEND;
Show All 11 Lines	if (useSVEForFixedLengthVectorVT(VT) \|\| useSVEForFixedLengthVectorVT(InVT))
return LowerFixedLengthIntToFPToSVE(Op, DAG);		return LowerFixedLengthIntToFPToSVE(Op, DAG);

uint64_t VTSize = VT.getFixedSizeInBits();		uint64_t VTSize = VT.getFixedSizeInBits();
uint64_t InVTSize = InVT.getFixedSizeInBits();		uint64_t InVTSize = InVT.getFixedSizeInBits();
if (VTSize < InVTSize) {		if (VTSize < InVTSize) {
MVT CastVT =		MVT CastVT =
MVT::getVectorVT(MVT::getFloatingPointVT(InVT.getScalarSizeInBits()),		MVT::getVectorVT(MVT::getFloatingPointVT(InVT.getScalarSizeInBits()),
InVT.getVectorNumElements());		InVT.getVectorNumElements());
		if (IsStrict) {
		In = DAG.getNode(Opc, dl, {CastVT, MVT::Other},
		{Op.getOperand(0), In});
		dmgreenUnsubmitted Not Done Reply Inline Actions Op.getOperand(1) -> In? dmgreen: Op.getOperand(1) -> In?
		return DAG.getNode(
		ISD::STRICT_FP_ROUND, dl, {VT, MVT::Other},
		{In.getValue(1), In.getValue(0), DAG.getIntPtrConstant(0, dl)});
		}
In = DAG.getNode(Opc, dl, CastVT, In);		In = DAG.getNode(Opc, dl, CastVT, In);
return DAG.getNode(ISD::FP_ROUND, dl, VT, In, DAG.getIntPtrConstant(0, dl));		return DAG.getNode(ISD::FP_ROUND, dl, VT, In, DAG.getIntPtrConstant(0, dl));
}		}

if (VTSize > InVTSize) {		if (VTSize > InVTSize) {
unsigned CastOpc = IsSigned ? ISD::SIGN_EXTEND : ISD::ZERO_EXTEND;		unsigned CastOpc = IsSigned ? ISD::SIGN_EXTEND : ISD::ZERO_EXTEND;
EVT CastVT = VT.changeVectorElementTypeToInteger();		EVT CastVT = VT.changeVectorElementTypeToInteger();
In = DAG.getNode(CastOpc, dl, CastVT, In);		In = DAG.getNode(CastOpc, dl, CastVT, In);
		if (IsStrict)
		return DAG.getNode(Opc, dl, {VT, MVT::Other}, {Op.getOperand(0), In});
return DAG.getNode(Opc, dl, VT, In);		return DAG.getNode(Opc, dl, VT, In);
}		}

		// Use a scalar operation for conversions between single-element vectors of
		// the same size.
		if (VT.getVectorNumElements() == 1) {
		SDValue Extract = DAG.getNode(
		dmgreenUnsubmitted Not Done Reply Inline Actions dl is already defined. dmgreen: dl is already defined.
		ISD::EXTRACT_VECTOR_ELT, dl, InVT.getScalarType(),
		In, DAG.getConstant(0, dl, MVT::i64));
		EVT ScalarVT = VT.getScalarType();
		dmgreenUnsubmitted Not Done Reply Inline Actions Op.getOperand(IsStrict ? 1 : 0) -> In? dmgreen: Op.getOperand(IsStrict ? 1 : 0) -> In?
		if (IsStrict)
		return DAG.getNode(Op.getOpcode(), dl, {ScalarVT, MVT::Other},
		{Op.getOperand(0), Extract});
		dmgreenUnsubmitted Not Done Reply Inline Actions This isn't used anywhere dmgreen: This isn't used anywhere
		return DAG.getNode(Op.getOpcode(), dl, ScalarVT, Extract);
		}

return Op;		return Op;
}		}

SDValue AArch64TargetLowering::LowerINT_TO_FP(SDValue Op,		SDValue AArch64TargetLowering::LowerINT_TO_FP(SDValue Op,
SelectionDAG &DAG) const {		SelectionDAG &DAG) const {
if (Op.getValueType().isVector())		if (Op.getValueType().isVector())
return LowerVectorINT_TO_FP(Op, DAG);		return LowerVectorINT_TO_FP(Op, DAG);

▲ Show 20 Lines • Show All 16,660 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/fp-intrinsics-vector.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc -mtriple=aarch64-none-eabi %s -disable-strictnode-mutation -o - \| FileCheck %s
				; RUN: llc -mtriple=aarch64-none-eabi -global-isel=true -global-isel-abort=2 -disable-strictnode-mutation %s -o - \| FileCheck %s

				; Check that constrained fp vector intrinsics are correctly lowered.


				; Single-precision intrinsics

				define <4 x float> @add_v4f32(<4 x float> %x, <4 x float> %y) #0 {
				; CHECK-LABEL: add_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fadd v0.4s, v0.4s, v1.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.fadd.v4f32(<4 x float> %x, <4 x float> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @sub_v4f32(<4 x float> %x, <4 x float> %y) #0 {
				; CHECK-LABEL: sub_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fsub v0.4s, v0.4s, v1.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.fsub.v4f32(<4 x float> %x, <4 x float> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @mul_v4f32(<4 x float> %x, <4 x float> %y) #0 {
				; CHECK-LABEL: mul_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fmul v0.4s, v0.4s, v1.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.fmul.v4f32(<4 x float> %x, <4 x float> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @div_v4f32(<4 x float> %x, <4 x float> %y) #0 {
				; CHECK-LABEL: div_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fdiv v0.4s, v0.4s, v1.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.fdiv.v4f32(<4 x float> %x, <4 x float> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @fma_v4f32(<4 x float> %x, <4 x float> %y, <4 x float> %z) #0 {
				; CHECK-LABEL: fma_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fmla v2.4s, v1.4s, v0.4s
				; CHECK-NEXT: mov v0.16b, v2.16b
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.fma.v4f32(<4 x float> %x, <4 x float> %y, <4 x float> %z, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x i32> @fptosi_v4i32_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: fptosi_v4i32_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzs v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x i32> @llvm.experimental.constrained.fptosi.v4i32.v4f32(<4 x float> %x, metadata !"fpexcept.strict") #0
				ret <4 x i32> %val
				}

				define <4 x i32> @fptoui_v4i32_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: fptoui_v4i32_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzu v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x i32> @llvm.experimental.constrained.fptoui.v4i32.v4f32(<4 x float> %x, metadata !"fpexcept.strict") #0
				ret <4 x i32> %val
				}

				define <4 x i64> @fptosi_v4i64_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: fptosi_v4i64_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: ext v1.16b, v0.16b, v0.16b, #8
				; CHECK-NEXT: fcvtl v0.2d, v0.2s
				; CHECK-NEXT: fcvtl v1.2d, v1.2s
				; CHECK-NEXT: fcvtzs v0.2d, v0.2d
				; CHECK-NEXT: fcvtzs v1.2d, v1.2d
				; CHECK-NEXT: ret
				%val = call <4 x i64> @llvm.experimental.constrained.fptosi.v4i64.v4f32(<4 x float> %x, metadata !"fpexcept.strict") #0
				ret <4 x i64> %val
				}

				define <4 x i64> @fptoui_v4i64_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: fptoui_v4i64_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: ext v1.16b, v0.16b, v0.16b, #8
				; CHECK-NEXT: fcvtl v0.2d, v0.2s
				; CHECK-NEXT: fcvtl v1.2d, v1.2s
				; CHECK-NEXT: fcvtzu v0.2d, v0.2d
				; CHECK-NEXT: fcvtzu v1.2d, v1.2d
				; CHECK-NEXT: ret
				%val = call <4 x i64> @llvm.experimental.constrained.fptoui.v4i64.v4f32(<4 x float> %x, metadata !"fpexcept.strict") #0
				ret <4 x i64> %val
				}

				define <4 x float> @sitofp_v4f32_v4i32(<4 x i32> %x) #0 {
				; CHECK-LABEL: sitofp_v4f32_v4i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: scvtf v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.sitofp.v4f32.v4i32(<4 x i32> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @uitofp_v4f32_v4i32(<4 x i32> %x) #0 {
				; CHECK-LABEL: uitofp_v4f32_v4i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: ucvtf v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.uitofp.v4f32.v4i32(<4 x i32> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @sitofp_v4f32_v4i64(<4 x i64> %x) #0 {
				; CHECK-LABEL: sitofp_v4f32_v4i64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: scvtf v0.2d, v0.2d
				; CHECK-NEXT: scvtf v1.2d, v1.2d
				; CHECK-NEXT: fcvtn v0.2s, v0.2d
				; CHECK-NEXT: fcvtn2 v0.4s, v1.2d
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.sitofp.v4f32.v4i64(<4 x i64> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @uitofp_v4f32_v4i64(<4 x i64> %x) #0 {
				; CHECK-LABEL: uitofp_v4f32_v4i64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: ucvtf v0.2d, v0.2d
				; CHECK-NEXT: ucvtf v1.2d, v1.2d
				; CHECK-NEXT: fcvtn v0.2s, v0.2d
				; CHECK-NEXT: fcvtn2 v0.4s, v1.2d
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.uitofp.v4f32.v4i64(<4 x i64> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @sqrt_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: sqrt_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fsqrt v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.sqrt.v4f32(<4 x float> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @rint_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: rint_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintx v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.rint.v4f32(<4 x float> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @nearbyint_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: nearbyint_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frinti v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.nearbyint.v4f32(<4 x float> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @maxnum_v4f32(<4 x float> %x, <4 x float> %y) #0 {
				; CHECK-LABEL: maxnum_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fmaxnm v0.4s, v0.4s, v1.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.maxnum.v4f32(<4 x float> %x, <4 x float> %y, metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @minnum_v4f32(<4 x float> %x, <4 x float> %y) #0 {
				; CHECK-LABEL: minnum_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fminnm v0.4s, v0.4s, v1.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.minnum.v4f32(<4 x float> %x, <4 x float> %y, metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @ceil_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: ceil_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintp v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.ceil.v4f32(<4 x float> %x, metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @floor_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: floor_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintm v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.floor.v4f32(<4 x float> %x, metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @round_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: round_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frinta v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.round.v4f32(<4 x float> %x, metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @roundeven_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: roundeven_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintn v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.roundeven.v4f32(<4 x float> %x, metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x float> @trunc_v4f32(<4 x float> %x) #0 {
				; CHECK-LABEL: trunc_v4f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintz v0.4s, v0.4s
				; CHECK-NEXT: ret
				%val = call <4 x float> @llvm.experimental.constrained.trunc.v4f32(<4 x float> %x, metadata !"fpexcept.strict") #0
				ret <4 x float> %val
				}

				define <4 x i1> @fcmp_v4f32(<4 x float> %x, <4 x float> %y) #0 {
				; CHECK-LABEL: fcmp_v4f32:
				; CHECK: // %bb.0: // %entry
				; CHECK-NEXT: mov s2, v1.s[1]
				; CHECK-NEXT: mov s3, v0.s[1]
				; CHECK-NEXT: fcmp s0, s1
				; CHECK-NEXT: mov s4, v1.s[2]
				; CHECK-NEXT: mov s5, v0.s[2]
				; CHECK-NEXT: mov s1, v1.s[3]
				; CHECK-NEXT: mov s0, v0.s[3]
				; CHECK-NEXT: csetm w8, eq
				; CHECK-NEXT: fcmp s3, s2
				; CHECK-NEXT: fmov s2, w8
				; CHECK-NEXT: csetm w8, eq
				; CHECK-NEXT: fcmp s5, s4
				; CHECK-NEXT: mov v2.s[1], w8
				; CHECK-NEXT: csetm w8, eq
				; CHECK-NEXT: fcmp s0, s1
				; CHECK-NEXT: mov v2.s[2], w8
				; CHECK-NEXT: csetm w8, eq
				; CHECK-NEXT: mov v2.s[3], w8
				; CHECK-NEXT: xtn v0.4h, v2.4s
				; CHECK-NEXT: ret
				entry:
				%val = call <4 x i1> @llvm.experimental.constrained.fcmp.v4f64(<4 x float> %x, <4 x float> %y, metadata !"oeq", metadata !"fpexcept.strict")
				ret <4 x i1> %val
				}

				define <4 x i1> @fcmps_v4f32(<4 x float> %x, <4 x float> %y) #0 {
				; CHECK-LABEL: fcmps_v4f32:
				; CHECK: // %bb.0: // %entry
				; CHECK-NEXT: mov s2, v1.s[1]
				; CHECK-NEXT: mov s3, v0.s[1]
				; CHECK-NEXT: fcmpe s0, s1
				; CHECK-NEXT: mov s4, v1.s[2]
				; CHECK-NEXT: mov s5, v0.s[2]
				; CHECK-NEXT: mov s1, v1.s[3]
				; CHECK-NEXT: mov s0, v0.s[3]
				; CHECK-NEXT: csetm w8, eq
				; CHECK-NEXT: fcmpe s3, s2
				; CHECK-NEXT: fmov s2, w8
				; CHECK-NEXT: csetm w8, eq
				; CHECK-NEXT: fcmpe s5, s4
				; CHECK-NEXT: mov v2.s[1], w8
				; CHECK-NEXT: csetm w8, eq
				; CHECK-NEXT: fcmpe s0, s1
				; CHECK-NEXT: mov v2.s[2], w8
				; CHECK-NEXT: csetm w8, eq
				; CHECK-NEXT: mov v2.s[3], w8
				; CHECK-NEXT: xtn v0.4h, v2.4s
				; CHECK-NEXT: ret
				entry:
				%val = call <4 x i1> @llvm.experimental.constrained.fcmps.v4f64(<4 x float> %x, <4 x float> %y, metadata !"oeq", metadata !"fpexcept.strict")
				ret <4 x i1> %val
				}


				; Double-precision intrinsics

				define <2 x double> @add_v2f64(<2 x double> %x, <2 x double> %y) #0 {
				; CHECK-LABEL: add_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fadd v0.2d, v0.2d, v1.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.fadd.v2f64(<2 x double> %x, <2 x double> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @sub_v2f64(<2 x double> %x, <2 x double> %y) #0 {
				; CHECK-LABEL: sub_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fsub v0.2d, v0.2d, v1.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.fsub.v2f64(<2 x double> %x, <2 x double> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @mul_v2f64(<2 x double> %x, <2 x double> %y) #0 {
				; CHECK-LABEL: mul_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fmul v0.2d, v0.2d, v1.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.fmul.v2f64(<2 x double> %x, <2 x double> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @div_v2f64(<2 x double> %x, <2 x double> %y) #0 {
				; CHECK-LABEL: div_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fdiv v0.2d, v0.2d, v1.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.fdiv.v2f64(<2 x double> %x, <2 x double> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @fma_v2f64(<2 x double> %x, <2 x double> %y, <2 x double> %z) #0 {
				; CHECK-LABEL: fma_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fmla v2.2d, v1.2d, v0.2d
				; CHECK-NEXT: mov v0.16b, v2.16b
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.fma.v2f64(<2 x double> %x, <2 x double> %y, <2 x double> %z, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x i32> @fptosi_v2i32_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: fptosi_v2i32_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzs v0.2d, v0.2d
				; CHECK-NEXT: xtn v0.2s, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x i32> @llvm.experimental.constrained.fptosi.v2i32.v2f64(<2 x double> %x, metadata !"fpexcept.strict") #0
				ret <2 x i32> %val
				}

				define <2 x i32> @fptoui_v2i32_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: fptoui_v2i32_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzu v0.2d, v0.2d
				; CHECK-NEXT: xtn v0.2s, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x i32> @llvm.experimental.constrained.fptoui.v2i32.v2f64(<2 x double> %x, metadata !"fpexcept.strict") #0
				ret <2 x i32> %val
				}

				define <2 x i64> @fptosi_v2i64_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: fptosi_v2i64_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzs v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x i64> @llvm.experimental.constrained.fptosi.v2i64.v2f64(<2 x double> %x, metadata !"fpexcept.strict") #0
				ret <2 x i64> %val
				}

				define <2 x i64> @fptoui_v2i64_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: fptoui_v2i64_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzu v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x i64> @llvm.experimental.constrained.fptoui.v2i64.v2f64(<2 x double> %x, metadata !"fpexcept.strict") #0
				ret <2 x i64> %val
				}

				define <2 x double> @sitofp_v2f64_v2i32(<2 x i32> %x) #0 {
				; CHECK-LABEL: sitofp_v2f64_v2i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: sshll v0.2d, v0.2s, #0
				; CHECK-NEXT: scvtf v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.sitofp.v2f64.v2i32(<2 x i32> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @uitofp_v2f64_v2i32(<2 x i32> %x) #0 {
				; CHECK-LABEL: uitofp_v2f64_v2i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: ushll v0.2d, v0.2s, #0
				; CHECK-NEXT: ucvtf v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.uitofp.v2f64.v2i32(<2 x i32> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @sitofp_v2f64_v2i64(<2 x i64> %x) #0 {
				; CHECK-LABEL: sitofp_v2f64_v2i64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: scvtf v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.sitofp.v2f64.v2i64(<2 x i64> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @uitofp_v2f64_v2i64(<2 x i64> %x) #0 {
				; CHECK-LABEL: uitofp_v2f64_v2i64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: ucvtf v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.uitofp.v2f64.v2i64(<2 x i64> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @sqrt_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: sqrt_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fsqrt v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.sqrt.v2f64(<2 x double> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @rint_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: rint_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintx v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.rint.v2f64(<2 x double> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @nearbyint_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: nearbyint_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frinti v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.nearbyint.v2f64(<2 x double> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @maxnum_v2f64(<2 x double> %x, <2 x double> %y) #0 {
				; CHECK-LABEL: maxnum_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fmaxnm v0.2d, v0.2d, v1.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.maxnum.v2f64(<2 x double> %x, <2 x double> %y, metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @minnum_v2f64(<2 x double> %x, <2 x double> %y) #0 {
				; CHECK-LABEL: minnum_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fminnm v0.2d, v0.2d, v1.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.minnum.v2f64(<2 x double> %x, <2 x double> %y, metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @ceil_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: ceil_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintp v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.ceil.v2f64(<2 x double> %x, metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @floor_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: floor_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintm v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.floor.v2f64(<2 x double> %x, metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @round_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: round_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frinta v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.round.v2f64(<2 x double> %x, metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @roundeven_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: roundeven_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintn v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.roundeven.v2f64(<2 x double> %x, metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x double> @trunc_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: trunc_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintz v0.2d, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.trunc.v2f64(<2 x double> %x, metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}

				define <2 x i1> @fcmp_v2f64(<2 x double> %x, <2 x double> %y) #0 {
				; CHECK-LABEL: fcmp_v2f64:
				; CHECK: // %bb.0: // %entry
				; CHECK-NEXT: mov d2, v1.d[1]
				; CHECK-NEXT: mov d3, v0.d[1]
				; CHECK-NEXT: fcmp d0, d1
				; CHECK-NEXT: csetm x8, eq
				; CHECK-NEXT: fcmp d3, d2
				; CHECK-NEXT: fmov d0, x8
				; CHECK-NEXT: csetm x8, eq
				; CHECK-NEXT: mov v0.d[1], x8
				; CHECK-NEXT: xtn v0.2s, v0.2d
				; CHECK-NEXT: ret
				entry:
				%val = call <2 x i1> @llvm.experimental.constrained.fcmp.v2f64(<2 x double> %x, <2 x double> %y, metadata !"oeq", metadata !"fpexcept.strict")
				ret <2 x i1> %val
				}

				define <2 x i1> @fcmps_v2f64(<2 x double> %x, <2 x double> %y) #0 {
				; CHECK-LABEL: fcmps_v2f64:
				; CHECK: // %bb.0: // %entry
				; CHECK-NEXT: mov d2, v1.d[1]
				; CHECK-NEXT: mov d3, v0.d[1]
				; CHECK-NEXT: fcmpe d0, d1
				; CHECK-NEXT: csetm x8, eq
				; CHECK-NEXT: fcmpe d3, d2
				; CHECK-NEXT: fmov d0, x8
				; CHECK-NEXT: csetm x8, eq
				; CHECK-NEXT: mov v0.d[1], x8
				; CHECK-NEXT: xtn v0.2s, v0.2d
				; CHECK-NEXT: ret
				entry:
				%val = call <2 x i1> @llvm.experimental.constrained.fcmps.v2f64(<2 x double> %x, <2 x double> %y, metadata !"oeq", metadata !"fpexcept.strict")
				ret <2 x i1> %val
				}


				; Double-precision single element intrinsics

				define <1 x double> @add_v1f64(<1 x double> %x, <1 x double> %y) #0 {
				; CHECK-LABEL: add_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fadd d0, d0, d1
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.fadd.v1f64(<1 x double> %x, <1 x double> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @sub_v1f64(<1 x double> %x, <1 x double> %y) #0 {
				; CHECK-LABEL: sub_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fsub d0, d0, d1
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.fsub.v1f64(<1 x double> %x, <1 x double> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @mul_v1f64(<1 x double> %x, <1 x double> %y) #0 {
				; CHECK-LABEL: mul_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fmul d0, d0, d1
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.fmul.v1f64(<1 x double> %x, <1 x double> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @div_v1f64(<1 x double> %x, <1 x double> %y) #0 {
				; CHECK-LABEL: div_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fdiv d0, d0, d1
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.fdiv.v1f64(<1 x double> %x, <1 x double> %y, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @fma_v1f64(<1 x double> %x, <1 x double> %y, <1 x double> %z) #0 {
				; CHECK-LABEL: fma_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fmadd d0, d0, d1, d2
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.fma.v1f64(<1 x double> %x, <1 x double> %y, <1 x double> %z, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x i32> @fptosi_v1i32_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: fptosi_v1i32_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzs w8, d0
				; CHECK-NEXT: fmov s0, w8
				; CHECK-NEXT: ret
				%val = call <1 x i32> @llvm.experimental.constrained.fptosi.v1i32.v1f64(<1 x double> %x, metadata !"fpexcept.strict") #0
				ret <1 x i32> %val
				}

				define <1 x i32> @fptoui_v1i32_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: fptoui_v1i32_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzu w8, d0
				; CHECK-NEXT: fmov s0, w8
				; CHECK-NEXT: ret
				%val = call <1 x i32> @llvm.experimental.constrained.fptoui.v1i32.v1f64(<1 x double> %x, metadata !"fpexcept.strict") #0
				ret <1 x i32> %val
				}

				define <1 x i64> @fptosi_v1i64_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: fptosi_v1i64_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzs x8, d0
				; CHECK-NEXT: fmov d0, x8
				; CHECK-NEXT: ret
				%val = call <1 x i64> @llvm.experimental.constrained.fptosi.v1i64.v1f64(<1 x double> %x, metadata !"fpexcept.strict") #0
				ret <1 x i64> %val
				}

				define <1 x i64> @fptoui_v1i64_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: fptoui_v1i64_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtzu x8, d0
				; CHECK-NEXT: fmov d0, x8
				; CHECK-NEXT: ret
				%val = call <1 x i64> @llvm.experimental.constrained.fptoui.v1i64.v1f64(<1 x double> %x, metadata !"fpexcept.strict") #0
				ret <1 x i64> %val
				}

				define <1 x double> @sitofp_v1f64_v1i32(<1 x i32> %x) #0 {
				; CHECK-LABEL: sitofp_v1f64_v1i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: // kill: def $d0 killed $d0 def $q0
				; CHECK-NEXT: fmov w8, s0
				; CHECK-NEXT: scvtf d0, w8
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.sitofp.v1f64.v1i32(<1 x i32> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @uitofp_v1f64_v1i32(<1 x i32> %x) #0 {
				; CHECK-LABEL: uitofp_v1f64_v1i32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: // kill: def $d0 killed $d0 def $q0
				; CHECK-NEXT: fmov w8, s0
				; CHECK-NEXT: ucvtf d0, w8
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.uitofp.v1f64.v1i32(<1 x i32> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @sitofp_v1f64_v1i64(<1 x i64> %x) #0 {
				; CHECK-LABEL: sitofp_v1f64_v1i64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: // kill: def $d0 killed $d0 def $q0
				; CHECK-NEXT: fmov x8, d0
				; CHECK-NEXT: scvtf d0, x8
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.sitofp.v1f64.v1i64(<1 x i64> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @uitofp_v1f64_v1i64(<1 x i64> %x) #0 {
				; CHECK-LABEL: uitofp_v1f64_v1i64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: // kill: def $d0 killed $d0 def $q0
				; CHECK-NEXT: fmov x8, d0
				; CHECK-NEXT: ucvtf d0, x8
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.uitofp.v1f64.v1i64(<1 x i64> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @sqrt_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: sqrt_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fsqrt d0, d0
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.sqrt.v1f64(<1 x double> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @rint_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: rint_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintx d0, d0
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.rint.v1f64(<1 x double> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @nearbyint_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: nearbyint_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frinti d0, d0
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.nearbyint.v1f64(<1 x double> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @maxnum_v1f64(<1 x double> %x, <1 x double> %y) #0 {
				; CHECK-LABEL: maxnum_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fmaxnm d0, d0, d1
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.maxnum.v1f64(<1 x double> %x, <1 x double> %y, metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @minnum_v1f64(<1 x double> %x, <1 x double> %y) #0 {
				; CHECK-LABEL: minnum_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fminnm d0, d0, d1
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.minnum.v1f64(<1 x double> %x, <1 x double> %y, metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @ceil_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: ceil_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintp d0, d0
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.ceil.v1f64(<1 x double> %x, metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @floor_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: floor_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintm d0, d0
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.floor.v1f64(<1 x double> %x, metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @round_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: round_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frinta d0, d0
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.round.v1f64(<1 x double> %x, metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @roundeven_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: roundeven_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintn d0, d0
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.roundeven.v1f64(<1 x double> %x, metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x double> @trunc_v1f64(<1 x double> %x) #0 {
				; CHECK-LABEL: trunc_v1f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintz d0, d0
				; CHECK-NEXT: ret
				%val = call <1 x double> @llvm.experimental.constrained.trunc.v1f64(<1 x double> %x, metadata !"fpexcept.strict") #0
				ret <1 x double> %val
				}

				define <1 x i1> @fcmp_v1f61(<1 x double> %x, <1 x double> %y) #0 {
				; CHECK-LABEL: fcmp_v1f61:
				; CHECK: // %bb.0: // %entry
				; CHECK-NEXT: fcmp d0, d1
				; CHECK-NEXT: cset w0, eq
				; CHECK-NEXT: ret
				entry:
				%val = call <1 x i1> @llvm.experimental.constrained.fcmp.v1f64(<1 x double> %x, <1 x double> %y, metadata !"oeq", metadata !"fpexcept.strict")
				ret <1 x i1> %val
				}

				define <1 x i1> @fcmps_v1f61(<1 x double> %x, <1 x double> %y) #0 {
				; CHECK-LABEL: fcmps_v1f61:
				; CHECK: // %bb.0: // %entry
				; CHECK-NEXT: fcmpe d0, d1
				; CHECK-NEXT: cset w0, eq
				; CHECK-NEXT: ret
				entry:
				%val = call <1 x i1> @llvm.experimental.constrained.fcmps.v1f64(<1 x double> %x, <1 x double> %y, metadata !"oeq", metadata !"fpexcept.strict")
				ret <1 x i1> %val
				}


				; Intrinsics to convert between floating-point types

				define <2 x float> @fptrunc_v2f32_v2f64(<2 x double> %x) #0 {
				; CHECK-LABEL: fptrunc_v2f32_v2f64:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtn v0.2s, v0.2d
				; CHECK-NEXT: ret
				%val = call <2 x float> @llvm.experimental.constrained.fptrunc.v2f32.v2f64(<2 x double> %x, metadata !"round.tonearest", metadata !"fpexcept.strict") #0
				ret <2 x float> %val
				}

				define <2 x double> @fpext_v2f64_v2f32(<2 x float> %x) #0 {
				; CHECK-LABEL: fpext_v2f64_v2f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: fcvtl v0.2d, v0.2s
				; CHECK-NEXT: ret
				%val = call <2 x double> @llvm.experimental.constrained.fpext.v2f64.v2f32(<2 x float> %x, metadata !"fpexcept.strict") #0
				ret <2 x double> %val
				}


				attributes #0 = { strictfp }

				declare <4 x float> @llvm.experimental.constrained.fadd.v4f32(<4 x float>, <4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.fsub.v4f32(<4 x float>, <4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.fmul.v4f32(<4 x float>, <4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.fdiv.v4f32(<4 x float>, <4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.fma.v4f32(<4 x float>, <4 x float>, <4 x float>, metadata, metadata)
				declare <4 x i32> @llvm.experimental.constrained.fptosi.v4i32.v4f32(<4 x float>, metadata)
				declare <4 x i32> @llvm.experimental.constrained.fptoui.v4i32.v4f32(<4 x float>, metadata)
				declare <4 x i64> @llvm.experimental.constrained.fptosi.v4i64.v4f32(<4 x float>, metadata)
				declare <4 x i64> @llvm.experimental.constrained.fptoui.v4i64.v4f32(<4 x float>, metadata)
				declare <4 x float> @llvm.experimental.constrained.sitofp.v4f32.v4i32(<4 x i32>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.uitofp.v4f32.v4i32(<4 x i32>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.sitofp.v4f32.v4i64(<4 x i64>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.uitofp.v4f32.v4i64(<4 x i64>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.sqrt.v4f32(<4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.rint.v4f32(<4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.nearbyint.v4f32(<4 x float>, metadata, metadata)
				declare <4 x float> @llvm.experimental.constrained.maxnum.v4f32(<4 x float>, <4 x float>, metadata)
				declare <4 x float> @llvm.experimental.constrained.minnum.v4f32(<4 x float>, <4 x float>, metadata)
				declare <4 x float> @llvm.experimental.constrained.ceil.v4f32(<4 x float>, metadata)
				declare <4 x float> @llvm.experimental.constrained.floor.v4f32(<4 x float>, metadata)
				declare <4 x float> @llvm.experimental.constrained.round.v4f32(<4 x float>, metadata)
				declare <4 x float> @llvm.experimental.constrained.roundeven.v4f32(<4 x float>, metadata)
				declare <4 x float> @llvm.experimental.constrained.trunc.v4f32(<4 x float>, metadata)
				declare <4 x i1> @llvm.experimental.constrained.fcmp.v4f64(<4 x float>, <4 x float>, metadata, metadata)
				declare <4 x i1> @llvm.experimental.constrained.fcmps.v4f64(<4 x float>, <4 x float>, metadata, metadata)

				declare <2 x double> @llvm.experimental.constrained.fadd.v2f64(<2 x double>, <2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.fsub.v2f64(<2 x double>, <2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.fmul.v2f64(<2 x double>, <2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.fdiv.v2f64(<2 x double>, <2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.fma.v2f64(<2 x double>, <2 x double>, <2 x double>, metadata, metadata)
				declare <2 x i32> @llvm.experimental.constrained.fptosi.v2i32.v2f64(<2 x double>, metadata)
				declare <2 x i32> @llvm.experimental.constrained.fptoui.v2i32.v2f64(<2 x double>, metadata)
				declare <2 x i64> @llvm.experimental.constrained.fptosi.v2i64.v2f64(<2 x double>, metadata)
				declare <2 x i64> @llvm.experimental.constrained.fptoui.v2i64.v2f64(<2 x double>, metadata)
				declare <2 x double> @llvm.experimental.constrained.sitofp.v2f64.v2i32(<2 x i32>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.uitofp.v2f64.v2i32(<2 x i32>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.sitofp.v2f64.v2i64(<2 x i64>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.uitofp.v2f64.v2i64(<2 x i64>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.sqrt.v2f64(<2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.rint.v2f64(<2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.nearbyint.v2f64(<2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.maxnum.v2f64(<2 x double>, <2 x double>, metadata)
				declare <2 x double> @llvm.experimental.constrained.minnum.v2f64(<2 x double>, <2 x double>, metadata)
				declare <2 x double> @llvm.experimental.constrained.ceil.v2f64(<2 x double>, metadata)
				declare <2 x double> @llvm.experimental.constrained.floor.v2f64(<2 x double>, metadata)
				declare <2 x double> @llvm.experimental.constrained.round.v2f64(<2 x double>, metadata)
				declare <2 x double> @llvm.experimental.constrained.roundeven.v2f64(<2 x double>, metadata)
				declare <2 x double> @llvm.experimental.constrained.trunc.v2f64(<2 x double>, metadata)
				declare <2 x i1> @llvm.experimental.constrained.fcmp.v2f64(<2 x double>, <2 x double>, metadata, metadata)
				declare <2 x i1> @llvm.experimental.constrained.fcmps.v2f64(<2 x double>, <2 x double>, metadata, metadata)

				declare <1 x double> @llvm.experimental.constrained.fadd.v1f64(<1 x double>, <1 x double>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.fsub.v1f64(<1 x double>, <1 x double>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.fmul.v1f64(<1 x double>, <1 x double>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.fdiv.v1f64(<1 x double>, <1 x double>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.fma.v1f64(<1 x double>, <1 x double>, <1 x double>, metadata, metadata)
				declare <1 x i32> @llvm.experimental.constrained.fptosi.v1i32.v1f64(<1 x double>, metadata)
				declare <1 x i32> @llvm.experimental.constrained.fptoui.v1i32.v1f64(<1 x double>, metadata)
				declare <1 x i64> @llvm.experimental.constrained.fptosi.v1i64.v1f64(<1 x double>, metadata)
				declare <1 x i64> @llvm.experimental.constrained.fptoui.v1i64.v1f64(<1 x double>, metadata)
				declare <1 x double> @llvm.experimental.constrained.sitofp.v1f64.v1i32(<1 x i32>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.uitofp.v1f64.v1i32(<1 x i32>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.sitofp.v1f64.v1i64(<1 x i64>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.uitofp.v1f64.v1i64(<1 x i64>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.sqrt.v1f64(<1 x double>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.rint.v1f64(<1 x double>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.nearbyint.v1f64(<1 x double>, metadata, metadata)
				declare <1 x double> @llvm.experimental.constrained.maxnum.v1f64(<1 x double>, <1 x double>, metadata)
				declare <1 x double> @llvm.experimental.constrained.minnum.v1f64(<1 x double>, <1 x double>, metadata)
				declare <1 x double> @llvm.experimental.constrained.ceil.v1f64(<1 x double>, metadata)
				declare <1 x double> @llvm.experimental.constrained.floor.v1f64(<1 x double>, metadata)
				declare <1 x double> @llvm.experimental.constrained.round.v1f64(<1 x double>, metadata)
				declare <1 x double> @llvm.experimental.constrained.roundeven.v1f64(<1 x double>, metadata)
				declare <1 x double> @llvm.experimental.constrained.trunc.v1f64(<1 x double>, metadata)
				declare <1 x i1> @llvm.experimental.constrained.fcmp.v1f64(<1 x double>, <1 x double>, metadata, metadata)
				declare <1 x i1> @llvm.experimental.constrained.fcmps.v1f64(<1 x double>, <1 x double>, metadata, metadata)

				declare <2 x float> @llvm.experimental.constrained.fptrunc.v2f32.v2f64(<2 x double>, metadata, metadata)
				declare <2 x double> @llvm.experimental.constrained.fpext.v2f64.v2f32(<2 x float>, metadata)

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] Add some missing strict FP vector loweringClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 409663

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

llvm/test/CodeGen/AArch64/fp-intrinsics-vector.ll

[AArch64] Add some missing strict FP vector lowering
ClosedPublic