This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/CodeGen/
-
CodeGen/
-
CGBuiltin.cpp
-
test/CodeGen/
-
CodeGen/
-
aarch64-neon-intrinsics.c
-
aarch64-neon-misc.c
-
aarch64-v8.2a-fp16-intrinsics.c
-
aarch64-v8.2a-neon-intrinsics.c
-
arm-neon-directed-rounding.c
-
arm64-vrnd.c
-
llvm/
-
include/llvm/
-
llvm/
-
IR/
1/1
IntrinsicsAArch64.td
-
Target/
1/3
TargetSelectionDAG.td
-
lib/
-
IR/
-
AutoUpgrade.cpp
-
Target/AArch64/
-
AArch64/
-
AArch64ISelLowering.cpp
-
AArch64InstrInfo.td
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
-
arm64-vcvt.ll
-
arm64-vfloatintrinsics.ll
-
f16-instructions.ll
-
fp-intrinsics.ll
-
frintn.ll
-
sve-fixed-length-fp-rounding.ll
-
vec-libcalls.ll

Differential D98487

[AArch64][SVE/NEON] Add support for FROUNDEVEN for both NEON and fixed length SVE
ClosedPublic

Authored by bsmith on Mar 12 2021, 3:58 AM.

Download Raw Diff

Details

Reviewers

paulwalker-arm
peterwaller-arm
joechrisellis
CarolineConcatto
dmgreen

Commits

rGcf0da91ba5e1: [AArch64][SVE/NEON] Add support for FROUNDEVEN for both NEON and fixed length…

Summary

Previously NEON used a target specific intrinsic for frintn, given that
the FROUNDEVEN ISD node now exists, move over to that instead and add
codegen support for that node for both NEON and fixed length SVE.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

bsmith created this revision.Mar 12 2021, 3:58 AM

Herald added subscribers: hiraditya, kristof.beyls, tschuett. · View Herald TranscriptMar 12 2021, 3:58 AM

bsmith requested review of this revision.Mar 12 2021, 3:58 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptMar 12 2021, 3:58 AM

Herald added subscribers: llvm-commits, cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B93459: Diff 330201.Mar 12 2021, 5:46 AM

Hi @ bsmith,

Thank you for adding me as a reviewer, although I don't think I am the more qualified to approve or not this patch.
But I have a question:
Why is this patch only changing int_aarch64_neon_frintn and not int_aarch64_sve_frintn?
Is there a particular reason to do so?
As you said in the commit message the ISD node for FROUNDEVEN exists now.
If so it would be too much to explain that in the commit message?
Thank you,
Carol

Why is this patch only changing int_aarch64_neon_frintn and not int_aarch64_sve_frintn?
Is there a particular reason to do so?

Things are done slightly differently for SVE in this regard, in principal yes, we could emit roundeven instead of frintn from the ACLE intrinsic, however all of the other ACLE intrinsics also emit SVE specific LLVM intrinsics rather than the arch-indep nodes. This patch doesn't change that in order to stay consistent, if we did want to change that it should be done as a separate patch that changes all of them.

In D98487#2625673, @bsmith wrote:

Why is this patch only changing int_aarch64_neon_frintn and not int_aarch64_sve_frintn?
Is there a particular reason to do so?

Things are done slightly differently for SVE in this regard, in principal yes, we could emit roundeven instead of frintn from the ACLE intrinsic, however all of the other ACLE intrinsics also emit SVE specific LLVM intrinsics rather than the arch-indep nodes. This patch doesn't change that in order to stay consistent, if we did want to change that it should be done as a separate patch that changes all of them.

@CarolineConcatto There are two levels at play here. At the top level (C->LLVM) the SVE ACLE cannot use the roundeven intrinsic because that operation takes a single data operand whereas for SVE the operation is predicated and thus also requires predicate and passthru operands (i.e. the two intrinsics are doing different things). At the bottom level (CodeGen) we already lower scalable vector variants of both intrinsics to ISD::FROUNDEVEN_MERGE_PASSTHRU which is the "masked" version of ISD::FROUNDEVEN.

dmgreen added inline comments.Mar 15 2021, 4:49 AM

llvm/include/llvm/IR/IntrinsicsAArch64.td
476	If you are removing the old intrinsic (which is great), then it will need some AutoUpgrade code from the old to the new. Hopefully in this case that's pretty simple. Look for how aarch64.rbit is done.

Add AutoUpgrade code to convert aarch64.neon.frintn to roundeven
Add test for above AutoUpgrade

Herald added a subscriber: dexonsmith. · View Herald TranscriptMar 15 2021, 6:02 AM

bsmith marked an inline comment as done.Mar 15 2021, 6:02 AM

Harbormaster completed remote builds in B93795: Diff 330629.Mar 15 2021, 6:50 AM

Thanks. This looks sensible, from what I can tell.

llvm/include/llvm/Target/TargetSelectionDAG.td
158	Is this used? The one above should maybe say `// fpround`?

bsmith added inline comments.Mar 15 2021, 8:37 AM

llvm/include/llvm/Target/TargetSelectionDAG.td
158	No it's not, I added it for consistency, but perhaps I shouldn't? I think fround is correct for the one above, or at least is consistent with the others in this file, for example fextend below.

dmgreen added inline comments.Mar 16 2021, 2:08 AM

llvm/include/llvm/Target/TargetSelectionDAG.td
158	It's used below in `def fpround : SDNode<"ISD::FP_ROUND" , SDTFPRoundOp>;`, so it looks like its used with the fptrunc instruction, not the fround intrinsic. I see your point about fextend... I would say they should both be changed to fpextend/fpround, for consistency with the nodes they act upon.

Remove SDTFPRoundEvenOp as it's not a correct mirror of SDTFPRoundOp since that is not for ISD::FROUND.
Fix comments in include/llvm/Target/TargetSelectionDAG.td for SDTFPRoundOp and SDTFPExtendOp.

Harbormaster completed remote builds in B94026: Diff 330951.Mar 16 2021, 6:15 AM

Thanks. LGTM, if no one else has comments.

This revision is now accepted and ready to land.Mar 16 2021, 8:01 AM

Mainly focused on the SVE side of things, which looks good to me.

This revision was landed with ongoing or failed builds.Mar 17 2021, 4:41 AM

Closed by commit rGcf0da91ba5e1: [AArch64][SVE/NEON] Add support for FROUNDEVEN for both NEON and fixed length… (authored by bsmith). · Explain Why

This revision was automatically updated to reflect the committed changes.

bsmith added a commit: rGcf0da91ba5e1: [AArch64][SVE/NEON] Add support for FROUNDEVEN for both NEON and fixed length….

dmgreen mentioned this in D131547: [Clang][AArch64] Use generic extract/insert vector for svget/svset/svcreate tuples.Aug 15 2022, 7:16 AM

Revision Contents

Path

Size

clang/

lib/

CodeGen/

CGBuiltin.cpp

12 lines

test/

CodeGen/

aarch64-neon-intrinsics.c

2 lines

aarch64-neon-misc.c

2 lines

aarch64-v8.2a-fp16-intrinsics.c

2 lines

aarch64-v8.2a-neon-intrinsics.c

4 lines

arm-neon-directed-rounding.c

6 lines

arm64-vrnd.c

2 lines

llvm/

include/

llvm/

IR/

IntrinsicsAArch64.td

4 lines

Target/

TargetSelectionDAG.td

10 lines

lib/

IR/

AutoUpgrade.cpp

5 lines

Target/

AArch64/

AArch64ISelLowering.cpp

9 lines

AArch64InstrInfo.td

7 lines

test/

CodeGen/

AArch64/

arm64-vcvt.ll

12 lines

arm64-vfloatintrinsics.ll

30 lines

f16-instructions.ll

27 lines

fp-intrinsics.ll

16 lines

frintn.ll

41 lines

sve-fixed-length-fp-rounding.ll

266 lines

vec-libcalls.ll

10 lines

Diff 331224

clang/lib/CodeGen/CGBuiltin.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 10,614 Lines • ▼ Show 20 Lines
	case NEON::BI__builtin_neon_vrndmq_v: {			case NEON::BI__builtin_neon_vrndmq_v: {
	Int = Builder.getIsFPConstrained()			Int = Builder.getIsFPConstrained()
	? Intrinsic::experimental_constrained_floor			? Intrinsic::experimental_constrained_floor
	: Intrinsic::floor;			: Intrinsic::floor;
	return EmitNeonCall(CGM.getIntrinsic(Int, Ty), Ops, "vrndm");			return EmitNeonCall(CGM.getIntrinsic(Int, Ty), Ops, "vrndm");
	}			}
	case NEON::BI__builtin_neon_vrndnh_f16: {			case NEON::BI__builtin_neon_vrndnh_f16: {
	Ops.push_back(EmitScalarExpr(E->getArg(0)));			Ops.push_back(EmitScalarExpr(E->getArg(0)));
	Int = Intrinsic::aarch64_neon_frintn;			Int = Builder.getIsFPConstrained()
				? Intrinsic::experimental_constrained_roundeven
				: Intrinsic::roundeven;
	return EmitNeonCall(CGM.getIntrinsic(Int, HalfTy), Ops, "vrndn");			return EmitNeonCall(CGM.getIntrinsic(Int, HalfTy), Ops, "vrndn");
	}			}
	case NEON::BI__builtin_neon_vrndn_v:			case NEON::BI__builtin_neon_vrndn_v:
	case NEON::BI__builtin_neon_vrndnq_v: {			case NEON::BI__builtin_neon_vrndnq_v: {
	Int = Intrinsic::aarch64_neon_frintn;			Int = Builder.getIsFPConstrained()
				? Intrinsic::experimental_constrained_roundeven
				: Intrinsic::roundeven;
	return EmitNeonCall(CGM.getIntrinsic(Int, Ty), Ops, "vrndn");			return EmitNeonCall(CGM.getIntrinsic(Int, Ty), Ops, "vrndn");
	}			}
	case NEON::BI__builtin_neon_vrndns_f32: {			case NEON::BI__builtin_neon_vrndns_f32: {
	Ops.push_back(EmitScalarExpr(E->getArg(0)));			Ops.push_back(EmitScalarExpr(E->getArg(0)));
	Int = Intrinsic::aarch64_neon_frintn;			Int = Builder.getIsFPConstrained()
				? Intrinsic::experimental_constrained_roundeven
				: Intrinsic::roundeven;
	return EmitNeonCall(CGM.getIntrinsic(Int, FloatTy), Ops, "vrndn");			return EmitNeonCall(CGM.getIntrinsic(Int, FloatTy), Ops, "vrndn");
	}			}
	case NEON::BI__builtin_neon_vrndph_f16: {			case NEON::BI__builtin_neon_vrndph_f16: {
	Ops.push_back(EmitScalarExpr(E->getArg(0)));			Ops.push_back(EmitScalarExpr(E->getArg(0)));
	Int = Builder.getIsFPConstrained()			Int = Builder.getIsFPConstrained()
	? Intrinsic::experimental_constrained_ceil			? Intrinsic::experimental_constrained_ceil
	: Intrinsic::ceil;			: Intrinsic::ceil;
	return EmitNeonCall(CGM.getIntrinsic(Int, HalfTy), Ops, "vrndp");			return EmitNeonCall(CGM.getIntrinsic(Int, HalfTy), Ops, "vrndp");
	▲ Show 20 Lines • Show All 7,285 Lines • Show Last 20 Lines

clang/test/CodeGen/aarch64-neon-intrinsics.c

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 18,149 Lines • ▼ Show 20 Lines
	// CHECK: [[VCVT_N1:%.*]] = call <1 x double> @llvm.aarch64.neon.vcvtfxu2fp.v1f64.v1i64(<1 x i64> [[VCVT_N]], i32 64)			// CHECK: [[VCVT_N1:%.*]] = call <1 x double> @llvm.aarch64.neon.vcvtfxu2fp.v1f64.v1i64(<1 x i64> [[VCVT_N]], i32 64)
	// CHECK: ret <1 x double> [[VCVT_N1]]			// CHECK: ret <1 x double> [[VCVT_N1]]
	float64x1_t test_vcvt_n_f64_u64(uint64x1_t a) {			float64x1_t test_vcvt_n_f64_u64(uint64x1_t a) {
	return vcvt_n_f64_u64(a, 64);			return vcvt_n_f64_u64(a, 64);
	}			}

	// CHECK-LABEL: @test_vrndn_f64(			// CHECK-LABEL: @test_vrndn_f64(
	// CHECK: [[TMP0:%.*]] = bitcast <1 x double> %a to <8 x i8>			// CHECK: [[TMP0:%.*]] = bitcast <1 x double> %a to <8 x i8>
	// CHECK: [[VRNDN1_I:%.*]] = call <1 x double> @llvm.aarch64.neon.frintn.v1f64(<1 x double> %a)			// CHECK: [[VRNDN1_I:%.*]] = call <1 x double> @llvm.roundeven.v1f64(<1 x double> %a)
	// CHECK: ret <1 x double> [[VRNDN1_I]]			// CHECK: ret <1 x double> [[VRNDN1_I]]
	float64x1_t test_vrndn_f64(float64x1_t a) {			float64x1_t test_vrndn_f64(float64x1_t a) {
	return vrndn_f64(a);			return vrndn_f64(a);
	}			}

	// CHECK-LABEL: @test_vrnda_f64(			// CHECK-LABEL: @test_vrnda_f64(
	// CHECK: [[TMP0:%.*]] = bitcast <1 x double> %a to <8 x i8>			// CHECK: [[TMP0:%.*]] = bitcast <1 x double> %a to <8 x i8>
	// CHECK: [[VRNDA1_I:%.*]] = call <1 x double> @llvm.round.v1f64(<1 x double> %a)			// CHECK: [[VRNDA1_I:%.*]] = call <1 x double> @llvm.round.v1f64(<1 x double> %a)
	▲ Show 20 Lines • Show All 143 Lines • Show Last 20 Lines

clang/test/CodeGen/aarch64-neon-misc.c

	Show First 20 Lines • Show All 2,281 Lines • ▼ Show 20 Lines
	// CHECK: [[VCVT_I_I:%.*]] = fpext <2 x float> [[SHUFFLE_I_I]] to <2 x double>			// CHECK: [[VCVT_I_I:%.*]] = fpext <2 x float> [[SHUFFLE_I_I]] to <2 x double>
	// CHECK: ret <2 x double> [[VCVT_I_I]]			// CHECK: ret <2 x double> [[VCVT_I_I]]
	float64x2_t test_vcvt_high_f64_f32(float32x4_t a) {			float64x2_t test_vcvt_high_f64_f32(float32x4_t a) {
	return vcvt_high_f64_f32(a);			return vcvt_high_f64_f32(a);
	}			}

	// CHECK-LABEL: @test_vrndnq_f64(			// CHECK-LABEL: @test_vrndnq_f64(
	// CHECK: [[TMP0:%.*]] = bitcast <2 x double> %a to <16 x i8>			// CHECK: [[TMP0:%.*]] = bitcast <2 x double> %a to <16 x i8>
	// CHECK: [[VRNDN1_I:%.*]] = call <2 x double> @llvm.aarch64.neon.frintn.v2f64(<2 x double> %a)			// CHECK: [[VRNDN1_I:%.*]] = call <2 x double> @llvm.roundeven.v2f64(<2 x double> %a)
	// CHECK: ret <2 x double> [[VRNDN1_I]]			// CHECK: ret <2 x double> [[VRNDN1_I]]
	float64x2_t test_vrndnq_f64(float64x2_t a) {			float64x2_t test_vrndnq_f64(float64x2_t a) {
	return vrndnq_f64(a);			return vrndnq_f64(a);
	}			}

	// CHECK-LABEL: @test_vrndaq_f64(			// CHECK-LABEL: @test_vrndaq_f64(
	// CHECK: [[TMP0:%.*]] = bitcast <2 x double> %a to <16 x i8>			// CHECK: [[TMP0:%.*]] = bitcast <2 x double> %a to <16 x i8>
	// CHECK: [[VRNDA1_I:%.*]] = call <2 x double> @llvm.round.v2f64(<2 x double> %a)			// CHECK: [[VRNDA1_I:%.*]] = call <2 x double> @llvm.round.v2f64(<2 x double> %a)
	▲ Show 20 Lines • Show All 420 Lines • Show Last 20 Lines

clang/test/CodeGen/aarch64-v8.2a-fp16-intrinsics.c

	Show First 20 Lines • Show All 360 Lines • ▼ Show 20 Lines
	// CHECK-LABEL: test_vrndmh_f16			// CHECK-LABEL: test_vrndmh_f16
	// CHECK: [[RND:%.*]] = call half @llvm.floor.f16(half %a)			// CHECK: [[RND:%.*]] = call half @llvm.floor.f16(half %a)
	// CHECK: ret half [[RND]]			// CHECK: ret half [[RND]]
	float16_t test_vrndmh_f16(float16_t a) {			float16_t test_vrndmh_f16(float16_t a) {
	return vrndmh_f16(a);			return vrndmh_f16(a);
	}			}

	// CHECK-LABEL: test_vrndnh_f16			// CHECK-LABEL: test_vrndnh_f16
	// CHECK: [[RND:%.*]] = call half @llvm.aarch64.neon.frintn.f16(half %a)			// CHECK: [[RND:%.*]] = call half @llvm.roundeven.f16(half %a)
	// CHECK: ret half [[RND]]			// CHECK: ret half [[RND]]
	float16_t test_vrndnh_f16(float16_t a) {			float16_t test_vrndnh_f16(float16_t a) {
	return vrndnh_f16(a);			return vrndnh_f16(a);
	}			}

	// CHECK-LABEL: test_vrndph_f16			// CHECK-LABEL: test_vrndph_f16
	// CHECK: [[RND:%.*]] = call half @llvm.ceil.f16(half %a)			// CHECK: [[RND:%.*]] = call half @llvm.ceil.f16(half %a)
	// CHECK: ret half [[RND]]			// CHECK: ret half [[RND]]
	▲ Show 20 Lines • Show All 284 Lines • Show Last 20 Lines

clang/test/CodeGen/aarch64-v8.2a-neon-intrinsics.c

	Show First 20 Lines • Show All 342 Lines • ▼ Show 20 Lines
	// CHECK-LABEL: test_vrndmq_f16			// CHECK-LABEL: test_vrndmq_f16
	// CHECK: [[RND:%.*]] = call <8 x half> @llvm.floor.v8f16(<8 x half> %a)			// CHECK: [[RND:%.*]] = call <8 x half> @llvm.floor.v8f16(<8 x half> %a)
	// CHECK: ret <8 x half> [[RND]]			// CHECK: ret <8 x half> [[RND]]
	float16x8_t test_vrndmq_f16(float16x8_t a) {			float16x8_t test_vrndmq_f16(float16x8_t a) {
	return vrndmq_f16(a);			return vrndmq_f16(a);
	}			}

	// CHECK-LABEL: test_vrndn_f16			// CHECK-LABEL: test_vrndn_f16
	// CHECK: [[RND:%.*]] = call <4 x half> @llvm.aarch64.neon.frintn.v4f16(<4 x half> %a)			// CHECK: [[RND:%.*]] = call <4 x half> @llvm.roundeven.v4f16(<4 x half> %a)
	// CHECK: ret <4 x half> [[RND]]			// CHECK: ret <4 x half> [[RND]]
	float16x4_t test_vrndn_f16(float16x4_t a) {			float16x4_t test_vrndn_f16(float16x4_t a) {
	return vrndn_f16(a);			return vrndn_f16(a);
	}			}

	// CHECK-LABEL: test_vrndnq_f16			// CHECK-LABEL: test_vrndnq_f16
	// CHECK: [[RND:%.*]] = call <8 x half> @llvm.aarch64.neon.frintn.v8f16(<8 x half> %a)			// CHECK: [[RND:%.*]] = call <8 x half> @llvm.roundeven.v8f16(<8 x half> %a)
	// CHECK: ret <8 x half> [[RND]]			// CHECK: ret <8 x half> [[RND]]
	float16x8_t test_vrndnq_f16(float16x8_t a) {			float16x8_t test_vrndnq_f16(float16x8_t a) {
	return vrndnq_f16(a);			return vrndnq_f16(a);
	}			}

	// CHECK-LABEL: test_vrndp_f16			// CHECK-LABEL: test_vrndp_f16
	// CHECK: [[RND:%.*]] = call <4 x half> @llvm.ceil.v4f16(<4 x half> %a)			// CHECK: [[RND:%.*]] = call <4 x half> @llvm.ceil.v4f16(<4 x half> %a)
	// CHECK: ret <4 x half> [[RND]]			// CHECK: ret <4 x half> [[RND]]
	▲ Show 20 Lines • Show All 1,283 Lines • Show Last 20 Lines

clang/test/CodeGen/arm-neon-directed-rounding.c

	Show All 35 Lines
	// CHECK-A64: [[VRNDMQ_V1_I:%.*]] = call <4 x float> @llvm.floor.v4f32(<4 x float> %a)			// CHECK-A64: [[VRNDMQ_V1_I:%.*]] = call <4 x float> @llvm.floor.v4f32(<4 x float> %a)
	// CHECK: ret <4 x float> [[VRNDMQ_V1_I]]			// CHECK: ret <4 x float> [[VRNDMQ_V1_I]]
	float32x4_t test_vrndmq_f32(float32x4_t a) {			float32x4_t test_vrndmq_f32(float32x4_t a) {
	return vrndmq_f32(a);			return vrndmq_f32(a);
	}			}

	// CHECK-LABEL: define{{.*}} <2 x float> @test_vrndn_f32(<2 x float> %a)			// CHECK-LABEL: define{{.*}} <2 x float> @test_vrndn_f32(<2 x float> %a)
	// CHECK-A32: [[VRNDN_V1_I:%.*]] = call <2 x float> @llvm.arm.neon.vrintn.v2f32(<2 x float> %a)			// CHECK-A32: [[VRNDN_V1_I:%.*]] = call <2 x float> @llvm.arm.neon.vrintn.v2f32(<2 x float> %a)
	// CHECK-A64: [[VRNDN_V1_I:%.*]] = call <2 x float> @llvm.aarch64.neon.frintn.v2f32(<2 x float> %a)			// CHECK-A64: [[VRNDN_V1_I:%.*]] = call <2 x float> @llvm.roundeven.v2f32(<2 x float> %a)
	// CHECK: ret <2 x float> [[VRNDN_V1_I]]			// CHECK: ret <2 x float> [[VRNDN_V1_I]]
	float32x2_t test_vrndn_f32(float32x2_t a) {			float32x2_t test_vrndn_f32(float32x2_t a) {
	return vrndn_f32(a);			return vrndn_f32(a);
	}			}

	// CHECK-LABEL: define{{.*}} <4 x float> @test_vrndnq_f32(<4 x float> %a)			// CHECK-LABEL: define{{.*}} <4 x float> @test_vrndnq_f32(<4 x float> %a)
	// CHECK-A32: [[VRNDNQ_V1_I:%.*]] = call <4 x float> @llvm.arm.neon.vrintn.v4f32(<4 x float> %a)			// CHECK-A32: [[VRNDNQ_V1_I:%.*]] = call <4 x float> @llvm.arm.neon.vrintn.v4f32(<4 x float> %a)
	// CHECK-A64: [[VRNDNQ_V1_I:%.*]] = call <4 x float> @llvm.aarch64.neon.frintn.v4f32(<4 x float> %a)			// CHECK-A64: [[VRNDNQ_V1_I:%.*]] = call <4 x float> @llvm.roundeven.v4f32(<4 x float> %a)
	// CHECK: ret <4 x float> [[VRNDNQ_V1_I]]			// CHECK: ret <4 x float> [[VRNDNQ_V1_I]]
	float32x4_t test_vrndnq_f32(float32x4_t a) {			float32x4_t test_vrndnq_f32(float32x4_t a) {
	return vrndnq_f32(a);			return vrndnq_f32(a);
	}			}

	// CHECK-LABEL: define{{.*}} <2 x float> @test_vrndp_f32(<2 x float> %a)			// CHECK-LABEL: define{{.*}} <2 x float> @test_vrndp_f32(<2 x float> %a)
	// CHECK-A32: [[VRNDP_V1_I:%.*]] = call <2 x float> @llvm.arm.neon.vrintp.v2f32(<2 x float> %a)			// CHECK-A32: [[VRNDP_V1_I:%.*]] = call <2 x float> @llvm.arm.neon.vrintp.v2f32(<2 x float> %a)
	// CHECK-A64: [[VRNDP_V1_I:%.*]] = call <2 x float> @llvm.ceil.v2f32(<2 x float> %a)			// CHECK-A64: [[VRNDP_V1_I:%.*]] = call <2 x float> @llvm.ceil.v2f32(<2 x float> %a)
	Show All 39 Lines
	// CHECK-A64: [[VRNDQ_V1_I:%.*]] = call <4 x float> @llvm.trunc.v4f32(<4 x float> %a)			// CHECK-A64: [[VRNDQ_V1_I:%.*]] = call <4 x float> @llvm.trunc.v4f32(<4 x float> %a)
	// CHECK: ret <4 x float> [[VRNDQ_V1_I]]			// CHECK: ret <4 x float> [[VRNDQ_V1_I]]
	float32x4_t test_vrndq_f32(float32x4_t a) {			float32x4_t test_vrndq_f32(float32x4_t a) {
	return vrndq_f32(a);			return vrndq_f32(a);
	}			}

	// CHECK-LABEL: define{{.*}} float @test_vrndns_f32(float %a)			// CHECK-LABEL: define{{.*}} float @test_vrndns_f32(float %a)
	// CHECK-A32: [[VRNDN_I:%.*]] = call float @llvm.arm.neon.vrintn.f32(float %a)			// CHECK-A32: [[VRNDN_I:%.*]] = call float @llvm.arm.neon.vrintn.f32(float %a)
	// CHECK-A64: [[VRNDN_I:%.*]] = call float @llvm.aarch64.neon.frintn.f32(float %a)			// CHECK-A64: [[VRNDN_I:%.*]] = call float @llvm.roundeven.f32(float %a)
	// CHECK: ret float [[VRNDN_I]]			// CHECK: ret float [[VRNDN_I]]
	float32_t test_vrndns_f32(float32_t a) {			float32_t test_vrndns_f32(float32_t a) {
	return vrndns_f32(a);			return vrndns_f32(a);
	}			}

	// CHECK-LABEL: define{{.*}} <2 x float> @test_vrndi_f32(<2 x float> %a)			// CHECK-LABEL: define{{.*}} <2 x float> @test_vrndi_f32(<2 x float> %a)
	// CHECK: [[TMP0:%.*]] = bitcast <2 x float> %a to <8 x i8>			// CHECK: [[TMP0:%.*]] = bitcast <2 x float> %a to <8 x i8>
	// CHECK: [[VRNDI1_I:%.*]] = call <2 x float> @llvm.nearbyint.v2f32(<2 x float> %a)			// CHECK: [[VRNDI1_I:%.*]] = call <2 x float> @llvm.nearbyint.v2f32(<2 x float> %a)
	Show All 12 Lines

clang/test/CodeGen/arm64-vrnd.c

	// RUN: %clang_cc1 -triple arm64-apple-ios7 -target-feature +neon -ffreestanding -flax-vector-conversions=none -emit-llvm -o - %s \| FileCheck %s			// RUN: %clang_cc1 -triple arm64-apple-ios7 -target-feature +neon -ffreestanding -flax-vector-conversions=none -emit-llvm -o - %s \| FileCheck %s

	#include <arm_neon.h>			#include <arm_neon.h>

	float64x2_t rnd5(float64x2_t a) { return vrndq_f64(a); }			float64x2_t rnd5(float64x2_t a) { return vrndq_f64(a); }
	// CHECK: call <2 x double> @llvm.trunc.v2f64(<2 x double>			// CHECK: call <2 x double> @llvm.trunc.v2f64(<2 x double>

	float64x2_t rnd9(float64x2_t a) { return vrndnq_f64(a); }			float64x2_t rnd9(float64x2_t a) { return vrndnq_f64(a); }
	// CHECK: call <2 x double> @llvm.aarch64.neon.frintn.v2f64(<2 x double>			// CHECK: call <2 x double> @llvm.roundeven.v2f64(<2 x double>

	float64x2_t rnd13(float64x2_t a) { return vrndmq_f64(a); }			float64x2_t rnd13(float64x2_t a) { return vrndmq_f64(a); }
	// CHECK: call <2 x double> @llvm.floor.v2f64(<2 x double>			// CHECK: call <2 x double> @llvm.floor.v2f64(<2 x double>

	float64x2_t rnd18(float64x2_t a) { return vrndpq_f64(a); }			float64x2_t rnd18(float64x2_t a) { return vrndpq_f64(a); }
	// CHECK: call <2 x double> @llvm.ceil.v2f64(<2 x double>			// CHECK: call <2 x double> @llvm.ceil.v2f64(<2 x double>

	float64x2_t rnd22(float64x2_t a) { return vrndaq_f64(a); }			float64x2_t rnd22(float64x2_t a) { return vrndaq_f64(a); }
	// CHECK: call <2 x double> @llvm.round.v2f64(<2 x double>			// CHECK: call <2 x double> @llvm.round.v2f64(<2 x double>

	float64x2_t rnd25(float64x2_t a) { return vrndxq_f64(a); }			float64x2_t rnd25(float64x2_t a) { return vrndxq_f64(a); }
	// CHECK: call <2 x double> @llvm.rint.v2f64(<2 x double>			// CHECK: call <2 x double> @llvm.rint.v2f64(<2 x double>

llvm/include/llvm/IR/IntrinsicsAArch64.td

Show First 20 Lines • Show All 465 Lines • ▼ Show 20 Lines	let TargetPrefix = "aarch64", IntrProperties = [IntrNoMem] in {
def int_aarch64_neon_fcvtmu : AdvSIMD_FPToIntRounding_Intrinsic;		def int_aarch64_neon_fcvtmu : AdvSIMD_FPToIntRounding_Intrinsic;
def int_aarch64_neon_fcvtns : AdvSIMD_FPToIntRounding_Intrinsic;		def int_aarch64_neon_fcvtns : AdvSIMD_FPToIntRounding_Intrinsic;
def int_aarch64_neon_fcvtnu : AdvSIMD_FPToIntRounding_Intrinsic;		def int_aarch64_neon_fcvtnu : AdvSIMD_FPToIntRounding_Intrinsic;
def int_aarch64_neon_fcvtps : AdvSIMD_FPToIntRounding_Intrinsic;		def int_aarch64_neon_fcvtps : AdvSIMD_FPToIntRounding_Intrinsic;
def int_aarch64_neon_fcvtpu : AdvSIMD_FPToIntRounding_Intrinsic;		def int_aarch64_neon_fcvtpu : AdvSIMD_FPToIntRounding_Intrinsic;
def int_aarch64_neon_fcvtzs : AdvSIMD_FPToIntRounding_Intrinsic;		def int_aarch64_neon_fcvtzs : AdvSIMD_FPToIntRounding_Intrinsic;
def int_aarch64_neon_fcvtzu : AdvSIMD_FPToIntRounding_Intrinsic;		def int_aarch64_neon_fcvtzu : AdvSIMD_FPToIntRounding_Intrinsic;

// Vector FP Rounding: only ties to even is unrepresented by a normal
// intrinsic.
def int_aarch64_neon_frintn : AdvSIMD_1FloatArg_Intrinsic;
dmgreenUnsubmitted Done Reply Inline Actions If you are removing the old intrinsic (which is great), then it will need some AutoUpgrade code from the old to the new. Hopefully in this case that's pretty simple. Look for how aarch64.rbit is done. dmgreen: If you are removing the old intrinsic (which is great), then it will need some AutoUpgrade code…

// v8.5-A Vector FP Rounding		// v8.5-A Vector FP Rounding
def int_aarch64_neon_frint32x : AdvSIMD_1FloatArg_Intrinsic;		def int_aarch64_neon_frint32x : AdvSIMD_1FloatArg_Intrinsic;
def int_aarch64_neon_frint32z : AdvSIMD_1FloatArg_Intrinsic;		def int_aarch64_neon_frint32z : AdvSIMD_1FloatArg_Intrinsic;
def int_aarch64_neon_frint64x : AdvSIMD_1FloatArg_Intrinsic;		def int_aarch64_neon_frint64x : AdvSIMD_1FloatArg_Intrinsic;
def int_aarch64_neon_frint64z : AdvSIMD_1FloatArg_Intrinsic;		def int_aarch64_neon_frint64z : AdvSIMD_1FloatArg_Intrinsic;

// Scalar FP->Int conversions		// Scalar FP->Int conversions

▲ Show 20 Lines • Show All 2,034 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetSelectionDAG.td

Show First 20 Lines • Show All 146 Lines • ▼ Show 20 Lines	def SDTIntExtendOp : SDTypeProfile<1, 1, [ // sext, zext, anyext
SDTCisInt<0>, SDTCisInt<1>, SDTCisOpSmallerThanOp<1, 0>, SDTCisSameNumEltsAs<0, 1>		SDTCisInt<0>, SDTCisInt<1>, SDTCisOpSmallerThanOp<1, 0>, SDTCisSameNumEltsAs<0, 1>
]>;		]>;
def SDTIntTruncOp : SDTypeProfile<1, 1, [ // trunc		def SDTIntTruncOp : SDTypeProfile<1, 1, [ // trunc
SDTCisInt<0>, SDTCisInt<1>, SDTCisOpSmallerThanOp<0, 1>, SDTCisSameNumEltsAs<0, 1>		SDTCisInt<0>, SDTCisInt<1>, SDTCisOpSmallerThanOp<0, 1>, SDTCisSameNumEltsAs<0, 1>
]>;		]>;
def SDTFPUnaryOp : SDTypeProfile<1, 1, [ // fneg, fsqrt, etc		def SDTFPUnaryOp : SDTypeProfile<1, 1, [ // fneg, fsqrt, etc
SDTCisSameAs<0, 1>, SDTCisFP<0>		SDTCisSameAs<0, 1>, SDTCisFP<0>
]>;		]>;
def SDTFPRoundOp : SDTypeProfile<1, 1, [ // fround		def SDTFPRoundOp : SDTypeProfile<1, 1, [ // fpround
SDTCisFP<0>, SDTCisFP<1>, SDTCisOpSmallerThanOp<0, 1>, SDTCisSameNumEltsAs<0, 1>		SDTCisFP<0>, SDTCisFP<1>, SDTCisOpSmallerThanOp<0, 1>, SDTCisSameNumEltsAs<0, 1>
]>;		]>;
def SDTFPExtendOp : SDTypeProfile<1, 1, [ // fextend		def SDTFPExtendOp : SDTypeProfile<1, 1, [ // fpextend
		dmgreenUnsubmitted Not Done Reply Inline Actions Is this used? The one above should maybe say `// fpround`? dmgreen: Is this used? The one above should maybe say `// fpround`?
		bsmithAuthorUnsubmitted Done Reply Inline Actions No it's not, I added it for consistency, but perhaps I shouldn't? I think fround is correct for the one above, or at least is consistent with the others in this file, for example fextend below. bsmith: No it's not, I added it for consistency, but perhaps I shouldn't? I think fround is correct for…
		dmgreenUnsubmitted Not Done Reply Inline Actions It's used below in `def fpround : SDNode<"ISD::FP_ROUND" , SDTFPRoundOp>;`, so it looks like its used with the fptrunc instruction, not the fround intrinsic. I see your point about fextend... I would say they should both be changed to fpextend/fpround, for consistency with the nodes they act upon. dmgreen: It's used below in `def fpround : SDNode<"ISD::FP_ROUND" , SDTFPRoundOp>;`, so it looks…
SDTCisFP<0>, SDTCisFP<1>, SDTCisOpSmallerThanOp<1, 0>, SDTCisSameNumEltsAs<0, 1>		SDTCisFP<0>, SDTCisFP<1>, SDTCisOpSmallerThanOp<1, 0>, SDTCisSameNumEltsAs<0, 1>
]>;		]>;
def SDTIntToFPOp : SDTypeProfile<1, 1, [ // [su]int_to_fp		def SDTIntToFPOp : SDTypeProfile<1, 1, [ // [su]int_to_fp
SDTCisFP<0>, SDTCisInt<1>, SDTCisSameNumEltsAs<0, 1>		SDTCisFP<0>, SDTCisInt<1>, SDTCisSameNumEltsAs<0, 1>
]>;		]>;
def SDTFPToIntOp : SDTypeProfile<1, 1, [ // fp_to_[su]int		def SDTFPToIntOp : SDTypeProfile<1, 1, [ // fp_to_[su]int
SDTCisInt<0>, SDTCisFP<1>, SDTCisSameNumEltsAs<0, 1>		SDTCisInt<0>, SDTCisFP<1>, SDTCisSameNumEltsAs<0, 1>
]>;		]>;
▲ Show 20 Lines • Show All 314 Lines • ▼ Show 20 Lines
def fpow : SDNode<"ISD::FPOW" , SDTFPBinOp>;		def fpow : SDNode<"ISD::FPOW" , SDTFPBinOp>;
def flog2 : SDNode<"ISD::FLOG2" , SDTFPUnaryOp>;		def flog2 : SDNode<"ISD::FLOG2" , SDTFPUnaryOp>;
def frint : SDNode<"ISD::FRINT" , SDTFPUnaryOp>;		def frint : SDNode<"ISD::FRINT" , SDTFPUnaryOp>;
def ftrunc : SDNode<"ISD::FTRUNC" , SDTFPUnaryOp>;		def ftrunc : SDNode<"ISD::FTRUNC" , SDTFPUnaryOp>;
def fceil : SDNode<"ISD::FCEIL" , SDTFPUnaryOp>;		def fceil : SDNode<"ISD::FCEIL" , SDTFPUnaryOp>;
def ffloor : SDNode<"ISD::FFLOOR" , SDTFPUnaryOp>;		def ffloor : SDNode<"ISD::FFLOOR" , SDTFPUnaryOp>;
def fnearbyint : SDNode<"ISD::FNEARBYINT" , SDTFPUnaryOp>;		def fnearbyint : SDNode<"ISD::FNEARBYINT" , SDTFPUnaryOp>;
def fround : SDNode<"ISD::FROUND" , SDTFPUnaryOp>;		def fround : SDNode<"ISD::FROUND" , SDTFPUnaryOp>;
		def froundeven : SDNode<"ISD::FROUNDEVEN" , SDTFPUnaryOp>;

def lround : SDNode<"ISD::LROUND" , SDTFPToIntOp>;		def lround : SDNode<"ISD::LROUND" , SDTFPToIntOp>;
def llround : SDNode<"ISD::LLROUND" , SDTFPToIntOp>;		def llround : SDNode<"ISD::LLROUND" , SDTFPToIntOp>;
def lrint : SDNode<"ISD::LRINT" , SDTFPToIntOp>;		def lrint : SDNode<"ISD::LRINT" , SDTFPToIntOp>;
def llrint : SDNode<"ISD::LLRINT" , SDTFPToIntOp>;		def llrint : SDNode<"ISD::LLRINT" , SDTFPToIntOp>;

def fpround : SDNode<"ISD::FP_ROUND" , SDTFPRoundOp>;		def fpround : SDNode<"ISD::FP_ROUND" , SDTFPRoundOp>;
def fpextend : SDNode<"ISD::FP_EXTEND" , SDTFPExtendOp>;		def fpextend : SDNode<"ISD::FP_EXTEND" , SDTFPExtendOp>;
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
def strict_ffloor : SDNode<"ISD::STRICT_FFLOOR",		def strict_ffloor : SDNode<"ISD::STRICT_FFLOOR",
SDTFPUnaryOp, [SDNPHasChain]>;		SDTFPUnaryOp, [SDNPHasChain]>;
def strict_lround : SDNode<"ISD::STRICT_LROUND",		def strict_lround : SDNode<"ISD::STRICT_LROUND",
SDTFPToIntOp, [SDNPHasChain]>;		SDTFPToIntOp, [SDNPHasChain]>;
def strict_llround : SDNode<"ISD::STRICT_LLROUND",		def strict_llround : SDNode<"ISD::STRICT_LLROUND",
SDTFPToIntOp, [SDNPHasChain]>;		SDTFPToIntOp, [SDNPHasChain]>;
def strict_fround : SDNode<"ISD::STRICT_FROUND",		def strict_fround : SDNode<"ISD::STRICT_FROUND",
SDTFPUnaryOp, [SDNPHasChain]>;		SDTFPUnaryOp, [SDNPHasChain]>;
		def strict_froundeven : SDNode<"ISD::STRICT_FROUNDEVEN",
		SDTFPUnaryOp, [SDNPHasChain]>;
def strict_ftrunc : SDNode<"ISD::STRICT_FTRUNC",		def strict_ftrunc : SDNode<"ISD::STRICT_FTRUNC",
SDTFPUnaryOp, [SDNPHasChain]>;		SDTFPUnaryOp, [SDNPHasChain]>;
def strict_fminnum : SDNode<"ISD::STRICT_FMINNUM",		def strict_fminnum : SDNode<"ISD::STRICT_FMINNUM",
SDTFPBinOp, [SDNPHasChain,		SDTFPBinOp, [SDNPHasChain,
SDNPCommutative, SDNPAssociative]>;		SDNPCommutative, SDNPAssociative]>;
def strict_fmaxnum : SDNode<"ISD::STRICT_FMAXNUM",		def strict_fmaxnum : SDNode<"ISD::STRICT_FMAXNUM",
SDTFPBinOp, [SDNPHasChain,		SDTFPBinOp, [SDNPHasChain,
SDNPCommutative, SDNPAssociative]>;		SDNPCommutative, SDNPAssociative]>;
▲ Show 20 Lines • Show All 851 Lines • ▼ Show 20 Lines	def any_lround : PatFrags<(ops node:$src),
[(strict_lround node:$src),		[(strict_lround node:$src),
(lround node:$src)]>;		(lround node:$src)]>;
def any_llround : PatFrags<(ops node:$src),		def any_llround : PatFrags<(ops node:$src),
[(strict_llround node:$src),		[(strict_llround node:$src),
(llround node:$src)]>;		(llround node:$src)]>;
def any_fround : PatFrags<(ops node:$src),		def any_fround : PatFrags<(ops node:$src),
[(strict_fround node:$src),		[(strict_fround node:$src),
(fround node:$src)]>;		(fround node:$src)]>;
		def any_froundeven : PatFrags<(ops node:$src),
		[(strict_froundeven node:$src),
		(froundeven node:$src)]>;
def any_ftrunc : PatFrags<(ops node:$src),		def any_ftrunc : PatFrags<(ops node:$src),
[(strict_ftrunc node:$src),		[(strict_ftrunc node:$src),
(ftrunc node:$src)]>;		(ftrunc node:$src)]>;
def any_fmaxnum : PatFrags<(ops node:$lhs, node:$rhs),		def any_fmaxnum : PatFrags<(ops node:$lhs, node:$rhs),
[(strict_fmaxnum node:$lhs, node:$rhs),		[(strict_fmaxnum node:$lhs, node:$rhs),
(fmaxnum node:$lhs, node:$rhs)]>;		(fmaxnum node:$lhs, node:$rhs)]>;
def any_fminnum : PatFrags<(ops node:$lhs, node:$rhs),		def any_fminnum : PatFrags<(ops node:$lhs, node:$rhs),
[(strict_fminnum node:$lhs, node:$rhs),		[(strict_fminnum node:$lhs, node:$rhs),
▲ Show 20 Lines • Show All 232 Lines • Show Last 20 Lines

llvm/lib/IR/AutoUpgrade.cpp

Show First 20 Lines • Show All 542 Lines • ▼ Show 20 Lines	static bool UpgradeIntrinsicFunction1(Function F, Function &NewFn) {
switch (Name[0]) {		switch (Name[0]) {
default: break;		default: break;
case 'a': {		case 'a': {
if (Name.startswith("arm.rbit") \|\| Name.startswith("aarch64.rbit")) {		if (Name.startswith("arm.rbit") \|\| Name.startswith("aarch64.rbit")) {
NewFn = Intrinsic::getDeclaration(F->getParent(), Intrinsic::bitreverse,		NewFn = Intrinsic::getDeclaration(F->getParent(), Intrinsic::bitreverse,
F->arg_begin()->getType());		F->arg_begin()->getType());
return true;		return true;
}		}
		if (Name.startswith("aarch64.neon.frintn")) {
		NewFn = Intrinsic::getDeclaration(F->getParent(), Intrinsic::roundeven,
		F->arg_begin()->getType());
		return true;
		}
if (Name.startswith("arm.neon.vclz")) {		if (Name.startswith("arm.neon.vclz")) {
Type* args[2] = {		Type* args[2] = {
F->arg_begin()->getType(),		F->arg_begin()->getType(),
Type::getInt1Ty(F->getContext())		Type::getInt1Ty(F->getContext())
};		};
// Can't use Intrinsic::getDeclaration here as it adds a ".i1" to		// Can't use Intrinsic::getDeclaration here as it adds a ".i1" to
// the end of the name. Change name from llvm.arm.neon.vclz.* to		// the end of the name. Change name from llvm.arm.neon.vclz.* to
// llvm.ctlz.*		// llvm.ctlz.*
▲ Show 20 Lines • Show All 3,919 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 599 Lines • ▼ Show 20 Lines	if (!Subtarget->hasFullFP16()) {
setOperationAction(ISD::FNEG, MVT::f16, Promote);		setOperationAction(ISD::FNEG, MVT::f16, Promote);
setOperationAction(ISD::FABS, MVT::f16, Promote);		setOperationAction(ISD::FABS, MVT::f16, Promote);
setOperationAction(ISD::FCEIL, MVT::f16, Promote);		setOperationAction(ISD::FCEIL, MVT::f16, Promote);
setOperationAction(ISD::FSQRT, MVT::f16, Promote);		setOperationAction(ISD::FSQRT, MVT::f16, Promote);
setOperationAction(ISD::FFLOOR, MVT::f16, Promote);		setOperationAction(ISD::FFLOOR, MVT::f16, Promote);
setOperationAction(ISD::FNEARBYINT, MVT::f16, Promote);		setOperationAction(ISD::FNEARBYINT, MVT::f16, Promote);
setOperationAction(ISD::FRINT, MVT::f16, Promote);		setOperationAction(ISD::FRINT, MVT::f16, Promote);
setOperationAction(ISD::FROUND, MVT::f16, Promote);		setOperationAction(ISD::FROUND, MVT::f16, Promote);
		setOperationAction(ISD::FROUNDEVEN, MVT::f16, Promote);
setOperationAction(ISD::FTRUNC, MVT::f16, Promote);		setOperationAction(ISD::FTRUNC, MVT::f16, Promote);
setOperationAction(ISD::FMINNUM, MVT::f16, Promote);		setOperationAction(ISD::FMINNUM, MVT::f16, Promote);
setOperationAction(ISD::FMAXNUM, MVT::f16, Promote);		setOperationAction(ISD::FMAXNUM, MVT::f16, Promote);
setOperationAction(ISD::FMINIMUM, MVT::f16, Promote);		setOperationAction(ISD::FMINIMUM, MVT::f16, Promote);
setOperationAction(ISD::FMAXIMUM, MVT::f16, Promote);		setOperationAction(ISD::FMAXIMUM, MVT::f16, Promote);

// promote v4f16 to v4f32 when that is known to be safe.		// promote v4f16 to v4f32 when that is known to be safe.
setOperationAction(ISD::FADD, MVT::v4f16, Promote);		setOperationAction(ISD::FADD, MVT::v4f16, Promote);
setOperationAction(ISD::FSUB, MVT::v4f16, Promote);		setOperationAction(ISD::FSUB, MVT::v4f16, Promote);
setOperationAction(ISD::FMUL, MVT::v4f16, Promote);		setOperationAction(ISD::FMUL, MVT::v4f16, Promote);
setOperationAction(ISD::FDIV, MVT::v4f16, Promote);		setOperationAction(ISD::FDIV, MVT::v4f16, Promote);
AddPromotedToType(ISD::FADD, MVT::v4f16, MVT::v4f32);		AddPromotedToType(ISD::FADD, MVT::v4f16, MVT::v4f32);
AddPromotedToType(ISD::FSUB, MVT::v4f16, MVT::v4f32);		AddPromotedToType(ISD::FSUB, MVT::v4f16, MVT::v4f32);
AddPromotedToType(ISD::FMUL, MVT::v4f16, MVT::v4f32);		AddPromotedToType(ISD::FMUL, MVT::v4f16, MVT::v4f32);
AddPromotedToType(ISD::FDIV, MVT::v4f16, MVT::v4f32);		AddPromotedToType(ISD::FDIV, MVT::v4f16, MVT::v4f32);

setOperationAction(ISD::FABS, MVT::v4f16, Expand);		setOperationAction(ISD::FABS, MVT::v4f16, Expand);
setOperationAction(ISD::FNEG, MVT::v4f16, Expand);		setOperationAction(ISD::FNEG, MVT::v4f16, Expand);
setOperationAction(ISD::FROUND, MVT::v4f16, Expand);		setOperationAction(ISD::FROUND, MVT::v4f16, Expand);
		setOperationAction(ISD::FROUNDEVEN, MVT::v4f16, Expand);
setOperationAction(ISD::FMA, MVT::v4f16, Expand);		setOperationAction(ISD::FMA, MVT::v4f16, Expand);
setOperationAction(ISD::SETCC, MVT::v4f16, Expand);		setOperationAction(ISD::SETCC, MVT::v4f16, Expand);
setOperationAction(ISD::BR_CC, MVT::v4f16, Expand);		setOperationAction(ISD::BR_CC, MVT::v4f16, Expand);
setOperationAction(ISD::SELECT, MVT::v4f16, Expand);		setOperationAction(ISD::SELECT, MVT::v4f16, Expand);
setOperationAction(ISD::SELECT_CC, MVT::v4f16, Expand);		setOperationAction(ISD::SELECT_CC, MVT::v4f16, Expand);
setOperationAction(ISD::FTRUNC, MVT::v4f16, Expand);		setOperationAction(ISD::FTRUNC, MVT::v4f16, Expand);
setOperationAction(ISD::FCOPYSIGN, MVT::v4f16, Expand);		setOperationAction(ISD::FCOPYSIGN, MVT::v4f16, Expand);
setOperationAction(ISD::FFLOOR, MVT::v4f16, Expand);		setOperationAction(ISD::FFLOOR, MVT::v4f16, Expand);
setOperationAction(ISD::FCEIL, MVT::v4f16, Expand);		setOperationAction(ISD::FCEIL, MVT::v4f16, Expand);
setOperationAction(ISD::FRINT, MVT::v4f16, Expand);		setOperationAction(ISD::FRINT, MVT::v4f16, Expand);
setOperationAction(ISD::FNEARBYINT, MVT::v4f16, Expand);		setOperationAction(ISD::FNEARBYINT, MVT::v4f16, Expand);
setOperationAction(ISD::FSQRT, MVT::v4f16, Expand);		setOperationAction(ISD::FSQRT, MVT::v4f16, Expand);

setOperationAction(ISD::FABS, MVT::v8f16, Expand);		setOperationAction(ISD::FABS, MVT::v8f16, Expand);
setOperationAction(ISD::FADD, MVT::v8f16, Expand);		setOperationAction(ISD::FADD, MVT::v8f16, Expand);
setOperationAction(ISD::FCEIL, MVT::v8f16, Expand);		setOperationAction(ISD::FCEIL, MVT::v8f16, Expand);
setOperationAction(ISD::FCOPYSIGN, MVT::v8f16, Expand);		setOperationAction(ISD::FCOPYSIGN, MVT::v8f16, Expand);
setOperationAction(ISD::FDIV, MVT::v8f16, Expand);		setOperationAction(ISD::FDIV, MVT::v8f16, Expand);
setOperationAction(ISD::FFLOOR, MVT::v8f16, Expand);		setOperationAction(ISD::FFLOOR, MVT::v8f16, Expand);
setOperationAction(ISD::FMA, MVT::v8f16, Expand);		setOperationAction(ISD::FMA, MVT::v8f16, Expand);
setOperationAction(ISD::FMUL, MVT::v8f16, Expand);		setOperationAction(ISD::FMUL, MVT::v8f16, Expand);
setOperationAction(ISD::FNEARBYINT, MVT::v8f16, Expand);		setOperationAction(ISD::FNEARBYINT, MVT::v8f16, Expand);
setOperationAction(ISD::FNEG, MVT::v8f16, Expand);		setOperationAction(ISD::FNEG, MVT::v8f16, Expand);
setOperationAction(ISD::FROUND, MVT::v8f16, Expand);		setOperationAction(ISD::FROUND, MVT::v8f16, Expand);
		setOperationAction(ISD::FROUNDEVEN, MVT::v8f16, Expand);
setOperationAction(ISD::FRINT, MVT::v8f16, Expand);		setOperationAction(ISD::FRINT, MVT::v8f16, Expand);
setOperationAction(ISD::FSQRT, MVT::v8f16, Expand);		setOperationAction(ISD::FSQRT, MVT::v8f16, Expand);
setOperationAction(ISD::FSUB, MVT::v8f16, Expand);		setOperationAction(ISD::FSUB, MVT::v8f16, Expand);
setOperationAction(ISD::FTRUNC, MVT::v8f16, Expand);		setOperationAction(ISD::FTRUNC, MVT::v8f16, Expand);
setOperationAction(ISD::SETCC, MVT::v8f16, Expand);		setOperationAction(ISD::SETCC, MVT::v8f16, Expand);
setOperationAction(ISD::BR_CC, MVT::v8f16, Expand);		setOperationAction(ISD::BR_CC, MVT::v8f16, Expand);
setOperationAction(ISD::SELECT, MVT::v8f16, Expand);		setOperationAction(ISD::SELECT, MVT::v8f16, Expand);
setOperationAction(ISD::SELECT_CC, MVT::v8f16, Expand);		setOperationAction(ISD::SELECT_CC, MVT::v8f16, Expand);
setOperationAction(ISD::FP_EXTEND, MVT::v8f16, Expand);		setOperationAction(ISD::FP_EXTEND, MVT::v8f16, Expand);
}		}

// AArch64 has implementations of a lot of rounding-like FP operations.		// AArch64 has implementations of a lot of rounding-like FP operations.
for (MVT Ty : {MVT::f32, MVT::f64}) {		for (MVT Ty : {MVT::f32, MVT::f64}) {
setOperationAction(ISD::FFLOOR, Ty, Legal);		setOperationAction(ISD::FFLOOR, Ty, Legal);
setOperationAction(ISD::FNEARBYINT, Ty, Legal);		setOperationAction(ISD::FNEARBYINT, Ty, Legal);
setOperationAction(ISD::FCEIL, Ty, Legal);		setOperationAction(ISD::FCEIL, Ty, Legal);
setOperationAction(ISD::FRINT, Ty, Legal);		setOperationAction(ISD::FRINT, Ty, Legal);
setOperationAction(ISD::FTRUNC, Ty, Legal);		setOperationAction(ISD::FTRUNC, Ty, Legal);
setOperationAction(ISD::FROUND, Ty, Legal);		setOperationAction(ISD::FROUND, Ty, Legal);
		setOperationAction(ISD::FROUNDEVEN, Ty, Legal);
setOperationAction(ISD::FMINNUM, Ty, Legal);		setOperationAction(ISD::FMINNUM, Ty, Legal);
setOperationAction(ISD::FMAXNUM, Ty, Legal);		setOperationAction(ISD::FMAXNUM, Ty, Legal);
setOperationAction(ISD::FMINIMUM, Ty, Legal);		setOperationAction(ISD::FMINIMUM, Ty, Legal);
setOperationAction(ISD::FMAXIMUM, Ty, Legal);		setOperationAction(ISD::FMAXIMUM, Ty, Legal);
setOperationAction(ISD::LROUND, Ty, Legal);		setOperationAction(ISD::LROUND, Ty, Legal);
setOperationAction(ISD::LLROUND, Ty, Legal);		setOperationAction(ISD::LLROUND, Ty, Legal);
setOperationAction(ISD::LRINT, Ty, Legal);		setOperationAction(ISD::LRINT, Ty, Legal);
setOperationAction(ISD::LLRINT, Ty, Legal);		setOperationAction(ISD::LLRINT, Ty, Legal);
}		}

if (Subtarget->hasFullFP16()) {		if (Subtarget->hasFullFP16()) {
setOperationAction(ISD::FNEARBYINT, MVT::f16, Legal);		setOperationAction(ISD::FNEARBYINT, MVT::f16, Legal);
setOperationAction(ISD::FFLOOR, MVT::f16, Legal);		setOperationAction(ISD::FFLOOR, MVT::f16, Legal);
setOperationAction(ISD::FCEIL, MVT::f16, Legal);		setOperationAction(ISD::FCEIL, MVT::f16, Legal);
setOperationAction(ISD::FRINT, MVT::f16, Legal);		setOperationAction(ISD::FRINT, MVT::f16, Legal);
setOperationAction(ISD::FTRUNC, MVT::f16, Legal);		setOperationAction(ISD::FTRUNC, MVT::f16, Legal);
setOperationAction(ISD::FROUND, MVT::f16, Legal);		setOperationAction(ISD::FROUND, MVT::f16, Legal);
		setOperationAction(ISD::FROUNDEVEN, MVT::f16, Legal);
setOperationAction(ISD::FMINNUM, MVT::f16, Legal);		setOperationAction(ISD::FMINNUM, MVT::f16, Legal);
setOperationAction(ISD::FMAXNUM, MVT::f16, Legal);		setOperationAction(ISD::FMAXNUM, MVT::f16, Legal);
setOperationAction(ISD::FMINIMUM, MVT::f16, Legal);		setOperationAction(ISD::FMINIMUM, MVT::f16, Legal);
setOperationAction(ISD::FMAXIMUM, MVT::f16, Legal);		setOperationAction(ISD::FMAXIMUM, MVT::f16, Legal);
}		}

setOperationAction(ISD::PREFETCH, MVT::Other, Custom);		setOperationAction(ISD::PREFETCH, MVT::Other, Custom);

▲ Show 20 Lines • Show All 243 Lines • ▼ Show 20 Lines	if (Subtarget->hasNEON()) {
setOperationAction(ISD::FFLOOR, MVT::v1f64, Expand);		setOperationAction(ISD::FFLOOR, MVT::v1f64, Expand);
setOperationAction(ISD::FMA, MVT::v1f64, Expand);		setOperationAction(ISD::FMA, MVT::v1f64, Expand);
setOperationAction(ISD::FMUL, MVT::v1f64, Expand);		setOperationAction(ISD::FMUL, MVT::v1f64, Expand);
setOperationAction(ISD::FNEARBYINT, MVT::v1f64, Expand);		setOperationAction(ISD::FNEARBYINT, MVT::v1f64, Expand);
setOperationAction(ISD::FNEG, MVT::v1f64, Expand);		setOperationAction(ISD::FNEG, MVT::v1f64, Expand);
setOperationAction(ISD::FPOW, MVT::v1f64, Expand);		setOperationAction(ISD::FPOW, MVT::v1f64, Expand);
setOperationAction(ISD::FREM, MVT::v1f64, Expand);		setOperationAction(ISD::FREM, MVT::v1f64, Expand);
setOperationAction(ISD::FROUND, MVT::v1f64, Expand);		setOperationAction(ISD::FROUND, MVT::v1f64, Expand);
		setOperationAction(ISD::FROUNDEVEN, MVT::v1f64, Expand);
setOperationAction(ISD::FRINT, MVT::v1f64, Expand);		setOperationAction(ISD::FRINT, MVT::v1f64, Expand);
setOperationAction(ISD::FSIN, MVT::v1f64, Expand);		setOperationAction(ISD::FSIN, MVT::v1f64, Expand);
setOperationAction(ISD::FSINCOS, MVT::v1f64, Expand);		setOperationAction(ISD::FSINCOS, MVT::v1f64, Expand);
setOperationAction(ISD::FSQRT, MVT::v1f64, Expand);		setOperationAction(ISD::FSQRT, MVT::v1f64, Expand);
setOperationAction(ISD::FSUB, MVT::v1f64, Expand);		setOperationAction(ISD::FSUB, MVT::v1f64, Expand);
setOperationAction(ISD::FTRUNC, MVT::v1f64, Expand);		setOperationAction(ISD::FTRUNC, MVT::v1f64, Expand);
setOperationAction(ISD::SETCC, MVT::v1f64, Expand);		setOperationAction(ISD::SETCC, MVT::v1f64, Expand);
setOperationAction(ISD::BR_CC, MVT::v1f64, Expand);		setOperationAction(ISD::BR_CC, MVT::v1f64, Expand);
▲ Show 20 Lines • Show All 110 Lines • ▼ Show 20 Lines	if (Subtarget->hasNEON()) {
// AArch64 has implementations of a lot of rounding-like FP operations.		// AArch64 has implementations of a lot of rounding-like FP operations.
for (MVT Ty : {MVT::v2f32, MVT::v4f32, MVT::v2f64}) {		for (MVT Ty : {MVT::v2f32, MVT::v4f32, MVT::v2f64}) {
setOperationAction(ISD::FFLOOR, Ty, Legal);		setOperationAction(ISD::FFLOOR, Ty, Legal);
setOperationAction(ISD::FNEARBYINT, Ty, Legal);		setOperationAction(ISD::FNEARBYINT, Ty, Legal);
setOperationAction(ISD::FCEIL, Ty, Legal);		setOperationAction(ISD::FCEIL, Ty, Legal);
setOperationAction(ISD::FRINT, Ty, Legal);		setOperationAction(ISD::FRINT, Ty, Legal);
setOperationAction(ISD::FTRUNC, Ty, Legal);		setOperationAction(ISD::FTRUNC, Ty, Legal);
setOperationAction(ISD::FROUND, Ty, Legal);		setOperationAction(ISD::FROUND, Ty, Legal);
		setOperationAction(ISD::FROUNDEVEN, Ty, Legal);
}		}

if (Subtarget->hasFullFP16()) {		if (Subtarget->hasFullFP16()) {
for (MVT Ty : {MVT::v4f16, MVT::v8f16}) {		for (MVT Ty : {MVT::v4f16, MVT::v8f16}) {
setOperationAction(ISD::FFLOOR, Ty, Legal);		setOperationAction(ISD::FFLOOR, Ty, Legal);
setOperationAction(ISD::FNEARBYINT, Ty, Legal);		setOperationAction(ISD::FNEARBYINT, Ty, Legal);
setOperationAction(ISD::FCEIL, Ty, Legal);		setOperationAction(ISD::FCEIL, Ty, Legal);
setOperationAction(ISD::FRINT, Ty, Legal);		setOperationAction(ISD::FRINT, Ty, Legal);
setOperationAction(ISD::FTRUNC, Ty, Legal);		setOperationAction(ISD::FTRUNC, Ty, Legal);
setOperationAction(ISD::FROUND, Ty, Legal);		setOperationAction(ISD::FROUND, Ty, Legal);
		setOperationAction(ISD::FROUNDEVEN, Ty, Legal);
}		}
}		}

if (Subtarget->hasSVE())		if (Subtarget->hasSVE())
setOperationAction(ISD::VSCALE, MVT::i32, Custom);		setOperationAction(ISD::VSCALE, MVT::i32, Custom);

setTruncStoreAction(MVT::v4i16, MVT::v4i8, Custom);		setTruncStoreAction(MVT::v4i16, MVT::v4i8, Custom);
}		}
▲ Show 20 Lines • Show All 308 Lines • ▼ Show 20 Lines	void AArch64TargetLowering::addTypeForFixedLengthSVE(MVT VT) {
setOperationAction(ISD::FMAXNUM, VT, Custom);		setOperationAction(ISD::FMAXNUM, VT, Custom);
setOperationAction(ISD::FMINIMUM, VT, Custom);		setOperationAction(ISD::FMINIMUM, VT, Custom);
setOperationAction(ISD::FMINNUM, VT, Custom);		setOperationAction(ISD::FMINNUM, VT, Custom);
setOperationAction(ISD::FMUL, VT, Custom);		setOperationAction(ISD::FMUL, VT, Custom);
setOperationAction(ISD::FNEARBYINT, VT, Custom);		setOperationAction(ISD::FNEARBYINT, VT, Custom);
setOperationAction(ISD::FNEG, VT, Custom);		setOperationAction(ISD::FNEG, VT, Custom);
setOperationAction(ISD::FRINT, VT, Custom);		setOperationAction(ISD::FRINT, VT, Custom);
setOperationAction(ISD::FROUND, VT, Custom);		setOperationAction(ISD::FROUND, VT, Custom);
		setOperationAction(ISD::FROUNDEVEN, VT, Custom);
setOperationAction(ISD::FSQRT, VT, Custom);		setOperationAction(ISD::FSQRT, VT, Custom);
setOperationAction(ISD::FSUB, VT, Custom);		setOperationAction(ISD::FSUB, VT, Custom);
setOperationAction(ISD::FTRUNC, VT, Custom);		setOperationAction(ISD::FTRUNC, VT, Custom);
setOperationAction(ISD::LOAD, VT, Custom);		setOperationAction(ISD::LOAD, VT, Custom);
setOperationAction(ISD::MUL, VT, Custom);		setOperationAction(ISD::MUL, VT, Custom);
setOperationAction(ISD::OR, VT, Custom);		setOperationAction(ISD::OR, VT, Custom);
setOperationAction(ISD::SDIV, VT, Custom);		setOperationAction(ISD::SDIV, VT, Custom);
setOperationAction(ISD::SETCC, VT, Custom);		setOperationAction(ISD::SETCC, VT, Custom);
▲ Show 20 Lines • Show All 16,063 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64InstrInfo.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 3,790 Lines • ▼ Show 20 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	defm FABS : SingleOperandFPData<0b0001, "fabs", fabs>;			defm FABS : SingleOperandFPData<0b0001, "fabs", fabs>;
	defm FMOV : SingleOperandFPData<0b0000, "fmov">;			defm FMOV : SingleOperandFPData<0b0000, "fmov">;
	defm FNEG : SingleOperandFPData<0b0010, "fneg", fneg>;			defm FNEG : SingleOperandFPData<0b0010, "fneg", fneg>;
	defm FRINTA : SingleOperandFPData<0b1100, "frinta", fround>;			defm FRINTA : SingleOperandFPData<0b1100, "frinta", fround>;
	defm FRINTI : SingleOperandFPData<0b1111, "frinti", fnearbyint>;			defm FRINTI : SingleOperandFPData<0b1111, "frinti", fnearbyint>;
	defm FRINTM : SingleOperandFPData<0b1010, "frintm", ffloor>;			defm FRINTM : SingleOperandFPData<0b1010, "frintm", ffloor>;
	defm FRINTN : SingleOperandFPData<0b1000, "frintn", int_aarch64_neon_frintn>;			defm FRINTN : SingleOperandFPData<0b1000, "frintn", froundeven>;
	defm FRINTP : SingleOperandFPData<0b1001, "frintp", fceil>;			defm FRINTP : SingleOperandFPData<0b1001, "frintp", fceil>;

	def : Pat<(v1f64 (int_aarch64_neon_frintn (v1f64 FPR64:$Rn))),
	(FRINTNDr FPR64:$Rn)>;

	defm FRINTX : SingleOperandFPData<0b1110, "frintx", frint>;			defm FRINTX : SingleOperandFPData<0b1110, "frintx", frint>;
	defm FRINTZ : SingleOperandFPData<0b1011, "frintz", ftrunc>;			defm FRINTZ : SingleOperandFPData<0b1011, "frintz", ftrunc>;

	let SchedRW = [WriteFDiv] in {			let SchedRW = [WriteFDiv] in {
	defm FSQRT : SingleOperandFPData<0b0011, "fsqrt", fsqrt>;			defm FSQRT : SingleOperandFPData<0b0011, "fsqrt", fsqrt>;
	}			}

	let Predicates = [HasFRInt3264] in {			let Predicates = [HasFRInt3264] in {
	▲ Show 20 Lines • Show All 272 Lines • ▼ Show 20 Lines
	def : Pat<(v4i32 (int_aarch64_neon_fcvtzu v4f32:$Rn)), (FCVTZUv4f32 $Rn)>;			def : Pat<(v4i32 (int_aarch64_neon_fcvtzu v4f32:$Rn)), (FCVTZUv4f32 $Rn)>;
	def : Pat<(v2i64 (int_aarch64_neon_fcvtzu v2f64:$Rn)), (FCVTZUv2f64 $Rn)>;			def : Pat<(v2i64 (int_aarch64_neon_fcvtzu v2f64:$Rn)), (FCVTZUv2f64 $Rn)>;

	defm FNEG : SIMDTwoVectorFP<1, 1, 0b01111, "fneg", fneg>;			defm FNEG : SIMDTwoVectorFP<1, 1, 0b01111, "fneg", fneg>;
	defm FRECPE : SIMDTwoVectorFP<0, 1, 0b11101, "frecpe", int_aarch64_neon_frecpe>;			defm FRECPE : SIMDTwoVectorFP<0, 1, 0b11101, "frecpe", int_aarch64_neon_frecpe>;
	defm FRINTA : SIMDTwoVectorFP<1, 0, 0b11000, "frinta", fround>;			defm FRINTA : SIMDTwoVectorFP<1, 0, 0b11000, "frinta", fround>;
	defm FRINTI : SIMDTwoVectorFP<1, 1, 0b11001, "frinti", fnearbyint>;			defm FRINTI : SIMDTwoVectorFP<1, 1, 0b11001, "frinti", fnearbyint>;
	defm FRINTM : SIMDTwoVectorFP<0, 0, 0b11001, "frintm", ffloor>;			defm FRINTM : SIMDTwoVectorFP<0, 0, 0b11001, "frintm", ffloor>;
	defm FRINTN : SIMDTwoVectorFP<0, 0, 0b11000, "frintn", int_aarch64_neon_frintn>;			defm FRINTN : SIMDTwoVectorFP<0, 0, 0b11000, "frintn", froundeven>;
	defm FRINTP : SIMDTwoVectorFP<0, 1, 0b11000, "frintp", fceil>;			defm FRINTP : SIMDTwoVectorFP<0, 1, 0b11000, "frintp", fceil>;
	defm FRINTX : SIMDTwoVectorFP<1, 0, 0b11001, "frintx", frint>;			defm FRINTX : SIMDTwoVectorFP<1, 0, 0b11001, "frintx", frint>;
	defm FRINTZ : SIMDTwoVectorFP<0, 1, 0b11001, "frintz", ftrunc>;			defm FRINTZ : SIMDTwoVectorFP<0, 1, 0b11001, "frintz", ftrunc>;

	let Predicates = [HasFRInt3264] in {			let Predicates = [HasFRInt3264] in {
	defm FRINT32Z : FRIntNNTVector<0, 0, "frint32z", int_aarch64_neon_frint32z>;			defm FRINT32Z : FRIntNNTVector<0, 0, "frint32z", int_aarch64_neon_frint32z>;
	defm FRINT64Z : FRIntNNTVector<0, 1, "frint64z", int_aarch64_neon_frint64z>;			defm FRINT64Z : FRIntNNTVector<0, 1, "frint64z", int_aarch64_neon_frint64z>;
	defm FRINT32X : FRIntNNTVector<1, 0, "frint32x", int_aarch64_neon_frint32x>;			defm FRINT32X : FRIntNNTVector<1, 0, "frint32x", int_aarch64_neon_frint32x>;
	▲ Show 20 Lines • Show All 3,860 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/arm64-vcvt.ll

	Show First 20 Lines • Show All 584 Lines • ▼ Show 20 Lines
	declare <4 x float> @llvm.floor.v4f32(<4 x float>) nounwind readnone			declare <4 x float> @llvm.floor.v4f32(<4 x float>) nounwind readnone
	declare <2 x double> @llvm.floor.v2f64(<2 x double>) nounwind readnone			declare <2 x double> @llvm.floor.v2f64(<2 x double>) nounwind readnone

	define <2 x float> @frintn_2s(<2 x float> %A) nounwind {			define <2 x float> @frintn_2s(<2 x float> %A) nounwind {
	;CHECK-LABEL: frintn_2s:			;CHECK-LABEL: frintn_2s:
	;CHECK-NOT: ld1			;CHECK-NOT: ld1
	;CHECK: frintn.2s v0, v0			;CHECK: frintn.2s v0, v0
	;CHECK-NEXT: ret			;CHECK-NEXT: ret
	%tmp3 = call <2 x float> @llvm.aarch64.neon.frintn.v2f32(<2 x float> %A)			%tmp3 = call <2 x float> @llvm.roundeven.v2f32(<2 x float> %A)
	ret <2 x float> %tmp3			ret <2 x float> %tmp3
	}			}

	define <4 x float> @frintn_4s(<4 x float> %A) nounwind {			define <4 x float> @frintn_4s(<4 x float> %A) nounwind {
	;CHECK-LABEL: frintn_4s:			;CHECK-LABEL: frintn_4s:
	;CHECK-NOT: ld1			;CHECK-NOT: ld1
	;CHECK: frintn.4s v0, v0			;CHECK: frintn.4s v0, v0
	;CHECK-NEXT: ret			;CHECK-NEXT: ret
	%tmp3 = call <4 x float> @llvm.aarch64.neon.frintn.v4f32(<4 x float> %A)			%tmp3 = call <4 x float> @llvm.roundeven.v4f32(<4 x float> %A)
	ret <4 x float> %tmp3			ret <4 x float> %tmp3
	}			}

	define <2 x double> @frintn_2d(<2 x double> %A) nounwind {			define <2 x double> @frintn_2d(<2 x double> %A) nounwind {
	;CHECK-LABEL: frintn_2d:			;CHECK-LABEL: frintn_2d:
	;CHECK-NOT: ld1			;CHECK-NOT: ld1
	;CHECK: frintn.2d v0, v0			;CHECK: frintn.2d v0, v0
	;CHECK-NEXT: ret			;CHECK-NEXT: ret
	%tmp3 = call <2 x double> @llvm.aarch64.neon.frintn.v2f64(<2 x double> %A)			%tmp3 = call <2 x double> @llvm.roundeven.v2f64(<2 x double> %A)
	ret <2 x double> %tmp3			ret <2 x double> %tmp3
	}			}

	declare <2 x float> @llvm.aarch64.neon.frintn.v2f32(<2 x float>) nounwind readnone			declare <2 x float> @llvm.roundeven.v2f32(<2 x float>) nounwind readnone
	declare <4 x float> @llvm.aarch64.neon.frintn.v4f32(<4 x float>) nounwind readnone			declare <4 x float> @llvm.roundeven.v4f32(<4 x float>) nounwind readnone
	declare <2 x double> @llvm.aarch64.neon.frintn.v2f64(<2 x double>) nounwind readnone			declare <2 x double> @llvm.roundeven.v2f64(<2 x double>) nounwind readnone

	; FALLBACK-NOT: remark{{.*}}frintp_2s			; FALLBACK-NOT: remark{{.*}}frintp_2s
	define <2 x float> @frintp_2s(<2 x float> %A) nounwind {			define <2 x float> @frintp_2s(<2 x float> %A) nounwind {
	;CHECK-LABEL: frintp_2s:			;CHECK-LABEL: frintp_2s:
	;CHECK-NOT: ld1			;CHECK-NOT: ld1
	;CHECK: frintp.2s v0, v0			;CHECK: frintp.2s v0, v0
	;CHECK-NEXT: ret			;CHECK-NEXT: ret
	%tmp3 = call <2 x float> @llvm.ceil.v2f32(<2 x float> %A)			%tmp3 = call <2 x float> @llvm.ceil.v2f32(<2 x float> %A)
	▲ Show 20 Lines • Show All 253 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/arm64-vfloatintrinsics.ll

Show First 20 Lines • Show All 239 Lines • ▼ Show 20 Lines	define %v4f16 @test_v4f16.round(%v4f16 %a) {
; GISEL-LABEL: test_v4f16.round:		; GISEL-LABEL: test_v4f16.round:
; GISEL-NOFP16-COUNT-4: frinta s{{[0-9]+}}, s{{[0-9]+}}		; GISEL-NOFP16-COUNT-4: frinta s{{[0-9]+}}, s{{[0-9]+}}
; GISEL-FP16-NOT: fcvt		; GISEL-FP16-NOT: fcvt
; GISEL-FP16: frinta.4h		; GISEL-FP16: frinta.4h
; GISEL-FP16-NEXT: ret		; GISEL-FP16-NEXT: ret
%1 = call %v4f16 @llvm.round.v4f16(%v4f16 %a)		%1 = call %v4f16 @llvm.round.v4f16(%v4f16 %a)
ret %v4f16 %1		ret %v4f16 %1
}		}
		define %v4f16 @test_v4f16.roundeven(%v4f16 %a) {
		; CHECK-LABEL: test_v4f16.roundeven:
		; CHECK-NOFP16-COUNT-4: frintn s{{[0-9]+}}, s{{[0-9]+}}
		; CHECK-FP16-NOT: fcvt
		; CHECK-FP16: frintn.4h
		; CHECK-FP16-NEXT: ret
		; GISEL-LABEL: test_v4f16.roundeven:
		; GISEL-NOFP16-COUNT-4: frintn s{{[0-9]+}}, s{{[0-9]+}}
		; GISEL-FP16-NOT: fcvt
		; GISEL-FP16: frintn.4h
		; GISEL-FP16-NEXT: ret
		%1 = call %v4f16 @llvm.roundeven.v4f16(%v4f16 %a)
		ret %v4f16 %1
		}

declare %v4f16 @llvm.sqrt.v4f16(%v4f16) #0		declare %v4f16 @llvm.sqrt.v4f16(%v4f16) #0
declare %v4f16 @llvm.powi.v4f16(%v4f16, i32) #0		declare %v4f16 @llvm.powi.v4f16(%v4f16, i32) #0
declare %v4f16 @llvm.sin.v4f16(%v4f16) #0		declare %v4f16 @llvm.sin.v4f16(%v4f16) #0
declare %v4f16 @llvm.cos.v4f16(%v4f16) #0		declare %v4f16 @llvm.cos.v4f16(%v4f16) #0
declare %v4f16 @llvm.pow.v4f16(%v4f16, %v4f16) #0		declare %v4f16 @llvm.pow.v4f16(%v4f16, %v4f16) #0
declare %v4f16 @llvm.exp.v4f16(%v4f16) #0		declare %v4f16 @llvm.exp.v4f16(%v4f16) #0
declare %v4f16 @llvm.exp2.v4f16(%v4f16) #0		declare %v4f16 @llvm.exp2.v4f16(%v4f16) #0
declare %v4f16 @llvm.log.v4f16(%v4f16) #0		declare %v4f16 @llvm.log.v4f16(%v4f16) #0
declare %v4f16 @llvm.log10.v4f16(%v4f16) #0		declare %v4f16 @llvm.log10.v4f16(%v4f16) #0
declare %v4f16 @llvm.log2.v4f16(%v4f16) #0		declare %v4f16 @llvm.log2.v4f16(%v4f16) #0
declare %v4f16 @llvm.fma.v4f16(%v4f16, %v4f16, %v4f16) #0		declare %v4f16 @llvm.fma.v4f16(%v4f16, %v4f16, %v4f16) #0
declare %v4f16 @llvm.fabs.v4f16(%v4f16) #0		declare %v4f16 @llvm.fabs.v4f16(%v4f16) #0
declare %v4f16 @llvm.floor.v4f16(%v4f16) #0		declare %v4f16 @llvm.floor.v4f16(%v4f16) #0
declare %v4f16 @llvm.ceil.v4f16(%v4f16) #0		declare %v4f16 @llvm.ceil.v4f16(%v4f16) #0
declare %v4f16 @llvm.trunc.v4f16(%v4f16) #0		declare %v4f16 @llvm.trunc.v4f16(%v4f16) #0
declare %v4f16 @llvm.rint.v4f16(%v4f16) #0		declare %v4f16 @llvm.rint.v4f16(%v4f16) #0
declare %v4f16 @llvm.nearbyint.v4f16(%v4f16) #0		declare %v4f16 @llvm.nearbyint.v4f16(%v4f16) #0
declare %v4f16 @llvm.round.v4f16(%v4f16) #0		declare %v4f16 @llvm.round.v4f16(%v4f16) #0
		declare %v4f16 @llvm.roundeven.v4f16(%v4f16) #0

;;;		;;;

%v8f16 = type <8 x half>		%v8f16 = type <8 x half>

; FALLBACK-NOT: remark{{.*}}test_v8f16.sqrt		; FALLBACK-NOT: remark{{.*}}test_v8f16.sqrt
define %v8f16 @test_v8f16.sqrt(%v8f16 %a) {		define %v8f16 @test_v8f16.sqrt(%v8f16 %a) {
; CHECK-LABEL: test_v8f16.sqrt:		; CHECK-LABEL: test_v8f16.sqrt:
▲ Show 20 Lines • Show All 222 Lines • ▼ Show 20 Lines	define %v8f16 @test_v8f16.round(%v8f16 %a) {
; GISEL-LABEL: test_v8f16.round:		; GISEL-LABEL: test_v8f16.round:
; GISEL-NOFP16-COUNT-8: frinta s{{[0-9]+}}, s{{[0-9]+}}		; GISEL-NOFP16-COUNT-8: frinta s{{[0-9]+}}, s{{[0-9]+}}
; GISEL-FP16-NOT: fcvt		; GISEL-FP16-NOT: fcvt
; GISEL-FP16: frinta.8h		; GISEL-FP16: frinta.8h
; GISEL-FP16-NEXT: ret		; GISEL-FP16-NEXT: ret
%1 = call %v8f16 @llvm.round.v8f16(%v8f16 %a)		%1 = call %v8f16 @llvm.round.v8f16(%v8f16 %a)
ret %v8f16 %1		ret %v8f16 %1
}		}
		define %v8f16 @test_v8f16.roundeven(%v8f16 %a) {
		; CHECK-LABEL: test_v8f16.roundeven:
		; CHECK-NOFP16-COUNT-8: frintn s{{[0-9]+}}, s{{[0-9]+}}
		; CHECK-FP16-NOT: fcvt
		; CHECK-FP16: frintn.8h
		; CHECK-FP16-NEXT: ret
		; GISEL-LABEL: test_v8f16.roundeven:
		; GISEL-NOFP16-COUNT-8: frintn s{{[0-9]+}}, s{{[0-9]+}}
		; GISEL-FP16-NOT: fcvt
		; GISEL-FP16: frintn.8h
		; GISEL-FP16-NEXT: ret
		%1 = call %v8f16 @llvm.roundeven.v8f16(%v8f16 %a)
		ret %v8f16 %1
		}

declare %v8f16 @llvm.sqrt.v8f16(%v8f16) #0		declare %v8f16 @llvm.sqrt.v8f16(%v8f16) #0
declare %v8f16 @llvm.powi.v8f16(%v8f16, i32) #0		declare %v8f16 @llvm.powi.v8f16(%v8f16, i32) #0
declare %v8f16 @llvm.sin.v8f16(%v8f16) #0		declare %v8f16 @llvm.sin.v8f16(%v8f16) #0
declare %v8f16 @llvm.cos.v8f16(%v8f16) #0		declare %v8f16 @llvm.cos.v8f16(%v8f16) #0
declare %v8f16 @llvm.pow.v8f16(%v8f16, %v8f16) #0		declare %v8f16 @llvm.pow.v8f16(%v8f16, %v8f16) #0
declare %v8f16 @llvm.exp.v8f16(%v8f16) #0		declare %v8f16 @llvm.exp.v8f16(%v8f16) #0
declare %v8f16 @llvm.exp2.v8f16(%v8f16) #0		declare %v8f16 @llvm.exp2.v8f16(%v8f16) #0
declare %v8f16 @llvm.log.v8f16(%v8f16) #0		declare %v8f16 @llvm.log.v8f16(%v8f16) #0
declare %v8f16 @llvm.log10.v8f16(%v8f16) #0		declare %v8f16 @llvm.log10.v8f16(%v8f16) #0
declare %v8f16 @llvm.log2.v8f16(%v8f16) #0		declare %v8f16 @llvm.log2.v8f16(%v8f16) #0
declare %v8f16 @llvm.fma.v8f16(%v8f16, %v8f16, %v8f16) #0		declare %v8f16 @llvm.fma.v8f16(%v8f16, %v8f16, %v8f16) #0
declare %v8f16 @llvm.fabs.v8f16(%v8f16) #0		declare %v8f16 @llvm.fabs.v8f16(%v8f16) #0
declare %v8f16 @llvm.floor.v8f16(%v8f16) #0		declare %v8f16 @llvm.floor.v8f16(%v8f16) #0
declare %v8f16 @llvm.ceil.v8f16(%v8f16) #0		declare %v8f16 @llvm.ceil.v8f16(%v8f16) #0
declare %v8f16 @llvm.trunc.v8f16(%v8f16) #0		declare %v8f16 @llvm.trunc.v8f16(%v8f16) #0
declare %v8f16 @llvm.rint.v8f16(%v8f16) #0		declare %v8f16 @llvm.rint.v8f16(%v8f16) #0
declare %v8f16 @llvm.nearbyint.v8f16(%v8f16) #0		declare %v8f16 @llvm.nearbyint.v8f16(%v8f16) #0
declare %v8f16 @llvm.round.v8f16(%v8f16) #0		declare %v8f16 @llvm.round.v8f16(%v8f16) #0
		declare %v8f16 @llvm.roundeven.v8f16(%v8f16) #0

;;; Float vectors		;;; Float vectors

%v2f32 = type <2 x float>		%v2f32 = type <2 x float>

; FALLBACK-NOT: remark{{.*}}test_v2f32.sqrt		; FALLBACK-NOT: remark{{.*}}test_v2f32.sqrt
; CHECK-LABEL: test_v2f32.sqrt:		; CHECK-LABEL: test_v2f32.sqrt:
; GISEL-LABEL: test_v2f32.sqrt:		; GISEL-LABEL: test_v2f32.sqrt:
▲ Show 20 Lines • Show All 526 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/f16-instructions.ll

	Show First 20 Lines • Show All 790 Lines • ▼ Show 20 Lines
	declare half @llvm.maxnum.f16(half %a, half %b) #0			declare half @llvm.maxnum.f16(half %a, half %b) #0
	declare half @llvm.copysign.f16(half %a, half %b) #0			declare half @llvm.copysign.f16(half %a, half %b) #0
	declare half @llvm.floor.f16(half %a) #0			declare half @llvm.floor.f16(half %a) #0
	declare half @llvm.ceil.f16(half %a) #0			declare half @llvm.ceil.f16(half %a) #0
	declare half @llvm.trunc.f16(half %a) #0			declare half @llvm.trunc.f16(half %a) #0
	declare half @llvm.rint.f16(half %a) #0			declare half @llvm.rint.f16(half %a) #0
	declare half @llvm.nearbyint.f16(half %a) #0			declare half @llvm.nearbyint.f16(half %a) #0
	declare half @llvm.round.f16(half %a) #0			declare half @llvm.round.f16(half %a) #0
				declare half @llvm.roundeven.f16(half %a) #0
	declare half @llvm.fmuladd.f16(half %a, half %b, half %c) #0			declare half @llvm.fmuladd.f16(half %a, half %b, half %c) #0
	declare half @llvm.aarch64.neon.frecpe.f16(half %a) #0			declare half @llvm.aarch64.neon.frecpe.f16(half %a) #0
	declare half @llvm.aarch64.neon.frecpx.f16(half %a) #0			declare half @llvm.aarch64.neon.frecpx.f16(half %a) #0
	declare half @llvm.aarch64.neon.frsqrte.f16(half %a) #0			declare half @llvm.aarch64.neon.frsqrte.f16(half %a) #0

	; FALLBACK-NOT: remark:{{.*}}test_sqrt			; FALLBACK-NOT: remark:{{.*}}test_sqrt
	; FALLBACK-FP16-NOT: remark:{{.*}}test_sqrt			; FALLBACK-FP16-NOT: remark:{{.*}}test_sqrt

	▲ Show 20 Lines • Show All 501 Lines • ▼ Show 20 Lines
	; GISEL-FP16-NEXT: frinta h0, h0			; GISEL-FP16-NEXT: frinta h0, h0
	; GISEL-FP16-NEXT: ret			; GISEL-FP16-NEXT: ret

	define half @test_round(half %a) #0 {			define half @test_round(half %a) #0 {
	%r = call half @llvm.round.f16(half %a)			%r = call half @llvm.round.f16(half %a)
	ret half %r			ret half %r
	}			}

				; CHECK-CVT-LABEL: test_roundeven:
				; CHECK-CVT-NEXT: fcvt [[FLOAT32:s[0-9]+]], h0
				; CHECK-CVT-NEXT: frintn [[INT32:s[0-9]+]], [[FLOAT32]]
				; CHECK-CVT-NEXT: fcvt h0, [[INT32]]
				; CHECK-CVT-NEXT: ret

				; GISEL-CVT-LABEL: test_roundeven:
				; GISEL-CVT-NEXT: fcvt [[FLOAT32:s[0-9]+]], h0
				; GISEL-CVT-NEXT: frintn [[INT32:s[0-9]+]], [[FLOAT32]]
				; GISEL-CVT-NEXT: fcvt h0, [[INT32]]
				; GISEL-CVT-NEXT: ret


				; CHECK-FP16-LABEL: test_roundeven:
				; CHECK-FP16-NEXT: frintn h0, h0
				; CHECK-FP16-NEXT: ret

				; GISEL-FP16-LABEL: test_roundeven:
				; GISEL-FP16-NEXT: frintn h0, h0
				; GISEL-FP16-NEXT: ret

				define half @test_roundeven(half %a) #0 {
				%r = call half @llvm.roundeven.f16(half %a)
				ret half %r
				}

	; CHECK-CVT-LABEL: test_fmuladd:			; CHECK-CVT-LABEL: test_fmuladd:
	; CHECK-CVT-NEXT: fcvt s1, h1			; CHECK-CVT-NEXT: fcvt s1, h1
	; CHECK-CVT-NEXT: fcvt s0, h0			; CHECK-CVT-NEXT: fcvt s0, h0
	; CHECK-CVT-NEXT: fmul s0, s0, s1			; CHECK-CVT-NEXT: fmul s0, s0, s1
	; CHECK-CVT-NEXT: fcvt h0, s0			; CHECK-CVT-NEXT: fcvt h0, s0
	; CHECK-CVT-NEXT: fcvt s0, h0			; CHECK-CVT-NEXT: fcvt s0, h0
	; CHECK-CVT-NEXT: fcvt s1, h2			; CHECK-CVT-NEXT: fcvt s1, h2
	; CHECK-CVT-NEXT: fadd s0, s0, s1			; CHECK-CVT-NEXT: fadd s0, s0, s1
	Show All 40 Lines

llvm/test/CodeGen/AArch64/fp-intrinsics.ll

	Show First 20 Lines • Show All 260 Lines • ▼ Show 20 Lines

	; CHECK-LABEL: round_f32:			; CHECK-LABEL: round_f32:
	; CHECK: frinta s0, s0			; CHECK: frinta s0, s0
	define float @round_f32(float %x) #0 {			define float @round_f32(float %x) #0 {
	%val = call float @llvm.experimental.constrained.round.f32(float %x, metadata !"fpexcept.strict") #0			%val = call float @llvm.experimental.constrained.round.f32(float %x, metadata !"fpexcept.strict") #0
	ret float %val			ret float %val
	}			}

				; CHECK-LABEL: roundeven_f32:
				; CHECK: frintn s0, s0
				define float @roundeven_f32(float %x) #0 {
				%val = call float @llvm.experimental.constrained.roundeven.f32(float %x, metadata !"fpexcept.strict") #0
				ret float %val
				}

	; CHECK-LABEL: trunc_f32:			; CHECK-LABEL: trunc_f32:
	; CHECK: frintz s0, s0			; CHECK: frintz s0, s0
	define float @trunc_f32(float %x) #0 {			define float @trunc_f32(float %x) #0 {
	%val = call float @llvm.experimental.constrained.trunc.f32(float %x, metadata !"fpexcept.strict") #0			%val = call float @llvm.experimental.constrained.trunc.f32(float %x, metadata !"fpexcept.strict") #0
	ret float %val			ret float %val
	}			}

	; CHECK-LABEL: fcmp_olt_f32:			; CHECK-LABEL: fcmp_olt_f32:
	▲ Show 20 Lines • Show All 447 Lines • ▼ Show 20 Lines

	; CHECK-LABEL: round_f64:			; CHECK-LABEL: round_f64:
	; CHECK: frinta d0, d0			; CHECK: frinta d0, d0
	define double @round_f64(double %x) #0 {			define double @round_f64(double %x) #0 {
	%val = call double @llvm.experimental.constrained.round.f64(double %x, metadata !"fpexcept.strict") #0			%val = call double @llvm.experimental.constrained.round.f64(double %x, metadata !"fpexcept.strict") #0
	ret double %val			ret double %val
	}			}

				; CHECK-LABEL: roundeven_f64:
				; CHECK: frintn d0, d0
				define double @roundeven_f64(double %x) #0 {
				%val = call double @llvm.experimental.constrained.roundeven.f64(double %x, metadata !"fpexcept.strict") #0
				ret double %val
				}

	; CHECK-LABEL: trunc_f64:			; CHECK-LABEL: trunc_f64:
	; CHECK: frintz d0, d0			; CHECK: frintz d0, d0
	define double @trunc_f64(double %x) #0 {			define double @trunc_f64(double %x) #0 {
	%val = call double @llvm.experimental.constrained.trunc.f64(double %x, metadata !"fpexcept.strict") #0			%val = call double @llvm.experimental.constrained.trunc.f64(double %x, metadata !"fpexcept.strict") #0
	ret double %val			ret double %val
	}			}

	; CHECK-LABEL: fcmp_olt_f64:			; CHECK-LABEL: fcmp_olt_f64:
	▲ Show 20 Lines • Show All 729 Lines • ▼ Show 20 Lines
	declare i64 @llvm.experimental.constrained.llrint.f32(float, metadata, metadata)			declare i64 @llvm.experimental.constrained.llrint.f32(float, metadata, metadata)
	declare float @llvm.experimental.constrained.maxnum.f32(float, float, metadata)			declare float @llvm.experimental.constrained.maxnum.f32(float, float, metadata)
	declare float @llvm.experimental.constrained.minnum.f32(float, float, metadata)			declare float @llvm.experimental.constrained.minnum.f32(float, float, metadata)
	declare float @llvm.experimental.constrained.ceil.f32(float, metadata)			declare float @llvm.experimental.constrained.ceil.f32(float, metadata)
	declare float @llvm.experimental.constrained.floor.f32(float, metadata)			declare float @llvm.experimental.constrained.floor.f32(float, metadata)
	declare i32 @llvm.experimental.constrained.lround.f32(float, metadata)			declare i32 @llvm.experimental.constrained.lround.f32(float, metadata)
	declare i64 @llvm.experimental.constrained.llround.f32(float, metadata)			declare i64 @llvm.experimental.constrained.llround.f32(float, metadata)
	declare float @llvm.experimental.constrained.round.f32(float, metadata)			declare float @llvm.experimental.constrained.round.f32(float, metadata)
				declare float @llvm.experimental.constrained.roundeven.f32(float, metadata)
	declare float @llvm.experimental.constrained.trunc.f32(float, metadata)			declare float @llvm.experimental.constrained.trunc.f32(float, metadata)
	declare i1 @llvm.experimental.constrained.fcmps.f32(float, float, metadata, metadata)			declare i1 @llvm.experimental.constrained.fcmps.f32(float, float, metadata, metadata)
	declare i1 @llvm.experimental.constrained.fcmp.f32(float, float, metadata, metadata)			declare i1 @llvm.experimental.constrained.fcmp.f32(float, float, metadata, metadata)

	declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)			declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)
	declare double @llvm.experimental.constrained.fsub.f64(double, double, metadata, metadata)			declare double @llvm.experimental.constrained.fsub.f64(double, double, metadata, metadata)
	declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)			declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)
	declare double @llvm.experimental.constrained.fdiv.f64(double, double, metadata, metadata)			declare double @llvm.experimental.constrained.fdiv.f64(double, double, metadata, metadata)
	Show All 25 Lines
	declare i64 @llvm.experimental.constrained.llrint.f64(double, metadata, metadata)			declare i64 @llvm.experimental.constrained.llrint.f64(double, metadata, metadata)
	declare double @llvm.experimental.constrained.maxnum.f64(double, double, metadata)			declare double @llvm.experimental.constrained.maxnum.f64(double, double, metadata)
	declare double @llvm.experimental.constrained.minnum.f64(double, double, metadata)			declare double @llvm.experimental.constrained.minnum.f64(double, double, metadata)
	declare double @llvm.experimental.constrained.ceil.f64(double, metadata)			declare double @llvm.experimental.constrained.ceil.f64(double, metadata)
	declare double @llvm.experimental.constrained.floor.f64(double, metadata)			declare double @llvm.experimental.constrained.floor.f64(double, metadata)
	declare i32 @llvm.experimental.constrained.lround.f64(double, metadata)			declare i32 @llvm.experimental.constrained.lround.f64(double, metadata)
	declare i64 @llvm.experimental.constrained.llround.f64(double, metadata)			declare i64 @llvm.experimental.constrained.llround.f64(double, metadata)
	declare double @llvm.experimental.constrained.round.f64(double, metadata)			declare double @llvm.experimental.constrained.round.f64(double, metadata)
				declare double @llvm.experimental.constrained.roundeven.f64(double, metadata)
	declare double @llvm.experimental.constrained.trunc.f64(double, metadata)			declare double @llvm.experimental.constrained.trunc.f64(double, metadata)
	declare i1 @llvm.experimental.constrained.fcmps.f64(double, double, metadata, metadata)			declare i1 @llvm.experimental.constrained.fcmps.f64(double, double, metadata, metadata)
	declare i1 @llvm.experimental.constrained.fcmp.f64(double, double, metadata, metadata)			declare i1 @llvm.experimental.constrained.fcmp.f64(double, double, metadata, metadata)

	declare fp128 @llvm.experimental.constrained.fadd.f128(fp128, fp128, metadata, metadata)			declare fp128 @llvm.experimental.constrained.fadd.f128(fp128, fp128, metadata, metadata)
	declare fp128 @llvm.experimental.constrained.fsub.f128(fp128, fp128, metadata, metadata)			declare fp128 @llvm.experimental.constrained.fsub.f128(fp128, fp128, metadata, metadata)
	declare fp128 @llvm.experimental.constrained.fmul.f128(fp128, fp128, metadata, metadata)			declare fp128 @llvm.experimental.constrained.fmul.f128(fp128, fp128, metadata, metadata)
	declare fp128 @llvm.experimental.constrained.fdiv.f128(fp128, fp128, metadata, metadata)			declare fp128 @llvm.experimental.constrained.fdiv.f128(fp128, fp128, metadata, metadata)
	▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/frintn.ll

This file was added.

				; RUN: llc -mtriple=aarch64-eabi -mattr=+fullfp16 %s -o - \| FileCheck %s

				; The llvm.aarch64.neon.frintn intrinsic should be auto-upgraded to the
				; target-independent roundeven intrinsic.

				define <4 x half> @frintn_4h(<4 x half> %A) nounwind {
				;CHECK-LABEL: frintn_4h:
				;CHECK: frintn v0.4h, v0.4h
				;CHECK-NEXT: ret
				%tmp3 = call <4 x half> @llvm.aarch64.neon.frintn.v4f16(<4 x half> %A)
				ret <4 x half> %tmp3
				}

				define <2 x float> @frintn_2s(<2 x float> %A) nounwind {
				;CHECK-LABEL: frintn_2s:
				;CHECK: frintn v0.2s, v0.2s
				;CHECK-NEXT: ret
				%tmp3 = call <2 x float> @llvm.aarch64.neon.frintn.v2f32(<2 x float> %A)
				ret <2 x float> %tmp3
				}

				define <4 x float> @frintn_4s(<4 x float> %A) nounwind {
				;CHECK-LABEL: frintn_4s:
				;CHECK: frintn v0.4s, v0.4s
				;CHECK-NEXT: ret
				%tmp3 = call <4 x float> @llvm.aarch64.neon.frintn.v4f32(<4 x float> %A)
				ret <4 x float> %tmp3
				}

				define <2 x double> @frintn_2d(<2 x double> %A) nounwind {
				;CHECK-LABEL: frintn_2d:
				;CHECK: frintn v0.2d, v0.2d
				;CHECK-NEXT: ret
				%tmp3 = call <2 x double> @llvm.aarch64.neon.frintn.v2f64(<2 x double> %A)
				ret <2 x double> %tmp3
				}

				declare <4 x half> @llvm.aarch64.neon.frintn.v4f16(<4 x half>) nounwind readnone
				declare <2 x float> @llvm.aarch64.neon.frintn.v2f32(<2 x float>) nounwind readnone
				declare <4 x float> @llvm.aarch64.neon.frintn.v4f32(<4 x float>) nounwind readnone
				declare <2 x double> @llvm.aarch64.neon.frintn.v2f64(<2 x double>) nounwind readnone

llvm/test/CodeGen/AArch64/sve-fixed-length-fp-rounding.ll

	Show First 20 Lines • Show All 1,250 Lines • ▼ Show 20 Lines
	; VBITS_GE_2048-NEXT: ret			; VBITS_GE_2048-NEXT: ret
	%op = load <32 x double>, <32 x double>* %a			%op = load <32 x double>, <32 x double>* %a
	%res = call <32 x double> @llvm.round.v32f64(<32 x double> %op)			%res = call <32 x double> @llvm.round.v32f64(<32 x double> %op)
	store <32 x double> %res, <32 x double>* %a			store <32 x double> %res, <32 x double>* %a
	ret void			ret void
	}			}

	;			;
				; ROUNDEVEN -> FRINTN
				;

				; Don't use SVE for 64-bit vectors.
				define <4 x half> @frintn_v4f16(<4 x half> %op) #0 {
				; CHECK-LABEL: frintn_v4f16:
				; CHECK: frintn v0.4h, v0.4h
				; CHECK-NEXT: ret
				%res = call <4 x half> @llvm.roundeven.v4f16(<4 x half> %op)
				ret <4 x half> %res
				}

				; Don't use SVE for 128-bit vectors.
				define <8 x half> @frintn_v8f16(<8 x half> %op) #0 {
				; CHECK-LABEL: frintn_v8f16:
				; CHECK: frintn v0.8h, v0.8h
				; CHECK-NEXT: ret
				%res = call <8 x half> @llvm.roundeven.v8f16(<8 x half> %op)
				ret <8 x half> %res
				}

				define void @frintn_v16f16(<16 x half>* %a) #0 {
				; CHECK-LABEL: frintn_v16f16:
				; CHECK: ptrue [[PG:p[0-9]+]].h, vl16
				; CHECK-DAG: ld1h { [[OP:z[0-9]+]].h }, [[PG]]/z, [x0]
				; CHECK-NEXT: frintn [[RES:z[0-9]+]].h, [[PG]]/m, [[OP]].h
				; CHECK-NEXT: st1h { [[RES]].h }, [[PG]], [x0]
				; CHECK-NEXT: ret
				%op = load <16 x half>, <16 x half>* %a
				%res = call <16 x half> @llvm.roundeven.v16f16(<16 x half> %op)
				store <16 x half> %res, <16 x half>* %a
				ret void
				}

				define void @frintn_v32f16(<32 x half>* %a) #0 {
				; CHECK-LABEL: frintn_v32f16:
				; VBITS_GE_512: ptrue [[PG:p[0-9]+]].h, vl32
				; VBITS_GE_512-DAG: ld1h { [[OP:z[0-9]+]].h }, [[PG]]/z, [x0]
				; VBITS_GE_512-NEXT: frintn [[RES:z[0-9]+]].h, [[PG]]/m, [[OP]].h
				; VBITS_GE_512-NEXT: st1h { [[RES]].h }, [[PG]], [x0]
				; VBITS_GE_512-NEXT: ret

				; Ensure sensible type legalisation.
				; VBITS_EQ_256-DAG: ptrue [[PG:p[0-9]+]].h, vl16
				; VBITS_EQ_256-DAG: add x[[A_HI:[0-9]+]], x0, #32
				; VBITS_EQ_256-DAG: ld1h { [[OP_LO:z[0-9]+]].h }, [[PG]]/z, [x0]
				; VBITS_EQ_256-DAG: ld1h { [[OP_HI:z[0-9]+]].h }, [[PG]]/z, [x[[A_HI]]]
				; VBITS_EQ_256-DAG: frintn [[RES_LO:z[0-9]+]].h, [[PG]]/m, [[OP_LO]].h
				; VBITS_EQ_256-DAG: frintn [[RES_HI:z[0-9]+]].h, [[PG]]/m, [[OP_HI]].h
				; VBITS_EQ_256-DAG: st1h { [[RES_LO]].h }, [[PG]], [x0]
				; VBITS_EQ_256-DAG: st1h { [[RES_HI]].h }, [[PG]], [x[[A_HI]]
				; VBITS_EQ_256-NEXT: ret
				%op = load <32 x half>, <32 x half>* %a
				%res = call <32 x half> @llvm.roundeven.v32f16(<32 x half> %op)
				store <32 x half> %res, <32 x half>* %a
				ret void
				}

				define void @frintn_v64f16(<64 x half>* %a) #0 {
				; CHECK-LABEL: frintn_v64f16:
				; VBITS_GE_1024: ptrue [[PG:p[0-9]+]].h, vl64
				; VBITS_GE_1024-DAG: ld1h { [[OP:z[0-9]+]].h }, [[PG]]/z, [x0]
				; VBITS_GE_1024-NEXT: frintn [[RES:z[0-9]+]].h, [[PG]]/m, [[OP]].h
				; VBITS_GE_1024-NEXT: st1h { [[RES]].h }, [[PG]], [x0]
				; VBITS_GE_1024-NEXT: ret
				%op = load <64 x half>, <64 x half>* %a
				%res = call <64 x half> @llvm.roundeven.v64f16(<64 x half> %op)
				store <64 x half> %res, <64 x half>* %a
				ret void
				}

				define void @frintn_v128f16(<128 x half>* %a) #0 {
				; CHECK-LABEL: frintn_v128f16:
				; VBITS_GE_2048: ptrue [[PG:p[0-9]+]].h, vl128
				; VBITS_GE_2048-DAG: ld1h { [[OP:z[0-9]+]].h }, [[PG]]/z, [x0]
				; VBITS_GE_2048-NEXT: frintn [[RES:z[0-9]+]].h, [[PG]]/m, [[OP]].h
				; VBITS_GE_2048-NEXT: st1h { [[RES]].h }, [[PG]], [x0]
				; VBITS_GE_2048-NEXT: ret
				%op = load <128 x half>, <128 x half>* %a
				%res = call <128 x half> @llvm.roundeven.v128f16(<128 x half> %op)
				store <128 x half> %res, <128 x half>* %a
				ret void
				}

				; Don't use SVE for 64-bit vectors.
				define <2 x float> @frintn_v2f32(<2 x float> %op) #0 {
				; CHECK-LABEL: frintn_v2f32:
				; CHECK: frintn v0.2s, v0.2s
				; CHECK-NEXT: ret
				%res = call <2 x float> @llvm.roundeven.v2f32(<2 x float> %op)
				ret <2 x float> %res
				}

				; Don't use SVE for 128-bit vectors.
				define <4 x float> @frintn_v4f32(<4 x float> %op) #0 {
				; CHECK-LABEL: frintn_v4f32:
				; CHECK: frintn v0.4s, v0.4s
				; CHECK-NEXT: ret
				%res = call <4 x float> @llvm.roundeven.v4f32(<4 x float> %op)
				ret <4 x float> %res
				}

				define void @frintn_v8f32(<8 x float>* %a) #0 {
				; CHECK-LABEL: frintn_v8f32:
				; CHECK: ptrue [[PG:p[0-9]+]].s, vl8
				; CHECK-DAG: ld1w { [[OP:z[0-9]+]].s }, [[PG]]/z, [x0]
				; CHECK-NEXT: frintn [[RES:z[0-9]+]].s, [[PG]]/m, [[OP]].s
				; CHECK-NEXT: st1w { [[RES]].s }, [[PG]], [x0]
				; CHECK-NEXT: ret
				%op = load <8 x float>, <8 x float>* %a
				%res = call <8 x float> @llvm.roundeven.v8f32(<8 x float> %op)
				store <8 x float> %res, <8 x float>* %a
				ret void
				}

				define void @frintn_v16f32(<16 x float>* %a) #0 {
				; CHECK-LABEL: frintn_v16f32:
				; VBITS_GE_512: ptrue [[PG:p[0-9]+]].s, vl16
				; VBITS_GE_512-DAG: ld1w { [[OP:z[0-9]+]].s }, [[PG]]/z, [x0]
				; VBITS_GE_512-NEXT: frintn [[RES:z[0-9]+]].s, [[PG]]/m, [[OP]].s
				; VBITS_GE_512-NEXT: st1w { [[RES]].s }, [[PG]], [x0]
				; VBITS_GE_512-NEXT: ret

				; Ensure sensible type legalisation.
				; VBITS_EQ_256-DAG: ptrue [[PG:p[0-9]+]].s, vl8
				; VBITS_EQ_256-DAG: add x[[A_HI:[0-9]+]], x0, #32
				; VBITS_EQ_256-DAG: ld1w { [[OP_LO:z[0-9]+]].s }, [[PG]]/z, [x0]
				; VBITS_EQ_256-DAG: ld1w { [[OP_HI:z[0-9]+]].s }, [[PG]]/z, [x[[A_HI]]]
				; VBITS_EQ_256-DAG: frintn [[RES_LO:z[0-9]+]].s, [[PG]]/m, [[OP_LO]].s
				; VBITS_EQ_256-DAG: frintn [[RES_HI:z[0-9]+]].s, [[PG]]/m, [[OP_HI]].s
				; VBITS_EQ_256-DAG: st1w { [[RES_LO]].s }, [[PG]], [x0]
				; VBITS_EQ_256-DAG: st1w { [[RES_HI]].s }, [[PG]], [x[[A_HI]]
				; VBITS_EQ_256-NEXT: ret
				%op = load <16 x float>, <16 x float>* %a
				%res = call <16 x float> @llvm.roundeven.v16f32(<16 x float> %op)
				store <16 x float> %res, <16 x float>* %a
				ret void
				}

				define void @frintn_v32f32(<32 x float>* %a) #0 {
				; CHECK-LABEL: frintn_v32f32:
				; VBITS_GE_1024: ptrue [[PG:p[0-9]+]].s, vl32
				; VBITS_GE_1024-DAG: ld1w { [[OP:z[0-9]+]].s }, [[PG]]/z, [x0]
				; VBITS_GE_1024-NEXT: frintn [[RES:z[0-9]+]].s, [[PG]]/m, [[OP]].s
				; VBITS_GE_1024-NEXT: st1w { [[RES]].s }, [[PG]], [x0]
				; VBITS_GE_1024-NEXT: ret
				%op = load <32 x float>, <32 x float>* %a
				%res = call <32 x float> @llvm.roundeven.v32f32(<32 x float> %op)
				store <32 x float> %res, <32 x float>* %a
				ret void
				}

				define void @frintn_v64f32(<64 x float>* %a) #0 {
				; CHECK-LABEL: frintn_v64f32:
				; VBITS_GE_2048: ptrue [[PG:p[0-9]+]].s, vl64
				; VBITS_GE_2048-DAG: ld1w { [[OP:z[0-9]+]].s }, [[PG]]/z, [x0]
				; VBITS_GE_2048-NEXT: frintn [[RES:z[0-9]+]].s, [[PG]]/m, [[OP]].s
				; VBITS_GE_2048-NEXT: st1w { [[RES]].s }, [[PG]], [x0]
				; VBITS_GE_2048-NEXT: ret
				%op = load <64 x float>, <64 x float>* %a
				%res = call <64 x float> @llvm.roundeven.v64f32(<64 x float> %op)
				store <64 x float> %res, <64 x float>* %a
				ret void
				}

				; Don't use SVE for 64-bit vectors.
				define <1 x double> @frintn_v1f64(<1 x double> %op) #0 {
				; CHECK-LABEL: frintn_v1f64:
				; CHECK: frintn d0, d0
				; CHECK-NEXT: ret
				%res = call <1 x double> @llvm.roundeven.v1f64(<1 x double> %op)
				ret <1 x double> %res
				}

				; Don't use SVE for 128-bit vectors.
				define <2 x double> @frintn_v2f64(<2 x double> %op) #0 {
				; CHECK-LABEL: frintn_v2f64:
				; CHECK: frintn v0.2d, v0.2d
				; CHECK-NEXT: ret
				%res = call <2 x double> @llvm.roundeven.v2f64(<2 x double> %op)
				ret <2 x double> %res
				}

				define void @frintn_v4f64(<4 x double>* %a) #0 {
				; CHECK-LABEL: frintn_v4f64:
				; CHECK: ptrue [[PG:p[0-9]+]].d, vl4
				; CHECK-DAG: ld1d { [[OP:z[0-9]+]].d }, [[PG]]/z, [x0]
				; CHECK-NEXT: frintn [[RES:z[0-9]+]].d, [[PG]]/m, [[OP]].d
				; CHECK-NEXT: st1d { [[RES]].d }, [[PG]], [x0]
				; CHECK-NEXT: ret
				%op = load <4 x double>, <4 x double>* %a
				%res = call <4 x double> @llvm.roundeven.v4f64(<4 x double> %op)
				store <4 x double> %res, <4 x double>* %a
				ret void
				}

				define void @frintn_v8f64(<8 x double>* %a) #0 {
				; CHECK-LABEL: frintn_v8f64:
				; VBITS_GE_512: ptrue [[PG:p[0-9]+]].d, vl8
				; VBITS_GE_512-DAG: ld1d { [[OP:z[0-9]+]].d }, [[PG]]/z, [x0]
				; VBITS_GE_512-NEXT: frintn [[RES:z[0-9]+]].d, [[PG]]/m, [[OP]].d
				; VBITS_GE_512-NEXT: st1d { [[RES]].d }, [[PG]], [x0]
				; VBITS_GE_512-NEXT: ret

				; Ensure sensible type legalisation.
				; VBITS_EQ_256-DAG: ptrue [[PG:p[0-9]+]].d, vl4
				; VBITS_EQ_256-DAG: add x[[A_HI:[0-9]+]], x0, #32
				; VBITS_EQ_256-DAG: ld1d { [[OP_LO:z[0-9]+]].d }, [[PG]]/z, [x0]
				; VBITS_EQ_256-DAG: ld1d { [[OP_HI:z[0-9]+]].d }, [[PG]]/z, [x[[A_HI]]]
				; VBITS_EQ_256-DAG: frintn [[RES_LO:z[0-9]+]].d, [[PG]]/m, [[OP_LO]].d
				; VBITS_EQ_256-DAG: frintn [[RES_HI:z[0-9]+]].d, [[PG]]/m, [[OP_HI]].d
				; VBITS_EQ_256-DAG: st1d { [[RES_LO]].d }, [[PG]], [x0]
				; VBITS_EQ_256-DAG: st1d { [[RES_HI]].d }, [[PG]], [x[[A_HI]]
				; VBITS_EQ_256-NEXT: ret
				%op = load <8 x double>, <8 x double>* %a
				%res = call <8 x double> @llvm.roundeven.v8f64(<8 x double> %op)
				store <8 x double> %res, <8 x double>* %a
				ret void
				}

				define void @frintn_v16f64(<16 x double>* %a) #0 {
				; CHECK-LABEL: frintn_v16f64:
				; VBITS_GE_1024: ptrue [[PG:p[0-9]+]].d, vl16
				; VBITS_GE_1024-DAG: ld1d { [[OP:z[0-9]+]].d }, [[PG]]/z, [x0]
				; VBITS_GE_1024-NEXT: frintn [[RES:z[0-9]+]].d, [[PG]]/m, [[OP]].d
				; VBITS_GE_1024-NEXT: st1d { [[RES]].d }, [[PG]], [x0]
				; VBITS_GE_1024-NEXT: ret
				%op = load <16 x double>, <16 x double>* %a
				%res = call <16 x double> @llvm.roundeven.v16f64(<16 x double> %op)
				store <16 x double> %res, <16 x double>* %a
				ret void
				}

				define void @frintn_v32f64(<32 x double>* %a) #0 {
				; CHECK-LABEL: frintn_v32f64:
				; VBITS_GE_2048: ptrue [[PG:p[0-9]+]].d, vl32
				; VBITS_GE_2048-DAG: ld1d { [[OP:z[0-9]+]].d }, [[PG]]/z, [x0]
				; VBITS_GE_2048-NEXT: frintn [[RES:z[0-9]+]].d, [[PG]]/m, [[OP]].d
				; VBITS_GE_2048-NEXT: st1d { [[RES]].d }, [[PG]], [x0]
				; VBITS_GE_2048-NEXT: ret
				%op = load <32 x double>, <32 x double>* %a
				%res = call <32 x double> @llvm.roundeven.v32f64(<32 x double> %op)
				store <32 x double> %res, <32 x double>* %a
				ret void
				}

				;
	; TRUNC -> FRINTZ			; TRUNC -> FRINTZ
	;			;

	; Don't use SVE for 64-bit vectors.			; Don't use SVE for 64-bit vectors.
	define <4 x half> @frintz_v4f16(<4 x half> %op) #0 {			define <4 x half> @frintz_v4f16(<4 x half> %op) #0 {
	; CHECK-LABEL: frintz_v4f16:			; CHECK-LABEL: frintz_v4f16:
	; CHECK: frintz v0.4h, v0.4h			; CHECK: frintz v0.4h, v0.4h
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	▲ Show 20 Lines • Show All 327 Lines • ▼ Show 20 Lines
	declare <64 x float> @llvm.round.v64f32(<64 x float>)			declare <64 x float> @llvm.round.v64f32(<64 x float>)
	declare <1 x double> @llvm.round.v1f64(<1 x double>)			declare <1 x double> @llvm.round.v1f64(<1 x double>)
	declare <2 x double> @llvm.round.v2f64(<2 x double>)			declare <2 x double> @llvm.round.v2f64(<2 x double>)
	declare <4 x double> @llvm.round.v4f64(<4 x double>)			declare <4 x double> @llvm.round.v4f64(<4 x double>)
	declare <8 x double> @llvm.round.v8f64(<8 x double>)			declare <8 x double> @llvm.round.v8f64(<8 x double>)
	declare <16 x double> @llvm.round.v16f64(<16 x double>)			declare <16 x double> @llvm.round.v16f64(<16 x double>)
	declare <32 x double> @llvm.round.v32f64(<32 x double>)			declare <32 x double> @llvm.round.v32f64(<32 x double>)

				declare <4 x half> @llvm.roundeven.v4f16(<4 x half>)
				declare <8 x half> @llvm.roundeven.v8f16(<8 x half>)
				declare <16 x half> @llvm.roundeven.v16f16(<16 x half>)
				declare <32 x half> @llvm.roundeven.v32f16(<32 x half>)
				declare <64 x half> @llvm.roundeven.v64f16(<64 x half>)
				declare <128 x half> @llvm.roundeven.v128f16(<128 x half>)
				declare <2 x float> @llvm.roundeven.v2f32(<2 x float>)
				declare <4 x float> @llvm.roundeven.v4f32(<4 x float>)
				declare <8 x float> @llvm.roundeven.v8f32(<8 x float>)
				declare <16 x float> @llvm.roundeven.v16f32(<16 x float>)
				declare <32 x float> @llvm.roundeven.v32f32(<32 x float>)
				declare <64 x float> @llvm.roundeven.v64f32(<64 x float>)
				declare <1 x double> @llvm.roundeven.v1f64(<1 x double>)
				declare <2 x double> @llvm.roundeven.v2f64(<2 x double>)
				declare <4 x double> @llvm.roundeven.v4f64(<4 x double>)
				declare <8 x double> @llvm.roundeven.v8f64(<8 x double>)
				declare <16 x double> @llvm.roundeven.v16f64(<16 x double>)
				declare <32 x double> @llvm.roundeven.v32f64(<32 x double>)

	declare <4 x half> @llvm.trunc.v4f16(<4 x half>)			declare <4 x half> @llvm.trunc.v4f16(<4 x half>)
	declare <8 x half> @llvm.trunc.v8f16(<8 x half>)			declare <8 x half> @llvm.trunc.v8f16(<8 x half>)
	declare <16 x half> @llvm.trunc.v16f16(<16 x half>)			declare <16 x half> @llvm.trunc.v16f16(<16 x half>)
	declare <32 x half> @llvm.trunc.v32f16(<32 x half>)			declare <32 x half> @llvm.trunc.v32f16(<32 x half>)
	declare <64 x half> @llvm.trunc.v64f16(<64 x half>)			declare <64 x half> @llvm.trunc.v64f16(<64 x half>)
	declare <128 x half> @llvm.trunc.v128f16(<128 x half>)			declare <128 x half> @llvm.trunc.v128f16(<128 x half>)
	declare <2 x float> @llvm.trunc.v2f32(<2 x float>)			declare <2 x float> @llvm.trunc.v2f32(<2 x float>)
	declare <4 x float> @llvm.trunc.v4f32(<4 x float>)			declare <4 x float> @llvm.trunc.v4f32(<4 x float>)
	Show All 10 Lines

llvm/test/CodeGen/AArch64/vec-libcalls.ll

	Show All 23 Lines
	declare <3 x float> @llvm.exp2.v3f32(<3 x float>)			declare <3 x float> @llvm.exp2.v3f32(<3 x float>)
	declare <3 x float> @llvm.floor.v3f32(<3 x float>)			declare <3 x float> @llvm.floor.v3f32(<3 x float>)
	declare <3 x float> @llvm.log.v3f32(<3 x float>)			declare <3 x float> @llvm.log.v3f32(<3 x float>)
	declare <3 x float> @llvm.log10.v3f32(<3 x float>)			declare <3 x float> @llvm.log10.v3f32(<3 x float>)
	declare <3 x float> @llvm.log2.v3f32(<3 x float>)			declare <3 x float> @llvm.log2.v3f32(<3 x float>)
	declare <3 x float> @llvm.nearbyint.v3f32(<3 x float>)			declare <3 x float> @llvm.nearbyint.v3f32(<3 x float>)
	declare <3 x float> @llvm.rint.v3f32(<3 x float>)			declare <3 x float> @llvm.rint.v3f32(<3 x float>)
	declare <3 x float> @llvm.round.v3f32(<3 x float>)			declare <3 x float> @llvm.round.v3f32(<3 x float>)
				declare <3 x float> @llvm.roundeven.v3f32(<3 x float>)
	declare <3 x float> @llvm.sqrt.v3f32(<3 x float>)			declare <3 x float> @llvm.sqrt.v3f32(<3 x float>)
	declare <3 x float> @llvm.trunc.v3f32(<3 x float>)			declare <3 x float> @llvm.trunc.v3f32(<3 x float>)

	define <1 x float> @sin_v1f32(<1 x float> %x) nounwind {			define <1 x float> @sin_v1f32(<1 x float> %x) nounwind {
	; CHECK-LABEL: sin_v1f32:			; CHECK-LABEL: sin_v1f32:
	; CHECK: // %bb.0:			; CHECK: // %bb.0:
	; CHECK-NEXT: str x30, [sp, #-16]! // 8-byte Folded Spill			; CHECK-NEXT: str x30, [sp, #-16]! // 8-byte Folded Spill
	; CHECK-NEXT: // kill: def $d0 killed $d0 def $q0			; CHECK-NEXT: // kill: def $d0 killed $d0 def $q0
	▲ Show 20 Lines • Show All 433 Lines • ▼ Show 20 Lines
	; CHECK-LABEL: round_v3f32:			; CHECK-LABEL: round_v3f32:
	; CHECK: // %bb.0:			; CHECK: // %bb.0:
	; CHECK-NEXT: frinta v0.4s, v0.4s			; CHECK-NEXT: frinta v0.4s, v0.4s
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	%r = call <3 x float> @llvm.round.v3f32(<3 x float> %x)			%r = call <3 x float> @llvm.round.v3f32(<3 x float> %x)
	ret <3 x float> %r			ret <3 x float> %r
	}			}

				define <3 x float> @roundeven_v3f32(<3 x float> %x) nounwind {
				; CHECK-LABEL: roundeven_v3f32:
				; CHECK: // %bb.0:
				; CHECK-NEXT: frintn v0.4s, v0.4s
				; CHECK-NEXT: ret
				%r = call <3 x float> @llvm.roundeven.v3f32(<3 x float> %x)
				ret <3 x float> %r
				}

	define <3 x float> @sqrt_v3f32(<3 x float> %x) nounwind {			define <3 x float> @sqrt_v3f32(<3 x float> %x) nounwind {
	; CHECK-LABEL: sqrt_v3f32:			; CHECK-LABEL: sqrt_v3f32:
	; CHECK: // %bb.0:			; CHECK: // %bb.0:
	; CHECK-NEXT: fsqrt v0.4s, v0.4s			; CHECK-NEXT: fsqrt v0.4s, v0.4s
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	%r = call <3 x float> @llvm.sqrt.v3f32(<3 x float> %x)			%r = call <3 x float> @llvm.sqrt.v3f32(<3 x float> %x)
	ret <3 x float> %r			ret <3 x float> %r
	}			}
	Show All 10 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][SVE/NEON] Add support for FROUNDEVEN for both NEON and fixed length SVEClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 331224

clang/lib/CodeGen/CGBuiltin.cpp

clang/test/CodeGen/aarch64-neon-intrinsics.c

clang/test/CodeGen/aarch64-neon-misc.c

clang/test/CodeGen/aarch64-v8.2a-fp16-intrinsics.c

clang/test/CodeGen/aarch64-v8.2a-neon-intrinsics.c

clang/test/CodeGen/arm-neon-directed-rounding.c

clang/test/CodeGen/arm64-vrnd.c

llvm/include/llvm/IR/IntrinsicsAArch64.td

llvm/include/llvm/Target/TargetSelectionDAG.td

llvm/lib/IR/AutoUpgrade.cpp

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

llvm/lib/Target/AArch64/AArch64InstrInfo.td

llvm/test/CodeGen/AArch64/arm64-vcvt.ll

llvm/test/CodeGen/AArch64/arm64-vfloatintrinsics.ll

llvm/test/CodeGen/AArch64/f16-instructions.ll

llvm/test/CodeGen/AArch64/fp-intrinsics.ll

llvm/test/CodeGen/AArch64/frintn.ll

llvm/test/CodeGen/AArch64/sve-fixed-length-fp-rounding.ll

llvm/test/CodeGen/AArch64/vec-libcalls.ll

[AArch64][SVE/NEON] Add support for FROUNDEVEN for both NEON and fixed length SVE
ClosedPublic