This is an archive of the discontinued LLVM Phabricator instance.

llvm/include/llvm/IR/IntrinsicsAArch64.td
853	Missing ImmArg markings?
llvm/lib/Target/AArch64/SVEInstrFormats.td
748	SQDECB always returns a value in a 64-bit register; why are you treating the return value as 32 bits? Even if there's some reason to prefer that form at the IR level, it doesn't seem like a good idea in isel; if you need a sign-extended value, you'll be forced to emit a redundant sign extension.

Cheers for taking a look @eli.friedman !

llvm/include/llvm/IR/IntrinsicsAArch64.td
853	Yes, good point. This means that I will have to duplicate `sve_incdec_imm` and `see_pred_enum` with `TImmLeaf` based equivalents.
llvm/lib/Target/AArch64/SVEInstrFormats.td
748	There's a 64-bit and 32-bit variant of `SQDECB`. This pattern is for the 32-bit variant, which returns 32-bit as well 64-bit result. Here we only care about the 32-bit result (because that's what the ACLE intrinsic returns). More specifically, this is meant to allow 1:1 mapping between: `int32_t svqdecb_n_s32(int32_t op, uint64_t imm_factor)` from ACLE `declare i32 @llvm.aarch64.sve.sqdecb.n32(i32, i32, i32)` IR intrinsic `sqdecb x0, w0, vl3, mul #4` SVE instruction For the 64-bit variant there's a different intrinsic: `int64_t svqdecb_n_s64(int64_t op, uint64_t imm_factor)` from ACLE `declare i64 @llvm.aarch64.sve.sqdecb.n64(i64, i32, i32)` IR intrinsic `sqdecb x0, vl4, mul #5` SVE instruction Also, this multiclass is only used for the intrnisics.

efriedma added inline comments.Dec 13 2019, 10:37 AM

llvm/lib/Target/AArch64/SVEInstrFormats.td
748	Consider something like the following: long x(int z) { return svqdecb_n_s32(z, 1); This function should lower to just a single sqdecb. The way this is written you end up with an unnecessary sxtw.

andwar marked an inline comment as done.Dec 19 2019, 10:36 AM

andwar added inline comments.

llvm/lib/Target/AArch64/SVEInstrFormats.td
748	I will add extra patterns to cater for this scenario (please check the next patch). The other option would be to rewrite this pattern so that the return value is always `i64` and then add some new ISD nodes and truncate the user requests `i32`. But the overall effect would be similar.

Add patterns for scenarios when a 64bit value is requested from an intrinsic returning a 32 bit value (so that unecessary sxtw is avoided)
Add test cases for the above
Split tests into 4 seperate files (one per instruction)
Add missing ImmArg
Rebase on top of master

Harbormaster completed remote builds in B42789: Diff 234751.Dec 19 2019, 10:42 AM

LGTM

llvm/lib/Target/AArch64/SVEInstrFormats.td
748	Okay, that works.

This revision is now accepted and ready to land.Dec 19 2019, 2:25 PM

sdesmalen accepted this revision.Dec 20 2019, 1:29 AM

Closed by commit rGbe2b7ea89ab4: [AArch64][SVE] Add intrnisics for saturating scalar arithmetic (authored by awarzynski). · Explain WhyDec 20 2019, 3:14 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

include/

llvm/

IR/

IntrinsicsAArch64.td

91 lines

lib/

Target/

AArch64/

AArch64SVEInstrInfo.td

120 lines

SVEInstrFormats.td

86 lines

test/

CodeGen/

AArch64/

sve-intrinsics-sqdec.ll

337 lines

sve-intrinsics-sqinc.ll

337 lines

sve-intrinsics-uqdec.ll

257 lines

sve-intrinsics-uqinc.ll

257 lines

Diff 234854

llvm/include/llvm/IR/IntrinsicsAArch64.td

Show First 20 Lines • Show All 833 Lines • ▼ Show 20 Lines	let TargetPrefix = "aarch64" in { // All intrinsics start with "llvm.aarch64.".

class AdvSIMD_SVE_CompareWide_Intrinsic		class AdvSIMD_SVE_CompareWide_Intrinsic
: Intrinsic<[LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],		: Intrinsic<[LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
[LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,		[LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
llvm_anyvector_ty,		llvm_anyvector_ty,
llvm_nxv2i64_ty],		llvm_nxv2i64_ty],
[IntrNoMem]>;		[IntrNoMem]>;

		class AdvSIMD_SVE_Saturating_Intrinsic
		: Intrinsic<[llvm_anyvector_ty],
		[LLVMMatchType<0>,
		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
		[IntrNoMem]>;

		class AdvSIMD_SVE_SaturatingWithPattern_Intrinsic
		: Intrinsic<[llvm_anyvector_ty],
		[LLVMMatchType<0>,
		llvm_i32_ty,
		llvm_i32_ty],
		[IntrNoMem, ImmArg<1>, ImmArg<2>]>;
		efriedmaUnsubmitted Not Done Reply Inline Actions Missing ImmArg markings? efriedma: Missing ImmArg markings?
		andwarAuthorUnsubmitted Done Reply Inline Actions Yes, good point. This means that I will have to duplicate `sve_incdec_imm` and `see_pred_enum` with `TImmLeaf` based equivalents. andwar: Yes, good point. This means that I will have to duplicate `sve_incdec_imm` and `see_pred_enum`…

		class AdvSIMD_SVE_Saturating_N_Intrinsic<LLVMType T>
		: Intrinsic<[T],
		[T, llvm_anyvector_ty],
		[IntrNoMem]>;

		class AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<LLVMType T>
		: Intrinsic<[T],
		[T, llvm_i32_ty, llvm_i32_ty],
		[IntrNoMem, ImmArg<1>, ImmArg<2>]>;

class AdvSIMD_SVE_CNT_Intrinsic		class AdvSIMD_SVE_CNT_Intrinsic
: Intrinsic<[LLVMVectorOfBitcastsToInt<0>],		: Intrinsic<[LLVMVectorOfBitcastsToInt<0>],
[LLVMVectorOfBitcastsToInt<0>,		[LLVMVectorOfBitcastsToInt<0>,
LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,		LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>,
llvm_anyvector_ty],		llvm_anyvector_ty],
[IntrNoMem]>;		[IntrNoMem]>;

class AdvSIMD_SVE_FP_Reduce_Intrinsic		class AdvSIMD_SVE_FP_Reduce_Intrinsic
▲ Show 20 Lines • Show All 413 Lines • ▼ Show 20 Lines
def int_aarch64_sve_cntb : AdvSIMD_SVE_CNTB_Intrinsic;		def int_aarch64_sve_cntb : AdvSIMD_SVE_CNTB_Intrinsic;
def int_aarch64_sve_cnth : AdvSIMD_SVE_CNTB_Intrinsic;		def int_aarch64_sve_cnth : AdvSIMD_SVE_CNTB_Intrinsic;
def int_aarch64_sve_cntw : AdvSIMD_SVE_CNTB_Intrinsic;		def int_aarch64_sve_cntw : AdvSIMD_SVE_CNTB_Intrinsic;
def int_aarch64_sve_cntd : AdvSIMD_SVE_CNTB_Intrinsic;		def int_aarch64_sve_cntd : AdvSIMD_SVE_CNTB_Intrinsic;

def int_aarch64_sve_cntp : AdvSIMD_SVE_CNTP_Intrinsic;		def int_aarch64_sve_cntp : AdvSIMD_SVE_CNTP_Intrinsic;

//		//
		// Saturating scalar arithmetic
		//

		def int_aarch64_sve_sqdech : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_sqdecw : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_sqdecd : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_sqdecp : AdvSIMD_SVE_Saturating_Intrinsic;

		def int_aarch64_sve_sqdecb_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqdecb_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_sqdech_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqdech_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_sqdecw_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqdecw_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_sqdecd_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqdecd_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_sqdecp_n32 : AdvSIMD_SVE_Saturating_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqdecp_n64 : AdvSIMD_SVE_Saturating_N_Intrinsic<llvm_i64_ty>;

		def int_aarch64_sve_sqinch : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_sqincw : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_sqincd : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_sqincp : AdvSIMD_SVE_Saturating_Intrinsic;

		def int_aarch64_sve_sqincb_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqincb_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_sqinch_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqinch_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_sqincw_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqincw_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_sqincd_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqincd_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_sqincp_n32 : AdvSIMD_SVE_Saturating_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_sqincp_n64 : AdvSIMD_SVE_Saturating_N_Intrinsic<llvm_i64_ty>;

		def int_aarch64_sve_uqdech : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_uqdecw : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_uqdecd : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_uqdecp : AdvSIMD_SVE_Saturating_Intrinsic;

		def int_aarch64_sve_uqdecb_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqdecb_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_uqdech_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqdech_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_uqdecw_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqdecw_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_uqdecd_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqdecd_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_uqdecp_n32 : AdvSIMD_SVE_Saturating_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqdecp_n64 : AdvSIMD_SVE_Saturating_N_Intrinsic<llvm_i64_ty>;

		def int_aarch64_sve_uqinch : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_uqincw : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_uqincd : AdvSIMD_SVE_SaturatingWithPattern_Intrinsic;
		def int_aarch64_sve_uqincp : AdvSIMD_SVE_Saturating_Intrinsic;

		def int_aarch64_sve_uqincb_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqincb_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_uqinch_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqinch_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_uqincw_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqincw_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_uqincd_n32 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqincd_n64 : AdvSIMD_SVE_SaturatingWithPattern_N_Intrinsic<llvm_i64_ty>;
		def int_aarch64_sve_uqincp_n32 : AdvSIMD_SVE_Saturating_N_Intrinsic<llvm_i32_ty>;
		def int_aarch64_sve_uqincp_n64 : AdvSIMD_SVE_Saturating_N_Intrinsic<llvm_i64_ty>;

		//
// Reversal		// Reversal
//		//

def int_aarch64_sve_rbit : AdvSIMD_Merged1VectorArg_Intrinsic;		def int_aarch64_sve_rbit : AdvSIMD_Merged1VectorArg_Intrinsic;
def int_aarch64_sve_revb : AdvSIMD_Merged1VectorArg_Intrinsic;		def int_aarch64_sve_revb : AdvSIMD_Merged1VectorArg_Intrinsic;
def int_aarch64_sve_revh : AdvSIMD_Merged1VectorArg_Intrinsic;		def int_aarch64_sve_revh : AdvSIMD_Merged1VectorArg_Intrinsic;
def int_aarch64_sve_revw : AdvSIMD_Merged1VectorArg_Intrinsic;		def int_aarch64_sve_revw : AdvSIMD_Merged1VectorArg_Intrinsic;

▲ Show 20 Lines • Show All 352 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td

Show First 20 Lines • Show All 846 Lines • ▼ Show 20 Lines	let Predicates = [HasSVE] in {
defm DECB_XPiI : sve_int_pred_pattern_a<0b001, "decb">;		defm DECB_XPiI : sve_int_pred_pattern_a<0b001, "decb">;
defm INCH_XPiI : sve_int_pred_pattern_a<0b010, "inch">;		defm INCH_XPiI : sve_int_pred_pattern_a<0b010, "inch">;
defm DECH_XPiI : sve_int_pred_pattern_a<0b011, "dech">;		defm DECH_XPiI : sve_int_pred_pattern_a<0b011, "dech">;
defm INCW_XPiI : sve_int_pred_pattern_a<0b100, "incw">;		defm INCW_XPiI : sve_int_pred_pattern_a<0b100, "incw">;
defm DECW_XPiI : sve_int_pred_pattern_a<0b101, "decw">;		defm DECW_XPiI : sve_int_pred_pattern_a<0b101, "decw">;
defm INCD_XPiI : sve_int_pred_pattern_a<0b110, "incd">;		defm INCD_XPiI : sve_int_pred_pattern_a<0b110, "incd">;
defm DECD_XPiI : sve_int_pred_pattern_a<0b111, "decd">;		defm DECD_XPiI : sve_int_pred_pattern_a<0b111, "decd">;

defm SQINCB_XPiWdI : sve_int_pred_pattern_b_s32<0b00000, "sqincb">;		defm SQINCB_XPiWdI : sve_int_pred_pattern_b_s32<0b00000, "sqincb", int_aarch64_sve_sqincb_n32>;
defm UQINCB_WPiI : sve_int_pred_pattern_b_u32<0b00001, "uqincb">;		defm UQINCB_WPiI : sve_int_pred_pattern_b_u32<0b00001, "uqincb", int_aarch64_sve_uqincb_n32>;
defm SQDECB_XPiWdI : sve_int_pred_pattern_b_s32<0b00010, "sqdecb">;		defm SQDECB_XPiWdI : sve_int_pred_pattern_b_s32<0b00010, "sqdecb", int_aarch64_sve_sqdecb_n32>;
defm UQDECB_WPiI : sve_int_pred_pattern_b_u32<0b00011, "uqdecb">;		defm UQDECB_WPiI : sve_int_pred_pattern_b_u32<0b00011, "uqdecb", int_aarch64_sve_uqdecb_n32>;
defm SQINCB_XPiI : sve_int_pred_pattern_b_x64<0b00100, "sqincb">;		defm SQINCB_XPiI : sve_int_pred_pattern_b_x64<0b00100, "sqincb", int_aarch64_sve_sqincb_n64>;
defm UQINCB_XPiI : sve_int_pred_pattern_b_x64<0b00101, "uqincb">;		defm UQINCB_XPiI : sve_int_pred_pattern_b_x64<0b00101, "uqincb", int_aarch64_sve_uqincb_n64>;
defm SQDECB_XPiI : sve_int_pred_pattern_b_x64<0b00110, "sqdecb">;		defm SQDECB_XPiI : sve_int_pred_pattern_b_x64<0b00110, "sqdecb", int_aarch64_sve_sqdecb_n64>;
defm UQDECB_XPiI : sve_int_pred_pattern_b_x64<0b00111, "uqdecb">;		defm UQDECB_XPiI : sve_int_pred_pattern_b_x64<0b00111, "uqdecb", int_aarch64_sve_uqdecb_n64>;

defm SQINCH_XPiWdI : sve_int_pred_pattern_b_s32<0b01000, "sqinch">;		defm SQINCH_XPiWdI : sve_int_pred_pattern_b_s32<0b01000, "sqinch", int_aarch64_sve_sqinch_n32>;
defm UQINCH_WPiI : sve_int_pred_pattern_b_u32<0b01001, "uqinch">;		defm UQINCH_WPiI : sve_int_pred_pattern_b_u32<0b01001, "uqinch", int_aarch64_sve_uqinch_n32>;
defm SQDECH_XPiWdI : sve_int_pred_pattern_b_s32<0b01010, "sqdech">;		defm SQDECH_XPiWdI : sve_int_pred_pattern_b_s32<0b01010, "sqdech", int_aarch64_sve_sqdech_n32>;
defm UQDECH_WPiI : sve_int_pred_pattern_b_u32<0b01011, "uqdech">;		defm UQDECH_WPiI : sve_int_pred_pattern_b_u32<0b01011, "uqdech", int_aarch64_sve_uqdech_n32>;
defm SQINCH_XPiI : sve_int_pred_pattern_b_x64<0b01100, "sqinch">;		defm SQINCH_XPiI : sve_int_pred_pattern_b_x64<0b01100, "sqinch", int_aarch64_sve_sqinch_n64>;
defm UQINCH_XPiI : sve_int_pred_pattern_b_x64<0b01101, "uqinch">;		defm UQINCH_XPiI : sve_int_pred_pattern_b_x64<0b01101, "uqinch", int_aarch64_sve_uqinch_n64>;
defm SQDECH_XPiI : sve_int_pred_pattern_b_x64<0b01110, "sqdech">;		defm SQDECH_XPiI : sve_int_pred_pattern_b_x64<0b01110, "sqdech", int_aarch64_sve_sqdech_n64>;
defm UQDECH_XPiI : sve_int_pred_pattern_b_x64<0b01111, "uqdech">;		defm UQDECH_XPiI : sve_int_pred_pattern_b_x64<0b01111, "uqdech", int_aarch64_sve_uqdech_n64>;

defm SQINCW_XPiWdI : sve_int_pred_pattern_b_s32<0b10000, "sqincw">;		defm SQINCW_XPiWdI : sve_int_pred_pattern_b_s32<0b10000, "sqincw", int_aarch64_sve_sqincw_n32>;
defm UQINCW_WPiI : sve_int_pred_pattern_b_u32<0b10001, "uqincw">;		defm UQINCW_WPiI : sve_int_pred_pattern_b_u32<0b10001, "uqincw", int_aarch64_sve_uqincw_n32>;
defm SQDECW_XPiWdI : sve_int_pred_pattern_b_s32<0b10010, "sqdecw">;		defm SQDECW_XPiWdI : sve_int_pred_pattern_b_s32<0b10010, "sqdecw", int_aarch64_sve_sqdecw_n32>;
defm UQDECW_WPiI : sve_int_pred_pattern_b_u32<0b10011, "uqdecw">;		defm UQDECW_WPiI : sve_int_pred_pattern_b_u32<0b10011, "uqdecw", int_aarch64_sve_uqdecw_n32>;
defm SQINCW_XPiI : sve_int_pred_pattern_b_x64<0b10100, "sqincw">;		defm SQINCW_XPiI : sve_int_pred_pattern_b_x64<0b10100, "sqincw", int_aarch64_sve_sqincw_n64>;
defm UQINCW_XPiI : sve_int_pred_pattern_b_x64<0b10101, "uqincw">;		defm UQINCW_XPiI : sve_int_pred_pattern_b_x64<0b10101, "uqincw", int_aarch64_sve_uqincw_n64>;
defm SQDECW_XPiI : sve_int_pred_pattern_b_x64<0b10110, "sqdecw">;		defm SQDECW_XPiI : sve_int_pred_pattern_b_x64<0b10110, "sqdecw", int_aarch64_sve_sqdecw_n64>;
defm UQDECW_XPiI : sve_int_pred_pattern_b_x64<0b10111, "uqdecw">;		defm UQDECW_XPiI : sve_int_pred_pattern_b_x64<0b10111, "uqdecw", int_aarch64_sve_uqdecw_n64>;

defm SQINCD_XPiWdI : sve_int_pred_pattern_b_s32<0b11000, "sqincd">;		defm SQINCD_XPiWdI : sve_int_pred_pattern_b_s32<0b11000, "sqincd", int_aarch64_sve_sqincd_n32>;
defm UQINCD_WPiI : sve_int_pred_pattern_b_u32<0b11001, "uqincd">;		defm UQINCD_WPiI : sve_int_pred_pattern_b_u32<0b11001, "uqincd", int_aarch64_sve_uqincd_n32>;
defm SQDECD_XPiWdI : sve_int_pred_pattern_b_s32<0b11010, "sqdecd">;		defm SQDECD_XPiWdI : sve_int_pred_pattern_b_s32<0b11010, "sqdecd", int_aarch64_sve_sqdecd_n32>;
defm UQDECD_WPiI : sve_int_pred_pattern_b_u32<0b11011, "uqdecd">;		defm UQDECD_WPiI : sve_int_pred_pattern_b_u32<0b11011, "uqdecd", int_aarch64_sve_uqdecd_n32>;
defm SQINCD_XPiI : sve_int_pred_pattern_b_x64<0b11100, "sqincd">;		defm SQINCD_XPiI : sve_int_pred_pattern_b_x64<0b11100, "sqincd", int_aarch64_sve_sqincd_n64>;
defm UQINCD_XPiI : sve_int_pred_pattern_b_x64<0b11101, "uqincd">;		defm UQINCD_XPiI : sve_int_pred_pattern_b_x64<0b11101, "uqincd", int_aarch64_sve_uqincd_n64>;
defm SQDECD_XPiI : sve_int_pred_pattern_b_x64<0b11110, "sqdecd">;		defm SQDECD_XPiI : sve_int_pred_pattern_b_x64<0b11110, "sqdecd", int_aarch64_sve_sqdecd_n64>;
defm UQDECD_XPiI : sve_int_pred_pattern_b_x64<0b11111, "uqdecd">;		defm UQDECD_XPiI : sve_int_pred_pattern_b_x64<0b11111, "uqdecd", int_aarch64_sve_uqdecd_n64>;

defm SQINCH_ZPiI : sve_int_countvlv<0b01000, "sqinch", ZPR16>;		defm SQINCH_ZPiI : sve_int_countvlv<0b01000, "sqinch", ZPR16, int_aarch64_sve_sqinch, nxv8i16>;
defm UQINCH_ZPiI : sve_int_countvlv<0b01001, "uqinch", ZPR16>;		defm UQINCH_ZPiI : sve_int_countvlv<0b01001, "uqinch", ZPR16, int_aarch64_sve_uqinch, nxv8i16>;
defm SQDECH_ZPiI : sve_int_countvlv<0b01010, "sqdech", ZPR16>;		defm SQDECH_ZPiI : sve_int_countvlv<0b01010, "sqdech", ZPR16, int_aarch64_sve_sqdech, nxv8i16>;
defm UQDECH_ZPiI : sve_int_countvlv<0b01011, "uqdech", ZPR16>;		defm UQDECH_ZPiI : sve_int_countvlv<0b01011, "uqdech", ZPR16, int_aarch64_sve_uqdech, nxv8i16>;
defm INCH_ZPiI : sve_int_countvlv<0b01100, "inch", ZPR16>;		defm INCH_ZPiI : sve_int_countvlv<0b01100, "inch", ZPR16>;
defm DECH_ZPiI : sve_int_countvlv<0b01101, "dech", ZPR16>;		defm DECH_ZPiI : sve_int_countvlv<0b01101, "dech", ZPR16>;
defm SQINCW_ZPiI : sve_int_countvlv<0b10000, "sqincw", ZPR32>;		defm SQINCW_ZPiI : sve_int_countvlv<0b10000, "sqincw", ZPR32, int_aarch64_sve_sqincw, nxv4i32>;
defm UQINCW_ZPiI : sve_int_countvlv<0b10001, "uqincw", ZPR32>;		defm UQINCW_ZPiI : sve_int_countvlv<0b10001, "uqincw", ZPR32, int_aarch64_sve_uqincw, nxv4i32>;
defm SQDECW_ZPiI : sve_int_countvlv<0b10010, "sqdecw", ZPR32>;		defm SQDECW_ZPiI : sve_int_countvlv<0b10010, "sqdecw", ZPR32, int_aarch64_sve_sqdecw, nxv4i32>;
defm UQDECW_ZPiI : sve_int_countvlv<0b10011, "uqdecw", ZPR32>;		defm UQDECW_ZPiI : sve_int_countvlv<0b10011, "uqdecw", ZPR32, int_aarch64_sve_uqdecw, nxv4i32>;
defm INCW_ZPiI : sve_int_countvlv<0b10100, "incw", ZPR32>;		defm INCW_ZPiI : sve_int_countvlv<0b10100, "incw", ZPR32>;
defm DECW_ZPiI : sve_int_countvlv<0b10101, "decw", ZPR32>;		defm DECW_ZPiI : sve_int_countvlv<0b10101, "decw", ZPR32>;
defm SQINCD_ZPiI : sve_int_countvlv<0b11000, "sqincd", ZPR64>;		defm SQINCD_ZPiI : sve_int_countvlv<0b11000, "sqincd", ZPR64, int_aarch64_sve_sqincd, nxv2i64>;
defm UQINCD_ZPiI : sve_int_countvlv<0b11001, "uqincd", ZPR64>;		defm UQINCD_ZPiI : sve_int_countvlv<0b11001, "uqincd", ZPR64, int_aarch64_sve_uqincd, nxv2i64>;
defm SQDECD_ZPiI : sve_int_countvlv<0b11010, "sqdecd", ZPR64>;		defm SQDECD_ZPiI : sve_int_countvlv<0b11010, "sqdecd", ZPR64, int_aarch64_sve_sqdecd, nxv2i64>;
defm UQDECD_ZPiI : sve_int_countvlv<0b11011, "uqdecd", ZPR64>;		defm UQDECD_ZPiI : sve_int_countvlv<0b11011, "uqdecd", ZPR64, int_aarch64_sve_uqdecd, nxv2i64>;
defm INCD_ZPiI : sve_int_countvlv<0b11100, "incd", ZPR64>;		defm INCD_ZPiI : sve_int_countvlv<0b11100, "incd", ZPR64>;
defm DECD_ZPiI : sve_int_countvlv<0b11101, "decd", ZPR64>;		defm DECD_ZPiI : sve_int_countvlv<0b11101, "decd", ZPR64>;

defm SQINCP_XPWd : sve_int_count_r_s32<0b00000, "sqincp">;		defm SQINCP_XPWd : sve_int_count_r_s32<0b00000, "sqincp", int_aarch64_sve_sqincp_n32>;
defm SQINCP_XP : sve_int_count_r_x64<0b00010, "sqincp">;		defm SQINCP_XP : sve_int_count_r_x64<0b00010, "sqincp", int_aarch64_sve_sqincp_n64>;
defm UQINCP_WP : sve_int_count_r_u32<0b00100, "uqincp">;		defm UQINCP_WP : sve_int_count_r_u32<0b00100, "uqincp", int_aarch64_sve_uqincp_n32>;
defm UQINCP_XP : sve_int_count_r_x64<0b00110, "uqincp">;		defm UQINCP_XP : sve_int_count_r_x64<0b00110, "uqincp", int_aarch64_sve_uqincp_n64>;
defm SQDECP_XPWd : sve_int_count_r_s32<0b01000, "sqdecp">;		defm SQDECP_XPWd : sve_int_count_r_s32<0b01000, "sqdecp", int_aarch64_sve_sqdecp_n32>;
defm SQDECP_XP : sve_int_count_r_x64<0b01010, "sqdecp">;		defm SQDECP_XP : sve_int_count_r_x64<0b01010, "sqdecp", int_aarch64_sve_sqdecp_n64>;
defm UQDECP_WP : sve_int_count_r_u32<0b01100, "uqdecp">;		defm UQDECP_WP : sve_int_count_r_u32<0b01100, "uqdecp", int_aarch64_sve_uqdecp_n32>;
defm UQDECP_XP : sve_int_count_r_x64<0b01110, "uqdecp">;		defm UQDECP_XP : sve_int_count_r_x64<0b01110, "uqdecp", int_aarch64_sve_uqdecp_n64>;
defm INCP_XP : sve_int_count_r_x64<0b10000, "incp">;		defm INCP_XP : sve_int_count_r_x64<0b10000, "incp">;
defm DECP_XP : sve_int_count_r_x64<0b10100, "decp">;		defm DECP_XP : sve_int_count_r_x64<0b10100, "decp">;

defm SQINCP_ZP : sve_int_count_v<0b00000, "sqincp">;		defm SQINCP_ZP : sve_int_count_v<0b00000, "sqincp", int_aarch64_sve_sqincp>;
defm UQINCP_ZP : sve_int_count_v<0b00100, "uqincp">;		defm UQINCP_ZP : sve_int_count_v<0b00100, "uqincp", int_aarch64_sve_uqincp>;
defm SQDECP_ZP : sve_int_count_v<0b01000, "sqdecp">;		defm SQDECP_ZP : sve_int_count_v<0b01000, "sqdecp", int_aarch64_sve_sqdecp>;
defm UQDECP_ZP : sve_int_count_v<0b01100, "uqdecp">;		defm UQDECP_ZP : sve_int_count_v<0b01100, "uqdecp", int_aarch64_sve_uqdecp>;
defm INCP_ZP : sve_int_count_v<0b10000, "incp">;		defm INCP_ZP : sve_int_count_v<0b10000, "incp">;
defm DECP_ZP : sve_int_count_v<0b10100, "decp">;		defm DECP_ZP : sve_int_count_v<0b10100, "decp">;

defm INDEX_RR : sve_int_index_rr<"index">;		defm INDEX_RR : sve_int_index_rr<"index">;
defm INDEX_IR : sve_int_index_ir<"index">;		defm INDEX_IR : sve_int_index_ir<"index">;
defm INDEX_RI : sve_int_index_ri<"index">;		defm INDEX_RI : sve_int_index_ri<"index">;
defm INDEX_II : sve_int_index_ii<"index">;		defm INDEX_II : sve_int_index_ii<"index">;

▲ Show 20 Lines • Show All 697 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/SVEInstrFormats.td

Show First 20 Lines • Show All 230 Lines • ▼ Show 20 Lines	: SVEExactFPImmOperand<"HalfOne", "AArch64ExactFPImm::half",
"AArch64ExactFPImm::one">;		"AArch64ExactFPImm::one">;
def sve_fpimm_half_two		def sve_fpimm_half_two
: SVEExactFPImmOperand<"HalfTwo", "AArch64ExactFPImm::half",		: SVEExactFPImmOperand<"HalfTwo", "AArch64ExactFPImm::half",
"AArch64ExactFPImm::two">;		"AArch64ExactFPImm::two">;
def sve_fpimm_zero_one		def sve_fpimm_zero_one
: SVEExactFPImmOperand<"ZeroOne", "AArch64ExactFPImm::zero",		: SVEExactFPImmOperand<"ZeroOne", "AArch64ExactFPImm::zero",
"AArch64ExactFPImm::one">;		"AArch64ExactFPImm::one">;

def sve_incdec_imm : Operand<i32>, ImmLeaf<i32, [{		def sve_incdec_imm : Operand<i32>, TImmLeaf<i32, [{
return (((uint32_t)Imm) > 0) && (((uint32_t)Imm) < 17);		return (((uint32_t)Imm) > 0) && (((uint32_t)Imm) < 17);
}]> {		}]> {
let ParserMatchClass = Imm1_16Operand;		let ParserMatchClass = Imm1_16Operand;
let EncoderMethod = "getSVEIncDecImm";		let EncoderMethod = "getSVEIncDecImm";
let DecoderMethod = "DecodeSVEIncDecImm";		let DecoderMethod = "DecodeSVEIncDecImm";
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 217 Lines • ▼ Show 20 Lines	: I<(outs dty:$Rdn), (ins pprty:$Pg, sty:$_Rdn),

// Signed 32bit forms require their GPR operand printed.		// Signed 32bit forms require their GPR operand printed.
let AsmString = !if(!eq(opc{4,2-0}, 0b0000),		let AsmString = !if(!eq(opc{4,2-0}, 0b0000),
!strconcat(asm, "\t$Rdn, $Pg, $_Rdn"),		!strconcat(asm, "\t$Rdn, $Pg, $_Rdn"),
!strconcat(asm, "\t$Rdn, $Pg"));		!strconcat(asm, "\t$Rdn, $Pg"));
let Constraints = "$Rdn = $_Rdn";		let Constraints = "$Rdn = $_Rdn";
}		}

multiclass sve_int_count_r_s32<bits<5> opc, string asm> {		multiclass sve_int_count_r_s32<bits<5> opc, string asm,
		SDPatternOperator op> {
def _B : sve_int_count_r<0b00, opc, asm, GPR64z, PPR8, GPR64as32>;		def _B : sve_int_count_r<0b00, opc, asm, GPR64z, PPR8, GPR64as32>;
def _H : sve_int_count_r<0b01, opc, asm, GPR64z, PPR16, GPR64as32>;		def _H : sve_int_count_r<0b01, opc, asm, GPR64z, PPR16, GPR64as32>;
def _S : sve_int_count_r<0b10, opc, asm, GPR64z, PPR32, GPR64as32>;		def _S : sve_int_count_r<0b10, opc, asm, GPR64z, PPR32, GPR64as32>;
def _D : sve_int_count_r<0b11, opc, asm, GPR64z, PPR64, GPR64as32>;		def _D : sve_int_count_r<0b11, opc, asm, GPR64z, PPR64, GPR64as32>;

		def : Pat<(i32 (op GPR32:$Rn, (nxv16i1 PPRAny:$Pg))),
		(EXTRACT_SUBREG (!cast<Instruction>(NAME # _B) PPRAny:$Pg, (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32)), sub_32)>;
		def : Pat<(i64 (sext (i32 (op GPR32:$Rn, (nxv16i1 PPRAny:$Pg))))),
		(!cast<Instruction>(NAME # _B) PPRAny:$Pg, (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32))>;

		def : Pat<(i32 (op GPR32:$Rn, (nxv8i1 PPRAny:$Pg))),
		(EXTRACT_SUBREG (!cast<Instruction>(NAME # _H) PPRAny:$Pg, (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32)), sub_32)>;
		def : Pat<(i64 (sext (i32 (op GPR32:$Rn, (nxv8i1 PPRAny:$Pg))))),
		(!cast<Instruction>(NAME # _H) PPRAny:$Pg, (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32))>;

		def : Pat<(i32 (op GPR32:$Rn, (nxv4i1 PPRAny:$Pg))),
		(EXTRACT_SUBREG (!cast<Instruction>(NAME # _S) PPRAny:$Pg, (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32)), sub_32)>;
		def : Pat<(i64 (sext (i32 (op GPR32:$Rn, (nxv4i1 PPRAny:$Pg))))),
		(!cast<Instruction>(NAME # _S) PPRAny:$Pg, (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32))>;

		def : Pat<(i32 (op GPR32:$Rn, (nxv2i1 PPRAny:$Pg))),
		(EXTRACT_SUBREG (!cast<Instruction>(NAME # _D) PPRAny:$Pg, (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32)), sub_32)>;
		def : Pat<(i64 (sext (i32 (op GPR32:$Rn, (nxv2i1 PPRAny:$Pg))))),
		(!cast<Instruction>(NAME # _D) PPRAny:$Pg, (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32))>;
}		}

multiclass sve_int_count_r_u32<bits<5> opc, string asm> {		multiclass sve_int_count_r_u32<bits<5> opc, string asm,
		SDPatternOperator op> {
def _B : sve_int_count_r<0b00, opc, asm, GPR32z, PPR8, GPR32z>;		def _B : sve_int_count_r<0b00, opc, asm, GPR32z, PPR8, GPR32z>;
def _H : sve_int_count_r<0b01, opc, asm, GPR32z, PPR16, GPR32z>;		def _H : sve_int_count_r<0b01, opc, asm, GPR32z, PPR16, GPR32z>;
def _S : sve_int_count_r<0b10, opc, asm, GPR32z, PPR32, GPR32z>;		def _S : sve_int_count_r<0b10, opc, asm, GPR32z, PPR32, GPR32z>;
def _D : sve_int_count_r<0b11, opc, asm, GPR32z, PPR64, GPR32z>;		def _D : sve_int_count_r<0b11, opc, asm, GPR32z, PPR64, GPR32z>;

		def : Pat<(i32 (op GPR32:$Rn, (nxv16i1 PPRAny:$Pg))),
		(!cast<Instruction>(NAME # _B) PPRAny:$Pg, $Rn)>;
		def : Pat<(i32 (op GPR32:$Rn, (nxv8i1 PPRAny:$Pg))),
		(!cast<Instruction>(NAME # _H) PPRAny:$Pg, $Rn)>;
		def : Pat<(i32 (op GPR32:$Rn, (nxv4i1 PPRAny:$Pg))),
		(!cast<Instruction>(NAME # _S) PPRAny:$Pg, $Rn)>;
		def : Pat<(i32 (op GPR32:$Rn, (nxv2i1 PPRAny:$Pg))),
		(!cast<Instruction>(NAME # _D) PPRAny:$Pg, $Rn)>;
}		}

multiclass sve_int_count_r_x64<bits<5> opc, string asm> {		multiclass sve_int_count_r_x64<bits<5> opc, string asm,
		SDPatternOperator op = null_frag> {
def _B : sve_int_count_r<0b00, opc, asm, GPR64z, PPR8, GPR64z>;		def _B : sve_int_count_r<0b00, opc, asm, GPR64z, PPR8, GPR64z>;
def _H : sve_int_count_r<0b01, opc, asm, GPR64z, PPR16, GPR64z>;		def _H : sve_int_count_r<0b01, opc, asm, GPR64z, PPR16, GPR64z>;
def _S : sve_int_count_r<0b10, opc, asm, GPR64z, PPR32, GPR64z>;		def _S : sve_int_count_r<0b10, opc, asm, GPR64z, PPR32, GPR64z>;
def _D : sve_int_count_r<0b11, opc, asm, GPR64z, PPR64, GPR64z>;		def _D : sve_int_count_r<0b11, opc, asm, GPR64z, PPR64, GPR64z>;

		def : Pat<(i64 (op GPR64:$Rn, (nxv16i1 PPRAny:$Pg))),
		(!cast<Instruction>(NAME # _B) PPRAny:$Pg, $Rn)>;
		def : Pat<(i64 (op GPR64:$Rn, (nxv8i1 PPRAny:$Pg))),
		(!cast<Instruction>(NAME # _H) PPRAny:$Pg, $Rn)>;
		def : Pat<(i64 (op GPR64:$Rn, (nxv4i1 PPRAny:$Pg))),
		(!cast<Instruction>(NAME # _S) PPRAny:$Pg, $Rn)>;
		def : Pat<(i64 (op GPR64:$Rn, (nxv2i1 PPRAny:$Pg))),
		(!cast<Instruction>(NAME # _D) PPRAny:$Pg, $Rn)>;
}		}

class sve_int_count_v<bits<2> sz8_64, bits<5> opc, string asm,		class sve_int_count_v<bits<2> sz8_64, bits<5> opc, string asm,
ZPRRegOp zprty, PPRRegOp pprty>		ZPRRegOp zprty, PPRRegOp pprty>
: I<(outs zprty:$Zdn), (ins zprty:$_Zdn, pprty:$Pm),		: I<(outs zprty:$Zdn), (ins zprty:$_Zdn, pprty:$Pm),
asm, "\t$Zdn, $Pm",		asm, "\t$Zdn, $Pm",
"",		"",
[]>, Sched<[]> {		[]>, Sched<[]> {
bits<4> Pm;		bits<4> Pm;
bits<5> Zdn;		bits<5> Zdn;
let Inst{31-24} = 0b00100101;		let Inst{31-24} = 0b00100101;
let Inst{23-22} = sz8_64;		let Inst{23-22} = sz8_64;
let Inst{21-19} = 0b101;		let Inst{21-19} = 0b101;
let Inst{18-16} = opc{4-2};		let Inst{18-16} = opc{4-2};
let Inst{15-11} = 0b10000;		let Inst{15-11} = 0b10000;
let Inst{10-9} = opc{1-0};		let Inst{10-9} = opc{1-0};
let Inst{8-5} = Pm;		let Inst{8-5} = Pm;
let Inst{4-0} = Zdn;		let Inst{4-0} = Zdn;

let Constraints = "$Zdn = $_Zdn";		let Constraints = "$Zdn = $_Zdn";
let DestructiveInstType = Destructive;		let DestructiveInstType = Destructive;
let ElementSize = ElementSizeNone;		let ElementSize = ElementSizeNone;
}		}

multiclass sve_int_count_v<bits<5> opc, string asm> {		multiclass sve_int_count_v<bits<5> opc, string asm,
		SDPatternOperator op = null_frag> {
def _H : sve_int_count_v<0b01, opc, asm, ZPR16, PPR16>;		def _H : sve_int_count_v<0b01, opc, asm, ZPR16, PPR16>;
def _S : sve_int_count_v<0b10, opc, asm, ZPR32, PPR32>;		def _S : sve_int_count_v<0b10, opc, asm, ZPR32, PPR32>;
def _D : sve_int_count_v<0b11, opc, asm, ZPR64, PPR64>;		def _D : sve_int_count_v<0b11, opc, asm, ZPR64, PPR64>;

		def : SVE_2_Op_Pat<nxv8i16, op, nxv8i16, nxv8i1, !cast<Instruction>(NAME # _H)>;
		def : SVE_2_Op_Pat<nxv4i32, op, nxv4i32, nxv4i1, !cast<Instruction>(NAME # _S)>;
		def : SVE_2_Op_Pat<nxv2i64, op, nxv2i64, nxv2i1, !cast<Instruction>(NAME # _D)>;

def : InstAlias<asm # "\t$Zdn, $Pm",		def : InstAlias<asm # "\t$Zdn, $Pm",
(!cast<Instruction>(NAME # "_H") ZPR16:$Zdn, PPRAny:$Pm), 0>;		(!cast<Instruction>(NAME # "_H") ZPR16:$Zdn, PPRAny:$Pm), 0>;
def : InstAlias<asm # "\t$Zdn, $Pm",		def : InstAlias<asm # "\t$Zdn, $Pm",
(!cast<Instruction>(NAME # "_S") ZPR32:$Zdn, PPRAny:$Pm), 0>;		(!cast<Instruction>(NAME # "_S") ZPR32:$Zdn, PPRAny:$Pm), 0>;
def : InstAlias<asm # "\t$Zdn, $Pm",		def : InstAlias<asm # "\t$Zdn, $Pm",
(!cast<Instruction>(NAME # "_D") ZPR64:$Zdn, PPRAny:$Pm), 0>;		(!cast<Instruction>(NAME # "_D") ZPR64:$Zdn, PPRAny:$Pm), 0>;
}		}

▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	: I<(outs zprty:$Zdn), (ins zprty:$_Zdn, sve_pred_enum:$pattern, sve_incdec_imm:$imm4),
let Inst{9-5} = pattern;		let Inst{9-5} = pattern;
let Inst{4-0} = Zdn;		let Inst{4-0} = Zdn;

let Constraints = "$Zdn = $_Zdn";		let Constraints = "$Zdn = $_Zdn";
let DestructiveInstType = Destructive;		let DestructiveInstType = Destructive;
let ElementSize = ElementSizeNone;		let ElementSize = ElementSizeNone;
}		}

multiclass sve_int_countvlv<bits<5> opc, string asm, ZPRRegOp zprty> {		multiclass sve_int_countvlv<bits<5> opc, string asm, ZPRRegOp zprty,
		SDPatternOperator op = null_frag,
		ValueType vt = OtherVT> {
def NAME : sve_int_countvlv<opc, asm, zprty>;		def NAME : sve_int_countvlv<opc, asm, zprty>;

def : InstAlias<asm # "\t$Zdn, $pattern",		def : InstAlias<asm # "\t$Zdn, $pattern",
(!cast<Instruction>(NAME) zprty:$Zdn, sve_pred_enum:$pattern, 1), 1>;		(!cast<Instruction>(NAME) zprty:$Zdn, sve_pred_enum:$pattern, 1), 1>;
def : InstAlias<asm # "\t$Zdn",		def : InstAlias<asm # "\t$Zdn",
(!cast<Instruction>(NAME) zprty:$Zdn, 0b11111, 1), 2>;		(!cast<Instruction>(NAME) zprty:$Zdn, 0b11111, 1), 2>;

		def : Pat<(vt (op (vt zprty:$Zn), (sve_pred_enum:$pattern), (sve_incdec_imm:$imm4))),
		(!cast<Instruction>(NAME) $Zn, sve_pred_enum:$pattern, sve_incdec_imm:$imm4)>;
}		}

class sve_int_pred_pattern_a<bits<3> opc, string asm>		class sve_int_pred_pattern_a<bits<3> opc, string asm>
: I<(outs GPR64:$Rdn), (ins GPR64:$_Rdn, sve_pred_enum:$pattern, sve_incdec_imm:$imm4),		: I<(outs GPR64:$Rdn), (ins GPR64:$_Rdn, sve_pred_enum:$pattern, sve_incdec_imm:$imm4),
asm, "\t$Rdn, $pattern, mul $imm4",		asm, "\t$Rdn, $pattern, mul $imm4",
"",		"",
[]>, Sched<[]> {		[]>, Sched<[]> {
bits<5> Rdn;		bits<5> Rdn;
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	: I<(outs dt:$Rdn), (ins st:$_Rdn, sve_pred_enum:$pattern, sve_incdec_imm:$imm4),
// Signed 32bit forms require their GPR operand printed.		// Signed 32bit forms require their GPR operand printed.
let AsmString = !if(!eq(opc{2,0}, 0b00),		let AsmString = !if(!eq(opc{2,0}, 0b00),
!strconcat(asm, "\t$Rdn, $_Rdn, $pattern, mul $imm4"),		!strconcat(asm, "\t$Rdn, $_Rdn, $pattern, mul $imm4"),
!strconcat(asm, "\t$Rdn, $pattern, mul $imm4"));		!strconcat(asm, "\t$Rdn, $pattern, mul $imm4"));

let Constraints = "$Rdn = $_Rdn";		let Constraints = "$Rdn = $_Rdn";
}		}

multiclass sve_int_pred_pattern_b_s32<bits<5> opc, string asm> {		multiclass sve_int_pred_pattern_b_s32<bits<5> opc, string asm,
		SDPatternOperator op> {
def NAME : sve_int_pred_pattern_b<opc, asm, GPR64z, GPR64as32>;		def NAME : sve_int_pred_pattern_b<opc, asm, GPR64z, GPR64as32>;

def : InstAlias<asm # "\t$Rd, $Rn, $pattern",		def : InstAlias<asm # "\t$Rd, $Rn, $pattern",
(!cast<Instruction>(NAME) GPR64z:$Rd, GPR64as32:$Rn, sve_pred_enum:$pattern, 1), 1>;		(!cast<Instruction>(NAME) GPR64z:$Rd, GPR64as32:$Rn, sve_pred_enum:$pattern, 1), 1>;
def : InstAlias<asm # "\t$Rd, $Rn",		def : InstAlias<asm # "\t$Rd, $Rn",
(!cast<Instruction>(NAME) GPR64z:$Rd, GPR64as32:$Rn, 0b11111, 1), 2>;		(!cast<Instruction>(NAME) GPR64z:$Rd, GPR64as32:$Rn, 0b11111, 1), 2>;

		// NOTE: Register allocation doesn't like tied operands of differing register
		// class, hence the extra INSERT_SUBREG complication.

		def : Pat<(i32 (op GPR32:$Rn, (sve_pred_enum:$pattern), (sve_incdec_imm:$imm4))),
		(EXTRACT_SUBREG (!cast<Instruction>(NAME) (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32), sve_pred_enum:$pattern, sve_incdec_imm:$imm4), sub_32)>;
		efriedmaUnsubmitted Not Done Reply Inline Actions SQDECB always returns a value in a 64-bit register; why are you treating the return value as 32 bits? Even if there's some reason to prefer that form at the IR level, it doesn't seem like a good idea in isel; if you need a sign-extended value, you'll be forced to emit a redundant sign extension. efriedma: SQDECB always returns a value in a 64-bit register; why are you treating the return value as 32…
		andwarAuthorUnsubmitted Done Reply Inline Actions There's a 64-bit and 32-bit variant of `SQDECB`. This pattern is for the 32-bit variant, which returns 32-bit as well 64-bit result. Here we only care about the 32-bit result (because that's what the ACLE intrinsic returns). More specifically, this is meant to allow 1:1 mapping between: `int32_t svqdecb_n_s32(int32_t op, uint64_t imm_factor)` from ACLE `declare i32 @llvm.aarch64.sve.sqdecb.n32(i32, i32, i32)` IR intrinsic `sqdecb x0, w0, vl3, mul #4` SVE instruction For the 64-bit variant there's a different intrinsic: `int64_t svqdecb_n_s64(int64_t op, uint64_t imm_factor)` from ACLE `declare i64 @llvm.aarch64.sve.sqdecb.n64(i64, i32, i32)` IR intrinsic `sqdecb x0, vl4, mul #5` SVE instruction Also, this multiclass is only used for the intrnisics. andwar: There's a 64-bit and 32-bit variant of `SQDECB`. This pattern is for the 32-bit variant, which…
		efriedmaUnsubmitted Not Done Reply Inline Actions Consider something like the following: long x(int z) { return svqdecb_n_s32(z, 1); This function should lower to just a single sqdecb. The way this is written you end up with an unnecessary sxtw. efriedma: Consider something like the following: ``` long x(int z) { return svqdecb_n_s32(z, 1); ```…
		andwarAuthorUnsubmitted Done Reply Inline Actions I will add extra patterns to cater for this scenario (please check the next patch). The other option would be to rewrite this pattern so that the return value is always `i64` and then add some new ISD nodes and truncate the user requests `i32`. But the overall effect would be similar. andwar: I will add extra patterns to cater for this scenario (please check the next patch). The other…
		efriedmaUnsubmitted Not Done Reply Inline Actions Okay, that works. efriedma: Okay, that works.
		def : Pat<(i64 (sext (i32 (op GPR32:$Rn, (sve_pred_enum:$pattern), (sve_incdec_imm:$imm4))))),
		(!cast<Instruction>(NAME) (INSERT_SUBREG (IMPLICIT_DEF), $Rn, sub_32), sve_pred_enum:$pattern, sve_incdec_imm:$imm4)>;
}		}

multiclass sve_int_pred_pattern_b_u32<bits<5> opc, string asm> {		multiclass sve_int_pred_pattern_b_u32<bits<5> opc, string asm,
		SDPatternOperator op> {
def NAME : sve_int_pred_pattern_b<opc, asm, GPR32z, GPR32z>;		def NAME : sve_int_pred_pattern_b<opc, asm, GPR32z, GPR32z>;

def : InstAlias<asm # "\t$Rdn, $pattern",		def : InstAlias<asm # "\t$Rdn, $pattern",
(!cast<Instruction>(NAME) GPR32z:$Rdn, sve_pred_enum:$pattern, 1), 1>;		(!cast<Instruction>(NAME) GPR32z:$Rdn, sve_pred_enum:$pattern, 1), 1>;
def : InstAlias<asm # "\t$Rdn",		def : InstAlias<asm # "\t$Rdn",
(!cast<Instruction>(NAME) GPR32z:$Rdn, 0b11111, 1), 2>;		(!cast<Instruction>(NAME) GPR32z:$Rdn, 0b11111, 1), 2>;

		def : Pat<(i32 (op GPR32:$Rn, (sve_pred_enum:$pattern), (sve_incdec_imm:$imm4))),
		(!cast<Instruction>(NAME) $Rn, sve_pred_enum:$pattern, sve_incdec_imm:$imm4)>;
}		}

multiclass sve_int_pred_pattern_b_x64<bits<5> opc, string asm> {		multiclass sve_int_pred_pattern_b_x64<bits<5> opc, string asm,
		SDPatternOperator op> {
def NAME : sve_int_pred_pattern_b<opc, asm, GPR64z, GPR64z>;		def NAME : sve_int_pred_pattern_b<opc, asm, GPR64z, GPR64z>;

def : InstAlias<asm # "\t$Rdn, $pattern",		def : InstAlias<asm # "\t$Rdn, $pattern",
(!cast<Instruction>(NAME) GPR64z:$Rdn, sve_pred_enum:$pattern, 1), 1>;		(!cast<Instruction>(NAME) GPR64z:$Rdn, sve_pred_enum:$pattern, 1), 1>;
def : InstAlias<asm # "\t$Rdn",		def : InstAlias<asm # "\t$Rdn",
(!cast<Instruction>(NAME) GPR64z:$Rdn, 0b11111, 1), 2>;		(!cast<Instruction>(NAME) GPR64z:$Rdn, 0b11111, 1), 2>;

		def : Pat<(i64 (op GPR64:$Rn, (sve_pred_enum:$pattern), (sve_incdec_imm:$imm4))),
		(!cast<Instruction>(NAME) $Rn, sve_pred_enum:$pattern, sve_incdec_imm:$imm4)>;
}		}


//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// SVE Permute - Cross Lane Group		// SVE Permute - Cross Lane Group
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

class sve_int_perm_dup_r<bits<2> sz8_64, string asm, ZPRRegOp zprty,		class sve_int_perm_dup_r<bits<2> sz8_64, string asm, ZPRRegOp zprty,
▲ Show 20 Lines • Show All 5,947 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/sve-intrinsics-sqdec.ll

This file was added.

				; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve -asm-verbose=0 < %s \| FileCheck %s

				; Since SQDEC{B\|H\|W\|D\|P} and SQINC{B\|H\|W\|D\|P} have identical semantics, the tests for
				; * @llvm.aarch64.sve.sqinc{b\|h\|w\|d\|p}, and
				; * @llvm.aarch64.sve.sqdec{b\|h\|w\|d\|p}
				; should also be identical (with the instruction name being adjusted). When
				; updating this file remember to make similar changes in the file testing the
				; other intrinsic.

				;
				; SQDECH (vector)
				;

				define <vscale x 8 x i16> @sqdech(<vscale x 8 x i16> %a) {
				; CHECK-LABEL: sqdech:
				; CHECK: sqdech z0.h, pow2
				; CHECK-NEXT: ret
				%out = call <vscale x 8 x i16> @llvm.aarch64.sve.sqdech.nxv8i16(<vscale x 8 x i16> %a,
				i32 0, i32 1)
				ret <vscale x 8 x i16> %out
				}

				;
				; SQDECW (vector)
				;

				define <vscale x 4 x i32> @sqdecw(<vscale x 4 x i32> %a) {
				; CHECK-LABEL: sqdecw:
				; CHECK: sqdecw z0.s, vl1, mul #2
				; CHECK-NEXT: ret
				%out = call <vscale x 4 x i32> @llvm.aarch64.sve.sqdecw.nxv4i32(<vscale x 4 x i32> %a,
				i32 1, i32 2)
				ret <vscale x 4 x i32> %out
				}

				;
				; SQDECD (vector)
				;

				define <vscale x 2 x i64> @sqdecd(<vscale x 2 x i64> %a) {
				; CHECK-LABEL: sqdecd:
				; CHECK: sqdecd z0.d, vl2, mul #3
				; CHECK-NEXT: ret
				%out = call <vscale x 2 x i64> @llvm.aarch64.sve.sqdecd.nxv2i64(<vscale x 2 x i64> %a,
				i32 2, i32 3)
				ret <vscale x 2 x i64> %out
				}

				;
				; SQDECP (vector)
				;

				define <vscale x 8 x i16> @sqdecp_b16(<vscale x 8 x i16> %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: sqdecp_b16:
				; CHECK: sqdecp z0.h, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 8 x i16> @llvm.aarch64.sve.sqdecp.nxv8i16(<vscale x 8 x i16> %a,
				<vscale x 8 x i1> %b)
				ret <vscale x 8 x i16> %out
				}

				define <vscale x 4 x i32> @sqdecp_b32(<vscale x 4 x i32> %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: sqdecp_b32:
				; CHECK: sqdecp z0.s, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 4 x i32> @llvm.aarch64.sve.sqdecp.nxv4i32(<vscale x 4 x i32> %a,
				<vscale x 4 x i1> %b)
				ret <vscale x 4 x i32> %out
				}

				define <vscale x 2 x i64> @sqdecp_b64(<vscale x 2 x i64> %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: sqdecp_b64:
				; CHECK: sqdecp z0.d, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 2 x i64> @llvm.aarch64.sve.sqdecp.nxv2i64(<vscale x 2 x i64> %a,
				<vscale x 2 x i1> %b)
				ret <vscale x 2 x i64> %out
				}

				;
				; SQDECB (scalar)
				;

				define i32 @sqdecb_n32_i32(i32 %a) {
				; CHECK-LABEL: sqdecb_n32_i32:
				; CHECK: sqdecb x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecb.n32(i32 %a, i32 3, i32 4)
				ret i32 %out
				}

				define i64 @sqdecb_n32_i64(i32 %a) {
				; CHECK-LABEL: sqdecb_n32_i64:
				; CHECK: sqdecb x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecb.n32(i32 %a, i32 3, i32 4)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqdecb_n64(i64 %a) {
				; CHECK-LABEL: sqdecb_n64:
				; CHECK: sqdecb x0, vl4, mul #5
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqdecb.n64(i64 %a, i32 4, i32 5)
				ret i64 %out
				}

				;
				; SQDECH (scalar)
				;

				define i32 @sqdech_n32_i32(i32 %a) {
				; CHECK-LABEL: sqdech_n32_i32:
				; CHECK: sqdech x0, w0, vl5, mul #6
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdech.n32(i32 %a, i32 5, i32 6)
				ret i32 %out
				}

				define i64 @sqdech_n32_i64(i32 %a) {
				; CHECK-LABEL: sqdech_n32_i64:
				; CHECK: sqdech x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdech.n32(i32 %a, i32 3, i32 4)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqdech_n64(i64 %a) {
				; CHECK-LABEL: sqdech_n64:
				; CHECK: sqdech x0, vl6, mul #7
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqdech.n64(i64 %a, i32 6, i32 7)
				ret i64 %out
				}

				;
				; SQDECW (scalar)
				;

				define i32 @sqdecw_n32_i32(i32 %a) {
				; CHECK-LABEL: sqdecw_n32_i32:
				; CHECK: sqdecw x0, w0, vl7, mul #8
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecw.n32(i32 %a, i32 7, i32 8)
				ret i32 %out
				}

				define i64 @sqdecw_n32_i64(i32 %a) {
				; CHECK-LABEL: sqdecw_n32_i64:
				; CHECK: sqdecw x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecw.n32(i32 %a, i32 3, i32 4)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqdecw_n64(i64 %a) {
				; CHECK-LABEL: sqdecw_n64:
				; CHECK: sqdecw x0, vl8, mul #9
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqdecw.n64(i64 %a, i32 8, i32 9)
				ret i64 %out
				}

				;
				; SQDECD (scalar)
				;

				define i32 @sqdecd_n32_i32(i32 %a) {
				; CHECK-LABEL: sqdecd_n32_i32:
				; CHECK: sqdecd x0, w0, vl16, mul #10
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecd.n32(i32 %a, i32 9, i32 10)
				ret i32 %out
				}

				define i64 @sqdecd_n32_i64(i32 %a) {
				; CHECK-LABEL: sqdecd_n32_i64:
				; CHECK: sqdecd x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecd.n32(i32 %a, i32 3, i32 4)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqdecd_n64(i64 %a) {
				; CHECK-LABEL: sqdecd_n64:
				; CHECK: sqdecd x0, vl32, mul #11
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqdecd.n64(i64 %a, i32 10, i32 11)
				ret i64 %out
				}

				;
				; SQDECP (scalar)
				;

				define i32 @sqdecp_n32_b8_i32(i32 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: sqdecp_n32_b8_i32:
				; CHECK: sqdecp x0, p0.b, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecp.n32.nxv16i1(i32 %a, <vscale x 16 x i1> %b)
				ret i32 %out
				}

				define i64 @sqdecp_n32_b8_i64(i32 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: sqdecp_n32_b8_i64:
				; CHECK: sqdecp x0, p0.b, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecp.n32.nxv16i1(i32 %a, <vscale x 16 x i1> %b)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i32 @sqdecp_n32_b16_i32(i32 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: sqdecp_n32_b16_i32:
				; CHECK: sqdecp x0, p0.h, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecp.n32.nxv8i1(i32 %a, <vscale x 8 x i1> %b)
				ret i32 %out
				}

				define i64 @sqdecp_n32_b16_i64(i32 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: sqdecp_n32_b16_i64:
				; CHECK: sqdecp x0, p0.h, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecp.n32.nxv8i1(i32 %a, <vscale x 8 x i1> %b)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i32 @sqdecp_n32_b32_i32(i32 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: sqdecp_n32_b32_i32:
				; CHECK: sqdecp x0, p0.s, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecp.n32.nxv4i1(i32 %a, <vscale x 4 x i1> %b)
				ret i32 %out
				}

				define i64 @sqdecp_n32_b32_i64(i32 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: sqdecp_n32_b32_i64:
				; CHECK: sqdecp x0, p0.s, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecp.n32.nxv4i1(i32 %a, <vscale x 4 x i1> %b)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i32 @sqdecp_n32_b64_i32(i32 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: sqdecp_n32_b64_i32:
				; CHECK: sqdecp x0, p0.d, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecp.n32.nxv2i1(i32 %a, <vscale x 2 x i1> %b)
				ret i32 %out
				}

				define i64 @sqdecp_n32_b64_i64(i32 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: sqdecp_n32_b64_i64:
				; CHECK: sqdecp x0, p0.d, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqdecp.n32.nxv2i1(i32 %a, <vscale x 2 x i1> %b)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqdecp_n64_b8(i64 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: sqdecp_n64_b8:
				; CHECK: sqdecp x0, p0.b
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqdecp.n64.nxv16i1(i64 %a, <vscale x 16 x i1> %b)
				ret i64 %out
				}

				define i64 @sqdecp_n64_b16(i64 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: sqdecp_n64_b16:
				; CHECK: sqdecp x0, p0.h
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqdecp.n64.nxv8i1(i64 %a, <vscale x 8 x i1> %b)
				ret i64 %out
				}

				define i64 @sqdecp_n64_b32(i64 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: sqdecp_n64_b32:
				; CHECK: sqdecp x0, p0.s
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqdecp.n64.nxv4i1(i64 %a, <vscale x 4 x i1> %b)
				ret i64 %out
				}

				define i64 @sqdecp_n64_b64(i64 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: sqdecp_n64_b64:
				; CHECK: sqdecp x0, p0.d
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqdecp.n64.nxv2i1(i64 %a, <vscale x 2 x i1> %b)
				ret i64 %out
				}

				; sqdec{h\|w\|d}(vector, pattern, multiplier)
				declare <vscale x 8 x i16> @llvm.aarch64.sve.sqdech.nxv8i16(<vscale x 8 x i16>, i32, i32)
				declare <vscale x 4 x i32> @llvm.aarch64.sve.sqdecw.nxv4i32(<vscale x 4 x i32>, i32, i32)
				declare <vscale x 2 x i64> @llvm.aarch64.sve.sqdecd.nxv2i64(<vscale x 2 x i64>, i32, i32)

				; sqdec{b\|h\|w\|d}(scalar, pattern, multiplier)
				declare i32 @llvm.aarch64.sve.sqdecb.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.sqdecb.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.sqdech.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.sqdech.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.sqdecw.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.sqdecw.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.sqdecd.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.sqdecd.n64(i64, i32, i32)

				; sqdecp(scalar, predicate)
				declare i32 @llvm.aarch64.sve.sqdecp.n32.nxv16i1(i32, <vscale x 16 x i1>)
				declare i32 @llvm.aarch64.sve.sqdecp.n32.nxv8i1(i32, <vscale x 8 x i1>)
				declare i32 @llvm.aarch64.sve.sqdecp.n32.nxv4i1(i32, <vscale x 4 x i1>)
				declare i32 @llvm.aarch64.sve.sqdecp.n32.nxv2i1(i32, <vscale x 2 x i1>)

				declare i64 @llvm.aarch64.sve.sqdecp.n64.nxv16i1(i64, <vscale x 16 x i1>)
				declare i64 @llvm.aarch64.sve.sqdecp.n64.nxv8i1(i64, <vscale x 8 x i1>)
				declare i64 @llvm.aarch64.sve.sqdecp.n64.nxv4i1(i64, <vscale x 4 x i1>)
				declare i64 @llvm.aarch64.sve.sqdecp.n64.nxv2i1(i64, <vscale x 2 x i1>)

				; sqdecp(vector, predicate)
				declare <vscale x 8 x i16> @llvm.aarch64.sve.sqdecp.nxv8i16(<vscale x 8 x i16>, <vscale x 8 x i1>)
				declare <vscale x 4 x i32> @llvm.aarch64.sve.sqdecp.nxv4i32(<vscale x 4 x i32>, <vscale x 4 x i1>)
				declare <vscale x 2 x i64> @llvm.aarch64.sve.sqdecp.nxv2i64(<vscale x 2 x i64>, <vscale x 2 x i1>)

llvm/test/CodeGen/AArch64/sve-intrinsics-sqinc.ll

This file was added.

				; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve -asm-verbose=0 < %s \| FileCheck %s

				; Since SQDEC{B\|H\|W\|D\|P} and SQINC{B\|H\|W\|D\|P} have identical semantics, the tests for
				; * @llvm.aarch64.sve.sqinc{b\|h\|w\|d\|p}, and
				; * @llvm.aarch64.sve.sqdec{b\|h\|w\|d\|p}
				; should also be identical (with the instruction name being adjusted). When
				; updating this file remember to make similar changes in the file testing the
				; other intrinsic.

				;
				; SQINCH (vector)
				;

				define <vscale x 8 x i16> @sqinch(<vscale x 8 x i16> %a) {
				; CHECK-LABEL: sqinch:
				; CHECK: sqinch z0.h, pow2
				; CHECK-NEXT: ret
				%out = call <vscale x 8 x i16> @llvm.aarch64.sve.sqinch.nxv8i16(<vscale x 8 x i16> %a,
				i32 0, i32 1)
				ret <vscale x 8 x i16> %out
				}

				;
				; SQINCW (vector)
				;

				define <vscale x 4 x i32> @sqincw(<vscale x 4 x i32> %a) {
				; CHECK-LABEL: sqincw:
				; CHECK: sqincw z0.s, vl1, mul #2
				; CHECK-NEXT: ret
				%out = call <vscale x 4 x i32> @llvm.aarch64.sve.sqincw.nxv4i32(<vscale x 4 x i32> %a,
				i32 1, i32 2)
				ret <vscale x 4 x i32> %out
				}

				;
				; SQINCD (vector)
				;

				define <vscale x 2 x i64> @sqincd(<vscale x 2 x i64> %a) {
				; CHECK-LABEL: sqincd:
				; CHECK: sqincd z0.d, vl2, mul #3
				; CHECK-NEXT: ret
				%out = call <vscale x 2 x i64> @llvm.aarch64.sve.sqincd.nxv2i64(<vscale x 2 x i64> %a,
				i32 2, i32 3)
				ret <vscale x 2 x i64> %out
				}

				;
				; SQINCP (vector)
				;

				define <vscale x 8 x i16> @sqincp_b16(<vscale x 8 x i16> %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: sqincp_b16:
				; CHECK: sqincp z0.h, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 8 x i16> @llvm.aarch64.sve.sqincp.nxv8i16(<vscale x 8 x i16> %a,
				<vscale x 8 x i1> %b)
				ret <vscale x 8 x i16> %out
				}

				define <vscale x 4 x i32> @sqincp_b32(<vscale x 4 x i32> %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: sqincp_b32:
				; CHECK: sqincp z0.s, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 4 x i32> @llvm.aarch64.sve.sqincp.nxv4i32(<vscale x 4 x i32> %a,
				<vscale x 4 x i1> %b)
				ret <vscale x 4 x i32> %out
				}

				define <vscale x 2 x i64> @sqincp_b64(<vscale x 2 x i64> %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: sqincp_b64:
				; CHECK: sqincp z0.d, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 2 x i64> @llvm.aarch64.sve.sqincp.nxv2i64(<vscale x 2 x i64> %a,
				<vscale x 2 x i1> %b)
				ret <vscale x 2 x i64> %out
				}

				;
				; SQINCB (scalar)
				;

				define i32 @sqincb_n32_i32(i32 %a) {
				; CHECK-LABEL: sqincb_n32_i32:
				; CHECK: sqincb x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincb.n32(i32 %a, i32 3, i32 4)
				ret i32 %out
				}

				define i64 @sqincb_n32_i64(i32 %a) {
				; CHECK-LABEL: sqincb_n32_i64:
				; CHECK: sqincb x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincb.n32(i32 %a, i32 3, i32 4)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqincb_n64(i64 %a) {
				; CHECK-LABEL: sqincb_n64:
				; CHECK: sqincb x0, vl4, mul #5
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqincb.n64(i64 %a, i32 4, i32 5)
				ret i64 %out
				}

				;
				; SQINCH (scalar)
				;

				define i32 @sqinch_n32_i32(i32 %a) {
				; CHECK-LABEL: sqinch_n32_i32:
				; CHECK: sqinch x0, w0, vl5, mul #6
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqinch.n32(i32 %a, i32 5, i32 6)
				ret i32 %out
				}

				define i64 @sqinch_n32_i64(i32 %a) {
				; CHECK-LABEL: sqinch_n32_i64:
				; CHECK: sqinch x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqinch.n32(i32 %a, i32 3, i32 4)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqinch_n64(i64 %a) {
				; CHECK-LABEL: sqinch_n64:
				; CHECK: sqinch x0, vl6, mul #7
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqinch.n64(i64 %a, i32 6, i32 7)
				ret i64 %out
				}

				;
				; SQINCW (scalar)
				;

				define i32 @sqincw_n32_i32(i32 %a) {
				; CHECK-LABEL: sqincw_n32_i32:
				; CHECK: sqincw x0, w0, vl7, mul #8
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincw.n32(i32 %a, i32 7, i32 8)
				ret i32 %out
				}

				define i64 @sqincw_n32_i64(i32 %a) {
				; CHECK-LABEL: sqincw_n32_i64:
				; CHECK: sqincw x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincw.n32(i32 %a, i32 3, i32 4)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqincw_n64(i64 %a) {
				; CHECK-LABEL: sqincw_n64:
				; CHECK: sqincw x0, vl8, mul #9
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqincw.n64(i64 %a, i32 8, i32 9)
				ret i64 %out
				}

				;
				; SQINCD (scalar)
				;

				define i32 @sqincd_n32_i32(i32 %a) {
				; CHECK-LABEL: sqincd_n32_i32:
				; CHECK: sqincd x0, w0, vl16, mul #10
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincd.n32(i32 %a, i32 9, i32 10)
				ret i32 %out
				}

				define i64 @sqincd_n32_i64(i32 %a) {
				; CHECK-LABEL: sqincd_n32_i64:
				; CHECK: sqincd x0, w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincd.n32(i32 %a, i32 3, i32 4)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqincd_n64(i64 %a) {
				; CHECK-LABEL: sqincd_n64:
				; CHECK: sqincd x0, vl32, mul #11
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqincd.n64(i64 %a, i32 10, i32 11)
				ret i64 %out
				}

				;
				; SQINCP (scalar)
				;

				define i32 @sqincp_n32_b8_i32(i32 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: sqincp_n32_b8_i32:
				; CHECK: sqincp x0, p0.b, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincp.n32.nxv16i1(i32 %a, <vscale x 16 x i1> %b)
				ret i32 %out
				}

				define i64 @sqincp_n32_b8_i64(i32 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: sqincp_n32_b8_i64:
				; CHECK: sqincp x0, p0.b, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincp.n32.nxv16i1(i32 %a, <vscale x 16 x i1> %b)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i32 @sqincp_n32_b16_i32(i32 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: sqincp_n32_b16_i32:
				; CHECK: sqincp x0, p0.h, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincp.n32.nxv8i1(i32 %a, <vscale x 8 x i1> %b)
				ret i32 %out
				}

				define i64 @sqincp_n32_b16_i64(i32 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: sqincp_n32_b16_i64:
				; CHECK: sqincp x0, p0.h, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincp.n32.nxv8i1(i32 %a, <vscale x 8 x i1> %b)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i32 @sqincp_n32_b32_i32(i32 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: sqincp_n32_b32_i32:
				; CHECK: sqincp x0, p0.s, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincp.n32.nxv4i1(i32 %a, <vscale x 4 x i1> %b)
				ret i32 %out
				}

				define i64 @sqincp_n32_b32_i64(i32 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: sqincp_n32_b32_i64:
				; CHECK: sqincp x0, p0.s, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincp.n32.nxv4i1(i32 %a, <vscale x 4 x i1> %b)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i32 @sqincp_n32_b64_i32(i32 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: sqincp_n32_b64_i32:
				; CHECK: sqincp x0, p0.d, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincp.n32.nxv2i1(i32 %a, <vscale x 2 x i1> %b)
				ret i32 %out
				}

				define i64 @sqincp_n32_b64_i64(i32 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: sqincp_n32_b64_i64:
				; CHECK: sqincp x0, p0.d, w0
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.sqincp.n32.nxv2i1(i32 %a, <vscale x 2 x i1> %b)
				%out_sext = sext i32 %out to i64

				ret i64 %out_sext
				}

				define i64 @sqincp_n64_b8(i64 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: sqincp_n64_b8:
				; CHECK: sqincp x0, p0.b
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqincp.n64.nxv16i1(i64 %a, <vscale x 16 x i1> %b)
				ret i64 %out
				}

				define i64 @sqincp_n64_b16(i64 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: sqincp_n64_b16:
				; CHECK: sqincp x0, p0.h
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqincp.n64.nxv8i1(i64 %a, <vscale x 8 x i1> %b)
				ret i64 %out
				}

				define i64 @sqincp_n64_b32(i64 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: sqincp_n64_b32:
				; CHECK: sqincp x0, p0.s
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqincp.n64.nxv4i1(i64 %a, <vscale x 4 x i1> %b)
				ret i64 %out
				}

				define i64 @sqincp_n64_b64(i64 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: sqincp_n64_b64:
				; CHECK: sqincp x0, p0.d
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.sqincp.n64.nxv2i1(i64 %a, <vscale x 2 x i1> %b)
				ret i64 %out
				}

				; sqinc{h\|w\|d}(vector, pattern, multiplier)
				declare <vscale x 8 x i16> @llvm.aarch64.sve.sqinch.nxv8i16(<vscale x 8 x i16>, i32, i32)
				declare <vscale x 4 x i32> @llvm.aarch64.sve.sqincw.nxv4i32(<vscale x 4 x i32>, i32, i32)
				declare <vscale x 2 x i64> @llvm.aarch64.sve.sqincd.nxv2i64(<vscale x 2 x i64>, i32, i32)

				; sqinc{b\|h\|w\|d}(scalar, pattern, multiplier)
				declare i32 @llvm.aarch64.sve.sqincb.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.sqincb.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.sqinch.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.sqinch.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.sqincw.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.sqincw.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.sqincd.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.sqincd.n64(i64, i32, i32)

				; sqincp(scalar, predicate)
				declare i32 @llvm.aarch64.sve.sqincp.n32.nxv16i1(i32, <vscale x 16 x i1>)
				declare i32 @llvm.aarch64.sve.sqincp.n32.nxv8i1(i32, <vscale x 8 x i1>)
				declare i32 @llvm.aarch64.sve.sqincp.n32.nxv4i1(i32, <vscale x 4 x i1>)
				declare i32 @llvm.aarch64.sve.sqincp.n32.nxv2i1(i32, <vscale x 2 x i1>)

				declare i64 @llvm.aarch64.sve.sqincp.n64.nxv16i1(i64, <vscale x 16 x i1>)
				declare i64 @llvm.aarch64.sve.sqincp.n64.nxv8i1(i64, <vscale x 8 x i1>)
				declare i64 @llvm.aarch64.sve.sqincp.n64.nxv4i1(i64, <vscale x 4 x i1>)
				declare i64 @llvm.aarch64.sve.sqincp.n64.nxv2i1(i64, <vscale x 2 x i1>)

				; sqincp(vector, predicate)
				declare <vscale x 8 x i16> @llvm.aarch64.sve.sqincp.nxv8i16(<vscale x 8 x i16>, <vscale x 8 x i1>)
				declare <vscale x 4 x i32> @llvm.aarch64.sve.sqincp.nxv4i32(<vscale x 4 x i32>, <vscale x 4 x i1>)
				declare <vscale x 2 x i64> @llvm.aarch64.sve.sqincp.nxv2i64(<vscale x 2 x i64>, <vscale x 2 x i1>)

llvm/test/CodeGen/AArch64/sve-intrinsics-uqdec.ll

This file was added.

				; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve -asm-verbose=0 < %s \| FileCheck %s

				; Since UQDEC{B\|H\|W\|D\|P} and UQINC{B\|H\|W\|D\|P} have identical semantics, the tests for
				; * @llvm.aarch64.sve.uqinc{b\|h\|w\|d\|p}, and
				; * @llvm.aarch64.sve.uqdec{b\|h\|w\|d\|p}
				; should also be identical (with the instruction name being adjusted). When
				; updating this file remember to make similar changes in the file testing the
				; other intrinsic.

				;
				; UQDECH (vector)
				;

				define <vscale x 8 x i16> @uqdech(<vscale x 8 x i16> %a) {
				; CHECK-LABEL: uqdech:
				; CHECK: uqdech z0.h, pow2
				; CHECK-NEXT: ret
				%out = call <vscale x 8 x i16> @llvm.aarch64.sve.uqdech.nxv8i16(<vscale x 8 x i16> %a,
				i32 0, i32 1)
				ret <vscale x 8 x i16> %out
				}

				;
				; UQDECW (vector)
				;

				define <vscale x 4 x i32> @uqdecw(<vscale x 4 x i32> %a) {
				; CHECK-LABEL: uqdecw:
				; CHECK: uqdecw z0.s, vl1, mul #2
				; CHECK-NEXT: ret
				%out = call <vscale x 4 x i32> @llvm.aarch64.sve.uqdecw.nxv4i32(<vscale x 4 x i32> %a,
				i32 1, i32 2)
				ret <vscale x 4 x i32> %out
				}

				;
				; UQDECD (vector)
				;

				define <vscale x 2 x i64> @uqdecd(<vscale x 2 x i64> %a) {
				; CHECK-LABEL: uqdecd:
				; CHECK: uqdecd z0.d, vl2, mul #3
				; CHECK-NEXT: ret
				%out = call <vscale x 2 x i64> @llvm.aarch64.sve.uqdecd.nxv2i64(<vscale x 2 x i64> %a,
				i32 2, i32 3)
				ret <vscale x 2 x i64> %out
				}

				;
				; UQDECP (vector)
				;

				define <vscale x 8 x i16> @uqdecp_b16(<vscale x 8 x i16> %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: uqdecp_b16:
				; CHECK: uqdecp z0.h, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 8 x i16> @llvm.aarch64.sve.uqdecp.nxv8i16(<vscale x 8 x i16> %a,
				<vscale x 8 x i1> %b)
				ret <vscale x 8 x i16> %out
				}

				define <vscale x 4 x i32> @uqdecp_b32(<vscale x 4 x i32> %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: uqdecp_b32:
				; CHECK: uqdecp z0.s, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 4 x i32> @llvm.aarch64.sve.uqdecp.nxv4i32(<vscale x 4 x i32> %a,
				<vscale x 4 x i1> %b)
				ret <vscale x 4 x i32> %out
				}

				define <vscale x 2 x i64> @uqdecp_b64(<vscale x 2 x i64> %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: uqdecp_b64:
				; CHECK: uqdecp z0.d, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 2 x i64> @llvm.aarch64.sve.uqdecp.nxv2i64(<vscale x 2 x i64> %a,
				<vscale x 2 x i1> %b)
				ret <vscale x 2 x i64> %out
				}

				;
				; UQDECB (scalar)
				;

				define i32 @uqdecb_n32(i32 %a) {
				; CHECK-LABEL: uqdecb_n32:
				; CHECK: uqdecb w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqdecb.n32(i32 %a, i32 3, i32 4)
				ret i32 %out
				}

				define i64 @uqdecb_n64(i64 %a) {
				; CHECK-LABEL: uqdecb_n64:
				; CHECK: uqdecb x0, vl4, mul #5
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqdecb.n64(i64 %a, i32 4, i32 5)
				ret i64 %out
				}

				;
				; UQDECH (scalar)
				;

				define i32 @uqdech_n32(i32 %a) {
				; CHECK-LABEL: uqdech_n32:
				; CHECK: uqdech w0, vl5, mul #6
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqdech.n32(i32 %a, i32 5, i32 6)
				ret i32 %out
				}

				define i64 @uqdech_n64(i64 %a) {
				; CHECK-LABEL: uqdech_n64:
				; CHECK: uqdech x0, vl6, mul #7
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqdech.n64(i64 %a, i32 6, i32 7)
				ret i64 %out
				}

				;
				; UQDECW (scalar)
				;

				define i32 @uqdecw_n32(i32 %a) {
				; CHECK-LABEL: uqdecw_n32:
				; CHECK: uqdecw w0, vl7, mul #8
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqdecw.n32(i32 %a, i32 7, i32 8)
				ret i32 %out
				}

				define i64 @uqdecw_n64(i64 %a) {
				; CHECK-LABEL: uqdecw_n64:
				; CHECK: uqdecw x0, vl8, mul #9
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqdecw.n64(i64 %a, i32 8, i32 9)
				ret i64 %out
				}

				;
				; UQDECD (scalar)
				;

				define i32 @uqdecd_n32(i32 %a) {
				; CHECK-LABEL: uqdecd_n32:
				; CHECK: uqdecd w0, vl16, mul #10
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqdecd.n32(i32 %a, i32 9, i32 10)
				ret i32 %out
				}

				define i64 @uqdecd_n64(i64 %a) {
				; CHECK-LABEL: uqdecd_n64:
				; CHECK: uqdecd x0, vl32, mul #11
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqdecd.n64(i64 %a, i32 10, i32 11)
				ret i64 %out
				}

				;
				; UQDECP (scalar)
				;

				define i32 @uqdecp_n32_b8(i32 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: uqdecp_n32_b8:
				; CHECK: uqdecp w0, p0.b
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqdecp.n32.nxv16i1(i32 %a, <vscale x 16 x i1> %b)
				ret i32 %out
				}

				define i32 @uqdecp_n32_b16(i32 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: uqdecp_n32_b16:
				; CHECK: uqdecp w0, p0.h
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqdecp.n32.nxv8i1(i32 %a, <vscale x 8 x i1> %b)
				ret i32 %out
				}

				define i32 @uqdecp_n32_b32(i32 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: uqdecp_n32_b32:
				; CHECK: uqdecp w0, p0.s
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqdecp.n32.nxv4i1(i32 %a, <vscale x 4 x i1> %b)
				ret i32 %out
				}

				define i32 @uqdecp_n32_b64(i32 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: uqdecp_n32_b64:
				; CHECK: uqdecp w0, p0.d
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqdecp.n32.nxv2i1(i32 %a, <vscale x 2 x i1> %b)
				ret i32 %out
				}

				define i64 @uqdecp_n64_b8(i64 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: uqdecp_n64_b8:
				; CHECK: uqdecp x0, p0.b
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqdecp.n64.nxv16i1(i64 %a, <vscale x 16 x i1> %b)
				ret i64 %out
				}

				define i64 @uqdecp_n64_b16(i64 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: uqdecp_n64_b16:
				; CHECK: uqdecp x0, p0.h
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqdecp.n64.nxv8i1(i64 %a, <vscale x 8 x i1> %b)
				ret i64 %out
				}

				define i64 @uqdecp_n64_b32(i64 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: uqdecp_n64_b32:
				; CHECK: uqdecp x0, p0.s
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqdecp.n64.nxv4i1(i64 %a, <vscale x 4 x i1> %b)
				ret i64 %out
				}

				define i64 @uqdecp_n64_b64(i64 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: uqdecp_n64_b64:
				; CHECK: uqdecp x0, p0.d
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqdecp.n64.nxv2i1(i64 %a, <vscale x 2 x i1> %b)
				ret i64 %out
				}

				; uqdec{h\|w\|d}(vector, pattern, multiplier)
				declare <vscale x 8 x i16> @llvm.aarch64.sve.uqdech.nxv8i16(<vscale x 8 x i16>, i32, i32)
				declare <vscale x 4 x i32> @llvm.aarch64.sve.uqdecw.nxv4i32(<vscale x 4 x i32>, i32, i32)
				declare <vscale x 2 x i64> @llvm.aarch64.sve.uqdecd.nxv2i64(<vscale x 2 x i64>, i32, i32)

				; uqdec{b\|h\|w\|d}(scalar, pattern, multiplier)
				declare i32 @llvm.aarch64.sve.uqdecb.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.uqdecb.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.uqdech.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.uqdech.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.uqdecw.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.uqdecw.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.uqdecd.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.uqdecd.n64(i64, i32, i32)

				; uqdecp(scalar, predicate)
				declare i32 @llvm.aarch64.sve.uqdecp.n32.nxv16i1(i32, <vscale x 16 x i1>)
				declare i32 @llvm.aarch64.sve.uqdecp.n32.nxv8i1(i32, <vscale x 8 x i1>)
				declare i32 @llvm.aarch64.sve.uqdecp.n32.nxv4i1(i32, <vscale x 4 x i1>)
				declare i32 @llvm.aarch64.sve.uqdecp.n32.nxv2i1(i32, <vscale x 2 x i1>)

				declare i64 @llvm.aarch64.sve.uqdecp.n64.nxv16i1(i64, <vscale x 16 x i1>)
				declare i64 @llvm.aarch64.sve.uqdecp.n64.nxv8i1(i64, <vscale x 8 x i1>)
				declare i64 @llvm.aarch64.sve.uqdecp.n64.nxv4i1(i64, <vscale x 4 x i1>)
				declare i64 @llvm.aarch64.sve.uqdecp.n64.nxv2i1(i64, <vscale x 2 x i1>)

				; uqdecp(vector, predicate)
				declare <vscale x 8 x i16> @llvm.aarch64.sve.uqdecp.nxv8i16(<vscale x 8 x i16>, <vscale x 8 x i1>)
				declare <vscale x 4 x i32> @llvm.aarch64.sve.uqdecp.nxv4i32(<vscale x 4 x i32>, <vscale x 4 x i1>)
				declare <vscale x 2 x i64> @llvm.aarch64.sve.uqdecp.nxv2i64(<vscale x 2 x i64>, <vscale x 2 x i1>)

llvm/test/CodeGen/AArch64/sve-intrinsics-uqinc.ll

This file was added.

				; RUN: llc -mtriple=aarch64-linux-gnu -mattr=+sve -asm-verbose=0 < %s \| FileCheck %s

				; Since UQDEC{B\|H\|W\|D\|P} and UQINC{B\|H\|W\|D\|P} have identical semantics, the tests for
				; * @llvm.aarch64.sve.uqinc{b\|h\|w\|d\|p}, and
				; * @llvm.aarch64.sve.uqdec{b\|h\|w\|d\|p}
				; should also be identical (with the instruction name being adjusted). When
				; updating this file remember to make similar changes in the file testing the
				; other intrinsic.

				;
				; UQINCH (vector)
				;

				define <vscale x 8 x i16> @uqinch(<vscale x 8 x i16> %a) {
				; CHECK-LABEL: uqinch:
				; CHECK: uqinch z0.h, pow2
				; CHECK-NEXT: ret
				%out = call <vscale x 8 x i16> @llvm.aarch64.sve.uqinch.nxv8i16(<vscale x 8 x i16> %a,
				i32 0, i32 1)
				ret <vscale x 8 x i16> %out
				}

				;
				; UQINCW (vector)
				;

				define <vscale x 4 x i32> @uqincw(<vscale x 4 x i32> %a) {
				; CHECK-LABEL: uqincw:
				; CHECK: uqincw z0.s, vl1, mul #2
				; CHECK-NEXT: ret
				%out = call <vscale x 4 x i32> @llvm.aarch64.sve.uqincw.nxv4i32(<vscale x 4 x i32> %a,
				i32 1, i32 2)
				ret <vscale x 4 x i32> %out
				}

				;
				; UQINCD (vector)
				;

				define <vscale x 2 x i64> @uqincd(<vscale x 2 x i64> %a) {
				; CHECK-LABEL: uqincd:
				; CHECK: uqincd z0.d, vl2, mul #3
				; CHECK-NEXT: ret
				%out = call <vscale x 2 x i64> @llvm.aarch64.sve.uqincd.nxv2i64(<vscale x 2 x i64> %a,
				i32 2, i32 3)
				ret <vscale x 2 x i64> %out
				}

				;
				; UQINCP (vector)
				;

				define <vscale x 8 x i16> @uqincp_b16(<vscale x 8 x i16> %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: uqincp_b16:
				; CHECK: uqincp z0.h, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 8 x i16> @llvm.aarch64.sve.uqincp.nxv8i16(<vscale x 8 x i16> %a,
				<vscale x 8 x i1> %b)
				ret <vscale x 8 x i16> %out
				}

				define <vscale x 4 x i32> @uqincp_b32(<vscale x 4 x i32> %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: uqincp_b32:
				; CHECK: uqincp z0.s, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 4 x i32> @llvm.aarch64.sve.uqincp.nxv4i32(<vscale x 4 x i32> %a,
				<vscale x 4 x i1> %b)
				ret <vscale x 4 x i32> %out
				}

				define <vscale x 2 x i64> @uqincp_b64(<vscale x 2 x i64> %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: uqincp_b64:
				; CHECK: uqincp z0.d, p0
				; CHECK-NEXT: ret
				%out = call <vscale x 2 x i64> @llvm.aarch64.sve.uqincp.nxv2i64(<vscale x 2 x i64> %a,
				<vscale x 2 x i1> %b)
				ret <vscale x 2 x i64> %out
				}

				;
				; UQINCB (scalar)
				;

				define i32 @uqincb_n32(i32 %a) {
				; CHECK-LABEL: uqincb_n32:
				; CHECK: uqincb w0, vl3, mul #4
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqincb.n32(i32 %a, i32 3, i32 4)
				ret i32 %out
				}

				define i64 @uqincb_n64(i64 %a) {
				; CHECK-LABEL: uqincb_n64:
				; CHECK: uqincb x0, vl4, mul #5
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqincb.n64(i64 %a, i32 4, i32 5)
				ret i64 %out
				}

				;
				; UQINCH (scalar)
				;

				define i32 @uqinch_n32(i32 %a) {
				; CHECK-LABEL: uqinch_n32:
				; CHECK: uqinch w0, vl5, mul #6
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqinch.n32(i32 %a, i32 5, i32 6)
				ret i32 %out
				}

				define i64 @uqinch_n64(i64 %a) {
				; CHECK-LABEL: uqinch_n64:
				; CHECK: uqinch x0, vl6, mul #7
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqinch.n64(i64 %a, i32 6, i32 7)
				ret i64 %out
				}

				;
				; UQINCW (scalar)
				;

				define i32 @uqincw_n32(i32 %a) {
				; CHECK-LABEL: uqincw_n32:
				; CHECK: uqincw w0, vl7, mul #8
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqincw.n32(i32 %a, i32 7, i32 8)
				ret i32 %out
				}

				define i64 @uqincw_n64(i64 %a) {
				; CHECK-LABEL: uqincw_n64:
				; CHECK: uqincw x0, vl8, mul #9
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqincw.n64(i64 %a, i32 8, i32 9)
				ret i64 %out
				}

				;
				; UQINCD (scalar)
				;

				define i32 @uqincd_n32(i32 %a) {
				; CHECK-LABEL: uqincd_n32:
				; CHECK: uqincd w0, vl16, mul #10
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqincd.n32(i32 %a, i32 9, i32 10)
				ret i32 %out
				}

				define i64 @uqincd_n64(i64 %a) {
				; CHECK-LABEL: uqincd_n64:
				; CHECK: uqincd x0, vl32, mul #11
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqincd.n64(i64 %a, i32 10, i32 11)
				ret i64 %out
				}

				;
				; UQINCP (scalar)
				;

				define i32 @uqincp_n32_b8(i32 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: uqincp_n32_b8:
				; CHECK: uqincp w0, p0.b
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqincp.n32.nxv16i1(i32 %a, <vscale x 16 x i1> %b)
				ret i32 %out
				}

				define i32 @uqincp_n32_b16(i32 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: uqincp_n32_b16:
				; CHECK: uqincp w0, p0.h
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqincp.n32.nxv8i1(i32 %a, <vscale x 8 x i1> %b)
				ret i32 %out
				}

				define i32 @uqincp_n32_b32(i32 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: uqincp_n32_b32:
				; CHECK: uqincp w0, p0.s
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqincp.n32.nxv4i1(i32 %a, <vscale x 4 x i1> %b)
				ret i32 %out
				}

				define i32 @uqincp_n32_b64(i32 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: uqincp_n32_b64:
				; CHECK: uqincp w0, p0.d
				; CHECK-NEXT: ret
				%out = call i32 @llvm.aarch64.sve.uqincp.n32.nxv2i1(i32 %a, <vscale x 2 x i1> %b)
				ret i32 %out
				}

				define i64 @uqincp_n64_b8(i64 %a, <vscale x 16 x i1> %b) {
				; CHECK-LABEL: uqincp_n64_b8:
				; CHECK: uqincp x0, p0.b
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqincp.n64.nxv16i1(i64 %a, <vscale x 16 x i1> %b)
				ret i64 %out
				}

				define i64 @uqincp_n64_b16(i64 %a, <vscale x 8 x i1> %b) {
				; CHECK-LABEL: uqincp_n64_b16:
				; CHECK: uqincp x0, p0.h
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqincp.n64.nxv8i1(i64 %a, <vscale x 8 x i1> %b)
				ret i64 %out
				}

				define i64 @uqincp_n64_b32(i64 %a, <vscale x 4 x i1> %b) {
				; CHECK-LABEL: uqincp_n64_b32:
				; CHECK: uqincp x0, p0.s
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqincp.n64.nxv4i1(i64 %a, <vscale x 4 x i1> %b)
				ret i64 %out
				}

				define i64 @uqincp_n64_b64(i64 %a, <vscale x 2 x i1> %b) {
				; CHECK-LABEL: uqincp_n64_b64:
				; CHECK: uqincp x0, p0.d
				; CHECK-NEXT: ret
				%out = call i64 @llvm.aarch64.sve.uqincp.n64.nxv2i1(i64 %a, <vscale x 2 x i1> %b)
				ret i64 %out
				}

				; uqinc{h\|w\|d}(vector, pattern, multiplier)
				declare <vscale x 8 x i16> @llvm.aarch64.sve.uqinch.nxv8i16(<vscale x 8 x i16>, i32, i32)
				declare <vscale x 4 x i32> @llvm.aarch64.sve.uqincw.nxv4i32(<vscale x 4 x i32>, i32, i32)
				declare <vscale x 2 x i64> @llvm.aarch64.sve.uqincd.nxv2i64(<vscale x 2 x i64>, i32, i32)

				; uqinc{b\|h\|w\|d}(scalar, pattern, multiplier)
				declare i32 @llvm.aarch64.sve.uqincb.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.uqincb.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.uqinch.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.uqinch.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.uqincw.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.uqincw.n64(i64, i32, i32)
				declare i32 @llvm.aarch64.sve.uqincd.n32(i32, i32, i32)
				declare i64 @llvm.aarch64.sve.uqincd.n64(i64, i32, i32)

				; uqincp(scalar, predicate)
				declare i32 @llvm.aarch64.sve.uqincp.n32.nxv16i1(i32, <vscale x 16 x i1>)
				declare i32 @llvm.aarch64.sve.uqincp.n32.nxv8i1(i32, <vscale x 8 x i1>)
				declare i32 @llvm.aarch64.sve.uqincp.n32.nxv4i1(i32, <vscale x 4 x i1>)
				declare i32 @llvm.aarch64.sve.uqincp.n32.nxv2i1(i32, <vscale x 2 x i1>)

				declare i64 @llvm.aarch64.sve.uqincp.n64.nxv16i1(i64, <vscale x 16 x i1>)
				declare i64 @llvm.aarch64.sve.uqincp.n64.nxv8i1(i64, <vscale x 8 x i1>)
				declare i64 @llvm.aarch64.sve.uqincp.n64.nxv4i1(i64, <vscale x 4 x i1>)
				declare i64 @llvm.aarch64.sve.uqincp.n64.nxv2i1(i64, <vscale x 2 x i1>)

				; uqincp(vector, predicate)
				declare <vscale x 8 x i16> @llvm.aarch64.sve.uqincp.nxv8i16(<vscale x 8 x i16>, <vscale x 8 x i1>)
				declare <vscale x 4 x i32> @llvm.aarch64.sve.uqincp.nxv4i32(<vscale x 4 x i32>, <vscale x 4 x i1>)
				declare <vscale x 2 x i64> @llvm.aarch64.sve.uqincp.nxv2i64(<vscale x 2 x i64>, <vscale x 2 x i1>)

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][SVE] Add intrnisics for saturating scalar arithmeticClosedPublic

Details

Diff Detail

Event Timeline