This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/
-
clang/
-
Basic/
-
arm_mve.td
-
arm_mve_defs.td
-
Sema/
-
Sema.h
-
lib/Sema/
-
Sema/
-
SemaChecking.cpp
-
test/
-
CodeGen/arm-mve-intrinsics/
-
arm-mve-intrinsics/
-
bitwise-imm.c
-
Sema/
-
arm-mve-immediates.c
-
utils/TableGen/
-
TableGen/
-
MveEmitter.cpp
-
llvm/
-
lib/Target/ARM/
-
Target/
-
ARM/
2/2
ARMISelLowering.cpp
-
ARMInstrInfo.td
-
ARMInstrMVE.td
-
ARMInstrNEON.td
-
test/CodeGen/Thumb2/mve-intrinsics/
-
CodeGen/
-
Thumb2/
-
mve-intrinsics/
-
bitwise-imm.ll

Differential D72934

[ARM,MVE] Support immediate vbicq,vorrq,vmvnq intrinsics.
ClosedPublic

Authored by simon_tatham on Jan 17 2020, 9:17 AM.

Download Raw Diff

Details

Reviewers

dmgreen
MarkMurrayARM
miyuki
ostannard

Commits

rG4321c6af28e9: [ARM,MVE] Support immediate vbicq,vorrq,vmvnq intrinsics.

Summary

Immediate vmvnq is code-generated as a simple vector constant in IR,
and left to the backend to recognize that it can be created with an
MVE VMVN instruction. The predicated version is represented as a
select between the input and the same constant, and I've added a
Tablegen isel rule to turn that into a predicated VMVN. (That should
be better than the previous VMVN + VPSEL: it's the same number of
instructions but now it can fold into an adjacent VPT block.)

The unpredicated forms of VBIC and VORR are done by enabling the same
isel lowering as for NEON, recognizing appropriate immediates and
rewriting them as ARMISD::VBICIMM / ARMISD::VORRIMM SDNodes, which I
then instruction-select into the right MVE instructions (now that I've
also reworked those instructions to use the same MC operand encoding).
In order to do that, I had to promote the Tablegen SDNode instance
NEONvorrImm to a general ARMvorrImm available in MVE as well, and
similarly for NEONvbicImm.

The predicated forms of VBIC and VORR are represented as a vector
select between the original input vector and the output of the
unpredicated operation. The main convenience of this is that it still
lets me use the existing isel lowering for VBICIMM/VORRIMM, and not
have to write another copy of the operand encoding translation code.

This intrinsic family is the first to use the imm_simd system I put
into the MveEmitter tablegen backend. So, naturally, it showed up a
bug or two (emitting bogus range checks and the like). Fixed those,
and added a full set of tests for the permissible immediates in the
existing Sema test.

Also adjusted the isel pattern for vmovlb.u8, which stopped matching
because lowering started turning its input into a VBICIMM. Now it
recognizes the VBICIMM instead.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

simon_tatham created this revision.Jan 17 2020, 9:17 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptJan 17 2020, 9:17 AM

Herald added subscribers: llvm-commits, cfe-commits, hiraditya, kristof.beyls. · View Herald Transcript

Harbormaster completed remote builds in B44287: Diff 238803.Jan 17 2020, 9:17 AM

What is the reason that this can't be lowered in tablegen, in the same way as the VMOVimm's are?

For vbic vs vmovlb, the vmovlb does include a free register move, so may under some circumstances be slightly better. Like you say, it's mostly benign, but may be worth updating the MVE_VMOVL patterns.

Do you have any tests for what would be invalid bic values under MVE?

llvm/lib/Target/ARM/ARMISelLowering.cpp
12181	This is OK because we are passing OtherModImm to isVMOVModifiedImm, and MVE supports the same patterns as NEON?

In D72934#1829331, @dmgreen wrote:

What is the reason that this can't be lowered in tablegen, in the same way as the VMOVimm's are?

In NEON, immediate VBIC is represented as a single MC instruction, which takes its immediate operand already encoded into the NEON format (8 data bits, op and cmode). That's the same format that ARMISD::VBICIMM has encoded the operand in after lowering. So you only need one tablegen pattern, which passes the immediate through unchanged between the input and output SDNode types.

In MVE, immediate VBIC is represented as four separate MC instructions, for an 8-bit immediate shifted left by 0, 8, 16 or 24 bits. Each one takes the immediate operand in the 'natural' form, i.e. the numerical value that would be combined into the vector lane and shown in assembly. For example, MVE_VBICIZ16v4i32 takes an operand such as 0xab0000 which NEON VBIC would represent as 0xab | (control bits << 8). So the C++ isel code I've written has to undo the NEON encoding and turn it back into the 'natural' immediate value plus a choice of which MVE opcode to use.

I suppose an alternative would be to rework the MC representation of MVE VBIC/VORR so that they look more like the NEON versions. I don't exactly know why MVE was done differently in the first place (the commit here has my name on it, but it was a team effort). One possibility is that the pseudo-instruction reversed forms vand and vorn might be hard to represent that way, but I don't know.

Do you have any tests for what would be invalid bic values under MVE?

True, I suppose I could provide some immediates that are valid for other VMOVModImmTypes, like 0xabff, and make sure nothing goes wrong.

llvm/lib/Target/ARM/ARMISelLowering.cpp
12181	Yes: `OtherModImm` only matches values of the form '8-bit number shifted left by a multiple of 8 bits', which is just what MVE VBIC and VORR take as well.

In D72934#1829387, @simon_tatham wrote:

In D72934#1829331, @dmgreen wrote:

What is the reason that this can't be lowered in tablegen, in the same way as the VMOVimm's are?

In NEON, immediate VBIC is represented as a single MC instruction, which takes its immediate operand already encoded into the NEON format (8 data bits, op and cmode). That's the same format that ARMISD::VBICIMM has encoded the operand in after lowering. So you only need one tablegen pattern, which passes the immediate through unchanged between the input and output SDNode types.

In MVE, immediate VBIC is represented as four separate MC instructions, for an 8-bit immediate shifted left by 0, 8, 16 or 24 bits. Each one takes the immediate operand in the 'natural' form, i.e. the numerical value that would be combined into the vector lane and shown in assembly. For example, MVE_VBICIZ16v4i32 takes an operand such as 0xab0000 which NEON VBIC would represent as 0xab | (control bits << 8). So the C++ isel code I've written has to undo the NEON encoding and turn it back into the 'natural' immediate value plus a choice of which MVE opcode to use.

I suppose an alternative would be to rework the MC representation of MVE VBIC/VORR so that they look more like the NEON versions. I don't exactly know why MVE was done differently in the first place (the commit here has my name on it, but it was a team effort). One possibility is that the pseudo-instruction reversed forms vand and vorn might be hard to represent that way, but I don't know.

I believe that the downstream VMOVimm's were rewritten like this when the other BUILDVECTOR handling was added by DavidS. If it is possible to structure this way for BIC's too, it sounds like it might be a little cleaner.

I've revised the MC representations of VBIC and VORR as suggested, but that was a big enough patch that I've done it separately as D73205. This patch now sits on top of that one.

Changing VBIC and VORR meant I could do the isel for the unpredicated forms in pure Tablegen. But the predicated ones would still have needed C++, because the IR intrinsics would have wanted the immediate in its natural form, but by the time you generate an instruction, it has to be re-encoded as NEON. The simplest way was to stop adding new IR intrinsics, and instead encode the predicated instructions as a select. Then I still get to use isel lowering's conversion into VBICIMM/VORRIMM which does the immediate translation for me.

Adjusting the VMOVL pattern to expect the result of my modified lowering has made all those unrelated MVE codegen tests go back to the way they were before, so the new version of this patch doesn't have to change anything there.

Also added a negative llc test with an immediate that doesn't fit into VBICIMM, to prove that it gets sensibly selected as a different instruction sequence and nothing crashes.

Harbormaster completed remote builds in B44599: Diff 239609.Jan 22 2020, 8:31 AM

Looks good, from what I can tell.

I especially like the selects. We know that we have to do more work there, but adding this for more instructions would go a long way towards creating more predicated instructions (before the ability to do this in IR comes along).

This revision is now accepted and ready to land.Jan 23 2020, 1:13 AM

Closed by commit rG4321c6af28e9: [ARM,MVE] Support immediate vbicq,vorrq,vmvnq intrinsics. (authored by simon_tatham). · Explain WhyJan 23 2020, 3:55 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

arm_mve.td

22 lines

arm_mve_defs.td

24 lines

Sema/

Sema.h

6 lines

lib/

Sema/

SemaChecking.cpp

14 lines

test/

CodeGen/

arm-mve-intrinsics/

bitwise-imm.c

402 lines

Sema/

arm-mve-immediates.c

70 lines

utils/

TableGen/

MveEmitter.cpp

35 lines

llvm/

lib/

Target/

ARM/

4 lines

4 lines

61 lines

21 lines

test/

CodeGen/

Thumb2/

mve-intrinsics/

bitwise-imm.ll

365 lines

Diff 239845

clang/include/clang/Basic/arm_mve.td

	Show First 20 Lines • Show All 110 Lines • ▼ Show 20 Lines
	defm vornqf: bit_op_fp_with_inv<or>, NameOverride<"vornq">;			defm vornqf: bit_op_fp_with_inv<or>, NameOverride<"vornq">;
	defm vorrqf: bit_op_fp<or>, NameOverride<"vorrq">;			defm vorrqf: bit_op_fp<or>, NameOverride<"vorrq">;
	def vsubqf: Intrinsic<Vector, (args Vector:$a, Vector:$b), (fsub $a, $b)>,			def vsubqf: Intrinsic<Vector, (args Vector:$a, Vector:$b), (fsub $a, $b)>,
	NameOverride<"vsubq">;			NameOverride<"vsubq">;
	def vmulqf: Intrinsic<Vector, (args Vector:$a, Vector:$b), (fmul $a, $b)>,			def vmulqf: Intrinsic<Vector, (args Vector:$a, Vector:$b), (fmul $a, $b)>,
	NameOverride<"vmulq">;			NameOverride<"vmulq">;
	}			}

				let params = !listconcat(T.Int16, T.Int32) in {
				let pnt = PNT_None in {
				def vmvnq_n: Intrinsic<Vector, (args imm_simd_vmvn:$imm),
				(not (splat (Scalar $imm)))>;
				}
				defm vmvnq: IntrinsicMX<Vector, (args imm_simd_vmvn:$imm, Predicate:$pred),
				(select $pred, (not (splat (Scalar $imm))), $inactive),
				1, "_n", PNT_NType, PNT_None>;
				let pnt = PNT_NType in {
				def vbicq_n: Intrinsic<Vector, (args Vector:$v, imm_simd_restrictive:$imm),
				(and $v, (not (splat (Scalar $imm))))>;
				def vorrq_n: Intrinsic<Vector, (args Vector:$v, imm_simd_restrictive:$imm),
				(or $v, (splat (Scalar $imm)))>;
				}
				def vbicq_m_n: Intrinsic<
				Vector, (args Vector:$v, imm_simd_restrictive:$imm, Predicate:$pred),
				(select $pred, (and $v, (not (splat (Scalar $imm)))), $v)>;
				def vorrq_m_n: Intrinsic<
				Vector, (args Vector:$v, imm_simd_restrictive:$imm, Predicate:$pred),
				(select $pred, (or $v, (splat (Scalar $imm))), $v)>;
				}

	// The bitcasting below is not overcomplicating the IR because while			// The bitcasting below is not overcomplicating the IR because while
	// Vector and UVector may be different vector types at the C level i.e.			// Vector and UVector may be different vector types at the C level i.e.
	// vectors of same size signed/unsigned ints. Once they're lowered			// vectors of same size signed/unsigned ints. Once they're lowered
	// to IR, they are just bit vectors with no sign at all, so the			// to IR, they are just bit vectors with no sign at all, so the
	// bitcasts will be automatically elided by IRBuilder.			// bitcasts will be automatically elided by IRBuilder.
	multiclass predicated_bit_op_fp<string int_op> {			multiclass predicated_bit_op_fp<string int_op> {
	def "": Intrinsic<Vector, (args Vector:$inactive, Vector:$a, Vector:$b,			def "": Intrinsic<Vector, (args Vector:$inactive, Vector:$a, Vector:$b,
	Predicate:$pred),			Predicate:$pred),
	▲ Show 20 Lines • Show All 902 Lines • Show Last 20 Lines

clang/include/clang/Basic/arm_mve_defs.td

Show First 20 Lines • Show All 313 Lines • ▼ Show 20 Lines	class IB_ConstRange<int lo_, int hi_> : ImmediateBounds {
int hi = hi_;		int hi = hi_;
}		}
def IB_UEltValue : ImmediateBounds;		def IB_UEltValue : ImmediateBounds;
def IB_LaneIndex : ImmediateBounds;		def IB_LaneIndex : ImmediateBounds;
class IB_EltBit<int base_, Type type_ = Scalar> : ImmediateBounds {		class IB_EltBit<int base_, Type type_ = Scalar> : ImmediateBounds {
int base = base_;		int base = base_;
Type type = type_;		Type type = type_;
}		}
		def IB_ExtraArg_LaneSize;

// -----------------------------------------------------------------------------		// -----------------------------------------------------------------------------
// End-user definitions for immediate arguments.		// End-user definitions for immediate arguments.

// imm_simd and imm_simd_restrictive are used for the immediate operands to		// imm_simd and imm_simd_restrictive are used for the immediate operands to
// intrinsics like vmvnq or vorrq. imm_simd_restrictive has to be an 8-bit		// intrinsics like vmvnq or vorrq. imm_simd_restrictive has to be an 8-bit
// value shifted left by a whole number of bytes; imm_simd_vmvn can also be of		// value shifted left by a whole number of bytes; imm_simd_vmvn can also be of
// the form 0xXXFF for some byte value XX.		// the form 0xXXFF for some byte value XX.
def imm_simd_restrictive : Immediate<u32, IB_UEltValue> {		def imm_simd_restrictive : Immediate<Scalar, IB_UEltValue> {
let extra = "ShiftedByte";		let extra = "ShiftedByte";
		let extraarg = "!lanesize";
}		}
def imm_simd_vmvn : Immediate<u32, IB_UEltValue> {		def imm_simd_vmvn : Immediate<Scalar, IB_UEltValue> {
let extra = "ShiftedByteOrXXFF";		let extra = "ShiftedByteOrXXFF";
		let extraarg = "!lanesize";
}		}

// imm_1toN can take any value from 1 to N inclusive, where N is the number of		// imm_1toN can take any value from 1 to N inclusive, where N is the number of
// bits in the main parameter type. (E.g. an immediate shift count, in an		// bits in the main parameter type. (E.g. an immediate shift count, in an
// intrinsic that shifts every lane of a vector by the same amount.)		// intrinsic that shifts every lane of a vector by the same amount.)
//		//
// imm_0toNm1 is the same but with the range offset by 1, i.e. 0 to N-1		// imm_0toNm1 is the same but with the range offset by 1, i.e. 0 to N-1
// inclusive.		// inclusive.
▲ Show 20 Lines • Show All 109 Lines • ▼ Show 20 Lines
// record name.		// record name.

class NameOverride<string basename_> {		class NameOverride<string basename_> {
string basename = basename_;		string basename = basename_;
}		}

// A wrapper to define both _m and _x versions of a predicated		// A wrapper to define both _m and _x versions of a predicated
// intrinsic.		// intrinsic.
		//
		// We provide optional parameters to override the polymorphic name
		// types separately for the _m and _x variants, because sometimes they
		// polymorph differently (typically because the type of the inactive
		// parameter can be used as a disambiguator if it's present).
multiclass IntrinsicMX<Type rettype, dag arguments, dag cg,		multiclass IntrinsicMX<Type rettype, dag arguments, dag cg,
int wantXVariant = 1,		int wantXVariant = 1,
string nameSuffix = "",		string nameSuffix = "",
		PolymorphicNameType pnt_m = PNT_Type,
PolymorphicNameType pnt_x = PNT_Type> {		PolymorphicNameType pnt_x = PNT_Type> {
// The _m variant takes an initial parameter called $inactive, which		// The _m variant takes an initial parameter called $inactive, which
// provides the input value of the output register, i.e. all the		// provides the input value of the output register, i.e. all the
// inactive lanes in the predicated operation take their values from		// inactive lanes in the predicated operation take their values from
// this.		// this.
def "_m" # nameSuffix:		def "_m" # nameSuffix:
Intrinsic<rettype, !con((args rettype:$inactive), arguments), cg>;		Intrinsic<rettype, !con((args rettype:$inactive), arguments), cg> {
		let pnt = pnt_m;
		}

foreach unusedVar = !if(!eq(wantXVariant, 1), [1], []<int>) in {		foreach unusedVar = !if(!eq(wantXVariant, 1), [1], []<int>) in {
// The _x variant leaves off that parameter, and simply uses an		// The _x variant leaves off that parameter, and simply uses an
// undef value of the same type.		// undef value of the same type.

def "_x" # nameSuffix:		def "_x" # nameSuffix:
Intrinsic<rettype, arguments, (seq (undef rettype):$inactive, cg)> {		Intrinsic<rettype, arguments, (seq (undef rettype):$inactive, cg)> {
// Allow overriding of the polymorphic name type, because
// sometimes the _m and _x variants polymorph differently
// (typically because the type of the inactive parameter can be
// used as a disambiguator if it's present).
let pnt = pnt_x;		let pnt = pnt_x;
}		}
}		}
}		}

// -----------------------------------------------------------------------------		// -----------------------------------------------------------------------------
// Convenience lists of parameter types. 'T' is just a container record, so you		// Convenience lists of parameter types. 'T' is just a container record, so you
// can define a typical intrinsic with 'let Params = T.Usual', or similar,		// can define a typical intrinsic with 'let Params = T.Usual', or similar,
Show All 29 Lines

clang/include/clang/Sema/Sema.h

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 11,664 Lines • ▼ Show 20 Lines	ExprResult SemaBuiltinOperatorNewDeleteOverloaded(ExprResult TheCallResult,
bool IsDelete);		bool IsDelete);
bool SemaBuiltinConstantArg(CallExpr *TheCall, int ArgNum,		bool SemaBuiltinConstantArg(CallExpr *TheCall, int ArgNum,
llvm::APSInt &Result);		llvm::APSInt &Result);
bool SemaBuiltinConstantArgRange(CallExpr *TheCall, int ArgNum, int Low,		bool SemaBuiltinConstantArgRange(CallExpr *TheCall, int ArgNum, int Low,
int High, bool RangeIsError = true);		int High, bool RangeIsError = true);
bool SemaBuiltinConstantArgMultiple(CallExpr *TheCall, int ArgNum,		bool SemaBuiltinConstantArgMultiple(CallExpr *TheCall, int ArgNum,
unsigned Multiple);		unsigned Multiple);
bool SemaBuiltinConstantArgPower2(CallExpr *TheCall, int ArgNum);		bool SemaBuiltinConstantArgPower2(CallExpr *TheCall, int ArgNum);
bool SemaBuiltinConstantArgShiftedByte(CallExpr *TheCall, int ArgNum);		bool SemaBuiltinConstantArgShiftedByte(CallExpr *TheCall, int ArgNum,
bool SemaBuiltinConstantArgShiftedByteOrXXFF(CallExpr *TheCall, int ArgNum);		unsigned ArgBits);
		bool SemaBuiltinConstantArgShiftedByteOrXXFF(CallExpr *TheCall, int ArgNum,
		unsigned ArgBits);
bool SemaBuiltinARMSpecialReg(unsigned BuiltinID, CallExpr *TheCall,		bool SemaBuiltinARMSpecialReg(unsigned BuiltinID, CallExpr *TheCall,
int ArgNum, unsigned ExpectedFieldNum,		int ArgNum, unsigned ExpectedFieldNum,
bool AllowName);		bool AllowName);
bool SemaBuiltinARMMemoryTaggingCall(unsigned BuiltinID, CallExpr *TheCall);		bool SemaBuiltinARMMemoryTaggingCall(unsigned BuiltinID, CallExpr *TheCall);
public:		public:
enum FormatStringType {		enum FormatStringType {
FST_Scanf,		FST_Scanf,
FST_Printf,		FST_Printf,
▲ Show 20 Lines • Show All 385 Lines • Show Last 20 Lines

clang/lib/Sema/SemaChecking.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,586 Lines • ▼ Show 20 Lines	while (true) {
// shifted byte or not. So do that, and go round again.		// shifted byte or not. So do that, and go round again.
Value >>= 8;		Value >>= 8;
}		}
}		}

/// SemaBuiltinConstantArgShiftedByte - Check if argument ArgNum of TheCall is		/// SemaBuiltinConstantArgShiftedByte - Check if argument ArgNum of TheCall is
/// a constant expression representing an arbitrary byte value shifted left by		/// a constant expression representing an arbitrary byte value shifted left by
/// a multiple of 8 bits.		/// a multiple of 8 bits.
bool Sema::SemaBuiltinConstantArgShiftedByte(CallExpr *TheCall, int ArgNum) {		bool Sema::SemaBuiltinConstantArgShiftedByte(CallExpr *TheCall, int ArgNum,
		unsigned ArgBits) {
llvm::APSInt Result;		llvm::APSInt Result;

// We can't check the value of a dependent argument.		// We can't check the value of a dependent argument.
Expr *Arg = TheCall->getArg(ArgNum);		Expr *Arg = TheCall->getArg(ArgNum);
if (Arg->isTypeDependent() \|\| Arg->isValueDependent())		if (Arg->isTypeDependent() \|\| Arg->isValueDependent())
return false;		return false;

// Check constant-ness first.		// Check constant-ness first.
if (SemaBuiltinConstantArg(TheCall, ArgNum, Result))		if (SemaBuiltinConstantArg(TheCall, ArgNum, Result))
return true;		return true;

		// Truncate to the given size.
		Result = Result.getLoBits(ArgBits);
		Result.setIsUnsigned(true);

if (IsShiftedByte(Result))		if (IsShiftedByte(Result))
return false;		return false;

return Diag(TheCall->getBeginLoc(), diag::err_argument_not_shifted_byte)		return Diag(TheCall->getBeginLoc(), diag::err_argument_not_shifted_byte)
<< Arg->getSourceRange();		<< Arg->getSourceRange();
}		}

/// SemaBuiltinConstantArgShiftedByteOr0xFF - Check if argument ArgNum of		/// SemaBuiltinConstantArgShiftedByteOr0xFF - Check if argument ArgNum of
/// TheCall is a constant expression representing either a shifted byte value,		/// TheCall is a constant expression representing either a shifted byte value,
/// or a value of the form 0x??FF (i.e. a member of the arithmetic progression		/// or a value of the form 0x??FF (i.e. a member of the arithmetic progression
/// 0x00FF, 0x01FF, ..., 0xFFFF). This strange range check is needed for some		/// 0x00FF, 0x01FF, ..., 0xFFFF). This strange range check is needed for some
/// Arm MVE intrinsics.		/// Arm MVE intrinsics.
bool Sema::SemaBuiltinConstantArgShiftedByteOrXXFF(CallExpr *TheCall,		bool Sema::SemaBuiltinConstantArgShiftedByteOrXXFF(CallExpr *TheCall,
int ArgNum) {		int ArgNum,
		unsigned ArgBits) {
llvm::APSInt Result;		llvm::APSInt Result;

// We can't check the value of a dependent argument.		// We can't check the value of a dependent argument.
Expr *Arg = TheCall->getArg(ArgNum);		Expr *Arg = TheCall->getArg(ArgNum);
if (Arg->isTypeDependent() \|\| Arg->isValueDependent())		if (Arg->isTypeDependent() \|\| Arg->isValueDependent())
return false;		return false;

// Check constant-ness first.		// Check constant-ness first.
if (SemaBuiltinConstantArg(TheCall, ArgNum, Result))		if (SemaBuiltinConstantArg(TheCall, ArgNum, Result))
return true;		return true;

		// Truncate to the given size.
		Result = Result.getLoBits(ArgBits);
		Result.setIsUnsigned(true);

// Check to see if it's in either of the required forms.		// Check to see if it's in either of the required forms.
if (IsShiftedByte(Result) \|\|		if (IsShiftedByte(Result) \|\|
(Result > 0 && Result < 0x10000 && (Result & 0xFF) == 0xFF))		(Result > 0 && Result < 0x10000 && (Result & 0xFF) == 0xFF))
return false;		return false;

return Diag(TheCall->getBeginLoc(),		return Diag(TheCall->getBeginLoc(),
diag::err_argument_not_shifted_byte_or_xxff)		diag::err_argument_not_shifted_byte_or_xxff)
<< Arg->getSourceRange();		<< Arg->getSourceRange();
▲ Show 20 Lines • Show All 8,588 Lines • Show Last 20 Lines

clang/test/CodeGen/arm-mve-intrinsics/bitwise-imm.c

This file was added.

				// NOTE: Assertions have been autogenerated by utils/update_cc_test_checks.py
				// RUN: %clang_cc1 -triple thumbv8.1m.main-arm-none-eabi -target-feature +mve.fp -mfloat-abi hard -fallow-half-arguments-and-returns -O0 -disable-O0-optnone -S -emit-llvm -o - %s \| opt -S -mem2reg \| FileCheck %s
				// RUN: %clang_cc1 -triple thumbv8.1m.main-arm-none-eabi -target-feature +mve.fp -mfloat-abi hard -fallow-half-arguments-and-returns -O0 -disable-O0-optnone -DPOLYMORPHIC -S -emit-llvm -o - %s \| opt -S -mem2reg \| FileCheck %s

				#include <arm_mve.h>

				// CHECK-LABEL: @test_vbicq_n_s16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = and <8 x i16> [[A:%.]], <i16 11007, i16 11007, i16 11007, i16 11007, i16 11007, i16 11007, i16 11007, i16 11007>
				// CHECK-NEXT: ret <8 x i16> [[TMP0]]
				//
				int16x8_t test_vbicq_n_s16(int16x8_t a)
				{
				#ifdef POLYMORPHIC
				return vbicq(a, 0xd500);
				#else /* POLYMORPHIC */
				return vbicq_n_s16(a, 0xd500);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vbicq_n_s32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = and <4 x i32> [[A:%.]], <i32 -252, i32 -252, i32 -252, i32 -252>
				// CHECK-NEXT: ret <4 x i32> [[TMP0]]
				//
				int32x4_t test_vbicq_n_s32(int32x4_t a)
				{
				#ifdef POLYMORPHIC
				return vbicq(a, 0xfb);
				#else /* POLYMORPHIC */
				return vbicq_n_s32(a, 0xfb);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vbicq_n_u16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = and <8 x i16> [[A:%.]], <i16 -243, i16 -243, i16 -243, i16 -243, i16 -243, i16 -243, i16 -243, i16 -243>
				// CHECK-NEXT: ret <8 x i16> [[TMP0]]
				//
				uint16x8_t test_vbicq_n_u16(uint16x8_t a)
				{
				#ifdef POLYMORPHIC
				return vbicq(a, 0xf2);
				#else /* POLYMORPHIC */
				return vbicq_n_u16(a, 0xf2);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vbicq_n_u32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = and <4 x i32> [[A:%.]], <i32 -8193, i32 -8193, i32 -8193, i32 -8193>
				// CHECK-NEXT: ret <4 x i32> [[TMP0]]
				//
				uint32x4_t test_vbicq_n_u32(uint32x4_t a)
				{
				#ifdef POLYMORPHIC
				return vbicq(a, 0x2000);
				#else /* POLYMORPHIC */
				return vbicq_n_u32(a, 0x2000);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vorrq_n_s16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = or <8 x i16> [[A:%.]], <i16 195, i16 195, i16 195, i16 195, i16 195, i16 195, i16 195, i16 195>
				// CHECK-NEXT: ret <8 x i16> [[TMP0]]
				//
				int16x8_t test_vorrq_n_s16(int16x8_t a)
				{
				#ifdef POLYMORPHIC
				return vorrq(a, 0xc3);
				#else /* POLYMORPHIC */
				return vorrq_n_s16(a, 0xc3);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vorrq_n_s32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = or <4 x i32> [[A:%.]], <i32 65536, i32 65536, i32 65536, i32 65536>
				// CHECK-NEXT: ret <4 x i32> [[TMP0]]
				//
				int32x4_t test_vorrq_n_s32(int32x4_t a)
				{
				#ifdef POLYMORPHIC
				return vorrq(a, 0x10000);
				#else /* POLYMORPHIC */
				return vorrq_n_s32(a, 0x10000);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vorrq_n_u16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = or <8 x i16> [[A:%.]], <i16 -4096, i16 -4096, i16 -4096, i16 -4096, i16 -4096, i16 -4096, i16 -4096, i16 -4096>
				// CHECK-NEXT: ret <8 x i16> [[TMP0]]
				//
				uint16x8_t test_vorrq_n_u16(uint16x8_t a)
				{
				#ifdef POLYMORPHIC
				return vorrq(a, 0xf000);
				#else /* POLYMORPHIC */
				return vorrq_n_u16(a, 0xf000);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vorrq_n_u32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = or <4 x i32> [[A:%.]], <i32 8978432, i32 8978432, i32 8978432, i32 8978432>
				// CHECK-NEXT: ret <4 x i32> [[TMP0]]
				//
				uint32x4_t test_vorrq_n_u32(uint32x4_t a)
				{
				#ifdef POLYMORPHIC
				return vorrq(a, 0x890000);
				#else /* POLYMORPHIC */
				return vorrq_n_u32(a, 0x890000);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vmvnq_n_s16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: ret <8 x i16> <i16 27391, i16 27391, i16 27391, i16 27391, i16 27391, i16 27391, i16 27391, i16 27391>
				//
				int16x8_t test_vmvnq_n_s16()
				{
				return vmvnq_n_s16(0x9500);
				}

				// CHECK-LABEL: @test_vmvnq_n_s32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: ret <4 x i32> <i32 -5570561, i32 -5570561, i32 -5570561, i32 -5570561>
				//
				int32x4_t test_vmvnq_n_s32()
				{
				return vmvnq_n_s32(0x550000);
				}

				// CHECK-LABEL: @test_vmvnq_n_u16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: ret <8 x i16> <i16 -18689, i16 -18689, i16 -18689, i16 -18689, i16 -18689, i16 -18689, i16 -18689, i16 -18689>
				//
				uint16x8_t test_vmvnq_n_u16()
				{
				return vmvnq_n_u16(0x4900);
				}

				// CHECK-LABEL: @test_vmvnq_n_u32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: ret <4 x i32> <i32 1023410175, i32 1023410175, i32 1023410175, i32 1023410175>
				//
				uint32x4_t test_vmvnq_n_u32()
				{
				return vmvnq_n_u32(0xc3000000);
				}

				// CHECK-LABEL: @test_vbicq_m_n_s16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = and <8 x i16> [[A:%.]], <i16 -11265, i16 -11265, i16 -11265, i16 -11265, i16 -11265, i16 -11265, i16 -11265, i16 -11265>
				// CHECK-NEXT: [[TMP3:%.*]] = select <8 x i1> [[TMP1]], <8 x i16> [[TMP2]], <8 x i16> [[A]]
				// CHECK-NEXT: ret <8 x i16> [[TMP3]]
				//
				int16x8_t test_vbicq_m_n_s16(int16x8_t a, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vbicq_m_n(a, 0x2c00, p);
				#else /* POLYMORPHIC */
				return vbicq_m_n_s16(a, 0x2c00, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vbicq_m_n_s32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = and <4 x i32> [[A:%.]], <i32 -13893633, i32 -13893633, i32 -13893633, i32 -13893633>
				// CHECK-NEXT: [[TMP3:%.*]] = select <4 x i1> [[TMP1]], <4 x i32> [[TMP2]], <4 x i32> [[A]]
				// CHECK-NEXT: ret <4 x i32> [[TMP3]]
				//
				int32x4_t test_vbicq_m_n_s32(int32x4_t a, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vbicq_m_n(a, 0xd40000, p);
				#else /* POLYMORPHIC */
				return vbicq_m_n_s32(a, 0xd40000, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vbicq_m_n_u16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = and <8 x i16> [[A:%.]], <i16 -37, i16 -37, i16 -37, i16 -37, i16 -37, i16 -37, i16 -37, i16 -37>
				// CHECK-NEXT: [[TMP3:%.*]] = select <8 x i1> [[TMP1]], <8 x i16> [[TMP2]], <8 x i16> [[A]]
				// CHECK-NEXT: ret <8 x i16> [[TMP3]]
				//
				uint16x8_t test_vbicq_m_n_u16(uint16x8_t a, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vbicq_m_n(a, 0x24, p);
				#else /* POLYMORPHIC */
				return vbicq_m_n_u16(a, 0x24, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vbicq_m_n_u32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = and <4 x i32> [[A:%.]], <i32 -1644167169, i32 -1644167169, i32 -1644167169, i32 -1644167169>
				// CHECK-NEXT: [[TMP3:%.*]] = select <4 x i1> [[TMP1]], <4 x i32> [[TMP2]], <4 x i32> [[A]]
				// CHECK-NEXT: ret <4 x i32> [[TMP3]]
				//
				uint32x4_t test_vbicq_m_n_u32(uint32x4_t a, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vbicq_m_n(a, 0x62000000, p);
				#else /* POLYMORPHIC */
				return vbicq_m_n_u32(a, 0x62000000, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vorrq_m_n_s16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = or <8 x i16> [[A:%.]], <i16 13568, i16 13568, i16 13568, i16 13568, i16 13568, i16 13568, i16 13568, i16 13568>
				// CHECK-NEXT: [[TMP3:%.*]] = select <8 x i1> [[TMP1]], <8 x i16> [[TMP2]], <8 x i16> [[A]]
				// CHECK-NEXT: ret <8 x i16> [[TMP3]]
				//
				int16x8_t test_vorrq_m_n_s16(int16x8_t a, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vorrq_m_n(a, 0x3500, p);
				#else /* POLYMORPHIC */
				return vorrq_m_n_s16(a, 0x3500, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vorrq_m_n_s32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = or <4 x i32> [[A:%.]], <i32 654311424, i32 654311424, i32 654311424, i32 654311424>
				// CHECK-NEXT: [[TMP3:%.*]] = select <4 x i1> [[TMP1]], <4 x i32> [[TMP2]], <4 x i32> [[A]]
				// CHECK-NEXT: ret <4 x i32> [[TMP3]]
				//
				int32x4_t test_vorrq_m_n_s32(int32x4_t a, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vorrq_m_n(a, 0x27000000, p);
				#else /* POLYMORPHIC */
				return vorrq_m_n_s32(a, 0x27000000, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vorrq_m_n_u16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = or <8 x i16> [[A:%.]], <i16 175, i16 175, i16 175, i16 175, i16 175, i16 175, i16 175, i16 175>
				// CHECK-NEXT: [[TMP3:%.*]] = select <8 x i1> [[TMP1]], <8 x i16> [[TMP2]], <8 x i16> [[A]]
				// CHECK-NEXT: ret <8 x i16> [[TMP3]]
				//
				uint16x8_t test_vorrq_m_n_u16(uint16x8_t a, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vorrq_m_n(a, 0xaf, p);
				#else /* POLYMORPHIC */
				return vorrq_m_n_u16(a, 0xaf, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vorrq_m_n_u32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = or <4 x i32> [[A:%.]], <i32 89, i32 89, i32 89, i32 89>
				// CHECK-NEXT: [[TMP3:%.*]] = select <4 x i1> [[TMP1]], <4 x i32> [[TMP2]], <4 x i32> [[A]]
				// CHECK-NEXT: ret <4 x i32> [[TMP3]]
				//
				uint32x4_t test_vorrq_m_n_u32(uint32x4_t a, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vorrq_m_n(a, 0x59, p);
				#else /* POLYMORPHIC */
				return vorrq_m_n_u32(a, 0x59, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vmvnq_m_n_s16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = select <8 x i1> [[TMP1]], <8 x i16> <i16 -3841, i16 -3841, i16 -3841, i16 -3841, i16 -3841, i16 -3841, i16 -3841, i16 -3841>, <8 x i16> [[INACTIVE:%.]]
				// CHECK-NEXT: ret <8 x i16> [[TMP2]]
				//
				int16x8_t test_vmvnq_m_n_s16(int16x8_t inactive, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vmvnq_m(inactive, 0xf00, p);
				#else /* POLYMORPHIC */
				return vmvnq_m_n_s16(inactive, 0xf00, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vmvnq_m_n_s32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = select <4 x i1> [[TMP1]], <4 x i32> <i32 -18945, i32 -18945, i32 -18945, i32 -18945>, <4 x i32> [[INACTIVE:%.]]
				// CHECK-NEXT: ret <4 x i32> [[TMP2]]
				//
				int32x4_t test_vmvnq_m_n_s32(int32x4_t inactive, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vmvnq_m(inactive, 0x4a00, p);
				#else /* POLYMORPHIC */
				return vmvnq_m_n_s32(inactive, 0x4a00, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vmvnq_m_n_u16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = select <8 x i1> [[TMP1]], <8 x i16> <i16 23295, i16 23295, i16 23295, i16 23295, i16 23295, i16 23295, i16 23295, i16 23295>, <8 x i16> [[INACTIVE:%.]]
				// CHECK-NEXT: ret <8 x i16> [[TMP2]]
				//
				uint16x8_t test_vmvnq_m_n_u16(uint16x8_t inactive, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vmvnq_m(inactive, 0xa500, p);
				#else /* POLYMORPHIC */
				return vmvnq_m_n_u16(inactive, 0xa500, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vmvnq_m_n_u32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.]] = select <4 x i1> [[TMP1]], <4 x i32> <i32 -63489, i32 -63489, i32 -63489, i32 -63489>, <4 x i32> [[INACTIVE:%.]]
				// CHECK-NEXT: ret <4 x i32> [[TMP2]]
				//
				uint32x4_t test_vmvnq_m_n_u32(uint32x4_t inactive, mve_pred16_t p)
				{
				#ifdef POLYMORPHIC
				return vmvnq_m(inactive, 0xf800, p);
				#else /* POLYMORPHIC */
				return vmvnq_m_n_u32(inactive, 0xf800, p);
				#endif /* POLYMORPHIC */
				}

				// CHECK-LABEL: @test_vmvnq_x_n_s16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.*]] = select <8 x i1> [[TMP1]], <8 x i16> <i16 767, i16 767, i16 767, i16 767, i16 767, i16 767, i16 767, i16 767>, <8 x i16> undef
				// CHECK-NEXT: ret <8 x i16> [[TMP2]]
				//
				int16x8_t test_vmvnq_x_n_s16(mve_pred16_t p)
				{
				return vmvnq_x_n_s16(0xfd00, p);
				}

				// CHECK-LABEL: @test_vmvnq_x_n_s32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.*]] = select <4 x i1> [[TMP1]], <4 x i32> <i32 -12189697, i32 -12189697, i32 -12189697, i32 -12189697>, <4 x i32> undef
				// CHECK-NEXT: ret <4 x i32> [[TMP2]]
				//
				int32x4_t test_vmvnq_x_n_s32(mve_pred16_t p)
				{
				return vmvnq_x_n_s32(0xba0000, p);
				}

				// CHECK-LABEL: @test_vmvnq_x_n_u16(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.*]] = select <8 x i1> [[TMP1]], <8 x i16> <i16 -21505, i16 -21505, i16 -21505, i16 -21505, i16 -21505, i16 -21505, i16 -21505, i16 -21505>, <8 x i16> undef
				// CHECK-NEXT: ret <8 x i16> [[TMP2]]
				//
				uint16x8_t test_vmvnq_x_n_u16(mve_pred16_t p)
				{
				return vmvnq_x_n_u16(0x5400, p);
				}

				// CHECK-LABEL: @test_vmvnq_x_n_u32(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TMP0:%.]] = zext i16 [[P:%.]] to i32
				// CHECK-NEXT: [[TMP1:%.*]] = call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 [[TMP0]])
				// CHECK-NEXT: [[TMP2:%.*]] = select <4 x i1> [[TMP1]], <4 x i32> <i32 -4865, i32 -4865, i32 -4865, i32 -4865>, <4 x i32> undef
				// CHECK-NEXT: ret <4 x i32> [[TMP2]]
				//
				uint32x4_t test_vmvnq_x_n_u32(mve_pred16_t p)
				{
				return vmvnq_x_n_u32(0x1300, p);
				}

clang/test/Sema/arm-mve-immediates.c

Show First 20 Lines • Show All 197 Lines • ▼ Show 20 Lines	void test_immediate_shifts(uint8x16_t vb, uint16x8_t vh, uint32x4_t vw)

vsriq(vb, vb, 0); // expected-error {{argument value 0 is outside the valid range [1, 8]}}		vsriq(vb, vb, 0); // expected-error {{argument value 0 is outside the valid range [1, 8]}}
vsriq(vb, vb, 9); // expected-error {{argument value 9 is outside the valid range [1, 8]}}		vsriq(vb, vb, 9); // expected-error {{argument value 9 is outside the valid range [1, 8]}}
vsriq(vh, vh, 0); // expected-error {{argument value 0 is outside the valid range [1, 16]}}		vsriq(vh, vh, 0); // expected-error {{argument value 0 is outside the valid range [1, 16]}}
vsriq(vh, vh, 17); // expected-error {{argument value 17 is outside the valid range [1, 16]}}		vsriq(vh, vh, 17); // expected-error {{argument value 17 is outside the valid range [1, 16]}}
vsriq(vw, vw, 0); // expected-error {{argument value 0 is outside the valid range [1, 32]}}		vsriq(vw, vw, 0); // expected-error {{argument value 0 is outside the valid range [1, 32]}}
vsriq(vw, vw, 33); // expected-error {{argument value 33 is outside the valid range [1, 32]}}		vsriq(vw, vw, 33); // expected-error {{argument value 33 is outside the valid range [1, 32]}}
}		}

		void test_simd_bic_orr(int16x8_t h, int32x4_t w)
		{
		h = vbicq(h, 0x0000);
		h = vbicq(h, 0x0001);
		h = vbicq(h, 0x00FF);
		h = vbicq(h, 0x0100);
		h = vbicq(h, 0x0101); // expected-error-re {{argument should be an 8-bit value shifted by a multiple of 8 bits{{$}}}}
		h = vbicq(h, 0x01FF); // expected-error-re {{argument should be an 8-bit value shifted by a multiple of 8 bits{{$}}}}
		h = vbicq(h, 0xFF00);

		w = vbicq(w, 0x00000000);
		w = vbicq(w, 0x00000001);
		w = vbicq(w, 0x000000FF);
		w = vbicq(w, 0x00000100);
		w = vbicq(w, 0x0000FF00);
		w = vbicq(w, 0x00010000);
		w = vbicq(w, 0x00FF0000);
		w = vbicq(w, 0x01000000);
		w = vbicq(w, 0xFF000000);
		w = vbicq(w, 0x01000001); // expected-error-re {{argument should be an 8-bit value shifted by a multiple of 8 bits{{$}}}}
		w = vbicq(w, 0x01FFFFFF); // expected-error-re {{argument should be an 8-bit value shifted by a multiple of 8 bits{{$}}}}

		h = vorrq(h, 0x0000);
		h = vorrq(h, 0x0001);
		h = vorrq(h, 0x00FF);
		h = vorrq(h, 0x0100);
		h = vorrq(h, 0x0101); // expected-error-re {{argument should be an 8-bit value shifted by a multiple of 8 bits{{$}}}}
		h = vorrq(h, 0x01FF); // expected-error-re {{argument should be an 8-bit value shifted by a multiple of 8 bits{{$}}}}
		h = vorrq(h, 0xFF00);

		w = vorrq(w, 0x00000000);
		w = vorrq(w, 0x00000001);
		w = vorrq(w, 0x000000FF);
		w = vorrq(w, 0x00000100);
		w = vorrq(w, 0x0000FF00);
		w = vorrq(w, 0x00010000);
		w = vorrq(w, 0x00FF0000);
		w = vorrq(w, 0x01000000);
		w = vorrq(w, 0xFF000000);
		w = vorrq(w, 0x01000001); // expected-error-re {{argument should be an 8-bit value shifted by a multiple of 8 bits{{$}}}}
		w = vorrq(w, 0x01FFFFFF); // expected-error-re {{argument should be an 8-bit value shifted by a multiple of 8 bits{{$}}}}
		}

		void test_simd_vmvn(void)
		{
		uint16x8_t h;
		h = vmvnq_n_u16(0x0000);
		h = vmvnq_n_u16(0x0001);
		h = vmvnq_n_u16(0x00FF);
		h = vmvnq_n_u16(0x0100);
		h = vmvnq_n_u16(0x0101); // expected-error {{argument should be an 8-bit value shifted by a multiple of 8 bits, or in the form 0x??FF}}
		h = vmvnq_n_u16(0x01FF);
		h = vmvnq_n_u16(0xFF00);

		uint32x4_t w;
		w = vmvnq_n_u32(0x00000000);
		w = vmvnq_n_u32(0x00000001);
		w = vmvnq_n_u32(0x000000FF);
		w = vmvnq_n_u32(0x00000100);
		w = vmvnq_n_u32(0x0000FF00);
		w = vmvnq_n_u32(0x00010000);
		w = vmvnq_n_u32(0x00FF0000);
		w = vmvnq_n_u32(0x01000000);
		w = vmvnq_n_u32(0xFF000000);
		w = vmvnq_n_u32(0x01000001); // expected-error {{argument should be an 8-bit value shifted by a multiple of 8 bits, or in the form 0x??FF}}
		w = vmvnq_n_u32(0x01FFFFFF); // expected-error {{argument should be an 8-bit value shifted by a multiple of 8 bits, or in the form 0x??FF}}
		w = vmvnq_n_u32(0x0001FFFF); // expected-error {{argument should be an 8-bit value shifted by a multiple of 8 bits, or in the form 0x??FF}}
		w = vmvnq_n_u32(0x000001FF);
		}

clang/utils/TableGen/MveEmitter.cpp

Show First 20 Lines • Show All 877 Lines • ▼ Show 20 Lines	for (const auto &kv : ImmediateArgs) {
llvm::APInt lo(128, 0), hi(128, 0);		llvm::APInt lo(128, 0), hi(128, 0);
switch (IA.boundsType) {		switch (IA.boundsType) {
case ImmediateArg::BoundsType::ExplicitRange:		case ImmediateArg::BoundsType::ExplicitRange:
lo = IA.i1;		lo = IA.i1;
hi = IA.i2;		hi = IA.i2;
break;		break;
case ImmediateArg::BoundsType::UInt:		case ImmediateArg::BoundsType::UInt:
lo = 0;		lo = 0;
hi = IA.i1;		hi = llvm::APInt::getMaxValue(IA.i1).zext(128);
break;		break;
}		}

llvm::APInt typelo, typehi;
unsigned Bits = IA.ArgType->sizeInBits();
if (cast<ScalarType>(IA.ArgType)->kind() == ScalarTypeKind::SignedInt) {
typelo = llvm::APInt::getSignedMinValue(Bits).sext(128);
typehi = llvm::APInt::getSignedMaxValue(Bits).sext(128);
} else {
typelo = llvm::APInt::getMinValue(Bits).zext(128);
typehi = llvm::APInt::getMaxValue(Bits).zext(128);
}

std::string Index = utostr(kv.first);		std::string Index = utostr(kv.first);

if (lo.sle(typelo) && hi.sge(typehi))		// Emit a range check if the legal range of values for the
SemaChecks.push_back("SemaBuiltinConstantArg(TheCall, " + Index + ")");		// immediate is smaller than the _possible_ range of values for
else		// its type.
		unsigned ArgTypeBits = IA.ArgType->sizeInBits();
		llvm::APInt ArgTypeRange = llvm::APInt::getMaxValue(ArgTypeBits).zext(128);
		llvm::APInt ActualRange = (hi-lo).trunc(64).sext(128);
		if (ActualRange.ult(ArgTypeRange))
SemaChecks.push_back("SemaBuiltinConstantArgRange(TheCall, " + Index +		SemaChecks.push_back("SemaBuiltinConstantArgRange(TheCall, " + Index +
", " + signedHexLiteral(lo) + ", " +		", " + signedHexLiteral(lo) + ", " +
signedHexLiteral(hi) + ")");		signedHexLiteral(hi) + ")");

if (!IA.ExtraCheckType.empty()) {		if (!IA.ExtraCheckType.empty()) {
std::string Suffix;		std::string Suffix;
if (!IA.ExtraCheckArgs.empty())		if (!IA.ExtraCheckArgs.empty()) {
Suffix = (Twine(", ") + IA.ExtraCheckArgs).str();		std::string tmp;
		StringRef Arg = IA.ExtraCheckArgs;
		if (Arg == "!lanesize") {
		tmp = utostr(IA.ArgType->sizeInBits());
		Arg = tmp;
		}
		Suffix = (Twine(", ") + Arg).str();
		}
SemaChecks.push_back((Twine("SemaBuiltinConstantArg") +		SemaChecks.push_back((Twine("SemaBuiltinConstantArg") +
IA.ExtraCheckType + "(TheCall, " + Index +		IA.ExtraCheckType + "(TheCall, " + Index +
Suffix + ")")		Suffix + ")")
.str());		.str());
}		}

		assert(!SemaChecks.empty());
}		}
if (SemaChecks.empty())		if (SemaChecks.empty())
return "";		return "";
return (Twine(" return ") +		return (Twine(" return ") +
join(std::begin(SemaChecks), std::end(SemaChecks),		join(std::begin(SemaChecks), std::end(SemaChecks),
" \|\|\n ") +		" \|\|\n ") +
";\n")		";\n")
.str();		.str();
▲ Show 20 Lines • Show All 991 Lines • Show Last 20 Lines

llvm/lib/Target/ARM/ARMISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 12,170 Lines • ▼ Show 20 Lines	static SDValue PerformANDCombine(SDNode *N,
SelectionDAG &DAG = DCI.DAG;		SelectionDAG &DAG = DCI.DAG;

if(!DAG.getTargetLoweringInfo().isTypeLegal(VT))		if(!DAG.getTargetLoweringInfo().isTypeLegal(VT))
return SDValue();		return SDValue();

APInt SplatBits, SplatUndef;		APInt SplatBits, SplatUndef;
unsigned SplatBitSize;		unsigned SplatBitSize;
bool HasAnyUndefs;		bool HasAnyUndefs;
if (BVN && Subtarget->hasNEON() &&		if (BVN && (Subtarget->hasNEON() \|\| Subtarget->hasMVEIntegerOps()) &&
BVN->isConstantSplat(SplatBits, SplatUndef, SplatBitSize, HasAnyUndefs)) {		BVN->isConstantSplat(SplatBits, SplatUndef, SplatBitSize, HasAnyUndefs)) {
if (SplatBitSize <= 64) {		if (SplatBitSize <= 64) {
		dmgreenUnsubmitted Done Reply Inline Actions This is OK because we are passing OtherModImm to isVMOVModifiedImm, and MVE supports the same patterns as NEON? dmgreen: This is OK because we are passing OtherModImm to isVMOVModifiedImm, and MVE supports the same…
		simon_tathamAuthorUnsubmitted Done Reply Inline Actions Yes: `OtherModImm` only matches values of the form '8-bit number shifted left by a multiple of 8 bits', which is just what MVE VBIC and VORR take as well. simon_tatham: Yes: `OtherModImm` only matches values of the form '8-bit number shifted left by a multiple of…
EVT VbicVT;		EVT VbicVT;
SDValue Val = isVMOVModifiedImm((~SplatBits).getZExtValue(),		SDValue Val = isVMOVModifiedImm((~SplatBits).getZExtValue(),
SplatUndef.getZExtValue(), SplatBitSize,		SplatUndef.getZExtValue(), SplatBitSize,
DAG, dl, VbicVT, VT.is128BitVector(),		DAG, dl, VbicVT, VT.is128BitVector(),
OtherModImm);		OtherModImm);
if (Val.getNode()) {		if (Val.getNode()) {
SDValue Input =		SDValue Input =
DAG.getNode(ISD::BITCAST, dl, VbicVT, N->getOperand(0));		DAG.getNode(ISD::BITCAST, dl, VbicVT, N->getOperand(0));
▲ Show 20 Lines • Show All 288 Lines • ▼ Show 20 Lines	static SDValue PerformORCombine(SDNode *N,
SelectionDAG &DAG = DCI.DAG;		SelectionDAG &DAG = DCI.DAG;

if(!DAG.getTargetLoweringInfo().isTypeLegal(VT))		if(!DAG.getTargetLoweringInfo().isTypeLegal(VT))
return SDValue();		return SDValue();

APInt SplatBits, SplatUndef;		APInt SplatBits, SplatUndef;
unsigned SplatBitSize;		unsigned SplatBitSize;
bool HasAnyUndefs;		bool HasAnyUndefs;
if (BVN && Subtarget->hasNEON() &&		if (BVN && (Subtarget->hasNEON() \|\| Subtarget->hasMVEIntegerOps()) &&
BVN->isConstantSplat(SplatBits, SplatUndef, SplatBitSize, HasAnyUndefs)) {		BVN->isConstantSplat(SplatBits, SplatUndef, SplatBitSize, HasAnyUndefs)) {
if (SplatBitSize <= 64) {		if (SplatBitSize <= 64) {
EVT VorrVT;		EVT VorrVT;
SDValue Val = isVMOVModifiedImm(SplatBits.getZExtValue(),		SDValue Val = isVMOVModifiedImm(SplatBits.getZExtValue(),
SplatUndef.getZExtValue(), SplatBitSize,		SplatUndef.getZExtValue(), SplatBitSize,
DAG, dl, VorrVT, VT.is128BitVector(),		DAG, dl, VorrVT, VT.is128BitVector(),
OtherModImm);		OtherModImm);
if (Val.getNode()) {		if (Val.getNode()) {
▲ Show 20 Lines • Show All 5,127 Lines • Show Last 20 Lines

llvm/lib/Target/ARM/ARMInstrInfo.td

	Show First 20 Lines • Show All 268 Lines • ▼ Show 20 Lines
	def ARMvgetlaneu : SDNode<"ARMISD::VGETLANEu", SDTARMVGETLN>;			def ARMvgetlaneu : SDNode<"ARMISD::VGETLANEu", SDTARMVGETLN>;
	def ARMvgetlanes : SDNode<"ARMISD::VGETLANEs", SDTARMVGETLN>;			def ARMvgetlanes : SDNode<"ARMISD::VGETLANEs", SDTARMVGETLN>;

	def SDTARMVMOVIMM : SDTypeProfile<1, 1, [SDTCisVec<0>, SDTCisVT<1, i32>]>;			def SDTARMVMOVIMM : SDTypeProfile<1, 1, [SDTCisVec<0>, SDTCisVT<1, i32>]>;
	def ARMvmovImm : SDNode<"ARMISD::VMOVIMM", SDTARMVMOVIMM>;			def ARMvmovImm : SDNode<"ARMISD::VMOVIMM", SDTARMVMOVIMM>;
	def ARMvmvnImm : SDNode<"ARMISD::VMVNIMM", SDTARMVMOVIMM>;			def ARMvmvnImm : SDNode<"ARMISD::VMVNIMM", SDTARMVMOVIMM>;
	def ARMvmovFPImm : SDNode<"ARMISD::VMOVFPIMM", SDTARMVMOVIMM>;			def ARMvmovFPImm : SDNode<"ARMISD::VMOVFPIMM", SDTARMVMOVIMM>;

				def SDTARMVORRIMM : SDTypeProfile<1, 2, [SDTCisVec<0>, SDTCisSameAs<0, 1>,
				SDTCisVT<2, i32>]>;
				def ARMvorrImm : SDNode<"ARMISD::VORRIMM", SDTARMVORRIMM>;
				def ARMvbicImm : SDNode<"ARMISD::VBICIMM", SDTARMVORRIMM>;

	def SDTARMVSHIMM : SDTypeProfile<1, 2, [SDTCisInt<0>, SDTCisSameAs<0, 1>,			def SDTARMVSHIMM : SDTypeProfile<1, 2, [SDTCisInt<0>, SDTCisSameAs<0, 1>,
	SDTCisVT<2, i32>]>;			SDTCisVT<2, i32>]>;
	def SDTARMVSH : SDTypeProfile<1, 2, [SDTCisInt<0>, SDTCisSameAs<0, 1>,			def SDTARMVSH : SDTypeProfile<1, 2, [SDTCisInt<0>, SDTCisSameAs<0, 1>,
	SDTCisSameAs<0, 2>,]>;			SDTCisSameAs<0, 2>,]>;
	def ARMvshlImm : SDNode<"ARMISD::VSHLIMM", SDTARMVSHIMM>;			def ARMvshlImm : SDNode<"ARMISD::VSHLIMM", SDTARMVSHIMM>;
	def ARMvshrsImm : SDNode<"ARMISD::VSHRsIMM", SDTARMVSHIMM>;			def ARMvshrsImm : SDNode<"ARMISD::VSHRsIMM", SDTARMVSHIMM>;
	def ARMvshruImm : SDNode<"ARMISD::VSHRuIMM", SDTARMVSHIMM>;			def ARMvshruImm : SDNode<"ARMISD::VSHRuIMM", SDTARMVSHIMM>;
	▲ Show 20 Lines • Show All 5,993 Lines • Show Last 20 Lines

llvm/lib/Target/ARM/ARMInstrMVE.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,361 Lines • ▼ Show 20 Lines	class MVE_bit_cmode<string iname, string suffix, bit halfword, dag inOps>
let Inst{10} = !if(halfword, 0, imm{10});		let Inst{10} = !if(halfword, 0, imm{10});
let Inst{9} = imm{9};		let Inst{9} = imm{9};
let Inst{8} = 0b1;		let Inst{8} = 0b1;
let Inst{7-6} = 0b01;		let Inst{7-6} = 0b01;
let Inst{4} = 0b1;		let Inst{4} = 0b1;
let Inst{3-0} = imm{3-0};		let Inst{3-0} = imm{3-0};
}		}

class MVE_VORR<string suffix, bit hw, Operand imm_type>		multiclass MVE_bit_cmode_p<string iname, bit opcode,
: MVE_bit_cmode<"vorr", suffix, hw, (ins MQPR:$Qd_src, imm_type:$imm)> {		MVEVectorVTInfo VTI, Operand imm_type, SDNode op> {
let Inst{5} = 0b0;		def "" : MVE_bit_cmode<iname, VTI.Suffix, VTI.Size{0},
		(ins MQPR:$Qd_src, imm_type:$imm)> {
		let Inst{5} = opcode;
let validForTailPredication = 1;		let validForTailPredication = 1;
}		}

def MVE_VORRimmi16 : MVE_VORR<"i16", 1, nImmSplatI16>;		defvar Inst = !cast<Instruction>(NAME);
def MVE_VORRimmi32 : MVE_VORR<"i32", 0, nImmSplatI32>;		defvar UnpredPat = (VTI.Vec (op (VTI.Vec MQPR:$src), timm:$simm));

		let Predicates = [HasMVEInt] in {
		def : Pat<UnpredPat, (VTI.Vec (Inst (VTI.Vec MQPR:$src), imm_type:$simm))>;
		def : Pat<(VTI.Vec (vselect (VTI.Pred VCCR:$pred),
		UnpredPat, (VTI.Vec MQPR:$src))),
		(VTI.Vec (Inst (VTI.Vec MQPR:$src), imm_type:$simm,
		ARMVCCThen, (VTI.Pred VCCR:$pred)))>;
		}
		}

		multiclass MVE_VORRimm<MVEVectorVTInfo VTI, Operand imm_type> {
		defm "": MVE_bit_cmode_p<"vorr", 0, VTI, imm_type, ARMvorrImm>;
		}
		multiclass MVE_VBICimm<MVEVectorVTInfo VTI, Operand imm_type> {
		defm "": MVE_bit_cmode_p<"vbic", 1, VTI, imm_type, ARMvbicImm>;
		}

		defm MVE_VORRimmi16 : MVE_VORRimm<MVE_v8i16, nImmSplatI16>;
		defm MVE_VORRimmi32 : MVE_VORRimm<MVE_v4i32, nImmSplatI32>;
		defm MVE_VBICimmi16 : MVE_VBICimm<MVE_v8i16, nImmSplatI16>;
		defm MVE_VBICimmi32 : MVE_VBICimm<MVE_v4i32, nImmSplatI32>;

def MVE_VORNimmi16 : MVEInstAlias<"vorn${vp}.i16\t$Qd, $imm",		def MVE_VORNimmi16 : MVEInstAlias<"vorn${vp}.i16\t$Qd, $imm",
(MVE_VORRimmi16 MQPR:$Qd, nImmSplatNotI16:$imm, vpred_n:$vp), 0>;		(MVE_VORRimmi16 MQPR:$Qd, nImmSplatNotI16:$imm, vpred_n:$vp), 0>;
def MVE_VORNimmi32 : MVEInstAlias<"vorn${vp}.i32\t$Qd, $imm",		def MVE_VORNimmi32 : MVEInstAlias<"vorn${vp}.i32\t$Qd, $imm",
(MVE_VORRimmi32 MQPR:$Qd, nImmSplatNotI32:$imm, vpred_n:$vp), 0>;		(MVE_VORRimmi32 MQPR:$Qd, nImmSplatNotI32:$imm, vpred_n:$vp), 0>;

def MVE_VMOV : MVEInstAlias<"vmov${vp}\t$Qd, $Qm",
(MVE_VORR MQPR:$Qd, MQPR:$Qm, MQPR:$Qm, vpred_r:$vp)>;

class MVE_VBIC<string suffix, bit hw, Operand imm_type>
: MVE_bit_cmode<"vbic", suffix, hw, (ins MQPR:$Qd_src, imm_type:$imm)> {
let Inst{5} = 0b1;
let validForTailPredication = 1;
}

def MVE_VBICimmi16 : MVE_VBIC<"i16", 1, nImmSplatI16>;
def MVE_VBICimmi32 : MVE_VBIC<"i32", 0, nImmSplatI32>;

def MVE_VANDimmi16 : MVEInstAlias<"vand${vp}.i16\t$Qd, $imm",		def MVE_VANDimmi16 : MVEInstAlias<"vand${vp}.i16\t$Qd, $imm",
(MVE_VBICimmi16 MQPR:$Qd, nImmSplatNotI16:$imm, vpred_n:$vp), 0>;		(MVE_VBICimmi16 MQPR:$Qd, nImmSplatNotI16:$imm, vpred_n:$vp), 0>;
def MVE_VANDimmi32 : MVEInstAlias<"vand${vp}.i32\t$Qd, $imm",		def MVE_VANDimmi32 : MVEInstAlias<"vand${vp}.i32\t$Qd, $imm",
(MVE_VBICimmi32 MQPR:$Qd, nImmSplatNotI32:$imm, vpred_n:$vp), 0>;		(MVE_VBICimmi32 MQPR:$Qd, nImmSplatNotI32:$imm, vpred_n:$vp), 0>;

		def MVE_VMOV : MVEInstAlias<"vmov${vp}\t$Qd, $Qm",
		(MVE_VORR MQPR:$Qd, MQPR:$Qm, MQPR:$Qm, vpred_r:$vp)>;

class MVE_VMOV_lane_direction {		class MVE_VMOV_lane_direction {
bit bit_20;		bit bit_20;
dag oops;		dag oops;
dag iops;		dag iops;
string ops;		string ops;
string cstr;		string cstr;
}		}
def MVE_VMOV_from_lane : MVE_VMOV_lane_direction {		def MVE_VMOV_from_lane : MVE_VMOV_lane_direction {
▲ Show 20 Lines • Show All 792 Lines • ▼ Show 20 Lines	let Predicates = [HasMVEInt] in {

def : Pat<(v8i16 (ARMvmvnImm timm:$simm)),		def : Pat<(v8i16 (ARMvmvnImm timm:$simm)),
(v8i16 (MVE_VMVNimmi16 nImmSplatI16:$simm))>;		(v8i16 (MVE_VMVNimmi16 nImmSplatI16:$simm))>;
def : Pat<(v4i32 (ARMvmvnImm timm:$simm)),		def : Pat<(v4i32 (ARMvmvnImm timm:$simm)),
(v4i32 (MVE_VMVNimmi32 nImmVMOVI32:$simm))>;		(v4i32 (MVE_VMVNimmi32 nImmVMOVI32:$simm))>;

def : Pat<(v4f32 (ARMvmovFPImm timm:$simm)),		def : Pat<(v4f32 (ARMvmovFPImm timm:$simm)),
(v4f32 (MVE_VMOVimmf32 nImmVMOVF32:$simm))>;		(v4f32 (MVE_VMOVimmf32 nImmVMOVF32:$simm))>;

		def : Pat<(v8i16 (vselect (v8i1 VCCR:$pred), (ARMvmvnImm timm:$simm),
		MQPR:$inactive)),
		(v8i16 (MVE_VMVNimmi16 nImmSplatI16:$simm,
		ARMVCCThen, VCCR:$pred, MQPR:$inactive))>;
		def : Pat<(v4i32 (vselect (v4i1 VCCR:$pred), (ARMvmvnImm timm:$simm),
		MQPR:$inactive)),
		(v4i32 (MVE_VMVNimmi32 nImmSplatI32:$simm,
		ARMVCCThen, VCCR:$pred, MQPR:$inactive))>;
}		}

class MVE_VMINMAXA<string iname, string suffix, bits<2> size,		class MVE_VMINMAXA<string iname, string suffix, bits<2> size,
bit bit_12, list<dag> pattern=[]>		bit bit_12, list<dag> pattern=[]>
: MVE_p<(outs MQPR:$Qd), (ins MQPR:$Qd_src, MQPR:$Qm),		: MVE_p<(outs MQPR:$Qd), (ins MQPR:$Qd_src, MQPR:$Qm),
NoItinerary, iname, suffix, "$Qd, $Qm", vpred_n, "$Qd = $Qd_src",		NoItinerary, iname, suffix, "$Qd, $Qm", vpred_n, "$Qd = $Qd_src",
pattern> {		pattern> {
bits<4> Qd;		bits<4> Qd;
▲ Show 20 Lines • Show All 119 Lines • ▼ Show 20 Lines	def : Pat<(sext_inreg (v8i16 MQPR:$src), v8i8),
(MVE_VMOVLs8bh MQPR:$src)>;		(MVE_VMOVLs8bh MQPR:$src)>;
def : Pat<(sext_inreg (v4i32 MQPR:$src), v4i8),		def : Pat<(sext_inreg (v4i32 MQPR:$src), v4i8),
(MVE_VMOVLs16bh (MVE_VMOVLs8bh MQPR:$src))>;		(MVE_VMOVLs16bh (MVE_VMOVLs8bh MQPR:$src))>;

// zext_inreg 16 -> 32		// zext_inreg 16 -> 32
def : Pat<(and (v4i32 MQPR:$src), (v4i32 (ARMvmovImm (i32 0xCFF)))),		def : Pat<(and (v4i32 MQPR:$src), (v4i32 (ARMvmovImm (i32 0xCFF)))),
(MVE_VMOVLu16bh MQPR:$src)>;		(MVE_VMOVLu16bh MQPR:$src)>;
// zext_inreg 8 -> 16		// zext_inreg 8 -> 16
def : Pat<(and (v8i16 MQPR:$src), (v8i16 (ARMvmovImm (i32 0x8FF)))),		def : Pat<(ARMvbicImm (v8i16 MQPR:$src), (i32 0xAFF)),
(MVE_VMOVLu8bh MQPR:$src)>;		(MVE_VMOVLu8bh MQPR:$src)>;
}		}


class MVE_VSHLL_imm<string iname, string suffix, bit U, bit th,		class MVE_VSHLL_imm<string iname, string suffix, bit U, bit th,
Operand immtype, list<dag> pattern=[]>		Operand immtype, list<dag> pattern=[]>
: MVE_shift_imm<(outs MQPR:$Qd), (ins MQPR:$Qm, immtype:$imm),		: MVE_shift_imm<(outs MQPR:$Qd), (ins MQPR:$Qm, immtype:$imm),
iname, suffix, "$Qd, $Qm, $imm", vpred_r, "", pattern> {		iname, suffix, "$Qd, $Qm, $imm", vpred_r, "", pattern> {
▲ Show 20 Lines • Show All 4,053 Lines • Show Last 20 Lines

llvm/lib/Target/ARM/ARMInstrNEON.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 503 Lines • ▼ Show 20 Lines

	def NEONvqrshrnsImm : SDNode<"ARMISD::VQRSHRNsIMM", SDTARMVSHXIMM>;			def NEONvqrshrnsImm : SDNode<"ARMISD::VQRSHRNsIMM", SDTARMVSHXIMM>;
	def NEONvqrshrnuImm : SDNode<"ARMISD::VQRSHRNuIMM", SDTARMVSHXIMM>;			def NEONvqrshrnuImm : SDNode<"ARMISD::VQRSHRNuIMM", SDTARMVSHXIMM>;
	def NEONvqrshrnsuImm : SDNode<"ARMISD::VQRSHRNsuIMM", SDTARMVSHXIMM>;			def NEONvqrshrnsuImm : SDNode<"ARMISD::VQRSHRNsuIMM", SDTARMVSHXIMM>;

	def NEONvsliImm : SDNode<"ARMISD::VSLIIMM", SDTARMVSHINSIMM>;			def NEONvsliImm : SDNode<"ARMISD::VSLIIMM", SDTARMVSHINSIMM>;
	def NEONvsriImm : SDNode<"ARMISD::VSRIIMM", SDTARMVSHINSIMM>;			def NEONvsriImm : SDNode<"ARMISD::VSRIIMM", SDTARMVSHINSIMM>;

	def SDTARMVORRIMM : SDTypeProfile<1, 2, [SDTCisVec<0>, SDTCisSameAs<0, 1>,
	SDTCisVT<2, i32>]>;
	def NEONvorrImm : SDNode<"ARMISD::VORRIMM", SDTARMVORRIMM>;
	def NEONvbicImm : SDNode<"ARMISD::VBICIMM", SDTARMVORRIMM>;

	def NEONvbsl : SDNode<"ARMISD::VBSL",			def NEONvbsl : SDNode<"ARMISD::VBSL",
	SDTypeProfile<1, 3, [SDTCisVec<0>,			SDTypeProfile<1, 3, [SDTCisVec<0>,
	SDTCisSameAs<0, 1>,			SDTCisSameAs<0, 1>,
	SDTCisSameAs<0, 2>,			SDTCisSameAs<0, 2>,
	SDTCisSameAs<0, 3>]>>;			SDTCisSameAs<0, 3>]>>;

	def SDTARMVEXT : SDTypeProfile<1, 3, [SDTCisVec<0>, SDTCisSameAs<0, 1>,			def SDTARMVEXT : SDTypeProfile<1, 3, [SDTCisVec<0>, SDTCisSameAs<0, 1>,
	SDTCisSameAs<0, 2>, SDTCisVT<3, i32>]>;			SDTCisSameAs<0, 2>, SDTCisVT<3, i32>]>;
	▲ Show 20 Lines • Show All 4,766 Lines • ▼ Show 20 Lines
	def VORRq : N3VQX<0, 0, 0b10, 0b0001, 1, IIC_VBINiQ, "vorr",			def VORRq : N3VQX<0, 0, 0b10, 0b0001, 1, IIC_VBINiQ, "vorr",
	v4i32, v4i32, or, 1>;			v4i32, v4i32, or, 1>;

	def VORRiv4i16 : N1ModImm<1, 0b000, {1,0,?,1}, 0, 0, 0, 1,			def VORRiv4i16 : N1ModImm<1, 0b000, {1,0,?,1}, 0, 0, 0, 1,
	(outs DPR:$Vd), (ins nImmSplatI16:$SIMM, DPR:$src),			(outs DPR:$Vd), (ins nImmSplatI16:$SIMM, DPR:$src),
	IIC_VMOVImm,			IIC_VMOVImm,
	"vorr", "i16", "$Vd, $SIMM", "$src = $Vd",			"vorr", "i16", "$Vd, $SIMM", "$src = $Vd",
	[(set DPR:$Vd,			[(set DPR:$Vd,
	(v4i16 (NEONvorrImm DPR:$src, timm:$SIMM)))]> {			(v4i16 (ARMvorrImm DPR:$src, timm:$SIMM)))]> {
	let Inst{9} = SIMM{9};			let Inst{9} = SIMM{9};
	}			}

	def VORRiv2i32 : N1ModImm<1, 0b000, {0,?,?,1}, 0, 0, 0, 1,			def VORRiv2i32 : N1ModImm<1, 0b000, {0,?,?,1}, 0, 0, 0, 1,
	(outs DPR:$Vd), (ins nImmSplatI32:$SIMM, DPR:$src),			(outs DPR:$Vd), (ins nImmSplatI32:$SIMM, DPR:$src),
	IIC_VMOVImm,			IIC_VMOVImm,
	"vorr", "i32", "$Vd, $SIMM", "$src = $Vd",			"vorr", "i32", "$Vd, $SIMM", "$src = $Vd",
	[(set DPR:$Vd,			[(set DPR:$Vd,
	(v2i32 (NEONvorrImm DPR:$src, timm:$SIMM)))]> {			(v2i32 (ARMvorrImm DPR:$src, timm:$SIMM)))]> {
	let Inst{10-9} = SIMM{10-9};			let Inst{10-9} = SIMM{10-9};
	}			}

	def VORRiv8i16 : N1ModImm<1, 0b000, {1,0,?,1}, 0, 1, 0, 1,			def VORRiv8i16 : N1ModImm<1, 0b000, {1,0,?,1}, 0, 1, 0, 1,
	(outs QPR:$Vd), (ins nImmSplatI16:$SIMM, QPR:$src),			(outs QPR:$Vd), (ins nImmSplatI16:$SIMM, QPR:$src),
	IIC_VMOVImm,			IIC_VMOVImm,
	"vorr", "i16", "$Vd, $SIMM", "$src = $Vd",			"vorr", "i16", "$Vd, $SIMM", "$src = $Vd",
	[(set QPR:$Vd,			[(set QPR:$Vd,
	(v8i16 (NEONvorrImm QPR:$src, timm:$SIMM)))]> {			(v8i16 (ARMvorrImm QPR:$src, timm:$SIMM)))]> {
	let Inst{9} = SIMM{9};			let Inst{9} = SIMM{9};
	}			}

	def VORRiv4i32 : N1ModImm<1, 0b000, {0,?,?,1}, 0, 1, 0, 1,			def VORRiv4i32 : N1ModImm<1, 0b000, {0,?,?,1}, 0, 1, 0, 1,
	(outs QPR:$Vd), (ins nImmSplatI32:$SIMM, QPR:$src),			(outs QPR:$Vd), (ins nImmSplatI32:$SIMM, QPR:$src),
	IIC_VMOVImm,			IIC_VMOVImm,
	"vorr", "i32", "$Vd, $SIMM", "$src = $Vd",			"vorr", "i32", "$Vd, $SIMM", "$src = $Vd",
	[(set QPR:$Vd,			[(set QPR:$Vd,
	(v4i32 (NEONvorrImm QPR:$src, timm:$SIMM)))]> {			(v4i32 (ARMvorrImm QPR:$src, timm:$SIMM)))]> {
	let Inst{10-9} = SIMM{10-9};			let Inst{10-9} = SIMM{10-9};
	}			}


	// VBIC : Vector Bitwise Bit Clear (AND NOT)			// VBIC : Vector Bitwise Bit Clear (AND NOT)
	let TwoOperandAliasConstraint = "$Vn = $Vd" in {			let TwoOperandAliasConstraint = "$Vn = $Vd" in {
	def VBICd : N3VX<0, 0, 0b01, 0b0001, 0, 1, (outs DPR:$Vd),			def VBICd : N3VX<0, 0, 0b01, 0b0001, 0, 1, (outs DPR:$Vd),
	(ins DPR:$Vn, DPR:$Vm), N3RegFrm, IIC_VBINiD,			(ins DPR:$Vn, DPR:$Vm), N3RegFrm, IIC_VBINiD,
	"vbic", "$Vd, $Vn, $Vm", "",			"vbic", "$Vd, $Vn, $Vm", "",
	[(set DPR:$Vd, (v2i32 (and DPR:$Vn,			[(set DPR:$Vd, (v2i32 (and DPR:$Vn,
	(vnotd DPR:$Vm))))]>;			(vnotd DPR:$Vm))))]>;
	def VBICq : N3VX<0, 0, 0b01, 0b0001, 1, 1, (outs QPR:$Vd),			def VBICq : N3VX<0, 0, 0b01, 0b0001, 1, 1, (outs QPR:$Vd),
	(ins QPR:$Vn, QPR:$Vm), N3RegFrm, IIC_VBINiQ,			(ins QPR:$Vn, QPR:$Vm), N3RegFrm, IIC_VBINiQ,
	"vbic", "$Vd, $Vn, $Vm", "",			"vbic", "$Vd, $Vn, $Vm", "",
	[(set QPR:$Vd, (v4i32 (and QPR:$Vn,			[(set QPR:$Vd, (v4i32 (and QPR:$Vn,
	(vnotq QPR:$Vm))))]>;			(vnotq QPR:$Vm))))]>;
	}			}

	def VBICiv4i16 : N1ModImm<1, 0b000, {1,0,?,1}, 0, 0, 1, 1,			def VBICiv4i16 : N1ModImm<1, 0b000, {1,0,?,1}, 0, 0, 1, 1,
	(outs DPR:$Vd), (ins nImmSplatI16:$SIMM, DPR:$src),			(outs DPR:$Vd), (ins nImmSplatI16:$SIMM, DPR:$src),
	IIC_VMOVImm,			IIC_VMOVImm,
	"vbic", "i16", "$Vd, $SIMM", "$src = $Vd",			"vbic", "i16", "$Vd, $SIMM", "$src = $Vd",
	[(set DPR:$Vd,			[(set DPR:$Vd,
	(v4i16 (NEONvbicImm DPR:$src, timm:$SIMM)))]> {			(v4i16 (ARMvbicImm DPR:$src, timm:$SIMM)))]> {
	let Inst{9} = SIMM{9};			let Inst{9} = SIMM{9};
	}			}

	def VBICiv2i32 : N1ModImm<1, 0b000, {0,?,?,1}, 0, 0, 1, 1,			def VBICiv2i32 : N1ModImm<1, 0b000, {0,?,?,1}, 0, 0, 1, 1,
	(outs DPR:$Vd), (ins nImmSplatI32:$SIMM, DPR:$src),			(outs DPR:$Vd), (ins nImmSplatI32:$SIMM, DPR:$src),
	IIC_VMOVImm,			IIC_VMOVImm,
	"vbic", "i32", "$Vd, $SIMM", "$src = $Vd",			"vbic", "i32", "$Vd, $SIMM", "$src = $Vd",
	[(set DPR:$Vd,			[(set DPR:$Vd,
	(v2i32 (NEONvbicImm DPR:$src, timm:$SIMM)))]> {			(v2i32 (ARMvbicImm DPR:$src, timm:$SIMM)))]> {
	let Inst{10-9} = SIMM{10-9};			let Inst{10-9} = SIMM{10-9};
	}			}

	def VBICiv8i16 : N1ModImm<1, 0b000, {1,0,?,1}, 0, 1, 1, 1,			def VBICiv8i16 : N1ModImm<1, 0b000, {1,0,?,1}, 0, 1, 1, 1,
	(outs QPR:$Vd), (ins nImmSplatI16:$SIMM, QPR:$src),			(outs QPR:$Vd), (ins nImmSplatI16:$SIMM, QPR:$src),
	IIC_VMOVImm,			IIC_VMOVImm,
	"vbic", "i16", "$Vd, $SIMM", "$src = $Vd",			"vbic", "i16", "$Vd, $SIMM", "$src = $Vd",
	[(set QPR:$Vd,			[(set QPR:$Vd,
	(v8i16 (NEONvbicImm QPR:$src, timm:$SIMM)))]> {			(v8i16 (ARMvbicImm QPR:$src, timm:$SIMM)))]> {
	let Inst{9} = SIMM{9};			let Inst{9} = SIMM{9};
	}			}

	def VBICiv4i32 : N1ModImm<1, 0b000, {0,?,?,1}, 0, 1, 1, 1,			def VBICiv4i32 : N1ModImm<1, 0b000, {0,?,?,1}, 0, 1, 1, 1,
	(outs QPR:$Vd), (ins nImmSplatI32:$SIMM, QPR:$src),			(outs QPR:$Vd), (ins nImmSplatI32:$SIMM, QPR:$src),
	IIC_VMOVImm,			IIC_VMOVImm,
	"vbic", "i32", "$Vd, $SIMM", "$src = $Vd",			"vbic", "i32", "$Vd, $SIMM", "$src = $Vd",
	[(set QPR:$Vd,			[(set QPR:$Vd,
	(v4i32 (NEONvbicImm QPR:$src, timm:$SIMM)))]> {			(v4i32 (ARMvbicImm QPR:$src, timm:$SIMM)))]> {
	let Inst{10-9} = SIMM{10-9};			let Inst{10-9} = SIMM{10-9};
	}			}

	// VORN : Vector Bitwise OR NOT			// VORN : Vector Bitwise OR NOT
	def VORNd : N3VX<0, 0, 0b11, 0b0001, 0, 1, (outs DPR:$Vd),			def VORNd : N3VX<0, 0, 0b11, 0b0001, 0, 1, (outs DPR:$Vd),
	(ins DPR:$Vn, DPR:$Vm), N3RegFrm, IIC_VBINiD,			(ins DPR:$Vn, DPR:$Vm), N3RegFrm, IIC_VBINiD,
	"vorn", "$Vd, $Vn, $Vm", "",			"vorn", "$Vd, $Vn, $Vm", "",
	[(set DPR:$Vd, (v2i32 (or DPR:$Vn,			[(set DPR:$Vd, (v2i32 (or DPR:$Vn,
	▲ Show 20 Lines • Show All 3,532 Lines • Show Last 20 Lines

llvm/test/CodeGen/Thumb2/mve-intrinsics/bitwise-imm.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc -mtriple=thumbv8.1m.main -mattr=+mve -verify-machineinstrs -o - %s \| FileCheck %s

				define arm_aapcs_vfpcc <8 x i16> @test_vbicq_n_u16_sh0(<8 x i16> %a) {
				; CHECK-LABEL: test_vbicq_n_u16_sh0:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vbic.i16 q0, #0x64
				; CHECK-NEXT: bx lr
				entry:
				%0 = and <8 x i16> %a, <i16 -101, i16 -101, i16 -101, i16 -101, i16 -101, i16 -101, i16 -101, i16 -101>
				ret <8 x i16> %0
				}

				define arm_aapcs_vfpcc <8 x i16> @test_vbicq_n_u16_sh8(<8 x i16> %a) {
				; CHECK-LABEL: test_vbicq_n_u16_sh8:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vbic.i16 q0, #0x6400
				; CHECK-NEXT: bx lr
				entry:
				%0 = and <8 x i16> %a, <i16 -25601, i16 -25601, i16 -25601, i16 -25601, i16 -25601, i16 -25601, i16 -25601, i16 -25601>
				ret <8 x i16> %0
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vbicq_n_u32_sh0(<4 x i32> %a) {
				; CHECK-LABEL: test_vbicq_n_u32_sh0:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vbic.i32 q0, #0x64
				; CHECK-NEXT: bx lr
				entry:
				%0 = and <4 x i32> %a, <i32 -101, i32 -101, i32 -101, i32 -101>
				ret <4 x i32> %0
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vbicq_n_u32_sh8(<4 x i32> %a) {
				; CHECK-LABEL: test_vbicq_n_u32_sh8:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vbic.i32 q0, #0x6400
				; CHECK-NEXT: bx lr
				entry:
				%0 = and <4 x i32> %a, <i32 -25601, i32 -25601, i32 -25601, i32 -25601>
				ret <4 x i32> %0
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vbicq_n_u32_sh16(<4 x i32> %a) {
				; CHECK-LABEL: test_vbicq_n_u32_sh16:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vbic.i32 q0, #0x640000
				; CHECK-NEXT: bx lr
				entry:
				%0 = and <4 x i32> %a, <i32 -6553601, i32 -6553601, i32 -6553601, i32 -6553601>
				ret <4 x i32> %0
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vbicq_n_u32_sh24(<4 x i32> %a) {
				; CHECK-LABEL: test_vbicq_n_u32_sh24:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vbic.i32 q0, #0x64000000
				; CHECK-NEXT: bx lr
				entry:
				%0 = and <4 x i32> %a, <i32 -1677721601, i32 -1677721601, i32 -1677721601, i32 -1677721601>
				ret <4 x i32> %0
				}

				; The immediate in this case is legal for a VMVN but not for a VBIC,
				; so in this case we expect to see the constant being prepared in
				; another register.
				define arm_aapcs_vfpcc <4 x i32> @test_vbicq_n_u32_illegal(<4 x i32> %a) {
				; CHECK-LABEL: test_vbicq_n_u32_illegal:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmvn.i32 q1, #0x54ff
				; CHECK-NEXT: vand q0, q0, q1
				; CHECK-NEXT: bx lr
				entry:
				%0 = and <4 x i32> %a, <i32 -21760, i32 -21760, i32 -21760, i32 -21760>
				ret <4 x i32> %0
				}

				define arm_aapcs_vfpcc <8 x i16> @test_vorrq_n_u16_sh0(<8 x i16> %a) {
				; CHECK-LABEL: test_vorrq_n_u16_sh0:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vorr.i16 q0, #0x64
				; CHECK-NEXT: bx lr
				entry:
				%0 = or <8 x i16> %a, <i16 100, i16 100, i16 100, i16 100, i16 100, i16 100, i16 100, i16 100>
				ret <8 x i16> %0
				}

				define arm_aapcs_vfpcc <8 x i16> @test_vorrq_n_u16_sh8(<8 x i16> %a) {
				; CHECK-LABEL: test_vorrq_n_u16_sh8:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vorr.i16 q0, #0x6400
				; CHECK-NEXT: bx lr
				entry:
				%0 = or <8 x i16> %a, <i16 25600, i16 25600, i16 25600, i16 25600, i16 25600, i16 25600, i16 25600, i16 25600>
				ret <8 x i16> %0
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vorrq_n_u32_sh0(<4 x i32> %a) {
				; CHECK-LABEL: test_vorrq_n_u32_sh0:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vorr.i32 q0, #0x64
				; CHECK-NEXT: bx lr
				entry:
				%0 = or <4 x i32> %a, <i32 100, i32 100, i32 100, i32 100>
				ret <4 x i32> %0
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vorrq_n_u32_sh8(<4 x i32> %a) {
				; CHECK-LABEL: test_vorrq_n_u32_sh8:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vorr.i32 q0, #0x6400
				; CHECK-NEXT: bx lr
				entry:
				%0 = or <4 x i32> %a, <i32 25600, i32 25600, i32 25600, i32 25600>
				ret <4 x i32> %0
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vorrq_n_u32_sh16(<4 x i32> %a) {
				; CHECK-LABEL: test_vorrq_n_u32_sh16:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vorr.i32 q0, #0x640000
				; CHECK-NEXT: bx lr
				entry:
				%0 = or <4 x i32> %a, <i32 6553600, i32 6553600, i32 6553600, i32 6553600>
				ret <4 x i32> %0
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vorrq_n_u32_sh24(<4 x i32> %a) {
				; CHECK-LABEL: test_vorrq_n_u32_sh24:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vorr.i32 q0, #0x64000000
				; CHECK-NEXT: bx lr
				entry:
				%0 = or <4 x i32> %a, <i32 1677721600, i32 1677721600, i32 1677721600, i32 1677721600>
				ret <4 x i32> %0
				}

				define arm_aapcs_vfpcc <8 x i16> @test_vbicq_m_n_u16_sh0(<8 x i16> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vbicq_m_n_u16_sh0:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vbict.i16 q0, #0x64
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 %0)
				%2 = and <8 x i16> %a, <i16 -101, i16 -101, i16 -101, i16 -101, i16 -101, i16 -101, i16 -101, i16 -101>
				%3 = select <8 x i1> %1, <8 x i16> %2, <8 x i16> %a
				ret <8 x i16> %3
				}

				define arm_aapcs_vfpcc <8 x i16> @test_vbicq_m_n_u16_sh8(<8 x i16> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vbicq_m_n_u16_sh8:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vbict.i16 q0, #0x6400
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 %0)
				%2 = and <8 x i16> %a, <i16 -25601, i16 -25601, i16 -25601, i16 -25601, i16 -25601, i16 -25601, i16 -25601, i16 -25601>
				%3 = select <8 x i1> %1, <8 x i16> %2, <8 x i16> %a
				ret <8 x i16> %3
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vbicq_m_n_u32_sh0(<4 x i32> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vbicq_m_n_u32_sh0:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vbict.i32 q0, #0x64
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 %0)
				%2 = and <4 x i32> %a, <i32 -101, i32 -101, i32 -101, i32 -101>
				%3 = select <4 x i1> %1, <4 x i32> %2, <4 x i32> %a
				ret <4 x i32> %3
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vbicq_m_n_u32_sh8(<4 x i32> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vbicq_m_n_u32_sh8:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vbict.i32 q0, #0x6400
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 %0)
				%2 = and <4 x i32> %a, <i32 -25601, i32 -25601, i32 -25601, i32 -25601>
				%3 = select <4 x i1> %1, <4 x i32> %2, <4 x i32> %a
				ret <4 x i32> %3
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vbicq_m_n_u32_sh16(<4 x i32> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vbicq_m_n_u32_sh16:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vbict.i32 q0, #0x640000
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 %0)
				%2 = and <4 x i32> %a, <i32 -6553601, i32 -6553601, i32 -6553601, i32 -6553601>
				%3 = select <4 x i1> %1, <4 x i32> %2, <4 x i32> %a
				ret <4 x i32> %3
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vbicq_m_n_u32_sh24(<4 x i32> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vbicq_m_n_u32_sh24:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vbict.i32 q0, #0x64000000
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 %0)
				%2 = and <4 x i32> %a, <i32 -1677721601, i32 -1677721601, i32 -1677721601, i32 -1677721601>
				%3 = select <4 x i1> %1, <4 x i32> %2, <4 x i32> %a
				ret <4 x i32> %3
				}

				define arm_aapcs_vfpcc <8 x i16> @test_vorrq_m_n_u16_sh0(<8 x i16> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vorrq_m_n_u16_sh0:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vorrt.i16 q0, #0x64
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 %0)
				%2 = or <8 x i16> %a, <i16 100, i16 100, i16 100, i16 100, i16 100, i16 100, i16 100, i16 100>
				%3 = select <8 x i1> %1, <8 x i16> %2, <8 x i16> %a
				ret <8 x i16> %3
				}

				define arm_aapcs_vfpcc <8 x i16> @test_vorrq_m_n_u16_sh8(<8 x i16> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vorrq_m_n_u16_sh8:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vorrt.i16 q0, #0x6400
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 %0)
				%2 = or <8 x i16> %a, <i16 25600, i16 25600, i16 25600, i16 25600, i16 25600, i16 25600, i16 25600, i16 25600>
				%3 = select <8 x i1> %1, <8 x i16> %2, <8 x i16> %a
				ret <8 x i16> %3
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vorrq_m_n_u32_sh0(<4 x i32> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vorrq_m_n_u32_sh0:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vorrt.i32 q0, #0x64
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 %0)
				%2 = or <4 x i32> %a, <i32 100, i32 100, i32 100, i32 100>
				%3 = select <4 x i1> %1, <4 x i32> %2, <4 x i32> %a
				ret <4 x i32> %3
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vorrq_m_n_u32_sh8(<4 x i32> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vorrq_m_n_u32_sh8:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vorrt.i32 q0, #0x6400
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 %0)
				%2 = or <4 x i32> %a, <i32 25600, i32 25600, i32 25600, i32 25600>
				%3 = select <4 x i1> %1, <4 x i32> %2, <4 x i32> %a
				ret <4 x i32> %3
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vorrq_m_n_u32_sh16(<4 x i32> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vorrq_m_n_u32_sh16:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vorrt.i32 q0, #0x640000
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 %0)
				%2 = or <4 x i32> %a, <i32 6553600, i32 6553600, i32 6553600, i32 6553600>
				%3 = select <4 x i1> %1, <4 x i32> %2, <4 x i32> %a
				ret <4 x i32> %3
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vorrq_m_n_u32_sh24(<4 x i32> %a, i16 zeroext %p) {
				; CHECK-LABEL: test_vorrq_m_n_u32_sh24:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vorrt.i32 q0, #0x64000000
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 %0)
				%2 = or <4 x i32> %a, <i32 1677721600, i32 1677721600, i32 1677721600, i32 1677721600>
				%3 = select <4 x i1> %1, <4 x i32> %2, <4 x i32> %a
				ret <4 x i32> %3
				}

				define arm_aapcs_vfpcc <8 x i16> @test_vmvnq_n_u16() {
				; CHECK-LABEL: test_vmvnq_n_u16:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmvn.i16 q0, #0xaa00
				; CHECK-NEXT: bx lr
				entry:
				ret <8 x i16> <i16 -43521, i16 -43521, i16 -43521, i16 -43521, i16 -43521, i16 -43521, i16 -43521, i16 -43521>
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vmvnq_n_u32() {
				; CHECK-LABEL: test_vmvnq_n_u32:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmvn.i32 q0, #0xaa00
				; CHECK-NEXT: bx lr
				entry:
				ret <4 x i32> <i32 -43521, i32 -43521, i32 -43521, i32 -43521>
				}

				define arm_aapcs_vfpcc <8 x i16> @test_vmvnq_m_n_u16(<8 x i16> %inactive, i16 zeroext %p) {
				; CHECK-LABEL: test_vmvnq_m_n_u16:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vmvnt.i16 q0, #0xaa00
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32 %0)
				%2 = select <8 x i1> %1, <8 x i16> <i16 -43521, i16 -43521, i16 -43521, i16 -43521, i16 -43521, i16 -43521, i16 -43521, i16 -43521>, <8 x i16> %inactive
				ret <8 x i16> %2
				}

				define arm_aapcs_vfpcc <4 x i32> @test_vmvnq_m_n_u32(<4 x i32> %inactive, i16 zeroext %p) {
				; CHECK-LABEL: test_vmvnq_m_n_u32:
				; CHECK: @ %bb.0: @ %entry
				; CHECK-NEXT: vmsr p0, r0
				; CHECK-NEXT: vpst
				; CHECK-NEXT: vmvnt.i32 q0, #0xaa00
				; CHECK-NEXT: bx lr
				entry:
				%0 = zext i16 %p to i32
				%1 = tail call <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32 %0)
				%2 = select <4 x i1> %1, <4 x i32> <i32 -43521, i32 -43521, i32 -43521, i32 -43521>, <4 x i32> %inactive
				ret <4 x i32> %2
				}

				declare <8 x i1> @llvm.arm.mve.pred.i2v.v8i1(i32)
				declare <4 x i1> @llvm.arm.mve.pred.i2v.v4i1(i32)

This is an archive of the discontinued LLVM Phabricator instance.

[ARM,MVE] Support immediate vbicq,vorrq,vmvnq intrinsics.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 239845

clang/include/clang/Basic/arm_mve.td

clang/include/clang/Basic/arm_mve_defs.td

clang/include/clang/Sema/Sema.h

clang/lib/Sema/SemaChecking.cpp

clang/test/CodeGen/arm-mve-intrinsics/bitwise-imm.c

clang/test/Sema/arm-mve-immediates.c

clang/utils/TableGen/MveEmitter.cpp

llvm/lib/Target/ARM/ARMISelLowering.cpp

llvm/lib/Target/ARM/ARMInstrInfo.td

llvm/lib/Target/ARM/ARMInstrMVE.td

llvm/lib/Target/ARM/ARMInstrNEON.td

llvm/test/CodeGen/Thumb2/mve-intrinsics/bitwise-imm.ll

[ARM,MVE] Support immediate vbicq,vorrq,vmvnq intrinsics.
ClosedPublic