This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] v8.3-a complex number support
ClosedPublic

Authored by samparker on Aug 16 2017, 7:10 AM.

Download Raw Diff

Details

Reviewers

olista01
john.brawn
eastig
dmgreen
SjoerdMeijer

Commits

rG5f9346471c97: [AArch64] v8.3-a complex number support
rL312228: [AArch64] v8.3-a complex number support

Summary

New instructions are added to AArch32 and AArch64 to aid floating-point multiplication and addition of complex numbers, where the complex numbers are packed in a vector register as a pair of elements. The Imaginary part of the number is placed in the more significant element, and the Real part of the number is placed in the less significant element.

Diff Detail

Event Timeline

samparker created this revision.Aug 16 2017, 7:10 AM

Herald added subscribers: kristof.beyls, javed.absar, rengolin, aemerson. · View Herald TranscriptAug 16 2017, 7:10 AM

samparker added reviewers: olista01, john.brawn, eastig, dmgreen.Aug 22 2017, 4:13 AM

SjoerdMeijer added a subscriber: SjoerdMeijer.Aug 25 2017, 7:34 AM

SjoerdMeijer added inline comments.

lib/Target/AArch64/AArch64InstrFormats.td
9411	I don't think we need 2 operand classes. They are essentially the same things, just some constants are different. An example from the ARM backend is: ImmAsmOperand<int Low, int High>, where we pass the range of the imm value. I think we can do something similar here. We can then also avoid some duplication in the print and predicate functions, see also comments below.
lib/Target/AArch64/AsmParser/AArch64AsmParser.cpp
838	We can refactor this and the next function (and create only 1), and use "PredicateMethod" in the AsmOperand class.
1541	We can refactor this function and the next one and create one (template/parametric) function to avoid code duplication (see earlier comment about the operand classes).
lib/Target/AArch64/InstPrinter/AArch64InstPrinter.cpp
1335	Same here.

Hi Sjoerd,

I've refactored some functionality so now there is a single operand class, separate functions are still required for the asm parser to 'addXXXOperands' though.

cheers,
sam

SjoerdMeijer added inline comments.Aug 30 2017, 5:15 AM

lib/Target/AArch64/AArch64InstrFormats.td
9488	Do you think it's worth making a base class for BaseSIMDThreeSameVectorTiedComplex and BaseSIMDThreeSameVectorComplex?
lib/Target/AArch64/AArch64InstrInfo.td
462	Nit: F16 => FP16?

Hi Sjoerd,

I've refactored the SIMDThreeSameVector class that is subclassed by these instructions, eliminating the need to duplicate the encodings. I've left the indexed version of the class as it is, because it is quite a bit different from the base class.

cheers,
sam

Hi Sam, many thanks for refactoring this. Looks really good now! Just 2 nits inlined, but no need for another review.
Cheers, Sjoerd.

lib/Target/AArch64/AArch64InstrFormats.td
9426	Nit: missing space after commas (also in the other predicates below)
lib/Target/AArch64/AArch64InstrInfo.td
462	Nit: F16 => FP16?

This revision is now accepted and ready to land.Aug 30 2017, 7:44 AM

Closed by commit rL312228: [AArch64] v8.3-a complex number support (authored by sam_parker). · Explain WhyAug 31 2017, 2:28 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

lib/

Target/

AArch64/

AArch64InstrFormats.td

352 lines

AArch64InstrInfo.td

9 lines

AsmParser/

AArch64AsmParser.cpp

29 lines

InstPrinter/

AArch64InstPrinter.h

3 lines

AArch64InstPrinter.cpp

9 lines

test/

MC/

AArch64/

armv8.3a-complex.s

148 lines

Disassembler/

AArch64/

armv8.3a-complex.txt

101 lines

Diff 113260

lib/Target/AArch64/AArch64InstrFormats.td

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	let Predicates = [HasNEON] in {			let Predicates = [HasNEON] in {

	//----------------------------------------------------------------------------			//----------------------------------------------------------------------------
	// AdvSIMD three register vector instructions			// AdvSIMD three register vector instructions
	//----------------------------------------------------------------------------			//----------------------------------------------------------------------------

	let mayLoad = 0, mayStore = 0, hasSideEffects = 0 in			let mayLoad = 0, mayStore = 0, hasSideEffects = 0 in
	class BaseSIMDThreeSameVector<bit Q, bit U, bits<3> size, bits<5> opcode,			class BaseSIMDThreeSameVector<bit Q, bit U, bits<3> size, bits<5> opcode,
	RegisterOperand regtype, string asm, string kind,			RegisterOperand regtype, dag oops, dag iops,
	list<dag> pattern>			string asm, string kind, list<dag> pattern>
	: I<(outs regtype:$Rd), (ins regtype:$Rn, regtype:$Rm), asm,			: I<oops, iops, asm, "{\t$Rd" # kind # ", $Rn" # kind # ", $Rm" # kind #
	"{\t$Rd" # kind # ", $Rn" # kind # ", $Rm" # kind #			"\|" # kind # "\t$Rd, $Rn, $Rm\|}", "", pattern>, Sched<[WriteV]> {
	"\|" # kind # "\t$Rd, $Rn, $Rm\|}", "", pattern>,
	Sched<[WriteV]> {
	bits<5> Rd;			bits<5> Rd;
	bits<5> Rn;			bits<5> Rn;
	bits<5> Rm;			bits<5> Rm;
	let Inst{31} = 0;			let Inst{31} = 0;
	let Inst{30} = Q;			let Inst{30} = Q;
	let Inst{29} = U;			let Inst{29} = U;
	let Inst{28-24} = 0b01110;			let Inst{28-24} = 0b01110;
	let Inst{23-21} = size;			let Inst{23-21} = size;
	let Inst{20-16} = Rm;			let Inst{20-16} = Rm;
	let Inst{15-11} = opcode;			let Inst{15-11} = opcode;
	let Inst{10} = 1;			let Inst{10} = 1;
	let Inst{9-5} = Rn;			let Inst{9-5} = Rn;
	let Inst{4-0} = Rd;			let Inst{4-0} = Rd;
	}			}

	let mayLoad = 0, mayStore = 0, hasSideEffects = 0 in			class SIMDThreeSameVector<bit Q, bit U, bits<3> size, bits<5> opcode,
	class BaseSIMDThreeSameVectorTied<bit Q, bit U, bits<3> size, bits<5> opcode,
	RegisterOperand regtype, string asm, string kind,			RegisterOperand regtype, string asm, string kind,
	list<dag> pattern>			list<dag> pattern>
	: I<(outs regtype:$dst), (ins regtype:$Rd, regtype:$Rn, regtype:$Rm), asm,			: BaseSIMDThreeSameVector<Q, U, size, opcode, regtype,
	"{\t$Rd" # kind # ", $Rn" # kind # ", $Rm" # kind #			(outs regtype:$Rd), (ins regtype:$Rn, regtype:$Rm),
	"\|" # kind # "\t$Rd, $Rn, $Rm}", "$Rd = $dst", pattern>,			asm, kind, pattern>;
	Sched<[WriteV]> {
	bits<5> Rd;			class SIMDThreeSameVectorTied<bit Q, bit U, bits<3> size, bits<5> opcode,
	bits<5> Rn;			RegisterOperand regtype, string asm, string kind,
	bits<5> Rm;			list<dag> pattern>
	let Inst{31} = 0;			: BaseSIMDThreeSameVector<Q, U, size, opcode, regtype,
	let Inst{30} = Q;			(outs regtype:$dst),
	let Inst{29} = U;			(ins regtype:$Rd, regtype:$Rn, regtype:$Rm), asm,
	let Inst{28-24} = 0b01110;			kind, pattern> {
	let Inst{23-21} = size;			let Constraints = "$Rd = $dst";
	let Inst{20-16} = Rm;			}
	let Inst{15-11} = opcode;
	let Inst{10} = 1;
	let Inst{9-5} = Rn;
	let Inst{4-0} = Rd;
	}

	class BaseSIMDThreeSameVectorDot<bit Q, bit U, string asm, string kind1,			class BaseSIMDThreeSameVectorDot<bit Q, bit U, string asm, string kind1,
	string kind2> :			string kind2> :
	BaseSIMDThreeSameVector<Q, U, 0b100, 0b10010, V128, asm, kind1, [] > {			SIMDThreeSameVector<Q, U, 0b100, 0b10010, V128, asm, kind1, [] > {
	let AsmString = !strconcat(asm, "{\t$Rd" # kind1 # ", $Rn" # kind2 # ", $Rm" # kind2 # "}");			let AsmString = !strconcat(asm, "{\t$Rd" # kind1 # ", $Rn" # kind2 # ", $Rm" # kind2 # "}");
	}			}

	// All operand sizes distinguished in the encoding.			// All operand sizes distinguished in the encoding.
	multiclass SIMDThreeSameVector<bit U, bits<5> opc, string asm,			multiclass SIMDThreeSameVector<bit U, bits<5> opc, string asm,
	SDPatternOperator OpNode> {			SDPatternOperator OpNode> {
	def v8i8 : BaseSIMDThreeSameVector<0, U, 0b001, opc, V64,			def v8i8 : SIMDThreeSameVector<0, U, 0b001, opc, V64,
	asm, ".8b",			asm, ".8b",
	[(set (v8i8 V64:$Rd), (OpNode (v8i8 V64:$Rn), (v8i8 V64:$Rm)))]>;			[(set (v8i8 V64:$Rd), (OpNode (v8i8 V64:$Rn), (v8i8 V64:$Rm)))]>;
	def v16i8 : BaseSIMDThreeSameVector<1, U, 0b001, opc, V128,			def v16i8 : SIMDThreeSameVector<1, U, 0b001, opc, V128,
	asm, ".16b",			asm, ".16b",
	[(set (v16i8 V128:$Rd), (OpNode (v16i8 V128:$Rn), (v16i8 V128:$Rm)))]>;			[(set (v16i8 V128:$Rd), (OpNode (v16i8 V128:$Rn), (v16i8 V128:$Rm)))]>;
	def v4i16 : BaseSIMDThreeSameVector<0, U, 0b011, opc, V64,			def v4i16 : SIMDThreeSameVector<0, U, 0b011, opc, V64,
	asm, ".4h",			asm, ".4h",
	[(set (v4i16 V64:$Rd), (OpNode (v4i16 V64:$Rn), (v4i16 V64:$Rm)))]>;			[(set (v4i16 V64:$Rd), (OpNode (v4i16 V64:$Rn), (v4i16 V64:$Rm)))]>;
	def v8i16 : BaseSIMDThreeSameVector<1, U, 0b011, opc, V128,			def v8i16 : SIMDThreeSameVector<1, U, 0b011, opc, V128,
	asm, ".8h",			asm, ".8h",
	[(set (v8i16 V128:$Rd), (OpNode (v8i16 V128:$Rn), (v8i16 V128:$Rm)))]>;			[(set (v8i16 V128:$Rd), (OpNode (v8i16 V128:$Rn), (v8i16 V128:$Rm)))]>;
	def v2i32 : BaseSIMDThreeSameVector<0, U, 0b101, opc, V64,			def v2i32 : SIMDThreeSameVector<0, U, 0b101, opc, V64,
	asm, ".2s",			asm, ".2s",
	[(set (v2i32 V64:$Rd), (OpNode (v2i32 V64:$Rn), (v2i32 V64:$Rm)))]>;			[(set (v2i32 V64:$Rd), (OpNode (v2i32 V64:$Rn), (v2i32 V64:$Rm)))]>;
	def v4i32 : BaseSIMDThreeSameVector<1, U, 0b101, opc, V128,			def v4i32 : SIMDThreeSameVector<1, U, 0b101, opc, V128,
	asm, ".4s",			asm, ".4s",
	[(set (v4i32 V128:$Rd), (OpNode (v4i32 V128:$Rn), (v4i32 V128:$Rm)))]>;			[(set (v4i32 V128:$Rd), (OpNode (v4i32 V128:$Rn), (v4i32 V128:$Rm)))]>;
	def v2i64 : BaseSIMDThreeSameVector<1, U, 0b111, opc, V128,			def v2i64 : SIMDThreeSameVector<1, U, 0b111, opc, V128,
	asm, ".2d",			asm, ".2d",
	[(set (v2i64 V128:$Rd), (OpNode (v2i64 V128:$Rn), (v2i64 V128:$Rm)))]>;			[(set (v2i64 V128:$Rd), (OpNode (v2i64 V128:$Rn), (v2i64 V128:$Rm)))]>;
	}			}

	// As above, but D sized elements unsupported.			// As above, but D sized elements unsupported.
	multiclass SIMDThreeSameVectorBHS<bit U, bits<5> opc, string asm,			multiclass SIMDThreeSameVectorBHS<bit U, bits<5> opc, string asm,
	SDPatternOperator OpNode> {			SDPatternOperator OpNode> {
	def v8i8 : BaseSIMDThreeSameVector<0, U, 0b001, opc, V64,			def v8i8 : SIMDThreeSameVector<0, U, 0b001, opc, V64,
	asm, ".8b",			asm, ".8b",
	[(set V64:$Rd, (v8i8 (OpNode (v8i8 V64:$Rn), (v8i8 V64:$Rm))))]>;			[(set V64:$Rd, (v8i8 (OpNode (v8i8 V64:$Rn), (v8i8 V64:$Rm))))]>;
	def v16i8 : BaseSIMDThreeSameVector<1, U, 0b001, opc, V128,			def v16i8 : SIMDThreeSameVector<1, U, 0b001, opc, V128,
	asm, ".16b",			asm, ".16b",
	[(set V128:$Rd, (v16i8 (OpNode (v16i8 V128:$Rn), (v16i8 V128:$Rm))))]>;			[(set V128:$Rd, (v16i8 (OpNode (v16i8 V128:$Rn), (v16i8 V128:$Rm))))]>;
	def v4i16 : BaseSIMDThreeSameVector<0, U, 0b011, opc, V64,			def v4i16 : SIMDThreeSameVector<0, U, 0b011, opc, V64,
	asm, ".4h",			asm, ".4h",
	[(set V64:$Rd, (v4i16 (OpNode (v4i16 V64:$Rn), (v4i16 V64:$Rm))))]>;			[(set V64:$Rd, (v4i16 (OpNode (v4i16 V64:$Rn), (v4i16 V64:$Rm))))]>;
	def v8i16 : BaseSIMDThreeSameVector<1, U, 0b011, opc, V128,			def v8i16 : SIMDThreeSameVector<1, U, 0b011, opc, V128,
	asm, ".8h",			asm, ".8h",
	[(set V128:$Rd, (v8i16 (OpNode (v8i16 V128:$Rn), (v8i16 V128:$Rm))))]>;			[(set V128:$Rd, (v8i16 (OpNode (v8i16 V128:$Rn), (v8i16 V128:$Rm))))]>;
	def v2i32 : BaseSIMDThreeSameVector<0, U, 0b101, opc, V64,			def v2i32 : SIMDThreeSameVector<0, U, 0b101, opc, V64,
	asm, ".2s",			asm, ".2s",
	[(set V64:$Rd, (v2i32 (OpNode (v2i32 V64:$Rn), (v2i32 V64:$Rm))))]>;			[(set V64:$Rd, (v2i32 (OpNode (v2i32 V64:$Rn), (v2i32 V64:$Rm))))]>;
	def v4i32 : BaseSIMDThreeSameVector<1, U, 0b101, opc, V128,			def v4i32 : SIMDThreeSameVector<1, U, 0b101, opc, V128,
	asm, ".4s",			asm, ".4s",
	[(set V128:$Rd, (v4i32 (OpNode (v4i32 V128:$Rn), (v4i32 V128:$Rm))))]>;			[(set V128:$Rd, (v4i32 (OpNode (v4i32 V128:$Rn), (v4i32 V128:$Rm))))]>;
	}			}

	multiclass SIMDThreeSameVectorBHSTied<bit U, bits<5> opc, string asm,			multiclass SIMDThreeSameVectorBHSTied<bit U, bits<5> opc, string asm,
	SDPatternOperator OpNode> {			SDPatternOperator OpNode> {
	def v8i8 : BaseSIMDThreeSameVectorTied<0, U, 0b001, opc, V64,			def v8i8 : SIMDThreeSameVectorTied<0, U, 0b001, opc, V64,
	asm, ".8b",			asm, ".8b",
	[(set (v8i8 V64:$dst),			[(set (v8i8 V64:$dst),
	(OpNode (v8i8 V64:$Rd), (v8i8 V64:$Rn), (v8i8 V64:$Rm)))]>;			(OpNode (v8i8 V64:$Rd), (v8i8 V64:$Rn), (v8i8 V64:$Rm)))]>;
	def v16i8 : BaseSIMDThreeSameVectorTied<1, U, 0b001, opc, V128,			def v16i8 : SIMDThreeSameVectorTied<1, U, 0b001, opc, V128,
	asm, ".16b",			asm, ".16b",
	[(set (v16i8 V128:$dst),			[(set (v16i8 V128:$dst),
	(OpNode (v16i8 V128:$Rd), (v16i8 V128:$Rn), (v16i8 V128:$Rm)))]>;			(OpNode (v16i8 V128:$Rd), (v16i8 V128:$Rn), (v16i8 V128:$Rm)))]>;
	def v4i16 : BaseSIMDThreeSameVectorTied<0, U, 0b011, opc, V64,			def v4i16 : SIMDThreeSameVectorTied<0, U, 0b011, opc, V64,
	asm, ".4h",			asm, ".4h",
	[(set (v4i16 V64:$dst),			[(set (v4i16 V64:$dst),
	(OpNode (v4i16 V64:$Rd), (v4i16 V64:$Rn), (v4i16 V64:$Rm)))]>;			(OpNode (v4i16 V64:$Rd), (v4i16 V64:$Rn), (v4i16 V64:$Rm)))]>;
	def v8i16 : BaseSIMDThreeSameVectorTied<1, U, 0b011, opc, V128,			def v8i16 : SIMDThreeSameVectorTied<1, U, 0b011, opc, V128,
	asm, ".8h",			asm, ".8h",
	[(set (v8i16 V128:$dst),			[(set (v8i16 V128:$dst),
	(OpNode (v8i16 V128:$Rd), (v8i16 V128:$Rn), (v8i16 V128:$Rm)))]>;			(OpNode (v8i16 V128:$Rd), (v8i16 V128:$Rn), (v8i16 V128:$Rm)))]>;
	def v2i32 : BaseSIMDThreeSameVectorTied<0, U, 0b101, opc, V64,			def v2i32 : SIMDThreeSameVectorTied<0, U, 0b101, opc, V64,
	asm, ".2s",			asm, ".2s",
	[(set (v2i32 V64:$dst),			[(set (v2i32 V64:$dst),
	(OpNode (v2i32 V64:$Rd), (v2i32 V64:$Rn), (v2i32 V64:$Rm)))]>;			(OpNode (v2i32 V64:$Rd), (v2i32 V64:$Rn), (v2i32 V64:$Rm)))]>;
	def v4i32 : BaseSIMDThreeSameVectorTied<1, U, 0b101, opc, V128,			def v4i32 : SIMDThreeSameVectorTied<1, U, 0b101, opc, V128,
	asm, ".4s",			asm, ".4s",
	[(set (v4i32 V128:$dst),			[(set (v4i32 V128:$dst),
	(OpNode (v4i32 V128:$Rd), (v4i32 V128:$Rn), (v4i32 V128:$Rm)))]>;			(OpNode (v4i32 V128:$Rd), (v4i32 V128:$Rn), (v4i32 V128:$Rm)))]>;
	}			}

	// As above, but only B sized elements supported.			// As above, but only B sized elements supported.
	multiclass SIMDThreeSameVectorB<bit U, bits<5> opc, string asm,			multiclass SIMDThreeSameVectorB<bit U, bits<5> opc, string asm,
	SDPatternOperator OpNode> {			SDPatternOperator OpNode> {
	def v8i8 : BaseSIMDThreeSameVector<0, U, 0b001, opc, V64,			def v8i8 : SIMDThreeSameVector<0, U, 0b001, opc, V64,
	asm, ".8b",			asm, ".8b",
	[(set (v8i8 V64:$Rd), (OpNode (v8i8 V64:$Rn), (v8i8 V64:$Rm)))]>;			[(set (v8i8 V64:$Rd), (OpNode (v8i8 V64:$Rn), (v8i8 V64:$Rm)))]>;
	def v16i8 : BaseSIMDThreeSameVector<1, U, 0b001, opc, V128,			def v16i8 : SIMDThreeSameVector<1, U, 0b001, opc, V128,
	asm, ".16b",			asm, ".16b",
	[(set (v16i8 V128:$Rd),			[(set (v16i8 V128:$Rd),
	(OpNode (v16i8 V128:$Rn), (v16i8 V128:$Rm)))]>;			(OpNode (v16i8 V128:$Rn), (v16i8 V128:$Rm)))]>;
	}			}

	// As above, but only floating point elements supported.			// As above, but only floating point elements supported.
	multiclass SIMDThreeSameVectorFP<bit U, bit S, bits<3> opc,			multiclass SIMDThreeSameVectorFP<bit U, bit S, bits<3> opc,
	string asm, SDPatternOperator OpNode> {			string asm, SDPatternOperator OpNode> {
	let Predicates = [HasNEON, HasFullFP16] in {			let Predicates = [HasNEON, HasFullFP16] in {
	def v4f16 : BaseSIMDThreeSameVector<0, U, {S,0b10}, {0b00,opc}, V64,			def v4f16 : SIMDThreeSameVector<0, U, {S,0b10}, {0b00,opc}, V64,
	asm, ".4h",			asm, ".4h",
	[(set (v4f16 V64:$Rd), (OpNode (v4f16 V64:$Rn), (v4f16 V64:$Rm)))]>;			[(set (v4f16 V64:$Rd), (OpNode (v4f16 V64:$Rn), (v4f16 V64:$Rm)))]>;
	def v8f16 : BaseSIMDThreeSameVector<1, U, {S,0b10}, {0b00,opc}, V128,			def v8f16 : SIMDThreeSameVector<1, U, {S,0b10}, {0b00,opc}, V128,
	asm, ".8h",			asm, ".8h",
	[(set (v8f16 V128:$Rd), (OpNode (v8f16 V128:$Rn), (v8f16 V128:$Rm)))]>;			[(set (v8f16 V128:$Rd), (OpNode (v8f16 V128:$Rn), (v8f16 V128:$Rm)))]>;
	} // Predicates = [HasNEON, HasFullFP16]			} // Predicates = [HasNEON, HasFullFP16]
	def v2f32 : BaseSIMDThreeSameVector<0, U, {S,0b01}, {0b11,opc}, V64,			def v2f32 : SIMDThreeSameVector<0, U, {S,0b01}, {0b11,opc}, V64,
	asm, ".2s",			asm, ".2s",
	[(set (v2f32 V64:$Rd), (OpNode (v2f32 V64:$Rn), (v2f32 V64:$Rm)))]>;			[(set (v2f32 V64:$Rd), (OpNode (v2f32 V64:$Rn), (v2f32 V64:$Rm)))]>;
	def v4f32 : BaseSIMDThreeSameVector<1, U, {S,0b01}, {0b11,opc}, V128,			def v4f32 : SIMDThreeSameVector<1, U, {S,0b01}, {0b11,opc}, V128,
	asm, ".4s",			asm, ".4s",
	[(set (v4f32 V128:$Rd), (OpNode (v4f32 V128:$Rn), (v4f32 V128:$Rm)))]>;			[(set (v4f32 V128:$Rd), (OpNode (v4f32 V128:$Rn), (v4f32 V128:$Rm)))]>;
	def v2f64 : BaseSIMDThreeSameVector<1, U, {S,0b11}, {0b11,opc}, V128,			def v2f64 : SIMDThreeSameVector<1, U, {S,0b11}, {0b11,opc}, V128,
	asm, ".2d",			asm, ".2d",
	[(set (v2f64 V128:$Rd), (OpNode (v2f64 V128:$Rn), (v2f64 V128:$Rm)))]>;			[(set (v2f64 V128:$Rd), (OpNode (v2f64 V128:$Rn), (v2f64 V128:$Rm)))]>;
	}			}

	multiclass SIMDThreeSameVectorFPCmp<bit U, bit S, bits<3> opc,			multiclass SIMDThreeSameVectorFPCmp<bit U, bit S, bits<3> opc,
	string asm,			string asm,
	SDPatternOperator OpNode> {			SDPatternOperator OpNode> {
	let Predicates = [HasNEON, HasFullFP16] in {			let Predicates = [HasNEON, HasFullFP16] in {
	def v4f16 : BaseSIMDThreeSameVector<0, U, {S,0b10}, {0b00,opc}, V64,			def v4f16 : SIMDThreeSameVector<0, U, {S,0b10}, {0b00,opc}, V64,
	asm, ".4h",			asm, ".4h",
	[(set (v4i16 V64:$Rd), (OpNode (v4f16 V64:$Rn), (v4f16 V64:$Rm)))]>;			[(set (v4i16 V64:$Rd), (OpNode (v4f16 V64:$Rn), (v4f16 V64:$Rm)))]>;
	def v8f16 : BaseSIMDThreeSameVector<1, U, {S,0b10}, {0b00,opc}, V128,			def v8f16 : SIMDThreeSameVector<1, U, {S,0b10}, {0b00,opc}, V128,
	asm, ".8h",			asm, ".8h",
	[(set (v8i16 V128:$Rd), (OpNode (v8f16 V128:$Rn), (v8f16 V128:$Rm)))]>;			[(set (v8i16 V128:$Rd), (OpNode (v8f16 V128:$Rn), (v8f16 V128:$Rm)))]>;
	} // Predicates = [HasNEON, HasFullFP16]			} // Predicates = [HasNEON, HasFullFP16]
	def v2f32 : BaseSIMDThreeSameVector<0, U, {S,0b01}, {0b11,opc}, V64,			def v2f32 : SIMDThreeSameVector<0, U, {S,0b01}, {0b11,opc}, V64,
	asm, ".2s",			asm, ".2s",
	[(set (v2i32 V64:$Rd), (OpNode (v2f32 V64:$Rn), (v2f32 V64:$Rm)))]>;			[(set (v2i32 V64:$Rd), (OpNode (v2f32 V64:$Rn), (v2f32 V64:$Rm)))]>;
	def v4f32 : BaseSIMDThreeSameVector<1, U, {S,0b01}, {0b11,opc}, V128,			def v4f32 : SIMDThreeSameVector<1, U, {S,0b01}, {0b11,opc}, V128,
	asm, ".4s",			asm, ".4s",
	[(set (v4i32 V128:$Rd), (OpNode (v4f32 V128:$Rn), (v4f32 V128:$Rm)))]>;			[(set (v4i32 V128:$Rd), (OpNode (v4f32 V128:$Rn), (v4f32 V128:$Rm)))]>;
	def v2f64 : BaseSIMDThreeSameVector<1, U, {S,0b11}, {0b11,opc}, V128,			def v2f64 : SIMDThreeSameVector<1, U, {S,0b11}, {0b11,opc}, V128,
	asm, ".2d",			asm, ".2d",
	[(set (v2i64 V128:$Rd), (OpNode (v2f64 V128:$Rn), (v2f64 V128:$Rm)))]>;			[(set (v2i64 V128:$Rd), (OpNode (v2f64 V128:$Rn), (v2f64 V128:$Rm)))]>;
	}			}

	multiclass SIMDThreeSameVectorFPTied<bit U, bit S, bits<3> opc,			multiclass SIMDThreeSameVectorFPTied<bit U, bit S, bits<3> opc,
	string asm, SDPatternOperator OpNode> {			string asm, SDPatternOperator OpNode> {
	let Predicates = [HasNEON, HasFullFP16] in {			let Predicates = [HasNEON, HasFullFP16] in {
	def v4f16 : BaseSIMDThreeSameVectorTied<0, U, {S,0b10}, {0b00,opc}, V64,			def v4f16 : SIMDThreeSameVectorTied<0, U, {S,0b10}, {0b00,opc}, V64,
	asm, ".4h",			asm, ".4h",
	[(set (v4f16 V64:$dst),			[(set (v4f16 V64:$dst),
	(OpNode (v4f16 V64:$Rd), (v4f16 V64:$Rn), (v4f16 V64:$Rm)))]>;			(OpNode (v4f16 V64:$Rd), (v4f16 V64:$Rn), (v4f16 V64:$Rm)))]>;
	def v8f16 : BaseSIMDThreeSameVectorTied<1, U, {S,0b10}, {0b00,opc}, V128,			def v8f16 : SIMDThreeSameVectorTied<1, U, {S,0b10}, {0b00,opc}, V128,
	asm, ".8h",			asm, ".8h",
	[(set (v8f16 V128:$dst),			[(set (v8f16 V128:$dst),
	(OpNode (v8f16 V128:$Rd), (v8f16 V128:$Rn), (v8f16 V128:$Rm)))]>;			(OpNode (v8f16 V128:$Rd), (v8f16 V128:$Rn), (v8f16 V128:$Rm)))]>;
	} // Predicates = [HasNEON, HasFullFP16]			} // Predicates = [HasNEON, HasFullFP16]
	def v2f32 : BaseSIMDThreeSameVectorTied<0, U, {S,0b01}, {0b11,opc}, V64,			def v2f32 : SIMDThreeSameVectorTied<0, U, {S,0b01}, {0b11,opc}, V64,
	asm, ".2s",			asm, ".2s",
	[(set (v2f32 V64:$dst),			[(set (v2f32 V64:$dst),
	(OpNode (v2f32 V64:$Rd), (v2f32 V64:$Rn), (v2f32 V64:$Rm)))]>;			(OpNode (v2f32 V64:$Rd), (v2f32 V64:$Rn), (v2f32 V64:$Rm)))]>;
	def v4f32 : BaseSIMDThreeSameVectorTied<1, U, {S,0b01}, {0b11,opc}, V128,			def v4f32 : SIMDThreeSameVectorTied<1, U, {S,0b01}, {0b11,opc}, V128,
	asm, ".4s",			asm, ".4s",
	[(set (v4f32 V128:$dst),			[(set (v4f32 V128:$dst),
	(OpNode (v4f32 V128:$Rd), (v4f32 V128:$Rn), (v4f32 V128:$Rm)))]>;			(OpNode (v4f32 V128:$Rd), (v4f32 V128:$Rn), (v4f32 V128:$Rm)))]>;
	def v2f64 : BaseSIMDThreeSameVectorTied<1, U, {S,0b11}, {0b11,opc}, V128,			def v2f64 : SIMDThreeSameVectorTied<1, U, {S,0b11}, {0b11,opc}, V128,
	asm, ".2d",			asm, ".2d",
	[(set (v2f64 V128:$dst),			[(set (v2f64 V128:$dst),
	(OpNode (v2f64 V128:$Rd), (v2f64 V128:$Rn), (v2f64 V128:$Rm)))]>;			(OpNode (v2f64 V128:$Rd), (v2f64 V128:$Rn), (v2f64 V128:$Rm)))]>;
	}			}

	// As above, but D and B sized elements unsupported.			// As above, but D and B sized elements unsupported.
	multiclass SIMDThreeSameVectorHS<bit U, bits<5> opc, string asm,			multiclass SIMDThreeSameVectorHS<bit U, bits<5> opc, string asm,
	SDPatternOperator OpNode> {			SDPatternOperator OpNode> {
	def v4i16 : BaseSIMDThreeSameVector<0, U, 0b011, opc, V64,			def v4i16 : SIMDThreeSameVector<0, U, 0b011, opc, V64,
	asm, ".4h",			asm, ".4h",
	[(set (v4i16 V64:$Rd), (OpNode (v4i16 V64:$Rn), (v4i16 V64:$Rm)))]>;			[(set (v4i16 V64:$Rd), (OpNode (v4i16 V64:$Rn), (v4i16 V64:$Rm)))]>;
	def v8i16 : BaseSIMDThreeSameVector<1, U, 0b011, opc, V128,			def v8i16 : SIMDThreeSameVector<1, U, 0b011, opc, V128,
	asm, ".8h",			asm, ".8h",
	[(set (v8i16 V128:$Rd), (OpNode (v8i16 V128:$Rn), (v8i16 V128:$Rm)))]>;			[(set (v8i16 V128:$Rd), (OpNode (v8i16 V128:$Rn), (v8i16 V128:$Rm)))]>;
	def v2i32 : BaseSIMDThreeSameVector<0, U, 0b101, opc, V64,			def v2i32 : SIMDThreeSameVector<0, U, 0b101, opc, V64,
	asm, ".2s",			asm, ".2s",
	[(set (v2i32 V64:$Rd), (OpNode (v2i32 V64:$Rn), (v2i32 V64:$Rm)))]>;			[(set (v2i32 V64:$Rd), (OpNode (v2i32 V64:$Rn), (v2i32 V64:$Rm)))]>;
	def v4i32 : BaseSIMDThreeSameVector<1, U, 0b101, opc, V128,			def v4i32 : SIMDThreeSameVector<1, U, 0b101, opc, V128,
	asm, ".4s",			asm, ".4s",
	[(set (v4i32 V128:$Rd), (OpNode (v4i32 V128:$Rn), (v4i32 V128:$Rm)))]>;			[(set (v4i32 V128:$Rd), (OpNode (v4i32 V128:$Rn), (v4i32 V128:$Rm)))]>;
	}			}

	// Logical three vector ops share opcode bits, and only use B sized elements.			// Logical three vector ops share opcode bits, and only use B sized elements.
	multiclass SIMDLogicalThreeVector<bit U, bits<2> size, string asm,			multiclass SIMDLogicalThreeVector<bit U, bits<2> size, string asm,
	SDPatternOperator OpNode = null_frag> {			SDPatternOperator OpNode = null_frag> {
	def v8i8 : BaseSIMDThreeSameVector<0, U, {size,1}, 0b00011, V64,			def v8i8 : SIMDThreeSameVector<0, U, {size,1}, 0b00011, V64,
	asm, ".8b",			asm, ".8b",
	[(set (v8i8 V64:$Rd), (OpNode V64:$Rn, V64:$Rm))]>;			[(set (v8i8 V64:$Rd), (OpNode V64:$Rn, V64:$Rm))]>;
	def v16i8 : BaseSIMDThreeSameVector<1, U, {size,1}, 0b00011, V128,			def v16i8 : SIMDThreeSameVector<1, U, {size,1}, 0b00011, V128,
	asm, ".16b",			asm, ".16b",
	[(set (v16i8 V128:$Rd), (OpNode V128:$Rn, V128:$Rm))]>;			[(set (v16i8 V128:$Rd), (OpNode V128:$Rn, V128:$Rm))]>;

	def : Pat<(v4i16 (OpNode V64:$LHS, V64:$RHS)),			def : Pat<(v4i16 (OpNode V64:$LHS, V64:$RHS)),
	(!cast<Instruction>(NAME#"v8i8") V64:$LHS, V64:$RHS)>;			(!cast<Instruction>(NAME#"v8i8") V64:$LHS, V64:$RHS)>;
	def : Pat<(v2i32 (OpNode V64:$LHS, V64:$RHS)),			def : Pat<(v2i32 (OpNode V64:$LHS, V64:$RHS)),
	(!cast<Instruction>(NAME#"v8i8") V64:$LHS, V64:$RHS)>;			(!cast<Instruction>(NAME#"v8i8") V64:$LHS, V64:$RHS)>;
	def : Pat<(v1i64 (OpNode V64:$LHS, V64:$RHS)),			def : Pat<(v1i64 (OpNode V64:$LHS, V64:$RHS)),
	(!cast<Instruction>(NAME#"v8i8") V64:$LHS, V64:$RHS)>;			(!cast<Instruction>(NAME#"v8i8") V64:$LHS, V64:$RHS)>;

	def : Pat<(v8i16 (OpNode V128:$LHS, V128:$RHS)),			def : Pat<(v8i16 (OpNode V128:$LHS, V128:$RHS)),
	(!cast<Instruction>(NAME#"v16i8") V128:$LHS, V128:$RHS)>;			(!cast<Instruction>(NAME#"v16i8") V128:$LHS, V128:$RHS)>;
	def : Pat<(v4i32 (OpNode V128:$LHS, V128:$RHS)),			def : Pat<(v4i32 (OpNode V128:$LHS, V128:$RHS)),
	(!cast<Instruction>(NAME#"v16i8") V128:$LHS, V128:$RHS)>;			(!cast<Instruction>(NAME#"v16i8") V128:$LHS, V128:$RHS)>;
	def : Pat<(v2i64 (OpNode V128:$LHS, V128:$RHS)),			def : Pat<(v2i64 (OpNode V128:$LHS, V128:$RHS)),
	(!cast<Instruction>(NAME#"v16i8") V128:$LHS, V128:$RHS)>;			(!cast<Instruction>(NAME#"v16i8") V128:$LHS, V128:$RHS)>;
	}			}

	multiclass SIMDLogicalThreeVectorTied<bit U, bits<2> size,			multiclass SIMDLogicalThreeVectorTied<bit U, bits<2> size,
	string asm, SDPatternOperator OpNode> {			string asm, SDPatternOperator OpNode> {
	def v8i8 : BaseSIMDThreeSameVectorTied<0, U, {size,1}, 0b00011, V64,			def v8i8 : SIMDThreeSameVectorTied<0, U, {size,1}, 0b00011, V64,
	asm, ".8b",			asm, ".8b",
	[(set (v8i8 V64:$dst),			[(set (v8i8 V64:$dst),
	(OpNode (v8i8 V64:$Rd), (v8i8 V64:$Rn), (v8i8 V64:$Rm)))]>;			(OpNode (v8i8 V64:$Rd), (v8i8 V64:$Rn), (v8i8 V64:$Rm)))]>;
	def v16i8 : BaseSIMDThreeSameVectorTied<1, U, {size,1}, 0b00011, V128,			def v16i8 : SIMDThreeSameVectorTied<1, U, {size,1}, 0b00011, V128,
	asm, ".16b",			asm, ".16b",
	[(set (v16i8 V128:$dst),			[(set (v16i8 V128:$dst),
	(OpNode (v16i8 V128:$Rd), (v16i8 V128:$Rn),			(OpNode (v16i8 V128:$Rd), (v16i8 V128:$Rn),
	(v16i8 V128:$Rm)))]>;			(v16i8 V128:$Rm)))]>;

	def : Pat<(v4i16 (OpNode (v4i16 V64:$LHS), (v4i16 V64:$MHS),			def : Pat<(v4i16 (OpNode (v4i16 V64:$LHS), (v4i16 V64:$MHS),
	(v4i16 V64:$RHS))),			(v4i16 V64:$RHS))),
	(!cast<Instruction>(NAME#"v8i8")			(!cast<Instruction>(NAME#"v8i8")
	▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines
	} // end of 'let Predicates = [HasNEON]'			} // end of 'let Predicates = [HasNEON]'

	//----------------------------------------------------------------------------			//----------------------------------------------------------------------------
	// AdvSIMD v8.1 Rounding Double Multiply Add/Subtract			// AdvSIMD v8.1 Rounding Double Multiply Add/Subtract
	//----------------------------------------------------------------------------			//----------------------------------------------------------------------------

	let Predicates = [HasNEON, HasRDM] in {			let Predicates = [HasNEON, HasRDM] in {

	class BaseSIMDThreeSameVectorTiedR0<bit Q, bit U, bits<2> size, bits<5> opcode,			class SIMDThreeSameVectorTiedR0<bit Q, bit U, bits<2> size, bits<5> opcode,
	RegisterOperand regtype, string asm,			RegisterOperand regtype, string asm,
	string kind, list<dag> pattern>			string kind, list<dag> pattern>
	: BaseSIMDThreeSameVectorTied<Q, U, {size,0}, opcode, regtype, asm, kind,			: SIMDThreeSameVectorTied<Q, U, {size,0}, opcode, regtype, asm, kind,
	pattern> {			pattern> {
	}			}
	multiclass SIMDThreeSameVectorSQRDMLxHTiedHS<bit U, bits<5> opc, string asm,			multiclass SIMDThreeSameVectorSQRDMLxHTiedHS<bit U, bits<5> opc, string asm,
	SDPatternOperator Accum> {			SDPatternOperator Accum> {
	def v4i16 : BaseSIMDThreeSameVectorTiedR0<0, U, 0b01, opc, V64, asm, ".4h",			def v4i16 : SIMDThreeSameVectorTiedR0<0, U, 0b01, opc, V64, asm, ".4h",
	[(set (v4i16 V64:$dst),			[(set (v4i16 V64:$dst),
	(Accum (v4i16 V64:$Rd),			(Accum (v4i16 V64:$Rd),
	(v4i16 (int_aarch64_neon_sqrdmulh (v4i16 V64:$Rn),			(v4i16 (int_aarch64_neon_sqrdmulh (v4i16 V64:$Rn),
	(v4i16 V64:$Rm)))))]>;			(v4i16 V64:$Rm)))))]>;
	def v8i16 : BaseSIMDThreeSameVectorTiedR0<1, U, 0b01, opc, V128, asm, ".8h",			def v8i16 : SIMDThreeSameVectorTiedR0<1, U, 0b01, opc, V128, asm, ".8h",
	[(set (v8i16 V128:$dst),			[(set (v8i16 V128:$dst),
	(Accum (v8i16 V128:$Rd),			(Accum (v8i16 V128:$Rd),
	(v8i16 (int_aarch64_neon_sqrdmulh (v8i16 V128:$Rn),			(v8i16 (int_aarch64_neon_sqrdmulh (v8i16 V128:$Rn),
	(v8i16 V128:$Rm)))))]>;			(v8i16 V128:$Rm)))))]>;
	def v2i32 : BaseSIMDThreeSameVectorTiedR0<0, U, 0b10, opc, V64, asm, ".2s",			def v2i32 : SIMDThreeSameVectorTiedR0<0, U, 0b10, opc, V64, asm, ".2s",
	[(set (v2i32 V64:$dst),			[(set (v2i32 V64:$dst),
	(Accum (v2i32 V64:$Rd),			(Accum (v2i32 V64:$Rd),
	(v2i32 (int_aarch64_neon_sqrdmulh (v2i32 V64:$Rn),			(v2i32 (int_aarch64_neon_sqrdmulh (v2i32 V64:$Rn),
	(v2i32 V64:$Rm)))))]>;			(v2i32 V64:$Rm)))))]>;
	def v4i32 : BaseSIMDThreeSameVectorTiedR0<1, U, 0b10, opc, V128, asm, ".4s",			def v4i32 : SIMDThreeSameVectorTiedR0<1, U, 0b10, opc, V128, asm, ".4s",
	[(set (v4i32 V128:$dst),			[(set (v4i32 V128:$dst),
	(Accum (v4i32 V128:$Rd),			(Accum (v4i32 V128:$Rd),
	(v4i32 (int_aarch64_neon_sqrdmulh (v4i32 V128:$Rn),			(v4i32 (int_aarch64_neon_sqrdmulh (v4i32 V128:$Rn),
	(v4i32 V128:$Rm)))))]>;			(v4i32 V128:$Rm)))))]>;
	}			}

	multiclass SIMDIndexedSQRDMLxHSDTied<bit U, bits<4> opc, string asm,			multiclass SIMDIndexedSQRDMLxHSDTied<bit U, bits<4> opc, string asm,
	SDPatternOperator Accum> {			SDPatternOperator Accum> {
	▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines
	bits<2> idx;			bits<2> idx;
	let Inst{11} = idx{1};			let Inst{11} = idx{1};
	let Inst{21} = idx{0};			let Inst{21} = idx{0};
	}			}
	}			}
	} // let Predicates = [HasNeon, HasRDM]			} // let Predicates = [HasNeon, HasRDM]

	//----------------------------------------------------------------------------			//----------------------------------------------------------------------------
				// ARMv8.3 Complex ADD/MLA instructions
				//----------------------------------------------------------------------------

				class ComplexRotationOperand<int Angle, int Remainder, string Type>
				: AsmOperandClass {
				let PredicateMethod = "isComplexRotation<" # Angle # ", " # Remainder # ">";
				let DiagnosticType = "InvalidComplexRotation" # Type;
				let Name = "ComplexRotation" # Type;
				}
				def complexrotateop : Operand<i32> {
				let ParserMatchClass = ComplexRotationOperand<90, 0, "Even">;
				let PrintMethod = "printComplexRotationOp<90, 0>";
				}
				def complexrotateopodd : Operand<i32> {
				let ParserMatchClass = ComplexRotationOperand<180, 90, "Odd">;
				let PrintMethod = "printComplexRotationOp<180, 90>";
				}

				class BaseSIMDThreeSameVectorComplex<bit Q, bit U, bits<3> size, bits<3> opcode,
				RegisterOperand regtype, Operand rottype,
				string asm, string kind, list<dag> pattern>
				SjoerdMeijerUnsubmitted Not Done Reply Inline Actions I don't think we need 2 operand classes. They are essentially the same things, just some constants are different. An example from the ARM backend is: ImmAsmOperand<int Low, int High>, where we pass the range of the imm value. I think we can do something similar here. We can then also avoid some duplication in the print and predicate functions, see also comments below. SjoerdMeijer: I don't think we need 2 operand classes. They are essentially the same things, just some…
				: SIMDThreeSameVector<Q, U, size, 0b00000, regtype, asm, kind, pattern> {
				bits<1> rot;
				// Non-tied version (FCADD) only has one rotation bit
				let Inst{15-13} = opcode;
				let Inst{12} = rot;
				let Inst{11} = 0;
				let InOperandList = (ins regtype:$Rn, regtype:$Rm, rottype:$rot);
				let AsmString = !strconcat(asm, "{\t$Rd" # kind # ", $Rn" # kind # ", $Rm" #
				kind # ", $rot" "\|" # kind #
				"\t$Rd, $Rn, $Rm, $rot}");
				}

				multiclass SIMDThreeSameVectorComplexHSD<bit U, bits<3> opcode, Operand rottype,
				string asm, SDPatternOperator OpNode>{
				let Predicates = [HasV8_3a,HasNEON,HasFullFP16] in {
				SjoerdMeijerUnsubmitted Not Done Reply Inline Actions Nit: missing space after commas (also in the other predicates below) SjoerdMeijer: Nit: missing space after commas (also in the other predicates below)
				def v4f16 : BaseSIMDThreeSameVectorComplex<0, U, 0b010, opcode, V64, rottype,
				asm, ".4h",
				[(set (v4f16 V64:$dst), (OpNode (v4f16 V64:$Rd),
				(v4f16 V64:$Rn),
				(v4f16 V64:$Rm),
				(rottype i32:$rot)))]>;

				def v8f16 : BaseSIMDThreeSameVectorComplex<1, U, 0b010, opcode, V128, rottype,
				asm, ".8h",
				[(set (v8f16 V128:$dst), (OpNode (v8f16 V128:$Rd),
				(v8f16 V128:$Rn),
				(v8f16 V128:$Rm),
				(rottype i32:$rot)))]>;
				}

				let Predicates = [HasV8_3a,HasNEON] in {
				def v2f32 : BaseSIMDThreeSameVectorComplex<0, U, 0b100, opcode, V64, rottype,
				asm, ".2s",
				[(set (v2f32 V64:$dst), (OpNode (v2f32 V64:$Rd),
				(v2f32 V64:$Rn),
				(v2f32 V64:$Rm),
				(rottype i32:$rot)))]>;

				def v4f32 : BaseSIMDThreeSameVectorComplex<1, U, 0b100, opcode, V128, rottype,
				asm, ".4s",
				[(set (v4f32 V128:$dst), (OpNode (v4f32 V128:$Rd),
				(v4f32 V128:$Rn),
				(v4f32 V128:$Rm),
				(rottype i32:$rot)))]>;

				def v2f64 : BaseSIMDThreeSameVectorComplex<1, U, 0b110, opcode, V128, rottype,
				asm, ".2d",
				[(set (v2f64 V128:$dst), (OpNode (v2f64 V128:$Rd),
				(v2f64 V128:$Rn),
				(v2f64 V128:$Rm),
				(rottype i32:$rot)))]>;
				}
				}

				class SIMDThreeSameVectorTiedComplex<bit Q, bit U, bits<3> size,
				bits<3> opcode,
				RegisterOperand regtype,
				Operand rottype, string asm,
				string kind, list<dag> pattern>
				: SIMDThreeSameVectorTied<Q, U, size, 0b00000, regtype, asm, kind, pattern> {
				bits<2> rot;
				let Inst{15-13} = opcode;
				let Inst{12-11} = rot;
				let InOperandList = (ins regtype:$Rd, regtype:$Rn, regtype:$Rm,
				rottype:$rot);
				let AsmString = !strconcat(asm, "{\t$Rd" # kind # ", $Rn" # kind # ", $Rm" #
				kind # ", $rot" "\|" # kind #
				"\t$Rd, $Rn, $Rm, $rot}");
				}

				multiclass SIMDThreeSameVectorTiedComplexHSD<bit U, bits<3> opcode,
				Operand rottype, string asm,
				SDPatternOperator OpNode> {
				let Predicates = [HasV8_3a,HasNEON,HasFullFP16] in {
				def v4f16 : SIMDThreeSameVectorTiedComplex<0, U, 0b010, opcode, V64,
				rottype, asm, ".4h",
				[(set (v4f16 V64:$dst), (OpNode (v4f16 V64:$Rd),
				SjoerdMeijerUnsubmitted Not Done Reply Inline Actions Do you think it's worth making a base class for BaseSIMDThreeSameVectorTiedComplex and BaseSIMDThreeSameVectorComplex? SjoerdMeijer: Do you think it's worth making a base class for BaseSIMDThreeSameVectorTiedComplex and…
				(v4f16 V64:$Rn),
				(v4f16 V64:$Rm),
				(rottype i32:$rot)))]>;

				def v8f16 : SIMDThreeSameVectorTiedComplex<1, U, 0b010, opcode, V128,
				rottype, asm, ".8h",
				[(set (v8f16 V128:$dst), (OpNode (v8f16 V128:$Rd),
				(v8f16 V128:$Rn),
				(v8f16 V128:$Rm),
				(rottype i32:$rot)))]>;
				}

				let Predicates = [HasV8_3a,HasNEON] in {
				def v2f32 : SIMDThreeSameVectorTiedComplex<0, U, 0b100, opcode, V64,
				rottype, asm, ".2s",
				[(set (v2f32 V64:$dst), (OpNode (v2f32 V64:$Rd),
				(v2f32 V64:$Rn),
				(v2f32 V64:$Rm),
				(rottype i32:$rot)))]>;

				def v4f32 : SIMDThreeSameVectorTiedComplex<1, U, 0b100, opcode, V128,
				rottype, asm, ".4s",
				[(set (v4f32 V128:$dst), (OpNode (v4f32 V128:$Rd),
				(v4f32 V128:$Rn),
				(v4f32 V128:$Rm),
				(rottype i32:$rot)))]>;

				def v2f64 : SIMDThreeSameVectorTiedComplex<1, U, 0b110, opcode, V128,
				rottype, asm, ".2d",
				[(set (v2f64 V128:$dst), (OpNode (v2f64 V128:$Rd),
				(v2f64 V128:$Rn),
				(v2f64 V128:$Rm),
				(rottype i32:$rot)))]>;
				}
				}

				let mayLoad = 0, mayStore = 0, hasSideEffects = 0 in
				class BaseSIMDIndexedTiedComplex<bit Q, bit U, bit Scalar, bits<2> size,
				bit opc1, bit opc2, RegisterOperand dst_reg,
				RegisterOperand lhs_reg,
				RegisterOperand rhs_reg, Operand vec_idx,
				Operand rottype, string asm, string apple_kind,
				string dst_kind, string lhs_kind,
				string rhs_kind, list<dag> pattern>
				: I<(outs dst_reg:$dst),
				(ins dst_reg:$Rd, lhs_reg:$Rn, rhs_reg:$Rm, vec_idx:$idx, rottype:$rot),
				asm,
				"{\t$Rd" # dst_kind # ", $Rn" # lhs_kind # ", $Rm" # rhs_kind #
				"$idx, $rot" # "\|" # apple_kind #
				"\t$Rd, $Rn, $Rm$idx, $rot}", "$Rd = $dst", pattern>,
				Sched<[WriteV]> {
				bits<5> Rd;
				bits<5> Rn;
				bits<5> Rm;
				bits<2> rot;

				let Inst{31} = 0;
				let Inst{30} = Q;
				let Inst{29} = U;
				let Inst{28} = Scalar;
				let Inst{27-24} = 0b1111;
				let Inst{23-22} = size;
				// Bit 21 must be set by the derived class.
				let Inst{20-16} = Rm;
				let Inst{15} = opc1;
				let Inst{14-13} = rot;
				let Inst{12} = opc2;
				// Bit 11 must be set by the derived class.
				let Inst{10} = 0;
				let Inst{9-5} = Rn;
				let Inst{4-0} = Rd;
				}

				// The complex instructions index by pairs of elements, so the VectorIndexes
				// don't match the lane types, and the index bits are different to the other
				// classes.
				multiclass SIMDIndexedTiedComplexHSD<bit U, bit opc1, bit opc2, Operand rottype,
				string asm, SDPatternOperator OpNode> {
				let Predicates = [HasV8_3a,HasNEON,HasFullFP16] in {
				def v4f16_indexed : BaseSIMDIndexedTiedComplex<0, 1, 0, 0b01, opc1, opc2, V64,
				V64, V128, VectorIndexD, rottype, asm, ".4h", ".4h",
				".4h", ".h", []> {
				bits<1> idx;
				let Inst{11} = 0;
				let Inst{21} = idx{0};
				}

				def v8f16_indexed : BaseSIMDIndexedTiedComplex<1, 1, 0, 0b01, opc1, opc2,
				V128, V128, V128, VectorIndexS, rottype, asm, ".8h",
				".8h", ".8h", ".h", []> {
				bits<2> idx;
				let Inst{11} = idx{1};
				let Inst{21} = idx{0};
				}
				} // Predicates = [HasV8_3a,HasNEON,HasFullFP16]

				let Predicates = [HasV8_3a,HasNEON] in {
				def v4f32_indexed : BaseSIMDIndexedTiedComplex<1, 1, 0, 0b10, opc1, opc2,
				V128, V128, V128, VectorIndexD, rottype, asm, ".4s",
				".4s", ".4s", ".s", []> {
				bits<1> idx;
				let Inst{11} = idx{0};
				let Inst{21} = 0;
				}
				} // Predicates = [HasV8_3a,HasNEON]
				}

				//----------------------------------------------------------------------------
	// Crypto extensions			// Crypto extensions
	//----------------------------------------------------------------------------			//----------------------------------------------------------------------------

	let Predicates = [HasCrypto] in {			let Predicates = [HasCrypto] in {
	let mayLoad = 0, mayStore = 0, hasSideEffects = 0 in			let mayLoad = 0, mayStore = 0, hasSideEffects = 0 in
	class AESBase<bits<4> opc, string asm, dag outs, dag ins, string cstr,			class AESBase<bits<4> opc, string asm, dag outs, dag ins, string cstr,
	list<dag> pat>			list<dag> pat>
	: I<outs, ins, asm, "{\t$Rd.16b, $Rn.16b\|.16b\t$Rd, $Rn}", cstr, pat>,			: I<outs, ins, asm, "{\t$Rd.16b, $Rn.16b\|.16b\t$Rd, $Rn}", cstr, pat>,
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64InstrInfo.td

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	let Predicates = [HasRCPC] in {			let Predicates = [HasRCPC] in {
	// v8.3 Release Consistent Processor Consistent support, optional in v8.2.			// v8.3 Release Consistent Processor Consistent support, optional in v8.2.
	def LDAPRB : RCPCLoad<0b00, "ldaprb", GPR32>;			def LDAPRB : RCPCLoad<0b00, "ldaprb", GPR32>;
	def LDAPRH : RCPCLoad<0b01, "ldaprh", GPR32>;			def LDAPRH : RCPCLoad<0b01, "ldaprh", GPR32>;
	def LDAPRW : RCPCLoad<0b10, "ldapr", GPR32>;			def LDAPRW : RCPCLoad<0b10, "ldapr", GPR32>;
	def LDAPRX : RCPCLoad<0b11, "ldapr", GPR64>;			def LDAPRX : RCPCLoad<0b11, "ldapr", GPR64>;
	}			}

				// v8.3a complex add and multiply-accumulate. No predicate here, that is done
				// inside the multiclass as the F16 versions need different predicates.
				SjoerdMeijerUnsubmitted Not Done Reply Inline Actions Nit: F16 => FP16? SjoerdMeijer: Nit: F16 => FP16?
				SjoerdMeijerUnsubmitted Not Done Reply Inline Actions Nit: F16 => FP16? SjoerdMeijer: Nit: F16 => FP16?
				defm FCMLA : SIMDThreeSameVectorTiedComplexHSD<1, 0b110, complexrotateop,
				"fcmla", null_frag>;
				defm FCADD : SIMDThreeSameVectorComplexHSD<1, 0b111, complexrotateopodd,
				"fcadd", null_frag>;
				defm FCMLA : SIMDIndexedTiedComplexHSD<1, 0, 1, complexrotateop, "fcmla",
				null_frag>;

	let Predicates = [HasV8_3a] in {			let Predicates = [HasV8_3a] in {
	// v8.3a Pointer Authentication			// v8.3a Pointer Authentication
	let Uses = [LR], Defs = [LR] in {			let Uses = [LR], Defs = [LR] in {
	def PACIAZ : SystemNoOperands<0b000, "paciaz">;			def PACIAZ : SystemNoOperands<0b000, "paciaz">;
	def PACIBZ : SystemNoOperands<0b010, "pacibz">;			def PACIBZ : SystemNoOperands<0b010, "pacibz">;
	def AUTIAZ : SystemNoOperands<0b100, "autiaz">;			def AUTIAZ : SystemNoOperands<0b100, "autiaz">;
	def AUTIBZ : SystemNoOperands<0b110, "autibz">;			def AUTIBZ : SystemNoOperands<0b110, "autibz">;
	}			}
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Target/AArch64/AsmParser/AArch64AsmParser.cpp

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	Reg.RegNum);			Reg.RegNum);
	}			}

	bool isGPR64sp0() const {			bool isGPR64sp0() const {
	return Kind == k_Register && !Reg.isVector &&			return Kind == k_Register && !Reg.isVector &&
	AArch64MCRegisterClasses[AArch64::GPR64spRegClassID].contains(Reg.RegNum);			AArch64MCRegisterClasses[AArch64::GPR64spRegClassID].contains(Reg.RegNum);
	}			}

				template<int64_t Angle, int64_t Remainder>
				SjoerdMeijerUnsubmitted Not Done Reply Inline Actions We can refactor this and the next function (and create only 1), and use "PredicateMethod" in the AsmOperand class. SjoerdMeijer: We can refactor this and the next function (and create only 1), and use "PredicateMethod" in…
				bool isComplexRotation() const {
				if (!isImm()) return false;

				const MCConstantExpr *CE = dyn_cast<MCConstantExpr>(getImm());
				if (!CE) return false;
				uint64_t Value = CE->getValue();

				return (Value % Angle == Remainder && Value <= 270);
				}

	/// Is this a vector list with the type implicit (presumably attached to the			/// Is this a vector list with the type implicit (presumably attached to the
	/// instruction itself)?			/// instruction itself)?
	template <unsigned NumRegs> bool isImplicitlyTypedVectorList() const {			template <unsigned NumRegs> bool isImplicitlyTypedVectorList() const {
	return Kind == k_VectorList && VectorList.Count == NumRegs &&			return Kind == k_VectorList && VectorList.Count == NumRegs &&
	!VectorList.ElementKind;			!VectorList.ElementKind;
	}			}

	template <unsigned NumRegs, unsigned NumElements, char ElementKind>			template <unsigned NumRegs, unsigned NumElements, char ElementKind>
	▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines
	void addMOVNMovAliasOperands(MCInst &Inst, unsigned N) const {			void addMOVNMovAliasOperands(MCInst &Inst, unsigned N) const {
	assert(N == 1 && "Invalid number of operands!");			assert(N == 1 && "Invalid number of operands!");

	const MCConstantExpr *CE = cast<MCConstantExpr>(getImm());			const MCConstantExpr *CE = cast<MCConstantExpr>(getImm());
	uint64_t Value = CE->getValue();			uint64_t Value = CE->getValue();
	Inst.addOperand(MCOperand::createImm((~Value >> Shift) & 0xffff));			Inst.addOperand(MCOperand::createImm((~Value >> Shift) & 0xffff));
	}			}

				void addComplexRotationEvenOperands(MCInst &Inst, unsigned N) const {
				SjoerdMeijerUnsubmitted Not Done Reply Inline Actions We can refactor this function and the next one and create one (template/parametric) function to avoid code duplication (see earlier comment about the operand classes). SjoerdMeijer: We can refactor this function and the next one and create one (template/parametric) function to…
				assert(N == 1 && "Invalid number of operands!");
				const MCConstantExpr *MCE = cast<MCConstantExpr>(getImm());
				Inst.addOperand(MCOperand::createImm(MCE->getValue() / 90));
				}

				void addComplexRotationOddOperands(MCInst &Inst, unsigned N) const {
				assert(N == 1 && "Invalid number of operands!");
				const MCConstantExpr *MCE = cast<MCConstantExpr>(getImm());
				Inst.addOperand(MCOperand::createImm((MCE->getValue() - 90) / 180));
				}

	void print(raw_ostream &OS) const override;			void print(raw_ostream &OS) const override;

	static std::unique_ptr<AArch64Operand>			static std::unique_ptr<AArch64Operand>
	CreateToken(StringRef Str, bool IsSuffix, SMLoc S, MCContext &Ctx) {			CreateToken(StringRef Str, bool IsSuffix, SMLoc S, MCContext &Ctx) {
	auto Op = make_unique<AArch64Operand>(k_Token, Ctx);			auto Op = make_unique<AArch64Operand>(k_Token, Ctx);
	Op->Tok.Data = Str.data();			Op->Tok.Data = Str.data();
	Op->Tok.Length = Str.size();			Op->Tok.Length = Str.size();
	Op->Tok.IsSuffix = IsSuffix;			Op->Tok.IsSuffix = IsSuffix;
	▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines
	case Match_InvalidIndexD:			case Match_InvalidIndexD:
	return Error(Loc, "vector lane must be an integer in range [0, 1].");			return Error(Loc, "vector lane must be an integer in range [0, 1].");
	case Match_InvalidLabel:			case Match_InvalidLabel:
	return Error(Loc, "expected label or encodable integer pc offset");			return Error(Loc, "expected label or encodable integer pc offset");
	case Match_MRS:			case Match_MRS:
	return Error(Loc, "expected readable system register");			return Error(Loc, "expected readable system register");
	case Match_MSR:			case Match_MSR:
	return Error(Loc, "expected writable system register or pstate");			return Error(Loc, "expected writable system register or pstate");
				case Match_InvalidComplexRotationEven:
				return Error(Loc, "complex rotation must be 0, 90, 180 or 270.");
				case Match_InvalidComplexRotationOdd:
				return Error(Loc, "complex rotation must be 90 or 270.");
	case Match_MnemonicFail: {			case Match_MnemonicFail: {
	std::string Suggestion = AArch64MnemonicSpellCheck(			std::string Suggestion = AArch64MnemonicSpellCheck(
	((AArch64Operand &)*Operands[0]).getToken(),			((AArch64Operand &)*Operands[0]).getToken(),
	ComputeAvailableFeatures(STI->getFeatureBits()));			ComputeAvailableFeatures(STI->getFeatureBits()));
	return Error(Loc, "unrecognized instruction mnemonic" + Suggestion);			return Error(Loc, "unrecognized instruction mnemonic" + Suggestion);
	}			}
	default:			default:
	llvm_unreachable("unexpected error code!");			llvm_unreachable("unexpected error code!");
	▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines
	case Match_InvalidImm1_32:			case Match_InvalidImm1_32:
	case Match_InvalidImm1_64:			case Match_InvalidImm1_64:
	case Match_InvalidIndex1:			case Match_InvalidIndex1:
	case Match_InvalidIndexB:			case Match_InvalidIndexB:
	case Match_InvalidIndexH:			case Match_InvalidIndexH:
	case Match_InvalidIndexS:			case Match_InvalidIndexS:
	case Match_InvalidIndexD:			case Match_InvalidIndexD:
	case Match_InvalidLabel:			case Match_InvalidLabel:
				case Match_InvalidComplexRotationEven:
				case Match_InvalidComplexRotationOdd:
	case Match_MSR:			case Match_MSR:
	case Match_MRS: {			case Match_MRS: {
	if (ErrorInfo >= Operands.size())			if (ErrorInfo >= Operands.size())
	return Error(IDLoc, "too few operands for instruction", SMRange(IDLoc, (*Operands.back()).getEndLoc()));			return Error(IDLoc, "too few operands for instruction", SMRange(IDLoc, (*Operands.back()).getEndLoc()));
	// Any time we get here, there's nothing fancy to do. Just get the			// Any time we get here, there's nothing fancy to do. Just get the
	// operand SMLoc and display the diagnostic.			// operand SMLoc and display the diagnostic.
	SMLoc ErrorLoc = ((AArch64Operand &)*Operands[ErrorInfo]).getStartLoc();			SMLoc ErrorLoc = ((AArch64Operand &)*Operands[ErrorInfo]).getStartLoc();
	if (ErrorLoc == SMLoc())			if (ErrorLoc == SMLoc())
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Target/AArch64/InstPrinter/AArch64InstPrinter.h

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	void printMSRSystemRegister(const MCInst *MI, unsigned OpNum,			void printMSRSystemRegister(const MCInst *MI, unsigned OpNum,
	const MCSubtargetInfo &STI, raw_ostream &O);			const MCSubtargetInfo &STI, raw_ostream &O);
	void printMRSSystemRegister(const MCInst *MI, unsigned OpNum,			void printMRSSystemRegister(const MCInst *MI, unsigned OpNum,
	const MCSubtargetInfo &STI, raw_ostream &O);			const MCSubtargetInfo &STI, raw_ostream &O);
	void printSystemPStateField(const MCInst *MI, unsigned OpNum,			void printSystemPStateField(const MCInst *MI, unsigned OpNum,
	const MCSubtargetInfo &STI, raw_ostream &O);			const MCSubtargetInfo &STI, raw_ostream &O);
	void printSIMDType10Operand(const MCInst *MI, unsigned OpNum,			void printSIMDType10Operand(const MCInst *MI, unsigned OpNum,
	const MCSubtargetInfo &STI, raw_ostream &O);			const MCSubtargetInfo &STI, raw_ostream &O);
				template<int64_t Angle, int64_t Remainder>
				void printComplexRotationOp(const MCInst *MI, unsigned OpNo,
				const MCSubtargetInfo &STI, raw_ostream &O);
	template<unsigned size>			template<unsigned size>
	void printGPRSeqPairsClassOperand(const MCInst *MI, unsigned OpNum,			void printGPRSeqPairsClassOperand(const MCInst *MI, unsigned OpNum,
	const MCSubtargetInfo &STI,			const MCSubtargetInfo &STI,
	raw_ostream &O);			raw_ostream &O);
	};			};

	class AArch64AppleInstPrinter : public AArch64InstPrinter {			class AArch64AppleInstPrinter : public AArch64InstPrinter {
	public:			public:
	Show All 26 Lines

lib/Target/AArch64/InstPrinter/AArch64InstPrinter.cpp

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines

	void AArch64InstPrinter::printSIMDType10Operand(const MCInst *MI, unsigned OpNo,	void AArch64InstPrinter::printSIMDType10Operand(const MCInst *MI, unsigned OpNo,
	const MCSubtargetInfo &STI,	const MCSubtargetInfo &STI,
	raw_ostream &O) {	raw_ostream &O) {
	unsigned RawVal = MI->getOperand(OpNo).getImm();	unsigned RawVal = MI->getOperand(OpNo).getImm();
	uint64_t Val = AArch64_AM::decodeAdvSIMDModImmType10(RawVal);	uint64_t Val = AArch64_AM::decodeAdvSIMDModImmType10(RawVal);
	O << format("#%#016llx", Val);	O << format("#%#016llx", Val);
	}	}

		template<int64_t Angle, int64_t Remainder>
		SjoerdMeijerUnsubmitted Not Done Reply Inline Actions Same here. SjoerdMeijer: Same here.
		void AArch64InstPrinter::printComplexRotationOp(const MCInst *MI, unsigned OpNo,
		const MCSubtargetInfo &STI,
		raw_ostream &O) {
		unsigned Val = MI->getOperand(OpNo).getImm();
		O << "#" << (Val * Angle) + Remainder;
		}

Context not available.

test/MC/AArch64/armv8.3a-complex.s

This file was added.

				// RUN: not llvm-mc -triple aarch64-none-linux-gnu -show-encoding -mattr=+v8.3a,-fullfp16 < %s 2>%t \| FileCheck %s --check-prefix=CHECK --check-prefix=NO-FP16
				// RUN: FileCheck --check-prefix=STDERR --check-prefix=STDERR-NO-FP16 %s < %t
				// RUN: not llvm-mc -triple aarch64-none-linux-gnu -show-encoding -mattr=+v8.3a,+fullfp16 < %s 2>%t \| FileCheck %s --check-prefix=CHECK --check-prefix=FP16
				// RUN: FileCheck --check-prefix=STDERR --check-prefix=STDERR-FP16 %s < %t
				// RUN: not llvm-mc -triple aarch64-none-linux-gnu -show-encoding -mattr=+v8.2a,-v8.3a,+fullfp16 < %s 2>&1 \| FileCheck %s --check-prefix=NO-V83A


				// ==== FCMLA vector ====
				// Types
				fcmla v0.4h, v1.4h, v2.4h, #0
				// FP16: fcmla v0.4h, v1.4h, v2.4h, #0 // encoding: [0x20,0xc4,0x42,0x2e]
				// STDERR-NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: fullfp16
				// NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.8h, v1.8h, v2.8h, #0
				// FP16: fcmla v0.8h, v1.8h, v2.8h, #0 // encoding: [0x20,0xc4,0x42,0x6e]
				// STDERR-NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: fullfp16
				// NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.2s, v1.2s, v2.2s, #0
				// CHECK: fcmla v0.2s, v1.2s, v2.2s, #0 // encoding: [0x20,0xc4,0x82,0x2e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.4s, v1.4s, v2.4s, #0
				// CHECK: fcmla v0.4s, v1.4s, v2.4s, #0 // encoding: [0x20,0xc4,0x82,0x6e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.2d, v1.2d, v2.2d, #0
				// CHECK: fcmla v0.2d, v1.2d, v2.2d, #0 // encoding: [0x20,0xc4,0xc2,0x6e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a

				// Rotations
				fcmla v0.2s, v1.2s, v2.2s, #0
				// CHECK: fcmla v0.2s, v1.2s, v2.2s, #0 // encoding: [0x20,0xc4,0x82,0x2e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.2s, v1.2s, v2.2s, #90
				// CHECK: fcmla v0.2s, v1.2s, v2.2s, #90 // encoding: [0x20,0xcc,0x82,0x2e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.2s, v1.2s, v2.2s, #180
				// CHECK: fcmla v0.2s, v1.2s, v2.2s, #180 // encoding: [0x20,0xd4,0x82,0x2e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.2s, v1.2s, v2.2s, #270
				// CHECK: fcmla v0.2s, v1.2s, v2.2s, #270 // encoding: [0x20,0xdc,0x82,0x2e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a

				// Invalid rotations
				fcmla v0.2s, v1.2s, v2.2s, #1
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 0, 90, 180 or 270.
				fcmla v0.2s, v1.2s, v2.2s, #360
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 0, 90, 180 or 270.
				fcmla v0.2s, v1.2s, v2.2s, #-90
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 0, 90, 180 or 270.

				// ==== FCADD vector ====
				// Types
				fcadd v0.4h, v1.4h, v2.4h, #90
				// FP16: fcadd v0.4h, v1.4h, v2.4h, #90 // encoding: [0x20,0xe4,0x42,0x2e]
				// STDERR-NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: fullfp16
				// NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcadd v0.8h, v1.8h, v2.8h, #90
				// FP16: fcadd v0.8h, v1.8h, v2.8h, #90 // encoding: [0x20,0xe4,0x42,0x6e]
				// STDERR-NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: fullfp16
				// NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcadd v0.2s, v1.2s, v2.2s, #90
				// CHECK: fcadd v0.2s, v1.2s, v2.2s, #90 // encoding: [0x20,0xe4,0x82,0x2e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcadd v0.4s, v1.4s, v2.4s, #90
				// CHECK: fcadd v0.4s, v1.4s, v2.4s, #90 // encoding: [0x20,0xe4,0x82,0x6e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcadd v0.2d, v1.2d, v2.2d, #90
				// CHECK: fcadd v0.2d, v1.2d, v2.2d, #90 // encoding: [0x20,0xe4,0xc2,0x6e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a

				// Rotations
				fcadd v0.2s, v1.2s, v2.2s, #90
				// CHECK: fcadd v0.2s, v1.2s, v2.2s, #90 // encoding: [0x20,0xe4,0x82,0x2e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcadd v0.2s, v1.2s, v2.2s, #270
				// CHECK: fcadd v0.2s, v1.2s, v2.2s, #270 // encoding: [0x20,0xf4,0x82,0x2e]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a

				// Invalid rotations
				fcadd v0.2s, v1.2s, v2.2s, #1
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 90 or 270.
				fcadd v0.2s, v1.2s, v2.2s, #360
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 90 or 270.
				fcadd v0.2s, v1.2s, v2.2s, #-90
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 90 or 270.
				fcadd v0.2s, v1.2s, v2.2s, #0
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 90 or 270.
				fcadd v0.2s, v1.2s, v2.2s, #180
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 90 or 270.

				// ==== FCMLA indexed ====
				// Types
				fcmla v0.4h, v1.4h, v2.h[0], #0
				// FP16: fcmla v0.4h, v1.4h, v2.h[0], #0 // encoding: [0x20,0x10,0x42,0x2f]
				// STDERR-NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: fullfp16
				// NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.8h, v1.8h, v2.h[0], #0
				// FP16: fcmla v0.8h, v1.8h, v2.h[0], #0 // encoding: [0x20,0x10,0x42,0x6f]
				// STDERR-NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: fullfp16
				// NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.2s, v1.2s, v2.s[0], #0
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: invalid operand for instruction
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: invalid operand for instruction
				fcmla v0.4s, v1.4s, v2.s[0], #0
				// CHECK: fcmla v0.4s, v1.4s, v2.s[0], #0 // encoding: [0x20,0x10,0x82,0x6f]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.2d, v1.2d, v2.d[0], #0
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: invalid operand for instruction
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: invalid operand for instruction

				// Rotations
				fcmla v0.4s, v1.4s, v2.s[0], #90
				// CHECK: fcmla v0.4s, v1.4s, v2.s[0], #90 // encoding: [0x20,0x30,0x82,0x6f]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.4s, v1.4s, v2.s[0], #180
				// CHECK: fcmla v0.4s, v1.4s, v2.s[0], #180 // encoding: [0x20,0x50,0x82,0x6f]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.4s, v1.4s, v2.s[0], #270
				// CHECK: fcmla v0.4s, v1.4s, v2.s[0], #270 // encoding: [0x20,0x70,0x82,0x6f]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a

				// Valid indices
				fcmla v0.4h, v1.4h, v2.h[1], #0
				// FP16: fcmla v0.4h, v1.4h, v2.h[1], #0 // encoding: [0x20,0x10,0x62,0x2f]
				// STDERR-NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: fullfp16
				// NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.8h, v1.8h, v2.h[3], #0
				// FP16: fcmla v0.8h, v1.8h, v2.h[3], #0 // encoding: [0x20,0x18,0x62,0x6f]
				// STDERR-NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: fullfp16
				// NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: error: instruction requires: armv8.3a
				fcmla v0.4s, v1.4s, v2.s[1], #0
				// CHECK: fcmla v0.4s, v1.4s, v2.s[1], #0 // encoding: [0x20,0x18,0x82,0x6f]
				// NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: error: instruction requires: armv8.3a

				// Invalid indices
				fcmla v0.4h, v1.4h, v2.h[2], #0
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: vector lane must be an integer in range [0, 1].
				fcmla v0.8h, v1.8h, v2.h[4], #0
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: vector lane must be an integer in range [0, 3].
				fcmla v0.4s, v1.4s, v2.s[2], #0
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: vector lane must be an integer in range [0, 1].

				// Invalid rotations
				fcmla v0.4s, v1.4s, v2.s[0], #1
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 0, 90, 180 or 270.
				fcmla v0.4s, v1.4s, v2.s[0], #360
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 0, 90, 180 or 270.
				fcmla v0.4s, v1.4s, v2.s[0], #-90
				// STDERR: :[[@LINE-1]]:{{[0-9]*}}: error: complex rotation must be 0, 90, 180 or 270.

test/MC/Disassembler/AArch64/armv8.3a-complex.txt

This file was added.

				# RUN: not llvm-mc -triple aarch64-none-linux-gnu -mattr=+v8.3a,-fullfp16 --disassemble < %s 2>%t \| FileCheck %s --check-prefix=CHECK
				# RUN: FileCheck %s < %t --check-prefix=NO-FP16
				# RUN: llvm-mc -triple aarch64-none-linux-gnu -mattr=+v8.3a,+fullfp16 --disassemble < %s 2>%t \| FileCheck %s --check-prefix=CHECK --check-prefix=FP16
				# RUN: not llvm-mc -triple aarch64-none-linux-gnu -mattr=-v8.3a,+fullfp16 --disassemble < %s 2>&1 \| FileCheck %s --check-prefix=NO-V83A

				###### FCMLA vector
				[0x20,0xc4,0x42,0x2e]
				# FP16: fcmla v0.4h, v1.4h, v2.4h, #0
				# NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				# NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xc4,0x42,0x6e]
				# FP16: fcmla v0.8h, v1.8h, v2.8h, #0
				# NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				# NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xc4,0x82,0x2e]
				# CHECK: fcmla v0.2s, v1.2s, v2.2s, #0
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xc4,0x82,0x6e]
				# CHECK: fcmla v0.4s, v1.4s, v2.4s, #0
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xc4,0xc2,0x6e]
				# CHECK: fcmla v0.2d, v1.2d, v2.2d, #0
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding


				[0x20,0xc4,0x82,0x2e]
				# CHECK: fcmla v0.2s, v1.2s, v2.2s, #0
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xcc,0x82,0x2e]
				# CHECK: fcmla v0.2s, v1.2s, v2.2s, #90
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xd4,0x82,0x2e]
				# CHECK: fcmla v0.2s, v1.2s, v2.2s, #180
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xdc,0x82,0x2e]
				# CHECK: fcmla v0.2s, v1.2s, v2.2s, #270
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding


				###### FCADD vector
				[0x20,0xe4,0x42,0x2e]
				# FP16: fcadd v0.4h, v1.4h, v2.4h, #90
				# NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				# NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xe4,0x42,0x6e]
				# FP16: fcadd v0.8h, v1.8h, v2.8h, #90
				# NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				# NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xe4,0x82,0x2e]
				# CHECK: fcadd v0.2s, v1.2s, v2.2s, #90
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xe4,0x82,0x6e]
				# CHECK: fcadd v0.4s, v1.4s, v2.4s, #90
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xe4,0xc2,0x6e]
				# CHECK: fcadd v0.2d, v1.2d, v2.2d, #90
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding


				[0x20,0xe4,0x82,0x2e]
				# CHECK: fcadd v0.2s, v1.2s, v2.2s, #90
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0xf4,0x82,0x2e]
				# CHECK: fcadd v0.2s, v1.2s, v2.2s, #270
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding

				[0x20,0x10,0x42,0x2f]
				# FP16: fcmla v0.4h, v1.4h, v2.h[0], #0
				# NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				# NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0x10,0x42,0x6f]
				# FP16: fcmla v0.8h, v1.8h, v2.h[0], #0
				# NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				# NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0x10,0x82,0x6f]
				# CHECK: fcmla v0.4s, v1.4s, v2.s[0], #0
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding


				[0x20,0x30,0x82,0x6f]
				# CHECK: fcmla v0.4s, v1.4s, v2.s[0], #90
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0x50,0x82,0x6f]
				# CHECK: fcmla v0.4s, v1.4s, v2.s[0], #180
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0x70,0x82,0x6f]
				# CHECK: fcmla v0.4s, v1.4s, v2.s[0], #270
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding


				[0x20,0x10,0x62,0x2f]
				# FP16: fcmla v0.4h, v1.4h, v2.h[1], #0
				# NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				# NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0x18,0x62,0x6f]
				# FP16: fcmla v0.8h, v1.8h, v2.h[3], #0
				# NO-FP16: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding
				# NO-V83A: :[[@LINE-3]]:{{[0-9]*}}: warning: invalid instruction encoding
				[0x20,0x18,0x82,0x6f]
				# CHECK: fcmla v0.4s, v1.4s, v2.s[1], #0
				# NO-V83A: :[[@LINE-2]]:{{[0-9]*}}: warning: invalid instruction encoding

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] v8.3-a complex number supportClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 113260

lib/Target/AArch64/AArch64InstrFormats.td

lib/Target/AArch64/AArch64InstrInfo.td

lib/Target/AArch64/AsmParser/AArch64AsmParser.cpp

lib/Target/AArch64/InstPrinter/AArch64InstPrinter.h

lib/Target/AArch64/InstPrinter/AArch64InstPrinter.cpp

test/MC/AArch64/armv8.3a-complex.s

test/MC/Disassembler/AArch64/armv8.3a-complex.txt

[AArch64] v8.3-a complex number support
ClosedPublic