This is an archive of the discontinued LLVM Phabricator instance.

[ARM] Fix for indexed dot product instruction descriptions
ClosedPublic

Authored by SjoerdMeijer on Sep 18 2017, 3:54 AM.

Download Raw Diff

Details

Reviewers

t.p.northover
rengolin
samparker
john.brawn

Commits

rG4e6df159621f: [ARM] Fix for indexed dot product instruction descriptions
rL313531: [ARM] Fix for indexed dot product instruction descriptions

Summary

The indexed dot product instructions only accept the lower 16 D-registers as
the indexed register, but we were e.g. incorrectly accepting:

vudot.u8 d16,d16,d18[0]

Diff Detail

Event Timeline

SjoerdMeijer created this revision.Sep 18 2017, 3:54 AM

Herald added subscribers: kristof.beyls, javed.absar, aemerson. · View Herald TranscriptSep 18 2017, 3:54 AM

The last ARMv8.2 manual I could find is from 31 March, but it says UDOT/SDOT will be documented later. Do you have an update on that?

It's really hard to review patches without official documentation out.

Hi, the full architecture specification is publicly available here:

https://developer.arm.com/products/architecture/a-profile/exploration-tools

which I also mentioned in the commit message of r310480, which introduced initial AArch64 assembler support.

Hope this helps.

Interesting, the Aarch32 PDF doesn't have SDOT/UDOT, only the AArch64 one...

Ignore me, that was VSDOT... :)

Where is the restriction that only the lower 16 registers are allowed? Can you test quad regs, too? Just to make it clear the lane issue.

Where is the restriction that only the lower 16 registers are allowed? Can you test quad regs, too? Just to make it clear the lane issue.

It's in the pseudo-code:

integer m = UInt(Vm<3:0>);
integer index = UInt(M);

Normally that 'M' bit would be the high bit of Vm (as for Vd and Vn just above). Here it's used to encode the lane.

I was just replying, but yes, there are only 4 bits available to encode Vm, the other M bit is the index.

In D37968#873710, @t.p.northover wrote:

It's in the pseudo-code:

D'oh! I was looking in the vector one.

LGTM. Thanks!

This revision is now accepted and ready to land.Sep 18 2017, 7:07 AM

Thanks for checking!

Closed by commit rL313531: [ARM] Fix for indexed dot product instruction descriptions (authored by SjoerdMeijer). · Explain WhySep 18 2017, 7:19 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

lib/

Target/

ARM/

ARMInstrNEON.td

2 lines

test/

MC/

ARM/

armv8.2a-dotprod-error.s

22 lines

Diff 115619

lib/Target/ARM/ARMInstrNEON.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 4,704 Lines • ▼ Show 20 Lines	def VUDOTQ : N3Vnp<0b11000, 0b10, 0b1101, 0b1, 0b1,
N3RegFrm, IIC_VDOTPROD, "vudot", "u8", []>;		N3RegFrm, IIC_VDOTPROD, "vudot", "u8", []>;
def VSDOTQ : N3Vnp<0b11000, 0b10, 0b1101, 0b1, 0b0,		def VSDOTQ : N3Vnp<0b11000, 0b10, 0b1101, 0b1, 0b0,
(outs QPR:$Vd), (ins QPR:$Vn, QPR:$Vm),		(outs QPR:$Vd), (ins QPR:$Vn, QPR:$Vm),
N3RegFrm, IIC_VDOTPROD, "vsdot", "s8", []>;		N3RegFrm, IIC_VDOTPROD, "vsdot", "s8", []>;

// Indexed dot product instructions:		// Indexed dot product instructions:
class DOTI<string opc, string dt, bit Q, bit U, RegisterClass Ty> :		class DOTI<string opc, string dt, bit Q, bit U, RegisterClass Ty> :
N3Vnp<0b11100, 0b10, 0b1101, Q, U,		N3Vnp<0b11100, 0b10, 0b1101, Q, U,
(outs Ty:$Vd), (ins Ty:$Vn, DPR:$Vm, VectorIndex32:$lane),		(outs Ty:$Vd), (ins Ty:$Vn, DPR_VFP2:$Vm, VectorIndex32:$lane),
N3RegFrm, IIC_VDOTPROD, opc, dt, []> {		N3RegFrm, IIC_VDOTPROD, opc, dt, []> {
bit lane;		bit lane;
let Inst{5} = lane;		let Inst{5} = lane;
let AsmString = !strconcat(opc, ".", dt, "\t$Vd, $Vn, $Vm$lane");		let AsmString = !strconcat(opc, ".", dt, "\t$Vd, $Vn, $Vm$lane");
}		}

def VUDOTDI : DOTI<"vudot", "u8", 0b0, 0b1, DPR>;		def VUDOTDI : DOTI<"vudot", "u8", 0b0, 0b1, DPR>;
def VSDOTDI : DOTI<"vsdot", "s8", 0b0, 0b0, DPR>;		def VSDOTDI : DOTI<"vsdot", "s8", 0b0, 0b0, DPR>;
▲ Show 20 Lines • Show All 3,569 Lines • Show Last 20 Lines

test/MC/ARM/armv8.2a-dotprod-error.s

	// RUN: not llvm-mc -triple arm -mattr=+dotprod -show-encoding < %s 2> %t			// RUN: not llvm-mc -triple arm -mattr=+dotprod -show-encoding < %s 2> %t
	// RUN: FileCheck --check-prefix=CHECK-ERROR < %t %s			// RUN: FileCheck --check-prefix=CHECK-ERROR < %t %s
	// RUN: not llvm-mc -triple thumb -mattr=+dotprod -show-encoding < %s 2> %t			// RUN: not llvm-mc -triple thumb -mattr=+dotprod -show-encoding < %s 2> %t
	// RUN: FileCheck --check-prefix=CHECK-ERROR < %t %s			// RUN: FileCheck --check-prefix=CHECK-ERROR < %t %s

				// Only indices 0 an 1 should be accepted:

	vudot.u8 d0, d1, d2[2]			vudot.u8 d0, d1, d2[2]
	vsdot.s8 d0, d1, d2[2]			vsdot.s8 d0, d1, d2[2]
	vudot.u8 q0, q1, d4[2]			vudot.u8 q0, q1, d4[2]
	vsdot.s8 q0, q1, d4[2]			vsdot.s8 q0, q1, d4[2]

	// CHECK-ERROR: error: invalid operand for instruction			// CHECK-ERROR: error: invalid operand for instruction
				// CHECK-ERROR: vudot.u8 d0, d1, d2[2]
				// CHECK-ERROR: ^
				// CHECK-ERROR: error: invalid operand for instruction
				// CHECK-ERROR: vsdot.s8 d0, d1, d2[2]
				// CHECK-ERROR: ^
				// CHECK-ERROR: error: invalid operand for instruction
				// CHECK-ERROR: vudot.u8 q0, q1, d4[2]
				// CHECK-ERROR: ^
	// CHECK-ERROR: error: invalid operand for instruction			// CHECK-ERROR: error: invalid operand for instruction
				// CHECK-ERROR: vsdot.s8 q0, q1, d4[2]
				// CHECK-ERROR: ^

				// Only the lower 16 D-registers should be accepted:

				vudot.u8 q0, q1, d16[0]
				vsdot.s8 q0, q1, d16[0]

	// CHECK-ERROR: error: invalid operand for instruction			// CHECK-ERROR: error: invalid operand for instruction
				// CHECK-ERROR: vudot.u8 q0, q1, d16[0]
				// CHECK-ERROR: ^
	// CHECK-ERROR: error: invalid operand for instruction			// CHECK-ERROR: error: invalid operand for instruction
				// CHECK-ERROR: vsdot.s8 q0, q1, d16[0]
				// CHECK-ERROR: ^