This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU][MC][GFX10][WS32] Corrected decoding of dst operand for v_cmp_*_sdwa opcodes
ClosedPublic

Authored by dp on Oct 2 2019, 11:05 AM.

Download Raw Diff

Details

Reviewers

rampitec
arsenm

Commits

rG434d59250e38: [AMDGPU][MC][GFX10][WS32] Corrected decoding of dst operand for v_cmp_*_sdwa…
rL373745: [AMDGPU][MC][GFX10][WS32] Corrected decoding of dst operand for v_cmp_*_sdwa…

Summary

v_cmp_*_sdwa instructions are decoded incorrectly in wavesize=32 mode: the size of dst is invalid. An example:

0xf9,0x04,0x1e,0x7d,0x01,0xfb,0x06,0x06

Should be decoded as follows:

v_cmp_class_f16_sdwa ttmp15, v1, v2 src0_sel:DWORD src1_sel:DWORD

Actual result:

v_cmp_class_f16_sdwa ttmp[14:15], v1, v2 src0_sel:DWORD src1_sel:DWORD

See bug 43484: https://bugs.llvm.org/show_bug.cgi?id=43484

Diff Detail

Event Timeline

dp created this revision.Oct 2 2019, 11:05 AM

Herald added subscribers: t-tye, tpr, dstuttard and 5 others. · View Herald TranscriptOct 2 2019, 11:06 AM

LGTM with a nit.

test/MC/Disassembler/AMDGPU/vcmp-gfx10.txt
7	Please add new line.

This revision is now accepted and ready to land.Oct 2 2019, 11:16 AM

Closed by commit rG434d59250e38: [AMDGPU][MC][GFX10][WS32] Corrected decoding of dst operand for v_cmp_*_sdwa… (authored by dp). · Explain WhyOct 4 2019, 6:06 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptOct 4 2019, 6:06 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

Revision Contents

Path

Size

lib/

Target/

AMDGPU/

Disassembler/

AMDGPUDisassembler.cpp

3 lines

test/

MC/

Disassembler/

AMDGPU/

vcmp-gfx10.txt

6 lines

Diff 222874

lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp

Show First 20 Lines • Show All 1,166 Lines • ▼ Show 20 Lines	MCOperand AMDGPUDisassembler::decodeSDWAVopcDst(unsigned Val) const {

bool IsWave64 = STI.getFeatureBits()[AMDGPU::FeatureWavefrontSize64];		bool IsWave64 = STI.getFeatureBits()[AMDGPU::FeatureWavefrontSize64];

if (Val & SDWA9EncValues::VOPC_DST_VCC_MASK) {		if (Val & SDWA9EncValues::VOPC_DST_VCC_MASK) {
Val &= SDWA9EncValues::VOPC_DST_SGPR_MASK;		Val &= SDWA9EncValues::VOPC_DST_SGPR_MASK;

int TTmpIdx = getTTmpIdx(Val);		int TTmpIdx = getTTmpIdx(Val);
if (TTmpIdx >= 0) {		if (TTmpIdx >= 0) {
return createSRegOperand(getTtmpClassId(OPW64), TTmpIdx);		auto TTmpClsId = getTtmpClassId(IsWave64 ? OPW64 : OPW32);
		return createSRegOperand(TTmpClsId, TTmpIdx);
} else if (Val > SGPR_MAX) {		} else if (Val > SGPR_MAX) {
return IsWave64 ? decodeSpecialReg64(Val)		return IsWave64 ? decodeSpecialReg64(Val)
: decodeSpecialReg32(Val);		: decodeSpecialReg32(Val);
} else {		} else {
return createSRegOperand(getSgprClassId(IsWave64 ? OPW64 : OPW32), Val);		return createSRegOperand(getSgprClassId(IsWave64 ? OPW64 : OPW32), Val);
}		}
} else {		} else {
return createRegOperand(IsWave64 ? AMDGPU::VCC : AMDGPU::VCC_LO);		return createRegOperand(IsWave64 ? AMDGPU::VCC : AMDGPU::VCC_LO);
▲ Show 20 Lines • Show All 85 Lines • Show Last 20 Lines

test/MC/Disassembler/AMDGPU/vcmp-gfx10.txt

This file was added.

				# RUN: llvm-mc -arch=amdgcn -mcpu=gfx1010 -mattr=+wavefrontsize32,-wavefrontsize64 -disassemble -show-encoding < %s \| FileCheck -check-prefixes=GFX10,W32 %s
				# RUN: llvm-mc -arch=amdgcn -mcpu=gfx1010 -mattr=-wavefrontsize32,+wavefrontsize64 -disassemble -show-encoding < %s \| FileCheck -check-prefixes=GFX10,W64 %s

				# W32: v_cmp_class_f16_sdwa ttmp14, v1, v2 src0_sel:DWORD src1_sel:DWORD ; encoding: [0xf9,0x04,0x1e,0x7d,0x01,0xfa,0x06,0x06]
				# W64: v_cmp_class_f16_sdwa ttmp[14:15], v1, v2 src0_sel:DWORD src1_sel:DWORD ; encoding: [0xf9,0x04,0x1e,0x7d,0x01,0xfa,0x06,0x06]
				0xf9,0x04,0x1e,0x7d,0x01,0xfa,0x06,0x06
				No newline at end of file
				rampitecUnsubmitted Not Done Reply Inline Actions Please add new line. rampitec: Please add new line.