This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Assembler: better support for immediate literals in assembler.
ClosedPublic

Authored by SamWot on Jul 28 2016, 9:15 AM.

Download Raw Diff

Details

Reviewers

artem.tamazov
• tstellarAMD
vpykhtin

Group Reviewers

Restricted Project

Commits

rG1eeb11bfd40e: AMDGPU] Assembler: better support for immediate literals in assembler.
rL281050: AMDGPU] Assembler: better support for immediate literals in assembler.

Summary

Prevously assembler parsed all literals as either 32-bit integers or 32-bit floating-point values. Because of this we couldn't support f64 literals.
E.g. in instruction "v_fract_f64 v[0:1], 0.5", literal 0.5 was encoded as 32-bit literal 0x3f000000, which is incorrect and will be interpreted as 3.0517578125E-5 instead of 0.5. Correct encoding is inline constant 240 (optimal) or 32-bit literal 0x3FE00000 at least.

With this change the way immediate literals are parsed is changed. All literals are always parsed as 64-bit values either integer or floating-point. Then we convert parsed literals to correct form based on information about type of operand parsed (was literal floating or binary) and type of expected instruction operands (is this f32/64 or b32/64 instruction).
Here are rules how we convert literals:

We parsed fp literal:
- Instruction expects 64-bit operand:
  - If parsed literal is inlinable (e.g. v_fract_f64_e32 v[0:1], 0.5)
    - then we do nothing this literal
  - Else if literal is not-inlinable but instruction requires to inline it (e.g. this is e64 encoding, v_fract_f64_e64 v[0:1], 1.5)
    - report error
  - Else literal is not-inlinable but we can encode it as additional 32-bit literal constant
    - If instruction expect fp operand type (f64)
      - Check if low 32 bits of literal are zeroes (e.g. v_fract_f64 v[0:1], 1.5)
        
        If so then do nothing
      - Else (e.g. v_fract_f64 v[0:1], 3.1415)
        
        report warning that low 32 bits will be set to zeroes and precision will be lost
        
        set low 32 bits of literal to zeroes
    - Instruction expects integer operand type (e.g. s_mov_b64_e32 s[0:1], 1.5)
      - report error as it is unclear how to encode this literal
- Instruction expects 32-bit operand:
  - Convert parsed 64 bit fp literal to 32 bit fp. Allow lose of precision but not overflow or underflow
  - Is this literal inlinable and are we required to inline literal (e.g. v_trunc_f32_e64 v0, 0.5)
    - do nothing
    - Else report error
  - Do nothing. We can encode any other 32-bit fp literal (e.g. v_trunc_f32 v0, 10000000.0)
Parsed binary literal:
- Is this literal inlinable (e.g. v_trunc_f32_e32 v0, 35)
  - do nothing
- Else, are we required to inline this literal (e.g. v_trunc_f32_e64 v0, 35)
  - report error
- Else, literal is not-inlinable and we are not required to inline it
  - Are high 32 bit of literal zeroes or same as sign bit (32 bit)
    - do nothing (e.g. v_trunc_f32 v0, 0xdeadbeef)
  - Else
    - report error (e.g. v_trunc_f32 v0, 0x123456789abcdef0)

For this change it is required that we know operand types of instruction (are they f32/64 or b32/64). I added several new register operands (they extend previous register operands) and set operand types to corresponding types:
'''
enum OperandType {

OPERAND_REG_IMM32_INT,
OPERAND_REG_IMM32_FP,
OPERAND_REG_INLINE_C_INT,
OPERAND_REG_INLINE_C_FP,

}
'''

This is not working yet:

Several tests are failing
Problems with predicate methods for inline immediates
LLVM generated assembler parts try to select e64 encoding before e32.

More changes are required for several AsmOperands.

Diff Detail

Repository: rL LLVM

Event Timeline

SamWot updated this revision to Diff 65946.Jul 28 2016, 9:15 AM

SamWot retitled this revision from to [AMDGPU] Assembler: better support for immediate literals in assembler..

SamWot updated this object.

SamWot added a reviewer: vpykhtin.

SamWot added a subscriber: artem.tamazov.

Herald added a reviewer: • tstellarAMD. · View Herald TranscriptJul 28 2016, 9:15 AM

Herald added subscribers: kzhuravl, arsenm. · View Herald Transcript

Thanks for detailed explanation!

We parsed fp literal:

Instruction expects 64-bit operand:
  ...
  Else literal is not-inlinable but we can encode it as additional 32-bit literal constant
    If instruction expect fp operand type (f64)
      ...
      Else (e.g. v_fract_f64 v[0:1], 3.1415)
        report warning that low 32 bits will be set to zeroes and precision will be lost
        set low 32 bits of literal to zeroes

The last two sentences are questionable. I would prefer an error here. I believe we have to try to be exact wherever possible; do you agree?

We parsed fp literal:

Instruction expects 64-bit operand:
    ...
    Else literal is not-inlinable but we can encode it as additional 32-bit literal constant
        ...
        Instruction expects integer operand type (e.g. s_mov_b64_e32 s[0:1], 1.5)
            report error as it is unclear how to encode this literal

Actually it is clear provided that we know the signedness of an operand. 32-bit literals are zero-extended to 64 bits if operand type is B64/U64, or sign-extended for I64. So:

If operand type is I64
- If 33 upper bits are all ones or all zeroes, then can be encoded
- Else error
Else (we have B/U64)
- If 32 upper bits are all zeroes, then can be encoded
- Else error

We parsed fp literal:

...
Instruction expects 32-bit operand:
    Convert parsed 64 bit fp literal to 32 bit fp. Allow lose of precision but not overflow or underflow

Good place for the "loss of precision" warning here.

Parsed binary literal:
....

I am guessing that the whole "binary literal" branch needs to be reconsidered... More details later.

For this change it is required that we know operand types of instruction (are they f32/64 or b32/64)...

Hmm, I believe we need to know also if non-FP instruction operand is signed (I32/64), unsigned (U32/64) or "untyped bits" (B32/64).

Let's proceed with actual code review after we agree on basic algorithm.

Does using a 32-bit literal for a 64-bit operand actually work? I vaguely remember asking about this when working on operand folding and it wasn't clear if that really worked

In D22922#515738, @arsenm wrote:

Does using a 32-bit literal for a 64-bit operand actually work? I vaguely remember asking about this when working on operand folding and it wasn't clear if that really worked

Yes, it works. _F64 operands are zero-padded (low 32 bits set to zero, high 32 bits taken from instruction stream), _B/U64 - zero-extended, _I64 - sign-extended.

Changed way how it all works. Moved all checks to predicate methods and all mutators to render methods.

There are 5 failing tests for now. With this changes for several instruction (integer VOP instructions) assembler tries to generate 64-bit encoding in first place instead of 32-bit encoding where it is possible. This is correct behaviour (instructions are identical) but unwantable: we want assembler to generate 32-bit encoding where possible. This is caused by order in which instructions are sorted in MatchTable inside LLVM-generated assembler. We can't control how instructions are sorted and we shouldn't rely on this. It was just luck that previously they were sorted as we wanted and 32-bit instruction were generated in first place.
We should explicitly convert from 64-bit encoding to 32-bit encoding where possible. This might be done in converter method for VOP3 instructions. I will try to implement this.

In D22922#526273, @SamWot wrote:

...There are 5 failing tests for now... We should explicitly convert from 64-bit encoding to 32-bit encoding where possible.
This might be done in converter method for VOP3 instructions. I will try to implement this.

Yes, when encoding width is not specified explicitly (none of _e32 or _e64 suffix), assembler must use the narrowest width possible. Let's make this feature explicit: I recommend creating separate tests for it and fixing the 5 failing tests (by adding _e32 suffix). This way, this change would not lead to regressions. The new tests will fail until new VOP3 converter is implemented.

Fixed error with KIMM operand.
Moved all failing tests cases to new file vop3-conversion.s and marked them as XFAIL. I will fix them later.

Herald added a subscriber: wdng. · View Herald TranscriptAug 30 2016, 8:36 AM

Rebased on top of latest SOP changes.

SamWot added a reviewer: Restricted Project.Aug 31 2016, 3:56 AM

What about f16 inline immediates? I've been wondering how to handle those since currently the size of the register class is used in various places, but for f16 it will still be the same VReg_32

In D22922#515358, @artem.tamazov wrote:

Actually it is clear provided that we know the signedness of an operand. 32-bit literals are zero-extended to 64 bits if operand type is B64/U64, or sign-extended for I64. So...

In D22922#515361, @artem.tamazov wrote:

I am guessing that the whole "binary literal" branch needs to be reconsidered...

In D22922#515363, @artem.tamazov wrote:

...we need to know also if non-FP instruction operand is signed (I32/64), unsigned (U32/64) or "untyped bits" (B32/64).

Finding out signness of operands seems to be complicated task. Let's address it later, by dedicated patch.

In D22922#530587, @arsenm wrote:

What about f16 inline immediates?...

I recommend sorting out f16 immediates separately.

This revision is now accepted and ready to land.Sep 1 2016, 4:28 AM

In D22922#530587, @arsenm wrote:

What about f16 inline immediates? I've been wondering how to handle those since currently the size of the register class is used in various places, but for f16 it will still be the same VReg_32

In this change f16 are processed similarly as f32. It is not correct but implementing full support for f16 would involve massive changes. We decided that it is not necessary for now but we plan to do this later.
As you said main problem here is that f16 uses 32-bit registers so we can't use register size to understad type of operand.

ping

LGTM.

lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
692 ↗	(On Diff #69824)	spare part?

Removed XFAIL from conversion tests

Herald added a subscriber: nhaehnle. · View Herald TranscriptSep 9 2016, 7:45 AM

Closed by commit rL281050: AMDGPU] Assembler: better support for immediate literals in assembler. (authored by skolton). · Explain WhySep 9 2016, 7:52 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Target/

AMDGPU/

AMDGPUInstructions.td

7 lines

AsmParser/

AMDGPUAsmParser.cpp

507 lines

InstPrinter/

AMDGPUInstPrinter.cpp

2 lines

MCTargetDesc/

45 lines

19 lines

10 lines

255 lines

14 lines

6 lines

73 lines

30 lines

Utils/

AMDGPUBaseInfo.h

19 lines

AMDGPUBaseInfo.cpp

70 lines

VIInstructions.td

2 lines

test/

MC/

AMDGPU/

literals.s

477 lines

reg-syntax-extra.s

2 lines

regression/

11 lines

4 lines

11 lines

2 lines

136 lines

170 lines

411 lines

Diff 70837

llvm/trunk/lib/Target/AMDGPU/AMDGPUInstructions.td

	Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines

	def FP32Denormals : Predicate<"Subtarget.hasFP32Denormals()">;			def FP32Denormals : Predicate<"Subtarget.hasFP32Denormals()">;
	def FP64Denormals : Predicate<"Subtarget.hasFP64Denormals()">;			def FP64Denormals : Predicate<"Subtarget.hasFP64Denormals()">;
	def UnsafeFPMath : Predicate<"TM.Options.UnsafeFPMath">;			def UnsafeFPMath : Predicate<"TM.Options.UnsafeFPMath">;

	def InstFlag : OperandWithDefaultOps <i32, (ops (i32 0))>;			def InstFlag : OperandWithDefaultOps <i32, (ops (i32 0))>;
	def ADDRIndirect : ComplexPattern<iPTR, 2, "SelectADDRIndirect", [], []>;			def ADDRIndirect : ComplexPattern<iPTR, 2, "SelectADDRIndirect", [], []>;

	// 32-bit VALU immediate operand that uses the constant bus.
	def u32kimm : Operand<i32> {
	let OperandNamespace = "AMDGPU";
	let OperandType = "OPERAND_KIMM32";
	let PrintMethod = "printU32ImmOperand";
	}

	let OperandType = "OPERAND_IMMEDIATE" in {			let OperandType = "OPERAND_IMMEDIATE" in {

	def u32imm : Operand<i32> {			def u32imm : Operand<i32> {
	let PrintMethod = "printU32ImmOperand";			let PrintMethod = "printU32ImmOperand";
	}			}

	def u16imm : Operand<i16> {			def u16imm : Operand<i16> {
	let PrintMethod = "printU16ImmOperand";			let PrintMethod = "printU16ImmOperand";
	▲ Show 20 Lines • Show All 590 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp

Show All 14 Lines
#include "Utils/AMDKernelCodeTUtils.h"		#include "Utils/AMDKernelCodeTUtils.h"
#include "Utils/AMDGPUAsmUtils.h"		#include "Utils/AMDGPUAsmUtils.h"
#include "llvm/ADT/APFloat.h"		#include "llvm/ADT/APFloat.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallBitVector.h"		#include "llvm/ADT/SmallBitVector.h"
#include "llvm/ADT/SmallString.h"		#include "llvm/ADT/SmallString.h"
#include "llvm/ADT/StringSwitch.h"		#include "llvm/ADT/StringSwitch.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
		#include "llvm/CodeGen/MachineValueType.h"
#include "llvm/MC/MCContext.h"		#include "llvm/MC/MCContext.h"
#include "llvm/MC/MCExpr.h"		#include "llvm/MC/MCExpr.h"
#include "llvm/MC/MCInst.h"		#include "llvm/MC/MCInst.h"
#include "llvm/MC/MCInstrInfo.h"		#include "llvm/MC/MCInstrInfo.h"
#include "llvm/MC/MCParser/MCAsmLexer.h"		#include "llvm/MC/MCParser/MCAsmLexer.h"
#include "llvm/MC/MCParser/MCAsmParser.h"		#include "llvm/MC/MCParser/MCAsmParser.h"
#include "llvm/MC/MCParser/MCParsedAsmOperand.h"		#include "llvm/MC/MCParser/MCParsedAsmOperand.h"
#include "llvm/MC/MCParser/MCTargetAsmParser.h"		#include "llvm/MC/MCParser/MCTargetAsmParser.h"
#include "llvm/MC/MCRegisterInfo.h"		#include "llvm/MC/MCRegisterInfo.h"
#include "llvm/MC/MCStreamer.h"		#include "llvm/MC/MCStreamer.h"
#include "llvm/MC/MCSubtargetInfo.h"		#include "llvm/MC/MCSubtargetInfo.h"
#include "llvm/MC/MCSymbolELF.h"		#include "llvm/MC/MCSymbolELF.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/ELF.h"		#include "llvm/Support/ELF.h"
#include "llvm/Support/SourceMgr.h"		#include "llvm/Support/SourceMgr.h"
#include "llvm/Support/TargetRegistry.h"		#include "llvm/Support/TargetRegistry.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"

using namespace llvm;		using namespace llvm;

namespace {		namespace {

		class AMDGPUAsmParser;
struct OptionalOperand;		struct OptionalOperand;

enum RegisterKind { IS_UNKNOWN, IS_VGPR, IS_SGPR, IS_TTMP, IS_SPECIAL };		enum RegisterKind { IS_UNKNOWN, IS_VGPR, IS_SGPR, IS_TTMP, IS_SPECIAL };

		//===----------------------------------------------------------------------===//
		// Operand
		//===----------------------------------------------------------------------===//

class AMDGPUOperand : public MCParsedAsmOperand {		class AMDGPUOperand : public MCParsedAsmOperand {
enum KindTy {		enum KindTy {
Token,		Token,
Immediate,		Immediate,
Register,		Register,
Expression		Expression
} Kind;		} Kind;

SMLoc StartLoc, EndLoc;		SMLoc StartLoc, EndLoc;
		const AMDGPUAsmParser *AsmParser;

public:		public:
AMDGPUOperand(enum KindTy K) : MCParsedAsmOperand(), Kind(K) {}		AMDGPUOperand(enum KindTy Kind_, const AMDGPUAsmParser *AsmParser_)
		: MCParsedAsmOperand(), Kind(Kind_), AsmParser(AsmParser_) {}

typedef std::unique_ptr<AMDGPUOperand> Ptr;		typedef std::unique_ptr<AMDGPUOperand> Ptr;

struct Modifiers {		struct Modifiers {
bool Abs;		bool Abs;
bool Neg;		bool Neg;
bool Sext;		bool Sext;

▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	public:
struct ImmOp {		struct ImmOp {
int64_t Val;		int64_t Val;
ImmTy Type;		ImmTy Type;
bool IsFPImm;		bool IsFPImm;
Modifiers Mods;		Modifiers Mods;
};		};

struct RegOp {		struct RegOp {
const MCRegisterInfo *TRI;
const MCSubtargetInfo *STI;
unsigned RegNo;		unsigned RegNo;
bool IsForcedVOP3;		bool IsForcedVOP3;
Modifiers Mods;		Modifiers Mods;
};		};

union {		union {
TokOp Tok;		TokOp Tok;
ImmOp Imm;		ImmOp Imm;
Show All 14 Lines	bool isToken() const override {
// interpret is a token, then we treat the symbol name as the token.		// interpret is a token, then we treat the symbol name as the token.
return isa<MCSymbolRefExpr>(Expr);		return isa<MCSymbolRefExpr>(Expr);
}		}

bool isImm() const override {		bool isImm() const override {
return Kind == Immediate;		return Kind == Immediate;
}		}

bool isInlinableImm() const {		bool isInlinableImm(MVT type) const;
if (!isImmTy(ImmTyNone)) {		bool isLiteralImm(MVT type) const;
// Only plain immediates are inlinable (e.g. "clamp" attribute is not)
return false;
}
// TODO: We should avoid using host float here. It would be better to
// check the float bit values which is what a few other places do.
// We've had bot failures before due to weird NaN support on mips hosts.
const float F = BitsToFloat(Imm.Val);
// TODO: Add 1/(2*pi) for VI
return (Imm.Val <= 64 && Imm.Val >= -16) \|\|
(F == 0.0 \|\| F == 0.5 \|\| F == -0.5 \|\| F == 1.0 \|\| F == -1.0 \|\|
F == 2.0 \|\| F == -2.0 \|\| F == 4.0 \|\| F == -4.0);
}

bool isRegKind() const {		bool isRegKind() const {
return Kind == Register;		return Kind == Register;
}		}

bool isReg() const override {		bool isReg() const override {
return isRegKind() && !Reg.Mods.hasModifiers();		return isRegKind() && !Reg.Mods.hasModifiers();
}		}

bool isRegOrImmWithInputMods() const {		bool isRegOrImmWithInputMods(MVT type) const {
return isRegKind() \|\| isInlinableImm();		return isRegKind() \|\| isInlinableImm(type);
		}

		bool isRegOrImmWithInt32InputMods() const {
		return isRegOrImmWithInputMods(MVT::i32);
		}

		bool isRegOrImmWithInt64InputMods() const {
		return isRegOrImmWithInputMods(MVT::i64);
		}

		bool isRegOrImmWithFP32InputMods() const {
		return isRegOrImmWithInputMods(MVT::f32);
		}

		bool isRegOrImmWithFP64InputMods() const {
		return isRegOrImmWithInputMods(MVT::f64);
}		}

bool isImmTy(ImmTy ImmT) const {		bool isImmTy(ImmTy ImmT) const {
return isImm() && Imm.Type == ImmT;		return isImm() && Imm.Type == ImmT;
}		}

bool isImmModifier() const {		bool isImmModifier() const {
return isImm() && Imm.Type != ImmTyNone;		return isImm() && Imm.Type != ImmTyNone;
Show All 27 Lines	public:
bool isMod() const {		bool isMod() const {
return isClampSI() \|\| isOModSI();		return isClampSI() \|\| isOModSI();
}		}

bool isRegOrImm() const {		bool isRegOrImm() const {
return isReg() \|\| isImm();		return isReg() \|\| isImm();
}		}

bool isRegClass(unsigned RCID) const {		bool isRegClass(unsigned RCID) const;
return isReg() && Reg.TRI->getRegClass(RCID).contains(getReg());
		bool isSCSrcB32() const {
		return isRegClass(AMDGPU::SReg_32RegClassID) \|\| isInlinableImm(MVT::i32);
}		}

bool isSCSrc32() const {		bool isSCSrcB64() const {
return isInlinableImm() \|\| isRegClass(AMDGPU::SReg_32RegClassID);		return isRegClass(AMDGPU::SReg_64RegClassID) \|\| isInlinableImm(MVT::i64);
}		}

bool isSCSrc64() const {		bool isSCSrcF32() const {
return isInlinableImm() \|\| isRegClass(AMDGPU::SReg_64RegClassID);		return isRegClass(AMDGPU::SReg_32RegClassID) \|\| isInlinableImm(MVT::f32);
}		}

bool isSSrc32() const {		bool isSCSrcF64() const {
return isImm() \|\| isSCSrc32() \|\| isExpr();		return isRegClass(AMDGPU::SReg_64RegClassID) \|\| isInlinableImm(MVT::f64);
}		}

bool isSSrc64() const {		bool isSSrcB32() const {
		return isSCSrcB32() \|\| isLiteralImm(MVT::i32) \|\| isExpr();
		}

		bool isSSrcB64() const {
// TODO: Find out how SALU supports extension of 32-bit literals to 64 bits.		// TODO: Find out how SALU supports extension of 32-bit literals to 64 bits.
// See isVSrc64().		// See isVSrc64().
return isImm() \|\| isSCSrc64();		return isSCSrcB64() \|\| isLiteralImm(MVT::i64);
		}

		bool isSSrcF32() const {
		return isSCSrcB32() \|\| isLiteralImm(MVT::f32) \|\| isExpr();
		}

		bool isSSrcF64() const {
		return isSCSrcB64() \|\| isLiteralImm(MVT::f64);
		}

		bool isVCSrcB32() const {
		return isRegClass(AMDGPU::VS_32RegClassID) \|\| isInlinableImm(MVT::i32);
		}

		bool isVCSrcB64() const {
		return isRegClass(AMDGPU::VS_64RegClassID) \|\| isInlinableImm(MVT::i64);
		}

		bool isVCSrcF32() const {
		return isRegClass(AMDGPU::VS_32RegClassID) \|\| isInlinableImm(MVT::f32);
		}

		bool isVCSrcF64() const {
		return isRegClass(AMDGPU::VS_64RegClassID) \|\| isInlinableImm(MVT::f64);
		}

		bool isVSrcB32() const {
		return isVCSrcF32() \|\| isLiteralImm(MVT::i32);
}		}

bool isVCSrc32() const {		bool isVSrcB64() const {
return isInlinableImm() \|\| isRegClass(AMDGPU::VS_32RegClassID);		return isVCSrcF64() \|\| isLiteralImm(MVT::i64);
}		}

bool isVCSrc64() const {		bool isVSrcF32() const {
return isInlinableImm() \|\| isRegClass(AMDGPU::VS_64RegClassID);		return isVCSrcF32() \|\| isLiteralImm(MVT::f32);
}		}

bool isVSrc32() const {		bool isVSrcF64() const {
return isImm() \|\| isVCSrc32();		return isVCSrcF64() \|\| isLiteralImm(MVT::f64);
}		}

bool isVSrc64() const {		bool isKImmFP32() const {
// TODO: Check if the 64-bit value (coming from assembly source) can be		return isLiteralImm(MVT::f32);
// narrowed to 32 bits (in the instruction stream). That require knowledge
// of instruction type (unsigned/signed, floating or "untyped"/B64),
// see [AMD GCN3 ISA 6.3.1].
// TODO: How 64-bit values are formed from 32-bit literals in _B64 insns?
return isImm() \|\| isVCSrc64();
}		}

bool isMem() const override {		bool isMem() const override {
return false;		return false;
}		}

bool isExpr() const {		bool isExpr() const {
return Kind == Expression;		return Kind == Expression;
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	public:
bool hasFPModifiers() const {		bool hasFPModifiers() const {
return getModifiers().hasFPModifiers();		return getModifiers().hasFPModifiers();
}		}

bool hasIntModifiers() const {		bool hasIntModifiers() const {
return getModifiers().hasIntModifiers();		return getModifiers().hasIntModifiers();
}		}

void addImmOperands(MCInst &Inst, unsigned N, bool ApplyModifiers = true) const {		void addImmOperands(MCInst &Inst, unsigned N, bool ApplyModifiers = true) const;
if (isImmTy(ImmTyNone) && ApplyModifiers && Imm.Mods.hasFPModifiers()) {
// Apply modifiers to immediate value
int64_t Val = Imm.Val;
bool Negate = Imm.Mods.Neg; // Only negate can get here
if (Imm.IsFPImm) {
APFloat F(BitsToFloat(Val));
if (Negate) {
F.changeSign();
}
Val = F.bitcastToAPInt().getZExtValue();
} else {
Val = Negate ? -Val : Val;
}
Inst.addOperand(MCOperand::createImm(Val));
} else {
Inst.addOperand(MCOperand::createImm(getImm()));
}
}

void addRegOperands(MCInst &Inst, unsigned N) const {		void addLiteralImmOperand(MCInst &Inst, int64_t Val) const;
Inst.addOperand(MCOperand::createReg(AMDGPU::getMCReg(getReg(), *Reg.STI)));
}		void addKImmFP32Operands(MCInst &Inst, unsigned N) const;

		void addRegOperands(MCInst &Inst, unsigned N) const;

void addRegOrImmOperands(MCInst &Inst, unsigned N) const {		void addRegOrImmOperands(MCInst &Inst, unsigned N) const {
if (isRegKind())		if (isRegKind())
addRegOperands(Inst, N);		addRegOperands(Inst, N);
else if (isExpr())		else if (isExpr())
Inst.addOperand(MCOperand::createExpr(Expr));		Inst.addOperand(MCOperand::createExpr(Expr));
else		else
addImmOperands(Inst, N);		addImmOperands(Inst, N);
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	case Token:
OS << '\'' << getToken() << '\'';		OS << '\'' << getToken() << '\'';
break;		break;
case Expression:		case Expression:
OS << "<expr " << *Expr << '>';		OS << "<expr " << *Expr << '>';
break;		break;
}		}
}		}

static AMDGPUOperand::Ptr CreateImm(int64_t Val, SMLoc Loc,		static AMDGPUOperand::Ptr CreateImm(const AMDGPUAsmParser *AsmParser,
		int64_t Val, SMLoc Loc,
enum ImmTy Type = ImmTyNone,		enum ImmTy Type = ImmTyNone,
bool IsFPImm = false) {		bool IsFPImm = false) {
auto Op = llvm::make_unique<AMDGPUOperand>(Immediate);		auto Op = llvm::make_unique<AMDGPUOperand>(Immediate, AsmParser);
Op->Imm.Val = Val;		Op->Imm.Val = Val;
Op->Imm.IsFPImm = IsFPImm;		Op->Imm.IsFPImm = IsFPImm;
Op->Imm.Type = Type;		Op->Imm.Type = Type;
Op->Imm.Mods = {false, false, false};		Op->Imm.Mods = {false, false, false};
Op->StartLoc = Loc;		Op->StartLoc = Loc;
Op->EndLoc = Loc;		Op->EndLoc = Loc;
return Op;		return Op;
}		}

static AMDGPUOperand::Ptr CreateToken(StringRef Str, SMLoc Loc,		static AMDGPUOperand::Ptr CreateToken(const AMDGPUAsmParser *AsmParser,
		StringRef Str, SMLoc Loc,
bool HasExplicitEncodingSize = true) {		bool HasExplicitEncodingSize = true) {
auto Res = llvm::make_unique<AMDGPUOperand>(Token);		auto Res = llvm::make_unique<AMDGPUOperand>(Token, AsmParser);
Res->Tok.Data = Str.data();		Res->Tok.Data = Str.data();
Res->Tok.Length = Str.size();		Res->Tok.Length = Str.size();
Res->StartLoc = Loc;		Res->StartLoc = Loc;
Res->EndLoc = Loc;		Res->EndLoc = Loc;
return Res;		return Res;
}		}

static AMDGPUOperand::Ptr CreateReg(unsigned RegNo, SMLoc S,		static AMDGPUOperand::Ptr CreateReg(const AMDGPUAsmParser *AsmParser,
		unsigned RegNo, SMLoc S,
SMLoc E,		SMLoc E,
const MCRegisterInfo *TRI,
const MCSubtargetInfo *STI,
bool ForceVOP3) {		bool ForceVOP3) {
auto Op = llvm::make_unique<AMDGPUOperand>(Register);		auto Op = llvm::make_unique<AMDGPUOperand>(Register, AsmParser);
Op->Reg.RegNo = RegNo;		Op->Reg.RegNo = RegNo;
Op->Reg.TRI = TRI;
Op->Reg.STI = STI;
Op->Reg.Mods = {false, false, false};		Op->Reg.Mods = {false, false, false};
Op->Reg.IsForcedVOP3 = ForceVOP3;		Op->Reg.IsForcedVOP3 = ForceVOP3;
Op->StartLoc = S;		Op->StartLoc = S;
Op->EndLoc = E;		Op->EndLoc = E;
return Op;		return Op;
}		}

static AMDGPUOperand::Ptr CreateExpr(const class MCExpr *Expr, SMLoc S) {		static AMDGPUOperand::Ptr CreateExpr(const AMDGPUAsmParser *AsmParser,
auto Op = llvm::make_unique<AMDGPUOperand>(Expression);		const class MCExpr *Expr, SMLoc S) {
		auto Op = llvm::make_unique<AMDGPUOperand>(Expression, AsmParser);
Op->Expr = Expr;		Op->Expr = Expr;
Op->StartLoc = S;		Op->StartLoc = S;
Op->EndLoc = S;		Op->EndLoc = S;
return Op;		return Op;
}		}
};		};

raw_ostream &operator <<(raw_ostream &OS, AMDGPUOperand::Modifiers Mods) {		raw_ostream &operator <<(raw_ostream &OS, AMDGPUOperand::Modifiers Mods) {
OS << "abs:" << Mods.Abs << " neg: " << Mods.Neg << " sext:" << Mods.Sext;		OS << "abs:" << Mods.Abs << " neg: " << Mods.Neg << " sext:" << Mods.Sext;
return OS;		return OS;
}		}

		//===----------------------------------------------------------------------===//
		// AsmParser
		//===----------------------------------------------------------------------===//

class AMDGPUAsmParser : public MCTargetAsmParser {		class AMDGPUAsmParser : public MCTargetAsmParser {
const MCInstrInfo &MII;		const MCInstrInfo &MII;
MCAsmParser &Parser;		MCAsmParser &Parser;

unsigned ForcedEncodingSize;		unsigned ForcedEncodingSize;
bool ForcedDPP;		bool ForcedDPP;
bool ForcedSDWA;		bool ForcedSDWA;

bool isSI() const {
return AMDGPU::isSI(getSTI());
}

bool isCI() const {
return AMDGPU::isCI(getSTI());
}

bool isVI() const {
return AMDGPU::isVI(getSTI());
}

bool hasSGPR102_SGPR103() const {
return !isVI();
}

/// @name Auto-generated Match Functions		/// @name Auto-generated Match Functions
/// {		/// {

#define GET_ASSEMBLER_HEADER		#define GET_ASSEMBLER_HEADER
#include "AMDGPUGenAsmMatcher.inc"		#include "AMDGPUGenAsmMatcher.inc"

/// }		/// }

▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	setAvailableFeatures(ComputeAvailableFeatures(getSTI().getFeatureBits()));
Sym->setVariableValue(MCConstantExpr::create(Isa.Major, Ctx));		Sym->setVariableValue(MCConstantExpr::create(Isa.Major, Ctx));
Sym = Ctx.getOrCreateSymbol(Twine(".option.machine_version_minor"));		Sym = Ctx.getOrCreateSymbol(Twine(".option.machine_version_minor"));
Sym->setVariableValue(MCConstantExpr::create(Isa.Minor, Ctx));		Sym->setVariableValue(MCConstantExpr::create(Isa.Minor, Ctx));
Sym = Ctx.getOrCreateSymbol(Twine(".option.machine_version_stepping"));		Sym = Ctx.getOrCreateSymbol(Twine(".option.machine_version_stepping"));
Sym->setVariableValue(MCConstantExpr::create(Isa.Stepping, Ctx));		Sym->setVariableValue(MCConstantExpr::create(Isa.Stepping, Ctx));
}		}
}		}

		bool isSI() const {
		return AMDGPU::isSI(getSTI());
		}

		bool isCI() const {
		return AMDGPU::isCI(getSTI());
		}

		bool isVI() const {
		return AMDGPU::isVI(getSTI());
		}

		bool hasSGPR102_SGPR103() const {
		return !isVI();
		}

AMDGPUTargetStreamer &getTargetStreamer() {		AMDGPUTargetStreamer &getTargetStreamer() {
MCTargetStreamer &TS = *getParser().getStreamer().getTargetStreamer();		MCTargetStreamer &TS = *getParser().getStreamer().getTargetStreamer();
return static_cast<AMDGPUTargetStreamer &>(TS);		return static_cast<AMDGPUTargetStreamer &>(TS);
}		}

		const MCRegisterInfo *getMRI() const {
		// We need this const_cast because for some reason getContext() is not const
		// in MCAsmParser.
		return const_cast<AMDGPUAsmParser*>(this)->getContext().getRegisterInfo();
		}

		const MCInstrInfo *getMII() const {
		return &MII;
		}

void setForcedEncodingSize(unsigned Size) { ForcedEncodingSize = Size; }		void setForcedEncodingSize(unsigned Size) { ForcedEncodingSize = Size; }
void setForcedDPP(bool ForceDPP_) { ForcedDPP = ForceDPP_; }		void setForcedDPP(bool ForceDPP_) { ForcedDPP = ForceDPP_; }
void setForcedSDWA(bool ForceSDWA_) { ForcedSDWA = ForceSDWA_; }		void setForcedSDWA(bool ForceSDWA_) { ForcedSDWA = ForceSDWA_; }

unsigned getForcedEncodingSize() const { return ForcedEncodingSize; }		unsigned getForcedEncodingSize() const { return ForcedEncodingSize; }
bool isForcedVOP3() const { return ForcedEncodingSize == 64; }		bool isForcedVOP3() const { return ForcedEncodingSize == 64; }
bool isForcedDPP() const { return ForcedDPP; }		bool isForcedDPP() const { return ForcedDPP; }
bool isForcedSDWA() const { return ForcedSDWA; }		bool isForcedSDWA() const { return ForcedSDWA; }

std::unique_ptr<AMDGPUOperand> parseRegister();		std::unique_ptr<AMDGPUOperand> parseRegister();
bool ParseRegister(unsigned &RegNo, SMLoc &StartLoc, SMLoc &EndLoc) override;		bool ParseRegister(unsigned &RegNo, SMLoc &StartLoc, SMLoc &EndLoc) override;
unsigned checkTargetMatchPredicate(MCInst &Inst) override;		unsigned checkTargetMatchPredicate(MCInst &Inst) override;
unsigned validateTargetOperandClass(MCParsedAsmOperand &Op,		unsigned validateTargetOperandClass(MCParsedAsmOperand &Op,
unsigned Kind) override;		unsigned Kind) override;
bool MatchAndEmitInstruction(SMLoc IDLoc, unsigned &Opcode,		bool MatchAndEmitInstruction(SMLoc IDLoc, unsigned &Opcode,
OperandVector &Operands, MCStreamer &Out,		OperandVector &Operands, MCStreamer &Out,
uint64_t &ErrorInfo,		uint64_t &ErrorInfo,
bool MatchingInlineAsm) override;		bool MatchingInlineAsm) override;
bool ParseDirective(AsmToken DirectiveID) override;		bool ParseDirective(AsmToken DirectiveID) override;
OperandMatchResultTy parseOperand(OperandVector &Operands, StringRef Mnemonic);		OperandMatchResultTy parseOperand(OperandVector &Operands, StringRef Mnemonic);
StringRef parseMnemonicSuffix(StringRef Name);		StringRef parseMnemonicSuffix(StringRef Name);
bool ParseInstruction(ParseInstructionInfo &Info, StringRef Name,		bool ParseInstruction(ParseInstructionInfo &Info, StringRef Name,
SMLoc NameLoc, OperandVector &Operands) override;		SMLoc NameLoc, OperandVector &Operands) override;
		//bool ProcessInstruction(MCInst &Inst);

OperandMatchResultTy parseIntWithPrefix(const char *Prefix, int64_t &Int);		OperandMatchResultTy parseIntWithPrefix(const char *Prefix, int64_t &Int);
OperandMatchResultTy parseIntWithPrefix(const char *Prefix,		OperandMatchResultTy parseIntWithPrefix(const char *Prefix,
OperandVector &Operands,		OperandVector &Operands,
enum AMDGPUOperand::ImmTy ImmTy = AMDGPUOperand::ImmTyNone,		enum AMDGPUOperand::ImmTy ImmTy = AMDGPUOperand::ImmTyNone,
bool (*ConvertResult)(int64_t&) = 0);		bool (*ConvertResult)(int64_t&) = 0);
OperandMatchResultTy parseNamedBit(const char *Name, OperandVector &Operands,		OperandMatchResultTy parseNamedBit(const char *Name, OperandVector &Operands,
enum AMDGPUOperand::ImmTy ImmTy = AMDGPUOperand::ImmTyNone);		enum AMDGPUOperand::ImmTy ImmTy = AMDGPUOperand::ImmTyNone);
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	struct OptionalOperand {
const char *Name;		const char *Name;
AMDGPUOperand::ImmTy Type;		AMDGPUOperand::ImmTy Type;
bool IsBit;		bool IsBit;
bool (*ConvertResult)(int64_t&);		bool (*ConvertResult)(int64_t&);
};		};

}		}

		//===----------------------------------------------------------------------===//
		// Operand
		//===----------------------------------------------------------------------===//

		bool AMDGPUOperand::isInlinableImm(MVT type) const {
		if (!isImmTy(ImmTyNone)) {
		// Only plain immediates are inlinable (e.g. "clamp" attribute is not)
		return false;
		}
		// TODO: We should avoid using host float here. It would be better to
		// check the float bit values which is what a few other places do.
		// We've had bot failures before due to weird NaN support on mips hosts.

		APInt Literal(64, Imm.Val);

		if (Imm.IsFPImm) { // We got fp literal token
		if (type == MVT::f64 \|\| type == MVT::i64) { // Expected 64-bit operand
		return AMDGPU::isInlinableLiteral64(Imm.Val, AsmParser->isVI());
		} else { // Expected 32-bit operand
		bool lost;
		APFloat FPLiteral(APFloat::IEEEdouble, Literal);
		// Convert literal to single precision
		APFloat::opStatus status = FPLiteral.convert(APFloat::IEEEsingle,
		APFloat::rmNearestTiesToEven,
		&lost);
		// We allow precision lost but not overflow or underflow
		if (status != APFloat::opOK &&
		lost &&
		((status & APFloat::opOverflow) != 0 \|\|
		(status & APFloat::opUnderflow) != 0)) {
		return false;
		}
		// Check if single precision literal is inlinable
		return AMDGPU::isInlinableLiteral32(
		static_cast<int32_t>(FPLiteral.bitcastToAPInt().getZExtValue()),
		AsmParser->isVI());
		}
		} else { // We got int literal token
		if (type == MVT::f64 \|\| type == MVT::i64) { // Expected 64-bit operand
		return AMDGPU::isInlinableLiteral64(Imm.Val, AsmParser->isVI());
		} else { // Expected 32-bit operand
		return AMDGPU::isInlinableLiteral32(
		static_cast<int32_t>(Literal.getLoBits(32).getZExtValue()),
		AsmParser->isVI());
		}
		}
		return false;
		}

		bool AMDGPUOperand::isLiteralImm(MVT type) const {
		// Check that this imediate can be added as literal
		if (!isImmTy(ImmTyNone)) {
		return false;
		}

		APInt Literal(64, Imm.Val);

		if (Imm.IsFPImm) { // We got fp literal token
		if (type == MVT::f64) { // Expected 64-bit fp operand
		// We would set low 64-bits of literal to zeroes but we accept this literals
		return true;
		} else if (type == MVT::i64) { // Expected 64-bit int operand
		// We don't allow fp literals in 64-bit integer instructions. It is
		// unclear how we should encode them.
		return false;
		} else { // Expected 32-bit operand
		bool lost;
		APFloat FPLiteral(APFloat::IEEEdouble, Literal);
		// Convert literal to single precision
		APFloat::opStatus status = FPLiteral.convert(APFloat::IEEEsingle,
		APFloat::rmNearestTiesToEven,
		&lost);
		// We allow precision lost but not overflow or underflow
		if (status != APFloat::opOK &&
		lost &&
		((status & APFloat::opOverflow) != 0 \|\|
		(status & APFloat::opUnderflow) != 0)) {
		return false;
		}
		return true;
		}
		} else { // We got int literal token
		APInt HiBits = Literal.getHiBits(32);
		if (HiBits == 0xffffffff &&
		(*Literal.getLoBits(32).getRawData() & 0x80000000) != 0) {
		// If high 32 bits aren't zeroes then they all should be ones and 32nd
		// bit should be set. So that this 64-bit literal is sign-extension of
		// 32-bit value.
		return true;
		} else if (HiBits == 0) {
		return true;
		}
		}
		return false;
		}

		bool AMDGPUOperand::isRegClass(unsigned RCID) const {
		return isReg() && AsmParser->getMRI()->getRegClass(RCID).contains(getReg());
		}

		void AMDGPUOperand::addImmOperands(MCInst &Inst, unsigned N, bool ApplyModifiers) const {
		int64_t Val = Imm.Val;
		if (isImmTy(ImmTyNone) && ApplyModifiers && Imm.Mods.hasFPModifiers() && Imm.Mods.Neg) {
		// Apply modifiers to immediate value. Only negate can get here
		if (Imm.IsFPImm) {
		APFloat F(BitsToDouble(Val));
		F.changeSign();
		Val = F.bitcastToAPInt().getZExtValue();
		} else {
		Val = -Val;
		}
		}

		if (AMDGPU::isSISrcOperand(AsmParser->getMII()->get(Inst.getOpcode()), Inst.getNumOperands())) {
		addLiteralImmOperand(Inst, Val);
		} else {
		Inst.addOperand(MCOperand::createImm(Val));
		}
		}

		void AMDGPUOperand::addLiteralImmOperand(MCInst &Inst, int64_t Val) const {
		const auto& InstDesc = AsmParser->getMII()->get(Inst.getOpcode());
		auto OpNum = Inst.getNumOperands();
		// Check that this operand accepts literals
		assert(AMDGPU::isSISrcOperand(InstDesc, OpNum));

		APInt Literal(64, Val);
		auto OpSize = AMDGPU::getRegOperandSize(AsmParser->getMRI(), InstDesc, OpNum); // expected operand size

		if (Imm.IsFPImm) { // We got fp literal token
		if (OpSize == 8) { // Expected 64-bit operand
		// Check if literal is inlinable
		if (AMDGPU::isInlinableLiteral64(Literal.getZExtValue(), AsmParser->isVI())) {
		Inst.addOperand(MCOperand::createImm(Literal.getZExtValue()));
		} else if (AMDGPU::isSISrcFPOperand(InstDesc, OpNum)) { // Expected 64-bit fp operand
		// For fp operands we check if low 32 bits are zeros
		if (Literal.getLoBits(32) != 0) {
		const_cast<AMDGPUAsmParser *>(AsmParser)->Warning(Inst.getLoc(),
		"Can't encode literal as exact 64-bit"
		" floating-point operand. Low 32-bits will be"
		" set to zero");
		}
		Inst.addOperand(MCOperand::createImm(Literal.lshr(32).getZExtValue()));
		} else {
		// We don't allow fp literals in 64-bit integer instructions. It is
		// unclear how we should encode them. This case should be checked earlier
		// in predicate methods (isLiteralImm())
		llvm_unreachable("fp literal in 64-bit integer instruction.");
		}
		} else { // Expected 32-bit operand
		bool lost;
		APFloat FPLiteral(APFloat::IEEEdouble, Literal);
		// Convert literal to single precision
		FPLiteral.convert(APFloat::IEEEsingle, APFloat::rmNearestTiesToEven, &lost);
		// We allow precision lost but not overflow or underflow. This should be
		// checked earlier in isLiteralImm()
		Inst.addOperand(MCOperand::createImm(FPLiteral.bitcastToAPInt().getZExtValue()));
		}
		} else { // We got int literal token
		if (OpSize == 8) { // Expected 64-bit operand
		auto LiteralVal = Literal.getZExtValue();
		if (AMDGPU::isInlinableLiteral64(LiteralVal, AsmParser->isVI())) {
		Inst.addOperand(MCOperand::createImm(LiteralVal));
		return;
		}
		} else { // Expected 32-bit operand
		auto LiteralVal = static_cast<int32_t>(Literal.getLoBits(32).getZExtValue());
		if (AMDGPU::isInlinableLiteral32(LiteralVal, AsmParser->isVI())) {
		Inst.addOperand(MCOperand::createImm(LiteralVal));
		return;
		}
		}
		Inst.addOperand(MCOperand::createImm(Literal.getLoBits(32).getZExtValue()));
		}
		}

		void AMDGPUOperand::addKImmFP32Operands(MCInst &Inst, unsigned N) const {
		APInt Literal(64, Imm.Val);
		if (Imm.IsFPImm) { // We got fp literal
		bool lost;
		APFloat FPLiteral(APFloat::IEEEdouble, Literal);
		FPLiteral.convert(APFloat::IEEEsingle, APFloat::rmNearestTiesToEven, &lost);
		Inst.addOperand(MCOperand::createImm(FPLiteral.bitcastToAPInt().getZExtValue()));
		} else { // We got int literal token
		Inst.addOperand(MCOperand::createImm(Literal.getLoBits(32).getZExtValue()));
		}
		}

		void AMDGPUOperand::addRegOperands(MCInst &Inst, unsigned N) const {
		Inst.addOperand(MCOperand::createReg(AMDGPU::getMCReg(getReg(), AsmParser->getSTI())));
		}

		//===----------------------------------------------------------------------===//
		// AsmParser
		//===----------------------------------------------------------------------===//

static int getRegClass(RegisterKind Is, unsigned RegWidth) {		static int getRegClass(RegisterKind Is, unsigned RegWidth) {
if (Is == IS_VGPR) {		if (Is == IS_VGPR) {
switch (RegWidth) {		switch (RegWidth) {
default: return -1;		default: return -1;
case 1: return AMDGPU::VGPR_32RegClassID;		case 1: return AMDGPU::VGPR_32RegClassID;
case 2: return AMDGPU::VReg_64RegClassID;		case 2: return AMDGPU::VReg_64RegClassID;
case 3: return AMDGPU::VReg_96RegClassID;		case 3: return AMDGPU::VReg_96RegClassID;
case 4: return AMDGPU::VReg_128RegClassID;		case 4: return AMDGPU::VReg_128RegClassID;
▲ Show 20 Lines • Show All 198 Lines • ▼ Show 20 Lines	if (!subtargetHasRegister(*TRI, Reg))
return false;		return false;
return true;		return true;
}		}

std::unique_ptr<AMDGPUOperand> AMDGPUAsmParser::parseRegister() {		std::unique_ptr<AMDGPUOperand> AMDGPUAsmParser::parseRegister() {
const auto &Tok = Parser.getTok();		const auto &Tok = Parser.getTok();
SMLoc StartLoc = Tok.getLoc();		SMLoc StartLoc = Tok.getLoc();
SMLoc EndLoc = Tok.getEndLoc();		SMLoc EndLoc = Tok.getEndLoc();
const MCRegisterInfo *TRI = getContext().getRegisterInfo();

RegisterKind RegKind;		RegisterKind RegKind;
unsigned Reg, RegNum, RegWidth;		unsigned Reg, RegNum, RegWidth;

if (!ParseAMDGPURegister(RegKind, Reg, RegNum, RegWidth)) {		if (!ParseAMDGPURegister(RegKind, Reg, RegNum, RegWidth)) {
return nullptr;		return nullptr;
}		}
return AMDGPUOperand::CreateReg(Reg, StartLoc, EndLoc,		return AMDGPUOperand::CreateReg(this, Reg, StartLoc, EndLoc, false);
TRI, &getSTI(), false);
}		}

AMDGPUAsmParser::OperandMatchResultTy		AMDGPUAsmParser::OperandMatchResultTy
AMDGPUAsmParser::parseImm(OperandVector &Operands) {		AMDGPUAsmParser::parseImm(OperandVector &Operands) {
		// TODO: add syntactic sugar for 1/(2*PI)
bool Minus = false;		bool Minus = false;
if (getLexer().getKind() == AsmToken::Minus) {		if (getLexer().getKind() == AsmToken::Minus) {
Minus = true;		Minus = true;
Parser.Lex();		Parser.Lex();
}		}

SMLoc S = Parser.getTok().getLoc();		SMLoc S = Parser.getTok().getLoc();
switch(getLexer().getKind()) {		switch(getLexer().getKind()) {
case AsmToken::Integer: {		case AsmToken::Integer: {
int64_t IntVal;		int64_t IntVal;
if (getParser().parseAbsoluteExpression(IntVal))		if (getParser().parseAbsoluteExpression(IntVal))
return MatchOperand_ParseFail;		return MatchOperand_ParseFail;
if (!isInt<32>(IntVal) && !isUInt<32>(IntVal)) {
Error(S, "invalid immediate: only 32-bit values are legal");
return MatchOperand_ParseFail;
}

if (Minus)		if (Minus)
IntVal *= -1;		IntVal *= -1;
Operands.push_back(AMDGPUOperand::CreateImm(IntVal, S));		Operands.push_back(AMDGPUOperand::CreateImm(this, IntVal, S));
return MatchOperand_Success;		return MatchOperand_Success;
}		}
case AsmToken::Real: {		case AsmToken::Real: {
// FIXME: We should emit an error if a double precisions floating-point
// value is used. I'm not sure the best way to detect this.
int64_t IntVal;		int64_t IntVal;
if (getParser().parseAbsoluteExpression(IntVal))		if (getParser().parseAbsoluteExpression(IntVal))
return MatchOperand_ParseFail;		return MatchOperand_ParseFail;

APFloat F((float)BitsToDouble(IntVal));		APFloat F(BitsToDouble(IntVal));
if (Minus)		if (Minus)
F.changeSign();		F.changeSign();
Operands.push_back(		Operands.push_back(
AMDGPUOperand::CreateImm(F.bitcastToAPInt().getZExtValue(), S,		AMDGPUOperand::CreateImm(this, F.bitcastToAPInt().getZExtValue(), S,
AMDGPUOperand::ImmTyNone, true));		AMDGPUOperand::ImmTyNone, true));
return MatchOperand_Success;		return MatchOperand_Success;
}		}
default:		default:
return Minus ? MatchOperand_ParseFail : MatchOperand_NoMatch;		return Minus ? MatchOperand_ParseFail : MatchOperand_NoMatch;
}		}
}		}

▲ Show 20 Lines • Show All 489 Lines • ▼ Show 20 Lines	if (getLexer().getKind() == AsmToken::Identifier) {
// It is a little difficult to distinguish between a symbol name, and		// It is a little difficult to distinguish between a symbol name, and
// an instruction flag like 'gds'. In order to do this, we parse		// an instruction flag like 'gds'. In order to do this, we parse
// all tokens as expressions and then treate the symbol name as the token		// all tokens as expressions and then treate the symbol name as the token
// string when we want to interpret the operand as a token.		// string when we want to interpret the operand as a token.
const auto &Tok = Parser.getTok();		const auto &Tok = Parser.getTok();
SMLoc S = Tok.getLoc();		SMLoc S = Tok.getLoc();
const MCExpr *Expr = nullptr;		const MCExpr *Expr = nullptr;
if (!Parser.parseExpression(Expr)) {		if (!Parser.parseExpression(Expr)) {
Operands.push_back(AMDGPUOperand::CreateExpr(Expr, S));		Operands.push_back(AMDGPUOperand::CreateExpr(this, Expr, S));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

Operands.push_back(AMDGPUOperand::CreateToken(Tok.getString(), Tok.getLoc()));		Operands.push_back(AMDGPUOperand::CreateToken(this, Tok.getString(), Tok.getLoc()));
Parser.Lex();		Parser.Lex();
return MatchOperand_Success;		return MatchOperand_Success;
}		}
return MatchOperand_NoMatch;		return MatchOperand_NoMatch;
}		}

StringRef AMDGPUAsmParser::parseMnemonicSuffix(StringRef Name) {		StringRef AMDGPUAsmParser::parseMnemonicSuffix(StringRef Name) {
// Clear any forced encodings from the previous instruction.		// Clear any forced encodings from the previous instruction.
Show All 17 Lines	StringRef AMDGPUAsmParser::parseMnemonicSuffix(StringRef Name) {
return Name;		return Name;
}		}

bool AMDGPUAsmParser::ParseInstruction(ParseInstructionInfo &Info,		bool AMDGPUAsmParser::ParseInstruction(ParseInstructionInfo &Info,
StringRef Name,		StringRef Name,
SMLoc NameLoc, OperandVector &Operands) {		SMLoc NameLoc, OperandVector &Operands) {
// Add the instruction mnemonic		// Add the instruction mnemonic
Name = parseMnemonicSuffix(Name);		Name = parseMnemonicSuffix(Name);
Operands.push_back(AMDGPUOperand::CreateToken(Name, NameLoc));		Operands.push_back(AMDGPUOperand::CreateToken(this, Name, NameLoc));

while (!getLexer().is(AsmToken::EndOfStatement)) {		while (!getLexer().is(AsmToken::EndOfStatement)) {
AMDGPUAsmParser::OperandMatchResultTy Res = parseOperand(Operands, Name);		AMDGPUAsmParser::OperandMatchResultTy Res = parseOperand(Operands, Name);

// Eat the comma or space if there is one.		// Eat the comma or space if there is one.
if (getLexer().is(AsmToken::Comma))		if (getLexer().is(AsmToken::Comma))
Parser.Lex();		Parser.Lex();

▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	AMDGPUAsmParser::parseIntWithPrefix(const char *Prefix, OperandVector &Operands,
AMDGPUAsmParser::OperandMatchResultTy Res = parseIntWithPrefix(Prefix, Value);		AMDGPUAsmParser::OperandMatchResultTy Res = parseIntWithPrefix(Prefix, Value);
if (Res != MatchOperand_Success)		if (Res != MatchOperand_Success)
return Res;		return Res;

if (ConvertResult && !ConvertResult(Value)) {		if (ConvertResult && !ConvertResult(Value)) {
return MatchOperand_ParseFail;		return MatchOperand_ParseFail;
}		}

Operands.push_back(AMDGPUOperand::CreateImm(Value, S, ImmTy));		Operands.push_back(AMDGPUOperand::CreateImm(this, Value, S, ImmTy));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

AMDGPUAsmParser::OperandMatchResultTy		AMDGPUAsmParser::OperandMatchResultTy
AMDGPUAsmParser::parseNamedBit(const char *Name, OperandVector &Operands,		AMDGPUAsmParser::parseNamedBit(const char *Name, OperandVector &Operands,
enum AMDGPUOperand::ImmTy ImmTy) {		enum AMDGPUOperand::ImmTy ImmTy) {
int64_t Bit = 0;		int64_t Bit = 0;
SMLoc S = Parser.getTok().getLoc();		SMLoc S = Parser.getTok().getLoc();
Show All 15 Lines	switch(getLexer().getKind()) {
}		}
break;		break;
}		}
default:		default:
return MatchOperand_NoMatch;		return MatchOperand_NoMatch;
}		}
}		}

Operands.push_back(AMDGPUOperand::CreateImm(Bit, S, ImmTy));		Operands.push_back(AMDGPUOperand::CreateImm(this, Bit, S, ImmTy));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

typedef std::map<enum AMDGPUOperand::ImmTy, unsigned> OptionalImmIndexMap;		typedef std::map<enum AMDGPUOperand::ImmTy, unsigned> OptionalImmIndexMap;

void addOptionalImmOperand(MCInst& Inst, const OperandVector& Operands,		void addOptionalImmOperand(MCInst& Inst, const OperandVector& Operands,
OptionalImmIndexMap& OptionalIdx,		OptionalImmIndexMap& OptionalIdx,
enum AMDGPUOperand::ImmTy ImmT, int64_t Default = 0) {		enum AMDGPUOperand::ImmTy ImmT, int64_t Default = 0) {
▲ Show 20 Lines • Show All 158 Lines • ▼ Show 20 Lines	switch(getLexer().getKind()) {

case AsmToken::Identifier:		case AsmToken::Identifier:
do {		do {
if (parseCnt(CntVal))		if (parseCnt(CntVal))
return MatchOperand_ParseFail;		return MatchOperand_ParseFail;
} while(getLexer().isNot(AsmToken::EndOfStatement));		} while(getLexer().isNot(AsmToken::EndOfStatement));
break;		break;
}		}
Operands.push_back(AMDGPUOperand::CreateImm(CntVal, S));		Operands.push_back(AMDGPUOperand::CreateImm(this, CntVal, S));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

bool AMDGPUAsmParser::parseHwregConstruct(OperandInfoTy &HwReg, int64_t &Offset, int64_t &Width) {		bool AMDGPUAsmParser::parseHwregConstruct(OperandInfoTy &HwReg, int64_t &Offset, int64_t &Width) {
using namespace llvm::AMDGPU::Hwreg;		using namespace llvm::AMDGPU::Hwreg;

if (Parser.getTok().getString() != "hwreg")		if (Parser.getTok().getString() != "hwreg")
return true;		return true;
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	case AsmToken::Identifier: {
if (Offset < 0 \|\| !isUInt<OFFSET_WIDTH_>(Offset))		if (Offset < 0 \|\| !isUInt<OFFSET_WIDTH_>(Offset))
Error(S, "invalid bit offset: only 5-bit values are legal");		Error(S, "invalid bit offset: only 5-bit values are legal");
if ((Width-1) < 0 \|\| !isUInt<WIDTH_M1_WIDTH_>(Width-1))		if ((Width-1) < 0 \|\| !isUInt<WIDTH_M1_WIDTH_>(Width-1))
Error(S, "invalid bitfield width: only values from 1 to 32 are legal");		Error(S, "invalid bitfield width: only values from 1 to 32 are legal");
Imm16Val = (HwReg.Id << ID_SHIFT_) \| (Offset << OFFSET_SHIFT_) \| ((Width-1) << WIDTH_M1_SHIFT_);		Imm16Val = (HwReg.Id << ID_SHIFT_) \| (Offset << OFFSET_SHIFT_) \| ((Width-1) << WIDTH_M1_SHIFT_);
}		}
break;		break;
}		}
Operands.push_back(AMDGPUOperand::CreateImm(Imm16Val, S, AMDGPUOperand::ImmTyHwreg));		Operands.push_back(AMDGPUOperand::CreateImm(this, Imm16Val, S, AMDGPUOperand::ImmTyHwreg));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

bool AMDGPUOperand::isSWaitCnt() const {		bool AMDGPUOperand::isSWaitCnt() const {
return isImm();		return isImm();
}		}

bool AMDGPUOperand::isHwreg() const {		bool AMDGPUOperand::isHwreg() const {
▲ Show 20 Lines • Show All 166 Lines • ▼ Show 20 Lines	case AsmToken::Identifier: {
break;		break;
}		}
Imm16Val \|= (StreamId << STREAM_ID_SHIFT_);		Imm16Val \|= (StreamId << STREAM_ID_SHIFT_);
}		}
} while (0);		} while (0);
}		}
break;		break;
}		}
Operands.push_back(AMDGPUOperand::CreateImm(Imm16Val, S, AMDGPUOperand::ImmTySendMsg));		Operands.push_back(AMDGPUOperand::CreateImm(this, Imm16Val, S, AMDGPUOperand::ImmTySendMsg));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

bool AMDGPUOperand::isSendMsg() const {		bool AMDGPUOperand::isSendMsg() const {
return isImmTy(ImmTySendMsg);		return isImmTy(ImmTySendMsg);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// sopp branch targets		// sopp branch targets
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

AMDGPUAsmParser::OperandMatchResultTy		AMDGPUAsmParser::OperandMatchResultTy
AMDGPUAsmParser::parseSOppBrTarget(OperandVector &Operands) {		AMDGPUAsmParser::parseSOppBrTarget(OperandVector &Operands) {
SMLoc S = Parser.getTok().getLoc();		SMLoc S = Parser.getTok().getLoc();

switch (getLexer().getKind()) {		switch (getLexer().getKind()) {
default: return MatchOperand_ParseFail;		default: return MatchOperand_ParseFail;
case AsmToken::Integer: {		case AsmToken::Integer: {
int64_t Imm;		int64_t Imm;
if (getParser().parseAbsoluteExpression(Imm))		if (getParser().parseAbsoluteExpression(Imm))
return MatchOperand_ParseFail;		return MatchOperand_ParseFail;
Operands.push_back(AMDGPUOperand::CreateImm(Imm, S));		Operands.push_back(AMDGPUOperand::CreateImm(this, Imm, S));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

case AsmToken::Identifier:		case AsmToken::Identifier:
Operands.push_back(AMDGPUOperand::CreateExpr(		Operands.push_back(AMDGPUOperand::CreateExpr(this,
MCSymbolRefExpr::create(getContext().getOrCreateSymbol(		MCSymbolRefExpr::create(getContext().getOrCreateSymbol(
Parser.getTok().getString()), getContext()), S));		Parser.getTok().getString()), getContext()), S));
Parser.Lex();		Parser.Lex();
return MatchOperand_Success;		return MatchOperand_Success;
}		}
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// mubuf		// mubuf
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultGLC() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultGLC() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyGLC);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyGLC);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultSLC() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultSLC() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTySLC);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTySLC);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultTFE() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultTFE() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyTFE);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyTFE);
}		}

void AMDGPUAsmParser::cvtMubufImpl(MCInst &Inst,		void AMDGPUAsmParser::cvtMubufImpl(MCInst &Inst,
const OperandVector &Operands,		const OperandVector &Operands,
bool IsAtomic, bool IsAtomicReturn) {		bool IsAtomic, bool IsAtomicReturn) {
OptionalImmIndexMap OptionalIdx;		OptionalImmIndexMap OptionalIdx;
assert(IsAtomicReturn ? IsAtomic : true);		assert(IsAtomicReturn ? IsAtomic : true);

▲ Show 20 Lines • Show All 106 Lines • ▼ Show 20 Lines	void AMDGPUAsmParser::cvtMIMGAtomic(MCInst &Inst, const OperandVector &Operands) {
addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTyDA);		addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTyDA);
addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTyR128);		addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTyR128);
addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTyTFE);		addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTyTFE);
addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTyLWE);		addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTyLWE);
addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTySLC);		addOptionalImmOperand(Inst, Operands, OptionalIdx, AMDGPUOperand::ImmTySLC);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultDMask() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultDMask() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyDMask);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyDMask);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultUNorm() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultUNorm() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyUNorm);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyUNorm);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultDA() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultDA() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyDA);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyDA);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultR128() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultR128() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyR128);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyR128);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultLWE() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultLWE() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyLWE);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyLWE);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// smrd		// smrd
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

bool AMDGPUOperand::isSMRDOffset() const {		bool AMDGPUOperand::isSMRDOffset() const {

// FIXME: Support 20-bit offsets on VI. We need to to pass subtarget		// FIXME: Support 20-bit offsets on VI. We need to to pass subtarget
// information here.		// information here.
return isImm() && isUInt<8>(getImm());		return isImm() && isUInt<8>(getImm());
}		}

bool AMDGPUOperand::isSMRDLiteralOffset() const {		bool AMDGPUOperand::isSMRDLiteralOffset() const {
// 32-bit literals are only supported on CI and we only want to use them		// 32-bit literals are only supported on CI and we only want to use them
// when the offset is > 8-bits.		// when the offset is > 8-bits.
return isImm() && !isUInt<8>(getImm()) && isUInt<32>(getImm());		return isImm() && !isUInt<8>(getImm()) && isUInt<32>(getImm());
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultSMRDOffset() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultSMRDOffset() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyOffset);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyOffset);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultSMRDLiteralOffset() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultSMRDLiteralOffset() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyOffset);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyOffset);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// vop3		// vop3
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static bool ConvertOmodMul(int64_t &Mul) {		static bool ConvertOmodMul(int64_t &Mul) {
if (Mul != 1 && Mul != 2 && Mul != 4)		if (Mul != 1 && Mul != 2 && Mul != 4)
▲ Show 20 Lines • Show All 116 Lines • ▼ Show 20 Lines	void AMDGPUAsmParser::cvtVOP3(MCInst &Inst, const OperandVector &Operands) {
unsigned I = 1;		unsigned I = 1;
const MCInstrDesc &Desc = MII.get(Inst.getOpcode());		const MCInstrDesc &Desc = MII.get(Inst.getOpcode());
for (unsigned J = 0; J < Desc.getNumDefs(); ++J) {		for (unsigned J = 0; J < Desc.getNumDefs(); ++J) {
((AMDGPUOperand &)*Operands[I++]).addRegOperands(Inst, 1);		((AMDGPUOperand &)*Operands[I++]).addRegOperands(Inst, 1);
}		}

for (unsigned E = Operands.size(); I != E; ++I) {		for (unsigned E = Operands.size(); I != E; ++I) {
AMDGPUOperand &Op = ((AMDGPUOperand &)*Operands[I]);		AMDGPUOperand &Op = ((AMDGPUOperand &)*Operands[I]);
if (Op.isRegOrImmWithInputMods()) {		if (Desc.OpInfo[Inst.getNumOperands()].OperandType == AMDGPU::OPERAND_INPUT_MODS) {
// only fp modifiers allowed in VOP3
Op.addRegOrImmWithFPInputModsOperands(Inst, 2);		Op.addRegOrImmWithFPInputModsOperands(Inst, 2);
} else if (Op.isImm()) {		} else if (Op.isImm()) {
OptionalIdx[Op.getImmTy()] = I;		OptionalIdx[Op.getImmTy()] = I;
} else {		} else {
assert(false);		assert(false);
}		}
}		}

▲ Show 20 Lines • Show All 129 Lines • ▼ Show 20 Lines	if (Prefix == "quad_perm") {
}		}
} else {		} else {
return MatchOperand_ParseFail;		return MatchOperand_ParseFail;
}		}
}		}
}		}
Parser.Lex(); // eat last token		Parser.Lex(); // eat last token

Operands.push_back(AMDGPUOperand::CreateImm(Int, S,		Operands.push_back(AMDGPUOperand::CreateImm(this, Int, S, AMDGPUOperand::ImmTyDppCtrl));
AMDGPUOperand::ImmTyDppCtrl));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultRowMask() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultRowMask() const {
return AMDGPUOperand::CreateImm(0xf, SMLoc(), AMDGPUOperand::ImmTyDppRowMask);		return AMDGPUOperand::CreateImm(this, 0xf, SMLoc(), AMDGPUOperand::ImmTyDppRowMask);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultBankMask() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultBankMask() const {
return AMDGPUOperand::CreateImm(0xf, SMLoc(), AMDGPUOperand::ImmTyDppBankMask);		return AMDGPUOperand::CreateImm(this, 0xf, SMLoc(), AMDGPUOperand::ImmTyDppBankMask);
}		}

AMDGPUOperand::Ptr AMDGPUAsmParser::defaultBoundCtrl() const {		AMDGPUOperand::Ptr AMDGPUAsmParser::defaultBoundCtrl() const {
return AMDGPUOperand::CreateImm(0, SMLoc(), AMDGPUOperand::ImmTyDppBoundCtrl);		return AMDGPUOperand::CreateImm(this, 0, SMLoc(), AMDGPUOperand::ImmTyDppBoundCtrl);
}		}

void AMDGPUAsmParser::cvtDPP(MCInst &Inst, const OperandVector &Operands) {		void AMDGPUAsmParser::cvtDPP(MCInst &Inst, const OperandVector &Operands) {
OptionalImmIndexMap OptionalIdx;		OptionalImmIndexMap OptionalIdx;

unsigned I = 1;		unsigned I = 1;
const MCInstrDesc &Desc = MII.get(Inst.getOpcode());		const MCInstrDesc &Desc = MII.get(Inst.getOpcode());
for (unsigned J = 0; J < Desc.getNumDefs(); ++J) {		for (unsigned J = 0; J < Desc.getNumDefs(); ++J) {
((AMDGPUOperand &)*Operands[I++]).addRegOperands(Inst, 1);		((AMDGPUOperand &)*Operands[I++]).addRegOperands(Inst, 1);
}		}

for (unsigned E = Operands.size(); I != E; ++I) {		for (unsigned E = Operands.size(); I != E; ++I) {
AMDGPUOperand &Op = ((AMDGPUOperand &)*Operands[I]);		AMDGPUOperand &Op = ((AMDGPUOperand &)*Operands[I]);
// Add the register arguments		// Add the register arguments
if (Op.isRegOrImmWithInputMods()) {		if (Desc.OpInfo[Inst.getNumOperands()].OperandType == AMDGPU::OPERAND_INPUT_MODS) {
// Only float modifiers supported in DPP
Op.addRegOrImmWithFPInputModsOperands(Inst, 2);		Op.addRegOrImmWithFPInputModsOperands(Inst, 2);
} else if (Op.isDPPCtrl()) {		} else if (Op.isDPPCtrl()) {
Op.addImmOperands(Inst, 1);		Op.addImmOperands(Inst, 1);
} else if (Op.isImm()) {		} else if (Op.isImm()) {
// Handle optional arguments		// Handle optional arguments
OptionalIdx[Op.getImmTy()] = I;		OptionalIdx[Op.getImmTy()] = I;
} else {		} else {
llvm_unreachable("Invalid operand type");		llvm_unreachable("Invalid operand type");
Show All 32 Lines	Int = StringSwitch<int64_t>(Value)
.Case("DWORD", 6)		.Case("DWORD", 6)
.Default(0xffffffff);		.Default(0xffffffff);
Parser.Lex(); // eat last token		Parser.Lex(); // eat last token

if (Int == 0xffffffff) {		if (Int == 0xffffffff) {
return MatchOperand_ParseFail;		return MatchOperand_ParseFail;
}		}

Operands.push_back(AMDGPUOperand::CreateImm(Int, S, Type));		Operands.push_back(AMDGPUOperand::CreateImm(this, Int, S, Type));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

AMDGPUAsmParser::OperandMatchResultTy		AMDGPUAsmParser::OperandMatchResultTy
AMDGPUAsmParser::parseSDWADstUnused(OperandVector &Operands) {		AMDGPUAsmParser::parseSDWADstUnused(OperandVector &Operands) {
SMLoc S = Parser.getTok().getLoc();		SMLoc S = Parser.getTok().getLoc();
StringRef Value;		StringRef Value;
AMDGPUAsmParser::OperandMatchResultTy res;		AMDGPUAsmParser::OperandMatchResultTy res;
Show All 10 Lines	Int = StringSwitch<int64_t>(Value)
.Case("UNUSED_PRESERVE", 2)		.Case("UNUSED_PRESERVE", 2)
.Default(0xffffffff);		.Default(0xffffffff);
Parser.Lex(); // eat last token		Parser.Lex(); // eat last token

if (Int == 0xffffffff) {		if (Int == 0xffffffff) {
return MatchOperand_ParseFail;		return MatchOperand_ParseFail;
}		}

Operands.push_back(AMDGPUOperand::CreateImm(Int, S,		Operands.push_back(AMDGPUOperand::CreateImm(this, Int, S, AMDGPUOperand::ImmTySdwaDstUnused));
AMDGPUOperand::ImmTySdwaDstUnused));
return MatchOperand_Success;		return MatchOperand_Success;
}		}

void AMDGPUAsmParser::cvtSdwaVOP1(MCInst &Inst, const OperandVector &Operands) {		void AMDGPUAsmParser::cvtSdwaVOP1(MCInst &Inst, const OperandVector &Operands) {
cvtSDWA(Inst, Operands, SIInstrFlags::VOP1);		cvtSDWA(Inst, Operands, SIInstrFlags::VOP1);
}		}

void AMDGPUAsmParser::cvtSdwaVOP2(MCInst &Inst, const OperandVector &Operands) {		void AMDGPUAsmParser::cvtSdwaVOP2(MCInst &Inst, const OperandVector &Operands) {
Show All 17 Lines	void AMDGPUAsmParser::cvtSDWA(MCInst &Inst, const OperandVector &Operands,
for (unsigned E = Operands.size(); I != E; ++I) {		for (unsigned E = Operands.size(); I != E; ++I) {
AMDGPUOperand &Op = ((AMDGPUOperand &)*Operands[I]);		AMDGPUOperand &Op = ((AMDGPUOperand &)*Operands[I]);
// Add the register arguments		// Add the register arguments
if (BasicInstType == SIInstrFlags::VOPC &&		if (BasicInstType == SIInstrFlags::VOPC &&
Op.isReg() &&		Op.isReg() &&
Op.Reg.RegNo == AMDGPU::VCC) {		Op.Reg.RegNo == AMDGPU::VCC) {
// VOPC sdwa use "vcc" token as dst. Skip it.		// VOPC sdwa use "vcc" token as dst. Skip it.
continue;		continue;
} else if (Op.isRegOrImmWithInputMods()) {		} else if (Desc.OpInfo[Inst.getNumOperands()].OperandType == AMDGPU::OPERAND_INPUT_MODS) {
Op.addRegOrImmWithInputModsOperands(Inst, 2);		Op.addRegOrImmWithInputModsOperands(Inst, 2);
} else if (Op.isImm()) {		} else if (Op.isImm()) {
// Handle optional arguments		// Handle optional arguments
OptionalIdx[Op.getImmTy()] = I;		OptionalIdx[Op.getImmTy()] = I;
} else {		} else {
llvm_unreachable("Invalid operand type");		llvm_unreachable("Invalid operand type");
}		}
}		}

▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	unsigned AMDGPUAsmParser::validateTargetOperandClass(MCParsedAsmOperand &Op,
case MCK_gds:		case MCK_gds:
return Operand.isGDS() ? Match_Success : Match_InvalidOperand;		return Operand.isGDS() ? Match_Success : Match_InvalidOperand;
case MCK_glc:		case MCK_glc:
return Operand.isGLC() ? Match_Success : Match_InvalidOperand;		return Operand.isGLC() ? Match_Success : Match_InvalidOperand;
case MCK_idxen:		case MCK_idxen:
return Operand.isIdxen() ? Match_Success : Match_InvalidOperand;		return Operand.isIdxen() ? Match_Success : Match_InvalidOperand;
case MCK_offen:		case MCK_offen:
return Operand.isOffen() ? Match_Success : Match_InvalidOperand;		return Operand.isOffen() ? Match_Success : Match_InvalidOperand;
case MCK_SSrc32:		case MCK_SSrcB32:
// When operands have expression values, they will return true for isToken,		// When operands have expression values, they will return true for isToken,
// because it is not possible to distinguish between a token and an		// because it is not possible to distinguish between a token and an
// expression at parse time. MatchInstructionImpl() will always try to		// expression at parse time. MatchInstructionImpl() will always try to
// match an operand as a token, when isToken returns true, and when the		// match an operand as a token, when isToken returns true, and when the
// name of the expression is not a valid token, the match will fail,		// name of the expression is not a valid token, the match will fail,
// so we need to handle it here.		// so we need to handle it here.
return Operand.isSSrc32() ? Match_Success : Match_InvalidOperand;		return Operand.isSSrcB32() ? Match_Success : Match_InvalidOperand;
		case MCK_SSrcF32:
		return Operand.isSSrcF32() ? Match_Success : Match_InvalidOperand;
case MCK_SoppBrTarget:		case MCK_SoppBrTarget:
return Operand.isSoppBrTarget() ? Match_Success : Match_InvalidOperand;		return Operand.isSoppBrTarget() ? Match_Success : Match_InvalidOperand;
default: return Match_InvalidOperand;		default: return Match_InvalidOperand;
}		}
}		}

llvm/trunk/lib/Target/AMDGPU/InstPrinter/AMDGPUInstPrinter.cpp

Show First 20 Lines • Show All 353 Lines • ▼ Show 20 Lines	else if (Imm == DoubleToBits(2.0))
O << "2.0";		O << "2.0";
else if (Imm == DoubleToBits(-2.0))		else if (Imm == DoubleToBits(-2.0))
O << "-2.0";		O << "-2.0";
else if (Imm == DoubleToBits(4.0))		else if (Imm == DoubleToBits(4.0))
O << "4.0";		O << "4.0";
else if (Imm == DoubleToBits(-4.0))		else if (Imm == DoubleToBits(-4.0))
O << "-4.0";		O << "-4.0";
else {		else {
assert(isUInt<32>(Imm));		assert(isUInt<32>(Imm) \|\| Imm == 0x3fc45f306dc9c882);

// In rare situations, we will have a 32-bit literal in a 64-bit		// In rare situations, we will have a 32-bit literal in a 64-bit
// operand. This is technically allowed for the encoding of s_mov_b64.		// operand. This is technically allowed for the encoding of s_mov_b64.
O << formatHex(static_cast<uint64_t>(Imm));		O << formatHex(static_cast<uint64_t>(Imm));
}		}
}		}

void AMDGPUInstPrinter::printOperand(const MCInst *MI, unsigned OpNo,		void AMDGPUInstPrinter::printOperand(const MCInst *MI, unsigned OpNo,
▲ Show 20 Lines • Show All 516 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/MCTargetDesc/SIMCCodeEmitter.cpp

Show All 12 Lines
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "AMDGPU.h"		#include "AMDGPU.h"
#include "MCTargetDesc/AMDGPUFixupKinds.h"		#include "MCTargetDesc/AMDGPUFixupKinds.h"
#include "MCTargetDesc/AMDGPUMCCodeEmitter.h"		#include "MCTargetDesc/AMDGPUMCCodeEmitter.h"
#include "MCTargetDesc/AMDGPUMCTargetDesc.h"		#include "MCTargetDesc/AMDGPUMCTargetDesc.h"
#include "SIDefines.h"		#include "SIDefines.h"
		#include "Utils/AMDGPUBaseInfo.h"
#include "llvm/MC/MCCodeEmitter.h"		#include "llvm/MC/MCCodeEmitter.h"
#include "llvm/MC/MCContext.h"		#include "llvm/MC/MCContext.h"
#include "llvm/MC/MCFixup.h"		#include "llvm/MC/MCFixup.h"
#include "llvm/MC/MCInst.h"		#include "llvm/MC/MCInst.h"
#include "llvm/MC/MCInstrInfo.h"		#include "llvm/MC/MCInstrInfo.h"
#include "llvm/MC/MCRegisterInfo.h"		#include "llvm/MC/MCRegisterInfo.h"
#include "llvm/MC/MCSubtargetInfo.h"		#include "llvm/MC/MCSubtargetInfo.h"
#include "llvm/MC/MCSymbol.h"		#include "llvm/MC/MCSymbol.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"

using namespace llvm;		using namespace llvm;

namespace {		namespace {

class SIMCCodeEmitter : public AMDGPUMCCodeEmitter {		class SIMCCodeEmitter : public AMDGPUMCCodeEmitter {
SIMCCodeEmitter(const SIMCCodeEmitter &) = delete;		SIMCCodeEmitter(const SIMCCodeEmitter &) = delete;
void operator=(const SIMCCodeEmitter &) = delete;		void operator=(const SIMCCodeEmitter &) = delete;
const MCInstrInfo &MCII;		const MCInstrInfo &MCII;
const MCRegisterInfo &MRI;		const MCRegisterInfo &MRI;

/// \brief Can this operand also contain immediate values?
bool isSrcOperand(const MCInstrDesc &Desc, unsigned OpNo) const;

/// \brief Encode an fp or int literal		/// \brief Encode an fp or int literal
uint32_t getLitEncoding(const MCOperand &MO, unsigned OpSize) const;		uint32_t getLitEncoding(const MCOperand &MO, unsigned OpSize,
		const MCSubtargetInfo &STI) const;

public:		public:
SIMCCodeEmitter(const MCInstrInfo &mcii, const MCRegisterInfo &mri,		SIMCCodeEmitter(const MCInstrInfo &mcii, const MCRegisterInfo &mri,
MCContext &ctx)		MCContext &ctx)
: MCII(mcii), MRI(mri) { }		: MCII(mcii), MRI(mri) { }

~SIMCCodeEmitter() override {}		~SIMCCodeEmitter() override {}

Show All 17 Lines
} // End anonymous namespace		} // End anonymous namespace

MCCodeEmitter *llvm::createSIMCCodeEmitter(const MCInstrInfo &MCII,		MCCodeEmitter *llvm::createSIMCCodeEmitter(const MCInstrInfo &MCII,
const MCRegisterInfo &MRI,		const MCRegisterInfo &MRI,
MCContext &Ctx) {		MCContext &Ctx) {
return new SIMCCodeEmitter(MCII, MRI, Ctx);		return new SIMCCodeEmitter(MCII, MRI, Ctx);
}		}

bool SIMCCodeEmitter::isSrcOperand(const MCInstrDesc &Desc,
unsigned OpNo) const {
unsigned OpType = Desc.OpInfo[OpNo].OperandType;

return OpType == AMDGPU::OPERAND_REG_IMM32 \|\|
OpType == AMDGPU::OPERAND_REG_INLINE_C;
}

// Returns the encoding value to use if the given integer is an integer inline		// Returns the encoding value to use if the given integer is an integer inline
// immediate value, or 0 if it is not.		// immediate value, or 0 if it is not.
template <typename IntTy>		template <typename IntTy>
static uint32_t getIntInlineImmEncoding(IntTy Imm) {		static uint32_t getIntInlineImmEncoding(IntTy Imm) {
if (Imm >= 0 && Imm <= 64)		if (Imm >= 0 && Imm <= 64)
return 128 + Imm;		return 128 + Imm;

if (Imm >= -16 && Imm <= -1)		if (Imm >= -16 && Imm <= -1)
return 192 + std::abs(Imm);		return 192 + std::abs(Imm);

return 0;		return 0;
}		}

static uint32_t getLit32Encoding(uint32_t Val) {		static uint32_t getLit32Encoding(uint32_t Val, const MCSubtargetInfo &STI) {
uint32_t IntImm = getIntInlineImmEncoding(static_cast<int32_t>(Val));		uint32_t IntImm = getIntInlineImmEncoding(static_cast<int32_t>(Val));
if (IntImm != 0)		if (IntImm != 0)
return IntImm;		return IntImm;

if (Val == FloatToBits(0.5f))		if (Val == FloatToBits(0.5f))
return 240;		return 240;

if (Val == FloatToBits(-0.5f))		if (Val == FloatToBits(-0.5f))
Show All 12 Lines	if (Val == FloatToBits(-2.0f))
return 245;		return 245;

if (Val == FloatToBits(4.0f))		if (Val == FloatToBits(4.0f))
return 246;		return 246;

if (Val == FloatToBits(-4.0f))		if (Val == FloatToBits(-4.0f))
return 247;		return 247;

		if (AMDGPU::isVI(STI) && Val == 0x3e22f983) // 1/(2*pi)
		return 248;

return 255;		return 255;
}		}

static uint32_t getLit64Encoding(uint64_t Val) {		static uint32_t getLit64Encoding(uint64_t Val, const MCSubtargetInfo &STI) {
uint32_t IntImm = getIntInlineImmEncoding(static_cast<int64_t>(Val));		uint32_t IntImm = getIntInlineImmEncoding(static_cast<int64_t>(Val));
if (IntImm != 0)		if (IntImm != 0)
return IntImm;		return IntImm;

if (Val == DoubleToBits(0.5))		if (Val == DoubleToBits(0.5))
return 240;		return 240;

if (Val == DoubleToBits(-0.5))		if (Val == DoubleToBits(-0.5))
Show All 12 Lines	if (Val == DoubleToBits(-2.0))
return 245;		return 245;

if (Val == DoubleToBits(4.0))		if (Val == DoubleToBits(4.0))
return 246;		return 246;

if (Val == DoubleToBits(-4.0))		if (Val == DoubleToBits(-4.0))
return 247;		return 247;

		if (AMDGPU::isVI(STI) && Val == 0x3fc45f306dc9c882) // 1/(2*pi)
		return 248;

return 255;		return 255;
}		}

uint32_t SIMCCodeEmitter::getLitEncoding(const MCOperand &MO,		uint32_t SIMCCodeEmitter::getLitEncoding(const MCOperand &MO,
unsigned OpSize) const {		unsigned OpSize,
		const MCSubtargetInfo &STI) const {

int64_t Imm;		int64_t Imm;
if (MO.isExpr()) {		if (MO.isExpr()) {
const MCConstantExpr *C = dyn_cast<MCConstantExpr>(MO.getExpr());		const MCConstantExpr *C = dyn_cast<MCConstantExpr>(MO.getExpr());
if (!C)		if (!C)
return 255;		return 255;

Imm = C->getValue();		Imm = C->getValue();
} else {		} else {

assert(!MO.isFPImm());		assert(!MO.isFPImm());

if (!MO.isImm())		if (!MO.isImm())
return ~0;		return ~0;

Imm = MO.getImm();		Imm = MO.getImm();
}		}

if (OpSize == 4)		if (OpSize == 4)
return getLit32Encoding(static_cast<uint32_t>(Imm));		return getLit32Encoding(static_cast<uint32_t>(Imm), STI);

assert(OpSize == 8);		assert(OpSize == 8);

return getLit64Encoding(static_cast<uint64_t>(Imm));		return getLit64Encoding(static_cast<uint64_t>(Imm), STI);
}		}

void SIMCCodeEmitter::encodeInstruction(const MCInst &MI, raw_ostream &OS,		void SIMCCodeEmitter::encodeInstruction(const MCInst &MI, raw_ostream &OS,
SmallVectorImpl<MCFixup> &Fixups,		SmallVectorImpl<MCFixup> &Fixups,
const MCSubtargetInfo &STI) const {		const MCSubtargetInfo &STI) const {

uint64_t Encoding = getBinaryCodeForInstr(MI, Fixups, STI);		uint64_t Encoding = getBinaryCodeForInstr(MI, Fixups, STI);
const MCInstrDesc &Desc = MCII.get(MI.getOpcode());		const MCInstrDesc &Desc = MCII.get(MI.getOpcode());
unsigned bytes = Desc.getSize();		unsigned bytes = Desc.getSize();

for (unsigned i = 0; i < bytes; i++) {		for (unsigned i = 0; i < bytes; i++) {
OS.write((uint8_t) ((Encoding >> (8 * i)) & 0xff));		OS.write((uint8_t) ((Encoding >> (8 * i)) & 0xff));
}		}

if (bytes > 4)		if (bytes > 4)
return;		return;

// Check for additional literals in SRC0/1/2 (Op 1/2/3)		// Check for additional literals in SRC0/1/2 (Op 1/2/3)
for (unsigned i = 0, e = MI.getNumOperands(); i < e; ++i) {		for (unsigned i = 0, e = MI.getNumOperands(); i < e; ++i) {

// Check if this operand should be encoded as [SV]Src		// Check if this operand should be encoded as [SV]Src
if (!isSrcOperand(Desc, i))		if (!AMDGPU::isSISrcOperand(Desc, i))
continue;		continue;

int RCID = Desc.OpInfo[i].RegClass;		int RCID = Desc.OpInfo[i].RegClass;
const MCRegisterClass &RC = MRI.getRegClass(RCID);		const MCRegisterClass &RC = MRI.getRegClass(RCID);

// Is this operand a literal immediate?		// Is this operand a literal immediate?
const MCOperand &Op = MI.getOperand(i);		const MCOperand &Op = MI.getOperand(i);
if (getLitEncoding(Op, RC.getSize()) != 255)		if (getLitEncoding(Op, RC.getSize(), STI) != 255)
continue;		continue;

// Yes! Encode it		// Yes! Encode it
int64_t Imm = 0;		int64_t Imm = 0;

if (Op.isImm())		if (Op.isImm())
Imm = Op.getImm();		Imm = Op.getImm();
else if (Op.isExpr()) {		else if (Op.isExpr()) {
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	uint64_t SIMCCodeEmitter::getMachineOpValue(const MCInst &MI,
// Figure out the operand number, needed for isSrcOperand check		// Figure out the operand number, needed for isSrcOperand check
unsigned OpNo = 0;		unsigned OpNo = 0;
for (unsigned e = MI.getNumOperands(); OpNo < e; ++OpNo) {		for (unsigned e = MI.getNumOperands(); OpNo < e; ++OpNo) {
if (&MO == &MI.getOperand(OpNo))		if (&MO == &MI.getOperand(OpNo))
break;		break;
}		}

const MCInstrDesc &Desc = MCII.get(MI.getOpcode());		const MCInstrDesc &Desc = MCII.get(MI.getOpcode());
if (isSrcOperand(Desc, OpNo)) {		if (AMDGPU::isSISrcOperand(Desc, OpNo)) {
int RCID = Desc.OpInfo[OpNo].RegClass;		uint32_t Enc = getLitEncoding(MO,
const MCRegisterClass &RC = MRI.getRegClass(RCID);		AMDGPU::getRegOperandSize(&MRI, Desc, OpNo),
		STI);
uint32_t Enc = getLitEncoding(MO, RC.getSize());
if (Enc != ~0U && (Enc != 255 \|\| Desc.getSize() == 4))		if (Enc != ~0U && (Enc != 255 \|\| Desc.getSize() == 4))
return Enc;		return Enc;

} else if (MO.isImm())		} else if (MO.isImm())
return MO.getImm();		return MO.getImm();

llvm_unreachable("Encoding of this operand type is not supported yet.");		llvm_unreachable("Encoding of this operand type is not supported yet.");
return 0;		return 0;
}		}

llvm/trunk/lib/Target/AMDGPU/SIDefines.h

Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	enum {
Gather4 = 1 << 25,		Gather4 = 1 << 25,
DisableWQM = 1 << 26		DisableWQM = 1 << 26
};		};
}		}

namespace llvm {		namespace llvm {
namespace AMDGPU {		namespace AMDGPU {
enum OperandType {		enum OperandType {
/// Operand with register or 32-bit immediate		/// Operands with register or 32-bit immediate
OPERAND_REG_IMM32 = MCOI::OPERAND_FIRST_TARGET,		OPERAND_REG_IMM32_INT = MCOI::OPERAND_FIRST_TARGET,
/// Operand with register or inline constant		OPERAND_REG_IMM32_FP,
OPERAND_REG_INLINE_C,		/// Operands with register or inline constant
		OPERAND_REG_INLINE_C_INT,
/// Operand with 32-bit immediate that uses the constant bus. The standard		OPERAND_REG_INLINE_C_FP,
/// OPERAND_IMMEDIATE should be used for special immediates such as source
/// modifiers.		// Operand for source modifiers for VOP instructions
		OPERAND_INPUT_MODS,

		/// Operand with 32-bit immediate that uses the constant bus.
OPERAND_KIMM32		OPERAND_KIMM32
};		};
}		}
}		}

namespace SIInstrFlags {		namespace SIInstrFlags {
enum Flags {		enum Flags {
// First 4 bits are the instruction encoding		// First 4 bits are the instruction encoding
▲ Show 20 Lines • Show All 257 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/SIInstrInfo.cpp

Show First 20 Lines • Show All 1,683 Lines • ▼ Show 20 Lines	for (int i = 0, e = Desc.getNumOperands(); i != e; ++i) {

switch (Desc.OpInfo[i].OperandType) {		switch (Desc.OpInfo[i].OperandType) {
case MCOI::OPERAND_REGISTER:		case MCOI::OPERAND_REGISTER:
if (MI.getOperand(i).isImm()) {		if (MI.getOperand(i).isImm()) {
ErrInfo = "Illegal immediate value for operand.";		ErrInfo = "Illegal immediate value for operand.";
return false;		return false;
}		}
break;		break;
case AMDGPU::OPERAND_REG_IMM32:		case AMDGPU::OPERAND_REG_IMM32_INT:
		case AMDGPU::OPERAND_REG_IMM32_FP:
break;		break;
case AMDGPU::OPERAND_REG_INLINE_C:		case AMDGPU::OPERAND_REG_INLINE_C_INT:
		case AMDGPU::OPERAND_REG_INLINE_C_FP:
if (isLiteralConstant(MI.getOperand(i),		if (isLiteralConstant(MI.getOperand(i),
RI.getRegClass(RegClass)->getSize())) {		RI.getRegClass(RegClass)->getSize())) {
ErrInfo = "Illegal immediate value for operand.";		ErrInfo = "Illegal immediate value for operand.";
return false;		return false;
}		}
break;		break;
case MCOI::OPERAND_IMMEDIATE:		case MCOI::OPERAND_IMMEDIATE:
case AMDGPU::OPERAND_KIMM32:		case AMDGPU::OPERAND_KIMM32:
▲ Show 20 Lines • Show All 322 Lines • ▼ Show 20 Lines	bool SIInstrInfo::isLegalRegOperand(const MachineRegisterInfo &MRI,

const SIRegisterInfo *TRI =		const SIRegisterInfo *TRI =
static_cast<const SIRegisterInfo*>(MRI.getTargetRegisterInfo());		static_cast<const SIRegisterInfo*>(MRI.getTargetRegisterInfo());
RC = TRI->getSubRegClass(RC, MO.getSubReg());		RC = TRI->getSubRegClass(RC, MO.getSubReg());

// In order to be legal, the common sub-class must be equal to the		// In order to be legal, the common sub-class must be equal to the
// class of the current operand. For example:		// class of the current operand. For example:
//		//
// v_mov_b32 s0 ; Operand defined as vsrc_32		// v_mov_b32 s0 ; Operand defined as vsrc_b32
// ; RI.getCommonSubClass(s0,vsrc_32) = sgpr ; LEGAL		// ; RI.getCommonSubClass(s0,vsrc_b32) = sgpr ; LEGAL
//		//
// s_sendmsg 0, s0 ; Operand defined as m0reg		// s_sendmsg 0, s0 ; Operand defined as m0reg
// ; RI.getCommonSubClass(s0,m0reg) = m0reg ; NOT LEGAL		// ; RI.getCommonSubClass(s0,m0reg) = m0reg ; NOT LEGAL

return RI.getCommonSubClass(RC, RI.getRegClass(OpInfo.RegClass)) == RC;		return RI.getCommonSubClass(RC, RI.getRegClass(OpInfo.RegClass)) == RC;
}		}

bool SIInstrInfo::isLegalVSrcOperand(const MachineRegisterInfo &MRI,		bool SIInstrInfo::isLegalVSrcOperand(const MachineRegisterInfo &MRI,
▲ Show 20 Lines • Show All 1,174 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/SIInstrInfo.td

Show First 20 Lines • Show All 448 Lines • ▼ Show 20 Lines
def src1_sel : NamedOperandU32<"SDWASrc1Sel", NamedMatchClass<"SDWASrc1Sel">>;		def src1_sel : NamedOperandU32<"SDWASrc1Sel", NamedMatchClass<"SDWASrc1Sel">>;
def dst_unused : NamedOperandU32<"SDWADstUnused", NamedMatchClass<"SDWADstUnused">>;		def dst_unused : NamedOperandU32<"SDWADstUnused", NamedMatchClass<"SDWADstUnused">>;

def hwreg : NamedOperandU16<"Hwreg", NamedMatchClass<"Hwreg", 0>>;		def hwreg : NamedOperandU16<"Hwreg", NamedMatchClass<"Hwreg", 0>>;

} // End OperandType = "OPERAND_IMMEDIATE"		} // End OperandType = "OPERAND_IMMEDIATE"


		// 32-bit VALU immediate operand that uses the constant bus.
		def KImmFP32MatchClass : AsmOperandClass {
		let Name = "KImmFP32";
		let PredicateMethod = "isKImmFP32";
		let ParserMethod = "parseImm";
		let RenderMethod = "addKImmFP32Operands";
		}

		def f32kimm : Operand<i32> {
		let OperandNamespace = "AMDGPU";
		let OperandType = "OPERAND_KIMM32";
		let PrintMethod = "printU32ImmOperand";
		let ParserMatchClass = KImmFP32MatchClass;
		}

def VOPDstS64 : VOPDstOperand <SReg_64>;		def VOPDstS64 : VOPDstOperand <SReg_64>;

def FPInputModsMatchClass : AsmOperandClass {		class FPInputModsMatchClass <int opSize> : AsmOperandClass {
let Name = "RegOrImmWithFPInputMods";		let Name = "RegOrImmWithFP"#opSize#"InputMods";
let ParserMethod = "parseRegOrImmWithFPInputMods";		let ParserMethod = "parseRegOrImmWithFPInputMods";
let PredicateMethod = "isRegOrImmWithInputMods";		let PredicateMethod = "isRegOrImmWithFP"#opSize#"InputMods";
}		}
		def FP32InputModsMatchClass : FPInputModsMatchClass<32>;
		def FP64InputModsMatchClass : FPInputModsMatchClass<64>;

def FPInputMods : Operand <i32> {		class InputMods <AsmOperandClass matchClass> : Operand <i32> {
		let OperandNamespace = "AMDGPU";
		let OperandType = "OPERAND_INPUT_MODS";
		let ParserMatchClass = matchClass;
		}

		class FPInputMods <FPInputModsMatchClass matchClass> : InputMods <matchClass> {
let PrintMethod = "printOperandAndFPInputMods";		let PrintMethod = "printOperandAndFPInputMods";
let ParserMatchClass = FPInputModsMatchClass;
}		}
		def FP32InputMods : FPInputMods<FP32InputModsMatchClass>;
		def FP64InputMods : FPInputMods<FP64InputModsMatchClass>;

def IntInputModsMatchClass : AsmOperandClass {		class IntInputModsMatchClass <int opSize> : AsmOperandClass {
let Name = "RegOrImmWithIntInputMods";		let Name = "RegOrImmWithInt"#opSize#"InputMods";
let ParserMethod = "parseRegOrImmWithIntInputMods";		let ParserMethod = "parseRegOrImmWithIntInputMods";
let PredicateMethod = "isRegOrImmWithInputMods";		let PredicateMethod = "isRegOrImmWithInt"#opSize#"InputMods";
}		}
		def Int32InputModsMatchClass : IntInputModsMatchClass<32>;
		def Int64InputModsMatchClass : IntInputModsMatchClass<64>;

def IntInputMods: Operand <i32> {		class IntInputMods <IntInputModsMatchClass matchClass> : InputMods <matchClass> {
let PrintMethod = "printOperandAndIntInputMods";		let PrintMethod = "printOperandAndIntInputMods";
let ParserMatchClass = IntInputModsMatchClass;
}		}
		def Int32InputMods : IntInputMods<Int32InputModsMatchClass>;
		def Int64InputMods : IntInputMods<Int64InputModsMatchClass>;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Complex patterns		// Complex patterns
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def DS1Addr1Offset : ComplexPattern<i32, 2, "SelectDS1Addr1Offset">;		def DS1Addr1Offset : ComplexPattern<i32, 2, "SelectDS1Addr1Offset">;
def DS64Bit4ByteAligned : ComplexPattern<i32, 3, "SelectDS64Bit4ByteAligned">;		def DS64Bit4ByteAligned : ComplexPattern<i32, 3, "SelectDS64Bit4ByteAligned">;

▲ Show 20 Lines • Show All 112 Lines • ▼ Show 20 Lines	RegisterOperand ret = !if(!eq(VT.Size, 32), VOPDstOperand<VGPR_32>,
!if(!eq(VT.Size, 64), VOPDstOperand<VReg_64>,		!if(!eq(VT.Size, 64), VOPDstOperand<VReg_64>,
!if(!eq(VT.Size, 16), VOPDstOperand<VGPR_32>,		!if(!eq(VT.Size, 16), VOPDstOperand<VGPR_32>,
VOPDstOperand<SReg_64>))); // else VT == i1		VOPDstOperand<SReg_64>))); // else VT == i1
}		}

// Returns the register class to use for source 0 of VOP[12C]		// Returns the register class to use for source 0 of VOP[12C]
// instructions for the given VT.		// instructions for the given VT.
class getVOPSrc0ForVT<ValueType VT> {		class getVOPSrc0ForVT<ValueType VT> {
RegisterOperand ret = !if(!eq(VT.Size, 64), VSrc_64, VSrc_32);		bit isFP = !if(!eq(VT.Value, f16.Value), 1,
		!if(!eq(VT.Value, f32.Value), 1,
		!if(!eq(VT.Value, f64.Value), 1,
		0)));
		RegisterOperand ret = !if(isFP,
		!if(!eq(VT.Size, 64), VSrc_f64, VSrc_f32),
		!if(!eq(VT.Size, 64), VSrc_b64, VSrc_b32));
}		}

// Returns the vreg register class to use for source operand given VT		// Returns the vreg register class to use for source operand given VT
class getVregSrcForVT<ValueType VT> {		class getVregSrcForVT<ValueType VT> {
RegisterClass ret = !if(!eq(VT.Size, 64), VReg_64, VGPR_32);		RegisterClass ret = !if(!eq(VT.Size, 64), VReg_64, VGPR_32);
}		}


// Returns the register class to use for sources of VOP3 instructions for the		// Returns the register class to use for sources of VOP3 instructions for the
// given VT.		// given VT.
class getVOP3SrcForVT<ValueType VT> {		class getVOP3SrcForVT<ValueType VT> {
		bit isFP = !if(!eq(VT.Value, f16.Value), 1,
		!if(!eq(VT.Value, f32.Value), 1,
		!if(!eq(VT.Value, f64.Value), 1,
		0)));
RegisterOperand ret =		RegisterOperand ret =
!if(!eq(VT.Size, 64),		!if(!eq(VT.Size, 64),
VCSrc_64,		!if(isFP,
		VCSrc_f64,
		VCSrc_b64),
!if(!eq(VT.Value, i1.Value),		!if(!eq(VT.Value, i1.Value),
SCSrc_64,		SCSrc_b64,
VCSrc_32		!if(isFP,
		VCSrc_f32,
		VCSrc_b32)
)		)
);		);
}		}

// Returns 1 if the source arguments have modifiers, 0 if they do not.		// Returns 1 if the source arguments have modifiers, 0 if they do not.
// XXX - do f16 instructions?		// XXX - do f16 instructions?
class hasModifiers<ValueType SrcVT> {		class hasModifiers<ValueType SrcVT> {
bit ret =		bit ret =
!if(!eq(SrcVT.Value, f32.Value), 1,		!if(!eq(SrcVT.Value, f32.Value), 1,
!if(!eq(SrcVT.Value, f64.Value), 1,		!if(!eq(SrcVT.Value, f64.Value), 1,
0));		0));
}		}

		// Return type of input modifiers operand for specified input operand
		class getSrcMod <ValueType VT> {
		bit isFP = !if(!eq(VT.Value, f16.Value), 1,
		!if(!eq(VT.Value, f32.Value), 1,
		!if(!eq(VT.Value, f64.Value), 1,
		0)));
		Operand ret = !if(!eq(VT.Size, 64),
		!if(isFP, FP64InputMods, Int64InputMods),
		!if(isFP, FP32InputMods, Int32InputMods));
		}

// Returns the input arguments for VOP[12C] instructions for the given SrcVT.		// Returns the input arguments for VOP[12C] instructions for the given SrcVT.
class getIns32 <RegisterOperand Src0RC, RegisterClass Src1RC, int NumSrcArgs> {		class getIns32 <RegisterOperand Src0RC, RegisterClass Src1RC, int NumSrcArgs> {
dag ret = !if(!eq(NumSrcArgs, 1), (ins Src0RC:$src0), // VOP1		dag ret = !if(!eq(NumSrcArgs, 1), (ins Src0RC:$src0), // VOP1
!if(!eq(NumSrcArgs, 2), (ins Src0RC:$src0, Src1RC:$src1), // VOP2		!if(!eq(NumSrcArgs, 2), (ins Src0RC:$src0, Src1RC:$src1), // VOP2
(ins)));		(ins)));
}		}

// Returns the input arguments for VOP3 instructions for the given SrcVT.		// Returns the input arguments for VOP3 instructions for the given SrcVT.
class getIns64 <RegisterOperand Src0RC, RegisterOperand Src1RC,		class getIns64 <RegisterOperand Src0RC, RegisterOperand Src1RC,
RegisterOperand Src2RC, int NumSrcArgs,		RegisterOperand Src2RC, int NumSrcArgs,
bit HasModifiers> {		bit HasModifiers, Operand Src0Mod, Operand Src1Mod,
		Operand Src2Mod> {

dag ret =		dag ret =
!if (!eq(NumSrcArgs, 0),		!if (!eq(NumSrcArgs, 0),
// VOP1 without input operands (V_NOP, V_CLREXCP)		// VOP1 without input operands (V_NOP, V_CLREXCP)
(ins),		(ins),
/* else */		/* else */
!if (!eq(NumSrcArgs, 1),		!if (!eq(NumSrcArgs, 1),
!if (!eq(HasModifiers, 1),		!if (!eq(HasModifiers, 1),
// VOP1 with modifiers		// VOP1 with modifiers
(ins FPInputMods:$src0_modifiers, Src0RC:$src0,		(ins Src0Mod:$src0_modifiers, Src0RC:$src0,
clampmod:$clamp, omod:$omod)		clampmod:$clamp, omod:$omod)
/* else */,		/* else */,
// VOP1 without modifiers		// VOP1 without modifiers
(ins Src0RC:$src0)		(ins Src0RC:$src0)
/* endif */ ),		/* endif */ ),
!if (!eq(NumSrcArgs, 2),		!if (!eq(NumSrcArgs, 2),
!if (!eq(HasModifiers, 1),		!if (!eq(HasModifiers, 1),
// VOP 2 with modifiers		// VOP 2 with modifiers
(ins FPInputMods:$src0_modifiers, Src0RC:$src0,		(ins Src0Mod:$src0_modifiers, Src0RC:$src0,
FPInputMods:$src1_modifiers, Src1RC:$src1,		Src1Mod:$src1_modifiers, Src1RC:$src1,
clampmod:$clamp, omod:$omod)		clampmod:$clamp, omod:$omod)
/* else */,		/* else */,
// VOP2 without modifiers		// VOP2 without modifiers
(ins Src0RC:$src0, Src1RC:$src1)		(ins Src0RC:$src0, Src1RC:$src1)
/* endif */ )		/* endif */ )
/* NumSrcArgs == 3 */,		/* NumSrcArgs == 3 */,
!if (!eq(HasModifiers, 1),		!if (!eq(HasModifiers, 1),
// VOP3 with modifiers		// VOP3 with modifiers
(ins FPInputMods:$src0_modifiers, Src0RC:$src0,		(ins Src0Mod:$src0_modifiers, Src0RC:$src0,
FPInputMods:$src1_modifiers, Src1RC:$src1,		Src1Mod:$src1_modifiers, Src1RC:$src1,
FPInputMods:$src2_modifiers, Src2RC:$src2,		Src2Mod:$src2_modifiers, Src2RC:$src2,
clampmod:$clamp, omod:$omod)		clampmod:$clamp, omod:$omod)
/* else */,		/* else */,
// VOP3 without modifiers		// VOP3 without modifiers
(ins Src0RC:$src0, Src1RC:$src1, Src2RC:$src2)		(ins Src0RC:$src0, Src1RC:$src1, Src2RC:$src2)
/* endif */ ))));		/* endif */ ))));
}		}

class getInsDPP <RegisterClass Src0RC, RegisterClass Src1RC, int NumSrcArgs,		class getInsDPP <RegisterClass Src0RC, RegisterClass Src1RC, int NumSrcArgs,
bit HasModifiers> {		bit HasModifiers, Operand Src0Mod, Operand Src1Mod> {

dag ret = !if (!eq(NumSrcArgs, 0),		dag ret = !if (!eq(NumSrcArgs, 0),
// VOP1 without input operands (V_NOP)		// VOP1 without input operands (V_NOP)
(ins dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,		(ins dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,
bank_mask:$bank_mask, bound_ctrl:$bound_ctrl),		bank_mask:$bank_mask, bound_ctrl:$bound_ctrl),
!if (!eq(NumSrcArgs, 1),		!if (!eq(NumSrcArgs, 1),
!if (!eq(HasModifiers, 1),		!if (!eq(HasModifiers, 1),
// VOP1_DPP with modifiers		// VOP1_DPP with modifiers
(ins FPInputMods:$src0_modifiers, Src0RC:$src0,		(ins Src0Mod:$src0_modifiers, Src0RC:$src0,
dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,		dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,
bank_mask:$bank_mask, bound_ctrl:$bound_ctrl)		bank_mask:$bank_mask, bound_ctrl:$bound_ctrl)
/* else */,		/* else */,
// VOP1_DPP without modifiers		// VOP1_DPP without modifiers
(ins Src0RC:$src0, dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,		(ins Src0RC:$src0, dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,
bank_mask:$bank_mask, bound_ctrl:$bound_ctrl)		bank_mask:$bank_mask, bound_ctrl:$bound_ctrl)
/* endif */)		/* endif */)
/* NumSrcArgs == 2 */,		/* NumSrcArgs == 2 */,
!if (!eq(HasModifiers, 1),		!if (!eq(HasModifiers, 1),
// VOP2_DPP with modifiers		// VOP2_DPP with modifiers
(ins FPInputMods:$src0_modifiers, Src0RC:$src0,		(ins Src0Mod:$src0_modifiers, Src0RC:$src0,
FPInputMods:$src1_modifiers, Src1RC:$src1,		Src1Mod:$src1_modifiers, Src1RC:$src1,
dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,		dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,
bank_mask:$bank_mask, bound_ctrl:$bound_ctrl)		bank_mask:$bank_mask, bound_ctrl:$bound_ctrl)
/* else */,		/* else */,
// VOP2_DPP without modifiers		// VOP2_DPP without modifiers
(ins Src0RC:$src0, Src1RC:$src1, dpp_ctrl:$dpp_ctrl,		(ins Src0RC:$src0, Src1RC:$src1, dpp_ctrl:$dpp_ctrl,
row_mask:$row_mask, bank_mask:$bank_mask,		row_mask:$row_mask, bank_mask:$bank_mask,
bound_ctrl:$bound_ctrl)		bound_ctrl:$bound_ctrl)
/* endif */)));		/* endif */)));
}		}

class getInsSDWA <RegisterClass Src0RC, RegisterClass Src1RC, int NumSrcArgs,		class getInsSDWA <RegisterClass Src0RC, RegisterClass Src1RC, int NumSrcArgs,
bit HasFloatModifiers, ValueType DstVT> {		bit HasFloatModifiers, Operand Src0Mod, Operand Src1Mod,
		ValueType DstVT> {

dag ret = !if(!eq(NumSrcArgs, 0),		dag ret = !if(!eq(NumSrcArgs, 0),
// VOP1 without input operands (V_NOP)		// VOP1 without input operands (V_NOP)
(ins),		(ins),
!if(!eq(NumSrcArgs, 1),		!if(!eq(NumSrcArgs, 1),
!if(HasFloatModifiers,		!if(HasFloatModifiers,
// VOP1_SDWA with float modifiers		// VOP1_SDWA with float modifiers
(ins FPInputMods:$src0_fmodifiers, Src0RC:$src0,		(ins Src0Mod:$src0_fmodifiers, Src0RC:$src0,
clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,		clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,
src0_sel:$src0_sel)		src0_sel:$src0_sel),
/* else */,		// VOP1_SDWA with int modifiers
// VOP1_SDWA with sext modifier		(ins Src0Mod:$src0_imodifiers, Src0RC:$src0,
(ins IntInputMods:$src0_imodifiers, Src0RC:$src0,
clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,		clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,
src0_sel:$src0_sel)		src0_sel:$src0_sel))
/* endif */)
/* NumSrcArgs == 2 */,		/* NumSrcArgs == 2 */,
!if(HasFloatModifiers,		!if(HasFloatModifiers,
!if(!eq(DstVT.Size, 1),		!if(!eq(DstVT.Size, 1),
// VOPC_SDWA with float modifiers		// VOPC_SDWA with float modifiers
(ins FPInputMods:$src0_fmodifiers, Src0RC:$src0,		(ins Src0Mod:$src0_fmodifiers, Src0RC:$src0,
FPInputMods:$src1_fmodifiers, Src1RC:$src1,		Src1Mod:$src1_fmodifiers, Src1RC:$src1,
clampmod:$clamp, src0_sel:$src0_sel, src1_sel:$src1_sel),		clampmod:$clamp, src0_sel:$src0_sel, src1_sel:$src1_sel),
// VOP2_SDWA or VOPC_SDWA with float modifiers		// VOP2_SDWA or VOPC_SDWA with float modifiers
(ins FPInputMods:$src0_fmodifiers, Src0RC:$src0,		(ins Src0Mod:$src0_fmodifiers, Src0RC:$src0,
FPInputMods:$src1_fmodifiers, Src1RC:$src1,		Src1Mod:$src1_fmodifiers, Src1RC:$src1,
clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,		clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,
src0_sel:$src0_sel, src1_sel:$src1_sel)		src0_sel:$src0_sel, src1_sel:$src1_sel)),
),
/* else */
!if(!eq(DstVT.Size, 1),		!if(!eq(DstVT.Size, 1),
// VOPC_SDWA with sext modifiers		// VOPC_SDWA with int modifiers
(ins IntInputMods:$src0_imodifiers, Src0RC:$src0,		(ins Src0Mod:$src0_imodifiers, Src0RC:$src0,
IntInputMods:$src1_imodifiers, Src1RC:$src1,		Src1Mod:$src1_imodifiers, Src1RC:$src1,
clampmod:$clamp, src0_sel:$src0_sel, src1_sel:$src1_sel),		clampmod:$clamp, src0_sel:$src0_sel, src1_sel:$src1_sel),
// VOP2_SDWA or VOPC_SDWA with sext modifier		// VOP2_SDWA or VOPC_SDWA with int modifiers
(ins IntInputMods:$src0_imodifiers, Src0RC:$src0,		(ins Src0Mod:$src0_imodifiers, Src0RC:$src0,
IntInputMods:$src1_imodifiers, Src1RC:$src1,		Src1Mod:$src1_imodifiers, Src1RC:$src1,
clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,		clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,
src0_sel:$src0_sel, src1_sel:$src1_sel)		src0_sel:$src0_sel, src1_sel:$src1_sel))
)
/* endif */)));		/* endif */)));
}		}

// Outs for DPP and SDWA		// Outs for DPP and SDWA
class getOutsExt <bit HasDst, ValueType DstVT, RegisterOperand DstRCDPP> {		class getOutsExt <bit HasDst, ValueType DstVT, RegisterOperand DstRCDPP> {
dag ret = !if(HasDst,		dag ret = !if(HasDst,
!if(!eq(DstVT.Size, 1),		!if(!eq(DstVT.Size, 1),
(outs), // no dst for VOPC, we use "vcc"-token as dst in SDWA VOPC instructions		(outs), // no dst for VOPC, we use "vcc"-token as dst in SDWA VOPC instructions
▲ Show 20 Lines • Show All 108 Lines • ▼ Show 20 Lines	class VOPProfile <list<ValueType> _ArgVT> {
field RegisterClass Src1RC32 = getVregSrcForVT<Src1VT>.ret;		field RegisterClass Src1RC32 = getVregSrcForVT<Src1VT>.ret;
field RegisterOperand Src0RC64 = getVOP3SrcForVT<Src0VT>.ret;		field RegisterOperand Src0RC64 = getVOP3SrcForVT<Src0VT>.ret;
field RegisterOperand Src1RC64 = getVOP3SrcForVT<Src1VT>.ret;		field RegisterOperand Src1RC64 = getVOP3SrcForVT<Src1VT>.ret;
field RegisterOperand Src2RC64 = getVOP3SrcForVT<Src2VT>.ret;		field RegisterOperand Src2RC64 = getVOP3SrcForVT<Src2VT>.ret;
field RegisterClass Src0DPP = getVregSrcForVT<Src0VT>.ret;		field RegisterClass Src0DPP = getVregSrcForVT<Src0VT>.ret;
field RegisterClass Src1DPP = getVregSrcForVT<Src1VT>.ret;		field RegisterClass Src1DPP = getVregSrcForVT<Src1VT>.ret;
field RegisterClass Src0SDWA = getVregSrcForVT<Src0VT>.ret;		field RegisterClass Src0SDWA = getVregSrcForVT<Src0VT>.ret;
field RegisterClass Src1SDWA = getVregSrcForVT<Src1VT>.ret;		field RegisterClass Src1SDWA = getVregSrcForVT<Src1VT>.ret;
		field Operand Src0Mod = getSrcMod<Src0VT>.ret;
		field Operand Src1Mod = getSrcMod<Src1VT>.ret;
		field Operand Src2Mod = getSrcMod<Src2VT>.ret;

field bit HasDst = !if(!eq(DstVT.Value, untyped.Value), 0, 1);		field bit HasDst = !if(!eq(DstVT.Value, untyped.Value), 0, 1);
field bit HasDst32 = HasDst;		field bit HasDst32 = HasDst;
field int NumSrcArgs = getNumSrcArgs<Src0VT, Src1VT, Src2VT>.ret;		field int NumSrcArgs = getNumSrcArgs<Src0VT, Src1VT, Src2VT>.ret;
field bit HasModifiers = hasModifiers<Src0VT>.ret;		field bit HasModifiers = hasModifiers<Src0VT>.ret;

field bit HasExt = getHasExt<NumSrcArgs, DstVT, Src0VT, Src1VT>.ret;		field bit HasExt = getHasExt<NumSrcArgs, DstVT, Src0VT, Src1VT>.ret;

field dag Outs = !if(HasDst,(outs DstRC:$vdst),(outs));		field dag Outs = !if(HasDst,(outs DstRC:$vdst),(outs));

// VOP3b instructions are a special case with a second explicit		// VOP3b instructions are a special case with a second explicit
// output. This is manually overridden for them.		// output. This is manually overridden for them.
field dag Outs32 = Outs;		field dag Outs32 = Outs;
field dag Outs64 = Outs;		field dag Outs64 = Outs;
field dag OutsDPP = getOutsExt<HasDst, DstVT, DstRCDPP>.ret;		field dag OutsDPP = getOutsExt<HasDst, DstVT, DstRCDPP>.ret;
field dag OutsSDWA = getOutsExt<HasDst, DstVT, DstRCDPP>.ret;		field dag OutsSDWA = getOutsExt<HasDst, DstVT, DstRCDPP>.ret;

field dag Ins32 = getIns32<Src0RC32, Src1RC32, NumSrcArgs>.ret;		field dag Ins32 = getIns32<Src0RC32, Src1RC32, NumSrcArgs>.ret;
field dag Ins64 = getIns64<Src0RC64, Src1RC64, Src2RC64, NumSrcArgs,		field dag Ins64 = getIns64<Src0RC64, Src1RC64, Src2RC64, NumSrcArgs,
HasModifiers>.ret;		HasModifiers, Src0Mod, Src1Mod, Src2Mod>.ret;
field dag InsDPP = getInsDPP<Src0DPP, Src1DPP, NumSrcArgs, HasModifiers>.ret;		field dag InsDPP = getInsDPP<Src0DPP, Src1DPP, NumSrcArgs,
field dag InsSDWA = getInsSDWA<Src0SDWA, Src1SDWA, NumSrcArgs, HasModifiers, DstVT>.ret;		HasModifiers, Src0Mod, Src1Mod>.ret;
		field dag InsSDWA = getInsSDWA<Src0SDWA, Src1SDWA, NumSrcArgs,
		HasModifiers, Src0Mod, Src1Mod, DstVT>.ret;

field string Asm32 = getAsm32<HasDst, NumSrcArgs, DstVT>.ret;		field string Asm32 = getAsm32<HasDst, NumSrcArgs, DstVT>.ret;
field string Asm64 = getAsm64<HasDst, NumSrcArgs, HasModifiers, DstVT>.ret;		field string Asm64 = getAsm64<HasDst, NumSrcArgs, HasModifiers, DstVT>.ret;
field string AsmDPP = getAsmDPP<HasDst, NumSrcArgs, HasModifiers, DstVT>.ret;		field string AsmDPP = getAsmDPP<HasDst, NumSrcArgs, HasModifiers, DstVT>.ret;
field string AsmSDWA = getAsmSDWA<HasDst, NumSrcArgs, HasModifiers, DstVT>.ret;		field string AsmSDWA = getAsmSDWA<HasDst, NumSrcArgs, HasModifiers, DstVT>.ret;
}		}

class VOP_NO_EXT <VOPProfile p> : VOPProfile <p.ArgVT> {		class VOP_NO_EXT <VOPProfile p> : VOPProfile <p.ArgVT> {
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
// to be a src operand. The custom inserter must add a tied implicit		// to be a src operand. The custom inserter must add a tied implicit
// def and use of the super register since there seems to be no way to		// def and use of the super register since there seems to be no way to
// add an implicit def of a virtual register in tablegen.		// add an implicit def of a virtual register in tablegen.
def VOP_MOVRELD : VOPProfile<[untyped, i32, untyped, untyped]> {		def VOP_MOVRELD : VOPProfile<[untyped, i32, untyped, untyped]> {
let Src0RC32 = VOPDstOperand<VGPR_32>;		let Src0RC32 = VOPDstOperand<VGPR_32>;
let Src0RC64 = VOPDstOperand<VGPR_32>;		let Src0RC64 = VOPDstOperand<VGPR_32>;

let Outs = (outs);		let Outs = (outs);
let Ins32 = (ins Src0RC32:$vdst, VSrc_32:$src0);		let Ins32 = (ins Src0RC32:$vdst, VSrc_b32:$src0);
let Ins64 = (ins Src0RC64:$vdst, VSrc_32:$src0);		let Ins64 = (ins Src0RC64:$vdst, VSrc_b32:$src0);

let InsDPP = (ins Src0RC32:$vdst, Src0RC32:$src0, dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,		let InsDPP = (ins Src0RC32:$vdst, Src0RC32:$src0, dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,
bank_mask:$bank_mask, bound_ctrl:$bound_ctrl);		bank_mask:$bank_mask, bound_ctrl:$bound_ctrl);
let InsSDWA = (ins Src0RC32:$vdst, IntInputMods:$src0_imodifiers, VCSrc_32:$src0,		let InsSDWA = (ins Src0RC32:$vdst, Int32InputMods:$src0_imodifiers, VCSrc_b32:$src0,
clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,		clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,
src0_sel:$src0_sel);		src0_sel:$src0_sel);

let Asm32 = getAsm32<1, 1>.ret;		let Asm32 = getAsm32<1, 1>.ret;
let Asm64 = getAsm64<1, 1, 0>.ret;		let Asm64 = getAsm64<1, 1, 0>.ret;
let AsmDPP = getAsmDPP<1, 1, 0>.ret;		let AsmDPP = getAsmDPP<1, 1, 0>.ret;
let AsmSDWA = getAsmSDWA<1, 1, 0>.ret;		let AsmSDWA = getAsmSDWA<1, 1, 0>.ret;

let HasExt = 0;		let HasExt = 0;
let HasDst = 0;		let HasDst = 0;
}		}

// Write out to vcc or arbitrary SGPR.		// Write out to vcc or arbitrary SGPR.
def VOP2b_I32_I1_I32_I32 : VOPProfile<[i32, i32, i32, untyped]> {		def VOP2b_I32_I1_I32_I32 : VOPProfile<[i32, i32, i32, untyped]> {
let Asm32 = "$vdst, vcc, $src0, $src1";		let Asm32 = "$vdst, vcc, $src0, $src1";
let Asm64 = "$vdst, $sdst, $src0, $src1";		let Asm64 = "$vdst, $sdst, $src0, $src1";
let Outs32 = (outs DstRC:$vdst);		let Outs32 = (outs DstRC:$vdst);
let Outs64 = (outs DstRC:$vdst, SReg_64:$sdst);		let Outs64 = (outs DstRC:$vdst, SReg_64:$sdst);
}		}

// Write out to vcc or arbitrary SGPR and read in from vcc or		// Write out to vcc or arbitrary SGPR and read in from vcc or
// arbitrary SGPR.		// arbitrary SGPR.
def VOP2b_I32_I1_I32_I32_I1 : VOPProfile<[i32, i32, i32, i1]> {		def VOP2b_I32_I1_I32_I32_I1 : VOPProfile<[i32, i32, i32, i1]> {
// We use VCSrc_32 to exclude literal constants, even though the		// We use VCSrc_b32 to exclude literal constants, even though the
// encoding normally allows them since the implicit VCC use means		// encoding normally allows them since the implicit VCC use means
// using one would always violate the constant bus		// using one would always violate the constant bus
// restriction. SGPRs are still allowed because it should		// restriction. SGPRs are still allowed because it should
// technically be possible to use VCC again as src0.		// technically be possible to use VCC again as src0.
let Src0RC32 = VCSrc_32;		let Src0RC32 = VCSrc_b32;
let Asm32 = "$vdst, vcc, $src0, $src1, vcc";		let Asm32 = "$vdst, vcc, $src0, $src1, vcc";
let Asm64 = "$vdst, $sdst, $src0, $src1, $src2";		let Asm64 = "$vdst, $sdst, $src0, $src1, $src2";
let Outs32 = (outs DstRC:$vdst);		let Outs32 = (outs DstRC:$vdst);
let Outs64 = (outs DstRC:$vdst, SReg_64:$sdst);		let Outs64 = (outs DstRC:$vdst, SReg_64:$sdst);

// Suppress src2 implied by type since the 32-bit encoding uses an		// Suppress src2 implied by type since the 32-bit encoding uses an
// implicit VCC use.		// implicit VCC use.
let Ins32 = (ins Src0RC32:$src0, Src1RC32:$src1);		let Ins32 = (ins Src0RC32:$src0, Src1RC32:$src1);
}		}

// Read in from vcc or arbitrary SGPR		// Read in from vcc or arbitrary SGPR
def VOP2e_I32_I32_I32_I1 : VOPProfile<[i32, i32, i32, i1]> {		def VOP2e_I32_I32_I32_I1 : VOPProfile<[i32, i32, i32, i1]> {
let Src0RC32 = VCSrc_32; // See comment in def VOP2b_I32_I1_I32_I32_I1 above.		let Src0RC32 = VCSrc_b32; // See comment in def VOP2b_I32_I1_I32_I32_I1 above.
let Asm32 = "$vdst, $src0, $src1, vcc";		let Asm32 = "$vdst, $src0, $src1, vcc";
let Asm64 = "$vdst, $src0, $src1, $src2";		let Asm64 = "$vdst, $src0, $src1, $src2";
let Outs32 = (outs DstRC:$vdst);		let Outs32 = (outs DstRC:$vdst);
let Outs64 = (outs DstRC:$vdst);		let Outs64 = (outs DstRC:$vdst);

// Suppress src2 implied by type since the 32-bit encoding uses an		// Suppress src2 implied by type since the 32-bit encoding uses an
// implicit VCC use.		// implicit VCC use.
let Ins32 = (ins Src0RC32:$src0, Src1RC32:$src1);		let Ins32 = (ins Src0RC32:$src0, Src1RC32:$src1);
Show All 20 Lines
class VOPC_Profile<ValueType vt0, ValueType vt1 = vt0> : VOPProfile <[i1, vt0, vt1, untyped]> {		class VOPC_Profile<ValueType vt0, ValueType vt1 = vt0> : VOPProfile <[i1, vt0, vt1, untyped]> {
let Asm32 = "vcc, $src0, $src1";		let Asm32 = "vcc, $src0, $src1";
// The destination for 32-bit encoding is implicit.		// The destination for 32-bit encoding is implicit.
let HasDst32 = 0;		let HasDst32 = 0;
let Outs64 = (outs DstRC:$sdst);		let Outs64 = (outs DstRC:$sdst);
}		}

class VOPC_Class_Profile<ValueType vt> : VOPC_Profile<vt, i32> {		class VOPC_Class_Profile<ValueType vt> : VOPC_Profile<vt, i32> {
let Ins64 = (ins FPInputMods:$src0_modifiers, Src0RC64:$src0, Src1RC64:$src1);		let Ins64 = (ins Src0Mod:$src0_modifiers, Src0RC64:$src0, Src1RC64:$src1);
let Asm64 = "$sdst, $src0_modifiers, $src1";		let Asm64 = "$sdst, $src0_modifiers, $src1";
let InsSDWA = (ins FPInputMods:$src0_fmodifiers, Src0RC64:$src0,		let InsSDWA = (ins Src0Mod:$src0_fmodifiers, Src0RC64:$src0,
IntInputMods:$src1_imodifiers, Src1RC64:$src1,		Int32InputMods:$src1_imodifiers, Src1RC64:$src1,
clampmod:$clamp, src0_sel:$src0_sel, src1_sel:$src1_sel);		clampmod:$clamp, src0_sel:$src0_sel, src1_sel:$src1_sel);
let AsmSDWA = " vcc, $src0_fmodifiers, $src1_imodifiers$clamp $src0_sel $src1_sel";		let AsmSDWA = " vcc, $src0_fmodifiers, $src1_imodifiers$clamp $src0_sel $src1_sel";

}		}

def VOPC_I1_F32_F32 : VOPC_Profile<f32>;		def VOPC_I1_F32_F32 : VOPC_Profile<f32>;
def VOPC_I1_F64_F64 : VOPC_Profile<f64>;		def VOPC_I1_F64_F64 : VOPC_Profile<f64>;
def VOPC_I1_I32_I32 : VOPC_Profile<i32>;		def VOPC_I1_I32_I32 : VOPC_Profile<i32>;
def VOPC_I1_I64_I64 : VOPC_Profile<i64>;		def VOPC_I1_I64_I64 : VOPC_Profile<i64>;

def VOPC_I1_F32_I32 : VOPC_Class_Profile<f32>;		def VOPC_I1_F32_I32 : VOPC_Class_Profile<f32>;
def VOPC_I1_F64_I32 : VOPC_Class_Profile<f64>;		def VOPC_I1_F64_I32 : VOPC_Class_Profile<f64>;

def VOP_I64_I64_I32 : VOPProfile <[i64, i64, i32, untyped]>;		def VOP_I64_I64_I32 : VOPProfile <[i64, i64, i32, untyped]>;
def VOP_I64_I32_I64 : VOPProfile <[i64, i32, i64, untyped]>;		def VOP_I64_I32_I64 : VOPProfile <[i64, i32, i64, untyped]>;
def VOP_I64_I64_I64 : VOPProfile <[i64, i64, i64, untyped]>;		def VOP_I64_I64_I64 : VOPProfile <[i64, i64, i64, untyped]>;

def VOP_F32_F32_F32_F32 : VOPProfile <[f32, f32, f32, f32]>;		def VOP_F32_F32_F32_F32 : VOPProfile <[f32, f32, f32, f32]>;
def VOP_MADAK : VOPProfile <[f32, f32, f32, f32]> {		def VOP_MADAK : VOPProfile <[f32, f32, f32, f32]> {
field dag Ins32 = (ins VCSrc_32:$src0, VGPR_32:$src1, u32kimm:$imm);		field dag Ins32 = (ins VCSrc_f32:$src0, VGPR_32:$src1, f32kimm:$imm);
field string Asm32 = "$vdst, $src0, $src1, $imm";		field string Asm32 = "$vdst, $src0, $src1, $imm";
field bit HasExt = 0;		field bit HasExt = 0;
}		}
def VOP_MADMK : VOPProfile <[f32, f32, f32, f32]> {		def VOP_MADMK : VOPProfile <[f32, f32, f32, f32]> {
field dag Ins32 = (ins VCSrc_32:$src0, u32kimm:$imm, VGPR_32:$src1);		field dag Ins32 = (ins VCSrc_f32:$src0, f32kimm:$imm, VGPR_32:$src1);
field string Asm32 = "$vdst, $src0, $imm, $src1";		field string Asm32 = "$vdst, $src0, $imm, $src1";
field bit HasExt = 0;		field bit HasExt = 0;
}		}
def VOP_MAC : VOPProfile <[f32, f32, f32, f32]> {		def VOP_MAC : VOPProfile <[f32, f32, f32, f32]> {
let Ins32 = (ins Src0RC32:$src0, Src1RC32:$src1, VGPR_32:$src2);		let Ins32 = (ins Src0RC32:$src0, Src1RC32:$src1, VGPR_32:$src2);
let Ins64 = getIns64<Src0RC64, Src1RC64, RegisterOperand<VGPR_32>, 3,		let Ins64 = getIns64<Src0RC64, Src1RC64, RegisterOperand<VGPR_32>, 3,
HasModifiers>.ret;		HasModifiers, Src0Mod, Src1Mod, Src2Mod>.ret;
let InsDPP = (ins FPInputMods:$src0_modifiers, Src0RC32:$src0,		let InsDPP = (ins FP32InputMods:$src0_modifiers, Src0RC32:$src0,
FPInputMods:$src1_modifiers, Src1RC32:$src1,		FP32InputMods:$src1_modifiers, Src1RC32:$src1,
VGPR_32:$src2, // stub argument		VGPR_32:$src2, // stub argument
dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,		dpp_ctrl:$dpp_ctrl, row_mask:$row_mask,
bank_mask:$bank_mask, bound_ctrl:$bound_ctrl);		bank_mask:$bank_mask, bound_ctrl:$bound_ctrl);
let InsSDWA = (ins FPInputMods:$src0_fmodifiers, Src0RC32:$src0,		let InsSDWA = (ins FP32InputMods:$src0_fmodifiers, Src0RC32:$src0,
FPInputMods:$src1_fmodifiers, Src1RC32:$src1,		FP32InputMods:$src1_fmodifiers, Src1RC32:$src1,
VGPR_32:$src2, // stub argument		VGPR_32:$src2, // stub argument
clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,		clampmod:$clamp, dst_sel:$dst_sel, dst_unused:$dst_unused,
src0_sel:$src0_sel, src1_sel:$src1_sel);		src0_sel:$src0_sel, src1_sel:$src1_sel);
let Asm32 = getAsm32<1, 2, f32>.ret;		let Asm32 = getAsm32<1, 2, f32>.ret;
let Asm64 = getAsm64<1, 2, HasModifiers, f32>.ret;		let Asm64 = getAsm64<1, 2, HasModifiers, f32>.ret;
let AsmDPP = getAsmDPP<1, 2, HasModifiers, f32>.ret;		let AsmDPP = getAsmDPP<1, 2, HasModifiers, f32>.ret;
let AsmSDWA = getAsmSDWA<1, 2, HasModifiers, f32>.ret;		let AsmSDWA = getAsmSDWA<1, 2, HasModifiers, f32>.ret;
}		}
▲ Show 20 Lines • Show All 857 Lines • ▼ Show 20 Lines

// Special case for v_div_fmas_{f32\|f64}, since it seems to be the		// Special case for v_div_fmas_{f32\|f64}, since it seems to be the
// only VOP instruction that implicitly reads VCC.		// only VOP instruction that implicitly reads VCC.
multiclass VOP3_VCC_Inst <vop3 op, string opName,		multiclass VOP3_VCC_Inst <vop3 op, string opName,
VOPProfile P,		VOPProfile P,
SDPatternOperator node = null_frag> : VOP3_Helper <		SDPatternOperator node = null_frag> : VOP3_Helper <
op, opName,		op, opName,
(outs P.DstRC.RegClass:$vdst),		(outs P.DstRC.RegClass:$vdst),
(ins FPInputMods:$src0_modifiers, P.Src0RC64:$src0,		P.Ins64,
FPInputMods:$src1_modifiers, P.Src1RC64:$src1,
FPInputMods:$src2_modifiers, P.Src2RC64:$src2,
clampmod:$clamp,
omod:$omod),
"$vdst, $src0_modifiers, $src1_modifiers, $src2_modifiers"#"$clamp"#"$omod",		"$vdst, $src0_modifiers, $src1_modifiers, $src2_modifiers"#"$clamp"#"$omod",
[(set P.DstVT:$vdst,		[(set P.DstVT:$vdst,
(node (P.Src0VT (VOP3Mods0 P.Src0VT:$src0, i32:$src0_modifiers,		(node (P.Src0VT (VOP3Mods0 P.Src0VT:$src0, i32:$src0_modifiers,
i1:$clamp, i32:$omod)),		i1:$clamp, i32:$omod)),
(P.Src1VT (VOP3Mods P.Src1VT:$src1, i32:$src1_modifiers)),		(P.Src1VT (VOP3Mods P.Src1VT:$src1, i32:$src1_modifiers)),
(P.Src2VT (VOP3Mods P.Src2VT:$src2, i32:$src2_modifiers)),		(P.Src2VT (VOP3Mods P.Src2VT:$src2, i32:$src2_modifiers)),
(i1 VCC)))],		(i1 VCC)))],
3, 1		3, 1
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines

let mayStore = 1, mayLoad = 0 in {		let mayStore = 1, mayLoad = 0 in {

multiclass MTBUF_Store_Helper <bits<3> op, string opName,		multiclass MTBUF_Store_Helper <bits<3> op, string opName,
RegisterClass regClass> : MTBUF_m <		RegisterClass regClass> : MTBUF_m <
op, opName, (outs),		op, opName, (outs),
(ins regClass:$vdata, u16imm:$offset, i1imm:$offen, i1imm:$idxen, i1imm:$glc,		(ins regClass:$vdata, u16imm:$offset, i1imm:$offen, i1imm:$idxen, i1imm:$glc,
i1imm:$addr64, i8imm:$dfmt, i8imm:$nfmt, VGPR_32:$vaddr,		i1imm:$addr64, i8imm:$dfmt, i8imm:$nfmt, VGPR_32:$vaddr,
SReg_128:$srsrc, i1imm:$slc, i1imm:$tfe, SCSrc_32:$soffset),		SReg_128:$srsrc, i1imm:$slc, i1imm:$tfe, SCSrc_b32:$soffset),
opName#" $vdata, $offset, $offen, $idxen, $glc, $addr64, $dfmt,"		opName#" $vdata, $offset, $offen, $idxen, $glc, $addr64, $dfmt,"
#" $nfmt, $vaddr, $srsrc, $slc, $tfe, $soffset", []		#" $nfmt, $vaddr, $srsrc, $slc, $tfe, $soffset", []
>;		>;

} // mayStore = 1, mayLoad = 0		} // mayStore = 1, mayLoad = 0

let mayLoad = 1, mayStore = 0 in {		let mayLoad = 1, mayStore = 0 in {

multiclass MTBUF_Load_Helper <bits<3> op, string opName,		multiclass MTBUF_Load_Helper <bits<3> op, string opName,
RegisterClass regClass> : MTBUF_m <		RegisterClass regClass> : MTBUF_m <
op, opName, (outs regClass:$dst),		op, opName, (outs regClass:$dst),
(ins u16imm:$offset, i1imm:$offen, i1imm:$idxen, i1imm:$glc, i1imm:$addr64,		(ins u16imm:$offset, i1imm:$offen, i1imm:$idxen, i1imm:$glc, i1imm:$addr64,
i8imm:$dfmt, i8imm:$nfmt, VGPR_32:$vaddr, SReg_128:$srsrc,		i8imm:$dfmt, i8imm:$nfmt, VGPR_32:$vaddr, SReg_128:$srsrc,
i1imm:$slc, i1imm:$tfe, SCSrc_32:$soffset),		i1imm:$slc, i1imm:$tfe, SCSrc_b32:$soffset),
opName#" $dst, $offset, $offen, $idxen, $glc, $addr64, $dfmt,"		opName#" $dst, $offset, $offen, $idxen, $glc, $addr64, $dfmt,"
#" $nfmt, $vaddr, $srsrc, $slc, $tfe, $soffset", []		#" $nfmt, $vaddr, $srsrc, $slc, $tfe, $soffset", []
>;		>;

} // mayLoad = 1, mayStore = 0		} // mayLoad = 1, mayStore = 0

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// MUBUF classes		// MUBUF classes
▲ Show 20 Lines • Show All 145 Lines • ▼ Show 20 Lines	let mayStore = 1, mayLoad = 1, hasPostISelHook = 1, hasSideEffects = 1,
DisableWQM = 1 in {		DisableWQM = 1 in {

// No return variants		// No return variants
let glc = 0, AsmMatchConverter = "cvtMubufAtomic" in {		let glc = 0, AsmMatchConverter = "cvtMubufAtomic" in {

defm _ADDR64 : MUBUFAtomicAddr64_m <		defm _ADDR64 : MUBUFAtomicAddr64_m <
op, name#"_addr64", (outs),		op, name#"_addr64", (outs),
(ins rc:$vdata, VReg_64:$vaddr, SReg_128:$srsrc,		(ins rc:$vdata, VReg_64:$vaddr, SReg_128:$srsrc,
SCSrc_32:$soffset, offset:$offset, slc:$slc),		SCSrc_b32:$soffset, offset:$offset, slc:$slc),
name#" $vdata, $vaddr, $srsrc, $soffset addr64$offset$slc", [], 0		name#" $vdata, $vaddr, $srsrc, $soffset addr64$offset$slc", [], 0
>;		>;

defm _OFFSET : MUBUFAtomicOffset_m <		defm _OFFSET : MUBUFAtomicOffset_m <
op, name#"_offset", (outs),		op, name#"_offset", (outs),
(ins rc:$vdata, SReg_128:$srsrc, SCSrc_32:$soffset, offset:$offset,		(ins rc:$vdata, SReg_128:$srsrc, SCSrc_b32:$soffset, offset:$offset,
slc:$slc),		slc:$slc),
name#" $vdata, off, $srsrc, $soffset$offset$slc", [], 0		name#" $vdata, off, $srsrc, $soffset$offset$slc", [], 0
>;		>;

let offen = 1, idxen = 0 in {		let offen = 1, idxen = 0 in {
defm _OFFEN : MUBUFAtomicOther_m <		defm _OFFEN : MUBUFAtomicOther_m <
op, name#"_offen", (outs),		op, name#"_offen", (outs),
(ins rc:$vdata, VGPR_32:$vaddr, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins rc:$vdata, VGPR_32:$vaddr, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, slc:$slc),		offset:$offset, slc:$slc),
name#" $vdata, $vaddr, $srsrc, $soffset offen$offset$slc", [], 0		name#" $vdata, $vaddr, $srsrc, $soffset offen$offset$slc", [], 0
>;		>;
}		}

let offen = 0, idxen = 1 in {		let offen = 0, idxen = 1 in {
defm _IDXEN : MUBUFAtomicOther_m <		defm _IDXEN : MUBUFAtomicOther_m <
op, name#"_idxen", (outs),		op, name#"_idxen", (outs),
(ins rc:$vdata, VGPR_32:$vaddr, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins rc:$vdata, VGPR_32:$vaddr, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, slc:$slc),		offset:$offset, slc:$slc),
name#" $vdata, $vaddr, $srsrc, $soffset idxen$offset$slc", [], 0		name#" $vdata, $vaddr, $srsrc, $soffset idxen$offset$slc", [], 0
>;		>;
}		}

let offen = 1, idxen = 1 in {		let offen = 1, idxen = 1 in {
defm _BOTHEN : MUBUFAtomicOther_m <		defm _BOTHEN : MUBUFAtomicOther_m <
op, name#"_bothen", (outs),		op, name#"_bothen", (outs),
(ins rc:$vdata, VReg_64:$vaddr, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins rc:$vdata, VReg_64:$vaddr, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, slc:$slc),		offset:$offset, slc:$slc),
name#" $vdata, $vaddr, $srsrc, $soffset idxen offen$offset$slc",		name#" $vdata, $vaddr, $srsrc, $soffset idxen offen$offset$slc",
[], 0		[], 0
>;		>;
}		}
} // glc = 0		} // glc = 0

// Variant that return values		// Variant that return values
let glc = 1, Constraints = "$vdata = $vdata_in",		let glc = 1, Constraints = "$vdata = $vdata_in",
AsmMatchConverter = "cvtMubufAtomicReturn",		AsmMatchConverter = "cvtMubufAtomicReturn",
DisableEncoding = "$vdata_in" in {		DisableEncoding = "$vdata_in" in {

defm _RTN_ADDR64 : MUBUFAtomicAddr64_m <		defm _RTN_ADDR64 : MUBUFAtomicAddr64_m <
op, name#"_rtn_addr64", (outs rc:$vdata),		op, name#"_rtn_addr64", (outs rc:$vdata),
(ins rc:$vdata_in, VReg_64:$vaddr, SReg_128:$srsrc,		(ins rc:$vdata_in, VReg_64:$vaddr, SReg_128:$srsrc,
SCSrc_32:$soffset, offset:$offset, slc:$slc),		SCSrc_b32:$soffset, offset:$offset, slc:$slc),
name#" $vdata, $vaddr, $srsrc, $soffset addr64$offset glc$slc",		name#" $vdata, $vaddr, $srsrc, $soffset addr64$offset glc$slc",
[(set vt:$vdata,		[(set vt:$vdata,
(atomic (MUBUFAddr64Atomic v4i32:$srsrc, i64:$vaddr, i32:$soffset,		(atomic (MUBUFAddr64Atomic v4i32:$srsrc, i64:$vaddr, i32:$soffset,
i16:$offset, i1:$slc), vt:$vdata_in))], 1		i16:$offset, i1:$slc), vt:$vdata_in))], 1
>;		>;

defm _RTN_OFFSET : MUBUFAtomicOffset_m <		defm _RTN_OFFSET : MUBUFAtomicOffset_m <
op, name#"_rtn_offset", (outs rc:$vdata),		op, name#"_rtn_offset", (outs rc:$vdata),
(ins rc:$vdata_in, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins rc:$vdata_in, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, slc:$slc),		offset:$offset, slc:$slc),
name#" $vdata, off, $srsrc, $soffset$offset glc$slc",		name#" $vdata, off, $srsrc, $soffset$offset glc$slc",
[(set vt:$vdata,		[(set vt:$vdata,
(atomic (MUBUFOffsetAtomic v4i32:$srsrc, i32:$soffset, i16:$offset,		(atomic (MUBUFOffsetAtomic v4i32:$srsrc, i32:$soffset, i16:$offset,
i1:$slc), vt:$vdata_in))], 1		i1:$slc), vt:$vdata_in))], 1
>;		>;

let offen = 1, idxen = 0 in {		let offen = 1, idxen = 0 in {
defm _RTN_OFFEN : MUBUFAtomicOther_m <		defm _RTN_OFFEN : MUBUFAtomicOther_m <
op, name#"_rtn_offen", (outs rc:$vdata),		op, name#"_rtn_offen", (outs rc:$vdata),
(ins rc:$vdata_in, VGPR_32:$vaddr, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins rc:$vdata_in, VGPR_32:$vaddr, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, slc:$slc),		offset:$offset, slc:$slc),
name#" $vdata, $vaddr, $srsrc, $soffset offen$offset glc$slc",		name#" $vdata, $vaddr, $srsrc, $soffset offen$offset glc$slc",
[], 1		[], 1
>;		>;
}		}

let offen = 0, idxen = 1 in {		let offen = 0, idxen = 1 in {
defm _RTN_IDXEN : MUBUFAtomicOther_m <		defm _RTN_IDXEN : MUBUFAtomicOther_m <
op, name#"_rtn_idxen", (outs rc:$vdata),		op, name#"_rtn_idxen", (outs rc:$vdata),
(ins rc:$vdata_in, VGPR_32:$vaddr, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins rc:$vdata_in, VGPR_32:$vaddr, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, slc:$slc),		offset:$offset, slc:$slc),
name#" $vdata, $vaddr, $srsrc, $soffset idxen$offset glc$slc",		name#" $vdata, $vaddr, $srsrc, $soffset idxen$offset glc$slc",
[], 1		[], 1
>;		>;
}		}

let offen = 1, idxen = 1 in {		let offen = 1, idxen = 1 in {
defm _RTN_BOTHEN : MUBUFAtomicOther_m <		defm _RTN_BOTHEN : MUBUFAtomicOther_m <
op, name#"_rtn_bothen", (outs rc:$vdata),		op, name#"_rtn_bothen", (outs rc:$vdata),
(ins rc:$vdata_in, VReg_64:$vaddr, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins rc:$vdata_in, VReg_64:$vaddr, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, slc:$slc),		offset:$offset, slc:$slc),
name#" $vdata, $vaddr, $srsrc, $soffset idxen offen$offset glc$slc",		name#" $vdata, $vaddr, $srsrc, $soffset idxen offen$offset glc$slc",
[], 1		[], 1
>;		>;
}		}
} // glc = 1		} // glc = 1

} // mayStore = 1, mayLoad = 1, hasPostISelHook = 1		} // mayStore = 1, mayLoad = 1, hasPostISelHook = 1
}		}

// FIXME: tfe can't be an operand because it requires a separate		// FIXME: tfe can't be an operand because it requires a separate
// opcode because it needs an N+1 register class dest register.		// opcode because it needs an N+1 register class dest register.
multiclass MUBUF_Load_Helper <mubuf op, string name, RegisterClass regClass,		multiclass MUBUF_Load_Helper <mubuf op, string name, RegisterClass regClass,
ValueType load_vt = i32,		ValueType load_vt = i32,
SDPatternOperator ld = null_frag> {		SDPatternOperator ld = null_frag> {

let mayLoad = 1, mayStore = 0 in {		let mayLoad = 1, mayStore = 0 in {
let offen = 0, idxen = 0, vaddr = 0 in {		let offen = 0, idxen = 0, vaddr = 0 in {
defm _OFFSET : MUBUF_m <op, name#"_offset", (outs regClass:$vdata),		defm _OFFSET : MUBUF_m <op, name#"_offset", (outs regClass:$vdata),
(ins SReg_128:$srsrc, SCSrc_32:$soffset,		(ins SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, glc:$glc, slc:$slc, tfe:$tfe),		offset:$offset, glc:$glc, slc:$slc, tfe:$tfe),
name#" $vdata, off, $srsrc, $soffset$offset$glc$slc$tfe",		name#" $vdata, off, $srsrc, $soffset$offset$glc$slc$tfe",
[(set load_vt:$vdata, (ld (MUBUFOffset v4i32:$srsrc,		[(set load_vt:$vdata, (ld (MUBUFOffset v4i32:$srsrc,
i32:$soffset, i16:$offset,		i32:$soffset, i16:$offset,
i1:$glc, i1:$slc, i1:$tfe)))]>;		i1:$glc, i1:$slc, i1:$tfe)))]>;
}		}

let offen = 1, idxen = 0 in {		let offen = 1, idxen = 0 in {
defm _OFFEN : MUBUF_m <op, name#"_offen", (outs regClass:$vdata),		defm _OFFEN : MUBUF_m <op, name#"_offen", (outs regClass:$vdata),
(ins VGPR_32:$vaddr, SReg_128:$srsrc,		(ins VGPR_32:$vaddr, SReg_128:$srsrc,
SCSrc_32:$soffset, offset:$offset, glc:$glc, slc:$slc,		SCSrc_b32:$soffset, offset:$offset, glc:$glc, slc:$slc,
tfe:$tfe),		tfe:$tfe),
name#" $vdata, $vaddr, $srsrc, $soffset offen$offset$glc$slc$tfe", []>;		name#" $vdata, $vaddr, $srsrc, $soffset offen$offset$glc$slc$tfe", []>;
}		}

let offen = 0, idxen = 1 in {		let offen = 0, idxen = 1 in {
defm _IDXEN : MUBUF_m <op, name#"_idxen", (outs regClass:$vdata),		defm _IDXEN : MUBUF_m <op, name#"_idxen", (outs regClass:$vdata),
(ins VGPR_32:$vaddr, SReg_128:$srsrc,		(ins VGPR_32:$vaddr, SReg_128:$srsrc,
SCSrc_32:$soffset, offset:$offset, glc:$glc,		SCSrc_b32:$soffset, offset:$offset, glc:$glc,
slc:$slc, tfe:$tfe),		slc:$slc, tfe:$tfe),
name#" $vdata, $vaddr, $srsrc, $soffset idxen$offset$glc$slc$tfe", []>;		name#" $vdata, $vaddr, $srsrc, $soffset idxen$offset$glc$slc$tfe", []>;
}		}

let offen = 1, idxen = 1 in {		let offen = 1, idxen = 1 in {
defm _BOTHEN : MUBUF_m <op, name#"_bothen", (outs regClass:$vdata),		defm _BOTHEN : MUBUF_m <op, name#"_bothen", (outs regClass:$vdata),
(ins VReg_64:$vaddr, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins VReg_64:$vaddr, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, glc:$glc, slc:$slc, tfe:$tfe),		offset:$offset, glc:$glc, slc:$slc, tfe:$tfe),
name#" $vdata, $vaddr, $srsrc, $soffset idxen offen$offset$glc$slc$tfe", []>;		name#" $vdata, $vaddr, $srsrc, $soffset idxen offen$offset$glc$slc$tfe", []>;
}		}

let offen = 0, idxen = 0 in {		let offen = 0, idxen = 0 in {
defm _ADDR64 : MUBUFAddr64_m <op, name#"_addr64", (outs regClass:$vdata),		defm _ADDR64 : MUBUFAddr64_m <op, name#"_addr64", (outs regClass:$vdata),
(ins VReg_64:$vaddr, SReg_128:$srsrc,		(ins VReg_64:$vaddr, SReg_128:$srsrc,
SCSrc_32:$soffset, offset:$offset,		SCSrc_b32:$soffset, offset:$offset,
glc:$glc, slc:$slc, tfe:$tfe),		glc:$glc, slc:$slc, tfe:$tfe),
name#" $vdata, $vaddr, $srsrc, $soffset addr64$offset$glc$slc$tfe",		name#" $vdata, $vaddr, $srsrc, $soffset addr64$offset$glc$slc$tfe",
[(set load_vt:$vdata, (ld (MUBUFAddr64 v4i32:$srsrc,		[(set load_vt:$vdata, (ld (MUBUFAddr64 v4i32:$srsrc,
i64:$vaddr, i32:$soffset,		i64:$vaddr, i32:$soffset,
i16:$offset, i1:$glc, i1:$slc,		i16:$offset, i1:$glc, i1:$slc,
i1:$tfe)))]>;		i1:$tfe)))]>;
}		}
}		}
}		}

multiclass MUBUF_Store_Helper <mubuf op, string name, RegisterClass vdataClass,		multiclass MUBUF_Store_Helper <mubuf op, string name, RegisterClass vdataClass,
ValueType store_vt = i32, SDPatternOperator st = null_frag> {		ValueType store_vt = i32, SDPatternOperator st = null_frag> {
let mayLoad = 0, mayStore = 1 in {		let mayLoad = 0, mayStore = 1 in {
let offen = 0, idxen = 0, vaddr = 0 in {		let offen = 0, idxen = 0, vaddr = 0 in {
defm _OFFSET : MUBUF_m <op, name#"_offset",(outs),		defm _OFFSET : MUBUF_m <op, name#"_offset",(outs),
(ins vdataClass:$vdata, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins vdataClass:$vdata, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, glc:$glc, slc:$slc, tfe:$tfe),		offset:$offset, glc:$glc, slc:$slc, tfe:$tfe),
name#" $vdata, off, $srsrc, $soffset$offset$glc$slc$tfe",		name#" $vdata, off, $srsrc, $soffset$offset$glc$slc$tfe",
[(st store_vt:$vdata, (MUBUFOffset v4i32:$srsrc, i32:$soffset,		[(st store_vt:$vdata, (MUBUFOffset v4i32:$srsrc, i32:$soffset,
i16:$offset, i1:$glc, i1:$slc, i1:$tfe))]>;		i16:$offset, i1:$glc, i1:$slc, i1:$tfe))]>;
} // offen = 0, idxen = 0, vaddr = 0		} // offen = 0, idxen = 0, vaddr = 0

let offen = 1, idxen = 0 in {		let offen = 1, idxen = 0 in {
defm _OFFEN : MUBUF_m <op, name#"_offen", (outs),		defm _OFFEN : MUBUF_m <op, name#"_offen", (outs),
(ins vdataClass:$vdata, VGPR_32:$vaddr, SReg_128:$srsrc,		(ins vdataClass:$vdata, VGPR_32:$vaddr, SReg_128:$srsrc,
SCSrc_32:$soffset, offset:$offset, glc:$glc,		SCSrc_b32:$soffset, offset:$offset, glc:$glc,
slc:$slc, tfe:$tfe),		slc:$slc, tfe:$tfe),
name#" $vdata, $vaddr, $srsrc, $soffset offen"#		name#" $vdata, $vaddr, $srsrc, $soffset offen"#
"$offset$glc$slc$tfe", []>;		"$offset$glc$slc$tfe", []>;
} // end offen = 1, idxen = 0		} // end offen = 1, idxen = 0

let offen = 0, idxen = 1 in {		let offen = 0, idxen = 1 in {
defm _IDXEN : MUBUF_m <op, name#"_idxen", (outs),		defm _IDXEN : MUBUF_m <op, name#"_idxen", (outs),
(ins vdataClass:$vdata, VGPR_32:$vaddr, SReg_128:$srsrc,		(ins vdataClass:$vdata, VGPR_32:$vaddr, SReg_128:$srsrc,
SCSrc_32:$soffset, offset:$offset, glc:$glc,		SCSrc_b32:$soffset, offset:$offset, glc:$glc,
slc:$slc, tfe:$tfe),		slc:$slc, tfe:$tfe),
name#" $vdata, $vaddr, $srsrc, $soffset idxen$offset$glc$slc$tfe", []>;		name#" $vdata, $vaddr, $srsrc, $soffset idxen$offset$glc$slc$tfe", []>;
}		}

let offen = 1, idxen = 1 in {		let offen = 1, idxen = 1 in {
defm _BOTHEN : MUBUF_m <op, name#"_bothen", (outs),		defm _BOTHEN : MUBUF_m <op, name#"_bothen", (outs),
(ins vdataClass:$vdata, VReg_64:$vaddr, SReg_128:$srsrc, SCSrc_32:$soffset,		(ins vdataClass:$vdata, VReg_64:$vaddr, SReg_128:$srsrc, SCSrc_b32:$soffset,
offset:$offset, glc:$glc, slc:$slc, tfe:$tfe),		offset:$offset, glc:$glc, slc:$slc, tfe:$tfe),
name#" $vdata, $vaddr, $srsrc, $soffset idxen offen$offset$glc$slc$tfe", []>;		name#" $vdata, $vaddr, $srsrc, $soffset idxen offen$offset$glc$slc$tfe", []>;
}		}

let offen = 0, idxen = 0 in {		let offen = 0, idxen = 0 in {
defm _ADDR64 : MUBUFAddr64_m <op, name#"_addr64", (outs),		defm _ADDR64 : MUBUFAddr64_m <op, name#"_addr64", (outs),
(ins vdataClass:$vdata, VReg_64:$vaddr, SReg_128:$srsrc,		(ins vdataClass:$vdata, VReg_64:$vaddr, SReg_128:$srsrc,
SCSrc_32:$soffset,		SCSrc_b32:$soffset,
offset:$offset, glc:$glc, slc:$slc,		offset:$offset, glc:$glc, slc:$slc,
tfe:$tfe),		tfe:$tfe),
name#" $vdata, $vaddr, $srsrc, $soffset addr64"#		name#" $vdata, $vaddr, $srsrc, $soffset addr64"#
"$offset$glc$slc$tfe",		"$offset$glc$slc$tfe",
[(st store_vt:$vdata,		[(st store_vt:$vdata,
(MUBUFAddr64 v4i32:$srsrc, i64:$vaddr,		(MUBUFAddr64 v4i32:$srsrc, i64:$vaddr,
i32:$soffset, i16:$offset,		i32:$soffset, i16:$offset,
i1:$glc, i1:$slc, i1:$tfe))]>;		i1:$glc, i1:$slc, i1:$tfe))]>;
▲ Show 20 Lines • Show All 128 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/SIInstructions.td

	Show First 20 Lines • Show All 842 Lines • ▼ Show 20 Lines

	// These are special and do not read the exec mask.			// These are special and do not read the exec mask.
	let isConvergent = 1, Uses = []<Register> in {			let isConvergent = 1, Uses = []<Register> in {

	defm V_READLANE_B32 : VOP2SI_3VI_m <			defm V_READLANE_B32 : VOP2SI_3VI_m <
	vop3 <0x001, 0x289>,			vop3 <0x001, 0x289>,
	"v_readlane_b32",			"v_readlane_b32",
	(outs SReg_32:$vdst),			(outs SReg_32:$vdst),
	(ins VGPR_32:$src0, SCSrc_32:$src1),			(ins VGPR_32:$src0, SCSrc_b32:$src1),
	"v_readlane_b32 $vdst, $src0, $src1",			"v_readlane_b32 $vdst, $src0, $src1",
	[(set i32:$vdst, (int_amdgcn_readlane i32:$src0, i32:$src1))]			[(set i32:$vdst, (int_amdgcn_readlane i32:$src0, i32:$src1))]
	>;			>;

	defm V_WRITELANE_B32 : VOP2SI_3VI_m <			defm V_WRITELANE_B32 : VOP2SI_3VI_m <
	vop3 <0x002, 0x28a>,			vop3 <0x002, 0x28a>,
	"v_writelane_b32",			"v_writelane_b32",
	(outs VGPR_32:$vdst),			(outs VGPR_32:$vdst),
	(ins SReg_32:$src0, SCSrc_32:$src1),			(ins SReg_32:$src0, SCSrc_b32:$src1),
	"v_writelane_b32 $vdst, $src0, $src1"			"v_writelane_b32 $vdst, $src0, $src1"
	>;			>;

	} // End isConvergent = 1			} // End isConvergent = 1

	// These instructions only exist on SI and CI			// These instructions only exist on SI and CI
	let SubtargetPredicate = isSICI in {			let SubtargetPredicate = isSICI in {

	▲ Show 20 Lines • Show All 305 Lines • ▼ Show 20 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Pseudo Instructions			// Pseudo Instructions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	let hasSideEffects = 0, mayLoad = 0, mayStore = 0, Uses = [EXEC] in {			let hasSideEffects = 0, mayLoad = 0, mayStore = 0, Uses = [EXEC] in {

	// For use in patterns			// For use in patterns
	def V_CNDMASK_B64_PSEUDO : VOP3Common <(outs VReg_64:$vdst),			def V_CNDMASK_B64_PSEUDO : VOP3Common <(outs VReg_64:$vdst),
	(ins VSrc_64:$src0, VSrc_64:$src1, SSrc_64:$src2), "", []> {			(ins VSrc_b64:$src0, VSrc_b64:$src1, SSrc_b64:$src2), "", []> {
	let isPseudo = 1;			let isPseudo = 1;
	let isCodeGenOnly = 1;			let isCodeGenOnly = 1;
	let usesCustomInserter = 1;			let usesCustomInserter = 1;
	}			}

	// 64-bit vector move instruction. This is mainly used by the SIFoldOperands			// 64-bit vector move instruction. This is mainly used by the SIFoldOperands
	// pass to enable folding of inline immediates.			// pass to enable folding of inline immediates.
	def V_MOV_B64_PSEUDO : PseudoInstSI <(outs VReg_64:$vdst), (ins VSrc_64:$src0)> {			def V_MOV_B64_PSEUDO : PseudoInstSI <(outs VReg_64:$vdst), (ins VSrc_b64:$src0)> {
	let VALU = 1;			let VALU = 1;
	}			}
	} // End let hasSideEffects = 0, mayLoad = 0, mayStore = 0, Uses = [EXEC]			} // End let hasSideEffects = 0, mayLoad = 0, mayStore = 0, Uses = [EXEC]

	let usesCustomInserter = 1, SALU = 1 in {			let usesCustomInserter = 1, SALU = 1 in {
	def GET_GROUPSTATICSIZE : PseudoInstSI <(outs SReg_32:$sdst), (ins),			def GET_GROUPSTATICSIZE : PseudoInstSI <(outs SReg_32:$sdst), (ins),
	[(set SReg_32:$sdst, (int_amdgcn_groupstaticsize))]>;			[(set SReg_32:$sdst, (int_amdgcn_groupstaticsize))]>;
	} // End let usesCustomInserter = 1, SALU = 1			} // End let usesCustomInserter = 1, SALU = 1
	▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines
	def SI_ELSE_BREAK : CFPseudoInstSI <			def SI_ELSE_BREAK : CFPseudoInstSI <
	(outs SReg_64:$dst), (ins SReg_64:$src0, SReg_64:$src1),			(outs SReg_64:$dst), (ins SReg_64:$src0, SReg_64:$src1),
	[(set i64:$dst, (int_amdgcn_else_break i64:$src0, i64:$src1))]> {			[(set i64:$dst, (int_amdgcn_else_break i64:$src0, i64:$src1))]> {
	let Size = 4;			let Size = 4;
	}			}

	let Uses = [EXEC], Defs = [EXEC,VCC] in {			let Uses = [EXEC], Defs = [EXEC,VCC] in {
	def SI_KILL : PseudoInstSI <			def SI_KILL : PseudoInstSI <
	(outs), (ins VSrc_32:$src),			(outs), (ins VSrc_b32:$src),
	[(AMDGPUkill i32:$src)]> {			[(AMDGPUkill i32:$src)]> {
	let isConvergent = 1;			let isConvergent = 1;
	let usesCustomInserter = 1;			let usesCustomInserter = 1;
	}			}

	def SI_KILL_TERMINATOR : SPseudoInstSI <			def SI_KILL_TERMINATOR : SPseudoInstSI <
	(outs), (ins VSrc_32:$src)> {			(outs), (ins VSrc_b32:$src)> {
	let isTerminator = 1;			let isTerminator = 1;
	}			}

	} // End Uses = [EXEC], Defs = [EXEC,VCC]			} // End Uses = [EXEC], Defs = [EXEC,VCC]

	} // End mayLoad = 1, mayStore = 1, hasSideEffects = 1			} // End mayLoad = 1, mayStore = 1, hasSideEffects = 1

	def SI_PS_LIVE : PseudoInstSI <			def SI_PS_LIVE : PseudoInstSI <
	(outs SReg_64:$dst), (ins),			(outs SReg_64:$dst), (ins),
	[(set i1:$dst, (int_amdgcn_ps_live))]> {			[(set i1:$dst, (int_amdgcn_ps_live))]> {
	let SALU = 1;			let SALU = 1;
	}			}

	// Used as an isel pseudo to directly emit initialization with an			// Used as an isel pseudo to directly emit initialization with an
	// s_mov_b32 rather than a copy of another initialized			// s_mov_b32 rather than a copy of another initialized
	// register. MachineCSE skips copies, and we don't want to have to			// register. MachineCSE skips copies, and we don't want to have to
	// fold operands before it runs.			// fold operands before it runs.
	def SI_INIT_M0 : SPseudoInstSI <(outs), (ins SSrc_32:$src)> {			def SI_INIT_M0 : SPseudoInstSI <(outs), (ins SSrc_b32:$src)> {
	let Defs = [M0];			let Defs = [M0];
	let usesCustomInserter = 1;			let usesCustomInserter = 1;
	let isAsCheapAsAMove = 1;			let isAsCheapAsAMove = 1;
	let isReMaterializable = 1;			let isReMaterializable = 1;
	}			}

	def SI_RETURN : SPseudoInstSI <			def SI_RETURN : SPseudoInstSI <
	(outs), (ins variable_ops), [(AMDGPUreturn)]> {			(outs), (ins variable_ops), [(AMDGPUreturn)]> {
	▲ Show 20 Lines • Show All 1,141 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/SIRegisterInfo.cpp

Show First 20 Lines • Show All 785 Lines • ▼ Show 20 Lines	bool SIRegisterInfo::shouldRewriteCopySrc(
// We want to look through the COPY to find:		// We want to look through the COPY to find:
// => vreg3 = COPY vreg0		// => vreg3 = COPY vreg0

// Plain copy.		// Plain copy.
return getCommonSubClass(DefRC, SrcRC) != nullptr;		return getCommonSubClass(DefRC, SrcRC) != nullptr;
}		}

bool SIRegisterInfo::opCanUseLiteralConstant(unsigned OpType) const {		bool SIRegisterInfo::opCanUseLiteralConstant(unsigned OpType) const {
return OpType == AMDGPU::OPERAND_REG_IMM32;		return OpType == AMDGPU::OPERAND_REG_IMM32_INT \|\|
		OpType == AMDGPU::OPERAND_REG_IMM32_FP;
}		}

bool SIRegisterInfo::opCanUseInlineConstant(unsigned OpType) const {		bool SIRegisterInfo::opCanUseInlineConstant(unsigned OpType) const {
if (opCanUseLiteralConstant(OpType))		if (opCanUseLiteralConstant(OpType))
return true;		return true;

return OpType == AMDGPU::OPERAND_REG_INLINE_C;		return OpType == AMDGPU::OPERAND_REG_INLINE_C_INT \|\|
		OpType == AMDGPU::OPERAND_REG_INLINE_C_FP;
}		}

// FIXME: Most of these are flexible with HSA and we don't need to reserve them		// FIXME: Most of these are flexible with HSA and we don't need to reserve them
// as input registers if unused. Whether the dispatch ptr is necessary should be		// as input registers if unused. Whether the dispatch ptr is necessary should be
// easy to detect from used intrinsics. Scratch setup is harder to know.		// easy to detect from used intrinsics. Scratch setup is harder to know.
unsigned SIRegisterInfo::getPreloadedValue(const MachineFunction &MF,		unsigned SIRegisterInfo::getPreloadedValue(const MachineFunction &MF,
enum PreloadedValue Value) const {		enum PreloadedValue Value) const {

▲ Show 20 Lines • Show All 251 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/SIRegisterInfo.td

	Show First 20 Lines • Show All 352 Lines • ▼ Show 20 Lines
	def VS_64 : RegisterClass<"AMDGPU", [i64, f64], 32, (add VReg_64, SReg_64)> {			def VS_64 : RegisterClass<"AMDGPU", [i64, f64], 32, (add VReg_64, SReg_64)> {
	let isAllocatable = 0;			let isAllocatable = 0;
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Register operands			// Register operands
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	class RegImmOperand <RegisterClass rc> : RegisterOperand<rc> {			class RegImmMatcher<string name> : AsmOperandClass {
	let OperandNamespace = "AMDGPU";			let Name = name;
	let OperandType = "OPERAND_REG_IMM32";			let RenderMethod = "addRegOrImmOperands";
	}			}

	class RegInlineOperand <RegisterClass rc> : RegisterOperand<rc> {			multiclass SIRegOperand <string rc, string MatchName, string opType> {
	let OperandNamespace = "AMDGPU";			let OperandNamespace = "AMDGPU" in {
	let OperandType = "OPERAND_REG_INLINE_C";
				def _b32 : RegisterOperand<!cast<RegisterClass>(rc#"_32")> {
				let OperandType = opType#"_INT";
				let ParserMatchClass = RegImmMatcher<MatchName#"B32">;
	}			}

	class RegImmMatcher<string name> : AsmOperandClass {			def _f32 : RegisterOperand<!cast<RegisterClass>(rc#"_32")> {
	let Name = name;			let OperandType = opType#"_FP";
	let RenderMethod = "addRegOrImmOperands";			let ParserMatchClass = RegImmMatcher<MatchName#"F32">;
	}			}

				def _b64 : RegisterOperand<!cast<RegisterClass>(rc#"_64")> {
				let OperandType = opType#"_INT";
				let ParserMatchClass = RegImmMatcher<MatchName#"B64">;
				}

				def _f64 : RegisterOperand<!cast<RegisterClass>(rc#"_64")> {
				let OperandType = opType#"_FP";
				let ParserMatchClass = RegImmMatcher<MatchName#"F64">;
				}
				}
				}

				multiclass RegImmOperand <string rc, string MatchName>
				: SIRegOperand<rc, MatchName, "OPERAND_REG_IMM32">;

				multiclass RegInlineOperand <string rc, string MatchName>
				: SIRegOperand<rc, MatchName, "OPERAND_REG_INLINE_C">;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// SSrc_* Operands with an SGPR or a 32-bit immediate			// SSrc_* Operands with an SGPR or a 32-bit immediate
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def SSrc_32 : RegImmOperand<SReg_32> {			defm SSrc : RegImmOperand<"SReg", "SSrc">;
	let ParserMatchClass = RegImmMatcher<"SSrc32">;
	}

	def SSrc_64 : RegImmOperand<SReg_64> {
	let ParserMatchClass = RegImmMatcher<"SSrc64">;
	}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// SCSrc_* Operands with an SGPR or a inline constant			// SCSrc_* Operands with an SGPR or a inline constant
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def SCSrc_32 : RegInlineOperand<SReg_32> {			defm SCSrc : RegInlineOperand<"SReg", "SCSrc"> ;
	let ParserMatchClass = RegImmMatcher<"SCSrc32">;
	}

	def SCSrc_64 : RegInlineOperand<SReg_64> {
	let ParserMatchClass = RegImmMatcher<"SCSrc64">;
	}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// VSrc_* Operands with an SGPR, VGPR or a 32-bit immediate			// VSrc_* Operands with an SGPR, VGPR or a 32-bit immediate
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def VSrc_32 : RegImmOperand<VS_32> {			defm VSrc : RegImmOperand<"VS", "VSrc">;
	let ParserMatchClass = RegImmMatcher<"VSrc32">;
	}

	def VSrc_64 : RegImmOperand<VS_64> {
	let ParserMatchClass = RegImmMatcher<"VSrc64">;
	}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// VSrc_* Operands with an VGPR			// VSrc_* Operands with an VGPR
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	// This is for operands with the enum(9), VSrc encoding restriction,			// This is for operands with the enum(9), VSrc encoding restriction,
	// but only allows VGPRs.			// but only allows VGPRs.
	def VRegSrc_32 : RegisterOperand<VGPR_32> {			def VRegSrc_32 : RegisterOperand<VGPR_32> {
	//let ParserMatchClass = RegImmMatcher<"VRegSrc32">;			//let ParserMatchClass = RegImmMatcher<"VRegSrc32">;
	let DecoderMethod = "DecodeVS_32RegisterClass";			let DecoderMethod = "DecodeVS_32RegisterClass";
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// VCSrc_* Operands with an SGPR, VGPR or an inline constant			// VCSrc_* Operands with an SGPR, VGPR or an inline constant
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def VCSrc_32 : RegInlineOperand<VS_32> {			defm VCSrc : RegInlineOperand<"VS", "VCSrc">;
	let ParserMatchClass = RegImmMatcher<"VCSrc32">;
	}

	def VCSrc_64 : RegInlineOperand<VS_64> {
	let ParserMatchClass = RegImmMatcher<"VCSrc64">;
	}

llvm/trunk/lib/Target/AMDGPU/SOPInstructions.td

Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	class SOP1_Real<bits<8> op, SOP1_Pseudo ps> :

let Inst{7-0} = !if(ps.has_src0, src0, ?);		let Inst{7-0} = !if(ps.has_src0, src0, ?);
let Inst{15-8} = op;		let Inst{15-8} = op;
let Inst{22-16} = !if(ps.has_sdst, sdst, ?);		let Inst{22-16} = !if(ps.has_sdst, sdst, ?);
let Inst{31-23} = 0x17d; //encoding;		let Inst{31-23} = 0x17d; //encoding;
}		}

class SOP1_32 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <		class SOP1_32 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <
opName, (outs SReg_32:$sdst), (ins SSrc_32:$src0),		opName, (outs SReg_32:$sdst), (ins SSrc_b32:$src0),
"$sdst, $src0", pattern		"$sdst, $src0", pattern
>;		>;

class SOP1_64 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <		class SOP1_64 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <
opName, (outs SReg_64:$sdst), (ins SSrc_64:$src0),		opName, (outs SReg_64:$sdst), (ins SSrc_b64:$src0),
"$sdst, $src0", pattern		"$sdst, $src0", pattern
>;		>;

// 64-bit input, 32-bit output.		// 64-bit input, 32-bit output.
class SOP1_32_64 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <		class SOP1_32_64 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <
opName, (outs SReg_32:$sdst), (ins SSrc_64:$src0),		opName, (outs SReg_32:$sdst), (ins SSrc_b64:$src0),
"$sdst, $src0", pattern		"$sdst, $src0", pattern
>;		>;

// 32-bit input, 64-bit output.		// 32-bit input, 64-bit output.
class SOP1_64_32 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <		class SOP1_64_32 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <
opName, (outs SReg_64:$sdst), (ins SSrc_32:$src0),		opName, (outs SReg_64:$sdst), (ins SSrc_b32:$src0),
"$sdst, $src0", pattern		"$sdst, $src0", pattern
>;		>;

// no input, 64-bit output.		// no input, 64-bit output.
class SOP1_64_0 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <		class SOP1_64_0 <string opName, list<dag> pattern=[]> : SOP1_Pseudo <
opName, (outs SReg_64:$sdst), (ins), "$sdst", pattern> {		opName, (outs SReg_64:$sdst), (ins), "$sdst", pattern> {
let has_src0 = 0;		let has_src0 = 0;
}		}
▲ Show 20 Lines • Show All 163 Lines • ▼ Show 20 Lines	class SOP2_Real<bits<7> op, SOP2_Pseudo ps> :
let Inst{15-8} = src1;		let Inst{15-8} = src1;
let Inst{22-16} = !if(ps.has_sdst, sdst, ?);		let Inst{22-16} = !if(ps.has_sdst, sdst, ?);
let Inst{29-23} = op;		let Inst{29-23} = op;
let Inst{31-30} = 0x2; // encoding		let Inst{31-30} = 0x2; // encoding
}		}


class SOP2_32 <string opName, list<dag> pattern=[]> : SOP2_Pseudo <		class SOP2_32 <string opName, list<dag> pattern=[]> : SOP2_Pseudo <
opName, (outs SReg_32:$sdst), (ins SSrc_32:$src0, SSrc_32:$src1),		opName, (outs SReg_32:$sdst), (ins SSrc_b32:$src0, SSrc_b32:$src1),
"$sdst, $src0, $src1", pattern		"$sdst, $src0, $src1", pattern
>;		>;

class SOP2_64 <string opName, list<dag> pattern=[]> : SOP2_Pseudo <		class SOP2_64 <string opName, list<dag> pattern=[]> : SOP2_Pseudo <
opName, (outs SReg_64:$sdst), (ins SSrc_64:$src0, SSrc_64:$src1),		opName, (outs SReg_64:$sdst), (ins SSrc_b64:$src0, SSrc_b64:$src1),
"$sdst, $src0, $src1", pattern		"$sdst, $src0, $src1", pattern
>;		>;

class SOP2_64_32 <string opName, list<dag> pattern=[]> : SOP2_Pseudo <		class SOP2_64_32 <string opName, list<dag> pattern=[]> : SOP2_Pseudo <
opName, (outs SReg_64:$sdst), (ins SSrc_64:$src0, SSrc_32:$src1),		opName, (outs SReg_64:$sdst), (ins SSrc_b64:$src0, SSrc_b32:$src1),
"$sdst, $src0, $src1", pattern		"$sdst, $src0, $src1", pattern
>;		>;

class SOP2_64_32_32 <string opName, list<dag> pattern=[]> : SOP2_Pseudo <		class SOP2_64_32_32 <string opName, list<dag> pattern=[]> : SOP2_Pseudo <
opName, (outs SReg_64:$sdst), (ins SSrc_32:$src0, SSrc_32:$src1),		opName, (outs SReg_64:$sdst), (ins SSrc_b32:$src0, SSrc_b32:$src1),
"$sdst, $src0, $src1", pattern		"$sdst, $src0, $src1", pattern
>;		>;

let Defs = [SCC] in { // Carry out goes to SCC		let Defs = [SCC] in { // Carry out goes to SCC
let isCommutable = 1 in {		let isCommutable = 1 in {
def S_ADD_U32 : SOP2_32 <"s_add_u32">;		def S_ADD_U32 : SOP2_32 <"s_add_u32">;
def S_ADD_I32 : SOP2_32 <"s_add_i32",		def S_ADD_I32 : SOP2_32 <"s_add_i32",
[(set i32:$sdst, (add SSrc_32:$src0, SSrc_32:$src1))]		[(set i32:$sdst, (add SSrc_b32:$src0, SSrc_b32:$src1))]
>;		>;
} // End isCommutable = 1		} // End isCommutable = 1

def S_SUB_U32 : SOP2_32 <"s_sub_u32">;		def S_SUB_U32 : SOP2_32 <"s_sub_u32">;
def S_SUB_I32 : SOP2_32 <"s_sub_i32",		def S_SUB_I32 : SOP2_32 <"s_sub_i32",
[(set i32:$sdst, (sub SSrc_32:$src0, SSrc_32:$src1))]		[(set i32:$sdst, (sub SSrc_b32:$src0, SSrc_b32:$src1))]
>;		>;

let Uses = [SCC] in { // Carry in comes from SCC		let Uses = [SCC] in { // Carry in comes from SCC
let isCommutable = 1 in {		let isCommutable = 1 in {
def S_ADDC_U32 : SOP2_32 <"s_addc_u32",		def S_ADDC_U32 : SOP2_32 <"s_addc_u32",
[(set i32:$sdst, (adde (i32 SSrc_32:$src0), (i32 SSrc_32:$src1)))]>;		[(set i32:$sdst, (adde (i32 SSrc_b32:$src0), (i32 SSrc_b32:$src1)))]>;
} // End isCommutable = 1		} // End isCommutable = 1

def S_SUBB_U32 : SOP2_32 <"s_subb_u32",		def S_SUBB_U32 : SOP2_32 <"s_subb_u32",
[(set i32:$sdst, (sube (i32 SSrc_32:$src0), (i32 SSrc_32:$src1)))]>;		[(set i32:$sdst, (sube (i32 SSrc_b32:$src0), (i32 SSrc_b32:$src1)))]>;
} // End Uses = [SCC]		} // End Uses = [SCC]


let isCommutable = 1 in {		let isCommutable = 1 in {
def S_MIN_I32 : SOP2_32 <"s_min_i32",		def S_MIN_I32 : SOP2_32 <"s_min_i32",
[(set i32:$sdst, (smin i32:$src0, i32:$src1))]		[(set i32:$sdst, (smin i32:$src0, i32:$src1))]
>;		>;
def S_MIN_U32 : SOP2_32 <"s_min_u32",		def S_MIN_U32 : SOP2_32 <"s_min_u32",
▲ Show 20 Lines • Show All 304 Lines • ▼ Show 20 Lines
}		}
class SOPC_Helper <bits<7> op, RegisterOperand rc, ValueType vt,		class SOPC_Helper <bits<7> op, RegisterOperand rc, ValueType vt,
string opName, PatLeaf cond> : SOPC_Base <		string opName, PatLeaf cond> : SOPC_Base <
op, rc, rc, opName,		op, rc, rc, opName,
[(set SCC, (si_setcc_uniform vt:$src0, vt:$src1, cond))] > {		[(set SCC, (si_setcc_uniform vt:$src0, vt:$src1, cond))] > {
}		}

class SOPC_CMP_32<bits<7> op, string opName, PatLeaf cond = COND_NULL>		class SOPC_CMP_32<bits<7> op, string opName, PatLeaf cond = COND_NULL>
: SOPC_Helper<op, SSrc_32, i32, opName, cond>;		: SOPC_Helper<op, SSrc_b32, i32, opName, cond>;

class SOPC_32<bits<7> op, string opName, list<dag> pattern = []>		class SOPC_32<bits<7> op, string opName, list<dag> pattern = []>
: SOPC_Base<op, SSrc_32, SSrc_32, opName, pattern>;		: SOPC_Base<op, SSrc_b32, SSrc_b32, opName, pattern>;

class SOPC_64_32<bits<7> op, string opName, list<dag> pattern = []>		class SOPC_64_32<bits<7> op, string opName, list<dag> pattern = []>
: SOPC_Base<op, SSrc_64, SSrc_32, opName, pattern>;		: SOPC_Base<op, SSrc_b64, SSrc_b32, opName, pattern>;


def S_CMP_EQ_I32 : SOPC_CMP_32 <0x00, "s_cmp_eq_i32", COND_EQ>;		def S_CMP_EQ_I32 : SOPC_CMP_32 <0x00, "s_cmp_eq_i32", COND_EQ>;
def S_CMP_LG_I32 : SOPC_CMP_32 <0x01, "s_cmp_lg_i32", COND_NE>;		def S_CMP_LG_I32 : SOPC_CMP_32 <0x01, "s_cmp_lg_i32", COND_NE>;
def S_CMP_GT_I32 : SOPC_CMP_32 <0x02, "s_cmp_gt_i32", COND_SGT>;		def S_CMP_GT_I32 : SOPC_CMP_32 <0x02, "s_cmp_gt_i32", COND_SGT>;
def S_CMP_GE_I32 : SOPC_CMP_32 <0x03, "s_cmp_ge_i32", COND_SGE>;		def S_CMP_GE_I32 : SOPC_CMP_32 <0x03, "s_cmp_ge_i32", COND_SGE>;
def S_CMP_LT_I32 : SOPC_CMP_32 <0x04, "s_cmp_lt_i32", COND_SLT>;		def S_CMP_LT_I32 : SOPC_CMP_32 <0x04, "s_cmp_lt_i32", COND_SLT>;
def S_CMP_LE_I32 : SOPC_CMP_32 <0x05, "s_cmp_le_i32", COND_SLE>;		def S_CMP_LE_I32 : SOPC_CMP_32 <0x05, "s_cmp_le_i32", COND_SLE>;
▲ Show 20 Lines • Show All 480 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h

	Show All 13 Lines
	#include "llvm/IR/CallingConv.h"			#include "llvm/IR/CallingConv.h"

	namespace llvm {			namespace llvm {

	class FeatureBitset;			class FeatureBitset;
	class Function;			class Function;
	class GlobalValue;			class GlobalValue;
	class MCContext;			class MCContext;
				class MCInstrDesc;
				class MCRegisterInfo;
	class MCSection;			class MCSection;
	class MCSubtargetInfo;			class MCSubtargetInfo;

	namespace AMDGPU {			namespace AMDGPU {

	struct IsaVersion {			struct IsaVersion {
	unsigned Major;			unsigned Major;
	unsigned Minor;			unsigned Minor;
	▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
	bool isSI(const MCSubtargetInfo &STI);			bool isSI(const MCSubtargetInfo &STI);
	bool isCI(const MCSubtargetInfo &STI);			bool isCI(const MCSubtargetInfo &STI);
	bool isVI(const MCSubtargetInfo &STI);			bool isVI(const MCSubtargetInfo &STI);

	/// If \p Reg is a pseudo reg, return the correct hardware register given			/// If \p Reg is a pseudo reg, return the correct hardware register given
	/// \p STI otherwise return \p Reg.			/// \p STI otherwise return \p Reg.
	unsigned getMCReg(unsigned Reg, const MCSubtargetInfo &STI);			unsigned getMCReg(unsigned Reg, const MCSubtargetInfo &STI);

				/// \brief Can this operand also contain immediate values?
				bool isSISrcOperand(const MCInstrDesc &Desc, unsigned OpNo);

				/// \brief Is this floating-point operand?
				bool isSISrcFPOperand(const MCInstrDesc &Desc, unsigned OpNo);

				/// \brief Does this opearnd support only inlinable literals?
				bool isSISrcInlinableOperand(const MCInstrDesc &Desc, unsigned OpNo);

				/// \brief Get size of register operand
				unsigned getRegOperandSize(const MCRegisterInfo *MRI, const MCInstrDesc &Desc,
				unsigned OpNo);

				/// \brief Is this literal inlinable
				bool isInlinableLiteral64(int64_t Literal, bool IsVI);
				bool isInlinableLiteral32(int32_t Literal, bool IsVI);

	} // end namespace AMDGPU			} // end namespace AMDGPU
	} // end namespace llvm			} // end namespace llvm

	#endif			#endif

llvm/trunk/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp

//===-- AMDGPUBaseInfo.cpp - AMDGPU Base encoding information--------------===//		//===-- AMDGPUBaseInfo.cpp - AMDGPU Base encoding information--------------===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
#include "AMDGPUBaseInfo.h"		#include "AMDGPUBaseInfo.h"
#include "AMDGPU.h"		#include "AMDGPU.h"
		#include "SIDefines.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/GlobalValue.h"		#include "llvm/IR/GlobalValue.h"
#include "llvm/MC/MCContext.h"		#include "llvm/MC/MCContext.h"
		#include "llvm/MC/MCInstrInfo.h"
		#include "llvm/MC/MCRegisterInfo.h"
#include "llvm/MC/MCSectionELF.h"		#include "llvm/MC/MCSectionELF.h"
#include "llvm/MC/MCSubtargetInfo.h"		#include "llvm/MC/MCSubtargetInfo.h"
#include "llvm/MC/SubtargetFeature.h"		#include "llvm/MC/SubtargetFeature.h"

#define GET_SUBTARGETINFO_ENUM		#define GET_SUBTARGETINFO_ENUM
#include "AMDGPUGenSubtargetInfo.inc"		#include "AMDGPUGenSubtargetInfo.inc"
#undef GET_SUBTARGETINFO_ENUM		#undef GET_SUBTARGETINFO_ENUM

▲ Show 20 Lines • Show All 172 Lines • ▼ Show 20 Lines	unsigned getMCReg(unsigned Reg, const MCSubtargetInfo &STI) {

case AMDGPU::FLAT_SCR_HI:		case AMDGPU::FLAT_SCR_HI:
assert(!isSI(STI));		assert(!isSI(STI));
return isCI(STI) ? AMDGPU::FLAT_SCR_HI_ci : AMDGPU::FLAT_SCR_HI_vi;		return isCI(STI) ? AMDGPU::FLAT_SCR_HI_ci : AMDGPU::FLAT_SCR_HI_vi;
}		}
return Reg;		return Reg;
}		}

		bool isSISrcOperand(const MCInstrDesc &Desc, unsigned OpNo) {
		unsigned OpType = Desc.OpInfo[OpNo].OperandType;

		return OpType == AMDGPU::OPERAND_REG_IMM32_INT \|\|
		OpType == AMDGPU::OPERAND_REG_IMM32_FP \|\|
		OpType == AMDGPU::OPERAND_REG_INLINE_C_INT \|\|
		OpType == AMDGPU::OPERAND_REG_INLINE_C_FP;
		}

		bool isSISrcFPOperand(const MCInstrDesc &Desc, unsigned OpNo) {
		unsigned OpType = Desc.OpInfo[OpNo].OperandType;

		return OpType == AMDGPU::OPERAND_REG_IMM32_FP \|\|
		OpType == AMDGPU::OPERAND_REG_INLINE_C_FP;
		}

		bool isSISrcInlinableOperand(const MCInstrDesc &Desc, unsigned OpNo) {
		unsigned OpType = Desc.OpInfo[OpNo].OperandType;

		return OpType == AMDGPU::OPERAND_REG_INLINE_C_INT \|\|
		OpType == AMDGPU::OPERAND_REG_INLINE_C_FP;
		}

		unsigned getRegOperandSize(const MCRegisterInfo *MRI, const MCInstrDesc &Desc,
		unsigned OpNo) {
		int RCID = Desc.OpInfo[OpNo].RegClass;
		const MCRegisterClass &RC = MRI->getRegClass(RCID);
		return RC.getSize();
		}

		bool isInlinableLiteral64(int64_t Literal, bool IsVI) {
		if (Literal >= -16 && Literal <= 64)
		return true;

		double D = BitsToDouble(Literal);

		if (D == 0.5 \|\| D == -0.5 \|\|
		D == 1.0 \|\| D == -1.0 \|\|
		D == 2.0 \|\| D == -2.0 \|\|
		D == 4.0 \|\| D == -4.0)
		return true;

		if (IsVI && Literal == 0x3fc45f306dc9c882)
		return true;

		return false;
		}

		bool isInlinableLiteral32(int32_t Literal, bool IsVI) {
		if (Literal >= -16 && Literal <= 64)
		return true;

		float F = BitsToFloat(Literal);

		if (F == 0.5 \|\| F == -0.5 \|\|
		F == 1.0 \|\| F == -1.0 \|\|
		F == 2.0 \|\| F == -2.0 \|\|
		F == 4.0 \|\| F == -4.0)
		return true;

		if (IsVI && Literal == 0x3e22f983)
		return true;

		return false;
		}


} // End namespace AMDGPU		} // End namespace AMDGPU
} // End namespace llvm		} // End namespace llvm

llvm/trunk/lib/Target/AMDGPU/VIInstructions.td

	Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines
	}			}
	} // let DisableSIDecoder = 1			} // let DisableSIDecoder = 1

	// Aliases to simplify matching of floating-point instructions that			// Aliases to simplify matching of floating-point instructions that
	// are VOP2 on SI and VOP3 on VI.			// are VOP2 on SI and VOP3 on VI.

	class SI2_VI3Alias <string name, Instruction inst> : InstAlias <			class SI2_VI3Alias <string name, Instruction inst> : InstAlias <
	name#" $dst, $src0, $src1",			name#" $dst, $src0, $src1",
	(inst VGPR_32:$dst, 0, VCSrc_32:$src0, 0, VCSrc_32:$src1, 0, 0)			(inst VGPR_32:$dst, 0, VCSrc_f32:$src0, 0, VCSrc_f32:$src1, 0, 0)
	>, PredicateControl {			>, PredicateControl {
	let UseInstAsmMatchConverter = 0;			let UseInstAsmMatchConverter = 0;
	let AsmVariantName = AMDGPUAsmVariants.VOP3;			let AsmVariantName = AMDGPUAsmVariants.VOP3;
	}			}

	def : SI2_VI3Alias <"v_ldexp_f32", V_LDEXP_F32_e64_vi>;			def : SI2_VI3Alias <"v_ldexp_f32", V_LDEXP_F32_e64_vi>;
	def : SI2_VI3Alias <"v_cvt_pkaccum_u8_f32", V_CVT_PKACCUM_U8_F32_e64_vi>;			def : SI2_VI3Alias <"v_cvt_pkaccum_u8_f32", V_CVT_PKACCUM_U8_F32_e64_vi>;
	def : SI2_VI3Alias <"v_cvt_pknorm_i16_f32", V_CVT_PKNORM_I16_F32_e64_vi>;			def : SI2_VI3Alias <"v_cvt_pknorm_i16_f32", V_CVT_PKNORM_I16_F32_e64_vi>;
	Show All 23 Lines

llvm/trunk/test/MC/AMDGPU/literals.s

				// RUN: not llvm-mc -arch=amdgcn -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=SI --check-prefix=SICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=SI -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=SI --check-prefix=SICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=bonaire -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=SICI --check-prefix=CIVI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=tonga -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=CIVI --check-prefix=VI

				// RUN: not llvm-mc -arch=amdgcn -show-encoding %s 2>&1 \| FileCheck %s --check-prefix=NOSI --check-prefix=NOSICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=SI -show-encoding %s 2>&1 \| FileCheck %s --check-prefix=NOSI --check-prefix=NOSICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=bonaire -show-encoding %s 2>&1 \| FileCheck %s --check-prefix=NOSICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=tonga -show-encoding %s 2>&1 \| FileCheck %s -check-prefix=NOVI

				//---------------------------------------------------------------------------//
				// fp literal, expected fp operand
				//---------------------------------------------------------------------------//

				// SICI: v_fract_f64_e32 v[0:1], 0.5 ; encoding: [0xf0,0x7c,0x00,0x7e]
				// VI: v_fract_f64_e32 v[0:1], 0.5 ; encoding: [0xf0,0x64,0x00,0x7e]
				v_fract_f64 v[0:1], 0.5

				// SICI: v_sqrt_f64_e32 v[0:1], -4.0 ; encoding: [0xf7,0x68,0x00,0x7e]
				// VI: v_sqrt_f64_e32 v[0:1], -4.0 ; encoding: [0xf7,0x50,0x00,0x7e]
				v_sqrt_f64 v[0:1], -4.0

				// SICI: v_log_clamp_f32_e32 v1, 0.5 ; encoding: [0xf0,0x4c,0x02,0x7e]
				// NOVI: error: instruction not supported on this GPU
				v_log_clamp_f32 v1, 0.5

				// SICI: v_fract_f64_e32 v[0:1], 0.5 ; encoding: [0xf0,0x7c,0x00,0x7e]
				// VI: v_fract_f64_e32 v[0:1], 0.5 ; encoding: [0xf0,0x64,0x00,0x7e]
				v_fract_f64 v[0:1], 0.5

				// SICI: v_trunc_f32_e32 v0, 0.5 ; encoding: [0xf0,0x42,0x00,0x7e]
				// VI: v_trunc_f32_e32 v0, 0.5 ; encoding: [0xf0,0x38,0x00,0x7e]
				v_trunc_f32 v0, 0.5

				// SICI: v_fract_f64_e32 v[0:1], -1.0 ; encoding: [0xf3,0x7c,0x00,0x7e]
				// VI: v_fract_f64_e32 v[0:1], -1.0 ; encoding: [0xf3,0x64,0x00,0x7e]
				v_fract_f64 v[0:1], -1.0

				// SICI: v_trunc_f32_e32 v0, -1.0 ; encoding: [0xf3,0x42,0x00,0x7e]
				// VI: v_trunc_f32_e32 v0, -1.0 ; encoding: [0xf3,0x38,0x00,0x7e]
				v_trunc_f32 v0, -1.0

				// SICI: v_fract_f64_e32 v[0:1], 4.0 ; encoding: [0xf6,0x7c,0x00,0x7e]
				// VI: v_fract_f64_e32 v[0:1], 4.0 ; encoding: [0xf6,0x64,0x00,0x7e]
				v_fract_f64 v[0:1], 4.0

				// SICI: v_trunc_f32_e32 v0, 4.0 ; encoding: [0xf6,0x42,0x00,0x7e]
				// VI: v_trunc_f32_e32 v0, 4.0 ; encoding: [0xf6,0x38,0x00,0x7e]
				v_trunc_f32 v0, 4.0

				// SICI: v_fract_f64_e32 v[0:1], 0 ; encoding: [0x80,0x7c,0x00,0x7e]
				// VI: v_fract_f64_e32 v[0:1], 0 ; encoding: [0x80,0x64,0x00,0x7e]
				v_fract_f64 v[0:1], 0.0

				// SICI: v_trunc_f32_e32 v0, 0 ; encoding: [0x80,0x42,0x00,0x7e]
				// VI: v_trunc_f32_e32 v0, 0 ; encoding: [0x80,0x38,0x00,0x7e]
				v_trunc_f32 v0, 0.0

				// SICI: v_fract_f64_e32 v[0:1], 0x3ff80000 ; encoding: [0xff,0x7c,0x00,0x7e,0x00,0x00,0xf8,0x3f]
				// VI: v_fract_f64_e32 v[0:1], 0x3ff80000 ; encoding: [0xff,0x64,0x00,0x7e,0x00,0x00,0xf8,0x3f]
				v_fract_f64 v[0:1], 1.5

				// SICI: v_trunc_f32_e32 v0, 0x3fc00000 ; encoding: [0xff,0x42,0x00,0x7e,0x00,0x00,0xc0,0x3f]
				// VI: v_trunc_f32_e32 v0, 0x3fc00000 ; encoding: [0xff,0x38,0x00,0x7e,0x00,0x00,0xc0,0x3f]
				v_trunc_f32 v0, 1.5

				// SICI: v_fract_f64_e32 v[0:1], 0xc00921ca ; encoding: [0xff,0x7c,0x00,0x7e,0xca,0x21,0x09,0xc0]
				// VI: v_fract_f64_e32 v[0:1], 0xc00921ca ; encoding: [0xff,0x64,0x00,0x7e,0xca,0x21,0x09,0xc0]
				v_fract_f64 v[0:1], -3.1415

				// SICI: v_trunc_f32_e32 v0, 0xc0490e56 ; encoding: [0xff,0x42,0x00,0x7e,0x56,0x0e,0x49,0xc0]
				// VI: v_trunc_f32_e32 v0, 0xc0490e56 ; encoding: [0xff,0x38,0x00,0x7e,0x56,0x0e,0x49,0xc0]
				v_trunc_f32 v0, -3.1415

				// SICI: v_fract_f64_e32 v[0:1], 0x44b52d02 ; encoding: [0xff,0x7c,0x00,0x7e,0x02,0x2d,0xb5,0x44]
				// VI: v_fract_f64_e32 v[0:1], 0x44b52d02 ; encoding: [0xff,0x64,0x00,0x7e,0x02,0x2d,0xb5,0x44]
				v_fract_f64 v[0:1], 100000000000000000000000.0

				// SICI: v_trunc_f32_e32 v0, 0x65a96816 ; encoding: [0xff,0x42,0x00,0x7e,0x16,0x68,0xa9,0x65]
				// VI: v_trunc_f32_e32 v0, 0x65a96816 ; encoding: [0xff,0x38,0x00,0x7e,0x16,0x68,0xa9,0x65]
				v_trunc_f32 v0, 100000000000000000000000.0

				// SICI: v_fract_f64_e32 v[0:1], 0x416312d0 ; encoding: [0xff,0x7c,0x00,0x7e,0xd0,0x12,0x63,0x41]
				// VI: v_fract_f64_e32 v[0:1], 0x416312d0 ; encoding: [0xff,0x64,0x00,0x7e,0xd0,0x12,0x63,0x41]
				v_fract_f64 v[0:1], 10000000.0

				// SICI: v_trunc_f32_e32 v0, 0x4b189680 ; encoding: [0xff,0x42,0x00,0x7e,0x80,0x96,0x18,0x4b]
				// VI: v_trunc_f32_e32 v0, 0x4b189680 ; encoding: [0xff,0x38,0x00,0x7e,0x80,0x96,0x18,0x4b]
				v_trunc_f32 v0, 10000000.0

				// SICI: v_fract_f64_e32 v[0:1], 0x47efffff ; encoding: [0xff,0x7c,0x00,0x7e,0xff,0xff,0xef,0x47]
				// VI: v_fract_f64_e32 v[0:1], 0x47efffff ; encoding: [0xff,0x64,0x00,0x7e,0xff,0xff,0xef,0x47]
				v_fract_f64 v[0:1], 3.402823e+38

				// SICI: v_trunc_f32_e32 v0, 0x7f7ffffd ; encoding: [0xff,0x42,0x00,0x7e,0xfd,0xff,0x7f,0x7f]
				// VI: v_trunc_f32_e32 v0, 0x7f7ffffd ; encoding: [0xff,0x38,0x00,0x7e,0xfd,0xff,0x7f,0x7f]
				v_trunc_f32 v0, 3.402823e+38

				// SICI: v_fract_f64_e32 v[0:1], 0x381fffff ; encoding: [0xff,0x7c,0x00,0x7e,0xff,0xff,0x1f,0x38]
				// VI: v_fract_f64_e32 v[0:1], 0x381fffff ; encoding: [0xff,0x64,0x00,0x7e,0xff,0xff,0x1f,0x38]
				v_fract_f64 v[0:1], 2.3509886e-38

				// SICI: v_trunc_f32_e32 v0, 0xffffff ; encoding: [0xff,0x42,0x00,0x7e,0xff,0xff,0xff,0x00]
				// VI: v_trunc_f32_e32 v0, 0xffffff ; encoding: [0xff,0x38,0x00,0x7e,0xff,0xff,0xff,0x00]
				v_trunc_f32 v0, 2.3509886e-38

				// SICI: v_fract_f64_e32 v[0:1], 0x3179f623 ; encoding: [0xff,0x7c,0x00,0x7e,0x23,0xf6,0x79,0x31]
				// VI: v_fract_f64_e32 v[0:1], 0x3179f623 ; encoding: [0xff,0x64,0x00,0x7e,0x23,0xf6,0x79,0x31]
				v_fract_f64 v[0:1], 2.3509886e-70

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_trunc_f32 v0, 2.3509886e-70

				//---------------------------------------------------------------------------//
				// fp literal, expected int operand
				//---------------------------------------------------------------------------//

				// SICI: s_mov_b64 s[0:1], 0.5 ; encoding: [0xf0,0x04,0x80,0xbe]
				// VI: s_mov_b64 s[0:1], 0.5 ; encoding: [0xf0,0x01,0x80,0xbe]
				s_mov_b64_e32 s[0:1], 0.5

				// SICI: v_and_b32_e32 v0, 0.5, v1 ; encoding: [0xf0,0x02,0x00,0x36]
				// VI: v_and_b32_e32 v0, 0.5, v1 ; encoding: [0xf0,0x02,0x00,0x26]
				v_and_b32_e32 v0, 0.5, v1

				// SICI: v_and_b32_e64 v0, 0.5, v1 ; encoding: [0x00,0x00,0x36,0xd2,0xf0,0x02,0x02,0x00]
				// VI: v_and_b32_e64 v0, 0.5, v1 ; encoding: [0x00,0x00,0x13,0xd1,0xf0,0x02,0x02,0x00]
				v_and_b32_e64 v0, 0.5, v1

				// SICI: s_mov_b64 s[0:1], -1.0 ; encoding: [0xf3,0x04,0x80,0xbe]
				// VI: s_mov_b64 s[0:1], -1.0 ; encoding: [0xf3,0x01,0x80,0xbe]
				s_mov_b64_e32 s[0:1], -1.0

				// SICI: v_and_b32_e32 v0, -1.0, v1 ; encoding: [0xf3,0x02,0x00,0x36]
				// VI: v_and_b32_e32 v0, -1.0, v1 ; encoding: [0xf3,0x02,0x00,0x26]
				v_and_b32_e32 v0, -1.0, v1

				// SICI: v_and_b32_e64 v0, -1.0, v1 ; encoding: [0x00,0x00,0x36,0xd2,0xf3,0x02,0x02,0x00]
				// VI: v_and_b32_e64 v0, -1.0, v1 ; encoding: [0x00,0x00,0x13,0xd1,0xf3,0x02,0x02,0x00]
				v_and_b32_e64 v0, -1.0, v1

				// SICI: s_mov_b64 s[0:1], 4.0 ; encoding: [0xf6,0x04,0x80,0xbe]
				// VI: s_mov_b64 s[0:1], 4.0 ; encoding: [0xf6,0x01,0x80,0xbe]
				s_mov_b64_e32 s[0:1], 4.0

				// SICI: v_and_b32_e32 v0, 4.0, v1 ; encoding: [0xf6,0x02,0x00,0x36]
				// VI: v_and_b32_e32 v0, 4.0, v1 ; encoding: [0xf6,0x02,0x00,0x26]
				v_and_b32_e32 v0, 4.0, v1

				// SICI: v_and_b32_e64 v0, 4.0, v1 ; encoding: [0x00,0x00,0x36,0xd2,0xf6,0x02,0x02,0x00]
				// VI: v_and_b32_e64 v0, 4.0, v1 ; encoding: [0x00,0x00,0x13,0xd1,0xf6,0x02,0x02,0x00]
				v_and_b32_e64 v0, 4.0, v1

				// SICI: s_mov_b64 s[0:1], 0 ; encoding: [0x80,0x04,0x80,0xbe]
				// VI: s_mov_b64 s[0:1], 0 ; encoding: [0x80,0x01,0x80,0xbe]
				s_mov_b64_e32 s[0:1], 0.0

				// SICI: v_and_b32_e32 v0, 0, v1 ; encoding: [0x80,0x02,0x00,0x36]
				// VI: v_and_b32_e32 v0, 0, v1 ; encoding: [0x80,0x02,0x00,0x26]
				v_and_b32_e32 v0, 0.0, v1

				// SICI: v_and_b32_e64 v0, 0, v1 ; encoding: [0x00,0x00,0x36,0xd2,0x80,0x02,0x02,0x00]
				// VI: v_and_b32_e64 v0, 0, v1 ; encoding: [0x00,0x00,0x13,0xd1,0x80,0x02,0x02,0x00]
				v_and_b32_e64 v0, 0.0, v1

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				s_mov_b64_e32 s[0:1], 1.5

				// SICI: v_and_b32_e32 v0, 0x3fc00000, v1 ; encoding: [0xff,0x02,0x00,0x36,0x00,0x00,0xc0,0x3f]
				// VI: v_and_b32_e32 v0, 0x3fc00000, v1 ; encoding: [0xff,0x02,0x00,0x26,0x00,0x00,0xc0,0x3f]
				v_and_b32_e32 v0, 1.5, v1

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				s_mov_b64_e32 s[0:1], -3.1415

				// SICI: v_and_b32_e32 v0, 0xc0490e56, v1 ; encoding: [0xff,0x02,0x00,0x36,0x56,0x0e,0x49,0xc0]
				// VI: v_and_b32_e32 v0, 0xc0490e56, v1 ; encoding: [0xff,0x02,0x00,0x26,0x56,0x0e,0x49,0xc0]
				v_and_b32_e32 v0, -3.1415, v1

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				s_mov_b64_e32 s[0:1], 100000000000000000000000.0

				// SICI: v_and_b32_e32 v0, 0x65a96816, v1 ; encoding: [0xff,0x02,0x00,0x36,0x16,0x68,0xa9,0x65]
				// VI: v_and_b32_e32 v0, 0x65a96816, v1 ; encoding: [0xff,0x02,0x00,0x26,0x16,0x68,0xa9,0x65]
				v_and_b32_e32 v0, 100000000000000000000000.0, v1

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				s_mov_b64_e32 s[0:1], 10000000.0

				// SICI: v_and_b32_e32 v0, 0x4b189680, v1 ; encoding: [0xff,0x02,0x00,0x36,0x80,0x96,0x18,0x4b]
				// VI: v_and_b32_e32 v0, 0x4b189680, v1 ; encoding: [0xff,0x02,0x00,0x26,0x80,0x96,0x18,0x4b]
				v_and_b32_e32 v0, 10000000.0, v1

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				s_mov_b64_e32 s[0:1], 3.402823e+38

				// SICI: v_and_b32_e32 v0, 0x7f7ffffd, v1 ; encoding: [0xff,0x02,0x00,0x36,0xfd,0xff,0x7f,0x7f]
				// VI: v_and_b32_e32 v0, 0x7f7ffffd, v1 ; encoding: [0xff,0x02,0x00,0x26,0xfd,0xff,0x7f,0x7f]
				v_and_b32_e32 v0, 3.402823e+38, v1

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				s_mov_b64_e32 s[0:1], 2.3509886e-38

				// SICI: v_and_b32_e32 v0, 0xffffff, v1 ; encoding: [0xff,0x02,0x00,0x36,0xff,0xff,0xff,0x00]
				// VI: v_and_b32_e32 v0, 0xffffff, v1 ; encoding: [0xff,0x02,0x00,0x26,0xff,0xff,0xff,0x00]
				v_and_b32_e32 v0, 2.3509886e-38, v1

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				s_mov_b64_e32 s[0:1], 2.3509886e-70

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_and_b32_e32 v0, 2.3509886e-70, v1

				//---------------------------------------------------------------------------//
				// int literal, expected fp operand
				//---------------------------------------------------------------------------//

				// SICI: v_trunc_f32_e32 v0, 0 ; encoding: [0x80,0x42,0x00,0x7e]
				// VI: v_trunc_f32_e32 v0, 0 ; encoding: [0x80,0x38,0x00,0x7e]
				v_trunc_f32_e32 v0, 0

				// SICI: v_fract_f64_e32 v[0:1], 0 ; encoding: [0x80,0x7c,0x00,0x7e]
				// VI: v_fract_f64_e32 v[0:1], 0 ; encoding: [0x80,0x64,0x00,0x7e]
				v_fract_f64_e32 v[0:1], 0

				// SICI: v_trunc_f32_e64 v0, 0 ; encoding: [0x00,0x00,0x42,0xd3,0x80,0x00,0x00,0x00]
				// VI: v_trunc_f32_e64 v0, 0 ; encoding: [0x00,0x00,0x5c,0xd1,0x80,0x00,0x00,0x00]
				v_trunc_f32_e64 v0, 0

				// SICI: v_fract_f64_e64 v[0:1], 0 ; encoding: [0x00,0x00,0x7c,0xd3,0x80,0x00,0x00,0x00]
				// VI: v_fract_f64_e64 v[0:1], 0 ; encoding: [0x00,0x00,0x72,0xd1,0x80,0x00,0x00,0x00]
				v_fract_f64_e64 v[0:1], 0

				// SICI: v_trunc_f32_e32 v0, -13 ; encoding: [0xcd,0x42,0x00,0x7e]
				// VI: v_trunc_f32_e32 v0, -13 ; encoding: [0xcd,0x38,0x00,0x7e]
				v_trunc_f32_e32 v0, -13

				// SICI: v_fract_f64_e32 v[0:1], -13 ; encoding: [0xcd,0x7c,0x00,0x7e]
				// VI: v_fract_f64_e32 v[0:1], -13 ; encoding: [0xcd,0x64,0x00,0x7e]
				v_fract_f64_e32 v[0:1], -13

				// SICI: v_trunc_f32_e64 v0, -13 ; encoding: [0x00,0x00,0x42,0xd3,0x8d,0x00,0x00,0x20]
				// VI: v_trunc_f32_e64 v0, -13 ; encoding: [0x00,0x00,0x5c,0xd1,0x8d,0x00,0x00,0x20]
				v_trunc_f32_e64 v0, -13

				// SICI: v_fract_f64_e64 v[0:1], -13 ; encoding: [0x00,0x00,0x7c,0xd3,0x8d,0x00,0x00,0x20]
				// VI: v_fract_f64_e64 v[0:1], -13 ; encoding: [0x00,0x00,0x72,0xd1,0x8d,0x00,0x00,0x20]
				v_fract_f64_e64 v[0:1], -13

				// SICI: v_trunc_f32_e32 v0, 35 ; encoding: [0xa3,0x42,0x00,0x7e]
				// VI: v_trunc_f32_e32 v0, 35 ; encoding: [0xa3,0x38,0x00,0x7e]
				v_trunc_f32_e32 v0, 35

				// SICI: v_fract_f64_e32 v[0:1], 35 ; encoding: [0xa3,0x7c,0x00,0x7e]
				// VI: v_fract_f64_e32 v[0:1], 35 ; encoding: [0xa3,0x64,0x00,0x7e]
				v_fract_f64_e32 v[0:1], 35

				// SICI: v_trunc_f32_e64 v0, 35 ; encoding: [0x00,0x00,0x42,0xd3,0xa3,0x00,0x00,0x00]
				// VI: v_trunc_f32_e64 v0, 35 ; encoding: [0x00,0x00,0x5c,0xd1,0xa3,0x00,0x00,0x00]
				v_trunc_f32_e64 v0, 35

				// SICI: v_fract_f64_e64 v[0:1], 35 ; encoding: [0x00,0x00,0x7c,0xd3,0xa3,0x00,0x00,0x00]
				// VI: v_fract_f64_e64 v[0:1], 35 ; encoding: [0x00,0x00,0x72,0xd1,0xa3,0x00,0x00,0x00]
				v_fract_f64_e64 v[0:1], 35

				// SICI: v_trunc_f32_e32 v0, 0x4d2 ; encoding: [0xff,0x42,0x00,0x7e,0xd2,0x04,0x00,0x00]
				// VI: v_trunc_f32_e32 v0, 0x4d2 ; encoding: [0xff,0x38,0x00,0x7e,0xd2,0x04,0x00,0x00]
				v_trunc_f32_e32 v0, 1234

				// SICI: v_fract_f64_e32 v[0:1], 0x4d2 ; encoding: [0xff,0x7c,0x00,0x7e,0xd2,0x04,0x00,0x00]
				// VI: v_fract_f64_e32 v[0:1], 0x4d2 ; encoding: [0xff,0x64,0x00,0x7e,0xd2,0x04,0x00,0x00]
				v_fract_f64_e32 v[0:1], 1234

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_trunc_f32_e64 v0, 1234

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_fract_f64_e64 v[0:1], 1234

				// SICI: v_trunc_f32_e32 v0, 0xffff2bcf ; encoding: [0xff,0x42,0x00,0x7e,0xcf,0x2b,0xff,0xff]
				// VI: v_trunc_f32_e32 v0, 0xffff2bcf ; encoding: [0xff,0x38,0x00,0x7e,0xcf,0x2b,0xff,0xff]
				v_trunc_f32_e32 v0, -54321

				// SICI: v_fract_f64_e32 v[0:1], 0xffff2bcf ; encoding: [0xff,0x7c,0x00,0x7e,0xcf,0x2b,0xff,0xff]
				// VI: v_fract_f64_e32 v[0:1], 0xffff2bcf ; encoding: [0xff,0x64,0x00,0x7e,0xcf,0x2b,0xff,0xff]
				v_fract_f64_e32 v[0:1], -54321

				// SICI: v_trunc_f32_e32 v0, 0xdeadbeef ; encoding: [0xff,0x42,0x00,0x7e,0xef,0xbe,0xad,0xde]
				// VI: v_trunc_f32_e32 v0, 0xdeadbeef ; encoding: [0xff,0x38,0x00,0x7e,0xef,0xbe,0xad,0xde]
				v_trunc_f32_e32 v0, 0xdeadbeef

				// SICI: v_fract_f64_e32 v[0:1], 0xdeadbeef ; encoding: [0xff,0x7c,0x00,0x7e,0xef,0xbe,0xad,0xde]
				// VI: v_fract_f64_e32 v[0:1], 0xdeadbeef ; encoding: [0xff,0x64,0x00,0x7e,0xef,0xbe,0xad,0xde]
				v_fract_f64_e32 v[0:1], 0xdeadbeef

				// SICI: v_trunc_f32_e32 v0, -1 ; encoding: [0xc1,0x42,0x00,0x7e]
				// VI: v_trunc_f32_e32 v0, -1 ; encoding: [0xc1,0x38,0x00,0x7e]
				v_trunc_f32_e32 v0, 0xffffffff

				// SICI: v_fract_f64_e32 v[0:1], 0xffffffff ; encoding: [0xff,0x7c,0x00,0x7e,0xff,0xff,0xff,0xff]
				// VI: v_fract_f64_e32 v[0:1], 0xffffffff ; encoding: [0xff,0x64,0x00,0x7e,0xff,0xff,0xff,0xff]
				v_fract_f64_e32 v[0:1], 0xffffffff

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_trunc_f32_e32 v0, 0x123456789abcdef0

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_fract_f64_e32 v[0:1], 0x123456789abcdef0

				// SICI: v_trunc_f32_e32 v0, -1 ; encoding: [0xc1,0x42,0x00,0x7e]
				// VI: v_trunc_f32_e32 v0, -1 ; encoding: [0xc1,0x38,0x00,0x7e]
				v_trunc_f32_e32 v0, 0xffffffffffffffff

				// SICI: v_fract_f64_e32 v[0:1], -1 ; encoding: [0xc1,0x7c,0x00,0x7e]
				// VI: v_fract_f64_e32 v[0:1], -1 ; encoding: [0xc1,0x64,0x00,0x7e]
				v_fract_f64_e32 v[0:1], 0xffffffffffffffff

				//---------------------------------------------------------------------------//
				// int literal, expected int operand
				//---------------------------------------------------------------------------//

				// SICI: s_mov_b64 s[0:1], 0 ; encoding: [0x80,0x04,0x80,0xbe]
				// VI: s_mov_b64 s[0:1], 0 ; encoding: [0x80,0x01,0x80,0xbe]
				s_mov_b64_e32 s[0:1], 0

				// SICI: v_and_b32_e32 v0, 0, v1 ; encoding: [0x80,0x02,0x00,0x36]
				// VI: v_and_b32_e32 v0, 0, v1 ; encoding: [0x80,0x02,0x00,0x26]
				v_and_b32_e32 v0, 0, v1

				// SICI: v_and_b32_e64 v0, 0, v1 ; encoding: [0x00,0x00,0x36,0xd2,0x80,0x02,0x02,0x00]
				// VI: v_and_b32_e64 v0, 0, v1 ; encoding: [0x00,0x00,0x13,0xd1,0x80,0x02,0x02,0x00]
				v_and_b32_e64 v0, 0, v1

				// SICI: s_mov_b64 s[0:1], -13 ; encoding: [0xcd,0x04,0x80,0xbe]
				// VI: s_mov_b64 s[0:1], -13 ; encoding: [0xcd,0x01,0x80,0xbe]
				s_mov_b64_e32 s[0:1], -13

				// SICI: v_and_b32_e32 v0, -13, v1 ; encoding: [0xcd,0x02,0x00,0x36]
				// VI: v_and_b32_e32 v0, -13, v1 ; encoding: [0xcd,0x02,0x00,0x26]
				v_and_b32_e32 v0, -13, v1

				// SICI: v_and_b32_e64 v0, -13, v1 ; encoding: [0x00,0x00,0x36,0xd2,0xcd,0x02,0x02,0x00]
				// VI: v_and_b32_e64 v0, -13, v1 ; encoding: [0x00,0x00,0x13,0xd1,0xcd,0x02,0x02,0x00]
				v_and_b32_e64 v0, -13, v1

				// SICI: s_mov_b64 s[0:1], 35 ; encoding: [0xa3,0x04,0x80,0xbe]
				// VI: s_mov_b64 s[0:1], 35 ; encoding: [0xa3,0x01,0x80,0xbe]
				s_mov_b64_e32 s[0:1], 35

				// SICI: v_and_b32_e32 v0, 35, v1 ; encoding: [0xa3,0x02,0x00,0x36]
				// VI: v_and_b32_e32 v0, 35, v1 ; encoding: [0xa3,0x02,0x00,0x26]
				v_and_b32_e32 v0, 35, v1

				// SICI: v_and_b32_e64 v0, 35, v1 ; encoding: [0x00,0x00,0x36,0xd2,0xa3,0x02,0x02,0x00]
				// VI: v_and_b32_e64 v0, 35, v1 ; encoding: [0x00,0x00,0x13,0xd1,0xa3,0x02,0x02,0x00]
				v_and_b32_e64 v0, 35, v1

				// SICI: s_mov_b64 s[0:1], 0x4d2 ; encoding: [0xff,0x04,0x80,0xbe,0xd2,0x04,0x00,0x00]
				// VI: s_mov_b64 s[0:1], 0x4d2 ; encoding: [0xff,0x01,0x80,0xbe,0xd2,0x04,0x00,0x00]
				s_mov_b64_e32 s[0:1], 1234

				// SICI: v_and_b32_e32 v0, 0x4d2, v1 ; encoding: [0xff,0x02,0x00,0x36,0xd2,0x04,0x00,0x00]
				// VI: v_and_b32_e32 v0, 0x4d2, v1 ; encoding: [0xff,0x02,0x00,0x26,0xd2,0x04,0x00,0x00]
				v_and_b32_e32 v0, 1234, v1

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_and_b32_e64 v0, 1234, v1

				// SICI: s_mov_b64 s[0:1], 0xffff2bcf ; encoding: [0xff,0x04,0x80,0xbe,0xcf,0x2b,0xff,0xff]
				// VI: s_mov_b64 s[0:1], 0xffff2bcf ; encoding: [0xff,0x01,0x80,0xbe,0xcf,0x2b,0xff,0xff]
				s_mov_b64_e32 s[0:1], -54321

				// SICI: v_and_b32_e32 v0, 0xffff2bcf, v1 ; encoding: [0xff,0x02,0x00,0x36,0xcf,0x2b,0xff,0xff]
				// VI: v_and_b32_e32 v0, 0xffff2bcf, v1 ; encoding: [0xff,0x02,0x00,0x26,0xcf,0x2b,0xff,0xff]
				v_and_b32_e32 v0, -54321, v1

				// SICI: s_mov_b64 s[0:1], 0xdeadbeef ; encoding: [0xff,0x04,0x80,0xbe,0xef,0xbe,0xad,0xde]
				// VI: s_mov_b64 s[0:1], 0xdeadbeef ; encoding: [0xff,0x01,0x80,0xbe,0xef,0xbe,0xad,0xde]
				s_mov_b64_e32 s[0:1], 0xdeadbeef

				// SICI: v_and_b32_e32 v0, 0xdeadbeef, v1 ; encoding: [0xff,0x02,0x00,0x36,0xef,0xbe,0xad,0xde]
				// VI: v_and_b32_e32 v0, 0xdeadbeef, v1 ; encoding: [0xff,0x02,0x00,0x26,0xef,0xbe,0xad,0xde]
				v_and_b32_e32 v0, 0xdeadbeef, v1

				// SICI: s_mov_b64 s[0:1], 0xffffffff ; encoding: [0xff,0x04,0x80,0xbe,0xff,0xff,0xff,0xff]
				// VI: s_mov_b64 s[0:1], 0xffffffff ; encoding: [0xff,0x01,0x80,0xbe,0xff,0xff,0xff,0xff]
				s_mov_b64_e32 s[0:1], 0xffffffff

				// SICI: v_and_b32_e32 v0, -1, v1 ; encoding: [0xc1,0x02,0x00,0x36]
				// VI: v_and_b32_e32 v0, -1, v1 ; encoding: [0xc1,0x02,0x00,0x26]
				v_and_b32_e32 v0, 0xffffffff, v1

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				s_mov_b64_e32 s[0:1], 0x123456789abcdef0

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_and_b32_e32 v0, 0x123456789abcdef0, v1

				// SICI: s_mov_b64 s[0:1], -1 ; encoding: [0xc1,0x04,0x80,0xbe]
				// VI: s_mov_b64 s[0:1], -1 ; encoding: [0xc1,0x01,0x80,0xbe]
				s_mov_b64_e32 s[0:1], 0xffffffffffffffff

				// SICI: v_and_b32_e32 v0, -1, v1 ; encoding: [0xc1,0x02,0x00,0x36]
				// VI: v_and_b32_e32 v0, -1, v1 ; encoding: [0xc1,0x02,0x00,0x26]
				v_and_b32_e32 v0, 0xffffffffffffffff, v1

				//---------------------------------------------------------------------------//
				// 1/(2*PI)
				//---------------------------------------------------------------------------//

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_trunc_f32_e32 v0, 0x3fc45f306dc9c882

				// NOSICI: error: invalid operand for instruction
				// VI: v_fract_f64_e32 v[0:1], 0x3fc45f306dc9c882 ; encoding: [0xf8,0x64,0x00,0x7e]
				v_fract_f64_e32 v[0:1], 0x3fc45f306dc9c882

				// SICI: v_trunc_f32_e32 v0, 0x3e22f983 ; encoding: [0xff,0x42,0x00,0x7e,0x83,0xf9,0x22,0x3e]
				// VI: v_trunc_f32_e32 v0, 0x3e22f983 ; encoding: [0xf8,0x38,0x00,0x7e]
				v_trunc_f32_e32 v0, 0x3e22f983

				// SICI: v_fract_f64_e32 v[0:1], 0x3e22f983 ; encoding: [0xff,0x7c,0x00,0x7e,0x83,0xf9,0x22,0x3e]
				// VI: v_fract_f64_e32 v[0:1], 0x3e22f983 ; encoding: [0xff,0x64,0x00,0x7e,0x83,0xf9,0x22,0x3e]
				v_fract_f64_e32 v[0:1], 0x3e22f983

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_trunc_f32_e64 v0, 0x3fc45f306dc9c882

				// NOSICI: error: invalid operand for instruction
				// VI: v_fract_f64_e64 v[0:1], 0x3fc45f306dc9c882 ; encoding: [0x00,0x00,0x72,0xd1,0xf8,0x00,0x00,0x00]
				v_fract_f64_e64 v[0:1], 0x3fc45f306dc9c882

				// NOSICI: error: invalid operand for instruction
				// VI: v_trunc_f32_e64 v0, 0x3e22f983 ; encoding: [0x00,0x00,0x5c,0xd1,0xf8,0x00,0x00,0x00]
				v_trunc_f32_e64 v0, 0x3e22f983

				// NOSICI: error: invalid operand for instruction
				// NOVI: error: invalid operand for instruction
				v_fract_f64_e64 v[0:1], 0x3e22f983

				// NOSICI: error: invalid operand for instruction
				// VI: s_mov_b64 s[0:1], 0x3fc45f306dc9c882 ; encoding: [0xf8,0x01,0x80,0xbe]
				s_mov_b64_e32 s[0:1], 0.159154943091895317852646485335

				// SICI: v_and_b32_e32 v0, 0x3e22f983, v1 ; encoding: [0xff,0x02,0x00,0x36,0x83,0xf9,0x22,0x3e]
				// VI: v_and_b32_e32 v0, 0x3e22f983, v1 ; encoding: [0xf8,0x02,0x00,0x26]
				v_and_b32_e32 v0, 0.159154943091895317852646485335, v1

				// NOSICI: error: invalid operand for instruction
				// VI: v_and_b32_e64 v0, 0x3e22f983, v1 ; encoding: [0x00,0x00,0x13,0xd1,0xf8,0x02,0x02,0x00]
				v_and_b32_e64 v0, 0.159154943091895317852646485335, v1

				// SICI: v_fract_f64_e32 v[0:1], 0x3fc45f30 ; encoding: [0xff,0x7c,0x00,0x7e,0x30,0x5f,0xc4,0x3f]
				// VI: v_fract_f64_e32 v[0:1], 0x3fc45f306dc9c882 ; encoding: [0xf8,0x64,0x00,0x7e]
				v_fract_f64 v[0:1], 0.159154943091895317852646485335

				// SICI: v_trunc_f32_e32 v0, 0x3e22f983 ; encoding: [0xff,0x42,0x00,0x7e,0x83,0xf9,0x22,0x3e]
				// VI: v_trunc_f32_e32 v0, 0x3e22f983 ; encoding: [0xf8,0x38,0x00,0x7e]
				v_trunc_f32 v0, 0.159154943091895317852646485335
				No newline at end of file

llvm/trunk/test/MC/AMDGPU/reg-syntax-extra.s

	Show All 37 Lines
	s_mov_b64 [tba_lo,tba_hi], s[2:3]			s_mov_b64 [tba_lo,tba_hi], s[2:3]
	// SICI: s_mov_b64 tba, s[2:3] ; encoding: [0x02,0x04,0xec,0xbe]			// SICI: s_mov_b64 tba, s[2:3] ; encoding: [0x02,0x04,0xec,0xbe]
	// VI: s_mov_b64 tba, s[2:3] ; encoding: [0x02,0x01,0xec,0xbe]			// VI: s_mov_b64 tba, s[2:3] ; encoding: [0x02,0x01,0xec,0xbe]

	s_mov_b64 [tma_lo,tma_hi], s[2:3]			s_mov_b64 [tma_lo,tma_hi], s[2:3]
	// SICI: s_mov_b64 tma, s[2:3] ; encoding: [0x02,0x04,0xee,0xbe]			// SICI: s_mov_b64 tma, s[2:3] ; encoding: [0x02,0x04,0xee,0xbe]
	// VI: s_mov_b64 tma, s[2:3] ; encoding: [0x02,0x01,0xee,0xbe]			// VI: s_mov_b64 tma, s[2:3] ; encoding: [0x02,0x01,0xee,0xbe]

	v_mov_b32 [v1], [v2]			v_mov_b32_e32 [v1], [v2]
	// GCN: v_mov_b32_e32 v1, v2 ; encoding: [0x02,0x03,0x02,0x7e]			// GCN: v_mov_b32_e32 v1, v2 ; encoding: [0x02,0x03,0x02,0x7e]

	v_rcp_f64 [v1,v2], [v2,v3]			v_rcp_f64 [v1,v2], [v2,v3]
	// SICI: v_rcp_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x5f,0x02,0x7e]			// SICI: v_rcp_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x5f,0x02,0x7e]
	// VI: v_rcp_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x4b,0x02,0x7e]			// VI: v_rcp_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x4b,0x02,0x7e]

	buffer_load_dwordx4 [v1,v2,v3,v4], off, [s4,s5,s6,s7], s1			buffer_load_dwordx4 [v1,v2,v3,v4], off, [s4,s5,s6,s7], s1
	// SICI: buffer_load_dwordx4 v[1:4], off, s[4:7], s1 ; encoding: [0x00,0x00,0x38,0xe0,0x00,0x01,0x01,0x01]			// SICI: buffer_load_dwordx4 v[1:4], off, s[4:7], s1 ; encoding: [0x00,0x00,0x38,0xe0,0x00,0x01,0x01,0x01]
	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/trunk/test/MC/AMDGPU/regression/bug28165.s

				// RUN: llvm-mc -arch=amdgcn -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=SI --check-prefix=SICI
				// RUN: llvm-mc -arch=amdgcn -mcpu=SI -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=SI --check-prefix=SICI
				// RUN: llvm-mc -arch=amdgcn -mcpu=bonaire -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=SICI --check-prefix=CIVI
				// RUN: llvm-mc -arch=amdgcn -mcpu=tonga -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=CIVI --check-prefix=VI

				// SICI: v_cmp_eq_f64_e32 vcc, 0.5, v[254:255] ; encoding: [0xf0,0xfc,0x45,0x7c]
				// VI: v_cmp_eq_f64_e32 vcc, 0.5, v[254:255] ; encoding: [0xf0,0xfc,0xc5,0x7c]
				v_cmp_eq_f64 vcc, 0.5, v[254:255]

				// GCN: v_cvt_f32_f64_e32 v0, 0.5 ; encoding: [0xf0,0x1e,0x00,0x7e]
				v_cvt_f32_f64 v0, 0.5

llvm/trunk/test/MC/AMDGPU/regression/bug28413.s

	Show All 13 Lines
	v_cmp_eq_i32 vcc, 3.125, v0			v_cmp_eq_i32 vcc, 3.125, v0
	// SICI: v_cmp_eq_i32_e32 vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x04,0x7d,0x00,0x00,0x48,0x40]			// SICI: v_cmp_eq_i32_e32 vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x04,0x7d,0x00,0x00,0x48,0x40]
	// VI: v_cmp_eq_i32_e32 vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x84,0x7d,0x00,0x00,0x48,0x40]			// VI: v_cmp_eq_i32_e32 vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x84,0x7d,0x00,0x00,0x48,0x40]

	v_cmpx_eq_u32 vcc, 3.125, v0			v_cmpx_eq_u32 vcc, 3.125, v0
	// SICI: v_cmpx_eq_u32_e32 vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0xa4,0x7d,0x00,0x00,0x48,0x40]			// SICI: v_cmpx_eq_u32_e32 vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0xa4,0x7d,0x00,0x00,0x48,0x40]
	// VI: v_cmpx_eq_u32_e32 vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0xb4,0x7d,0x00,0x00,0x48,0x40]			// VI: v_cmpx_eq_u32_e32 vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0xb4,0x7d,0x00,0x00,0x48,0x40]

	v_mov_b32 v0, 0.5			v_mov_b32_e32 v0, 0.5
	// GCN: v_mov_b32_e32 v0, 0.5 ; encoding: [0xf0,0x02,0x00,0x7e]			// GCN: v_mov_b32_e32 v0, 0.5 ; encoding: [0xf0,0x02,0x00,0x7e]

	v_mov_b32 v0, 3.125			v_mov_b32_e32 v0, 3.125
	// GCN: v_mov_b32_e32 v0, 0x40480000 ; encoding: [0xff,0x02,0x00,0x7e,0x00,0x00,0x48,0x40]			// GCN: v_mov_b32_e32 v0, 0x40480000 ; encoding: [0xff,0x02,0x00,0x7e,0x00,0x00,0x48,0x40]

	v_add_i32 v0, vcc, 0.5, v0			v_add_i32 v0, vcc, 0.5, v0
	// SICI: v_add_i32_e32 v0, vcc, 0.5, v0 ; encoding: [0xf0,0x00,0x00,0x4a]			// SICI: v_add_i32_e32 v0, vcc, 0.5, v0 ; encoding: [0xf0,0x00,0x00,0x4a]
	// VI: v_add_i32_e32 v0, vcc, 0.5, v0 ; encoding: [0xf0,0x00,0x00,0x32]			// VI: v_add_i32_e32 v0, vcc, 0.5, v0 ; encoding: [0xf0,0x00,0x00,0x32]

	v_add_i32 v0, vcc, 3.125, v0			v_add_i32 v0, vcc, 3.125, v0
	// SICI: v_add_i32_e32 v0, vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x00,0x4a,0x00,0x00,0x48,0x40]			// SICI: v_add_i32_e32 v0, vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x00,0x4a,0x00,0x00,0x48,0x40]
	// VI: v_add_i32_e32 v0, vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x00,0x32,0x00,0x00,0x48,0x40]			// VI: v_add_i32_e32 v0, vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x00,0x32,0x00,0x00,0x48,0x40]
	No newline at end of file			No newline at end of file

llvm/trunk/test/MC/AMDGPU/sop1-err.s

	Show All 20 Lines
	// GCN: error: not a valid operand			// GCN: error: not a valid operand

	s_mov_b64 s1, s[0:1]			s_mov_b64 s1, s[0:1]
	// GCN: error: invalid operand for instruction			// GCN: error: invalid operand for instruction

	s_mov_b64 s[0:1], s1			s_mov_b64 s[0:1], s1
	// GCN: error: invalid operand for instruction			// GCN: error: invalid operand for instruction

	// Immediate greater than 32-bits
	s_mov_b32 s1, 0xfffffffff
	// GCN: error: invalid immediate: only 32-bit values are legal

	// Immediate greater than 32-bits
	s_mov_b64 s[0:1], 0xfffffffff
	// GCN: error: invalid immediate: only 32-bit values are legal

	s_mov_b64 s[0:1], 0x0000000200000000
	// GCN: error: invalid immediate: only 32-bit values are legal

	// FIXME: This shoudl probably say failed to parse.			// FIXME: This shoudl probably say failed to parse.
	s_mov_b32 s			s_mov_b32 s
	// GCN: error: not a valid operand			// GCN: error: not a valid operand
	// Out of range register			// Out of range register

	s_mov_b32 s102, 1			s_mov_b32 s102, 1
	// VI: error: not a valid operand			// VI: error: not a valid operand
	// SI: s_mov_b32 s102, 1			// SI: s_mov_b32 s102, 1

	s_mov_b32 s103, 1			s_mov_b32 s103, 1
	// VI: error: not a valid operand			// VI: error: not a valid operand
	// SI: s_mov_b32 s103, 1			// SI: s_mov_b32 s103, 1

	s_mov_b64 s[102:103], -1			s_mov_b64 s[102:103], -1
	// VI: error: not a valid operand			// VI: error: not a valid operand
	// SI: s_mov_b64 s[102:103], -1			// SI: s_mov_b64 s[102:103], -1

llvm/trunk/test/MC/AMDGPU/trap.s

	Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines
	s_cmp_eq_i32 ttmp8, 0x000000fe			s_cmp_eq_i32 ttmp8, 0x000000fe
	// SICI: s_cmp_eq_i32 ttmp8, 0xfe ; encoding: [0x78,0xff,0x00,0xbf,0xfe,0x00,0x00,0x00]			// SICI: s_cmp_eq_i32 ttmp8, 0xfe ; encoding: [0x78,0xff,0x00,0xbf,0xfe,0x00,0x00,0x00]
	// VI: s_cmp_eq_i32 ttmp8, 0xfe ; encoding: [0x78,0xff,0x00,0xbf,0xfe,0x00,0x00,0x00]			// VI: s_cmp_eq_i32 ttmp8, 0xfe ; encoding: [0x78,0xff,0x00,0xbf,0xfe,0x00,0x00,0x00]

	s_lshr_b32 ttmp8, ttmp8, 12			s_lshr_b32 ttmp8, ttmp8, 12
	// SICI: s_lshr_b32 ttmp8, ttmp8, 12 ; encoding: [0x78,0x8c,0x78,0x90]			// SICI: s_lshr_b32 ttmp8, ttmp8, 12 ; encoding: [0x78,0x8c,0x78,0x90]
	// VI: s_lshr_b32 ttmp8, ttmp8, 12 ; encoding: [0x78,0x8c,0x78,0x8f]			// VI: s_lshr_b32 ttmp8, ttmp8, 12 ; encoding: [0x78,0x8c,0x78,0x8f]

	v_mov_b32 v1, ttmp8			v_mov_b32_e32 v1, ttmp8
	// SICI: v_mov_b32_e32 v1, ttmp8 ; encoding: [0x78,0x02,0x02,0x7e]			// SICI: v_mov_b32_e32 v1, ttmp8 ; encoding: [0x78,0x02,0x02,0x7e]
	// VI: v_mov_b32_e32 v1, ttmp8 ; encoding: [0x78,0x02,0x02,0x7e]			// VI: v_mov_b32_e32 v1, ttmp8 ; encoding: [0x78,0x02,0x02,0x7e]

	s_mov_b32 m0, ttmp8			s_mov_b32 m0, ttmp8
	// SICI: s_mov_b32 m0, ttmp8 ; encoding: [0x78,0x03,0xfc,0xbe]			// SICI: s_mov_b32 m0, ttmp8 ; encoding: [0x78,0x03,0xfc,0xbe]
	// VI: s_mov_b32 m0, ttmp8 ; encoding: [0x78,0x00,0xfc,0xbe]			// VI: s_mov_b32 m0, ttmp8 ; encoding: [0x78,0x00,0xfc,0xbe]

	s_mov_b32 ttmp10, 0			s_mov_b32 ttmp10, 0
	▲ Show 20 Lines • Show All 71 Lines • Show Last 20 Lines

llvm/trunk/test/MC/AMDGPU/vop1.s

	Show All 26 Lines
	// Instructions			// Instructions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//


	// GCN: v_nop ; encoding: [0x00,0x00,0x00,0x7e]			// GCN: v_nop ; encoding: [0x00,0x00,0x00,0x7e]
	v_nop			v_nop

	// GCN: v_mov_b32_e32 v1, v2 ; encoding: [0x02,0x03,0x02,0x7e]			// GCN: v_mov_b32_e32 v1, v2 ; encoding: [0x02,0x03,0x02,0x7e]
	v_mov_b32 v1, v2			v_mov_b32_e32 v1, v2

	// GCN: v_readfirstlane_b32 s1, v2 ; encoding: [0x02,0x05,0x02,0x7e]			// GCN: v_readfirstlane_b32 s1, v2 ; encoding: [0x02,0x05,0x02,0x7e]
	v_readfirstlane_b32 s1, v2			v_readfirstlane_b32 s1, v2

	// GCN: v_cvt_i32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x07,0x02,0x7e]			// GCN: v_cvt_i32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x07,0x02,0x7e]
	v_cvt_i32_f64 v1, v[2:3]			v_cvt_i32_f64_e32 v1, v[2:3]

	// GCN: v_cvt_f64_i32_e32 v[1:2], v2 ; encoding: [0x02,0x09,0x02,0x7e]			// GCN: v_cvt_f64_i32_e32 v[1:2], v2 ; encoding: [0x02,0x09,0x02,0x7e]
	v_cvt_f64_i32 v[1:2], v2			v_cvt_f64_i32_e32 v[1:2], v2

	// GCN: v_cvt_f32_i32_e32 v1, v2 ; encoding: [0x02,0x0b,0x02,0x7e]			// GCN: v_cvt_f32_i32_e32 v1, v2 ; encoding: [0x02,0x0b,0x02,0x7e]
	v_cvt_f32_i32 v1, v2			v_cvt_f32_i32_e32 v1, v2

	// GCN: v_cvt_f32_u32_e32 v1, v2 ; encoding: [0x02,0x0d,0x02,0x7e]			// GCN: v_cvt_f32_u32_e32 v1, v2 ; encoding: [0x02,0x0d,0x02,0x7e]
	v_cvt_f32_u32 v1, v2			v_cvt_f32_u32_e32 v1, v2

	// GCN: v_cvt_u32_f32_e32 v1, v2 ; encoding: [0x02,0x0f,0x02,0x7e			// GCN: v_cvt_u32_f32_e32 v1, v2 ; encoding: [0x02,0x0f,0x02,0x7e
	v_cvt_u32_f32 v1, v2			v_cvt_u32_f32_e32 v1, v2

	// GCN: v_cvt_i32_f32_e32 v1, v2 ; encoding: [0x02,0x11,0x02,0x7e]			// GCN: v_cvt_i32_f32_e32 v1, v2 ; encoding: [0x02,0x11,0x02,0x7e]
	v_cvt_i32_f32 v1, v2			v_cvt_i32_f32_e32 v1, v2

	// SICI: v_mov_fed_b32_e32 v1, v2 ; encoding: [0x02,0x13,0x02,0x7e]			// SICI: v_mov_fed_b32_e32 v1, v2 ; encoding: [0x02,0x13,0x02,0x7e]
	// NOVI: error: instruction not supported on this GPU			// NOVI: error: instruction not supported on this GPU
	v_mov_fed_b32 v1, v2			v_mov_fed_b32_e32 v1, v2

	// GCN: v_cvt_f16_f32_e32 v1, v2 ; encoding: [0x02,0x15,0x02,0x7e]			// GCN: v_cvt_f16_f32_e32 v1, v2 ; encoding: [0x02,0x15,0x02,0x7e]
	v_cvt_f16_f32 v1, v2			v_cvt_f16_f32_e32 v1, v2

	// GCN: v_cvt_f32_f16_e32 v1, v2 ; encoding: [0x02,0x17,0x02,0x7e]			// GCN: v_cvt_f32_f16_e32 v1, v2 ; encoding: [0x02,0x17,0x02,0x7e]
	v_cvt_f32_f16 v1, v2			v_cvt_f32_f16_e32 v1, v2

	// GCN: v_cvt_rpi_i32_f32_e32 v1, v2 ; encoding: [0x02,0x19,0x02,0x7e]			// GCN: v_cvt_rpi_i32_f32_e32 v1, v2 ; encoding: [0x02,0x19,0x02,0x7e]
	v_cvt_rpi_i32_f32 v1, v2			v_cvt_rpi_i32_f32_e32 v1, v2

	// GCN: v_cvt_flr_i32_f32_e32 v1, v2 ; encoding: [0x02,0x1b,0x02,0x7e]			// GCN: v_cvt_flr_i32_f32_e32 v1, v2 ; encoding: [0x02,0x1b,0x02,0x7e]
	v_cvt_flr_i32_f32 v1, v2			v_cvt_flr_i32_f32_e32 v1, v2

	// GCN: v_cvt_off_f32_i4_e32 v1, v2 ; encoding: [0x02,0x1d,0x02,0x7e]			// GCN: v_cvt_off_f32_i4_e32 v1, v2 ; encoding: [0x02,0x1d,0x02,0x7e]
	v_cvt_off_f32_i4_e32 v1, v2			v_cvt_off_f32_i4_e32 v1, v2

	// GCN: v_cvt_f32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x1f,0x02,0x7e]			// GCN: v_cvt_f32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x1f,0x02,0x7e]
	v_cvt_f32_f64 v1, v[2:3]			v_cvt_f32_f64_e32 v1, v[2:3]

	// GCN: v_cvt_f64_f32_e32 v[1:2], v2 ; encoding: [0x02,0x21,0x02,0x7e]			// GCN: v_cvt_f64_f32_e32 v[1:2], v2 ; encoding: [0x02,0x21,0x02,0x7e]
	v_cvt_f64_f32 v[1:2], v2			v_cvt_f64_f32_e32 v[1:2], v2

	// GCN: v_cvt_f32_ubyte0_e32 v1, v2 ; encoding: [0x02,0x23,0x02,0x7e]			// GCN: v_cvt_f32_ubyte0_e32 v1, v2 ; encoding: [0x02,0x23,0x02,0x7e]
	v_cvt_f32_ubyte0 v1, v2			v_cvt_f32_ubyte0_e32 v1, v2

	// GCN: v_cvt_f32_ubyte1_e32 v1, v2 ; encoding: [0x02,0x25,0x02,0x7e]			// GCN: v_cvt_f32_ubyte1_e32 v1, v2 ; encoding: [0x02,0x25,0x02,0x7e]
	v_cvt_f32_ubyte1_e32 v1, v2			v_cvt_f32_ubyte1_e32 v1, v2

	// GCN: v_cvt_f32_ubyte2_e32 v1, v2 ; encoding: [0x02,0x27,0x02,0x7e]			// GCN: v_cvt_f32_ubyte2_e32 v1, v2 ; encoding: [0x02,0x27,0x02,0x7e]
	v_cvt_f32_ubyte2 v1, v2			v_cvt_f32_ubyte2_e32 v1, v2

	// GCN: v_cvt_f32_ubyte3_e32 v1, v2 ; encoding: [0x02,0x29,0x02,0x7e]			// GCN: v_cvt_f32_ubyte3_e32 v1, v2 ; encoding: [0x02,0x29,0x02,0x7e]
	v_cvt_f32_ubyte3 v1, v2			v_cvt_f32_ubyte3_e32 v1, v2

	// GCN: v_cvt_u32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x2b,0x02,0x7e]			// GCN: v_cvt_u32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x2b,0x02,0x7e]
	v_cvt_u32_f64 v1, v[2:3]			v_cvt_u32_f64_e32 v1, v[2:3]

	// GCN: v_cvt_f64_u32_e32 v[1:2], v2 ; encoding: [0x02,0x2d,0x02,0x7e]			// GCN: v_cvt_f64_u32_e32 v[1:2], v2 ; encoding: [0x02,0x2d,0x02,0x7e]
	v_cvt_f64_u32 v[1:2], v2			v_cvt_f64_u32_e32 v[1:2], v2

	// NOSI: error: instruction not supported on this GPU			// NOSI: error: instruction not supported on this GPU
	// NOSI: v_trunc_f64_e32 v[1:2], v[2:3]			// NOSI: v_trunc_f64_e32 v[1:2], v[2:3]
	// CIVI: v_trunc_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x2f,0x02,0x7e]			// CIVI: v_trunc_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x2f,0x02,0x7e]
	v_trunc_f64_e32 v[1:2], v[2:3]			v_trunc_f64_e32 v[1:2], v[2:3]

	// NOSI: error: instruction not supported on this GPU			// NOSI: error: instruction not supported on this GPU
	// NOSI: v_ceil_f64_e32 v[1:2], v[2:3]			// NOSI: v_ceil_f64_e32 v[1:2], v[2:3]
	▲ Show 20 Lines • Show All 106 Lines • ▼ Show 20 Lines
	v_sin_f32 v1, v2			v_sin_f32 v1, v2

	// SICI: v_cos_f32_e32 v1, v2 ; encoding: [0x02,0x6d,0x02,0x7e]			// SICI: v_cos_f32_e32 v1, v2 ; encoding: [0x02,0x6d,0x02,0x7e]
	// VI: v_cos_f32_e32 v1, v2 ; encoding: [0x02,0x55,0x02,0x7e]			// VI: v_cos_f32_e32 v1, v2 ; encoding: [0x02,0x55,0x02,0x7e]
	v_cos_f32 v1, v2			v_cos_f32 v1, v2

	// SICI: v_not_b32_e32 v1, v2 ; encoding: [0x02,0x6f,0x02,0x7e]			// SICI: v_not_b32_e32 v1, v2 ; encoding: [0x02,0x6f,0x02,0x7e]
	// VI: v_not_b32_e32 v1, v2 ; encoding: [0x02,0x57,0x02,0x7e]			// VI: v_not_b32_e32 v1, v2 ; encoding: [0x02,0x57,0x02,0x7e]
	v_not_b32 v1, v2			v_not_b32_e32 v1, v2

	// SICI: v_bfrev_b32_e32 v1, v2 ; encoding: [0x02,0x71,0x02,0x7e]			// SICI: v_bfrev_b32_e32 v1, v2 ; encoding: [0x02,0x71,0x02,0x7e]
	// VI: v_bfrev_b32_e32 v1, v2 ; encoding: [0x02,0x59,0x02,0x7e]			// VI: v_bfrev_b32_e32 v1, v2 ; encoding: [0x02,0x59,0x02,0x7e]
	v_bfrev_b32 v1, v2			v_bfrev_b32_e32 v1, v2

	// SICI: v_ffbh_u32_e32 v1, v2 ; encoding: [0x02,0x73,0x02,0x7e]			// SICI: v_ffbh_u32_e32 v1, v2 ; encoding: [0x02,0x73,0x02,0x7e]
	// VI: v_ffbh_u32_e32 v1, v2 ; encoding: [0x02,0x5b,0x02,0x7e]			// VI: v_ffbh_u32_e32 v1, v2 ; encoding: [0x02,0x5b,0x02,0x7e]
	v_ffbh_u32 v1, v2			v_ffbh_u32_e32 v1, v2

	// SICI: v_ffbl_b32_e32 v1, v2 ; encoding: [0x02,0x75,0x02,0x7e]			// SICI: v_ffbl_b32_e32 v1, v2 ; encoding: [0x02,0x75,0x02,0x7e]
	// VI: v_ffbl_b32_e32 v1, v2 ; encoding: [0x02,0x5d,0x02,0x7e]			// VI: v_ffbl_b32_e32 v1, v2 ; encoding: [0x02,0x5d,0x02,0x7e]
	v_ffbl_b32 v1, v2			v_ffbl_b32_e32 v1, v2

	// SICI: v_ffbh_i32_e32 v1, v2 ; encoding: [0x02,0x77,0x02,0x7e]			// SICI: v_ffbh_i32_e32 v1, v2 ; encoding: [0x02,0x77,0x02,0x7e]
	// VI: v_ffbh_i32_e32 v1, v2 ; encoding: [0x02,0x5f,0x02,0x7e]			// VI: v_ffbh_i32_e32 v1, v2 ; encoding: [0x02,0x5f,0x02,0x7e]
	v_ffbh_i32_e32 v1, v2			v_ffbh_i32_e32 v1, v2

	// SICI: v_frexp_exp_i32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x79,0x02,0x7e]			// SICI: v_frexp_exp_i32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x79,0x02,0x7e]
	// VI: v_frexp_exp_i32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x61,0x02,0x7e]			// VI: v_frexp_exp_i32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x61,0x02,0x7e]
	v_frexp_exp_i32_f64 v1, v[2:3]			v_frexp_exp_i32_f64_e32 v1, v[2:3]

	// SICI: v_frexp_mant_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x7b,0x02,0x7e]			// SICI: v_frexp_mant_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x7b,0x02,0x7e]
	// VI; v_frexp_mant_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x63,0x02,0x7e]			// VI: v_frexp_mant_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x63,0x02,0x7e]
	v_frexp_mant_f64 v[1:2], v[2:3]			v_frexp_mant_f64_e32 v[1:2], v[2:3]

	// SICI: v_fract_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x7d,0x02,0x7e]			// SICI: v_fract_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x7d,0x02,0x7e]
	// VI: v_fract_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x65,0x02,0x7e]			// VI: v_fract_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x65,0x02,0x7e]
	v_fract_f64 v[1:2], v[2:3]			v_fract_f64_e32 v[1:2], v[2:3]

	// SICI: v_frexp_exp_i32_f32_e32 v1, v2 ; encoding: [0x02,0x7f,0x02,0x7e]			// SICI: v_frexp_exp_i32_f32_e32 v1, v2 ; encoding: [0x02,0x7f,0x02,0x7e]
	// VI: v_frexp_exp_i32_f32_e32 v1, v2 ; encoding: [0x02,0x67,0x02,0x7e]			// VI: v_frexp_exp_i32_f32_e32 v1, v2 ; encoding: [0x02,0x67,0x02,0x7e]
	v_frexp_exp_i32_f32 v1, v2			v_frexp_exp_i32_f32_e32 v1, v2

	// SICI: v_frexp_mant_f32_e32 v1, v2 ; encoding: [0x02,0x81,0x02,0x7e]			// SICI: v_frexp_mant_f32_e32 v1, v2 ; encoding: [0x02,0x81,0x02,0x7e]
	// VI: v_frexp_mant_f32_e32 v1, v2 ; encoding: [0x02,0x69,0x02,0x7e]			// VI: v_frexp_mant_f32_e32 v1, v2 ; encoding: [0x02,0x69,0x02,0x7e]
	v_frexp_mant_f32 v1, v2			v_frexp_mant_f32 v1, v2

	// SICI: v_clrexcp ; encoding: [0x00,0x82,0x00,0x7e]			// SICI: v_clrexcp ; encoding: [0x00,0x82,0x00,0x7e]
	// VI: v_clrexcp ; encoding: [0x00,0x6a,0x00,0x7e]			// VI: v_clrexcp ; encoding: [0x00,0x6a,0x00,0x7e]
	v_clrexcp			v_clrexcp_e32

	// SICI: v_movreld_b32_e32 v1, v2 ; encoding: [0x02,0x85,0x02,0x7e]			// SICI: v_movreld_b32_e32 v1, v2 ; encoding: [0x02,0x85,0x02,0x7e]
	// VI: v_movreld_b32_e32 v1, v2 ; encoding: [0x02,0x6d,0x02,0x7e]			// VI: v_movreld_b32_e32 v1, v2 ; encoding: [0x02,0x6d,0x02,0x7e]
	v_movreld_b32 v1, v2			v_movreld_b32_e32 v1, v2

	// SICI: v_movrels_b32_e32 v1, v2 ; encoding: [0x02,0x87,0x02,0x7e]			// SICI: v_movrels_b32_e32 v1, v2 ; encoding: [0x02,0x87,0x02,0x7e]
	// VI: v_movrels_b32_e32 v1, v2 ; encoding: [0x02,0x6f,0x02,0x7e]			// VI: v_movrels_b32_e32 v1, v2 ; encoding: [0x02,0x6f,0x02,0x7e]
	v_movrels_b32 v1, v2			v_movrels_b32_e32 v1, v2

	// SICI: v_movrelsd_b32_e32 v1, v2 ; encoding: [0x02,0x89,0x02,0x7e]			// SICI: v_movrelsd_b32_e32 v1, v2 ; encoding: [0x02,0x89,0x02,0x7e]
	// VI: v_movrelsd_b32_e32 v1, v2 ; encoding: [0x02,0x71,0x02,0x7e]			// VI: v_movrelsd_b32_e32 v1, v2 ; encoding: [0x02,0x71,0x02,0x7e]
	v_movrelsd_b32 v1, v2			v_movrelsd_b32_e32 v1, v2

	// NOSI: error: instruction not supported on this GPU			// NOSI: error: instruction not supported on this GPU
	// NOSI: v_log_legacy_f32 v1, v2			// NOSI: v_log_legacy_f32 v1, v2
	// CI: v_log_legacy_f32_e32 v1, v2 ; encoding: [0x02,0x8b,0x02,0x7e]			// CI: v_log_legacy_f32_e32 v1, v2 ; encoding: [0x02,0x8b,0x02,0x7e]
	// VI: v_log_legacy_f32_e32 v1, v2 ; encoding: [0x02,0x99,0x02,0x7e]			// VI: v_log_legacy_f32_e32 v1, v2 ; encoding: [0x02,0x99,0x02,0x7e]
	v_log_legacy_f32 v1, v2			v_log_legacy_f32 v1, v2

	// NOSI: error: instruction not supported on this GPU			// NOSI: error: instruction not supported on this GPU
	// NOSI: v_exp_legacy_f32 v1, v2			// NOSI: v_exp_legacy_f32 v1, v2
	// CI: v_exp_legacy_f32_e32 v1, v2 ; encoding: [0x02,0x8d,0x02,0x7e]			// CI: v_exp_legacy_f32_e32 v1, v2 ; encoding: [0x02,0x8d,0x02,0x7e]
	// VI: v_exp_legacy_f32_e32 v1, v2 ; encoding: [0x02,0x97,0x02,0x7e]			// VI: v_exp_legacy_f32_e32 v1, v2 ; encoding: [0x02,0x97,0x02,0x7e]
	v_exp_legacy_f32 v1, v2			v_exp_legacy_f32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_cvt_f16_u16 v1, v2			// NOSICI: v_cvt_f16_u16_e32 v1, v2
	// VI: v_cvt_f16_u16_e32 v1, v2 ; encoding: [0x02,0x73,0x02,0x7e]			// VI: v_cvt_f16_u16_e32 v1, v2 ; encoding: [0x02,0x73,0x02,0x7e]
	v_cvt_f16_u16 v1, v2			v_cvt_f16_u16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_cvt_f16_i16 v1, v2			// NOSICI: v_cvt_f16_i16_e32 v1, v2
	// VI: v_cvt_f16_i16_e32 v1, v2 ; encoding: [0x02,0x75,0x02,0x7e]			// VI: v_cvt_f16_i16_e32 v1, v2 ; encoding: [0x02,0x75,0x02,0x7e]
	v_cvt_f16_i16 v1, v2			v_cvt_f16_i16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_cvt_u16_f16 v1, v2			// NOSICI: v_cvt_u16_f16_e32 v1, v2
	// VI: v_cvt_u16_f16_e32 v1, v2 ; encoding: [0x02,0x77,0x02,0x7e]			// VI: v_cvt_u16_f16_e32 v1, v2 ; encoding: [0x02,0x77,0x02,0x7e]
	v_cvt_u16_f16 v1, v2			v_cvt_u16_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_cvt_i16_f16 v1, v2			// NOSICI: v_cvt_i16_f16_e32 v1, v2
	// VI: v_cvt_i16_f16_e32 v1, v2 ; encoding: [0x02,0x79,0x02,0x7e]			// VI: v_cvt_i16_f16_e32 v1, v2 ; encoding: [0x02,0x79,0x02,0x7e]
	v_cvt_i16_f16 v1, v2			v_cvt_i16_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_rcp_f16 v1, v2			// NOSICI: v_rcp_f16_e32 v1, v2
	// VI: v_rcp_f16_e32 v1, v2 ; encoding: [0x02,0x7b,0x02,0x7e]			// VI: v_rcp_f16_e32 v1, v2 ; encoding: [0x02,0x7b,0x02,0x7e]
	v_rcp_f16 v1, v2			v_rcp_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_sqrt_f16 v1, v2			// NOSICI: v_sqrt_f16_e32 v1, v2
	// VI: v_sqrt_f16_e32 v1, v2 ; encoding: [0x02,0x7d,0x02,0x7e]			// VI: v_sqrt_f16_e32 v1, v2 ; encoding: [0x02,0x7d,0x02,0x7e]
	v_sqrt_f16 v1, v2			v_sqrt_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_rsq_f16 v1, v2			// NOSICI: v_rsq_f16_e32 v1, v2
	// VI: v_rsq_f16_e32 v1, v2 ; encoding: [0x02,0x7f,0x02,0x7e]			// VI: v_rsq_f16_e32 v1, v2 ; encoding: [0x02,0x7f,0x02,0x7e]
	v_rsq_f16 v1, v2			v_rsq_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_log_f16 v1, v2			// NOSICI: v_log_f16_e32 v1, v2
	// VI: v_log_f16_e32 v1, v2 ; encoding: [0x02,0x81,0x02,0x7e]			// VI: v_log_f16_e32 v1, v2 ; encoding: [0x02,0x81,0x02,0x7e]
	v_log_f16 v1, v2			v_log_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_exp_f16 v1, v2			// NOSICI: v_exp_f16_e32 v1, v2
	// VI: v_exp_f16_e32 v1, v2 ; encoding: [0x02,0x83,0x02,0x7e]			// VI: v_exp_f16_e32 v1, v2 ; encoding: [0x02,0x83,0x02,0x7e]
	v_exp_f16 v1, v2			v_exp_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_frexp_mant_f16 v1, v2			// NOSICI: v_frexp_mant_f16_e32 v1, v2
	// VI: v_frexp_mant_f16_e32 v1, v2 ; encoding: [0x02,0x85,0x02,0x7e]			// VI: v_frexp_mant_f16_e32 v1, v2 ; encoding: [0x02,0x85,0x02,0x7e]
	v_frexp_mant_f16 v1, v2			v_frexp_mant_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_frexp_exp_i16_f16 v1, v2			// NOSICI: v_frexp_exp_i16_f16_e32 v1, v2
	// VI: v_frexp_exp_i16_f16_e32 v1, v2 ; encoding: [0x02,0x87,0x02,0x7e]			// VI: v_frexp_exp_i16_f16_e32 v1, v2 ; encoding: [0x02,0x87,0x02,0x7e]
	v_frexp_exp_i16_f16 v1, v2			v_frexp_exp_i16_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_floor_f16 v1, v2			// NOSICI: v_floor_f16_e32 v1, v2
	// VI: v_floor_f16_e32 v1, v2 ; encoding: [0x02,0x89,0x02,0x7e]			// VI: v_floor_f16_e32 v1, v2 ; encoding: [0x02,0x89,0x02,0x7e]
	v_floor_f16 v1, v2			v_floor_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_ceil_f16 v1, v2			// NOSICI: v_ceil_f16_e32 v1, v2
	// VI: v_ceil_f16_e32 v1, v2 ; encoding: [0x02,0x8b,0x02,0x7e]			// VI: v_ceil_f16_e32 v1, v2 ; encoding: [0x02,0x8b,0x02,0x7e]
	v_ceil_f16 v1, v2			v_ceil_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_trunc_f16 v1, v2			// NOSICI: v_trunc_f16_e32 v1, v2
	// VI: v_trunc_f16_e32 v1, v2 ; encoding: [0x02,0x8d,0x02,0x7e]			// VI: v_trunc_f16_e32 v1, v2 ; encoding: [0x02,0x8d,0x02,0x7e]
	v_trunc_f16 v1, v2			v_trunc_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_rndne_f16 v1, v2			// NOSICI: v_rndne_f16_e32 v1, v2
	// VI: v_rndne_f16_e32 v1, v2 ; encoding: [0x02,0x8f,0x02,0x7e]			// VI: v_rndne_f16_e32 v1, v2 ; encoding: [0x02,0x8f,0x02,0x7e]
	v_rndne_f16 v1, v2			v_rndne_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_fract_f16 v1, v2			// NOSICI: v_fract_f16_e32 v1, v2
	// VI: v_fract_f16_e32 v1, v2 ; encoding: [0x02,0x91,0x02,0x7e]			// VI: v_fract_f16_e32 v1, v2 ; encoding: [0x02,0x91,0x02,0x7e]
	v_fract_f16 v1, v2			v_fract_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_sin_f16 v1, v2			// NOSICI: v_sin_f16_e32 v1, v2
	// VI: v_sin_f16_e32 v1, v2 ; encoding: [0x02,0x93,0x02,0x7e]			// VI: v_sin_f16_e32 v1, v2 ; encoding: [0x02,0x93,0x02,0x7e]
	v_sin_f16 v1, v2			v_sin_f16_e32 v1, v2

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_cos_f16 v1, v2			// NOSICI: v_cos_f16_e32 v1, v2
	// VI: v_cos_f16_e32 v1, v2 ; encoding: [0x02,0x95,0x02,0x7e]			// VI: v_cos_f16_e32 v1, v2 ; encoding: [0x02,0x95,0x02,0x7e]
	v_cos_f16 v1, v2			v_cos_f16_e32 v1, v2

llvm/trunk/test/MC/AMDGPU/vop2.s

	Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	v_mul_i32_i24_e32 v1, v2, v3			v_mul_i32_i24_e32 v1, v2, v3

	// _e64 suffix			// _e64 suffix
	// SICI: v_mul_i32_i24_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x07,0x02,0x00]			// SICI: v_mul_i32_i24_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x07,0x02,0x00]
	v_mul_i32_i24_e64 v1, v2, v3			v_mul_i32_i24_e64 v1, v2, v3

	// src0 inline			// src0 inline
	// SICI: v_mul_i32_i24_e32 v1, 3, v3 ; encoding: [0x83,0x06,0x02,0x12]			// SICI: v_mul_i32_i24_e32 v1, 3, v3 ; encoding: [0x83,0x06,0x02,0x12]
	v_mul_i32_i24 v1, 3, v3			v_mul_i32_i24_e32 v1, 3, v3

	// src0 negative inline			// src0 negative inline
	// SICI: v_mul_i32_i24_e32 v1, -3, v3 ; encoding: [0xc3,0x06,0x02,0x12]			// SICI: v_mul_i32_i24_e32 v1, -3, v3 ; encoding: [0xc3,0x06,0x02,0x12]
	v_mul_i32_i24 v1, -3, v3			v_mul_i32_i24_e32 v1, -3, v3

	// src1 inline			// src1 inline
	// SICI: v_mul_i32_i24_e64 v1, v2, 3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x07,0x01,0x00]			// SICI: v_mul_i32_i24_e64 v1, v2, 3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x07,0x01,0x00]
	v_mul_i32_i24 v1, v2, 3			v_mul_i32_i24_e64 v1, v2, 3

	// src1 negative inline			// src1 negative inline
	// SICI: v_mul_i32_i24_e64 v1, v2, -3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x87,0x01,0x00]			// SICI: v_mul_i32_i24_e64 v1, v2, -3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x87,0x01,0x00]
	v_mul_i32_i24 v1, v2, -3			v_mul_i32_i24_e64 v1, v2, -3

	// src0 literal			// src0 literal
	// SICI: v_mul_i32_i24_e32 v1, 0x64, v3 ; encoding: [0xff,0x06,0x02,0x12,0x64,0x00,0x00,0x00]			// SICI: v_mul_i32_i24_e32 v1, 0x64, v3 ; encoding: [0xff,0x06,0x02,0x12,0x64,0x00,0x00,0x00]
	v_mul_i32_i24 v1, 100, v3			v_mul_i32_i24_e32 v1, 100, v3

	// src1 negative literal			// src1 negative literal
	// SICI: v_mul_i32_i24_e32 v1, 0xffffff9c, v3 ; encoding: [0xff,0x06,0x02,0x12,0x9c,0xff,0xff,0xff]			// SICI: v_mul_i32_i24_e32 v1, 0xffffff9c, v3 ; encoding: [0xff,0x06,0x02,0x12,0x9c,0xff,0xff,0xff]
	v_mul_i32_i24 v1, -100, v3			v_mul_i32_i24_e32 v1, -100, v3

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Checks for legal operands			// Checks for legal operands
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	// src0 sgpr			// src0 sgpr
	// SICI: v_mul_i32_i24_e32 v1, s2, v3 ; encoding: [0x02,0x06,0x02,0x12]			// SICI: v_mul_i32_i24_e32 v1, s2, v3 ; encoding: [0x02,0x06,0x02,0x12]
	v_mul_i32_i24 v1, s2, v3			v_mul_i32_i24_e32 v1, s2, v3

	// src1 sgpr			// src1 sgpr
	// SICI: v_mul_i32_i24_e64 v1, v2, s3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x07,0x00,0x00]			// SICI: v_mul_i32_i24_e64 v1, v2, s3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x07,0x00,0x00]
	v_mul_i32_i24 v1, v2, s3			v_mul_i32_i24_e64 v1, v2, s3

	// src0, src1 same sgpr			// src0, src1 same sgpr
	// SICI: v_mul_i32_i24_e64 v1, s2, s2 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x04,0x00,0x00]			// SICI: v_mul_i32_i24_e64 v1, s2, s2 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x04,0x00,0x00]
	v_mul_i32_i24 v1, s2, s2			v_mul_i32_i24_e64 v1, s2, s2

	// src0 sgpr, src1 inline			// src0 sgpr, src1 inline
	// SICI: v_mul_i32_i24_e64 v1, s2, 3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x06,0x01,0x00]			// SICI: v_mul_i32_i24_e64 v1, s2, 3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x06,0x01,0x00]
	v_mul_i32_i24 v1, s2, 3			v_mul_i32_i24_e64 v1, s2, 3

	// src0 inline src1 sgpr			// src0 inline src1 sgpr
	// SICI: v_mul_i32_i24_e64 v1, 3, s3 ; encoding: [0x01,0x00,0x12,0xd2,0x83,0x06,0x00,0x00]			// SICI: v_mul_i32_i24_e64 v1, 3, s3 ; encoding: [0x01,0x00,0x12,0xd2,0x83,0x06,0x00,0x00]
	v_mul_i32_i24 v1, 3, s3			v_mul_i32_i24_e64 v1, 3, s3

	// SICI: v_add_i32_e32 v0, vcc, 0.5, v0 ; encoding: [0xf0,0x00,0x00,0x4a]			// SICI: v_add_i32_e32 v0, vcc, 0.5, v0 ; encoding: [0xf0,0x00,0x00,0x4a]
	// VI: v_add_i32_e32 v0, vcc, 0.5, v0 ; encoding: [0xf0,0x00,0x00,0x32]			// VI: v_add_i32_e32 v0, vcc, 0.5, v0 ; encoding: [0xf0,0x00,0x00,0x32]
	v_add_i32 v0, vcc, 0.5, v0			v_add_i32_e32 v0, vcc, 0.5, v0

	// SICI: v_add_i32_e32 v0, vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x00,0x4a,0x00,0x00,0x48,0x40]			// SICI: v_add_i32_e32 v0, vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x00,0x4a,0x00,0x00,0x48,0x40]
	// VI: v_add_i32_e32 v0, vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x00,0x32,0x00,0x00,0x48,0x40]			// VI: v_add_i32_e32 v0, vcc, 0x40480000, v0 ; encoding: [0xff,0x00,0x00,0x32,0x00,0x00,0x48,0x40]
	v_add_i32 v0, vcc, 3.125, v0			v_add_i32_e32 v0, vcc, 3.125, v0

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Instructions			// Instructions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	// GCN: v_cndmask_b32_e32 v1, v2, v3, vcc ; encoding: [0x02,0x07,0x02,0x00]			// GCN: v_cndmask_b32_e32 v1, v2, v3, vcc ; encoding: [0x02,0x07,0x02,0x00]
	v_cndmask_b32 v1, v2, v3, vcc			v_cndmask_b32 v1, v2, v3, vcc

	Show All 30 Lines
	v_mul_legacy_f32_e32 v1, v2, v3			v_mul_legacy_f32_e32 v1, v2, v3

	// SICI: v_mul_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x10]			// SICI: v_mul_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x10]
	// VI: v_mul_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x0a]			// VI: v_mul_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x0a]
	v_mul_f32 v1, v2, v3			v_mul_f32 v1, v2, v3

	// SICI: v_mul_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x12]			// SICI: v_mul_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x12]
	// VI: v_mul_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x0c]			// VI: v_mul_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x0c]
	v_mul_i32_i24 v1, v2, v3			v_mul_i32_i24_e32 v1, v2, v3

	// SICI: v_mul_hi_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x14]			// SICI: v_mul_hi_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x14]
	// VI: v_mul_hi_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x0e]			// VI: v_mul_hi_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x0e]
	v_mul_hi_i32_i24 v1, v2, v3			v_mul_hi_i32_i24_e32 v1, v2, v3

	// SICI: v_mul_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x16]			// SICI: v_mul_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x16]
	// VI: v_mul_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x10]			// VI: v_mul_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x10]
	v_mul_u32_u24 v1, v2, v3			v_mul_u32_u24_e32 v1, v2, v3

	// SICI: v_mul_hi_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x18]			// SICI: v_mul_hi_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x18]
	// VI: v_mul_hi_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x12]			// VI: v_mul_hi_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x12]
	v_mul_hi_u32_u24 v1, v2, v3			v_mul_hi_u32_u24_e32 v1, v2, v3

	// SICI: v_min_legacy_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1a]			// SICI: v_min_legacy_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1a]
	// NOVI: error: instruction not supported on this GPU			// NOVI: error: instruction not supported on this GPU
	// NOVI: v_min_legacy_f32_e32 v1, v2, v3			// NOVI: v_min_legacy_f32_e32 v1, v2, v3
	v_min_legacy_f32_e32 v1, v2, v3			v_min_legacy_f32_e32 v1, v2, v3

	// SICI: v_max_legacy_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1c]			// SICI: v_max_legacy_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1c]
	// NOVI: error: instruction not supported on this GPU			// NOVI: error: instruction not supported on this GPU
	// NOVI: v_max_legacy_f32 v1, v2, v3			// NOVI: v_max_legacy_f32 v1, v2, v3
	v_max_legacy_f32 v1, v2, v3			v_max_legacy_f32 v1, v2, v3

	// SICI: v_min_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1e]			// SICI: v_min_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1e]
	// VI: v_min_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x14]			// VI: v_min_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x14]
	v_min_f32_e32 v1, v2, v3			v_min_f32_e32 v1, v2, v3

	// SICI: v_max_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x20]			// SICI: v_max_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x20]
	// VI: v_max_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x16]			// VI: v_max_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x16]
	v_max_f32 v1, v2 v3			v_max_f32 v1, v2 v3

	// SICI: v_min_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x22]			// SICI: v_min_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x22]
	// VI: v_min_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x18]			// VI: v_min_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x18]
	v_min_i32 v1, v2, v3			v_min_i32_e32 v1, v2, v3

	// SICI: v_max_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x24]			// SICI: v_max_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x24]
	// VI: v_max_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1a]			// VI: v_max_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1a]
	v_max_i32 v1, v2, v3			v_max_i32_e32 v1, v2, v3

	// SICI: v_min_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x26]			// SICI: v_min_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x26]
	// VI: v_min_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1c]			// VI: v_min_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1c]
	v_min_u32 v1, v2, v3			v_min_u32_e32 v1, v2, v3

	// SICI: v_max_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x28]			// SICI: v_max_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x28]
	// VI: v_max_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1e]			// VI: v_max_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1e]
	v_max_u32 v1, v2, v3			v_max_u32_e32 v1, v2, v3

	// SICI: v_lshr_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2a]			// SICI: v_lshr_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2a]
	// NOVI: error: instruction not supported on this GPU			// NOVI: error: instruction not supported on this GPU
	// NOVI: v_lshr_b32 v1, v2, v3			// NOVI: v_lshr_b32_e32 v1, v2, v3
	v_lshr_b32 v1, v2, v3			v_lshr_b32_e32 v1, v2, v3

	// SICI: v_lshrrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2c]			// SICI: v_lshrrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2c]
	// VI: v_lshrrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x20]			// VI: v_lshrrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x20]
	v_lshrrev_b32 v1, v2, v3			v_lshrrev_b32_e32 v1, v2, v3

	// SICI: v_ashr_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2e]			// SICI: v_ashr_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2e]
	// NOVI: error: instruction not supported on this GPU			// NOVI: error: instruction not supported on this GPU
	// NOVI: v_ashr_i32 v1, v2, v3			// NOVI: v_ashr_i32_e32 v1, v2, v3
	v_ashr_i32 v1, v2, v3			v_ashr_i32_e32 v1, v2, v3

	// SICI: v_ashrrev_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x30]			// SICI: v_ashrrev_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x30]
	// VI: v_ashrrev_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x22]			// VI: v_ashrrev_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x22]
	v_ashrrev_i32 v1, v2, v3			v_ashrrev_i32_e32 v1, v2, v3

	// SICI: v_lshl_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x32]			// SICI: v_lshl_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x32]
	// NOVI: error: instruction not supported on this GPU			// NOVI: error: instruction not supported on this GPU
	// NOVI: v_lshl_b32_e32 v1, v2, v3			// NOVI: v_lshl_b32_e32 v1, v2, v3
	v_lshl_b32_e32 v1, v2, v3			v_lshl_b32_e32 v1, v2, v3

	// SICI: v_lshlrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x34]			// SICI: v_lshlrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x34]
	// VI: v_lshlrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x24]			// VI: v_lshlrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x24]
	v_lshlrev_b32 v1, v2, v3			v_lshlrev_b32_e32 v1, v2, v3

	// SICI: v_and_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x36]			// SICI: v_and_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x36]
	// VI: v_and_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x26]			// VI: v_and_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x26]
	v_and_b32 v1, v2, v3			v_and_b32_e32 v1, v2, v3

	// SICI: v_or_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x38]			// SICI: v_or_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x38]
	// VI: v_or_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x28]			// VI: v_or_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x28]
	v_or_b32 v1, v2, v3			v_or_b32_e32 v1, v2, v3

	// SICI: v_xor_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3a]			// SICI: v_xor_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3a]
	// VI: v_xor_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2a]			// VI: v_xor_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2a]
	v_xor_b32 v1, v2, v3			v_xor_b32_e32 v1, v2, v3

	// SICI: v_bfm_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3c]			// SICI: v_bfm_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x3c,0xd2,0x02,0x07,0x02,0x00]
	// VI: v_bfm_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x93,0xd2,0x02,0x07,0x02,0x00]			// VI: v_bfm_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x93,0xd2,0x02,0x07,0x02,0x00]
	v_bfm_b32 v1, v2, v3			v_bfm_b32_e64 v1, v2, v3

	// SICI: v_mac_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3e]			// SICI: v_mac_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3e]
	// VI: v_mac_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2c]			// VI: v_mac_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2c]
	v_mac_f32 v1, v2, v3			v_mac_f32_e32 v1, v2, v3

	// SICI: v_madmk_f32_e32 v1, v2, 0x42800000, v3 ; encoding: [0x02,0x07,0x02,0x40,0x00,0x00,0x80,0x42]			// SICI: v_madmk_f32_e32 v1, v2, 0x42800000, v3 ; encoding: [0x02,0x07,0x02,0x40,0x00,0x00,0x80,0x42]
	// VI: v_madmk_f32_e32 v1, v2, 0x42800000, v3 ; encoding: [0x02,0x07,0x02,0x2e,0x00,0x00,0x80,0x42]			// VI: v_madmk_f32_e32 v1, v2, 0x42800000, v3 ; encoding: [0x02,0x07,0x02,0x2e,0x00,0x00,0x80,0x42]
	v_madmk_f32 v1, v2, 64.0, v3			v_madmk_f32_e32 v1, v2, 64.0, v3

	// SICI: v_madak_f32_e32 v1, v2, v3, 0x42800000 ; encoding: [0x02,0x07,0x02,0x42,0x00,0x00,0x80,0x42]			// SICI: v_madak_f32_e32 v1, v2, v3, 0x42800000 ; encoding: [0x02,0x07,0x02,0x42,0x00,0x00,0x80,0x42]
	// VI: v_madak_f32_e32 v1, v2, v3, 0x42800000 ; encoding: [0x02,0x07,0x02,0x30,0x00,0x00,0x80,0x42]			// VI: v_madak_f32_e32 v1, v2, v3, 0x42800000 ; encoding: [0x02,0x07,0x02,0x30,0x00,0x00,0x80,0x42]
	v_madak_f32 v1, v2, v3, 64.0			v_madak_f32_e32 v1, v2, v3, 64.0

	// SICI: v_bcnt_u32_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x44]			// SICI: v_bcnt_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x44,0xd2,0x02,0x07,0x02,0x00]
	// VI: v_bcnt_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x8b,0xd2,0x02,0x07,0x02,0x00]			// VI: v_bcnt_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x8b,0xd2,0x02,0x07,0x02,0x00]
	v_bcnt_u32_b32 v1, v2, v3			v_bcnt_u32_b32_e64 v1, v2, v3

	// SICI: v_mbcnt_lo_u32_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x46]			// SICI: v_mbcnt_lo_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x46,0xd2,0x02,0x07,0x02,0x00]
	// VI: v_mbcnt_lo_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x8c,0xd2,0x02,0x07,0x02,0x00]			// VI: v_mbcnt_lo_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x8c,0xd2,0x02,0x07,0x02,0x00]
	v_mbcnt_lo_u32_b32 v1, v2, v3			v_mbcnt_lo_u32_b32_e64 v1, v2, v3

	// SICI: v_mbcnt_hi_u32_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x48]			// SICI: v_mbcnt_hi_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x48,0xd2,0x02,0x07,0x02,0x00]
	// VI: v_mbcnt_hi_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x8d,0xd2,0x02,0x07,0x02,0x00]			// VI: v_mbcnt_hi_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x8d,0xd2,0x02,0x07,0x02,0x00]
	v_mbcnt_hi_u32_b32 v1, v2, v3			v_mbcnt_hi_u32_b32_e64 v1, v2, v3

	// SICI: v_add_i32_e32 v1, vcc, v2, v3 ; encoding: [0x02,0x07,0x02,0x4a]			// SICI: v_add_i32_e32 v1, vcc, v2, v3 ; encoding: [0x02,0x07,0x02,0x4a]
	// VI: v_add_i32_e32 v1, vcc, v2, v3 ; encoding: [0x02,0x07,0x02,0x32]			// VI: v_add_i32_e32 v1, vcc, v2, v3 ; encoding: [0x02,0x07,0x02,0x32]
	v_add_i32 v1, vcc, v2, v3			v_add_i32_e32 v1, vcc, v2, v3

	// SICI: v_add_i32_e64 v1, s[0:1], v2, v3 ; encoding: [0x01,0x00,0x4a,0xd2,0x02,0x07,0x02,0x00]			// SICI: v_add_i32_e64 v1, s[0:1], v2, v3 ; encoding: [0x01,0x00,0x4a,0xd2,0x02,0x07,0x02,0x00]
	// VI: v_add_i32_e64 v1, s[0:1], v2, v3 ; encoding: [0x01,0x00,0x19,0xd1,0x02,0x07,0x02,0x00]			// VI: v_add_i32_e64 v1, s[0:1], v2, v3 ; encoding: [0x01,0x00,0x19,0xd1,0x02,0x07,0x02,0x00]
	v_add_i32 v1, s[0:1], v2, v3			v_add_i32 v1, s[0:1], v2, v3

	// SICI: v_add_i32_e64 v1, s[0:1], v2, v3 ; encoding: [0x01,0x00,0x4a,0xd2,0x02,0x07,0x02,0x00]			// SICI: v_add_i32_e64 v1, s[0:1], v2, v3 ; encoding: [0x01,0x00,0x4a,0xd2,0x02,0x07,0x02,0x00]
	// VI: v_add_i32_e64 v1, s[0:1], v2, v3 ; encoding: [0x01,0x00,0x19,0xd1,0x02,0x07,0x02,0x00]			// VI: v_add_i32_e64 v1, s[0:1], v2, v3 ; encoding: [0x01,0x00,0x19,0xd1,0x02,0x07,0x02,0x00]
	v_add_i32_e64 v1, s[0:1], v2, v3			v_add_i32_e64 v1, s[0:1], v2, v3
	▲ Show 20 Lines • Show All 106 Lines • ▼ Show 20 Lines
	// SICI: v_cvt_pknorm_u16_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5c]			// SICI: v_cvt_pknorm_u16_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5c]
	// VI: v_cvt_pknorm_u16_f32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x95,0xd2,0x02,0x07,0x02,0x00]			// VI: v_cvt_pknorm_u16_f32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x95,0xd2,0x02,0x07,0x02,0x00]
	v_cvt_pknorm_u16_f32 v1, v2, v3			v_cvt_pknorm_u16_f32 v1, v2, v3

	// SICI: v_cvt_pkrtz_f16_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5e]			// SICI: v_cvt_pkrtz_f16_f32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5e]
	// VI: v_cvt_pkrtz_f16_f32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x96,0xd2,0x02,0x07,0x02,0x00]			// VI: v_cvt_pkrtz_f16_f32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x96,0xd2,0x02,0x07,0x02,0x00]
	v_cvt_pkrtz_f16_f32 v1, v2, v3			v_cvt_pkrtz_f16_f32 v1, v2, v3

	// SICI: v_cvt_pk_u16_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x60]			// SICI: v_cvt_pk_u16_u32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x60,0xd2,0x02,0x07,0x02,0x00]
	// VI: v_cvt_pk_u16_u32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x97,0xd2,0x02,0x07,0x02,0x00]			// VI: v_cvt_pk_u16_u32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x97,0xd2,0x02,0x07,0x02,0x00]
	v_cvt_pk_u16_u32 v1, v2, v3			v_cvt_pk_u16_u32_e64 v1, v2, v3

	// SICI: v_cvt_pk_i16_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x62]			// SICI: v_cvt_pk_i16_i32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x62,0xd2,0x02,0x07,0x02,0x00]
	// VI: v_cvt_pk_i16_i32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x98,0xd2,0x02,0x07,0x02,0x00]			// VI: v_cvt_pk_i16_i32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x98,0xd2,0x02,0x07,0x02,0x00]
	v_cvt_pk_i16_i32 v1, v2, v3			v_cvt_pk_i16_i32_e64 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_add_f16 v1, v2, v3			// NOSICI: v_add_f16_e32 v1, v2, v3
	// VI: v_add_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3e]			// VI: v_add_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3e]
	v_add_f16 v1, v2, v3			v_add_f16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_sub_f16 v1, v2, v3			// NOSICI: v_sub_f16_e32 v1, v2, v3
	// VI: v_sub_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x40]			// VI: v_sub_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x40]
	v_sub_f16 v1, v2, v3			v_sub_f16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_subrev_f16 v1, v2, v3			// NOSICI: v_subrev_f16_e32 v1, v2, v3
	// VI: v_subrev_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x42]			// VI: v_subrev_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x42]
	v_subrev_f16 v1, v2, v3			v_subrev_f16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_mul_f16 v1, v2, v3			// NOSICI: v_mul_f16_e32 v1, v2, v3
	// VI: v_mul_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x44]			// VI: v_mul_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x44]
	v_mul_f16 v1, v2, v3			v_mul_f16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_mac_f16 v1, v2, v3			// NOSICI: v_mac_f16_e32 v1, v2, v3
	// VI: v_mac_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x46]			// VI: v_mac_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x46]
	v_mac_f16 v1, v2, v3			v_mac_f16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_madmk_f16 v1, v2, 64.0, v3			// NOSICI: v_madmk_f16 v1, v2, 64.0, v3
	// VI: v_madmk_f16_e32 v1, v2, 0x42800000, v3 ; encoding: [0x02,0x07,0x02,0x48,0x00,0x00,0x80,0x42]			// VI: v_madmk_f16_e32 v1, v2, 0x42800000, v3 ; encoding: [0x02,0x07,0x02,0x48,0x00,0x00,0x80,0x42]
	v_madmk_f16 v1, v2, 64.0, v3			v_madmk_f16 v1, v2, 64.0, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_madak_f16 v1, v2, v3, 64.0			// NOSICI: v_madak_f16 v1, v2, v3, 64.0
	// VI: v_madak_f16_e32 v1, v2, v3, 0x42800000 ; encoding: [0x02,0x07,0x02,0x4a,0x00,0x00,0x80,0x42]			// VI: v_madak_f16_e32 v1, v2, v3, 0x42800000 ; encoding: [0x02,0x07,0x02,0x4a,0x00,0x00,0x80,0x42]
	v_madak_f16 v1, v2, v3, 64.0			v_madak_f16 v1, v2, v3, 64.0

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_add_u16 v1, v2, v3			// NOSICI: v_add_u16_e32 v1, v2, v3
	// VI: v_add_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x4c]			// VI: v_add_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x4c]
	v_add_u16 v1, v2, v3			v_add_u16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_sub_u16 v1, v2, v3			// NOSICI: v_sub_u16_e32 v1, v2, v3
	// VI: v_sub_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x4e]			// VI: v_sub_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x4e]
	v_sub_u16 v1, v2, v3			v_sub_u16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_subrev_u16 v1, v2, v3			// NOSICI: v_subrev_u16_e32 v1, v2, v3
	// VI: v_subrev_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x50]			// VI: v_subrev_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x50]
	v_subrev_u16 v1, v2, v3			v_subrev_u16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_mul_lo_u16 v1, v2, v3			// NOSICI: v_mul_lo_u16_e32 v1, v2, v3
	// VI: v_mul_lo_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x52]			// VI: v_mul_lo_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x52]
	v_mul_lo_u16 v1, v2, v3			v_mul_lo_u16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_lshlrev_b16 v1, v2, v3			// NOSICI: v_lshlrev_b16_e32 v1, v2, v3
	// VI: v_lshlrev_b16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x54]			// VI: v_lshlrev_b16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x54]
	v_lshlrev_b16 v1, v2, v3			v_lshlrev_b16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_lshrrev_b16 v1, v2, v3			// NOSICI: v_lshrrev_b16_e32 v1, v2, v3
	// VI: v_lshrrev_b16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x56]			// VI: v_lshrrev_b16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x56]
	v_lshrrev_b16 v1, v2, v3			v_lshrrev_b16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_ashrrev_b16 v1, v2, v3			// NOSICI: v_ashrrev_b16_e32 v1, v2, v3
	// VI: v_ashrrev_b16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x58]			// VI: v_ashrrev_b16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x58]
	v_ashrrev_b16 v1, v2, v3			v_ashrrev_b16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_max_f16 v1, v2, v3			// NOSICI: v_max_f16_e32 v1, v2, v3
	// VI: v_max_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5a]			// VI: v_max_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5a]
	v_max_f16 v1, v2, v3			v_max_f16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_min_f16 v1, v2, v3			// NOSICI: v_min_f16_e32 v1, v2, v3
	// VI: v_min_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5c]			// VI: v_min_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5c]
	v_min_f16 v1, v2, v3			v_min_f16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_max_u16 v1, v2, v3			// NOSICI: v_max_u16_e32 v1, v2, v3
	// VI: v_max_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5e]			// VI: v_max_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5e]
	v_max_u16 v1, v2, v3			v_max_u16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_max_i16 v1, v2, v3			// NOSICI: v_max_i16_e32 v1, v2, v3
	// VI: v_max_i16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x60]			// VI: v_max_i16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x60]
	v_max_i16 v1, v2, v3			v_max_i16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_min_u16 v1, v2, v3			// NOSICI: v_min_u16_e32 v1, v2, v3
	// VI: v_min_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x62]			// VI: v_min_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x62]
	v_min_u16 v1, v2, v3			v_min_u16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_min_i16 v1, v2, v3			// NOSICI: v_min_i16_e32 v1, v2, v3
	// VI: v_min_i16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x64]			// VI: v_min_i16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x64]
	v_min_i16 v1, v2, v3			v_min_i16_e32 v1, v2, v3

	// NOSICI: error: instruction not supported on this GPU			// NOSICI: error: instruction not supported on this GPU
	// NOSICI: v_ldexp_f16 v1, v2, v3			// NOSICI: v_ldexp_f16_e32 v1, v2, v3
	// VI: v_ldexp_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x66]			// VI: v_ldexp_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x66]
	v_ldexp_f16 v1, v2, v3			v_ldexp_f16_e32 v1, v2, v3

llvm/trunk/test/MC/AMDGPU/vop3-convert.s

				// RUN: not llvm-mc -arch=amdgcn -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=SICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=SI -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=SICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=bonaire -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=SICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=tonga -show-encoding %s \| FileCheck %s --check-prefix=GCN --check-prefix=CIVI --check-prefix=VI

				// RUN: not llvm-mc -arch=amdgcn -show-encoding %s 2>&1 \| FileCheck %s --check-prefix=NOSICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=SI -show-encoding %s 2>&1 \| FileCheck %s --check-prefix=NOSICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=bonaire -show-encoding %s 2>&1 \| FileCheck %s --check-prefix=NOSICI
				// RUN: not llvm-mc -arch=amdgcn -mcpu=tonga -show-encoding %s 2>&1 \| FileCheck %s -check-prefix=NOVI

				v_mov_b32 [v1], [v2]
				// GCN: v_mov_b32_e32 v1, v2 ; encoding: [0x02,0x03,0x02,0x7e]

				v_mov_b32 v0, 0.5
				// GCN: v_mov_b32_e32 v0, 0.5 ; encoding: [0xf0,0x02,0x00,0x7e]

				v_mov_b32_e32 v0, 3.125
				// GCN: v_mov_b32_e32 v0, 0x40480000 ; encoding: [0xff,0x02,0x00,0x7e,0x00,0x00,0x48,0x40]

				v_mov_b32 v1, ttmp8
				// SICI: v_mov_b32_e32 v1, ttmp8 ; encoding: [0x78,0x02,0x02,0x7e]
				// VI: v_mov_b32_e32 v1, ttmp8 ; encoding: [0x78,0x02,0x02,0x7e]

				// GCN: v_mov_b32_e32 v1, v2 ; encoding: [0x02,0x03,0x02,0x7e]
				v_mov_b32 v1, v2

				// SICI: v_not_b32_e32 v1, v2 ; encoding: [0x02,0x6f,0x02,0x7e]
				// VI: v_not_b32_e32 v1, v2 ; encoding: [0x02,0x57,0x02,0x7e]
				v_not_b32 v1, v2

				// SICI: v_bfrev_b32_e32 v1, v2 ; encoding: [0x02,0x71,0x02,0x7e]
				// VI: v_bfrev_b32_e32 v1, v2 ; encoding: [0x02,0x59,0x02,0x7e]
				v_bfrev_b32 v1, v2

				// SICI: v_ffbh_u32_e32 v1, v2 ; encoding: [0x02,0x73,0x02,0x7e]
				// VI: v_ffbh_u32_e32 v1, v2 ; encoding: [0x02,0x5b,0x02,0x7e]
				v_ffbh_u32 v1, v2

				// SICI: v_ffbl_b32_e32 v1, v2 ; encoding: [0x02,0x75,0x02,0x7e]
				// VI: v_ffbl_b32_e32 v1, v2 ; encoding: [0x02,0x5d,0x02,0x7e]
				v_ffbl_b32 v1, v2

				// SICI: v_ffbh_i32_e32 v1, v2 ; encoding: [0x02,0x77,0x02,0x7e]
				// VI: v_ffbh_i32_e32 v1, v2 ; encoding: [0x02,0x5f,0x02,0x7e]
				v_ffbh_i32_e32 v1, v2

				// SICI: v_frexp_exp_i32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x79,0x02,0x7e]
				// VI: v_frexp_exp_i32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x61,0x02,0x7e]
				v_frexp_exp_i32_f64 v1, v[2:3]

				// SICI: v_frexp_mant_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x7b,0x02,0x7e]
				// VI; v_frexp_mant_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x63,0x02,0x7e]
				v_frexp_mant_f64 v[1:2], v[2:3]

				// SICI: v_fract_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x7d,0x02,0x7e]
				// VI: v_fract_f64_e32 v[1:2], v[2:3] ; encoding: [0x02,0x65,0x02,0x7e]
				v_fract_f64 v[1:2], v[2:3]

				// SICI: v_frexp_exp_i32_f32_e32 v1, v2 ; encoding: [0x02,0x7f,0x02,0x7e]
				// VI: v_frexp_exp_i32_f32_e32 v1, v2 ; encoding: [0x02,0x67,0x02,0x7e]
				v_frexp_exp_i32_f32 v1, v2

				// SICI: v_frexp_mant_f32_e32 v1, v2 ; encoding: [0x02,0x81,0x02,0x7e]
				// VI: v_frexp_mant_f32_e32 v1, v2 ; encoding: [0x02,0x69,0x02,0x7e]
				v_frexp_mant_f32 v1, v2

				// SICI: v_clrexcp ; encoding: [0x00,0x82,0x00,0x7e]
				// VI: v_clrexcp ; encoding: [0x00,0x6a,0x00,0x7e]
				v_clrexcp

				// SICI: v_movreld_b32_e32 v1, v2 ; encoding: [0x02,0x85,0x02,0x7e]
				// VI: v_movreld_b32_e32 v1, v2 ; encoding: [0x02,0x6d,0x02,0x7e]
				v_movreld_b32 v1, v2

				// SICI: v_movrels_b32_e32 v1, v2 ; encoding: [0x02,0x87,0x02,0x7e]
				// VI: v_movrels_b32_e32 v1, v2 ; encoding: [0x02,0x6f,0x02,0x7e]
				v_movrels_b32 v1, v2

				// SICI: v_movrelsd_b32_e32 v1, v2 ; encoding: [0x02,0x89,0x02,0x7e]
				// VI: v_movrelsd_b32_e32 v1, v2 ; encoding: [0x02,0x71,0x02,0x7e]
				v_movrelsd_b32 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_cvt_f16_u16 v1, v2
				// VI: v_cvt_f16_u16_e32 v1, v2 ; encoding: [0x02,0x73,0x02,0x7e]
				v_cvt_f16_u16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_cvt_f16_i16 v1, v2
				// VI: v_cvt_f16_i16_e32 v1, v2 ; encoding: [0x02,0x75,0x02,0x7e]
				v_cvt_f16_i16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_cvt_u16_f16 v1, v2
				// VI: v_cvt_u16_f16_e32 v1, v2 ; encoding: [0x02,0x77,0x02,0x7e]
				v_cvt_u16_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_cvt_i16_f16 v1, v2
				// VI: v_cvt_i16_f16_e32 v1, v2 ; encoding: [0x02,0x79,0x02,0x7e]
				v_cvt_i16_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_rcp_f16 v1, v2
				// VI: v_rcp_f16_e32 v1, v2 ; encoding: [0x02,0x7b,0x02,0x7e]
				v_rcp_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_sqrt_f16 v1, v2
				// VI: v_sqrt_f16_e32 v1, v2 ; encoding: [0x02,0x7d,0x02,0x7e]
				v_sqrt_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_rsq_f16 v1, v2
				// VI: v_rsq_f16_e32 v1, v2 ; encoding: [0x02,0x7f,0x02,0x7e]
				v_rsq_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_log_f16 v1, v2
				// VI: v_log_f16_e32 v1, v2 ; encoding: [0x02,0x81,0x02,0x7e]
				v_log_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_exp_f16 v1, v2
				// VI: v_exp_f16_e32 v1, v2 ; encoding: [0x02,0x83,0x02,0x7e]
				v_exp_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_frexp_mant_f16 v1, v2
				// VI: v_frexp_mant_f16_e32 v1, v2 ; encoding: [0x02,0x85,0x02,0x7e]
				v_frexp_mant_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_frexp_exp_i16_f16 v1, v2
				// VI: v_frexp_exp_i16_f16_e32 v1, v2 ; encoding: [0x02,0x87,0x02,0x7e]
				v_frexp_exp_i16_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_floor_f16 v1, v2
				// VI: v_floor_f16_e32 v1, v2 ; encoding: [0x02,0x89,0x02,0x7e]
				v_floor_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_ceil_f16 v1, v2
				// VI: v_ceil_f16_e32 v1, v2 ; encoding: [0x02,0x8b,0x02,0x7e]
				v_ceil_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_trunc_f16 v1, v2
				// VI: v_trunc_f16_e32 v1, v2 ; encoding: [0x02,0x8d,0x02,0x7e]
				v_trunc_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_rndne_f16 v1, v2
				// VI: v_rndne_f16_e32 v1, v2 ; encoding: [0x02,0x8f,0x02,0x7e]
				v_rndne_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_fract_f16 v1, v2
				// VI: v_fract_f16_e32 v1, v2 ; encoding: [0x02,0x91,0x02,0x7e]
				v_fract_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_sin_f16 v1, v2
				// VI: v_sin_f16_e32 v1, v2 ; encoding: [0x02,0x93,0x02,0x7e]
				v_sin_f16 v1, v2

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_cos_f16 v1, v2
				// VI: v_cos_f16_e32 v1, v2 ; encoding: [0x02,0x95,0x02,0x7e]
				v_cos_f16 v1, v2

				// src0 inline
				// SICI: v_mul_i32_i24_e32 v1, 3, v3 ; encoding: [0x83,0x06,0x02,0x12]
				v_mul_i32_i24 v1, 3, v3

				// src0 negative inline
				// SICI: v_mul_i32_i24_e32 v1, -3, v3 ; encoding: [0xc3,0x06,0x02,0x12]
				v_mul_i32_i24 v1, -3, v3

				// src1 inline
				// SICI: v_mul_i32_i24_e64 v1, v2, 3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x07,0x01,0x00]
				v_mul_i32_i24 v1, v2, 3

				// src1 negative inline
				// SICI: v_mul_i32_i24_e64 v1, v2, -3 ; encoding: [0x01,0x00,0x12,0xd2,0x02,0x87,0x01,0x00]
				v_mul_i32_i24 v1, v2, -3

				// GCN: v_cvt_flr_i32_f32_e32 v1, v2 ; encoding: [0x02,0x1b,0x02,0x7e]
				v_cvt_flr_i32_f32 v1, v2

				// GCN: v_cvt_off_f32_i4_e32 v1, v2 ; encoding: [0x02,0x1d,0x02,0x7e]
				v_cvt_off_f32_i4_e32 v1, v2

				// GCN: v_cvt_f32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x1f,0x02,0x7e]
				v_cvt_f32_f64 v1, v[2:3]

				// GCN: v_cvt_f64_f32_e32 v[1:2], v2 ; encoding: [0x02,0x21,0x02,0x7e]
				v_cvt_f64_f32 v[1:2], v2

				// GCN: v_cvt_f32_ubyte0_e32 v1, v2 ; encoding: [0x02,0x23,0x02,0x7e]
				v_cvt_f32_ubyte0 v1, v2

				// GCN: v_cvt_f32_ubyte1_e32 v1, v2 ; encoding: [0x02,0x25,0x02,0x7e]
				v_cvt_f32_ubyte1_e32 v1, v2

				// GCN: v_cvt_f32_ubyte2_e32 v1, v2 ; encoding: [0x02,0x27,0x02,0x7e]
				v_cvt_f32_ubyte2 v1, v2

				// GCN: v_cvt_f32_ubyte3_e32 v1, v2 ; encoding: [0x02,0x29,0x02,0x7e]
				v_cvt_f32_ubyte3 v1, v2

				// GCN: v_cvt_u32_f64_e32 v1, v[2:3] ; encoding: [0x02,0x2b,0x02,0x7e]
				v_cvt_u32_f64 v1, v[2:3]

				// GCN: v_cvt_f64_u32_e32 v[1:2], v2 ; encoding: [0x02,0x2d,0x02,0x7e]
				v_cvt_f64_u32 v[1:2], v2

				// SICI: v_mul_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x12]
				// VI: v_mul_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x0c]
				v_mul_i32_i24 v1, v2, v3

				// SICI: v_mul_hi_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x14]
				// VI: v_mul_hi_i32_i24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x0e]
				v_mul_hi_i32_i24 v1, v2, v3

				// SICI: v_mul_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x16]
				// VI: v_mul_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x10]
				v_mul_u32_u24 v1, v2, v3

				// SICI: v_mul_hi_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x18]
				// VI: v_mul_hi_u32_u24_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x12]
				v_mul_hi_u32_u24 v1, v2, v3

				// SICI: v_min_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x22]
				// VI: v_min_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x18]
				v_min_i32 v1, v2, v3

				// SICI: v_max_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x24]
				// VI: v_max_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1a]
				v_max_i32 v1, v2, v3

				// SICI: v_min_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x26]
				// VI: v_min_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1c]
				v_min_u32 v1, v2, v3

				// SICI: v_max_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x28]
				// VI: v_max_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x1e]
				v_max_u32 v1, v2, v3

				// SICI: v_lshr_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2a]
				// NOVI: error: instruction not supported on this GPU
				// NOVI: v_lshr_b32 v1, v2, v3
				v_lshr_b32 v1, v2, v3

				// SICI: v_lshrrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2c]
				// VI: v_lshrrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x20]
				v_lshrrev_b32 v1, v2, v3

				// SICI: v_ashr_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2e]
				// NOVI: error: instruction not supported on this GPU
				// NOVI: v_ashr_i32 v1, v2, v3
				v_ashr_i32 v1, v2, v3

				// SICI: v_ashrrev_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x30]
				// VI: v_ashrrev_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x22]
				v_ashrrev_i32 v1, v2, v3

				// SICI: v_lshl_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x32]
				// NOVI: error: instruction not supported on this GPU
				// NOVI: v_lshl_b32_e32 v1, v2, v3
				v_lshl_b32_e32 v1, v2, v3

				// SICI: v_lshlrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x34]
				// VI: v_lshlrev_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x24]
				v_lshlrev_b32 v1, v2, v3

				// SICI: v_and_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x36]
				// VI: v_and_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x26]
				v_and_b32 v1, v2, v3

				// SICI: v_or_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x38]
				// VI: v_or_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x28]
				v_or_b32 v1, v2, v3

				// SICI: v_xor_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3a]
				// VI: v_xor_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x2a]
				v_xor_b32 v1, v2, v3

				// SICI: v_bfm_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3c]
				// VI: v_bfm_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x93,0xd2,0x02,0x07,0x02,0x00]
				v_bfm_b32 v1, v2, v3

				// SICI: v_bcnt_u32_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x44]
				// VI: v_bcnt_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x8b,0xd2,0x02,0x07,0x02,0x00]
				v_bcnt_u32_b32 v1, v2, v3

				// SICI: v_mbcnt_lo_u32_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x46]
				// VI: v_mbcnt_lo_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x8c,0xd2,0x02,0x07,0x02,0x00]
				v_mbcnt_lo_u32_b32 v1, v2, v3

				// SICI: v_mbcnt_hi_u32_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x48]
				// VI: v_mbcnt_hi_u32_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x8d,0xd2,0x02,0x07,0x02,0x00]
				v_mbcnt_hi_u32_b32 v1, v2, v3

				// SICI: v_cvt_pk_u16_u32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x60]
				// VI: v_cvt_pk_u16_u32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x97,0xd2,0x02,0x07,0x02,0x00]
				v_cvt_pk_u16_u32 v1, v2, v3

				// SICI: v_cvt_pk_i16_i32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x62]
				// VI: v_cvt_pk_i16_i32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x98,0xd2,0x02,0x07,0x02,0x00]
				v_cvt_pk_i16_i32 v1, v2, v3

				// SICI: v_bfm_b32_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3c]
				// VI: v_bfm_b32_e64 v1, v2, v3 ; encoding: [0x01,0x00,0x93,0xd2,0x02,0x07,0x02,0x00]
				v_bfm_b32 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_add_f16 v1, v2, v3
				// VI: v_add_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x3e]
				v_add_f16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_sub_f16 v1, v2, v3
				// VI: v_sub_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x40]
				v_sub_f16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_subrev_f16 v1, v2, v3
				// VI: v_subrev_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x42]
				v_subrev_f16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_mul_f16 v1, v2, v3
				// VI: v_mul_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x44]
				v_mul_f16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_mac_f16 v1, v2, v3
				// VI: v_mac_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x46]
				v_mac_f16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_add_u16 v1, v2, v3
				// VI: v_add_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x4c]
				v_add_u16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_sub_u16 v1, v2, v3
				// VI: v_sub_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x4e]
				v_sub_u16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_subrev_u16 v1, v2, v3
				// VI: v_subrev_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x50]
				v_subrev_u16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_mul_lo_u16 v1, v2, v3
				// VI: v_mul_lo_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x52]
				v_mul_lo_u16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_lshlrev_b16 v1, v2, v3
				// VI: v_lshlrev_b16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x54]
				v_lshlrev_b16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_lshrrev_b16 v1, v2, v3
				// VI: v_lshrrev_b16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x56]
				v_lshrrev_b16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_ashrrev_b16 v1, v2, v3
				// VI: v_ashrrev_b16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x58]
				v_ashrrev_b16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_max_f16 v1, v2, v3
				// VI: v_max_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5a]
				v_max_f16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_min_f16 v1, v2, v3
				// VI: v_min_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5c]
				v_min_f16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_max_u16 v1, v2, v3
				// VI: v_max_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x5e]
				v_max_u16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_max_i16 v1, v2, v3
				// VI: v_max_i16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x60]
				v_max_i16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_min_u16 v1, v2, v3
				// VI: v_min_u16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x62]
				v_min_u16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_min_i16 v1, v2, v3
				// VI: v_min_i16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x64]
				v_min_i16 v1, v2, v3

				// NOSICI: error: instruction not supported on this GPU
				// NOSICI: v_ldexp_f16 v1, v2, v3
				// VI: v_ldexp_f16_e32 v1, v2, v3 ; encoding: [0x02,0x07,0x02,0x66]
				v_ldexp_f16 v1, v2, v3

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Assembler: better support for immediate literals in assembler.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 70837

llvm/trunk/lib/Target/AMDGPU/AMDGPUInstructions.td

llvm/trunk/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp

llvm/trunk/lib/Target/AMDGPU/InstPrinter/AMDGPUInstPrinter.cpp

llvm/trunk/lib/Target/AMDGPU/MCTargetDesc/SIMCCodeEmitter.cpp

llvm/trunk/lib/Target/AMDGPU/SIDefines.h

llvm/trunk/lib/Target/AMDGPU/SIInstrInfo.cpp

llvm/trunk/lib/Target/AMDGPU/SIInstrInfo.td

llvm/trunk/lib/Target/AMDGPU/SIInstructions.td

llvm/trunk/lib/Target/AMDGPU/SIRegisterInfo.cpp

llvm/trunk/lib/Target/AMDGPU/SIRegisterInfo.td

llvm/trunk/lib/Target/AMDGPU/SOPInstructions.td

llvm/trunk/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.h

llvm/trunk/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp

llvm/trunk/lib/Target/AMDGPU/VIInstructions.td

llvm/trunk/test/MC/AMDGPU/literals.s

llvm/trunk/test/MC/AMDGPU/reg-syntax-extra.s

llvm/trunk/test/MC/AMDGPU/regression/bug28165.s

llvm/trunk/test/MC/AMDGPU/regression/bug28413.s

llvm/trunk/test/MC/AMDGPU/sop1-err.s

llvm/trunk/test/MC/AMDGPU/trap.s

llvm/trunk/test/MC/AMDGPU/vop1.s

llvm/trunk/test/MC/AMDGPU/vop2.s

llvm/trunk/test/MC/AMDGPU/vop3-convert.s

[AMDGPU] Assembler: better support for immediate literals in assembler.
ClosedPublic