This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Disassembler: Added basic disassembler for AMDGPU target.
ClosedPublic

Authored by SamWot on Jan 29 2016, 6:59 AM.

Download Raw Diff

Details

Reviewers

• tstellarAMD
arsenm

Commits

rGe1818af8c533: [AMDGPU] Disassembler: Added basic disassembler for AMDGPU target
rL261185: [AMDGPU] Disassembler: Added basic disassembler for AMDGPU target

Summary

Changes:

Added disassembler project
Fixed all decoding conflicts in .td files
Added DecoderMethod=“NONE” option to Target.td that allows to disable decoder generation for an instruction
Created decoding functions for VS_32 and VReg_32 register classes.
Added stubs for decoding all register classes.
Added several tests for disassembler

Disassembler only supports:

VI subtarget
VOP1 instruction encoding
32-bit register operands and inline constants

[Valery]

One of the point that requires to pay attention to is how decoder conflicts were resolved:

Groups of target instructions were separated by using different DecoderNamespace (SICI, VI, CI) using similar to AssemblerPredicate approach.
There were conflicts in IMAGE_<> instructions caused by two different reasons:
1. dmask wasn’t specified for the output (fixed)
2. There’re image instructions that differ only by the number of the address components but have the same encoding by the HW spec. The actual number of address components is determined by the HW at runtime using image resource descriptor starting from the VGPR encoded in an IMAGE instruction. This means that we should choose only one instruction from conflicting group to be the rule for decoder. I didn’t find the way to disable decoder generation for an arbitrary instruction and therefore made a onelinear fix to tablegen generator that would suppress decoder generation when DecoderMethod is set to “NONE”. This is a change that should be reviewed and submitted first. Otherwise I would need to specify different DecoderNamespace for every instruction in the conflicting group. I haven’t checked yet if DecoderMethod=“NONE” is not used in other targets.
3. IMAGE_GATHER decoder generation is for now disabled and to be done later.
  
  [/Valery]

Diff Detail

Repository: rL LLVM

Event Timeline

SamWot updated this revision to Diff 46379.Jan 29 2016, 6:59 AM

SamWot retitled this revision from to [AMDGPU] Disassembler: Added basic disassembler for AMDGPU target..

SamWot updated this object.

SamWot added reviewers: • tstellarAMD, arsenm.

SamWot added subscribers: vpykhtin, nhaustov, llvm-commits.

Herald added a subscriber: arsenm. · View Herald TranscriptJan 29 2016, 6:59 AM

• tstellarAMD added inline comments.Jan 29 2016, 7:55 AM

lib/Target/AMDGPU/Disassembler/LLVMBuild.txt
1 ↗	(On Diff #46379)	Copy and paste error: s/AArch64/AMDGPU/
lib/Target/AMDGPU/Disassembler/Makefile
1 ↗	(On Diff #46379)	Makefiles have been deleted, so you don't need to update this file.
lib/Target/AMDGPU/SIInstrInfo.td
3017 ↗	(On Diff #46379)	This can be removed.
utils/TableGen/FixedLenDecoderEmitter.cpp
1733 ↗	(On Diff #46379)	This change needs to be in a separate patch.

arsenm added inline comments.Jan 29 2016, 5:30 PM

lib/Target/AMDGPU/AMDGPU.td
306 ↗	(On Diff #46379)	Do we need this? Ideally we would only use the names
lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp
22 ↗	(On Diff #46379)	This should be the first included file
40–74 ↗	(On Diff #46379)	Can you move the definitions of these functions up to avoid needing declaring them separately? Can they also be moved down to be below the includes for the generated files?
113 ↗	(On Diff #46379)	It looks like this doesn't handle using one of the literal constants for an 64-bit/f64 operand, which does work.
lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.h
1 ↗	(On Diff #46379)	This should have a -- C++ -_ added at the end of the first line
15 ↗	(On Diff #46379)	Remove this extra comment line
24–26 ↗	(On Diff #46379)	These should be alphabetized

vpykhtin added inline comments.Jan 31 2016, 3:48 AM

utils/TableGen/FixedLenDecoderEmitter.cpp
1733 ↗	(On Diff #46379)	I found out isAsmParserOnly=1 does the job, so I'll remove this.

Fixed some issues, reverted changes in core TableGen.

Last uploaded diff unfortunately contains new changes from LLVM master. Comparing it with previous diff will show changes from master.

lib/Target/AMDGPU/AMDGPU.td
306 ↗	(On Diff #46379)	This is needed now. Otherwise LLVM generate invalid decoder methods for most instructions. It tries to match operand names with names of fileds that specify operands in instruction encoding. E.g. for V_MOV_B32 operand names are "dst" and "src0" and fields names are "vdst" and "src0". So LLVM fail to match destination operand ("vdst" vs. "dst") and doesn't generate decoding method for it. Same time it match "src0" operand and tries to decode it successfully. So until we fix names of operands using this option is only reasonable choice.
lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp
113 ↗	(On Diff #46379)	Disassembler for now support only 32-bit operands. Support for 64-bit operands is planned to be next step.

• tstellarAMD added inline comments.Feb 1 2016, 8:07 AM

lib/Target/AMDGPU/AMDGPU.td
306 ↗	(On Diff #46525)	We should fix these name mismatches rather than enabling this bit. It is much safer to use the name based matching.

arsenm added inline comments.Feb 1 2016, 5:40 PM

lib/Target/AMDGPU/AMDGPU.td
306 ↗	(On Diff #46525)	I think we should change the names to vdst. The way it is now I think follows the names of the operands as they appear in the ISA documentation, but the document isn't perfectly consistent. This is because a few instructions have a second sdst

SamWot added inline comments.Feb 1 2016, 9:10 PM

lib/Target/AMDGPU/AMDGPU.td
306 ↗	(On Diff #46525)	Problem here is that renaming to vdst will affect all patterns that match this type of instructions. Also I'm not sure if this is realy needed because LLVM will still match operands positionalybut with help of operand names.

arsenm added inline comments.Feb 1 2016, 9:57 PM

lib/Target/AMDGPU/AMDGPU.td
306 ↗	(On Diff #46525)	This shouldn't be difficult. It would be best to start out always using the names. It took a while to enable noNamedPositionallyEncodedOperands

Removed decodePositionallyEncodedOperands. Renamed $dst to $vdst operands for VOP instructions.

In D16723#344064, @SamWot wrote:

Removed decodePositionallyEncodedOperands. Renamed $dst to $vdst operands for VOP instructions.

The changes to the operand names should really be in a different patch so they can be tested separately.

vpykhtin added a child revision: D16998: [AMDGPU] llvm-objdump: disassembling amdgcn object file.Feb 11 2016, 10:00 AM

SamWot added a parent revision: D16920: [AMDGPU] Rename $dst operand to $vdst for VOP instructions..Feb 12 2016, 1:58 AM

SamWot mentioned this in D17194: [AMDGPU] Disassembler: Support for all VOP1 instructions..Feb 12 2016, 4:39 AM

SamWot added a child revision: D17194: [AMDGPU] Disassembler: Support for all VOP1 instructions..

SamWot edited edge metadata.Feb 12 2016, 6:29 AM

SamWot added a project: Restricted Project.

Can you rebase this on top of latest LLVM code.

Updated diff with latest LLVM master code

The encoding of the dmask field is incorrect with this patch. You may want to split the dmask changes into a separate patch. Here is an example test case:

Good Encoding: image_sample v[0:2], 13, 0, 0, 0, 0, 0, 0, 0, v[2:3], s[12:19], s[0:3] ; F0800D00 00030002
Bad Encoding: image_sample v[0:2], 13, 0, 0, 0, 0, 0, 0, 0, v[2:3], s[12:19], s[0:3] ; F0800700 00030002

target triple = "amdgcn--"

define void @main([9 x <16 x i8>] addrspace(2)* byval, [17 x <16 x i8>] addrspace(2)* byval, [17 x <8 x i32>] addrspace(2)* byval, i32 addrspace(2)* byval, float inreg, i32 inreg, <2 x i32>, <2 x i32>, <2 x i32>, <3 x i32>, <2 x i32>, <2 x i32>, <2 x i32>, float, float, float, float, float, i32, i32, float, float) #0 {
main_body:

%22 = getelementptr [17 x <8 x i32>], [17 x <8 x i32>] addrspace(2)* %2, i64 0, i64 0
%23 = load <8 x i32>, <8 x i32> addrspace(2)* %22, align 32, !tbaa !0
%24 = bitcast [17 x <8 x i32>] addrspace(2)* %2 to [0 x <4 x i32>] addrspace(2)*
%25 = getelementptr [0 x <4 x i32>], [0 x <4 x i32>] addrspace(2)* %24, i64 0, i64 3
%26 = load <4 x i32>, <4 x i32> addrspace(2)* %25, align 16, !tbaa !0
%27 = call float @llvm.SI.fs.interp(i32 0, i32 0, i32 %5, <2 x i32> %7)
%28 = call float @llvm.SI.fs.interp(i32 1, i32 0, i32 %5, <2 x i32> %7)
%29 = bitcast float %27 to i32
%30 = bitcast float %28 to i32
%31 = insertelement <2 x i32> undef, i32 %29, i32 0
%32 = insertelement <2 x i32> %31, i32 %30, i32 1
%33 = call <4 x float> @llvm.SI.image.sample.v2i32(<2 x i32> %32, <8 x i32> %23, <4 x i32> %26, i32 15, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0, i32 0)
%34 = extractelement <4 x float> %33, i32 0
%35 = extractelement <4 x float> %33, i32 2
%36 = extractelement <4 x float> %33, i32 3
%37 = call i32 @llvm.SI.packf16(float %34, float 0.000000e+00)
%38 = bitcast i32 %37 to float
%39 = call i32 @llvm.SI.packf16(float %35, float %36)
%40 = bitcast i32 %39 to float
call void @llvm.SI.export(i32 15, i32 1, i32 1, i32 0, i32 1, float %38, float %40, float undef, float undef)
ret void

}

; Function Attrs: nounwind readnone
declare float @llvm.SI.fs.interp(i32, i32, i32, <2 x i32>) #1

; Function Attrs: nounwind readnone
declare <4 x float> @llvm.SI.image.sample.v2i32(<2 x i32>, <8 x i32>, <4 x i32>, i32, i32, i32, i32, i32, i32, i32, i32) #1

; Function Attrs: nounwind readnone
declare i32 @llvm.SI.packf16(float, float) #1

declare void @llvm.SI.export(i32, i32, i32, i32, i32, float, float, float, float)

attributes #0 = { "ShaderType"="0" }
attributes #1 = { nounwind readnone }

!0 = !{!"const", null, i32 1}

This revision now requires changes to proceed.Feb 17 2016, 6:56 AM

Fix dmask encoding.

Closed by commit rL261185: [AMDGPU] Disassembler: Added basic disassembler for AMDGPU target (authored by tstellar). · Explain WhyFeb 17 2016, 7:47 PM

This revision was automatically updated to reflect the committed changes.

I committed this with a few warning fixes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Target/

AMDGPU/

AMDGPUInstructions.td

8 lines

CIInstructions.td

6 lines

CMakeLists.txt

2 lines

Disassembler/

AMDGPUDisassembler.h

57 lines

AMDGPUDisassembler.cpp

302 lines

7 lines

23 lines

3 lines

6 lines

171 lines

11 lines

4 lines

test/

MC/

Disassembler/

AMDGPU/

lit.local.cfg

2 lines

mov.txt

31 lines

nop.txt

4 lines

Diff 48272

llvm/trunk/lib/Target/AMDGPU/AMDGPUInstructions.td

Show All 17 Lines	class AMDGPUInst <dag outs, dag ins, string asm, list<dag> pattern> : Instruction {

let Namespace = "AMDGPU";		let Namespace = "AMDGPU";
let OutOperandList = outs;		let OutOperandList = outs;
let InOperandList = ins;		let InOperandList = ins;
let AsmString = asm;		let AsmString = asm;
let Pattern = pattern;		let Pattern = pattern;
let Itinerary = NullALU;		let Itinerary = NullALU;

		// SoftFail is a field the disassembler can use to provide a way for
		// instructions to not match without killing the whole decode process. It is
		// mainly used for ARM, but Tablegen expects this field to exist or it fails
		// to build the decode table.
		field bits<64> SoftFail = 0;

		let DecoderNamespace = Namespace;

let TSFlags{63} = isRegisterLoad;		let TSFlags{63} = isRegisterLoad;
let TSFlags{62} = isRegisterStore;		let TSFlags{62} = isRegisterStore;
}		}

class AMDGPUShaderInst <dag outs, dag ins, string asm, list<dag> pattern>		class AMDGPUShaderInst <dag outs, dag ins, string asm, list<dag> pattern>
: AMDGPUInst<outs, ins, asm, pattern> {		: AMDGPUInst<outs, ins, asm, pattern> {

field bits<32> Inst = 0xffffffff;		field bits<32> Inst = 0xffffffff;
▲ Show 20 Lines • Show All 583 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/CIInstructions.td

	Show First 20 Lines • Show All 94 Lines • ▼ Show 20 Lines

	defm S_DCACHE_INV_VOL : SMRD_Inval <smrd<0x1d, 0x22>,			defm S_DCACHE_INV_VOL : SMRD_Inval <smrd<0x1d, 0x22>,
	"s_dcache_inv_vol", int_amdgcn_s_dcache_inv_vol>;			"s_dcache_inv_vol", int_amdgcn_s_dcache_inv_vol>;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// MUBUF Instructions			// MUBUF Instructions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

				let DisableSIDecoder = 1 in {
	defm BUFFER_WBINVL1_VOL : MUBUF_Invalidate <mubuf<0x70, 0x3f>,			defm BUFFER_WBINVL1_VOL : MUBUF_Invalidate <mubuf<0x70, 0x3f>,
	"buffer_wbinvl1_vol", int_amdgcn_buffer_wbinvl1_vol			"buffer_wbinvl1_vol", int_amdgcn_buffer_wbinvl1_vol
	>;			>;
				}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Flat Instructions			// Flat Instructions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	defm FLAT_LOAD_UBYTE : FLAT_Load_Helper <			defm FLAT_LOAD_UBYTE : FLAT_Load_Helper <
	flat<0x8, 0x10>, "flat_load_ubyte", VGPR_32			flat<0x8, 0x10>, "flat_load_ubyte", VGPR_32
	>;			>;
	▲ Show 20 Lines • Show All 114 Lines • ▼ Show 20 Lines
	defm FLAT_ATOMIC_DEC_X2 : FLAT_ATOMIC <			defm FLAT_ATOMIC_DEC_X2 : FLAT_ATOMIC <
	flat<0x5d, 0x6c>, "flat_atomic_dec_x2", VReg_64			flat<0x5d, 0x6c>, "flat_atomic_dec_x2", VReg_64
	>;			>;

	} // End SubtargetPredicate = isCIVI			} // End SubtargetPredicate = isCIVI

	// CI Only flat instructions			// CI Only flat instructions

	let SubtargetPredicate = isCI, VIAssemblerPredicate = DisableInst in {			let SubtargetPredicate = isCI, VIAssemblerPredicate = DisableInst, DisableVIDecoder = 1 in {

	defm FLAT_ATOMIC_FCMPSWAP : FLAT_ATOMIC <			defm FLAT_ATOMIC_FCMPSWAP : FLAT_ATOMIC <
	flat<0x3e>, "flat_atomic_fcmpswap", VGPR_32, VReg_64			flat<0x3e>, "flat_atomic_fcmpswap", VGPR_32, VReg_64
	>;			>;
	defm FLAT_ATOMIC_FMIN : FLAT_ATOMIC <			defm FLAT_ATOMIC_FMIN : FLAT_ATOMIC <
	flat<0x3f>, "flat_atomic_fmin", VGPR_32			flat<0x3f>, "flat_atomic_fmin", VGPR_32
	>;			>;
	defm FLAT_ATOMIC_FMAX : FLAT_ATOMIC <			defm FLAT_ATOMIC_FMAX : FLAT_ATOMIC <
	flat<0x40>, "flat_atomic_fmax", VGPR_32			flat<0x40>, "flat_atomic_fmax", VGPR_32
	>;			>;
	defm FLAT_ATOMIC_FCMPSWAP_X2 : FLAT_ATOMIC <			defm FLAT_ATOMIC_FCMPSWAP_X2 : FLAT_ATOMIC <
	flat<0x5e>, "flat_atomic_fcmpswap_x2", VReg_64, VReg_128			flat<0x5e>, "flat_atomic_fcmpswap_x2", VReg_64, VReg_128
	>;			>;
	defm FLAT_ATOMIC_FMIN_X2 : FLAT_ATOMIC <			defm FLAT_ATOMIC_FMIN_X2 : FLAT_ATOMIC <
	flat<0x5f>, "flat_atomic_fmin_x2", VReg_64			flat<0x5f>, "flat_atomic_fmin_x2", VReg_64
	>;			>;
	defm FLAT_ATOMIC_FMAX_X2 : FLAT_ATOMIC <			defm FLAT_ATOMIC_FMAX_X2 : FLAT_ATOMIC <
	flat<0x60>, "flat_atomic_fmax_x2", VReg_64			flat<0x60>, "flat_atomic_fmax_x2", VReg_64
	>;			>;

	} // End SubtargetPredicate = isCI, VIAssemblerPredicate = DisableInst			} // End SubtargetPredicate = isCI, VIAssemblerPredicate = DisableInst, DisableVIDecoder = 1

	let Predicates = [isCI] in {			let Predicates = [isCI] in {

	// Convert (x - floor(x)) to fract(x)			// Convert (x - floor(x)) to fract(x)
	def : Pat <			def : Pat <
	(f32 (fsub (f32 (VOP3Mods f32:$x, i32:$mods)),			(f32 (fsub (f32 (VOP3Mods f32:$x, i32:$mods)),
	(f32 (ffloor (f32 (VOP3Mods f32:$x, i32:$mods)))))),			(f32 (ffloor (f32 (VOP3Mods f32:$x, i32:$mods)))))),
	(V_FRACT_F32_e64 $mods, $x, DSTCLAMP.NONE, DSTOMOD.NONE)			(V_FRACT_F32_e64 $mods, $x, DSTCLAMP.NONE, DSTOMOD.NONE)
	▲ Show 20 Lines • Show All 60 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/CMakeLists.txt

set(LLVM_TARGET_DEFINITIONS AMDGPU.td)		set(LLVM_TARGET_DEFINITIONS AMDGPU.td)

tablegen(LLVM AMDGPUGenRegisterInfo.inc -gen-register-info)		tablegen(LLVM AMDGPUGenRegisterInfo.inc -gen-register-info)
tablegen(LLVM AMDGPUGenInstrInfo.inc -gen-instr-info)		tablegen(LLVM AMDGPUGenInstrInfo.inc -gen-instr-info)
tablegen(LLVM AMDGPUGenDAGISel.inc -gen-dag-isel)		tablegen(LLVM AMDGPUGenDAGISel.inc -gen-dag-isel)
tablegen(LLVM AMDGPUGenCallingConv.inc -gen-callingconv)		tablegen(LLVM AMDGPUGenCallingConv.inc -gen-callingconv)
tablegen(LLVM AMDGPUGenSubtargetInfo.inc -gen-subtarget)		tablegen(LLVM AMDGPUGenSubtargetInfo.inc -gen-subtarget)
tablegen(LLVM AMDGPUGenIntrinsics.inc -gen-tgt-intrinsic)		tablegen(LLVM AMDGPUGenIntrinsics.inc -gen-tgt-intrinsic)
tablegen(LLVM AMDGPUGenMCCodeEmitter.inc -gen-emitter)		tablegen(LLVM AMDGPUGenMCCodeEmitter.inc -gen-emitter)
tablegen(LLVM AMDGPUGenDFAPacketizer.inc -gen-dfa-packetizer)		tablegen(LLVM AMDGPUGenDFAPacketizer.inc -gen-dfa-packetizer)
tablegen(LLVM AMDGPUGenAsmWriter.inc -gen-asm-writer)		tablegen(LLVM AMDGPUGenAsmWriter.inc -gen-asm-writer)
tablegen(LLVM AMDGPUGenAsmMatcher.inc -gen-asm-matcher)		tablegen(LLVM AMDGPUGenAsmMatcher.inc -gen-asm-matcher)
		tablegen(LLVM AMDGPUGenDisassemblerTables.inc -gen-disassembler)
add_public_tablegen_target(AMDGPUCommonTableGen)		add_public_tablegen_target(AMDGPUCommonTableGen)

add_llvm_target(AMDGPUCodeGen		add_llvm_target(AMDGPUCodeGen
AMDILCFGStructurizer.cpp		AMDILCFGStructurizer.cpp
AMDGPUAlwaysInlinePass.cpp		AMDGPUAlwaysInlinePass.cpp
AMDGPUAnnotateKernelFeatures.cpp		AMDGPUAnnotateKernelFeatures.cpp
AMDGPUAnnotateUniformValues.cpp		AMDGPUAnnotateUniformValues.cpp
AMDGPUAsmPrinter.cpp		AMDGPUAsmPrinter.cpp
Show All 39 Lines	add_llvm_target(AMDGPUCodeGen
SIMachineScheduler.cpp		SIMachineScheduler.cpp
SIRegisterInfo.cpp		SIRegisterInfo.cpp
SIShrinkInstructions.cpp		SIShrinkInstructions.cpp
SITypeRewriter.cpp		SITypeRewriter.cpp
)		)

add_subdirectory(AsmParser)		add_subdirectory(AsmParser)
add_subdirectory(InstPrinter)		add_subdirectory(InstPrinter)
		add_subdirectory(Disassembler)
add_subdirectory(TargetInfo)		add_subdirectory(TargetInfo)
add_subdirectory(MCTargetDesc)		add_subdirectory(MCTargetDesc)
add_subdirectory(Utils)		add_subdirectory(Utils)

llvm/trunk/lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.h

				//===-- AMDGPUDisassembler.hpp - Disassembler for AMDGPU ISA ---- C++ ---===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				/// \file
				///
				/// This file contains declaration for AMDGPU ISA disassembler
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_LIB_TARGET_AMDGPU_DISASSEMBLER_AMDGPUDISASSEMBLER_H
				#define LLVM_LIB_TARGET_AMDGPU_DISASSEMBLER_AMDGPUDISASSEMBLER_H

				#include "llvm/MC/MCDisassembler/MCDisassembler.h"

				namespace llvm {

				class MCContext;
				class MCInst;
				class MCSubtargetInfo;

				class AMDGPUDisassembler : public MCDisassembler {
				public:
				AMDGPUDisassembler(const MCSubtargetInfo &STI, MCContext &Ctx) :
				MCDisassembler(STI, Ctx) {}

				~AMDGPUDisassembler() {}

				DecodeStatus getInstruction(MCInst &MI, uint64_t &Size,
				ArrayRef<uint8_t> Bytes, uint64_t Address,
				raw_ostream &WS, raw_ostream &CS) const override;

				/// Decode inline float value in VSrc field
				DecodeStatus DecodeLitFloat(unsigned Imm, uint32_t& F) const;
				/// Decode inline integer value in VSrc field
				DecodeStatus DecodeLitInteger(unsigned Imm, int64_t& I) const;
				/// Decode VGPR register
				DecodeStatus DecodeVgprRegister(unsigned Val, unsigned& RegID) const;
				/// Decode SGPR register
				DecodeStatus DecodeSgprRegister(unsigned Val, unsigned& RegID) const;
				/// Decode register in VSrc field
				DecodeStatus DecodeSrcRegister(unsigned Val, unsigned& RegID) const;

				DecodeStatus DecodeVS_32RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr) const;

				DecodeStatus DecodeVGPR_32RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr) const;
				};
				} // namespace llvm

				#endif //LLVM_LIB_TARGET_AMDGPU_DISASSEMBLER_AMDGPUDISASSEMBLER_H

llvm/trunk/lib/Target/AMDGPU/Disassembler/AMDGPUDisassembler.cpp

				//===-- AMDGPUDisassembler.cpp - Disassembler for AMDGPU ISA --------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				//===----------------------------------------------------------------------===//
				//
				/// \file
				///
				/// This file contains definition for AMDGPU ISA disassembler
				//
				//===----------------------------------------------------------------------===//

				// ToDo: What to do with instruction suffixes (v_mov_b32 vs v_mov_b32_e32)?

				#include "AMDGPUDisassembler.h"
				#include "AMDGPU.h"
				#include "AMDGPURegisterInfo.h"
				#include "Utils/AMDGPUBaseInfo.h"

				#include "llvm/MC/MCFixedLenDisassembler.h"
				#include "llvm/MC/MCInst.h"
				#include "llvm/MC/MCInstrDesc.h"
				#include "llvm/MC/MCSubtargetInfo.h"
				#include "llvm/Support/Debug.h"
				#include "llvm/Support/TargetRegistry.h"


				using namespace llvm;

				#define DEBUG_TYPE "amdgpu-disassembler"

				typedef llvm::MCDisassembler::DecodeStatus DecodeStatus;


				static DecodeStatus DecodeVGPR_32RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				const AMDGPUDisassembler *Dis =
				static_cast<const AMDGPUDisassembler *>(Decoder);
				return Dis->DecodeVGPR_32RegisterClass(Inst, Imm, Addr);
				}

				static DecodeStatus DecodeVS_32RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				const AMDGPUDisassembler *Dis =
				static_cast<const AMDGPUDisassembler *>(Decoder);
				return Dis->DecodeVS_32RegisterClass(Inst, Imm, Addr);
				}

				static DecodeStatus DecodeVS_64RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				// ToDo
				return MCDisassembler::Fail;
				}

				static DecodeStatus DecodeVReg_64RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				// ToDo
				return MCDisassembler::Fail;
				}

				static DecodeStatus DecodeVReg_96RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				// ToDo
				return MCDisassembler::Fail;
				}

				static DecodeStatus DecodeVReg_128RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				// ToDo
				return MCDisassembler::Fail;
				}

				static DecodeStatus DecodeSReg_32RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				// ToDo
				return MCDisassembler::Fail;
				}

				static DecodeStatus DecodeSReg_64RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				// ToDo
				return MCDisassembler::Fail;
				}

				static DecodeStatus DecodeSReg_128RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				// ToDo
				return MCDisassembler::Fail;
				}

				static DecodeStatus DecodeSReg_256RegisterClass(MCInst &Inst, unsigned Imm,
				uint64_t Addr, const void *Decoder) {
				// ToDo
				return MCDisassembler::Fail;
				}


				#define GET_SUBTARGETINFO_ENUM
				#include "AMDGPUGenSubtargetInfo.inc"
				#undef GET_SUBTARGETINFO_ENUM

				#include "AMDGPUGenDisassemblerTables.inc"

				//===----------------------------------------------------------------------===//
				//
				//===----------------------------------------------------------------------===//

				DecodeStatus AMDGPUDisassembler::getInstruction(MCInst &MI, uint64_t &Size,
				ArrayRef<uint8_t> Bytes,
				uint64_t Address,
				raw_ostream &WS,
				raw_ostream &CS) const {
				CommentStream = &CS;

				// ToDo: AMDGPUDisassembler supports only VI ISA.
				assert(AMDGPU::isVI(STI) && "Can disassemble only VI ISA.");

				// Try decode 32-bit instruction
				if (Bytes.size() < 4) {
				Size = 0;
				return MCDisassembler::Fail;
				}
				uint32_t Insn =
				(Bytes[3] << 24) \| (Bytes[2] << 16) \| (Bytes[1] << 8) \| (Bytes[0] << 0);

				// Calling the auto-generated decoder function.
				DecodeStatus Result =
				decodeInstruction(DecoderTableVI32, MI, Insn, Address, this, STI);
				if (Result != MCDisassembler::Success) {
				Size = 0;
				return MCDisassembler::Fail;
				}
				Size = 4;

				return MCDisassembler::Success;
				}

				DecodeStatus AMDGPUDisassembler::DecodeLitFloat(unsigned Imm, uint32_t& F) const {
				// ToDo: case 248: 1/(2*PI) - is allowed only on VI
				// ToDo: AMDGPUInstPrinter does not support 1/(2PI). It consider 1/(2PI) as
				// literal constant.
				switch(Imm) {
				case 240: F = FloatToBits(0.5f); return MCDisassembler::Success;
				case 241: F = FloatToBits(-0.5f); return MCDisassembler::Success;
				case 242: F = FloatToBits(1.0f); return MCDisassembler::Success;
				case 243: F = FloatToBits(-1.0f); return MCDisassembler::Success;
				case 244: F = FloatToBits(2.0f); return MCDisassembler::Success;
				case 245: F = FloatToBits(-2.0f); return MCDisassembler::Success;
				case 246: F = FloatToBits(4.0f); return MCDisassembler::Success;
				case 247: F = FloatToBits(-4.0f); return MCDisassembler::Success;
				case 248: F = 0x3e22f983; return MCDisassembler::Success; // 1/(2*PI)
				default: return MCDisassembler::Fail;
				}
				}

				DecodeStatus AMDGPUDisassembler::DecodeLitInteger(unsigned Imm,
				int64_t& I) const {
				if ((Imm >= 128) && (Imm <= 192)) {
				I = Imm - 128;
				return MCDisassembler::Success;
				} else if ((Imm >= 193) && (Imm <= 208)) {
				I = 192 - Imm;
				return MCDisassembler::Success;
				}
				return MCDisassembler::Fail;
				}

				DecodeStatus AMDGPUDisassembler::DecodeVgprRegister(unsigned Val,
				unsigned& RegID) const {
				if (Val > 255) {
				return MCDisassembler::Fail;
				}
				RegID = AMDGPUMCRegisterClasses[AMDGPU::VGPR_32RegClassID].getRegister(Val);
				return MCDisassembler::Success;
				}

				DecodeStatus AMDGPUDisassembler::DecodeSgprRegister(unsigned Val,
				unsigned& RegID) const {
				// ToDo: SI/CI have 104 SGPRs, VI - 102
				if (Val > 101) {
				return MCDisassembler::Fail;
				}
				RegID = AMDGPUMCRegisterClasses[AMDGPU::SGPR_32RegClassID].getRegister(Val);
				return MCDisassembler::Success;
				}

				DecodeStatus AMDGPUDisassembler::DecodeSrcRegister(unsigned Val,
				unsigned& RegID) const {
				// ToDo: deal with out-of range registers
				using namespace AMDGPU;
				if (Val <= 101) {
				return DecodeSgprRegister(Val, RegID);
				} else if ((Val >= 256) && (Val <= 511)) {
				return DecodeVgprRegister(Val - 256, RegID);
				} else {
				switch(Val) {
				case 102: RegID = getMCReg(FLAT_SCR_LO, STI); return MCDisassembler::Success;
				case 103: RegID = getMCReg(FLAT_SCR_HI, STI); return MCDisassembler::Success;
				// ToDo: no support for xnack_mask_lo/_hi register
				case 104:
				case 105: return MCDisassembler::Fail;
				case 106: RegID = getMCReg(VCC_LO, STI); return MCDisassembler::Success;
				case 107: RegID = getMCReg(VCC_HI, STI); return MCDisassembler::Success;
				// ToDo: no support for tba_lo/_hi register
				case 108:
				case 109: return MCDisassembler::Fail;
				// ToDo: no support for tma_lo/_hi register
				case 110:
				case 111: return MCDisassembler::Fail;
				// ToDo: no support for ttmp[0:11] register
				case 112:
				case 113:
				case 114:
				case 115:
				case 116:
				case 117:
				case 118:
				case 119:
				case 120:
				case 121:
				case 122:
				case 123: return MCDisassembler::Fail;
				case 124: RegID = getMCReg(M0, STI); return MCDisassembler::Success;
				case 126: RegID = getMCReg(EXEC_LO, STI); return MCDisassembler::Success;
				case 127: RegID = getMCReg(EXEC_HI, STI); return MCDisassembler::Success;
				// ToDo: no support for vccz register
				case 251: return MCDisassembler::Fail;
				// ToDo: no support for execz register
				case 252: return MCDisassembler::Fail;
				case 253: RegID = getMCReg(SCC, STI); return MCDisassembler::Success;
				default: return MCDisassembler::Fail;
				}
				}
				return MCDisassembler::Fail;
				}

				DecodeStatus AMDGPUDisassembler::DecodeVGPR_32RegisterClass(llvm::MCInst &Inst,
				unsigned Imm,
				uint64_t Addr) const {
				unsigned RegID;
				if (DecodeVgprRegister(Imm, RegID) == MCDisassembler::Success) {
				Inst.addOperand(MCOperand::createReg(RegID));
				return MCDisassembler::Success;
				}
				return MCDisassembler::Fail;
				}

				DecodeStatus AMDGPUDisassembler::DecodeVS_32RegisterClass(MCInst &Inst,
				unsigned Imm,
				uint64_t Addr) const {
				// ToDo: different opcodes allow different formats og this operands
				if ((Imm >= 128) && (Imm <= 208)) {
				// immediate integer
				int64_t Val;
				if (DecodeLitInteger(Imm, Val) == MCDisassembler::Success) {
				Inst.addOperand(MCOperand::createImm(Val));
				return MCDisassembler::Success;
				}
				} else if ((Imm >= 240) && (Imm <= 248)) {
				// immediate float
				uint32_t Val;
				if (DecodeLitFloat(Imm, Val) == MCDisassembler::Success) {
				Inst.addOperand(MCOperand::createImm(Val));
				return MCDisassembler::Success;
				}
				} else if (Imm == 254) {
				// LDS direct
				// ToDo: implement LDS direct read
				} else if (Imm == 255) {
				// literal constant
				} else if ((Imm == 125) \|\|
				((Imm >= 209) && (Imm <= 239)) \|\|
				(Imm == 249) \|\|
				(Imm == 250) \|\|
				(Imm >= 512)) {
				// reserved
				return MCDisassembler::Fail;
				} else {
				// register
				unsigned RegID;
				if (DecodeSrcRegister(Imm, RegID) == MCDisassembler::Success) {
				Inst.addOperand(MCOperand::createReg(RegID));
				return MCDisassembler::Success;
				}
				}
				return MCDisassembler::Fail;
				}

				static MCDisassembler *createAMDGPUDisassembler(const Target &T,
				const MCSubtargetInfo &STI,
				MCContext &Ctx) {
				return new AMDGPUDisassembler(STI, Ctx);
				}

				extern "C" void LLVMInitializeAMDGPUDisassembler() {
				TargetRegistry::RegisterMCDisassembler(TheGCNTarget, createAMDGPUDisassembler);
				}

llvm/trunk/lib/Target/AMDGPU/Disassembler/CMakeLists.txt

				include_directories( ${CMAKE_CURRENT_BINARY_DIR}/.. ${CMAKE_CURRENT_SOURCE_DIR}/.. )

				add_llvm_library(LLVMAMDGPUDisassembler
				AMDGPUDisassembler.cpp
				)

				add_dependencies(LLVMAMDGPUDisassembler AMDGPUCommonTableGen)

llvm/trunk/lib/Target/AMDGPU/Disassembler/LLVMBuild.txt

				;===- ./lib/Target/AMDGPU/Disassembler/LLVMBuild.txt ------------- Conf ---===;
				;
				; The LLVM Compiler Infrastructure
				;
				; This file is distributed under the University of Illinois Open Source
				; License. See LICENSE.TXT for details.
				;
				;===------------------------------------------------------------------------===;
				;
				; This is an LLVMBuild description file for the components in this subdirectory.
				;
				; For more information on the LLVMBuild system, please see:
				;
				; http://llvm.org/docs/LLVMBuild.html
				;
				;===------------------------------------------------------------------------===;

				[component_0]
				type = Library
				name = AMDGPUDisassembler
				parent = AMDGPU
				required_libraries = AMDGPUDesc AMDGPUInfo AMDGPUUtils MC MCDisassembler Support
				add_to_library_groups = AMDGPU

llvm/trunk/lib/Target/AMDGPU/LLVMBuild.txt

	Show All 10 Lines
	;			;
	; For more information on the LLVMBuild system, please see:			; For more information on the LLVMBuild system, please see:
	;			;
	; http://llvm.org/docs/LLVMBuild.html			; http://llvm.org/docs/LLVMBuild.html
	;			;
	;===------------------------------------------------------------------------===;			;===------------------------------------------------------------------------===;

	[common]			[common]
	subdirectories = AsmParser InstPrinter MCTargetDesc TargetInfo Utils			subdirectories = AsmParser Disassembler InstPrinter MCTargetDesc TargetInfo Utils

	[component_0]			[component_0]
	type = TargetGroup			type = TargetGroup
	name = AMDGPU			name = AMDGPU
	parent = Target			parent = Target
	has_asmparser = 1			has_asmparser = 1
	has_asmprinter = 1			has_asmprinter = 1
				has_disassembler = 1

	[component_1]			[component_1]
	type = Library			type = Library
	name = AMDGPUCodeGen			name = AMDGPUCodeGen
	parent = AMDGPU			parent = AMDGPU
	required_libraries = Analysis AsmPrinter CodeGen Core IPO MC AMDGPUAsmParser AMDGPUAsmPrinter AMDGPUDesc AMDGPUInfo AMDGPUUtils Scalar SelectionDAG Support Target TransformUtils			required_libraries = Analysis AsmPrinter CodeGen Core IPO MC AMDGPUAsmParser AMDGPUAsmPrinter AMDGPUDesc AMDGPUInfo AMDGPUUtils Scalar SelectionDAG Support Target TransformUtils
	add_to_library_groups = AMDGPU			add_to_library_groups = AMDGPU

llvm/trunk/lib/Target/AMDGPU/SIInstrFormats.td

Show First 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	class InstSI <dag outs, dag ins, string asm, list<dag> pattern> :
let TSFlags{17} = DS;		let TSFlags{17} = DS;
let TSFlags{18} = MIMG;		let TSFlags{18} = MIMG;
let TSFlags{19} = FLAT;		let TSFlags{19} = FLAT;
let TSFlags{20} = WQM;		let TSFlags{20} = WQM;
let TSFlags{21} = VGPRSpill;		let TSFlags{21} = VGPRSpill;
let TSFlags{22} = VOPAsmPrefer32Bit;		let TSFlags{22} = VOPAsmPrefer32Bit;

let SchedRW = [Write32Bit];		let SchedRW = [Write32Bit];

		field bits<1> DisableSIDecoder = 0;
		field bits<1> DisableVIDecoder = 0;
		field bits<1> DisableDecoder = 0;

		let isAsmParserOnly = !if(!eq(DisableDecoder{0}, {0}), 0, 1);
}		}

class Enc32 {		class Enc32 {
field bits<32> Inst;		field bits<32> Inst;
int Size = 4;		int Size = 4;
}		}

class Enc64 {		class Enc64 {
▲ Show 20 Lines • Show All 622 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/SIInstrInfo.td

Show First 20 Lines • Show All 698 Lines • ▼ Show 20 Lines
}		}

multiclass EXP_m {		multiclass EXP_m {

let isPseudo = 1, isCodeGenOnly = 1 in {		let isPseudo = 1, isCodeGenOnly = 1 in {
def "" : EXPCommon, SIMCInstr <"exp", SISubtarget.NONE> ;		def "" : EXPCommon, SIMCInstr <"exp", SISubtarget.NONE> ;
}		}

def _si : EXPCommon, SIMCInstr <"exp", SISubtarget.SI>, EXPe;		def _si : EXPCommon, SIMCInstr <"exp", SISubtarget.SI>, EXPe {
		let DecoderNamespace="SICI";
		let DisableDecoder = DisableSIDecoder;
		}

def _vi : EXPCommon, SIMCInstr <"exp", SISubtarget.VI>, EXPe_vi;		def _vi : EXPCommon, SIMCInstr <"exp", SISubtarget.VI>, EXPe_vi {
		let DecoderNamespace="VI";
		let DisableDecoder = DisableVIDecoder;
		}
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Scalar classes		// Scalar classes
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

class SOP1_Pseudo <string opName, dag outs, dag ins, list<dag> pattern> :		class SOP1_Pseudo <string opName, dag outs, dag ins, list<dag> pattern> :
SOP1 <outs, ins, "", pattern>,		SOP1 <outs, ins, "", pattern>,
SIMCInstr<opName, SISubtarget.NONE> {		SIMCInstr<opName, SISubtarget.NONE> {
let isPseudo = 1;		let isPseudo = 1;
let isCodeGenOnly = 1;		let isCodeGenOnly = 1;
}		}

class SOP1_Real_si <sop1 op, string opName, dag outs, dag ins, string asm> :		class SOP1_Real_si <sop1 op, string opName, dag outs, dag ins, string asm> :
SOP1 <outs, ins, asm, []>,		SOP1 <outs, ins, asm, []>,
SOP1e <op.SI>,		SOP1e <op.SI>,
SIMCInstr<opName, SISubtarget.SI> {		SIMCInstr<opName, SISubtarget.SI> {
let isCodeGenOnly = 0;		let isCodeGenOnly = 0;
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class SOP1_Real_vi <sop1 op, string opName, dag outs, dag ins, string asm> :		class SOP1_Real_vi <sop1 op, string opName, dag outs, dag ins, string asm> :
SOP1 <outs, ins, asm, []>,		SOP1 <outs, ins, asm, []>,
SOP1e <op.VI>,		SOP1e <op.VI>,
SIMCInstr<opName, SISubtarget.VI> {		SIMCInstr<opName, SISubtarget.VI> {
let isCodeGenOnly = 0;		let isCodeGenOnly = 0;
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}

multiclass SOP1_m <sop1 op, string opName, dag outs, dag ins, string asm,		multiclass SOP1_m <sop1 op, string opName, dag outs, dag ins, string asm,
list<dag> pattern> {		list<dag> pattern> {

def "" : SOP1_Pseudo <opName, outs, ins, pattern>;		def "" : SOP1_Pseudo <opName, outs, ins, pattern>;

def _si : SOP1_Real_si <op, opName, outs, ins, asm>;		def _si : SOP1_Real_si <op, opName, outs, ins, asm>;
▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	class SOP2_Pseudo<string opName, dag outs, dag ins, list<dag> pattern> :
field bits<7> sdst = 0;		field bits<7> sdst = 0;
}		}

class SOP2_Real_si<sop2 op, string opName, dag outs, dag ins, string asm> :		class SOP2_Real_si<sop2 op, string opName, dag outs, dag ins, string asm> :
SOP2<outs, ins, asm, []>,		SOP2<outs, ins, asm, []>,
SOP2e<op.SI>,		SOP2e<op.SI>,
SIMCInstr<opName, SISubtarget.SI> {		SIMCInstr<opName, SISubtarget.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class SOP2_Real_vi<sop2 op, string opName, dag outs, dag ins, string asm> :		class SOP2_Real_vi<sop2 op, string opName, dag outs, dag ins, string asm> :
SOP2<outs, ins, asm, []>,		SOP2<outs, ins, asm, []>,
SOP2e<op.VI>,		SOP2e<op.VI>,
SIMCInstr<opName, SISubtarget.VI> {		SIMCInstr<opName, SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}

multiclass SOP2_m <sop2 op, string opName, dag outs, dag ins, string asm,		multiclass SOP2_m <sop2 op, string opName, dag outs, dag ins, string asm,
list<dag> pattern> {		list<dag> pattern> {

def "" : SOP2_Pseudo <opName, outs, ins, pattern>;		def "" : SOP2_Pseudo <opName, outs, ins, pattern>;

def _si : SOP2_Real_si <op, opName, outs, ins, asm>;		def _si : SOP2_Real_si <op, opName, outs, ins, asm>;
Show All 38 Lines	class SOPK_Pseudo <string opName, dag outs, dag ins, list<dag> pattern> :
let isCodeGenOnly = 1;		let isCodeGenOnly = 1;
}		}

class SOPK_Real_si <sopk op, string opName, dag outs, dag ins, string asm> :		class SOPK_Real_si <sopk op, string opName, dag outs, dag ins, string asm> :
SOPK <outs, ins, asm, []>,		SOPK <outs, ins, asm, []>,
SOPKe <op.SI>,		SOPKe <op.SI>,
SIMCInstr<opName, SISubtarget.SI> {		SIMCInstr<opName, SISubtarget.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
let isCodeGenOnly = 0;		let isCodeGenOnly = 0;
}		}

class SOPK_Real_vi <sopk op, string opName, dag outs, dag ins, string asm> :		class SOPK_Real_vi <sopk op, string opName, dag outs, dag ins, string asm> :
SOPK <outs, ins, asm, []>,		SOPK <outs, ins, asm, []>,
SOPKe <op.VI>,		SOPKe <op.VI>,
SIMCInstr<opName, SISubtarget.VI> {		SIMCInstr<opName, SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
let isCodeGenOnly = 0;		let isCodeGenOnly = 0;
}		}

multiclass SOPK_m <sopk op, string opName, dag outs, dag ins, string opAsm,		multiclass SOPK_m <sopk op, string opName, dag outs, dag ins, string opAsm,
string asm = opName#opAsm> {		string asm = opName#opAsm> {
def "" : SOPK_Pseudo <opName, outs, ins, []>;		def "" : SOPK_Pseudo <opName, outs, ins, []>;

def _si : SOPK_Real_si <op, opName, outs, ins, asm>;		def _si : SOPK_Real_si <op, opName, outs, ins, asm>;
Show All 40 Lines	multiclass SOPK_IMM32 <sopk op, string opName, dag outs, dag ins,
string argAsm, string asm = opName#argAsm> {		string argAsm, string asm = opName#argAsm> {

def "" : SOPK_Pseudo <opName, outs, ins, []>;		def "" : SOPK_Pseudo <opName, outs, ins, []>;

def _si : SOPK <outs, ins, asm, []>,		def _si : SOPK <outs, ins, asm, []>,
SOPK64e <op.SI>,		SOPK64e <op.SI>,
SIMCInstr<opName, SISubtarget.SI> {		SIMCInstr<opName, SISubtarget.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
let isCodeGenOnly = 0;		let isCodeGenOnly = 0;
}		}

def _vi : SOPK <outs, ins, asm, []>,		def _vi : SOPK <outs, ins, asm, []>,
SOPK64e <op.VI>,		SOPK64e <op.VI>,
SIMCInstr<opName, SISubtarget.VI> {		SIMCInstr<opName, SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
let isCodeGenOnly = 0;		let isCodeGenOnly = 0;
}		}
}		}
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// SMRD classes		// SMRD classes
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

class SMRD_Pseudo <string opName, dag outs, dag ins, list<dag> pattern> :		class SMRD_Pseudo <string opName, dag outs, dag ins, list<dag> pattern> :
SMRD <outs, ins, "", pattern>,		SMRD <outs, ins, "", pattern>,
SIMCInstr<opName, SISubtarget.NONE> {		SIMCInstr<opName, SISubtarget.NONE> {
let isPseudo = 1;		let isPseudo = 1;
let isCodeGenOnly = 1;		let isCodeGenOnly = 1;
}		}

class SMRD_Real_si <bits<5> op, string opName, bit imm, dag outs, dag ins,		class SMRD_Real_si <bits<5> op, string opName, bit imm, dag outs, dag ins,
string asm> :		string asm> :
SMRD <outs, ins, asm, []>,		SMRD <outs, ins, asm, []>,
SMRDe <op, imm>,		SMRDe <op, imm>,
SIMCInstr<opName, SISubtarget.SI> {		SIMCInstr<opName, SISubtarget.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class SMRD_Real_vi <bits<8> op, string opName, bit imm, dag outs, dag ins,		class SMRD_Real_vi <bits<8> op, string opName, bit imm, dag outs, dag ins,
string asm, list<dag> pattern = []> :		string asm, list<dag> pattern = []> :
SMRD <outs, ins, asm, pattern>,		SMRD <outs, ins, asm, pattern>,
SMEMe_vi <op, imm>,		SMEMe_vi <op, imm>,
SIMCInstr<opName, SISubtarget.VI> {		SIMCInstr<opName, SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}

multiclass SMRD_m <smrd op, string opName, bit imm, dag outs, dag ins,		multiclass SMRD_m <smrd op, string opName, bit imm, dag outs, dag ins,
string asm, list<dag> pattern> {		string asm, list<dag> pattern> {

def "" : SMRD_Pseudo <opName, outs, ins, pattern>;		def "" : SMRD_Pseudo <opName, outs, ins, pattern>;

def _si : SMRD_Real_si <op.SI, opName, imm, outs, ins, asm>;		def _si : SMRD_Real_si <op.SI, opName, imm, outs, ins, asm>;
Show All 39 Lines	defm _IMM : SMRD_m <
(ins baseClass:$sbase, smrd_offset:$offset),		(ins baseClass:$sbase, smrd_offset:$offset),
opName#" $dst, $sbase, $offset", []		opName#" $dst, $sbase, $offset", []
>;		>;

def _IMM_ci : SMRD <		def _IMM_ci : SMRD <
(outs dstClass:$dst), (ins baseClass:$sbase, smrd_literal_offset:$offset),		(outs dstClass:$dst), (ins baseClass:$sbase, smrd_literal_offset:$offset),
opName#" $dst, $sbase, $offset", []>, SMRD_IMMe_ci <op.SI> {		opName#" $dst, $sbase, $offset", []>, SMRD_IMMe_ci <op.SI> {
let AssemblerPredicates = [isCIOnly];		let AssemblerPredicates = [isCIOnly];
		let DecoderNamespace = "CI";
}		}

defm _SGPR : SMRD_m <		defm _SGPR : SMRD_m <
op, opName#"_SGPR", 0, (outs dstClass:$dst),		op, opName#"_SGPR", 0, (outs dstClass:$dst),
(ins baseClass:$sbase, SReg_32:$soff),		(ins baseClass:$sbase, SReg_32:$soff),
opName#" $dst, $sbase, $soff", []		opName#" $dst, $sbase, $soff", []
>;		>;
}		}
▲ Show 20 Lines • Show All 80 Lines • ▼ Show 20 Lines
}		}

// Returns the input arguments for VOP3 instructions for the given SrcVT.		// Returns the input arguments for VOP3 instructions for the given SrcVT.
class getIns64 <RegisterOperand Src0RC, RegisterOperand Src1RC,		class getIns64 <RegisterOperand Src0RC, RegisterOperand Src1RC,
RegisterOperand Src2RC, int NumSrcArgs,		RegisterOperand Src2RC, int NumSrcArgs,
bit HasModifiers> {		bit HasModifiers> {

dag ret =		dag ret =
		!if (!eq(NumSrcArgs, 0),
		// VOP1 without input operands (V_NOP, V_CLREXCP)
		(ins),
		/* else */
!if (!eq(NumSrcArgs, 1),		!if (!eq(NumSrcArgs, 1),
!if (!eq(HasModifiers, 1),		!if (!eq(HasModifiers, 1),
// VOP1 with modifiers		// VOP1 with modifiers
(ins InputModsNoDefault:$src0_modifiers, Src0RC:$src0,		(ins InputModsNoDefault:$src0_modifiers, Src0RC:$src0,
ClampMod:$clamp, omod:$omod)		ClampMod:$clamp, omod:$omod)
/* else */,		/* else */,
// VOP1 without modifiers		// VOP1 without modifiers
(ins Src0RC:$src0)		(ins Src0RC:$src0)
Show All 13 Lines	/* NumSrcArgs == 3 */,
// VOP3 with modifiers		// VOP3 with modifiers
(ins InputModsNoDefault:$src0_modifiers, Src0RC:$src0,		(ins InputModsNoDefault:$src0_modifiers, Src0RC:$src0,
InputModsNoDefault:$src1_modifiers, Src1RC:$src1,		InputModsNoDefault:$src1_modifiers, Src1RC:$src1,
InputModsNoDefault:$src2_modifiers, Src2RC:$src2,		InputModsNoDefault:$src2_modifiers, Src2RC:$src2,
ClampMod:$clamp, omod:$omod)		ClampMod:$clamp, omod:$omod)
/* else */,		/* else */,
// VOP3 without modifiers		// VOP3 without modifiers
(ins Src0RC:$src0, Src1RC:$src1, Src2RC:$src2)		(ins Src0RC:$src0, Src1RC:$src1, Src2RC:$src2)
/* endif */ )));		/* endif */ ))));
}		}

class getInsDPP <RegisterClass Src0RC, RegisterClass Src1RC, int NumSrcArgs,		class getInsDPP <RegisterClass Src0RC, RegisterClass Src1RC, int NumSrcArgs,
bit HasModifiers> {		bit HasModifiers> {

dag ret = !if (!eq(NumSrcArgs, 1),		dag ret = !if (!eq(NumSrcArgs, 1),
!if (!eq(HasModifiers, 1),		!if (!eq(HasModifiers, 1),
// VOP1_DPP with modifiers		// VOP1_DPP with modifiers
▲ Show 20 Lines • Show All 296 Lines • ▼ Show 20 Lines	class VOP1_Pseudo <dag outs, dag ins, list<dag> pattern, string opName> :
field bits<8> vdst;		field bits<8> vdst;
field bits<9> src0;		field bits<9> src0;
}		}

class VOP1_Real_si <string opName, vop1 op, dag outs, dag ins, string asm> :		class VOP1_Real_si <string opName, vop1 op, dag outs, dag ins, string asm> :
VOP1<op.SI, outs, ins, asm, []>,		VOP1<op.SI, outs, ins, asm, []>,
SIMCInstr <opName#"_e32", SISubtarget.SI> {		SIMCInstr <opName#"_e32", SISubtarget.SI> {
let AssemblerPredicate = SIAssemblerPredicate;		let AssemblerPredicate = SIAssemblerPredicate;
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class VOP1_Real_vi <string opName, vop1 op, dag outs, dag ins, string asm> :		class VOP1_Real_vi <string opName, vop1 op, dag outs, dag ins, string asm> :
VOP1<op.VI, outs, ins, asm, []>,		VOP1<op.VI, outs, ins, asm, []>,
SIMCInstr <opName#"_e32", SISubtarget.VI> {		SIMCInstr <opName#"_e32", SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}

multiclass VOP1_m <vop1 op, string opName, VOPProfile p, list<dag> pattern,		multiclass VOP1_m <vop1 op, string opName, VOPProfile p, list<dag> pattern,
string asm = opName#p.Asm32> {		string asm = opName#p.Asm32> {
def "" : VOP1_Pseudo <p.Outs, p.Ins32, pattern, opName>;		def "" : VOP1_Pseudo <p.Outs, p.Ins32, pattern, opName>;

def _si : VOP1_Real_si <opName, op, p.Outs, p.Ins32, asm>;		def _si : VOP1_Real_si <opName, op, p.Outs, p.Ins32, asm>;

Show All 25 Lines	class VOP2_Pseudo <dag outs, dag ins, list<dag> pattern, string opName> :
let isPseudo = 1;		let isPseudo = 1;
let isCodeGenOnly = 1;		let isCodeGenOnly = 1;
}		}

class VOP2_Real_si <string opName, vop2 op, dag outs, dag ins, string asm> :		class VOP2_Real_si <string opName, vop2 op, dag outs, dag ins, string asm> :
VOP2 <op.SI, outs, ins, opName#asm, []>,		VOP2 <op.SI, outs, ins, opName#asm, []>,
SIMCInstr <opName#"_e32", SISubtarget.SI> {		SIMCInstr <opName#"_e32", SISubtarget.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class VOP2_Real_vi <string opName, vop2 op, dag outs, dag ins, string asm> :		class VOP2_Real_vi <string opName, vop2 op, dag outs, dag ins, string asm> :
VOP2 <op.VI, outs, ins, opName#asm, []>,		VOP2 <op.VI, outs, ins, opName#asm, []>,
SIMCInstr <opName#"_e32", SISubtarget.VI> {		SIMCInstr <opName#"_e32", SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}

multiclass VOP2SI_m <vop2 op, string opName, VOPProfile p, list<dag> pattern,		multiclass VOP2SI_m <vop2 op, string opName, VOPProfile p, list<dag> pattern,
string revOp> {		string revOp> {

def "" : VOP2_Pseudo <p.Outs32, p.Ins32, pattern, opName>,		def "" : VOP2_Pseudo <p.Outs32, p.Ins32, pattern, opName>,
VOP2_REV<revOp#"_e32", !eq(revOp, opName)>;		VOP2_REV<revOp#"_e32", !eq(revOp, opName)>;

▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
}		}

class VOP3_Real_si <bits<9> op, dag outs, dag ins, string asm, string opName,		class VOP3_Real_si <bits<9> op, dag outs, dag ins, string asm, string opName,
bit HasMods = 0, bit VOP3Only = 0> :		bit HasMods = 0, bit VOP3Only = 0> :
VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,		VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,
VOP3e <op>,		VOP3e <op>,
SIMCInstr<opName#"_e64", SISubtarget.SI> {		SIMCInstr<opName#"_e64", SISubtarget.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class VOP3_Real_vi <bits<10> op, dag outs, dag ins, string asm, string opName,		class VOP3_Real_vi <bits<10> op, dag outs, dag ins, string asm, string opName,
bit HasMods = 0, bit VOP3Only = 0> :		bit HasMods = 0, bit VOP3Only = 0> :
VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,		VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,
VOP3e_vi <op>,		VOP3e_vi <op>,
SIMCInstr <opName#"_e64", SISubtarget.VI> {		SIMCInstr <opName#"_e64", SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}

class VOP3_C_Real_si <bits<9> op, dag outs, dag ins, string asm, string opName,		class VOP3_C_Real_si <bits<9> op, dag outs, dag ins, string asm, string opName,
bit HasMods = 0, bit VOP3Only = 0> :		bit HasMods = 0, bit VOP3Only = 0> :
VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,		VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,
VOP3ce <op>,		VOP3ce <op>,
SIMCInstr<opName#"_e64", SISubtarget.SI> {		SIMCInstr<opName#"_e64", SISubtarget.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class VOP3_C_Real_vi <bits<10> op, dag outs, dag ins, string asm, string opName,		class VOP3_C_Real_vi <bits<10> op, dag outs, dag ins, string asm, string opName,
bit HasMods = 0, bit VOP3Only = 0> :		bit HasMods = 0, bit VOP3Only = 0> :
VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,		VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,
VOP3ce_vi <op>,		VOP3ce_vi <op>,
SIMCInstr <opName#"_e64", SISubtarget.VI> {		SIMCInstr <opName#"_e64", SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}

class VOP3b_Real_si <bits<9> op, dag outs, dag ins, string asm, string opName,		class VOP3b_Real_si <bits<9> op, dag outs, dag ins, string asm, string opName,
bit HasMods = 0, bit VOP3Only = 0> :		bit HasMods = 0, bit VOP3Only = 0> :
VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,		VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,
VOP3be <op>,		VOP3be <op>,
SIMCInstr<opName#"_e64", SISubtarget.SI> {		SIMCInstr<opName#"_e64", SISubtarget.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class VOP3b_Real_vi <bits<10> op, dag outs, dag ins, string asm, string opName,		class VOP3b_Real_vi <bits<10> op, dag outs, dag ins, string asm, string opName,
bit HasMods = 0, bit VOP3Only = 0> :		bit HasMods = 0, bit VOP3Only = 0> :
VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,		VOP3Common <outs, ins, asm, [], HasMods, VOP3Only>,
VOP3be_vi <op>,		VOP3be_vi <op>,
SIMCInstr <opName#"_e64", SISubtarget.VI> {		SIMCInstr <opName#"_e64", SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}

multiclass VOP3_m <vop op, dag outs, dag ins, string asm, list<dag> pattern,		multiclass VOP3_m <vop op, dag outs, dag ins, string asm, list<dag> pattern,
string opName, int NumSrcArgs, bit HasMods = 1, bit VOP3Only = 0> {		string opName, int NumSrcArgs, bit HasMods = 1, bit VOP3Only = 0> {

def "" : VOP3_Pseudo <outs, ins, pattern, opName>;		def "" : VOP3_Pseudo <outs, ins, pattern, opName>;

def _si : VOP3_Real_si <op.SI3, outs, ins, asm, opName, HasMods, VOP3Only>,		def _si : VOP3_Real_si <op.SI3, outs, ins, asm, opName, HasMods, VOP3Only>,
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	multiclass VOP2SI_3VI_m <vop3 op, string opName, dag outs, dag ins,
let isPseudo = 1, isCodeGenOnly = 1 in {		let isPseudo = 1, isCodeGenOnly = 1 in {
def "" : VOPAnyCommon <outs, ins, "", pattern>,		def "" : VOPAnyCommon <outs, ins, "", pattern>,
SIMCInstr<opName, SISubtarget.NONE>;		SIMCInstr<opName, SISubtarget.NONE>;
}		}

def _si : VOP2 <op.SI3{5-0}, outs, ins, asm, []>,		def _si : VOP2 <op.SI3{5-0}, outs, ins, asm, []>,
SIMCInstr <opName, SISubtarget.SI> {		SIMCInstr <opName, SISubtarget.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

def _vi : VOP3Common <outs, ins, asm, []>,		def _vi : VOP3Common <outs, ins, asm, []>,
VOP3e_vi <op.VI3>,		VOP3e_vi <op.VI3>,
VOP3DisableFields <1, 0, 0>,		VOP3DisableFields <1, 0, 0>,
SIMCInstr <opName, SISubtarget.VI> {		SIMCInstr <opName, SISubtarget.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}
}		}

multiclass VOP1_Helper <vop1 op, string opName, VOPProfile p, list<dag> pat32,		multiclass VOP1_Helper <vop1 op, string opName, VOPProfile p, list<dag> pat32,
list<dag> pat64> {		list<dag> pat64> {

defm _e32 : VOP1_m <op, opName, p, pat32>;		defm _e32 : VOP1_m <op, opName, p, pat32>;

▲ Show 20 Lines • Show All 119 Lines • ▼ Show 20 Lines	multiclass VOP2MADK <vop2 op, string opName, list<dag> pattern = []> {
def "" : VOP2_Pseudo <VOP_MADK.Outs, VOP_MADK.Ins, pattern, opName>;		def "" : VOP2_Pseudo <VOP_MADK.Outs, VOP_MADK.Ins, pattern, opName>;

let isCodeGenOnly = 0 in {		let isCodeGenOnly = 0 in {
def _si : VOP2Common <VOP_MADK.Outs, VOP_MADK.Ins,		def _si : VOP2Common <VOP_MADK.Outs, VOP_MADK.Ins,
!strconcat(opName, VOP_MADK.Asm), []>,		!strconcat(opName, VOP_MADK.Asm), []>,
SIMCInstr <opName#"_e32", SISubtarget.SI>,		SIMCInstr <opName#"_e32", SISubtarget.SI>,
VOP2_MADKe <op.SI> {		VOP2_MADKe <op.SI> {
let AssemblerPredicates = [isSICI];		let AssemblerPredicates = [isSICI];
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

def _vi : VOP2Common <VOP_MADK.Outs, VOP_MADK.Ins,		def _vi : VOP2Common <VOP_MADK.Outs, VOP_MADK.Ins,
!strconcat(opName, VOP_MADK.Asm), []>,		!strconcat(opName, VOP_MADK.Asm), []>,
SIMCInstr <opName#"_e32", SISubtarget.VI>,		SIMCInstr <opName#"_e32", SISubtarget.VI>,
VOP2_MADKe <op.VI> {		VOP2_MADKe <op.VI> {
let AssemblerPredicates = [isVI];		let AssemblerPredicates = [isVI];
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}
} // End isCodeGenOnly = 0		} // End isCodeGenOnly = 0
}		}

class VOPC_Pseudo <dag ins, list<dag> pattern, string opName> :		class VOPC_Pseudo <dag ins, list<dag> pattern, string opName> :
VOPCCommon <ins, "", pattern>,		VOPCCommon <ins, "", pattern>,
VOP <opName>,		VOP <opName>,
SIMCInstr<opName#"_e32", SISubtarget.NONE> {		SIMCInstr<opName#"_e32", SISubtarget.NONE> {
Show All 13 Lines	multiclass VOPC_m <vopc op, dag ins, string op_asm, list<dag> pattern,
}		}

let AssemblerPredicates = [isSICI] in {		let AssemblerPredicates = [isSICI] in {
def _si : VOPC<op.SI, ins, asm, []>,		def _si : VOPC<op.SI, ins, asm, []>,
SIMCInstr <opName#"_e32", SISubtarget.SI> {		SIMCInstr <opName#"_e32", SISubtarget.SI> {
let Defs = !if(DefExec, [VCC, EXEC], [VCC]);		let Defs = !if(DefExec, [VCC, EXEC], [VCC]);
let hasSideEffects = DefExec;		let hasSideEffects = DefExec;
let SchedRW = sched;		let SchedRW = sched;
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

} // End AssemblerPredicates = [isSICI]		} // End AssemblerPredicates = [isSICI]

let AssemblerPredicates = [isVI] in {		let AssemblerPredicates = [isVI] in {
def _vi : VOPC<op.VI, ins, asm, []>,		def _vi : VOPC<op.VI, ins, asm, []>,
SIMCInstr <opName#"_e32", SISubtarget.VI> {		SIMCInstr <opName#"_e32", SISubtarget.VI> {
let Defs = !if(DefExec, [VCC, EXEC], [VCC]);		let Defs = !if(DefExec, [VCC, EXEC], [VCC]);
let hasSideEffects = DefExec;		let hasSideEffects = DefExec;
let SchedRW = sched;		let SchedRW = sched;
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
}		}

} // End AssemblerPredicates = [isVI]		} // End AssemblerPredicates = [isVI]

defm : SIInstAliasBuilder<alias_asm, p>;		defm : SIInstAliasBuilder<alias_asm, p>;
}		}

multiclass VOPC_Helper <vopc op, string opName, list<dag> pat32,		multiclass VOPC_Helper <vopc op, string opName, list<dag> pat32,
▲ Show 20 Lines • Show All 174 Lines • ▼ Show 20 Lines	class VINTRP_Pseudo <string opName, dag outs, dag ins, list<dag> pattern> :
let isPseudo = 1;		let isPseudo = 1;
let isCodeGenOnly = 1;		let isCodeGenOnly = 1;
}		}

class VINTRP_Real_si <bits <2> op, string opName, dag outs, dag ins,		class VINTRP_Real_si <bits <2> op, string opName, dag outs, dag ins,
string asm> :		string asm> :
VINTRPCommon <outs, ins, asm, []>,		VINTRPCommon <outs, ins, asm, []>,
VINTRPe <op>,		VINTRPe <op>,
SIMCInstr<opName, SISubtarget.SI>;		SIMCInstr<opName, SISubtarget.SI> {
		let DecoderNamespace = "SICI";
		let DisableDecoder = DisableSIDecoder;
		}

class VINTRP_Real_vi <bits <2> op, string opName, dag outs, dag ins,		class VINTRP_Real_vi <bits <2> op, string opName, dag outs, dag ins,
string asm> :		string asm> :
VINTRPCommon <outs, ins, asm, []>,		VINTRPCommon <outs, ins, asm, []>,
VINTRPe_vi <op>,		VINTRPe_vi <op>,
SIMCInstr<opName, SISubtarget.VI>;		SIMCInstr<opName, SISubtarget.VI> {
		let DecoderNamespace = "VI";
		let DisableDecoder = DisableVIDecoder;
		}

multiclass VINTRP_m <bits <2> op, dag outs, dag ins, string asm,		multiclass VINTRP_m <bits <2> op, dag outs, dag ins, string asm,
list<dag> pattern = []> {		list<dag> pattern = []> {
def "" : VINTRP_Pseudo <NAME, outs, ins, pattern>;		def "" : VINTRP_Pseudo <NAME, outs, ins, pattern>;

def _si : VINTRP_Real_si <op, NAME, outs, ins, asm>;		def _si : VINTRP_Real_si <op, NAME, outs, ins, asm>;

def _vi : VINTRP_Real_vi <op, NAME, outs, ins, asm>;		def _vi : VINTRP_Real_vi <op, NAME, outs, ins, asm>;
Show All 10 Lines	class DS_Pseudo <string opName, dag outs, dag ins, list<dag> pattern> :
let isCodeGenOnly = 1;		let isCodeGenOnly = 1;
}		}

class DS_Real_si <bits<8> op, string opName, dag outs, dag ins, string asm> :		class DS_Real_si <bits<8> op, string opName, dag outs, dag ins, string asm> :
DS <outs, ins, asm, []>,		DS <outs, ins, asm, []>,
DSe <op>,		DSe <op>,
SIMCInstr <opName, SISubtarget.SI> {		SIMCInstr <opName, SISubtarget.SI> {
let isCodeGenOnly = 0;		let isCodeGenOnly = 0;
		let DecoderNamespace="SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class DS_Real_vi <bits<8> op, string opName, dag outs, dag ins, string asm> :		class DS_Real_vi <bits<8> op, string opName, dag outs, dag ins, string asm> :
DS <outs, ins, asm, []>,		DS <outs, ins, asm, []>,
DSe_vi <op>,		DSe_vi <op>,
SIMCInstr <opName, SISubtarget.VI>;		SIMCInstr <opName, SISubtarget.VI> {
		let DecoderNamespace="VI";
		let DisableDecoder = DisableVIDecoder;
		}

class DS_Off16_Real_si <bits<8> op, string opName, dag outs, dag ins, string asm> :		class DS_Off16_Real_si <bits<8> op, string opName, dag outs, dag ins, string asm> :
DS_Real_si <op,opName, outs, ins, asm> {		DS_Real_si <op,opName, outs, ins, asm> {

// Single load interpret the 2 i8imm operands as a single i16 offset.		// Single load interpret the 2 i8imm operands as a single i16 offset.
bits<16> offset;		bits<16> offset;
let offset0 = offset{7-0};		let offset0 = offset{7-0};
let offset1 = offset{15-8};		let offset1 = offset{15-8};
▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines	class MTBUF_Pseudo <string opName, dag outs, dag ins, list<dag> pattern> :
let isPseudo = 1;		let isPseudo = 1;
let isCodeGenOnly = 1;		let isCodeGenOnly = 1;
}		}

class MTBUF_Real_si <bits<3> op, string opName, dag outs, dag ins,		class MTBUF_Real_si <bits<3> op, string opName, dag outs, dag ins,
string asm> :		string asm> :
MTBUF <outs, ins, asm, []>,		MTBUF <outs, ins, asm, []>,
MTBUFe <op>,		MTBUFe <op>,
SIMCInstr<opName, SISubtarget.SI>;		SIMCInstr<opName, SISubtarget.SI> {
		let DecoderNamespace="SICI";
		let DisableDecoder = DisableSIDecoder;
		}

class MTBUF_Real_vi <bits<4> op, string opName, dag outs, dag ins, string asm> :		class MTBUF_Real_vi <bits<4> op, string opName, dag outs, dag ins, string asm> :
MTBUF <outs, ins, asm, []>,		MTBUF <outs, ins, asm, []>,
MTBUFe_vi <op>,		MTBUFe_vi <op>,
SIMCInstr <opName, SISubtarget.VI>;		SIMCInstr <opName, SISubtarget.VI> {
		let DecoderNamespace="VI";
		let DisableDecoder = DisableVIDecoder;
		}

multiclass MTBUF_m <bits<3> op, string opName, dag outs, dag ins, string asm,		multiclass MTBUF_m <bits<3> op, string opName, dag outs, dag ins, string asm,
list<dag> pattern> {		list<dag> pattern> {

def "" : MTBUF_Pseudo <opName, outs, ins, pattern>;		def "" : MTBUF_Pseudo <opName, outs, ins, pattern>;

def _si : MTBUF_Real_si <op, opName, outs, ins, asm>;		def _si : MTBUF_Real_si <op, opName, outs, ins, asm>;

▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines
}		}

class MUBUF_Real_si <mubuf op, string opName, dag outs, dag ins,		class MUBUF_Real_si <mubuf op, string opName, dag outs, dag ins,
string asm> :		string asm> :
MUBUF <outs, ins, asm, []>,		MUBUF <outs, ins, asm, []>,
MUBUFe <op.SI>,		MUBUFe <op.SI>,
SIMCInstr<opName, SISubtarget.SI> {		SIMCInstr<opName, SISubtarget.SI> {
let lds = 0;		let lds = 0;
		let DecoderNamespace="SICI";
		let DisableDecoder = DisableSIDecoder;
}		}

class MUBUF_Real_vi <mubuf op, string opName, dag outs, dag ins,		class MUBUF_Real_vi <mubuf op, string opName, dag outs, dag ins,
string asm> :		string asm> :
MUBUF <outs, ins, asm, []>,		MUBUF <outs, ins, asm, []>,
MUBUFe_vi <op.VI>,		MUBUFe_vi <op.VI>,
SIMCInstr<opName, SISubtarget.VI> {		SIMCInstr<opName, SISubtarget.VI> {
let lds = 0;		let lds = 0;
		let DecoderNamespace="VI";
		let DisableDecoder = DisableVIDecoder;
}		}

multiclass MUBUF_m <mubuf op, string opName, dag outs, dag ins, string asm,		multiclass MUBUF_m <mubuf op, string opName, dag outs, dag ins, string asm,
list<dag> pattern> {		list<dag> pattern> {

def "" : MUBUF_Pseudo <opName, outs, ins, pattern>,		def "" : MUBUF_Pseudo <opName, outs, ins, pattern>,
MUBUFAddr64Table <0>;		MUBUFAddr64Table <0>;

▲ Show 20 Lines • Show All 247 Lines • ▼ Show 20 Lines	class FLAT_Pseudo <string opName, dag outs, dag ins, list<dag> pattern> :
let isPseudo = 1;		let isPseudo = 1;
let isCodeGenOnly = 1;		let isCodeGenOnly = 1;
}		}

class FLAT_Real_ci <bits<7> op, string opName, dag outs, dag ins, string asm> :		class FLAT_Real_ci <bits<7> op, string opName, dag outs, dag ins, string asm> :
FLAT <op, outs, ins, asm, []>,		FLAT <op, outs, ins, asm, []>,
SIMCInstr<opName, SISubtarget.SI> {		SIMCInstr<opName, SISubtarget.SI> {
let AssemblerPredicate = isCIOnly;		let AssemblerPredicate = isCIOnly;
		let DecoderNamespace="CI";
}		}

class FLAT_Real_vi <bits<7> op, string opName, dag outs, dag ins, string asm> :		class FLAT_Real_vi <bits<7> op, string opName, dag outs, dag ins, string asm> :
FLAT <op, outs, ins, asm, []>,		FLAT <op, outs, ins, asm, []>,
SIMCInstr<opName, SISubtarget.VI> {		SIMCInstr<opName, SISubtarget.VI> {
let AssemblerPredicate = VIAssemblerPredicate;		let AssemblerPredicate = VIAssemblerPredicate;
		let DecoderNamespace="VI";
		let DisableDecoder = DisableVIDecoder;
}		}

multiclass FLAT_AtomicRet_m <flat op, dag outs, dag ins, string asm,		multiclass FLAT_AtomicRet_m <flat op, dag outs, dag ins, string asm,
list<dag> pattern> {		list<dag> pattern> {
def "" : FLAT_Pseudo <NAME#"_RTN", outs, ins, pattern>,		def "" : FLAT_Pseudo <NAME#"_RTN", outs, ins, pattern>,
AtomicNoRet <NAME, 1>;		AtomicNoRet <NAME, 1>;

def _ci : FLAT_Real_ci <op.CI, NAME#"_RTN", outs, ins, asm>;		def _ci : FLAT_Real_ci <op.CI, NAME#"_RTN", outs, ins, asm>;
▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines	multiclass FLAT_ATOMIC <flat op, string asm_name, RegisterClass vdst_rc,
}		}
}		}

class MIMG_Mask <string op, int channels> {		class MIMG_Mask <string op, int channels> {
string Op = op;		string Op = op;
int Channels = channels;		int Channels = channels;
}		}

		class MIMG_Helper <bits<7> op, dag outs, dag ins, string asm,
		string dns=""> : MIMG<op, outs, ins, asm,[]> {
		let mayLoad = 1;
		let mayStore = 0;
		let hasPostISelHook = 1;
		let DecoderNamespace = dns;
		let isAsmParserOnly = !if(!eq(dns,""), 1, 0);
		}

class MIMG_NoSampler_Helper <bits<7> op, string asm,		class MIMG_NoSampler_Helper <bits<7> op, string asm,
RegisterClass dst_rc,		RegisterClass dst_rc,
RegisterClass src_rc> : MIMG <		RegisterClass src_rc,
		string dns=""> : MIMG_Helper <
op,		op,
(outs dst_rc:$vdata),		(outs dst_rc:$vdata),
(ins i32imm:$dmask, i1imm:$unorm, i1imm:$glc, i1imm:$da, i1imm:$r128,		(ins i32imm:$dmask, i1imm:$unorm, i1imm:$glc, i1imm:$da, i1imm:$r128,
i1imm:$tfe, i1imm:$lwe, i1imm:$slc, src_rc:$vaddr,		i1imm:$tfe, i1imm:$lwe, i1imm:$slc, src_rc:$vaddr,
SReg_256:$srsrc),		SReg_256:$srsrc),
asm#" $vdata, $dmask, $unorm, $glc, $da, $r128,"		asm#" $vdata, $dmask, $unorm, $glc, $da, $r128,"
#" $tfe, $lwe, $slc, $vaddr, $srsrc",		#" $tfe, $lwe, $slc, $vaddr, $srsrc",
[]> {		dns> {
let ssamp = 0;		let ssamp = 0;
let mayLoad = 1;
let mayStore = 0;
let hasPostISelHook = 1;
}		}

multiclass MIMG_NoSampler_Src_Helper <bits<7> op, string asm,		multiclass MIMG_NoSampler_Src_Helper <bits<7> op, string asm,
RegisterClass dst_rc,		RegisterClass dst_rc,
int channels> {		int channels> {
def _V1 : MIMG_NoSampler_Helper <op, asm, dst_rc, VGPR_32>,		def _V1 : MIMG_NoSampler_Helper <op, asm, dst_rc, VGPR_32,
		!if(!eq(channels, 1), "AMDGPU", "")>,
MIMG_Mask<asm#"_V1", channels>;		MIMG_Mask<asm#"_V1", channels>;
def _V2 : MIMG_NoSampler_Helper <op, asm, dst_rc, VReg_64>,		def _V2 : MIMG_NoSampler_Helper <op, asm, dst_rc, VReg_64>,
MIMG_Mask<asm#"_V2", channels>;		MIMG_Mask<asm#"_V2", channels>;
def _V4 : MIMG_NoSampler_Helper <op, asm, dst_rc, VReg_128>,		def _V4 : MIMG_NoSampler_Helper <op, asm, dst_rc, VReg_128>,
MIMG_Mask<asm#"_V4", channels>;		MIMG_Mask<asm#"_V4", channels>;
}		}

multiclass MIMG_NoSampler <bits<7> op, string asm> {		multiclass MIMG_NoSampler <bits<7> op, string asm> {
defm _V1 : MIMG_NoSampler_Src_Helper <op, asm, VGPR_32, 1>;		defm _V1 : MIMG_NoSampler_Src_Helper <op, asm, VGPR_32, 1>;
defm _V2 : MIMG_NoSampler_Src_Helper <op, asm, VReg_64, 2>;		defm _V2 : MIMG_NoSampler_Src_Helper <op, asm, VReg_64, 2>;
defm _V3 : MIMG_NoSampler_Src_Helper <op, asm, VReg_96, 3>;		defm _V3 : MIMG_NoSampler_Src_Helper <op, asm, VReg_96, 3>;
defm _V4 : MIMG_NoSampler_Src_Helper <op, asm, VReg_128, 4>;		defm _V4 : MIMG_NoSampler_Src_Helper <op, asm, VReg_128, 4>;
}		}

class MIMG_Sampler_Helper <bits<7> op, string asm,		class MIMG_Sampler_Helper <bits<7> op, string asm,
RegisterClass dst_rc,		RegisterClass dst_rc,
RegisterClass src_rc, int wqm> : MIMG <		RegisterClass src_rc,
		int wqm,
		string dns=""> : MIMG_Helper <
op,		op,
(outs dst_rc:$vdata),		(outs dst_rc:$vdata),
(ins i32imm:$dmask, i1imm:$unorm, i1imm:$glc, i1imm:$da, i1imm:$r128,		(ins i32imm:$dmask, i1imm:$unorm, i1imm:$glc, i1imm:$da, i1imm:$r128,
i1imm:$tfe, i1imm:$lwe, i1imm:$slc, src_rc:$vaddr,		i1imm:$tfe, i1imm:$lwe, i1imm:$slc, src_rc:$vaddr,
SReg_256:$srsrc, SReg_128:$ssamp),		SReg_256:$srsrc, SReg_128:$ssamp),
asm#" $vdata, $dmask, $unorm, $glc, $da, $r128,"		asm#" $vdata, $dmask, $unorm, $glc, $da, $r128,"
#" $tfe, $lwe, $slc, $vaddr, $srsrc, $ssamp",		#" $tfe, $lwe, $slc, $vaddr, $srsrc, $ssamp",
[]> {		dns> {
let mayLoad = 1;
let mayStore = 0;
let hasPostISelHook = 1;
let WQM = wqm;		let WQM = wqm;
}		}

multiclass MIMG_Sampler_Src_Helper <bits<7> op, string asm,		multiclass MIMG_Sampler_Src_Helper <bits<7> op, string asm,
RegisterClass dst_rc,		RegisterClass dst_rc,
int channels, int wqm> {		int channels, int wqm> {
def _V1 : MIMG_Sampler_Helper <op, asm, dst_rc, VGPR_32, wqm>,		def _V1 : MIMG_Sampler_Helper <op, asm, dst_rc, VGPR_32, wqm,
		!if(!eq(channels, 1), "AMDGPU", "")>,
MIMG_Mask<asm#"_V1", channels>;		MIMG_Mask<asm#"_V1", channels>;
def _V2 : MIMG_Sampler_Helper <op, asm, dst_rc, VReg_64, wqm>,		def _V2 : MIMG_Sampler_Helper <op, asm, dst_rc, VReg_64, wqm>,
MIMG_Mask<asm#"_V2", channels>;		MIMG_Mask<asm#"_V2", channels>;
def _V4 : MIMG_Sampler_Helper <op, asm, dst_rc, VReg_128, wqm>,		def _V4 : MIMG_Sampler_Helper <op, asm, dst_rc, VReg_128, wqm>,
MIMG_Mask<asm#"_V4", channels>;		MIMG_Mask<asm#"_V4", channels>;
def _V8 : MIMG_Sampler_Helper <op, asm, dst_rc, VReg_256, wqm>,		def _V8 : MIMG_Sampler_Helper <op, asm, dst_rc, VReg_256, wqm>,
MIMG_Mask<asm#"_V8", channels>;		MIMG_Mask<asm#"_V8", channels>;
def _V16 : MIMG_Sampler_Helper <op, asm, dst_rc, VReg_512, wqm>,		def _V16 : MIMG_Sampler_Helper <op, asm, dst_rc, VReg_512, wqm>,
MIMG_Mask<asm#"_V16", channels>;		MIMG_Mask<asm#"_V16", channels>;
}		}

multiclass MIMG_Sampler <bits<7> op, string asm> {		multiclass MIMG_Sampler <bits<7> op, string asm, int wqm=0> {
defm _V1 : MIMG_Sampler_Src_Helper<op, asm, VGPR_32, 1, 0>;		defm _V1 : MIMG_Sampler_Src_Helper<op, asm, VGPR_32, 1, wqm>;
defm _V2 : MIMG_Sampler_Src_Helper<op, asm, VReg_64, 2, 0>;		defm _V2 : MIMG_Sampler_Src_Helper<op, asm, VReg_64, 2, wqm>;
defm _V3 : MIMG_Sampler_Src_Helper<op, asm, VReg_96, 3, 0>;		defm _V3 : MIMG_Sampler_Src_Helper<op, asm, VReg_96, 3, wqm>;
defm _V4 : MIMG_Sampler_Src_Helper<op, asm, VReg_128, 4, 0>;		defm _V4 : MIMG_Sampler_Src_Helper<op, asm, VReg_128, 4, wqm>;
}		}

multiclass MIMG_Sampler_WQM <bits<7> op, string asm> {		multiclass MIMG_Sampler_WQM <bits<7> op, string asm> : MIMG_Sampler<op, asm, 1>;
defm _V1 : MIMG_Sampler_Src_Helper<op, asm, VGPR_32, 1, 1>;
defm _V2 : MIMG_Sampler_Src_Helper<op, asm, VReg_64, 2, 1>;
defm _V3 : MIMG_Sampler_Src_Helper<op, asm, VReg_96, 3, 1>;
defm _V4 : MIMG_Sampler_Src_Helper<op, asm, VReg_128, 4, 1>;
}

class MIMG_Gather_Helper <bits<7> op, string asm,		class MIMG_Gather_Helper <bits<7> op, string asm,
RegisterClass dst_rc,		RegisterClass dst_rc,
RegisterClass src_rc, int wqm> : MIMG <		RegisterClass src_rc, int wqm> : MIMG <
op,		op,
(outs dst_rc:$vdata),		(outs dst_rc:$vdata),
(ins i32imm:$dmask, i1imm:$unorm, i1imm:$glc, i1imm:$da, i1imm:$r128,		(ins i32imm:$dmask, i1imm:$unorm, i1imm:$glc, i1imm:$da, i1imm:$r128,
i1imm:$tfe, i1imm:$lwe, i1imm:$slc, src_rc:$vaddr,		i1imm:$tfe, i1imm:$lwe, i1imm:$slc, src_rc:$vaddr,
Show All 9 Lines	class MIMG_Gather_Helper <bits<7> op, string asm,
// the component to fetch. The only useful DMASK values are		// the component to fetch. The only useful DMASK values are
// 1=red, 2=green, 4=blue, 8=alpha. (e.g. 1 returns		// 1=red, 2=green, 4=blue, 8=alpha. (e.g. 1 returns
// (red,red,red,red) etc.) The ISA document doesn't mention		// (red,red,red,red) etc.) The ISA document doesn't mention
// this.		// this.
// Therefore, disable all code which updates DMASK by setting these two:		// Therefore, disable all code which updates DMASK by setting these two:
let MIMG = 0;		let MIMG = 0;
let hasPostISelHook = 0;		let hasPostISelHook = 0;
let WQM = wqm;		let WQM = wqm;

		let isAsmParserOnly = 1; // TBD: fix it later
}		}

multiclass MIMG_Gather_Src_Helper <bits<7> op, string asm,		multiclass MIMG_Gather_Src_Helper <bits<7> op, string asm,
RegisterClass dst_rc,		RegisterClass dst_rc,
int channels, int wqm> {		int channels, int wqm> {
def _V1 : MIMG_Gather_Helper <op, asm, dst_rc, VGPR_32, wqm>,		def _V1 : MIMG_Gather_Helper <op, asm, dst_rc, VGPR_32, wqm>,
MIMG_Mask<asm#"_V1", channels>;		MIMG_Mask<asm#"_V1", channels>;
def _V2 : MIMG_Gather_Helper <op, asm, dst_rc, VReg_64, wqm>,		def _V2 : MIMG_Gather_Helper <op, asm, dst_rc, VReg_64, wqm>,
MIMG_Mask<asm#"_V2", channels>;		MIMG_Mask<asm#"_V2", channels>;
def _V4 : MIMG_Gather_Helper <op, asm, dst_rc, VReg_128, wqm>,		def _V4 : MIMG_Gather_Helper <op, asm, dst_rc, VReg_128, wqm>,
MIMG_Mask<asm#"_V4", channels>;		MIMG_Mask<asm#"_V4", channels>;
def _V8 : MIMG_Gather_Helper <op, asm, dst_rc, VReg_256, wqm>,		def _V8 : MIMG_Gather_Helper <op, asm, dst_rc, VReg_256, wqm>,
MIMG_Mask<asm#"_V8", channels>;		MIMG_Mask<asm#"_V8", channels>;
def _V16 : MIMG_Gather_Helper <op, asm, dst_rc, VReg_512, wqm>,		def _V16 : MIMG_Gather_Helper <op, asm, dst_rc, VReg_512, wqm>,
MIMG_Mask<asm#"_V16", channels>;		MIMG_Mask<asm#"_V16", channels>;
}		}

multiclass MIMG_Gather <bits<7> op, string asm> {		multiclass MIMG_Gather <bits<7> op, string asm, int wqm=0> {
defm _V1 : MIMG_Gather_Src_Helper<op, asm, VGPR_32, 1, 0>;		defm _V1 : MIMG_Gather_Src_Helper<op, asm, VGPR_32, 1, wqm>;
defm _V2 : MIMG_Gather_Src_Helper<op, asm, VReg_64, 2, 0>;		defm _V2 : MIMG_Gather_Src_Helper<op, asm, VReg_64, 2, wqm>;
defm _V3 : MIMG_Gather_Src_Helper<op, asm, VReg_96, 3, 0>;		defm _V3 : MIMG_Gather_Src_Helper<op, asm, VReg_96, 3, wqm>;
defm _V4 : MIMG_Gather_Src_Helper<op, asm, VReg_128, 4, 0>;		defm _V4 : MIMG_Gather_Src_Helper<op, asm, VReg_128, 4, wqm>;
}		}

multiclass MIMG_Gather_WQM <bits<7> op, string asm> {		multiclass MIMG_Gather_WQM <bits<7> op, string asm> : MIMG_Gather<op, asm, 1>;
defm _V1 : MIMG_Gather_Src_Helper<op, asm, VGPR_32, 1, 1>;
defm _V2 : MIMG_Gather_Src_Helper<op, asm, VReg_64, 2, 1>;
defm _V3 : MIMG_Gather_Src_Helper<op, asm, VReg_96, 3, 1>;
defm _V4 : MIMG_Gather_Src_Helper<op, asm, VReg_128, 4, 1>;
}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Vector instruction mappings		// Vector instruction mappings
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// Maps an opcode in e32 form to its e64 equivalent		// Maps an opcode in e32 form to its e64 equivalent
def getVOPe64 : InstrMapping {		def getVOPe64 : InstrMapping {
let FilterClass = "VOP";		let FilterClass = "VOP";
▲ Show 20 Lines • Show All 96 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/SIInstructions.td

	Show First 20 Lines • Show All 1,028 Lines • ▼ Show 20 Lines
	//def BUFFER_ATOMIC_OR_X2 : MUBUF_X2 <mubuf<0x5a, 0x69>, "buffer_atomic_or_x2", []>;			//def BUFFER_ATOMIC_OR_X2 : MUBUF_X2 <mubuf<0x5a, 0x69>, "buffer_atomic_or_x2", []>;
	//def BUFFER_ATOMIC_XOR_X2 : MUBUF_X2 <mubuf<0x5b, 0x6a>, "buffer_atomic_xor_x2", []>;			//def BUFFER_ATOMIC_XOR_X2 : MUBUF_X2 <mubuf<0x5b, 0x6a>, "buffer_atomic_xor_x2", []>;
	//def BUFFER_ATOMIC_INC_X2 : MUBUF_X2 <mubuf<0x5c, 0x6b>, "buffer_atomic_inc_x2", []>;			//def BUFFER_ATOMIC_INC_X2 : MUBUF_X2 <mubuf<0x5c, 0x6b>, "buffer_atomic_inc_x2", []>;
	//def BUFFER_ATOMIC_DEC_X2 : MUBUF_X2 <mubuf<0x5d, 0x6c>, "buffer_atomic_dec_x2", []>;			//def BUFFER_ATOMIC_DEC_X2 : MUBUF_X2 <mubuf<0x5d, 0x6c>, "buffer_atomic_dec_x2", []>;
	//def BUFFER_ATOMIC_FCMPSWAP_X2 : MUBUF_X2 <mubuf<0x5e>, "buffer_atomic_fcmpswap_x2", []>; // isn't on VI			//def BUFFER_ATOMIC_FCMPSWAP_X2 : MUBUF_X2 <mubuf<0x5e>, "buffer_atomic_fcmpswap_x2", []>; // isn't on VI
	//def BUFFER_ATOMIC_FMIN_X2 : MUBUF_X2 <mubuf<0x5f>, "buffer_atomic_fmin_x2", []>; // isn't on VI			//def BUFFER_ATOMIC_FMIN_X2 : MUBUF_X2 <mubuf<0x5f>, "buffer_atomic_fmin_x2", []>; // isn't on VI
	//def BUFFER_ATOMIC_FMAX_X2 : MUBUF_X2 <mubuf<0x60>, "buffer_atomic_fmax_x2", []>; // isn't on VI			//def BUFFER_ATOMIC_FMAX_X2 : MUBUF_X2 <mubuf<0x60>, "buffer_atomic_fmax_x2", []>; // isn't on VI

	let SubtargetPredicate = isSI in {			let SubtargetPredicate = isSI, DisableVIDecoder = 1 in {
	defm BUFFER_WBINVL1_SC : MUBUF_Invalidate <mubuf<0x70>, "buffer_wbinvl1_sc", int_amdgcn_buffer_wbinvl1_sc>; // isn't on CI & VI			defm BUFFER_WBINVL1_SC : MUBUF_Invalidate <mubuf<0x70>, "buffer_wbinvl1_sc", int_amdgcn_buffer_wbinvl1_sc>; // isn't on CI & VI
	}			}

	defm BUFFER_WBINVL1 : MUBUF_Invalidate <mubuf<0x71, 0x3e>, "buffer_wbinvl1", int_amdgcn_buffer_wbinvl1>;			defm BUFFER_WBINVL1 : MUBUF_Invalidate <mubuf<0x71, 0x3e>, "buffer_wbinvl1", int_amdgcn_buffer_wbinvl1>;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// MTBUF Instructions			// MTBUF Instructions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	▲ Show 20 Lines • Show All 345 Lines • ▼ Show 20 Lines
	>;			>;

	let OtherPredicates = [has32BankLDS] in {			let OtherPredicates = [has32BankLDS] in {

	defm V_INTERP_P1_F32 : V_INTERP_P1_F32_m;			defm V_INTERP_P1_F32 : V_INTERP_P1_F32_m;

	} // End OtherPredicates = [has32BankLDS]			} // End OtherPredicates = [has32BankLDS]

	let OtherPredicates = [has16BankLDS], Constraints = "@earlyclobber $dst" in {			let OtherPredicates = [has16BankLDS], Constraints = "@earlyclobber $dst", isAsmParserOnly=1 in {

	defm V_INTERP_P1_F32_16bank : V_INTERP_P1_F32_m;			defm V_INTERP_P1_F32_16bank : V_INTERP_P1_F32_m;

	} // End OtherPredicates = [has32BankLDS], Constraints = "@earlyclobber $dst"			} // End OtherPredicates = [has32BankLDS], Constraints = "@earlyclobber $dst", isAsmParserOnly=1

	let DisableEncoding = "$src0", Constraints = "$src0 = $dst" in {			let DisableEncoding = "$src0", Constraints = "$src0 = $dst" in {

	defm V_INTERP_P2_F32 : VINTRP_m <			defm V_INTERP_P2_F32 : VINTRP_m <
	0x00000001,			0x00000001,
	(outs VGPR_32:$dst),			(outs VGPR_32:$dst),
	(ins VGPR_32:$src0, VGPR_32:$j, i32imm:$attr_chan, i32imm:$attr),			(ins VGPR_32:$src0, VGPR_32:$j, i32imm:$attr_chan, i32imm:$attr),
	"v_interp_p2_f32 $dst, [$src0], $j, $attr_chan, $attr, [m0]",			"v_interp_p2_f32 $dst, [$src0], $j, $attr_chan, $attr, [m0]",
	▲ Show 20 Lines • Show All 342 Lines • ▼ Show 20 Lines

	defm V_MUL_LO_U32 : VOP3Inst <vop3<0x169, 0x285>, "v_mul_lo_u32",			defm V_MUL_LO_U32 : VOP3Inst <vop3<0x169, 0x285>, "v_mul_lo_u32",
	VOP_I32_I32_I32			VOP_I32_I32_I32
	>;			>;
	defm V_MUL_HI_U32 : VOP3Inst <vop3<0x16a, 0x286>, "v_mul_hi_u32",			defm V_MUL_HI_U32 : VOP3Inst <vop3<0x16a, 0x286>, "v_mul_hi_u32",
	VOP_I32_I32_I32, mulhu			VOP_I32_I32_I32, mulhu
	>;			>;

				let DisableVIDecoder=1 in { // removed from VI as identical to V_MUL_LO_U32
	defm V_MUL_LO_I32 : VOP3Inst <vop3<0x16b, 0x285>, "v_mul_lo_i32",			defm V_MUL_LO_I32 : VOP3Inst <vop3<0x16b, 0x285>, "v_mul_lo_i32",
	VOP_I32_I32_I32			VOP_I32_I32_I32
	>;			>;
				}

	defm V_MUL_HI_I32 : VOP3Inst <vop3<0x16c, 0x287>, "v_mul_hi_i32",			defm V_MUL_HI_I32 : VOP3Inst <vop3<0x16c, 0x287>, "v_mul_hi_i32",
	VOP_I32_I32_I32, mulhs			VOP_I32_I32_I32, mulhs
	>;			>;

	} // End isCommutable = 1, SchedRW = [WriteQuarterRate32]			} // End isCommutable = 1, SchedRW = [WriteQuarterRate32]

	let SchedRW = [WriteFloatFMA, WriteSALU] in {			let SchedRW = [WriteFloatFMA, WriteSALU] in {
	defm V_DIV_SCALE_F32 : VOP3bInst <vop3<0x16d, 0x1e0>, "v_div_scale_f32",			defm V_DIV_SCALE_F32 : VOP3bInst <vop3<0x16d, 0x1e0>, "v_div_scale_f32",
	▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines
	defm V_LSHR_B64 : VOP3Inst <vop3<0x162>, "v_lshr_b64", VOP_I64_I64_I32>;			defm V_LSHR_B64 : VOP3Inst <vop3<0x162>, "v_lshr_b64", VOP_I64_I64_I32>;
	defm V_ASHR_I64 : VOP3Inst <vop3<0x163>, "v_ashr_i64", VOP_I64_I64_I32>;			defm V_ASHR_I64 : VOP3Inst <vop3<0x163>, "v_ashr_i64", VOP_I64_I64_I32>;

	defm V_MULLIT_F32 : VOP3Inst <vop3<0x150>, "v_mullit_f32",			defm V_MULLIT_F32 : VOP3Inst <vop3<0x150>, "v_mullit_f32",
	VOP_F32_F32_F32_F32>;			VOP_F32_F32_F32_F32>;

	} // End SubtargetPredicate = isSICI			} // End SubtargetPredicate = isSICI

	let SubtargetPredicate = isVI in {			let SubtargetPredicate = isVI, DisableSIDecoder = 1 in {

	defm V_LSHLREV_B64 : VOP3Inst <vop3<0, 0x28f>, "v_lshlrev_b64",			defm V_LSHLREV_B64 : VOP3Inst <vop3<0, 0x28f>, "v_lshlrev_b64",
	VOP_I64_I32_I64			VOP_I64_I32_I64
	>;			>;
	defm V_LSHRREV_B64 : VOP3Inst <vop3<0, 0x290>, "v_lshrrev_b64",			defm V_LSHRREV_B64 : VOP3Inst <vop3<0, 0x290>, "v_lshrrev_b64",
	VOP_I64_I32_I64			VOP_I64_I32_I64
	>;			>;
	defm V_ASHRREV_I64 : VOP3Inst <vop3<0, 0x291>, "v_ashrrev_i64",			defm V_ASHRREV_I64 : VOP3Inst <vop3<0, 0x291>, "v_ashrrev_i64",
	▲ Show 20 Lines • Show All 1,361 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AMDGPU/VIInstructions.td

	//===-- VIInstructions.td - VI Instruction Defintions ---------------------===//			//===-- VIInstructions.td - VI Instruction Defintions ---------------------===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Instruction definitions for VI and newer.			// Instruction definitions for VI and newer.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	let SIAssemblerPredicate = DisableInst, SubtargetPredicate = isVI in {			let SIAssemblerPredicate = DisableInst, SubtargetPredicate = isVI in {

				let DisableSIDecoder = 1 in {

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// VOP1 Instructions			// VOP1 Instructions
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	defm V_CVT_F16_U16 : VOP1Inst <vop1<0, 0x39>, "v_cvt_f16_u16", VOP_F16_I16>;			defm V_CVT_F16_U16 : VOP1Inst <vop1<0, 0x39>, "v_cvt_f16_u16", VOP_F16_I16>;
	defm V_CVT_F16_I16 : VOP1Inst <vop1<0, 0x3a>, "v_cvt_f16_i16", VOP_F16_I16>;			defm V_CVT_F16_I16 : VOP1Inst <vop1<0, 0x3a>, "v_cvt_f16_i16", VOP_F16_I16>;
	defm V_CVT_U16_F16 : VOP1Inst <vop1<0, 0x3b>, "v_cvt_u16_f16", VOP_I16_F16>;			defm V_CVT_U16_F16 : VOP1Inst <vop1<0, 0x3b>, "v_cvt_u16_f16", VOP_I16_F16>;
	defm V_CVT_I16_F16 : VOP1Inst <vop1<0, 0x3c>, "v_cvt_i16_f16", VOP_I16_F16>;			defm V_CVT_I16_F16 : VOP1Inst <vop1<0, 0x3c>, "v_cvt_i16_f16", VOP_I16_F16>;
	▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines
	defm V_MIN_F16 : VOP2Inst <vop2<0,0x2e>, "v_min_f16", VOP_F16_F16_F16>;			defm V_MIN_F16 : VOP2Inst <vop2<0,0x2e>, "v_min_f16", VOP_F16_F16_F16>;
	defm V_MAX_U16 : VOP2Inst <vop2<0,0x2f>, "v_max_u16", VOP_I16_I16_I16>;			defm V_MAX_U16 : VOP2Inst <vop2<0,0x2f>, "v_max_u16", VOP_I16_I16_I16>;
	defm V_MAX_I16 : VOP2Inst <vop2<0,0x30>, "v_max_i16", VOP_I16_I16_I16>;			defm V_MAX_I16 : VOP2Inst <vop2<0,0x30>, "v_max_i16", VOP_I16_I16_I16>;
	defm V_MIN_U16 : VOP2Inst <vop2<0,0x31>, "v_min_u16", VOP_I16_I16_I16>;			defm V_MIN_U16 : VOP2Inst <vop2<0,0x31>, "v_min_u16", VOP_I16_I16_I16>;
	defm V_MIN_I16 : VOP2Inst <vop2<0,0x32>, "v_min_i16", VOP_I16_I16_I16>;			defm V_MIN_I16 : VOP2Inst <vop2<0,0x32>, "v_min_i16", VOP_I16_I16_I16>;
	} // End isCommutable = 1			} // End isCommutable = 1
	defm V_LDEXP_F16 : VOP2Inst <vop2<0,0x33>, "v_ldexp_f16", VOP_F16_F16_I16>;			defm V_LDEXP_F16 : VOP2Inst <vop2<0,0x33>, "v_ldexp_f16", VOP_F16_F16_I16>;

				} // let DisableSIDecoder = 1

	// Aliases to simplify matching of floating-point instructions that			// Aliases to simplify matching of floating-point instructions that
	// are VOP2 on SI and VOP3 on VI.			// are VOP2 on SI and VOP3 on VI.

	class SI2_VI3Alias <string name, Instruction inst> : InstAlias <			class SI2_VI3Alias <string name, Instruction inst> : InstAlias <
	name#" $dst, $src0, $src1",			name#" $dst, $src0, $src1",
	(inst VGPR_32:$dst, 0, VCSrc_32:$src0, 0, VCSrc_32:$src1, 0, 0)			(inst VGPR_32:$dst, 0, VCSrc_32:$src0, 0, VCSrc_32:$src1, 0, 0)
	>, PredicateControl {			>, PredicateControl {
	let UseInstAsmMatchConverter = 0;			let UseInstAsmMatchConverter = 0;
	Show All 40 Lines

llvm/trunk/test/MC/Disassembler/AMDGPU/lit.local.cfg

				if not 'AMDGPU' in config.root.targets:
				config.unsupported = True

llvm/trunk/test/MC/Disassembler/AMDGPU/mov.txt

				# RUN: llvm-mc -arch=amdgcn -mcpu=tonga -disassemble -show-encoding < %s \| FileCheck %s

				# CHECK: v_mov_b32_e32 v2, v1 ; encoding: [0x01,0x03,0x04,0x7e]
				0x01 0x03 0x04 0x7e

				# CHECK: v_mov_b32_e32 v1, 0.5 ; encoding: [0xf0,0x02,0x02,0x7e]
				0xf0 0x02 0x02 0x7e

				# CHECK: v_mov_b32_e32 v15, s100 ; encoding: [0x64,0x02,0x1e,0x7e]
				0x64 0x02 0x1e 0x7e

				# CHECK: v_mov_b32_e32 v90, flat_scratch_lo ; encoding: [0x66,0x02,0xb4,0x7e]
				0x66 0x02 0xb4 0x7e

				# CHECK: v_mov_b32_e32 v150, vcc_lo ; encoding: [0x6a,0x02,0x2c,0x7f]
				0x6a 0x02 0x2c 0x7f

				# CHECK: v_mov_b32_e32 v199, exec_lo ; encoding: [0x7e,0x02,0x8e,0x7f]
				0x7e 0x02 0x8e 0x7f

				# CHECK: v_mov_b32_e32 v222, m0 ; encoding: [0x7c,0x02,0xbc,0x7f]
				0x7c 0x02 0xbc 0x7f

				# CHECK: v_mov_b32_e32 v255, -13 ; encoding: [0xcd,0x02,0xfe,0x7f]
				0xcd 0x02 0xfe 0x7f

				# CHECK: v_cvt_f32_i32_e32 v153, s98 ; encoding: [0x62,0x0a,0x32,0x7f]
				0x62 0x0a 0x32 0x7f

				# CHECK: v_cvt_f32_u32_e32 v33, -4.0 ; encoding: [0xf7,0x0c,0x42,0x7e]
				0xf7 0x0c 0x42 0x7e
				No newline at end of file

llvm/trunk/test/MC/Disassembler/AMDGPU/nop.txt

				# RUN: llvm-mc -arch=amdgcn -mcpu=tonga -disassemble -show-encoding < %s \| FileCheck %s

				# CHECK: v_nop ; encoding: [0x00,0x00,0x00,0x7e]
				0x00 0x00 0x00 0x7e