This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Target/
-
llvm/
-
Target/
4/4
Target.td
-
test/TableGen/
-
TableGen/
-
VarLenDecoder.td
-
utils/TableGen/
-
TableGen/
1/1
DecoderEmitter.cpp
-
VarLenCodeEmitterGen.h
7/7
VarLenCodeEmitterGen.cpp

Differential D142079

[TableGen] Support custom decoders for variable length instructions
ClosedPublic

Authored by myhsu on Jan 18 2023, 8:42 PM.

Download Raw Diff

Details

Reviewers

0x59616e
RKSimon
jyknight
Paul-C-Anagnostopoulos

Commits

rG36c19eae27b2: [TableGen] Support custom decoders for variable length instructions

Summary

Just like the encoder directive for variable-length instructions, this patch adds a new decoder directive to allow custom decoder function on an operand.

Right now, due to the design of DecoderEmitter each operand can only have a single custom decoder in a given instruction.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

myhsu created this revision.Jan 18 2023, 8:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 18 2023, 8:42 PM

myhsu requested review of this revision.Jan 18 2023, 8:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 18 2023, 8:42 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

myhsu added a child revision: D142080: [M68k][Disassembler] Use custom decoder for 32-bit immediates.Jan 18 2023, 8:46 PM

myhsu mentioned this in D137902: [M68k][MC] Make immediate operands relocatable.Jan 18 2023, 8:50 PM

Harbormaster completed remote builds in B208651: Diff 490369.Jan 18 2023, 10:30 PM

0x59616e added inline comments.Jan 19 2023, 8:05 PM

llvm/utils/TableGen/DecoderEmitter.cpp
1896	nit: Use `!empty` for better readability.
llvm/utils/TableGen/VarLenCodeEmitterGen.cpp
165–170	nit: `getCustomCoders` will return empty `StringRef` automatically if `DI->getNumArgs()` == 2. So this if statement is unnecessary.
195–198	ditto
215–227	Use early exit to reduce indentation for better readability.
221–222	nit: Avoid declaring multiple variables in the same statement;

0x59616e added inline comments.Jan 19 2023, 8:12 PM

llvm/include/llvm/Target/Target.td
806–807	Under what circumstances will we need different decoders for the same operand ?

0x59616e added inline comments.Jan 19 2023, 8:20 PM

llvm/utils/TableGen/VarLenCodeEmitterGen.cpp
212	`getCustomCoders` seems unnecessary to be a member function of `VarLenInst` ?

Addressed feedbacks

llvm/include/llvm/Target/Target.td
806–807	I figure it's probably rare but for instance, the binary encoding of an operand was separated in two different places so the decoder has to puzzle them back.
llvm/utils/TableGen/VarLenCodeEmitterGen.cpp
165–170	good catch, thanks!

I'm not very familiar with varlen decoder, hence only general notes.

llvm/include/llvm/Target/Target.td
806–807	Is it diagnosed somewhere?
llvm/utils/TableGen/VarLenCodeEmitterGen.cpp
88	The guideline is to limit anonymous namespaces to class declarations and use `static` everywhere else. The function does not protect against users, it needs proper error checking.

Harbormaster completed remote builds in B209163: Diff 491092.Jan 21 2023, 1:06 PM

Should we add a description to llvm/docs/TableGen/ProgRef.rst?

The TableGen Programmer's Reference does not include descriptions of backend facilities.

In D142079#4071770, @Paul-C-Anagnostopoulos wrote:

The TableGen Programmer's Reference does not include descriptions of backend facilities.

So maybe BackGuide.rst ?

Yes, that makes sense.

In D142079#4071791, @RKSimon wrote:

In D142079#4071770, @Paul-C-Anagnostopoulos wrote:

The TableGen Programmer's Reference does not include descriptions of backend facilities.

So maybe BackGuide.rst ?

I don't think it will be a good place since it's for TableGen backend developers while this patch only adds new TG directives for disassembler developers. Right now, IMO, the most related document is actually Writing an LLVM Backend, which is the only place mentioning TG syntax for writing instruction encodings (for fixed-length instructions, of course). Instead of cluttering with Writing an LLVM Backend I feel like a better way will be creating a separate page for variable-length instruction encoding / decoding.

Pull getCustomCoders out of anonymous namespace
Report a fatal error if a encoder or decoder directive is not followed by a function name.

llvm/include/llvm/Target/Target.td
806–807	I double check with the code and found that it requires some works to report such diagnostics (e.g. print a warning if two different decoders are used on the same operand), which personally I think it's not worth it.

Harbormaster completed remote builds in B209282: Diff 491239.Jan 23 2023, 12:46 AM

LGTM. Thanks for this amazing work ;)

This revision is now accepted and ready to land.Jan 24 2023, 6:24 AM

This revision was landed with ongoing or failed builds.Jan 24 2023, 10:03 PM

Closed by commit rG36c19eae27b2: [TableGen] Support custom decoders for variable length instructions (authored by myhsu). · Explain Why

This revision was automatically updated to reflect the committed changes.

myhsu added a commit: rG36c19eae27b2: [TableGen] Support custom decoders for variable length instructions.

skan added a subscriber: skan.Feb 8 2023, 3:05 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Target/

Target.td

10 lines

test/

TableGen/

VarLenDecoder.td

9 lines

utils/

TableGen/

DecoderEmitter.cpp

2 lines

VarLenCodeEmitterGen.h

9 lines

VarLenCodeEmitterGen.cpp

51 lines

Diff 491994

llvm/include/llvm/Target/Target.td

	Show First 20 Lines • Show All 790 Lines • ▼ Show 20 Lines
	/// Which represents a 4-bit encoding for an instruction operand named `$src`.			/// Which represents a 4-bit encoding for an instruction operand named `$src`.
	def operand;			def operand;
	/// Similar to `operand`, we can reference only part of the operand's encoding:			/// Similar to `operand`, we can reference only part of the operand's encoding:
	/// (slice "$src", 6, 8)			/// (slice "$src", 6, 8)
	/// (slice "$src", 8, 6)			/// (slice "$src", 8, 6)
	/// Both DAG represent bit 6 to 8 (total of 3 bits) in the encoding of operand			/// Both DAG represent bit 6 to 8 (total of 3 bits) in the encoding of operand
	/// `$src`.			/// `$src`.
	def slice;			def slice;
	/// You can use `encoder` to specify a custom encoder function for a specific			/// You can use `encoder` or `decoder` to specify a custom encoder or decoder
	/// `operand` or `encoder` directive. For example:			/// function for a specific `operand` or `slice` directive. For example:
	/// (operand "$src", 4, (encoder "encodeMyImm"))			/// (operand "$src", 4, (encoder "encodeMyImm"))
	/// (slice "$src", 8, 6, (encoder "encodeMyReg"))			/// (slice "$src", 8, 6, (encoder "encodeMyReg"))
				/// (operand "$src", 4, (encoder "encodeMyImm"), (decoder "decodeMyImm"))
				/// The ordering of `encoder` and `decoder` in the same `operand` or `slice`
				/// doesn't matter.
				/// Note that currently we cannot assign different decoders in the same
				/// (instruction) operand.
				0x59616eUnsubmitted Done Reply Inline Actions Under what circumstances will we need different decoders for the same operand ? 0x59616e: Under what circumstances will we need different decoders for the same operand ?
				myhsuAuthorUnsubmitted Done Reply Inline Actions I figure it's probably rare but for instance, the binary encoding of an operand was separated in two different places so the decoder has to puzzle them back. myhsu: I figure it's probably rare but for instance, the binary encoding of an operand was separated…
				barannikov88Unsubmitted Done Reply Inline Actions Is it diagnosed somewhere? barannikov88: Is it diagnosed somewhere?
				myhsuAuthorUnsubmitted Done Reply Inline Actions I double check with the code and found that it requires some works to report such diagnostics (e.g. print a warning if two different decoders are used on the same operand), which personally I think it's not worth it. myhsu: I double check with the code and found that it requires some works to report such diagnostics…
	def encoder;			def encoder;
				def decoder;

	/// PointerLikeRegClass - Values that are designed to have pointer width are			/// PointerLikeRegClass - Values that are designed to have pointer width are
	/// derived from this. TableGen treats the register class as having a symbolic			/// derived from this. TableGen treats the register class as having a symbolic
	/// type that it doesn't know, and resolves the actual regclass to use by using			/// type that it doesn't know, and resolves the actual regclass to use by using
	/// the TargetRegisterInfo::getPointerRegClass() hook at codegen time.			/// the TargetRegisterInfo::getPointerRegClass() hook at codegen time.
	class PointerLikeRegClass<int Kind> {			class PointerLikeRegClass<int Kind> {
	int RegClassKind = Kind;			int RegClassKind = Kind;
	}			}
	▲ Show 20 Lines • Show All 1,035 Lines • Show Last 20 Lines

llvm/test/TableGen/VarLenDecoder.td

Show All 28 Lines	class MyVarInst<MyMemOperand memory_op> : Instruction {

let OutOperandList = (outs GR64:$dst);		let OutOperandList = (outs GR64:$dst);
let InOperandList = (ins memory_op:$src);		let InOperandList = (ins memory_op:$src);
}		}

def FOO16 : MyVarInst<MemOp16> {		def FOO16 : MyVarInst<MemOp16> {
let Inst = (ascend		let Inst = (ascend
(descend (operand "$dst", 3), 0b01000, (operand "$src.reg", 3)),		(descend (operand "$dst", 3), 0b01000, (operand "$src.reg", 3)),
(slice "$src.offset", 15, 0)		(slice "$src.offset", 15, 0, (decoder "myCustomDecoder"))
);		);
}		}
def FOO32 : MyVarInst<MemOp32> {		def FOO32 : MyVarInst<MemOp32> {
let Inst = (ascend		let Inst = (ascend
(descend (operand "$dst", 3), 0b01001, (operand "$src.reg", 3)),		(descend (operand "$dst", 3), 0b01001,
		(operand "$src.reg", 3, (decoder "myCustomDecoder"))),
(slice "$src.offset", 31, 16),		(slice "$src.offset", 31, 16),
(slice "$src.offset", 15, 0)		(slice "$src.offset", 15, 0)
);		);
}		}

// CHECK: MCD::OPC_ExtractField, 3, 5, // Inst{7-3} ...		// CHECK: MCD::OPC_ExtractField, 3, 5, // Inst{7-3} ...
// CHECK-NEXT: MCD::OPC_FilterValue, 8, 4, 0, 0, // Skip to: 12		// CHECK-NEXT: MCD::OPC_FilterValue, 8, 4, 0, 0, // Skip to: 12
// CHECK-NEXT: MCD::OPC_Decode, [[#OPCODE:]], 1, 0, // Opcode: FOO16		// CHECK-NEXT: MCD::OPC_Decode, [[#OPCODE:]], 1, 0, // Opcode: FOO16
// CHECK-NEXT: MCD::OPC_FilterValue, 9, 4, 0, 0, // Skip to: 21		// CHECK-NEXT: MCD::OPC_FilterValue, 9, 4, 0, 0, // Skip to: 21
// CHECK-NEXT: MCD::OPC_Decode, [[#OPCODE+1]], 1, 1, // Opcode: FOO32		// CHECK-NEXT: MCD::OPC_Decode, [[#OPCODE+1]], 1, 1, // Opcode: FOO32
// CHECK-NEXT: MCD::OPC_Fail,		// CHECK-NEXT: MCD::OPC_Fail,

// Instruction length table		// Instruction length table
// CHECK: 27,		// CHECK: 27,
// CHECK-NEXT: 43,		// CHECK-NEXT: 43,
// CHECK-NEXT: };		// CHECK-NEXT: };

// CHECK: case 0:		// CHECK: case 0:
// CHECK-NEXT: tmp = fieldFromInstruction(insn, 8, 3);		// CHECK-NEXT: tmp = fieldFromInstruction(insn, 8, 3);
// CHECK-NEXT: if (!Check(S, DecodeRegClassRegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; }		// CHECK-NEXT: if (!Check(S, DecodeRegClassRegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; }
// CHECK-NEXT: tmp = fieldFromInstruction(insn, 0, 3);		// CHECK-NEXT: tmp = fieldFromInstruction(insn, 0, 3);
// CHECK-NEXT: if (!Check(S, DecodeRegClassRegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; }		// CHECK-NEXT: if (!Check(S, DecodeRegClassRegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; }
// CHECK-NEXT: tmp = fieldFromInstruction(insn, 11, 16);		// CHECK-NEXT: tmp = fieldFromInstruction(insn, 11, 16);
// CHECK-NEXT: MI.addOperand(MCOperand::createImm(tmp));		// CHECK-NEXT: if (!Check(S, myCustomDecoder(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; }
// CHECK-NEXT: return S;		// CHECK-NEXT: return S;
// CHECK-NEXT: case 1:		// CHECK-NEXT: case 1:
// CHECK-NEXT: tmp = fieldFromInstruction(insn, 8, 3);		// CHECK-NEXT: tmp = fieldFromInstruction(insn, 8, 3);
// CHECK-NEXT: if (!Check(S, DecodeRegClassRegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; }		// CHECK-NEXT: if (!Check(S, DecodeRegClassRegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; }
// CHECK-NEXT: tmp = fieldFromInstruction(insn, 0, 3);		// CHECK-NEXT: tmp = fieldFromInstruction(insn, 0, 3);
// CHECK-NEXT: if (!Check(S, DecodeRegClassRegisterClass(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; }		// CHECK-NEXT: if (!Check(S, myCustomDecoder(MI, tmp, Address, Decoder))) { return MCDisassembler::Fail; }
// CHECK-NEXT: tmp = 0x0;		// CHECK-NEXT: tmp = 0x0;
// CHECK-NEXT: insertBits(tmp, fieldFromInstruction(insn, 11, 16), 16, 16);		// CHECK-NEXT: insertBits(tmp, fieldFromInstruction(insn, 11, 16), 16, 16);
// CHECK-NEXT: insertBits(tmp, fieldFromInstruction(insn, 27, 16), 0, 16);		// CHECK-NEXT: insertBits(tmp, fieldFromInstruction(insn, 27, 16), 0, 16);
// CHECK-NEXT: MI.addOperand(MCOperand::createImm(tmp));		// CHECK-NEXT: MI.addOperand(MCOperand::createImm(tmp));
// CHECK-NEXT: return S;		// CHECK-NEXT: return S;

// CHECK-LABEL: case MCD::OPC_ExtractField: {		// CHECK-LABEL: case MCD::OPC_ExtractField: {
// CHECK: makeUp(insn, Start + Len);		// CHECK: makeUp(insn, Start + Len);

// CHECK-LABEL: case MCD::OPC_CheckField: {		// CHECK-LABEL: case MCD::OPC_CheckField: {
// CHECK: makeUp(insn, Start + Len);		// CHECK: makeUp(insn, Start + Len);

// CHECK-LABEL: case MCD::OPC_Decode: {		// CHECK-LABEL: case MCD::OPC_Decode: {
// CHECK: Len = InstrLenTable[Opc];		// CHECK: Len = InstrLenTable[Opc];
// CHECK-NEXT: makeUp(insn, Len);		// CHECK-NEXT: makeUp(insn, Len);

llvm/utils/TableGen/DecoderEmitter.cpp

Show First 20 Lines • Show All 1,887 Lines • ▼ Show 20 Lines for (auto &EncodingSegment : VLI) {

} }

if (!OpName.empty()) { if (!OpName.empty()) {

auto OpSubOpPair = auto OpSubOpPair =

const_cast<CodeGenInstruction &>(CGI).Operands.ParseOperandName( const_cast<CodeGenInstruction &>(CGI).Operands.ParseOperandName(

OpName); OpName);

unsigned OpIdx = CGI.Operands.getFlattenedOperandNumber(OpSubOpPair); unsigned OpIdx = CGI.Operands.getFlattenedOperandNumber(OpSubOpPair);

Operands[OpIdx].addField(CurrBitPos, EncodingSegment.BitWidth, Offset); Operands[OpIdx].addField(CurrBitPos, EncodingSegment.BitWidth, Offset);

if (!EncodingSegment.CustomDecoder.empty())

0x59616eUnsubmitted

Done

Operands[OpIdx].addField(CurrBitPos, EncodingSegment.BitWidth, Offset);

- if (EncodingSegment.CustomDecoder.size())

+ if (!EncodingSegment.CustomDecoder.empty())

Operands[OpIdx].Decoder = EncodingSegment.CustomDecoder.str();

nit: Use !empty for better readability.

0x59616e: nit: Use `!empty` for better readability.

Operands[OpIdx].Decoder = EncodingSegment.CustomDecoder.str();

int TiedReg = TiedTo[OpSubOpPair.first]; int TiedReg = TiedTo[OpSubOpPair.first];

if (TiedReg != -1) { if (TiedReg != -1) {

unsigned OpIdx = CGI.Operands.getFlattenedOperandNumber( unsigned OpIdx = CGI.Operands.getFlattenedOperandNumber(

std::make_pair(TiedReg, OpSubOpPair.second)); std::make_pair(TiedReg, OpSubOpPair.second));

Operands[OpIdx].addField(CurrBitPos, EncodingSegment.BitWidth, Offset); Operands[OpIdx].addField(CurrBitPos, EncodingSegment.BitWidth, Offset);

} }

▲ Show 20 Lines • Show All 868 Lines • Show Last 20 Lines

llvm/utils/TableGen/VarLenCodeEmitterGen.h

	Show All 16 Lines
	#include "llvm/TableGen/Record.h"			#include "llvm/TableGen/Record.h"

	namespace llvm {			namespace llvm {

	struct EncodingSegment {			struct EncodingSegment {
	unsigned BitWidth;			unsigned BitWidth;
	const Init *Value;			const Init *Value;
	StringRef CustomEncoder = "";			StringRef CustomEncoder = "";
				StringRef CustomDecoder = "";
	};			};

	class VarLenInst {			class VarLenInst {
	const RecordVal *TheDef;			const RecordVal *TheDef;
	size_t NumBits;			size_t NumBits;

	// Set if any of the segment is not fixed value.			// Set if any of the segment is not fixed value.
	bool HasDynamicSegment;			bool HasDynamicSegment;

	SmallVector<EncodingSegment, 4> Segments;			SmallVector<EncodingSegment, 4> Segments;

	void buildRec(const DagInit *DI);			void buildRec(const DagInit *DI);

	StringRef getCustomEncoderName(const Init *EI) const {
	if (const auto *DI = dyn_cast<DagInit>(EI)) {
	if (DI->getNumArgs() && isa<StringInit>(DI->getArg(0)))
	return cast<StringInit>(DI->getArg(0))->getValue();
	}
	return "";
	}

	public:			public:
	VarLenInst() : TheDef(nullptr), NumBits(0U), HasDynamicSegment(false) {}			VarLenInst() : TheDef(nullptr), NumBits(0U), HasDynamicSegment(false) {}

	explicit VarLenInst(const DagInit DI, const RecordVal TheDef);			explicit VarLenInst(const DagInit DI, const RecordVal TheDef);

	/// Number of bits			/// Number of bits
	size_t size() const { return NumBits; }			size_t size() const { return NumBits; }

	Show All 13 Lines

llvm/utils/TableGen/VarLenCodeEmitterGen.cpp

Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines class VarLenCodeEmitterGen {

std::string getInstructionCaseForEncoding(Record *R, Record *EncodingDef, std::string getInstructionCaseForEncoding(Record *R, Record *EncodingDef,

CodeGenTarget &Target); CodeGenTarget &Target);

public: public:

explicit VarLenCodeEmitterGen(RecordKeeper &R) : Records(R) {} explicit VarLenCodeEmitterGen(RecordKeeper &R) : Records(R) {}

void run(raw_ostream &OS); void run(raw_ostream &OS);

}; };

} // end anonymous namespace } // end anonymous namespace

// Get the name of custom encoder or decoder, if there is any.

barannikov88Unsubmitted

Done

The guideline is to limit anonymous namespaces to class declarations and use static everywhere else.
The function does not protect against users, it needs proper error checking.

barannikov88: * The guideline is to limit anonymous namespaces to class declarations and use `static`…

// Returns `{encoder name, decoder name}`.

static std::pair<StringRef, StringRef> getCustomCoders(ArrayRef<Init *> Args) {

std::pair<StringRef, StringRef> Result;

for (const auto *Arg : Args) {

const auto *DI = dyn_cast<DagInit>(Arg);

if (!DI)

continue;

const Init *Op = DI->getOperator();

if (!isa<DefInit>(Op))

continue;

// syntax: `(<encoder | decoder> "function name")`

StringRef OpName = cast<DefInit>(Op)->getDef()->getName();

if (OpName != "encoder" && OpName != "decoder")

continue;

if (!DI->getNumArgs() || !isa<StringInit>(DI->getArg(0)))

PrintFatalError("expected '" + OpName +

"' directive to be followed by a custom function name.");

StringRef FuncName = cast<StringInit>(DI->getArg(0))->getValue();

if (OpName == "encoder")

Result.first = FuncName;

else

Result.second = FuncName;

}

return Result;

}

VarLenInst::VarLenInst(const DagInit *DI, const RecordVal *TheDef) VarLenInst::VarLenInst(const DagInit *DI, const RecordVal *TheDef)

: TheDef(TheDef), NumBits(0U) { : TheDef(TheDef), NumBits(0U) {

buildRec(DI); buildRec(DI);

for (const auto &S : Segments) for (const auto &S : Segments)

NumBits += S.BitWidth; NumBits += S.BitWidth;

} }

void VarLenInst::buildRec(const DagInit *DI) { void VarLenInst::buildRec(const DagInit *DI) {

Show All 21 Lines for (; i != e; i += s) {

} else if (const auto *SubDI = dyn_cast<DagInit>(Arg)) { } else if (const auto *SubDI = dyn_cast<DagInit>(Arg)) {

buildRec(SubDI); buildRec(SubDI);

} else { } else {

PrintFatalError(TheDef->getLoc(), "Unrecognized type of argument in `" + PrintFatalError(TheDef->getLoc(), "Unrecognized type of argument in `" +

Op + "`: " + Arg->getAsString()); Op + "`: " + Arg->getAsString());

} }

} else if (Op == "operand") { } else if (Op == "operand") {

// (operand <operand name>, <# of bits>, [(encoder <custom encoder>)]) // (operand <operand name>, <# of bits>,

// [(encoder <custom encoder>)][, (decoder <custom decoder>)])

if (DI->getNumArgs() < 2) if (DI->getNumArgs() < 2)

PrintFatalError(TheDef->getLoc(), PrintFatalError(TheDef->getLoc(),

"Expecting at least 2 arguments for `operand`"); "Expecting at least 2 arguments for `operand`");

HasDynamicSegment = true; HasDynamicSegment = true;

const Init *OperandName = DI->getArg(0), *NumBits = DI->getArg(1); const Init *OperandName = DI->getArg(0), *NumBits = DI->getArg(1);

if (!isa<StringInit>(OperandName) || !isa<IntInit>(NumBits)) if (!isa<StringInit>(OperandName) || !isa<IntInit>(NumBits))

PrintFatalError(TheDef->getLoc(), "Invalid argument types for `operand`"); PrintFatalError(TheDef->getLoc(), "Invalid argument types for `operand`");

auto NumBitsVal = cast<IntInit>(NumBits)->getValue(); auto NumBitsVal = cast<IntInit>(NumBits)->getValue();

if (NumBitsVal <= 0) if (NumBitsVal <= 0)

PrintFatalError(TheDef->getLoc(), "Invalid number of bits for `operand`"); PrintFatalError(TheDef->getLoc(), "Invalid number of bits for `operand`");

StringRef CustomEncoder; auto [CustomEncoder, CustomDecoder] =

if (DI->getNumArgs() >= 3) getCustomCoders(DI->getArgs().slice(2));

CustomEncoder = getCustomEncoderName(DI->getArg(2)); Segments.push_back({static_cast<unsigned>(NumBitsVal), OperandName,

Segments.push_back( CustomEncoder, CustomDecoder});

{static_cast<unsigned>(NumBitsVal), OperandName, CustomEncoder});

} else if (Op == "slice") { } else if (Op == "slice") {

0x59616eUnsubmitted

Done

PrintFatalError(TheDef->getLoc(), "Invalid number of bits for `operand`");

- StringRef CustomEncoder, CustomDecoder;

- if (DI->getNumArgs() >= 3)

- std::tie(CustomEncoder, CustomDecoder) =

- getCustomCoders(DI->getArgs().slice(2));

+ auto [CustomEncoder, CustomDecoder] = getCustomCoders(DI->getArgs().slice(2));

Segments.push_back({static_cast<unsigned>(NumBitsVal), OperandName,

nit: getCustomCoders will return empty StringRef automatically if DI->getNumArgs() == 2. So this if statement is unnecessary.

0x59616e: nit: `getCustomCoders` will return empty `StringRef` automatically if `DI->getNumArgs()` == 2.

myhsuAuthorUnsubmitted

Done

good catch, thanks!

myhsu: good catch, thanks!

// (slice <operand name>, <high / low bit>, <low / high bit>, // (slice <operand name>, <high / low bit>, <low / high bit>,

// [(encoder <custom encoder>)]) // [(encoder <custom encoder>)][, (decoder <custom decoder>)])

if (DI->getNumArgs() < 3) if (DI->getNumArgs() < 3)

PrintFatalError(TheDef->getLoc(), PrintFatalError(TheDef->getLoc(),

"Expecting at least 3 arguments for `slice`"); "Expecting at least 3 arguments for `slice`");

HasDynamicSegment = true; HasDynamicSegment = true;

Init *OperandName = DI->getArg(0), *HiBit = DI->getArg(1), Init *OperandName = DI->getArg(0), *HiBit = DI->getArg(1),

*LoBit = DI->getArg(2); *LoBit = DI->getArg(2);

if (!isa<StringInit>(OperandName) || !isa<IntInit>(HiBit) || if (!isa<StringInit>(OperandName) || !isa<IntInit>(HiBit) ||

!isa<IntInit>(LoBit)) !isa<IntInit>(LoBit))

PrintFatalError(TheDef->getLoc(), "Invalid argument types for `slice`"); PrintFatalError(TheDef->getLoc(), "Invalid argument types for `slice`");

auto HiBitVal = cast<IntInit>(HiBit)->getValue(), auto HiBitVal = cast<IntInit>(HiBit)->getValue(),

LoBitVal = cast<IntInit>(LoBit)->getValue(); LoBitVal = cast<IntInit>(LoBit)->getValue();

if (HiBitVal < 0 || LoBitVal < 0) if (HiBitVal < 0 || LoBitVal < 0)

PrintFatalError(TheDef->getLoc(), "Invalid bit range for `slice`"); PrintFatalError(TheDef->getLoc(), "Invalid bit range for `slice`");

bool NeedSwap = false; bool NeedSwap = false;

unsigned NumBits = 0U; unsigned NumBits = 0U;

if (HiBitVal < LoBitVal) { if (HiBitVal < LoBitVal) {

NeedSwap = true; NeedSwap = true;

NumBits = static_cast<unsigned>(LoBitVal - HiBitVal + 1); NumBits = static_cast<unsigned>(LoBitVal - HiBitVal + 1);

} else { } else {

NumBits = static_cast<unsigned>(HiBitVal - LoBitVal + 1); NumBits = static_cast<unsigned>(HiBitVal - LoBitVal + 1);

} }

StringRef CustomEncoder; auto [CustomEncoder, CustomDecoder] =

if (DI->getNumArgs() >= 4) getCustomCoders(DI->getArgs().slice(3));

CustomEncoder = getCustomEncoderName(DI->getArg(3));

0x59616eUnsubmitted

Done

ditto

0x59616e: ditto

if (NeedSwap) { if (NeedSwap) {

// Normalization: Hi bit should always be the second argument. // Normalization: Hi bit should always be the second argument.

Init *const NewArgs[] = {OperandName, LoBit, HiBit}; Init *const NewArgs[] = {OperandName, LoBit, HiBit};

Segments.push_back({NumBits, Segments.push_back({NumBits,

DagInit::get(DI->getOperator(), nullptr, NewArgs, {}), DagInit::get(DI->getOperator(), nullptr, NewArgs, {}),

CustomEncoder}); CustomEncoder, CustomDecoder});

} else { } else {

Segments.push_back({NumBits, DI, CustomEncoder}); Segments.push_back({NumBits, DI, CustomEncoder, CustomDecoder});

} }

void VarLenCodeEmitterGen::run(raw_ostream &OS) { void VarLenCodeEmitterGen::run(raw_ostream &OS) {

CodeGenTarget Target(Records); CodeGenTarget Target(Records);

0x59616eUnsubmitted

Done

getCustomCoders seems unnecessary to be a member function of VarLenInst ?

0x59616e: `getCustomCoders` seems unnecessary to be a member function of `VarLenInst` ?

auto Insts = Records.getAllDerivedDefinitions("Instruction"); auto Insts = Records.getAllDerivedDefinitions("Instruction");

auto NumberedInstructions = Target.getInstructionsByEnumValue(); auto NumberedInstructions = Target.getInstructionsByEnumValue();

const CodeGenHwModes &HWM = Target.getHwModes(); const CodeGenHwModes &HWM = Target.getHwModes();

// The set of HwModes used by instruction encodings. // The set of HwModes used by instruction encodings.

std::set<unsigned> HwModes; std::set<unsigned> HwModes;

for (const CodeGenInstruction *CGI : NumberedInstructions) { for (const CodeGenInstruction *CGI : NumberedInstructions) {

Record *R = CGI->TheDef; Record *R = CGI->TheDef;

0x59616eUnsubmitted

Done

continue;

- StringRef OpName = cast<DefInit>(Op)->getDef()->getName(),

- FuncName = cast<StringInit>(DI->getArg(0))->getValue();

+ StringRef OpName = cast<DefInit>(Op)->getDef()->getName();

+ StringRef FuncName = cast<StringInit>(DI->getArg(0))->getValue();

if (OpName == "encoder")

nit: Avoid declaring multiple variables in the same statement;

0x59616e: nit: Avoid declaring multiple variables in the same statement;

// Create the corresponding VarLenInst instance. // Create the corresponding VarLenInst instance.

if (R->getValueAsString("Namespace") == "TargetOpcode" || if (R->getValueAsString("Namespace") == "TargetOpcode" ||

R->getValueAsBit("isPseudo")) R->getValueAsBit("isPseudo"))

continue; continue;

0x59616eUnsubmitted

Done

for (const auto *Arg : Args) {

- if (const auto *DI = dyn_cast<DagInit>(Arg)) {

- const Init *Op = DI->getOperator();

- // syntax: `(<encoder | decoder> "function name")`

- if (!isa<DefInit>(Op) || !DI->getNumArgs() ||

+ const auto *DI = dyn_cast<DagInit>(Arg);

+ if (!DI)

+ continue;

+ const Init *Op = DI->getOperator();

+ // syntax: `(<encoder | decoder> "function name")`

+ if (!isa<DefInit>(Op) || !DI->getNumArgs() ||

!isa<StringInit>(DI->getArg(0)))

- continue;

- StringRef OpName = cast<DefInit>(Op)->getDef()->getName(),

- FuncName = cast<StringInit>(DI->getArg(0))->getValue();

- if (OpName == "encoder")

- Result.first = FuncName;

- else if (OpName == "decoder")

- Result.second = FuncName;

- }

+ continue;

+ StringRef OpName = cast<DefInit>(Op)->getDef()->getName(),

+ FuncName = cast<StringInit>(DI->getArg(0))->getValue();

+ if (OpName == "encoder")

+ Result.first = FuncName;

+ else if (OpName == "decoder")

+ Result.second = FuncName;

}

return Result;

Use early exit to reduce indentation for better readability.

0x59616e: Use early exit to reduce indentation for better readability.

if (const RecordVal *RV = R->getValue("EncodingInfos")) { if (const RecordVal *RV = R->getValue("EncodingInfos")) {

if (auto *DI = dyn_cast_or_null<DefInit>(RV->getValue())) { if (auto *DI = dyn_cast_or_null<DefInit>(RV->getValue())) {

EncodingInfoByHwMode EBM(DI->getDef(), HWM); EncodingInfoByHwMode EBM(DI->getDef(), HWM);

for (auto &KV : EBM) { for (auto &KV : EBM) {

HwModes.insert(KV.first); HwModes.insert(KV.first);

Record *EncodingDef = KV.second; Record *EncodingDef = KV.second;

RecordVal *RV = EncodingDef->getValue("Inst"); RecordVal *RV = EncodingDef->getValue("Inst");

DagInit *DI = cast<DagInit>(RV->getValue()); DagInit *DI = cast<DagInit>(RV->getValue());

▲ Show 20 Lines • Show All 278 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[TableGen] Support custom decoders for variable length instructionsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 491994

llvm/include/llvm/Target/Target.td

llvm/test/TableGen/VarLenDecoder.td

llvm/utils/TableGen/DecoderEmitter.cpp

llvm/utils/TableGen/VarLenCodeEmitterGen.h

llvm/utils/TableGen/VarLenCodeEmitterGen.cpp

[TableGen] Support custom decoders for variable length instructions
ClosedPublic