Download Raw Diff

Details

Reviewers

jyknight
t.p.northover
theraven

Commits

rG24d9b13b36bd: [RISCV 4/10] Add basic RISCV{InstrFormats,InstrInfo,RegisterInfo,}.td
rL285769: [RISCV 4/10] Add basic RISCV{InstrFormats,InstrInfo,RegisterInfo,}.td

Summary

For now, only add instruction definitions for basic ALU operations. Our initial target is a working MC layer rather than codegen, so appropriate SelectionDAG patterns will come later.

Diff Detail

Repository: rL LLVM

Event Timeline

asb updated this revision to Diff 68187.Aug 16 2016, 8:25 AM

asb retitled this revision from to [RISCV 4/10] Add basic RISCV{InstrFormats,InstrInfo,RegisterInfo,}.td.

asb updated this object.

asb added reviewers: theraven, jyknight.

asb added a subscriber: llvm-commits.

emaste added a subscriber: emaste.Aug 16 2016, 2:42 PM

simoncook added a subscriber: simoncook.Aug 17 2016, 6:56 AM

jleidel added a subscriber: jleidel.Aug 17 2016, 10:50 AM

t.p.northover added a subscriber: t.p.northover.Aug 17 2016, 12:01 PM

t.p.northover added inline comments.

lib/Target/RISCV/RISCVInstrFormats.td

33 ↗

(On Diff #68187)

Nit: since the spec seems to list instructions from bit 31 down to 0 (as do TableGen 0bxxxx literals), definitions might read more naturally if you make the order "funct7, funct3, opcode".

97 ↗

(On Diff #68187)

This immediate doesn't appear to have a bit 0 in the spec (it fits into exactly the same fields as imm12 does).

So what we're really deciding here is what the immediate in the MCInst representation of "BEQ #x" should be. Valid answers are "x" and "the bits representing how x is encoded", with newer targets tending to favour the latter.

This is mostly because it makes it harder by design to construct an invalid MCInst when you're doing it manually. Other than that, it's just punting complexity around the various MC layer components:

MC component	with "x"	with bits
AsmParser	nop	encodes
Disassembler	decodes	nop
AsmPrinter	nop	decodes
Fixup handling	encodes	nop

Either way "imm13" is probably misleading as a name and it's probably a good idea to be consistent between formats if possible (I note FU has taken the second approach).

darthcloud added a subscriber: darthcloud.Aug 18 2016, 4:09 AM

asb added inline comments.Aug 18 2016, 8:09 AM

lib/Target/RISCV/RISCVInstrFormats.td
33 ↗	(On Diff #68187)	That's a good point. Obviously the advantage of listing the opcode first is it reads more logically when writing something in RISCVInstrInfo that inherits from it. i.e. you first specify the high-level opcode then might select a more specific options by varying the funct bits. More closely matching the presentation in the spec is attractive though.
97 ↗	(On Diff #68187)	I am incredibly glad you raised this issue, as it's something I've agonised back and forth over. I considered writing something up about this in the patch description, but thought it may be of limited interest. The same issue you raise applies to FUJ vs FU. I started going in the direction of `imm13` because: this better matches the spec which describes immediates in terms of slicing the integer logically represented the idea that the MCInst representation of 'BEQ #x` should be 'x' appealed to me. The logical value is a 13-bit immediate with the restriction that the LSB must be 0, but of course this is encoded in 12 bits. Arguably imm13m0 would be more descriptive (imm13, masked bit0) but this description is perhaps a little opaque to most. As you've spotted, I ended up going a different direction with FU. After deciding `imm32` was misleading, I tried out with `imm20shl12` as a description of the value that's logically a 20-bit immediate shifted left 12 places. However I think I found that for whatever reason the code manipulating this didn't read cleanly. The argument that now FU has its arguments defined in terms of the bit encoding, FSB and FUJ are inconsistent with it is persuasive. I'll modify so FSB takes an imm12 and FUJ takes an imm20.

reames added a subscriber: reames.Aug 21 2016, 12:08 PM

reames added inline comments.

lib/Target/RISCV/RISCVRegisterInfo.td
18 ↗	(On Diff #68187)	Minor: Ri might be overly terse. RegID? Or something which gives some context when seen elsewhere?

asb added inline comments.Aug 22 2016, 2:57 AM

lib/Target/RISCV/RISCVInstrFormats.td
97 ↗	(On Diff #68187)	Do you have any thoughts on a clearer naming convention? Suppose I change FUJ so it takes an imm20 (i.e. the bits to be encoded), there are a few choices for naming the operand (shown below in context: `def JAL : FUJ<0b1101111, (outs GPR:$rd), (ins simm21_mask1:$imm20)`. The argument for this would be that `simm21_mask1` describes the logical value rather than the physical encoding and we don't have to describe the value in terms of either encoding or decoding. As part of `def simm21_mask1 : Operand<i32>` we would add a `DecoderMethod` and `EncoderMethod` that converts to/from the imm20 bit encoding `def JAL : FUJ<0b1101111, (outs GPR:$rd), (ins simm20_lsl1:$imm20)`. This matches what the Mips backend has done. The name reflects how to go from the encoded 20-bit representation to the logical value that represents. `def JAL : FUJ<0b1101111, (outs GPR:$rd), (ins simm21_asr1:$imm20)`. The name reflects how to go from the logical value (the signed 21-bit offset) to the 20-bit encoding. It's not clear to me why it would be more logical to use option 2) vs option 3), which is what tempts me to stick with something along the lines of the current name. `simm21_mask1` describes the logical value (a signed 21-bit value with the LSB guaranteed to be zero, i.e. has been masked `& 1`.

jordy.potman.lists added a subscriber: jordy.potman.lists.Aug 24 2016, 10:25 AM

Changed the order of parameters in RISCVInstrFormats.td so it more closely matches the RISC-V spec. As usggested by @t.p.northover, changed the way of naming the immediate in FUJ and FSB (refer to them as imm20 and imm12 which matches the number of bits stored in the instruction). Address comment from @reames about Ri being rather terse.

lib/Target/RISCV/RISCVRegisterInfo.td
18 ↗	(On Diff #68187)	It's essentially never referred to again in user-written code - on the one hand this means it only needs to make sense in the context of this file, on the other it means there's little advantage in being terse. Other archs seem to call it ArchReg so I've renamed to RISCVReg.

Comments inline.

lib/Target/RISCV/RISCV.td
23 ↗	(On Diff #69349)	Should these not be `Processor`, rather than `ProcessorModel`. I was under the impression that `Processor` is intended for generic families, whereas `ProcessorModel` was intended for specific microarchitectures. It would be nice to have Rocket as a specific `ProcessorModel` here. It may be in a later review, but we should probably also have `SubtargetFeature` flags for floating point, atomics, and so on here too.

jordy.potman.lists added inline comments.Aug 26 2016, 10:36 AM

lib/Target/RISCV/RISCVInstrFormats.td
143 ↗	(On Diff #69349)	Shouldn't this now be bits<20> imm20; ?

Rename ProcessorModels to generic-rv32 and generic-rv64 and fix incorrect immediate size spotted by @jordy.potman.lists

asb added a subscriber: hfinkel.Aug 26 2016, 12:49 PM

asb added inline comments.

lib/Target/RISCV/RISCV.td
23 ↗	(On Diff #69349)	I think actually `ProcessorModel` is preferred as it exposes the more modern `SchedMachineModel` rather than the legacy `ProcessorItineraries`. At least X86, AArch64, and Lanai describe generic `ProcessorModel`s. See this email from @hfinkel, who can perhaps confirm my interpretation http://lists.llvm.org/pipermail/llvm-dev/2015-November/092214.html. It looks like an uppercase `ProcessorModel` name is uncommon though, so I will change these to `generic-rv32` and `generic-rv64`. Having a Rocket `ProcessorModel` and scheduling model is definitely part of the plan, and later patches will add appropriate `SubtargetFeature` flags.
lib/Target/RISCV/RISCVInstrFormats.td
143 ↗	(On Diff #69349)	It should be - thanks!

I think this looks reasonable now.

This revision is now accepted and ready to land.Aug 26 2016, 2:33 PM

jyknight added inline comments.Aug 26 2016, 9:39 PM

lib/Target/RISCV/RISCVRegisterInfo.td
27 ↗	(On Diff #68187)	Are the risc-v dwarf register numbers documented anywhere? (I assume not, just like the relocation types, but thought I'd ask anyways. Maybe you can start a list of Things That Ought To Be Documented...)
63 ↗	(On Diff #68187)	May want to use a different register allocation sequence. Typically backends put caller-save registers first, then callee save. In RISCV, we probably also want to prefer registers in the range X8-X15, since they're usable by the compressed instructions. How about putting the registers in the order: X10-X17 [a0-a7] , X5-X7 [t0-t2], X28-X31 [t3-t6], X8-X9 [s0-s1], X18-X27 [s2-s11], X0-X4 [zero, ra, sp, gp, tp] That is: caller-save between x10-15, then the remaining caller-save, then callee-save, then specials (which will be reserved, anyways)
66 ↗	(On Diff #68187)	I think using one set of registers for both the 32-bit arch and the 64-bit arch doesn't actually work right -- you rather want separate 32bit and 64bit versions of the registers (perhaps name them X{0..31}_32 and X{0..31}_64 for clarity), to make up the GPR32 and GPR64 register classes. The SPARC backend actually does something like you have here, but I'm pretty sure its 64-bit support is fairly busted and is the wrong way to do things -- better to look at, perhaps, the PPC backend. Could name the registers X{0..31}_32 and X{0..31}_64.

asb marked an inline comment as done.Aug 31 2016, 5:25 AM

asb added inline comments.

lib/Target/RISCV/RISCVRegisterInfo.td
28 ↗	(On Diff #69423)	They're not. I have added that to the list.
64 ↗	(On Diff #69423)	There have been some responses to this that haven't been picked up by phabricator: 1, 2. To restate the discussion there - I would like to initially focus on MC concerns in this patchset and only introduce codegen concerns when they can be tested. Would you be happy with a comment here explaining that the allocation order should be changed when codegen is implemented?
66 ↗	(On Diff #69423)	Could you elaborate on where things go wrong for SPARC? If it's conceptual failing of having a single set of register IDs, what precisely is the issue? There doesn't for instance seem to be an assumption that any given `Register` is a member of only a single `RegisterClass`. Or is it likely due to bugs in tablegen or elsewhere? In PPC and AArch64, the 32-bit registers are defined as subregisters of the 64-bit GPRs but I'm not sure if this is as correct for RISC-V. The compiler is either targeting RV32 or RV64, and when targeting RV64 there is no instruction that will modify only 32-bits of the register. Values are always held in sign-extended format so bit 31 will be copied in bits 32-63 if you for instance execute an `ADDW` or one of the other RV64 instructions defined to work on 32-bit quantities. Load instructions defined to either zero bits 32-63 or to sign-extend the loaded value.

asb marked an inline comment as done.Aug 31 2016, 7:47 AM

asb added inline comments.

lib/Target/RISCV/RISCVRegisterInfo.td
66 ↗	(On Diff #69423)	Ok, having had a look at this some more I think I mostly answered my own question. When writing to a register in RV64 (e.g. with `ADDW`) we do indeed need to model in-register sign extension, which I think is similar to MIPS. But the operands to `ADDW` or `SUBW` come from truncating the 64-bit register. Arguably the most sensible way of modelling this is by having a 32-bit register which is a subregister of the matching 64-bit GPR. What I really want is that almost all instructions are defined as taking register operands from the 'GPR' set, which depending on if the target is RV32 or RV64 may be GPR32 or GPR64 (or in the future with RV128, GPR128). A small number of instructions as `SLLW` are defined to take a 32-bit subregisters, and in the future with RV128 we would have `ADDD`, `SLLD` and friends which take the 64-bit subregister of one of the GPR128 register set. I'll have more of a think about what I want to do here.

jyknight added inline comments.Aug 31 2016, 3:39 PM

lib/Target/RISCV/RISCVRegisterInfo.td
64 ↗	(On Diff #69423)	Yes, a TODO note sounds fine. I just didn't want it to get forgotten just because a future change might not touch this line of code.
66 ↗	(On Diff #69423)	Originally, the SPARC 64bit support was written by reusing the 32bit instruction patterns in r178527. This definitely didn't work, because the instructions specified the 32-bit register class. It apparently "almost" worked though -- until the return value needed to be spilled, and got stored in 4-bytes of memory instead of 8. That particular thing been fixed since -- by adding separate 64-bit instructions -- but the legacy of specifying a single register set still remains. I'm not really sure what exact problems it causes, I've not really pushed on sparc64, since I only really care about sparc32. But I do think the right thing really is to specify separate 32bit and 64bit registers (with a subreg relationship), and separately specify the integer instructions for each mode. As far as modeling ADDW, I believe it's fine to model it as having the 32-bit subregs as input/outputs; that it mangles the upper bits while doing so should be fine.

Apologies for the slight delay, organising last week's LLVM Cauldron and other activities in my life had limited my time.

As discussed here, we really want to have two separate register classes for RV32 and RV64 and separate instruction definitions to match this. In RISC-V, an RV32 add and an RV64 add (and indeed, a future RV128 add) all use the same encoding. There seem to be a number of possible approaches to handling this:

As done by other backends such as Sparc would, ust define every instruction twice.
Make classes in RISCVInstrInfo like ALU_rr multiclasses that define both 32-bit and 64-bit versions of instructions.
You could also imagine extending TableGen with some sort of AST macro support or a new tablegen option that would handle substituting the register class for each defined instruction
A foreach at the top of RISCVInstrInfo that iterates over each register width (for now, 32 and 64-bit). Each def would be to be written like def ADD#x and some !cast or !if magic would be needed to get the appropriate RegisterClass based on the register width.
Move all class definitions to the top of RISCVInstrInfo and make each ALU_rr etch take a RegisterClass parameter. Then define a multiclass used to parametrise all instruction definitions.

I have trialled the last option from the list above. You can see a version of RISCVInstrInfo.td (based on the complete patchset) that implements this option at paste P7637. For those subscribing to this review - I'd really appreciate your comment on if you feel this is the best way to go about it (particularly TableGen wizards like @t.p.northover). Given the current patchset aims to focus on MC, I propose actually making this change in a later patch when RV32/RV64 codegen is introduced.

As done by other backends such as Sparc would, ust define every instruction twice.

Let's use PPC rather than Sparc as our baseline.

IMO, this simple thing of just defining the instructions twice probably makes the most sense to start out with. Don't forget you also need different types in the patterns for the different modes, not just different register classes -- afaik, you can't do that with a multiclass.

If, after everything's more finished, it does turn out to be easy to merge them with a multiclass macro, then OK. I'm skeptical that it will be easier that way, but that's fine. :)

On the other hand, it's also easy to split them apart later, so if you want to start with what you have in the pastebin and split it later on when/if becomes cumbersome that way, that's fine too. :)

In D23561#542222, @asb wrote:

As discussed here, we really want to have two separate register classes for RV32 and RV64 and separate instruction definitions to match this. In RISC-V, an RV32 add and an RV64 add (and indeed, a future RV128 add) all use the same encoding.

On a somewhat longer-term note:

We have the exact same situation with HVX: the vector registers can be 64 or 128-byte long, depending on the processor mode. The actual encodings are identical between the two modes, so it's possible to have a single binary that would work in both modes. We have every HVX instruction defined twice, and separate register classes for both types of registers. This is a pain and a mess, and I have been planning to get rid of that for quite some time now.

Since this type of situation now appears in several targets, I hope that this will be enough of a rationale to develop a proper support for this issue, namely register class with a non-constant register size/alignment. This should be fairly simple, actually, and I can develop a prototype for review, hopefully in a few days.

This shouldn't stop you from proceeding with a currently available solution, since I'm not sure how long it would take to develop a working approach that avoids the duplication.

In D23561#542479, @kparzysz wrote:

In D23561#542222, @asb wrote:

On a somewhat longer-term note:

We have the exact same situation with HVX: the vector registers can be 64 or 128-byte long, depending on the processor mode. The actual encodings are identical between the two modes, so it's possible to have a single binary that would work in both modes. We have every HVX instruction defined twice, and separate register classes for both types of registers. This is a pain and a mess, and I have been planning to get rid of that for quite some time now.

Since this type of situation now appears in several targets, I hope that this will be enough of a rationale to develop a proper support for this issue, namely register class with a non-constant register size/alignment. This should be fairly simple, actually, and I can develop a prototype for review, hopefully in a few days.

Please add me to the review thread for this. We have the same thing in MIPS and it's even worse in CHERI (where we have 128- and 256-bit variants of the ISA and currently conditionally compile for only one of them).

All of that sounds great! I think it also points strongly towards simply duplicating the instructions in the RISCV target for now, as other targets are doing today, rather than working on half-measures towards improving things here.

Once/if better infrastructure becomes available, it can be changed over then.

@jyknight: Yes, that's a good point, once codegen is added I'll need to thread through a ValueType in a similar way to how I currently pass RegisterClass. I can definitely see the argument that describing instructions twice and abstracting later might lead to the best solution. I think it's definitely worth considering the alternatives early on though, even if I do go ahead with the duplication approach.

@kparzysz: I'm glad to hear you're interested in working to solve this problem and I'm looking forward to studying your proposal. Given that the current patches in review for RISCV focus just on the MC layer without codegen, it may be that there is time for your proposal to be merged before deciding on an approach for the RV32/RV64/RV128 issue. If not, we always have the option of starting duplication or utilising multiclasses with the intent to move to a better approach further down the road.

I will add a comment to explain instruction definitions will need modifying for RV64, and modify the definitions to use GPR32.

kparzysz mentioned this in D24631: [RFC] Implement variable-width register classes, step 1: API changes.Sep 15 2016, 2:38 PM

I have updated RISCVRegisterInfo.td so it defines both the 32-bit and 64-bit GPRs, marking the 32-bit GPRs as subregs. As explained in comments, the hope is that we can later move to using something like D24631.

Herald added subscribers: mgorny, beanz. · View Herald TranscriptOct 8 2016, 6:02 AM

It only shows up upon a clean compile (CMake issue?), but I've just found the introduction of registers with the same assembly name now leads to an assertion during compilation within llvm-tblgen. I'll investigate a fix.

Fix the issue described in my previous comment by not defining an AsmName and AltName for the RISCV64 registers. With this approach, the generated MatchRegisterName and MatchAltRegisterName functions can be used. In the future, a little bit of extra logic in RISCVAsmParser.cpp can be added to coerce a parsed register from 32-bit to 64-bit when desired.

Looks good

jyknight mentioned this in D23566: [RISCV 8/10] Add support for all RV32I instructions.Oct 13 2016, 12:12 PM

Closed by commit rL285769: [RISCV 4/10] Add basic RISCV{InstrFormats,InstrInfo,RegisterInfo,}.td (authored by asb). · Explain WhyNov 1 2016, 4:50 PM

This revision was automatically updated to reflect the committed changes.

Diff 76654

llvm/trunk/lib/Target/RISCV/CMakeLists.txt

				set(LLVM_TARGET_DEFINITIONS RISCV.td)

				tablegen(LLVM RISCVGenRegisterInfo.inc -gen-register-info)
				tablegen(LLVM RISCVGenInstrInfo.inc -gen-instr-info)

				add_public_tablegen_target(RISCVCommonTableGen)

	add_llvm_target(RISCVCodeGen			add_llvm_target(RISCVCodeGen
	RISCVTargetMachine.cpp			RISCVTargetMachine.cpp
	)			)

	add_subdirectory(TargetInfo)			add_subdirectory(TargetInfo)

llvm/trunk/lib/Target/RISCV/RISCV.td

				//===-- RISCV.td - Describe the RISCV Target Machine -------- tablegen --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				include "llvm/Target/Target.td"

				include "RISCVRegisterInfo.td"
				include "RISCVInstrInfo.td"


				def RISCVInstrInfo : InstrInfo;

				def Feature64Bit : SubtargetFeature<"64bit", "HasRV64", "true",
				"Implements RV64">;

				def : ProcessorModel<"generic-rv32", NoSchedModel, []>;

				def : ProcessorModel<"generic-rv64", NoSchedModel, [Feature64Bit]>;

				def RISCV : Target {
				let InstructionSet = RISCVInstrInfo;
				}

llvm/trunk/lib/Target/RISCV/RISCVInstrFormats.td

				//===-- RISCVInstrFormats.td - RISCV Instruction Formats ---- tablegen --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				//===----------------------------------------------------------------------===//
				//
				// These instruction format definitions are structured to match the
				// description in the RISC-V User-Level ISA specification as closely as
				// possible. For instance, the specification describes instructions with the
				// MSB (31st bit) on the left and the LSB (0th bit) on the right. This is
				// reflected in the order of parameters to each instruction class.
				//
				// One area of divergence is in the description of immediates. The
				// specification describes immediate encoding in terms of bit-slicing
				// operations on the logical value represented. The immediate argument to
				// these instruction formats instead represents the bit sequence that will be
				// inserted into the instruction. e.g. although JAL's immediate is logically
				// a 21-bit value (where the LSB is always zero), we describe it as an imm20
				// to match how it is encoded.
				//
				//===----------------------------------------------------------------------===//

				class RISCVInst<dag outs, dag ins, string asmstr, list<dag> pattern>
				: Instruction {
				field bits<32> Inst;
				let Size = 4;

				bits<7> Opcode = 0;

				let Inst{6-0} = Opcode;

				let Namespace = "RISCV";

				dag OutOperandList = outs;
				dag InOperandList = ins;
				let AsmString = asmstr;
				let Pattern = pattern;
				}

				// Pseudo instructions
				class Pseudo<dag outs, dag ins, string asmstr, list<dag> pattern>
				: RISCVInst<outs, ins, asmstr, pattern> {
				let isPseudo = 1;
				}

				class FR<bits<7> funct7, bits<3> funct3, bits<7> opcode, dag outs, dag ins,
				string asmstr, list<dag> pattern> : RISCVInst<outs, ins, asmstr, pattern>
				{
				bits<5> rs2;
				bits<5> rs1;
				bits<5> rd;

				let Inst{31-25} = funct7;
				let Inst{24-20} = rs2;
				let Inst{19-15} = rs1;
				let Inst{14-12} = funct3;
				let Inst{11-7} = rd;
				let Opcode = opcode;
				}

				class FI<bits<3> funct3, bits<7> opcode, dag outs, dag ins, string asmstr, list<dag> pattern>
				: RISCVInst<outs, ins, asmstr, pattern>
				{
				bits<12> imm12;
				bits<5> rs1;
				bits<5> rd;

				let Inst{31-20} = imm12;
				let Inst{19-15} = rs1;
				let Inst{14-12} = funct3;
				let Inst{11-7} = rd;
				let Opcode = opcode;
				}

				class FI32Shift<bit arithshift, bits<3> funct3, bits<7> opcode, dag outs, dag ins, string asmstr, list<dag> pattern>
				: RISCVInst<outs, ins, asmstr, pattern>
				{
				bits<5> shamt;
				bits<5> rs1;
				bits<5> rd;

				let Inst{31} = 0;
				let Inst{30} = arithshift;
				let Inst{29-25} = 0;
				let Inst{24-20} = shamt;
				let Inst{19-15} = rs1;
				let Inst{14-12} = funct3;
				let Inst{11-7} = rd;
				let Opcode = opcode;
				}

				class FS<bits<3> funct3, bits<7> opcode, dag outs, dag ins, string asmstr, list<dag> pattern>
				: RISCVInst<outs, ins, asmstr, pattern>
				{
				bits<12> imm12;
				bits<5> rs2;
				bits<5> rs1;

				let Inst{31-25} = imm12{11-5};
				let Inst{24-20} = rs2;
				let Inst{19-15} = rs1;
				let Inst{14-12} = funct3;
				let Inst{11-7} = imm12{4-0};
				let Opcode = opcode;
				}

				class FSB<bits<3> funct3, bits<7> opcode, dag outs, dag ins, string asmstr, list<dag> pattern>
				: RISCVInst<outs, ins, asmstr, pattern>
				{
				bits<12> imm12;
				bits<5> rs2;
				bits<5> rs1;

				let Inst{31} = imm12{11};
				let Inst{30-25} = imm12{9-4};
				let Inst{24-20} = rs2;
				let Inst{19-15} = rs1;
				let Inst{14-12} = funct3;
				let Inst{11-8} = imm12{3-0};
				let Inst{7} = imm12{10};
				let Opcode = opcode;
				}

				class FU<bits<7> opcode, dag outs, dag ins, string asmstr, list<dag> pattern>
				: RISCVInst<outs, ins, asmstr, pattern>
				{
				bits<20> imm20;
				bits<5> rd;

				let Inst{31-12} = imm20;
				let Inst{11-7} = rd;
				let Opcode = opcode;
				}

				class FUJ<bits<7> opcode, dag outs, dag ins, string asmstr, list<dag> pattern>
				: RISCVInst<outs, ins, asmstr, pattern>
				{
				bits<20> imm20;
				bits<5> rd;

				let Inst{31} = imm20{19};
				let Inst{30-21} = imm20{9-0};
				let Inst{20} = imm20{10};
				let Inst{19-12} = imm20{18-11};
				let Inst{11-7} = rd;
				let Opcode = opcode;
				}

llvm/trunk/lib/Target/RISCV/RISCVInstrInfo.td

				//===-- RISCVInstrInfo.td - Target Description for RISCV ---- tablegen --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// This file describes the RISC-V instructions in TableGen format.
				//
				//===----------------------------------------------------------------------===//

				include "RISCVInstrFormats.td"

				def simm12 : Operand<i32>;

				// As noted in RISCVRegisterInfo.td, the hope is that support for
				// variable-sized register classes will mean that instruction definitions do
				// not need to be duplicated for 32-bit and 64-bit register classes. For now
				// we use 'GPR', which is 32-bit. When codegen for both RV32 and RV64 is
				// added, we will need to duplicate instruction definitions unless a proposal
				// like <http://lists.llvm.org/pipermail/llvm-dev/2016-September/105027.html>
				// is adopted.

				class ALU_ri<bits<3> funct3, string OpcodeStr> :
				FI<funct3, 0b0010011, (outs GPR:$rd), (ins GPR:$rs1, simm12:$imm12),
				OpcodeStr#"\t$rd, $rs1, $imm12", []>
				{
				}

				def ADDI : ALU_ri<0b000, "addi">;
				def SLTI : ALU_ri<0b010, "slti">;
				def SLTIU : ALU_ri<0b011, "sltiu">;
				def XORI : ALU_ri<0b100, "xori">;
				def ORI : ALU_ri<0b110, "ori">;
				def ANDI : ALU_ri<0b111, "andi">;

				class ALU_rr<bits<7> funct7, bits<3> funct3, string OpcodeStr> :
				FR<funct7, funct3, 0b0110011, (outs GPR:$rd), (ins GPR:$rs1, GPR:$rs2),
				OpcodeStr#"\t$rd, $rs1, $rs2", []>
				{
				}

				def ADD : ALU_rr<0b0000000, 0b000, "add">;
				def SUB : ALU_rr<0b0100000, 0b000, "sub">;
				def SLL : ALU_rr<0b0000000, 0b001, "sll">;
				def SLT : ALU_rr<0b0000000, 0b010, "slt">;
				def SLTU : ALU_rr<0b0000000, 0b011, "sltu">;
				def XOR : ALU_rr<0b0000000, 0b100, "xor">;
				def SRL : ALU_rr<0b0000000, 0b101, "srl">;
				def SRA : ALU_rr<0b0100000, 0b101, "sra">;
				def OR : ALU_rr<0b0000000, 0b110, "or">;
				def AND : ALU_rr<0b0000000, 0b111, "and">;

llvm/trunk/lib/Target/RISCV/RISCVRegisterInfo.td

				//===-- RISCVRegisterInfo.td - RISC-V Register defs --------- tablegen --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				//===----------------------------------------------------------------------===//
				// Declarations that describe the RISC-V register file
				//===----------------------------------------------------------------------===//

				let Namespace = "RISCV" in {
				def sub_32 : SubRegIndex<32>;

				class RISCVReg32<bits<5> Enc, string n, list<string> alt = []> : Register<n> {
				let HWEncoding{4-0} = Enc;
				let AltNames = alt;
				}

				// RISCV64 registers don't define an AsmName or AltName. If they specified
				// names aliasing the RISCVReg32 registers, the generation of the default
				// MatchRegisterName/MatchRegisterAltName would fail. When necessary,
				// RISCVAsmParser will need to convert a register number from a RISCVReg32
				// to the equivalent RISCVReg64.
				class RISCVReg64<RISCVReg32 subreg> : Register<""> {
				let HWEncoding{4-0} = subreg.HWEncoding{4-0};
				let SubRegs = [subreg];
				let SubRegIndices = [sub_32];
				}

				def ABIRegAltName : RegAltNameIndex;
				}

				// Integer registers
				let RegAltNameIndices = [ABIRegAltName] in {
				def X0_32 : RISCVReg32<0, "x0", ["zero"]>, DwarfRegNum<[0]>;
				def X1_32 : RISCVReg32<1, "x1", ["ra"]>, DwarfRegNum<[1]>;
				def X2_32 : RISCVReg32<2, "x2", ["sp"]>, DwarfRegNum<[2]>;
				def X3_32 : RISCVReg32<3, "x3", ["gp"]>, DwarfRegNum<[3]>;
				def X4_32 : RISCVReg32<4, "x4", ["tp"]>, DwarfRegNum<[4]>;
				def X5_32 : RISCVReg32<5, "x5", ["t0"]>, DwarfRegNum<[5]>;
				def X6_32 : RISCVReg32<6, "x6", ["t1"]>, DwarfRegNum<[6]>;
				def X7_32 : RISCVReg32<7, "x7", ["t2"]>, DwarfRegNum<[7]>;
				def X8_32 : RISCVReg32<8, "x8", ["s0"]>, DwarfRegNum<[8]>;
				def X9_32 : RISCVReg32<9, "x9", ["s1"]>, DwarfRegNum<[9]>;
				def X10_32 : RISCVReg32<10,"x10", ["a0"]>, DwarfRegNum<[10]>;
				def X11_32 : RISCVReg32<11,"x11", ["a1"]>, DwarfRegNum<[11]>;
				def X12_32 : RISCVReg32<12,"x12", ["a2"]>, DwarfRegNum<[12]>;
				def X13_32 : RISCVReg32<13,"x13", ["a3"]>, DwarfRegNum<[13]>;
				def X14_32 : RISCVReg32<14,"x14", ["a4"]>, DwarfRegNum<[14]>;
				def X15_32 : RISCVReg32<15,"x15", ["a5"]>, DwarfRegNum<[15]>;
				def X16_32 : RISCVReg32<16,"x16", ["a6"]>, DwarfRegNum<[16]>;
				def X17_32 : RISCVReg32<17,"x17", ["a7"]>, DwarfRegNum<[17]>;
				def X18_32 : RISCVReg32<18,"x18", ["s2"]>, DwarfRegNum<[18]>;
				def X19_32 : RISCVReg32<19,"x19", ["s3"]>, DwarfRegNum<[19]>;
				def X20_32 : RISCVReg32<20,"x20", ["s4"]>, DwarfRegNum<[20]>;
				def X21_32 : RISCVReg32<21,"x21", ["s5"]>, DwarfRegNum<[21]>;
				def X22_32 : RISCVReg32<22,"x22", ["s6"]>, DwarfRegNum<[22]>;
				def X23_32 : RISCVReg32<23,"x23", ["s7"]>, DwarfRegNum<[23]>;
				def X24_32 : RISCVReg32<24,"x24", ["s8"]>, DwarfRegNum<[24]>;
				def X25_32 : RISCVReg32<25,"x25", ["s9"]>, DwarfRegNum<[25]>;
				def X26_32 : RISCVReg32<26,"x26", ["s10"]>, DwarfRegNum<[26]>;
				def X27_32 : RISCVReg32<27,"x27", ["s11"]>, DwarfRegNum<[27]>;
				def X28_32 : RISCVReg32<28,"x28", ["t3"]>, DwarfRegNum<[28]>;
				def X29_32 : RISCVReg32<29,"x29", ["t4"]>, DwarfRegNum<[29]>;
				def X30_32 : RISCVReg32<30,"x30", ["t5"]>, DwarfRegNum<[30]>;
				def X31_32 : RISCVReg32<31,"x31", ["t6"]>, DwarfRegNum<[31]>;
				}

				foreach Index = 0-31 in {
				def X#Index#_64 : RISCVReg64<!cast<RISCVReg32>("X"#Index#"_32")>, DwarfRegNum<[Index]>;
				}

				// We currently define separate register classes for the 32-bit and 64-bit
				// GPRs. Once variable-sized register classes
				// <http://lists.llvm.org/pipermail/llvm-dev/2016-September/105027.html> or
				// similar are implemented, we can just use one 'GPR' class for most
				// instruction definitions.

				// TODO: once codegen is implemented, registers should be listed in an order
				// reflecting the preferred register allocation sequence.
				def GPR : RegisterClass<"RISCV", [i32], 32, (add
				(sequence "X%u_32", 0, 31)
				)>;

				def GPR64 : RegisterClass<"RISCV", [i64], 64, (add
				(sequence "X%u_64", 0, 31)
				)>;

This is an archive of the discontinued LLVM Phabricator instance.

[RISCV 4/10] Add basic RISCV{InstrFormats,InstrInfo,RegisterInfo,}.td
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 76654

llvm/trunk/lib/Target/RISCV/CMakeLists.txt

llvm/trunk/lib/Target/RISCV/RISCV.td

llvm/trunk/lib/Target/RISCV/RISCVInstrFormats.td

llvm/trunk/lib/Target/RISCV/RISCVInstrInfo.td

llvm/trunk/lib/Target/RISCV/RISCVRegisterInfo.td

This is an archive of the discontinued LLVM Phabricator instance.

[RISCV 4/10] Add basic RISCV{InstrFormats,InstrInfo,RegisterInfo,}.tdClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 76654

llvm/trunk/lib/Target/RISCV/CMakeLists.txt

llvm/trunk/lib/Target/RISCV/RISCV.td

llvm/trunk/lib/Target/RISCV/RISCVInstrFormats.td

llvm/trunk/lib/Target/RISCV/RISCVInstrInfo.td

llvm/trunk/lib/Target/RISCV/RISCVRegisterInfo.td

[RISCV 4/10] Add basic RISCV{InstrFormats,InstrInfo,RegisterInfo,}.td
ClosedPublic