This is an archive of the discontinued LLVM Phabricator instance.

[mips][p5600] Added P5600 processor and initial scheduler.
ClosedPublic

Authored by dsanders on Aug 20 2015, 6:05 AM.

Download Raw Diff

Details

Reviewers

Commits

rG7727e1098c69: [mips][p5600] Added P5600 processor and initial scheduler.
rL248725: [mips][p5600] Added P5600 processor and initial scheduler.

Summary

The P5600 is an out-of-order, superscalar implementation of the MIPS32R5
architecture.

The scheduler has a few missing details (see the 'Tricky Instructions'
section and some quirks of the P5600 are deliberately omitted due to
implementation difficulty and low chance of significant benefit (e.g. the
predicate on P5600WriteEitherALU). However, testing on SingleSource is
showing significant performance benefits on some apps (seven in the 10-30%
range) and only one significant regression (12%) when
-pre-RA-sched=linearize is given. Without -pre-RA-sched=linearize the
results are more variable. Some do even better (up to 55% improvement) but
increased numbers of copies are slowing others down (up to 12%).

Overall, the scheduler as it currently stands is a 2.4% win with
-pre-RA-sched=linearize and a 2.7% win without -pre-RA-sched=linearize.
I'm sure we can improve on this further.

For completeness, the FPGA this was tested on shows some failures with and
without the P5600 scheduler. These appear to be scheduling related since
the two test runs have fairly different sets of failing tests even after
accounting for other factors (e.g. spurious connection failures) however
it's not P5600 specific since we also get some for the generic scheduler.

Depends on D12190

Diff Detail

Event Timeline

dsanders updated this revision to Diff 32689.Aug 20 2015, 6:05 AM

dsanders retitled this revision from to [mips][p5600] Added P5600 processor and initial scheduler..

dsanders updated this object.

dsanders added a parent revision: D12190: [mips][sched] Added class for WSBH.

dsanders added subscribers: vkalintiris, atrick, llvm-commits.

vkalintiris added a reviewer: vkalintiris.Aug 20 2015, 8:58 AM

dsanders added a child revision: D12234: [mips][p5600] Add -mcpu=p5600 option..Aug 21 2015, 5:47 AM

vkalintiris mentioned this in D12188: [mips][sched] Temporarily rename IIAlu to IIM16Alu. NFC..Aug 23 2015, 3:03 PM

Generally, this LGTM, as far as I can tell. I added some comments inline.

Also, we should change the itinerary used in BC16_MMR6_DESC which was introduced in r246963.

IMHO, we should group together the ProcResourses, ItinRWs etc, in a later patch, as it improves the readability of the scheduler.

lib/Target/Mips/Mips.td
172	Is there a reason for using FeatureMips32r2 instead of FeatureMips32r5?
lib/Target/Mips/MipsScheduleP5600.td
16	Shouldn't we set this to zero? This is my understanding (from the relevant comment in the definition of CompleteModel) given that we don't define itineraries for every instruction.
lib/Target/Mips/MipsSubtarget.h
45	I didn't investigate the problem but I can compile this patch only with an unscoped enumeration.
45–53	Can we change these names to MipsCPUEnum and MipsProcImpl, or something similar in order to follow the naming convention of MipsArchEnum and MipsArchVersion?

This revision is now accepted and ready to land.Sep 16 2015, 2:43 PM

dsanders added inline comments.Sep 24 2015, 4:06 AM

lib/Target/Mips/Mips.td
172	I'll fix that. When this patch was first written (around a year ago) FeatureMips32r5 didn't exist.
lib/Target/Mips/MipsScheduleP5600.td
16	We should cover every instruction present in P5600 at this point. If I've missed any then that's a bug in the scheduler and it should be fixed. If CompleteModel is 1, we will assert on unexpected instructions but if it's 0 then we will silently assign some scheduling information.
lib/Target/Mips/MipsSubtarget.h
45	Could you tell me which compiler and version you are using and the error message you get?
45–53	I'd actually like to change MipsArchEnum and MipsArchVersion at some point. They're inside a class called MipsSubtarget so the repeated 'Mips' is redundant.

vkalintiris added inline comments.Sep 24 2015, 5:12 AM

lib/Target/Mips/MipsScheduleP5600.td
16	If I've missed any then that's a bug in the scheduler and it should be fixed. Okay then that sounds fine.
lib/Target/Mips/MipsSubtarget.h
45	GCC 4.8.4, here's the error: `In file included from /home/vk/repos/llvm/lib/Target/Mips/MipsSubtarget.cpp:32:0: lib/Target/Mips/MipsGenSubtargetInfo.inc: In member function ‘void llvm::MipsSubtarget::ParseSubtargetFeatures(llvm::StringRef, llvm::StringRef)’: lib/Target/Mips/MipsGenSubtargetInfo.inc:760:43: error: ‘CPU’ is not a class, namespace, or enumeration if (Bits[Mips::ImplP5600] && ProcImpl < CPU::P5600) ProcImpl = CPU::P5600; ^ lib/Target/Mips/MipsGenSubtargetInfo.inc:760:66: error: ‘CPU’ is not a class, namespace, or enumeration if (Bits[Mips::ImplP5600] && ProcImpl < CPU::P5600) ProcImpl = CPU::P5600; ^ [99/461] Building CXX object tools/clang/lib/Sema/CMakeFiles/clangSema.dir/SemaTemplate.cpp.o ninja: build stopped: subcommand failed.` And here's the quick fix I had in my branch: ` diff --git a/lib/Target/Mips/Mips.td b/lib/Target/Mips/Mips.td index 87e013d..ef28ebf 100644 a/lib/Target/Mips/Mips.td +++ b/lib/Target/Mips/Mips.td @@ -175,7 +175,7 @@ def FeatureUseTCCInDIV : SubtargetFeature< Mips processors supported. ===----------------------------------------------------------------------===// -def ImplP5600 : SubtargetFeature<"p5600", "ProcImpl", "CPU::P5600", +def ImplP5600 : SubtargetFeature<"p5600", "ProcImpl", "P5600", "The P5600 Processor", [FeatureMips32r2]>; class Proc<string Name, list<SubtargetFeature> Features> diff --git a/lib/Target/Mips/MipsSubtarget.h b/lib/Target/Mips/MipsSubtarget.h index 19e9788..fa6225c 100644 --- a/lib/Target/Mips/MipsSubtarget.h +++ b/lib/Target/Mips/MipsSubtarget.h @@ -42,14 +42,15 @@ class MipsSubtarget : public MipsGenSubtargetInfo { Mips3, Mips4, Mips5, Mips64, Mips64r2, Mips64r3, Mips64r5, Mips64r6 }; - enum class CPU { P5600 }; - // Mips architecture version MipsArchEnum MipsArchVersion; + enum CPU { P5600 }; + + CPU ProcImpl; + // Processor implementation (unused but required to exist by // tablegen-erated code). - CPU ProcImpl; // IsLittle - The target is Little Endian bool IsLittle;`
45–53	Agreed. Repeating the Mips-prefix is redundant, we can fix this with a later patch.

mpf added a subscriber: mpf.Sep 24 2015, 6:19 AM

mpf added inline comments.

lib/Target/Mips/MipsScheduleP5600.td
16	I have some concerns about this as we are potentially making it impossible to use the P5600 scheduler (in a release version of LLVM) because someone happens to hit an instruction that did not get triggered in any testing of p5600. 99% of the scheduler would be OK in that environment with just a few corner cases of bad scheduling information. Do other architectures set CompleteModel to '1'? If so then fine otherwise can you make a production/release build of the compiler just generate a warning if it hits a missing instruction (but continue) and have a debug compiler abort (or something along the same lines)? Just trying to make sure we don't end up with this scheduler being in place but unusable because of a corner case. Bearing in mind that if one file in an application can't build with the p5600 scheduler then many build systems will need to turn it off for all files.

dsanders marked 9 inline comments as done.Sep 24 2015, 6:39 AM

dsanders added inline comments.

lib/Target/Mips/MipsScheduleP5600.td
16	21 out of 25 currently have CompleteModel = 1. I just looked up the code for the error and there's an '#ifndef NDEBUG' around the check so release builds will invent some default scheduling information and carry on.
lib/Target/Mips/MipsSubtarget.h
45	Thanks. It's my understanding that we only support the last two major releases of GCC which is currently 4.9 and 5.0 so I don't think we should worry about this too much. That said, does 'MipsSubtarget::CPU::P5600' work?

vkalintiris added inline comments.Sep 24 2015, 7:19 AM

lib/Target/Mips/MipsSubtarget.h
45	Yes, it works.

dsanders closed this revision.Sep 28 2015, 11:25 AM

dsanders marked an inline comment as done.

dsanders marked 4 inline comments as done.Sep 28 2015, 11:26 AM

dsanders added inline comments.

lib/Target/Mips/Mips.td
172	Fixed in the commit
lib/Target/Mips/MipsSubtarget.h
45	Ok, I've made that change in the commit.

Revision Contents

Path

Size

lib/

Target/

Mips/

4 lines

2 lines

392 lines

6 lines

Diff 32689

lib/Target/Mips/Mips.td

	Show First 20 Lines • Show All 162 Lines • ▼ Show 20 Lines
	def FeatureCnMips : SubtargetFeature<"cnmips", "HasCnMips",			def FeatureCnMips : SubtargetFeature<"cnmips", "HasCnMips",
	"true", "Octeon cnMIPS Support",			"true", "Octeon cnMIPS Support",
	[FeatureMips64r2]>;			[FeatureMips64r2]>;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Mips processors supported.			// Mips processors supported.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

				def ImplP5600 : SubtargetFeature<"p5600", "ProcImpl", "CPU::P5600",
				"The P5600 Processor", [FeatureMips32r2]>;
				vkalintirisUnsubmitted Done Reply Inline Actions Is there a reason for using FeatureMips32r2 instead of FeatureMips32r5? vkalintiris: Is there a reason for using FeatureMips32r2 instead of FeatureMips32r5?
				dsandersAuthorUnsubmitted Not Done Reply Inline Actions I'll fix that. When this patch was first written (around a year ago) FeatureMips32r5 didn't exist. dsanders: I'll fix that. When this patch was first written (around a year ago) FeatureMips32r5 didn't…
				dsandersAuthorUnsubmitted Not Done Reply Inline Actions Fixed in the commit dsanders: Fixed in the commit

	class Proc<string Name, list<SubtargetFeature> Features>			class Proc<string Name, list<SubtargetFeature> Features>
	: Processor<Name, MipsGenericItineraries, Features>;			: Processor<Name, MipsGenericItineraries, Features>;

	def : Proc<"mips1", [FeatureMips1]>;			def : Proc<"mips1", [FeatureMips1]>;
	def : Proc<"mips2", [FeatureMips2]>;			def : Proc<"mips2", [FeatureMips2]>;
	def : Proc<"mips32", [FeatureMips32]>;			def : Proc<"mips32", [FeatureMips32]>;
	def : Proc<"mips32r2", [FeatureMips32r2]>;			def : Proc<"mips32r2", [FeatureMips32r2]>;
	def : Proc<"mips32r3", [FeatureMips32r3]>;			def : Proc<"mips32r3", [FeatureMips32r3]>;
	def : Proc<"mips32r5", [FeatureMips32r5]>;			def : Proc<"mips32r5", [FeatureMips32r5]>;
	def : Proc<"mips32r6", [FeatureMips32r6]>;			def : Proc<"mips32r6", [FeatureMips32r6]>;

	def : Proc<"mips3", [FeatureMips3]>;			def : Proc<"mips3", [FeatureMips3]>;
	def : Proc<"mips4", [FeatureMips4]>;			def : Proc<"mips4", [FeatureMips4]>;
	def : Proc<"mips5", [FeatureMips5]>;			def : Proc<"mips5", [FeatureMips5]>;
	def : Proc<"mips64", [FeatureMips64]>;			def : Proc<"mips64", [FeatureMips64]>;
	def : Proc<"mips64r2", [FeatureMips64r2]>;			def : Proc<"mips64r2", [FeatureMips64r2]>;
	def : Proc<"mips64r3", [FeatureMips64r3]>;			def : Proc<"mips64r3", [FeatureMips64r3]>;
	def : Proc<"mips64r5", [FeatureMips64r5]>;			def : Proc<"mips64r5", [FeatureMips64r5]>;
	def : Proc<"mips64r6", [FeatureMips64r6]>;			def : Proc<"mips64r6", [FeatureMips64r6]>;
	def : Proc<"mips16", [FeatureMips16]>;			def : Proc<"mips16", [FeatureMips16]>;
	def : Proc<"octeon", [FeatureMips64r2, FeatureCnMips]>;			def : Proc<"octeon", [FeatureMips64r2, FeatureCnMips]>;
				def : ProcessorModel<"p5600", MipsP5600Model, [ImplP5600]>;

	def MipsAsmParser : AsmParser {			def MipsAsmParser : AsmParser {
	let ShouldEmitMatchRegisterName = 0;			let ShouldEmitMatchRegisterName = 0;
	let MnemonicContainsDot = 1;			let MnemonicContainsDot = 1;
	}			}

	def MipsAsmParserVariant : AsmParserVariant {			def MipsAsmParserVariant : AsmParserVariant {
	int Variant = 0;			int Variant = 0;
	Show All 10 Lines

lib/Target/Mips/MipsSchedule.td

Show First 20 Lines • Show All 351 Lines • ▼ Show 20 Lines	def MipsGenericItineraries : ProcessorItineraries<[ALU, IMULDIV], [], [
InstrItinData<II_SUXC1 , [InstrStage<1, [ALU]>]>,		InstrItinData<II_SUXC1 , [InstrStage<1, [ALU]>]>,
InstrItinData<II_DMFC1 , [InstrStage<2, [ALU]>]>,		InstrItinData<II_DMFC1 , [InstrStage<2, [ALU]>]>,
InstrItinData<II_DMTC1 , [InstrStage<2, [ALU]>]>,		InstrItinData<II_DMTC1 , [InstrStage<2, [ALU]>]>,
InstrItinData<II_MFC1 , [InstrStage<2, [ALU]>]>,		InstrItinData<II_MFC1 , [InstrStage<2, [ALU]>]>,
InstrItinData<II_MTC1 , [InstrStage<2, [ALU]>]>,		InstrItinData<II_MTC1 , [InstrStage<2, [ALU]>]>,
InstrItinData<II_MFHC1 , [InstrStage<2, [ALU]>]>,		InstrItinData<II_MFHC1 , [InstrStage<2, [ALU]>]>,
InstrItinData<II_MTHC1 , [InstrStage<2, [ALU]>]>		InstrItinData<II_MTHC1 , [InstrStage<2, [ALU]>]>
]>;		]>;

		include "MipsScheduleP5600.td"

lib/Target/Mips/MipsScheduleP5600.td

This file was added.

				//==- MipsScheduleP5600.td - P5600 Scheduling Definitions --- tablegen --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				def MipsP5600Model : SchedMachineModel {
				int IssueWidth = 2; // 2x dispatched per cycle
				int MicroOpBufferSize = 48; // min(48, 48, 64)
				int LoadLatency = 4;
				int MispredictPenalty = 8; // TODO: Estimated

				let CompleteModel = 1;
				vkalintirisUnsubmitted Done Reply Inline Actions Shouldn't we set this to zero? This is my understanding (from the relevant comment in the definition of CompleteModel) given that we don't define itineraries for every instruction. vkalintiris: Shouldn't we set this to zero? This is my understanding (from the relevant comment in the…
				dsandersAuthorUnsubmitted Done Reply Inline Actions We should cover every instruction present in P5600 at this point. If I've missed any then that's a bug in the scheduler and it should be fixed. If CompleteModel is 1, we will assert on unexpected instructions but if it's 0 then we will silently assign some scheduling information. dsanders: We should cover every instruction present in P5600 at this point. If I've missed any then…
				vkalintirisUnsubmitted Done Reply Inline Actions If I've missed any then that's a bug in the scheduler and it should be fixed. Okay then that sounds fine. vkalintiris: > If I've missed any then that's a bug in the scheduler and it should be fixed. Okay then that…
				mpfUnsubmitted Not Done Reply Inline Actions I have some concerns about this as we are potentially making it impossible to use the P5600 scheduler (in a release version of LLVM) because someone happens to hit an instruction that did not get triggered in any testing of p5600. 99% of the scheduler would be OK in that environment with just a few corner cases of bad scheduling information. Do other architectures set CompleteModel to '1'? If so then fine otherwise can you make a production/release build of the compiler just generate a warning if it hits a missing instruction (but continue) and have a debug compiler abort (or something along the same lines)? Just trying to make sure we don't end up with this scheduler being in place but unusable because of a corner case. Bearing in mind that if one file in an application can't build with the p5600 scheduler then many build systems will need to turn it off for all files. mpf: I have some concerns about this as we are potentially making it impossible to use the P5600…
				dsandersAuthorUnsubmitted Not Done Reply Inline Actions 21 out of 25 currently have CompleteModel = 1. I just looked up the code for the error and there's an '#ifndef NDEBUG' around the check so release builds will invent some default scheduling information and carry on. dsanders: 21 out of 25 currently have CompleteModel = 1. I just looked up the code for the error and…
				}

				let SchedModel = MipsP5600Model in {

				// ALQ Pipelines
				// =============

				def P5600ALQ : ProcResource<1> { let BufferSize = 16; }
				def P5600IssueALU : ProcResource<1> { let Super = P5600ALQ; }

				// ALU Pipeline
				// ------------

				def P5600WriteALU : SchedWriteRes<[P5600IssueALU]>;

				// and, lui, nor, or, slti, sltiu, sub, subu, xor
				def : ItinRW<[P5600WriteALU],
				[II_AND, II_LUI, II_NOR, II_OR, II_SLTI_SLTIU, II_SUBU, II_XOR]>;

				// AGQ Pipelines
				// =============

				def P5600AGQ : ProcResource<3> { let BufferSize = 16; }
				def P5600IssueAL2 : ProcResource<1> { let Super = P5600AGQ; }
				def P5600IssueCTISTD : ProcResource<1> { let Super = P5600AGQ; }
				def P5600IssueLDST : ProcResource<1> { let Super = P5600AGQ; }

				def P5600AL2Div : ProcResource<1>;
				// Pseudo-resource used to block CTISTD when handling multi-pipeline splits.
				def P5600CTISTD : ProcResource<1>;

				// CTISTD Pipeline
				// ---------------

				def P5600WriteJump : SchedWriteRes<[P5600IssueCTISTD, P5600CTISTD]>;
				def P5600WriteJumpAndLink : SchedWriteRes<[P5600IssueCTISTD, P5600CTISTD]> {
				let Latency = 2;
				}

				// b, beq, beql, bg[et]z, bl[et]z, bne, bnel, j, syscall, jal, bltzal, jalx,
				// jalr, jr.hb, jr
				def : ItinRW<[P5600WriteJump], [II_B, II_BCC, II_BCCZ, II_BCCZAL, II_J, II_JR]>;
				def : ItinRW<[P5600WriteJumpAndLink], [II_JAL, II_JALR]>;

				// LDST Pipeline
				// -------------

				def P5600WriteLoad : SchedWriteRes<[P5600IssueLDST]> {
				let Latency = 4;
				}

				def P5600WriteLoadShifted : SchedWriteRes<[P5600IssueLDST, P5600CTISTD]> {
				let Latency = 4;
				}

				def P5600WritePref : SchedWriteRes<[P5600IssueLDST]>;

				def P5600WriteStore : SchedWriteRes<[P5600IssueLDST, P5600CTISTD]> {
				// FIXME: This is a bit pessimistic. P5600CTISTD is only used during cycle 2
				// not during 0, 1, and 2.
				let ResourceCycles = [ 1, 3 ];
				}

				def P5600WriteGPRFromBypass : SchedWriteRes<[P5600IssueLDST]> {
				let Latency = 2;
				}

				def P5600WriteStoreFromOtherUnits : SchedWriteRes<[P5600IssueLDST]>;
				def P5600WriteLoadToOtherUnits : SchedWriteRes<[P5600IssueLDST]> {
				let Latency = 0;
				}

				// l[bhw], l[bh]u, ll
				def : ItinRW<[P5600WriteLoad], [II_LB, II_LBU, II_LH, II_LHU, II_LW, II_LWU]>;

				// lw[lr]
				def : ItinRW<[P5600WriteLoadShifted], [II_LWL, II_LWR]>;

				// s[bhw], sw[lr]
				def : ItinRW<[P5600WriteStore], [II_SB, II_SH, II_SW, II_SWL, II_SWR]>;

				// pref
				// (this instruction does not exist in the backend yet)
				def : ItinRW<[P5600WritePref], []>;

				// sc
				// (this instruction does not exist in the backend yet)
				def : ItinRW<[P5600WriteStore], []>;

				// LDST is also used in moves from general purpose registers to floating point
				// and MSA.
				def P5600WriteMoveGPRToOtherUnits : SchedWriteRes<[P5600IssueLDST]> {
				let Latency = 0;
				}

				// AL2 Pipeline
				// ------------

				def P5600WriteAL2 : SchedWriteRes<[P5600IssueAL2]>;
				def P5600WriteAL2BitExt : SchedWriteRes<[P5600IssueAL2]> { let Latency = 2; }
				def P5600WriteAL2ShadowMov : SchedWriteRes<[P5600IssueAL2]> { let Latency = 2; }
				def P5600WriteAL2CondMov : SchedWriteRes<[P5600IssueAL2, P5600CTISTD]> {
				let Latency = 2;
				}
				def P5600WriteAL2Div : SchedWriteRes<[P5600IssueAL2, P5600AL2Div]> {
				// Estimated worst case
				let Latency = 34;
				let ResourceCycles = [1, 34];
				}
				def P5600WriteAL2DivU : SchedWriteRes<[P5600IssueAL2, P5600AL2Div]> {
				// Estimated worst case
				let Latency = 34;
				let ResourceCycles = [1, 34];
				}
				def P5600WriteAL2Mul : SchedWriteRes<[P5600IssueAL2]> { let Latency = 3; }
				def P5600WriteAL2Mult: SchedWriteRes<[P5600IssueAL2]> { let Latency = 5; }
				def P5600WriteAL2MAdd: SchedWriteRes<[P5600IssueAL2, P5600CTISTD]> {
				let Latency = 5;
				}

				// clo, clz, di, mfhi, mflo
				def : ItinRW<[P5600WriteAL2], [II_CLO, II_CLZ, II_MFHI_MFLO]>;

				// ehb, rdhwr, rdpgpr, wrpgpr, wsbh
				def : ItinRW<[P5600WriteAL2ShadowMov], [II_RDHWR]>;

				// mov[nz]
				def : ItinRW<[P5600WriteAL2CondMov], [II_MOVN, II_MOVZ]>;

				// divu?
				def : ItinRW<[P5600WriteAL2Div], [II_DIV]>;
				def : ItinRW<[P5600WriteAL2DivU], [II_DIVU]>;

				// mul
				def : ItinRW<[P5600WriteAL2Mul], [II_MUL]>;
				// multu?, multu?
				def : ItinRW<[P5600WriteAL2Mult], [II_MULT, II_MULTU]>;
				// maddu?, msubu?, mthi, mtlo
				def : ItinRW<[P5600WriteAL2MAdd],
				[II_MADD, II_MADDU, II_MSUB, II_MSUBU, II_MTHI_MTLO]>;

				// ext, ins
				def : ItinRW<[P5600WriteAL2BitExt],
				[II_EXT, II_INS]>;

				// Either ALU or AL2 Pipelines
				// ---------------------------
				//
				// Some instructions can choose between ALU and AL2, but once dispatched to
				// ALQ or AGQ respectively they are committed to that path.
				// The decision is based on the outcome of the most recent selection when the
				// choice was last available. For now, we assume ALU is always chosen.

				def P5600WriteEitherALU : SchedWriteVariant<
				// FIXME: Implement selection predicate
				[SchedVar<SchedPredicate<[{1}]>, [P5600WriteALU]>,
				SchedVar<SchedPredicate<[{0}]>, [P5600WriteAL2]>
				]>;

				// add, addi, addiu, addu, andi, ori, rotr, se[bh], sllv?, sr[al]v?, slt, sltu,
				// xori
				def : ItinRW<[P5600WriteEitherALU],
				[II_ADDI, II_ADDIU, II_ANDI, II_ORI, II_ROTR, II_SEB, II_SEH,
				II_SLT_SLTU, II_SLL, II_SRA, II_SRL, II_XORI, II_ADDU, II_SLLV,
				II_SRAV, II_SRLV]>;

				// FPU Pipelines
				// =============

				def P5600FPQ : ProcResource<3> { let BufferSize = 16; }
				def P5600IssueFPUS : ProcResource<1> { let Super = P5600FPQ; }
				def P5600IssueFPUL : ProcResource<1> { let Super = P5600FPQ; }
				def P5600IssueFPULoad : ProcResource<1> { let Super = P5600FPQ; }

				def P5600FPUDivSqrt : ProcResource<2>;

				def P5600WriteFPUS : SchedWriteRes<[P5600IssueFPUS]>;
				def P5600WriteFPUL : SchedWriteRes<[P5600IssueFPUL]> { let Latency = 4; }
				def P5600WriteFPUL_MADDSUB : SchedWriteRes<[P5600IssueFPUL]> { let Latency = 6; }
				def P5600WriteFPUDivS : SchedWriteRes<[P5600IssueFPUL, P5600FPUDivSqrt]> {
				// Best/Common/Worst case = 7 / 23 / 27
				let Latency = 23; // Using common case
				let ResourceCycles = [ 1, 23 ];
				}
				def P5600WriteFPUDivD : SchedWriteRes<[P5600IssueFPUL, P5600FPUDivSqrt]> {
				// Best/Common/Worst case = 7 / 31 / 35
				let Latency = 31; // Using common case
				let ResourceCycles = [ 1, 31 ];
				}
				def P5600WriteFPURcpS : SchedWriteRes<[P5600IssueFPUL, P5600FPUDivSqrt]> {
				// Best/Common/Worst case = 7 / 19 / 23
				let Latency = 19; // Using common case
				let ResourceCycles = [ 1, 19 ];
				}
				def P5600WriteFPURcpD : SchedWriteRes<[P5600IssueFPUL, P5600FPUDivSqrt]> {
				// Best/Common/Worst case = 7 / 27 / 31
				let Latency = 27; // Using common case
				let ResourceCycles = [ 1, 27 ];
				}
				def P5600WriteFPURsqrtS : SchedWriteRes<[P5600IssueFPUL, P5600FPUDivSqrt]> {
				// Best/Common/Worst case = 7 / 27 / 27
				let Latency = 27; // Using common case
				let ResourceCycles = [ 1, 27 ];
				}
				def P5600WriteFPURsqrtD : SchedWriteRes<[P5600IssueFPUL, P5600FPUDivSqrt]> {
				// Best/Common/Worst case = 7 / 27 / 31
				let Latency = 27; // Using common case
				let ResourceCycles = [ 1, 27 ];
				}
				def P5600WriteFPUSqrtS : SchedWriteRes<[P5600IssueFPUL, P5600FPUDivSqrt]> {
				// Best/Common/Worst case = 7 / 27 / 31
				let Latency = 27; // Using common case
				let ResourceCycles = [ 1, 27 ];
				}
				def P5600WriteFPUSqrtD : SchedWriteRes<[P5600IssueFPUL, P5600FPUDivSqrt]> {
				// Best/Common/Worst case = 7 / 35 / 39
				let Latency = 35; // Using common case
				let ResourceCycles = [ 1, 35 ];
				}
				def P5600WriteMSAShortLogic : SchedWriteRes<[P5600IssueFPUS]>;
				def P5600WriteMSAShortInt : SchedWriteRes<[P5600IssueFPUS]> { let Latency = 2; }
				def P5600WriteMoveOtherUnitsToFPU : SchedWriteRes<[P5600IssueFPUS]>;

				// FPUS is also used in moves from floating point and MSA registers to general
				// purpose registers.
				def P5600WriteMoveFPUSToOtherUnits : SchedWriteRes<[P5600IssueFPUS]> {
				let Latency = 0;
				}

				// FPUL is also used in moves from floating point and MSA registers to general
				// purpose registers.
				def P5600WriteMoveFPULToOtherUnits : SchedWriteRes<[P5600IssueFPUL]>;

				// Short Pipe
				// ----------
				//
				// abs.[ds], abs.ps, bc1[tf]l?, mov[tf].[ds], mov[tf], mov.[ds], [cm][ft]c1,
				// m[ft]hc1, neg.[ds], neg.ps, nor.v, nori.b, or.v, ori.b, xor.v, xori.b,
				// sdxc1, sdc1, st.[bhwd], swc1, swxc1
				def : ItinRW<[P5600WriteFPUS], [II_ABS, II_MOVF_D, II_MOVF_S, II_MOVT_D,
				II_MOVT_S, II_MOV_D, II_MOV_S, II_NEG]>;

				// adds_a.[bhwd], adds_[asu].[bhwd], addvi?.[bhwd], asub_[us].[bhwd],
				// aver?_[us].[bhwd]
				def : InstRW<[P5600WriteMSAShortInt], (instregex "^ADD_A_[BHWD]$")>;
				def : InstRW<[P5600WriteMSAShortInt], (instregex "^ADDS_[ASU]_[BHWD]$")>;
				// TODO: ADDVI_[BHW] might be 1 cycle latency rather than 2. Need to confirm it.
				def : InstRW<[P5600WriteMSAShortInt], (instregex "^ADDVI?_[BHWD]$")>;
				def : InstRW<[P5600WriteMSAShortInt], (instregex "^ASUB_[US].[BHWD]$")>;
				def : InstRW<[P5600WriteMSAShortInt], (instregex "^AVER?_[US].[BHWD]$")>;

				// and.v, andi.b, move.v, ldi.[bhwd]
				def : InstRW<[P5600WriteMSAShortLogic], (instregex "^MOVE_V$")>;
				def : InstRW<[P5600WriteMSAShortLogic], (instregex "^LDI_[BHWD]$")>;
				def : InstRW<[P5600WriteMSAShortLogic], (instregex "^(AND\|OR\|[XN]OR)_V$")>;
				def : InstRW<[P5600WriteMSAShortLogic], (instregex "^(AND\|OR\|[XN]OR)I_B$")>;

				// Long Pipe
				// ----------
				//
				// add.[ds], add.ps, cvt.d.[sw], cvt.s.[dw], cvt.w.[sd], cvt.[sw].ps,
				// cvt.ps.[sw], c.<cc>.[ds], c.<cc>.ps, mul.[ds], mul.ps, sub.[ds], sub.ps,
				// trunc.w.[ds], trunc.w.ps
				def : ItinRW<[P5600WriteFPUL],
				[II_ADD_D, II_ADD_S, II_CVT, II_C_CC_D, II_C_CC_S, II_MUL_D,
				II_MUL_S, II_SUB_D, II_SUB_S, II_TRUNC]>;

				// div.[ds], div.ps
				def : ItinRW<[P5600WriteFPUDivS], [II_DIV_S]>;
				def : ItinRW<[P5600WriteFPUDivD], [II_DIV_D]>;

				// sqrt.[ds], sqrt.ps
				def : ItinRW<[P5600WriteFPUSqrtS], [II_SQRT_S]>;
				def : ItinRW<[P5600WriteFPUSqrtD], [II_SQRT_D]>;

				// madd.[ds], msub.[ds], nmadd.[ds], nmsub.[ds],
				// Operand 0 is read on cycle 5. All other operands are read on operand 0.
				def : ItinRW<[SchedReadAdvance<5>, P5600WriteFPUL_MADDSUB],
				[II_MADD_D, II_MADD_S, II_MSUB_D, II_MSUB_S, II_NMADD_D,
				II_NMADD_S, II_NMSUB_D, II_NMSUB_S]>;

				// madd.ps, msub.ps, nmadd.ps, nmsub.ps
				// Operand 0 and 1 are read on cycle 5. All others are read on operand 0.
				// (none of these instructions exist in the backend yet)

				// Load Pipe
				// ---------
				//
				// This is typically used in conjunction with the load pipeline under the AGQ
				// All the instructions are in the 'Tricky Instructions' section.

				def P5600WriteLoadOtherUnitsToFPU : SchedWriteRes<[P5600IssueFPULoad]> {
				let Latency = 4;
				}

				// Tricky Instructions
				// ===================
				//
				// These instructions are split across multiple uops (in different pipelines)
				// that must cooperate to complete the operation

				// FIXME: This isn't quite right since the implementation of WriteSequence
				// current aggregates the resources and ignores the exact cycle they are
				// used.
				def P5600WriteMoveGPRToFPU : WriteSequence<[P5600WriteMoveGPRToOtherUnits,
				P5600WriteMoveOtherUnitsToFPU]>;

				// FIXME: This isn't quite right since the implementation of WriteSequence
				// current aggregates the resources and ignores the exact cycle they are
				// used.
				def P5600WriteMoveFPUToGPR : WriteSequence<[P5600WriteMoveFPUSToOtherUnits,
				P5600WriteGPRFromBypass]>;

				// FIXME: This isn't quite right since the implementation of WriteSequence
				// current aggregates the resources and ignores the exact cycle they are
				// used.
				def P5600WriteStoreFPUS : WriteSequence<[P5600WriteMoveFPUSToOtherUnits,
				P5600WriteStoreFromOtherUnits]>;

				// FIXME: This isn't quite right since the implementation of WriteSequence
				// current aggregates the resources and ignores the exact cycle they are
				// used.
				def P5600WriteStoreFPUL : WriteSequence<[P5600WriteMoveFPULToOtherUnits,
				P5600WriteStoreFromOtherUnits]>;

				// FIXME: This isn't quite right since the implementation of WriteSequence
				// current aggregates the resources and ignores the exact cycle they are
				// used.
				def P5600WriteLoadFPU : WriteSequence<[P5600WriteLoadToOtherUnits,
				P5600WriteLoadOtherUnitsToFPU]>;

				// ctc1, mtc1, mthc1
				def : ItinRW<[P5600WriteMoveGPRToFPU], [II_CTC1, II_MTC1, II_MTHC1]>;

				// bc1[ft], cfc1, mfc1, mfhc1, movf, movt
				def : ItinRW<[P5600WriteMoveFPUToGPR],
				[II_BC1F, II_BC1T, II_CFC1, II_MFC1, II_MFHC1, II_MOVF, II_MOVT]>;

				// swc1, swxc1, st.[bhwd]
				def : ItinRW<[P5600WriteStoreFPUS], [II_SWC1, II_SWXC1]>;
				def : InstRW<[P5600WriteStoreFPUS], (instregex "^ST_[BHWD]$")>;

				// movn.[ds], movz.[ds]
				def : ItinRW<[P5600WriteStoreFPUL], [II_MOVN_D, II_MOVN_S, II_MOVZ_D, II_MOVZ_S]>;

				// l[dw]x?c1, ld.[bhwd]
				def : ItinRW<[P5600WriteLoadFPU], [II_LDC1, II_LDXC1, II_LWC1, II_LWXC1]>;
				def : InstRW<[P5600WriteLoadFPU], (instregex "LD_[BHWD]")>;

				// Unsupported Instructions
				// ========================
				//
				// The following instruction classes are never valid on P5600.
				// II_DADDIU, II_DADDU, II_DMFC1, II_DMTC1, II_DMULT, II_DMULTU, II_DROTR,
				// II_DROTR32, II_DROTRV, II_DDIV, II_DSLL, II_DSLL32, II_DSLLV, II_DSRA,
				// II_DSRA32, II_DSRAV, II_DSRL, II_DSRL32, II_DSRLV, II_DSUBU, II_DDIVU,
				// II_JALRC, II_LD, II_LD[LR], II_LUXC1, II_RESTORE, II_SAVE, II_SD, II_SDC1,
				// II_SDL, II_SDR, II_SDXC1
				//
				// The following instructions are never valid on P5600.
				// addq.ph, rdhwr, repl.ph, repl.qb, subq.ph, subu_s.qb
				//
				// Guesswork
				// =========
				//
				// This section is largely temporary guesswork.

				// ceil.[lw].[ds], floor.[lw].[ds]
				// Reason behind guess: trunc.[lw].ds and the various cvt's are in FPUL
				def : ItinRW<[P5600WriteFPUL], [II_CEIL, II_FLOOR, II_ROUND]>;

				// rotrv
				// Reason behind guess: rotr is in the same category and the two register forms
				// generally follow the immediate forms in this category
				def : ItinRW<[P5600WriteEitherALU], [II_ROTRV]>;
				}

lib/Target/Mips/MipsSubtarget.h

Show All 36 Lines	class MipsSubtarget : public MipsGenSubtargetInfo {
virtual void anchor();		virtual void anchor();

enum MipsArchEnum {		enum MipsArchEnum {
MipsDefault,		MipsDefault,
Mips1, Mips2, Mips32, Mips32r2, Mips32r3, Mips32r5, Mips32r6, Mips32Max,		Mips1, Mips2, Mips32, Mips32r2, Mips32r3, Mips32r5, Mips32r6, Mips32Max,
Mips3, Mips4, Mips5, Mips64, Mips64r2, Mips64r3, Mips64r5, Mips64r6		Mips3, Mips4, Mips5, Mips64, Mips64r2, Mips64r3, Mips64r5, Mips64r6
};		};

		enum class CPU { P5600 };
		vkalintirisUnsubmitted Done Reply Inline Actions I didn't investigate the problem but I can compile this patch only with an unscoped enumeration. vkalintiris: I didn't investigate the problem but I can compile this patch only with an unscoped enumeration.
		dsandersAuthorUnsubmitted Done Reply Inline Actions Could you tell me which compiler and version you are using and the error message you get? dsanders: Could you tell me which compiler and version you are using and the error message you get?
		vkalintirisUnsubmitted Done Reply Inline Actions GCC 4.8.4, here's the error: `In file included from /home/vk/repos/llvm/lib/Target/Mips/MipsSubtarget.cpp:32:0: lib/Target/Mips/MipsGenSubtargetInfo.inc: In member function ‘void llvm::MipsSubtarget::ParseSubtargetFeatures(llvm::StringRef, llvm::StringRef)’: lib/Target/Mips/MipsGenSubtargetInfo.inc:760:43: error: ‘CPU’ is not a class, namespace, or enumeration if (Bits[Mips::ImplP5600] && ProcImpl < CPU::P5600) ProcImpl = CPU::P5600; ^ lib/Target/Mips/MipsGenSubtargetInfo.inc:760:66: error: ‘CPU’ is not a class, namespace, or enumeration if (Bits[Mips::ImplP5600] && ProcImpl < CPU::P5600) ProcImpl = CPU::P5600; ^ [99/461] Building CXX object tools/clang/lib/Sema/CMakeFiles/clangSema.dir/SemaTemplate.cpp.o ninja: build stopped: subcommand failed.` And here's the quick fix I had in my branch: ` diff --git a/lib/Target/Mips/Mips.td b/lib/Target/Mips/Mips.td index 87e013d..ef28ebf 100644 a/lib/Target/Mips/Mips.td +++ b/lib/Target/Mips/Mips.td @@ -175,7 +175,7 @@ def FeatureUseTCCInDIV : SubtargetFeature< Mips processors supported. ===----------------------------------------------------------------------===// -def ImplP5600 : SubtargetFeature<"p5600", "ProcImpl", "CPU::P5600", +def ImplP5600 : SubtargetFeature<"p5600", "ProcImpl", "P5600", "The P5600 Processor", [FeatureMips32r2]>; class Proc<string Name, list<SubtargetFeature> Features> diff --git a/lib/Target/Mips/MipsSubtarget.h b/lib/Target/Mips/MipsSubtarget.h index 19e9788..fa6225c 100644 --- a/lib/Target/Mips/MipsSubtarget.h +++ b/lib/Target/Mips/MipsSubtarget.h @@ -42,14 +42,15 @@ class MipsSubtarget : public MipsGenSubtargetInfo { Mips3, Mips4, Mips5, Mips64, Mips64r2, Mips64r3, Mips64r5, Mips64r6 }; - enum class CPU { P5600 }; - // Mips architecture version MipsArchEnum MipsArchVersion; + enum CPU { P5600 }; + + CPU ProcImpl; + // Processor implementation (unused but required to exist by // tablegen-erated code). - CPU ProcImpl; // IsLittle - The target is Little Endian bool IsLittle;` vkalintiris: GCC 4.8.4, here's the error: `In file included from…
		dsandersAuthorUnsubmitted Done Reply Inline Actions Thanks. It's my understanding that we only support the last two major releases of GCC which is currently 4.9 and 5.0 so I don't think we should worry about this too much. That said, does 'MipsSubtarget::CPU::P5600' work? dsanders: Thanks. It's my understanding that we only support the last two major releases of GCC which is…
		vkalintirisUnsubmitted Not Done Reply Inline Actions Yes, it works. vkalintiris: Yes, it works.
		dsandersAuthorUnsubmitted Not Done Reply Inline Actions Ok, I've made that change in the commit. dsanders: Ok, I've made that change in the commit.

// Mips architecture version		// Mips architecture version
MipsArchEnum MipsArchVersion;		MipsArchEnum MipsArchVersion;

		// Processor implementation (unused but required to exist by
		// tablegen-erated code).
		CPU ProcImpl;

		vkalintirisUnsubmitted Done Reply Inline Actions Can we change these names to MipsCPUEnum and MipsProcImpl, or something similar in order to follow the naming convention of MipsArchEnum and MipsArchVersion? vkalintiris: Can we change these names to MipsCPUEnum and MipsProcImpl, or something similar in order to…
		dsandersAuthorUnsubmitted Done Reply Inline Actions I'd actually like to change MipsArchEnum and MipsArchVersion at some point. They're inside a class called MipsSubtarget so the repeated 'Mips' is redundant. dsanders: I'd actually like to change MipsArchEnum and MipsArchVersion at some point. They're inside a…
		vkalintirisUnsubmitted Done Reply Inline Actions Agreed. Repeating the Mips-prefix is redundant, we can fix this with a later patch. vkalintiris: Agreed. Repeating the Mips-prefix is redundant, we can fix this with a later patch.
// IsLittle - The target is Little Endian		// IsLittle - The target is Little Endian
bool IsLittle;		bool IsLittle;

// IsSoftFloat - The target does not support any floating point instructions.		// IsSoftFloat - The target does not support any floating point instructions.
bool IsSoftFloat;		bool IsSoftFloat;

// IsSingleFloat - The target only supports single precision float		// IsSingleFloat - The target only supports single precision float
// point operations. This enable the target to use all 32 32-bit		// point operations. This enable the target to use all 32 32-bit
▲ Show 20 Lines • Show All 243 Lines • Show Last 20 Lines