This is an archive of the discontinued LLVM Phabricator instance.

include/llvm/MC/MCSchedule.h
191	Why do you need both?
199	What would `ProcResName` be is case of retired instruction for instance?
lib/MC/MCSchedule.cpp
152 ↗	(On Diff #168457)	I don't understand the error message.
utils/TableGen/CodeGenSchedule.cpp
783	join string literals and reapply formatting.
886	Same here
1204	Can you explain what the 6 is referring to?
1432	const
1463	join string literals
1942	Can you reformat
utils/TableGen/SubtargetEmitter.cpp
1792	llvm::format would help here OS << llvm::format("{0}ProcSchedKV, {0}ProcPfmKV, {0}WriteProcResTable, {0}WriteLatencyTable, {0}ReadAdvanceTable, ", Target);
1886	Same here.

@gchatelet a bunch of your comments are on old code touched by unwanted reformatting. I ignored these since I have reverted the formatted changes.

include/llvm/MC/MCSchedule.h
191	For consistency with MCSchedModel (see getValueForCpu() below).
199	the retired counter is a scheduler concept, not tied to proc resources. It would go with `CycleCounter` and `UopsCounter` above.
lib/MC/MCSchedule.cpp
152 ↗	(On Diff #168457)	This is ensuring that the value can be constructed by the linker without running any dynamic initialization.

use "dynamic initialization" instead of "static constructor".

address review comments

Harbormaster completed remote builds in B23513: Diff 168467.Oct 5 2018, 6:50 AM

Thanks

utils/TableGen/SubtargetEmitter.cpp
1792	I want to remain consistent with the rest of the file. I've reformatted to make it more readable.

RKSimon added a reviewer: andreadb.Oct 5 2018, 7:13 AM

RKSimon added inline comments.Oct 5 2018, 7:17 AM

lib/Target/X86/X86.td
978 ↗	(On Diff #168467)	Missing btver2 details?

add missing BtVer2PfmCounters to btver2.

lib/Target/X86/X86.td
978 ↗	(On Diff #168467)	Thanks for the catch.

Harbormaster completed remote builds in B23514: Diff 168470.Oct 5 2018, 7:24 AM

gchatelet accepted this revision.Oct 5 2018, 7:51 AM

This revision is now accepted and ready to land.Oct 5 2018, 7:51 AM

I’m not at work today, but I’d like a bit of time to review this patch.

I saw that it has already been accepted. However I am not entirely sure if I like the idea of adding another field to MCSubtargetInfo. I need to think about this.

RKSimon requested changes to this revision.Oct 5 2018, 8:13 AM

This revision now requires changes to proceed.Oct 5 2018, 8:13 AM

In D52932#1256560, @andreadb wrote:

I’m not at work today, but I’d like a bit of time to review this patch.

I saw that it has already been accepted. However I am not entirely sure if I like the idea of adding another field to MCSubtargetInfo. I need to think about this.

Sure, take your time.

RKSimon added a reviewer: craig.topper.Oct 5 2018, 1:12 PM

Thanks for waiting Clement.

I am now back from holiday, and I finally had time to look at your patch.

What concerns me about this approach is that MCSubtargetInfo describes a processor for the purpose of codegen.
At least for me (I know it may sound a bit philosophical), exposing the knowledge about PFMs to MCSubtargetInfo is not ideal. Ideally, the MCSubtargetInfo interface should be small and "abstract enough" to allow the description of different subprocessors for different targets. Anything perf related, should be described by other modules (i.e. in a separate class; alternatively, perf-related knowledge should be described by scheduling models).

What if instead we make PFM descriptors a customizable property tablegen class SchedMachineModel? Something that defaults to an empty set of descriptors.

On X86, knowledge about PFM counters of a vendor/family could be moved to a separate .td file (similarly to what you already do with lib/Target/X86/X86PfmCounters.td).
When a new model is created, people can either use let expressions to override the PFM set, or - alternatively - derive from a IntelSchedMachineModel class (vic. AMDSchedMachineModel) that sets some common "defaults".
Alternatively, we could introduce the concept of target vendor/processor family, if that helps mapping models to "default sets of" PFM counters.
As long as people are allowed to customize that set (either by using let expressions, or by overriding fiels using a a tablegen derived class), then it should be okay.

Not sure if it makes sense.

The bottom line is: I think we should try to keep the concept of PFMs separate from MCSubtargetInfo as much as possible.
That being said, I don't have a too strong opinion on this; if other devs think that I am wrong on this, then fine. I don't want to block the development on this.

I hope this helps.
Andrea

Hi Andrea, thanks for the comments.

I think there are two independent points that we're discussing here:
1 - Where we're putting the generated table
2 - How we specify the table.

Regarding (1):
The goal of this patch is to move the pfm counters out of the SchedModels (see PR39165), because different CPUs with different PFM counters can use the same sched model (e.g. sandybridge).
That being said, I'm not opposed to moving the table of CPU->MCPfmCountersInfo outside of the MCSubtargetInfo. One possible approach is to create a new PfmEmitter tablegen backend and PfmCounters library. If people think that's reasonable I can do that easily.

Regarding (2), I'm not sure I understand extactly what you're suggesting. Are you suggesting I do something like:

let PfmCounters = HaswellPfmCounters in {
def : HaswellProc<"haswell">;
def : HaswellProc<"core-avx2">; // Legacy alias.
}

Or more like:

// In X86.td:
def HaswellProcessorModel : HaswellProc<"haswell">;
def CoreAvx2ProcessorModel : HaswellProc<"core-avx2">; // Legacy alias.

// In X86PfmCounters.td:
def : ProcPfmCounters {
  let CycleCounter = UnhaltedCoreCyclesPfmCounter;
  let UopsCounter = UopsIssuedPfmCounter;
  let IssueCounters = [
    PfmIssueCounter<"HWPort0", "uops_dispatched_port:port_0">,
    PfmIssueCounter<"HWPort1", "uops_dispatched_port:port_1">,
    PfmIssueCounter<"HWPort2", "uops_dispatched_port:port_2">,
    PfmIssueCounter<"HWPort3", "uops_dispatched_port:port_3">,
    PfmIssueCounter<"HWPort4", "uops_dispatched_port:port_4">,
    PfmIssueCounter<"HWPort5", "uops_dispatched_port:port_5">,
    PfmIssueCounter<"HWPort6", "uops_dispatched_port:port_6">,
    PfmIssueCounter<"HWPort7", "uops_dispatched_port:port_7">
  ];
  let ProcModels = [HaswellProcessorModel, CoreAvx2ProcessorModel]
}

Thanks for the description Clement,

In D52932#1258740, @courbet wrote:

Hi Andrea, thanks for the comments.

I think there are two independent points that we're discussing here:
1 - Where we're putting the generated table
2 - How we specify the table.

Regarding (1):
The goal of this patch is to move the pfm counters out of the SchedModels (see PR39165), because different CPUs with different PFM counters can use the same sched model (e.g. sandybridge).

Sorry. I didn't realize this patch was trying to fix that issue.
The use case for this change was not very clear to me. I though it was just a simple: "how to inherit a default set of PMCs, so that I didn't have to specify the same set over and over..".

I still think it is a preferrable solution to not touch MCSubtargetInfo. That being said, I don't have a strong opinion; other reviewers are definitely more knowledgeable than me on exegesis..

Andrea

@RKSimon, any opinions ?

courbet requested review of this revision.Oct 15 2018, 11:59 PM

ping

Sorry for late reply!

It looks like we need to find a way to still embed this information in MCExtraProcessorInfo but allow for different CPUs PFM mappings.

One of the aims for PR39165 was to make it possible for llvm-exegesis to be run on CPUs with declared PFMs but without a model - allowing a report on raw resource usage for an instruction and to help create the model from scratch. @courbet @gchatelet do you think this would still be useful? Is it still an aim of llvm-exegesis to create models from scratch or just report on existing models?

Can you easily attach PFM mappings to the GenericModel/GenericPostRAModel?

In D52932#1270595, @RKSimon wrote:

It looks like we need to find a way to still embed this information in MCExtraProcessorInfo but allow for different CPUs PFM mappings.
Can you easily attach PFM mappings to the GenericModel/GenericPostRAModel?

I don't think an MCExtraProcessorInfo should store information for several CPUs, because MCExtraProcessorInfo is in MCSchedModel, which is selected by CPU, so that would be weird.

Before this change, we were putting the MCPfmCountersInfo inside MCSchedModel, which implicitly meant that there was a 1:1 mapping between them. This change decouples MCPfmCountersInfo from MCSchedModel.
A CPU chooses its SchedModel and PfmCounters independently.

Now the decision we have to make now is whether in TD files (see point (2) in my comment above):

(A) the CPU should declare its PfmCounters (this is the approach I'm taking here). Because that's very similar to what is done for MCSchedModels, I'm doing the same for PfmCounters, i.e. putting the table of Cpu->PfmCounter inside TargetSubtargetInfo, but that is actually independent from (see point (1) in my response to Andrea's comment above)
(B) the mapping of CPU->PfmCounter is kept separately (in X86PfmCounters.td).

One of the aims for PR39165 was to make it possible for llvm-exegesis to be run on CPUs with declared PFMs but without a model - allowing a report on raw resource usage for an instruction and to help create the model from scratch. @courbet @gchatelet do you think this would still be useful?
Is it still an aim of llvm-exegesis to create models from scratch or just report on existing models?

@lebedev.ri used llvm-exegesis to produce the bdverX model. At that point there was no SchedModel, so it still seems desirable to do this.

I don't think an MCExtraProcessorInfo should store information for several CPUs, because MCExtraProcessorInfo is in MCSchedModel, which is selected by CPU, so that would be weird.

You say that but we have multiple, very different, CPUs referencing the same model - Knights Landing uses the Haswell Model, the PileDriver model is likely to be used for the entire Bulldozer range, Sandy Bridge model is used as the default cpu - if you run llvm-exegesis in any of these cases apart for the 'main' cpu case it will just crash.

Before this change, we were putting the MCPfmCountersInfo inside MCSchedModel, which implicitly meant that there was a 1:1 mapping between them. This change decouples MCPfmCountersInfo from MCSchedModel.
A CPU chooses its SchedModel and PfmCounters independently.

Now the decision we have to make now is whether in TD files (see point (2) in my comment above):

(A) the CPU should declare its PfmCounters (this is the approach I'm taking here). Because that's very similar to what is done for MCSchedModels, I'm doing the same for PfmCounters, i.e. putting the table of Cpu->PfmCounter inside TargetSubtargetInfo, but that is actually independent from (see point (1) in my response to Andrea's comment above)

(B) the mapping of CPU->PfmCounter is kept separately (in X86PfmCounters.td).

(C) As the PFMs are only used by llvm-exegesis (and are likely to only every be) the PFM mappings should be moved entirely into llvm-exegesis code.

One of the aims for PR39165 was to make it possible for llvm-exegesis to be run on CPUs with declared PFMs but without a model - allowing a report on raw resource usage for an instruction and to help create the model from scratch. @courbet @gchatelet do you think this would still be useful?
Is it still an aim of llvm-exegesis to create models from scratch or just report on existing models?

@lebedev.ri used llvm-exegesis to produce the bdverX model. At that point there was no SchedModel, so it still seems desirable to do this.

IIRC @lebedev.ri had to create a copy of the btver2 model and then slowly iterate on it until it matched the bdver arch.

In D52932#1270712, @RKSimon wrote:

@lebedev.ri used llvm-exegesis to produce the bdverX model. At that point there was no SchedModel, so it still seems desirable to do this.

IIRC @lebedev.ri had to create a copy of the btver2 model and then slowly iterate on it

Correct.

until it matched the bdver arch.

In D52932#1270712, @RKSimon wrote:

I don't think an MCExtraProcessorInfo should store information for several CPUs, because MCExtraProcessorInfo is in MCSchedModel, which is selected by CPU, so that would be weird.

You say that but we have multiple, very different, CPUs referencing the same model - Knights Landing uses the Haswell Model, the PileDriver model is likely to be used for the entire Bulldozer range, Sandy Bridge model is used as the default cpu - if you run llvm-exegesis in any of these cases apart for the 'main' cpu case it will just crash.

That's exactly my point: Because we have very different CPUs referencing the same SchedModel, we should not store stuff that's going to vary from CPU to CPU (e.g. PfmCounters) inside the MCSchedModel.

Before this change, we were putting the MCPfmCountersInfo inside MCSchedModel, which implicitly meant that there was a 1:1 mapping between them. This change decouples MCPfmCountersInfo from MCSchedModel.
A CPU chooses its SchedModel and PfmCounters independently.

Now the decision we have to make now is whether in TD files (see point (2) in my comment above):

(A) the CPU should declare its PfmCounters (this is the approach I'm taking here). Because that's very similar to what is done for MCSchedModels, I'm doing the same for PfmCounters, i.e. putting the table of Cpu->PfmCounter inside TargetSubtargetInfo, but that is actually independent from (see point (1) in my response to Andrea's comment above)

(B) the mapping of CPU->PfmCounter is kept separately (in X86PfmCounters.td).

(C) As the PFMs are only used by llvm-exegesis (and are likely to only every be) the PFM mappings should be moved entirely into llvm-exegesis code.

This is choice (B) here, plus choice (1) in the comment I mentioned above, quoting myself:

"That being said, I'm not opposed to moving the table of CPU->MCPfmCountersInfo outside of the MCSubtargetInfo. One possible approach is to create a new PfmEmitter tablegen backend and PfmCounters library. If people think that's reasonable I can do that easily."

So it seems we have a plan now :)

One of the aims for PR39165 was to make it possible for llvm-exegesis to be run on CPUs with declared PFMs but without a model - allowing a report on raw resource usage for an instruction and to help create the model from scratch. @courbet @gchatelet do you think this would still be useful?
Is it still an aim of llvm-exegesis to create models from scratch or just report on existing models?

@lebedev.ri used llvm-exegesis to produce the bdverX model. At that point there was no SchedModel, so it still seems desirable to do this.

IIRC @lebedev.ri had to create a copy of the btver2 model and then slowly iterate on it until it matched the bdver arch.

Move pfm counters to ExegesisTarget. There is now a separate Exegesis TableGen backend.

Herald added subscribers: aheejin, javed.absar, mgorny, dschuff. · View Herald TranscriptOct 23 2018, 8:29 AM

Harbormaster completed remote builds in B24081: Diff 170656.Oct 23 2018, 8:30 AM

AArch64 is not yet tablegened.

Harbormaster completed remote builds in B24082: Diff 170658.Oct 23 2018, 8:38 AM

In D52932#1272557, @courbet wrote:

Move pfm counters to ExegesisTarget. There is now a separate Exegesis TableGen backend.

Thanks Clement!

I really like this approach. Overall, I think it is a much better design than the original one, since it avoid polluting MCSubtargetInfo.

Cheers
-Andrea

LGTM - thanks @courbet

This revision is now accepted and ready to land.Oct 24 2018, 11:22 AM

Thanks!

rebase

Harbormaster completed remote builds in B24188: Diff 171040.Oct 25 2018, 12:20 AM

Closed by commit rL345243: [MCSched] Bind PFM Counters to the CPUs instead of the SchedModel. (authored by courbet). · Explain WhyOct 25 2018, 12:46 AM

This revision was automatically updated to reflect the committed changes.

@courbet: Your commit seems to cause build issues:

http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/838

Thanks

In D52932#1275505, @CarlosAlbertoEnciso wrote:

@courbet: Your commit seems to cause build issues:

http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/838

Thanks

Simon fixed this issue at r345252
http://llvm.org/viewvc/llvm-project?view=revision&revision=345252

-Andrea

thakis added a subscriber: thakis.Dec 30 2018, 5:45 PM

thakis added inline comments.

llvm/trunk/lib/Target/X86/CMakeLists.txt
16 ↗	(On Diff #171043)	This list is in alphabetical order. Mind putting this entry in alphabetical order too?

thakis added inline comments.Dec 30 2018, 5:48 PM

llvm/trunk/lib/Target/X86/CMakeLists.txt
16 ↗	(On Diff #171043)	Also, it looks like X86GenExegesis.inc is only used on llvm/tools/lib/X86. Shouldn't this tablegen call be there?

thakis mentioned this in D56167: [gn build] Add some llvm/tools: llvm-exegesis, llvm-extract, llvm-link.Dec 30 2018, 5:56 PM

Diffusion mentioned this in rL350184: [gn build] Add some llvm/tools: llvm-exegesis, llvm-extract, llvm-link.Dec 31 2018, 3:52 PM

Revision Contents

Path

Size

docs/

CommandGuide/

llvm-exegesis.rst

4 lines

tblgen.rst

4 lines

include/

llvm/

MC/

MCSchedule.h

16 lines

Target/

Target.td

5 lines

TargetPfmCounters.td

46 lines

TargetSchedule.td

28 lines

lib/

Target/

X86/

CMakeLists.txt

1 line

X86PfmCounters.td

137 lines

tools/

llvm-exegesis/

lib/

AArch64/

4 lines

9 lines

8 lines

7 lines

39 lines

27 lines

20 lines

X86/

Target.cpp

6 lines

llvm-exegesis.cpp

11 lines

utils/

TableGen/

1 line

12 lines

31 lines

1 line

212 lines

86 lines

8 lines

1 line

Diff 171040

docs/CommandGuide/llvm-exegesis.rst

	Show First 20 Lines • Show All 218 Lines • ▼ Show 20 Lines

	Specify the numPoints parameters to be used for DBSCAN clustering			Specify the numPoints parameters to be used for DBSCAN clustering
	(`analysis` mode).			(`analysis` mode).

	.. option:: -ignore-invalid-sched-class=false			.. option:: -ignore-invalid-sched-class=false

	If set, ignore instructions that do not have a sched class (class idx = 0).			If set, ignore instructions that do not have a sched class (class idx = 0).

				.. option:: -mcpu=<cpu name>

				If set, measure the cpu characteristics using the counters for this CPU. This
				is useful when creating new sched models (the host CPU is unknown to LLVM).

	EXIT STATUS			EXIT STATUS
	-----------			-----------

	:program:`llvm-exegesis` returns 0 on success. Otherwise, an error message is			:program:`llvm-exegesis` returns 0 on success. Otherwise, an error message is
	printed to standard error, and the tool returns a non 0 value.			printed to standard error, and the tool returns a non 0 value.

docs/CommandGuide/tblgen.rst

	Show First 20 Lines • Show All 124 Lines • ▼ Show 20 Lines
	.. option:: -gen-tgt-intrinsic			.. option:: -gen-tgt-intrinsic

	Generate target intrinsic information.			Generate target intrinsic information.

	.. option:: -gen-enhanced-disassembly-info			.. option:: -gen-enhanced-disassembly-info

	Generate enhanced disassembly info.			Generate enhanced disassembly info.

				.. option:: -gen-exegesis

				Generate llvm-exegesis tables.

	.. option:: -version			.. option:: -version

	Show the version number of this program.			Show the version number of this program.

	EXIT STATUS			EXIT STATUS
	-----------			-----------

	If :program:`tblgen` succeeds, it will exit with 0. Otherwise, if an error			If :program:`tblgen` succeeds, it will exit with 0. Otherwise, if an error
	occurs, it will exit with a non-zero value.			occurs, it will exit with a non-zero value.

include/llvm/MC/MCSchedule.h

Show First 20 Lines • Show All 177 Lines • ▼ Show 20 Lines	struct MCExtraProcessorInfo {
// Actual size of the reorder buffer in hardware.		// Actual size of the reorder buffer in hardware.
unsigned ReorderBufferSize;		unsigned ReorderBufferSize;
// Number of instructions retired per cycle.		// Number of instructions retired per cycle.
unsigned MaxRetirePerCycle;		unsigned MaxRetirePerCycle;
const MCRegisterFileDesc *RegisterFiles;		const MCRegisterFileDesc *RegisterFiles;
unsigned NumRegisterFiles;		unsigned NumRegisterFiles;
const MCRegisterCostEntry *RegisterCostTable;		const MCRegisterCostEntry *RegisterCostTable;
unsigned NumRegisterCostEntries;		unsigned NumRegisterCostEntries;

struct PfmCountersInfo {
// An optional name of a performance counter that can be used to measure
// cycles.
const char *CycleCounter;

// An optional name of a performance counter that can be used to measure
// uops.
const char *UopsCounter;

// For each MCProcResourceDesc defined by the processor, an optional list of
// names of performance counters that can be used to measure the resource
// utilization.
const char **IssueCounters;
};
PfmCountersInfo PfmCounters;
};		};

/// Machine model for scheduling, bundling, and heuristics.		/// Machine model for scheduling, bundling, and heuristics.
///		///
/// The machine model directly provides basic information about the		/// The machine model directly provides basic information about the
/// microarchitecture to the scheduler in the form of properties. It also		/// microarchitecture to the scheduler in the form of properties. It also
		gchateletUnsubmitted Not Done Reply Inline Actions Why do you need both? gchatelet: Why do you need both?
		courbetAuthorUnsubmitted Not Done Reply Inline Actions For consistency with MCSchedModel (see getValueForCpu() below). courbet: For consistency with MCSchedModel (see getValueForCpu() below).
/// optionally refers to scheduler resource tables and itinerary		/// optionally refers to scheduler resource tables and itinerary
/// tables. Scheduler resource tables model the latency and cost for each		/// tables. Scheduler resource tables model the latency and cost for each
/// instruction type. Itinerary tables are an independent mechanism that		/// instruction type. Itinerary tables are an independent mechanism that
/// provides a detailed reservation table describing each cycle of instruction		/// provides a detailed reservation table describing each cycle of instruction
/// execution. Subtargets may define any or all of the above categories of data		/// execution. Subtargets may define any or all of the above categories of data
/// depending on the type of CPU and selected scheduler.		/// depending on the type of CPU and selected scheduler.
///		///
/// The machine independent properties defined here are used by the scheduler as		/// The machine independent properties defined here are used by the scheduler as
		gchateletUnsubmitted Not Done Reply Inline Actions What would `ProcResName` be is case of retired instruction for instance? gchatelet: What would `ProcResName` be is case of retired instruction for instance?
		courbetAuthorUnsubmitted Not Done Reply Inline Actions the retired counter is a scheduler concept, not tied to proc resources. It would go with `CycleCounter` and `UopsCounter` above. courbet: the retired counter is a scheduler concept, not tied to proc resources. It would go with…
/// an abstract machine model. A real micro-architecture has a number of		/// an abstract machine model. A real micro-architecture has a number of
/// buffers, queues, and stages. Declaring that a given machine-independent		/// buffers, queues, and stages. Declaring that a given machine-independent
/// abstract property corresponds to a specific physical property across all		/// abstract property corresponds to a specific physical property across all
/// subtargets can't be done. Nonetheless, the abstract model is		/// subtargets can't be done. Nonetheless, the abstract model is
/// useful. Futhermore, subtargets typically extend this model with processor		/// useful. Futhermore, subtargets typically extend this model with processor
/// specific resources to model any hardware features that can be exploited by		/// specific resources to model any hardware features that can be exploited by
/// sceduling heuristics and aren't sufficiently represented in the abstract.		/// sceduling heuristics and aren't sufficiently represented in the abstract.
///		///
▲ Show 20 Lines • Show All 171 Lines • Show Last 20 Lines

include/llvm/Target/Target.td

	Show First 20 Lines • Show All 1,549 Lines • ▼ Show 20 Lines
	// Pull in the common support for DAG isel generation.			// Pull in the common support for DAG isel generation.
	//			//
	include "llvm/Target/GlobalISel/Target.td"			include "llvm/Target/GlobalISel/Target.td"

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Pull in the common support for the Global ISel DAG-based selector generation.			// Pull in the common support for the Global ISel DAG-based selector generation.
	//			//
	include "llvm/Target/GlobalISel/SelectionDAGCompat.td"			include "llvm/Target/GlobalISel/SelectionDAGCompat.td"

				//===----------------------------------------------------------------------===//
				// Pull in the common support for Pfm Counters generation.
				//
				include "llvm/Target/TargetPfmCounters.td"

include/llvm/Target/TargetPfmCounters.td

This file was added.

				//===- TargetPfmCounters.td - Target Pfm Counters -- tablegen -----------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// This file defines the target-independent interfaces for performance counters.

				// Definition of a hardware counters from libpfm identifiers.
				class PfmCounter<string counter> {
				// The name of the counter that measures events.
				// The name can be "some_counter + some_other_counter", in which case the
				// measured value is the sum of events on these counters.
				string Counter = counter;
				}

				// Issue counters can be tied to a ProcResource
				class PfmIssueCounter<string resource_name, string counter>
				: PfmCounter<counter> {
				// The name of the ProcResource on which uops are issued. This is used by
				// llvm-exegesis to compare measurements with values in the SchedModels.
				// If the CPU has a sched model, this should correspond to the name of a
				// ProcResource.
				string ResourceName = resource_name;
				}

				def NoPfmCounter : PfmCounter <""> {}

				// Set of PfmCounters for measuring sched model characteristics.
				class ProcPfmCounters {
				// Processors can define how to measure cycles by defining a CycleCounter.
				PfmCounter CycleCounter = NoPfmCounter;
				// Processors can define how to measure uops by defining a UopsCounter.
				PfmCounter UopsCounter = NoPfmCounter;
				// Processors can define how to measure issued uops by defining IssueCounters.
				list<PfmIssueCounter> IssueCounters = [];
				}

				// A binding of a set of counters to a CPU.
				class PfmCountersBinding<string cpu_name, ProcPfmCounters counters> {
				string CpuName = cpu_name;
				ProcPfmCounters Counters = counters;
				}

include/llvm/Target/TargetSchedule.td

	Show First 20 Lines • Show All 551 Lines • ▼ Show 20 Lines
	// restrictions on the number of instructions retired per cycle".			// restrictions on the number of instructions retired per cycle".
	// Models can optionally specify up to one instance of RetireControlUnit per			// Models can optionally specify up to one instance of RetireControlUnit per
	// scheduling model.			// scheduling model.
	class RetireControlUnit<int bufferSize, int retirePerCycle> {			class RetireControlUnit<int bufferSize, int retirePerCycle> {
	int ReorderBufferSize = bufferSize;			int ReorderBufferSize = bufferSize;
	int MaxRetirePerCycle = retirePerCycle;			int MaxRetirePerCycle = retirePerCycle;
	SchedMachineModel SchedModel = ?;			SchedMachineModel SchedModel = ?;
	}			}

	// Allow the definition of hardware counters.
	class PfmCounter {
	SchedMachineModel SchedModel = ?;
	}

	// Each processor can define how to measure cycles by defining a
	// PfmCycleCounter.
	class PfmCycleCounter<string counter> : PfmCounter {
	string Counter = counter;
	}

	// Each ProcResourceUnits can define how to measure issued uops by defining
	// a PfmIssueCounter.
	class PfmIssueCounter<ProcResourceUnits resource, list<string> counters>
	: PfmCounter{
	// The resource units on which uops are issued.
	ProcResourceUnits Resource = resource;
	// The list of counters that measure issue events.
	list<string> Counters = counters;
	}

	// Each processor can define how to measure NumMicroOps by defining a
	// PfmUopsCounter.
	class PfmUopsCounter<string counter> : PfmCounter {
	string Counter = counter;
	}

lib/Target/X86/CMakeLists.txt

	set(LLVM_TARGET_DEFINITIONS X86.td)			set(LLVM_TARGET_DEFINITIONS X86.td)

	tablegen(LLVM X86GenAsmMatcher.inc -gen-asm-matcher)			tablegen(LLVM X86GenAsmMatcher.inc -gen-asm-matcher)
	tablegen(LLVM X86GenAsmWriter.inc -gen-asm-writer)			tablegen(LLVM X86GenAsmWriter.inc -gen-asm-writer)
	tablegen(LLVM X86GenAsmWriter1.inc -gen-asm-writer -asmwriternum=1)			tablegen(LLVM X86GenAsmWriter1.inc -gen-asm-writer -asmwriternum=1)
	tablegen(LLVM X86GenCallingConv.inc -gen-callingconv)			tablegen(LLVM X86GenCallingConv.inc -gen-callingconv)
	tablegen(LLVM X86GenDAGISel.inc -gen-dag-isel)			tablegen(LLVM X86GenDAGISel.inc -gen-dag-isel)
	tablegen(LLVM X86GenDisassemblerTables.inc -gen-disassembler)			tablegen(LLVM X86GenDisassemblerTables.inc -gen-disassembler)
	tablegen(LLVM X86GenEVEX2VEXTables.inc -gen-x86-EVEX2VEX-tables)			tablegen(LLVM X86GenEVEX2VEXTables.inc -gen-x86-EVEX2VEX-tables)
	tablegen(LLVM X86GenFastISel.inc -gen-fast-isel)			tablegen(LLVM X86GenFastISel.inc -gen-fast-isel)
	tablegen(LLVM X86GenGlobalISel.inc -gen-global-isel)			tablegen(LLVM X86GenGlobalISel.inc -gen-global-isel)
	tablegen(LLVM X86GenInstrInfo.inc -gen-instr-info)			tablegen(LLVM X86GenInstrInfo.inc -gen-instr-info)
	tablegen(LLVM X86GenRegisterBank.inc -gen-register-bank)			tablegen(LLVM X86GenRegisterBank.inc -gen-register-bank)
	tablegen(LLVM X86GenRegisterInfo.inc -gen-register-info)			tablegen(LLVM X86GenRegisterInfo.inc -gen-register-info)
	tablegen(LLVM X86GenSubtargetInfo.inc -gen-subtarget)			tablegen(LLVM X86GenSubtargetInfo.inc -gen-subtarget)
				tablegen(LLVM X86GenExegesis.inc -gen-exegesis)

	if (X86_GEN_FOLD_TABLES)			if (X86_GEN_FOLD_TABLES)
	tablegen(LLVM X86GenFoldTables.inc -gen-x86-fold-tables)			tablegen(LLVM X86GenFoldTables.inc -gen-x86-fold-tables)
	endif()			endif()

	add_public_tablegen_target(X86CommonTableGen)			add_public_tablegen_target(X86CommonTableGen)

	set(sources			set(sources
	▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

lib/Target/X86/X86PfmCounters.td

	//===-- X86PfmCounters.td - X86 Hardware Counters ----------- tablegen --===//			//===-- X86PfmCounters.td - X86 Hardware Counters ----------- tablegen --===//
	//			//
	// The LLVM Compiler Infrastructure			// The LLVM Compiler Infrastructure
	//			//
	// This file is distributed under the University of Illinois Open Source			// This file is distributed under the University of Illinois Open Source
	// License. See LICENSE.TXT for details.			// License. See LICENSE.TXT for details.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This describes the available hardware counters for various subtargets.			// This describes the available hardware counters for various subtargets.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	let SchedModel = SandyBridgeModel in {			def UnhaltedCoreCyclesPfmCounter : PfmCounter<"unhalted_core_cycles">;
	def SBCycleCounter : PfmCycleCounter<"unhalted_core_cycles">;			def UopsIssuedPfmCounter : PfmCounter<"uops_issued:any">;
	def SBPort0Counter : PfmIssueCounter<SBPort0, ["uops_dispatched_port:port_0"]>;
	def SBPort1Counter : PfmIssueCounter<SBPort1, ["uops_dispatched_port:port_1"]>;			def SandyBridgePfmCounters : ProcPfmCounters {
	def SBPort23Counter : PfmIssueCounter<SBPort23,			let CycleCounter = UnhaltedCoreCyclesPfmCounter;
	["uops_dispatched_port:port_2",			let UopsCounter = UopsIssuedPfmCounter;
	"uops_dispatched_port:port_3"]>;			let IssueCounters = [
	def SBPort4Counter : PfmIssueCounter<SBPort4, ["uops_dispatched_port:port_4"]>;			PfmIssueCounter<"SBPort0", "uops_dispatched_port:port_0">,
	def SBPort5Counter : PfmIssueCounter<SBPort5, ["uops_dispatched_port:port_5"]>;			PfmIssueCounter<"SBPort1", "uops_dispatched_port:port_1">,
	def SBUopsCounter : PfmUopsCounter<"uops_issued:any">;			PfmIssueCounter<"SBPort23", "uops_dispatched_port:port_2 + uops_dispatched_port:port_3">,
	}			PfmIssueCounter<"SBPort4", "uops_dispatched_port:port_4">,
				PfmIssueCounter<"SBPort5", "uops_dispatched_port:port_5">
	let SchedModel = HaswellModel in {			];
	def HWCycleCounter : PfmCycleCounter<"unhalted_core_cycles">;			}
	def HWPort0Counter : PfmIssueCounter<HWPort0, ["uops_dispatched_port:port_0"]>;			def : PfmCountersBinding<"sandybridge", SandyBridgePfmCounters>;
	def HWPort1Counter : PfmIssueCounter<HWPort1, ["uops_dispatched_port:port_1"]>;
	def HWPort2Counter : PfmIssueCounter<HWPort2, ["uops_dispatched_port:port_2"]>;			def HaswellPfmCounters : ProcPfmCounters {
	def HWPort3Counter : PfmIssueCounter<HWPort3, ["uops_dispatched_port:port_3"]>;			let CycleCounter = UnhaltedCoreCyclesPfmCounter;
	def HWPort4Counter : PfmIssueCounter<HWPort4, ["uops_dispatched_port:port_4"]>;			let UopsCounter = UopsIssuedPfmCounter;
	def HWPort5Counter : PfmIssueCounter<HWPort5, ["uops_dispatched_port:port_5"]>;			let IssueCounters = [
	def HWPort6Counter : PfmIssueCounter<HWPort6, ["uops_dispatched_port:port_6"]>;			PfmIssueCounter<"HWPort0", "uops_dispatched_port:port_0">,
	def HWPort7Counter : PfmIssueCounter<HWPort7, ["uops_dispatched_port:port_7"]>;			PfmIssueCounter<"HWPort1", "uops_dispatched_port:port_1">,
	def HWUopsCounter : PfmUopsCounter<"uops_issued:any">;			PfmIssueCounter<"HWPort2", "uops_dispatched_port:port_2">,
	}			PfmIssueCounter<"HWPort3", "uops_dispatched_port:port_3">,
				PfmIssueCounter<"HWPort4", "uops_dispatched_port:port_4">,
	let SchedModel = BroadwellModel in {			PfmIssueCounter<"HWPort5", "uops_dispatched_port:port_5">,
	def BWCycleCounter : PfmCycleCounter<"unhalted_core_cycles">;			PfmIssueCounter<"HWPort6", "uops_dispatched_port:port_6">,
	def BWPort0Counter : PfmIssueCounter<BWPort0, ["uops_executed_port:port_0"]>;			PfmIssueCounter<"HWPort7", "uops_dispatched_port:port_7">
	def BWPort1Counter : PfmIssueCounter<BWPort1, ["uops_executed_port:port_1"]>;			];
	def BWPort2Counter : PfmIssueCounter<BWPort2, ["uops_executed_port:port_2"]>;			}
	def BWPort3Counter : PfmIssueCounter<BWPort3, ["uops_executed_port:port_3"]>;			def : PfmCountersBinding<"haswell", HaswellPfmCounters>;
	def BWPort4Counter : PfmIssueCounter<BWPort4, ["uops_executed_port:port_4"]>;
	def BWPort5Counter : PfmIssueCounter<BWPort5, ["uops_executed_port:port_5"]>;			def BroadwellPfmCounters : ProcPfmCounters {
	def BWPort6Counter : PfmIssueCounter<BWPort6, ["uops_executed_port:port_6"]>;			let CycleCounter = UnhaltedCoreCyclesPfmCounter;
	def BWPort7Counter : PfmIssueCounter<BWPort7, ["uops_executed_port:port_7"]>;			let UopsCounter = UopsIssuedPfmCounter;
	def BWUopsCounter : PfmUopsCounter<"uops_issued:any">;			let IssueCounters = [
	}			PfmIssueCounter<"BWPort0", "uops_executed_port:port_0">,
				PfmIssueCounter<"BWPort1", "uops_executed_port:port_1">,
	let SchedModel = SkylakeClientModel in {			PfmIssueCounter<"BWPort2", "uops_executed_port:port_2">,
	def SKLCycleCounter : PfmCycleCounter<"unhalted_core_cycles">;			PfmIssueCounter<"BWPort3", "uops_executed_port:port_3">,
	def SKLPort0Counter : PfmIssueCounter<SKLPort0, ["uops_dispatched_port:port_0"]>;			PfmIssueCounter<"BWPort4", "uops_executed_port:port_4">,
	def SKLPort1Counter : PfmIssueCounter<SKLPort1, ["uops_dispatched_port:port_1"]>;			PfmIssueCounter<"BWPort5", "uops_executed_port:port_5">,
	def SKLPort2Counter : PfmIssueCounter<SKLPort2, ["uops_dispatched_port:port_2"]>;			PfmIssueCounter<"BWPort6", "uops_executed_port:port_6">,
	def SKLPort3Counter : PfmIssueCounter<SKLPort3, ["uops_dispatched_port:port_3"]>;			PfmIssueCounter<"BWPort7", "uops_executed_port:port_7">
	def SKLPort4Counter : PfmIssueCounter<SKLPort4, ["uops_dispatched_port:port_4"]>;			];
	def SKLPort5Counter : PfmIssueCounter<SKLPort5, ["uops_dispatched_port:port_5"]>;			}
	def SKLPort6Counter : PfmIssueCounter<SKLPort6, ["uops_dispatched_port:port_6"]>;			def : PfmCountersBinding<"broadwell", BroadwellPfmCounters>;
	def SKLPort7Counter : PfmIssueCounter<SKLPort7, ["uops_dispatched_port:port_7"]>;
	def SKLUopsCounter : PfmUopsCounter<"uops_issued:any">;			def SkylakeClientPfmCounters : ProcPfmCounters {
	}			let CycleCounter = UnhaltedCoreCyclesPfmCounter;
				let UopsCounter = UopsIssuedPfmCounter;
	let SchedModel = SkylakeServerModel in {			let IssueCounters = [
	def SKXCycleCounter : PfmCycleCounter<"unhalted_core_cycles">;			PfmIssueCounter<"SKLPort0", "uops_dispatched_port:port_0">,
	def SKXPort0Counter : PfmIssueCounter<SKXPort0, ["uops_dispatched_port:port_0"]>;			PfmIssueCounter<"SKLPort1", "uops_dispatched_port:port_1">,
	def SKXPort1Counter : PfmIssueCounter<SKXPort1, ["uops_dispatched_port:port_1"]>;			PfmIssueCounter<"SKLPort2", "uops_dispatched_port:port_2">,
	def SKXPort2Counter : PfmIssueCounter<SKXPort2, ["uops_dispatched_port:port_2"]>;			PfmIssueCounter<"SKLPort3", "uops_dispatched_port:port_3">,
	def SKXPort3Counter : PfmIssueCounter<SKXPort3, ["uops_dispatched_port:port_3"]>;			PfmIssueCounter<"SKLPort4", "uops_dispatched_port:port_4">,
	def SKXPort4Counter : PfmIssueCounter<SKXPort4, ["uops_dispatched_port:port_4"]>;			PfmIssueCounter<"SKLPort5", "uops_dispatched_port:port_5">,
	def SKXPort5Counter : PfmIssueCounter<SKXPort5, ["uops_dispatched_port:port_5"]>;			PfmIssueCounter<"SKLPort6", "uops_dispatched_port:port_6">,
	def SKXPort6Counter : PfmIssueCounter<SKXPort6, ["uops_dispatched_port:port_6"]>;			PfmIssueCounter<"SKLPort7", "uops_dispatched_port:port_7">
	def SKXPort7Counter : PfmIssueCounter<SKXPort7, ["uops_dispatched_port:port_7"]>;			];
	def SKXUopsCounter : PfmUopsCounter<"uops_issued:any">;			}
	}			def : PfmCountersBinding<"skylake", SkylakeClientPfmCounters>;

	let SchedModel = BtVer2Model in {			def SkylakeServerPfmCounters : ProcPfmCounters {
	def JCycleCounter : PfmCycleCounter<"cpu_clk_unhalted">;			let CycleCounter = UnhaltedCoreCyclesPfmCounter;
	def JUopsCounter : PfmUopsCounter<"retired_uops">;			let UopsCounter = UopsIssuedPfmCounter;
	def JFPU0Counter : PfmIssueCounter<JFPU0, ["dispatched_fpu:pipe0"]>;			let IssueCounters = [
	def JFPU1Counter : PfmIssueCounter<JFPU1, ["dispatched_fpu:pipe1"]>;			PfmIssueCounter<"SKXPort0", "uops_dispatched_port:port_0">,
				PfmIssueCounter<"SKXPort1", "uops_dispatched_port:port_1">,
				PfmIssueCounter<"SKXPort2", "uops_dispatched_port:port_2">,
				PfmIssueCounter<"SKXPort3", "uops_dispatched_port:port_3">,
				PfmIssueCounter<"SKXPort4", "uops_dispatched_port:port_4">,
				PfmIssueCounter<"SKXPort5", "uops_dispatched_port:port_5">,
				PfmIssueCounter<"SKXPort6", "uops_dispatched_port:port_6">,
				PfmIssueCounter<"SKXPort7", "uops_dispatched_port:port_7">
				];
				}
				def : PfmCountersBinding<"skylake-avx512", SkylakeServerPfmCounters>;

				def BtVer2PfmCounters : ProcPfmCounters {
				let CycleCounter = PfmCounter<"cpu_clk_unhalted">;
				let UopsCounter = PfmCounter<"retired_uops">;
				let IssueCounters = [
				PfmIssueCounter<"JFPU0", "dispatched_fpu:pipe0">,
				PfmIssueCounter<"JFPU1", "dispatched_fpu:pipe1">
				];
	}			}
				def : PfmCountersBinding<"btver2", BtVer2PfmCounters>;

tools/llvm-exegesis/lib/AArch64/Target.cpp

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	static llvm::MCInst loadImmediate(unsigned Reg, unsigned RegBitWidth,
return llvm::MCInstBuilder(getLoadImmediateOpcode(RegBitWidth))		return llvm::MCInstBuilder(getLoadImmediateOpcode(RegBitWidth))
.addReg(Reg)		.addReg(Reg)
.addImm(Value.getZExtValue());		.addImm(Value.getZExtValue());
}		}

} // namespace		} // namespace

class ExegesisAArch64Target : public ExegesisTarget {		class ExegesisAArch64Target : public ExegesisTarget {
		public:
		ExegesisAArch64Target() : ExegesisTarget({}) {}

		private:
std::vector<llvm::MCInst> setRegTo(const llvm::MCSubtargetInfo &STI,		std::vector<llvm::MCInst> setRegTo(const llvm::MCSubtargetInfo &STI,
unsigned Reg,		unsigned Reg,
const llvm::APInt &Value) const override {		const llvm::APInt &Value) const override {
if (llvm::AArch64::GPR32RegClass.contains(Reg))		if (llvm::AArch64::GPR32RegClass.contains(Reg))
return {loadImmediate(Reg, 32, Value)};		return {loadImmediate(Reg, 32, Value)};
if (llvm::AArch64::GPR64RegClass.contains(Reg))		if (llvm::AArch64::GPR64RegClass.contains(Reg))
return {loadImmediate(Reg, 64, Value)};		return {loadImmediate(Reg, 64, Value)};
llvm::errs() << "setRegTo is not implemented, results will be unreliable\n";		llvm::errs() << "setRegTo is not implemented, results will be unreliable\n";
Show All 31 Lines

tools/llvm-exegesis/lib/Latency.cpp

//===-- Latency.cpp ---------------------------------------------- C++ --===//		//===-- Latency.cpp ---------------------------------------------- C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "Latency.h"		#include "Latency.h"

#include "Assembler.h"		#include "Assembler.h"
#include "BenchmarkRunner.h"		#include "BenchmarkRunner.h"
#include "MCInstrDescView.h"		#include "MCInstrDescView.h"
		#include "PerfHelper.h"
		#include "Target.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/MC/MCInst.h"		#include "llvm/MC/MCInst.h"
#include "llvm/MC/MCInstBuilder.h"		#include "llvm/MC/MCInstBuilder.h"
#include "llvm/Support/FormatVariadic.h"		#include "llvm/Support/FormatVariadic.h"

namespace llvm {		namespace llvm {
namespace exegesis {		namespace exegesis {

▲ Show 20 Lines • Show All 137 Lines • ▼ Show 20 Lines	LatencySnippetGenerator::generateCodeTemplates(const Instruction &Instr) const {
}		}
if (Results.empty())		if (Results.empty())
return llvm::make_error<BenchmarkFailure>(		return llvm::make_error<BenchmarkFailure>(
"No strategy found to make the execution serial");		"No strategy found to make the execution serial");
return std::move(Results);		return std::move(Results);
}		}

const char *LatencyBenchmarkRunner::getCounterName() const {		const char *LatencyBenchmarkRunner::getCounterName() const {
if (!State.getSubtargetInfo().getSchedModel().hasExtraProcessorInfo())		const char *CounterName = State.getPfmCounters().CycleCounter;
llvm::report_fatal_error("sched model is missing extra processor info!");
const char *CounterName = State.getSubtargetInfo()
.getSchedModel()
.getExtraProcessorInfo()
.PfmCounters.CycleCounter;
if (!CounterName)		if (!CounterName)
llvm::report_fatal_error("sched model does not define a cycle counter");		llvm::report_fatal_error("sched model does not define a cycle counter");
return CounterName;		return CounterName;
}		}

LatencyBenchmarkRunner::~LatencyBenchmarkRunner() = default;		LatencyBenchmarkRunner::~LatencyBenchmarkRunner() = default;

llvm::Expected<std::vector<BenchmarkMeasure>>		llvm::Expected<std::vector<BenchmarkMeasure>>
Show All 23 Lines

tools/llvm-exegesis/lib/LlvmState.h

	Show All 24 Lines
	#include "llvm/Target/TargetMachine.h"			#include "llvm/Target/TargetMachine.h"
	#include <memory>			#include <memory>
	#include <string>			#include <string>

	namespace llvm {			namespace llvm {
	namespace exegesis {			namespace exegesis {

	class ExegesisTarget;			class ExegesisTarget;
				class PfmCountersInfo;

	// An object to initialize LLVM and prepare objects needed to run the			// An object to initialize LLVM and prepare objects needed to run the
	// measurements.			// measurements.
	class LLVMState {			class LLVMState {
	public:			public:
	LLVMState();			// Uses the host triple. If CpuName is empty, uses the host CPU.
				LLVMState(const std::string &CpuName);

	LLVMState(const std::string &Triple,			LLVMState(const std::string &Triple,
	const std::string &CpuName); // For tests.			const std::string &CpuName); // For tests.

	const llvm::TargetMachine &getTargetMachine() const { return *TargetMachine; }			const llvm::TargetMachine &getTargetMachine() const { return *TargetMachine; }
	std::unique_ptr<llvm::LLVMTargetMachine> createTargetMachine() const;			std::unique_ptr<llvm::LLVMTargetMachine> createTargetMachine() const;

	const ExegesisTarget &getExegesisTarget() const { return *TheExegesisTarget; }			const ExegesisTarget &getExegesisTarget() const { return *TheExegesisTarget; }

	bool canAssemble(const llvm::MCInst &mc_inst) const;			bool canAssemble(const llvm::MCInst &mc_inst) const;

	// For convenience:			// For convenience:
	const llvm::MCInstrInfo &getInstrInfo() const {			const llvm::MCInstrInfo &getInstrInfo() const {
	return *TargetMachine->getMCInstrInfo();			return *TargetMachine->getMCInstrInfo();
	}			}
	const llvm::MCRegisterInfo &getRegInfo() const {			const llvm::MCRegisterInfo &getRegInfo() const {
	return *TargetMachine->getMCRegisterInfo();			return *TargetMachine->getMCRegisterInfo();
	}			}
	const llvm::MCSubtargetInfo &getSubtargetInfo() const {			const llvm::MCSubtargetInfo &getSubtargetInfo() const {
	return *TargetMachine->getMCSubtargetInfo();			return *TargetMachine->getMCSubtargetInfo();
	}			}

	const RegisterAliasingTrackerCache &getRATC() const { return *RATC; }			const RegisterAliasingTrackerCache &getRATC() const { return *RATC; }
	const InstructionsCache &getIC() const { return *IC; }			const InstructionsCache &getIC() const { return *IC; }

				const PfmCountersInfo &getPfmCounters() const { return *PfmCounters; }

	private:			private:
	const ExegesisTarget *TheExegesisTarget;			const ExegesisTarget *TheExegesisTarget;
	std::unique_ptr<const llvm::TargetMachine> TargetMachine;			std::unique_ptr<const llvm::TargetMachine> TargetMachine;
	std::unique_ptr<const RegisterAliasingTrackerCache> RATC;			std::unique_ptr<const RegisterAliasingTrackerCache> RATC;
	std::unique_ptr<const InstructionsCache> IC;			std::unique_ptr<const InstructionsCache> IC;
				const PfmCountersInfo *PfmCounters;
	};			};

	} // namespace exegesis			} // namespace exegesis
	} // namespace llvm			} // namespace llvm

	#endif // LLVM_TOOLS_LLVM_EXEGESIS_LLVMSTATE_H			#endif // LLVM_TOOLS_LLVM_EXEGESIS_LLVMSTATE_H

tools/llvm-exegesis/lib/LlvmState.cpp

Show All 30 Lines	LLVMState::LLVMState(const std::string &Triple, const std::string &CpuName) {
TargetMachine.reset(static_cast<llvm::LLVMTargetMachine *>(		TargetMachine.reset(static_cast<llvm::LLVMTargetMachine *>(
TheTarget->createTargetMachine(Triple, CpuName, /Features/ "", Options,		TheTarget->createTargetMachine(Triple, CpuName, /Features/ "", Options,
llvm::Reloc::Model::Static)));		llvm::Reloc::Model::Static)));
TheExegesisTarget = ExegesisTarget::lookup(TargetMachine->getTargetTriple());		TheExegesisTarget = ExegesisTarget::lookup(TargetMachine->getTargetTriple());
if (!TheExegesisTarget) {		if (!TheExegesisTarget) {
llvm::errs() << "no exegesis target for " << Triple << ", using default\n";		llvm::errs() << "no exegesis target for " << Triple << ", using default\n";
TheExegesisTarget = &ExegesisTarget::getDefault();		TheExegesisTarget = &ExegesisTarget::getDefault();
}		}
		PfmCounters = &TheExegesisTarget->getPfmCounters(CpuName);

RATC.reset(new RegisterAliasingTrackerCache(		RATC.reset(new RegisterAliasingTrackerCache(
getRegInfo(), getFunctionReservedRegs(getTargetMachine())));		getRegInfo(), getFunctionReservedRegs(getTargetMachine())));
IC.reset(new InstructionsCache(getInstrInfo(), getRATC()));		IC.reset(new InstructionsCache(getInstrInfo(), getRATC()));
}		}

LLVMState::LLVMState()		LLVMState::LLVMState(const std::string &CpuName)
: LLVMState(llvm::sys::getProcessTriple(),		: LLVMState(llvm::sys::getProcessTriple(),
llvm::sys::getHostCPUName().str()) {}		CpuName.empty() ? llvm::sys::getHostCPUName().str() : CpuName) {
		}

std::unique_ptr<llvm::LLVMTargetMachine>		std::unique_ptr<llvm::LLVMTargetMachine>
LLVMState::createTargetMachine() const {		LLVMState::createTargetMachine() const {
return std::unique_ptr<llvm::LLVMTargetMachine>(		return std::unique_ptr<llvm::LLVMTargetMachine>(
static_cast<llvm::LLVMTargetMachine *>(		static_cast<llvm::LLVMTargetMachine *>(
TargetMachine->getTarget().createTargetMachine(		TargetMachine->getTarget().createTargetMachine(
TargetMachine->getTargetTriple().normalize(),		TargetMachine->getTargetTriple().normalize(),
TargetMachine->getTargetCPU(),		TargetMachine->getTargetCPU(),
Show All 22 Lines

tools/llvm-exegesis/lib/Target.h

Show All 25 Lines
#include "llvm/IR/CallingConv.h"		#include "llvm/IR/CallingConv.h"
#include "llvm/IR/LegacyPassManager.h"		#include "llvm/IR/LegacyPassManager.h"
#include "llvm/MC/MCInst.h"		#include "llvm/MC/MCInst.h"
#include "llvm/MC/MCRegisterInfo.h"		#include "llvm/MC/MCRegisterInfo.h"

namespace llvm {		namespace llvm {
namespace exegesis {		namespace exegesis {

		struct PfmCountersInfo {
		// An optional name of a performance counter that can be used to measure
		// cycles.
		const char *const CycleCounter;

		// An optional name of a performance counter that can be used to measure
		// uops.
		const char *const UopsCounter;

		// An IssueCounter specifies how to measure uops issued to specific proc
		// resources.
		struct IssueCounter {
		const char *const Counter;
		// The name of the ProcResource that this counter measures.
		const char *const ProcResName;
		};
		// An optional list of IssueCounters.
		const IssueCounter *const IssueCounters;
		const unsigned NumIssueCounters;

		static const PfmCountersInfo Default;
		};

		struct CpuAndPfmCounters {
		const char *const CpuName;
		const PfmCountersInfo *const PCI;
		bool operator<(llvm::StringRef S) const {
		return llvm::StringRef(CpuName) < S;
		}
		};

class ExegesisTarget {		class ExegesisTarget {
public:		public:
		explicit ExegesisTarget(llvm::ArrayRef<CpuAndPfmCounters> CpuPfmCounters)
		: CpuPfmCounters(CpuPfmCounters) {}

// Targets can use this to add target-specific passes in assembleToStream();		// Targets can use this to add target-specific passes in assembleToStream();
virtual void addTargetSpecificPasses(llvm::PassManagerBase &PM) const {}		virtual void addTargetSpecificPasses(llvm::PassManagerBase &PM) const {}

// Generates code to move a constant into a the given register.		// Generates code to move a constant into a the given register.
// Precondition: Value must fit into Reg.		// Precondition: Value must fit into Reg.
virtual std::vector<llvm::MCInst>		virtual std::vector<llvm::MCInst>
setRegTo(const llvm::MCSubtargetInfo &STI, unsigned Reg,		setRegTo(const llvm::MCSubtargetInfo &STI, unsigned Reg,
const llvm::APInt &Value) const = 0;		const llvm::APInt &Value) const = 0;
Show All 34 Lines	public:
static const ExegesisTarget *lookup(llvm::Triple TT);		static const ExegesisTarget *lookup(llvm::Triple TT);
// Returns the default (unspecialized) ExegesisTarget.		// Returns the default (unspecialized) ExegesisTarget.
static const ExegesisTarget &getDefault();		static const ExegesisTarget &getDefault();
// Registers a target. Not thread safe.		// Registers a target. Not thread safe.
static void registerTarget(ExegesisTarget *T);		static void registerTarget(ExegesisTarget *T);

virtual ~ExegesisTarget();		virtual ~ExegesisTarget();

		// Returns the Pfm counters for the given CPU (or the default if no pfm
		// counters are defined for this CPU).
		const PfmCountersInfo &getPfmCounters(llvm::StringRef CpuName) const;

private:		private:
virtual bool matchesArch(llvm::Triple::ArchType Arch) const = 0;		virtual bool matchesArch(llvm::Triple::ArchType Arch) const = 0;

// Targets can implement their own snippet generators/benchmarks runners by		// Targets can implement their own snippet generators/benchmarks runners by
// implementing these.		// implementing these.
std::unique_ptr<SnippetGenerator> virtual createLatencySnippetGenerator(		std::unique_ptr<SnippetGenerator> virtual createLatencySnippetGenerator(
const LLVMState &State) const;		const LLVMState &State) const;
std::unique_ptr<SnippetGenerator> virtual createUopsSnippetGenerator(		std::unique_ptr<SnippetGenerator> virtual createUopsSnippetGenerator(
const LLVMState &State) const;		const LLVMState &State) const;
std::unique_ptr<BenchmarkRunner> virtual createLatencyBenchmarkRunner(		std::unique_ptr<BenchmarkRunner> virtual createLatencyBenchmarkRunner(
const LLVMState &State) const;		const LLVMState &State) const;
std::unique_ptr<BenchmarkRunner> virtual createUopsBenchmarkRunner(		std::unique_ptr<BenchmarkRunner> virtual createUopsBenchmarkRunner(
const LLVMState &State) const;		const LLVMState &State) const;

const ExegesisTarget *Next = nullptr;		const ExegesisTarget *Next = nullptr;
		const llvm::ArrayRef<CpuAndPfmCounters> CpuPfmCounters;
};		};

} // namespace exegesis		} // namespace exegesis
} // namespace llvm		} // namespace llvm

#endif // LLVM_TOOLS_LLVM_EXEGESIS_TARGET_H		#endif // LLVM_TOOLS_LLVM_EXEGESIS_TARGET_H

tools/llvm-exegesis/lib/Target.cpp

Show First 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	ExegesisTarget::createLatencyBenchmarkRunner(const LLVMState &State) const {
return llvm::make_unique<LatencyBenchmarkRunner>(State);		return llvm::make_unique<LatencyBenchmarkRunner>(State);
}		}

std::unique_ptr<BenchmarkRunner>		std::unique_ptr<BenchmarkRunner>
ExegesisTarget::createUopsBenchmarkRunner(const LLVMState &State) const {		ExegesisTarget::createUopsBenchmarkRunner(const LLVMState &State) const {
return llvm::make_unique<UopsBenchmarkRunner>(State);		return llvm::make_unique<UopsBenchmarkRunner>(State);
}		}

		static_assert(std::is_pod<PfmCountersInfo>::value,
		"We shouldn't have dynamic initialization here");
		const PfmCountersInfo PfmCountersInfo::Default = {nullptr, nullptr, nullptr};

		const PfmCountersInfo &
		ExegesisTarget::getPfmCounters(llvm::StringRef CpuName) const {
		assert(std::is_sorted(
		CpuPfmCounters.begin(), CpuPfmCounters.end(),
		[](const CpuAndPfmCounters &LHS, const CpuAndPfmCounters &RHS) {
		return strcmp(LHS.CpuName, RHS.CpuName) < 0;
		}) &&
		"CpuPfmCounters table is not sorted");

		// Find entry
		auto Found =
		std::lower_bound(CpuPfmCounters.begin(), CpuPfmCounters.end(), CpuName);
		if (Found == CpuPfmCounters.end() \|\|
		llvm::StringRef(Found->CpuName) != CpuName) {
		return PfmCountersInfo::Default;
		}
		assert(Found->PCI && "Missing counters");
		return *Found->PCI;
		}

namespace {		namespace {

// Default implementation.		// Default implementation.
class ExegesisDefaultTarget : public ExegesisTarget {		class ExegesisDefaultTarget : public ExegesisTarget {
		public:
		ExegesisDefaultTarget() : ExegesisTarget({}) {}

private:		private:
std::vector<llvm::MCInst> setRegTo(const llvm::MCSubtargetInfo &STI,		std::vector<llvm::MCInst> setRegTo(const llvm::MCSubtargetInfo &STI,
unsigned Reg,		unsigned Reg,
const llvm::APInt &Value) const override {		const llvm::APInt &Value) const override {
llvm_unreachable("Not yet implemented");		llvm_unreachable("Not yet implemented");
}		}

bool matchesArch(llvm::Triple::ArchType Arch) const override {		bool matchesArch(llvm::Triple::ArchType Arch) const override {
Show All 14 Lines

tools/llvm-exegesis/lib/Uops.cpp

Show First 20 Lines • Show All 217 Lines • ▼ Show 20 Lines	CT.Info =
"instruction has no tied variables picking Uses different from defs";		"instruction has no tied variables picking Uses different from defs";
CT.Instructions.push_back(std::move(IT));		CT.Instructions.push_back(std::move(IT));
instantiateMemoryOperands(CT.ScratchSpacePointerInReg, CT.Instructions);		instantiateMemoryOperands(CT.ScratchSpacePointerInReg, CT.Instructions);
return getSingleton(std::move(CT));		return getSingleton(std::move(CT));
}		}

llvm::Expected<std::vector<BenchmarkMeasure>>		llvm::Expected<std::vector<BenchmarkMeasure>>
UopsBenchmarkRunner::runMeasurements(const FunctionExecutor &Executor) const {		UopsBenchmarkRunner::runMeasurements(const FunctionExecutor &Executor) const {
const auto &SchedModel = State.getSubtargetInfo().getSchedModel();

std::vector<BenchmarkMeasure> Result;		std::vector<BenchmarkMeasure> Result;
const auto &PfmCounters = SchedModel.getExtraProcessorInfo().PfmCounters;		const PfmCountersInfo &PCI = State.getPfmCounters();
// Uops per port.		// Uops per port.
for (unsigned ProcResIdx = 1;		for (const auto *IssueCounter = PCI.IssueCounters,
ProcResIdx < SchedModel.getNumProcResourceKinds(); ++ProcResIdx) {		*IssueCounterEnd = PCI.IssueCounters + PCI.NumIssueCounters;
const char *const Counters = PfmCounters.IssueCounters[ProcResIdx];		IssueCounter != IssueCounterEnd; ++IssueCounter) {
if (!Counters)		if (!IssueCounter->Counter)
continue;		continue;
auto ExpectedCounterValue = Executor.runAndMeasure(Counters);		auto ExpectedCounterValue = Executor.runAndMeasure(IssueCounter->Counter);
if (!ExpectedCounterValue)		if (!ExpectedCounterValue)
return ExpectedCounterValue.takeError();		return ExpectedCounterValue.takeError();
Result.push_back(BenchmarkMeasure::Create(		Result.push_back(BenchmarkMeasure::Create(IssueCounter->ProcResName,
SchedModel.getProcResource(ProcResIdx)->Name, *ExpectedCounterValue));		*ExpectedCounterValue));
}		}
// NumMicroOps.		// NumMicroOps.
if (const char *const UopsCounter = PfmCounters.UopsCounter) {		if (const char *const UopsCounter = PCI.UopsCounter) {
auto ExpectedCounterValue = Executor.runAndMeasure(UopsCounter);		auto ExpectedCounterValue = Executor.runAndMeasure(UopsCounter);
if (!ExpectedCounterValue)		if (!ExpectedCounterValue)
return ExpectedCounterValue.takeError();		return ExpectedCounterValue.takeError();
Result.push_back(		Result.push_back(
BenchmarkMeasure::Create("NumMicroOps", *ExpectedCounterValue));		BenchmarkMeasure::Create("NumMicroOps", *ExpectedCounterValue));
}		}
return std::move(Result);		return std::move(Result);
}		}

constexpr const size_t UopsSnippetGenerator::kMinNumDifferentAddresses;		constexpr const size_t UopsSnippetGenerator::kMinNumDifferentAddresses;

} // namespace exegesis		} // namespace exegesis
} // namespace llvm		} // namespace llvm

tools/llvm-exegesis/lib/X86/Target.cpp

Show First 20 Lines • Show All 323 Lines • ▼ Show 20 Lines	if (Bytes - ByteOffset >= 1)
llvm::X86::MOV8mi, ByteOffset,		llvm::X86::MOV8mi, ByteOffset,
WideConstant.extractBits(8, ByteOffset * 8).getZExtValue()));		WideConstant.extractBits(8, ByteOffset * 8).getZExtValue()));
}		}

llvm::APInt Constant_;		llvm::APInt Constant_;
std::vector<llvm::MCInst> Instructions;		std::vector<llvm::MCInst> Instructions;
};		};

		#include "X86GenExegesis.inc"

class ExegesisX86Target : public ExegesisTarget {		class ExegesisX86Target : public ExegesisTarget {
		public:
		ExegesisX86Target() : ExegesisTarget(X86CpuPfmCounters) {}

		private:
void addTargetSpecificPasses(llvm::PassManagerBase &PM) const override {		void addTargetSpecificPasses(llvm::PassManagerBase &PM) const override {
// Lowers FP pseudo-instructions, e.g. ABS_Fp32 -> ABS_F.		// Lowers FP pseudo-instructions, e.g. ABS_Fp32 -> ABS_F.
PM.add(llvm::createX86FloatingPointStackifierPass());		PM.add(llvm::createX86FloatingPointStackifierPass());
}		}

unsigned getScratchMemoryRegister(const llvm::Triple &TT) const override {		unsigned getScratchMemoryRegister(const llvm::Triple &TT) const override {
if (!TT.isArch64Bit()) {		if (!TT.isArch64Bit()) {
// FIXME: This would require popping from the stack, so we would have to		// FIXME: This would require popping from the stack, so we would have to
▲ Show 20 Lines • Show All 87 Lines • Show Last 20 Lines

tools/llvm-exegesis/llvm-exegesis.cpp

Show First 20 Lines • Show All 88 Lines • ▼ Show 20 Lines

static cl::opt<std::string>		static cl::opt<std::string>
AnalysisClustersOutputFile("analysis-clusters-output-file", cl::desc(""),		AnalysisClustersOutputFile("analysis-clusters-output-file", cl::desc(""),
cl::init("-"));		cl::init("-"));
static cl::opt<std::string>		static cl::opt<std::string>
AnalysisInconsistenciesOutputFile("analysis-inconsistencies-output-file",		AnalysisInconsistenciesOutputFile("analysis-inconsistencies-output-file",
cl::desc(""), cl::init("-"));		cl::desc(""), cl::init("-"));

		static cl::opt<std::string>
		CpuName("mcpu",
		cl::desc(
		"cpu name to use for pfm counters, leave empty to autodetect"),
		cl::init(""));


static ExitOnError ExitOnErr;		static ExitOnError ExitOnErr;

#ifdef LLVM_EXEGESIS_INITIALIZE_NATIVE_TARGET		#ifdef LLVM_EXEGESIS_INITIALIZE_NATIVE_TARGET
void LLVM_EXEGESIS_INITIALIZE_NATIVE_TARGET();		void LLVM_EXEGESIS_INITIALIZE_NATIVE_TARGET();
#endif		#endif

// Checks that only one of OpcodeNames, OpcodeIndex or SnippetsFile is provided,		// Checks that only one of OpcodeNames, OpcodeIndex or SnippetsFile is provided,
// and returns the opcode indices or {} if snippets should be read from		// and returns the opcode indices or {} if snippets should be read from
▲ Show 20 Lines • Show All 211 Lines • ▼ Show 20 Lines	void benchmarkMain() {

llvm::InitializeNativeTarget();		llvm::InitializeNativeTarget();
llvm::InitializeNativeTargetAsmPrinter();		llvm::InitializeNativeTargetAsmPrinter();
llvm::InitializeNativeTargetAsmParser();		llvm::InitializeNativeTargetAsmParser();
#ifdef LLVM_EXEGESIS_INITIALIZE_NATIVE_TARGET		#ifdef LLVM_EXEGESIS_INITIALIZE_NATIVE_TARGET
LLVM_EXEGESIS_INITIALIZE_NATIVE_TARGET();		LLVM_EXEGESIS_INITIALIZE_NATIVE_TARGET();
#endif		#endif

const LLVMState State;		const LLVMState State(CpuName);
const auto Opcodes = getOpcodesOrDie(State.getInstrInfo());		const auto Opcodes = getOpcodesOrDie(State.getInstrInfo());

std::vector<BenchmarkCode> Configurations;		std::vector<BenchmarkCode> Configurations;
if (!Opcodes.empty()) {		if (!Opcodes.empty()) {
for (const unsigned Opcode : Opcodes) {		for (const unsigned Opcode : Opcodes) {
// Ignore instructions without a sched class if		// Ignore instructions without a sched class if
// -ignore-invalid-sched-class is passed.		// -ignore-invalid-sched-class is passed.
if (IgnoreInvalidSchedClass &&		if (IgnoreInvalidSchedClass &&
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines
static void analysisMain() {		static void analysisMain() {
if (BenchmarkFile.empty())		if (BenchmarkFile.empty())
llvm::report_fatal_error("--benchmarks-file must be set.");		llvm::report_fatal_error("--benchmarks-file must be set.");

llvm::InitializeNativeTarget();		llvm::InitializeNativeTarget();
llvm::InitializeNativeTargetAsmPrinter();		llvm::InitializeNativeTargetAsmPrinter();
llvm::InitializeNativeTargetDisassembler();		llvm::InitializeNativeTargetDisassembler();
// Read benchmarks.		// Read benchmarks.
const LLVMState State;		const LLVMState State("");
const std::vector<InstructionBenchmark> Points =		const std::vector<InstructionBenchmark> Points =
ExitOnErr(InstructionBenchmark::readYamls(State, BenchmarkFile));		ExitOnErr(InstructionBenchmark::readYamls(State, BenchmarkFile));
llvm::outs() << "Parsed " << Points.size() << " benchmark points\n";		llvm::outs() << "Parsed " << Points.size() << " benchmark points\n";
if (Points.empty()) {		if (Points.empty()) {
llvm::errs() << "no benchmarks to analyze\n";		llvm::errs() << "no benchmarks to analyze\n";
return;		return;
}		}
// FIXME: Check that all points have the same triple/cpu.		// FIXME: Check that all points have the same triple/cpu.
▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

utils/TableGen/CMakeLists.txt

Show All 15 Lines	add_tablegen(llvm-tblgen LLVM
CodeGenTarget.cpp		CodeGenTarget.cpp
DAGISelEmitter.cpp		DAGISelEmitter.cpp
DAGISelMatcherEmitter.cpp		DAGISelMatcherEmitter.cpp
DAGISelMatcherGen.cpp		DAGISelMatcherGen.cpp
DAGISelMatcherOpt.cpp		DAGISelMatcherOpt.cpp
DAGISelMatcher.cpp		DAGISelMatcher.cpp
DFAPacketizerEmitter.cpp		DFAPacketizerEmitter.cpp
DisassemblerEmitter.cpp		DisassemblerEmitter.cpp
		ExegesisEmitter.cpp
FastISelEmitter.cpp		FastISelEmitter.cpp
FixedLenDecoderEmitter.cpp		FixedLenDecoderEmitter.cpp
GlobalISelEmitter.cpp		GlobalISelEmitter.cpp
InfoByHwMode.cpp		InfoByHwMode.cpp
InstrInfoEmitter.cpp		InstrInfoEmitter.cpp
InstrDocsEmitter.cpp		InstrDocsEmitter.cpp
IntrinsicEmitter.cpp		IntrinsicEmitter.cpp
OptParserEmitter.cpp		OptParserEmitter.cpp
Show All 20 Lines

utils/TableGen/CodeGenSchedule.h

Show First 20 Lines • Show All 240 Lines • ▼ Show 20 Lines	struct CodeGenProcModel {
RecVec ProcResourceDefs;		RecVec ProcResourceDefs;

// List of Register Files.		// List of Register Files.
std::vector<CodeGenRegisterFile> RegisterFiles;		std::vector<CodeGenRegisterFile> RegisterFiles;

// Optional Retire Control Unit definition.		// Optional Retire Control Unit definition.
Record *RetireControlUnit;		Record *RetireControlUnit;

// List of PfmCounters.
RecVec PfmIssueCounterDefs;
Record *PfmCycleCounterDef = nullptr;
Record *PfmUopsCounterDef = nullptr;

CodeGenProcModel(unsigned Idx, std::string Name, Record *MDef,		CodeGenProcModel(unsigned Idx, std::string Name, Record *MDef,
Record *IDef) :		Record *IDef) :
Index(Idx), ModelName(std::move(Name)), ModelDef(MDef), ItinsDef(IDef),		Index(Idx), ModelName(std::move(Name)), ModelDef(MDef), ItinsDef(IDef),
RetireControlUnit(nullptr) {}		RetireControlUnit(nullptr) {}

bool hasItineraries() const {		bool hasItineraries() const {
return !ItinsDef->getValueAsListOfDefs("IID").empty();		return !ItinsDef->getValueAsListOfDefs("IID").empty();
}		}

bool hasInstrSchedModel() const {		bool hasInstrSchedModel() const {
return !WriteResDefs.empty() \|\| !ItinRWDefs.empty();		return !WriteResDefs.empty() \|\| !ItinRWDefs.empty();
}		}

bool hasExtraProcessorInfo() const {		bool hasExtraProcessorInfo() const {
return RetireControlUnit \|\| !RegisterFiles.empty() \|\|		return RetireControlUnit \|\| !RegisterFiles.empty();
!PfmIssueCounterDefs.empty() \|\|
PfmCycleCounterDef != nullptr \|\|
PfmUopsCounterDef != nullptr;
}		}

unsigned getProcResourceIdx(Record *PRDef) const;		unsigned getProcResourceIdx(Record *PRDef) const;

bool isUnsupported(const CodeGenInstruction &Inst) const;		bool isUnsupported(const CodeGenInstruction &Inst) const;

#ifndef NDEBUG		#ifndef NDEBUG
void dump() const;		void dump() const;
▲ Show 20 Lines • Show All 308 Lines • ▼ Show 20 Lines	private:
unsigned findRWForSequence(ArrayRef<unsigned> Seq, bool IsRead);		unsigned findRWForSequence(ArrayRef<unsigned> Seq, bool IsRead);

void collectSchedClasses();		void collectSchedClasses();

void collectRetireControlUnits();		void collectRetireControlUnits();

void collectRegisterFiles();		void collectRegisterFiles();

void collectPfmCounters();

void collectOptionalProcessorInfo();		void collectOptionalProcessorInfo();

std::string createSchedClassName(Record *ItinClassDef,		std::string createSchedClassName(Record *ItinClassDef,
ArrayRef<unsigned> OperWrites,		ArrayRef<unsigned> OperWrites,
ArrayRef<unsigned> OperReads);		ArrayRef<unsigned> OperReads);
std::string createSchedClassName(const RecVec &InstDefs);		std::string createSchedClassName(const RecVec &InstDefs);
void createInstRWClass(Record *InstRWDef);		void createInstRWClass(Record *InstRWDef);

▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

utils/TableGen/CodeGenSchedule.cpp

Show First 20 Lines • Show All 344 Lines • ▼ Show 20 Lines	for (const Record *EC : Classes) {
const Record *Pred = EC->getValueAsDef("Predicate");		const Record *Pred = EC->getValueAsDef("Predicate");
APInt PredMask(NumUniquePredicates, 0);		APInt PredMask(NumUniquePredicates, 0);
PredMask.setBit(Predicate2Index[Pred]);		PredMask.setBit(Predicate2Index[Pred]);

for (const Record *Opcode : Opcodes) {		for (const Record *Opcode : Opcodes) {
unsigned OpcodeIdx = Opcode2Index[Opcode];		unsigned OpcodeIdx = Opcode2Index[Opcode];
if (OpcodeMasks[OpcodeIdx].first[ProcIndex]) {		if (OpcodeMasks[OpcodeIdx].first[ProcIndex]) {
std::string Message =		std::string Message =
"Opcode " + Opcode->getName().str() +		"Opcode " + Opcode->getName().str() +
" used by multiple InstructionEquivalenceClass definitions.";		" used by multiple InstructionEquivalenceClass definitions.";
PrintFatalError(EC->getLoc(), Message);		PrintFatalError(EC->getLoc(), Message);
}		}
OpcodeMasks[OpcodeIdx].first \|= ProcMask;		OpcodeMasks[OpcodeIdx].first \|= ProcMask;
OpcodeMasks[OpcodeIdx].second \|= PredMask;		OpcodeMasks[OpcodeIdx].second \|= PredMask;
OpcodeInfo &OI = OpcodeMappings[OpcodeIdx].second;		OpcodeInfo &OI = OpcodeMappings[OpcodeIdx].second;

OI.addPredicateForProcModel(ProcMask, OperandMask, Pred);		OI.addPredicateForProcModel(ProcMask, OperandMask, Pred);
▲ Show 20 Lines • Show All 120 Lines • ▼ Show 20 Lines
/// Collect optional processor information.		/// Collect optional processor information.
void CodeGenSchedModels::collectOptionalProcessorInfo() {		void CodeGenSchedModels::collectOptionalProcessorInfo() {
// Find register file definitions for each processor.		// Find register file definitions for each processor.
collectRegisterFiles();		collectRegisterFiles();

// Collect processor RetireControlUnit descriptors if available.		// Collect processor RetireControlUnit descriptors if available.
collectRetireControlUnits();		collectRetireControlUnits();

// Find pfm counter definitions for each processor.
collectPfmCounters();

checkCompleteness();		checkCompleteness();
}		}

/// Gather all processor models.		/// Gather all processor models.
void CodeGenSchedModels::collectProcModels() {		void CodeGenSchedModels::collectProcModels() {
RecVec ProcRecords = Records.getAllDerivedDefinitions("Processor");		RecVec ProcRecords = Records.getAllDerivedDefinitions("Processor");
llvm::sort(ProcRecords, LessRecordFieldName());		llvm::sort(ProcRecords, LessRecordFieldName());

▲ Show 20 Lines • Show All 277 Lines • ▼ Show 20 Lines	if (Rec->getValueInit("SchedModel")->isComplete()) {
Record *ModelDef = Rec->getValueAsDef("SchedModel");		Record *ModelDef = Rec->getValueAsDef("SchedModel");
if (&getProcModel(ModelDef) != &ProcModel)		if (&getProcModel(ModelDef) != &ProcModel)
continue;		continue;
}		}
if (AliasDef)		if (AliasDef)
PrintFatalError(AliasRW.TheDef->getLoc(), "Multiple aliases "		PrintFatalError(AliasRW.TheDef->getLoc(), "Multiple aliases "
"defined for processor " + ProcModel.ModelName +		"defined for processor " + ProcModel.ModelName +
" Ensure only one SchedAlias exists per RW.");		" Ensure only one SchedAlias exists per RW.");
AliasDef = AliasRW.TheDef;		AliasDef = AliasRW.TheDef;
		gchateletUnsubmitted Not Done Reply Inline Actions join string literals and reapply formatting. gchatelet: join string literals and reapply formatting.
}		}
if (AliasDef) {		if (AliasDef) {
expandRWSeqForProc(getSchedRWIdx(AliasDef, IsRead),		expandRWSeqForProc(getSchedRWIdx(AliasDef, IsRead),
RWSeq, IsRead,ProcModel);		RWSeq, IsRead,ProcModel);
return;		return;
}		}
if (!SchedWrite.IsSequence) {		if (!SchedWrite.IsSequence) {
RWSeq.push_back(RWIdx);		RWSeq.push_back(RWIdx);
▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	if (!SCIdx) {
if (!Inst->hasNoSchedulingInfo)		if (!Inst->hasNoSchedulingInfo)
dbgs() << "No machine model for " << Inst->TheDef->getName() << '\n';		dbgs() << "No machine model for " << Inst->TheDef->getName() << '\n';
});		});
continue;		continue;
}		}
CodeGenSchedClass &SC = getSchedClass(SCIdx);		CodeGenSchedClass &SC = getSchedClass(SCIdx);
if (SC.ProcIndices[0] != 0)		if (SC.ProcIndices[0] != 0)
PrintFatalError(Inst->TheDef->getLoc(), "Instruction's sched class "		PrintFatalError(Inst->TheDef->getLoc(), "Instruction's sched class "
"must not be subtarget specific.");		"must not be subtarget specific.");
		gchateletUnsubmitted Not Done Reply Inline Actions Same here gchatelet: Same here

IdxVec ProcIndices;		IdxVec ProcIndices;
if (SC.ItinClassDef->getName() != "NoItinerary") {		if (SC.ItinClassDef->getName() != "NoItinerary") {
ProcIndices.push_back(0);		ProcIndices.push_back(0);
dbgs() << "Itinerary for " << InstName << ": "		dbgs() << "Itinerary for " << InstName << ": "
<< SC.ItinClassDef->getName() << '\n';		<< SC.ItinClassDef->getName() << '\n';
}		}
if (!SC.Writes.empty()) {		if (!SC.Writes.empty()) {
▲ Show 20 Lines • Show All 301 Lines • ▼ Show 20 Lines	for (unsigned Idx = 0; Idx != SchedClasses.size(); ++Idx) {
if (SchedClasses[Idx].ItinClassDef)		if (SchedClasses[Idx].ItinClassDef)
inferFromItinClass(SchedClasses[Idx].ItinClassDef, Idx);		inferFromItinClass(SchedClasses[Idx].ItinClassDef, Idx);
if (!SchedClasses[Idx].InstRWs.empty())		if (!SchedClasses[Idx].InstRWs.empty())
inferFromInstRWs(Idx);		inferFromInstRWs(Idx);
if (!SchedClasses[Idx].Writes.empty()) {		if (!SchedClasses[Idx].Writes.empty()) {
inferFromRW(SchedClasses[Idx].Writes, SchedClasses[Idx].Reads,		inferFromRW(SchedClasses[Idx].Writes, SchedClasses[Idx].Reads,
Idx, SchedClasses[Idx].ProcIndices);		Idx, SchedClasses[Idx].ProcIndices);
}		}
assert(SchedClasses.size() < (NumInstrSchedClasses*6) &&		assert(SchedClasses.size() < (NumInstrSchedClasses*6) &&
		gchateletUnsubmitted Not Done Reply Inline Actions Can you explain what the 6 is referring to? gchatelet: Can you explain what the 6 is referring to?
"too many SchedVariants");		"too many SchedVariants");
}		}
}		}

/// Infer classes from per-processor itinerary resources.		/// Infer classes from per-processor itinerary resources.
void CodeGenSchedModels::inferFromItinClass(Record *ItinClassDef,		void CodeGenSchedModels::inferFromItinClass(Record *ItinClassDef,
unsigned FromClassIdx) {		unsigned FromClassIdx) {
for (unsigned PIdx = 0, PEnd = ProcModels.size(); PIdx != PEnd; ++PIdx) {		for (unsigned PIdx = 0, PEnd = ProcModels.size(); PIdx != PEnd; ++PIdx) {
▲ Show 20 Lines • Show All 211 Lines • ▼ Show 20 Lines	if (AliasProcIdx == 0)
GenericRW = true;		GenericRW = true;
}		}
for (TransVariant &Variant : Variants) {		for (TransVariant &Variant : Variants) {
// Don't expand variants if the processor models don't intersect.		// Don't expand variants if the processor models don't intersect.
// A zero processor index means any processor.		// A zero processor index means any processor.
SmallVectorImpl<unsigned> &ProcIndices = TransVec[TransIdx].ProcIndices;		SmallVectorImpl<unsigned> &ProcIndices = TransVec[TransIdx].ProcIndices;
if (ProcIndices[0] && Variant.ProcIdx) {		if (ProcIndices[0] && Variant.ProcIdx) {
unsigned Cnt = std::count(ProcIndices.begin(), ProcIndices.end(),		unsigned Cnt = std::count(ProcIndices.begin(), ProcIndices.end(),
Variant.ProcIdx);		Variant.ProcIdx);
		gchateletUnsubmitted Not Done Reply Inline Actions const gchatelet: const
if (!Cnt)		if (!Cnt)
continue;		continue;
if (Cnt > 1) {		if (Cnt > 1) {
const CodeGenProcModel &PM =		const CodeGenProcModel &PM =
*(SchedModels.procModelBegin() + Variant.ProcIdx);		*(SchedModels.procModelBegin() + Variant.ProcIdx);
PrintFatalError(Variant.VarOrSeqDef->getLoc(),		PrintFatalError(Variant.VarOrSeqDef->getLoc(),
"Multiple variants defined for processor " +		"Multiple variants defined for processor " +
PM.ModelName +		PM.ModelName +
Show All 14 Lines	else {
// Push another copy of the current transition for more variants.		// Push another copy of the current transition for more variants.
Variant.TransVecIdx = TransVec.size();		Variant.TransVecIdx = TransVec.size();
IntersectingVariants.push_back(Variant);		IntersectingVariants.push_back(Variant);
TransVec.push_back(TransVec[TransIdx]);		TransVec.push_back(TransVec[TransIdx]);
}		}
}		}
if (GenericRW && IntersectingVariants.empty()) {		if (GenericRW && IntersectingVariants.empty()) {
PrintFatalError(SchedRW.TheDef->getLoc(), "No variant of this type has "		PrintFatalError(SchedRW.TheDef->getLoc(), "No variant of this type has "
"a matching predicate on any processor");		"a matching predicate on any processor");
		gchateletUnsubmitted Not Done Reply Inline Actions join string literals gchatelet: join string literals
}		}
}		}

// Push the Reads/Writes selected by this variant onto the PredTransition		// Push the Reads/Writes selected by this variant onto the PredTransition
// specified by VInfo.		// specified by VInfo.
void PredTransitions::		void PredTransitions::
pushVariant(const TransVariant &VInfo, bool IsRead) {		pushVariant(const TransVariant &VInfo, bool IsRead) {
PredTransition &Trans = TransVec[VInfo.TransVecIdx];		PredTransition &Trans = TransVec[VInfo.TransVecIdx];
▲ Show 20 Lines • Show All 309 Lines • ▼ Show 20 Lines	for (unsigned I = 0, E = RegisterClasses.size(); I < E; ++I) {
AllowMoveElim = Val->getValue();		AllowMoveElim = Val->getValue();
}		}

CGRF.Costs.emplace_back(RegisterClasses[I], Cost, AllowMoveElim);		CGRF.Costs.emplace_back(RegisterClasses[I], Cost, AllowMoveElim);
}		}
}		}
}		}

// Collect all the RegisterFile definitions available in this target.
void CodeGenSchedModels::collectPfmCounters() {
for (Record *Def : Records.getAllDerivedDefinitions("PfmIssueCounter")) {
CodeGenProcModel &PM = getProcModel(Def->getValueAsDef("SchedModel"));
PM.PfmIssueCounterDefs.emplace_back(Def);
}
for (Record *Def : Records.getAllDerivedDefinitions("PfmCycleCounter")) {
CodeGenProcModel &PM = getProcModel(Def->getValueAsDef("SchedModel"));
if (PM.PfmCycleCounterDef) {
PrintFatalError(Def->getLoc(),
"multiple cycle counters for " +
Def->getValueAsDef("SchedModel")->getName());
}
PM.PfmCycleCounterDef = Def;
}
for (Record *Def : Records.getAllDerivedDefinitions("PfmUopsCounter")) {
CodeGenProcModel &PM = getProcModel(Def->getValueAsDef("SchedModel"));
if (PM.PfmUopsCounterDef) {
PrintFatalError(Def->getLoc(),
"multiple uops counters for " +
Def->getValueAsDef("SchedModel")->getName());
}
PM.PfmUopsCounterDef = Def;
}
}

// Collect and sort WriteRes, ReadAdvance, and ProcResources.		// Collect and sort WriteRes, ReadAdvance, and ProcResources.
void CodeGenSchedModels::collectProcResources() {		void CodeGenSchedModels::collectProcResources() {
ProcResourceDefs = Records.getAllDerivedDefinitions("ProcResourceUnits");		ProcResourceDefs = Records.getAllDerivedDefinitions("ProcResourceUnits");
ProcResGroups = Records.getAllDerivedDefinitions("ProcResGroup");		ProcResGroups = Records.getAllDerivedDefinitions("ProcResGroup");

// Add any subtarget-specific SchedReadWrites that are directly associated		// Add any subtarget-specific SchedReadWrites that are directly associated
// with processor resources. Refer to the parent SchedClass's ProcIndices to		// with processor resources. Refer to the parent SchedClass's ProcIndices to
// determine which processors they apply to.		// determine which processors they apply to.
▲ Show 20 Lines • Show All 137 Lines • ▼ Show 20 Lines	if (!Complete) {
errs() << "\n\nIncomplete schedule models found.\n"		errs() << "\n\nIncomplete schedule models found.\n"
<< "- Consider setting 'CompleteModel = 0' while developing new models.\n"		<< "- Consider setting 'CompleteModel = 0' while developing new models.\n"
<< "- Pseudo instructions can be marked with 'hasNoSchedulingInfo = 1'.\n"		<< "- Pseudo instructions can be marked with 'hasNoSchedulingInfo = 1'.\n"
<< "- Instructions should usually have Sched<[...]> as a superclass, "		<< "- Instructions should usually have Sched<[...]> as a superclass, "
"you may temporarily use an empty list.\n"		"you may temporarily use an empty list.\n"
<< "- Instructions related to unsupported features can be excluded with "		<< "- Instructions related to unsupported features can be excluded with "
"list<Predicate> UnsupportedFeatures = [HasA,..,HasY]; in the "		"list<Predicate> UnsupportedFeatures = [HasA,..,HasY]; in the "
"processor model.\n\n";		"processor model.\n\n";
PrintFatalError("Incomplete schedule model");		PrintFatalError("Incomplete schedule model");
		gchateletUnsubmitted Not Done Reply Inline Actions Can you reformat gchatelet: Can you reformat
}		}
}		}

// Collect itinerary class resources for each processor.		// Collect itinerary class resources for each processor.
void CodeGenSchedModels::collectItinProcResources(Record *ItinClassDef) {		void CodeGenSchedModels::collectItinProcResources(Record *ItinClassDef) {
for (unsigned PIdx = 0, PEnd = ProcModels.size(); PIdx != PEnd; ++PIdx) {		for (unsigned PIdx = 0, PEnd = ProcModels.size(); PIdx != PEnd; ++PIdx) {
const CodeGenProcModel &PM = ProcModels[PIdx];		const CodeGenProcModel &PM = ProcModels[PIdx];
// For all ItinRW entries.		// For all ItinRW entries.
▲ Show 20 Lines • Show All 244 Lines • Show Last 20 Lines

utils/TableGen/CodeGenTarget.cpp

Show First 20 Lines • Show All 705 Lines • ▼ Show 20 Lines	CodeGenIntrinsic::CodeGenIntrinsic(Record *R) {
}		}

// Also record the SDPatternOperator Properties.		// Also record the SDPatternOperator Properties.
Properties = parseSDPatternOperatorProperties(R);		Properties = parseSDPatternOperatorProperties(R);

// Sort the argument attributes for later benefit.		// Sort the argument attributes for later benefit.
llvm::sort(ArgumentAttributes);		llvm::sort(ArgumentAttributes);
}		}

utils/TableGen/ExegesisEmitter.cpp

This file was added.

				//===- ExegesisEmitter.cpp - Generate exegesis target data ----------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// This tablegen backend emits llvm-exegesis information.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/ADT/SmallSet.h"
				#include "llvm/ADT/StringRef.h"
				#include "llvm/Support/Debug.h"
				#include "llvm/Support/Format.h"
				#include "llvm/Support/raw_ostream.h"
				#include "llvm/TableGen/Error.h"
				#include "llvm/TableGen/Record.h"
				#include "llvm/TableGen/TableGenBackend.h"
				#include <algorithm>
				#include <cassert>
				#include <cstdint>
				#include <map>
				#include <string>
				#include <vector>

				using namespace llvm;

				#define DEBUG_TYPE "exegesis-emitter"

				namespace {

				class ExegesisEmitter {
				public:
				ExegesisEmitter(RecordKeeper &RK);

				void run(raw_ostream &OS) const;

				private:
				unsigned getPfmCounterId(llvm::StringRef Name) const {
				const auto It = PfmCounterNameTable.find(Name);
				if (It == PfmCounterNameTable.end())
				PrintFatalError("no pfm counter id for " + Name);
				return It->second;
				}

				// Collects all the ProcPfmCounters definitions available in this target.
				void emitPfmCounters(raw_ostream &OS) const;

				void emitPfmCountersInfo(const Record &Def,
				unsigned &IssueCountersTableOffset,
				raw_ostream &OS) const;

				void emitPfmCountersLookupTable(raw_ostream &OS) const;

				RecordKeeper &Records;
				std::string Target;

				// Table of counter name -> counter index.
				const std::map<llvm::StringRef, unsigned> PfmCounterNameTable;
				};

				static std::map<llvm::StringRef, unsigned>
				collectPfmCounters(const RecordKeeper &Records) {
				std::map<llvm::StringRef, unsigned> PfmCounterNameTable;
				const auto AddPfmCounterName = [&PfmCounterNameTable](
				const Record *PfmCounterDef) {
				const llvm::StringRef Counter = PfmCounterDef->getValueAsString("Counter");
				if (!Counter.empty())
				PfmCounterNameTable.emplace(Counter, 0);
				};
				for (Record *Def : Records.getAllDerivedDefinitions("ProcPfmCounters")) {
				// Check that ResourceNames are unique.
				llvm::SmallSet<llvm::StringRef, 16> Seen;
				for (const Record *IssueCounter :
				Def->getValueAsListOfDefs("IssueCounters")) {
				const llvm::StringRef ResourceName =
				IssueCounter->getValueAsString("ResourceName");
				if (ResourceName.empty())
				PrintFatalError(IssueCounter->getLoc(), "invalid empty ResourceName");
				if (!Seen.insert(ResourceName).second)
				PrintFatalError(IssueCounter->getLoc(),
				"duplicate ResourceName " + ResourceName);
				AddPfmCounterName(IssueCounter);
				}
				AddPfmCounterName(Def->getValueAsDef("CycleCounter"));
				AddPfmCounterName(Def->getValueAsDef("UopsCounter"));
				}
				unsigned Index = 0;
				for (auto &NameAndIndex : PfmCounterNameTable)
				NameAndIndex.second = Index++;
				return PfmCounterNameTable;
				}

				ExegesisEmitter::ExegesisEmitter(RecordKeeper &RK)
				: Records(RK), PfmCounterNameTable(collectPfmCounters(RK)) {
				std::vector<Record *> Targets = Records.getAllDerivedDefinitions("Target");
				if (Targets.size() == 0)
				PrintFatalError("ERROR: No 'Target' subclasses defined!");
				if (Targets.size() != 1)
				PrintFatalError("ERROR: Multiple subclasses of Target defined!");
				Target = Targets[0]->getName();
				}

				void ExegesisEmitter::emitPfmCountersInfo(const Record &Def,
				unsigned &IssueCountersTableOffset,
				raw_ostream &OS) const {
				const auto CycleCounter =
				Def.getValueAsDef("CycleCounter")->getValueAsString("Counter");
				const auto UopsCounter =
				Def.getValueAsDef("UopsCounter")->getValueAsString("Counter");
				const size_t NumIssueCounters =
				Def.getValueAsListOfDefs("IssueCounters").size();

				// This is the default, do not emit.
				if (CycleCounter.empty() && UopsCounter.empty() && NumIssueCounters == 0)
				return;

				OS << "\nstatic const PfmCountersInfo " << Target << Def.getName()
				<< " = {\n";

				// Cycle Counter.
				if (CycleCounter.empty())
				OS << " nullptr, // No cycle counter.\n";
				else
				OS << " " << Target << "PfmCounterNames[" << getPfmCounterId(CycleCounter)
				<< "], // Cycle counter\n";

				// Uops Counter.
				if (UopsCounter.empty())
				OS << " nullptr, // No uops counter.\n";
				else
				OS << " " << Target << "PfmCounterNames[" << getPfmCounterId(UopsCounter)
				<< "], // Uops counter\n";

				// Issue Counters
				if (NumIssueCounters == 0)
				OS << " nullptr, // No issue counters.\n 0\n";
				else
				OS << " " << Target << "PfmIssueCounters + " << IssueCountersTableOffset
				<< ", " << NumIssueCounters << " // Issue counters.\n";

				OS << "};\n";
				IssueCountersTableOffset += NumIssueCounters;
				}

				void ExegesisEmitter::emitPfmCounters(raw_ostream &OS) const {
				// Emit the counter name table.
				OS << "\nstatic const char* " << Target << "PfmCounterNames[] = {\n";
				for (const auto &NameAndIndex : PfmCounterNameTable)
				OS << " \"" << NameAndIndex.first << "\", // " << NameAndIndex.second
				<< "\n";
				OS << "};\n\n";

				// Emit the IssueCounters table.
				const auto PfmCounterDefs =
				Records.getAllDerivedDefinitions("ProcPfmCounters");
				OS << "static const PfmCountersInfo::IssueCounter " << Target
				<< "PfmIssueCounters[] = {\n";
				for (const Record *Def : PfmCounterDefs) {
				for (const Record *ICDef : Def->getValueAsListOfDefs("IssueCounters"))
				OS << " { " << Target << "PfmCounterNames["
				<< getPfmCounterId(ICDef->getValueAsString("Counter")) << "], \""
				<< ICDef->getValueAsString("ResourceName") << "\"},\n";
				}

				OS << "};\n";

				// Now generate the PfmCountersInfo.
				unsigned IssueCountersTableOffset = 0;
				for (const Record *Def : PfmCounterDefs)
				emitPfmCountersInfo(*Def, IssueCountersTableOffset, OS);

				OS << "\n";
				}

				void ExegesisEmitter::emitPfmCountersLookupTable(raw_ostream &OS) const {
				std::vector<Record *> Bindings =
				Records.getAllDerivedDefinitions("PfmCountersBinding");
				llvm::sort(Bindings, [](const Record L, const Record R) {
				return L->getValueAsString("CpuName") < R->getValueAsString("CpuName");
				});

				OS << "// Sorted (by CpuName) array of pfm counters.\n"
				<< "static const CpuAndPfmCounters " << Target << "CpuPfmCounters[] = {\n";
				for (Record *Binding : Bindings) {
				// Emit as { "cpu", procinit },
				OS << " { \"" //
				<< Binding->getValueAsString("CpuName") << "\"," //
				<< " &" << Target << Binding->getValueAsDef("Counters")->getName() //
				<< " },\n";
				}
				OS << "};\n\n";
				}

				void ExegesisEmitter::run(raw_ostream &OS) const {
				emitSourceFileHeader("Exegesis Tables", OS);
				emitPfmCounters(OS);
				emitPfmCountersLookupTable(OS);
				}

				} // end anonymous namespace

				namespace llvm {

				void EmitExegesis(RecordKeeper &RK, raw_ostream &OS) {
				ExegesisEmitter(RK).run(OS);
				}

				} // end namespace llvm

utils/TableGen/SubtargetEmitter.cpp

Show First 20 Lines • Show All 691 Lines • ▼ Show 20 Lines	OS << NumCostEntries << ", " << CostTblIndex << ", "
<< RD.AllowZeroMoveEliminationOnly << "},\n";		<< RD.AllowZeroMoveEliminationOnly << "},\n";
CostTblIndex += NumCostEntries;		CostTblIndex += NumCostEntries;
}		}
OS << "};\n";		OS << "};\n";

return CostTblIndex;		return CostTblIndex;
}		}

static bool EmitPfmIssueCountersTable(const CodeGenProcModel &ProcModel,
raw_ostream &OS) {
unsigned NumCounterDefs = 1 + ProcModel.ProcResourceDefs.size();
std::vector<const Record *> CounterDefs(NumCounterDefs);
bool HasCounters = false;
for (const Record *CounterDef : ProcModel.PfmIssueCounterDefs) {
const Record *&CD = CounterDefs[ProcModel.getProcResourceIdx(
CounterDef->getValueAsDef("Resource"))];
if (CD) {
PrintFatalError(CounterDef->getLoc(),
"multiple issue counters for " +
CounterDef->getValueAsDef("Resource")->getName());
}
CD = CounterDef;
HasCounters = true;
}
if (!HasCounters) {
return false;
}
OS << "\nstatic const char* " << ProcModel.ModelName
<< "PfmIssueCounters[] = {\n";
for (unsigned i = 0; i != NumCounterDefs; ++i) {
const Record *CounterDef = CounterDefs[i];
if (CounterDef) {
const auto PfmCounters = CounterDef->getValueAsListOfStrings("Counters");
if (PfmCounters.empty())
PrintFatalError(CounterDef->getLoc(), "empty counter list");
OS << " \"" << PfmCounters[0];
for (unsigned p = 1, e = PfmCounters.size(); p != e; ++p)
OS << ",\" \"" << PfmCounters[p];
OS << "\", // #" << i << " = ";
OS << CounterDef->getValueAsDef("Resource")->getName() << "\n";
} else {
OS << " nullptr, // #" << i << "\n";
}
}
OS << "};\n";
return true;
}

static void EmitPfmCounters(const CodeGenProcModel &ProcModel,
const bool HasPfmIssueCounters, raw_ostream &OS) {
OS << " {\n";
// Emit the cycle counter.
if (ProcModel.PfmCycleCounterDef)
OS << " \"" << ProcModel.PfmCycleCounterDef->getValueAsString("Counter")
<< "\", // Cycle counter.\n";
else
OS << " nullptr, // No cycle counter.\n";

// Emit the uops counter.
if (ProcModel.PfmUopsCounterDef)
OS << " \"" << ProcModel.PfmUopsCounterDef->getValueAsString("Counter")
<< "\", // Uops counter.\n";
else
OS << " nullptr, // No uops counter.\n";

// Emit a reference to issue counters table.
if (HasPfmIssueCounters)
OS << " " << ProcModel.ModelName << "PfmIssueCounters\n";
else
OS << " nullptr // No issue counters.\n";
OS << " }\n";
}

void SubtargetEmitter::EmitExtraProcessorInfo(const CodeGenProcModel &ProcModel,		void SubtargetEmitter::EmitExtraProcessorInfo(const CodeGenProcModel &ProcModel,
raw_ostream &OS) {		raw_ostream &OS) {
// Generate a table of register file descriptors (one entry per each user		// Generate a table of register file descriptors (one entry per each user
// defined register file), and a table of register costs.		// defined register file), and a table of register costs.
unsigned NumCostEntries = EmitRegisterFileTables(ProcModel, OS);		unsigned NumCostEntries = EmitRegisterFileTables(ProcModel, OS);

// Generate a table of ProcRes counter names.
const bool HasPfmIssueCounters = EmitPfmIssueCountersTable(ProcModel, OS);

// Now generate a table for the extra processor info.		// Now generate a table for the extra processor info.
OS << "\nstatic const llvm::MCExtraProcessorInfo " << ProcModel.ModelName		OS << "\nstatic const llvm::MCExtraProcessorInfo " << ProcModel.ModelName
<< "ExtraInfo = {\n ";		<< "ExtraInfo = {\n ";

// Add information related to the retire control unit.		// Add information related to the retire control unit.
EmitRetireControlUnitInfo(ProcModel, OS);		EmitRetireControlUnitInfo(ProcModel, OS);

// Add information related to the register files (i.e. where to find register		// Add information related to the register files (i.e. where to find register
// file descriptors and register costs).		// file descriptors and register costs).
EmitRegisterFileInfo(ProcModel, ProcModel.RegisterFiles.size(),		EmitRegisterFileInfo(ProcModel, ProcModel.RegisterFiles.size(),
NumCostEntries, OS);		NumCostEntries, OS);

EmitPfmCounters(ProcModel, HasPfmIssueCounters, OS);

OS << "};\n";		OS << "};\n";
}		}

void SubtargetEmitter::EmitProcessorResources(const CodeGenProcModel &ProcModel,		void SubtargetEmitter::EmitProcessorResources(const CodeGenProcModel &ProcModel,
raw_ostream &OS) {		raw_ostream &OS) {
EmitProcessorResourceSubUnits(ProcModel, OS);		EmitProcessorResourceSubUnits(ProcModel, OS);

OS << "\n// {Name, NumUnits, SuperIdx, BufferSize, SubUnitsIdxBegin}\n";		OS << "\n// {Name, NumUnits, SuperIdx, BufferSize, SubUnitsIdxBegin}\n";
▲ Show 20 Lines • Show All 609 Lines • ▼ Show 20 Lines	if (PM.hasExtraProcessorInfo())
OS << " &" << PM.ModelName << "ExtraInfo,\n";		OS << " &" << PM.ModelName << "ExtraInfo,\n";
else		else
OS << " nullptr // No extra processor descriptor\n";		OS << " nullptr // No extra processor descriptor\n";
OS << "};\n";		OS << "};\n";
}		}
}		}

//		//
// EmitProcessorLookup - generate cpu name to itinerary lookup table.		// EmitProcessorLookup - generate cpu name to sched model lookup tables.
//		//
void SubtargetEmitter::EmitProcessorLookup(raw_ostream &OS) {		void SubtargetEmitter::EmitProcessorLookup(raw_ostream &OS) {
// Gather and sort processor information		// Gather and sort processor information
std::vector<Record*> ProcessorList =		std::vector<Record*> ProcessorList =
Records.getAllDerivedDefinitions("Processor");		Records.getAllDerivedDefinitions("Processor");
llvm::sort(ProcessorList, LessRecordFieldName());		llvm::sort(ProcessorList, LessRecordFieldName());

// Begin processor table		// Begin processor->sched model table
OS << "\n";		OS << "\n";
OS << "// Sorted (by key) array of itineraries for CPU subtype.\n"		OS << "// Sorted (by key) array of sched model for CPU subtype.\n"
<< "extern const llvm::SubtargetInfoKV "		<< "extern const llvm::SubtargetInfoKV " << Target
<< Target << "ProcSchedKV[] = {\n";		<< "ProcSchedKV[] = {\n";

// For each processor		// For each processor
for (Record *Processor : ProcessorList) {		for (Record *Processor : ProcessorList) {
StringRef Name = Processor->getValueAsString("Name");		StringRef Name = Processor->getValueAsString("Name");
const std::string &ProcModelName =		const std::string &ProcModelName =
SchedModels.getModelForProc(Processor).ModelName;		SchedModels.getModelForProc(Processor).ModelName;

// Emit as { "cpu", procinit },		// Emit as { "cpu", procinit },
OS << " { \"" << Name << "\", (const void *)&" << ProcModelName << " },\n";		OS << " { \"" << Name << "\", (const void *)&" << ProcModelName << " },\n";
}		}
		// End processor->sched model table
// End processor table
OS << "};\n";		OS << "};\n";
}		}

//		//
// EmitSchedModel - Emits all scheduling model tables, folding common patterns.		// EmitSchedModel - Emits all scheduling model tables, folding common patterns.
//		//
void SubtargetEmitter::EmitSchedModel(raw_ostream &OS) {		void SubtargetEmitter::EmitSchedModel(raw_ostream &OS) {
OS << "#ifdef DBGFIELD\n"		OS << "#ifdef DBGFIELD\n"
▲ Show 20 Lines • Show All 224 Lines • ▼ Show 20 Lines	OS << "unsigned " << ClassName
<< "\n::resolveSchedClass(unsigned SchedClass, const MachineInstr *MI,"		<< "\n::resolveSchedClass(unsigned SchedClass, const MachineInstr *MI,"
<< " const TargetSchedModel *SchedModel) const {\n";		<< " const TargetSchedModel *SchedModel) const {\n";

// Emit the predicate prolog code.		// Emit the predicate prolog code.
emitPredicateProlog(Records, OS);		emitPredicateProlog(Records, OS);

// Emit target predicates.		// Emit target predicates.
emitSchedModelHelpersImpl(OS);		emitSchedModelHelpersImpl(OS);

OS << "} // " << ClassName << "::resolveSchedClass\n\n";		OS << "} // " << ClassName << "::resolveSchedClass\n\n";

OS << "unsigned " << ClassName		OS << "unsigned " << ClassName
<< "\n::resolveVariantSchedClass(unsigned SchedClass, const MCInst *MI,"		<< "\n::resolveVariantSchedClass(unsigned SchedClass, const MCInst *MI,"
<< " unsigned CPUID) const {\n"		<< " unsigned CPUID) const {\n"
<< " return " << Target << "_MC"		<< " return " << Target << "_MC"
<< "::resolveVariantSchedClassImpl(SchedClass, MI, CPUID);\n"		<< "::resolveVariantSchedClassImpl(SchedClass, MI, CPUID);\n"
<< "} // " << ClassName << "::resolveVariantSchedClass\n\n";		<< "} // " << ClassName << "::resolveVariantSchedClass\n\n";
▲ Show 20 Lines • Show All 169 Lines • ▼ Show 20 Lines	#endif
if (NumProcs)		if (NumProcs)
OS << Target << "SubTypeKV, ";		OS << Target << "SubTypeKV, ";
else		else
OS << "None, ";		OS << "None, ";
OS << '\n'; OS.indent(22);		OS << '\n'; OS.indent(22);
OS << Target << "ProcSchedKV, "		OS << Target << "ProcSchedKV, "
<< Target << "WriteProcResTable, "		<< Target << "WriteProcResTable, "
<< Target << "WriteLatencyTable, "		<< Target << "WriteLatencyTable, "
<< Target << "ReadAdvanceTable, ";		<< Target << "ReadAdvanceTable, ";
		gchateletUnsubmitted Not Done Reply Inline Actions llvm::format would help here OS << llvm::format("{0}ProcSchedKV, {0}ProcPfmKV, {0}WriteProcResTable, {0}WriteLatencyTable, {0}ReadAdvanceTable, ", Target); gchatelet: llvm::format would help here ``` OS << llvm::format("{0}ProcSchedKV, {0}ProcPfmKV…
		courbetAuthorUnsubmitted Not Done Reply Inline Actions I want to remain consistent with the rest of the file. I've reformatted to make it more readable. courbet: I want to remain consistent with the rest of the file. I've reformatted to make it more…
OS << '\n'; OS.indent(22);		OS << '\n'; OS.indent(22);
if (SchedModels.hasItineraries()) {		if (SchedModels.hasItineraries()) {
OS << Target << "Stages, "		OS << Target << "Stages, "
<< Target << "OperandCycles, "		<< Target << "OperandCycles, "
<< Target << "ForwardingPaths";		<< Target << "ForwardingPaths";
} else		} else
OS << "nullptr, nullptr, nullptr";		OS << "nullptr, nullptr, nullptr";
OS << ");\n}\n\n";		OS << ");\n}\n\n";
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	#endif
if (NumProcs)		if (NumProcs)
OS << "makeArrayRef(" << Target << "SubTypeKV, " << NumProcs << "), ";		OS << "makeArrayRef(" << Target << "SubTypeKV, " << NumProcs << "), ";
else		else
OS << "None, ";		OS << "None, ";
OS << '\n'; OS.indent(24);		OS << '\n'; OS.indent(24);
OS << Target << "ProcSchedKV, "		OS << Target << "ProcSchedKV, "
<< Target << "WriteProcResTable, "		<< Target << "WriteProcResTable, "
<< Target << "WriteLatencyTable, "		<< Target << "WriteLatencyTable, "
<< Target << "ReadAdvanceTable, ";		<< Target << "ReadAdvanceTable, ";
		gchateletUnsubmitted Done Reply Inline Actions Same here. gchatelet: Same here.
OS << '\n'; OS.indent(24);		OS << '\n'; OS.indent(24);
if (SchedModels.hasItineraries()) {		if (SchedModels.hasItineraries()) {
OS << Target << "Stages, "		OS << Target << "Stages, "
<< Target << "OperandCycles, "		<< Target << "OperandCycles, "
<< Target << "ForwardingPaths";		<< Target << "ForwardingPaths";
} else		} else
OS << "nullptr, nullptr, nullptr";		OS << "nullptr, nullptr, nullptr";
OS << ") {}\n\n";		OS << ") {}\n\n";
Show All 19 Lines

utils/TableGen/TableGen.cpp

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	enum ActionType {
GenOptParserDefs,		GenOptParserDefs,
GenCTags,		GenCTags,
GenAttributes,		GenAttributes,
GenSearchableTables,		GenSearchableTables,
GenGlobalISel,		GenGlobalISel,
GenX86EVEX2VEXTables,		GenX86EVEX2VEXTables,
GenX86FoldTables,		GenX86FoldTables,
GenRegisterBank,		GenRegisterBank,
		GenExegesis,
};		};

namespace {		namespace {
cl::opt<ActionType>		cl::opt<ActionType>
Action(cl::desc("Action to perform:"),		Action(cl::desc("Action to perform:"),
cl::values(clEnumValN(PrintRecords, "print-records",		cl::values(clEnumValN(PrintRecords, "print-records",
"Print all records to stdout (default)"),		"Print all records to stdout (default)"),
clEnumValN(DumpJSON, "dump-json",		clEnumValN(DumpJSON, "dump-json",
▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	Action(cl::desc("Action to perform:"),
"Generate generic binary-searchable table"),		"Generate generic binary-searchable table"),
clEnumValN(GenGlobalISel, "gen-global-isel",		clEnumValN(GenGlobalISel, "gen-global-isel",
"Generate GlobalISel selector"),		"Generate GlobalISel selector"),
clEnumValN(GenX86EVEX2VEXTables, "gen-x86-EVEX2VEX-tables",		clEnumValN(GenX86EVEX2VEXTables, "gen-x86-EVEX2VEX-tables",
"Generate X86 EVEX to VEX compress tables"),		"Generate X86 EVEX to VEX compress tables"),
clEnumValN(GenX86FoldTables, "gen-x86-fold-tables",		clEnumValN(GenX86FoldTables, "gen-x86-fold-tables",
"Generate X86 fold tables"),		"Generate X86 fold tables"),
clEnumValN(GenRegisterBank, "gen-register-bank",		clEnumValN(GenRegisterBank, "gen-register-bank",
"Generate registers bank descriptions")));		"Generate registers bank descriptions"),
		clEnumValN(GenExegesis, "gen-exegesis",
		"Generate llvm-exegesis tables")));

cl::OptionCategory PrintEnumsCat("Options for -print-enums");		cl::OptionCategory PrintEnumsCat("Options for -print-enums");
cl::opt<std::string>		cl::opt<std::string>
Class("class", cl::desc("Print Enum list for this class"),		Class("class", cl::desc("Print Enum list for this class"),
cl::value_desc("class name"), cl::cat(PrintEnumsCat));		cl::value_desc("class name"), cl::cat(PrintEnumsCat));

bool LLVMTableGenMain(raw_ostream &OS, RecordKeeper &Records) {		bool LLVMTableGenMain(raw_ostream &OS, RecordKeeper &Records) {
switch (Action) {		switch (Action) {
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	case GenRegisterBank:
EmitRegisterBank(Records, OS);		EmitRegisterBank(Records, OS);
break;		break;
case GenX86EVEX2VEXTables:		case GenX86EVEX2VEXTables:
EmitX86EVEX2VEXTables(Records, OS);		EmitX86EVEX2VEXTables(Records, OS);
break;		break;
case GenX86FoldTables:		case GenX86FoldTables:
EmitX86FoldTables(Records, OS);		EmitX86FoldTables(Records, OS);
break;		break;
		case GenExegesis:
		EmitExegesis(Records, OS);
		break;
}		}

return false;		return false;
}		}
}		}

int main(int argc, char **argv) {		int main(int argc, char **argv) {
sys::PrintStackTraceOnErrorSignal(argv[0]);		sys::PrintStackTraceOnErrorSignal(argv[0]);
Show All 16 Lines

utils/TableGen/TableGenBackends.h

	Show First 20 Lines • Show All 83 Lines • ▼ Show 20 Lines
	void EmitOptParser(RecordKeeper &RK, raw_ostream &OS);			void EmitOptParser(RecordKeeper &RK, raw_ostream &OS);
	void EmitCTags(RecordKeeper &RK, raw_ostream &OS);			void EmitCTags(RecordKeeper &RK, raw_ostream &OS);
	void EmitAttributes(RecordKeeper &RK, raw_ostream &OS);			void EmitAttributes(RecordKeeper &RK, raw_ostream &OS);
	void EmitSearchableTables(RecordKeeper &RK, raw_ostream &OS);			void EmitSearchableTables(RecordKeeper &RK, raw_ostream &OS);
	void EmitGlobalISel(RecordKeeper &RK, raw_ostream &OS);			void EmitGlobalISel(RecordKeeper &RK, raw_ostream &OS);
	void EmitX86EVEX2VEXTables(RecordKeeper &RK, raw_ostream &OS);			void EmitX86EVEX2VEXTables(RecordKeeper &RK, raw_ostream &OS);
	void EmitX86FoldTables(RecordKeeper &RK, raw_ostream &OS);			void EmitX86FoldTables(RecordKeeper &RK, raw_ostream &OS);
	void EmitRegisterBank(RecordKeeper &RK, raw_ostream &OS);			void EmitRegisterBank(RecordKeeper &RK, raw_ostream &OS);
				void EmitExegesis(RecordKeeper &RK, raw_ostream &OS);

	} // End llvm namespace			} // End llvm namespace

	#endif			#endif

This is an archive of the discontinued LLVM Phabricator instance.

[MCSched] Bind PFM Counters to the CPUs instead of the SchedModel.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 171040

docs/CommandGuide/llvm-exegesis.rst

docs/CommandGuide/tblgen.rst

include/llvm/MC/MCSchedule.h

include/llvm/Target/Target.td

include/llvm/Target/TargetPfmCounters.td

include/llvm/Target/TargetSchedule.td

lib/Target/X86/CMakeLists.txt

lib/Target/X86/X86PfmCounters.td

tools/llvm-exegesis/lib/AArch64/Target.cpp

tools/llvm-exegesis/lib/Latency.cpp

tools/llvm-exegesis/lib/LlvmState.h

tools/llvm-exegesis/lib/LlvmState.cpp

tools/llvm-exegesis/lib/Target.h

tools/llvm-exegesis/lib/Target.cpp

tools/llvm-exegesis/lib/Uops.cpp

tools/llvm-exegesis/lib/X86/Target.cpp

tools/llvm-exegesis/llvm-exegesis.cpp

utils/TableGen/CMakeLists.txt

utils/TableGen/CodeGenSchedule.h

utils/TableGen/CodeGenSchedule.cpp

utils/TableGen/CodeGenTarget.cpp

utils/TableGen/ExegesisEmitter.cpp

utils/TableGen/SubtargetEmitter.cpp

utils/TableGen/TableGen.cpp

utils/TableGen/TableGenBackends.h

[MCSched] Bind PFM Counters to the CPUs instead of the SchedModel.
ClosedPublic