This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/
-
llvm/
-
CodeGen/
-
TargetSubtargetInfo.h
-
MC/
-
MCInstrAnalysis.h
-
Target/
-
TargetInstrPredicate.td
-
lib/
-
MC/
-
MCInstrAnalysis.cpp
-
Target/X86/
-
X86/
-
MCTargetDesc/
-
X86MCTargetDesc.cpp
-
X86ScheduleBtVer2.td
-
test/tools/llvm-mca/X86/BtVer2/
-
tools/
-
llvm-mca/
-
X86/
-
BtVer2/
-
zero-idioms-avx-256.s
-
tools/llvm-mca/lib/
-
llvm-mca/
-
lib/
-
InstrBuilder.cpp
-
utils/TableGen/
-
TableGen/
-
CodeGenSchedule.h
-
CodeGenSchedule.cpp
-
PredicateExpander.h
-
PredicateExpander.cpp
-
SubtargetEmitter.cpp

Differential D52174

[TableGen][SubtargetEmitter] Add the ability for processor models to describe dependency breaking instructions.
ClosedPublic

Authored by andreadb on Sep 17 2018, 7:39 AM.

Download Raw Diff

Details

Reviewers

RKSimon
spatel
craig.topper
mattd
courbet
gchatelet
atrick
ab
lebedev.ri

Commits

rG8b6c314be192: [TableGen][SubtargetEmitter] Add the ability for processor models to describe…
rL342555: [TableGen][SubtargetEmitter] Add the ability for processor models to describe…

Summary

This patch adds the ability for processor models to describe dependency breaking instructions.

Different processors may specify a different set of dependency-breaking instructions.
That means, we cannot assume that all processors of the same target would use the same rules to classify dependency breaking instructions.

The main goal of this patch is to provide the means to describe dependency breaking instructions directly via tablegen, and have the following TargetSubtargetInfo hooks redefined in overrides by tabegen'd XXXGenSubtargetInfo classes (here, XXX is a Target name).

virtual bool isZeroIdiom(const MachineInstr *MI, APInt &Mask) const {
  return false;
}

virtual bool isDependencyBreaking(const MachineInstr *MI, APInt &Mask) const {
  return isZeroIdiom(MI);
}

An instruction MI is a dependency-breaking instruction if a call to method isDependencyBreaking(MI) on the STI (TargetSubtargetInfo object) evaluates to true. Similarly, an instruction MI is a special case of zero-idiom dependency breaking instruction if a call to STI.isZeroIdiom(MI) returns true.
The extra APInt is used for those targets that may want to select which machine operands have their dependency broken (see comments in code).
Note that by default, subtargets don't know about the existence of dependency-breaking. In the absence of external information, those method calls would always return false.

A new tablegen class named STIPredicate has been added by this patch to let processor models classify instructions that have properties in common. The idea is that, a MCInstrPredicate definition can be used to "generate" an instruction equivalence class, with the idea that instructions of a same class all have a property in common.

STIPredicate definitions are essentially a collection of instruction equivalence classes.
Also, different processor models can specify a different variant of the same STIPredicate with different rules (i.e. predicates) to classify instructions. Tablegen backends (in this particular case, the SubtargetEmitter) will be able to process STIPredicate definitions, and automatically generate functions in XXXGenSubtargetInfo.

This patch introduces two special kind of STIPredicate classes named IsZeroIdiomFunction and IsDepBreakingFunction in tablegen. It also adds a definition for those in the BtVer2 scheduling model only.

The definition of zero-idioms in BtVer2 is quite big. For simplicity, in the example below I only reported GPR and AVX zero-idioms variants:

def : IsZeroIdiomFunction<[
  // GPR Zero-idioms.
  DepBreakingClass<[ SUB32rr, SUB64rr, XOR32rr, XOR64rr ], ZeroIdiomPredicate>,

  // AVX Zero-idioms.
  DepBreakingClass<[
    VPXORrr, VPANDNrr, VXORPSrr, VXORPDrr,
    VXORPSYrr, VXORPDYrr, VANDNPSrr, VANDNPDrr,
    VPSUBBrr, VPSUBDrr, VPSUBQrr, VPSUBWrr,
    VPCMPGTBrr, VPCMPGTDrr, VPCMPGTQrr, VPCMPGTWrr
  ], ZeroIdiomPredicate>
>;

This is what the SubtargetEmitter generates for those variants:

bool X86GenSubtargetInfo::isZeroIdiom(const MachineInstr *MI, APInt &Mask) const {
  unsigned ProcessorID = getSchedModel().getProcessorID();
  switch(MI->getOpcode()) {
  default:
    break;
  case X86::SUB32rr:
  case X86::SUB64rr:
  case X86::XOR32rr:
  case X86::XOR64rr:
  case X86::VPXORrr:
  case X86::VPANDNrr:
  case X86::VXORPSrr:
  case X86::VXORPDrr:
  case X86::VXORPSYrr:
  case X86::VXORPDYrr:
  case X86::VANDNPSrr:
  case X86::VANDNPDrr:
  case X86::VPSUBBrr:
  case X86::VPSUBDrr:
  case X86::VPSUBQrr:
  case X86::VPSUBWrr:
  case X86::VPCMPGTBrr:
  case X86::VPCMPGTDrr:
  case X86::VPCMPGTQrr:
  case X86::VPCMPGTWrr:
    if (ProcessorID == 4) {
      Mask.clearAllBits();
      return MI->getOperand(1).getReg() == MI->getOperand(2).getReg();
    }
    break;
  }

  return false;
} // X86GenSubtargetInfo::isZeroIdiom

Rules for different zero-idioms are discriminated based on the processor identifier which comes from the scheduling model.
Note that a similar definition can be generated for MCInst. This patch shows how to do it, and those extra definitions are currently expanded into X86MCInstrDesc.

This patch supersedes the one committed at r338372 for D49310.

The main advantages are:

we can describe subtarget predicates via tablegen using STIPredicates.
we can describe zero-idioms / dep-breaking instructions directly via tablegen in the scheduling models.

In future, the STIPredicates framework can be used for solving other problems. For example:

teach how to identify optimizable register-register moves
teach how to identify slow LEA instructions (each subtarget defining its own concept of "slow" LEA).
teach how to identify instructions that have undocumented false dependencies on the output registers on some processors only.
etc.

It is also (in my opinion) an elegant way to expose knowledge to both external tools like llvm-mca, and codegen passes.
For example, machine schedulers in LLVM could reuse that information when internally constructing the data dependency graph for a code region.

This new design feature is also an "opt-in" feature. Processor models don't have to use the new STIPredicates. It has all been designed to be as unintrusive as possible.

Please let me know what you think.

Thanks,
Andrea

Diff Detail

Repository: rL LLVM

Event Timeline

andreadb created this revision.Sep 17 2018, 7:39 AM

Herald added subscribers: mgrang, gbedwell. · View Herald TranscriptSep 17 2018, 7:39 AM

RKSimon added inline comments.Sep 17 2018, 8:11 AM

include/llvm/CodeGen/TargetSubtargetInfo.h
152 ↗	(On Diff #165750)	relationship
include/llvm/MC/MCInstrAnalysis.h
94 ↗	(On Diff #165750)	\param MI
114 ↗	(On Diff #165750)	Please can you make it clear that Mask's bitwidth is total number of operands or just register operands?

Some minors

lib/Target/X86/X86ScheduleBtVer2.td
716 ↗	(On Diff #165750)	VANDNPSYrr/VANDNPDYrr?
test/tools/llvm-mca/X86/BtVer2/zero-idioms-avx-256.s
68 ↗	(On Diff #165750)	Shouldn't this only take a single resource cycle (0.5 rtp)? IIRC dep-breaking 256-bit ops only needs to process the upper half
tools/llvm-mca/include/Instruction.h
319 ↗	(On Diff #165750)	Comments?
tools/llvm-mca/lib/Stages/DispatchStage.cpp
115 ↗	(On Diff #165750)	Does this comment need updating to be zero-idiom specific?
utils/TableGen/CodeGenSchedule.cpp
412 ↗	(On Diff #165750)	Pass by const ref?
utils/TableGen/CodeGenSchedule.h
311 ↗	(On Diff #165750)	Why SmallVector here - the other added classes use std::vector

andreadb marked 8 inline comments as done.Sep 17 2018, 9:33 AM

andreadb added inline comments.

lib/Target/X86/X86ScheduleBtVer2.td
716 ↗	(On Diff #165750)	Interestingly, those were missing in the original implementation of `X86InstrAnalysis::isDependencyBreaking()`. I have added them to the set. I have also added two extra tests for VANDNPSYrr/VANDNPDYrr in `test/tools/llvm-mca/BtVer2/zero-idioms-avx-256.s`
test/tools/llvm-mca/X86/BtVer2/zero-idioms-avx-256.s
68 ↗	(On Diff #165750)	Nice catch. I think the latency/throughput of instructions should be fixed by a separate patch. This patch should only help to identify independent operands of an instruction. I will add a TODO to this test.
utils/TableGen/CodeGenSchedule.h
311 ↗	(On Diff #165750)	That vector is expected to be small. In the worst case scenario, it has exactly one element per processor model. It is initialized to 8, because most targets define less than 8 predicates. X86 is the target with most models (9). If you want, I can use a vector here for consistency.

Patch updated.

Addressed review comments.

Added a TODO comment to test zero-idioms-avx-256.s. Processor resource cycles consumed by VXORPSYrr and VXORPDYrr are wrongly set to 2cy (instead of 1cy).

I like this change. I have a few nits in the comments, but this looks good to me, caveat: I am not a tablegen expert.

include/llvm/Target/TargetInstrPredicate.td
245 ↗	(On Diff #165750)	*instructions
266 ↗	(On Diff #165750)	*remove one of the 'operands'

Patch updated.

Addressed review comments.

Patch rebased.

Nice! I only have one comment on the schema, the rest is cosmetic.

include/llvm/Target/TargetInstrPredicate.td
292 ↗	(On Diff #165985)	This feels like a double negation to me. What about "ExpandForMC" ? It would also make it more future-rpoof if we want to expand to more stuff.
tools/llvm-mca/lib/InstrBuilder.cpp
458 ↗	(On Diff #165985)	This could use two temporary bool variables for readability.
utils/TableGen/CodeGenSchedule.cpp
257 ↗	(On Diff #165985)	*processSTIPredicate

andreadb marked 5 inline comments as done.Sep 19 2018, 5:02 AM

andreadb added inline comments.

include/llvm/Target/TargetInstrPredicate.td
292 ↗	(On Diff #165985)	Sure, I will change it. Thanks for the feedback!
306 ↗	(On Diff #165985)	I will update this comment too.
tools/llvm-mca/lib/InstrBuilder.cpp
458 ↗	(On Diff #165985)	I will change it. I tried to expand that if-stmt a bit, adding extra code comments. Hopefully, it is more readable with code comments. Let me know if you prefer it done in a different way.
utils/TableGen/CodeGenSchedule.cpp
257 ↗	(On Diff #165985)	I will fix it. Thanks.

Address review comments.

Also:

Improved the description of class PredicateInfo.
Removed two unused fields from tablegen class STIPredicate.
Added extra verification checks to: a) explicitly disallow InstructionEquivalenceClass definitions with an empty set of opcodes; b) avoid that an instruction opcode is used by multiple equivalence classes of a same STIPredicate.

LGTM - the vandps/vandpd btver2 zero-idiom cases need cleaning up but have confirmed with @andreadb offline that he'll do this as a follow-up

lib/Target/X86/X86ScheduleBtVer2.td
716 ↗	(On Diff #165750)	Possibly sort all these instructions so its easier to find a specific one?

This revision is now accepted and ready to land.Sep 19 2018, 7:09 AM

Closed by commit rL342555: [TableGen][SubtargetEmitter] Add the ability for processor models to describe… (authored by adibiagio). · Explain WhySep 19 2018, 8:59 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

TargetSubtargetInfo.h

26 lines

MC/

MCInstrAnalysis.h

49 lines

Target/

TargetInstrPredicate.td

98 lines

lib/

MC/

MCInstrAnalysis.cpp

5 lines

Target/

X86/

MCTargetDesc/

X86MCTargetDesc.cpp

78 lines

X86ScheduleBtVer2.td

62 lines

test/

tools/

llvm-mca/

X86/

BtVer2/

zero-idioms-avx-256.s

322 lines

tools/

llvm-mca/

lib/

InstrBuilder.cpp

30 lines

utils/

TableGen/

CodeGenSchedule.h

141 lines

CodeGenSchedule.cpp

212 lines

PredicateExpander.h

33 lines

PredicateExpander.cpp

155 lines

SubtargetEmitter.cpp

45 lines

Diff 166138

llvm/trunk/include/llvm/CodeGen/TargetSubtargetInfo.h

//===- llvm/CodeGen/TargetSubtargetInfo.h - Target Information --- C++ --===//		//===- llvm/CodeGen/TargetSubtargetInfo.h - Target Information --- C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file describes the subtarget options of a Target machine.		// This file describes the subtarget options of a Target machine.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_CODEGEN_TARGETSUBTARGETINFO_H		#ifndef LLVM_CODEGEN_TARGETSUBTARGETINFO_H
#define LLVM_CODEGEN_TARGETSUBTARGETINFO_H		#define LLVM_CODEGEN_TARGETSUBTARGETINFO_H

		#include "llvm/ADT/APInt.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/CodeGen/PBQPRAConstraint.h"		#include "llvm/CodeGen/PBQPRAConstraint.h"
#include "llvm/CodeGen/ScheduleDAGMutation.h"		#include "llvm/CodeGen/ScheduleDAGMutation.h"
#include "llvm/CodeGen/SchedulerRegistry.h"		#include "llvm/CodeGen/SchedulerRegistry.h"
#include "llvm/MC/MCSubtargetInfo.h"		#include "llvm/MC/MCSubtargetInfo.h"
#include "llvm/Support/CodeGen.h"		#include "llvm/Support/CodeGen.h"
▲ Show 20 Lines • Show All 114 Lines • ▼ Show 20 Lines	public:
/// another variant SchedClass, but repeated invocation must quickly terminate		/// another variant SchedClass, but repeated invocation must quickly terminate
/// in a nonvariant SchedClass.		/// in a nonvariant SchedClass.
virtual unsigned resolveSchedClass(unsigned SchedClass,		virtual unsigned resolveSchedClass(unsigned SchedClass,
const MachineInstr *MI,		const MachineInstr *MI,
const TargetSchedModel *SchedModel) const {		const TargetSchedModel *SchedModel) const {
return 0;		return 0;
}		}

		/// Returns true if \param MI is a dependency breaking zero-idiom instruction
		/// for the subtarget.
		///
		/// This function also sets bits in \param Mask related to input operands that
		/// are not in a data dependency relationship. There is one bit for each
		/// machine operand; implicit operands follow explicit operands in the bit
		/// representation used for \param Mask. An empty \param Mask (i.e. a mask
		/// with all bits cleared) means: data dependencies are "broken" for all the
		/// explicit input machine operands of \param MI.
		virtual bool isZeroIdiom(const MachineInstr *MI, APInt &Mask) const {
		return false;
		}

		/// Returns true if \param MI is a dependency breaking instruction for the
		/// subtarget.
		///
		/// Similar in behavior to `isZeroIdiom`. However, it knows how to identify
		/// all dependency breaking instructions (i.e. not just zero-idioms).
		///
		/// As for `isZeroIdiom`, this method returns a mask of "broken" dependencies.
		/// (See method `isZeroIdiom` for a detailed description of \param Mask).
		virtual bool isDependencyBreaking(const MachineInstr *MI, APInt &Mask) const {
		return isZeroIdiom(MI, Mask);
		}

/// True if the subtarget should run MachineScheduler after aggressive		/// True if the subtarget should run MachineScheduler after aggressive
/// coalescing.		/// coalescing.
///		///
/// This currently replaces the SelectionDAG scheduler with the "source" order		/// This currently replaces the SelectionDAG scheduler with the "source" order
/// scheduler (though see below for an option to turn this off and use the		/// scheduler (though see below for an option to turn this off and use the
/// TargetLowering preference). It does not yet disable the postRA scheduler.		/// TargetLowering preference). It does not yet disable the postRA scheduler.
virtual bool enableMachineScheduler() const;		virtual bool enableMachineScheduler() const;

▲ Show 20 Lines • Show All 107 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/MC/MCInstrAnalysis.h

Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	public:
///		///
/// The assumption is that the bit-width of the APInt is correctly set by		/// The assumption is that the bit-width of the APInt is correctly set by
/// the caller. The default implementation conservatively assumes that none of		/// the caller. The default implementation conservatively assumes that none of
/// the writes clears the upper portion of a super-register.		/// the writes clears the upper portion of a super-register.
virtual bool clearsSuperRegisters(const MCRegisterInfo &MRI,		virtual bool clearsSuperRegisters(const MCRegisterInfo &MRI,
const MCInst &Inst,		const MCInst &Inst,
APInt &Writes) const;		APInt &Writes) const;

/// Returns true if \param Inst is a dependency breaking instruction for the		/// Returns true if \param MI is a dependency breaking zero-idiom for the
/// given subtarget.		/// given subtarget.
///		///
		/// \param Mask is used to identify input operands that have their dependency
		/// broken. Each bit of the mask is associated with a specific input operand.
		/// Bits associated with explicit input operands are laid out first in the
		/// mask; implicit operands come after explicit operands.
		///
		/// Dependencies are broken only for operands that have their corresponding bit
		/// set. Operands that have their bit cleared, or that don't have a
		/// corresponding bit in the mask don't have their dependency broken.
		/// Note that \param Mask may not be big enough to describe all operands.
		/// The assumption for operands that don't have a correspondent bit in the
		/// mask is that those are still data dependent.
		///
		/// The only exception to the rule is for when \param Mask has all zeroes.
		/// A zero mask means: dependencies are broken for all explicit register
		/// operands.
		virtual bool isZeroIdiom(const MCInst &MI, APInt &Mask,
		unsigned CPUID) const {
		return false;
		}

		/// Returns true if \param MI is a dependency breaking instruction for the
		/// subtarget associated with \param CPUID.
		///
/// The value computed by a dependency breaking instruction is not dependent		/// The value computed by a dependency breaking instruction is not dependent
/// on the inputs. An example of dependency breaking instruction on X86 is		/// on the inputs. An example of dependency breaking instruction on X86 is
/// `XOR %eax, %eax`.		/// `XOR %eax, %eax`.
/// TODO: In future, we could implement an alternative approach where this		///
/// method returns `true` if the input instruction is not dependent on		/// If \param MI is a dependency breaking instruction for subtarget \param
/// some/all of its input operands. An APInt mask could then be used to		/// CPUID, then \param Mask can be inspected to identify independent operands.
/// identify independent operands.		///
virtual bool isDependencyBreaking(const MCSubtargetInfo &STI,		/// Essentially, each bit of the mask corresponds to an input operand.
const MCInst &Inst) const;		/// Explicit operands are laid out first in the mask; implicit operands follow
		/// explicit operands. Bits are set for operands that are independent.
		///
		/// Note that the number of bits in Mask may not be equivalent to the sum of
		/// explicit and implicit operands in \param MI. Operands that don't have a
		/// corresponding bit in Mask are assumed "not independente".
		///
		/// The only exception is for when \param Mask is all zeroes. That means:
		/// explicit input operands of \param MI are independent.
		virtual bool isDependencyBreaking(const MCInst &MI, APInt &Mask,
		unsigned CPUID) const {
		return isZeroIdiom(MI, Mask, CPUID);
		}

/// Given a branch instruction try to get the address the branch		/// Given a branch instruction try to get the address the branch
/// targets. Return true on success, and the address in Target.		/// targets. Return true on success, and the address in Target.
virtual bool		virtual bool
evaluateBranch(const MCInst &Inst, uint64_t Addr, uint64_t Size,		evaluateBranch(const MCInst &Inst, uint64_t Addr, uint64_t Size,
uint64_t &Target) const;		uint64_t &Target) const;

/// Returns (PLT virtual address, GOT virtual address) pairs for PLT entries.		/// Returns (PLT virtual address, GOT virtual address) pairs for PLT entries.
Show All 10 Lines

llvm/trunk/include/llvm/Target/TargetInstrPredicate.td

	Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	//			//
	// New MCInstPredicate classes must be added to this file. For each new class			// New MCInstPredicate classes must be added to this file. For each new class
	// XYZ, an "expandXYZ" method must be added to the PredicateExpander.			// XYZ, an "expandXYZ" method must be added to the PredicateExpander.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	// Forward declarations.			// Forward declarations.
	class Instruction;			class Instruction;
				class SchedMachineModel;

	// A generic machine instruction predicate.			// A generic machine instruction predicate.
	class MCInstPredicate;			class MCInstPredicate;

	class MCTrue : MCInstPredicate; // A predicate that always evaluates to True.			class MCTrue : MCInstPredicate; // A predicate that always evaluates to True.
	class MCFalse : MCInstPredicate; // A predicate that always evaluates to False.			class MCFalse : MCInstPredicate; // A predicate that always evaluates to False.
	def TruePred : MCTrue;			def TruePred : MCTrue;
	def FalsePred : MCFalse;			def FalsePred : MCFalse;
	▲ Show 20 Lines • Show All 146 Lines • ▼ Show 20 Lines
	// a call to `MCInstFn`, or into a call to`MachineInstrFn` depending on whether			// a call to `MCInstFn`, or into a call to`MachineInstrFn` depending on whether
	// it is lowering predicates for MCInst or MachineInstr.			// it is lowering predicates for MCInst or MachineInstr.
	//			//
	// In this context, `MCInstFn` and `MachineInstrFn` are both function names.			// In this context, `MCInstFn` and `MachineInstrFn` are both function names.
	class CheckFunctionPredicate<string MCInstFn, string MachineInstrFn> : MCInstPredicate {			class CheckFunctionPredicate<string MCInstFn, string MachineInstrFn> : MCInstPredicate {
	string MCInstFnName = MCInstFn;			string MCInstFnName = MCInstFn;
	string MachineInstrFnName = MachineInstrFn;			string MachineInstrFnName = MachineInstrFn;
	}			}

				// Used to classify machine instructions based on a machine instruction
				// predicate.
				//
				// Let IC be an InstructionEquivalenceClass definition, and MI a machine
				// instruction. We say that MI belongs to the equivalence class described by IC
				// if and only if the following two conditions are met:
				// a) MI's opcode is in the `opcodes` set, and
				// b) `Predicate` evaluates to true when applied to MI.
				//
				// Instances of this class can be used by processor scheduling models to
				// describe instructions that have a property in common. For example,
				// InstructionEquivalenceClass definitions can be used to identify the set of
				// dependency breaking instructions for a processor model.
				//
				// An (optional) list of operand indices can be used to further describe
				// properties that apply to instruction operands. For example, it can be used to
				// identify register uses of a dependency breaking instructions that are not in
				// a RAW dependency.
				class InstructionEquivalenceClass<list<Instruction> opcodes,
				MCInstPredicate pred,
				list<int> operands = []> {
				list<Instruction> Opcodes = opcodes;
				MCInstPredicate Predicate = pred;
				list<int> OperandIndices = operands;
				}

				// Used by processor models to describe dependency breaking instructions.
				//
				// This is mainly an alias for InstructionEquivalenceClass. Input operand
				// `BrokenDeps` identifies the set of "broken dependencies". There is one bit
				// per each implicit and explicit input operand. An empty set of broken
				// dependencies means: "explicit input register operands are independent."
				class DepBreakingClass<list<Instruction> opcodes, MCInstPredicate pred,
				list<int> BrokenDeps = []>
				: InstructionEquivalenceClass<opcodes, pred, BrokenDeps>;

				// A function descriptor used to describe the signature of a predicate methods
				// which will be expanded by the STIPredicateExpander into a tablegen'd
				// XXXGenSubtargetInfo class member definition (here, XXX is a target name).
				//
				// It describes the signature of a TargetSubtarget hook, as well as a few extra
				// properties. Examples of extra properties are:
				// - The default return value for the auto-generate function hook.
				// - A list of subtarget hooks (Delegates) that are called from this function.
				//
				class STIPredicateDecl<string name, MCInstPredicate default = FalsePred,
				bit overrides = 1, bit expandForMC = 1,
				bit updatesOpcodeMask = 0,
				list<STIPredicateDecl> delegates = []> {
				string Name = name;

				MCInstPredicate DefaultReturnValue = default;

				// True if this method is declared as virtual in class TargetSubtargetInfo.
				bit OverridesBaseClassMember = overrides;

				// True if we need an equivalent predicate function in the MC layer.
				bit ExpandForMC = expandForMC;

				// True if the autogenerated method has a extra in/out APInt param used as a
				// mask of operands.
				bit UpdatesOpcodeMask = updatesOpcodeMask;

				// A list of STIPredicates used by this definition to delegate part of the
				// computation. For example, STIPredicateFunction `isDependencyBreaking()`
				// delegates to `isZeroIdiom()` part of its computation.
				list<STIPredicateDecl> Delegates = delegates;
				}

				// A predicate function definition member of class `XXXGenSubtargetInfo`.
				//
				// If `Declaration.ExpandForMC` is true, then SubtargetEmitter
				// will also expand another definition of this method that accepts a MCInst.
				class STIPredicate<STIPredicateDecl declaration,
				list<InstructionEquivalenceClass> classes> {
				STIPredicateDecl Declaration = declaration;
				list<InstructionEquivalenceClass> Classes = classes;
				SchedMachineModel SchedModel = ?;
				}

				// Convenience classes and definitions used by processor scheduling models to
				// describe dependency breaking instructions.
				let UpdatesOpcodeMask = 1 in {

				def IsZeroIdiomDecl : STIPredicateDecl<"isZeroIdiom">;

				let Delegates = [IsZeroIdiomDecl] in
				def IsDepBreakingDecl : STIPredicateDecl<"isDependencyBreaking">;

				} // UpdatesOpcodeMask

				class IsZeroIdiomFunction<list<DepBreakingClass> classes>
				: STIPredicate<IsZeroIdiomDecl, classes>;

				class IsDepBreakingFunction<list<DepBreakingClass> classes>
				: STIPredicate<IsDepBreakingDecl, classes>;

llvm/trunk/lib/MC/MCInstrAnalysis.cpp

	Show All 18 Lines

	bool MCInstrAnalysis::clearsSuperRegisters(const MCRegisterInfo &MRI,			bool MCInstrAnalysis::clearsSuperRegisters(const MCRegisterInfo &MRI,
	const MCInst &Inst,			const MCInst &Inst,
	APInt &Writes) const {			APInt &Writes) const {
	Writes.clearAllBits();			Writes.clearAllBits();
	return false;			return false;
	}			}

	bool MCInstrAnalysis::isDependencyBreaking(const MCSubtargetInfo &STI,
	const MCInst &Inst) const {
	return false;
	}

	bool MCInstrAnalysis::evaluateBranch(const MCInst &Inst, uint64_t Addr,			bool MCInstrAnalysis::evaluateBranch(const MCInst &Inst, uint64_t Addr,
	uint64_t Size, uint64_t &Target) const {			uint64_t Size, uint64_t &Target) const {
	if (Inst.getNumOperands() == 0 \|\|			if (Inst.getNumOperands() == 0 \|\|
	Info->get(Inst.getOpcode()).OpInfo[0].OperandType != MCOI::OPERAND_PCREL)			Info->get(Inst.getOpcode()).OpInfo[0].OperandType != MCOI::OPERAND_PCREL)
	return false;			return false;

	int64_t Imm = Inst.getOperand(0).getImm();			int64_t Imm = Inst.getOperand(0).getImm();
	Target = Addr+Size+Imm;			Target = Addr+Size+Imm;
	return true;			return true;
	}			}

llvm/trunk/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp

	Show First 20 Lines • Show All 374 Lines • ▼ Show 20 Lines
	class X86MCInstrAnalysis : public MCInstrAnalysis {			class X86MCInstrAnalysis : public MCInstrAnalysis {
	X86MCInstrAnalysis(const X86MCInstrAnalysis &) = delete;			X86MCInstrAnalysis(const X86MCInstrAnalysis &) = delete;
	X86MCInstrAnalysis &operator=(const X86MCInstrAnalysis &) = delete;			X86MCInstrAnalysis &operator=(const X86MCInstrAnalysis &) = delete;
	virtual ~X86MCInstrAnalysis() = default;			virtual ~X86MCInstrAnalysis() = default;

	public:			public:
	X86MCInstrAnalysis(const MCInstrInfo *MCII) : MCInstrAnalysis(MCII) {}			X86MCInstrAnalysis(const MCInstrInfo *MCII) : MCInstrAnalysis(MCII) {}

	bool isDependencyBreaking(const MCSubtargetInfo &STI,			#define GET_STIPREDICATE_DECLS_FOR_MC_ANALYSIS
	const MCInst &Inst) const override;			#include "X86GenSubtargetInfo.inc"

	bool clearsSuperRegisters(const MCRegisterInfo &MRI, const MCInst &Inst,			bool clearsSuperRegisters(const MCRegisterInfo &MRI, const MCInst &Inst,
	APInt &Mask) const override;			APInt &Mask) const override;
	std::vector<std::pair<uint64_t, uint64_t>>			std::vector<std::pair<uint64_t, uint64_t>>
	findPltEntries(uint64_t PltSectionVA, ArrayRef<uint8_t> PltContents,			findPltEntries(uint64_t PltSectionVA, ArrayRef<uint8_t> PltContents,
	uint64_t GotSectionVA,			uint64_t GotSectionVA,
	const Triple &TargetTriple) const override;			const Triple &TargetTriple) const override;
	};			};

	bool X86MCInstrAnalysis::isDependencyBreaking(const MCSubtargetInfo &STI,			#define GET_STIPREDICATE_DEFS_FOR_MC_ANALYSIS
	const MCInst &Inst) const {			#include "X86GenSubtargetInfo.inc"
	if (STI.getCPU() == "btver2") {
	// Reference: Agner Fog's microarchitecture.pdf - Section 20 "AMD Bobcat and
	// Jaguar pipeline", subsection 8 "Dependency-breaking instructions".
	switch (Inst.getOpcode()) {
	default:
	return false;
	case X86::SUB32rr:
	case X86::SUB64rr:
	case X86::SBB32rr:
	case X86::SBB64rr:
	case X86::XOR32rr:
	case X86::XOR64rr:
	case X86::XORPSrr:
	case X86::XORPDrr:
	case X86::VXORPSrr:
	case X86::VXORPDrr:
	case X86::ANDNPSrr:
	case X86::VANDNPSrr:
	case X86::ANDNPDrr:
	case X86::VANDNPDrr:
	case X86::PXORrr:
	case X86::VPXORrr:
	case X86::PANDNrr:
	case X86::VPANDNrr:
	case X86::PSUBBrr:
	case X86::PSUBWrr:
	case X86::PSUBDrr:
	case X86::PSUBQrr:
	case X86::VPSUBBrr:
	case X86::VPSUBWrr:
	case X86::VPSUBDrr:
	case X86::VPSUBQrr:
	case X86::PCMPEQBrr:
	case X86::PCMPEQWrr:
	case X86::PCMPEQDrr:
	case X86::PCMPEQQrr:
	case X86::VPCMPEQBrr:
	case X86::VPCMPEQWrr:
	case X86::VPCMPEQDrr:
	case X86::VPCMPEQQrr:
	case X86::PCMPGTBrr:
	case X86::PCMPGTWrr:
	case X86::PCMPGTDrr:
	case X86::PCMPGTQrr:
	case X86::VPCMPGTBrr:
	case X86::VPCMPGTWrr:
	case X86::VPCMPGTDrr:
	case X86::VPCMPGTQrr:
	case X86::MMX_PXORirr:
	case X86::MMX_PANDNirr:
	case X86::MMX_PSUBBirr:
	case X86::MMX_PSUBDirr:
	case X86::MMX_PSUBQirr:
	case X86::MMX_PSUBWirr:
	case X86::MMX_PCMPGTBirr:
	case X86::MMX_PCMPGTDirr:
	case X86::MMX_PCMPGTWirr:
	case X86::MMX_PCMPEQBirr:
	case X86::MMX_PCMPEQDirr:
	case X86::MMX_PCMPEQWirr:
	return Inst.getOperand(1).getReg() == Inst.getOperand(2).getReg();
	case X86::CMP32rr:
	case X86::CMP64rr:
	return Inst.getOperand(0).getReg() == Inst.getOperand(1).getReg();
	}
	}

	return false;
	}

	bool X86MCInstrAnalysis::clearsSuperRegisters(const MCRegisterInfo &MRI,			bool X86MCInstrAnalysis::clearsSuperRegisters(const MCRegisterInfo &MRI,
	const MCInst &Inst,			const MCInst &Inst,
	APInt &Mask) const {			APInt &Mask) const {
	const MCInstrDesc &Desc = Info->get(Inst.getOpcode());			const MCInstrDesc &Desc = Info->get(Inst.getOpcode());
	unsigned NumDefs = Desc.getNumDefs();			unsigned NumDefs = Desc.getNumDefs();
	unsigned NumImplicitDefs = Desc.getNumImplicitDefs();			unsigned NumImplicitDefs = Desc.getNumImplicitDefs();
	assert(Mask.getBitWidth() == NumDefs + NumImplicitDefs &&			assert(Mask.getBitWidth() == NumDefs + NumImplicitDefs &&
	▲ Show 20 Lines • Show All 336 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86ScheduleBtVer2.td

	Show First 20 Lines • Show All 681 Lines • ▼ Show 20 Lines

	def JSlowLEA16r : SchedWriteRes<[JALU01]> {			def JSlowLEA16r : SchedWriteRes<[JALU01]> {
	let Latency = 3;			let Latency = 3;
	let ResourceCycles = [4];			let ResourceCycles = [4];
	}			}

	def : InstRW<[JSlowLEA16r], (instrs LEA16r)>;			def : InstRW<[JSlowLEA16r], (instrs LEA16r)>;

				///////////////////////////////////////////////////////////////////////////////
				// Dependency breaking instructions.
				///////////////////////////////////////////////////////////////////////////////

				def : IsZeroIdiomFunction<[
				// GPR Zero-idioms.
				DepBreakingClass<[ SUB32rr, SUB64rr, XOR32rr, XOR64rr ], ZeroIdiomPredicate>,

				// MMX Zero-idioms.
				DepBreakingClass<[
				MMX_PXORirr, MMX_PANDNirr, MMX_PSUBBirr,
				MMX_PSUBDirr, MMX_PSUBQirr, MMX_PSUBWirr,
				MMX_PCMPGTBirr, MMX_PCMPGTDirr, MMX_PCMPGTWirr
				], ZeroIdiomPredicate>,

				// SSE Zero-idioms.
				DepBreakingClass<[
				// fp variants.
				XORPSrr, XORPDrr, ANDNPSrr, ANDNPDrr,

				// int variants.
				PXORrr, PANDNrr,
				PSUBBrr, PSUBWrr, PSUBDrr, PSUBQrr,
				PCMPGTBrr, PCMPGTDrr, PCMPGTQrr, PCMPGTWrr
				], ZeroIdiomPredicate>,

				// AVX Zero-idioms.
				DepBreakingClass<[
				// xmm fp variants.
				VXORPSrr, VXORPDrr, VANDNPSrr, VANDNPDrr,

				// xmm int variants.
				VPXORrr, VPANDNrr,
				VPSUBBrr, VPSUBWrr, VPSUBDrr, VPSUBQrr,
				VPCMPGTBrr, VPCMPGTWrr, VPCMPGTDrr, VPCMPGTQrr,

				// ymm variants.
				VXORPSYrr, VXORPDYrr, VANDNPSYrr, VANDNPDYrr
				], ZeroIdiomPredicate>
				]>;

				def : IsDepBreakingFunction<[
				// GPR
				DepBreakingClass<[ SBB32rr, SBB64rr ], ZeroIdiomPredicate>,
				DepBreakingClass<[ CMP32rr, CMP64rr ], CheckSameRegOperand<0, 1> >,

				// MMX
				DepBreakingClass<[
				MMX_PCMPEQBirr, MMX_PCMPEQDirr, MMX_PCMPEQWirr
				], ZeroIdiomPredicate>,

				// SSE
				DepBreakingClass<[
				PCMPEQBrr, PCMPEQWrr, PCMPEQDrr, PCMPEQQrr
				], ZeroIdiomPredicate>,

				// AVX
				DepBreakingClass<[
				VPCMPEQBrr, VPCMPEQWrr, VPCMPEQDrr, VPCMPEQQrr
				], ZeroIdiomPredicate>
				]>;

	} // SchedModel			} // SchedModel

llvm/trunk/test/tools/llvm-mca/X86/BtVer2/zero-idioms-avx-256.s

				# NOTE: Assertions have been autogenerated by utils/update_mca_test_checks.py
				# RUN: llvm-mca -mtriple=x86_64-unknown-unknown -mcpu=btver2 -timeline -timeline-max-iterations=3 < %s \| FileCheck %s

				# TODO: Fix the processor resource usage for zero-idiom YMM XOR instructions.
				# Those vector XOR instructions should only consume 1cy of JFPU1 (instead
				# of 2cy).

				# LLVM-MCA-BEGIN ZERO-IDIOM-1

				vaddps %ymm0, %ymm0, %ymm1
				vxorps %ymm1, %ymm1, %ymm1
				vblendps $2, %ymm1, %ymm2, %ymm3

				# LLVM-MCA-END

				# LLVM-MCA-BEGIN ZERO-IDIOM-2

				vaddpd %ymm0, %ymm0, %ymm1
				vxorpd %ymm1, %ymm1, %ymm1
				vblendpd $2, %ymm1, %ymm2, %ymm3

				# LLVM-MCA-END

				# LLVM-MCA-BEGIN ZERO-IDIOM-3
				vaddps %xmm0, %xmm1, %xmm2
				vandnps %xmm2, %xmm2, %xmm3
				# LLVM-MCA-END

				# LLVM-MCA-BEGIN ZERO-IDIOM-4
				vaddps %xmm0, %xmm1, %xmm2
				vandnps %xmm2, %xmm2, %xmm3
				# LLVM-MCA-END

				# CHECK: [0] Code Region - ZERO-IDIOM-1

				# CHECK: Iterations: 100
				# CHECK-NEXT: Instructions: 300
				# CHECK-NEXT: Total Cycles: 306
				# CHECK-NEXT: Total uOps: 600

				# CHECK: Dispatch Width: 2
				# CHECK-NEXT: uOps Per Cycle: 1.96
				# CHECK-NEXT: IPC: 0.98
				# CHECK-NEXT: Block RThroughput: 3.0

				# CHECK: Instruction Info:
				# CHECK-NEXT: [1]: #uOps
				# CHECK-NEXT: [2]: Latency
				# CHECK-NEXT: [3]: RThroughput
				# CHECK-NEXT: [4]: MayLoad
				# CHECK-NEXT: [5]: MayStore
				# CHECK-NEXT: [6]: HasSideEffects (U)

				# CHECK: [1] [2] [3] [4] [5] [6] Instructions:
				# CHECK-NEXT: 2 3 2.00 vaddps %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: 2 1 1.00 vxorps %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: 2 1 1.00 vblendps $2, %ymm1, %ymm2, %ymm3

				# CHECK: Resources:
				# CHECK-NEXT: [0] - JALU0
				# CHECK-NEXT: [1] - JALU1
				# CHECK-NEXT: [2] - JDiv
				# CHECK-NEXT: [3] - JFPA
				# CHECK-NEXT: [4] - JFPM
				# CHECK-NEXT: [5] - JFPU0
				# CHECK-NEXT: [6] - JFPU1
				# CHECK-NEXT: [7] - JLAGU
				# CHECK-NEXT: [8] - JMul
				# CHECK-NEXT: [9] - JSAGU
				# CHECK-NEXT: [10] - JSTC
				# CHECK-NEXT: [11] - JVALU0
				# CHECK-NEXT: [12] - JVALU1
				# CHECK-NEXT: [13] - JVIMUL

				# CHECK: Resource pressure per iteration:
				# CHECK-NEXT: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13]
				# CHECK-NEXT: - - - 3.00 3.00 3.00 3.00 - - - - - - -

				# CHECK: Resource pressure by instruction:
				# CHECK-NEXT: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] Instructions:
				# CHECK-NEXT: - - - 2.00 - 2.00 - - - - - - - - vaddps %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: - - - - 2.00 - 2.00 - - - - - - - vxorps %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: - - - 1.00 1.00 1.00 1.00 - - - - - - - vblendps $2, %ymm1, %ymm2, %ymm3

				# CHECK: Timeline view:
				# CHECK-NEXT: 012
				# CHECK-NEXT: Index 0123456789

				# CHECK: [0,0] DeeeER . . vaddps %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: [0,1] .DeE-R . . vxorps %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: [0,2] . DeE-R . . vblendps $2, %ymm1, %ymm2, %ymm3
				# CHECK-NEXT: [1,0] . D=eeeER. . vaddps %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: [1,1] . DeE--R. . vxorps %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: [1,2] . D=eE-R . vblendps $2, %ymm1, %ymm2, %ymm3
				# CHECK-NEXT: [2,0] . .DeeeER. vaddps %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: [2,1] . . D=eER. vxorps %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: [2,2] . . D=eER vblendps $2, %ymm1, %ymm2, %ymm3

				# CHECK: Average Wait times (based on the timeline view):
				# CHECK-NEXT: [0]: Executions
				# CHECK-NEXT: [1]: Average time spent waiting in a scheduler's queue
				# CHECK-NEXT: [2]: Average time spent waiting in a scheduler's queue while ready
				# CHECK-NEXT: [3]: Average time elapsed from WB until retire stage

				# CHECK: [0] [1] [2] [3]
				# CHECK-NEXT: 0. 3 1.3 1.3 0.0 vaddps %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: 1. 3 1.3 1.3 1.0 vxorps %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: 2. 3 1.7 0.3 0.7 vblendps $2, %ymm1, %ymm2, %ymm3

				# CHECK: [1] Code Region - ZERO-IDIOM-2

				# CHECK: Iterations: 100
				# CHECK-NEXT: Instructions: 300
				# CHECK-NEXT: Total Cycles: 306
				# CHECK-NEXT: Total uOps: 600

				# CHECK: Dispatch Width: 2
				# CHECK-NEXT: uOps Per Cycle: 1.96
				# CHECK-NEXT: IPC: 0.98
				# CHECK-NEXT: Block RThroughput: 3.0

				# CHECK: Instruction Info:
				# CHECK-NEXT: [1]: #uOps
				# CHECK-NEXT: [2]: Latency
				# CHECK-NEXT: [3]: RThroughput
				# CHECK-NEXT: [4]: MayLoad
				# CHECK-NEXT: [5]: MayStore
				# CHECK-NEXT: [6]: HasSideEffects (U)

				# CHECK: [1] [2] [3] [4] [5] [6] Instructions:
				# CHECK-NEXT: 2 3 2.00 vaddpd %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: 2 1 1.00 vxorpd %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: 2 1 1.00 vblendpd $2, %ymm1, %ymm2, %ymm3

				# CHECK: Resources:
				# CHECK-NEXT: [0] - JALU0
				# CHECK-NEXT: [1] - JALU1
				# CHECK-NEXT: [2] - JDiv
				# CHECK-NEXT: [3] - JFPA
				# CHECK-NEXT: [4] - JFPM
				# CHECK-NEXT: [5] - JFPU0
				# CHECK-NEXT: [6] - JFPU1
				# CHECK-NEXT: [7] - JLAGU
				# CHECK-NEXT: [8] - JMul
				# CHECK-NEXT: [9] - JSAGU
				# CHECK-NEXT: [10] - JSTC
				# CHECK-NEXT: [11] - JVALU0
				# CHECK-NEXT: [12] - JVALU1
				# CHECK-NEXT: [13] - JVIMUL

				# CHECK: Resource pressure per iteration:
				# CHECK-NEXT: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13]
				# CHECK-NEXT: - - - 3.00 3.00 3.00 3.00 - - - - - - -

				# CHECK: Resource pressure by instruction:
				# CHECK-NEXT: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] Instructions:
				# CHECK-NEXT: - - - 2.00 - 2.00 - - - - - - - - vaddpd %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: - - - - 2.00 - 2.00 - - - - - - - vxorpd %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: - - - 1.00 1.00 1.00 1.00 - - - - - - - vblendpd $2, %ymm1, %ymm2, %ymm3

				# CHECK: Timeline view:
				# CHECK-NEXT: 012
				# CHECK-NEXT: Index 0123456789

				# CHECK: [0,0] DeeeER . . vaddpd %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: [0,1] .DeE-R . . vxorpd %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: [0,2] . DeE-R . . vblendpd $2, %ymm1, %ymm2, %ymm3
				# CHECK-NEXT: [1,0] . D=eeeER. . vaddpd %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: [1,1] . DeE--R. . vxorpd %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: [1,2] . D=eE-R . vblendpd $2, %ymm1, %ymm2, %ymm3
				# CHECK-NEXT: [2,0] . .DeeeER. vaddpd %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: [2,1] . . D=eER. vxorpd %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: [2,2] . . D=eER vblendpd $2, %ymm1, %ymm2, %ymm3

				# CHECK: Average Wait times (based on the timeline view):
				# CHECK-NEXT: [0]: Executions
				# CHECK-NEXT: [1]: Average time spent waiting in a scheduler's queue
				# CHECK-NEXT: [2]: Average time spent waiting in a scheduler's queue while ready
				# CHECK-NEXT: [3]: Average time elapsed from WB until retire stage

				# CHECK: [0] [1] [2] [3]
				# CHECK-NEXT: 0. 3 1.3 1.3 0.0 vaddpd %ymm0, %ymm0, %ymm1
				# CHECK-NEXT: 1. 3 1.3 1.3 1.0 vxorpd %ymm1, %ymm1, %ymm1
				# CHECK-NEXT: 2. 3 1.7 0.3 0.7 vblendpd $2, %ymm1, %ymm2, %ymm3

				# CHECK: [2] Code Region - ZERO-IDIOM-3

				# CHECK: Iterations: 100
				# CHECK-NEXT: Instructions: 200
				# CHECK-NEXT: Total Cycles: 105
				# CHECK-NEXT: Total uOps: 200

				# CHECK: Dispatch Width: 2
				# CHECK-NEXT: uOps Per Cycle: 1.90
				# CHECK-NEXT: IPC: 1.90
				# CHECK-NEXT: Block RThroughput: 1.0

				# CHECK: Instruction Info:
				# CHECK-NEXT: [1]: #uOps
				# CHECK-NEXT: [2]: Latency
				# CHECK-NEXT: [3]: RThroughput
				# CHECK-NEXT: [4]: MayLoad
				# CHECK-NEXT: [5]: MayStore
				# CHECK-NEXT: [6]: HasSideEffects (U)

				# CHECK: [1] [2] [3] [4] [5] [6] Instructions:
				# CHECK-NEXT: 1 3 1.00 vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: 1 0 0.50 vandnps %xmm2, %xmm2, %xmm3

				# CHECK: Resources:
				# CHECK-NEXT: [0] - JALU0
				# CHECK-NEXT: [1] - JALU1
				# CHECK-NEXT: [2] - JDiv
				# CHECK-NEXT: [3] - JFPA
				# CHECK-NEXT: [4] - JFPM
				# CHECK-NEXT: [5] - JFPU0
				# CHECK-NEXT: [6] - JFPU1
				# CHECK-NEXT: [7] - JLAGU
				# CHECK-NEXT: [8] - JMul
				# CHECK-NEXT: [9] - JSAGU
				# CHECK-NEXT: [10] - JSTC
				# CHECK-NEXT: [11] - JVALU0
				# CHECK-NEXT: [12] - JVALU1
				# CHECK-NEXT: [13] - JVIMUL

				# CHECK: Resource pressure per iteration:
				# CHECK-NEXT: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13]
				# CHECK-NEXT: - - - 1.00 - 1.00 - - - - - - - -

				# CHECK: Resource pressure by instruction:
				# CHECK-NEXT: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] Instructions:
				# CHECK-NEXT: - - - 1.00 - 1.00 - - - - - - - - vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: - - - - - - - - - - - - - - vandnps %xmm2, %xmm2, %xmm3

				# CHECK: Timeline view:
				# CHECK-NEXT: Index 01234567

				# CHECK: [0,0] DeeeER . vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: [0,1] D----R . vandnps %xmm2, %xmm2, %xmm3
				# CHECK-NEXT: [1,0] .DeeeER. vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: [1,1] .D----R. vandnps %xmm2, %xmm2, %xmm3
				# CHECK-NEXT: [2,0] . DeeeER vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: [2,1] . D----R vandnps %xmm2, %xmm2, %xmm3

				# CHECK: Average Wait times (based on the timeline view):
				# CHECK-NEXT: [0]: Executions
				# CHECK-NEXT: [1]: Average time spent waiting in a scheduler's queue
				# CHECK-NEXT: [2]: Average time spent waiting in a scheduler's queue while ready
				# CHECK-NEXT: [3]: Average time elapsed from WB until retire stage

				# CHECK: [0] [1] [2] [3]
				# CHECK-NEXT: 0. 3 1.0 1.0 0.0 vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: 1. 3 0.0 0.0 4.0 vandnps %xmm2, %xmm2, %xmm3

				# CHECK: [3] Code Region - ZERO-IDIOM-4

				# CHECK: Iterations: 100
				# CHECK-NEXT: Instructions: 200
				# CHECK-NEXT: Total Cycles: 105
				# CHECK-NEXT: Total uOps: 200

				# CHECK: Dispatch Width: 2
				# CHECK-NEXT: uOps Per Cycle: 1.90
				# CHECK-NEXT: IPC: 1.90
				# CHECK-NEXT: Block RThroughput: 1.0

				# CHECK: Instruction Info:
				# CHECK-NEXT: [1]: #uOps
				# CHECK-NEXT: [2]: Latency
				# CHECK-NEXT: [3]: RThroughput
				# CHECK-NEXT: [4]: MayLoad
				# CHECK-NEXT: [5]: MayStore
				# CHECK-NEXT: [6]: HasSideEffects (U)

				# CHECK: [1] [2] [3] [4] [5] [6] Instructions:
				# CHECK-NEXT: 1 3 1.00 vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: 1 0 0.50 vandnps %xmm2, %xmm2, %xmm3

				# CHECK: Resources:
				# CHECK-NEXT: [0] - JALU0
				# CHECK-NEXT: [1] - JALU1
				# CHECK-NEXT: [2] - JDiv
				# CHECK-NEXT: [3] - JFPA
				# CHECK-NEXT: [4] - JFPM
				# CHECK-NEXT: [5] - JFPU0
				# CHECK-NEXT: [6] - JFPU1
				# CHECK-NEXT: [7] - JLAGU
				# CHECK-NEXT: [8] - JMul
				# CHECK-NEXT: [9] - JSAGU
				# CHECK-NEXT: [10] - JSTC
				# CHECK-NEXT: [11] - JVALU0
				# CHECK-NEXT: [12] - JVALU1
				# CHECK-NEXT: [13] - JVIMUL

				# CHECK: Resource pressure per iteration:
				# CHECK-NEXT: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13]
				# CHECK-NEXT: - - - 1.00 - 1.00 - - - - - - - -

				# CHECK: Resource pressure by instruction:
				# CHECK-NEXT: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] Instructions:
				# CHECK-NEXT: - - - 1.00 - 1.00 - - - - - - - - vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: - - - - - - - - - - - - - - vandnps %xmm2, %xmm2, %xmm3

				# CHECK: Timeline view:
				# CHECK-NEXT: Index 01234567

				# CHECK: [0,0] DeeeER . vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: [0,1] D----R . vandnps %xmm2, %xmm2, %xmm3
				# CHECK-NEXT: [1,0] .DeeeER. vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: [1,1] .D----R. vandnps %xmm2, %xmm2, %xmm3
				# CHECK-NEXT: [2,0] . DeeeER vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: [2,1] . D----R vandnps %xmm2, %xmm2, %xmm3

				# CHECK: Average Wait times (based on the timeline view):
				# CHECK-NEXT: [0]: Executions
				# CHECK-NEXT: [1]: Average time spent waiting in a scheduler's queue
				# CHECK-NEXT: [2]: Average time spent waiting in a scheduler's queue while ready
				# CHECK-NEXT: [3]: Average time elapsed from WB until retire stage

				# CHECK: [0] [1] [2] [3]
				# CHECK-NEXT: 0. 3 1.0 1.0 0.0 vaddps %xmm0, %xmm1, %xmm2
				# CHECK-NEXT: 1. 3 0.0 0.0 4.0 vandnps %xmm2, %xmm2, %xmm3

llvm/trunk/tools/llvm-mca/lib/InstrBuilder.cpp

	Show First 20 Lines • Show All 418 Lines • ▼ Show 20 Lines
	InstrBuilder::createInstruction(const MCInst &MCI) {			InstrBuilder::createInstruction(const MCInst &MCI) {
	Expected<const InstrDesc &> DescOrErr = getOrCreateInstrDesc(MCI);			Expected<const InstrDesc &> DescOrErr = getOrCreateInstrDesc(MCI);
	if (!DescOrErr)			if (!DescOrErr)
	return DescOrErr.takeError();			return DescOrErr.takeError();
	const InstrDesc &D = *DescOrErr;			const InstrDesc &D = *DescOrErr;
	std::unique_ptr<Instruction> NewIS = llvm::make_unique<Instruction>(D);			std::unique_ptr<Instruction> NewIS = llvm::make_unique<Instruction>(D);

	// Check if this is a dependency breaking instruction.			// Check if this is a dependency breaking instruction.
	bool IsDepBreaking = MCIA.isDependencyBreaking(STI, MCI);			APInt Mask;
	// FIXME: this is a temporary hack to identify zero-idioms.
	bool IsZeroIdiom = D.isZeroLatency() && IsDepBreaking;			unsigned ProcID = STI.getSchedModel().getProcessorID();
				bool IsZeroIdiom = MCIA.isZeroIdiom(MCI, Mask, ProcID);
				bool IsDepBreaking =
				IsZeroIdiom \|\| MCIA.isDependencyBreaking(MCI, Mask, ProcID);

	// Initialize Reads first.			// Initialize Reads first.
	for (const ReadDescriptor &RD : D.Reads) {			for (const ReadDescriptor &RD : D.Reads) {
	int RegID = -1;			int RegID = -1;
	if (!RD.isImplicitRead()) {			if (!RD.isImplicitRead()) {
	// explicit read.			// explicit read.
	const MCOperand &Op = MCI.getOperand(RD.OpIndex);			const MCOperand &Op = MCI.getOperand(RD.OpIndex);
	// Skip non-register operands.			// Skip non-register operands.
	if (!Op.isReg())			if (!Op.isReg())
	continue;			continue;
	RegID = Op.getReg();			RegID = Op.getReg();
	} else {			} else {
	// Implicit read.			// Implicit read.
	RegID = RD.RegisterID;			RegID = RD.RegisterID;
	}			}

	// Skip invalid register operands.			// Skip invalid register operands.
	if (!RegID)			if (!RegID)
	continue;			continue;

	// Okay, this is a register operand. Create a ReadState for it.			// Okay, this is a register operand. Create a ReadState for it.
	assert(RegID > 0 && "Invalid register ID found!");			assert(RegID > 0 && "Invalid register ID found!");
	auto RS = llvm::make_unique<ReadState>(RD, RegID);			auto RS = llvm::make_unique<ReadState>(RD, RegID);

	if (IsDepBreaking && !RD.isImplicitRead())			if (IsDepBreaking) {
				// A mask of all zeroes means: explicit input operands are not
				// independent.
				if (Mask.isNullValue()) {
				if (!RD.isImplicitRead())
				RS->setIndependentFromDef();
				} else {
				// Check if this register operand is independent according to `Mask`.
				// Note that Mask may not have enough bits to describe all explicit and
				// implicit input operands. If this register operand doesn't have a
				// corresponding bit in Mask, then conservatively assume that it is
				// dependent.
				if (Mask.getBitWidth() > RD.UseIndex) {
				// Okay. This map describe register use `RD.UseIndex`.
				if (Mask[RD.UseIndex])
	RS->setIndependentFromDef();			RS->setIndependentFromDef();
				}
				}
				}
	NewIS->getUses().emplace_back(std::move(RS));			NewIS->getUses().emplace_back(std::move(RS));
	}			}

	// Early exit if there are no writes.			// Early exit if there are no writes.
	if (D.Writes.empty())			if (D.Writes.empty())
	return std::move(NewIS);			return std::move(NewIS);

	// Track register writes that implicitly clear the upper portion of the			// Track register writes that implicitly clear the upper portion of the
	Show All 28 Lines

llvm/trunk/utils/TableGen/CodeGenSchedule.h

Show All 9 Lines
// This file defines structures to encapsulate the machine model as described in		// This file defines structures to encapsulate the machine model as described in
// the target description.		// the target description.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_UTILS_TABLEGEN_CODEGENSCHEDULE_H		#ifndef LLVM_UTILS_TABLEGEN_CODEGENSCHEDULE_H
#define LLVM_UTILS_TABLEGEN_CODEGENSCHEDULE_H		#define LLVM_UTILS_TABLEGEN_CODEGENSCHEDULE_H

		#include "llvm/ADT/APInt.h"
#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/TableGen/Record.h"		#include "llvm/TableGen/Record.h"
#include "llvm/TableGen/SetTheory.h"		#include "llvm/TableGen/SetTheory.h"

namespace llvm {		namespace llvm {

▲ Show 20 Lines • Show All 239 Lines • ▼ Show 20 Lines	struct CodeGenProcModel {

bool isUnsupported(const CodeGenInstruction &Inst) const;		bool isUnsupported(const CodeGenInstruction &Inst) const;

#ifndef NDEBUG		#ifndef NDEBUG
void dump() const;		void dump() const;
#endif		#endif
};		};

		/// Used to correlate instructions to MCInstPredicates specified by
		/// InstructionEquivalentClass tablegen definitions.
		///
		/// Example: a XOR of a register with self, is a known zero-idiom for most
		/// X86 processors.
		///
		/// Each processor can use a (potentially different) InstructionEquivalenceClass
		/// definition to classify zero-idioms. That means, XORrr is likely to appear
		/// in more than one equivalence class (where each class definition is
		/// contributed by a different processor).
		///
		/// There is no guarantee that the same MCInstPredicate will be used to describe
		/// equivalence classes that identify XORrr as a zero-idiom.
		///
		/// To be more specific, the requirements for being a zero-idiom XORrr may be
		/// different for different processors.
		///
		/// Class PredicateInfo identifies a subset of processors that specify the same
		/// requirements (i.e. same MCInstPredicate and OperandMask) for an instruction
		/// opcode.
		///
		/// Back to the example. Field `ProcModelMask` will have one bit set for every
		/// processor model that sees XORrr as a zero-idiom, and that specifies the same
		/// set of constraints.
		///
		/// By construction, there can be multiple instances of PredicateInfo associated
		/// with a same instruction opcode. For example, different processors may define
		/// different constraints on the same opcode.
		///
		/// Field OperandMask can be used as an extra constraint.
		/// It may be used to describe conditions that appy only to a subset of the
		/// operands of a machine instruction, and the operands subset may not be the
		/// same for all processor models.
		struct PredicateInfo {
		llvm::APInt ProcModelMask; // A set of processor model indices.
		llvm::APInt OperandMask; // An operand mask.
		const Record *Predicate; // MCInstrPredicate definition.
		PredicateInfo(llvm::APInt CpuMask, llvm::APInt Operands, const Record *Pred)
		: ProcModelMask(CpuMask), OperandMask(Operands), Predicate(Pred) {}

		bool operator==(const PredicateInfo &Other) const {
		return ProcModelMask == Other.ProcModelMask &&
		OperandMask == Other.OperandMask && Predicate == Other.Predicate;
		}
		};

		/// A collection of PredicateInfo objects.
		///
		/// There is at least one OpcodeInfo object for every opcode specified by a
		/// TIPredicate definition.
		class OpcodeInfo {
		llvm::SmallVector<PredicateInfo, 8> Predicates;

		OpcodeInfo(const OpcodeInfo &Other) = delete;
		OpcodeInfo &operator=(const OpcodeInfo &Other) = delete;

		public:
		OpcodeInfo() = default;
		OpcodeInfo &operator=(OpcodeInfo &&Other) = default;
		OpcodeInfo(OpcodeInfo &&Other) = default;

		ArrayRef<PredicateInfo> getPredicates() const { return Predicates; }

		void addPredicateForProcModel(const llvm::APInt &CpuMask,
		const llvm::APInt &OperandMask,
		const Record *Predicate);
		};

		/// Used to group together tablegen instruction definitions that are subject
		/// to a same set of constraints (identified by an instance of OpcodeInfo).
		class OpcodeGroup {
		OpcodeInfo Info;
		std::vector<const Record *> Opcodes;

		OpcodeGroup(const OpcodeGroup &Other) = delete;
		OpcodeGroup &operator=(const OpcodeGroup &Other) = delete;

		public:
		OpcodeGroup(OpcodeInfo &&OpInfo) : Info(std::move(OpInfo)) {}
		OpcodeGroup(OpcodeGroup &&Other) = default;

		void addOpcode(const Record *Opcode) {
		assert(std::find(Opcodes.begin(), Opcodes.end(), Opcode) == Opcodes.end() &&
		"Opcode already in set!");
		Opcodes.push_back(Opcode);
		}

		ArrayRef<const Record *> getOpcodes() const { return Opcodes; }
		const OpcodeInfo &getOpcodeInfo() const { return Info; }
		};

		/// An STIPredicateFunction descriptor used by tablegen backends to
		/// auto-generate the body of a predicate function as a member of tablegen'd
		/// class XXXGenSubtargetInfo.
		class STIPredicateFunction {
		const Record *FunctionDeclaration;

		std::vector<const Record *> Definitions;
		std::vector<OpcodeGroup> Groups;

		STIPredicateFunction(const STIPredicateFunction &Other) = delete;
		STIPredicateFunction &operator=(const STIPredicateFunction &Other) = delete;

		public:
		STIPredicateFunction(const Record *Rec) : FunctionDeclaration(Rec) {}
		STIPredicateFunction(STIPredicateFunction &&Other) = default;

		bool isCompatibleWith(const STIPredicateFunction &Other) const {
		return FunctionDeclaration == Other.FunctionDeclaration;
		}

		void addDefinition(const Record *Def) { Definitions.push_back(Def); }
		void addOpcode(const Record *OpcodeRec, OpcodeInfo &&Info) {
		if (Groups.empty() \|\|
		Groups.back().getOpcodeInfo().getPredicates() != Info.getPredicates())
		Groups.emplace_back(std::move(Info));
		Groups.back().addOpcode(OpcodeRec);
		}

		StringRef getName() const {
		return FunctionDeclaration->getValueAsString("Name");
		}
		const Record *getDefaultReturnPredicate() const {
		return FunctionDeclaration->getValueAsDef("DefaultReturnValue");
		}

		const Record *getDeclaration() const { return FunctionDeclaration; }
		ArrayRef<const Record *> getDefinitions() const { return Definitions; }
		ArrayRef<OpcodeGroup> getGroups() const { return Groups; }
		};

/// Top level container for machine model data.		/// Top level container for machine model data.
class CodeGenSchedModels {		class CodeGenSchedModels {
RecordKeeper &Records;		RecordKeeper &Records;
const CodeGenTarget &Target;		const CodeGenTarget &Target;

// Map dag expressions to Instruction lists.		// Map dag expressions to Instruction lists.
SetTheory Sets;		SetTheory Sets;

Show All 17 Lines	class CodeGenSchedModels {
RecVec ProcResourceDefs;		RecVec ProcResourceDefs;
RecVec ProcResGroups;		RecVec ProcResGroups;

// Map each instruction to its unique SchedClass index considering the		// Map each instruction to its unique SchedClass index considering the
// combination of it's itinerary class, SchedRW list, and InstRW records.		// combination of it's itinerary class, SchedRW list, and InstRW records.
using InstClassMapTy = DenseMap<Record*, unsigned>;		using InstClassMapTy = DenseMap<Record*, unsigned>;
InstClassMapTy InstrClassMap;		InstClassMapTy InstrClassMap;

		std::vector<STIPredicateFunction> STIPredicates;

public:		public:
CodeGenSchedModels(RecordKeeper& RK, const CodeGenTarget &TGT);		CodeGenSchedModels(RecordKeeper& RK, const CodeGenTarget &TGT);

// iterator access to the scheduling classes.		// iterator access to the scheduling classes.
using class_iterator = std::vector<CodeGenSchedClass>::iterator;		using class_iterator = std::vector<CodeGenSchedClass>::iterator;
using const_class_iterator = std::vector<CodeGenSchedClass>::const_iterator;		using const_class_iterator = std::vector<CodeGenSchedClass>::const_iterator;
class_iterator classes_begin() { return SchedClasses.begin(); }		class_iterator classes_begin() { return SchedClasses.begin(); }
const_class_iterator classes_begin() const { return SchedClasses.begin(); }		const_class_iterator classes_begin() const { return SchedClasses.begin(); }
▲ Show 20 Lines • Show All 111 Lines • ▼ Show 20 Lines	unsigned addSchedClass(Record *ItinDef, ArrayRef<unsigned> OperWrites,
ArrayRef<unsigned> OperReads,		ArrayRef<unsigned> OperReads,
ArrayRef<unsigned> ProcIndices);		ArrayRef<unsigned> ProcIndices);

unsigned findOrInsertRW(ArrayRef<unsigned> Seq, bool IsRead);		unsigned findOrInsertRW(ArrayRef<unsigned> Seq, bool IsRead);

Record findProcResUnits(Record ProcResKind, const CodeGenProcModel &PM,		Record findProcResUnits(Record ProcResKind, const CodeGenProcModel &PM,
ArrayRef<SMLoc> Loc) const;		ArrayRef<SMLoc> Loc) const;

		ArrayRef<STIPredicateFunction> getSTIPredicates() const {
		return STIPredicates;
		}
private:		private:
void collectProcModels();		void collectProcModels();

// Initialize a new processor model if it is unique.		// Initialize a new processor model if it is unique.
void addProcModel(Record *ProcDef);		void addProcModel(Record *ProcDef);

void collectSchedRW();		void collectSchedRW();

Show All 21 Lines	private:
void collectProcItinRW();		void collectProcItinRW();

void collectProcUnsupportedFeatures();		void collectProcUnsupportedFeatures();

void inferSchedClasses();		void inferSchedClasses();

void checkMCInstPredicates() const;		void checkMCInstPredicates() const;

		void checkSTIPredicates() const;

		void collectSTIPredicates();

void checkCompleteness();		void checkCompleteness();

void inferFromRW(ArrayRef<unsigned> OperWrites, ArrayRef<unsigned> OperReads,		void inferFromRW(ArrayRef<unsigned> OperWrites, ArrayRef<unsigned> OperReads,
unsigned FromClassIdx, ArrayRef<unsigned> ProcIndices);		unsigned FromClassIdx, ArrayRef<unsigned> ProcIndices);
void inferFromItinClass(Record *ItinClassDef, unsigned FromClassIdx);		void inferFromItinClass(Record *ItinClassDef, unsigned FromClassIdx);
void inferFromInstRWs(unsigned SCIdx);		void inferFromInstRWs(unsigned SCIdx);

bool hasSuperGroup(RecVec &SubUnits, CodeGenProcModel &PM);		bool hasSuperGroup(RecVec &SubUnits, CodeGenProcModel &PM);
Show All 23 Lines

llvm/trunk/utils/TableGen/CodeGenSchedule.cpp

Show First 20 Lines • Show All 219 Lines • ▼ Show 20 Lines	CodeGenSchedModels::CodeGenSchedModels(RecordKeeper &RK,
collectProcResources();		collectProcResources();

// Collect optional processor description.		// Collect optional processor description.
collectOptionalProcessorInfo();		collectOptionalProcessorInfo();

// Check MCInstPredicate definitions.		// Check MCInstPredicate definitions.
checkMCInstPredicates();		checkMCInstPredicates();

		// Check STIPredicate definitions.
		checkSTIPredicates();

		// Find STIPredicate definitions for each processor model, and construct
		// STIPredicateFunction objects.
		collectSTIPredicates();

checkCompleteness();		checkCompleteness();
}		}

		void CodeGenSchedModels::checkSTIPredicates() const {
		DenseMap<StringRef, const Record *> Declarations;

		// There cannot be multiple declarations with the same name.
		const RecVec Decls = Records.getAllDerivedDefinitions("STIPredicateDecl");
		for (const Record *R : Decls) {
		StringRef Name = R->getValueAsString("Name");
		const auto It = Declarations.find(Name);
		if (It == Declarations.end()) {
		Declarations[Name] = R;
		continue;
		}

		PrintError(R->getLoc(), "STIPredicate " + Name + " multiply declared.");
		PrintNote(It->second->getLoc(), "Previous declaration was here.");
		PrintFatalError(R->getLoc(), "Invalid STIPredicateDecl found.");
		}

		// Disallow InstructionEquivalenceClasses with an empty instruction list.
		const RecVec Defs =
		Records.getAllDerivedDefinitions("InstructionEquivalenceClass");
		for (const Record *R : Defs) {
		RecVec Opcodes = R->getValueAsListOfDefs("Opcodes");
		if (Opcodes.empty()) {
		PrintFatalError(R->getLoc(), "Invalid InstructionEquivalenceClass "
		"defined with an empty opcode list.");
		}
		}
		}

		// Used by function `processSTIPredicate` to construct a mask of machine
		// instruction operands.
		static APInt constructOperandMask(ArrayRef<int64_t> Indices) {
		APInt OperandMask;
		if (Indices.empty())
		return OperandMask;

		int64_t MaxIndex = *std::max_element(Indices.begin(), Indices.end());
		assert(MaxIndex >= 0 && "Invalid negative indices in input!");
		OperandMask = OperandMask.zext(MaxIndex + 1);
		for (const int64_t Index : Indices) {
		assert(Index >= 0 && "Invalid negative indices!");
		OperandMask.setBit(Index);
		}

		return OperandMask;
		}

		static void
		processSTIPredicate(STIPredicateFunction &Fn,
		const DenseMap<Record *, unsigned> &ProcModelMap) {
		DenseMap<const Record *, unsigned> Opcode2Index;
		using OpcodeMapPair = std::pair<const Record *, OpcodeInfo>;
		std::vector<OpcodeMapPair> OpcodeMappings;
		std::vector<std::pair<APInt, APInt>> OpcodeMasks;

		DenseMap<const Record *, unsigned> Predicate2Index;
		unsigned NumUniquePredicates = 0;

		// Number unique predicates and opcodes used by InstructionEquivalenceClass
		// definitions. Each unique opcode will be associated with an OpcodeInfo
		// object.
		for (const Record *Def : Fn.getDefinitions()) {
		RecVec Classes = Def->getValueAsListOfDefs("Classes");
		for (const Record *EC : Classes) {
		const Record *Pred = EC->getValueAsDef("Predicate");
		if (Predicate2Index.find(Pred) == Predicate2Index.end())
		Predicate2Index[Pred] = NumUniquePredicates++;

		RecVec Opcodes = EC->getValueAsListOfDefs("Opcodes");
		for (const Record *Opcode : Opcodes) {
		if (Opcode2Index.find(Opcode) == Opcode2Index.end()) {
		Opcode2Index[Opcode] = OpcodeMappings.size();
		OpcodeMappings.emplace_back(Opcode, OpcodeInfo());
		}
		}
		}
		}

		// Initialize vector `OpcodeMasks` with default values. We want to keep track
		// of which processors "use" which opcodes. We also want to be able to
		// identify predicates that are used by different processors for a same
		// opcode.
		// This information is used later on by this algorithm to sort OpcodeMapping
		// elements based on their processor and predicate sets.
		OpcodeMasks.resize(OpcodeMappings.size());
		APInt DefaultProcMask(ProcModelMap.size(), 0);
		APInt DefaultPredMask(NumUniquePredicates, 0);
		for (std::pair<APInt, APInt> &MaskPair : OpcodeMasks)
		MaskPair = std::make_pair(DefaultProcMask, DefaultPredMask);

		// Construct a OpcodeInfo object for every unique opcode declared by an
		// InstructionEquivalenceClass definition.
		for (const Record *Def : Fn.getDefinitions()) {
		RecVec Classes = Def->getValueAsListOfDefs("Classes");
		const Record *SchedModel = Def->getValueAsDef("SchedModel");
		unsigned ProcIndex = ProcModelMap.find(SchedModel)->second;
		APInt ProcMask(ProcModelMap.size(), 0);
		ProcMask.setBit(ProcIndex);

		for (const Record *EC : Classes) {
		RecVec Opcodes = EC->getValueAsListOfDefs("Opcodes");

		std::vector<int64_t> OpIndices =
		EC->getValueAsListOfInts("OperandIndices");
		APInt OperandMask = constructOperandMask(OpIndices);

		const Record *Pred = EC->getValueAsDef("Predicate");
		APInt PredMask(NumUniquePredicates, 0);
		PredMask.setBit(Predicate2Index[Pred]);

		for (const Record *Opcode : Opcodes) {
		unsigned OpcodeIdx = Opcode2Index[Opcode];
		if (OpcodeMasks[OpcodeIdx].first[ProcIndex]) {
		std::string Message =
		"Opcode " + Opcode->getName().str() +
		" used by multiple InstructionEquivalenceClass definitions.";
		PrintFatalError(EC->getLoc(), Message);
		}
		OpcodeMasks[OpcodeIdx].first \|= ProcMask;
		OpcodeMasks[OpcodeIdx].second \|= PredMask;
		OpcodeInfo &OI = OpcodeMappings[OpcodeIdx].second;

		OI.addPredicateForProcModel(ProcMask, OperandMask, Pred);
		}
		}
		}

		// Sort OpcodeMappings elements based on their CPU and predicate masks.
		// As a last resort, order elements by opcode identifier.
		llvm::sort(OpcodeMappings.begin(), OpcodeMappings.end(),
		[&](const OpcodeMapPair &Lhs, const OpcodeMapPair &Rhs) {
		unsigned LhsIdx = Opcode2Index[Lhs.first];
		unsigned RhsIdx = Opcode2Index[Rhs.first];
		std::pair<APInt, APInt> &LhsMasks = OpcodeMasks[LhsIdx];
		std::pair<APInt, APInt> &RhsMasks = OpcodeMasks[RhsIdx];

		if (LhsMasks.first != RhsMasks.first) {
		if (LhsMasks.first.countPopulation() <
		RhsMasks.first.countPopulation())
		return true;
		return LhsMasks.first.countLeadingZeros() >
		RhsMasks.first.countLeadingZeros();
		}

		if (LhsMasks.second != RhsMasks.second) {
		if (LhsMasks.second.countPopulation() <
		RhsMasks.second.countPopulation())
		return true;
		return LhsMasks.second.countLeadingZeros() >
		RhsMasks.second.countLeadingZeros();
		}

		return LhsIdx < RhsIdx;
		});

		// Now construct opcode groups. Groups are used by the SubtargetEmitter when
		// expanding the body of a STIPredicate function. In particular, each opcode
		// group is expanded into a sequence of labels in a switch statement.
		// It identifies opcodes for which different processors define same predicates
		// and same opcode masks.
		for (OpcodeMapPair &Info : OpcodeMappings)
		Fn.addOpcode(Info.first, std::move(Info.second));
		}

		void CodeGenSchedModels::collectSTIPredicates() {
		// Map STIPredicateDecl records to elements of vector
		// CodeGenSchedModels::STIPredicates.
		DenseMap<const Record *, unsigned> Decl2Index;

		RecVec RV = Records.getAllDerivedDefinitions("STIPredicate");
		for (const Record *R : RV) {
		const Record *Decl = R->getValueAsDef("Declaration");

		const auto It = Decl2Index.find(Decl);
		if (It == Decl2Index.end()) {
		Decl2Index[Decl] = STIPredicates.size();
		STIPredicateFunction Predicate(Decl);
		Predicate.addDefinition(R);
		STIPredicates.emplace_back(std::move(Predicate));
		continue;
		}

		STIPredicateFunction &PreviousDef = STIPredicates[It->second];
		PreviousDef.addDefinition(R);
		}

		for (STIPredicateFunction &Fn : STIPredicates)
		processSTIPredicate(Fn, ProcModelMap);
		}

		void OpcodeInfo::addPredicateForProcModel(const llvm::APInt &CpuMask,
		const llvm::APInt &OperandMask,
		const Record *Predicate) {
		auto It = llvm::find_if(
		Predicates, [&OperandMask, &Predicate](const PredicateInfo &P) {
		return P.Predicate == Predicate && P.OperandMask == OperandMask;
		});
		if (It == Predicates.end()) {
		Predicates.emplace_back(CpuMask, OperandMask, Predicate);
		return;
		}
		It->ProcModelMask \|= CpuMask;
		}

void CodeGenSchedModels::checkMCInstPredicates() const {		void CodeGenSchedModels::checkMCInstPredicates() const {
RecVec MCPredicates = Records.getAllDerivedDefinitions("TIIPredicate");		RecVec MCPredicates = Records.getAllDerivedDefinitions("TIIPredicate");
if (MCPredicates.empty())		if (MCPredicates.empty())
return;		return;

// A target cannot have multiple TIIPredicate definitions with a same name.		// A target cannot have multiple TIIPredicate definitions with a same name.
llvm::StringMap<const Record *> TIIPredicates(MCPredicates.size());		llvm::StringMap<const Record *> TIIPredicates(MCPredicates.size());
for (const Record *TIIPred : MCPredicates) {		for (const Record *TIIPred : MCPredicates) {
▲ Show 20 Lines • Show All 1,750 Lines • Show Last 20 Lines

llvm/trunk/utils/TableGen/PredicateExpander.h

Show All 37 Lines
public:		public:
PredicateExpander(StringRef Target)		PredicateExpander(StringRef Target)
: EmitCallsByRef(true), NegatePredicate(false), ExpandForMC(false),		: EmitCallsByRef(true), NegatePredicate(false), ExpandForMC(false),
IndentLevel(1U), TargetName(Target) {}		IndentLevel(1U), TargetName(Target) {}
bool isByRef() const { return EmitCallsByRef; }		bool isByRef() const { return EmitCallsByRef; }
bool shouldNegate() const { return NegatePredicate; }		bool shouldNegate() const { return NegatePredicate; }
bool shouldExpandForMC() const { return ExpandForMC; }		bool shouldExpandForMC() const { return ExpandForMC; }
unsigned getIndentLevel() const { return IndentLevel; }		unsigned getIndentLevel() const { return IndentLevel; }
		StringRef getTargetName() const { return TargetName; }

void setByRef(bool Value) { EmitCallsByRef = Value; }		void setByRef(bool Value) { EmitCallsByRef = Value; }
void flipNegatePredicate() { NegatePredicate = !NegatePredicate; }		void flipNegatePredicate() { NegatePredicate = !NegatePredicate; }
void setNegatePredicate(bool Value) { NegatePredicate = Value; }		void setNegatePredicate(bool Value) { NegatePredicate = Value; }
void setExpandForMC(bool Value) { ExpandForMC = Value; }		void setExpandForMC(bool Value) { ExpandForMC = Value; }
		void setIndentLevel(unsigned Level) { IndentLevel = Level; }
void increaseIndentLevel() { ++IndentLevel; }		void increaseIndentLevel() { ++IndentLevel; }
void decreaseIndentLevel() { --IndentLevel; }		void decreaseIndentLevel() { --IndentLevel; }
void setIndentLevel(unsigned Level) { IndentLevel = Level; }

using RecVec = std::vector<Record *>;		using RecVec = std::vector<Record *>;
void expandTrue(raw_ostream &OS);		void expandTrue(raw_ostream &OS);
void expandFalse(raw_ostream &OS);		void expandFalse(raw_ostream &OS);
void expandCheckImmOperand(raw_ostream &OS, int OpIndex, int ImmVal);		void expandCheckImmOperand(raw_ostream &OS, int OpIndex, int ImmVal);
void expandCheckImmOperand(raw_ostream &OS, int OpIndex, StringRef ImmVal);		void expandCheckImmOperand(raw_ostream &OS, int OpIndex, StringRef ImmVal);
void expandCheckRegOperand(raw_ostream &OS, int OpIndex, const Record *Reg);		void expandCheckRegOperand(raw_ostream &OS, int OpIndex, const Record *Reg);
void expandCheckSameRegOperand(raw_ostream &OS, int First, int Second);		void expandCheckSameRegOperand(raw_ostream &OS, int First, int Second);
Show All 14 Lines	public:
void expandPredicate(raw_ostream &OS, const Record *Rec);		void expandPredicate(raw_ostream &OS, const Record *Rec);
void expandReturnStatement(raw_ostream &OS, const Record *Rec);		void expandReturnStatement(raw_ostream &OS, const Record *Rec);
void expandOpcodeSwitchCase(raw_ostream &OS, const Record *Rec);		void expandOpcodeSwitchCase(raw_ostream &OS, const Record *Rec);
void expandOpcodeSwitchStatement(raw_ostream &OS, const RecVec &Cases,		void expandOpcodeSwitchStatement(raw_ostream &OS, const RecVec &Cases,
const Record *Default);		const Record *Default);
void expandStatement(raw_ostream &OS, const Record *Rec);		void expandStatement(raw_ostream &OS, const Record *Rec);
};		};

		// Forward declarations.
		class STIPredicateFunction;
		class OpcodeGroup;

		class STIPredicateExpander : public PredicateExpander {
		StringRef ClassPrefix;
		bool ExpandDefinition;

		STIPredicateExpander(const PredicateExpander &) = delete;
		STIPredicateExpander &operator=(const PredicateExpander &) = delete;

		void expandHeader(raw_ostream &OS, const STIPredicateFunction &Fn);
		void expandPrologue(raw_ostream &OS, const STIPredicateFunction &Fn);
		void expandOpcodeGroup(raw_ostream &OS, const OpcodeGroup &Group,
		bool ShouldUpdateOpcodeMask);
		void expandBody(raw_ostream &OS, const STIPredicateFunction &Fn);
		void expandEpilogue(raw_ostream &OS, const STIPredicateFunction &Fn);

		public:
		STIPredicateExpander(StringRef Target)
		: PredicateExpander(Target), ClassPrefix(), ExpandDefinition(false) {}

		bool shouldExpandDefinition() const { return ExpandDefinition; }
		StringRef getClassPrefix() const { return ClassPrefix; }
		void setClassPrefix(StringRef S) { ClassPrefix = S; }
		void setExpandDefinition(bool Value) { ExpandDefinition = Value; }

		void expandSTIPredicate(raw_ostream &OS, const STIPredicateFunction &Fn);
		};

} // namespace llvm		} // namespace llvm

#endif		#endif

llvm/trunk/utils/TableGen/PredicateExpander.cpp

//===--------------------- PredicateExpander.cpp --------------------------===//		//===--------------------- PredicateExpander.cpp --------------------------===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
/// \file		/// \file
/// Functionalities used by the Tablegen backends to expand machine predicates.		/// Functionalities used by the Tablegen backends to expand machine predicates.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "PredicateExpander.h"		#include "PredicateExpander.h"
		#include "CodeGenSchedule.h" // Definition of STIPredicateFunction.

namespace llvm {		namespace llvm {

void PredicateExpander::expandTrue(raw_ostream &OS) { OS << "true"; }		void PredicateExpander::expandTrue(raw_ostream &OS) { OS << "true"; }
void PredicateExpander::expandFalse(raw_ostream &OS) { OS << "false"; }		void PredicateExpander::expandFalse(raw_ostream &OS) { OS << "false"; }

void PredicateExpander::expandCheckImmOperand(raw_ostream &OS, int OpIndex,		void PredicateExpander::expandCheckImmOperand(raw_ostream &OS, int OpIndex,
int ImmVal) {		int ImmVal) {
▲ Show 20 Lines • Show All 285 Lines • ▼ Show 20 Lines	if (Rec->isSubClassOf("CheckNonPortable"))
return expandCheckNonPortable(OS, Rec->getValueAsString("CodeBlock"));		return expandCheckNonPortable(OS, Rec->getValueAsString("CodeBlock"));

if (Rec->isSubClassOf("TIIPredicate"))		if (Rec->isSubClassOf("TIIPredicate"))
return expandTIIFunctionCall(OS, Rec->getValueAsString("FunctionName"));		return expandTIIFunctionCall(OS, Rec->getValueAsString("FunctionName"));

llvm_unreachable("No known rules to expand this MCInstPredicate");		llvm_unreachable("No known rules to expand this MCInstPredicate");
}		}

		void STIPredicateExpander::expandHeader(raw_ostream &OS,
		const STIPredicateFunction &Fn) {
		const Record *Rec = Fn.getDeclaration();
		StringRef FunctionName = Rec->getValueAsString("Name");

		OS.indent(getIndentLevel() * 2);
		OS << "bool ";
		if (shouldExpandDefinition())
		OS << getClassPrefix() << "::";
		OS << FunctionName << "(";
		if (shouldExpandForMC())
		OS << "const MCInst " << (isByRef() ? "&" : "*") << "MI";
		else
		OS << "const MachineInstr " << (isByRef() ? "&" : "*") << "MI";
		if (Rec->getValueAsBit("UpdatesOpcodeMask"))
		OS << ", APInt &Mask";
		OS << (shouldExpandForMC() ? ", unsigned ProcessorID) const " : ") const ");
		if (shouldExpandDefinition()) {
		OS << "{\n";
		return;
		}

		if (Rec->getValueAsBit("OverridesBaseClassMember"))
		OS << "override";
		OS << ";\n";
		}

		void STIPredicateExpander::expandPrologue(raw_ostream &OS,
		const STIPredicateFunction &Fn) {
		RecVec Delegates = Fn.getDeclaration()->getValueAsListOfDefs("Delegates");
		bool UpdatesOpcodeMask =
		Fn.getDeclaration()->getValueAsBit("UpdatesOpcodeMask");

		increaseIndentLevel();
		unsigned IndentLevel = getIndentLevel();
		for (const Record *Delegate : Delegates) {
		OS.indent(IndentLevel * 2);
		OS << "if (" << Delegate->getValueAsString("Name") << "(MI";
		if (UpdatesOpcodeMask)
		OS << ", Mask";
		if (shouldExpandForMC())
		OS << ", ProcessorID";
		OS << "))\n";
		OS.indent((1 + IndentLevel) * 2);
		OS << "return true;\n\n";
		}

		if (shouldExpandForMC())
		return;

		OS.indent(IndentLevel * 2);
		OS << "unsigned ProcessorID = getSchedModel().getProcessorID();\n";
		}

		void STIPredicateExpander::expandOpcodeGroup(raw_ostream &OS, const OpcodeGroup &Group,
		bool ShouldUpdateOpcodeMask) {
		const OpcodeInfo &OI = Group.getOpcodeInfo();
		for (const PredicateInfo &PI : OI.getPredicates()) {
		const APInt &ProcModelMask = PI.ProcModelMask;
		bool FirstProcID = true;
		for (unsigned I = 0, E = ProcModelMask.getActiveBits(); I < E; ++I) {
		if (!ProcModelMask[I])
		continue;

		if (FirstProcID) {
		OS.indent(getIndentLevel() * 2);
		OS << "if (ProcessorID == " << I;
		} else {
		OS << " \|\| ProcessorID == " << I;
		}
		FirstProcID = false;
		}

		OS << ") {\n";

		increaseIndentLevel();
		OS.indent(getIndentLevel() * 2);
		if (ShouldUpdateOpcodeMask) {
		if (PI.OperandMask.isNullValue())
		OS << "Mask.clearAllBits();\n";
		else
		OS << "Mask = " << PI.OperandMask << ";\n";
		OS.indent(getIndentLevel() * 2);
		}
		OS << "return ";
		expandPredicate(OS, PI.Predicate);
		OS << ";\n";
		decreaseIndentLevel();
		OS.indent(getIndentLevel() * 2);
		OS << "}\n";
		}
		}

		void STIPredicateExpander::expandBody(raw_ostream &OS,
		const STIPredicateFunction &Fn) {
		bool UpdatesOpcodeMask =
		Fn.getDeclaration()->getValueAsBit("UpdatesOpcodeMask");

		unsigned IndentLevel = getIndentLevel();
		OS.indent(IndentLevel * 2);
		OS << "switch(MI" << (isByRef() ? "." : "->") << "getOpcode()) {\n";
		OS.indent(IndentLevel * 2);
		OS << "default:\n";
		OS.indent(IndentLevel * 2);
		OS << " break;";

		for (const OpcodeGroup &Group : Fn.getGroups()) {
		for (const Record *Opcode : Group.getOpcodes()) {
		OS << '\n';
		OS.indent(IndentLevel * 2);
		OS << "case " << getTargetName() << "::" << Opcode->getName() << ":";
		}

		OS << '\n';
		increaseIndentLevel();
		expandOpcodeGroup(OS, Group, UpdatesOpcodeMask);

		OS.indent(getIndentLevel() * 2);
		OS << "break;\n";
		decreaseIndentLevel();
		}

		OS.indent(IndentLevel * 2);
		OS << "}\n";
		}

		void STIPredicateExpander::expandEpilogue(raw_ostream &OS,
		const STIPredicateFunction &Fn) {
		OS << '\n';
		OS.indent(getIndentLevel() * 2);
		OS << "return ";
		expandPredicate(OS, Fn.getDefaultReturnPredicate());
		OS << ";\n";

		decreaseIndentLevel();
		OS.indent(getIndentLevel() * 2);
		StringRef FunctionName = Fn.getDeclaration()->getValueAsString("Name");
		OS << "} // " << ClassPrefix << "::" << FunctionName << "\n\n";
		}

		void STIPredicateExpander::expandSTIPredicate(raw_ostream &OS,
		const STIPredicateFunction &Fn) {
		const Record *Rec = Fn.getDeclaration();
		if (shouldExpandForMC() && !Rec->getValueAsBit("ExpandForMC"))
		return;

		expandHeader(OS, Fn);
		if (shouldExpandDefinition()) {
		expandPrologue(OS, Fn);
		expandBody(OS, Fn);
		expandEpilogue(OS, Fn);
		}
		}

} // namespace llvm		} // namespace llvm

llvm/trunk/utils/TableGen/SubtargetEmitter.cpp

Show First 20 Lines • Show All 110 Lines • ▼ Show 20 Lines	void GenSchedClassTables(const CodeGenProcModel &ProcModel,
SchedClassTables &SchedTables);		SchedClassTables &SchedTables);
void EmitSchedClassTables(SchedClassTables &SchedTables, raw_ostream &OS);		void EmitSchedClassTables(SchedClassTables &SchedTables, raw_ostream &OS);
void EmitProcessorModels(raw_ostream &OS);		void EmitProcessorModels(raw_ostream &OS);
void EmitProcessorLookup(raw_ostream &OS);		void EmitProcessorLookup(raw_ostream &OS);
void EmitSchedModelHelpers(const std::string &ClassName, raw_ostream &OS);		void EmitSchedModelHelpers(const std::string &ClassName, raw_ostream &OS);
void emitSchedModelHelpersImpl(raw_ostream &OS,		void emitSchedModelHelpersImpl(raw_ostream &OS,
bool OnlyExpandMCInstPredicates = false);		bool OnlyExpandMCInstPredicates = false);
void emitGenMCSubtargetInfo(raw_ostream &OS);		void emitGenMCSubtargetInfo(raw_ostream &OS);
		void EmitMCInstrAnalysisPredicateFunctions(raw_ostream &OS);

void EmitSchedModel(raw_ostream &OS);		void EmitSchedModel(raw_ostream &OS);
void EmitHwModeCheck(const std::string &ClassName, raw_ostream &OS);		void EmitHwModeCheck(const std::string &ClassName, raw_ostream &OS);
void ParseFeaturesFunction(raw_ostream &OS, unsigned NumFeatures,		void ParseFeaturesFunction(raw_ostream &OS, unsigned NumFeatures,
unsigned NumProcs);		unsigned NumProcs);

public:		public:
SubtargetEmitter(RecordKeeper &R, CodeGenTarget &TGT)		SubtargetEmitter(RecordKeeper &R, CodeGenTarget &TGT)
▲ Show 20 Lines • Show All 1,540 Lines • ▼ Show 20 Lines	void SubtargetEmitter::EmitSchedModelHelpers(const std::string &ClassName,

OS << "} // " << ClassName << "::resolveSchedClass\n\n";		OS << "} // " << ClassName << "::resolveSchedClass\n\n";

OS << "unsigned " << ClassName		OS << "unsigned " << ClassName
<< "\n::resolveVariantSchedClass(unsigned SchedClass, const MCInst *MI,"		<< "\n::resolveVariantSchedClass(unsigned SchedClass, const MCInst *MI,"
<< " unsigned CPUID) const {\n"		<< " unsigned CPUID) const {\n"
<< " return " << Target << "_MC"		<< " return " << Target << "_MC"
<< "::resolveVariantSchedClassImpl(SchedClass, MI, CPUID);\n"		<< "::resolveVariantSchedClassImpl(SchedClass, MI, CPUID);\n"
<< "} // " << ClassName << "::resolveVariantSchedClass\n";		<< "} // " << ClassName << "::resolveVariantSchedClass\n\n";

		STIPredicateExpander PE(Target);
		PE.setClassPrefix(ClassName);
		PE.setExpandDefinition(true);
		PE.setByRef(false);
		PE.setIndentLevel(0);

		for (const STIPredicateFunction &Fn : SchedModels.getSTIPredicates())
		PE.expandSTIPredicate(OS, Fn);
}		}

void SubtargetEmitter::EmitHwModeCheck(const std::string &ClassName,		void SubtargetEmitter::EmitHwModeCheck(const std::string &ClassName,
raw_ostream &OS) {		raw_ostream &OS) {
const CodeGenHwModes &CGH = TGT.getHwModes();		const CodeGenHwModes &CGH = TGT.getHwModes();
assert(CGH.getNumModeIds() > 0);		assert(CGH.getNumModeIds() > 0);
if (CGH.getNumModeIds() == 1)		if (CGH.getNumModeIds() == 1)
return;		return;
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	OS << " " << Target << "GenMCSubtargetInfo(const Triple &TT, \n"
<< " unsigned resolveVariantSchedClass(unsigned SchedClass,\n"		<< " unsigned resolveVariantSchedClass(unsigned SchedClass,\n"
<< " const MCInst *MI, unsigned CPUID) const override {\n"		<< " const MCInst *MI, unsigned CPUID) const override {\n"
<< " return " << Target << "_MC"		<< " return " << Target << "_MC"
<< "::resolveVariantSchedClassImpl(SchedClass, MI, CPUID); \n";		<< "::resolveVariantSchedClassImpl(SchedClass, MI, CPUID); \n";
OS << " }\n";		OS << " }\n";
OS << "};\n";		OS << "};\n";
}		}

		void SubtargetEmitter::EmitMCInstrAnalysisPredicateFunctions(raw_ostream &OS) {
		OS << "\n#ifdef GET_STIPREDICATE_DECLS_FOR_MC_ANALYSIS\n";
		OS << "#undef GET_STIPREDICATE_DECLS_FOR_MC_ANALYSIS\n\n";

		STIPredicateExpander PE(Target);
		PE.setExpandForMC(true);
		PE.setByRef(true);
		for (const STIPredicateFunction &Fn : SchedModels.getSTIPredicates())
		PE.expandSTIPredicate(OS, Fn);

		OS << "#endif // GET_STIPREDICATE_DECLS_FOR_MC_ANALYSIS\n\n";

		OS << "\n#ifdef GET_STIPREDICATE_DEFS_FOR_MC_ANALYSIS\n";
		OS << "#undef GET_STIPREDICATE_DEFS_FOR_MC_ANALYSIS\n\n";

		std::string ClassPrefix = Target + "MCInstrAnalysis";
		PE.setExpandDefinition(true);
		PE.setClassPrefix(ClassPrefix);
		PE.setIndentLevel(0);
		for (const STIPredicateFunction &Fn : SchedModels.getSTIPredicates())
		PE.expandSTIPredicate(OS, Fn);

		OS << "#endif // GET_STIPREDICATE_DEFS_FOR_MC_ANALYSIS\n\n";
		}

//		//
// SubtargetEmitter::run - Main subtarget enumeration emitter.		// SubtargetEmitter::run - Main subtarget enumeration emitter.
//		//
void SubtargetEmitter::run(raw_ostream &OS) {		void SubtargetEmitter::run(raw_ostream &OS) {
emitSourceFileHeader("Subtarget Enumeration Source Fragment", OS);		emitSourceFileHeader("Subtarget Enumeration Source Fragment", OS);

OS << "\n#ifdef GET_SUBTARGETINFO_ENUM\n";		OS << "\n#ifdef GET_SUBTARGETINFO_ENUM\n";
OS << "#undef GET_SUBTARGETINFO_ENUM\n\n";		OS << "#undef GET_SUBTARGETINFO_ENUM\n\n";
▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	OS << "struct " << ClassName << " : public TargetSubtargetInfo {\n"
<< " const MachineInstr *DefMI,"		<< " const MachineInstr *DefMI,"
<< " const TargetSchedModel *SchedModel) const override;\n"		<< " const TargetSchedModel *SchedModel) const override;\n"
<< " unsigned resolveVariantSchedClass(unsigned SchedClass,"		<< " unsigned resolveVariantSchedClass(unsigned SchedClass,"
<< " const MCInst *MI, unsigned CPUID) const override;\n"		<< " const MCInst *MI, unsigned CPUID) const override;\n"
<< " DFAPacketizer createDFAPacketizer(const InstrItineraryData IID)"		<< " DFAPacketizer createDFAPacketizer(const InstrItineraryData IID)"
<< " const;\n";		<< " const;\n";
if (TGT.getHwModes().getNumModeIds() > 1)		if (TGT.getHwModes().getNumModeIds() > 1)
OS << " unsigned getHwMode() const override;\n";		OS << " unsigned getHwMode() const override;\n";

		STIPredicateExpander PE(Target);
		PE.setByRef(false);
		for (const STIPredicateFunction &Fn : SchedModels.getSTIPredicates())
		PE.expandSTIPredicate(OS, Fn);

OS << "};\n"		OS << "};\n"
<< "} // end namespace llvm\n\n";		<< "} // end namespace llvm\n\n";

OS << "#endif // GET_SUBTARGETINFO_HEADER\n\n";		OS << "#endif // GET_SUBTARGETINFO_HEADER\n\n";

OS << "\n#ifdef GET_SUBTARGETINFO_CTOR\n";		OS << "\n#ifdef GET_SUBTARGETINFO_CTOR\n";
OS << "#undef GET_SUBTARGETINFO_CTOR\n\n";		OS << "#undef GET_SUBTARGETINFO_CTOR\n\n";

▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	#endif
OS << ") {}\n\n";		OS << ") {}\n\n";

EmitSchedModelHelpers(ClassName, OS);		EmitSchedModelHelpers(ClassName, OS);
EmitHwModeCheck(ClassName, OS);		EmitHwModeCheck(ClassName, OS);

OS << "} // end namespace llvm\n\n";		OS << "} // end namespace llvm\n\n";

OS << "#endif // GET_SUBTARGETINFO_CTOR\n\n";		OS << "#endif // GET_SUBTARGETINFO_CTOR\n\n";

		EmitMCInstrAnalysisPredicateFunctions(OS);
}		}

namespace llvm {		namespace llvm {

void EmitSubtarget(RecordKeeper &RK, raw_ostream &OS) {		void EmitSubtarget(RecordKeeper &RK, raw_ostream &OS) {
CodeGenTarget CGTarget(RK);		CodeGenTarget CGTarget(RK);
SubtargetEmitter(RK, CGTarget).run(OS);		SubtargetEmitter(RK, CGTarget).run(OS);
}		}

} // end namespace llvm		} // end namespace llvm