This is an archive of the discontinued LLVM Phabricator instance.

[X86] Move HasNOPL to a subtarget feature bit. Plumb MCSubtargetInfo through the MCAsmBackend constructor
ClosedPublic

Authored by craig.topper on Jan 3 2018, 5:44 PM.

Download Raw Diff

Details

Reviewers

RKSimon
spatel
pcordes
andreadb

Commits

rG505f38a059de: [X86] Move HasNOPL to a subtarget feature bit. Plumb MCSubtargetInfo through…
rL322227: [X86] Move HasNOPL to a subtarget feature bit. Plumb MCSubtargetInfo through…

Summary

After D41349, we can no get a MCSubtargetInfo into the MCAsmBackend constructor. This allows us to get NOPL from a subtarget feature rather than a CPU name blacklist.

Diff Detail

Event Timeline

craig.topper created this revision.Jan 3 2018, 5:44 PM

Nice, I was half way through doing this myself....

PR22965 needs addressing as well - instead of driving off the silvermont feature bit - do we need a 'PreferNop15' feature bit for AMD (fam15/fam16/ryzen) and possibly recent Intel (???) targets and otherwise use 11 byte NOPs?

Limit NOP length to 10 on everything but AMD fam 16h and 17h. @RKSimon you mentioned fam 15h in your previous update, but the optimization manual I found for fam 15h only talks about 11 byte.

We could maybe go to 11 instead of 10. But Intel documentation only lists out to 9 bytes. And the current 10-15 byte sequences we do are different than the 10-15 byte sequences in AMD's manual. We use a CS prefix that's not mentioned in the optimization manuals. The CS prefix does appear in the binutils 10 byte sequence. I believe binutils stops at 10 bytes for all CPUs.

In D41721#967576, @craig.topper wrote:

Limit NOP length to 10 on everything but AMD fam 16h and 17h. @RKSimon you mentioned fam 15h in your previous update, but the optimization manual I found for fam 15h only talks about 11 byte.

Yes, I think you're right - the fam15 sog says 3 prefix bytes are the max to avoid a decode stall, so 11 bytes should be the limit, but going to just 10 bytes is probably OK. Sorry I was going off a comment in PR22965 without rechecking.

We could maybe go to 11 instead of 10. But Intel documentation only lists out to 9 bytes. And the current 10-15 byte sequences we do are different than the 10-15 byte sequences in AMD's manual. We use a CS prefix that's not mentioned in the optimization manuals. The CS prefix does appear in the binutils 10 byte sequence. I believe binutils stops at 10 bytes for all CPUs.

If we're limiting the 11-15 byte options to the AMD targets, I think it'd be better to follow their recommendations (which luckily are the same in both the 16h and 17h sogs).

I feel guilty for asking for this change as it looks like the max nop length change should be part of a separate patch as it might have performance effects that need reviewing.

@andreadb @pcordes do you have any suggestions?

Should I go back to my first patch that just does the plumbing for now?

In D41721#969305, @craig.topper wrote:

Should I go back to my first patch that just does the plumbing for now?

Yes please, and split off the NOP length changes to a seperate patch to allow for regression tests. That way I think this patch should have no effect on codegen and is just a cleanup?

Revert back to previous version of the patch which should be NFC

LGTM - thanks.

This revision is now accepted and ready to land.Jan 10 2018, 12:34 PM

Closed by commit rL322227: [X86] Move HasNOPL to a subtarget feature bit. Plumb MCSubtargetInfo through… (authored by ctopper). · Explain WhyJan 10 2018, 2:08 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

lib/

Target/

X86/

MCTargetDesc/

81 lines

49 lines

5 lines

1 line

Diff 128589

lib/Target/X86/MCTargetDesc/X86AsmBackend.cpp

Show First 20 Lines • Show All 61 Lines • ▼ Show 20 Lines
class X86ELFObjectWriter : public MCELFObjectTargetWriter {		class X86ELFObjectWriter : public MCELFObjectTargetWriter {
public:		public:
X86ELFObjectWriter(bool is64Bit, uint8_t OSABI, uint16_t EMachine,		X86ELFObjectWriter(bool is64Bit, uint8_t OSABI, uint16_t EMachine,
bool HasRelocationAddend, bool foobar)		bool HasRelocationAddend, bool foobar)
: MCELFObjectTargetWriter(is64Bit, OSABI, EMachine, HasRelocationAddend) {}		: MCELFObjectTargetWriter(is64Bit, OSABI, EMachine, HasRelocationAddend) {}
};		};

class X86AsmBackend : public MCAsmBackend {		class X86AsmBackend : public MCAsmBackend {
const StringRef CPU;		const MCSubtargetInfo &STI;
bool HasNopl;		public:
const uint64_t MaxNopLength;		X86AsmBackend(const Target &T, const MCSubtargetInfo &STI)
public:		: MCAsmBackend(), STI(STI) {}
X86AsmBackend(const Target &T, StringRef CPU)
: MCAsmBackend(), CPU(CPU),
MaxNopLength((CPU == "slm" \|\| CPU == "silvermont") ? 7 : 15) {
HasNopl = CPU != "generic" && CPU != "i386" && CPU != "i486" &&
CPU != "i586" && CPU != "pentium" && CPU != "pentium-mmx" &&
CPU != "i686" && CPU != "k6" && CPU != "k6-2" && CPU != "k6-3" &&
CPU != "geode" && CPU != "winchip-c6" && CPU != "winchip2" &&
CPU != "c3" && CPU != "c3-2" && CPU != "lakemont" && CPU != "";
}

unsigned getNumFixupKinds() const override {		unsigned getNumFixupKinds() const override {
return X86::NumTargetFixupKinds;		return X86::NumTargetFixupKinds;
}		}

const MCFixupKindInfo &getFixupKindInfo(MCFixupKind Kind) const override {		const MCFixupKindInfo &getFixupKindInfo(MCFixupKind Kind) const override {
const static MCFixupKindInfo Infos[X86::NumTargetFixupKinds] = {		const static MCFixupKindInfo Infos[X86::NumTargetFixupKinds] = {
{"reloc_riprel_4byte", 0, 32, MCFixupKindInfo::FKF_IsPCRel},		{"reloc_riprel_4byte", 0, 32, MCFixupKindInfo::FKF_IsPCRel},
▲ Show 20 Lines • Show All 250 Lines • ▼ Show 20 Lines	static const uint8_t Nops[10][10] = {
{0x0f, 0x1f, 0x84, 0x00, 0x00, 0x00, 0x00, 0x00},		{0x0f, 0x1f, 0x84, 0x00, 0x00, 0x00, 0x00, 0x00},
// nopw 0L(%[re]ax,%[re]ax,1)		// nopw 0L(%[re]ax,%[re]ax,1)
{0x66, 0x0f, 0x1f, 0x84, 0x00, 0x00, 0x00, 0x00, 0x00},		{0x66, 0x0f, 0x1f, 0x84, 0x00, 0x00, 0x00, 0x00, 0x00},
// nopw %cs:0L(%[re]ax,%[re]ax,1)		// nopw %cs:0L(%[re]ax,%[re]ax,1)
{0x66, 0x2e, 0x0f, 0x1f, 0x84, 0x00, 0x00, 0x00, 0x00, 0x00},		{0x66, 0x2e, 0x0f, 0x1f, 0x84, 0x00, 0x00, 0x00, 0x00, 0x00},
};		};

// This CPU doesn't support long nops. If needed add more.		// This CPU doesn't support long nops. If needed add more.
// FIXME: Can we get this from the subtarget somehow?
// FIXME: We could generated something better than plain 0x90.		// FIXME: We could generated something better than plain 0x90.
if (!HasNopl) {		if (!STI.getFeatureBits()[X86::FeatureNOPL]) {
for (uint64_t i = 0; i < Count; ++i)		for (uint64_t i = 0; i < Count; ++i)
OW->write8(0x90);		OW->write8(0x90);
return true;		return true;
}		}

		uint64_t MaxNopLength = STI.getFeatureBits()[X86::ProcIntelSLM] ? 7 : 15;

// 15 is the longest single nop instruction. Emit as many 15-byte nops as		// 15 is the longest single nop instruction. Emit as many 15-byte nops as
// needed, then emit a nop of the remaining length.		// needed, then emit a nop of the remaining length.
do {		do {
const uint8_t ThisNopLength = (uint8_t) std::min(Count, MaxNopLength);		const uint8_t ThisNopLength = (uint8_t) std::min(Count, MaxNopLength);
const uint8_t Prefixes = ThisNopLength <= 10 ? 0 : ThisNopLength - 10;		const uint8_t Prefixes = ThisNopLength <= 10 ? 0 : ThisNopLength - 10;
for (uint8_t i = 0; i < Prefixes; i++)		for (uint8_t i = 0; i < Prefixes; i++)
OW->write8(0x66);		OW->write8(0x66);
const uint8_t Rest = ThisNopLength - Prefixes;		const uint8_t Rest = ThisNopLength - Prefixes;
for (uint8_t i = 0; i < Rest; i++)		for (uint8_t i = 0; i < Rest; i++)
OW->write8(Nops[Rest - 1][i]);		OW->write8(Nops[Rest - 1][i]);
Count -= ThisNopLength;		Count -= ThisNopLength;
} while (Count != 0);		} while (Count != 0);

return true;		return true;
}		}

/* *** */		/* *** */

namespace {		namespace {

class ELFX86AsmBackend : public X86AsmBackend {		class ELFX86AsmBackend : public X86AsmBackend {
public:		public:
uint8_t OSABI;		uint8_t OSABI;
ELFX86AsmBackend(const Target &T, uint8_t OSABI, StringRef CPU)		ELFX86AsmBackend(const Target &T, uint8_t OSABI, const MCSubtargetInfo &STI)
: X86AsmBackend(T, CPU), OSABI(OSABI) {}		: X86AsmBackend(T, STI), OSABI(OSABI) {}
};		};

class ELFX86_32AsmBackend : public ELFX86AsmBackend {		class ELFX86_32AsmBackend : public ELFX86AsmBackend {
public:		public:
ELFX86_32AsmBackend(const Target &T, uint8_t OSABI, StringRef CPU)		ELFX86_32AsmBackend(const Target &T, uint8_t OSABI,
: ELFX86AsmBackend(T, OSABI, CPU) {}		const MCSubtargetInfo &STI)
		: ELFX86AsmBackend(T, OSABI, STI) {}

std::unique_ptr<MCObjectWriter>		std::unique_ptr<MCObjectWriter>
createObjectWriter(raw_pwrite_stream &OS) const override {		createObjectWriter(raw_pwrite_stream &OS) const override {
return createX86ELFObjectWriter(OS, /IsELF64/ false, OSABI, ELF::EM_386);		return createX86ELFObjectWriter(OS, /IsELF64/ false, OSABI, ELF::EM_386);
}		}
};		};

class ELFX86_X32AsmBackend : public ELFX86AsmBackend {		class ELFX86_X32AsmBackend : public ELFX86AsmBackend {
public:		public:
ELFX86_X32AsmBackend(const Target &T, uint8_t OSABI, StringRef CPU)		ELFX86_X32AsmBackend(const Target &T, uint8_t OSABI,
: ELFX86AsmBackend(T, OSABI, CPU) {}		const MCSubtargetInfo &STI)
		: ELFX86AsmBackend(T, OSABI, STI) {}

std::unique_ptr<MCObjectWriter>		std::unique_ptr<MCObjectWriter>
createObjectWriter(raw_pwrite_stream &OS) const override {		createObjectWriter(raw_pwrite_stream &OS) const override {
return createX86ELFObjectWriter(OS, /IsELF64/ false, OSABI,		return createX86ELFObjectWriter(OS, /IsELF64/ false, OSABI,
ELF::EM_X86_64);		ELF::EM_X86_64);
}		}
};		};

class ELFX86_IAMCUAsmBackend : public ELFX86AsmBackend {		class ELFX86_IAMCUAsmBackend : public ELFX86AsmBackend {
public:		public:
ELFX86_IAMCUAsmBackend(const Target &T, uint8_t OSABI, StringRef CPU)		ELFX86_IAMCUAsmBackend(const Target &T, uint8_t OSABI,
: ELFX86AsmBackend(T, OSABI, CPU) {}		const MCSubtargetInfo &STI)
		: ELFX86AsmBackend(T, OSABI, STI) {}

std::unique_ptr<MCObjectWriter>		std::unique_ptr<MCObjectWriter>
createObjectWriter(raw_pwrite_stream &OS) const override {		createObjectWriter(raw_pwrite_stream &OS) const override {
return createX86ELFObjectWriter(OS, /IsELF64/ false, OSABI,		return createX86ELFObjectWriter(OS, /IsELF64/ false, OSABI,
ELF::EM_IAMCU);		ELF::EM_IAMCU);
}		}
};		};

class ELFX86_64AsmBackend : public ELFX86AsmBackend {		class ELFX86_64AsmBackend : public ELFX86AsmBackend {
public:		public:
ELFX86_64AsmBackend(const Target &T, uint8_t OSABI, StringRef CPU)		ELFX86_64AsmBackend(const Target &T, uint8_t OSABI,
: ELFX86AsmBackend(T, OSABI, CPU) {}		const MCSubtargetInfo &STI)
		: ELFX86AsmBackend(T, OSABI, STI) {}

std::unique_ptr<MCObjectWriter>		std::unique_ptr<MCObjectWriter>
createObjectWriter(raw_pwrite_stream &OS) const override {		createObjectWriter(raw_pwrite_stream &OS) const override {
return createX86ELFObjectWriter(OS, /IsELF64/ true, OSABI, ELF::EM_X86_64);		return createX86ELFObjectWriter(OS, /IsELF64/ true, OSABI, ELF::EM_X86_64);
}		}
};		};

class WindowsX86AsmBackend : public X86AsmBackend {		class WindowsX86AsmBackend : public X86AsmBackend {
bool Is64Bit;		bool Is64Bit;

public:		public:
WindowsX86AsmBackend(const Target &T, bool is64Bit, StringRef CPU)		WindowsX86AsmBackend(const Target &T, bool is64Bit,
: X86AsmBackend(T, CPU)		const MCSubtargetInfo &STI)
		: X86AsmBackend(T, STI)
, Is64Bit(is64Bit) {		, Is64Bit(is64Bit) {
}		}

Optional<MCFixupKind> getFixupKind(StringRef Name) const override {		Optional<MCFixupKind> getFixupKind(StringRef Name) const override {
return StringSwitch<Optional<MCFixupKind>>(Name)		return StringSwitch<Optional<MCFixupKind>>(Name)
.Case("dir32", FK_Data_4)		.Case("dir32", FK_Data_4)
.Case("secrel32", FK_SecRel_4)		.Case("secrel32", FK_SecRel_4)
.Case("secidx", FK_SecRel_2)		.Case("secidx", FK_SecRel_2)
▲ Show 20 Lines • Show All 341 Lines • ▼ Show 20 Lines	uint32_t encodeCompactUnwindRegistersWithoutFrame(unsigned RegCount) const {
}		}

assert((permutationEncoding & 0x3FF) == permutationEncoding &&		assert((permutationEncoding & 0x3FF) == permutationEncoding &&
"Invalid compact register encoding!");		"Invalid compact register encoding!");
return permutationEncoding;		return permutationEncoding;
}		}

public:		public:
DarwinX86AsmBackend(const Target &T, const MCRegisterInfo &MRI, StringRef CPU,		DarwinX86AsmBackend(const Target &T, const MCRegisterInfo &MRI,
bool Is64Bit)		const MCSubtargetInfo &STI, bool Is64Bit)
: X86AsmBackend(T, CPU), MRI(MRI), Is64Bit(Is64Bit) {		: X86AsmBackend(T, STI), MRI(MRI), Is64Bit(Is64Bit) {
memset(SavedRegs, 0, sizeof(SavedRegs));		memset(SavedRegs, 0, sizeof(SavedRegs));
OffsetSize = Is64Bit ? 8 : 4;		OffsetSize = Is64Bit ? 8 : 4;
MoveInstrSize = Is64Bit ? 3 : 2;		MoveInstrSize = Is64Bit ? 3 : 2;
StackDivide = Is64Bit ? 8 : 4;		StackDivide = Is64Bit ? 8 : 4;
}		}
};		};

class DarwinX86_32AsmBackend : public DarwinX86AsmBackend {		class DarwinX86_32AsmBackend : public DarwinX86AsmBackend {
public:		public:
DarwinX86_32AsmBackend(const Target &T, const MCRegisterInfo &MRI,		DarwinX86_32AsmBackend(const Target &T, const MCRegisterInfo &MRI,
StringRef CPU)		const MCSubtargetInfo &STI)
: DarwinX86AsmBackend(T, MRI, CPU, false) {}		: DarwinX86AsmBackend(T, MRI, STI, false) {}

std::unique_ptr<MCObjectWriter>		std::unique_ptr<MCObjectWriter>
createObjectWriter(raw_pwrite_stream &OS) const override {		createObjectWriter(raw_pwrite_stream &OS) const override {
return createX86MachObjectWriter(OS, /Is64Bit=/false,		return createX86MachObjectWriter(OS, /Is64Bit=/false,
MachO::CPU_TYPE_I386,		MachO::CPU_TYPE_I386,
MachO::CPU_SUBTYPE_I386_ALL);		MachO::CPU_SUBTYPE_I386_ALL);
}		}

/// \brief Generate the compact unwind encoding for the CFI instructions.		/// \brief Generate the compact unwind encoding for the CFI instructions.
uint32_t generateCompactUnwindEncoding(		uint32_t generateCompactUnwindEncoding(
ArrayRef<MCCFIInstruction> Instrs) const override {		ArrayRef<MCCFIInstruction> Instrs) const override {
return generateCompactUnwindEncodingImpl(Instrs);		return generateCompactUnwindEncodingImpl(Instrs);
}		}
};		};

class DarwinX86_64AsmBackend : public DarwinX86AsmBackend {		class DarwinX86_64AsmBackend : public DarwinX86AsmBackend {
const MachO::CPUSubTypeX86 Subtype;		const MachO::CPUSubTypeX86 Subtype;
public:		public:
DarwinX86_64AsmBackend(const Target &T, const MCRegisterInfo &MRI,		DarwinX86_64AsmBackend(const Target &T, const MCRegisterInfo &MRI,
StringRef CPU, MachO::CPUSubTypeX86 st)		const MCSubtargetInfo &STI, MachO::CPUSubTypeX86 st)
: DarwinX86AsmBackend(T, MRI, CPU, true), Subtype(st) {}		: DarwinX86AsmBackend(T, MRI, STI, true), Subtype(st) {}

std::unique_ptr<MCObjectWriter>		std::unique_ptr<MCObjectWriter>
createObjectWriter(raw_pwrite_stream &OS) const override {		createObjectWriter(raw_pwrite_stream &OS) const override {
return createX86MachObjectWriter(OS, /Is64Bit=/true,		return createX86MachObjectWriter(OS, /Is64Bit=/true,
MachO::CPU_TYPE_X86_64, Subtype);		MachO::CPU_TYPE_X86_64, Subtype);
}		}

/// \brief Generate the compact unwind encoding for the CFI instructions.		/// \brief Generate the compact unwind encoding for the CFI instructions.
uint32_t generateCompactUnwindEncoding(		uint32_t generateCompactUnwindEncoding(
ArrayRef<MCCFIInstruction> Instrs) const override {		ArrayRef<MCCFIInstruction> Instrs) const override {
return generateCompactUnwindEncodingImpl(Instrs);		return generateCompactUnwindEncodingImpl(Instrs);
}		}
};		};

} // end anonymous namespace		} // end anonymous namespace

MCAsmBackend *llvm::createX86_32AsmBackend(const Target &T,		MCAsmBackend *llvm::createX86_32AsmBackend(const Target &T,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
const MCRegisterInfo &MRI,		const MCRegisterInfo &MRI,
const MCTargetOptions &Options) {		const MCTargetOptions &Options) {
const Triple &TheTriple = STI.getTargetTriple();		const Triple &TheTriple = STI.getTargetTriple();
StringRef CPU = STI.getCPU();
if (TheTriple.isOSBinFormatMachO())		if (TheTriple.isOSBinFormatMachO())
return new DarwinX86_32AsmBackend(T, MRI, CPU);		return new DarwinX86_32AsmBackend(T, MRI, STI);

if (TheTriple.isOSWindows() && TheTriple.isOSBinFormatCOFF())		if (TheTriple.isOSWindows() && TheTriple.isOSBinFormatCOFF())
return new WindowsX86AsmBackend(T, false, CPU);		return new WindowsX86AsmBackend(T, false, STI);

uint8_t OSABI = MCELFObjectTargetWriter::getOSABI(TheTriple.getOS());		uint8_t OSABI = MCELFObjectTargetWriter::getOSABI(TheTriple.getOS());

if (TheTriple.isOSIAMCU())		if (TheTriple.isOSIAMCU())
return new ELFX86_IAMCUAsmBackend(T, OSABI, CPU);		return new ELFX86_IAMCUAsmBackend(T, OSABI, STI);

return new ELFX86_32AsmBackend(T, OSABI, CPU);		return new ELFX86_32AsmBackend(T, OSABI, STI);
}		}

MCAsmBackend *llvm::createX86_64AsmBackend(const Target &T,		MCAsmBackend *llvm::createX86_64AsmBackend(const Target &T,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
const MCRegisterInfo &MRI,		const MCRegisterInfo &MRI,
const MCTargetOptions &Options) {		const MCTargetOptions &Options) {
const Triple &TheTriple = STI.getTargetTriple();		const Triple &TheTriple = STI.getTargetTriple();
StringRef CPU = STI.getCPU();
if (TheTriple.isOSBinFormatMachO()) {		if (TheTriple.isOSBinFormatMachO()) {
MachO::CPUSubTypeX86 CS =		MachO::CPUSubTypeX86 CS =
StringSwitch<MachO::CPUSubTypeX86>(TheTriple.getArchName())		StringSwitch<MachO::CPUSubTypeX86>(TheTriple.getArchName())
.Case("x86_64h", MachO::CPU_SUBTYPE_X86_64_H)		.Case("x86_64h", MachO::CPU_SUBTYPE_X86_64_H)
.Default(MachO::CPU_SUBTYPE_X86_64_ALL);		.Default(MachO::CPU_SUBTYPE_X86_64_ALL);
return new DarwinX86_64AsmBackend(T, MRI, CPU, CS);		return new DarwinX86_64AsmBackend(T, MRI, STI, CS);
}		}

if (TheTriple.isOSWindows() && TheTriple.isOSBinFormatCOFF())		if (TheTriple.isOSWindows() && TheTriple.isOSBinFormatCOFF())
return new WindowsX86AsmBackend(T, true, CPU);		return new WindowsX86AsmBackend(T, true, STI);

uint8_t OSABI = MCELFObjectTargetWriter::getOSABI(TheTriple.getOS());		uint8_t OSABI = MCELFObjectTargetWriter::getOSABI(TheTriple.getOS());

if (TheTriple.getEnvironment() == Triple::GNUX32)		if (TheTriple.getEnvironment() == Triple::GNUX32)
return new ELFX86_X32AsmBackend(T, OSABI, CPU);		return new ELFX86_X32AsmBackend(T, OSABI, STI);
return new ELFX86_64AsmBackend(T, OSABI, CPU);		return new ELFX86_64AsmBackend(T, OSABI, STI);
}		}

lib/Target/X86/X86.td

Show All 28 Lines

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// X86 Subtarget features		// X86 Subtarget features
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def FeatureX87 : SubtargetFeature<"x87","HasX87", "true",		def FeatureX87 : SubtargetFeature<"x87","HasX87", "true",
"Enable X87 float instructions">;		"Enable X87 float instructions">;

		def FeatureNOPL : SubtargetFeature<"nopl", "HasNOPL", "true",
		"Enable NOPL instruction">;

def FeatureCMOV : SubtargetFeature<"cmov","HasCMov", "true",		def FeatureCMOV : SubtargetFeature<"cmov","HasCMov", "true",
"Enable conditional move instructions">;		"Enable conditional move instructions">;

def FeaturePOPCNT : SubtargetFeature<"popcnt", "HasPOPCNT", "true",		def FeaturePOPCNT : SubtargetFeature<"popcnt", "HasPOPCNT", "true",
"Support POPCNT instruction">;		"Support POPCNT instruction">;

def FeatureFXSR : SubtargetFeature<"fxsr", "HasFXSR", "true",		def FeatureFXSR : SubtargetFeature<"fxsr", "HasFXSR", "true",
"Support fxsave/fxrestore instructions">;		"Support fxsave/fxrestore instructions">;
▲ Show 20 Lines • Show All 340 Lines • ▼ Show 20 Lines

def : Proc<"generic", [FeatureX87, FeatureSlowUAMem16]>;		def : Proc<"generic", [FeatureX87, FeatureSlowUAMem16]>;
def : Proc<"i386", [FeatureX87, FeatureSlowUAMem16]>;		def : Proc<"i386", [FeatureX87, FeatureSlowUAMem16]>;
def : Proc<"i486", [FeatureX87, FeatureSlowUAMem16]>;		def : Proc<"i486", [FeatureX87, FeatureSlowUAMem16]>;
def : Proc<"i586", [FeatureX87, FeatureSlowUAMem16]>;		def : Proc<"i586", [FeatureX87, FeatureSlowUAMem16]>;
def : Proc<"pentium", [FeatureX87, FeatureSlowUAMem16]>;		def : Proc<"pentium", [FeatureX87, FeatureSlowUAMem16]>;
def : Proc<"pentium-mmx", [FeatureX87, FeatureSlowUAMem16, FeatureMMX]>;		def : Proc<"pentium-mmx", [FeatureX87, FeatureSlowUAMem16, FeatureMMX]>;

foreach P = ["i686", "pentiumpro"] in {		def : Proc<"i686", [FeatureX87, FeatureSlowUAMem16, FeatureCMOV]>;
def : Proc<P, [FeatureX87, FeatureSlowUAMem16, FeatureCMOV]>;		def : Proc<"pentiumpro", [FeatureX87, FeatureSlowUAMem16, FeatureCMOV,
}		FeatureNOPL]>;

def : Proc<"pentium2", [FeatureX87, FeatureSlowUAMem16, FeatureMMX,		def : Proc<"pentium2", [FeatureX87, FeatureSlowUAMem16, FeatureMMX,
FeatureCMOV, FeatureFXSR]>;		FeatureCMOV, FeatureFXSR, FeatureNOPL]>;

foreach P = ["pentium3", "pentium3m"] in {		foreach P = ["pentium3", "pentium3m"] in {
def : Proc<P, [FeatureX87, FeatureSlowUAMem16, FeatureMMX, FeatureSSE1,		def : Proc<P, [FeatureX87, FeatureSlowUAMem16, FeatureMMX, FeatureSSE1,
FeatureFXSR]>;		FeatureFXSR, FeatureNOPL]>;
}		}

// Enable the PostRAScheduler for SSE2 and SSE3 class cpus.		// Enable the PostRAScheduler for SSE2 and SSE3 class cpus.
// The intent is to enable it for pentium4 which is the current default		// The intent is to enable it for pentium4 which is the current default
// processor in a vanilla 32-bit clang compilation when no specific		// processor in a vanilla 32-bit clang compilation when no specific
// architecture is specified. This generally gives a nice performance		// architecture is specified. This generally gives a nice performance
// increase on silvermont, with largely neutral behavior on other		// increase on silvermont, with largely neutral behavior on other
// contemporary large core processors.		// contemporary large core processors.
// pentium-m, pentium4m, prescott and nocona are included as a preventative		// pentium-m, pentium4m, prescott and nocona are included as a preventative
// measure to avoid performance surprises, in case clang's default cpu		// measure to avoid performance surprises, in case clang's default cpu
// changes slightly.		// changes slightly.

def : ProcessorModel<"pentium-m", GenericPostRAModel,		def : ProcessorModel<"pentium-m", GenericPostRAModel,
[FeatureX87, FeatureSlowUAMem16, FeatureMMX,		[FeatureX87, FeatureSlowUAMem16, FeatureMMX,
FeatureSSE2, FeatureFXSR]>;		FeatureSSE2, FeatureFXSR, FeatureNOPL]>;

foreach P = ["pentium4", "pentium4m"] in {		foreach P = ["pentium4", "pentium4m"] in {
def : ProcessorModel<P, GenericPostRAModel,		def : ProcessorModel<P, GenericPostRAModel,
[FeatureX87, FeatureSlowUAMem16, FeatureMMX,		[FeatureX87, FeatureSlowUAMem16, FeatureMMX,
FeatureSSE2, FeatureFXSR]>;		FeatureSSE2, FeatureFXSR, FeatureNOPL]>;
}		}

// Intel Quark.		// Intel Quark.
def : Proc<"lakemont", []>;		def : Proc<"lakemont", []>;

// Intel Core Duo.		// Intel Core Duo.
def : ProcessorModel<"yonah", SandyBridgeModel,		def : ProcessorModel<"yonah", SandyBridgeModel,
[FeatureX87, FeatureSlowUAMem16, FeatureMMX, FeatureSSE3,		[FeatureX87, FeatureSlowUAMem16, FeatureMMX, FeatureSSE3,
FeatureFXSR]>;		FeatureFXSR, FeatureNOPL]>;

// NetBurst.		// NetBurst.
def : ProcessorModel<"prescott", GenericPostRAModel,		def : ProcessorModel<"prescott", GenericPostRAModel,
[FeatureX87, FeatureSlowUAMem16, FeatureMMX, FeatureSSE3,		[FeatureX87, FeatureSlowUAMem16, FeatureMMX, FeatureSSE3,
FeatureFXSR]>;		FeatureFXSR, FeatureNOPL]>;
def : ProcessorModel<"nocona", GenericPostRAModel, [		def : ProcessorModel<"nocona", GenericPostRAModel, [
FeatureX87,		FeatureX87,
FeatureSlowUAMem16,		FeatureSlowUAMem16,
FeatureMMX,		FeatureMMX,
FeatureSSE3,		FeatureSSE3,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B		FeatureCMPXCHG16B
]>;		]>;

// Intel Core 2 Solo/Duo.		// Intel Core 2 Solo/Duo.
def : ProcessorModel<"core2", SandyBridgeModel, [		def : ProcessorModel<"core2", SandyBridgeModel, [
FeatureX87,		FeatureX87,
FeatureSlowUAMem16,		FeatureSlowUAMem16,
FeatureMMX,		FeatureMMX,
FeatureSSSE3,		FeatureSSSE3,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureLAHFSAHF,		FeatureLAHFSAHF,
FeatureMacroFusion		FeatureMacroFusion
]>;		]>;
def : ProcessorModel<"penryn", SandyBridgeModel, [		def : ProcessorModel<"penryn", SandyBridgeModel, [
FeatureX87,		FeatureX87,
FeatureSlowUAMem16,		FeatureSlowUAMem16,
FeatureMMX,		FeatureMMX,
FeatureSSE41,		FeatureSSE41,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureLAHFSAHF,		FeatureLAHFSAHF,
FeatureMacroFusion		FeatureMacroFusion
]>;		]>;

// Atom CPUs.		// Atom CPUs.
class BonnellProc<string Name> : ProcessorModel<Name, AtomModel, [		class BonnellProc<string Name> : ProcessorModel<Name, AtomModel, [
ProcIntelAtom,		ProcIntelAtom,
FeatureX87,		FeatureX87,
FeatureSlowUAMem16,		FeatureSlowUAMem16,
FeatureMMX,		FeatureMMX,
FeatureSSSE3,		FeatureSSSE3,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureMOVBE,		FeatureMOVBE,
FeatureLEAForSP,		FeatureLEAForSP,
FeatureSlowDivide32,		FeatureSlowDivide32,
FeatureSlowDivide64,		FeatureSlowDivide64,
FeatureSlowTwoMemOps,		FeatureSlowTwoMemOps,
FeatureLEAUsesAG,		FeatureLEAUsesAG,
FeaturePadShortFunctions,		FeaturePadShortFunctions,
FeatureLAHFSAHF		FeatureLAHFSAHF
]>;		]>;
def : BonnellProc<"bonnell">;		def : BonnellProc<"bonnell">;
def : BonnellProc<"atom">; // Pin the generic name to the baseline.		def : BonnellProc<"atom">; // Pin the generic name to the baseline.

class SilvermontProc<string Name> : ProcessorModel<Name, SLMModel, [		class SilvermontProc<string Name> : ProcessorModel<Name, SLMModel, [
ProcIntelSLM,		ProcIntelSLM,
FeatureX87,		FeatureX87,
FeatureMMX,		FeatureMMX,
FeatureSSE42,		FeatureSSE42,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureMOVBE,		FeatureMOVBE,
FeaturePOPCNT,		FeaturePOPCNT,
FeaturePCLMUL,		FeaturePCLMUL,
FeatureAES,		FeatureAES,
FeatureSlowDivide64,		FeatureSlowDivide64,
FeatureSlowTwoMemOps,		FeatureSlowTwoMemOps,
FeaturePRFCHW,		FeaturePRFCHW,
FeatureSlowLEA,		FeatureSlowLEA,
FeatureSlowIncDec,		FeatureSlowIncDec,
FeatureSlowPMULLD,		FeatureSlowPMULLD,
FeatureLAHFSAHF		FeatureLAHFSAHF
]>;		]>;
def : SilvermontProc<"silvermont">;		def : SilvermontProc<"silvermont">;
def : SilvermontProc<"slm">; // Legacy alias.		def : SilvermontProc<"slm">; // Legacy alias.

class GoldmontProc<string Name> : ProcessorModel<Name, SLMModel, [		class GoldmontProc<string Name> : ProcessorModel<Name, SLMModel, [
ProcIntelGLM,		ProcIntelGLM,
FeatureX87,		FeatureX87,
FeatureMMX,		FeatureMMX,
FeatureSSE42,		FeatureSSE42,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureMOVBE,		FeatureMOVBE,
FeaturePOPCNT,		FeaturePOPCNT,
FeaturePCLMUL,		FeaturePCLMUL,
FeatureAES,		FeatureAES,
FeaturePRFCHW,		FeaturePRFCHW,
FeatureSlowTwoMemOps,		FeatureSlowTwoMemOps,
FeatureSlowLEA,		FeatureSlowLEA,
Show All 13 Lines
def : GoldmontProc<"goldmont">;		def : GoldmontProc<"goldmont">;

// "Arrandale" along with corei3 and corei5		// "Arrandale" along with corei3 and corei5
class NehalemProc<string Name> : ProcessorModel<Name, SandyBridgeModel, [		class NehalemProc<string Name> : ProcessorModel<Name, SandyBridgeModel, [
FeatureX87,		FeatureX87,
FeatureMMX,		FeatureMMX,
FeatureSSE42,		FeatureSSE42,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeaturePOPCNT,		FeaturePOPCNT,
FeatureLAHFSAHF,		FeatureLAHFSAHF,
FeatureMacroFusion		FeatureMacroFusion
]>;		]>;
def : NehalemProc<"nehalem">;		def : NehalemProc<"nehalem">;
def : NehalemProc<"corei7">;		def : NehalemProc<"corei7">;

// Westmere is a similar machine to nehalem with some additional features.		// Westmere is a similar machine to nehalem with some additional features.
// Westmere is the corei3/i5/i7 path from nehalem to sandybridge		// Westmere is the corei3/i5/i7 path from nehalem to sandybridge
class WestmereProc<string Name> : ProcessorModel<Name, SandyBridgeModel, [		class WestmereProc<string Name> : ProcessorModel<Name, SandyBridgeModel, [
FeatureX87,		FeatureX87,
FeatureMMX,		FeatureMMX,
FeatureSSE42,		FeatureSSE42,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeaturePOPCNT,		FeaturePOPCNT,
FeatureAES,		FeatureAES,
FeaturePCLMUL,		FeaturePCLMUL,
FeatureLAHFSAHF,		FeatureLAHFSAHF,
FeatureMacroFusion		FeatureMacroFusion
]>;		]>;
def : WestmereProc<"westmere">;		def : WestmereProc<"westmere">;
Show All 10 Lines

// SSE is not listed here since llvm treats AVX as a reimplementation of SSE,		// SSE is not listed here since llvm treats AVX as a reimplementation of SSE,
// rather than a superset.		// rather than a superset.
def SNBFeatures : ProcessorFeatures<[], [		def SNBFeatures : ProcessorFeatures<[], [
FeatureX87,		FeatureX87,
FeatureMMX,		FeatureMMX,
FeatureAVX,		FeatureAVX,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeaturePOPCNT,		FeaturePOPCNT,
FeatureAES,		FeatureAES,
FeatureSlowDivide64,		FeatureSlowDivide64,
FeaturePCLMUL,		FeaturePCLMUL,
FeatureXSAVE,		FeatureXSAVE,
FeatureXSAVEOPT,		FeatureXSAVEOPT,
FeatureLAHFSAHF,		FeatureLAHFSAHF,
▲ Show 20 Lines • Show All 157 Lines • ▼ Show 20 Lines

// AMD CPUs.		// AMD CPUs.

def : Proc<"k6", [FeatureX87, FeatureSlowUAMem16, FeatureMMX]>;		def : Proc<"k6", [FeatureX87, FeatureSlowUAMem16, FeatureMMX]>;
def : Proc<"k6-2", [FeatureX87, FeatureSlowUAMem16, Feature3DNow]>;		def : Proc<"k6-2", [FeatureX87, FeatureSlowUAMem16, Feature3DNow]>;
def : Proc<"k6-3", [FeatureX87, FeatureSlowUAMem16, Feature3DNow]>;		def : Proc<"k6-3", [FeatureX87, FeatureSlowUAMem16, Feature3DNow]>;

foreach P = ["athlon", "athlon-tbird"] in {		foreach P = ["athlon", "athlon-tbird"] in {
def : Proc<P, [FeatureX87, FeatureSlowUAMem16, Feature3DNowA, FeatureSlowSHLD]>;		def : Proc<P, [FeatureX87, FeatureSlowUAMem16, Feature3DNowA,
		FeatureNOPL, FeatureSlowSHLD]>;
}		}

foreach P = ["athlon-4", "athlon-xp", "athlon-mp"] in {		foreach P = ["athlon-4", "athlon-xp", "athlon-mp"] in {
def : Proc<P, [FeatureX87, FeatureSlowUAMem16, FeatureSSE1,		def : Proc<P, [FeatureX87, FeatureSlowUAMem16, FeatureSSE1,
Feature3DNowA, FeatureFXSR, FeatureSlowSHLD]>;		Feature3DNowA, FeatureFXSR, FeatureNOPL, FeatureSlowSHLD]>;
}		}

foreach P = ["k8", "opteron", "athlon64", "athlon-fx"] in {		foreach P = ["k8", "opteron", "athlon64", "athlon-fx"] in {
def : Proc<P, [FeatureX87, FeatureSlowUAMem16, FeatureSSE2, Feature3DNowA,		def : Proc<P, [FeatureX87, FeatureSlowUAMem16, FeatureSSE2, Feature3DNowA,
FeatureFXSR, Feature64Bit, FeatureSlowSHLD]>;		FeatureFXSR, FeatureNOPL, Feature64Bit, FeatureSlowSHLD]>;
}		}

foreach P = ["k8-sse3", "opteron-sse3", "athlon64-sse3"] in {		foreach P = ["k8-sse3", "opteron-sse3", "athlon64-sse3"] in {
def : Proc<P, [FeatureX87, FeatureSlowUAMem16, FeatureSSE3, Feature3DNowA,		def : Proc<P, [FeatureX87, FeatureSlowUAMem16, FeatureSSE3, Feature3DNowA,
FeatureFXSR, FeatureCMPXCHG16B, FeatureSlowSHLD]>;		FeatureFXSR, FeatureNOPL, FeatureCMPXCHG16B, FeatureSlowSHLD]>;
}		}

foreach P = ["amdfam10", "barcelona"] in {		foreach P = ["amdfam10", "barcelona"] in {
def : Proc<P, [FeatureX87, FeatureSSE4A, Feature3DNowA, FeatureFXSR,		def : Proc<P, [FeatureX87, FeatureSSE4A, Feature3DNowA, FeatureFXSR,
FeatureCMPXCHG16B, FeatureLZCNT, FeaturePOPCNT,		FeatureNOPL, FeatureCMPXCHG16B, FeatureLZCNT, FeaturePOPCNT,
FeatureSlowSHLD, FeatureLAHFSAHF]>;		FeatureSlowSHLD, FeatureLAHFSAHF]>;
}		}

// Bobcat		// Bobcat
def : Proc<"btver1", [		def : Proc<"btver1", [
FeatureX87,		FeatureX87,
FeatureMMX,		FeatureMMX,
FeatureSSSE3,		FeatureSSSE3,
FeatureSSE4A,		FeatureSSE4A,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeaturePRFCHW,		FeaturePRFCHW,
FeatureLZCNT,		FeatureLZCNT,
FeaturePOPCNT,		FeaturePOPCNT,
FeatureSlowSHLD,		FeatureSlowSHLD,
FeatureLAHFSAHF		FeatureLAHFSAHF
]>;		]>;

// Jaguar		// Jaguar
def : ProcessorModel<"btver2", BtVer2Model, [		def : ProcessorModel<"btver2", BtVer2Model, [
FeatureX87,		FeatureX87,
FeatureMMX,		FeatureMMX,
FeatureAVX,		FeatureAVX,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureSSE4A,		FeatureSSE4A,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeaturePRFCHW,		FeaturePRFCHW,
FeatureAES,		FeatureAES,
FeaturePCLMUL,		FeaturePCLMUL,
FeatureBMI,		FeatureBMI,
FeatureF16C,		FeatureF16C,
FeatureMOVBE,		FeatureMOVBE,
Show All 14 Lines	def : Proc<"bdver1", [
FeatureFMA4,		FeatureFMA4,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureAES,		FeatureAES,
FeaturePRFCHW,		FeaturePRFCHW,
FeaturePCLMUL,		FeaturePCLMUL,
FeatureMMX,		FeatureMMX,
FeatureAVX,		FeatureAVX,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureSSE4A,		FeatureSSE4A,
FeatureLZCNT,		FeatureLZCNT,
FeaturePOPCNT,		FeaturePOPCNT,
FeatureXSAVE,		FeatureXSAVE,
FeatureLWP,		FeatureLWP,
FeatureSlowSHLD,		FeatureSlowSHLD,
FeatureLAHFSAHF,		FeatureLAHFSAHF,
FeatureMacroFusion		FeatureMacroFusion
]>;		]>;
// Piledriver		// Piledriver
def : Proc<"bdver2", [		def : Proc<"bdver2", [
FeatureX87,		FeatureX87,
FeatureXOP,		FeatureXOP,
FeatureFMA4,		FeatureFMA4,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureAES,		FeatureAES,
FeaturePRFCHW,		FeaturePRFCHW,
FeaturePCLMUL,		FeaturePCLMUL,
FeatureMMX,		FeatureMMX,
FeatureAVX,		FeatureAVX,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureSSE4A,		FeatureSSE4A,
FeatureF16C,		FeatureF16C,
FeatureLZCNT,		FeatureLZCNT,
FeaturePOPCNT,		FeaturePOPCNT,
FeatureXSAVE,		FeatureXSAVE,
FeatureBMI,		FeatureBMI,
FeatureTBM,		FeatureTBM,
FeatureLWP,		FeatureLWP,
Show All 10 Lines	def : Proc<"bdver3", [
FeatureFMA4,		FeatureFMA4,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureAES,		FeatureAES,
FeaturePRFCHW,		FeaturePRFCHW,
FeaturePCLMUL,		FeaturePCLMUL,
FeatureMMX,		FeatureMMX,
FeatureAVX,		FeatureAVX,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureSSE4A,		FeatureSSE4A,
FeatureF16C,		FeatureF16C,
FeatureLZCNT,		FeatureLZCNT,
FeaturePOPCNT,		FeaturePOPCNT,
FeatureXSAVE,		FeatureXSAVE,
FeatureBMI,		FeatureBMI,
FeatureTBM,		FeatureTBM,
FeatureLWP,		FeatureLWP,
FeatureFMA,		FeatureFMA,
FeatureXSAVEOPT,		FeatureXSAVEOPT,
FeatureSlowSHLD,		FeatureSlowSHLD,
FeatureFSGSBase,		FeatureFSGSBase,
FeatureLAHFSAHF,		FeatureLAHFSAHF,
FeatureMacroFusion		FeatureMacroFusion
]>;		]>;

// Excavator		// Excavator
def : Proc<"bdver4", [		def : Proc<"bdver4", [
FeatureX87,		FeatureX87,
FeatureMMX,		FeatureMMX,
FeatureAVX2,		FeatureAVX2,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureXOP,		FeatureXOP,
FeatureFMA4,		FeatureFMA4,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureAES,		FeatureAES,
FeaturePRFCHW,		FeaturePRFCHW,
FeaturePCLMUL,		FeaturePCLMUL,
FeatureF16C,		FeatureF16C,
FeatureLZCNT,		FeatureLZCNT,
Show All 21 Lines	def: ProcessorModel<"znver1", Znver1Model, [
FeatureBMI2,		FeatureBMI2,
FeatureCLFLUSHOPT,		FeatureCLFLUSHOPT,
FeatureCLZERO,		FeatureCLZERO,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureF16C,		FeatureF16C,
FeatureFMA,		FeatureFMA,
FeatureFSGSBase,		FeatureFSGSBase,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
FeatureFastLZCNT,		FeatureFastLZCNT,
FeatureLAHFSAHF,		FeatureLAHFSAHF,
FeatureLZCNT,		FeatureLZCNT,
FeatureMacroFusion,		FeatureMacroFusion,
FeatureMMX,		FeatureMMX,
FeatureMOVBE,		FeatureMOVBE,
FeatureMWAITX,		FeatureMWAITX,
FeaturePCLMUL,		FeaturePCLMUL,
Show All 28 Lines
// covers a huge swath of x86 processors. If there are specific scheduling		// covers a huge swath of x86 processors. If there are specific scheduling
// knobs which need to be tuned differently for AMD chips, we might consider		// knobs which need to be tuned differently for AMD chips, we might consider
// forming a common base for them.		// forming a common base for them.
def : ProcessorModel<"x86-64", SandyBridgeModel, [		def : ProcessorModel<"x86-64", SandyBridgeModel, [
FeatureX87,		FeatureX87,
FeatureMMX,		FeatureMMX,
FeatureSSE2,		FeatureSSE2,
FeatureFXSR,		FeatureFXSR,
		FeatureNOPL,
Feature64Bit,		Feature64Bit,
FeatureSlow3OpsLEA,		FeatureSlow3OpsLEA,
FeatureSlowIncDec,		FeatureSlowIncDec,
FeatureMacroFusion		FeatureMacroFusion
]>;		]>;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Calling Conventions		// Calling Conventions
▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

lib/Target/X86/X86Subtarget.h

Show First 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	protected:
X86SSEEnum X86SSELevel;		X86SSEEnum X86SSELevel;

/// MMX, 3DNow, 3DNow Athlon, or none supported.		/// MMX, 3DNow, 3DNow Athlon, or none supported.
X863DNowEnum X863DNowLevel;		X863DNowEnum X863DNowLevel;

/// True if the processor supports X87 instructions.		/// True if the processor supports X87 instructions.
bool HasX87;		bool HasX87;

		/// True if this processor has NOPL instruction
		/// (generally pentium pro+).
		bool HasNOPL;

/// True if this processor has conditional move instructions		/// True if this processor has conditional move instructions
/// (generally pentium pro+).		/// (generally pentium pro+).
bool HasCMov;		bool HasCMov;

/// True if the processor supports X86-64 instructions.		/// True if the processor supports X86-64 instructions.
bool HasX86_64;		bool HasX86_64;

/// True if the processor supports POPCNT.		/// True if the processor supports POPCNT.
▲ Show 20 Lines • Show All 361 Lines • ▼ Show 20 Lines	bool isTarget64BitLP64() const {
return In64BitMode && (TargetTriple.getEnvironment() != Triple::GNUX32 &&		return In64BitMode && (TargetTriple.getEnvironment() != Triple::GNUX32 &&
!TargetTriple.isOSNaCl());		!TargetTriple.isOSNaCl());
}		}

PICStyles::Style getPICStyle() const { return PICStyle; }		PICStyles::Style getPICStyle() const { return PICStyle; }
void setPICStyle(PICStyles::Style Style) { PICStyle = Style; }		void setPICStyle(PICStyles::Style Style) { PICStyle = Style; }

bool hasX87() const { return HasX87; }		bool hasX87() const { return HasX87; }
		bool hasNOPL() const { return HasNOPL; }
bool hasCMov() const { return HasCMov; }		bool hasCMov() const { return HasCMov; }
bool hasSSE1() const { return X86SSELevel >= SSE1; }		bool hasSSE1() const { return X86SSELevel >= SSE1; }
bool hasSSE2() const { return X86SSELevel >= SSE2; }		bool hasSSE2() const { return X86SSELevel >= SSE2; }
bool hasSSE3() const { return X86SSELevel >= SSE3; }		bool hasSSE3() const { return X86SSELevel >= SSE3; }
bool hasSSSE3() const { return X86SSELevel >= SSSE3; }		bool hasSSSE3() const { return X86SSELevel >= SSSE3; }
bool hasSSE41() const { return X86SSELevel >= SSE41; }		bool hasSSE41() const { return X86SSELevel >= SSE41; }
bool hasSSE42() const { return X86SSELevel >= SSE42; }		bool hasSSE42() const { return X86SSELevel >= SSE42; }
bool hasAVX() const { return X86SSELevel >= AVX; }		bool hasAVX() const { return X86SSELevel >= AVX; }
▲ Show 20 Lines • Show All 242 Lines • Show Last 20 Lines

lib/Target/X86/X86Subtarget.cpp

Show First 20 Lines • Show All 254 Lines • ▼ Show 20 Lines	void X86Subtarget::initSubtargetFeatures(StringRef CPU, StringRef FS) {
if (hasAVX512())		if (hasAVX512())
ScatterOverhead = 2;		ScatterOverhead = 2;
}		}

void X86Subtarget::initializeEnvironment() {		void X86Subtarget::initializeEnvironment() {
X86SSELevel = NoSSE;		X86SSELevel = NoSSE;
X863DNowLevel = NoThreeDNow;		X863DNowLevel = NoThreeDNow;
HasX87 = false;		HasX87 = false;
		HasNOPL = false;
HasCMov = false;		HasCMov = false;
HasX86_64 = false;		HasX86_64 = false;
HasPOPCNT = false;		HasPOPCNT = false;
HasSSE4A = false;		HasSSE4A = false;
HasAES = false;		HasAES = false;
HasVAES = false;		HasVAES = false;
HasFXSR = false;		HasFXSR = false;
HasXSAVE = false;		HasXSAVE = false;
▲ Show 20 Lines • Show All 138 Lines • Show Last 20 Lines