This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/
-
Basic/Targets/
-
Targets/
-
X86.h
-
X86.cpp
-
CodeGen/
1
CodeGenModule.cpp
-
test/CodeGen/
-
CodeGen/
-
attr-cpuspecific-avx-abi.c
8/10
attr-cpuspecific.c
-
llvm/
-
include/llvm/TargetParser/
-
llvm/
-
TargetParser/
-
X86TargetParser.h
-
X86TargetParser.def
-
lib/
-
Target/X86/
-
X86/
-
X86.td
-
TargetParser/
8/10
X86TargetParser.cpp
-
test/CodeGen/X86/
-
CodeGen/
-
X86/
5/6
cpus-intel.ll

Differential D151696

[X86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchMangling
ClosedPublic

Authored by FreddyYe on May 30 2023, 1:43 AM.

Download Raw Diff

Details

Reviewers

pengfei
LuoYuanke
RKSimon
erichkeane
craig.topper
skan

Commits

rG7717c0071d7c: [X86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchMangling

Summary

This refactor patch means to remove CPU_SPECIFIC* MACROs in X86TargetParser.def
and move those information into ProcInfo of X86TargetParser.cpp. Since these
two files both maintain a table with redundant info such as cpuname and its
features supported. CPU_SPECIFIC* MACROs define some different information. This
patch dealt with them in these ways when moving:
1.mangling
This is now moved to Mangling in ProcInfo and directly initialized at array of
Processors. CPUs don't support cpu_dispatch/specific are assigned '\0' as
mangling.
2.CPU alias
The alias cpu will also be initialized in array of Processors, its attributes
will be same as its alias target cpu. Same feature list, same mangling.
3.TUNE_NAME
Before my change, some cpu names support cpu_dispatch/specific are not
supported in X86.td, which means optimizer/backend doesn't recognize them. So
they use a different TUNE_NAME to generate in IR. In this patch, I added these
missing cpu support at X86.td by utilizing existing Features and XXXTunings, so
that each cpu name can directly use its own name as TUNE_NAME to be supported
by optimizer/backend.
4.Feature list
The feature list of one CPU maintained in X86TargetParser.def is not same as
the one in X86TargetParser.cpp. It only maintains part of features of one CPU
(features defined by X86_FEATURE_COMPAT). While X86TargetParser.cpp maintains
a complete one. This patch abandons the feature list maintained by CPU_SPECIFIC*
MACROs because assigning a CPU with a complete one doesn't affect the
functionality of cpu_dispatch/specific.
Except these four info, since some of CPUs supported by cpu_dispatch/specific
doesn's support clang options like -march, -mtune before, this patch also kept
this behavior still by adding another member OnlyForCPUDispatchSpecific in
ProcInfo.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

FreddyYe created this revision.May 30 2023, 1:43 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 30 2023, 1:43 AM

Herald added subscribers: pengfei, hiraditya. · View Herald Transcript

FreddyYe requested review of this revision.May 30 2023, 1:43 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptMay 30 2023, 1:43 AM

Herald added subscribers: llvm-commits, cfe-commits. · View Herald Transcript

FreddyYe retitled this revision from Remove CPU_SPECIFIC* MACROs and use unified getManglingForCPU to [WIP] Remove CPU_SPECIFIC* MACROs and use unified getManglingForCPU.May 30 2023, 1:43 AM

Harbormaster completed remote builds in B235238: Diff 526546.May 30 2023, 2:53 AM

FreddyYe mentioned this in D151537: [NFC] Update cpu_specific test to use a newer CPU.Jun 13 2023, 5:45 PM

Update to a more conservative change.

FreddyYe retitled this revision from [WIP] Remove CPU_SPECIFIC* MACROs and use unified getManglingForCPU to [x86] Remove CPU_SPECIFIC* MACROs and add getManglingForCPU.Jun 14 2023, 7:50 PM

Harbormaster completed remote builds in B239021: Diff 531596.Jun 14 2023, 8:59 PM

misc refine.

This patch means to remove CPU_SPECIFIC* MACROs in X86TargetParser.def and move that part of functionality into X86TargetParser.cpp.
Since these two files both maintain a table with cpuname, features of this cpu supported, ... This move can reduce the codes.

CPU_SPECIFIC* has some different and specific information maintained. This patch dealt with them in these ways when moving:

mangling This is now moved to Mangling in ProcInfo and directly initialized at array of Processors. CPUs don't support cpu_dispatch/specific are assigned '\0' as mangling. This patch also supports some of new intel cpus for cpu_dispatch/specific feature.
alias relationship The alias cpu will also be initialized in array of Processors, its attributes will be same as its alias target cpu. Same feature list, same mangling.
TUNE_NAME Before my change, some cpu names support cpu_dispatch/specific are not supported in X86.td, which means optimizer/backend doesn't recognize them. So they use a different TUNE_NAME to generate in IR. In this patch, I added these missing cpu support at X86.td by utilize existing Features and XXXTunings. So that each cpu name can directly use its own name as TUNE_NAME to be supported by optimizer/backend.
Feature list The feature list of one CPU maintained in X86TargetParser.def is not same as the one in X86TargetParser.cpp. It only maintains part of features of one CPU(Features defined by X86_FEATURE_COMPAT). While X86TargetParser.cpp maintains a complete one. This patch abandons the feature list maintained in X86TargetParser.def because assigning a CPU with a complete feature list in X86TargetParser.cpp doesn't affect the functionality of cpu_dispatch/specific. See the implement of llvm::X86::getCpuSupportsMask, it already masked out the features not defined by X86_FEATURE_COMPAT.

Beyond these four information, since some of CPUs supported by cpu_dispatch/specific doesn's support clang options like -march, -mtune before, this patch also kept this behavior still by adding another member OnlyForCPUSpecificDispath in ProcInfo.

FreddyYe added reviewers: pengfei, LuoYuanke, RKSimon, erichkeane, craig.topper, skan.Jun 14 2023, 10:48 PM

FreddyYe edited the summary of this revision. (Show Details)

FreddyYe added a parent revision: D152989: Pre-commit test for D151696..Jun 14 2023, 11:07 PM

Harbormaster completed remote builds in B239032: Diff 531615.Jun 15 2023, 12:11 AM

RKSimon added inline comments.Jun 15 2023, 6:21 AM

llvm/lib/TargetParser/X86TargetParser.cpp
15	(clang-format) - include order
313	Would it be better to move all of this into X86TargetParser.def ?
314	FeaturesPentiumMMX & ~FeaturesPentiumMMX ?
llvm/test/CodeGen/X86/cpus-intel.ll
34	Keeping all the aliased cpus variants RUN together will make maintenance easier (e.g. pentium_mmx)

FreddyYe added inline comments.Jun 15 2023, 7:11 AM

llvm/lib/TargetParser/X86TargetParser.cpp
313	Feels so. My next step plan it to furtherly combine the feature list table in X86.td together.
314	I think I was to initialize a null feature list but failed... Will refine.

Update generic and pentium Features to align with X86.td.
Readjust tests in cpus-intel.ll

Harbormaster completed remote builds in B239301: Diff 531993.Jun 15 2023, 11:07 PM

FreddyYe marked 2 inline comments as done.Jun 15 2023, 11:07 PM

FreddyYe added inline comments.

llvm/test/CodeGen/X86/cpus-intel.ll
34	Updated, not sure if I understood your point. pls review again.

RKSimon added inline comments.Jun 16 2023, 2:46 AM

llvm/test/CodeGen/X86/cpus-intel.ll

8–9

I meant like this:

; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium-mmx 2>&1 | FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_mmx 2>&1 | FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty

The idea is to keep RUN lines that test the equivalent cpus together, so its easier for any future edits to handle them together

Update cpus-intel.ll.

Harbormaster completed remote builds in B239687: Diff 532494.Jun 18 2023, 6:11 PM

FreddyYe marked 2 inline comments as done.Jun 18 2023, 6:28 PM

FreddyYe added inline comments.

llvm/test/CodeGen/X86/cpus-intel.ll
8–9	Woops, I really misunderstood. Updated.

Typo refine: OnlyForCPUSpecificDispath -> OnlyForCPUDispatchSpecific

Harbormaster completed remote builds in B239708: Diff 532517.Jun 18 2023, 8:45 PM

Thanks for @RKSimon 's review, I'd like also to mention that https://reviews.llvm.org/D152989 is supposed to be the base commit of here.

RKSimon added inline comments.Jun 19 2023, 7:58 AM

llvm/lib/TargetParser/X86TargetParser.cpp
110	Do we have a documented list of the mangling values anywhere? The values below look too much like magic numbers tbh.
llvm/test/CodeGen/X86/cpus-intel.ll
90	put this with the goldmonth checks?

FreddyYe added inline comments.Jun 19 2023, 6:03 PM

llvm/lib/TargetParser/X86TargetParser.cpp
110	I think not. I copied them from the old MACROs in X86TargetParser.def. I think the values are just used to do some distinguish from different CPUs. And 'a'-'Z' is about to run out of soon here. We probably need to extend the mangling, may be with another letter as prefix. @erichkeane may have a better answer here.
llvm/test/CodeGen/X86/cpus-intel.ll
90	I'll do the adjust. BTW I'd like also to mention below: Here are the rules I used to add features/tuning model in X86.td for missing cpu names in CPU_SPECIFIC Macros: RULE 1: Infer from the name itself. E.g. pentium_4 -> pentium4, pentium_iii -> pentium3 RULE 2: CPU_SPECIFIC_ALIAS can tell me its old name. Using the old name to continue to infer. RULE 3: Use the feature list in CPU_SPECIFIC to compare and append with existing CPU names: pentium_4_sse3 -> prescott core_2_duo_ssse3 -> core2 core_2_duo_sse4_1 -> penryn atom_sse4_2 -> silvermont core_i7_sse4_2 -> nehalem core_aes_pclmulqdq -> westmere core_5th_gen_avx_tsx -> broadwell There is only one exception: atom_sse4_2_movbe. For now I set it as goldmont features and silvermont tuning model according to its definition: CPU_SPECIFIC("atom_sse4_2_movbe", "silvermont", 'd', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt")

Adjust atom_sse4_2_movbe test.

Harbormaster completed remote builds in B239902: Diff 532772.Jun 19 2023, 6:14 PM

FreddyYe retitled this revision from [x86] Remove CPU_SPECIFIC* MACROs and add getManglingForCPU to [x86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchMangling.Jun 20 2023, 11:20 PM

pengfei added inline comments.Jun 20 2023, 11:49 PM

clang/test/CodeGen/attr-cpuspecific.c
342	Why `cmov` disappearing? This feature is supported by most recent targets. Same below.

FreddyYe added inline comments.Jun 20 2023, 11:57 PM

clang/test/CodeGen/attr-cpuspecific.c
342	Good catch. Seems like frontend codes missed CMOV from the very beginning, while X86.td doesn't have this issue. Will update.

Add missing FeatureCMOV in X8TargetParser.cpp

Harbormaster completed remote builds in B240171: Diff 533155.Jun 21 2023, 12:12 AM

debug typo.

Harbormaster completed remote builds in B240174: Diff 533161.Jun 21 2023, 12:26 AM

Please can you cleanup the summary, as it isn't very easy to understand at the moment. Possibly split into a series of bullet points describing the changes?

FreddyYe edited the summary of this revision. (Show Details)Jun 22 2023, 5:50 AM

FreddyYe edited the summary of this revision. (Show Details)

gentle ping _cpu_dispatch/specific feature owner @erichkeane. Any comments?

FreddyYe mentioned this in D152989: Pre-commit test for D151696..Jun 26 2023, 6:03 AM

I think this is OK, I have a slight fear we're losing a bit of the 'tune' functionality, but it is not impossible that we've never really cared about that. One concern I have is that the list was used for the resolver function, but I don't see any test changes for that? Are we properly filtering out the features list somehow?

clang/lib/CodeGen/CodeGenModule.cpp
2272	So my understanding here is that our intent was that the 'tune' cpu and the 'selected' cpu were not necessarily the same (either not the same name, OR not the same CPU!), right? Is that being lost here?

In D151696#4448573, @erichkeane wrote:

I think this is OK, I have a slight fear we're losing a bit of the 'tune' functionality, but it is not impossible that we've never really cared about that. One concern I have is that the list was used for the resolver function, but I don't see any test changes for that? Are we properly filtering out the features list somehow?

Yes! Now there are no tests yet testing the resolver function influenced by the feature list. I'll add another pre-commit test to show this.

FreddyYe mentioned this in D121410: Have cpu-specific variants set 'tune-cpu' as an optimization hint.Jun 27 2023, 2:30 AM

In D151696#4448680, @FreddyYe wrote:

In D151696#4448573, @erichkeane wrote:

I think this is OK, I have a slight fear we're losing a bit of the 'tune' functionality, but it is not impossible that we've never really cared about that. One concern I have is that the list was used for the resolver function, but I don't see any test changes for that? Are we properly filtering out the features list somehow?

Yes! Now there are no tests yet testing the resolver function influenced by the feature list. I'll add another pre-commit test to show this.

Added checks in https://reviews.llvm.org/D152989. pls help review. About the "-tune-cpu", I think it's ok to use the same name as the one that users specified in _cpu_specific(), after I supported those missing names in X86.td so that optimizer can now recognize them.

FreddyYe mentioned this in rG43baa5986c20: Pre-commit test for D151696..Jun 27 2023, 10:53 PM

Fixed copy errors of some CPU's manglings
Updated resolver function checkers due to more complete feature list changes.
Added more cpu checks in attr-cpuspecific-cpus.c

Notice that these two changes are both expected since a more complete feature list won't influence the _cpu_dispatch/specific multiversion to not work.

clang/test/CodeGen/attr-cpuspecific.c
45–46	This value change is because the feature list of ivybridge described in X86TargetParser.def before missed feature "pclmul".
51–52	This value change is because the feature list of knl described in X86TargetParser.def before missed feature "bmi2" and "aes".

Harbormaster completed remote builds in B241715: Diff 535277.Jun 28 2023, 1:54 AM

gentle ping

I have some concerns for RULE 3, especially core_aes_pclmulqdq -> westmere and atom_sse4_2_movbe -> silvermont.
Sometimes, we have minor feature differences in the same generation targets. I guess that's why we use arch_feature naming like core_2_duo_ssse3. Merging them into the same generation or the next generation might corrup the intention here. But I'm not expert in CPUDispatch, and I don't see any existing tests for them, so I won't block the patch since it's an improvement in general.
Please wait a few days for other reviewers' opinions.

clang/test/CodeGen/attr-cpuspecific.c
51–52	The comment is for TwoVersions?

This revision is now accepted and ready to land.Jun 28 2023, 11:46 PM

It looks to me the failed unit tests might be related to this patch, please take a look.

FreddyYe marked an inline comment as done.Jun 29 2023, 12:48 AM

FreddyYe added inline comments.

clang/test/CodeGen/attr-cpuspecific.c
51–52	Yes. Sorry for wrong point.

In D151696#4458443, @pengfei wrote:

It looks to me the failed unit tests might be related to this patch, please take a look.

This is due to FeatureCMOV adding. Should I split into another review?

In D151696#4458435, @pengfei wrote:

I have some concerns for RULE 3, especially core_aes_pclmulqdq -> westmere and atom_sse4_2_movbe -> silvermont.
Sometimes, we have minor feature differences in the same generation targets. I guess that's why we use arch_feature naming like core_2_duo_ssse3. Merging them into the same generation or the next generation might corrup the intention here. But I'm not expert in CPUDispatch, and I don't see any existing tests for them, so I won't block the patch since it's an improvement in general.
Please wait a few days for other reviewers' opinions.

You are right. For historical reasons, I can't find which product each cpu name string points to actually. From the old feature list in X86TargetParser.def, these three cpu names even share a same feature list:

atom_sse4_2
core_i7_sse4_2
core_aes_pclmulqdq

"core_5th_gen_avx_tsx" is also same as "broadwell". So I have a new proposal for RULE3, which can be considered to be more conservative:

pentium_4_sse3 -> prescott (FeatureSSE3)
    First introduce FeatureSSE3 like prescott
core_2_duo_ssse3 -> core2 (FeatureSSSE3)
    First introduce FeatureSSSE3 like core2
core_2_duo_sse4_1 -> penryn (FeatureSSE4_1)
    First introduce FeatureSSE4_1 like penryn
atom_sse4_2 -> nehalem (FeatureSSE4_2)
    First introduce FeatureSSE4_2 like nehalem
core_i7_sse4_2 -> nehalem (FeatureSSE4_2)
    First introduce FeatureSSE4_2 like nehalem
core_aes_pclmulqdq -> nehalem (FeatureSSE4_2)
    First introduce FeatureSSE4_2 like nehalem
core_5th_gen_avx_tsx -> broadwell 
    Same feature list as broadwell

Meanwhile, the fact above won't affect code changes in X86.td. I'll still define these new cpu names with the TUNE_NAME info in original source.

In D151696#4458751, @FreddyYe wrote:

In D151696#4458443, @pengfei wrote:

It looks to me the failed unit tests might be related to this patch, please take a look.

This is due to FeatureCMOV adding. Should I split into another review?

Yes, it looks like it would be best to split off and commit some of the fixes (cmov / the isa changes causing the attr-cpuspecific.c diffs etc.) first, before this refactor patch.

Yes, it looks like it would be best to split off and commit some of the fixes (cmov / the isa changes causing the attr-cpuspecific.c diffs etc.) first, before this refactor patch.

Done in https://reviews.llvm.org/D154181

Rebase for D154181

changes according to the new RULE3, pls review again.

Harbormaster completed remote builds in B242333: Diff 536118.Jun 30 2023, 12:09 AM

RKSimon added inline comments.Jun 30 2023, 1:22 AM

clang/test/CodeGen/attr-cpuspecific.c
45–46	Pull out an ivybridge fix into its own patch?

FreddyYe added inline comments.Jun 30 2023, 1:29 AM

clang/test/CodeGen/attr-cpuspecific.c
45–46	Emmm. Seems like a good idea. How about I change these two CPU's old feature list only? Since the successors of them will also have this issue, but no tests influenced.

RKSimon added inline comments.Jun 30 2023, 1:33 AM

clang/test/CodeGen/attr-cpuspecific.c
45–46	Sure, a single patch with multiple cpus' fixups is fine

FreddyYe mentioned this in D154209: [X86] Add missing features for ivybridge, sandybridge and knl in X86TargetParser.def..Jun 30 2023, 5:59 AM

FreddyYe marked an inline comment as done.Jun 30 2023, 6:00 AM

FreddyYe added inline comments.

clang/test/CodeGen/attr-cpuspecific.c
45–46	Done in https://reviews.llvm.org/D154209

FreddyYe mentioned this in rGb026c9eb1051: [X86] Add missing features for ivybridge, sandybridge and knl in….Jul 2 2023, 5:13 PM

Rebase, especially for D151696

Harbormaster completed remote builds in B242721: Diff 536645.Jul 2 2023, 7:48 PM

gentle ping. If no objections, I'll merge this tomorrow.

RKSimon added inline comments.Jul 3 2023, 9:08 AM

llvm/lib/TargetParser/X86TargetParser.cpp
378	I'm still not clear on what determines the mangling mode and cpu dispatch flag for cpu targets are supposedly the same? For example, none of these ivybridge equivalent configs have the same values.

FreddyYe marked an inline comment as done.Jul 3 2023, 7:29 PM

FreddyYe added inline comments.

clang/test/CodeGen/attr-cpuspecific-cpus.c
40 ↗	(On Diff #536645)	In this patch, I additionally supported some intel new CPU's _cpu_specific feature by creating a new mangling or copy some old ones (which means aliasing certain cpu). Maybe I should do this in a following patch?
llvm/lib/TargetParser/X86TargetParser.cpp
378	I assign them by following orders: Copy the mangling from the original CPU_SPEICIFC MACRO. If there's no way to copy, assign to '\0' by default, which means doesn't support __cpu_specific/dispatch feature. If cpu name contain ''-', assign the mangling as '\0', too. Because '-' cannot be correctly identified in _cpu_specific/dispatch(). set OnlyForCPUDispatch flag as `true` if this cpu name was not listed here, which means it doesn't support -march, -mtune and so on. This flag makes this cpu name only support cpu_dispatch/specific feature. E.g. core_3rd_gen_avx, core_4rd_gen_avx., ... And normally, these names are very old. So supporting them with -march=, -mtune= is not easy for now. And notice that new cpu names shouldn't set this flag as `true` since they should both support -march= and cpu_specific/dispatch feature by default.

RKSimon added inline comments.Jul 4 2023, 2:08 AM

clang/test/CodeGen/attr-cpuspecific-cpus.c
40 ↗	(On Diff #536645)	Yes please, that would great.
llvm/lib/TargetParser/X86TargetParser.cpp
378	OK - please can you add that to a comment above line 311 for future reference?

Add comments about Mangling and OnlyForCPUDispatchSpecific,
and remove the supporting more/new CPU names for cpu_specific/dispatch feature.

FreddyYe retitled this revision from [x86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchMangling to [X86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchMangling.Jul 4 2023, 5:45 AM

FreddyYe edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B243016: Diff 537065.Jul 4 2023, 6:33 AM

LGTM - cheers

This revision was landed with ongoing or failed builds.Jul 5 2023, 2:32 AM

Closed by commit rG7717c0071d7c: [X86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchMangling (authored by FreddyYe). · Explain Why

This revision was automatically updated to reflect the committed changes.

FreddyYe added a commit: rG7717c0071d7c: [X86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchMangling.

Thank so much on all your comments/review to make this happen. Cheers~!

Revision Contents

Path

Size

clang/

lib/

Basic/

Targets/

X86.h

2 lines

X86.cpp

36 lines

CodeGen/

CodeGenModule.cpp

3 lines

test/

CodeGen/

attr-cpuspecific-avx-abi.c

4 lines

attr-cpuspecific.c

6 lines

llvm/

include/

llvm/

TargetParser/

X86TargetParser.h

6 lines

X86TargetParser.def

46 lines

lib/

Target/

X86/

X86.td

96 lines

TargetParser/

X86TargetParser.cpp

254 lines

test/

CodeGen/

X86/

cpus-intel.ll

25 lines

Diff 532517

clang/lib/Basic/Targets/X86.h

Show First 20 Lines • Show All 212 Lines • ▼ Show 20 Lines	public:
bool validateCPUSpecificCPUDispatch(StringRef Name) const override;		bool validateCPUSpecificCPUDispatch(StringRef Name) const override;

char CPUSpecificManglingCharacter(StringRef Name) const override;		char CPUSpecificManglingCharacter(StringRef Name) const override;

void getCPUSpecificCPUDispatchFeatures(		void getCPUSpecificCPUDispatchFeatures(
StringRef Name,		StringRef Name,
llvm::SmallVectorImpl<StringRef> &Features) const override;		llvm::SmallVectorImpl<StringRef> &Features) const override;

StringRef getCPUSpecificTuneName(StringRef Name) const override;

std::optional<unsigned> getCPUCacheLineSize() const override;		std::optional<unsigned> getCPUCacheLineSize() const override;

bool validateAsmConstraint(const char *&Name,		bool validateAsmConstraint(const char *&Name,
TargetInfo::ConstraintInfo &info) const override;		TargetInfo::ConstraintInfo &info) const override;

bool validateGlobalRegisterVariable(StringRef RegName, unsigned RegSize,		bool validateGlobalRegisterVariable(StringRef RegName, unsigned RegSize,
bool &HasSizeMismatch) const override {		bool &HasSizeMismatch) const override {
// esp and ebp are the only 32-bit registers the x86 backend can currently		// esp and ebp are the only 32-bit registers the x86 backend can currently
▲ Show 20 Lines • Show All 764 Lines • Show Last 20 Lines

clang/lib/Basic/Targets/X86.cpp

Show First 20 Lines • Show All 1,153 Lines • ▼ Show 20 Lines	unsigned X86TargetInfo::multiVersionSortPriority(StringRef Name) const {
}		}

// Now we know we have a feature, so get its priority and shift it a few so		// Now we know we have a feature, so get its priority and shift it a few so
// that we have sufficient room for the CPUs (above).		// that we have sufficient room for the CPUs (above).
return getFeaturePriority(getFeature(Name)) << 1;		return getFeaturePriority(getFeature(Name)) << 1;
}		}

bool X86TargetInfo::validateCPUSpecificCPUDispatch(StringRef Name) const {		bool X86TargetInfo::validateCPUSpecificCPUDispatch(StringRef Name) const {
return llvm::StringSwitch<bool>(Name)		return llvm::X86::validateCPUSpecificCPUDispatch(Name);
#define CPU_SPECIFIC(NAME, TUNE_NAME, MANGLING, FEATURES) .Case(NAME, true)
#define CPU_SPECIFIC_ALIAS(NEW_NAME, TUNE_NAME, NAME) .Case(NEW_NAME, true)
#include "llvm/TargetParser/X86TargetParser.def"
.Default(false);
}

static StringRef CPUSpecificCPUDispatchNameDealias(StringRef Name) {
return llvm::StringSwitch<StringRef>(Name)
#define CPU_SPECIFIC_ALIAS(NEW_NAME, TUNE_NAME, NAME) .Case(NEW_NAME, NAME)
#include "llvm/TargetParser/X86TargetParser.def"
.Default(Name);
}		}

char X86TargetInfo::CPUSpecificManglingCharacter(StringRef Name) const {		char X86TargetInfo::CPUSpecificManglingCharacter(StringRef Name) const {
return llvm::StringSwitch<char>(CPUSpecificCPUDispatchNameDealias(Name))		return llvm::X86::getCPUDispatchMangling(Name);
#define CPU_SPECIFIC(NAME, TUNE_NAME, MANGLING, FEATURES) .Case(NAME, MANGLING)
#include "llvm/TargetParser/X86TargetParser.def"
.Default(0);
}		}

void X86TargetInfo::getCPUSpecificCPUDispatchFeatures(		void X86TargetInfo::getCPUSpecificCPUDispatchFeatures(
StringRef Name, llvm::SmallVectorImpl<StringRef> &Features) const {		StringRef Name, llvm::SmallVectorImpl<StringRef> &Features) const {
StringRef WholeList =		SmallVector<StringRef, 32> TargetCPUFeatures;
llvm::StringSwitch<StringRef>(CPUSpecificCPUDispatchNameDealias(Name))		llvm::X86::getFeaturesForCPU(Name, TargetCPUFeatures, true);
#define CPU_SPECIFIC(NAME, TUNE_NAME, MANGLING, FEATURES) .Case(NAME, FEATURES)		for (auto &F : TargetCPUFeatures)
#include "llvm/TargetParser/X86TargetParser.def"		Features.push_back(F);
.Default("");
WholeList.split(Features, ',', /MaxSplit=/-1, /KeepEmpty=/false);
}

StringRef X86TargetInfo::getCPUSpecificTuneName(StringRef Name) const {
return llvm::StringSwitch<StringRef>(Name)
#define CPU_SPECIFIC(NAME, TUNE_NAME, MANGLING, FEATURES) .Case(NAME, TUNE_NAME)
#define CPU_SPECIFIC_ALIAS(NEW_NAME, TUNE_NAME, NAME) .Case(NEW_NAME, TUNE_NAME)
#include "llvm/TargetParser/X86TargetParser.def"
.Default("");
}		}

// We can't use a generic validation scheme for the cpus accepted here		// We can't use a generic validation scheme for the cpus accepted here
// versus subtarget cpus accepted in the target attribute because the		// versus subtarget cpus accepted in the target attribute because the
// variables intitialized by the runtime only support the below currently		// variables intitialized by the runtime only support the below currently
// rather than the full range of cpus.		// rather than the full range of cpus.
bool X86TargetInfo::validateCpuIs(StringRef FeatureStr) const {		bool X86TargetInfo::validateCpuIs(StringRef FeatureStr) const {
return llvm::StringSwitch<bool>(FeatureStr)		return llvm::StringSwitch<bool>(FeatureStr)
▲ Show 20 Lines • Show All 415 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenModule.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,263 Lines • ▼ Show 20 Lines	if (TD) {
if (!ParsedAttr.Tune.empty() &&		if (!ParsedAttr.Tune.empty() &&
getTarget().isValidCPUName(ParsedAttr.Tune))		getTarget().isValidCPUName(ParsedAttr.Tune))
TuneCPU = ParsedAttr.Tune;		TuneCPU = ParsedAttr.Tune;
}		}

if (SD) {		if (SD) {
// Apply the given CPU name as the 'tune-cpu' so that the optimizer can		// Apply the given CPU name as the 'tune-cpu' so that the optimizer can
// favor this processor.		// favor this processor.
TuneCPU = getTarget().getCPUSpecificTuneName(		TuneCPU = SD->getCPUName(GD.getMultiVersionIndex())->getName();
		erichkeaneUnsubmitted Not Done Reply Inline Actions So my understanding here is that our intent was that the 'tune' cpu and the 'selected' cpu were not necessarily the same (either not the same name, OR not the same CPU!), right? Is that being lost here? erichkeane: So my understanding here is that our intent was that the 'tune' cpu and the 'selected' cpu were…
SD->getCPUName(GD.getMultiVersionIndex())->getName());
}		}
} else {		} else {
// Otherwise just add the existing target cpu and target features to the		// Otherwise just add the existing target cpu and target features to the
// function.		// function.
Features = getTarget().getTargetOpts().Features;		Features = getTarget().getTargetOpts().Features;
}		}

if (!TargetCPU.empty()) {		if (!TargetCPU.empty()) {
▲ Show 20 Lines • Show All 4,977 Lines • Show Last 20 Lines

clang/test/CodeGen/attr-cpuspecific-avx-abi.c

	Show All 18 Lines
	// CHECK: define{{.*}} @foo.A() #[[A:[0-9]+]]			// CHECK: define{{.*}} @foo.A() #[[A:[0-9]+]]

	__attribute__((cpu_specific(core_4th_gen_avx)))			__attribute__((cpu_specific(core_4th_gen_avx)))
	__m256d foo(void) { return bar_avx2(); }			__m256d foo(void) { return bar_avx2(); }
	// CHECK: define{{.*}} @foo.V() #[[V:[0-9]+]]			// CHECK: define{{.*}} @foo.V() #[[V:[0-9]+]]

	// CHECK: attributes #[[A]] = {{.*}}"target-features"="+avx,+crc32,+cx8,+mmx,+popcnt,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave"			// CHECK: attributes #[[A]] = {{.*}}"target-features"="+avx,+crc32,+cx8,+mmx,+popcnt,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave"
	// CHECK-SAME: "tune-cpu"="generic"			// CHECK-SAME: "tune-cpu"="generic"
	// CHECK: attributes #[[V]] = {{.*}}"target-features"="+avx,+avx2,+bmi,+cmov,+crc32,+cx8,+f16c,+fma,+lzcnt,+mmx,+movbe,+popcnt,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave"			// CHECK: attributes #[[V]] = {{.*}}"target-features"="+avx,+avx2,+bmi,+bmi2,+crc32,+cx16,+cx8,+f16c,+fma,+fsgsbase,+fxsr,+invpcid,+lzcnt,+mmx,+movbe,+pclmul,+popcnt,+rdrnd,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave,+xsaveopt"
	// CHECK-SAME: "tune-cpu"="haswell"			// CHECK-SAME: "tune-cpu"="core_4th_gen_avx"

clang/test/CodeGen/attr-cpuspecific.c

	Show All 36 Lines
	ATTR(cpu_specific(ivybridge))			ATTR(cpu_specific(ivybridge))
	void SingleVersion(void){}			void SingleVersion(void){}
	// LINUX: define{{.*}} void @SingleVersion.S() #[[S:[0-9]+]]			// LINUX: define{{.*}} void @SingleVersion.S() #[[S:[0-9]+]]
	// WINDOWS: define dso_local void @SingleVersion.S() #[[S:[0-9]+]]			// WINDOWS: define dso_local void @SingleVersion.S() #[[S:[0-9]+]]

	ATTR(cpu_dispatch(ivybridge))			ATTR(cpu_dispatch(ivybridge))
	void SingleVersion(void);			void SingleVersion(void);
	// LINUX: define weak_odr ptr @SingleVersion.resolver()			// LINUX: define weak_odr ptr @SingleVersion.resolver()
	// LINUX: call void @__cpu_indicator_init			// LINUX: call void @__cpu_indicator_init
	// LINUX: ret ptr @SingleVersion.S			// LINUX: ret ptr @SingleVersion.S
				FreddyYeAuthorUnsubmitted Done Reply Inline Actions This value change is because the feature list of ivybridge described in X86TargetParser.def before missed feature "pclmul". FreddyYe: This value change is because the feature list of ivybridge described in X86TargetParser.def…
				RKSimonUnsubmitted Not Done Reply Inline Actions Pull out an ivybridge fix into its own patch? RKSimon: Pull out an ivybridge fix into its own patch?
				FreddyYeAuthorUnsubmitted Done Reply Inline Actions Emmm. Seems like a good idea. How about I change these two CPU's old feature list only? Since the successors of them will also have this issue, but no tests influenced. FreddyYe: Emmm. Seems like a good idea. How about I change these two CPU's old feature list only? Since…
				RKSimonUnsubmitted Done Reply Inline Actions Sure, a single patch with multiple cpus' fixups is fine RKSimon: Sure, a single patch with multiple cpus' fixups is fine
				FreddyYeAuthorUnsubmitted Done Reply Inline Actions Done in https://reviews.llvm.org/D154209 FreddyYe: Done in https://reviews.llvm.org/D154209
	// LINUX: call void @llvm.trap			// LINUX: call void @llvm.trap
	// LINUX: unreachable			// LINUX: unreachable

	// WINDOWS: define weak_odr dso_local void @SingleVersion() comdat			// WINDOWS: define weak_odr dso_local void @SingleVersion() comdat
	// WINDOWS: call void @__cpu_indicator_init()			// WINDOWS: call void @__cpu_indicator_init()
	// WINDOWS: call void @SingleVersion.S()			// WINDOWS: call void @SingleVersion.S()
				FreddyYeAuthorUnsubmitted Done Reply Inline Actions This value change is because the feature list of knl described in X86TargetParser.def before missed feature "bmi2" and "aes". FreddyYe: This value change is because the feature list of knl described in X86TargetParser.def before…
				pengfeiUnsubmitted Done Reply Inline Actions The comment is for TwoVersions? pengfei: The comment is for TwoVersions?
				FreddyYeAuthorUnsubmitted Done Reply Inline Actions Yes. Sorry for wrong point. FreddyYe: Yes. Sorry for wrong point.
	// WINDOWS-NEXT: ret void			// WINDOWS-NEXT: ret void
	// WINDOWS: call void @llvm.trap			// WINDOWS: call void @llvm.trap
	// WINDOWS: unreachable			// WINDOWS: unreachable

	ATTR(cpu_specific(ivybridge))			ATTR(cpu_specific(ivybridge))
	void NotCalled(void){}			void NotCalled(void){}
	// LINUX: define{{.*}} void @NotCalled.S() #[[S]]			// LINUX: define{{.*}} void @NotCalled.S() #[[S]]
	// WINDOWS: define dso_local void @NotCalled.S() #[[S:[0-9]+]]			// WINDOWS: define dso_local void @NotCalled.S() #[[S:[0-9]+]]
	▲ Show 20 Lines • Show All 273 Lines • ▼ Show 20 Lines
	// LINUX: ret i32 1			// LINUX: ret i32 1

	// WINDOWS: define dso_local i32 @DispatchFirst.B			// WINDOWS: define dso_local i32 @DispatchFirst.B
	// WINDOWS: ret i32 1			// WINDOWS: ret i32 1

	ATTR(cpu_specific(knl))			ATTR(cpu_specific(knl))
	void OrderDispatchUsageSpecific(void) {}			void OrderDispatchUsageSpecific(void) {}

	// CHECK: attributes #[[S]] = {{.*}}"target-features"="+avx,+cmov,+crc32,+cx8,+f16c,+mmx,+popcnt,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave"			// CHECK: attributes #[[S]] = {{.*}}"target-features"="+avx,+crc32,+cx16,+cx8,+f16c,+fsgsbase,+fxsr,+mmx,+pclmul,+popcnt,+rdrnd,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave,+xsaveopt"
	pengfeiUnsubmitted Not Done Reply Inline Actions Why `cmov` disappearing? This feature is supported by most recent targets. Same below. pengfei: Why `cmov` disappearing? This feature is supported by most recent targets. Same below.
	FreddyYeAuthorUnsubmitted Done Reply Inline Actions Good catch. Seems like frontend codes missed CMOV from the very beginning, while X86.td doesn't have this issue. Will update. FreddyYe: Good catch. Seems like frontend codes missed CMOV from the very beginning, while X86.td doesn't…
	// CHECK-SAME: "tune-cpu"="ivybridge"			// CHECK-SAME: "tune-cpu"="ivybridge"
	// CHECK: attributes #[[K]] = {{.*}}"target-features"="+adx,+avx,+avx2,+avx512cd,+avx512er,+avx512f,+avx512pf,+bmi,+cmov,+crc32,+cx8,+f16c,+fma,+lzcnt,+mmx,+movbe,+popcnt,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave"			// CHECK: attributes #[[K]] = {{.*}}"target-features"="+adx,+aes,+avx,+avx2,+avx512cd,+avx512er,+avx512f,+avx512pf,+bmi,+bmi2,+crc32,+cx16,+cx8,+f16c,+fma,+fsgsbase,+fxsr,+invpcid,+lzcnt,+mmx,+movbe,+pclmul,+popcnt,+prefetchwt1,+prfchw,+rdrnd,+rdseed,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave,+xsaveopt"
	// CHECK-SAME: "tune-cpu"="knl"			// CHECK-SAME: "tune-cpu"="knl"
	// CHECK: attributes #[[O]] = {{.*}}"target-features"="+cmov,+cx8,+mmx,+movbe,+sse,+sse2,+sse3,+ssse3,+x87"			// CHECK: attributes #[[O]] = {{.*}}"target-features"="+cx16,+cx8,+fxsr,+mmx,+movbe,+sahf,+sse,+sse2,+sse3,+ssse3,+x87"
	// CHECK-SAME: "tune-cpu"="atom"			// CHECK-SAME: "tune-cpu"="atom"

llvm/include/llvm/TargetParser/X86TargetParser.h

	Show First 20 Lines • Show All 149 Lines • ▼ Show 20 Lines
	/// Provide a list of valid -mtune names.			/// Provide a list of valid -mtune names.
	void fillValidTuneCPUList(SmallVectorImpl<StringRef> &Values,			void fillValidTuneCPUList(SmallVectorImpl<StringRef> &Values,
	bool Only64Bit = false);			bool Only64Bit = false);

	/// Get the key feature prioritizing target multiversioning.			/// Get the key feature prioritizing target multiversioning.
	ProcessorFeatures getKeyFeature(CPUKind Kind);			ProcessorFeatures getKeyFeature(CPUKind Kind);

	/// Fill in the features that \p CPU supports into \p Features.			/// Fill in the features that \p CPU supports into \p Features.
	void getFeaturesForCPU(StringRef CPU, SmallVectorImpl<StringRef> &Features);			/// "+" will be append in front of each feature if IfNeedPlus is true.
				void getFeaturesForCPU(StringRef CPU, SmallVectorImpl<StringRef> &Features,
				bool IfNeedPlus = false);

	/// Set or clear entries in \p Features that are implied to be enabled/disabled			/// Set or clear entries in \p Features that are implied to be enabled/disabled
	/// by the provided \p Feature.			/// by the provided \p Feature.
	void updateImpliedFeatures(StringRef Feature, bool Enabled,			void updateImpliedFeatures(StringRef Feature, bool Enabled,
	StringMap<bool> &Features);			StringMap<bool> &Features);

				char getCPUDispatchMangling(StringRef Name);
				bool validateCPUSpecificCPUDispatch(StringRef Name);
	uint64_t getCpuSupportsMask(ArrayRef<StringRef> FeatureStrs);			uint64_t getCpuSupportsMask(ArrayRef<StringRef> FeatureStrs);
	unsigned getFeaturePriority(ProcessorFeatures Feat);			unsigned getFeaturePriority(ProcessorFeatures Feat);

	} // namespace X86			} // namespace X86
	} // namespace llvm			} // namespace llvm

	#endif			#endif

llvm/include/llvm/TargetParser/X86TargetParser.def

	Show First 20 Lines • Show All 229 Lines • ▼ Show 20 Lines
	// These features aren't really CPU features, but the frontend can set them.			// These features aren't really CPU features, but the frontend can set them.
	X86_FEATURE (RETPOLINE_EXTERNAL_THUNK, "retpoline-external-thunk")			X86_FEATURE (RETPOLINE_EXTERNAL_THUNK, "retpoline-external-thunk")
	X86_FEATURE (RETPOLINE_INDIRECT_BRANCHES, "retpoline-indirect-branches")			X86_FEATURE (RETPOLINE_INDIRECT_BRANCHES, "retpoline-indirect-branches")
	X86_FEATURE (RETPOLINE_INDIRECT_CALLS, "retpoline-indirect-calls")			X86_FEATURE (RETPOLINE_INDIRECT_CALLS, "retpoline-indirect-calls")
	X86_FEATURE (LVI_CFI, "lvi-cfi")			X86_FEATURE (LVI_CFI, "lvi-cfi")
	X86_FEATURE (LVI_LOAD_HARDENING, "lvi-load-hardening")			X86_FEATURE (LVI_LOAD_HARDENING, "lvi-load-hardening")
	#undef X86_FEATURE_COMPAT			#undef X86_FEATURE_COMPAT
	#undef X86_FEATURE			#undef X86_FEATURE

	#ifndef CPU_SPECIFIC
	#define CPU_SPECIFIC(NAME, TUNE_NAME, MANGLING, FEATURES)
	#endif

	#ifndef CPU_SPECIFIC_ALIAS
	#define CPU_SPECIFIC_ALIAS(NEW_NAME, TUNE_NAME, NAME)
	#endif

	CPU_SPECIFIC("generic", "generic", 'A', "")
	CPU_SPECIFIC("pentium", "pentium", 'B', "")
	CPU_SPECIFIC("pentium_pro", "pentiumpro", 'C', "+cmov")
	CPU_SPECIFIC("pentium_mmx", "pentium-mmx", 'D', "+mmx")
	CPU_SPECIFIC("pentium_ii", "pentium2", 'E', "+cmov,+mmx")
	CPU_SPECIFIC("pentium_iii", "pentium3", 'H', "+cmov,+mmx,+sse")
	CPU_SPECIFIC_ALIAS("pentium_iii_no_xmm_regs", "pentium3", "pentium_iii")
	CPU_SPECIFIC("pentium_4", "pentium4", 'J', "+cmov,+mmx,+sse,+sse2")
	CPU_SPECIFIC("pentium_m", "pentium-m", 'K', "+cmov,+mmx,+sse,+sse2")
	CPU_SPECIFIC("pentium_4_sse3", "prescott", 'L', "+cmov,+mmx,+sse,+sse2,+sse3")
	CPU_SPECIFIC("core_2_duo_ssse3", "core2", 'M', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3")
	CPU_SPECIFIC("core_2_duo_sse4_1", "penryn", 'N', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1")
	CPU_SPECIFIC("atom", "atom", 'O', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+movbe")
	CPU_SPECIFIC("atom_sse4_2", "silvermont", 'c', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+popcnt")
	CPU_SPECIFIC("core_i7_sse4_2", "nehalem", 'P', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+popcnt")
	CPU_SPECIFIC("core_aes_pclmulqdq", "westmere", 'Q', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+popcnt")
	CPU_SPECIFIC("atom_sse4_2_movbe", "silvermont", 'd', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt")
	CPU_SPECIFIC("goldmont", "goldmont", 'i', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt")
	CPU_SPECIFIC("sandybridge", "sandybridge", 'R', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+popcnt,+avx")
	CPU_SPECIFIC_ALIAS("core_2nd_gen_avx", "sandybridge", "sandybridge")
	CPU_SPECIFIC("ivybridge", "ivybridge", 'S', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+popcnt,+f16c,+avx")
	CPU_SPECIFIC_ALIAS("core_3rd_gen_avx", "ivybridge", "ivybridge")
	CPU_SPECIFIC("haswell", "haswell", 'V', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt,+f16c,+avx,+fma,+bmi,+lzcnt,+avx2")
	CPU_SPECIFIC_ALIAS("core_4th_gen_avx", "haswell", "haswell")
	CPU_SPECIFIC("core_4th_gen_avx_tsx", "haswell", 'W', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt,+f16c,+avx,+fma,+bmi,+lzcnt,+avx2")
	CPU_SPECIFIC("broadwell", "broadwell", 'X', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt,+f16c,+avx,+fma,+bmi,+lzcnt,+avx2,+adx")
	CPU_SPECIFIC_ALIAS("core_5th_gen_avx", "broadwell", "broadwell")
	CPU_SPECIFIC("core_5th_gen_avx_tsx", "broadwell", 'Y', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt,+f16c,+avx,+fma,+bmi,+lzcnt,+avx2,+adx")
	CPU_SPECIFIC("knl", "knl", 'Z', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt,+f16c,+avx,+fma,+bmi,+lzcnt,+avx2,+avx512f,+adx,+avx512er,+avx512pf,+avx512cd")
	CPU_SPECIFIC_ALIAS("mic_avx512", "knl", "knl")
	CPU_SPECIFIC("skylake", "skylake", 'b', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt,+f16c,+avx,+fma,+bmi,+lzcnt,+avx2,+adx,+mpx")
	CPU_SPECIFIC( "skylake_avx512", "skylake-avx512", 'a', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt,+f16c,+avx,+fma,+bmi,+lzcnt,+avx2,+avx512dq,+avx512f,+adx,+avx512cd,+avx512bw,+avx512vl,+clwb")
	CPU_SPECIFIC("cannonlake", "cannonlake", 'e', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt,+f16c,+avx,+fma,+bmi,+lzcnt,+avx2,+avx512dq,+avx512f,+adx,+avx512ifma,+avx512cd,+avx512bw,+avx512vl,+avx512vbmi")
	CPU_SPECIFIC("knm", "knm", 'j', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt,+f16c,+avx,+fma,+bmi,+lzcnt,+avx2,+avx512f,+adx,+avx512er,+avx512pf,+avx512cd,+avx5124fmaps,+avx5124vnniw,+avx512vpopcntdq")

	#undef CPU_SPECIFIC_ALIAS
	#undef CPU_SPECIFIC

llvm/lib/Target/X86/X86.td

	Show First 20 Lines • Show All 1,461 Lines • ▼ Show 20 Lines
	def : Proc<"i386", [FeatureX87],			def : Proc<"i386", [FeatureX87],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
	def : Proc<"i486", [FeatureX87],			def : Proc<"i486", [FeatureX87],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
	def : Proc<"i586", [FeatureX87, FeatureCX8],			def : Proc<"i586", [FeatureX87, FeatureCX8],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
	def : Proc<"pentium", [FeatureX87, FeatureCX8],			def : Proc<"pentium", [FeatureX87, FeatureCX8],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
	def : Proc<"pentium-mmx", [FeatureX87, FeatureCX8, FeatureMMX],			foreach P = ["pentium-mmx", "pentium_mmx"] in {
				def : Proc<P, [FeatureX87, FeatureCX8, FeatureMMX],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
				}
	def : Proc<"i686", [FeatureX87, FeatureCX8, FeatureCMOV],			def : Proc<"i686", [FeatureX87, FeatureCX8, FeatureCMOV],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
	def : Proc<"pentiumpro", [FeatureX87, FeatureCX8, FeatureCMOV,			foreach P = ["pentiumpro", "pentium_pro"] in {
	FeatureNOPL],			def : Proc<P, [FeatureX87, FeatureCX8, FeatureCMOV, FeatureNOPL],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
				}
	def : Proc<"pentium2", [FeatureX87, FeatureCX8, FeatureMMX, FeatureCMOV,			foreach P = ["pentium2", "pentium_ii"] in {
				def : Proc<P, [FeatureX87, FeatureCX8, FeatureMMX, FeatureCMOV,
	FeatureFXSR, FeatureNOPL],			FeatureFXSR, FeatureNOPL],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
				}
	foreach P = ["pentium3", "pentium3m"] in {			foreach P = ["pentium3", "pentium3m", "pentium_iii_no_xmm_regs", "pentium_iii"] in {
	def : Proc<P, [FeatureX87, FeatureCX8, FeatureMMX,			def : Proc<P, [FeatureX87, FeatureCX8, FeatureMMX,
	FeatureSSE1, FeatureFXSR, FeatureNOPL, FeatureCMOV],			FeatureSSE1, FeatureFXSR, FeatureNOPL, FeatureCMOV],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
	}			}

	// Enable the PostRAScheduler for SSE2 and SSE3 class cpus.			// Enable the PostRAScheduler for SSE2 and SSE3 class cpus.
	// The intent is to enable it for pentium4 which is the current default			// The intent is to enable it for pentium4 which is the current default
	// processor in a vanilla 32-bit clang compilation when no specific			// processor in a vanilla 32-bit clang compilation when no specific
	// architecture is specified. This generally gives a nice performance			// architecture is specified. This generally gives a nice performance
	// increase on silvermont, with largely neutral behavior on other			// increase on silvermont, with largely neutral behavior on other
	// contemporary large core processors.			// contemporary large core processors.
	// pentium-m, pentium4m, prescott and nocona are included as a preventative			// pentium-m, pentium4m, prescott and nocona are included as a preventative
	// measure to avoid performance surprises, in case clang's default cpu			// measure to avoid performance surprises, in case clang's default cpu
	// changes slightly.			// changes slightly.

	def : ProcModel<"pentium-m", GenericPostRAModel,			foreach P = ["pentium_m", "pentium-m"] in {
				def : ProcModel<P, GenericPostRAModel,
	[FeatureX87, FeatureCX8, FeatureMMX, FeatureSSE2,			[FeatureX87, FeatureCX8, FeatureMMX, FeatureSSE2,
	FeatureFXSR, FeatureNOPL, FeatureCMOV],			FeatureFXSR, FeatureNOPL, FeatureCMOV],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
				}

	foreach P = ["pentium4", "pentium4m"] in {			foreach P = ["pentium4", "pentium4m", "pentium_4"] in {
	def : ProcModel<P, GenericPostRAModel,			def : ProcModel<P, GenericPostRAModel,
	[FeatureX87, FeatureCX8, FeatureMMX, FeatureSSE2,			[FeatureX87, FeatureCX8, FeatureMMX, FeatureSSE2,
	FeatureFXSR, FeatureNOPL, FeatureCMOV],			FeatureFXSR, FeatureNOPL, FeatureCMOV],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
	}			}

	// Intel Quark.			// Intel Quark.
	def : Proc<"lakemont", [FeatureCX8],			def : Proc<"lakemont", [FeatureCX8],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;

	// Intel Core Duo.			// Intel Core Duo.
	def : ProcModel<"yonah", SandyBridgeModel,			def : ProcModel<"yonah", SandyBridgeModel,
	[FeatureX87, FeatureCX8, FeatureMMX, FeatureSSE3,			[FeatureX87, FeatureCX8, FeatureMMX, FeatureSSE3,
	FeatureFXSR, FeatureNOPL, FeatureCMOV],			FeatureFXSR, FeatureNOPL, FeatureCMOV],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;

	// NetBurst.			// NetBurst.
	def : ProcModel<"prescott", GenericPostRAModel,			foreach P = ["prescott", "pentium_4_sse3"] in {
				def : ProcModel<P, GenericPostRAModel,
	[FeatureX87, FeatureCX8, FeatureMMX, FeatureSSE3,			[FeatureX87, FeatureCX8, FeatureMMX, FeatureSSE3,
	FeatureFXSR, FeatureNOPL, FeatureCMOV],			FeatureFXSR, FeatureNOPL, FeatureCMOV],
	[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;			[TuningSlowUAMem16, TuningInsertVZEROUPPER]>;
				}
	def : ProcModel<"nocona", GenericPostRAModel, [			def : ProcModel<"nocona", GenericPostRAModel, [
	FeatureX87,			FeatureX87,
	FeatureCX8,			FeatureCX8,
	FeatureCMOV,			FeatureCMOV,
	FeatureMMX,			FeatureMMX,
	FeatureSSE3,			FeatureSSE3,
	FeatureFXSR,			FeatureFXSR,
	FeatureNOPL,			FeatureNOPL,
	FeatureX86_64,			FeatureX86_64,
	FeatureCX16,			FeatureCX16,
	],			],
	[			[
	TuningSlowUAMem16,			TuningSlowUAMem16,
	TuningInsertVZEROUPPER			TuningInsertVZEROUPPER
	]>;			]>;

	// Intel Core 2 Solo/Duo.			// Intel Core 2 Solo/Duo.
	def : ProcModel<"core2", SandyBridgeModel, [			foreach P = ["core2", "core_2_duo_ssse3"] in {
				def : ProcModel<P, SandyBridgeModel, [
	FeatureX87,			FeatureX87,
	FeatureCX8,			FeatureCX8,
	FeatureCMOV,			FeatureCMOV,
	FeatureMMX,			FeatureMMX,
	FeatureSSSE3,			FeatureSSSE3,
	FeatureFXSR,			FeatureFXSR,
	FeatureNOPL,			FeatureNOPL,
	FeatureX86_64,			FeatureX86_64,
	FeatureCX16,			FeatureCX16,
	FeatureLAHFSAHF64			FeatureLAHFSAHF64
	],			],
	[			[
	TuningMacroFusion,			TuningMacroFusion,
	TuningSlowUAMem16,			TuningSlowUAMem16,
	TuningInsertVZEROUPPER			TuningInsertVZEROUPPER
	]>;			]>;
	def : ProcModel<"penryn", SandyBridgeModel, [			}
				foreach P = ["penryn", "core_2_duo_sse4_1"] in {
				def : ProcModel<P, SandyBridgeModel, [
	FeatureX87,			FeatureX87,
	FeatureCX8,			FeatureCX8,
	FeatureCMOV,			FeatureCMOV,
	FeatureMMX,			FeatureMMX,
	FeatureSSE41,			FeatureSSE41,
	FeatureFXSR,			FeatureFXSR,
	FeatureNOPL,			FeatureNOPL,
	FeatureX86_64,			FeatureX86_64,
	FeatureCX16,			FeatureCX16,
	FeatureLAHFSAHF64			FeatureLAHFSAHF64
	],			],
	[			[
	TuningMacroFusion,			TuningMacroFusion,
	TuningSlowUAMem16,			TuningSlowUAMem16,
	TuningInsertVZEROUPPER			TuningInsertVZEROUPPER
	]>;			]>;
				}

	// Atom CPUs.			// Atom CPUs.
	foreach P = ["bonnell", "atom"] in {			foreach P = ["bonnell", "atom"] in {
	def : ProcModel<P, AtomModel, ProcessorFeatures.AtomFeatures,			def : ProcModel<P, AtomModel, ProcessorFeatures.AtomFeatures,
	ProcessorFeatures.AtomTuning>;			ProcessorFeatures.AtomTuning>;
	}			}

	foreach P = ["silvermont", "slm"] in {			foreach P = ["silvermont", "slm", "atom_sse4_2"] in {
	def : ProcModel<P, SLMModel, ProcessorFeatures.SLMFeatures,			def : ProcModel<P, SLMModel, ProcessorFeatures.SLMFeatures,
	ProcessorFeatures.SLMTuning>;			ProcessorFeatures.SLMTuning>;
	}			}

				def : ProcModel<"atom_sse4_2_movbe", SLMModel, ProcessorFeatures.GLMFeatures,
				ProcessorFeatures.SLMTuning>;
	def : ProcModel<"goldmont", SLMModel, ProcessorFeatures.GLMFeatures,			def : ProcModel<"goldmont", SLMModel, ProcessorFeatures.GLMFeatures,
	ProcessorFeatures.GLMTuning>;			ProcessorFeatures.GLMTuning>;
	def : ProcModel<"goldmont-plus", SLMModel, ProcessorFeatures.GLPFeatures,			foreach P = ["goldmont_plus", "goldmont-plus"] in {
				def : ProcModel<P, SLMModel, ProcessorFeatures.GLPFeatures,
	ProcessorFeatures.GLPTuning>;			ProcessorFeatures.GLPTuning>;
				}
	def : ProcModel<"tremont", SLMModel, ProcessorFeatures.TRMFeatures,			def : ProcModel<"tremont", SLMModel, ProcessorFeatures.TRMFeatures,
	ProcessorFeatures.TRMTuning>;			ProcessorFeatures.TRMTuning>;
	def : ProcModel<"sierraforest", AlderlakePModel, ProcessorFeatures.SRFFeatures,			def : ProcModel<"sierraforest", AlderlakePModel, ProcessorFeatures.SRFFeatures,
	ProcessorFeatures.TRMTuning>;			ProcessorFeatures.TRMTuning>;
	def : ProcModel<"grandridge", AlderlakePModel, ProcessorFeatures.GRRFeatures,			def : ProcModel<"grandridge", AlderlakePModel, ProcessorFeatures.GRRFeatures,
	ProcessorFeatures.TRMTuning>;			ProcessorFeatures.TRMTuning>;

	// "Arrandale" along with corei3 and corei5			// "Arrandale" along with corei3 and corei5
	foreach P = ["nehalem", "corei7"] in {			foreach P = ["nehalem", "corei7", "core_i7_sse4_2"] in {
	def : ProcModel<P, SandyBridgeModel, ProcessorFeatures.NHMFeatures,			def : ProcModel<P, SandyBridgeModel, ProcessorFeatures.NHMFeatures,
	ProcessorFeatures.NHMTuning>;			ProcessorFeatures.NHMTuning>;
	}			}

	// Westmere is the corei3/i5/i7 path from nehalem to sandybridge			// Westmere is the corei3/i5/i7 path from nehalem to sandybridge
	def : ProcModel<"westmere", SandyBridgeModel, ProcessorFeatures.WSMFeatures,			foreach P = ["westmere", "core_aes_pclmulqdq"] in {
				def : ProcModel<P, SandyBridgeModel, ProcessorFeatures.WSMFeatures,
	ProcessorFeatures.WSMTuning>;			ProcessorFeatures.WSMTuning>;
				}

	foreach P = ["sandybridge", "corei7-avx"] in {			foreach P = ["sandybridge", "corei7-avx", "core_2nd_gen_avx"] in {
	def : ProcModel<P, SandyBridgeModel, ProcessorFeatures.SNBFeatures,			def : ProcModel<P, SandyBridgeModel, ProcessorFeatures.SNBFeatures,
	ProcessorFeatures.SNBTuning>;			ProcessorFeatures.SNBTuning>;
	}			}

	foreach P = ["ivybridge", "core-avx-i"] in {			foreach P = ["ivybridge", "core-avx-i", "core_3rd_gen_avx"] in {
	def : ProcModel<P, SandyBridgeModel, ProcessorFeatures.IVBFeatures,			def : ProcModel<P, SandyBridgeModel, ProcessorFeatures.IVBFeatures,
	ProcessorFeatures.IVBTuning>;			ProcessorFeatures.IVBTuning>;
	}			}

	foreach P = ["haswell", "core-avx2"] in {			foreach P = ["haswell", "core-avx2", "core_4th_gen_avx", "core_4th_gen_avx_tsx"] in {
	def : ProcModel<P, HaswellModel, ProcessorFeatures.HSWFeatures,			def : ProcModel<P, HaswellModel, ProcessorFeatures.HSWFeatures,
	ProcessorFeatures.HSWTuning>;			ProcessorFeatures.HSWTuning>;
	}			}

	def : ProcModel<"broadwell", BroadwellModel, ProcessorFeatures.BDWFeatures,			foreach P = ["broadwell", "core_5th_gen_avx", "core_5th_gen_avx_tsx"] in {
				def : ProcModel<P, BroadwellModel, ProcessorFeatures.BDWFeatures,
	ProcessorFeatures.BDWTuning>;			ProcessorFeatures.BDWTuning>;
				}

	def : ProcModel<"skylake", SkylakeClientModel, ProcessorFeatures.SKLFeatures,			def : ProcModel<"skylake", SkylakeClientModel, ProcessorFeatures.SKLFeatures,
	ProcessorFeatures.SKLTuning>;			ProcessorFeatures.SKLTuning>;

	// FIXME: define KNL scheduler model			// FIXME: define KNL scheduler model
	def : ProcModel<"knl", HaswellModel, ProcessorFeatures.KNLFeatures,			foreach P = ["knl", "mic_avx512"] in {
				def : ProcModel<P, HaswellModel, ProcessorFeatures.KNLFeatures,
	ProcessorFeatures.KNLTuning>;			ProcessorFeatures.KNLTuning>;
				}
	def : ProcModel<"knm", HaswellModel, ProcessorFeatures.KNMFeatures,			def : ProcModel<"knm", HaswellModel, ProcessorFeatures.KNMFeatures,
	ProcessorFeatures.KNLTuning>;			ProcessorFeatures.KNLTuning>;

	foreach P = ["skylake-avx512", "skx"] in {			foreach P = ["skylake-avx512", "skx", "skylake_avx512"] in {
	def : ProcModel<P, SkylakeServerModel, ProcessorFeatures.SKXFeatures,			def : ProcModel<P, SkylakeServerModel, ProcessorFeatures.SKXFeatures,
	ProcessorFeatures.SKXTuning>;			ProcessorFeatures.SKXTuning>;
	}			}

	def : ProcModel<"cascadelake", SkylakeServerModel,			def : ProcModel<"cascadelake", SkylakeServerModel,
	ProcessorFeatures.CLXFeatures, ProcessorFeatures.CLXTuning>;			ProcessorFeatures.CLXFeatures, ProcessorFeatures.CLXTuning>;
	def : ProcModel<"cooperlake", SkylakeServerModel,			def : ProcModel<"cooperlake", SkylakeServerModel,
	ProcessorFeatures.CPXFeatures, ProcessorFeatures.CPXTuning>;			ProcessorFeatures.CPXFeatures, ProcessorFeatures.CPXTuning>;
	def : ProcModel<"cannonlake", SkylakeServerModel,			def : ProcModel<"cannonlake", SkylakeServerModel,
	ProcessorFeatures.CNLFeatures, ProcessorFeatures.CNLTuning>;			ProcessorFeatures.CNLFeatures, ProcessorFeatures.CNLTuning>;
	def : ProcModel<"icelake-client", IceLakeModel,			foreach P = ["icelake-client", "icelake_client"] in {
				def : ProcModel<P, IceLakeModel,
	ProcessorFeatures.ICLFeatures, ProcessorFeatures.ICLTuning>;			ProcessorFeatures.ICLFeatures, ProcessorFeatures.ICLTuning>;
				}
	def : ProcModel<"rocketlake", IceLakeModel,			def : ProcModel<"rocketlake", IceLakeModel,
	ProcessorFeatures.ICLFeatures, ProcessorFeatures.ICLTuning>;			ProcessorFeatures.ICLFeatures, ProcessorFeatures.ICLTuning>;
	def : ProcModel<"icelake-server", IceLakeModel,			foreach P = ["icelake-server", "icelake_server"] in {
				def : ProcModel<P, IceLakeModel,
	ProcessorFeatures.ICXFeatures, ProcessorFeatures.ICXTuning>;			ProcessorFeatures.ICXFeatures, ProcessorFeatures.ICXTuning>;
				}
	def : ProcModel<"tigerlake", IceLakeModel,			def : ProcModel<"tigerlake", IceLakeModel,
	ProcessorFeatures.TGLFeatures, ProcessorFeatures.TGLTuning>;			ProcessorFeatures.TGLFeatures, ProcessorFeatures.TGLTuning>;
	def : ProcModel<"sapphirerapids", SkylakeServerModel,			def : ProcModel<"sapphirerapids", SkylakeServerModel,
	ProcessorFeatures.SPRFeatures, ProcessorFeatures.SPRTuning>;			ProcessorFeatures.SPRFeatures, ProcessorFeatures.SPRTuning>;
	def : ProcModel<"alderlake", AlderlakePModel,			def : ProcModel<"alderlake", AlderlakePModel,
	ProcessorFeatures.ADLFeatures, ProcessorFeatures.ADLTuning>;			ProcessorFeatures.ADLFeatures, ProcessorFeatures.ADLTuning>;
	def : ProcModel<"raptorlake", AlderlakePModel,			def : ProcModel<"raptorlake", AlderlakePModel,
	ProcessorFeatures.ADLFeatures, ProcessorFeatures.ADLTuning>;			ProcessorFeatures.ADLFeatures, ProcessorFeatures.ADLTuning>;
	▲ Show 20 Lines • Show All 177 Lines • Show Last 20 Lines

llvm/lib/TargetParser/X86TargetParser.cpp

//===-- X86TargetParser - Parser for X86 features ---------------- C++ --===//		//===-- X86TargetParser - Parser for X86 features ---------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file implements a target parser to recognise X86 hardware features.		// This file implements a target parser to recognise X86 hardware features.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/TargetParser/X86TargetParser.h"		#include "llvm/TargetParser/X86TargetParser.h"
#include "llvm/ADT/StringSwitch.h"		#include "llvm/ADT/StringSwitch.h"
#include <numeric>		#include <numeric>
		RKSimonUnsubmitted Done Reply Inline Actions (clang-format) - include order RKSimon: (clang-format) - include order

using namespace llvm;		using namespace llvm;
using namespace llvm::X86;		using namespace llvm::X86;

namespace {		namespace {

/// Container class for CPU features.		/// Container class for CPU features.
/// This is a constexpr reimplementation of a subset of std::bitset. It would be		/// This is a constexpr reimplementation of a subset of std::bitset. It would be
▲ Show 20 Lines • Show All 78 Lines • ▼ Show 20 Lines	public:
}		}
};		};

struct ProcInfo {		struct ProcInfo {
StringLiteral Name;		StringLiteral Name;
X86::CPUKind Kind;		X86::CPUKind Kind;
unsigned KeyFeature;		unsigned KeyFeature;
FeatureBitset Features;		FeatureBitset Features;
		char Mangling;
		RKSimonUnsubmitted Not Done Reply Inline Actions Do we have a documented list of the mangling values anywhere? The values below look too much like magic numbers tbh. RKSimon: Do we have a documented list of the mangling values anywhere? The values below look too much…
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions I think not. I copied them from the old MACROs in X86TargetParser.def. I think the values are just used to do some distinguish from different CPUs. And 'a'-'Z' is about to run out of soon here. We probably need to extend the mangling, may be with another letter as prefix. @erichkeane may have a better answer here. FreddyYe: I think not. I copied them from the old MACROs in X86TargetParser.def. I think the values are…
		bool OnlyForCPUDispatchSpecific;
};		};

struct FeatureInfo {		struct FeatureInfo {
StringLiteral Name;		StringLiteral Name;
FeatureBitset ImpliedFeatures;		FeatureBitset ImpliedFeatures;
};		};

} // end anonymous namespace		} // end anonymous namespace
▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines
static constexpr FeatureBitset FeaturesZNVER4 =		static constexpr FeatureBitset FeaturesZNVER4 =
FeaturesZNVER3 \| FeatureAVX512F \| FeatureAVX512CD \| FeatureAVX512DQ \|		FeaturesZNVER3 \| FeatureAVX512F \| FeatureAVX512CD \| FeatureAVX512DQ \|
FeatureAVX512BW \| FeatureAVX512VL \| FeatureAVX512IFMA \| FeatureAVX512VBMI \|		FeatureAVX512BW \| FeatureAVX512VL \| FeatureAVX512IFMA \| FeatureAVX512VBMI \|
FeatureAVX512VBMI2 \| FeatureAVX512VNNI \| FeatureAVX512BITALG \|		FeatureAVX512VBMI2 \| FeatureAVX512VNNI \| FeatureAVX512BITALG \|
FeatureAVX512VPOPCNTDQ \| FeatureAVX512BF16 \| FeatureGFNI \|		FeatureAVX512VPOPCNTDQ \| FeatureAVX512BF16 \| FeatureGFNI \|
FeatureSHSTK;		FeatureSHSTK;

constexpr ProcInfo Processors[] = {		constexpr ProcInfo Processors[] = {
// Empty processor. Include X87 and CMPXCHG8 for backwards compatibility.		// Empty processor. Include X87 and CMPXCHG8 for backwards compatibility.
{ {""}, CK_None, ~0U, FeatureX87 \| FeatureCMPXCHG8B },		{ {""}, CK_None, ~0U, FeatureX87 \| FeatureCMPXCHG8B, '\0', false },
		RKSimonUnsubmitted Not Done Reply Inline Actions Would it be better to move all of this into X86TargetParser.def ? RKSimon: Would it be better to move all of this into X86TargetParser.def ?
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions Feels so. My next step plan it to furtherly combine the feature list table in X86.td together. FreddyYe: Feels so. My next step plan it to furtherly combine the feature list table in X86.td together.
		{ {"generic"}, CK_None, ~0U, FeatureX87 \| FeatureCMPXCHG8B \| Feature64BIT, 'A', true },
		RKSimonUnsubmitted Done Reply Inline Actions FeaturesPentiumMMX & ~FeaturesPentiumMMX ? RKSimon: FeaturesPentiumMMX & ~FeaturesPentiumMMX ?
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions I think I was to initialize a null feature list but failed... Will refine. FreddyYe: I think I was to initialize a null feature list but failed... Will refine.
// i386-generation processors.		// i386-generation processors.
{ {"i386"}, CK_i386, ~0U, FeatureX87 },		{ {"i386"}, CK_i386, ~0U, FeatureX87, '\0', false },
// i486-generation processors.		// i486-generation processors.
{ {"i486"}, CK_i486, ~0U, FeatureX87 },		{ {"i486"}, CK_i486, ~0U, FeatureX87, '\0', false },
{ {"winchip-c6"}, CK_WinChipC6, ~0U, FeaturesPentiumMMX },		{ {"winchip-c6"}, CK_WinChipC6, ~0U, FeaturesPentiumMMX, '\0', false },
{ {"winchip2"}, CK_WinChip2, ~0U, FeaturesPentiumMMX \| Feature3DNOW },		{ {"winchip2"}, CK_WinChip2, ~0U, FeaturesPentiumMMX \| Feature3DNOW, '\0', false },
{ {"c3"}, CK_C3, ~0U, FeaturesPentiumMMX \| Feature3DNOW },		{ {"c3"}, CK_C3, ~0U, FeaturesPentiumMMX \| Feature3DNOW, '\0', false },
// i586-generation processors, P5 microarchitecture based.		// i586-generation processors, P5 microarchitecture based.
{ {"i586"}, CK_i586, ~0U, FeatureX87 \| FeatureCMPXCHG8B },		{ {"i586"}, CK_i586, ~0U, FeatureX87 \| FeatureCMPXCHG8B, '\0', false },
{ {"pentium"}, CK_Pentium, ~0U, FeatureX87 \| FeatureCMPXCHG8B },		{ {"pentium"}, CK_Pentium, ~0U, FeatureX87 \| FeatureCMPXCHG8B, 'B', false },
{ {"pentium-mmx"}, CK_PentiumMMX, ~0U, FeaturesPentiumMMX },		{ {"pentium-mmx"}, CK_PentiumMMX, ~0U, FeaturesPentiumMMX, 'D', false },
		{ {"pentium_mmx"}, CK_PentiumMMX, ~0U, FeaturesPentiumMMX, 'D', true },
// i686-generation processors, P6 / Pentium M microarchitecture based.		// i686-generation processors, P6 / Pentium M microarchitecture based.
{ {"pentiumpro"}, CK_PentiumPro, ~0U, FeatureX87 \| FeatureCMPXCHG8B },		{ {"pentiumpro"}, CK_PentiumPro, ~0U, FeatureX87 \| FeatureCMPXCHG8B, 'C', false },
{ {"i686"}, CK_i686, ~0U, FeatureX87 \| FeatureCMPXCHG8B },		{ {"pentium_pro"}, CK_PentiumPro, ~0U, FeatureX87 \| FeatureCMPXCHG8B, 'C', true },
{ {"pentium2"}, CK_Pentium2, ~0U, FeaturesPentium2 },		{ {"i686"}, CK_i686, ~0U, FeatureX87 \| FeatureCMPXCHG8B, '\0', false },
{ {"pentium3"}, CK_Pentium3, ~0U, FeaturesPentium3 },		{ {"pentium2"}, CK_Pentium2, ~0U, FeaturesPentium2, 'E', false },
{ {"pentium3m"}, CK_Pentium3, ~0U, FeaturesPentium3 },		{ {"pentium_ii"}, CK_Pentium2, ~0U, FeaturesPentium2, 'E', true },
{ {"pentium-m"}, CK_PentiumM, ~0U, FeaturesPentium4 },		{ {"pentium3"}, CK_Pentium3, ~0U, FeaturesPentium3, 'H', false },
{ {"c3-2"}, CK_C3_2, ~0U, FeaturesPentium3 },		{ {"pentium3m"}, CK_Pentium3, ~0U, FeaturesPentium3, 'H', false },
{ {"yonah"}, CK_Yonah, ~0U, FeaturesPrescott },		{ {"pentium_iii"}, CK_Pentium3, ~0U, FeaturesPentium3, 'H', true },
		{ {"pentium_iii_no_xmm_regs"}, CK_Pentium3, ~0U, FeaturesPentium3, 'H', true },
		{ {"pentium-m"}, CK_PentiumM, ~0U, FeaturesPentium4, 'K', false },
		{ {"pentium_m"}, CK_PentiumM, ~0U, FeaturesPentium4, 'K', true },
		{ {"c3-2"}, CK_C3_2, ~0U, FeaturesPentium3, '\0', false },
		{ {"yonah"}, CK_Yonah, ~0U, FeaturesPrescott, 'L', false },
// Netburst microarchitecture based processors.		// Netburst microarchitecture based processors.
{ {"pentium4"}, CK_Pentium4, ~0U, FeaturesPentium4 },		{ {"pentium4"}, CK_Pentium4, ~0U, FeaturesPentium4, 'J', false },
{ {"pentium4m"}, CK_Pentium4, ~0U, FeaturesPentium4 },		{ {"pentium4m"}, CK_Pentium4, ~0U, FeaturesPentium4, 'J', false },
{ {"prescott"}, CK_Prescott, ~0U, FeaturesPrescott },		{ {"pentium_4"}, CK_Pentium4, ~0U, FeaturesPentium4, 'J', true },
{ {"nocona"}, CK_Nocona, ~0U, FeaturesNocona },		{ {"pentium_4_sse3"}, CK_Prescott, ~0U, FeaturesPrescott, 'L', true },
		{ {"prescott"}, CK_Prescott, ~0U, FeaturesPrescott, 'L', false },
		{ {"nocona"}, CK_Nocona, ~0U, FeaturesNocona, 'L', false },
// Core microarchitecture based processors.		// Core microarchitecture based processors.
{ {"core2"}, CK_Core2, FEATURE_SSSE3, FeaturesCore2 },		{ {"core2"}, CK_Core2, FEATURE_SSSE3, FeaturesCore2, 'M', false },
{ {"penryn"}, CK_Penryn, ~0U, FeaturesPenryn },		{ {"core_2_duo_ssse3"}, CK_Core2, ~0U, FeaturesCore2, 'N', true },
		{ {"penryn"}, CK_Penryn, ~0U, FeaturesPenryn, 'N', false },
		{ {"core_2_duo_sse4_1"}, CK_Penryn, ~0U, FeaturesPenryn, 'N', true },
// Atom processors		// Atom processors
{ {"bonnell"}, CK_Bonnell, FEATURE_SSSE3, FeaturesBonnell },		{ {"bonnell"}, CK_Bonnell, FEATURE_SSSE3, FeaturesBonnell, 'a', false },
{ {"atom"}, CK_Bonnell, FEATURE_SSSE3, FeaturesBonnell },		{ {"atom"}, CK_Bonnell, FEATURE_SSSE3, FeaturesBonnell, 'O', false },
{ {"silvermont"}, CK_Silvermont, FEATURE_SSE4_2, FeaturesSilvermont },		{ {"silvermont"}, CK_Silvermont, FEATURE_SSE4_2, FeaturesSilvermont, 'c', false },
{ {"slm"}, CK_Silvermont, FEATURE_SSE4_2, FeaturesSilvermont },		{ {"slm"}, CK_Silvermont, FEATURE_SSE4_2, FeaturesSilvermont, 'c', false },
{ {"goldmont"}, CK_Goldmont, FEATURE_SSE4_2, FeaturesGoldmont },		{ {"atom_sse4_2"}, CK_Silvermont, FEATURE_SSE4_2, FeaturesSilvermont, 'c', true },
{ {"goldmont-plus"}, CK_GoldmontPlus, FEATURE_SSE4_2, FeaturesGoldmontPlus },		{ {"atom_sse4_2_movbe"}, CK_Goldmont, FEATURE_SSE4_2, FeaturesGoldmont, 'c', true },
{ {"tremont"}, CK_Tremont, FEATURE_SSE4_2, FeaturesTremont },		{ {"goldmont"}, CK_Goldmont, FEATURE_SSE4_2, FeaturesGoldmont, 'i', false },
		{ {"goldmont-plus"}, CK_GoldmontPlus, FEATURE_SSE4_2, FeaturesGoldmontPlus, 'd', false },
		{ {"goldmont_plus"}, CK_GoldmontPlus, FEATURE_SSE4_2, FeaturesGoldmontPlus, 'd', true },
		{ {"tremont"}, CK_Tremont, FEATURE_SSE4_2, FeaturesTremont, 'd', false },
// Nehalem microarchitecture based processors.		// Nehalem microarchitecture based processors.
{ {"nehalem"}, CK_Nehalem, FEATURE_SSE4_2, FeaturesNehalem },		{ {"nehalem"}, CK_Nehalem, FEATURE_SSE4_2, FeaturesNehalem, 'P', false },
{ {"corei7"}, CK_Nehalem, FEATURE_SSE4_2, FeaturesNehalem },		{ {"core_i7_sse4_2"}, CK_Nehalem, FEATURE_SSE4_2, FeaturesNehalem, 'P', true },
		{ {"corei7"}, CK_Nehalem, FEATURE_SSE4_2, FeaturesNehalem, 'P', false },
// Westmere microarchitecture based processors.		// Westmere microarchitecture based processors.
{ {"westmere"}, CK_Westmere, FEATURE_PCLMUL, FeaturesWestmere },		{ {"westmere"}, CK_Westmere, FEATURE_PCLMUL, FeaturesWestmere, 'Q', false },
		{ {"core_aes_pclmulqdq"}, CK_Westmere, FEATURE_PCLMUL, FeaturesWestmere, 'Q', true },
// Sandy Bridge microarchitecture based processors.		// Sandy Bridge microarchitecture based processors.
{ {"sandybridge"}, CK_SandyBridge, FEATURE_AVX, FeaturesSandyBridge },		{ {"sandybridge"}, CK_SandyBridge, FEATURE_AVX, FeaturesSandyBridge, 'R', false },
{ {"corei7-avx"}, CK_SandyBridge, FEATURE_AVX, FeaturesSandyBridge },		{ {"core_2nd_gen_avx"}, CK_SandyBridge, FEATURE_AVX, FeaturesSandyBridge, 'R', true },
		{ {"corei7-avx"}, CK_SandyBridge, FEATURE_AVX, FeaturesSandyBridge, 'R', false },
// Ivy Bridge microarchitecture based processors.		// Ivy Bridge microarchitecture based processors.
{ {"ivybridge"}, CK_IvyBridge, FEATURE_AVX, FeaturesIvyBridge },		{ {"ivybridge"}, CK_IvyBridge, FEATURE_AVX, FeaturesIvyBridge, 'S', false },
{ {"core-avx-i"}, CK_IvyBridge, FEATURE_AVX, FeaturesIvyBridge },		{ {"core_3rd_gen_avx"}, CK_IvyBridge, FEATURE_AVX, FeaturesIvyBridge, 'S', true },
		{ {"core-avx-i"}, CK_IvyBridge, FEATURE_AVX, FeaturesIvyBridge, 'S', false },
		RKSimonUnsubmitted Done Reply Inline Actions I'm still not clear on what determines the mangling mode and cpu dispatch flag for cpu targets are supposedly the same? For example, none of these ivybridge equivalent configs have the same values. RKSimon: I'm still not clear on what determines the mangling mode and cpu dispatch flag for cpu targets…
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions I assign them by following orders: Copy the mangling from the original CPU_SPEICIFC MACRO. If there's no way to copy, assign to '\0' by default, which means doesn't support __cpu_specific/dispatch feature. If cpu name contain ''-', assign the mangling as '\0', too. Because '-' cannot be correctly identified in _cpu_specific/dispatch(). set OnlyForCPUDispatch flag as `true` if this cpu name was not listed here, which means it doesn't support -march, -mtune and so on. This flag makes this cpu name only support cpu_dispatch/specific feature. E.g. core_3rd_gen_avx, core_4rd_gen_avx., ... And normally, these names are very old. So supporting them with -march=, -mtune= is not easy for now. And notice that new cpu names shouldn't set this flag as `true` since they should both support -march= and cpu_specific/dispatch feature by default. FreddyYe: I assign them by following orders: 1. Copy the mangling from the original CPU_SPEICIFC MACRO. 2.
		RKSimonUnsubmitted Done Reply Inline Actions OK - please can you add that to a comment above line 311 for future reference? RKSimon: OK - please can you add that to a comment above line 311 for future reference?
// Haswell microarchitecture based processors.		// Haswell microarchitecture based processors.
{ {"haswell"}, CK_Haswell, FEATURE_AVX2, FeaturesHaswell },		{ {"haswell"}, CK_Haswell, FEATURE_AVX2, FeaturesHaswell, 'V', false },
{ {"core-avx2"}, CK_Haswell, FEATURE_AVX2, FeaturesHaswell },		{ {"core-avx2"}, CK_Haswell, FEATURE_AVX2, FeaturesHaswell, 'V', false },
		{ {"core_4th_gen_avx"}, CK_Haswell, FEATURE_AVX2, FeaturesHaswell, 'V', true },
		{ {"core_4th_gen_avx_tsx"}, CK_Haswell, FEATURE_AVX2, FeaturesHaswell, 'V', true },
// Broadwell microarchitecture based processors.		// Broadwell microarchitecture based processors.
{ {"broadwell"}, CK_Broadwell, FEATURE_AVX2, FeaturesBroadwell },		{ {"broadwell"}, CK_Broadwell, FEATURE_AVX2, FeaturesBroadwell, 'X', false },
		{ {"core_5th_gen_avx"}, CK_Broadwell, FEATURE_AVX2, FeaturesBroadwell, 'X', true },
		{ {"core_5th_gen_avx_tsx"}, CK_Broadwell, FEATURE_AVX2, FeaturesBroadwell, 'X', true },
// Skylake client microarchitecture based processors.		// Skylake client microarchitecture based processors.
{ {"skylake"}, CK_SkylakeClient, FEATURE_AVX2, FeaturesSkylakeClient },		{ {"skylake"}, CK_SkylakeClient, FEATURE_AVX2, FeaturesSkylakeClient, 'b', false },
// Skylake server microarchitecture based processors.		// Skylake server microarchitecture based processors.
{ {"skylake-avx512"}, CK_SkylakeServer, FEATURE_AVX512F, FeaturesSkylakeServer },		{ {"skylake-avx512"}, CK_SkylakeServer, FEATURE_AVX512F, FeaturesSkylakeServer, 'a', false },
{ {"skx"}, CK_SkylakeServer, FEATURE_AVX512F, FeaturesSkylakeServer },		{ {"skx"}, CK_SkylakeServer, FEATURE_AVX512F, FeaturesSkylakeServer, 'a', false },
		{ {"skylake_avx512"}, CK_SkylakeServer, FEATURE_AVX512F, FeaturesSkylakeServer, 'a', true },
// Cascadelake Server microarchitecture based processors.		// Cascadelake Server microarchitecture based processors.
{ {"cascadelake"}, CK_Cascadelake, FEATURE_AVX512VNNI, FeaturesCascadeLake },		{ {"cascadelake"}, CK_Cascadelake, FEATURE_AVX512VNNI, FeaturesCascadeLake, 'o', false },
// Cooperlake Server microarchitecture based processors.		// Cooperlake Server microarchitecture based processors.
{ {"cooperlake"}, CK_Cooperlake, FEATURE_AVX512BF16, FeaturesCooperLake },		{ {"cooperlake"}, CK_Cooperlake, FEATURE_AVX512BF16, FeaturesCooperLake, 'o', false },
// Cannonlake client microarchitecture based processors.		// Cannonlake client microarchitecture based processors.
{ {"cannonlake"}, CK_Cannonlake, FEATURE_AVX512VBMI, FeaturesCannonlake },		{ {"cannonlake"}, CK_Cannonlake, FEATURE_AVX512VBMI, FeaturesCannonlake, 'e', false },
// Icelake client microarchitecture based processors.		// Icelake client microarchitecture based processors.
{ {"icelake-client"}, CK_IcelakeClient, FEATURE_AVX512VBMI2, FeaturesICLClient },		{ {"icelake-client"}, CK_IcelakeClient, FEATURE_AVX512VBMI2, FeaturesICLClient, 'k', false },
		{ {"icelake_client"}, CK_IcelakeClient, FEATURE_AVX512VBMI2, FeaturesICLClient, 'k', true },
// Rocketlake microarchitecture based processors.		// Rocketlake microarchitecture based processors.
{ {"rocketlake"}, CK_Rocketlake, FEATURE_AVX512VBMI2, FeaturesRocketlake },		{ {"rocketlake"}, CK_Rocketlake, FEATURE_AVX512VBMI2, FeaturesRocketlake, 'k', false },
// Icelake server microarchitecture based processors.		// Icelake server microarchitecture based processors.
{ {"icelake-server"}, CK_IcelakeServer, FEATURE_AVX512VBMI2, FeaturesICLServer },		{ {"icelake-server"}, CK_IcelakeServer, FEATURE_AVX512VBMI2, FeaturesICLServer, 'k', false },
		{ {"icelake_server"}, CK_IcelakeServer, FEATURE_AVX512VBMI2, FeaturesICLServer, 'k', true },
// Tigerlake microarchitecture based processors.		// Tigerlake microarchitecture based processors.
{ {"tigerlake"}, CK_Tigerlake, FEATURE_AVX512VP2INTERSECT, FeaturesTigerlake },		{ {"tigerlake"}, CK_Tigerlake, FEATURE_AVX512VP2INTERSECT, FeaturesTigerlake, 'l', false },
// Sapphire Rapids microarchitecture based processors.		// Sapphire Rapids microarchitecture based processors.
{ {"sapphirerapids"}, CK_SapphireRapids, FEATURE_AVX512BF16, FeaturesSapphireRapids },		{ {"sapphirerapids"}, CK_SapphireRapids, FEATURE_AVX512BF16, FeaturesSapphireRapids, 'n', false },
// Alderlake microarchitecture based processors.		// Alderlake microarchitecture based processors.
{ {"alderlake"}, CK_Alderlake, FEATURE_AVX2, FeaturesAlderlake },		{ {"alderlake"}, CK_Alderlake, FEATURE_AVX2, FeaturesAlderlake, 'p', false },
// Raptorlake microarchitecture based processors.		// Raptorlake microarchitecture based processors.
{ {"raptorlake"}, CK_Raptorlake, FEATURE_AVX2, FeaturesAlderlake },		{ {"raptorlake"}, CK_Raptorlake, FEATURE_AVX2, FeaturesAlderlake, 'p', false },
// Meteorlake microarchitecture based processors.		// Meteorlake microarchitecture based processors.
{ {"meteorlake"}, CK_Meteorlake, FEATURE_AVX2, FeaturesAlderlake },		{ {"meteorlake"}, CK_Meteorlake, FEATURE_AVX2, FeaturesAlderlake, 'p', false },
// Sierraforest microarchitecture based processors.		// Sierraforest microarchitecture based processors.
{ {"sierraforest"}, CK_Sierraforest, FEATURE_AVX2, FeaturesSierraforest },		{ {"sierraforest"}, CK_Sierraforest, FEATURE_AVX2, FeaturesSierraforest, 'p', false },
// Grandridge microarchitecture based processors.		// Grandridge microarchitecture based processors.
{ {"grandridge"}, CK_Grandridge, FEATURE_AVX2, FeaturesGrandridge },		{ {"grandridge"}, CK_Grandridge, FEATURE_AVX2, FeaturesGrandridge, 'p', false },
// Granite Rapids microarchitecture based processors.		// Granite Rapids microarchitecture based processors.
{ {"graniterapids"}, CK_Graniterapids, FEATURE_AVX512BF16, FeaturesGraniteRapids },		{ {"graniterapids"}, CK_Graniterapids, FEATURE_AVX512BF16, FeaturesGraniteRapids, 'n', false },
// Emerald Rapids microarchitecture based processors.		// Emerald Rapids microarchitecture based processors.
{ {"emeraldrapids"}, CK_Emeraldrapids, FEATURE_AVX512BF16, FeaturesSapphireRapids },		{ {"emeraldrapids"}, CK_Emeraldrapids, FEATURE_AVX512BF16, FeaturesSapphireRapids, 'n', false },
// Knights Landing processor.		// Knights Landing processor.
{ {"knl"}, CK_KNL, FEATURE_AVX512F, FeaturesKNL },		{ {"knl"}, CK_KNL, FEATURE_AVX512F, FeaturesKNL, 'Z', false },
		{ {"mic_avx512"}, CK_KNL, FEATURE_AVX512F, FeaturesKNL, 'Z', true },
// Knights Mill processor.		// Knights Mill processor.
{ {"knm"}, CK_KNM, FEATURE_AVX5124FMAPS, FeaturesKNM },		{ {"knm"}, CK_KNM, FEATURE_AVX5124FMAPS, FeaturesKNM, 'j', false },
// Lakemont microarchitecture based processors.		// Lakemont microarchitecture based processors.
{ {"lakemont"}, CK_Lakemont, ~0U, FeatureCMPXCHG8B },		{ {"lakemont"}, CK_Lakemont, ~0U, FeatureCMPXCHG8B, '\0', false },
// K6 architecture processors.		// K6 architecture processors.
{ {"k6"}, CK_K6, ~0U, FeaturesK6 },		{ {"k6"}, CK_K6, ~0U, FeaturesK6, '\0', false },
{ {"k6-2"}, CK_K6_2, ~0U, FeaturesK6 \| Feature3DNOW },		{ {"k6-2"}, CK_K6_2, ~0U, FeaturesK6 \| Feature3DNOW, '\0', false },
{ {"k6-3"}, CK_K6_3, ~0U, FeaturesK6 \| Feature3DNOW },		{ {"k6-3"}, CK_K6_3, ~0U, FeaturesK6 \| Feature3DNOW, '\0', false },
// K7 architecture processors.		// K7 architecture processors.
{ {"athlon"}, CK_Athlon, ~0U, FeaturesAthlon },		{ {"athlon"}, CK_Athlon, ~0U, FeaturesAthlon, '\0', false },
{ {"athlon-tbird"}, CK_Athlon, ~0U, FeaturesAthlon },		{ {"athlon-tbird"}, CK_Athlon, ~0U, FeaturesAthlon, '\0', false },
{ {"athlon-xp"}, CK_AthlonXP, ~0U, FeaturesAthlonXP },		{ {"athlon-xp"}, CK_AthlonXP, ~0U, FeaturesAthlonXP, '\0', false },
{ {"athlon-mp"}, CK_AthlonXP, ~0U, FeaturesAthlonXP },		{ {"athlon-mp"}, CK_AthlonXP, ~0U, FeaturesAthlonXP, '\0', false },
{ {"athlon-4"}, CK_AthlonXP, ~0U, FeaturesAthlonXP },		{ {"athlon-4"}, CK_AthlonXP, ~0U, FeaturesAthlonXP, '\0', false },
// K8 architecture processors.		// K8 architecture processors.
{ {"k8"}, CK_K8, ~0U, FeaturesK8 },		{ {"k8"}, CK_K8, ~0U, FeaturesK8, '\0', false },
{ {"athlon64"}, CK_K8, ~0U, FeaturesK8 },		{ {"athlon64"}, CK_K8, ~0U, FeaturesK8, '\0', false },
{ {"athlon-fx"}, CK_K8, ~0U, FeaturesK8 },		{ {"athlon-fx"}, CK_K8, ~0U, FeaturesK8, '\0', false },
{ {"opteron"}, CK_K8, ~0U, FeaturesK8 },		{ {"opteron"}, CK_K8, ~0U, FeaturesK8, '\0', false },
{ {"k8-sse3"}, CK_K8SSE3, ~0U, FeaturesK8SSE3 },		{ {"k8-sse3"}, CK_K8SSE3, ~0U, FeaturesK8SSE3, '\0', false },
{ {"athlon64-sse3"}, CK_K8SSE3, ~0U, FeaturesK8SSE3 },		{ {"athlon64-sse3"}, CK_K8SSE3, ~0U, FeaturesK8SSE3, '\0', false },
{ {"opteron-sse3"}, CK_K8SSE3, ~0U, FeaturesK8SSE3 },		{ {"opteron-sse3"}, CK_K8SSE3, ~0U, FeaturesK8SSE3, '\0', false },
{ {"amdfam10"}, CK_AMDFAM10, FEATURE_SSE4_A, FeaturesAMDFAM10 },		{ {"amdfam10"}, CK_AMDFAM10, FEATURE_SSE4_A, FeaturesAMDFAM10, '\0', false },
{ {"barcelona"}, CK_AMDFAM10, FEATURE_SSE4_A, FeaturesAMDFAM10 },		{ {"barcelona"}, CK_AMDFAM10, FEATURE_SSE4_A, FeaturesAMDFAM10, '\0', false },
// Bobcat architecture processors.		// Bobcat architecture processors.
{ {"btver1"}, CK_BTVER1, FEATURE_SSE4_A, FeaturesBTVER1 },		{ {"btver1"}, CK_BTVER1, FEATURE_SSE4_A, FeaturesBTVER1, '\0', false },
{ {"btver2"}, CK_BTVER2, FEATURE_BMI, FeaturesBTVER2 },		{ {"btver2"}, CK_BTVER2, FEATURE_BMI, FeaturesBTVER2, '\0', false },
// Bulldozer architecture processors.		// Bulldozer architecture processors.
{ {"bdver1"}, CK_BDVER1, FEATURE_XOP, FeaturesBDVER1 },		{ {"bdver1"}, CK_BDVER1, FEATURE_XOP, FeaturesBDVER1, '\0', false },
{ {"bdver2"}, CK_BDVER2, FEATURE_FMA, FeaturesBDVER2 },		{ {"bdver2"}, CK_BDVER2, FEATURE_FMA, FeaturesBDVER2, '\0', false },
{ {"bdver3"}, CK_BDVER3, FEATURE_FMA, FeaturesBDVER3 },		{ {"bdver3"}, CK_BDVER3, FEATURE_FMA, FeaturesBDVER3, '\0', false },
{ {"bdver4"}, CK_BDVER4, FEATURE_AVX2, FeaturesBDVER4 },		{ {"bdver4"}, CK_BDVER4, FEATURE_AVX2, FeaturesBDVER4, '\0', false },
// Zen architecture processors.		// Zen architecture processors.
{ {"znver1"}, CK_ZNVER1, FEATURE_AVX2, FeaturesZNVER1 },		{ {"znver1"}, CK_ZNVER1, FEATURE_AVX2, FeaturesZNVER1, '\0', false },
{ {"znver2"}, CK_ZNVER2, FEATURE_AVX2, FeaturesZNVER2 },		{ {"znver2"}, CK_ZNVER2, FEATURE_AVX2, FeaturesZNVER2, '\0', false },
{ {"znver3"}, CK_ZNVER3, FEATURE_AVX2, FeaturesZNVER3 },		{ {"znver3"}, CK_ZNVER3, FEATURE_AVX2, FeaturesZNVER3, '\0', false },
{ {"znver4"}, CK_ZNVER4, FEATURE_AVX512VBMI2, FeaturesZNVER4 },		{ {"znver4"}, CK_ZNVER4, FEATURE_AVX512VBMI2, FeaturesZNVER4, '\0', false },
// Generic 64-bit processor.		// Generic 64-bit processor.
{ {"x86-64"}, CK_x86_64, ~0U, FeaturesX86_64 },		{ {"x86-64"}, CK_x86_64, ~0U, FeaturesX86_64, '\0', false },
{ {"x86-64-v2"}, CK_x86_64_v2, ~0U, FeaturesX86_64_V2 },		{ {"x86-64-v2"}, CK_x86_64_v2, ~0U, FeaturesX86_64_V2, '\0', false },
{ {"x86-64-v3"}, CK_x86_64_v3, ~0U, FeaturesX86_64_V3 },		{ {"x86-64-v3"}, CK_x86_64_v3, ~0U, FeaturesX86_64_V3, '\0', false },
{ {"x86-64-v4"}, CK_x86_64_v4, ~0U, FeaturesX86_64_V4 },		{ {"x86-64-v4"}, CK_x86_64_v4, ~0U, FeaturesX86_64_V4, '\0', false },
// Geode processors.		// Geode processors.
{ {"geode"}, CK_Geode, ~0U, FeaturesGeode },		{ {"geode"}, CK_Geode, ~0U, FeaturesGeode, '\0', false },
};		};

constexpr const char *NoTuneList[] = {"x86-64-v2", "x86-64-v3", "x86-64-v4"};		constexpr const char *NoTuneList[] = {"x86-64-v2", "x86-64-v3", "x86-64-v4"};

X86::CPUKind llvm::X86::parseArchX86(StringRef CPU, bool Only64Bit) {		X86::CPUKind llvm::X86::parseArchX86(StringRef CPU, bool Only64Bit) {
for (const auto &P : Processors)		for (const auto &P : Processors)
if (P.Name == CPU && (P.Features[FEATURE_64BIT] \|\| !Only64Bit))		if (!P.OnlyForCPUDispatchSpecific && P.Name == CPU &&
		(P.Features[FEATURE_64BIT] \|\| !Only64Bit))
return P.Kind;		return P.Kind;

return CK_None;		return CK_None;
}		}

X86::CPUKind llvm::X86::parseTuneCPU(StringRef CPU, bool Only64Bit) {		X86::CPUKind llvm::X86::parseTuneCPU(StringRef CPU, bool Only64Bit) {
if (llvm::is_contained(NoTuneList, CPU))		if (llvm::is_contained(NoTuneList, CPU))
return CK_None;		return CK_None;
return parseArchX86(CPU, Only64Bit);		return parseArchX86(CPU, Only64Bit);
}		}

void llvm::X86::fillValidCPUArchList(SmallVectorImpl<StringRef> &Values,		void llvm::X86::fillValidCPUArchList(SmallVectorImpl<StringRef> &Values,
bool Only64Bit) {		bool Only64Bit) {
for (const auto &P : Processors)		for (const auto &P : Processors)
if (!P.Name.empty() && (P.Features[FEATURE_64BIT] \|\| !Only64Bit))		if (!P.OnlyForCPUDispatchSpecific && !P.Name.empty() &&
		(P.Features[FEATURE_64BIT] \|\| !Only64Bit))
Values.emplace_back(P.Name);		Values.emplace_back(P.Name);
}		}

void llvm::X86::fillValidTuneCPUList(SmallVectorImpl<StringRef> &Values,		void llvm::X86::fillValidTuneCPUList(SmallVectorImpl<StringRef> &Values,
bool Only64Bit) {		bool Only64Bit) {
for (const ProcInfo &P : Processors)		for (const ProcInfo &P : Processors)
if (!P.Name.empty() && (P.Features[FEATURE_64BIT] \|\| !Only64Bit) &&		if (!P.OnlyForCPUDispatchSpecific && !P.Name.empty() &&
		(P.Features[FEATURE_64BIT] \|\| !Only64Bit) &&
!llvm::is_contained(NoTuneList, P.Name))		!llvm::is_contained(NoTuneList, P.Name))
Values.emplace_back(P.Name);		Values.emplace_back(P.Name);
}		}

ProcessorFeatures llvm::X86::getKeyFeature(X86::CPUKind Kind) {		ProcessorFeatures llvm::X86::getKeyFeature(X86::CPUKind Kind) {
// FIXME: Can we avoid a linear search here? The table might be sorted by		// FIXME: Can we avoid a linear search here? The table might be sorted by
// CPUKind so we could binary search?		// CPUKind so we could binary search?
for (const auto &P : Processors) {		for (const auto &P : Processors) {
▲ Show 20 Lines • Show All 143 Lines • ▼ Show 20 Lines
// AVXVNNI Features		// AVXVNNI Features
constexpr FeatureBitset ImpliedFeaturesAVXVNNI = FeatureAVX2;		constexpr FeatureBitset ImpliedFeaturesAVXVNNI = FeatureAVX2;

constexpr FeatureInfo FeatureInfos[X86::CPU_FEATURE_MAX] = {		constexpr FeatureInfo FeatureInfos[X86::CPU_FEATURE_MAX] = {
#define X86_FEATURE(ENUM, STR) {{STR}, ImpliedFeatures##ENUM},		#define X86_FEATURE(ENUM, STR) {{STR}, ImpliedFeatures##ENUM},
#include "llvm/TargetParser/X86TargetParser.def"		#include "llvm/TargetParser/X86TargetParser.def"
};		};

		constexpr FeatureInfo FeatureInfos_WithPLUS[X86::CPU_FEATURE_MAX] = {
		#define X86_FEATURE(ENUM, STR) {{"+" STR}, ImpliedFeatures##ENUM},
		#include "llvm/TargetParser/X86TargetParser.def"
		};

void llvm::X86::getFeaturesForCPU(StringRef CPU,		void llvm::X86::getFeaturesForCPU(StringRef CPU,
SmallVectorImpl<StringRef> &EnabledFeatures) {		SmallVectorImpl<StringRef> &EnabledFeatures,
		bool IfNeedPlus) {
auto I = llvm::find_if(Processors,		auto I = llvm::find_if(Processors,
[&](const ProcInfo &P) { return P.Name == CPU; });		[&](const ProcInfo &P) { return P.Name == CPU; });
assert(I != std::end(Processors) && "Processor not found!");		assert(I != std::end(Processors) && "Processor not found!");

FeatureBitset Bits = I->Features;		FeatureBitset Bits = I->Features;

// Remove the 64-bit feature which we only use to validate if a CPU can		// Remove the 64-bit feature which we only use to validate if a CPU can
// be used with 64-bit mode.		// be used with 64-bit mode.
Bits &= ~Feature64BIT;		Bits &= ~Feature64BIT;

// Add the string version of all set bits.		// Add the string version of all set bits.
for (unsigned i = 0; i != CPU_FEATURE_MAX; ++i)		for (unsigned i = 0; i != CPU_FEATURE_MAX; ++i)
if (Bits[i] && !FeatureInfos[i].Name.empty())		if (Bits[i] && !FeatureInfos[i].Name.empty() &&
EnabledFeatures.push_back(FeatureInfos[i].Name);		!FeatureInfos_WithPLUS[i].Name.empty()){
		EnabledFeatures.push_back(IfNeedPlus ? FeatureInfos_WithPLUS[i].Name
		: FeatureInfos[i].Name);
		}
}		}

// For each feature that is (transitively) implied by this feature, set it.		// For each feature that is (transitively) implied by this feature, set it.
static void getImpliedEnabledFeatures(FeatureBitset &Bits,		static void getImpliedEnabledFeatures(FeatureBitset &Bits,
const FeatureBitset &Implies) {		const FeatureBitset &Implies) {
// Fast path: Implies is often empty.		// Fast path: Implies is often empty.
if (!Implies.any())		if (!Implies.any())
return;		return;
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	getImpliedDisabledFeatures(ImpliedBits,
std::distance(std::begin(FeatureInfos), I));		std::distance(std::begin(FeatureInfos), I));

// Update the map entry for all implied features.		// Update the map entry for all implied features.
for (unsigned i = 0; i != CPU_FEATURE_MAX; ++i)		for (unsigned i = 0; i != CPU_FEATURE_MAX; ++i)
if (ImpliedBits[i] && !FeatureInfos[i].Name.empty())		if (ImpliedBits[i] && !FeatureInfos[i].Name.empty())
Features[FeatureInfos[i].Name] = Enabled;		Features[FeatureInfos[i].Name] = Enabled;
}		}

		char llvm::X86::getCPUDispatchMangling(StringRef CPU) {
		auto I = llvm::find_if(Processors,
		[&](const ProcInfo &P) { return P.Name == CPU; });
		assert(I != std::end(Processors) && "Processor not found!");
		assert(I->Mangling != '\0' && "Processor dooesn't support function multiversion!");
		return I->Mangling;
		}

		bool llvm::X86::validateCPUSpecificCPUDispatch(StringRef Name) {
		auto I = llvm::find_if(Processors,
		[&](const ProcInfo &P) { return P.Name == Name; });
		return I != std::end(Processors);
		}

uint64_t llvm::X86::getCpuSupportsMask(ArrayRef<StringRef> FeatureStrs) {		uint64_t llvm::X86::getCpuSupportsMask(ArrayRef<StringRef> FeatureStrs) {
// Processor features and mapping to processor feature value.		// Processor features and mapping to processor feature value.
uint64_t FeaturesMask = 0;		uint64_t FeaturesMask = 0;
for (const StringRef &FeatureStr : FeatureStrs) {		for (const StringRef &FeatureStr : FeatureStrs) {
unsigned Feature = StringSwitch<unsigned>(FeatureStr)		unsigned Feature = StringSwitch<unsigned>(FeatureStr)
#define X86_FEATURE_COMPAT(ENUM, STR, PRIORITY) \		#define X86_FEATURE_COMPAT(ENUM, STR, PRIORITY) \
.Case(STR, llvm::X86::FEATURE_##ENUM)		.Case(STR, llvm::X86::FEATURE_##ENUM)
#include "llvm/TargetParser/X86TargetParser.def"		#include "llvm/TargetParser/X86TargetParser.def"
Show All 33 Lines

llvm/test/CodeGen/X86/cpus-intel.ll

	; Test that the CPU names work.			; Test that the CPU names work.
	; CHECK-NO-ERROR-NOT: not a recognized processor for this target			; CHECK-NO-ERROR-NOT: not a recognized processor for this target

	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=i386 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=i386 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=i486 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=i486 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=i586 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=i586 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium-mmx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium-mmx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_mmx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				RKSimonUnsubmitted Done Reply Inline Actions I meant like this: ; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium-mmx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty ; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_mmx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty The idea is to keep RUN lines that test the equivalent cpus together, so its easier for any future edits to handle them together RKSimon: I meant like this: ``` ; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium…
				FreddyYeAuthorUnsubmitted Done Reply Inline Actions Woops, I really misunderstood. Updated. FreddyYe: Woops, I really misunderstood. Updated.
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=i686 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=i686 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentiumpro 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentiumpro 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_pro 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium2 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium2 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_ii 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium3 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium3 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium3m 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium3m 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_iii 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_iii_no_xmm_regs 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_m 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium-m 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium-m 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium4 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium4 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium4m 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium4m 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_4 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=yonah 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=yonah 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=prescott 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=prescott 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=pentium_4_sse3 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=lakemont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=lakemont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=raptorlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=raptorlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=meteorlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=meteorlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=sierraforest 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=sierraforest 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=grandridge 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=grandridge 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=graniterapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=graniterapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=emeraldrapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=i686-unknown-unknown -mcpu=emeraldrapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty

				RKSimonUnsubmitted Done Reply Inline Actions Keeping all the aliased cpus variants RUN together will make maintenance easier (e.g. pentium_mmx) RKSimon: Keeping all the aliased cpus variants RUN together will make maintenance easier (e.g.
				FreddyYeAuthorUnsubmitted Done Reply Inline Actions Updated, not sure if I understood your point. pls review again. FreddyYe: Updated, not sure if I understood your point. pls review again.
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=nocona 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=nocona 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core2 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core2 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_2_duo_ssse3 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=penryn 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=penryn 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_2_duo_sse4_1 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=nehalem 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=nehalem 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=corei7 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=corei7 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_i7_sse4_2 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=westmere 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=westmere 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_aes_pclmulqdq 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_5th_gen_avx_tsx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=sandybridge 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=sandybridge 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=corei7-avx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=corei7-avx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_2nd_gen_avx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=ivybridge 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=ivybridge 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core-avx-i 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core-avx-i 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_3rd_gen_avx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=haswell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=haswell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core-avx2 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core-avx2 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_4th_gen_avx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_4th_gen_avx_tsx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=broadwell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=broadwell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=core_5th_gen_avx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skylake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skylake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skylake-avx512 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skylake-avx512 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skylake_avx512 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cascadelake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cascadelake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cooperlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cooperlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cannonlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cannonlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake-client 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake-client 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake_client 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=rocketlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=rocketlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake-server 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake-server 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake_server 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=tigerlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=tigerlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=sapphirerapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=sapphirerapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=alderlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=alderlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=atom 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=atom 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=bonnell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=bonnell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=silvermont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=silvermont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=slm 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=slm 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=atom_sse4_2 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=goldmont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=goldmont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=goldmont-plus 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=goldmont-plus 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=goldmont_plus 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=tremont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=tremont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=knl 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=knl 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=mic_avx512 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=knm 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=knm 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=raptorlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=raptorlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=meteorlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=meteorlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=sierraforest 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=sierraforest 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=grandridge 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=grandridge 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=graniterapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=graniterapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=atom_sse4_2_movbe 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				RKSimonUnsubmitted Not Done Reply Inline Actions put this with the goldmonth checks? RKSimon: put this with the goldmonth checks?
				FreddyYeAuthorUnsubmitted Done Reply Inline Actions I'll do the adjust. BTW I'd like also to mention below: Here are the rules I used to add features/tuning model in X86.td for missing cpu names in CPU_SPECIFIC Macros: RULE 1: Infer from the name itself. E.g. pentium_4 -> pentium4, pentium_iii -> pentium3 RULE 2: CPU_SPECIFIC_ALIAS can tell me its old name. Using the old name to continue to infer. RULE 3: Use the feature list in CPU_SPECIFIC to compare and append with existing CPU names: pentium_4_sse3 -> prescott core_2_duo_ssse3 -> core2 core_2_duo_sse4_1 -> penryn atom_sse4_2 -> silvermont core_i7_sse4_2 -> nehalem core_aes_pclmulqdq -> westmere core_5th_gen_avx_tsx -> broadwell There is only one exception: atom_sse4_2_movbe. For now I set it as goldmont features and silvermont tuning model according to its definition: CPU_SPECIFIC("atom_sse4_2_movbe", "silvermont", 'd', "+cmov,+mmx,+sse,+sse2,+sse3,+ssse3,+sse4.1,+sse4.2,+movbe,+popcnt") FreddyYe: I'll do the adjust. BTW I'd like also to mention below: Here are the rules I used to add…

	define void @foo() {			define void @foo() {
	ret void			ret void
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchManglingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 532517

clang/lib/Basic/Targets/X86.h

clang/lib/Basic/Targets/X86.cpp

clang/lib/CodeGen/CodeGenModule.cpp

clang/test/CodeGen/attr-cpuspecific-avx-abi.c

clang/test/CodeGen/attr-cpuspecific.c

llvm/include/llvm/TargetParser/X86TargetParser.h

llvm/include/llvm/TargetParser/X86TargetParser.def

llvm/lib/Target/X86/X86.td

llvm/lib/TargetParser/X86TargetParser.cpp

llvm/test/CodeGen/X86/cpus-intel.ll

[X86] Remove CPU_SPECIFIC* MACROs and add getCPUDispatchMangling
ClosedPublic