This is an archive of the discontinued LLVM Phabricator instance.

clang/test/Preprocessor/predefined-arch-macros.c
1404	The file may need some refactoring first. You can let RUN lines share some common check prefixes, instead of adding a bunch of defines for every new processor. // CHECK_X86_64_V2: ... // CHECK_X86_64_V2: ... // CHECK_X86_64_V3: ... // CHECK_PROCESSOR1_M32: // CHECK_PROCESSOR1_M64: // CHECK_PROCESSOR2_M32: // CHECK_PROCESSOR2_M64:

Harbormaster completed remote builds in B97642: Diff 335986.Apr 7 2021, 9:15 PM

craig.topper added inline comments.Apr 7 2021, 10:00 PM

compiler-rt/lib/builtins/cpu_model.c
101	This order is defined by libgcc. We can't insert in the middle unless ZNVER3 was in the wrong place Why this not referenced in the switch the select subtype?
llvm/lib/Target/X86/X86.td
767	Is this list this long because SKL includes SGX but RKL doesn't?

THX for review!

clang/test/Preprocessor/predefined-arch-macros.c
1404	I agree. I'll do it
compiler-rt/lib/builtins/cpu_model.c
101	This is a mistake. I'll modify. And reference is missing in two switch. I'll add.
llvm/lib/Target/X86/X86.td
767	Yes. And I don't know any simple ways to exclude SGX here, any suggestions?

Updating according to comments. Test refactoring not done.

Hi @MaskRay, I tried to refactor, but met some difficulties. Since these defines are dictionary ordered, a new #define may insert into a common CHECK. So it is difficult to let different RUN share common CHECKs.

// RUN: %clang -march=pentium-mmx -m32 -E -dM %s -o - 2>&1 \
// RUN:     -target i386-unknown-linux \
// RUN:   | FileCheck -match-full-lines %s -check-prefixes=CHECK_I386_M32,CHECK_I586_M32,CHECK_PENTIUM_MMX_M32

// CHECK_I386_M32: #define __i386 1
// CHECK_I386_M32: #define __i386__ 1
// CHECK_I386_M32: #define i386 1

// CHECK_I586_M32: #define __i586 1
// CHECK_I586_M32: #define __i586__ 1
// CHECK_I586_M32: #define __pentium 1
// CHECK_I586_M32: #define __pentium__ 1
// CHECK_I586_M32: #define __tune_i586__ 1
// CHECK_I586_M32: #define __tune_pentium__ 1

// CHECK_PENTIUM_MMX_M32: #define __MMX__ 1
// CHECK_PENTIUM_MMX_M32: #define __pentium_mmx__ 1
// CHECK_PENTIUM_MMX_M32: #define __tune_pentium_mmx__ 1

The example above will destroy the original order of the define list. Do you have some good suggestions?

In D100085#2675919, @FreddyYe wrote:

Hi @MaskRay, I tried to refactor, but met some difficulties. Since these defines are dictionary ordered, a new #define may insert into a common CHECK. So it is difficult to let different RUN share common CHECKs.

Check prefixes of the same kind do not need to be contiguous.

// A:
// B:
// A:

Harbormaster completed remote builds in B97657: Diff 336007.Apr 7 2021, 11:55 PM

In D100085#2675965, @MaskRay wrote:
In D100085#2675919, @FreddyYe wrote:

Hi @MaskRay, I tried to refactor, but met some difficulties. Since these defines are dictionary ordered, a new #define may insert into a common CHECK. So it is difficult to let different RUN share common CHECKs.

Check prefixes of the same kind do not need to be contiguous.
// A:
// B:
// A:

I see. And It works. And I'll update. And thank you!

RKSimon added a subscriber: RKSimon.Apr 8 2021, 1:59 AM

RKSimon added inline comments.

llvm/lib/Target/X86/X86.td
790	Using ICLTuning suggests we should still be avoiding 512-bit ops (FeaturePrefer256Bit) - is this still true for RKL (or anything past CNL...)? I posted PR48336 but never got any response, but from what others have reported (Travis Downs, Phoronix etc) its mainly a power issue these days, not a perf issue due to big freq drops.

update lit test and clang-format

Herald added a subscriber: jfb. · View Herald TranscriptApr 8 2021, 8:44 AM

Hi @MaskRay , I tried to refactor the test file by assembling the common CHECKs. But I found it will lead to too many check-prefixes ine one RUN line. For example,

// RUN: %clang -march=c3 -m32 -E -dM %s -o - 2>&1 \
// RUN:     -target i386-unknown-linux \
// RUN:   | FileCheck -match-full-lines %s -check-prefixes=CHECK_I386_M32C,CHECK_I486_M32C,CHECK_I486_M32S,CHECK_PENTIUM_MMX_M32S,CHECK_WINCHIP2_M32S

// CHECK_WINCHIP2_M32S: #define __3dNOW__ 1
// CHECK_PENTIUM_MMX_M32S:    #define __MMX__ 1
// CHECK_WINCHIP_C6_M32S: #define __MMX__ 1
// CHECK_I386_M32C:             #define __i386 1
// CHECK_I386_M32C:             #define __i386__ 1
// CHECK_I486_M32C:             #define __i486 1
// CHECK_I486_M32C:             #define __i486__ 1
// CHECK_I586_M32C:             #define __i586 1
// CHECK_I586_M32C:             #define __i586__ 1
// CHECK_I586_M32C:             #define __pentium 1
// CHECK_I586_M32C:             #define __pentium__ 1
// CHECK_PENTIUM_MMX_M32S:      #define __pentium_mmx__ 1
// CHECK_I386_M32S:         #define __tune_i386__ 1
// CHECK_I486_M32S:         #define __tune_i486__ 1
// CHECK_PENTIUM_MMX_M32S:      #define __tune_pentium_mmx__ 1
// CHECK_I386_M32C:             #define i386 1

And that CPU is still a very old CPU. It is easy to imagine how long for example skylake's RUN line is. Generally speaking, X86's ISA evolution between different chips is not very clear. So this refactoring work is beyond my ETA. I uploaded a new version to reuse the most similar CHECKS. Does that look good to you?

Harbormaster completed remote builds in B97754: Diff 336133.Apr 8 2021, 9:44 AM

craig.topper added inline comments.Apr 8 2021, 10:20 AM

llvm/lib/Target/X86/X86.td
767	Nothing pretty. Guess it depends on if SGX is going to not appear in more future CPUs or if this is a one off case. If it's going to continue then we could remove it from the inheritance and just give it to SKL, ICL, CNL, etc. individually. Or we could just not default SGX on for any CPU. It's probably not all that useful in the backend anyway. Clang will put it in the target-feature attribute anyway. Having it in the backend feature lists doesn't really do anything since I don't think we have any IR intrinsics for SGX.

FreddyYe added inline comments.Apr 8 2021, 6:58 PM

llvm/lib/Target/X86/X86.td
790	We need more tests on such as SPEC to see whether we can default enable FeaturePrefer512bit.

FreddyYe added inline comments.Apr 8 2021, 7:05 PM

llvm/lib/Target/X86/X86.td
767	Agree. Like we did in https://reviews.llvm.org/D88006. SGX is also not useful in the backend.

delete FeatureSGX in the backend since there are no IR intrinsics for SGX.

skan added inline comments.Apr 8 2021, 7:37 PM

llvm/lib/Support/X86TargetParser.cpp
414–415	It's not correct to format here in this patch and do not mix tab with space.

craig.topper added inline comments.Apr 8 2021, 7:46 PM

llvm/lib/Target/X86/X86.td
271	Clang still puts it in target-features attribute so you can’t delete this or you’ll get a warning that the feature doesn’t exist.

skan added inline comments.Apr 8 2021, 7:49 PM

llvm/lib/Target/X86/X86.td
271–273	If you delete the definition of FeatureSGX, you need to remove the related code in X86Subtarget.h too. BTW, I don't think "there are no IR intrinsics for a feature" is a good reason to remove a feature.

craig.topper added inline comments.Apr 8 2021, 7:58 PM

llvm/lib/Target/X86/X86.td
271–273	I only said to remove it from the CPUs because for llc -march=skylake it doesn’t matter if we enable SGX because there’s nothing you can test from llc.

cancel the clang-format in constexpr ProcInfo Processors[] = {}

Harbormaster completed remote builds in B97865: Diff 336295.Apr 8 2021, 8:10 PM

revert clang-format and revert deleting FeatureSGX def.

Hi @craig.topper and @skan , THX for review! I tested that deleting sgx indeed leads to not generating "+sgx" in 'target-features', didn't know before:)

'+sgx' is not a recognized feature for this target (ignoring feature)

Harbormaster completed remote builds in B97873: Diff 336303.Apr 8 2021, 9:04 PM

Harbormaster completed remote builds in B97872: Diff 336301.Apr 8 2021, 9:33 PM

craig.topper added inline comments.Apr 9 2021, 9:48 AM

llvm/lib/Target/X86/X86.td
741–742	I'm not sure that rocketlake has CLWB. Can you double check that? It's not listed in the cpuinfo dump on the 11700K that I found with a google search here https://www.pugetsystems.com/labs/hpc/Intel-Rocket-Lake-Compute-Performance-Results-HPL-HPCG-NAMD-and-Numpy-2116/

FreddyYe added inline comments.Apr 11 2021, 2:49 AM

llvm/lib/Target/X86/X86.td
741–742	For now I have only an icelake-client machine and found that CLWB is not there, too. Guess I can do that modification in this patch? Rocketlake may probably lose CLWB. I'll double check.

craig.topper added inline comments.Apr 11 2021, 9:15 AM

llvm/lib/Target/X86/X86.td
741–742	What’s the model number for you ice lake client CPU?

FreddyYe marked an inline comment as done.Apr 11 2021, 6:50 PM

FreddyYe added inline comments.

llvm/lib/Target/X86/X86.td
741–742	It is 0x7e. And I've double checked that rkl also hasn't CLWB.

craig.topper added inline comments.Apr 11 2021, 6:58 PM

llvm/lib/Target/X86/X86.td
741–742	Sorry I meant the marketing name like "Intel® Core™ i7-1065G7"

FreddyYe marked an inline comment as done.Apr 11 2021, 7:14 PM

FreddyYe added inline comments.

llvm/lib/Target/X86/X86.td
741–742	It is `Intel(R) Core(TM) i7-1065G7 CPU @ 1.30GHz`

Thanks. I also found this https://github.com/gcc-mirror/gcc/commit/c422e5f81f42a0fc197f0715f4fcd81f1be90bff can you create a new patch to do the same for llvm/clang and rebase this patch on top of it.

In D100085#2682080, @craig.topper wrote:

Thanks. I also found this https://github.com/gcc-mirror/gcc/commit/c422e5f81f42a0fc197f0715f4fcd81f1be90bff can you create a new patch to do the same for llvm/clang and rebase this patch on top of it.

OK, I'll create it.

rebase

Harbormaster completed remote builds in B98205: Diff 336738.Apr 11 2021, 9:56 PM

Hi @MaskRay, @craig.topper, @skan, reviewers, I've addressed your comments. Any more concerns?

LGTM

This revision is now accepted and ready to land.Apr 12 2021, 9:55 AM

MaskRay accepted this revision.Apr 12 2021, 3:44 PM

skan added inline comments.Apr 12 2021, 6:06 PM

llvm/lib/Support/X86TargetParser.cpp
176	Shouldn't the FeatureSGX be removed here?
180	Remove `~FeatureSGX` here?
197	Remove `~FeatureSGX` here?

craig.topper added inline comments.Apr 12 2021, 6:07 PM

llvm/lib/Support/X86TargetParser.cpp
176	That would change the frontend behavior which would require a wider discussion. My suggestion was only to change the backend behavior since there was nothing testable with llc anyway.

skan accepted this revision.Apr 12 2021, 6:09 PM

skan added inline comments.

llvm/lib/Support/X86TargetParser.cpp
176	Okay, it makes sense to me.

This revision was landed with ongoing or failed builds.Apr 12 2021, 6:48 PM

Closed by commit rG3fc1fe8db830: [X86] Support -march=rocketlake (authored by FreddyYe). · Explain Why

This revision was automatically updated to reflect the committed changes.

FreddyYe added a commit: rG3fc1fe8db830: [X86] Support -march=rocketlake.

Revision Contents

Path

Size

clang/

lib/

Basic/

Targets/

X86.cpp

2 lines

test/

CodeGen/

attr-target-mv.c

3 lines

target-builtin-noerror.c

1 line

Driver/

x86-march.c

4 lines

Misc/

target-invalid-cpu-note.c

8 lines

Preprocessor/

predefined-arch-macros.c

16 lines

compiler-rt/

lib/

builtins/

cpu_model.c

7 lines

llvm/

include/

llvm/

Support/

X86TargetParser.h

1 line

X86TargetParser.def

1 line

lib/

Support/

Host.cpp

7 lines

X86TargetParser.cpp

3 lines

Target/

X86/

X86.td

12 lines

test/

CodeGen/

X86/

cpus-intel.ll

1 line

Diff 337020

clang/lib/Basic/Targets/X86.cpp

Show First 20 Lines • Show All 461 Lines • ▼ Show 20 Lines	void X86TargetInfo::getTargetDefines(const LangOptions &Opts,
case CK_Haswell:		case CK_Haswell:
case CK_Broadwell:		case CK_Broadwell:
case CK_SkylakeClient:		case CK_SkylakeClient:
case CK_SkylakeServer:		case CK_SkylakeServer:
case CK_Cascadelake:		case CK_Cascadelake:
case CK_Cooperlake:		case CK_Cooperlake:
case CK_Cannonlake:		case CK_Cannonlake:
case CK_IcelakeClient:		case CK_IcelakeClient:
		case CK_Rocketlake:
case CK_IcelakeServer:		case CK_IcelakeServer:
case CK_Tigerlake:		case CK_Tigerlake:
case CK_SapphireRapids:		case CK_SapphireRapids:
case CK_Alderlake:		case CK_Alderlake:
// FIXME: Historically, we defined this legacy name, it would be nice to		// FIXME: Historically, we defined this legacy name, it would be nice to
// remove it at some point. We've never exposed fine-grained names for		// remove it at some point. We've never exposed fine-grained names for
// recent primary x86 CPUs, and we should keep it that way.		// recent primary x86 CPUs, and we should keep it that way.
defineCPUMacros(Builder, "corei7");		defineCPUMacros(Builder, "corei7");
▲ Show 20 Lines • Show All 831 Lines • ▼ Show 20 Lines	switch (CPU) {
case CK_SkylakeServer:		case CK_SkylakeServer:
case CK_Cascadelake:		case CK_Cascadelake:
case CK_Nehalem:		case CK_Nehalem:
case CK_Cooperlake:		case CK_Cooperlake:
case CK_Cannonlake:		case CK_Cannonlake:
case CK_Tigerlake:		case CK_Tigerlake:
case CK_SapphireRapids:		case CK_SapphireRapids:
case CK_IcelakeClient:		case CK_IcelakeClient:
		case CK_Rocketlake:
case CK_IcelakeServer:		case CK_IcelakeServer:
case CK_Alderlake:		case CK_Alderlake:
case CK_KNL:		case CK_KNL:
case CK_KNM:		case CK_KNM:
// K7		// K7
case CK_Athlon:		case CK_Athlon:
case CK_AthlonXP:		case CK_AthlonXP:
// K8		// K8
▲ Show 20 Lines • Show All 184 Lines • Show Last 20 Lines

clang/test/CodeGen/attr-target-mv.c

	// RUN: %clang_cc1 -triple x86_64-linux-gnu -emit-llvm %s -o - \| FileCheck %s --check-prefix=LINUX			// RUN: %clang_cc1 -triple x86_64-linux-gnu -emit-llvm %s -o - \| FileCheck %s --check-prefix=LINUX
	// RUN: %clang_cc1 -triple x86_64-windows-pc -emit-llvm %s -o - \| FileCheck %s --check-prefix=WINDOWS			// RUN: %clang_cc1 -triple x86_64-windows-pc -emit-llvm %s -o - \| FileCheck %s --check-prefix=WINDOWS

	int __attribute__((target("sse4.2"))) foo(void) { return 0; }			int __attribute__((target("sse4.2"))) foo(void) { return 0; }
	int __attribute__((target("arch=sandybridge"))) foo(void);			int __attribute__((target("arch=sandybridge"))) foo(void);
	int __attribute__((target("arch=ivybridge"))) foo(void) {return 1;}			int __attribute__((target("arch=ivybridge"))) foo(void) {return 1;}
	int __attribute__((target("arch=goldmont"))) foo(void) {return 3;}			int __attribute__((target("arch=goldmont"))) foo(void) {return 3;}
	int __attribute__((target("arch=goldmont-plus"))) foo(void) {return 4;}			int __attribute__((target("arch=goldmont-plus"))) foo(void) {return 4;}
	int __attribute__((target("arch=tremont"))) foo(void) {return 5;}			int __attribute__((target("arch=tremont"))) foo(void) {return 5;}
	int __attribute__((target("arch=icelake-client"))) foo(void) {return 6;}			int __attribute__((target("arch=icelake-client"))) foo(void) {return 6;}
	int __attribute__((target("arch=icelake-server"))) foo(void) {return 7;}			int __attribute__((target("arch=icelake-server"))) foo(void) {return 7;}
	int __attribute__((target("arch=cooperlake"))) foo(void) {return 8;}			int __attribute__((target("arch=cooperlake"))) foo(void) {return 8;}
	int __attribute__((target("arch=tigerlake"))) foo(void) {return 9;}			int __attribute__((target("arch=tigerlake"))) foo(void) {return 9;}
	int __attribute__((target("arch=sapphirerapids"))) foo(void) {return 10;}			int __attribute__((target("arch=sapphirerapids"))) foo(void) {return 10;}
	int __attribute__((target("arch=alderlake"))) foo(void) {return 11;}			int __attribute__((target("arch=alderlake"))) foo(void) {return 11;}
				int __attribute__((target("arch=rocketlake"))) foo(void) {return 12;}
	int __attribute__((target("default"))) foo(void) { return 2; }			int __attribute__((target("default"))) foo(void) { return 2; }

	int bar() {			int bar() {
	return foo();			return foo();
	}			}

	inline int __attribute__((target("sse4.2"))) foo_inline(void) { return 0; }			inline int __attribute__((target("sse4.2"))) foo_inline(void) { return 0; }
	inline int __attribute__((target("arch=sandybridge"))) foo_inline(void);			inline int __attribute__((target("arch=sandybridge"))) foo_inline(void);
	▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines
	// LINUX: define{{.*}} i32 @foo.arch_cooperlake()			// LINUX: define{{.*}} i32 @foo.arch_cooperlake()
	// LINUX: ret i32 8			// LINUX: ret i32 8
	// LINUX: define{{.*}} i32 @foo.arch_tigerlake()			// LINUX: define{{.*}} i32 @foo.arch_tigerlake()
	// LINUX: ret i32 9			// LINUX: ret i32 9
	// LINUX: define{{.*}} i32 @foo.arch_sapphirerapids()			// LINUX: define{{.*}} i32 @foo.arch_sapphirerapids()
	// LINUX: ret i32 10			// LINUX: ret i32 10
	// LINUX: define{{.*}} i32 @foo.arch_alderlake()			// LINUX: define{{.*}} i32 @foo.arch_alderlake()
	// LINUX: ret i32 11			// LINUX: ret i32 11
				// LINUX: define{{.*}} i32 @foo.arch_rocketlake()
				// LINUX: ret i32 12
	// LINUX: define{{.*}} i32 @foo()			// LINUX: define{{.*}} i32 @foo()
	// LINUX: ret i32 2			// LINUX: ret i32 2
	// LINUX: define{{.*}} i32 @bar()			// LINUX: define{{.*}} i32 @bar()
	// LINUX: call i32 @foo.ifunc()			// LINUX: call i32 @foo.ifunc()

	// WINDOWS: define dso_local i32 @foo.sse4.2()			// WINDOWS: define dso_local i32 @foo.sse4.2()
	// WINDOWS: ret i32 0			// WINDOWS: ret i32 0
	// WINDOWS: define dso_local i32 @foo.arch_ivybridge()			// WINDOWS: define dso_local i32 @foo.arch_ivybridge()
	▲ Show 20 Lines • Show All 192 Lines • Show Last 20 Lines

clang/test/CodeGen/target-builtin-noerror.c

Show First 20 Lines • Show All 110 Lines • ▼ Show 20 Lines	void verifycpustrings() {
(void)__builtin_cpu_is("icelake-client");		(void)__builtin_cpu_is("icelake-client");
(void)__builtin_cpu_is("icelake-server");		(void)__builtin_cpu_is("icelake-server");
(void)__builtin_cpu_is("intel");		(void)__builtin_cpu_is("intel");
(void)__builtin_cpu_is("istanbul");		(void)__builtin_cpu_is("istanbul");
(void)__builtin_cpu_is("ivybridge");		(void)__builtin_cpu_is("ivybridge");
(void)__builtin_cpu_is("knl");		(void)__builtin_cpu_is("knl");
(void)__builtin_cpu_is("knm");		(void)__builtin_cpu_is("knm");
(void)__builtin_cpu_is("nehalem");		(void)__builtin_cpu_is("nehalem");
		(void)__builtin_cpu_is("rocketlake");
(void)__builtin_cpu_is("sandybridge");		(void)__builtin_cpu_is("sandybridge");
(void)__builtin_cpu_is("shanghai");		(void)__builtin_cpu_is("shanghai");
(void)__builtin_cpu_is("silvermont");		(void)__builtin_cpu_is("silvermont");
(void)__builtin_cpu_is("skylake");		(void)__builtin_cpu_is("skylake");
(void)__builtin_cpu_is("skylake-avx512");		(void)__builtin_cpu_is("skylake-avx512");
(void)__builtin_cpu_is("slm");		(void)__builtin_cpu_is("slm");
(void)__builtin_cpu_is("tigerlake");		(void)__builtin_cpu_is("tigerlake");
(void)__builtin_cpu_is("sapphirerapids");		(void)__builtin_cpu_is("sapphirerapids");
(void)__builtin_cpu_is("tremont");		(void)__builtin_cpu_is("tremont");
(void)__builtin_cpu_is("westmere");		(void)__builtin_cpu_is("westmere");
(void)__builtin_cpu_is("znver1");		(void)__builtin_cpu_is("znver1");
(void)__builtin_cpu_is("znver2");		(void)__builtin_cpu_is("znver2");
(void)__builtin_cpu_is("znver3");		(void)__builtin_cpu_is("znver3");
}		}

clang/test/Driver/x86-march.c

	Show First 20 Lines • Show All 66 Lines • ▼ Show 20 Lines
	// RUN: %clang -target x86_64-unknown-unknown -c -### %s -march=cannonlake 2>&1 \			// RUN: %clang -target x86_64-unknown-unknown -c -### %s -march=cannonlake 2>&1 \
	// RUN: \| FileCheck %s -check-prefix=cannonlake			// RUN: \| FileCheck %s -check-prefix=cannonlake
	// cannonlake: "-target-cpu" "cannonlake"			// cannonlake: "-target-cpu" "cannonlake"
	//			//
	// RUN: %clang -target x86_64-unknown-unknown -c -### %s -march=icelake-client 2>&1 \			// RUN: %clang -target x86_64-unknown-unknown -c -### %s -march=icelake-client 2>&1 \
	// RUN: \| FileCheck %s -check-prefix=icelake-client			// RUN: \| FileCheck %s -check-prefix=icelake-client
	// icelake-client: "-target-cpu" "icelake-client"			// icelake-client: "-target-cpu" "icelake-client"
	//			//
				// RUN: %clang -target x86_64-unknown-unknown -c -### %s -march=rocketlake 2>&1 \
				// RUN: \| FileCheck %s -check-prefix=rocketlake
				// rocketlake: "-target-cpu" "rocketlake"
				//
	// RUN: %clang -target x86_64-unknown-unknown -c -### %s -march=icelake-server 2>&1 \			// RUN: %clang -target x86_64-unknown-unknown -c -### %s -march=icelake-server 2>&1 \
	// RUN: \| FileCheck %s -check-prefix=icelake-server			// RUN: \| FileCheck %s -check-prefix=icelake-server
	// icelake-server: "-target-cpu" "icelake-server"			// icelake-server: "-target-cpu" "icelake-server"
	//			//
	// RUN: %clang -target x86_64-unknown-unknown -c -### %s -march=tigerlake 2>&1 \			// RUN: %clang -target x86_64-unknown-unknown -c -### %s -march=tigerlake 2>&1 \
	// RUN: \| FileCheck %s -check-prefix=tigerlake			// RUN: \| FileCheck %s -check-prefix=tigerlake
	// tigerlake: "-target-cpu" "tigerlake"			// tigerlake: "-target-cpu" "tigerlake"
	//			//
	▲ Show 20 Lines • Show All 112 Lines • Show Last 20 Lines

clang/test/Misc/target-invalid-cpu-note.c

	Show All 15 Lines
	// RUN: not %clang_cc1 -triple i386--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix X86			// RUN: not %clang_cc1 -triple i386--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix X86
	// X86: error: unknown target CPU 'not-a-cpu'			// X86: error: unknown target CPU 'not-a-cpu'
	// X86: note: valid target CPU values are: i386, i486, winchip-c6, winchip2, c3,			// X86: note: valid target CPU values are: i386, i486, winchip-c6, winchip2, c3,
	// X86-SAME: i586, pentium, pentium-mmx, pentiumpro, i686, pentium2, pentium3,			// X86-SAME: i586, pentium, pentium-mmx, pentiumpro, i686, pentium2, pentium3,
	// X86-SAME: pentium3m, pentium-m, c3-2, yonah, pentium4, pentium4m, prescott,			// X86-SAME: pentium3m, pentium-m, c3-2, yonah, pentium4, pentium4m, prescott,
	// X86-SAME: nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont,			// X86-SAME: nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont,
	// X86-SAME: nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge,			// X86-SAME: nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge,
	// X86-SAME: core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512,			// X86-SAME: core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512,
	// X86-SAME: skx, cascadelake, cooperlake, cannonlake, icelake-client, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, lakemont, k6, k6-2, k6-3,			// X86-SAME: skx, cascadelake, cooperlake, cannonlake, icelake-client, rocketlake, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, lakemont, k6, k6-2, k6-3,
	// X86-SAME: athlon, athlon-tbird, athlon-xp, athlon-mp, athlon-4, k8, athlon64,			// X86-SAME: athlon, athlon-tbird, athlon-xp, athlon-mp, athlon-4, k8, athlon64,
	// X86-SAME: athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10,			// X86-SAME: athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10,
	// X86-SAME: barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3,			// X86-SAME: barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3,
	// X86-SAME: x86-64, x86-64-v2, x86-64-v3, x86-64-v4, geode{{$}}			// X86-SAME: x86-64, x86-64-v2, x86-64-v3, x86-64-v4, geode{{$}}

	// RUN: not %clang_cc1 -triple x86_64--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix X86_64			// RUN: not %clang_cc1 -triple x86_64--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix X86_64
	// X86_64: error: unknown target CPU 'not-a-cpu'			// X86_64: error: unknown target CPU 'not-a-cpu'
	// X86_64: note: valid target CPU values are: nocona, core2, penryn, bonnell,			// X86_64: note: valid target CPU values are: nocona, core2, penryn, bonnell,
	// X86_64-SAME: atom, silvermont, slm, goldmont, goldmont-plus, tremont, nehalem, corei7, westmere,			// X86_64-SAME: atom, silvermont, slm, goldmont, goldmont-plus, tremont, nehalem, corei7, westmere,
	// X86_64-SAME: sandybridge, corei7-avx, ivybridge, core-avx-i, haswell,			// X86_64-SAME: sandybridge, corei7-avx, ivybridge, core-avx-i, haswell,
	// X86_64-SAME: core-avx2, broadwell, skylake, skylake-avx512, skx, cascadelake, cooperlake, cannonlake,			// X86_64-SAME: core-avx2, broadwell, skylake, skylake-avx512, skx, cascadelake, cooperlake, cannonlake,
	// X86_64-SAME: icelake-client, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, k8, athlon64, athlon-fx, opteron, k8-sse3,			// X86_64-SAME: icelake-client, rocketlake, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, k8, athlon64, athlon-fx, opteron, k8-sse3,
	// X86_64-SAME: athlon64-sse3, opteron-sse3, amdfam10, barcelona, btver1,			// X86_64-SAME: athlon64-sse3, opteron-sse3, amdfam10, barcelona, btver1,
	// X86_64-SAME: btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3,			// X86_64-SAME: btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3,
	// X86_64-SAME: x86-64, x86-64-v2, x86-64-v3, x86-64-v4{{$}}			// X86_64-SAME: x86-64, x86-64-v2, x86-64-v3, x86-64-v4{{$}}

	// RUN: not %clang_cc1 -triple i386--- -tune-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix TUNE_X86			// RUN: not %clang_cc1 -triple i386--- -tune-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix TUNE_X86
	// TUNE_X86: error: unknown target CPU 'not-a-cpu'			// TUNE_X86: error: unknown target CPU 'not-a-cpu'
	// TUNE_X86: note: valid target CPU values are: i386, i486, winchip-c6, winchip2, c3,			// TUNE_X86: note: valid target CPU values are: i386, i486, winchip-c6, winchip2, c3,
	// TUNE_X86-SAME: i586, pentium, pentium-mmx, pentiumpro, i686, pentium2, pentium3,			// TUNE_X86-SAME: i586, pentium, pentium-mmx, pentiumpro, i686, pentium2, pentium3,
	// TUNE_X86-SAME: pentium3m, pentium-m, c3-2, yonah, pentium4, pentium4m, prescott,			// TUNE_X86-SAME: pentium3m, pentium-m, c3-2, yonah, pentium4, pentium4m, prescott,
	// TUNE_X86-SAME: nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont,			// TUNE_X86-SAME: nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont,
	// TUNE_X86-SAME: nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge,			// TUNE_X86-SAME: nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge,
	// TUNE_X86-SAME: core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512,			// TUNE_X86-SAME: core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512,
	// TUNE_X86-SAME: skx, cascadelake, cooperlake, cannonlake, icelake-client, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, lakemont, k6, k6-2, k6-3,			// TUNE_X86-SAME: skx, cascadelake, cooperlake, cannonlake, icelake-client, rocketlake, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, lakemont, k6, k6-2, k6-3,
	// TUNE_X86-SAME: athlon, athlon-tbird, athlon-xp, athlon-mp, athlon-4, k8, athlon64,			// TUNE_X86-SAME: athlon, athlon-tbird, athlon-xp, athlon-mp, athlon-4, k8, athlon64,
	// TUNE_X86-SAME: athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10,			// TUNE_X86-SAME: athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10,
	// TUNE_X86-SAME: barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3,			// TUNE_X86-SAME: barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3,
	// TUNE_X86-SAME: x86-64, geode{{$}}			// TUNE_X86-SAME: x86-64, geode{{$}}

	// RUN: not %clang_cc1 -triple x86_64--- -tune-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix TUNE_X86_64			// RUN: not %clang_cc1 -triple x86_64--- -tune-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix TUNE_X86_64
	// TUNE_X86_64: error: unknown target CPU 'not-a-cpu'			// TUNE_X86_64: error: unknown target CPU 'not-a-cpu'
	// TUNE_X86_64: note: valid target CPU values are: i386, i486, winchip-c6, winchip2, c3,			// TUNE_X86_64: note: valid target CPU values are: i386, i486, winchip-c6, winchip2, c3,
	// TUNE_X86_64-SAME: i586, pentium, pentium-mmx, pentiumpro, i686, pentium2, pentium3,			// TUNE_X86_64-SAME: i586, pentium, pentium-mmx, pentiumpro, i686, pentium2, pentium3,
	// TUNE_X86_64-SAME: pentium3m, pentium-m, c3-2, yonah, pentium4, pentium4m, prescott,			// TUNE_X86_64-SAME: pentium3m, pentium-m, c3-2, yonah, pentium4, pentium4m, prescott,
	// TUNE_X86_64-SAME: nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont,			// TUNE_X86_64-SAME: nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont,
	// TUNE_X86_64-SAME: nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge,			// TUNE_X86_64-SAME: nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge,
	// TUNE_X86_64-SAME: core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512,			// TUNE_X86_64-SAME: core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512,
	// TUNE_X86_64-SAME: skx, cascadelake, cooperlake, cannonlake, icelake-client, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, lakemont, k6, k6-2, k6-3,			// TUNE_X86_64-SAME: skx, cascadelake, cooperlake, cannonlake, icelake-client, rocketlake, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, lakemont, k6, k6-2, k6-3,
	// TUNE_X86_64-SAME: athlon, athlon-tbird, athlon-xp, athlon-mp, athlon-4, k8, athlon64,			// TUNE_X86_64-SAME: athlon, athlon-tbird, athlon-xp, athlon-mp, athlon-4, k8, athlon64,
	// TUNE_X86_64-SAME: athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10,			// TUNE_X86_64-SAME: athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10,
	// TUNE_X86_64-SAME: barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3,			// TUNE_X86_64-SAME: barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3,
	// TUNE_X86_64-SAME: x86-64, geode{{$}}			// TUNE_X86_64-SAME: x86-64, geode{{$}}

	// RUN: not %clang_cc1 -triple nvptx--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix NVPTX			// RUN: not %clang_cc1 -triple nvptx--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix NVPTX
	// NVPTX: error: unknown target CPU 'not-a-cpu'			// NVPTX: error: unknown target CPU 'not-a-cpu'
	// NVPTX: note: valid target CPU values are: sm_20, sm_21, sm_30, sm_32, sm_35,			// NVPTX: note: valid target CPU values are: sm_20, sm_21, sm_30, sm_32, sm_35,
	▲ Show 20 Lines • Show All 136 Lines • Show Last 20 Lines

clang/test/Preprocessor/predefined-arch-macros.c

	Show First 20 Lines • Show All 1,274 Lines • ▼ Show 20 Lines
	// CHECK_CNL_M64: #define __corei7 1			// CHECK_CNL_M64: #define __corei7 1
	// CHECK_CNL_M64: #define __corei7__ 1			// CHECK_CNL_M64: #define __corei7__ 1
	// CHECK_CNL_M64: #define __tune_corei7__ 1			// CHECK_CNL_M64: #define __tune_corei7__ 1
	// CHECK_CNL_M64: #define __x86_64 1			// CHECK_CNL_M64: #define __x86_64 1
	// CHECK_CNL_M64: #define __x86_64__ 1			// CHECK_CNL_M64: #define __x86_64__ 1

	// RUN: %clang -march=icelake-client -m32 -E -dM %s -o - 2>&1 \			// RUN: %clang -march=icelake-client -m32 -E -dM %s -o - 2>&1 \
	// RUN: -target i386-unknown-linux \			// RUN: -target i386-unknown-linux \
	// RUN: \| FileCheck -match-full-lines %s -check-prefix=CHECK_ICL_M32			// RUN: \| FileCheck -match-full-lines %s -check-prefixes=CHECK_ICL_M32,CHECK_ICL_M32S
				// RUN: %clang -march=rocketlake -m32 -E -dM %s -o - 2>&1 \
				// RUN: -target i386-unknown-linux \
				// RUN: \| FileCheck -match-full-lines %s -check-prefixes=CHECK_ICL_M32,CHECK_RKL_M32S
	// CHECK_ICL_M32: #define __AES__ 1			// CHECK_ICL_M32: #define __AES__ 1
	// CHECK_ICL_M32: #define __AVX2__ 1			// CHECK_ICL_M32: #define __AVX2__ 1
	// CHECK_ICL_M32: #define __AVX512BITALG__ 1			// CHECK_ICL_M32: #define __AVX512BITALG__ 1
	// CHECK_ICL_M32: #define __AVX512BW__ 1			// CHECK_ICL_M32: #define __AVX512BW__ 1
	// CHECK_ICL_M32: #define __AVX512CD__ 1			// CHECK_ICL_M32: #define __AVX512CD__ 1
	// CHECK_ICL_M32: #define __AVX512DQ__ 1			// CHECK_ICL_M32: #define __AVX512DQ__ 1
	// CHECK_ICL_M32: #define __AVX512F__ 1			// CHECK_ICL_M32: #define __AVX512F__ 1
	// CHECK_ICL_M32: #define __AVX512IFMA__ 1			// CHECK_ICL_M32: #define __AVX512IFMA__ 1
	Show All 16 Lines
	// CHECK_ICL_M32: #define __MOVBE__ 1			// CHECK_ICL_M32: #define __MOVBE__ 1
	// CHECK_ICL_M32: #define __PCLMUL__ 1			// CHECK_ICL_M32: #define __PCLMUL__ 1
	// CHECK_ICL_M32: #define __PKU__ 1			// CHECK_ICL_M32: #define __PKU__ 1
	// CHECK_ICL_M32: #define __POPCNT__ 1			// CHECK_ICL_M32: #define __POPCNT__ 1
	// CHECK_ICL_M32: #define __PRFCHW__ 1			// CHECK_ICL_M32: #define __PRFCHW__ 1
	// CHECK_ICL_M32: #define __RDPID__ 1			// CHECK_ICL_M32: #define __RDPID__ 1
	// CHECK_ICL_M32: #define __RDRND__ 1			// CHECK_ICL_M32: #define __RDRND__ 1
	// CHECK_ICL_M32: #define __RDSEED__ 1			// CHECK_ICL_M32: #define __RDSEED__ 1
	// CHECK_ICL_M32: #define __SGX__ 1			// CHECK_ICL_M32S: #define __SGX__ 1
				// CHECK_RKL_M32S-NOT: #define __SGX__ 1
	// CHECK_ICL_M32: #define __SHA__ 1			// CHECK_ICL_M32: #define __SHA__ 1
	// CHECK_ICL_M32: #define __SSE2__ 1			// CHECK_ICL_M32: #define __SSE2__ 1
	// CHECK_ICL_M32: #define __SSE3__ 1			// CHECK_ICL_M32: #define __SSE3__ 1
	// CHECK_ICL_M32: #define __SSE4_1__ 1			// CHECK_ICL_M32: #define __SSE4_1__ 1
	// CHECK_ICL_M32: #define __SSE4_2__ 1			// CHECK_ICL_M32: #define __SSE4_2__ 1
	// CHECK_ICL_M32: #define __SSE__ 1			// CHECK_ICL_M32: #define __SSE__ 1
	// CHECK_ICL_M32: #define __SSSE3__ 1			// CHECK_ICL_M32: #define __SSSE3__ 1
	// CHECK_ICL_M32: #define __VAES__ 1			// CHECK_ICL_M32: #define __VAES__ 1
	// CHECK_ICL_M32: #define __VPCLMULQDQ__ 1			// CHECK_ICL_M32: #define __VPCLMULQDQ__ 1
	// CHECK_ICL_M32-NOT: #define __WBNOINVD__ 1			// CHECK_ICL_M32-NOT: #define __WBNOINVD__ 1
	// CHECK_ICL_M32: #define __XSAVEC__ 1			// CHECK_ICL_M32: #define __XSAVEC__ 1
	// CHECK_ICL_M32: #define __XSAVEOPT__ 1			// CHECK_ICL_M32: #define __XSAVEOPT__ 1
	// CHECK_ICL_M32: #define __XSAVES__ 1			// CHECK_ICL_M32: #define __XSAVES__ 1
	// CHECK_ICL_M32: #define __XSAVE__ 1			// CHECK_ICL_M32: #define __XSAVE__ 1
	// CHECK_ICL_M32: #define __corei7 1			// CHECK_ICL_M32: #define __corei7 1
	// CHECK_ICL_M32: #define __corei7__ 1			// CHECK_ICL_M32: #define __corei7__ 1
	// CHECK_ICL_M32: #define __i386 1			// CHECK_ICL_M32: #define __i386 1
	// CHECK_ICL_M32: #define __i386__ 1			// CHECK_ICL_M32: #define __i386__ 1
	// CHECK_ICL_M32: #define __tune_corei7__ 1			// CHECK_ICL_M32: #define __tune_corei7__ 1
	// CHECK_ICL_M32: #define i386 1			// CHECK_ICL_M32: #define i386 1

	// RUN: %clang -march=icelake-client -m64 -E -dM %s -o - 2>&1 \			// RUN: %clang -march=icelake-client -m64 -E -dM %s -o - 2>&1 \
	// RUN: -target i386-unknown-linux \			// RUN: -target i386-unknown-linux \
	// RUN: \| FileCheck -match-full-lines %s -check-prefix=CHECK_ICL_M64			// RUN: \| FileCheck -match-full-lines %s -check-prefixes=CHECK_ICL_M64,CHECK_ICL_M64S
				// RUN: %clang -march=rocketlake -m64 -E -dM %s -o - 2>&1 \
				// RUN: -target i386-unknown-linux \
				// RUN: \| FileCheck -match-full-lines %s -check-prefixes=CHECK_ICL_M64,CHECK_RKL_M64S
	// CHECK_ICL_M64: #define __AES__ 1			// CHECK_ICL_M64: #define __AES__ 1
	// CHECK_ICL_M64: #define __AVX2__ 1			// CHECK_ICL_M64: #define __AVX2__ 1
	// CHECK_ICL_M64: #define __AVX512BITALG__ 1			// CHECK_ICL_M64: #define __AVX512BITALG__ 1
	// CHECK_ICL_M64: #define __AVX512BW__ 1			// CHECK_ICL_M64: #define __AVX512BW__ 1
	// CHECK_ICL_M64: #define __AVX512CD__ 1			// CHECK_ICL_M64: #define __AVX512CD__ 1
	// CHECK_ICL_M64: #define __AVX512DQ__ 1			// CHECK_ICL_M64: #define __AVX512DQ__ 1
	// CHECK_ICL_M64: #define __AVX512F__ 1			// CHECK_ICL_M64: #define __AVX512F__ 1
	// CHECK_ICL_M64: #define __AVX512IFMA__ 1			// CHECK_ICL_M64: #define __AVX512IFMA__ 1
	Show All 16 Lines
	// CHECK_ICL_M64: #define __MOVBE__ 1			// CHECK_ICL_M64: #define __MOVBE__ 1
	// CHECK_ICL_M64: #define __PCLMUL__ 1			// CHECK_ICL_M64: #define __PCLMUL__ 1
	// CHECK_ICL_M64: #define __PKU__ 1			// CHECK_ICL_M64: #define __PKU__ 1
	// CHECK_ICL_M64: #define __POPCNT__ 1			// CHECK_ICL_M64: #define __POPCNT__ 1
	// CHECK_ICL_M64: #define __PRFCHW__ 1			// CHECK_ICL_M64: #define __PRFCHW__ 1
	// CHECK_ICL_M64: #define __RDPID__ 1			// CHECK_ICL_M64: #define __RDPID__ 1
	// CHECK_ICL_M64: #define __RDRND__ 1			// CHECK_ICL_M64: #define __RDRND__ 1
	// CHECK_ICL_M64: #define __RDSEED__ 1			// CHECK_ICL_M64: #define __RDSEED__ 1
	// CHECK_ICL_M64: #define __SGX__ 1			// CHECK_ICL_M64S: #define __SGX__ 1
				// CHECK_RKL_M64S-NOT: #define __SGX__ 1
	// CHECK_ICL_M64: #define __SHA__ 1			// CHECK_ICL_M64: #define __SHA__ 1
	// CHECK_ICL_M64: #define __SSE2__ 1			// CHECK_ICL_M64: #define __SSE2__ 1
	// CHECK_ICL_M64: #define __SSE3__ 1			// CHECK_ICL_M64: #define __SSE3__ 1
	// CHECK_ICL_M64: #define __SSE4_1__ 1			// CHECK_ICL_M64: #define __SSE4_1__ 1
	// CHECK_ICL_M64: #define __SSE4_2__ 1			// CHECK_ICL_M64: #define __SSE4_2__ 1
	// CHECK_ICL_M64: #define __SSE__ 1			// CHECK_ICL_M64: #define __SSE__ 1
	// CHECK_ICL_M64: #define __SSSE3__ 1			// CHECK_ICL_M64: #define __SSSE3__ 1
	// CHECK_ICL_M64: #define __VAES__ 1			// CHECK_ICL_M64: #define __VAES__ 1
	// CHECK_ICL_M64: #define __VPCLMULQDQ__ 1			// CHECK_ICL_M64: #define __VPCLMULQDQ__ 1
	// CHECK_ICL_M64-NOT: #define __WBNOINVD__ 1			// CHECK_ICL_M64-NOT: #define __WBNOINVD__ 1
	// CHECK_ICL_M64: #define __XSAVEC__ 1			// CHECK_ICL_M64: #define __XSAVEC__ 1
	// CHECK_ICL_M64: #define __XSAVEOPT__ 1			// CHECK_ICL_M64: #define __XSAVEOPT__ 1
	// CHECK_ICL_M64: #define __XSAVES__ 1			// CHECK_ICL_M64: #define __XSAVES__ 1
	// CHECK_ICL_M64: #define __XSAVE__ 1			// CHECK_ICL_M64: #define __XSAVE__ 1
	// CHECK_ICL_M64: #define __amd64 1			// CHECK_ICL_M64: #define __amd64 1
	// CHECK_ICL_M64: #define __amd64__ 1			// CHECK_ICL_M64: #define __amd64__ 1
	// CHECK_ICL_M64: #define __corei7 1			// CHECK_ICL_M64: #define __corei7 1
	// CHECK_ICL_M64: #define __corei7__ 1			// CHECK_ICL_M64: #define __corei7__ 1
	// CHECK_ICL_M64: #define __tune_corei7__ 1			// CHECK_ICL_M64: #define __tune_corei7__ 1
	// CHECK_ICL_M64: #define __x86_64 1			// CHECK_ICL_M64: #define __x86_64 1
	// CHECK_ICL_M64: #define __x86_64__ 1			// CHECK_ICL_M64: #define __x86_64__ 1

	// RUN: %clang -march=icelake-server -m32 -E -dM %s -o - 2>&1 \			// RUN: %clang -march=icelake-server -m32 -E -dM %s -o - 2>&1 \
				MaskRayUnsubmitted Not Done Reply Inline Actions The file may need some refactoring first. You can let RUN lines share some common check prefixes, instead of adding a bunch of defines for every new processor. // CHECK_X86_64_V2: ... // CHECK_X86_64_V2: ... // CHECK_X86_64_V3: ... // CHECK_PROCESSOR1_M32: // CHECK_PROCESSOR1_M64: // CHECK_PROCESSOR2_M32: // CHECK_PROCESSOR2_M64: MaskRay: The file may need some refactoring first. You can let RUN lines share some common check…
				FreddyYeAuthorUnsubmitted Done Reply Inline Actions I agree. I'll do it FreddyYe: I agree. I'll do it
	// RUN: -target i386-unknown-linux \			// RUN: -target i386-unknown-linux \
	// RUN: \| FileCheck -match-full-lines %s -check-prefix=CHECK_ICX_M32			// RUN: \| FileCheck -match-full-lines %s -check-prefix=CHECK_ICX_M32
	// CHECK_ICX_M32: #define __AES__ 1			// CHECK_ICX_M32: #define __AES__ 1
	// CHECK_ICX_M32: #define __AVX2__ 1			// CHECK_ICX_M32: #define __AVX2__ 1
	// CHECK_ICX_M32: #define __AVX512BITALG__ 1			// CHECK_ICX_M32: #define __AVX512BITALG__ 1
	// CHECK_ICX_M32: #define __AVX512BW__ 1			// CHECK_ICX_M32: #define __AVX512BW__ 1
	// CHECK_ICX_M32: #define __AVX512CD__ 1			// CHECK_ICX_M32: #define __AVX512CD__ 1
	// CHECK_ICX_M32: #define __AVX512DQ__ 1			// CHECK_ICX_M32: #define __AVX512DQ__ 1
	▲ Show 20 Lines • Show All 2,316 Lines • Show Last 20 Lines

compiler-rt/lib/builtins/cpu_model.c

Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	enum ProcessorSubtypes {
INTEL_COREI7_ICELAKE_CLIENT,		INTEL_COREI7_ICELAKE_CLIENT,
INTEL_COREI7_ICELAKE_SERVER,		INTEL_COREI7_ICELAKE_SERVER,
AMDFAM17H_ZNVER2,		AMDFAM17H_ZNVER2,
INTEL_COREI7_CASCADELAKE,		INTEL_COREI7_CASCADELAKE,
INTEL_COREI7_TIGERLAKE,		INTEL_COREI7_TIGERLAKE,
INTEL_COREI7_COOPERLAKE,		INTEL_COREI7_COOPERLAKE,
INTEL_COREI7_SAPPHIRERAPIDS,		INTEL_COREI7_SAPPHIRERAPIDS,
INTEL_COREI7_ALDERLAKE,		INTEL_COREI7_ALDERLAKE,
AMDFAM19H_ZNVER3,		AMDFAM19H_ZNVER3,
		craig.topperUnsubmitted Not Done Reply Inline Actions This order is defined by libgcc. We can't insert in the middle unless ZNVER3 was in the wrong place Why this not referenced in the switch the select subtype? craig.topper: This order is defined by libgcc. We can't insert in the middle unless ZNVER3 was in the wrong…
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions This is a mistake. I'll modify. And reference is missing in two switch. I'll add. FreddyYe: This is a mistake. I'll modify. And reference is missing in two switch. I'll add.
		INTEL_COREI7_ROCKETLAKE,
CPU_SUBTYPE_MAX		CPU_SUBTYPE_MAX
};		};

enum ProcessorFeatures {		enum ProcessorFeatures {
FEATURE_CMOV = 0,		FEATURE_CMOV = 0,
FEATURE_MMX,		FEATURE_MMX,
FEATURE_POPCNT,		FEATURE_POPCNT,
FEATURE_SSE,		FEATURE_SSE,
▲ Show 20 Lines • Show All 269 Lines • ▼ Show 20 Lines	case 6:
case 0x9e: // Kaby Lake desktop		case 0x9e: // Kaby Lake desktop
case 0xa5: // Comet Lake-H/S		case 0xa5: // Comet Lake-H/S
case 0xa6: // Comet Lake-U		case 0xa6: // Comet Lake-U
CPU = "skylake";		CPU = "skylake";
*Type = INTEL_COREI7;		*Type = INTEL_COREI7;
*Subtype = INTEL_COREI7_SKYLAKE;		*Subtype = INTEL_COREI7_SKYLAKE;
break;		break;

		// Rocketlake:
		case 0xa7:
		CPU = "rocketlake";
		*Type = INTEL_COREI7;
		*Subtype = INTEL_COREI7_ROCKETLAKE;

// Skylake Xeon:		// Skylake Xeon:
case 0x55:		case 0x55:
*Type = INTEL_COREI7;		*Type = INTEL_COREI7;
if (testFeature(FEATURE_AVX512BF16)) {		if (testFeature(FEATURE_AVX512BF16)) {
CPU = "cooperlake";		CPU = "cooperlake";
*Subtype = INTEL_COREI7_COOPERLAKE;		*Subtype = INTEL_COREI7_COOPERLAKE;
} else if (testFeature(FEATURE_AVX512VNNI)) {		} else if (testFeature(FEATURE_AVX512VNNI)) {
CPU = "cascadelake";		CPU = "cascadelake";
▲ Show 20 Lines • Show All 389 Lines • Show Last 20 Lines

llvm/include/llvm/Support/X86TargetParser.h

Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	enum CPUKind {
CK_Haswell,		CK_Haswell,
CK_Broadwell,		CK_Broadwell,
CK_SkylakeClient,		CK_SkylakeClient,
CK_SkylakeServer,		CK_SkylakeServer,
CK_Cascadelake,		CK_Cascadelake,
CK_Cooperlake,		CK_Cooperlake,
CK_Cannonlake,		CK_Cannonlake,
CK_IcelakeClient,		CK_IcelakeClient,
		CK_Rocketlake,
CK_IcelakeServer,		CK_IcelakeServer,
CK_Tigerlake,		CK_Tigerlake,
CK_SapphireRapids,		CK_SapphireRapids,
CK_Alderlake,		CK_Alderlake,
CK_KNL,		CK_KNL,
CK_KNM,		CK_KNM,
CK_Lakemont,		CK_Lakemont,
CK_K6,		CK_K6,
▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

llvm/include/llvm/Support/X86TargetParser.def

	Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines
	X86_CPU_SUBTYPE(INTEL_COREI7_ICELAKE_SERVER, "icelake-server")			X86_CPU_SUBTYPE(INTEL_COREI7_ICELAKE_SERVER, "icelake-server")
	X86_CPU_SUBTYPE(AMDFAM17H_ZNVER2, "znver2")			X86_CPU_SUBTYPE(AMDFAM17H_ZNVER2, "znver2")
	X86_CPU_SUBTYPE(INTEL_COREI7_CASCADELAKE, "cascadelake")			X86_CPU_SUBTYPE(INTEL_COREI7_CASCADELAKE, "cascadelake")
	X86_CPU_SUBTYPE(INTEL_COREI7_TIGERLAKE, "tigerlake")			X86_CPU_SUBTYPE(INTEL_COREI7_TIGERLAKE, "tigerlake")
	X86_CPU_SUBTYPE(INTEL_COREI7_COOPERLAKE, "cooperlake")			X86_CPU_SUBTYPE(INTEL_COREI7_COOPERLAKE, "cooperlake")
	X86_CPU_SUBTYPE(INTEL_COREI7_SAPPHIRERAPIDS, "sapphirerapids")			X86_CPU_SUBTYPE(INTEL_COREI7_SAPPHIRERAPIDS, "sapphirerapids")
	X86_CPU_SUBTYPE(INTEL_COREI7_ALDERLAKE, "alderlake")			X86_CPU_SUBTYPE(INTEL_COREI7_ALDERLAKE, "alderlake")
	X86_CPU_SUBTYPE(AMDFAM19H_ZNVER3, "znver3")			X86_CPU_SUBTYPE(AMDFAM19H_ZNVER3, "znver3")
				X86_CPU_SUBTYPE(INTEL_COREI7_ROCKETLAKE, "rocketlake")
	#undef X86_CPU_SUBTYPE			#undef X86_CPU_SUBTYPE


	// This macro is used for cpu types present in compiler-rt/libgcc.			// This macro is used for cpu types present in compiler-rt/libgcc.
	#ifndef X86_FEATURE_COMPAT			#ifndef X86_FEATURE_COMPAT
	#define X86_FEATURE_COMPAT(ENUM, STR) X86_FEATURE(ENUM, STR)			#define X86_FEATURE_COMPAT(ENUM, STR) X86_FEATURE(ENUM, STR)
	#endif			#endif

	▲ Show 20 Lines • Show All 105 Lines • Show Last 20 Lines

llvm/lib/Support/Host.cpp

Show First 20 Lines • Show All 702 Lines • ▼ Show 20 Lines	case 6:
case 0x9e: // Kaby Lake desktop		case 0x9e: // Kaby Lake desktop
case 0xa5: // Comet Lake-H/S		case 0xa5: // Comet Lake-H/S
case 0xa6: // Comet Lake-U		case 0xa6: // Comet Lake-U
CPU = "skylake";		CPU = "skylake";
*Type = X86::INTEL_COREI7;		*Type = X86::INTEL_COREI7;
*Subtype = X86::INTEL_COREI7_SKYLAKE;		*Subtype = X86::INTEL_COREI7_SKYLAKE;
break;		break;

		// Rocketlake:
		case 0xa7:
		CPU = "rocketlake";
		*Type = X86::INTEL_COREI7;
		*Subtype = X86::INTEL_COREI7_ROCKETLAKE;
		break;

// Skylake Xeon:		// Skylake Xeon:
case 0x55:		case 0x55:
*Type = X86::INTEL_COREI7;		*Type = X86::INTEL_COREI7;
if (testFeature(X86::FEATURE_AVX512BF16)) {		if (testFeature(X86::FEATURE_AVX512BF16)) {
CPU = "cooperlake";		CPU = "cooperlake";
*Subtype = X86::INTEL_COREI7_COOPERLAKE;		*Subtype = X86::INTEL_COREI7_COOPERLAKE;
} else if (testFeature(X86::FEATURE_AVX512VNNI)) {		} else if (testFeature(X86::FEATURE_AVX512VNNI)) {
CPU = "cascadelake";		CPU = "cascadelake";
▲ Show 20 Lines • Show All 963 Lines • Show Last 20 Lines

llvm/lib/Support/X86TargetParser.cpp

Show First 20 Lines • Show All 167 Lines • ▼ Show 20 Lines
constexpr FeatureBitset FeaturesKNL =		constexpr FeatureBitset FeaturesKNL =
FeaturesBroadwell \| FeatureAES \| FeatureAVX512F \| FeatureAVX512CD \|		FeaturesBroadwell \| FeatureAES \| FeatureAVX512F \| FeatureAVX512CD \|
FeatureAVX512ER \| FeatureAVX512PF \| FeaturePREFETCHWT1;		FeatureAVX512ER \| FeatureAVX512PF \| FeaturePREFETCHWT1;
constexpr FeatureBitset FeaturesKNM = FeaturesKNL \| FeatureAVX512VPOPCNTDQ;		constexpr FeatureBitset FeaturesKNM = FeaturesKNL \| FeatureAVX512VPOPCNTDQ;

// Intel Skylake processors.		// Intel Skylake processors.
constexpr FeatureBitset FeaturesSkylakeClient =		constexpr FeatureBitset FeaturesSkylakeClient =
FeaturesBroadwell \| FeatureAES \| FeatureCLFLUSHOPT \| FeatureXSAVEC \|		FeaturesBroadwell \| FeatureAES \| FeatureCLFLUSHOPT \| FeatureXSAVEC \|
FeatureXSAVES \| FeatureSGX;		FeatureXSAVES \| FeatureSGX;
		skanUnsubmitted Not Done Reply Inline Actions Shouldn't the FeatureSGX be removed here? skan: Shouldn't the FeatureSGX be removed here?
		craig.topperUnsubmitted Not Done Reply Inline Actions That would change the frontend behavior which would require a wider discussion. My suggestion was only to change the backend behavior since there was nothing testable with llc anyway. craig.topper: That would change the frontend behavior which would require a wider discussion. My suggestion…
		skanUnsubmitted Not Done Reply Inline Actions Okay, it makes sense to me. skan: Okay, it makes sense to me.
// SkylakeServer inherits all SkylakeClient features except SGX.		// SkylakeServer inherits all SkylakeClient features except SGX.
// FIXME: That doesn't match gcc.		// FIXME: That doesn't match gcc.
constexpr FeatureBitset FeaturesSkylakeServer =		constexpr FeatureBitset FeaturesSkylakeServer =
(FeaturesSkylakeClient & ~FeatureSGX) \| FeatureAVX512F \| FeatureAVX512CD \|		(FeaturesSkylakeClient & ~FeatureSGX) \| FeatureAVX512F \| FeatureAVX512CD \|
		skanUnsubmitted Not Done Reply Inline Actions Remove `~FeatureSGX` here? skan: Remove `~FeatureSGX` here?
FeatureAVX512DQ \| FeatureAVX512BW \| FeatureAVX512VL \| FeatureCLWB \|		FeatureAVX512DQ \| FeatureAVX512BW \| FeatureAVX512VL \| FeatureCLWB \|
FeaturePKU;		FeaturePKU;
constexpr FeatureBitset FeaturesCascadeLake =		constexpr FeatureBitset FeaturesCascadeLake =
FeaturesSkylakeServer \| FeatureAVX512VNNI;		FeaturesSkylakeServer \| FeatureAVX512VNNI;
constexpr FeatureBitset FeaturesCooperLake =		constexpr FeatureBitset FeaturesCooperLake =
FeaturesCascadeLake \| FeatureAVX512BF16;		FeaturesCascadeLake \| FeatureAVX512BF16;

// Intel 10nm processors.		// Intel 10nm processors.
constexpr FeatureBitset FeaturesCannonlake =		constexpr FeatureBitset FeaturesCannonlake =
FeaturesSkylakeClient \| FeatureAVX512F \| FeatureAVX512CD \| FeatureAVX512DQ \|		FeaturesSkylakeClient \| FeatureAVX512F \| FeatureAVX512CD \| FeatureAVX512DQ \|
FeatureAVX512BW \| FeatureAVX512VL \| FeatureAVX512IFMA \| FeatureAVX512VBMI \|		FeatureAVX512BW \| FeatureAVX512VL \| FeatureAVX512IFMA \| FeatureAVX512VBMI \|
FeaturePKU \| FeatureSHA;		FeaturePKU \| FeatureSHA;
constexpr FeatureBitset FeaturesICLClient =		constexpr FeatureBitset FeaturesICLClient =
FeaturesCannonlake \| FeatureAVX512BITALG \| FeatureAVX512VBMI2 \|		FeaturesCannonlake \| FeatureAVX512BITALG \| FeatureAVX512VBMI2 \|
FeatureAVX512VNNI \| FeatureAVX512VPOPCNTDQ \| FeatureGFNI \| FeatureRDPID \|		FeatureAVX512VNNI \| FeatureAVX512VPOPCNTDQ \| FeatureGFNI \| FeatureRDPID \|
FeatureVAES \| FeatureVPCLMULQDQ;		FeatureVAES \| FeatureVPCLMULQDQ;
		constexpr FeatureBitset FeaturesRocketlake = FeaturesICLClient & ~FeatureSGX;
		skanUnsubmitted Not Done Reply Inline Actions Remove `~FeatureSGX` here? skan: Remove `~FeatureSGX` here?
constexpr FeatureBitset FeaturesICLServer =		constexpr FeatureBitset FeaturesICLServer =
FeaturesICLClient \| FeatureCLWB \| FeaturePCONFIG \| FeatureWBNOINVD;		FeaturesICLClient \| FeatureCLWB \| FeaturePCONFIG \| FeatureWBNOINVD;
constexpr FeatureBitset FeaturesTigerlake =		constexpr FeatureBitset FeaturesTigerlake =
FeaturesICLClient \| FeatureAVX512VP2INTERSECT \| FeatureMOVDIR64B \|		FeaturesICLClient \| FeatureAVX512VP2INTERSECT \| FeatureMOVDIR64B \|
FeatureCLWB \| FeatureMOVDIRI \| FeatureSHSTK \| FeatureKL \| FeatureWIDEKL;		FeatureCLWB \| FeatureMOVDIRI \| FeatureSHSTK \| FeatureKL \| FeatureWIDEKL;
constexpr FeatureBitset FeaturesSapphireRapids =		constexpr FeatureBitset FeaturesSapphireRapids =
FeaturesICLServer \| FeatureAMX_TILE \| FeatureAMX_INT8 \| FeatureAMX_BF16 \|		FeaturesICLServer \| FeatureAMX_TILE \| FeatureAMX_INT8 \| FeatureAMX_BF16 \|
FeatureAVX512BF16 \| FeatureAVX512VP2INTERSECT \| FeatureCLDEMOTE \|		FeatureAVX512BF16 \| FeatureAVX512VP2INTERSECT \| FeatureCLDEMOTE \|
▲ Show 20 Lines • Show All 146 Lines • ▼ Show 20 Lines	constexpr ProcInfo Processors[] = {
// Cascadelake Server microarchitecture based processors.		// Cascadelake Server microarchitecture based processors.
{ {"cascadelake"}, CK_Cascadelake, FEATURE_AVX512VNNI, FeaturesCascadeLake },		{ {"cascadelake"}, CK_Cascadelake, FEATURE_AVX512VNNI, FeaturesCascadeLake },
// Cooperlake Server microarchitecture based processors.		// Cooperlake Server microarchitecture based processors.
{ {"cooperlake"}, CK_Cooperlake, FEATURE_AVX512BF16, FeaturesCooperLake },		{ {"cooperlake"}, CK_Cooperlake, FEATURE_AVX512BF16, FeaturesCooperLake },
// Cannonlake client microarchitecture based processors.		// Cannonlake client microarchitecture based processors.
{ {"cannonlake"}, CK_Cannonlake, FEATURE_AVX512VBMI, FeaturesCannonlake },		{ {"cannonlake"}, CK_Cannonlake, FEATURE_AVX512VBMI, FeaturesCannonlake },
// Icelake client microarchitecture based processors.		// Icelake client microarchitecture based processors.
{ {"icelake-client"}, CK_IcelakeClient, FEATURE_AVX512VBMI2, FeaturesICLClient },		{ {"icelake-client"}, CK_IcelakeClient, FEATURE_AVX512VBMI2, FeaturesICLClient },
		// Rocketlake microarchitecture based processors.
		{ {"rocketlake"}, CK_Rocketlake, FEATURE_AVX512VBMI2, FeaturesRocketlake },
// Icelake server microarchitecture based processors.		// Icelake server microarchitecture based processors.
{ {"icelake-server"}, CK_IcelakeServer, FEATURE_AVX512VBMI2, FeaturesICLServer },		{ {"icelake-server"}, CK_IcelakeServer, FEATURE_AVX512VBMI2, FeaturesICLServer },
// Tigerlake microarchitecture based processors.		// Tigerlake microarchitecture based processors.
{ {"tigerlake"}, CK_Tigerlake, FEATURE_AVX512VP2INTERSECT, FeaturesTigerlake },		{ {"tigerlake"}, CK_Tigerlake, FEATURE_AVX512VP2INTERSECT, FeaturesTigerlake },
// Sapphire Rapids microarchitecture based processors.		// Sapphire Rapids microarchitecture based processors.
{ {"sapphirerapids"}, CK_SapphireRapids, FEATURE_AVX512VP2INTERSECT, FeaturesSapphireRapids },		{ {"sapphirerapids"}, CK_SapphireRapids, FEATURE_AVX512VP2INTERSECT, FeaturesSapphireRapids },
// Alderlake microarchitecture based processors.		// Alderlake microarchitecture based processors.
{ {"alderlake"}, CK_Alderlake, FEATURE_AVX2, FeaturesAlderlake },		{ {"alderlake"}, CK_Alderlake, FEATURE_AVX2, FeaturesAlderlake },
Show All 36 Lines	constexpr ProcInfo Processors[] = {
{ {"znver2"}, CK_ZNVER2, FEATURE_AVX2, FeaturesZNVER2 },		{ {"znver2"}, CK_ZNVER2, FEATURE_AVX2, FeaturesZNVER2 },
{ {"znver3"}, CK_ZNVER3, FEATURE_AVX2, FeaturesZNVER3 },		{ {"znver3"}, CK_ZNVER3, FEATURE_AVX2, FeaturesZNVER3 },
// Generic 64-bit processor.		// Generic 64-bit processor.
{ {"x86-64"}, CK_x86_64, ~0U, FeaturesX86_64 },		{ {"x86-64"}, CK_x86_64, ~0U, FeaturesX86_64 },
{ {"x86-64-v2"}, CK_x86_64_v2, ~0U, FeaturesX86_64_V2 },		{ {"x86-64-v2"}, CK_x86_64_v2, ~0U, FeaturesX86_64_V2 },
{ {"x86-64-v3"}, CK_x86_64_v3, ~0U, FeaturesX86_64_V3 },		{ {"x86-64-v3"}, CK_x86_64_v3, ~0U, FeaturesX86_64_V3 },
{ {"x86-64-v4"}, CK_x86_64_v4, ~0U, FeaturesX86_64_V4 },		{ {"x86-64-v4"}, CK_x86_64_v4, ~0U, FeaturesX86_64_V4 },
// Geode processors.		// Geode processors.
{ {"geode"}, CK_Geode, ~0U, FeaturesGeode },		{ {"geode"}, CK_Geode, ~0U, FeaturesGeode },
};		};
		skanUnsubmitted Not Done Reply Inline Actions It's not correct to format here in this patch and do not mix tab with space. skan: It's not correct to format here in this patch and do not mix tab with space.

constexpr const char *NoTuneList[] = {"x86-64-v2", "x86-64-v3", "x86-64-v4"};		constexpr const char *NoTuneList[] = {"x86-64-v2", "x86-64-v3", "x86-64-v4"};

X86::CPUKind llvm::X86::parseArchX86(StringRef CPU, bool Only64Bit) {		X86::CPUKind llvm::X86::parseArchX86(StringRef CPU, bool Only64Bit) {
for (const auto &P : Processors)		for (const auto &P : Processors)
if (P.Name == CPU && (P.Features[FEATURE_64BIT] \|\| !Only64Bit))		if (P.Name == CPU && (P.Features[FEATURE_64BIT] \|\| !Only64Bit))
return P.Kind;		return P.Kind;

▲ Show 20 Lines • Show All 239 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86.td

Show First 20 Lines • Show All 262 Lines • ▼ Show 20 Lines
def FeatureSlowDivide64 : SubtargetFeature<"idivq-to-divl",		def FeatureSlowDivide64 : SubtargetFeature<"idivq-to-divl",
"HasSlowDivide64", "true",		"HasSlowDivide64", "true",
"Use 32-bit divide for positive values less than 2^32">;		"Use 32-bit divide for positive values less than 2^32">;
def FeaturePadShortFunctions : SubtargetFeature<"pad-short-functions",		def FeaturePadShortFunctions : SubtargetFeature<"pad-short-functions",
"PadShortFunctions", "true",		"PadShortFunctions", "true",
"Pad short functions">;		"Pad short functions">;
def FeatureINVPCID : SubtargetFeature<"invpcid", "HasINVPCID", "true",		def FeatureINVPCID : SubtargetFeature<"invpcid", "HasINVPCID", "true",
"Invalidate Process-Context Identifier">;		"Invalidate Process-Context Identifier">;
def FeatureSGX : SubtargetFeature<"sgx", "HasSGX", "true",		def FeatureSGX : SubtargetFeature<"sgx", "HasSGX", "true",
craig.topperUnsubmitted Not Done Reply Inline Actions Clang still puts it in target-features attribute so you can’t delete this or you’ll get a warning that the feature doesn’t exist. craig.topper: Clang still puts it in target-features attribute so you can’t delete this or you’ll get a…
"Enable Software Guard Extensions">;		"Enable Software Guard Extensions">;
def FeatureCLFLUSHOPT : SubtargetFeature<"clflushopt", "HasCLFLUSHOPT", "true",		def FeatureCLFLUSHOPT : SubtargetFeature<"clflushopt", "HasCLFLUSHOPT", "true",
skanUnsubmitted Not Done Reply Inline Actions If you delete the definition of FeatureSGX, you need to remove the related code in X86Subtarget.h too. BTW, I don't think "there are no IR intrinsics for a feature" is a good reason to remove a feature. skan: If you delete the definition of FeatureSGX, you need to remove the related code in X86Subtarget.
craig.topperUnsubmitted Not Done Reply Inline Actions I only said to remove it from the CPUs because for llc -march=skylake it doesn’t matter if we enable SGX because there’s nothing you can test from llc. craig.topper: I only said to remove it from the CPUs because for llc -march=skylake it doesn’t matter if we…
"Flush A Cache Line Optimized">;		"Flush A Cache Line Optimized">;
def FeatureCLWB : SubtargetFeature<"clwb", "HasCLWB", "true",		def FeatureCLWB : SubtargetFeature<"clwb", "HasCLWB", "true",
"Cache Line Write Back">;		"Cache Line Write Back">;
def FeatureWBNOINVD : SubtargetFeature<"wbnoinvd", "HasWBNOINVD", "true",		def FeatureWBNOINVD : SubtargetFeature<"wbnoinvd", "HasWBNOINVD", "true",
"Write Back No Invalidate">;		"Write Back No Invalidate">;
def FeatureRDPID : SubtargetFeature<"rdpid", "HasRDPID", "true",		def FeatureRDPID : SubtargetFeature<"rdpid", "HasRDPID", "true",
"Support RDPID instructions">;		"Support RDPID instructions">;
def FeatureWAITPKG : SubtargetFeature<"waitpkg", "HasWAITPKG", "true",		def FeatureWAITPKG : SubtargetFeature<"waitpkg", "HasWAITPKG", "true",
▲ Show 20 Lines • Show All 366 Lines • ▼ Show 20 Lines	def ProcessorFeatures {
list<SubtargetFeature> BDWTuning = HSWTuning;		list<SubtargetFeature> BDWTuning = HSWTuning;
list<SubtargetFeature> BDWFeatures =		list<SubtargetFeature> BDWFeatures =
!listconcat(HSWFeatures, BDWAdditionalFeatures);		!listconcat(HSWFeatures, BDWAdditionalFeatures);

// Skylake		// Skylake
list<SubtargetFeature> SKLAdditionalFeatures = [FeatureAES,		list<SubtargetFeature> SKLAdditionalFeatures = [FeatureAES,
FeatureXSAVEC,		FeatureXSAVEC,
FeatureXSAVES,		FeatureXSAVES,
FeatureCLFLUSHOPT,		FeatureCLFLUSHOPT];
FeatureSGX];
list<SubtargetFeature> SKLTuning = [FeatureHasFastGather,		list<SubtargetFeature> SKLTuning = [FeatureHasFastGather,
FeatureMacroFusion,		FeatureMacroFusion,
FeatureSlow3OpsLEA,		FeatureSlow3OpsLEA,
FeatureSlowDivide64,		FeatureSlowDivide64,
FeatureFastScalarFSQRT,		FeatureFastScalarFSQRT,
FeatureFastVectorFSQRT,		FeatureFastVectorFSQRT,
FeatureFastSHLDRotate,		FeatureFastSHLDRotate,
FeatureFast15ByteNOP,		FeatureFast15ByteNOP,
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	def ProcessorFeatures {

// Icelake		// Icelake
list<SubtargetFeature> ICLAdditionalFeatures = [FeatureBITALG,		list<SubtargetFeature> ICLAdditionalFeatures = [FeatureBITALG,
FeatureVAES,		FeatureVAES,
FeatureVBMI2,		FeatureVBMI2,
FeatureVNNI,		FeatureVNNI,
FeatureVPCLMULQDQ,		FeatureVPCLMULQDQ,
FeatureVPOPCNTDQ,		FeatureVPOPCNTDQ,
FeatureGFNI,		FeatureGFNI,
FeatureRDPID,		FeatureRDPID,
		craig.topperUnsubmitted Not Done Reply Inline Actions I'm not sure that rocketlake has CLWB. Can you double check that? It's not listed in the cpuinfo dump on the 11700K that I found with a google search here https://www.pugetsystems.com/labs/hpc/Intel-Rocket-Lake-Compute-Performance-Results-HPL-HPCG-NAMD-and-Numpy-2116/ craig.topper: I'm not sure that rocketlake has CLWB. Can you double check that? It's not listed in the…
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions For now I have only an icelake-client machine and found that CLWB is not there, too. Guess I can do that modification in this patch? Rocketlake may probably lose CLWB. I'll double check. FreddyYe: For now I have only an icelake-client machine and found that CLWB is not there, too. Guess I…
		craig.topperUnsubmitted Done Reply Inline Actions What’s the model number for you ice lake client CPU? craig.topper: What’s the model number for you ice lake client CPU?
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions It is 0x7e. And I've double checked that rkl also hasn't CLWB. FreddyYe: It is 0x7e. And I've double checked that rkl also hasn't CLWB.
		craig.topperUnsubmitted Not Done Reply Inline Actions Sorry I meant the marketing name like "Intel® Core™ i7-1065G7" craig.topper: Sorry I meant the marketing name like "Intel® Core™ i7-1065G7"
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions It is `Intel(R) Core(TM) i7-1065G7 CPU @ 1.30GHz` FreddyYe: It is `Intel(R) Core(TM) i7-1065G7 CPU @ 1.30GHz`
FeatureFSRM];		FeatureFSRM];
list<SubtargetFeature> ICLTuning = CNLTuning;		list<SubtargetFeature> ICLTuning = CNLTuning;
list<SubtargetFeature> ICLFeatures =		list<SubtargetFeature> ICLFeatures =
!listconcat(CNLFeatures, ICLAdditionalFeatures);		!listconcat(CNLFeatures, ICLAdditionalFeatures);

// Icelake Server		// Icelake Server
list<SubtargetFeature> ICXAdditionalFeatures = [FeaturePCONFIG,		list<SubtargetFeature> ICXAdditionalFeatures = [FeaturePCONFIG,
FeatureCLWB,		FeatureCLWB,
FeatureWBNOINVD];		FeatureWBNOINVD];
list<SubtargetFeature> ICXTuning = CNLTuning;		list<SubtargetFeature> ICXTuning = CNLTuning;
list<SubtargetFeature> ICXFeatures =		list<SubtargetFeature> ICXFeatures =
!listconcat(ICLFeatures, ICXAdditionalFeatures);		!listconcat(ICLFeatures, ICXAdditionalFeatures);

//Tigerlake		// Tigerlake
list<SubtargetFeature> TGLAdditionalFeatures = [FeatureVP2INTERSECT,		list<SubtargetFeature> TGLAdditionalFeatures = [FeatureVP2INTERSECT,
FeatureCLWB,		FeatureCLWB,
FeatureMOVDIRI,		FeatureMOVDIRI,
FeatureMOVDIR64B,		FeatureMOVDIR64B,
FeatureSHSTK];		FeatureSHSTK];
list<SubtargetFeature> TGLTuning = CNLTuning;		list<SubtargetFeature> TGLTuning = CNLTuning;
list<SubtargetFeature> TGLFeatures =		list<SubtargetFeature> TGLFeatures =
!listconcat(ICLFeatures, TGLAdditionalFeatures );		!listconcat(ICLFeatures, TGLAdditionalFeatures );

//Sapphirerapids		// Sapphirerapids
list<SubtargetFeature> SPRAdditionalFeatures = [FeatureAMXTILE,		list<SubtargetFeature> SPRAdditionalFeatures = [FeatureAMXTILE,
		craig.topperUnsubmitted Not Done Reply Inline Actions Is this list this long because SKL includes SGX but RKL doesn't? craig.topper: Is this list this long because SKL includes SGX but RKL doesn't?
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions Yes. And I don't know any simple ways to exclude SGX here, any suggestions? FreddyYe: Yes. And I don't know any simple ways to exclude SGX here, any suggestions?
		craig.topperUnsubmitted Not Done Reply Inline Actions Nothing pretty. Guess it depends on if SGX is going to not appear in more future CPUs or if this is a one off case. If it's going to continue then we could remove it from the inheritance and just give it to SKL, ICL, CNL, etc. individually. Or we could just not default SGX on for any CPU. It's probably not all that useful in the backend anyway. Clang will put it in the target-feature attribute anyway. Having it in the backend feature lists doesn't really do anything since I don't think we have any IR intrinsics for SGX. craig.topper: Nothing pretty. Guess it depends on if SGX is going to not appear in more future CPUs or if…
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions Agree. Like we did in https://reviews.llvm.org/D88006. SGX is also not useful in the backend. FreddyYe: Agree. Like we did in https://reviews.llvm.org/D88006. SGX is also not useful in the backend.
FeatureAMXINT8,		FeatureAMXINT8,
FeatureAMXBF16,		FeatureAMXBF16,
FeatureBF16,		FeatureBF16,
FeatureSERIALIZE,		FeatureSERIALIZE,
FeatureCLDEMOTE,		FeatureCLDEMOTE,
FeatureWAITPKG,		FeatureWAITPKG,
FeaturePTWRITE,		FeaturePTWRITE,
FeatureAVXVNNI,		FeatureAVXVNNI,
FeatureTSXLDTRK,		FeatureTSXLDTRK,
FeatureENQCMD,		FeatureENQCMD,
FeatureSHSTK,		FeatureSHSTK,
FeatureVP2INTERSECT,		FeatureVP2INTERSECT,
FeatureMOVDIRI,		FeatureMOVDIRI,
FeatureMOVDIR64B,		FeatureMOVDIR64B,
FeatureUINTR];		FeatureUINTR];
list<SubtargetFeature> SPRTuning = ICXTuning;		list<SubtargetFeature> SPRTuning = ICXTuning;
list<SubtargetFeature> SPRFeatures =		list<SubtargetFeature> SPRFeatures =
!listconcat(ICXFeatures, SPRAdditionalFeatures);		!listconcat(ICXFeatures, SPRAdditionalFeatures);

// Atom		// Atom
list<SubtargetFeature> AtomFeatures = [FeatureX87,		list<SubtargetFeature> AtomFeatures = [FeatureX87,
FeatureCMPXCHG8B,		FeatureCMPXCHG8B,
FeatureCMOV,		FeatureCMOV,
		RKSimonUnsubmitted Not Done Reply Inline Actions Using ICLTuning suggests we should still be avoiding 512-bit ops (FeaturePrefer256Bit) - is this still true for RKL (or anything past CNL...)? I posted PR48336 but never got any response, but from what others have reported (Travis Downs, Phoronix etc) its mainly a power issue these days, not a perf issue due to big freq drops. RKSimon: Using ICLTuning suggests we should still be avoiding 512-bit ops (FeaturePrefer256Bit) - is…
		FreddyYeAuthorUnsubmitted Done Reply Inline Actions We need more tests on such as SPEC to see whether we can default enable FeaturePrefer512bit. FreddyYe: We need more tests on such as SPEC to see whether we can default enable FeaturePrefer512bit.
FeatureMMX,		FeatureMMX,
FeatureSSSE3,		FeatureSSSE3,
FeatureFXSR,		FeatureFXSR,
FeatureNOPL,		FeatureNOPL,
Feature64Bit,		Feature64Bit,
FeatureCMPXCHG16B,		FeatureCMPXCHG16B,
FeatureMOVBE,		FeatureMOVBE,
FeatureLAHFSAHF];		FeatureLAHFSAHF];
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	list<SubtargetFeature> GLMTuning = [FeatureUseGLMDivSqrtCosts,
FeatureSlowIncDec,		FeatureSlowIncDec,
FeaturePOPCNTFalseDeps,		FeaturePOPCNTFalseDeps,
FeatureInsertVZEROUPPER];		FeatureInsertVZEROUPPER];
list<SubtargetFeature> GLMFeatures =		list<SubtargetFeature> GLMFeatures =
!listconcat(SLMFeatures, GLMAdditionalFeatures);		!listconcat(SLMFeatures, GLMAdditionalFeatures);

// Goldmont Plus		// Goldmont Plus
list<SubtargetFeature> GLPAdditionalFeatures = [FeaturePTWRITE,		list<SubtargetFeature> GLPAdditionalFeatures = [FeaturePTWRITE,
FeatureRDPID,		FeatureRDPID];
FeatureSGX];
list<SubtargetFeature> GLPTuning = [FeatureUseGLMDivSqrtCosts,		list<SubtargetFeature> GLPTuning = [FeatureUseGLMDivSqrtCosts,
FeatureSlowTwoMemOps,		FeatureSlowTwoMemOps,
FeatureSlowLEA,		FeatureSlowLEA,
FeatureSlowIncDec,		FeatureSlowIncDec,
FeatureInsertVZEROUPPER];		FeatureInsertVZEROUPPER];
list<SubtargetFeature> GLPFeatures =		list<SubtargetFeature> GLPFeatures =
!listconcat(GLMFeatures, GLPAdditionalFeatures);		!listconcat(GLMFeatures, GLPAdditionalFeatures);

▲ Show 20 Lines • Show All 444 Lines • ▼ Show 20 Lines
def : ProcModel<"cascadelake", SkylakeServerModel,		def : ProcModel<"cascadelake", SkylakeServerModel,
ProcessorFeatures.CLXFeatures, ProcessorFeatures.CLXTuning>;		ProcessorFeatures.CLXFeatures, ProcessorFeatures.CLXTuning>;
def : ProcModel<"cooperlake", SkylakeServerModel,		def : ProcModel<"cooperlake", SkylakeServerModel,
ProcessorFeatures.CPXFeatures, ProcessorFeatures.CPXTuning>;		ProcessorFeatures.CPXFeatures, ProcessorFeatures.CPXTuning>;
def : ProcModel<"cannonlake", SkylakeServerModel,		def : ProcModel<"cannonlake", SkylakeServerModel,
ProcessorFeatures.CNLFeatures, ProcessorFeatures.CNLTuning>;		ProcessorFeatures.CNLFeatures, ProcessorFeatures.CNLTuning>;
def : ProcModel<"icelake-client", SkylakeServerModel,		def : ProcModel<"icelake-client", SkylakeServerModel,
ProcessorFeatures.ICLFeatures, ProcessorFeatures.ICLTuning>;		ProcessorFeatures.ICLFeatures, ProcessorFeatures.ICLTuning>;
		def : ProcModel<"rocketlake", SkylakeServerModel,
		ProcessorFeatures.ICLFeatures, ProcessorFeatures.ICLTuning>;
def : ProcModel<"icelake-server", SkylakeServerModel,		def : ProcModel<"icelake-server", SkylakeServerModel,
ProcessorFeatures.ICXFeatures, ProcessorFeatures.ICXTuning>;		ProcessorFeatures.ICXFeatures, ProcessorFeatures.ICXTuning>;
def : ProcModel<"tigerlake", SkylakeServerModel,		def : ProcModel<"tigerlake", SkylakeServerModel,
ProcessorFeatures.TGLFeatures, ProcessorFeatures.TGLTuning>;		ProcessorFeatures.TGLFeatures, ProcessorFeatures.TGLTuning>;
def : ProcModel<"sapphirerapids", SkylakeServerModel,		def : ProcModel<"sapphirerapids", SkylakeServerModel,
ProcessorFeatures.SPRFeatures, ProcessorFeatures.SPRTuning>;		ProcessorFeatures.SPRFeatures, ProcessorFeatures.SPRTuning>;
def : ProcModel<"alderlake", SkylakeClientModel,		def : ProcModel<"alderlake", SkylakeClientModel,
ProcessorFeatures.ADLFeatures, ProcessorFeatures.ADLTuning>;		ProcessorFeatures.ADLFeatures, ProcessorFeatures.ADLTuning>;
▲ Show 20 Lines • Show All 176 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/cpus-intel.ll

	Show All 32 Lines
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=broadwell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=broadwell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skylake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skylake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skylake-avx512 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skylake-avx512 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=skx 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cascadelake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cascadelake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cooperlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cooperlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cannonlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=cannonlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake-client 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake-client 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
				; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=rocketlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake-server 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=icelake-server 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=tigerlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=tigerlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=sapphirerapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=sapphirerapids 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=alderlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=alderlake 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=atom 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=atom 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=bonnell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=bonnell 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=silvermont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=silvermont 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=slm 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty			; RUN: llc < %s -o /dev/null -mtriple=x86_64-unknown-unknown -mcpu=slm 2>&1 \| FileCheck %s --check-prefix=CHECK-NO-ERROR --allow-empty
	Show All 9 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Support -march=rocketlakeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 337020

clang/lib/Basic/Targets/X86.cpp

clang/test/CodeGen/attr-target-mv.c

clang/test/CodeGen/target-builtin-noerror.c

clang/test/Driver/x86-march.c

clang/test/Misc/target-invalid-cpu-note.c

clang/test/Preprocessor/predefined-arch-macros.c

compiler-rt/lib/builtins/cpu_model.c

llvm/include/llvm/Support/X86TargetParser.h

llvm/include/llvm/Support/X86TargetParser.def

llvm/lib/Support/Host.cpp

llvm/lib/Support/X86TargetParser.cpp

llvm/lib/Target/X86/X86.td

llvm/test/CodeGen/X86/cpus-intel.ll

[X86] Support -march=rocketlake
ClosedPublic