This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
docs/
-
ReleaseNotes.rst
-
include/clang/Driver/
-
clang/
-
Driver/
-
Options.td
-
lib/
-
Basic/Targets/
-
Targets/
1/2
X86.h
1/2
X86.cpp
-
CodeGen/
-
CGBuiltin.cpp
-
CodeGenFunction.h
1/2
CodeGenFunction.cpp
-
Targets/
-
X86.cpp
-
Driver/ToolChains/Arch/
-
ToolChains/
-
Arch/
1/2
X86.cpp
-
test/
-
CodeGen/
-
X86/
-
avx10-error.c
-
attr-target-x86.c
-
target-avx-abi-diag.c
-
Driver/
-
x86-target-features.c
-
Preprocessor/
-
x86_target_features.c
-
llvm/
-
docs/
-
ReleaseNotes.rst
-
include/llvm/TargetParser/
-
llvm/
-
TargetParser/
-
X86TargetParser.def
-
lib/
-
IR/
-
Verifier.cpp
-
Target/X86/
-
X86/
-
MCTargetDesc/
2/4
X86MCCodeEmitter.cpp
-
X86.td
-
X86InstrInfo.td
-
X86RegisterInfo.cpp
-
X86Subtarget.h
-
TargetParser/
-
Host.cpp
-
X86TargetParser.cpp
-
test/CodeGen/X86/
-
CodeGen/
-
X86/
-
avx512-arith.ll
-
avx512-broadcast-arith.ll
-
avx512bw-arith.ll
-
avx512bwvl-arith.ll
-
avx512fp16-arith.ll
-
avx512vl-arith.ll

Differential D157485

[X86][RFC] Support new feature AVX10
Changes PlannedPublic

Authored by pengfei on Aug 9 2023, 3:06 AM.

Download Raw Diff

Details

Reviewers

RKSimon
craig.topper
skan
e-kud

Summary

AVX10 Architecture Specification: https://cdrdv2.intel.com/v1/dl/getContent/784267
AVX10 Technical Paper: https://cdrdv2.intel.com/v1/dl/getContent/784343
RFC: https://discourse.llvm.org/t/rfc-design-for-avx10-feature-support/72661

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

pengfei created this revision.Aug 9 2023, 3:06 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 9 2023, 3:06 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

pengfei requested review of this revision.Aug 9 2023, 3:06 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptAug 9 2023, 3:06 AM

Herald added subscribers: llvm-commits, cfe-commits, MaskRay. · View Herald Transcript

pengfei edited the summary of this revision. (Show Details)Aug 9 2023, 3:08 AM

Harbormaster completed remote builds in B251333: Diff 548541.Aug 9 2023, 5:48 AM

goldstein.w.n added a subscriber: goldstein.w.n.Aug 9 2023, 8:22 AM

goldstein.w.n added inline comments.

clang/lib/Basic/Targets/X86.h
99	Maybe should be HasAVX10_1_512? As brought up the rfc, there might be an avx10.2-512 Likewise elsewhere, or is this to match GCC?

Matt added a subscriber: Matt.Aug 9 2023, 3:10 PM

pengfei added inline comments.Aug 10 2023, 7:23 AM

clang/lib/Basic/Targets/X86.h
99	No. `HasAVX10_512BIT` is a single feature which can be combined with `HasAVX10_1`, `HasAVX10_2` etc. AFAIK, GCC chooses the similar idea here.

Ping~ It looks to me there's no concern about this solution in the RFC. I think we can move forward to land it.

skan added inline comments.Aug 15 2023, 11:33 PM

clang/lib/CodeGen/CodeGenFunction.cpp
2581	Minor suggestion. The code here may be refined to be better extended by other targets, like llvm::Triple::ArchType ArchType = getContext().getTargetInfo().getTriple().getArch(); switch (ArchType) { case llvm::Triple::x86: case llvm::Triple::x86_64: { .... } default: return;
llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
924	I think what you need here is `TSFlags & X86II::EVEX_L2` instead of `getL2()`. The class `X86OpcodePrefixHelper` is designed for encoding only. The bit `L2` can be set in other cases so it may blur the meaning of 512-bit here you use the getter.
926	-mavx10.1 does not work for assembler. So if such instruction is generated w/o AVX10-512BIT support, it must be compiler's issue instead of user's. An `assert` should be more appropriate here.

skan added inline comments.Aug 15 2023, 11:54 PM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
926	-mavx10.1 does not work for assembler. So if such instruction is generated w/o AVX10-512BIT support, it must be compiler's issue instead of user's. An `assert` should be more appropriate here. Reference: https://llvm.org/docs/CodingStandards.html#assert-liberally

Address comment.

clang/lib/CodeGen/CodeGenFunction.cpp
2581	We have a few place code using `isX86`. I think it's more convenient to use a single condition. We can refactor when necessary.
llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
926	We need to report fatal error for this case even if it's a compiler bug. Otherwise, user may observe the crash issue in runtime and hard to find the reason.

Harbormaster completed remote builds in B252888: Diff 550669.Aug 16 2023, 5:43 AM

LGTM

This revision is now accepted and ready to land.Aug 16 2023, 6:17 PM

MaskRay added inline comments.Aug 16 2023, 6:26 PM

clang/lib/Basic/Targets/X86.cpp
739	This is untested?

MaskRay added inline comments.Aug 16 2023, 7:22 PM

clang/lib/Driver/ToolChains/Arch/X86.cpp
261	`warn_drv_overriding_flag_option` is under the group `-Woverriding-t-option`, which was intended for clang-cl `/T*` options (`D1290`). I created D158137 to add `-Woverriding-option`.

Just curious, in RFC we have -mavx10.x-256/-mavx10.x-512 but here we refer to -mavx10.x/-mavx10.x,-mavx10-512bit. Is it compliant with GCC, or the revision is just for the illustrative purpose?

pengfei mentioned this in D159250: [X86][RFC] Add new option `-m[no-]evex512` to disable ZMM and 64-bit mask instructions for AVX512 features.Aug 30 2023, 11:50 PM

In D157485#4597603, @e-kud wrote:

Just curious, in RFC we have -mavx10.x-256/-mavx10.x-512 but here we refer to -mavx10.x/-mavx10.x,-mavx10-512bit. Is it compliant with GCC, or the revision is just for the illustrative purpose?

Sorry for the late reply. We have received a couple concerns about how to interpret these options, especially when used together with AVX512 options. We decided not to provide AVX10.1 options at the present, instead, we just provide -m[no-]evex512 to disable ZMM and 64-bit mask instructions for AVX512 features. For more details, lease take a look at D159250.

clang/lib/Basic/Targets/X86.cpp
739	Done in an alternative reversion D159250.
clang/lib/Driver/ToolChains/Arch/X86.cpp
261	Thanks! This is not needed in the new version.

pengfei mentioned this in rG7dd48cc24de2: [X86][RFC] Add new option `-m[no-]evex512` to disable ZMM and 64-bit mask….Sep 7 2023, 6:38 AM

pengfei mentioned this in rG24194090e17b: [X86][RFC] Add new option `-m[no-]evex512` to disable ZMM and 64-bit mask….Sep 8 2023, 7:47 AM

Revision Contents

Path

Size

clang/

docs/

ReleaseNotes.rst

2 lines

include/

clang/

Driver/

Options.td

4 lines

lib/

Basic/

Targets/

X86.h

2 lines

X86.cpp

25 lines

CodeGen/

CGBuiltin.cpp

5 lines

CodeGenFunction.h

2 lines

CodeGenFunction.cpp

14 lines

Targets/

X86.cpp

22 lines

Driver/

ToolChains/

Arch/

X86.cpp

42 lines

test/

CodeGen/

X86/

avx10-error.c

9 lines

attr-target-x86.c

4 lines

target-avx-abi-diag.c

7 lines

Driver/

x86-target-features.c

19 lines

Preprocessor/

x86_target_features.c

12 lines

llvm/

docs/

ReleaseNotes.rst

2 lines

include/

llvm/

TargetParser/

X86TargetParser.def

2 lines

lib/

IR/

Verifier.cpp

22 lines

Target/

X86/

MCTargetDesc/

7 lines

7 lines

2 lines

9 lines

3 lines

TargetParser/

Host.cpp

5 lines

X86TargetParser.cpp

6 lines

test/

CodeGen/

X86/

avx512-arith.ll

1 line

avx512-broadcast-arith.ll

1 line

1 line

1 line

1 line

1 line

Diff 550669

clang/docs/ReleaseNotes.rst

	Show First 20 Lines • Show All 154 Lines • ▼ Show 20 Lines
	-----------------------			-----------------------

	AMDGPU Support			AMDGPU Support
	^^^^^^^^^^^^^^			^^^^^^^^^^^^^^

	X86 Support			X86 Support
	^^^^^^^^^^^			^^^^^^^^^^^

				- Support ISA of ``AVX10.1``.

	Arm and AArch64 Support			Arm and AArch64 Support
	^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^

	Windows Support			Windows Support
	^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^

	LoongArch Support			LoongArch Support
	^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^
	▲ Show 20 Lines • Show All 68 Lines • Show Last 20 Lines

clang/include/clang/Driver/Options.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 4,933 Lines • ▼ Show 20 Lines
	// -mno-sse4 turns off sse4.1 which has the effect of turning off everything			// -mno-sse4 turns off sse4.1 which has the effect of turning off everything
	// later than 4.1. -msse4 turns on 4.2 which has the effect of turning on			// later than 4.1. -msse4 turns on 4.2 which has the effect of turning on
	// everything earlier than 4.2.			// everything earlier than 4.2.
	def mno_sse4 : Flag<["-"], "mno-sse4">, Alias<mno_sse4_1>;			def mno_sse4 : Flag<["-"], "mno-sse4">, Alias<mno_sse4_1>;
	def msse4a : Flag<["-"], "msse4a">, Group<m_x86_Features_Group>;			def msse4a : Flag<["-"], "msse4a">, Group<m_x86_Features_Group>;
	def mno_sse4a : Flag<["-"], "mno-sse4a">, Group<m_x86_Features_Group>;			def mno_sse4a : Flag<["-"], "mno-sse4a">, Group<m_x86_Features_Group>;
	def mavx : Flag<["-"], "mavx">, Group<m_x86_Features_Group>;			def mavx : Flag<["-"], "mavx">, Group<m_x86_Features_Group>;
	def mno_avx : Flag<["-"], "mno-avx">, Group<m_x86_Features_Group>;			def mno_avx : Flag<["-"], "mno-avx">, Group<m_x86_Features_Group>;
				def mavx10_1 : Flag<["-"], "mavx10.1">, Group<m_x86_Features_Group>;
				def mno_avx10_1 : Flag<["-"], "mno-avx10.1">, Group<m_x86_Features_Group>;
				def mavx10_1_256 : Flag<["-"], "mavx10.1-256">, Group<m_x86_Features_Group>;
				def mavx10_1_512 : Flag<["-"], "mavx10.1-512">, Group<m_x86_Features_Group>;
	def mavx2 : Flag<["-"], "mavx2">, Group<m_x86_Features_Group>;			def mavx2 : Flag<["-"], "mavx2">, Group<m_x86_Features_Group>;
	def mno_avx2 : Flag<["-"], "mno-avx2">, Group<m_x86_Features_Group>;			def mno_avx2 : Flag<["-"], "mno-avx2">, Group<m_x86_Features_Group>;
	def mavx512f : Flag<["-"], "mavx512f">, Group<m_x86_Features_Group>;			def mavx512f : Flag<["-"], "mavx512f">, Group<m_x86_Features_Group>;
	def mno_avx512f : Flag<["-"], "mno-avx512f">, Group<m_x86_Features_Group>;			def mno_avx512f : Flag<["-"], "mno-avx512f">, Group<m_x86_Features_Group>;
	def mavx512bf16 : Flag<["-"], "mavx512bf16">, Group<m_x86_Features_Group>;			def mavx512bf16 : Flag<["-"], "mavx512bf16">, Group<m_x86_Features_Group>;
	def mno_avx512bf16 : Flag<["-"], "mno-avx512bf16">, Group<m_x86_Features_Group>;			def mno_avx512bf16 : Flag<["-"], "mno-avx512bf16">, Group<m_x86_Features_Group>;
	def mavx512bitalg : Flag<["-"], "mavx512bitalg">, Group<m_x86_Features_Group>;			def mavx512bitalg : Flag<["-"], "mavx512bitalg">, Group<m_x86_Features_Group>;
	def mno_avx512bitalg : Flag<["-"], "mno-avx512bitalg">, Group<m_x86_Features_Group>;			def mno_avx512bitalg : Flag<["-"], "mno-avx512bitalg">, Group<m_x86_Features_Group>;
	▲ Show 20 Lines • Show All 2,525 Lines • Show Last 20 Lines

clang/lib/Basic/Targets/X86.h

Show First 20 Lines • Show All 89 Lines • ▼ Show 20 Lines	class LLVM_LIBRARY_VISIBILITY X86TargetInfo : public TargetInfo {
bool HasRTM = false;		bool HasRTM = false;
bool HasPRFCHW = false;		bool HasPRFCHW = false;
bool HasRDSEED = false;		bool HasRDSEED = false;
bool HasADX = false;		bool HasADX = false;
bool HasTBM = false;		bool HasTBM = false;
bool HasLWP = false;		bool HasLWP = false;
bool HasFMA = false;		bool HasFMA = false;
bool HasF16C = false;		bool HasF16C = false;
		bool HasAVX10_1 = false;
		bool HasAVX10_512BIT = false;
		goldstein.w.nUnsubmitted Not Done Reply Inline Actions Maybe should be HasAVX10_1_512? As brought up the rfc, there might be an avx10.2-512 Likewise elsewhere, or is this to match GCC? goldstein.w.n: Maybe should be HasAVX10_1_512? As brought up the rfc, there might be an avx10.2-512 Likewise…
		pengfeiAuthorUnsubmitted Done Reply Inline Actions No. `HasAVX10_512BIT` is a single feature which can be combined with `HasAVX10_1`, `HasAVX10_2` etc. AFAIK, GCC chooses the similar idea here. pengfei: No. `HasAVX10_512BIT` is a single feature which can be combined with `HasAVX10_1`, `HasAVX10_2`…
bool HasAVX512CD = false;		bool HasAVX512CD = false;
bool HasAVX512VPOPCNTDQ = false;		bool HasAVX512VPOPCNTDQ = false;
bool HasAVX512VNNI = false;		bool HasAVX512VNNI = false;
bool HasAVX512FP16 = false;		bool HasAVX512FP16 = false;
bool HasAVX512BF16 = false;		bool HasAVX512BF16 = false;
bool HasAVX512ER = false;		bool HasAVX512ER = false;
bool HasAVX512PF = false;		bool HasAVX512PF = false;
bool HasAVX512DQ = false;		bool HasAVX512DQ = false;
▲ Show 20 Lines • Show All 890 Lines • Show Last 20 Lines

clang/lib/Basic/Targets/X86.cpp

Show First 20 Lines • Show All 222 Lines • ▼ Show 20 Lines	for (const auto &Feature : Features) {
} else if (Feature == "+lwp") {		} else if (Feature == "+lwp") {
HasLWP = true;		HasLWP = true;
} else if (Feature == "+fma") {		} else if (Feature == "+fma") {
HasFMA = true;		HasFMA = true;
} else if (Feature == "+f16c") {		} else if (Feature == "+f16c") {
HasF16C = true;		HasF16C = true;
} else if (Feature == "+gfni") {		} else if (Feature == "+gfni") {
HasGFNI = true;		HasGFNI = true;
		} else if (Feature == "+avx10.1") {
		HasAVX10_1 = true;
		} else if (Feature == "+avx10-512bit") {
		HasAVX10_512BIT = true;
} else if (Feature == "+avx512cd") {		} else if (Feature == "+avx512cd") {
HasAVX512CD = true;		HasAVX512CD = true;
} else if (Feature == "+avx512vpopcntdq") {		} else if (Feature == "+avx512vpopcntdq") {
HasAVX512VPOPCNTDQ = true;		HasAVX512VPOPCNTDQ = true;
} else if (Feature == "+avx512vnni") {		} else if (Feature == "+avx512vnni") {
HasAVX512VNNI = true;		HasAVX512VNNI = true;
} else if (Feature == "+avx512bf16") {		} else if (Feature == "+avx512bf16") {
HasAVX512BF16 = true;		HasAVX512BF16 = true;
▲ Show 20 Lines • Show All 485 Lines • ▼ Show 20 Lines	if (HasFMA)
Builder.defineMacro("__FMA__");		Builder.defineMacro("__FMA__");

if (HasF16C)		if (HasF16C)
Builder.defineMacro("__F16C__");		Builder.defineMacro("__F16C__");

if (HasGFNI)		if (HasGFNI)
Builder.defineMacro("__GFNI__");		Builder.defineMacro("__GFNI__");

		if (HasAVX10_1)
		Builder.defineMacro("__AVX10_1__");
		if (HasAVX10_512BIT)
		Builder.defineMacro("__AVX10_512BIT__");
		MaskRayUnsubmitted Not Done Reply Inline Actions This is untested? MaskRay: This is untested?
		pengfeiAuthorUnsubmitted Done Reply Inline Actions Done in an alternative reversion D159250. pengfei: Done in an alternative reversion D159250.

if (HasAVX512CD)		if (HasAVX512CD)
Builder.defineMacro("__AVX512CD__");		Builder.defineMacro("__AVX512CD__");
if (HasAVX512VPOPCNTDQ)		if (HasAVX512VPOPCNTDQ)
Builder.defineMacro("__AVX512VPOPCNTDQ__");		Builder.defineMacro("__AVX512VPOPCNTDQ__");
if (HasAVX512VNNI)		if (HasAVX512VNNI)
Builder.defineMacro("__AVX512VNNI__");		Builder.defineMacro("__AVX512VNNI__");
if (HasAVX512BF16)		if (HasAVX512BF16)
Builder.defineMacro("__AVX512BF16__");		Builder.defineMacro("__AVX512BF16__");
▲ Show 20 Lines • Show All 207 Lines • ▼ Show 20 Lines	return llvm::StringSwitch<bool>(Name)
.Case("adx", true)		.Case("adx", true)
.Case("aes", true)		.Case("aes", true)
.Case("amx-bf16", true)		.Case("amx-bf16", true)
.Case("amx-complex", true)		.Case("amx-complex", true)
.Case("amx-fp16", true)		.Case("amx-fp16", true)
.Case("amx-int8", true)		.Case("amx-int8", true)
.Case("amx-tile", true)		.Case("amx-tile", true)
.Case("avx", true)		.Case("avx", true)
		.Case("avx10-512bit", true)
		.Case("avx10.1", true)
.Case("avx2", true)		.Case("avx2", true)
.Case("avx512f", true)		.Case("avx512f", true)
.Case("avx512cd", true)		.Case("avx512cd", true)
.Case("avx512vpopcntdq", true)		.Case("avx512vpopcntdq", true)
.Case("avx512vnni", true)		.Case("avx512vnni", true)
.Case("avx512bf16", true)		.Case("avx512bf16", true)
.Case("avx512er", true)		.Case("avx512er", true)
.Case("avx512fp16", true)		.Case("avx512fp16", true)
▲ Show 20 Lines • Show All 90 Lines • ▼ Show 20 Lines	return llvm::StringSwitch<bool>(Feature)
.Case("adx", HasADX)		.Case("adx", HasADX)
.Case("aes", HasAES)		.Case("aes", HasAES)
.Case("amx-bf16", HasAMXBF16)		.Case("amx-bf16", HasAMXBF16)
.Case("amx-complex", HasAMXCOMPLEX)		.Case("amx-complex", HasAMXCOMPLEX)
.Case("amx-fp16", HasAMXFP16)		.Case("amx-fp16", HasAMXFP16)
.Case("amx-int8", HasAMXINT8)		.Case("amx-int8", HasAMXINT8)
.Case("amx-tile", HasAMXTILE)		.Case("amx-tile", HasAMXTILE)
.Case("avx", SSELevel >= AVX)		.Case("avx", SSELevel >= AVX)
		.Case("avx10-512bit", HasAVX10_512BIT)
		.Case("avx10.1", HasAVX10_1)
.Case("avx2", SSELevel >= AVX2)		.Case("avx2", SSELevel >= AVX2)
.Case("avx512f", SSELevel >= AVX512F)		.Case("avx512f", SSELevel >= AVX512F)
.Case("avx512cd", HasAVX512CD)		.Case("avx512cd", HasAVX512CD)
.Case("avx512vpopcntdq", HasAVX512VPOPCNTDQ)		.Case("avx512vpopcntdq", HasAVX512VPOPCNTDQ)
.Case("avx512vnni", HasAVX512VNNI)		.Case("avx512vnni", HasAVX512VNNI)
.Case("avx512bf16", HasAVX512BF16)		.Case("avx512bf16", HasAVX512BF16)
.Case("avx512er", HasAVX512ER)		.Case("avx512er", HasAVX512ER)
.Case("avx512fp16", HasAVX512FP16)		.Case("avx512fp16", HasAVX512FP16)
▲ Show 20 Lines • Show All 455 Lines • ▼ Show 20 Lines	case 'Y':
default:		default:
return false;		return false;
case 'm':		case 'm':
// 'Ym' is synonymous with 'y'.		// 'Ym' is synonymous with 'y'.
case 'k':		case 'k':
return Size <= 64;		return Size <= 64;
case 'z':		case 'z':
// XMM0/YMM/ZMM0		// XMM0/YMM/ZMM0
if (hasFeatureEnabled(FeatureMap, "avx512f"))		if (hasFeatureEnabled(FeatureMap, "avx10.1") &&
		!hasFeatureEnabled(FeatureMap, "avx10-512bit"))
		// ZMM0 cannot be used if target only supports AVX10.x.
		return Size <= 256U;
		else if (hasFeatureEnabled(FeatureMap, "avx512f"))
// ZMM0 can be used if target supports AVX512F.		// ZMM0 can be used if target supports AVX512F.
return Size <= 512U;		return Size <= 512U;
else if (hasFeatureEnabled(FeatureMap, "avx"))		else if (hasFeatureEnabled(FeatureMap, "avx"))
// YMM0 can be used if target supports AVX.		// YMM0 can be used if target supports AVX.
return Size <= 256U;		return Size <= 256U;
else if (hasFeatureEnabled(FeatureMap, "sse"))		else if (hasFeatureEnabled(FeatureMap, "sse"))
return Size <= 128U;		return Size <= 128U;
return false;		return false;
case 'i':		case 'i':
case 't':		case 't':
case '2':		case '2':
// 'Yi','Yt','Y2' are synonymous with 'x' when SSE2 is enabled.		// 'Yi','Yt','Y2' are synonymous with 'x' when SSE2 is enabled.
if (SSELevel < SSE2)		if (SSELevel < SSE2)
return false;		return false;
break;		break;
}		}
break;		break;
case 'v':		case 'v':
case 'x':		case 'x':
if (hasFeatureEnabled(FeatureMap, "avx512f"))		if (hasFeatureEnabled(FeatureMap, "avx10.1") &&
		!hasFeatureEnabled(FeatureMap, "avx10-512bit"))
		// 512-bit zmm registers cannot be used if target only supports AVX10.x.
		return Size <= 256U;
		else if (hasFeatureEnabled(FeatureMap, "avx512f"))
// 512-bit zmm registers can be used if target supports AVX512F.		// 512-bit zmm registers can be used if target supports AVX512F.
return Size <= 512U;		return Size <= 512U;
else if (hasFeatureEnabled(FeatureMap, "avx"))		else if (hasFeatureEnabled(FeatureMap, "avx"))
// 256-bit ymm registers can be used if target supports AVX.		// 256-bit ymm registers can be used if target supports AVX.
return Size <= 256U;		return Size <= 256U;
return Size <= 128U;		return Size <= 128U;

}		}
▲ Show 20 Lines • Show All 81 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGBuiltin.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 5,408 Lines • ▼ Show 20 Lines

	// Check that a call to a target specific builtin has the correct target			// Check that a call to a target specific builtin has the correct target
	// features.			// features.
	// This is down here to avoid non-target specific builtins, however, if			// This is down here to avoid non-target specific builtins, however, if
	// generic builtins start to require generic target features then we			// generic builtins start to require generic target features then we
	// can move this up to the beginning of the function.			// can move this up to the beginning of the function.
	checkTargetFeatures(E, FD);			checkTargetFeatures(E, FD);

	if (unsigned VectorWidth = getContext().BuiltinInfo.getRequiredVectorWidth(BuiltinID))			if (unsigned VectorWidth =
				getContext().BuiltinInfo.getRequiredVectorWidth(BuiltinID)) {
				checkTargetVectorWidth(E, FD, VectorWidth);
	LargestVectorWidth = std::max(LargestVectorWidth, VectorWidth);			LargestVectorWidth = std::max(LargestVectorWidth, VectorWidth);
				}

	// See if we have a target specific intrinsic.			// See if we have a target specific intrinsic.
	StringRef Name = getContext().BuiltinInfo.getName(BuiltinID);			StringRef Name = getContext().BuiltinInfo.getName(BuiltinID);
	Intrinsic::ID IntrinsicID = Intrinsic::not_intrinsic;			Intrinsic::ID IntrinsicID = Intrinsic::not_intrinsic;
	StringRef Prefix =			StringRef Prefix =
	llvm::Triple::getArchTypePrefix(getTarget().getTriple().getArch());			llvm::Triple::getArchTypePrefix(getTarget().getTriple().getArch());
	if (!Prefix.empty()) {			if (!Prefix.empty()) {
	IntrinsicID = Intrinsic::getIntrinsicForClangBuiltin(Prefix.data(), Name);			IntrinsicID = Intrinsic::getIntrinsicForClangBuiltin(Prefix.data(), Name);
	▲ Show 20 Lines • Show All 15,056 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenFunction.h

Show First 20 Lines • Show All 4,061 Lines • ▼ Show 20 Lines	RValue EmitCall(QualType FnType, const CGCallee &Callee, const CallExpr *E,
ReturnValueSlot ReturnValue, llvm::Value *Chain = nullptr);		ReturnValueSlot ReturnValue, llvm::Value *Chain = nullptr);
RValue EmitCallExpr(const CallExpr *E,		RValue EmitCallExpr(const CallExpr *E,
ReturnValueSlot ReturnValue = ReturnValueSlot());		ReturnValueSlot ReturnValue = ReturnValueSlot());
RValue EmitSimpleCallExpr(const CallExpr *E, ReturnValueSlot ReturnValue);		RValue EmitSimpleCallExpr(const CallExpr *E, ReturnValueSlot ReturnValue);
CGCallee EmitCallee(const Expr *E);		CGCallee EmitCallee(const Expr *E);

void checkTargetFeatures(const CallExpr E, const FunctionDecl TargetDecl);		void checkTargetFeatures(const CallExpr E, const FunctionDecl TargetDecl);
void checkTargetFeatures(SourceLocation Loc, const FunctionDecl *TargetDecl);		void checkTargetFeatures(SourceLocation Loc, const FunctionDecl *TargetDecl);
		void checkTargetVectorWidth(const CallExpr E, const FunctionDecl TargetDecl,
		unsigned VectorWidth);

llvm::CallInst *EmitRuntimeCall(llvm::FunctionCallee callee,		llvm::CallInst *EmitRuntimeCall(llvm::FunctionCallee callee,
const Twine &name = "");		const Twine &name = "");
llvm::CallInst *EmitRuntimeCall(llvm::FunctionCallee callee,		llvm::CallInst *EmitRuntimeCall(llvm::FunctionCallee callee,
ArrayRef<llvm::Value *> args,		ArrayRef<llvm::Value *> args,
const Twine &name = "");		const Twine &name = "");
llvm::CallInst *EmitNounwindRuntimeCall(llvm::FunctionCallee callee,		llvm::CallInst *EmitNounwindRuntimeCall(llvm::FunctionCallee callee,
const Twine &name = "");		const Twine &name = "");
▲ Show 20 Lines • Show All 867 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenFunction.cpp

	Show First 20 Lines • Show All 2,567 Lines • ▼ Show 20 Lines

	// Emits an error if we don't have a valid set of target features for the			// Emits an error if we don't have a valid set of target features for the
	// called function.			// called function.
	void CodeGenFunction::checkTargetFeatures(const CallExpr *E,			void CodeGenFunction::checkTargetFeatures(const CallExpr *E,
	const FunctionDecl *TargetDecl) {			const FunctionDecl *TargetDecl) {
	return checkTargetFeatures(E->getBeginLoc(), TargetDecl);			return checkTargetFeatures(E->getBeginLoc(), TargetDecl);
	}			}

				// Emits an error if the builtin's vector width >= 512 and avx10-512bit
				// feature is not set.
				void CodeGenFunction::checkTargetVectorWidth(const CallExpr *E,
				const FunctionDecl *TargetDecl,
				unsigned VectorWidth) {
				if (!getTarget().getTriple().isX86() \|\| VectorWidth < 512)
				skanUnsubmitted Not Done Reply Inline Actions Minor suggestion. The code here may be refined to be better extended by other targets, like llvm::Triple::ArchType ArchType = getContext().getTargetInfo().getTriple().getArch(); switch (ArchType) { case llvm::Triple::x86: case llvm::Triple::x86_64: { .... } default: return; skan: Minor suggestion. The code here may be refined to be better extended by other targets, like ```…
				pengfeiAuthorUnsubmitted Done Reply Inline Actions We have a few place code using `isX86`. I think it's more convenient to use a single condition. We can refactor when necessary. pengfei: We have a few place code using `isX86`. I think it's more convenient to use a single condition.
				return;
				llvm::StringMap<bool> FeatureMap;
				CGM.getContext().getFunctionFeatureMap(FeatureMap, TargetDecl);
				if (FeatureMap.lookup("avx10.1") && !FeatureMap.lookup("avx10-512bit"))
				CGM.getDiags().Report(E->getBeginLoc(), diag::err_builtin_needs_feature)
				<< TargetDecl->getDeclName() << "avx10-512bit";
				}

	// Emits an error if we don't have a valid set of target features for the			// Emits an error if we don't have a valid set of target features for the
	// called function.			// called function.
	void CodeGenFunction::checkTargetFeatures(SourceLocation Loc,			void CodeGenFunction::checkTargetFeatures(SourceLocation Loc,
	const FunctionDecl *TargetDecl) {			const FunctionDecl *TargetDecl) {
	// Early exit if this is an indirect call.			// Early exit if this is an indirect call.
	if (!TargetDecl)			if (!TargetDecl)
	return;			return;

	▲ Show 20 Lines • Show All 340 Lines • Show Last 20 Lines

clang/lib/CodeGen/Targets/X86.cpp

Show First 20 Lines • Show All 1,480 Lines • ▼ Show 20 Lines	if (CalleeMap.empty() && CallerMap.empty()) {
// The caller is potentially nullptr in the case where the call isn't in a		// The caller is potentially nullptr in the case where the call isn't in a
// function. In this case, the getFunctionFeatureMap ensures we just get		// function. In this case, the getFunctionFeatureMap ensures we just get
// the TU level setting (since it cannot be modified by 'target'..		// the TU level setting (since it cannot be modified by 'target'..
Ctx.getFunctionFeatureMap(CallerMap, Caller);		Ctx.getFunctionFeatureMap(CallerMap, Caller);
Ctx.getFunctionFeatureMap(CalleeMap, Callee);		Ctx.getFunctionFeatureMap(CalleeMap, Callee);
}		}
}		}

		static bool checkAVX10ParamFeature(DiagnosticsEngine &Diag,
		SourceLocation CallLoc,
		const llvm::StringMap<bool> &CallerMap,
		const llvm::StringMap<bool> &CalleeMap,
		QualType Ty, bool IsArgument) {
		bool CallerAVX256 =
		CallerMap.lookup("avx10.1") && !CallerMap.lookup("avx10-512bit");
		bool CalleeAVX256 =
		CallerMap.lookup("avx10.1") && !CallerMap.lookup("avx10-512bit");

		// Forbid 512-bit or large vector pass or return on AVX10 256-bit targets.
		if (CallerAVX256 \|\| CalleeAVX256)
		return Diag.Report(CallLoc, diag::err_avx_calling_convention)
		<< IsArgument << Ty << "avx10.x-256";

		return false;
		}

static bool checkAVXParamFeature(DiagnosticsEngine &Diag,		static bool checkAVXParamFeature(DiagnosticsEngine &Diag,
SourceLocation CallLoc,		SourceLocation CallLoc,
const llvm::StringMap<bool> &CallerMap,		const llvm::StringMap<bool> &CallerMap,
const llvm::StringMap<bool> &CalleeMap,		const llvm::StringMap<bool> &CalleeMap,
QualType Ty, StringRef Feature,		QualType Ty, StringRef Feature,
bool IsArgument) {		bool IsArgument) {
bool CallerHasFeat = CallerMap.lookup(Feature);		bool CallerHasFeat = CallerMap.lookup(Feature);
bool CalleeHasFeat = CalleeMap.lookup(Feature);		bool CalleeHasFeat = CalleeMap.lookup(Feature);
Show All 13 Lines

static bool checkAVXParam(DiagnosticsEngine &Diag, ASTContext &Ctx,		static bool checkAVXParam(DiagnosticsEngine &Diag, ASTContext &Ctx,
SourceLocation CallLoc,		SourceLocation CallLoc,
const llvm::StringMap<bool> &CallerMap,		const llvm::StringMap<bool> &CallerMap,
const llvm::StringMap<bool> &CalleeMap, QualType Ty,		const llvm::StringMap<bool> &CalleeMap, QualType Ty,
bool IsArgument) {		bool IsArgument) {
uint64_t Size = Ctx.getTypeSize(Ty);		uint64_t Size = Ctx.getTypeSize(Ty);
if (Size > 256)		if (Size > 256)
return checkAVXParamFeature(Diag, CallLoc, CallerMap, CalleeMap, Ty,		return checkAVX10ParamFeature(Diag, CallLoc, CallerMap, CalleeMap, Ty,
		IsArgument) \|\|
		checkAVXParamFeature(Diag, CallLoc, CallerMap, CalleeMap, Ty,
"avx512f", IsArgument);		"avx512f", IsArgument);

if (Size > 128)		if (Size > 128)
return checkAVXParamFeature(Diag, CallLoc, CallerMap, CalleeMap, Ty, "avx",		return checkAVXParamFeature(Diag, CallLoc, CallerMap, CalleeMap, Ty, "avx",
IsArgument);		IsArgument);

return false;		return false;
}		}
▲ Show 20 Lines • Show All 1,884 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Arch/X86.cpp

Show First 20 Lines • Show All 225 Lines • ▼ Show 20 Lines	if (SpectreOpt != clang::driver::options::ID::OPT_INVALID &&
LVIOpt != clang::driver::options::ID::OPT_INVALID) {		LVIOpt != clang::driver::options::ID::OPT_INVALID) {
D.Diag(diag::err_drv_argument_not_allowed_with)		D.Diag(diag::err_drv_argument_not_allowed_with)
<< D.getOpts().getOptionName(SpectreOpt)		<< D.getOpts().getOptionName(SpectreOpt)
<< D.getOpts().getOptionName(LVIOpt);		<< D.getOpts().getOptionName(LVIOpt);
}		}

// Now add any that the user explicitly requested on the command line,		// Now add any that the user explicitly requested on the command line,
// which may override the defaults.		// which may override the defaults.
		bool HasAVX10x = false;
		int AVXVecSize = 0;
		std::vector<StringRef> AVX512Cand;
for (const Arg *A : Args.filtered(options::OPT_m_x86_Features_Group,		for (const Arg *A : Args.filtered(options::OPT_m_x86_Features_Group,
options::OPT_mgeneral_regs_only)) {		options::OPT_mgeneral_regs_only)) {
StringRef Name = A->getOption().getName();		StringRef Name = A->getOption().getName();
A->claim();		A->claim();

// Skip over "-m".		// Skip over "-m".
assert(Name.startswith("m") && "Invalid feature name.");		assert(Name.startswith("m") && "Invalid feature name.");
Name = Name.substr(1);		Name = Name.substr(1);

// Replace -mgeneral-regs-only with -x87, -mmx, -sse		// Replace -mgeneral-regs-only with -x87, -mmx, -sse
if (A->getOption().getID() == options::OPT_mgeneral_regs_only) {		if (A->getOption().getID() == options::OPT_mgeneral_regs_only) {
Features.insert(Features.end(), {"-x87", "-mmx", "-sse"});		Features.insert(Features.end(), {"-x87", "-mmx", "-sse"});
continue;		continue;
}		}

bool IsNegative = Name.startswith("no-");		bool IsNegative = Name.startswith("no-");
if (IsNegative)		if (IsNegative)
Name = Name.substr(3);		Name = Name.substr(3);
Features.push_back(Args.MakeArgString((IsNegative ? "-" : "+") + Name));		if (Name.startswith("avx10.")) {
		HasAVX10x = true;
		StringRef VecSizeStr;
		std::tie(Name, VecSizeStr) = Name.split('-');
		if (VecSizeStr == "512") {
		if (AVXVecSize == 256)
		D.Diag(diag::warn_drv_overriding_flag_option) << "AVX10-256"
		MaskRayUnsubmitted Not Done Reply Inline Actions `warn_drv_overriding_flag_option` is under the group `-Woverriding-t-option`, which was intended for clang-cl `/T` options (`D1290`). I created D158137 to add `-Woverriding-option`. MaskRay:* `warn_drv_overriding_flag_option` is under the group `-Woverriding-t-option`, which was…
		pengfeiAuthorUnsubmitted Done Reply Inline Actions Thanks! This is not needed in the new version. pengfei: Thanks! This is not needed in the new version.
		<< "AVX10-512";
		AVXVecSize = 512;
		} else if (VecSizeStr == "256") {
		if (AVXVecSize == 512)
		D.Diag(diag::warn_drv_overriding_flag_option) << "AVX10-512"
		<< "AVX10-256";
		AVXVecSize = 256;
		} else if (VecSizeStr != "") {
		D.Diag(diag::err_drv_unsupported_opt_with_suggestion)
		<< A->getOption().getName() << Name;
		}
		}
		StringRef ArgString = Args.MakeArgString((IsNegative ? "-" : "+") + Name);
		if (Name.startswith("avx512"))
		AVX512Cand.push_back(ArgString);
		else
		Features.push_back(ArgString);
		}

		// If -mavx10.x is specified, clear all -m[no-]avx512xxx options and emit a
		// warning.
		if (HasAVX10x) {
		if (AVX512Cand.size())
		D.Diag(diag::warn_drv_overriding_flag_option) << "avx512*"
		<< "avx10.*";
		if (AVXVecSize == 256)
		Features.push_back("-avx10-512bit");
		if (AVXVecSize == 512)
		Features.push_back("+avx10-512bit");
		} else {
		Features.insert(Features.end(), AVX512Cand.begin(), AVX512Cand.end());
}		}

// Enable/disable straight line speculation hardening.		// Enable/disable straight line speculation hardening.
if (Arg *A = Args.getLastArg(options::OPT_mharden_sls_EQ)) {		if (Arg *A = Args.getLastArg(options::OPT_mharden_sls_EQ)) {
StringRef Scope = A->getValue();		StringRef Scope = A->getValue();
if (Scope == "all") {		if (Scope == "all") {
Features.push_back("+harden-sls-ijmp");		Features.push_back("+harden-sls-ijmp");
Features.push_back("+harden-sls-ret");		Features.push_back("+harden-sls-ret");
Show All 10 Lines

clang/test/CodeGen/X86/avx10-error.c

This file was added.

				// RUN: %clang_cc1 %s -ffreestanding -triple=x86_64-unknown-unknown -target-feature +avx10.1 -emit-llvm -verify

				#include <immintrin.h>

				__m512d test_mm512_sqrt_pd(__m512d a)
				{
				// CHECK-LABEL: @test_mm512_sqrt_pd
				return __builtin_ia32_sqrtpd512(a, _MM_FROUND_CUR_DIRECTION); // expected-error {{'__builtin_ia32_sqrtpd512' needs target feature avx10-512bit}}
				}

clang/test/CodeGen/attr-target-x86.c

	Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	// CHECK: qax{{.*}} #5			// CHECK: qax{{.*}} #5
	// CHECK: qq{{.*}} #6			// CHECK: qq{{.*}} #6
	// CHECK: lake{{.*}} #7			// CHECK: lake{{.*}} #7
	// CHECK: use_before_def{{.*}} #7			// CHECK: use_before_def{{.*}} #7
	// CHECK: walrus{{.*}} #8			// CHECK: walrus{{.*}} #8
	// CHECK: #0 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87" "tune-cpu"="i686"			// CHECK: #0 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87" "tune-cpu"="i686"
	// CHECK: #1 = {{.*}}"target-cpu"="ivybridge" "target-features"="+avx,+cmov,+crc32,+cx16,+cx8,+f16c,+fsgsbase,+fxsr,+mmx,+pclmul,+popcnt,+rdrnd,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave,+xsaveopt"			// CHECK: #1 = {{.*}}"target-cpu"="ivybridge" "target-features"="+avx,+cmov,+crc32,+cx16,+cx8,+f16c,+fsgsbase,+fxsr,+mmx,+pclmul,+popcnt,+rdrnd,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave,+xsaveopt"
	// CHECK-NOT: tune-cpu			// CHECK-NOT: tune-cpu
	// CHECK: #2 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87,-aes,-avx,-avx2,-avx512bf16,-avx512bitalg,-avx512bw,-avx512cd,-avx512dq,-avx512er,-avx512f,-avx512fp16,-avx512ifma,-avx512pf,-avx512vbmi,-avx512vbmi2,-avx512vl,-avx512vnni,-avx512vp2intersect,-avx512vpopcntdq,-avxifma,-avxneconvert,-avxvnni,-avxvnniint16,-avxvnniint8,-f16c,-fma,-fma4,-gfni,-kl,-pclmul,-sha,-sha512,-sm3,-sm4,-sse2,-sse3,-sse4.1,-sse4.2,-sse4a,-ssse3,-vaes,-vpclmulqdq,-widekl,-xop" "tune-cpu"="i686"			// CHECK: #2 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87,-aes,-avx,-avx10.1,-avx2,-avx512bf16,-avx512bitalg,-avx512bw,-avx512cd,-avx512dq,-avx512er,-avx512f,-avx512fp16,-avx512ifma,-avx512pf,-avx512vbmi,-avx512vbmi2,-avx512vl,-avx512vnni,-avx512vp2intersect,-avx512vpopcntdq,-avxifma,-avxneconvert,-avxvnni,-avxvnniint16,-avxvnniint8,-f16c,-fma,-fma4,-gfni,-kl,-pclmul,-sha,-sha512,-sm3,-sm4,-sse2,-sse3,-sse4.1,-sse4.2,-sse4a,-ssse3,-vaes,-vpclmulqdq,-widekl,-xop" "tune-cpu"="i686"
	// CHECK: #3 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+crc32,+cx8,+mmx,+popcnt,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87" "tune-cpu"="i686"			// CHECK: #3 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+crc32,+cx8,+mmx,+popcnt,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87" "tune-cpu"="i686"
	// CHECK: #4 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87,-avx,-avx2,-avx512bf16,-avx512bitalg,-avx512bw,-avx512cd,-avx512dq,-avx512er,-avx512f,-avx512fp16,-avx512ifma,-avx512pf,-avx512vbmi,-avx512vbmi2,-avx512vl,-avx512vnni,-avx512vp2intersect,-avx512vpopcntdq,-avxifma,-avxneconvert,-avxvnni,-avxvnniint16,-avxvnniint8,-f16c,-fma,-fma4,-sha512,-sm3,-sm4,-sse4.1,-sse4.2,-vaes,-vpclmulqdq,-xop" "tune-cpu"="i686"			// CHECK: #4 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87,-avx,-avx10.1,-avx2,-avx512bf16,-avx512bitalg,-avx512bw,-avx512cd,-avx512dq,-avx512er,-avx512f,-avx512fp16,-avx512ifma,-avx512pf,-avx512vbmi,-avx512vbmi2,-avx512vl,-avx512vnni,-avx512vp2intersect,-avx512vpopcntdq,-avxifma,-avxneconvert,-avxvnni,-avxvnniint16,-avxvnniint8,-f16c,-fma,-fma4,-sha512,-sm3,-sm4,-sse4.1,-sse4.2,-vaes,-vpclmulqdq,-xop" "tune-cpu"="i686"
	// CHECK: #5 = {{.*}}"target-cpu"="ivybridge" "target-features"="+avx,+cmov,+crc32,+cx16,+cx8,+f16c,+fsgsbase,+fxsr,+mmx,+pclmul,+popcnt,+rdrnd,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave,+xsaveopt,-aes,-vaes"			// CHECK: #5 = {{.*}}"target-cpu"="ivybridge" "target-features"="+avx,+cmov,+crc32,+cx16,+cx8,+f16c,+fsgsbase,+fxsr,+mmx,+pclmul,+popcnt,+rdrnd,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave,+xsaveopt,-aes,-vaes"
	// CHECK-NOT: tune-cpu			// CHECK-NOT: tune-cpu
	// CHECK: #6 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87,-3dnow,-3dnowa,-mmx"			// CHECK: #6 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87,-3dnow,-3dnowa,-mmx"
	// CHECK: #7 = {{.*}}"target-cpu"="lakemont" "target-features"="+cx8,+mmx"			// CHECK: #7 = {{.*}}"target-cpu"="lakemont" "target-features"="+cx8,+mmx"
	// CHECK-NOT: tune-cpu			// CHECK-NOT: tune-cpu
	// CHECK: #8 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87" "tune-cpu"="sandybridge"			// CHECK: #8 = {{.*}}"target-cpu"="i686" "target-features"="+cmov,+cx8,+x87" "tune-cpu"="sandybridge"

	// CHECK: "target-cpu"="x86-64-v2"			// CHECK: "target-cpu"="x86-64-v2"
	// CHECK-SAME: "target-features"="+cmov,+crc32,+cx16,+cx8,+fxsr,+mmx,+popcnt,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87"			// CHECK-SAME: "target-features"="+cmov,+crc32,+cx16,+cx8,+fxsr,+mmx,+popcnt,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87"
	// CHECK: "target-cpu"="x86-64-v3"			// CHECK: "target-cpu"="x86-64-v3"
	// CHECK-SAME: "target-features"="+avx,+avx2,+bmi,+bmi2,+cmov,+crc32,+cx16,+cx8,+f16c,+fma,+fxsr,+lzcnt,+mmx,+movbe,+popcnt,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave"			// CHECK-SAME: "target-features"="+avx,+avx2,+bmi,+bmi2,+cmov,+crc32,+cx16,+cx8,+f16c,+fma,+fxsr,+lzcnt,+mmx,+movbe,+popcnt,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave"
	// CHECK: "target-cpu"="x86-64-v4"			// CHECK: "target-cpu"="x86-64-v4"
	// CHECK-SAME: "target-features"="+avx,+avx2,+avx512bw,+avx512cd,+avx512dq,+avx512f,+avx512vl,+bmi,+bmi2,+cmov,+crc32,+cx16,+cx8,+f16c,+fma,+fxsr,+lzcnt,+mmx,+movbe,+popcnt,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave"			// CHECK-SAME: "target-features"="+avx,+avx2,+avx512bw,+avx512cd,+avx512dq,+avx512f,+avx512vl,+bmi,+bmi2,+cmov,+crc32,+cx16,+cx8,+f16c,+fma,+fxsr,+lzcnt,+mmx,+movbe,+popcnt,+sahf,+sse,+sse2,+sse3,+sse4.1,+sse4.2,+ssse3,+x87,+xsave"

clang/test/CodeGen/target-avx-abi-diag.c

	// RUN: %clang_cc1 %s -triple=x86_64-linux-gnu -verify=no256,no512 -o - -S			// RUN: %clang_cc1 %s -triple=x86_64-linux-gnu -verify=no256,no512 -o - -S
	// RUN: %clang_cc1 %s -triple=x86_64-linux-gnu -target-feature +avx -verify=no512 -o - -S			// RUN: %clang_cc1 %s -triple=x86_64-linux-gnu -target-feature +avx -verify=no512 -o - -S
	// RUN: %clang_cc1 %s -triple=x86_64-linux-gnu -target-feature +avx512f -verify=both -o - -S			// RUN: %clang_cc1 %s -triple=x86_64-linux-gnu -target-feature +avx512f -verify=both -o - -S
				// RUN: %clang_cc1 %s -triple=x86_64-linux-gnu -target-feature +avx10.1 -DAVX10_256 -verify=avx10-256 -o - -S
				// RUN: %clang_cc1 %s -triple=x86_64-linux-gnu -target-feature +avx10.1 -target-feature +avx10-512bit -verify=both -o - -S
	// REQUIRES: x86-registered-target			// REQUIRES: x86-registered-target

	// both-no-diagnostics			// both-no-diagnostics

	typedef short avx512fType __attribute__((vector_size(64)));			typedef short avx512fType __attribute__((vector_size(64)));
	typedef short avx256Type __attribute__((vector_size(32)));			typedef short avx256Type __attribute__((vector_size(32)));

	__attribute__((target("avx"))) void takesAvx256(avx256Type t);			__attribute__((target("avx"))) void takesAvx256(avx256Type t);
	__attribute__((target("avx512f"))) void takesAvx512(avx512fType t);			__attribute__((target("avx512f"))) void takesAvx512(avx512fType t);
	void takesAvx256_no_target(avx256Type t);			void takesAvx256_no_target(avx256Type t);
	void takesAvx512_no_target(avx512fType t);			void takesAvx512_no_target(avx512fType t);

	void variadic(int i, ...);			void variadic(int i, ...);
	__attribute__((target("avx512f"))) void variadic_err(int i, ...);			__attribute__((target("avx512f"))) void variadic_err(int i, ...);

				#ifndef AVX10_256
	// If neither side has an attribute, warn.			// If neither side has an attribute, warn.
	void call_warn(void) {			void call_warn(void) {
	avx256Type t1;			avx256Type t1;
	takesAvx256_no_target(t1); // no256-warning {{AVX vector argument of type 'avx256Type' (vector of 16 'short' values) without 'avx' enabled changes the ABI}}			takesAvx256_no_target(t1); // no256-warning {{AVX vector argument of type 'avx256Type' (vector of 16 'short' values) without 'avx' enabled changes the ABI}}

	avx512fType t2;			avx512fType t2;
	takesAvx512_no_target(t2); // no512-warning {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx512f' enabled changes the ABI}}			takesAvx512_no_target(t2); // no512-warning {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx512f' enabled changes the ABI}}

	variadic(1, t1); // no256-warning {{AVX vector argument of type 'avx256Type' (vector of 16 'short' values) without 'avx' enabled changes the ABI}}			variadic(1, t1); // no256-warning {{AVX vector argument of type 'avx256Type' (vector of 16 'short' values) without 'avx' enabled changes the ABI}}
	variadic(3, t2); // no512-warning {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx512f' enabled changes the ABI}}			variadic(3, t2); // no512-warning {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx512f' enabled changes the ABI}}
	}			}
				#endif

	// If only 1 side has an attribute, error.			// If only 1 side has an attribute, error.
	void call_errors(void) {			void call_errors(void) {
	avx256Type t1;			avx256Type t1;
	takesAvx256(t1); // no256-error {{AVX vector argument of type 'avx256Type' (vector of 16 'short' values) without 'avx' enabled changes the ABI}}			takesAvx256(t1); // no256-error {{AVX vector argument of type 'avx256Type' (vector of 16 'short' values) without 'avx' enabled changes the ABI}}
	avx512fType t2;			avx512fType t2;

				// avx10-256-error@+1 {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx10.x-256' enabled changes the ABI}}
	takesAvx512(t2); // no512-error {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx512f' enabled changes the ABI}}			takesAvx512(t2); // no512-error {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx512f' enabled changes the ABI}}

	variadic_err(1, t1); // no256-error {{AVX vector argument of type 'avx256Type' (vector of 16 'short' values) without 'avx' enabled changes the ABI}}			variadic_err(1, t1); // no256-error {{AVX vector argument of type 'avx256Type' (vector of 16 'short' values) without 'avx' enabled changes the ABI}}
				// avx10-256-error@+1 {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx10.x-256' enabled changes the ABI}}
	variadic_err(3, t2); // no512-error {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx512f' enabled changes the ABI}}			variadic_err(3, t2); // no512-error {{AVX vector argument of type 'avx512fType' (vector of 32 'short' values) without 'avx512f' enabled changes the ABI}}
	}			}

	// These two don't diagnose anything, since these are valid calls.			// These two don't diagnose anything, since these are valid calls.
	__attribute__((target("avx"))) void call_avx256_ok(void) {			__attribute__((target("avx"))) void call_avx256_ok(void) {
	avx256Type t;			avx256Type t;
	takesAvx256(t);			takesAvx256(t);
	}			}

	__attribute__((target("avx512f"))) void call_avx512_ok(void) {			__attribute__((target("avx512f"))) void call_avx512_ok(void) {
	avx512fType t;			avx512fType t;
	takesAvx512(t);			takesAvx512(t);
	}			}

clang/test/Driver/x86-target-features.c

	Show First 20 Lines • Show All 363 Lines • ▼ Show 20 Lines
	// SM4: "-target-feature" "+sm4"			// SM4: "-target-feature" "+sm4"
	// NO-SM4: "-target-feature" "-sm4"			// NO-SM4: "-target-feature" "-sm4"

	// RUN: %clang --target=i386 -mavxvnniint16 %s -### -o %t.o 2>&1 \| FileCheck -check-prefix=AVXVNNIINT16 %s			// RUN: %clang --target=i386 -mavxvnniint16 %s -### -o %t.o 2>&1 \| FileCheck -check-prefix=AVXVNNIINT16 %s
	// RUN: %clang --target=i386 -mno-avxvnniint16 %s -### -o %t.o 2>&1 \| FileCheck -check-prefix=NO-AVXVNNIINT16 %s			// RUN: %clang --target=i386 -mno-avxvnniint16 %s -### -o %t.o 2>&1 \| FileCheck -check-prefix=NO-AVXVNNIINT16 %s
	// AVXVNNIINT16: "-target-feature" "+avxvnniint16"			// AVXVNNIINT16: "-target-feature" "+avxvnniint16"
	// NO-AVXVNNIINT16: "-target-feature" "-avxvnniint16"			// NO-AVXVNNIINT16: "-target-feature" "-avxvnniint16"

				// RUN: %clang --target=i386 -mavx10.1 %s -### -o %t.o 2>&1 \| FileCheck -check-prefix=AVX10_1 %s
				// RUN: %clang --target=i386 -mavx10.1 -mavx512f %s -### -o %t.o 2>&1 \| FileCheck -check-prefixes=AVX10_1,AVX10_WARN %s
				// RUN: %clang --target=i386 -mavx10.1 -mno-avx512f %s -### -o %t.o 2>&1 \| FileCheck -check-prefixes=AVX10_1,AVX10_WARN %s
				// RUN: %clang --target=i386 -mno-avx10.1 %s -### -o %t.o 2>&1 \| FileCheck -check-prefix=NO-AVX10_1 %s
				// RUN: %clang --target=i386 -mno-avx10.1 -mavx512f %s -### -o %t.o 2>&1 \| FileCheck -check-prefixes=NO-AVX10_1,AVX10_WARN %s
				// RUN: %clang --target=i386 -mno-avx10.1 -mno-avx512f %s -### -o %t.o 2>&1 \| FileCheck -check-prefixes=NO-AVX10_1,AVX10_WARN %s
				// AVX10_WARN: clang: warning: overriding 'avx512' option with 'avx10.' [-Woverriding-t-option]
				// AVX10_1: "-target-feature" "+avx10.1"
				// NO-AVX10_1: "-target-feature" "-avx10.1"

				// RUN: %clang --target=i386 -mavx10.1-512 %s -### -o %t.o 2>&1 \| FileCheck -check-prefix=AVX10_512BIT %s
				// RUN: %clang --target=i386 -mavx10.1-256 %s -### -o %t.o 2>&1 \| FileCheck -check-prefix=NO-AVX10_512BIT %s
				// RUN: %clang --target=i386 -mavx10.1-256 -mavx10.1-512 %s -### -o %t.o 2>&1 \| FileCheck -check-prefixes=AVX10_512BIT,OVER256_WARN %s
				// RUN: %clang --target=i386 -mavx10.1-512 -mavx10.1-256 %s -### -o %t.o 2>&1 \| FileCheck -check-prefixes=NO-AVX10_512BIT,OVER512_WARN %s
				// OVER256_WARN: clang: warning: overriding 'AVX10-256' option with 'AVX10-512' [-Woverriding-t-option]
				// OVER512_WARN: clang: warning: overriding 'AVX10-512' option with 'AVX10-256' [-Woverriding-t-option]
				// AVX10_512BIT: "-target-feature" "+avx10-512bit"
				// NO-AVX10_512BIT: "-target-feature" "-avx10-512bit"

	// RUN: %clang --target=i386 -march=i386 -mcrc32 %s -### 2>&1 \| FileCheck -check-prefix=CRC32 %s			// RUN: %clang --target=i386 -march=i386 -mcrc32 %s -### 2>&1 \| FileCheck -check-prefix=CRC32 %s
	// RUN: %clang --target=i386 -march=i386 -mno-crc32 %s -### 2>&1 \| FileCheck -check-prefix=NO-CRC32 %s			// RUN: %clang --target=i386 -march=i386 -mno-crc32 %s -### 2>&1 \| FileCheck -check-prefix=NO-CRC32 %s
	// CRC32: "-target-feature" "+crc32"			// CRC32: "-target-feature" "+crc32"
	// NO-CRC32: "-target-feature" "-crc32"			// NO-CRC32: "-target-feature" "-crc32"

	// RUN: not %clang -### --target=aarch64 -mcrc32 -msse4.1 -msse4.2 -mno-sgx %s 2>&1 \| FileCheck --check-prefix=NONX86 %s			// RUN: not %clang -### --target=aarch64 -mcrc32 -msse4.1 -msse4.2 -mno-sgx %s 2>&1 \| FileCheck --check-prefix=NONX86 %s
	// NONX86: error: unsupported option '-mcrc32' for target 'aarch64'			// NONX86: error: unsupported option '-mcrc32' for target 'aarch64'
	// NONX86-NEXT: error: unsupported option '-msse4.1' for target 'aarch64'			// NONX86-NEXT: error: unsupported option '-msse4.1' for target 'aarch64'
	Show All 17 Lines

clang/test/Preprocessor/x86_target_features.c

	Show First 20 Lines • Show All 708 Lines • ▼ Show 20 Lines

	// NOAVXVNNIINT16-NOT: #define __AVXVNNIINT16__ 1			// NOAVXVNNIINT16-NOT: #define __AVXVNNIINT16__ 1

	// RUN: %clang -target i686-unknown-linux-gnu -march=atom -mavxvnniint16 -mno-avx2 -x c -E -dM -o - %s \| FileCheck -check-prefix=AVXVNNIINT16NOAVX2 %s			// RUN: %clang -target i686-unknown-linux-gnu -march=atom -mavxvnniint16 -mno-avx2 -x c -E -dM -o - %s \| FileCheck -check-prefix=AVXVNNIINT16NOAVX2 %s

	// AVXVNNIINT16NOAVX2-NOT: #define __AVX2__ 1			// AVXVNNIINT16NOAVX2-NOT: #define __AVX2__ 1
	// AVXVNNIINT16NOAVX2-NOT: #define __AVXVNNIINT16__ 1			// AVXVNNIINT16NOAVX2-NOT: #define __AVXVNNIINT16__ 1

				// RUN: %clang -target i686-unknown-linux-gnu -march=atom -mavx10.1 -x c -E -dM -o - %s \| FileCheck -check-prefix=AVX10_1 %s
				// RUN: %clang -target i686-unknown-linux-gnu -march=atom -mavx10.1 -mno-avx512f -x c -E -dM -o - %s \| FileCheck -check-prefix=AVX10_1 %s

				// AVX10_1: #define __AVX10_1__ 1
				// AVX10_1: #define __AVX512F__ 1

				// RUN: %clang -target i686-unknown-linux-gnu -march=atom -mno-avx10.1 -x c -E -dM -o - %s \| FileCheck -check-prefix=NOAVX10_1 %s
				// RUN: %clang -target i686-unknown-linux-gnu -march=atom -mno-avx10.1 -mavx512f -x c -E -dM -o - %s \| FileCheck -check-prefix=NOAVX10_1 %s

				// NOAVX10_1-NOT: #define __AVX10_1__ 1
				// NOAVX10_1-NOT: #define __AVX512F__ 1

	// RUN: %clang -target i386-unknown-linux-gnu -march=i386 -mcrc32 -x c -E -dM -o - %s \| FileCheck -check-prefix=CRC32 %s			// RUN: %clang -target i386-unknown-linux-gnu -march=i386 -mcrc32 -x c -E -dM -o - %s \| FileCheck -check-prefix=CRC32 %s

	// CRC32: #define __CRC32__ 1			// CRC32: #define __CRC32__ 1

	// RUN: %clang -target i386-unknown-linux-gnu -march=i386 -mno-crc32 -x c -E -dM -o - %s \| FileCheck -check-prefix=NOCRC32 %s			// RUN: %clang -target i386-unknown-linux-gnu -march=i386 -mno-crc32 -x c -E -dM -o - %s \| FileCheck -check-prefix=NOCRC32 %s

	// NOCRC32-NOT: #define __CRC32__ 1			// NOCRC32-NOT: #define __CRC32__ 1

	// RUN: %clang -target i386-unknown-linux-gnu -march=i386 -mrdpru -x c -E -dM -o - %s \| FileCheck -check-prefix=RDPRU %s			// RUN: %clang -target i386-unknown-linux-gnu -march=i386 -mrdpru -x c -E -dM -o - %s \| FileCheck -check-prefix=RDPRU %s

	// RDPRU: #define __RDPRU__ 1			// RDPRU: #define __RDPRU__ 1

	// RUN: %clang -target i386-unknown-linux-gnu -march=i386 -mno-rdpru -x c -E -dM -o - %s \| FileCheck -check-prefix=NORDPRU %s			// RUN: %clang -target i386-unknown-linux-gnu -march=i386 -mno-rdpru -x c -E -dM -o - %s \| FileCheck -check-prefix=NORDPRU %s

	// NORDPRU-NOT: #define __RDPRU__ 1			// NORDPRU-NOT: #define __RDPRU__ 1

llvm/docs/ReleaseNotes.rst

	Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines
	----------------------------------			----------------------------------

	Changes to the Windows Target			Changes to the Windows Target
	-----------------------------			-----------------------------

	Changes to the X86 Backend			Changes to the X86 Backend
	--------------------------			--------------------------

				* Support ISA of ``AVX10.1``.

	Changes to the OCaml bindings			Changes to the OCaml bindings
	-----------------------------			-----------------------------

	Changes to the Python bindings			Changes to the Python bindings
	------------------------------			------------------------------

	* The python bindings have been removed.			* The python bindings have been removed.

	▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

llvm/include/llvm/TargetParser/X86TargetParser.def

	Show First 20 Lines • Show All 229 Lines • ▼ Show 20 Lines
	X86_FEATURE (AVXNECONVERT, "avxneconvert")			X86_FEATURE (AVXNECONVERT, "avxneconvert")
	X86_FEATURE (AVXVNNI, "avxvnni")			X86_FEATURE (AVXVNNI, "avxvnni")
	X86_FEATURE (AVXIFMA, "avxifma")			X86_FEATURE (AVXIFMA, "avxifma")
	X86_FEATURE (AVXVNNIINT8, "avxvnniint8")			X86_FEATURE (AVXVNNIINT8, "avxvnniint8")
	X86_FEATURE (SHA512, "sha512")			X86_FEATURE (SHA512, "sha512")
	X86_FEATURE (SM3, "sm3")			X86_FEATURE (SM3, "sm3")
	X86_FEATURE (SM4, "sm4")			X86_FEATURE (SM4, "sm4")
	X86_FEATURE (AVXVNNIINT16, "avxvnniint16")			X86_FEATURE (AVXVNNIINT16, "avxvnniint16")
				X86_FEATURE (AVX10_1, "avx10.1")
				X86_FEATURE (AVX10_512BIT, "avx10-512bit")
	// These features aren't really CPU features, but the frontend can set them.			// These features aren't really CPU features, but the frontend can set them.
	X86_FEATURE (RETPOLINE_EXTERNAL_THUNK, "retpoline-external-thunk")			X86_FEATURE (RETPOLINE_EXTERNAL_THUNK, "retpoline-external-thunk")
	X86_FEATURE (RETPOLINE_INDIRECT_BRANCHES, "retpoline-indirect-branches")			X86_FEATURE (RETPOLINE_INDIRECT_BRANCHES, "retpoline-indirect-branches")
	X86_FEATURE (RETPOLINE_INDIRECT_CALLS, "retpoline-indirect-calls")			X86_FEATURE (RETPOLINE_INDIRECT_CALLS, "retpoline-indirect-calls")
	X86_FEATURE (LVI_CFI, "lvi-cfi")			X86_FEATURE (LVI_CFI, "lvi-cfi")
	X86_FEATURE (LVI_LOAD_HARDENING, "lvi-load-hardening")			X86_FEATURE (LVI_LOAD_HARDENING, "lvi-load-hardening")
	#undef X86_FEATURE_COMPAT			#undef X86_FEATURE_COMPAT
	#undef X86_FEATURE			#undef X86_FEATURE

llvm/lib/IR/Verifier.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,025 Lines • ▼ Show 20 Lines	void Verifier::verifyFunctionAttrs(FunctionType *FT, AttributeList Attrs,
AttributeSet RetAttrs = Attrs.getRetAttrs();		AttributeSet RetAttrs = Attrs.getRetAttrs();
for (Attribute RetAttr : RetAttrs)		for (Attribute RetAttr : RetAttrs)
Check(RetAttr.isStringAttribute() \|\|		Check(RetAttr.isStringAttribute() \|\|
Attribute::canUseAsRetAttr(RetAttr.getKindAsEnum()),		Attribute::canUseAsRetAttr(RetAttr.getKindAsEnum()),
"Attribute '" + RetAttr.getAsString() +		"Attribute '" + RetAttr.getAsString() +
"' does not apply to function return values",		"' does not apply to function return values",
V);		V);

		unsigned MaxParameterWidth = 0;
		auto GetMaxParameterWidth = [&MaxParameterWidth](Type *Ty) {
		if (Ty->isVectorTy()) {
		if (auto *VT = dyn_cast<FixedVectorType>(Ty)) {
		unsigned Size = VT->getPrimitiveSizeInBits().getFixedValue();
		if (Size > MaxParameterWidth)
		MaxParameterWidth = Size;
		}
		}
		};
		GetMaxParameterWidth(FT->getReturnType());
verifyParameterAttrs(RetAttrs, FT->getReturnType(), V);		verifyParameterAttrs(RetAttrs, FT->getReturnType(), V);

// Verify parameter attributes.		// Verify parameter attributes.
for (unsigned i = 0, e = FT->getNumParams(); i != e; ++i) {		for (unsigned i = 0, e = FT->getNumParams(); i != e; ++i) {
Type *Ty = FT->getParamType(i);		Type *Ty = FT->getParamType(i);
AttributeSet ArgAttrs = Attrs.getParamAttrs(i);		AttributeSet ArgAttrs = Attrs.getParamAttrs(i);

if (!IsIntrinsic) {		if (!IsIntrinsic) {
Check(!ArgAttrs.hasAttribute(Attribute::ImmArg),		Check(!ArgAttrs.hasAttribute(Attribute::ImmArg),
"immarg attribute only applies to intrinsics", V);		"immarg attribute only applies to intrinsics", V);
if (!IsInlineAsm)		if (!IsInlineAsm)
Check(!ArgAttrs.hasAttribute(Attribute::ElementType),		Check(!ArgAttrs.hasAttribute(Attribute::ElementType),
"Attribute 'elementtype' can only be applied to intrinsics"		"Attribute 'elementtype' can only be applied to intrinsics"
" and inline asm.",		" and inline asm.",
V);		V);
}		}

verifyParameterAttrs(ArgAttrs, Ty, V);		verifyParameterAttrs(ArgAttrs, Ty, V);
		GetMaxParameterWidth(Ty);

if (ArgAttrs.hasAttribute(Attribute::Nest)) {		if (ArgAttrs.hasAttribute(Attribute::Nest)) {
Check(!SawNest, "More than one parameter has attribute nest!", V);		Check(!SawNest, "More than one parameter has attribute nest!", V);
SawNest = true;		SawNest = true;
}		}

if (ArgAttrs.hasAttribute(Attribute::Returned)) {		if (ArgAttrs.hasAttribute(Attribute::Returned)) {
Check(!SawReturned, "More than one parameter has attribute returned!", V);		Check(!SawReturned, "More than one parameter has attribute returned!", V);
▲ Show 20 Lines • Show All 139 Lines • ▼ Show 20 Lines	void Verifier::verifyFunctionAttrs(FunctionType *FT, AttributeList Attrs,
}		}

if (Attrs.hasFnAttr("frame-pointer")) {		if (Attrs.hasFnAttr("frame-pointer")) {
StringRef FP = Attrs.getFnAttr("frame-pointer").getValueAsString();		StringRef FP = Attrs.getFnAttr("frame-pointer").getValueAsString();
if (FP != "all" && FP != "non-leaf" && FP != "none")		if (FP != "all" && FP != "non-leaf" && FP != "none")
CheckFailed("invalid value for 'frame-pointer' attribute: " + FP, V);		CheckFailed("invalid value for 'frame-pointer' attribute: " + FP, V);
}		}

		// Check AVX10 512-bit feature.
		if (MaxParameterWidth >= 512 && Attrs.hasFnAttr("target-features")) {
		Triple T(M.getTargetTriple());
		if (T.isX86()) {
		StringRef TF = Attrs.getFnAttr("target-features").getValueAsString();
		Check(!TF.contains("+avx10.1") \|\| TF.contains("+avx10-512bit"),
		"512-bit vector arguments require 'avx10-512bit' for AVX10", V);
		}
		}

checkUnsignedBaseTenFuncAttr(Attrs, "patchable-function-prefix", V);		checkUnsignedBaseTenFuncAttr(Attrs, "patchable-function-prefix", V);
checkUnsignedBaseTenFuncAttr(Attrs, "patchable-function-entry", V);		checkUnsignedBaseTenFuncAttr(Attrs, "patchable-function-entry", V);
checkUnsignedBaseTenFuncAttr(Attrs, "warn-stack-size", V);		checkUnsignedBaseTenFuncAttr(Attrs, "warn-stack-size", V);
}		}

void Verifier::verifyFunctionMetadata(		void Verifier::verifyFunctionMetadata(
ArrayRef<std::pair<unsigned, MDNode *>> MDs) {		ArrayRef<std::pair<unsigned, MDNode *>> MDs) {
for (const auto &Pair : MDs) {		for (const auto &Pair : MDs) {
▲ Show 20 Lines • Show All 4,798 Lines • Show Last 20 Lines

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp

Show First 20 Lines • Show All 279 Lines • ▼ Show 20 Lines	void emitMemModRMByte(const MCInst &MI, unsigned Op, unsigned RegOpcodeField,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
bool ForceSIB = false) const;		bool ForceSIB = false) const;

PrefixKind emitPrefixImpl(unsigned &CurOp, const MCInst &MI,		PrefixKind emitPrefixImpl(unsigned &CurOp, const MCInst &MI,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
SmallVectorImpl<char> &CB) const;		SmallVectorImpl<char> &CB) const;

PrefixKind emitVEXOpcodePrefix(int MemOperand, const MCInst &MI,		PrefixKind emitVEXOpcodePrefix(int MemOperand, const MCInst &MI,
		const MCSubtargetInfo &STI,
SmallVectorImpl<char> &CB) const;		SmallVectorImpl<char> &CB) const;

void emitSegmentOverridePrefix(unsigned SegOperand, const MCInst &MI,		void emitSegmentOverridePrefix(unsigned SegOperand, const MCInst &MI,
SmallVectorImpl<char> &CB) const;		SmallVectorImpl<char> &CB) const;

PrefixKind emitOpcodePrefix(int MemOperand, const MCInst &MI,		PrefixKind emitOpcodePrefix(int MemOperand, const MCInst &MI,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
SmallVectorImpl<char> &CB) const;		SmallVectorImpl<char> &CB) const;
▲ Show 20 Lines • Show All 540 Lines • ▼ Show 20 Lines	case X86II::RawFrmMemOffs: {
emitSegmentOverridePrefix(1, MI, CB);		emitSegmentOverridePrefix(1, MI, CB);
break;		break;
}		}
}		}

// REX prefix is optional, but if used must be immediately before the opcode		// REX prefix is optional, but if used must be immediately before the opcode
// Encoding type for this instruction.		// Encoding type for this instruction.
return (TSFlags & X86II::EncodingMask)		return (TSFlags & X86II::EncodingMask)
? emitVEXOpcodePrefix(MemoryOperand, MI, CB)		? emitVEXOpcodePrefix(MemoryOperand, MI, STI, CB)
: emitOpcodePrefix(MemoryOperand, MI, STI, CB);		: emitOpcodePrefix(MemoryOperand, MI, STI, CB);
}		}

// AVX instructions are encoded using an encoding scheme that combines		// AVX instructions are encoded using an encoding scheme that combines
// prefix bytes, opcode extension field, operand encoding fields, and vector		// prefix bytes, opcode extension field, operand encoding fields, and vector
// length encoding capability into a new prefix, referred to as VEX.		// length encoding capability into a new prefix, referred to as VEX.

// The majority of the AVX-512 family of instructions (operating on		// The majority of the AVX-512 family of instructions (operating on
// 512/256/128-bit vector register operands) are encoded using a new prefix		// 512/256/128-bit vector register operands) are encoded using a new prefix
// (called EVEX).		// (called EVEX).

// XOP is a revised subset of what was originally intended as SSE5. It was		// XOP is a revised subset of what was originally intended as SSE5. It was
// changed to be similar but not overlapping with AVX.		// changed to be similar but not overlapping with AVX.

/// Emit XOP, VEX2, VEX3 or EVEX prefix.		/// Emit XOP, VEX2, VEX3 or EVEX prefix.
/// \returns the used prefix.		/// \returns the used prefix.
PrefixKind		PrefixKind
X86MCCodeEmitter::emitVEXOpcodePrefix(int MemOperand, const MCInst &MI,		X86MCCodeEmitter::emitVEXOpcodePrefix(int MemOperand, const MCInst &MI,
		const MCSubtargetInfo &STI,
SmallVectorImpl<char> &CB) const {		SmallVectorImpl<char> &CB) const {
const MCInstrDesc &Desc = MCII.get(MI.getOpcode());		const MCInstrDesc &Desc = MCII.get(MI.getOpcode());
uint64_t TSFlags = Desc.TSFlags;		uint64_t TSFlags = Desc.TSFlags;

assert(!(TSFlags & X86II::LOCK) && "Can't have LOCK VEX.");		assert(!(TSFlags & X86II::LOCK) && "Can't have LOCK VEX.");

X86OpcodePrefixHelper Prefix(*Ctx.getRegisterInfo());		X86OpcodePrefixHelper Prefix(*Ctx.getRegisterInfo());
switch (TSFlags & X86II::EncodingMask) {		switch (TSFlags & X86II::EncodingMask) {
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	case X86II::T_MAP5:
break;		break;
case X86II::T_MAP6:		case X86II::T_MAP6:
Prefix.set5M(0x6);		Prefix.set5M(0x6);
break;		break;
}		}

Prefix.setL(TSFlags & X86II::VEX_L);		Prefix.setL(TSFlags & X86II::VEX_L);
Prefix.setL2(TSFlags & X86II::EVEX_L2);		Prefix.setL2(TSFlags & X86II::EVEX_L2);
		if ((TSFlags & X86II::EVEX_L2) && STI.hasFeature(X86::FeatureAVX10_1) &&
		skanUnsubmitted Done Reply Inline Actions I think what you need here is `TSFlags & X86II::EVEX_L2` instead of `getL2()`. The class `X86OpcodePrefixHelper` is designed for encoding only. The bit `L2` can be set in other cases so it may blur the meaning of 512-bit here you use the getter. skan: I think what you need here is `TSFlags & X86II::EVEX_L2` instead of `getL2()`. The class…
		!STI.hasFeature(X86::FeatureAVX10_512bit))
		report_fatal_error("ZMM registers are not supported without AVX10-512BIT");
		skanUnsubmitted Not Done Reply Inline Actions -mavx10.1 does not work for assembler. So if such instruction is generated w/o AVX10-512BIT support, it must be compiler's issue instead of user's. An `assert` should be more appropriate here. skan: -mavx10.1 does not work for assembler. So if such instruction is generated w/o AVX10-512BIT…
		skanUnsubmitted Not Done Reply Inline Actions -mavx10.1 does not work for assembler. So if such instruction is generated w/o AVX10-512BIT support, it must be compiler's issue instead of user's. An `assert` should be more appropriate here. Reference: https://llvm.org/docs/CodingStandards.html#assert-liberally skan: > -mavx10.1 does not work for assembler. So if such instruction is generated w/o AVX10-512BIT…
		pengfeiAuthorUnsubmitted Done Reply Inline Actions We need to report fatal error for this case even if it's a compiler bug. Otherwise, user may observe the crash issue in runtime and hard to find the reason. pengfei: We need to report fatal error for this case even if it's a compiler bug. Otherwise, user may…
switch (TSFlags & X86II::OpPrefixMask) {		switch (TSFlags & X86II::OpPrefixMask) {
case X86II::PD:		case X86II::PD:
Prefix.setPP(0x1); // 66		Prefix.setPP(0x1); // 66
break;		break;
case X86II::XS:		case X86II::XS:
Prefix.setPP(0x2); // F3		Prefix.setPP(0x2); // F3
break;		break;
case X86II::XD:		case X86II::XD:
▲ Show 20 Lines • Show All 868 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86.td

	Show First 20 Lines • Show All 181 Lines • ▼ Show 20 Lines
	// FIXME: FP16 scalar intrinsics use the type v8f16, which is supposed to be			// FIXME: FP16 scalar intrinsics use the type v8f16, which is supposed to be
	// guarded under condition hasVLX. So we imply it in FeatureFP16 currently.			// guarded under condition hasVLX. So we imply it in FeatureFP16 currently.
	// FIXME: FP16 conversion between f16 and i64 customize type v8i64, which is			// FIXME: FP16 conversion between f16 and i64 customize type v8i64, which is
	// supposed to be guarded under condition hasDQI. So we imply it in FeatureFP16			// supposed to be guarded under condition hasDQI. So we imply it in FeatureFP16
	// currently.			// currently.
	def FeatureFP16 : SubtargetFeature<"avx512fp16", "HasFP16", "true",			def FeatureFP16 : SubtargetFeature<"avx512fp16", "HasFP16", "true",
	"Support 16-bit floating point",			"Support 16-bit floating point",
	[FeatureBWI, FeatureVLX, FeatureDQI]>;			[FeatureBWI, FeatureVLX, FeatureDQI]>;
				def FeatureAVX10_1 : SubtargetFeature<"avx10.1", "HasAVX10_1", "true",
				"Enable AVX10.1 instructions",
				[FeatureFP16, FeatureCDI, FeatureBF16,
				FeatureBITALG, FeatureIFMA, FeatureVNNI,
				FeatureVPOPCNTDQ, FeatureVBMI, FeatureVBMI2]>;
				def FeatureAVX10_512bit : SubtargetFeature<"avx10-512bit", "HasAVX10_512BIT", "true",
				"Enable AVX10 512-bit Instructions">;
	def FeatureAVXVNNIINT8 : SubtargetFeature<"avxvnniint8",			def FeatureAVXVNNIINT8 : SubtargetFeature<"avxvnniint8",
	"HasAVXVNNIINT8", "true",			"HasAVXVNNIINT8", "true",
	"Enable AVX-VNNI-INT8",			"Enable AVX-VNNI-INT8",
	[FeatureAVX2]>;			[FeatureAVX2]>;
	def FeatureAVXVNNIINT16 : SubtargetFeature<"avxvnniint16",			def FeatureAVXVNNIINT16 : SubtargetFeature<"avxvnniint16",
	"HasAVXVNNIINT16", "true",			"HasAVXVNNIINT16", "true",
	"Enable AVX-VNNI-INT16",			"Enable AVX-VNNI-INT16",
	[FeatureAVX2]>;			[FeatureAVX2]>;
	▲ Show 20 Lines • Show All 1,715 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86InstrInfo.td

	Show First 20 Lines • Show All 896 Lines • ▼ Show 20 Lines
	def NoSSE41 : Predicate<"!Subtarget->hasSSE41()">;			def NoSSE41 : Predicate<"!Subtarget->hasSSE41()">;
	def UseSSE41 : Predicate<"Subtarget->hasSSE41() && !Subtarget->hasAVX()">;			def UseSSE41 : Predicate<"Subtarget->hasSSE41() && !Subtarget->hasAVX()">;
	def HasSSE42 : Predicate<"Subtarget->hasSSE42()">;			def HasSSE42 : Predicate<"Subtarget->hasSSE42()">;
	def UseSSE42 : Predicate<"Subtarget->hasSSE42() && !Subtarget->hasAVX()">;			def UseSSE42 : Predicate<"Subtarget->hasSSE42() && !Subtarget->hasAVX()">;
	def HasSSE4A : Predicate<"Subtarget->hasSSE4A()">;			def HasSSE4A : Predicate<"Subtarget->hasSSE4A()">;
	def NoAVX : Predicate<"!Subtarget->hasAVX()">;			def NoAVX : Predicate<"!Subtarget->hasAVX()">;
	def HasAVX : Predicate<"Subtarget->hasAVX()">;			def HasAVX : Predicate<"Subtarget->hasAVX()">;
	def HasAVX2 : Predicate<"Subtarget->hasAVX2()">;			def HasAVX2 : Predicate<"Subtarget->hasAVX2()">;
				def HasAVX10_1 : Predicate<"Subtarget->hasAVX10_1()">;
				def HasAVX10_512BIT : Predicate<"Subtarget->hasAVX10_512BIT()">;
	def HasAVX1Only : Predicate<"Subtarget->hasAVX() && !Subtarget->hasAVX2()">;			def HasAVX1Only : Predicate<"Subtarget->hasAVX() && !Subtarget->hasAVX2()">;
	def HasAVX512 : Predicate<"Subtarget->hasAVX512()">;			def HasAVX512 : Predicate<"Subtarget->hasAVX512()">;
	def UseAVX : Predicate<"Subtarget->hasAVX() && !Subtarget->hasAVX512()">;			def UseAVX : Predicate<"Subtarget->hasAVX() && !Subtarget->hasAVX512()">;
	def UseAVX2 : Predicate<"Subtarget->hasAVX2() && !Subtarget->hasAVX512()">;			def UseAVX2 : Predicate<"Subtarget->hasAVX2() && !Subtarget->hasAVX512()">;
	def NoAVX512 : Predicate<"!Subtarget->hasAVX512()">;			def NoAVX512 : Predicate<"!Subtarget->hasAVX512()">;
	def HasCDI : Predicate<"Subtarget->hasCDI()">;			def HasCDI : Predicate<"Subtarget->hasCDI()">;
	def HasVPOPCNTDQ : Predicate<"Subtarget->hasVPOPCNTDQ()">;			def HasVPOPCNTDQ : Predicate<"Subtarget->hasVPOPCNTDQ()">;
	def HasPFI : Predicate<"Subtarget->hasPFI()">;			def HasPFI : Predicate<"Subtarget->hasPFI()">;
	▲ Show 20 Lines • Show All 557 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86RegisterInfo.cpp

Show First 20 Lines • Show All 1,024 Lines • ▼ Show 20 Lines	bool X86RegisterInfo::getRegAllocationHints(Register VirtReg,
const MachineFunction &MF,		const MachineFunction &MF,
const VirtRegMap *VRM,		const VirtRegMap *VRM,
const LiveRegMatrix *Matrix) const {		const LiveRegMatrix *Matrix) const {
const MachineRegisterInfo *MRI = &MF.getRegInfo();		const MachineRegisterInfo *MRI = &MF.getRegInfo();
const TargetRegisterClass &RC = *MRI->getRegClass(VirtReg);		const TargetRegisterClass &RC = *MRI->getRegClass(VirtReg);
bool BaseImplRetVal = TargetRegisterInfo::getRegAllocationHints(		bool BaseImplRetVal = TargetRegisterInfo::getRegAllocationHints(
VirtReg, Order, Hints, MF, VRM, Matrix);		VirtReg, Order, Hints, MF, VRM, Matrix);

if (RC.getID() != X86::TILERegClassID)		unsigned ID = RC.getID();
		const X86Subtarget &Subtarget = MF.getSubtarget<X86Subtarget>();
		if ((ID == X86::VK64RegClassID \|\| ID == X86::VK64WMRegClassID) &&
		Subtarget.hasAVX10_1() && !Subtarget.hasAVX10_512BIT())
		report_fatal_error(
		"64-bit mask registers are not supported without AVX10-512BIT");

		if (ID != X86::TILERegClassID)
return BaseImplRetVal;		return BaseImplRetVal;

ShapeT VirtShape = getTileShape(VirtReg, const_cast<VirtRegMap *>(VRM), MRI);		ShapeT VirtShape = getTileShape(VirtReg, const_cast<VirtRegMap *>(VRM), MRI);
auto AddHint = [&](MCPhysReg PhysReg) {		auto AddHint = [&](MCPhysReg PhysReg) {
Register VReg = Matrix->getOneVReg(PhysReg);		Register VReg = Matrix->getOneVReg(PhysReg);
if (VReg == MCRegister::NoRegister) { // Not allocated yet		if (VReg == MCRegister::NoRegister) { // Not allocated yet
Hints.push_back(PhysReg);		Hints.push_back(PhysReg);
return;		return;
Show All 31 Lines

llvm/lib/Target/X86/X86Subtarget.h

Show First 20 Lines • Show All 257 Lines • ▼ Show 20 Lines	#include "X86GenSubtargetInfo.inc"
}		}
bool hasNoDomainDelayShuffle() const {		bool hasNoDomainDelayShuffle() const {
return hasNoDomainDelay() \|\| NoDomainDelayShuffle;		return hasNoDomainDelay() \|\| NoDomainDelayShuffle;
}		}

// If there are no 512-bit vectors and we prefer not to use 512-bit registers,		// If there are no 512-bit vectors and we prefer not to use 512-bit registers,
// disable them in the legalizer.		// disable them in the legalizer.
bool useAVX512Regs() const {		bool useAVX512Regs() const {
		if (hasAVX10_1())
		return hasAVX10_512BIT() &&
		(getPreferVectorWidth() >= 512 \|\| RequiredVectorWidth > 256);
return hasAVX512() && (canExtendTo512DQ() \|\| RequiredVectorWidth > 256);		return hasAVX512() && (canExtendTo512DQ() \|\| RequiredVectorWidth > 256);
}		}

bool useLight256BitInstructions() const {		bool useLight256BitInstructions() const {
return getPreferVectorWidth() >= 256 \|\| AllowLight256Bit;		return getPreferVectorWidth() >= 256 \|\| AllowLight256Bit;
}		}

bool useBWIRegs() const {		bool useBWIRegs() const {
▲ Show 20 Lines • Show All 167 Lines • Show Last 20 Lines

llvm/lib/TargetParser/Host.cpp

Show First 20 Lines • Show All 1,785 Lines • ▼ Show 20 Lines	#endif
Features["cmpccxadd"] = HasLeaf7Subleaf1 && ((EAX >> 7) & 1);		Features["cmpccxadd"] = HasLeaf7Subleaf1 && ((EAX >> 7) & 1);
Features["hreset"] = HasLeaf7Subleaf1 && ((EAX >> 22) & 1);		Features["hreset"] = HasLeaf7Subleaf1 && ((EAX >> 22) & 1);
Features["avxifma"] = HasLeaf7Subleaf1 && ((EAX >> 23) & 1) && HasAVXSave;		Features["avxifma"] = HasLeaf7Subleaf1 && ((EAX >> 23) & 1) && HasAVXSave;
Features["avxvnniint8"] = HasLeaf7Subleaf1 && ((EDX >> 4) & 1) && HasAVXSave;		Features["avxvnniint8"] = HasLeaf7Subleaf1 && ((EDX >> 4) & 1) && HasAVXSave;
Features["avxneconvert"] = HasLeaf7Subleaf1 && ((EDX >> 5) & 1) && HasAVXSave;		Features["avxneconvert"] = HasLeaf7Subleaf1 && ((EDX >> 5) & 1) && HasAVXSave;
Features["amx-complex"] = HasLeaf7Subleaf1 && ((EDX >> 8) & 1) && HasAMXSave;		Features["amx-complex"] = HasLeaf7Subleaf1 && ((EDX >> 8) & 1) && HasAMXSave;
Features["avxvnniint16"] = HasLeaf7Subleaf1 && ((EDX >> 10) & 1) && HasAVXSave;		Features["avxvnniint16"] = HasLeaf7Subleaf1 && ((EDX >> 10) & 1) && HasAVXSave;
Features["prefetchi"] = HasLeaf7Subleaf1 && ((EDX >> 14) & 1);		Features["prefetchi"] = HasLeaf7Subleaf1 && ((EDX >> 14) & 1);
		Features["avx10.1"] = HasLeaf7Subleaf1 && ((EDX >> 19) & 1);

bool HasLeafD = MaxLevel >= 0xd &&		bool HasLeafD = MaxLevel >= 0xd &&
!getX86CpuIDAndInfoEx(0xd, 0x1, &EAX, &EBX, &ECX, &EDX);		!getX86CpuIDAndInfoEx(0xd, 0x1, &EAX, &EBX, &ECX, &EDX);

// Only enable XSAVE if OS has enabled support for saving YMM state.		// Only enable XSAVE if OS has enabled support for saving YMM state.
Features["xsaveopt"] = HasLeafD && ((EAX >> 0) & 1) && HasAVXSave;		Features["xsaveopt"] = HasLeafD && ((EAX >> 0) & 1) && HasAVXSave;
Features["xsavec"] = HasLeafD && ((EAX >> 1) & 1) && HasAVXSave;		Features["xsavec"] = HasLeafD && ((EAX >> 1) & 1) && HasAVXSave;
Features["xsaves"] = HasLeafD && ((EAX >> 3) & 1) && HasAVXSave;		Features["xsaves"] = HasLeafD && ((EAX >> 3) & 1) && HasAVXSave;

bool HasLeaf14 = MaxLevel >= 0x14 &&		bool HasLeaf14 = MaxLevel >= 0x14 &&
!getX86CpuIDAndInfoEx(0x14, 0x0, &EAX, &EBX, &ECX, &EDX);		!getX86CpuIDAndInfoEx(0x14, 0x0, &EAX, &EBX, &ECX, &EDX);

Features["ptwrite"] = HasLeaf14 && ((EBX >> 4) & 1);		Features["ptwrite"] = HasLeaf14 && ((EBX >> 4) & 1);

bool HasLeaf19 =		bool HasLeaf19 =
MaxLevel >= 0x19 && !getX86CpuIDAndInfo(0x19, &EAX, &EBX, &ECX, &EDX);		MaxLevel >= 0x19 && !getX86CpuIDAndInfo(0x19, &EAX, &EBX, &ECX, &EDX);
Features["widekl"] = HasLeaf7 && HasLeaf19 && ((EBX >> 2) & 1);		Features["widekl"] = HasLeaf7 && HasLeaf19 && ((EBX >> 2) & 1);

		bool HasLeaf24 =
		MaxLevel >= 0x24 && !getX86CpuIDAndInfo(0x24, &EAX, &EBX, &ECX, &EDX);
		Features["avx10-512bit"] = HasLeaf24 && ((EBX >> 18) & 1);

return true;		return true;
}		}
#elif defined(__linux__) && (defined(__arm__) \|\| defined(__aarch64__))		#elif defined(__linux__) && (defined(__arm__) \|\| defined(__aarch64__))
bool sys::getHostCPUFeatures(StringMap<bool> &Features) {		bool sys::getHostCPUFeatures(StringMap<bool> &Features) {
std::unique_ptr<llvm::MemoryBuffer> P = getProcCpuinfoContent();		std::unique_ptr<llvm::MemoryBuffer> P = getProcCpuinfoContent();
if (!P)		if (!P)
return false;		return false;

▲ Show 20 Lines • Show All 150 Lines • Show Last 20 Lines

llvm/lib/TargetParser/X86TargetParser.cpp

Show First 20 Lines • Show All 672 Lines • ▼ Show 20 Lines	constexpr FeatureBitset ImpliedFeaturesAVX512FP16 =
FeatureAVX512BW \| FeatureAVX512DQ \| FeatureAVX512VL;		FeatureAVX512BW \| FeatureAVX512DQ \| FeatureAVX512VL;
// Key Locker Features		// Key Locker Features
constexpr FeatureBitset ImpliedFeaturesKL = FeatureSSE2;		constexpr FeatureBitset ImpliedFeaturesKL = FeatureSSE2;
constexpr FeatureBitset ImpliedFeaturesWIDEKL = FeatureKL;		constexpr FeatureBitset ImpliedFeaturesWIDEKL = FeatureKL;

// AVXVNNI Features		// AVXVNNI Features
constexpr FeatureBitset ImpliedFeaturesAVXVNNI = FeatureAVX2;		constexpr FeatureBitset ImpliedFeaturesAVXVNNI = FeatureAVX2;

		constexpr FeatureBitset ImpliedFeaturesAVX10_1 =
		FeatureAVX512FP16 \| FeatureAVX512CD \| FeatureAVX512BF16 \|
		FeatureAVX512BITALG \| FeatureAVX512IFMA \| FeatureAVX512VNNI \|
		FeatureAVX512VPOPCNTDQ \| FeatureAVX512VBMI \| FeatureAVX512VBMI2;
		constexpr FeatureBitset ImpliedFeaturesAVX10_512BIT = {};

constexpr FeatureInfo FeatureInfos[X86::CPU_FEATURE_MAX] = {		constexpr FeatureInfo FeatureInfos[X86::CPU_FEATURE_MAX] = {
#define X86_FEATURE(ENUM, STR) {{STR}, ImpliedFeatures##ENUM},		#define X86_FEATURE(ENUM, STR) {{STR}, ImpliedFeatures##ENUM},
#include "llvm/TargetParser/X86TargetParser.def"		#include "llvm/TargetParser/X86TargetParser.def"
};		};

constexpr FeatureInfo FeatureInfos_WithPLUS[X86::CPU_FEATURE_MAX] = {		constexpr FeatureInfo FeatureInfos_WithPLUS[X86::CPU_FEATURE_MAX] = {
#define X86_FEATURE(ENUM, STR) {{"+" STR}, ImpliedFeatures##ENUM},		#define X86_FEATURE(ENUM, STR) {{"+" STR}, ImpliedFeatures##ENUM},
#include "llvm/TargetParser/X86TargetParser.def"		#include "llvm/TargetParser/X86TargetParser.def"
▲ Show 20 Lines • Show All 134 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/avx512-arith.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512f \| FileCheck %s --check-prefix=CHECK --check-prefix=AVX512F			; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512f \| FileCheck %s --check-prefix=CHECK --check-prefix=AVX512F
	; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512vl \| FileCheck %s --check-prefix=CHECK --check-prefix=AVX512VL			; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512vl \| FileCheck %s --check-prefix=CHECK --check-prefix=AVX512VL
	; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512bw \| FileCheck %s --check-prefix=CHECK --check-prefix=AVX512BW			; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512bw \| FileCheck %s --check-prefix=CHECK --check-prefix=AVX512BW
	; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512dq \| FileCheck %s --check-prefix=CHECK --check-prefix=AVX512DQ			; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512dq \| FileCheck %s --check-prefix=CHECK --check-prefix=AVX512DQ
	; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512dq,+avx512bw,+avx512vl \| FileCheck %s --check-prefix=CHECK --check-prefix=SKX			; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512dq,+avx512bw,+avx512vl \| FileCheck %s --check-prefix=CHECK --check-prefix=SKX
				; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx10.1,+avx10-512bit \| FileCheck %s --check-prefix=CHECK --check-prefix=SKX

	define <8 x double> @addpd512(<8 x double> %y, <8 x double> %x) {			define <8 x double> @addpd512(<8 x double> %y, <8 x double> %x) {
	; CHECK-LABEL: addpd512:			; CHECK-LABEL: addpd512:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: vaddpd %zmm0, %zmm1, %zmm0			; CHECK-NEXT: vaddpd %zmm0, %zmm1, %zmm0
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	entry:			entry:
	%add.i = fadd <8 x double> %x, %y			%add.i = fadd <8 x double> %x, %y
	▲ Show 20 Lines • Show All 1,167 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/avx512-broadcast-arith.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx512f \| FileCheck %s --check-prefixes=AVX512F			; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx512f \| FileCheck %s --check-prefixes=AVX512F
	; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx512f,+avx512bw \| FileCheck %s --check-prefixes=AVX512BW			; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx512f,+avx512bw \| FileCheck %s --check-prefixes=AVX512BW
				; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx10.1,+avx10-512bit \| FileCheck %s --check-prefixes=AVX512BW

	; PR34666			; PR34666
	define <64 x i8> @add_v64i8_broadcasts(<64 x i8> %a0, i64 %a1, i8 %a2) {			define <64 x i8> @add_v64i8_broadcasts(<64 x i8> %a0, i64 %a1, i8 %a2) {
	; AVX512F-LABEL: add_v64i8_broadcasts:			; AVX512F-LABEL: add_v64i8_broadcasts:
	; AVX512F: # %bb.0:			; AVX512F: # %bb.0:
	; AVX512F-NEXT: movq %rdi, %rax			; AVX512F-NEXT: movq %rdi, %rax
	; AVX512F-NEXT: movl %edi, %ecx			; AVX512F-NEXT: movl %edi, %ecx
	; AVX512F-NEXT: kmovw %edi, %k1			; AVX512F-NEXT: kmovw %edi, %k1
	▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/avx512bw-arith.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512bw \| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512bw \| FileCheck %s
				; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx10.1,+avx10-512bit \| FileCheck %s

	define <64 x i8> @vpaddb512_test(<64 x i8> %i, <64 x i8> %j) nounwind readnone {			define <64 x i8> @vpaddb512_test(<64 x i8> %i, <64 x i8> %j) nounwind readnone {
	; CHECK-LABEL: vpaddb512_test:			; CHECK-LABEL: vpaddb512_test:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: vpaddb %zmm1, %zmm0, %zmm0			; CHECK-NEXT: vpaddb %zmm1, %zmm0, %zmm0
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%x = add <64 x i8> %i, %j			%x = add <64 x i8> %i, %j
	ret <64 x i8> %x			ret <64 x i8> %x
	▲ Show 20 Lines • Show All 108 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/avx512bwvl-arith.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512bw,+avx512vl \| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512bw,+avx512vl \| FileCheck %s
				; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx10.1 \| FileCheck %s

	; 256-bit			; 256-bit

	define <32 x i8> @vpaddb256_test(<32 x i8> %i, <32 x i8> %j) nounwind readnone {			define <32 x i8> @vpaddb256_test(<32 x i8> %i, <32 x i8> %j) nounwind readnone {
	; CHECK-LABEL: vpaddb256_test:			; CHECK-LABEL: vpaddb256_test:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: vpaddb %ymm1, %ymm0, %ymm0			; CHECK-NEXT: vpaddb %ymm1, %ymm0, %ymm0
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	▲ Show 20 Lines • Show All 227 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/avx512fp16-arith.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=x86_64-apple-darwin -mcpu=skx -mattr=+avx512fp16 \| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-apple-darwin -mcpu=skx -mattr=+avx512fp16 \| FileCheck %s
				; RUN: llc < %s -mtriple=x86_64-apple-darwin -mattr=+avx10.1,+avx10-512bit \| FileCheck %s

	define <32 x half> @vaddph_512_test(<32 x half> %i, <32 x half> %j) nounwind readnone {			define <32 x half> @vaddph_512_test(<32 x half> %i, <32 x half> %j) nounwind readnone {
	; CHECK-LABEL: vaddph_512_test:			; CHECK-LABEL: vaddph_512_test:
	; CHECK: ## %bb.0:			; CHECK: ## %bb.0:
	; CHECK-NEXT: vaddph %zmm1, %zmm0, %zmm0			; CHECK-NEXT: vaddph %zmm1, %zmm0, %zmm0
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%x = fadd <32 x half> %i, %j			%x = fadd <32 x half> %i, %j
	ret <32 x half> %x			ret <32 x half> %x
	▲ Show 20 Lines • Show All 648 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/avx512vl-arith.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=x86_64-apple-darwin -mcpu=knl -mattr=+avx512vl --show-mc-encoding\| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-apple-darwin -mcpu=knl -mattr=+avx512vl --show-mc-encoding\| FileCheck %s
				; RUN: llc < %s -mtriple=x86_64-apple-darwin -mattr=+avx10.1 --show-mc-encoding\| FileCheck %s

	; 256-bit			; 256-bit

	define <4 x i64> @vpaddq256_test(<4 x i64> %i, <4 x i64> %j) nounwind readnone {			define <4 x i64> @vpaddq256_test(<4 x i64> %i, <4 x i64> %j) nounwind readnone {
	; CHECK-LABEL: vpaddq256_test:			; CHECK-LABEL: vpaddq256_test:
	; CHECK: ## %bb.0:			; CHECK: ## %bb.0:
	; CHECK-NEXT: vpaddq %ymm1, %ymm0, %ymm0 ## EVEX TO VEX Compression encoding: [0xc5,0xfd,0xd4,0xc1]			; CHECK-NEXT: vpaddq %ymm1, %ymm0, %ymm0 ## EVEX TO VEX Compression encoding: [0xc5,0xfd,0xd4,0xc1]
	; CHECK-NEXT: retq ## encoding: [0xc3]			; CHECK-NEXT: retq ## encoding: [0xc3]
	▲ Show 20 Lines • Show All 854 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[X86][RFC] Support new feature AVX10Changes PlannedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 550669

clang/docs/ReleaseNotes.rst

clang/include/clang/Driver/Options.td

clang/lib/Basic/Targets/X86.h

clang/lib/Basic/Targets/X86.cpp

clang/lib/CodeGen/CGBuiltin.cpp

clang/lib/CodeGen/CodeGenFunction.h

clang/lib/CodeGen/CodeGenFunction.cpp

clang/lib/CodeGen/Targets/X86.cpp

clang/lib/Driver/ToolChains/Arch/X86.cpp

clang/test/CodeGen/X86/avx10-error.c

clang/test/CodeGen/attr-target-x86.c

clang/test/CodeGen/target-avx-abi-diag.c

clang/test/Driver/x86-target-features.c

clang/test/Preprocessor/x86_target_features.c

llvm/docs/ReleaseNotes.rst

llvm/include/llvm/TargetParser/X86TargetParser.def

llvm/lib/IR/Verifier.cpp

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp

llvm/lib/Target/X86/X86.td

llvm/lib/Target/X86/X86InstrInfo.td

llvm/lib/Target/X86/X86RegisterInfo.cpp

llvm/lib/Target/X86/X86Subtarget.h

llvm/lib/TargetParser/Host.cpp

llvm/lib/TargetParser/X86TargetParser.cpp

llvm/test/CodeGen/X86/avx512-arith.ll

llvm/test/CodeGen/X86/avx512-broadcast-arith.ll

llvm/test/CodeGen/X86/avx512bw-arith.ll

llvm/test/CodeGen/X86/avx512bwvl-arith.ll

llvm/test/CodeGen/X86/avx512fp16-arith.ll

llvm/test/CodeGen/X86/avx512vl-arith.ll

[X86][RFC] Support new feature AVX10
Changes PlannedPublic