This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/test/
-
test/
-
Driver/
-
aarch64-cpus.c
-
Misc/
-
target-invalid-cpu-note.c
-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
-
AArch64TargetParser.def
-
lib/Target/AArch64/
-
Target/
-
AArch64/
1
AArch64.td
-
AArch64Subtarget.h
1/1
AArch64Subtarget.cpp
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
-
cpus.ll
-
misched-fusion-aes.ll
-
unittests/Support/
-
Support/
-
TargetParserTest.cpp

Differential D112406

[Driver][AArch64]Add driver support for neoverse-512tvb target
ClosedPublic

Authored by CarolineConcatto on Oct 24 2021, 11:59 PM.

Download Raw Diff

Details

Reviewers

sdesmalen
paulwalker-arm
david-arm

Commits

rG2186b011e966: [Driver][AArch64]Add driver support for neoverse-512tvb target

Summary

The support for neoverse-512tvb mirrors the same option available in GCC[1].
There is no functional effect for this option yet.
This patch ensures the driver accepts "-mcpu=neoverse-512tvb", and enough
plumbing is in place to allow the new option to be used in the future.

[1]https://gcc.gnu.org/onlinedocs/gcc/AArch64-Options.html

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

CarolineConcatto created this revision.Oct 24 2021, 11:59 PM

Herald added subscribers: dexonsmith, hiraditya, kristof.beyls. · View Herald TranscriptOct 24 2021, 11:59 PM

CarolineConcatto requested review of this revision.Oct 24 2021, 11:59 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptOct 24 2021, 11:59 PM

Herald added subscribers: llvm-commits, cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B130365: Diff 381853.Oct 25 2021, 12:54 AM

This is a bit of a shame. I was hoping we wouldn't need the same hacks as GCC. The llvm cost modelling can work quite differently at times to GCC and I didn't think we were close enough to optimal code to need to worry about these kinds of differences. I guess having the option is useful for consistency.

llvm/lib/Support/Host.cpp
216 ↗	(On Diff #381853)	This doesn't sound right - for a fake cpu to work with -mcpu=native.
llvm/lib/Target/AArch64/AArch64Subtarget.cpp
168	Should this have Loop Alignment too? Is the interleave factor higher due to the 512bit vector bandwidth?
llvm/lib/Target/ARM/ARM.td
1412 ↗	(On Diff #381853)	Are we sure gcc has a -mcpu=neoverse-512tvb option for Arm? Or is it AArch64 only?

CarolineConcatto retitled this revision from [Driver][AArch64]Add driver support for neoverse-512tvb target to [WIP][Driver][AArch64]Add driver support for neoverse-512tvb target.Oct 25 2021, 5:09 AM

remove neoverse-512tvb from Host.cpp

CarolineConcatto retitled this revision from [WIP][Driver][AArch64]Add driver support for neoverse-512tvb target to [Driver][AArch64]Add driver support for neoverse-512tvb target.Oct 25 2021, 8:03 AM

CarolineConcatto added reviewers: sdesmalen, paulwalker-arm, david-arm.

Harbormaster completed remote builds in B130475: Diff 382004.Oct 25 2021, 9:42 AM

This one might get a VScaleForTuning:
https://reviews.llvm.org/D112459
Do you need this as well?

Rebase and remove support on for ARM 32 bits

Thank you for your review @dmgreen and @tschuett.
I rebase the patch, now VScaleForTuning is being set.
And I removed support for neoverse-512tvb from Arm 32 bits.
@paulwalker-arm pointed me that neoverse-512tvb is only supported by AArch64.

llvm/lib/Support/Host.cpp
216 ↗	(On Diff #381853)	Good catch, I should have removed that.

Harbormaster completed remote builds in B130667: Diff 382273.Oct 26 2021, 5:52 AM

Thanks. If the cpu has a 512 bit total vector bandwidth, should the VScaleForTuning be 1 or 2 (or higher)? llvm doesn't usually deal with total bandwidth a lot, perhaps not as much as it should.

@david-arm any thoughts?

In D112406#3087191, @dmgreen wrote:

Thanks. If the cpu has a 512 bit total vector bandwidth, should the VScaleForTuning be 1 or 2 (or higher)? llvm doesn't usually deal with total bandwidth a lot, perhaps not as much as it should.

@david-arm any thoughts?

The total vector bandwidth includes unrolling so currently having VScaleForTuning=1 and MaxInterleaveFactor=4 implies 512 tvb. If the target has >128bit vectors then vector loops will likely have more work than they can handle in parallel but as long as that does not negatively affect register pressure it shouldn't be a problem.

LGTM with nit addressed.

llvm/lib/Target/AArch64/AArch64.td
840	nit: s/512/512-TVB/

This revision is now accepted and ready to land.Oct 27 2021, 3:08 AM

The total vector bandwidth includes unrolling so currently having VScaleForTuning=1 and MaxInterleaveFactor=4 implies 512 tvb. If the target has >128bit vectors then vector loops will likely have more work than they can handle in parallel but as long as that does not negatively affect register pressure it shouldn't be a problem.

That doesn't fit with my understanding of how VScaleForTuning is currently used, and vectorizing/unrolling too far can easily cause the vector part to be skipped for many loop counts, falling back to the scalar part. But that all sounds fine to me for what this is. Cheers.

This revision was landed with ongoing or failed builds.Oct 28 2021, 1:10 AM

Closed by commit rG2186b011e966: [Driver][AArch64]Add driver support for neoverse-512tvb target (authored by CarolineConcatto). · Explain Why

This revision was automatically updated to reflect the committed changes.

CarolineConcatto added a commit: rG2186b011e966: [Driver][AArch64]Add driver support for neoverse-512tvb target.

Revision Contents

Path

Size

clang/

test/

Driver/

aarch64-cpus.c

3 lines

Misc/

target-invalid-cpu-note.c

4 lines

llvm/

include/

llvm/

Support/

AArch64TargetParser.def

4 lines

lib/

Target/

AArch64/

AArch64.td

12 lines

AArch64Subtarget.h

1 line

AArch64Subtarget.cpp

5 lines

test/

CodeGen/

AArch64/

cpus.ll

1 line

misched-fusion-aes.ll

1 line

unittests/

Support/

TargetParserTest.cpp

10 lines

Diff 382273

clang/test/Driver/aarch64-cpus.c

	Show First 20 Lines • Show All 186 Lines • ▼ Show 20 Lines
	// NEOVERSE-E1: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "neoverse-e1"			// NEOVERSE-E1: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "neoverse-e1"
	// RUN: %clang -target aarch64 -mcpu=neoverse-v1 -### -c %s 2>&1 \| FileCheck -check-prefix=NEOVERSE-V1 %s			// RUN: %clang -target aarch64 -mcpu=neoverse-v1 -### -c %s 2>&1 \| FileCheck -check-prefix=NEOVERSE-V1 %s
	// NEOVERSE-V1: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "neoverse-v1"			// NEOVERSE-V1: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "neoverse-v1"
	// RUN: %clang -target aarch64 -mcpu=neoverse-n1 -### -c %s 2>&1 \| FileCheck -check-prefix=NEOVERSE-N1 %s			// RUN: %clang -target aarch64 -mcpu=neoverse-n1 -### -c %s 2>&1 \| FileCheck -check-prefix=NEOVERSE-N1 %s
	// NEOVERSE-N1: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "neoverse-n1"			// NEOVERSE-N1: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "neoverse-n1"
	// RUN: %clang -target aarch64 -mcpu=neoverse-n2 -### -c %s 2>&1 \| FileCheck -check-prefix=NEOVERSE-N2 %s			// RUN: %clang -target aarch64 -mcpu=neoverse-n2 -### -c %s 2>&1 \| FileCheck -check-prefix=NEOVERSE-N2 %s
	// NEOVERSE-N2: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "neoverse-n2"			// NEOVERSE-N2: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "neoverse-n2"

				// RUN: %clang -target aarch64 -mcpu=neoverse-512tvb -### -c %s 2>&1 \| FileCheck -check-prefix=NEOVERSE-512TVB %s
				// NEOVERSE-512TVB: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "neoverse-512tvb"

	// RUN: %clang -target aarch64 -mcpu=cortex-r82 -### -c %s 2>&1 \| FileCheck -check-prefix=CORTEXR82 %s			// RUN: %clang -target aarch64 -mcpu=cortex-r82 -### -c %s 2>&1 \| FileCheck -check-prefix=CORTEXR82 %s
	// CORTEXR82: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "cortex-r82"			// CORTEXR82: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "cortex-r82"

	// RUN: %clang -target aarch64_be -mcpu=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3 %s			// RUN: %clang -target aarch64_be -mcpu=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3 %s
	// RUN: %clang -target aarch64 -mbig-endian -mcpu=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3 %s			// RUN: %clang -target aarch64 -mbig-endian -mcpu=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3 %s
	// RUN: %clang -target aarch64_be -mbig-endian -mcpu=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3 %s			// RUN: %clang -target aarch64_be -mbig-endian -mcpu=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3 %s
	// RUN: %clang -target aarch64_be -mtune=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3-TUNE %s			// RUN: %clang -target aarch64_be -mtune=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3-TUNE %s
	// RUN: %clang -target aarch64 -mbig-endian -mtune=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3-TUNE %s			// RUN: %clang -target aarch64 -mbig-endian -mtune=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=M3-TUNE %s
	▲ Show 20 Lines • Show All 681 Lines • Show Last 20 Lines

clang/test/Misc/target-invalid-cpu-note.c

	// Use CHECK-NEXT instead of multiple CHECK-SAME to ensure we will fail if there is anything extra in the output.			// Use CHECK-NEXT instead of multiple CHECK-SAME to ensure we will fail if there is anything extra in the output.
	// RUN: not %clang_cc1 -triple armv5--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix ARM			// RUN: not %clang_cc1 -triple armv5--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix ARM
	// ARM: error: unknown target CPU 'not-a-cpu'			// ARM: error: unknown target CPU 'not-a-cpu'
	// ARM-NEXT: note: valid target CPU values are: arm8, arm810, strongarm, strongarm110, strongarm1100, strongarm1110, arm7tdmi, arm7tdmi-s, arm710t, arm720t, arm9, arm9tdmi, arm920, arm920t, arm922t, arm940t, ep9312, arm10tdmi, arm1020t, arm9e, arm946e-s, arm966e-s, arm968e-s, arm10e, arm1020e, arm1022e, arm926ej-s, arm1136j-s, arm1136jf-s, mpcore, mpcorenovfp, arm1176jz-s, arm1176jzf-s, arm1156t2-s, arm1156t2f-s, cortex-m0, cortex-m0plus, cortex-m1, sc000, cortex-a5, cortex-a7, cortex-a8, cortex-a9, cortex-a12, cortex-a15, cortex-a17, krait, cortex-r4, cortex-r4f, cortex-r5, cortex-r7, cortex-r8, cortex-r52, sc300, cortex-m3, cortex-m4, cortex-m7, cortex-m23, cortex-m33, cortex-m35p, cortex-m55, cortex-a32, cortex-a35, cortex-a53, cortex-a55, cortex-a57, cortex-a72, cortex-a73, cortex-a75, cortex-a76, cortex-a76ae, cortex-a77, cortex-a78, cortex-a78c, cortex-x1, neoverse-n1, neoverse-n2, neoverse-v1, cyclone, exynos-m3, exynos-m4, exynos-m5, kryo, iwmmxt, xscale, swift{{$}}			// ARM-NEXT: note: valid target CPU values are: arm8, arm810, strongarm, strongarm110, strongarm1100, strongarm1110, arm7tdmi, arm7tdmi-s, arm710t, arm720t, arm9, arm9tdmi, arm920, arm920t, arm922t, arm940t, ep9312, arm10tdmi, arm1020t, arm9e, arm946e-s, arm966e-s, arm968e-s, arm10e, arm1020e, arm1022e, arm926ej-s, arm1136j-s, arm1136jf-s, mpcore, mpcorenovfp, arm1176jz-s, arm1176jzf-s, arm1156t2-s, arm1156t2f-s, cortex-m0, cortex-m0plus, cortex-m1, sc000, cortex-a5, cortex-a7, cortex-a8, cortex-a9, cortex-a12, cortex-a15, cortex-a17, krait, cortex-r4, cortex-r4f, cortex-r5, cortex-r7, cortex-r8, cortex-r52, sc300, cortex-m3, cortex-m4, cortex-m7, cortex-m23, cortex-m33, cortex-m35p, cortex-m55, cortex-a32, cortex-a35, cortex-a53, cortex-a55, cortex-a57, cortex-a72, cortex-a73, cortex-a75, cortex-a76, cortex-a76ae, cortex-a77, cortex-a78, cortex-a78c, cortex-x1, neoverse-n1, neoverse-n2, neoverse-v1, cyclone, exynos-m3, exynos-m4, exynos-m5, kryo, iwmmxt, xscale, swift{{$}}

	// RUN: not %clang_cc1 -triple arm64--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix AARCH64			// RUN: not %clang_cc1 -triple arm64--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix AARCH64
	// AARCH64: error: unknown target CPU 'not-a-cpu'			// AARCH64: error: unknown target CPU 'not-a-cpu'
	// AARCH64-NEXT: note: valid target CPU values are: cortex-a34, cortex-a35, cortex-a53, cortex-a55, cortex-a510, cortex-a57, cortex-a65, cortex-a65ae, cortex-a72, cortex-a73, cortex-a75, cortex-a76, cortex-a76ae, cortex-a77, cortex-a78, cortex-a78c, cortex-r82, cortex-x1, neoverse-e1, neoverse-n1, neoverse-n2, neoverse-v1, cyclone, apple-a7, apple-a8, apple-a9, apple-a10, apple-a11, apple-a12, apple-a13, apple-a14, apple-m1, apple-s4, apple-s5, exynos-m3, exynos-m4, exynos-m5, falkor, saphira, kryo, thunderx2t99, thunderx3t110, thunderx, thunderxt88, thunderxt81, thunderxt83, tsv110, a64fx, carmel{{$}}			// AARCH64-NEXT: note: valid target CPU values are: cortex-a34, cortex-a35, cortex-a53, cortex-a55, cortex-a510, cortex-a57, cortex-a65, cortex-a65ae, cortex-a72, cortex-a73, cortex-a75, cortex-a76, cortex-a76ae, cortex-a77, cortex-a78, cortex-a78c, cortex-r82, cortex-x1, neoverse-e1, neoverse-n1, neoverse-n2, neoverse-512tvb, neoverse-v1, cyclone, apple-a7, apple-a8, apple-a9, apple-a10, apple-a11, apple-a12, apple-a13, apple-a14, apple-m1, apple-s4, apple-s5, exynos-m3, exynos-m4, exynos-m5, falkor, saphira, kryo, thunderx2t99, thunderx3t110, thunderx, thunderxt88, thunderxt81, thunderxt83, tsv110, a64fx, carmel{{$}}

	// RUN: not %clang_cc1 -triple arm64--- -tune-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix TUNE_AARCH64			// RUN: not %clang_cc1 -triple arm64--- -tune-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix TUNE_AARCH64
	// TUNE_AARCH64: error: unknown target CPU 'not-a-cpu'			// TUNE_AARCH64: error: unknown target CPU 'not-a-cpu'
	// TUNE_AARCH64-NEXT: note: valid target CPU values are: cortex-a34, cortex-a35, cortex-a53, cortex-a55, cortex-a510, cortex-a57, cortex-a65, cortex-a65ae, cortex-a72, cortex-a73, cortex-a75, cortex-a76, cortex-a76ae, cortex-a77, cortex-a78, cortex-a78c, cortex-r82, cortex-x1, neoverse-e1, neoverse-n1, neoverse-n2, neoverse-v1, cyclone, apple-a7, apple-a8, apple-a9, apple-a10, apple-a11, apple-a12, apple-a13, apple-a14, apple-m1, apple-s4, apple-s5, exynos-m3, exynos-m4, exynos-m5, falkor, saphira, kryo, thunderx2t99, thunderx3t110, thunderx, thunderxt88, thunderxt81, thunderxt83, tsv110, a64fx, carmel{{$}}			// TUNE_AARCH64-NEXT: note: valid target CPU values are: cortex-a34, cortex-a35, cortex-a53, cortex-a55, cortex-a510, cortex-a57, cortex-a65, cortex-a65ae, cortex-a72, cortex-a73, cortex-a75, cortex-a76, cortex-a76ae, cortex-a77, cortex-a78, cortex-a78c, cortex-r82, cortex-x1, neoverse-e1, neoverse-n1, neoverse-n2, neoverse-512tvb, neoverse-v1, cyclone, apple-a7, apple-a8, apple-a9, apple-a10, apple-a11, apple-a12, apple-a13, apple-a14, apple-m1, apple-s4, apple-s5, exynos-m3, exynos-m4, exynos-m5, falkor, saphira, kryo, thunderx2t99, thunderx3t110, thunderx, thunderxt88, thunderxt81, thunderxt83, tsv110, a64fx, carmel{{$}}

	// RUN: not %clang_cc1 -triple i386--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix X86			// RUN: not %clang_cc1 -triple i386--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix X86
	// X86: error: unknown target CPU 'not-a-cpu'			// X86: error: unknown target CPU 'not-a-cpu'
	// X86-NEXT: note: valid target CPU values are: i386, i486, winchip-c6, winchip2, c3, i586, pentium, pentium-mmx, pentiumpro, i686, pentium2, pentium3, pentium3m, pentium-m, c3-2, yonah, pentium4, pentium4m, prescott, nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont, nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge, core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512, skx, cascadelake, cooperlake, cannonlake, icelake-client, rocketlake, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, lakemont, k6, k6-2, k6-3, athlon, athlon-tbird, athlon-xp, athlon-mp, athlon-4, k8, athlon64, athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10, barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3, x86-64, x86-64-v2, x86-64-v3, x86-64-v4, geode{{$}}			// X86-NEXT: note: valid target CPU values are: i386, i486, winchip-c6, winchip2, c3, i586, pentium, pentium-mmx, pentiumpro, i686, pentium2, pentium3, pentium3m, pentium-m, c3-2, yonah, pentium4, pentium4m, prescott, nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont, nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge, core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512, skx, cascadelake, cooperlake, cannonlake, icelake-client, rocketlake, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, lakemont, k6, k6-2, k6-3, athlon, athlon-tbird, athlon-xp, athlon-mp, athlon-4, k8, athlon64, athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10, barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3, x86-64, x86-64-v2, x86-64-v3, x86-64-v4, geode{{$}}

	// RUN: not %clang_cc1 -triple x86_64--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix X86_64			// RUN: not %clang_cc1 -triple x86_64--- -target-cpu not-a-cpu -fsyntax-only %s 2>&1 \| FileCheck %s --check-prefix X86_64
	// X86_64: error: unknown target CPU 'not-a-cpu'			// X86_64: error: unknown target CPU 'not-a-cpu'
	// X86_64-NEXT: note: valid target CPU values are: nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont, nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge, core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512, skx, cascadelake, cooperlake, cannonlake, icelake-client, rocketlake, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, k8, athlon64, athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10, barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3, x86-64, x86-64-v2, x86-64-v3, x86-64-v4{{$}}			// X86_64-NEXT: note: valid target CPU values are: nocona, core2, penryn, bonnell, atom, silvermont, slm, goldmont, goldmont-plus, tremont, nehalem, corei7, westmere, sandybridge, corei7-avx, ivybridge, core-avx-i, haswell, core-avx2, broadwell, skylake, skylake-avx512, skx, cascadelake, cooperlake, cannonlake, icelake-client, rocketlake, icelake-server, tigerlake, sapphirerapids, alderlake, knl, knm, k8, athlon64, athlon-fx, opteron, k8-sse3, athlon64-sse3, opteron-sse3, amdfam10, barcelona, btver1, btver2, bdver1, bdver2, bdver3, bdver4, znver1, znver2, znver3, x86-64, x86-64-v2, x86-64-v3, x86-64-v4{{$}}
	▲ Show 20 Lines • Show All 76 Lines • Show Last 20 Lines

llvm/include/llvm/Support/AArch64TargetParser.def

Show First 20 Lines • Show All 188 Lines • ▼ Show 20 Lines	AARCH64_CPU_NAME("neoverse-n1", ARMV8_2A, FK_CRYPTO_NEON_FP_ARMV8, false,
(AArch64::AEK_DOTPROD \| AArch64::AEK_FP16 \|		(AArch64::AEK_DOTPROD \| AArch64::AEK_FP16 \|
AArch64::AEK_PROFILE \| AArch64::AEK_RAS \| AArch64::AEK_RCPC \|		AArch64::AEK_PROFILE \| AArch64::AEK_RAS \| AArch64::AEK_RCPC \|
AArch64::AEK_SSBS))		AArch64::AEK_SSBS))
AARCH64_CPU_NAME("neoverse-n2", ARMV8_5A, FK_CRYPTO_NEON_FP_ARMV8, false,		AARCH64_CPU_NAME("neoverse-n2", ARMV8_5A, FK_CRYPTO_NEON_FP_ARMV8, false,
(AArch64::AEK_BF16 \| AArch64::AEK_DOTPROD \| AArch64::AEK_FP16 \|		(AArch64::AEK_BF16 \| AArch64::AEK_DOTPROD \| AArch64::AEK_FP16 \|
AArch64::AEK_I8MM \| AArch64::AEK_MTE \| AArch64::AEK_RAS \|		AArch64::AEK_I8MM \| AArch64::AEK_MTE \| AArch64::AEK_RAS \|
AArch64::AEK_RCPC \| AArch64::AEK_SB \| AArch64::AEK_SSBS \|		AArch64::AEK_RCPC \| AArch64::AEK_SB \| AArch64::AEK_SSBS \|
AArch64::AEK_SVE \| AArch64::AEK_SVE2 \| AArch64::AEK_SVE2BITPERM))		AArch64::AEK_SVE \| AArch64::AEK_SVE2 \| AArch64::AEK_SVE2BITPERM))
		AARCH64_CPU_NAME("neoverse-512tvb", ARMV8_4A, FK_CRYPTO_NEON_FP_ARMV8, false,
		(AArch64::AEK_RAS \| AArch64::AEK_SVE \| AArch64::AEK_SSBS \|
		AArch64::AEK_RCPC \| AArch64::AEK_FP16 \| AArch64::AEK_BF16 \|
		AArch64::AEK_DOTPROD ))
AARCH64_CPU_NAME("neoverse-v1", ARMV8_4A, FK_CRYPTO_NEON_FP_ARMV8, false,		AARCH64_CPU_NAME("neoverse-v1", ARMV8_4A, FK_CRYPTO_NEON_FP_ARMV8, false,
(AArch64::AEK_RAS \| AArch64::AEK_SVE \| AArch64::AEK_SSBS \|		(AArch64::AEK_RAS \| AArch64::AEK_SVE \| AArch64::AEK_SSBS \|
AArch64::AEK_RCPC \| AArch64::AEK_FP16 \| AArch64::AEK_BF16 \|		AArch64::AEK_RCPC \| AArch64::AEK_FP16 \| AArch64::AEK_BF16 \|
AArch64::AEK_DOTPROD ))		AArch64::AEK_DOTPROD ))
AARCH64_CPU_NAME("cyclone", ARMV8A, FK_CRYPTO_NEON_FP_ARMV8, false,		AARCH64_CPU_NAME("cyclone", ARMV8A, FK_CRYPTO_NEON_FP_ARMV8, false,
(AArch64::AEK_NONE))		(AArch64::AEK_NONE))
AARCH64_CPU_NAME("apple-a7", ARMV8A, FK_CRYPTO_NEON_FP_ARMV8, false,		AARCH64_CPU_NAME("apple-a7", ARMV8A, FK_CRYPTO_NEON_FP_ARMV8, false,
(AArch64::AEK_NONE))		(AArch64::AEK_NONE))
▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64.td

Show First 20 Lines • Show All 830 Lines • ▼ Show 20 Lines	def TuneNeoverseN1 : SubtargetFeature<"neoversen1", "ARMProcFamily", "NeoverseN1",
FeatureFuseAES		FeatureFuseAES
]>;		]>;

def TuneNeoverseN2 : SubtargetFeature<"neoversen2", "ARMProcFamily", "NeoverseN2",		def TuneNeoverseN2 : SubtargetFeature<"neoversen2", "ARMProcFamily", "NeoverseN2",
"Neoverse N2 ARM processors", [		"Neoverse N2 ARM processors", [
FeaturePostRAScheduler,		FeaturePostRAScheduler,
FeatureFuseAES		FeatureFuseAES
]>;		]>;
		def TuneNeoverse512TVB : SubtargetFeature<"neoverse512tvb", "ARMProcFamily", "Neoverse512TVB",
		"Neoverse 512 ARM processors", [
		sdesmalenUnsubmitted Not Done Reply Inline Actions nit: s/512/512-TVB/ sdesmalen: nit: s/512/512-TVB/
		FeaturePostRAScheduler,
		FeatureFuseAES
		]>;

def TuneNeoverseV1 : SubtargetFeature<"neoversev1", "ARMProcFamily", "NeoverseV1",		def TuneNeoverseV1 : SubtargetFeature<"neoversev1", "ARMProcFamily", "NeoverseV1",
"Neoverse V1 ARM processors", [		"Neoverse V1 ARM processors", [
FeatureFuseAES,		FeatureFuseAES,
FeaturePostRAScheduler]>;		FeaturePostRAScheduler]>;

def TuneSaphira : SubtargetFeature<"saphira", "ARMProcFamily", "Saphira",		def TuneSaphira : SubtargetFeature<"saphira", "ARMProcFamily", "Saphira",
"Qualcomm Saphira processors", [		"Qualcomm Saphira processors", [
▲ Show 20 Lines • Show All 114 Lines • ▼ Show 20 Lines	list<SubtargetFeature> NeoverseE1 = [HasV8_2aOps, FeatureCrypto, FeatureDotProd,
FeatureFPARMv8, FeatureFullFP16, FeatureNEON,		FeatureFPARMv8, FeatureFullFP16, FeatureNEON,
FeatureRCPC, FeatureSSBS];		FeatureRCPC, FeatureSSBS];
list<SubtargetFeature> NeoverseN1 = [HasV8_2aOps, FeatureCrypto, FeatureDotProd,		list<SubtargetFeature> NeoverseN1 = [HasV8_2aOps, FeatureCrypto, FeatureDotProd,
FeatureFPARMv8, FeatureFullFP16, FeatureNEON,		FeatureFPARMv8, FeatureFullFP16, FeatureNEON,
FeatureRCPC, FeatureSPE, FeatureSSBS];		FeatureRCPC, FeatureSPE, FeatureSSBS];
list<SubtargetFeature> NeoverseN2 = [HasV8_5aOps, FeatureBF16, FeatureETE,		list<SubtargetFeature> NeoverseN2 = [HasV8_5aOps, FeatureBF16, FeatureETE,
FeatureMatMulInt8, FeatureMTE, FeatureSVE2,		FeatureMatMulInt8, FeatureMTE, FeatureSVE2,
FeatureSVE2BitPerm, FeatureTRBE, FeatureCrypto];		FeatureSVE2BitPerm, FeatureTRBE, FeatureCrypto];
		list<SubtargetFeature> Neoverse512TVB = [HasV8_4aOps, FeatureBF16, FeatureCacheDeepPersist,
		FeatureCrypto, FeatureFPARMv8, FeatureFP16FML,
		FeatureFullFP16, FeatureMatMulInt8, FeatureNEON,
		FeaturePerfMon, FeatureRandGen, FeatureSPE,
		FeatureSSBS, FeatureSVE];
list<SubtargetFeature> NeoverseV1 = [HasV8_4aOps, FeatureBF16, FeatureCacheDeepPersist,		list<SubtargetFeature> NeoverseV1 = [HasV8_4aOps, FeatureBF16, FeatureCacheDeepPersist,
FeatureCrypto, FeatureFPARMv8, FeatureFP16FML,		FeatureCrypto, FeatureFPARMv8, FeatureFP16FML,
FeatureFullFP16, FeatureMatMulInt8, FeatureNEON,		FeatureFullFP16, FeatureMatMulInt8, FeatureNEON,
FeaturePerfMon, FeatureRandGen, FeatureSPE,		FeaturePerfMon, FeatureRandGen, FeatureSPE,
FeatureSSBS, FeatureSVE];		FeatureSSBS, FeatureSVE];
list<SubtargetFeature> Saphira = [HasV8_4aOps, FeatureCrypto, FeatureFPARMv8,		list<SubtargetFeature> Saphira = [HasV8_4aOps, FeatureCrypto, FeatureFPARMv8,
FeatureNEON, FeatureSPE, FeaturePerfMon];		FeatureNEON, FeatureSPE, FeaturePerfMon];
list<SubtargetFeature> ThunderX = [FeatureCRC, FeatureCrypto, FeatureFPARMv8,		list<SubtargetFeature> ThunderX = [FeatureCRC, FeatureCrypto, FeatureFPARMv8,
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
def : ProcessorModel<"cortex-x1", CortexA57Model, ProcessorFeatures.X1,		def : ProcessorModel<"cortex-x1", CortexA57Model, ProcessorFeatures.X1,
[TuneX1]>;		[TuneX1]>;
def : ProcessorModel<"neoverse-e1", CortexA53Model,		def : ProcessorModel<"neoverse-e1", CortexA53Model,
ProcessorFeatures.NeoverseE1, [TuneNeoverseE1]>;		ProcessorFeatures.NeoverseE1, [TuneNeoverseE1]>;
def : ProcessorModel<"neoverse-n1", CortexA57Model,		def : ProcessorModel<"neoverse-n1", CortexA57Model,
ProcessorFeatures.NeoverseN1, [TuneNeoverseN1]>;		ProcessorFeatures.NeoverseN1, [TuneNeoverseN1]>;
def : ProcessorModel<"neoverse-n2", CortexA57Model,		def : ProcessorModel<"neoverse-n2", CortexA57Model,
ProcessorFeatures.NeoverseN2, [TuneNeoverseN2]>;		ProcessorFeatures.NeoverseN2, [TuneNeoverseN2]>;
		def : ProcessorModel<"neoverse-512tvb", CortexA57Model,
		ProcessorFeatures.Neoverse512TVB, [TuneNeoverse512TVB]>;
def : ProcessorModel<"neoverse-v1", CortexA57Model,		def : ProcessorModel<"neoverse-v1", CortexA57Model,
ProcessorFeatures.NeoverseV1, [TuneNeoverseV1]>;		ProcessorFeatures.NeoverseV1, [TuneNeoverseV1]>;
def : ProcessorModel<"exynos-m3", ExynosM3Model, ProcessorFeatures.ExynosM3,		def : ProcessorModel<"exynos-m3", ExynosM3Model, ProcessorFeatures.ExynosM3,
[TuneExynosM3]>;		[TuneExynosM3]>;
def : ProcessorModel<"exynos-m4", ExynosM4Model, ProcessorFeatures.ExynosM4,		def : ProcessorModel<"exynos-m4", ExynosM4Model, ProcessorFeatures.ExynosM4,
[TuneExynosM4]>;		[TuneExynosM4]>;
def : ProcessorModel<"exynos-m5", ExynosM5Model, ProcessorFeatures.ExynosM4,		def : ProcessorModel<"exynos-m5", ExynosM5Model, ProcessorFeatures.ExynosM4,
[TuneExynosM4]>;		[TuneExynosM4]>;
▲ Show 20 Lines • Show All 121 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64Subtarget.h

Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	enum ARMProcFamilyEnum : uint8_t {
CortexR82,		CortexR82,
CortexX1,		CortexX1,
ExynosM3,		ExynosM3,
Falkor,		Falkor,
Kryo,		Kryo,
NeoverseE1,		NeoverseE1,
NeoverseN1,		NeoverseN1,
NeoverseN2,		NeoverseN2,
		Neoverse512TVB,
NeoverseV1,		NeoverseV1,
Saphira,		Saphira,
ThunderX2T99,		ThunderX2T99,
ThunderX,		ThunderX,
ThunderXT81,		ThunderXT81,
ThunderXT83,		ThunderXT83,
ThunderXT88,		ThunderXT88,
ThunderX3T110,		ThunderX3T110,
▲ Show 20 Lines • Show All 586 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64Subtarget.cpp

Show First 20 Lines • Show All 158 Lines • ▼ Show 20 Lines	void AArch64Subtarget::initializeProperties() {
case NeoverseN2:		case NeoverseN2:
PrefFunctionLogAlignment = 4;		PrefFunctionLogAlignment = 4;
VScaleForTuning = 1;		VScaleForTuning = 1;
break;		break;
case NeoverseV1:		case NeoverseV1:
PrefFunctionLogAlignment = 4;		PrefFunctionLogAlignment = 4;
VScaleForTuning = 2;		VScaleForTuning = 2;
break;		break;
		case Neoverse512TVB:
		PrefFunctionLogAlignment = 4;
		dmgreenUnsubmitted Done Reply Inline Actions Should this have Loop Alignment too? Is the interleave factor higher due to the 512bit vector bandwidth? dmgreen: Should this have Loop Alignment too? Is the interleave factor higher due to the 512bit vector…
		VScaleForTuning = 1;
		MaxInterleaveFactor = 4;
		break;
case Saphira:		case Saphira:
MaxInterleaveFactor = 4;		MaxInterleaveFactor = 4;
// FIXME: remove this to enable 64-bit SLP if performance looks good.		// FIXME: remove this to enable 64-bit SLP if performance looks good.
MinVectorRegisterBitWidth = 128;		MinVectorRegisterBitWidth = 128;
break;		break;
case ThunderX2T99:		case ThunderX2T99:
CacheLineSize = 64;		CacheLineSize = 64;
PrefFunctionLogAlignment = 3;		PrefFunctionLogAlignment = 3;
▲ Show 20 Lines • Show All 196 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/cpus.ll

	Show All 15 Lines
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-a76ae 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-a76ae 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-a76 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-a76 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-a77 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-a77 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-a78 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-a78 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-x1 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=cortex-x1 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=neoverse-e1 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=neoverse-e1 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=neoverse-n1 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=neoverse-n1 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=neoverse-n2 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=neoverse-n2 2>&1 \| FileCheck %s
				; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=neoverse-512tvb 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=neoverse-v1 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=neoverse-v1 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m3 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m3 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m4 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m4 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m5 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m5 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=falkor 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=falkor 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=saphira 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=saphira 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=kryo 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=kryo 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=thunderx2t99 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=thunderx2t99 2>&1 \| FileCheck %s
	Show All 12 Lines

llvm/test/CodeGen/AArch64/misched-fusion-aes.ll

	; RUN: llc %s -o - -mtriple=aarch64-unknown -mattr=+fuse-aes,+crypto \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mattr=+fuse-aes,+crypto \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=generic -mattr=+crypto \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=generic -mattr=+crypto \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a53 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a53 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a57 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a57 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a65 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a65 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a72 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a72 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a73 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a73 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a76 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a76 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a77 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a77 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a78 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a78 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a78c\| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-a78c\| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-x1 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=cortex-x1 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=neoverse-e1 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=neoverse-e1 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=neoverse-n1 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=neoverse-n1 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=neoverse-n2 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=neoverse-n2 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=neoverse-v1 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=neoverse-v1 \| FileCheck %s
				; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=neoverse-512tvb \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=exynos-m3 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=exynos-m3 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=exynos-m4 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=exynos-m4 \| FileCheck %s
	; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=exynos-m5 \| FileCheck %s			; RUN: llc %s -o - -mtriple=aarch64-unknown -mcpu=exynos-m5 \| FileCheck %s

	declare <16 x i8> @llvm.aarch64.crypto.aese(<16 x i8> %d, <16 x i8> %k)			declare <16 x i8> @llvm.aarch64.crypto.aese(<16 x i8> %d, <16 x i8> %k)
	declare <16 x i8> @llvm.aarch64.crypto.aesmc(<16 x i8> %d)			declare <16 x i8> @llvm.aarch64.crypto.aesmc(<16 x i8> %d)
	declare <16 x i8> @llvm.aarch64.crypto.aesd(<16 x i8> %d, <16 x i8> %k)			declare <16 x i8> @llvm.aarch64.crypto.aesd(<16 x i8> %d, <16 x i8> %k)
	declare <16 x i8> @llvm.aarch64.crypto.aesimc(<16 x i8> %d)			declare <16 x i8> @llvm.aarch64.crypto.aesimc(<16 x i8> %d)
	▲ Show 20 Lines • Show All 198 Lines • Show Last 20 Lines

llvm/unittests/Support/TargetParserTest.cpp

Show First 20 Lines • Show All 1,128 Lines • ▼ Show 20 Lines	::testing::Values(
AArch64::AEK_FP16 \| AArch64::AEK_RAS \|		AArch64::AEK_FP16 \| AArch64::AEK_RAS \|
AArch64::AEK_LSE \| AArch64::AEK_SVE \|		AArch64::AEK_LSE \| AArch64::AEK_SVE \|
AArch64::AEK_DOTPROD \| AArch64::AEK_RCPC \|		AArch64::AEK_DOTPROD \| AArch64::AEK_RCPC \|
AArch64::AEK_RDM \| AArch64::AEK_MTE \|		AArch64::AEK_RDM \| AArch64::AEK_MTE \|
AArch64::AEK_SSBS \| AArch64::AEK_SB \|		AArch64::AEK_SSBS \| AArch64::AEK_SB \|
AArch64::AEK_SVE2 \| AArch64::AEK_SVE2BITPERM \|		AArch64::AEK_SVE2 \| AArch64::AEK_SVE2BITPERM \|
AArch64::AEK_BF16 \| AArch64::AEK_I8MM,		AArch64::AEK_BF16 \| AArch64::AEK_I8MM,
"8.5-A"),		"8.5-A"),
		ARMCPUTestParams(
		"neoverse-512tvb", "armv8.4-a", "crypto-neon-fp-armv8",
		AArch64::AEK_RAS \| AArch64::AEK_SVE \| AArch64::AEK_SSBS \|
		AArch64::AEK_RCPC \| AArch64::AEK_CRC \| AArch64::AEK_FP \|
		AArch64::AEK_SIMD \| AArch64::AEK_RAS \| AArch64::AEK_LSE \|
		AArch64::AEK_RDM \| AArch64::AEK_RCPC \| AArch64::AEK_DOTPROD \|
		AArch64::AEK_CRYPTO \| AArch64::AEK_FP16 \| AArch64::AEK_BF16,
		"8.4-A"),
ARMCPUTestParams("thunderx2t99", "armv8.1-a", "crypto-neon-fp-armv8",		ARMCPUTestParams("thunderx2t99", "armv8.1-a", "crypto-neon-fp-armv8",
AArch64::AEK_NONE \| AArch64::AEK_CRC \|		AArch64::AEK_NONE \| AArch64::AEK_CRC \|
AArch64::AEK_CRYPTO \| AArch64::AEK_LSE \|		AArch64::AEK_CRYPTO \| AArch64::AEK_LSE \|
AArch64::AEK_RDM \| AArch64::AEK_FP \|		AArch64::AEK_RDM \| AArch64::AEK_FP \|
AArch64::AEK_SIMD,		AArch64::AEK_SIMD,
"8.1-A"),		"8.1-A"),
ARMCPUTestParams("thunderx3t110", "armv8.3-a", "crypto-neon-fp-armv8",		ARMCPUTestParams("thunderx3t110", "armv8.3-a", "crypto-neon-fp-armv8",
AArch64::AEK_CRC \| AArch64::AEK_CRYPTO \|		AArch64::AEK_CRC \| AArch64::AEK_CRYPTO \|
Show All 39 Lines	::testing::Values(
"8.2-A"),		"8.2-A"),
ARMCPUTestParams("carmel", "armv8.2-a", "crypto-neon-fp-armv8",		ARMCPUTestParams("carmel", "armv8.2-a", "crypto-neon-fp-armv8",
AArch64::AEK_CRC \| AArch64::AEK_CRYPTO \|		AArch64::AEK_CRC \| AArch64::AEK_CRYPTO \|
AArch64::AEK_FP \| AArch64::AEK_SIMD \|		AArch64::AEK_FP \| AArch64::AEK_SIMD \|
AArch64::AEK_FP16 \| AArch64::AEK_RAS \|		AArch64::AEK_FP16 \| AArch64::AEK_RAS \|
AArch64::AEK_LSE \| AArch64::AEK_RDM,		AArch64::AEK_LSE \| AArch64::AEK_RDM,
"8.2-A")));		"8.2-A")));

static constexpr unsigned NumAArch64CPUArchs = 49;		static constexpr unsigned NumAArch64CPUArchs = 50;

TEST(TargetParserTest, testAArch64CPUArchList) {		TEST(TargetParserTest, testAArch64CPUArchList) {
SmallVector<StringRef, NumAArch64CPUArchs> List;		SmallVector<StringRef, NumAArch64CPUArchs> List;
AArch64::fillValidCPUArchList(List);		AArch64::fillValidCPUArchList(List);

// No list exists for these in this test suite, so ensure all are		// No list exists for these in this test suite, so ensure all are
// valid, and match the expected 'magic' count.		// valid, and match the expected 'magic' count.
EXPECT_EQ(List.size(), NumAArch64CPUArchs);		EXPECT_EQ(List.size(), NumAArch64CPUArchs);
▲ Show 20 Lines • Show All 289 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Driver][AArch64]Add driver support for neoverse-512tvb targetClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 382273

clang/test/Driver/aarch64-cpus.c

clang/test/Misc/target-invalid-cpu-note.c

llvm/include/llvm/Support/AArch64TargetParser.def

llvm/lib/Target/AArch64/AArch64.td

llvm/lib/Target/AArch64/AArch64Subtarget.h

llvm/lib/Target/AArch64/AArch64Subtarget.cpp

llvm/test/CodeGen/AArch64/cpus.ll

llvm/test/CodeGen/AArch64/misched-fusion-aes.ll

llvm/unittests/Support/TargetParserTest.cpp

[Driver][AArch64]Add driver support for neoverse-512tvb target
ClosedPublic