This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/test/
-
test/
-
Driver/
-
aarch64-cpus.c
-
Preprocessor/
-
aarch64-target-features.c
-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
-
AArch64TargetParser.def
-
lib/
-
Support/
-
Host.cpp
-
Target/AArch64/
-
AArch64/
-
AArch64.td
-
AArch64Subtarget.h
1
AArch64Subtarget.cpp
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
-
cpus.ll
-
preferred-function-alignment.ll
-
unittests/Support/
-
Support/
-
Host.cpp
-
TargetParserTest.cpp

Differential D75594

[AArch64] Add support for Fujitsu A64FX
ClosedPublic

Authored by kawashima-fj on Mar 3 2020, 11:12 PM.

Download Raw Diff

Details

Reviewers

t.p.northover
sdesmalen
SjoerdMeijer
dmgreen

Commits

rGc8cd1a994d28: [AArch64] Add support for Fujitsu A64FX

Summary

A64FX is used in FUJITSU Supercomputer PRIMEHPC FX1000, PRIMEHPC FX700,
and supercomputer Fugaku.

https://www.fujitsu.com/global/products/computing/servers/supercomputer/specifications/

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

kawashima-fj created this revision.Mar 3 2020, 11:12 PM

Herald added subscribers: llvm-commits, jfb, hiraditya, kristof.beyls. · View Herald TranscriptMar 3 2020, 11:12 PM

Harbormaster failed remote builds in B48016: Diff 248109!Mar 4 2020, 12:06 AM

An unnecessary comment line is removed and indentation is aligned.

huntergr added a subscriber: huntergr.Mar 4 2020, 1:42 AM

Sounds good. Should there be some clang tests? For example in clang/test/Driver/aarch64-cpus.c

The technical paper mentions dotprod. I presume it means SVE dotprod, not AEK_DOTPROD?

llvm/lib/Target/AArch64/AArch64Subtarget.cpp
94	A 32byte loop alignment sounds very high. Are you sure executing that many NOP's will be beneficial?

willlovett added a subscriber: willlovett.Mar 4 2020, 6:01 AM

Looks good, I agree with @dmgreen that a clang driver test would be nice.

I think AEK_DOTPROD was introduced with 8.4 (and backported to 8.2 as an optional feature?) so I suspect the dot product support is just for SVE; it certainly isn't present in the cpuinfo feature flags.

As far as the loop alignment goes, would the A64FX benefit from planting an unconditional branch at the start of a series of alignment nops to skip actually executing them? (Not a change I'm requesting in this patch, just wondering if it would help with performance if we did have to plant lots of nops)

@huntergr @dmgreen Thanks for your reviews.

I added tests to clang/test/Driver/aarch64-cpus.c and clang/test/Preprocessor/aarch64-target-features.c.

Yes, you are correct. 'Dot product' in Fujitsu techincal papers denotes 'integer dot product' in SVE, not 'SIMD dot product' in ARMv8.2-DotProd.

My colleague measured SPEC CPU2017 with PrefLoopLogAlignment = 2 and 5. The performance effect varies depending on benchmarks, and we could determine the best parameter. I want to submit this patch with 5 and revisit it later (possibly with the scheduling model).

I want to submit this patch with 5 and revisit it later (possibly with the scheduling model).

Sounds sensible. LGTM.

This revision is now accepted and ready to land.Mar 5 2020, 11:34 PM

Closed by commit rGc8cd1a994d28: [AArch64] Add support for Fujitsu A64FX (authored by kawashima-fj). · Explain WhyMar 9 2020, 3:44 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptMar 9 2020, 3:44 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Publicly available information on A64FX:

https://github.com/fujitsu/A64FX

This revision is now accepted and ready to land.Mar 17 2020, 3:28 PM

Yes, https://github.com/fujitsu/A64FX contains the official microarchitecture information of A64FX. I wanted to include the URL in the Git commit message but the disclosure was not ready for it at the time.

In D75594#1927959, @kawashima-fj wrote:

Yes, https://github.com/fujitsu/A64FX contains the official microarchitecture information of A64FX. I wanted to include the URL in the Git commit message but the disclosure was not ready for it at the time.

Can you do it at the next commit opportunity as this reference manual should be broadly read by the Arm developer community?

In D75594#1927988, @ikitayama wrote:

In D75594#1927959, @kawashima-fj wrote:

Yes, https://github.com/fujitsu/A64FX contains the official microarchitecture information of A64FX. I wanted to include the URL in the Git commit message but the disclosure was not ready for it at the time.

Can you do it at the next commit opportunity as this reference manual should be broadly read by the Arm developer community?

Sure. I'll do.

kawashima-fj mentioned this in D93791: [AArch64] Add Fujitsu A64FX scheduling model.Dec 23 2020, 6:25 PM

kawashima-fj mentioned this in rGb54337070b19: [AArch64] Add Fujitsu A64FX scheduling model.Jan 15 2021, 12:20 AM

In D75594#1927988, @ikitayama wrote:

In D75594#1927959, @kawashima-fj wrote:

Yes, https://github.com/fujitsu/A64FX contains the official microarchitecture information of A64FX. I wanted to include the URL in the Git commit message but the disclosure was not ready for it at the time.

Can you do it at the next commit opportunity as this reference manual should be broadly read by the Arm developer community?

Done in b54337070b198cf66356a4ee3e420666151a2023 .

Herald added subscribers: dexonsmith, danielkiss. · View Herald TranscriptJan 15 2021, 4:32 AM

Matt added a subscriber: Matt.Jan 19 2021, 9:13 AM

timsmith78 added a subscriber: timsmith78.Jan 21 2021, 9:21 AM

Revision Contents

Path

Size

clang/

test/

Driver/

aarch64-cpus.c

14 lines

Preprocessor/

aarch64-target-features.c

2 lines

llvm/

include/

llvm/

Support/

AArch64TargetParser.def

2 lines

lib/

Support/

Host.cpp

10 lines

Target/

AArch64/

AArch64.td

17 lines

AArch64Subtarget.h

1 line

AArch64Subtarget.cpp

5 lines

test/

CodeGen/

AArch64/

cpus.ll

1 line

preferred-function-alignment.ll

1 line

unittests/

Support/

Host.cpp

13 lines

TargetParserTest.cpp

14 lines

Diff 249051

clang/test/Driver/aarch64-cpus.c

	Show First 20 Lines • Show All 263 Lines • ▼ Show 20 Lines
	// RUN: %clang -target arm64 -mcpu=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-THUNDERX2T99 %s			// RUN: %clang -target arm64 -mcpu=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-THUNDERX2T99 %s
	// RUN: %clang -target arm64 -mlittle-endian -mcpu=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-THUNDERX2T99 %s			// RUN: %clang -target arm64 -mlittle-endian -mcpu=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-THUNDERX2T99 %s
	// RUN: %clang -target arm64 -mtune=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-THUNDERX2T99-TUNE %s			// RUN: %clang -target arm64 -mtune=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-THUNDERX2T99-TUNE %s
	// RUN: %clang -target arm64 -mlittle-endian -mtune=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-THUNDERX2T99-TUNE %s			// RUN: %clang -target arm64 -mlittle-endian -mtune=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-THUNDERX2T99-TUNE %s
	// ARM64-THUNDERX2T99: "-cc1"{{.}} "-triple" "arm64{{.}}" "-target-cpu" "thunderx2t99" "-target-feature" "+v8.1a"			// ARM64-THUNDERX2T99: "-cc1"{{.}} "-triple" "arm64{{.}}" "-target-cpu" "thunderx2t99" "-target-feature" "+v8.1a"
	// ARM64-THUNDERX2T99-TUNE: "-cc1"{{.}} "-triple" "arm64{{.}}" "-target-cpu" "generic"			// ARM64-THUNDERX2T99-TUNE: "-cc1"{{.}} "-triple" "arm64{{.}}" "-target-cpu" "generic"
	// ARM64-THUNDERX2T99-TUNE-NOT: +v8.1a			// ARM64-THUNDERX2T99-TUNE-NOT: +v8.1a

				// RUN: %clang -target aarch64 -mcpu=a64fx -### -c %s 2>&1 \| FileCheck -check-prefix=A64FX %s
				// RUN: %clang -target aarch64 -mlittle-endian -mcpu=a64fx -### -c %s 2>&1 \| FileCheck -check-prefix=A64FX %s
				// RUN: %clang -target aarch64 -mtune=a64fx -### -c %s 2>&1 \| FileCheck -check-prefix=A64FX-TUNE %s
				// RUN: %clang -target aarch64 -mlittle-endian -mtune=a64fx -### -c %s 2>&1 \| FileCheck -check-prefix=A64FX-TUNE %s
				// A64FX: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "a64fx"
				// A64FX-TUNE: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-cpu" "generic"

				// RUN: %clang -target arm64 -mcpu=a64fx -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-A64FX %s
				// RUN: %clang -target arm64 -mlittle-endian -mcpu=a64fx -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-A64FX %s
				// RUN: %clang -target arm64 -mtune=a64fx -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-A64FX-TUNE %s
				// RUN: %clang -target arm64 -mlittle-endian -mtune=a64fx -### -c %s 2>&1 \| FileCheck -check-prefix=ARM64-A64FX-TUNE %s
				// ARM64-A64FX: "-cc1"{{.}} "-triple" "arm64{{.}}" "-target-cpu" "a64fx"
				// ARM64-A64FX-TUNE: "-cc1"{{.}} "-triple" "arm64{{.}}" "-target-cpu" "generic"

	// RUN: %clang -target aarch64_be -### -c %s 2>&1 \| FileCheck -check-prefix=GENERIC-BE %s			// RUN: %clang -target aarch64_be -### -c %s 2>&1 \| FileCheck -check-prefix=GENERIC-BE %s
	// RUN: %clang -target aarch64 -mbig-endian -### -c %s 2>&1 \| FileCheck -check-prefix=GENERIC-BE %s			// RUN: %clang -target aarch64 -mbig-endian -### -c %s 2>&1 \| FileCheck -check-prefix=GENERIC-BE %s
	// RUN: %clang -target aarch64_be -mbig-endian -### -c %s 2>&1 \| FileCheck -check-prefix=GENERIC-BE %s			// RUN: %clang -target aarch64_be -mbig-endian -### -c %s 2>&1 \| FileCheck -check-prefix=GENERIC-BE %s
	// GENERIC-BE: "-cc1"{{.}} "-triple" "aarch64_be{{.}}" "-target-cpu" "generic"			// GENERIC-BE: "-cc1"{{.}} "-triple" "aarch64_be{{.}}" "-target-cpu" "generic"

	// RUN: %clang -target aarch64_be -mcpu=cortex-a35 -### -c %s 2>&1 \| FileCheck -check-prefix=CA35-BE %s			// RUN: %clang -target aarch64_be -mcpu=cortex-a35 -### -c %s 2>&1 \| FileCheck -check-prefix=CA35-BE %s
	// RUN: %clang -target aarch64 -mbig-endian -mcpu=cortex-a35 -### -c %s 2>&1 \| FileCheck -check-prefix=CA35-BE %s			// RUN: %clang -target aarch64 -mbig-endian -mcpu=cortex-a35 -### -c %s 2>&1 \| FileCheck -check-prefix=CA35-BE %s
	// RUN: %clang -target aarch64_be -mbig-endian -mcpu=cortex-a35 -### -c %s 2>&1 \| FileCheck -check-prefix=CA35-BE %s			// RUN: %clang -target aarch64_be -mbig-endian -mcpu=cortex-a35 -### -c %s 2>&1 \| FileCheck -check-prefix=CA35-BE %s
	▲ Show 20 Lines • Show All 361 Lines • Show Last 20 Lines

clang/test/Preprocessor/aarch64-target-features.c

	Show First 20 Lines • Show All 154 Lines • ▼ Show 20 Lines
	// RUN: %clang -target aarch64 -mcpu=cortex-a57 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-A57 %s			// RUN: %clang -target aarch64 -mcpu=cortex-a57 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-A57 %s
	// RUN: %clang -target aarch64 -mcpu=cortex-a72 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-A72 %s			// RUN: %clang -target aarch64 -mcpu=cortex-a72 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-A72 %s
	// RUN: %clang -target aarch64 -mcpu=cortex-a73 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-CORTEX-A73 %s			// RUN: %clang -target aarch64 -mcpu=cortex-a73 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-CORTEX-A73 %s
	// RUN: %clang -target aarch64 -mcpu=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-M1 %s			// RUN: %clang -target aarch64 -mcpu=exynos-m3 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-M1 %s
	// RUN: %clang -target aarch64 -mcpu=exynos-m4 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-M4 %s			// RUN: %clang -target aarch64 -mcpu=exynos-m4 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-M4 %s
	// RUN: %clang -target aarch64 -mcpu=exynos-m5 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-M4 %s			// RUN: %clang -target aarch64 -mcpu=exynos-m5 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-M4 %s
	// RUN: %clang -target aarch64 -mcpu=kryo -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-KRYO %s			// RUN: %clang -target aarch64 -mcpu=kryo -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-KRYO %s
	// RUN: %clang -target aarch64 -mcpu=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-THUNDERX2T99 %s			// RUN: %clang -target aarch64 -mcpu=thunderx2t99 -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-THUNDERX2T99 %s
				// RUN: %clang -target aarch64 -mcpu=a64fx -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MCPU-A64FX %s
	// CHECK-MCPU-APPLE-A7: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crypto" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"			// CHECK-MCPU-APPLE-A7: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crypto" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"
	// CHECK-MCPU-APPLE-A10: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+rdm" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"			// CHECK-MCPU-APPLE-A10: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+rdm" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"
	// CHECK-MCPU-APPLE-A11: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+v8.2a" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+ras" "-target-feature" "+lse" "-target-feature" "+rdm" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"			// CHECK-MCPU-APPLE-A11: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+v8.2a" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+ras" "-target-feature" "+lse" "-target-feature" "+rdm" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"
	// CHECK-MCPU-APPLE-A12: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+v8.3a" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+fullfp16" "-target-feature" "+ras" "-target-feature" "+lse" "-target-feature" "+rdm" "-target-feature" "+rcpc" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"			// CHECK-MCPU-APPLE-A12: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+v8.3a" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+fullfp16" "-target-feature" "+ras" "-target-feature" "+lse" "-target-feature" "+rdm" "-target-feature" "+rcpc" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"
	// CHECK-MCPU-A34: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc"			// CHECK-MCPU-A34: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc"
	// CHECK-MCPU-APPLE-A13: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+v8.4a" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+dotprod" "-target-feature" "+fullfp16" "-target-feature" "+ras" "-target-feature" "+lse" "-target-feature" "+rdm" "-target-feature" "+rcpc" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+fp16fml" "-target-feature" "+sm4" "-target-feature" "+sha3" "-target-feature" "+sha2" "-target-feature" "+aes"			// CHECK-MCPU-APPLE-A13: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+v8.4a" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+dotprod" "-target-feature" "+fullfp16" "-target-feature" "+ras" "-target-feature" "+lse" "-target-feature" "+rdm" "-target-feature" "+rcpc" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+fp16fml" "-target-feature" "+sm4" "-target-feature" "+sha3" "-target-feature" "+sha2" "-target-feature" "+aes"
	// CHECK-MCPU-A35: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"			// CHECK-MCPU-A35: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"
	// CHECK-MCPU-A53: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"			// CHECK-MCPU-A53: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"
	// CHECK-MCPU-A57: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"			// CHECK-MCPU-A57: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"
	// CHECK-MCPU-A72: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"			// CHECK-MCPU-A72: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"
	// CHECK-MCPU-CORTEX-A73: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"			// CHECK-MCPU-CORTEX-A73: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"
	// CHECK-MCPU-M1: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"			// CHECK-MCPU-M1: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"
	// CHECK-MCPU-M4: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+dotprod" "-target-feature" "+fullfp16"			// CHECK-MCPU-M4: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+dotprod" "-target-feature" "+fullfp16"
	// CHECK-MCPU-KRYO: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"			// CHECK-MCPU-KRYO: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"
	// CHECK-MCPU-THUNDERX2T99: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"			// CHECK-MCPU-THUNDERX2T99: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto"
				// CHECK-MCPU-A64FX: "-cc1"{{.}} "-triple" "aarch64{{.}}" "-target-feature" "+v8.2a" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+fullfp16" "-target-feature" "+ras" "-target-feature" "+lse" "-target-feature" "+rdm" "-target-feature" "+sve" "-target-feature" "+sha2"

	// RUN: %clang -target x86_64-apple-macosx -arch arm64 -### -c %s 2>&1 \| FileCheck --check-prefix=CHECK-ARCH-ARM64 %s			// RUN: %clang -target x86_64-apple-macosx -arch arm64 -### -c %s 2>&1 \| FileCheck --check-prefix=CHECK-ARCH-ARM64 %s
	// CHECK-ARCH-ARM64: "-target-cpu" "apple-a7" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crypto" "-target-feature" "+zcm" "-target-feature" "+zcz"			// CHECK-ARCH-ARM64: "-target-cpu" "apple-a7" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crypto" "-target-feature" "+zcm" "-target-feature" "+zcz"

	// RUN: %clang -target x86_64-apple-macosx -arch arm64_32 -### -c %s 2>&1 \| FileCheck --check-prefix=CHECK-ARCH-ARM64_32 %s			// RUN: %clang -target x86_64-apple-macosx -arch arm64_32 -### -c %s 2>&1 \| FileCheck --check-prefix=CHECK-ARCH-ARM64_32 %s
	// CHECK-ARCH-ARM64_32: "-target-cpu" "apple-s4" "-target-feature" "+v8.3a" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+fullfp16" "-target-feature" "+ras" "-target-feature" "+lse" "-target-feature" "+rdm" "-target-feature" "+rcpc" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"			// CHECK-ARCH-ARM64_32: "-target-cpu" "apple-s4" "-target-feature" "+v8.3a" "-target-feature" "+fp-armv8" "-target-feature" "+neon" "-target-feature" "+crc" "-target-feature" "+crypto" "-target-feature" "+fullfp16" "-target-feature" "+ras" "-target-feature" "+lse" "-target-feature" "+rdm" "-target-feature" "+rcpc" "-target-feature" "+zcm" "-target-feature" "+zcz" "-target-feature" "+sha2" "-target-feature" "+aes"

	// RUN: %clang -target aarch64 -march=armv8-a+fp+simd+crc+crypto -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MARCH-1 %s			// RUN: %clang -target aarch64 -march=armv8-a+fp+simd+crc+crypto -### -c %s 2>&1 \| FileCheck -check-prefix=CHECK-MARCH-1 %s
	▲ Show 20 Lines • Show All 149 Lines • Show Last 20 Lines

llvm/include/llvm/Support/AArch64TargetParser.def

	Show First 20 Lines • Show All 160 Lines • ▼ Show 20 Lines
	AARCH64_CPU_NAME("thunderxt81", ARMV8A, FK_CRYPTO_NEON_FP_ARMV8, false,			AARCH64_CPU_NAME("thunderxt81", ARMV8A, FK_CRYPTO_NEON_FP_ARMV8, false,
	(AArch64::AEK_CRC \| AArch64::AEK_PROFILE))			(AArch64::AEK_CRC \| AArch64::AEK_PROFILE))
	AARCH64_CPU_NAME("thunderxt83", ARMV8A, FK_CRYPTO_NEON_FP_ARMV8, false,			AARCH64_CPU_NAME("thunderxt83", ARMV8A, FK_CRYPTO_NEON_FP_ARMV8, false,
	(AArch64::AEK_CRC \| AArch64::AEK_PROFILE))			(AArch64::AEK_CRC \| AArch64::AEK_PROFILE))
	AARCH64_CPU_NAME("tsv110", ARMV8_2A, FK_CRYPTO_NEON_FP_ARMV8, false,			AARCH64_CPU_NAME("tsv110", ARMV8_2A, FK_CRYPTO_NEON_FP_ARMV8, false,
	(AArch64::AEK_DOTPROD \|			(AArch64::AEK_DOTPROD \|
	AArch64::AEK_FP16 \| AArch64::AEK_FP16FML \|			AArch64::AEK_FP16 \| AArch64::AEK_FP16FML \|
	AArch64::AEK_PROFILE))			AArch64::AEK_PROFILE))
				AARCH64_CPU_NAME("a64fx", ARMV8_2A, FK_CRYPTO_NEON_FP_ARMV8, false,
				(AArch64::AEK_FP16 \| AArch64::AEK_SVE))
	// Invalid CPU			// Invalid CPU
	AARCH64_CPU_NAME("invalid", INVALID, FK_INVALID, true, AArch64::AEK_INVALID)			AARCH64_CPU_NAME("invalid", INVALID, FK_INVALID, true, AArch64::AEK_INVALID)
	#undef AARCH64_CPU_NAME			#undef AARCH64_CPU_NAME

llvm/lib/Support/Host.cpp

Show First 20 Lines • Show All 213 Lines • ▼ Show 20 Lines	for (unsigned I = 0, E = Lines.size(); I != E; ++I) {
.Case("0x0af", "thunderx2t99")		.Case("0x0af", "thunderx2t99")
.Case("0xa1", "thunderxt88")		.Case("0xa1", "thunderxt88")
.Case("0x0a1", "thunderxt88")		.Case("0x0a1", "thunderxt88")
.Default("generic");		.Default("generic");
}		}
}		}
}		}

		if (Implementer == "0x46") { // Fujitsu Ltd.
		for (unsigned I = 0, E = Lines.size(); I != E; ++I) {
		if (Lines[I].startswith("CPU part")) {
		return StringSwitch<const char *>(Lines[I].substr(8).ltrim("\t :"))
		.Case("0x001", "a64fx")
		.Default("generic");
		}
		}
		}

if (Implementer == "0x48") // HiSilicon Technologies, Inc.		if (Implementer == "0x48") // HiSilicon Technologies, Inc.
// Look for the CPU part line.		// Look for the CPU part line.
for (unsigned I = 0, E = Lines.size(); I != E; ++I)		for (unsigned I = 0, E = Lines.size(); I != E; ++I)
if (Lines[I].startswith("CPU part"))		if (Lines[I].startswith("CPU part"))
// The CPU part is a 3 digit hexadecimal number with a 0x prefix. The		// The CPU part is a 3 digit hexadecimal number with a 0x prefix. The
// values correspond to the "Part number" in the CP15/c0 register. The		// values correspond to the "Part number" in the CP15/c0 register. The
// contents are specified in the various processor manuals.		// contents are specified in the various processor manuals.
return StringSwitch<const char *>(Lines[I].substr(8).ltrim("\t :"))		return StringSwitch<const char *>(Lines[I].substr(8).ltrim("\t :"))
▲ Show 20 Lines • Show All 1,361 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64.td

Show First 20 Lines • Show All 557 Lines • ▼ Show 20 Lines	def ProcA76 : SubtargetFeature<"a76", "ARMProcFamily", "CortexA76",
FeatureNEON,		FeatureNEON,
FeatureRCPC,		FeatureRCPC,
FeatureCrypto,		FeatureCrypto,
FeatureFullFP16,		FeatureFullFP16,
FeatureDotProd,		FeatureDotProd,
FeatureSSBS		FeatureSSBS
]>;		]>;

		def ProcA64FX : SubtargetFeature<"a64fx", "ARMProcFamily", "A64FX",
		"Fujitsu A64FX processors", [
		HasV8_2aOps,
		FeatureFPARMv8,
		FeatureNEON,
		FeatureSHA2,
		FeaturePerfMon,
		FeatureFullFP16,
		FeatureSVE,
		FeaturePostRAScheduler,
		FeatureComplxNum
		]>;

// Note that cyclone does not fuse AES instructions, but newer apple chips do		// Note that cyclone does not fuse AES instructions, but newer apple chips do
// perform the fusion and cyclone is used by default when targetting apple OSes.		// perform the fusion and cyclone is used by default when targetting apple OSes.
def ProcAppleA7 : SubtargetFeature<"apple-a7", "ARMProcFamily", "AppleA7",		def ProcAppleA7 : SubtargetFeature<"apple-a7", "ARMProcFamily", "AppleA7",
"Apple A7 (the CPU formerly known as Cyclone)", [		"Apple A7 (the CPU formerly known as Cyclone)", [
FeatureAlternateSExtLoadCVTF32Pattern,		FeatureAlternateSExtLoadCVTF32Pattern,
FeatureArithmeticBccFusion,		FeatureArithmeticBccFusion,
FeatureArithmeticCbzFusion,		FeatureArithmeticCbzFusion,
FeatureCrypto,		FeatureCrypto,
▲ Show 20 Lines • Show All 322 Lines • ▼ Show 20 Lines

// watch CPUs.		// watch CPUs.
def : ProcessorModel<"apple-s4", CycloneModel, [ProcAppleA12]>;		def : ProcessorModel<"apple-s4", CycloneModel, [ProcAppleA12]>;
def : ProcessorModel<"apple-s5", CycloneModel, [ProcAppleA12]>;		def : ProcessorModel<"apple-s5", CycloneModel, [ProcAppleA12]>;

// Alias for the latest Apple processor model supported by LLVM.		// Alias for the latest Apple processor model supported by LLVM.
def : ProcessorModel<"apple-latest", CycloneModel, [ProcAppleA13]>;		def : ProcessorModel<"apple-latest", CycloneModel, [ProcAppleA13]>;

		// Fujitsu A64FX
		// FIXME: Scheduling model is not implemented yet.
		def : ProcessorModel<"a64fx", NoSchedModel, [ProcA64FX]>;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Assembly parser		// Assembly parser
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def GenericAsmParserVariant : AsmParserVariant {		def GenericAsmParserVariant : AsmParserVariant {
int Variant = 0;		int Variant = 0;
string Name = "generic";		string Name = "generic";
string BreakCharacters = ".";		string BreakCharacters = ".";
▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64Subtarget.h

	Show All 32 Lines
	class GlobalValue;			class GlobalValue;
	class StringRef;			class StringRef;
	class Triple;			class Triple;

	class AArch64Subtarget final : public AArch64GenSubtargetInfo {			class AArch64Subtarget final : public AArch64GenSubtargetInfo {
	public:			public:
	enum ARMProcFamilyEnum : uint8_t {			enum ARMProcFamilyEnum : uint8_t {
	Others,			Others,
				A64FX,
	AppleA7,			AppleA7,
	AppleA10,			AppleA10,
	AppleA11,			AppleA11,
	AppleA12,			AppleA12,
	AppleA13,			AppleA13,
	CortexA35,			CortexA35,
	CortexA53,			CortexA53,
	CortexA55,			CortexA55,
	▲ Show 20 Lines • Show All 455 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64Subtarget.cpp

Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	case CortexA65:
PrefFunctionLogAlignment = 3;		PrefFunctionLogAlignment = 3;
break;		break;
case CortexA72:		case CortexA72:
case CortexA73:		case CortexA73:
case CortexA75:		case CortexA75:
case CortexA76:		case CortexA76:
PrefFunctionLogAlignment = 4;		PrefFunctionLogAlignment = 4;
break;		break;
		case A64FX:
		CacheLineSize = 256;
		PrefFunctionLogAlignment = 5;
		PrefLoopLogAlignment = 5;
		dmgreenUnsubmitted Not Done Reply Inline Actions A 32byte loop alignment sounds very high. Are you sure executing that many NOP's will be beneficial? dmgreen: A 32byte loop alignment sounds very high. Are you sure executing that many NOP's will be…
		break;
case AppleA7:		case AppleA7:
case AppleA10:		case AppleA10:
case AppleA11:		case AppleA11:
case AppleA12:		case AppleA12:
case AppleA13:		case AppleA13:
CacheLineSize = 64;		CacheLineSize = 64;
PrefetchDistance = 280;		PrefetchDistance = 280;
MinPrefetchStride = 2048;		MinPrefetchStride = 2048;
▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/cpus.ll

	Show All 19 Lines
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m4 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m4 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m5 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=exynos-m5 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=falkor 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=falkor 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=saphira 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=saphira 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=kryo 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=kryo 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=thunderx2t99 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=thunderx2t99 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=tsv110 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=tsv110 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=apple-latest 2>&1 \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=apple-latest 2>&1 \| FileCheck %s
				; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=a64fx 2>&1 \| FileCheck %s
	; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=invalidcpu 2>&1 \| FileCheck %s --check-prefix=INVALID			; RUN: llc < %s -mtriple=arm64-unknown-unknown -mcpu=invalidcpu 2>&1 \| FileCheck %s --check-prefix=INVALID

	; CHECK-NOT: {{.*}} is not a recognized processor for this target			; CHECK-NOT: {{.*}} is not a recognized processor for this target
	; INVALID: {{.*}} is not a recognized processor for this target			; INVALID: {{.*}} is not a recognized processor for this target

	define i32 @f(i64 %z) {			define i32 @f(i64 %z) {
	ret i32 0			ret i32 0
	}			}

llvm/test/CodeGen/AArch64/preferred-function-alignment.ll

	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=generic < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=generic < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a35 < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a35 < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a53 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a53 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a57 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a57 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a65 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a65 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a65ae < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a65ae < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a72 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a72 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a73 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a73 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a75 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a75 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a76 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cortex-a76 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s
				; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=a64fx < %s \| FileCheck --check-prefixes=ALIGN5,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cyclone < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=cyclone < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=falkor < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=falkor < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=kryo < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=kryo < %s \| FileCheck --check-prefixes=ALIGN2,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=neoverse-e1 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=neoverse-e1 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=neoverse-n1 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=neoverse-n1 < %s \| FileCheck --check-prefixes=ALIGN4,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=thunderx < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=thunderx < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=thunderxt81 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=thunderxt81 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s
	; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=thunderxt83 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s			; RUN: llc -mtriple=aarch64-unknown-linux -mcpu=thunderxt83 < %s \| FileCheck --check-prefixes=ALIGN3,CHECK %s
	Show All 20 Lines

llvm/unittests/Support/Host.cpp

Show First 20 Lines • Show All 243 Lines • ▼ Show 20 Lines	EXPECT_EQ(sys::detail::getHostCPUNameForARM(ThunderXT88ProcCpuInfo +
"CPU implementer : 0x43\n"		"CPU implementer : 0x43\n"
"CPU part : 0xa1"),		"CPU part : 0xa1"),
"thunderxt88");		"thunderxt88");

// Verify HiSilicon processors.		// Verify HiSilicon processors.
EXPECT_EQ(sys::detail::getHostCPUNameForARM("CPU implementer : 0x48\n"		EXPECT_EQ(sys::detail::getHostCPUNameForARM("CPU implementer : 0x48\n"
"CPU part : 0xd01"),		"CPU part : 0xd01"),
"tsv110");		"tsv110");

		// Verify A64FX.
		const std::string A64FXProcCpuInfo = R"(
		processor : 0
		BogoMIPS : 200.00
		Features : fp asimd evtstrm sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm fcma dcpop sve
		CPU implementer : 0x46
		CPU architecture: 8
		CPU variant : 0x1
		CPU part : 0x001
		)";

		EXPECT_EQ(sys::detail::getHostCPUNameForARM(A64FXProcCpuInfo), "a64fx");
}		}

#if defined(__APPLE__) \|\| defined(_AIX)		#if defined(__APPLE__) \|\| defined(_AIX)
static bool runAndGetCommandOutput(		static bool runAndGetCommandOutput(
const char *ExePath, ArrayRef<llvm::StringRef> argv,		const char *ExePath, ArrayRef<llvm::StringRef> argv,
std::unique_ptr<char[]> &Buffer, off_t &Size) {		std::unique_ptr<char[]> &Buffer, off_t &Size) {
bool Success = false;		bool Success = false;
[ExePath, argv, &Buffer, &Size, &Success] {		[ExePath, argv, &Buffer, &Size, &Success] {
▲ Show 20 Lines • Show All 110 Lines • Show Last 20 Lines

llvm/unittests/Support/TargetParserTest.cpp

Show First 20 Lines • Show All 958 Lines • ▼ Show 20 Lines	EXPECT_TRUE(testAArch64CPU(
"8-A"));		"8-A"));
EXPECT_TRUE(testAArch64CPU(		EXPECT_TRUE(testAArch64CPU(
"tsv110", "armv8.2-a", "crypto-neon-fp-armv8",		"tsv110", "armv8.2-a", "crypto-neon-fp-armv8",
AArch64::AEK_CRC \| AArch64::AEK_CRYPTO \| AArch64::AEK_FP \|		AArch64::AEK_CRC \| AArch64::AEK_CRYPTO \| AArch64::AEK_FP \|
AArch64::AEK_SIMD \| AArch64::AEK_RAS \| AArch64::AEK_LSE \|		AArch64::AEK_SIMD \| AArch64::AEK_RAS \| AArch64::AEK_LSE \|
AArch64::AEK_RDM \| AArch64::AEK_PROFILE \| AArch64::AEK_FP16 \|		AArch64::AEK_RDM \| AArch64::AEK_PROFILE \| AArch64::AEK_FP16 \|
AArch64::AEK_FP16FML \| AArch64::AEK_DOTPROD,		AArch64::AEK_FP16FML \| AArch64::AEK_DOTPROD,
"8.2-A"));		"8.2-A"));
		EXPECT_TRUE(testAArch64CPU(
		"a64fx", "armv8.2-a", "crypto-neon-fp-armv8",
		AArch64::AEK_CRC \| AArch64::AEK_CRYPTO \| AArch64::AEK_FP \|
		AArch64::AEK_SIMD \| AArch64::AEK_FP16 \| AArch64::AEK_RAS \|
		AArch64::AEK_LSE \| AArch64::AEK_SVE \| AArch64::AEK_RDM,
		"8.2-A"));
}		}

static constexpr unsigned NumAArch64CPUArchs = 36;		static constexpr unsigned NumAArch64CPUArchs = 37;

TEST(TargetParserTest, testAArch64CPUArchList) {		TEST(TargetParserTest, testAArch64CPUArchList) {
SmallVector<StringRef, NumAArch64CPUArchs> List;		SmallVector<StringRef, NumAArch64CPUArchs> List;
AArch64::fillValidCPUArchList(List);		AArch64::fillValidCPUArchList(List);

// No list exists for these in this test suite, so ensure all are		// No list exists for these in this test suite, so ensure all are
// valid, and match the expected 'magic' count.		// valid, and match the expected 'magic' count.
EXPECT_EQ(List.size(), NumAArch64CPUArchs);		EXPECT_EQ(List.size(), NumAArch64CPUArchs);
▲ Show 20 Lines • Show All 124 Lines • ▼ Show 20 Lines	TEST(TargetParserTest, testAArch64Extension) {
EXPECT_TRUE(testAArch64Extension("tsv110",		EXPECT_TRUE(testAArch64Extension("tsv110",
AArch64::ArchKind::INVALID, "profile"));		AArch64::ArchKind::INVALID, "profile"));
EXPECT_TRUE(testAArch64Extension("tsv110",		EXPECT_TRUE(testAArch64Extension("tsv110",
AArch64::ArchKind::INVALID, "fp16"));		AArch64::ArchKind::INVALID, "fp16"));
EXPECT_TRUE(testAArch64Extension("tsv110",		EXPECT_TRUE(testAArch64Extension("tsv110",
AArch64::ArchKind::INVALID, "fp16fml"));		AArch64::ArchKind::INVALID, "fp16fml"));
EXPECT_TRUE(testAArch64Extension("tsv110",		EXPECT_TRUE(testAArch64Extension("tsv110",
AArch64::ArchKind::INVALID, "dotprod"));		AArch64::ArchKind::INVALID, "dotprod"));
		EXPECT_TRUE(testAArch64Extension("a64fx",
		AArch64::ArchKind::INVALID, "fp16"));
		EXPECT_TRUE(testAArch64Extension("a64fx",
		AArch64::ArchKind::INVALID, "sve"));
		EXPECT_FALSE(testAArch64Extension("a64fx",
		AArch64::ArchKind::INVALID, "sve2"));

EXPECT_FALSE(testAArch64Extension(		EXPECT_FALSE(testAArch64Extension(
"generic", AArch64::ArchKind::ARMV8A, "ras"));		"generic", AArch64::ArchKind::ARMV8A, "ras"));
EXPECT_FALSE(testAArch64Extension(		EXPECT_FALSE(testAArch64Extension(
"generic", AArch64::ArchKind::ARMV8_1A, "ras"));		"generic", AArch64::ArchKind::ARMV8_1A, "ras"));
EXPECT_FALSE(testAArch64Extension(		EXPECT_FALSE(testAArch64Extension(
"generic", AArch64::ArchKind::ARMV8_2A, "profile"));		"generic", AArch64::ArchKind::ARMV8_2A, "profile"));
EXPECT_FALSE(testAArch64Extension(		EXPECT_FALSE(testAArch64Extension(
▲ Show 20 Lines • Show All 108 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] Add support for Fujitsu A64FXClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 249051

clang/test/Driver/aarch64-cpus.c

clang/test/Preprocessor/aarch64-target-features.c

llvm/include/llvm/Support/AArch64TargetParser.def

llvm/lib/Support/Host.cpp

llvm/lib/Target/AArch64/AArch64.td

llvm/lib/Target/AArch64/AArch64Subtarget.h

llvm/lib/Target/AArch64/AArch64Subtarget.cpp

llvm/test/CodeGen/AArch64/cpus.ll

llvm/test/CodeGen/AArch64/preferred-function-alignment.ll

llvm/unittests/Support/Host.cpp

llvm/unittests/Support/TargetParserTest.cpp

[AArch64] Add support for Fujitsu A64FX
ClosedPublic