This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Basic/
-
clang/
-
Basic/
1/1
AArch64SVETypeFlags.h
-
BuiltinsAArch64.def
-
BuiltinsSVE.def
1
CMakeLists.txt
-
TargetBuiltins.h
3/5
arm_sve.td
-
lib/
-
Basic/Targets/
-
Targets/
-
AArch64.cpp
-
CodeGen/
1/3
CGBuiltin.cpp
-
CodeGenFunction.h
-
utils/TableGen/
-
TableGen/
9/12
SveEmitter.cpp
1/3
TableGen.cpp
-
TableGenBackends.h

Differential D75470

[SVE] Auto-generate builtins and header for svld1.
ClosedPublic

Authored by sdesmalen on Mar 2 2020, 9:25 AM.

Download Raw Diff

Details

Reviewers

efriedma
rovka
SjoerdMeijer
rsandifo-arm
rengolin

Commits

rG8b409eabaf75: [SVE] Auto-generate builtins and header for svld1.

Summary

This is a first patch in a series for the SveEmitter to generate the arm_sve.h
header file and builtins.

I've tried my best to strip down this patch as best as I could, but there
are still a few changes that are not necessarily exercised by the load intrinsics
in this patch, mostly around the SVEType class which has some common logic to
represent types from a type and prototype string. I thought it didn't make
much sense to remove that from this patch and split it up.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sdesmalen created this revision.Mar 2 2020, 9:25 AM

Herald added a reviewer: rengolin. · View Herald TranscriptMar 2 2020, 9:25 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: psnobl, rkruppe, mgrang and 3 others. · View Herald Transcript

sdesmalen added a parent revision: D75298: [Clang][SVE] Parse builtin type string for scalable vectors.Mar 2 2020, 9:25 AM

sdesmalen marked an inline comment as done.

sdesmalen added inline comments.

clang/include/clang/Basic/AArch64SVETypeFlags.h
2	I just see that this comment will need updating, as that line seems copied from SveEmitter.cpp.

Harbormaster failed remote builds in B47809: Diff 247673!Mar 2 2020, 10:21 AM

Big patch, I only did a first scan, but looks very reasonable in general. Just a first round of nits and 2 questions.

clang/include/clang/Basic/arm_sve.td
121	This encoding, e.g, this is "csilUcUsUiUlhfd", is such a monstrosity. It's a very efficient encoding, but of course completely unreadable. I know there is prior art, and know that this is how it's been done, but just curious if you have given it thoughts how to do this in a normal way, a bit more c++y. I don't want to de-rail this work, but if we are adding a new emitter, perhaps now is the time to give it a thought, so was just curious.
clang/utils/TableGen/SveEmitter.cpp
1	I wanted to add the nit that SveEmiiter.cpp should perhaps be SVEEmitter.cpp, but then I saw at the bottom that MVE is spelled Mve, so perhaps this is fine then.
64	why a default of 128? Will this gives problems for SVE implementions with> 128 bits?
119	nit: for readability, perhaps don't abbreviate some of these member names? R -> Record BaseTS -> BaseTypeSpec CK -> ClassKind
260	don't need this
264	just a return here

Renamed CK and BaseTS
Refactored switch statementsd in SVEType::getTypeFlags()

clang/include/clang/Basic/arm_sve.td
121	Haha, its a bit of a monstrosity indeed. The only thing I can think of here would be having something like: class TypeSpecs<list<string> val> { list<string> v = val; } def All_Int_Float_Ty : TypeSpecs<["c", "s", "i", "l", "Uc", "Us", "Ul", "h", "f", "d">; def SVLD1 : Minst<"svld1[_{2}]", "dPc", All_Int_Float_Ty, [IsLoad]>; But I suspect this gets a bit awkward because of the many permutations, I count more than 40. Not sure if that would really improve the readability.
clang/utils/TableGen/SveEmitter.cpp
64	SVE vectors are n x 128bits, so the 128 is scalable here.
119	`Record` and `ClassKind` are also the names of the enum though. Perhaps I can rename CK to `Class`?
264	Good catch!

SjoerdMeijer added inline comments.Mar 5 2020, 5:52 AM

clang/include/clang/Basic/arm_sve.td
121	I would personally welcome any improvement here, even the smallest. But if you think it's tricky, then fair enough! I've managed to completely ignore the MVE intrinsics work so far, but understood there were some innovations here and there (e.g. in tablegen). Probably because it is dealing with similar problems: a lot of intrinsics, some of them overloaded with different types. I'm going to have a little look now to see if there's anything we can borrow from that, or if that is unrelated....
clang/utils/TableGen/SveEmitter.cpp
64	ah, okay, fair enough, didn't realise that.

Adding @simon_tatham in case he feels wants to have a look too.

simon_tatham added inline comments.Mar 5 2020, 7:45 AM

clang/include/clang/Basic/arm_sve.td
121	In the MVE intrinsics implementation I completely avoided that entire system of string-based type specifications. There's another completely different way you can set up the types of builtins, and I used that instead. You can declare the function for `Builtins.def` purposes with no type specification at all, and then you fill in its type signature using a declaration in the header file, with the unusual combination of `__inline__` and no function body: static __inline__ int32x4_t __builtin_arm_foo_bar(int16x8_t, float23x7t); // or whatever In fact I went one step further: the user-facing names for the MVE intrinsics are declared in `arm_mve.h` with a special attribute indicating that they're aliases for clang builtins. And the MVE polymorphic intrinsics are done by adding `__attribute__((overloadable))` to the declaration, which allows C++-style overloading based on parameter types even when compiling in C. So when the user invokes an MVE intrinsic by its polymorphic name, the compiler first does overload resolution to decide which declaration in the header file to select; then it looks at the builtin-alias attribute and discovers which internal builtin id it corresponds to; and then it can do codegen for that builtin directly, without a wrapper function in the header. Pros of doing it this way: if the builtin requires some of its arguments to be compile-time constants, then you don't run into the problem that a wrapper function in the header fails to pass through the constantness. (In NEON this is worked around by making some wrapper functions be wrapper macros instead – but NEON doesn't have to deal with polymorphism.) declaring a builtin's type signature in the header file means that it can include definitions that the header file has created beforehand. For example, one of the arguments to the MVE `vld2q` family involves a small `struct` containing 2 or 4 vectors, and it would be tricky to get that struct type into the `Builtins.def` type specification before the header file can tell clang it exists. doing polymorphism like this, rather than making the polymorphic function be a macro expanding to something involving C11 `_Generic`, means the error messages are orders of magnitude more friendly when the user messes up a call. (Also it's remarkably fiddly to use `_Generic` in complicated cases, because of the requirement that even its untaken branches not provoke any type-checking or semantic errors.) I don't know of any way that the preprocessor + `_Generic` approach can avoid expanding its macro arguments at least twice. It can avoid evaluating twice, so that's safe in the side-effect sense, but you still have the problem that you get exponential inflation of the size of preprocessed output if calls to these macros are lexically nested too deeply. Cons: you have to do all of your codegen inside the clang binary, starting from the function operands you're given, and ending up with LLVM IR. You don't get to do the tedious parts (like unpacking structs, or dereferencing pointer arguments for passed-by-reference parameters) in the wrapper function in the header, because there isn't one. I had to invent a whole system in MveEmitter to allow the IR generation to be specified in a not-too-verbose way. if the builtins don't have type declarations until the header is included, then users can't call them without the header file. Probably this is fine for SVE intrinsics the same way it is for MVE, where the builtins are a detail of that particular compiler's implementation and users are intended to use the compiler-independent public API. But in cases where the builtin itself was intended to be called directly by the end user (in the way that `__builtin_clz` is, for example), you'd probably want it to work everywhere. if you do polymorphism using `__attribute__((overloadable))` then all the things you're overloading between have to be real functions. You can't make some of them be macros, with the extra flexibility a macro gives you. (But then, making them builtins rather than genuine functions restores some of that flexibility.) Off the top of my head I don't know whether all these ideas can be separated from each other. It feels to me as if all the choices I made are leaning on each other and making a mutually supporting whole, and it's quite possible that if you tried to cherry-pick just one of these design decisions into an otherwise more conventional approach, it might all come crashing down. But I haven't tried it :-)

In D75470#1907562, @SjoerdMeijer wrote:

Adding @simon_tatham in case he feels wants to have a look too.

Thanks Sjoerd! @simon_tatham and I had a chat about this offline today.

The SVE implementation now does more or less the same thing the MVE implementation; arm_sve.h also uses __attribute__((overloadable)) and __attribute__((arm_sve_alias("__builtin_..."))), the latter only to declare the overloaded intrinsics. That means we get the same benefits as Simon described.

There are a few details that are different:

The MVE implementation *does not* use the type string in the actual Builtins.def. For example, the type string below is not "V16ScV16ScV16Sci", but rather "":

TARGET_HEADER_BUILTIN(__builtin_arm_mve_vcaddq_rot270_f16, "", "n", "arm_mve.h", ALL_LANGUAGES, "")

This means that the implementation relies solely on the declaration in arm_mve.h to define the intrinsic prototype.
If I understood this correctly, this is currently needed to represent the tuple types (which cannot yet be expressed in the bulitin type string format) returned from structured loads like ld2/ld3/ld4.

The SVE implementation *does* use the type string in Builtins.def.

For SVE, the structured loads will just return a wider vector with 2, 3 or 4 times the number of elements, so the lack of support in the builtin type-string format is not an issue.
An advantage of that is that we can use a #define svadd_u8(...) __builtin_svadd_u8(...) for the non-overloaded builtins which helps compile-time performance.

If all the intrinsics can be described *with* a builtin type string, we can actually work to define all builtins internally in Clang (as @efriedma suggested in D75298) rather than having to rely on the header file to define the prototypes, hopefully get rid of the expensive header file.
When we do this for SVE, MVE can probably follow the same approach.

clang/include/clang/Basic/arm_sve.td
121	Thanks for sharing some background here @simon_tatham!

Herald added a subscriber: danielkiss. · View Herald TranscriptMar 6 2020, 10:14 AM

simon_tatham mentioned this in D75850: [ARM,CDE] Generalize MVE intrinsics infrastructure to support CDE.Mar 9 2020, 9:31 AM

In D75470#1910071, @sdesmalen wrote:

The SVE implementation now does more or less the same thing the MVE implementation; arm_sve.h also uses __attribute__((overloadable)) and __attribute__((arm_sve_alias("__builtin_..."))), the latter only to declare the overloaded intrinsics. That means we get the same benefits as Simon described.

FYI: we are currently working on another M-profile extension, CDE. In my patches am reusing some of the MVE intrinsic machinery, and I decided to rename __clang_arm_mve_alias to __clang_arm_builtin_alias. You might want to reuse the same attribute name to reduce duplication, see https://reviews.llvm.org/D75850

SjoerdMeijer added inline comments.Mar 9 2020, 2:32 PM

clang/lib/CodeGen/CGBuiltin.cpp
5296	I am wondering if it is confusing/correct to use NeonInstrinsicInfo here?
7473	and the same here: findNeon...

s/NeonIntrinsicInfo/ARMVectorIntrinsicInfo/
s/findNeonIntrinsicInMap/findARMVectorIntrinsicInMap/

sdesmalen marked an inline comment as done.Mar 10 2020, 4:34 AM

sdesmalen added inline comments.

clang/lib/CodeGen/CGBuiltin.cpp
5296	We can reuse the same info-struct and find function, but the names are indeed misleading. I've renamed these.

Cheers, I think this looks very reasonable.

This revision is now accepted and ready to land.Mar 10 2020, 6:20 AM

Closed by commit rG8b409eabaf75: [SVE] Auto-generate builtins and header for svld1. (authored by sdesmalen). · Explain WhyMar 16 2020, 3:54 AM

This revision was automatically updated to reflect the committed changes.

thakis added a subscriber: thakis.Mar 16 2020, 6:49 AM

thakis added inline comments.

clang/include/clang/Basic/CMakeLists.txt
44	Update comment to also say "SVE" and "CDE" (or just say "# ARM builtin headers")
clang/utils/TableGen/TableGen.cpp
196	Any reason these aren't called `-gen-arm-sve-builtin-def` and `-gen-arm-sve-builtin-codegen` for consistency with CDE and MVE?

thakis added inline comments.Mar 16 2020, 6:52 AM

clang/utils/TableGen/SveEmitter.cpp
32	Including stuff from `clang/Basic` in clang/utils/TableGen is conceptually a layering violation: clang-tblgen is used to generate headers included in clang/Basic. In this case it happens to work, but it's because you're lucky, and it could break for subtle reasons if the TypeFlags header starts including some other header in Basic that happens to include something generated. Please restructure this so that the TableGen code doesn't need an include from Basic.

sdesmalen marked 2 inline comments as done.Mar 16 2020, 7:40 AM

sdesmalen added inline comments.

clang/utils/TableGen/SveEmitter.cpp
32	Thanks for pointing out! The only directory that seems to have common includes between Clang TableGen/CodeGen is the llvm/Support directory, any objection to me moving the header there?
clang/utils/TableGen/TableGen.cpp
196	Not really, I can change that.

This patch broke the Clang build with enabled modules (which is used by the LLDB bot, but every other bot is also dead: http://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/ )

thakis added inline comments.Mar 16 2020, 9:04 AM

clang/utils/TableGen/SveEmitter.cpp
32	That seems like a strange place for this header. Maybe you can rework things so that you don't have to share a header between clang's tablegen and clang's Basic? No other tablegen output so far has needed that. (see e.g. the `/// These must be kept in sync with the flags in utils/TableGen/NeonEmitter.h.` line in TargetBuiltins.h). If that isn't possible at all, I suppose you could put the .h file in clang/utils/TableGen and also make clang-tblgen write the .h file and use the written .h file in Basic.

sdesmalen mentioned this in rGc5b81466c2bc: Reland D75470 [SVE] Auto-generate builtins and header for svld1..Mar 18 2020, 4:19 AM

thakis added inline comments.Mar 18 2020, 8:35 AM

clang/utils/TableGen/TableGen.cpp
196	Looks like you renamed the files to be consistent (thanks!), but not the flag names. Can you make those consistent too?

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

AArch64SVETypeFlags.h

67 lines

13 lines

20 lines

7 lines

11 lines

107 lines

lib/

Basic/

Targets/

AArch64.cpp

4 lines

CodeGen/

CGBuiltin.cpp

106 lines

CodeGenFunction.h

1 line

utils/

TableGen/

SveEmitter.cpp

649 lines

TableGen.cpp

12 lines

TableGenBackends.h

2 lines

Diff 250509

clang/include/clang/Basic/AArch64SVETypeFlags.h

This file was added.

				//===- AArch64SVETypeFlags.h - Flags used to generate ACLE builtins- C++ -*-===//
				//
				sdesmalenAuthorUnsubmitted Done Reply Inline Actions I just see that this comment will need updating, as that line seems copied from SveEmitter.cpp. sdesmalen: I just see that this comment will need updating, as that line seems copied from SveEmitter.cpp.
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_CLANG_BASIC_AARCH64SVETYPEFLAGS_H
				#define LLVM_CLANG_BASIC_AARCH64SVETYPEFLAGS_H

				#include <stdint.h>

				namespace clang {

				/// Flags to identify the types for overloaded SVE builtins.
				class SVETypeFlags {
				uint64_t Flags;

				public:
				/// These must be kept in sync with the flags in
				/// include/clang/Basic/arm_sve.td.
				static const uint64_t MemEltTypeOffset = 4; // Bit offset of MemEltTypeMask
				static const uint64_t EltTypeMask = 0x00000000000f;
				static const uint64_t MemEltTypeMask = 0x000000000070;
				static const uint64_t IsLoad = 0x000000000080;

				enum EltType {
				Invalid,
				Int8,
				Int16,
				Int32,
				Int64,
				Float16,
				Float32,
				Float64,
				Bool8,
				Bool16,
				Bool32,
				Bool64
				};

				enum MemEltTy {
				MemEltTyDefault,
				MemEltTyInt8,
				MemEltTyInt16,
				MemEltTyInt32,
				MemEltTyInt64
				};

				SVETypeFlags(uint64_t F) : Flags(F) { }
				SVETypeFlags(EltType ET, bool IsUnsigned) : Flags(ET) { }

				EltType getEltType() const { return (EltType)(Flags & EltTypeMask); }
				MemEltTy getMemEltType() const {
				return (MemEltTy)((Flags & MemEltTypeMask) >> MemEltTypeOffset);
				}

				bool isLoad() const { return Flags & IsLoad; }

				uint64_t getBits() const { return Flags; }
				bool isFlagSet(uint64_t Flag) const { return Flags & Flag; }
				};

				} // end namespace clang

				#endif

clang/include/clang/Basic/BuiltinsAArch64.def

	Show First 20 Lines • Show All 93 Lines • ▼ Show 20 Lines
	BUILTIN(__builtin_sponentry, "v*", "c")			BUILTIN(__builtin_sponentry, "v*", "c")

	// Transactional Memory Extension			// Transactional Memory Extension
	BUILTIN(__builtin_arm_tstart, "WUi", "nj")			BUILTIN(__builtin_arm_tstart, "WUi", "nj")
	BUILTIN(__builtin_arm_tcommit, "v", "n")			BUILTIN(__builtin_arm_tcommit, "v", "n")
	BUILTIN(__builtin_arm_tcancel, "vWUIi", "n")			BUILTIN(__builtin_arm_tcancel, "vWUIi", "n")
	BUILTIN(__builtin_arm_ttest, "WUi", "nc")			BUILTIN(__builtin_arm_ttest, "WUi", "nc")

	// SVE
	BUILTIN(__builtin_sve_svld1_s16, "q8sq16bSsC*", "n")
	BUILTIN(__builtin_sve_svld1_s32, "q4iq16bSiC*", "n")
	BUILTIN(__builtin_sve_svld1_s64, "q2Wiq16bSWiC*", "n")
	BUILTIN(__builtin_sve_svld1_s8, "q16Scq16bScC*", "n")
	BUILTIN(__builtin_sve_svld1_u16, "q8Usq16bUsC*", "n")
	BUILTIN(__builtin_sve_svld1_u32, "q4Uiq16bUiC*", "n")
	BUILTIN(__builtin_sve_svld1_u64, "q2UWiq16bUWiC*", "n")
	BUILTIN(__builtin_sve_svld1_u8, "q16Ucq16bUcC*", "n")
	BUILTIN(__builtin_sve_svld1_f64, "q2dq16bdC*", "n")
	BUILTIN(__builtin_sve_svld1_f32, "q4fq16bfC*", "n")
	BUILTIN(__builtin_sve_svld1_f16, "q8hq16bhC*", "n")

	TARGET_HEADER_BUILTIN(_BitScanForward, "UcUNi*UNi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")			TARGET_HEADER_BUILTIN(_BitScanForward, "UcUNi*UNi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")
	TARGET_HEADER_BUILTIN(_BitScanReverse, "UcUNi*UNi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")			TARGET_HEADER_BUILTIN(_BitScanReverse, "UcUNi*UNi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")
	TARGET_HEADER_BUILTIN(_BitScanForward64, "UcUNi*ULLi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")			TARGET_HEADER_BUILTIN(_BitScanForward64, "UcUNi*ULLi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")
	TARGET_HEADER_BUILTIN(_BitScanReverse64, "UcUNi*ULLi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")			TARGET_HEADER_BUILTIN(_BitScanReverse64, "UcUNi*ULLi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")

	TARGET_HEADER_BUILTIN(_InterlockedAdd, "NiNiD*Ni", "nh", "intrin.h", ALL_MS_LANGUAGES, "")			TARGET_HEADER_BUILTIN(_InterlockedAdd, "NiNiD*Ni", "nh", "intrin.h", ALL_MS_LANGUAGES, "")
	TARGET_HEADER_BUILTIN(_InterlockedAnd64, "LLiLLiD*LLi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")			TARGET_HEADER_BUILTIN(_InterlockedAnd64, "LLiLLiD*LLi", "nh", "intrin.h", ALL_MS_LANGUAGES, "")
	TARGET_HEADER_BUILTIN(_InterlockedDecrement64, "LLiLLiD*", "nh", "intrin.h", ALL_MS_LANGUAGES, "")			TARGET_HEADER_BUILTIN(_InterlockedDecrement64, "LLiLLiD*", "nh", "intrin.h", ALL_MS_LANGUAGES, "")
	▲ Show 20 Lines • Show All 114 Lines • Show Last 20 Lines

clang/include/clang/Basic/BuiltinsSVE.def

This file was added.

				//===--- BuiltinsSVE.def - SVE Builtin function database --------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file defines the SVE-specific builtin function database. Users of
				// this file must define the BUILTIN macro to make use of this information.
				//
				//===----------------------------------------------------------------------===//

				// The format of this database matches clang/Basic/Builtins.def.

				#define GET_SVE_BUILTINS
				#include "clang/Basic/arm_sve_builtins.inc"
				#undef GET_SVE_BUILTINS

				#undef BUILTIN

clang/include/clang/Basic/CMakeLists.txt

Show All 35 Lines	clang_tablegen(AttrSubMatchRulesList.inc -gen-clang-attr-subject-match-rule-list
TARGET ClangAttrSubjectMatchRuleList)		TARGET ClangAttrSubjectMatchRuleList)

clang_tablegen(AttrHasAttributeImpl.inc -gen-clang-attr-has-attribute-impl		clang_tablegen(AttrHasAttributeImpl.inc -gen-clang-attr-has-attribute-impl
-I ${CMAKE_CURRENT_SOURCE_DIR}/../../		-I ${CMAKE_CURRENT_SOURCE_DIR}/../../
SOURCE Attr.td		SOURCE Attr.td
TARGET ClangAttrHasAttributeImpl		TARGET ClangAttrHasAttributeImpl
)		)

# ARM NEON and MVE		# ARM NEON and MVE
thakisUnsubmitted Not Done Reply Inline Actions Update comment to also say "SVE" and "CDE" (or just say "# ARM builtin headers") thakis: Update comment to also say "SVE" and "CDE" (or just say "# ARM builtin headers")
clang_tablegen(arm_neon.inc -gen-arm-neon-sema		clang_tablegen(arm_neon.inc -gen-arm-neon-sema
SOURCE arm_neon.td		SOURCE arm_neon.td
TARGET ClangARMNeon)		TARGET ClangARMNeon)
clang_tablegen(arm_fp16.inc -gen-arm-neon-sema		clang_tablegen(arm_fp16.inc -gen-arm-neon-sema
SOURCE arm_fp16.td		SOURCE arm_fp16.td
TARGET ClangARMFP16)		TARGET ClangARMFP16)
clang_tablegen(arm_mve_builtins.inc -gen-arm-mve-builtin-def		clang_tablegen(arm_mve_builtins.inc -gen-arm-mve-builtin-def
SOURCE arm_mve.td		SOURCE arm_mve.td
TARGET ClangARMMveBuiltinsDef)		TARGET ClangARMMveBuiltinsDef)
clang_tablegen(arm_mve_builtin_cg.inc -gen-arm-mve-builtin-codegen		clang_tablegen(arm_mve_builtin_cg.inc -gen-arm-mve-builtin-codegen
SOURCE arm_mve.td		SOURCE arm_mve.td
TARGET ClangARMMveBuiltinCG)		TARGET ClangARMMveBuiltinCG)
clang_tablegen(arm_mve_builtin_sema.inc -gen-arm-mve-builtin-sema		clang_tablegen(arm_mve_builtin_sema.inc -gen-arm-mve-builtin-sema
SOURCE arm_mve.td		SOURCE arm_mve.td
TARGET ClangARMMveBuiltinSema)		TARGET ClangARMMveBuiltinSema)
clang_tablegen(arm_mve_builtin_aliases.inc -gen-arm-mve-builtin-aliases		clang_tablegen(arm_mve_builtin_aliases.inc -gen-arm-mve-builtin-aliases
SOURCE arm_mve.td		SOURCE arm_mve.td
TARGET ClangARMMveBuiltinAliases)		TARGET ClangARMMveBuiltinAliases)
		clang_tablegen(arm_sve_builtins.inc -gen-arm-sve-builtins
		SOURCE arm_sve.td
		TARGET ClangARMSveBuiltins)
		clang_tablegen(arm_sve_codegenmap.inc -gen-arm-sve-codegenmap
		SOURCE arm_sve.td
		TARGET ClangARMSveCodeGenMap)
clang_tablegen(arm_cde_builtins.inc -gen-arm-cde-builtin-def		clang_tablegen(arm_cde_builtins.inc -gen-arm-cde-builtin-def
SOURCE arm_cde.td		SOURCE arm_cde.td
TARGET ClangARMCdeBuiltinsDef)		TARGET ClangARMCdeBuiltinsDef)
clang_tablegen(arm_cde_builtin_cg.inc -gen-arm-cde-builtin-codegen		clang_tablegen(arm_cde_builtin_cg.inc -gen-arm-cde-builtin-codegen
SOURCE arm_cde.td		SOURCE arm_cde.td
TARGET ClangARMCdeBuiltinCG)		TARGET ClangARMCdeBuiltinCG)
clang_tablegen(arm_cde_builtin_sema.inc -gen-arm-cde-builtin-sema		clang_tablegen(arm_cde_builtin_sema.inc -gen-arm-cde-builtin-sema
SOURCE arm_cde.td		SOURCE arm_cde.td
TARGET ClangARMCdeBuiltinSema)		TARGET ClangARMCdeBuiltinSema)
clang_tablegen(arm_cde_builtin_aliases.inc -gen-arm-cde-builtin-aliases		clang_tablegen(arm_cde_builtin_aliases.inc -gen-arm-cde-builtin-aliases
SOURCE arm_cde.td		SOURCE arm_cde.td
TARGET ClangARMCdeBuiltinAliases)		TARGET ClangARMCdeBuiltinAliases)

clang/include/clang/Basic/TargetBuiltins.h

Show All 35 Lines	enum {
LastTIBuiltin = clang::Builtin::FirstTSBuiltin-1,		LastTIBuiltin = clang::Builtin::FirstTSBuiltin-1,
LastNEONBuiltin = NEON::FirstTSBuiltin - 1,		LastNEONBuiltin = NEON::FirstTSBuiltin - 1,
#define BUILTIN(ID, TYPE, ATTRS) BI##ID,		#define BUILTIN(ID, TYPE, ATTRS) BI##ID,
#include "clang/Basic/BuiltinsARM.def"		#include "clang/Basic/BuiltinsARM.def"
LastTSBuiltin		LastTSBuiltin
};		};
}		}

		namespace SVE {
		enum {
		LastNEONBuiltin = NEON::FirstTSBuiltin - 1,
		#define BUILTIN(ID, TYPE, ATTRS) BI##ID,
		#include "clang/Basic/BuiltinsSVE.def"
		FirstTSBuiltin,
		};
		}

/// AArch64 builtins		/// AArch64 builtins
namespace AArch64 {		namespace AArch64 {
enum {		enum {
LastTIBuiltin = clang::Builtin::FirstTSBuiltin - 1,		LastTIBuiltin = clang::Builtin::FirstTSBuiltin - 1,
LastNEONBuiltin = NEON::FirstTSBuiltin - 1,		LastNEONBuiltin = NEON::FirstTSBuiltin - 1,
		FirstSVEBuiltin = NEON::FirstTSBuiltin,
		LastSVEBuiltin = SVE::FirstTSBuiltin - 1,
#define BUILTIN(ID, TYPE, ATTRS) BI##ID,		#define BUILTIN(ID, TYPE, ATTRS) BI##ID,
#include "clang/Basic/BuiltinsAArch64.def"		#include "clang/Basic/BuiltinsAArch64.def"
LastTSBuiltin		LastTSBuiltin
};		};
}		}

/// BPF builtins		/// BPF builtins
namespace BPF {		namespace BPF {
▲ Show 20 Lines • Show All 158 Lines • Show Last 20 Lines

clang/include/clang/Basic/arm_sve.td

	//===--- arm_sve.td - ARM SVE compiler interface ------------------------===//			//===--- arm_sve.td - ARM SVE compiler interface ------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file defines the TableGen definitions from which the ARM SVE header			// This file defines the TableGen definitions from which the ARM SVE header
	// file will be generated. See:			// file will be generated. See:
	//			//
	// https://developer.arm.com/architectures/system-architectures/software-standards/acle			// https://developer.arm.com/architectures/system-architectures/software-standards/acle
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

				//===----------------------------------------------------------------------===//
				// Instruction definitions
				//===----------------------------------------------------------------------===//
				// Every intrinsic subclasses "Inst". An intrinsic has a name, a prototype and
				// a sequence of typespecs.
				//
				// The name is the base name of the intrinsic, for example "svld1". This is
				// then mangled by the tblgen backend to add type information ("svld1_s16").
				//
				// A typespec is a sequence of uppercase characters (modifiers) followed by one
				// lowercase character. A typespec encodes a particular "base type" of the
				// intrinsic.
				//
				// An example typespec is "Us" - unsigned short - svuint16_t. The available
				// typespec codes are given below.
				//
				// The string given to an Inst class is a sequence of typespecs. The intrinsic
				// is instantiated for every typespec in the sequence. For example "sdUsUd".
				//
				// The prototype is a string that defines the return type of the intrinsic
				// and the type of each argument. The return type and every argument gets a
				// "modifier" that can change in some way the "base type" of the intrinsic.
				//
				// The modifier 'd' means "default" and does not modify the base type in any
				// way. The available modifiers are given below.
				//
				// Typespecs
				// ---------
				// c: char
				// s: short
				// i: int
				// l: long
				// f: float
				// h: half-float
				// d: double

				// Typespec modifiers
				// ------------------
				// P: boolean
				// U: unsigned

				// Prototype modifiers
				// -------------------
				// prototype: return (arg, arg, ...)
				//
				// d: default
				// c: const pointer type
				// P: predicate type

				class MergeType<int val> {
				int Value = val;
				}
				def MergeNone : MergeType<0>;
				def MergeAny : MergeType<1>;
				def MergeOp1 : MergeType<2>;
				def MergeZero : MergeType<3>;
				def MergeAnyExp : MergeType<4>; // Use merged builtin with explicit
				def MergeZeroExp : MergeType<5>; // generation of its inactive argument.

				class MemEltTy<int val> {
				int Value = val;
				}
				def MemEltTyDefault : MemEltTy<0>;
				def MemEltTyInt8 : MemEltTy<1>;
				def MemEltTyInt16 : MemEltTy<2>;
				def MemEltTyInt32 : MemEltTy<3>;
				def MemEltTyInt64 : MemEltTy<4>;

				class FlagType<int val> {
				int Value = val;
				}

				// These must be kept in sync with the flags in utils/TableGen/SveEmitter.h
				// and include/clang/Basic/TargetBuiltins.h
				def NoFlags : FlagType<0x00000000>;
				// 0x00000001 => EltType
				// ...
				// 0x0000000f => EltType
				// 0x00000010 => MemEltType
				// ...
				// 0x00000070 => MemEltType
				def IsLoad : FlagType<0x00000080>;

				// Every intrinsic subclasses Inst.
				class Inst<string n, string p, string t, MergeType mt, string i,
				list<FlagType> ft, MemEltTy met> {
				string Name = n;
				string Prototype = p;
				string Types = t;
				string ArchGuard = "";
				int Merge = mt.Value;
				string LLVMIntrinsic = i;
				list<FlagType> Flags = ft;
				int MemEltType = met.Value;
				}

				// MInst: Instructions which access memory
				class MInst<string n, string p, string t, list<FlagType> f,
				MemEltTy met=MemEltTyDefault, string i="">
				: Inst<n, p, t, MergeNone, i, f, met> {}

				////////////////////////////////////////////////////////////////////////////////
				// Loads

				// Load one vector (scalar base)
				def SVLD1 : MInst<"svld1[_{2}]", "dPc", "csilUcUsUiUlhfd", [IsLoad]>;
				SjoerdMeijerUnsubmitted Done Reply Inline Actions This encoding, e.g, this is "csilUcUsUiUlhfd", is such a monstrosity. It's a very efficient encoding, but of course completely unreadable. I know there is prior art, and know that this is how it's been done, but just curious if you have given it thoughts how to do this in a normal way, a bit more c++y. I don't want to de-rail this work, but if we are adding a new emitter, perhaps now is the time to give it a thought, so was just curious. SjoerdMeijer: This encoding, e.g, this is "csilUcUsUiUlhfd", is such a monstrosity. It's a very efficient…
				sdesmalenAuthorUnsubmitted Done Reply Inline Actions Haha, its a bit of a monstrosity indeed. The only thing I can think of here would be having something like: class TypeSpecs<list<string> val> { list<string> v = val; } def All_Int_Float_Ty : TypeSpecs<["c", "s", "i", "l", "Uc", "Us", "Ul", "h", "f", "d">; def SVLD1 : Minst<"svld1[_{2}]", "dPc", All_Int_Float_Ty, [IsLoad]>; But I suspect this gets a bit awkward because of the many permutations, I count more than 40. Not sure if that would really improve the readability. sdesmalen: Haha, its a bit of a monstrosity indeed. The only thing I can think of here would be having…
				SjoerdMeijerUnsubmitted Not Done Reply Inline Actions I would personally welcome any improvement here, even the smallest. But if you think it's tricky, then fair enough! I've managed to completely ignore the MVE intrinsics work so far, but understood there were some innovations here and there (e.g. in tablegen). Probably because it is dealing with similar problems: a lot of intrinsics, some of them overloaded with different types. I'm going to have a little look now to see if there's anything we can borrow from that, or if that is unrelated.... SjoerdMeijer: I would personally welcome any improvement here, even the smallest. But if you think it's…
				simon_tathamUnsubmitted Not Done Reply Inline Actions In the MVE intrinsics implementation I completely avoided that entire system of string-based type specifications. There's another completely different way you can set up the types of builtins, and I used that instead. You can declare the function for `Builtins.def` purposes with no type specification at all, and then you fill in its type signature using a declaration in the header file, with the unusual combination of `__inline__` and no function body: static __inline__ int32x4_t __builtin_arm_foo_bar(int16x8_t, float23x7t); // or whatever In fact I went one step further: the user-facing names for the MVE intrinsics are declared in `arm_mve.h` with a special attribute indicating that they're aliases for clang builtins. And the MVE polymorphic intrinsics are done by adding `__attribute__((overloadable))` to the declaration, which allows C++-style overloading based on parameter types even when compiling in C. So when the user invokes an MVE intrinsic by its polymorphic name, the compiler first does overload resolution to decide which declaration in the header file to select; then it looks at the builtin-alias attribute and discovers which internal builtin id it corresponds to; and then it can do codegen for that builtin directly, without a wrapper function in the header. Pros of doing it this way: if the builtin requires some of its arguments to be compile-time constants, then you don't run into the problem that a wrapper function in the header fails to pass through the constantness. (In NEON this is worked around by making some wrapper functions be wrapper macros instead – but NEON doesn't have to deal with polymorphism.) declaring a builtin's type signature in the header file means that it can include definitions that the header file has created beforehand. For example, one of the arguments to the MVE `vld2q` family involves a small `struct` containing 2 or 4 vectors, and it would be tricky to get that struct type into the `Builtins.def` type specification before the header file can tell clang it exists. doing polymorphism like this, rather than making the polymorphic function be a macro expanding to something involving C11 `_Generic`, means the error messages are orders of magnitude more friendly when the user messes up a call. (Also it's remarkably fiddly to use `_Generic` in complicated cases, because of the requirement that even its untaken branches not provoke any type-checking or semantic errors.) I don't know of any way that the preprocessor + `_Generic` approach can avoid expanding its macro arguments at least twice. It can avoid evaluating twice, so that's safe in the side-effect sense, but you still have the problem that you get exponential inflation of the size of preprocessed output if calls to these macros are lexically nested too deeply. Cons: you have to do all of your codegen inside the clang binary, starting from the function operands you're given, and ending up with LLVM IR. You don't get to do the tedious parts (like unpacking structs, or dereferencing pointer arguments for passed-by-reference parameters) in the wrapper function in the header, because there isn't one. I had to invent a whole system in MveEmitter to allow the IR generation to be specified in a not-too-verbose way. if the builtins don't have type declarations until the header is included, then users can't call them without the header file. Probably this is fine for SVE intrinsics the same way it is for MVE, where the builtins are a detail of that particular compiler's implementation and users are intended to use the compiler-independent public API. But in cases where the builtin itself was intended to be called directly by the end user (in the way that `__builtin_clz` is, for example), you'd probably want it to work everywhere. if you do polymorphism using `__attribute__((overloadable))` then all the things you're overloading between have to be real functions. You can't make some of them be macros, with the extra flexibility a macro gives you. (But then, making them builtins rather than genuine functions restores some of that flexibility.) Off the top of my head I don't know whether all these ideas can be separated from each other. It feels to me as if all the choices I made are leaning on each other and making a mutually supporting whole, and it's quite possible that if you tried to cherry-pick just one of these design decisions into an otherwise more conventional approach, it might all come crashing down. But I haven't tried it :-) simon_tatham: In the MVE intrinsics implementation I completely avoided that entire system of string-based…
				sdesmalenAuthorUnsubmitted Done Reply Inline Actions Thanks for sharing some background here @simon_tatham! sdesmalen: Thanks for sharing some background here @simon_tatham!

clang/lib/Basic/Targets/AArch64.cpp

	Show All 22 Lines

	const Builtin::Info AArch64TargetInfo::BuiltinInfo[] = {			const Builtin::Info AArch64TargetInfo::BuiltinInfo[] = {
	#define BUILTIN(ID, TYPE, ATTRS) \			#define BUILTIN(ID, TYPE, ATTRS) \
	{#ID, TYPE, ATTRS, nullptr, ALL_LANGUAGES, nullptr},			{#ID, TYPE, ATTRS, nullptr, ALL_LANGUAGES, nullptr},
	#include "clang/Basic/BuiltinsNEON.def"			#include "clang/Basic/BuiltinsNEON.def"

	#define BUILTIN(ID, TYPE, ATTRS) \			#define BUILTIN(ID, TYPE, ATTRS) \
	{#ID, TYPE, ATTRS, nullptr, ALL_LANGUAGES, nullptr},			{#ID, TYPE, ATTRS, nullptr, ALL_LANGUAGES, nullptr},
				#include "clang/Basic/BuiltinsSVE.def"

				#define BUILTIN(ID, TYPE, ATTRS) \
				{#ID, TYPE, ATTRS, nullptr, ALL_LANGUAGES, nullptr},
	#define LANGBUILTIN(ID, TYPE, ATTRS, LANG) \			#define LANGBUILTIN(ID, TYPE, ATTRS, LANG) \
	{#ID, TYPE, ATTRS, nullptr, LANG, nullptr},			{#ID, TYPE, ATTRS, nullptr, LANG, nullptr},
	#define TARGET_HEADER_BUILTIN(ID, TYPE, ATTRS, HEADER, LANGS, FEATURE) \			#define TARGET_HEADER_BUILTIN(ID, TYPE, ATTRS, HEADER, LANGS, FEATURE) \
	{#ID, TYPE, ATTRS, HEADER, LANGS, FEATURE},			{#ID, TYPE, ATTRS, HEADER, LANGS, FEATURE},
	#include "clang/Basic/BuiltinsAArch64.def"			#include "clang/Basic/BuiltinsAArch64.def"
	};			};

	AArch64TargetInfo::AArch64TargetInfo(const llvm::Triple &Triple,			AArch64TargetInfo::AArch64TargetInfo(const llvm::Triple &Triple,
	▲ Show 20 Lines • Show All 693 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGBuiltin.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show All 17 Lines
#include "CodeGenModule.h"		#include "CodeGenModule.h"
#include "ConstantEmitter.h"		#include "ConstantEmitter.h"
#include "PatternInit.h"		#include "PatternInit.h"
#include "TargetInfo.h"		#include "TargetInfo.h"
#include "clang/AST/ASTContext.h"		#include "clang/AST/ASTContext.h"
#include "clang/AST/Attr.h"		#include "clang/AST/Attr.h"
#include "clang/AST/Decl.h"		#include "clang/AST/Decl.h"
#include "clang/AST/OSLog.h"		#include "clang/AST/OSLog.h"
		#include "clang/Basic/AArch64SVETypeFlags.h"
#include "clang/Basic/TargetBuiltins.h"		#include "clang/Basic/TargetBuiltins.h"
#include "clang/Basic/TargetInfo.h"		#include "clang/Basic/TargetInfo.h"
#include "clang/CodeGen/CGFunctionInfo.h"		#include "clang/CodeGen/CGFunctionInfo.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/InlineAsm.h"		#include "llvm/IR/InlineAsm.h"
#include "llvm/IR/Intrinsics.h"		#include "llvm/IR/Intrinsics.h"
▲ Show 20 Lines • Show All 4,537 Lines • ▼ Show 20 Lines	enum {
VectorRet = AddRetType \| VectorizeRetType,		VectorRet = AddRetType \| VectorizeRetType,
VectorRetGetArgs01 =		VectorRetGetArgs01 =
AddRetType \| Add2ArgTypes \| VectorizeRetType \| VectorizeArgTypes,		AddRetType \| Add2ArgTypes \| VectorizeRetType \| VectorizeArgTypes,
FpCmpzModifiers =		FpCmpzModifiers =
AddRetType \| VectorizeRetType \| Add1ArgType \| InventFloatType		AddRetType \| VectorizeRetType \| Add1ArgType \| InventFloatType
};		};

namespace {		namespace {
struct NeonIntrinsicInfo {		struct ARMVectorIntrinsicInfo {
const char *NameHint;		const char *NameHint;
unsigned BuiltinID;		unsigned BuiltinID;
unsigned LLVMIntrinsic;		unsigned LLVMIntrinsic;
unsigned AltLLVMIntrinsic;		unsigned AltLLVMIntrinsic;
unsigned TypeModifier;		unsigned TypeModifier;

bool operator<(unsigned RHSBuiltinID) const {		bool operator<(unsigned RHSBuiltinID) const {
return BuiltinID < RHSBuiltinID;		return BuiltinID < RHSBuiltinID;
}		}
bool operator<(const NeonIntrinsicInfo &TE) const {		bool operator<(const ARMVectorIntrinsicInfo &TE) const {
return BuiltinID < TE.BuiltinID;		return BuiltinID < TE.BuiltinID;
}		}
};		};
} // end anonymous namespace		} // end anonymous namespace

#define NEONMAP0(NameBase) \		#define NEONMAP0(NameBase) \
{ #NameBase, NEON::BI__builtin_neon_ ## NameBase, 0, 0, 0 }		{ #NameBase, NEON::BI__builtin_neon_ ## NameBase, 0, 0, 0 }

#define NEONMAP1(NameBase, LLVMIntrinsic, TypeModifier) \		#define NEONMAP1(NameBase, LLVMIntrinsic, TypeModifier) \
{ #NameBase, NEON:: BI__builtin_neon_ ## NameBase, \		{ #NameBase, NEON:: BI__builtin_neon_ ## NameBase, \
Intrinsic::LLVMIntrinsic, 0, TypeModifier }		Intrinsic::LLVMIntrinsic, 0, TypeModifier }

#define NEONMAP2(NameBase, LLVMIntrinsic, AltLLVMIntrinsic, TypeModifier) \		#define NEONMAP2(NameBase, LLVMIntrinsic, AltLLVMIntrinsic, TypeModifier) \
{ #NameBase, NEON:: BI__builtin_neon_ ## NameBase, \		{ #NameBase, NEON:: BI__builtin_neon_ ## NameBase, \
Intrinsic::LLVMIntrinsic, Intrinsic::AltLLVMIntrinsic, \		Intrinsic::LLVMIntrinsic, Intrinsic::AltLLVMIntrinsic, \
TypeModifier }		TypeModifier }

static const NeonIntrinsicInfo ARMSIMDIntrinsicMap [] = {		static const ARMVectorIntrinsicInfo ARMSIMDIntrinsicMap [] = {
NEONMAP2(vabd_v, arm_neon_vabdu, arm_neon_vabds, Add1ArgType \| UnsignedAlts),		NEONMAP2(vabd_v, arm_neon_vabdu, arm_neon_vabds, Add1ArgType \| UnsignedAlts),
NEONMAP2(vabdq_v, arm_neon_vabdu, arm_neon_vabds, Add1ArgType \| UnsignedAlts),		NEONMAP2(vabdq_v, arm_neon_vabdu, arm_neon_vabds, Add1ArgType \| UnsignedAlts),
NEONMAP1(vabs_v, arm_neon_vabs, 0),		NEONMAP1(vabs_v, arm_neon_vabs, 0),
NEONMAP1(vabsq_v, arm_neon_vabs, 0),		NEONMAP1(vabsq_v, arm_neon_vabs, 0),
NEONMAP0(vaddhn_v),		NEONMAP0(vaddhn_v),
NEONMAP1(vaesdq_v, arm_neon_aesd, 0),		NEONMAP1(vaesdq_v, arm_neon_aesd, 0),
NEONMAP1(vaeseq_v, arm_neon_aese, 0),		NEONMAP1(vaeseq_v, arm_neon_aese, 0),
NEONMAP1(vaesimcq_v, arm_neon_aesimc, 0),		NEONMAP1(vaesimcq_v, arm_neon_aesimc, 0),
▲ Show 20 Lines • Show All 264 Lines • ▼ Show 20 Lines	static const ARMVectorIntrinsicInfo ARMSIMDIntrinsicMap [] = {
NEONMAP0(vtst_v),		NEONMAP0(vtst_v),
NEONMAP0(vtstq_v),		NEONMAP0(vtstq_v),
NEONMAP0(vuzp_v),		NEONMAP0(vuzp_v),
NEONMAP0(vuzpq_v),		NEONMAP0(vuzpq_v),
NEONMAP0(vzip_v),		NEONMAP0(vzip_v),
NEONMAP0(vzipq_v)		NEONMAP0(vzipq_v)
};		};

static const NeonIntrinsicInfo AArch64SIMDIntrinsicMap[] = {		static const ARMVectorIntrinsicInfo AArch64SIMDIntrinsicMap[] = {
NEONMAP1(vabs_v, aarch64_neon_abs, 0),		NEONMAP1(vabs_v, aarch64_neon_abs, 0),
NEONMAP1(vabsq_v, aarch64_neon_abs, 0),		NEONMAP1(vabsq_v, aarch64_neon_abs, 0),
NEONMAP0(vaddhn_v),		NEONMAP0(vaddhn_v),
NEONMAP1(vaesdq_v, aarch64_crypto_aesd, 0),		NEONMAP1(vaesdq_v, aarch64_crypto_aesd, 0),
NEONMAP1(vaeseq_v, aarch64_crypto_aese, 0),		NEONMAP1(vaeseq_v, aarch64_crypto_aese, 0),
NEONMAP1(vaesimcq_v, aarch64_crypto_aesimc, 0),		NEONMAP1(vaesimcq_v, aarch64_crypto_aesimc, 0),
NEONMAP1(vaesmcq_v, aarch64_crypto_aesmc, 0),		NEONMAP1(vaesmcq_v, aarch64_crypto_aesmc, 0),
NEONMAP1(vcadd_rot270_v, aarch64_neon_vcadd_rot270, Add1ArgType),		NEONMAP1(vcadd_rot270_v, aarch64_neon_vcadd_rot270, Add1ArgType),
▲ Show 20 Lines • Show All 152 Lines • ▼ Show 20 Lines	static const ARMVectorIntrinsicInfo AArch64SIMDIntrinsicMap[] = {
NEONMAP1(vst1q_x2_v, aarch64_neon_st1x2, 0),		NEONMAP1(vst1q_x2_v, aarch64_neon_st1x2, 0),
NEONMAP1(vst1q_x3_v, aarch64_neon_st1x3, 0),		NEONMAP1(vst1q_x3_v, aarch64_neon_st1x3, 0),
NEONMAP1(vst1q_x4_v, aarch64_neon_st1x4, 0),		NEONMAP1(vst1q_x4_v, aarch64_neon_st1x4, 0),
NEONMAP0(vsubhn_v),		NEONMAP0(vsubhn_v),
NEONMAP0(vtst_v),		NEONMAP0(vtst_v),
NEONMAP0(vtstq_v),		NEONMAP0(vtstq_v),
};		};

static const NeonIntrinsicInfo AArch64SISDIntrinsicMap[] = {		static const ARMVectorIntrinsicInfo AArch64SISDIntrinsicMap[] = {
NEONMAP1(vabdd_f64, aarch64_sisd_fabd, Add1ArgType),		NEONMAP1(vabdd_f64, aarch64_sisd_fabd, Add1ArgType),
NEONMAP1(vabds_f32, aarch64_sisd_fabd, Add1ArgType),		NEONMAP1(vabds_f32, aarch64_sisd_fabd, Add1ArgType),
NEONMAP1(vabsd_s64, aarch64_neon_abs, Add1ArgType),		NEONMAP1(vabsd_s64, aarch64_neon_abs, Add1ArgType),
NEONMAP1(vaddlv_s32, aarch64_neon_saddlv, AddRetType \| Add1ArgType),		NEONMAP1(vaddlv_s32, aarch64_neon_saddlv, AddRetType \| Add1ArgType),
NEONMAP1(vaddlv_u32, aarch64_neon_uaddlv, AddRetType \| Add1ArgType),		NEONMAP1(vaddlv_u32, aarch64_neon_uaddlv, AddRetType \| Add1ArgType),
NEONMAP1(vaddlvq_s32, aarch64_neon_saddlv, AddRetType \| Add1ArgType),		NEONMAP1(vaddlvq_s32, aarch64_neon_saddlv, AddRetType \| Add1ArgType),
NEONMAP1(vaddlvq_u32, aarch64_neon_uaddlv, AddRetType \| Add1ArgType),		NEONMAP1(vaddlvq_u32, aarch64_neon_uaddlv, AddRetType \| Add1ArgType),
NEONMAP1(vaddv_f32, aarch64_neon_faddv, AddRetType \| Add1ArgType),		NEONMAP1(vaddv_f32, aarch64_neon_faddv, AddRetType \| Add1ArgType),
▲ Show 20 Lines • Show All 213 Lines • ▼ Show 20 Lines	static const ARMVectorIntrinsicInfo AArch64SISDIntrinsicMap[] = {
NEONMAP1(vrsqrteh_f16, aarch64_neon_frsqrte, Add1ArgType),		NEONMAP1(vrsqrteh_f16, aarch64_neon_frsqrte, Add1ArgType),
NEONMAP1(vrsqrtsh_f16, aarch64_neon_frsqrts, Add1ArgType),		NEONMAP1(vrsqrtsh_f16, aarch64_neon_frsqrts, Add1ArgType),
};		};

#undef NEONMAP0		#undef NEONMAP0
#undef NEONMAP1		#undef NEONMAP1
#undef NEONMAP2		#undef NEONMAP2

		#define SVEMAP1(NameBase, LLVMIntrinsic, TypeModifier) \
		{ \
		#NameBase, SVE::BI__builtin_sve_##NameBase, Intrinsic::LLVMIntrinsic, 0, \
		TypeModifier \
		}

		#define SVEMAP2(NameBase, TypeModifier) \
		{ #NameBase, SVE::BI__builtin_sve_##NameBase, 0, 0, TypeModifier }
		static const ARMVectorIntrinsicInfo AArch64SVEIntrinsicMap[] = {
		SjoerdMeijerUnsubmitted Not Done Reply Inline Actions I am wondering if it is confusing/correct to use NeonInstrinsicInfo here? SjoerdMeijer: I am wondering if it is confusing/correct to use NeonInstrinsicInfo here?
		sdesmalenAuthorUnsubmitted Done Reply Inline Actions We can reuse the same info-struct and find function, but the names are indeed misleading. I've renamed these. sdesmalen: We can reuse the same info-struct and find function, but the names are indeed misleading. I've…
		#define GET_SVE_LLVM_INTRINSIC_MAP
		#include "clang/Basic/arm_sve_codegenmap.inc"
		#undef GET_SVE_LLVM_INTRINSIC_MAP
		};

		#undef SVEMAP1
		#undef SVEMAP2

static bool NEONSIMDIntrinsicsProvenSorted = false;		static bool NEONSIMDIntrinsicsProvenSorted = false;

static bool AArch64SIMDIntrinsicsProvenSorted = false;		static bool AArch64SIMDIntrinsicsProvenSorted = false;
static bool AArch64SISDIntrinsicsProvenSorted = false;		static bool AArch64SISDIntrinsicsProvenSorted = false;
		static bool AArch64SVEIntrinsicsProvenSorted = false;

		static const ARMVectorIntrinsicInfo *
static const NeonIntrinsicInfo *		findARMVectorIntrinsicInMap(ArrayRef<ARMVectorIntrinsicInfo> IntrinsicMap,
findNeonIntrinsicInMap(ArrayRef<NeonIntrinsicInfo> IntrinsicMap,
unsigned BuiltinID, bool &MapProvenSorted) {		unsigned BuiltinID, bool &MapProvenSorted) {

#ifndef NDEBUG		#ifndef NDEBUG
if (!MapProvenSorted) {		if (!MapProvenSorted) {
assert(std::is_sorted(std::begin(IntrinsicMap), std::end(IntrinsicMap)));		assert(std::is_sorted(std::begin(IntrinsicMap), std::end(IntrinsicMap)));
MapProvenSorted = true;		MapProvenSorted = true;
}		}
#endif		#endif

const NeonIntrinsicInfo *Builtin = llvm::lower_bound(IntrinsicMap, BuiltinID);		const ARMVectorIntrinsicInfo *Builtin =
		llvm::lower_bound(IntrinsicMap, BuiltinID);

if (Builtin != IntrinsicMap.end() && Builtin->BuiltinID == BuiltinID)		if (Builtin != IntrinsicMap.end() && Builtin->BuiltinID == BuiltinID)
return Builtin;		return Builtin;

return nullptr;		return nullptr;
}		}

Function *CodeGenFunction::LookupNeonLLVMIntrinsic(unsigned IntrinsicID,		Function *CodeGenFunction::LookupNeonLLVMIntrinsic(unsigned IntrinsicID,
Show All 30 Lines	if (Modifier & Add2ArgTypes)
Tys.push_back(ArgType);		Tys.push_back(ArgType);

if (Modifier & InventFloatType)		if (Modifier & InventFloatType)
Tys.push_back(FloatTy);		Tys.push_back(FloatTy);

return CGM.getIntrinsic(IntrinsicID, Tys);		return CGM.getIntrinsic(IntrinsicID, Tys);
}		}

static Value *EmitCommonNeonSISDBuiltinExpr(CodeGenFunction &CGF,		static Value *EmitCommonNeonSISDBuiltinExpr(
const NeonIntrinsicInfo &SISDInfo,		CodeGenFunction &CGF, const ARMVectorIntrinsicInfo &SISDInfo,
SmallVectorImpl<Value *> &Ops,		SmallVectorImpl<Value > &Ops, const CallExpr E) {
const CallExpr *E) {
unsigned BuiltinID = SISDInfo.BuiltinID;		unsigned BuiltinID = SISDInfo.BuiltinID;
unsigned int Int = SISDInfo.LLVMIntrinsic;		unsigned int Int = SISDInfo.LLVMIntrinsic;
unsigned Modifier = SISDInfo.TypeModifier;		unsigned Modifier = SISDInfo.TypeModifier;
const char *s = SISDInfo.NameHint;		const char *s = SISDInfo.NameHint;

switch (BuiltinID) {		switch (BuiltinID) {
case NEON::BI__builtin_neon_vcled_s64:		case NEON::BI__builtin_neon_vcled_s64:
case NEON::BI__builtin_neon_vcled_u64:		case NEON::BI__builtin_neon_vcled_u64:
▲ Show 20 Lines • Show All 1,496 Lines • ▼ Show 20 Lines	llvm::VectorType *VTy = GetNeonType(this, Type,
getTarget().hasLegalHalfType());		getTarget().hasLegalHalfType());
llvm::Type *Ty = VTy;		llvm::Type *Ty = VTy;
if (!Ty)		if (!Ty)
return nullptr;		return nullptr;

// Many NEON builtins have identical semantics and uses in ARM and		// Many NEON builtins have identical semantics and uses in ARM and
// AArch64. Emit these in a single function.		// AArch64. Emit these in a single function.
auto IntrinsicMap = makeArrayRef(ARMSIMDIntrinsicMap);		auto IntrinsicMap = makeArrayRef(ARMSIMDIntrinsicMap);
const NeonIntrinsicInfo *Builtin = findNeonIntrinsicInMap(		const ARMVectorIntrinsicInfo *Builtin = findARMVectorIntrinsicInMap(
IntrinsicMap, BuiltinID, NEONSIMDIntrinsicsProvenSorted);		IntrinsicMap, BuiltinID, NEONSIMDIntrinsicsProvenSorted);
if (Builtin)		if (Builtin)
return EmitCommonNeonBuiltinExpr(		return EmitCommonNeonBuiltinExpr(
Builtin->BuiltinID, Builtin->LLVMIntrinsic, Builtin->AltLLVMIntrinsic,		Builtin->BuiltinID, Builtin->LLVMIntrinsic, Builtin->AltLLVMIntrinsic,
Builtin->NameHint, Builtin->TypeModifier, E, Ops, PtrOp0, PtrOp1, Arch);		Builtin->NameHint, Builtin->TypeModifier, E, Ops, PtrOp0, PtrOp1, Arch);

unsigned Int;		unsigned Int;
switch (BuiltinID) {		switch (BuiltinID) {
▲ Show 20 Lines • Show All 555 Lines • ▼ Show 20 Lines	Value CodeGenFunction::EmitSVEMaskedLoad(llvm::Type ReturnTy,
Value *Predicate = EmitSVEPredicateCast(Ops[0], MemoryTy);		Value *Predicate = EmitSVEPredicateCast(Ops[0], MemoryTy);
Value *BasePtr = Builder.CreateBitCast(Ops[1], MemoryTy->getPointerTo());		Value *BasePtr = Builder.CreateBitCast(Ops[1], MemoryTy->getPointerTo());
BasePtr = Builder.CreateGEP(MemoryTy, BasePtr, Offset);		BasePtr = Builder.CreateGEP(MemoryTy, BasePtr, Offset);

Value *Splat0 = Constant::getNullValue(MemoryTy);		Value *Splat0 = Constant::getNullValue(MemoryTy);
return Builder.CreateMaskedLoad(BasePtr, Align(1), Predicate, Splat0);		return Builder.CreateMaskedLoad(BasePtr, Align(1), Predicate, Splat0);
}		}

		Value *CodeGenFunction::EmitAArch64SVEBuiltinExpr(unsigned BuiltinID,
		const CallExpr *E) {
		// Find out if any arguments are required to be integer constant expressions.
		unsigned ICEArguments = 0;
		ASTContext::GetBuiltinTypeError Error;
		getContext().GetBuiltinType(BuiltinID, Error, &ICEArguments);
		assert(Error == ASTContext::GE_None && "Should not codegen an error");

		llvm::SmallVector<Value *, 4> Ops;
		for (unsigned i = 0, e = E->getNumArgs(); i != e; i++) {
		if ((ICEArguments & (1 << i)) == 0)
		Ops.push_back(EmitScalarExpr(E->getArg(i)));
		else
		llvm_unreachable("Not yet implemented");
		}

		auto *Builtin = findARMVectorIntrinsicInMap(AArch64SVEIntrinsicMap, BuiltinID,
		SjoerdMeijerUnsubmitted Not Done Reply Inline Actions and the same here: findNeon... SjoerdMeijer: and the same here: findNeon...
		AArch64SVEIntrinsicsProvenSorted);
		SVETypeFlags TypeFlags(Builtin->TypeModifier);
		llvm::Type *Ty = ConvertType(E->getType());
		if (TypeFlags.isLoad())
		return EmitSVEMaskedLoad(Ty, Ops);

		/// Should not happen
		return nullptr;
		}

Value *CodeGenFunction::EmitAArch64BuiltinExpr(unsigned BuiltinID,		Value *CodeGenFunction::EmitAArch64BuiltinExpr(unsigned BuiltinID,
const CallExpr *E,		const CallExpr *E,
llvm::Triple::ArchType Arch) {		llvm::Triple::ArchType Arch) {
		if (BuiltinID >= AArch64::FirstSVEBuiltin &&
		BuiltinID <= AArch64::LastSVEBuiltin)
		return EmitAArch64SVEBuiltinExpr(BuiltinID, E);

unsigned HintID = static_cast<unsigned>(-1);		unsigned HintID = static_cast<unsigned>(-1);
switch (BuiltinID) {		switch (BuiltinID) {
default: break;		default: break;
case AArch64::BI__builtin_arm_nop:		case AArch64::BI__builtin_arm_nop:
HintID = 0;		HintID = 0;
break;		break;
case AArch64::BI__builtin_arm_yield:		case AArch64::BI__builtin_arm_yield:
case AArch64::BI__yield:		case AArch64::BI__yield:
Show All 17 Lines	case AArch64::BI__sevl:
break;		break;
}		}

if (HintID != static_cast<unsigned>(-1)) {		if (HintID != static_cast<unsigned>(-1)) {
Function *F = CGM.getIntrinsic(Intrinsic::aarch64_hint);		Function *F = CGM.getIntrinsic(Intrinsic::aarch64_hint);
return Builder.CreateCall(F, llvm::ConstantInt::get(Int32Ty, HintID));		return Builder.CreateCall(F, llvm::ConstantInt::get(Int32Ty, HintID));
}		}

switch (BuiltinID) {
case AArch64::BI__builtin_sve_svld1_u8:
case AArch64::BI__builtin_sve_svld1_u16:
case AArch64::BI__builtin_sve_svld1_u32:
case AArch64::BI__builtin_sve_svld1_u64:
case AArch64::BI__builtin_sve_svld1_s8:
case AArch64::BI__builtin_sve_svld1_s16:
case AArch64::BI__builtin_sve_svld1_s32:
case AArch64::BI__builtin_sve_svld1_s64:
case AArch64::BI__builtin_sve_svld1_f16:
case AArch64::BI__builtin_sve_svld1_f32:
case AArch64::BI__builtin_sve_svld1_f64: {
llvm::SmallVector<Value *, 4> Ops = {EmitScalarExpr(E->getArg(0)),
EmitScalarExpr(E->getArg(1))};
llvm::Type *Ty = ConvertType(E->getType());
return EmitSVEMaskedLoad(Ty, Ops);
}
default:
break;
}

if (BuiltinID == AArch64::BI__builtin_arm_prefetch) {		if (BuiltinID == AArch64::BI__builtin_arm_prefetch) {
Value *Address = EmitScalarExpr(E->getArg(0));		Value *Address = EmitScalarExpr(E->getArg(0));
Value *RW = EmitScalarExpr(E->getArg(1));		Value *RW = EmitScalarExpr(E->getArg(1));
Value *CacheLevel = EmitScalarExpr(E->getArg(2));		Value *CacheLevel = EmitScalarExpr(E->getArg(2));
Value *RetentionPolicy = EmitScalarExpr(E->getArg(3));		Value *RetentionPolicy = EmitScalarExpr(E->getArg(3));
Value *IsData = EmitScalarExpr(E->getArg(4));		Value *IsData = EmitScalarExpr(E->getArg(4));

Value *Locality = nullptr;		Value *Locality = nullptr;
▲ Show 20 Lines • Show All 382 Lines • ▼ Show 20 Lines	if ((ICEArguments & (1 << i)) == 0) {
bool IsConst = E->getArg(i)->isIntegerConstantExpr(Result, getContext());		bool IsConst = E->getArg(i)->isIntegerConstantExpr(Result, getContext());
assert(IsConst && "Constant arg isn't actually constant?");		assert(IsConst && "Constant arg isn't actually constant?");
(void)IsConst;		(void)IsConst;
Ops.push_back(llvm::ConstantInt::get(getLLVMContext(), Result));		Ops.push_back(llvm::ConstantInt::get(getLLVMContext(), Result));
}		}
}		}

auto SISDMap = makeArrayRef(AArch64SISDIntrinsicMap);		auto SISDMap = makeArrayRef(AArch64SISDIntrinsicMap);
const NeonIntrinsicInfo *Builtin = findNeonIntrinsicInMap(		const ARMVectorIntrinsicInfo *Builtin = findARMVectorIntrinsicInMap(
SISDMap, BuiltinID, AArch64SISDIntrinsicsProvenSorted);		SISDMap, BuiltinID, AArch64SISDIntrinsicsProvenSorted);

if (Builtin) {		if (Builtin) {
Ops.push_back(EmitScalarExpr(E->getArg(E->getNumArgs() - 1)));		Ops.push_back(EmitScalarExpr(E->getArg(E->getNumArgs() - 1)));
Value Result = EmitCommonNeonSISDBuiltinExpr(this, *Builtin, Ops, E);		Value Result = EmitCommonNeonSISDBuiltinExpr(this, *Builtin, Ops, E);
assert(Result && "SISD intrinsic should have been handled");		assert(Result && "SISD intrinsic should have been handled");
return Result;		return Result;
}		}
▲ Show 20 Lines • Show All 823 Lines • ▼ Show 20 Lines

llvm::VectorType *VTy = GetNeonType(this, Type);		llvm::VectorType *VTy = GetNeonType(this, Type);
llvm::Type *Ty = VTy;		llvm::Type *Ty = VTy;
if (!Ty)		if (!Ty)
return nullptr;		return nullptr;

// Not all intrinsics handled by the common case work for AArch64 yet, so only		// Not all intrinsics handled by the common case work for AArch64 yet, so only
// defer to common code if it's been added to our special map.		// defer to common code if it's been added to our special map.
Builtin = findNeonIntrinsicInMap(AArch64SIMDIntrinsicMap, BuiltinID,		Builtin = findARMVectorIntrinsicInMap(AArch64SIMDIntrinsicMap, BuiltinID,
AArch64SIMDIntrinsicsProvenSorted);		AArch64SIMDIntrinsicsProvenSorted);

if (Builtin)		if (Builtin)
return EmitCommonNeonBuiltinExpr(		return EmitCommonNeonBuiltinExpr(
Builtin->BuiltinID, Builtin->LLVMIntrinsic, Builtin->AltLLVMIntrinsic,		Builtin->BuiltinID, Builtin->LLVMIntrinsic, Builtin->AltLLVMIntrinsic,
Builtin->NameHint, Builtin->TypeModifier, E, Ops,		Builtin->NameHint, Builtin->TypeModifier, E, Ops,
/never use addresses/ Address::invalid(), Address::invalid(), Arch);		/never use addresses/ Address::invalid(), Address::invalid(), Arch);

if (Value V = EmitAArch64TblBuiltinExpr(this, BuiltinID, E, Ops, Arch))		if (Value V = EmitAArch64TblBuiltinExpr(this, BuiltinID, E, Ops, Arch))
▲ Show 20 Lines • Show All 6,682 Lines • Show Last 20 Lines

clang/lib/CodeGen/CodeGenFunction.h

Show First 20 Lines • Show All 3,898 Lines • ▼ Show 20 Lines	llvm::Value EmitNeonShiftVector(llvm::Value V, llvm::Type *Ty,
bool negateForRightShift);		bool negateForRightShift);
llvm::Value EmitNeonRShiftImm(llvm::Value Vec, llvm::Value *Amt,		llvm::Value EmitNeonRShiftImm(llvm::Value Vec, llvm::Value *Amt,
llvm::Type Ty, bool usgn, const char name);		llvm::Type Ty, bool usgn, const char name);
llvm::Value vectorWrapScalar16(llvm::Value Op);		llvm::Value vectorWrapScalar16(llvm::Value Op);

llvm::Value EmitSVEPredicateCast(llvm::Value Pred, llvm::VectorType *VTy);		llvm::Value EmitSVEPredicateCast(llvm::Value Pred, llvm::VectorType *VTy);
llvm::Value EmitSVEMaskedLoad(llvm::Type ReturnTy,		llvm::Value EmitSVEMaskedLoad(llvm::Type ReturnTy,
SmallVectorImpl<llvm::Value *> &Ops);		SmallVectorImpl<llvm::Value *> &Ops);
		llvm::Value EmitAArch64SVEBuiltinExpr(unsigned BuiltinID, const CallExpr E);

llvm::Value EmitAArch64BuiltinExpr(unsigned BuiltinID, const CallExpr E,		llvm::Value EmitAArch64BuiltinExpr(unsigned BuiltinID, const CallExpr E,
llvm::Triple::ArchType Arch);		llvm::Triple::ArchType Arch);
llvm::Value EmitBPFBuiltinExpr(unsigned BuiltinID, const CallExpr E);		llvm::Value EmitBPFBuiltinExpr(unsigned BuiltinID, const CallExpr E);

llvm::Value BuildVector(ArrayRef<llvm::Value> Ops);		llvm::Value BuildVector(ArrayRef<llvm::Value> Ops);
llvm::Value EmitX86BuiltinExpr(unsigned BuiltinID, const CallExpr E);		llvm::Value EmitX86BuiltinExpr(unsigned BuiltinID, const CallExpr E);
llvm::Value EmitPPCBuiltinExpr(unsigned BuiltinID, const CallExpr E);		llvm::Value EmitPPCBuiltinExpr(unsigned BuiltinID, const CallExpr E);
▲ Show 20 Lines • Show All 630 Lines • Show Last 20 Lines

clang/utils/TableGen/SveEmitter.cpp

	//===- SveEmitter.cpp - Generate arm_sve.h for use with clang -- C++ --===//			//===- SveEmitter.cpp - Generate arm_sve.h for use with clang -- C++ --===//
				SjoerdMeijerUnsubmitted Done Reply Inline Actions I wanted to add the nit that SveEmiiter.cpp should perhaps be SVEEmitter.cpp, but then I saw at the bottom that MVE is spelled Mve, so perhaps this is fine then. SjoerdMeijer: I wanted to add the nit that SveEmiiter.cpp should perhaps be SVEEmitter.cpp, but then I saw at…
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This tablegen backend is responsible for emitting arm_sve.h, which includes			// This tablegen backend is responsible for emitting arm_sve.h, which includes
	Show All 14 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/ADT/STLExtras.h"			#include "llvm/ADT/STLExtras.h"
	#include "llvm/ADT/DenseMap.h"			#include "llvm/ADT/DenseMap.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/StringExtras.h"			#include "llvm/ADT/StringExtras.h"
	#include "llvm/TableGen/Record.h"			#include "llvm/TableGen/Record.h"
	#include "llvm/TableGen/Error.h"			#include "llvm/TableGen/Error.h"
				#include "clang/Basic/AArch64SVETypeFlags.h"
				thakisUnsubmitted Not Done Reply Inline Actions Including stuff from `clang/Basic` in clang/utils/TableGen is conceptually a layering violation: clang-tblgen is used to generate headers included in clang/Basic. In this case it happens to work, but it's because you're lucky, and it could break for subtle reasons if the TypeFlags header starts including some other header in Basic that happens to include something generated. Please restructure this so that the TableGen code doesn't need an include from Basic. thakis: Including stuff from `clang/Basic` in clang/utils/TableGen is conceptually a layering violation…
				sdesmalenAuthorUnsubmitted Done Reply Inline Actions Thanks for pointing out! The only directory that seems to have common includes between Clang TableGen/CodeGen is the llvm/Support directory, any objection to me moving the header there? sdesmalen: Thanks for pointing out! The only directory that seems to have common includes between Clang…
				thakisUnsubmitted Not Done Reply Inline Actions That seems like a strange place for this header. Maybe you can rework things so that you don't have to share a header between clang's tablegen and clang's Basic? No other tablegen output so far has needed that. (see e.g. the `/// These must be kept in sync with the flags in utils/TableGen/NeonEmitter.h.` line in TargetBuiltins.h). If that isn't possible at all, I suppose you could put the .h file in clang/utils/TableGen and also make clang-tblgen write the .h file and use the written .h file in Basic. thakis: That seems like a strange place for this header. Maybe you can rework things so that you don't…
	#include <string>			#include <string>
	#include <sstream>			#include <sstream>
	#include <set>			#include <set>
	#include <cctype>			#include <cctype>

	using namespace llvm;			using namespace llvm;

	//===----------------------------------------------------------------------===//			enum ClassKind {
	// SVEEmitter			ClassNone,
	//===----------------------------------------------------------------------===//			ClassS, // signed/unsigned, e.g., "_s8", "_u8" suffix
				ClassG, // Overloaded name without type suffix
				};

				using TypeSpec = std::string;
				using SVETypeFlags = clang::SVETypeFlags;

	namespace {			namespace {

				class SVEType {
				TypeSpec TS;
				bool Float, Signed, Immediate, Void, Constant, Pointer;
				bool DefaultType, IsScalable, Predicate, PredicatePattern, PrefetchOp;
				unsigned Bitwidth, ElementBitwidth, NumVectors;

				public:
				SVEType() : SVEType(TypeSpec(), 'v') {}

				SVEType(TypeSpec TS, char CharMod)
				: TS(TS), Float(false), Signed(true), Immediate(false), Void(false),
				Constant(false), Pointer(false), DefaultType(false), IsScalable(true),
				Predicate(false), PredicatePattern(false), PrefetchOp(false),
				Bitwidth(128), ElementBitwidth(~0U), NumVectors(1) {
				SjoerdMeijerUnsubmitted Done Reply Inline Actions why a default of 128? Will this gives problems for SVE implementions with> 128 bits? SjoerdMeijer: why a default of 128? Will this gives problems for SVE implementions with> 128 bits?
				sdesmalenAuthorUnsubmitted Done Reply Inline Actions SVE vectors are n x 128bits, so the 128 is scalable here. sdesmalen: SVE vectors are n x 128bits, so the 128 is scalable here.
				SjoerdMeijerUnsubmitted Not Done Reply Inline Actions ah, okay, fair enough, didn't realise that. SjoerdMeijer: ah, okay, fair enough, didn't realise that.
				if (!TS.empty())
				applyTypespec();
				applyModifier(CharMod);
				}

				/// Return the value in SVETypeFlags for this type.
				unsigned getTypeFlags() const;

				bool isPointer() const { return Pointer; }
				bool isVoidPointer() const { return Pointer && Void; }
				bool isSigned() const { return Signed; }
				bool isImmediate() const { return Immediate; }
				bool isScalar() const { return NumVectors == 0; }
				bool isVector() const { return NumVectors > 0; }
				bool isScalableVector() const { return isVector() && IsScalable; }
				bool isChar() const { return ElementBitwidth == 8; }
				bool isVoid() const { return Void & !Pointer; }
				bool isDefault() const { return DefaultType; }
				bool isFloat() const { return Float; }
				bool isInteger() const { return !Float && !Predicate; }
				bool isScalarPredicate() const { return !Float && ElementBitwidth == 1; }
				bool isPredicateVector() const { return Predicate; }
				bool isPredicatePattern() const { return PredicatePattern; }
				bool isPrefetchOp() const { return PrefetchOp; }
				bool isConstant() const { return Constant; }
				unsigned getElementSizeInBits() const { return ElementBitwidth; }
				unsigned getNumVectors() const { return NumVectors; }

				unsigned getNumElements() const {
				assert(ElementBitwidth != ~0U);
				return Bitwidth / ElementBitwidth;
				}
				unsigned getSizeInBits() const {
				return Bitwidth;
				}

				/// Return the string representation of a type, which is an encoded
				/// string for passing to the BUILTIN() macro in Builtins.def.
				std::string builtin_str() const;

				private:
				/// Creates the type based on the typespec string in TS.
				void applyTypespec();

				/// Applies a prototype modifier to the type.
				void applyModifier(char Mod);
				};


				class SVEEmitter;

				/// The main grunt class. This represents an instantiation of an intrinsic with
				/// a particular typespec and prototype.
				class Intrinsic {
				/// The unmangled name.
				SjoerdMeijerUnsubmitted Done Reply Inline Actions nit: for readability, perhaps don't abbreviate some of these member names? R -> Record BaseTS -> BaseTypeSpec CK -> ClassKind SjoerdMeijer: nit: for readability, perhaps don't abbreviate some of these member names? R -> Record…
				sdesmalenAuthorUnsubmitted Done Reply Inline Actions `Record` and `ClassKind` are also the names of the enum though. Perhaps I can rename CK to `Class`? sdesmalen: `Record` and `ClassKind` are also the names of the enum though. Perhaps I can rename CK to…
				std::string Name;

				/// The name of the corresponding LLVM IR intrinsic.
				std::string LLVMName;

				/// Intrinsic prototype.
				std::string Proto;

				/// The base type spec for this intrinsic.
				TypeSpec BaseTypeSpec;

				/// The base class kind. Most intrinsics use ClassS, which has full type
				/// info for integers (_s32/_u32), or ClassG which is used for overloaded
				/// intrinsics.
				ClassKind Class;

				/// The architectural #ifdef guard.
				std::string Guard;

				/// The types of return value [0] and parameters [1..].
				std::vector<SVEType> Types;

				/// The "base type", which is VarType('d', BaseTypeSpec).
				SVEType BaseType;

				/// The type of the memory element
				enum MemEltType {
				MemEltTypeDefault,
				MemEltTypeInt8,
				MemEltTypeInt16,
				MemEltTypeInt32,
				MemEltTypeInt64,
				MemEltTypeInvalid
				} MemEltTy;

				SVETypeFlags Flags;

				public:
				/// The type of predication.
				enum MergeType {
				MergeNone,
				MergeAny,
				MergeOp1,
				MergeZero,
				MergeAnyExp,
				MergeZeroExp,
				MergeInvalid
				} Merge;

				Intrinsic(StringRef Name, StringRef Proto, int64_t MT, int64_t MET,
				StringRef LLVMName, SVETypeFlags Flags, TypeSpec BT, ClassKind Class,
				SVEEmitter &Emitter, StringRef Guard)
				: Name(Name.str()), LLVMName(LLVMName), Proto(Proto.str()),
				BaseTypeSpec(BT), Class(Class), Guard(Guard.str()), BaseType(BT, 'd'),
				MemEltTy(MemEltType(MET)), Flags(Flags), Merge(MergeType(MT)) {
				// Types[0] is the return value.
				for (unsigned I = 0; I < Proto.size(); ++I)
				Types.emplace_back(BaseTypeSpec, Proto[I]);
				}

				~Intrinsic()=default;

				std::string getName() const { return Name; }
				std::string getLLVMName() const { return LLVMName; }
				std::string getProto() const { return Proto; }
				TypeSpec getBaseTypeSpec() const { return BaseTypeSpec; }
				SVEType getBaseType() const { return BaseType; }

				StringRef getGuard() const { return Guard; }
				ClassKind getClassKind() const { return Class; }
				MergeType getMergeType() const { return Merge; }

				SVEType getReturnType() const { return Types[0]; }
				ArrayRef<SVEType> getTypes() const { return Types; }
				SVEType getParamType(unsigned I) const { return Types[I + 1]; }
				unsigned getNumParams() const { return Proto.size() - 1; }

				SVETypeFlags getFlags() const { return Flags; }
				bool isFlagSet(uint64_t Flag) const { return Flags.isFlagSet(Flag);}

				int64_t getMemEltTypeEnum() const {
				int64_t METEnum = (MemEltTy << SVETypeFlags::MemEltTypeOffset);
				assert((METEnum &~ SVETypeFlags::MemEltTypeMask) == 0 && "Bad MemEltTy");
				return METEnum;
				}

				/// Return the type string for a BUILTIN() macro in Builtins.def.
				std::string getBuiltinTypeStr();

				/// Return the name, mangled with type information. The name is mangled for
				/// ClassS, so will add type suffixes such as _u32/_s32.
				std::string getMangledName() const { return mangleName(ClassS); }

				/// Returns true if the intrinsic is overloaded, in that it should also generate
				/// a short form without the type-specifiers, e.g. 'svld1(..)' instead of
				/// 'svld1_u32(..)'.
				static bool isOverloadedIntrinsic(StringRef Name) {
				auto BrOpen = Name.find("[");
				auto BrClose = Name.find(']');
				return BrOpen != std::string::npos && BrClose != std::string::npos;
				}

				/// Emits the intrinsic declaration to the ostream.
				void emitIntrinsic(raw_ostream &OS) const;

				private:
				std::string getMergeSuffix() const;
				std::string mangleName(ClassKind LocalCK) const;
				std::string replaceTemplatedArgs(std::string Name, TypeSpec TS,
				std::string Proto) const;
				};

	class SVEEmitter {			class SVEEmitter {
				private:
				RecordKeeper &Records;

	public:			public:
	// run - Emit arm_sve.h			SVEEmitter(RecordKeeper &R) : Records(R) {}
	void run(raw_ostream &o);
				/// Emit arm_sve.h.
				void createHeader(raw_ostream &o);

				/// Emit all the __builtin prototypes and code needed by Sema.
				void createBuiltins(raw_ostream &o);

				/// Emit all the information needed to map builtin -> LLVM IR intrinsic.
				void createCodeGenMap(raw_ostream &o);

				/// Create intrinsic and add it to \p Out
				void createIntrinsic(Record *R, SmallVectorImpl<std::unique_ptr<Intrinsic>> &Out);
	};			};

	} // end anonymous namespace			} // end anonymous namespace


	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
				// Type implementation
				//===----------------------------------------------------------------------===//

				unsigned SVEType::getTypeFlags() const {
				if (isFloat()) {
				SjoerdMeijerUnsubmitted Done Reply Inline Actions don't need this SjoerdMeijer: don't need this
				switch (ElementBitwidth) {
				case 16: return SVETypeFlags::Float16;
				case 32: return SVETypeFlags::Float32;
				case 64: return SVETypeFlags::Float64;
				SjoerdMeijerUnsubmitted Done Reply Inline Actions just a return here SjoerdMeijer: just a return here
				sdesmalenAuthorUnsubmitted Done Reply Inline Actions Good catch! sdesmalen: Good catch!
				default: llvm_unreachable("Unhandled float element bitwidth!");
				}
				}

				if (isPredicateVector()) {
				switch (ElementBitwidth) {
				case 8: return SVETypeFlags::Bool8;
				case 16: return SVETypeFlags::Bool16;
				case 32: return SVETypeFlags::Bool32;
				case 64: return SVETypeFlags::Bool64;
				default: llvm_unreachable("Unhandled predicate element bitwidth!");
				}
				}

				switch (ElementBitwidth) {
				case 8: return SVETypeFlags::Int8;
				case 16: return SVETypeFlags::Int16;
				case 32: return SVETypeFlags::Int32;
				case 64: return SVETypeFlags::Int64;
				default: llvm_unreachable("Unhandled integer element bitwidth!");
				}
				}

				std::string SVEType::builtin_str() const {
				std::string S;
				if (isVoid())
				return "v";

				if (isVoidPointer())
				S += "v";
				else if (!Float)
				switch (ElementBitwidth) {
				case 1: S += "b"; break;
				case 8: S += "c"; break;
				case 16: S += "s"; break;
				case 32: S += "i"; break;
				case 64: S += "Wi"; break;
				case 128: S += "LLLi"; break;
				default: llvm_unreachable("Unhandled case!");
				}
				else
				switch (ElementBitwidth) {
				case 16: S += "h"; break;
				case 32: S += "f"; break;
				case 64: S += "d"; break;
				default: llvm_unreachable("Unhandled case!");
				}

				if (!isFloat()) {
				if ((isChar() \|\| isPointer()) && !isVoidPointer()) {
				// Make chars and typed pointers explicitly signed.
				if (Signed)
				S = "S" + S;
				else if (!Signed)
				S = "U" + S;
				} else if (!isVoidPointer() && !Signed) {
				S = "U" + S;
				}
				}

				// Constant indices are "int", but have the "constant expression" modifier.
				if (isImmediate()) {
				assert(!isFloat() && "fp immediates are not supported");
				S = "I" + S;
				}

				if (isScalar()) {
				if (Constant) S += "C";
				if (Pointer) S += "*";
				return S;
				}

				assert(isScalableVector() && "Unsupported type");
				return "q" + utostr(getNumElements() * NumVectors) + S;
				}

				void SVEType::applyTypespec() {
				for (char I : TS) {
				switch (I) {
				case 'P':
				Predicate = true;
				ElementBitwidth = 1;
				break;
				case 'U':
				Signed = false;
				break;
				case 'c':
				ElementBitwidth = 8;
				break;
				case 's':
				ElementBitwidth = 16;
				break;
				case 'i':
				ElementBitwidth = 32;
				break;
				case 'l':
				ElementBitwidth = 64;
				break;
				case 'h':
				Float = true;
				ElementBitwidth = 16;
				break;
				case 'f':
				Float = true;
				ElementBitwidth = 32;
				break;
				case 'd':
				Float = true;
				ElementBitwidth = 64;
				break;
				default:
				llvm_unreachable("Unhandled type code!");
				}
				}
				assert(ElementBitwidth != ~0U && "Bad element bitwidth!");
				}

				void SVEType::applyModifier(char Mod) {
				switch (Mod) {
				case 'v':
				Void = true;
				break;
				case 'd':
				DefaultType = true;
				break;
				case 'c':
				Constant = true;
				LLVM_FALLTHROUGH;
				case 'p':
				Pointer = true;
				Bitwidth = ElementBitwidth;
				NumVectors = 0;
				break;
				case 'P':
				Signed = true;
				Float = false;
				Predicate = true;
				Bitwidth = 16;
				ElementBitwidth = 1;
				break;
				default:
				llvm_unreachable("Unhandled character!");
				}
				}


				//===----------------------------------------------------------------------===//
				// Intrinsic implementation
				//===----------------------------------------------------------------------===//

				std::string Intrinsic::getBuiltinTypeStr() {
				std::string S;

				SVEType RetT = getReturnType();
				// Since the return value must be one type, return a vector type of the
				// appropriate width which we will bitcast. An exception is made for
				// returning structs of 2, 3, or 4 vectors which are returned in a sret-like
				// fashion, storing them to a pointer arg.
				if (RetT.getNumVectors() > 1) {
				S += "vv"; // void result with void first argument
				} else
				S += RetT.builtin_str();

				for (unsigned I = 0; I < getNumParams(); ++I)
				S += getParamType(I).builtin_str();

				return S;
				}

				std::string Intrinsic::replaceTemplatedArgs(std::string Name, TypeSpec TS,
				std::string Proto) const {
				std::string Ret = Name;
				while (Ret.find('{') != std::string::npos) {
				size_t Pos = Ret.find('{');
				size_t End = Ret.find('}');
				unsigned NumChars = End - Pos + 1;
				assert(NumChars == 3 && "Unexpected template argument");

				SVEType T;
				char C = Ret[Pos+1];
				switch(C) {
				default:
				llvm_unreachable("Unknown predication specifier");
				case 'd':
				T = SVEType(TS, 'd');
				break;
				case '0':
				case '1':
				case '2':
				case '3':
				T = SVEType(TS, Proto[C - '0']);
				break;
				}

				// Replace templated arg with the right suffix (e.g. u32)
				std::string TypeCode;
				if (T.isInteger())
				TypeCode = T.isSigned() ? 's' : 'u';
				else if (T.isPredicateVector())
				TypeCode = 'b';
				else
				TypeCode = 'f';
				Ret.replace(Pos, NumChars, TypeCode + utostr(T.getElementSizeInBits()));
				}

				return Ret;
				}

				// ACLE function names have a merge style postfix.
				std::string Intrinsic::getMergeSuffix() const {
				switch (getMergeType()) {
				default:
				llvm_unreachable("Unknown predication specifier");
				case MergeNone: return "";
				case MergeAny:
				case MergeAnyExp: return "_x";
				case MergeOp1: return "_m";
				case MergeZero:
				case MergeZeroExp: return "_z";
				}
				}

				std::string Intrinsic::mangleName(ClassKind LocalCK) const {
				std::string S = getName();

				if (LocalCK == ClassG) {
				// Remove the square brackets and everything in between.
				while (S.find("[") != std::string::npos) {
				auto Start = S.find("[");
				auto End = S.find(']');
				S.erase(Start, (End-Start)+1);
				}
				} else {
				// Remove the square brackets.
				while (S.find("[") != std::string::npos) {
				auto BrPos = S.find('[');
				if (BrPos != std::string::npos)
				S.erase(BrPos, 1);
				BrPos = S.find(']');
				if (BrPos != std::string::npos)
				S.erase(BrPos, 1);
				}
				}

				// Replace all {d} like expressions with e.g. 'u32'
				return replaceTemplatedArgs(S, getBaseTypeSpec(), getProto()) +
				getMergeSuffix();
				}

				void Intrinsic::emitIntrinsic(raw_ostream &OS) const {
				// Use the preprocessor to enable the non-overloaded builtins.
				if (getClassKind() != ClassG \|\| getProto().size() <= 1) {
				OS << "#define " << mangleName(getClassKind())
				<< "(...) __builtin_sve_" << mangleName(ClassS)
				<< "(__VA_ARGS__)\n";
				} else {
				llvm_unreachable("Not yet implemented. Overloaded intrinsics will follow "
				"in a future patch");
				}
				}

				//===----------------------------------------------------------------------===//
	// SVEEmitter implementation			// SVEEmitter implementation
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
				void SVEEmitter::createIntrinsic(
				Record *R, SmallVectorImpl<std::unique_ptr<Intrinsic>> &Out) {
				StringRef Name = R->getValueAsString("Name");
				StringRef Proto = R->getValueAsString("Prototype");
				StringRef Types = R->getValueAsString("Types");
				StringRef Guard = R->getValueAsString("ArchGuard");
				StringRef LLVMName = R->getValueAsString("LLVMIntrinsic");
				int64_t Merge = R->getValueAsInt("Merge");
				std::vector<Record*> FlagsList = R->getValueAsListOfDefs("Flags");
				int64_t MemEltType = R->getValueAsInt("MemEltType");

				int64_t Flags = 0;
				for (auto FlagRec : FlagsList)
				Flags \|= FlagRec->getValueAsInt("Value");

				// Extract type specs from string
				SmallVector<TypeSpec, 8> TypeSpecs;
				TypeSpec Acc;
				for (char I : Types) {
				Acc.push_back(I);
				if (islower(I)) {
				TypeSpecs.push_back(TypeSpec(Acc));
				Acc.clear();
				}
				}

				// Remove duplicate type specs.
				std::sort(TypeSpecs.begin(), TypeSpecs.end());
				TypeSpecs.erase(std::unique(TypeSpecs.begin(), TypeSpecs.end()),
				TypeSpecs.end());

				// Create an Intrinsic for each type spec.
				for (auto TS : TypeSpecs) {
				Out.push_back(std::make_unique<Intrinsic>(Name, Proto, Merge, MemEltType,
				LLVMName, Flags, TS, ClassS,
				*this, Guard));
				}
				}

	void SVEEmitter::run(raw_ostream &OS) {			void SVEEmitter::createHeader(raw_ostream &OS) {
	OS << "/*===---- arm_sve.h - ARM SVE intrinsics "			OS << "/*===---- arm_sve.h - ARM SVE intrinsics "
	"-----------------------------------===\n"			"-----------------------------------===\n"
	" *\n"			" *\n"
	" *\n"			" *\n"
	" * Part of the LLVM Project, under the Apache License v2.0 with LLVM "			" * Part of the LLVM Project, under the Apache License v2.0 with LLVM "
	"Exceptions.\n"			"Exceptions.\n"
	" * See https://llvm.org/LICENSE.txt for license information.\n"			" * See https://llvm.org/LICENSE.txt for license information.\n"
	" * SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception\n"			" * SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception\n"
	" *\n"			" *\n"
	" *===-----------------------------------------------------------------"			" *===-----------------------------------------------------------------"
	"------===\n"			"------===\n"
	" */\n\n";			" */\n\n";

	OS << "#ifndef __ARM_SVE_H\n";			OS << "#ifndef __ARM_SVE_H\n";
	OS << "#define __ARM_SVE_H\n\n";			OS << "#define __ARM_SVE_H\n\n";

	OS << "#if !defined(__ARM_FEATURE_SVE)\n";			OS << "#if !defined(__ARM_FEATURE_SVE)\n";
	OS << "#error \"SVE support not enabled\"\n";			OS << "#error \"SVE support not enabled\"\n";
	OS << "#else\n\n";			OS << "#else\n\n";

	OS << "#include <stdint.h>\n\n";			OS << "#include <stdint.h>\n\n";
	OS << "#ifndef __cplusplus\n";			OS << "#ifdef __cplusplus\n";
				OS << "extern \"C\" {\n";
				OS << "#else\n";
	OS << "#include <stdbool.h>\n";			OS << "#include <stdbool.h>\n";
	OS << "#endif\n\n";			OS << "#endif\n\n";

	OS << "typedef __fp16 float16_t;\n";			OS << "typedef __fp16 float16_t;\n";
	OS << "typedef float float32_t;\n";			OS << "typedef float float32_t;\n";
	OS << "typedef double float64_t;\n";			OS << "typedef double float64_t;\n";
	OS << "typedef bool bool_t;\n\n";			OS << "typedef bool bool_t;\n\n";

	OS << "typedef __SVInt8_t svint8_t;\n";			OS << "typedef __SVInt8_t svint8_t;\n";
	OS << "typedef __SVInt16_t svint16_t;\n";			OS << "typedef __SVInt16_t svint16_t;\n";
	OS << "typedef __SVInt32_t svint32_t;\n";			OS << "typedef __SVInt32_t svint32_t;\n";
	OS << "typedef __SVInt64_t svint64_t;\n";			OS << "typedef __SVInt64_t svint64_t;\n";
	OS << "typedef __SVUint8_t svuint8_t;\n";			OS << "typedef __SVUint8_t svuint8_t;\n";
	OS << "typedef __SVUint16_t svuint16_t;\n";			OS << "typedef __SVUint16_t svuint16_t;\n";
	OS << "typedef __SVUint32_t svuint32_t;\n";			OS << "typedef __SVUint32_t svuint32_t;\n";
	OS << "typedef __SVUint64_t svuint64_t;\n";			OS << "typedef __SVUint64_t svuint64_t;\n";
	OS << "typedef __SVFloat16_t svfloat16_t;\n";			OS << "typedef __SVFloat16_t svfloat16_t;\n";
	OS << "typedef __SVFloat32_t svfloat32_t;\n";			OS << "typedef __SVFloat32_t svfloat32_t;\n";
	OS << "typedef __SVFloat64_t svfloat64_t;\n";			OS << "typedef __SVFloat64_t svfloat64_t;\n";
	OS << "typedef __SVBool_t svbool_t;\n\n";			OS << "typedef __SVBool_t svbool_t;\n\n";

	OS << "#define svld1_u8(...) __builtin_sve_svld1_u8(__VA_ARGS__)\n";			SmallVector<std::unique_ptr<Intrinsic>, 128> Defs;
	OS << "#define svld1_u16(...) __builtin_sve_svld1_u16(__VA_ARGS__)\n";			std::vector<Record *> RV = Records.getAllDerivedDefinitions("Inst");
	OS << "#define svld1_u32(...) __builtin_sve_svld1_u32(__VA_ARGS__)\n";			for (auto *R : RV)
	OS << "#define svld1_u64(...) __builtin_sve_svld1_u64(__VA_ARGS__)\n";			createIntrinsic(R, Defs);
	OS << "#define svld1_s8(...) __builtin_sve_svld1_s8(__VA_ARGS__)\n";
	OS << "#define svld1_s16(...) __builtin_sve_svld1_s16(__VA_ARGS__)\n";			// Sort intrinsics in header file by following order/priority:
	OS << "#define svld1_s32(...) __builtin_sve_svld1_s32(__VA_ARGS__)\n";			// - Architectural guard (i.e. does it require SVE2 or SVE2_AES)
	OS << "#define svld1_s64(...) __builtin_sve_svld1_s64(__VA_ARGS__)\n";			// - Class (is intrinsic overloaded or not)
	OS << "#define svld1_f16(...) __builtin_sve_svld1_f16(__VA_ARGS__)\n";			// - Intrinsic name
	OS << "#define svld1_f32(...) __builtin_sve_svld1_f32(__VA_ARGS__)\n";			std::stable_sort(
	OS << "#define svld1_f64(...) __builtin_sve_svld1_f64(__VA_ARGS__)\n";			Defs.begin(), Defs.end(), [](const std::unique_ptr<Intrinsic> &A,
				const std::unique_ptr<Intrinsic> &B) {
				return A->getGuard() < B->getGuard() \|\|
				(unsigned)A->getClassKind() < (unsigned)B->getClassKind() \|\|
				A->getName() < B->getName();
				});

				StringRef InGuard = "";
				for (auto &I : Defs) {
				// Emit #endif/#if pair if needed.
				if (I->getGuard() != InGuard) {
				if (!InGuard.empty())
				OS << "#endif //" << InGuard << "\n";
				InGuard = I->getGuard();
				if (!InGuard.empty())
				OS << "\n#if " << InGuard << "\n";
				}

				// Actually emit the intrinsic declaration.
				I->emitIntrinsic(OS);
				}

	OS << "#endif /__ARM_FEATURE_SVE /\n";			if (!InGuard.empty())
				OS << "#endif //" << InGuard << "\n";

				OS << "#ifdef __cplusplus\n";
				OS << "} // extern \"C\"\n";
				OS << "#endif\n\n";
				OS << "#endif /__ARM_FEATURE_SVE /\n\n";
	OS << "#endif /* __ARM_SVE_H */\n";			OS << "#endif /* __ARM_SVE_H */\n";
	}			}

				void SVEEmitter::createBuiltins(raw_ostream &OS) {
				std::vector<Record *> RV = Records.getAllDerivedDefinitions("Inst");
				SmallVector<std::unique_ptr<Intrinsic>, 128> Defs;
				for (auto *R : RV)
				createIntrinsic(R, Defs);

				// The mappings must be sorted based on BuiltinID.
				llvm::sort(Defs, [](const std::unique_ptr<Intrinsic> &A,
				const std::unique_ptr<Intrinsic> &B) {
				return A->getMangledName() < B->getMangledName();
				});

				OS << "#ifdef GET_SVE_BUILTINS\n";
				for (auto &Def : Defs) {
				// Only create BUILTINs for non-overloaded intrinsics, as overloaded
				// declarations only live in the header file.
				if (Def->getClassKind() != ClassG)
				OS << "BUILTIN(__builtin_sve_" << Def->getMangledName() << ", \""
				<< Def->getBuiltinTypeStr() << "\", \"n\")\n";
				}
				OS << "#endif\n\n";
				}

				void SVEEmitter::createCodeGenMap(raw_ostream &OS) {
				std::vector<Record *> RV = Records.getAllDerivedDefinitions("Inst");
				SmallVector<std::unique_ptr<Intrinsic>, 128> Defs;
				for (auto *R : RV)
				createIntrinsic(R, Defs);

				// The mappings must be sorted based on BuiltinID.
				llvm::sort(Defs, [](const std::unique_ptr<Intrinsic> &A,
				const std::unique_ptr<Intrinsic> &B) {
				return A->getMangledName() < B->getMangledName();
				});

				OS << "#ifdef GET_SVE_LLVM_INTRINSIC_MAP\n";
				for (auto &Def : Defs) {
				// Builtins only exist for non-overloaded intrinsics, overloaded
				// declarations only live in the header file.
				if (Def->getClassKind() == ClassG)
				continue;

				assert(!Def->isFlagSet(SVETypeFlags::EltTypeMask) &&
				!Def->isFlagSet(SVETypeFlags::MemEltTypeMask) &&
				"Unexpected mask value");
				uint64_t Flags = Def->getFlags().getBits() \|
				Def->getBaseType().getTypeFlags() \|
				Def->getMemEltTypeEnum();
				auto FlagString = std::to_string(Flags);

				std::string LLVMName = Def->getLLVMName();
				std::string Builtin = Def->getMangledName();
				if (!LLVMName.empty())
				OS << "SVEMAP1(" << Builtin << ", " << LLVMName << ", " << FlagString
				<< "),\n";
				else
				OS << "SVEMAP2(" << Builtin << ", " << FlagString << "),\n";
				}
				OS << "#endif\n\n";
				}

	namespace clang {			namespace clang {
	void EmitSveHeader(RecordKeeper &Records, raw_ostream &OS) {			void EmitSveHeader(RecordKeeper &Records, raw_ostream &OS) {
	SVEEmitter().run(OS);			SVEEmitter(Records).createHeader(OS);
				}

				void EmitSveBuiltins(RecordKeeper &Records, raw_ostream &OS) {
				SVEEmitter(Records).createBuiltins(OS);
				}

				void EmitSveCodeGenMap(RecordKeeper &Records, raw_ostream &OS) {
				SVEEmitter(Records).createCodeGenMap(OS);
	}			}

	} // End namespace clang			} // End namespace clang

clang/utils/TableGen/TableGen.cpp

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	enum ActionType {
GenArmNeonSema,		GenArmNeonSema,
GenArmNeonTest,		GenArmNeonTest,
GenArmMveHeader,		GenArmMveHeader,
GenArmMveBuiltinDef,		GenArmMveBuiltinDef,
GenArmMveBuiltinSema,		GenArmMveBuiltinSema,
GenArmMveBuiltinCG,		GenArmMveBuiltinCG,
GenArmMveBuiltinAliases,		GenArmMveBuiltinAliases,
GenArmSveHeader,		GenArmSveHeader,
		GenArmSveBuiltins,
		GenArmSveCodeGenMap,
GenArmCdeHeader,		GenArmCdeHeader,
GenArmCdeBuiltinDef,		GenArmCdeBuiltinDef,
GenArmCdeBuiltinSema,		GenArmCdeBuiltinSema,
GenArmCdeBuiltinCG,		GenArmCdeBuiltinCG,
GenArmCdeBuiltinAliases,		GenArmCdeBuiltinAliases,
GenAttrDocs,		GenAttrDocs,
GenDiagDocs,		GenDiagDocs,
GenOptDocs,		GenOptDocs,
▲ Show 20 Lines • Show All 101 Lines • ▼ Show 20 Lines	cl::values(
clEnumValN(GenArmNeon, "gen-arm-neon", "Generate arm_neon.h for clang"),		clEnumValN(GenArmNeon, "gen-arm-neon", "Generate arm_neon.h for clang"),
clEnumValN(GenArmFP16, "gen-arm-fp16", "Generate arm_fp16.h for clang"),		clEnumValN(GenArmFP16, "gen-arm-fp16", "Generate arm_fp16.h for clang"),
clEnumValN(GenArmNeonSema, "gen-arm-neon-sema",		clEnumValN(GenArmNeonSema, "gen-arm-neon-sema",
"Generate ARM NEON sema support for clang"),		"Generate ARM NEON sema support for clang"),
clEnumValN(GenArmNeonTest, "gen-arm-neon-test",		clEnumValN(GenArmNeonTest, "gen-arm-neon-test",
"Generate ARM NEON tests for clang"),		"Generate ARM NEON tests for clang"),
clEnumValN(GenArmSveHeader, "gen-arm-sve-header",		clEnumValN(GenArmSveHeader, "gen-arm-sve-header",
"Generate arm_sve.h for clang"),		"Generate arm_sve.h for clang"),
		clEnumValN(GenArmSveBuiltins, "gen-arm-sve-builtins",
		"Generate arm_sve_builtins.inc for clang"),
		clEnumValN(GenArmSveCodeGenMap, "gen-arm-sve-codegenmap",
		"Generate arm_sve_codegenmap.inc for clang"),
		thakisUnsubmitted Not Done Reply Inline Actions Any reason these aren't called `-gen-arm-sve-builtin-def` and `-gen-arm-sve-builtin-codegen` for consistency with CDE and MVE? thakis: Any reason these aren't called `-gen-arm-sve-builtin-def` and `-gen-arm-sve-builtin-codegen`…
		sdesmalenAuthorUnsubmitted Done Reply Inline Actions Not really, I can change that. sdesmalen: Not really, I can change that.
		thakisUnsubmitted Not Done Reply Inline Actions Looks like you renamed the files to be consistent (thanks!), but not the flag names. Can you make those consistent too? thakis: Looks like you renamed the files to be consistent (thanks!), but not the flag names. Can you…
clEnumValN(GenArmMveHeader, "gen-arm-mve-header",		clEnumValN(GenArmMveHeader, "gen-arm-mve-header",
"Generate arm_mve.h for clang"),		"Generate arm_mve.h for clang"),
clEnumValN(GenArmMveBuiltinDef, "gen-arm-mve-builtin-def",		clEnumValN(GenArmMveBuiltinDef, "gen-arm-mve-builtin-def",
"Generate ARM MVE builtin definitions for clang"),		"Generate ARM MVE builtin definitions for clang"),
clEnumValN(GenArmMveBuiltinSema, "gen-arm-mve-builtin-sema",		clEnumValN(GenArmMveBuiltinSema, "gen-arm-mve-builtin-sema",
"Generate ARM MVE builtin sema checks for clang"),		"Generate ARM MVE builtin sema checks for clang"),
clEnumValN(GenArmMveBuiltinCG, "gen-arm-mve-builtin-codegen",		clEnumValN(GenArmMveBuiltinCG, "gen-arm-mve-builtin-codegen",
"Generate ARM MVE builtin code-generator for clang"),		"Generate ARM MVE builtin code-generator for clang"),
▲ Show 20 Lines • Show All 168 Lines • ▼ Show 20 Lines	case GenArmMveBuiltinCG:
EmitMveBuiltinCG(Records, OS);		EmitMveBuiltinCG(Records, OS);
break;		break;
case GenArmMveBuiltinAliases:		case GenArmMveBuiltinAliases:
EmitMveBuiltinAliases(Records, OS);		EmitMveBuiltinAliases(Records, OS);
break;		break;
case GenArmSveHeader:		case GenArmSveHeader:
EmitSveHeader(Records, OS);		EmitSveHeader(Records, OS);
break;		break;
		case GenArmSveBuiltins:
		EmitSveBuiltins(Records, OS);
		break;
		case GenArmSveCodeGenMap:
		EmitSveCodeGenMap(Records, OS);
		break;
case GenArmCdeHeader:		case GenArmCdeHeader:
EmitCdeHeader(Records, OS);		EmitCdeHeader(Records, OS);
break;		break;
case GenArmCdeBuiltinDef:		case GenArmCdeBuiltinDef:
EmitCdeBuiltinDef(Records, OS);		EmitCdeBuiltinDef(Records, OS);
break;		break;
case GenArmCdeBuiltinSema:		case GenArmCdeBuiltinSema:
EmitCdeBuiltinSema(Records, OS);		EmitCdeBuiltinSema(Records, OS);
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

clang/utils/TableGen/TableGenBackends.h

	Show First 20 Lines • Show All 86 Lines • ▼ Show 20 Lines
	void EmitFP16(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitFP16(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	void EmitNeonSema(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitNeonSema(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	void EmitNeonTest(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitNeonTest(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	void EmitNeon2(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitNeon2(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	void EmitNeonSema2(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitNeonSema2(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	void EmitNeonTest2(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitNeonTest2(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);

	void EmitSveHeader(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitSveHeader(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
				void EmitSveBuiltins(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
				void EmitSveCodeGenMap(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);

	void EmitMveHeader(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitMveHeader(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	void EmitMveBuiltinDef(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitMveBuiltinDef(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	void EmitMveBuiltinSema(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitMveBuiltinSema(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	void EmitMveBuiltinCG(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitMveBuiltinCG(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	void EmitMveBuiltinAliases(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitMveBuiltinAliases(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);

	void EmitCdeHeader(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);			void EmitCdeHeader(llvm::RecordKeeper &Records, llvm::raw_ostream &OS);
	Show All 21 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[SVE] Auto-generate builtins and header for svld1.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 250509

clang/include/clang/Basic/AArch64SVETypeFlags.h

clang/include/clang/Basic/BuiltinsAArch64.def

clang/include/clang/Basic/BuiltinsSVE.def

clang/include/clang/Basic/CMakeLists.txt

clang/include/clang/Basic/TargetBuiltins.h

clang/include/clang/Basic/arm_sve.td

clang/lib/Basic/Targets/AArch64.cpp

clang/lib/CodeGen/CGBuiltin.cpp

clang/lib/CodeGen/CodeGenFunction.h

clang/utils/TableGen/SveEmitter.cpp

clang/utils/TableGen/TableGen.cpp

clang/utils/TableGen/TableGenBackends.h

[SVE] Auto-generate builtins and header for svld1.
ClosedPublic