This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/
-
clang/
-
AST/
-
ASTContext.h
-
Type.h
-
Basic/
-
Attr.td
-
DiagnosticSemaKinds.td
-
Sema/
-
Sema.h
-
lib/
-
AST/
-
ASTContext.cpp
-
ItaniumMangle.cpp
-
JSONNodeDumper.cpp
-
TextNodeDumper.cpp
-
Type.cpp
-
TypePrinter.cpp
-
CodeGen/
-
CGCall.cpp
-
CGExprScalar.cpp
-
TargetInfo.cpp
-
Sema/
-
SemaDecl.cpp
-
SemaExpr.cpp
-
SemaType.cpp
-
test/
-
CodeGen/
-
attr-arm-sve-vector-bits-bitcast.c
-
attr-arm-sve-vector-bits-call.c
-
attr-arm-sve-vector-bits-cast.c
-
attr-arm-sve-vector-bits-codegen.c
-
attr-arm-sve-vector-bits-globals.c
-
attr-arm-sve-vector-bits-types.c
-
CodeGenCXX/
-
aarch64-sve-fixedtypeinfo.cpp
-
Sema/
-
attr-arm-sve-vector-bits.c
-
llvm/lib/
-
lib/
-
Analysis/
-
InlineCost.cpp
-
Transforms/
-
InstCombine/
-
InstCombineCasts.cpp
-
Scalar/
-
SROA.cpp

Differential D85128

[Prototype][SVE] Support arm_sve_vector_bits attribute
AbandonedPublic

Authored by c-rhodes on Aug 3 2020, 5:41 AM.

Download Raw Diff

Details

Reviewers

sdesmalen
paulwalker-arm
rsandifo-arm
efriedma
ctetreau
cameron.mcinally
rengolin
aaron.ballman

Summary

This patch is a prototype demonstrating an alternative approach to D83553 which
turned out to be an unviable solution.

In that approach vector-length-specific types (VLSTs) defined by the attribute
were by default represented as scalable vectors except in constructs where
scalable vectors aren't supported in IR, such as globals and structs, where they
were represented as fixed-length arrays. When loading from a VLST to a VLAT, or
when storing a VLAT to a VLST, the address was bitcasted, e.g.

bitcast [N x i8]* %addr.ptr to <vscale x 16 x i8>*

The issue with that approach was VLSTs were represented as AttributedType in
the AST and were not part of the canonical type. This was problematic in places
such as CodeGenTypes that look at the canonical type as special handling was
required for types such as ConstantArrayType that needed to be lowered to
fixed-length arrays. See the patch for more information on issues with that
approach.

In this implementation VLSTs are represented as VectorType in the AST and
fixed-length vectors in the IR everywhere except in function args/return.
Predicates are represented with i8 as they were in D83553 to avoid layout
issues in structs. For example, in the following C code:

#if __ARM_FEATURE_SVE_BITS==512
typedef svbool_t fixed_bool_t __attribute__((arm_sve_vector_bits(512)));
#endif

fixed_bool_t becomes <8 x i8> in the IR.

In function args/return VLSTs are coerced from fixed to scalable vectors. This
is implemented through the AArch64 ABI in TargetInfo. As BuiltinType::SveBool
and BuiltinType::SveUint8 are both represented as a VectorType of element
type BuiltinType::UChar, to support this in the ABI two new vectors kinds were
required to distinguish between predicates and data vectors.

Casting between VLAT/VLST is handled by the CK_BitCast operation and this has
been extended in CodeGen to support the new vector kinds, where the cast is
implemented through memory rather than a bitcast which is unsupported.
Implementing this as a normal bitcast would require relaxing checks in LLVM to
allow bitcasting between scalable and fixed types. Another option was adding
target-specific intrinsics, although codegen support would need to be added for
these intrinsics. Given this, casting through memory seemed like the best
approach as it's supported today and existing optimisations may remove
unnecessary loads/stores, although there is room for improvement here.

The semantics implemented in D83551 are changed as the AttributedType has been
replaced by VectorType in the AST. When VLSTs were represented as sizeless
types only minimal changes in Sema were necessary to permit them in places such
as structs, but no changes were required for implicit casting as the canonical
types were the same.

The AArch64 ACLE states VLSTs defined by the attribute map to the same AAPCS64
types as the sizeless variants. In the previous approach mangling wasn't
necessary as the canonical types were the same. The mangling scheme is defined
in the appendices to the Procedure Call Standard for the Arm Architecture, see

https://github.com/ARM-software/abi-aa/blob/master/aapcs64/aapcs64.rst#appendix-c-mangling

For more information on the attribute see:

https://developer.arm.com/documentation/100987/latest

NOTE: This patch is intended as a prototype to demonstrate the approach. This is quite a large patch containing a number of changes, if the approach is considered valid the plan is to break it up into separate patches.

Diff Detail

Unit TestsFailed

	Time	Test
	110 ms	windows > LLVM.Transforms/InstCombine::xor.ll

Event Timeline

c-rhodes created this revision.Aug 3 2020, 5:41 AM

Herald added a reviewer: rengolin. · View Herald TranscriptAug 3 2020, 5:41 AM

Herald added a reviewer: aaron.ballman. · View Herald Transcript

Herald added projects: Restricted Project, Restricted Project. · View Herald Transcript

Herald added subscribers: aaron.ballman, danielkiss, psnobl and 5 others. · View Herald Transcript

c-rhodes requested review of this revision.Aug 3 2020, 5:41 AM

c-rhodes mentioned this in D83553: [PATCH 3/4][Sema][AArch64] Add codegen for arm_sve_vector_bits attribute.Aug 3 2020, 5:47 AM

Harbormaster completed remote builds in B66762: Diff 282560.Aug 3 2020, 6:40 AM

Stupid questions.

Is it for convenience? You get arrays, global variables, structs, ... . Vectorization becomes easier ...
Are there any potential performance benefits over scalable vectors?
Is it compatible with GCC?

Not going to write detailed review comments, but this looks like the right approach in general.

One high-level thing to consider: we could still decide that in IR generation, we want to represent VLSTs registers using scalable vector types, like the original patch did. This would allow avoiding the awkward "bitcast" implementation. That interacts with a relatively narrow slice of clang CodeGen, though; we could easily change it later without impacting the rest of the changes.

In D85128#2191108, @tschuett wrote:

Stupid questions.

Is it for convenience? You get arrays, global variables, structs, ... . Vectorization becomes easier ...

Yes, this allows the definition of types that can be used in constructs sizeless types cannot. In earlier revisions of the ACLE there was the concept of sizeless structs defined by a __sizeless_struct keyword that could have members of sizeless type in addition to members of sized type, although there was push back on the idea of sizeless aggregates in general and that idea was dropped. If you're interested in the background there's more information here [1][2].

[1] https://gcc.gnu.org/legacy-ml/gcc/2019-11/msg00088.html
[2] https://gcc.gnu.org/legacy-ml/gcc-patches/2018-10/msg00868.html

Are there any potential performance benefits over scalable vectors?

If VLSTs are represented as fixed-length vectors in LLVM as they are in this prototype then hopefully we can take advantage of existing optimisations, although I think there is work to be done there especially around the interaction with scalable vectors and supporting those in existing passes. This patch required a few changes to existing passes to bail out for scalable vectors so we're already hitting parts of the codebase we've yet to hit that would be good candidates for optimisation work. This also ties into the fixed-length code generation work @paulwalker-arm has been doing which is still early days. I'm not sure if that answers your question, but ultimately the compiler should have more information about these types given the vector size is explicit so it should be able to do a better job at optimisation.

Is it compatible with GCC?

Support for this attribute landed in GCC 10 and it's more complete than what this patch implements. We've yet to implement the behaviour guarded by the __ARM_FEATURE_SVE_VECTOR_OPERATORS and __ARM_FEATURE_SVE_PREDICATE_OPERATORS feature macros, so the GNU __attribute__((vector_size)) extension is not available and operators such as binary '+' are not supported for VLSTs. Support for this is intended to be addressed by later patches.

In D85128#2191401, @efriedma wrote:

Not going to write detailed review comments, but this looks like the right approach in general.

Thanks for taking a look! I'll split this up into separate patches soon.

One high-level thing to consider: we could still decide that in IR generation, we want to represent VLSTs registers using scalable vector types, like the original patch did. This would allow avoiding the awkward "bitcast" implementation. That interacts with a relatively narrow slice of clang CodeGen, though; we could easily change it later without impacting the rest of the changes.

Yeah now that the VLST is part of the canonical type with the new vector kinds we have more information if we were to go the CodeGenTypes route if that's what you're referring to as the narrow slice of CodeGen. That would still require converting between VLAT/VLST, I quite like this approach as it gives me more confidence we're not missing bitcasts when doing it as part of a cast operation. I guess with what you're suggesting the bitcast could still be emitted there but the cast operations could be limited in Sema to cases where ultimately ConvertType would return a type that requires bitcasting, or are you saying that could be avoided completely?

In D85128#2192867, @c-rhodes wrote:

In D85128#2191108, @tschuett wrote:

Is it compatible with GCC?

Support for this attribute landed in GCC 10 and it's more complete than what this patch implements. We've yet to implement the behaviour guarded by the __ARM_FEATURE_SVE_VECTOR_OPERATORS and __ARM_FEATURE_SVE_PREDICATE_OPERATORS feature macros, so the GNU __attribute__((vector_size)) extension is not available and operators such as binary '+' are not supported for VLSTs. Support for this is intended to be addressed by later patches.

Just to clarify, GCC doesn't have support for vectors of booleans or operations on them (__ARM_FEATURE_SVE_PREDICATE_OPERATORS) yet either but does support the behaviour indicated by the other macro.

Sorry. I meant ABI. Can link GCC .o files with Clang .o files using the attributes?

I guess with what you're suggesting the bitcast could still be emitted there but the cast operations could be limited in Sema to cases where ultimately ConvertType would return a type that requires bitcasting, or are you saying that could be avoided completely?

The bitcast operation would exist in Sema either way; it's necessary for the types to stay consistent. My suggestion is just that bitcasting between a VLAT and the corresponding VLST it would be a no-op in CodeGen.

My suggestion is similar to the way the "bool" type works: in memory, it's an i8, but when you load it, it's truncated it to i1.

In D85128#2193309, @tschuett wrote:

Sorry. I meant ABI. Can link GCC .o files with Clang .o files using the attributes?

Yes they should be compatible. The machine-level ABI distinguishes 4 types of SVE vector [1]:

VG×64-bit vector of 8-bit elements
VG×64-bit vector of 16-bit elements
VG×64-bit vector of 32-bit elements
VG×64-bit vector of 64-bit elements

The VLS and VLA types are defined by the ACLE to map to the same machine-level SVE vectors (in both compilers). Hence the mangling changes in this patch.

[1] https://github.com/ARM-software/abi-aa/blob/master/aapcs64/aapcs64.rst#fundamental-data-types

c-rhodes mentioned this in D85725: [Transforms][SROA] Skip uses of allocas where the type is scalable.Aug 11 2020, 5:56 AM

c-rhodes mentioned this in D85736: [Sema][AArch64] Support arm_sve_vector_bits attribute.Aug 11 2020, 7:59 AM

c-rhodes mentioned this in D85743: [CodeGen][AArch64] Support arm_sve_vector_bits attribute.Aug 11 2020, 9:01 AM

c-rhodes mentioned this in rG511d5aaca37a: [Transforms][SROA] Skip uses of allocas where the type is scalable.Aug 12 2020, 2:36 AM

c-rhodes mentioned this in D85848: [InlineCost] Fix scalable vectors in visitAlloca.Aug 12 2020, 11:16 AM

c-rhodes mentioned this in rG2ccde3c96b78: [InlineCost] Fix scalable vectors in visitAlloca.Aug 17 2020, 3:35 AM

c-rhodes mentioned this in rGfeed5a7239d8: [Sema][AArch64] Support arm_sve_vector_bits attribute.Aug 27 2020, 3:39 AM

Closing this now the prototype has been split into separate patches that have landed.

Revision Contents

Path

Size

clang/

include/

clang/

AST/

ASTContext.h

4 lines

Type.h

14 lines

Basic/

Attr.td

2 lines

DiagnosticSemaKinds.td

2 lines

Sema/

Sema.h

10 lines

lib/

AST/

52 lines

81 lines

3 lines

3 lines

39 lines

39 lines

CodeGen/

CGCall.cpp

44 lines

CGExprScalar.cpp

28 lines

TargetInfo.cpp

123 lines

Sema/

SemaDecl.cpp

2 lines

SemaExpr.cpp

52 lines

SemaType.cpp

28 lines

test/

CodeGen/

attr-arm-sve-vector-bits-bitcast.c

278 lines

attr-arm-sve-vector-bits-call.c

337 lines

attr-arm-sve-vector-bits-cast.c

109 lines

attr-arm-sve-vector-bits-codegen.c

117 lines

attr-arm-sve-vector-bits-globals.c

120 lines

attr-arm-sve-vector-bits-types.c

581 lines

CodeGenCXX/

aarch64-sve-fixedtypeinfo.cpp

75 lines

Sema/

attr-arm-sve-vector-bits.c

18 lines

llvm/

lib/

Analysis/

InlineCost.cpp

4 lines

Transforms/

InstCombine/

InstCombineCasts.cpp

4 lines

Scalar/

SROA.cpp

8 lines

Diff 282560

clang/include/clang/AST/ASTContext.h

Show First 20 Lines • Show All 2,080 Lines • ▼ Show 20 Lines	if (Ty->isIncompleteType() \|\| Ty->isDependentType())
return None;		return None;
return getTypeSizeInChars(Ty);		return getTypeSizeInChars(Ty);
}		}

Optional<CharUnits> getTypeSizeInCharsIfKnown(const Type *Ty) const {		Optional<CharUnits> getTypeSizeInCharsIfKnown(const Type *Ty) const {
return getTypeSizeInCharsIfKnown(QualType(Ty, 0));		return getTypeSizeInCharsIfKnown(QualType(Ty, 0));
}		}

/// Returns the bitwidth of \p T, an SVE type attributed with
/// 'arm_sve_vector_bits'. Should only be called if T->isVLST().
unsigned getBitwidthForAttributedSveType(const Type *T) const;

/// Return the ABI-specified alignment of a (complete) type \p T, in		/// Return the ABI-specified alignment of a (complete) type \p T, in
/// bits.		/// bits.
unsigned getTypeAlign(QualType T) const { return getTypeInfo(T).Align; }		unsigned getTypeAlign(QualType T) const { return getTypeInfo(T).Align; }
unsigned getTypeAlign(const Type *T) const { return getTypeInfo(T).Align; }		unsigned getTypeAlign(const Type *T) const { return getTypeInfo(T).Align; }

/// Return the ABI-specified natural alignment of a (complete) type \p T,		/// Return the ABI-specified natural alignment of a (complete) type \p T,
/// before alignment adjustments, in bits.		/// before alignment adjustments, in bits.
///		///
▲ Show 20 Lines • Show All 1,046 Lines • Show Last 20 Lines

clang/include/clang/AST/Type.h

Show First 20 Lines • Show All 1,919 Lines • ▼ Show 20 Lines	public:
/// every type is one or the other. Standard types are all sized;		/// every type is one or the other. Standard types are all sized;
/// sizeless types are purely an extension.		/// sizeless types are purely an extension.
///		///
/// Sizeless types contain data with no specified size, alignment,		/// Sizeless types contain data with no specified size, alignment,
/// or layout.		/// or layout.
bool isSizelessType() const;		bool isSizelessType() const;
bool isSizelessBuiltinType() const;		bool isSizelessBuiltinType() const;

/// Determines if this is a vector-length-specific type (VLST), i.e. a
/// sizeless type with the 'arm_sve_vector_bits' attribute applied.
bool isVLST() const;
/// Determines if this is a sizeless type supported by the		/// Determines if this is a sizeless type supported by the
/// 'arm_sve_vector_bits' type attribute, which can be applied to a single		/// 'arm_sve_vector_bits' type attribute, which can be applied to a single
/// SVE vector or predicate, excluding tuple types such as svint32x4_t.		/// SVE vector or predicate, excluding tuple types such as svint32x4_t.
bool isVLSTBuiltinType() const;		bool isVLSTBuiltinType() const;

		/// Returns the representive type for the element of an SVE builtin type.
		/// This is used to represent fixed-length SVE vectors created with the
		/// 'arm_sve_vector_bits' type attribute as VectorType.
		QualType getFixedLengthSveEltType(const ASTContext &Ctx) const;

/// Types are partitioned into 3 broad categories (C99 6.2.5p1):		/// Types are partitioned into 3 broad categories (C99 6.2.5p1):
/// object types, function types, and incomplete types.		/// object types, function types, and incomplete types.

/// Return true if this is an incomplete type.		/// Return true if this is an incomplete type.
/// A type that can describe objects, but which lacks information needed to		/// A type that can describe objects, but which lacks information needed to
/// determine its size (e.g. void, or a fwd declared struct). Clients of this		/// determine its size (e.g. void, or a fwd declared struct). Clients of this
/// routine will need to determine if the size is actually required.		/// routine will need to determine if the size is actually required.
///		///
▲ Show 20 Lines • Show All 1,309 Lines • ▼ Show 20 Lines	enum VectorKind {

/// is AltiVec 'vector bool ...'		/// is AltiVec 'vector bool ...'
AltiVecBool,		AltiVecBool,

/// is ARM Neon vector		/// is ARM Neon vector
NeonVector,		NeonVector,

/// is ARM Neon polynomial vector		/// is ARM Neon polynomial vector
NeonPolyVector		NeonPolyVector,

		/// is ARM fixed-length scalable vector
		SveFixedLengthDataVector,
		SveFixedLengthPredicateVector
};		};

protected:		protected:
friend class ASTContext; // ASTContext creates these.		friend class ASTContext; // ASTContext creates these.

/// The element type of the vector.		/// The element type of the vector.
QualType ElementType;		QualType ElementType;

▲ Show 20 Lines • Show All 3,997 Lines • Show Last 20 Lines

clang/include/clang/Basic/Attr.td

	Show First 20 Lines • Show All 1,532 Lines • ▼ Show 20 Lines
	}			}

	def ArmSveVectorBits : TypeAttr {			def ArmSveVectorBits : TypeAttr {
	let Spellings = [GNU<"arm_sve_vector_bits">];			let Spellings = [GNU<"arm_sve_vector_bits">];
	let Subjects = SubjectList<[TypedefName], ErrorDiag>;			let Subjects = SubjectList<[TypedefName], ErrorDiag>;
	let Args = [UnsignedArgument<"NumBits">];			let Args = [UnsignedArgument<"NumBits">];
	let Documentation = [ArmSveVectorBitsDocs];			let Documentation = [ArmSveVectorBitsDocs];
	let PragmaAttributeSupport = 0;			let PragmaAttributeSupport = 0;
				// Represented as VectorType instead.
				let ASTNode = 0;
	}			}

	def ArmMveStrictPolymorphism : TypeAttr, TargetSpecificAttr<TargetARM> {			def ArmMveStrictPolymorphism : TypeAttr, TargetSpecificAttr<TargetARM> {
	let Spellings = [Clang<"__clang_arm_mve_strict_polymorphism">];			let Spellings = [Clang<"__clang_arm_mve_strict_polymorphism">];
	let Documentation = [ArmMveStrictPolymorphismDocs];			let Documentation = [ArmMveStrictPolymorphismDocs];
	}			}

	def NoUniqueAddress : InheritableAttr, TargetSpecificAttr<TargetItaniumCXXABI> {			def NoUniqueAddress : InheritableAttr, TargetSpecificAttr<TargetItaniumCXXABI> {
	▲ Show 20 Lines • Show All 1,906 Lines • Show Last 20 Lines

clang/include/clang/Basic/DiagnosticSemaKinds.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 2,915 Lines • ▼ Show 20 Lines
	def err_callback_implicit_this_not_available : Error<			def err_callback_implicit_this_not_available : Error<
	"'callback' argument at position %0 references unavailable implicit 'this'">;			"'callback' argument at position %0 references unavailable implicit 'this'">;
	def err_init_method_bad_return_type : Error<			def err_init_method_bad_return_type : Error<
	"init methods must return an object pointer type, not %0">;			"init methods must return an object pointer type, not %0">;
	def err_attribute_invalid_size : Error<			def err_attribute_invalid_size : Error<
	"vector size not an integral multiple of component size">;			"vector size not an integral multiple of component size">;
	def err_attribute_zero_size : Error<"zero %0 size">;			def err_attribute_zero_size : Error<"zero %0 size">;
	def err_attribute_size_too_large : Error<"%0 size too large">;			def err_attribute_size_too_large : Error<"%0 size too large">;
				def err_typecheck_vector_not_convertable_sizeless : Error<
				"cannot convert between fixed-length and sizeless vector (%0 and %1)">;
	def err_typecheck_vector_not_convertable_implict_truncation : Error<			def err_typecheck_vector_not_convertable_implict_truncation : Error<
	"cannot convert between %select{scalar\|vector}0 type %1 and vector type"			"cannot convert between %select{scalar\|vector}0 type %1 and vector type"
	" %2 as implicit conversion would cause truncation">;			" %2 as implicit conversion would cause truncation">;
	def err_typecheck_vector_not_convertable : Error<			def err_typecheck_vector_not_convertable : Error<
	"cannot convert between vector values of different size (%0 and %1)">;			"cannot convert between vector values of different size (%0 and %1)">;
	def err_typecheck_vector_not_convertable_non_scalar : Error<			def err_typecheck_vector_not_convertable_non_scalar : Error<
	"cannot convert between vector and non-scalar values (%0 and %1)">;			"cannot convert between vector and non-scalar values (%0 and %1)">;
	def err_typecheck_vector_lengths_not_equal : Error<			def err_typecheck_vector_lengths_not_equal : Error<
	▲ Show 20 Lines • Show All 7,956 Lines • Show Last 20 Lines

clang/include/clang/Sema/Sema.h

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,991 Lines • ▼ Show 20 Lines	bool RequireCompleteType(SourceLocation Loc, QualType T, unsigned DiagID,
BoundTypeDiagnoser<Ts...> Diagnoser(DiagID, Args...);		BoundTypeDiagnoser<Ts...> Diagnoser(DiagID, Args...);
return RequireCompleteType(Loc, T, Diagnoser);		return RequireCompleteType(Loc, T, Diagnoser);
}		}

template <typename... Ts>		template <typename... Ts>
bool RequireCompleteSizedType(SourceLocation Loc, QualType T, unsigned DiagID,		bool RequireCompleteSizedType(SourceLocation Loc, QualType T, unsigned DiagID,
const Ts &... Args) {		const Ts &... Args) {
SizelessTypeDiagnoser<Ts...> Diagnoser(DiagID, Args...);		SizelessTypeDiagnoser<Ts...> Diagnoser(DiagID, Args...);
CompleteTypeKind Kind = CompleteTypeKind::Normal;		return RequireCompleteType(Loc, T, CompleteTypeKind::Normal, Diagnoser);
if (T->isVLST())
Kind = CompleteTypeKind::AcceptSizeless;
return RequireCompleteType(Loc, T, Kind, Diagnoser);
}		}

void completeExprArrayBound(Expr *E);		void completeExprArrayBound(Expr *E);
bool RequireCompleteExprType(Expr *E, CompleteTypeKind Kind,		bool RequireCompleteExprType(Expr *E, CompleteTypeKind Kind,
TypeDiagnoser &Diagnoser);		TypeDiagnoser &Diagnoser);
bool RequireCompleteExprType(Expr *E, unsigned DiagID);		bool RequireCompleteExprType(Expr *E, unsigned DiagID);

template <typename... Ts>		template <typename... Ts>
bool RequireCompleteExprType(Expr *E, unsigned DiagID, const Ts &...Args) {		bool RequireCompleteExprType(Expr *E, unsigned DiagID, const Ts &...Args) {
BoundTypeDiagnoser<Ts...> Diagnoser(DiagID, Args...);		BoundTypeDiagnoser<Ts...> Diagnoser(DiagID, Args...);
return RequireCompleteExprType(E, CompleteTypeKind::Default, Diagnoser);		return RequireCompleteExprType(E, CompleteTypeKind::Default, Diagnoser);
}		}

template <typename... Ts>		template <typename... Ts>
bool RequireCompleteSizedExprType(Expr *E, unsigned DiagID,		bool RequireCompleteSizedExprType(Expr *E, unsigned DiagID,
const Ts &... Args) {		const Ts &... Args) {
SizelessTypeDiagnoser<Ts...> Diagnoser(DiagID, Args...);		SizelessTypeDiagnoser<Ts...> Diagnoser(DiagID, Args...);
CompleteTypeKind Kind = CompleteTypeKind::Normal;		return RequireCompleteExprType(E, CompleteTypeKind::Normal, Diagnoser);
if (E->getType()->isVLST())
Kind = CompleteTypeKind::AcceptSizeless;
return RequireCompleteExprType(E, Kind, Diagnoser);
}		}

bool RequireLiteralType(SourceLocation Loc, QualType T,		bool RequireLiteralType(SourceLocation Loc, QualType T,
TypeDiagnoser &Diagnoser);		TypeDiagnoser &Diagnoser);
bool RequireLiteralType(SourceLocation Loc, QualType T, unsigned DiagID);		bool RequireLiteralType(SourceLocation Loc, QualType T, unsigned DiagID);

template <typename... Ts>		template <typename... Ts>
bool RequireLiteralType(SourceLocation Loc, QualType T, unsigned DiagID,		bool RequireLiteralType(SourceLocation Loc, QualType T, unsigned DiagID,
▲ Show 20 Lines • Show All 10,619 Lines • Show Last 20 Lines

clang/lib/AST/ASTContext.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,863 Lines • ▼ Show 20 Lines	if (I != MemoizedTypeInfo.end())
return I->second;		return I->second;

// This call can invalidate MemoizedTypeInfo[T], so we need a second lookup.		// This call can invalidate MemoizedTypeInfo[T], so we need a second lookup.
TypeInfo TI = getTypeInfoImpl(T);		TypeInfo TI = getTypeInfoImpl(T);
MemoizedTypeInfo[T] = TI;		MemoizedTypeInfo[T] = TI;
return TI;		return TI;
}		}

static unsigned getSveVectorWidth(const Type *T) {
// Get the vector size from the 'arm_sve_vector_bits' attribute via the
// AttributedTypeLoc associated with the typedef decl.
if (const auto *TT = T->getAs<TypedefType>()) {
const TypedefNameDecl *Typedef = TT->getDecl();
TypeSourceInfo *TInfo = Typedef->getTypeSourceInfo();
TypeLoc TL = TInfo->getTypeLoc();
if (AttributedTypeLoc ATL = TL.getAs<AttributedTypeLoc>())
if (const auto *Attr = ATL.getAttrAs<ArmSveVectorBitsAttr>())
return Attr->getNumBits();
}

llvm_unreachable("bad 'arm_sve_vector_bits' attribute!");
}

static unsigned getSvePredWidth(const ASTContext &Context, const Type *T) {
return getSveVectorWidth(T) / Context.getCharWidth();
}

unsigned ASTContext::getBitwidthForAttributedSveType(const Type *T) const {
assert(T->isVLST() &&
"getBitwidthForAttributedSveType called for non-attributed type!");

switch (T->castAs<BuiltinType>()->getKind()) {
default:
llvm_unreachable("unknown builtin type!");
case BuiltinType::SveInt8:
case BuiltinType::SveInt16:
case BuiltinType::SveInt32:
case BuiltinType::SveInt64:
case BuiltinType::SveUint8:
case BuiltinType::SveUint16:
case BuiltinType::SveUint32:
case BuiltinType::SveUint64:
case BuiltinType::SveFloat16:
case BuiltinType::SveFloat32:
case BuiltinType::SveFloat64:
case BuiltinType::SveBFloat16:
return getSveVectorWidth(T);
case BuiltinType::SveBool:
return getSvePredWidth(*this, T);
}
}

/// getTypeInfoImpl - Return the size of the specified type, in bits. This		/// getTypeInfoImpl - Return the size of the specified type, in bits. This
/// method does not work on incomplete types.		/// method does not work on incomplete types.
///		///
/// FIXME: Pointers into different addr spaces could have different sizes and		/// FIXME: Pointers into different addr spaces could have different sizes and
/// alignment requirements: getPointerInfo should take an AddrSpace, this		/// alignment requirements: getPointerInfo should take an AddrSpace, this
/// should take a QualType, &c.		/// should take a QualType, &c.
TypeInfo ASTContext::getTypeInfoImpl(const Type *T) const {		TypeInfo ASTContext::getTypeInfoImpl(const Type *T) const {
uint64_t Width = 0;		uint64_t Width = 0;
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	case Type::Vector: {
if (Align & (Align-1)) {		if (Align & (Align-1)) {
Align = llvm::NextPowerOf2(Align);		Align = llvm::NextPowerOf2(Align);
Width = llvm::alignTo(Width, Align);		Width = llvm::alignTo(Width, Align);
}		}
// Adjust the alignment based on the target max.		// Adjust the alignment based on the target max.
uint64_t TargetVectorAlign = Target->getMaxVectorAlign();		uint64_t TargetVectorAlign = Target->getMaxVectorAlign();
if (TargetVectorAlign && TargetVectorAlign < Align)		if (TargetVectorAlign && TargetVectorAlign < Align)
Align = TargetVectorAlign;		Align = TargetVectorAlign;
		// Adjust the alignment for fixed-length SVE predicates.
		if (VT->getVectorKind() == VectorType::SveFixedLengthPredicateVector)
		Align = 16;
break;		break;
}		}

case Type::ConstantMatrix: {		case Type::ConstantMatrix: {
const auto *MT = cast<ConstantMatrixType>(T);		const auto *MT = cast<ConstantMatrixType>(T);
TypeInfo ElementInfo = getTypeInfo(MT->getElementType());		TypeInfo ElementInfo = getTypeInfo(MT->getElementType());
// The internal layout of a matrix value is implementation defined.		// The internal layout of a matrix value is implementation defined.
// Initially be ABI compatible with arrays with respect to alignment and		// Initially be ABI compatible with arrays with respect to alignment and
▲ Show 20 Lines • Show All 322 Lines • ▼ Show 20 Lines	case Type::Typedef: {
// attribute(aligned) can only round up) but matches its implementation.		// attribute(aligned) can only round up) but matches its implementation.
if (unsigned AttrAlign = Typedef->getMaxAlignment()) {		if (unsigned AttrAlign = Typedef->getMaxAlignment()) {
Align = AttrAlign;		Align = AttrAlign;
AlignIsRequired = true;		AlignIsRequired = true;
} else {		} else {
Align = Info.Align;		Align = Info.Align;
AlignIsRequired = Info.AlignIsRequired;		AlignIsRequired = Info.AlignIsRequired;
}		}
if (T->isVLST())
Width = getBitwidthForAttributedSveType(T);
else
Width = Info.Width;		Width = Info.Width;
break;		break;
}		}

case Type::Elaborated:		case Type::Elaborated:
return getTypeInfo(cast<ElaboratedType>(T)->getNamedType().getTypePtr());		return getTypeInfo(cast<ElaboratedType>(T)->getNamedType().getTypePtr());

case Type::Attributed:		case Type::Attributed:
return getTypeInfo(		return getTypeInfo(
▲ Show 20 Lines • Show All 8,880 Lines • Show Last 20 Lines

clang/lib/AST/ItaniumMangle.cpp

Show First 20 Lines • Show All 525 Lines • ▼ Show 20 Lines	#include "clang/AST/TypeNodes.inc"
void mangleExtParameterInfo(FunctionProtoType::ExtParameterInfo info);		void mangleExtParameterInfo(FunctionProtoType::ExtParameterInfo info);
void mangleExtFunctionInfo(const FunctionType *T);		void mangleExtFunctionInfo(const FunctionType *T);
void mangleBareFunctionType(const FunctionProtoType *T, bool MangleReturnType,		void mangleBareFunctionType(const FunctionProtoType *T, bool MangleReturnType,
const FunctionDecl *FD = nullptr);		const FunctionDecl *FD = nullptr);
void mangleNeonVectorType(const VectorType *T);		void mangleNeonVectorType(const VectorType *T);
void mangleNeonVectorType(const DependentVectorType *T);		void mangleNeonVectorType(const DependentVectorType *T);
void mangleAArch64NeonVectorType(const VectorType *T);		void mangleAArch64NeonVectorType(const VectorType *T);
void mangleAArch64NeonVectorType(const DependentVectorType *T);		void mangleAArch64NeonVectorType(const DependentVectorType *T);
		void mangleAArch64FixedSveVectorType(const VectorType *T);
		void mangleAArch64FixedSveVectorType(const DependentVectorType *T);

void mangleIntegerLiteral(QualType T, const llvm::APSInt &Value);		void mangleIntegerLiteral(QualType T, const llvm::APSInt &Value);
void mangleMemberExprBase(const Expr *base, bool isArrow);		void mangleMemberExprBase(const Expr *base, bool isArrow);
void mangleMemberExpr(const Expr *base, bool isArrow,		void mangleMemberExpr(const Expr *base, bool isArrow,
NestedNameSpecifier *qualifier,		NestedNameSpecifier *qualifier,
NamedDecl *firstQualifierLookup,		NamedDecl *firstQualifierLookup,
DeclarationName name,		DeclarationName name,
const TemplateArgumentLoc *TemplateArgs,		const TemplateArgumentLoc *TemplateArgs,
▲ Show 20 Lines • Show All 2,751 Lines • ▼ Show 20 Lines
void CXXNameMangler::mangleAArch64NeonVectorType(const DependentVectorType *T) {		void CXXNameMangler::mangleAArch64NeonVectorType(const DependentVectorType *T) {
DiagnosticsEngine &Diags = Context.getDiags();		DiagnosticsEngine &Diags = Context.getDiags();
unsigned DiagID = Diags.getCustomDiagID(		unsigned DiagID = Diags.getCustomDiagID(
DiagnosticsEngine::Error,		DiagnosticsEngine::Error,
"cannot mangle this dependent neon vector type yet");		"cannot mangle this dependent neon vector type yet");
Diags.Report(T->getAttributeLoc(), DiagID);		Diags.Report(T->getAttributeLoc(), DiagID);
}		}

		// The AArch64 ACLE specifies that fixed-length SVE vector and predicate types
		// defined with the 'arm_sve_vector_bits' attribute map to the same AAPCS64
		// type as the sizeless variants. The mangling scheme is defined in the
		// appendices to the Procedure Call Standard for the Arm Architecture, see:
		// https://github.com/ARM-software/abi-aa/blob/master/aapcs64/aapcs64.rst#appendix-c-mangling
		void CXXNameMangler::mangleAArch64FixedSveVectorType(const VectorType *T) {
		assert((T->getVectorKind() == VectorType::SveFixedLengthDataVector \|\|
		T->getVectorKind() == VectorType::SveFixedLengthPredicateVector) &&
		"expected fixed-length SVE vector!");
		QualType EltType = T->getElementType();
		assert(EltType->isBuiltinType() &&
		"expected builtin type for fixed-length SVE vector!");

		StringRef TypeName;
		switch (cast<BuiltinType>(EltType)->getKind()) {
		case BuiltinType::SChar:
		TypeName = "__SVInt8_t";
		break;
		case BuiltinType::UChar: {
		if (T->getVectorKind() == VectorType::SveFixedLengthDataVector)
		TypeName = "__SVUint8_t";
		else
		TypeName = "__SVBool_t";
		break;
		}
		case BuiltinType::Short:
		TypeName = "__SVInt16_t";
		break;
		case BuiltinType::UShort:
		TypeName = "__SVUint16_t";
		break;
		case BuiltinType::Int:
		TypeName = "__SVInt32_t";
		break;
		case BuiltinType::UInt:
		TypeName = "__SVUint32_t";
		break;
		case BuiltinType::Long:
		TypeName = "__SVInt64_t";
		break;
		case BuiltinType::ULong:
		TypeName = "__SVUint64_t";
		break;
		case BuiltinType::Float16:
		TypeName = "__SVFloat16_t";
		break;
		case BuiltinType::Float:
		TypeName = "__SVFloat32_t";
		break;
		case BuiltinType::Double:
		TypeName = "__SVFloat64_t";
		break;
		case BuiltinType::BFloat16:
		TypeName = "__SVBfloat16_t";
		break;
		default:
		llvm_unreachable("unexpected element type for fixed-length SVE vector!");
		}

		Out << 'u' << TypeName.size() << TypeName;
		}

		void CXXNameMangler::mangleAArch64FixedSveVectorType(
		const DependentVectorType *T) {
		DiagnosticsEngine &Diags = Context.getDiags();
		unsigned DiagID = Diags.getCustomDiagID(
		DiagnosticsEngine::Error,
		"cannot mangle this dependent fixed-length SVE vector type yet");
		Diags.Report(T->getAttributeLoc(), DiagID);
		}

// GNU extension: vector types		// GNU extension: vector types
// <type> ::= <vector-type>		// <type> ::= <vector-type>
// <vector-type> ::= Dv <positive dimension number> _		// <vector-type> ::= Dv <positive dimension number> _
// <extended element type>		// <extended element type>
// ::= Dv [<dimension expression>] _ <element type>		// ::= Dv [<dimension expression>] _ <element type>
// <extended element type> ::= <element type>		// <extended element type> ::= <element type>
// ::= p # AltiVec vector pixel		// ::= p # AltiVec vector pixel
// ::= b # Altivec vector bool		// ::= b # Altivec vector bool
void CXXNameMangler::mangleType(const VectorType *T) {		void CXXNameMangler::mangleType(const VectorType *T) {
if ((T->getVectorKind() == VectorType::NeonVector \|\|		if ((T->getVectorKind() == VectorType::NeonVector \|\|
T->getVectorKind() == VectorType::NeonPolyVector)) {		T->getVectorKind() == VectorType::NeonPolyVector)) {
llvm::Triple Target = getASTContext().getTargetInfo().getTriple();		llvm::Triple Target = getASTContext().getTargetInfo().getTriple();
llvm::Triple::ArchType Arch =		llvm::Triple::ArchType Arch =
getASTContext().getTargetInfo().getTriple().getArch();		getASTContext().getTargetInfo().getTriple().getArch();
if ((Arch == llvm::Triple::aarch64 \|\|		if ((Arch == llvm::Triple::aarch64 \|\|
Arch == llvm::Triple::aarch64_be) && !Target.isOSDarwin())		Arch == llvm::Triple::aarch64_be) && !Target.isOSDarwin())
mangleAArch64NeonVectorType(T);		mangleAArch64NeonVectorType(T);
else		else
mangleNeonVectorType(T);		mangleNeonVectorType(T);
return;		return;
		} else if (T->getVectorKind() == VectorType::SveFixedLengthDataVector \|\|
		T->getVectorKind() == VectorType::SveFixedLengthPredicateVector) {
		mangleAArch64FixedSveVectorType(T);
		return;
}		}
Out << "Dv" << T->getNumElements() << '_';		Out << "Dv" << T->getNumElements() << '_';
if (T->getVectorKind() == VectorType::AltiVecPixel)		if (T->getVectorKind() == VectorType::AltiVecPixel)
Out << 'p';		Out << 'p';
else if (T->getVectorKind() == VectorType::AltiVecBool)		else if (T->getVectorKind() == VectorType::AltiVecBool)
Out << 'b';		Out << 'b';
else		else
mangleType(T->getElementType());		mangleType(T->getElementType());
}		}

void CXXNameMangler::mangleType(const DependentVectorType *T) {		void CXXNameMangler::mangleType(const DependentVectorType *T) {
if ((T->getVectorKind() == VectorType::NeonVector \|\|		if ((T->getVectorKind() == VectorType::NeonVector \|\|
T->getVectorKind() == VectorType::NeonPolyVector)) {		T->getVectorKind() == VectorType::NeonPolyVector)) {
llvm::Triple Target = getASTContext().getTargetInfo().getTriple();		llvm::Triple Target = getASTContext().getTargetInfo().getTriple();
llvm::Triple::ArchType Arch =		llvm::Triple::ArchType Arch =
getASTContext().getTargetInfo().getTriple().getArch();		getASTContext().getTargetInfo().getTriple().getArch();
if ((Arch == llvm::Triple::aarch64 \|\| Arch == llvm::Triple::aarch64_be) &&		if ((Arch == llvm::Triple::aarch64 \|\| Arch == llvm::Triple::aarch64_be) &&
!Target.isOSDarwin())		!Target.isOSDarwin())
mangleAArch64NeonVectorType(T);		mangleAArch64NeonVectorType(T);
else		else
mangleNeonVectorType(T);		mangleNeonVectorType(T);
return;		return;
		} else if (T->getVectorKind() == VectorType::SveFixedLengthDataVector \|\|
		T->getVectorKind() == VectorType::SveFixedLengthPredicateVector) {
		mangleAArch64FixedSveVectorType(T);
		return;
}		}

Out << "Dv";		Out << "Dv";
mangleExpression(T->getSizeExpr());		mangleExpression(T->getSizeExpr());
Out << '_';		Out << '_';
if (T->getVectorKind() == VectorType::AltiVecPixel)		if (T->getVectorKind() == VectorType::AltiVecPixel)
Out << 'p';		Out << 'p';
else if (T->getVectorKind() == VectorType::AltiVecBool)		else if (T->getVectorKind() == VectorType::AltiVecBool)
▲ Show 20 Lines • Show All 2,008 Lines • Show Last 20 Lines

clang/lib/AST/JSONNodeDumper.cpp

Show First 20 Lines • Show All 610 Lines • ▼ Show 20 Lines	case VectorType::AltiVecBool:
JOS.attribute("vectorKind", "altivec bool");		JOS.attribute("vectorKind", "altivec bool");
break;		break;
case VectorType::NeonVector:		case VectorType::NeonVector:
JOS.attribute("vectorKind", "neon");		JOS.attribute("vectorKind", "neon");
break;		break;
case VectorType::NeonPolyVector:		case VectorType::NeonPolyVector:
JOS.attribute("vectorKind", "neon poly");		JOS.attribute("vectorKind", "neon poly");
break;		break;
		case VectorType::SveFixedLengthDataVector:
		case VectorType::SveFixedLengthPredicateVector:
		JOS.attribute("vectorKind", "fixed-length sve");
}		}
}		}

void JSONNodeDumper::VisitUnresolvedUsingType(const UnresolvedUsingType *UUT) {		void JSONNodeDumper::VisitUnresolvedUsingType(const UnresolvedUsingType *UUT) {
JOS.attribute("decl", createBareDeclRef(UUT->getDecl()));		JOS.attribute("decl", createBareDeclRef(UUT->getDecl()));
}		}

void JSONNodeDumper::VisitUnaryTransformType(const UnaryTransformType *UTT) {		void JSONNodeDumper::VisitUnaryTransformType(const UnaryTransformType *UTT) {
▲ Show 20 Lines • Show All 1,000 Lines • Show Last 20 Lines

clang/lib/AST/TextNodeDumper.cpp

Show First 20 Lines • Show All 1,383 Lines • ▼ Show 20 Lines	case VectorType::AltiVecBool:
OS << " altivec bool";		OS << " altivec bool";
break;		break;
case VectorType::NeonVector:		case VectorType::NeonVector:
OS << " neon";		OS << " neon";
break;		break;
case VectorType::NeonPolyVector:		case VectorType::NeonPolyVector:
OS << " neon poly";		OS << " neon poly";
break;		break;
		case VectorType::SveFixedLengthDataVector:
		case VectorType::SveFixedLengthPredicateVector:
		OS << " fixed-length sve";
}		}
OS << " " << T->getNumElements();		OS << " " << T->getNumElements();
}		}

void TextNodeDumper::VisitFunctionType(const FunctionType *T) {		void TextNodeDumper::VisitFunctionType(const FunctionType *T) {
auto EI = T->getExtInfo();		auto EI = T->getExtInfo();
if (EI.getNoReturn())		if (EI.getNoReturn())
OS << " noreturn";		OS << " noreturn";
▲ Show 20 Lines • Show All 850 Lines • Show Last 20 Lines

clang/lib/AST/Type.cpp

Show First 20 Lines • Show All 2,312 Lines • ▼ Show 20 Lines	case BuiltinType::SveBool:
return true;		return true;
default:		default:
return false;		return false;
}		}
}		}
return false;		return false;
}		}

bool Type::isVLST() const {		QualType Type::getFixedLengthSveEltType(const ASTContext &Ctx) const {
if (!isVLSTBuiltinType())		assert(isVLSTBuiltinType() && "unsupported type!");
return false;

return hasAttr(attr::ArmSveVectorBits);		const BuiltinType *BTy = getAs<BuiltinType>();
		switch (BTy->getKind()) {
		default:
		llvm_unreachable("Unknown builtin SVE type!");
		case BuiltinType::SveInt8:
		return Ctx.SignedCharTy;
		case BuiltinType::SveUint8:
		case BuiltinType::SveBool:
		// Represent predicates as i8 rather than i1 to avoid any layout issues.
		// The type is bitcasted to a scalable predicate type when casting between
		// scalable and fixed-length vectors.
		return Ctx.UnsignedCharTy;
		case BuiltinType::SveInt16:
		return Ctx.ShortTy;
		case BuiltinType::SveUint16:
		return Ctx.UnsignedShortTy;
		case BuiltinType::SveInt32:
		return Ctx.IntTy;
		case BuiltinType::SveUint32:
		return Ctx.UnsignedIntTy;
		case BuiltinType::SveInt64:
		return Ctx.LongTy;
		case BuiltinType::SveUint64:
		return Ctx.UnsignedLongTy;
		case BuiltinType::SveFloat16:
		return Ctx.Float16Ty;
		case BuiltinType::SveBFloat16:
		return Ctx.BFloat16Ty;
		case BuiltinType::SveFloat32:
		return Ctx.FloatTy;
		case BuiltinType::SveFloat64:
		return Ctx.DoubleTy;
		}
}		}

bool QualType::isPODType(const ASTContext &Context) const {		bool QualType::isPODType(const ASTContext &Context) const {
// C++11 has a more relaxed definition of POD.		// C++11 has a more relaxed definition of POD.
if (Context.getLangOpts().CPlusPlus11)		if (Context.getLangOpts().CPlusPlus11)
return isCXX11PODType(Context);		return isCXX11PODType(Context);

return isCXX98PODType(Context);		return isCXX98PODType(Context);
▲ Show 20 Lines • Show All 2,041 Lines • Show Last 20 Lines

clang/lib/AST/TypePrinter.cpp

Show First 20 Lines • Show All 649 Lines • ▼ Show 20 Lines	case VectorType::GenericVector: {
OS << "__attribute__((__vector_size__("		OS << "__attribute__((__vector_size__("
<< T->getNumElements()		<< T->getNumElements()
<< " * sizeof(";		<< " * sizeof(";
print(T->getElementType(), OS, StringRef());		print(T->getElementType(), OS, StringRef());
OS << ")))) ";		OS << ")))) ";
printBefore(T->getElementType(), OS);		printBefore(T->getElementType(), OS);
break;		break;
}		}
		case VectorType::SveFixedLengthDataVector:
		case VectorType::SveFixedLengthPredicateVector:
		// FIXME: We prefer to print the size directly here, but have no way
		// to get the size of the type.
		OS << "__attribute__((__arm_sve_vector_bits__(";

		if (T->getVectorKind() == VectorType::SveFixedLengthPredicateVector)
		// Predicates take a bit per byte of the vector size, multiply by 8 to
		// get the number of bits passed to the attribute.
		OS << T->getNumElements() * 8;
		else
		OS << T->getNumElements();

		OS << " * sizeof(";
		print(T->getElementType(), OS, StringRef());
		// Multiply by 8 for the number of bits.
		OS << ") * 8))) ";
		printBefore(T->getElementType(), OS);
}		}
}		}

void TypePrinter::printVectorAfter(const VectorType *T, raw_ostream &OS) {		void TypePrinter::printVectorAfter(const VectorType *T, raw_ostream &OS) {
printAfter(T->getElementType(), OS);		printAfter(T->getElementType(), OS);
}		}

void TypePrinter::printDependentVectorBefore(		void TypePrinter::printDependentVectorBefore(
Show All 31 Lines	case VectorType::GenericVector: {
if (T->getSizeExpr())		if (T->getSizeExpr())
T->getSizeExpr()->printPretty(OS, nullptr, Policy);		T->getSizeExpr()->printPretty(OS, nullptr, Policy);
OS << " * sizeof(";		OS << " * sizeof(";
print(T->getElementType(), OS, StringRef());		print(T->getElementType(), OS, StringRef());
OS << ")))) ";		OS << ")))) ";
printBefore(T->getElementType(), OS);		printBefore(T->getElementType(), OS);
break;		break;
}		}
		case VectorType::SveFixedLengthDataVector:
		case VectorType::SveFixedLengthPredicateVector:
		// FIXME: We prefer to print the size directly here, but have no way
		// to get the size of the type.
		OS << "__attribute__((__arm_sve_vector_bits__(";
		if (T->getSizeExpr()) {
		T->getSizeExpr()->printPretty(OS, nullptr, Policy);
		if (T->getVectorKind() == VectorType::SveFixedLengthPredicateVector)
		// Predicates take a bit per byte of the vector size, multiply by 8 to
		// get the number of bits passed to the attribute.
		OS << " * 8";
		OS << " * sizeof(";
		print(T->getElementType(), OS, StringRef());
		// Multiply by 8 for the number of bits.
		OS << ") * 8";
		}
		OS << "))) ";
		printBefore(T->getElementType(), OS);
}		}
}		}

void TypePrinter::printDependentVectorAfter(		void TypePrinter::printDependentVectorAfter(
const DependentVectorType *T, raw_ostream &OS) {		const DependentVectorType *T, raw_ostream &OS) {
printAfter(T->getElementType(), OS);		printAfter(T->getElementType(), OS);
}		}

▲ Show 20 Lines • Show All 914 Lines • ▼ Show 20 Lines	case attr::NoDeref:
OS << "noderef";		OS << "noderef";
break;		break;
case attr::AcquireHandle:		case attr::AcquireHandle:
OS << "acquire_handle";		OS << "acquire_handle";
break;		break;
case attr::ArmMveStrictPolymorphism:		case attr::ArmMveStrictPolymorphism:
OS << "__clang_arm_mve_strict_polymorphism";		OS << "__clang_arm_mve_strict_polymorphism";
break;		break;
case attr::ArmSveVectorBits:
OS << "arm_sve_vector_bits";
break;
}		}
OS << "))";		OS << "))";
}		}

void TypePrinter::printObjCInterfaceBefore(const ObjCInterfaceType *T,		void TypePrinter::printObjCInterfaceBefore(const ObjCInterfaceType *T,
raw_ostream &OS) {		raw_ostream &OS) {
OS << T->getDecl()->getName();		OS << T->getDecl()->getName();
spaceBeforePlaceHolder(OS);		spaceBeforePlaceHolder(OS);
▲ Show 20 Lines • Show All 350 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGCall.cpp

Show First 20 Lines • Show All 1,113 Lines • ▼ Show 20 Lines	if (IRCallArgPos < IRFuncTy->getNumParams() &&
V = Builder.CreateBitCast(V, IRFuncTy->getParamType(IRCallArgPos));		V = Builder.CreateBitCast(V, IRFuncTy->getParamType(IRCallArgPos));

IRCallArgs[IRCallArgPos++] = V;		IRCallArgs[IRCallArgPos++] = V;
}		}
}		}

/// Create a temporary allocation for the purposes of coercion.		/// Create a temporary allocation for the purposes of coercion.
static Address CreateTempAllocaForCoercion(CodeGenFunction &CGF, llvm::Type *Ty,		static Address CreateTempAllocaForCoercion(CodeGenFunction &CGF, llvm::Type *Ty,
CharUnits MinAlign) {		CharUnits MinAlign,
		const Twine &Name = "tmp") {
// Don't use an alignment that's worse than what LLVM would prefer.		// Don't use an alignment that's worse than what LLVM would prefer.
auto PrefAlign = CGF.CGM.getDataLayout().getPrefTypeAlignment(Ty);		auto PrefAlign = CGF.CGM.getDataLayout().getPrefTypeAlignment(Ty);
CharUnits Align = std::max(MinAlign, CharUnits::fromQuantity(PrefAlign));		CharUnits Align = std::max(MinAlign, CharUnits::fromQuantity(PrefAlign));

return CGF.CreateTempAlloca(Ty, Align);		return CGF.CreateTempAlloca(Ty, Align, Name + ".coerce");
}		}

/// EnterStructPointerForCoercedAccess - Given a struct pointer that we are		/// EnterStructPointerForCoercedAccess - Given a struct pointer that we are
/// accessing some number of bytes out of it, try to gep into the struct to get		/// accessing some number of bytes out of it, try to gep into the struct to get
/// at its inner goodness. Dive as deep as possible without entering an element		/// at its inner goodness. Dive as deep as possible without entering an element
/// with an in-memory size smaller than DstSize.		/// with an in-memory size smaller than DstSize.
static Address		static Address
EnterStructPointerForCoercedAccess(Address SrcPtr,		EnterStructPointerForCoercedAccess(Address SrcPtr,
▲ Show 20 Lines • Show All 89 Lines • ▼ Show 20 Lines
static llvm::Value CreateCoercedLoad(Address Src, llvm::Type Ty,		static llvm::Value CreateCoercedLoad(Address Src, llvm::Type Ty,
CodeGenFunction &CGF) {		CodeGenFunction &CGF) {
llvm::Type *SrcTy = Src.getElementType();		llvm::Type *SrcTy = Src.getElementType();

// If SrcTy and Ty are the same, just do a load.		// If SrcTy and Ty are the same, just do a load.
if (SrcTy == Ty)		if (SrcTy == Ty)
return CGF.Builder.CreateLoad(Src);		return CGF.Builder.CreateLoad(Src);

uint64_t DstSize = CGF.CGM.getDataLayout().getTypeAllocSize(Ty);		llvm::TypeSize DstSize = CGF.CGM.getDataLayout().getTypeAllocSize(Ty);

if (llvm::StructType *SrcSTy = dyn_cast<llvm::StructType>(SrcTy)) {		if (llvm::StructType *SrcSTy = dyn_cast<llvm::StructType>(SrcTy)) {
Src = EnterStructPointerForCoercedAccess(Src, SrcSTy, DstSize, CGF);		Src = EnterStructPointerForCoercedAccess(Src, SrcSTy,
		DstSize.getKnownMinSize(), CGF);
SrcTy = Src.getElementType();		SrcTy = Src.getElementType();
}		}

uint64_t SrcSize = CGF.CGM.getDataLayout().getTypeAllocSize(SrcTy);		llvm::TypeSize SrcSize = CGF.CGM.getDataLayout().getTypeAllocSize(SrcTy);

// If the source and destination are integer or pointer types, just do an		// If the source and destination are integer or pointer types, just do an
// extension or truncation to the desired type.		// extension or truncation to the desired type.
if ((isa<llvm::IntegerType>(Ty) \|\| isa<llvm::PointerType>(Ty)) &&		if ((isa<llvm::IntegerType>(Ty) \|\| isa<llvm::PointerType>(Ty)) &&
(isa<llvm::IntegerType>(SrcTy) \|\| isa<llvm::PointerType>(SrcTy))) {		(isa<llvm::IntegerType>(SrcTy) \|\| isa<llvm::PointerType>(SrcTy))) {
llvm::Value *Load = CGF.Builder.CreateLoad(Src);		llvm::Value *Load = CGF.Builder.CreateLoad(Src);
return CoerceIntOrPtrToIntOrPtr(Load, Ty, CGF);		return CoerceIntOrPtrToIntOrPtr(Load, Ty, CGF);
}		}

// If load is legal, just bitcast the src pointer.		// If load is legal, just bitcast the src pointer.
if (SrcSize >= DstSize) {		if ((!SrcSize.isScalable() && !DstSize.isScalable()) &&
		SrcSize.getKnownMinSize() >= DstSize.getKnownMinSize()) {
// Generally SrcSize is never greater than DstSize, since this means we are		// Generally SrcSize is never greater than DstSize, since this means we are
// losing bits. However, this can happen in cases where the structure has		// losing bits. However, this can happen in cases where the structure has
// additional padding, for example due to a user specified alignment.		// additional padding, for example due to a user specified alignment.
//		//
// FIXME: Assert that we aren't truncating non-padding bits when have access		// FIXME: Assert that we aren't truncating non-padding bits when have access
// to that information.		// to that information.
Src = CGF.Builder.CreateBitCast(Src,		Src = CGF.Builder.CreateBitCast(Src,
Ty->getPointerTo(Src.getAddressSpace()));		Ty->getPointerTo(Src.getAddressSpace()));
return CGF.Builder.CreateLoad(Src);		return CGF.Builder.CreateLoad(Src);
}		}

// Otherwise do coercion through memory. This is stupid, but simple.		// Otherwise do coercion through memory. This is stupid, but simple.
Address Tmp = CreateTempAllocaForCoercion(CGF, Ty, Src.getAlignment());		Address Tmp =
CGF.Builder.CreateMemCpy(Tmp.getPointer(), Tmp.getAlignment().getAsAlign(),		CreateTempAllocaForCoercion(CGF, Ty, Src.getAlignment(), Src.getName());
Src.getPointer(), Src.getAlignment().getAsAlign(),		CGF.Builder.CreateMemCpy(
llvm::ConstantInt::get(CGF.IntPtrTy, SrcSize));		Tmp.getPointer(), Tmp.getAlignment().getAsAlign(), Src.getPointer(),
		Src.getAlignment().getAsAlign(),
		llvm::ConstantInt::get(CGF.IntPtrTy, SrcSize.getKnownMinSize()));
return CGF.Builder.CreateLoad(Tmp);		return CGF.Builder.CreateLoad(Tmp);
}		}

// Function to store a first-class aggregate into memory. We prefer to		// Function to store a first-class aggregate into memory. We prefer to
// store the elements rather than the aggregate to be more friendly to		// store the elements rather than the aggregate to be more friendly to
// fast-isel.		// fast-isel.
// FIXME: Do we need to recurse here?		// FIXME: Do we need to recurse here?
void CodeGenFunction::EmitAggregateStore(llvm::Value *Val, Address Dest,		void CodeGenFunction::EmitAggregateStore(llvm::Value *Val, Address Dest,
Show All 22 Lines	static void CreateCoercedStore(llvm::Value *Src,
CodeGenFunction &CGF) {		CodeGenFunction &CGF) {
llvm::Type *SrcTy = Src->getType();		llvm::Type *SrcTy = Src->getType();
llvm::Type *DstTy = Dst.getElementType();		llvm::Type *DstTy = Dst.getElementType();
if (SrcTy == DstTy) {		if (SrcTy == DstTy) {
CGF.Builder.CreateStore(Src, Dst, DstIsVolatile);		CGF.Builder.CreateStore(Src, Dst, DstIsVolatile);
return;		return;
}		}

uint64_t SrcSize = CGF.CGM.getDataLayout().getTypeAllocSize(SrcTy);		llvm::TypeSize SrcSize = CGF.CGM.getDataLayout().getTypeAllocSize(SrcTy);

if (llvm::StructType *DstSTy = dyn_cast<llvm::StructType>(DstTy)) {		if (llvm::StructType *DstSTy = dyn_cast<llvm::StructType>(DstTy)) {
Dst = EnterStructPointerForCoercedAccess(Dst, DstSTy, SrcSize, CGF);		Dst = EnterStructPointerForCoercedAccess(Dst, DstSTy,
		SrcSize.getKnownMinSize(), CGF);
DstTy = Dst.getElementType();		DstTy = Dst.getElementType();
}		}

llvm::PointerType *SrcPtrTy = llvm::dyn_cast<llvm::PointerType>(SrcTy);		llvm::PointerType *SrcPtrTy = llvm::dyn_cast<llvm::PointerType>(SrcTy);
llvm::PointerType *DstPtrTy = llvm::dyn_cast<llvm::PointerType>(DstTy);		llvm::PointerType *DstPtrTy = llvm::dyn_cast<llvm::PointerType>(DstTy);
if (SrcPtrTy && DstPtrTy &&		if (SrcPtrTy && DstPtrTy &&
SrcPtrTy->getAddressSpace() != DstPtrTy->getAddressSpace()) {		SrcPtrTy->getAddressSpace() != DstPtrTy->getAddressSpace()) {
Src = CGF.Builder.CreatePointerBitCastOrAddrSpaceCast(Src, DstTy);		Src = CGF.Builder.CreatePointerBitCastOrAddrSpaceCast(Src, DstTy);
CGF.Builder.CreateStore(Src, Dst, DstIsVolatile);		CGF.Builder.CreateStore(Src, Dst, DstIsVolatile);
return;		return;
}		}

// If the source and destination are integer or pointer types, just do an		// If the source and destination are integer or pointer types, just do an
// extension or truncation to the desired type.		// extension or truncation to the desired type.
if ((isa<llvm::IntegerType>(SrcTy) \|\| isa<llvm::PointerType>(SrcTy)) &&		if ((isa<llvm::IntegerType>(SrcTy) \|\| isa<llvm::PointerType>(SrcTy)) &&
(isa<llvm::IntegerType>(DstTy) \|\| isa<llvm::PointerType>(DstTy))) {		(isa<llvm::IntegerType>(DstTy) \|\| isa<llvm::PointerType>(DstTy))) {
Src = CoerceIntOrPtrToIntOrPtr(Src, DstTy, CGF);		Src = CoerceIntOrPtrToIntOrPtr(Src, DstTy, CGF);
CGF.Builder.CreateStore(Src, Dst, DstIsVolatile);		CGF.Builder.CreateStore(Src, Dst, DstIsVolatile);
return;		return;
}		}

uint64_t DstSize = CGF.CGM.getDataLayout().getTypeAllocSize(DstTy);		llvm::TypeSize DstSize = CGF.CGM.getDataLayout().getTypeAllocSize(DstTy);

// If store is legal, just bitcast the src pointer.		// If store is legal, just bitcast the src pointer.
if (SrcSize <= DstSize) {		// FIXME: does this check for scalable vectors need to be more conservative?
		if (SrcSize.getKnownMinSize() <= DstSize.getKnownMinSize() \|\|
		(isa<llvm::ScalableVectorType>(SrcTy) \|\|
		isa<llvm::ScalableVectorType>(DstTy))) {
Dst = CGF.Builder.CreateElementBitCast(Dst, SrcTy);		Dst = CGF.Builder.CreateElementBitCast(Dst, SrcTy);
CGF.EmitAggregateStore(Src, Dst, DstIsVolatile);		CGF.EmitAggregateStore(Src, Dst, DstIsVolatile);
} else {		} else {
// Otherwise do coercion through memory. This is stupid, but		// Otherwise do coercion through memory. This is stupid, but
// simple.		// simple.

// Generally SrcSize is never greater than DstSize, since this means we are		// Generally SrcSize is never greater than DstSize, since this means we are
// losing bits. However, this can happen in cases where the structure has		// losing bits. However, this can happen in cases where the structure has
// additional padding, for example due to a user specified alignment.		// additional padding, for example due to a user specified alignment.
//		//
// FIXME: Assert that we aren't truncating non-padding bits when have access		// FIXME: Assert that we aren't truncating non-padding bits when have access
// to that information.		// to that information.
Address Tmp = CreateTempAllocaForCoercion(CGF, SrcTy, Dst.getAlignment());		Address Tmp = CreateTempAllocaForCoercion(CGF, SrcTy, Dst.getAlignment());
CGF.Builder.CreateStore(Src, Tmp);		CGF.Builder.CreateStore(Src, Tmp);
CGF.Builder.CreateMemCpy(Dst.getPointer(), Dst.getAlignment().getAsAlign(),		CGF.Builder.CreateMemCpy(
Tmp.getPointer(), Tmp.getAlignment().getAsAlign(),		Dst.getPointer(), Dst.getAlignment().getAsAlign(), Tmp.getPointer(),
llvm::ConstantInt::get(CGF.IntPtrTy, DstSize));		Tmp.getAlignment().getAsAlign(),
		llvm::ConstantInt::get(CGF.IntPtrTy, DstSize.getKnownMinSize()));
}		}
}		}

static Address emitAddressAtOffset(CodeGenFunction &CGF, Address addr,		static Address emitAddressAtOffset(CodeGenFunction &CGF, Address addr,
const ABIArgInfo &info) {		const ABIArgInfo &info) {
if (unsigned offset = info.getDirectOffset()) {		if (unsigned offset = info.getDirectOffset()) {
addr = CGF.Builder.CreateElementBitCast(addr, CGF.Int8Ty);		addr = CGF.Builder.CreateElementBitCast(addr, CGF.Int8Ty);
addr = CGF.Builder.CreateConstInBoundsByteGEP(addr,		addr = CGF.Builder.CreateConstInBoundsByteGEP(addr,
▲ Show 20 Lines • Show All 3,774 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGExprScalar.cpp

Show First 20 Lines • Show All 2,073 Lines • ▼ Show 20 Lines	if (auto *CI = dyn_cast<llvm::CallBase>(Src)) {
if (CI->getMetadata("heapallocsite") && isa<ExplicitCastExpr>(CE)) {		if (CI->getMetadata("heapallocsite") && isa<ExplicitCastExpr>(CE)) {
QualType PointeeType = DestTy->getPointeeType();		QualType PointeeType = DestTy->getPointeeType();
if (!PointeeType.isNull())		if (!PointeeType.isNull())
CGF.getDebugInfo()->addHeapAllocSiteMetadata(CI, PointeeType,		CGF.getDebugInfo()->addHeapAllocSiteMetadata(CI, PointeeType,
CE->getExprLoc());		CE->getExprLoc());
}		}
}		}

		// Perform VLAT <-> VLST bitcast through memory.
		if ((isa<llvm::FixedVectorType>(SrcTy) &&
		isa<llvm::ScalableVectorType>(DstTy)) \|\|
		(isa<llvm::ScalableVectorType>(SrcTy) &&
		isa<llvm::FixedVectorType>(DstTy))) {
		if (const CallExpr *CE = dyn_cast<CallExpr>(E)) {
		// Call expressions can't have a scalar return unless the return type
		// is a reference type so an lvalue can't be emitted. Create a temp
		// alloca to store the call, bitcast the address then load.
		QualType RetTy = CE->getCallReturnType(CGF.getContext());
		Address Addr =
		CGF.CreateDefaultAlignTempAlloca(SrcTy, "saved-call-rvalue");
		LValue LV = CGF.MakeAddrLValue(Addr, RetTy);
		CGF.EmitStoreOfScalar(Src, LV);
		Addr = Builder.CreateElementBitCast(Addr, CGF.ConvertTypeForMem(DestTy),
		"castFixedSve");
		LValue DestLV = CGF.MakeAddrLValue(Addr, DestTy);
		DestLV.setTBAAInfo(TBAAAccessInfo::getMayAliasInfo());
		return EmitLoadOfLValue(DestLV, CE->getExprLoc());
		}

		Address Addr = EmitLValue(E).getAddress(CGF);
		Addr = Builder.CreateElementBitCast(Addr, CGF.ConvertTypeForMem(DestTy));
		LValue DestLV = CGF.MakeAddrLValue(Addr, DestTy);
		DestLV.setTBAAInfo(TBAAAccessInfo::getMayAliasInfo());
		return EmitLoadOfLValue(DestLV, CE->getExprLoc());
		}

return Builder.CreateBitCast(Src, DstTy);		return Builder.CreateBitCast(Src, DstTy);
}		}
case CK_AddressSpaceConversion: {		case CK_AddressSpaceConversion: {
Expr::EvalResult Result;		Expr::EvalResult Result;
if (E->EvaluateAsRValue(Result, CGF.getContext()) &&		if (E->EvaluateAsRValue(Result, CGF.getContext()) &&
Result.Val.isNullPointer()) {		Result.Val.isNullPointer()) {
// If E has side effect, it is emitted even if its final result is a		// If E has side effect, it is emitted even if its final result is a
// null pointer. In that case, a DCE pass should be able to		// null pointer. In that case, a DCE pass should be able to
▲ Show 20 Lines • Show All 2,933 Lines • Show Last 20 Lines

clang/lib/CodeGen/TargetInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,442 Lines • ▼ Show 20 Lines	AArch64ABIInfo(CodeGenTypes &CGT, ABIKind Kind)
: SwiftABIInfo(CGT), Kind(Kind) {}		: SwiftABIInfo(CGT), Kind(Kind) {}

private:		private:
ABIKind getABIKind() const { return Kind; }		ABIKind getABIKind() const { return Kind; }
bool isDarwinPCS() const { return Kind == DarwinPCS; }		bool isDarwinPCS() const { return Kind == DarwinPCS; }

ABIArgInfo classifyReturnType(QualType RetTy, bool IsVariadic) const;		ABIArgInfo classifyReturnType(QualType RetTy, bool IsVariadic) const;
ABIArgInfo classifyArgumentType(QualType RetTy) const;		ABIArgInfo classifyArgumentType(QualType RetTy) const;
		ABIArgInfo coerceIllegalVector(QualType Ty) const;
bool isHomogeneousAggregateBaseType(QualType Ty) const override;		bool isHomogeneousAggregateBaseType(QualType Ty) const override;
bool isHomogeneousAggregateSmallEnough(const Type *Ty,		bool isHomogeneousAggregateSmallEnough(const Type *Ty,
uint64_t Members) const override;		uint64_t Members) const override;

bool isIllegalVectorType(QualType Ty) const;		bool isIllegalVectorType(QualType Ty) const;

void computeInfo(CGFunctionInfo &FI) const override {		void computeInfo(CGFunctionInfo &FI) const override {
if (!::classifyReturnType(getCXXABI(), FI, *this))		if (!::classifyReturnType(getCXXABI(), FI, *this))
▲ Show 20 Lines • Show All 117 Lines • ▼ Show 20 Lines	void WindowsAArch64TargetCodeGenInfo::setTargetAttributes(
const Decl D, llvm::GlobalValue GV, CodeGen::CodeGenModule &CGM) const {		const Decl D, llvm::GlobalValue GV, CodeGen::CodeGenModule &CGM) const {
AArch64TargetCodeGenInfo::setTargetAttributes(D, GV, CGM);		AArch64TargetCodeGenInfo::setTargetAttributes(D, GV, CGM);
if (GV->isDeclaration())		if (GV->isDeclaration())
return;		return;
addStackProbeTargetAttributes(D, GV, CGM);		addStackProbeTargetAttributes(D, GV, CGM);
}		}
}		}

ABIArgInfo AArch64ABIInfo::classifyArgumentType(QualType Ty) const {		ABIArgInfo AArch64ABIInfo::coerceIllegalVector(QualType Ty) const {
Ty = useFirstFieldIfTransparentUnion(Ty);		assert(Ty->isVectorType() && "expected vector type!");

		const auto *VT = Ty->castAs<VectorType>();
		if (VT->getVectorKind() == VectorType::SveFixedLengthPredicateVector) {
		assert(VT->getElementType()->isBuiltinType() && "expected builtin type!");
		assert(VT->getElementType()->castAs<BuiltinType>()->getKind() ==
		BuiltinType::UChar &&
		"unexpected builtin type for SVE predicate!");
		return ABIArgInfo::getDirect(llvm::ScalableVectorType::get(
		llvm::Type::getInt1Ty(getVMContext()), 16));
		}

		if (VT->getVectorKind() == VectorType::SveFixedLengthDataVector) {
		assert(VT->getElementType()->isBuiltinType() && "expected builtin type!");

		const auto *BT = VT->getElementType()->castAs<BuiltinType>();
		llvm::ScalableVectorType *ResType = nullptr;
		switch (BT->getKind()) {
		default:
		llvm_unreachable("unexpected builtin type for SVE vector!");
		case BuiltinType::SChar:
		case BuiltinType::UChar:
		ResType = llvm::ScalableVectorType::get(
		llvm::Type::getInt8Ty(getVMContext()), 16);
		break;
		case BuiltinType::Short:
		case BuiltinType::UShort:
		ResType = llvm::ScalableVectorType::get(
		llvm::Type::getInt16Ty(getVMContext()), 8);
		break;
		case BuiltinType::Int:
		case BuiltinType::UInt:
		ResType = llvm::ScalableVectorType::get(
		llvm::Type::getInt32Ty(getVMContext()), 4);
		break;
		case BuiltinType::Long:
		case BuiltinType::ULong:
		ResType = llvm::ScalableVectorType::get(
		llvm::Type::getInt64Ty(getVMContext()), 2);
		break;
		case BuiltinType::Float16:
		ResType = llvm::ScalableVectorType::get(
		llvm::Type::getHalfTy(getVMContext()), 8);
		break;
		case BuiltinType::Float:
		ResType = llvm::ScalableVectorType::get(
		llvm::Type::getFloatTy(getVMContext()), 4);
		break;
		case BuiltinType::Double:
		ResType = llvm::ScalableVectorType::get(
		llvm::Type::getDoubleTy(getVMContext()), 2);
		break;
		case BuiltinType::BFloat16:
		ResType = llvm::ScalableVectorType::get(
		llvm::Type::getBFloatTy(getVMContext()), 8);
		break;
		}
		return ABIArgInfo::getDirect(ResType);
		}

// Handle illegal vector types here.
if (isIllegalVectorType(Ty)) {
uint64_t Size = getContext().getTypeSize(Ty);		uint64_t Size = getContext().getTypeSize(Ty);
// Android promotes <2 x i8> to i16, not i32		// Android promotes <2 x i8> to i16, not i32
if (isAndroid() && (Size <= 16)) {		if (isAndroid() && (Size <= 16)) {
llvm::Type *ResType = llvm::Type::getInt16Ty(getVMContext());		llvm::Type *ResType = llvm::Type::getInt16Ty(getVMContext());
return ABIArgInfo::getDirect(ResType);		return ABIArgInfo::getDirect(ResType);
}		}
if (Size <= 32) {		if (Size <= 32) {
llvm::Type *ResType = llvm::Type::getInt32Ty(getVMContext());		llvm::Type *ResType = llvm::Type::getInt32Ty(getVMContext());
return ABIArgInfo::getDirect(ResType);		return ABIArgInfo::getDirect(ResType);
}		}
if (Size == 64) {		if (Size == 64) {
auto *ResType =		auto *ResType =
llvm::FixedVectorType::get(llvm::Type::getInt32Ty(getVMContext()), 2);		llvm::FixedVectorType::get(llvm::Type::getInt32Ty(getVMContext()), 2);
return ABIArgInfo::getDirect(ResType);		return ABIArgInfo::getDirect(ResType);
}		}
if (Size == 128) {		if (Size == 128) {
auto *ResType =		auto *ResType =
llvm::FixedVectorType::get(llvm::Type::getInt32Ty(getVMContext()), 4);		llvm::FixedVectorType::get(llvm::Type::getInt32Ty(getVMContext()), 4);
return ABIArgInfo::getDirect(ResType);		return ABIArgInfo::getDirect(ResType);
}		}
return getNaturalAlignIndirect(Ty, /ByVal=/false);		return getNaturalAlignIndirect(Ty, /ByVal=/false);
}		}

		ABIArgInfo AArch64ABIInfo::classifyArgumentType(QualType Ty) const {
		Ty = useFirstFieldIfTransparentUnion(Ty);

		// Handle illegal vector types here.
		if (isIllegalVectorType(Ty))
		return coerceIllegalVector(Ty);

if (!isAggregateTypeForABI(Ty)) {		if (!isAggregateTypeForABI(Ty)) {
// Treat an enum type as its underlying type.		// Treat an enum type as its underlying type.
if (const EnumType *EnumTy = Ty->getAs<EnumType>())		if (const EnumType *EnumTy = Ty->getAs<EnumType>())
Ty = EnumTy->getDecl()->getIntegerType();		Ty = EnumTy->getDecl()->getIntegerType();

if (const auto *EIT = Ty->getAs<ExtIntType>())		if (const auto *EIT = Ty->getAs<ExtIntType>())
if (EIT->getNumBits() > 128)		if (EIT->getNumBits() > 128)
return getNaturalAlignIndirect(Ty);		return getNaturalAlignIndirect(Ty);
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	ABIArgInfo AArch64ABIInfo::classifyArgumentType(QualType Ty) const {
return getNaturalAlignIndirect(Ty, /ByVal=/false);		return getNaturalAlignIndirect(Ty, /ByVal=/false);
}		}

ABIArgInfo AArch64ABIInfo::classifyReturnType(QualType RetTy,		ABIArgInfo AArch64ABIInfo::classifyReturnType(QualType RetTy,
bool IsVariadic) const {		bool IsVariadic) const {
if (RetTy->isVoidType())		if (RetTy->isVoidType())
return ABIArgInfo::getIgnore();		return ABIArgInfo::getIgnore();

		if (const auto *VT = RetTy->getAs<VectorType>()) {
		if (VT->getVectorKind() == VectorType::SveFixedLengthDataVector \|\|
		VT->getVectorKind() == VectorType::SveFixedLengthPredicateVector)
		return coerceIllegalVector(RetTy);
		}

// Large vector types should be returned via memory.		// Large vector types should be returned via memory.
if (RetTy->isVectorType() && getContext().getTypeSize(RetTy) > 128)		if (RetTy->isVectorType() && getContext().getTypeSize(RetTy) > 128)
return getNaturalAlignIndirect(RetTy);		return getNaturalAlignIndirect(RetTy);

if (!isAggregateTypeForABI(RetTy)) {		if (!isAggregateTypeForABI(RetTy)) {
// Treat an enum type as its underlying type.		// Treat an enum type as its underlying type.
if (const EnumType *EnumTy = RetTy->getAs<EnumType>())		if (const EnumType *EnumTy = RetTy->getAs<EnumType>())
RetTy = EnumTy->getDecl()->getIntegerType();		RetTy = EnumTy->getDecl()->getIntegerType();
Show All 39 Lines	ABIArgInfo AArch64ABIInfo::classifyReturnType(QualType RetTy,
}		}

return getNaturalAlignIndirect(RetTy);		return getNaturalAlignIndirect(RetTy);
}		}

/// isIllegalVectorType - check whether the vector type is legal for AArch64.		/// isIllegalVectorType - check whether the vector type is legal for AArch64.
bool AArch64ABIInfo::isIllegalVectorType(QualType Ty) const {		bool AArch64ABIInfo::isIllegalVectorType(QualType Ty) const {
if (const VectorType *VT = Ty->getAs<VectorType>()) {		if (const VectorType *VT = Ty->getAs<VectorType>()) {
		// Check whether VT is a fixed-length SVE vector. These types are
		// represented as scalable vectors in function args/return and must be
		// coerced from fixed vectors.
		if (VT->getVectorKind() == VectorType::SveFixedLengthDataVector \|\|
		VT->getVectorKind() == VectorType::SveFixedLengthPredicateVector)
		return true;

// Check whether VT is legal.		// Check whether VT is legal.
unsigned NumElements = VT->getNumElements();		unsigned NumElements = VT->getNumElements();
uint64_t Size = getContext().getTypeSize(VT);		uint64_t Size = getContext().getTypeSize(VT);
// NumElements should be power of 2.		// NumElements should be power of 2.
if (!llvm::isPowerOf2_32(NumElements))		if (!llvm::isPowerOf2_32(NumElements))
return true;		return true;

// arm64_32 has to be compatible with the ARM logic here, which allows huge		// arm64_32 has to be compatible with the ARM logic here, which allows huge
▲ Show 20 Lines • Show All 5,341 Lines • Show Last 20 Lines

clang/lib/Sema/SemaDecl.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,998 Lines • ▼ Show 20 Lines	void Sema::CheckVariableDeclarationType(VarDecl *NewVD) {
}		}

if (!NewVD->hasLocalStorage() && NewVD->hasAttr<BlocksAttr>()) {		if (!NewVD->hasLocalStorage() && NewVD->hasAttr<BlocksAttr>()) {
Diag(NewVD->getLocation(), diag::err_block_on_nonlocal);		Diag(NewVD->getLocation(), diag::err_block_on_nonlocal);
NewVD->setInvalidDecl();		NewVD->setInvalidDecl();
return;		return;
}		}

if (!NewVD->hasLocalStorage() && T->isSizelessType() && !T->isVLST()) {		if (!NewVD->hasLocalStorage() && T->isSizelessType()) {
Diag(NewVD->getLocation(), diag::err_sizeless_nonlocal) << T;		Diag(NewVD->getLocation(), diag::err_sizeless_nonlocal) << T;
NewVD->setInvalidDecl();		NewVD->setInvalidDecl();
return;		return;
}		}

if (isVM && NewVD->hasAttr<BlocksAttr>()) {		if (isVM && NewVD->hasAttr<BlocksAttr>()) {
Diag(NewVD->getLocation(), diag::err_block_on_vm);		Diag(NewVD->getLocation(), diag::err_block_on_vm);
NewVD->setInvalidDecl();		NewVD->setInvalidDecl();
▲ Show 20 Lines • Show All 10,266 Lines • Show Last 20 Lines

clang/lib/Sema/SemaExpr.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,890 Lines • ▼ Show 20 Lines
/// This helper function returns true if QT is a vector type that has element		/// This helper function returns true if QT is a vector type that has element
/// type ElementType.		/// type ElementType.
static bool isVector(QualType QT, QualType ElementType) {		static bool isVector(QualType QT, QualType ElementType) {
if (const VectorType *VT = QT->getAs<VectorType>())		if (const VectorType *VT = QT->getAs<VectorType>())
return VT->getElementType().getCanonicalType() == ElementType;		return VT->getElementType().getCanonicalType() == ElementType;
return false;		return false;
}		}

		/// This helper function returns true if LHSType is an SVE builtin type and
		/// RHSType is a valid fixed-length representation of LHSType, and vice versa.
		static bool areCompatibleSveTypes(QualType LHSType, QualType RHSType,
		ASTContext &Context) {
		auto IsValidCast = [](QualType LHSType, QualType RHSType,
		ASTContext &Context) {
		if (const auto *BT = LHSType->getAs<BuiltinType>()) {
		if (const auto *VT = RHSType->getAs<VectorType>()) {
		// Predicates have the same representation as uint8 so we also have to
		// check the kind to make these types incompatible.
		if (BT->getKind() == BuiltinType::SveBool &&
		VT->getVectorKind() == VectorType::SveFixedLengthPredicateVector &&
		isVector(RHSType, LHSType->getFixedLengthSveEltType(Context)))
		return true;

		if (VT->getVectorKind() == VectorType::SveFixedLengthDataVector &&
		isVector(RHSType, LHSType->getFixedLengthSveEltType(Context)))
		return true;
		}
		}
		return false;
		};

		return IsValidCast(LHSType, RHSType, Context) \|\|
		IsValidCast(RHSType, LHSType, Context);
		}

/// CheckAssignmentConstraints (C99 6.5.16) - This routine currently		/// CheckAssignmentConstraints (C99 6.5.16) - This routine currently
/// has code to accommodate several GCC extensions when type checking		/// has code to accommodate several GCC extensions when type checking
/// pointers. Here are some objectionable examples that GCC considers warnings:		/// pointers. Here are some objectionable examples that GCC considers warnings:
///		///
/// int a, *pint;		/// int a, *pint;
/// short *pshort;		/// short *pshort;
/// struct foo *pfoo;		/// struct foo *pfoo;
///		///
▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines	if (LHSType->isScalarType()) {
isLaxVectorConversion(RHSType, LHSType)) {		isLaxVectorConversion(RHSType, LHSType)) {
ExprResult *VecExpr = &RHS;		ExprResult *VecExpr = &RHS;
*VecExpr = ImpCastExprToType(VecExpr->get(), LHSType, CK_BitCast);		*VecExpr = ImpCastExprToType(VecExpr->get(), LHSType, CK_BitCast);
Kind = CK_BitCast;		Kind = CK_BitCast;
return Compatible;		return Compatible;
}		}
}		}

		if ((LHSType->isSizelessBuiltinType() && RHSType->isVectorType()) \|\|
		(LHSType->isVectorType() && RHSType->isSizelessBuiltinType())) {
		// Allow assignments between fixed-length and sizeless SVE vectors.
		if (areCompatibleSveTypes(LHSType, RHSType, Context)) {
		Kind = CK_BitCast;
		return Compatible;
		}
		}

return Incompatible;		return Incompatible;
}		}

// Diagnose attempts to convert between __float128 and long double where		// Diagnose attempts to convert between __float128 and long double where
// such conversions currently can't be handled.		// such conversions currently can't be handled.
if (unsupportedTypeConversion(*this, LHSType, RHSType))		if (unsupportedTypeConversion(*this, LHSType, RHSType))
return Incompatible;		return Incompatible;

▲ Show 20 Lines • Show All 870 Lines • ▼ Show 20 Lines	// <1 x T> -> T. The result is also a vector type.
ExprResult *RHSExpr = &RHS;		ExprResult *RHSExpr = &RHS;
*RHSExpr = ImpCastExprToType(RHSExpr->get(), LHSType, CK_BitCast);		*RHSExpr = ImpCastExprToType(RHSExpr->get(), LHSType, CK_BitCast);
return VecType;		return VecType;
}		}
}		}

// Okay, the expression is invalid.		// Okay, the expression is invalid.

		// If there's a sizeless operand, diagnose that.
		if ((LHSVecType &&
		((LHSVecType->getVectorKind() == VectorType::SveFixedLengthDataVector) \|\|
		(LHSVecType->getVectorKind() ==
		VectorType::SveFixedLengthPredicateVector)) &&
		RHSType->isSizelessBuiltinType()) \|\|
		(RHSVecType &&
		((RHSVecType->getVectorKind() == VectorType::SveFixedLengthDataVector) \|\|
		(RHSVecType->getVectorKind() ==
		VectorType::SveFixedLengthPredicateVector)) &&
		LHSType->isSizelessBuiltinType())) {
		Diag(Loc, diag::err_typecheck_vector_not_convertable_sizeless)
		<< LHSType << RHSType;
		return QualType();
		}

// If there's a non-vector, non-real operand, diagnose that.		// If there's a non-vector, non-real operand, diagnose that.
if ((!RHSVecType && !RHSType->isRealType()) \|\|		if ((!RHSVecType && !RHSType->isRealType()) \|\|
(!LHSVecType && !LHSType->isRealType())) {		(!LHSVecType && !LHSType->isRealType())) {
Diag(Loc, diag::err_typecheck_vector_not_convertable_non_scalar)		Diag(Loc, diag::err_typecheck_vector_not_convertable_non_scalar)
<< LHSType << RHSType		<< LHSType << RHSType
<< LHS.get()->getSourceRange() << RHS.get()->getSourceRange();		<< LHS.get()->getSourceRange() << RHS.get()->getSourceRange();
return QualType();		return QualType();
}		}
▲ Show 20 Lines • Show All 9,299 Lines • Show Last 20 Lines

clang/lib/Sema/SemaType.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,298 Lines • ▼ Show 20 Lines	QualType Sema::BuildArrayType(QualType T, ArrayType::ArraySizeModifier ASM,
} else {		} else {
// C99 6.7.5.2p1: If the element type is an incomplete or function type,		// C99 6.7.5.2p1: If the element type is an incomplete or function type,
// reject it (e.g. void ary[7], struct foo ary[7], void ary[7]())		// reject it (e.g. void ary[7], struct foo ary[7], void ary[7]())
if (RequireCompleteSizedType(Loc, T,		if (RequireCompleteSizedType(Loc, T,
diag::err_array_incomplete_or_sizeless_type))		diag::err_array_incomplete_or_sizeless_type))
return QualType();		return QualType();
}		}

if (T->isSizelessType() && !T->isVLST()) {		if (T->isSizelessType()) {
Diag(Loc, diag::err_array_incomplete_or_sizeless_type) << 1 << T;		Diag(Loc, diag::err_array_incomplete_or_sizeless_type) << 1 << T;
return QualType();		return QualType();
}		}

if (T->isFunctionType()) {		if (T->isFunctionType()) {
Diag(Loc, diag::err_illegal_decl_array_of_functions)		Diag(Loc, diag::err_illegal_decl_array_of_functions)
<< getPrintableNameForEntity(Entity) << T;		<< getPrintableNameForEntity(Entity) << T;
return QualType();		return QualType();
▲ Show 20 Lines • Show All 5,433 Lines • ▼ Show 20 Lines	static void HandleNeonVectorTypeAttr(QualType &CurType, const ParsedAttr &Attr,
}		}

CurType = S.Context.getVectorType(CurType, numElts, VecKind);		CurType = S.Context.getVectorType(CurType, numElts, VecKind);
}		}

/// HandleArmSveVectorBitsTypeAttr - The "arm_sve_vector_bits" attribute is		/// HandleArmSveVectorBitsTypeAttr - The "arm_sve_vector_bits" attribute is
/// used to create fixed-length versions of sizeless SVE types defined by		/// used to create fixed-length versions of sizeless SVE types defined by
/// the ACLE, such as svint32_t and svbool_t.		/// the ACLE, such as svint32_t and svbool_t.
static void HandleArmSveVectorBitsTypeAttr(TypeProcessingState &State,		static void HandleArmSveVectorBitsTypeAttr(QualType &CurType, ParsedAttr &Attr,
QualType &CurType,		Sema &S) {
ParsedAttr &Attr) {
Sema &S = State.getSema();
ASTContext &Ctx = S.Context;

// Target must have SVE.		// Target must have SVE.
if (!Ctx.getTargetInfo().hasFeature("sve")) {		if (!S.Context.getTargetInfo().hasFeature("sve")) {
S.Diag(Attr.getLoc(), diag::err_attribute_unsupported) << Attr;		S.Diag(Attr.getLoc(), diag::err_attribute_unsupported) << Attr;
Attr.setInvalid();		Attr.setInvalid();
return;		return;
}		}

// Attribute is unsupported if '-msve-vector-bits=<bits>' isn't specified.		// Attribute is unsupported if '-msve-vector-bits=<bits>' isn't specified.
if (!S.getLangOpts().ArmSveVectorBits) {		if (!S.getLangOpts().ArmSveVectorBits) {
S.Diag(Attr.getLoc(), diag::err_attribute_arm_feature_sve_bits_unsupported)		S.Diag(Attr.getLoc(), diag::err_attribute_arm_feature_sve_bits_unsupported)
Show All 28 Lines	static void HandleArmSveVectorBitsTypeAttr(QualType &CurType, ParsedAttr &Attr,
// Attribute can only be attached to a single SVE vector or predicate type.		// Attribute can only be attached to a single SVE vector or predicate type.
if (!CurType->isVLSTBuiltinType()) {		if (!CurType->isVLSTBuiltinType()) {
S.Diag(Attr.getLoc(), diag::err_attribute_invalid_sve_type)		S.Diag(Attr.getLoc(), diag::err_attribute_invalid_sve_type)
<< Attr << CurType;		<< Attr << CurType;
Attr.setInvalid();		Attr.setInvalid();
return;		return;
}		}

auto *A = ::new (Ctx) ArmSveVectorBitsAttr(Ctx, Attr, VecSize);		const auto *BT = CurType->castAs<BuiltinType>();
CurType = State.getAttributedType(A, CurType, CurType);
		QualType EltType = CurType->getFixedLengthSveEltType(S.Context);
		unsigned TypeSize = S.Context.getTypeSize(EltType);
		VectorType::VectorKind VecKind = VectorType::SveFixedLengthDataVector;
		if (BT->getKind() == BuiltinType::SveBool) {
		// Predicates are represented as i8
		VecSize /= S.Context.getCharWidth() * S.Context.getCharWidth();
		VecKind = VectorType::SveFixedLengthPredicateVector;
		} else
		VecSize /= TypeSize;
		CurType = S.Context.getVectorType(EltType, VecSize, VecKind);
}		}

static void HandleArmMveStrictPolymorphismAttr(TypeProcessingState &State,		static void HandleArmMveStrictPolymorphismAttr(TypeProcessingState &State,
QualType &CurType,		QualType &CurType,
ParsedAttr &Attr) {		ParsedAttr &Attr) {
const VectorType *VT = dyn_cast<VectorType>(CurType);		const VectorType *VT = dyn_cast<VectorType>(CurType);
if (!VT \|\| VT->getVectorKind() != VectorType::NeonVector) {		if (!VT \|\| VT->getVectorKind() != VectorType::NeonVector) {
State.getSema().Diag(Attr.getLoc(),		State.getSema().Diag(Attr.getLoc(),
▲ Show 20 Lines • Show All 250 Lines • ▼ Show 20 Lines	case ParsedAttr::AT_NeonVectorType:
attr.setUsedAsTypeAttr();		attr.setUsedAsTypeAttr();
break;		break;
case ParsedAttr::AT_NeonPolyVectorType:		case ParsedAttr::AT_NeonPolyVectorType:
HandleNeonVectorTypeAttr(type, attr, state.getSema(),		HandleNeonVectorTypeAttr(type, attr, state.getSema(),
VectorType::NeonPolyVector);		VectorType::NeonPolyVector);
attr.setUsedAsTypeAttr();		attr.setUsedAsTypeAttr();
break;		break;
case ParsedAttr::AT_ArmSveVectorBits:		case ParsedAttr::AT_ArmSveVectorBits:
HandleArmSveVectorBitsTypeAttr(state, type, attr);		HandleArmSveVectorBitsTypeAttr(type, attr, state.getSema());
attr.setUsedAsTypeAttr();		attr.setUsedAsTypeAttr();
break;		break;
case ParsedAttr::AT_ArmMveStrictPolymorphism: {		case ParsedAttr::AT_ArmMveStrictPolymorphism: {
HandleArmMveStrictPolymorphismAttr(state, type, attr);		HandleArmMveStrictPolymorphismAttr(state, type, attr);
attr.setUsedAsTypeAttr();		attr.setUsedAsTypeAttr();
break;		break;
}		}
case ParsedAttr::AT_OpenCLAccess:		case ParsedAttr::AT_OpenCLAccess:
▲ Show 20 Lines • Show All 888 Lines • Show Last 20 Lines

clang/test/CodeGen/attr-arm-sve-vector-bits-bitcast.c

This file was added.

				// NOTE: Assertions have been autogenerated by utils/update_cc_test_checks.py
				// REQUIRES: aarch64-registered-target
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=128 -fallow-half-arguments-and-returns -S -O1 -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-128
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=256 -fallow-half-arguments-and-returns -S -O1 -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-256
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=512 -fallow-half-arguments-and-returns -S -O1 -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-512

				#include <arm_sve.h>

				#define N __ARM_FEATURE_SVE_BITS_EXPERIMENTAL

				typedef svint64_t fixed_int64_t __attribute__((arm_sve_vector_bits(N)));
				typedef svfloat64_t fixed_float64_t __attribute__((arm_sve_vector_bits(N)));
				typedef svbfloat16_t fixed_bfloat16_t __attribute__((arm_sve_vector_bits(N)));
				typedef svbool_t fixed_bool_t __attribute__((arm_sve_vector_bits(N)));

				#define DEFINE_STRUCT(ty) \
				struct struct_##ty { \
				fixed_##ty##_t x, y[3]; \
				} struct_##ty;

				DEFINE_STRUCT(int64)
				DEFINE_STRUCT(float64)
				DEFINE_STRUCT(bfloat16)
				DEFINE_STRUCT(bool)

				//===----------------------------------------------------------------------===//
				// int64
				//===----------------------------------------------------------------------===//

				// CHECK-128-LABEL: @read_int64(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_INT64:%.]], %struct.struct_int64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <2 x i64> [[ARRAYIDX]] to <vscale x 2 x i64>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <vscale x 2 x i64>, <vscale x 2 x i64> [[TMP0]], align 16, !tbaa !2
				// CHECK-128-NEXT: ret <vscale x 2 x i64> [[TMP1]]
				//
				// CHECK-256-LABEL: @read_int64(
				// CHECK-256-NEXT: entry:
				// CHECK-256-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_INT64:%.]], %struct.struct_int64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <4 x i64> [[ARRAYIDX]] to <vscale x 2 x i64>*
				// CHECK-256-NEXT: [[TMP1:%.]] = load <vscale x 2 x i64>, <vscale x 2 x i64> [[TMP0]], align 16, !tbaa !2
				// CHECK-256-NEXT: ret <vscale x 2 x i64> [[TMP1]]
				//
				// CHECK-512-LABEL: @read_int64(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_INT64:%.]], %struct.struct_int64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <8 x i64> [[ARRAYIDX]] to <vscale x 2 x i64>*
				// CHECK-512-NEXT: [[TMP1:%.]] = load <vscale x 2 x i64>, <vscale x 2 x i64> [[TMP0]], align 16, !tbaa !2
				// CHECK-512-NEXT: ret <vscale x 2 x i64> [[TMP1]]
				//
				svint64_t read_int64(struct struct_int64 *s) {
				return s->y[0];
				}

				// CHECK-128-LABEL: @write_int64(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 2 x i64>, align 16
				// CHECK-128-NEXT: store <vscale x 2 x i64> [[X:%.]], <vscale x 2 x i64> [[X_ADDR]], align 16, !tbaa !5
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <vscale x 2 x i64> [[X_ADDR]] to <2 x i64>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <2 x i64>, <2 x i64> [[TMP0]], align 16, !tbaa !2
				// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_INT64:%.]], %struct.struct_int64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-128-NEXT: store <2 x i64> [[TMP1]], <2 x i64>* [[ARRAYIDX]], align 16, !tbaa !2
				// CHECK-128-NEXT: ret void
				//
				// CHECK-256-LABEL: @write_int64(
				// CHECK-256-NEXT: entry:
				// CHECK-256-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 2 x i64>, align 16
				// CHECK-256-NEXT: store <vscale x 2 x i64> [[X:%.]], <vscale x 2 x i64> [[X_ADDR]], align 16, !tbaa !5
				// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <vscale x 2 x i64> [[X_ADDR]] to <4 x i64>*
				// CHECK-256-NEXT: [[TMP1:%.]] = load <4 x i64>, <4 x i64> [[TMP0]], align 16, !tbaa !2
				// CHECK-256-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_INT64:%.]], %struct.struct_int64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-256-NEXT: store <4 x i64> [[TMP1]], <4 x i64>* [[ARRAYIDX]], align 16, !tbaa !2
				// CHECK-256-NEXT: ret void
				//
				// CHECK-512-LABEL: @write_int64(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 2 x i64>, align 16
				// CHECK-512-NEXT: store <vscale x 2 x i64> [[X:%.]], <vscale x 2 x i64> [[X_ADDR]], align 16, !tbaa !5
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 2 x i64> [[X_ADDR]] to <8 x i64>*
				// CHECK-512-NEXT: [[TMP1:%.]] = load <8 x i64>, <8 x i64> [[TMP0]], align 16, !tbaa !2
				// CHECK-512-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_INT64:%.]], %struct.struct_int64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-512-NEXT: store <8 x i64> [[TMP1]], <8 x i64>* [[ARRAYIDX]], align 16, !tbaa !2
				// CHECK-512-NEXT: ret void
				//
				void write_int64(struct struct_int64 *s, svint64_t x) {
				s->y[0] = x;
				}

				//===----------------------------------------------------------------------===//
				// float64
				//===----------------------------------------------------------------------===//

				// CHECK-128-LABEL: @read_float64(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_FLOAT64:%.]], %struct.struct_float64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <2 x double> [[ARRAYIDX]] to <vscale x 2 x double>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[TMP0]], align 16, !tbaa !2
				// CHECK-128-NEXT: ret <vscale x 2 x double> [[TMP1]]
				//
				// CHECK-256-LABEL: @read_float64(
				// CHECK-256-NEXT: entry:
				// CHECK-256-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_FLOAT64:%.]], %struct.struct_float64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <4 x double> [[ARRAYIDX]] to <vscale x 2 x double>*
				// CHECK-256-NEXT: [[TMP1:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[TMP0]], align 16, !tbaa !2
				// CHECK-256-NEXT: ret <vscale x 2 x double> [[TMP1]]
				//
				// CHECK-512-LABEL: @read_float64(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_FLOAT64:%.]], %struct.struct_float64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <8 x double> [[ARRAYIDX]] to <vscale x 2 x double>*
				// CHECK-512-NEXT: [[TMP1:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[TMP0]], align 16, !tbaa !2
				// CHECK-512-NEXT: ret <vscale x 2 x double> [[TMP1]]
				//
				svfloat64_t read_float64(struct struct_float64 *s) {
				return s->y[0];
				}

				// CHECK-128-LABEL: @write_float64(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-128-NEXT: store <vscale x 2 x double> [[X:%.]], <vscale x 2 x double> [[X_ADDR]], align 16, !tbaa !7
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <vscale x 2 x double> [[X_ADDR]] to <2 x double>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <2 x double>, <2 x double> [[TMP0]], align 16, !tbaa !2
				// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_FLOAT64:%.]], %struct.struct_float64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-128-NEXT: store <2 x double> [[TMP1]], <2 x double>* [[ARRAYIDX]], align 16, !tbaa !2
				// CHECK-128-NEXT: ret void
				//
				// CHECK-256-LABEL: @write_float64(
				// CHECK-256-NEXT: entry:
				// CHECK-256-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-256-NEXT: store <vscale x 2 x double> [[X:%.]], <vscale x 2 x double> [[X_ADDR]], align 16, !tbaa !7
				// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <vscale x 2 x double> [[X_ADDR]] to <4 x double>*
				// CHECK-256-NEXT: [[TMP1:%.]] = load <4 x double>, <4 x double> [[TMP0]], align 16, !tbaa !2
				// CHECK-256-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_FLOAT64:%.]], %struct.struct_float64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-256-NEXT: store <4 x double> [[TMP1]], <4 x double>* [[ARRAYIDX]], align 16, !tbaa !2
				// CHECK-256-NEXT: ret void
				//
				// CHECK-512-LABEL: @write_float64(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-512-NEXT: store <vscale x 2 x double> [[X:%.]], <vscale x 2 x double> [[X_ADDR]], align 16, !tbaa !7
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 2 x double> [[X_ADDR]] to <8 x double>*
				// CHECK-512-NEXT: [[TMP1:%.]] = load <8 x double>, <8 x double> [[TMP0]], align 16, !tbaa !2
				// CHECK-512-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_FLOAT64:%.]], %struct.struct_float64* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-512-NEXT: store <8 x double> [[TMP1]], <8 x double>* [[ARRAYIDX]], align 16, !tbaa !2
				// CHECK-512-NEXT: ret void
				//
				void write_float64(struct struct_float64 *s, svfloat64_t x) {
				s->y[0] = x;
				}

				//===----------------------------------------------------------------------===//
				// bfloat16
				//===----------------------------------------------------------------------===//

				// CHECK-128-LABEL: @read_bfloat16(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BFLOAT16:%.]], %struct.struct_bfloat16* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <8 x bfloat> [[ARRAYIDX]] to <vscale x 8 x bfloat>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <vscale x 8 x bfloat>, <vscale x 8 x bfloat> [[TMP0]], align 16, !tbaa !2
				// CHECK-128-NEXT: ret <vscale x 8 x bfloat> [[TMP1]]
				//
				// CHECK-256-LABEL: @read_bfloat16(
				// CHECK-256-NEXT: entry:
				// CHECK-256-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BFLOAT16:%.]], %struct.struct_bfloat16* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <16 x bfloat> [[ARRAYIDX]] to <vscale x 8 x bfloat>*
				// CHECK-256-NEXT: [[TMP1:%.]] = load <vscale x 8 x bfloat>, <vscale x 8 x bfloat> [[TMP0]], align 16, !tbaa !2
				// CHECK-256-NEXT: ret <vscale x 8 x bfloat> [[TMP1]]
				//
				// CHECK-512-LABEL: @read_bfloat16(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BFLOAT16:%.]], %struct.struct_bfloat16* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <32 x bfloat> [[ARRAYIDX]] to <vscale x 8 x bfloat>*
				// CHECK-512-NEXT: [[TMP1:%.]] = load <vscale x 8 x bfloat>, <vscale x 8 x bfloat> [[TMP0]], align 16, !tbaa !2
				// CHECK-512-NEXT: ret <vscale x 8 x bfloat> [[TMP1]]
				//
				svbfloat16_t read_bfloat16(struct struct_bfloat16 *s) {
				return s->y[0];
				}

				// CHECK-128-LABEL: @write_bfloat16(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 8 x bfloat>, align 16
				// CHECK-128-NEXT: store <vscale x 8 x bfloat> [[X:%.]], <vscale x 8 x bfloat> [[X_ADDR]], align 16, !tbaa !9
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <vscale x 8 x bfloat> [[X_ADDR]] to <8 x bfloat>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <8 x bfloat>, <8 x bfloat> [[TMP0]], align 16, !tbaa !2
				// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BFLOAT16:%.]], %struct.struct_bfloat16* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-128-NEXT: store <8 x bfloat> [[TMP1]], <8 x bfloat>* [[ARRAYIDX]], align 16, !tbaa !2
				// CHECK-128-NEXT: ret void
				//
				// CHECK-256-LABEL: @write_bfloat16(
				// CHECK-256-NEXT: entry:
				// CHECK-256-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 8 x bfloat>, align 16
				// CHECK-256-NEXT: store <vscale x 8 x bfloat> [[X:%.]], <vscale x 8 x bfloat> [[X_ADDR]], align 16, !tbaa !9
				// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <vscale x 8 x bfloat> [[X_ADDR]] to <16 x bfloat>*
				// CHECK-256-NEXT: [[TMP1:%.]] = load <16 x bfloat>, <16 x bfloat> [[TMP0]], align 16, !tbaa !2
				// CHECK-256-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BFLOAT16:%.]], %struct.struct_bfloat16* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-256-NEXT: store <16 x bfloat> [[TMP1]], <16 x bfloat>* [[ARRAYIDX]], align 16, !tbaa !2
				// CHECK-256-NEXT: ret void
				//
				// CHECK-512-LABEL: @write_bfloat16(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 8 x bfloat>, align 16
				// CHECK-512-NEXT: store <vscale x 8 x bfloat> [[X:%.]], <vscale x 8 x bfloat> [[X_ADDR]], align 16, !tbaa !9
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 8 x bfloat> [[X_ADDR]] to <32 x bfloat>*
				// CHECK-512-NEXT: [[TMP1:%.]] = load <32 x bfloat>, <32 x bfloat> [[TMP0]], align 16, !tbaa !2
				// CHECK-512-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BFLOAT16:%.]], %struct.struct_bfloat16* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-512-NEXT: store <32 x bfloat> [[TMP1]], <32 x bfloat>* [[ARRAYIDX]], align 16, !tbaa !2
				// CHECK-512-NEXT: ret void
				//
				void write_bfloat16(struct struct_bfloat16 *s, svbfloat16_t x) {
				s->y[0] = x;
				}

				//===----------------------------------------------------------------------===//
				// bool
				//===----------------------------------------------------------------------===//

				// CHECK-128-LABEL: @read_bool(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <2 x i8> [[ARRAYIDX]] to <vscale x 16 x i1>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP0]], align 2, !tbaa !2
				// CHECK-128-NEXT: ret <vscale x 16 x i1> [[TMP1]]
				//
				// CHECK-256-LABEL: @read_bool(
				// CHECK-256-NEXT: entry:
				// CHECK-256-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <4 x i8> [[ARRAYIDX]] to <vscale x 16 x i1>*
				// CHECK-256-NEXT: [[TMP1:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP0]], align 2, !tbaa !2
				// CHECK-256-NEXT: ret <vscale x 16 x i1> [[TMP1]]
				//
				// CHECK-512-LABEL: @read_bool(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[ARRAYIDX]] to <vscale x 16 x i1>*
				// CHECK-512-NEXT: [[TMP1:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP0]], align 2, !tbaa !2
				// CHECK-512-NEXT: ret <vscale x 16 x i1> [[TMP1]]
				//
				svbool_t read_bool(struct struct_bool *s) {
				return s->y[0];
				}

				// CHECK-128-LABEL: @write_bool(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-128-NEXT: store <vscale x 16 x i1> [[X:%.]], <vscale x 16 x i1> [[X_ADDR]], align 16, !tbaa !11
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[X_ADDR]] to <2 x i8>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <2 x i8>, <2 x i8> [[TMP0]], align 16, !tbaa !2
				// CHECK-128-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1, i64 0
				// CHECK-128-NEXT: store <2 x i8> [[TMP1]], <2 x i8>* [[ARRAYIDX]], align 2, !tbaa !2
				// CHECK-128-NEXT: ret void
				//
				// CHECK-256-LABEL: @write_bool(
				// CHECK-256-NEXT: entry:
				// CHECK-256-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-256-NEXT: store <vscale x 16 x i1> [[X:%.]], <vscale x 16 x i1> [[X_ADDR]], align 16, !tbaa !11
				// CHECK-256-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[X_ADDR]] to i32*
				// CHECK-256-NEXT: [[TMP1:%.]] = load i32, i32 [[TMP0]], align 16, !tbaa !2
				// CHECK-256-NEXT: [[Y:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1
				// CHECK-256-NEXT: [[TMP2:%.]] = bitcast [3 x <4 x i8>] [[Y]] to i32*
				// CHECK-256-NEXT: store i32 [[TMP1]], i32* [[TMP2]], align 2, !tbaa !2
				// CHECK-256-NEXT: ret void
				//
				// CHECK-512-LABEL: @write_bool(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-512-NEXT: store <vscale x 16 x i1> [[X:%.]], <vscale x 16 x i1> [[X_ADDR]], align 16, !tbaa !11
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[X_ADDR]] to i64*
				// CHECK-512-NEXT: [[TMP1:%.]] = load i64, i64 [[TMP0]], align 16, !tbaa !2
				// CHECK-512-NEXT: [[Y:%.]] = getelementptr inbounds [[STRUCT_STRUCT_BOOL:%.]], %struct.struct_bool* [[S:%.*]], i64 0, i32 1
				// CHECK-512-NEXT: [[TMP2:%.]] = bitcast [3 x <8 x i8>] [[Y]] to i64*
				// CHECK-512-NEXT: store i64 [[TMP1]], i64* [[TMP2]], align 2, !tbaa !2
				// CHECK-512-NEXT: ret void
				//
				void write_bool(struct struct_bool *s, svbool_t x) {
				s->y[0] = x;
				}

clang/test/CodeGen/attr-arm-sve-vector-bits-call.c

This file was added.

				// NOTE: Assertions have been autogenerated by utils/update_cc_test_checks.py
				// REQUIRES: aarch64-registered-target
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -msve-vector-bits=512 -fallow-half-arguments-and-returns -S -O1 -emit-llvm -o - %s \| FileCheck %s

				#include <arm_sve.h>

				#define N __ARM_FEATURE_SVE_BITS_EXPERIMENTAL

				typedef svint32_t fixed_int32_t __attribute__((arm_sve_vector_bits(N)));
				typedef svfloat64_t fixed_float64_t __attribute__((arm_sve_vector_bits(N)));
				typedef svbool_t fixed_bool_t __attribute__((arm_sve_vector_bits(N)));

				//===----------------------------------------------------------------------===//
				// Test caller/callee with VLST <-> VLAT
				//===----------------------------------------------------------------------===//

				// CHECK-LABEL: @sizeless_callee(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: ret <vscale x 4 x i32> [[X:%.*]]
				//
				svint32_t sizeless_callee(svint32_t x) {
				return x;
				}

				// CHECK-LABEL: @fixed_caller(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[X:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[X_ADDR:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <16 x i32> [[X]] to <vscale x 4 x i32>*
				// CHECK-NEXT: store <vscale x 4 x i32> [[X_COERCE:%.]], <vscale x 4 x i32> [[TMP0]], align 16
				// CHECK-NEXT: [[X1:%.]] = load <16 x i32>, <16 x i32> [[X]], align 16, !tbaa !2
				// CHECK-NEXT: store <16 x i32> [[X1]], <16 x i32>* [[X_ADDR]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <16 x i32> [[X_ADDR]] to <vscale x 4 x i32>*
				// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[TMP1]], align 16, !tbaa !2
				// CHECK-NEXT: store <vscale x 4 x i32> [[TMP2]], <vscale x 4 x i32>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !5
				// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 4 x i32> [[SAVED_CALL_RVALUE]] to <16 x i32>*
				// CHECK-NEXT: [[TMP3:%.]] = load <16 x i32>, <16 x i32> [[CASTFIXEDSVE]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to <16 x i32>*
				// CHECK-NEXT: store <16 x i32> [[TMP3]], <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP4]]
				//
				fixed_int32_t fixed_caller(fixed_int32_t x) {
				return sizeless_callee(x);
				}

				// CHECK-LABEL: @fixed_callee(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[X:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <16 x i32> [[X]] to <vscale x 4 x i32>*
				// CHECK-NEXT: store <vscale x 4 x i32> [[X_COERCE:%.]], <vscale x 4 x i32> [[TMP0]], align 16
				// CHECK-NEXT: [[X1:%.]] = load <16 x i32>, <16 x i32> [[X]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to <16 x i32>*
				// CHECK-NEXT: store <16 x i32> [[X1]], <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP1:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP1]]
				//
				fixed_int32_t fixed_callee(fixed_int32_t x) {
				return x;
				}

				// CHECK-LABEL: @sizeless_caller(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[X_ADDR:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[COERCE_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[COERCE1:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <16 x i32>, align 64
				// CHECK-NEXT: store <vscale x 4 x i32> [[X:%.]], <vscale x 4 x i32> [[X_ADDR]], align 16, !tbaa !5
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <vscale x 4 x i32> [[X_ADDR]] to <16 x i32>*
				// CHECK-NEXT: [[TMP1:%.]] = load <16 x i32>, <16 x i32> [[TMP0]], align 16, !tbaa !2
				// CHECK-NEXT: [[COERCE_0__SROA_CAST:%.]] = bitcast <vscale x 4 x i32> [[COERCE_COERCE]] to <16 x i32>*
				// CHECK-NEXT: store <16 x i32> [[TMP1]], <16 x i32>* [[COERCE_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[COERCE_COERCE]], align 16
				// CHECK-NEXT: [[CALL:%.*]] = call <vscale x 4 x i32> @fixed_callee(<vscale x 4 x i32> [[TMP2]])
				// CHECK-NEXT: [[TMP3:%.]] = bitcast <16 x i32> [[COERCE1]] to <vscale x 4 x i32>*
				// CHECK-NEXT: store <vscale x 4 x i32> [[CALL]], <vscale x 4 x i32>* [[TMP3]], align 16
				// CHECK-NEXT: [[TMP4:%.]] = load <16 x i32>, <16 x i32> [[COERCE1]], align 16, !tbaa !2
				// CHECK-NEXT: store <16 x i32> [[TMP4]], <16 x i32>* [[SAVED_CALL_RVALUE]], align 64, !tbaa !2
				// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <16 x i32> [[SAVED_CALL_RVALUE]] to <vscale x 4 x i32>*
				// CHECK-NEXT: [[TMP5:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[CASTFIXEDSVE]], align 64, !tbaa !2
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP5]]
				//
				svint32_t sizeless_caller(svint32_t x) {
				return fixed_callee(x);
				}

				//===----------------------------------------------------------------------===//
				// fixed, fixed
				//===----------------------------------------------------------------------===//

				// CHECK-LABEL: @call_int32_ff(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[OP1:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[OP2:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[OP2_ADDR:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <16 x i32> [[OP1]] to <vscale x 4 x i32>*
				// CHECK-NEXT: store <vscale x 4 x i32> [[OP1_COERCE:%.]], <vscale x 4 x i32> [[TMP0]], align 16
				// CHECK-NEXT: [[OP11:%.]] = load <16 x i32>, <16 x i32> [[OP1]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <16 x i32> [[OP2]] to <vscale x 4 x i32>*
				// CHECK-NEXT: store <vscale x 4 x i32> [[OP2_COERCE:%.]], <vscale x 4 x i32> [[TMP1]], align 16
				// CHECK-NEXT: [[OP22:%.]] = load <16 x i32>, <16 x i32> [[OP2]], align 16, !tbaa !2
				// CHECK-NEXT: store <16 x i32> [[OP11]], <16 x i32>* [[OP1_ADDR]], align 16, !tbaa !2
				// CHECK-NEXT: store <16 x i32> [[OP22]], <16 x i32>* [[OP2_ADDR]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP2:%.]] = bitcast <16 x i32> [[OP1_ADDR]] to <vscale x 4 x i32>*
				// CHECK-NEXT: [[TMP3:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[TMP2]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP4:%.]] = bitcast <16 x i32> [[OP2_ADDR]] to <vscale x 4 x i32>*
				// CHECK-NEXT: [[TMP5:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[TMP4]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP6:%.]] = call <vscale x 4 x i1> @llvm.aarch64.sve.convert.from.svbool.nxv4i1(<vscale x 16 x i1> [[PG:%.]])
				// CHECK-NEXT: [[TMP7:%.*]] = call <vscale x 4 x i32> @llvm.aarch64.sve.sel.nxv4i32(<vscale x 4 x i1> [[TMP6]], <vscale x 4 x i32> [[TMP3]], <vscale x 4 x i32> [[TMP5]])
				// CHECK-NEXT: store <vscale x 4 x i32> [[TMP7]], <vscale x 4 x i32>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !5
				// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 4 x i32> [[SAVED_CALL_RVALUE]] to <16 x i32>*
				// CHECK-NEXT: [[TMP8:%.]] = load <16 x i32>, <16 x i32> [[CASTFIXEDSVE]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to <16 x i32>*
				// CHECK-NEXT: store <16 x i32> [[TMP8]], <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP9:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP9]]
				//
				fixed_int32_t call_int32_ff(svbool_t pg, fixed_int32_t op1, fixed_int32_t op2) {
				return svsel(pg, op1, op2);
				}

				// CHECK-LABEL: @call_float64_ff(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[OP1:%.*]] = alloca <8 x double>, align 16
				// CHECK-NEXT: [[OP2:%.*]] = alloca <8 x double>, align 16
				// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <8 x double>, align 16
				// CHECK-NEXT: [[OP2_ADDR:%.*]] = alloca <8 x double>, align 16
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x double> [[OP1]] to <vscale x 2 x double>*
				// CHECK-NEXT: store <vscale x 2 x double> [[OP1_COERCE:%.]], <vscale x 2 x double> [[TMP0]], align 16
				// CHECK-NEXT: [[OP11:%.]] = load <8 x double>, <8 x double> [[OP1]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x double> [[OP2]] to <vscale x 2 x double>*
				// CHECK-NEXT: store <vscale x 2 x double> [[OP2_COERCE:%.]], <vscale x 2 x double> [[TMP1]], align 16
				// CHECK-NEXT: [[OP22:%.]] = load <8 x double>, <8 x double> [[OP2]], align 16, !tbaa !2
				// CHECK-NEXT: store <8 x double> [[OP11]], <8 x double>* [[OP1_ADDR]], align 16, !tbaa !2
				// CHECK-NEXT: store <8 x double> [[OP22]], <8 x double>* [[OP2_ADDR]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP2:%.]] = bitcast <8 x double> [[OP1_ADDR]] to <vscale x 2 x double>*
				// CHECK-NEXT: [[TMP3:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[TMP2]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP4:%.]] = bitcast <8 x double> [[OP2_ADDR]] to <vscale x 2 x double>*
				// CHECK-NEXT: [[TMP5:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[TMP4]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP6:%.]] = call <vscale x 2 x i1> @llvm.aarch64.sve.convert.from.svbool.nxv2i1(<vscale x 16 x i1> [[PG:%.]])
				// CHECK-NEXT: [[TMP7:%.*]] = call <vscale x 2 x double> @llvm.aarch64.sve.sel.nxv2f64(<vscale x 2 x i1> [[TMP6]], <vscale x 2 x double> [[TMP3]], <vscale x 2 x double> [[TMP5]])
				// CHECK-NEXT: store <vscale x 2 x double> [[TMP7]], <vscale x 2 x double>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !7
				// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 2 x double> [[SAVED_CALL_RVALUE]] to <8 x double>*
				// CHECK-NEXT: [[TMP8:%.]] = load <8 x double>, <8 x double> [[CASTFIXEDSVE]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 2 x double> [[RETVAL_COERCE]] to <8 x double>*
				// CHECK-NEXT: store <8 x double> [[TMP8]], <8 x double>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP9:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 2 x double> [[TMP9]]
				//
				fixed_float64_t call_float64_ff(svbool_t pg, fixed_float64_t op1, fixed_float64_t op2) {
				return svsel(pg, op1, op2);
				}

				// CHECK-LABEL: @call_bool_ff(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[OP1:%.*]] = alloca <8 x i8>, align 16
				// CHECK-NEXT: [[OP2:%.*]] = alloca <8 x i8>, align 16
				// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <8 x i8>, align 16
				// CHECK-NEXT: [[OP2_ADDR:%.*]] = alloca <8 x i8>, align 16
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[OP1]] to <vscale x 16 x i1>*
				// CHECK-NEXT: store <vscale x 16 x i1> [[OP1_COERCE:%.]], <vscale x 16 x i1> [[TMP0]], align 16
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x i8> [[OP1]] to i64*
				// CHECK-NEXT: [[OP113:%.]] = load i64, i64 [[TMP1]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP2:%.]] = bitcast <8 x i8> [[OP2]] to <vscale x 16 x i1>*
				// CHECK-NEXT: store <vscale x 16 x i1> [[OP2_COERCE:%.]], <vscale x 16 x i1> [[TMP2]], align 16
				// CHECK-NEXT: [[TMP3:%.]] = bitcast <8 x i8> [[OP2]] to i64*
				// CHECK-NEXT: [[OP224:%.]] = load i64, i64 [[TMP3]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP4:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to i64*
				// CHECK-NEXT: store i64 [[OP113]], i64* [[TMP4]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP5:%.]] = bitcast <8 x i8> [[OP2_ADDR]] to i64*
				// CHECK-NEXT: store i64 [[OP224]], i64* [[TMP5]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP6:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to <vscale x 16 x i1>*
				// CHECK-NEXT: [[TMP7:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP6]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP8:%.]] = bitcast <8 x i8> [[OP2_ADDR]] to <vscale x 16 x i1>*
				// CHECK-NEXT: [[TMP9:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP8]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP10:%.]] = call <vscale x 16 x i1> @llvm.aarch64.sve.sel.nxv16i1(<vscale x 16 x i1> [[PG:%.]], <vscale x 16 x i1> [[TMP7]], <vscale x 16 x i1> [[TMP9]])
				// CHECK-NEXT: store <vscale x 16 x i1> [[TMP10]], <vscale x 16 x i1>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !9
				// CHECK-NEXT: [[TMP11:%.]] = bitcast <vscale x 16 x i1> [[SAVED_CALL_RVALUE]] to i64*
				// CHECK-NEXT: [[TMP12:%.]] = load i64, i64 [[TMP11]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP13:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to i64*
				// CHECK-NEXT: store i64 [[TMP12]], i64* [[TMP13]], align 16
				// CHECK-NEXT: [[TMP14:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP14]]
				//
				fixed_bool_t call_bool_ff(svbool_t pg, fixed_bool_t op1, fixed_bool_t op2) {
				return svsel(pg, op1, op2);
				}

				//===----------------------------------------------------------------------===//
				// fixed, scalable
				//===----------------------------------------------------------------------===//

				// CHECK-LABEL: @call_int32_fs(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[OP1:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <16 x i32> [[OP1]] to <vscale x 4 x i32>*
				// CHECK-NEXT: store <vscale x 4 x i32> [[OP1_COERCE:%.]], <vscale x 4 x i32> [[TMP0]], align 16
				// CHECK-NEXT: [[OP11:%.]] = load <16 x i32>, <16 x i32> [[OP1]], align 16, !tbaa !2
				// CHECK-NEXT: store <16 x i32> [[OP11]], <16 x i32>* [[OP1_ADDR]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <16 x i32> [[OP1_ADDR]] to <vscale x 4 x i32>*
				// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[TMP1]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP3:%.]] = call <vscale x 4 x i1> @llvm.aarch64.sve.convert.from.svbool.nxv4i1(<vscale x 16 x i1> [[PG:%.]])
				// CHECK-NEXT: [[TMP4:%.]] = call <vscale x 4 x i32> @llvm.aarch64.sve.sel.nxv4i32(<vscale x 4 x i1> [[TMP3]], <vscale x 4 x i32> [[TMP2]], <vscale x 4 x i32> [[OP2:%.]])
				// CHECK-NEXT: store <vscale x 4 x i32> [[TMP4]], <vscale x 4 x i32>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !5
				// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 4 x i32> [[SAVED_CALL_RVALUE]] to <16 x i32>*
				// CHECK-NEXT: [[TMP5:%.]] = load <16 x i32>, <16 x i32> [[CASTFIXEDSVE]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to <16 x i32>*
				// CHECK-NEXT: store <16 x i32> [[TMP5]], <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP6:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP6]]
				//
				fixed_int32_t call_int32_fs(svbool_t pg, fixed_int32_t op1, svint32_t op2) {
				return svsel(pg, op1, op2);
				}

				// CHECK-LABEL: @call_float64_fs(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[OP1:%.*]] = alloca <8 x double>, align 16
				// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <8 x double>, align 16
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x double> [[OP1]] to <vscale x 2 x double>*
				// CHECK-NEXT: store <vscale x 2 x double> [[OP1_COERCE:%.]], <vscale x 2 x double> [[TMP0]], align 16
				// CHECK-NEXT: [[OP11:%.]] = load <8 x double>, <8 x double> [[OP1]], align 16, !tbaa !2
				// CHECK-NEXT: store <8 x double> [[OP11]], <8 x double>* [[OP1_ADDR]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x double> [[OP1_ADDR]] to <vscale x 2 x double>*
				// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[TMP1]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP3:%.]] = call <vscale x 2 x i1> @llvm.aarch64.sve.convert.from.svbool.nxv2i1(<vscale x 16 x i1> [[PG:%.]])
				// CHECK-NEXT: [[TMP4:%.]] = call <vscale x 2 x double> @llvm.aarch64.sve.sel.nxv2f64(<vscale x 2 x i1> [[TMP3]], <vscale x 2 x double> [[TMP2]], <vscale x 2 x double> [[OP2:%.]])
				// CHECK-NEXT: store <vscale x 2 x double> [[TMP4]], <vscale x 2 x double>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !7
				// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 2 x double> [[SAVED_CALL_RVALUE]] to <8 x double>*
				// CHECK-NEXT: [[TMP5:%.]] = load <8 x double>, <8 x double> [[CASTFIXEDSVE]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 2 x double> [[RETVAL_COERCE]] to <8 x double>*
				// CHECK-NEXT: store <8 x double> [[TMP5]], <8 x double>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP6:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 2 x double> [[TMP6]]
				//
				fixed_float64_t call_float64_fs(svbool_t pg, fixed_float64_t op1, svfloat64_t op2) {
				return svsel(pg, op1, op2);
				}

				// CHECK-LABEL: @call_bool_fs(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[OP1:%.*]] = alloca <8 x i8>, align 16
				// CHECK-NEXT: [[OP1_ADDR:%.*]] = alloca <8 x i8>, align 16
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[OP1]] to <vscale x 16 x i1>*
				// CHECK-NEXT: store <vscale x 16 x i1> [[OP1_COERCE:%.]], <vscale x 16 x i1> [[TMP0]], align 16
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x i8> [[OP1]] to i64*
				// CHECK-NEXT: [[OP112:%.]] = load i64, i64 [[TMP1]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP2:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to i64*
				// CHECK-NEXT: store i64 [[OP112]], i64* [[TMP2]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP3:%.]] = bitcast <8 x i8> [[OP1_ADDR]] to <vscale x 16 x i1>*
				// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP3]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP5:%.]] = call <vscale x 16 x i1> @llvm.aarch64.sve.sel.nxv16i1(<vscale x 16 x i1> [[PG:%.]], <vscale x 16 x i1> [[TMP4]], <vscale x 16 x i1> [[OP2:%.*]])
				// CHECK-NEXT: store <vscale x 16 x i1> [[TMP5]], <vscale x 16 x i1>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !9
				// CHECK-NEXT: [[TMP6:%.]] = bitcast <vscale x 16 x i1> [[SAVED_CALL_RVALUE]] to i64*
				// CHECK-NEXT: [[TMP7:%.]] = load i64, i64 [[TMP6]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP8:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to i64*
				// CHECK-NEXT: store i64 [[TMP7]], i64* [[TMP8]], align 16
				// CHECK-NEXT: [[TMP9:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP9]]
				//
				fixed_bool_t call_bool_fs(svbool_t pg, fixed_bool_t op1, svbool_t op2) {
				return svsel(pg, op1, op2);
				}

				//===----------------------------------------------------------------------===//
				// scalable, scalable
				//===----------------------------------------------------------------------===//

				// CHECK-LABEL: @call_int32_ss(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = call <vscale x 4 x i1> @llvm.aarch64.sve.convert.from.svbool.nxv4i1(<vscale x 16 x i1> [[PG:%.]])
				// CHECK-NEXT: [[TMP1:%.]] = call <vscale x 4 x i32> @llvm.aarch64.sve.sel.nxv4i32(<vscale x 4 x i1> [[TMP0]], <vscale x 4 x i32> [[OP1:%.]], <vscale x 4 x i32> [[OP2:%.*]])
				// CHECK-NEXT: store <vscale x 4 x i32> [[TMP1]], <vscale x 4 x i32>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !5
				// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 4 x i32> [[SAVED_CALL_RVALUE]] to <16 x i32>*
				// CHECK-NEXT: [[TMP2:%.]] = load <16 x i32>, <16 x i32> [[CASTFIXEDSVE]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to <16 x i32>*
				// CHECK-NEXT: store <16 x i32> [[TMP2]], <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP3:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP3]]
				//
				fixed_int32_t call_int32_ss(svbool_t pg, svint32_t op1, svint32_t op2) {
				return svsel(pg, op1, op2);
				}

				// CHECK-LABEL: @call_float64_ss(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = call <vscale x 2 x i1> @llvm.aarch64.sve.convert.from.svbool.nxv2i1(<vscale x 16 x i1> [[PG:%.]])
				// CHECK-NEXT: [[TMP1:%.]] = call <vscale x 2 x double> @llvm.aarch64.sve.sel.nxv2f64(<vscale x 2 x i1> [[TMP0]], <vscale x 2 x double> [[OP1:%.]], <vscale x 2 x double> [[OP2:%.*]])
				// CHECK-NEXT: store <vscale x 2 x double> [[TMP1]], <vscale x 2 x double>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !7
				// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 2 x double> [[SAVED_CALL_RVALUE]] to <8 x double>*
				// CHECK-NEXT: [[TMP2:%.]] = load <8 x double>, <8 x double> [[CASTFIXEDSVE]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 2 x double> [[RETVAL_COERCE]] to <8 x double>*
				// CHECK-NEXT: store <8 x double> [[TMP2]], <8 x double>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP3:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 2 x double> [[TMP3]]
				//
				fixed_float64_t call_float64_ss(svbool_t pg, svfloat64_t op1, svfloat64_t op2) {
				return svsel(pg, op1, op2);
				}

				// CHECK-LABEL: @call_bool_ss(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = call <vscale x 16 x i1> @llvm.aarch64.sve.sel.nxv16i1(<vscale x 16 x i1> [[PG:%.]], <vscale x 16 x i1> [[OP1:%.]], <vscale x 16 x i1> [[OP2:%.]])
				// CHECK-NEXT: store <vscale x 16 x i1> [[TMP0]], <vscale x 16 x i1>* [[SAVED_CALL_RVALUE]], align 16, !tbaa !9
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <vscale x 16 x i1> [[SAVED_CALL_RVALUE]] to i64*
				// CHECK-NEXT: [[TMP2:%.]] = load i64, i64 [[TMP1]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP3:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to i64*
				// CHECK-NEXT: store i64 [[TMP2]], i64* [[TMP3]], align 16
				// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP4]]
				//
				fixed_bool_t call_bool_ss(svbool_t pg, svbool_t op1, svbool_t op2) {
				return svsel(pg, op1, op2);
				}

clang/test/CodeGen/attr-arm-sve-vector-bits-cast.c

This file was added.

				// NOTE: Assertions have been autogenerated by utils/update_cc_test_checks.py
				// REQUIRES: aarch64-registered-target
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -msve-vector-bits=512 -fallow-half-arguments-and-returns -S -O1 -emit-llvm -o - %s \| FileCheck %s

				#include <arm_sve.h>

				#define N __ARM_FEATURE_SVE_BITS_EXPERIMENTAL

				typedef svint32_t fixed_int32_t __attribute__((arm_sve_vector_bits(N)));
				typedef svfloat64_t fixed_float64_t __attribute__((arm_sve_vector_bits(N)));
				typedef svbool_t fixed_bool_t __attribute__((arm_sve_vector_bits(N)));

				// CHECK-LABEL: @to_svint32_t(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TYPE:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <16 x i32> [[TYPE]] to <vscale x 4 x i32>*
				// CHECK-NEXT: store <vscale x 4 x i32> [[TYPE_COERCE:%.]], <vscale x 4 x i32> [[TMP0]], align 16
				// CHECK-NEXT: [[TYPE1:%.]] = load <16 x i32>, <16 x i32> [[TYPE]], align 16, !tbaa !2
				// CHECK-NEXT: store <16 x i32> [[TYPE1]], <16 x i32>* [[TYPE_ADDR]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <16 x i32> [[TYPE_ADDR]] to <vscale x 4 x i32>*
				// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[TMP1]], align 16, !tbaa !2
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP2]]
				//
				svint32_t to_svint32_t(fixed_int32_t type) {
				return type;
				}

				// CHECK-LABEL: @from_svint32_t(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: store <vscale x 4 x i32> [[TYPE:%.]], <vscale x 4 x i32> [[TYPE_ADDR]], align 16, !tbaa !5
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <vscale x 4 x i32> [[TYPE_ADDR]] to <16 x i32>*
				// CHECK-NEXT: [[TMP1:%.]] = load <16 x i32>, <16 x i32> [[TMP0]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to <16 x i32>*
				// CHECK-NEXT: store <16 x i32> [[TMP1]], <16 x i32>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP2]]
				//
				fixed_int32_t from_svint32_t(svint32_t type) {
				return type;
				}

				// CHECK-LABEL: @to_svfloat64_t(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TYPE:%.*]] = alloca <8 x double>, align 16
				// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <8 x double>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x double> [[TYPE]] to <vscale x 2 x double>*
				// CHECK-NEXT: store <vscale x 2 x double> [[TYPE_COERCE:%.]], <vscale x 2 x double> [[TMP0]], align 16
				// CHECK-NEXT: [[TYPE1:%.]] = load <8 x double>, <8 x double> [[TYPE]], align 16, !tbaa !2
				// CHECK-NEXT: store <8 x double> [[TYPE1]], <8 x double>* [[TYPE_ADDR]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x double> [[TYPE_ADDR]] to <vscale x 2 x double>*
				// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[TMP1]], align 16, !tbaa !2
				// CHECK-NEXT: ret <vscale x 2 x double> [[TMP2]]
				//
				svfloat64_t to_svfloat64_t(fixed_float64_t type) {
				return type;
				}

				// CHECK-LABEL: @from_svfloat64_t(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 2 x double>, align 16
				// CHECK-NEXT: store <vscale x 2 x double> [[TYPE:%.]], <vscale x 2 x double> [[TYPE_ADDR]], align 16, !tbaa !7
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <vscale x 2 x double> [[TYPE_ADDR]] to <8 x double>*
				// CHECK-NEXT: [[TMP1:%.]] = load <8 x double>, <8 x double> [[TMP0]], align 16, !tbaa !2
				// CHECK-NEXT: [[RETVAL_0__SROA_CAST:%.]] = bitcast <vscale x 2 x double> [[RETVAL_COERCE]] to <8 x double>*
				// CHECK-NEXT: store <8 x double> [[TMP1]], <8 x double>* [[RETVAL_0__SROA_CAST]], align 16
				// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 2 x double>, <vscale x 2 x double> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 2 x double> [[TMP2]]
				//
				fixed_float64_t from_svfloat64_t(svfloat64_t type) {
				return type;
				}

				// CHECK-LABEL: @to_svbool_t(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TYPE:%.*]] = alloca <8 x i8>, align 16
				// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <8 x i8>, align 16
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <8 x i8> [[TYPE]] to <vscale x 16 x i1>*
				// CHECK-NEXT: store <vscale x 16 x i1> [[TYPE_COERCE:%.]], <vscale x 16 x i1> [[TMP0]], align 16
				// CHECK-NEXT: [[TMP1:%.]] = bitcast <8 x i8> [[TYPE]] to i64*
				// CHECK-NEXT: [[TYPE12:%.]] = load i64, i64 [[TMP1]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP2:%.]] = bitcast <8 x i8> [[TYPE_ADDR]] to i64*
				// CHECK-NEXT: store i64 [[TYPE12]], i64* [[TMP2]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP3:%.]] = bitcast <8 x i8> [[TYPE_ADDR]] to <vscale x 16 x i1>*
				// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[TMP3]], align 16, !tbaa !2
				// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP4]]
				//
				svbool_t to_svbool_t(fixed_bool_t type) {
				return type;
				}

				// CHECK-LABEL: @from_svbool_t(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[TYPE_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-NEXT: store <vscale x 16 x i1> [[TYPE:%.]], <vscale x 16 x i1> [[TYPE_ADDR]], align 16, !tbaa !9
				// CHECK-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[TYPE_ADDR]] to i64*
				// CHECK-NEXT: [[TMP1:%.]] = load i64, i64 [[TMP0]], align 16, !tbaa !2
				// CHECK-NEXT: [[TMP2:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to i64*
				// CHECK-NEXT: store i64 [[TMP1]], i64* [[TMP2]], align 16
				// CHECK-NEXT: [[TMP3:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP3]]
				//
				fixed_bool_t from_svbool_t(svbool_t type) {
				return type;
				}

clang/test/CodeGen/attr-arm-sve-vector-bits-codegen.c

This file was added.

				// NOTE: Assertions have been autogenerated by utils/update_cc_test_checks.py
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=512 -fallow-half-arguments-and-returns -S -disable-llvm-passes -emit-llvm -o - %s \| FileCheck %s

				#include <arm_sve.h>

				#define N __ARM_FEATURE_SVE_BITS_EXPERIMENTAL

				typedef svint32_t fixed_int32_t __attribute__((arm_sve_vector_bits(N)));
				typedef svbool_t fixed_bool_t __attribute__((arm_sve_vector_bits(N)));

				fixed_bool_t global_pred;
				fixed_int32_t global_vec;

				// CHECK-LABEL: @foo(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[RETVAL:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[PRED_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 2
				// CHECK-NEXT: [[VEC_ADDR:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[PG:%.*]] = alloca <vscale x 16 x i1>, align 2
				// CHECK-NEXT: [[SAVED_CALL_RVALUE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: store <vscale x 16 x i1> [[PRED:%.]], <vscale x 16 x i1> [[PRED_ADDR]], align 2
				// CHECK-NEXT: store <vscale x 4 x i32> [[VEC:%.]], <vscale x 4 x i32> [[VEC_ADDR]], align 16
				// CHECK-NEXT: [[TMP0:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[PRED_ADDR]], align 2
				// CHECK-NEXT: [[TMP1:%.]] = load <8 x i8>, <8 x i8> @global_pred, align 2
				// CHECK-NEXT: [[TMP2:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> bitcast (<8 x i8>* @global_pred to <vscale x 16 x i1>*), align 2
				// CHECK-NEXT: [[TMP3:%.]] = load <8 x i8>, <8 x i8> @global_pred, align 2
				// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> bitcast (<8 x i8>* @global_pred to <vscale x 16 x i1>*), align 2
				// CHECK-NEXT: [[TMP5:%.*]] = call <vscale x 16 x i1> @llvm.aarch64.sve.and.z.nxv16i1(<vscale x 16 x i1> [[TMP0]], <vscale x 16 x i1> [[TMP2]], <vscale x 16 x i1> [[TMP4]])
				// CHECK-NEXT: store <vscale x 16 x i1> [[TMP5]], <vscale x 16 x i1>* [[PG]], align 2
				// CHECK-NEXT: [[TMP6:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[PG]], align 2
				// CHECK-NEXT: [[TMP7:%.]] = load <16 x i32>, <16 x i32> @global_vec, align 16
				// CHECK-NEXT: [[TMP8:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> bitcast (<16 x i32>* @global_vec to <vscale x 4 x i32>*), align 16
				// CHECK-NEXT: [[TMP9:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[VEC_ADDR]], align 16
				// CHECK-NEXT: [[TMP10:%.*]] = call <vscale x 4 x i1> @llvm.aarch64.sve.convert.from.svbool.nxv4i1(<vscale x 16 x i1> [[TMP6]])
				// CHECK-NEXT: [[TMP11:%.*]] = call <vscale x 4 x i32> @llvm.aarch64.sve.add.nxv4i32(<vscale x 4 x i1> [[TMP10]], <vscale x 4 x i32> [[TMP8]], <vscale x 4 x i32> [[TMP9]])
				// CHECK-NEXT: store <vscale x 4 x i32> [[TMP11]], <vscale x 4 x i32>* [[SAVED_CALL_RVALUE]], align 16
				// CHECK-NEXT: [[CASTFIXEDSVE:%.]] = bitcast <vscale x 4 x i32> [[SAVED_CALL_RVALUE]] to <16 x i32>*
				// CHECK-NEXT: [[TMP12:%.]] = load <16 x i32>, <16 x i32> [[CASTFIXEDSVE]], align 16
				// CHECK-NEXT: store <16 x i32> [[TMP12]], <16 x i32>* [[RETVAL]], align 16
				// CHECK-NEXT: [[TMP13:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to i8*
				// CHECK-NEXT: [[TMP14:%.]] = bitcast <16 x i32> [[RETVAL]] to i8*
				// CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[TMP13]], i8* align 16 [[TMP14]], i64 64, i1 false)
				// CHECK-NEXT: [[TMP15:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP15]]
				//
				fixed_int32_t foo(svbool_t pred, svint32_t vec) {
				svbool_t pg = svand_z(pred, global_pred, global_pred);
				return svadd_m(pg, global_vec, vec);
				}

				// CHECK-LABEL: @test_ptr_to_global(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[RETVAL:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[GLOBAL_VEC_PTR:%.]] = alloca <16 x i32>, align 8
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: store <16 x i32>* @global_vec, <16 x i32>** [[GLOBAL_VEC_PTR]], align 8
				// CHECK-NEXT: [[TMP0:%.]] = load <16 x i32>, <16 x i32>** [[GLOBAL_VEC_PTR]], align 8
				// CHECK-NEXT: [[TMP1:%.]] = load <16 x i32>, <16 x i32> [[TMP0]], align 16
				// CHECK-NEXT: store <16 x i32> [[TMP1]], <16 x i32>* [[RETVAL]], align 16
				// CHECK-NEXT: [[TMP2:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to i8*
				// CHECK-NEXT: [[TMP3:%.]] = bitcast <16 x i32> [[RETVAL]] to i8*
				// CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[TMP2]], i8* align 16 [[TMP3]], i64 64, i1 false)
				// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP4]]
				//
				fixed_int32_t test_ptr_to_global() {
				fixed_int32_t *global_vec_ptr;
				global_vec_ptr = &global_vec;
				return *global_vec_ptr;
				}

				//
				// Test casting pointer from fixed-length array to scalable vector.
				// CHECK-LABEL: @array_arg(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[RETVAL:%.*]] = alloca <16 x i32>, align 16
				// CHECK-NEXT: [[ARR_ADDR:%.]] = alloca <16 x i32>, align 8
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 4 x i32>, align 16
				// CHECK-NEXT: store <16 x i32>* [[ARR:%.]], <16 x i32>* [[ARR_ADDR]], align 8
				// CHECK-NEXT: [[TMP0:%.]] = load <16 x i32>, <16 x i32>** [[ARR_ADDR]], align 8
				// CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds <16 x i32>, <16 x i32> [[TMP0]], i64 0
				// CHECK-NEXT: [[TMP1:%.]] = load <16 x i32>, <16 x i32> [[ARRAYIDX]], align 16
				// CHECK-NEXT: store <16 x i32> [[TMP1]], <16 x i32>* [[RETVAL]], align 16
				// CHECK-NEXT: [[TMP2:%.]] = bitcast <vscale x 4 x i32> [[RETVAL_COERCE]] to i8*
				// CHECK-NEXT: [[TMP3:%.]] = bitcast <16 x i32> [[RETVAL]] to i8*
				// CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[TMP2]], i8* align 16 [[TMP3]], i64 64, i1 false)
				// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 4 x i32>, <vscale x 4 x i32> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 4 x i32> [[TMP4]]
				//
				fixed_int32_t array_arg(fixed_int32_t arr[]) {
				return arr[0];
				}

				// CHECK-LABEL: @address_of_array_idx(
				// CHECK-NEXT: entry:
				// CHECK-NEXT: [[RETVAL:%.*]] = alloca <8 x i8>, align 2
				// CHECK-NEXT: [[ARR:%.*]] = alloca [3 x <8 x i8>], align 2
				// CHECK-NEXT: [[PARR:%.]] = alloca <8 x i8>, align 8
				// CHECK-NEXT: [[RETVAL_COERCE:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-NEXT: [[ARRAYIDX:%.]] = getelementptr inbounds [3 x <8 x i8>], [3 x <8 x i8>] [[ARR]], i64 0, i64 0
				// CHECK-NEXT: store <8 x i8>* [[ARRAYIDX]], <8 x i8>** [[PARR]], align 8
				// CHECK-NEXT: [[TMP0:%.]] = load <8 x i8>, <8 x i8>** [[PARR]], align 8
				// CHECK-NEXT: [[TMP1:%.]] = load <8 x i8>, <8 x i8> [[TMP0]], align 2
				// CHECK-NEXT: store <8 x i8> [[TMP1]], <8 x i8>* [[RETVAL]], align 2
				// CHECK-NEXT: [[TMP2:%.]] = bitcast <vscale x 16 x i1> [[RETVAL_COERCE]] to i8*
				// CHECK-NEXT: [[TMP3:%.]] = bitcast <8 x i8> [[RETVAL]] to i8*
				// CHECK-NEXT: call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 16 [[TMP2]], i8* align 2 [[TMP3]], i64 8, i1 false)
				// CHECK-NEXT: [[TMP4:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> [[RETVAL_COERCE]], align 16
				// CHECK-NEXT: ret <vscale x 16 x i1> [[TMP4]]
				//
				fixed_bool_t address_of_array_idx() {
				fixed_bool_t arr[3];
				fixed_bool_t *parr;
				parr = &arr[0];
				return *parr;
				}

clang/test/CodeGen/attr-arm-sve-vector-bits-globals.c

This file was added.

				// NOTE: Assertions have been autogenerated by utils/update_cc_test_checks.py
				// REQUIRES: aarch64-registered-target
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=128 -fallow-half-arguments-and-returns -S -O1 -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-128
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=512 -fallow-half-arguments-and-returns -S -O1 -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-512

				#include <arm_sve.h>

				#define N __ARM_FEATURE_SVE_BITS_EXPERIMENTAL

				typedef svint64_t fixed_int64_t __attribute__((arm_sve_vector_bits(N)));
				typedef svbfloat16_t fixed_bfloat16_t __attribute__((arm_sve_vector_bits(N)));
				typedef svbool_t fixed_bool_t __attribute__((arm_sve_vector_bits(N)));

				fixed_int64_t global_i64;
				fixed_bfloat16_t global_bf16;
				fixed_bool_t global_bool;

				//===----------------------------------------------------------------------===//
				// WRITES
				//===----------------------------------------------------------------------===//

				// CHECK-128-LABEL: @write_global_i64(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[V_ADDR:%.*]] = alloca <vscale x 2 x i64>, align 16
				// CHECK-128-NEXT: store <vscale x 2 x i64> [[V:%.]], <vscale x 2 x i64> [[V_ADDR]], align 16, !tbaa !2
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <vscale x 2 x i64> [[V_ADDR]] to <2 x i64>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <2 x i64>, <2 x i64> [[TMP0]], align 16, !tbaa !6
				// CHECK-128-NEXT: store <2 x i64> [[TMP1]], <2 x i64>* @global_i64, align 16, !tbaa !6
				// CHECK-128-NEXT: ret void
				//
				// CHECK-512-LABEL: @write_global_i64(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[V_ADDR:%.*]] = alloca <vscale x 2 x i64>, align 16
				// CHECK-512-NEXT: store <vscale x 2 x i64> [[V:%.]], <vscale x 2 x i64> [[V_ADDR]], align 16, !tbaa !2
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 2 x i64> [[V_ADDR]] to <8 x i64>*
				// CHECK-512-NEXT: [[TMP1:%.]] = load <8 x i64>, <8 x i64> [[TMP0]], align 16, !tbaa !6
				// CHECK-512-NEXT: store <8 x i64> [[TMP1]], <8 x i64>* @global_i64, align 16, !tbaa !6
				// CHECK-512-NEXT: ret void
				//
				void write_global_i64(svint64_t v) { global_i64 = v; }

				// CHECK-128-LABEL: @write_global_bf16(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[V_ADDR:%.*]] = alloca <vscale x 8 x bfloat>, align 16
				// CHECK-128-NEXT: store <vscale x 8 x bfloat> [[V:%.]], <vscale x 8 x bfloat> [[V_ADDR]], align 16, !tbaa !7
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <vscale x 8 x bfloat> [[V_ADDR]] to <8 x bfloat>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <8 x bfloat>, <8 x bfloat> [[TMP0]], align 16, !tbaa !6
				// CHECK-128-NEXT: store <8 x bfloat> [[TMP1]], <8 x bfloat>* @global_bf16, align 16, !tbaa !6
				// CHECK-128-NEXT: ret void
				//
				// CHECK-512-LABEL: @write_global_bf16(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[V_ADDR:%.*]] = alloca <vscale x 8 x bfloat>, align 16
				// CHECK-512-NEXT: store <vscale x 8 x bfloat> [[V:%.]], <vscale x 8 x bfloat> [[V_ADDR]], align 16, !tbaa !7
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 8 x bfloat> [[V_ADDR]] to <32 x bfloat>*
				// CHECK-512-NEXT: [[TMP1:%.]] = load <32 x bfloat>, <32 x bfloat> [[TMP0]], align 16, !tbaa !6
				// CHECK-512-NEXT: store <32 x bfloat> [[TMP1]], <32 x bfloat>* @global_bf16, align 16, !tbaa !6
				// CHECK-512-NEXT: ret void
				//
				void write_global_bf16(svbfloat16_t v) { global_bf16 = v; }

				// CHECK-128-LABEL: @write_global_bool(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[V_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-128-NEXT: store <vscale x 16 x i1> [[V:%.]], <vscale x 16 x i1> [[V_ADDR]], align 16, !tbaa !9
				// CHECK-128-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[V_ADDR]] to <2 x i8>*
				// CHECK-128-NEXT: [[TMP1:%.]] = load <2 x i8>, <2 x i8> [[TMP0]], align 16, !tbaa !6
				// CHECK-128-NEXT: store <2 x i8> [[TMP1]], <2 x i8>* @global_bool, align 2, !tbaa !6
				// CHECK-128-NEXT: ret void
				//
				// CHECK-512-LABEL: @write_global_bool(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[V_ADDR:%.*]] = alloca <vscale x 16 x i1>, align 16
				// CHECK-512-NEXT: store <vscale x 16 x i1> [[V:%.]], <vscale x 16 x i1> [[V_ADDR]], align 16, !tbaa !9
				// CHECK-512-NEXT: [[TMP0:%.]] = bitcast <vscale x 16 x i1> [[V_ADDR]] to i64*
				// CHECK-512-NEXT: [[TMP1:%.]] = load i64, i64 [[TMP0]], align 16, !tbaa !6
				// CHECK-512-NEXT: store i64 [[TMP1]], i64* bitcast (<8 x i8>* @global_bool to i64*), align 2, !tbaa !6
				// CHECK-512-NEXT: ret void
				//
				void write_global_bool(svbool_t v) { global_bool = v; }

				//===----------------------------------------------------------------------===//
				// READS
				//===----------------------------------------------------------------------===//

				// CHECK-128-LABEL: @read_global_i64(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[TMP0:%.]] = load <vscale x 2 x i64>, <vscale x 2 x i64> bitcast (<2 x i64>* @global_i64 to <vscale x 2 x i64>*), align 16, !tbaa !6
				// CHECK-128-NEXT: ret <vscale x 2 x i64> [[TMP0]]
				//
				// CHECK-512-LABEL: @read_global_i64(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[TMP0:%.]] = load <vscale x 2 x i64>, <vscale x 2 x i64> bitcast (<8 x i64>* @global_i64 to <vscale x 2 x i64>*), align 16, !tbaa !6
				// CHECK-512-NEXT: ret <vscale x 2 x i64> [[TMP0]]
				//
				svint64_t read_global_i64() { return global_i64; }

				// CHECK-128-LABEL: @read_global_bf16(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[TMP0:%.]] = load <vscale x 8 x bfloat>, <vscale x 8 x bfloat> bitcast (<8 x bfloat>* @global_bf16 to <vscale x 8 x bfloat>*), align 16, !tbaa !6
				// CHECK-128-NEXT: ret <vscale x 8 x bfloat> [[TMP0]]
				//
				// CHECK-512-LABEL: @read_global_bf16(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[TMP0:%.]] = load <vscale x 8 x bfloat>, <vscale x 8 x bfloat> bitcast (<32 x bfloat>* @global_bf16 to <vscale x 8 x bfloat>*), align 16, !tbaa !6
				// CHECK-512-NEXT: ret <vscale x 8 x bfloat> [[TMP0]]
				//
				svbfloat16_t read_global_bf16() { return global_bf16; }

				// CHECK-128-LABEL: @read_global_bool(
				// CHECK-128-NEXT: entry:
				// CHECK-128-NEXT: [[TMP0:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> bitcast (<2 x i8>* @global_bool to <vscale x 16 x i1>*), align 2, !tbaa !6
				// CHECK-128-NEXT: ret <vscale x 16 x i1> [[TMP0]]
				//
				// CHECK-512-LABEL: @read_global_bool(
				// CHECK-512-NEXT: entry:
				// CHECK-512-NEXT: [[TMP0:%.]] = load <vscale x 16 x i1>, <vscale x 16 x i1> bitcast (<8 x i8>* @global_bool to <vscale x 16 x i1>*), align 2, !tbaa !6
				// CHECK-512-NEXT: ret <vscale x 16 x i1> [[TMP0]]
				//
				svbool_t read_global_bool() { return global_bool; }

clang/test/CodeGen/attr-arm-sve-vector-bits-types.c

This file was added.

				// REQUIRES: aarch64-registered-target
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=128 -fallow-half-arguments-and-returns -S -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-128
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=256 -fallow-half-arguments-and-returns -S -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-256
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=512 -fallow-half-arguments-and-returns -S -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-512
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=1024 -fallow-half-arguments-and-returns -S -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-1024
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu -target-feature +sve -target-feature +bf16 -msve-vector-bits=2048 -fallow-half-arguments-and-returns -S -emit-llvm -o - %s \| FileCheck %s --check-prefix=CHECK-2048

				#include <arm_sve.h>

				#define N __ARM_FEATURE_SVE_BITS_EXPERIMENTAL

				typedef svint8_t fixed_int8_t __attribute__((arm_sve_vector_bits(N)));
				typedef svint16_t fixed_int16_t __attribute__((arm_sve_vector_bits(N)));
				typedef svint32_t fixed_int32_t __attribute__((arm_sve_vector_bits(N)));
				typedef svint64_t fixed_int64_t __attribute__((arm_sve_vector_bits(N)));

				typedef svuint8_t fixed_uint8_t __attribute__((arm_sve_vector_bits(N)));
				typedef svuint16_t fixed_uint16_t __attribute__((arm_sve_vector_bits(N)));
				typedef svuint32_t fixed_uint32_t __attribute__((arm_sve_vector_bits(N)));
				typedef svuint64_t fixed_uint64_t __attribute__((arm_sve_vector_bits(N)));

				typedef svfloat16_t fixed_float16_t __attribute__((arm_sve_vector_bits(N)));
				typedef svfloat32_t fixed_float32_t __attribute__((arm_sve_vector_bits(N)));
				typedef svfloat64_t fixed_float64_t __attribute__((arm_sve_vector_bits(N)));

				typedef svbfloat16_t fixed_bfloat16_t __attribute__((arm_sve_vector_bits(N)));

				typedef svbool_t fixed_bool_t __attribute__((arm_sve_vector_bits(N)));

				//===----------------------------------------------------------------------===//
				// Structs and unions
				//===----------------------------------------------------------------------===//
				#define DEFINE_STRUCT(ty) \
				struct struct_##ty { \
				fixed_##ty##_t x; \
				} struct_##ty;

				#define DEFINE_UNION(ty) \
				union union_##ty { \
				fixed_##ty##_t x; \
				} union_##ty;

				DEFINE_STRUCT(int8)
				DEFINE_STRUCT(int16)
				DEFINE_STRUCT(int32)
				DEFINE_STRUCT(int64)
				DEFINE_STRUCT(uint8)
				DEFINE_STRUCT(uint16)
				DEFINE_STRUCT(uint32)
				DEFINE_STRUCT(uint64)
				DEFINE_STRUCT(float16)
				DEFINE_STRUCT(float32)
				DEFINE_STRUCT(float64)
				DEFINE_STRUCT(bfloat16)
				DEFINE_STRUCT(bool)

				DEFINE_UNION(int8)
				DEFINE_UNION(int16)
				DEFINE_UNION(int32)
				DEFINE_UNION(int64)
				DEFINE_UNION(uint8)
				DEFINE_UNION(uint16)
				DEFINE_UNION(uint32)
				DEFINE_UNION(uint64)
				DEFINE_UNION(float16)
				DEFINE_UNION(float32)
				DEFINE_UNION(float64)
				DEFINE_UNION(bfloat16)
				DEFINE_UNION(bool)

				//===----------------------------------------------------------------------===//
				// Global variables
				//===----------------------------------------------------------------------===//
				fixed_int8_t global_i8;
				fixed_int16_t global_i16;
				fixed_int32_t global_i32;
				fixed_int64_t global_i64;

				fixed_uint8_t global_u8;
				fixed_uint16_t global_u16;
				fixed_uint32_t global_u32;
				fixed_uint64_t global_u64;

				fixed_float16_t global_f16;
				fixed_float32_t global_f32;
				fixed_float64_t global_f64;

				fixed_bfloat16_t global_bf16;

				fixed_bool_t global_bool;

				//===----------------------------------------------------------------------===//
				// Global arrays
				//===----------------------------------------------------------------------===//
				fixed_int8_t global_arr_i8[3];
				fixed_int16_t global_arr_i16[3];
				fixed_int32_t global_arr_i32[3];
				fixed_int64_t global_arr_i64[3];

				fixed_uint8_t global_arr_u8[3];
				fixed_uint16_t global_arr_u16[3];
				fixed_uint32_t global_arr_u32[3];
				fixed_uint64_t global_arr_u64[3];

				fixed_float16_t global_arr_f16[3];
				fixed_float32_t global_arr_f32[3];
				fixed_float64_t global_arr_f64[3];

				fixed_bfloat16_t global_arr_bf16[3];

				fixed_bool_t global_arr_bool[3];

				//===----------------------------------------------------------------------===//
				// Locals
				//===----------------------------------------------------------------------===//
				void f() {
				// Variables
				fixed_int8_t local_i8;
				fixed_int16_t local_i16;
				fixed_int32_t local_i32;
				fixed_int64_t local_i64;
				fixed_uint8_t local_u8;
				fixed_uint16_t local_u16;
				fixed_uint32_t local_u32;
				fixed_uint64_t local_u64;
				fixed_float16_t local_f16;
				fixed_float32_t local_f32;
				fixed_float64_t local_f64;
				fixed_bfloat16_t local_bf16;
				fixed_bool_t local_bool;

				// Arrays
				fixed_int8_t local_arr_i8[3];
				fixed_int16_t local_arr_i16[3];
				fixed_int32_t local_arr_i32[3];
				fixed_int64_t local_arr_i64[3];
				fixed_uint8_t local_arr_u8[3];
				fixed_uint16_t local_arr_u16[3];
				fixed_uint32_t local_arr_u32[3];
				fixed_uint64_t local_arr_u64[3];
				fixed_float16_t local_arr_f16[3];
				fixed_float32_t local_arr_f32[3];
				fixed_float64_t local_arr_f64[3];
				fixed_bfloat16_t local_arr_bf16[3];
				fixed_bool_t local_arr_bool[3];
				}

				//===----------------------------------------------------------------------===//
				// Structs and unions
				//===----------------------------------------------------------------------===//
				// CHECK-128: %struct.struct_int8 = type { <16 x i8> }
				// CHECK-128-NEXT: %struct.struct_int16 = type { <8 x i16> }
				// CHECK-128-NEXT: %struct.struct_int32 = type { <4 x i32> }
				// CHECK-128-NEXT: %struct.struct_int64 = type { <2 x i64> }
				// CHECK-128-NEXT: %struct.struct_uint8 = type { <16 x i8> }
				// CHECK-128-NEXT: %struct.struct_uint16 = type { <8 x i16> }
				// CHECK-128-NEXT: %struct.struct_uint32 = type { <4 x i32> }
				// CHECK-128-NEXT: %struct.struct_uint64 = type { <2 x i64> }
				// CHECK-128-NEXT: %struct.struct_float16 = type { <8 x half> }
				// CHECK-128-NEXT: %struct.struct_float32 = type { <4 x float> }
				// CHECK-128-NEXT: %struct.struct_float64 = type { <2 x double> }
				// CHECK-128-NEXT: %struct.struct_bfloat16 = type { <8 x bfloat> }
				// CHECK-128-NEXT: %struct.struct_bool = type { <2 x i8> }

				// CHECK-256: %struct.struct_int8 = type { <32 x i8> }
				// CHECK-256-NEXT: %struct.struct_int16 = type { <16 x i16> }
				// CHECK-256-NEXT: %struct.struct_int32 = type { <8 x i32> }
				// CHECK-256-NEXT: %struct.struct_int64 = type { <4 x i64> }
				// CHECK-256-NEXT: %struct.struct_uint8 = type { <32 x i8> }
				// CHECK-256-NEXT: %struct.struct_uint16 = type { <16 x i16> }
				// CHECK-256-NEXT: %struct.struct_uint32 = type { <8 x i32> }
				// CHECK-256-NEXT: %struct.struct_uint64 = type { <4 x i64> }
				// CHECK-256-NEXT: %struct.struct_float16 = type { <16 x half> }
				// CHECK-256-NEXT: %struct.struct_float32 = type { <8 x float> }
				// CHECK-256-NEXT: %struct.struct_float64 = type { <4 x double> }
				// CHECK-256-NEXT: %struct.struct_bfloat16 = type { <16 x bfloat> }
				// CHECK-256-NEXT: %struct.struct_bool = type { <4 x i8> }

				// CHECK-512: %struct.struct_int8 = type { <64 x i8> }
				// CHECK-512-NEXT: %struct.struct_int16 = type { <32 x i16> }
				// CHECK-512-NEXT: %struct.struct_int32 = type { <16 x i32> }
				// CHECK-512-NEXT: %struct.struct_int64 = type { <8 x i64> }
				// CHECK-512-NEXT: %struct.struct_uint8 = type { <64 x i8> }
				// CHECK-512-NEXT: %struct.struct_uint16 = type { <32 x i16> }
				// CHECK-512-NEXT: %struct.struct_uint32 = type { <16 x i32> }
				// CHECK-512-NEXT: %struct.struct_uint64 = type { <8 x i64> }
				// CHECK-512-NEXT: %struct.struct_float16 = type { <32 x half> }
				// CHECK-512-NEXT: %struct.struct_float32 = type { <16 x float> }
				// CHECK-512-NEXT: %struct.struct_float64 = type { <8 x double> }
				// CHECK-512-NEXT: %struct.struct_bfloat16 = type { <32 x bfloat> }
				// CHECK-512-NEXT: %struct.struct_bool = type { <8 x i8> }

				// CHECK-1024: %struct.struct_int8 = type { <128 x i8> }
				// CHECK-1024-NEXT: %struct.struct_int16 = type { <64 x i16> }
				// CHECK-1024-NEXT: %struct.struct_int32 = type { <32 x i32> }
				// CHECK-1024-NEXT: %struct.struct_int64 = type { <16 x i64> }
				// CHECK-1024-NEXT: %struct.struct_uint8 = type { <128 x i8> }
				// CHECK-1024-NEXT: %struct.struct_uint16 = type { <64 x i16> }
				// CHECK-1024-NEXT: %struct.struct_uint32 = type { <32 x i32> }
				// CHECK-1024-NEXT: %struct.struct_uint64 = type { <16 x i64> }
				// CHECK-1024-NEXT: %struct.struct_float16 = type { <64 x half> }
				// CHECK-1024-NEXT: %struct.struct_float32 = type { <32 x float> }
				// CHECK-1024-NEXT: %struct.struct_float64 = type { <16 x double> }
				// CHECK-1024-NEXT: %struct.struct_bfloat16 = type { <64 x bfloat> }
				// CHECK-1024-NEXT: %struct.struct_bool = type { <16 x i8> }

				// CHECK-2048: %struct.struct_int8 = type { <256 x i8> }
				// CHECK-2048-NEXT: %struct.struct_int16 = type { <128 x i16> }
				// CHECK-2048-NEXT: %struct.struct_int32 = type { <64 x i32> }
				// CHECK-2048-NEXT: %struct.struct_int64 = type { <32 x i64> }
				// CHECK-2048-NEXT: %struct.struct_uint8 = type { <256 x i8> }
				// CHECK-2048-NEXT: %struct.struct_uint16 = type { <128 x i16> }
				// CHECK-2048-NEXT: %struct.struct_uint32 = type { <64 x i32> }
				// CHECK-2048-NEXT: %struct.struct_uint64 = type { <32 x i64> }
				// CHECK-2048-NEXT: %struct.struct_float16 = type { <128 x half> }
				// CHECK-2048-NEXT: %struct.struct_float32 = type { <64 x float> }
				// CHECK-2048-NEXT: %struct.struct_float64 = type { <32 x double> }
				// CHECK-2048-NEXT: %struct.struct_bfloat16 = type { <128 x bfloat> }
				// CHECK-2048-NEXT: %struct.struct_bool = type { <32 x i8> }

				// CHECK-128: %union.union_int8 = type { <16 x i8> }
				// CHECK-128-NEXT: %union.union_int16 = type { <8 x i16> }
				// CHECK-128-NEXT: %union.union_int32 = type { <4 x i32> }
				// CHECK-128-NEXT: %union.union_int64 = type { <2 x i64> }
				// CHECK-128-NEXT: %union.union_uint8 = type { <16 x i8> }
				// CHECK-128-NEXT: %union.union_uint16 = type { <8 x i16> }
				// CHECK-128-NEXT: %union.union_uint32 = type { <4 x i32> }
				// CHECK-128-NEXT: %union.union_uint64 = type { <2 x i64> }
				// CHECK-128-NEXT: %union.union_float16 = type { <8 x half> }
				// CHECK-128-NEXT: %union.union_float32 = type { <4 x float> }
				// CHECK-128-NEXT: %union.union_float64 = type { <2 x double> }
				// CHECK-128-NEXT: %union.union_bfloat16 = type { <8 x bfloat> }
				// CHECK-128-NEXT: %union.union_bool = type { <2 x i8> }

				// CHECK-256: %union.union_int8 = type { <32 x i8> }
				// CHECK-256-NEXT: %union.union_int16 = type { <16 x i16> }
				// CHECK-256-NEXT: %union.union_int32 = type { <8 x i32> }
				// CHECK-256-NEXT: %union.union_int64 = type { <4 x i64> }
				// CHECK-256-NEXT: %union.union_uint8 = type { <32 x i8> }
				// CHECK-256-NEXT: %union.union_uint16 = type { <16 x i16> }
				// CHECK-256-NEXT: %union.union_uint32 = type { <8 x i32> }
				// CHECK-256-NEXT: %union.union_uint64 = type { <4 x i64> }
				// CHECK-256-NEXT: %union.union_float16 = type { <16 x half> }
				// CHECK-256-NEXT: %union.union_float32 = type { <8 x float> }
				// CHECK-256-NEXT: %union.union_float64 = type { <4 x double> }
				// CHECK-256-NEXT: %union.union_bfloat16 = type { <16 x bfloat> }
				// CHECK-256-NEXT: %union.union_bool = type { <4 x i8> }

				// CHECK-512: %union.union_int8 = type { <64 x i8> }
				// CHECK-512-NEXT: %union.union_int16 = type { <32 x i16> }
				// CHECK-512-NEXT: %union.union_int32 = type { <16 x i32> }
				// CHECK-512-NEXT: %union.union_int64 = type { <8 x i64> }
				// CHECK-512-NEXT: %union.union_uint8 = type { <64 x i8> }
				// CHECK-512-NEXT: %union.union_uint16 = type { <32 x i16> }
				// CHECK-512-NEXT: %union.union_uint32 = type { <16 x i32> }
				// CHECK-512-NEXT: %union.union_uint64 = type { <8 x i64> }
				// CHECK-512-NEXT: %union.union_float16 = type { <32 x half> }
				// CHECK-512-NEXT: %union.union_float32 = type { <16 x float> }
				// CHECK-512-NEXT: %union.union_float64 = type { <8 x double> }
				// CHECK-512-NEXT: %union.union_bfloat16 = type { <32 x bfloat> }
				// CHECK-512-NEXT: %union.union_bool = type { <8 x i8> }

				// CHECK-1024: %union.union_int8 = type { <128 x i8> }
				// CHECK-1024-NEXT: %union.union_int16 = type { <64 x i16> }
				// CHECK-1024-NEXT: %union.union_int32 = type { <32 x i32> }
				// CHECK-1024-NEXT: %union.union_int64 = type { <16 x i64> }
				// CHECK-1024-NEXT: %union.union_uint8 = type { <128 x i8> }
				// CHECK-1024-NEXT: %union.union_uint16 = type { <64 x i16> }
				// CHECK-1024-NEXT: %union.union_uint32 = type { <32 x i32> }
				// CHECK-1024-NEXT: %union.union_uint64 = type { <16 x i64> }
				// CHECK-1024-NEXT: %union.union_float16 = type { <64 x half> }
				// CHECK-1024-NEXT: %union.union_float32 = type { <32 x float> }
				// CHECK-1024-NEXT: %union.union_float64 = type { <16 x double> }
				// CHECK-1024-NEXT: %union.union_bfloat16 = type { <64 x bfloat> }
				// CHECK-1024-NEXT: %union.union_bool = type { <16 x i8> }

				// CHECK-2048: %union.union_int8 = type { <256 x i8> }
				// CHECK-2048-NEXT: %union.union_int16 = type { <128 x i16> }
				// CHECK-2048-NEXT: %union.union_int32 = type { <64 x i32> }
				// CHECK-2048-NEXT: %union.union_int64 = type { <32 x i64> }
				// CHECK-2048-NEXT: %union.union_uint8 = type { <256 x i8> }
				// CHECK-2048-NEXT: %union.union_uint16 = type { <128 x i16> }
				// CHECK-2048-NEXT: %union.union_uint32 = type { <64 x i32> }
				// CHECK-2048-NEXT: %union.union_uint64 = type { <32 x i64> }
				// CHECK-2048-NEXT: %union.union_float16 = type { <128 x half> }
				// CHECK-2048-NEXT: %union.union_float32 = type { <64 x float> }
				// CHECK-2048-NEXT: %union.union_float64 = type { <32 x double> }
				// CHECK-2048-NEXT: %union.union_bfloat16 = type { <128 x bfloat> }
				// CHECK-2048-NEXT: %union.union_bool = type { <32 x i8> }

				//===----------------------------------------------------------------------===//
				// Global variables
				//===----------------------------------------------------------------------===//
				// CHECK-128: @global_i8 = global <16 x i8> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_i16 = global <8 x i16> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_i32 = global <4 x i32> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_i64 = global <2 x i64> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_u8 = global <16 x i8> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_u16 = global <8 x i16> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_u32 = global <4 x i32> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_u64 = global <2 x i64> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_f16 = global <8 x half> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_f32 = global <4 x float> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_f64 = global <2 x double> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_bf16 = global <8 x bfloat> zeroinitializer, align 16
				// CHECK-128-NEXT: @global_bool = global <2 x i8> zeroinitializer, align 2

				// CHECK-256: @global_i8 = global <32 x i8> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_i16 = global <16 x i16> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_i32 = global <8 x i32> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_i64 = global <4 x i64> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_u8 = global <32 x i8> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_u16 = global <16 x i16> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_u32 = global <8 x i32> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_u64 = global <4 x i64> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_f16 = global <16 x half> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_f32 = global <8 x float> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_f64 = global <4 x double> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_bf16 = global <16 x bfloat> zeroinitializer, align 16
				// CHECK-NEXT-256: @global_bool = global <4 x i8> zeroinitializer, align 2

				// CHECK-512: @global_i8 = global <64 x i8> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_i16 = global <32 x i16> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_i32 = global <16 x i32> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_i64 = global <8 x i64> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_u8 = global <64 x i8> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_u16 = global <32 x i16> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_u32 = global <16 x i32> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_u64 = global <8 x i64> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_f16 = global <32 x half> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_f32 = global <16 x float> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_f64 = global <8 x double> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_bf16 = global <32 x bfloat> zeroinitializer, align 16
				// CHECK-NEXT-512: @global_bool = global <8 x i8> zeroinitializer, align 2

				// CHECK-1024: @global_i8 = global <128 x i8> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_i16 = global <64 x i16> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_i32 = global <32 x i32> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_i64 = global <16 x i64> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_u8 = global <128 x i8> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_u16 = global <64 x i16> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_u32 = global <32 x i32> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_u64 = global <16 x i64> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_f16 = global <64 x half> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_f32 = global <32 x float> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_f64 = global <16 x double> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_bf16 = global <64 x bfloat> zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_bool = global <16 x i8> zeroinitializer, align 2

				// CHECK-2048: @global_i8 = global <256 x i8> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_i16 = global <128 x i16> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_i32 = global <64 x i32> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_i64 = global <32 x i64> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_u8 = global <256 x i8> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_u16 = global <128 x i16> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_u32 = global <64 x i32> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_u64 = global <32 x i64> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_f16 = global <128 x half> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_f32 = global <64 x float> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_f64 = global <32 x double> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_bf16 = global <128 x bfloat> zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_bool = global <32 x i8> zeroinitializer, align 2

				//===----------------------------------------------------------------------===//
				// Global arrays
				//===----------------------------------------------------------------------===//
				// CHECK-128: @global_arr_i8 = global [3 x <16 x i8>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_i16 = global [3 x <8 x i16>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_i32 = global [3 x <4 x i32>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_i64 = global [3 x <2 x i64>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_u8 = global [3 x <16 x i8>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_u16 = global [3 x <8 x i16>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_u32 = global [3 x <4 x i32>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_u64 = global [3 x <2 x i64>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_f16 = global [3 x <8 x half>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_f32 = global [3 x <4 x float>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_f64 = global [3 x <2 x double>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_bf16 = global [3 x <8 x bfloat>] zeroinitializer, align 16
				// CHECK-128-NEXT: @global_arr_bool = global [3 x <2 x i8>] zeroinitializer, align 2

				// CHECK-256: @global_arr_i8 = global [3 x <32 x i8>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_i16 = global [3 x <16 x i16>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_i32 = global [3 x <8 x i32>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_i64 = global [3 x <4 x i64>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_u8 = global [3 x <32 x i8>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_u16 = global [3 x <16 x i16>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_u32 = global [3 x <8 x i32>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_u64 = global [3 x <4 x i64>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_f16 = global [3 x <16 x half>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_f32 = global [3 x <8 x float>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_f64 = global [3 x <4 x double>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_bf16 = global [3 x <16 x bfloat>] zeroinitializer, align 16
				// CHECK-NEXT-256: @global_arr_bool = global [3 x <4 x i8>] zeroinitializer, align 2

				// CHECK-512: @global_arr_i8 = global [3 x <64 x i8>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_i16 = global [3 x <32 x i16>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_i32 = global [3 x <16 x i32>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_i64 = global [3 x <8 x i64>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_u8 = global [3 x <64 x i8>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_u16 = global [3 x <32 x i16>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_u32 = global [3 x <16 x i32>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_u64 = global [3 x <8 x i64>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_f16 = global [3 x <32 x half>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_f32 = global [3 x <16 x float>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_f64 = global [3 x <8 x double>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_bf16 = global [3 x <32 x bfloat>] zeroinitializer, align 16
				// CHECK-NEXT-512: @global_arr_bool = global [3 x <8 x i8>] zeroinitializer, align 2

				// CHECK-1024: @global_arr_i8 = global [3 x <128 x i8>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_i16 = global [3 x <64 x i16>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_i32 = global [3 x <32 x i32>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_i64 = global [3 x <16 x i64>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_u8 = global [3 x <128 x i8>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_u16 = global [3 x <64 x i16>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_u32 = global [3 x <32 x i32>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_u64 = global [3 x <16 x i64>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_f16 = global [3 x <64 x half>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_f32 = global [3 x <32 x float>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_f64 = global [3 x <16 x double>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_bf16 = global [3 x <64 x bfloat>] zeroinitializer, align 16
				// CHECK-NEXT-1024: @global_arr_bool = global [3 x <16 x i8>] zeroinitializer, align 2

				// CHECK-2048: @global_arr_i8 = global [3 x <256 x i8>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_i16 = global [3 x <128 x i16>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_i32 = global [3 x <64 x i32>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_i64 = global [3 x <32 x i64>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_u8 = global [3 x <256 x i8>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_u16 = global [3 x <128 x i16>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_u32 = global [3 x <64 x i32>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_u64 = global [3 x <32 x i64>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_f16 = global [3 x <128 x half>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_f32 = global [3 x <64 x float>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_f64 = global [3 x <32 x double>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_bf16 = global [3 x <128 x bfloat>] zeroinitializer, align 16
				// CHECK-NEXT-2048: @global_arr_bool = global [3 x <32 x i8>] zeroinitializer, align 2

				//===----------------------------------------------------------------------===//
				// Local variables
				//===----------------------------------------------------------------------===//
				// CHECK-128: %local_i8 = alloca <16 x i8>, align 16
				// CHECK-128-NEXT: %local_i16 = alloca <8 x i16>, align 16
				// CHECK-128-NEXT: %local_i32 = alloca <4 x i32>, align 16
				// CHECK-128-NEXT: %local_i64 = alloca <2 x i64>, align 16
				// CHECK-128-NEXT: %local_u8 = alloca <16 x i8>, align 16
				// CHECK-128-NEXT: %local_u16 = alloca <8 x i16>, align 16
				// CHECK-128-NEXT: %local_u32 = alloca <4 x i32>, align 16
				// CHECK-128-NEXT: %local_u64 = alloca <2 x i64>, align 16
				// CHECK-128-NEXT: %local_f16 = alloca <8 x half>, align 16
				// CHECK-128-NEXT: %local_f32 = alloca <4 x float>, align 16
				// CHECK-128-NEXT: %local_f64 = alloca <2 x double>, align 16
				// CHECK-128-NEXT: %local_bf16 = alloca <8 x bfloat>, align 16
				// CHECK-128-NEXT: %local_bool = alloca <2 x i8>, align 2

				// CHECK-256: %local_i8 = alloca <32 x i8>, align 16
				// CHECK-256-NEXT: %local_i16 = alloca <16 x i16>, align 16
				// CHECK-256-NEXT: %local_i32 = alloca <8 x i32>, align 16
				// CHECK-256-NEXT: %local_i64 = alloca <4 x i64>, align 16
				// CHECK-256-NEXT: %local_u8 = alloca <32 x i8>, align 16
				// CHECK-256-NEXT: %local_u16 = alloca <16 x i16>, align 16
				// CHECK-256-NEXT: %local_u32 = alloca <8 x i32>, align 16
				// CHECK-256-NEXT: %local_u64 = alloca <4 x i64>, align 16
				// CHECK-256-NEXT: %local_f16 = alloca <16 x half>, align 16
				// CHECK-256-NEXT: %local_f32 = alloca <8 x float>, align 16
				// CHECK-256-NEXT: %local_f64 = alloca <4 x double>, align 16
				// CHECK-256-NEXT: %local_bf16 = alloca <16 x bfloat>, align 16
				// CHECK-256-NEXT: %local_bool = alloca <4 x i8>, align 2

				// CHECK-512: %local_i8 = alloca <64 x i8>, align 16
				// CHECK-512-NEXT: %local_i16 = alloca <32 x i16>, align 16
				// CHECK-512-NEXT: %local_i32 = alloca <16 x i32>, align 16
				// CHECK-512-NEXT: %local_i64 = alloca <8 x i64>, align 16
				// CHECK-512-NEXT: %local_u8 = alloca <64 x i8>, align 16
				// CHECK-512-NEXT: %local_u16 = alloca <32 x i16>, align 16
				// CHECK-512-NEXT: %local_u32 = alloca <16 x i32>, align 16
				// CHECK-512-NEXT: %local_u64 = alloca <8 x i64>, align 16
				// CHECK-512-NEXT: %local_f16 = alloca <32 x half>, align 16
				// CHECK-512-NEXT: %local_f32 = alloca <16 x float>, align 16
				// CHECK-512-NEXT: %local_f64 = alloca <8 x double>, align 16
				// CHECK-512-NEXT: %local_bf16 = alloca <32 x bfloat>, align 16
				// CHECK-512-NEXT: %local_bool = alloca <8 x i8>, align 2

				// CHECK-1024: %local_i8 = alloca <128 x i8>, align 16
				// CHECK-1024-NEXT: %local_i16 = alloca <64 x i16>, align 16
				// CHECK-1024-NEXT: %local_i32 = alloca <32 x i32>, align 16
				// CHECK-1024-NEXT: %local_i64 = alloca <16 x i64>, align 16
				// CHECK-1024-NEXT: %local_u8 = alloca <128 x i8>, align 16
				// CHECK-1024-NEXT: %local_u16 = alloca <64 x i16>, align 16
				// CHECK-1024-NEXT: %local_u32 = alloca <32 x i32>, align 16
				// CHECK-1024-NEXT: %local_u64 = alloca <16 x i64>, align 16
				// CHECK-1024-NEXT: %local_f16 = alloca <64 x half>, align 16
				// CHECK-1024-NEXT: %local_f32 = alloca <32 x float>, align 16
				// CHECK-1024-NEXT: %local_f64 = alloca <16 x double>, align 16
				// CHECK-1024-NEXT: %local_bf16 = alloca <64 x bfloat>, align 16
				// CHECK-1024-NEXT: %local_bool = alloca <16 x i8>, align 2

				// CHECK-2048: %local_i8 = alloca <256 x i8>, align 16
				// CHECK-2048-NEXT: %local_i16 = alloca <128 x i16>, align 16
				// CHECK-2048-NEXT: %local_i32 = alloca <64 x i32>, align 16
				// CHECK-2048-NEXT: %local_i64 = alloca <32 x i64>, align 16
				// CHECK-2048-NEXT: %local_u8 = alloca <256 x i8>, align 16
				// CHECK-2048-NEXT: %local_u16 = alloca <128 x i16>, align 16
				// CHECK-2048-NEXT: %local_u32 = alloca <64 x i32>, align 16
				// CHECK-2048-NEXT: %local_u64 = alloca <32 x i64>, align 16
				// CHECK-2048-NEXT: %local_f16 = alloca <128 x half>, align 16
				// CHECK-2048-NEXT: %local_f32 = alloca <64 x float>, align 16
				// CHECK-2048-NEXT: %local_f64 = alloca <32 x double>, align 16
				// CHECK-2048-NEXT: %local_bf16 = alloca <128 x bfloat>, align 16
				// CHECK-2048-NEXT: %local_bool = alloca <32 x i8>, align 2

				//===----------------------------------------------------------------------===//
				// Local arrays
				//===----------------------------------------------------------------------===//
				// CHECK-128: %local_arr_i8 = alloca [3 x <16 x i8>], align 16
				// CHECK-128-NEXT: %local_arr_i16 = alloca [3 x <8 x i16>], align 16
				// CHECK-128-NEXT: %local_arr_i32 = alloca [3 x <4 x i32>], align 16
				// CHECK-128-NEXT: %local_arr_i64 = alloca [3 x <2 x i64>], align 16
				// CHECK-128-NEXT: %local_arr_u8 = alloca [3 x <16 x i8>], align 16
				// CHECK-128-NEXT: %local_arr_u16 = alloca [3 x <8 x i16>], align 16
				// CHECK-128-NEXT: %local_arr_u32 = alloca [3 x <4 x i32>], align 16
				// CHECK-128-NEXT: %local_arr_u64 = alloca [3 x <2 x i64>], align 16
				// CHECK-128-NEXT: %local_arr_f16 = alloca [3 x <8 x half>], align 16
				// CHECK-128-NEXT: %local_arr_f32 = alloca [3 x <4 x float>], align 16
				// CHECK-128-NEXT: %local_arr_f64 = alloca [3 x <2 x double>], align 16
				// CHECK-128-NEXT: %local_arr_bf16 = alloca [3 x <8 x bfloat>], align 16
				// CHECK-128-NEXT: %local_arr_bool = alloca [3 x <2 x i8>], align 2

				// CHECK-256: %local_arr_i8 = alloca [3 x <32 x i8>], align 16
				// CHECK-256-NEXT: %local_arr_i16 = alloca [3 x <16 x i16>], align 16
				// CHECK-256-NEXT: %local_arr_i32 = alloca [3 x <8 x i32>], align 16
				// CHECK-256-NEXT: %local_arr_i64 = alloca [3 x <4 x i64>], align 16
				// CHECK-256-NEXT: %local_arr_u8 = alloca [3 x <32 x i8>], align 16
				// CHECK-256-NEXT: %local_arr_u16 = alloca [3 x <16 x i16>], align 16
				// CHECK-256-NEXT: %local_arr_u32 = alloca [3 x <8 x i32>], align 16
				// CHECK-256-NEXT: %local_arr_u64 = alloca [3 x <4 x i64>], align 16
				// CHECK-256-NEXT: %local_arr_f16 = alloca [3 x <16 x half>], align 16
				// CHECK-256-NEXT: %local_arr_f32 = alloca [3 x <8 x float>], align 16
				// CHECK-256-NEXT: %local_arr_f64 = alloca [3 x <4 x double>], align 16
				// CHECK-256-NEXT: %local_arr_bf16 = alloca [3 x <16 x bfloat>], align 16
				// CHECK-256-NEXT: %local_arr_bool = alloca [3 x <4 x i8>], align 2

				// CHECK-512: %local_arr_i8 = alloca [3 x <64 x i8>], align 16
				// CHECK-512-NEXT: %local_arr_i16 = alloca [3 x <32 x i16>], align 16
				// CHECK-512-NEXT: %local_arr_i32 = alloca [3 x <16 x i32>], align 16
				// CHECK-512-NEXT: %local_arr_i64 = alloca [3 x <8 x i64>], align 16
				// CHECK-512-NEXT: %local_arr_u8 = alloca [3 x <64 x i8>], align 16
				// CHECK-512-NEXT: %local_arr_u16 = alloca [3 x <32 x i16>], align 16
				// CHECK-512-NEXT: %local_arr_u32 = alloca [3 x <16 x i32>], align 16
				// CHECK-512-NEXT: %local_arr_u64 = alloca [3 x <8 x i64>], align 16
				// CHECK-512-NEXT: %local_arr_f16 = alloca [3 x <32 x half>], align 16
				// CHECK-512-NEXT: %local_arr_f32 = alloca [3 x <16 x float>], align 16
				// CHECK-512-NEXT: %local_arr_f64 = alloca [3 x <8 x double>], align 16
				// CHECK-512-NEXT: %local_arr_bf16 = alloca [3 x <32 x bfloat>], align 16
				// CHECK-512-NEXT: %local_arr_bool = alloca [3 x <8 x i8>], align 2

				// CHECK-1024: %local_arr_i8 = alloca [3 x <128 x i8>], align 16
				// CHECK-1024-NEXT: %local_arr_i16 = alloca [3 x <64 x i16>], align 16
				// CHECK-1024-NEXT: %local_arr_i32 = alloca [3 x <32 x i32>], align 16
				// CHECK-1024-NEXT: %local_arr_i64 = alloca [3 x <16 x i64>], align 16
				// CHECK-1024-NEXT: %local_arr_u8 = alloca [3 x <128 x i8>], align 16
				// CHECK-1024-NEXT: %local_arr_u16 = alloca [3 x <64 x i16>], align 16
				// CHECK-1024-NEXT: %local_arr_u32 = alloca [3 x <32 x i32>], align 16
				// CHECK-1024-NEXT: %local_arr_u64 = alloca [3 x <16 x i64>], align 16
				// CHECK-1024-NEXT: %local_arr_f16 = alloca [3 x <64 x half>], align 16
				// CHECK-1024-NEXT: %local_arr_f32 = alloca [3 x <32 x float>], align 16
				// CHECK-1024-NEXT: %local_arr_f64 = alloca [3 x <16 x double>], align 16
				// CHECK-1024-NEXT: %local_arr_bf16 = alloca [3 x <64 x bfloat>], align 16
				// CHECK-1024-NEXT: %local_arr_bool = alloca [3 x <16 x i8>], align 2

				// CHECK-2048: %local_arr_i8 = alloca [3 x <256 x i8>], align 16
				// CHECK-2048-NEXT: %local_arr_i16 = alloca [3 x <128 x i16>], align 16
				// CHECK-2048-NEXT: %local_arr_i32 = alloca [3 x <64 x i32>], align 16
				// CHECK-2048-NEXT: %local_arr_i64 = alloca [3 x <32 x i64>], align 16
				// CHECK-2048-NEXT: %local_arr_u8 = alloca [3 x <256 x i8>], align 16
				// CHECK-2048-NEXT: %local_arr_u16 = alloca [3 x <128 x i16>], align 16
				// CHECK-2048-NEXT: %local_arr_u32 = alloca [3 x <64 x i32>], align 16
				// CHECK-2048-NEXT: %local_arr_u64 = alloca [3 x <32 x i64>], align 16
				// CHECK-2048-NEXT: %local_arr_f16 = alloca [3 x <128 x half>], align 16
				// CHECK-2048-NEXT: %local_arr_f32 = alloca [3 x <64 x float>], align 16
				// CHECK-2048-NEXT: %local_arr_f64 = alloca [3 x <32 x double>], align 16
				// CHECK-2048-NEXT: %local_arr_bf16 = alloca [3 x <128 x bfloat>], align 16
				// CHECK-2048-NEXT: %local_arr_bool = alloca [3 x <32 x i8>], align 2

clang/test/CodeGenCXX/aarch64-sve-fixedtypeinfo.cpp

This file was added.

				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu %s -emit-llvm -o - \
				// RUN: -target-feature +sve -target-feature +bf16 \
				// RUN: -D__ARM_FEATURE_SVE -msve-vector-bits=128 \| FileCheck %s
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu %s -emit-llvm -o - \
				// RUN: -target-feature +sve -target-feature +bf16 \
				// RUN: -D__ARM_FEATURE_SVE -msve-vector-bits=256 \| FileCheck %s
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu %s -emit-llvm -o - \
				// RUN: -target-feature +sve -target-feature +bf16 \
				// RUN: -D__ARM_FEATURE_SVE -msve-vector-bits=512 \| FileCheck %s
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu %s -emit-llvm -o - \
				// RUN: -target-feature +sve -target-feature +bf16 \
				// RUN: -D__ARM_FEATURE_SVE -msve-vector-bits=1024 \| FileCheck %s
				// RUN: %clang_cc1 -triple aarch64-none-linux-gnu %s -emit-llvm -o - \
				// RUN: -target-feature +sve -target-feature +bf16 \
				// RUN: -D__ARM_FEATURE_SVE -msve-vector-bits=2048 \| FileCheck %s

				// This test verifies fixed-length vectors defined with the
				// 'arm_sve_vector_bits' attribute map to the same AAPCS64 ABI type as the
				// sizeless variants.

				#define N __ARM_FEATURE_SVE_BITS_EXPERIMENTAL

				namespace std {
				class type_info;
				};

				typedef __SVInt8_t fixed_int8_t __attribute__((arm_sve_vector_bits(N)));
				typedef __SVInt16_t fixed_int16_t __attribute__((arm_sve_vector_bits(N)));
				typedef __SVInt32_t fixed_int32_t __attribute__((arm_sve_vector_bits(N)));
				typedef __SVInt64_t fixed_int64_t __attribute__((arm_sve_vector_bits(N)));

				typedef __SVUint8_t fixed_uint8_t __attribute__((arm_sve_vector_bits(N)));
				typedef __SVUint16_t fixed_uint16_t __attribute__((arm_sve_vector_bits(N)));
				typedef __SVUint32_t fixed_uint32_t __attribute__((arm_sve_vector_bits(N)));
				typedef __SVUint64_t fixed_uint64_t __attribute__((arm_sve_vector_bits(N)));

				typedef __SVFloat16_t fixed_float16_t __attribute__((arm_sve_vector_bits(N)));
				typedef __SVFloat32_t fixed_float32_t __attribute__((arm_sve_vector_bits(N)));
				typedef __SVFloat64_t fixed_float64_t __attribute__((arm_sve_vector_bits(N)));

				typedef __SVBFloat16_t fixed_bfloat16_t __attribute__((arm_sve_vector_bits(N)));

				typedef __SVBool_t fixed_bool_t __attribute__((arm_sve_vector_bits(N)));

				auto &fs8 = typeid(fixed_int8_t);
				auto &fs16 = typeid(fixed_int16_t);
				auto &fs32 = typeid(fixed_int32_t);
				auto &fs64 = typeid(fixed_int64_t);

				auto &fu8 = typeid(fixed_uint8_t);
				auto &fu16 = typeid(fixed_uint16_t);
				auto &fu32 = typeid(fixed_uint32_t);
				auto &fu64 = typeid(fixed_uint64_t);

				auto &ff16 = typeid(fixed_float16_t);
				auto &ff32 = typeid(fixed_float32_t);
				auto &ff64 = typeid(fixed_float64_t);

				auto &fbf16 = typeid(fixed_bfloat16_t);

				auto &fb8 = typeid(fixed_bool_t);

				// CHECK-DAG: @_ZTIu10__SVInt8_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu10__SVInt8_t
				// CHECK-DAG: @_ZTIu11__SVInt16_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu11__SVInt16_t
				// CHECK-DAG: @_ZTIu11__SVInt32_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu11__SVInt32_t
				// CHECK-DAG: @_ZTIu11__SVInt64_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu11__SVInt64_t
				// CHECK-DAG: @_ZTIu11__SVUint8_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu11__SVUint8_t
				// CHECK-DAG: @_ZTIu12__SVUint16_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu12__SVUint16_t
				// CHECK-DAG: @_ZTIu12__SVUint32_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu12__SVUint32_t
				// CHECK-DAG: @_ZTIu12__SVUint64_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu12__SVUint64_t
				// CHECK-DAG: @_ZTIu13__SVFloat16_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu13__SVFloat16_t
				// CHECK-DAG: @_ZTIu13__SVFloat32_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu13__SVFloat32_t
				// CHECK-DAG: @_ZTIu13__SVFloat64_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu13__SVFloat64_t
				// CHECK-DAG: @_ZTIu14__SVBfloat16_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu14__SVBfloat16_t
				// CHECK-DAG: @_ZTIu10__SVBool_t = {{.}} @_ZTVN10__cxxabiv123__fundamental_type_infoE, {{.}} @_ZTSu10__SVBool_t

clang/test/Sema/attr-arm-sve-vector-bits.c

	Show First 20 Lines • Show All 96 Lines • ▼ Show 20 Lines
	int alignof_int8_var = __alignof__(*extern_int8_ptr);			int alignof_int8_var = __alignof__(*extern_int8_ptr);
	int alignof_int8_var_ptr = __alignof__(extern_int8_ptr);			int alignof_int8_var_ptr = __alignof__(extern_int8_ptr);

	void f(int c) {			void f(int c) {
	fixed_int8_t fs8;			fixed_int8_t fs8;
	svint8_t ss8;			svint8_t ss8;

	void *sel __attribute__((unused));			void *sel __attribute__((unused));
	sel = c ? ss8 : fs8; // expected-error {{incompatible operand types ('svint8_t' (aka '__SVInt8_t') and 'fixed_int8_t' (aka '__SVInt8_t'))}}			sel = c ? ss8 : fs8; // expected-error {{cannot convert between fixed-length and sizeless vector}}
	sel = c ? fs8 : ss8; // expected-error {{incompatible operand types ('fixed_int8_t' (aka '__SVInt8_t') and 'svint8_t' (aka '__SVInt8_t'))}}			sel = c ? fs8 : ss8; // expected-error {{cannot convert between fixed-length and sizeless vector}}

				sel = fs8 + ss8; // expected-error {{cannot convert between fixed-length and sizeless vector}}
				sel = ss8 + fs8; // expected-error {{cannot convert between fixed-length and sizeless vector}}
	}			}

	// --------------------------------------------------------------------------//			// --------------------------------------------------------------------------//
	// Sizeof			// Sizeof

	#define VECTOR_SIZE ((N / 8))			#define VECTOR_SIZE ((N / 8))
	#define PRED_SIZE ((N / 64))			#define PRED_SIZE ((N / 64))

	▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines
	TEST_CAST(uint64)			TEST_CAST(uint64)
	TEST_CAST(float16)			TEST_CAST(float16)
	TEST_CAST(float32)			TEST_CAST(float32)
	TEST_CAST(float64)			TEST_CAST(float64)
	TEST_CAST(bfloat16)			TEST_CAST(bfloat16)
	TEST_CAST(bool)			TEST_CAST(bool)

	// Test the implicit conversion only applies to valid types			// Test the implicit conversion only applies to valid types
	fixed_int8_t to_fixed_int8_t__from_svuint8_t(svuint8_t x) { return x; } // expected-error {{returning 'svuint8_t' (aka '__SVUint8_t') from a function with incompatible result type 'fixed_int8_t' (aka '__SVInt8_t')}}			fixed_int8_t to_fixed_int8_t__from_svuint8_t(svuint8_t x) { return x; } // expected-error-re {{returning 'svuint8_t' (aka '__SVUint8_t') from a function with incompatible result type 'fixed_int8_t' (vector of {{[0-9]+}} 'signed char' values)}}
	fixed_bool_t to_fixed_bool_t__from_svint32_t(svint32_t x) { return x; } // expected-error {{returning 'svint32_t' (aka '__SVInt32_t') from a function with incompatible result type 'fixed_bool_t' (aka '__SVBool_t')}}			fixed_bool_t to_fixed_bool_t__from_svint32_t(svint32_t x) { return x; } // expected-error-re {{returning 'svint32_t' (aka '__SVInt32_t') from a function with incompatible result type 'fixed_bool_t' (vector of {{[0-9]+}} 'unsigned char' values)}}
				// Test conversion between predicate and int8 is invalid, both have the same
				// memory representation.
				fixed_bool_t to_fixed_bool_t__from_svint8_t(svint8_t x) { return x; } // expected-error-re {{returning 'svint8_t' (aka '__SVInt8_t') from a function with incompatible result type 'fixed_bool_t' (vector of {{[0-9]+}} 'unsigned char' values)}}

	// Test the implicit conversion only applies to fixed-length types			// Test the implicit conversion only applies to fixed-length types
	typedef signed int vSInt32 __attribute__((__vector_size__(16)));			typedef signed int vSInt32 __attribute__((__vector_size__(16)));
	svint32_t to_svint32_t_from_gnut(vSInt32 x) { return x; } // expected-error {{returning 'vSInt32' (vector of 4 'int' values) from a function with incompatible result type 'svint32_t' (aka '__SVInt32_t')}}			svint32_t to_svint32_t_from_gnut(vSInt32 x) { return x; } // expected-error-re {{returning 'vSInt32' (vector of {{[0-9]+}} 'int' values) from a function with incompatible result type 'svint32_t' (aka '__SVInt32_t')}}

	vSInt32 to_gnut_from_svint32_t(svint32_t x) { return x; } // expected-error {{returning 'svint32_t' (aka '__SVInt32_t') from a function with incompatible result type 'vSInt32' (vector of 4 'int' values)}}			vSInt32 to_gnut_from_svint32_t(svint32_t x) { return x; } // expected-error-re {{returning 'svint32_t' (aka '__SVInt32_t') from a function with incompatible result type 'vSInt32' (vector of {{[0-9]+}} 'int' values)}}

	// --------------------------------------------------------------------------//			// --------------------------------------------------------------------------//
	// Test the scalable and fixed-length types can be used interchangeably			// Test the scalable and fixed-length types can be used interchangeably

	svint32_t __attribute__((overloadable)) svfunc(svint32_t op1, svint32_t op2);			svint32_t __attribute__((overloadable)) svfunc(svint32_t op1, svint32_t op2);
	svfloat64_t __attribute__((overloadable)) svfunc(svfloat64_t op1, svfloat64_t op2);			svfloat64_t __attribute__((overloadable)) svfunc(svfloat64_t op1, svfloat64_t op2);
	svbool_t __attribute__((overloadable)) svfunc(svbool_t op1, svbool_t op2);			svbool_t __attribute__((overloadable)) svfunc(svbool_t op1, svbool_t op2);

	Show All 17 Lines

llvm/lib/Analysis/InlineCost.cpp

Show First 20 Lines • Show All 846 Lines • ▼ Show 20 Lines	for (User::op_iterator I = GEP.idx_begin(), E = GEP.idx_end(); I != E; ++I)
else		else
Operands.push_back(*I);		Operands.push_back(*I);
return TargetTransformInfo::TCC_Free ==		return TargetTransformInfo::TCC_Free ==
TTI.getUserCost(&GEP, Operands,		TTI.getUserCost(&GEP, Operands,
TargetTransformInfo::TCK_SizeAndLatency);		TargetTransformInfo::TCK_SizeAndLatency);
}		}

bool CallAnalyzer::visitAlloca(AllocaInst &I) {		bool CallAnalyzer::visitAlloca(AllocaInst &I) {
		// FIXME: Support scalable vector types.
		if (isa<ScalableVectorType>(I.getAllocatedType()))
		return false;

// Check whether inlining will turn a dynamic alloca into a static		// Check whether inlining will turn a dynamic alloca into a static
// alloca and handle that case.		// alloca and handle that case.
if (I.isArrayAllocation()) {		if (I.isArrayAllocation()) {
Constant *Size = SimplifiedValues.lookup(I.getArraySize());		Constant *Size = SimplifiedValues.lookup(I.getArraySize());
if (auto *AllocSize = dyn_cast_or_null<ConstantInt>(Size)) {		if (auto *AllocSize = dyn_cast_or_null<ConstantInt>(Size)) {
// Sometimes a dynamic alloca could be converted into a static alloca		// Sometimes a dynamic alloca could be converted into a static alloca
// after this constant prop, and become a huge static alloca on an		// after this constant prop, and become a huge static alloca on an
// unconditional CFG path. Avoid inlining if this is going to happen above		// unconditional CFG path. Avoid inlining if this is going to happen above
▲ Show 20 Lines • Show All 1,732 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp

Show First 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	Instruction *InstCombinerImpl::PromoteCastOfAllocation(BitCastInst &CI,
IRBuilderBase::InsertPointGuard Guard(Builder);		IRBuilderBase::InsertPointGuard Guard(Builder);
Builder.SetInsertPoint(&AI);		Builder.SetInsertPoint(&AI);

// Get the type really allocated and the type casted to.		// Get the type really allocated and the type casted to.
Type *AllocElTy = AI.getAllocatedType();		Type *AllocElTy = AI.getAllocatedType();
Type *CastElTy = PTy->getElementType();		Type *CastElTy = PTy->getElementType();
if (!AllocElTy->isSized() \|\| !CastElTy->isSized()) return nullptr;		if (!AllocElTy->isSized() \|\| !CastElTy->isSized()) return nullptr;

		// FIXME: Support scalable vector types.
		if (isa<ScalableVectorType>(AllocElTy) \|\| isa<ScalableVectorType>(CastElTy))
		return nullptr;

Align AllocElTyAlign = DL.getABITypeAlign(AllocElTy);		Align AllocElTyAlign = DL.getABITypeAlign(AllocElTy);
Align CastElTyAlign = DL.getABITypeAlign(CastElTy);		Align CastElTyAlign = DL.getABITypeAlign(CastElTy);
if (CastElTyAlign < AllocElTyAlign) return nullptr;		if (CastElTyAlign < AllocElTyAlign) return nullptr;

// If the allocation has multiple uses, only promote it if we are strictly		// If the allocation has multiple uses, only promote it if we are strictly
// increasing the alignment of the resultant allocation. If we keep it the		// increasing the alignment of the resultant allocation. If we keep it the
// same, we open the door to infinite loops of various kinds.		// same, we open the door to infinite loops of various kinds.
if (!AI.hasOneUse() && CastElTyAlign == AllocElTyAlign) return nullptr;		if (!AI.hasOneUse() && CastElTyAlign == AllocElTyAlign) return nullptr;
▲ Show 20 Lines • Show All 2,591 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/SROA.cpp

Show First 20 Lines • Show All 774 Lines • ▼ Show 20 Lines	void visitLoadInst(LoadInst &LI) {

if (!IsOffsetKnown)		if (!IsOffsetKnown)
return PI.setAborted(&LI);		return PI.setAborted(&LI);

if (LI.isVolatile() &&		if (LI.isVolatile() &&
LI.getPointerAddressSpace() != DL.getAllocaAddrSpace())		LI.getPointerAddressSpace() != DL.getAllocaAddrSpace())
return PI.setAborted(&LI);		return PI.setAborted(&LI);

		if (isa<ScalableVectorType>(LI.getType()))
		return PI.setAborted(&LI);

uint64_t Size = DL.getTypeStoreSize(LI.getType()).getFixedSize();		uint64_t Size = DL.getTypeStoreSize(LI.getType()).getFixedSize();
return handleLoadOrStore(LI.getType(), LI, Offset, Size, LI.isVolatile());		return handleLoadOrStore(LI.getType(), LI, Offset, Size, LI.isVolatile());
}		}

void visitStoreInst(StoreInst &SI) {		void visitStoreInst(StoreInst &SI) {
Value *ValOp = SI.getValueOperand();		Value *ValOp = SI.getValueOperand();
if (ValOp == *U)		if (ValOp == *U)
return PI.setEscapedAndAborted(&SI);		return PI.setEscapedAndAborted(&SI);
if (!IsOffsetKnown)		if (!IsOffsetKnown)
return PI.setAborted(&SI);		return PI.setAborted(&SI);

if (SI.isVolatile() &&		if (SI.isVolatile() &&
SI.getPointerAddressSpace() != DL.getAllocaAddrSpace())		SI.getPointerAddressSpace() != DL.getAllocaAddrSpace())
return PI.setAborted(&SI);		return PI.setAborted(&SI);

		if (isa<ScalableVectorType>(ValOp->getType()))
		return PI.setAborted(&SI);

uint64_t Size = DL.getTypeStoreSize(ValOp->getType()).getFixedSize();		uint64_t Size = DL.getTypeStoreSize(ValOp->getType()).getFixedSize();

// If this memory access can be shown to statically extend outside the		// If this memory access can be shown to statically extend outside the
// bounds of the allocation, it's behavior is undefined, so simply		// bounds of the allocation, it's behavior is undefined, so simply
// ignore it. Note that this is more strict than the generic clamping		// ignore it. Note that this is more strict than the generic clamping
// behavior of insertUse. We also try to handle cases which might run the		// behavior of insertUse. We also try to handle cases which might run the
// risk of overflow.		// risk of overflow.
// FIXME: We should instead consider the pointer to have escaped if this		// FIXME: We should instead consider the pointer to have escaped if this
▲ Show 20 Lines • Show All 722 Lines • ▼ Show 20 Lines	static Value *getNaturalGEPWithOffset(IRBuilderTy &IRB, const DataLayout &DL,
PointerType *Ty = cast<PointerType>(Ptr->getType());		PointerType *Ty = cast<PointerType>(Ptr->getType());

// Don't consider any GEPs through an i8* as natural unless the TargetTy is		// Don't consider any GEPs through an i8* as natural unless the TargetTy is
// an i8.		// an i8.
if (Ty == IRB.getInt8PtrTy(Ty->getAddressSpace()) && TargetTy->isIntegerTy(8))		if (Ty == IRB.getInt8PtrTy(Ty->getAddressSpace()) && TargetTy->isIntegerTy(8))
return nullptr;		return nullptr;

Type *ElementTy = Ty->getElementType();		Type *ElementTy = Ty->getElementType();
		if (isa<ScalableVectorType>(ElementTy))
		return nullptr;
if (!ElementTy->isSized())		if (!ElementTy->isSized())
return nullptr; // We can't GEP through an unsized element.		return nullptr; // We can't GEP through an unsized element.
APInt ElementSize(Offset.getBitWidth(),		APInt ElementSize(Offset.getBitWidth(),
DL.getTypeAllocSize(ElementTy).getFixedSize());		DL.getTypeAllocSize(ElementTy).getFixedSize());
if (ElementSize == 0)		if (ElementSize == 0)
return nullptr; // Zero-length arrays can't help us build a natural GEP.		return nullptr; // Zero-length arrays can't help us build a natural GEP.
APInt NumSkippedElements = Offset.sdiv(ElementSize);		APInt NumSkippedElements = Offset.sdiv(ElementSize);

▲ Show 20 Lines • Show All 3,237 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Prototype][SVE] Support arm_sve_vector_bits attributeAbandonedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 282560

clang/include/clang/AST/ASTContext.h

clang/include/clang/AST/Type.h

clang/include/clang/Basic/Attr.td

clang/include/clang/Basic/DiagnosticSemaKinds.td

clang/include/clang/Sema/Sema.h

clang/lib/AST/ASTContext.cpp

clang/lib/AST/ItaniumMangle.cpp

clang/lib/AST/JSONNodeDumper.cpp

clang/lib/AST/TextNodeDumper.cpp

clang/lib/AST/Type.cpp

clang/lib/AST/TypePrinter.cpp

clang/lib/CodeGen/CGCall.cpp

clang/lib/CodeGen/CGExprScalar.cpp

clang/lib/CodeGen/TargetInfo.cpp

clang/lib/Sema/SemaDecl.cpp

clang/lib/Sema/SemaExpr.cpp

clang/lib/Sema/SemaType.cpp

clang/test/CodeGen/attr-arm-sve-vector-bits-bitcast.c

clang/test/CodeGen/attr-arm-sve-vector-bits-call.c

clang/test/CodeGen/attr-arm-sve-vector-bits-cast.c

clang/test/CodeGen/attr-arm-sve-vector-bits-codegen.c

clang/test/CodeGen/attr-arm-sve-vector-bits-globals.c

clang/test/CodeGen/attr-arm-sve-vector-bits-types.c

clang/test/CodeGenCXX/aarch64-sve-fixedtypeinfo.cpp

clang/test/Sema/attr-arm-sve-vector-bits.c

llvm/lib/Analysis/InlineCost.cpp

llvm/lib/Transforms/InstCombine/InstCombineCasts.cpp

llvm/lib/Transforms/Scalar/SROA.cpp

[Prototype][SVE] Support arm_sve_vector_bits attribute
AbandonedPublic