This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
docs/
-
LanguageExtensions.rst
-
lib/
-
Basic/Targets/
-
Targets/
-
X86.h
-
X86.cpp
-
CodeGen/
-
TargetInfo.cpp
-
test/
-
CodeGen/X86/
-
X86/
-
bfloat-abi.c
-
bfloat-half-abi.c
-
bfloat-mangle.cpp
-
Sema/
-
vector-decl-crash.c
-
llvm/include/llvm/IR/
-
include/
-
llvm/
-
IR/
-
Type.h

Differential D130964

[X86][BF16] Enable __bf16 for x86 targets.
ClosedPublic

Authored by FreddyYe on Aug 1 2022, 10:46 PM.

Download Raw Diff

Details

Reviewers

pengfei
craig.topper
andrew.w.kaylor
bkramer
RKSimon
LuoYuanke
rjmccall

Commits

rGe4888a37d367: [X86][BF16] Enable __bf16 for x86 targets.

Summary

X86 psABI has updated to support __bf16 type, the ABI of which is the
same as FP16. See https://discourse.llvm.org/t/patch-add-optional-bfloat16-support/63149

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

FreddyYe created this revision.Aug 1 2022, 10:46 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 1 2022, 10:46 PM

Herald added subscribers: luke957, pengfei, s.egerton, simoncook. · View Herald Transcript

FreddyYe requested review of this revision.Aug 1 2022, 10:46 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptAug 1 2022, 10:46 PM

Herald added subscribers: llvm-commits, cfe-commits, • pcwang-thead. · View Herald Transcript

FreddyYe retitled this revision from Enable __bf16 for x86 targets. to [X86][BF16] Enable __bf16 for x86 targets..Aug 1 2022, 10:49 PM

FreddyYe added reviewers: pengfei, craig.topper, andrew.w.kaylor, bkramer, RKSimon.

Herald added a subscriber: StephenFan. · View Herald TranscriptAug 1 2022, 10:49 PM

FreddyYe added a reviewer: LuoYuanke.Aug 1 2022, 10:52 PM

FreddyYe added a parent revision: D130832: [X86][BF16] Make backend type bf16 to follow the psABI.Aug 1 2022, 10:56 PM

Harbormaster completed remote builds in B178690: Diff 449187.Aug 1 2022, 11:32 PM

Add to ReleaseNotes.rst as well.

How are you actually implementing __bf16 on these targets? There isn't even hardware support for conversions.

In D130964#3694473, @rjmccall wrote:

How are you actually implementing __bf16 on these targets? There isn't even hardware support for conversions.

bf16 -> float is really just a bit shift. The other direction gets lowered to a libcall, compiler-rt has a conversion function with proper rounding. I added some support to make the backend promote all other arithmetic to float, but I think that's only enabled on x86 so far.

In D130964#3694473, @rjmccall wrote:

How are you actually implementing __bf16 on these targets? There isn't even hardware support for conversions.

We support float -> bf16 in AVX512BF16. https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#avx512techs=AVX512_BF16
And we found some problems in how to represent bf16 types in intrinsics. For example, we currently defined __bfloat16 as unsigned short. We cannot stop user e.g., adding 2 __bfloat16 in C code and getting the wrong result. So we want to introduce the type on X86. For more information, please see the discussions in D120395,

In D130964#3694540, @bkramer wrote:

In D130964#3694473, @rjmccall wrote:

How are you actually implementing __bf16 on these targets? There isn't even hardware support for conversions.

bf16 -> float is really just a bit shift. The other direction gets lowered to a libcall, compiler-rt has a conversion function with proper rounding. I added some support to make the backend promote all other arithmetic to float, but I think that's only enabled on x86 so far.

Yes, we can view x86 backend has been dealing with __bf16. And with https://reviews.llvm.org/D130832, it will complete follow psABI. About hardware support, x86 actually has supported bf16 since AVX512BF16 (https://reviews.llvm.org/D60552), which has vector conversion support between float and bf16. However, at that time we chose a typedef short as C type. In the future, we can support backend lowering for those instructions: VCVTNE2PS2BF16, VCVTNEPS2BF16 and DPBF16PS

In D130964#3695408, @FreddyYe wrote:

In D130964#3694540, @bkramer wrote:

In D130964#3694473, @rjmccall wrote:

How are you actually implementing __bf16 on these targets? There isn't even hardware support for conversions.

bf16 -> float is really just a bit shift. The other direction gets lowered to a libcall, compiler-rt has a conversion function with proper rounding. I added some support to make the backend promote all other arithmetic to float, but I think that's only enabled on x86 so far.

About hardware support, x86 actually has supported bf16 since AVX512BF16 (https://reviews.llvm.org/D60552), which has vector conversion support between float and bf16.

Right, but this patch is adding x86 support whenever SSE2 is available. AVX512BF16 is available on a *very* small slice of processors. In contrast, e.g. F16C is relatively broadly available, although I understand that we formally support _Float16 all the way back to SSE2 and thus on some processors that lack F16C.

But okay, pure intrinsic support is fine if that's what we're doing.

I think the patch looks fine.

Right, but this patch is adding x86 support whenever SSE2 is available. AVX512BF16 is available on a *very* small slice of processors. In contrast, e.g. F16C is relatively broadly available, although I understand that we formally support _Float16 all the way back to SSE2 and thus on some processors that lack F16C.

But okay, pure intrinsic support is fine if that's what we're doing.

I think the patch looks fine.

Yes. This type is for pure intrinsic support. Thanks for your review. Let's wait for the backend patch to land first.

pengfei added a child revision: D131147: [compiler-rt][BF16] Do not compile truncdfbf2.c and truncsfbf2.c if `__bf16` is not supported.Aug 4 2022, 2:11 AM

pengfei mentioned this in D130832: [X86][BF16] Make backend type bf16 to follow the psABI.Aug 4 2022, 2:16 AM

LGTM.

This revision is now accepted and ready to land.Aug 4 2022, 7:37 PM

pengfei removed a child revision: D131147: [compiler-rt][BF16] Do not compile truncdfbf2.c and truncsfbf2.c if `__bf16` is not supported.Aug 9 2022, 6:12 PM

This revision was landed with ongoing or failed builds.Aug 9 2022, 6:41 PM

Closed by commit rGe4888a37d367: [X86][BF16] Enable __bf16 for x86 targets. (authored by FreddyYe, committed by pengfei). · Explain Why

This revision was automatically updated to reflect the committed changes.

pengfei added a commit: rGe4888a37d367: [X86][BF16] Enable __bf16 for x86 targets..

pengfei mentioned this in D132329: [X86][RFC] Using `__bf16` for AVX512_BF16 intrinsics.Aug 21 2022, 8:47 AM

pengfei mentioned this in rGbc1819389fb4: [X86][RFC] Using `__bf16` for AVX512_BF16 intrinsics.Oct 19 2022, 8:47 AM

Revision Contents

Path

Size

clang/

docs/

LanguageExtensions.rst

4 lines

lib/

Basic/

Targets/

X86.h

4 lines

X86.cpp

2 lines

CodeGen/

TargetInfo.cpp

13 lines

test/

CodeGen/

X86/

bfloat-abi.c

149 lines

bfloat-half-abi.c

149 lines

bfloat-mangle.cpp

5 lines

Sema/

vector-decl-crash.c

2 lines

llvm/

include/

llvm/

IR/

Type.h

5 lines

Diff 451331

clang/docs/LanguageExtensions.rst

	Show First 20 Lines • Show All 750 Lines • ▼ Show 20 Lines
	Otherwise, ``_Float16`` arithmetic is performed by promoting to ``float``,			Otherwise, ``_Float16`` arithmetic is performed by promoting to ``float``,
	performing the operation, and then truncating to ``_Float16``.			performing the operation, and then truncating to ``_Float16``.

	``_Float16`` will be supported on more targets as they define ABIs for it.			``_Float16`` will be supported on more targets as they define ABIs for it.

	``__bf16`` is purely a storage format; it is currently only supported on the following targets:			``__bf16`` is purely a storage format; it is currently only supported on the following targets:
	* 32-bit ARM			* 32-bit ARM
	* 64-bit ARM (AArch64)			* 64-bit ARM (AArch64)
				* X86 (see below)

				On X86 targets, ``__bf16`` is supported as long as SSE2 is available, which
				includes all 64-bit and all recent 32-bit processors.

	``__fp16`` is a storage and interchange format only. This means that values of			``__fp16`` is a storage and interchange format only. This means that values of
	``__fp16`` are immediately promoted to (at least) ``float`` when used in arithmetic			``__fp16`` are immediately promoted to (at least) ``float`` when used in arithmetic
	operations, so that e.g. the result of adding two ``__fp16`` values has type ``float``.			operations, so that e.g. the result of adding two ``__fp16`` values has type ``float``.
	The behavior of ``__fp16`` is specified by the ARM C Language Extensions (`ACLE <http://infocenter.arm.com/help/topic/com.arm.doc.ihi0053d/IHI0053D_acle_2_1.pdf>`_).			The behavior of ``__fp16`` is specified by the ARM C Language Extensions (`ACLE <http://infocenter.arm.com/help/topic/com.arm.doc.ihi0053d/IHI0053D_acle_2_1.pdf>`_).
	Clang uses the ``binary16`` format from IEEE 754-2008 for ``__fp16``, not the ARM			Clang uses the ``binary16`` format from IEEE 754-2008 for ``__fp16``, not the ARM
	alternative format.			alternative format.

	▲ Show 20 Lines • Show All 3,945 Lines • Show Last 20 Lines

clang/lib/Basic/Targets/X86.h

Show First 20 Lines • Show All 150 Lines • ▼ Show 20 Lines
protected:		protected:
llvm::X86::CPUKind CPU = llvm::X86::CK_None;		llvm::X86::CPUKind CPU = llvm::X86::CK_None;

enum FPMathKind { FP_Default, FP_SSE, FP_387 } FPMath = FP_Default;		enum FPMathKind { FP_Default, FP_SSE, FP_387 } FPMath = FP_Default;

public:		public:
X86TargetInfo(const llvm::Triple &Triple, const TargetOptions &)		X86TargetInfo(const llvm::Triple &Triple, const TargetOptions &)
: TargetInfo(Triple) {		: TargetInfo(Triple) {
		BFloat16Width = BFloat16Align = 16;
		BFloat16Format = &llvm::APFloat::BFloat();
LongDoubleFormat = &llvm::APFloat::x87DoubleExtended();		LongDoubleFormat = &llvm::APFloat::x87DoubleExtended();
AddrSpaceMap = &X86AddrSpaceMap;		AddrSpaceMap = &X86AddrSpaceMap;
HasStrictFP = true;		HasStrictFP = true;

bool IsWinCOFF =		bool IsWinCOFF =
getTriple().isOSWindows() && getTriple().isOSBinFormatCOFF();		getTriple().isOSWindows() && getTriple().isOSBinFormatCOFF();
if (IsWinCOFF)		if (IsWinCOFF)
MaxVectorAlign = MaxTLSAlign = 8192u * getCharWidth();		MaxVectorAlign = MaxTLSAlign = 8192u * getCharWidth();
▲ Show 20 Lines • Show All 224 Lines • ▼ Show 20 Lines	uint64_t getPointerWidthV(unsigned AddrSpace) const override {
if (AddrSpace == ptr64)		if (AddrSpace == ptr64)
return 64;		return 64;
return PointerWidth;		return PointerWidth;
}		}

uint64_t getPointerAlignV(unsigned AddrSpace) const override {		uint64_t getPointerAlignV(unsigned AddrSpace) const override {
return getPointerWidthV(AddrSpace);		return getPointerWidthV(AddrSpace);
}		}

		const char *getBFloat16Mangling() const override { return "u6__bf16"; };
};		};

// X86-32 generic target		// X86-32 generic target
class LLVM_LIBRARY_VISIBILITY X86_32TargetInfo : public X86TargetInfo {		class LLVM_LIBRARY_VISIBILITY X86_32TargetInfo : public X86TargetInfo {
public:		public:
X86_32TargetInfo(const llvm::Triple &Triple, const TargetOptions &Opts)		X86_32TargetInfo(const llvm::Triple &Triple, const TargetOptions &Opts)
: X86TargetInfo(Triple, Opts) {		: X86TargetInfo(Triple, Opts) {
DoubleAlign = LongLongAlign = 32;		DoubleAlign = LongLongAlign = 32;
▲ Show 20 Lines • Show All 538 Lines • Show Last 20 Lines

clang/lib/Basic/Targets/X86.cpp

Show First 20 Lines • Show All 352 Lines • ▼ Show 20 Lines	X86SSEEnum Level = llvm::StringSwitch<X86SSEEnum>(Feature)
.Case("+sse3", SSE3)		.Case("+sse3", SSE3)
.Case("+sse2", SSE2)		.Case("+sse2", SSE2)
.Case("+sse", SSE1)		.Case("+sse", SSE1)
.Default(NoSSE);		.Default(NoSSE);
SSELevel = std::max(SSELevel, Level);		SSELevel = std::max(SSELevel, Level);

HasFloat16 = SSELevel >= SSE2;		HasFloat16 = SSELevel >= SSE2;

		HasBFloat16 = SSELevel >= SSE2;

MMX3DNowEnum ThreeDNowLevel = llvm::StringSwitch<MMX3DNowEnum>(Feature)		MMX3DNowEnum ThreeDNowLevel = llvm::StringSwitch<MMX3DNowEnum>(Feature)
.Case("+3dnowa", AMD3DNowAthlon)		.Case("+3dnowa", AMD3DNowAthlon)
.Case("+3dnow", AMD3DNow)		.Case("+3dnow", AMD3DNow)
.Case("+mmx", MMX)		.Case("+mmx", MMX)
.Default(NoMMX3DNow);		.Default(NoMMX3DNow);
MMX3DNowLevel = std::max(MMX3DNowLevel, ThreeDNowLevel);		MMX3DNowLevel = std::max(MMX3DNowLevel, ThreeDNowLevel);

XOPEnum XLevel = llvm::StringSwitch<XOPEnum>(Feature)		XOPEnum XLevel = llvm::StringSwitch<XOPEnum>(Feature)
▲ Show 20 Lines • Show All 1,187 Lines • Show Last 20 Lines

clang/lib/CodeGen/TargetInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,855 Lines • ▼ Show 20 Lines	if (const BuiltinType *BT = Ty->getAs<BuiltinType>()) {
if (k == BuiltinType::Void) {		if (k == BuiltinType::Void) {
Current = NoClass;		Current = NoClass;
} else if (k == BuiltinType::Int128 \|\| k == BuiltinType::UInt128) {		} else if (k == BuiltinType::Int128 \|\| k == BuiltinType::UInt128) {
Lo = Integer;		Lo = Integer;
Hi = Integer;		Hi = Integer;
} else if (k >= BuiltinType::Bool && k <= BuiltinType::LongLong) {		} else if (k >= BuiltinType::Bool && k <= BuiltinType::LongLong) {
Current = Integer;		Current = Integer;
} else if (k == BuiltinType::Float \|\| k == BuiltinType::Double \|\|		} else if (k == BuiltinType::Float \|\| k == BuiltinType::Double \|\|
k == BuiltinType::Float16) {		k == BuiltinType::Float16 \|\| k == BuiltinType::BFloat16) {
Current = SSE;		Current = SSE;
} else if (k == BuiltinType::LongDouble) {		} else if (k == BuiltinType::LongDouble) {
const llvm::fltSemantics *LDF = &getTarget().getLongDoubleFormat();		const llvm::fltSemantics *LDF = &getTarget().getLongDoubleFormat();
if (LDF == &llvm::APFloat::IEEEquad()) {		if (LDF == &llvm::APFloat::IEEEquad()) {
Lo = SSE;		Lo = SSE;
Hi = SSEUp;		Hi = SSEUp;
} else if (LDF == &llvm::APFloat::x87DoubleExtended()) {		} else if (LDF == &llvm::APFloat::x87DoubleExtended()) {
Lo = X87;		Lo = X87;
▲ Show 20 Lines • Show All 114 Lines • ▼ Show 20 Lines	if (const ComplexType *CT = Ty->getAs<ComplexType>()) {
QualType ET = getContext().getCanonicalType(CT->getElementType());		QualType ET = getContext().getCanonicalType(CT->getElementType());

uint64_t Size = getContext().getTypeSize(Ty);		uint64_t Size = getContext().getTypeSize(Ty);
if (ET->isIntegralOrEnumerationType()) {		if (ET->isIntegralOrEnumerationType()) {
if (Size <= 64)		if (Size <= 64)
Current = Integer;		Current = Integer;
else if (Size <= 128)		else if (Size <= 128)
Lo = Hi = Integer;		Lo = Hi = Integer;
} else if (ET->isFloat16Type() \|\| ET == getContext().FloatTy) {		} else if (ET->isFloat16Type() \|\| ET == getContext().FloatTy \|\|
		ET->isBFloat16Type()) {
Current = SSE;		Current = SSE;
} else if (ET == getContext().DoubleTy) {		} else if (ET == getContext().DoubleTy) {
Lo = Hi = SSE;		Lo = Hi = SSE;
} else if (ET == getContext().LongDoubleTy) {		} else if (ET == getContext().LongDoubleTy) {
const llvm::fltSemantics *LDF = &getTarget().getLongDoubleFormat();		const llvm::fltSemantics *LDF = &getTarget().getLongDoubleFormat();
if (LDF == &llvm::APFloat::IEEEquad())		if (LDF == &llvm::APFloat::IEEEquad())
Current = Memory;		Current = Memory;
else if (LDF == &llvm::APFloat::x87DoubleExtended())		else if (LDF == &llvm::APFloat::x87DoubleExtended())
▲ Show 20 Lines • Show All 455 Lines • ▼ Show 20 Lines	if (!T0 \|\| T0->isDoubleTy())
return llvm::Type::getDoubleTy(getVMContext());		return llvm::Type::getDoubleTy(getVMContext());

// Get the adjacent FP type.		// Get the adjacent FP type.
llvm::Type *T1 = nullptr;		llvm::Type *T1 = nullptr;
unsigned T0Size = TD.getTypeAllocSize(T0);		unsigned T0Size = TD.getTypeAllocSize(T0);
if (SourceSize > T0Size)		if (SourceSize > T0Size)
T1 = getFPTypeAtOffset(IRType, IROffset + T0Size, TD);		T1 = getFPTypeAtOffset(IRType, IROffset + T0Size, TD);
if (T1 == nullptr) {		if (T1 == nullptr) {
// Check if IRType is a half + float. float type will be in IROffset+4 due		// Check if IRType is a half/bfloat + float. float type will be in IROffset+4 due
// to its alignment.		// to its alignment.
if (T0->isHalfTy() && SourceSize > 4)		if (T0->is16bitFPTy() && SourceSize > 4)
T1 = getFPTypeAtOffset(IRType, IROffset + 4, TD);		T1 = getFPTypeAtOffset(IRType, IROffset + 4, TD);
// If we can't get a second FP type, return a simple half or float.		// If we can't get a second FP type, return a simple half or float.
// avx512fp16-abi.c:pr51813_2 shows it works to return float for		// avx512fp16-abi.c:pr51813_2 shows it works to return float for
// {float, i8} too.		// {float, i8} too.
if (T1 == nullptr)		if (T1 == nullptr)
return T0;		return T0;
}		}

if (T0->isFloatTy() && T1->isFloatTy())		if (T0->isFloatTy() && T1->isFloatTy())
return llvm::FixedVectorType::get(T0, 2);		return llvm::FixedVectorType::get(T0, 2);

if (T0->isHalfTy() && T1->isHalfTy()) {		if (T0->is16bitFPTy() && T1->is16bitFPTy()) {
llvm::Type *T2 = nullptr;		llvm::Type *T2 = nullptr;
if (SourceSize > 4)		if (SourceSize > 4)
T2 = getFPTypeAtOffset(IRType, IROffset + 4, TD);		T2 = getFPTypeAtOffset(IRType, IROffset + 4, TD);
if (T2 == nullptr)		if (T2 == nullptr)
return llvm::FixedVectorType::get(T0, 2);		return llvm::FixedVectorType::get(T0, 2);
return llvm::FixedVectorType::get(T0, 4);		return llvm::FixedVectorType::get(T0, 4);
}		}

if (T0->isHalfTy() \|\| T1->isHalfTy())		if (T0->is16bitFPTy() \|\| T1->is16bitFPTy())
return llvm::FixedVectorType::get(llvm::Type::getHalfTy(getVMContext()), 4);		return llvm::FixedVectorType::get(llvm::Type::getHalfTy(getVMContext()), 4);

return llvm::Type::getDoubleTy(getVMContext());		return llvm::Type::getDoubleTy(getVMContext());
}		}


/// GetINTEGERTypeAtOffset - The ABI specifies that a value should be passed in		/// GetINTEGERTypeAtOffset - The ABI specifies that a value should be passed in
/// an 8-byte GPR. This means that we either have a scalar or we are talking		/// an 8-byte GPR. This means that we either have a scalar or we are talking
▲ Show 20 Lines • Show All 8,334 Lines • Show Last 20 Lines

clang/test/CodeGen/X86/bfloat-abi.c

This file was added.

				// RUN: %clang_cc1 -triple x86_64-linux -emit-llvm -target-feature +sse2 < %s \| FileCheck %s --check-prefixes=CHECK

				struct bfloat1 {
				__bf16 a;
				};

				struct bfloat1 h1(__bf16 a) {
				// CHECK: define{{.*}}bfloat @
				struct bfloat1 x;
				x.a = a;
				return x;
				}

				struct bfloat2 {
				__bf16 a;
				__bf16 b;
				};

				struct bfloat2 h2(__bf16 a, __bf16 b) {
				// CHECK: define{{.*}}<2 x bfloat> @
				struct bfloat2 x;
				x.a = a;
				x.b = b;
				return x;
				}

				struct bfloat3 {
				__bf16 a;
				__bf16 b;
				__bf16 c;
				};

				struct bfloat3 h3(__bf16 a, __bf16 b, __bf16 c) {
				// CHECK: define{{.*}}<4 x bfloat> @
				struct bfloat3 x;
				x.a = a;
				x.b = b;
				x.c = c;
				return x;
				}

				struct bfloat4 {
				__bf16 a;
				__bf16 b;
				__bf16 c;
				__bf16 d;
				};

				struct bfloat4 h4(__bf16 a, __bf16 b, __bf16 c, __bf16 d) {
				// CHECK: define{{.*}}<4 x bfloat> @
				struct bfloat4 x;
				x.a = a;
				x.b = b;
				x.c = c;
				x.d = d;
				return x;
				}

				struct floatbfloat {
				float a;
				__bf16 b;
				};

				struct floatbfloat fh(float a, __bf16 b) {
				// CHECK: define{{.*}}<4 x half> @
				struct floatbfloat x;
				x.a = a;
				x.b = b;
				return x;
				}

				struct floatbfloat2 {
				float a;
				__bf16 b;
				__bf16 c;
				};

				struct floatbfloat2 fh2(float a, __bf16 b, __bf16 c) {
				// CHECK: define{{.*}}<4 x half> @
				struct floatbfloat2 x;
				x.a = a;
				x.b = b;
				x.c = c;
				return x;
				}

				struct bfloatfloat {
				__bf16 a;
				float b;
				};

				struct bfloatfloat hf(__bf16 a, float b) {
				// CHECK: define{{.*}}<4 x half> @
				struct bfloatfloat x;
				x.a = a;
				x.b = b;
				return x;
				}

				struct bfloat2float {
				__bf16 a;
				__bf16 b;
				float c;
				};

				struct bfloat2float h2f(__bf16 a, __bf16 b, float c) {
				// CHECK: define{{.*}}<4 x bfloat> @
				struct bfloat2float x;
				x.a = a;
				x.b = b;
				x.c = c;
				return x;
				}

				struct floatbfloat3 {
				float a;
				__bf16 b;
				__bf16 c;
				__bf16 d;
				};

				struct floatbfloat3 fh3(float a, __bf16 b, __bf16 c, __bf16 d) {
				// CHECK: define{{.*}}{ <4 x half>, bfloat } @
				struct floatbfloat3 x;
				x.a = a;
				x.b = b;
				x.c = c;
				x.d = d;
				return x;
				}

				struct bfloat5 {
				__bf16 a;
				__bf16 b;
				__bf16 c;
				__bf16 d;
				__bf16 e;
				};

				struct bfloat5 h5(__bf16 a, __bf16 b, __bf16 c, __bf16 d, __bf16 e) {
				// CHECK: define{{.*}}{ <4 x bfloat>, bfloat } @
				struct bfloat5 x;
				x.a = a;
				x.b = b;
				x.c = c;
				x.d = d;
				x.e = e;
				return x;
				}

clang/test/CodeGen/X86/bfloat-half-abi.c

This file was added.

				// RUN: %clang_cc1 -triple x86_64-linux -emit-llvm -target-feature +sse2 < %s \| FileCheck %s --check-prefixes=CHECK

				struct bfloat1 {
				__bf16 a;
				};

				struct bfloat1 h1(__bf16 a) {
				// CHECK: define{{.*}}bfloat @
				struct bfloat1 x;
				x.a = a;
				return x;
				}

				struct bfloat2 {
				__bf16 a;
				__bf16 b;
				};

				struct bfloat2 h2(__bf16 a, __bf16 b) {
				// CHECK: define{{.*}}<2 x bfloat> @
				struct bfloat2 x;
				x.a = a;
				x.b = b;
				return x;
				}

				struct bfloat3 {
				__bf16 a;
				__bf16 b;
				__bf16 c;
				};

				struct bfloat3 h3(__bf16 a, __bf16 b, __bf16 c) {
				// CHECK: define{{.*}}<4 x bfloat> @
				struct bfloat3 x;
				x.a = a;
				x.b = b;
				x.c = c;
				return x;
				}

				struct bfloat4 {
				__bf16 a;
				__bf16 b;
				__bf16 c;
				__bf16 d;
				};

				struct bfloat4 h4(__bf16 a, __bf16 b, __bf16 c, __bf16 d) {
				// CHECK: define{{.*}}<4 x bfloat> @
				struct bfloat4 x;
				x.a = a;
				x.b = b;
				x.c = c;
				x.d = d;
				return x;
				}

				struct floatbfloat {
				float a;
				__bf16 b;
				};

				struct floatbfloat fh(float a, __bf16 b) {
				// CHECK: define{{.*}}<4 x half> @
				struct floatbfloat x;
				x.a = a;
				x.b = b;
				return x;
				}

				struct floatbfloat2 {
				float a;
				__bf16 b;
				__bf16 c;
				};

				struct floatbfloat2 fh2(float a, __bf16 b, __bf16 c) {
				// CHECK: define{{.*}}<4 x half> @
				struct floatbfloat2 x;
				x.a = a;
				x.b = b;
				x.c = c;
				return x;
				}

				struct bfloatfloat {
				__bf16 a;
				float b;
				};

				struct bfloatfloat hf(__bf16 a, float b) {
				// CHECK: define{{.*}}<4 x half> @
				struct bfloatfloat x;
				x.a = a;
				x.b = b;
				return x;
				}

				struct bfloat2float {
				__bf16 a;
				__bf16 b;
				float c;
				};

				struct bfloat2float h2f(__bf16 a, __bf16 b, float c) {
				// CHECK: define{{.*}}<4 x bfloat> @
				struct bfloat2float x;
				x.a = a;
				x.b = b;
				x.c = c;
				return x;
				}

				struct floatbfloat3 {
				float a;
				__bf16 b;
				__bf16 c;
				__bf16 d;
				};

				struct floatbfloat3 fh3(float a, __bf16 b, __bf16 c, __bf16 d) {
				// CHECK: define{{.*}}{ <4 x half>, bfloat } @
				struct floatbfloat3 x;
				x.a = a;
				x.b = b;
				x.c = c;
				x.d = d;
				return x;
				}

				struct bfloat5 {
				__bf16 a;
				__bf16 b;
				__bf16 c;
				__bf16 d;
				__bf16 e;
				};

				struct bfloat5 h5(__bf16 a, __bf16 b, __bf16 c, __bf16 d, __bf16 e) {
				// CHECK: define{{.*}}{ <4 x bfloat>, bfloat } @
				struct bfloat5 x;
				x.a = a;
				x.b = b;
				x.c = c;
				x.d = d;
				x.e = e;
				return x;
				}

clang/test/CodeGen/X86/bfloat-mangle.cpp

This file was added.

				// RUN: %clang_cc1 -triple i386-unknown-unknown -target-feature +sse2 -emit-llvm -o - %s \| FileCheck %s
				// RUN: %clang_cc1 -triple x86_64-unknown-unknown -target-feature +sse2 -emit-llvm -o - %s \| FileCheck %s

				// CHECK: define {{.*}}void @_Z3foou6__bf16(bfloat noundef %b)
				void foo(__bf16 b) {}

clang/test/Sema/vector-decl-crash.c

	// RUN: %clang_cc1 %s -fsyntax-only -verify -triple x86_64-unknown-unknown			// RUN: %clang_cc1 %s -fsyntax-only -verify -triple riscv64-unknown-unknown

	// GH50171			// GH50171
	// This would previously crash when __bf16 was not a supported type.			// This would previously crash when __bf16 was not a supported type.
	__bf16 v64bf __attribute__((vector_size(128))); // expected-error {{__bf16 is not supported on this target}} \			__bf16 v64bf __attribute__((vector_size(128))); // expected-error {{__bf16 is not supported on this target}} \
	expected-error {{vector size not an integral multiple of component size}}			expected-error {{vector size not an integral multiple of component size}}

llvm/include/llvm/IR/Type.h

Show First 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	public:
bool isVoidTy() const { return getTypeID() == VoidTyID; }		bool isVoidTy() const { return getTypeID() == VoidTyID; }

/// Return true if this is 'half', a 16-bit IEEE fp type.		/// Return true if this is 'half', a 16-bit IEEE fp type.
bool isHalfTy() const { return getTypeID() == HalfTyID; }		bool isHalfTy() const { return getTypeID() == HalfTyID; }

/// Return true if this is 'bfloat', a 16-bit bfloat type.		/// Return true if this is 'bfloat', a 16-bit bfloat type.
bool isBFloatTy() const { return getTypeID() == BFloatTyID; }		bool isBFloatTy() const { return getTypeID() == BFloatTyID; }

		/// Return true if this is a 16-bit float type.
		bool is16bitFPTy() const {
		return getTypeID() == BFloatTyID \|\| getTypeID() == HalfTyID;
		}

/// Return true if this is 'float', a 32-bit IEEE fp type.		/// Return true if this is 'float', a 32-bit IEEE fp type.
bool isFloatTy() const { return getTypeID() == FloatTyID; }		bool isFloatTy() const { return getTypeID() == FloatTyID; }

/// Return true if this is 'double', a 64-bit IEEE fp type.		/// Return true if this is 'double', a 64-bit IEEE fp type.
bool isDoubleTy() const { return getTypeID() == DoubleTyID; }		bool isDoubleTy() const { return getTypeID() == DoubleTyID; }

/// Return true if this is x86 long double.		/// Return true if this is x86 long double.
bool isX86_FP80Ty() const { return getTypeID() == X86_FP80TyID; }		bool isX86_FP80Ty() const { return getTypeID() == X86_FP80TyID; }
▲ Show 20 Lines • Show All 358 Lines • Show Last 20 Lines