This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
docs/
1/1
LanguageExtensions.rst
-
ReleaseNotes.rst
-
lib/Basic/Targets/
-
Basic/
-
Targets/
5/5
X86.cpp
-
test/
-
CodeGen/X86/
-
X86/
-
Float16-arithmetic.c
-
avx512fp16-abi.c
-
avx512fp16-complex.c
1/1
fp16-abi.c
1/1
fp16-complex.c
-
Sema/
-
Float16.c
-
conversion-target-dep.c
-
SemaCXX/
1/1
Float16.cpp

Differential D114099

Enable `_Float16` type support on X86 without the avx512fp16 flag
AbandonedPublic

Authored by pengfei on Nov 17 2021, 8:59 AM.

Download Raw Diff

Details

Reviewers

rjmccall
andrew.w.kaylor
zahiraam

Summary

The _Float16 type is supported on x86 systems with SSE2 enabled. Operations are emulated by software emulation and “float” instructions. This patch is allowing the support of _Float16 type without the use of -max512fp16 flag. The final goal being, perform _Float16 emulation for all arithmetic expressions.

Diff Detail

Event Timeline

zahiraam requested review of this revision.Nov 17 2021, 8:59 AM

zahiraam created this revision.

Herald added a project: Restricted Project. · View Herald TranscriptNov 17 2021, 8:59 AM

Harbormaster completed remote builds in B134753: Diff 387959.Nov 17 2021, 9:34 AM

rjmccall added inline comments.Nov 17 2021, 4:03 PM

clang/lib/Basic/Targets/X86.cpp
242	We should probably be setting `HasLegalHalfType` here.
373	Out of curiosity, why SSE2? SSE2 adds double-precision, but we just need single-precision, which is in the original SSE. It doesn't much matter anymore, of course, since both are ubiquitous.

pengfei added inline comments.Nov 17 2021, 6:13 PM

clang/lib/Basic/Targets/X86.cpp
372	sse2?
373	https://lists.llvm.org/pipermail/llvm-dev/2021-July/151618.html
clang/test/CodeGen/X86/avx512fp16-abi.c
1	Why don't move it to fp16-abi.c directly?
clang/test/CodeGen/X86/avx512fp16-complex.c
1	Change the file name?

rjmccall added inline comments.Nov 17 2021, 6:23 PM

clang/lib/Basic/Targets/X86.cpp
373	Ah, makes sense.

zahiraam updated this revision to Diff 388179.Nov 18 2021, 6:23 AM

zahiraam marked 7 inline comments as done.

tschuett added a subscriber: tschuett.Nov 18 2021, 6:36 AM

tschuett added inline comments.

clang/docs/LanguageExtensions.rst
676	Would something like SSE2 and up help understanding?

Harbormaster completed remote builds in B134887: Diff 388179.Nov 18 2021, 6:53 AM

zahiraam updated this revision to Diff 388189.Nov 18 2021, 7:02 AM

zahiraam marked an inline comment as done.

Harbormaster completed remote builds in B134897: Diff 388189.Nov 18 2021, 7:33 AM

rjmccall requested changes to this revision.Nov 18 2021, 8:37 AM

rjmccall added inline comments.

clang/test/SemaCXX/Float16.cpp
6	This test (and Float16.c) should test at least one target that doesn't have `_Float16` support, so please just add `-DHAVE` to the x86_64 line and add, I dunno, a generic i386 or SPARC line.

This revision now requires changes to proceed.Nov 18 2021, 8:37 AM

zahiraam updated this revision to Diff 388273.Nov 18 2021, 11:20 AM

zahiraam marked an inline comment as done.

Harbormaster completed remote builds in B134953: Diff 388273.Nov 18 2021, 11:52 AM

Thanks, LGTM

This revision is now accepted and ready to land.Nov 18 2021, 10:35 PM

rG6623c02d70c3

As mentioned in
https://github.com/llvm/llvm-project/commit/6623c02d70c3732dbea59c6d79c69501baf9627b#commitcomment-60741407

This change is breaking build of compiler-rt on Ubuntu bionic and others on amd64:

"/build/llvm-toolchain-snapshot-14~++20211119100719+d729f4c38fca/build-llvm/./bin/clang" --target=x86_64-pc-linux-gnu -DVISIBILITY_HIDDEN  -fstack-protector-strong -Wformat -Werror=format-security -Wno-unused-command-line-argument -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG  -m32 -DCOMPILER_RT_HAS_FLOAT16 -std=c11 -fPIC -fno-builtin -fvisibility=hidden -fomit-frame-pointer -MD -MT CMakeFiles/clang_rt.builtins-i386.dir/extendhfsf2.c.o -MF CMakeFiles/clang_rt.builtins-i386.dir/extendhfsf2.c.o.d -o CMakeFiles/clang_rt.builtins-i386.dir/extendhfsf2.c.o -c '/build/llvm-toolchain-snapshot-14~++20211119100719+d729f4c38fca/compiler-rt/lib/builtins/extendhfsf2.c'
In file included from /build/llvm-toolchain-snapshot-14~++20211119100719+d729f4c38fca/compiler-rt/lib/builtins/extendhfsf2.c:11:
In file included from /build/llvm-toolchain-snapshot-14~++20211119100719+d729f4c38fca/compiler-rt/lib/builtins/fp_extend_impl.inc:38:
/build/llvm-toolchain-snapshot-14~++20211119100719+d729f4c38fca/compiler-rt/lib/builtins/fp_extend.h:44:9: error: _Float16 is not supported on this target
typedef _Float16 src_t;
        ^
1 error generated.

Full log:
https://llvm-jenkins.debian.net/view/Debian%20sid/job/llvm-toolchain-binaries/architecture=amd64,distribution=unstable,label=amd64/104/consoleFull

Actually, it breaks on all Debian.
Could you please revert it?

In D114099#3148631, @sylvestre.ledru wrote:

Actually, it breaks on all Debian.
Could you please revert it?

Done.

In D114099#3148665, @zahiraam wrote:

In D114099#3148631, @sylvestre.ledru wrote:

Actually, it breaks on all Debian.
Could you please revert it?

Done.

I have reverted this patch but would like to push it in at some point (may be after the back end changes https://reviews.llvm.org/D107082 will be merged in.
But I see in the command above that it is compiling with -DCOMPILER_RT_HAS_FLOAT16. @sylvestre.ledru is this flag really supposed to be on? Was it the case before this patch? @rjmccall isn't this because we turned on HasLegalHalfType?

@zahiraam

Sorry, i missed your questions

But I see in the command above that it is compiling with -DCOMPILER_RT_HAS_FLOAT16. @sylvestre.ledru is this flag really supposed to be on?

dunno, I guess this is set by the build system (I didn't set it)

The issue is back but mostly on ubuntu bionic on amd64:

"/build/llvm-toolchain-snapshot-15~++20220702091600+23ee84f43201/build-llvm/./bin/clang" --target=x86_64-pc-linux-gnu -DVISIBILITY_HIDDEN  -fstack-protector-strong -Wformat -Werror=format-security -Wno-unused-command-line-argument -Wdate-time -D_FORTIFY_SOURCE=2 -O3 -DNDEBUG -m32 -DCOMPILER_RT_HAS_FLOAT16 -std=c11 -fPIC -fno-builtin -fvisibility=hidden -fomit-frame-pointer -MD -MT CMakeFiles/clang_rt.builtins-i386.dir/extendhfsf2.c.o -MF CMakeFiles/clang_rt.builtins-i386.dir/extendhfsf2.c.o.d -o CMakeFiles/clang_rt.builtins-i386.dir/extendhfsf2.c.o -c '/build/llvm-toolchain-snapshot-15~++20220702091600+23ee84f43201/compiler-rt/lib/builtins/extendhfsf2.c'
In file included from /build/llvm-toolchain-snapshot-15~++20220702091600+23ee84f43201/compiler-rt/lib/builtins/extendhfsf2.c:11:
In file included from /build/llvm-toolchain-snapshot-15~++20220702091600+23ee84f43201/compiler-rt/lib/builtins/fp_extend_impl.inc:38:
/build/llvm-toolchain-snapshot-15~++20220702091600+23ee84f43201/compiler-rt/lib/builtins/fp_extend.h:44:9: error: _Float16 is not supported on this target
typedef _Float16 src_t;
        ^
1 error generated.

Herald added a project: Restricted Project. · View Herald TranscriptJul 2 2022, 3:45 AM

Herald added a subscriber: jsji. · View Herald Transcript

sylvestre.ledru mentioned this in D107082: [X86][RFC] Enable `_Float16` type support on X86 following the psABI.Jul 2 2022, 4:07 AM

This patch was replaced by D128571. Let me commandeer and abandon it.

pengfei abandoned this revision.Jul 2 2022, 5:06 AM

Revision Contents

Path

Size

clang/

docs/

LanguageExtensions.rst

2 lines

ReleaseNotes.rst

1 line

lib/

Basic/

Targets/

X86.cpp

4 lines

test/

CodeGen/

X86/

Float16-arithmetic.c

73 lines

avx512fp16-abi.c

avx512fp16-complex.c

	fp16-abi.c
	avx512fp16-abi.c

2 lines

	fp16-complex.c
	avx512fp16-complex.c

1 line

Sema/

Float16.c

3 lines

conversion-target-dep.c

2 lines

SemaCXX/

Float16.cpp

3 lines

Diff 388273

clang/docs/LanguageExtensions.rst

	Show First 20 Lines • Show All 667 Lines • ▼ Show 20 Lines
	``__fp16`` is supported on every target, as it is purely a storage format; see below.			``__fp16`` is supported on every target, as it is purely a storage format; see below.
	``_Float16`` is currently only supported on the following targets, with further			``_Float16`` is currently only supported on the following targets, with further
	targets pending ABI standardization:			targets pending ABI standardization:

	* 32-bit ARM			* 32-bit ARM
	* 64-bit ARM (AArch64)			* 64-bit ARM (AArch64)
	* AMDGPU			* AMDGPU
	* SPIR			* SPIR
	* X86 (Only available under feature AVX512-FP16)			* X86 (Available with feature SSE2 and up enabled)
				tschuettUnsubmitted Done Reply Inline Actions Would something like SSE2 and up help understanding? tschuett: Would something like SSE2 and up help understanding?

	``_Float16`` will be supported on more targets as they define ABIs for it.			``_Float16`` will be supported on more targets as they define ABIs for it.

	``__bf16`` is purely a storage format; it is currently only supported on the following targets:			``__bf16`` is purely a storage format; it is currently only supported on the following targets:
	* 32-bit ARM			* 32-bit ARM
	* 64-bit ARM (AArch64)			* 64-bit ARM (AArch64)

	The ``__bf16`` type is only available when supported in hardware.			The ``__bf16`` type is only available when supported in hardware.
	▲ Show 20 Lines • Show All 3,542 Lines • Show Last 20 Lines

clang/docs/ReleaseNotes.rst

	Show First 20 Lines • Show All 181 Lines • ▼ Show 20 Lines
	---------------------			---------------------

	- ...			- ...

	X86 Support in Clang			X86 Support in Clang
	--------------------			--------------------

	- Support for ``AVX512-FP16`` instructions has been added.			- Support for ``AVX512-FP16`` instructions has been added.
				- Support for ``_Float16`` type has been added.

	Arm and AArch64 Support in Clang			Arm and AArch64 Support in Clang
	--------------------------------			--------------------------------

	- Support has been added for the following processors (command-line identifiers in parentheses):			- Support has been added for the following processors (command-line identifiers in parentheses):
	- Arm Cortex-A510 (``cortex-a510``)			- Arm Cortex-A510 (``cortex-a510``)
	- Arm Cortex-X2 (``cortex-x2``)			- Arm Cortex-X2 (``cortex-x2``)

	▲ Show 20 Lines • Show All 116 Lines • Show Last 20 Lines

clang/lib/Basic/Targets/X86.cpp

Show First 20 Lines • Show All 233 Lines • ▼ Show 20 Lines	for (const auto &Feature : Features) {
} else if (Feature == "+avx512vnni") {		} else if (Feature == "+avx512vnni") {
HasAVX512VNNI = true;		HasAVX512VNNI = true;
} else if (Feature == "+avx512bf16") {		} else if (Feature == "+avx512bf16") {
HasAVX512BF16 = true;		HasAVX512BF16 = true;
} else if (Feature == "+avx512er") {		} else if (Feature == "+avx512er") {
HasAVX512ER = true;		HasAVX512ER = true;
} else if (Feature == "+avx512fp16") {		} else if (Feature == "+avx512fp16") {
HasAVX512FP16 = true;		HasAVX512FP16 = true;
HasFloat16 = true;
rjmccallUnsubmitted Done Reply Inline Actions We should probably be setting `HasLegalHalfType` here. rjmccall: We should probably be setting `HasLegalHalfType` here.
} else if (Feature == "+avx512pf") {		} else if (Feature == "+avx512pf") {
HasAVX512PF = true;		HasAVX512PF = true;
		HasLegalHalfType = true;
} else if (Feature == "+avx512dq") {		} else if (Feature == "+avx512dq") {
HasAVX512DQ = true;		HasAVX512DQ = true;
} else if (Feature == "+avx512bitalg") {		} else if (Feature == "+avx512bitalg") {
HasAVX512BITALG = true;		HasAVX512BITALG = true;
} else if (Feature == "+avx512bw") {		} else if (Feature == "+avx512bw") {
HasAVX512BW = true;		HasAVX512BW = true;
} else if (Feature == "+avx512vl") {		} else if (Feature == "+avx512vl") {
HasAVX512VL = true;		HasAVX512VL = true;
▲ Show 20 Lines • Show All 111 Lines • ▼ Show 20 Lines	for (const auto &Feature : Features) {

XOPEnum XLevel = llvm::StringSwitch<XOPEnum>(Feature)		XOPEnum XLevel = llvm::StringSwitch<XOPEnum>(Feature)
.Case("+xop", XOP)		.Case("+xop", XOP)
.Case("+fma4", FMA4)		.Case("+fma4", FMA4)
.Case("+sse4a", SSE4A)		.Case("+sse4a", SSE4A)
.Default(NoXOP);		.Default(NoXOP);
XOPLevel = std::max(XOPLevel, XLevel);		XOPLevel = std::max(XOPLevel, XLevel);
}		}
		// Turn on _float16 for x86 (feature sse2)
		pengfeiAuthorUnsubmitted Done Reply Inline Actions sse2? pengfei: sse2?
		HasFloat16 = SSELevel >= SSE2;
		rjmccallUnsubmitted Done Reply Inline Actions Out of curiosity, why SSE2? SSE2 adds double-precision, but we just need single-precision, which is in the original SSE. It doesn't much matter anymore, of course, since both are ubiquitous. rjmccall: Out of curiosity, why SSE2? SSE2 adds double-precision, but we just need single-precision…
		pengfeiAuthorUnsubmitted Done Reply Inline Actions https://lists.llvm.org/pipermail/llvm-dev/2021-July/151618.html pengfei: https://lists.llvm.org/pipermail/llvm-dev/2021-July/151618.html
		rjmccallUnsubmitted Done Reply Inline Actions Ah, makes sense. rjmccall: Ah, makes sense.

// LLVM doesn't have a separate switch for fpmath, so only accept it if it		// LLVM doesn't have a separate switch for fpmath, so only accept it if it
// matches the selected sse level.		// matches the selected sse level.
if ((FPMath == FP_SSE && SSELevel < SSE1) \|\|		if ((FPMath == FP_SSE && SSELevel < SSE1) \|\|
(FPMath == FP_387 && SSELevel >= SSE1)) {		(FPMath == FP_387 && SSELevel >= SSE1)) {
Diags.Report(diag::err_target_unsupported_fpmath)		Diags.Report(diag::err_target_unsupported_fpmath)
<< (FPMath == FP_SSE ? "sse" : "387");		<< (FPMath == FP_SSE ? "sse" : "387");
return false;		return false;
▲ Show 20 Lines • Show All 1,163 Lines • Show Last 20 Lines

clang/test/CodeGen/X86/Float16-arithmetic.c

This file was added.

				// RUN: %clang_cc1 -triple x86_64-unknown-unknown -emit-llvm \
				// RUN: < %s \| FileCheck %s --check-prefixes=CHECK

				_Float16 add1(_Float16 a, _Float16 b) {
				// CHECK-LABEL: define{{.*}} half @add1
				// CHECK: alloca half
				// CHECK: alloca half
				// CHECK: store half {{.}}, half
				// CHECK: store half {{.}}, half
				// CHECK: load half, half*
				// CHECK: load half, half* {{.*}}
				// CHECK: fadd half {{.}}, {{.}}
				// CHECK: ret half
				return a + b;
				}

				_Float16 add2(_Float16 a, _Float16 b, _Float16 c) {
				// CHECK-LABEL: define{{.*}} half @add2
				// CHECK: alloca half
				// CHECK: alloca half
				// CHECK: alloca half
				// CHECK: store half {{.}}, half
				// CHECK: store half {{.}}, half
				// CHECK: store half {{.}}, half
				// CHECK: load half, half* {{.*}}
				// CHECK: load half, half* {{.*}}
				// CHECK: fadd half {{.}}, {{.}}
				// CHECK: load half, half* {{.*}}
				// CHECK: fadd half {{.}}, {{.}}
				// CHECK: ret half
				return a + b + c;
				}

				_Float16 sub(_Float16 a, _Float16 b) {
				// CHECK-LABEL: define{{.*}} half @sub
				// CHECK: alloca half
				// CHECK: alloca half
				// CHECK: store half {{.}}, half
				// CHECK: store half {{.}}, half
				// CHECK: load half, half*
				// CHECK: load half, half* {{.*}}
				// CHECK: fsub half {{.}}, {{.}}
				// CHECK: ret half
				return a - b;
				}

				_Float16 div(_Float16 a, _Float16 b) {
				// CHECK-LABEL: define{{.*}} half @div
				// CHECK: alloca half
				// CHECK: alloca half
				// CHECK: store half {{.}}, half
				// CHECK: store half {{.}}, half
				// CHECK: load half, half* {{.*}}
				// CHECK: load half, half* {{.*}}
				// CHECK: fdiv half {{.}}, {{.}}
				// CHECK: ret half
				return a / b;
				}

				_Float16 mul(_Float16 a, _Float16 b) {
				// CHECK-LABEL: define{{.*}} half @mul
				// CHECK: alloca half
				// CHECK: alloca half
				// CHECK: store half {{.}}, half
				// CHECK: store half {{.}}, half
				// CHECK: load half, half* {{.*}}
				// CHECK: load half, half* {{.*}}
				// CHECK: fmul half {{.}}, {{.}}
				// CHECK: ret half
				return a * b;
				}

clang/test/CodeGen/X86/avx512fp16-abi.c

This file was moved to clang/test/CodeGen/X86/fp16-abi.c.

clang/test/CodeGen/X86/avx512fp16-complex.c

This file was moved to clang/test/CodeGen/X86/fp16-complex.c.

clang/test/CodeGen/X86/fp16-abi.c

This file was moved from clang/test/CodeGen/X86/avx512fp16-abi.c.

	// RUN: %clang_cc1 -triple x86_64-linux -emit-llvm -target-feature +avx512fp16 < %s \| FileCheck %s --check-prefixes=CHECK,CHECK-C			// RUN: %clang_cc1 -triple x86_64-linux -emit-llvm -target-feature +avx512fp16 < %s \| FileCheck %s --check-prefixes=CHECK,CHECK-C
				// RUN: %clang_cc1 -triple x86_64-linux -emit-llvm < %s \| FileCheck %s --check-prefixes=CHECK,CHECK-C
				pengfeiAuthorUnsubmitted Done Reply Inline Actions Why don't move it to fp16-abi.c directly? pengfei: Why don't move it to fp16-abi.c directly?
	// RUN: %clang_cc1 -triple x86_64-linux -emit-llvm -target-feature +avx512fp16 -x c++ -std=c++11 < %s \| FileCheck %s --check-prefixes=CHECK,CHECK-CPP			// RUN: %clang_cc1 -triple x86_64-linux -emit-llvm -target-feature +avx512fp16 -x c++ -std=c++11 < %s \| FileCheck %s --check-prefixes=CHECK,CHECK-CPP
				// RUN: %clang_cc1 -triple x86_64-linux -emit-llvm -x c++ -std=c++11 < %s \| FileCheck %s --check-prefixes=CHECK,CHECK-CPP

	struct half1 {			struct half1 {
	_Float16 a;			_Float16 a;
	};			};

	struct half1 h1(_Float16 a) {			struct half1 h1(_Float16 a) {
	// CHECK: define{{.*}}half @			// CHECK: define{{.*}}half @
	struct half1 x;			struct half1 x;
	▲ Show 20 Lines • Show All 230 Lines • Show Last 20 Lines

clang/test/CodeGen/X86/fp16-complex.c

This file was moved from clang/test/CodeGen/X86/avx512fp16-complex.c.

	// RUN: %clang_cc1 %s -O0 -emit-llvm -triple x86_64-unknown-unknown -target-feature +avx512fp16 -o - \| FileCheck %s --check-prefix=X86			// RUN: %clang_cc1 %s -O0 -emit-llvm -triple x86_64-unknown-unknown -target-feature +avx512fp16 -o - \| FileCheck %s --check-prefix=X86
				// RUN: %clang_cc1 %s -O0 -emit-llvm -triple x86_64-unknown-unknown -o - \| FileCheck %s --check-prefix=X86
				pengfeiAuthorUnsubmitted Done Reply Inline Actions Change the file name? pengfei: Change the file name?

	_Float16 _Complex add_half_rr(_Float16 a, _Float16 b) {			_Float16 _Complex add_half_rr(_Float16 a, _Float16 b) {
	// X86-LABEL: @add_half_rr(			// X86-LABEL: @add_half_rr(
	// X86: fadd			// X86: fadd
	// X86-NOT: fadd			// X86-NOT: fadd
	// X86: ret			// X86: ret
	return a + b;			return a + b;
	}			}
	▲ Show 20 Lines • Show All 124 Lines • Show Last 20 Lines

clang/test/Sema/Float16.c

	// RUN: %clang_cc1 -fsyntax-only -verify -triple x86_64-linux-pc %s			// RUN: %clang_cc1 -fsyntax-only -verify -triple x86_64-linux-pc %s -DHAVE
	// RUN: %clang_cc1 -fsyntax-only -verify -triple x86_64-linux-pc -target-feature +avx512fp16 %s -DHAVE			// RUN: %clang_cc1 -fsyntax-only -verify -triple x86_64-linux-pc -target-feature +avx512fp16 %s -DHAVE
	// RUN: %clang_cc1 -fsyntax-only -verify -triple spir-unknown-unknown %s -DHAVE			// RUN: %clang_cc1 -fsyntax-only -verify -triple spir-unknown-unknown %s -DHAVE
	// RUN: %clang_cc1 -fsyntax-only -verify -triple armv7a-linux-gnu %s -DHAVE			// RUN: %clang_cc1 -fsyntax-only -verify -triple armv7a-linux-gnu %s -DHAVE
	// RUN: %clang_cc1 -fsyntax-only -verify -triple aarch64-linux-gnu %s -DHAVE			// RUN: %clang_cc1 -fsyntax-only -verify -triple aarch64-linux-gnu %s -DHAVE
				// RUN: %clang_cc1 -fsyntax-only -verify -triple i386-pc-linux-gnu %s

	#ifndef HAVE			#ifndef HAVE
	// expected-error@+2{{_Float16 is not supported on this target}}			// expected-error@+2{{_Float16 is not supported on this target}}
	#endif // HAVE			#endif // HAVE
	_Float16 f;			_Float16 f;

	#ifdef HAVE			#ifdef HAVE
	_Complex _Float16 a;			_Complex _Float16 a;
	void builtin_complex() {			void builtin_complex() {
	_Float16 a = 0;			_Float16 a = 0;
	(void)__builtin_complex(a, a); // expected-error {{'_Complex _Float16' is invalid}}			(void)__builtin_complex(a, a); // expected-error {{'_Complex _Float16' is invalid}}
	}			}
	#endif			#endif

clang/test/Sema/conversion-target-dep.c

	// RUN: %clang_cc1 -Wdouble-promotion -Wimplicit-float-conversion %s -triple x86_64-apple-macosx10.12 -verify=x86,expected			// RUN: %clang_cc1 -Wdouble-promotion -Wimplicit-float-conversion %s -triple x86_64-apple-macosx10.12 -verify=x86,expected
	// RUN: %clang_cc1 -Wdouble-promotion -Wimplicit-float-conversion %s -triple armv7-apple-ios9.0 -verify=arm,expected			// RUN: %clang_cc1 -Wdouble-promotion -Wimplicit-float-conversion %s -triple armv7-apple-ios9.0 -verify=arm,expected

	// On ARM, long double and double both map to double precision 754s, so there			// On ARM, long double and double both map to double precision 754s, so there
	// isn't any reason to warn on conversions back and forth.			// isn't any reason to warn on conversions back and forth.

	long double ld;			long double ld;
	double d;			double d;
	_Float16 f16; // x86-error {{_Float16 is not supported on this target}}			_Float16 f16;

	int main() {			int main() {
	ld = d; // x86-warning {{implicit conversion increases floating-point precision: 'double' to 'long double'}}			ld = d; // x86-warning {{implicit conversion increases floating-point precision: 'double' to 'long double'}}
	d = ld; // x86-warning {{implicit conversion loses floating-point precision: 'long double' to 'double'}}			d = ld; // x86-warning {{implicit conversion loses floating-point precision: 'long double' to 'double'}}

	ld += d; // x86-warning {{implicit conversion increases floating-point precision: 'double' to 'long double'}}			ld += d; // x86-warning {{implicit conversion increases floating-point precision: 'double' to 'long double'}}
	d += ld; // x86-warning {{implicit conversion when assigning computation result loses floating-point precision: 'long double' to 'double'}}			d += ld; // x86-warning {{implicit conversion when assigning computation result loses floating-point precision: 'long double' to 'double'}}

	f16 = ld; // expected-warning {{implicit conversion loses floating-point precision: 'long double' to '_Float16'}}			f16 = ld; // expected-warning {{implicit conversion loses floating-point precision: 'long double' to '_Float16'}}
	ld = f16; // expected-warning {{implicit conversion increases floating-point precision: '_Float16' to 'long double'}}			ld = f16; // expected-warning {{implicit conversion increases floating-point precision: '_Float16' to 'long double'}}

	f16 += ld; // expected-warning {{implicit conversion when assigning computation result loses floating-point precision: 'long double' to '_Float16'}}			f16 += ld; // expected-warning {{implicit conversion when assigning computation result loses floating-point precision: 'long double' to '_Float16'}}
	ld += f16; // expected-warning {{implicit conversion increases floating-point precision: '_Float16' to 'long double'}}			ld += f16; // expected-warning {{implicit conversion increases floating-point precision: '_Float16' to 'long double'}}
	}			}

clang/test/SemaCXX/Float16.cpp

	// RUN: %clang_cc1 -fsyntax-only -verify -triple x86_64-linux-pc %s			// RUN: %clang_cc1 -fsyntax-only -verify -triple x86_64-linux-pc %s -DHAVE
	// RUN: %clang_cc1 -fsyntax-only -verify -triple spir-unknown-unknown %s -DHAVE			// RUN: %clang_cc1 -fsyntax-only -verify -triple spir-unknown-unknown %s -DHAVE
	// RUN: %clang_cc1 -fsyntax-only -verify -triple armv7a-linux-gnu %s -DHAVE			// RUN: %clang_cc1 -fsyntax-only -verify -triple armv7a-linux-gnu %s -DHAVE
	// RUN: %clang_cc1 -fsyntax-only -verify -triple aarch64-linux-gnu %s -DHAVE			// RUN: %clang_cc1 -fsyntax-only -verify -triple aarch64-linux-gnu %s -DHAVE
				// RUN: %clang_cc1 -fsyntax-only -verify -triple i386-pc-linux-gnu %s

				rjmccallUnsubmitted Done Reply Inline Actions This test (and Float16.c) should test at least one target that doesn't have `_Float16` support, so please just add `-DHAVE` to the x86_64 line and add, I dunno, a generic i386 or SPARC line. rjmccall: This test (and Float16.c) should test at least one target that doesn't have `_Float16` support…
	#ifdef HAVE			#ifdef HAVE
	// expected-no-diagnostics			// expected-no-diagnostics
	#endif // HAVE			#endif // HAVE

	#ifndef HAVE			#ifndef HAVE
	// expected-error@+2{{_Float16 is not supported on this target}}			// expected-error@+2{{_Float16 is not supported on this target}}
	#endif // !HAVE			#endif // !HAVE
	_Float16 f;			_Float16 f;

	#ifndef HAVE			#ifndef HAVE
	// expected-error@+2{{invalid suffix 'F16' on floating constant}}			// expected-error@+2{{invalid suffix 'F16' on floating constant}}
	#endif // !HAVE			#endif // !HAVE
	const auto g = 1.1F16;			const auto g = 1.1F16;

This is an archive of the discontinued LLVM Phabricator instance.

Enable `_Float16` type support on X86 without the avx512fp16 flagAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 388273

clang/docs/LanguageExtensions.rst

clang/docs/ReleaseNotes.rst

clang/lib/Basic/Targets/X86.cpp

clang/test/CodeGen/X86/Float16-arithmetic.c

clang/test/CodeGen/X86/avx512fp16-abi.c

clang/test/CodeGen/X86/avx512fp16-complex.c

clang/test/CodeGen/X86/fp16-abi.c

clang/test/CodeGen/X86/fp16-complex.c

clang/test/Sema/Float16.c

clang/test/Sema/conversion-target-dep.c

clang/test/SemaCXX/Float16.cpp

Enable `_Float16` type support on X86 without the avx512fp16 flag
AbandonedPublic