This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
compiler-rt/
-
cmake/
-
builtin-config-ix.cmake
-
lib/builtins/
-
builtins/
-
CMakeLists.txt
-
extendhfsf2.c
-
extendhftf2.c
-
fp_extend.h
-
fp_trunc.h
-
truncdfhf2.c
-
truncsfhf2.c
-
trunctfhf2.c
-
test/builtins/
-
builtins/
-
CMakeLists.txt
-
Unit/
-
extendhfsf2_test.c
-
extendhftf2_test.c
-
fp_test.h
-
truncdfhf2_test.c
-
truncsfhf2_test.c
-
trunctfhf2_test.c
-
llvm/
-
include/llvm/IR/
-
llvm/
-
IR/
-
RuntimeLibcalls.def
-
lib/
-
CodeGen/
-
TargetLoweringBase.cpp
-
Target/AArch64/
-
AArch64/
-
AArch64ISelLowering.cpp
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
1
arm64-fp128.ll

Differential D90175

[AArch64] Support conversion between fp16 and fp128
AbandonedPublic

Authored by zatrazz on Oct 26 2020, 11:00 AM.

Download Raw Diff

Details

Reviewers

atrosinenko
MaskRay
psmith
efriedma

Summary

This is an updated version of https://reviews.llvm.org/D86453

This patch adds both extendhftf2 and trunctfhf2 to support
conversion between half-precision and quad-precision floating-point
values. They are enabled iff the compiler supports _Float16.

It also adjust the extendhfsf2, truncdfhf2 __truncsfhf2 to use
_Float16 when compiler supports it. On AArch64 it allows use the
native FP16 ABI, while on other architectures the expected current
semantic is preserved (arm for instance).

Diff Detail

Unit TestsFailed

	Time	Test
	460 ms	linux > HWAddressSanitizer-x86_64.TestCases::sizes.cpp
	300 ms	windows > lld.ELF/invalid::symtab-sh-info.s

Event Timeline

zatrazz created this revision.Oct 26 2020, 11:00 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 26 2020, 11:00 AM

Herald added subscribers: Restricted Project, danielkiss, hiraditya and 2 others. · View Herald Transcript

zatrazz requested review of this revision.Oct 26 2020, 11:00 AM

Harbormaster completed remote builds in B76441: Diff 300739.Oct 26 2020, 11:44 AM

bryanpkc added a subscriber: bryanpkc.Oct 27 2020, 12:23 PM

Ping.

Herald added a subscriber: dexonsmith. · View Herald TranscriptNov 2 2020, 9:47 AM

Ping.

Ping (x2).

Apologies for my slow response.

Patches need to have a large context (see https://llvm.org/docs/Phabricator.html#requesting-a-review-via-the-web-interface I prefer arc diff)
The code generation part and the compiler-rt/lib/builtins part are logically independent and thus splitting the patch makes sense. The compiler-rt parts reasonable.

Adding folks with better AArch64 knowledge...

llvm/test/CodeGen/AArch64/arm64-fp128.ll
234	Add `-NEXT` whenever appropriate

MaskRay edited reviewers, added: psmith, efriedma; removed: howard.hinnant.Nov 17 2020, 10:27 PM

They are enabled iff the compiler supports _Float16.

Why are these not enabled for compilers which don't support _Float16? There is also the __fp16 type, which has been supported for much longer and could also generate code like this.

It also adjust the extendhfsf2, truncdfhf2 __truncsfhf2 to use _Float16 when compiler supports it.

This would be better (easier to review, bisect, etc) if split into a separate patch.

On AArch64 it allows use the native FP16 ABI,.

Why is changing the ABI of these functions (including the existing 16<->32 and 16<->64 bit ones) OK? Wouldn't this cause problems if linking clang-compiled code against libgcc? I also don't see any changes to tell the compiler that the ABI has changed.

while on other architectures the expected current semantic is preserved (arm for instance).

Are you sure about this? ARM has a hard-float ABI which passes fp16 values in FP registers, and I'd expect some other architectures to have different calling conventions for uint16_t vs _Float16.

In D90175#2401747, @MaskRay wrote:

Apologies for my slow response.

Patches need to have a large context (see https://llvm.org/docs/Phabricator.html#requesting-a-review-via-the-web-interface I prefer arc diff)

I keep forgetting this.

The code generation part and the compiler-rt/lib/builtins part are logically independent and thus splitting the patch makes sense. The compiler-rt parts reasonable.

Ok, I will split it.

Adding folks with better AArch64 knowledge...

Thanks.

In D90175#2402021, @ostannard wrote:

They are enabled iff the compiler supports _Float16.

Why are these not enabled for compilers which don't support _Float16? There is also the __fp16 type, which has been supported for much longer and could also generate code like this.

Because fp16 is an ARM C extension and tying the builtin generation to ISO extended floating-point type allows to more generic code (once the backend supports the type the builtin will be built instead of tying to the target ABI, such as extendhfsf2.c with builds only for ARM_EABI__).

It also adjust the extendhfsf2, truncdfhf2 __truncsfhf2 to use _Float16 when compiler supports it.

This would be better (easier to review, bisect, etc) if split into a separate patch.

Right, this could be a subsequent path.

On AArch64 it allows use the native FP16 ABI,.

Why is changing the ABI of these functions (including the existing 16<->32 and 16<->64 bit ones) OK? Wouldn't this cause problems if linking clang-compiled code against libgcc? I also don't see any changes to tell the compiler that the ABI has changed.

While fp16 is supported on all architectures, _Float16 is supported only for 32-bit ARM, 64-bit ARM, and SPIR (as indicated by clang/docs/LanguageExtensions.rst). Also, fp16 is a storage format and promote to 'float' for argument passing and 64-bit ARM supports floating-point convert precision to half as base armv8.1-a instruction. It means that although extendhfsf2, truncdfhf2 __truncsfhf2 will be built for 64-bit ARM, they will be never used in practice (compiler won't emit libcall to them). And the patch does not change the ABI for 32-bit ARM, it will continue to pass _Float16 as uint16.

while on other architectures the expected current semantic is preserved (arm for instance).

Are you sure about this? ARM has a hard-float ABI which passes fp16 values in FP registers, and I'd expect some other architectures to have different calling conventions for uint16_t vs _Float16.

Even for the case where either Armv8.2-A 16-bit floating point extensions are available or for 32-bit ARM if neon-fp16 fpu is selected, the libcalls won't be generated (both have support instruction support for float conversion).

In fact, compiler-rt has an issue where gnu_d2h, gnu_f2h, gnu_h2d, gnu_h2f (same for __extendhfsf2, and related functions) rely on the target floating point ABI (it uses 'float' type, where libgcc uses the expected int types). As a noted in a previous patch review, I think these routines should be moved to 32-arm specific folder with the expected sematic (regardless of the float ABI).

In D90175#2402021, @ostannard wrote:

They are enabled iff the compiler supports _Float16.

Why are these not enabled for compilers which don't support _Float16? There is also the __fp16 type, which has been supported for much longer and could also generate code like this.

Because fp16 is only supported for argument passing and function return only on specific 32-bit ARM FPU modes, otherwise it is a only a storage format. Even ACLE has deprecated it over _Float16, so I think the generic implementation for compiler-rt should focus on _Float16 support and make fp16 support (if required, since for architecture where it has argument/function return support such ARM there is no need to fp convertions libcalls).

To summarize, I will split the patch in:

Add extendhftf2 and trunctfhf2
extendhfsf2, truncdfhf2 __truncsfhf2 to use _Float16 when compiler supports it and maybe disable on AArch64 (since there is no need to provide them)
Maybe fix the arm gnu_d2h, gnu_f2h, gnu_h2d, gnu_h2f

[1] https://gcc.gnu.org/legacy-ml/gcc-patches/2016-04/msg01766.html

zatrazz abandoned this revision.Nov 18 2020, 11:36 AM

Revision Contents

Path

Size

compiler-rt/

cmake/

builtin-config-ix.cmake

7 lines

lib/

builtins/

4 lines

6 lines

23 lines

4 lines

4 lines

4 lines

6 lines

21 lines

test/

builtins/

CMakeLists.txt

2 lines

Unit/

98 lines

95 lines

30 lines

8 lines

6 lines

125 lines

llvm/

include/

llvm/

IR/

RuntimeLibcalls.def

1 line

lib/

CodeGen/

TargetLoweringBase.cpp

2 lines

Target/

AArch64/

AArch64ISelLowering.cpp

2 lines

test/

CodeGen/

AArch64/

arm64-fp128.ll

14 lines

Diff 300739

compiler-rt/cmake/builtin-config-ix.cmake

Context not available.
	}	}
	")	")

		builtin_check_c_compiler_source(COMPILER_RT_HAS_FLOAT16
		"
		_Float16 foo(_Float16 x) {
		return x;
		}
		"
		)

	set(ARM64 aarch64)	set(ARM64 aarch64)
	set(ARM32 arm armhf armv6m armv7m armv7em armv7 armv7s armv7k)	set(ARM32 arm armhf armv6m armv7m armv7em armv7 armv7s armv7k)
Context not available.

compiler-rt/lib/builtins/CMakeLists.txt

Context not available.
	divtc3.c	divtc3.c
	divtf3.c	divtf3.c
	extenddftf2.c	extenddftf2.c
		extendhftf2.c
	extendsftf2.c	extendsftf2.c
	fixtfdi.c	fixtfdi.c
	fixtfsi.c	fixtfsi.c
Context not available.
	powitf2.c	powitf2.c
	subtf3.c	subtf3.c
	trunctfdf2.c	trunctfdf2.c
		trunctfhf2.c
	trunctfsf2.c	trunctfsf2.c
	)	)

Context not available.
	else ()	else ()
	set(BUILTIN_CFLAGS "")	set(BUILTIN_CFLAGS "")

		append_list_if(COMPILER_RT_HAS_FLOAT16 -DCOMPILER_RT_HAS_FLOAT16 BUILTIN_CFLAGS)

	append_list_if(COMPILER_RT_HAS_STD_C11_FLAG -std=c11 BUILTIN_CFLAGS)	append_list_if(COMPILER_RT_HAS_STD_C11_FLAG -std=c11 BUILTIN_CFLAGS)

	# These flags would normally be added to CMAKE_C_FLAGS by the llvm	# These flags would normally be added to CMAKE_C_FLAGS by the llvm
Context not available.

compiler-rt/lib/builtins/extendhfsf2.c

Context not available.

	// Use a forwarding definition and noinline to implement a poor man's alias,	// Use a forwarding definition and noinline to implement a poor man's alias,
	// as there isn't a good cross-platform way of defining one.	// as there isn't a good cross-platform way of defining one.
	COMPILER_RT_ABI NOINLINE float __extendhfsf2(uint16_t a) {	COMPILER_RT_ABI NOINLINE float __extendhfsf2(src_t a) {
	return __extendXfYf2__(a);	return __extendXfYf2__(a);
	}	}

	COMPILER_RT_ABI float __gnu_h2f_ieee(uint16_t a) { return __extendhfsf2(a); }	COMPILER_RT_ABI float __gnu_h2f_ieee(src_t a) { return __extendhfsf2(a); }

	#if defined(__ARM_EABI__)	#if defined(__ARM_EABI__)
	#if defined(COMPILER_RT_ARMHF_TARGET)	#if defined(COMPILER_RT_ARMHF_TARGET)
	AEABI_RTABI float __aeabi_h2f(uint16_t a) { return __extendhfsf2(a); }	AEABI_RTABI float __aeabi_h2f(src_t a) { return __extendhfsf2(a); }
	#else	#else
	COMPILER_RT_ALIAS(__extendhfsf2, __aeabi_h2f)	COMPILER_RT_ALIAS(__extendhfsf2, __aeabi_h2f)
	#endif	#endif
Context not available.

compiler-rt/lib/builtins/extendhftf2.c

This file was added.

				//===-- lib/extendhftf2.c - half -> quad conversion ---------------- C --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is dual licensed under the MIT and the University of Illinois Open
				// Source Licenses. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#define QUAD_PRECISION
				#include "fp_lib.h"

				#if defined(CRT_HAS_128BIT) && defined(CRT_LDBL_128BIT) && \
				defined(COMPILER_RT_HAS_FLOAT16)
				#define SRC_HALF
				#define DST_QUAD
				#include "fp_extend_impl.inc"

				COMPILER_RT_ABI long double __extendhftf2(src_t a) {
				return __extendXfYf2__(a);
				}

				#endif

compiler-rt/lib/builtins/fp_extend.h

Context not available.
	}	}

	#elif defined SRC_HALF	#elif defined SRC_HALF
		#ifdef COMPILER_RT_HAS_FLOAT16
		typedef _Float16 src_t;
		#else
	typedef uint16_t src_t;	typedef uint16_t src_t;
		#endif
	typedef uint16_t src_rep_t;	typedef uint16_t src_rep_t;
	#define SRC_REP_C UINT16_C	#define SRC_REP_C UINT16_C
	static const int srcSigBits = 10;	static const int srcSigBits = 10;
Context not available.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: Source should be half, single, or double precision! [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: Source should be half, single, or double precision! [clang-diagnostic-error]…
		Lint: Pre-merge checks Inline Actions clang-tidy: error: Destination should be single, double, or quad precision! [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: Destination should be single, double, or quad precision! [clang-diagnostic…
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'src_rep_t' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'src_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'src_rep_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'src_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'src_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'src_rep_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'src_rep_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'dst_t' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'dst_rep_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'dst_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'dst_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'dst_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'dst_rep_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'dst_rep_t' [clang-diagnostic-error] [[https://github.

compiler-rt/lib/builtins/fp_trunc.h

Context not available.
	static const int dstSigBits = 23;	static const int dstSigBits = 23;

	#elif defined DST_HALF	#elif defined DST_HALF
		#ifdef COMPILER_RT_HAS_FLOAT16
		typedef _Float16 dst_t;
		#else
	typedef uint16_t dst_t;	typedef uint16_t dst_t;
		#endif
	typedef uint16_t dst_rep_t;	typedef uint16_t dst_rep_t;
	#define DST_REP_C UINT16_C	#define DST_REP_C UINT16_C
	static const int dstSigBits = 10;	static const int dstSigBits = 10;
Context not available.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: Destination should be single precision or double precision! [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: Destination should be single precision or double precision! [clang…
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'src_rep_t' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'src_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'src_rep_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'src_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'src_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'src_rep_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'src_rep_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'dst_t' [clang-diagnostic-error] not useful clang-tidy: error: unknown type name 'dst_rep_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'dst_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'dst_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'dst_t' [clang-diagnostic-error] [[https://github.
		Lint: Pre-merge checks Inline Actions clang-tidy: error: unknown type name 'dst_rep_t' [clang-diagnostic-error] not useful Lint: Pre-merge checks: clang-tidy: error: unknown type name 'dst_rep_t' [clang-diagnostic-error] [[https://github.

compiler-rt/lib/builtins/truncdfhf2.c

Context not available.
	#define DST_HALF	#define DST_HALF
	#include "fp_trunc_impl.inc"	#include "fp_trunc_impl.inc"

	COMPILER_RT_ABI uint16_t __truncdfhf2(double a) { return __truncXfYf2__(a); }	COMPILER_RT_ABI dst_t __truncdfhf2(double a) { return __truncXfYf2__(a); }

	#if defined(__ARM_EABI__)	#if defined(__ARM_EABI__)
	#if defined(COMPILER_RT_ARMHF_TARGET)	#if defined(COMPILER_RT_ARMHF_TARGET)
	AEABI_RTABI uint16_t __aeabi_d2h(double a) { return __truncdfhf2(a); }	AEABI_RTABI dst_t __aeabi_d2h(double a) { return __truncdfhf2(a); }
	#else	#else
	COMPILER_RT_ALIAS(__truncdfhf2, __aeabi_d2h)	COMPILER_RT_ALIAS(__truncdfhf2, __aeabi_d2h)
	#endif	#endif
Context not available.

compiler-rt/lib/builtins/truncsfhf2.c

Context not available.

	// Use a forwarding definition and noinline to implement a poor man's alias,	// Use a forwarding definition and noinline to implement a poor man's alias,
	// as there isn't a good cross-platform way of defining one.	// as there isn't a good cross-platform way of defining one.
	COMPILER_RT_ABI NOINLINE uint16_t __truncsfhf2(float a) {	COMPILER_RT_ABI NOINLINE dst_t __truncsfhf2(float a) {
	return __truncXfYf2__(a);	return __truncXfYf2__(a);
	}	}

	COMPILER_RT_ABI uint16_t __gnu_f2h_ieee(float a) { return __truncsfhf2(a); }	COMPILER_RT_ABI dst_t __gnu_f2h_ieee(float a) { return __truncsfhf2(a); }

	#if defined(__ARM_EABI__)	#if defined(__ARM_EABI__)
	#if defined(COMPILER_RT_ARMHF_TARGET)	#if defined(COMPILER_RT_ARMHF_TARGET)
	AEABI_RTABI uint16_t __aeabi_f2h(float a) { return __truncsfhf2(a); }	AEABI_RTABI dst_t __aeabi_f2h(float a) { return __truncsfhf2(a); }
	#else	#else
	COMPILER_RT_ALIAS(__truncsfhf2, __aeabi_f2h)	COMPILER_RT_ALIAS(__truncsfhf2, __aeabi_f2h)
	#endif	#endif
Context not available.

compiler-rt/lib/builtins/trunctfhf2.c

This file was added.

				//===-- lib/trunctfhf2.c - quad -> half conversion ----------------- C --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is dual licensed under the MIT and the University of Illinois Open
				// Source Licenses. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#define QUAD_PRECISION
				#include "fp_lib.h"

				#if defined(CRT_HAS_128BIT) && defined(CRT_LDBL_128BIT) && \
				defined(COMPILER_RT_HAS_FLOAT16)
				#define SRC_QUAD
				#define DST_HALF
				#include "fp_trunc_impl.inc"

				COMPILER_RT_ABI dst_t __trunctfhf2(long double a) { return __truncXfYf2__(a); }

				#endif

compiler-rt/test/builtins/CMakeLists.txt

Context not available.
	string(REPLACE ";" " " BUILTINS_TEST_TARGET_CFLAGS "${BUILTINS_TEST_TARGET_CFLAGS}")	string(REPLACE ";" " " BUILTINS_TEST_TARGET_CFLAGS "${BUILTINS_TEST_TARGET_CFLAGS}")
	endif()	endif()

		append_list_if(COMPILER_RT_HAS_FLOAT16 -DCOMPILER_RT_HAS_FLOAT16 BUILTINS_TEST_TARGET_CFLAGS)

	# Compute builtins available in library and add them as lit features.	# Compute builtins available in library and add them as lit features.
	if(APPLE)	if(APPLE)
	# TODO: Support other Apple platforms.	# TODO: Support other Apple platforms.
Context not available.

compiler-rt/test/builtins/Unit/extendhfsf2_test.c

Context not available.

	#include "fp_test.h"	#include "fp_test.h"

	float __extendhfsf2(uint16_t a);	float __extendhfsf2(TYPE_FP16 a);

	int test__extendhfsf2(uint16_t a, float expected)	int test__extendhfsf2(TYPE_FP16 a, uint32_t expected)
	{	{
	float x = __extendhfsf2(a);	float x = __extendhfsf2(a);
	int ret = compareResultH(x, expected);	int ret = compareResultF(x, expected);

	if (ret){	if (ret){
	printf("error in test__extendhfsf2(%#.4x) = %f, "	printf("error in test__extendhfsf2(%#.4x) = %f, "
	"expected %f\n", a, x, expected);	"expected %f\n", toRep16(a), x, fromRep32(expected));
	}	}
	return ret;	return ret;
	}	}

	char assumption_1[sizeof(__fp16) * CHAR_BIT == 16] = {0};	char assumption_1[sizeof(TYPE_FP16) * CHAR_BIT == 16] = {0};

	int main()	int main()
	{	{
	// qNaN	// qNaN
	if (test__extendhfsf2(UINT16_C(0x7e00),	if (test__extendhfsf2(fromRep16(0x7e00),
	makeQNaN32()))	UINT32_C(0x7fc00000)))
	return 1;	return 1;
	// NaN	// NaN
	if (test__extendhfsf2(UINT16_C(0x7e00),	if (test__extendhfsf2(fromRep16(0x7f80),
	makeNaN32(UINT32_C(0x8000))))	UINT32_C(0x7ff00000)))
	return 1;	return 1;
	// inf	// inf
	if (test__extendhfsf2(UINT16_C(0x7c00),	if (test__extendhfsf2(fromRep16(0x7c00),
	makeInf32()))	UINT32_C(0x7f800000)))
	return 1;	return 1;
	if (test__extendhfsf2(UINT16_C(0xfc00),	// -inf
	-makeInf32()))	if (test__extendhfsf2(fromRep16(0xfc00),
		UINT32_C(0xff800000)))
	return 1;	return 1;
	// zero	// zero
	if (test__extendhfsf2(UINT16_C(0x0),	if (test__extendhfsf2(fromRep16(0x0),
	0.0f))	UINT32_C(0x00000000)))
	return 1;	return 1;
	if (test__extendhfsf2(UINT16_C(0x8000),	// -zero
	-0.0f))	if (test__extendhfsf2(fromRep16(0x8000),
		UINT32_C(0x80000000)))
	return 1;	return 1;
		if (test__extendhfsf2(fromRep16(0x4248),
	if (test__extendhfsf2(UINT16_C(0x4248),	UINT32_C(0x40490000)))
	3.1415926535f))
	return 1;
	if (test__extendhfsf2(UINT16_C(0xc248),
	-3.1415926535f))
	return 1;	return 1;
	if (test__extendhfsf2(UINT16_C(0x7c00),	if (test__extendhfsf2(fromRep16(0xc248),
	0x1.987124876876324p+100f))	UINT32_C(0xc0490000)))
	return 1;	return 1;
	if (test__extendhfsf2(UINT16_C(0x6e62),	if (test__extendhfsf2(fromRep16(0x6e62),
	0x1.988p+12f))	UINT32_C(0x45cc4000)))
	return 1;	return 1;
	if (test__extendhfsf2(UINT16_C(0x3c00),	if (test__extendhfsf2(fromRep16(0x3c00),
	0x1.0p+0f))	UINT32_C(0x3f800000)))
	return 1;	return 1;
	if (test__extendhfsf2(UINT16_C(0x0400),	if (test__extendhfsf2(fromRep16(0x0400),
	0x1.0p-14f))	UINT32_C(0x38800000)))
	return 1;	return 1;
	// denormal	// denormal
	if (test__extendhfsf2(UINT16_C(0x0010),	if (test__extendhfsf2(fromRep16(0x0010),
	0x1.0p-20f))	UINT32_C(0x35800000)))
	return 1;
	if (test__extendhfsf2(UINT16_C(0x0001),
	0x1.0p-24f))
	return 1;	return 1;
	if (test__extendhfsf2(UINT16_C(0x8001),	if (test__extendhfsf2(fromRep16(0x0001),
	-0x1.0p-24f))	UINT32_C(0x33800000)))
	return 1;	return 1;
	if (test__extendhfsf2(UINT16_C(0x0001),	if (test__extendhfsf2(fromRep16(0x8001),
	0x1.5p-25f))	UINT32_C(0xb3800000)))
	return 1;	return 1;
	// and back to zero	if (test__extendhfsf2(fromRep16(0x0001),
	if (test__extendhfsf2(UINT16_C(0x0000),	UINT32_C(0x33800000)))
	0x1.0p-25f))
	return 1;
	if (test__extendhfsf2(UINT16_C(0x8000),
	-0x1.0p-25f))
	return 1;	return 1;
	// max (precise)	// max (precise)
	if (test__extendhfsf2(UINT16_C(0x7bff),	if (test__extendhfsf2(fromRep16(0x7bff),
	65504.0f))	UINT32_C(0x477fe000)))
	return 1;	return 1;
	// max (rounded)	// max (rounded)
	if (test__extendhfsf2(UINT16_C(0x7bff),	if (test__extendhfsf2(fromRep16(0x7bff),
	65504.0f))	UINT32_C(0x477fe000)))
	return 1;
	// max (to +inf)
	if (test__extendhfsf2(UINT16_C(0x7c00),
	makeInf32()))
	return 1;
	if (test__extendhfsf2(UINT16_C(0xfc00),
	-makeInf32()))
	return 1;	return 1;
	return 0;	return 0;
	}	}
Context not available.

compiler-rt/test/builtins/Unit/extendhftf2_test.c

This file was added.

				// RUN: %clang_builtins %s %librt -o %t && %run %t
				// REQUIRES: librt_has_extendhftf2

				#include "int_lib.h"
				#include <stdio.h>

				#if __LDBL_MANT_DIG__ == 113 && defined(COMPILER_RT_HAS_FLOAT16)

				#include "fp_test.h"

				COMPILER_RT_ABI long double __extendhftf2(TYPE_FP16 a);

				int test__extendhftf2(TYPE_FP16 a, uint64_t expectedHi, uint64_t expectedLo) {
				long double x = __extendhftf2(a);
				int ret = compareResultLD(x, expectedHi, expectedLo);

				if (ret) {
				printf("error in test__extendhftf2(%#.4x) = %.20Lf, "
				"expected %.20Lf\n",
				toRep16(a), x,
				fromRep128(expectedHi, expectedLo));
				}
				return ret;
				}

				char assumption_1[sizeof(TYPE_FP16) * CHAR_BIT == 16] = {0};

				#endif

				int main() {
				#if __LDBL_MANT_DIG__ == 113 && defined(COMPILER_RT_HAS_FLOAT16)
				// qNaN
				if (test__extendhftf2(makeQNaN16(),
				UINT64_C(0x7fff800000000000),
				UINT64_C(0x0)))
				return 1;
				// NaN
				if (test__extendhftf2(makeNaN16(UINT16_C(0x0100)),
				UINT64_C(0x7fff400000000000),
				UINT64_C(0x0)))
				return 1;
				// inf
				if (test__extendhftf2(makeInf16(),
				UINT64_C(0x7fff000000000000),
				UINT64_C(0x0)))
				return 1;
				if (test__extendhftf2(-makeInf16(),
				UINT64_C(0xffff000000000000),
				UINT64_C(0x0)))
				return 1;
				// zero
				if (test__extendhftf2(fromRep16(0x0U),
				UINT64_C(0x0), UINT64_C(0x0)))
				return 1;
				if (test__extendhftf2(fromRep16(0x8000U),
				UINT64_C(0x8000000000000000),
				UINT64_C(0x0)))
				return 1;
				// denormal
				if (test__extendhftf2(fromRep16(0x0010U),
				UINT64_C(0x3feb000000000000),
				UINT64_C(0x0000000000000000)))
				return 1;
				if (test__extendhftf2(fromRep16(0x0001U),
				UINT64_C(0x3fe7000000000000),
				UINT64_C(0x0000000000000000)))
				return 1;
				if (test__extendhftf2(fromRep16(0x8001U),
				UINT64_C(0xbfe7000000000000),
				UINT64_C(0x0000000000000000)))
				return 1;

				// pi
				if (test__extendhftf2(fromRep16(0x4248U),
				UINT64_C(0x4000920000000000),
				UINT64_C(0x0000000000000000)))
				return 1;
				if (test__extendhftf2(fromRep16(0xc248U),
				UINT64_C(0xc000920000000000),
				UINT64_C(0x0000000000000000)))
				return 1;

				if (test__extendhftf2(fromRep16(0x508cU),
				UINT64_C(0x4004230000000000),
				UINT64_C(0x0)))
				return 1;
				if (test__extendhftf2(fromRep16(0x1bb7U),
				UINT64_C(0x3ff6edc000000000),
				UINT64_C(0x0)))
				return 1;
				#else
				printf("skipped\n");
				#endif
				return 0;
				}

compiler-rt/test/builtins/Unit/fp_test.h

Context not available.
	#include <string.h>	#include <string.h>
	#include <stdint.h>	#include <stdint.h>

		#ifdef COMPILER_RT_HAS_FLOAT16
		#define TYPE_FP16 _Float16
		#else
		#define TYPE_FP16 uint16_t
		#endif

	enum EXPECTED_RESULT {	enum EXPECTED_RESULT {
	LESS_0, LESS_EQUAL_0, EQUAL_0, GREATER_0, GREATER_EQUAL_0, NEQUAL_0	LESS_0, LESS_EQUAL_0, EQUAL_0, GREATER_0, GREATER_EQUAL_0, NEQUAL_0
	};	};

	static inline uint16_t fromRep16(uint16_t x)	static inline TYPE_FP16 fromRep16(uint16_t x)
	{	{
		#ifdef COMPILER_RT_HAS_FLOAT16
		TYPE_FP16 ret;
		memcpy(&ret, &x, sizeof(ret));
		return ret;
		#else
	return x;	return x;
		#endif
	}	}

	static inline float fromRep32(uint32_t x)	static inline float fromRep32(uint32_t x)
Context not available.
	}	}
	#endif	#endif

	static inline uint16_t toRep16(uint16_t x)	static inline uint16_t toRep16(TYPE_FP16 x)
	{	{
		#ifdef COMPILER_RT_HAS_FLOAT16
		uint16_t ret;
		memcpy(&ret, &x, sizeof(ret));
		return ret;
		#else
	return x;	return x;
		#endif
	}	}

	static inline uint32_t toRep32(float x)	static inline uint32_t toRep32(float x)
Context not available.
	}	}
	#endif	#endif

	static inline int compareResultH(uint16_t result,	static inline int compareResultH(TYPE_FP16 result,
	uint16_t expected)	uint16_t expected)
	{	{
	uint16_t rep = toRep16(result);	uint16_t rep = toRep16(result);
Context not available.
	return "";	return "";
	}	}

	static inline uint16_t makeQNaN16(void)	static inline TYPE_FP16 makeQNaN16(void)
	{	{
	return fromRep16(0x7e00U);	return fromRep16(0x7e00U);
	}	}
Context not available.
	}	}
	#endif	#endif

	static inline uint16_t makeNaN16(uint16_t rand)	static inline TYPE_FP16 makeNaN16(uint16_t rand)
	{	{
	return fromRep16(0x7c00U \| (rand & 0x7fffU));	return fromRep16(0x7c00U \| (rand & 0x7fffU));
	}	}
Context not available.
	}	}
	#endif	#endif

	static inline uint16_t makeInf16(void)	static inline TYPE_FP16 makeInf16(void)
	{	{
	return fromRep16(0x7c00U);	return fromRep16(0x7c00U);
	}	}
Context not available.

compiler-rt/test/builtins/Unit/truncdfhf2_test.c

Context not available.

	#include "fp_test.h"	#include "fp_test.h"

	uint16_t __truncdfhf2(double a);	TYPE_FP16 __truncdfhf2(double a);

	int test__truncdfhf2(double a, uint16_t expected)	int test__truncdfhf2(double a, uint16_t expected)
	{	{
	uint16_t x = __truncdfhf2(a);	TYPE_FP16 x = __truncdfhf2(a);
	int ret = compareResultH(x, expected);	int ret = compareResultH(x, expected);

	if (ret){	if (ret){
	printf("error in test__truncdfhf2(%f) = %#.4x, "	printf("error in test__truncdfhf2(%lf) = %#.4x, "
	"expected %#.4x\n", a, x, fromRep16(expected));	"expected %#.4x\n", a, toRep16(x), expected);
	}	}
	return ret;	return ret;
	}	}
Context not available.

compiler-rt/test/builtins/Unit/truncsfhf2_test.c

Context not available.

	#include "fp_test.h"	#include "fp_test.h"

	uint16_t __truncsfhf2(float a);	TYPE_FP16 __truncsfhf2(float a);

	int test__truncsfhf2(float a, uint16_t expected)	int test__truncsfhf2(float a, uint16_t expected)
	{	{
	uint16_t x = __truncsfhf2(a);	TYPE_FP16 x = __truncsfhf2(a);
	int ret = compareResultH(x, expected);	int ret = compareResultH(x, expected);

	if (ret){	if (ret){
	printf("error in test__truncsfhf2(%f) = %#.4x, "	printf("error in test__truncsfhf2(%f) = %#.4x, "
	"expected %#.4x\n", a, x, fromRep16(expected));	"expected %#.4x\n", a, toRep16(x), expected);
	}	}
	return ret;	return ret;
	}	}
Context not available.

compiler-rt/test/builtins/Unit/trunctfhf2_test.c

This file was added.

				// RUN: %clang_builtins %s %librt -o %t && %run %t
				// REQUIRES: librt_has_trunctfhf2

				#include "int_lib.h"
				#include <stdio.h>

				#if __LDBL_MANT_DIG__ == 113 && defined(COMPILER_RT_HAS_FLOAT16)

				#include "fp_test.h"

				TYPE_FP16 __trunctfhf2(long double a);

				int test__trunctfhf2(long double a, uint16_t expected) {
				TYPE_FP16 x = __trunctfhf2(a);
				int ret = compareResultH(x, expected);

				if (ret) {
				printf("error in test__trunctfhf2(%.20Lf) = %#.4x, "
				"expected %#.4x\n",
				a, toRep16(x), expected);
				}
				return ret;
				}

				char assumption_1[sizeof(TYPE_FP16) * CHAR_BIT == 16] = {0};

				#endif

				int main() {
				#if __LDBL_MANT_DIG__ == 113 && defined(COMPILER_RT_HAS_FLOAT16)
				// qNaN
				if (test__trunctfhf2(makeQNaN128(),
				UINT16_C(0x7e00)))
				return 1;
				// NaN
				if (test__trunctfhf2(makeNaN128(UINT64_C(0x810000000000)),
				UINT16_C(0x7e00)))
				return 1;
				// inf
				if (test__trunctfhf2(makeInf128(),
				UINT16_C(0x7c00)))
				return 1;
				if (test__trunctfhf2(-makeInf128(),
				UINT16_C(0xfc00)))
				return 1;
				// zero
				if (test__trunctfhf2(0.0L, UINT16_C(0x0)))
				return 1;
				if (test__trunctfhf2(-0.0L, UINT16_C(0x8000)))
				return 1;

				if (test__trunctfhf2(3.1415926535L,
				UINT16_C(0x4248)))
				return 1;
				if (test__trunctfhf2(-3.1415926535L,
				UINT16_C(0xc248)))
				return 1;
				if (test__trunctfhf2(0x1.987124876876324p+100L,
				UINT16_C(0x7c00)))
				return 1;
				if (test__trunctfhf2(0x1.987124876876324p+12L,
				UINT16_C(0x6e62)))
				return 1;
				if (test__trunctfhf2(0x1.0p+0L,
				UINT16_C(0x3c00)))
				return 1;
				if (test__trunctfhf2(0x1.0p-14L,
				UINT16_C(0x0400)))
				return 1;
				// denormal
				if (test__trunctfhf2(0x1.0p-20L,
				UINT16_C(0x0010)))
				return 1;
				if (test__trunctfhf2(0x1.0p-24L,
				UINT16_C(0x0001)))
				return 1;
				if (test__trunctfhf2(-0x1.0p-24L,
				UINT16_C(0x8001)))
				return 1;
				if (test__trunctfhf2(0x1.5p-25L,
				UINT16_C(0x0001)))
				return 1;
				// and back to zero
				if (test__trunctfhf2(0x1.0p-25L,
				UINT16_C(0x0000)))
				return 1;
				if (test__trunctfhf2(-0x1.0p-25L,
				UINT16_C(0x8000)))
				return 1;
				// max (precise)
				if (test__trunctfhf2(65504.0L,
				UINT16_C(0x7bff)))
				return 1;
				// max (rounded)
				if (test__trunctfhf2(65519.0L,
				UINT16_C(0x7bff)))
				return 1;
				// max (to +inf)
				if (test__trunctfhf2(65520.0L,
				UINT16_C(0x7c00)))
				return 1;
				if (test__trunctfhf2(65536.0L,
				UINT16_C(0x7c00)))
				return 1;
				if (test__trunctfhf2(-65520.0L,
				UINT16_C(0xfc00)))
				return 1;

				if (test__trunctfhf2(0x1.23a2abb4a2ddee355f36789abcdep+5L,
				UINT16_C(0x508f)))
				return 1;
				if (test__trunctfhf2(0x1.e3d3c45bd3abfd98b76a54cc321fp-9L,
				UINT16_C(0x1b8f)))
				return 1;
				if (test__trunctfhf2(0x1.234eebb5faa678f4488693abcdefp+453L,
				UINT16_C(0x7c00)))
				return 1;
				if (test__trunctfhf2(0x1.edcba9bb8c76a5a43dd21f334634p-43L,
				UINT16_C(0x0)))
				return 1;
				#else
				printf("skipped\n");
				#endif
				return 0;
				}

llvm/include/llvm/IR/RuntimeLibcalls.def

Context not available.
	HANDLE_LIBCALL(FPEXT_F80_F128, "__extendxftf2")	HANDLE_LIBCALL(FPEXT_F80_F128, "__extendxftf2")
	HANDLE_LIBCALL(FPEXT_F64_F128, "__extenddftf2")	HANDLE_LIBCALL(FPEXT_F64_F128, "__extenddftf2")
	HANDLE_LIBCALL(FPEXT_F32_F128, "__extendsftf2")	HANDLE_LIBCALL(FPEXT_F32_F128, "__extendsftf2")
		HANDLE_LIBCALL(FPEXT_F16_F128, "__extendhftf2")
	HANDLE_LIBCALL(FPEXT_F32_F64, "__extendsfdf2")	HANDLE_LIBCALL(FPEXT_F32_F64, "__extendsfdf2")
	HANDLE_LIBCALL(FPEXT_F16_F32, "__gnu_h2f_ieee")	HANDLE_LIBCALL(FPEXT_F16_F32, "__gnu_h2f_ieee")
	HANDLE_LIBCALL(FPROUND_F32_F16, "__gnu_f2h_ieee")	HANDLE_LIBCALL(FPROUND_F32_F16, "__gnu_f2h_ieee")
Context not available.

llvm/lib/CodeGen/TargetLoweringBase.cpp

Context not available.
	if (OpVT == MVT::f16) {	if (OpVT == MVT::f16) {
	if (RetVT == MVT::f32)	if (RetVT == MVT::f32)
	return FPEXT_F16_F32;	return FPEXT_F16_F32;
		if (RetVT == MVT::f128)
		return FPEXT_F16_F128;
	} else if (OpVT == MVT::f32) {	} else if (OpVT == MVT::f32) {
	if (RetVT == MVT::f64)	if (RetVT == MVT::f64)
	return FPEXT_F32_F64;	return FPEXT_F32_F64;
Context not available.

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

Context not available.
	setOperationAction(ISD::STRICT_UINT_TO_FP, MVT::i32, Custom);	setOperationAction(ISD::STRICT_UINT_TO_FP, MVT::i32, Custom);
	setOperationAction(ISD::STRICT_UINT_TO_FP, MVT::i64, Custom);	setOperationAction(ISD::STRICT_UINT_TO_FP, MVT::i64, Custom);
	setOperationAction(ISD::STRICT_UINT_TO_FP, MVT::i128, Custom);	setOperationAction(ISD::STRICT_UINT_TO_FP, MVT::i128, Custom);
		setOperationAction(ISD::FP_ROUND, MVT::f16, Custom);
	setOperationAction(ISD::FP_ROUND, MVT::f32, Custom);	setOperationAction(ISD::FP_ROUND, MVT::f32, Custom);
	setOperationAction(ISD::FP_ROUND, MVT::f64, Custom);	setOperationAction(ISD::FP_ROUND, MVT::f64, Custom);
		setOperationAction(ISD::STRICT_FP_ROUND, MVT::f16, Custom);
	setOperationAction(ISD::STRICT_FP_ROUND, MVT::f32, Custom);	setOperationAction(ISD::STRICT_FP_ROUND, MVT::f32, Custom);
	setOperationAction(ISD::STRICT_FP_ROUND, MVT::f64, Custom);	setOperationAction(ISD::STRICT_FP_ROUND, MVT::f64, Custom);

Context not available.

llvm/test/CodeGen/AArch64/arm64-fp128.ll

Context not available.
	; CHECK: ret	; CHECK: ret
	}	}

		@varhalf = global half 0.0, align 2
	@varfloat = global float 0.0, align 4	@varfloat = global float 0.0, align 4
	@vardouble = global double 0.0, align 8	@vardouble = global double 0.0, align 8

Context not available.

	%val = load fp128, fp128* @lhs, align 16	%val = load fp128, fp128* @lhs, align 16

		%half = fptrunc fp128 %val to half
		store half %half, half* @varhalf, align 2
		; CHECK: ldr q0, [{{x[0-9]+}}, :lo12:lhs]
		; CHECK: bl __trunctfhf2
		MaskRayUnsubmitted Not Done Reply Inline Actions Add `-NEXT` whenever appropriate MaskRay: Add `-NEXT` whenever appropriate
		; CHECK: str h0, [{{x[0-9]+}}, :lo12:varhalf]

	%float = fptrunc fp128 %val to float	%float = fptrunc fp128 %val to float
	store float %float, float* @varfloat, align 4	store float %float, float* @varfloat, align 4
	; CHECK: bl __trunctfsf2	; CHECK: bl __trunctfsf2
Context not available.

	%val = load fp128, fp128* @lhs, align 16	%val = load fp128, fp128* @lhs, align 16

		%half = load half, half* @varhalf
		%fromhalf = fpext half %half to fp128
		store volatile fp128 %fromhalf, fp128* @lhs, align 16
		; CHECK: ldr h0, [{{x[0-9]+}}, :lo12:varhalf]
		; CHECK: bl __extendhftf2
		; CHECK: str q0, [{{x[0-9]+}}, :lo12:lhs]

	%float = load float, float* @varfloat	%float = load float, float* @varfloat
	%fromfloat = fpext float %float to fp128	%fromfloat = fpext float %float to fp128
	store volatile fp128 %fromfloat, fp128* @lhs, align 16	store volatile fp128 %fromfloat, fp128* @lhs, align 16
Context not available.

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] Support conversion between fp16 and fp128AbandonedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 300739

compiler-rt/cmake/builtin-config-ix.cmake

compiler-rt/lib/builtins/CMakeLists.txt

compiler-rt/lib/builtins/extendhfsf2.c

compiler-rt/lib/builtins/extendhftf2.c

compiler-rt/lib/builtins/fp_extend.h

compiler-rt/lib/builtins/fp_trunc.h

compiler-rt/lib/builtins/truncdfhf2.c

compiler-rt/lib/builtins/truncsfhf2.c

compiler-rt/lib/builtins/trunctfhf2.c

compiler-rt/test/builtins/CMakeLists.txt

compiler-rt/test/builtins/Unit/extendhfsf2_test.c

compiler-rt/test/builtins/Unit/extendhftf2_test.c

compiler-rt/test/builtins/Unit/fp_test.h

compiler-rt/test/builtins/Unit/truncdfhf2_test.c

compiler-rt/test/builtins/Unit/truncsfhf2_test.c

compiler-rt/test/builtins/Unit/trunctfhf2_test.c

llvm/include/llvm/IR/RuntimeLibcalls.def

llvm/lib/CodeGen/TargetLoweringBase.cpp

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

llvm/test/CodeGen/AArch64/arm64-fp128.ll

[AArch64] Support conversion between fp16 and fp128
AbandonedPublic