This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/ARM/
-
Target/
-
ARM/
3/5
ARMISelLowering.cpp
-
test/CodeGen/ARM/
-
CodeGen/
-
ARM/
-
fp16.ll

Differential D94557

[ARM] Fixed incorrect lowering when using GNUEABI (libgcc) and 16bit floats
Needs ReviewPublic

Authored by kevinpeizner on Jan 12 2021, 2:44 PM.

Download Raw Diff

Details

Reviewers

rengolin
compnerd
peter.smith

Summary

When compiling code that uses 16bit floating point and the compilation
targets GNUEABI, the intrinsic __gnu_d2h_ieee from libgcc should be
used rather than __aeabi_d2h which is not available in libgcc.

An example of this issue can be found in this Compiler Explorer
instance (using clang trunk as of Jan 04, 2020):
https://godbolt.org/z/anj1Y6

In the Compiler Explorer example, if you switch the target between
armv7-unknown-linux-gnueabihf and armv7-unknown-linux-eabihf you can
see many intrinsics are correctly swapped except for __aeabi_d2h.

This commit makes correct the code comment (line 710 in
ARMISelLowering.cpp as of commit 32c47ebef18):

  // In EABI, these functions have an __aeabi_ prefix, but in GNUEABI they have
  // a __gnu_ prefix (which is the default).

Diff Detail

Event Timeline

kevinpeizner created this revision.Jan 12 2021, 2:44 PM

Herald added subscribers: danielkiss, hiraditya, kristof.beyls. · View Herald TranscriptJan 12 2021, 2:44 PM

kevinpeizner requested review of this revision.Jan 12 2021, 2:44 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 12 2021, 2:44 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B84927: Diff 316243.Jan 12 2021, 4:16 PM

Also, please use diff -U9999 as documented: https://llvm.org/docs/Phabricator.html#requesting-a-review-via-the-web-interface

llvm/lib/Target/ARM/ARMISelLowering.cpp
601	This doesn't look like the right fix. You're just replacing for all, no? Your changes to the tests don't show it can still emit EABI intrinsics with the right ABI flag.

Updated patch with diff -U9999

kevinpeizner added inline comments.Jan 13 2021, 9:15 AM

llvm/lib/Target/ARM/ARMISelLowering.cpp
601	The `i16 @test_to_fp16(double %in)` in `llvm/test/CodeGen/ARM/fp16.ll` test checks for different intrinsics based on the EABI flag. For example, when the test runs with `; RUN: llc -mtriple=armv7a--none-eabi < %s \| FileCheck --check-prefix=CHECK --check-prefix=CHECK-HARDFLOAT-EABI %s` then the test will verify that `bl __aeabi_d2h` is emitted rather than `bl __gnu_d2h_ieee`. I interpret this to mean that my changes still emit EABI intrinsics with the right ABI flag. This is my first contribution to the LLVM Project so please let me know if this is incorrect. The intent is to emit `__gnu_d2h_ieee` by default while allowing `__aeabi_d2h` to be emitted when EABI is targeted.
720	Here `__gnu_d2h_ieee` is replaced with `__aeabi_d2h` if and only if EABI is targeted.

rengolin added reviewers: compnerd, peter.smith.Jan 13 2021, 9:41 AM

rengolin added inline comments.

llvm/lib/Target/ARM/ARMISelLowering.cpp
601	You're right, I didn't catch the other tests due to the lack of context. But this still looks like the wrong place. It's been a long time I touched this part of the code, but it being the only "gnu" replacement where all the rest is still "eabi" looks wrong. There must be another place where the "gnu" calls are being replaced and probably just adding "d2h" there would do the trick?

kevinpeizner added inline comments.Jan 13 2021, 11:36 AM

llvm/lib/Target/ARM/ARMISelLowering.cpp
601	Grepping for `isTargetGNUAEABI` shows that there is no conditional block for GNUAEABI && !AEABI. On the other hand, grepping for `isTargetAEABI` shows that there is one conditional block for AEABI && !GNUAEABI (see line 713). I can add a new conditional block to achieve the same result, if that is preferred. However, based on the comment at line 712, I thought changing the default intrinsic to `__gnu_d2h_ieee` made the most sense and minimized code change.

I agree with Renato, this seems incorrect. In the EABI mode, we should prefer the EABI symbols.

In D94557#2498984, @compnerd wrote:

I agree with Renato, this seems incorrect. In the EABI mode, we should prefer the EABI symbols.

Maybe I'm miss understanding, but this change ensures EABI symbols are still used when EABI is targeted. The fp16.ll regression test verifies this is the case, no?

Would it be better to default to EABI symbols and conditionally use GNU symbols based on the target? I can certain switch to that approach, but I didn't go that way initially since this code block implies GNU symbols should be the default:

// In EABI, these functions have an __aeabi_ prefix, but in GNUEABI they have
// a __gnu_ prefix (which is the default).
if (Subtarget->isTargetAEABI()) {
  static const struct {
    const RTLIB::Libcall Op;
    const char * const Name;
    const CallingConv::ID CC;
  } LibraryCalls[] = {
    { RTLIB::FPROUND_F32_F16, "__aeabi_f2h", CallingConv::ARM_AAPCS },
    { RTLIB::FPROUND_F64_F16, "__aeabi_d2h", CallingConv::ARM_AAPCS },
    { RTLIB::FPEXT_F16_F32, "__aeabi_h2f", CallingConv::ARM_AAPCS },
  };

Here's some observations that might help figure out what path to take

According to the discussion on this ticket , there are only four arm __aeabi prefixed functions in compiter-rt that aren't present in libgcc:

__aeabi_memcpy
__aeabi_memmove
__aeabi_memset
__aeabi_d2h

memcpy, memmove and memset default to being lowered to the non __aebi prefixed functions in include/llvm/IR/RuntimeLibcalls.def:

HANDLE_LIBCALL(MEMCPY, "memcpy")
HANDLE_LIBCALL(MEMMOVE, "memmove")
HANDLE_LIBCALL(MEMSET, "memset")

A code path in ARMISelLowering.cpp switches them to __aeabi_memcpy, __aeabi_memmove and __aeabi_memset if the "EABIVersion" option is set to EABI4 or EABI5.

Also in include/llvm/IR/RuntimeLibcalls.def, the FPROUND_F32_F16 and FPROUND_F32_F16 library calls default to __gnu prefixed functions:

HANDLE_LIBCALL(FPEXT_F16_F32, "__gnu_h2f_ieee")
HANDLE_LIBCALL(FPROUND_F32_F16, "__gnu_f2h_ieee")
HANDLE_LIBCALL(FPROUND_F64_F16, "__truncdfhf2")

Code in ARMISelLowering.cpp is currently unconditionally switching the FPROUND_F64_F16 default lowering to __aeabi_d2h. Kevin's change would make it default to __gnu_d2h_ieee

There is a code path in ARMISelLowering.cpp that changes FPROUND_F32_F16, FPROUND_F64_F16, and FPEXT_F16_F32 to the __eabui prefixed functions when EABI4 or EABI5 is requested (mentioned in Kevin's last comment)

To me it seems the proposed change would make the default FPROUND_F64_F16 lowering more consistent with that of FPEXT_F16_F32 and FPROUND_F32_F16

That said, I don't know if making FPROUND_F64_F16 consistent with FPEXT_F16_F32 and FPROUND_F32_F16 is necessarily the right thing to do

Being explicit with a Subtarget->isTargetGNUAEABI() code branch also seems like a good option and would minimize change to other subtargets

For example, I'm not sure what the android triplet is, but the swift stdlib is already adding an implementation for __aeabi_d2h for android armv7 platforms: https://github.com/apple/swift/blob/cd2d4cc287814c39f33ee1f57ec05aac3f0537e4/stdlib/public/runtime/Float16Support.cpp#L168

As I understand it, this is about what library call should be made when there is an eabi target such as arm-none-eabi which should target the Run Time ABI for the Arm Architecture https://github.com/ARM-software/abi-aa/blob/master/rtabi32/rtabi32.rst and other Arm targets such as Linux or Android. I think if we're arguing from the specification then clang is compliant with the eabi, so relocatable objects produced by clang should be compatible with eabi compliant toolchains that supply the eabi functions, this includes non GNU toolchains as well like IAR.

I believe that in this case GNUEABI is intending to be compliant with eabi, as it does supply aeabi prefixed functions. I think in this case it is likely an omission in gcc/libgcc as even arm-none-eabi-gcc with -mfp16-format=ieee which is supposed to be targeting the bare-metal eabi uses __gnu_d2h_ieee however even if this is fixed then there will be lots of libgcc libraries without the function.

I've asked the Arm GNU team if they can remember why the aeabi functions were not used for FP16 in GCC https://gcc.gnu.org/pipermail/gcc-patches/2016-November/462046.html

Having said that, even if this is working around a bug in libgcc, if this is restricted to GNUEABI then it may be the right thing to do as there will be many existing libgcc libraries without it.

As an aside, the __aeabi_memcpy, __aeabi_memmove, __aeabi_memset functions are a bit of a special case in that these are intended to be "optimized" functions, all libraries have memcpy, memmove and memset so it is always safe to put out a call to one of these and be compatible with everything. Emitting a call to __gnu_d2h_ieee is unlikely to be present in a toolchain not trying to be compatible with GCC so it is more of a risk to do so.

Revision Contents

Path

Size

llvm/

lib/

Target/

ARM/

ARMISelLowering.cpp

2 lines

test/

CodeGen/

ARM/

fp16.ll

4 lines

Diff 316419

llvm/lib/Target/ARM/ARMISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 592 Lines • ▼ Show 20 Lines	static const struct {
{ RTLIB::FPTOSINT_F32_I32, "__aeabi_f2iz", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::FPTOSINT_F32_I32, "__aeabi_f2iz", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },
{ RTLIB::FPTOUINT_F32_I32, "__aeabi_f2uiz", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::FPTOUINT_F32_I32, "__aeabi_f2uiz", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },
{ RTLIB::FPTOSINT_F32_I64, "__aeabi_f2lz", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::FPTOSINT_F32_I64, "__aeabi_f2lz", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },
{ RTLIB::FPTOUINT_F32_I64, "__aeabi_f2ulz", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::FPTOUINT_F32_I64, "__aeabi_f2ulz", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },

// Conversions between floating types.		// Conversions between floating types.
// RTABI chapter 4.1.2, Table 7		// RTABI chapter 4.1.2, Table 7
{ RTLIB::FPROUND_F64_F32, "__aeabi_d2f", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::FPROUND_F64_F32, "__aeabi_d2f", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },
{ RTLIB::FPROUND_F64_F16, "__aeabi_d2h", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::FPROUND_F64_F16, "__gnu_d2h_ieee", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },
		rengolinUnsubmitted Not Done Reply Inline Actions This doesn't look like the right fix. You're just replacing for all, no? Your changes to the tests don't show it can still emit EABI intrinsics with the right ABI flag. rengolin: This doesn't look like the right fix. You're just replacing for all, no? Your changes to the…
		kevinpeiznerAuthorUnsubmitted Done Reply Inline Actions The `i16 @test_to_fp16(double %in)` in `llvm/test/CodeGen/ARM/fp16.ll` test checks for different intrinsics based on the EABI flag. For example, when the test runs with `; RUN: llc -mtriple=armv7a--none-eabi < %s \| FileCheck --check-prefix=CHECK --check-prefix=CHECK-HARDFLOAT-EABI %s` then the test will verify that `bl __aeabi_d2h` is emitted rather than `bl __gnu_d2h_ieee`. I interpret this to mean that my changes still emit EABI intrinsics with the right ABI flag. This is my first contribution to the LLVM Project so please let me know if this is incorrect. The intent is to emit `__gnu_d2h_ieee` by default while allowing `__aeabi_d2h` to be emitted when EABI is targeted. kevinpeizner: The `i16 @test_to_fp16(double %in)` in `llvm/test/CodeGen/ARM/fp16.ll` test checks for…
		rengolinUnsubmitted Not Done Reply Inline Actions You're right, I didn't catch the other tests due to the lack of context. But this still looks like the wrong place. It's been a long time I touched this part of the code, but it being the only "gnu" replacement where all the rest is still "eabi" looks wrong. There must be another place where the "gnu" calls are being replaced and probably just adding "d2h" there would do the trick? rengolin: You're right, I didn't catch the other tests due to the lack of context. But this still looks…
		kevinpeiznerAuthorUnsubmitted Done Reply Inline Actions Grepping for `isTargetGNUAEABI` shows that there is no conditional block for GNUAEABI && !AEABI. On the other hand, grepping for `isTargetAEABI` shows that there is one conditional block for AEABI && !GNUAEABI (see line 713). I can add a new conditional block to achieve the same result, if that is preferred. However, based on the comment at line 712, I thought changing the default intrinsic to `__gnu_d2h_ieee` made the most sense and minimized code change. kevinpeizner: Grepping for `isTargetGNUAEABI` shows that there is no conditional block for GNUAEABI && !AEABI.
{ RTLIB::FPEXT_F32_F64, "__aeabi_f2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::FPEXT_F32_F64, "__aeabi_f2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },

// Integer to floating-point conversions.		// Integer to floating-point conversions.
// RTABI chapter 4.1.2, Table 8		// RTABI chapter 4.1.2, Table 8
{ RTLIB::SINTTOFP_I32_F64, "__aeabi_i2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::SINTTOFP_I32_F64, "__aeabi_i2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },
{ RTLIB::UINTTOFP_I32_F64, "__aeabi_ui2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::UINTTOFP_I32_F64, "__aeabi_ui2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },
{ RTLIB::SINTTOFP_I64_F64, "__aeabi_l2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::SINTTOFP_I64_F64, "__aeabi_l2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },
{ RTLIB::UINTTOFP_I64_F64, "__aeabi_ul2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },		{ RTLIB::UINTTOFP_I64_F64, "__aeabi_ul2d", CallingConv::ARM_AAPCS, ISD::SETCC_INVALID },
▲ Show 20 Lines • Show All 102 Lines • ▼ Show 20 Lines	ARMTargetLowering::ARMTargetLowering(const TargetMachine &TM,
// a __gnu_ prefix (which is the default).		// a __gnu_ prefix (which is the default).
if (Subtarget->isTargetAEABI()) {		if (Subtarget->isTargetAEABI()) {
static const struct {		static const struct {
const RTLIB::Libcall Op;		const RTLIB::Libcall Op;
const char * const Name;		const char * const Name;
const CallingConv::ID CC;		const CallingConv::ID CC;
} LibraryCalls[] = {		} LibraryCalls[] = {
{ RTLIB::FPROUND_F32_F16, "__aeabi_f2h", CallingConv::ARM_AAPCS },		{ RTLIB::FPROUND_F32_F16, "__aeabi_f2h", CallingConv::ARM_AAPCS },
{ RTLIB::FPROUND_F64_F16, "__aeabi_d2h", CallingConv::ARM_AAPCS },		{ RTLIB::FPROUND_F64_F16, "__aeabi_d2h", CallingConv::ARM_AAPCS },
		kevinpeiznerAuthorUnsubmitted Done Reply Inline Actions Here `__gnu_d2h_ieee` is replaced with `__aeabi_d2h` if and only if EABI is targeted. kevinpeizner: Here `__gnu_d2h_ieee` is replaced with `__aeabi_d2h` if and only if EABI is targeted.
{ RTLIB::FPEXT_F16_F32, "__aeabi_h2f", CallingConv::ARM_AAPCS },		{ RTLIB::FPEXT_F16_F32, "__aeabi_h2f", CallingConv::ARM_AAPCS },
};		};

for (const auto &LC : LibraryCalls) {		for (const auto &LC : LibraryCalls) {
setLibcallName(LC.Op, LC.Name);		setLibcallName(LC.Op, LC.Name);
setLibcallCallingConv(LC.Op, LC.CC);		setLibcallCallingConv(LC.Op, LC.CC);
}		}
}		}
▲ Show 20 Lines • Show All 18,636 Lines • Show Last 20 Lines

llvm/test/CodeGen/ARM/fp16.ll

Show First 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	; CHECK-SOFTFLOAT-GNU: bl __aeabi_f2d
ret double %val		ret double %val
}		}

define i16 @test_to_fp16(double %in) {		define i16 @test_to_fp16(double %in) {
; CHECK-LABEL: test_to_fp16:		; CHECK-LABEL: test_to_fp16:
%val = call i16 @llvm.convert.to.fp16.f64(double %in)		%val = call i16 @llvm.convert.to.fp16.f64(double %in)
; CHECK-HARDFLOAT-EABI: bl __aeabi_d2h		; CHECK-HARDFLOAT-EABI: bl __aeabi_d2h

; CHECK-HARDFLOAT-GNU: bl __aeabi_d2h		; CHECK-HARDFLOAT-GNU: bl __gnu_d2h_ieee

; CHECK-FP16-SAFE: bl __aeabi_d2h		; CHECK-FP16-SAFE: bl __aeabi_d2h

; CHECK-FP16-UNSAFE: vcvt.f32.f64 s0, d0		; CHECK-FP16-UNSAFE: vcvt.f32.f64 s0, d0
; CHECK-FP16-UNSAFE-NEXT: vcvtb.f16.f32 s0, s0		; CHECK-FP16-UNSAFE-NEXT: vcvtb.f16.f32 s0, s0

; CHECK-ARMV8: vcvtb.f16.f64 [[TMP:s[0-9]+]], d0		; CHECK-ARMV8: vcvtb.f16.f64 [[TMP:s[0-9]+]], d0
; CHECK-ARMV8: vmov r0, [[TMP]]		; CHECK-ARMV8: vmov r0, [[TMP]]

; CHECK-SOFTFLOAT-EABI: bl __aeabi_d2h		; CHECK-SOFTFLOAT-EABI: bl __aeabi_d2h

; CHECK-SOFTFLOAT-GNU: bl __aeabi_d2h		; CHECK-SOFTFLOAT-GNU: bl __gnu_d2h_ieee
ret i16 %val		ret i16 %val
}		}

declare float @llvm.convert.from.fp16.f32(i16) nounwind readnone		declare float @llvm.convert.from.fp16.f32(i16) nounwind readnone
declare double @llvm.convert.from.fp16.f64(i16) nounwind readnone		declare double @llvm.convert.from.fp16.f64(i16) nounwind readnone

declare i16 @llvm.convert.to.fp16.f32(float) nounwind readnone		declare i16 @llvm.convert.to.fp16.f32(float) nounwind readnone
declare i16 @llvm.convert.to.fp16.f64(double) nounwind readnone		declare i16 @llvm.convert.to.fp16.f64(double) nounwind readnone