This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
libc/
-
config/linux/
-
linux/
-
aarch64/
-
entrypoints.txt
-
x86_64/
-
entrypoints.txt
-
src/math/
-
math/
-
CMakeLists.txt
-
fma.h
-
fma.cpp
-
fmaf.cpp
-
generic/
-
CMakeLists.txt
-
fmaf.cpp
-
test/src/math/
-
src/
-
math/
-
CMakeLists.txt
-
fma_test.cpp
-
utils/FPUtil/
-
FPUtil/
1/2
FMA.h
-
aarch64/
-
FMA.h
-
generic/
-
FMA.h
-
README.md
-
x86_64/
-
FMA.h

Differential D100811

[libc] Add hardware implementations of fma and fmaf for x86_64 and aarch64.
ClosedPublic

Authored by sivachandra on Apr 19 2021, 9:03 PM.

Download Raw Diff

Details

Reviewers

lntue

Commits

rG95934c3a37cb: [libc] Add hardware implementations of fma and fmaf for x86_64 and aarch64.

Summary

The current generic implementation of the fmaf function has been moved
to the FPUtil directory. This allows one use the fma operation from
implementations of other math functions like the trignometric functions
without depending on/requiring the fma/fmaf/fmal function targets. If
this pattern ends being convenient, we will switch all generic math
implementations to this pattern.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sivachandra created this revision.Apr 19 2021, 9:03 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 19 2021, 9:03 PM

Herald added subscribers: libc-commits, ecnelises, danielkiss and 3 others. · View Herald Transcript

sivachandra requested review of this revision.Apr 19 2021, 9:03 PM

I have not yet fully tested this patch on x86_64. Will update soon if it requires any changes.

Add fma to the x86_64 entrypoint list.

Harbormaster completed remote builds in B99608: Diff 338701.Apr 19 2021, 9:34 PM

Harbormaster completed remote builds in B99611: Diff 338705.Apr 19 2021, 9:44 PM

lntue added inline comments.Apr 19 2021, 10:04 PM

libc/utils/FPUtil/FMA.h
15	Do we also need to check for _ _FMA__ flag?

Stupid question: why assembler instead of builtins:
https://github.com/gcc-mirror/gcc/blob/master/gcc/testsuite/gcc.target/ia64/builtin-fma-1.c
I guess there is a tradeoff between control over the process and helping the register allocator.

In D100811#2700415, @tschuett wrote:

Stupid question: why assembler instead of builtins:
https://github.com/gcc-mirror/gcc/blob/master/gcc/testsuite/gcc.target/ia64/builtin-fma-1.c
I guess there is a tradeoff between control over the process and helping the register allocator.

Builtins can potentially silently call into the libc. I preferred inline assembly to avoid such a possibility. I think using target specific compiler intrinsics is a better option but they don't have convenient ways to convert from intrinsic types like __m128 to and from float/double.

libc/utils/FPUtil/FMA.h
15	I agree that not checking on `__FMA__` is not ideal. But, I preferred not to condition on any other thing because it complicates the build system. For example, if `__FMA__` was not defined, then we wouldn't get the hardware `fma` and `fmaf` but only get the integer `fmaf` implementation. But, other places will expect that `fma` and `fmaf` both are available. We can of course teach the build system to check for `__FMA__` and propagate that info suitably to the rest of the libc build. That is a change we can take up separately if really required. I would prefer to see more examples of that pattern before building the appropriate machinery in to the build system.

lntue accepted this revision.Apr 20 2021, 2:43 PM

This revision is now accepted and ready to land.Apr 20 2021, 2:43 PM

This revision was landed with ongoing or failed builds.Apr 20 2021, 9:31 PM

Closed by commit rG95934c3a37cb: [libc] Add hardware implementations of fma and fmaf for x86_64 and aarch64. (authored by sivachandra). · Explain Why

This revision was automatically updated to reflect the committed changes.

sivachandra added a commit: rG95934c3a37cb: [libc] Add hardware implementations of fma and fmaf for x86_64 and aarch64..

Revision Contents

Path

Size

libc/

config/

linux/

aarch64/

entrypoints.txt

1 line

x86_64/

entrypoints.txt

1 line

src/

math/

26 lines

18 lines

20 lines

20 lines

generic/

CMakeLists.txt

15 lines

fmaf.cpp

test/

src/

math/

CMakeLists.txt

13 lines

fma_test.cpp

23 lines

utils/

FPUtil/

FMA.h

37 lines

aarch64/

FMA.h

38 lines

generic/

README.md

28 lines

x86_64/

FMA.h

40 lines

	utils/	FPUtil/	generic/
	src/	math/	generic/

	FMA.h
	fmaf.cpp

21 lines

Diff 339095

libc/config/linux/aarch64/entrypoints.txt

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	set(TARGET_LIBM_ENTRYPOINTS
libc.src.math.fabsf		libc.src.math.fabsf
libc.src.math.fabsl		libc.src.math.fabsl
libc.src.math.fdim		libc.src.math.fdim
libc.src.math.fdimf		libc.src.math.fdimf
libc.src.math.fdiml		libc.src.math.fdiml
libc.src.math.floor		libc.src.math.floor
libc.src.math.floorf		libc.src.math.floorf
libc.src.math.floorl		libc.src.math.floorl
		libc.src.math.fma
libc.src.math.fmaf		libc.src.math.fmaf
libc.src.math.fmax		libc.src.math.fmax
libc.src.math.fmaxf		libc.src.math.fmaxf
libc.src.math.fmaxl		libc.src.math.fmaxl
libc.src.math.fmin		libc.src.math.fmin
libc.src.math.fminf		libc.src.math.fminf
libc.src.math.fminl		libc.src.math.fminl
libc.src.math.frexp		libc.src.math.frexp
▲ Show 20 Lines • Show All 69 Lines • Show Last 20 Lines

libc/config/linux/x86_64/entrypoints.txt

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	set(TARGET_LIBM_ENTRYPOINTS
libc.src.math.fabsf		libc.src.math.fabsf
libc.src.math.fabsl		libc.src.math.fabsl
libc.src.math.fdim		libc.src.math.fdim
libc.src.math.fdimf		libc.src.math.fdimf
libc.src.math.fdiml		libc.src.math.fdiml
libc.src.math.floor		libc.src.math.floor
libc.src.math.floorf		libc.src.math.floorf
libc.src.math.floorl		libc.src.math.floorl
		libc.src.math.fma
libc.src.math.fmaf		libc.src.math.fmaf
libc.src.math.fmin		libc.src.math.fmin
libc.src.math.fminf		libc.src.math.fminf
libc.src.math.fminl		libc.src.math.fminl
libc.src.math.fmax		libc.src.math.fmax
libc.src.math.fmaxf		libc.src.math.fmaxf
libc.src.math.fmaxl		libc.src.math.fmaxl
libc.src.math.frexp		libc.src.math.frexp
▲ Show 20 Lines • Show All 104 Lines • Show Last 20 Lines

libc/src/math/CMakeLists.txt

Show All 26 Lines	add_entrypoint_object(
.generic.${name}		.generic.${name}
)		)
return()		return()
endif()		endif()

message(FATAL_ERROR "No machine specific or generic implementation found for ${name}.")		message(FATAL_ERROR "No machine specific or generic implementation found for ${name}.")
endfunction()		endfunction()

		add_entrypoint_object(
		fmaf
		SRCS
		fmaf.cpp
		HDRS
		fmaf.h
		DEPENDS
		libc.utils.FPUtil.fputil
		COMPILE_OPTIONS
		-O2
		)

		add_entrypoint_object(
		fma
		SRCS
		fma.cpp
		HDRS
		fma.h
		DEPENDS
		libc.utils.FPUtil.fputil
		COMPILE_OPTIONS
		-O2
		)

add_math_entrypoint_object(ceil)		add_math_entrypoint_object(ceil)
add_math_entrypoint_object(ceilf)		add_math_entrypoint_object(ceilf)
add_math_entrypoint_object(ceill)		add_math_entrypoint_object(ceill)

add_math_entrypoint_object(copysign)		add_math_entrypoint_object(copysign)
add_math_entrypoint_object(copysignf)		add_math_entrypoint_object(copysignf)
add_math_entrypoint_object(copysignl)		add_math_entrypoint_object(copysignl)

Show All 10 Lines
add_math_entrypoint_object(fdim)		add_math_entrypoint_object(fdim)
add_math_entrypoint_object(fdimf)		add_math_entrypoint_object(fdimf)
add_math_entrypoint_object(fdiml)		add_math_entrypoint_object(fdiml)

add_math_entrypoint_object(floor)		add_math_entrypoint_object(floor)
add_math_entrypoint_object(floorf)		add_math_entrypoint_object(floorf)
add_math_entrypoint_object(floorl)		add_math_entrypoint_object(floorl)

add_math_entrypoint_object(fmaf)

add_math_entrypoint_object(fmax)		add_math_entrypoint_object(fmax)
add_math_entrypoint_object(fmaxf)		add_math_entrypoint_object(fmaxf)
add_math_entrypoint_object(fmaxl)		add_math_entrypoint_object(fmaxl)

add_math_entrypoint_object(fmin)		add_math_entrypoint_object(fmin)
add_math_entrypoint_object(fminf)		add_math_entrypoint_object(fminf)
add_math_entrypoint_object(fminl)		add_math_entrypoint_object(fminl)

▲ Show 20 Lines • Show All 74 Lines • Show Last 20 Lines

libc/src/math/fma.h

This file was added.

				//===-- Implementation header for fma ---------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_LIBC_SRC_MATH_FMA_H
				#define LLVM_LIBC_SRC_MATH_FMA_H

				namespace __llvm_libc {

				double fma(double x, double y, double z);

				} // namespace __llvm_libc

				#endif // LLVM_LIBC_SRC_MATH_FMA_H

libc/src/math/fma.cpp

This file was added.

				//===-- Implementation of fma function ------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "src/math/fma.h"
				#include "src/__support/common.h"

				#include "utils/FPUtil/FMA.h"

				namespace __llvm_libc {

				LLVM_LIBC_FUNCTION(double, fma, (double x, double y, double z)) {
				return fputil::fma(x, y, z);
				}

				} // namespace __llvm_libc

libc/src/math/fmaf.cpp

This file was added.

				//===-- Implementation of fmaf function -----------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "src/math/fmaf.h"
				#include "src/__support/common.h"

				#include "utils/FPUtil/FMA.h"

				namespace __llvm_libc {

				LLVM_LIBC_FUNCTION(float, fmaf, (float x, float y, float z)) {
				return fputil::fma(x, y, z);
				}

				} // namespace __llvm_libc

libc/src/math/generic/CMakeLists.txt

Show First 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	add_entrypoint_object(
SRCS		SRCS
sinf.cpp		sinf.cpp
HDRS		HDRS
../sinf.h		../sinf.h
DEPENDS		DEPENDS
.sincosf_utils		.sincosf_utils
libc.include.math		libc.include.math
libc.src.errno.__errno_location		libc.src.errno.__errno_location
		libc.utils.FPUtil.fputil
		COMPILE_OPTIONS
		-O3
)		)

add_entrypoint_object(		add_entrypoint_object(
sincosf		sincosf
SRCS		SRCS
sincosf.cpp		sincosf.cpp
HDRS		HDRS
../sincosf.h		../sincosf.h
▲ Show 20 Lines • Show All 883 Lines • ▼ Show 20 Lines	SRCS
nextafterl.cpp		nextafterl.cpp
HDRS		HDRS
../nextafterl.h		../nextafterl.h
DEPENDS		DEPENDS
libc.utils.FPUtil.fputil		libc.utils.FPUtil.fputil
COMPILE_OPTIONS		COMPILE_OPTIONS
-O2		-O2
)		)

add_entrypoint_object(
fmaf
SRCS
fmaf.cpp
HDRS
../fmaf.h
DEPENDS
libc.utils.FPUtil.fputil
COMPILE_OPTIONS
-O2
)

libc/src/math/generic/fmaf.cpp

This file was moved to libc/utils/FPUtil/generic/FMA.h.

libc/test/src/math/CMakeLists.txt

Show First 20 Lines • Show All 1,113 Lines • ▼ Show 20 Lines	add_fp_unittest(
SRCS		SRCS
fmaf_test.cpp		fmaf_test.cpp
DEPENDS		DEPENDS
libc.include.math		libc.include.math
libc.src.math.fmaf		libc.src.math.fmaf
libc.utils.FPUtil.fputil		libc.utils.FPUtil.fputil
)		)

		add_fp_unittest(
		fma_test
		NEED_MPFR
		SUITE
		libc_math_unittests
		SRCS
		fma_test.cpp
		DEPENDS
		libc.include.math
		libc.src.math.fma
		libc.utils.FPUtil.fputil
		)

add_subdirectory(generic)		add_subdirectory(generic)
add_subdirectory(exhaustive)		add_subdirectory(exhaustive)

libc/test/src/math/fma_test.cpp

This file was added.

				//===-- Unittests for fma ------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "FmaTest.h"

				#include "src/math/fma.h"

				using LlvmLibcFmaTest = FmaTestTemplate<double>;

				TEST_F(LlvmLibcFmaTest, SpecialNumbers) {
				testSpecialNumbers(&__llvm_libc::fma);
				}

				TEST_F(LlvmLibcFmaTest, SubnormalRange) {
				testSubnormalRange(&__llvm_libc::fma);
				}

				TEST_F(LlvmLibcFmaTest, NormalRange) { testNormalRange(&__llvm_libc::fma); }

libc/utils/FPUtil/FMA.h

This file was added.

				//===-- Common header for FMA implementations -------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_LIBC_UTILS_FPUTIL_FMA_H
				#define LLVM_LIBC_UTILS_FPUTIL_FMA_H

				#include "utils/CPP/TypeTraits.h"

				#ifdef __x86_64__
				#include "x86_64/FMA.h"
				lntueUnsubmitted Not Done Reply Inline Actions Do we also need to check for _ _FMA__ flag? lntue: Do we also need to check for _ _FMA__ flag?
				sivachandraAuthorUnsubmitted Done Reply Inline Actions I agree that not checking on `__FMA__` is not ideal. But, I preferred not to condition on any other thing because it complicates the build system. For example, if `__FMA__` was not defined, then we wouldn't get the hardware `fma` and `fmaf` but only get the integer `fmaf` implementation. But, other places will expect that `fma` and `fmaf` both are available. We can of course teach the build system to check for `__FMA__` and propagate that info suitably to the rest of the libc build. That is a change we can take up separately if really required. I would prefer to see more examples of that pattern before building the appropriate machinery in to the build system. sivachandra: I agree that not checking on `__FMA__` is not ideal. But, I preferred not to condition on any…
				#elif defined(__aarch64__)
				#include "aarch64/FMA.h"
				#else
				#include "generic/FMA.h"

				namespace __llvm_libc {
				namespace fputil {

				// We have a generic implementation available only for single precision fma os
				// we restrict it to float values for now.
				template <typename T>
				static inline cpp::EnableIfType<cpp::IsSame<T, float>::Value, T> fma(T x, T y,
				T z) {
				return generic::fma(x, y, z);
				}

				} // namespace fputil
				} // namespace __llvm_libc

				#endif

				#endif // LLVM_LIBC_UTILS_FPUTIL_FMA_H

libc/utils/FPUtil/aarch64/FMA.h

This file was added.

				//===-- Aarch64 implementations of the fma function -------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_LIBC_UTILS_FPUTIL_AARCH64_FMA_H
				#define LLVM_LIBC_UTILS_FPUTIL_AARCH64_FMA_H

				#include "utils/CPP/TypeTraits.h"

				namespace __llvm_libc {
				namespace fputil {

				template <typename T>
				cpp::EnableIfType<cpp::IsSame<T, float>::Value, T> fma(T x, T y, T z) {
				float result;
				__asm__ __volatile__("fmadd %s0, %s1, %s2, %s3\n\t"
				: "=w"(result)
				: "w"(x), "w"(y), "w"(z));
				return result;
				}

				template <typename T>
				cpp::EnableIfType<cpp::IsSame<T, double>::Value, T> fma(T x, T y, T z) {
				double result;
				__asm__ __volatile__("fmadd %d0, %d1, %d2, %d3\n\t"
				: "=w"(result)
				: "w"(x), "w"(y), "w"(z));
				return result;
				}

				} // namespace fputil
				} // namespace __llvm_libc

				#endif // LLVM_LIBC_UTILS_FPUTIL_AARCH64_FMA_H

libc/utils/FPUtil/generic/FMA.h

This file was moved from libc/src/math/generic/fmaf.cpp.

//===-- Implementation of fmaf function -----------------------------------===//		//===-- Common header for FMA implementations -------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "src/math/fmaf.h"		#ifndef LLVM_LIBC_UTILS_FPUTIL_GENERIC_FMA_H
#include "src/__support/common.h"		#define LLVM_LIBC_UTILS_FPUTIL_GENERIC_FMA_H

#include "utils/FPUtil/FEnv.h"		#include "utils/CPP/TypeTraits.h"
#include "utils/FPUtil/FPBits.h"

namespace __llvm_libc {		namespace __llvm_libc {
		namespace fputil {
		namespace generic {

LLVM_LIBC_FUNCTION(float, fmaf, (float x, float y, float z)) {		template <typename T>
		static inline cpp::EnableIfType<cpp::IsSame<T, float>::Value, T> fma(T x, T y,
		T z) {
// Product is exact.		// Product is exact.
double prod = static_cast<double>(x) * static_cast<double>(y);		double prod = static_cast<double>(x) * static_cast<double>(y);
double z_d = static_cast<double>(z);		double z_d = static_cast<double>(z);
double sum = prod + z_d;		double sum = prod + z_d;
fputil::FPBits<double> bit_prod(prod), bitz(z_d), bit_sum(sum);		fputil::FPBits<double> bit_prod(prod), bitz(z_d), bit_sum(sum);

if (!(bit_sum.isInfOrNaN() \|\| bit_sum.isZero())) {		if (!(bit_sum.isInfOrNaN() \|\| bit_sum.isZero())) {
// Since the sum is computed in double precision, rounding might happen		// Since the sum is computed in double precision, rounding might happen
Show All 31 Lines	if (!t.isZero() && ((bit_sum.encoding.mantissa & 0xfff'ffffULL) == 0)) {
--bit_sum.encoding.mantissa;		--bit_sum.encoding.mantissa;
}		}
}		}
}		}

return static_cast<float>(static_cast<double>(bit_sum));		return static_cast<float>(static_cast<double>(bit_sum));
}		}

		} // namespace generic
		} // namespace fputil
} // namespace __llvm_libc		} // namespace __llvm_libc

		#endif // Generic fma implementations

		#endif // LLVM_LIBC_UTILS_FPUTIL_GENERIC_FMA_H

libc/utils/FPUtil/generic/README.md

This file was added.

				This directory contains machine independent implementations of floating point
				operations. The implementations are nested in the namespace
				`__llvm_libc::fputil::generic`. This is to facilitate calling these generic
				implementations from machine dependent implementations. Consider the example of
				the fuse-multiply-add operation (FMA). The C standard library requires three
				different flavors, `fma` which operates double precsion numbers, `fmaf` which
				operates on single precision numbers, and `fmal` which operates on `lond double`
				numbers. On Aarch64, there are hardware instructions which implement the single
				and double precision flavors but not the `long double` flavor. For such targets,
				we want to be able to call the generic `long double` implementation from the
				`long double` flavor. By putting the generic implementations in a separate
				nested namespace, we will be to call them as follows:

				```
				namespace __llvm_libc {
				namespace fputil {

				long double fmal(long double x, long double y, long double z) {
				return generic::fmal(x, y, z);
				}

				} // namespace fputil
				} // namespace __llvm_libc
				```

				Note that actual code might not be as straightforward as above (for example,
				we might want to prevent implicit type promotions by using some template
				facilities). But, the general idea is very similar.

libc/utils/FPUtil/x86_64/FMA.h

This file was added.

				//===-- x86_64 implementations of the fma function --------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_LIBC_UTILS_FPUTIL_X86_64_FMA_H
				#define LLVM_LIBC_UTILS_FPUTIL_X86_64_FMA_H

				#include "utils/CPP/TypeTraits.h"

				namespace __llvm_libc {
				namespace fputil {

				template <typename T>
				static inline cpp::EnableIfType<cpp::IsSame<T, float>::Value, T> fma(T x, T y,
				T z) {
				float result = x;
				__asm__ __volatile__("vfmadd213ss %x2, %x1, %x0"
				: "+x"(result)
				: "x"(y), "x"(z));
				return result;
				}

				template <typename T>
				static inline cpp::EnableIfType<cpp::IsSame<T, double>::Value, T> fma(T x, T y,
				T z) {
				double result = x;
				__asm__ __volatile__("vfmadd213sd %x2, %x1, %x0"
				: "+x"(result)
				: "x"(y), "x"(z));
				return result;
				}

				} // namespace fputil
				} // namespace __llvm_libc

				#endif // LLVM_LIBC_UTILS_FPUTIL_X86_64_FMA_H

This is an archive of the discontinued LLVM Phabricator instance.

[libc] Add hardware implementations of fma and fmaf for x86_64 and aarch64.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 339095

libc/config/linux/aarch64/entrypoints.txt

libc/config/linux/x86_64/entrypoints.txt

libc/src/math/CMakeLists.txt

libc/src/math/fma.h

libc/src/math/fma.cpp

libc/src/math/fmaf.cpp

libc/src/math/generic/CMakeLists.txt

libc/src/math/generic/fmaf.cpp

libc/test/src/math/CMakeLists.txt

libc/test/src/math/fma_test.cpp

libc/utils/FPUtil/FMA.h

libc/utils/FPUtil/aarch64/FMA.h

libc/utils/FPUtil/generic/FMA.h

libc/utils/FPUtil/generic/README.md

libc/utils/FPUtil/x86_64/FMA.h

[libc] Add hardware implementations of fma and fmaf for x86_64 and aarch64.
ClosedPublic