This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
libc/
-
docs/
-
math.rst
-
src/math/generic/
-
math/
-
generic/
-
CMakeLists.txt
-
exp2f.cpp
-
explogxf.h
-
test/src/math/
-
src/
-
math/
-
exhaustive/
-
exp2f_test.cpp
-
explogxf_test.cpp

Differential D133870

[libc][math] Improve exp2f performance.
ClosedPublic

Authored by lntue on Sep 14 2022, 8:51 AM.

Download Raw Diff

Details

Reviewers

michaelrj
sivachandra
orex
zimmermann6

Commits

rGe6226e6b7234: [libc][math] Improve exp2f performance.

Summary

Reduce the number of subintervals that need lookup table and optimize
the evaluation steps.

Currently, exp2f is computed by reducing to 2^hi * 2^mid * 2^lo where
-16/32 <= mid <= 15/32 and -1/64 <= lo <= 1/64, and 2^lo is then
approximated by a degree 6 polynomial.

Experiment with Sollya showed that by using a degree 6 polynomial, we
can approximate 2^lo for a bigger range with reasonable errors:

> P = fpminimax((2^x - 1)/x, 5, [|D...|], [-1/64, 1/64]);
> dirtyinfnorm(2^x - 1 - x*P, [-1/64, 1/64]);
0x1.e18a1bc09114def49eb851655e2e5c4dd08075ac2p-63

> P = fpminimax((2^x - 1)/x, 5, [|D...|], [-1/32, 1/32]);
> dirtyinfnorm(2^x - 1 - x*P, [-1/32, 1/32]);
0x1.05627b6ed48ca417fe53e3495f7df4baf84a05e2ap-56

So we can optimize the implementation a bit with:

Reduce the range to mid = i/16 for i = 0..15 and -1/32 <= lo <= 1/32
Store the table 2^mid in bits, and add hi directly to its exponent field to compute 2^hi * 2^mid
Rearrange the order of evaluating the polynomial approximating 2^lo.

Performance benchmark using perf tool from the CORE-MATH project on Ryzen 1700:

$ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh exp2f
GNU libc version: 2.35
GNU libc release: stable
CORE-MATH reciprocal throughput   : 9.534
System LIBC reciprocal throughput : 6.229

BEFORE:
LIBC reciprocal throughput        : 21.405
LIBC reciprocal throughput        : 15.241    (with `-msse4.2` flag)
LIBC reciprocal throughput        : 11.111    (with `-mfma` flag)

AFTER:
LIBC reciprocal throughput        : 18.617
LIBC reciprocal throughput        : 12.852    (with `-msse4.2` flag)
LIBC reciprocal throughput        : 9.253     (with `-mfma` flag)

$ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh exp2f --latency
GNU libc version: 2.35
GNU libc release: stable
CORE-MATH latency   : 40.869
System LIBC latency : 30.580

BEFORE
LIBC latency        : 64.888
LIBC latency        : 61.027    (with `-msse4.2` flag)
LIBC latency        : 48.778    (with `-mfma` flag)

AFTER
LIBC latency        : 48.803
LIBC latency        : 45.047    (with `-msse4.2` flag)
LIBC latency        : 37.487    (with `-mfma` flag)

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

lntue created this revision.Sep 14 2022, 8:51 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptSep 14 2022, 8:51 AM

Herald added subscribers: ecnelises, tschuett, mgorny. · View Herald Transcript

lntue requested review of this revision.Sep 14 2022, 8:51 AM

Harbormaster completed remote builds in B186645: Diff 460121.Sep 14 2022, 8:58 AM

lntue edited the summary of this revision. (Show Details)Sep 14 2022, 9:26 AM

orex accepted this revision.Sep 14 2022, 11:22 AM

This revision is now accepted and ready to land.Sep 14 2022, 11:22 AM

sivachandra accepted this revision.Sep 14 2022, 11:25 AM

Closed by commit rGe6226e6b7234: [libc][math] Improve exp2f performance. (authored by lntue). · Explain WhySep 14 2022, 11:45 AM

This revision was automatically updated to reflect the committed changes.

lntue added a commit: rGe6226e6b7234: [libc][math] Improve exp2f performance..

lntue mentioned this in D133913: [libc][math] Improve sinhf and coshf performance..Sep 14 2022, 10:46 PM

I confirm the improvement, and the function is still correctly rounded.

lntue mentioned this in rG1c89ae71ea69: [libc][math] Improve sinhf and coshf performance..Sep 15 2022, 6:21 AM

lntue mentioned this in D134002: [libc][math] Improve tanhf performance..Sep 15 2022, 5:53 PM

lntue mentioned this in rG4973eee12286: [libc][math] Improve tanhf performance..Sep 19 2022, 5:43 AM

Revision Contents

Path

Size

libc/

docs/

math.rst

4 lines

src/

math/

generic/

CMakeLists.txt

2 lines

exp2f.cpp

77 lines

explogxf.h

21 lines

test/

src/

math/

exhaustive/

exp2f_test.cpp

8 lines

explogxf_test.cpp

9 lines

Diff 460168

libc/docs/math.rst

	Show First 20 Lines • Show All 213 Lines • ▼ Show 20 Lines
	\| atanhf \| 20 \| 66 \| 71 \| 133 \| :math:`[-1, 1]` \| Ryzen 1700 \| Ubuntu 22.04 LTS x86_64 \| Clang 14.0.0 \| FMA \|			\| atanhf \| 20 \| 66 \| 71 \| 133 \| :math:`[-1, 1]` \| Ryzen 1700 \| Ubuntu 22.04 LTS x86_64 \| Clang 14.0.0 \| FMA \|
	+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+			+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+
	\| cosf \| 13 \| 32 \| 53 \| 59 \| :math:`[0, 2\pi]` \| Ryzen 1700 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|			\| cosf \| 13 \| 32 \| 53 \| 59 \| :math:`[0, 2\pi]` \| Ryzen 1700 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|
	+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+			+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+
	\| coshf \| 23 \| 20 \| 73 \| 49 \| :math:`[-10, 10]` \| Ryzen 1700 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|			\| coshf \| 23 \| 20 \| 73 \| 49 \| :math:`[-10, 10]` \| Ryzen 1700 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|
	+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+			+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+
	\| expf \| 9 \| 7 \| 44 \| 38 \| :math:`[-10, 10]` \| Ryzen 1700 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|			\| expf \| 9 \| 7 \| 44 \| 38 \| :math:`[-10, 10]` \| Ryzen 1700 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|
	+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+			+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+
	\| exp2f \| 8 \| 6 \| 35 \| 23 \| :math:`[-10, 10]` \| i5-1135G7 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|			\| exp2f \| 9 \| 6 \| 37 \| 31 \| :math:`[-10, 10]` \| Ryzen 1700 \| Ubuntu 22.04 LTS x86_64 \| Clang 14.0.0 \| FMA \|
	+ +-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+
	\| \| 11 \| 6 \| 49 \| 31 \| :math:`[-10, 10]` \| Ryzen 1700 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|
	+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+			+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+
	\| expm1f \| 9 \| 44 \| 42 \| 121 \| :math:`[-10, 10]` \| Ryzen 1700 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|			\| expm1f \| 9 \| 44 \| 42 \| 121 \| :math:`[-10, 10]` \| Ryzen 1700 \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| FMA \|
	+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+			+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+
	\| fmodf \| 73 \| 263 \| - \| - \| [MIN_NORMAL, MAX_NORMAL] \| i5 mobile \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| \|			\| fmodf \| 73 \| 263 \| - \| - \| [MIN_NORMAL, MAX_NORMAL] \| i5 mobile \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| \|
	\| +-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+			\| +-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+
	\| \| 9 \| 11 \| - \| - \| [0, MAX_SUBNORMAL] \| i5 mobile \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| \|			\| \| 9 \| 11 \| - \| - \| [0, MAX_SUBNORMAL] \| i5 mobile \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| \|
	+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+			+--------------+-----------+-------------------+-----------+-------------------+-------------------------------------+------------+-------------------------+--------------+---------------+
	\| fmod \| 595 \| 3297 \| - \| - \| [MIN_NORMAL, MAX_NORMAL] \| i5 mobile \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| \|			\| fmod \| 595 \| 3297 \| - \| - \| [MIN_NORMAL, MAX_NORMAL] \| i5 mobile \| Ubuntu 20.04 LTS x86_64 \| Clang 12.0.0 \| \|
	Show All 33 Lines

libc/src/math/generic/CMakeLists.txt

	Show First 20 Lines • Show All 543 Lines • ▼ Show 20 Lines

	add_entrypoint_object(			add_entrypoint_object(
	exp2f			exp2f
	SRCS			SRCS
	exp2f.cpp			exp2f.cpp
	HDRS			HDRS
	../exp2f.h			../exp2f.h
	DEPENDS			DEPENDS
	.common_constants
	.explogxf
	libc.src.__support.FPUtil.fenv_impl			libc.src.__support.FPUtil.fenv_impl
	libc.src.__support.FPUtil.fp_bits			libc.src.__support.FPUtil.fp_bits
	libc.src.__support.FPUtil.multiply_add			libc.src.__support.FPUtil.multiply_add
	libc.src.__support.FPUtil.nearest_integer			libc.src.__support.FPUtil.nearest_integer
	libc.src.__support.FPUtil.polyeval			libc.src.__support.FPUtil.polyeval
	libc.include.errno			libc.include.errno
	libc.src.errno.errno			libc.src.errno.errno
	libc.include.math			libc.include.math
	▲ Show 20 Lines • Show All 777 Lines • Show Last 20 Lines

libc/src/math/generic/exp2f.cpp

	//===-- Single-precision 2^x function -------------------------------------===//			//===-- Single-precision 2^x function -------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "src/math/exp2f.h"			#include "src/math/exp2f.h"
	#include "common_constants.h"
	#include "explogxf.h"
	#include "src/__support/FPUtil/FEnvImpl.h"			#include "src/__support/FPUtil/FEnvImpl.h"
	#include "src/__support/FPUtil/FPBits.h"			#include "src/__support/FPUtil/FPBits.h"
	#include "src/__support/FPUtil/PolyEval.h"			#include "src/__support/FPUtil/PolyEval.h"
	#include "src/__support/FPUtil/multiply_add.h"			#include "src/__support/FPUtil/multiply_add.h"
	#include "src/__support/FPUtil/nearest_integer.h"			#include "src/__support/FPUtil/nearest_integer.h"
	#include "src/__support/common.h"			#include "src/__support/common.h"

	#include <errno.h>			#include <errno.h>

	namespace __llvm_libc {			namespace __llvm_libc {

	constexpr uint32_t exval1 = 0x3b42'9d37U;			constexpr uint32_t exval1 = 0x3b42'9d37U;
	constexpr uint32_t exval2 = 0xbcf3'a937U;			constexpr uint32_t exval2 = 0xbcf3'a937U;
	constexpr uint32_t exval_mask = exval1 & exval2;			constexpr uint32_t exval_mask = exval1 & exval2;

				// Look up table for bit fields of 2^(i/16) for i = 0..15, generated by Sollya
				// with:
				// > for i from 0 to 15 do printdouble(round(2^(i/16), D, RN));
				constexpr int64_t EXP_2_M[16] = {
				0x3ff0000000000000, 0x3ff0b5586cf9890f, 0x3ff172b83c7d517b,
				0x3ff2387a6e756238, 0x3ff306fe0a31b715, 0x3ff3dea64c123422,
				0x3ff4bfdad5362a27, 0x3ff5ab07dd485429, 0x3ff6a09e667f3bcd,
				0x3ff7a11473eb0187, 0x3ff8ace5422aa0db, 0x3ff9c49182a3f090,
				0x3ffae89f995ad3ad, 0x3ffc199bdd85529c, 0x3ffd5818dcfba487,
				0x3ffea4afa2a490da};

	LLVM_LIBC_FUNCTION(float, exp2f, (float x)) {			LLVM_LIBC_FUNCTION(float, exp2f, (float x)) {
	using FPBits = typename fputil::FPBits<float>;			using FPBits = typename fputil::FPBits<float>;
	FPBits xbits(x);			FPBits xbits(x);

	uint32_t x_u = xbits.uintval();			uint32_t x_u = xbits.uintval();
	uint32_t x_abs = x_u & 0x7fff'ffffU;			uint32_t x_abs = x_u & 0x7fff'ffffU;

	// // When \|x\| >= 128, \|x\| < 2^-25, or x is nan
	if (unlikely(x_abs >= 0x4300'0000U \|\| x_abs <= 0x3280'0000U)) {
	// \|x\| < 2^-25			// \|x\| < 2^-25
	if (x_abs <= 0x3280'0000U) {			if (unlikely(x_abs <= 0x3280'0000U)) {
	return 1.0f + x;			return 1.0f + x;
	}			}

				// // When \|x\| >= 128, or x is nan
				if (unlikely(x_abs >= 0x4300'0000U)) {

	// x >= 128			// x >= 128
	if (!xbits.get_sign()) {			if (!xbits.get_sign()) {
	// x is finite			// x is finite
	if (x_u < 0x7f80'0000U) {			if (x_u < 0x7f80'0000U) {
	int rounding = fputil::get_round();			int rounding = fputil::get_round();
	if (rounding == FE_DOWNWARD \|\| rounding == FE_TOWARDZERO)			if (rounding == FE_DOWNWARD \|\| rounding == FE_TOWARDZERO)
	return static_cast<float>(FPBits(FPBits::MAX_NORMAL));			return static_cast<float>(FPBits(FPBits::MAX_NORMAL));

	errno = ERANGE;			errno = ERANGE;
	}			}
	// x is +inf or nan			// x is +inf or nan
	return x + FPBits::inf().get_val();			return x + FPBits::inf().get_val();
	}			}
	// x < -150			// x <= -150
	if (x_u >= 0xc316'0000U) {			if (x_u >= 0xc316'0000U) {
	// exp(-Inf) = 0			// exp(-Inf) = 0
	if (xbits.is_inf())			if (xbits.is_inf())
	return 0.0f;			return 0.0f;
	// exp(nan) = nan			// exp(nan) = nan
	if (xbits.is_nan())			if (xbits.is_nan())
	return x;			return x;
	if (fputil::get_round() == FE_UPWARD)			if (fputil::get_round() == FE_UPWARD)
	return FPBits(FPBits::MIN_SUBNORMAL).get_val();			return FPBits(FPBits::MIN_SUBNORMAL).get_val();
	if (x != 0.0f)			if (x != 0.0f)
	errno = ERANGE;			errno = ERANGE;
	return 0.0f;			return 0.0f;
	}			}
	}			}

				// Check exceptional values.
	if (unlikely(x_u & exval_mask) == exval_mask) {			if (unlikely(x_u & exval_mask) == exval_mask) {
	if (unlikely(x_u == exval1)) { // x = 0x1.853a6ep-9f			if (unlikely(x_u == exval1)) { // x = 0x1.853a6ep-9f
	if (fputil::get_round() == FE_TONEAREST)			if (fputil::get_round() == FE_TONEAREST)
	return 0x1.00870ap+0f;			return 0x1.00870ap+0f;
	} else if (unlikely(x_u == exval2)) { // x = -0x1.e7526ep-6f			} else if (unlikely(x_u == exval2)) { // x = -0x1.e7526ep-6f
	if (fputil::get_round() == FE_TONEAREST)			if (fputil::get_round() == FE_TONEAREST)
	return 0x1.f58d62p-1f;			return 0x1.f58d62p-1f;
	}			}
	}			}

	return exp2_eval(x);			// For -150 < x < 128, to compute 2^x, we perform the following range
				// reduction: find hi, mid, lo such that:
				// x = hi + mid + lo, in which
				// hi is an integer,
				// 0 <= mid * 2^4 < 16 is an integer
				// -2^(-5) <= lo <= 2^-5.
				// In particular,
				// hi + mid = round(x * 2^4) * 2^(-4).
				// Then,
				// 2^x = 2^(hi + mid + lo) = 2^hi * 2^mid * 2^lo.
				// 2^mid is stored in the lookup table EXP_2_M of 16 elements.
				// 2^lo is computed using a degree-6 minimax polynomial
				// generated by Sollya.
				// We perform 2^hi * 2^lo by simply add hi to the exponent field
				// of 2^mid.

				// kf = (hi + mid) * 2^4 = round(x * 2^4)
				float kf = fputil::nearest_integer(x * 16.0f);
				// dx = lo = x - (hi + mid) = x - kf * 2^(-4)
				double dx = fputil::multiply_add(-0x1.0p-4f, kf, x);

				int k = static_cast<int>(kf);
				// hi = floor(kf * 2^(-4))
				// exp_hi = shift hi to the exponent field of double precision.
				int64_t exp_hi = static_cast<int64_t>(k >> 4)
				<< fputil::FloatProperties<double>::MANTISSA_WIDTH;
				// mh = 2^hi * 2^mid
				// mh_bits = bit field of mh
				int64_t mh_bits = EXP_2_M[k & 15] + exp_hi;
				double mh = fputil::FPBits<double>(uint64_t(mh_bits)).get_val();

				// Degree-5 polynomial approximating (2^x - 1)/x generating by Sollya with:
				// > P = fpminimax((2^x - 1)/x, 5, [\|D...\|], [-1/32. 1/32]);
				constexpr double COEFFS[6] = {0x1.62e42fefa39f3p-1, 0x1.ebfbdff82c57bp-3,
				0x1.c6b08d6f2d7aap-5, 0x1.3b2ab6fc92f5dp-7,
				0x1.5d897cfe27125p-10, 0x1.43090e61e6af1p-13};
				double dx_sq = dx * dx;
				double c1 = fputil::multiply_add(dx, COEFFS[1], COEFFS[0]);
				double c2 = fputil::multiply_add(dx, COEFFS[3], COEFFS[2]);
				double c3 = fputil::multiply_add(dx, COEFFS[5], COEFFS[4]);
				double p = fputil::polyeval(dx_sq, c1, c2, c3);
				// 2^x = 2^(hi + mid + lo)
				// = 2^(hi + mid) * 2^lo
				// ~ mh * (1 + lo * P(lo))
				// = mh + (mhlo) P(lo)
				return fputil::multiply_add(p, dx * mh, mh);
	}			}

	} // namespace __llvm_libc			} // namespace __llvm_libc

libc/src/math/generic/explogxf.h

Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	inline static exe_eval_result_t exp_eval(double x) {
double pe = dx * fputil::polyeval(dx, 1.0, 0x1.0p-1, 0x1.5555555555555p-3,		double pe = dx * fputil::polyeval(dx, 1.0, 0x1.0p-1, 0x1.5555555555555p-3,
0x1.5555555555555p-5, 0x1.1111111111111p-7,		0x1.5555555555555p-5, 0x1.1111111111111p-7,
0x1.6c16c16c16c17p-10);		0x1.6c16c16c16c17p-10);

double r = fputil::multiply_add(ml, pe, pe) + ml;		double r = fputil::multiply_add(ml, pe, pe) + ml;
return {mult_e1, r};		return {mult_e1, r};
}		}

inline static double exp2_eval(double x) {
double kf = fputil::nearest_integer(x * mlp);
double dx = fputil::multiply_add(mmld, kf, x);
double mult_f, ml;
{
uint32_t ps = static_cast<int>(kf) + (1 << (EXP_bits_p - 1)) +
(fputil::FPBits<double>::EXPONENT_BIAS << EXP_bits_p);
fputil::FPBits<double> bs;
bs.set_unbiased_exponent(ps >> EXP_bits_p);
ml = 1.0 + EXP_2_POW[ps & (EXP_num_p - 1)];
mult_f = bs.get_val();
}

// Taylor series coefficients for 2^x
double pe = fputil::polyeval(
dx, 1.0, 0x1.62e42fefa39efp-1, 0x1.ebfbdff82c58fp-3, 0x1.c6b08d704a0c0p-5,
0x1.3b2ab6fba4e77p-7, 0x1.5d87fe78a6731p-10, 0x1.430912f86c787p-13);

return mult_f * ml * pe;
}

// x should be positive, normal finite value		// x should be positive, normal finite value
inline static double log2_eval(double x) {		inline static double log2_eval(double x) {
using FPB = fputil::FPBits<double>;		using FPB = fputil::FPBits<double>;
FPB bs(x);		FPB bs(x);

double result = 0;		double result = 0;
result += bs.get_exponent();		result += bs.get_exponent();

Show All 29 Lines

libc/test/src/math/exhaustive/exp2f_test.cpp

Show All 26 Lines	do {
float x = float(xbits);		float x = float(xbits);
result &= EXPECT_MPFR_MATCH(mpfr::Operation::Exp2, x,		result &= EXPECT_MPFR_MATCH(mpfr::Operation::Exp2, x,
__llvm_libc::exp2f(x), 0.5, rounding);		__llvm_libc::exp2f(x), 0.5, rounding);
} while (bits++ < stop);		} while (bits++ < stop);
return result;		return result;
}		}
};		};

// Range: [0, 128];		// Range: [0, +Inf];
static constexpr uint32_t POS_START = 0x0000'0000U;		static constexpr uint32_t POS_START = 0x0000'0000U;
static constexpr uint32_t POS_STOP = 0x4300'0000U;		static constexpr uint32_t POS_STOP = 0x7f80'0000U;

TEST_F(LlvmLibcExp2fExhaustiveTest, PostiveRangeRoundNearestTieToEven) {		TEST_F(LlvmLibcExp2fExhaustiveTest, PostiveRangeRoundNearestTieToEven) {
test_full_range(POS_START, POS_STOP, mpfr::RoundingMode::Nearest);		test_full_range(POS_START, POS_STOP, mpfr::RoundingMode::Nearest);
}		}

TEST_F(LlvmLibcExp2fExhaustiveTest, PostiveRangeRoundUp) {		TEST_F(LlvmLibcExp2fExhaustiveTest, PostiveRangeRoundUp) {
test_full_range(POS_START, POS_STOP, mpfr::RoundingMode::Upward);		test_full_range(POS_START, POS_STOP, mpfr::RoundingMode::Upward);
}		}

TEST_F(LlvmLibcExp2fExhaustiveTest, PostiveRangeRoundDown) {		TEST_F(LlvmLibcExp2fExhaustiveTest, PostiveRangeRoundDown) {
test_full_range(POS_START, POS_STOP, mpfr::RoundingMode::Downward);		test_full_range(POS_START, POS_STOP, mpfr::RoundingMode::Downward);
}		}

TEST_F(LlvmLibcExp2fExhaustiveTest, PostiveRangeRoundTowardZero) {		TEST_F(LlvmLibcExp2fExhaustiveTest, PostiveRangeRoundTowardZero) {
test_full_range(POS_START, POS_STOP, mpfr::RoundingMode::TowardZero);		test_full_range(POS_START, POS_STOP, mpfr::RoundingMode::TowardZero);
}		}

// Range: [-150, 0];		// Range: [-Inf, 0];
static constexpr uint32_t NEG_START = 0x8000'0000U;		static constexpr uint32_t NEG_START = 0x8000'0000U;
static constexpr uint32_t NEG_STOP = 0xc316'0000U;		static constexpr uint32_t NEG_STOP = 0xff80'0000U;

TEST_F(LlvmLibcExp2fExhaustiveTest, NegativeRangeRoundNearestTieToEven) {		TEST_F(LlvmLibcExp2fExhaustiveTest, NegativeRangeRoundNearestTieToEven) {
test_full_range(NEG_START, NEG_STOP, mpfr::RoundingMode::Nearest);		test_full_range(NEG_START, NEG_STOP, mpfr::RoundingMode::Nearest);
}		}

TEST_F(LlvmLibcExp2fExhaustiveTest, NegativeRangeRoundUp) {		TEST_F(LlvmLibcExp2fExhaustiveTest, NegativeRangeRoundUp) {
test_full_range(NEG_START, NEG_STOP, mpfr::RoundingMode::Upward);		test_full_range(NEG_START, NEG_STOP, mpfr::RoundingMode::Upward);
}		}

TEST_F(LlvmLibcExp2fExhaustiveTest, NegativeRangeRoundDown) {		TEST_F(LlvmLibcExp2fExhaustiveTest, NegativeRangeRoundDown) {
test_full_range(NEG_START, NEG_STOP, mpfr::RoundingMode::Downward);		test_full_range(NEG_START, NEG_STOP, mpfr::RoundingMode::Downward);
}		}

TEST_F(LlvmLibcExp2fExhaustiveTest, NegativeRangeRoundTowardZero) {		TEST_F(LlvmLibcExp2fExhaustiveTest, NegativeRangeRoundTowardZero) {
test_full_range(NEG_START, NEG_STOP, mpfr::RoundingMode::TowardZero);		test_full_range(NEG_START, NEG_STOP, mpfr::RoundingMode::TowardZero);
}		}

libc/test/src/math/explogxf_test.cpp

Show All 33 Lines	TEST(LlvmLibcExpxfTest, InFloatRange) {
auto f_check = [](float x) -> bool {		auto f_check = [](float x) -> bool {
return !(		return !(
(isnan(x) \|\| isinf(x) \|\| x < -70 \|\| x > 70 \|\| fabsf(x) < 0x1.0p-10));		(isnan(x) \|\| isinf(x) \|\| x < -70 \|\| x > 70 \|\| fabsf(x) < 0x1.0p-10));
};		};
CHECK_DATA(0.0f, neg_inf, mpfr::Operation::Exp, fx, f_check, def_count,		CHECK_DATA(0.0f, neg_inf, mpfr::Operation::Exp, fx, f_check, def_count,
def_prec);		def_prec);
}		}

TEST(LlvmLibcExp2xfTest, InFloatRange) {
auto f_check = [](float x) -> bool {
return !(
(isnan(x) \|\| isinf(x) \|\| x < -130 \|\| x > 130 \|\| fabsf(x) < 0x1.0p-10));
};
CHECK_DATA(0.0f, neg_inf, mpfr::Operation::Exp2, __llvm_libc::exp2_eval,
f_check, def_count, def_prec);
}

TEST(LlvmLibcLog2xfTest, InFloatRange) {		TEST(LlvmLibcLog2xfTest, InFloatRange) {
CHECK_DATA(0.0f, inf, mpfr::Operation::Log2, __llvm_libc::log2_eval, f_normal,		CHECK_DATA(0.0f, inf, mpfr::Operation::Log2, __llvm_libc::log2_eval, f_normal,
def_count, def_prec);		def_count, def_prec);
}		}

TEST(LlvmLibcLogxfTest, InFloatRange) {		TEST(LlvmLibcLogxfTest, InFloatRange) {
CHECK_DATA(0.0f, inf, mpfr::Operation::Log, __llvm_libc::log_eval, f_normal,		CHECK_DATA(0.0f, inf, mpfr::Operation::Log, __llvm_libc::log_eval, f_normal,
def_count, def_prec);		def_count, def_prec);
}		}