This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
libc/
-
test/src/math/
-
src/
-
math/
-
LdExpTest.h
-
NextAfterTest.h
-
RoundToIntegerTest.h
-
SqrtTest.h
-
utils/FPUtil/
-
FPUtil/
-
BasicOperations.h
-
DivisionAndRemainderOperations.h
27/29
FPBits.h
-
Hypot.h
-
LongDoubleBitsX86.h
-
ManipulationFunctions.h
-
NearestIntegerOperations.h
1/2
NextAfterLongDoubleX86.h
-
NormalFloat.h
-
Sqrt.h
-
SqrtLongDoubleX86.h
-
TestHelpers.cpp
-
generic/
-
FMA.h

Differential D105561

[libc] Capture floating point encoding and arrange it sequentially in memory
ClosedPublic

Authored by hedingarcia on Jul 7 2021, 10:12 AM.

Download Raw Diff

Details

Reviewers

aeubanks
sivachandra
lntue

Commits

rGa5a337e55ed2: [libc] Capture floating point encoding and arrange it sequentially in memory

Summary

Redefined FPBits.h and LongDoubleBitsX86 so its implementation works for the Windows
and Linux platform while maintaining a packed memory alignment of the precision floating
point numbers. For its size in memory to be the same as the data type of the float point number.
This change was necessary because the previous attribute((packed)) specification in the struct was not working
for Windows like it was for Linux and consequently static_asserts in the FPBits.h file were failing.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hedingarcia created this revision.Jul 7 2021, 10:12 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 7 2021, 10:12 AM

Herald added subscribers: libc-commits, ecnelises, tschuett, pengfei. · View Herald Transcript

hedingarcia requested review of this revision.Jul 7 2021, 10:12 AM

hedingarcia edited the summary of this revision. (Show Details)Jul 7 2021, 10:19 AM

hedingarcia added a subscriber: sivachandra.

Harbormaster completed remote builds in B112807: Diff 356989.Jul 7 2021, 10:42 AM

Fixed error signaled when running the tests for LdExpTest.h by ensuring that the value passed to set mantissa
did not lead to overload to the exponent value in FPBits.h and LongDoubleBitsX86.h.
Added asserts to detect those unintentional overflows in setMantissa() and setExponent().

the title should say something about creating a custom packed int, the getter and setter part is a small detail that shouldn't be in the title

libc/utils/FPUtil/FPBits.h
21–22	since we're basically just forwarding MantissaWidth to FloatProperties::mantissaWidth, we can delete all the specializations below and do template <typename T> struct MantissaWidth { static constexpr unsigned value = FloatProperties<T>::mantissaWidth; }; Or even better, delete these and and only use FloatProperties::mantissaWidth. But maybe that can be done in a follow up patch.
45	I think we don't need the `__llvm_libc::fputil::` since we're arleady inside that namespace
45–78	we should replace all uses of `integer` with `encoding.valueFP`
48	I don't really like `valueFP`, how about something like `bits`
54	these comments should probably be deleted, they're not super useful
79	this is very confusing, can we just use `mantissaWidth`?
91	`bool`?

hedingarcia retitled this revision from [libc] (WIP) Modify the struct that captures floating point encoding to have setters and getters to [libc] Creating a struct that captures floating point encoding and manually arranges it sequentially in memory .Jul 8 2021, 9:37 AM

hedingarcia edited the summary of this revision. (Show Details)

hedingarcia added reviewers: sivachandra, lntue.

hedingarcia removed a subscriber: sivachandra.

Harbormaster completed remote builds in B113002: Diff 357245.Jul 8 2021, 10:10 AM

[libc] Fix of the patch, refactored FPBits.h to have template structs

Made one template struct for MantissaWidth, one for ExponentWidth, and another for FPUIntType.
Removed namespaced calls, comments and changed variable name valueFP to bits in FPBits.h and LongDoubleBitsX86.h.
Changed the return data type of getSign() from bool to uint8_t.

the commit message is a bit verbose:

no need to say that the patch was fixed
no need to say that you updated all getters/setters of exponent/mantissa/sign, that is a direct and obvious consequence of FPBits work
no need to say that we're using FloatProperties.h, that's obvious if you look at the patch
the ultimate reason for this patch isn't that static_asserts are failing, it's that attribute((packed)) isn't portable and doesn't work on Windows

in general the commit message should be more of a "why" rather than a "what are the steps I took to create this patch"; try to figure out exactly what a reader of this patch needs to know and don't over-explain details since that clutters the message
if something tricky is being done then that should be explained of course, but most of this patch is self-explanatory if you know the reasoning behind the change

libc/utils/FPUtil/FPBits.h
56	@sivachandra is this the right way to assert?
83	not sure if this should be a bool or a uint8_t... let's see what the others say

hedingarcia marked 6 inline comments as done.Jul 8 2021, 2:28 PM

hedingarcia added inline comments.

libc/utils/FPUtil/FPBits.h
79	The reason why we cannot use mantissaWidth is because in the 80-bit long double implementation the value for that field is 63 instead of 64. An explicit bit is considered in the 63 bit as the integer part of the significand (https://en.wikipedia.org/wiki/Extended_precision ). This exception leads to that long expression in getExponent(), because with only the value of mantissaWidth the shift will still require to move one bit in order to return all the bits of the exponent.

aeubanks added inline comments.Jul 8 2021, 2:32 PM

libc/utils/FPUtil/FPBits.h
79	I thought that only applied to the version in LongDoubleBitsX86

hedingarcia added inline comments.Jul 8 2021, 2:44 PM

libc/utils/FPUtil/FPBits.h
79	Yes, however should the same implementation in LongDoubleBitsX86.h be kept the same even in FPBits.h?

hedingarcia edited the summary of this revision. (Show Details)Jul 8 2021, 2:55 PM

hedingarcia edited the summary of this revision. (Show Details)Jul 8 2021, 3:01 PM

Harbormaster completed remote builds in B113086: Diff 357356.Jul 8 2021, 3:22 PM

hedingarcia edited the summary of this revision. (Show Details)Jul 9 2021, 6:19 AM

[libc] Fixing shift in getExponent() and setExponent()

Refactoring the previous shift to retrieve and set the exponent
with the mantissaWidth value in FPBits.h

hedingarcia marked 2 inline comments as done.Jul 9 2021, 6:36 AM

hedingarcia added inline comments.

libc/utils/FPUtil/FPBits.h
79	I see what you mean now, yes since LongDoubleBitsX86.h takes care of the long double implementation, this file only should take in consideration that explicit bit. FPBits.h does not have to work around this case since it is used for single and double. Thank you for catching that detail.

Harbormaster completed remote builds in B113189: Diff 357496.Jul 9 2021, 7:08 AM

hedingarcia retitled this revision from [libc] Creating a struct that captures floating point encoding and manually arranges it sequentially in memory to [libc] Capture floating point encoding and arrange it sequentially in memory with structs.Jul 9 2021, 8:23 AM

mostly looks good

libc/utils/FPUtil/FPBits.h
44–45	`FloatProp::BitsType` seems more consistent (you can move the `using FloatProp` here)
55	the next assert will never fail with this line (same for setExponent())

[libc] Rearranging alias declarations and removing asserts

hedingarcia marked 2 inline comments as done.Jul 9 2021, 10:57 AM

Harbormaster completed remote builds in B113245: Diff 357573.Jul 9 2021, 12:03 PM

Sorry for the delay in the review. I have gone through the history and it mostly looks good. I have a few comments about the cosmetics.

libc/utils/FPUtil/FPBits.h
29–30	Can this struct be removed now?
44–78	I would actually prefer if we got rid of `encoding` completely [1]. Adding the new methods to `FPBits` directly would reduce the verbosity when calling them. The integer value is currently called `integer`, but as @aeubanks suggests elsewhere, `bits` is more appropriate may be. However, to reduce the churn, you can choose to keep it as `integer` or `bits` depending on whats convenient to you. [1] - We had a separate `encoding` field because it was essentially that when we could use bit-fields. Moreover, it helped us distinguish it with the other fields. With that gone, we can choose to remove it and eliminate that "complexity" and verbosity.
45	For consistency, I would name `UIntType` as `BitsType` here as well. But, you can choose to add a `TODO` here and do that "cleanup" in a follow up change.
53	Why are these methods required? Probably because you got rid of the `integer` field of `FPBits`? If you add the rest of the methods to `FPBits` directly (see below), this should not be required as the `integer` / `bits` value can be directly set or read.
56	Not sure what line this pertains to. But, after this change, the only assert that would be required is something like: assert(sizeof(T) == sizeof(UIntVal), ...);
69	You should probably name this method `unbiasedExponent` to distinguish it from the real exponent value returned by `getExponent()`.
83	To which line was this comment originally for?

aeubanks added inline comments.Jul 12 2021, 9:21 AM

libc/utils/FPUtil/FPBits.h
29–30	I'd say clean this, along with Exponent/MantissaWidth up in a later patch, there are lots of uses of these
56	oh, I think the code was deleted for some reason I spent a while debugging a failure Hedin was running into. Turns out sometimes when setting the mantissa/exponent, it was actually larger than the max value (IIRC at the very bottom of NormalFloat.h). With the struct bitfield it automatically took care of that, but here we needed to mask out the extra bits. The question is whether to do that in the setter or do that in the caller. If we do that in the setter we preserve the existing behavior. But I'm slightly in favor of forcing callers to handle that and adding an assert in the setter that the bits don't overflow. This is probably a bit faster since we don't have to do the bitmask on every setter, and may catch future issues. The question about the assert was, is `assert(cond);` the right way to assert? As in in debug modes it'll run the assert and crash, and in release modes the assert won't be there for perf reasons. Is `<assert.h>` the right thing to include?
83	this was for `getSign()`

sivachandra added inline comments.Jul 12 2021, 10:23 AM

libc/utils/FPUtil/FPBits.h
56	Ah, OK! I should have said `static_assert` in my comment. The general rule we follow is: Include freestanding C headers if required. Include the header file corresponding to the implementation if required. So, in fputils, we can include `math.h`. Do not include other libc public headers unless they are related. Some of the above are checked by the libc lint rules implemented as part of clang-tidy. They are only run on the full build builders: https://lab.llvm.org/buildbot/#/workers/120 So, we cannot include `assert.h` or use `assert`. Whether the value should be checked in the caller or the setter, there are a few algorithms which assume the setters are doing it (there is a deliberate intention to overflow the mantissa.) So, I would say do the masking in the setter to retain functionality. In practice, it isn't a big runtime penalty as the setting usually happens at the very end of a complex algorithm.

hedingarcia retitled this revision from [libc] Capture floating point encoding and arrange it sequentially in memory with structs to [libc] Capture floating point encoding and arrange it sequentially in memory.Jul 12 2021, 11:39 AM

hedingarcia edited the summary of this revision. (Show Details)

[libc] Removed encoding/FPUIntType struct and renamed getExponent()

Eliminated encoding struct and moved its data and member fields into FPBits.
Removed the FPUIntType struct since it was not being referenced.

hedingarcia marked 10 inline comments as done.Jul 12 2021, 11:58 AM

Harbormaster completed remote builds in B113550: Diff 358011.Jul 12 2021, 12:26 PM

lgtm besides the question for Siva of uint8_t vs bool for getSign()

This revision is now accepted and ready to land.Jul 12 2021, 12:54 PM

In D105561#2872036, @aeubanks wrote:

lgtm besides the question for Siva of uint8_t vs bool for getSign()

Logically, it doesn't matter. Are there benefits wrt compiler driven optimizations? If yes, then suggest the one which should be preferred?

Otherwise, LGTM as well. Thanks for patiently resolving the comments.

In D105561#2872173, @sivachandra wrote:

In D105561#2872036, @aeubanks wrote:

lgtm besides the question for Siva of uint8_t vs bool for getSign()

Logically, it doesn't matter. Are there benefits wrt compiler driven optimizations? If yes, then suggest the one which should be preferred?

Otherwise, LGTM as well. Thanks for patiently resolving the comments.

Probably bool is better since the compiler can assume it's only 0 or 1.

In D105561#2872187, @aeubanks wrote:

Probably bool is better since the compiler can assume it's only 0 or 1.

Okay, I will change the return type to bool.

[libc] Changed the return type of getSign()

hedingarcia marked 3 inline comments as done.Jul 12 2021, 2:07 PM

lgtm

Harbormaster completed remote builds in B113585: Diff 358060.Jul 12 2021, 3:15 PM

lntue added inline comments.Jul 12 2021, 10:32 PM

libc/utils/FPUtil/FPBits.h
76	Sorry for late comments! Since you already updated to get* and set*, can you also add getBits and setBits? I think they would be a better than the current uintval() (and technically no equivalent of setBits).
libc/utils/FPUtil/NextAfterLongDoubleX86.h
53	I'm a bit curious about generated assembly of this line. @sivachandra : can we check if there is any regression with this one? I think no regression for O2 and O3 would be good enough.

hedingarcia added inline comments.Jul 13 2021, 9:42 AM

libc/utils/FPUtil/FPBits.h
76	These get/setBits members were used previously with a different name but now they are not required anymore since `bits` value can be directly set or read from FPBits.
libc/utils/FPUtil/NextAfterLongDoubleX86.h
53	The object files generated before and after this patch are the same, at least after running the check there was no difference between the files at O2 optimization level.

This revision was landed with ongoing or failed builds.Jul 13 2021, 1:44 PM

Closed by commit rGa5a337e55ed2: [libc] Capture floating point encoding and arrange it sequentially in memory (authored by hedingarcia). · Explain Why

This revision was automatically updated to reflect the committed changes.

hedingarcia added a commit: rGa5a337e55ed2: [libc] Capture floating point encoding and arrange it sequentially in memory.

amccarth added a subscriber: amccarth.Aug 12 2021, 2:45 PM

amccarth added inline comments.

libc/utils/FPUtil/FPBits.h
49	I'm rather surprised that this change was necessary. When working with a specific bit layout, using a unsigned integer type and bitwise operations can be a good idea--I don't think the language standards give many guarantees about how the bits are allocated (e.g., MSB first or LSB first?). But I'm not sure why the compiler (we are talking about clang here, right?) didn't pack the structures the same way on all platforms. Perhaps when targeting Windows, clang is trying too hard to be compatible with MSVC. With MSVC, when defining bit fields, they will pack by default--but only if the underlying types are all the same. Since the old types were UIntType,, uint16_t, and uint8_t--all of which have different alignment requirements--MSVC doesn't pack those. But if they had all been defined as uint32_t, MSVC would certainly have done the right thing. And thus, maybe clang would have as well.

Revision Contents

Path

Size

libc/

test/

src/

math/

2 lines

22 lines

13 lines

2 lines

utils/

FPUtil/

BasicOperations.h

10 lines

DivisionAndRemainderOperations.h

5 lines

FPBits.h

108 lines

Hypot.h

26 lines

LongDoubleBitsX86.h

106 lines

ManipulationFunctions.h

11 lines

NearestIntegerOperations.h

34 lines

NextAfterLongDoubleX86.h

20 lines

73 lines

8 lines

16 lines

9 lines

generic/

FMA.h

12 lines

Diff 358418

libc/test/src/math/LdExpTest.h

Show First 20 Lines • Show All 115 Lines • ▼ Show 20 Lines	void testNormalOperation(LdExpFunc func) {
// Start with a normal number high exponent but pass a very low number for		// Start with a normal number high exponent but pass a very low number for
// exp. The result should be a subnormal number.		// exp. The result should be a subnormal number.
x = NormalFloat(FPBits::exponentBias, NormalFloat::one, 0);		x = NormalFloat(FPBits::exponentBias, NormalFloat::one, 0);
int exp = -FPBits::maxExponent - 5;		int exp = -FPBits::maxExponent - 5;
T result = func(x, exp);		T result = func(x, exp);
FPBits resultBits(result);		FPBits resultBits(result);
ASSERT_FALSE(resultBits.isZero());		ASSERT_FALSE(resultBits.isZero());
// Verify that the result is indeed subnormal.		// Verify that the result is indeed subnormal.
ASSERT_EQ(resultBits.encoding.exponent, uint16_t(0));		ASSERT_EQ(resultBits.getUnbiasedExponent(), uint16_t(0));
// But if the exp is so less that normalization leads to zero, then		// But if the exp is so less that normalization leads to zero, then
// the result should be zero.		// the result should be zero.
result = func(x, -FPBits::maxExponent - int(mantissaWidth) - 5);		result = func(x, -FPBits::maxExponent - int(mantissaWidth) - 5);
ASSERT_TRUE(FPBits(result).isZero());		ASSERT_TRUE(FPBits(result).isZero());

// Start with a subnormal number but pass a very high number for exponent.		// Start with a subnormal number but pass a very high number for exponent.
// The result should not be infinity.		// The result should not be infinity.
x = NormalFloat(-FPBits::exponentBias + 1, NormalFloat::one >> 10, 0);		x = NormalFloat(-FPBits::exponentBias + 1, NormalFloat::one >> 10, 0);
Show All 23 Lines

libc/test/src/math/NextAfterTest.h

Show First 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	void testBoundaries(NextAfterFunc func) {
ASSERT_FP_EQ(result, expected);		ASSERT_FP_EQ(result, expected);
ASSERT_FP_EQ(func(x, negInf), negInf);		ASSERT_FP_EQ(func(x, negInf), negInf);

// 'from' is a power of 2.		// 'from' is a power of 2.
x = T(32.0);		x = T(32.0);
result = func(x, 0);		result = func(x, 0);
FPBits xBits = FPBits(x);		FPBits xBits = FPBits(x);
FPBits resultBits = FPBits(result);		FPBits resultBits = FPBits(result);
ASSERT_EQ(resultBits.encoding.exponent,		ASSERT_EQ(resultBits.getUnbiasedExponent(),
uint16_t(xBits.encoding.exponent - 1));		uint16_t(xBits.getUnbiasedExponent() - 1));
ASSERT_EQ(resultBits.encoding.mantissa,		ASSERT_EQ(resultBits.getMantissa(),
(UIntType(1) << MantissaWidth::value) - 1);		(UIntType(1) << MantissaWidth::value) - 1);

result = func(x, T(33.0));		result = func(x, T(33.0));
resultBits = FPBits(result);		resultBits = FPBits(result);
ASSERT_EQ(resultBits.encoding.exponent, xBits.encoding.exponent);		ASSERT_EQ(resultBits.getUnbiasedExponent(), xBits.getUnbiasedExponent());
ASSERT_EQ(resultBits.encoding.mantissa,		ASSERT_EQ(resultBits.getMantissa(), xBits.getMantissa() + UIntType(1));
xBits.encoding.mantissa + UIntType(1));

x = -x;		x = -x;

result = func(x, 0);		result = func(x, 0);
resultBits = FPBits(result);		resultBits = FPBits(result);
ASSERT_EQ(resultBits.encoding.exponent,		ASSERT_EQ(resultBits.getUnbiasedExponent(),
uint16_t(xBits.encoding.exponent - 1));		uint16_t(xBits.getUnbiasedExponent() - 1));
ASSERT_EQ(resultBits.encoding.mantissa,		ASSERT_EQ(resultBits.getMantissa(),
(UIntType(1) << MantissaWidth::value) - 1);		(UIntType(1) << MantissaWidth::value) - 1);

result = func(x, T(-33.0));		result = func(x, T(-33.0));
resultBits = FPBits(result);		resultBits = FPBits(result);
ASSERT_EQ(resultBits.encoding.exponent, xBits.encoding.exponent);		ASSERT_EQ(resultBits.getUnbiasedExponent(), xBits.getUnbiasedExponent());
ASSERT_EQ(resultBits.encoding.mantissa,		ASSERT_EQ(resultBits.getMantissa(), xBits.getMantissa() + UIntType(1));
xBits.encoding.mantissa + UIntType(1));
}		}
};		};

#define LIST_NEXTAFTER_TESTS(T, func) \		#define LIST_NEXTAFTER_TESTS(T, func) \
using LlvmLibcNextAfterTest = NextAfterTestTemplate<T>; \		using LlvmLibcNextAfterTest = NextAfterTestTemplate<T>; \
TEST_F(LlvmLibcNextAfterTest, TestNaN) { testNaN(&func); } \		TEST_F(LlvmLibcNextAfterTest, TestNaN) { testNaN(&func); } \
TEST_F(LlvmLibcNextAfterTest, TestBoundaries) { testBoundaries(&func); }		TEST_F(LlvmLibcNextAfterTest, TestBoundaries) { testBoundaries(&func); }

#endif // LLVM_LIBC_TEST_SRC_MATH_NEXTAFTERTEST_H		#endif // LLVM_LIBC_TEST_SRC_MATH_NEXTAFTERTEST_H

libc/test/src/math/RoundToIntegerTest.h

Show First 20 Lines • Show All 129 Lines • ▼ Show 20 Lines	void doRoundNumbersTest(RoundToIntegerFunc func) {
// long.		// long.
if (sizeof(I) > sizeof(long))		if (sizeof(I) > sizeof(long))
return;		return;

constexpr int exponentLimit = sizeof(I) * 8 - 1;		constexpr int exponentLimit = sizeof(I) * 8 - 1;
// We start with 1.0 so that the implicit bit for x86 long doubles		// We start with 1.0 so that the implicit bit for x86 long doubles
// is set.		// is set.
FPBits bits(F(1.0));		FPBits bits(F(1.0));
bits.encoding.exponent = exponentLimit + FPBits::exponentBias;		bits.setUnbiasedExponent(exponentLimit + FPBits::exponentBias);
bits.encoding.sign = 1;		bits.setSign(1);
bits.encoding.mantissa = 0;		bits.setMantissa(0);

F x = F(bits);		F x = F(bits);
long mpfrResult;		long mpfrResult;
bool erangeflag = mpfr::RoundToLong(x, mpfrResult);		bool erangeflag = mpfr::RoundToLong(x, mpfrResult);
ASSERT_FALSE(erangeflag);		ASSERT_FALSE(erangeflag);
testOneInput(func, x, mpfrResult, false);		testOneInput(func, x, mpfrResult, false);
}		}

▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	void testIntegerOverflow(RoundToIntegerFunc func) {
// that of long.		// that of long.
if (sizeof(I) > sizeof(long))		if (sizeof(I) > sizeof(long))
return;		return;

constexpr int exponentLimit = sizeof(I) * 8 - 1;		constexpr int exponentLimit = sizeof(I) * 8 - 1;
// We start with 1.0 so that the implicit bit for x86 long doubles		// We start with 1.0 so that the implicit bit for x86 long doubles
// is set.		// is set.
FPBits bits(F(1.0));		FPBits bits(F(1.0));
bits.encoding.exponent = exponentLimit + FPBits::exponentBias;		bits.setUnbiasedExponent(exponentLimit + FPBits::exponentBias);
bits.encoding.sign = 1;		bits.setSign(1);
bits.encoding.mantissa =		bits.setMantissa(UIntType(0x1) << (__llvm_libc::fputil::MantissaWidth<F>::value - 1));
UIntType(0x1) << (__llvm_libc::fputil::MantissaWidth<F>::value - 1);

F x = F(bits);		F x = F(bits);
if (TestModes) {		if (TestModes) {
for (int m : roundingModes) {		for (int m : roundingModes) {
__llvm_libc::fputil::setRound(m);		__llvm_libc::fputil::setRound(m);
long mpfrLongResult;		long mpfrLongResult;
bool erangeflag =		bool erangeflag =
mpfr::RoundToLong(x, toMPFRRoundingMode(m), mpfrLongResult);		mpfr::RoundToLong(x, toMPFRRoundingMode(m), mpfrLongResult);
▲ Show 20 Lines • Show All 116 Lines • Show Last 20 Lines

libc/test/src/math/SqrtTest.h

Show All 33 Lines	void testSpecialNumbers(SqrtFunc func) {
ASSERT_FP_EQ(T(1.0), func(T(1.0)));		ASSERT_FP_EQ(T(1.0), func(T(1.0)));
ASSERT_FP_EQ(T(2.0), func(T(4.0)));		ASSERT_FP_EQ(T(2.0), func(T(4.0)));
ASSERT_FP_EQ(T(3.0), func(T(9.0)));		ASSERT_FP_EQ(T(3.0), func(T(9.0)));
}		}

void testDenormalValues(SqrtFunc func) {		void testDenormalValues(SqrtFunc func) {
for (UIntType mant = 1; mant < HiddenBit; mant <<= 1) {		for (UIntType mant = 1; mant < HiddenBit; mant <<= 1) {
FPBits denormal(T(0.0));		FPBits denormal(T(0.0));
denormal.encoding.mantissa = mant;		denormal.setMantissa(mant);

ASSERT_MPFR_MATCH(mpfr::Operation::Sqrt, T(denormal), func(T(denormal)),		ASSERT_MPFR_MATCH(mpfr::Operation::Sqrt, T(denormal), func(T(denormal)),
T(0.5));		T(0.5));
}		}

constexpr UIntType count = 1'000'001;		constexpr UIntType count = 1'000'001;
constexpr UIntType step = HiddenBit / count;		constexpr UIntType step = HiddenBit / count;
for (UIntType i = 0, v = 0; i <= count; ++i, v += step) {		for (UIntType i = 0, v = 0; i <= count; ++i, v += step) {
Show All 23 Lines

libc/utils/FPUtil/BasicOperations.h

	Show All 14 Lines

	namespace __llvm_libc {			namespace __llvm_libc {
	namespace fputil {			namespace fputil {

	template <typename T,			template <typename T,
	cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>			cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
	static inline T abs(T x) {			static inline T abs(T x) {
	FPBits<T> bits(x);			FPBits<T> bits(x);
	bits.encoding.sign = 0;			bits.setSign(0);
	return T(bits);			return T(bits);
	}			}

	template <typename T,			template <typename T,
	cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>			cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
	static inline T fmin(T x, T y) {			static inline T fmin(T x, T y) {
	FPBits<T> bitx(x), bity(y);			FPBits<T> bitx(x), bity(y);

	if (bitx.isNaN()) {			if (bitx.isNaN()) {
	return y;			return y;
	} else if (bity.isNaN()) {			} else if (bity.isNaN()) {
	return x;			return x;
	} else if (bitx.encoding.sign != bity.encoding.sign) {			} else if (bitx.getSign() != bity.getSign()) {
	// To make sure that fmin(+0, -0) == -0 == fmin(-0, +0), whenever x and			// To make sure that fmin(+0, -0) == -0 == fmin(-0, +0), whenever x and
	// y has different signs and both are not NaNs, we return the number			// y has different signs and both are not NaNs, we return the number
	// with negative sign.			// with negative sign.
	return (bitx.encoding.sign ? x : y);			return (bitx.getSign() ? x : y);
	} else {			} else {
	return (x < y ? x : y);			return (x < y ? x : y);
	}			}
	}			}

	template <typename T,			template <typename T,
	cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>			cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
	static inline T fmax(T x, T y) {			static inline T fmax(T x, T y) {
	FPBits<T> bitx(x), bity(y);			FPBits<T> bitx(x), bity(y);

	if (bitx.isNaN()) {			if (bitx.isNaN()) {
	return y;			return y;
	} else if (bity.isNaN()) {			} else if (bity.isNaN()) {
	return x;			return x;
	} else if (bitx.encoding.sign != bity.encoding.sign) {			} else if (bitx.getSign() != bity.getSign()) {
	// To make sure that fmax(+0, -0) == +0 == fmax(-0, +0), whenever x and			// To make sure that fmax(+0, -0) == +0 == fmax(-0, +0), whenever x and
	// y has different signs and both are not NaNs, we return the number			// y has different signs and both are not NaNs, we return the number
	// with positive sign.			// with positive sign.
	return (bitx.encoding.sign ? y : x);			return (bitx.getSign() ? y : x);
	} else {			} else {
	return (x > y ? x : y);			return (x > y ? x : y);
	}			}
	}			}

	template <typename T,			template <typename T,
	cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>			cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
	static inline T fdim(T x, T y) {			static inline T fdim(T x, T y) {
	Show All 17 Lines

libc/utils/FPUtil/DivisionAndRemainderOperations.h

Show All 37 Lines	if (xbits.isZero()) {
return __llvm_libc::fputil::copysign(T(0.0), x);		return __llvm_libc::fputil::copysign(T(0.0), x);
}		}

if (ybits.isInf()) {		if (ybits.isInf()) {
q = 0;		q = 0;
return x;		return x;
}		}

bool resultSign = (xbits.encoding.sign == ybits.encoding.sign ? false : true);		bool resultSign = (xbits.getSign() == ybits.getSign() ? false : true);

// Once we know the sign of the result, we can just operate on the absolute		// Once we know the sign of the result, we can just operate on the absolute
// values. The correct sign can be applied to the result after the result		// values. The correct sign can be applied to the result after the result
// is evaluated.		// is evaluated.
xbits.encoding.sign = ybits.encoding.sign = 0;		xbits.setSign(0);
		ybits.setSign(0);

NormalFloat<T> normalx(xbits), normaly(ybits);		NormalFloat<T> normalx(xbits), normaly(ybits);
int exp = normalx.exponent - normaly.exponent;		int exp = normalx.exponent - normaly.exponent;
typename NormalFloat<T>::UIntType mx = normalx.mantissa,		typename NormalFloat<T>::UIntType mx = normalx.mantissa,
my = normaly.mantissa;		my = normaly.mantissa;

q = 0;		q = 0;
while (exp >= 0) {		while (exp >= 0) {
▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

libc/utils/FPUtil/FPBits.h

	//===-- Abstract class for bit manipulation of float numbers. ---- C++ --===//			//===-- Abstract class for bit manipulation of float numbers. ---- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_LIBC_UTILS_FPUTIL_FP_BITS_H			#ifndef LLVM_LIBC_UTILS_FPUTIL_FP_BITS_H
	#define LLVM_LIBC_UTILS_FPUTIL_FP_BITS_H			#define LLVM_LIBC_UTILS_FPUTIL_FP_BITS_H

	#include "PlatformDefs.h"			#include "PlatformDefs.h"

	#include "utils/CPP/TypeTraits.h"			#include "utils/CPP/TypeTraits.h"

				#include "FloatProperties.h"
	#include <stdint.h>			#include <stdint.h>

	namespace __llvm_libc {			namespace __llvm_libc {
	namespace fputil {			namespace fputil {

	template <typename T> struct MantissaWidth {};			template <typename T> struct MantissaWidth {
				aeubanksUnsubmitted Done Reply Inline Actions since we're basically just forwarding MantissaWidth to FloatProperties::mantissaWidth, we can delete all the specializations below and do template <typename T> struct MantissaWidth { static constexpr unsigned value = FloatProperties<T>::mantissaWidth; }; Or even better, delete these and and only use FloatProperties::mantissaWidth. But maybe that can be done in a follow up patch. aeubanks: since we're basically just forwarding MantissaWidth to FloatProperties::mantissaWidth, we can…
	template <> struct MantissaWidth<float> {			static constexpr unsigned value = FloatProperties<T>::mantissaWidth;
	static constexpr unsigned value = 23;
	};
	template <> struct MantissaWidth<double> {
	static constexpr unsigned value = 52;
	};

	template <typename T> struct ExponentWidth {};
	template <> struct ExponentWidth<float> {
	static constexpr unsigned value = 8;
	};
	template <> struct ExponentWidth<double> {
	static constexpr unsigned value = 11;
	};
	template <> struct ExponentWidth<long double> {
	static constexpr unsigned value = 15;
	};			};

	template <typename T> struct FPUIntType {};			template <typename T> struct ExponentWidth {
	template <> struct FPUIntType<float> { using Type = uint32_t; };			static constexpr unsigned value = FloatProperties<T>::exponentWidth;
	template <> struct FPUIntType<double> { using Type = uint64_t; };

	#ifdef LONG_DOUBLE_IS_DOUBLE
	template <> struct MantissaWidth<long double> {
	static constexpr unsigned value = MantissaWidth<double>::value;
	};			};
	template <> struct FPUIntType<long double> {
	using Type = FPUIntType<double>::Type;
	};
	#elif !defined(SPECIAL_X86_LONG_DOUBLE)
	template <> struct MantissaWidth<long double> {
	static constexpr unsigned value = 112;
	};
	template <> struct FPUIntType<long double> { using Type = __uint128_t; };
	#endif

	// A generic class to represent single precision, double precision, and quad			// A generic class to represent single precision, double precision, and quad
				sivachandraUnsubmitted Done Reply Inline Actions Can this struct be removed now? sivachandra: Can this struct be removed now?
				aeubanksUnsubmitted Done Reply Inline Actions I'd say clean this, along with Exponent/MantissaWidth up in a later patch, there are lots of uses of these aeubanks: I'd say clean this, along with Exponent/MantissaWidth up in a later patch, there are lots of…
	// precision IEEE 754 floating point formats.			// precision IEEE 754 floating point formats.
	// On most platforms, the 'float' type corresponds to single precision floating			// On most platforms, the 'float' type corresponds to single precision floating
	// point numbers, the 'double' type corresponds to double precision floating			// point numbers, the 'double' type corresponds to double precision floating
	// point numers, and the 'long double' type corresponds to the quad precision			// point numers, and the 'long double' type corresponds to the quad precision
	// floating numbers. On x86 platforms however, the 'long double' type maps to			// floating numbers. On x86 platforms however, the 'long double' type maps to
	// an x87 floating point format. This format is an IEEE 754 extension format.			// an x87 floating point format. This format is an IEEE 754 extension format.
	// It is handled as an explicit specialization of this class.			// It is handled as an explicit specialization of this class.
	template <typename T> union FPBits {			template <typename T> union FPBits {
	static_assert(cpp::IsFloatingPointType<T>::Value,			static_assert(cpp::IsFloatingPointType<T>::Value,
	"FPBits instantiated with invalid type.");			"FPBits instantiated with invalid type.");

	// Reinterpreting bits as an integer value and interpreting the bits of an			// Reinterpreting bits as an integer value and interpreting the bits of an
	// integer value as a floating point value is used in tests. So, a convenient			// integer value as a floating point value is used in tests. So, a convenient
	// type is provided for such reinterpretations.			// type is provided for such reinterpretations.
	using UIntType = typename FPUIntType<T>::Type;			using FloatProp = FloatProperties<T>;
				aeubanksUnsubmitted Done Reply Inline Actions I think we don't need the `__llvm_libc::fputil::` since we're arleady inside that namespace aeubanks: I think we don't need the `__llvm_libc::fputil::` since we're arleady inside that namespace
				aeubanksUnsubmitted Done Reply Inline Actions `FloatProp::BitsType` seems more consistent (you can move the `using FloatProp` here) aeubanks: `FloatProp::BitsType` seems more consistent (you can move the `using FloatProp` here)
				sivachandraUnsubmitted Done Reply Inline Actions For consistency, I would name `UIntType` as `BitsType` here as well. But, you can choose to add a `TODO` here and do that "cleanup" in a follow up change. sivachandra: For consistency, I would name `UIntType` as `BitsType` here as well. But, you can choose to add…
				// TODO: Change UintType name to BitsType for consistency.
				using UIntType = typename FloatProp::BitsType;

				aeubanksUnsubmitted Done Reply Inline Actions I don't really like `valueFP`, how about something like `bits` aeubanks: I don't really like `valueFP`, how about something like `bits`
	struct __attribute__((packed)) {			UIntType bits;
				amccarthUnsubmitted Not Done Reply Inline Actions I'm rather surprised that this change was necessary. When working with a specific bit layout, using a unsigned integer type and bitwise operations can be a good idea--I don't think the language standards give many guarantees about how the bits are allocated (e.g., MSB first or LSB first?). But I'm not sure why the compiler (we are talking about clang here, right?) didn't pack the structures the same way on all platforms. Perhaps when targeting Windows, clang is trying too hard to be compatible with MSVC. With MSVC, when defining bit fields, they will pack by default--but only if the underlying types are all the same. Since the old types were UIntType,, uint16_t, and uint8_t--all of which have different alignment requirements--MSVC doesn't pack those. But if they had all been defined as uint32_t, MSVC would certainly have done the right thing. And thus, maybe clang would have as well. amccarth: I'm rather surprised that this change was necessary. When working with a specific bit layout…
	UIntType mantissa : MantissaWidth<T>::value;
	uint16_t exponent : ExponentWidth<T>::value;			void setMantissa(UIntType mantVal) {
	uint8_t sign : 1;			mantVal &= (FloatProp::mantissaMask);
	} encoding;			bits &= ~(FloatProp::mantissaMask);
				sivachandraUnsubmitted Done Reply Inline Actions Why are these methods required? Probably because you got rid of the `integer` field of `FPBits`? If you add the rest of the methods to `FPBits` directly (see below), this should not be required as the `integer` / `bits` value can be directly set or read. sivachandra: Why are these methods required? Probably because you got rid of the `integer` field of `FPBits`?
	UIntType integer;			bits \|= mantVal;
				aeubanksUnsubmitted Done Reply Inline Actions these comments should probably be deleted, they're not super useful aeubanks: these comments should probably be deleted, they're not super useful
				}
				aeubanksUnsubmitted Done Reply Inline Actions the next assert will never fail with this line (same for setExponent()) aeubanks: the next assert will never fail with this line (same for setExponent())

				aeubanksUnsubmitted Done Reply Inline Actions @sivachandra is this the right way to assert? aeubanks: @sivachandra is this the right way to assert?
				sivachandraUnsubmitted Done Reply Inline Actions Not sure what line this pertains to. But, after this change, the only assert that would be required is something like: assert(sizeof(T) == sizeof(UIntVal), ...); sivachandra: Not sure what line this pertains to. But, after this change, the only assert that would be…
				aeubanksUnsubmitted Done Reply Inline Actions oh, I think the code was deleted for some reason I spent a while debugging a failure Hedin was running into. Turns out sometimes when setting the mantissa/exponent, it was actually larger than the max value (IIRC at the very bottom of NormalFloat.h). With the struct bitfield it automatically took care of that, but here we needed to mask out the extra bits. The question is whether to do that in the setter or do that in the caller. If we do that in the setter we preserve the existing behavior. But I'm slightly in favor of forcing callers to handle that and adding an assert in the setter that the bits don't overflow. This is probably a bit faster since we don't have to do the bitmask on every setter, and may catch future issues. The question about the assert was, is `assert(cond);` the right way to assert? As in in debug modes it'll run the assert and crash, and in release modes the assert won't be there for perf reasons. Is `<assert.h>` the right thing to include? aeubanks: oh, I think the code was deleted for some reason I spent a while debugging a failure Hedin was…
				sivachandraUnsubmitted Done Reply Inline Actions Ah, OK! I should have said `static_assert` in my comment. The general rule we follow is: Include freestanding C headers if required. Include the header file corresponding to the implementation if required. So, in fputils, we can include `math.h`. Do not include other libc public headers unless they are related. Some of the above are checked by the libc lint rules implemented as part of clang-tidy. They are only run on the full build builders: https://lab.llvm.org/buildbot/#/workers/120 So, we cannot include `assert.h` or use `assert`. Whether the value should be checked in the caller or the setter, there are a few algorithms which assume the setters are doing it (there is a deliberate intention to overflow the mantissa.) So, I would say do the masking in the setter to retain functionality. In practice, it isn't a big runtime penalty as the setting usually happens at the very end of a complex algorithm. sivachandra: Ah, OK! I should have said `static_assert` in my comment. The general rule we follow is: 1.
				UIntType getMantissa() const { return bits & FloatProp::mantissaMask; }

				void setUnbiasedExponent(UIntType expVal) {
				expVal = (expVal << (FloatProp::mantissaWidth)) & FloatProp::exponentMask;
				bits &= ~(FloatProp::exponentMask);
				bits \|= expVal;
				}

				uint16_t getUnbiasedExponent() const {
				return uint16_t((bits & FloatProp::exponentMask) >>
				(FloatProp::mantissaWidth));
				}

				sivachandraUnsubmitted Done Reply Inline Actions You should probably name this method `unbiasedExponent` to distinguish it from the real exponent value returned by `getExponent()`. sivachandra: You should probably name this method `unbiasedExponent` to distinguish it from the real…
				void setSign(bool signVal) {
				bits &= ~(FloatProp::signMask);
				UIntType sign = UIntType(signVal) << (FloatProp::bitWidth - 1);
				bits \|= sign;
				}

				bool getSign() const {
				lntueUnsubmitted Not Done Reply Inline Actions Sorry for late comments! Since you already updated to get* and set, can you also add getBits and setBits? I think they would be a better than the current uintval() (and technically no equivalent of setBits). lntue:* Sorry for late comments! Since you already updated to get* and set*, can you also add getBits…
				hedingarciaAuthorUnsubmitted Done Reply Inline Actions These get/setBits members were used previously with a different name but now they are not required anymore since `bits` value can be directly set or read from FPBits. hedingarcia: These get/setBits members were used previously with a different name but now they are not…
				return ((bits & FloatProp::signMask) >> (FloatProp::bitWidth - 1));
				}
				aeubanksUnsubmitted Done Reply Inline Actions we should replace all uses of `integer` with `encoding.valueFP` aeubanks: we should replace all uses of `integer` with `encoding.valueFP`
				sivachandraUnsubmitted Done Reply Inline Actions I would actually prefer if we got rid of `encoding` completely [1]. Adding the new methods to `FPBits` directly would reduce the verbosity when calling them. The integer value is currently called `integer`, but as @aeubanks suggests elsewhere, `bits` is more appropriate may be. However, to reduce the churn, you can choose to keep it as `integer` or `bits` depending on whats convenient to you. [1] - We had a separate `encoding` field because it was essentially that when we could use bit-fields. Moreover, it helped us distinguish it with the other fields. With that gone, we can choose to remove it and eliminate that "complexity" and verbosity. sivachandra: I would actually prefer if we got rid of `encoding` completely [1]. Adding the new methods to…
	T val;			T val;
				aeubanksUnsubmitted Done Reply Inline Actions this is very confusing, can we just use `mantissaWidth`? aeubanks: this is very confusing, can we just use `mantissaWidth`?
				hedingarciaAuthorUnsubmitted Done Reply Inline Actions The reason why we cannot use mantissaWidth is because in the 80-bit long double implementation the value for that field is 63 instead of 64. An explicit bit is considered in the 63 bit as the integer part of the significand (https://en.wikipedia.org/wiki/Extended_precision ). This exception leads to that long expression in getExponent(), because with only the value of mantissaWidth the shift will still require to move one bit in order to return all the bits of the exponent. hedingarcia: The reason why we cannot use mantissaWidth is because in the 80-bit long double implementation…
				aeubanksUnsubmitted Done Reply Inline Actions I thought that only applied to the version in LongDoubleBitsX86 aeubanks: I thought that only applied to the version in LongDoubleBitsX86
				hedingarciaAuthorUnsubmitted Done Reply Inline Actions Yes, however should the same implementation in LongDoubleBitsX86.h be kept the same even in FPBits.h? hedingarcia: Yes, however should the same implementation in LongDoubleBitsX86.h be kept the same even in…
				hedingarciaAuthorUnsubmitted Done Reply Inline Actions I see what you mean now, yes since LongDoubleBitsX86.h takes care of the long double implementation, this file only should take in consideration that explicit bit. FPBits.h does not have to work around this case since it is used for single and double. Thank you for catching that detail. hedingarcia: I see what you mean now, yes since LongDoubleBitsX86.h takes care of the long double…

	static_assert(sizeof(encoding) == sizeof(UIntType),			static_assert(sizeof(T) == sizeof(UIntType),
	"Encoding and integral representation have different sizes.");			"Data type and integral representation have different sizes.");
	static_assert(sizeof(integer) == sizeof(UIntType),
	"Integral representation and value type have different sizes.");

				aeubanksUnsubmitted Done Reply Inline Actions not sure if this should be a bool or a uint8_t... let's see what the others say aeubanks: not sure if this should be a bool or a uint8_t... let's see what the others say
				sivachandraUnsubmitted Done Reply Inline Actions To which line was this comment originally for? sivachandra: To which line was this comment originally for?
				aeubanksUnsubmitted Done Reply Inline Actions this was for `getSign()` aeubanks: this was for `getSign()`
	static constexpr int exponentBias = (1 << (ExponentWidth<T>::value - 1)) - 1;			static constexpr int exponentBias = (1 << (ExponentWidth<T>::value - 1)) - 1;
	static constexpr int maxExponent = (1 << ExponentWidth<T>::value) - 1;			static constexpr int maxExponent = (1 << ExponentWidth<T>::value) - 1;

	static constexpr UIntType minSubnormal = UIntType(1);			static constexpr UIntType minSubnormal = UIntType(1);
	static constexpr UIntType maxSubnormal =			static constexpr UIntType maxSubnormal =
	(UIntType(1) << MantissaWidth<T>::value) - 1;			(UIntType(1) << MantissaWidth<T>::value) - 1;
	static constexpr UIntType minNormal =			static constexpr UIntType minNormal =
	(UIntType(1) << MantissaWidth<T>::value);			(UIntType(1) << MantissaWidth<T>::value);
				aeubanksUnsubmitted Done Reply Inline Actions `bool`? aeubanks: `bool`?
	static constexpr UIntType maxNormal =			static constexpr UIntType maxNormal =
	((UIntType(maxExponent) - 1) << MantissaWidth<T>::value) \| maxSubnormal;			((UIntType(maxExponent) - 1) << MantissaWidth<T>::value) \| maxSubnormal;

	// We don't want accidental type promotions/conversions so we require exact			// We don't want accidental type promotions/conversions so we require exact
	// type match.			// type match.
	template <typename XType,			template <typename XType,
	cpp::EnableIfType<cpp::IsSame<T, XType>::Value, int> = 0>			cpp::EnableIfType<cpp::IsSame<T, XType>::Value, int> = 0>
	explicit FPBits(XType x) : val(x) {}			explicit FPBits(XType x) : val(x) {}

	template <typename XType,			template <typename XType,
	cpp::EnableIfType<cpp::IsSame<XType, UIntType>::Value, int> = 0>			cpp::EnableIfType<cpp::IsSame<XType, UIntType>::Value, int> = 0>
	explicit FPBits(XType x) : integer(x) {}			explicit FPBits(XType x) : bits(x) {}

	FPBits() : integer(0) {}			FPBits() : bits(0) {}

	explicit operator T() { return val; }			explicit operator T() { return val; }

	UIntType uintval() const { return integer; }			UIntType uintval() const { return bits; }

	int getExponent() const { return int(encoding.exponent) - exponentBias; }			int getExponent() const { return int(getUnbiasedExponent()) - exponentBias; }

	bool isZero() const {			bool isZero() const {
	return encoding.mantissa == 0 && encoding.exponent == 0;			return getMantissa() == 0 && getUnbiasedExponent() == 0;
	}			}

	bool isInf() const {			bool isInf() const {
	return encoding.mantissa == 0 && encoding.exponent == maxExponent;			return getMantissa() == 0 && getUnbiasedExponent() == maxExponent;
	}			}

	bool isNaN() const {			bool isNaN() const {
	return encoding.exponent == maxExponent && encoding.mantissa != 0;			return getUnbiasedExponent() == maxExponent && getMantissa() != 0;
	}			}

	bool isInfOrNaN() const { return encoding.exponent == maxExponent; }			bool isInfOrNaN() const { return getUnbiasedExponent() == maxExponent; }

	static FPBits<T> zero() { return FPBits(); }			static FPBits<T> zero() { return FPBits(); }

	static FPBits<T> negZero() {			static FPBits<T> negZero() {
	return FPBits(UIntType(1) << (sizeof(UIntType) * 8 - 1));			return FPBits(UIntType(1) << (sizeof(UIntType) * 8 - 1));
	}			}

	static FPBits<T> inf() {			static FPBits<T> inf() {
	FPBits<T> bits;			FPBits<T> bits;
	bits.encoding.exponent = maxExponent;			bits.setUnbiasedExponent(maxExponent);
	return bits;			return bits;
	}			}

	static FPBits<T> negInf() {			static FPBits<T> negInf() {
	FPBits<T> bits = inf();			FPBits<T> bits = inf();
	bits.encoding.sign = 1;			bits.setSign(1);
	return bits;			return bits;
	}			}

	static T buildNaN(UIntType v) {			static T buildNaN(UIntType v) {
	FPBits<T> bits = inf();			FPBits<T> bits = inf();
	bits.encoding.mantissa = v;			bits.setMantissa(v);
	return T(bits);			return T(bits);
	}			}
	};			};

	} // namespace fputil			} // namespace fputil
	} // namespace __llvm_libc			} // namespace __llvm_libc

	#ifdef SPECIAL_X86_LONG_DOUBLE			#ifdef SPECIAL_X86_LONG_DOUBLE
	#include "utils/FPUtil/LongDoubleBitsX86.h"			#include "utils/FPUtil/LongDoubleBitsX86.h"
	#endif			#endif

	#endif // LLVM_LIBC_UTILS_FPUTIL_FP_BITS_H			#endif // LLVM_LIBC_UTILS_FPUTIL_FP_BITS_H

libc/utils/FPUtil/Hypot.h

Show First 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	if (y_bits.isNaN()) {
return y;		return y;
}		}

uint16_t a_exp, b_exp, out_exp;		uint16_t a_exp, b_exp, out_exp;
UIntType a_mant, b_mant;		UIntType a_mant, b_mant;
DUIntType a_mant_sq, b_mant_sq;		DUIntType a_mant_sq, b_mant_sq;
bool sticky_bits;		bool sticky_bits;

if ((x_bits.encoding.exponent >=		if ((x_bits.getUnbiasedExponent() >=
y_bits.encoding.exponent + MantissaWidth<T>::value + 2) \|\|		y_bits.getUnbiasedExponent() + MantissaWidth<T>::value + 2) \|\|
(y == 0)) {		(y == 0)) {
return abs(x);		return abs(x);
} else if ((y_bits.encoding.exponent >=		} else if ((y_bits.getUnbiasedExponent() >=
x_bits.encoding.exponent + MantissaWidth<T>::value + 2) \|\|		x_bits.getUnbiasedExponent() + MantissaWidth<T>::value + 2) \|\|
(x == 0)) {		(x == 0)) {
y_bits.encoding.sign = 0;		y_bits.setSign(0);
return abs(y);		return abs(y);
}		}

if (x >= y) {		if (x >= y) {
a_exp = x_bits.encoding.exponent;		a_exp = x_bits.getUnbiasedExponent();
a_mant = x_bits.encoding.mantissa;		a_mant = x_bits.getMantissa();
b_exp = y_bits.encoding.exponent;		b_exp = y_bits.getUnbiasedExponent();
b_mant = y_bits.encoding.mantissa;		b_mant = y_bits.getMantissa();
} else {		} else {
a_exp = y_bits.encoding.exponent;		a_exp = y_bits.getUnbiasedExponent();
a_mant = y_bits.encoding.mantissa;		a_mant = y_bits.getMantissa();
b_exp = x_bits.encoding.exponent;		b_exp = x_bits.getUnbiasedExponent();
b_mant = x_bits.encoding.mantissa;		b_mant = x_bits.getMantissa();
}		}

out_exp = a_exp;		out_exp = a_exp;

// Add an extra bit to simplify the final rounding bit computation.		// Add an extra bit to simplify the final rounding bit computation.
constexpr UIntType one = UIntType(1) << (MantissaWidth<T>::value + 1);		constexpr UIntType one = UIntType(1) << (MantissaWidth<T>::value + 1);

a_mant <<= 1;		a_mant <<= 1;
▲ Show 20 Lines • Show All 98 Lines • Show Last 20 Lines

libc/utils/FPUtil/LongDoubleBitsX86.h

	Show All 10 Lines

	#include "FPBits.h"			#include "FPBits.h"

	#include <stdint.h>			#include <stdint.h>

	namespace __llvm_libc {			namespace __llvm_libc {
	namespace fputil {			namespace fputil {

	template <> struct MantissaWidth<long double> {
	static constexpr unsigned value = 63;
	};

	template <unsigned Width> struct Padding;			template <unsigned Width> struct Padding;

	// i386 padding.			// i386 padding.
	template <> struct Padding<4> { static constexpr unsigned value = 16; };			template <> struct Padding<4> { static constexpr unsigned value = 16; };

	// x86_64 padding.			// x86_64 padding.
	template <> struct Padding<8> { static constexpr unsigned value = 48; };			template <> struct Padding<8> { static constexpr unsigned value = 48; };

	template <> union FPBits<long double> {			template <> union FPBits<long double> {
	using UIntType = __uint128_t;			using UIntType = __uint128_t;

	static constexpr int exponentBias = 0x3FFF;			static constexpr int exponentBias = 0x3FFF;
	static constexpr int maxExponent = 0x7FFF;			static constexpr int maxExponent = 0x7FFF;
	static constexpr UIntType minSubnormal = UIntType(1);			static constexpr UIntType minSubnormal = UIntType(1);
	// Subnormal numbers include the implicit bit in x86 long double formats.			// Subnormal numbers include the implicit bit in x86 long double formats.
	static constexpr UIntType maxSubnormal =			static constexpr UIntType maxSubnormal =
	(UIntType(1) << (MantissaWidth<long double>::value)) - 1;			(UIntType(1) << (MantissaWidth<long double>::value)) - 1;
	static constexpr UIntType minNormal =			static constexpr UIntType minNormal =
	(UIntType(3) << MantissaWidth<long double>::value);			(UIntType(3) << MantissaWidth<long double>::value);
	static constexpr UIntType maxNormal =			static constexpr UIntType maxNormal =
	((UIntType(maxExponent) - 1) << (MantissaWidth<long double>::value + 1)) \|			((UIntType(maxExponent) - 1) << (MantissaWidth<long double>::value + 1)) \|
	(UIntType(1) << MantissaWidth<long double>::value) \| maxSubnormal;			(UIntType(1) << MantissaWidth<long double>::value) \| maxSubnormal;

	struct __attribute__((packed)) {			using FloatProp = FloatProperties<long double>;
	UIntType mantissa : MantissaWidth<long double>::value;
	uint8_t implicitBit : 1;			UIntType bits;
	uint16_t exponent : ExponentWidth<long double>::value;
	uint8_t sign : 1;			void setMantissa(UIntType mantVal) {
	uint64_t padding : Padding<sizeof(uintptr_t)>::value;			mantVal &= (FloatProp::mantissaMask);
	} encoding;			bits &= ~(FloatProp::mantissaMask);
	UIntType integer;			bits \|= mantVal;
				}

				UIntType getMantissa() const { return bits & FloatProp::mantissaMask; }

				void setUnbiasedExponent(UIntType expVal) {
				expVal = (expVal << (FloatProp::bitWidth - 1 - FloatProp::exponentWidth)) &
				FloatProp::exponentMask;
				bits &= ~(FloatProp::exponentMask);
				bits \|= expVal;
				}

				uint16_t getUnbiasedExponent() const {
				return uint16_t((bits & FloatProp::exponentMask) >>
				(FloatProp::bitWidth - 1 - FloatProp::exponentWidth));
				}

				void setImplicitBit(bool implicitVal) {
				bits &= ~(UIntType(1) << FloatProp::mantissaWidth);
				bits \|= (UIntType(implicitVal) << FloatProp::mantissaWidth);
				}

				bool getImplicitBit() const {
				return ((bits & (UIntType(1) << FloatProp::mantissaWidth)) >>
				FloatProp::mantissaWidth);
				}

				void setSign(bool signVal) {
				bits &= ~(FloatProp::signMask);
				UIntType sign1 = UIntType(signVal) << (FloatProp::bitWidth - 1);
				bits \|= sign1;
				}

				bool getSign() const {
				return ((bits & FloatProp::signMask) >> (FloatProp::bitWidth - 1));
				}

	long double val;			long double val;

	FPBits() : integer(0) {}			FPBits() : bits(0) {}

	template <typename XType,			template <typename XType,
	cpp::EnableIfType<cpp::IsSame<long double, XType>::Value, int> = 0>			cpp::EnableIfType<cpp::IsSame<long double, XType>::Value, int> = 0>
	explicit FPBits<long double>(XType x) : val(x) {}			explicit FPBits(XType x) : val(x) {}

	template <typename XType,			template <typename XType,
	cpp::EnableIfType<cpp::IsSame<XType, UIntType>::Value, int> = 0>			cpp::EnableIfType<cpp::IsSame<XType, UIntType>::Value, int> = 0>
	explicit FPBits(XType x) : integer(x) {}			explicit FPBits(XType x) : bits(x) {}

	operator long double() { return val; }			operator long double() { return val; }

	UIntType uintval() {			UIntType uintval() {
	// We zero the padding bits as they can contain garbage.			// We zero the padding bits as they can contain garbage.
	static constexpr UIntType mask =			static constexpr UIntType mask =
	(UIntType(1) << (sizeof(long double) * 8 -			(UIntType(1) << (sizeof(long double) * 8 -
	Padding<sizeof(uintptr_t)>::value)) -			Padding<sizeof(uintptr_t)>::value)) -
	1;			1;
	return integer & mask;			return bits & mask;
	}			}

	int getExponent() const {			int getExponent() const {
	if (encoding.exponent == 0)			if (getUnbiasedExponent() == 0)
	return int(1) - exponentBias;			return int(1) - exponentBias;
	return int(encoding.exponent) - exponentBias;			return int(getUnbiasedExponent()) - exponentBias;
	}			}

	bool isZero() const {			bool isZero() const {
	return encoding.exponent == 0 && encoding.mantissa == 0 &&			return getUnbiasedExponent() == 0 && getMantissa() == 0 &&
	encoding.implicitBit == 0;			getImplicitBit() == 0;
	}			}

	bool isInf() const {			bool isInf() const {
	return encoding.exponent == maxExponent && encoding.mantissa == 0 &&			return getUnbiasedExponent() == maxExponent && getMantissa() == 0 &&
	encoding.implicitBit == 1;			getImplicitBit() == 1;
	}			}

	bool isNaN() const {			bool isNaN() const {
	if (encoding.exponent == maxExponent) {			if (getUnbiasedExponent() == maxExponent) {
	return (encoding.implicitBit == 0) \|\| encoding.mantissa != 0;			return (getImplicitBit() == 0) \|\| getMantissa() != 0;
	} else if (encoding.exponent != 0) {			} else if (getUnbiasedExponent() != 0) {
	return encoding.implicitBit == 0;			return getImplicitBit() == 0;
	}			}
	return false;			return false;
	}			}

	bool isInfOrNaN() const {			bool isInfOrNaN() const {
	return (encoding.exponent == maxExponent) \|\|			return (getUnbiasedExponent() == maxExponent) \|\|
	(encoding.exponent != 0 && encoding.implicitBit == 0);			(getUnbiasedExponent() != 0 && getImplicitBit() == 0);
	}			}

	// Methods below this are used by tests.			// Methods below this are used by tests.

	static FPBits<long double> zero() { return FPBits<long double>(0.0l); }			static FPBits<long double> zero() { return FPBits<long double>(0.0l); }

	static FPBits<long double> negZero() {			static FPBits<long double> negZero() {
	FPBits<long double> bits(0.0l);			FPBits<long double> bits(0.0l);
	bits.encoding.sign = 1;			bits.setSign(1);
	return bits;			return bits;
	}			}

	static FPBits<long double> inf() {			static FPBits<long double> inf() {
	FPBits<long double> bits(0.0l);			FPBits<long double> bits(0.0l);
	bits.encoding.exponent = maxExponent;			bits.setUnbiasedExponent(maxExponent);
	bits.encoding.implicitBit = 1;			bits.setImplicitBit(1);
	return bits;			return bits;
	}			}

	static FPBits<long double> negInf() {			static FPBits<long double> negInf() {
	FPBits<long double> bits(0.0l);			FPBits<long double> bits(0.0l);
	bits.encoding.exponent = maxExponent;			bits.setUnbiasedExponent(maxExponent);
	bits.encoding.implicitBit = 1;			bits.setImplicitBit(1);
	bits.encoding.sign = 1;			bits.setSign(1);
	return bits;			return bits;
	}			}

	static long double buildNaN(UIntType v) {			static long double buildNaN(UIntType v) {
	FPBits<long double> bits(0.0l);			FPBits<long double> bits(0.0l);
	bits.encoding.exponent = maxExponent;			bits.setUnbiasedExponent(maxExponent);
	bits.encoding.implicitBit = 1;			bits.setImplicitBit(1);
	bits.encoding.mantissa = v;			bits.setMantissa(v);
	return bits;			return bits;
	}			}
	};			};

	static_assert(			static_assert(
	sizeof(FPBits<long double>) == sizeof(long double),			sizeof(FPBits<long double>) == sizeof(long double),
	"Internal long double representation does not match the machine format.");			"Internal long double representation does not match the machine format.");

	} // namespace fputil			} // namespace fputil
	} // namespace __llvm_libc			} // namespace __llvm_libc

	#endif // LLVM_LIBC_UTILS_FPUTIL_LONG_DOUBLE_BITS_X86_H			#endif // LLVM_LIBC_UTILS_FPUTIL_LONG_DOUBLE_BITS_X86_H

libc/utils/FPUtil/ManipulationFunctions.h

Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	template <typename T,
cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>		cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
static inline T modf(T x, T &iptr) {		static inline T modf(T x, T &iptr) {
FPBits<T> bits(x);		FPBits<T> bits(x);
if (bits.isZero() \|\| bits.isNaN()) {		if (bits.isZero() \|\| bits.isNaN()) {
iptr = x;		iptr = x;
return x;		return x;
} else if (bits.isInf()) {		} else if (bits.isInf()) {
iptr = x;		iptr = x;
return bits.encoding.sign ? T(FPBits<T>::negZero()) : T(FPBits<T>::zero());		return bits.getSign() ? T(FPBits<T>::negZero()) : T(FPBits<T>::zero());
} else {		} else {
iptr = trunc(x);		iptr = trunc(x);
if (x == iptr) {		if (x == iptr) {
// If x is already an integer value, then return zero with the right		// If x is already an integer value, then return zero with the right
// sign.		// sign.
return bits.encoding.sign ? T(FPBits<T>::negZero())		return bits.getSign() ? T(FPBits<T>::negZero()) : T(FPBits<T>::zero());
: T(FPBits<T>::zero());
} else {		} else {
return x - iptr;		return x - iptr;
}		}
}		}
}		}

template <typename T,		template <typename T,
cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>		cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
static inline T copysign(T x, T y) {		static inline T copysign(T x, T y) {
FPBits<T> xbits(x);		FPBits<T> xbits(x);
xbits.encoding.sign = FPBits<T>(y).encoding.sign;		xbits.setSign(FPBits<T>(y).getSign());
return T(xbits);		return T(xbits);
}		}

template <typename T,		template <typename T,
cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>		cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
static inline int ilogb(T x) {		static inline int ilogb(T x) {
// TODO: Raise appropriate floating point exceptions and set errno to the		// TODO: Raise appropriate floating point exceptions and set errno to the
// an appropriate error value wherever relevant.		// an appropriate error value wherever relevant.
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	static inline T ldexp(T x, int exp) {
// NormalFloat uses int32_t to store the true exponent value. We should ensure		// NormalFloat uses int32_t to store the true exponent value. We should ensure
// that adding \|exp\| to it does not lead to integer rollover. But, if \|exp\|		// that adding \|exp\| to it does not lead to integer rollover. But, if \|exp\|
// value is larger the exponent range for type T, then we can return infinity		// value is larger the exponent range for type T, then we can return infinity
// early. Because the result of the ldexp operation can be a subnormal number,		// early. Because the result of the ldexp operation can be a subnormal number,
// we need to accommodate the (mantissaWidht + 1) worth of shift in		// we need to accommodate the (mantissaWidht + 1) worth of shift in
// calculating the limit.		// calculating the limit.
int expLimit = FPBits<T>::maxExponent + MantissaWidth<T>::value + 1;		int expLimit = FPBits<T>::maxExponent + MantissaWidth<T>::value + 1;
if (exp > expLimit)		if (exp > expLimit)
return bits.encoding.sign ? T(FPBits<T>::negInf()) : T(FPBits<T>::inf());		return bits.getSign() ? T(FPBits<T>::negInf()) : T(FPBits<T>::inf());

// Similarly on the negative side we return zero early if \|exp\| is too small.		// Similarly on the negative side we return zero early if \|exp\| is too small.
if (exp < -expLimit)		if (exp < -expLimit)
return bits.encoding.sign ? T(FPBits<T>::negZero()) : T(FPBits<T>::zero());		return bits.getSign() ? T(FPBits<T>::negZero()) : T(FPBits<T>::zero());

// For all other values, NormalFloat to T conversion handles it the right way.		// For all other values, NormalFloat to T conversion handles it the right way.
NormalFloat<T> normal(bits);		NormalFloat<T> normal(bits);
normal.exponent += exp;		normal.exponent += exp;
return normal;		return normal;
}		}

template <typename T,		template <typename T,
Show All 38 Lines

libc/utils/FPUtil/NearestIntegerOperations.h

Show All 37 Lines	static inline T trunc(T x) {

// If the exponent is greater than the most negative mantissa		// If the exponent is greater than the most negative mantissa
// exponent, then x is already an integer.		// exponent, then x is already an integer.
if (exponent >= static_cast<int>(MantissaWidth<T>::value))		if (exponent >= static_cast<int>(MantissaWidth<T>::value))
return x;		return x;

// If the exponent is such that abs(x) is less than 1, then return 0.		// If the exponent is such that abs(x) is less than 1, then return 0.
if (exponent <= -1) {		if (exponent <= -1) {
if (bits.encoding.sign)		if (bits.getSign())
return T(-0.0);		return T(-0.0);
else		else
return T(0.0);		return T(0.0);
}		}

int trimSize = MantissaWidth<T>::value - exponent;		int trimSize = MantissaWidth<T>::value - exponent;
bits.encoding.mantissa = (bits.encoding.mantissa >> trimSize) << trimSize;		bits.setMantissa((bits.getMantissa() >> trimSize) << trimSize);
return T(bits);		return T(bits);
}		}

template <typename T,		template <typename T,
cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>		cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
static inline T ceil(T x) {		static inline T ceil(T x) {
FPBits<T> bits(x);		FPBits<T> bits(x);

// If x is infinity NaN or zero, return it.		// If x is infinity NaN or zero, return it.
if (bits.isInfOrNaN() \|\| bits.isZero())		if (bits.isInfOrNaN() \|\| bits.isZero())
return x;		return x;

bool isNeg = bits.encoding.sign;		bool isNeg = bits.getSign();
int exponent = bits.getExponent();		int exponent = bits.getExponent();

// If the exponent is greater than the most negative mantissa		// If the exponent is greater than the most negative mantissa
// exponent, then x is already an integer.		// exponent, then x is already an integer.
if (exponent >= static_cast<int>(MantissaWidth<T>::value))		if (exponent >= static_cast<int>(MantissaWidth<T>::value))
return x;		return x;

if (exponent <= -1) {		if (exponent <= -1) {
if (isNeg)		if (isNeg)
return T(-0.0);		return T(-0.0);
else		else
return T(1.0);		return T(1.0);
}		}

uint32_t trimSize = MantissaWidth<T>::value - exponent;		uint32_t trimSize = MantissaWidth<T>::value - exponent;
bits.encoding.mantissa = (bits.encoding.mantissa >> trimSize) << trimSize;		bits.setMantissa((bits.getMantissa() >> trimSize) << trimSize);
T truncValue = T(bits);		T truncValue = T(bits);

// If x is already an integer, return it.		// If x is already an integer, return it.
if (truncValue == x)		if (truncValue == x)
return x;		return x;

// If x is negative, the ceil operation is equivalent to the trunc operation.		// If x is negative, the ceil operation is equivalent to the trunc operation.
if (isNeg)		if (isNeg)
return truncValue;		return truncValue;

return truncValue + T(1.0);		return truncValue + T(1.0);
}		}

template <typename T,		template <typename T,
cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>		cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
static inline T floor(T x) {		static inline T floor(T x) {
FPBits<T> bits(x);		FPBits<T> bits(x);
if (bits.encoding.sign) {		if (bits.getSign()) {
return -ceil(-x);		return -ceil(-x);
} else {		} else {
return trunc(x);		return trunc(x);
}		}
}		}

template <typename T,		template <typename T,
cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>		cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
static inline T round(T x) {		static inline T round(T x) {
using UIntType = typename FPBits<T>::UIntType;		using UIntType = typename FPBits<T>::UIntType;
FPBits<T> bits(x);		FPBits<T> bits(x);

// If x is infinity NaN or zero, return it.		// If x is infinity NaN or zero, return it.
if (bits.isInfOrNaN() \|\| bits.isZero())		if (bits.isInfOrNaN() \|\| bits.isZero())
return x;		return x;

bool isNeg = bits.encoding.sign;		bool isNeg = bits.getSign();
int exponent = bits.getExponent();		int exponent = bits.getExponent();

// If the exponent is greater than the most negative mantissa		// If the exponent is greater than the most negative mantissa
// exponent, then x is already an integer.		// exponent, then x is already an integer.
if (exponent >= static_cast<int>(MantissaWidth<T>::value))		if (exponent >= static_cast<int>(MantissaWidth<T>::value))
return x;		return x;

if (exponent == -1) {		if (exponent == -1) {
// Absolute value of x is greater than equal to 0.5 but less than 1.		// Absolute value of x is greater than equal to 0.5 but less than 1.
if (isNeg)		if (isNeg)
return T(-1.0);		return T(-1.0);
else		else
return T(1.0);		return T(1.0);
}		}

if (exponent <= -2) {		if (exponent <= -2) {
// Absolute value of x is less than 0.5.		// Absolute value of x is less than 0.5.
if (isNeg)		if (isNeg)
return T(-0.0);		return T(-0.0);
else		else
return T(0.0);		return T(0.0);
}		}

uint32_t trimSize = MantissaWidth<T>::value - exponent;		uint32_t trimSize = MantissaWidth<T>::value - exponent;
bool halfBitSet = bits.encoding.mantissa & (UIntType(1) << (trimSize - 1));		bool halfBitSet = bits.getMantissa() & (UIntType(1) << (trimSize - 1));
bits.encoding.mantissa = (bits.encoding.mantissa >> trimSize) << trimSize;		bits.setMantissa((bits.getMantissa() >> trimSize) << trimSize);
T truncValue = T(bits);		T truncValue = T(bits);

// If x is already an integer, return it.		// If x is already an integer, return it.
if (truncValue == x)		if (truncValue == x)
return x;		return x;

if (!halfBitSet) {		if (!halfBitSet) {
// Franctional part is less than 0.5 so round value is the		// Franctional part is less than 0.5 so round value is the
Show All 9 Lines
static inline T roundUsingCurrentRoundingMode(T x) {		static inline T roundUsingCurrentRoundingMode(T x) {
using UIntType = typename FPBits<T>::UIntType;		using UIntType = typename FPBits<T>::UIntType;
FPBits<T> bits(x);		FPBits<T> bits(x);

// If x is infinity NaN or zero, return it.		// If x is infinity NaN or zero, return it.
if (bits.isInfOrNaN() \|\| bits.isZero())		if (bits.isInfOrNaN() \|\| bits.isZero())
return x;		return x;

bool isNeg = bits.encoding.sign;		bool isNeg = bits.getSign();
int exponent = bits.getExponent();		int exponent = bits.getExponent();
int roundingMode = getRound();		int roundingMode = getRound();

// If the exponent is greater than the most negative mantissa		// If the exponent is greater than the most negative mantissa
// exponent, then x is already an integer.		// exponent, then x is already an integer.
if (exponent >= static_cast<int>(MantissaWidth<T>::value))		if (exponent >= static_cast<int>(MantissaWidth<T>::value))
return x;		return x;

if (exponent <= -1) {		if (exponent <= -1) {
switch (roundingMode) {		switch (roundingMode) {
case FE_DOWNWARD:		case FE_DOWNWARD:
return isNeg ? T(-1.0) : T(0.0);		return isNeg ? T(-1.0) : T(0.0);
case FE_UPWARD:		case FE_UPWARD:
return isNeg ? T(-0.0) : T(1.0);		return isNeg ? T(-0.0) : T(1.0);
case FE_TOWARDZERO:		case FE_TOWARDZERO:
return isNeg ? T(-0.0) : T(0.0);		return isNeg ? T(-0.0) : T(0.0);
case FE_TONEAREST:		case FE_TONEAREST:
if (exponent <= -2 \|\| bits.encoding.mantissa == 0)		if (exponent <= -2 \|\| bits.getMantissa() == 0)
return isNeg ? T(-0.0) : T(0.0); // abs(x) <= 0.5		return isNeg ? T(-0.0) : T(0.0); // abs(x) <= 0.5
else		else
return isNeg ? T(-1.0) : T(1.0); // abs(x) > 0.5		return isNeg ? T(-1.0) : T(1.0); // abs(x) > 0.5
default:		default:
__builtin_unreachable();		__builtin_unreachable();
}		}
}		}

uint32_t trimSize = MantissaWidth<T>::value - exponent;		uint32_t trimSize = MantissaWidth<T>::value - exponent;
FPBits<T> newBits = bits;		FPBits<T> newBits = bits;
newBits.encoding.mantissa = (bits.encoding.mantissa >> trimSize) << trimSize;		newBits.setMantissa((bits.getMantissa() >> trimSize) << trimSize);
T truncValue = T(newBits);		T truncValue = T(newBits);

// If x is already an integer, return it.		// If x is already an integer, return it.
if (truncValue == x)		if (truncValue == x)
return x;		return x;

UIntType trimValue = bits.encoding.mantissa & ((UIntType(1) << trimSize) - 1);		UIntType trimValue = bits.getMantissa() & ((UIntType(1) << trimSize) - 1);
UIntType halfValue = (UIntType(1) << (trimSize - 1));		UIntType halfValue = (UIntType(1) << (trimSize - 1));
// If exponent is 0, trimSize will be equal to the mantissa width, and		// If exponent is 0, trimSize will be equal to the mantissa width, and
// truncIsOdd` will not be correct. So, we handle it as a special case		// truncIsOdd` will not be correct. So, we handle it as a special case
// below.		// below.
UIntType truncIsOdd = newBits.encoding.mantissa & (UIntType(1) << trimSize);		UIntType truncIsOdd = newBits.getMantissa() & (UIntType(1) << trimSize);

switch (roundingMode) {		switch (roundingMode) {
case FE_DOWNWARD:		case FE_DOWNWARD:
return isNeg ? truncValue - T(1.0) : truncValue;		return isNeg ? truncValue - T(1.0) : truncValue;
case FE_UPWARD:		case FE_UPWARD:
return isNeg ? truncValue : truncValue + T(1.0);		return isNeg ? truncValue : truncValue + T(1.0);
case FE_TOWARDZERO:		case FE_TOWARDZERO:
return truncValue;		return truncValue;
Show All 31 Lines
#endif		#endif
#if math_errhandling & MATH_ERREXCEPT		#if math_errhandling & MATH_ERREXCEPT
raiseExcept(FE_INVALID);		raiseExcept(FE_INVALID);
#endif		#endif
};		};

if (bits.isInfOrNaN()) {		if (bits.isInfOrNaN()) {
setDomainErrorAndRaiseInvalid();		setDomainErrorAndRaiseInvalid();
return bits.encoding.sign ? IntegerMin : IntegerMax;		return bits.getSign() ? IntegerMin : IntegerMax;
}		}

int exponent = bits.getExponent();		int exponent = bits.getExponent();
constexpr int exponentLimit = sizeof(I) * 8 - 1;		constexpr int exponentLimit = sizeof(I) * 8 - 1;
if (exponent > exponentLimit) {		if (exponent > exponentLimit) {
setDomainErrorAndRaiseInvalid();		setDomainErrorAndRaiseInvalid();
return bits.encoding.sign ? IntegerMin : IntegerMax;		return bits.getSign() ? IntegerMin : IntegerMax;
} else if (exponent == exponentLimit) {		} else if (exponent == exponentLimit) {
if (bits.encoding.sign == 0 \|\| bits.encoding.mantissa != 0) {		if (bits.getSign() == 0 \|\| bits.getMantissa() != 0) {
setDomainErrorAndRaiseInvalid();		setDomainErrorAndRaiseInvalid();
return bits.encoding.sign ? IntegerMin : IntegerMax;		return bits.getSign() ? IntegerMin : IntegerMax;
}		}
// If the control reaches here, then it means that the rounded		// If the control reaches here, then it means that the rounded
// value is the most negative number for the signed integer type I.		// value is the most negative number for the signed integer type I.
}		}

// For all other cases, if `x` can fit in the integer type `I`,		// For all other cases, if `x` can fit in the integer type `I`,
// we just return `x`. Implicit conversion will convert the		// we just return `x`. Implicit conversion will convert the
// floating point value to the exact integer value.		// floating point value to the exact integer value.
Show All 26 Lines

libc/utils/FPUtil/NextAfterLongDoubleX86.h

Show All 24 Lines	static inline long double nextafter(long double from, long double to) {
FPBits toBits(to);		FPBits toBits(to);
if (toBits.isNaN())		if (toBits.isNaN())
return to;		return to;

if (from == to)		if (from == to)
return to;		return to;

// Convert pseudo subnormal number to normal number.		// Convert pseudo subnormal number to normal number.
if (fromBits.encoding.implicitBit == 1 && fromBits.encoding.exponent == 0) {		if (fromBits.getImplicitBit() == 1 && fromBits.getUnbiasedExponent() == 0) {
fromBits.encoding.exponent = 1;		fromBits.setUnbiasedExponent(1);
}		}

using UIntType = FPBits::UIntType;		using UIntType = FPBits::UIntType;
constexpr UIntType signVal = (UIntType(1) << 79);		constexpr UIntType signVal = (UIntType(1) << 79);
constexpr UIntType mantissaMask =		constexpr UIntType mantissaMask =
(UIntType(1) << MantissaWidth<long double>::value) - 1;		(UIntType(1) << MantissaWidth<long double>::value) - 1;
UIntType intVal = fromBits.uintval();		UIntType intVal = fromBits.uintval();
if (from < 0.0l) {		if (from < 0.0l) {
if (from > to) {		if (from > to) {
if (intVal == (signVal + FPBits::maxSubnormal)) {		if (intVal == (signVal + FPBits::maxSubnormal)) {
// We deal with normal/subnormal boundary separately to avoid		// We deal with normal/subnormal boundary separately to avoid
// dealing with the implicit bit.		// dealing with the implicit bit.
intVal = signVal + FPBits::minNormal;		intVal = signVal + FPBits::minNormal;
} else if ((intVal & mantissaMask) == mantissaMask) {		} else if ((intVal & mantissaMask) == mantissaMask) {
fromBits.encoding.mantissa = 0;		fromBits.setMantissa(0);
// Incrementing exponent might overflow the value to infinity,		// Incrementing exponent might overflow the value to infinity,
// which is what is expected. Since NaNs are handling separately,		// which is what is expected. Since NaNs are handling separately,
// it will never overflow "beyond" infinity.		// it will never overflow "beyond" infinity.
++fromBits.encoding.exponent;		fromBits.setUnbiasedExponent(fromBits.getUnbiasedExponent() + 1);
		lntueUnsubmitted Not Done Reply Inline Actions I'm a bit curious about generated assembly of this line. @sivachandra : can we check if there is any regression with this one? I think no regression for O2 and O3 would be good enough. lntue: I'm a bit curious about generated assembly of this line. @sivachandra : can we check if there…
		hedingarciaAuthorUnsubmitted Done Reply Inline Actions The object files generated before and after this patch are the same, at least after running the check there was no difference between the files at O2 optimization level. hedingarcia: The object files generated before and after this patch are the same, at least after running the…
return fromBits;		return fromBits;
} else {		} else {
++intVal;		++intVal;
}		}
} else {		} else {
if (intVal == (signVal + FPBits::minNormal)) {		if (intVal == (signVal + FPBits::minNormal)) {
// We deal with normal/subnormal boundary separately to avoid		// We deal with normal/subnormal boundary separately to avoid
// dealing with the implicit bit.		// dealing with the implicit bit.
intVal = signVal + FPBits::maxSubnormal;		intVal = signVal + FPBits::maxSubnormal;
} else if ((intVal & mantissaMask) == 0) {		} else if ((intVal & mantissaMask) == 0) {
fromBits.encoding.mantissa = mantissaMask;		fromBits.setMantissa(mantissaMask);
// from == 0 is handled separately so decrementing the exponent will not		// from == 0 is handled separately so decrementing the exponent will not
// lead to underflow.		// lead to underflow.
--fromBits.encoding.exponent;		fromBits.setUnbiasedExponent(fromBits.getUnbiasedExponent() - 1);
return fromBits;		return fromBits;
} else {		} else {
--intVal;		--intVal;
}		}
}		}
} else if (from == 0.0l) {		} else if (from == 0.0l) {
if (from > to)		if (from > to)
intVal = signVal + 1;		intVal = signVal + 1;
else		else
intVal = 1;		intVal = 1;
} else {		} else {
if (from > to) {		if (from > to) {
if (intVal == FPBits::minNormal) {		if (intVal == FPBits::minNormal) {
intVal = FPBits::maxSubnormal;		intVal = FPBits::maxSubnormal;
} else if ((intVal & mantissaMask) == 0) {		} else if ((intVal & mantissaMask) == 0) {
fromBits.encoding.mantissa = mantissaMask;		fromBits.setMantissa(mantissaMask);
// from == 0 is handled separately so decrementing the exponent will not		// from == 0 is handled separately so decrementing the exponent will not
// lead to underflow.		// lead to underflow.
--fromBits.encoding.exponent;		fromBits.setUnbiasedExponent(fromBits.getUnbiasedExponent() - 1);
return fromBits;		return fromBits;
} else {		} else {
--intVal;		--intVal;
}		}
} else {		} else {
if (intVal == FPBits::maxSubnormal) {		if (intVal == FPBits::maxSubnormal) {
intVal = FPBits::minNormal;		intVal = FPBits::minNormal;
} else if ((intVal & mantissaMask) == mantissaMask) {		} else if ((intVal & mantissaMask) == mantissaMask) {
fromBits.encoding.mantissa = 0;		fromBits.setMantissa(0);
// Incrementing exponent might overflow the value to infinity,		// Incrementing exponent might overflow the value to infinity,
// which is what is expected. Since NaNs are handling separately,		// which is what is expected. Since NaNs are handling separately,
// it will never overflow "beyond" infinity.		// it will never overflow "beyond" infinity.
++fromBits.encoding.exponent;		fromBits.setUnbiasedExponent(fromBits.getUnbiasedExponent() + 1);
return fromBits;		return fromBits;
} else {		} else {
++intVal;		++intVal;
}		}
}		}
}		}

return reinterpret_cast<long double >(&intVal);		return reinterpret_cast<long double >(&intVal);
// TODO: Raise floating point exceptions as required by the standard.		// TODO: Raise floating point exceptions as required by the standard.
}		}

} // namespace fputil		} // namespace fputil
} // namespace __llvm_libc		} // namespace __llvm_libc

#endif // LLVM_LIBC_UTILS_FPUTIL_NEXT_AFTER_LONG_DOUBLE_X86_H		#endif // LLVM_LIBC_UTILS_FPUTIL_NEXT_AFTER_LONG_DOUBLE_X86_H

libc/utils/FPUtil/NormalFloat.h

Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	operator T() const {
int biasedExponent = exponent + FPBits<T>::exponentBias;		int biasedExponent = exponent + FPBits<T>::exponentBias;
// Max exponent is of the form 0xFF...E. That is why -2 and not -1.		// Max exponent is of the form 0xFF...E. That is why -2 and not -1.
constexpr int maxExponentValue = (1 << ExponentWidth<T>::value) - 2;		constexpr int maxExponentValue = (1 << ExponentWidth<T>::value) - 2;
if (biasedExponent > maxExponentValue) {		if (biasedExponent > maxExponentValue) {
return sign ? T(FPBits<T>::negInf()) : T(FPBits<T>::inf());		return sign ? T(FPBits<T>::negInf()) : T(FPBits<T>::inf());
}		}

FPBits<T> result(T(0.0));		FPBits<T> result(T(0.0));
result.encoding.sign = sign;		result.setSign(sign);

constexpr int subnormalExponent = -FPBits<T>::exponentBias + 1;		constexpr int subnormalExponent = -FPBits<T>::exponentBias + 1;
if (exponent < subnormalExponent) {		if (exponent < subnormalExponent) {
unsigned shift = subnormalExponent - exponent;		unsigned shift = subnormalExponent - exponent;
// Since exponent > subnormalExponent, shift is strictly greater than		// Since exponent > subnormalExponent, shift is strictly greater than
// zero.		// zero.
if (shift <= MantissaWidth<T>::value + 1) {		if (shift <= MantissaWidth<T>::value + 1) {
// Generate a subnormal number. Might lead to loss of precision.		// Generate a subnormal number. Might lead to loss of precision.
// We round to nearest and round halfway cases to even.		// We round to nearest and round halfway cases to even.
const UIntType shiftOutMask = (UIntType(1) << shift) - 1;		const UIntType shiftOutMask = (UIntType(1) << shift) - 1;
const UIntType shiftOutValue = mantissa & shiftOutMask;		const UIntType shiftOutValue = mantissa & shiftOutMask;
const UIntType halfwayValue = UIntType(1) << (shift - 1);		const UIntType halfwayValue = UIntType(1) << (shift - 1);
result.encoding.exponent = 0;		result.setUnbiasedExponent(0);
result.encoding.mantissa = mantissa >> shift;		result.setMantissa(mantissa >> shift);
UIntType newMantissa = result.encoding.mantissa;		UIntType newMantissa = result.getMantissa();
if (shiftOutValue > halfwayValue) {		if (shiftOutValue > halfwayValue) {
newMantissa += 1;		newMantissa += 1;
} else if (shiftOutValue == halfwayValue) {		} else if (shiftOutValue == halfwayValue) {
// Round to even.		// Round to even.
if (result.encoding.mantissa & 0x1)		if (result.getMantissa() & 0x1)
newMantissa += 1;		newMantissa += 1;
}		}
result.encoding.mantissa = newMantissa;		result.setMantissa(newMantissa);
// Adding 1 to mantissa can lead to overflow. This can only happen if		// Adding 1 to mantissa can lead to overflow. This can only happen if
// mantissa was all ones (0b111..11). For such a case, we will carry		// mantissa was all ones (0b111..11). For such a case, we will carry
// the overflow into the exponent.		// the overflow into the exponent.
if (newMantissa == one)		if (newMantissa == one)
result.encoding.exponent = 1;		result.setUnbiasedExponent(1);
return T(result);		return T(result);
} else {		} else {
return T(result);		return T(result);
}		}
}		}

result.encoding.exponent = exponent + FPBits<T>::exponentBias;		result.setUnbiasedExponent(exponent + FPBits<T>::exponentBias);
result.encoding.mantissa = mantissa;		result.setMantissa(mantissa);
return T(result);		return T(result);
}		}

private:		private:
void initFromBits(FPBits<T> bits) {		void initFromBits(FPBits<T> bits) {
sign = bits.encoding.sign;		sign = bits.getSign();

if (bits.isInfOrNaN() \|\| bits.isZero()) {		if (bits.isInfOrNaN() \|\| bits.isZero()) {
// Ignore special bit patterns. Implementations deal with them separately		// Ignore special bit patterns. Implementations deal with them separately
// anyway so this should not be a problem.		// anyway so this should not be a problem.
exponent = 0;		exponent = 0;
mantissa = 0;		mantissa = 0;
return;		return;
}		}

// Normalize subnormal numbers.		// Normalize subnormal numbers.
if (bits.encoding.exponent == 0) {		if (bits.getUnbiasedExponent() == 0) {
unsigned shift = evaluateNormalizationShift(bits.encoding.mantissa);		unsigned shift = evaluateNormalizationShift(bits.getMantissa());
mantissa = UIntType(bits.encoding.mantissa) << shift;		mantissa = UIntType(bits.getMantissa()) << shift;
exponent = 1 - FPBits<T>::exponentBias - shift;		exponent = 1 - FPBits<T>::exponentBias - shift;
} else {		} else {
exponent = bits.encoding.exponent - FPBits<T>::exponentBias;		exponent = bits.getUnbiasedExponent() - FPBits<T>::exponentBias;
mantissa = one \| bits.encoding.mantissa;		mantissa = one \| bits.getMantissa();
}		}
}		}

unsigned evaluateNormalizationShift(UIntType m) {		unsigned evaluateNormalizationShift(UIntType m) {
unsigned shift = 0;		unsigned shift = 0;
for (; (one & m) == 0 && (shift < MantissaWidth<T>::value);		for (; (one & m) == 0 && (shift < MantissaWidth<T>::value);
m <<= 1, ++shift)		m <<= 1, ++shift)
;		;
return shift;		return shift;
}		}
};		};

#ifdef SPECIAL_X86_LONG_DOUBLE		#ifdef SPECIAL_X86_LONG_DOUBLE
template <>		template <>
inline void NormalFloat<long double>::initFromBits(FPBits<long double> bits) {		inline void NormalFloat<long double>::initFromBits(FPBits<long double> bits) {
sign = bits.encoding.sign;		sign = bits.getSign();

if (bits.isInfOrNaN() \|\| bits.isZero()) {		if (bits.isInfOrNaN() \|\| bits.isZero()) {
// Ignore special bit patterns. Implementations deal with them separately		// Ignore special bit patterns. Implementations deal with them separately
// anyway so this should not be a problem.		// anyway so this should not be a problem.
exponent = 0;		exponent = 0;
mantissa = 0;		mantissa = 0;
return;		return;
}		}

if (bits.encoding.exponent == 0) {		if (bits.getUnbiasedExponent() == 0) {
if (bits.encoding.implicitBit == 0) {		if (bits.getImplicitBit() == 0) {
// Since we ignore zero value, the mantissa in this case is non-zero.		// Since we ignore zero value, the mantissa in this case is non-zero.
int normalizationShift =		int normalizationShift = evaluateNormalizationShift(bits.getMantissa());
evaluateNormalizationShift(bits.encoding.mantissa);
exponent = -16382 - normalizationShift;		exponent = -16382 - normalizationShift;
mantissa = (bits.encoding.mantissa << normalizationShift);		mantissa = (bits.getMantissa() << normalizationShift);
} else {		} else {
exponent = -16382;		exponent = -16382;
mantissa = one \| bits.encoding.mantissa;		mantissa = one \| bits.getMantissa();
}		}
} else {		} else {
if (bits.encoding.implicitBit == 0) {		if (bits.getImplicitBit() == 0) {
// Invalid number so just store 0 similar to a NaN.		// Invalid number so just store 0 similar to a NaN.
exponent = 0;		exponent = 0;
mantissa = 0;		mantissa = 0;
} else {		} else {
exponent = bits.encoding.exponent - 16383;		exponent = bits.getUnbiasedExponent() - 16383;
mantissa = one \| bits.encoding.mantissa;		mantissa = one \| bits.getMantissa();
}		}
}		}
}		}

template <> inline NormalFloat<long double>::operator long double() const {		template <> inline NormalFloat<long double>::operator long double() const {
int biasedExponent = exponent + FPBits<long double>::exponentBias;		int biasedExponent = exponent + FPBits<long double>::exponentBias;
// Max exponent is of the form 0xFF...E. That is why -2 and not -1.		// Max exponent is of the form 0xFF...E. That is why -2 and not -1.
constexpr int maxExponentValue = (1 << ExponentWidth<long double>::value) - 2;		constexpr int maxExponentValue = (1 << ExponentWidth<long double>::value) - 2;
if (biasedExponent > maxExponentValue) {		if (biasedExponent > maxExponentValue) {
return sign ? FPBits<long double>::negInf() : FPBits<long double>::inf();		return sign ? FPBits<long double>::negInf() : FPBits<long double>::inf();
}		}

FPBits<long double> result(0.0l);		FPBits<long double> result(0.0l);
result.encoding.sign = sign;		result.setSign(sign);

constexpr int subnormalExponent = -FPBits<long double>::exponentBias + 1;		constexpr int subnormalExponent = -FPBits<long double>::exponentBias + 1;
if (exponent < subnormalExponent) {		if (exponent < subnormalExponent) {
unsigned shift = subnormalExponent - exponent;		unsigned shift = subnormalExponent - exponent;
if (shift <= MantissaWidth<long double>::value + 1) {		if (shift <= MantissaWidth<long double>::value + 1) {
// Generate a subnormal number. Might lead to loss of precision.		// Generate a subnormal number. Might lead to loss of precision.
// We round to nearest and round halfway cases to even.		// We round to nearest and round halfway cases to even.
const UIntType shiftOutMask = (UIntType(1) << shift) - 1;		const UIntType shiftOutMask = (UIntType(1) << shift) - 1;
const UIntType shiftOutValue = mantissa & shiftOutMask;		const UIntType shiftOutValue = mantissa & shiftOutMask;
const UIntType halfwayValue = UIntType(1) << (shift - 1);		const UIntType halfwayValue = UIntType(1) << (shift - 1);
result.encoding.exponent = 0;		result.setUnbiasedExponent(0);
result.encoding.mantissa = mantissa >> shift;		result.setMantissa(mantissa >> shift);
UIntType newMantissa = result.encoding.mantissa;		UIntType newMantissa = result.getMantissa();
if (shiftOutValue > halfwayValue) {		if (shiftOutValue > halfwayValue) {
newMantissa += 1;		newMantissa += 1;
} else if (shiftOutValue == halfwayValue) {		} else if (shiftOutValue == halfwayValue) {
// Round to even.		// Round to even.
if (result.encoding.mantissa & 0x1)		if (result.getMantissa() & 0x1)
newMantissa += 1;		newMantissa += 1;
}		}
result.encoding.mantissa = newMantissa;		result.setMantissa(newMantissa);
// Adding 1 to mantissa can lead to overflow. This can only happen if		// Adding 1 to mantissa can lead to overflow. This can only happen if
// mantissa was all ones (0b111..11). For such a case, we will carry		// mantissa was all ones (0b111..11). For such a case, we will carry
// the overflow into the exponent and set the implicit bit to 1.		// the overflow into the exponent and set the implicit bit to 1.
if (newMantissa == one) {		if (newMantissa == one) {
result.encoding.exponent = 1;		result.setUnbiasedExponent(1);
result.encoding.implicitBit = 1;		result.setImplicitBit(1);
} else {		} else {
result.encoding.implicitBit = 0;		result.setImplicitBit(0);
}		}
return static_cast<long double>(result);		return static_cast<long double>(result);
} else {		} else {
return static_cast<long double>(result);		return static_cast<long double>(result);
}		}
}		}

result.encoding.exponent = biasedExponent;		result.setUnbiasedExponent(biasedExponent);
result.encoding.mantissa = mantissa;		result.setMantissa(mantissa);
result.encoding.implicitBit = 1;		result.setImplicitBit(1);
return static_cast<long double>(result);		return static_cast<long double>(result);
}		}
#endif // SPECIAL_X86_LONG_DOUBLE		#endif // SPECIAL_X86_LONG_DOUBLE

} // namespace fputil		} // namespace fputil
} // namespace __llvm_libc		} // namespace __llvm_libc

#endif // LLVM_LIBC_UTILS_FPUTIL_NORMAL_FLOAT_H		#endif // LLVM_LIBC_UTILS_FPUTIL_NORMAL_FLOAT_H

libc/utils/FPUtil/Sqrt.h

Show First 20 Lines • Show All 96 Lines • ▼ Show 20 Lines	template <typename T,
cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>		cpp::EnableIfType<cpp::IsFloatingPointType<T>::Value, int> = 0>
static inline T sqrt(T x) {		static inline T sqrt(T x) {
using UIntType = typename FPBits<T>::UIntType;		using UIntType = typename FPBits<T>::UIntType;
constexpr UIntType One = UIntType(1) << MantissaWidth<T>::value;		constexpr UIntType One = UIntType(1) << MantissaWidth<T>::value;

FPBits<T> bits(x);		FPBits<T> bits(x);

if (bits.isInfOrNaN()) {		if (bits.isInfOrNaN()) {
if (bits.encoding.sign && (bits.encoding.mantissa == 0)) {		if (bits.getSign() && (bits.getMantissa() == 0)) {
// sqrt(-Inf) = NaN		// sqrt(-Inf) = NaN
return FPBits<T>::buildNaN(One >> 1);		return FPBits<T>::buildNaN(One >> 1);
} else {		} else {
// sqrt(NaN) = NaN		// sqrt(NaN) = NaN
// sqrt(+Inf) = +Inf		// sqrt(+Inf) = +Inf
return x;		return x;
}		}
} else if (bits.isZero()) {		} else if (bits.isZero()) {
// sqrt(+0) = +0		// sqrt(+0) = +0
// sqrt(-0) = -0		// sqrt(-0) = -0
return x;		return x;
} else if (bits.encoding.sign) {		} else if (bits.getSign()) {
// sqrt( negative numbers ) = NaN		// sqrt( negative numbers ) = NaN
return FPBits<T>::buildNaN(One >> 1);		return FPBits<T>::buildNaN(One >> 1);
} else {		} else {
int xExp = bits.getExponent();		int xExp = bits.getExponent();
UIntType xMant = bits.encoding.mantissa;		UIntType xMant = bits.getMantissa();

// Step 1a: Normalize denormal input and append hiddent bit to the mantissa		// Step 1a: Normalize denormal input and append hiddent bit to the mantissa
if (bits.encoding.exponent == 0) {		if (bits.getUnbiasedExponent() == 0) {
++xExp; // let xExp be the correct exponent of One bit.		++xExp; // let xExp be the correct exponent of One bit.
internal::normalize<T>(xExp, xMant);		internal::normalize<T>(xExp, xMant);
} else {		} else {
xMant \|= One;		xMant \|= One;
}		}

// Step 1b: Make sure the exponent is even.		// Step 1b: Make sure the exponent is even.
if (xExp & 1) {		if (xExp & 1) {
▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

libc/utils/FPUtil/SqrtLongDoubleX86.h

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines
template <> inline long double sqrt<long double, 0>(long double x) {		template <> inline long double sqrt<long double, 0>(long double x) {
using UIntType = typename FPBits<long double>::UIntType;		using UIntType = typename FPBits<long double>::UIntType;
constexpr UIntType One = UIntType(1)		constexpr UIntType One = UIntType(1)
<< int(MantissaWidth<long double>::value);		<< int(MantissaWidth<long double>::value);

FPBits<long double> bits(x);		FPBits<long double> bits(x);

if (bits.isInfOrNaN()) {		if (bits.isInfOrNaN()) {
if (bits.encoding.sign && (bits.encoding.mantissa == 0)) {		if (bits.getSign() && (bits.getMantissa() == 0)) {
// sqrt(-Inf) = NaN		// sqrt(-Inf) = NaN
return FPBits<long double>::buildNaN(One >> 1);		return FPBits<long double>::buildNaN(One >> 1);
} else {		} else {
// sqrt(NaN) = NaN		// sqrt(NaN) = NaN
// sqrt(+Inf) = +Inf		// sqrt(+Inf) = +Inf
return x;		return x;
}		}
} else if (bits.isZero()) {		} else if (bits.isZero()) {
// sqrt(+0) = +0		// sqrt(+0) = +0
// sqrt(-0) = -0		// sqrt(-0) = -0
return x;		return x;
} else if (bits.encoding.sign) {		} else if (bits.getSign()) {
// sqrt( negative numbers ) = NaN		// sqrt( negative numbers ) = NaN
return FPBits<long double>::buildNaN(One >> 1);		return FPBits<long double>::buildNaN(One >> 1);
} else {		} else {
int xExp = bits.getExponent();		int xExp = bits.getExponent();
UIntType xMant = bits.encoding.mantissa;		UIntType xMant = bits.getMantissa();

// Step 1a: Normalize denormal input		// Step 1a: Normalize denormal input
if (bits.encoding.implicitBit) {		if (bits.getImplicitBit()) {
xMant \|= One;		xMant \|= One;
} else if (bits.encoding.exponent == 0) {		} else if (bits.getUnbiasedExponent() == 0) {
internal::normalize<long double>(xExp, xMant);		internal::normalize<long double>(xExp, xMant);
}		}

// Step 1b: Make sure the exponent is even.		// Step 1b: Make sure the exponent is even.
if (xExp & 1) {		if (xExp & 1) {
--xExp;		--xExp;
xMant <<= 1;		xMant <<= 1;
}		}
Show All 39 Lines	if (bits.isInfOrNaN()) {

// Round to nearest, ties to even		// Round to nearest, ties to even
if (rb && (lsb \|\| (r != 0))) {		if (rb && (lsb \|\| (r != 0))) {
++y;		++y;
}		}

// Extract output		// Extract output
FPBits<long double> out(0.0L);		FPBits<long double> out(0.0L);
out.encoding.exponent = xExp;		out.setUnbiasedExponent(xExp);
out.encoding.implicitBit = 1;		out.setImplicitBit(1);
out.encoding.mantissa = (y & (One - 1));		out.setMantissa((y & (One - 1)));

return out;		return out;
}		}
}		}

} // namespace fputil		} // namespace fputil
} // namespace __llvm_libc		} // namespace __llvm_libc

#endif // LLVM_LIBC_UTILS_FPUTIL_SQRT_LONG_DOUBLE_X86_H		#endif // LLVM_LIBC_UTILS_FPUTIL_SQRT_LONG_DOUBLE_X86_H

libc/utils/FPUtil/TestHelpers.cpp

	Show All 34 Lines
	describeValue(const char *label, ValType value,			describeValue(const char *label, ValType value,
	testutils::StreamWrapper &stream) {			testutils::StreamWrapper &stream) {
	stream << label;			stream << label;

	FPBits<ValType> bits(value);			FPBits<ValType> bits(value);
	if (bits.isNaN()) {			if (bits.isNaN()) {
	stream << "(NaN)";			stream << "(NaN)";
	} else if (bits.isInf()) {			} else if (bits.isInf()) {
	if (bits.encoding.sign)			if (bits.getSign())
	stream << "(-Infinity)";			stream << "(-Infinity)";
	else			else
	stream << "(+Infinity)";			stream << "(+Infinity)";
	} else {			} else {
	constexpr int exponentWidthInHex =			constexpr int exponentWidthInHex =
	(fputil::ExponentWidth<ValType>::value - 1) / 4 + 1;			(fputil::ExponentWidth<ValType>::value - 1) / 4 + 1;
	constexpr int mantissaWidthInHex =			constexpr int mantissaWidthInHex =
	(fputil::MantissaWidth<ValType>::value - 1) / 4 + 1;			(fputil::MantissaWidth<ValType>::value - 1) / 4 + 1;

	stream << "Sign: " << (bits.encoding.sign ? '1' : '0') << ", "			stream << "Sign: " << (bits.getSign() ? '1' : '0') << ", "
	<< "Exponent: 0x"			<< "Exponent: 0x"
	<< uintToHex<uint16_t>(bits.encoding.exponent, exponentWidthInHex)			<< uintToHex<uint16_t>(bits.getUnbiasedExponent(),
				exponentWidthInHex)
	<< ", "			<< ", "
	<< "Mantissa: 0x"			<< "Mantissa: 0x"
	<< uintToHex<typename fputil::FPBits<ValType>::UIntType>(			<< uintToHex<typename fputil::FPBits<ValType>::UIntType>(
	bits.encoding.mantissa, mantissaWidthInHex);			bits.getMantissa(), mantissaWidthInHex);
	}			}

	stream << '\n';			stream << '\n';
	}			}

	template void describeValue<float>(const char *, float,			template void describeValue<float>(const char *, float,
	testutils::StreamWrapper &);			testutils::StreamWrapper &);
	template void describeValue<double>(const char *, double,			template void describeValue<double>(const char *, double,
	testutils::StreamWrapper &);			testutils::StreamWrapper &);
	template void describeValue<long double>(const char *, long double,			template void describeValue<long double>(const char *, long double,
	testutils::StreamWrapper &);			testutils::StreamWrapper &);

	} // namespace testing			} // namespace testing
	} // namespace fputil			} // namespace fputil
	} // namespace __llvm_libc			} // namespace __llvm_libc

libc/utils/FPUtil/generic/FMA.h

Show First 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	if (!(bit_sum.isInfOrNaN() \|\| bit_sum.isZero())) {
// Dekker's 2Sum algorithm to find t such that sum - t = prod + z exactly,		// Dekker's 2Sum algorithm to find t such that sum - t = prod + z exactly,
// assuming the (default) rounding mode is round-to-the-nearest,		// assuming the (default) rounding mode is round-to-the-nearest,
// tie-to-even. Moreover, t satisfies the condition that t < eps(sum),		// tie-to-even. Moreover, t satisfies the condition that t < eps(sum),
// i.e., t.exponent < sum.exponent - 52. So if t is not 0, meaning rounding		// i.e., t.exponent < sum.exponent - 52. So if t is not 0, meaning rounding
// occurs when computing the sum, we just need to use t to adjust (any) last		// occurs when computing the sum, we just need to use t to adjust (any) last
// bit of sum, so that the sticky bits used when rounding sum to float are		// bit of sum, so that the sticky bits used when rounding sum to float are
// correct (when it matters).		// correct (when it matters).
fputil::FPBits<double> t(		fputil::FPBits<double> t(
(bit_prod.encoding.exponent >= bitz.encoding.exponent)		(bit_prod.getUnbiasedExponent() >= bitz.getUnbiasedExponent())
? ((double(bit_sum) - double(bit_prod)) - double(bitz))		? ((double(bit_sum) - double(bit_prod)) - double(bitz))
: ((double(bit_sum) - double(bitz)) - double(bit_prod)));		: ((double(bit_sum) - double(bitz)) - double(bit_prod)));

// Update sticky bits if t != 0.0 and the least (52 - 23 - 1 = 28) bits are		// Update sticky bits if t != 0.0 and the least (52 - 23 - 1 = 28) bits are
// zero.		// zero.
if (!t.isZero() && ((bit_sum.encoding.mantissa & 0xfff'ffffULL) == 0)) {		if (!t.isZero() && ((bit_sum.getMantissa() & 0xfff'ffffULL) == 0)) {
if (bit_sum.encoding.sign != t.encoding.sign) {		if (bit_sum.getSign() != t.getSign()) {
++bit_sum.encoding.mantissa;		bit_sum.setMantissa(bit_sum.getMantissa() + 1);
} else if (bit_sum.encoding.mantissa) {		} else if (bit_sum.getMantissa()) {
--bit_sum.encoding.mantissa;		bit_sum.setMantissa(bit_sum.getMantissa() - 1);
}		}
}		}
}		}

return static_cast<float>(static_cast<double>(bit_sum));		return static_cast<float>(static_cast<double>(bit_sum));
}		}

} // namespace generic		} // namespace generic
} // namespace fputil		} // namespace fputil
} // namespace __llvm_libc		} // namespace __llvm_libc

#endif // LLVM_LIBC_UTILS_FPUTIL_GENERIC_FMA_H		#endif // LLVM_LIBC_UTILS_FPUTIL_GENERIC_FMA_H