This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/lib/AST/
-
lib/
-
AST/
-
MicrosoftMangle.cpp
-
llvm/
-
include/llvm/ADT/
-
llvm/
-
ADT/
1/3
APFloat.h
-
lib/Support/
-
Support/
-
APFloat.cpp
-
unittests/ADT/
-
ADT/
-
APFloatTest.cpp
-
mlir/
-
include/
-
mlir-c/
-
BuiltinTypes.h
-
mlir/IR/
-
IR/
-
Builders.h
-
BuiltinTypes.h
-
BuiltinTypes.td
-
Types.h
-
lib/
-
AsmParser/
-
TokenKinds.def
-
TypeParser.cpp
-
CAPI/IR/
-
IR/
-
BuiltinTypes.cpp
-
IR/
-
AsmPrinter.cpp
-
Builders.cpp
-
BuiltinTypes.cpp
-
MLIRContext.cpp
-
Types.cpp
-
test/
-
IR/
-
attribute.mlir
-
lib/Dialect/Test/
-
Dialect/
-
Test/
1
TestOps.td

Differential D133823

Add APFloat and MLIR type support for fp8 (e5m2).
ClosedPublic

Authored by stellaraccident on Sep 13 2022, 5:43 PM.

Download Raw Diff

Details

Reviewers

rriddle
jholewinski
gchakrabarti
bkramer
nicolasvasilache
rengolin
jpienaar
mehdi_amini

Commits

rGe28b15b572b5: Add APFloat and MLIR type support for fp8 (e5m2).
rG2dc68b539825: Add APFloat and MLIR type support for fp8 (e5m2).

Summary

This is a first step towards high level representation for fp8 types
that have been built in to hardware with near term roadmaps. Like the
BFLOAT16 type, the family of fp8 types are inspired by IEEE-754 binary
floating point formats but, due to the size limits, have been tweaked in
various ways in order to maximally use the range/precision in various
scenarios. The list of variants is small/finite and bounded by real
hardware.

This patch introduces the E5M2 FP8 format as proposed by Nvidia, ARM,
and Intel in the paper: https://arxiv.org/pdf/2209.05433.pdf

As the more conformant of the two implemented datatypes, we are plumbing
it through LLVM's APFloat type and MLIR's type system first as a
template. It will be followed by the range optimized E4M3 FP8 format
described in the paper. Since that format deviates further from the
IEEE-754 norms, it may require more debate and implementation
complexity.

Given that we see two parts of the FP8 implementation space represented
by these cases, we are recommending naming of:

F8M<N> : For FP8 types that can be conceived of as following the same rules as FP16 but with a smaller number of mantissa/exponent bits. Including the number of mantissa bits in the type name is enough to fully specify the type. This naming scheme is used to represent the E5M2 type described in the paper.
F8M<N>F : For FP8 types such as E4M3 which only support finite values.

The first of these (this patch) seems fairly non-controversial. The
second is previewed here to illustrate options for extending to the
other known variant (but can be discussed in detail in the patch
which implements it).

Many conversations about these types focus on the Machine-Learning
ecosystem where they are used to represent mixed-datatype computations
at a high level. At that level (which is why we also expose them in
MLIR), it is important to retain the actual type definition so that when
lowering to actual kernels or target specific code, the correct
promotions, casts and rescalings can be done as needed. We expect that
most LLVM backends will only experience these types as opaque I8
values that are applicable to some instructions.

MLIR does not make it particularly easy to add new floating point types
(i.e. the FloatType hierarchy is not open). Given the need to fully
model FloatTypes and make them interop with tooling, such types will
always be "heavy-weight" and it is not expected that a highly open type
system will be particularly helpful. There are also a bounded number of
floating point types in use for current and upcoming hardware, and we
can just implement them like this (perhaps looking for some cosmetic
ways to reduce the number of places that need to change). Creating a
more generic mechanism for extending floating point types seems like it
wouldn't be worth it and we should just deal with defining them one by
one on an as-needed basis when real hardware implements a new scheme.
Hopefully, with some additional production use and complete software
stacks, hardware makers will converge on a set of such types that is not
terribly divergent at the level that the compiler cares about.

(I cleaned up some old formatting and sorted some items for this case:
If we converge on landing this in some form, I will NFC commit format
only changes as a separate commit)

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

stellaraccident created this revision.Sep 13 2022, 5:43 PM

Herald added a reviewer: rriddle. · View Herald TranscriptSep 13 2022, 5:43 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: bzcheeseman, sdasgup3, wenzhicui and 20 others. · View Herald Transcript

stellaraccident added a parent revision: D133824: NFC: Run clang-format on APFloatTest..Sep 13 2022, 5:50 PM

Harbormaster completed remote builds in B186498: Diff 459934.Sep 13 2022, 7:20 PM

stellaraccident published this revision for review.Sep 14 2022, 9:05 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptSep 14 2022, 9:05 AM

Herald added subscribers: llvm-commits, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

stellaraccident added reviewers: jholewinski, gchakrabarti.Sep 14 2022, 9:07 AM

stellaraccident added a reviewer: bkramer.

I imagine APFloat changes likely require an RFC (though I'm not sure). Also, did you accidentally reformat all of the APFloat tests (4k lines change o.O)

In D133823#3789844, @rriddle wrote:

I imagine APFloat changes likely require an RFC (though I'm not sure). Also, did you accidentally reformat all of the APFloat tests (4k lines change o.O)

Sorry: noted this patch in the RFC but didn't put the reverse link here: https://discourse.llvm.org/t/rfc-add-apfloat-and-mlir-type-support-for-fp8-e5m2/65279

Ugh, yeah, APFloatTest was very not formatted and clang-format "helped". I tried to stack an NFC clang-format under this one but seem to have messed something up. Could you offer any advice on what to do (I was thinking of just pre-landing an NFC clang-format of APFloatTest and then rebasing on top of that).

In D133823#3790482, @stellaraccident wrote:

In D133823#3789844, @rriddle wrote:

I imagine APFloat changes likely require an RFC (though I'm not sure). Also, did you accidentally reformat all of the APFloat tests (4k lines change o.O)

Sorry: noted this patch in the RFC but didn't put the reverse link here: https://discourse.llvm.org/t/rfc-add-apfloat-and-mlir-type-support-for-fp8-e5m2/65279

Ugh, yeah, APFloatTest was very not formatted and clang-format "helped". I tried to stack an NFC clang-format under this one but seem to have messed something up. Could you offer any advice on what to do (I was thinking of just pre-landing an NFC clang-format of APFloatTest and then rebasing on top of that).

There is a script to only run clang-format on the area you changed. https://clang.llvm.org/docs/ClangFormat.html#script-for-patch-reformatting

Format and apply rename suggestions.

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptSep 23 2022, 4:07 PM

Herald added a subscriber: zero9178. · View Herald Transcript

Harbormaster completed remote builds in B188487: Diff 462606.Sep 23 2022, 4:07 PM

git-clang-format

Harbormaster completed remote builds in B188489: Diff 462608.Sep 23 2022, 4:10 PM

Formatted and naming suggestions from RFC applied. Ready for review.

Code looks good. Were the concerns about naming resolved?

This revision is now accepted and ready to land.Sep 24 2022, 3:12 AM

In D133823#3813169, @bkramer wrote:

Code looks good. Were the concerns about naming resolved?

I believe so for this one, which is fairly uncontroversial. Would still like a review by someone on the MLIR side, though.

stellaraccident removed a parent revision: D133824: NFC: Run clang-format on APFloatTest..Sep 26 2022, 5:13 AM

Honestly, there isn't much meat here, other than the decision to support and the name bikeshedding, both of which I think are well covered in the RFC.

Most of the code is mechanical changes to adding a new type and in that are perfectly fine.

The main questions for me are:

Do we want float8? YES!
Do we want this specific variant? Yes, IMO.
Is adding a single variant (amongst so many) worth it? Yes, at the very least for initial support. Adding others or changing this should be easy.
Do we want a more generic implementation which we derive specific cases from? Maybe. Regardless, this is an initial implementation and we can generalise later, if desired.
Is the end goal to have only concrete cases listed? Strong YES from me. This should be less than 1/2 dozen and we can cope with that, even if we don't have a generic implementation.
Does this patch satisfy all the constraints above? Yes, IMO.

The only issue that could be raised for adding a specific one is that of backwards compatibility. As long as we're sure what we want before forking the next version (Jan-ish), it should be fine to experiment and gather opinions.

So, my approval of it is exactly that: my support for this change. As usual, please wait for more approvals to make sure we have enough support for the specific way you implemented it.

Renato

In D133823#3815492, @rengolin wrote:

Honestly, there isn't much meat here, other than the decision to support and the name bikeshedding, both of which I think are well covered in the RFC.

I scoped it down to the hardest topic in computer science: what to call it ;)

stellaraccident added reviewers: jpienaar, mehdi_amini.Sep 26 2022, 2:00 PM

Bkramer; was your intention to review both the APFloat and MLIR side of the patch? Just making sure that the MLIR side had a proper look before submitting and it wasn't clear to me from the commentary.

In D133823#3820513, @stellaraccident wrote:

Bkramer; was your intention to review both the APFloat and MLIR side of the patch? Just making sure that the MLIR side had a proper look before submitting and it wasn't clear to me from the commentary.

Both the LLVM and MLIR sides look good to me (but I have less experience on the MLIR side).

Looks good thanks, naming discussions always fun.

mlir/test/lib/Dialect/Test/TestOps.td
197	I'll file an issue if you haven't already.

Rebase.

This revision was landed with ongoing or failed builds.Oct 2 2022, 5:25 PM

Closed by commit rG2dc68b539825: Add APFloat and MLIR type support for fp8 (e5m2). (authored by stellaraccident). · Explain Why

This revision was automatically updated to reflect the committed changes.

stellaraccident added a commit: rG2dc68b539825: Add APFloat and MLIR type support for fp8 (e5m2)..

chapuni added a subscriber: chapuni.Oct 2 2022, 6:54 PM

chapuni added inline comments.

llvm/include/llvm/ADT/APFloat.h
160	This triggers a warning in clang/lib/AST/MicrosoftMangle.cpp:mangleFloat. Any idea?

Harbormaster completed remote builds in B189907: Diff 464579.Oct 2 2022, 6:55 PM

vitalybuka added a reverting change: rGe68c7a99176d: Revert "Add APFloat and MLIR type support for fp8 (e5m2).".Oct 2 2022, 9:23 PM

vitalybuka reopened this revision.Oct 2 2022, 9:23 PM

This revision is now accepted and ready to land.Oct 2 2022, 9:23 PM

stellaraccident added inline comments.Oct 3 2022, 10:02 AM

llvm/include/llvm/ADT/APFloat.h
160	I'm not familiar with that code but it appears that it is assuming full coverage of every APFloat semantic to an msvc mangling -- which can't be valid. It looks like the way such things are handled within that file for errors is llvm_unreachable so should probably add a default with that.

craig.topper added inline comments.Oct 3 2022, 10:45 AM

llvm/include/llvm/ADT/APFloat.h
160	Probably best to use llvm_unreachable for `S_Float8E5M2` instead of adding a default. That will keep the fully covered switch warning if another semantic is added in the future.

Add fix to MicrosoftMangle.cpp that caused buildbot failure.

Herald added a project: Restricted Project. · View Herald TranscriptOct 4 2022, 5:14 PM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Switch to explicit case per comment from ctopper.

Remove break after llvm_unreachable for consistency with other switches in file.

jpienaar accepted this revision.Oct 4 2022, 5:26 PM

This revision was landed with ongoing or failed builds.Oct 4 2022, 5:40 PM

Closed by commit rGe28b15b572b5: Add APFloat and MLIR type support for fp8 (e5m2). (authored by stellaraccident). · Explain Why

This revision was automatically updated to reflect the committed changes.

stellaraccident added a commit: rGe28b15b572b5: Add APFloat and MLIR type support for fp8 (e5m2)..

Ran the full clang/llvm/mlir test suite in debug mode just to be safe.

Harbormaster completed remote builds in B190372: Diff 465233.Oct 4 2022, 6:24 PM

reedwm mentioned this in D137760: Add FP8 E4M3 support to APFloat..Nov 9 2022, 8:28 PM

bkramer mentioned this in rG88eb3c62f25d: Add FP8 E4M3 support to APFloat..Nov 15 2022, 11:27 AM

reedwm mentioned this in D138075: Add Float8E4M3FN type to MLIR..Nov 15 2022, 4:05 PM

bkramer mentioned this in rGe08ca4bb1dfe: Add Float8E4M3FN type to MLIR..Nov 16 2022, 1:30 AM

LuoYuanke added a subscriber: LuoYuanke.Nov 29 2022, 6:07 PM

Herald added a subscriber: Moerafaat. · View Herald TranscriptNov 29 2022, 6:07 PM

ezhulenev mentioned this in D140088: Add LLVM type support for fp8.Dec 15 2022, 10:25 AM

jakeh-gc mentioned this in D141432: Add two additional float8 types to MLIR and APFloat..Jan 10 2023, 2:02 PM

jakeh-gc mentioned this in D143744: Add Float8E5M2FNUZ and Float8E4M3FNUZ types to MLIR.Feb 10 2023, 8:15 AM

chrisjackson mentioned this in rG96267b6b8840: [mlir] Add Float8E5M2FNUZ and Float8E4M3FNUZ types to MLIR.Feb 13 2023, 10:26 AM

Revision Contents

Path

Size

clang/

lib/

AST/

MicrosoftMangle.cpp

2 lines

llvm/

include/

llvm/

ADT/

APFloat.h

10 lines

lib/

Support/

APFloat.cpp

86 lines

unittests/

ADT/

APFloatTest.cpp

105 lines

mlir/

include/

mlir-c/

BuiltinTypes.h

7 lines

mlir/

IR/

1 line

9 lines

22 lines

1 line

lib/

AsmParser/

TokenKinds.def

1 line

TypeParser.cpp

4 lines

CAPI/

IR/

BuiltinTypes.cpp

8 lines

IR/

1 line

4 lines

4 lines

5 lines

1 line

test/

IR/

attribute.mlir

36 lines

lib/

Dialect/

Test/

TestOps.td

8 lines

Diff 465240

clang/lib/AST/MicrosoftMangle.cpp

Show First 20 Lines • Show All 832 Lines • ▼ Show 20 Lines	void MicrosoftCXXNameMangler::mangleFloat(llvm::APFloat Number) {

// The following are all Clang extensions. We try to pick manglings that are		// The following are all Clang extensions. We try to pick manglings that are
// unlikely to conflict with MSVC's scheme.		// unlikely to conflict with MSVC's scheme.
case APFloat::S_IEEEhalf: Out << 'V'; break;		case APFloat::S_IEEEhalf: Out << 'V'; break;
case APFloat::S_BFloat: Out << 'W'; break;		case APFloat::S_BFloat: Out << 'W'; break;
case APFloat::S_x87DoubleExtended: Out << 'X'; break;		case APFloat::S_x87DoubleExtended: Out << 'X'; break;
case APFloat::S_IEEEquad: Out << 'Y'; break;		case APFloat::S_IEEEquad: Out << 'Y'; break;
case APFloat::S_PPCDoubleDouble: Out << 'Z'; break;		case APFloat::S_PPCDoubleDouble: Out << 'Z'; break;
		case APFloat::S_Float8E5M2:
		llvm_unreachable("Tried to mangle unexpected APFloat semantics");
}		}

mangleBits(Number.bitcastToAPInt());		mangleBits(Number.bitcastToAPInt());
}		}

void MicrosoftCXXNameMangler::mangleBits(llvm::APInt Value) {		void MicrosoftCXXNameMangler::mangleBits(llvm::APInt Value) {
if (Value == 0)		if (Value == 0)
Out << "A@";		Out << "A@";
▲ Show 20 Lines • Show All 3,117 Lines • Show Last 20 Lines

llvm/include/llvm/ADT/APFloat.h

Show First 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	struct APFloatBase {

/// \name Floating Point Semantics.		/// \name Floating Point Semantics.
/// @{		/// @{
enum Semantics {		enum Semantics {
S_IEEEhalf,		S_IEEEhalf,
S_BFloat,		S_BFloat,
S_IEEEsingle,		S_IEEEsingle,
S_IEEEdouble,		S_IEEEdouble,
S_x87DoubleExtended,
S_IEEEquad,		S_IEEEquad,
S_PPCDoubleDouble,		S_PPCDoubleDouble,
S_MaxSemantics = S_PPCDoubleDouble		// 8-bit floating point number following IEEE-754 conventions with bit
		// layout S1E5M2 as described in https://arxiv.org/abs/2209.05433
		S_Float8E5M2,
		chapuniUnsubmitted Not Done Reply Inline Actions This triggers a warning in clang/lib/AST/MicrosoftMangle.cpp:mangleFloat. Any idea? chapuni: This triggers a warning in clang/lib/AST/MicrosoftMangle.cpp:mangleFloat. Any idea?
		stellaraccidentAuthorUnsubmitted Not Done Reply Inline Actions I'm not familiar with that code but it appears that it is assuming full coverage of every APFloat semantic to an msvc mangling -- which can't be valid. It looks like the way such things are handled within that file for errors is llvm_unreachable so should probably add a default with that. stellaraccident: I'm not familiar with that code but it appears that it is assuming full coverage of every…
		craig.topperUnsubmitted Done Reply Inline Actions Probably best to use llvm_unreachable for `S_Float8E5M2` instead of adding a default. That will keep the fully covered switch warning if another semantic is added in the future. craig.topper: Probably best to use llvm_unreachable for `S_Float8E5M2` instead of adding a default. That will…
		S_x87DoubleExtended,
		S_MaxSemantics = S_x87DoubleExtended,
};		};

static const llvm::fltSemantics &EnumToSemantics(Semantics S);		static const llvm::fltSemantics &EnumToSemantics(Semantics S);
static Semantics SemanticsToEnum(const llvm::fltSemantics &Sem);		static Semantics SemanticsToEnum(const llvm::fltSemantics &Sem);

static const fltSemantics &IEEEhalf() LLVM_READNONE;		static const fltSemantics &IEEEhalf() LLVM_READNONE;
static const fltSemantics &BFloat() LLVM_READNONE;		static const fltSemantics &BFloat() LLVM_READNONE;
static const fltSemantics &IEEEsingle() LLVM_READNONE;		static const fltSemantics &IEEEsingle() LLVM_READNONE;
static const fltSemantics &IEEEdouble() LLVM_READNONE;		static const fltSemantics &IEEEdouble() LLVM_READNONE;
static const fltSemantics &IEEEquad() LLVM_READNONE;		static const fltSemantics &IEEEquad() LLVM_READNONE;
static const fltSemantics &PPCDoubleDouble() LLVM_READNONE;		static const fltSemantics &PPCDoubleDouble() LLVM_READNONE;
		static const fltSemantics &Float8E5M2() LLVM_READNONE;
static const fltSemantics &x87DoubleExtended() LLVM_READNONE;		static const fltSemantics &x87DoubleExtended() LLVM_READNONE;

/// A Pseudo fltsemantic used to construct APFloats that cannot conflict with		/// A Pseudo fltsemantic used to construct APFloats that cannot conflict with
/// anything real.		/// anything real.
static const fltSemantics &Bogus() LLVM_READNONE;		static const fltSemantics &Bogus() LLVM_READNONE;

/// @}		/// @}

▲ Show 20 Lines • Show All 368 Lines • ▼ Show 20 Lines	private:

APInt convertHalfAPFloatToAPInt() const;		APInt convertHalfAPFloatToAPInt() const;
APInt convertBFloatAPFloatToAPInt() const;		APInt convertBFloatAPFloatToAPInt() const;
APInt convertFloatAPFloatToAPInt() const;		APInt convertFloatAPFloatToAPInt() const;
APInt convertDoubleAPFloatToAPInt() const;		APInt convertDoubleAPFloatToAPInt() const;
APInt convertQuadrupleAPFloatToAPInt() const;		APInt convertQuadrupleAPFloatToAPInt() const;
APInt convertF80LongDoubleAPFloatToAPInt() const;		APInt convertF80LongDoubleAPFloatToAPInt() const;
APInt convertPPCDoubleDoubleAPFloatToAPInt() const;		APInt convertPPCDoubleDoubleAPFloatToAPInt() const;
		APInt convertFloat8E5M2APFloatToAPInt() const;
void initFromAPInt(const fltSemantics *Sem, const APInt &api);		void initFromAPInt(const fltSemantics *Sem, const APInt &api);
void initFromHalfAPInt(const APInt &api);		void initFromHalfAPInt(const APInt &api);
void initFromBFloatAPInt(const APInt &api);		void initFromBFloatAPInt(const APInt &api);
void initFromFloatAPInt(const APInt &api);		void initFromFloatAPInt(const APInt &api);
void initFromDoubleAPInt(const APInt &api);		void initFromDoubleAPInt(const APInt &api);
void initFromQuadrupleAPInt(const APInt &api);		void initFromQuadrupleAPInt(const APInt &api);
void initFromF80LongDoubleAPInt(const APInt &api);		void initFromF80LongDoubleAPInt(const APInt &api);
void initFromPPCDoubleDoubleAPInt(const APInt &api);		void initFromPPCDoubleDoubleAPInt(const APInt &api);
		void initFromFloat8E5M2APInt(const APInt &api);

void assign(const IEEEFloat &);		void assign(const IEEEFloat &);
void copySignificand(const IEEEFloat &);		void copySignificand(const IEEEFloat &);
void freeSignificand();		void freeSignificand();

/// Note: this must be the first data member.		/// Note: this must be the first data member.
/// The semantics that this value obeys.		/// The semantics that this value obeys.
const fltSemantics *semantics;		const fltSemantics *semantics;
▲ Show 20 Lines • Show All 774 Lines • Show Last 20 Lines

llvm/lib/Support/APFloat.cpp

Show First 20 Lines • Show All 74 Lines • ▼ Show 20 Lines	struct fltSemantics {
}		}
};		};

static const fltSemantics semIEEEhalf = {15, -14, 11, 16};		static const fltSemantics semIEEEhalf = {15, -14, 11, 16};
static const fltSemantics semBFloat = {127, -126, 8, 16};		static const fltSemantics semBFloat = {127, -126, 8, 16};
static const fltSemantics semIEEEsingle = {127, -126, 24, 32};		static const fltSemantics semIEEEsingle = {127, -126, 24, 32};
static const fltSemantics semIEEEdouble = {1023, -1022, 53, 64};		static const fltSemantics semIEEEdouble = {1023, -1022, 53, 64};
static const fltSemantics semIEEEquad = {16383, -16382, 113, 128};		static const fltSemantics semIEEEquad = {16383, -16382, 113, 128};
		static const fltSemantics semFloat8E5M2 = {15, -14, 3, 8};
static const fltSemantics semX87DoubleExtended = {16383, -16382, 64, 80};		static const fltSemantics semX87DoubleExtended = {16383, -16382, 64, 80};
static const fltSemantics semBogus = {0, 0, 0, 0};		static const fltSemantics semBogus = {0, 0, 0, 0};

/* The IBM double-double semantics. Such a number consists of a pair of IEEE		/* The IBM double-double semantics. Such a number consists of a pair of IEEE
64-bit doubles (Hi, Lo), where \|Hi\| > \|Lo\|, and if normal,		64-bit doubles (Hi, Lo), where \|Hi\| > \|Lo\|, and if normal,
(double)(Hi + Lo) == Hi. The numeric value it's modeling is Hi + Lo.		(double)(Hi + Lo) == Hi. The numeric value it's modeling is Hi + Lo.
Therefore it has two 53-bit mantissa parts that aren't necessarily adjacent		Therefore it has two 53-bit mantissa parts that aren't necessarily adjacent
to each other, and two 11-bit exponents.		to each other, and two 11-bit exponents.
Show All 35 Lines	const llvm::fltSemantics &APFloatBase::EnumToSemantics(Semantics S) {
case S_IEEEhalf:		case S_IEEEhalf:
return IEEEhalf();		return IEEEhalf();
case S_BFloat:		case S_BFloat:
return BFloat();		return BFloat();
case S_IEEEsingle:		case S_IEEEsingle:
return IEEEsingle();		return IEEEsingle();
case S_IEEEdouble:		case S_IEEEdouble:
return IEEEdouble();		return IEEEdouble();
case S_x87DoubleExtended:
return x87DoubleExtended();
case S_IEEEquad:		case S_IEEEquad:
return IEEEquad();		return IEEEquad();
case S_PPCDoubleDouble:		case S_PPCDoubleDouble:
return PPCDoubleDouble();		return PPCDoubleDouble();
		case S_Float8E5M2:
		return Float8E5M2();
		case S_x87DoubleExtended:
		return x87DoubleExtended();
}		}
llvm_unreachable("Unrecognised floating semantics");		llvm_unreachable("Unrecognised floating semantics");
}		}

APFloatBase::Semantics		APFloatBase::Semantics
APFloatBase::SemanticsToEnum(const llvm::fltSemantics &Sem) {		APFloatBase::SemanticsToEnum(const llvm::fltSemantics &Sem) {
if (&Sem == &llvm::APFloat::IEEEhalf())		if (&Sem == &llvm::APFloat::IEEEhalf())
return S_IEEEhalf;		return S_IEEEhalf;
else if (&Sem == &llvm::APFloat::BFloat())		else if (&Sem == &llvm::APFloat::BFloat())
return S_BFloat;		return S_BFloat;
else if (&Sem == &llvm::APFloat::IEEEsingle())		else if (&Sem == &llvm::APFloat::IEEEsingle())
return S_IEEEsingle;		return S_IEEEsingle;
else if (&Sem == &llvm::APFloat::IEEEdouble())		else if (&Sem == &llvm::APFloat::IEEEdouble())
return S_IEEEdouble;		return S_IEEEdouble;
else if (&Sem == &llvm::APFloat::x87DoubleExtended())
return S_x87DoubleExtended;
else if (&Sem == &llvm::APFloat::IEEEquad())		else if (&Sem == &llvm::APFloat::IEEEquad())
return S_IEEEquad;		return S_IEEEquad;
else if (&Sem == &llvm::APFloat::PPCDoubleDouble())		else if (&Sem == &llvm::APFloat::PPCDoubleDouble())
return S_PPCDoubleDouble;		return S_PPCDoubleDouble;
		else if (&Sem == &llvm::APFloat::Float8E5M2())
		return S_Float8E5M2;
		else if (&Sem == &llvm::APFloat::x87DoubleExtended())
		return S_x87DoubleExtended;
else		else
llvm_unreachable("Unknown floating semantics");		llvm_unreachable("Unknown floating semantics");
}		}

const fltSemantics &APFloatBase::IEEEhalf() {		const fltSemantics &APFloatBase::IEEEhalf() {
return semIEEEhalf;		return semIEEEhalf;
}		}
const fltSemantics &APFloatBase::BFloat() {		const fltSemantics &APFloatBase::BFloat() {
return semBFloat;		return semBFloat;
}		}
const fltSemantics &APFloatBase::IEEEsingle() {		const fltSemantics &APFloatBase::IEEEsingle() {
return semIEEEsingle;		return semIEEEsingle;
}		}
const fltSemantics &APFloatBase::IEEEdouble() {		const fltSemantics &APFloatBase::IEEEdouble() {
return semIEEEdouble;		return semIEEEdouble;
}		}
const fltSemantics &APFloatBase::IEEEquad() {		const fltSemantics &APFloatBase::IEEEquad() { return semIEEEquad; }
return semIEEEquad;		const fltSemantics &APFloatBase::PPCDoubleDouble() {
		return semPPCDoubleDouble;
}		}
		const fltSemantics &APFloatBase::Float8E5M2() { return semFloat8E5M2; }
const fltSemantics &APFloatBase::x87DoubleExtended() {		const fltSemantics &APFloatBase::x87DoubleExtended() {
return semX87DoubleExtended;		return semX87DoubleExtended;
}		}
const fltSemantics &APFloatBase::Bogus() {		const fltSemantics &APFloatBase::Bogus() { return semBogus; }
return semBogus;
}
const fltSemantics &APFloatBase::PPCDoubleDouble() {
return semPPCDoubleDouble;
}

constexpr RoundingMode APFloatBase::rmNearestTiesToEven;		constexpr RoundingMode APFloatBase::rmNearestTiesToEven;
constexpr RoundingMode APFloatBase::rmTowardPositive;		constexpr RoundingMode APFloatBase::rmTowardPositive;
constexpr RoundingMode APFloatBase::rmTowardNegative;		constexpr RoundingMode APFloatBase::rmTowardNegative;
constexpr RoundingMode APFloatBase::rmTowardZero;		constexpr RoundingMode APFloatBase::rmTowardZero;
constexpr RoundingMode APFloatBase::rmNearestTiesToAway;		constexpr RoundingMode APFloatBase::rmNearestTiesToAway;

/* A tight upper bound on number of parts required to hold the value		/* A tight upper bound on number of parts required to hold the value
▲ Show 20 Lines • Show All 3,152 Lines • ▼ Show 20 Lines	if (isFiniteNonZero()) {
myexponent = 0x1f;		myexponent = 0x1f;
mysignificand = (uint32_t)*significandParts();		mysignificand = (uint32_t)*significandParts();
}		}

return APInt(16, (((sign&1) << 15) \| ((myexponent&0x1f) << 10) \|		return APInt(16, (((sign&1) << 15) \| ((myexponent&0x1f) << 10) \|
(mysignificand & 0x3ff)));		(mysignificand & 0x3ff)));
}		}

		APInt IEEEFloat::convertFloat8E5M2APFloatToAPInt() const {
		assert(semantics == (const llvm::fltSemantics *)&semFloat8E5M2);
		assert(partCount() == 1);

		uint32_t myexponent, mysignificand;

		if (isFiniteNonZero()) {
		myexponent = exponent + 15; // bias
		mysignificand = (uint32_t)*significandParts();
		if (myexponent == 1 && !(mysignificand & 0x4))
		myexponent = 0; // denormal
		} else if (category == fcZero) {
		myexponent = 0;
		mysignificand = 0;
		} else if (category == fcInfinity) {
		myexponent = 0x1f;
		mysignificand = 0;
		} else {
		assert(category == fcNaN && "Unknown category!");
		myexponent = 0x1f;
		mysignificand = (uint32_t)*significandParts();
		}

		return APInt(8, (((sign & 1) << 7) \| ((myexponent & 0x1f) << 2) \|
		(mysignificand & 0x3)));
		}

// This function creates an APInt that is just a bit map of the floating		// This function creates an APInt that is just a bit map of the floating
// point constant as it would appear in memory. It is not a conversion,		// point constant as it would appear in memory. It is not a conversion,
// and treating the result as a normal integer is unlikely to be useful.		// and treating the result as a normal integer is unlikely to be useful.

APInt IEEEFloat::bitcastToAPInt() const {		APInt IEEEFloat::bitcastToAPInt() const {
if (semantics == (const llvm::fltSemantics*)&semIEEEhalf)		if (semantics == (const llvm::fltSemantics*)&semIEEEhalf)
return convertHalfAPFloatToAPInt();		return convertHalfAPFloatToAPInt();

if (semantics == (const llvm::fltSemantics *)&semBFloat)		if (semantics == (const llvm::fltSemantics *)&semBFloat)
return convertBFloatAPFloatToAPInt();		return convertBFloatAPFloatToAPInt();

if (semantics == (const llvm::fltSemantics*)&semIEEEsingle)		if (semantics == (const llvm::fltSemantics*)&semIEEEsingle)
return convertFloatAPFloatToAPInt();		return convertFloatAPFloatToAPInt();

if (semantics == (const llvm::fltSemantics*)&semIEEEdouble)		if (semantics == (const llvm::fltSemantics*)&semIEEEdouble)
return convertDoubleAPFloatToAPInt();		return convertDoubleAPFloatToAPInt();

if (semantics == (const llvm::fltSemantics*)&semIEEEquad)		if (semantics == (const llvm::fltSemantics*)&semIEEEquad)
return convertQuadrupleAPFloatToAPInt();		return convertQuadrupleAPFloatToAPInt();

if (semantics == (const llvm::fltSemantics *)&semPPCDoubleDoubleLegacy)		if (semantics == (const llvm::fltSemantics *)&semPPCDoubleDoubleLegacy)
return convertPPCDoubleDoubleAPFloatToAPInt();		return convertPPCDoubleDoubleAPFloatToAPInt();

		if (semantics == (const llvm::fltSemantics *)&semFloat8E5M2)
		return convertFloat8E5M2APFloatToAPInt();

assert(semantics == (const llvm::fltSemantics*)&semX87DoubleExtended &&		assert(semantics == (const llvm::fltSemantics*)&semX87DoubleExtended &&
"unknown format!");		"unknown format!");
return convertF80LongDoubleAPFloatToAPInt();		return convertF80LongDoubleAPFloatToAPInt();
}		}

float IEEEFloat::convertToFloat() const {		float IEEEFloat::convertToFloat() const {
assert(semantics == (const llvm::fltSemantics*)&semIEEEsingle &&		assert(semantics == (const llvm::fltSemantics*)&semIEEEsingle &&
"Float semantics are not IEEEsingle");		"Float semantics are not IEEEsingle");
▲ Show 20 Lines • Show All 211 Lines • ▼ Show 20 Lines	if (myexponent==0 && mysignificand==0) {
*significandParts() = mysignificand;		*significandParts() = mysignificand;
if (myexponent==0) // denormal		if (myexponent==0) // denormal
exponent = -14;		exponent = -14;
else		else
*significandParts() \|= 0x400; // integer bit		*significandParts() \|= 0x400; // integer bit
}		}
}		}

		void IEEEFloat::initFromFloat8E5M2APInt(const APInt &api) {
		uint32_t i = (uint32_t)*api.getRawData();
		uint32_t myexponent = (i >> 2) & 0x1f;
		uint32_t mysignificand = i & 0x3;

		initialize(&semFloat8E5M2);
		assert(partCount() == 1);

		sign = i >> 7;
		if (myexponent == 0 && mysignificand == 0) {
		makeZero(sign);
		} else if (myexponent == 0x1f && mysignificand == 0) {
		makeInf(sign);
		} else if (myexponent == 0x1f && mysignificand != 0) {
		category = fcNaN;
		exponent = exponentNaN();
		*significandParts() = mysignificand;
		} else {
		category = fcNormal;
		exponent = myexponent - 15; // bias
		*significandParts() = mysignificand;
		if (myexponent == 0) // denormal
		exponent = -14;
		else
		*significandParts() \|= 0x4; // integer bit
		}
		}

/// Treat api as containing the bits of a floating point number. Currently		/// Treat api as containing the bits of a floating point number. Currently
/// we infer the floating point type from the size of the APInt. The		/// we infer the floating point type from the size of the APInt. The
/// isIEEE argument distinguishes between PPC128 and IEEE128 (not meaningful		/// isIEEE argument distinguishes between PPC128 and IEEE128 (not meaningful
/// when the size is anything else).		/// when the size is anything else).
void IEEEFloat::initFromAPInt(const fltSemantics *Sem, const APInt &api) {		void IEEEFloat::initFromAPInt(const fltSemantics *Sem, const APInt &api) {
assert(api.getBitWidth() == Sem->sizeInBits);		assert(api.getBitWidth() == Sem->sizeInBits);
if (Sem == &semIEEEhalf)		if (Sem == &semIEEEhalf)
return initFromHalfAPInt(api);		return initFromHalfAPInt(api);
if (Sem == &semBFloat)		if (Sem == &semBFloat)
return initFromBFloatAPInt(api);		return initFromBFloatAPInt(api);
if (Sem == &semIEEEsingle)		if (Sem == &semIEEEsingle)
return initFromFloatAPInt(api);		return initFromFloatAPInt(api);
if (Sem == &semIEEEdouble)		if (Sem == &semIEEEdouble)
return initFromDoubleAPInt(api);		return initFromDoubleAPInt(api);
if (Sem == &semX87DoubleExtended)		if (Sem == &semX87DoubleExtended)
return initFromF80LongDoubleAPInt(api);		return initFromF80LongDoubleAPInt(api);
if (Sem == &semIEEEquad)		if (Sem == &semIEEEquad)
return initFromQuadrupleAPInt(api);		return initFromQuadrupleAPInt(api);
if (Sem == &semPPCDoubleDoubleLegacy)		if (Sem == &semPPCDoubleDoubleLegacy)
return initFromPPCDoubleDoubleAPInt(api);		return initFromPPCDoubleDoubleAPInt(api);
		if (Sem == &semFloat8E5M2)
		return initFromFloat8E5M2APInt(api);

llvm_unreachable(nullptr);		llvm_unreachable(nullptr);
}		}

/// Make this number the largest magnitude normal number in the given		/// Make this number the largest magnitude normal number in the given
/// semantics.		/// semantics.
void IEEEFloat::makeLargest(bool Negative) {		void IEEEFloat::makeLargest(bool Negative) {
// We want (in interchange format):		// We want (in interchange format):
▲ Show 20 Lines • Show All 1,297 Lines • Show Last 20 Lines

llvm/unittests/ADT/APFloatTest.cpp

Show First 20 Lines • Show All 1,746 Lines • ▼ Show 20 Lines

TEST(APFloatTest, getZero) {		TEST(APFloatTest, getZero) {
struct {		struct {
const fltSemantics *semantics;		const fltSemantics *semantics;
const bool sign;		const bool sign;
const unsigned long long bitPattern[2];		const unsigned long long bitPattern[2];
const unsigned bitPatternLength;		const unsigned bitPatternLength;
} const GetZeroTest[] = {		} const GetZeroTest[] = {
{ &APFloat::IEEEhalf(), false, {0, 0}, 1},		{&APFloat::IEEEhalf(), false, {0, 0}, 1},
{ &APFloat::IEEEhalf(), true, {0x8000ULL, 0}, 1},		{&APFloat::IEEEhalf(), true, {0x8000ULL, 0}, 1},
{ &APFloat::IEEEsingle(), false, {0, 0}, 1},		{&APFloat::IEEEsingle(), false, {0, 0}, 1},
{ &APFloat::IEEEsingle(), true, {0x80000000ULL, 0}, 1},		{&APFloat::IEEEsingle(), true, {0x80000000ULL, 0}, 1},
{ &APFloat::IEEEdouble(), false, {0, 0}, 1},		{&APFloat::IEEEdouble(), false, {0, 0}, 1},
{ &APFloat::IEEEdouble(), true, {0x8000000000000000ULL, 0}, 1},		{&APFloat::IEEEdouble(), true, {0x8000000000000000ULL, 0}, 1},
{ &APFloat::IEEEquad(), false, {0, 0}, 2},		{&APFloat::IEEEquad(), false, {0, 0}, 2},
{ &APFloat::IEEEquad(), true, {0, 0x8000000000000000ULL}, 2},		{&APFloat::IEEEquad(), true, {0, 0x8000000000000000ULL}, 2},
{ &APFloat::PPCDoubleDouble(), false, {0, 0}, 2},		{&APFloat::PPCDoubleDouble(), false, {0, 0}, 2},
{ &APFloat::PPCDoubleDouble(), true, {0x8000000000000000ULL, 0}, 2},		{&APFloat::PPCDoubleDouble(), true, {0x8000000000000000ULL, 0}, 2},
{ &APFloat::x87DoubleExtended(), false, {0, 0}, 2},		{&APFloat::x87DoubleExtended(), false, {0, 0}, 2},
{ &APFloat::x87DoubleExtended(), true, {0, 0x8000ULL}, 2},		{&APFloat::x87DoubleExtended(), true, {0, 0x8000ULL}, 2},
		{&APFloat::Float8E5M2(), false, {0, 0}, 1},
		{&APFloat::Float8E5M2(), true, {0x80ULL, 0}, 1},
};		};
const unsigned NumGetZeroTests = 12;		const unsigned NumGetZeroTests = 12;
for (unsigned i = 0; i < NumGetZeroTests; ++i) {		for (unsigned i = 0; i < NumGetZeroTests; ++i) {
APFloat test = APFloat::getZero(*GetZeroTest[i].semantics,		APFloat test = APFloat::getZero(*GetZeroTest[i].semantics,
GetZeroTest[i].sign);		GetZeroTest[i].sign);
const char *pattern = GetZeroTest[i].sign? "-0x0p+0" : "0x0p+0";		const char *pattern = GetZeroTest[i].sign? "-0x0p+0" : "0x0p+0";
APFloat expected = APFloat(*GetZeroTest[i].semantics,		APFloat expected = APFloat(*GetZeroTest[i].semantics,
pattern);		pattern);
▲ Show 20 Lines • Show All 2,974 Lines • ▼ Show 20 Lines
}		}

TEST(APFloatTest, x87Next) {		TEST(APFloatTest, x87Next) {
APFloat F(APFloat::x87DoubleExtended(), "-1.0");		APFloat F(APFloat::x87DoubleExtended(), "-1.0");
F.next(false);		F.next(false);
EXPECT_TRUE(ilogb(F) == -1);		EXPECT_TRUE(ilogb(F) == -1);
}		}

TEST(APFloatTest, ToDouble) {		TEST(APFloatTest, IEEEdoubleToDouble) {
APFloat DPosZero(0.0);		APFloat DPosZero(0.0);
APFloat DPosZeroToDouble(DPosZero.convertToDouble());		APFloat DPosZeroToDouble(DPosZero.convertToDouble());
EXPECT_TRUE(DPosZeroToDouble.isPosZero());		EXPECT_TRUE(DPosZeroToDouble.isPosZero());
APFloat DNegZero(-0.0);		APFloat DNegZero(-0.0);
APFloat DNegZeroToDouble(DNegZero.convertToDouble());		APFloat DNegZeroToDouble(DNegZero.convertToDouble());
EXPECT_TRUE(DNegZeroToDouble.isNegZero());		EXPECT_TRUE(DNegZeroToDouble.isNegZero());

APFloat DOne(1.0);		APFloat DOne(1.0);
Show All 19 Lines	TEST(APFloatTest, IEEEdoubleToDouble) {

APFloat DPosInf = APFloat::getInf(APFloat::IEEEdouble());		APFloat DPosInf = APFloat::getInf(APFloat::IEEEdouble());
EXPECT_EQ(std::numeric_limits<double>::infinity(), DPosInf.convertToDouble());		EXPECT_EQ(std::numeric_limits<double>::infinity(), DPosInf.convertToDouble());
APFloat DNegInf = APFloat::getInf(APFloat::IEEEdouble(), true);		APFloat DNegInf = APFloat::getInf(APFloat::IEEEdouble(), true);
EXPECT_EQ(-std::numeric_limits<double>::infinity(),		EXPECT_EQ(-std::numeric_limits<double>::infinity(),
DNegInf.convertToDouble());		DNegInf.convertToDouble());
APFloat DQNaN = APFloat::getQNaN(APFloat::IEEEdouble());		APFloat DQNaN = APFloat::getQNaN(APFloat::IEEEdouble());
EXPECT_TRUE(std::isnan(DQNaN.convertToDouble()));		EXPECT_TRUE(std::isnan(DQNaN.convertToDouble()));
		}

		TEST(APFloatTest, IEEEsingleToDouble) {
APFloat FPosZero(0.0F);		APFloat FPosZero(0.0F);
APFloat FPosZeroToDouble(FPosZero.convertToDouble());		APFloat FPosZeroToDouble(FPosZero.convertToDouble());
EXPECT_TRUE(FPosZeroToDouble.isPosZero());		EXPECT_TRUE(FPosZeroToDouble.isPosZero());
APFloat FNegZero(-0.0F);		APFloat FNegZero(-0.0F);
APFloat FNegZeroToDouble(FNegZero.convertToDouble());		APFloat FNegZeroToDouble(FNegZero.convertToDouble());
EXPECT_TRUE(FNegZeroToDouble.isNegZero());		EXPECT_TRUE(FNegZeroToDouble.isNegZero());

APFloat FOne(1.0F);		APFloat FOne(1.0F);
Show All 18 Lines	TEST(APFloatTest, IEEEsingleToDouble) {

APFloat FPosInf = APFloat::getInf(APFloat::IEEEsingle());		APFloat FPosInf = APFloat::getInf(APFloat::IEEEsingle());
EXPECT_EQ(std::numeric_limits<double>::infinity(), FPosInf.convertToDouble());		EXPECT_EQ(std::numeric_limits<double>::infinity(), FPosInf.convertToDouble());
APFloat FNegInf = APFloat::getInf(APFloat::IEEEsingle(), true);		APFloat FNegInf = APFloat::getInf(APFloat::IEEEsingle(), true);
EXPECT_EQ(-std::numeric_limits<double>::infinity(),		EXPECT_EQ(-std::numeric_limits<double>::infinity(),
FNegInf.convertToDouble());		FNegInf.convertToDouble());
APFloat FQNaN = APFloat::getQNaN(APFloat::IEEEsingle());		APFloat FQNaN = APFloat::getQNaN(APFloat::IEEEsingle());
EXPECT_TRUE(std::isnan(FQNaN.convertToDouble()));		EXPECT_TRUE(std::isnan(FQNaN.convertToDouble()));
		}

		TEST(APFloatTest, IEEEhalfToDouble) {
APFloat HPosZero = APFloat::getZero(APFloat::IEEEhalf());		APFloat HPosZero = APFloat::getZero(APFloat::IEEEhalf());
APFloat HPosZeroToDouble(HPosZero.convertToDouble());		APFloat HPosZeroToDouble(HPosZero.convertToDouble());
EXPECT_TRUE(HPosZeroToDouble.isPosZero());		EXPECT_TRUE(HPosZeroToDouble.isPosZero());
APFloat HNegZero = APFloat::getZero(APFloat::IEEEhalf(), true);		APFloat HNegZero = APFloat::getZero(APFloat::IEEEhalf(), true);
APFloat HNegZeroToDouble(HNegZero.convertToDouble());		APFloat HNegZeroToDouble(HNegZero.convertToDouble());
EXPECT_TRUE(HNegZeroToDouble.isNegZero());		EXPECT_TRUE(HNegZeroToDouble.isNegZero());

APFloat HOne(APFloat::IEEEhalf(), "1.0");		APFloat HOne(APFloat::IEEEhalf(), "1.0");
Show All 25 Lines	TEST(APFloatTest, IEEEhalfToDouble) {
EXPECT_TRUE(std::isnan(HQNaN.convertToDouble()));		EXPECT_TRUE(std::isnan(HQNaN.convertToDouble()));

APFloat BPosZero = APFloat::getZero(APFloat::IEEEhalf());		APFloat BPosZero = APFloat::getZero(APFloat::IEEEhalf());
APFloat BPosZeroToDouble(BPosZero.convertToDouble());		APFloat BPosZeroToDouble(BPosZero.convertToDouble());
EXPECT_TRUE(BPosZeroToDouble.isPosZero());		EXPECT_TRUE(BPosZeroToDouble.isPosZero());
APFloat BNegZero = APFloat::getZero(APFloat::IEEEhalf(), true);		APFloat BNegZero = APFloat::getZero(APFloat::IEEEhalf(), true);
APFloat BNegZeroToDouble(BNegZero.convertToDouble());		APFloat BNegZeroToDouble(BNegZero.convertToDouble());
EXPECT_TRUE(BNegZeroToDouble.isNegZero());		EXPECT_TRUE(BNegZeroToDouble.isNegZero());
		}

		TEST(APFloatTest, BFloatToDouble) {
APFloat BOne(APFloat::BFloat(), "1.0");		APFloat BOne(APFloat::BFloat(), "1.0");
EXPECT_EQ(1.0, BOne.convertToDouble());		EXPECT_EQ(1.0, BOne.convertToDouble());
APFloat BPosLargest = APFloat::getLargest(APFloat::BFloat(), false);		APFloat BPosLargest = APFloat::getLargest(APFloat::BFloat(), false);
EXPECT_EQ(/0x1.FEp127/ 3.3895313892515355e+38,		EXPECT_EQ(/0x1.FEp127/ 3.3895313892515355e+38,
BPosLargest.convertToDouble());		BPosLargest.convertToDouble());
APFloat BNegLargest = APFloat::getLargest(APFloat::BFloat(), true);		APFloat BNegLargest = APFloat::getLargest(APFloat::BFloat(), true);
EXPECT_EQ(/-0x1.FEp127/ -3.3895313892515355e+38,		EXPECT_EQ(/-0x1.FEp127/ -3.3895313892515355e+38,
BNegLargest.convertToDouble());		BNegLargest.convertToDouble());
Show All 17 Lines	TEST(APFloatTest, BFloatToDouble) {
EXPECT_EQ(std::numeric_limits<double>::infinity(), BPosInf.convertToDouble());		EXPECT_EQ(std::numeric_limits<double>::infinity(), BPosInf.convertToDouble());
APFloat BNegInf = APFloat::getInf(APFloat::BFloat(), true);		APFloat BNegInf = APFloat::getInf(APFloat::BFloat(), true);
EXPECT_EQ(-std::numeric_limits<double>::infinity(),		EXPECT_EQ(-std::numeric_limits<double>::infinity(),
BNegInf.convertToDouble());		BNegInf.convertToDouble());
APFloat BQNaN = APFloat::getQNaN(APFloat::BFloat());		APFloat BQNaN = APFloat::getQNaN(APFloat::BFloat());
EXPECT_TRUE(std::isnan(BQNaN.convertToDouble()));		EXPECT_TRUE(std::isnan(BQNaN.convertToDouble()));
}		}

TEST(APFloatTest, ToFloat) {		TEST(APFloatTest, Float8E5M2ToDouble) {
		APFloat One(APFloat::Float8E5M2(), "1.0");
		EXPECT_EQ(1.0, One.convertToDouble());
		APFloat Two(APFloat::Float8E5M2(), "2.0");
		EXPECT_EQ(2.0, Two.convertToDouble());
		APFloat PosLargest = APFloat::getLargest(APFloat::Float8E5M2(), false);
		EXPECT_EQ(5.734400e+04, PosLargest.convertToDouble());
		APFloat NegLargest = APFloat::getLargest(APFloat::Float8E5M2(), true);
		EXPECT_EQ(-5.734400e+04, NegLargest.convertToDouble());
		APFloat PosSmallest =
		APFloat::getSmallestNormalized(APFloat::Float8E5M2(), false);
		EXPECT_EQ(0x1.p-14, PosSmallest.convertToDouble());
		APFloat NegSmallest =
		APFloat::getSmallestNormalized(APFloat::Float8E5M2(), true);
		EXPECT_EQ(-0x1.p-14, NegSmallest.convertToDouble());

		APFloat SmallestDenorm = APFloat::getSmallest(APFloat::Float8E5M2(), false);
		EXPECT_TRUE(SmallestDenorm.isDenormal());
		EXPECT_EQ(0x1p-16, SmallestDenorm.convertToDouble());

		APFloat PosInf = APFloat::getInf(APFloat::Float8E5M2());
		EXPECT_EQ(std::numeric_limits<double>::infinity(), PosInf.convertToDouble());
		APFloat NegInf = APFloat::getInf(APFloat::Float8E5M2(), true);
		EXPECT_EQ(-std::numeric_limits<double>::infinity(), NegInf.convertToDouble());
		APFloat QNaN = APFloat::getQNaN(APFloat::Float8E5M2());
		EXPECT_TRUE(std::isnan(QNaN.convertToDouble()));
		}

		TEST(APFloatTest, IEEEsingleToFloat) {
APFloat FPosZero(0.0F);		APFloat FPosZero(0.0F);
APFloat FPosZeroToFloat(FPosZero.convertToFloat());		APFloat FPosZeroToFloat(FPosZero.convertToFloat());
EXPECT_TRUE(FPosZeroToFloat.isPosZero());		EXPECT_TRUE(FPosZeroToFloat.isPosZero());
APFloat FNegZero(-0.0F);		APFloat FNegZero(-0.0F);
APFloat FNegZeroToFloat(FNegZero.convertToFloat());		APFloat FNegZeroToFloat(FNegZero.convertToFloat());
EXPECT_TRUE(FNegZeroToFloat.isNegZero());		EXPECT_TRUE(FNegZeroToFloat.isNegZero());

APFloat FOne(1.0F);		APFloat FOne(1.0F);
Show All 17 Lines	EXPECT_EQ(/0x1.FFFFFEp-126/ 2.3509885615147286e-38F,
FLargestDenorm.convertToFloat());		FLargestDenorm.convertToFloat());

APFloat FPosInf = APFloat::getInf(APFloat::IEEEsingle());		APFloat FPosInf = APFloat::getInf(APFloat::IEEEsingle());
EXPECT_EQ(std::numeric_limits<float>::infinity(), FPosInf.convertToFloat());		EXPECT_EQ(std::numeric_limits<float>::infinity(), FPosInf.convertToFloat());
APFloat FNegInf = APFloat::getInf(APFloat::IEEEsingle(), true);		APFloat FNegInf = APFloat::getInf(APFloat::IEEEsingle(), true);
EXPECT_EQ(-std::numeric_limits<float>::infinity(), FNegInf.convertToFloat());		EXPECT_EQ(-std::numeric_limits<float>::infinity(), FNegInf.convertToFloat());
APFloat FQNaN = APFloat::getQNaN(APFloat::IEEEsingle());		APFloat FQNaN = APFloat::getQNaN(APFloat::IEEEsingle());
EXPECT_TRUE(std::isnan(FQNaN.convertToFloat()));		EXPECT_TRUE(std::isnan(FQNaN.convertToFloat()));
		}

		TEST(APFloatTest, IEEEhalfToFloat) {
APFloat HPosZero = APFloat::getZero(APFloat::IEEEhalf());		APFloat HPosZero = APFloat::getZero(APFloat::IEEEhalf());
APFloat HPosZeroToFloat(HPosZero.convertToFloat());		APFloat HPosZeroToFloat(HPosZero.convertToFloat());
EXPECT_TRUE(HPosZeroToFloat.isPosZero());		EXPECT_TRUE(HPosZeroToFloat.isPosZero());
APFloat HNegZero = APFloat::getZero(APFloat::IEEEhalf(), true);		APFloat HNegZero = APFloat::getZero(APFloat::IEEEhalf(), true);
APFloat HNegZeroToFloat(HNegZero.convertToFloat());		APFloat HNegZeroToFloat(HNegZero.convertToFloat());
EXPECT_TRUE(HNegZeroToFloat.isNegZero());		EXPECT_TRUE(HNegZeroToFloat.isNegZero());

APFloat HOne(APFloat::IEEEhalf(), "1.0");		APFloat HOne(APFloat::IEEEhalf(), "1.0");
Show All 17 Lines	EXPECT_EQ(/0x1.FFCp-14/ 0.00012201070785522461F,
HLargestDenorm.convertToFloat());		HLargestDenorm.convertToFloat());

APFloat HPosInf = APFloat::getInf(APFloat::IEEEhalf());		APFloat HPosInf = APFloat::getInf(APFloat::IEEEhalf());
EXPECT_EQ(std::numeric_limits<float>::infinity(), HPosInf.convertToFloat());		EXPECT_EQ(std::numeric_limits<float>::infinity(), HPosInf.convertToFloat());
APFloat HNegInf = APFloat::getInf(APFloat::IEEEhalf(), true);		APFloat HNegInf = APFloat::getInf(APFloat::IEEEhalf(), true);
EXPECT_EQ(-std::numeric_limits<float>::infinity(), HNegInf.convertToFloat());		EXPECT_EQ(-std::numeric_limits<float>::infinity(), HNegInf.convertToFloat());
APFloat HQNaN = APFloat::getQNaN(APFloat::IEEEhalf());		APFloat HQNaN = APFloat::getQNaN(APFloat::IEEEhalf());
EXPECT_TRUE(std::isnan(HQNaN.convertToFloat()));		EXPECT_TRUE(std::isnan(HQNaN.convertToFloat()));
		}

		TEST(APFloatTest, BFloatToFloat) {
APFloat BPosZero = APFloat::getZero(APFloat::BFloat());		APFloat BPosZero = APFloat::getZero(APFloat::BFloat());
APFloat BPosZeroToDouble(BPosZero.convertToFloat());		APFloat BPosZeroToDouble(BPosZero.convertToFloat());
EXPECT_TRUE(BPosZeroToDouble.isPosZero());		EXPECT_TRUE(BPosZeroToDouble.isPosZero());
APFloat BNegZero = APFloat::getZero(APFloat::BFloat(), true);		APFloat BNegZero = APFloat::getZero(APFloat::BFloat(), true);
APFloat BNegZeroToDouble(BNegZero.convertToFloat());		APFloat BNegZeroToDouble(BNegZero.convertToFloat());
EXPECT_TRUE(BNegZeroToDouble.isNegZero());		EXPECT_TRUE(BNegZeroToDouble.isNegZero());

APFloat BOne(APFloat::BFloat(), "1.0");		APFloat BOne(APFloat::BFloat(), "1.0");
Show All 22 Lines	TEST(APFloatTest, BFloatToFloat) {

APFloat BPosInf = APFloat::getInf(APFloat::BFloat());		APFloat BPosInf = APFloat::getInf(APFloat::BFloat());
EXPECT_EQ(std::numeric_limits<float>::infinity(), BPosInf.convertToFloat());		EXPECT_EQ(std::numeric_limits<float>::infinity(), BPosInf.convertToFloat());
APFloat BNegInf = APFloat::getInf(APFloat::BFloat(), true);		APFloat BNegInf = APFloat::getInf(APFloat::BFloat(), true);
EXPECT_EQ(-std::numeric_limits<float>::infinity(), BNegInf.convertToFloat());		EXPECT_EQ(-std::numeric_limits<float>::infinity(), BNegInf.convertToFloat());
APFloat BQNaN = APFloat::getQNaN(APFloat::BFloat());		APFloat BQNaN = APFloat::getQNaN(APFloat::BFloat());
EXPECT_TRUE(std::isnan(BQNaN.convertToFloat()));		EXPECT_TRUE(std::isnan(BQNaN.convertToFloat()));
}		}

		TEST(APFloatTest, Float8E5M2ToFloat) {
		APFloat PosZero = APFloat::getZero(APFloat::Float8E5M2());
		APFloat PosZeroToFloat(PosZero.convertToFloat());
		EXPECT_TRUE(PosZeroToFloat.isPosZero());
		APFloat NegZero = APFloat::getZero(APFloat::Float8E5M2(), true);
		APFloat NegZeroToFloat(NegZero.convertToFloat());
		EXPECT_TRUE(NegZeroToFloat.isNegZero());

		APFloat One(APFloat::Float8E5M2(), "1.0");
		EXPECT_EQ(1.0F, One.convertToFloat());
		APFloat Two(APFloat::Float8E5M2(), "2.0");
		EXPECT_EQ(2.0F, Two.convertToFloat());

		APFloat PosLargest = APFloat::getLargest(APFloat::Float8E5M2(), false);
		EXPECT_EQ(5.734400e+04, PosLargest.convertToFloat());
		APFloat NegLargest = APFloat::getLargest(APFloat::Float8E5M2(), true);
		EXPECT_EQ(-5.734400e+04, NegLargest.convertToFloat());
		APFloat PosSmallest =
		APFloat::getSmallestNormalized(APFloat::Float8E5M2(), false);
		EXPECT_EQ(0x1.p-14, PosSmallest.convertToFloat());
		APFloat NegSmallest =
		APFloat::getSmallestNormalized(APFloat::Float8E5M2(), true);
		EXPECT_EQ(-0x1.p-14, NegSmallest.convertToFloat());

		APFloat SmallestDenorm = APFloat::getSmallest(APFloat::Float8E5M2(), false);
		EXPECT_TRUE(SmallestDenorm.isDenormal());
		EXPECT_EQ(0x1.p-16, SmallestDenorm.convertToFloat());

		APFloat PosInf = APFloat::getInf(APFloat::Float8E5M2());
		EXPECT_EQ(std::numeric_limits<float>::infinity(), PosInf.convertToFloat());
		APFloat NegInf = APFloat::getInf(APFloat::Float8E5M2(), true);
		EXPECT_EQ(-std::numeric_limits<float>::infinity(), NegInf.convertToFloat());
		APFloat QNaN = APFloat::getQNaN(APFloat::Float8E5M2());
		EXPECT_TRUE(std::isnan(QNaN.convertToFloat()));
}		}

		} // namespace

mlir/include/mlir-c/BuiltinTypes.h

	Show First 20 Lines • Show All 61 Lines • ▼ Show 20 Lines
	/// Creates an index type in the given context. The type is owned by the			/// Creates an index type in the given context. The type is owned by the
	/// context.			/// context.
	MLIR_CAPI_EXPORTED MlirType mlirIndexTypeGet(MlirContext ctx);			MLIR_CAPI_EXPORTED MlirType mlirIndexTypeGet(MlirContext ctx);

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Floating-point types.			// Floating-point types.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

				/// Checks whether the given type is an f8E5M2 type.
				MLIR_CAPI_EXPORTED bool mlirTypeIsAFloat8E5M2(MlirType type);

				/// Creates an f8E5M2 type in the given context. The type is owned by the
				/// context.
				MLIR_CAPI_EXPORTED MlirType mlirFloat8E5M2TypeGet(MlirContext ctx);

	/// Checks whether the given type is a bf16 type.			/// Checks whether the given type is a bf16 type.
	MLIR_CAPI_EXPORTED bool mlirTypeIsABF16(MlirType type);			MLIR_CAPI_EXPORTED bool mlirTypeIsABF16(MlirType type);

	/// Creates a bf16 type in the given context. The type is owned by the			/// Creates a bf16 type in the given context. The type is owned by the
	/// context.			/// context.
	MLIR_CAPI_EXPORTED MlirType mlirBF16TypeGet(MlirContext ctx);			MLIR_CAPI_EXPORTED MlirType mlirBF16TypeGet(MlirContext ctx);

	/// Checks whether the given type is an f16 type.			/// Checks whether the given type is an f16 type.
	▲ Show 20 Lines • Show All 287 Lines • Show Last 20 Lines

mlir/include/mlir/IR/Builders.h

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	public:
MLIRContext *getContext() const { return context; }		MLIRContext *getContext() const { return context; }

// Locations.		// Locations.
Location getUnknownLoc();		Location getUnknownLoc();
Location getFusedLoc(ArrayRef<Location> locs,		Location getFusedLoc(ArrayRef<Location> locs,
Attribute metadata = Attribute());		Attribute metadata = Attribute());

// Types.		// Types.
		FloatType getFloat8E5M2Type();
FloatType getBF16Type();		FloatType getBF16Type();
FloatType getF16Type();		FloatType getF16Type();
FloatType getF32Type();		FloatType getF32Type();
FloatType getF64Type();		FloatType getF64Type();
FloatType getF80Type();		FloatType getF80Type();
FloatType getF128Type();		FloatType getF128Type();

IndexType getIndexType();		IndexType getIndexType();
▲ Show 20 Lines • Show All 478 Lines • Show Last 20 Lines

mlir/include/mlir/IR/BuiltinTypes.h

Show All 40 Lines	public:

// Convenience factories.		// Convenience factories.
static FloatType getBF16(MLIRContext *ctx);		static FloatType getBF16(MLIRContext *ctx);
static FloatType getF16(MLIRContext *ctx);		static FloatType getF16(MLIRContext *ctx);
static FloatType getF32(MLIRContext *ctx);		static FloatType getF32(MLIRContext *ctx);
static FloatType getF64(MLIRContext *ctx);		static FloatType getF64(MLIRContext *ctx);
static FloatType getF80(MLIRContext *ctx);		static FloatType getF80(MLIRContext *ctx);
static FloatType getF128(MLIRContext *ctx);		static FloatType getF128(MLIRContext *ctx);
		static FloatType getFloat8E5M2(MLIRContext *ctx);

/// Methods for support type inquiry through isa, cast, and dyn_cast.		/// Methods for support type inquiry through isa, cast, and dyn_cast.
static bool classof(Type type);		static bool classof(Type type);

/// Return the bitwidth of this float type.		/// Return the bitwidth of this float type.
unsigned getWidth();		unsigned getWidth();

/// Return the width of the mantissa of this type.		/// Return the width of the mantissa of this type.
▲ Show 20 Lines • Show All 311 Lines • ▼ Show 20 Lines

inline bool BaseMemRefType::isValidElementType(Type type) {		inline bool BaseMemRefType::isValidElementType(Type type) {
return type.isIntOrIndexOrFloat() \|\|		return type.isIntOrIndexOrFloat() \|\|
type.isa<ComplexType, MemRefType, VectorType, UnrankedMemRefType>() \|\|		type.isa<ComplexType, MemRefType, VectorType, UnrankedMemRefType>() \|\|
type.isa<MemRefElementTypeInterface>();		type.isa<MemRefElementTypeInterface>();
}		}

inline bool FloatType::classof(Type type) {		inline bool FloatType::classof(Type type) {
return type.isa<BFloat16Type, Float16Type, Float32Type, Float64Type,		return type.isa<Float8E5M2Type, BFloat16Type, Float16Type, Float32Type,
Float80Type, Float128Type>();		Float64Type, Float80Type, Float128Type>();
		}

		inline FloatType FloatType::getFloat8E5M2(MLIRContext *ctx) {
		return Float8E5M2Type::get(ctx);
}		}

inline FloatType FloatType::getBF16(MLIRContext *ctx) {		inline FloatType FloatType::getBF16(MLIRContext *ctx) {
return BFloat16Type::get(ctx);		return BFloat16Type::get(ctx);
}		}

inline FloatType FloatType::getF16(MLIRContext *ctx) {		inline FloatType FloatType::getF16(MLIRContext *ctx) {
return Float16Type::get(ctx);		return Float16Type::get(ctx);
▲ Show 20 Lines • Show All 78 Lines • Show Last 20 Lines

mlir/include/mlir/IR/BuiltinTypes.td

	Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines
	class Builtin_FloatType<string name>			class Builtin_FloatType<string name>
	: Builtin_Type<name, /traits=/[], "::mlir::FloatType"> {			: Builtin_Type<name, /traits=/[], "::mlir::FloatType"> {
	let extraClassDeclaration = [{			let extraClassDeclaration = [{
	static }] # name # [{Type get(MLIRContext *context);			static }] # name # [{Type get(MLIRContext *context);
	}];			}];
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
				// Float8E5M2Type

				def Builtin_Float8E5M2 : Builtin_FloatType<"Float8E5M2"> {
				let summary = "8-bit floating point with 2 bit mantissa";
				let description = [{
				An 8-bit floating point type with 1 sign bit, 5 bits exponent and 2 bits
				mantissa. This is not a standard type as defined by IEEE-754, but it
				follows similar conventions with the following characteristics:

				* bit encoding: S1E5M2
				* exponent bias: 15
				* infinities: supported with exponent set to all 1s and mantissa 0s
				* NaNs: supported with exponent bits set to all 1s and mantissa of
				(01, 10, or 11)
				* denormals when exponent is 0

				Described in: https://arxiv.org/abs/2209.05433
				}];
				}


				//===----------------------------------------------------------------------===//
	// BFloat16Type			// BFloat16Type

	def Builtin_BFloat16 : Builtin_FloatType<"BFloat16"> {			def Builtin_BFloat16 : Builtin_FloatType<"BFloat16"> {
	let summary = "bfloat16 floating-point type";			let summary = "bfloat16 floating-point type";
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Float16Type			// Float16Type
	▲ Show 20 Lines • Show All 917 Lines • Show Last 20 Lines

mlir/include/mlir/IR/Types.h

Show First 20 Lines • Show All 117 Lines • ▼ Show 20 Lines	public:
MLIRContext *getContext() const;		MLIRContext *getContext() const;

/// Get the dialect this type is registered to.		/// Get the dialect this type is registered to.
Dialect &getDialect() const { return impl->getAbstractType().getDialect(); }		Dialect &getDialect() const { return impl->getAbstractType().getDialect(); }

// Convenience predicates. This is only for floating point types,		// Convenience predicates. This is only for floating point types,
// derived types should use isa/dyn_cast.		// derived types should use isa/dyn_cast.
bool isIndex() const;		bool isIndex() const;
		bool isFloat8E5M2() const;
bool isBF16() const;		bool isBF16() const;
bool isF16() const;		bool isF16() const;
bool isF32() const;		bool isF32() const;
bool isF64() const;		bool isF64() const;
bool isF80() const;		bool isF80() const;
bool isF128() const;		bool isF128() const;

/// Return true if this is an integer type with the specified width.		/// Return true if this is an integer type with the specified width.
▲ Show 20 Lines • Show All 222 Lines • Show Last 20 Lines

mlir/lib/AsmParser/TokenKinds.def

	Show First 20 Lines • Show All 87 Lines • ▼ Show 20 Lines
	TOK_KEYWORD(ceildiv)			TOK_KEYWORD(ceildiv)
	TOK_KEYWORD(complex)			TOK_KEYWORD(complex)
	TOK_KEYWORD(dense)			TOK_KEYWORD(dense)
	TOK_KEYWORD(dense_resource)			TOK_KEYWORD(dense_resource)
	TOK_KEYWORD(f16)			TOK_KEYWORD(f16)
	TOK_KEYWORD(f32)			TOK_KEYWORD(f32)
	TOK_KEYWORD(f64)			TOK_KEYWORD(f64)
	TOK_KEYWORD(f80)			TOK_KEYWORD(f80)
				TOK_KEYWORD(f8E5M2)
	TOK_KEYWORD(f128)			TOK_KEYWORD(f128)
	TOK_KEYWORD(false)			TOK_KEYWORD(false)
	TOK_KEYWORD(floordiv)			TOK_KEYWORD(floordiv)
	TOK_KEYWORD(for)			TOK_KEYWORD(for)
	TOK_KEYWORD(func)			TOK_KEYWORD(func)
	TOK_KEYWORD(index)			TOK_KEYWORD(index)
	TOK_KEYWORD(loc)			TOK_KEYWORD(loc)
	TOK_KEYWORD(max)			TOK_KEYWORD(max)
	Show All 23 Lines

mlir/lib/AsmParser/TypeParser.cpp

Show All 24 Lines	OptionalParseResult Parser::parseOptionalType(Type &type) {
switch (getToken().getKind()) {		switch (getToken().getKind()) {
case Token::l_paren:		case Token::l_paren:
case Token::kw_memref:		case Token::kw_memref:
case Token::kw_tensor:		case Token::kw_tensor:
case Token::kw_complex:		case Token::kw_complex:
case Token::kw_tuple:		case Token::kw_tuple:
case Token::kw_vector:		case Token::kw_vector:
case Token::inttype:		case Token::inttype:
		case Token::kw_f8E5M2:
case Token::kw_bf16:		case Token::kw_bf16:
case Token::kw_f16:		case Token::kw_f16:
case Token::kw_f32:		case Token::kw_f32:
case Token::kw_f64:		case Token::kw_f64:
case Token::kw_f80:		case Token::kw_f80:
case Token::kw_f128:		case Token::kw_f128:
case Token::kw_index:		case Token::kw_index:
case Token::kw_none:		case Token::kw_none:
▲ Show 20 Lines • Show All 240 Lines • ▼ Show 20 Lines	case Token::inttype: {
if (Optional<bool> signedness = getToken().getIntTypeSignedness())		if (Optional<bool> signedness = getToken().getIntTypeSignedness())
signSemantics = *signedness ? IntegerType::Signed : IntegerType::Unsigned;		signSemantics = *signedness ? IntegerType::Signed : IntegerType::Unsigned;

consumeToken(Token::inttype);		consumeToken(Token::inttype);
return IntegerType::get(getContext(), *width, signSemantics);		return IntegerType::get(getContext(), *width, signSemantics);
}		}

// float-type		// float-type
		case Token::kw_f8E5M2:
		consumeToken(Token::kw_f8E5M2);
		return builder.getFloat8E5M2Type();
case Token::kw_bf16:		case Token::kw_bf16:
consumeToken(Token::kw_bf16);		consumeToken(Token::kw_bf16);
return builder.getBF16Type();		return builder.getBF16Type();
case Token::kw_f16:		case Token::kw_f16:
consumeToken(Token::kw_f16);		consumeToken(Token::kw_f16);
return builder.getF16Type();		return builder.getF16Type();
case Token::kw_f32:		case Token::kw_f32:
consumeToken(Token::kw_f32);		consumeToken(Token::kw_f32);
▲ Show 20 Lines • Show All 283 Lines • Show Last 20 Lines

mlir/lib/CAPI/IR/BuiltinTypes.cpp

	Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	MlirType mlirIndexTypeGet(MlirContext ctx) {			MlirType mlirIndexTypeGet(MlirContext ctx) {
	return wrap(IndexType::get(unwrap(ctx)));			return wrap(IndexType::get(unwrap(ctx)));
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Floating-point types.			// Floating-point types.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

				bool mlirTypeIsAFloat8E5M2(MlirType type) {
				return unwrap(type).isFloat8E5M2();
				}

				MlirType mlirFloat8E5M2TypeGet(MlirContext ctx) {
				return wrap(FloatType::getFloat8E5M2(unwrap(ctx)));
				}

	bool mlirTypeIsABF16(MlirType type) { return unwrap(type).isBF16(); }			bool mlirTypeIsABF16(MlirType type) { return unwrap(type).isBF16(); }

	MlirType mlirBF16TypeGet(MlirContext ctx) {			MlirType mlirBF16TypeGet(MlirContext ctx) {
	return wrap(FloatType::getBF16(unwrap(ctx)));			return wrap(FloatType::getBF16(unwrap(ctx)));
	}			}

	bool mlirTypeIsAF16(MlirType type) { return unwrap(type).isF16(); }			bool mlirTypeIsAF16(MlirType type) { return unwrap(type).isF16(); }

	▲ Show 20 Lines • Show All 309 Lines • Show Last 20 Lines

mlir/lib/IR/AsmPrinter.cpp

Show First 20 Lines • Show All 2,173 Lines • ▼ Show 20 Lines	if (succeeded(printAlias(type)))
return;		return;

TypeSwitch<Type>(type)		TypeSwitch<Type>(type)
.Case<OpaqueType>([&](OpaqueType opaqueTy) {		.Case<OpaqueType>([&](OpaqueType opaqueTy) {
printDialectSymbol(os, "!", opaqueTy.getDialectNamespace(),		printDialectSymbol(os, "!", opaqueTy.getDialectNamespace(),
opaqueTy.getTypeData());		opaqueTy.getTypeData());
})		})
.Case<IndexType>([&](Type) { os << "index"; })		.Case<IndexType>([&](Type) { os << "index"; })
		.Case<Float8E5M2Type>([&](Type) { os << "f8E5M2"; })
.Case<BFloat16Type>([&](Type) { os << "bf16"; })		.Case<BFloat16Type>([&](Type) { os << "bf16"; })
.Case<Float16Type>([&](Type) { os << "f16"; })		.Case<Float16Type>([&](Type) { os << "f16"; })
.Case<Float32Type>([&](Type) { os << "f32"; })		.Case<Float32Type>([&](Type) { os << "f32"; })
.Case<Float64Type>([&](Type) { os << "f64"; })		.Case<Float64Type>([&](Type) { os << "f64"; })
.Case<Float80Type>([&](Type) { os << "f80"; })		.Case<Float80Type>([&](Type) { os << "f80"; })
.Case<Float128Type>([&](Type) { os << "f128"; })		.Case<Float128Type>([&](Type) { os << "f128"; })
.Case<IntegerType>([&](IntegerType integerTy) {		.Case<IntegerType>([&](IntegerType integerTy) {
if (integerTy.isSigned())		if (integerTy.isSigned())
▲ Show 20 Lines • Show All 1,272 Lines • Show Last 20 Lines

mlir/lib/IR/Builders.cpp

	Show All 27 Lines
	Location Builder::getFusedLoc(ArrayRef<Location> locs, Attribute metadata) {			Location Builder::getFusedLoc(ArrayRef<Location> locs, Attribute metadata) {
	return FusedLoc::get(locs, metadata, context);			return FusedLoc::get(locs, metadata, context);
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Types.			// Types.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

				FloatType Builder::getFloat8E5M2Type() {
				return FloatType::getFloat8E5M2(context);
				}

	FloatType Builder::getBF16Type() { return FloatType::getBF16(context); }			FloatType Builder::getBF16Type() { return FloatType::getBF16(context); }

	FloatType Builder::getF16Type() { return FloatType::getF16(context); }			FloatType Builder::getF16Type() { return FloatType::getF16(context); }

	FloatType Builder::getF32Type() { return FloatType::getF32(context); }			FloatType Builder::getF32Type() { return FloatType::getF32(context); }

	FloatType Builder::getF64Type() { return FloatType::getF64(context); }			FloatType Builder::getF64Type() { return FloatType::getF64(context); }

	▲ Show 20 Lines • Show All 477 Lines • Show Last 20 Lines

mlir/lib/IR/BuiltinTypes.cpp

Show First 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	IntegerType IntegerType::scaleElementBitwidth(unsigned scale) {
return IntegerType::get(getContext(), scale * getWidth(), getSignedness());		return IntegerType::get(getContext(), scale * getWidth(), getSignedness());
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Float Type		// Float Type
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

unsigned FloatType::getWidth() {		unsigned FloatType::getWidth() {
		if (isa<Float8E5M2Type>())
		return 8;
if (isa<Float16Type, BFloat16Type>())		if (isa<Float16Type, BFloat16Type>())
return 16;		return 16;
if (isa<Float32Type>())		if (isa<Float32Type>())
return 32;		return 32;
if (isa<Float64Type>())		if (isa<Float64Type>())
return 64;		return 64;
if (isa<Float80Type>())		if (isa<Float80Type>())
return 80;		return 80;
if (isa<Float128Type>())		if (isa<Float128Type>())
return 128;		return 128;
llvm_unreachable("unexpected float type");		llvm_unreachable("unexpected float type");
}		}

/// Returns the floating semantics for the given type.		/// Returns the floating semantics for the given type.
const llvm::fltSemantics &FloatType::getFloatSemantics() {		const llvm::fltSemantics &FloatType::getFloatSemantics() {
		if (isa<Float8E5M2Type>())
		return APFloat::Float8E5M2();
if (isa<BFloat16Type>())		if (isa<BFloat16Type>())
return APFloat::BFloat();		return APFloat::BFloat();
if (isa<Float16Type>())		if (isa<Float16Type>())
return APFloat::IEEEhalf();		return APFloat::IEEEhalf();
if (isa<Float32Type>())		if (isa<Float32Type>())
return APFloat::IEEEsingle();		return APFloat::IEEEsingle();
if (isa<Float64Type>())		if (isa<Float64Type>())
return APFloat::IEEEdouble();		return APFloat::IEEEdouble();
▲ Show 20 Lines • Show All 897 Lines • Show Last 20 Lines

mlir/lib/IR/MLIRContext.cpp

Show First 20 Lines • Show All 200 Lines • ▼ Show 20 Lines	#endif
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Type uniquing		// Type uniquing
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

DenseMap<TypeID, AbstractType *> registeredTypes;		DenseMap<TypeID, AbstractType *> registeredTypes;
StorageUniquer typeUniquer;		StorageUniquer typeUniquer;

/// Cached Type Instances.		/// Cached Type Instances.
		Float8E5M2Type f8E5M2Ty;
BFloat16Type bf16Ty;		BFloat16Type bf16Ty;
Float16Type f16Ty;		Float16Type f16Ty;
Float32Type f32Ty;		Float32Type f32Ty;
Float64Type f64Ty;		Float64Type f64Ty;
Float80Type f80Ty;		Float80Type f80Ty;
Float128Type f128Ty;		Float128Type f128Ty;
IndexType indexTy;		IndexType indexTy;
IntegerType int1Ty, int8Ty, int16Ty, int32Ty, int64Ty, int128Ty;		IntegerType int1Ty, int8Ty, int16Ty, int32Ty, int64Ty, int128Ty;
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	MLIRContext::MLIRContext(const DialectRegistry &registry, Threading setting)
// Ensure the builtin dialect is always pre-loaded.		// Ensure the builtin dialect is always pre-loaded.
getOrLoadDialect<BuiltinDialect>();		getOrLoadDialect<BuiltinDialect>();

// Initialize several common attributes and types to avoid the need to lock		// Initialize several common attributes and types to avoid the need to lock
// the context when accessing them.		// the context when accessing them.

//// Types.		//// Types.
/// Floating-point Types.		/// Floating-point Types.
		impl->f8E5M2Ty = TypeUniquer::get<Float8E5M2Type>(this);
impl->bf16Ty = TypeUniquer::get<BFloat16Type>(this);		impl->bf16Ty = TypeUniquer::get<BFloat16Type>(this);
impl->f16Ty = TypeUniquer::get<Float16Type>(this);		impl->f16Ty = TypeUniquer::get<Float16Type>(this);
impl->f32Ty = TypeUniquer::get<Float32Type>(this);		impl->f32Ty = TypeUniquer::get<Float32Type>(this);
impl->f64Ty = TypeUniquer::get<Float64Type>(this);		impl->f64Ty = TypeUniquer::get<Float64Type>(this);
impl->f80Ty = TypeUniquer::get<Float80Type>(this);		impl->f80Ty = TypeUniquer::get<Float80Type>(this);
impl->f128Ty = TypeUniquer::get<Float128Type>(this);		impl->f128Ty = TypeUniquer::get<Float128Type>(this);
/// Index Type.		/// Index Type.
impl->indexTy = TypeUniquer::get<IndexType>(this);		impl->indexTy = TypeUniquer::get<IndexType>(this);
▲ Show 20 Lines • Show All 548 Lines • ▼ Show 20 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Type uniquing		// Type uniquing
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// Returns the storage uniquer used for constructing type storage instances.		/// Returns the storage uniquer used for constructing type storage instances.
/// This should not be used directly.		/// This should not be used directly.
StorageUniquer &MLIRContext::getTypeUniquer() { return getImpl().typeUniquer; }		StorageUniquer &MLIRContext::getTypeUniquer() { return getImpl().typeUniquer; }

		Float8E5M2Type Float8E5M2Type::get(MLIRContext *context) {
		return context->getImpl().f8E5M2Ty;
		}
BFloat16Type BFloat16Type::get(MLIRContext *context) {		BFloat16Type BFloat16Type::get(MLIRContext *context) {
return context->getImpl().bf16Ty;		return context->getImpl().bf16Ty;
}		}
Float16Type Float16Type::get(MLIRContext *context) {		Float16Type Float16Type::get(MLIRContext *context) {
return context->getImpl().f16Ty;		return context->getImpl().f16Ty;
}		}
Float32Type Float32Type::get(MLIRContext *context) {		Float32Type Float32Type::get(MLIRContext *context) {
return context->getImpl().f32Ty;		return context->getImpl().f32Ty;
▲ Show 20 Lines • Show All 220 Lines • Show Last 20 Lines

mlir/lib/IR/Types.cpp

	Show All 12 Lines
	using namespace mlir::detail;			using namespace mlir::detail;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Type			// Type
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	MLIRContext *Type::getContext() const { return getDialect().getContext(); }			MLIRContext *Type::getContext() const { return getDialect().getContext(); }

				bool Type::isFloat8E5M2() const { return isa<Float8E5M2Type>(); }
	bool Type::isBF16() const { return isa<BFloat16Type>(); }			bool Type::isBF16() const { return isa<BFloat16Type>(); }
	bool Type::isF16() const { return isa<Float16Type>(); }			bool Type::isF16() const { return isa<Float16Type>(); }
	bool Type::isF32() const { return isa<Float32Type>(); }			bool Type::isF32() const { return isa<Float32Type>(); }
	bool Type::isF64() const { return isa<Float64Type>(); }			bool Type::isF64() const { return isa<Float64Type>(); }
	bool Type::isF80() const { return isa<Float80Type>(); }			bool Type::isF80() const { return isa<Float80Type>(); }
	bool Type::isF128() const { return isa<Float128Type>(); }			bool Type::isF128() const { return isa<Float128Type>(); }

	bool Type::isIndex() const { return isa<IndexType>(); }			bool Type::isIndex() const { return isa<IndexType>(); }
	▲ Show 20 Lines • Show All 68 Lines • Show Last 20 Lines

mlir/test/IR/attribute.mlir

Show All 26 Lines	func.func @any_attr_of_fail() {
} : () -> ()		} : () -> ()

return		return
}		}

// -----		// -----

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// Test float attributes
		//===----------------------------------------------------------------------===//

		func.func @float_attrs_pass() {
		"test.float_attrs"() {
		// CHECK: float_attr = 2.000000e+00 : f8E5M2
		float_attr = 2. : f8E5M2
		} : () -> ()
		"test.float_attrs"() {
		// CHECK: float_attr = 2.000000e+00 : f16
		float_attr = 2. : f16
		} : () -> ()
		"test.float_attrs"() {
		// CHECK: float_attr = 2.000000e+00 : bf16
		float_attr = 2. : bf16
		} : () -> ()
		"test.float_attrs"() {
		// CHECK: float_attr = 2.000000e+00 : f32
		float_attr = 2. : f32
		} : () -> ()
		"test.float_attrs"() {
		// CHECK: float_attr = 2.000000e+00 : f64
		float_attr = 2. : f64
		} : () -> ()
		"test.float_attrs"() {
		// CHECK: float_attr = 2.000000e+00 : f80
		float_attr = 2. : f80
		} : () -> ()
		"test.float_attrs"() {
		// CHECK: float_attr = 2.000000e+00 : f128
		float_attr = 2. : f128
		} : () -> ()
		return
		}

		//===----------------------------------------------------------------------===//
// Test integer attributes		// Test integer attributes
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

func.func @int_attrs_pass() {		func.func @int_attrs_pass() {
"test.int_attrs"() {		"test.int_attrs"() {
// CHECK: any_i32_attr = 5 : ui32		// CHECK: any_i32_attr = 5 : ui32
any_i32_attr = 5 : ui32,		any_i32_attr = 5 : ui32,
// CHECK-SAME: index_attr = 8 : index		// CHECK-SAME: index_attr = 8 : index
▲ Show 20 Lines • Show All 675 Lines • Show Last 20 Lines

mlir/test/lib/Dialect/Test/TestOps.td

	Show First 20 Lines • Show All 187 Lines • ▼ Show 20 Lines
	def TypeArrayAttrWithDefaultOp : TEST_Op<"type_array_attr_with_default"> {			def TypeArrayAttrWithDefaultOp : TEST_Op<"type_array_attr_with_default"> {
	let arguments = (ins DefaultValuedAttr<TypeArrayAttr, "{}">:$attr);			let arguments = (ins DefaultValuedAttr<TypeArrayAttr, "{}">:$attr);
	}			}
	def TypeStringAttrWithTypeOp : TEST_Op<"string_attr_with_type"> {			def TypeStringAttrWithTypeOp : TEST_Op<"string_attr_with_type"> {
	let arguments = (ins TypedStrAttr<AnyType>:$attr);			let arguments = (ins TypedStrAttr<AnyType>:$attr);
	let assemblyFormat = "$attr attr-dict";			let assemblyFormat = "$attr attr-dict";
	}			}

				def FloatAttrOp : TEST_Op<"float_attrs"> {
				// TODO: Clean up the OpBase float type and attribute selectors so they
				jpienaarUnsubmitted Not Done Reply Inline Actions I'll file an issue if you haven't already. jpienaar: I'll file an issue if you haven't already.
				// can express all of the types.
				let arguments = (ins
				AnyAttr:$float_attr
				);
				}

	def I32Case5: I32EnumAttrCase<"case5", 5>;			def I32Case5: I32EnumAttrCase<"case5", 5>;
	def I32Case10: I32EnumAttrCase<"case10", 10>;			def I32Case10: I32EnumAttrCase<"case10", 10>;

	def SomeI32Enum: I32EnumAttr<			def SomeI32Enum: I32EnumAttr<
	"SomeI32Enum", "", [I32Case5, I32Case10]>;			"SomeI32Enum", "", [I32Case5, I32Case10]>;

	def I32EnumAttrOp : TEST_Op<"i32_enum_attr"> {			def I32EnumAttrOp : TEST_Op<"i32_enum_attr"> {
	let arguments = (ins SomeI32Enum:$attr);			let arguments = (ins SomeI32Enum:$attr);
	▲ Show 20 Lines • Show All 2,804 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add APFloat and MLIR type support for fp8 (e5m2).ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 465240

clang/lib/AST/MicrosoftMangle.cpp

llvm/include/llvm/ADT/APFloat.h

llvm/lib/Support/APFloat.cpp

llvm/unittests/ADT/APFloatTest.cpp

mlir/include/mlir-c/BuiltinTypes.h

mlir/include/mlir/IR/Builders.h

mlir/include/mlir/IR/BuiltinTypes.h

mlir/include/mlir/IR/BuiltinTypes.td

mlir/include/mlir/IR/Types.h

mlir/lib/AsmParser/TokenKinds.def

mlir/lib/AsmParser/TypeParser.cpp

mlir/lib/CAPI/IR/BuiltinTypes.cpp

mlir/lib/IR/AsmPrinter.cpp

mlir/lib/IR/Builders.cpp

mlir/lib/IR/BuiltinTypes.cpp

mlir/lib/IR/MLIRContext.cpp

mlir/lib/IR/Types.cpp

mlir/test/IR/attribute.mlir

mlir/test/lib/Dialect/Test/TestOps.td

Add APFloat and MLIR type support for fp8 (e5m2).
ClosedPublic