This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
3/9
LangRef.rst
-
include/llvm/
-
llvm/
-
CodeGen/
2/4
ISDOpcodes.h
-
IR/
1
IRBuilder.h
-
Intrinsics.td
-
lib/
-
CodeGen/SelectionDAG/
-
SelectionDAG/
1/2
SelectionDAGBuilder.cpp
-
SelectionDAGDumper.cpp
-
IR/
1/4
Verifier.cpp
-
unittests/IR/
-
IR/
-
IRBuilderTest.cpp

Differential D74729

[FPEnv] Intrinsic for setting rounding mode
ClosedPublic

Authored by sepavloff on Feb 17 2020, 10:16 AM.

Download Raw Diff

Details

Reviewers

andrew.w.kaylor
kpn
cameron.mcinally
craig.topper
RKSimon
jdoerfert

Commits

rGbf416d166bdd: [FPEnv] Intrinsic for setting rounding mode

Summary

To set non-default rounding mode user usually calls function 'fesetround'
from standard C library. This way has some disadvantages.

It creates unnecessary dependency on libc. On the other hand, setting rounding mode requires few instruction and could be made by compiler. Sometimes standard C library even is not available, like in the case of GPU or AI cores that execute small kernels.
Compiler could generate more effective code if it know that particular call just sets rounding mode.

This change introduces new IR intrinsic, namely 'llvm.set.rounding', which
sets current rounding mode, similar to 'fesetround'. It however differs
from the latter, because it is a lower level facility:

'llvm.set.rounding' does not return any value, whereas 'fesetround' returns non-zero value in the case of failure. In glibc 'fesetround' reports failure if its argument is invalid or unsupported or if floating point operations are unavailable on the hardware. Compiler usually knows what core it generates code for and it can validate arguments in many cases.
Rounding mode is specified in 'fesetround' using constants like 'FE_TONEAREST', which are target dependent. It is inconvenient to work with such constants at IR level.

C standard provides a target-independent way to specify rounding mode, it
is used in FLT_ROUNDS, however it does not define standard way to set
rounding mode using this encoding.

This change implements only IR intrinsic. Lowering it to machine code is
target-specific and will be implemented latter. Mapping of 'fesetround'
to 'llvm.set.rounding' is also not implemented here.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sepavloff created this revision.Feb 17 2020, 10:16 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 17 2020, 10:16 AM

Herald added subscribers: jdoerfert, hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B46651: Diff 245003.Feb 17 2020, 10:21 AM

sepavloff added a child revision: D74730: [FPEnv][X86] Implement lowering of llvm.set.rounding.Feb 17 2020, 10:26 AM

sepavloff mentioned this in D77379: [FPEnv] Use single enum to represent rounding mode.Apr 6 2020, 3:10 AM

Rebased patch

Harbormaster failed remote builds in B54769: Diff 260257!Apr 27 2020, 3:43 AM

Add missed change

sepavloff added a reviewer: RKSimon.Apr 27 2020, 5:18 AM

Harbormaster failed remote builds in B54778: Diff 260275!Apr 27 2020, 5:20 AM

sepavloff mentioned this in D74730: [FPEnv][X86] Implement lowering of llvm.set.rounding.Apr 27 2020, 5:25 AM

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

In D74729#2005777, @kpn wrote:

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

AMDGPU can

In D74729#2005862, @arsenm wrote:

In D74729#2005777, @kpn wrote:

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

AMDGPU can

Is the fpsetround() function available on AMDGPU? Put another way, is this intrinsic needed on AMDGPU?

In D74729#2005862, @arsenm wrote:

In D74729#2005777, @kpn wrote:

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

AMDGPU can

Originally we had 1 instruction that can change all aspects of the FP environment (which has a heavy runtime cost). The newest subtargets also have 2 additional and faster instructions that can separately set the rounding mode, and denormal mode. Therefore the decision of how to lower this is different per-subtarget, and therefore an abstracted and legalizable intrinsic is useful

In D74729#2005867, @kpn wrote:

In D74729#2005862, @arsenm wrote:

In D74729#2005777, @kpn wrote:

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

AMDGPU can

Is the fpsetround() function available on AMDGPU? Put another way, is this intrinsic needed on AMDGPU?

We have no ISA libcalls of any kind, everything is through instructions (with different handling per-subtarget depending on what you're setting)

I think we need to be clear in the documentation that this is _not_ a substitute for the constrained FP intrinsics. Can you please add that to your new documentation? Include a link to the "Floating-Point Environment" section in the language ref.

Add to the IR verifier checks that the input is a constant that matches the documentation. The type and input values all need to be checked.

arsenm added inline comments.Apr 27 2020, 12:37 PM

llvm/docs/LangRef.rst
18397–18401	I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). For AMDGPU we have a number of additional bits in the FP environment. We also have the denormal mode, enabling FP exceptions, and a few more exotic target specific FP mode bits.

RKSimon added inline comments.Apr 28 2020, 5:43 AM

llvm/docs/LangRef.rst
18371	Remove "read or " ?
18397–18401	@arsenm Would these extra bits be exclusive modes or would you need this to support target specific mode combos?

In D74729#2005958, @kpn wrote:

I think we need to be clear in the documentation that this is _not_ a substitute for the constrained FP intrinsics. Can you please add that to your new documentation? Include a link to the "Floating-Point Environment" section in the language ref.

It cannot be such substitute. IIUC constrained intrinsics were introduced to represent variants of corresponding C functions that operate in non-default FP environment to distinguish them from "ordinary" variants, that are pure functions. Functions like set_rounding always access FP environment, it always have side effect and must be properly ordered.

Add to the IR verifier checks that the input is a constant that matches the documentation. The type and input values all need to be checked.

The type of input value is already checked by generic code. As for values, there are two notes.

Input values may be a variable, not constant.
A target may support non-standard rounding modes.

However constant values are limited by 3 bits, of which one (Dynamic) cannot be used as argument. So adding check to IR verifier makes sense.

llvm/docs/LangRef.rst
18371	This is a section for group of intrinsics. Now there is only one intrinsic in it, which indeed only writes FP environment. It makes sense to implement intrinsics fo standard C functions, like `fegetmode`, `fetestexcept` and others. Actually the intrinsic `flt_rounds` may be documented here. I will add documentation for it.
18397–18401	I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). C library defined function `fesetmode`, which sets all control modes, not just rounding. It make sense to introduce intrinsic for it, which would serve these purposes.

In D74729#2008093, @sepavloff wrote:

In D74729#2005958, @kpn wrote:

I think we need to be clear in the documentation that this is _not_ a substitute for the constrained FP intrinsics. Can you please add that to your new documentation? Include a link to the "Floating-Point Environment" section in the language ref.

It cannot be such substitute. IIUC constrained intrinsics were introduced to represent variants of corresponding C functions that operate in non-default FP environment to distinguish them from "ordinary" variants, that are pure functions. Functions like set_rounding always access FP environment, it always have side effect and must be properly ordered.

Sure, you know that, and I know that, but someone who hasn't been following along the past couple of years may not know that. A sentence or two tying things together won't hurt. Something along the lines of "Altering the rounding mode requires special care. See 'Floating-Point Environment'.", with a link to that section of the documentation.

Updated patch

Added verification code,
Added note to documentation.

In D74729#2008437, @kpn wrote:

In D74729#2008093, @sepavloff wrote:

In D74729#2005958, @kpn wrote:

I think we need to be clear in the documentation that this is _not_ a substitute for the constrained FP intrinsics. Can you please add that to your new documentation? Include a link to the "Floating-Point Environment" section in the language ref.

It cannot be such substitute. IIUC constrained intrinsics were introduced to represent variants of corresponding C functions that operate in non-default FP environment to distinguish them from "ordinary" variants, that are pure functions. Functions like set_rounding always access FP environment, it always have side effect and must be properly ordered.

Sure, you know that, and I know that, but someone who hasn't been following along the past couple of years may not know that. A sentence or two tying things together won't hurt. Something along the lines of "Altering the rounding mode requires special care. See 'Floating-Point Environment'.", with a link to that section of the documentation.

Ah, got it! Added note to the documentation. Thank you!

llvm/docs/LangRef.rst
18371	Added `flt_rounds` in D79322.

Harbormaster failed remote builds in B55626: Diff 261789!May 4 2020, 6:22 AM

arsenm added inline comments.May 6 2020, 3:29 PM

llvm/docs/LangRef.rst
18397–18401	We have the rounding mode controls as presented here, however they are broken down by FP type. We can separately set the rounding mode for f32 and f64/f16, so there are two different settings. We also have the denormal mode, for inputs and outputs, also broken down by type in the same way. The denormal handling and per-type handling I think deserve consideration here We have 2 additional target specific FP bits nothing else would need to really think about, but it would be nice if you could set the exact mode you want in a single intrinsic call. I'm less interested in these though
18397–18401	Oh, we also have a bit to turn on/off fp exceptions which is probably generally interesting

arsenm added inline comments.May 6 2020, 3:33 PM

llvm/docs/LangRef.rst
18397–18401	And by a bit, I mean a mask of bits for different FP exception types

Rebased patch

Harbormaster failed remote builds in B58004: Diff 266439!May 27 2020, 1:35 AM

craig.topper added inline comments.May 27 2020, 11:40 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
6197	Don't you need to call getRoot not DAG.getRoot()?
llvm/lib/IR/Verifier.cpp
4973	Is the argument intended to always be a constant or we're just verifying it when we can? The latter seems unusual.

Updated patch

Use getRoot() instead of DAG.getRoot().

sepavloff marked 2 inline comments as done.May 28 2020, 5:09 AM

sepavloff added inline comments.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
6197	Indeed. Thank you!
llvm/lib/IR/Verifier.cpp
4973	The argument may be a variable. If it is a constant, it must be a valid rounding mode. It is expected to be a value of type `RoundingMode`. Values from 0 to 4 denote IEEE rounding modes, they may be followed by target-specific rounding modes. The argument value must be less than `RoundingMode::Dynamic`, which now if 7. I am hesitating if this code is useful enough, as even for constant argument its validity cannot be verified due to non-IEEE rounding modes. Probably we should remove this check.

Rebased patch

Harbormaster failed remote builds in B59640: Diff 269541!Jun 9 2020, 9:52 AM

I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). For AMDGPU we have a number of additional bits in the FP environment. We also have the denormal mode, enabling FP exceptions, and a few more exotic target specific FP mode bits.

Instrinsics get.fpmode and set.fpmode introduced in D82525 may be used for this case.

Herald added a reviewer: jdoerfert. · View Herald TranscriptJul 2 2020, 4:47 AM

Rebased patch

Harbormaster completed remote builds in B63359: Diff 276313.Jul 7 2020, 11:42 PM

Rebased patch

Harbormaster failed remote builds in B65755: Diff 280784!Jul 26 2020, 10:38 PM

Rebased patch

Harbormaster completed remote builds in B68128: Diff 285114.Aug 12 2020, 10:20 AM

Updated patch

Harbormaster completed remote builds in B73310: Diff 294927.Sep 29 2020, 4:36 AM

Removed clang-tidy warning

Harbormaster completed remote builds in B73676: Diff 295606.Oct 1 2020, 10:12 AM

Any feedback is appreciated.

Herald added a subscriber: pengfei. · View Herald TranscriptOct 4 2020, 11:02 PM

RKSimon added inline comments.Oct 5 2020, 11:37 AM

llvm/lib/IR/Verifier.cpp
4973	I'm OK with this being dropped - @craig.topper @kpn ?

craig.topper added inline comments.Oct 5 2020, 2:25 PM

llvm/lib/IR/Verifier.cpp
4973	I'm fine removing it

Removed check from Verifier

Harbormaster completed remote builds in B74095: Diff 296365.Oct 5 2020, 11:53 PM

Ping.

RKSimon added inline comments.Oct 23 2020, 8:52 AM

llvm/include/llvm/CodeGen/ISDOpcodes.h
708	Sorry for the bikeshedding - but if SET_ROUNDING is supposed to match FLT_ROUNDS - shouldn't it have a more similar name?

sepavloff added inline comments.Oct 26 2020, 8:42 AM

llvm/include/llvm/CodeGen/ISDOpcodes.h
708	It is `FLT_ROUNDS_` that has "wrong" name. It is named after the macro `FLT_ROUNDS`, which is defined by C99. To get better names `FLT_ROUNDS_` must be renamed not `SET_ROUNDING`.

ok, I've no more questions @arsenm @kpn?

llvm/include/llvm/CodeGen/ISDOpcodes.h
708	OK - add a TODO comment by FLT_ROUNDS_ then?

In D74729#2354279, @RKSimon wrote:

ok, I've no more questions @arsenm @kpn?

Nothing from me.

Added TODO.

sepavloff added inline comments.Oct 26 2020, 11:17 PM

llvm/include/llvm/CodeGen/ISDOpcodes.h
708	Done.

Harbormaster completed remote builds in B76510: Diff 300901.Oct 27 2020, 12:48 AM

craig.topper added inline comments.Oct 27 2020, 7:45 PM

llvm/include/llvm/IR/IRBuilder.h
897	Do we need this? I don't think we provide IRBuilder for all intrinsics. Just common ones.

Do we need type legalization support for targets like RISCV where i32 isn't a legal type in SelectionDAG?

Removed method IRBuilderBase::createSetRounding.

Harbormaster completed remote builds in B76693: Diff 301210.Oct 28 2020, 3:41 AM

In D74729#2358110, @craig.topper wrote:

Do we need type legalization support for targets like RISCV where i32 isn't a legal type in SelectionDAG?

Lowering of the intrinsic is anyway custom. Default promotion to i64 is OK.

In D74729#2358796, @sepavloff wrote:

In D74729#2358110, @craig.topper wrote:

Do we need type legalization support for targets like RISCV where i32 isn't a legal type in SelectionDAG?

Lowering of the intrinsic is anyway custom. Default promotion to i64 is OK.

There is no default promotion code. Each opcode that needs to be promoted by type legalization must be handled in LegalizeIntegerTypes.cpp

Add support in DAGTypeLegalizer::PromoteIntegerOperand

Harbormaster completed remote builds in B77673: Diff 303062.Nov 5 2020, 2:44 AM

sepavloff added a child revision: D91242: [RISCV] Custom lowering of SET_ROUNDING.Nov 11 2020, 2:38 AM

In D74729#2359928, @craig.topper wrote:

In D74729#2358796, @sepavloff wrote:

In D74729#2358110, @craig.topper wrote:

Do we need type legalization support for targets like RISCV where i32 isn't a legal type in SelectionDAG?

Lowering of the intrinsic is anyway custom. Default promotion to i64 is OK.

There is no default promotion code. Each opcode that needs to be promoted by type legalization must be handled in LegalizeIntegerTypes.cpp

Handling of SET_ROUNDING on RISCV is implemented in D91242.

Ping.

LGTM - any other comments? @craig.topper is the RISCV followup at D91242 OK do you think? It'll probably be what other targets end up using as reference.

This revision was not accepted when it landed; it landed in state Needs Review.Jan 31 2021, 8:29 PM

This revision was landed with ongoing or failed builds.

Closed by commit rGbf416d166bdd: [FPEnv] Intrinsic for setting rounding mode (authored by sepavloff). · Explain Why

This revision was automatically updated to reflect the committed changes.

sepavloff added a commit: rGbf416d166bdd: [FPEnv] Intrinsic for setting rounding mode.

sepavloff mentioned this in D83036: [X86][FPEnv] Lowering of {get,set,reset}_fpmode.Mar 5 2021, 10:17 AM

xiongji90 mentioned this in D144454: Add builtin for llvm set rounding.Feb 28 2023, 12:51 AM

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

31 lines

include/

llvm/

CodeGen/

ISDOpcodes.h

10 lines

IR/

IRBuilder.h

5 lines

Intrinsics.td

1 line

lib/

CodeGen/

SelectionDAG/

SelectionDAGBuilder.cpp

6 lines

SelectionDAGDumper.cpp

5 lines

IR/

Verifier.cpp

6 lines

unittests/

IR/

IRBuilderTest.cpp

5 lines

Diff 266439

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 18,362 Lines • ▼ Show 20 Lines

	This function returns the same values as the libm ``trunc`` functions			This function returns the same values as the libm ``trunc`` functions
	would and handles error conditions in the same way.			would and handles error conditions in the same way.


	Floating Point Environment Manipulation intrinsics			Floating Point Environment Manipulation intrinsics
	--------------------------------------------------			--------------------------------------------------

	These functions read or write floating point environment, such as rounding			These functions read or write floating point environment, such as rounding
				RKSimonUnsubmitted Not Done Reply Inline Actions Remove "read or " ? RKSimon: Remove "read or " ?
				sepavloffAuthorUnsubmitted Done Reply Inline Actions This is a section for group of intrinsics. Now there is only one intrinsic in it, which indeed only writes FP environment. It makes sense to implement intrinsics fo standard C functions, like `fegetmode`, `fetestexcept` and others. Actually the intrinsic `flt_rounds` may be documented here. I will add documentation for it. sepavloff: This is a section for group of intrinsics. Now there is only one intrinsic in it, which indeed…
				sepavloffAuthorUnsubmitted Done Reply Inline Actions Added `flt_rounds` in D79322. sepavloff: Added `flt_rounds` in D79322.
	mode or state of floating point exceptions. Altering the floating point			mode or state of floating point exceptions. Altering the floating point
	environment requires special care. See :ref:`Floating Point Environment <floatenv>`.			environment requires special care. See :ref:`Floating Point Environment <floatenv>`.

	'``llvm.flt.rounds``' Intrinsic			'``llvm.flt.rounds``' Intrinsic
	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

	Syntax:			Syntax:
	"""""""			"""""""
	Show All 9 Lines

	Semantics:			Semantics:
	""""""""""			""""""""""

	The '``llvm.flt.rounds``' intrinsic returns the current rounding mode.			The '``llvm.flt.rounds``' intrinsic returns the current rounding mode.
	Encoding of the returned values is same as the result of ``FLT_ROUNDS``,			Encoding of the returned values is same as the result of ``FLT_ROUNDS``,
	specified by C standard:			specified by C standard:

	::			::

	0 - toward zero			0 - toward zero
	1 - to nearest, ties to even			1 - to nearest, ties to even
	2 - toward positive infinity			2 - toward positive infinity
				arsenmUnsubmitted Not Done Reply Inline Actions I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). For AMDGPU we have a number of additional bits in the FP environment. We also have the denormal mode, enabling FP exceptions, and a few more exotic target specific FP mode bits. arsenm: I'm wondering if this should be more opaque, and broader for the entire FP environment (not…
				RKSimonUnsubmitted Not Done Reply Inline Actions @arsenm Would these extra bits be exclusive modes or would you need this to support target specific mode combos? RKSimon: @arsenm Would these extra bits be exclusive modes or would you need this to support target…
				sepavloffAuthorUnsubmitted Done Reply Inline Actions I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). C library defined function `fesetmode`, which sets all control modes, not just rounding. It make sense to introduce intrinsic for it, which would serve these purposes. sepavloff: > I'm wondering if this should be more opaque, and broader for the entire FP environment (not…
				arsenmUnsubmitted Not Done Reply Inline Actions We have the rounding mode controls as presented here, however they are broken down by FP type. We can separately set the rounding mode for f32 and f64/f16, so there are two different settings. We also have the denormal mode, for inputs and outputs, also broken down by type in the same way. The denormal handling and per-type handling I think deserve consideration here We have 2 additional target specific FP bits nothing else would need to really think about, but it would be nice if you could set the exact mode you want in a single intrinsic call. I'm less interested in these though arsenm: We have the rounding mode controls as presented here, however they are broken down by FP type.
				arsenmUnsubmitted Not Done Reply Inline Actions Oh, we also have a bit to turn on/off fp exceptions which is probably generally interesting arsenm: Oh, we also have a bit to turn on/off fp exceptions which is probably generally interesting
				arsenmUnsubmitted Not Done Reply Inline Actions And by a bit, I mean a mask of bits for different FP exception types arsenm: And by a bit, I mean a mask of bits for different FP exception types
	3 - toward negative infinity			3 - toward negative infinity
	4 - to nearest, ties away from zero			4 - to nearest, ties away from zero

	Other values may be used to represent additional rounding modes, supported by a			Other values may be used to represent additional rounding modes, supported by a
	target. These values are target-specific.			target. These values are target-specific.


				'``llvm.set.rounding``' Intrinsic
				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Syntax:
				"""""""

				::

				declare void @llvm.set.rounding(i32 <val>)

				Overview:
				"""""""""

				The '``llvm.set.rounding``' intrinsic sets current rounding mode.

				Arguments:
				""""""""""

				The argument is the required rounding mode. Encoding of rounding mode is
				the same as used by '``llvm.flt.rounds``'.

				Semantics:
				""""""""""

				The '``llvm.set.rounding``' intrinsic sets the current rounding mode. It is
				similar to C library function 'fesetround', however this intrinsic does not
				return any value and uses platform-independent representation of IEEE rounding
				modes.


	General Intrinsics			General Intrinsics
	------------------			------------------

	This class of intrinsics is designed to be generic and has no specific			This class of intrinsics is designed to be generic and has no specific
	purpose.			purpose.

	'``llvm.var.annotation``' Intrinsic			'``llvm.var.annotation``' Intrinsic
	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
	▲ Show 20 Lines • Show All 1,549 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 686 Lines • ▼ Show 20 Lines	enum NodeType {
///		///
/// The TRUNC = 1 case is used in cases where we know that the value will		/// The TRUNC = 1 case is used in cases where we know that the value will
/// not be modified by the node, because Y is not using any of the extra		/// not be modified by the node, because Y is not using any of the extra
/// precision of source type. This allows certain transformations like		/// precision of source type. This allows certain transformations like
/// FP_EXTEND(FP_ROUND(X,1)) -> X which are not safe for		/// FP_EXTEND(FP_ROUND(X,1)) -> X which are not safe for
/// FP_EXTEND(FP_ROUND(X,0)) because the extra bits aren't removed.		/// FP_EXTEND(FP_ROUND(X,0)) because the extra bits aren't removed.
FP_ROUND,		FP_ROUND,

/// FLT_ROUNDS_ - Returns current rounding mode:		/// Returns current rounding mode:
/// -1 Undefined		/// -1 Undefined
/// 0 Round to 0		/// 0 Round to 0
/// 1 Round to nearest		/// 1 Round to nearest, ties to even
/// 2 Round to +inf		/// 2 Round to +inf
/// 3 Round to -inf		/// 3 Round to -inf
		/// 4 Round to nearest, ties to zero
/// Result is rounding mode and chain. Input is a chain.		/// Result is rounding mode and chain. Input is a chain.
FLT_ROUNDS_,		FLT_ROUNDS_,

		/// Set rounding mode.
		/// The first operand is a chain pointer. The second specifies the required
		/// rounding mode, encoded in the same way as used in '``FLT_ROUNDS_``'.
		SET_ROUNDING,
		RKSimonUnsubmitted Not Done Reply Inline Actions Sorry for the bikeshedding - but if SET_ROUNDING is supposed to match FLT_ROUNDS - shouldn't it have a more similar name? RKSimon: Sorry for the bikeshedding - but if SET_ROUNDING is supposed to match FLT_ROUNDS - shouldn't it…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions It is `FLT_ROUNDS_` that has "wrong" name. It is named after the macro `FLT_ROUNDS`, which is defined by C99. To get better names `FLT_ROUNDS_` must be renamed not `SET_ROUNDING`. sepavloff: It is `FLT_ROUNDS_` that has "wrong" name. It is named after the macro `FLT_ROUNDS`, which is…
		RKSimonUnsubmitted Not Done Reply Inline Actions OK - add a TODO comment by FLT_ROUNDS_ then? RKSimon: OK - add a TODO comment by FLT_ROUNDS_ then?
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Done. sepavloff: Done.

/// X = FP_EXTEND(Y) - Extend a smaller FP type into a larger FP type.		/// X = FP_EXTEND(Y) - Extend a smaller FP type into a larger FP type.
FP_EXTEND,		FP_EXTEND,

/// BITCAST - This operator converts between integer, vector and FP		/// BITCAST - This operator converts between integer, vector and FP
/// values, as if the value was stored to memory with one type and loaded		/// values, as if the value was stored to memory with one type and loaded
/// from the same address with the other type (or equivalently for vector		/// from the same address with the other type (or equivalently for vector
/// format conversions, etc). The source and result are required to have		/// format conversions, etc). The source and result are required to have
/// the same bit size (e.g. f32 <-> i32). This can also be used for		/// the same bit size (e.g. f32 <-> i32). This can also be used for
▲ Show 20 Lines • Show All 556 Lines • Show Last 20 Lines

llvm/include/llvm/IR/IRBuilder.h

Show First 20 Lines • Show All 888 Lines • ▼ Show 20 Lines	CallInst CreateMinimum(Value LHS, Value *RHS, const Twine &Name = "") {
return CreateBinaryIntrinsic(Intrinsic::minimum, LHS, RHS, nullptr, Name);		return CreateBinaryIntrinsic(Intrinsic::minimum, LHS, RHS, nullptr, Name);
}		}

/// Create call to the maximum intrinsic.		/// Create call to the maximum intrinsic.
CallInst CreateMaximum(Value LHS, Value *RHS, const Twine &Name = "") {		CallInst CreateMaximum(Value LHS, Value *RHS, const Twine &Name = "") {
return CreateBinaryIntrinsic(Intrinsic::maximum, LHS, RHS, nullptr, Name);		return CreateBinaryIntrinsic(Intrinsic::maximum, LHS, RHS, nullptr, Name);
}		}

		/// Create call to the set_rounding intrinsic.
		craig.topperUnsubmitted Not Done Reply Inline Actions Do we need this? I don't think we provide IRBuilder for all intrinsics. Just common ones. craig.topper: Do we need this? I don't think we provide IRBuilder for all intrinsics. Just common ones.
		CallInst CreateSetRounding(Value RM, const Twine &Name = "") {
		return CreateIntrinsic(Intrinsic::set_rounding, {}, {RM}, nullptr, Name);
		}

private:		private:
/// Create a call to a masked intrinsic with given Id.		/// Create a call to a masked intrinsic with given Id.
CallInst CreateMaskedIntrinsic(Intrinsic::ID Id, ArrayRef<Value > Ops,		CallInst CreateMaskedIntrinsic(Intrinsic::ID Id, ArrayRef<Value > Ops,
ArrayRef<Type *> OverloadedTypes,		ArrayRef<Type *> OverloadedTypes,
const Twine &Name = "");		const Twine &Name = "");

Value getCastedInt8PtrValue(Value Ptr);		Value getCastedInt8PtrValue(Value Ptr);

▲ Show 20 Lines • Show All 1,715 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

Show First 20 Lines • Show All 612 Lines • ▼ Show 20 Lines	def int_objectsize : Intrinsic<[llvm_anyint_ty],
[IntrNoMem, IntrSpeculatable, IntrWillReturn, ImmArg<1>, ImmArg<2>, ImmArg<3>]>,		[IntrNoMem, IntrSpeculatable, IntrWillReturn, ImmArg<1>, ImmArg<2>, ImmArg<3>]>,
GCCBuiltin<"__builtin_object_size">;		GCCBuiltin<"__builtin_object_size">;

//===--------------- Access to Floating Point Environment -----------------===//		//===--------------- Access to Floating Point Environment -----------------===//
//		//

let IntrProperties = [IntrInaccessibleMemOnly, IntrWillReturn] in {		let IntrProperties = [IntrInaccessibleMemOnly, IntrWillReturn] in {
def int_flt_rounds : Intrinsic<[llvm_i32_ty], []>;		def int_flt_rounds : Intrinsic<[llvm_i32_ty], []>;
		def int_set_rounding : Intrinsic<[], [llvm_i32_ty]>;
}		}

//===--------------- Constrained Floating Point Intrinsics ----------------===//		//===--------------- Constrained Floating Point Intrinsics ----------------===//
//		//

let IntrProperties = [IntrInaccessibleMemOnly, IntrWillReturn] in {		let IntrProperties = [IntrInaccessibleMemOnly, IntrWillReturn] in {
def int_experimental_constrained_fadd : Intrinsic<[ llvm_anyfloat_ty ],		def int_experimental_constrained_fadd : Intrinsic<[ llvm_anyfloat_ty ],
[ LLVMMatchType<0>,		[ LLVMMatchType<0>,
▲ Show 20 Lines • Show All 861 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,186 Lines • ▼ Show 20 Lines	setValue(&I, DAG.getNode(ISD::BITCAST, sdl, MVT::i16,
MVT::i32))));		MVT::i32))));
return;		return;
case Intrinsic::convert_from_fp16:		case Intrinsic::convert_from_fp16:
setValue(&I, DAG.getNode(ISD::FP_EXTEND, sdl,		setValue(&I, DAG.getNode(ISD::FP_EXTEND, sdl,
TLI.getValueType(DAG.getDataLayout(), I.getType()),		TLI.getValueType(DAG.getDataLayout(), I.getType()),
DAG.getNode(ISD::BITCAST, sdl, MVT::f16,		DAG.getNode(ISD::BITCAST, sdl, MVT::f16,
getValue(I.getArgOperand(0)))));		getValue(I.getArgOperand(0)))));
return;		return;
		case Intrinsic::set_rounding:
		Res = DAG.getNode(ISD::SET_ROUNDING, sdl, MVT::Other,
		{DAG.getRoot(), getValue(I.getArgOperand(0))});
		craig.topperUnsubmitted Not Done Reply Inline Actions Don't you need to call getRoot not DAG.getRoot()? craig.topper: Don't you need to call getRoot not DAG.getRoot()?
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Indeed. Thank you! sepavloff: Indeed. Thank you!
		setValue(&I, Res);
		DAG.setRoot(Res.getValue(0));
		return;
case Intrinsic::pcmarker: {		case Intrinsic::pcmarker: {
SDValue Tmp = getValue(I.getArgOperand(0));		SDValue Tmp = getValue(I.getArgOperand(0));
DAG.setRoot(DAG.getNode(ISD::PCMARKER, sdl, MVT::Other, getRoot(), Tmp));		DAG.setRoot(DAG.getNode(ISD::PCMARKER, sdl, MVT::Other, getRoot(), Tmp));
return;		return;
}		}
case Intrinsic::readcyclecounter: {		case Intrinsic::readcyclecounter: {
SDValue Op = getRoot();		SDValue Op = getRoot();
Res = DAG.getNode(ISD::READCYCLECOUNTER, sdl,		Res = DAG.getNode(ISD::READCYCLECOUNTER, sdl,
▲ Show 20 Lines • Show All 4,428 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

//===- SelectionDAGDumper.cpp - Implement SelectionDAG::dump() ------------===//		//===- SelectionDAGDumper.cpp - Implement SelectionDAG::dump() ------------===//
		Lint: Lint Inline Actions clang-format suggested style edits found: Lint: Lint: clang-format suggested style edits found:
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This implements the SelectionDAG::dump method and friends.		// This implements the SelectionDAG::dump method and friends.
▲ Show 20 Lines • Show All 316 Lines • ▼ Show 20 Lines	#endif
case ISD::ANY_EXTEND: return "any_extend";		case ISD::ANY_EXTEND: return "any_extend";
case ISD::SIGN_EXTEND_INREG: return "sign_extend_inreg";		case ISD::SIGN_EXTEND_INREG: return "sign_extend_inreg";
case ISD::ANY_EXTEND_VECTOR_INREG: return "any_extend_vector_inreg";		case ISD::ANY_EXTEND_VECTOR_INREG: return "any_extend_vector_inreg";
case ISD::SIGN_EXTEND_VECTOR_INREG: return "sign_extend_vector_inreg";		case ISD::SIGN_EXTEND_VECTOR_INREG: return "sign_extend_vector_inreg";
case ISD::ZERO_EXTEND_VECTOR_INREG: return "zero_extend_vector_inreg";		case ISD::ZERO_EXTEND_VECTOR_INREG: return "zero_extend_vector_inreg";
case ISD::TRUNCATE: return "truncate";		case ISD::TRUNCATE: return "truncate";
case ISD::FP_ROUND: return "fp_round";		case ISD::FP_ROUND: return "fp_round";
case ISD::STRICT_FP_ROUND: return "strict_fp_round";		case ISD::STRICT_FP_ROUND: return "strict_fp_round";
case ISD::FLT_ROUNDS_: return "flt_rounds";
case ISD::FP_EXTEND: return "fp_extend";		case ISD::FP_EXTEND: return "fp_extend";
case ISD::STRICT_FP_EXTEND: return "strict_fp_extend";		case ISD::STRICT_FP_EXTEND: return "strict_fp_extend";

case ISD::SINT_TO_FP: return "sint_to_fp";		case ISD::SINT_TO_FP: return "sint_to_fp";
case ISD::STRICT_SINT_TO_FP: return "strict_sint_to_fp";		case ISD::STRICT_SINT_TO_FP: return "strict_sint_to_fp";
case ISD::UINT_TO_FP: return "uint_to_fp";		case ISD::UINT_TO_FP: return "uint_to_fp";
case ISD::STRICT_UINT_TO_FP: return "strict_uint_to_fp";		case ISD::STRICT_UINT_TO_FP: return "strict_uint_to_fp";
case ISD::FP_TO_SINT: return "fp_to_sint";		case ISD::FP_TO_SINT: return "fp_to_sint";
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	#endif
case ISD::GC_TRANSITION_END: return "gc_transition.end";		case ISD::GC_TRANSITION_END: return "gc_transition.end";
case ISD::GET_DYNAMIC_AREA_OFFSET: return "get.dynamic.area.offset";		case ISD::GET_DYNAMIC_AREA_OFFSET: return "get.dynamic.area.offset";
case ISD::FREEZE: return "freeze";		case ISD::FREEZE: return "freeze";
case ISD::PREALLOCATED_SETUP:		case ISD::PREALLOCATED_SETUP:
return "call_setup";		return "call_setup";
case ISD::PREALLOCATED_ARG:		case ISD::PREALLOCATED_ARG:
return "call_alloc";		return "call_alloc";

		// Floating point environment manipulation
		case ISD::FLT_ROUNDS_: return "flt_rounds";
		case ISD::SET_ROUNDING: return "set_rounding";

// Bit manipulation		// Bit manipulation
case ISD::ABS: return "abs";		case ISD::ABS: return "abs";
case ISD::BITREVERSE: return "bitreverse";		case ISD::BITREVERSE: return "bitreverse";
case ISD::BSWAP: return "bswap";		case ISD::BSWAP: return "bswap";
case ISD::CTPOP: return "ctpop";		case ISD::CTPOP: return "ctpop";
case ISD::CTTZ: return "cttz";		case ISD::CTTZ: return "cttz";
case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";		case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";
case ISD::CTLZ: return "ctlz";		case ISD::CTLZ: return "ctlz";
▲ Show 20 Lines • Show All 582 Lines • Show Last 20 Lines

llvm/lib/IR/Verifier.cpp

Show First 20 Lines • Show All 4,963 Lines • ▼ Show 20 Lines	case Intrinsic::matrix_columnwise_store: {
default:		default:
llvm_unreachable("unexpected intrinsic");		llvm_unreachable("unexpected intrinsic");
}		}
Assert(TypeToCheck->getNumElements() ==		Assert(TypeToCheck->getNumElements() ==
NumRows->getZExtValue() * NumColumns->getZExtValue(),		NumRows->getZExtValue() * NumColumns->getZExtValue(),
"result of a matrix operation does not fit in the returned vector");		"result of a matrix operation does not fit in the returned vector");
break;		break;
}		}
		case Intrinsic::set_rounding: {
		if (auto RM = dyn_cast<ConstantInt>(Call.getArgOperand(0))) {
		craig.topperUnsubmitted Not Done Reply Inline Actions Is the argument intended to always be a constant or we're just verifying it when we can? The latter seems unusual. craig.topper: Is the argument intended to always be a constant or we're just verifying it when we can? The…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions The argument may be a variable. If it is a constant, it must be a valid rounding mode. It is expected to be a value of type `RoundingMode`. Values from 0 to 4 denote IEEE rounding modes, they may be followed by target-specific rounding modes. The argument value must be less than `RoundingMode::Dynamic`, which now if 7. I am hesitating if this code is useful enough, as even for constant argument its validity cannot be verified due to non-IEEE rounding modes. Probably we should remove this check. sepavloff: The argument may be a variable. If it is a constant, it must be a valid rounding mode. It is…
		RKSimonUnsubmitted Not Done Reply Inline Actions I'm OK with this being dropped - @craig.topper @kpn ? RKSimon: I'm OK with this being dropped - @craig.topper @kpn ?
		craig.topperUnsubmitted Not Done Reply Inline Actions I'm fine removing it craig.topper: I'm fine removing it
		Assert(RM->getZExtValue() < static_cast<unsigned>(RoundingMode::Dynamic),
		"invalid value of rounding mode");
		}
		}
};		};
}		}

/// Carefully grab the subprogram from a local scope.		/// Carefully grab the subprogram from a local scope.
///		///
/// This carefully grabs the subprogram from a local scope, avoiding the		/// This carefully grabs the subprogram from a local scope, avoiding the
/// built-in assertions that would typically fire.		/// built-in assertions that would typically fire.
static DISubprogram getSubprogram(Metadata LocalScope) {		static DISubprogram getSubprogram(Metadata LocalScope) {
▲ Show 20 Lines • Show All 806 Lines • Show Last 20 Lines

llvm/unittests/IR/IRBuilderTest.cpp

Show First 20 Lines • Show All 121 Lines • ▼ Show 20 Lines	TEST_F(IRBuilderTest, Intrinsics) {
EXPECT_TRUE(II->hasNoInfs());		EXPECT_TRUE(II->hasNoInfs());
EXPECT_FALSE(II->hasNoNaNs());		EXPECT_FALSE(II->hasNoNaNs());

Call = Builder.CreateUnaryIntrinsic(Intrinsic::roundeven, V);		Call = Builder.CreateUnaryIntrinsic(Intrinsic::roundeven, V);
II = cast<IntrinsicInst>(Call);		II = cast<IntrinsicInst>(Call);
EXPECT_EQ(II->getIntrinsicID(), Intrinsic::roundeven);		EXPECT_EQ(II->getIntrinsicID(), Intrinsic::roundeven);
EXPECT_FALSE(II->hasNoInfs());		EXPECT_FALSE(II->hasNoInfs());
EXPECT_FALSE(II->hasNoNaNs());		EXPECT_FALSE(II->hasNoNaNs());

		Call = Builder.CreateSetRounding(
		Builder.getInt32(static_cast<uint32_t>(RoundingMode::TowardZero)));
		II = cast<IntrinsicInst>(Call);
		EXPECT_EQ(II->getIntrinsicID(), Intrinsic::set_rounding);
}		}

TEST_F(IRBuilderTest, IntrinsicsWithScalableVectors) {		TEST_F(IRBuilderTest, IntrinsicsWithScalableVectors) {
IRBuilder<> Builder(BB);		IRBuilder<> Builder(BB);
CallInst *Call;		CallInst *Call;
FunctionType *FTy;		FunctionType *FTy;

// Test scalable flag isn't dropped for intrinsic that is explicitly defined		// Test scalable flag isn't dropped for intrinsic that is explicitly defined
▲ Show 20 Lines • Show All 827 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[FPEnv] Intrinsic for setting rounding modeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 266439

llvm/docs/LangRef.rst

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/IR/IRBuilder.h

llvm/include/llvm/IR/Intrinsics.td

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/IR/Verifier.cpp

llvm/unittests/IR/IRBuilderTest.cpp

[FPEnv] Intrinsic for setting rounding mode
ClosedPublic