This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
3/9
LangRef.rst
-
include/llvm/
-
llvm/
-
CodeGen/
2/4
ISDOpcodes.h
-
IR/
1
IRBuilder.h
-
Intrinsics.td
-
lib/
-
CodeGen/SelectionDAG/
-
SelectionDAG/
1/2
SelectionDAGBuilder.cpp
-
SelectionDAGDumper.cpp
-
IR/
1/4
Verifier.cpp
-
unittests/IR/
-
IR/
-
IRBuilderTest.cpp

Differential D74729

[FPEnv] Intrinsic for setting rounding mode
ClosedPublic

Authored by sepavloff on Feb 17 2020, 10:16 AM.

Download Raw Diff

Details

Reviewers

andrew.w.kaylor
kpn
cameron.mcinally
craig.topper
RKSimon
jdoerfert

Commits

rGbf416d166bdd: [FPEnv] Intrinsic for setting rounding mode

Summary

To set non-default rounding mode user usually calls function 'fesetround'
from standard C library. This way has some disadvantages.

It creates unnecessary dependency on libc. On the other hand, setting rounding mode requires few instruction and could be made by compiler. Sometimes standard C library even is not available, like in the case of GPU or AI cores that execute small kernels.
Compiler could generate more effective code if it know that particular call just sets rounding mode.

This change introduces new IR intrinsic, namely 'llvm.set.rounding', which
sets current rounding mode, similar to 'fesetround'. It however differs
from the latter, because it is a lower level facility:

'llvm.set.rounding' does not return any value, whereas 'fesetround' returns non-zero value in the case of failure. In glibc 'fesetround' reports failure if its argument is invalid or unsupported or if floating point operations are unavailable on the hardware. Compiler usually knows what core it generates code for and it can validate arguments in many cases.
Rounding mode is specified in 'fesetround' using constants like 'FE_TONEAREST', which are target dependent. It is inconvenient to work with such constants at IR level.

C standard provides a target-independent way to specify rounding mode, it
is used in FLT_ROUNDS, however it does not define standard way to set
rounding mode using this encoding.

This change implements only IR intrinsic. Lowering it to machine code is
target-specific and will be implemented latter. Mapping of 'fesetround'
to 'llvm.set.rounding' is also not implemented here.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sepavloff created this revision.Feb 17 2020, 10:16 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 17 2020, 10:16 AM

Herald added subscribers: jdoerfert, hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B46651: Diff 245003.Feb 17 2020, 10:21 AM

sepavloff added a child revision: D74730: [FPEnv][X86] Implement lowering of llvm.set.rounding.Feb 17 2020, 10:26 AM

sepavloff mentioned this in D77379: [FPEnv] Use single enum to represent rounding mode.Apr 6 2020, 3:10 AM

Rebased patch

Harbormaster failed remote builds in B54769: Diff 260257!Apr 27 2020, 3:43 AM

Add missed change

sepavloff added a reviewer: RKSimon.Apr 27 2020, 5:18 AM

Harbormaster failed remote builds in B54778: Diff 260275!Apr 27 2020, 5:20 AM

sepavloff mentioned this in D74730: [FPEnv][X86] Implement lowering of llvm.set.rounding.Apr 27 2020, 5:25 AM

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

In D74729#2005777, @kpn wrote:

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

AMDGPU can

In D74729#2005862, @arsenm wrote:

In D74729#2005777, @kpn wrote:

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

AMDGPU can

Is the fpsetround() function available on AMDGPU? Put another way, is this intrinsic needed on AMDGPU?

In D74729#2005862, @arsenm wrote:

In D74729#2005777, @kpn wrote:

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

AMDGPU can

Originally we had 1 instruction that can change all aspects of the FP environment (which has a heavy runtime cost). The newest subtargets also have 2 additional and faster instructions that can separately set the rounding mode, and denormal mode. Therefore the decision of how to lower this is different per-subtarget, and therefore an abstracted and legalizable intrinsic is useful

In D74729#2005867, @kpn wrote:

In D74729#2005862, @arsenm wrote:

In D74729#2005777, @kpn wrote:

Are there GPUs or AI cores _targeted by llvm_ that support changing the rounding mode at run-time?

AMDGPU can

Is the fpsetround() function available on AMDGPU? Put another way, is this intrinsic needed on AMDGPU?

We have no ISA libcalls of any kind, everything is through instructions (with different handling per-subtarget depending on what you're setting)

I think we need to be clear in the documentation that this is _not_ a substitute for the constrained FP intrinsics. Can you please add that to your new documentation? Include a link to the "Floating-Point Environment" section in the language ref.

Add to the IR verifier checks that the input is a constant that matches the documentation. The type and input values all need to be checked.

arsenm added inline comments.Apr 27 2020, 12:37 PM

llvm/docs/LangRef.rst
18283–18287	I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). For AMDGPU we have a number of additional bits in the FP environment. We also have the denormal mode, enabling FP exceptions, and a few more exotic target specific FP mode bits.

RKSimon added inline comments.Apr 28 2020, 5:43 AM

llvm/docs/LangRef.rst
18257	Remove "read or " ?
18283–18287	@arsenm Would these extra bits be exclusive modes or would you need this to support target specific mode combos?

In D74729#2005958, @kpn wrote:

I think we need to be clear in the documentation that this is _not_ a substitute for the constrained FP intrinsics. Can you please add that to your new documentation? Include a link to the "Floating-Point Environment" section in the language ref.

It cannot be such substitute. IIUC constrained intrinsics were introduced to represent variants of corresponding C functions that operate in non-default FP environment to distinguish them from "ordinary" variants, that are pure functions. Functions like set_rounding always access FP environment, it always have side effect and must be properly ordered.

Add to the IR verifier checks that the input is a constant that matches the documentation. The type and input values all need to be checked.

The type of input value is already checked by generic code. As for values, there are two notes.

Input values may be a variable, not constant.
A target may support non-standard rounding modes.

However constant values are limited by 3 bits, of which one (Dynamic) cannot be used as argument. So adding check to IR verifier makes sense.

llvm/docs/LangRef.rst
18257	This is a section for group of intrinsics. Now there is only one intrinsic in it, which indeed only writes FP environment. It makes sense to implement intrinsics fo standard C functions, like `fegetmode`, `fetestexcept` and others. Actually the intrinsic `flt_rounds` may be documented here. I will add documentation for it.
18283–18287	I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). C library defined function `fesetmode`, which sets all control modes, not just rounding. It make sense to introduce intrinsic for it, which would serve these purposes.

In D74729#2008093, @sepavloff wrote:

In D74729#2005958, @kpn wrote:

I think we need to be clear in the documentation that this is _not_ a substitute for the constrained FP intrinsics. Can you please add that to your new documentation? Include a link to the "Floating-Point Environment" section in the language ref.

It cannot be such substitute. IIUC constrained intrinsics were introduced to represent variants of corresponding C functions that operate in non-default FP environment to distinguish them from "ordinary" variants, that are pure functions. Functions like set_rounding always access FP environment, it always have side effect and must be properly ordered.

Sure, you know that, and I know that, but someone who hasn't been following along the past couple of years may not know that. A sentence or two tying things together won't hurt. Something along the lines of "Altering the rounding mode requires special care. See 'Floating-Point Environment'.", with a link to that section of the documentation.

Updated patch

Added verification code,
Added note to documentation.

In D74729#2008437, @kpn wrote:

In D74729#2008093, @sepavloff wrote:

In D74729#2005958, @kpn wrote:

I think we need to be clear in the documentation that this is _not_ a substitute for the constrained FP intrinsics. Can you please add that to your new documentation? Include a link to the "Floating-Point Environment" section in the language ref.

It cannot be such substitute. IIUC constrained intrinsics were introduced to represent variants of corresponding C functions that operate in non-default FP environment to distinguish them from "ordinary" variants, that are pure functions. Functions like set_rounding always access FP environment, it always have side effect and must be properly ordered.

Sure, you know that, and I know that, but someone who hasn't been following along the past couple of years may not know that. A sentence or two tying things together won't hurt. Something along the lines of "Altering the rounding mode requires special care. See 'Floating-Point Environment'.", with a link to that section of the documentation.

Ah, got it! Added note to the documentation. Thank you!

llvm/docs/LangRef.rst
18257	Added `flt_rounds` in D79322.

Harbormaster failed remote builds in B55626: Diff 261789!May 4 2020, 6:22 AM

arsenm added inline comments.May 6 2020, 3:29 PM

llvm/docs/LangRef.rst
18283–18287	We have the rounding mode controls as presented here, however they are broken down by FP type. We can separately set the rounding mode for f32 and f64/f16, so there are two different settings. We also have the denormal mode, for inputs and outputs, also broken down by type in the same way. The denormal handling and per-type handling I think deserve consideration here We have 2 additional target specific FP bits nothing else would need to really think about, but it would be nice if you could set the exact mode you want in a single intrinsic call. I'm less interested in these though
18283–18287	Oh, we also have a bit to turn on/off fp exceptions which is probably generally interesting

arsenm added inline comments.May 6 2020, 3:33 PM

llvm/docs/LangRef.rst
18283–18287	And by a bit, I mean a mask of bits for different FP exception types

Rebased patch

Harbormaster failed remote builds in B58004: Diff 266439!May 27 2020, 1:35 AM

craig.topper added inline comments.May 27 2020, 11:40 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
6152	Don't you need to call getRoot not DAG.getRoot()?
llvm/lib/IR/Verifier.cpp
4941	Is the argument intended to always be a constant or we're just verifying it when we can? The latter seems unusual.

Updated patch

Use getRoot() instead of DAG.getRoot().

sepavloff marked 2 inline comments as done.May 28 2020, 5:09 AM

sepavloff added inline comments.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
6152	Indeed. Thank you!
llvm/lib/IR/Verifier.cpp
4941	The argument may be a variable. If it is a constant, it must be a valid rounding mode. It is expected to be a value of type `RoundingMode`. Values from 0 to 4 denote IEEE rounding modes, they may be followed by target-specific rounding modes. The argument value must be less than `RoundingMode::Dynamic`, which now if 7. I am hesitating if this code is useful enough, as even for constant argument its validity cannot be verified due to non-IEEE rounding modes. Probably we should remove this check.

Rebased patch

Harbormaster failed remote builds in B59640: Diff 269541!Jun 9 2020, 9:52 AM

I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). For AMDGPU we have a number of additional bits in the FP environment. We also have the denormal mode, enabling FP exceptions, and a few more exotic target specific FP mode bits.

Instrinsics get.fpmode and set.fpmode introduced in D82525 may be used for this case.

Herald added a reviewer: jdoerfert. · View Herald TranscriptJul 2 2020, 4:47 AM

Rebased patch

Harbormaster completed remote builds in B63359: Diff 276313.Jul 7 2020, 11:42 PM

Rebased patch

Harbormaster failed remote builds in B65755: Diff 280784!Jul 26 2020, 10:38 PM

Rebased patch

Harbormaster completed remote builds in B68128: Diff 285114.Aug 12 2020, 10:20 AM

Updated patch

Harbormaster completed remote builds in B73310: Diff 294927.Sep 29 2020, 4:36 AM

Removed clang-tidy warning

Harbormaster completed remote builds in B73676: Diff 295606.Oct 1 2020, 10:12 AM

Any feedback is appreciated.

Herald added a subscriber: pengfei. · View Herald TranscriptOct 4 2020, 11:02 PM

RKSimon added inline comments.Oct 5 2020, 11:37 AM

llvm/lib/IR/Verifier.cpp
4941	I'm OK with this being dropped - @craig.topper @kpn ?

craig.topper added inline comments.Oct 5 2020, 2:25 PM

llvm/lib/IR/Verifier.cpp
4941	I'm fine removing it

Removed check from Verifier

Harbormaster completed remote builds in B74095: Diff 296365.Oct 5 2020, 11:53 PM

Ping.

RKSimon added inline comments.Oct 23 2020, 8:52 AM

llvm/include/llvm/CodeGen/ISDOpcodes.h
1022	Sorry for the bikeshedding - but if SET_ROUNDING is supposed to match FLT_ROUNDS - shouldn't it have a more similar name?

sepavloff added inline comments.Oct 26 2020, 8:42 AM

llvm/include/llvm/CodeGen/ISDOpcodes.h
1022	It is `FLT_ROUNDS_` that has "wrong" name. It is named after the macro `FLT_ROUNDS`, which is defined by C99. To get better names `FLT_ROUNDS_` must be renamed not `SET_ROUNDING`.

ok, I've no more questions @arsenm @kpn?

llvm/include/llvm/CodeGen/ISDOpcodes.h
1022	OK - add a TODO comment by FLT_ROUNDS_ then?

In D74729#2354279, @RKSimon wrote:

ok, I've no more questions @arsenm @kpn?

Nothing from me.

Added TODO.

sepavloff added inline comments.Oct 26 2020, 11:17 PM

llvm/include/llvm/CodeGen/ISDOpcodes.h
1022	Done.

Harbormaster completed remote builds in B76510: Diff 300901.Oct 27 2020, 12:48 AM

craig.topper added inline comments.Oct 27 2020, 7:45 PM

llvm/include/llvm/IR/IRBuilder.h
881	Do we need this? I don't think we provide IRBuilder for all intrinsics. Just common ones.

Do we need type legalization support for targets like RISCV where i32 isn't a legal type in SelectionDAG?

Removed method IRBuilderBase::createSetRounding.

Harbormaster completed remote builds in B76693: Diff 301210.Oct 28 2020, 3:41 AM

In D74729#2358110, @craig.topper wrote:

Do we need type legalization support for targets like RISCV where i32 isn't a legal type in SelectionDAG?

Lowering of the intrinsic is anyway custom. Default promotion to i64 is OK.

In D74729#2358796, @sepavloff wrote:

In D74729#2358110, @craig.topper wrote:

Do we need type legalization support for targets like RISCV where i32 isn't a legal type in SelectionDAG?

Lowering of the intrinsic is anyway custom. Default promotion to i64 is OK.

There is no default promotion code. Each opcode that needs to be promoted by type legalization must be handled in LegalizeIntegerTypes.cpp

Add support in DAGTypeLegalizer::PromoteIntegerOperand

Harbormaster completed remote builds in B77673: Diff 303062.Nov 5 2020, 2:44 AM

sepavloff added a child revision: D91242: [RISCV] Custom lowering of SET_ROUNDING.Nov 11 2020, 2:38 AM

In D74729#2359928, @craig.topper wrote:

In D74729#2358796, @sepavloff wrote:

In D74729#2358110, @craig.topper wrote:

Do we need type legalization support for targets like RISCV where i32 isn't a legal type in SelectionDAG?

Lowering of the intrinsic is anyway custom. Default promotion to i64 is OK.

There is no default promotion code. Each opcode that needs to be promoted by type legalization must be handled in LegalizeIntegerTypes.cpp

Handling of SET_ROUNDING on RISCV is implemented in D91242.

Ping.

LGTM - any other comments? @craig.topper is the RISCV followup at D91242 OK do you think? It'll probably be what other targets end up using as reference.

This revision was not accepted when it landed; it landed in state Needs Review.Jan 31 2021, 8:29 PM

This revision was landed with ongoing or failed builds.

Closed by commit rGbf416d166bdd: [FPEnv] Intrinsic for setting rounding mode (authored by sepavloff). · Explain Why

This revision was automatically updated to reflect the committed changes.

sepavloff added a commit: rGbf416d166bdd: [FPEnv] Intrinsic for setting rounding mode.

sepavloff mentioned this in D83036: [X86][FPEnv] Lowering of {get,set,reset}_fpmode.Mar 5 2021, 10:17 AM

xiongji90 mentioned this in D144454: Add builtin for llvm set rounding.Feb 28 2023, 12:51 AM

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

44 lines

include/

llvm/

CodeGen/

ISDOpcodes.h

6 lines

IR/

IRBuilder.h

5 lines

Intrinsics.td

4 lines

lib/

CodeGen/

SelectionDAG/

SelectionDAGBuilder.cpp

6 lines

SelectionDAGDumper.cpp

3 lines

IR/

Verifier.cpp

6 lines

unittests/

IR/

IRBuilderTest.cpp

5 lines

Diff 261789

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 18,245 Lines • ▼ Show 20 Lines

	Semantics:			Semantics:
	""""""""""			""""""""""

	This function returns the same values as the libm ``trunc`` functions			This function returns the same values as the libm ``trunc`` functions
	would and handles error conditions in the same way.			would and handles error conditions in the same way.


				Floating Point Environment Manipulation intrinsics
				--------------------------------------------------

				These functions read or write floating point environment, such as rounding
				RKSimonUnsubmitted Not Done Reply Inline Actions Remove "read or " ? RKSimon: Remove "read or " ?
				sepavloffAuthorUnsubmitted Done Reply Inline Actions This is a section for group of intrinsics. Now there is only one intrinsic in it, which indeed only writes FP environment. It makes sense to implement intrinsics fo standard C functions, like `fegetmode`, `fetestexcept` and others. Actually the intrinsic `flt_rounds` may be documented here. I will add documentation for it. sepavloff: This is a section for group of intrinsics. Now there is only one intrinsic in it, which indeed…
				sepavloffAuthorUnsubmitted Done Reply Inline Actions Added `flt_rounds` in D79322. sepavloff: Added `flt_rounds` in D79322.
				mode or state of floating point exceptions. Altering the floating point
				environment requires special care. See :ref:`Floating Point Environment <floatenv>`.

				'``llvm.set.rounding``' Intrinsic
				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Syntax:
				"""""""

				::

				declare void @llvm.set.rounding(i32 <val>)

				Overview:
				"""""""""

				The '``llvm.set.rounding``' intrinsic sets current rounding mode.

				Arguments:
				""""""""""

				The argument is the required rounding mode. Encoding of rounding mode is
				compatible with the values returned by ``FLT_ROUNDS``:

				::

				0 - toward zero
				1 - to nearest, ties to even
				2 - toward positive infinity
				3 - toward negative infinity
				arsenmUnsubmitted Not Done Reply Inline Actions I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). For AMDGPU we have a number of additional bits in the FP environment. We also have the denormal mode, enabling FP exceptions, and a few more exotic target specific FP mode bits. arsenm: I'm wondering if this should be more opaque, and broader for the entire FP environment (not…
				RKSimonUnsubmitted Not Done Reply Inline Actions @arsenm Would these extra bits be exclusive modes or would you need this to support target specific mode combos? RKSimon: @arsenm Would these extra bits be exclusive modes or would you need this to support target…
				sepavloffAuthorUnsubmitted Done Reply Inline Actions I'm wondering if this should be more opaque, and broader for the entire FP environment (not just the rounding mode). C library defined function `fesetmode`, which sets all control modes, not just rounding. It make sense to introduce intrinsic for it, which would serve these purposes. sepavloff: > I'm wondering if this should be more opaque, and broader for the entire FP environment (not…
				arsenmUnsubmitted Not Done Reply Inline Actions We have the rounding mode controls as presented here, however they are broken down by FP type. We can separately set the rounding mode for f32 and f64/f16, so there are two different settings. We also have the denormal mode, for inputs and outputs, also broken down by type in the same way. The denormal handling and per-type handling I think deserve consideration here We have 2 additional target specific FP bits nothing else would need to really think about, but it would be nice if you could set the exact mode you want in a single intrinsic call. I'm less interested in these though arsenm: We have the rounding mode controls as presented here, however they are broken down by FP type.
				arsenmUnsubmitted Not Done Reply Inline Actions Oh, we also have a bit to turn on/off fp exceptions which is probably generally interesting arsenm: Oh, we also have a bit to turn on/off fp exceptions which is probably generally interesting
				arsenmUnsubmitted Not Done Reply Inline Actions And by a bit, I mean a mask of bits for different FP exception types arsenm: And by a bit, I mean a mask of bits for different FP exception types
				4 - to nearest, ties away from zero

				Semantics:
				""""""""""

				The '``llvm.set.rounding``' intrinsic sets the current rounding mode. It is
				similar to C library function 'fesetround', however this intrinsic does not
				return any value and uses platform-independent representation of rounding modes.


	General Intrinsics			General Intrinsics
	------------------			------------------

	This class of intrinsics is designed to be generic and has no specific			This class of intrinsics is designed to be generic and has no specific
	purpose.			purpose.

	'``llvm.var.annotation``' Intrinsic			'``llvm.var.annotation``' Intrinsic
	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
	▲ Show 20 Lines • Show All 1,549 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/ISDOpcodes.h

//===-- llvm/CodeGen/ISDOpcodes.h - CodeGen opcodes -------------- C++ --===//		//===-- llvm/CodeGen/ISDOpcodes.h - CodeGen opcodes -------------- C++ --===//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
Show All 23 Lines	namespace ISD {
/// instruction sets as much as possible, and only use target-dependent		/// instruction sets as much as possible, and only use target-dependent
/// operators when they have special requirements.		/// operators when they have special requirements.
///		///
/// Finally, during and after selection proper, SNodes may use special		/// Finally, during and after selection proper, SNodes may use special
/// operator codes that correspond directly with MachineInstr opcodes. These		/// operator codes that correspond directly with MachineInstr opcodes. These
/// are used to represent selected instructions. See the isMachineOpcode()		/// are used to represent selected instructions. See the isMachineOpcode()
/// and getMachineOpcode() member functions of SDNode.		/// and getMachineOpcode() member functions of SDNode.
///		///
enum NodeType {		enum NodeType {
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - enum NodeType { - /// DELETED_NODE - This is an illegal value that is used to catch - /// errors. This opcode is not a legal opcode for any node. - DELETED_NODE, - - /// EntryToken - This is the marker used to indicate the start of a region. - EntryToken, - - /// TokenFactor - This node takes multiple tokens as input and produces a - /// single token result. This is used to represent the fact that the operand 1295 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - enum NodeType { - /// DELETED_NODE - This is…
/// DELETED_NODE - This is an illegal value that is used to catch		/// DELETED_NODE - This is an illegal value that is used to catch
/// errors. This opcode is not a legal opcode for any node.		/// errors. This opcode is not a legal opcode for any node.
DELETED_NODE,		DELETED_NODE,

/// EntryToken - This is the marker used to indicate the start of a region.		/// EntryToken - This is the marker used to indicate the start of a region.
EntryToken,		EntryToken,

/// TokenFactor - This node takes multiple tokens as input and produces a		/// TokenFactor - This node takes multiple tokens as input and produces a
▲ Show 20 Lines • Show All 623 Lines • ▼ Show 20 Lines	enum NodeType {
/// FMINIMUM/FMAXIMUM - NaN-propagating minimum/maximum that also treat -0.0		/// FMINIMUM/FMAXIMUM - NaN-propagating minimum/maximum that also treat -0.0
/// as less than 0.0. While FMINNUM_IEEE/FMAXNUM_IEEE follow IEEE 754-2008		/// as less than 0.0. While FMINNUM_IEEE/FMAXNUM_IEEE follow IEEE 754-2008
/// semantics, FMINIMUM/FMAXIMUM follow IEEE 754-2018 draft semantics.		/// semantics, FMINIMUM/FMAXIMUM follow IEEE 754-2018 draft semantics.
FMINIMUM, FMAXIMUM,		FMINIMUM, FMAXIMUM,

/// FSINCOS - Compute both fsin and fcos as a single operation.		/// FSINCOS - Compute both fsin and fcos as a single operation.
FSINCOS,		FSINCOS,

		/// Set rounding mode.
		/// The first operand is a chain pointer. The second specifies the required
		/// rounding mode, encoded in the same way as in the intrinsic
		/// 'set_rounding'.
		SET_ROUNDING,

/// LOAD and STORE have token chains as their first operand, then the same		/// LOAD and STORE have token chains as their first operand, then the same
/// operands as an LLVM load/store instruction, then an offset node that		/// operands as an LLVM load/store instruction, then an offset node that
/// is added / subtracted from the base pointer to form the address (for		/// is added / subtracted from the base pointer to form the address (for
/// indexed memory ops).		/// indexed memory ops).
LOAD, STORE,		LOAD, STORE,

/// DYNAMIC_STACKALLOC - Allocate some number of bytes on the stack aligned		/// DYNAMIC_STACKALLOC - Allocate some number of bytes on the stack aligned
/// to a specified boundary. This node always has two return values: a new		/// to a specified boundary. This node always has two return values: a new
▲ Show 20 Lines • Show All 279 Lines • ▼ Show 20 Lines	namespace ISD {
/// this value. Those that do must not be less than this value, and can		/// this value. Those that do must not be less than this value, and can
/// be used with SelectionDAG::getMemIntrinsicNode.		/// be used with SelectionDAG::getMemIntrinsicNode.
static const int FIRST_TARGET_MEMORY_OPCODE = BUILTIN_OP_END+500;		static const int FIRST_TARGET_MEMORY_OPCODE = BUILTIN_OP_END+500;

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
/// MemIndexedMode enum - This enum defines the load / store indexed		/// MemIndexedMode enum - This enum defines the load / store indexed
/// addressing modes.		/// addressing modes.
///		///
/// UNINDEXED "Normal" load / store. The effective address is already		/// UNINDEXED "Normal" load / store. The effective address is already
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// UNINDEXED "Normal" load / store. The effective address is already - /// computed and is available in the base pointer. The offset - /// operand is always undefined. In addition to producing a - /// chain, an unindexed load produces one value (result of the - /// load); an unindexed store does not produce a value. + /// The TRUNC = 1 case is used in cases where we know that the value will + /// not be modified by the node, because Y is not using any of the extra + /// precision of source type. This allows certain transformations like + /// STRICT_FP_EXTEND(STRICT_FP_ROUND(X,1)) -> X which are not safe for + /// STRICT_FP_EXTEND(STRICT_FP_ROUND(X,0)) because the extra bits aren't 134 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// UNINDEXED "Normal" load / store. The…
/// computed and is available in the base pointer. The offset		/// computed and is available in the base pointer. The offset
/// operand is always undefined. In addition to producing a		/// operand is always undefined. In addition to producing a
/// chain, an unindexed load produces one value (result of the		/// chain, an unindexed load produces one value (result of the
/// load); an unindexed store does not produce a value.		/// load); an unindexed store does not produce a value.
///		///
/// PRE_INC Similar to the unindexed mode where the effective address is		/// PRE_INC Similar to the unindexed mode where the effective address is
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// PRE_INC Similar to the unindexed mode where the effective address is - /// PRE_DEC the value of the base pointer add / subtract the offset. - /// It considers the computation as being folded into the load / - /// store operation (i.e. the load / store does the address - /// computation as well as performing the memory transaction). - /// The base operand is always undefined. In addition to - /// producing a chain, pre-indexed load produces two values - /// (result of the load and the result of the address - /// computation); a pre-indexed store produces one value (result - /// of the address computation). 138 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// PRE_INC Similar to the unindexed mode…
/// PRE_DEC the value of the base pointer add / subtract the offset.		/// PRE_DEC the value of the base pointer add / subtract the offset.
/// It considers the computation as being folded into the load /		/// It considers the computation as being folded into the load /
/// store operation (i.e. the load / store does the address		/// store operation (i.e. the load / store does the address
/// computation as well as performing the memory transaction).		/// computation as well as performing the memory transaction).
/// The base operand is always undefined. In addition to		/// The base operand is always undefined. In addition to
/// producing a chain, pre-indexed load produces two values		/// producing a chain, pre-indexed load produces two values
/// (result of the load and the result of the address		/// (result of the load and the result of the address
/// computation); a pre-indexed store produces one value (result		/// computation); a pre-indexed store produces one value (result
/// of the address computation).		/// of the address computation).
///		///
/// POST_INC The effective address is the value of the base pointer. The		/// POST_INC The effective address is the value of the base pointer. The
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// POST_INC The effective address is the value of the base pointer. The - /// POST_DEC value of the offset operand is then added to / subtracted - /// from the base after memory transaction. In addition to - /// producing a chain, post-indexed load produces two values - /// (the result of the load and the result of the base +/- offset - /// computation); a post-indexed store produces one value (the - /// the result of the base +/- offset computation). - enum MemIndexedMode { - UNINDEXED = 0, - PRE_INC, 36 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// POST_INC The effective address is the…
/// POST_DEC value of the offset operand is then added to / subtracted		/// POST_DEC value of the offset operand is then added to / subtracted
/// from the base after memory transaction. In addition to		/// from the base after memory transaction. In addition to
/// producing a chain, post-indexed load produces two values		/// producing a chain, post-indexed load produces two values
/// (the result of the load and the result of the base +/- offset		/// (the result of the load and the result of the base +/- offset
/// computation); a post-indexed store produces one value (the		/// computation); a post-indexed store produces one value (the
/// the result of the base +/- offset computation).		/// the result of the base +/- offset computation).
enum MemIndexedMode {		enum MemIndexedMode {
UNINDEXED = 0,		UNINDEXED = 0,
PRE_INC,		PRE_INC,
PRE_DEC,		PRE_DEC,
POST_INC,		POST_INC,
POST_DEC		POST_DEC
};		};

static const int LAST_INDEXED_MODE = POST_DEC + 1;		static const int LAST_INDEXED_MODE = POST_DEC + 1;

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
/// MemIndexType enum - This enum defines how to interpret MGATHER/SCATTER's		/// MemIndexType enum - This enum defines how to interpret MGATHER/SCATTER's
/// index parameter when calculating addresses.		/// index parameter when calculating addresses.
///		///
/// SIGNED_SCALED Addr = Base + ((signed)Index * sizeof(element))		/// SIGNED_SCALED Addr = Base + ((signed)Index * sizeof(element))
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// SIGNED_SCALED Addr = Base + ((signed)Index * sizeof(element)) - /// SIGNED_UNSCALED Addr = Base + (signed)Index - /// UNSIGNED_SCALED Addr = Base + ((unsigned)Index * sizeof(element)) - /// UNSIGNED_UNSCALED Addr = Base + (unsigned)Index - enum MemIndexType { - SIGNED_SCALED = 0, - SIGNED_UNSCALED, - UNSIGNED_SCALED, - UNSIGNED_UNSCALED - }; 83 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// SIGNED_SCALED Addr = Base +…
/// SIGNED_UNSCALED Addr = Base + (signed)Index		/// SIGNED_UNSCALED Addr = Base + (signed)Index
/// UNSIGNED_SCALED Addr = Base + ((unsigned)Index * sizeof(element))		/// UNSIGNED_SCALED Addr = Base + ((unsigned)Index * sizeof(element))
/// UNSIGNED_UNSCALED Addr = Base + (unsigned)Index		/// UNSIGNED_UNSCALED Addr = Base + (unsigned)Index
		RKSimonUnsubmitted Not Done Reply Inline Actions Sorry for the bikeshedding - but if SET_ROUNDING is supposed to match FLT_ROUNDS - shouldn't it have a more similar name? RKSimon: Sorry for the bikeshedding - but if SET_ROUNDING is supposed to match FLT_ROUNDS - shouldn't it…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions It is `FLT_ROUNDS_` that has "wrong" name. It is named after the macro `FLT_ROUNDS`, which is defined by C99. To get better names `FLT_ROUNDS_` must be renamed not `SET_ROUNDING`. sepavloff: It is `FLT_ROUNDS_` that has "wrong" name. It is named after the macro `FLT_ROUNDS`, which is…
		RKSimonUnsubmitted Not Done Reply Inline Actions OK - add a TODO comment by FLT_ROUNDS_ then? RKSimon: OK - add a TODO comment by FLT_ROUNDS_ then?
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Done. sepavloff: Done.
enum MemIndexType {		enum MemIndexType {
SIGNED_SCALED = 0,		SIGNED_SCALED = 0,
SIGNED_UNSCALED,		SIGNED_UNSCALED,
UNSIGNED_SCALED,		UNSIGNED_SCALED,
UNSIGNED_UNSCALED		UNSIGNED_UNSCALED
};		};

static const int LAST_MEM_INDEX_TYPE = UNSIGNED_UNSCALED + 1;		static const int LAST_MEM_INDEX_TYPE = UNSIGNED_UNSCALED + 1;
Show All 22 Lines
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
/// ISD::CondCode enum - These are ordered carefully to make the bitfields		/// ISD::CondCode enum - These are ordered carefully to make the bitfields
/// below work out, when considering SETFALSE (something that never exists		/// below work out, when considering SETFALSE (something that never exists
/// dynamically) as 0. "U" -> Unsigned (for integer operands) or Unordered		/// dynamically) as 0. "U" -> Unsigned (for integer operands) or Unordered
/// (for floating point), "L" -> Less than, "G" -> Greater than, "E" -> Equal		/// (for floating point), "L" -> Less than, "G" -> Greater than, "E" -> Equal
/// to. If the "N" column is 1, the result of the comparison is undefined if		/// to. If the "N" column is 1, the result of the comparison is undefined if
/// the input is a NAN.		/// the input is a NAN.
///		///
/// All of these (except for the 'always folded ops') should be handled for		/// All of these (except for the 'always folded ops') should be handled for
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// All of these (except for the 'always folded ops') should be handled for - /// floating point. For integer, only the SETEQ,SETNE,SETLT,SETLE,SETGT, - /// SETGE,SETULT,SETULE,SETUGT, and SETUGE opcodes are used. + /// The return value of (FMINNUM 0.0, -0.0) could be either 0.0 or -0.0. + FMINNUM, + FMAXNUM, + + /// FMINNUM_IEEE/FMAXNUM_IEEE - Perform floating-point minimum or maximum on + /// two values, following the IEEE-754 2008 definition. This differs from + /// FMINNUM/FMAXNUM in the handling of signaling NaNs. If one input is a 73 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// All of these (except for the 'always folded…
/// floating point. For integer, only the SETEQ,SETNE,SETLT,SETLE,SETGT,		/// floating point. For integer, only the SETEQ,SETNE,SETLT,SETLE,SETGT,
/// SETGE,SETULT,SETULE,SETUGT, and SETUGE opcodes are used.		/// SETGE,SETULT,SETULE,SETUGT, and SETUGE opcodes are used.
///		///
/// Note that these are laid out in a specific order to allow bit-twiddling		/// Note that these are laid out in a specific order to allow bit-twiddling
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// Note that these are laid out in a specific order to allow bit-twiddling - /// to transform conditions. - enum CondCode { - // Opcode N U L G E Intuitive operation - SETFALSE, // 0 0 0 0 Always false (always folded) - SETOEQ, // 0 0 0 1 True if ordered and equal - SETOGT, // 0 0 1 0 True if ordered and greater than - SETOGE, // 0 0 1 1 True if ordered and greater than or equal - SETOLT, // 0 1 0 0 True if ordered and less than - SETOLE, // 0 1 0 1 True if ordered and less than or equal 391 diff lines are omitted. See full diff. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// Note that these are laid out in a specific…
/// to transform conditions.		/// to transform conditions.
enum CondCode {		enum CondCode {
// Opcode N U L G E Intuitive operation		// Opcode N U L G E Intuitive operation
SETFALSE, // 0 0 0 0 Always false (always folded)		SETFALSE, // 0 0 0 0 Always false (always folded)
SETOEQ, // 0 0 0 1 True if ordered and equal		SETOEQ, // 0 0 0 1 True if ordered and equal
SETOGT, // 0 0 1 0 True if ordered and greater than		SETOGT, // 0 0 1 0 True if ordered and greater than
SETOGE, // 0 0 1 1 True if ordered and greater than or equal		SETOGE, // 0 0 1 1 True if ordered and greater than or equal
SETOLT, // 0 1 0 0 True if ordered and less than		SETOLT, // 0 1 0 0 True if ordered and less than
▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

llvm/include/llvm/IR/IRBuilder.h

//===- llvm/IRBuilder.h - Builder for LLVM Instructions ---------- C++ --===//		//===- llvm/IRBuilder.h - Builder for LLVM Instructions ---------- C++ --===//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
▲ Show 20 Lines • Show All 864 Lines • ▼ Show 20 Lines	CallInst CreateMinimum(Value LHS, Value *RHS, const Twine &Name = "") {
return CreateBinaryIntrinsic(Intrinsic::minimum, LHS, RHS, nullptr, Name);		return CreateBinaryIntrinsic(Intrinsic::minimum, LHS, RHS, nullptr, Name);
}		}

/// Create call to the maximum intrinsic.		/// Create call to the maximum intrinsic.
CallInst CreateMaximum(Value LHS, Value *RHS, const Twine &Name = "") {		CallInst CreateMaximum(Value LHS, Value *RHS, const Twine &Name = "") {
return CreateBinaryIntrinsic(Intrinsic::maximum, LHS, RHS, nullptr, Name);		return CreateBinaryIntrinsic(Intrinsic::maximum, LHS, RHS, nullptr, Name);
}		}

		/// Create call to the set_rounding intrinsic.
		craig.topperUnsubmitted Not Done Reply Inline Actions Do we need this? I don't think we provide IRBuilder for all intrinsics. Just common ones. craig.topper: Do we need this? I don't think we provide IRBuilder for all intrinsics. Just common ones.
		CallInst CreateSetRounding(Value RM, const Twine &Name = "") {
		return CreateIntrinsic(Intrinsic::set_rounding, {}, { RM }, nullptr, Name);
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - return CreateIntrinsic(Intrinsic::set_rounding, {}, { RM }, nullptr, Name); + return CreateIntrinsic(Intrinsic::set_rounding, {}, {RM}, nullptr, Name); Lint: Pre-merge checks: clang-format: please reformat the code ``` - return CreateIntrinsic(Intrinsic::set_rounding…
		}

private:		private:
/// Create a call to a masked intrinsic with given Id.		/// Create a call to a masked intrinsic with given Id.
CallInst CreateMaskedIntrinsic(Intrinsic::ID Id, ArrayRef<Value > Ops,		CallInst CreateMaskedIntrinsic(Intrinsic::ID Id, ArrayRef<Value > Ops,
ArrayRef<Type *> OverloadedTypes,		ArrayRef<Type *> OverloadedTypes,
const Twine &Name = "");		const Twine &Name = "");

Value getCastedInt8PtrValue(Value Ptr);		Value getCastedInt8PtrValue(Value Ptr);

▲ Show 20 Lines • Show All 1,715 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

	Show First 20 Lines • Show All 605 Lines • ▼ Show 20 Lines

	// Internal interface for object size checking			// Internal interface for object size checking
	def int_objectsize : Intrinsic<[llvm_anyint_ty],			def int_objectsize : Intrinsic<[llvm_anyint_ty],
	[llvm_anyptr_ty, llvm_i1_ty,			[llvm_anyptr_ty, llvm_i1_ty,
	llvm_i1_ty, llvm_i1_ty],			llvm_i1_ty, llvm_i1_ty],
	[IntrNoMem, IntrSpeculatable, IntrWillReturn, ImmArg<1>, ImmArg<2>, ImmArg<3>]>,			[IntrNoMem, IntrSpeculatable, IntrWillReturn, ImmArg<1>, ImmArg<2>, ImmArg<3>]>,
	GCCBuiltin<"__builtin_object_size">;			GCCBuiltin<"__builtin_object_size">;

				def int_set_rounding : Intrinsic<[],
				[llvm_i32_ty],
				[IntrInaccessibleMemOnly, IntrWillReturn]>;

	//===--------------- Constrained Floating Point Intrinsics ----------------===//			//===--------------- Constrained Floating Point Intrinsics ----------------===//
	//			//

	let IntrProperties = [IntrInaccessibleMemOnly, IntrWillReturn] in {			let IntrProperties = [IntrInaccessibleMemOnly, IntrWillReturn] in {
	def int_experimental_constrained_fadd : Intrinsic<[ llvm_anyfloat_ty ],			def int_experimental_constrained_fadd : Intrinsic<[ llvm_anyfloat_ty ],
	[ LLVMMatchType<0>,			[ LLVMMatchType<0>,
	LLVMMatchType<0>,			LLVMMatchType<0>,
	llvm_metadata_ty,			llvm_metadata_ty,
	▲ Show 20 Lines • Show All 858 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

//===- SelectionDAGBuilder.cpp - Selection-DAG building -------------------===//		//===- SelectionDAGBuilder.cpp - Selection-DAG building -------------------===//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
▲ Show 20 Lines • Show All 6,133 Lines • ▼ Show 20 Lines	setValue(&I, DAG.getNode(ISD::BITCAST, sdl, MVT::i16,
MVT::i32))));		MVT::i32))));
return;		return;
case Intrinsic::convert_from_fp16:		case Intrinsic::convert_from_fp16:
setValue(&I, DAG.getNode(ISD::FP_EXTEND, sdl,		setValue(&I, DAG.getNode(ISD::FP_EXTEND, sdl,
TLI.getValueType(DAG.getDataLayout(), I.getType()),		TLI.getValueType(DAG.getDataLayout(), I.getType()),
DAG.getNode(ISD::BITCAST, sdl, MVT::f16,		DAG.getNode(ISD::BITCAST, sdl, MVT::f16,
getValue(I.getArgOperand(0)))));		getValue(I.getArgOperand(0)))));
return;		return;
		case Intrinsic::set_rounding:
		Res = DAG.getNode(ISD::SET_ROUNDING, sdl, MVT::Other,
		{ DAG.getRoot(), getValue(I.getArgOperand(0)) });
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - { DAG.getRoot(), getValue(I.getArgOperand(0)) }); + {DAG.getRoot(), getValue(I.getArgOperand(0))}); Lint: Pre-merge checks: clang-format: please reformat the code ``` - { DAG.getRoot(), getValue(I.
		craig.topperUnsubmitted Not Done Reply Inline Actions Don't you need to call getRoot not DAG.getRoot()? craig.topper: Don't you need to call getRoot not DAG.getRoot()?
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Indeed. Thank you! sepavloff: Indeed. Thank you!
		setValue(&I, Res);
		DAG.setRoot(Res.getValue(0));
		return;
case Intrinsic::pcmarker: {		case Intrinsic::pcmarker: {
SDValue Tmp = getValue(I.getArgOperand(0));		SDValue Tmp = getValue(I.getArgOperand(0));
DAG.setRoot(DAG.getNode(ISD::PCMARKER, sdl, MVT::Other, getRoot(), Tmp));		DAG.setRoot(DAG.getNode(ISD::PCMARKER, sdl, MVT::Other, getRoot(), Tmp));
return;		return;
}		}
case Intrinsic::readcyclecounter: {		case Intrinsic::readcyclecounter: {
SDValue Op = getRoot();		SDValue Op = getRoot();
Res = DAG.getNode(ISD::READCYCLECOUNTER, sdl,		Res = DAG.getNode(ISD::READCYCLECOUNTER, sdl,
▲ Show 20 Lines • Show All 4,407 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

//===- SelectionDAGDumper.cpp - Implement SelectionDAG::dump() ------------===//		//===- SelectionDAGDumper.cpp - Implement SelectionDAG::dump() ------------===//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
▲ Show 20 Lines • Show All 380 Lines • ▼ Show 20 Lines	#endif
case ISD::DEBUGTRAP: return "debugtrap";		case ISD::DEBUGTRAP: return "debugtrap";
case ISD::LIFETIME_START: return "lifetime.start";		case ISD::LIFETIME_START: return "lifetime.start";
case ISD::LIFETIME_END: return "lifetime.end";		case ISD::LIFETIME_END: return "lifetime.end";
case ISD::GC_TRANSITION_START: return "gc_transition.start";		case ISD::GC_TRANSITION_START: return "gc_transition.start";
case ISD::GC_TRANSITION_END: return "gc_transition.end";		case ISD::GC_TRANSITION_END: return "gc_transition.end";
case ISD::GET_DYNAMIC_AREA_OFFSET: return "get.dynamic.area.offset";		case ISD::GET_DYNAMIC_AREA_OFFSET: return "get.dynamic.area.offset";
case ISD::FREEZE: return "freeze";		case ISD::FREEZE: return "freeze";

		// Floating point environment manipulation
		case ISD::SET_ROUNDING: return "set_rounding";
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - case ISD::SET_ROUNDING: return "set_rounding"; + case ISD::SET_ROUNDING: + return "set_rounding"; Lint: Pre-merge checks: clang-format: please reformat the code ``` - case ISD::SET_ROUNDING: return…

// Bit manipulation		// Bit manipulation
case ISD::ABS: return "abs";		case ISD::ABS: return "abs";
case ISD::BITREVERSE: return "bitreverse";		case ISD::BITREVERSE: return "bitreverse";
case ISD::BSWAP: return "bswap";		case ISD::BSWAP: return "bswap";
case ISD::CTPOP: return "ctpop";		case ISD::CTPOP: return "ctpop";
case ISD::CTTZ: return "cttz";		case ISD::CTTZ: return "cttz";
case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";		case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";
case ISD::CTLZ: return "ctlz";		case ISD::CTLZ: return "ctlz";
▲ Show 20 Lines • Show All 582 Lines • Show Last 20 Lines

llvm/lib/IR/Verifier.cpp

//===-- Verifier.cpp - Implement the Module Verifier -----------------------==//		//===-- Verifier.cpp - Implement the Module Verifier -----------------------==//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
▲ Show 20 Lines • Show All 4,923 Lines • ▼ Show 20 Lines	case Intrinsic::matrix_columnwise_store: {
default:		default:
llvm_unreachable("unexpected intrinsic");		llvm_unreachable("unexpected intrinsic");
}		}
Assert(TypeToCheck->getNumElements() ==		Assert(TypeToCheck->getNumElements() ==
NumRows->getZExtValue() * NumColumns->getZExtValue(),		NumRows->getZExtValue() * NumColumns->getZExtValue(),
"result of a matrix operation does not fit in the returned vector");		"result of a matrix operation does not fit in the returned vector");
break;		break;
}		}
		case Intrinsic::set_rounding: {
		if (auto RM = dyn_cast<ConstantInt>(Call.getArgOperand(0))) {
		craig.topperUnsubmitted Not Done Reply Inline Actions Is the argument intended to always be a constant or we're just verifying it when we can? The latter seems unusual. craig.topper: Is the argument intended to always be a constant or we're just verifying it when we can? The…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions The argument may be a variable. If it is a constant, it must be a valid rounding mode. It is expected to be a value of type `RoundingMode`. Values from 0 to 4 denote IEEE rounding modes, they may be followed by target-specific rounding modes. The argument value must be less than `RoundingMode::Dynamic`, which now if 7. I am hesitating if this code is useful enough, as even for constant argument its validity cannot be verified due to non-IEEE rounding modes. Probably we should remove this check. sepavloff: The argument may be a variable. If it is a constant, it must be a valid rounding mode. It is…
		RKSimonUnsubmitted Not Done Reply Inline Actions I'm OK with this being dropped - @craig.topper @kpn ? RKSimon: I'm OK with this being dropped - @craig.topper @kpn ?
		craig.topperUnsubmitted Not Done Reply Inline Actions I'm fine removing it craig.topper: I'm fine removing it
		Assert(RM->getZExtValue() < static_cast<unsigned>(RoundingMode::Dynamic),
		"invalid value of rounding mode");
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - "invalid value of rounding mode"); + "invalid value of rounding mode"); Lint: Pre-merge checks: clang-format: please reformat the code ``` - "invalid value of rounding mode"); +…
		}
		}
};		};
}		}

/// Carefully grab the subprogram from a local scope.		/// Carefully grab the subprogram from a local scope.
///		///
/// This carefully grabs the subprogram from a local scope, avoiding the		/// This carefully grabs the subprogram from a local scope, avoiding the
/// built-in assertions that would typically fire.		/// built-in assertions that would typically fire.
static DISubprogram getSubprogram(Metadata LocalScope) {		static DISubprogram getSubprogram(Metadata LocalScope) {
▲ Show 20 Lines • Show All 806 Lines • Show Last 20 Lines

llvm/unittests/IR/IRBuilderTest.cpp

//===- llvm/unittest/IR/IRBuilderTest.cpp - IRBuilder tests ---------------===//		//===- llvm/unittest/IR/IRBuilderTest.cpp - IRBuilder tests ---------------===//
		Lint: Lint Inline Actions clang-format not found in user's PATH; not linting file. Lint: Lint: clang-format not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

▲ Show 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	TEST_F(IRBuilderTest, Intrinsics) {
EXPECT_TRUE(II->hasNoInfs());		EXPECT_TRUE(II->hasNoInfs());
EXPECT_FALSE(II->hasNoNaNs());		EXPECT_FALSE(II->hasNoNaNs());

Call = Builder.CreateIntrinsic(Intrinsic::fma, {V->getType()}, {V, V, V}, I);		Call = Builder.CreateIntrinsic(Intrinsic::fma, {V->getType()}, {V, V, V}, I);
II = cast<IntrinsicInst>(Call);		II = cast<IntrinsicInst>(Call);
EXPECT_EQ(II->getIntrinsicID(), Intrinsic::fma);		EXPECT_EQ(II->getIntrinsicID(), Intrinsic::fma);
EXPECT_TRUE(II->hasNoInfs());		EXPECT_TRUE(II->hasNoInfs());
EXPECT_FALSE(II->hasNoNaNs());		EXPECT_FALSE(II->hasNoNaNs());

		Call = Builder.CreateSetRounding(Builder.getInt32(
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - Call = Builder.CreateSetRounding(Builder.getInt32( - static_cast<uint32_t>(RoundingMode::TowardZero))); + Call = Builder.CreateSetRounding( + Builder.getInt32(static_cast<uint32_t>(RoundingMode::TowardZero))); Lint: Pre-merge checks: clang-format: please reformat the code ``` - Call = Builder.CreateSetRounding(Builder.getInt32…
		static_cast<uint32_t>(RoundingMode::TowardZero)));
		II = cast<IntrinsicInst>(Call);
		EXPECT_EQ(II->getIntrinsicID(), Intrinsic::set_rounding);
}		}

TEST_F(IRBuilderTest, IntrinsicsWithScalableVectors) {		TEST_F(IRBuilderTest, IntrinsicsWithScalableVectors) {
IRBuilder<> Builder(BB);		IRBuilder<> Builder(BB);
CallInst *Call;		CallInst *Call;
FunctionType *FTy;		FunctionType *FTy;

// Test scalable flag isn't dropped for intrinsic that is explicitly defined		// Test scalable flag isn't dropped for intrinsic that is explicitly defined
▲ Show 20 Lines • Show All 808 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[FPEnv] Intrinsic for setting rounding modeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 261789

llvm/docs/LangRef.rst

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/IR/IRBuilder.h

llvm/include/llvm/IR/Intrinsics.td

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/IR/Verifier.cpp

llvm/unittests/IR/IRBuilderTest.cpp

[FPEnv] Intrinsic for setting rounding mode
ClosedPublic