This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
-
LangRef.rst
-
include/llvm/
-
llvm/
-
CodeGen/
-
ISDOpcodes.h
-
IR/
-
Intrinsics.td
-
RuntimeLibcalls.def
-
Target/
-
TargetSelectionDAG.td
-
lib/CodeGen/
-
CodeGen/
-
SelectionDAG/
1/6
LegalizeDAG.cpp
-
SelectionDAGBuilder.cpp
-
SelectionDAGDumper.cpp
-
TargetLoweringBase.cpp
-
test/CodeGen/X86/
-
CodeGen/
-
X86/
-
fpenv.ll

Differential D82525

[FPEnv] Intrinsics for access to FP control modes
ClosedPublic

Authored by sepavloff on Jun 25 2020, 1:30 AM.

Download Raw Diff

Details

Reviewers

andrew.w.kaylor
efriedma
kpn
cameron.mcinally
uweigand
jdoerfert
arsenm

Commits

rG6862f0fab121: [FPEnv] Intrinsics for access to FP control modes

Summary

The change introduces intrinsics 'get_fpmode', 'set_fpmode' and
'reset_fpmode'. They manage all target dynamic floating-point control
modes, which include, for instance, rounding direction, precision,
treatment of denormals and so on. The intrinsics do the same
operations as the C library functions 'fegetmode' and 'fesetmode'. By
default they are lowered to calls to these functions.

Two main use cases are supported by this implementation.

Local modification of the control modes. In this case the code

usually has a pattern (in pseudocode):

saved_modes = get_fpmode()
set_fpmode(<new_modes>)
...
<do operations with new modes>
...
set_fpmode(saved_modes)

In the case when it is known that the current FP environment is default,
the code may be shorter:

set_fpmode(<new_modes>)
...
<do operations with new modes>
...
reset_fpmode()

Such patterns appear not only in user code but also in implementations
of various FP controlling pragmas. In particular, the implementation of
#pragma STDC FENV_ROUND requires similar code if the target does not
support static rounding mode.

Portable control of FP modes. Usually FP control modes are set by

write to some control register. Different targets have different
layout of this register, the way the register is accessed also may be
different. Using set of target-specific definitions for the control
register bits together with these intrinsic functions provides enough
portable way to handle control modes across wide range of hardware.

This change defines only llvm intrinsic function, which implement the
access required for the aforementioned use cases.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sepavloff created this revision.Jun 25 2020, 1:30 AM

Herald added a reviewer: jdoerfert. · View Herald TranscriptJun 25 2020, 1:30 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: jdoerfert, hiraditya. · View Herald Transcript

Harbormaster failed remote builds in B61669: Diff 273245!Jun 25 2020, 2:30 AM

Fixed legalization of SET_FPMODE

sepavloff added a child revision: D83036: [X86][FPEnv] Lowering of {get,set,reset}_fpmode.Jul 2 2020, 4:30 AM

sepavloff added a reviewer: arsenm.Jul 2 2020, 4:33 AM

Herald added a subscriber: wdng. · View Herald TranscriptJul 2 2020, 4:33 AM

sepavloff mentioned this in D74729: [FPEnv] Intrinsic for setting rounding mode.Jul 2 2020, 4:47 AM

Harbormaster failed remote builds in B62654: Diff 275055!Jul 2 2020, 5:20 AM

Missed change

Harbormaster failed remote builds in B62837: Diff 275373!Jul 3 2020, 6:58 AM

Rebased patch

Harbormaster completed remote builds in B66335: Diff 281827.Jul 30 2020, 2:44 AM

Ping.

Rebased patch

Harbormaster completed remote builds in B68111: Diff 285081.Aug 12 2020, 8:30 AM

Rebased patch

Harbormaster completed remote builds in B70249: Diff 289188.Sep 1 2020, 9:36 AM

Get rid of clang-tidy warnings

Harbormaster completed remote builds in B70349: Diff 289360.Sep 2 2020, 1:05 AM

Ping.

Rebased and simplified a bit.

sepavloff edited the summary of this revision. (Show Details)Mar 3 2021, 5:05 AM

sepavloff edited the summary of this revision. (Show Details)

From langref it isn't obvious if the following transform is valid or not

%z = fadd_strict %x, %y
call @llvm.set.fpmode.i16(i16 %fpenv)
  =>
call @llvm.set.fpmode.i16(i16 %fpenv)
%z = fadd_strict %x, %y

Harbormaster completed remote builds in B91789: Diff 327751.Mar 3 2021, 8:46 AM

craig.topper added a subscriber: craig.topper.Mar 3 2021, 9:01 AM

craig.topper added inline comments.

llvm/test/CodeGen/Generic/fpenv.ll
35 ↗	(On Diff #327751)	Is this missing the instructions that copy %fpenv into the stack temporary?

Is there any guarantee that femode_t will be the same layout for a given target in different C library implementations?

Extended documentation, fixed chain treatment.

In D82525#2599988, @lebedev.ri wrote:
From langref it isn't obvious if the following transform is valid or not
%z = fadd_strict %x, %y
call @llvm.set.fpmode.i16(i16 %fpenv)
  =>
call @llvm.set.fpmode.i16(i16 %fpenv)
%z = fadd_strict %x, %y

Short mention about function ordering is added to the paragraph "Floating Point Environment Manipulation intrinsics".

In D82525#2600598, @craig.topper wrote:

Is there any guarantee that femode_t will be the same layout for a given target in different C library implementations?

Strictly speaking there is no such guarantee. However the obvious implementation of femode_t is the type used to store content of FP control register. Most of 16 targets supported by glibc use unsigned int as femode_t. Exceptions are alpha, ia64, sparc (unsigned long) and powerpc (double). In these cases femode_t is identical to fenv_t.

llvm/test/CodeGen/Generic/fpenv.ll
35 ↗	(On Diff #327751)	Indeed, due to incorrect chain argument supplied to the library function call, the store to stack disappeared. Thank you for the catch!

Harbormaster completed remote builds in B91979: Diff 328037.Mar 4 2021, 7:24 AM

In D82525#2603275, @sepavloff wrote:
In D82525#2599988, @lebedev.ri wrote:
From langref it isn't obvious if the following transform is valid or not
%z = fadd_strict %x, %y
call @llvm.set.fpmode.i16(i16 %fpenv)
  =>
call @llvm.set.fpmode.i16(i16 %fpenv)
%z = fadd_strict %x, %y
Short mention about function ordering is added to the paragraph "Floating Point Environment Manipulation intrinsics".

In D82525#2600598, @craig.topper wrote:

Is there any guarantee that femode_t will be the same layout for a given target in different C library implementations?

Strictly speaking there is no such guarantee. However the obvious implementation of femode_t is the type used to store content of FP control register. Most of 16 targets supported by glibc use unsigned int as femode_t. Exceptions are alpha, ia64, sparc (unsigned long) and powerpc (double). In these cases femode_t is identical to fenv_t.

Isn't X86 using this struct which is 8 bytes?

typedef struct
  {
    unsigned short int __control_word;
    unsigned short int __glibc_reserved;
    unsigned int __mxcsr;
  }
femode_t;

In D82525#2603611, @craig.topper wrote:
In D82525#2603275, @sepavloff wrote:

In D82525#2600598, @craig.topper wrote:

Is there any guarantee that femode_t will be the same layout for a given target in different C library implementations?

Strictly speaking there is no such guarantee. However the obvious implementation of femode_t is the type used to store content of FP control register. Most of 16 targets supported by glibc use unsigned int as femode_t. Exceptions are alpha, ia64, sparc (unsigned long) and powerpc (double). In these cases femode_t is identical to fenv_t.

Isn't X86 using this struct which is 8 bytes?
typedef struct
  {
    unsigned short int __control_word;
    unsigned short int __glibc_reserved;
    unsigned int __mxcsr;
  }
femode_t;

Sure. I forget to mention x86.

There are cases when size of fenv_t differs in different libraries. ARM uses unsigned int in glibc but unsigned long in musl.

In D82525#2605358, @sepavloff wrote:
In D82525#2603611, @craig.topper wrote:
In D82525#2603275, @sepavloff wrote:

In D82525#2600598, @craig.topper wrote:

Is there any guarantee that femode_t will be the same layout for a given target in different C library implementations?

Strictly speaking there is no such guarantee. However the obvious implementation of femode_t is the type used to store content of FP control register. Most of 16 targets supported by glibc use unsigned int as femode_t. Exceptions are alpha, ia64, sparc (unsigned long) and powerpc (double). In these cases femode_t is identical to fenv_t.

Isn't X86 using this struct which is 8 bytes?
typedef struct
  {
    unsigned short int __control_word;
    unsigned short int __glibc_reserved;
    unsigned int __mxcsr;
  }
femode_t;
Sure. I forget to mention x86.

There are cases when size of fenv_t differs in different libraries. ARM uses unsigned int in glibc but unsigned long in musl.

Is unsigned long 32-bits in this case?

In D82525#2605442, @craig.topper wrote:

In D82525#2605358, @sepavloff wrote:

In D82525#2603611, @craig.topper wrote:

In D82525#2603275, @sepavloff wrote:

In D82525#2600598, @craig.topper wrote:

Is there any guarantee that femode_t will be the same layout for a given target in different C library implementations?

There are cases when size of fenv_t differs in different libraries. ARM uses unsigned int in glibc but unsigned long in musl.

Is unsigned long 32-bits in this case?

Yes, ARM gcc 10.2(linux) generates 4 for sizeof(unsigned long).

qiucf added a subscriber: qiucf.Mar 15 2021, 1:27 AM

Added helper functions

These are methods of IRBuilder: createGetFPMode, which get size of fp modes from
DataLayout, createSetFPMode and createResetFPMode.

sepavloff added a parent revision: D71741: Add size of FP environment to DataLayout.Mar 17 2021, 9:39 AM

Harbormaster completed remote builds in B94262: Diff 331293.Mar 17 2021, 9:42 AM

Rebased patch

Harbormaster completed remote builds in B96274: Diff 334095.Mar 30 2021, 4:26 AM

Ping.

sepavloff mentioned this in D99083: [RISCV] Introduce floating point control and state registers.Apr 7 2021, 8:47 PM

Any feedback?

Ping.

sepavloff edited the summary of this revision. (Show Details)Nov 25 2021, 8:50 AM

Updated patch

Rebased.
Get rid of using DataLayout to determine the size of control modes. It limits the usage of the intrinsics to some extent, because an IR transformation that would create a call to llvm.get.fpmode or llvm.set.fpmode must somehow know the size for current target. But for the main use cases it should be enough, only TargetInfo needs to be extended so that clang could know the size.
The test that checks default lowering was rewritten using soft-float option.

Harbormaster completed remote builds in B136086: Diff 389817.Nov 25 2021, 9:05 AM

sepavloff removed a parent revision: D71741: Add size of FP environment to DataLayout.Nov 25 2021, 9:11 AM

Can this be abandoned with get_fpenv / set_fpenv already pushed?

This revision now requires changes to proceed.Jun 15 2023, 9:36 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 15 2023, 9:36 AM

Rebased patch

Herald added a subscriber: pengfei. · View Herald TranscriptJul 9 2023, 10:20 PM

Harbormaster completed remote builds in B244046: Diff 538506.Jul 9 2023, 11:37 PM

In D82525#4425234, @arsenm wrote:

Can this be abandoned with get_fpenv / set_fpenv already pushed?

These are different intrinsics, and they correspond to different C functions. Control modes are a part of FP environment, but the latter contains also status bits. If a target has single FP status and control register, both fegetenv and fegetmode are implemented as reading from that register in glibc, but in fegetmode bits not related to control modes are cleared. If a target have different status and control registers, or FP environment is large in size (as on X86), get_fpmode/set_fpmode can be have faster implementation than get_fpenv/set_fpenv.

I just realized how much I had confused this with llvm.get.fpenv. I think I implemented this one as llvm.get.fpenv in D152710. Can you clarify in the LangRef for get/set fpenv what the "current floating-point environment" means? I was interpreting this as the mode register, not including the dynamic exception mask

arsenm mentioned this in D152710: AMDGPU: Implement llvm.get.fpmode.Jul 11 2023, 12:31 PM

Could you also handle GlobalISel?

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
4858–4860	You don't need to bother with this typed pointer dance anymore

arsenm requested changes to this revision.Aug 17 2023, 3:45 PM

arsenm added inline comments.

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
4834	This should be getStack, not getFixedStack
4848	This should be getStack, not getFixedStack

This revision now requires changes to proceed.Aug 17 2023, 3:45 PM

sepavloff added inline comments.Aug 18 2023, 12:00 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
4848	Could you please explain a bit, why it should be `getStack`? It is not used anywhere in this file. Function `PerformInsertVectorEltInMemory` uses `getFixedStack` for temporary space, what is the difference between it and these cases?

Rebase, define custom SDNodes for the intrinsics and use opaque pointer

Harbormaster completed remote builds in B253411: Diff 551402.Aug 18 2023, 12:58 AM

arsenm requested changes to this revision.Aug 18 2023, 5:48 AM

arsenm added inline comments.

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
4848	Nevermind this is correct. We have confusing overlapping terminology. MachineFrameInfo refers to fixed stack objects as frame indexes as known fixed offsets for incoming stack arguments but apparently MachineMemOperand uses getFixedStack for any FrameIndex reference and getStack for offsets from SP
4858–4860	Don't think the alloca address space is correct for this. I guess use the default? Also just use TLI.getPointerTy, you don't need to go through the IR type

This revision now requires changes to proceed.Aug 18 2023, 5:48 AM

Address reviewer's notes

LGTM

This revision is now accepted and ready to land.Aug 18 2023, 10:46 AM

Harbormaster completed remote builds in B253530: Diff 551564.Aug 18 2023, 12:46 PM

This revision was landed with ongoing or failed builds.Aug 24 2023, 1:53 AM

Closed by commit rG6862f0fab121: [FPEnv] Intrinsics for access to FP control modes (authored by sepavloff). · Explain Why

This revision was automatically updated to reflect the committed changes.

sepavloff added a commit: rG6862f0fab121: [FPEnv] Intrinsics for access to FP control modes.

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

95 lines

include/

llvm/

CodeGen/

ISDOpcodes.h

13 lines

IR/

Intrinsics.td

3 lines

RuntimeLibcalls.def

4 lines

Target/

TargetSelectionDAG.td

10 lines

lib/

CodeGen/

SelectionDAG/

LegalizeDAG.cpp

41 lines

SelectionDAGBuilder.cpp

19 lines

SelectionDAGDumper.cpp

3 lines

TargetLoweringBase.cpp

6 lines

test/

CodeGen/

X86/

fpenv.ll

103 lines

Diff 553043

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 25,563 Lines • ▼ Show 20 Lines
	Semantics:			Semantics:
	""""""""""			""""""""""

	The '``llvm.reset.fpenv``' intrinsic sets the current floating-point environment			The '``llvm.reset.fpenv``' intrinsic sets the current floating-point environment
	to default state. It is similar to the call 'fesetenv(FE_DFL_ENV)', except it			to default state. It is similar to the call 'fesetenv(FE_DFL_ENV)', except it
	does not return any value.			does not return any value.


				'``llvm.get.fpmode``' Intrinsic
				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Syntax:
				"""""""

				The '``llvm.get.fpmode``' intrinsic returns bits of the current floating-point
				control modes. The return value type is platform-specific.

				::

				declare <integer_type> @llvm.get.fpmode()

				Overview:
				"""""""""

				The '``llvm.get.fpmode``' intrinsic reads the current dynamic floating-point
				control modes and returns it as an integer value.

				Arguments:
				""""""""""

				None.

				Semantics:
				""""""""""

				The '``llvm.get.fpmode``' intrinsic reads the current dynamic floating-point
				control modes, such as rounding direction, precision, treatment of denormals and
				so on. It is similar to the C library function 'fegetmode', however this
				function does not store the set of control modes into memory but returns it as
				an integer value. Interpretation of the bits in this value is target-dependent.

				'``llvm.set.fpmode``' Intrinsic
				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Syntax:
				"""""""

				The '``llvm.set.fpmode``' intrinsic sets the current floating-point control modes.

				::

				declare void @llvm.set.fpmode(<integer_type> <val>)

				Overview:
				"""""""""

				The '``llvm.set.fpmode``' intrinsic sets the current dynamic floating-point
				control modes.

				Arguments:
				""""""""""

				The argument is a set of floating-point control modes, represented as an integer
				value in a target-dependent way.

				Semantics:
				""""""""""

				The '``llvm.set.fpmode``' intrinsic sets the current dynamic floating-point
				control modes to the state specified by the argument, which must be obtained by
				a call to '``llvm.get.fpmode``' or constructed in a target-specific way. It is
				similar to the C library function 'fesetmode', however this function does not
				read the set of control modes from memory but gets it as integer value.

				'``llvm.reset.fpmode``' Intrinsic
				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Syntax:
				"""""""

				::

				declare void @llvm.reset.fpmode()

				Overview:
				"""""""""

				The '``llvm.reset.fpmode``' intrinsic sets the default dynamic floating-point
				control modes.

				Arguments:
				""""""""""

				None.

				Semantics:
				""""""""""

				The '``llvm.reset.fpmode``' intrinsic sets the current dynamic floating-point
				environment to default state. It is similar to the C library function call
				'fesetmode(FE_DFL_MODE)', however this function does not return any value.


	Floating-Point Test Intrinsics			Floating-Point Test Intrinsics
	------------------------------			------------------------------

	These functions get properties of floating-point values.			These functions get properties of floating-point values.


	.. _llvm.is.fpclass:			.. _llvm.is.fpclass:

	▲ Show 20 Lines • Show All 1,833 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/ISDOpcodes.h

Show First 20 Lines • Show All 998 Lines • ▼ Show 20 Lines	enum NodeType {
/// to. The result is a token chain.		/// to. The result is a token chain.
GET_FPENV_MEM,		GET_FPENV_MEM,

/// Sets the current floating point environment. The first operand is a token		/// Sets the current floating point environment. The first operand is a token
/// chain, the second is a pointer to memory, where FP environment is loaded		/// chain, the second is a pointer to memory, where FP environment is loaded
/// from. The result is a token chain.		/// from. The result is a token chain.
SET_FPENV_MEM,		SET_FPENV_MEM,

		/// Reads the current dynamic floating-point control modes. The operand is
		/// a token chain.
		GET_FPMODE,

		/// Sets the current dynamic floating-point control modes. The first operand
		/// is a token chain, the second is control modes set represented as integer
		/// value.
		SET_FPMODE,

		/// Sets default dynamic floating-point control modes. The operand is a
		/// token chain.
		RESET_FPMODE,

/// LOAD and STORE have token chains as their first operand, then the same		/// LOAD and STORE have token chains as their first operand, then the same
/// operands as an LLVM load/store instruction, then an offset node that		/// operands as an LLVM load/store instruction, then an offset node that
/// is added / subtracted from the base pointer to form the address (for		/// is added / subtracted from the base pointer to form the address (for
/// indexed memory ops).		/// indexed memory ops).
LOAD,		LOAD,
STORE,		STORE,

/// DYNAMIC_STACKALLOC - Allocate some number of bytes on the stack aligned		/// DYNAMIC_STACKALLOC - Allocate some number of bytes on the stack aligned
▲ Show 20 Lines • Show All 574 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Intrinsics.td

	Show First 20 Lines • Show All 1,067 Lines • ▼ Show 20 Lines
	//			//

	let IntrProperties = [IntrInaccessibleMemOnly, IntrWillReturn] in {			let IntrProperties = [IntrInaccessibleMemOnly, IntrWillReturn] in {
	def int_get_rounding : DefaultAttrsIntrinsic<[llvm_i32_ty], []>;			def int_get_rounding : DefaultAttrsIntrinsic<[llvm_i32_ty], []>;
	def int_set_rounding : DefaultAttrsIntrinsic<[], [llvm_i32_ty]>;			def int_set_rounding : DefaultAttrsIntrinsic<[], [llvm_i32_ty]>;
	def int_get_fpenv : DefaultAttrsIntrinsic<[llvm_anyint_ty], []>;			def int_get_fpenv : DefaultAttrsIntrinsic<[llvm_anyint_ty], []>;
	def int_set_fpenv : DefaultAttrsIntrinsic<[], [llvm_anyint_ty]>;			def int_set_fpenv : DefaultAttrsIntrinsic<[], [llvm_anyint_ty]>;
	def int_reset_fpenv : DefaultAttrsIntrinsic<[], []>;			def int_reset_fpenv : DefaultAttrsIntrinsic<[], []>;
				def int_get_fpmode : DefaultAttrsIntrinsic<[llvm_anyint_ty], []>;
				def int_set_fpmode : DefaultAttrsIntrinsic<[], [llvm_anyint_ty]>;
				def int_reset_fpmode : DefaultAttrsIntrinsic<[], []>;
	}			}

	//===--------------- Floating Point Properties ----------------------------===//			//===--------------- Floating Point Properties ----------------------------===//
	//			//

	def int_is_fpclass			def int_is_fpclass
	: DefaultAttrsIntrinsic<[LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],			: DefaultAttrsIntrinsic<[LLVMScalarOrSameVectorWidth<0, llvm_i1_ty>],
	[llvm_anyfloat_ty, llvm_i32_ty],			[llvm_anyfloat_ty, llvm_i32_ty],
	▲ Show 20 Lines • Show All 1,469 Lines • Show Last 20 Lines

llvm/include/llvm/IR/RuntimeLibcalls.def

	Show First 20 Lines • Show All 288 Lines • ▼ Show 20 Lines
	HANDLE_LIBCALL(FREXP_F80, "frexpl")			HANDLE_LIBCALL(FREXP_F80, "frexpl")
	HANDLE_LIBCALL(FREXP_F128, "frexpl")			HANDLE_LIBCALL(FREXP_F128, "frexpl")
	HANDLE_LIBCALL(FREXP_PPCF128, "frexpl")			HANDLE_LIBCALL(FREXP_PPCF128, "frexpl")

	// Floating point environment			// Floating point environment
	HANDLE_LIBCALL(FEGETENV, "fegetenv")			HANDLE_LIBCALL(FEGETENV, "fegetenv")
	HANDLE_LIBCALL(FESETENV, "fesetenv")			HANDLE_LIBCALL(FESETENV, "fesetenv")

				// Floating point control modes
				HANDLE_LIBCALL(FEGETMODE, "fegetmode")
				HANDLE_LIBCALL(FESETMODE, "fesetmode")

	// Conversion			// Conversion
	HANDLE_LIBCALL(FPEXT_F32_PPCF128, "__gcc_stoq")			HANDLE_LIBCALL(FPEXT_F32_PPCF128, "__gcc_stoq")
	HANDLE_LIBCALL(FPEXT_F64_PPCF128, "__gcc_dtoq")			HANDLE_LIBCALL(FPEXT_F64_PPCF128, "__gcc_dtoq")
	HANDLE_LIBCALL(FPEXT_F80_F128, "__extendxftf2")			HANDLE_LIBCALL(FPEXT_F80_F128, "__extendxftf2")
	HANDLE_LIBCALL(FPEXT_F64_F128, "__extenddftf2")			HANDLE_LIBCALL(FPEXT_F64_F128, "__extenddftf2")
	HANDLE_LIBCALL(FPEXT_F32_F128, "__extendsftf2")			HANDLE_LIBCALL(FPEXT_F32_F128, "__extendsftf2")
	HANDLE_LIBCALL(FPEXT_F16_F128, "__extendhftf2")			HANDLE_LIBCALL(FPEXT_F16_F128, "__extendhftf2")
	HANDLE_LIBCALL(FPEXT_F16_F80, "__extendhfxf2")			HANDLE_LIBCALL(FPEXT_F16_F80, "__extendhfxf2")
	▲ Show 20 Lines • Show All 304 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetSelectionDAG.td

Show First 20 Lines • Show All 170 Lines • ▼ Show 20 Lines	def SDTFPToIntOp : SDTypeProfile<1, 1, [ // fp_to_[su]int
SDTCisInt<0>, SDTCisFP<1>, SDTCisSameNumEltsAs<0, 1>		SDTCisInt<0>, SDTCisFP<1>, SDTCisSameNumEltsAs<0, 1>
]>;		]>;
def SDTFPToIntSatOp : SDTypeProfile<1, 2, [ // fp_to_[su]int_sat		def SDTFPToIntSatOp : SDTypeProfile<1, 2, [ // fp_to_[su]int_sat
SDTCisInt<0>, SDTCisFP<1>, SDTCisSameNumEltsAs<0, 1>, SDTCisVT<2, OtherVT>		SDTCisInt<0>, SDTCisFP<1>, SDTCisSameNumEltsAs<0, 1>, SDTCisVT<2, OtherVT>
]>;		]>;
def SDTFPExpOp : SDTypeProfile<1, 2, [ // ldexp		def SDTFPExpOp : SDTypeProfile<1, 2, [ // ldexp
SDTCisSameAs<0, 1>, SDTCisFP<0>, SDTCisInt<2>		SDTCisSameAs<0, 1>, SDTCisFP<0>, SDTCisInt<2>
]>;		]>;
		def SDTGetFPStateOp : SDTypeProfile<1, 0, [ // get_fpenv, get_fpmode
		SDTCisInt<0>
		]>;
		def SDTSetFPStateOp : SDTypeProfile<0, 1, [ // set_fpenv, set_fpmode
		SDTCisInt<0>
		]>;
def SDTExtInreg : SDTypeProfile<1, 2, [ // sext_inreg		def SDTExtInreg : SDTypeProfile<1, 2, [ // sext_inreg
SDTCisSameAs<0, 1>, SDTCisInt<0>, SDTCisVT<2, OtherVT>,		SDTCisSameAs<0, 1>, SDTCisInt<0>, SDTCisVT<2, OtherVT>,
SDTCisVTSmallerThanOp<2, 1>		SDTCisVTSmallerThanOp<2, 1>
]>;		]>;
def SDTExtInvec : SDTypeProfile<1, 1, [ // sext_invec		def SDTExtInvec : SDTypeProfile<1, 1, [ // sext_invec
SDTCisInt<0>, SDTCisVec<0>, SDTCisInt<1>, SDTCisVec<1>,		SDTCisInt<0>, SDTCisVec<0>, SDTCisInt<1>, SDTCisVec<1>,
SDTCisOpSmallerThanOp<1, 0>		SDTCisOpSmallerThanOp<1, 0>
]>;		]>;
▲ Show 20 Lines • Show All 418 Lines • ▼ Show 20 Lines	def strict_fp_to_uint : SDNode<"ISD::STRICT_FP_TO_UINT",
SDTFPToIntOp, [SDNPHasChain]>;		SDTFPToIntOp, [SDNPHasChain]>;
def strict_sint_to_fp : SDNode<"ISD::STRICT_SINT_TO_FP",		def strict_sint_to_fp : SDNode<"ISD::STRICT_SINT_TO_FP",
SDTIntToFPOp, [SDNPHasChain]>;		SDTIntToFPOp, [SDNPHasChain]>;
def strict_uint_to_fp : SDNode<"ISD::STRICT_UINT_TO_FP",		def strict_uint_to_fp : SDNode<"ISD::STRICT_UINT_TO_FP",
SDTIntToFPOp, [SDNPHasChain]>;		SDTIntToFPOp, [SDNPHasChain]>;
def strict_fsetcc : SDNode<"ISD::STRICT_FSETCC", SDTSetCC, [SDNPHasChain]>;		def strict_fsetcc : SDNode<"ISD::STRICT_FSETCC", SDTSetCC, [SDNPHasChain]>;
def strict_fsetccs : SDNode<"ISD::STRICT_FSETCCS", SDTSetCC, [SDNPHasChain]>;		def strict_fsetccs : SDNode<"ISD::STRICT_FSETCCS", SDTSetCC, [SDNPHasChain]>;

		def get_fpmode : SDNode<"ISD::GET_FPMODE", SDTGetFPStateOp, [SDNPHasChain]>;
		def set_fpmode : SDNode<"ISD::SET_FPMODE", SDTSetFPStateOp, [SDNPHasChain]>;
		def reset_fpmode : SDNode<"ISD::RESET_FPMODE", SDTNone, [SDNPHasChain]>;

def setcc : SDNode<"ISD::SETCC" , SDTSetCC>;		def setcc : SDNode<"ISD::SETCC" , SDTSetCC>;
def select : SDNode<"ISD::SELECT" , SDTSelect>;		def select : SDNode<"ISD::SELECT" , SDTSelect>;
def vselect : SDNode<"ISD::VSELECT" , SDTVSelect>;		def vselect : SDNode<"ISD::VSELECT" , SDTVSelect>;
def selectcc : SDNode<"ISD::SELECT_CC" , SDTSelectCC>;		def selectcc : SDNode<"ISD::SELECT_CC" , SDTSelectCC>;

def brcc : SDNode<"ISD::BR_CC" , SDTBrCC, [SDNPHasChain]>;		def brcc : SDNode<"ISD::BR_CC" , SDTBrCC, [SDNPHasChain]>;
def brcond : SDNode<"ISD::BRCOND" , SDTBrcond, [SDNPHasChain]>;		def brcond : SDNode<"ISD::BRCOND" , SDTBrcond, [SDNPHasChain]>;
def brind : SDNode<"ISD::BRIND" , SDTBrind, [SDNPHasChain]>;		def brind : SDNode<"ISD::BRIND" , SDTBrind, [SDNPHasChain]>;
▲ Show 20 Lines • Show All 1,286 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

Show First 20 Lines • Show All 1,001 Lines • ▼ Show 20 Lines	case ISD::GET_DYNAMIC_AREA_OFFSET:
break;		break;
case ISD::VAARG:		case ISD::VAARG:
Action = TLI.getOperationAction(Node->getOpcode(),		Action = TLI.getOperationAction(Node->getOpcode(),
Node->getValueType(0));		Node->getValueType(0));
if (Action != TargetLowering::Promote)		if (Action != TargetLowering::Promote)
Action = TLI.getOperationAction(Node->getOpcode(), MVT::Other);		Action = TLI.getOperationAction(Node->getOpcode(), MVT::Other);
break;		break;
case ISD::SET_FPENV:		case ISD::SET_FPENV:
		case ISD::SET_FPMODE:
Action = TLI.getOperationAction(Node->getOpcode(),		Action = TLI.getOperationAction(Node->getOpcode(),
Node->getOperand(1).getValueType());		Node->getOperand(1).getValueType());
break;		break;
case ISD::FP_TO_FP16:		case ISD::FP_TO_FP16:
case ISD::FP_TO_BF16:		case ISD::FP_TO_BF16:
case ISD::SINT_TO_FP:		case ISD::SINT_TO_FP:
case ISD::UINT_TO_FP:		case ISD::UINT_TO_FP:
case ISD::EXTRACT_VECTOR_ELT:		case ISD::EXTRACT_VECTOR_ELT:
▲ Show 20 Lines • Show All 3,797 Lines • ▼ Show 20 Lines	void SelectionDAGLegalize::ConvertNodeToLibcall(SDNode *Node) {
}		}
case ISD::SET_FPENV_MEM: {		case ISD::SET_FPENV_MEM: {
SDValue Chain = Node->getOperand(0);		SDValue Chain = Node->getOperand(0);
SDValue EnvPtr = Node->getOperand(1);		SDValue EnvPtr = Node->getOperand(1);
Results.push_back(		Results.push_back(
DAG.makeStateFunctionCall(RTLIB::FESETENV, EnvPtr, Chain, dl));		DAG.makeStateFunctionCall(RTLIB::FESETENV, EnvPtr, Chain, dl));
break;		break;
}		}
		case ISD::GET_FPMODE: {
		// Call fegetmode, which saves control modes into a stack slot. Then load
		// the value to return from the stack.
		EVT ModeVT = Node->getValueType(0);
		SDValue StackPtr = DAG.CreateStackTemporary(ModeVT);
		int SPFI = cast<FrameIndexSDNode>(StackPtr.getNode())->getIndex();
		SDValue Chain = DAG.makeStateFunctionCall(RTLIB::FEGETMODE, StackPtr,
		Node->getOperand(0), dl);
		SDValue LdInst = DAG.getLoad(
		ModeVT, dl, Chain, StackPtr,
		MachinePointerInfo::getFixedStack(DAG.getMachineFunction(), SPFI));
		arsenmUnsubmitted Not Done Reply Inline Actions This should be getStack, not getFixedStack arsenm: This should be getStack, not getFixedStack
		Results.push_back(LdInst);
		Results.push_back(LdInst.getValue(1));
		break;
		}
		case ISD::SET_FPMODE: {
		// Move control modes to stack slot and then call fesetmode with the pointer
		// to the slot as argument.
		SDValue Mode = Node->getOperand(1);
		EVT ModeVT = Mode.getValueType();
		SDValue StackPtr = DAG.CreateStackTemporary(ModeVT);
		int SPFI = cast<FrameIndexSDNode>(StackPtr.getNode())->getIndex();
		SDValue StInst = DAG.getStore(
		Node->getOperand(0), dl, Mode, StackPtr,
		MachinePointerInfo::getFixedStack(DAG.getMachineFunction(), SPFI));
		arsenmUnsubmitted Not Done Reply Inline Actions This should be getStack, not getFixedStack arsenm: This should be getStack, not getFixedStack
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Could you please explain a bit, why it should be `getStack`? It is not used anywhere in this file. Function `PerformInsertVectorEltInMemory` uses `getFixedStack` for temporary space, what is the difference between it and these cases? sepavloff: Could you please explain a bit, why it should be `getStack`? It is not used anywhere in this…
		arsenmUnsubmitted Not Done Reply Inline Actions Nevermind this is correct. We have confusing overlapping terminology. MachineFrameInfo refers to fixed stack objects as frame indexes as known fixed offsets for incoming stack arguments but apparently MachineMemOperand uses getFixedStack for any FrameIndex reference and getStack for offsets from SP arsenm: Nevermind this is correct. We have confusing overlapping terminology. MachineFrameInfo refers…
		Results.push_back(
		DAG.makeStateFunctionCall(RTLIB::FESETMODE, StackPtr, StInst, dl));
		break;
		}
		case ISD::RESET_FPMODE: {
		// It is legalized to a call 'fesetmode(FE_DFL_MODE)'. On most targets
		// FE_DFL_MODE is defined as '((const femode_t *) -1)' in glibc. If not, the
		// target must provide custom lowering.
		const DataLayout &DL = DAG.getDataLayout();
		EVT PtrTy = TLI.getPointerTy(DL);
		SDValue Mode = DAG.getConstant(-1LL, dl, PtrTy);
		Results.push_back(DAG.makeStateFunctionCall(RTLIB::FESETMODE, Mode,
		arsenmUnsubmitted Not Done Reply Inline Actions You don't need to bother with this typed pointer dance anymore arsenm: You don't need to bother with this typed pointer dance anymore
		arsenmUnsubmitted Not Done Reply Inline Actions Don't think the alloca address space is correct for this. I guess use the default? Also just use TLI.getPointerTy, you don't need to go through the IR type arsenm: Don't think the alloca address space is correct for this. I guess use the default? Also just…
		Node->getOperand(0), dl));
		break;
		}
}		}

// Replace the original node with the legalized result.		// Replace the original node with the legalized result.
if (!Results.empty()) {		if (!Results.empty()) {
LLVM_DEBUG(dbgs() << "Successfully converted node to libcall\n");		LLVM_DEBUG(dbgs() << "Successfully converted node to libcall\n");
ReplaceNode(Node, Results.data());		ReplaceNode(Node, Results.data());
} else		} else
LLVM_DEBUG(dbgs() << "Could not convert node to libcall\n");		LLVM_DEBUG(dbgs() << "Could not convert node to libcall\n");
▲ Show 20 Lines • Show All 720 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,659 Lines • ▼ Show 20 Lines	if (TLI.isOperationLegalOrCustom(ISD::SET_FPENV, EnvVT)) {
Chain = DAG.getSetFPEnv(Chain, sdl, Temp, EnvVT, MMO);		Chain = DAG.getSetFPEnv(Chain, sdl, Temp, EnvVT, MMO);
}		}
DAG.setRoot(Chain);		DAG.setRoot(Chain);
return;		return;
}		}
case Intrinsic::reset_fpenv:		case Intrinsic::reset_fpenv:
DAG.setRoot(DAG.getNode(ISD::RESET_FPENV, sdl, MVT::Other, getRoot()));		DAG.setRoot(DAG.getNode(ISD::RESET_FPENV, sdl, MVT::Other, getRoot()));
return;		return;
		case Intrinsic::get_fpmode:
		Res = DAG.getNode(
		ISD::GET_FPMODE, sdl,
		DAG.getVTList(TLI.getValueType(DAG.getDataLayout(), I.getType()),
		MVT::Other),
		DAG.getRoot());
		setValue(&I, Res);
		DAG.setRoot(Res.getValue(1));
		return;
		case Intrinsic::set_fpmode:
		Res = DAG.getNode(ISD::SET_FPMODE, sdl, MVT::Other, {DAG.getRoot()},
		getValue(I.getArgOperand(0)));
		DAG.setRoot(Res);
		return;
		case Intrinsic::reset_fpmode: {
		Res = DAG.getNode(ISD::RESET_FPMODE, sdl, MVT::Other, getRoot());
		DAG.setRoot(Res);
		return;
		}
case Intrinsic::pcmarker: {		case Intrinsic::pcmarker: {
SDValue Tmp = getValue(I.getArgOperand(0));		SDValue Tmp = getValue(I.getArgOperand(0));
DAG.setRoot(DAG.getNode(ISD::PCMARKER, sdl, MVT::Other, getRoot(), Tmp));		DAG.setRoot(DAG.getNode(ISD::PCMARKER, sdl, MVT::Other, getRoot(), Tmp));
return;		return;
}		}
case Intrinsic::readcyclecounter: {		case Intrinsic::readcyclecounter: {
SDValue Op = getRoot();		SDValue Op = getRoot();
Res = DAG.getNode(ISD::READCYCLECOUNTER, sdl,		Res = DAG.getNode(ISD::READCYCLECOUNTER, sdl,
▲ Show 20 Lines • Show All 5,302 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 433 Lines • ▼ Show 20 Lines	#endif
// Floating point environment manipulation		// Floating point environment manipulation
case ISD::GET_ROUNDING: return "get_rounding";		case ISD::GET_ROUNDING: return "get_rounding";
case ISD::SET_ROUNDING: return "set_rounding";		case ISD::SET_ROUNDING: return "set_rounding";
case ISD::GET_FPENV: return "get_fpenv";		case ISD::GET_FPENV: return "get_fpenv";
case ISD::SET_FPENV: return "set_fpenv";		case ISD::SET_FPENV: return "set_fpenv";
case ISD::RESET_FPENV: return "reset_fpenv";		case ISD::RESET_FPENV: return "reset_fpenv";
case ISD::GET_FPENV_MEM: return "get_fpenv_mem";		case ISD::GET_FPENV_MEM: return "get_fpenv_mem";
case ISD::SET_FPENV_MEM: return "set_fpenv_mem";		case ISD::SET_FPENV_MEM: return "set_fpenv_mem";
		case ISD::GET_FPMODE: return "get_fpmode";
		case ISD::SET_FPMODE: return "set_fpmode";
		case ISD::RESET_FPMODE: return "reset_fpmode";

// Bit manipulation		// Bit manipulation
case ISD::ABS: return "abs";		case ISD::ABS: return "abs";
case ISD::BITREVERSE: return "bitreverse";		case ISD::BITREVERSE: return "bitreverse";
case ISD::BSWAP: return "bswap";		case ISD::BSWAP: return "bswap";
case ISD::CTPOP: return "ctpop";		case ISD::CTPOP: return "ctpop";
case ISD::CTTZ: return "cttz";		case ISD::CTTZ: return "cttz";
case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";		case ISD::CTTZ_ZERO_UNDEF: return "cttz_zero_undef";
▲ Show 20 Lines • Show All 647 Lines • Show Last 20 Lines

llvm/lib/CodeGen/TargetLoweringBase.cpp

Show First 20 Lines • Show All 936 Lines • ▼ Show 20 Lines	#include "llvm/IR/VPIntrinsics.def"
// On most systems, DEBUGTRAP and TRAP have no difference. The "Expand"		// On most systems, DEBUGTRAP and TRAP have no difference. The "Expand"
// here is to inform DAG Legalizer to replace DEBUGTRAP with TRAP.		// here is to inform DAG Legalizer to replace DEBUGTRAP with TRAP.
setOperationAction(ISD::DEBUGTRAP, MVT::Other, Expand);		setOperationAction(ISD::DEBUGTRAP, MVT::Other, Expand);

setOperationAction(ISD::UBSANTRAP, MVT::Other, Expand);		setOperationAction(ISD::UBSANTRAP, MVT::Other, Expand);

setOperationAction(ISD::GET_FPENV_MEM, MVT::Other, Expand);		setOperationAction(ISD::GET_FPENV_MEM, MVT::Other, Expand);
setOperationAction(ISD::SET_FPENV_MEM, MVT::Other, Expand);		setOperationAction(ISD::SET_FPENV_MEM, MVT::Other, Expand);

		for (MVT VT : {MVT::i8, MVT::i16, MVT::i32, MVT::i64}) {
		setOperationAction(ISD::GET_FPMODE, VT, Expand);
		setOperationAction(ISD::SET_FPMODE, VT, Expand);
		}
		setOperationAction(ISD::RESET_FPMODE, MVT::Other, Expand);
}		}

MVT TargetLoweringBase::getScalarShiftAmountTy(const DataLayout &DL,		MVT TargetLoweringBase::getScalarShiftAmountTy(const DataLayout &DL,
EVT) const {		EVT) const {
return MVT::getIntegerVT(DL.getPointerSizeInBits(0));		return MVT::getIntegerVT(DL.getPointerSizeInBits(0));
}		}

EVT TargetLoweringBase::getShiftAmountTy(EVT LHSTy, const DataLayout &DL,		EVT TargetLoweringBase::getShiftAmountTy(EVT LHSTy, const DataLayout &DL,
▲ Show 20 Lines • Show All 1,453 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/fpenv.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -mtriple=i686-unknown-linux-gnu -mattr=-sse -verify-machineinstrs < %s \| FileCheck %s -check-prefix=X86-NOSSE			; RUN: llc -mtriple=i686-unknown-linux-gnu -mattr=-sse -verify-machineinstrs < %s \| FileCheck %s -check-prefix=X86-NOSSE
	; RUN: llc -mtriple=i686-unknown-linux-gnu -mattr=+sse -verify-machineinstrs < %s \| FileCheck %s -check-prefix=X86-SSE			; RUN: llc -mtriple=i686-unknown-linux-gnu -mattr=+sse -verify-machineinstrs < %s \| FileCheck %s -check-prefix=X86-SSE
	; RUN: llc -mtriple=x86_64-unknown-linux-gnu -verify-machineinstrs < %s \| FileCheck %s -check-prefix=X64			; RUN: llc -mtriple=x86_64-unknown-linux-gnu -verify-machineinstrs < %s \| FileCheck %s -check-prefix=X64

	declare void @llvm.set.rounding(i32 %x)			declare void @llvm.set.rounding(i32 %x)
	declare i256 @llvm.get.fpenv.i256()			declare i256 @llvm.get.fpenv.i256()
	declare void @llvm.set.fpenv.i256(i256 %fpenv)			declare void @llvm.set.fpenv.i256(i256 %fpenv)
	declare void @llvm.reset.fpenv()			declare void @llvm.reset.fpenv()
				declare i32 @llvm.get.fpmode.i32()
				declare void @llvm.set.fpmode.i32(i32 %fpmode)
				declare void @llvm.reset.fpmode()

	define void @func_01() nounwind {			define void @func_01() nounwind {
	; X86-NOSSE-LABEL: func_01:			; X86-NOSSE-LABEL: func_01:
	; X86-NOSSE: # %bb.0:			; X86-NOSSE: # %bb.0:
	; X86-NOSSE-NEXT: pushl %eax			; X86-NOSSE-NEXT: pushl %eax
	; X86-NOSSE-NEXT: fnstcw (%esp)			; X86-NOSSE-NEXT: fnstcw (%esp)
	; X86-NOSSE-NEXT: orb $12, {{[0-9]+}}(%esp)			; X86-NOSSE-NEXT: orb $12, {{[0-9]+}}(%esp)
	; X86-NOSSE-NEXT: fldcw (%esp)			; X86-NOSSE-NEXT: fldcw (%esp)
	▲ Show 20 Lines • Show All 397 Lines • ▼ Show 20 Lines
	; X64-NEXT: fldenv {{\.?LCPI[0-9]+_[0-9]+}}(%rip)			; X64-NEXT: fldenv {{\.?LCPI[0-9]+_[0-9]+}}(%rip)
	; X64-NEXT: ldmxcsr {{\.?LCPI[0-9]+_[0-9]+}}+28(%rip)			; X64-NEXT: ldmxcsr {{\.?LCPI[0-9]+_[0-9]+}}+28(%rip)
	; X64-NEXT: retq			; X64-NEXT: retq
	entry:			entry:
	call void @llvm.reset.fpenv()			call void @llvm.reset.fpenv()
	ret void			ret void
	}			}

				define i32 @func_get_fpmode() #0 {
				; X86-NOSSE-LABEL: func_get_fpmode:
				; X86-NOSSE: # %bb.0: # %entry
				; X86-NOSSE-NEXT: subl $12, %esp
				; X86-NOSSE-NEXT: leal {{[0-9]+}}(%esp), %eax
				; X86-NOSSE-NEXT: movl %eax, (%esp)
				; X86-NOSSE-NEXT: calll fegetmode
				; X86-NOSSE-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NOSSE-NEXT: addl $12, %esp
				; X86-NOSSE-NEXT: retl
				;
				; X86-SSE-LABEL: func_get_fpmode:
				; X86-SSE: # %bb.0: # %entry
				; X86-SSE-NEXT: subl $12, %esp
				; X86-SSE-NEXT: leal {{[0-9]+}}(%esp), %eax
				; X86-SSE-NEXT: movl %eax, (%esp)
				; X86-SSE-NEXT: calll fegetmode
				; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-SSE-NEXT: addl $12, %esp
				; X86-SSE-NEXT: retl
				;
				; X64-LABEL: func_get_fpmode:
				; X64: # %bb.0: # %entry
				; X64-NEXT: pushq %rax
				; X64-NEXT: leaq {{[0-9]+}}(%rsp), %rdi
				; X64-NEXT: callq fegetmode@PLT
				; X64-NEXT: movl {{[0-9]+}}(%rsp), %eax
				; X64-NEXT: popq %rcx
				; X64-NEXT: retq
				entry:
				%fpmode = call i32 @llvm.get.fpmode.i32()
				ret i32 %fpmode
				}

				define void @func_set_fpmode(i32 %fpmode) #0 {
				; X86-NOSSE-LABEL: func_set_fpmode:
				; X86-NOSSE: # %bb.0: # %entry
				; X86-NOSSE-NEXT: subl $12, %esp
				; X86-NOSSE-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-NOSSE-NEXT: movl %eax, {{[0-9]+}}(%esp)
				; X86-NOSSE-NEXT: leal {{[0-9]+}}(%esp), %eax
				; X86-NOSSE-NEXT: movl %eax, (%esp)
				; X86-NOSSE-NEXT: calll fesetmode
				; X86-NOSSE-NEXT: addl $12, %esp
				; X86-NOSSE-NEXT: retl
				;
				; X86-SSE-LABEL: func_set_fpmode:
				; X86-SSE: # %bb.0: # %entry
				; X86-SSE-NEXT: subl $12, %esp
				; X86-SSE-NEXT: movl {{[0-9]+}}(%esp), %eax
				; X86-SSE-NEXT: movl %eax, {{[0-9]+}}(%esp)
				; X86-SSE-NEXT: leal {{[0-9]+}}(%esp), %eax
				; X86-SSE-NEXT: movl %eax, (%esp)
				; X86-SSE-NEXT: calll fesetmode
				; X86-SSE-NEXT: addl $12, %esp
				; X86-SSE-NEXT: retl
				;
				; X64-LABEL: func_set_fpmode:
				; X64: # %bb.0: # %entry
				; X64-NEXT: pushq %rax
				; X64-NEXT: movl %edi, {{[0-9]+}}(%rsp)
				; X64-NEXT: leaq {{[0-9]+}}(%rsp), %rdi
				; X64-NEXT: callq fesetmode@PLT
				; X64-NEXT: popq %rax
				; X64-NEXT: retq
				entry:
				call void @llvm.set.fpmode.i32(i32 %fpmode)
				ret void
				}


				define void @func_reset() #0 {
				; X86-NOSSE-LABEL: func_reset:
				; X86-NOSSE: # %bb.0: # %entry
				; X86-NOSSE-NEXT: subl $12, %esp
				; X86-NOSSE-NEXT: movl $-1, (%esp)
				; X86-NOSSE-NEXT: calll fesetmode
				; X86-NOSSE-NEXT: addl $12, %esp
				; X86-NOSSE-NEXT: retl
				;
				; X86-SSE-LABEL: func_reset:
				; X86-SSE: # %bb.0: # %entry
				; X86-SSE-NEXT: subl $12, %esp
				; X86-SSE-NEXT: movl $-1, (%esp)
				; X86-SSE-NEXT: calll fesetmode
				; X86-SSE-NEXT: addl $12, %esp
				; X86-SSE-NEXT: retl
				;
				; X64-LABEL: func_reset:
				; X64: # %bb.0: # %entry
				; X64-NEXT: pushq %rax
				; X64-NEXT: movq $-1, %rdi
				; X64-NEXT: callq fesetmode@PLT
				; X64-NEXT: popq %rax
				; X64-NEXT: retq
				entry:
				call void @llvm.reset.fpmode()
				ret void
				}

	attributes #0 = { nounwind "use-soft-float"="true" }			attributes #0 = { nounwind "use-soft-float"="true" }

This is an archive of the discontinued LLVM Phabricator instance.

[FPEnv] Intrinsics for access to FP control modesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 553043

llvm/docs/LangRef.rst

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/IR/Intrinsics.td

llvm/include/llvm/IR/RuntimeLibcalls.def

llvm/include/llvm/Target/TargetSelectionDAG.td

llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/CodeGen/TargetLoweringBase.cpp

llvm/test/CodeGen/X86/fpenv.ll

[FPEnv] Intrinsics for access to FP control modes
ClosedPublic