This is an archive of the discontinued LLVM Phabricator instance.

llvm/lib/Target/X86/X86ISelLowering.cpp
721	Add f32/f64 too?
22707	Why only f80?
22762	Can we define it with a new name? This give an impression the check conditions can be permuted. But the code seems don't support it.
22765	ditto.
22769	Define it in `FPSWFlag` ?
22808	It's equal to assert at line 22751 and better there with message.
22848	This only works for fp80. We should check the type first. Can we check exception #IA for f32 and f64?
llvm/test/CodeGen/X86/x86-is_fpclass-fp80.ll
9	Where's fucomp generated from? I didn't see it in the code.
21	Why `fucompi`?

Address reviewer's notes.

sepavloff added inline comments.Nov 9 2021, 9:25 AM

llvm/lib/Target/X86/X86ISelLowering.cpp
721	There is concern that FXAM can be slow (https://reviews.llvm.org/D104853#2839746). So it is used only for f80, where x87 anyway is used. For other types a default lowering is used, which use usual float operations or integer arithmetic to make the tests. The resulted code for x86 is presented in D112025 in the test.
22707	You are right, this check is not needed. Removed.
22762	Actually the checks can be permuted. I reformatted the comments, hopefully they make this code clearer.
22769	I would like but I cannot invent a good name for it. Added comment to describe why this constant is used.
22808	The assert only checked that for each `CCToCompare` there is corresponding `ExpectedCC`. There are not many cases here, it seems that this check is safe to remove.
22848	Fixed.
llvm/test/CodeGen/X86/x86-is_fpclass-fp80.ll
9	It appears as a result of the call `DAG.getSetCC(DL, ResultVT, Arg, Arg, ISD::CondCode::SETUNE)`, which is called at the beginning of `lowerIS_FPCLASS` if exceptions are ignored.
21	It is also the result of `getSetCC`. This code must be generated for unordered comparison.

Harbormaster completed remote builds in B133280: Diff 385847.Nov 9 2021, 9:52 AM

pengfei added inline comments.Nov 9 2021, 10:18 PM

llvm/lib/Target/X86/X86ISelLowering.cpp
721	Given that and we still need to distinguish sNaN with qNaN. Why don't we use integer arithmetic for `f80` too? We just need to check the high 32 bits for all classes expect fcZero and fcInf.
22704	This should also help for nan combined with other flags.
22712–22725	Can we assert the type only be `f80`?
22762	But the switch doesn't enumerate all possibilities, e.g., ISD::fcZero \| ISD::fcInf. I don't think silently break for these cases are correct.
22848	I don't understand. If we only customize for `f80`. Why do we always add code for `f32` and `f64`?

Remove support of f32 and f64

sepavloff added inline comments.Nov 10 2021, 7:34 AM

llvm/lib/Target/X86/X86ISelLowering.cpp
721	There is no `i80`, so doing integer arithmetic on `f80` is complicated. Existing default lowering simply crashes on `f80`. Also for many classes, like `fcZero`, `fcSubnormal`, `fcInf` or `fcSNan` we need to analyze all three words of `f80`. One instruction `FXAM` does all the job, and seems faster even if its latency is high.
22704	It looks faster to execute `FXAM` and then analyze its result in integer pipeline than doing checks in x87 registers and move the results to integer registers for analysis.
22712–22725	Done just at the start of the function.
22762	This switch only processes some combinations that are simple to implement. For all other combinations the general case is applied. This patch was tested using runtime tests from D112933. They contains all combinations of two basic tests and many other combinations.
22848	There was some hesitation whether using `FXAM` could be used for types other than `f80` in some cases. Now support of these types is removed.

Harbormaster completed remote builds in B133470: Diff 386144.Nov 10 2021, 7:36 AM

pengfei added inline comments.Nov 10 2021, 6:27 PM

llvm/lib/Target/X86/X86ISelLowering.cpp
721	I mean we still customize `f80` here, but enrich the code for checking S/QNAN to more classes by loading bits [79:48] rather than [63:32]. We can easily check whether the remainding 48 bits are zero for `fcZero` and `fcInf`. Not only for performance, but also we should lower it when `f80` is used without `x87` enabled. Although it's rare of such case, it's supported by compiler.

sepavloff added inline comments.Nov 11 2021, 2:35 AM

llvm/lib/Target/X86/X86ISelLowering.cpp
721	but also we should lower it when f80 is used without x87 enabled. Although it's rare of such case, it's supported by compiler. How can this mode be activated? If I provide option `-mno-x87`, compiler fails in `TargetLoweringBase::getRegClassFor` in: assert(RC && "This value type is not natively supported!");

pengfei added inline comments.Nov 11 2021, 6:13 AM

llvm/lib/Target/X86/X86ISelLowering.cpp
721	Yeah, Clang may still have some problem to handle such cases, but GCC does support it. https://godbolt.org/z/aza81fKx4 We shouldn't make it worse :)

sepavloff added inline comments.Nov 11 2021, 8:29 AM

llvm/lib/Target/X86/X86ISelLowering.cpp
721	As I understand, it is what `-msoft-float` should do but it doesn't in clang. Things are already bad :) If FP operations in this case are implemented by library functions, it must be easier, more efficient and reliable to implement a special function for this intrinsic, rather than emulate the FP operations in the compiler.

Updated patch because its dependency changed.

Harbormaster completed remote builds in B150682: Diff 410288.Feb 21 2022, 5:45 AM

sepavloff added inline comments.Feb 21 2022, 5:52 AM

llvm/lib/Target/X86/X86ISelLowering.cpp
721	Adding a new library function is not an option, because we cannot add it to libgcc. The base patch (D112025) changed to support fp80 as well. Functions with suffix "soft" in the test file `is_fpclass-fp80.ll` demonstrate lowering for this case when x87 unit is not available.

sepavloff mentioned this in D112025: Intrinsic for checking floating point class.Mar 18 2022, 4:43 AM

Revision Contents

Path

Size

llvm/

lib/

Target/

X86/

MCTargetDesc/

X86BaseInfo.h

19 lines

X86ISelLowering.h

1 line

X86ISelLowering.cpp

265 lines

test/

CodeGen/

X86/

x86-is_fpclass-fp80.ll

497 lines

Diff 385523

llvm/lib/Target/X86/MCTargetDesc/X86BaseInfo.h

//===-- X86BaseInfo.h - Top level definitions for X86 -------- --- C++ --===//		//===-- X86BaseInfo.h - Top level definitions for X86 -------- --- C++ --===//
		Lint: Lint Inline Actions clang-format suggested style edits found: Lint: Lint: clang-format suggested style edits found:
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file contains small standalone helper functions and enum definitions for		// This file contains small standalone helper functions and enum definitions for
▲ Show 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	enum CondCode {
// which can't be represented on x86 with a single condition. These		// which can't be represented on x86 with a single condition. These
// are never used in MachineInstrs and are inverses of one another.		// are never used in MachineInstrs and are inverses of one another.
COND_NE_OR_P,		COND_NE_OR_P,
COND_E_AND_NP,		COND_E_AND_NP,

COND_INVALID		COND_INVALID
};		};

		/// Bits of x87 FPU Status Register.
		enum FPSWFlag {
		// Condition code.
		C0 = 0x0100,
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - C0 = 0x0100, - C1 = 0x0200, - C2 = 0x0400, - C3 = 0x4000, + C0 = 0x0100, + C1 = 0x0200, + C2 = 0x0400, + C3 = 0x4000, Lint: Pre-merge checks: clang-format: please reformat the code ``` - C0 = 0x0100, - C1 = 0x0200, - C2 =…
		C1 = 0x0200,
		C2 = 0x0400,
		C3 = 0x4000,

		// Bits set by FXAM.
		AllCondCodeBits = C0 \| C1 \| C2 \| C3,
		AllClassBits = C0 \| C2 \| C3,
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - AllClassBits = C0 \| C2 \| C3, - ClassNaN = C0, - ClassInf = C0 \| C2, - ClassNormal = C2, - ClassZero = C3, - ClassSubnormal = C2 \| C3, - Negative = C1 + AllClassBits = C0 \| C2 \| C3, + ClassNaN = C0, + ClassInf = C0 \| C2, 4 diff lines are omitted. See full path. Lint: Pre-merge checks: clang-format: please reformat the code ``` - AllClassBits = C0 \| C2 \| C3, - ClassNaN…
		ClassNaN = C0,
		ClassInf = C0 \| C2,
		ClassNormal = C2,
		ClassZero = C3,
		ClassSubnormal = C2 \| C3,
		Negative = C1
		};

// The classification for the first instruction in macro fusion.		// The classification for the first instruction in macro fusion.
enum class FirstMacroFusionInstKind {		enum class FirstMacroFusionInstKind {
// TEST		// TEST
Test,		Test,
// CMP		// CMP
Cmp,		Cmp,
// AND		// AND
And,		And,
▲ Show 20 Lines • Show All 1,115 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86ISelLowering.h

Show First 20 Lines • Show All 1,563 Lines • ▼ Show 20 Lines	private:
SDValue LowerWin64_FP_TO_INT128(SDValue Op, SelectionDAG &DAG,		SDValue LowerWin64_FP_TO_INT128(SDValue Op, SelectionDAG &DAG,
SDValue &Chain) const;		SDValue &Chain) const;
SDValue LowerWin64_INT128_TO_FP(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerWin64_INT128_TO_FP(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerGC_TRANSITION(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerGC_TRANSITION(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerINTRINSIC_WO_CHAIN(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerINTRINSIC_WO_CHAIN(SDValue Op, SelectionDAG &DAG) const;
SDValue lowerFaddFsub(SDValue Op, SelectionDAG &DAG) const;		SDValue lowerFaddFsub(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerFP_EXTEND(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerFP_EXTEND(SDValue Op, SelectionDAG &DAG) const;
SDValue LowerFP_ROUND(SDValue Op, SelectionDAG &DAG) const;		SDValue LowerFP_ROUND(SDValue Op, SelectionDAG &DAG) const;
		SDValue lowerIS_FPCLASS(SDValue Op, SelectionDAG &DAG) const;

SDValue		SDValue
LowerFormalArguments(SDValue Chain, CallingConv::ID CallConv, bool isVarArg,		LowerFormalArguments(SDValue Chain, CallingConv::ID CallConv, bool isVarArg,
const SmallVectorImpl<ISD::InputArg> &Ins,		const SmallVectorImpl<ISD::InputArg> &Ins,
const SDLoc &dl, SelectionDAG &DAG,		const SDLoc &dl, SelectionDAG &DAG,
SmallVectorImpl<SDValue> &InVals) const override;		SmallVectorImpl<SDValue> &InVals) const override;
SDValue LowerCall(CallLoweringInfo &CLI,		SDValue LowerCall(CallLoweringInfo &CLI,
SmallVectorImpl<SDValue> &InVals) const override;		SmallVectorImpl<SDValue> &InVals) const override;
▲ Show 20 Lines • Show All 172 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 712 Lines • ▼ Show 20 Lines	if (UseX87) {
setOperationAction(ISD::FTRUNC, MVT::f80, Expand);		setOperationAction(ISD::FTRUNC, MVT::f80, Expand);
setOperationAction(ISD::FRINT, MVT::f80, Expand);		setOperationAction(ISD::FRINT, MVT::f80, Expand);
setOperationAction(ISD::FNEARBYINT, MVT::f80, Expand);		setOperationAction(ISD::FNEARBYINT, MVT::f80, Expand);
setOperationAction(ISD::FMA, MVT::f80, Expand);		setOperationAction(ISD::FMA, MVT::f80, Expand);
setOperationAction(ISD::LROUND, MVT::f80, Expand);		setOperationAction(ISD::LROUND, MVT::f80, Expand);
setOperationAction(ISD::LLROUND, MVT::f80, Expand);		setOperationAction(ISD::LLROUND, MVT::f80, Expand);
setOperationAction(ISD::LRINT, MVT::f80, Custom);		setOperationAction(ISD::LRINT, MVT::f80, Custom);
setOperationAction(ISD::LLRINT, MVT::f80, Custom);		setOperationAction(ISD::LLRINT, MVT::f80, Custom);
		setOperationAction(ISD::IS_FPCLASS, MVT::f80, Custom);
		pengfeiUnsubmitted Not Done Reply Inline Actions Add f32/f64 too? pengfei: Add f32/f64 too?
		sepavloffAuthorUnsubmitted Done Reply Inline Actions There is concern that FXAM can be slow (https://reviews.llvm.org/D104853#2839746). So it is used only for f80, where x87 anyway is used. For other types a default lowering is used, which use usual float operations or integer arithmetic to make the tests. The resulted code for x86 is presented in D112025 in the test. sepavloff: There is concern that FXAM can be slow (https://reviews.llvm.org/D104853#2839746). So it is…
		pengfeiUnsubmitted Not Done Reply Inline Actions Given that and we still need to distinguish sNaN with qNaN. Why don't we use integer arithmetic for `f80` too? We just need to check the high 32 bits for all classes expect fcZero and fcInf. pengfei: Given that and we still need to distinguish sNaN with qNaN. Why don't we use integer arithmetic…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions There is no `i80`, so doing integer arithmetic on `f80` is complicated. Existing default lowering simply crashes on `f80`. Also for many classes, like `fcZero`, `fcSubnormal`, `fcInf` or `fcSNan` we need to analyze all three words of `f80`. One instruction `FXAM` does all the job, and seems faster even if its latency is high. sepavloff: There is no `i80`, so doing integer arithmetic on `f80` is complicated. Existing default…
		pengfeiUnsubmitted Not Done Reply Inline Actions I mean we still customize `f80` here, but enrich the code for checking S/QNAN to more classes by loading bits [79:48] rather than [63:32]. We can easily check whether the remainding 48 bits are zero for `fcZero` and `fcInf`. Not only for performance, but also we should lower it when `f80` is used without `x87` enabled. Although it's rare of such case, it's supported by compiler. pengfei: I mean we still customize `f80` here, but enrich the code for checking S/QNAN to more classes…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions but also we should lower it when f80 is used without x87 enabled. Although it's rare of such case, it's supported by compiler. How can this mode be activated? If I provide option `-mno-x87`, compiler fails in `TargetLoweringBase::getRegClassFor` in: assert(RC && "This value type is not natively supported!"); sepavloff: > but also we should lower it when f80 is used without x87 enabled. Although it's rare of such…
		pengfeiUnsubmitted Not Done Reply Inline Actions Yeah, Clang may still have some problem to handle such cases, but GCC does support it. https://godbolt.org/z/aza81fKx4 We shouldn't make it worse :) pengfei: Yeah, Clang may still have some problem to handle such cases, but GCC does support it. https…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions As I understand, it is what `-msoft-float` should do but it doesn't in clang. Things are already bad :) If FP operations in this case are implemented by library functions, it must be easier, more efficient and reliable to implement a special function for this intrinsic, rather than emulate the FP operations in the compiler. sepavloff: As I understand, it is what `-msoft-float` should do but it doesn't in clang. Things are…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Adding a new library function is not an option, because we cannot add it to libgcc. The base patch (D112025) changed to support fp80 as well. Functions with suffix "soft" in the test file `is_fpclass-fp80.ll` demonstrate lowering for this case when x87 unit is not available. sepavloff: Adding a new library function is not an option, because we cannot add it to libgcc. The base…

// Handle constrained floating-point operations of scalar.		// Handle constrained floating-point operations of scalar.
setOperationAction(ISD::STRICT_FADD , MVT::f80, Legal);		setOperationAction(ISD::STRICT_FADD , MVT::f80, Legal);
setOperationAction(ISD::STRICT_FSUB , MVT::f80, Legal);		setOperationAction(ISD::STRICT_FSUB , MVT::f80, Legal);
setOperationAction(ISD::STRICT_FMUL , MVT::f80, Legal);		setOperationAction(ISD::STRICT_FMUL , MVT::f80, Legal);
setOperationAction(ISD::STRICT_FDIV , MVT::f80, Legal);		setOperationAction(ISD::STRICT_FDIV , MVT::f80, Legal);
setOperationAction(ISD::STRICT_FSQRT , MVT::f80, Legal);		setOperationAction(ISD::STRICT_FSQRT , MVT::f80, Legal);
setOperationAction(ISD::STRICT_FP_EXTEND, MVT::f80, Legal);		setOperationAction(ISD::STRICT_FP_EXTEND, MVT::f80, Legal);
▲ Show 20 Lines • Show All 21,957 Lines • ▼ Show 20 Lines	static SDValue LowerFGETSIGN(SDValue Op, SelectionDAG &DAG) {
MVT VecVT = (OpVT == MVT::f32 ? MVT::v4f32 : MVT::v2f64);		MVT VecVT = (OpVT == MVT::f32 ? MVT::v4f32 : MVT::v2f64);
SDValue Res = DAG.getNode(ISD::SCALAR_TO_VECTOR, dl, VecVT, N0);		SDValue Res = DAG.getNode(ISD::SCALAR_TO_VECTOR, dl, VecVT, N0);
Res = DAG.getNode(X86ISD::MOVMSK, dl, MVT::i32, Res);		Res = DAG.getNode(X86ISD::MOVMSK, dl, MVT::i32, Res);
Res = DAG.getZExtOrTrunc(Res, dl, VT);		Res = DAG.getZExtOrTrunc(Res, dl, VT);
Res = DAG.getNode(ISD::AND, dl, VT, Res, DAG.getConstant(1, dl, VT));		Res = DAG.getNode(ISD::AND, dl, VT, Res, DAG.getConstant(1, dl, VT));
return Res;		return Res;
}		}

		SDValue X86TargetLowering::lowerIS_FPCLASS(SDValue Op,
		SelectionDAG &DAG) const {
		SDLoc DL(Op);
		MVT ResultVT = Op.getSimpleValueType();
		SDValue Arg = Op.getOperand(0);
		MVT ArgVT = Arg.getSimpleValueType();
		auto CNode = cast<ConstantSDNode>(Op.getOperand(1));
		unsigned Check = CNode->getZExtValue();

		if (Check == ISD::fcNan) {
		pengfeiUnsubmitted Not Done Reply Inline Actions This should also help for nan combined with other flags. pengfei: This should also help for nan combined with other flags.
		sepavloffAuthorUnsubmitted Done Reply Inline Actions It looks faster to execute `FXAM` and then analyze its result in integer pipeline than doing checks in x87 registers and move the results to integer registers for analysis. sepavloff: It looks faster to execute `FXAM` and then analyze its result in integer pipeline than doing…
		// If exceptions are ignored, use unordered comparison. It treats
		// unsupported values as NaNs, which is compatible with glibc.
		if (ArgVT == MVT::f80 && Op->getFlags().hasNoFPExcept())
		pengfeiUnsubmitted Not Done Reply Inline Actions Why only f80? pengfei: Why only f80?
		sepavloffAuthorUnsubmitted Done Reply Inline Actions You are right, this check is not needed. Removed. sepavloff: You are right, this check is not needed. Removed.
		return DAG.getSetCC(DL, ResultVT, Arg, Arg, ISD::CondCode::SETUNE);
		}

		// Determine classification of the argument using instruction FXAM.
		unsigned Opc;
		switch (ArgVT.SimpleTy) {
		default:
		llvm_unreachable("Unexpected type!");
		case MVT::f32:
		Opc = X86::XAM_Fp32;
		break;
		case MVT::f64:
		Opc = X86::XAM_Fp64;
		break;
		case MVT::f80:
		Opc = X86::XAM_Fp80;
		break;
		}
		pengfeiUnsubmitted Not Done Reply Inline Actions Can we assert the type only be `f80`? pengfei: Can we assert the type only be `f80`?
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Done just at the start of the function. sepavloff: Done just at the start of the function.
		SDValue Test(DAG.getMachineNode(Opc, DL, MVT::Glue, Arg), 0);

		// Move FPSW to AX.
		SDValue FPSW =
		SDValue(DAG.getMachineNode(X86::FNSTSW16r, DL, MVT::i16, Test), 0);

		// Recognize the case when the FP class test is an inversion of a simpler
		// test, like "inf\|normal\|subnormal\|zero" == !"nan".
		bool IsInverted = false;
		if (unsigned InvertedCheck = checkInvertedFPClass(Check)) {
		IsInverted = true;
		Check = InvertedCheck;
		}

		// Mask irrelevant bits in FPSW.
		SDValue ClassCCWithoutSign =
		DAG.getNode(ISD::AND, DL, MVT::i16, FPSW,
		DAG.getConstant(X86::AllClassBits, DL, MVT::i16));
		FPSW = DAG.getNode(ISD::AND, DL, MVT::i16, FPSW,
		DAG.getConstant(X86::AllCondCodeBits, DL, MVT::i16));

		// Process the simple cases, when only one compare is required for
		// classification.
		SDValue CCToCompare;
		switch (Check) {
		default:
		break;
		case ISD::fcZero:
		case ISD::fcSubnormal:
		case ISD::fcNormal:
		case ISD::fcInf:
		case ISD::fcNan:
		CCToCompare = ClassCCWithoutSign;
		break;
		case ISD::fcFinite:
		// isfinite == !(isnan \|\| isinf)
		case ISD::fcInf \| ISD::fcNan:
		pengfeiUnsubmitted Not Done Reply Inline Actions Can we define it with a new name? This give an impression the check conditions can be permuted. But the code seems don't support it. pengfei: Can we define it with a new name? This give an impression the check conditions can be permuted.
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Actually the checks can be permuted. I reformatted the comments, hopefully they make this code clearer. sepavloff: Actually the checks can be permuted. I reformatted the comments, hopefully they make this code…
		pengfeiUnsubmitted Not Done Reply Inline Actions But the switch doesn't enumerate all possibilities, e.g., ISD::fcZero \| ISD::fcInf. I don't think silently break for these cases are correct. pengfei: But the switch doesn't enumerate all possibilities, e.g., ISD::fcZero \| ISD::fcInf. I don't…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions This switch only processes some combinations that are simple to implement. For all other combinations the general case is applied. This patch was tested using runtime tests from D112933. They contains all combinations of two basic tests and many other combinations. sepavloff: This switch only processes some combinations that are simple to implement. For all other…
		// isnan == (C3 == 0, C2 == 0, C0 == 1)
		// isinf == (C3 == 0, C2 == 1, C0 == 1)
		case ISD::fcZero \| ISD::fcSubnormal:
		pengfeiUnsubmitted Not Done Reply Inline Actions ditto. pengfei: ditto.
		// iszero == (C3 == 1, C2 == 0, C0 == 0)
		// issubnormal == (C3 == 1, C2 == 1, C0 == 0)
		CCToCompare = DAG.getNode(ISD::AND, DL, MVT::i16, FPSW,
		DAG.getConstant(X86::C3 \| X86::C0, DL, MVT::i16));
		pengfeiUnsubmitted Not Done Reply Inline Actions Define it in `FPSWFlag` ? pengfei: Define it in `FPSWFlag` ?
		sepavloffAuthorUnsubmitted Done Reply Inline Actions I would like but I cannot invent a good name for it. Added comment to describe why this constant is used. sepavloff: I would like but I cannot invent a good name for it. Added comment to describe why this…
		break;
		}
		unsigned ExpectedCC = 0;
		switch (Check) {
		default:
		break;
		case ISD::fcFinite:
		// isfinite == !(isnan \|\| isinf)
		IsInverted = !IsInverted;
		LLVM_FALLTHROUGH;
		case ISD::fcInf \| ISD::fcNan:
		// For this test the bit C2 is cleared, so check for NaN will check for Inf
		// also.
		ExpectedCC = X86::ClassNaN;
		break;
		case ISD::fcZero:
		case ISD::fcZero \| ISD::fcSubnormal:
		ExpectedCC = X86::ClassZero;
		break;
		case ISD::fcSubnormal:
		ExpectedCC = X86::ClassSubnormal;
		break;
		case ISD::fcNormal:
		ExpectedCC = X86::ClassNormal;
		break;
		case ISD::fcInf:
		ExpectedCC = X86::ClassInf;
		break;
		case ISD::fcNan:
		// For compatibility with glibc treat unsupported formats as NaN.
		return DAG.getSetCC(DL, ResultVT, ClassCCWithoutSign,
		DAG.getConstant(X86::ClassNaN, DL, MVT::i16),
		IsInverted ? ISD::SETGT : ISD::SETLE);
		}
		if (ExpectedCC)
		return DAG.getSetCC(DL, ResultVT, CCToCompare,
		DAG.getConstant(ExpectedCC, DL, MVT::i16),
		IsInverted ? ISD::SETNE : ISD::SETEQ);
		assert(!CCToCompare);
		pengfeiUnsubmitted Not Done Reply Inline Actions It's equal to assert at line 22751 and better there with message. pengfei: It's equal to assert at line 22751 and better there with message.
		sepavloffAuthorUnsubmitted Done Reply Inline Actions The assert only checked that for each `CCToCompare` there is corresponding `ExpectedCC`. There are not many cases here, it seems that this check is safe to remove. sepavloff: The assert only checked that for each `CCToCompare` there is corresponding `ExpectedCC`. There…

		// The general case is implemented as series of checks.
		SDValue Res;
		unsigned PartialCheck;

		PartialCheck = Check & ISD::fcNan;
		if (PartialCheck) {
		Res =
		DAG.getSetCC(DL, ResultVT, ClassCCWithoutSign,
		DAG.getConstant(X86::ClassNaN, DL, MVT::i16), ISD::SETEQ);
		if (PartialCheck == ISD::fcSNan \|\| PartialCheck == ISD::fcQNan) {
		// FXAM does not provide information whether a NaN value is signaling, so
		// to distinguish between signaling and quiet NaNs we need to analyze
		// mantissa.

		// Store FP value to stack.
		SDValue Chain = DAG.getEntryNode();
		MachineFunction &MF = DAG.getMachineFunction();
		Align StackAlign = Subtarget.getFrameLowering()->getStackAlign();
		unsigned SSFISize =
		alignTo(Arg.getValueType().getStoreSize(), StackAlign);
		int SSFI =
		MF.getFrameInfo().CreateStackObject(SSFISize, StackAlign, false);
		auto PtrVT = getPointerTy(MF.getDataLayout());
		SDValue StackSlot = DAG.getFrameIndex(SSFI, PtrVT);
		MachineMemOperand *StoreMMO = MF.getMachineMemOperand(
		MachinePointerInfo::getFixedStack(DAG.getMachineFunction(), SSFI),
		MachineMemOperand::MOStore, SSFISize, StackAlign);

		Chain = DAG.getStore(Chain, DL, Arg, StackSlot, StoreMMO);

		// Load the high half of the mantissa.
		SDValue OffsetSlot =
		DAG.getMemBasePlusOffset(StackSlot, TypeSize::Fixed(4), DL);
		SDValue AsInt = DAG.getLoad(
		MVT::i32, DL, Chain, OffsetSlot,
		MachinePointerInfo::getFixedStack(MF, SSFI).getWithOffset(4));

		// Check the quiet NaN bit.
		APInt QNaNBitMask(32, 0x40000000);
		pengfeiUnsubmitted Not Done Reply Inline Actions This only works for fp80. We should check the type first. Can we check exception #IA for f32 and f64? pengfei: This only works for fp80. We should check the type first. Can we check exception #IA for f32…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Fixed. sepavloff: Fixed.
		pengfeiUnsubmitted Not Done Reply Inline Actions I don't understand. If we only customize for `f80`. Why do we always add code for `f32` and `f64`? pengfei: I don't understand. If we only customize for `f80`. Why do we always add code for `f32` and…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions There was some hesitation whether using `FXAM` could be used for types other than `f80` in some cases. Now support of these types is removed. sepavloff: There was some hesitation whether using `FXAM` could be used for types other than `f80` in some…
		SDValue QNaNBitMaskV = DAG.getConstant(QNaNBitMask, DL, MVT::i32);
		SDValue QNaNBitV =
		DAG.getNode(ISD::AND, DL, MVT::i32, AsInt, QNaNBitMaskV);
		ISD::CondCode CC = (PartialCheck == ISD::fcSNan) ? ISD::CondCode::SETEQ
		: ISD::CondCode::SETNE;
		SDValue QNaNBitTest = DAG.getSetCC(DL, ResultVT, QNaNBitV,
		DAG.getConstant(0, DL, MVT::i32), CC);
		Res = DAG.getNode(ISD::AND, DL, ResultVT, Res, QNaNBitTest);
		}
		}

		PartialCheck = Check & ISD::fcInf;
		if (PartialCheck) {
		SDValue PartialRes;
		if (PartialCheck == ISD::fcPosInf) {
		PartialRes = DAG.getSetCC(DL, ResultVT, FPSW,
		DAG.getConstant(X86::ClassInf, DL, MVT::i16),
		ISD::SETEQ);
		} else if (PartialCheck == ISD::fcNegInf) {
		PartialRes = DAG.getSetCC(
		DL, ResultVT, FPSW,
		DAG.getConstant(X86::ClassInf \| X86::Negative, DL, MVT::i16),
		ISD::SETEQ);
		} else {
		PartialRes = DAG.getSetCC(DL, ResultVT, ClassCCWithoutSign,
		DAG.getConstant(X86::ClassInf, DL, MVT::i16),
		ISD::SETEQ);
		}
		if (Res)
		Res = DAG.getNode(ISD::OR, DL, ResultVT, Res, PartialRes);
		else
		Res = PartialRes;
		}

		PartialCheck = Check & ISD::fcNormal;
		if (PartialCheck) {
		SDValue PartialRes;
		if (PartialCheck == ISD::fcPosNormal) {
		PartialRes = DAG.getSetCC(DL, ResultVT, FPSW,
		DAG.getConstant(X86::ClassNormal, DL, MVT::i16),
		ISD::SETEQ);
		} else if (PartialCheck == ISD::fcNegNormal) {
		PartialRes = DAG.getSetCC(
		DL, ResultVT, FPSW,
		DAG.getConstant(X86::ClassNormal \| X86::Negative, DL, MVT::i16),
		ISD::SETEQ);
		} else {
		PartialRes = DAG.getSetCC(DL, ResultVT, ClassCCWithoutSign,
		DAG.getConstant(X86::ClassNormal, DL, MVT::i16),
		ISD::SETEQ);
		}
		if (Res)
		Res = DAG.getNode(ISD::OR, DL, ResultVT, Res, PartialRes);
		else
		Res = PartialRes;
		}

		PartialCheck = Check & ISD::fcSubnormal;
		if (PartialCheck) {
		SDValue PartialRes;
		if (PartialCheck == ISD::fcPosSubnormal) {
		PartialRes = DAG.getSetCC(
		DL, ResultVT, FPSW,
		DAG.getConstant(X86::ClassSubnormal, DL, MVT::i16), ISD::SETEQ);
		} else if (PartialCheck == ISD::fcNegSubnormal) {
		PartialRes = DAG.getSetCC(
		DL, ResultVT, FPSW,
		DAG.getConstant(X86::ClassSubnormal \| X86::Negative, DL, MVT::i16),
		ISD::SETEQ);
		} else {
		PartialRes = DAG.getSetCC(
		DL, ResultVT, ClassCCWithoutSign,
		DAG.getConstant(X86::ClassSubnormal, DL, MVT::i16), ISD::SETEQ);
		}
		if (Res)
		Res = DAG.getNode(ISD::OR, DL, ResultVT, Res, PartialRes);
		else
		Res = PartialRes;
		}

		PartialCheck = Check & ISD::fcZero;
		if (PartialCheck) {
		SDValue PartialRes;
		if (PartialCheck == ISD::fcPosZero) {
		PartialRes = DAG.getSetCC(DL, ResultVT, FPSW,
		DAG.getConstant(X86::ClassZero, DL, MVT::i16),
		ISD::SETEQ);
		} else if (PartialCheck == ISD::fcNegZero) {
		PartialRes = DAG.getSetCC(
		DL, ResultVT, FPSW,
		DAG.getConstant(X86::ClassZero \| X86::Negative, DL, MVT::i16),
		ISD::SETEQ);
		} else {
		PartialRes = DAG.getSetCC(DL, ResultVT, ClassCCWithoutSign,
		DAG.getConstant(X86::ClassZero, DL, MVT::i16),
		ISD::SETEQ);
		}
		if (Res)
		Res = DAG.getNode(ISD::OR, DL, ResultVT, Res, PartialRes);
		else
		Res = PartialRes;
		}

		if (IsInverted)
		Res = DAG.getSetCC(DL, ResultVT, Res, DAG.getConstant(0, DL, ResultVT),
		ISD::SETEQ);
		return Res;
		}

/// Helper for creating a X86ISD::SETCC node.		/// Helper for creating a X86ISD::SETCC node.
static SDValue getSETCC(X86::CondCode Cond, SDValue EFLAGS, const SDLoc &dl,		static SDValue getSETCC(X86::CondCode Cond, SDValue EFLAGS, const SDLoc &dl,
SelectionDAG &DAG) {		SelectionDAG &DAG) {
return DAG.getNode(X86ISD::SETCC, dl, MVT::i8,		return DAG.getNode(X86ISD::SETCC, dl, MVT::i8,
DAG.getTargetConstant(Cond, dl, MVT::i8), EFLAGS);		DAG.getTargetConstant(Cond, dl, MVT::i8), EFLAGS);
}		}

/// Helper for matching OR(EXTRACTELT(X,0),OR(EXTRACTELT(X,1),...))		/// Helper for matching OR(EXTRACTELT(X,0),OR(EXTRACTELT(X,1),...))
▲ Show 20 Lines • Show All 8,528 Lines • ▼ Show 20 Lines	SDValue X86TargetLowering::LowerOperation(SDValue Op, SelectionDAG &DAG) const {
case ISD::FADD:		case ISD::FADD:
case ISD::FSUB: return lowerFaddFsub(Op, DAG);		case ISD::FSUB: return lowerFaddFsub(Op, DAG);
case ISD::STRICT_FROUND:		case ISD::STRICT_FROUND:
case ISD::FROUND: return LowerFROUND(Op, DAG);		case ISD::FROUND: return LowerFROUND(Op, DAG);
case ISD::FABS:		case ISD::FABS:
case ISD::FNEG: return LowerFABSorFNEG(Op, DAG);		case ISD::FNEG: return LowerFABSorFNEG(Op, DAG);
case ISD::FCOPYSIGN: return LowerFCOPYSIGN(Op, DAG);		case ISD::FCOPYSIGN: return LowerFCOPYSIGN(Op, DAG);
case ISD::FGETSIGN: return LowerFGETSIGN(Op, DAG);		case ISD::FGETSIGN: return LowerFGETSIGN(Op, DAG);
		case ISD::IS_FPCLASS: return lowerIS_FPCLASS(Op, DAG);
case ISD::LRINT:		case ISD::LRINT:
case ISD::LLRINT: return LowerLRINT_LLRINT(Op, DAG);		case ISD::LLRINT: return LowerLRINT_LLRINT(Op, DAG);
case ISD::SETCC:		case ISD::SETCC:
case ISD::STRICT_FSETCC:		case ISD::STRICT_FSETCC:
case ISD::STRICT_FSETCCS: return LowerSETCC(Op, DAG);		case ISD::STRICT_FSETCCS: return LowerSETCC(Op, DAG);
case ISD::SETCCCARRY: return LowerSETCCCARRY(Op, DAG);		case ISD::SETCCCARRY: return LowerSETCCCARRY(Op, DAG);
case ISD::SELECT: return LowerSELECT(Op, DAG);		case ISD::SELECT: return LowerSELECT(Op, DAG);
case ISD::BRCOND: return LowerBRCOND(Op, DAG);		case ISD::BRCOND: return LowerBRCOND(Op, DAG);
case ISD::JumpTable: return LowerJumpTable(Op, DAG);		case ISD::JumpTable: return LowerJumpTable(Op, DAG);
case ISD::VASTART: return LowerVASTART(Op, DAG);		case ISD::VASTART: return LowerVASTART(Op, DAG);
case ISD::VAARG: return LowerVAARG(Op, DAG);		case ISD::VAARG: return LowerVAARG(Op, DAG);
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - case ISD::IS_FPCLASS: return lowerIS_FPCLASS(Op, DAG); + case ISD::IS_FPCLASS: + return lowerIS_FPCLASS(Op, DAG); Lint: Pre-merge checks: clang-format: please reformat the code ``` - case ISD::IS_FPCLASS: return…
case ISD::VACOPY: return LowerVACOPY(Op, Subtarget, DAG);		case ISD::VACOPY: return LowerVACOPY(Op, Subtarget, DAG);
case ISD::INTRINSIC_WO_CHAIN: return LowerINTRINSIC_WO_CHAIN(Op, DAG);		case ISD::INTRINSIC_WO_CHAIN: return LowerINTRINSIC_WO_CHAIN(Op, DAG);
case ISD::INTRINSIC_VOID:		case ISD::INTRINSIC_VOID:
case ISD::INTRINSIC_W_CHAIN: return LowerINTRINSIC_W_CHAIN(Op, Subtarget, DAG);		case ISD::INTRINSIC_W_CHAIN: return LowerINTRINSIC_W_CHAIN(Op, Subtarget, DAG);
case ISD::RETURNADDR: return LowerRETURNADDR(Op, DAG);		case ISD::RETURNADDR: return LowerRETURNADDR(Op, DAG);
case ISD::ADDROFRETURNADDR: return LowerADDROFRETURNADDR(Op, DAG);		case ISD::ADDROFRETURNADDR: return LowerADDROFRETURNADDR(Op, DAG);
case ISD::FRAMEADDR: return LowerFRAMEADDR(Op, DAG);		case ISD::FRAMEADDR: return LowerFRAMEADDR(Op, DAG);
case ISD::FRAME_TO_ARGS_OFFSET:		case ISD::FRAME_TO_ARGS_OFFSET:
▲ Show 20 Lines • Show All 22,893 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/x86-is_fpclass-fp80.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
				; RUN: llc < %s -mtriple=i686-linux \| FileCheck %s -check-prefix=CHECK-32
				; RUN: llc < %s -mtriple=x86_64-linux \| FileCheck %s -check-prefix=CHECK-64

				define i1 @isnan_f(x86_fp80 %x) {
				; CHECK-32-LABEL: isnan_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fucomp %st(0)
				pengfeiUnsubmitted Not Done Reply Inline Actions Where's fucomp generated from? I didn't see it in the code. pengfei: Where's fucomp generated from? I didn't see it in the code.
				sepavloffAuthorUnsubmitted Done Reply Inline Actions It appears as a result of the call `DAG.getSetCC(DL, ResultVT, Arg, Arg, ISD::CondCode::SETUNE)`, which is called at the beginning of `lowerIS_FPCLASS` if exceptions are ignored. sepavloff: It appears as a result of the call `DAG.getSetCC(DL, ResultVT, Arg, Arg, ISD::CondCode…
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: # kill: def $ah killed $ah killed $ax
				; CHECK-32-NEXT: sahf
				; CHECK-32-NEXT: setp %cl
				; CHECK-32-NEXT: setne %al
				; CHECK-32-NEXT: orb %cl, %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: isnan_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fucompi %st(0), %st
				pengfeiUnsubmitted Not Done Reply Inline Actions Why `fucompi`? pengfei: Why `fucompi`?
				sepavloffAuthorUnsubmitted Done Reply Inline Actions It is also the result of `getSetCC`. This code must be generated for unordered comparison. sepavloff: It is also the result of `getSetCC`. This code must be generated for unordered comparison.
				; CHECK-64-NEXT: setp %cl
				; CHECK-64-NEXT: setne %al
				; CHECK-64-NEXT: orb %cl, %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"nan")
				ret i1 %0
				}

				define i1 @isnot_nan_f(x86_fp80 %x) {
				; CHECK-32-LABEL: isnot_nan_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-32-NEXT: cmpl $257, %eax # imm = 0x101
				; CHECK-32-NEXT: setge %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: isnot_nan_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-64-NEXT: cmpl $257, %eax # imm = 0x101
				; CHECK-64-NEXT: setge %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"zero\|subnormal\|normal\|inf")
				ret i1 %0
				}

				define i1 @issignaling_f(x86_fp80 %x) {
				; CHECK-32-LABEL: issignaling_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: subl $28, %esp
				; CHECK-32-NEXT: .cfi_def_cfa_offset 32
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fld %st(0)
				; CHECK-32-NEXT: fstpt (%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-32-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-32-NEXT: sete %cl
				; CHECK-32-NEXT: testl $1073741824, {{[0-9]+}}(%esp) # imm = 0x40000000
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: andb %cl, %al
				; CHECK-32-NEXT: addl $28, %esp
				; CHECK-32-NEXT: .cfi_def_cfa_offset 4
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: issignaling_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fld %st(0)
				; CHECK-64-NEXT: fstpt -{{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-64-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-64-NEXT: sete %cl
				; CHECK-64-NEXT: testl $1073741824, -{{[0-9]+}}(%rsp) # imm = 0x40000000
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: andb %cl, %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"snan")
				ret i1 %0
				}

				define i1 @isnot_signaling_f(x86_fp80 %x) {
				; CHECK-32-LABEL: isnot_signaling_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: subl $28, %esp
				; CHECK-32-NEXT: .cfi_def_cfa_offset 32
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fld %st(0)
				; CHECK-32-NEXT: fstpt (%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-32-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: testl $1073741824, {{[0-9]+}}(%esp) # imm = 0x40000000
				; CHECK-32-NEXT: sete %cl
				; CHECK-32-NEXT: testb %cl, %al
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: addl $28, %esp
				; CHECK-32-NEXT: .cfi_def_cfa_offset 4
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: isnot_signaling_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fld %st(0)
				; CHECK-64-NEXT: fstpt -{{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-64-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: testl $1073741824, -{{[0-9]+}}(%rsp) # imm = 0x40000000
				; CHECK-64-NEXT: sete %cl
				; CHECK-64-NEXT: testb %cl, %al
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"zero\|subnormal\|normal\|inf\|qnan")
				ret i1 %0
				}

				define i1 @isquiet_f(x86_fp80 %x) {
				; CHECK-32-LABEL: isquiet_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: subl $28, %esp
				; CHECK-32-NEXT: .cfi_def_cfa_offset 32
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fld %st(0)
				; CHECK-32-NEXT: fstpt (%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-32-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-32-NEXT: sete %cl
				; CHECK-32-NEXT: testl $1073741824, {{[0-9]+}}(%esp) # imm = 0x40000000
				; CHECK-32-NEXT: setne %al
				; CHECK-32-NEXT: andb %cl, %al
				; CHECK-32-NEXT: addl $28, %esp
				; CHECK-32-NEXT: .cfi_def_cfa_offset 4
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: isquiet_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fld %st(0)
				; CHECK-64-NEXT: fstpt -{{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-64-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-64-NEXT: sete %cl
				; CHECK-64-NEXT: testl $1073741824, -{{[0-9]+}}(%rsp) # imm = 0x40000000
				; CHECK-64-NEXT: setne %al
				; CHECK-64-NEXT: andb %cl, %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"qnan")
				ret i1 %0
				}

				define i1 @isinf_f(x86_fp80 %x) {
				; CHECK-32-LABEL: isinf_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-32-NEXT: cmpl $1280, %eax # imm = 0x500
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: isinf_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-64-NEXT: cmpl $1280, %eax # imm = 0x500
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"inf")
				ret i1 %0
				}

				define i1 @is_plus_inf_f(x86_fp80 %x) {
				; CHECK-32-LABEL: is_plus_inf_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $18176, %eax # imm = 0x4700
				; CHECK-32-NEXT: cmpl $1280, %eax # imm = 0x500
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: is_plus_inf_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $18176, %eax # imm = 0x4700
				; CHECK-64-NEXT: cmpl $1280, %eax # imm = 0x500
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"+inf")
				ret i1 %0
				}

				define i1 @is_minus_inf_f(x86_fp80 %x) {
				; CHECK-32-LABEL: is_minus_inf_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $18176, %eax # imm = 0x4700
				; CHECK-32-NEXT: cmpl $1792, %eax # imm = 0x700
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: is_minus_inf_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $18176, %eax # imm = 0x4700
				; CHECK-64-NEXT: cmpl $1792, %eax # imm = 0x700
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"-inf")
				ret i1 %0
				}

				define i1 @isfinite_f(x86_fp80 %x) {
				; CHECK-32-LABEL: isfinite_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $16640, %eax # imm = 0x4100
				; CHECK-32-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-32-NEXT: setne %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: isfinite_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $16640, %eax # imm = 0x4100
				; CHECK-64-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-64-NEXT: setne %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"finite")
				ret i1 %0
				}

				define i1 @is_plus_finite_f(x86_fp80 %x) {
				; CHECK-32-LABEL: is_plus_finite_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $18176, %eax # imm = 0x4700
				; CHECK-32-NEXT: cmpl $17408, %eax # imm = 0x4400
				; CHECK-32-NEXT: sete %cl
				; CHECK-32-NEXT: cmpl $1024, %eax # imm = 0x400
				; CHECK-32-NEXT: sete %dl
				; CHECK-32-NEXT: cmpl $16384, %eax # imm = 0x4000
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: orb %cl, %al
				; CHECK-32-NEXT: orb %dl, %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: is_plus_finite_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $18176, %eax # imm = 0x4700
				; CHECK-64-NEXT: cmpl $17408, %eax # imm = 0x4400
				; CHECK-64-NEXT: sete %cl
				; CHECK-64-NEXT: cmpl $1024, %eax # imm = 0x400
				; CHECK-64-NEXT: sete %dl
				; CHECK-64-NEXT: cmpl $16384, %eax # imm = 0x4000
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: orb %cl, %al
				; CHECK-64-NEXT: orb %dl, %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"+finite")
				ret i1 %0
				}

				define i1 @isnormal_f(x86_fp80 %x) {
				; CHECK-32-LABEL: isnormal_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-32-NEXT: cmpl $1024, %eax # imm = 0x400
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: isnormal_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-64-NEXT: cmpl $1024, %eax # imm = 0x400
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"normal")
				ret i1 %0
				}

				define i1 @issubnormal_f(x86_fp80 %x) {
				; CHECK-32-LABEL: issubnormal_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-32-NEXT: cmpl $17408, %eax # imm = 0x4400
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: issubnormal_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-64-NEXT: cmpl $17408, %eax # imm = 0x4400
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"subnormal")
				ret i1 %0
				}

				define i1 @iszero_f(x86_fp80 %x) {
				; CHECK-32-LABEL: iszero_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-32-NEXT: cmpl $16384, %eax # imm = 0x4000
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: iszero_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-64-NEXT: cmpl $16384, %eax # imm = 0x4000
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"zero")
				ret i1 %0
				}

				define i1 @is_minus_zero_f(x86_fp80 %x) {
				; CHECK-32-LABEL: is_minus_zero_f:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $18176, %eax # imm = 0x4700
				; CHECK-32-NEXT: cmpl $16896, %eax # imm = 0x4200
				; CHECK-32-NEXT: sete %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: is_minus_zero_f:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $18176, %eax # imm = 0x4700
				; CHECK-64-NEXT: cmpl $16896, %eax # imm = 0x4200
				; CHECK-64-NEXT: sete %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"-zero")
				ret i1 %0
				}



				define i1 @isnan_f_strictfp(x86_fp80 %x) strictfp {
				; CHECK-32-LABEL: isnan_f_strictfp:
				; CHECK-32: # %bb.0: # %entry
				; CHECK-32-NEXT: fldt {{[0-9]+}}(%esp)
				; CHECK-32-NEXT: fxam
				; CHECK-32-NEXT: fnstsw %ax
				; CHECK-32-NEXT: fstp %st(0)
				; CHECK-32-NEXT: wait
				; CHECK-32-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-32-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-32-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-32-NEXT: setle %al
				; CHECK-32-NEXT: retl
				;
				; CHECK-64-LABEL: isnan_f_strictfp:
				; CHECK-64: # %bb.0: # %entry
				; CHECK-64-NEXT: fldt {{[0-9]+}}(%rsp)
				; CHECK-64-NEXT: fxam
				; CHECK-64-NEXT: fnstsw %ax
				; CHECK-64-NEXT: fstp %st(0)
				; CHECK-64-NEXT: wait
				; CHECK-64-NEXT: # kill: def $ax killed $ax def $eax
				; CHECK-64-NEXT: andl $17664, %eax # imm = 0x4500
				; CHECK-64-NEXT: cmpl $256, %eax # imm = 0x100
				; CHECK-64-NEXT: setle %al
				; CHECK-64-NEXT: retq
				entry:
				%0 = tail call i1 @llvm.is.fpclass.f80(x86_fp80 %x, metadata !"nan")
				ret i1 %0
				}

				declare i1 @llvm.is.fpclass.f80(x86_fp80, metadata)

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Custom lowering of llvm.is_fpclass for x86_fp80Needs ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 385523

llvm/lib/Target/X86/MCTargetDesc/X86BaseInfo.h

llvm/lib/Target/X86/X86ISelLowering.h

llvm/lib/Target/X86/X86ISelLowering.cpp

llvm/test/CodeGen/X86/x86-is_fpclass-fp80.ll

[X86] Custom lowering of llvm.is_fpclass for x86_fp80
Needs ReviewPublic