This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/CodeGen/
-
CodeGen/
-
CGExprScalar.cpp
-
test/CodeGen/
-
CodeGen/
-
fpconstrained-cmp.c
-
llvm/include/llvm/IR/
-
include/
-
llvm/
-
IR/
1
IRBuilder.h

Differential D71467

[FPEnv] Generate constrained FP comparisons from clang
ClosedPublic

Authored by uweigand on Dec 13 2019, 7:05 AM.

Download Raw Diff

Details

Reviewers

kpn
andrew.w.kaylor
craig.topper
cameron.mcinally
RKSimon
spatel
rjmccall
rsmith

Commits

rG76e9c2a9870e: [FPEnv] Generate constrained FP comparisons from clang

Summary

Update the IRBuilder to generate constrained FP comparisons in CreateFCmp when IsFPConstrained is true, similar to the other places in the IRBuilder.

Also, add a new CreateFCmpS to emit signaling FP comparisons, and use it in clang where comparisons are supposed to be signaling (currently, only when emitting code for the <, <=, >, >= operators). Most other places are supposed to emit quiet comparisons, including the equality comparisons, the various builtins like isless, and uses of floating-point values in boolean contexts. A few places that I haven't touched may need some extra thought (e.g. are comparisons implicitly generated to implement sanitizer checks supposed to be signaling?).

I've noticed two potential problems while implementing this:

There is currently no way to add fast-math flags to a constrained FP comparison, since this is implemented as an intrinsic call that returns a boolean type, and FMF are only allowed for calls returning a floating-point type. However, given the discussion around https://bugs.llvm.org/show_bug.cgi?id=42179, it seems that FCmp itself really shouldn't have any FMF either, so this is probably OK.

Using builtins like __builtin_isless on a "float" type will implicitly convert the float argument to a double; apparently this is because the builtin is declared as having a variable argument list? In any case, that means that even though a quiet comparison is generated, the semantics still isn't correct for quiet NaNs, as that implicit conversion will already signal an exception. This probably needs to be fixed, but I guess that can be done as a separate patch.

Diff Detail

Event Timeline

uweigand created this revision.Dec 13 2019, 7:05 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptDec 13 2019, 7:05 AM

Herald added subscribers: llvm-commits, cfe-commits. · View Herald Transcript

The bug with __builtin_isless should be a really easy fix; the builtin just needs to be flagged as having custom type-checking, and then we need to make sure we do appropriate promotions on the arguments (but we probably do).

In D71467#1784286, @rjmccall wrote:

The bug with __builtin_isless should be a really easy fix; the builtin just needs to be flagged as having custom type-checking, and then we need to make sure we do appropriate promotions on the arguments (but we probably do).

I think I convinced @erichkeane to look at it on Monday.

In D71467#1784338, @craig.topper wrote:

In D71467#1784286, @rjmccall wrote:

The bug with __builtin_isless should be a really easy fix; the builtin just needs to be flagged as having custom type-checking, and then we need to make sure we do appropriate promotions on the arguments (but we probably do).

I think I convinced @erichkeane to look at it on Monday.

Everything but fpclassify is pretty trivial, I just needed to write a test but needed to go home. I'll commit that Monday when I get to it. Fpclassify will take a touch longer since the int arguments need to be dealt with, but that shouldn't be more than a little work.

pengfei added a subscriber: pengfei.Dec 15 2019, 6:14 PM

erichkeane mentioned this in rGf02d6dd6c7af: Fix floating point builtins to not promote float->double.Dec 16 2019, 7:20 AM

In D71467#1784398, @erichkeane wrote:

In D71467#1784338, @craig.topper wrote:

In D71467#1784286, @rjmccall wrote:

The bug with __builtin_isless should be a really easy fix; the builtin just needs to be flagged as having custom type-checking, and then we need to make sure we do appropriate promotions on the arguments (but we probably do).

I think I convinced @erichkeane to look at it on Monday.

Everything but fpclassify is pretty trivial, I just needed to write a test but needed to go home. I'll commit that Monday when I get to it. Fpclassify will take a touch longer since the int arguments need to be dealt with, but that shouldn't be more than a little work.

I did the compare operators that didn't work right, and will do a separate patch for the fp-classification type ones: f02d6dd6c7afc08f871a623c0411f2d77ed6acf8

Added float (f32) test cases.

In D71467#1785943, @erichkeane wrote:

I did the compare operators that didn't work right, and will do a separate patch for the fp-classification type ones: f02d6dd6c7afc08f871a623c0411f2d77ed6acf8

Thanks! Now I'm getting the correct output for the float test cases as well, and I've added them to the patch.

As to fp-classification, I think there is an additional complication here: according to IEEE and the proposed C2x standard, these builtins should never raise any exception, not even when receiving a signaling NaN as input. Strictly speaking, this means that they cannot possibly be implemented in terms of any comparison operation.

Now, on SystemZ (and many other platforms, I think) there are in fact specialized instructions that will implement the required semantics without raising any exceptions, but there seems to be no way to represent those at the LLVM IR level. We'll probably need some extensions here (some new IR-level builtins?) ...

(But I'd say that problem is unrelated to this patch, so I'd prefer to decouple that problem from the question of whether this patch is the right solution for comparisons.)

In D71467#1786188, @uweigand wrote:

In D71467#1785943, @erichkeane wrote:

I did the compare operators that didn't work right, and will do a separate patch for the fp-classification type ones: f02d6dd6c7afc08f871a623c0411f2d77ed6acf8

Thanks! Now I'm getting the correct output for the float test cases as well, and I've added them to the patch.

As to fp-classification, I think there is an additional complication here: according to IEEE and the proposed C2x standard, these builtins should never raise any exception, not even when receiving a signaling NaN as input. Strictly speaking, this means that they cannot possibly be implemented in terms of any comparison operation.

Now, on SystemZ (and many other platforms, I think) there are in fact specialized instructions that will implement the required semantics without raising any exceptions, but there seems to be no way to represent those at the LLVM IR level. We'll probably need some extensions here (some new IR-level builtins?) ...

(But I'd say that problem is unrelated to this patch, so I'd prefer to decouple that problem from the question of whether this patch is the right solution for comparisons.)

__builtin_fpclassify/isfinite/isinf/isinf_sign/isnan/isnormal/signbit are all implemented the same as the OTHER ones, except there is a strange fixup step in SEMA that removes the float->double cast. It is IMO the wrong way to do it.

I don't think it would modify the IR at all or the AST, but I'm also working on removing that hack (which is what I meant by the fp-classification type ones).

I hope the work I've done already is sufficient to unblock this patch.

In D71467#1786192, @erichkeane wrote:

__builtin_fpclassify/isfinite/isinf/isinf_sign/isnan/isnormal/signbit are all implemented the same as the OTHER ones, except there is a strange fixup step in SEMA that removes the float->double cast. It is IMO the wrong way to do it.

I don't think it would modify the IR at all or the AST, but I'm also working on removing that hack (which is what I meant by the fp-classification type ones).

I hope the work I've done already is sufficient to unblock this patch.

Yes, this patch is no longer blocked, thanks!

What I was trying to say is that there is a fundamental difference between the comparison builtins like isless, isgreater, etc. and the classification builtins like isinf, isnan, etc.

The former should result in comparison instructions being generated, the only difference between the builtin and a regular "<" operator is that the builtin emits a quiet compare while the operator emits a signaling compare in strict mode.

However, the latter (classification macros) should not actually emit any comparison instructions in strict mode, because the classification macros may never trap, but all comparison instructions do. So the basic idea of implementing e.g. isinf(x) as "fabs(x) == infinity" (like the comment in CGBuiltin.cpp currently says) is fundamentally wrong in strict mode.

Ping?

Ping again.

LGTM

This revision is now accepted and ready to land.Jan 9 2020, 6:41 PM

Closed by commit rG76e9c2a9870e: [FPEnv] Generate constrained FP comparisons from clang (authored by uweigand). · Explain WhyJan 10 2020, 5:37 AM

This revision was automatically updated to reflect the committed changes.

Is this approach going to work with scope-local strictness? We need a way to do a comparison that has the non-strict properties but appears in a function that enables strictness elsewhere.

llvm/include/llvm/IR/IRBuilder.h
2342	Can you make a helper method for the common code in the non-constrained paths here? Please document the difference between these two methods.

uweigand mentioned this in rG6aca3e8dfa22: [FPEnv] Add some comments to IRBuilder.h.Jan 14 2020, 5:26 AM

In D71467#1817939, @rjmccall wrote:

Is this approach going to work with scope-local strictness? We need a way to do a comparison that has the non-strict properties but appears in a function that enables strictness elsewhere.

Well, just like for all the other FP builder methods, you can use the setIsFPConstrained method on the builder object to switch between strict and non-strict mode. Does this not suffice, or is there anything particular about the comparisons that would require anything extra?

Please document the difference between these two methods.

OK, checked in header file comments as 6aca3e8.

Can you make a helper method for the common code in the non-constrained paths here?

Would you prefer something like

private:
  Value *CreateFCmpHelper(CmpInst::Predicate P, Value *LHS, Value *RHS,
                          const Twine &Name, MDNode *FPMathTag) {
    if (auto *LC = dyn_cast<Constant>(LHS))
      if (auto *RC = dyn_cast<Constant>(RHS))
        return Insert(Folder.CreateFCmp(P, LC, RC), Name);
    return Insert(setFPAttrs(new FCmpInst(P, LHS, RHS), FPMathTag, FMF), Name);
  }

public:
  Value *CreateFCmp(CmpInst::Predicate P, Value *LHS, Value *RHS,
                    const Twine &Name = "", MDNode *FPMathTag = nullptr) {
    if (IsFPConstrained)
      return CreateConstrainedFPCmp(Intrinsic::experimental_constrained_fcmp,
                                    P, LHS, RHS, Name);

    return CreateFCmpHelper(P, LHS, RHS, Name, FPMathTag);
  }
  [...]

or rather something like:

private:
  Value *CreateFCmpHelper(CmpInst::Predicate P, Value *LHS, Value *RHS,
                          bool IsSignaling, const Twine &Name, MDNode *FPMathTag) {
    if (IsFPConstrained)
      return CreateConstrainedFPCmp(IsSignaling ? Intrinsic::experimental_constrained_fcmps
                                                : Intrinsic::experimental_constrained_fcmp,
                                    P, LHS, RHS, Name);

    if (auto *LC = dyn_cast<Constant>(LHS))
      if (auto *RC = dyn_cast<Constant>(RHS))
        return Insert(Folder.CreateFCmp(P, LC, RC), Name);
    return Insert(setFPAttrs(new FCmpInst(P, LHS, RHS), FPMathTag, FMF), Name);
  }

public:
  Value *CreateFCmp(CmpInst::Predicate P, Value *LHS, Value *RHS,
                    const Twine &Name = "", MDNode *FPMathTag = nullptr) {
    return CreateFCmpHelper(P, LHS, RHS, false, Name, FPMathTag);
  }
  [...]

or maybe simply have CreateFCmpS call CreateFCmp directly in the non-strict case?

Well, just like for all the other FP builder methods, you can use the setIsFPConstrained method on the builder object to switch between strict and non-strict mode. Does this not suffice, or is there anything particular about the comparisons that would require anything extra?

Ah, sorry, I forgot that IsFPConstrained is about whether we emit the intrinsics at all and not whether the intrinsics are currently recording real constraints.

I think I have a slight preference for the second option, where there's a single method that does all the work for the two cases.

uweigand mentioned this in rG870137d207f7: [FPEnv] Address post-commit review comment for D71467.Jan 15 2020, 6:11 AM

In D71467#1820589, @rjmccall wrote:

I think I have a slight preference for the second option, where there's a single method that does all the work for the two cases.

OK, now checked in as 870137d .

Revision Contents

Path

Size

clang/

lib/

CodeGen/

CGExprScalar.cpp

28 lines

test/

CodeGen/

fpconstrained-cmp.c

151 lines

llvm/

include/

llvm/

IR/

IRBuilder.h

41 lines

Diff 233796

clang/lib/CodeGen/CGExprScalar.cpp

Show First 20 Lines • Show All 791 Lines • ▼ Show 20 Lines	#define HANDLEBINOP(OP) \
HANDLEBINOP(And)		HANDLEBINOP(And)
HANDLEBINOP(Xor)		HANDLEBINOP(Xor)
HANDLEBINOP(Or)		HANDLEBINOP(Or)
#undef HANDLEBINOP		#undef HANDLEBINOP

// Comparisons.		// Comparisons.
Value EmitCompare(const BinaryOperator E, llvm::CmpInst::Predicate UICmpOpc,		Value EmitCompare(const BinaryOperator E, llvm::CmpInst::Predicate UICmpOpc,
llvm::CmpInst::Predicate SICmpOpc,		llvm::CmpInst::Predicate SICmpOpc,
llvm::CmpInst::Predicate FCmpOpc);		llvm::CmpInst::Predicate FCmpOpc, bool IsSignaling);
#define VISITCOMP(CODE, UI, SI, FP) \		#define VISITCOMP(CODE, UI, SI, FP, SIG) \
Value VisitBin##CODE(const BinaryOperator E) { \		Value VisitBin##CODE(const BinaryOperator E) { \
return EmitCompare(E, llvm::ICmpInst::UI, llvm::ICmpInst::SI, \		return EmitCompare(E, llvm::ICmpInst::UI, llvm::ICmpInst::SI, \
llvm::FCmpInst::FP); }		llvm::FCmpInst::FP, SIG); }
VISITCOMP(LT, ICMP_ULT, ICMP_SLT, FCMP_OLT)		VISITCOMP(LT, ICMP_ULT, ICMP_SLT, FCMP_OLT, true)
VISITCOMP(GT, ICMP_UGT, ICMP_SGT, FCMP_OGT)		VISITCOMP(GT, ICMP_UGT, ICMP_SGT, FCMP_OGT, true)
VISITCOMP(LE, ICMP_ULE, ICMP_SLE, FCMP_OLE)		VISITCOMP(LE, ICMP_ULE, ICMP_SLE, FCMP_OLE, true)
VISITCOMP(GE, ICMP_UGE, ICMP_SGE, FCMP_OGE)		VISITCOMP(GE, ICMP_UGE, ICMP_SGE, FCMP_OGE, true)
VISITCOMP(EQ, ICMP_EQ , ICMP_EQ , FCMP_OEQ)		VISITCOMP(EQ, ICMP_EQ , ICMP_EQ , FCMP_OEQ, false)
VISITCOMP(NE, ICMP_NE , ICMP_NE , FCMP_UNE)		VISITCOMP(NE, ICMP_NE , ICMP_NE , FCMP_UNE, false)
#undef VISITCOMP		#undef VISITCOMP

Value VisitBinAssign (const BinaryOperator E);		Value VisitBinAssign (const BinaryOperator E);

Value VisitBinLAnd (const BinaryOperator E);		Value VisitBinLAnd (const BinaryOperator E);
Value VisitBinLOr (const BinaryOperator E);		Value VisitBinLOr (const BinaryOperator E);
Value VisitBinComma (const BinaryOperator E);		Value VisitBinComma (const BinaryOperator E);

▲ Show 20 Lines • Show All 2,964 Lines • ▼ Show 20 Lines	case BuiltinType::Double:
return (IT == VCMPEQ) ? llvm::Intrinsic::ppc_vsx_xvcmpeqdp_p :		return (IT == VCMPEQ) ? llvm::Intrinsic::ppc_vsx_xvcmpeqdp_p :
llvm::Intrinsic::ppc_vsx_xvcmpgtdp_p;		llvm::Intrinsic::ppc_vsx_xvcmpgtdp_p;
}		}
}		}

Value ScalarExprEmitter::EmitCompare(const BinaryOperator E,		Value ScalarExprEmitter::EmitCompare(const BinaryOperator E,
llvm::CmpInst::Predicate UICmpOpc,		llvm::CmpInst::Predicate UICmpOpc,
llvm::CmpInst::Predicate SICmpOpc,		llvm::CmpInst::Predicate SICmpOpc,
llvm::CmpInst::Predicate FCmpOpc) {		llvm::CmpInst::Predicate FCmpOpc,
		bool IsSignaling) {
TestAndClearIgnoreResultAssign();		TestAndClearIgnoreResultAssign();
Value *Result;		Value *Result;
QualType LHSTy = E->getLHS()->getType();		QualType LHSTy = E->getLHS()->getType();
QualType RHSTy = E->getRHS()->getType();		QualType RHSTy = E->getRHS()->getType();
if (const MemberPointerType *MPT = LHSTy->getAs<MemberPointerType>()) {		if (const MemberPointerType *MPT = LHSTy->getAs<MemberPointerType>()) {
assert(E->getOpcode() == BO_EQ \|\|		assert(E->getOpcode() == BO_EQ \|\|
E->getOpcode() == BO_NE);		E->getOpcode() == BO_NE);
Value *LHS = CGF.EmitScalarExpr(E->getLHS());		Value *LHS = CGF.EmitScalarExpr(E->getLHS());
▲ Show 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	if (LHSTy->isVectorType() && !E->getType()->isVectorType()) {
Result = Builder.CreateTrunc(Result, Builder.getInt1Ty());		Result = Builder.CreateTrunc(Result, Builder.getInt1Ty());
return EmitScalarConversion(Result, CGF.getContext().BoolTy, E->getType(),		return EmitScalarConversion(Result, CGF.getContext().BoolTy, E->getType(),
E->getExprLoc());		E->getExprLoc());
}		}

if (BOInfo.isFixedPointBinOp()) {		if (BOInfo.isFixedPointBinOp()) {
Result = EmitFixedPointBinOp(BOInfo);		Result = EmitFixedPointBinOp(BOInfo);
} else if (LHS->getType()->isFPOrFPVectorTy()) {		} else if (LHS->getType()->isFPOrFPVectorTy()) {
		if (!IsSignaling)
Result = Builder.CreateFCmp(FCmpOpc, LHS, RHS, "cmp");		Result = Builder.CreateFCmp(FCmpOpc, LHS, RHS, "cmp");
		else
		Result = Builder.CreateFCmpS(FCmpOpc, LHS, RHS, "cmp");
} else if (LHSTy->hasSignedIntegerRepresentation()) {		} else if (LHSTy->hasSignedIntegerRepresentation()) {
Result = Builder.CreateICmp(SICmpOpc, LHS, RHS, "cmp");		Result = Builder.CreateICmp(SICmpOpc, LHS, RHS, "cmp");
} else {		} else {
// Unsigned integers and pointers.		// Unsigned integers and pointers.

if (CGF.CGM.getCodeGenOpts().StrictVTablePointers &&		if (CGF.CGM.getCodeGenOpts().StrictVTablePointers &&
!isa<llvm::ConstantPointerNull>(LHS) &&		!isa<llvm::ConstantPointerNull>(LHS) &&
!isa<llvm::ConstantPointerNull>(RHS)) {		!isa<llvm::ConstantPointerNull>(RHS)) {
Show All 40 Lines	if (auto *CTy = RHSTy->getAs<ComplexType>()) {
RHS.first = Visit(E->getRHS());		RHS.first = Visit(E->getRHS());
RHS.second = llvm::Constant::getNullValue(RHS.first->getType());		RHS.second = llvm::Constant::getNullValue(RHS.first->getType());
assert(CGF.getContext().hasSameUnqualifiedType(CETy, RHSTy) &&		assert(CGF.getContext().hasSameUnqualifiedType(CETy, RHSTy) &&
"The element types must always match.");		"The element types must always match.");
}		}

Value ResultR, ResultI;		Value ResultR, ResultI;
if (CETy->isRealFloatingType()) {		if (CETy->isRealFloatingType()) {
		// As complex comparisons can only be equality comparisons, they
		// are never signaling comparisons.
ResultR = Builder.CreateFCmp(FCmpOpc, LHS.first, RHS.first, "cmp.r");		ResultR = Builder.CreateFCmp(FCmpOpc, LHS.first, RHS.first, "cmp.r");
ResultI = Builder.CreateFCmp(FCmpOpc, LHS.second, RHS.second, "cmp.i");		ResultI = Builder.CreateFCmp(FCmpOpc, LHS.second, RHS.second, "cmp.i");
} else {		} else {
// Complex comparisons can only be equality comparisons. As such, signed		// Complex comparisons can only be equality comparisons. As such, signed
// and unsigned opcodes are the same.		// and unsigned opcodes are the same.
ResultR = Builder.CreateICmp(UICmpOpc, LHS.first, RHS.first, "cmp.r");		ResultR = Builder.CreateICmp(UICmpOpc, LHS.first, RHS.first, "cmp.r");
ResultI = Builder.CreateICmp(UICmpOpc, LHS.second, RHS.second, "cmp.i");		ResultI = Builder.CreateICmp(UICmpOpc, LHS.second, RHS.second, "cmp.i");
}		}
▲ Show 20 Lines • Show All 878 Lines • Show Last 20 Lines

clang/test/CodeGen/fpconstrained-cmp.c

This file was added.

				// RUN: %clang_cc1 -ffp-exception-behavior=ignore -emit-llvm -o - %s \| FileCheck %s -check-prefix=CHECK -check-prefix=FCMP
				// RUN: %clang_cc1 -ffp-exception-behavior=strict -emit-llvm -o - %s \| FileCheck %s -check-prefix=CHECK -check-prefix=EXCEPT
				// RUN: %clang_cc1 -ffp-exception-behavior=maytrap -emit-llvm -o - %s \| FileCheck %s -check-prefix=CHECK -check-prefix=MAYTRAP
				// RUN: %clang_cc1 -frounding-math -ffp-exception-behavior=ignore -emit-llvm -o - %s \| FileCheck %s -check-prefix=CHECK -check-prefix=IGNORE
				// RUN: %clang_cc1 -frounding-math -ffp-exception-behavior=strict -emit-llvm -o - %s \| FileCheck %s -check-prefix=CHECK -check-prefix=EXCEPT
				// RUN: %clang_cc1 -frounding-math -ffp-exception-behavior=maytrap -emit-llvm -o - %s \| FileCheck %s -check-prefix=CHECK -check-prefix=MAYTRAP

				_Bool QuietEqual(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @QuietEqual(double %f1, double %f2)

				// FCMP: fcmp oeq double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"oeq", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"oeq", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"oeq", metadata !"fpexcept.maytrap")
				return f1 == f2;

				// CHECK: ret
				}

				_Bool QuietNotEqual(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @QuietNotEqual(double %f1, double %f2)

				// FCMP: fcmp une double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"une", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"une", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"une", metadata !"fpexcept.maytrap")
				return f1 != f2;

				// CHECK: ret
				}

				_Bool SignalingLess(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @SignalingLess(double %f1, double %f2)

				// FCMP: fcmp olt double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"olt", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"olt", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"olt", metadata !"fpexcept.maytrap")
				return f1 < f2;

				// CHECK: ret
				}

				_Bool SignalingLessEqual(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @SignalingLessEqual(double %f1, double %f2)

				// FCMP: fcmp ole double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"ole", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"ole", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"ole", metadata !"fpexcept.maytrap")
				return f1 <= f2;

				// CHECK: ret
				}

				_Bool SignalingGreater(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @SignalingGreater(double %f1, double %f2)

				// FCMP: fcmp ogt double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"ogt", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"ogt", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"ogt", metadata !"fpexcept.maytrap")
				return f1 > f2;

				// CHECK: ret
				}

				_Bool SignalingGreaterEqual(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @SignalingGreaterEqual(double %f1, double %f2)

				// FCMP: fcmp oge double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"oge", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"oge", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmps.f64(double %{{.}}, double %{{.}}, metadata !"oge", metadata !"fpexcept.maytrap")
				return f1 >= f2;

				// CHECK: ret
				}

				_Bool QuietLess(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @QuietLess(double %f1, double %f2)

				// FCMP: fcmp olt double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"olt", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"olt", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"olt", metadata !"fpexcept.maytrap")
				return __builtin_isless(f1, f2);

				// CHECK: ret
				}

				_Bool QuietLessEqual(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @QuietLessEqual(double %f1, double %f2)

				// FCMP: fcmp ole double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"ole", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"ole", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"ole", metadata !"fpexcept.maytrap")
				return __builtin_islessequal(f1, f2);

				// CHECK: ret
				}

				_Bool QuietGreater(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @QuietGreater(double %f1, double %f2)

				// FCMP: fcmp ogt double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"ogt", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"ogt", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"ogt", metadata !"fpexcept.maytrap")
				return __builtin_isgreater(f1, f2);

				// CHECK: ret
				}

				_Bool QuietGreaterEqual(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @QuietGreaterEqual(double %f1, double %f2)

				// FCMP: fcmp oge double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"oge", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"oge", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"oge", metadata !"fpexcept.maytrap")
				return __builtin_isgreaterequal(f1, f2);

				// CHECK: ret
				}

				_Bool QuietLessGreater(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @QuietLessGreater(double %f1, double %f2)

				// FCMP: fcmp one double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"one", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"one", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"one", metadata !"fpexcept.maytrap")
				return __builtin_islessgreater(f1, f2);

				// CHECK: ret
				}

				_Bool QuietUnordered(double f1, double f2) {
				// CHECK-LABEL: define {{.*}}i1 @QuietUnordered(double %f1, double %f2)

				// FCMP: fcmp uno double %{{.}}, %{{.}}
				// IGNORE: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"uno", metadata !"fpexcept.ignore")
				// EXCEPT: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"uno", metadata !"fpexcept.strict")
				// MAYTRAP: call i1 @llvm.experimental.constrained.fcmp.f64(double %{{.}}, double %{{.}}, metadata !"uno", metadata !"fpexcept.maytrap")
				return __builtin_isunordered(f1, f2);

				// CHECK: ret
				}

llvm/include/llvm/IR/IRBuilder.h

Show First 20 Lines • Show All 1,155 Lines • ▼ Show 20 Lines	Value *getConstrainedFPExcept(Optional<fp::ExceptionBehavior> Except) {

Optional<StringRef> ExceptStr = ExceptionBehaviorToStr(UseExcept);		Optional<StringRef> ExceptStr = ExceptionBehaviorToStr(UseExcept);
assert(ExceptStr.hasValue() && "Garbage strict exception behavior!");		assert(ExceptStr.hasValue() && "Garbage strict exception behavior!");
auto *ExceptMDS = MDString::get(Context, ExceptStr.getValue());		auto *ExceptMDS = MDString::get(Context, ExceptStr.getValue());

return MetadataAsValue::get(Context, ExceptMDS);		return MetadataAsValue::get(Context, ExceptMDS);
}		}

		Value *getConstrainedFPPredicate(CmpInst::Predicate Predicate) {
		assert(CmpInst::isFPPredicate(Predicate) &&
		Predicate != CmpInst::FCMP_FALSE &&
		Predicate != CmpInst::FCMP_TRUE &&
		"Invalid constrained FP comparison predicate!");

		StringRef PredicateStr = CmpInst::getPredicateName(Predicate);
		auto *PredicateMDS = MDString::get(Context, PredicateStr);

		return MetadataAsValue::get(Context, PredicateMDS);
		}

public:		public:
Value CreateAdd(Value LHS, Value *RHS, const Twine &Name = "",		Value CreateAdd(Value LHS, Value *RHS, const Twine &Name = "",
bool HasNUW = false, bool HasNSW = false) {		bool HasNUW = false, bool HasNSW = false) {
if (auto *LC = dyn_cast<Constant>(LHS))		if (auto *LC = dyn_cast<Constant>(LHS))
if (auto *RC = dyn_cast<Constant>(RHS))		if (auto *RC = dyn_cast<Constant>(RHS))
return Insert(Folder.CreateAdd(LC, RC, HasNUW, HasNSW), Name);		return Insert(Folder.CreateAdd(LC, RC, HasNUW, HasNSW), Name);
return CreateInsertNUWNSWBinOp(Instruction::Add, LHS, RHS, Name,		return CreateInsertNUWNSWBinOp(Instruction::Add, LHS, RHS, Name,
HasNUW, HasNSW);		HasNUW, HasNSW);
▲ Show 20 Lines • Show All 1,131 Lines • ▼ Show 20 Lines	Value CreateICmp(CmpInst::Predicate P, Value LHS, Value *RHS,
if (auto *LC = dyn_cast<Constant>(LHS))		if (auto *LC = dyn_cast<Constant>(LHS))
if (auto *RC = dyn_cast<Constant>(RHS))		if (auto *RC = dyn_cast<Constant>(RHS))
return Insert(Folder.CreateICmp(P, LC, RC), Name);		return Insert(Folder.CreateICmp(P, LC, RC), Name);
return Insert(new ICmpInst(P, LHS, RHS), Name);		return Insert(new ICmpInst(P, LHS, RHS), Name);
}		}

Value CreateFCmp(CmpInst::Predicate P, Value LHS, Value *RHS,		Value CreateFCmp(CmpInst::Predicate P, Value LHS, Value *RHS,
const Twine &Name = "", MDNode *FPMathTag = nullptr) {		const Twine &Name = "", MDNode *FPMathTag = nullptr) {
		if (IsFPConstrained)
		return CreateConstrainedFPCmp(Intrinsic::experimental_constrained_fcmp,
		P, LHS, RHS, Name);

		if (auto *LC = dyn_cast<Constant>(LHS))
		if (auto *RC = dyn_cast<Constant>(RHS))
		return Insert(Folder.CreateFCmp(P, LC, RC), Name);
		return Insert(setFPAttrs(new FCmpInst(P, LHS, RHS), FPMathTag, FMF), Name);
		}

		Value CreateFCmpS(CmpInst::Predicate P, Value LHS, Value *RHS,
		const Twine &Name = "", MDNode *FPMathTag = nullptr) {
		if (IsFPConstrained)
		return CreateConstrainedFPCmp(Intrinsic::experimental_constrained_fcmps,
		P, LHS, RHS, Name);

if (auto *LC = dyn_cast<Constant>(LHS))		if (auto *LC = dyn_cast<Constant>(LHS))
if (auto *RC = dyn_cast<Constant>(RHS))		if (auto *RC = dyn_cast<Constant>(RHS))
return Insert(Folder.CreateFCmp(P, LC, RC), Name);		return Insert(Folder.CreateFCmp(P, LC, RC), Name);
return Insert(setFPAttrs(new FCmpInst(P, LHS, RHS), FPMathTag, FMF), Name);		return Insert(setFPAttrs(new FCmpInst(P, LHS, RHS), FPMathTag, FMF), Name);
		rjmccallUnsubmitted Not Done Reply Inline Actions Can you make a helper method for the common code in the non-constrained paths here? Please document the difference between these two methods. rjmccall: Can you make a helper method for the common code in the non-constrained paths here? Please…
}		}

		CallInst *CreateConstrainedFPCmp(
		Intrinsic::ID ID, CmpInst::Predicate P, Value L, Value R,
		const Twine &Name = "",
		Optional<fp::ExceptionBehavior> Except = None) {
		Value *PredicateV = getConstrainedFPPredicate(P);
		Value *ExceptV = getConstrainedFPExcept(Except);

		CallInst *C = CreateIntrinsic(ID, {L->getType()},
		{L, R, PredicateV, ExceptV}, nullptr, Name);
		setConstrainedFPCallAttr(C);
		return C;
		}

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Instruction creation methods: Other Instructions		// Instruction creation methods: Other Instructions
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

PHINode CreatePHI(Type Ty, unsigned NumReservedValues,		PHINode CreatePHI(Type Ty, unsigned NumReservedValues,
const Twine &Name = "") {		const Twine &Name = "") {
PHINode *Phi = PHINode::Create(Ty, NumReservedValues);		PHINode *Phi = PHINode::Create(Ty, NumReservedValues);
if (isa<FPMathOperator>(Phi))		if (isa<FPMathOperator>(Phi))
▲ Show 20 Lines • Show All 476 Lines • Show Last 20 Lines