This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/
-
Analysis/
7/13
ConstantFolding.cpp
-
Transforms/Utils/
-
Utils/
1/2
Local.cpp
-
test/Transforms/InstSimplify/
-
Transforms/
-
InstSimplify/
1/2
constfold-constrained.ll

Differential D102673

[ConstantFolding] Fold constrained arithmetic intrinsics
ClosedPublic

Authored by sepavloff on May 18 2021, 1:39 AM.

Download Raw Diff

Details

Reviewers

kpn
craig.topper
foad
arsenm
bkramer
scanon
spatel

Commits

rG1c64b5dc5ea8: [ConstantFolding] Fold constrained arithmetic intrinsics

Summary

Constfold constrained variants of operations fadd, fsub, fmul, fdiv,
frem, fma and fmuladd.

The change also sets up some means to support for removal of unused
constrained intrinsics. They are declared as accessing memory to model
interaction with floating point environment, so they were not removed,
as they have side effect. Now constrained intrinsics that have
"fpexcept.ignore" as exception behavior are removed if they have no uses.
As for intrinsics that have exception behavior other than "fpexcept.ignore",
they can be removed if it is known that they do not raise floating point
exceptions. It happens when doing constant folding, attributes of such
intrinsic are changed so that the intrinsic is not claimed as accessing
memory.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sepavloff created this revision.May 18 2021, 1:39 AM

Herald added subscribers: hiraditya, nemanjai. · View Herald TranscriptMay 18 2021, 1:39 AM

sepavloff requested review of this revision.May 18 2021, 1:39 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 18 2021, 1:39 AM

Herald added a subscriber: wdng. · View Herald Transcript

Harbormaster completed remote builds in B104955: Diff 346074.May 18 2021, 1:40 AM

sepavloff added a parent revision: D102672: [ConstantFolding] Use APFloat for constant folding. NFC.May 18 2021, 1:41 AM

sepavloff mentioned this in D102672: [ConstantFolding] Use APFloat for constant folding. NFC.May 18 2021, 10:10 AM

Rebased

Harbormaster completed remote builds in B105869: Diff 347343.May 24 2021, 4:10 AM

kpn added inline comments.May 26 2021, 8:07 AM

llvm/lib/Transforms/Utils/Local.cpp
497	What about "maytrap"? The language reference explicitly calls out constant folding as allowed under "maytrap" since it doesn't introduce any new exceptions.
llvm/test/CodeGen/PowerPC/vector-constrained-fp-intrinsics.ll
240 ↗	(On Diff #347343)	I hate to lose the testing we're currently doing with this file. Can you split out from this file the tests that these changes hit, and then change this existing test to take arguments instead of constants? That way we preserve testing the SDAG but also test that your constant folding works correctly.

nemanjai added inline comments.May 26 2021, 9:19 AM

llvm/test/CodeGen/PowerPC/vector-constrained-fp-intrinsics.ll
240 ↗	(On Diff #347343)	I agree. I'm not sure why this test case was written to only test this with constants, but it now becomes clear that it is an unfortunate choice.

kpn mentioned this in D103169: [FPEnv][InstSimplify] Constrained FP support for NaN.May 26 2021, 2:23 PM

Changes to the test llvm/test/CodeGen/PowerPC/vector-constrained-fp-intrinsics.ll are presented in D103259.

sepavloff mentioned this in D103259: [PowerPC] Split tests for constrained intrinsics.May 31 2021, 4:24 AM

Allow folding for 'maytrap'. Rebased

sepavloff added inline comments.Jun 1 2021, 3:35 AM

llvm/lib/Transforms/Utils/Local.cpp
497	I think you are right. According to the documentation, constant folding is possible in this case.

Harbormaster completed remote builds in B106998: Diff 348918.Jun 1 2021, 3:38 AM

foad added inline comments.Jun 4 2021, 2:02 AM

llvm/lib/Analysis/ConstantFolding.cpp
2465	You don't need the "if", just do it unconditionally?

spatel mentioned this in rG8a4d05ddb3ff: [ConstantFolding] add copysign tests for more FP types; NFC.Jun 4 2021, 8:47 AM

spatel added a subscriber: spatel.Jun 4 2021, 8:54 AM

spatel added inline comments.

llvm/lib/Analysis/ConstantFolding.cpp
2424	From the tests, I'm assuming that this diff is here to support bfloat, but this change affects more than constrained intrinsics and more than only bfloat. Either we need to limit this enhancement, or we need to split this into its own patch and add more tests to make sure it works as expected. I put in some basic coverage for copysign here: 8a4d05ddb3ff I think things work as expected for simple cases like that one, but I don't know what happens if we are using the host mathlib to evaluate more complex functions (for example "pow" is in the switch under here).

Addressed reviewers' comments, rebased

sepavloff added inline comments.Jun 7 2021, 4:05 AM

llvm/lib/Analysis/ConstantFolding.cpp
2424	I put checks if (!Ty->isHalfTy() && !Ty->isFloatTy() && !Ty->isDoubleTy()) return nullptr; below. Constrained intrinsics have tests for bfloat type in this patch. As for other functions, test for them will be added later, and these checks will be removed.
2465	Yes, you are right. Updated.

Harbormaster completed remote builds in B107949: Diff 350233.Jun 7 2021, 4:37 AM

spatel added inline comments.Jun 8 2021, 5:51 AM

llvm/lib/Analysis/ConstantFolding.cpp
2424	Thanks. I'm worried that someone could come back after this patch and add yet another `if` or `switch` and miss the differences in type handling. How about adding a helper function to deal with ConstrainedFP and calling it from the very first clause in this function? That also raises a question: what is the behavior of these intrinsics with undef and/or poison operands?

Move common code into separate function

vdsered added a subscriber: vdsered.Jun 9 2021, 12:56 AM

sepavloff added inline comments.Jun 9 2021, 12:56 AM

llvm/lib/Analysis/ConstantFolding.cpp
2424	How about adding a helper function to deal with ConstrainedFP and calling it from the very first clause in this function? The shared code is moved to `getEvaluationRoundingMode`. That also raises a question: what is the behavior of these intrinsics with undef and/or poison operands? Constrained intrinsics should not differ from their counterparts in handling undef/poison. @kpn is working on D103169, which eventually should implement the relevant folding.

Harbormaster completed remote builds in B108350: Diff 350801.Jun 9 2021, 1:28 AM

sepavloff added a child revision: D103956: [ConstantFolding] Enable folding of min/max/copysign for all floats.Jun 9 2021, 5:21 AM

sepavloff removed a child revision: D103956: [ConstantFolding] Enable folding of min/max/copysign for all floats.Jun 9 2021, 10:59 AM

Swap sequence of patches

sepavloff added a parent revision: D103956: [ConstantFolding] Enable folding of min/max/copysign for all floats.Jun 9 2021, 11:01 AM

Harbormaster completed remote builds in B108461: Diff 350957.Jun 9 2021, 11:46 AM

spatel mentioned this in D104383: [InstSimplify] propagate poison through FP ops.Jun 16 2021, 7:09 AM

spatel mentioned this in rGce95200b7942: [InstSimplify] propagate poison through FP ops.Jun 16 2021, 8:33 AM

Rebased

Harbormaster completed remote builds in B109531: Diff 352458.Jun 16 2021, 6:37 PM

Is there anything I can do for this patch?

Ping @scanon to comment on whether the exception vs. fold-ability logic seems correct.

llvm/lib/Analysis/ConstantFolding.cpp

1910–1911

I think this would be easier to read if we made it more like the code above here:

Optional<RoundingMode> ORM = CI->getRoundingMode();
// If no rounding mode is specified by the intrinsic or the mode is dynamic,
// try to evaluate using the default mode. If it does not raise an inexact
// exception, rounding was not applied so the result is independent of
// rounding mode. 
if (!ORM || *ORM == RoundingMode::Dynamic) 
  return RoundingMode::NearestTiesToEven;
// Use the mode specified by the intrinsic.
return *ORM;

That's assuming I'm reading it correctly right now - are there tests with no RM specified on the intrinsic?

Addressed reviewer's note. Rebased

sepavloff marked an inline comment as done.Jun 25 2021, 3:52 AM

sepavloff added inline comments.

llvm/lib/Analysis/ConstantFolding.cpp
1910–1911	That's assuming I'm reading it correctly right now - are there tests with no RM specified on the intrinsic? No, all affected intrinsics have such argument. Bu in other places such check is made, so I put it here also.

Harbormaster completed remote builds in B110970: Diff 354463.Jun 25 2021, 4:19 AM

Can you fix the formatting, so we don't have clang-tidy warnings? Those pop-up boxes are getting in the way of reading the code here in Phab.

llvm/lib/Analysis/ConstantFolding.cpp
1910–1911	I am not understanding something then. If the metadata args for rounding mode and exception behavior are required by the LangRef: https://llvm.org/docs/LangRef.html#llvm-experimental-constrained-fadd-intrinsic ...then why is the result of `getRoundingMode()` or `getExceptionBehavior()` an `Optional` value? If it is not valid IR without those args, we shouldn't be adding compile-time checks for things that can't happen.

Updated patch

Rebased,
Get rid of some clang-tidy warnings,
Updated test fdiv-strict.ll

Harbormaster completed remote builds in B113507: Diff 357945.Jul 12 2021, 8:30 AM

sepavloff added inline comments.Jul 12 2021, 9:18 AM

llvm/lib/Analysis/ConstantFolding.cpp
1910–1911	I am not understanding something then. If the metadata args for rounding mode and exception behavior are required by the LangRef: https://llvm.org/docs/LangRef.html#llvm-experimental-constrained-fadd-intrinsic ...then why is the result of `getRoundingMode()` or `getExceptionBehavior()` an `Optional` value? If it is not valid IR without those args, we shouldn't be adding compile-time checks for things that can't happen. This function processes any `ConstrainedFPIntrinsic`. Some of them do not have rounding mode arguments, like constrained variants of `floor`, `trunc`. `frem` also do not depend on rounding mode, although it has rounding mode argument, probably some day it would be removed. Using `Optional` as return value of `getRoundingMode()` or `getExceptionBehavior()` is now a part of `ConstrainedFPIntrinsic` interface. It is safer to process results of these function in more general way.

spatel added inline comments.Jul 13 2021, 6:40 AM

llvm/lib/Analysis/ConstantFolding.cpp
1910–1911	Ok - thanks for explaining. I didn't think of those other intrinsics. Presumably, we'll add constant folding for those later, so this code will be shared.
1912	This code comment does not look accurate. When we speculatively evaluate the expression, we check that any exception was not raised, not just the inexact exception, right? This raises a question: if we evaluate the expression using NearestTiesToEven, then are we guaranteed that all potential exceptions (even underflow) are identical to any other rounding mode?

Rebased and reword comment

sepavloff added inline comments.Jul 15 2021, 4:27 AM

llvm/lib/Analysis/ConstantFolding.cpp
1912	This code comment does not look accurate. When we speculatively evaluate the expression, we check that any exception was not raised, not just the inexact exception, right? Yes. If exceptions are tracked and constant evaluation can raise any of them, constant expression is not folded. The relevant logic is implemented by `mayFoldConstrained`, which is defined above. It is necessary to set hardware state, which is expected as side effect of the evaluation. The purpose of this rounding mode substitution is to enable constant folding of expressions like `1.0 + 1.0` even when rounding mode is dynamic, so unknown at compile time. I tried to reword the comment to make it clearer. This raises a question: if we evaluate the expression using NearestTiesToEven, then are we guaranteed that all potential exceptions (even underflow) are identical to any other rounding mode? IEEE-754 compliant system evaluates an operation in two steps. First it calculates intermediate result as if both the exponent range and the precision were unbounded. Then this result is rounded if it cannot be represented in the chosen floating point format. The first step does not depend on rounding mode but the second does. Of the five FP exception `division-by-zero` and `invalid` obviously do not depend on rounding mode. `Overflow` is raised when the intermediate result is too large by magnitude to be represented in the chosen floating point format. Similar considerations apply to `underflow` as well. As for `inexact` exception, if it is raised, it means the intermediate result cannot be exactly represented and the rounding step changed it to either of nearby values. The direction is determined by rounding mode but rounding is required in any rounding mode.

Harbormaster completed remote builds in B114203: Diff 358919.Jul 15 2021, 5:07 AM

LGTM - see inline for some test file suggestions.
Might want to get a 2nd opinion though - I haven't used constrained ops or FP exceptions in C outside of toy examples.

llvm/test/Transforms/InstSimplify/constfold-constrained.ll
1	If there's not too much noise, I recommend using utils/update_test_checks.py to auto-generate the FileCheck lines in this file - it should eliminate typos and make it easier to update the file with new tests. So (1) add new tests, (2) auto-generate the baseline CHECKs for all tests, (3) apply this patch and re-run the script, so we just have the test diffs.
llvm/test/Transforms/InstSimplify/fdiv-strictfp.ll
27 ↗	(On Diff #358919)	Fix this test to not be misleading as a preliminary/NFC commit?

This revision is now accepted and ready to land.Jul 15 2021, 9:52 AM

Thanks!

llvm/test/Transforms/InstSimplify/constfold-constrained.ll
1	(1) and (2) are implemented in https://reviews.llvm.org/rGa0b4f424f564
llvm/test/Transforms/InstSimplify/fdiv-strictfp.ll
27 ↗	(On Diff #358919)	This typo is fixed in https://reviews.llvm.org/rG39a36999f9b4

qiucf added a subscriber: qiucf.Jul 19 2021, 2:48 AM

I just got over to @kpn 's question on llvm-dev:
https://lists.llvm.org/pipermail/llvm-dev/2021-July/151727.html

And this patch should have tests with SNaN and QNaN inputs, so we have coverage for those cases (let me know if they are/were here, but I missed it).

define float @fadd.except.strict_qnan_qnan(float %x) {
  %r = call float @llvm.experimental.constrained.fadd.f32(float 0x7ff8000000000000, float 0x7ff8000000000000, metadata !"round.dynamic", metadata !"fpexcept.strict")
  ret float %r

}

define float @fadd.except.strict_snan_qnan(float %x) {
  %r = call float @llvm.experimental.constrained.fadd.f32(float 0x7ff4000000000000, float 0x7ff8000000000000, metadata !"round.dynamic", metadata !"fpexcept.strict")
  ret float %r

}

This revision was landed with ongoing or failed builds.Jul 23 2021, 12:40 AM

Closed by commit rG1c64b5dc5ea8: [ConstantFolding] Fold constrained arithmetic intrinsics (authored by sepavloff). · Explain Why

This revision was automatically updated to reflect the committed changes.

sepavloff added a commit: rG1c64b5dc5ea8: [ConstantFolding] Fold constrained arithmetic intrinsics.

In D102673#2892961, @spatel wrote:

I just got over to @kpn 's question on llvm-dev:
https://lists.llvm.org/pipermail/llvm-dev/2021-July/151727.html

And this patch should have tests with SNaN and QNaN inputs, so we have coverage for those cases (let me know if they are/were here, but I missed it).

This patch intentionally avoided checking NaN inputs because at the same time @kpn implemented https://reviews.llvm.org/D103169. That patch must have required tests.

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ConstantFolding.cpp

114 lines

Transforms/

Utils/

Local.cpp

10 lines

test/

Transforms/

InstSimplify/

constfold-constrained.ll

223 lines

Diff 350801

llvm/lib/Analysis/ConstantFolding.cpp

Show First 20 Lines • Show All 1,592 Lines • ▼ Show 20 Lines	bool llvm::canConstantFoldCallTo(const CallBase Call, const Function F) {
case Intrinsic::floor:		case Intrinsic::floor:
case Intrinsic::round:		case Intrinsic::round:
case Intrinsic::roundeven:		case Intrinsic::roundeven:
case Intrinsic::trunc:		case Intrinsic::trunc:
case Intrinsic::nearbyint:		case Intrinsic::nearbyint:
case Intrinsic::rint:		case Intrinsic::rint:
// Constrained intrinsics can be folded if FP environment is known		// Constrained intrinsics can be folded if FP environment is known
// to compiler.		// to compiler.
		case Intrinsic::experimental_constrained_fma:
		case Intrinsic::experimental_constrained_fmuladd:
		case Intrinsic::experimental_constrained_fadd:
		case Intrinsic::experimental_constrained_fsub:
		case Intrinsic::experimental_constrained_fmul:
		case Intrinsic::experimental_constrained_fdiv:
		case Intrinsic::experimental_constrained_frem:
case Intrinsic::experimental_constrained_ceil:		case Intrinsic::experimental_constrained_ceil:
case Intrinsic::experimental_constrained_floor:		case Intrinsic::experimental_constrained_floor:
case Intrinsic::experimental_constrained_round:		case Intrinsic::experimental_constrained_round:
case Intrinsic::experimental_constrained_roundeven:		case Intrinsic::experimental_constrained_roundeven:
case Intrinsic::experimental_constrained_trunc:		case Intrinsic::experimental_constrained_trunc:
case Intrinsic::experimental_constrained_nearbyint:		case Intrinsic::experimental_constrained_nearbyint:
case Intrinsic::experimental_constrained_rint:		case Intrinsic::experimental_constrained_rint:
return true;		return true;
▲ Show 20 Lines • Show All 245 Lines • ▼ Show 20 Lines	static bool getConstIntOrUndef(Value Op, const APInt &C) {
}		}
if (isa<UndefValue>(Op)) {		if (isa<UndefValue>(Op)) {
C = nullptr;		C = nullptr;
return true;		return true;
}		}
return false;		return false;
}		}

		/// Checks if the given intrinsic call, which evaluates to constant, is allowed
		/// to be folded.
		///
		/// \param CI Constrained intrinsic call.
		/// \param St Exception flags raised during constant evaluation.
		static bool mayFoldConstrained(ConstrainedFPIntrinsic *CI,
		APFloat::opStatus St) {
		Optional<RoundingMode> ORM = CI->getRoundingMode();
		Optional<fp::ExceptionBehavior> EB = CI->getExceptionBehavior();

		// If the operation does not change exception status flags, it is safe
		// to fold.
		if (St == APFloat::opStatus::opOK) {
		// When FP exceptions are not ignored, intrinsic call will not be
		// eliminated, because it is considered as having side effect. But we
		// know that its evaluation does not raise exceptions, so side effect
		// is absent. To allow removing the call, mark it as not accessing memory.
		if (EB && *EB != fp::ExceptionBehavior::ebIgnore)
		CI->addAttribute(AttributeList::FunctionIndex, Attribute::ReadNone);
		return true;
		}

		// If evaluation raised FP exception, the result can depend on rounding
		// mode. If the latter is unknown, folding is not possible.
		if (!ORM \|\| *ORM == RoundingMode::Dynamic)
		return false;

		// If FP exceptions are ignored, fold the call, even if such exception is
		// raised.
		if (!EB \|\| *EB != fp::ExceptionBehavior::ebStrict)
		return true;

		// Leave the calculation for runtime so that exception flags be correctly set
		// in hardware.
		return false;
		}

		/// Returns the rounding mode that should be used for constant evaluation.
		static RoundingMode
		getEvaluationRoundingMode(const ConstrainedFPIntrinsic *CI) {
		RoundingMode RM = RoundingMode::Dynamic;
		Optional<RoundingMode> ORM = CI->getRoundingMode();
		if (ORM)
		spatelUnsubmitted Done Reply Inline Actions I think this would be easier to read if we made it more like the code above here: Optional<RoundingMode> ORM = CI->getRoundingMode(); // If no rounding mode is specified by the intrinsic or the mode is dynamic, // try to evaluate using the default mode. If it does not raise an inexact // exception, rounding was not applied so the result is independent of // rounding mode. if (!ORM \|\| ORM == RoundingMode::Dynamic) return RoundingMode::NearestTiesToEven; // Use the mode specified by the intrinsic. return ORM; That's assuming I'm reading it correctly right now - are there tests with no RM specified on the intrinsic? spatel: I think this would be easier to read if we made it more like the code above here: ```…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions That's assuming I'm reading it correctly right now - are there tests with no RM specified on the intrinsic? No, all affected intrinsics have such argument. Bu in other places such check is made, so I put it here also. sepavloff: > That's assuming I'm reading it correctly right now - are there tests with no RM specified on…
		spatelUnsubmitted Not Done Reply Inline Actions I am not understanding something then. If the metadata args for rounding mode and exception behavior are required by the LangRef: https://llvm.org/docs/LangRef.html#llvm-experimental-constrained-fadd-intrinsic ...then why is the result of `getRoundingMode()` or `getExceptionBehavior()` an `Optional` value? If it is not valid IR without those args, we shouldn't be adding compile-time checks for things that can't happen. spatel: I am not understanding something then. If the metadata args for rounding mode and exception…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions I am not understanding something then. If the metadata args for rounding mode and exception behavior are required by the LangRef: https://llvm.org/docs/LangRef.html#llvm-experimental-constrained-fadd-intrinsic ...then why is the result of `getRoundingMode()` or `getExceptionBehavior()` an `Optional` value? If it is not valid IR without those args, we shouldn't be adding compile-time checks for things that can't happen. This function processes any `ConstrainedFPIntrinsic`. Some of them do not have rounding mode arguments, like constrained variants of `floor`, `trunc`. `frem` also do not depend on rounding mode, although it has rounding mode argument, probably some day it would be removed. Using `Optional` as return value of `getRoundingMode()` or `getExceptionBehavior()` is now a part of `ConstrainedFPIntrinsic` interface. It is safer to process results of these function in more general way. sepavloff: > I am not understanding something then. > If the metadata args for rounding mode and exception…
		spatelUnsubmitted Not Done Reply Inline Actions Ok - thanks for explaining. I didn't think of those other intrinsics. Presumably, we'll add constant folding for those later, so this code will be shared. spatel: Ok - thanks for explaining. I didn't think of those other intrinsics. Presumably, we'll add…
		RM = *ORM;
		spatelUnsubmitted Not Done Reply Inline Actions This code comment does not look accurate. When we speculatively evaluate the expression, we check that any exception was not raised, not just the inexact exception, right? This raises a question: if we evaluate the expression using NearestTiesToEven, then are we guaranteed that all potential exceptions (even underflow) are identical to any other rounding mode? spatel: This code comment does not look accurate. When we speculatively evaluate the expression, we…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions This code comment does not look accurate. When we speculatively evaluate the expression, we check that any exception was not raised, not just the inexact exception, right? Yes. If exceptions are tracked and constant evaluation can raise any of them, constant expression is not folded. The relevant logic is implemented by `mayFoldConstrained`, which is defined above. It is necessary to set hardware state, which is expected as side effect of the evaluation. The purpose of this rounding mode substitution is to enable constant folding of expressions like `1.0 + 1.0` even when rounding mode is dynamic, so unknown at compile time. I tried to reword the comment to make it clearer. This raises a question: if we evaluate the expression using NearestTiesToEven, then are we guaranteed that all potential exceptions (even underflow) are identical to any other rounding mode? IEEE-754 compliant system evaluates an operation in two steps. First it calculates intermediate result as if both the exponent range and the precision were unbounded. Then this result is rounded if it cannot be represented in the chosen floating point format. The first step does not depend on rounding mode but the second does. Of the five FP exception `division-by-zero` and `invalid` obviously do not depend on rounding mode. `Overflow` is raised when the intermediate result is too large by magnitude to be represented in the chosen floating point format. Similar considerations apply to `underflow` as well. As for `inexact` exception, if it is raised, it means the intermediate result cannot be exactly represented and the rounding step changed it to either of nearby values. The direction is determined by rounding mode but rounding is required in any rounding mode. sepavloff: > This code comment does not look accurate. > When we speculatively evaluate the expression, we…
		if (RM == RoundingMode::Dynamic)
		// Even if the rounding mode is unknown, try evaluating the operation.
		// If it does not raise inexact exception, rounding was not applied
		// so the result does not depend on rounding mode.
		RM = RoundingMode::NearestTiesToEven;
		return RM;
		}

static Constant *ConstantFoldScalarCall1(StringRef Name,		static Constant *ConstantFoldScalarCall1(StringRef Name,
Intrinsic::ID IntrinsicID,		Intrinsic::ID IntrinsicID,
Type *Ty,		Type *Ty,
ArrayRef<Constant *> Operands,		ArrayRef<Constant *> Operands,
const TargetLibraryInfo *TLI,		const TargetLibraryInfo *TLI,
const CallBase *Call) {		const CallBase *Call) {
assert(Operands.size() == 1 && "Wrong number of operands.");		assert(Operands.size() == 1 && "Wrong number of operands.");

▲ Show 20 Lines • Show All 487 Lines • ▼ Show 20 Lines	case Intrinsic::minimum:
return Operands[1];		return Operands[1];
if (IsOp1Undef)		if (IsOp1Undef)
return Operands[0];		return Operands[0];
break;		break;
}		}
}		}

if (auto *Op1 = dyn_cast<ConstantFP>(Operands[0])) {		if (auto *Op1 = dyn_cast<ConstantFP>(Operands[0])) {
if (!Ty->isHalfTy() && !Ty->isFloatTy() && !Ty->isDoubleTy())		if (!Ty->isFloatingPointTy())
		spatelUnsubmitted Not Done Reply Inline Actions From the tests, I'm assuming that this diff is here to support bfloat, but this change affects more than constrained intrinsics and more than only bfloat. Either we need to limit this enhancement, or we need to split this into its own patch and add more tests to make sure it works as expected. I put in some basic coverage for copysign here: 8a4d05ddb3ff I think things work as expected for simple cases like that one, but I don't know what happens if we are using the host mathlib to evaluate more complex functions (for example "pow" is in the switch under here). spatel: From the tests, I'm assuming that this diff is here to support bfloat, but this change affects…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions I put checks if (!Ty->isHalfTy() && !Ty->isFloatTy() && !Ty->isDoubleTy()) return nullptr; below. Constrained intrinsics have tests for bfloat type in this patch. As for other functions, test for them will be added later, and these checks will be removed. sepavloff: I put checks ``` if (!Ty->isHalfTy() && !Ty->isFloatTy() && !Ty->isDoubleTy()) return…
		spatelUnsubmitted Not Done Reply Inline Actions Thanks. I'm worried that someone could come back after this patch and add yet another `if` or `switch` and miss the differences in type handling. How about adding a helper function to deal with ConstrainedFP and calling it from the very first clause in this function? That also raises a question: what is the behavior of these intrinsics with undef and/or poison operands? spatel: Thanks. I'm worried that someone could come back after this patch and add yet another `if` or…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions How about adding a helper function to deal with ConstrainedFP and calling it from the very first clause in this function? The shared code is moved to `getEvaluationRoundingMode`. That also raises a question: what is the behavior of these intrinsics with undef and/or poison operands? Constrained intrinsics should not differ from their counterparts in handling undef/poison. @kpn is working on D103169, which eventually should implement the relevant folding. sepavloff: > How about adding a helper function to deal with ConstrainedFP and calling it from the very…
return nullptr;		return nullptr;
APFloat Op1V = Op1->getValueAPF();		APFloat Op1V = Op1->getValueAPF();

if (auto *Op2 = dyn_cast<ConstantFP>(Operands[1])) {		if (auto *Op2 = dyn_cast<ConstantFP>(Operands[1])) {
if (Op2->getType() != Op1->getType())		if (Op2->getType() != Op1->getType())
return nullptr;		return nullptr;
APFloat Op2V = Op2->getValueAPF();		APFloat Op2V = Op2->getValueAPF();

		if (auto ConstrIntr = dyn_cast<ConstrainedFPIntrinsic>(Call)) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto ConstrIntr' can be declared as 'const auto ConstrIntr' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto ConstrIntr' can be declared as 'const auto *ConstrIntr' [llvm…
		RoundingMode RM = getEvaluationRoundingMode(ConstrIntr);
		APFloat Res = Op1V;
		APFloat::opStatus St;
		switch (IntrinsicID) {
		default:
		return nullptr;
		case Intrinsic::experimental_constrained_fadd:
		St = Res.add(Op2V, RM);
		break;
		case Intrinsic::experimental_constrained_fsub:
		St = Res.subtract(Op2V, RM);
		break;
		case Intrinsic::experimental_constrained_fmul:
		St = Res.multiply(Op2V, RM);
		break;
		case Intrinsic::experimental_constrained_fdiv:
		St = Res.divide(Op2V, RM);
		break;
		case Intrinsic::experimental_constrained_frem:
		St = Res.remainder(Op2V);
		break;
		}
		if (mayFoldConstrained(const_cast<ConstrainedFPIntrinsic *>(ConstrIntr),
		St))
		return ConstantFP::get(Ty->getContext(), Res);
		return nullptr;
		}

		if (!Ty->isHalfTy() && !Ty->isFloatTy() && !Ty->isDoubleTy())
		return nullptr;

switch (IntrinsicID) {		switch (IntrinsicID) {
		foadUnsubmitted Not Done Reply Inline Actions You don't need the "if", just do it unconditionally? foad: You don't need the "if", just do it unconditionally?
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Yes, you are right. Updated. sepavloff: Yes, you are right. Updated.
default:		default:
break;		break;
case Intrinsic::pow:		case Intrinsic::pow:
return ConstantFoldBinaryFP(pow, Op1V, Op2V, Ty);		return ConstantFoldBinaryFP(pow, Op1V, Op2V, Ty);
case Intrinsic::copysign:		case Intrinsic::copysign:
return ConstantFP::get(Ty->getContext(), APFloat::copySign(Op1V, Op2V));		return ConstantFP::get(Ty->getContext(), APFloat::copySign(Op1V, Op2V));
case Intrinsic::minnum:		case Intrinsic::minnum:
return ConstantFP::get(Ty->getContext(), minnum(Op1V, Op2V));		return ConstantFP::get(Ty->getContext(), minnum(Op1V, Op2V));
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	if (auto *Op2 = dyn_cast<ConstantFP>(Operands[1])) {
case LibFunc_atan2f:		case LibFunc_atan2f:
case LibFunc_atan2_finite:		case LibFunc_atan2_finite:
case LibFunc_atan2f_finite:		case LibFunc_atan2f_finite:
if (TLI->has(Func))		if (TLI->has(Func))
return ConstantFoldBinaryFP(atan2, Op1V, Op2V, Ty);		return ConstantFoldBinaryFP(atan2, Op1V, Op2V, Ty);
break;		break;
}		}
} else if (auto *Op2C = dyn_cast<ConstantInt>(Operands[1])) {		} else if (auto *Op2C = dyn_cast<ConstantInt>(Operands[1])) {
		if (!Ty->isHalfTy() && !Ty->isFloatTy() && !Ty->isDoubleTy())
		return nullptr;
if (IntrinsicID == Intrinsic::powi && Ty->isHalfTy())		if (IntrinsicID == Intrinsic::powi && Ty->isHalfTy())
return ConstantFP::get(		return ConstantFP::get(
Ty->getContext(),		Ty->getContext(),
APFloat((float)std::pow((float)Op1V.convertToDouble(),		APFloat((float)std::pow((float)Op1V.convertToDouble(),
(int)Op2C->getZExtValue())));		(int)Op2C->getZExtValue())));
if (IntrinsicID == Intrinsic::powi && Ty->isFloatTy())		if (IntrinsicID == Intrinsic::powi && Ty->isFloatTy())
return ConstantFP::get(		return ConstantFP::get(
Ty->getContext(),		Ty->getContext(),
▲ Show 20 Lines • Show All 319 Lines • ▼ Show 20 Lines	static Constant *ConstantFoldScalarCall3(StringRef Name,
assert(Operands.size() == 3 && "Wrong number of operands.");		assert(Operands.size() == 3 && "Wrong number of operands.");

if (const auto *Op1 = dyn_cast<ConstantFP>(Operands[0])) {		if (const auto *Op1 = dyn_cast<ConstantFP>(Operands[0])) {
if (const auto *Op2 = dyn_cast<ConstantFP>(Operands[1])) {		if (const auto *Op2 = dyn_cast<ConstantFP>(Operands[1])) {
if (const auto *Op3 = dyn_cast<ConstantFP>(Operands[2])) {		if (const auto *Op3 = dyn_cast<ConstantFP>(Operands[2])) {
const APFloat &C1 = Op1->getValueAPF();		const APFloat &C1 = Op1->getValueAPF();
const APFloat &C2 = Op2->getValueAPF();		const APFloat &C2 = Op2->getValueAPF();
const APFloat &C3 = Op3->getValueAPF();		const APFloat &C3 = Op3->getValueAPF();

		if (auto ConstrIntr = dyn_cast<ConstrainedFPIntrinsic>(Call)) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto ConstrIntr' can be declared as 'const auto ConstrIntr' [llvm-qualified-auto] not useful Lint: Pre-merge checks:* clang-tidy: warning: 'auto ConstrIntr' can be declared as 'const auto *ConstrIntr' [llvm…
		RoundingMode RM = getEvaluationRoundingMode(ConstrIntr);
		APFloat Res = C1;
		APFloat::opStatus St;
		switch (IntrinsicID) {
		default:
		return nullptr;
		case Intrinsic::experimental_constrained_fma:
		case Intrinsic::experimental_constrained_fmuladd:
		St = Res.fusedMultiplyAdd(C2, C3, RM);
		break;
		}
		if (mayFoldConstrained(
		const_cast<ConstrainedFPIntrinsic *>(ConstrIntr), St))
		return ConstantFP::get(Ty->getContext(), Res);
		return nullptr;
		}

switch (IntrinsicID) {		switch (IntrinsicID) {
default: break;		default: break;
case Intrinsic::amdgcn_fma_legacy: {		case Intrinsic::amdgcn_fma_legacy: {
// The legacy behaviour is that multiplying +/- 0.0 by anything, even		// The legacy behaviour is that multiplying +/- 0.0 by anything, even
// NaN or infinity, gives +0.0.		// NaN or infinity, gives +0.0.
if (C1.isZero() \|\| C2.isZero()) {		if (C1.isZero() \|\| C2.isZero()) {
// It's tempting to just return C3 here, but that would give the		// It's tempting to just return C3 here, but that would give the
// wrong result if C3 was -0.0.		// wrong result if C3 was -0.0.
▲ Show 20 Lines • Show All 422 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/Local.cpp

Show First 20 Lines • Show All 480 Lines • ▼ Show 20 Lines	bool llvm::wouldInstructionBeTriviallyDead(Instruction *I,
if (CallInst *CI = isFreeCall(I, TLI))		if (CallInst *CI = isFreeCall(I, TLI))
if (Constant *C = dyn_cast<Constant>(CI->getArgOperand(0)))		if (Constant *C = dyn_cast<Constant>(CI->getArgOperand(0)))
return C->isNullValue() \|\| isa<UndefValue>(C);		return C->isNullValue() \|\| isa<UndefValue>(C);

if (auto *Call = dyn_cast<CallBase>(I))		if (auto *Call = dyn_cast<CallBase>(I))
if (isMathLibCallNoop(Call, TLI))		if (isMathLibCallNoop(Call, TLI))
return true;		return true;

		// To express possible interaction with floating point environment constrained
		// intrinsics are described as if they access memory. So they look like having
		// side effect but actually do not have it unless they raise floating point
		// exception. If FP exceptions are ignored, the intrinsic may be deleted.
		if (auto *CI = dyn_cast<ConstrainedFPIntrinsic>(I)) {
		Optional<fp::ExceptionBehavior> EB = CI->getExceptionBehavior();
		if (!EB \|\| *EB == fp::ExceptionBehavior::ebIgnore)
		return true;
		}
		kpnUnsubmitted Not Done Reply Inline Actions What about "maytrap"? The language reference explicitly calls out constant folding as allowed under "maytrap" since it doesn't introduce any new exceptions. kpn: What about "maytrap"? The language reference explicitly calls out constant folding as allowed…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions I think you are right. According to the documentation, constant folding is possible in this case. sepavloff: I think you are right. According to the documentation, constant folding is possible in this…

return false;		return false;
}		}

/// RecursivelyDeleteTriviallyDeadInstructions - If the specified value is a		/// RecursivelyDeleteTriviallyDeadInstructions - If the specified value is a
/// trivially dead instruction, delete it. If that makes any of its operands		/// trivially dead instruction, delete it. If that makes any of its operands
/// trivially dead, delete them too, recursively. Return true if any		/// trivially dead, delete them too, recursively. Return true if any
/// instructions were deleted.		/// instructions were deleted.
bool llvm::RecursivelyDeleteTriviallyDeadInstructions(		bool llvm::RecursivelyDeleteTriviallyDeadInstructions(
▲ Show 20 Lines • Show All 2,878 Lines • Show Last 20 Lines

llvm/test/Transforms/InstSimplify/constfold-constrained.ll

; RUN: opt < %s -instsimplify -S \| FileCheck %s		; RUN: opt < %s -instsimplify -S \| FileCheck %s
		spatelUnsubmitted Not Done Reply Inline Actions If there's not too much noise, I recommend using utils/update_test_checks.py to auto-generate the FileCheck lines in this file - it should eliminate typos and make it easier to update the file with new tests. So (1) add new tests, (2) auto-generate the baseline CHECKs for all tests, (3) apply this patch and re-run the script, so we just have the test diffs. spatel: If there's not too much noise, I recommend using utils/update_test_checks.py to auto-generate…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions (1) and (2) are implemented in https://reviews.llvm.org/rGa0b4f424f564 sepavloff: (1) and (2) are implemented in https://reviews.llvm.org/rGa0b4f424f564


; Verify that floor(10.1) is folded to 10.0 when the exception behavior is 'ignore'.		; Verify that floor(10.1) is folded to 10.0 when the exception behavior is 'ignore'.
define double @floor_01() #0 {		define double @floor_01() #0 {
entry:		entry:
%result = call double @llvm.experimental.constrained.floor.f64(		%result = call double @llvm.experimental.constrained.floor.f64(
double 1.010000e+01,		double 1.010000e+01,
metadata !"fpexcept.ignore") #0		metadata !"fpexcept.ignore") #0
▲ Show 20 Lines • Show All 217 Lines • ▼ Show 20 Lines	%result = call double @llvm.experimental.constrained.rint.f64(
double 1.010000e+01,		double 1.010000e+01,
metadata !"round.tonearest",		metadata !"round.tonearest",
metadata !"fpexcept.maytrap") #0		metadata !"fpexcept.maytrap") #0
ret double %result		ret double %result
; CHECK-LABEL: @rint_03		; CHECK-LABEL: @rint_03
; CHECK: ret double 1.000000e+01		; CHECK: ret double 1.000000e+01
}		}

		define float @fadd_01() #0 {
		entry:
		%result = call float @llvm.experimental.constrained.fadd.f32(
		float 1.000000e+01,
		float 2.000000e+01,
		metadata !"round.tonearest",
		metadata !"fpexcept.ignore") #0
		ret float %result
		; CHECK-LABEL: @fadd_01
		; CHECK: ret float 3.000000e+01
		}

		; Inexact result does not prevent from folding if exceptions are ignored and
		; rounding mode is known.
		define double @fadd_02() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fadd.f64(
		double 1.0,
		double 0x3FF0000000000001,
		metadata !"round.tonearest",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @fadd_02
		; CHECK: ret double 2.000000e+00
		}

		define double @fadd_03() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fadd.f64(
		double 1.0,
		double 0x3FF0000000000001,
		metadata !"round.upward",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @fadd_03
		; CHECK: ret double 0x4000000000000001
		}

		; Inexact result prevents from folding if exceptions may be checked.
		define double @fadd_04() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fadd.f64(
		double 1.0,
		double 0x3FF0000000000001,
		metadata !"round.tonearest",
		metadata !"fpexcept.strict") #0
		ret double %result
		; CHECK-LABEL: @fadd_04
		; CHECK: ret double %result
		}

		; If result is exact, folding is allowed even if exceptions may be checked.
		define double @fadd_05() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fadd.f64(
		double 1.0,
		double 2.0,
		metadata !"round.tonearest",
		metadata !"fpexcept.strict") #0
		ret double %result
		; CHECK-LABEL: @fadd_05
		; CHECK: ret double 3.000000e+00
		}

		; Dynamic rounding mode does not prevent from folding if the result is exact.
		define double @fadd_06() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fadd.f64(
		double 1.0,
		double 2.0,
		metadata !"round.dynamic",
		metadata !"fpexcept.strict") #0
		ret double %result
		; CHECK-LABEL: @fadd_06
		; CHECK: ret double 3.000000e+00
		}

		; Inexact results prevents from folding if rounding mode is unknown.
		define double @fadd_07() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fadd.f64(
		double 1.0,
		double 0x3FF0000000000001,
		metadata !"round.dynamic",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @fadd_07
		; CHECK: ret double %result
		}

		; Infinite result does not prevent from folding unless exceptions are tracked.
		define double @fadd_08() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fadd.f64(
		double 0x7fEFFFFFFFFFFFFF,
		double 0x7fEFFFFFFFFFFFFF,
		metadata !"round.tonearest",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @fadd_08
		; CHECK: ret double 0x7FF0000000000000
		}

		define double @fadd_09() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fadd.f64(
		double 0x7fEFFFFFFFFFFFFF,
		double 0x7fEFFFFFFFFFFFFF,
		metadata !"round.tonearest",
		metadata !"fpexcept.strict") #0
		ret double %result
		; CHECK-LABEL: @fadd_09
		; CHECK: ret double %result
		}

		define half @fadd_10() #0 {
		entry:
		%result = call half @llvm.experimental.constrained.fadd.f16(
		half 1.0,
		half 2.0,
		metadata !"round.tonearest",
		metadata !"fpexcept.ignore") #0
		ret half %result
		; CHECK-LABEL: @fadd_10
		; CHECK: ret half 0xH4200
		}

		define bfloat @fadd_11() #0 {
		entry:
		%result = call bfloat @llvm.experimental.constrained.fadd.bf16(
		bfloat 1.0,
		bfloat 2.0,
		metadata !"round.tonearest",
		metadata !"fpexcept.ignore") #0
		ret bfloat %result
		; CHECK-LABEL: @fadd_11
		; CHECK: ret bfloat 0xR4040
		}

		define double @fsub_01() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fsub.f64(
		double 1.0,
		double 2.0,
		metadata !"round.tonearest",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @fsub_01
		; CHECK: ret double -1.000000e+00
		}

		define double @fmul_01() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fmul.f64(
		double 1.0,
		double 2.0,
		metadata !"round.tonearest",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @fmul_01
		; CHECK: ret double 2.000000e+00
		}

		define double @fdiv_01() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fdiv.f64(
		double 1.0,
		double 2.0,
		metadata !"round.tonearest",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @fdiv_01
		; CHECK: ret double 5.000000e-01
		}

		define double @frem_01() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.frem.f64(
		double 1.0,
		double 2.0,
		metadata !"round.dynamic",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @frem_01
		; CHECK: ret double 1.000000e+00
		}

		define double @fma_01() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fma.f64(
		double 1.0,
		double 2.0,
		double 3.0,
		metadata !"round.dynamic",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @fma_01
		; CHECK: ret double 5.000000e+00
		}

		define double @fmuladd_01() #0 {
		entry:
		%result = call double @llvm.experimental.constrained.fmuladd.f64(
		double 1.0,
		double 2.0,
		double 3.0,
		metadata !"round.dynamic",
		metadata !"fpexcept.ignore") #0
		ret double %result
		; CHECK-LABEL: @fmuladd_01
		; CHECK: ret double 5.000000e+00
		}


attributes #0 = { strictfp }		attributes #0 = { strictfp }

declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.nearbyint.f64(double, metadata, metadata)
declare double @llvm.experimental.constrained.floor.f64(double, metadata)		declare double @llvm.experimental.constrained.floor.f64(double, metadata)
declare double @llvm.experimental.constrained.ceil.f64(double, metadata)		declare double @llvm.experimental.constrained.ceil.f64(double, metadata)
declare double @llvm.experimental.constrained.trunc.f64(double, metadata)		declare double @llvm.experimental.constrained.trunc.f64(double, metadata)
declare double @llvm.experimental.constrained.round.f64(double, metadata)		declare double @llvm.experimental.constrained.round.f64(double, metadata)
declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)		declare double @llvm.experimental.constrained.rint.f64(double, metadata, metadata)
		declare double @llvm.experimental.constrained.fadd.f64(double, double, metadata, metadata)
		declare float @llvm.experimental.constrained.fadd.f32(float, float, metadata, metadata)
		declare half @llvm.experimental.constrained.fadd.f16(half, half, metadata, metadata)
		declare bfloat @llvm.experimental.constrained.fadd.bf16(bfloat, bfloat, metadata, metadata)
		declare double @llvm.experimental.constrained.fsub.f64(double, double, metadata, metadata)
		declare double @llvm.experimental.constrained.fmul.f64(double, double, metadata, metadata)
		declare double @llvm.experimental.constrained.fdiv.f64(double, double, metadata, metadata)
		declare double @llvm.experimental.constrained.frem.f64(double, double, metadata, metadata)
		declare double @llvm.experimental.constrained.fma.f64(double, double, double, metadata, metadata)
		declare double @llvm.experimental.constrained.fmuladd.f64(double, double, double, metadata, metadata)

This is an archive of the discontinued LLVM Phabricator instance.

[ConstantFolding] Fold constrained arithmetic intrinsicsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 350801

llvm/lib/Analysis/ConstantFolding.cpp

llvm/lib/Transforms/Utils/Local.cpp

llvm/test/Transforms/InstSimplify/constfold-constrained.ll

[ConstantFolding] Fold constrained arithmetic intrinsics
ClosedPublic