This is an archive of the discontinued LLVM Phabricator instance.

[doc] Clarify constrained fcmps behavior
ClosedPublic

Authored by thopre on Apr 22 2021, 5:21 AM.

Download Raw Diff

Details

Reviewers

uweigand
craig.topper
cameron.mcinally
andrew.w.kaylor
kpn
kristof.beyls

Commits

rG2fdedf905a6b: [doc] Clarify constrained fcmps behavior

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

thopre created this revision.Apr 22 2021, 5:21 AM

Herald added a subscriber: jdoerfert. · View Herald TranscriptApr 22 2021, 5:21 AM

thopre requested review of this revision.Apr 22 2021, 5:21 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 22 2021, 5:21 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

I don't believe this change is correct. Note that even if an exception is raised, the instruction may still have a result (if exceptions are configured to only set a flag bit and not actually trap). This result will still be different for ordered vs. unordered comparisons.

In D101053#2708365, @uweigand wrote:

I don't believe this change is correct. Note that even if an exception is raised, the instruction may still have a result (if exceptions are configured to only set a flag bit and not actually trap). This result will still be different for ordered vs. unordered comparisons.

Oh I see, I guess it needs clarification even more then. I'll adapt the patch accordingly, thanks.

Harbormaster completed remote builds in B100235: Diff 339579.Apr 22 2021, 6:31 AM

Fix clarification

Harbormaster completed remote builds in B100261: Diff 339615.Apr 22 2021, 8:43 AM

The statement is now correct, but it's still not completely clear to me why it is necessary. All floating-point intrinsics may still produce a result in addition to raising an exception (if they do), so the definition of the result is always independent of the question what -if any- exceptions are raised. Why does this need to be called out here specifically?

In D101053#2709157, @uweigand wrote:

The statement is now correct, but it's still not completely clear to me why it is necessary. All floating-point intrinsics may still produce a result in addition to raising an exception (if they do), so the definition of the result is always independent of the question what -if any- exceptions are raised. Why does this need to be called out here specifically?

Other constrained operation have meaning irregardless of whether a trap is produced for an exception: they limit the sort of optimisation that can be done with FP operation. The distinction between ordered and unordered for constrained fcmps AFAIK only makes sense if no trap happens. I can also add a general reminder that exception does not necessarily implies trap at the beginning of the constrained intrinsics section if it makes more sense but I think we should have something here *as well* because that's where it's important to remember it.

Said another way, realising my mistake that exception != trap I would not change any of the STRICTFP node lowering except for STRICT_FSETCCS.

B.t.w. even if the raised exception does cause an actual trap, the result of the instruction might _still_ be accessed. This is now of course platform-specific, but e.g. on Linux it is possible to inspect the result of the instruction in the trap handler if one is installed; it is also possible for the trap handler to resume execution so that the result may also still be used by the original code afterwards. In those cases, even if there is a trap, there would still be difference between fcmp and fcmps visible.

But in any case, I guess if this extra text clarifies things for you, it might also be helpful to others, so I certainly don't mind adding it.

This revision is now accepted and ready to land.Apr 23 2021, 3:21 AM

Closed by commit rG2fdedf905a6b: [doc] Clarify constrained fcmps behavior (authored by thopre). · Explain WhyApr 23 2021, 3:55 AM

This revision was automatically updated to reflect the committed changes.

thopre added a commit: rG2fdedf905a6b: [doc] Clarify constrained fcmps behavior.

Revision Contents

Path

Size

llvm/

docs/

LangRef.rst

6 lines

Diff 339970

llvm/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 19,279 Lines • ▼ Show 20 Lines
	- "``une``": yields ``true`` if either operand is a NAN or ``op1`` is			- "``une``": yields ``true`` if either operand is a NAN or ``op1`` is
	not equal to ``op2``.			not equal to ``op2``.
	- "``uno``": yields ``true`` if either operand is a NAN.			- "``uno``": yields ``true`` if either operand is a NAN.

	The quiet comparison operation performed by			The quiet comparison operation performed by
	'``llvm.experimental.constrained.fcmp``' will only raise an exception			'``llvm.experimental.constrained.fcmp``' will only raise an exception
	if either operand is a SNAN. The signaling comparison operation			if either operand is a SNAN. The signaling comparison operation
	performed by '``llvm.experimental.constrained.fcmps``' will raise an			performed by '``llvm.experimental.constrained.fcmps``' will raise an
	exception if either operand is a NAN (QNAN or SNAN).			exception if either operand is a NAN (QNAN or SNAN). Such an exception
				does not preclude a result being produced (e.g. exception might only
				set a flag), therefore the distinction between ordered and unordered
				comparisons is also relevant for the
				'``llvm.experimental.constrained.fcmps``' intrinsic.

	'``llvm.experimental.constrained.fmuladd``' Intrinsic			'``llvm.experimental.constrained.fmuladd``' Intrinsic
	^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^			^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

	Syntax:			Syntax:
	"""""""			"""""""

	::			::
	▲ Show 20 Lines • Show All 2,753 Lines • Show Last 20 Lines