⚙ D91972 Improve STRICT_FSETCC codegen in absence of no NaN

thopre created this revision.Nov 23 2020, 8:33 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 23 2020, 8:33 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

thopre requested review of this revision.Nov 23 2020, 8:33 AM

Harbormaster completed remote builds in B79803: Diff 307091.Nov 23 2020, 9:10 AM

Make FPMO const

Harbormaster completed remote builds in B79924: Diff 307303.Nov 24 2020, 4:04 AM

Rebase

Harbormaster completed remote builds in B80698: Diff 308721.Dec 1 2020, 12:06 PM

Ping?

SjoerdMeijer added inline comments.Feb 8 2021, 4:33 AM

llvm/test/CodeGen/AArch64/arm64-fcmp-no-nans-opt.ll
2	Since these tests run with `-enable-no-nans-fp-math`, I am wondering what condition we are testing: if ((FPMO && FPMO->hasNoNaNs()) \|\| TM.Options.NoNaNsFPMath) I guess that is `TM.Options.NoNaNsFPMath`, so do we have/need checks for `FPMO->hasNoNaNs()`?

Don't check for instruction nonan flag since there's none for constrained fcmp

In D91972#2548455, @thopre wrote:

Don't check for instruction nonan flag since there's none for constrained fcmp

But that check seems useful for float instructions? In other words, can we keep the check for FPMO->hasNoNaNs(), but just add some new tests for that?

Harbormaster completed remote builds in B88276: Diff 322093.Feb 8 2021, 6:49 AM

thopre added inline comments.Feb 8 2021, 7:01 AM

llvm/test/CodeGen/AArch64/arm64-fcmp-no-nans-opt.ll
2	Ah yes, this only makes sense for regular fcmp which can have fast math flag `nnan`. Constrained fcmp cannot.

In D91972#2548500, @SjoerdMeijer wrote:

In D91972#2548455, @thopre wrote:

Don't check for instruction nonan flag since there's none for constrained fcmp

But that check seems useful for float instructions? In other words, can we keep the check for FPMO->hasNoNaNs(), but just add some new tests for that?

Constrained FCmp instructions cannot be given fast-math flags in textual IR currently. I'm not sure if there's a way for the flag to be propagated from somewhere else. I'll try to dig a bit more. Bear with me

Rename testcase

Harbormaster completed remote builds in B88349: Diff 322205.Feb 8 2021, 2:06 PM

In D91972#2548536, @thopre wrote:

In D91972#2548500, @SjoerdMeijer wrote:

In D91972#2548455, @thopre wrote:

Don't check for instruction nonan flag since there's none for constrained fcmp

But that check seems useful for float instructions? In other words, can we keep the check for FPMO->hasNoNaNs(), but just add some new tests for that?

Constrained FCmp instructions cannot be given fast-math flags in textual IR currently. I'm not sure if there's a way for the flag to be propagated from somewhere else. I'll try to dig a bit more. Bear with me

Constrained fcmp can have such flags since any FPMathOperation can have the NoNaN flag but I could not find a way to set it from looking at the code. There are no parameter attribute that would achieve it, no easy way to express it with llvm.assume and no obvious combine that would work. Since there's no way to test that code I think it's safer to leave it out. Worst case it's an optimisation that does not happen and we can patch it later.

SjoerdMeijer accepted this revision.Feb 9 2021, 1:22 AM

SjoerdMeijer added inline comments.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
7010	Nit: perhaps just make this an if-else.
llvm/test/CodeGen/AArch64/arm64-constrained-fcmp-no-nans-opt.ll
3 ↗	(On Diff #322205)	Do we need to add tests for .f16 and .f64 just for completeness?

This revision is now accepted and ready to land.Feb 9 2021, 1:22 AM

thopre marked an inline comment as done.Feb 9 2021, 2:45 AM

thopre added inline comments.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

7010

I'm not sure what you mean, something like:

ISD::CondCode Condition = getFcmpCondCode(FPCmp->getPredicate());
if (TM.Options.NoNaNsFPMath)
  Opers.push_back(DAG.getCondCode(getFCmpCodeWithoutNaN(Condition)));
else
  Opers.push_back(DAG>getCondCode(Condition));

? I'd prefer to keep the current code since it mirrors what is done in visitFcmp and reflect better that even if a better condition code can be used (due to no NaN) we are still going to push it. YMMV of course.

llvm/test/CodeGen/AArch64/arm64-constrained-fcmp-no-nans-opt.ll

3 ↗

(On Diff #322205)

I've only done the f64 because f16 does not appear to be supported on AArch64:

LLVM ERROR: Cannot select: 0x55d983a0fc58: f16,ch = AArch64ISD::STRICT_FCMP 0x55d9839a8938, 0x55d983a0fab8, 0x55d983a0fb88
  0x55d983a0fab8: f16,ch = CopyFromReg 0x55d9839a8938, Register:f16 %0
    0x55d983a0fa50: f16 = Register %0
  0x55d983a0fb88: f16,ch = CopyFromReg 0x55d9839a8938, Register:f16 %1
    0x55d983a0fb20: f16 = Register %1
In function: f16_constrained_fcmp_ueq

Happy to add it later if someone adds that support.

Add f64 tests

In D91972#2550797, @thopre wrote:

Add f64 tests

llvm/test/CodeGen/AArch64/arm64-constrained-fcmp-no-nans-opt.ll
3 ↗	(On Diff #322205)	It is supported if you add `-mattr=+fullfp16` to the command line.

SjoerdMeijer added inline comments.Feb 9 2021, 3:13 AM

llvm/test/CodeGen/AArch64/arm64-constrained-fcmp-no-nans-opt.ll
3 ↗	(On Diff #322205)	And without it, it shouldn't crash, so that is a problem, but perhaps a different one.

Add f16 cases

LGTM

In D91972#2550830, @SjoerdMeijer wrote:

LGTM

Great, thanks Sjoerd!

This revision was landed with ongoing or failed builds.Feb 9 2021, 3:18 AM

Closed by commit rGb7b61a7b5bc6: Improve STRICT_FSETCC codegen in absence of no NaN (authored by thopre). · Explain Why

This revision was automatically updated to reflect the committed changes.

thopre added a commit: rGb7b61a7b5bc6: Improve STRICT_FSETCC codegen in absence of no NaN.

thopre added a reverting change: rGa50ab8672d16: Revert STRICT_FCMP nonan optimisation.Feb 9 2021, 3:28 AM

In D91972#2550832, @thopre wrote:

In D91972#2550830, @SjoerdMeijer wrote:

LGTM

Great, thanks Sjoerd!

I had to revert it because the testcase was failing on the bot. I rebased the patch yesterday and it was passing locally with f16 and f64. Maybe a change between yesterday and today. I'll try to reproduce and reopen the diff if it needs changing.

Harbormaster completed remote builds in B88432: Diff 322324.Feb 9 2021, 3:34 AM

Harbormaster completed remote builds in B88434: Diff 322327.Feb 9 2021, 3:50 AM

thopre reopened this revision.Feb 9 2021, 7:31 AM

This revision is now accepted and ready to land.Feb 9 2021, 7:31 AM

Remove f16 tests which is not yet supported by AArch64 backend: https://reviews.llvm.org/source/llvm-github/browse/main/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp$2277

In D91972#2551343, @thopre wrote:

Remove f16 tests which is not yet supported by AArch64 backend: https://reviews.llvm.org/source/llvm-github/browse/main/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp$2277

@SjoerdMeijer Ok to commit without f16 tests given the above? I've tested in debug mode and thus assertion on this time and that passes without issue (fails in that mode with f16 cases)

Harbormaster completed remote builds in B88468: Diff 322389.Feb 9 2021, 8:30 AM

In D91972#2551345, @thopre wrote:

In D91972#2551343, @thopre wrote:

Remove f16 tests which is not yet supported by AArch64 backend: https://reviews.llvm.org/source/llvm-github/browse/main/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp$2277

@SjoerdMeijer Ok to commit without f16 tests given the above? I've tested in debug mode and thus assertion on this time and that passes without issue (fails in that mode with f16 cases)

Ah, ok, sure.

Bonus points for following up to fix the f16 case then. ;-)

This revision was landed with ongoing or failed builds.Feb 11 2021, 6:19 AM

Closed by commit rGbad0290ce374: Improve STRICT_FSETCC codegen in absence of no NaN (authored by thopre). · Explain Why

This revision was automatically updated to reflect the committed changes.

thopre added a commit: rGbad0290ce374: Improve STRICT_FSETCC codegen in absence of no NaN.

This is an archive of the discontinued LLVM Phabricator instance.

Improve STRICT_FSETCC codegen in absence of no NaN
ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 307303

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/test/CodeGen/AArch64/arm64-fcmp-no-nans-opt.ll

This is an archive of the discontinued LLVM Phabricator instance.

Improve STRICT_FSETCC codegen in absence of no NaNClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 307303

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/test/CodeGen/AArch64/arm64-fcmp-no-nans-opt.ll

Improve STRICT_FSETCC codegen in absence of no NaN
ClosedPublic