This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Analysis/
-
Analysis/
4/4
ConstantFolding.cpp
-
test/Transforms/EarlyCSE/
-
Transforms/
-
EarlyCSE/
5/10
atan.ll
-
math-1.ll
-
math-2.ll

Differential D127964

[DCE] Eliminate no-op atan and atan2 calls
ClosedPublic

Authored by mohammed-nurulhoque on Jun 16 2022, 7:26 AM.

Download Raw Diff

Details

Reviewers

sepavloff
efriedma
spatel
dcandler
shchenz

Commits

rG30abc1a6a18e: [ConstantFolding] Eliminate atan and atan2 calls

Summary

From the opengroup specifications, atan2 can fail if the result underflows and atan can fail if the argument is subnormal.
In other cases we can eliminate the call to atan/atan2.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

mohammed-nurulhoque created this revision.Jun 16 2022, 7:26 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 16 2022, 7:26 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

mohammed-nurulhoque requested review of this revision.Jun 16 2022, 7:26 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 16 2022, 7:26 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

mohammed-nurulhoque added reviewers: eli.friedman, sepavloff.Jun 16 2022, 7:30 AM

mohammed-nurulhoque edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B170253: Diff 437531.Jun 16 2022, 8:29 AM

The opengroup specification (https://pubs.opengroup.org/onlinepubs/9699919799/functions/atan.html) states:

If x is subnormal, a range error may occur

It does not state that error shall occur, which means this behavior is optional. So constant evaluation hardly needs changes.

nikic edited reviewers, added: efriedma, spatel; removed: eli.friedman.Jun 17 2022, 12:30 AM

nikic added a subscriber: nikic.

nikic added inline comments.

llvm/test/Transforms/InstCombine/elimAtan.ll
2 ↗	(On Diff #437531)	Please run only `-instcombine`.

Changed the tests to only invoke -early-cse and moved the file to the earlyCSE tests folder

In D127964#3591112, @sepavloff wrote:
The opengroup specification (https://pubs.opengroup.org/onlinepubs/9699919799/functions/atan.html) states:
If x is subnormal, a range error may occur
It does not state that error shall occur, which means this behavior is optional. So constant evaluation hardly needs changes.

Yes. But before this patch, it doesn't consider any atan* call as No-Op, which means it's assuming all atan* as potentially failing.
This patch strictly limits the cases where an error might be expected

Harbormaster completed remote builds in B170470: Diff 437843.Jun 17 2022, 4:27 AM

ping

sepavloff added inline comments.Jul 6 2022, 10:08 AM

llvm/lib/Analysis/ConstantFolding.cpp
3302	Should we check for denormal argument? C11 standard (http://www.open-std.org/jtc1/sc22/wg14/www/docs/n1570.pdf, 7.12.4.3) does not mention any error condition. GLIBC documentation (https://www.gnu.org/software/libc/manual/html_node/Inverse-Trig-Functions.html) also says nothing about it. POSIX (https://pubs.opengroup.org/onlinepubs/9699919799/functions/atan.html) says range error is optional. Can we safely assume `atan` never sets error?

Updated, so atan & atan2 are always no-op. Since, the errors are optional, never raising an error is compliant. It's also consistent with how the other math functions with optional errors are currently implemented.

mohammed-nurulhoque marked an inline comment as done.Jul 12 2022, 12:05 PM

sepavloff added inline comments.Jul 13 2022, 11:10 AM

llvm/lib/Analysis/ConstantFolding.cpp
3371–3374	Both GLIBC and POSIX docs say that atan(0, 0) is 0. But C11 says (7.12.4.4p2): ... A domain error may occur if both arguments are zero. Could you please elaborate this question?

mohammed-nurulhoque added inline comments.Jul 14 2022, 1:27 AM

llvm/lib/Analysis/ConstantFolding.cpp
3371–3374	The C11 still says the error is optional, so I think it's correct not to raise the error. Furthermore, POSIX docs say explicitly "If both arguments are 0, a domain error shall not occur." Other than that, out of these 3, POSIX descriptions look the most complete here, as it enumerates a lot of corner cases, whereas GLIBC & C docs are high-level, so I'm inclined to go with the POSIX description here.

sepavloff added inline comments.Jul 15 2022, 6:16 AM

llvm/lib/Analysis/ConstantFolding.cpp
3371–3374	Please put here a comment saying that POSIX, GLIBC and MSVC assume atan2(0, 0) is 0, C11 says that a domain error may occur but does not require that.
llvm/test/Transforms/EarlyCSE/atan.ll
21	Why it is not constfolded? C11 (F.10.1.3) states: atan(±∞) returns ±π /2.
32	Why it is not constfolded? C11 (F.10p11) states: Functions with a NaN argument return a NaN result and raise no floating-point exception, except where stated otherwise.
49	Why it is not removed? The value is not used and it should not have side effect, no?

mohammed-nurulhoque updated this revision to Diff 445761.Jul 19 2022, 3:22 AM

mohammed-nurulhoque updated this revision to Diff 445764.Jul 19 2022, 3:30 AM

Add atan2 tests for Inf & NaN

mohammed-nurulhoque added inline comments.Jul 19 2022, 3:55 AM

llvm/test/Transforms/EarlyCSE/atan.ll
21	infinity and NaN are not folded because in `ConstantFoldScalarCall1` there's this snippet: /// We only fold functions with finite arguments. Folding NaN and inf is /// likely to be aborted with an exception anyway, and some host libms /// have known errors raising exceptions. if (!U.isFinite()) return nullptr; This is too conservative as your example shows, but changing it will affect most of the floating point functions, so it's probably best left for a separate commit. Interestingly, the same wasn't done for calls with 2 arguments as atan2 with Inf&NaN arguments are folded as the newly added test cases show.
49	It's removed indeed. I accidentally uploaded the wrong diff.

Harbormaster completed remote builds in B176217: Diff 445768.Jul 19 2022, 4:25 AM

There's an earlier check in canConstantFoldCallTo() that appears to prevent folding on "atanl" (and other long double calls), but it would be good to include a test like this to confirm:

define x86_fp80 @atanl_x86(x86_fp80 %x) {
  %call = call x86_fp80 @atanl(x86_fp80 noundef 0xK3FFF8CCCCCCCCCCCCCCD)
  ret x86_fp80 %call
}

llvm/test/Transforms/EarlyCSE/atan.ll
2	Unless there's a reason/difference, I'd expect tests like this to be in the folder tests/Transforms/InstSimplify/ConstProp with similar tests for mathlib calls and use "RUN: opt -passes=instsimplify..."

add a long double test

mohammed-nurulhoque added inline comments.Aug 2 2022, 4:35 AM

llvm/test/Transforms/EarlyCSE/atan.ll
2	instsimplify does the "replacing the use of the call return value with a constant" part, but does not do the "eliminating the now redundant call" part. This patch adds the elimination not the replacement with constant.

Harbormaster completed remote builds in B178748: Diff 449260.Aug 2 2022, 6:50 AM

spatel added inline comments.Aug 5 2022, 1:27 PM

llvm/test/Transforms/EarlyCSE/atan.ll
2	Ah, I see now that the existing files are running -early-cse, but misplaced in the InstSimplify test directory. It will be easier to tell exactly what is changing if we pre-commit the tests with baseline results. Do you have commit access?

mohammed-nurulhoque added inline comments.Aug 8 2022, 1:45 AM

llvm/test/Transforms/EarlyCSE/atan.ll
2	Sounds good. I don't have commit access.

spatel mentioned this in rGb53d44fe4741: [EarlyCSE][ConstantFolding] add tests for atan/atan2; NFC.Aug 8 2022, 6:25 AM

Baseline tests committed:
b53d44fe474

Please rebase and add test comments to make it obvious why things are changing (or not changing).

Rebase tests on committed baseline tests

Harbormaster completed remote builds in B179908: Diff 450796.Aug 8 2022, 7:13 AM

I moved the existing (misplaced) test files:
59f3b3d7963b93
...so this will need another update. Also, see my suggestion to add test comments, so the current behavior is explained.

There's an open question about what to do with denormals. We recently added denorm-attribute-aware folding with:
D116952
D128647
...and we mentioned that it is a work-in-progress to get consistent behavior for intrinsics and libcalls. Exclude denorm inputs in this patch?

There's an open question about what to do with denormals. We recently added denorm-attribute-aware folding with:
D116952
D128647
...and we mentioned that it is a work-in-progress to get consistent behavior for intrinsics and libcalls. Exclude denorm inputs in this patch?

The denormal question will affect the exact constant that will be substituted for the call, but it doesn't affect whether atan/atan2 is free from side-effects, no?
This patch only changes whether atan/atan2 is safe to remove, it doesn't change how we substitute constants.

In D127964#3709349, @mohammed-nurulhoque wrote:

There's an open question about what to do with denormals. We recently added denorm-attribute-aware folding with:
D116952
D128647
...and we mentioned that it is a work-in-progress to get consistent behavior for intrinsics and libcalls. Exclude denorm inputs in this patch?

The denormal question will affect the exact constant that will be substituted for the call, but it doesn't affect whether atan/atan2 is free from side-effects, no?
This patch only changes whether atan/atan2 is safe to remove, it doesn't change how we substitute constants.

Denorms are mentioned specifically in the POSIX doc cited in an earlier comment:
https://pubs.opengroup.org/onlinepubs/9699919799/functions/atan.html

These functions may fail if
Range Error
  [MX] [Option Start] The value of x is subnormal.

But you're proposing that we take the permissive side of "may fail", so I guess that's fine. Please add test comments to make that clear.

rebase on moved tests & comment on error behaviour

LGTM - see inline comment for one more suggestion.

llvm/test/Transforms/EarlyCSE/atan.ll
14	Label this test and the next one with a TODO comment?

This revision is now accepted and ready to land.Aug 9 2022, 7:44 AM

Updated TODO comments. Thank you for the review. I don't have write permissions, so please land this patch. Here are my details:
Name: Mohammed Nurul Hoque
email: mohammed.nurulhoque@imgtec.com

Harbormaster completed remote builds in B180174: Diff 451152.Aug 9 2022, 11:05 AM

This revision was landed with ongoing or failed builds.Aug 10 2022, 8:01 AM

Closed by commit rG30abc1a6a18e: [ConstantFolding] Eliminate atan and atan2 calls (authored by mohammed-nurulhoque, committed by spatel). · Explain Why

This revision was automatically updated to reflect the committed changes.

spatel added a commit: rG30abc1a6a18e: [ConstantFolding] Eliminate atan and atan2 calls.

The llvm/test/Transforms/EarlyCSE/atan.ll test FAILs on Solaris (both sparcv9 and amd64):

/vol/llvm/src/llvm-project/local/llvm/test/Transforms/EarlyCSE/atan.ll:55:15: error: CHECK-NEXT: expected string not found in input
; CHECK-NEXT: ret float -0.000000e+00
              ^

Comparing to the Linux/x86_64 version of the output, the only difference is

--- /homes/ro/atan.ll.x86_64	2022-08-18 12:38:48.313115000 +0200
+++ /homes/ro/atan.ll.sparcv9	2022-08-18 11:50:22.976866000 +0200
@@ -25,7 +25,8 @@
 }
 
 define float @callatan2_00() {
-  ret float -0.000000e+00
+  %call = call float @atan2f(float -0.000000e+00, float 0.000000e+00)
+  ret float %call
 }
 
 define float @callatan2_x0() {

Any suggestion on how to handle this? Just XFAIL the test on Solaris?

In D127964#3731720, @ro wrote:
The llvm/test/Transforms/EarlyCSE/atan.ll test FAILs on Solaris (both sparcv9 and amd64):
/vol/llvm/src/llvm-project/local/llvm/test/Transforms/EarlyCSE/atan.ll:55:15: error: CHECK-NEXT: expected string not found in input
; CHECK-NEXT: ret float -0.000000e+00
              ^
Comparing to the Linux/x86_64 version of the output, the only difference is
--- /homes/ro/atan.ll.x86_64	2022-08-18 12:38:48.313115000 +0200
+++ /homes/ro/atan.ll.sparcv9	2022-08-18 11:50:22.976866000 +0200
@@ -25,7 +25,8 @@
 }
 
 define float @callatan2_00() {
-  ret float -0.000000e+00
+  %call = call float @atan2f(float -0.000000e+00, float 0.000000e+00)
+  ret float %call
 }
 
 define float @callatan2_x0() {
Any suggestion on how to handle this? Just XFAIL the test on Solaris?

That would be the quick fix to get things passing. But does that mean the mathlib on Solaris is setting errno on "atan2(-0.0, 0.0)" ?

IEEE says "atan2(±0, +0) is ±0".
POSIX has that case as optional - https://pubs.opengroup.org/onlinepubs/9699919799/functions/atan2.html :
"If y is ±0 and x is +0, ±0 shall be returned."
And also says:
"If both arguments are 0, a domain error shall not occur."
...but that's in the same optional block.

So the more general fix would be to disallow constant folding on this case (and also on "atan2(+0.0, 0.0)")?

In D127964#3732256, @spatel wrote:

That would be the quick fix to get things passing. But does that mean the mathlib on Solaris is setting errno on "atan2(-0.0, 0.0)" ?

IEEE says "atan2(±0, +0) is ±0".
POSIX has that case as optional - https://pubs.opengroup.org/onlinepubs/9699919799/functions/atan2.html :
"If y is ±0 and x is +0, ±0 shall be returned."
And also says:
"If both arguments are 0, a domain error shall not occur."
...but that's in the same optional block.

I've just checked: the difference is between clang 11.3.0 and clang 15.0.0 (and older):

With gcc, I get both the expected return values and errno is 0.
With clang, the values remain correct, but somehow errno is set to EDOM.

However, this only happens because gcc, unlike clang, evaluates the atan2 calls at compile time. When compiling with -fno-builtin-atan2, both clang and gcc binaries behave the same.

I've found a copy of the Solaris 9 libm sources and am looking at the atan2 implementation and error handling right now.

I've looked around some more and it seems the Solaris libm acts within the C standard: all of C99, p.219, C11, p.239, and C17, p.147 state

A domain error may occur if both arguments are zero.

I've also found the atan2 docs on cppreference.com which says the same, adding

If the implementation supports IEEE floating-point arithmetic (IEC 60559),

    If x and y are both zero, domain error does not occur
    If x and y are both zero, range error does not occur either

IEC 60559 support thus seems to be lacking on Solaris and, given that's it's optional, the LLVM testsuite should cope either way.

spatel mentioned this in rG7f1262a322c0: [EarlyCSE][ConstantFolding] do not constant fold atan2(+/-0.0, +/-0.0).Aug 19 2022, 9:31 AM

In D127964#3735267, @ro wrote:
I've looked around some more and it seems the Solaris libm acts within the C standard: all of C99, p.219, C11, p.239, and C17, p.147 state
A domain error may occur if both arguments are zero.
I've also found the atan2 docs on cppreference.com which says the same, adding
If the implementation supports IEEE floating-point arithmetic (IEC 60559),

    If x and y are both zero, domain error does not occur
    If x and y are both zero, range error does not occur either
IEC 60559 support thus seems to be lacking on Solaris and, given that's it's optional, the LLVM testsuite should cope either way.

Thanks for checking. I added tests and avoided folding on these patterns:
4bff1037bbfc3
7f1262a322c0

In the initial review, we misinterpreted the POSIX docs because the optional behavior specifier for raising errors is easily missed. If there are other cases where this patch overstepped, we should fix those too.

In D127964#3735654, @spatel wrote:

Thanks for checking. I added tests and avoided folding on these patterns:
4bff1037bbfc3
7f1262a322c0

In the initial review, we misinterpreted the POSIX docs because the optional behavior specifier for raising errors is easily missed. If there are other cases where this patch overstepped, we should fix those too.

Unfortunately, the fix isn't complete yet: I still get a FAIL on Solaris/amd64 (atan.ll.x86_64 is the Linux/x86_64 output, atan.ll.amd64 the Solaris/amd64 one):

--- atan.ll.x86_64	2022-08-19 23:01:10.257646000 +0200
+++ atan.ll.amd64	2022-08-19 23:00:57.605261000 +0200
@@ -26,22 +26,22 @@
 
 define float @callatan2_00() {
   %call = call float @atan2f(float 0.000000e+00, float 0.000000e+00)
-  ret float 0.000000e+00
+  ret float %call
 }
 
 define float @callatan2_n00() {
   %call = call float @atan2f(float -0.000000e+00, float 0.000000e+00)
-  ret float -0.000000e+00
+  ret float %call
 }
 
 define float @callatan2_0n0() {
   %call = call float @atan2f(float 0.000000e+00, float -0.000000e+00)
-  ret float 0x400921FB60000000
+  ret float %call
 }
 
 define float @callatan2_n0n0() {
   %call = call float @atan2f(float -0.000000e+00, float -0.000000e+00)
-  ret float 0xC00921FB60000000
+  ret float %call
 }
 
 define float @callatan2_x0() {

In D127964#3736290, @ro wrote:

Unfortunately, the fix isn't complete yet: I still get a FAIL on Solaris/amd64 (atan.ll.x86_64 is the Linux/x86_64 output, atan.ll.amd64 the Solaris/amd64 one):

IIUC, that bug existed before this patch; it was just exposed with the additional tests. Ie, because the library raises an exception on atan2(0, 0), we don't want to assume any particular constant-folded result.

A similar corner-case came up in the earlier comments: we avoid constant folding on 1-argument libm calls that potentially raise exceptions with a NaN/Inf input, but that check is missing on the path that handles 2-argument calls. And now we need a special-case for atan2 to bail out when both inputs are zero.

spatel mentioned this in rG2981a9490277: [EarlyCSE][ConstantFolding] do not constant fold atan2(+/-0.0, +/-0.0), part 2.Aug 20 2022, 7:17 AM

Are tests passing on Solaris after 2981a9490277a7920936d287c?

In D127964#3737997, @spatel wrote:

Are tests passing on Solaris after 2981a9490277a7920936d287c?

They are indeed. The Solaris/amd64 is (almost) reliably green again after quite some time, and the Solaris/sparcv9 one is just one other (unrelated) failure away.

Thanks a lot.

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ConstantFolding.cpp

15 lines

test/

Transforms/

EarlyCSE/

atan.ll

15 lines

math-1.ll

1 line

math-2.ll

1 line

Diff 451469

llvm/lib/Analysis/ConstantFolding.cpp

Show First 20 Lines • Show All 3,290 Lines • ▼ Show 20 Lines	if (ConstantFP *OpC = dyn_cast<ConstantFP>(Call->getArgOperand(0))) {
// FIXME: Stop using the host math library.		// FIXME: Stop using the host math library.
// FIXME: The computation isn't done in the right precision.		// FIXME: The computation isn't done in the right precision.
Type *Ty = OpC->getType();		Type *Ty = OpC->getType();
if (Ty->isDoubleTy() \|\| Ty->isFloatTy() \|\| Ty->isHalfTy())		if (Ty->isDoubleTy() \|\| Ty->isFloatTy() \|\| Ty->isHalfTy())
return ConstantFoldFP(tan, OpC->getValueAPF(), Ty) != nullptr;		return ConstantFoldFP(tan, OpC->getValueAPF(), Ty) != nullptr;
break;		break;
}		}

		case LibFunc_atan:
		case LibFunc_atanf:
		case LibFunc_atanl:
		// Per POSIX, this MAY fail if Op is denormal. We choose not failing.
		sepavloffUnsubmitted Done Reply Inline Actions Should we check for denormal argument? C11 standard (http://www.open-std.org/jtc1/sc22/wg14/www/docs/n1570.pdf, 7.12.4.3) does not mention any error condition. GLIBC documentation (https://www.gnu.org/software/libc/manual/html_node/Inverse-Trig-Functions.html) also says nothing about it. POSIX (https://pubs.opengroup.org/onlinepubs/9699919799/functions/atan.html) says range error is optional. Can we safely assume `atan` never sets error? sepavloff: Should we check for denormal argument? C11 standard (http://www.open-std.
		return true;


case LibFunc_asinl:		case LibFunc_asinl:
case LibFunc_asin:		case LibFunc_asin:
case LibFunc_asinf:		case LibFunc_asinf:
case LibFunc_acosl:		case LibFunc_acosl:
case LibFunc_acos:		case LibFunc_acos:
case LibFunc_acosf:		case LibFunc_acosf:
return !(Op < APFloat(Op.getSemantics(), "-1") \|\|		return !(Op < APFloat(Op.getSemantics(), "-1") \|\|
Op > APFloat(Op.getSemantics(), "1"));		Op > APFloat(Op.getSemantics(), "1"));
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	if (Op0C && Op1C) {
case LibFunc_fmod:		case LibFunc_fmod:
case LibFunc_fmodf:		case LibFunc_fmodf:
case LibFunc_remainderl:		case LibFunc_remainderl:
case LibFunc_remainder:		case LibFunc_remainder:
case LibFunc_remainderf:		case LibFunc_remainderf:
return Op0.isNaN() \|\| Op1.isNaN() \|\|		return Op0.isNaN() \|\| Op1.isNaN() \|\|
(!Op0.isInfinity() && !Op1.isZero());		(!Op0.isInfinity() && !Op1.isZero());

		case LibFunc_atan2:
		case LibFunc_atan2f:
		case LibFunc_atan2l:
		// POSIX, GLIBC and MSVC dictate atan2(0,0) is 0 and no error is raised.
		sepavloffUnsubmitted Done Reply Inline Actions Both GLIBC and POSIX docs say that atan(0, 0) is 0. But C11 says (7.12.4.4p2): ... A domain error may occur if both arguments are zero. Could you please elaborate this question? sepavloff: Both GLIBC and POSIX docs say that atan(0, 0) is 0. But C11 says (7.12.4.4p2): ``` ... A domain…
		mohammed-nurulhoqueAuthorUnsubmitted Done Reply Inline Actions The C11 still says the error is optional, so I think it's correct not to raise the error. Furthermore, POSIX docs say explicitly "If both arguments are 0, a domain error shall not occur." Other than that, out of these 3, POSIX descriptions look the most complete here, as it enumerates a lot of corner cases, whereas GLIBC & C docs are high-level, so I'm inclined to go with the POSIX description here. mohammed-nurulhoque: The C11 still says the error is optional, so I think it's correct not to raise the error.
		sepavloffUnsubmitted Done Reply Inline Actions Please put here a comment saying that POSIX, GLIBC and MSVC assume atan2(0, 0) is 0, C11 says that a domain error may occur but does not require that. sepavloff: Please put here a comment saying that POSIX, GLIBC and MSVC assume atan2(0, 0) is 0, C11 says…
		// C11 says that a domain error may optionally occur.
		// This is consistent with both.
		return true;

default:		default:
break;		break;
}		}
}		}
}		}

return false;		return false;
}		}

void TargetFolder::anchor() {}		void TargetFolder::anchor() {}

llvm/test/Transforms/EarlyCSE/atan.ll

	; XFAIL: system-aix			; XFAIL: system-aix
	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
				spatelUnsubmitted Not Done Reply Inline Actions Unless there's a reason/difference, I'd expect tests like this to be in the folder tests/Transforms/InstSimplify/ConstProp with similar tests for mathlib calls and use "RUN: opt -passes=instsimplify..." spatel: Unless there's a reason/difference, I'd expect tests like this to be in the folder…
				mohammed-nurulhoqueAuthorUnsubmitted Done Reply Inline Actions instsimplify does the "replacing the use of the call return value with a constant" part, but does not do the "eliminating the now redundant call" part. This patch adds the elimination not the replacement with constant. mohammed-nurulhoque: instsimplify does the "replacing the use of the call return value with a constant" part, but…
				spatelUnsubmitted Not Done Reply Inline Actions Ah, I see now that the existing files are running -early-cse, but misplaced in the InstSimplify test directory. It will be easier to tell exactly what is changing if we pre-commit the tests with baseline results. Do you have commit access? spatel: Ah, I see now that the existing files are running -early-cse, but misplaced in the InstSimplify…
				mohammed-nurulhoqueAuthorUnsubmitted Done Reply Inline Actions Sounds good. I don't have commit access. mohammed-nurulhoque: Sounds good. I don't have commit access.
	; RUN: opt -early-cse -S < %s \| FileCheck %s			; RUN: opt -early-cse -S < %s \| FileCheck %s

	define float @callatan0() {			define float @callatan0() {
	; CHECK-LABEL: @callatan0(			; CHECK-LABEL: @callatan0(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atanf(float -0.000000e+00)
	; CHECK-NEXT: ret float -0.000000e+00			; CHECK-NEXT: ret float -0.000000e+00
	;			;
	%call = call float @atanf(float -0.0)			%call = call float @atanf(float -0.0)
	ret float %call			ret float %call
	}			}

				; TODO: constant should be folded
	define float @callatanInf() {			define float @callatanInf() {
				spatelUnsubmitted Not Done Reply Inline Actions Label this test and the next one with a TODO comment? spatel: Label this test and the next one with a TODO comment?
	; CHECK-LABEL: @callatanInf(			; CHECK-LABEL: @callatanInf(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atanf(float 0x7FF0000000000000)			; CHECK-NEXT: [[CALL:%.*]] = call float @atanf(float 0x7FF0000000000000)
	; CHECK-NEXT: ret float [[CALL]]			; CHECK-NEXT: ret float [[CALL]]
	;			;
	%call = call float @atanf(float 0x7FF0000000000000)			%call = call float @atanf(float 0x7FF0000000000000)
	ret float %call			ret float %call
	}			}
				sepavloffUnsubmitted Not Done Reply Inline Actions Why it is not constfolded? C11 (F.10.1.3) states: atan(±∞) returns ±π /2. sepavloff: Why it is not constfolded? C11 (F.10.1.3) states: ``` atan(±∞) returns ±π /2. ```
				mohammed-nurulhoqueAuthorUnsubmitted Done Reply Inline Actions infinity and NaN are not folded because in `ConstantFoldScalarCall1` there's this snippet: /// We only fold functions with finite arguments. Folding NaN and inf is /// likely to be aborted with an exception anyway, and some host libms /// have known errors raising exceptions. if (!U.isFinite()) return nullptr; This is too conservative as your example shows, but changing it will affect most of the floating point functions, so it's probably best left for a separate commit. Interestingly, the same wasn't done for calls with 2 arguments as atan2 with Inf&NaN arguments are folded as the newly added test cases show. mohammed-nurulhoque: infinity and NaN are not folded because in `ConstantFoldScalarCall1` there's this snippet: ```…

				; TODO: constant should be folded
	define float @callatanNaN() {			define float @callatanNaN() {
	; CHECK-LABEL: @callatanNaN(			; CHECK-LABEL: @callatanNaN(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atanf(float 0x7FF8000000000000)			; CHECK-NEXT: [[CALL:%.*]] = call float @atanf(float 0x7FF8000000000000)
	; CHECK-NEXT: ret float [[CALL]]			; CHECK-NEXT: ret float [[CALL]]
	;			;
	%call = call float @atanf(float 0x7FF8000000000000)			%call = call float @atanf(float 0x7FF8000000000000)
	ret float %call			ret float %call
	}			}

				sepavloffUnsubmitted Not Done Reply Inline Actions Why it is not constfolded? C11 (F.10p11) states: Functions with a NaN argument return a NaN result and raise no floating-point exception, except where stated otherwise. sepavloff: Why it is not constfolded? C11 (F.10p11) states: ``` Functions with a NaN argument return a NaN…
				; POSIX: May fail with Range Error. We choose not to fail.
	define float @callatanDenorm() {			define float @callatanDenorm() {
	; CHECK-LABEL: @callatanDenorm(			; CHECK-LABEL: @callatanDenorm(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atanf(float 0x37A16C2000000000)
	; CHECK-NEXT: ret float 0x37A16C2000000000			; CHECK-NEXT: ret float 0x37A16C2000000000
	;			;
	%call = call float @atanf(float 0x37A16C2000000000)			%call = call float @atanf(float 0x37A16C2000000000)
	ret float %call			ret float %call
	}			}

	; long double calls currently not folded			; TODO: long double calls currently not folded
	define x86_fp80 @atanl_x86(x86_fp80 %x) {			define x86_fp80 @atanl_x86(x86_fp80 %x) {
	; CHECK-LABEL: @atanl_x86(			; CHECK-LABEL: @atanl_x86(
	; CHECK-NEXT: [[CALL:%.*]] = call x86_fp80 @atanl(x86_fp80 noundef 0xK3FFF8CCCCCCCCCCCCCCD)			; CHECK-NEXT: [[CALL:%.*]] = call x86_fp80 @atanl(x86_fp80 noundef 0xK3FFF8CCCCCCCCCCCCCCD)
	; CHECK-NEXT: ret x86_fp80 [[CALL]]			; CHECK-NEXT: ret x86_fp80 [[CALL]]
	;			;
	%call = call x86_fp80 @atanl(x86_fp80 noundef 0xK3FFF8CCCCCCCCCCCCCCD)			%call = call x86_fp80 @atanl(x86_fp80 noundef 0xK3FFF8CCCCCCCCCCCCCCD)
	ret x86_fp80 %call			ret x86_fp80 %call
				sepavloffUnsubmitted Done Reply Inline Actions Why it is not removed? The value is not used and it should not have side effect, no? sepavloff: Why it is not removed? The value is not used and it should not have side effect, no?
				mohammed-nurulhoqueAuthorUnsubmitted Done Reply Inline Actions It's removed indeed. I accidentally uploaded the wrong diff. mohammed-nurulhoque: It's removed indeed. I accidentally uploaded the wrong diff.
	}			}

	define float @callatan2_00() {			define float @callatan2_00() {
	; CHECK-LABEL: @callatan2_00(			; CHECK-LABEL: @callatan2_00(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atan2f(float -0.000000e+00, float 0.000000e+00)
	; CHECK-NEXT: ret float -0.000000e+00			; CHECK-NEXT: ret float -0.000000e+00
	;			;
	%call = call float @atan2f(float -0.0, float 0.0)			%call = call float @atan2f(float -0.0, float 0.0)
	ret float %call			ret float %call
	}			}

	define float @callatan2_x0() {			define float @callatan2_x0() {
	; CHECK-LABEL: @callatan2_x0(			; CHECK-LABEL: @callatan2_x0(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atan2f(float 1.000000e+00, float -0.000000e+00)
	; CHECK-NEXT: ret float 0x3FF921FB60000000			; CHECK-NEXT: ret float 0x3FF921FB60000000
	;			;
	%call = call float @atan2f(float 1.0, float -0.000000e+00)			%call = call float @atan2f(float 1.0, float -0.000000e+00)
	ret float %call			ret float %call
	}			}

	define float @callatan2_0x() {			define float @callatan2_0x() {
	; CHECK-LABEL: @callatan2_0x(			; CHECK-LABEL: @callatan2_0x(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atan2f(float -0.000000e+00, float 1.000000e+00)
	; CHECK-NEXT: ret float -0.000000e+00			; CHECK-NEXT: ret float -0.000000e+00
	;			;
	%call = call float @atan2f(float -0.0, float 1.0)			%call = call float @atan2f(float -0.0, float 1.0)
	ret float %call			ret float %call
	}			}

	define float @callatan2_xx() {			define float @callatan2_xx() {
	; CHECK-LABEL: @callatan2_xx(			; CHECK-LABEL: @callatan2_xx(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atan2f(float -1.000000e+00, float 1.000000e+00)
	; CHECK-NEXT: ret float 0xBFE921FB60000000			; CHECK-NEXT: ret float 0xBFE921FB60000000
	;			;
	%call = call float @atan2f(float -1.0, float 1.0)			%call = call float @atan2f(float -1.0, float 1.0)
	ret float %call			ret float %call
	}			}

	define float @callatan2_denorm() {			define float @callatan2_denorm() {
	; CHECK-LABEL: @callatan2_denorm(			; CHECK-LABEL: @callatan2_denorm(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atan2f(float 0x39B4484C00000000, float 1.000000e+10)
	; CHECK-NEXT: ret float 0x37A16C2000000000			; CHECK-NEXT: ret float 0x37A16C2000000000
	;			;
	%call = call float @atan2f(float 0x39B4484C00000000, float 1.0e+10)			%call = call float @atan2f(float 0x39B4484C00000000, float 1.0e+10)
	ret float %call			ret float %call
	}			}

	define float @callatan2_flush_to_zero() {			define float @callatan2_flush_to_zero() {
	; CHECK-LABEL: @callatan2_flush_to_zero(			; CHECK-LABEL: @callatan2_flush_to_zero(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atan2f(float 0x39B4484C00000000, float 0x4415AF1D80000000)
	; CHECK-NEXT: ret float 0.000000e+00			; CHECK-NEXT: ret float 0.000000e+00
	;			;
	%call = call float @atan2f(float 0x39B4484C00000000, float 0x4415AF1D80000000)			%call = call float @atan2f(float 0x39B4484C00000000, float 0x4415AF1D80000000)
	ret float %call			ret float %call
	}			}

	define float @callatan2_NaN() {			define float @callatan2_NaN() {
	; CHECK-LABEL: @callatan2_NaN(			; CHECK-LABEL: @callatan2_NaN(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atan2f(float 0x7FF8000000000000, float 0x7FF8000000000000)
	; CHECK-NEXT: ret float 0x7FF8000000000000			; CHECK-NEXT: ret float 0x7FF8000000000000
	;			;
	%call = call float @atan2f(float 0x7FF8000000000000, float 0x7FF8000000000000)			%call = call float @atan2f(float 0x7FF8000000000000, float 0x7FF8000000000000)
	ret float %call			ret float %call
	}			}

	define float @callatan2_Inf() {			define float @callatan2_Inf() {
	; CHECK-LABEL: @callatan2_Inf(			; CHECK-LABEL: @callatan2_Inf(
	; CHECK-NEXT: [[CALL:%.*]] = call float @atan2f(float 0x7FF0000000000000, float 0x7FF0000000000000)
	; CHECK-NEXT: ret float 0x3FE921FB60000000			; CHECK-NEXT: ret float 0x3FE921FB60000000
	;			;
	%call = call float @atan2f(float 0x7FF0000000000000, float 0x7FF0000000000000)			%call = call float @atan2f(float 0x7FF0000000000000, float 0x7FF0000000000000)
	ret float %call			ret float %call
	}			}

	declare dso_local float @atanf(float) #0			declare dso_local float @atanf(float) #0
	declare dso_local x86_fp80 @atanl(x86_fp80) #0			declare dso_local x86_fp80 @atanl(x86_fp80) #0

	declare dso_local float @atan2f(float, float) #0			declare dso_local float @atan2f(float, float) #0

	attributes #0 = { nofree nounwind willreturn }			attributes #0 = { nofree nounwind willreturn }

llvm/test/Transforms/EarlyCSE/math-1.ll

	Show All 16 Lines
	;			;
	%res = tail call fast float @asinf(float 1.0)			%res = tail call fast float @asinf(float 1.0)
	ret float %res			ret float %res
	}			}

	declare double @atan(double) #0			declare double @atan(double) #0
	define double @f_atan() {			define double @f_atan() {
	; CHECK-LABEL: @f_atan(			; CHECK-LABEL: @f_atan(
	; CHECK-NEXT: [[RES:%.*]] = tail call fast double @atan(double 1.000000e+00)
	; CHECK-NEXT: ret double 0x3FE921FB			; CHECK-NEXT: ret double 0x3FE921FB
	;			;
	%res = tail call fast double @atan(double 1.0)			%res = tail call fast double @atan(double 1.0)
	ret double %res			ret double %res
	}			}

	declare float @cosf(float) #0			declare float @cosf(float) #0
	define float @f_cosf() {			define float @f_cosf() {
	▲ Show 20 Lines • Show All 164 Lines • Show Last 20 Lines

llvm/test/Transforms/EarlyCSE/math-2.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -early-cse -earlycse-debug-hash -S -o - %s \| FileCheck %s			; RUN: opt -early-cse -earlycse-debug-hash -S -o - %s \| FileCheck %s

	declare double @atan2(double, double) #0			declare double @atan2(double, double) #0
	define double @f_atan2() {			define double @f_atan2() {
	; CHECK-LABEL: @f_atan2(			; CHECK-LABEL: @f_atan2(
	; CHECK-NEXT: [[RES:%.*]] = tail call fast double @atan2(double 1.000000e+00, double 2.000000e+00)
	; CHECK-NEXT: ret double 0x3FDDAC6{{.+}}			; CHECK-NEXT: ret double 0x3FDDAC6{{.+}}
	;			;
	%res = tail call fast double @atan2(double 1.0, double 2.0)			%res = tail call fast double @atan2(double 1.0, double 2.0)
	ret double %res			ret double %res
	}			}

	declare float @fmodf(float, float) #0			declare float @fmodf(float, float) #0
	define float @f_fmodf() {			define float @f_fmodf() {
	▲ Show 20 Lines • Show All 87 Lines • Show Last 20 Lines