This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
-
ConstantFolding.h
-
lib/Analysis/
-
Analysis/
2/7
ConstantFolding.cpp
1/2
InstructionSimplify.cpp
-
test/Transforms/InstSimplify/
-
Transforms/
-
InstSimplify/
-
constant-fold-fp-denormal.ll

Differential D116952

[ConstantFolding] Respect denormal handling mode attributes when folding instructions
ClosedPublic

Authored by dcandler on Jan 10 2022, 9:15 AM.

Download Raw Diff

Details

Reviewers

spatel
craig.topper
nikic
lebedev.ri

Commits

rGd3919a8cc503: [ConstantFolding] Respect denormal handling mode attributes when folding…

Summary

Depending on the environment, a floating point instruction should
treat denormal inputs as zero, and/or flush a denormal output to zero.
Denormals are not currently accounted for when an instruction gets
folded to a constant, which can lead to differences in output between
a folded and a unfolded instruction when running on the target. The
denormal handling mode can be set by the function level attribute
denormal-fp-math, which this patch uses to determine whether any
denormal inputs to or outputs from folding should be zero, and that
the sign is set appropriately.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dcandler created this revision.Jan 10 2022, 9:15 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptJan 10 2022, 9:15 AM

dcandler requested review of this revision.Jan 10 2022, 9:15 AM

Herald added a project: Restricted Project. · View Herald TranscriptJan 10 2022, 9:15 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Does "original operation" mean what hardware would do? Not all targets support flushing denormals to zero. Even within X86, SSE can flush denormals to zero, but X87 can't.

Harbormaster completed remote builds in B142463: Diff 398664.Jan 10 2022, 10:01 AM

Thanks for highlighting that. Producing the appropriate result for the hardware was what I meant, so based on that I would have to rework this to handle different targets.

It doesn't look like constant folding currently has such target knowledge, and the only obvious solution I can see would be to use TargetTransformInfo to determine if the target should flush denormals from folding. However this would also mean passing TTI around much more in order to pass it down through to the constant folding functions wherever they get used.

lenary added a subscriber: lenary.Jan 13 2022, 4:31 AM

I've updated the patch to use TargetTransformInfo to determine whether the target supports flushing to zero. I'm slightly wary, as there is an warning discouraging using TargetTransformInfo, but I unsure of an alternative.

lenary added inline comments.Jan 20 2022, 5:22 AM

llvm/lib/Analysis/ConstantFolding.cpp
1004	You should probably also explain that the `return nullptr` means if you don't have a TTI, you are explicitly preventing any constant folding of operations that produce denormals, rather than continuing to fold to the denormal value.

Harbormaster completed remote builds in B144562: Diff 401607.Jan 20 2022, 5:55 AM

Ping for any other comments.

I've also added the suggested comment in the code. And yes, one result of this patch would indeed be that calls to constant folding from passes other than instruction combining would not be able to fold floating point instructions that result in a denormal - since they lack TTI. While it would be possible to add TTI to those passes, that becomes a significantly larger change and I was unsure how necessary that would be if instruction combining can already handle this case.

Harbormaster completed remote builds in B146043: Diff 403669.Jan 27 2022, 12:11 PM

Definitely not a fan of this change, on multiple levels. I don't particularly look forward to having a TTI dependency in constant folding, but more immediately, I don't think I really understand the LangRef basis for this change.

Could you explain which constituent part of fast FMF specifically allows this optimization? I don't see anything related to denormals, and interpreting afn to apply to primitive operations would be a bit of a stretch.

How does this relate to the denormal-fp-math function attribute? I would have thought that this is the one that controls denormal flushing behavior.

This revision now requires changes to proceed.Jan 29 2022, 1:27 AM

Sorry, I didn't look at this review sooner, but I agree with @nikic. The TTI warning comment in instcombine was supposed to prevent this kind of proposal, and this seems to be mixing up seemingly unrelated pieces of FP behavior. It might help to see a source/motivating example to understand if there's any realistic solution to the problem.

Thanks for taking a look, it does appear I misunderstood a few things.

The original motivation was that downstream we found a case where the output of the compiled program is different between O0 and O1: at O0 a floating point instruction executes on the target and produces a zero, but at O1 the instruction gets constant folded to a denormal value. So I have been exploring whether it would be possible to handle denormals at the time of folding, since this occurs before other combination passes (e.g DAGCombiner).

The denormal-fp-math attribute does contains the relevant information about the floating point envionment, and should already be accessible during folding via the instruction pointer (assuming it belongs to a function at the time). So I believe I could potentially rework this to avoid pulling in target info by checking denormal-fp-math instead, if that would be more acceptable. Reading the language reference, the attribute doesn't mandate flushing denormals to zero, but does suggest inputs should be treated as zero, which constant folding also does not currently respect. On testing, this can lead to similar differences in ouput when folding a floating point instruction where one input is a denormal, so it may make sense to check the inputs as well as the output in ConstantFoldInstOperands.

In D116952#3304298, @dcandler wrote:

Thanks for taking a look, it does appear I misunderstood a few things.

The original motivation was that downstream we found a case where the output of the compiled program is different between O0 and O1: at O0 a floating point instruction executes on the target and produces a zero, but at O1 the instruction gets constant folded to a denormal value. So I have been exploring whether it would be possible to handle denormals at the time of folding, since this occurs before other combination passes (e.g DAGCombiner).

The denormal-fp-math attribute does contains the relevant information about the floating point envionment, and should already be accessible during folding via the instruction pointer (assuming it belongs to a function at the time). So I believe I could potentially rework this to avoid pulling in target info by checking denormal-fp-math instead, if that would be more acceptable. Reading the language reference, the attribute doesn't mandate flushing denormals to zero, but does suggest inputs should be treated as zero, which constant folding also does not currently respect. On testing, this can lead to similar differences in ouput when folding a floating point instruction where one input is a denormal, so it may make sense to check the inputs as well as the output in ConstantFoldInstOperands.

If we can use the function attribute to get the desired result, I think that would be fine.
In an ideal world, we would have all of the FP settings in one place, but FMF became part of the bonus bits in an instruction, and there's not enough space there to represent variations like denorm or sqrt specializations.
If the attribute is not specified as needed, then we should clarify/enhance that in LangRef.

I've updated the patch with a new version which now takes the denormal handling mode from the function attribute, and adjusted the title/summary to reflect this. This supports both different settings for inputs and outputs to the instruction, as well as whether values get flushed positive zero or the sign is preserved.

While testing I found the denormal-fp-math-f32 attribute was being set unexpectedly, but I've created a separate patch to deal with that: https://reviews.llvm.org/D122589. It should not be an issue for this patch however, since it simply uses the attributes as given.

Herald added a project: Restricted Project. · View Herald TranscriptMar 28 2022, 8:52 AM

Herald added a subscriber: StephenFan. · View Herald Transcript

ConstantFoldBinaryOpOperands has other callers; are you planning to fix each of them separately?

Assuming IEEE denormal handling if we can't find the parent Function seems a bit dubious, but maybe it's the best we can do for now. It's not clear to me how this interacts with floating-point constant expressions (e.g. ConstantExpr::getFAdd). I guess we could just kill off floating-point constant expressions, since they aren't really useful in practice, but that's a non-trivial effort.

Harbormaster completed remote builds in B156574: Diff 418604.Mar 29 2022, 1:02 PM

When I originally checked the other calls to ConstantFoldBinaryOpOperands did not look like they would potentially be handling floating point instructions, although on second look, I missed InstructionSimplify::foldOrCommuteConstant. The same approach should work there too though, so I can expand the patch to cover that usage.

If the instruction lacks a parent function, then the alternative to defaulting to IEEE would be not folding at all. The impact to that could be limited to only instructions to where a denormal is detected in the input/output; if there's no denormal, there's no need for a parent function so folding can proceed.

Constant expressions won't be currently be affected by this, although potentially they could also follow the principle of only getting folded if denormals are not involved.

dcandler mentioned this in D122589: Additionally set f32 mode with denormal-fp-math.Mar 30 2022, 9:35 AM

In D116952#3417056, @dcandler wrote:

If the instruction lacks a parent function, then the alternative to defaulting to IEEE would be not folding at all. The impact to that could be limited to only instructions to where a denormal is detected in the input/output; if there's no denormal, there's no need for a parent function so folding can proceed.

Maybe... refusing to fold only when we see a denormal might lead to bugs which only show up for specific constants, though. I think the current approach of just continuing to do IEEE folding is fine as an incremental step.

Constant expressions won't be currently be affected by this, although potentially they could also follow the principle of only getting folded if denormals are not involved.

My concern with constant expressions here is mostly optimizations turning instructions into constant expressions, e.g. TargetFolder/ConstantFolder. If frontends create constant expressions, it's less important what happens.

spatel added inline comments.Mar 30 2022, 11:21 AM

llvm/lib/Analysis/ConstantFolding.cpp
1002	fneg is not a computational FP operation; it's a signbit operation. For example on x86 with SSE, it's implemented with a vector integer xor instruction, so it is not affected by denorm FP mode. I'm not sure what happens on targets that have a real fneg instruction. Either way, we need at least one test to check the behavior.

I've removed FNeg from the changes; indeed it wasn't affected by denormal mode wherever I tried. That did allow me to refactor slightly to wrap around just ConstantFoldBinaryOpOperands, and fall back to that when dealing with a constant expression or functionless instruction just as before. So for the moment the denormal mode information is only applied in situations where it is available, which is one step forward at least.

Harbormaster completed remote builds in B158705: Diff 421535.Apr 8 2022, 9:43 AM

Please add a testcase for opt -instsimplify.

llvm/lib/Analysis/InstructionSimplify.cpp
615	Maybe we should just return early if CxtI is null, instead of falling back to ConstantFoldBinaryOpOperands?

In D116952#3439943, @efriedma wrote:

Please add a testcase for opt -instsimplify.

Right - I don't think we want any -instcombine tests with this patch. It should be completely testable from -instsimplify. And we should vary the opcodes (not just fmul), so we have at least partial coverage for each one (plus a negative test for "fneg").

arsenm added a subscriber: arsenm.Apr 18 2022, 1:37 PM

arsenm added inline comments.

llvm/lib/Analysis/ConstantFolding.cpp
1327	Don't need llvm::
llvm/test/Transforms/InstCombine/AArch64/constant-fold-fp-denormal.ll
3 ↗	(On Diff #421535)	Don't need a specific target here
4 ↗	(On Diff #421535)	Would it be helpful to have some constantexpr cases in a global initializer?

I've moved the tests to instsimplify, and expanded them out to cover more cases and additional instructions. While there may be some overlap, this structures it a bit better and ensures cases don't get conflated: depending on the instruction some zero results can be obtained from either the input getting zeroed or the output getting zeroed, so it's better to test both separately.

dcandler marked 3 inline comments as done.May 3 2022, 1:56 AM

dcandler added inline comments.

llvm/lib/Analysis/InstructionSimplify.cpp
615	Returning early here would change the result for existing cases where there is no instruction pointer, so constant expressions won't get folded where they would before.
llvm/test/Transforms/InstCombine/AArch64/constant-fold-fp-denormal.ll
4 ↗	(On Diff #421535)	I don't think it's necessary to test those if they aren't going to be affected by setting the function attribute.

Harbormaster completed remote builds in B162392: Diff 426602.May 3 2022, 2:36 AM

I don't understand why we have (duplicate?) tests with -instcombine. Also, why are there ARM and AArch test file variants? IIUC, the target makes no difference - the behavior is completely specified by the function attributes.
I like that we're testing each opcode with each attribute combination for thoroughness, but I'd prefer to have that all in one file rather than split by opcode. Wouldn't it be easier to see the progression if the tests were ordered based on the attributes rather than opcode? Ie, if we're testing that an input is flushed to zero, then that denorm constant could be repeated N times in a row independently of the opcode.

Once we have the right set of tests in place, you can pre-commit them with baseline CHECK lines, and then it will be easier to see how this patch changes functionality (and we can add test comments to explain more if needed).

Sorry, the instcombine tests are from the previous version and shouldn't have been included in the diff.

Originally I put the tests in ARM/AArch64 because those seemed the relevant ones where you'd expect to see the attributes with all the different modes, but you're right; having the tests in both is redundant, and no target is needed at all really when the test is specifying the attributes. So I will combine the opcodes into one file, and can move them down a folder.

On ordering, while one set of inputs would work for multiple attributes/opcodes when testing the inputs are correctly flushed, testing that the output is flushed requires specific inputs for each opcode/attribute combination to produce a subnormal output. Where possible, I tried to pick input values relevant to the opcode where one set of inputs produced different results based on the attribute and grouped based on that, since then the effect of the attribute is visible at a glance. For example with fadd, the same inputs and opcode can produce four different results depending on which attribute is used, and the result of the input getting flushed is distinct from the result when the output is flushed. Keeping those tests together felt more readable than continually changing the inputs to order by attribute first.

Tests moved out and pre-committed in https://reviews.llvm.org/D125807

dcandler added a parent revision: D125807: [ConstantFolding] Pre-commit tests showing denormal handling during folding.May 17 2022, 9:49 AM

Harbormaster completed remote builds in B164924: Diff 430100.May 17 2022, 10:49 AM

Ping

LGTM - thanks for the thorough tests!
See inline for some minor cleanups.

llvm/lib/Analysis/ConstantFolding.cpp
1011	typo: separately Hopefully, we'll get rid of FP constant expressions, so there won't be a discrepancy in the future.
1012	"if a constant"
1318	typo: separately
1366	typo: instruction

This revision was not accepted when it landed; it landed in state Needs Review.Jun 20 2022, 8:43 AM

This revision was landed with ongoing or failed builds.

Closed by commit rGd3919a8cc503: [ConstantFolding] Respect denormal handling mode attributes when folding… (authored by dcandler). · Explain Why

This revision was automatically updated to reflect the committed changes.

dcandler added a commit: rGd3919a8cc503: [ConstantFolding] Respect denormal handling mode attributes when folding….

define zeroext i1 @foo() #0 {
  %_add = fadd fast double 1.264810e-321, 3.789480e-321
  %_res = fcmp fast une double %_add, 5.054290e-321
  ret i1 %_res
}

attributes #0 = { "denormal-fp-math"="positive-zero" }

llc 1.ll -mtriple=powerpc64le-unknown-linux-gnu

Hi, this patch causes mis-compile for above case, now %_res is true while before this patch it is false. We can not handle the denormal constantFP in fcmp? Will the denormal constantFP be in other opcodes as well?

Thanks.

In D116952#3600716, @shchenz wrote:
define zeroext i1 @foo() #0 {
  %_add = fadd fast double 1.264810e-321, 3.789480e-321
  %_res = fcmp fast une double %_add, 5.054290e-321
  ret i1 %_res
}

attributes #0 = { "denormal-fp-math"="positive-zero" }
llc 1.ll -mtriple=powerpc64le-unknown-linux-gnu
Hi, this patch causes mis-compile for above case, now %_res is true while before this patch it is false. We can not handle the denormal constantFP in fcmp? Will the denormal constantFP be in other opcodes as well?

Alive says that LLVM is correct:

define i1 @foo() denormal-fp-math=positive-zero,positive-zero {
  %_add = fadd double 0.000000, 0.000000, exceptions=ignore
  %_res = fcmp une double %_add, 0.000000
  ret i1 %_res
}
=>
define i1 @foo() noread nowrite nofree willreturn denormal-fp-math=positive-zero,positive-zero {
  ret i1 1
}
Transformation seems to be correct!

In D116952#3601629, @nlopes wrote:
In D116952#3600716, @shchenz wrote:
define zeroext i1 @foo() #0 {
  %_add = fadd fast double 1.264810e-321, 3.789480e-321
  %_res = fcmp fast une double %_add, 5.054290e-321
  ret i1 %_res
}

attributes #0 = { "denormal-fp-math"="positive-zero" }
llc 1.ll -mtriple=powerpc64le-unknown-linux-gnu
Hi, this patch causes mis-compile for above case, now %_res is true while before this patch it is false. We can not handle the denormal constantFP in fcmp? Will the denormal constantFP be in other opcodes as well?
Alive says that LLVM is correct:
define i1 @foo() denormal-fp-math=positive-zero,positive-zero {
  %_add = fadd double 0.000000, 0.000000, exceptions=ignore
  %_res = fcmp une double %_add, 0.000000
  ret i1 %_res
}
=>
define i1 @foo() noread nowrite nofree willreturn denormal-fp-math=positive-zero,positive-zero {
  ret i1 1
}
Transformation seems to be correct!

Alive 2 says "ERROR: Couldn't prove the correctness of the transformation"...

https://alive2.llvm.org/ce/z/GPLj4Z

And from the semantic of the case, %_add not equal to 5.054290e-321 should be wrong, (1.264810e-321 + 3.789480e-321 == 5.054290e-321), so we should expect false here?

In D116952#3601819, @shchenz wrote:
In D116952#3601629, @nlopes wrote:
In D116952#3600716, @shchenz wrote:
define zeroext i1 @foo() #0 {
  %_add = fadd fast double 1.264810e-321, 3.789480e-321
  %_res = fcmp fast une double %_add, 5.054290e-321
  ret i1 %_res
}

attributes #0 = { "denormal-fp-math"="positive-zero" }
llc 1.ll -mtriple=powerpc64le-unknown-linux-gnu
Hi, this patch causes mis-compile for above case, now %_res is true while before this patch it is false. We can not handle the denormal constantFP in fcmp? Will the denormal constantFP be in other opcodes as well?
Alive says that LLVM is correct:
define i1 @foo() denormal-fp-math=positive-zero,positive-zero {
  %_add = fadd double 0.000000, 0.000000, exceptions=ignore
  %_res = fcmp une double %_add, 0.000000
  ret i1 %_res
}
=>
define i1 @foo() noread nowrite nofree willreturn denormal-fp-math=positive-zero,positive-zero {
  ret i1 1
}
Transformation seems to be correct!
Alive 2 says "ERROR: Couldn't prove the correctness of the transformation"...

https://alive2.llvm.org/ce/z/GPLj4Z

The online version is outdated, sorry.

And from the semantic of the case, %_add not equal to 5.054290e-321 should be wrong, (1.264810e-321 + 3.789480e-321 == 5.054290e-321), so we should expect false here?

%_add = #x00000000000003ff
Which is a subnormal, so per the function attribute it is changed to +0.0.

Hi @nlopes, thanks for providing the useful info. However I am still not very clear about how to deal with our internal failure after this patch.

define i1 @foo() denormal-fp-math=positive-zero,positive-zero {
  %_add = fadd double 0.000000, 0.000000, exceptions=ignore
  %_res = fcmp une double %_add, 0.000000
  ret i1 %_res
}
=>
define i1 @foo() noread nowrite nofree willreturn denormal-fp-math=positive-zero,positive-zero {
  ret i1 1
}

The alive result is very confusing. I don't understand why 0.000000 + 0.000000 != 0x000000 when denormal-fp-math=positive-zero, could you help to explain?
I know you said online Alive2 is outdated, but seems the online Alive2 gets opposite result for the above 0.000000 case, it verifies ret i1 0 as the valid transformation. https://alive2.llvm.org/ce/z/PjhR3U

There is a C case too:

int main(void)
{
  double a = 1.264810e-321;
  double b = 3.789480e-321;

  return (a + b != 5.054290e-321);
}

clang 1.c -Ofast -fdenormal-fp-math=positive-zero without this patch, it gets 0 and with this patch, it gets 1. Our internal test expects 0 here.

I also tested above case with XLC(-Ofast -qnostrict)/GCC(-Ofast -funsafe-math-optimizations) on PowerPC, they both get 0.

Could you please tell me what's wrong with our internal failure? Thanks in advance. @nlopes @dcandler

In D116952#3603700, @shchenz wrote:
Hi @nlopes, thanks for providing the useful info. However I am still not very clear about how to deal with our internal failure after this patch.
define i1 @foo() denormal-fp-math=positive-zero,positive-zero {
  %_add = fadd double 0.000000, 0.000000, exceptions=ignore
  %_res = fcmp une double %_add, 0.000000
  ret i1 %_res
}
=>
define i1 @foo() noread nowrite nofree willreturn denormal-fp-math=positive-zero,positive-zero {
  ret i1 1
}
The alive result is very confusing. I don't understand why 0.000000 + 0.000000 != 0x000000 when denormal-fp-math=positive-zero, could you help to explain?

True, the output isn't great (floats are truncated, hence the 0.000000, which is not what's underneath). But what matters is the final result.

I know you said online Alive2 is outdated, but seems the online Alive2 gets opposite result for the above 0.000000 case, it verifies ret i1 0 as the valid transformation. https://alive2.llvm.org/ce/z/PjhR3U

The online version if Alive2 doesn't implement the denormal-fp-math attribute.

There is a C case too:
int main(void)
{
  double a = 1.264810e-321;
  double b = 3.789480e-321;

  return (a + b != 5.054290e-321);
}
clang 1.c -Ofast -fdenormal-fp-math=positive-zero without this patch, it gets 0 and with this patch, it gets 1. Our internal test expects 0 here.

Look at the generated assembly with -O0 without and without -fdenormal-fp-math. There's no difference. So it seems that this flag doesn't guarantee anything (it's best effort) or it's not fully implemented yet.
Nevertheless, your internal test is wrong. Check the math here: https://en.wikipedia.org/wiki/Double-precision_floating-point_format#Exponent_encoding

In D116952#3604095, @nlopes wrote:
In D116952#3603700, @shchenz wrote:
Hi @nlopes, thanks for providing the useful info. However I am still not very clear about how to deal with our internal failure after this patch.
define i1 @foo() denormal-fp-math=positive-zero,positive-zero {
  %_add = fadd double 0.000000, 0.000000, exceptions=ignore
  %_res = fcmp une double %_add, 0.000000
  ret i1 %_res
}
=>
define i1 @foo() noread nowrite nofree willreturn denormal-fp-math=positive-zero,positive-zero {
  ret i1 1
}
The alive result is very confusing. I don't understand why 0.000000 + 0.000000 != 0x000000 when denormal-fp-math=positive-zero, could you help to explain?
True, the output isn't great (floats are truncated, hence the 0.000000, which is not what's underneath). But what matters is the final result.

I know you said online Alive2 is outdated, but seems the online Alive2 gets opposite result for the above 0.000000 case, it verifies ret i1 0 as the valid transformation. https://alive2.llvm.org/ce/z/PjhR3U

The online version if Alive2 doesn't implement the denormal-fp-math attribute.
There is a C case too:
int main(void)
{
  double a = 1.264810e-321;
  double b = 3.789480e-321;

  return (a + b != 5.054290e-321);
}
clang 1.c -Ofast -fdenormal-fp-math=positive-zero without this patch, it gets 0 and with this patch, it gets 1. Our internal test expects 0 here.
Look at the generated assembly with -O0 without and without -fdenormal-fp-math. There's no difference. So it seems that this flag doesn't guarantee anything (it's best effort) or it's not fully implemented yet.
Nevertheless, your internal test is wrong. Check the math here: https://en.wikipedia.org/wiki/Double-precision_floating-point_format#Exponent_encoding

Thanks. I need some time to have a better understanding.

So GCC/XLC both returning 0 for the C case is caused by -fdenormal-fp-math=positive-zero not implemented or not used in the command line? I tested with clang, without -fdenormal-fp-math=positive-zero, it also returns 0 with this patch.

dcandler mentioned this in rGd3919a8cc503: [ConstantFolding] Respect denormal handling mode attributes when folding….Jun 23 2022, 5:13 AM

In D116952#3600716, @shchenz wrote:

Hi, this patch causes mis-compile for above case, now %_res is true while before this patch it is false. We can not handle the denormal constantFP in fcmp? Will the denormal constantFP be in other opcodes as well?

Catching up after being out for a few days...
Yes, we'll need to make more of FP constant-folding aware of these function attributes to get consistent results.
D128647 looks like it will fix fcmp. We'll need something similar for constant-folded FP intrinsics and libcalls too. And there was a comment about updating LangRef to document the behavior (the flushing mode does not affect signbit ops like fneg/fabs/copysign).

In D116952#3618733, @spatel wrote:

In D116952#3600716, @shchenz wrote:

Hi, this patch causes mis-compile for above case, now %_res is true while before this patch it is false. We can not handle the denormal constantFP in fcmp? Will the denormal constantFP be in other opcodes as well?

Catching up after being out for a few days...
Yes, we'll need to make more of FP constant-folding aware of these function attributes to get consistent results.
D128647 looks like it will fix fcmp. We'll need something similar for constant-folded FP intrinsics and libcalls too. And there was a comment about updating LangRef to document the behavior (the flushing mode does not affect signbit ops like fneg/fabs/copysign).

Thanks for confirming.

spatel mentioned this in D128647: [InstructionSimplify] handle denormal constant input for fcmp.Jun 29 2022, 7:27 AM

spatel mentioned this in D127964: [DCE] Eliminate no-op atan and atan2 calls.Aug 8 2022, 7:31 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

ConstantFolding.h

7 lines

lib/

Analysis/

ConstantFolding.cpp

75 lines

InstructionSimplify.cpp

14 lines

test/

Transforms/

InstSimplify/

constant-fold-fp-denormal.ll

208 lines

Diff 438423

llvm/include/llvm/Analysis/ConstantFolding.h

Show First 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	Constant ConstantFoldUnaryOpOperand(unsigned Opcode, Constant Op,
const DataLayout &DL);		const DataLayout &DL);

/// Attempt to constant fold a binary operation with the specified		/// Attempt to constant fold a binary operation with the specified
/// operands. If it fails, it returns a constant expression of the specified		/// operands. If it fails, it returns a constant expression of the specified
/// operands.		/// operands.
Constant ConstantFoldBinaryOpOperands(unsigned Opcode, Constant LHS,		Constant ConstantFoldBinaryOpOperands(unsigned Opcode, Constant LHS,
Constant *RHS, const DataLayout &DL);		Constant *RHS, const DataLayout &DL);

		/// Attempt to constant fold a floating point binary operation with the
		/// specified operands, applying the denormal handling mod to the operands. If
		/// it fails, it returns a constant expression of the specified operands.
		Constant ConstantFoldFPInstOperands(unsigned Opcode, Constant LHS,
		Constant *RHS, const DataLayout &DL,
		const Instruction *I);

/// Attempt to constant fold a select instruction with the specified		/// Attempt to constant fold a select instruction with the specified
/// operands. The constant result is returned if successful; if not, null is		/// operands. The constant result is returned if successful; if not, null is
/// returned.		/// returned.
Constant ConstantFoldSelectInstruction(Constant Cond, Constant *V1,		Constant ConstantFoldSelectInstruction(Constant Cond, Constant *V1,
Constant *V2);		Constant *V2);

/// Attempt to constant fold a cast with the specified operand. If it		/// Attempt to constant fold a cast with the specified operand. If it
/// fails, it returns a constant expression of the specified operand.		/// fails, it returns a constant expression of the specified operand.
▲ Show 20 Lines • Show All 82 Lines • Show Last 20 Lines

llvm/lib/Analysis/ConstantFolding.cpp

Show First 20 Lines • Show All 993 Lines • ▼ Show 20 Lines	Constant ConstantFoldInstOperandsImpl(const Value InstOrCE, unsigned Opcode,
ArrayRef<Constant *> Ops,		ArrayRef<Constant *> Ops,
const DataLayout &DL,		const DataLayout &DL,
const TargetLibraryInfo *TLI) {		const TargetLibraryInfo *TLI) {
Type *DestTy = InstOrCE->getType();		Type *DestTy = InstOrCE->getType();

if (Instruction::isUnaryOp(Opcode))		if (Instruction::isUnaryOp(Opcode))
return ConstantFoldUnaryOpOperand(Opcode, Ops[0], DL);		return ConstantFoldUnaryOpOperand(Opcode, Ops[0], DL);

if (Instruction::isBinaryOp(Opcode))		if (Instruction::isBinaryOp(Opcode)) {
		spatelUnsubmitted Done Reply Inline Actions fneg is not a computational FP operation; it's a signbit operation. For example on x86 with SSE, it's implemented with a vector integer xor instruction, so it is not affected by denorm FP mode. I'm not sure what happens on targets that have a real fneg instruction. Either way, we need at least one test to check the behavior. spatel: fneg is not a computational FP operation; it's a signbit operation. For example on x86 with SSE…
		switch (Opcode) {
		default:
		lenaryUnsubmitted Not Done Reply Inline Actions You should probably also explain that the `return nullptr` means if you don't have a TTI, you are explicitly preventing any constant folding of operations that produce denormals, rather than continuing to fold to the denormal value. lenary: You should probably also explain that the `return nullptr` means if you don't have a TTI, you…
		break;
		case Instruction::FAdd:
		case Instruction::FSub:
		case Instruction::FMul:
		case Instruction::FDiv:
		case Instruction::FRem:
		// Handle floating point instructions separately to account for denormals
		spatelUnsubmitted Not Done Reply Inline Actions typo: separately Hopefully, we'll get rid of FP constant expressions, so there won't be a discrepancy in the future. spatel: typo: separately Hopefully, we'll get rid of FP constant expressions, so there won't be a…
		// TODO: If a constant expression is being folded rather than an
		spatelUnsubmitted Not Done Reply Inline Actions "if a constant" spatel: "if a constant"
		// instruction, denormals will not be flushed/treated as zero
		if (const auto *I = dyn_cast<Instruction>(InstOrCE)) {
		return ConstantFoldFPInstOperands(Opcode, Ops[0], Ops[1], DL, I);
		}
		}
return ConstantFoldBinaryOpOperands(Opcode, Ops[0], Ops[1], DL);		return ConstantFoldBinaryOpOperands(Opcode, Ops[0], Ops[1], DL);
		}

if (Instruction::isCast(Opcode))		if (Instruction::isCast(Opcode))
return ConstantFoldCastOperand(Opcode, Ops[0], DestTy, DL);		return ConstantFoldCastOperand(Opcode, Ops[0], DestTy, DL);

if (auto *GEP = dyn_cast<GEPOperator>(InstOrCE)) {		if (auto *GEP = dyn_cast<GEPOperator>(InstOrCE)) {
if (Constant *C = SymbolicallyEvaluateGEP(GEP, Ops, DL, TLI))		if (Constant *C = SymbolicallyEvaluateGEP(GEP, Ops, DL, TLI))
return C;		return C;

▲ Show 20 Lines • Show All 278 Lines • ▼ Show 20 Lines	Constant llvm::ConstantFoldBinaryOpOperands(unsigned Opcode, Constant LHS,
assert(Instruction::isBinaryOp(Opcode));		assert(Instruction::isBinaryOp(Opcode));
if (isa<ConstantExpr>(LHS) \|\| isa<ConstantExpr>(RHS))		if (isa<ConstantExpr>(LHS) \|\| isa<ConstantExpr>(RHS))
if (Constant *C = SymbolicallyEvaluateBinop(Opcode, LHS, RHS, DL))		if (Constant *C = SymbolicallyEvaluateBinop(Opcode, LHS, RHS, DL))
return C;		return C;

return ConstantExpr::get(Opcode, LHS, RHS);		return ConstantExpr::get(Opcode, LHS, RHS);
}		}

		// Check whether a constant is a floating point denormal that should be flushed
		// to zero according to the denormal handling mode set in the function
		// attributes. If so, return a zero with the correct sign, otherwise return the
		// original constant. Inputs and outputs to floating point instructions can have
		// their mode set separately, so the direction is also needed.
		spatelUnsubmitted Not Done Reply Inline Actions typo: separately spatel: typo: separately
		Constant FlushFPConstant(Constant Operand, const llvm::Function *F,
		bool IsOutput) {
		if (F == nullptr)
		return Operand;
		if (auto *CFP = dyn_cast<ConstantFP>(Operand)) {
		const APFloat &APF = CFP->getValueAPF();
		Type *Ty = CFP->getType();
		DenormalMode DenormMode = F->getDenormalMode(Ty->getFltSemantics());
		DenormalMode::DenormalModeKind Mode =
		arsenmUnsubmitted Done Reply Inline Actions Don't need llvm:: arsenm: Don't need llvm::
		IsOutput ? DenormMode.Output : DenormMode.Input;
		switch (Mode) {
		default:
		llvm_unreachable("unknown denormal mode");
		return Operand;
		case DenormalMode::IEEE:
		return Operand;
		case DenormalMode::PreserveSign:
		if (APF.isDenormal()) {
		return ConstantFP::get(
		Ty->getContext(),
		APFloat::getZero(Ty->getFltSemantics(), APF.isNegative()));
		}
		return Operand;
		case DenormalMode::PositiveZero:
		if (APF.isDenormal()) {
		return ConstantFP::get(Ty->getContext(),
		APFloat::getZero(Ty->getFltSemantics(), false));
		}
		return Operand;
		}
		}
		return Operand;
		}

		Constant llvm::ConstantFoldFPInstOperands(unsigned Opcode, Constant LHS,
		Constant *RHS, const DataLayout &DL,
		const Instruction *I) {
		if (auto *BB = I->getParent()) {
		if (auto *F = BB->getParent()) {
		if (Instruction::isBinaryOp(Opcode)) {
		Constant *Op0 = FlushFPConstant(LHS, F, false);
		Constant *Op1 = FlushFPConstant(RHS, F, false);
		Constant *C = ConstantFoldBinaryOpOperands(Opcode, Op0, Op1, DL);
		return FlushFPConstant(C, F, true);
		}
		}
		}
		// If instruction lacks a parent/function and the denormal mode cannot be
		spatelUnsubmitted Not Done Reply Inline Actions typo: instruction spatel: typo: instruction
		// determined, use the default (IEEE).
		return ConstantFoldBinaryOpOperands(Opcode, LHS, RHS, DL);
		}

Constant llvm::ConstantFoldCastOperand(unsigned Opcode, Constant C,		Constant llvm::ConstantFoldCastOperand(unsigned Opcode, Constant C,
Type *DestTy, const DataLayout &DL) {		Type *DestTy, const DataLayout &DL) {
assert(Instruction::isCast(Opcode));		assert(Instruction::isCast(Opcode));
switch (Opcode) {		switch (Opcode) {
default:		default:
llvm_unreachable("Missing case");		llvm_unreachable("Missing case");
case Instruction::PtrToInt:		case Instruction::PtrToInt:
if (auto *CE = dyn_cast<ConstantExpr>(C)) {		if (auto *CE = dyn_cast<ConstantExpr>(C)) {
▲ Show 20 Lines • Show All 1,955 Lines • Show Last 20 Lines

llvm/lib/Analysis/InstructionSimplify.cpp

Show First 20 Lines • Show All 597 Lines • ▼ Show 20 Lines	static Value threadCmpOverPHI(CmpInst::Predicate Pred, Value LHS, Value *RHS,

return CommonValue;		return CommonValue;
}		}

static Constant *foldOrCommuteConstant(Instruction::BinaryOps Opcode,		static Constant *foldOrCommuteConstant(Instruction::BinaryOps Opcode,
Value &Op0, Value &Op1,		Value &Op0, Value &Op1,
const SimplifyQuery &Q) {		const SimplifyQuery &Q) {
if (auto *CLHS = dyn_cast<Constant>(Op0)) {		if (auto *CLHS = dyn_cast<Constant>(Op0)) {
if (auto *CRHS = dyn_cast<Constant>(Op1))		if (auto *CRHS = dyn_cast<Constant>(Op1)) {
		switch (Opcode) {
		default:
		break;
		case Instruction::FAdd:
		case Instruction::FSub:
		case Instruction::FMul:
		case Instruction::FDiv:
		case Instruction::FRem:
		if (Q.CxtI != nullptr)
		efriedmaUnsubmitted Not Done Reply Inline Actions Maybe we should just return early if CxtI is null, instead of falling back to ConstantFoldBinaryOpOperands? efriedma: Maybe we should just return early if CxtI is null, instead of falling back to…
		dcandlerAuthorUnsubmitted Done Reply Inline Actions Returning early here would change the result for existing cases where there is no instruction pointer, so constant expressions won't get folded where they would before. dcandler: Returning early here would change the result for existing cases where there is no instruction…
		return ConstantFoldFPInstOperands(Opcode, CLHS, CRHS, Q.DL, Q.CxtI);
		}
return ConstantFoldBinaryOpOperands(Opcode, CLHS, CRHS, Q.DL);		return ConstantFoldBinaryOpOperands(Opcode, CLHS, CRHS, Q.DL);
		}

// Canonicalize the constant to the RHS if this is a commutative operation.		// Canonicalize the constant to the RHS if this is a commutative operation.
if (Instruction::isCommutative(Opcode))		if (Instruction::isCommutative(Opcode))
std::swap(Op0, Op1);		std::swap(Op0, Op1);
}		}
return nullptr;		return nullptr;
}		}

▲ Show 20 Lines • Show All 5,975 Lines • Show Last 20 Lines

llvm/test/Transforms/InstSimplify/constant-fold-fp-denormal.ll

	Show All 15 Lines
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fadd float 0xB810000000000000, 0x3800000000000000			%result = fadd float 0xB810000000000000, 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fadd_pzero_out() #1 {			define float @test_float_fadd_pzero_out() #1 {
	; CHECK-LABEL: @test_float_fadd_pzero_out(			; CHECK-LABEL: @test_float_fadd_pzero_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = fadd float 0xB810000000000000, 0x3800000000000000			%result = fadd float 0xB810000000000000, 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fadd_psign_out() #2 {			define float @test_float_fadd_psign_out() #2 {
	; CHECK-LABEL: @test_float_fadd_psign_out(			; CHECK-LABEL: @test_float_fadd_psign_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = fadd float 0xB810000000000000, 0x3800000000000000			%result = fadd float 0xB810000000000000, 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fadd_pzero_in() #3 {			define float @test_float_fadd_pzero_in() #3 {
	; CHECK-LABEL: @test_float_fadd_pzero_in(			; CHECK-LABEL: @test_float_fadd_pzero_in(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB810000000000000
	; default ieee mode leaves result as a denormal			; denormal operand is treated as zero
				; normal operand added to zero results in the same operand as a result
	%result = fadd float 0xB810000000000000, 0x3800000000000000			%result = fadd float 0xB810000000000000, 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fadd_psign_in() #4 {			define float @test_float_fadd_psign_in() #4 {
	; CHECK-LABEL: @test_float_fadd_psign_in(			; CHECK-LABEL: @test_float_fadd_psign_in(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB810000000000000
	; default ieee mode leaves result as a denormal			; denormal operand is treated as zero
				; normal operand added to zero results in the same operand as a result
	%result = fadd float 0xB810000000000000, 0x3800000000000000			%result = fadd float 0xB810000000000000, 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fadd_pzero_f32_out() #5 {			define float @test_float_fadd_pzero_f32_out() #5 {
	; CHECK-LABEL: @test_float_fadd_pzero_f32_out(			; CHECK-LABEL: @test_float_fadd_pzero_f32_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
				; f32 only attribute should flush float output
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fadd float 0xB810000000000000, 0x3800000000000000			%result = fadd float 0xB810000000000000, 0x3800000000000000
	ret float %result			ret float %result
	}			}

	define double @test_double_fadd_ieee() #0 {			define double @test_double_fadd_ieee() #0 {
	; CHECK-LABEL: @test_double_fadd_ieee(			; CHECK-LABEL: @test_double_fadd_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fadd double 0x8010000000000000, 0x8000000000000			%result = fadd double 0x8010000000000000, 0x8000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fadd_pzero_out() #1 {			define double @test_double_fadd_pzero_out() #1 {
	; CHECK-LABEL: @test_double_fadd_pzero_out(			; CHECK-LABEL: @test_double_fadd_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = fadd double 0x8010000000000000, 0x8000000000000			%result = fadd double 0x8010000000000000, 0x8000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fadd_psign_out() #2 {			define double @test_double_fadd_psign_out() #2 {
	; CHECK-LABEL: @test_double_fadd_psign_out(			; CHECK-LABEL: @test_double_fadd_psign_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = fadd double 0x8010000000000000, 0x8000000000000			%result = fadd double 0x8010000000000000, 0x8000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fadd_pzero_in() #3 {			define double @test_double_fadd_pzero_in() #3 {
	; CHECK-LABEL: @test_double_fadd_pzero_in(			; CHECK-LABEL: @test_double_fadd_pzero_in(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8010000000000000
	; default ieee mode leaves result as a denormal			; denormal operand is treated as zero
				; normal operand added to zero results in the same operand as a result
	%result = fadd double 0x8010000000000000, 0x8000000000000			%result = fadd double 0x8010000000000000, 0x8000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fadd_psign_in() #4 {			define double @test_double_fadd_psign_in() #4 {
	; CHECK-LABEL: @test_double_fadd_psign_in(			; CHECK-LABEL: @test_double_fadd_psign_in(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8010000000000000
	; default ieee mode leaves result as a denormal			; denormal operand is treated as zero
				; normal operand added to zero results in the same operand as a result
	%result = fadd double 0x8010000000000000, 0x8000000000000			%result = fadd double 0x8010000000000000, 0x8000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fadd_f32_ieee() #5 {			define double @test_double_fadd_f32_ieee() #5 {
	; CHECK-LABEL: @test_double_fadd_f32_ieee(			; CHECK-LABEL: @test_double_fadd_f32_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
				; f32 only attribute should not flush doubles
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fadd double 0x8010000000000000, 0x8000000000000			%result = fadd double 0x8010000000000000, 0x8000000000000
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; fsub tests			; fsub tests
	; Normal operand subtracted from denormal operand produces denormal result			; Normal operand subtracted from denormal operand produces denormal result
	; If denormal outputs should be flushed to zero, the result should be zero.			; If denormal outputs should be flushed to zero, the result should be zero.
	; If denormal inputs should be treated as zero, the result should be the			; If denormal inputs should be treated as zero, the result should be the
	; negated normal operand (zero minus the original operand).			; negated normal operand (zero minus the original operand).
	; ============================================================================ ;			; ============================================================================ ;

	define float @test_float_fsub_ieee() #0 {			define float @test_float_fsub_ieee() #0 {
	; CHECK-LABEL: @test_float_fsub_ieee(			; CHECK-LABEL: @test_float_fsub_ieee(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fsub float 0x3800000000000000, 0x3810000000000000			%result = fsub float 0x3800000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fsub_pzero_out() #1 {			define float @test_float_fsub_pzero_out() #1 {
	; CHECK-LABEL: @test_float_fsub_pzero_out(			; CHECK-LABEL: @test_float_fsub_pzero_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = fsub float 0x3800000000000000, 0x3810000000000000			%result = fsub float 0x3800000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fsub_psign_out() #2 {			define float @test_float_fsub_psign_out() #2 {
	; CHECK-LABEL: @test_float_fsub_psign_out(			; CHECK-LABEL: @test_float_fsub_psign_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = fsub float 0x3800000000000000, 0x3810000000000000			%result = fsub float 0x3800000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fsub_pzero_in() #3 {			define float @test_float_fsub_pzero_in() #3 {
	; CHECK-LABEL: @test_float_fsub_pzero_in(			; CHECK-LABEL: @test_float_fsub_pzero_in(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB810000000000000
	; default ieee mode leaves result as a denormal			; denormal operand is treated as zero
				; normal operand subtracted from zero produces the same operand, negated
	%result = fsub float 0x3800000000000000, 0x3810000000000000			%result = fsub float 0x3800000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fsub_psign_in() #4 {			define float @test_float_fsub_psign_in() #4 {
	; CHECK-LABEL: @test_float_fsub_psign_in(			; CHECK-LABEL: @test_float_fsub_psign_in(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB810000000000000
	; default ieee mode leaves result as a denormal			; denormal operand is treated as zero
				; normal operand subtracted from zero produces the same operand, negated
	%result = fsub float 0x3800000000000000, 0x3810000000000000			%result = fsub float 0x3800000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_fsub_pzero_f32_out() #5 {			define float @test_float_fsub_pzero_f32_out() #5 {
	; CHECK-LABEL: @test_float_fsub_pzero_f32_out(			; CHECK-LABEL: @test_float_fsub_pzero_f32_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a denormal			; f32 only attribute should flush float output
				; same as pzero_out above
	%result = fsub float 0x3800000000000000, 0x3810000000000000			%result = fsub float 0x3800000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define double @test_double_fsub_ieee() #0 {			define double @test_double_fsub_ieee() #0 {
	; CHECK-LABEL: @test_double_fsub_ieee(			; CHECK-LABEL: @test_double_fsub_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fsub double 0x8000000000000, 0x10000000000000			%result = fsub double 0x8000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fsub_pzero_out() #1 {			define double @test_double_fsub_pzero_out() #1 {
	; CHECK-LABEL: @test_double_fsub_pzero_out(			; CHECK-LABEL: @test_double_fsub_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = fsub double 0x8000000000000, 0x10000000000000			%result = fsub double 0x8000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fsub_psign_out() #2 {			define double @test_double_fsub_psign_out() #2 {
	; CHECK-LABEL: @test_double_fsub_psign_out(			; CHECK-LABEL: @test_double_fsub_psign_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = fsub double 0x8000000000000, 0x10000000000000			%result = fsub double 0x8000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fsub_pzero_in() #3 {			define double @test_double_fsub_pzero_in() #3 {
	; CHECK-LABEL: @test_double_fsub_pzero_in(			; CHECK-LABEL: @test_double_fsub_pzero_in(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8010000000000000
	; default ieee mode leaves result as a denormal			; denormal operand is treated as zero
				; normal operand subtracted from zero produces the same operand, negated
	%result = fsub double 0x8000000000000, 0x10000000000000			%result = fsub double 0x8000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fsub_psign_in() #4 {			define double @test_double_fsub_psign_in() #4 {
	; CHECK-LABEL: @test_double_fsub_psign_in(			; CHECK-LABEL: @test_double_fsub_psign_in(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8010000000000000
	; default ieee mode leaves result as a denormal			; denormal operand is treated as zero
				; normal operand subtracted from zero produces the same operand, negated
	%result = fsub double 0x8000000000000, 0x10000000000000			%result = fsub double 0x8000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_fsub_f32_ieee() #5 {			define double @test_double_fsub_f32_ieee() #5 {
	; CHECK-LABEL: @test_double_fsub_f32_ieee(			; CHECK-LABEL: @test_double_fsub_f32_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
				; f32 only attribute should not flush doubles
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fsub double 0x8000000000000, 0x10000000000000			%result = fsub double 0x8000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; fmul tests			; fmul tests
	; Output modes are tested by multiplying the smallest normal number by 0.5,			; Output modes are tested by multiplying the smallest normal number by 0.5,
	Show All 9 Lines
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fmul float 0x3810000000000000, -5.000000e-01			%result = fmul float 0x3810000000000000, -5.000000e-01
	ret float %result			ret float %result
	}			}

	define float @test_float_fmul_pzero_out() #1 {			define float @test_float_fmul_pzero_out() #1 {
	; CHECK-LABEL: @test_float_fmul_pzero_out(			; CHECK-LABEL: @test_float_fmul_pzero_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = fmul float 0x3810000000000000, -5.000000e-01			%result = fmul float 0x3810000000000000, -5.000000e-01
	ret float %result			ret float %result
	}			}

	define float @test_float_fmul_psign_out() #2 {			define float @test_float_fmul_psign_out() #2 {
	; CHECK-LABEL: @test_float_fmul_psign_out(			; CHECK-LABEL: @test_float_fmul_psign_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = fmul float 0x3810000000000000, -5.000000e-01			%result = fmul float 0x3810000000000000, -5.000000e-01
	ret float %result			ret float %result
	}			}

	define float @test_float_fmul_pzero_in() #3 {			define float @test_float_fmul_pzero_in() #3 {
	; CHECK-LABEL: @test_float_fmul_pzero_in(			; CHECK-LABEL: @test_float_fmul_pzero_in(
	; CHECK-NEXT: ret float 0xB810000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a normal			; denormal operand is treated as positive zero
				; anything multiplied by zero gives a zero result
	%result = fmul float 0xB800000000000000, 2.000000e-00			%result = fmul float 0xB800000000000000, 2.000000e-00
	ret float %result			ret float %result
	}			}

	define float @test_float_fmul_psign_in() #4 {			define float @test_float_fmul_psign_in() #4 {
	; CHECK-LABEL: @test_float_fmul_psign_in(			; CHECK-LABEL: @test_float_fmul_psign_in(
	; CHECK-NEXT: ret float 0xB810000000000000			; CHECK-NEXT: ret float -0.000000e+00
	; default ieee mode leaves result as a normal			; denormal operand is treated as signed zero
				; anything multiplied by zero gives a zero result
	%result = fmul float 0xB800000000000000, 2.000000e-00			%result = fmul float 0xB800000000000000, 2.000000e-00
	ret float %result			ret float %result
	}			}

	define float @test_float_fmul_pzero_f32_out() #1 {			define float @test_float_fmul_pzero_f32_out() #1 {
	; CHECK-LABEL: @test_float_fmul_pzero_f32_out(			; CHECK-LABEL: @test_float_fmul_pzero_f32_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a denormal			; f32 only attribute should flush float output
				; same as pzero_out above
	%result = fmul float 0x3810000000000000, -5.000000e-01			%result = fmul float 0x3810000000000000, -5.000000e-01
	ret float %result			ret float %result
	}			}

	define double @test_double_fmul_ieee() #0 {			define double @test_double_fmul_ieee() #0 {
	; CHECK-LABEL: @test_double_fmul_ieee(			; CHECK-LABEL: @test_double_fmul_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fmul double 0x10000000000000, -5.000000e-01			%result = fmul double 0x10000000000000, -5.000000e-01
	ret double %result			ret double %result
	}			}

	define double @test_double_fmul_pzero_out() #1 {			define double @test_double_fmul_pzero_out() #1 {
	; CHECK-LABEL: @test_double_fmul_pzero_out(			; CHECK-LABEL: @test_double_fmul_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = fmul double 0x10000000000000, -5.000000e-01			%result = fmul double 0x10000000000000, -5.000000e-01
	ret double %result			ret double %result
	}			}

	define double @test_double_fmul_psign_out() #2 {			define double @test_double_fmul_psign_out() #2 {
	; CHECK-LABEL: @test_double_fmul_psign_out(			; CHECK-LABEL: @test_double_fmul_psign_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = fmul double 0x10000000000000, -5.000000e-01			%result = fmul double 0x10000000000000, -5.000000e-01
	ret double %result			ret double %result
	}			}

	define double @test_double_fmul_pzero_in() #3 {			define double @test_double_fmul_pzero_in() #3 {
	; CHECK-LABEL: @test_double_fmul_pzero_in(			; CHECK-LABEL: @test_double_fmul_pzero_in(
	; CHECK-NEXT: ret double 0x8010000000000000			; CHECK-NEXT: ret double 0.000000e+00
	; default ieee mode leaves result as a normal			; denormal operand is treated as positive zero
				; anything multiplied by zero gives a zero result
	%result = fmul double 0x8008000000000000, 2.000000e-00			%result = fmul double 0x8008000000000000, 2.000000e-00
	ret double %result			ret double %result
	}			}

	define double @test_double_fmul_psign_in() #4 {			define double @test_double_fmul_psign_in() #4 {
	; CHECK-LABEL: @test_double_fmul_psign_in(			; CHECK-LABEL: @test_double_fmul_psign_in(
	; CHECK-NEXT: ret double 0x8010000000000000			; CHECK-NEXT: ret double -0.000000e+00
	; default ieee mode leaves result as a normal			; denormal operand is treated as signed zero
				; anything multiplied by zero gives a zero result
	%result = fmul double 0x8008000000000000, 2.000000e-00			%result = fmul double 0x8008000000000000, 2.000000e-00
	ret double %result			ret double %result
	}			}

	define double @test_double_fmul_f32_ieee() #5 {			define double @test_double_fmul_f32_ieee() #5 {
	; CHECK-LABEL: @test_double_fmul_f32_ieee(			; CHECK-LABEL: @test_double_fmul_f32_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
				; f32 only attribute should not flush doubles
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fmul double 0x10000000000000, -5.000000e-01			%result = fmul double 0x10000000000000, -5.000000e-01
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; fdiv tests			; fdiv tests
	; Output modes are tested by dividing the smallest normal number by 2,			; Output modes are tested by dividing the smallest normal number by 2,
	Show All 9 Lines
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fdiv float 0x3810000000000000, -2.000000e-00			%result = fdiv float 0x3810000000000000, -2.000000e-00
	ret float %result			ret float %result
	}			}

	define float @test_float_fdiv_pzero_out() #1 {			define float @test_float_fdiv_pzero_out() #1 {
	; CHECK-LABEL: @test_float_fdiv_pzero_out(			; CHECK-LABEL: @test_float_fdiv_pzero_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = fdiv float 0x3810000000000000, -2.000000e-00			%result = fdiv float 0x3810000000000000, -2.000000e-00
	ret float %result			ret float %result
	}			}

	define float @test_float_fdiv_psign_out() #2 {			define float @test_float_fdiv_psign_out() #2 {
	; CHECK-LABEL: @test_float_fdiv_psign_out(			; CHECK-LABEL: @test_float_fdiv_psign_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = fdiv float 0x3810000000000000, -2.000000e-00			%result = fdiv float 0x3810000000000000, -2.000000e-00
	ret float %result			ret float %result
	}			}

	define float @test_float_fdiv_pzero_in() #3 {			define float @test_float_fdiv_pzero_in() #3 {
	; CHECK-LABEL: @test_float_fdiv_pzero_in(			; CHECK-LABEL: @test_float_fdiv_pzero_in(
	; CHECK-NEXT: ret float 0xB810000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a normal			; denormal operand is treated as zero
				; zero divided by anything gives a zero result
	%result = fdiv float 0xB800000000000000, 5.000000e-01			%result = fdiv float 0xB800000000000000, 5.000000e-01
	ret float %result			ret float %result
	}			}

	define float @test_float_fdiv_psign_in() #4 {			define float @test_float_fdiv_psign_in() #4 {
	; CHECK-LABEL: @test_float_fdiv_psign_in(			; CHECK-LABEL: @test_float_fdiv_psign_in(
	; CHECK-NEXT: ret float 0xB7F0000000000000			; CHECK-NEXT: ret float -0.000000e+00
	; default ieee mode leaves result as a normal			; denormal operand is treated as zero
				; zero divided by anything gives a zero result
	%result = fmul float 0xB800000000000000, 5.000000e-01			%result = fmul float 0xB800000000000000, 5.000000e-01
	ret float %result			ret float %result
	}			}

	define float @test_float_fdiv_pzero_f32_out() #1 {			define float @test_float_fdiv_pzero_f32_out() #1 {
	; CHECK-LABEL: @test_float_fdiv_pzero_f32_out(			; CHECK-LABEL: @test_float_fdiv_pzero_f32_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a denormal			; f32 only attribute should flush float output
				; same as pzero_out above
	%result = fdiv float 0x3810000000000000, -2.000000e-00			%result = fdiv float 0x3810000000000000, -2.000000e-00
	ret float %result			ret float %result
	}			}

	define double @test_double_fdiv_ieee() #0 {			define double @test_double_fdiv_ieee() #0 {
	; CHECK-LABEL: @test_double_fdiv_ieee(			; CHECK-LABEL: @test_double_fdiv_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fdiv double 0x10000000000000, -2.000000e-00			%result = fdiv double 0x10000000000000, -2.000000e-00
	ret double %result			ret double %result
	}			}

	define double @test_double_fdiv_pzero_out() #1 {			define double @test_double_fdiv_pzero_out() #1 {
	; CHECK-LABEL: @test_double_fdiv_pzero_out(			; CHECK-LABEL: @test_double_fdiv_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = fdiv double 0x10000000000000, -2.000000e-00			%result = fdiv double 0x10000000000000, -2.000000e-00
	ret double %result			ret double %result
	}			}

	define double @test_double_fdiv_psign_out() #2 {			define double @test_double_fdiv_psign_out() #2 {
	; CHECK-LABEL: @test_double_fdiv_psign_out(			; CHECK-LABEL: @test_double_fdiv_psign_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = fdiv double 0x10000000000000, -2.000000e-00			%result = fdiv double 0x10000000000000, -2.000000e-00
	ret double %result			ret double %result
	}			}

	define double @test_double_fdiv_pzero_in() #3 {			define double @test_double_fdiv_pzero_in() #3 {
	; CHECK-LABEL: @test_double_fdiv_pzero_in(			; CHECK-LABEL: @test_double_fdiv_pzero_in(
	; CHECK-NEXT: ret double 0x8010000000000000			; CHECK-NEXT: ret double 0.000000e+00
	; default ieee mode leaves result as a normal			; denormal operand is treated as zero
				; zero divided by anything gives a zero result
	%result = fdiv double 0x8008000000000000, 5.000000e-01			%result = fdiv double 0x8008000000000000, 5.000000e-01
	ret double %result			ret double %result
	}			}

	define double @test_double_fdiv_psign_in() #4 {			define double @test_double_fdiv_psign_in() #4 {
	; CHECK-LABEL: @test_double_fdiv_psign_in(			; CHECK-LABEL: @test_double_fdiv_psign_in(
	; CHECK-NEXT: ret double 0x8010000000000000			; CHECK-NEXT: ret double -0.000000e+00
	; default ieee mode leaves result as a normal			; denormal operand is treated as zero
				; zero divided by anything gives a zero result
	%result = fdiv double 0x8008000000000000, 5.000000e-01			%result = fdiv double 0x8008000000000000, 5.000000e-01
	ret double %result			ret double %result
	}			}

	define double @test_double_fdiv_f32_ieee() #5 {			define double @test_double_fdiv_f32_ieee() #5 {
	; CHECK-LABEL: @test_double_fdiv_f32_ieee(			; CHECK-LABEL: @test_double_fdiv_f32_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
				; f32 only attribute should not flush doubles
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = fdiv double 0x10000000000000, -2.000000e-00			%result = fdiv double 0x10000000000000, -2.000000e-00
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; frem tests			; frem tests
	; Output modes are tested by using two small normal numbers to produce a			; Output modes are tested by using two small normal numbers to produce a
	Show All 9 Lines
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0xB800000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = frem float 0xB818000000000000, 0x3810000000000000			%result = frem float 0xB818000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_frem_pzero_out() #1 {			define float @test_float_frem_pzero_out() #1 {
	; CHECK-LABEL: @test_float_frem_pzero_out(			; CHECK-LABEL: @test_float_frem_pzero_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = frem float 0xB818000000000000, 0x3810000000000000			%result = frem float 0xB818000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_frem_psign_out() #2 {			define float @test_float_frem_psign_out() #2 {
	; CHECK-LABEL: @test_float_frem_psign_out(			; CHECK-LABEL: @test_float_frem_psign_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = frem float 0xB818000000000000, 0x3810000000000000			%result = frem float 0xB818000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define float @test_float_frem_ieee_in() #0 {			define float @test_float_frem_ieee_in() #0 {
	; CHECK-LABEL: @test_float_frem_ieee_in(			; CHECK-LABEL: @test_float_frem_ieee_in(
	; CHECK-NEXT: ret float 0x3800000000000000			; CHECK-NEXT: ret float 0x3800000000000000
	; default ieee mode leaves result same as input			; default ieee mode leaves result same as input
	%result = frem float 0x3800000000000000, 2.000000e+00			%result = frem float 0x3800000000000000, 2.000000e+00
	ret float %result			ret float %result
	}			}

	define float @test_float_frem_pzero_in() #3 {			define float @test_float_frem_pzero_in() #3 {
	; CHECK-LABEL: @test_float_frem_pzero_in(			; CHECK-LABEL: @test_float_frem_pzero_in(
	; CHECK-NEXT: ret float 0x3800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result same as input			; denormal operand is treated as zero
				; remainder is now zero
	%result = frem float 0x3800000000000000, 2.000000e+00			%result = frem float 0x3800000000000000, 2.000000e+00
	ret float %result			ret float %result
	}			}

	define float @test_float_frem_psign_in() #4 {			define float @test_float_frem_psign_in() #4 {
	; CHECK-LABEL: @test_float_frem_psign_in(			; CHECK-LABEL: @test_float_frem_psign_in(
	; CHECK-NEXT: ret float 0x3800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result same as input			; denormal operand is treated as zero
				; remainder is now zero
	%result = frem float 0x3800000000000000, 2.000000e+00			%result = frem float 0x3800000000000000, 2.000000e+00
	ret float %result			ret float %result
	}			}

	define float @test_float_frem_pzero_f32_out() #1 {			define float @test_float_frem_pzero_f32_out() #1 {
	; CHECK-LABEL: @test_float_frem_pzero_f32_out(			; CHECK-LABEL: @test_float_frem_pzero_f32_out(
	; CHECK-NEXT: ret float 0xB800000000000000			; CHECK-NEXT: ret float 0.000000e+00
	; default ieee mode leaves result as a denormal			; f32 only attribute should flush float output
				; same as pzero_out above
	%result = frem float 0xB818000000000000, 0x3810000000000000			%result = frem float 0xB818000000000000, 0x3810000000000000
	ret float %result			ret float %result
	}			}

	define double @test_double_frem_ieee_out() #0 {			define double @test_double_frem_ieee_out() #0 {
	; CHECK-LABEL: @test_double_frem_ieee_out(			; CHECK-LABEL: @test_double_frem_ieee_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = frem double 0x8018000000000000, 0x10000000000000			%result = frem double 0x8018000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_frem_pzero_out() #1 {			define double @test_double_frem_pzero_out() #1 {
	; CHECK-LABEL: @test_double_frem_pzero_out(			; CHECK-LABEL: @test_double_frem_pzero_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to positive zero
	%result = frem double 0x8018000000000000, 0x10000000000000			%result = frem double 0x8018000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_frem_psign_out() #2 {			define double @test_double_frem_psign_out() #2 {
	; CHECK-LABEL: @test_double_frem_psign_out(			; CHECK-LABEL: @test_double_frem_psign_out(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double -0.000000e+00
	; default ieee mode leaves result as a denormal			; denormal result is flushed to sign preserved zero
	%result = frem double 0x8018000000000000, 0x10000000000000			%result = frem double 0x8018000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	define double @test_double_frem_ieee_in() #0 {			define double @test_double_frem_ieee_in() #0 {
	; CHECK-LABEL: @test_double_frem_ieee_in(			; CHECK-LABEL: @test_double_frem_ieee_in(
	; CHECK-NEXT: ret double 0x8000000000000			; CHECK-NEXT: ret double 0x8000000000000
	; default ieee mode leaves result same as input			; default ieee mode leaves result same as input
	%result = frem double 0x8000000000000, 2.000000e+00			%result = frem double 0x8000000000000, 2.000000e+00
	ret double %result			ret double %result
	}			}

	define double @test_double_frem_pzero_in() #3 {			define double @test_double_frem_pzero_in() #3 {
	; CHECK-LABEL: @test_double_frem_pzero_in(			; CHECK-LABEL: @test_double_frem_pzero_in(
	; CHECK-NEXT: ret double 0x8000000000000			; CHECK-NEXT: ret double 0.000000e+00
	; default ieee mode leaves result same as input			; denormal operand is treated as zero
				; remainder is now zero
	%result = frem double 0x8000000000000, 2.000000e+00			%result = frem double 0x8000000000000, 2.000000e+00
	ret double %result			ret double %result
	}			}

	define double @test_double_frem_psign_in() #4 {			define double @test_double_frem_psign_in() #4 {
	; CHECK-LABEL: @test_double_frem_psign_in(			; CHECK-LABEL: @test_double_frem_psign_in(
	; CHECK-NEXT: ret double 0x8000000000000			; CHECK-NEXT: ret double 0.000000e+00
	; default ieee mode leaves result same as input			; denormal operand is treated as zero
				; remainder is now zero
	%result = frem double 0x8000000000000, 2.000000e+00			%result = frem double 0x8000000000000, 2.000000e+00
	ret double %result			ret double %result
	}			}

	define double @test_double_frem_f32_ieee() #5 {			define double @test_double_frem_f32_ieee() #5 {
	; CHECK-LABEL: @test_double_frem_f32_ieee(			; CHECK-LABEL: @test_double_frem_f32_ieee(
	; CHECK-NEXT: ret double 0x8008000000000000			; CHECK-NEXT: ret double 0x8008000000000000
				; f32 only attribute should not flush doubles
	; default ieee mode leaves result as a denormal			; default ieee mode leaves result as a denormal
	%result = frem double 0x8018000000000000, 0x10000000000000			%result = frem double 0x8018000000000000, 0x10000000000000
	ret double %result			ret double %result
	}			}

	; ============================================================================ ;			; ============================================================================ ;
	; fneg tests			; fneg tests
	; fneg should NOT be affected by denormal handling mode			; fneg should NOT be affected by denormal handling mode
	▲ Show 20 Lines • Show All 93 Lines • Show Last 20 Lines