This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Analysis/
-
Analysis/
-
InstructionSimplify.cpp
-
test/Transforms/InstSimplify/
-
Transforms/
-
InstSimplify/
1/1
fast-math.ll

Differential D43765

[InstSimplify] loosen FMF for sqrt(X) * sqrt(X) --> X
ClosedPublic

Authored by spatel on Feb 26 2018, 7:47 AM.

Download Raw Diff

Details

Reviewers

arsenm
wristow
codeman.consulting
efriedma
hfinkel
scanon

Commits

rG95ec4a4dfe46: [InstSimplify] loosen FMF for sqrt(X) * sqrt(X) --> X
rL327796: [InstSimplify] loosen FMF for sqrt(X) * sqrt(X) --> X

Summary

If my fast-math understanding is correct, 'reassoc' alone is not enough because that doesn't give us the freedom to get the negative number cases wrong.

I considered that we might not even need 'reassoc' here, but if we eliminate the sqrt calls, then we may differ in the last bit by eliminating the rounding that occurs in those calls.

Diff Detail

Event Timeline

spatel created this revision.Feb 26 2018, 7:47 AM

Herald added subscribers: wdng, mcrosier. · View Herald TranscriptFeb 26 2018, 7:47 AM

Ping.

IIRC Intrinsic::sqrt is undef for negative inputs (unlike the sqrt libcall), so we don't need FMF.noNaNs to license this transformation.

In D43765#1027289, @scanon wrote:

IIRC Intrinsic::sqrt is undef for negative inputs (unlike the sqrt libcall), so we don't need FMF.noNaNs to license this transformation.

Although IMO if we're fixing all of the math in LLVM I would like to fix this. At least do something like ctlz/cttz for whether negative is undef

In D43765#1027302, @arsenm wrote:

In D43765#1027289, @scanon wrote:

IIRC Intrinsic::sqrt is undef for negative inputs (unlike the sqrt libcall), so we don't need FMF.noNaNs to license this transformation.

Although IMO if we're fixing all of the math in LLVM I would like to fix this. At least do something like ctlz/cttz for whether negative is undef

Sqrt was fixed:
https://reviews.llvm.org/D28797

In D43765#1027330, @spatel wrote:

In D43765#1027302, @arsenm wrote:

In D43765#1027289, @scanon wrote:

IIRC Intrinsic::sqrt is undef for negative inputs (unlike the sqrt libcall), so we don't need FMF.noNaNs to license this transformation.

Although IMO if we're fixing all of the math in LLVM I would like to fix this. At least do something like ctlz/cttz for whether negative is undef

Sqrt was fixed:
https://reviews.llvm.org/D28797

Also, note that we convert sqrt and other libcalls to the LLVM intrinsics when possible in clang:
D39204 (and follow-up commits listed there)
...so we don't have to muddy the code looking for sqrt libcall patterns too. If errno can be set by a libcall, then no amount of FMF should override that; if errno can't be set by the libcall, it should have been converted to the LLVM intrinsic.

Ping * 2.

efriedma added inline comments.Mar 12 2018, 12:06 PM

test/Transforms/InstSimplify/fast-math.ll
219	Maybe add a test that we don't do this without the relevant fast-math flags?

Patch updated:
Include negative tests to show that we're only doing the transform when all 3 of the required FMF are present.

LGTM

This revision is now accepted and ready to land.Mar 15 2018, 6:42 PM

Closed by commit rL327796: [InstSimplify] loosen FMF for sqrt(X) * sqrt(X) --> X (authored by spatel). · Explain WhyMar 18 2018, 7:14 AM

This revision was automatically updated to reflect the committed changes.

spatel mentioned this in rL327797: [InstCombine] add nnan requirement for sqrt(x) * sqrt(y) -> sqrt(x*y).Mar 18 2018, 7:36 AM

Revision Contents

Path

Size

lib/

Analysis/

InstructionSimplify.cpp

9 lines

test/

Transforms/

InstSimplify/

fast-math.ll

4 lines

Diff 135904

lib/Analysis/InstructionSimplify.cpp

Show First 20 Lines • Show All 4,245 Lines • ▼ Show 20 Lines	static Value SimplifyFMulInst(Value Op0, Value *Op1, FastMathFlags FMF,
// fmul X, 1.0 ==> X		// fmul X, 1.0 ==> X
if (match(Op1, m_FPOne()))		if (match(Op1, m_FPOne()))
return Op0;		return Op0;

// fmul nnan nsz X, 0 ==> 0		// fmul nnan nsz X, 0 ==> 0
if (FMF.noNaNs() && FMF.noSignedZeros() && match(Op1, m_AnyZero()))		if (FMF.noNaNs() && FMF.noSignedZeros() && match(Op1, m_AnyZero()))
return Op1;		return Op1;

// sqrt(X) * sqrt(X) --> X		// sqrt(X) * sqrt(X) --> X, if we can:
		// 1. Remove the intermediate rounding (reassociate).
		// 2. Ignore non-zero negative numbers because sqrt would produce NAN.
		// 3. Ignore -0.0 because sqrt(-0.0) == -0.0, but -0.0 * -0.0 == 0.0.
Value *X;		Value *X;
if (FMF.isFast() && Op0 == Op1 &&		if (Op0 == Op1 && match(Op0, m_Intrinsic<Intrinsic::sqrt>(m_Value(X))) &&
match(Op0, m_Intrinsic<Intrinsic::sqrt>(m_Value(X))))		FMF.allowReassoc() && FMF.noNaNs() && FMF.noSignedZeros())
return X;		return X;

return nullptr;		return nullptr;
}		}

Value llvm::SimplifyFAddInst(Value Op0, Value *Op1, FastMathFlags FMF,		Value llvm::SimplifyFAddInst(Value Op0, Value *Op1, FastMathFlags FMF,
const SimplifyQuery &Q) {		const SimplifyQuery &Q) {
return ::SimplifyFAddInst(Op0, Op1, FMF, Q, RecursionLimit);		return ::SimplifyFAddInst(Op0, Op1, FMF, Q, RecursionLimit);
▲ Show 20 Lines • Show All 749 Lines • Show Last 20 Lines

test/Transforms/InstSimplify/fast-math.ll

	Show First 20 Lines • Show All 199 Lines • ▼ Show 20 Lines
	; CHECK: ret float -1.000000e+00			; CHECK: ret float -1.000000e+00
	;			;
	%neg = fsub float 0.000000e+00, %f			%neg = fsub float 0.000000e+00, %f
	%div = fdiv nnan float %f, %neg			%div = fdiv nnan float %f, %neg
	ret float %div			ret float %div
	}			}

	; PR21126: http://llvm.org/bugs/show_bug.cgi?id=21126			; PR21126: http://llvm.org/bugs/show_bug.cgi?id=21126
	; With unsafe/fast math, sqrt(X) * sqrt(X) is just X.			; With loose math, sqrt(X) * sqrt(X) is just X.

	declare double @llvm.sqrt.f64(double)			declare double @llvm.sqrt.f64(double)

	define double @sqrt_squared(double %f) {			define double @sqrt_squared(double %f) {
	; CHECK-LABEL: @sqrt_squared(			; CHECK-LABEL: @sqrt_squared(
	; CHECK-NEXT: ret double [[F:%.*]]			; CHECK-NEXT: ret double [[F:%.*]]
	;			;
	%sqrt = call double @llvm.sqrt.f64(double %f)			%sqrt = call double @llvm.sqrt.f64(double %f)
	%mul = fmul fast double %sqrt, %sqrt			%mul = fmul reassoc nnan nsz double %sqrt, %sqrt
	ret double %mul			ret double %mul
	}			}
				efriedmaUnsubmitted Done Reply Inline Actions Maybe add a test that we don't do this without the relevant fast-math flags? efriedma: Maybe add a test that we don't do this without the relevant fast-math flags?