This is an archive of the discontinued LLVM Phabricator instance.

[SimplifyLibCalls] require fast-math-flags for pow(X, -0.5) transforms
ClosedPublic

Authored by spatel on Dec 19 2019, 7:22 AM.

Details

Summary

As discussed in PR44330:
https://bugs.llvm.org/show_bug.cgi?id=44330
...the transform from pow(X, -0.5) libcall/intrinsic to reciprocal square root can result in small deviations from the expected result due to differences in the pow() implementation and/or the extra rounding step from the division.

This patch proposes to allow that difference with either the 'approximate functions' or 'reassociate' FMF:
http://llvm.org/docs/LangRef.html#fast-math-flags

In practice, this likely means that the code is compiled with all of 'fast' (-ffast-math), but I have preserved the existing specializations for -0.0/-INF that enable generating safe code if those special values are allowed simultaneously with allowing approximation/reassociation.

The question about whether a similar restriction is needed for the non-reciprocal case -- pow(X, 0.5) -- is deferred. That transform is allowed without FMF currently, and this patch does not change that behavior.

Diff Detail

Event Timeline

spatel created this revision.Dec 19 2019, 7:22 AM
Herald added a project: Restricted Project. · View Herald TranscriptDec 19 2019, 7:22 AM
This revision is now accepted and ready to land.Dec 19 2019, 2:27 PM
This revision was automatically updated to reflect the committed changes.