This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Remove pointless libcall optimization of fma/mad
ClosedPublic

Authored by arsenm on Jul 31 2023, 5:16 AM.

Details

Reviewers
rampitec
jhuber6
vpykhtin
dfukalov
foad
yaxunl
Group Reviewers
Restricted Project
Summary

After the library is linked and trivially inlined, the generic fma and
fmuladd intrinsics already handle these cases, and with precise flag
handling. This was requiring all fast math flags when we really just
need nsz for the fma(a, b, 0) case.

Diff Detail