This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Don't require all flags to expand fast powr
ClosedPublic

Authored by arsenm on Aug 25 2023, 4:06 PM.

Details

Reviewers
rampitec
foad
jmmartinez
Group Reviewers
Restricted Project
Summary

This was requiring all fast math flags, which is practically
useless. This wouldn't fire using all the standard OpenCL fast math
flags. This only needs afn nnan and ninf.

Diff Detail

Event Timeline

arsenm created this revision.Aug 25 2023, 4:06 PM
Herald added a project: Restricted Project. · View Herald TranscriptAug 25 2023, 4:06 PM
arsenm requested review of this revision.Aug 25 2023, 4:06 PM
Herald added a project: Restricted Project. · View Herald TranscriptAug 25 2023, 4:06 PM
Herald added a subscriber: wdng. · View Herald Transcript
rampitec added inline comments.Aug 25 2023, 4:13 PM
llvm/lib/Target/AMDGPU/AMDGPULibCalls.cpp
802

I do not see why CF, CINT, and CZero cannot be null.

rampitec accepted this revision.Aug 25 2023, 4:18 PM
rampitec added inline comments.
llvm/lib/Target/AMDGPU/AMDGPULibCalls.cpp
802

Ah, OK, it seems to be checked with every use.

This revision is now accepted and ready to land.Aug 25 2023, 4:18 PM