This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Drop unsafe 1/sqrt -> rsq combine
ClosedPublic

Authored by arsenm on Aug 15 2023, 3:56 PM.

Details

Reviewers
foad
rampitec
Group Reviewers
Restricted Project
Summary

AMDGPUCodeGenPrepare implements a safer version of this that handles
denormals correctly.

Diff Detail

Event Timeline

arsenm created this revision.Aug 15 2023, 3:56 PM
Herald added a project: Restricted Project. · View Herald TranscriptAug 15 2023, 3:56 PM
arsenm requested review of this revision.Aug 15 2023, 3:56 PM
Herald added a project: Restricted Project. · View Herald TranscriptAug 15 2023, 3:56 PM
Herald added a subscriber: wdng. · View Herald Transcript
arsenm retitled this revision from AMDGPU: Drop unsafe 1/rcp -> rsq combine to AMDGPU: Drop unsafe 1/sqrt -> rsq combine.Aug 15 2023, 3:56 PM
rampitec accepted this revision.Aug 15 2023, 4:01 PM
This revision is now accepted and ready to land.Aug 15 2023, 4:01 PM