We were basing the defer the fast case to codegen based on the fdiv
itself, and not looking for a foldable sqrt input.
Details
Details
Diff Detail
Diff Detail
Paths
| Differential D158127
AMDGPU: Permit more rsq formation in AMDGPUCodeGenPrepare ClosedPublic Authored by arsenm on Aug 16 2023, 3:58 PM.
Details
Diff Detail
Revision Contents
Diff 550922 llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.ll
llvm/test/CodeGen/AMDGPU/fdiv_flags.f32.ll
llvm/test/CodeGen/AMDGPU/rsq.f32.ll
|