These codegen correctly but f64 doesn't. This prevents losing fast
math flags on the way to the underlying intrinsic.
Details
Details
- Reviewers
rampitec jmmartinez - Group Reviewers
Restricted Project
Diff Detail
Diff Detail
Paths
| Differential D158997
AMDGPU: Use exp2 and log2 intrinsics directly for f16/f32 ClosedPublic Authored by arsenm on Aug 28 2023, 8:24 AM.
Details
Summary These codegen correctly but f64 doesn't. This prevents losing fast
Diff Detail Event Timelinearsenm added a child revision: D158998: AMDGPU: Use pown instead of pow if known integral.Aug 28 2023, 8:28 AM This revision is now accepted and ready to land.Aug 28 2023, 10:35 AM
Revision Contents
Diff 553950 llvm/lib/Target/AMDGPU/AMDGPULibCalls.cpp
llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-pow-codegen.ll
llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-pow.ll
llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-pown.ll
llvm/test/CodeGen/AMDGPU/amdgpu-simplify-libcall-powr.ll
llvm/test/CodeGen/AMDGPU/simplify-libcalls.ll
|