If result of fmul(b,c) has one use, in almost all cases (except denormals are
IEEE) the pair of operations will be fused in one fma/mad/mac/etc.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Paths
| Differential D83919
[AMDGPU][CostModel] Improve cost estimation for fused {fadd|fsub}(a,fmul(b,c)) ClosedPublic Authored by dfukalov on Jul 15 2020, 4:25 PM.
Details Summary If result of fmul(b,c) has one use, in almost all cases (except denormals are
Diff Detail
Event TimelineThis revision is now accepted and ready to land.Jul 15 2020, 4:28 PM
Closed by commit rG76a0c0ee6ffa: [AMDGPU][CostModel] Improve cost estimation for fused {fadd|fsub}(a,fmul(b,c)) (authored by dfukalov). · Explain WhyJul 15 2020, 5:07 PM This revision was automatically updated to reflect the committed changes. dfukalov marked an inline comment as done.
Revision Contents
Diff 278339 llvm/include/llvm/CodeGen/BasicTTIImpl.h
llvm/lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
llvm/test/Analysis/CostModel/AMDGPU/fused_costs.ll
|
Extra newlines