DAGCombiner::visitFCOPYSIGN does
// copysign(x, fp_extend(y)) -> copysign(x, y)
when x has f64 type and y - f32 the backend cannot select fcopysign. Adding the pattern fixes rodinia\myocite test failure.
Paths
| Differential D25827
[AMDGPU] add fcopysign(f64, f32) pattern ClosedPublic Authored by vpykhtin on Oct 20 2016, 8:24 AM.
Details Summary DAGCombiner::visitFCOPYSIGN does // copysign(x, fp_extend(y)) -> copysign(x, y) when x has f64 type and y - f32 the backend cannot select fcopysign. Adding the pattern fixes rodinia\myocite test failure.
Diff Detail
Event Timelinevpykhtin updated this object. This revision is now accepted and ready to land.Oct 20 2016, 8:34 AM Closed by commit rL284743: [AMDGPU] add fcopysign(f64, f32) pattern (authored by vpykhtin). · Explain WhyOct 20 2016, 9:27 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 75313 llvm/trunk/lib/Target/AMDGPU/AMDGPUInstructions.td
llvm/trunk/test/CodeGen/AMDGPU/fcopysign.f64.ll
|