There are no packed instructions for min3 or max3. So, performMinMaxCombine should not optimize vectors of f16 to min3/max3.
Details
Details
Diff Detail
Diff Detail
Paths
| Differential D45219
[AMDGPU] performMinMaxCombine should not optimize patterns of vectors to min3/max3 ClosedPublic Authored by FarhanaAleen on Apr 3 2018, 11:31 AM.
Details Summary There are no packed instructions for min3 or max3. So, performMinMaxCombine should not optimize vectors of f16 to min3/max3.
Diff Detail Event TimelineHerald added subscribers: t-tye, tpr, dstuttard and 5 others. · View Herald TranscriptApr 3 2018, 11:31 AM Closed by commit rL329131: [AMDGPU] performMinMaxCombine should not optimize patterns of vectors to… (authored by faaleen). · Explain Why This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 140834 lib/Target/AMDGPU/SIISelLowering.cpp
test/CodeGen/AMDGPU/fmax3.ll
test/CodeGen/AMDGPU/fmin3.ll
|
Needs check lines