Mutate the node type during selection when it
doesn't matter. This avoids an intermediate bitcast
node on targets with legal i16/f16.
Also fixes missing output modifiers on v_cvt_pkrtz_f32_f16,
which I assume are OK.
Paths
| Differential D50069
AMDGPU: Improve hack for packing conversion ops ClosedPublic Authored by arsenm on Jul 31 2018, 7:55 AM.
Details
Diff Detail Event TimelineHerald added subscribers: t-tye, tpr, dstuttard and 3 others. · View Herald TranscriptJul 31 2018, 7:55 AM This revision is now accepted and ready to land.Jul 31 2018, 9:19 AM
Revision Contents
Diff 158266 lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
lib/Target/AMDGPU/SIISelLowering.cpp
lib/Target/AMDGPU/SIInstrInfo.td
lib/Target/AMDGPU/VOP2Instructions.td
test/CodeGen/AMDGPU/clamp-modifier.ll
test/CodeGen/AMDGPU/coalescer-subranges-another-prune-error.mir
test/CodeGen/AMDGPU/couldnt-join-subrange-3.mir
test/CodeGen/AMDGPU/subreg-split-live-in-error.mir
|