Avoids instructions to pack a vector when the source is really
a scalar being broadcast.
Also be smarter and look for per-component fneg.
Doesn't yet handle scalar from upper half of register
or other swizzles.
Paths
| Differential D32606
AMDGPU: Try to use op_sel when selecting packed instructions ClosedPublic Authored by arsenm on Apr 27 2017, 11:26 AM.
Details
Summary Avoids instructions to pack a vector when the source is really Also be smarter and look for per-component fneg. Doesn't yet handle scalar from upper half of register
Diff Detail Event TimelineHerald added subscribers: t-tye, tpr, dstuttard and 4 others. · View Herald TranscriptApr 27 2017, 11:26 AM arsenm added a child revision: D32630: AMDGPU: Make better use of op_sel with high components.Apr 27 2017, 8:17 PM This revision is now accepted and ready to land.May 17 2017, 7:51 AM
Revision Contents
Diff 96984 lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp
test/CodeGen/AMDGPU/immv216.ll
test/CodeGen/AMDGPU/packed-op-sel.ll
|