This propagates a sgpr->vgpr copy into an operand of a
v_fma/v_mad that is generated from v_fmac, given it
doesn't break constant bus restriction
Change-Id: I39cacf43205e15982f7405c805d67b7a8b11d2a9
Differential D66666
[AMDGPU] Remove unnecessary movs for v_fmac operands rtaylor on Aug 23 2019, 10:17 AM. Authored by
Details
Diff Detail
Event TimelineComment Actions I somewhat expected this to be handled in SIFoldOperands, as constants are already folded there and this is essentially the same problem. It will always save an instruction. This is the version RA uses and I'm not sure I expect it to do any folding
Comment Actions Added condition in foldOperands to allow tied subreg folding Comment Actions Can you also add a MIR test for this? Also testing the 16-bit case would be nice
|
Can you fold this opcode list into a canFoldTiedSrcOp (or something similar) predicate function