When lowering COPY to a target instruction and transferring the implicit
operands, don't add any duplicate implicit operands. This affects AMDGPU
where both the original COPY and the lowered instruction like
V_MOV_B32_e32 can have an implicit use of $exec.
This is mostly just a cosmetic issue, but it could conceivably have
caused missed optimizations in
SIInstrInfo::isReallyTriviallyReMaterializable where we rely on counting
the number of implicit operands.
This feels like a hack and we should have avoided this