When SCC is dead, but VCC is required then replace s_and / s_andn2
with s_mov into VCC when mask value is 0 or -1.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
llvm/lib/Target/AMDGPU/SIPreEmitPeephole.cpp | ||
---|---|---|
140–153 | Can we do this earlier? Removing the SCC def earlier would be more useful |
llvm/lib/Target/AMDGPU/SIPreEmitPeephole.cpp | ||
---|---|---|
140–153 | To the best of my understanding, the earliest this optimisation becomes available is after "Branch Probability Basic Block Placement". Which is not much earlier. |
Can we do this earlier? Removing the SCC def earlier would be more useful