When SCC is dead, but VCC is required then replace s_and / s_andn2
with s_mov into VCC when mask value is 0 or -1.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Unit Tests
Unit Tests
Time | Test | |
---|---|---|
1,410 ms | windows > LLVM.CodeGen/AMDGPU::Unknown Unit Message ("") |
Event Timeline
llvm/lib/Target/AMDGPU/SIPreEmitPeephole.cpp | ||
---|---|---|
140–141 | Can we do this earlier? Removing the SCC def earlier would be more useful |
llvm/lib/Target/AMDGPU/SIPreEmitPeephole.cpp | ||
---|---|---|
140–141 | To the best of my understanding, the earliest this optimisation becomes available is after "Branch Probability Basic Block Placement". Which is not much earlier. |
Can we do this earlier? Removing the SCC def earlier would be more useful