We found that it might be beneficial to have the SIOptimizeExecMasking
pass detect more cases where v_cmp, s_and_saveexec patterns can be
transformed to s_mov, v_cmpx patterns. Currently, the search range
for finding a fitting v_cmp instruction is 5, however, this is doubled
to 10 here.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
llvm/lib/Target/AMDGPU/SIOptimizeExecMasking.cpp | ||
---|---|---|
322 | Pre-existing problem: debug instructions (A->isDebugInstr()) should not count towards the search limit, because we need to get identical codegen with and without debug instructions present. |
Pre-existing problem: debug instructions (A->isDebugInstr()) should not count towards the search limit, because we need to get identical codegen with and without debug instructions present.