We can eliminate old value if bound_ctrl = 1 and row_mask = bank_mask = 0xf.
This is alternative implementation working with the intrinsic in InstCombine.
Original review for past-ISel optimization: D46570.
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
Event Timeline
lib/Transforms/InstCombine/InstCombineCalls.cpp | ||
---|---|---|
3480–3482 ↗ | (On Diff #145754) | I would prefer to use checked dyn_casts here rather than crash on invalid intrinsic uses |
test/CodeGen/AMDGPU/llvm.amdgcn.update.dpp.ll | ||
2–3 ↗ | (On Diff #145754) | These should go in the test/InstCombine/AMDGPU/amdgcn-intrinsics.ll |