We can eliminate old value if bound_ctrl = 1 and row_mask = bank_mask = 0xf.
This is alternative implementation working with the intrinsic in InstCombine.
Original review for past-ISel optimization: D46570.
Details
Details
Diff Detail
Diff Detail
Paths
| Differential D46596
[AMDGPU] Optimze old value of v_mov_b32_dpp ClosedPublic Authored by rampitec on May 8 2018, 12:01 PM.
Details Summary We can eliminate old value if bound_ctrl = 1 and row_mask = bank_mask = 0xf.
Diff Detail Event TimelineHerald added subscribers: t-tye, tpr, dstuttard and 2 others. · View Herald TranscriptMay 8 2018, 12:01 PM rampitec marked 2 inline comments as done. This revision is now accepted and ready to land.May 21 2018, 11:23 PM Closed by commit rL332956: [AMDGPU] Optimze old value of v_mov_b32_dpp (authored by rampitec). · Explain WhyMay 22 2018, 1:08 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 145754 lib/Transforms/InstCombine/InstCombineCalls.cpp
test/CodeGen/AMDGPU/llvm.amdgcn.update.dpp.ll |
I would prefer to use checked dyn_casts here rather than crash on invalid intrinsic uses