global/scratch_load will return in order they are issued. No
need to insert a s_waitcnt for WAW hazard.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
LGTM. @nhaehnle already explained offline that the basic idea is sound.
llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp | ||
---|---|---|
144 | I guess this is OK but it I really don't know what we mean by "VMEM" any more. | |
152 | I don't think you need to change this line. Global and scratch will already fail the isMIMG test. | |
1190 | Don't need the outer parentheses here. |
llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp | ||
---|---|---|
144 | Yes, good idea. |
Since this change one of our bots has been timing out trying to build SIInsertWaitcnts.cpp https://lab.llvm.org/buildbot/#/builders/182/builds/4502.
Not your problem, but just so you know you can ignore the failure email. We will find the underlying issue ourselves.
I guess this is OK but it I really don't know what we mean by "VMEM" any more.