On GFX9 and earlier, flat memory ops may decrement VMCNT out-of-order as well as LGKMCNT out-of-order.
Details
Details
Diff Detail
Diff Detail
Event Timeline
lib/Target/AMDGPU/SIInsertWaitcnts.cpp | ||
---|---|---|
740–741 | Update comment to: // If there is a pending FLAT operation, this is a VM or // LGKM waitcnt, and the target can report early // completion, then we need to force a waitcnt 0.} |
test/CodeGen/AMDGPU/waitcnt.mir | ||
---|---|---|
33–37 | Since this is a change for a specific subtarget, should this test have multiple sets of check lines instead of just changing these |
Update comment to: