As of gfx7, VMEM operations do not increment the export counter and the input registers are available in the next instruction; update the waitcnt pass to take this into account.
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
Paths
| Differential D46067
[AMDGPU][Waitcnt] Take ISA target into account for s_waitcnt expcnt instr generation ClosedPublic Authored by msearles on Apr 25 2018, 8:51 AM.
Details Summary As of gfx7, VMEM operations do not increment the export counter and the input registers are available in the next instruction; update the waitcnt pass to take this into account.
Diff Detail
Event TimelineHerald added subscribers: tpr, dstuttard, yaxunl and 4 others. · View Herald TranscriptApr 25 2018, 8:51 AM This revision is now accepted and ready to land.Apr 25 2018, 9:09 AM Closed by commit rL330954: [AMDGPU][Waitcnt] As of gfx7, VMEM operations do not increment the export… (authored by msearles). · Explain WhyApr 26 2018, 9:14 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 144134 llvm/trunk/lib/Target/AMDGPU/AMDGPUSubtarget.h
llvm/trunk/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
llvm/trunk/test/CodeGen/AMDGPU/insert_vector_elt.ll
llvm/trunk/test/CodeGen/AMDGPU/llvm.amdgcn.buffer.store.format.ll
llvm/trunk/test/CodeGen/AMDGPU/llvm.amdgcn.buffer.store.ll
llvm/trunk/test/CodeGen/AMDGPU/llvm.amdgcn.image.ll
llvm/trunk/test/CodeGen/AMDGPU/llvm.amdgcn.tbuffer.store.ll
|