-amdgpu-waitcnt-forcezero={1|0} Force all waitcnt instrs to be emitted as s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0) -amdgpu-waitcnt-forceexp=<n> Force emit a s_waitcnt expcnt(0) before the first <n> instrs -amdgpu-waitcnt-forcelgkm=<n> Force emit a s_waitcnt lgkmcnt(0) before the first <n> instrs -amdgpu-waitcnt-forcevm=<n> Force emit a s_waitcnt vmcnt(0) before the first <n> instrs
Details
Details
- Reviewers
arsenm kzhuravl - Commits
- rGd6d5a2571fa4: [AMDGPU] Add options for waitcnt pass debugging; add instr count in debug…
rGa84d23489ad5: [AMDGPU] Add options for waitcnt pass debugging; add instr count in debug…
rL323788: [AMDGPU] Add options for waitcnt pass debugging; add instr count in debug…
rL320084: [AMDGPU] Add options for waitcnt pass debugging; add instr count in debug…
Diff Detail
Diff Detail
- Repository
- rL LLVM
Event Timeline
Comment Actions
This will be pretty useful. Thanks!
lib/Target/AMDGPU/SIInsertWaitcnts.cpp | ||
---|---|---|
426–428 ↗ | (On Diff #123055) | tabs? |
1168–1174 ↗ | (On Diff #123055) | tabs? |
1186–1189 ↗ | (On Diff #123055) | tabs? |
lib/Target/AMDGPU/SIInsertWaitcnts.cpp | ||
---|---|---|
424 ↗ | (On Diff #123055) | const? |
Comment Actions
Patch was submitted/reverted (caused a buildbot failure: private field 'InstCnt' is not used ); changed InstCnt to be a function-scope static instead.
Comment Actions
Reopening; patch was submitted/reverted (caused a buildbot failure: private field 'InstCnt' is not used ); changed InstCnt to be a function-scope static instead.