It is not necessary to wait for all outstanding memory operations before
barriers on hardware that can back off of the barrier in the event of an
exception when traps are enabled. Add a new subtarget feature which
tracks which HW has this ability.
Details
Details
- Reviewers
- rampitec - arsenm - foad 
- Group Reviewers
- Restricted Project 
- Commits
- rG2c82a126d762: [AMDGPU] Omit unnecessary waitcnt before barriers
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo