This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Omit unnecessary waitcnt before barriers
ClosedPublic

Authored by kerbowa on Jul 28 2022, 11:45 AM.

Details

Summary

It is not necessary to wait for all outstanding memory operations before
barriers on hardware that can back off of the barrier in the event of an
exception when traps are enabled. Add a new subtarget feature which
tracks which HW has this ability.

Diff Detail

Event Timeline

kerbowa created this revision.Jul 28 2022, 11:45 AM
Herald added a project: Restricted Project. · View Herald TranscriptJul 28 2022, 11:45 AM
kerbowa requested review of this revision.Jul 28 2022, 11:45 AM
Herald added a project: Restricted Project. · View Herald TranscriptJul 28 2022, 11:45 AM

What's the resolution of mesa problem?

What's the resolution of mesa problem?

I believe they manually insert waitcnt now. I forwarded the email discussion.

rampitec accepted this revision.Jul 28 2022, 12:35 PM
This revision is now accepted and ready to land.Jul 28 2022, 12:35 PM
This revision was landed with ongoing or failed builds.Jul 29 2022, 11:21 AM
This revision was automatically updated to reflect the committed changes.