This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Avoid most waitcnts before calls
ClosedPublic

Authored by arsenm on Jun 14 2019, 1:21 PM.

Details

Summary

Currently you get extra waits, because waits are inserted for the
register dependencies of the call, and the function prolog waits on
everything.

Currently waits are still inserted on returns. It may make sense to
not do this, and wait in the caller instead.

Diff Detail

Event Timeline

arsenm created this revision.Jun 14 2019, 1:21 PM
arsenm updated this revision to Diff 204839.Jun 14 2019, 1:22 PM

Fix leftover junk in comment

arsenm updated this revision to Diff 204840.Jun 14 2019, 1:30 PM

Handle tail call correctly in case of future changes

This revision is now accepted and ready to land.Jun 14 2019, 1:30 PM
arsenm closed this revision.Jun 14 2019, 2:49 PM

r363465