This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Run SIInsertWaits as pre-emit pass
ClosedPublic

Authored by arsenm on Jun 27 2015, 2:40 PM.

Details

Reviewers
tstellarAMD
Summary

Running this after the scheduler enables scheduling
waits later so other ALU instructions can run while
this would be waiting.

When combined with enabling the post-RA scheduler, this
gives about a ~20% improvement on sgemm.

Diff Detail

Event Timeline

arsenm updated this revision to Diff 28632.Jun 27 2015, 2:40 PM
arsenm retitled this revision from to AMDGPU: Run SIInsertWaits as pre-emit pass.
arsenm updated this object.
arsenm edited the test plan for this revision. (Show Details)
arsenm added a subscriber: Unknown Object (MLST).
This revision is now accepted and ready to land.Jul 6 2015, 7:20 AM
arsenm closed this revision.Jul 6 2015, 10:02 AM

r241473