This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Remove unnecessary s_waitcnt between VMEM loads
ClosedPublic

Authored by foad on Apr 30 2020, 8:02 AM.

Details

Summary

VMEM loads of the same type (sampler vs no sampler) are guaranteed to
write their result registers in order, so there is no need for an
s_waitcnt even if they write to overlapping vgprs.

Diff Detail

Event Timeline

foad created this revision.Apr 30 2020, 8:02 AM
Herald added a project: Restricted Project. · View Herald TranscriptApr 30 2020, 8:02 AM
This revision is now accepted and ready to land.Apr 30 2020, 11:02 AM
This revision was automatically updated to reflect the committed changes.