This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Fold frame index into MUBUF
ClosedPublic

Authored by arsenm on Jun 21 2019, 5:07 PM.

Details

Reviewers
rampitec
Summary

This matters for byval uses outside of the entry block, which appear
as copies.

Previously, the only folding done was during selection, which could
not see the underlying frame index. For any uses outside the entry
block, the frame index was materialized in the entry block relative to
he global scratch wave offset.

This may produce worse code in cases where the offset ends up not
fitting in the MUBUF offset field. A better heuristic would be helpfu
for extreme frames.

Diff Detail

Event Timeline

arsenm created this revision.Jun 21 2019, 5:07 PM
This revision is now accepted and ready to land.Jun 21 2019, 7:13 PM
arsenm closed this revision.Jun 24 2019, 7:54 AM

r364185