This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Propagate amdgpu-flat-work-group-size attributes
ClosedPublic

Authored by arsenm on Oct 19 2020, 2:04 PM.

Details

Summary

Fixes being overly conservative with the register counts in called
functions. This should try to do a conservative range merge, but for
now just clone.

Also fix not being able to functionally run the pass standalone.

Diff Detail

Event Timeline

arsenm created this revision.Oct 19 2020, 2:04 PM
Herald added a project: Restricted Project. · View Herald TranscriptOct 19 2020, 2:04 PM
arsenm requested review of this revision.Oct 19 2020, 2:04 PM
rampitec accepted this revision.Oct 19 2020, 2:08 PM

LGTM.

Cloning might be a good thing anyway. One call stack will use conservative slow version and some other a faster version.

This revision is now accepted and ready to land.Oct 19 2020, 2:08 PM