Fixes being overly conservative with the register counts in called
functions. This should try to do a conservative range merge, but for
now just clone.
Also fix not being able to functionally run the pass standalone.
Paths
| Differential D89737
AMDGPU: Propagate amdgpu-flat-work-group-size attributes ClosedPublic Authored by arsenm on Oct 19 2020, 2:04 PM.
Details
Diff Detail Event TimelineComment Actions LGTM. Cloning might be a good thing anyway. One call stack will use conservative slow version and some other a faster version. This revision is now accepted and ready to land.Oct 19 2020, 2:08 PM
Revision Contents
Diff 299150 llvm/lib/Target/AMDGPU/AMDGPUPropagateAttributes.cpp
llvm/test/CodeGen/AMDGPU/propagate-attributes-flat-work-group-size.ll
|