This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Correct maximum possible private allocation size
ClosedPublic

Authored by arsenm on May 23 2019, 6:23 AM.

Details

Summary

We were assuming a much larger possible per-wave visible stack
allocation than is possible:

https://github.com/RadeonOpenCompute/ROCR-Runtime/blob/master/src/core/runtime/amd_gpu_agent.cpp#L70

Based on this, we can assume the high 15 bits of a frame index or sret
are 0. The frame index value is the per-lane offset, so the maximum
frame index value is MAX_WAVE_SCRATCH / wavesize.

Remove the corresponding subtarget feature and option that made
this configurable.

Diff Detail

Event Timeline

arsenm created this revision.May 23 2019, 6:23 AM
This revision is now accepted and ready to land.May 23 2019, 11:09 AM
arsenm closed this revision.May 23 2019, 4:54 PM

r361541