This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Enable shouldConsiderGEPOffsetSplit
Needs ReviewPublic

Authored by arsenm on Oct 18 2019, 6:17 PM.

Details

Reviewers
rampitec
vpykhtin
Summary

The results in the new test (copied from the AArch64 test for the
hook) are slightly worse, due to now using a scalar phi for the loop
bounds add which requires copying to VGPRs for the pointer operand.

schedule-regpressure-limit2 is somewhat degenerate looking, but this
gives a much better result (the function ends up about half the size
as before). However it is no longer matching the tested register
limits, and uses a lot more VGPRs in the minreg case.

Diff Detail

Event Timeline

arsenm created this revision.Oct 18 2019, 6:17 PM

The RP blow up doesn't look good. This is probably our biggest problem, code size does not compensate for it. Given the purpose of the test to be able to fit into a minimal register footprint, I think it will lead to quite bad regressions. We need something smarter than that.