This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Re-organize the outer loop of SILoadStoreOptimizer
ClosedPublic

Authored by nhaehnle on Nov 22 2017, 4:24 AM.

Details

Summary

The entire algorithm operates per basic-block, so for cache locality
it should be better to re-optimize a basic-block immediately rather than
in a separate loop.

I don't have performance measurements.

Change-Id: I85106570bd623c4ff277faaa50ee43258e1ddcc5

Event Timeline

nhaehnle created this revision.Nov 22 2017, 4:24 AM
This revision is now accepted and ready to land.Nov 22 2017, 11:17 AM
This revision was automatically updated to reflect the committed changes.