This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU][LoopUnroll] Increase BB size to analyze for complete unroll.
ClosedPublic

Authored by dfukalov on Aug 19 2020, 2:15 PM.

Details

Summary

The UnrollMaxBlockToAnalyze parameter is used at the stage when we have no
information about a loop body BB cost. In some cases, e.g. for simple loop

for(int i=0; i<32; ++i){
  D = Arr2[i*8 + C1];
  Arr1[i*64 + C2] += C3 * D;
  Arr1[i*64 + C2 + 2048] += C4 * D;
}

current default parameter value is not enough to run deeper cost analyze so the
loop is not completely unrolled.

Diff Detail

Event Timeline

dfukalov created this revision.Aug 19 2020, 2:15 PM
Herald added a project: Restricted Project. · View Herald TranscriptAug 19 2020, 2:15 PM
dfukalov requested review of this revision.Aug 19 2020, 2:15 PM
rampitec accepted this revision.Aug 19 2020, 3:13 PM
This revision is now accepted and ready to land.Aug 19 2020, 3:13 PM
This revision was landed with ongoing or failed builds.Aug 20 2020, 12:42 AM
This revision was automatically updated to reflect the committed changes.