This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Avoid second rescheduling for some regions
ClosedPublic

Authored by rampitec on Feb 25 2021, 3:52 PM.

Details

Summary

If a region was not constrained by a high register pressure
and was not rescheduled without clustering we can skip
rescheduling it ClusteredLowOccupancyReschedule stage.

This improves scheduling speed by 25% on some kernels.

Diff Detail

Event Timeline

rampitec created this revision.Feb 25 2021, 3:52 PM
rampitec requested review of this revision.Feb 25 2021, 3:52 PM
Herald added a project: Restricted Project. · View Herald TranscriptFeb 25 2021, 3:52 PM
Herald added a subscriber: wdng. · View Herald Transcript

Looks good either but as with previous patch I think setting HasClusteredNodes to true to avoid discovering it's value is a bit misleading.

vpykhtin accepted this revision.Feb 26 2021, 12:08 PM

Following the discussion at https://reviews.llvm.org/D97342.

This revision is now accepted and ready to land.Feb 26 2021, 12:08 PM
This revision was landed with ongoing or failed builds.Feb 26 2021, 12:55 PM
This revision was automatically updated to reflect the committed changes.