This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Refactor getWavesPerEU to separate flat workgroup size query
ClosedPublic

Authored by arsenm on Sep 17 2021, 12:58 PM.

Details

Summary

Add an overload to pass the flat workgroup range in separately. This
will allow the attributor to use the assumed value for
amdgpu-flat-workgroup-sizes when inferring amdgpu-waves-per-eu.

Diff Detail

Unit TestsFailed

Event Timeline

arsenm created this revision.Sep 17 2021, 12:58 PM
arsenm requested review of this revision.Sep 17 2021, 12:58 PM
Herald added a reviewer: sstefan1. · View Herald Transcript
Herald added a reviewer: baziotis. · View Herald Transcript
Herald added a project: Restricted Project. · View Herald Transcript
Herald added a subscriber: wdng. · View Herald Transcript
jdoerfert accepted this revision.Sep 17 2021, 2:01 PM

LG, one suggestion for the documentation which only makes sense as you use it from the Attributor though.

llvm/lib/Target/AMDGPU/AMDGPUSubtarget.h
102
This revision is now accepted and ready to land.Sep 17 2021, 2:01 PM