Add an overload to pass the flat workgroup range in separately. This
will allow the attributor to use the assumed value for
amdgpu-flat-workgroup-sizes when inferring amdgpu-waves-per-eu.
Details
Details
Diff Detail
Diff Detail
Unit Tests
Unit Tests
Time | Test | |
---|---|---|
840 ms | x64 debian > libomp.lock::omp_init_lock.c |
Event Timeline
Comment Actions
LG, one suggestion for the documentation which only makes sense as you use it from the Attributor though.
llvm/lib/Target/AMDGPU/AMDGPUSubtarget.h | ||
---|---|---|
102 |