This is a performance hint that can be applied to kernels
to attempt to limit the number of used registers. This adds two attributes,
amdgpu_num_vgpr and amdgpu_num_sgpr. Alternatively, there could only be one
with two arguments. That version would also be usable for Evergreen, which could
ignore the second argument as a request for the number of SGPRs.
Details
Details
- Reviewers
arsenm
Diff Detail
Diff Detail