MFMA instructions shall not be scheduled back to back
to avoid MAI SIMD stall. Tell post-RA schedule we would
prefer some other instruction instead.
Details
Details
- Reviewers
kerbowa - Commits
- rG13b63be47223: [AMDGPU] prefer non-mfma in post-RA schedule
Diff Detail
Diff Detail
Event Timeline
Comment Actions
Minor nit, but test should disable-power-sched otherwise I think MFMA will be scheduled apart anyway.
Comment Actions
Actually MFMA would need more height to ensure post-RA tries to schedule them first and together. You could have their results be implicit uses on later instruction like S_ENDPGM or something.
Comment Actions
W/o this patch they are scheduled back to back:
S_ADD_U32
S_ADDC_U32
V_MFMA_F32_32X32X1F32
V_MFMA_F32_32X32X1F32
S_CMP_LG_U32