This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Use v_fma_f16 on GFX10
ClosedPublic

Authored by foad on Dec 14 2021, 7:52 AM.

Details

Summary

Teach convertToThreeAddress to use the V_FMA_F16_gfx9 pseudo (i.e. the
standard instruction in GFX9 onwards) instead of V_FMA_F16 (the legacy
pseudo for GFX8 compatibility, which is no longer supported in GFX10).
This follows the example of macToMad in SIFoldOperands.

Diff Detail

Event Timeline

foad created this revision.Dec 14 2021, 7:52 AM
foad requested review of this revision.Dec 14 2021, 7:52 AM
Herald added a project: Restricted Project. · View Herald TranscriptDec 14 2021, 7:52 AM
foad added a comment.Dec 14 2021, 7:54 AM

We should probably rename the pseudos:

  • V_FMA_F16_gfx9 -> V_FMA_F16
  • V_FMA_F16 -> V_FMA_F16_vi

and the same for the other 16-bit ones that were renamed in GFX9.

arsenm accepted this revision.Dec 14 2021, 10:34 AM
This revision is now accepted and ready to land.Dec 14 2021, 10:34 AM
This revision was landed with ongoing or failed builds.Dec 15 2021, 5:15 AM
This revision was automatically updated to reflect the committed changes.