This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] gfx940: support V_FMAMK_F32 and V_FMAAK_F32
ClosedPublic

Authored by rampitec on Mar 1 2022, 1:42 PM.

Diff Detail

Event Timeline

rampitec created this revision.Mar 1 2022, 1:42 PM
Herald added a project: Restricted Project. · View Herald TranscriptMar 1 2022, 1:42 PM
rampitec requested review of this revision.Mar 1 2022, 1:42 PM
Herald added a project: Restricted Project. · View Herald TranscriptMar 1 2022, 1:42 PM
Herald added a subscriber: wdng. · View Herald Transcript
foad added inline comments.Mar 2 2022, 12:57 AM
llvm/lib/Target/AMDGPU/VOP2Instructions.td
1747

Isn't AssemblerPredicate redundant if you've already set SubtargetPredicate? I thought they all got concatenated into one list of predicates.

foad added inline comments.Mar 2 2022, 12:58 AM
llvm/test/CodeGen/AMDGPU/madak.ll
162

Isn't this supposed to use fmaak now?

rampitec marked an inline comment as done.Mar 2 2022, 10:25 AM
rampitec added inline comments.
llvm/test/CodeGen/AMDGPU/madak.ll
162

It exceeds constant bus limit. This is OK for gfx10 where the limit is 2.

rampitec updated this revision to Diff 412484.Mar 2 2022, 10:42 AM
rampitec marked 2 inline comments as done.

Removed redundant AssemblerPredicate.

kzhuravl accepted this revision.Mar 7 2022, 11:09 AM

LGTM, thanks

This revision is now accepted and ready to land.Mar 7 2022, 11:09 AM
This revision was landed with ongoing or failed builds.Mar 7 2022, 11:31 AM
This revision was automatically updated to reflect the committed changes.