This is an archive of the discontinued LLVM Phabricator instance.

[NFC][AMDGPU] Improve fused fmul+fadd tests.
ClosedPublic

Authored by dfukalov on Jul 29 2020, 6:13 PM.

Diff Detail

Event Timeline

dfukalov created this revision.Jul 29 2020, 6:13 PM
Herald added a project: Restricted Project. · View Herald TranscriptJul 29 2020, 6:13 PM
Herald added subscribers: kerbowa, t-tye, tpr and 5 others. · View Herald Transcript
dfukalov requested review of this revision.Jul 29 2020, 6:13 PM
arsenm added inline comments.Jul 29 2020, 6:14 PM
llvm/test/CodeGen/AMDGPU/fmuladd.f16.ll
10

This probably should also gain gfx1030 lines

dfukalov added inline comments.Jul 30 2020, 3:53 AM
llvm/test/CodeGen/AMDGPU/fmuladd.f16.ll
10

My understanding was that tests RUN lines are actually represent a different targets in terms of features set. I guess, gfx1030 has no difference with gfx1010 in fused f16 operations capabilities/features. So gfx1030 will test the same code as gfx1010?

Otherwise, if the tests are to cover all modes/features, we'll have to add extra RUN lines for each gfx90x, gfx10xx and, probably, older targets.

arsenm added inline comments.Jul 30 2020, 8:20 AM
llvm/test/CodeGen/AMDGPU/fmuladd.f16.ll
10

No, the v_mad/v_mac instructions were removed in gfx1030 (but gfx1010 does have them), so the cases with f16 denormals disabled will differ

rampitec added inline comments.Jul 30 2020, 9:55 AM
llvm/test/CodeGen/AMDGPU/fmuladd.f16.ll
10

That is only f32 mad/mac were removed.

This revision is now accepted and ready to land.Jul 30 2020, 9:57 AM
This revision was landed with ongoing or failed builds.Jul 30 2020, 6:00 PM
This revision was automatically updated to reflect the committed changes.