This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Transform] Add support for mma.sync m16n8k16 f16 rewrite.
ClosedPublic

Authored by nicolasvasilache on Jun 21 2023, 7:49 AM.

Details

Summary

This PR adds support for the m16n8k16 f16 case.
At this point, the support is mostly mechanical and could be Tablegen'd to all cases.
Until then, this can be populated as needed on a case-by-case basis.

Depends on: D153420

Diff Detail

Event Timeline

Herald added a project: Restricted Project. · View Herald TranscriptJun 21 2023, 7:49 AM
nicolasvasilache requested review of this revision.Jun 21 2023, 7:49 AM
Herald added a project: Restricted Project. · View Herald Transcript
nicolasvasilache edited the summary of this revision. (Show Details)

Update

springerm accepted this revision.Jun 21 2023, 8:06 AM
springerm added inline comments.
mlir/test/Integration/GPU/CUDA/TensorCore/transform-mma-sync-matmul-f16-f16-accum.mlir
4–14

why are these all separate mlir-opt calls?

This revision is now accepted and ready to land.Jun 21 2023, 8:06 AM
This revision was landed with ongoing or failed builds.Jun 26 2023, 9:46 AM
This revision was automatically updated to reflect the committed changes.