This PR adds support for the m16n8k16 f16 case.
At this point, the support is mostly mechanical and could be Tablegen'd to all cases.
Until then, this can be populated as needed on a case-by-case basis.
Depends on: D153420
Paths
| Differential D153428
[mlir][Transform] Add support for mma.sync m16n8k16 f16 rewrite. ClosedPublic Authored by nicolasvasilache on Jun 21 2023, 7:49 AM.
Details Summary This PR adds support for the m16n8k16 f16 case. Depends on: D153420
Diff Detail
Event TimelineThis revision is now accepted and ready to land.Jun 21 2023, 8:06 AM This revision was landed with ongoing or failed builds.Jun 26 2023, 9:46 AM Closed by commit rG1660f2174d59: [mlir][Transform] Add support for mma.sync m16n8k16 f16 rewrite. (authored by nicolasvasilache). · Explain Why This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 533259 mlir/lib/Dialect/NVGPU/TransformOps/NVGPUTransformOps.cpp
mlir/test/Dialect/NVGPU/transform-matmul-to-nvvm.mlir
mlir/test/Integration/GPU/CUDA/TensorCore/transform-mma-sync-matmul-f16-f16-accum.mlir
|
why are these all separate mlir-opt calls?