- Adds verification for nvgpu.mma.sync op
- Adds tests to mlir/test/Dialect/NVGPU/invalid.mlir
- nvgpu.mma.sync verifier caught a bug and triggered a failure in m16n8k4_tf32_f32 variant in mlir/test/Conversion/NVGPUToNVVM/nvgpu-to-nvvm.mlir
- The output shape of vector holding thread-level accumulators was inconsistent and fixed in this change
Details
Diff Detail
Event Timeline
mlir/lib/Dialect/NVGPU/IR/NVGPUDialect.cpp | ||
---|---|---|
115 | Please spell out those types: | |
120 | same here spell those out | |
129 | should be aType == bType | |
148 | I don't see i4 nor f64 in the condition |
mlir/lib/Dialect/NVGPU/IR/NVGPUDialect.cpp | ||
---|---|---|
157 | skip braces for single line if: |
Please spell out those types:
https://llvm.org/docs/CodingStandards.html#use-auto-type-deduction-to-make-code-more-readable