Wave Matrix Multiply Accumulate (WMMA) is the instruction to accelerate
matrix multiplication on RDNA3 architectures. LLVM already provides a
set of intrinsics to generate wmma instructions. This change uses those
intrinsics to enable the feature in MLIR.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Great to see this is coming in.
mlir/lib/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.cpp | ||
---|---|---|
368 | I expect there's less chance the flag is misused compared to the input type is incorrectly set at some point of the lowering path. | |
544 | Does it check sourceA.type == sourceB.type ? |
Comment Actions
@jungpark-mlir Ping.
mlir/lib/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.cpp | ||
---|---|---|
544 | Yeah, we have a AllTypesMatch<["sourceA", "sourceB"]> on the op definition. |
I expect there's less chance the flag is misused compared to the input type is incorrectly set at some point of the lowering path.
I fell slightly better to have unsignedA/B as optional and ignore the elemType when unsignedA/B is given but no big deal.