Add a test case to test the complete execution of WMMA ops on a Nvidia GPU with tensor cores.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Not that I'm aware of. I wouldn't hold back this revision, but let me know if you want me to move everything afterwards.
mlir/test/mlir-cuda-runner/wmma-matmul.mlir | ||
---|---|---|
8–10 ↗ | (On Diff #318909) | All alignment attributes have a typo. |
Changes in this diff :-
1.) Modify the test case to use the !gpu.mmafragment type introduced in revision D95330.
Changes in this diff :-
1.) Change type of ldm attribute in load/store fragment from i32 to index.
Changes in this diff :-
1.) Make changes to operate with the newly intoduced gpu.mma_matrix type.
I just realized that getting these test cases in will mean check-mlir will fail for all those without tensor cores on GPUs (if they are configuring with NVPTX)! Can we add an -DMLIR_ENABLE_CUDA_TENSOR_CORES and have these tests run under that? @ftynse @ThomasRaoux
Correct, for example we have some systems on our CI running CUDA execution tests that don't have tensor cores.
mlir/test/Integration/GPU/CUDA/wmma-matmul-f16.mlir | ||
---|---|---|
1–7 ↗ | (On Diff #347047) | Please configure to run these tests only under a -DMLIR_ENABLE_CUDA_TENSOR_CORES. |
Changes in this diff:-
1.) Add flag to enable/disable Tensor core WMMA tests. 2.) Rebase on upstream/main.
Nit: Tensor core -> CUDA tensor core