This diff added the sddmm cuSparse op + roundtrip + lowering to wrapper lib
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
I would split this into adding the new cuSparse op + roundtrip + lowering to wrapper lib in this revision
and then the actual recognition during sparsification + test + integration test perhaps in one or two next revisions
mlir/lib/Dialect/SparseTensor/Transforms/SparseGPUCodegen.cpp | ||
---|---|---|
694 | Note that I made a small refinement in https://reviews.llvm.org/D151404 which seems safer: keep them all on the same stream (so no blocking wait here) and move the copy after the copy-in only and buffer are released |
mlir/lib/Dialect/SparseTensor/Transforms/SparseGPUCodegen.cpp | ||
---|---|---|
694 | noted. I will update this and mark done once I incorporate the rebased-pull into this diff. |
mlir/lib/Dialect/SparseTensor/Transforms/SparseGPUCodegen.cpp | ||
---|---|---|
698 | sparse output will be more complex than this btw |
mlir/include/mlir/Dialect/GPU/IR/GPUOps.td | ||
---|---|---|
2135 | bad indentation here |
mlir/include/mlir/Dialect/GPU/IR/GPUOps.td | ||
---|---|---|
2135 | Good catch. Thank you! |
matrix -> matrices (since we have dense A,B now)