Example of using 2:4 sparsity on NVidia GPU
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/sparse-mma-2-4-f16.mlir | ||
---|---|---|
2 | I'm not sure exactly how lit parses these lines, so I'm sure it'd mess up the double-quotes, but... It'd be a lot easier to read this if it were broken up across a few lines, like how lines 3~6 are | |
59 | Is there any particular reason for using affine.apply instead of arith.floordiv? (ditto for everywhere below) |
mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/sparse-mma-2-4-f16.mlir | ||
---|---|---|
2 | Yeah, you hit the nail on the head that I was very afraid breaking up the string value. | |
59 | My understanding is that both yield the same code eventually, and that affine.apply's are more idiomatic for the GPU thread expressions (see e.g. affine.delinearize_index below). So following one style seems the preferred way, but I am easily convinced otherwise ;-) |
The bot is broken (and has been for >2 weeks now!): https://lab.llvm.org/buildbot/#/builders/61/builds/42062 ; please make sure you don't filter out the notifications.
Note: this seems to have broken the build bots https://lab.llvm.org/buildbot/#/builders/61/builds/41349 and follow ups added more sparse failures https://lab.llvm.org/buildbot/#/builders/61/builds/42062
I don't think I got any bot notification about this?
This test requires A100 GPU's to be present.
Is there any way to extend
if not config.enable_cuda_runner: config.unsupported = True
to have that restriction? In bazel, this would be
"requires-gpu-sm80",
I'm not sure exactly how lit parses these lines, so I'm sure it'd mess up the double-quotes, but... It'd be a lot easier to read this if it were broken up across a few lines, like how lines 3~6 are