When targeting NVIDIA GPUs, seeing the generated PTX is important. Currently, we don't have simple way to do it.
This work adds dump-ptx to gpu-to-cubin pass. One can use it like gpu-to-cubin{chip=sm_90 features=+ptx80 dump-ptx}.
Paths
| Differential D155166
[mlir][gpu] Add dump-ptx option ClosedPublic Authored by guraypp on Jul 13 2023, 2:47 AM.
Details Summary When targeting NVIDIA GPUs, seeing the generated PTX is important. Currently, we don't have simple way to do it. This work adds dump-ptx to gpu-to-cubin pass. One can use it like gpu-to-cubin{chip=sm_90 features=+ptx80 dump-ptx}.
Diff Detail
Event TimelineThis revision is now accepted and ready to land.Jul 13 2023, 9:39 AM Closed by commit rG22a32f7d9cb1: [mlir][gpu] Add dump-ptx option (authored by guraypp). · Explain WhyJul 13 2023, 12:15 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 539977 mlir/include/mlir/Dialect/GPU/Transforms/Passes.h
mlir/lib/Dialect/GPU/Transforms/SerializeToCubin.cpp
mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/dump-ptx.mlir
|