This is an archive of the discontinued LLVM Phabricator instance.

[mlir][gpu] Add dump-ptx option
ClosedPublic

Authored by guraypp on Jul 13 2023, 2:47 AM.

Details

Summary

When targeting NVIDIA GPUs, seeing the generated PTX is important. Currently, we don't have simple way to do it.

This work adds dump-ptx to gpu-to-cubin pass. One can use it like gpu-to-cubin{chip=sm_90 features=+ptx80 dump-ptx}.

Diff Detail

Event Timeline

guraypp created this revision.Jul 13 2023, 2:47 AM
guraypp requested review of this revision.Jul 13 2023, 2:47 AM
Herald added a project: Restricted Project. · View Herald Transcript
guraypp updated this revision to Diff 539917.Jul 13 2023, 3:04 AM

add an example

guraypp updated this revision to Diff 539977.Jul 13 2023, 5:41 AM

move the test to integration

nicolasvasilache accepted this revision.Jul 13 2023, 9:39 AM

Great, thanks!

This revision is now accepted and ready to land.Jul 13 2023, 9:39 AM
This revision was automatically updated to reflect the committed changes.