This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Nvidia Hopper TMA load integration test
ClosedPublic

Authored by guraypp on Jul 20 2023, 6:40 AM.

Details

Summary

This work introduces sm90 integration testing and adds a single test.

Depends on : D155825 D155680 D155563 D155453

Diff Detail

Event Timeline

guraypp created this revision.Jul 20 2023, 6:40 AM
Herald added a project: Restricted Project. · View Herald TranscriptJul 20 2023, 6:40 AM
guraypp requested review of this revision.Jul 20 2023, 6:40 AM
Herald added a project: Restricted Project. · View Herald Transcript
guraypp added inline comments.Jul 20 2023, 6:45 AM
mlir/test/Integration/GPU/CUDA/sm90/tmaload.mlir
3

The next step will be using -test-lower-to-nvvm pass.

nice!

mlir/test/Integration/GPU/CUDA/sm90/tmaload.mlir
75

nit: can we rename %10 into %is_thread_0 for legibility ?

88

can we test with another thread here too ?
e.g. %is_thread_0_or_thread_42 and CHECK-DAG that both see the same value?

98

nit: nl

This revision is now accepted and ready to land.Jul 21 2023, 12:04 AM
guraypp added inline comments.Jul 21 2023, 12:34 AM
mlir/test/Integration/GPU/CUDA/sm90/tmaload.mlir
88

Yes we can. I will do it in a better way. I will implement elect instruction that will select fastest available thread.

@mehdi_amini @kerrmudgeon related to https://reviews.llvm.org/D155463, it seems we have some difficulties running this e2e example through while also using -test-lower-to-nvvm instead of the gynormous CLI.

If you are presently interested in this line of work and have cycles, we'd happily take some help :)

guraypp updated this revision to Diff 545163.Jul 28 2023, 7:43 AM

Text PTX not the runtime

nicolasvasilache accepted this revision.Jul 28 2023, 1:53 PM
This revision was automatically updated to reflect the committed changes.