Add a CL option to pass sm version in MLIR CUDA Runner. There are currently
no checks on the compatibility/validity of PTX and SM version. This is just naive addition
to check the working of WMMA tensor core operations. Add CL option to pass index-bitwidth
for LLVM lowering passes on the device side.
Details
Details
- Reviewers
bondhugula ftynse
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Unit Tests
Unit Tests
Event Timeline
Comment Actions
Changes in this diff:-
1.) Add CL option to pass index-bitwidth for LLVM lowering passes on the device side.
Comment Actions
Closing revision, As recent changes(removal of mlir-cuda-runner and others) have baked the functionality of this patch into convert-gpu-to-nvvm and gpu-to-cubin.