This work introduce cp.async.bulk.tensor.shared.cluster.global in NVVM dialect that executes load using TMA.
Depends on D155056
Paths
| Differential D155060
[mlir][nvvm] Add `cp.async.bulk.tensor.shared.cluster.global` ClosedPublic Authored by guraypp on Jul 12 2023, 2:49 AM.
Details Summary This work introduce cp.async.bulk.tensor.shared.cluster.global in NVVM dialect that executes load using TMA. Depends on D155056
Diff Detail
Event Timelinenicolasvasilache added inline comments. This revision is now accepted and ready to land.Jul 17 2023, 8:07 AM This revision was landed with ongoing or failed builds.Jul 17 2023, 8:10 AM Closed by commit rG28555793b1e5: [mlir][nvvm] Add `cp.async.bulk.tensor.shared.cluster.global` (authored by guraypp). · Explain Why This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 540975 mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
mlir/test/Conversion/NVVMToLLVM/nvvm-to-llvm.mlir
|
spurious include ? (should be transitively included already)