Add attribute to be able to generate the intrinsic version of async copy
generating a copy with l1 bypass. This correspond to
cp.async.cg.shared.global in ptx.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Paths
| Differential D125241
[mlir][nvvm] Add attribute to nvvm.cpAsyncOp to control l1 bypass ClosedPublic Authored by ThomasRaoux on May 9 2022, 8:50 AM.
Details Summary Add attribute to be able to generate the intrinsic version of async copy
Diff Detail
Event TimelineThis revision is now accepted and ready to land.May 9 2022, 10:40 AM Closed by commit rG09fc685ce680: [mlir][nvvm] Add attribute to nvvm.cpAsyncOp to control l1 bypass (authored by ThomasRaoux). · Explain WhyMay 9 2022, 12:35 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 428166 mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
mlir/test/Dialect/LLVMIR/invalid.mlir
mlir/test/Dialect/LLVMIR/nvvm.mlir
mlir/test/Target/LLVMIR/nvvmir.mlir
|