This patch adds support for cache all (.ca) in conversion from nvgpu-to-nvvm for inline asm cp.async.
For lower than 16 bytes cp.async cache global is not allowed and cache all is required to generate a valid ptx.
Paths
| Differential D148604
[mlir][NVGPU] Support cache all (.ca) in nvgpu.device_async_copy ClosedPublic Authored by manishucsd on Apr 17 2023, 10:47 PM.
Details Summary This patch adds support for cache all (.ca) in conversion from nvgpu-to-nvvm for inline asm cp.async. For lower than 16 bytes cp.async cache global is not allowed and cache all is required to generate a valid ptx.
Diff Detail
Event TimelineThis revision is now accepted and ready to land.Apr 18 2023, 5:01 AM Closed by commit rG95cb9862a8dc: [mlir][NVGPU] Support cache all (.ca) in nvgpu.device_async_copy (authored by nicolasvasilache). · Explain WhyApr 18 2023, 5:01 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 514609 mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp
mlir/test/Conversion/NVGPUToNVVM/nvgpu-to-nvvm.mlir
|