This is an archive of the discontinued LLVM Phabricator instance.

[mlir][NVGPU] Support cache all (.ca) in nvgpu.device_async_copy
ClosedPublic

Authored by manishucsd on Apr 17 2023, 10:47 PM.

Details

Summary

This patch adds support for cache all (.ca) in conversion from nvgpu-to-nvvm for inline asm cp.async.

For lower than 16 bytes cp.async cache global is not allowed and cache all is required to generate a valid ptx.

Diff Detail

Event Timeline

manishucsd created this revision.Apr 17 2023, 10:47 PM
Herald added a project: Restricted Project. · View Herald TranscriptApr 17 2023, 10:47 PM
manishucsd requested review of this revision.Apr 17 2023, 10:47 PM
nicolasvasilache accepted this revision.Apr 18 2023, 5:01 AM

Thanks!

Some nits, fixing them.

This revision is now accepted and ready to land.Apr 18 2023, 5:01 AM