This is an archive of the discontinued LLVM Phabricator instance.

[MLIR][NVGPU] Handling Offset in `nvgpu.tma.async.load`
ClosedPublic

Authored by guraypp on Aug 8 2023, 2:38 AM.

Details

Summary

When using nvgpu.tma.async.load Op to asynchronously load data into shared memory, it fails to account for provided offsets, potentially leading to incorrect memory access. Using offset is common practice especially with the dynamic shared memory. This work addresses the problem by ensuring proper consideration of offsets.

Diff Detail

Event Timeline

guraypp created this revision.Aug 8 2023, 2:38 AM
Herald added a project: Restricted Project. · View Herald TranscriptAug 8 2023, 2:38 AM
guraypp requested review of this revision.Aug 8 2023, 2:38 AM
nicolasvasilache accepted this revision.Aug 8 2023, 4:23 AM
This revision is now accepted and ready to land.Aug 8 2023, 4:23 AM
This revision was automatically updated to reflect the committed changes.