This is an archive of the discontinued LLVM Phabricator instance.

[mlir][cuda runtime] Set Max Dynamic Shared Memory Attribute
ClosedPublic

Authored by guraypp on Aug 2 2023, 3:57 AM.

Details

Summary

This works aims to address the issue related to larger shared memory usage in the MLIR CUDA runtime. Currently, when the shared memory usage exceeds 48KB, we need to set the CU_FUNC_ATTRIBUTE_MAX_DYNAMIC_SHARED_SIZE_BYTES attribute of the CUDA kernel appropriately. This work takes care of that by setting the attribute as required. Additionally, it includes some debug prints for better visibility and troubleshooting.

Diff Detail

Event Timeline

guraypp created this revision.Aug 2 2023, 3:57 AM
Herald added a project: Restricted Project. · View Herald TranscriptAug 2 2023, 3:57 AM
guraypp requested review of this revision.Aug 2 2023, 3:57 AM
nicolasvasilache accepted this revision.Aug 2 2023, 4:58 AM
This revision is now accepted and ready to land.Aug 2 2023, 4:58 AM