This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP][CUDA] Refine the logic to determine grid size
ClosedPublic

Authored by tianshilei1992 on Feb 8 2022, 9:53 PM.

Details

Summary

This patch refines the logic to determine grid size as previous method
can escape the check of whether CudaBlocksPerGrid could be greater than the actual
hardware limit.

Diff Detail

Event Timeline

tianshilei1992 created this revision.Feb 8 2022, 9:53 PM
tianshilei1992 requested review of this revision.Feb 8 2022, 9:53 PM
Herald added a project: Restricted Project. · View Herald TranscriptFeb 8 2022, 9:53 PM
tianshilei1992 edited the summary of this revision. (Show Details)Feb 9 2022, 9:08 PM
jdoerfert accepted this revision.Feb 10 2022, 11:11 AM

LG, please make a ticket and backport to 14

This revision is now accepted and ready to land.Feb 10 2022, 11:11 AM