This patch refines the logic to determine grid size as previous method
can escape the check of whether CudaBlocksPerGrid could be greater than the actual
hardware limit.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
| Paths 
 |  Differential  D119311  
[OpenMP][CUDA] Refine the logic to determine grid size ClosedPublic Authored by tianshilei1992 on Feb 8 2022, 9:53 PM. 
Details Summary This patch refines the logic to determine grid size as previous method 
Diff Detail 
 Event TimelineHerald added subscribers: carlosgalvezp, guansong, yaxunl.  ·  View Herald TranscriptFeb 8 2022, 9:53 PM This revision is now accepted and ready to land.Feb 10 2022, 11:11 AM Closed by commit rGf6685f774697: [OpenMP][CUDA] Refine the logic to determine grid size (authored by tianshilei1992).  ·  Explain WhyFeb 10 2022, 11:13 AM This revision was automatically updated to reflect the committed changes. 
Diff 407612 openmp/libomptarget/plugins/cuda/src/rtl.cpp
 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||