This patch refines the logic to determine grid size as previous method
can escape the check of whether CudaBlocksPerGrid could be greater than the actual
hardware limit.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Paths
| Differential D119311
[OpenMP][CUDA] Refine the logic to determine grid size ClosedPublic Authored by tianshilei1992 on Feb 8 2022, 9:53 PM.
Details Summary This patch refines the logic to determine grid size as previous method
Diff Detail
Event TimelineHerald added subscribers: carlosgalvezp, guansong, yaxunl. · View Herald TranscriptFeb 8 2022, 9:53 PM This revision is now accepted and ready to land.Feb 10 2022, 11:11 AM Closed by commit rGf6685f774697: [OpenMP][CUDA] Refine the logic to determine grid size (authored by tianshilei1992). · Explain WhyFeb 10 2022, 11:13 AM This revision was automatically updated to reflect the committed changes.
Diff 407612 openmp/libomptarget/plugins/cuda/src/rtl.cpp
|