All nowait series of interfaces in libomptarget accept four more arguments (int32_t depNum, void *depList, int32_t noAliasDepNum, void *noAliasDepList) compared with their counterparts w/o nowait. These extra arguments were expected for dependence resolution, potentially lowered to device side. Current implementation calls libomp function __kmpc_omp_taskwait. However, the front end simply ignores them, that these four arguments are not emitted at all. As a consequence, the depNum and noAliasDepNum are garbage, which could lead to unnecessary task wait.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Now that target nowait operations are handled via hidden helper tasks which make sure that dependencies are satisfied, these calls to __kmpc_omp_taskwait are not only redundant but also hurt performance as they serialize execution unconditionally. LGTM. Can you edit the patch description and explain exactly what you mean and why this patch is needed? Thanks!