This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP][NVPTX] Simplify codegen for orphaned parallel, NFCI.
ClosedPublic

Authored by Hahnfeld on Oct 1 2018, 10:09 AM.

Details

Summary

Worker threads fork off to the compiler generated worker function
directly after entering the kernel function. Hence, there is no
need to check whether the current thread is the master if we are
outside of a parallel region (neither SPMD nor parallel_level > 0).

Diff Detail

Event Timeline

Hahnfeld created this revision.Oct 1 2018, 10:09 AM
This revision is now accepted and ready to land.Oct 1 2018, 11:32 AM
This revision was automatically updated to reflect the committed changes.