The noinline for non-SPMD parallel functions is probably not necessary
but as long as we use it we should put it on the outermost parallel
function, which is the wrapper, not the actual outlined function.
Resolves PR49752
Paths
| Differential D99506
[OpenMP][NFC] Move the `noinline` to the parallel entry point ClosedPublic Authored by jdoerfert on Mar 29 2021, 7:37 AM.
Details Summary The noinline for non-SPMD parallel functions is probably not necessary Resolves PR49752
Diff Detail
Event TimelineComment Actions Context in https://bugs.llvm.org/show_bug.cgi?id=49752 is that this resolves a regression in stack usage from D94315. This change looks good. I'm not totally sold on using a function call boundary to convey invariants on ICV, but that's an existing property. This revision is now accepted and ready to land.Mar 29 2021, 8:09 AM Comment Actions
There is a test for the presence of noinline, let me make it more explicit. Closed by commit rG03cc8a1ba050: [OpenMP][NFC] Move the `noinline` to the parallel entry point (authored by jdoerfert). · Explain WhyMar 29 2021, 11:12 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 334054 clang/lib/CodeGen/CGOpenMPRuntimeGPU.cpp
clang/test/OpenMP/nvptx_parallel_codegen.cpp
|