This patch adds the AlwaysInline attribute to the __kmpc_parallel_51
device runtime call. This improves inlining heuristics which encourages
the indirect function pointer arguemnt to also be inlined. This greatly
improves performance for a few applications whose outlined regions were
not inlined otherwise.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo