This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP] Add an option to always inline OpenMP device functions.
ClosedPublic

Authored by jhuber6 on Aug 31 2021, 12:04 PM.

Details

Summary

Performance on GPU targets can be highly variable, sometimes inlining
everything hurts performance and sometimes it greatly improves it. Add
an option to toggle this behaviour to better investigate it.

Diff Detail

Event Timeline

jhuber6 created this revision.Aug 31 2021, 12:04 PM
jhuber6 requested review of this revision.Aug 31 2021, 12:04 PM
Herald added a project: Restricted Project. · View Herald TranscriptAug 31 2021, 12:04 PM
jdoerfert accepted this revision.Aug 31 2021, 12:13 PM

Add a test please. LGTM.

llvm/lib/Transforms/IPO/OpenMPOpt.cpp
4490–4491
This revision is now accepted and ready to land.Aug 31 2021, 12:13 PM
jhuber6 updated this revision to Diff 369767.Aug 31 2021, 12:55 PM

Adding test.