This is an archive of the discontinued LLVM Phabricator instance.

[libomptarget][amdgpu] Call into deviceRTL instead of ockl
ClosedPublic

Authored by JonChesterfield on Dec 15 2020, 4:11 PM.

Details

Summary

[libomptarget][amdgpu] Call into deviceRTL instead of ockl

Amdgpu codegen presently emits a call into ockl. The same functionality
is already present in the deviceRTL. Adds an amdgpu specific entry point
to avoid the dependency. This lets simple openmp code (specifically, that
which doesn't use libm) run without rocm device libraries installed.

Diff Detail

Event Timeline

JonChesterfield requested review of this revision.Dec 15 2020, 4:11 PM
Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptDec 15 2020, 4:11 PM
JonChesterfield planned changes to this revision.Dec 16 2020, 8:48 AM

There's a codegen test that checks for __ockl_get_local_size. Testing a change to that test out of tree now, probably need to update said test before landing this.

  • update test, fix whitespace
ronlieb accepted this revision.Jan 4 2021, 8:10 AM

lgtm

This revision is now accepted and ready to land.Jan 4 2021, 8:10 AM
This revision was landed with ongoing or failed builds.Jan 4 2021, 8:49 AM
This revision was automatically updated to reflect the committed changes.