This is an archive of the discontinued LLVM Phabricator instance.

[libomptarget][AMDGPU] Remove MaxParallelLevel
ClosedPublic

Authored by pdhaliwal on Dec 1 2020, 11:23 PM.

Details

Summary

Removes MaxParallelLevel references from rtl.cpp and drops
resulting dead code.

Diff Detail

Event Timeline

pdhaliwal created this revision.Dec 1 2020, 11:23 PM
Herald added a project: Restricted Project. · View Herald TranscriptDec 1 2020, 11:23 PM
pdhaliwal requested review of this revision.Dec 1 2020, 11:23 PM
JonChesterfield accepted this revision.EditedDec 2 2020, 5:31 AM

Very good, thank you.

Some context. Nested parallel is implemented by a transform to sequential execution by one parallel region. This transform uses linked call frames, allocated by malloc from the gpu within deviceRTL.

A previous version of amdgpu's deviceRTL had a buggy malloc implementation. The code that this patch deletes worked around that by allocating the gpu memory from the host instead.

That work around is now gone from the downstream fork (because malloc is no longer buggy), thus also removed from here.

This revision is now accepted and ready to land.Dec 2 2020, 5:31 AM
This revision was landed with ongoing or failed builds.Dec 2 2020, 9:27 PM
This revision was automatically updated to reflect the committed changes.