Removes MaxParallelLevel references from rtl.cpp and drops
resulting dead code.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Very good, thank you.
Some context. Nested parallel is implemented by a transform to sequential execution by one parallel region. This transform uses linked call frames, allocated by malloc from the gpu within deviceRTL.
A previous version of amdgpu's deviceRTL had a buggy malloc implementation. The code that this patch deletes worked around that by allocating the gpu memory from the host instead.
That work around is now gone from the downstream fork (because malloc is no longer buggy), thus also removed from here.