If we have thread states, the program is going to be rather slow. If we
don't we want to avoid wasting shared memory. This patch introduces a
slight penalty (malloc + indirection) for the slow path and reduces
resource usage for the fast path.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Add assertion
openmp/libomptarget/DeviceRTL/include/State.h | ||
---|---|---|
112 | Not without rewriting a lot. We use the pointer to build a list of nested thread states. Otherwise we would need to set it up to build the list and have the head in the outermost array, or traverse the list on a lookup, neither seems worth it now. |
Does it work if we just have ThreadStateTy *ThreadStates?