Patch improves performance of the full runtime mode by moving
number-of-threads counter to the shared memory. It also allows to save
global memory.
Details
Details
Diff Detail
Diff Detail
- Repository
- rOMP OpenMP
- Build Status
Buildable 31385 Build 31384: arc lint + arc unit
Event Timeline
Comment Actions
Looks good.
libomptarget/deviceRTLs/nvptx/src/libcall.cu | ||
---|---|---|
33–36 | Can you make this comment clearer? What is the first parallel region and what are the other parallel regions? I suppose you mean L1 parallel vs nested? |
Can you make this comment clearer? What is the first parallel region and what are the other parallel regions? I suppose you mean L1 parallel vs nested?