Implemented simple and lightweight runtime support for SPMD mode-based
constructs. It adds support for L2 sequential parallelism wihtout full
runtime support. Also, patch fixes some use cases for
uninitialized|lightweight runtime.
Details
Diff Detail
- Repository
- rL LLVM
Event Timeline
Great that we don't need any additional entry points. I suppose LLVM is just optimizing away unneeded code when the runtime is inlined?
openmp/trunk/libomptarget/deviceRTLs/nvptx/src/omptarget-nvptx.cu | ||
---|---|---|
192–200 | Again looking at this code: Shouldn't all threads return, ie should it be outside of if (threadId == 0)? |
openmp/trunk/libomptarget/deviceRTLs/nvptx/src/omptarget-nvptx.cu | ||
---|---|---|
192–200 | You're right, that was an original intention. Could commit a quick fix? |
openmp/trunk/libomptarget/deviceRTLs/nvptx/src/omptarget-nvptx.cu | ||
---|---|---|
192–200 | Yes please. |
openmp/trunk/libomptarget/deviceRTLs/nvptx/src/omptarget-nvptx.cu | ||
---|---|---|
192–200 | I meant could you commit the fix? :) I don't have an access to my laptop. |
openmp/trunk/libomptarget/deviceRTLs/nvptx/src/omptarget-nvptx.cu | ||
---|---|---|
192–200 | Ah, done in r341328. Thanks for your quick replies! |
openmp/trunk/libomptarget/deviceRTLs/nvptx/src/loop.cu | ||
---|---|---|
146–167 | Has this change related to kmp_sched_static_balanced_chunk slipped into this patch on commit? I can't find it in the original review. (No need to revert, just curious...) |
openmp/trunk/libomptarget/deviceRTLs/nvptx/src/loop.cu | ||
---|---|---|
146–167 | Yes, during testing I found some inconsistency between clang and NVPTX libomp abd fixed all of them in a single patch. |
Has this change related to kmp_sched_static_balanced_chunk slipped into this patch on commit? I can't find it in the original review.
(No need to revert, just curious...)