Details
- Reviewers
jdoerfert tianshilei1992
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Making the usage use dynamic memory as well too, now the usage only uses as much
shared memory are there are threads in the block.
openmp/libomptarget/plugins/cuda/src/rtl.cpp | ||
---|---|---|
1240 | +? Not *? |
openmp/libomptarget/plugins/cuda/src/rtl.cpp | ||
---|---|---|
1240 | This is a linear region of memory, we allocate a byte per-thread to save the memory usage per-thread. This allocated enough memory for the stack, and enough memory for all threads that are active. |
LG. Two comments.
openmp/libomptarget/plugins/cuda/src/rtl.cpp | ||
---|---|---|
932 | We should add a TODO here. It's unreasonable that we copy stuff from the device even though the host has the image with the information. I know this is how we do it for other stuff too, in general seems sub-optimal. | |
1241 | Put these things in separate variables with explanation what they mean and how the size is computed. In the current way this is just magic. |
clang-tidy: error: 'Interface.h' file not found [clang-diagnostic-error]
not useful