Details
- Reviewers
jdoerfert tianshilei1992
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Making the usage use dynamic memory as well too, now the usage only uses as much
shared memory are there are threads in the block.
| openmp/libomptarget/plugins/cuda/src/rtl.cpp | ||
|---|---|---|
| 1240 | +? Not *? | |
| openmp/libomptarget/plugins/cuda/src/rtl.cpp | ||
|---|---|---|
| 1240 | This is a linear region of memory, we allocate a byte per-thread to save the memory usage per-thread. This allocated enough memory for the stack, and enough memory for all threads that are active. | |
LG. Two comments.
| openmp/libomptarget/plugins/cuda/src/rtl.cpp | ||
|---|---|---|
| 932 | We should add a TODO here. It's unreasonable that we copy stuff from the device even though the host has the image with the information. I know this is how we do it for other stuff too, in general seems sub-optimal. | |
| 1241 | Put these things in separate variables with explanation what they mean and how the size is computed. In the current way this is just magic. | |
clang-tidy: error: 'Interface.h' file not found [clang-diagnostic-error]
not useful