Currently we provide the send_n and recv_n functions. These were
somewhat divergent and not tested on the GPU. This patch changes the
support to be more common. We do this my making the CPU provide an array
equal the to at least the lane size while the GPU can rely on the
private memory address of its stack variables. This allows us to send
data back and forth generically.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
How does deallocation happen here? I see the lambda using malloc on one path and new on another, but can't find a corresponding free for one of them
Comment Actions
The server does a delete []. The malloc on the GPU is currently a bump pointer so free is a no-op.