[libomptarget][nvptx] Replace cuda atomic primitives with clang intrinsics
Tested by diff of IR generated for target_impl.cu before and after. NFC. Part
of removing deviceRTL build time dependency on cuda SDK.
Differential D95294
[libomptarget][nvptx] Replace cuda atomic primitives with clang intrinsics JonChesterfield on Jan 23 2021, 12:28 PM. Authored by
Details [libomptarget][nvptx] Replace cuda atomic primitives with clang intrinsics Tested by diff of IR generated for target_impl.cu before and after. NFC. Part
Diff Detail
Event Timeline
|
__syncwarp is left. It can be also simply replaced by __nvvm_bar_warp_sync(mask).