This patch makes prep for dropping CUDA when compiling deviceRTLs.
CUDA intrinsics are replaced by NVVM intrinsics which refers to code in
__clang_cuda_intrinsics.h. We don't want to directly include it because in the
near future we're going to switch to OpenMP and by then the header cannot be
used anymore.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
The cuda_intrisics header would need to be substantially refactored to support including from openmp. Doesn't presently seem worthwhile for four straightforward functions.
openmp/libomptarget/deviceRTLs/nvptx/src/target_impl.cu | ||
---|---|---|
83 | The expression ((WARPSIZE - Width) << 8) | 0x1f) occurs on both branches, maybe assign it to a local variable before the #if |
The expression ((WARPSIZE - Width) << 8) | 0x1f) occurs on both branches, maybe assign it to a local variable before the #if