This is very similar to 7f1e6fcff942, just fixing a left-over.
With this, it should be possible to use both, -x cuda and -fopenmp in the same invocation,
enabling to use both OpenMP, targeting CPU, and CUDA, targeting the GPU.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
You need to define & undefine the macro around the includes of __clang_cuda_complex_builtins.h in clang/lib/Headers/openmp_wrappers/complex and clang/lib/Headers/openmp_wrappers/complex.h. (see also rG7f1e6fcff942) That should fix the tests. Assuming the tests pass, LGTM.
For release 11, can you file a bug and prepare a patch that applies to the relevant branch?
Comment Actions
Thanks, will land it later.
For the bug see: https://bugs.llvm.org/show_bug.cgi?id=48014
Do I have to create a new phabricator review, too?
I'm currently building the release/11.x branch with the patch cherry-picked (which worked flawlessly).
Comment Actions
CC tstellar@redhat.com in the bug and ask for it to be merged. Attach the patch file to the bug. At least that is (one way) how it works I think.