Enable the -nocudalib flag for the OpenMP device offloading toolchain as well. Currently it can only be used for the CUDA toolchain.
Details
Details
- Reviewers
Hahnfeld ABataev carlo.bertolli caomhin hfinkel tra - Commits
- rG20789a5f096e: [OpenMP] Enable the existing nocudalib flag for OpenMP offloading toolchain.
rC314164: [OpenMP] Enable the existing nocudalib flag for OpenMP offloading toolchain.
rL314164: [OpenMP] Enable the existing nocudalib flag for OpenMP offloading toolchain.
Diff Detail
Diff Detail
- Repository
- rL LLVM
Event Timeline
lib/Driver/ToolChains/Cuda.cpp | ||
---|---|---|
255–257 | The purpose of the original assert was to catch a programming error and this change negates that purpose. I don't understand why it would be OK to end up with an unknown GPU architecture if -nocudalib is specified. |
Comment Actions
One small nit. LGTM otherwise.
test/Driver/openmp-offload-gpu.c | ||
---|---|---|
133 | Please split this RUN line further. |
The purpose of the original assert was to catch a programming error and this change negates that purpose.
Perhaps I'm missing something. Could you elaborate on what's the motivation for this particular change?
I don't understand why it would be OK to end up with an unknown GPU architecture if -nocudalib is specified.
You still do want to pass *some* specific GPU arch to ptxas and that has nothing to do with whether you happen to have suitable libdevice.