I'm using clang to compile CUDA code. And just found that clang doesn't support the per-thread stream option for NV CUDA. I don't know if there is another solution.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Looking at CUDA headers, it appears that changing only compiler-generated-glue may be insufficient. A lot of other CUDA API calls need to be changed to _ptsz variant and for that we need to have CUDA_API_PER_THREAD_DEFAULT_STREAM defined.
Another point that I don't get is why the libcxx CI failed. Anyone know the reason this message?
"
Running global pre-checkout hook
Preparing working directory
Running global post-checkout hook
Running commands
$ trap 'kill -- $$' INT TERM QUIT; libcxx/utils/ci/generate-buildkite-pipeline | buildkite-agent pipeline upload
2023-07-11 02:36:45 INFO Reading pipeline config from STDIN
2023-07-11 02:36:46 INFO Updating BUILDKITE_COMMIT to "a297905cd83911c8a03f060cb9d96bc99aae3f8c"
2023-07-11 02:36:46 FATAL Pipeline parsing of "(stdin)" failed (Expected identifier to start with a letter, got ')
🚨 Error: The command exited with status 1
user command error: exit status 1
"
Can anyone push this?
I can help with this. How do you want your commit to be attributed? The patch currently has boxu.zhang <boxu.zhang@hotmail.com>. Do you want it to be changed to something else?