The patch only plumbs through the option necessary for targeting sm_86 GPUs w/o adding any new functionality.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
One nit below. Seems reasonable in general.
So we add ptx72 but it's not used with sm_86, interesting.
clang/lib/Driver/ToolChains/Cuda.cpp | ||
---|---|---|
103 | leftover? |
Comment Actions
So we add ptx72 but it's not used with sm_86, interesting.
ptx71 is the minimum/default requited PTX version for sm_86. If we compile with CUDA-11.2, clang will set the '+ptx72' as we may potentially need it in order to link in libdevice bitcode or if source code (e.g CUDA headers) has inline asm with instructions supported by CUDA-11.2.
leftover?