Replaced CUDA builtin vars with LLVM intrinsics such that we don't need
definitions of those intrinsics.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Nice. Particularly like getting rid of these because they use field access syntax to make a function call, which clang does with a Microsoft extension.
clang-format: please reformat the code
-DEVICE int GetNumberOfBlocksInKernel() { return __nvvm_read_ptx_sreg_nctaid_x(); } +DEVICE int GetNumberOfBlocksInKernel() { + return __nvvm_read_ptx_sreg_nctaid_x(); +}