This work introduces special registers such as cluster ID, dimensions, and more for managing CTA clusters, which are groups of CTAsthat can synchronize and communicate through shared memory. This is for Nvidia's sm_90 capability.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo