Adds apply_patterns.gpu.unroll_vectors_subgroup_mma which allows
specifying a native MMA shape of m, n, and k to unroll to,
greedily unrolling the inner most dimension of contractions and other
vector operations based on expected usage.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
This would fit better in nvgpu transform ops I believe, as it has heuristics specifically for those cases.
Approving conditioned on moving to nvgpu.
Comment Actions
Because this first targets gpu.subgroup_mma ops (which has a shared target of SPIR-V Cooperative Matrix) I moved it to gpu instead.