This fixes two regressions introduced by https://reviews.llvm.org/D155941:
(1) it may be that more dimensions are extracted from an existing
scf.forall op than there are dimensions to map to, so these iterators
should be combined with zip and not zip_longest; and (2) if
generate_gpu_launch is set, the grid has to be 3D, but previously,
lower-dimensional grids were also accepted, so this patch trivially
extends these grids with unit dimensions.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Abandoning because 92f088d335e4a55f10e74c541ac52ec2a4d4ceeb fixes the issues and has landed.