This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Linalg] Improve conv vectorization for the stride==1 case.
ClosedPublic

Authored by nicolasvasilache on Oct 20 2021, 7:35 AM.

Details

Summary

In the stride == 1 case, conv1d reads contiguous data along the input dimension. This can be advantageaously used to bulk memory transfers and compute while avoiding unrolling. Experimentally, this can yield speedups of up to 50%.

Diff Detail

Event Timeline

nicolasvasilache requested review of this revision.Oct 20 2021, 7:35 AM
This revision is now accepted and ready to land.Oct 20 2021, 9:37 AM
This revision was landed with ongoing or failed builds.Oct 21 2021, 8:18 AM
This revision was automatically updated to reflect the committed changes.