This is more of a proof of concept for now although it is correct and does not result in noticeable perf degradations.
This is what a better decoupling of transfer read/write from vectorization of conv would look like. This form is close to ready to plop into a new vector.conv op and the vector.transfer operations to be generalized as part of generic vectorization once the properties ConvolutionOpInterface are inferred from the indexing maps.