As noted in https://reviews.llvm.org/D111220#3048397,
after masked memory intrinsic has been scalarized,
we may have an opportunity to improve the IR,
because the pointers were scalarized via extractelement,
but we might be able to fold them into getelementptr's.
We'd also need to schedule another VectorCombine run
somewhere after ScalarizeMaskedMemIntrin in backend IR phase.