This effectively splits the scheduling WriteVecMaskedStore(Y) classes
into four different classes (one per each variant).
VecMaskedStores are now correctly marked as 'unsupported' by the bdver2
and btver2 models (since those are only used for AVX2 VPMASKMOV variants).
No functional change intended.
TBH it's better if we can keep the integer vector entries all together, not put in the float vector sections - same for all other models