Split out of D6202.
For 0-lane stores, we used to generate stuff like:
fmov w8, s0 str w8, [x0, x1, lsl #2]
instead of:
str s0, [x0, x1, lsl #2]
To correct that: for store lane 0 patterns, directly match to STR <subreg>0
Byte-sized instructions don't have the special case for a 0 index, because FPR8s are defined to have untyped content.