For smaller types (e.g. 16-bit), we don't import patterns for unscaled loads and stores.
Teach the selector to do this.
We didn't really have any testcases for the unscaled load/store stuff anyway, so add testcases showing what we can select.
(Nothing for STURBi or LDURBi, since we need more legalizer support for s8s to get those.)
This is a 0.2% code size improvement on 7zip at -O3. (Geomean -0.0%)
selectUnscaledLoadStoreOp can take the bool IsStore for the first arg instead of a full opcode.